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ISOLATED DROSOPHILA PROTEINS ESSENTIAL FOR SURVIVAL, 
NUCLEIC ACID MOLECULES ENCODING ESSENTIAL DROSOPHILA 
PROTEINS, AND USES THEREOF AS INSECTICIDAL TARGETS 

INVENTOR 

Mark D. YANDELL, 9805 D Gable Ridge Terrace, Rockville, MD 20850, 
Citizenship: United States. 

RELATED APPLICATIONS 

The present application claims priority to US Serial Nos. 60/171,590 and 
60/171,627, both filed December 23, 1999; 06/175,763 and 60/175,685, both filed 
January 12, 2000; and 06/86,663 and 60/187,241, both filed March 3, 2000. 

FIELD OF THE INVENTION 

The present invention is in the field of Drosophila proteins that are essential for 
survival, recombinant DNA molecules and protein production. The present invention 
specifically provides novel Drosophila proteins and nucleic acid molecules encoding such 
protein molecules, for use in the development of insecticide and insecticide targets. 

BACKGROUND OF THE INVENTION 

Drosophila melanogaster 

The Drosophila melanogaster genome is 165 Mb, with about 120 Mb of this 
being euchromatic. The genome is organized in 4 chromosome pairs and is estimated to 
contain 10 - 12,000 genes. Model organisms, such as Drosophila melanogaster, share 
many genes with humans whose sequences and functions have been conserved. In 
addition to myriad similarities in cellular structure and function, humans and Drosophila 
share pathways for intercellular signaling, developmental patterning, learning and 
behavior, as well as tumor formation and metastasis. 

The genes involved in the development of Drosophila, with few exceptions, are 
the same as those involved in the development of higher organisms. Developmental 
biology studies the sequential activation and interaction of genes, in relation to 
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developing morphology. Right now, Drosophila is the only organism for which one can 
begin with a list of genes active in the egg and follow the morphological changes and 
gene activations through to adulthood. 

Drosophila studies have provided the widest knowledge base available for any 
single organism; accordingly, developmental biologists use the fly to ferret out the 
activity of genes with similar functions in higher organisms. Despite its small size, the fly 
is by no means a small developmental problem. If you know the genes involved in the 
development of the fly, you also know, to a reasonable approximation, the genes involved 
in the development of the worm, the fish, the mouse, and humans. 

A major goal in insecticide development is to understand and elucidate the 
molecular mechanisms that govern cell signaling and cell-cell interactions in higher 
eukaryotes. Many proteins identified in Drosophila form major links in cellular 
j communication/response systems. A complete list of proteins from Drosophila would 

therefore be invaluable in developing human therapeutic compounds and insecticidal 
f agents. Not only will the proteins serve as models for human and invertebrate cellular 

; signaling and response, such molecules will also serve as molecular keys in identifying 
therapeutically important human and other invertebrate orthologs. 
Insecticides 

About 10,000 species of the more than 1 million species of insects are crop- 
eating, and of these, approximately 700 species worldwide cause most of the insect 
damage to man's crops, in the field and in storage. 

A detailed study of novel proteins from Drosophila and invertebrate orthologs 
thereof, will serve as targets for identifying new members of the known classes of 
insecticides as well as aiding in the identification of new classes of compounds. 

SUMMARY OF THE INVENTION 

The present invention is based in part on the identification of amino acid 
sequences of 51 1 proteins that are produced by Drosophila melanogaster and are 
essential for survival, many of which look to be insect specific having no knowN 
homolog in mammalian genomes (See SEQ ID NOs:l-39). These unique protein 
sequences, and nucleic acid sequences that encode these proteins, can be used as targets 
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for the development of insecticidal agents and to identify invertebrate and vertebrate 
orthologs thereof. 

DESCRIPTION OF THE FIGURE SHEETS 

FIGURE SHEETS 1-836 provide genomic nucleic acid sequences from 
Drosophila melanogaster ^ predicted transcript, and predicted amino acid sequences of the 
proteins of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

General Description 

The present invention is based on the sequencing of the Drosophila melanogaster 
genome. During the sequencing and assembly of the Drosophila melanogaster genome, 
! analysis of the sequence information revealed previously unidentified peptides that do not 
I share structural and/or sequence homology to any presently known proteins, peptides, or 
f domains. In addition, the specific subset of genes, transcripts, and proteins of the present 

: invention are essential for survival: when altered by way of a P-element insertion, it 
produces a lethal phenotype. Based on this analysis, the present invention provides 
amino acid sequences of proteins produced by Drosophila melanogaster and are essential 
for survival, nucleic acid sequences that encode these Drosophila proteins and methods of 
using these proteins for insecticidal development and gene target development. 

In addition to being previously unknown, the proteins that are provided in the 
present invention are selected based on their ability to be used for the development of 
commercially important products and services. Specifically, the present proteins are 
selected based the need for the protein to be present to produce a viable insect. Some of 
the more specific features of the proteins of the present invention, and the uses thereof, 
are described in detail below. 

Specific Embodiments 
Protein Molecules 

In Figure sheets 1-836, the present invention provides nucleic acid molecules, 
provided in the form of genomic sequences and transcript sequences, that encode 5 1 1 
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protein molecules that have been identified as being essential for Drosophila survival. 
The protein sequences provided herein will be referred to as the Drosophila proteins or 
proteins of the present invention, Drosophila proteins or peptides, or peptides/proteins of 
the present invention. 

The present invention provides isolated peptide/protein and protein molecules that 
consist of, consist essentially of or are comprised of the amino acid sequences of the 
Drosophila proteins encoded by the nucleic acid sequences disclosed in the Figure Sheets 
(the amino acid sequences are provided in SEQ ID NOS: 3, 6, 9, . . . 1527, 1530 and 
1533336, 339 and 342, the genomic sequences are provided in SEQ ID NOS: 1, 4, 7, ... 
334, 337 and 340 and the predicted transcript sequences are provided in SEQ ID NOS: 2, 
5, 8, . . . 1526, 1529 and 1532335, 338 and 341), as well as all obvious variants of these 
peptides that are within the art to make and use. Some of these variants are described in 
3 detail below. 

I As used herein, a peptide is said to be "isolated" or "purified" when it is 

I substantially free of cellular material or free of chemical precursors or other chemicals. 
The proteins of the present invention can be purified to homogeneity or other degrees of 
purity. The level of purification will be based on the intended use. The critical feature is 
; that the preparation allows for the desired function of the peptide, even if in the presence of 
considerable amounts of other components. 

In some uses, "substantially free of cellular material" includes preparations of the 
peptide having less than about 30% (by dry weight) other proteins (i.e., contaminating 
protein), less than about 20% other proteins, less than about 10% other proteins, or less than 
about 5% other proteins. When the peptide is recombinantly produced, it can also be 
substantially free of culture medium, i.e., culture medium represents less than about 20% of 
the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the peptide in which it is separated from chemical precursors or other 
chemicals that are involved in its synthesis. In one embodiment, the language "substantially 
free of chemical precursors or other chemicals" includes preparations of the Drosophila 
protein having less than about 30% (by dry weight) chemical precursors or other chemicals, 
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less than about 20% chemical precursors or other chemicals, less than about 10% chemical 
precursors or other chemicals, or less than about 5% chemical precursors or other chemicals. 

The isolated Drosophila protein can be purified from cells that naturally express it, 
purified from cells that have been altered to express it (recombinant), or synthesized using 
known protein synthesis methods. For example, a nucleic acid molecule encoding the 
Drosophila protein is cloned into an expression vector, the expression vector introduced into 
a host cell and the protein expressed in the host cell. The protein can then be isolated from 
the cells by an appropriate purification scheme using standard protein purification 
techniques. Many of these techniques are described in detail below. 

Accordingly, the present invention provides proteins that consist of one of the amino 
acid sequences encoded by the nucleic acid sequences shown in Figure sheets 1-836. The 
amino acid sequences of such proteins are provided in the SEQ ID NO:3, 6, 9, ... 1 527, 
1530 and 1533, encoded by genomic sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 
1531, or transcript sequences SEQ ID NO: 2, 5, 8, . . . 1526, 1529 and 1532. A protein 
consists of an amino acid sequence when the amino acid sequence is the final amino acid 
sequence of the protein. 

The present invention further provides proteins that consist essentially of one of the 
amino acid sequences encoded by the nucleic acid sequences shown in Figure sheets 1-836, 
SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533. A protein consists essentially of an amino 
acid sequence when such an amino acid sequence is present with only a few additional 
amino acid residues in the final protein. 

The present invention further provides proteins that are comprised of one of the 
amino acid sequences encoded by the nucleic acid sequences shown in Figure sheets 1-836, 
SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533. A protein is comprised of an amino acid 
sequence when the amino acid sequence is at least part of the final amino acid sequence of 
the protein. In such a fashion, the protein can be only the peptide or have additional amino 
acid molecules, such as amino acid residues (contiguous encoded sequence) that are 
naturally associated with it or heterologous amino acid residues/peptide sequences. Such a 
protein can have a few additional amino acid residues or can comprise several hundred or 
more additional amino acids. The preferred classes of proteins that are comprised of the 
Drosophila proteins of the present invention are the naturally occurring mature proteins. A 
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brief description of how various types of these proteins can be made/isolated is provided 
below. 

The Drosophila proteins of the present invention can be attached to heterologous 
sequences to form chimeric or fusion proteins. Such chimeric and fusion proteins comprise 
a Drosophila protein operatively linked to a heterologous protein having an amino acid 
sequence not substantially homologous to the Drosophila protein. "Operatively linked" 
indicates that the Drosophila protein and the heterologous protein are fused in-frame. The 
heterologous protein can be fused to the N-terminus or C-terminus of the Drosophila 
protein. 

In some uses, the fusion protein does not affect the activity of the Drosophila protein 
per se. For example, the fusion protein can include, but is not limited to, enzymatic fusion 
proteins, for example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His 
fusions, MYC-tagged, Hi-tagged and Ig fusions. Such fusion proteins, particularly poly-His 
fusions, can facilitate the purification of recombinant Drosophila protein. In certain host 
cells (e.g., mammalian host cells), expression and/or secretion of a protein can be increased 
by using a heterologous signal sequence. 

A chimeric or fusion protein can be produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the different protein sequences are 
ligated together in-frame in accordance with conventional techniques. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and re-amplified to 
generate a chimeric gene sequence (see Ausubel et al, Current Protocols in Molecular 
Biology, 1992). Moreover, many expression vectors are commercially available that already 
encode a fusion moiety (e.g., a GST protein). A Drosophila protein-encoding nucleic acid 
can be cloned into such an expression vector such that the fusion moiety is linked in-frame 
to the Drosophila protein. 

As mentioned above, the present invention also provides and enables obvious 
variants of the amino acid sequence of the proteins of the present invention, such as 
naturally occurring mature forms of the proteins, allelic/sequence variants of the proteins, 
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non-naturally occurring recombinantly derived variants of the proteins, and orthologs and 
paralogs of the proteins. Such variants can readily be generated using art know techniques 
in the fields of recombinant nucleic acid technology and protein biochemistry. It is 
understood, however, that variants exclude any amino acid sequences disclosed prior to the 
invention. 

Such variants can readily be identified/made using molecular techniques and the 
sequence information disclosed herein. Further, such variants can readily be distinguished 
from other peptides based on sequence and/or structural homology to the Drosophila 
proteins of the present invention. The degree of homology/identity present will be based 
primarily on whether the peptide is a functional variant or non-functional variant, the 
amount of divergence present in the paralog family and the evolutionary distance between 
the orthologs. 

To determine the percent identity of two amino acid sequences or two nucleic 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a preferred embodiment, the length of a reference sequence 
aligned for comparison purposes is at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or 
more of the length of the reference sequence. The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as 
the corresponding position in the second sequence, then the molecules are identical at that 
position (as used herein amino acid or nucleic acid "identity" is equivalent to amino acid 
or nucleic acid "homology"). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into account 
the number of gaps, and the length of each gap, which need to be introduced for optimal 
alignment of the two sequences. 

The comparison of sequences and determination of percent identity and similarity 
between two sequences can be accomplished using a mathematical algorithm. 
{Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 



7 



CL000735 

New York, 1993; Computer Analysis of Sequence Data, Part 7, Griffin, A.M., and Griffin, 
H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von 
Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and 
Devereux, J., eds., M Stockton Press, New York, 1991). In a preferred embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (J. Mol Biol (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using 
the GAP program in the GCG software package (Devereux, J., et ah, Nucleic Acids Res. 
I2(1):3S7 (1984)) (available at http://www.gcg.com), using a NWSgapdna.CMP matrix 
and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In 
another embodiment, the percent identity between two amino acid or nucleotide 
sequences is determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:1 1- 
17 (1989)) which has been incorporated into the ALIGN program (version 2.0), using a 
PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can farther be 
used as a "query sequence" to perform a search against sequence databases to, for 
example, identify other family members or related sequences. Such searches can be 
performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (J. 
Mol Biol 215:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, word length = 12 to obtain nucleotide sequences 
homologous to the nucleic acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, word length = 3 to obtain amino 
acid sequences homologous to the proteins of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (Nucleic Acids Res, 25(17):3389-3402 (1997)). When utilizing BLAST 
and gapped BLAST programs, the default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov . 
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Full-length pre-processed forms, as well as mature processed forms, of proteins that 
comprise one of the proteins of the present invention can readily be identified as having 
complete sequence identity to one of the Drosophila proteins of the present invention as well 
as being encoded by the same genetic locus as the Drosophila protein provided herein. 

Allelic variants of a Drosophila protein can readily be identified as having a high 
degree (significant) of sequence homology/identity to at least a portion of the Drosophila 
protein as well as being encoded by the same genetic locus as the Drosophila protein 
provided herein. As used herein, two proteins (or a region of the proteins) have significant 
homology when the amino acid sequences are typically at least about 70-75%, 80-85%, 
and more typically at least about 90-95% or more homologous. A significantly 
homologous amino acid sequence, according to the present invention, will be encoded by 
a nucleic acid sequence that will hybridize to a Drosophila protein encoding nucleic acid 
molecule under stringent conditions as more fully described below. 

Paralogs of a Drosophila protein can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the Drosophila protein, as 
being encoded by a gene from Drosophila, and as having similar activity or function. Two 
proteins will typically be considered paralogs when the amino acid sequences are 
typically at least about 70-75%, 80-85%, and more typically at least about 90-95% or 
more homologous through a given region or domain. Such paralogs will be encoded by a 
nucleic acid sequence that will hybridize to a Drosophila protein encoding nucleic acid 
molecule under stringent conditions as more fully described below. 

Orthologs of a Drosophila protein can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the Drosophila protein as well 
as being encoded by a gene from another organism. Preferred orthologs will be isolated 
from other invertebrates, particularly insects of economical/agriculture importance, e.g. 
members of the Lepidopteran and Coleopteran orders, for the development of insecticides 
and insecticidal targets, or vertebrate counterparts, such as from humans. Such orthologs 
will be encoded by a nucleic acid sequence that will hybridize to a Drosophila protein 
encoding nucleic acid molecule under moderate to stringent conditions, as more fully 
described below, depending on the degree of relatedness of the two organisms yielding 
the proteins. 
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Non-naturally occurring variants of the Drosophila proteins of the present invention 
can readily be generated using recombinant techniques. Such variants include, but are not 
limited to deletions, additions and substitutions in the amino acid sequence of the 
Drosophila protein. For example, one class of substitutions is conserved amino acid 
substitution. Such substitutions are those that substitute a given amino acid in a Drosophila 
protein by another amino acid of like characteristics. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids Ala, 
Val, Leu, and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic 
residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of 
the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. 
Guidance concerning which amino acid changes are likely to be phenotypically silent are 
found in Bowie et aL, Science 247:1306-1310 (1990). 

Variant Drosophila proteins can be fully functional or can lack function in one or 
5 more activities. Fully functional variants typically contain only conservative variation or 

variation in non-critical residues or in non-critical regions. Functional variants can also 
contain substitution of similar amino acids that result in no change or an insignificant 
change in function. Alternatively, such substitutions may positively or negatively affect 
function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, 
inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in 
the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et 
aL , Science 244: 1 08 1 -1 085 (1 989)). The latter procedure introduces single alanine 
mutations at every residue in the molecule. The resulting mutant molecules are then tested 
for biological activity such as receptor binding or in vitro proliferative activity. Sites that 
are critical for ligand-receptor binding can also be determined by structural analysis such as 
crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et aL, J. MoL 
Biol. 224:899-904 (1992); de Vos et aL Science 255:306-312 (1992)). 

Polypeptides often contain amino acids other than the 20 amino acids commonly 
referred to as the 20 naturally occurring amino acids. Further, many amino acids, including 
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the terminal amino acids, may be modified by natural processes, such as processing and 
other post-translational modifications, or by chemical modification techniques well known 
in the art. Common modifications that occur naturally in polypeptides are described in basic 
texts, detailed monographs, and the research literature, and they are well known to those of 
skill in the art. 

Accordingly, the polypeptides also encompass derivatives or analogs in which a 
substituted amino acid residue is not one encoded by the genetic code, in which a substituent 
group is included, in which the mature polypeptide is fused with another compound, such as 
a compound to increase the half-life of the polypeptide (for example, polyethylene glycol), 
or in which the additional amino acids are fused to the mature polypeptide, such as a leader 
or secretory sequence or a sequence for purification of the mature polypeptide or a pro- 
protein sequence. 

The present invention further provides fragments of the Drosophila proteins, in 
addition to proteins and peptides that comprise and consist of such fragments. The 
fragments to which the invention pertains, however, are not to be construed as 
encompassing fragments that may be disclosed publicly prior to the present invention. 

As used herein, a fragment comprises at least 8 or more contiguous amino acid 
residues from a Drosophila protein. Such fragments can be chosen based on the ability to 
retain one or more of the biological activities of the Drosophila protein or could be chosen 
for the ability to perform a function, e.g. act as an immunogen. Particularly important 
fragments are biologically active fragments, peptides that are, for example about 8 or more 
amino acids in length. Such fragments will typically comprise a domain or motif of the 
Drosophila protein, e.g., active site. Further, possible fragments include, but are not limited 
to, domain or motif containing fragments, soluble peptide fragments, and fragments 
containing immunogenic structures. Predicted domains and functional sites are readily 
identifiable by computer programs well known and readily available to those of skill in the 
art (e.g., PROSITE analysis). 

Polypeptides often contain amino acids other than the 20 amino acids commonly 
referred to as the 20 naturally occurring amino acids. Further, many amino acids, including 
the terminal amino acids, may be modified by natural processes, such as processing and 
other post-translational modifications, or by chemical modification techniques well known 
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in the art. Common modifications that occur naturally in Drosophila proteins are described 
in basic texts, detailed monographs, and the research literature, and they are well known to 
those of skill in the art. 

Accordingly, the Drosophila proteins of the present invention also encompass 
derivatives or analogs in which a substituted amino acid residue is not one encoded by the 
genetic code, in which a substituent group is included, in which the mature Drosophila 
protein is fused with another compound, such as a compound to increase the half-life of the 
Drosophila protein (for example, polyethylene glycol), or in which the additional amino 
acids are fused to the mature Drosophila protein, such as a leader or secretory sequence or a 
sequence for purification of the mature Drosophila protein or a pro-protein sequence. 

Known modifications include, but are not limited to, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 

3 moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of 
a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 

3 cyclization, disulfide bond formation, demethylation, formation of covalent crosslinks, 

2 formation of cystine, formation of pyroglutamate, formylation, gamma carboxylation, 

3 glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 

j myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 

* racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 

I proteins such as arginylation, and ubiquitination. 

I Such modifications are well known to those of skill in the art and have been 

described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic 
acid residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic 
texts, such as Proteins - Structure and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. 
Freeman and Company, New York (1993). Many detailed reviews are available on this 
subject, such as by Wold, F., Posttranslational Covalent Modification of Proteins, B.C. 
Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter et ah (Metk Enzymol. 182: 
626-646 (1990)) and Rattan et al (Ann N. Y. Acad Sci 663:48-62 (1992)). 

Protein/Peptide Uses 
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The proteins of the present invention most importantly can be used as insecticide 
targets without even knowing the specific biology of the target. First, the proteins of the 
present invention are essential for Drosophila survival. P-element mutation has shown 
that these transcripts are essential for survival. Second, these proteins do not have known 
homologs. Such proteins can be routinely configured in assays to identify small molecule 
inhibitors of the target protein. 

Further, the Drosophila proteins of the present invention can be used in assays to 
determine the biological activity of the protein, including in a panel of multiple proteins 
for high-throughput screening; to raise antibodies or to elicit another immune response; 
as a reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its binding partner or receptor) in biological fluids; and as 
markers for tissues in which the corresponding protein is preferentially expressed (either 
i constitutively or at a particular stage of tissue differentiation or development). Where the 

Hi, ^ 

* protein binds or potentially binds to another protein (such as, for example, in a receptor- 
ligand interaction), the protein can be used to identify the binding partner so as to develop 
a system to identify inhibitors of the binding interaction. Since the proteins of the present 
invention are selected base on the need for them for survival, such proteins serve as 
excellent targets for the development of insecticidal agents. Any or all of these research 
utilities are capable of being developed into reagent grade or kit format for 
commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include "Molecular Cloning: A Laboratory 
Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. 
Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

The potential uses of the proteins of the present invention are based primarily on 
the source of the protein as well as the class/action of the protein. For example, proteins 
isolated from Drosophila and other invertebrates serve as a target for identifying anti- 
invertebrate compounds, e.g. insecticides. A combination of the invertebrate and 
mammalian orthologs can be used in selective screening methods to find agents specific 
for invertebrates. 
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The Drosophila proteins of the present invention (excluding variants and fragments 
that may have been disclosed prior to the present invention) are useful for biological assays 
related to the class of proteins of which it is a member. Such assays involve any of the 
known protein functions or activities or properties mediated by the protein. 

The Drosophila proteins are particularly useful in insecticide screening assays, in 
cell-based or cell-free systems. Cell-based systems can be native, i.e., cells that normally 
express the Drosophila protein, as cell of tissue sample or culture. In one embodiment, 
however, cell-based assays involve recombinant host cells expressing the Drosophila 
protein. 

The proteins can be used to identify compounds that modulate the protein activity. 
Both the proteins of the present invention and appropriate variants and fragments can be 
used in high-throughput screens to assay candidate compounds for the ability to bind to the 
t protein. These compounds can be further screened against a functional protein to determine 
* the effect of the compound on the protein activity. Further, these compounds can be tested 
I in animal or invertebrate systems to determine activity/effectiveness. Compounds can be 
I identified that activate (agonist) or inactivate (antagonist) the protein to a desired degree, 
Candidate compounds include, for example, 1) peptides such as soluble peptides, 
including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam 
et aU Nature 354:82-84 (1991); Houghten et al, Nature 354:84-86 (1991)) and 
combinatorial chemistry-derived molecular libraries made of D- and/or L- configuration 
amino acids; 2) phosphopeptides (e.g., members of random and partially degenerate, 
directed phosphopeptide libraries, see, e.g., Songyang etaL, Cell 72:161-118 (1993)); 3) 
antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single 
chain antibodies as well as Fab, F(ab')2, Fab expression library fragments, and epitope- 
binding fragments of antibodies); and 4) small organic and inorganic molecules (e.g., 
molecules obtained from combinatorial and natural product libraries). 

One candidate compound is a soluble fragment of the Drosophila protein that 
competes for ligand binding. Other candidate compounds include mutant Drosophila 
proteins or appropriate fragments containing mutations that affect protein function and thus 
compete for ligand and small molecules. Accordingly, a fragment that competes for ligand, 
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for example with a higher affinity, or a fragment that binds ligand but does not allow 
release, is encompassed by the invention. 

The invention further includes other end point assays to identify compounds that 
modulate (stimulate or inhibit) protein Drosophila protein activity. The assays typically 
involve an assay of events in the signal transduction pathway that indicate Drosophila 
protein activity. Thus, the expression of genes that are up- or down-regulated in response to 
the Drosophila protein can be assayed. In one embodiment, the regulatory region of such 
genes can be operably linked to a marker that is easily detectable, such as luciferase. 
Alternatively, phosphorylation of the Drosophila protein, or a Drosophila protein target, 
could also be measured. 

Any of the biological or biochemical functions mediated by the Drosophila protein 
can be used as an endpoint assay. These include all of the biochemical or 
I biochemical/biological events described herein, in the references cited herein, incorporated 

by reference for these endpoint assay targets, and other functions known to those of ordinary 
skill in the art. 

i Binding and/or activating compounds can also be screened by using chimeric 

Drosophila proteins in which any of the protein's domains, or parts thereof, can be replaced 
by heterologous domains or subregions. Accordingly, a different set of signal transduction 
components is available as an end-point assay for activation. This allows for assays to be 
performed in other than the specific host cell from which the Drosophila protein is derived. 

To perform cell free insecticide screening assays, it is sometimes desirable to 
immobilize either the Drosophila protein, or fragment, or its target molecule to facilitate 
separation of complexes from uncomplexed forms of one or both of the proteins, as well as 
to accommodate automation of the assay. 

Techniques for immobilizing proteins on matrices can be used in the insecticide 
screening assays. In one embodiment, a fusion protein can be provided which adds a 
domain that allows the protein to be bound to a matrix. For example, glutathione-S- 
transferase/15625 fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are then 
combined with the cell lysates (e.g., 35 S-labeled) and the candidate compound, and the 
mixture incubated under conditions conducive to complex formation (e.g., at physiological 
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conditions for salt and pH). Following incubation, the beads are washed to remove any 
unbound label, and the matrix immobilized and radiolabel determined directly, or in the 
supernatant after the complexes are dissociated. Alternatively, the complexes can be 
dissociated from the matrix, separated by SDS-PAGE, and the level of Drosophila-binding 
protein found in the bead fraction quantitated from the gel using standard electrophoretic 
techniques. For example, either the polypeptide or its target molecule can be immobilized 
utilizing conjugation of biotin and streptavidin using techniques well known in the ait. 

Agents that modulate one of the Drosophila proteins of the present invention can be 
identified using one or more of the above assays, alone or in combination. It is generally 
preferable to use a cell-based or cell free system first and then confirm activity in an 
animal/insect model system. Such model systems are well known in the art and can readily 
be employed in this context. 

In yet another aspect of the invention, the Drosophila proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol Chem. 
268:12046-12054; Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) 
Oncogene 8:1693-125696; and Brent WO94/10300), to identify other proteins, which 
bind to or interact with the Drosophila protein and are involved in Drosophila protein 
activity. Such Drosophila-binding proteins are also likely to be involved in the 
propagation of signals by the Drosophila proteins or Drosophila targets as, for example, 
downstream elements of a Drosophila-mediated signaling pathway. Alternatively, such 
Drosophila-binding proteins are likely to be Drosophila protein inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for a 
Drosophila protein is fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library 
of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a 
gene that codes for the activation domain of the known transcription factor. If the "bait" 
and the "prey" proteins are able to interact, in vivo, forming a Drosophila protein- 
dependent complex, the DNA-binding and activation domains of the transcription factor 
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are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the 
cloned gene which encodes the protein which interacts with the Drosophila protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. For example, an agent identified as described herein (e.g., a Drosophila 
protein modulating agent, an antisense Drosophila nucleic acid molecule, a Drosophila 
protein-specific antibody, or a Drosophila protein-binding partner) can be used in an 
insect model to determine the efficacy and toxicity of such an agent. Alternatively, an 
agent identified as described herein can be used in an insect model to determine the 
mechanism of action of such an agent. Furthermore, this invention pertains to uses of 
novel agents identified by the above-described screening assays for insecticidal use 
described herein. 

Antibodies 

The invention also provides antibodies that selectively bind to one of the proteins of 
the present invention, a fragment of such proteins, as well as variants thereof. As used 
herein, an antibody selectively binds a target peptide when it binds the target peptide and 
does not significantly bind to unrelated proteins. An antibody is still considered to 
selectively bind a peptide even if it also binds to other proteins that are not substantially 
homologous with the target peptide so long as such proteins share homology with a 
fragment or domain of the peptide target of the antibody. In this case, it would be 
understood that antibody binding to the peptide is still selective despite some degree of 
cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized 
within the art: they are multi-subunit proteins produced by a mammalian organism in 
response to an antigen challenge. The antibodies of the present invention include polyclonal 
antibodies and monoclonal antibodies, as well as fragments of such antibodies, including, 
but not limited to, Fab or F(ab')2, and Fv fragments. 
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Many methods are known for generating and/or identifying antibodies to a given 
target peptide. Several such methods are described by Harlow, Antibodies, Cold Spring 
Harbor Press, (1989). 

In general, to generate antibodies, an isolated peptide is used as an immunogen and 
is administered to a mammalian organism, such as a rat, rabbit or mouse. Either the full- 
length protein, an antigenic peptide fragment or a fusion protein can be used. 

Antibodies are preferably prepared from regions or discrete fragments of the 
Drosophila proteins. Antibodies can be prepared from any region of the peptide as 
described herein. However, preferred regions will include those involved in 
function/activity and/or receptor/binding partner interaction. 

An antigenic fragment will typically comprise at least 10 contiguous amino acid 
residues. The antigenic peptide can comprise, however, at least 12, 14, 20 or more amino 
acid residues. Such fragments can be selected on a physical property, such as fragments 
correspond to regions that are located on the surface of the protein, e.g., hydrophilic regions 
or can be selected based on sequence uniqueness. 

Detection on an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, 
and examples of suitable radioactive material include 12d I, 13 1 1, 35 S or 3 H. 

Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by 
standard techniques, such as affinity chromatography or immunoprecipitation. The 
antibodies can facilitate the purification of the natural protein from cells and recombinantly 
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produced protein expressed in host cells. In addition, such antibodies are useful to detect the 
presence of one of the proteins of the present invention in cells or tissues to determine the 
pattern of expression of the protein among various tissues in an organism and over the 
course of normal development. Further, such antibodies can be used to detect protein in 
situ, in vitro, or in a cell lysate or supernatant in order to evaluate the abundance and pattern 
of expression. Also, such antibodies can be used to assess abnormal tissue distribution or 
abnormal expression during development. Antibody detection of circulating fragments of 
the full length protein can be used to identify turnover. 

The antibodies are also useful for inhibiting protein function, for example, blocking 
the binding of the Drosophila protein to a binding partner such as a receptor or ligand. 
These uses can also be applied in an insecticidal context in which treatment involves 
inhibiting the protein's function. An antibody can be used, for example, to block binding, 
thus modulating (agonizing or antagonizing) the peptides activity. Antibodies can be 
prepared against specific fragments containing sites required for function or against intact 
associated with a cell. 

The invention also encompasses kits for using antibodies to detect the presence of a 
protein in a biological sample. The kit can comprise antibodies such as a labeled or 
labelable antibody and a compound or agent for detecting protein in a biological sample; 
means for determining the amount of protein in the sample; means for comparing the 
amount of protein in the sample with a standard; and instructions for use. 

Nucleic Acid Molecules 

The present invention further provides isolated nucleic acid molecules that encode a 
Drosophila protein of the present invention. Such nucleic acid molecules will consist of, 
consist essentially of, or comprise a nucleotide sequence that encodes one of the Drosophila 
proteins of the present invention, an allelic variant thereof, or an ortholog or paralog thereof. 

As used herein, an "isolated" nucleic acid molecule is one that is separated from 
other nucleic acid present in the natural source of the nucleic acid. Preferably, an "isolated" 
nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences 
located at the 5' and 3 ? ends of the nucleic acid) in the genomic DNA of the organism from 
which the nucleic acid is derived. However, there can be some flanking nucleotide 
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sequences, for example up to about 5KB, particularly contiguous peptide encoding 
sequences and peptide encoding sequences within the same gene but separated by introns in 
the genomic sequence. The important point is that the nucleic acid is isolated from remote 
and unimportant flanking sequences such that it can be subjected to the specific 
manipulations described herein such as recombinant expression, preparation of probes and 
primers, and other uses specific to the nucleic acid sequences. 

Moreover, an "isolated 11 nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or chemical precursors or other chemicals when chemically 
synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

For example, recombinant DNA molecules contained in a vector are considered 
isolated. Further examples of isolated DNA molecules include recombinant DNA 
molecules maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts 
of the isolated DNA molecules of the present invention. Isolated nucleic acid molecules 
according to the present invention further include such molecules produced synthetically. 

Accordingly, the present invention provides nucleic acid molecules that consist of 
one of the nucleotide sequences shown in Figure sheets 1-836: genomic sequences SEQ ID 
NO: 1, 4, 7, . . . 1525, 1528 and 1531, and transcript sequences SEQ ID NO: 2, 5, 8, . . . 

1526, 1529 and 1532 and/or encode a protein comprising of SEQ ID NO:3, 6, 9, . . . 
1527, 1530 and 1533. A nucleic acid molecule consists of a nucleotide sequence when the 
nucleotide sequence is the complete nucleotide sequence of the nucleic acid molecule. 

The present invention further provides nucleic acid molecules that consist essentially 
of one of the nucleotide sequences shown in Figure sheets 1-836, genomic sequences SEQ 
ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, and transcript sequences SEQ ID NO: 2, 5, 8, . . 
. 1526, 1529 and 1532 and/or encode a protein comprising of SEQ ID NO:3, 6, 9, . . . 

1527, 1530 and 1533. A nucleic acid molecule consists essentially of a nucleotide sequence 
when such a nucleotide sequence is present with only a few additional nucleic acid residues 
in the final nucleic acid molecule. 
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The present invention further provides nucleic acid molecules that are comprised of 
one of the nucleotide sequences shown in Figure sheets 1-836, genomic sequences SEQ ID 
NO: 1, 4, 7, . . . 1525, 1528 and 1531, and transcript sequences SEQ ID NO: 2, 5, 8, . . . 
1526, 1529 and 1532 and/or encode a protein comprising of SEQ ID NO:3, 6, 9, . . . 
1527, 1530 and 1533. A nucleic acid molecule is comprised of a nucleotide sequence when 
the nucleotide sequence is at least part of the final nucleotide sequence of the nucleic acid 
molecule. In such a fashion, the nucleic acid molecule can be only the nucleotide sequence 
or have additional nucleic acid residues, such as nucleic acid residues that are naturally 
associated with it or heterologous nucleotide sequences. Such a nucleic acid molecule can 
have a few additional nucleotides or can comprises several hundred or more additional 
nucleotides. The preferred classes of nucleic acid molecules that are comprised of the 
nucleotide sequences of the present are the naturally occurring full-length cDNA molecules 

f and genes and genomic sequences. A brief description of how various types of these nucleic 

:ri acid molecules can be readily made/isolated is provided below. 

«% In the Figures, both coding and non-coding sequences are provided for each 

3 peptide encoding nucleic acid sequence (both genomic and transcript sequences). 

i Because of the source of the present invention, Drosophila genomic sequences, the 

genomic nucleic acid molecules in the figures will contain genomic intronic sequences, 

4 5 7 and 3' non-coding sequences, gene regulatory regions and non-coding intergenic 

n sequences. In general such sequence features are either noted or can readily be identified 
f using computational tools known in the art. As discussed below, some of the non-coding 
regions, particularly gene regulatory elements such as promoters, are useful for a variety 
of purposes, e.g. control of heterologous gene expression, target for identifying gene 
activity modulating compounds. 

The isolated nucleic acid molecules can encode the mature protein plus additional 
amino or carboxyl-terminal amino acids, or amino acids interior to the mature peptide (when 
the mature form has more than one peptide chain, for instance). Such sequences may play a 
role in processing of a protein from precursor to a mature form, facilitate protein trafficking, 
prolong or shorten protein half-life or facilitate manipulation of a protein for assay or 
production, among other things. As generally is the case in situ, the additional amino acids 
may be processed away from the mature protein by cellular enzymes. 
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As mentioned above, the isolated nucleic acid molecules include, but are not limited 
to, the sequence encoding the Drosophila protein alone, the sequence encoding the mature 
peptide and additional coding sequences, such as a leader or secretory sequence (e.g., a pre- 
pro or pro-protein sequence), the sequence encoding the mature peptide, with or without the 
additional coding sequences, plus additional non-coding sequences, for example introns and 
non-coding 5' and 3 J sequences such as transcribed but non-translated sequences that play a 
role in transcription, mRNA processing (including splicing and polyadenylation signals), 
ribosome binding and stability of mRNA. In addition, the nucleic acid molecule may be 
fused to a marker sequence encoding, for example, a peptide that facilitates purification. 

Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the 
form DNA, including cDNA and genomic DNA obtained by cloning or produced by 
chemical synthetic techniques or by a combination thereof. The nucleic acid, especially 
DNA, can be double-stranded or single-stranded. Single-stranded nucleic acid can be the 
coding strand (sense strand) or the non-coding strand (anti-sense strand). 

The invention further provides nucleic acid molecules that encode fragments of the 
proteins of the present invention and that encode obvious variants of the Drosophila proteins 
of the present invention that are described above. Such nucleic acid molecules may be 
naturally occurring, such as allelic variants (same locus), paralogs (different locus), and 
orthologs (different organism), or may be constructed by recombinant DNA methods or by 
chemical synthesis. Such non-naturally occurring variants may be made by mutagenesis 
techniques, including those applied to nucleic acid molecules, cells, or organisms. 
Accordingly, as discussed above, the variants can contain nucleotide substitutions, deletions, 
inversions and insertions. Variation can occur in either or both the coding and non-coding 
regions. The variations can produce both conservative and non-conservative amino acid 
substitutions. 

The present invention further provides non-coding fragments of the nucleic acid 
molecules provided in the Figures. Preferred non-coding fragments include, but are not 
limited to, promoter sequences, enhancer sequences, gene modulating sequences and gene 
termination sequences. Such fragments are useful in controlling heterologous gene 
expression and in developing screens to identify gene modulating agents. Such regulatory 
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sequences are readily identifiable using know methods as well as by isolating the non- 
coding sequences found 5' to the transcription start site identified in the Figures. 

A fragment comprises a contiguous nucleotide sequence greater than 12 or more 
nucleotides. Further, a fragment could at least 30, 40, 50, 100, 250 or 500 nucleotides in 
length. The length of the fragment will be based on its intended use. For example, the 
fragment can encode epitope bearing regions of the peptide, or can be useful as DNA probes 
and primers. Such fragments can be isolated using the known nucleotide sequence to 
synthesize an oligonucleotide probe. A labeled probe can then be used to screen a cDNA 
library, genomic DNA library, or mRNA to isolate nucleic acid corresponding to the coding 
region. Further, primers can be used in PCR reactions to clone specific regions of gene. 

A probe/primer typically comprises substantially a purified oligonucleotide or 
oligonucleotide pair. The oligonucleotide typically comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least about 12, 20, 25, 40, 50 or 
more consecutive nucleotides. 

Orthologs, homologs, and allelic variants can be identified using methods well 
known in the art. As described in the Peptide Section, these variants comprise a nucleotide 
sequence encoding a peptide that is typically 60-65%, 70-75%, 80-85%, and more typically 
at least about 90-95% or more homologous to the nucleotide sequence shown in the Figure 
sheets or a fragment of this sequence. Such nucleic acid molecules can readily be identified 
as being able to hybridize under moderate to stringent conditions, to the nucleotide sequence 
shown in the Figure sheets or a fragment of the sequence. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences 
encoding a peptide at least 50-55% homologous to each other typically remain hybridized to 
each other. The conditions can be such that sequences at least about 65%, at least about 
70%o, or at least about 75% or more homologous to each other typically remain hybridized to 
each other. Such stringent conditions are known to those skilled in the art and can be found 
in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 
One example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45C, followed by one or more washes in 0.2 X SSC, 
0.1%SDSat50-65C. 
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Nucleic Acid Molecule Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, 
chemical intermediates, and in biological assays. The nucleic acid molecules are useful as a 
hybridization probe for cDNA and genomic DNA to isolate full-length cDNA and genomic 
clones encoding the peptide described in the Figures and to isolate cDNA and genomic 
clones that correspond to variants (alleles, orthologs, etc.) producing the same or related 
peptides shown in the Figures. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the Figures. Accordingly, it could be derived from 5' noncoding 
regions, the coding region, and 3' noncoding regions. However, as discussed, fragments are 
not to be construed as those which may encompass fragments disclosed prior to the present 
invention. 

The nucleic acid molecules are also useful as primers for PCR to amplify any given 
region of a nucleic acid molecule and are useful to synthesize antisense molecules of desired 
length and sequence. 

The nucleic acid molecules are also useful for constructing recombinant vectors. 
Such vectors include expression vectors that express a portion of, or all of, the peptide 
sequences. Vectors also include insertion vectors, used to integrate into another nucleic acid 
molecule sequence, such as into the cellular genome, to alter in situ expression of a gene 
and/or gene product. For example, an endogenous coding sequence can be replaced via 
homologous recombination with all or part of the coding region containing one or more 
specifically introduced mutations. 

The nucleic acid molecules are also useful for expressing antigenic portions of the 
proteins. 

The nucleic acid molecules are also useful in making vectors containing the gene 
regulatory regions of the nucleic acid molecules of the present invention. 

The nucleic acid molecules are also useful for designing ribozymes corresponding to 
all, or a part, of the mRNA produced from the nucleic acid molecules described herein. 

The nucleic acid molecules are also useful for constructing host cells expressing a 
part, or all, of the nucleic acid molecules and peptides. 
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The nucleic acid molecules are also useful for constructing transgenic animals 
expressing all, or a part, of the nucleic acid molecules and peptides. 

The nucleic acid molecules are also useful for making vectors that express part, or 
all, of the peptides. 

The nucleic acid molecules are also useful as hybridization probes for determining 
the presence, level, form and distribution of nucleic acid expression. Accordingly, the 
probes can be used to detect the presence of, or to determine levels of, a specific nucleic acid 
molecule in cells, tissues, and in organisms. The nucleic acid whose level is determined can 
be DNA or RNA. Accordingly, probes corresponding to the peptides described herein can 
be used to assess expression and/or gene copy number in a given cell, tissue, or organism. 
These uses are relevant for identifying insecticide susceptibility or tolerance involving an 
increase or decrease in Drosophila protein expression relative to normal results. 

In vitro techniques for detection of mRNA include Northern hybridizations and in 
situ hybridizations. In vitro techniques for detecting DNA includes Southern hybridizations 
and in situ hybridization. 

Probes can be used as a part of a diagnostic test kit for identifying cells or tissues 
that express a Drosophila protein, such as by measuring a level of a receptor-encoding 
nucleic acid in a sample of cells from a subject e.g., mRNA or genomic DNA, or 
determining if a receptor gene has been mutated. 

Nucleic acid expression assays are useful for insecticide screening to identify 
compounds that modulate Drosophila nucleic acid expression. 

The invention thus provides a method for identifying a compound that can be used 
to alter (inhibit or enhance) expression of the Drosophila protein gene. The method 
typically includes assaying the ability of the compound to modulate the expression of the 
Drosophila nucleic acid and thus identifying a compound that can be used to kill an insect. 
The assays can be performed in cell-based and cell-free systems. Cell-based assays include 
cells naturally expressing the Drosophila nucleic acid or recombinant cells genetically 
engineered to express specific nucleic acid sequences. 

The assay for Drosophila nucleic acid expression can involve direct assay of nucleic 
acid levels, such as mRNA levels, or on collateral compounds involved in the signal 
pathway. Further, the expression of genes that are up- or down-regulated in response to the 
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Drosophila protein signal pathway can also be assayed. In this embodiment the regulatory 
regions of these genes can be operably linked to a reporter gene such as luciferase. 

Thus, modulators of Drosophila protein gene expression can be identified in a 
method wherein a cell is contacted with a candidate compound and the expression of mRNA 
determined. The level of expression of Drosophila protein mRNA in the presence of the 
candidate compound is compared to the level of expression of Drosophila protein mRNA in 
the absence of the candidate compound. The candidate compound can then be identified as 
a modulator of nucleic acid expression based on this comparison and be used, for example 
to alter nucleic acid expression. When expression of mRNA is statistically significantly 
greater in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of nucleic acid expression. When nucleic acid 
expression is statistically significantly less in the presence of the candidate compound than 
in its absence, the candidate compound is identified as an inhibitor of nucleic acid 
expression. 

The invention also encompasses kits for detecting the presence of a Drosophila 
nucleic acid in a biological sample. For example, the kit can comprise reagents such as a 
labeled or labelable nucleic acid or agent capable of detecting Drosophila nucleic acid in a 
biological sample; means for determining the amount of Drosophila nucleic acid in the 
sample; and means for comparing the amount of Drosophila nucleic acid in the sample with 
a standard. The compound or agent can be packaged in a suitable container. The kit can 
further comprise instructions for using the kit to detect Drosophila mRNA or DNA. 

Nucleic Acid Arrays 

The present invention further provides arrays or microarrays of nucleic acid 
molecules that are based on the sequence information provided in the Figure Sheets, 
genomic sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 1531 , or transcript 
sequences SEQ ID NO: 2, 5, 8, . . . 1526, 1529 and 1532. 

As used herein "Arrays" or "Microarrays" refers to an array of distinct 
polynucleotides or oligonucleotides synthesized on a substrate, such as paper, nylon or 
other type of membrane, filter, chip, glass slide, or any other suitable solid support. In 
one embodiment, the microarray is prepared and used according to the methods described 
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in US Patent 5,837,832, Chee et aL, PCT application W095/1 1995 (Chee et al.), 
Lockhart, D. J. et al. (1996; Nat. Biotech. 14: 1675-125680) and Schena, M. et al. (1996; 
Proc. Natl Acad. Sci. 93: 10614-10619), all of which are incorporated herein in their 
entirety by reference. In other embodiments, such arrays are produced by the methods 
described by Brown et. al., US Patent No. 5,807,522. 

The microarray is preferably composed of a large number of unique, single- 
stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 
6-60 nucleotides in length, more preferably 15-30 nucleotides in length, and most 
preferably about 20-25 nucleotides in length. For a certain type of microarray, it may be 
preferable to use oligonucleotides that are only 7-20 nucleotides in length. The 
microarray may contain oligonucleotides that cover the known 5', or 3', sequence, 
sequential oligonucleotides which cover the full length sequence; or unique 
oligonucleotides selected from particular areas along the length of the sequence. 
Polynucleotides used in the microarray may be oligonucleotides that are specific to a 
gene or genes of interest. 

In order to produce oligonucleotides to a known sequence for a microarray, the 
gene(s) of interest (or an ORF identified from the contigs of the present invention) is 
typically examined using a computer algorithm which starts at the 5' or at the 3' end of 
the nucleotide sequence. Typical algorithms will then identify oligomers of defined 
length that are unique to the gene, have a GC content within a range suitable for 
hybridization, and lack predicted secondary structure that may interfere with 
hybridization. In certain situations it may be appropriate to use pairs of oligonucleotides 
on a microarray. The "pairs" will be identical, except for one nucleotide that preferably is 
located in the center of the sequence. The second oligonucleotide in the pair 
(mismatched by one) serves as a control. The number of oligonucleotide pairs may range 
from two to one million. The oligomers are synthesized at designated areas on a 
substrate using a light-directed chemical process. The substrate may be paper, nylon or 
other type of membrane, filter, chip, glass slide or any other suitable solid support. 

In another aspect, an oligonucleotide may be synthesized on the surface of the 
substrate by using a chemical coupling procedure and an ink jet application apparatus, as 
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described in PCT application W095/251 1 16 (Baldeschweiler et al.) which is incorporated 
herein in its entirety by reference. In another aspect, a "gridded" array analogous to a dot 
(or slot) blot may be used to arrange and link cDNA fragments or oligonucleotides to the 
surface of a substrate using a vacuum system, thermal, UV, mechanical or chemical 
bonding procedures. An array, such as those described above, may be produced by hand 
or by using available devices (slot blot or dot blot apparatus), materials (any suitable 
solid support), and machines (including robotic instruments), and may contain 8, 24, 96, 
384, 1536, 6144 or more oligonucleotides, or any other number between two and one 
million which lends itself to the efficient use of commercially available instrumentation. 

In order to conduct sample analysis using a microarray, the RNA or DNA from a 
biological sample is made into hybridization probes. The mRNA is isolated, and cDNA is 
produced and used as a template to make antisense RNA (aRNA). The aRNA is 
amplified in the presence of fluorescent nucleotides, and labeled probes are incubated 
with the microarray so that the probe sequences hybridize to complementary 
oligonucleotides of the microarray. Incubation conditions are adjusted so that 
hybridization occurs with precise complementary matches or with various degrees of less 
complementarity. After removal of nonhybridized probes, a scanner is used to determine 
the levels and patterns of fluorescence. The scanned images are examined to determine 
degree of complementarity and the relative abundance of each oligonucleotide sequence 
on the microarray. The biological samples may be obtained from any bodily fluids (such 
as blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or other 
tissue preparations. A detection system may be used to measure the absence, presence, 
and amount of hybridization for all of the distinct sequences simultaneously. This data 
may be used for large scale correlation studies on the sequences, expression patterns, 
mutations, variants, or polymorphisms among samples. 

Using such arrays, the present invention provides methods to identify the 
expression of one or more of the secreted proteins/proteins of the present invention. In 
detail, such methods comprise incubating a test sample with one or more nucleic acid 
molecules and assaying for binding of the nucleic acid molecule with components within 
the test sample. Such assays will typically involve arrays comprising many genes, at 
least one of which is a gene of the present invention. 



28 



Conditions for incubating a nucleic acid molecule with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid molecule used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or array assay formats can readily be adapted to employ the novel 
fragments of the human genome disclosed herein. Examples of such assays can be found 
in Chard, T, An Introduction to Radioimmunoassay and Related Techniques, Elsevier 
Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G. R. etal y 
Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1 982), Vol. 
2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of Enzyme Immunoassays: 
Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include cells, protein or membrane 
extracts of cells. The test sample used in the above-described method will vary based on 
the assay format, nature of the detection method and the tissues, cells or extracts used as 
the sample to be assayed. Methods for preparing nucleic acid extracts or of cells are well 
known in the art and can be readily be adapted in order to obtain a sample that is 
compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain 
the necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising 
one of the nucleic acid molecules that can bind to a fragment of the human genome 
disclosed herein; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound nucleic acid. 
Preferred kits will include chips that are capable of detecting the expression of all of the 
genes expressed provided herein. 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers, 
strips of plastic, glass or paper, or arraying material such as silica. Such containers 
allows one to efficiently transfer reagents from one compartment to another compartment 
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such that the samples and reagents are not cross-contaminated, and the agents or solutions 
of each container can be added in a quantitative fashion from one compartment to 
another. Such containers will include a container which will accept the test sample, a 
container which contains the nucleic acid probe, containers which contain wash reagents 
(such as phosphate buffered saline, Tris-buffers, etc.), and containers which contain the 
reagents used to detect the bound probe. One skilled in the art will readily recognize that 
the previously unidentified secreted protein genes of the present invention can be 
routinely identified using the sequence information disclosed herein can be readily 
incorporated into one of the established kit formats which are well known in the art, 
particularly expression arrays. 

Vectors/host cells 

The invention also provides vectors containing the nucleic acid molecules described 
herein. The term "vector" refers to a vehicle, preferably a nucleic acid molecule, which can 
transport the nucleic acid molecules. When the vector is a nucleic acid molecule, the 
nucleic acid molecules are covalently linked to the vector nucleic acid. With this aspect of 
the invention, the vector includes a plasmid, single or double stranded phage, a single or 
double stranded RNA or DNA viral vector, or artificial chromosome, such as a B AC, PAC, 
YAC, OR MAC. 

A vector can be maintained in the host cell as an extrachromosomal element where it 
replicates and produces additional copies of the nucleic acid molecules. Alternatively, the 
vector may integrate into the host cell genome and produce additional copies of the nucleic 
acid molecules when the host cell replicates. 

The invention provides vectors for the maintenance (cloning vectors) or vectors for 
expression (expression vectors) of the nucleic acid molecules. The vectors can function in 
procaryotic or eukaryotic cells or in both (shuttle vectors). 

Expression vectors contain cis-acting regulatory regions that are operably linked in 
the vector to the nucleic acid molecules such that transcription of the nucleic acid molecules 
is allowed in a host cell. The nucleic acid molecules can be introduced into the host cell 
with a separate nucleic acid molecule capable of affecting transcription. Thus, the second 
nucleic acid molecule may provide a trans-acting factor interacting with the cis-regulatory 
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control region to allow transcription of the nucleic acid molecules from the vector. 
Alternatively, a trans-acting factor may be supplied by the host cell. Finally, a trans-acting 
factor can be produced from the vector itself. It is understood, however, that in some 
embodiments, transcription and/or translation of the nucleic acid molecules can occur in a 
cell-free system. 

The regulatory sequence to which the nucleic acid molecules described herein can 
be operably linked include promoters for directing mRNA transcription. These include, but 
are not limited to, the left promoter from bacteriophage A,, the lac, TRP, and TAC promoters 
from E. colU the early and late promoters from S V40, the CMV immediate early promoter, 
the adenovirus early and late promoters, and retrovirus long-terminal repeats. 

In addition to control regions that promote transcription, expression vectors may also 
include regions that modulate transcription, such as repressor binding sites and enhancers. 
•3 Examples include the SV40 enhancer, the cytomegalovirus immediate early enhancer, 
Mi polyoma enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 
l Z In addition to containing sites for transcription initiation and control, expression 

\ 3 vectors can also contain sequences necessary for transcription termination and, in the 
2 transcribed region a ribosome binding site for translation. Other regulatory control elements 
; a for expression include initiation and termination codons as well as polyadenylation signals. 
: 4 The person of ordinary skill in the art would be aware of the numerous regulatory sequences 
that are useful in expression vectors. Such regulatory sequences are described, for example, 
1 3 in Sambrook et ah, Molecular Cloning: A Laboratory Manual 2nd. ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, (1989). 

A variety of expression vectors can be used to express a nucleic acid molecule. 
Such vectors include chromosomal, episomal, and virus-derived vectors, for example 
vectors derived from bacterial plasmids, from bacteriophage, from yeast episomes, from 
yeast chromosomal elements, including yeast artificial chromosomes, from viruses such as 
baculoviruses, papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, 
pseudorabies viruses, and retroviruses. Vectors may also be derived from combinations of 
these sources such as those derived from plasmid and bacteriophage genetic elements, e.g. 
cosmids and phagemids. Appropriate cloning and expression vectors for prokaryotic and 



31 



eukaryotic hosts are described in Sambrook et al. 9 Molecular Cloning: A Laboratory 
Manual 2nd. ed. 9 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989). 

The regulatory sequence may provide constitutive expression in one or more host 
cells (i.e. tissue specific) or may provide for inducible expression in one or more cell types 
such as by temperature, nutrient additive, or exogenous factor such as a hormone or other 
ligand. A variety of vectors providing for constitutive and inducible expression in 
prokaryotic and eukaryotic hosts are well known to those of ordinary skill in the art. 

The nucleic acid molecules can be inserted into the vector nucleic acid by well- 
known methodology. Generally, the DNA sequence that will ultimately be expressed is 
joined to an expression vector by cleaving the DNA sequence and the expression vector 
with one or more restriction enzymes and then ligating the fragments together. Procedures 
for restriction enzyme digestion and ligation are well known to those of ordinary skill in the 
art. 

The vector containing the appropriate nucleic acid molecule can be introduced into 
an appropriate host cell for propagation or expression using well-known techniques. 
Bacterial cells include, but are not limited to, E. coli, Streptomyces, and Salmonella 
typhimurium. Eukaryotic cells include, but are not limited to, yeast, insect cells such as 
Drosophila, animal cells such as COS and CHO cells, and plant cells. 

As described herein, it may be desirable to express the peptide as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of the 
peptides. Fusion vectors can increase the expression of a recombinant protein, increase the 
solubility of the recombinant protein, and aid in the purification of the protein by acting for 
example as a ligand for affinity purification. A proteolytic cleavage site may be introduced 
at the junction of the fusion moiety so that the desired peptide can ultimately be separated 
from the fusion moiety. Proteolytic enzymes include, but are not limited to, factor Xa, and 
thrombin. Typical fusion expression vectors include pGEX (Smith et ah, Gene 57:31-40 
(1988)), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, 
NJ) which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, 
respectively, to the target recombinant protein. Examples of suitable inducible non-fusion 
£ coli expression vectors include pTrc (Amann etal, Gene <59:301-315 (1988)) and pET 
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1 Id (Studier et aL, Gene Expression Technology; Methods in Enzymology 755:60-89 
(1990)). 

Recombinant protein expression can be maximized in a host bacteria by providing a 
genetic background wherein the host cell has an impaired capacity to proteolytically cleave 
the recombinant protein. (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 1 85, Academic Press, San Diego, California (1 990) 1 1 9-128). Alternatively, 
the sequence of the nucleic acid molecule of interest can be altered to provide preferential 
codon usage for a specific host cell, for example E. coli. (Wada et aL, Nucleic Acids Res, 
20:2111-2118(1992)). 

The nucleic acid molecules can also be expressed by expression vectors that are 
operative in yeast. Examples of vectors for expression in yeast e.g., S. cerevisiae include 
pYepSecl (Baldari, et aU EMBO 1 6:229-234 (1987)), pMFa (Kurjan et aU Cell 50:933- 
943(1982)), pJRY88 (Schultz etal, Gene 54:113-123 (1987)), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

The nucleic acid molecules can also be expressed in insect cells using, for example, 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al, Mol Cell Biol 
5:2156-2165 ( 1983)) and the pVL series (Lucklow etal, Virology 770:31-39 (1989)). 

In certain embodiments of the invention, the nucleic acid molecules described herein 
are expressed in mammalian cells using mammalian expression vectors. Examples of 
mammalian expression vectors include pCDM8 (Seed, B. Nature 522:840(1987)) and 
pMT2PC(Kaufinan^a/. ? £M50J 6:187-195 (1987)). 

The expression vectors listed herein are provided by way of example only of the 
well-known vectors available to those of ordinary skill in the art that would be useful to 
express the nucleic acid molecules. The person of ordinary skill in fee art would be aware 
of other vectors suitable for maintenance propagation or expression of the nucleic acid 
molecules described herein. These are found for example in Sambrook, J., Fritsh, E. F., and 
Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

The invention also encompasses vectors in which the nucleic acid sequences 
described herein are cloned into the vector in reverse orientation, but operably linked to a 
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regulatory sequence that permits transcription of antisense RNA. Thus, an antisense 
transcript can be produced to all, or to a portion, of the nucleic acid molecule sequences 
described herein, including both coding and non-coding regions. Expression of this 
antisense RNA is subject to each of the parameters described above in relation to expression 
of the sense RNA (regulatory sequences, constitutive or inducible expression, tissue-specific 
expression). 

The invention also relates to recombinant host cells containing the vectors described 
herein. Host cells therefore include prokaryotic cells, lower eukaryotic cells such as yeast, 
other eukaryotic cells such as insect cells, and higher eukaryotic cells such as mammalian 
cells. 

The recombinant host cells are prepared by introducing the vector constructs 
described herein into the cells by techniques readily available to the person of ordinary skill 
in the art. These include, but are not limited to, calcium phosphate transfection, DEAE- 
dextran-mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, lipofection, and other techniques such as those found in Sambrook, 
etal. (Molecular Cloning; A Laboratory Manual 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

Host cells can contain more than one vector. Thus, different nucleotide sequences 
can be introduced on different vectors of the same cell Similarly, the nucleic acid 
molecules can be introduced either alone or with other nucleic acid molecules that are not 
related to the nucleic acid molecules such as those providing trans-acting factors for 
expression vectors. When more than one vector is introduced into a cell, the vectors can be 
introduced independently, co-introduced or joined to the nucleic acid molecule vector. 

In the case of bacteriophage and viral vectors, these can be introduced into cells as 
packaged or encapsulated virus by standard procedures for infection and transduction. Viral 
vectors can be replication-competent or replication-defective. In the case in which viral 
replication is defective, replication will occur in host cells providing functions that 
complement the defects. 

Vectors generally include selectable markers that enable the selection of the 
subpopulation of cells that contain the recombinant vector constructs. The marker can be 
contained in the same vector that contains the nucleic acid molecules described herein or 
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may be on a separate vector. Markers include tetracycline or ampicillin-resistance genes for 
prokaryotic host cells and dihydrofolate reductase or neomycin resistance for eukaryotic 
host cells. However, any marker that provides selection for a phenotypic trait will be 
effective. 

While the mature proteins can be produced in bacteria, yeast, mammalian cells, and 
other cells under the control of the appropriate regulatory sequences, cell- free transcription 
and translation systems can also be used to produce these proteins using RNA derived from 
the DNA constructs described herein. 

Where secretion of the peptide is desired, appropriate secretion signals are 
incorporated into the vector. The signal sequence can be endogenous to the peptides or 
heterologous to these peptides. 

Where the peptide is not secreted into the medium, the protein can be isolated from 
the host cell by standard disruption procedures, including freeze thaw, sonication, 
mechanical disruption, use of lysing agents and the like. The peptide can then be recovered 
and purified by well-known purification methods including ammonium sulfate precipitation, 
acid extraction, anion or cationic exchange chromatography, phosphocellulose 
chromatography, hydrophobic-interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography, lectin chromatography, or high performance liquid 
chromatography. 

It is also understood that depending upon the host cell in recombinant production of 
the peptides described herein, the peptides can have various glycosylation patterns, 
depending upon the cell, or maybe non-glycosylated as when produced in bacteria. In 
addition, the peptides may include an initial modified methionine in some cases as a result 
of a host-mediated process. 

Uses of vectors and host cells 

The recombinant host cells expressing the peptides described herein have a variety 
of uses. First, the cells are useful for producing a Drosophila protein or peptide that can be 
further purified to produce desired amounts of Drosophila protein or fragments. Thus, host 
cells containing expression vectors are useful for peptide production. 

Host cells are also useful for conducting cell-based assays involving the Drosophila 
protein or Drosophila protein fragments. Thus, a recombinant host cell expressing a native 
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Drosophila protein is useful for assaying compounds that stimulate or inhibit Drosophila 
protein function. 

Host cells are also useful for identifying Drosophila protein mutants in which these 
functions are affected. If the mutants naturally occur and give rise to a pathology, host cells 
containing the mutations are useful to assay compounds that have a desired effect on the 
mutant Drosophila protein (for example, stimulating or inhibiting function) which may not 
be indicated by their effect on the native Drosophila protein. 

Genetically engineered host cells can be further used to produce non-human 
transgenic animals. A transgenic animal is preferably a mammal, for example a rodent, such 
as a rat or mouse, in which one or more of the cells of the animal include a transgene. A 
transgene is exogenous DNA which is integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the genome of the mature animal in one or 
more cell types or tissues of the transgenic animal. These animals are useful for studying 
the function of a Drosophila protein and identifying and evaluating modulators of 
Drosophila protein activity. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, and amphibians. 

A transgenic animal can be produced by introducing nucleic acid into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster animal. Any of the Drosophila protein 
nucleotide sequences can be introduced as a transgene into the genome of a non-human 
animal, such as a mouse. 

Any of the regulatory or other sequences useful in expression vectors can form part 
of the transgenic sequence. This includes intronic sequences and polyadenylation signals, if 
not already included. A tissue-specific regulatory sequence(s) can be operably linked to the 
transgene to direct expression of the Drosophila protein to particular cells. 

Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et 
ah, U.S. Patent No. 4,873,191 by Wagner et al and in Hogan, B., Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar 
methods are used for production of other transgenic animals. A transgenic founder animal 
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can be identified based upon the presence of the transgene in its genome and/or expression 
of transgenic mRNA in tissues or cells of the animals. A transgenic founder animal can then 
be used to breed additional animals carrying the transgene. Moreover, transgenic animals 
carrying a transgene can further be bred to other transgenic animals carrying other 
transgenes. A transgenic animal also includes animals in which the entire animal or tissues 
in the animal have been produced using the homologously recombinant host cells described 
herein. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/IoxP recombinase system of bacteriophage PI. For a 
description of the crefloxP recombinase system, see, e.g., Lakso et ah PNAS 89:6232-6236 
(1992). Another example of a recombinase system is the FLP recombinase system of S. 
t cerevisiae (O'Gorman etal Science 257:1351-1355 (1991). If a cre/loxP recombinase 
1 system is used to regulate expression of the transgene, animals containing transgenes 
S encoding both the Cre recombinase and a selected protein is required. Such animals can be 
I provided through the construction of "double" transgenic animals, e.g., by mating two 
I transgenic animals, one containing a transgene encoding a selected protein and the other 

s containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, L et al Nature 555:810-813 (1997) and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a 
somatic cell, from the transgenic animal can be isolated and induced to exit the growth cycle 
and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 
morula or blastocyst and then transferred to pseudopregnant female foster animal. The 
offspring born of this female foster animal will be a clone of the animal from which the cell, 
e.g., the somatic cell, is isolated. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing 
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from the scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such specific embodiments. Indeed, 
various modifications of the above-described modes for carrying out the invention which 
are obvious to those skilled in the field of molecular biology or related fields are intended 
to be within the scope of the following claims. 
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Claims 

That which is claimed is: 

1 . An isolated peptide consisting of an amino acid sequence selected from the 
group consisting of: 

(a) the amino acid sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 
1530 and 1533; 

(b) the amino acid sequence of an allelic variant of the amino acid 
sequence shown SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said allelic variant 
is encoded by a nucleic acid molecule that hybridizes to the nucleic acid molecule shown in 
genomic sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript 
sequences SEQ ID NO: 2, 5, 8, . . . 1526, 1529 and 1532, under stringent conditions; 

(c) the amino acid sequence of an ortholog of an amino acid sequence 
shown in SEQ IDNO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said ortholog is encoded 
by a nucleic acid molecule that hybridizes to the nucleic acid molecule shown in genomic 
sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript sequences SEQ ID 
NO: 2, 5, 8, ... 1 526, 1 529 and 1 532, under stringent condition; 

(d) a fragment of the amino acid sequence shown in SEQ ID NO:3, 6, 9, 
. . . 1527, 1530 and 1533, wherein the fragment comprises at least 10 contiguous amino 
acids. 

2. An isolated peptide comprising an amino acid sequence selected from the 
group consisting of: 

(a) the amino acid sequence shown in SEQ ID NO:3, 6, 9, . . . 1 527, 
1530 and 1533; 

(b) the amino acid sequence of an allelic variant of the amino acid 
sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said allelic 
variant is encoded by a nucleic acid molecule that hybridizes to the nucleic acid molecule 
shown in genomic sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript 
sequences SEQ ID NO: 2, 5, 8, . . . 1526, 1529 and 1532, under stringent conditions; 
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(c) the amino acid sequence of an ortholog of an amino acid sequence 
shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said ortholog is encoded 
by a nucleic acid molecule that hybridizes to the nucleic acid molecule shown in genomic 
sequences SEQ ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript sequences SEQ ID 
NO; 2, 5, 8, . . . 1526, 1529 and 1532, under stringent condition; 

(d) a fragment of the amino acid sequence shown in SEQ ID NO: 3, 6, 9, 
. . . 1527, 1530 and 1533, wherein the fragment comprises at least 10 contiguous amino 
acids. 

3 . An isolated antibody that selectively binds to a peptide of claim 1 . 

4. An isolated nucleic acid molecule consisting of a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533; 

(b) a nucleotide sequence that encodes of an allelic variant of the amino 
acid sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said nucleic 
acid molecule hybridizes to the nucleic acid molecule shown in genomic sequences SEQ 
ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript sequences SEQ ID NO: 2, 5, 8, . . . 
1526, 1529 and 1532, under stringent conditions; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid 
sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said nucleic acid 
molecule hybridizes to the nucleic acid molecule shown in genomic sequences SEQ ID 
NO: 1,4,7, . 1525, 1528 and 1531, or transcript sequences SEQ ID NO: 2, 5, 8, . . . 
1526, 1529 and 1532, under stringent condition; 

(d) a nucleotide sequence that encodes a fragment of the amino acid 
sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said the 
fragment comprises at least 10 contiguous amino acids; and 

(e) A nucleic acid molecule that is the complement of a nucleic acid 
molecule of (a)-(d). 
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5. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533; 

(b) a nucleotide sequence that encodes of an allelic variant of the amino 
acid sequence shown in SEQ ID NO:3, 6, 9, . . , 1527, 1530 and 1533, wherein said nucleic 
acid molecule hybridizes to the nucleic acid molecule shown in genomic sequences SEQ 
ID NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript sequences SEQ ID NO: 2, 5, 8, . . . 
1526, 1529 and 1532, under stringent conditions; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid 
sequence shown in SEQ ID NO:3, 6, 9, . . . 1527, 1530 and 1533, wherein said nucleic acid 
molecule hybridizes to the nucleic acid molecule shown in genomic sequences SEQ ID 
NO: 1, 4, 7, . . . 1525, 1528 and 1531, or transcript sequences SEQ ID NO: 2, 5, 8, . . . 
1526, 1529 and 1532, under stringent condition; 

(d) a nucleotide sequence that encodes a fragment of the amino acid 
sequence shown in SEQ ID NO:3, 6, 9 5 . . . 1527, 1530 and 1533, wherein said the 
fragment comprises at least 10 contiguous amino acids; and 

(f) A nucleic acid molecule that is the complement of a nucleic acid 
molecule of (a)-(d), 

6. A nucleic acid vector comprising the nucleic acid sequences of claim 4. 

7. A nucleic acid vector comprising the nucleic acid sequences of claim 5. 

8. A host cell containing the vector of claim 6. 

9. A host cell containing the vector of claim 7. 

10. A method for producing any of the peptides of claim 1 comprising 
introducing a nucleotide sequence encoding any of the peptide sequences in (a)-(d) into a 
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host cell, and culturing the host cell under conditions in which the proteins are expressed 
from the nucleic acid. 

11. A method for producing any of the peptides of claim 2 comprising 
introducing a nucleotide sequence encoding any of the peptide sequences in (a)-(d) into a 
host cell, and culturing the host cell under conditions in which the proteins are expressed 
from the nucleic acid. 

12. A method for detecting the presence of any of the peptides of claim 1 in a 
sample, said method comprising contacting said sample with an agent that specifically 
allows detection of the presence of the peptide in the sample and then detecting the presence 
of the peptide. 

13. A method for detecting the presence of any of the peptides of claim 2 in a 
sample, said method comprising contacting said sample with an agent that specifically 
allows detection of the presence of the peptide in the sample and then detecting the presence 
of the peptide. 

14. A kit comprising reagents used for the method of claim 12, wherein the 
reagents comprise an agent that specifically binds to said peptide. 

15. A method for detecting the presence of a nucleic acid sequence of claim 4 in 
a sample, the method comprising contacting the sample with an oligonucleotide that 
hybridizes to the nucleic acid sequences under stringent conditions and determining whether 
the oligonucleotide binds to the nucleic acid sequence in the sample. 

16. A method for detecting the presence of a nucleic acid sequence of claim 5 in 
a sample, the method comprising contacting the sample with an oligonucleotide that 
hybridizes to the nucleic acid sequences under stringent conditions and determining whether 
the oligonucleotide binds to the nucleic acid sequence in the sample. 
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1 7. A kit comprising reagents used for the method of claim 15, wherein the 
reagents comprise a compound that hybridizes under stringent conditions to any of the 
nucleic acid molecules. 

18. A kit comprising reagents used for the method of claim 1 6, wherein the 
reagents comprise a compound that hybridizes under stringent conditions to any of the 
nucleic acid molecules. 

1 9. A method for identifying an agent that binds to any of the peptides of 
claim 1, said method comprising contacting the peptide with an agent and assaying the 
contacted mixture to determine whether a complex is formed with the agent bound to the 
peptide. 

20. A method for identifying an agent that binds to any of the peptides of 
claim 2, said method comprising contacting the peptide with an agent and assaying the 
contacted mixture to determine whether a complex is formed with the agent bound to the 
peptide. 
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ABSTRACT 

The present invention provides amino acid sequences and transcript sequences 
that are encoded by 51 1 genes within the Drosophila melanogaster genome that are 
essential for survival, the Drosophila proteins of the present invention. The present 
invention specifically provides isolated protein and nucleic acid molecules, methods of 
identifying orthologs and paralogs of the Drosophila proteins and methods of identifying 
modulators of the Drosophila proteins (e.g. insecticides). 
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Celera Sequence No. : 142000013384069 

CCGGCCAGCTGAACGTGGTGGGCGCCGTGGGACTGGGCATCGCCTTTGTCCAGCTGTTCGGGCTGATCACCTCGATGCTGCTGTTCTGCACGGTG 

AAGCACAAGCGGGCCTCGGACACCTACAAGTCATACTCCCCGTCGATTGATCCTCAGACCCGCACCAGCAGTTGGGAAGATTGAACGCGCCTGTG 

ACGGTGGTAGGCGGACTTCTAAATTGGGGGGTTGCAGTCGGCCTCCCGAACTCCCTGTCGCTATCTCTATTCCTTTTGTTAACTCTCTTAGTTTC 

TAATTTATTCTGCGAAATGTGCCAAAAATTAATTTAATTCAAAGTGTATTTTTTTTTGTTTTTTTGTTTGTAATCTGTAGGCGGAGACAAGTCGA 

AACAAAATTGAACTGGCAGCAAAATGACAAATTCGCACTGGGCACAGTGTGAACATGTCATATTGCTGCCGGTGAAACAATCACATATTTAGTTA 

GCAGTGTATCCATATTGTGATGTGCTCTCTAGTAGTATAGTATAAGGCCCAAGGCCCGAAACGCACCTGGAACAGGGTATAGTGCGTTCCATACT 

TATCAGGCTGGCTGCTTAGCTGCAAGAGTACCTGACACATATATACAGCTAGTGTGAGTCGGAATCCTTGGACCTGGATATGCTTATGCTCGTAA 

CCCTTCAACTAAACCAATATATCAACTACAATATATATTTATATATAATTATATAATACTTATGTTACTTTATGGAAATCTGTAGTATTCGAAAT 

AGCAGAAACCGCAACTGCCCGAAAAGCGCGAGGAAAGCGAGAAGAGAAGAGAAAAGAAAAGAAGAGAAGACGAAGTGGAGAAAGAGGGAGCAAAT 

AGTTAAAGCAAATATTACATTACTACTATGCGTATGATTATTTTATTCATACATACAAATATTTATATCGAAATGCATTCCTAAACAACATGTTC 

GTGTTAAGTTACAAAATAAATTAAAGTATTTATTTAAATTACACGCCTTCTCAAAGGCCCACCAATTTGCCTCCTATGATATCCAATTGAGCCGT 

GATCCCCTTCGTCACGCTGCGCGCCAACTCTTTGGGCTTCTGCAAGAGAGAAATGGGGGGAAGAGAGTGGTCAAGAGAGGGAAAGCTGGCACTGG 

AACTGACCTTGCTCAAAGCGAAGCGCAAGCGCACACGACGAGCGACTAATTGCATAAGCAGTGCCAGCGGGTCGTGGTACTGTCAATCTGGCTTG 

AAAATAGCCAAGACTCAAGGACTTGCAGTAATACTGACAGCTAAGGCCGTAAGTTGATAGGCTCTTTCGTACGCATGTTTAAAGTATTGTTTAAA 

TTTTGTTTGGCTGCTGAGCCGGTGCATCGCTGGGACCATTATGCAACGTGTGTGCGAATGGATTGACCAAGAAACAGGGCAAACAAACAAACCAA 

ACACAATGGTCGCAATTTCGGCACTCCCTCACCTGAATAAAGGTGTTGAGCAGGTTGGAAAAGGCCTCGTAGCCGTTGACAATGGATCGCGAGAT 

GCCGTCGAACATGTCACCGTTGTTGCCGTACTTCTCGTGCTCACTGATGGAGTCCAAAATGGGAACGGGCTGAAATCAAAACTGGGTCCTAGTGG 

CTCTGGTCTTTGGTGGATATATATATGTATCACATCACTTACCTGCGCCAAATCGGCCAGCCTCTTTTGACTGTTGTCGTACACGCTCAGCACCT 

TCGAGGGAATGGCCTCCTGCTCTAGATTCACCGATTCCGCCGTTTTAGCCGGAGTGGGCAGGAGAGAGATCTGCCGAAAAATAAGGGGAAGTCAC 

TTGGATTTTTAAAGTATACCTCGGTTACTCACCTTGAACCCGGCGTCGCTGGCCTGCGCGTGGGTGGATTCCGCGGAGCTGTCGGCGGCATCGAA 

AGCATCCTGGGCTCCGAAGTCGGCGGCCGAGACGCGGGCGTCGATGGGCGTGGCGCCATCGGGCGTGGCAGGAGCTATGGGCGTGGCACAGATGC 

AGCGCATCATGGCCAAGAGCAGCAGCAGAGTATTCATCTTTTGAAGTTGCAGCATTTTCGTTAGTATGTGCGGAGAAGGGGGAAAGGGCCCAACT 

ATGTGGTTTTATAACAAGGTGAAAACTTCCAAATCCAAATCCCAGGTTGAGCTTCGATCCGAAGTCTCTCGAGTCCAAAATTCAACTGAATAGCT 

GCGTCTGCGCAACGATTTTATTGGTTTAACCCGACCACGAGAGTCGCCGCTTTGGACATTGAGCCACGATCGTTTTGGCTTTAGTTTGGGTTTCA 

GTTGAGCACAGGTTTTCAGGGGGCAATCGGGGGTTAGCGCCTCGTGGAGCCGCTGCATAGTTATGTGTGATTATTACGTTGCATAAAATGCAAAC 

TCTTTCAACGAAAATATTTTATTAAAATAATATAAATTTAATAAGATTGACCACAGAAGATTTAATTTGAATGTTTAAAATTATCTGGGATATTT 

GATTCATTTAAGTAGTTCACTTAGTTCACTCTTGTTCAAAGCTCCAAGGCTTGTTCAACTTTAGGAAAACTAAAATTTTATCTAATATCTTCTTT 

ATATCGCGAGAGCCGCCTAAATTTTTTCCTACAGTGTAGTGTATTGATCGTTATCGTAATGACAACAACAAATAGTCATAAAACTGTAACAATAA 

CAACATAGGCGAAATCTGCAACTTGTTGCTGTCAACAAATGTTGCCGTCATTCTGTCGCAACAATGAGGAAGGAAACTCGGCTTACTTGGTTTAC 

TCAGTTAGTTGGTCAATCGGTAGCCTCATCCCCCATCTACTCCCGCCAGTCCATCAGTAAAACCCCCGCGACACATATGCTGGTTGTGCAATTAC 

GACCACACTGAAAGAAATTCAATACAATGACGTAAACATCGCCTGAACTAAAACACTTTTTATTGATAGGAAAATGAGTTTTCTCGTTTGCACGG 

ATTTTTTGCCAGTGAATTTGTCAGGCAACAACACCACCGCCGCCCACTTTTACATTTCATGCAACTCAATCAATTAAGAGAGAGAGAGTTACTCA 

GCAACAAGGC 

(SEQ ID NO: 1) 

Exon: 2050.. 1838 
Exon: 1780 . .1658 
Exon: 1589. .1458 
Exon: 1084.. 1001 

Start ATG: 2050 {Reverse strand: CAT) 
Transcript No. : CT18623 

ATGCTGCAACTTCAAAAGATGAATACTCTGCTGCTGCTCTTGGCCATGATGCGCTGCATCTGTGCCACGCCCATAGCTCCTGCCACGCCCGATGG 
CGCCACGCCCATCGACGCCCGCGTCTCGGCCGCCGACTTCGGAGCCCAGGATGCTTTCGATGCCGCCGACAGCTCCGCGGAATCCACCCACGCGC 
AGGCCAGCGACGCCGGGTTCAAGATCTCTCTCCTGCCCACTCCGGCTAAAACGGCGGAATCGGTGAATCTAGAGCAGGAGGCCATTCCCTCGAAG 
GTGCTGAGCGTGTACGACAACAGTCAAAAGAGGCTGGCCGATTTGGCGCAGCCCGTTCCCATTTTGGACTCCATCAGTGAGCACGAGAAGTACGG 
CAACAACGGTGACATGTTCGACGGCATCTCGCGATCCATTGTCAACGGCTACGAGGCCTTTTCCAACCTGCTCAACACCTTTATTCAGAAGCCCA 
AAGAGTTGGCGCGCAGCGTGACGAAGGGGATCACGGCTCAATTGGATATCATAGGAGGCAAATTGGTGGGCCTTTGA 
(SEQ ID NO: 2) 

Start ATG: 1 (Reverse strand: CAT) 

MLQLQKMNTLLLLLAMMRCICATPIAPATPDGATPIDARVSAADFGAQDAFDAADSSAESTHAQASDAGFKISLLPTPAKTAESVNLEQEAIPSK 
VLSVYDNSQKRLADLAQPVPILDSISEHEKYGNNGDMFDGISRSIVNGYEAFSNLLNTFIQKPKELARSVTKGITAQLDIIGGKLVGL* 
(SEQ ID NO: 3) 



Celera Sequence No. : 142000013384666 

GCTGTCATAATGGCATAAAAGATTGAGATGGAAAACAAATTCATAGATAGCGAACAAATTGATGGTTACACTGCTAGAAAATTTTTTTTTAGATT 
TAGTAGCATACTTTAAAGAAATATGCAGTTATATCCTAATAGGATTAGTTATTAAGCTCCGTTTTTTTTTTGTTCTCTTTAAAAGTTTTAGTTTT 
TTTAGATATAAATAATGGCGTTCTTTGTACACATTTTTCTCTGTGCACATTAAACGGCTCTATTTGCAATTGAACTGCAATAGTAACAGTCACGA 
AAGGCAGTCAAAAACAAAGAAAAACGGCACTTCAAGTGCGCAATGGCCTCATTTTTGGGGGCAGTAAATGTAGCGCGTAGCCAAATTGCTCTGCA 
TTTCGAAATATCAGACATTGCAATTATCTATCGAGTATTCACTAAATGTAGTTAATTTTGTAAGCTTTGGCTTTCGAGCTTTCAAAATTGGCGCG 
TACTTTTAACAGTAACTGTTAGCAGTGTACTGTTAAGTCTGCATGCGATCTGGCAGCTCAGTTATAGAATTATTTTTGGCCAAACCAGAGTAACT 
CATTTCGCTCTTGGATTGAAGATGTTTTTAAGGCCGCTTTAATGGCGTTGACAATTACAAAATTTCGAACTTCGAGGGGGATGGTGGTGGCCGTA 
CAGCCCACAGTGGTGTTGAGCCCCAAAATACCAGATTCCCAGGATAACGAAAAGGTAATCAAAAATCTGTCAAATGTATAAACTCTCATCCTATT 
TGTTACAATTCGGCGTTGCCAAATCGTGAAAAGGTGCTATTGACCAGCTGTTTTGTGAAATACGTGATACGTACGGGCTGAGCACAATTTGTTTA 
AAATTTAAATCATGATAAAGTAGGTAATTATATTTACTTATTTGCACAGAAACCGTTTTCAAAATCGGAGCCGGGAAATGCAACTGTGAGAGTTA 
ATGCAGGTGGTATAGTTCTTAACAAAATACCAGCGATTATAAGGCCAAGTATGAAGAGAGCAGCAACAACAAAGATGACTGGAGCCACGGCTGCG 
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GGAGCAACAACAACAACATCGTCAACAGGTGCAGTGGGATGTGAGTAACAACCAATCGAACACTAATATGAAATATAAACAAATGGTTTTTAAAT 
ATTGTTCTTATACTTTAGATCCCGTTCTCAAAACACCCAAGTATGTGGTTCAGACTAGTCCGAGTGGATCCTCTGGCCATCAGCTCCAGATGCTG 
GCGAGGAAGGACACTCAAAGTCTGGGAGTGGCCATCAATTCACTGCCGCCCAACACAATCATCAAAGCAACCACAAGACCTTCACAAACAGCGCC 
TTTGACACCAAACTCAGCGGCTGTCACGCCAAGCACGCCGAGCAGTAGCAGGAATTCCACTCAGTCCACACCAACTGTGGTACCTGATGCGAGAG 
TTTCCTCCGCCGTGCGCCAAGCTGTGTTCATCAAGAGGGAGCTACCCCAGCCGCAGAGGAGCATGCGAAATATGACACTTGGTTTGGTGGAACAG 
GCGCCACTGCTTCATTTGGGTGTTGCGCCACAGCACCTGTCACTGCTGAAACGCCATATCTGCCGCAATGCTAATGTCACCCACTTGGACTGTTG 
CTTGACTCTAAGGAAACTCAAACAAAACGAGCACTTCGCCCTGTTGGCCGAGCACTTTGAGCTGAGCGAATCAGATGTCGAGGACACATTTAAGC 
GCACCCTTATCAAGCTGGCCCGTTACCTCCGTCCACTGATTCGTTGGCCAGATGCACGGCATCACAACGAGCGCTTCAAACATACCCCACTGAAC 
TACCGAGCCAACCTGTTGCATGTACGCTCGTTGATCGAGTGTGTGGAAACGGACGTGCCGATAGATCTGGGATTGGGCAGCGGCAGCTATAAGTT 
CATATTGTGCATCAATACAAATGGTAGTTTTTAGCGTAAACAGCGTACTTTCATTAACACTTCCCTCGCTTTTCAGGCATCATCAGCTATGTGTC 
TAGCGCCTTTCCTGGTAGTTGCGATGATCTTCAATTGTTTGAGGCCAGCAGATTTCGGGATGTCATTCCCAATTACCTAACACTATGCGCGGAAC 
CAGGCAAAGCAGTACGCCGTGCTCGCAGGTCGGGCTTCGGAGATCCTCACGACTCAGCGGATGAGGATGAGGCGGCGGCGGAACCAAAGCGATCA 
CTTACCAAATTCGAGGCACAGCGTTTGAGTGGCCAGCTAGCAAGCCAGCAATCCCTATCCGTTGTAGACGGAGCACTGACTTCCAAGCGGGCTCC 
AGCGATTCAACTACCCACATTCAACGCACAAGAACCCGCCTGTAGAGCCCAAATGAGAGATATGATAGATTATTTAAGGGAATTCCGCATGCTGG 
ATAATTCGGCTATTAAGCAAAAGTCATTGCTGGGTTATCTTGATGAAATGATCGTGGTGGCTGCGGGTCTATGCAACCTTAAGCGCCAAGAGTTG 
GAATCTTAAATAAGAAGTGCCATCCGCGAAGGTCTGTTTAAAGGCCAACCAGATGTTATTGTCCTAGTTATTTTAAGAATTTAATTTAATGTATT 
GTTAATGCAATGTAGAAACTGACTATTAACAAGTACGATTAAAAGGGTATATTTTAAAGTATTTCAAAGAATAAACACTACGTGGTCATTCAAGA 
GGTTTTTGCTTTTATCTGGAAACATACTCAAGTGCCTTATTTTCGAAAACATTATTAGTTTCTCGATTTCCACAGTCGTACTTGTCCTCTTGTTT 
GTTCAGCTTCCCCATAGTTAAACACTTGCAGTTGTATTTCTTTTATATTTTTGCTTTATTTACTTTACAAAAATACAAGCAAAACTAAGTACATA 
TTCAATGAAAGTTGTTCAATGAACCAGGGCAACATTGCCAATCGGAGTTTGCAATATTTGAATTTGAAGAATACGCGTATGTCGTACAATGTATA 
GATATTCCAAGGGTAGGTGCTCCATCATATCCGTTTACTCTACCATCGTTGGTCGCTCTTTCTCTTAGGGGGTAATAATTAAAAACTTGCTTAGT 
TAATAAGTGACTCTTCGTACTAATAGCTAAACATATTGTATATTGTGTGGTGTGGTGTATGCCTGACAAAACCAGCTCTAGCTCCTCATCCTGAC 
ATTATAACAAGTAGTTGGTTGGGCTACCCTTTTCGTCTCAGAGCTTCTGAATGATGTCGAACTTCTTCTTGAGGTCCACACTCTCGTTCTTGGGA 
TCGTACTTCTTCTTGGATGCCAATTTGGCGCGCATTTCGGCGGCCGTCATTGGTAAATGCAAGGTGCCCGCCTTATAGCCACTGGTCGGTGGGGG 
TGGTGTAGTGGGTGTGGTCATGCCATTTGTACCATTTCCATTGGTGTTATAAACACTGCCATTGTTGTTGATGCCAGCCTTAGTAGCGGACGGCG 
GTGGCGGTGTGCTCATCAAAGTGTTATTATTAATCGGCGATGAGGGTTTACTTCCATTT 
(SEQ ID NO: 4) 

Exon: 1001.. 1085 
Exon: 1159.. 1923 
Exon: 1977. .2479 
Start ATG: 1001 

Transcript No. : CT20444 

ATGAAGAGAGCAGCAACAACAAAGATGACTGGAGCCACGGCTGCGGGAGCAACAACAACAACATCGTCAACAGGTGCAGTGGGATATCCCGTTCT 
CAAAACACCCAAGTATGTGGTTCAGACTAGTCCGAGTGGATCCTCTGGCCATCAGCTCCAGATGCTGGCGAGGAAGGACACTCAAAGTCTGGGAG 
TGGCCATCAATTCACTGCCGCCCAACACAATCATCAAAGCAACCACAAGACCTTCACAAACAGCGCCTTTGACACCAAACTCAGCGGCTGTCACG 
CCAAGCACGCCGAGCAGTAGCAGGAATTCCACTCAGTCCACACCAACTGTGGTACCTGATGCGAGAGTTTCCTCCGCCGTGCGCCAAGCTGTGTT 
CATCAAGAGGGAGCTACCCCAGCCGCAGAGGAGCATGCGAAATATGACACTTGGTTTGGTGGAACAGGCGCCACTGCTTCATTTGGGTGTTGCGC 
CACAGCACCTGTCACTGCTGAAACGCCATATCTGCCGCAATGCTAATGTCACCCACTTGGACTGTTGCTTGACTCTAAGGAAACTCAAACAAAAC 
GAGCACTTCGCCCTGTTGGCCGAGCACTTTGAGCTGAGCGAATCAGATGTCGAGGACACATTTAAGCGCACCCTTATCAAGCTGGCCCGTTACCT 
CCGTCCACTGATTCGTTGGCCAGATGCACGGCATCACAACGAGCGCTTCAAACATACCCCACTGAACTACCGAGCCAACCTGTTGCATGTACGCT 
CGTTGATCGAGTGTGTGGAAACGGACGTGCCGATAGATCTGGGATTGGGCAGCGGCAGCTATAAGTTCATATTGTGCATCAATACAAATGGCATC 
ATCAGCTATGTGTCTAGCGCCTTTCCTGGTAGTTGCGATGATCTTCAATTGTTTGAGGCCAGCAGATTTCGGGATGTCATTCCCAATTACCTAAC 
ACTATGCGCGGAACCAGGCAAAGCAGTACGCCGTGCTCGCAGGTCGGGCTTCGGAGATCCTCACGACTCAGCGGATGAGGATGAGGCGGCGGCGG 
AACCAAAGCGATCACTTACCAAATTCGAGGCACAGCGTTTGAGTGGCCAGCTAGCAAGCCAGCAATCCCTATCCGTTGTAGACGGAGCACTGACT 
TCCAAGCGGGCTCCAGCGATTCAACTACCCACATTCAACGCACAAGAACCCGCCTGTAGAGCCCAAATGAGAGATATGATAGATTATTTAAGGGA 
ATTCCGCATGCTGGATAATTCGGCTATTAAGCAAAAGTCATTGCTGGGTTATCTTGATGAAATGATCGTGGTGGCTGCGGGTCTATGCAACCTTA 
AGCGCCAAGAGTTGGAATCTTAA 
(SEQ ID NO: 5) 

Start ATG: 1 

MKRAATTKMTGATAAGATTTTSSTGAVGYPVLKTPKYVVQTSPSGSSGHQLQMLARKDTQSLGVAINSLPPNTIIKATTRPSQTAPLTPNSAAVT 

PSTPSSSRNSTQSTPTVVPDARVSSAVRQAVFIKRELPQPQRSMRNMTLGLVEQAPLLHLGVAPQHLSLLKRHICRNANVTHLDCCLTLRKLKQN 

EHFALLAEHFELSESDVEDTFKRTLIKLARYLRPLIRWPDARHHNERFKHTPLNYRANLLHVRSLIECVETDVPIDLGLGSGSYKFILCINTNGI 

ISYVSSAFPGSCDDLQLFEASRFRDVIPNYLTLCAEPGKAVRRARRSGFGDPHDSADEDEAAAEPKRSLTKFEAQRLSGQLASQQSLSVVDGALT 

S KRAPAIQL PTFNAQEPACRAQMRDMI DYLREFRMLDN S AI KQKSLLGYLDEMI VVAAGLCNLKRQELES * 

(SEQ ID NO: 6) 



Celera Sequence No. : 142000013384697 

TCGATTTGTTTATAGTCTTTAGCGGCTCATTGTCACCAATTTTTGCTCTCTTGCGAAAGATGTGAACTTGGGCGCAATATTGATTTTTCGATTTT 
GGCTCATTGCTCGCATCACCGGGTGTTGTGCAAGATTCCCATTTGGGTCAGGCTCGGGAAAATGGGAAAGCTCTGGAATGGGGAAATGCGCATCT 
TTTTCACTTACCACAATGAGAATGCAGGTGACCAGTAGGCCGTACTTCCAGGGTGTTATGGGAAACGACTTCTCCATGTTGCCACAATGTCGCAC 
GTGAGTTTCCTTGGCAGTTATGCTTGTGAACCAAAGTATCTGGCTGCCGATAGAATGGAAGATATGACCTCCGAGTTCCGGAAACTCGCTCGATT 
CGCACTATGCCACTGGCTTAGGGCAGGGCATTGTTTACTTGCGCCCAGAGTAGGTCTACACCCACATTAAACTGGGCTCGGAGCTCTGAGCCCTA 
TTTCGGATTGGAACTCGTGGTGTTGATGCCACAAAAGCTGCCCTGCGACTGAGCGAAAAAATGAGAGACCCAAGGGCCCCTCTCGAATCTGAACT 
TTTTGGTCTGGCGCGTTCTGCGTTCGCTGGCAAACTGAAAGCGGGAAAAGTTTTGTTTTTCTCTCTTCCCGTGAGGACTGTTGTTCCCCAATCAA 
AATAAACCACACAGAGCGCACTCGCTCTCTCAAAAGAGAGCGTTGCACTTGGGTCAGAGCTCCAAGTCATGTCGACTGGCGTTTCCACACGGTGG 
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TGTTTTCGAATTCTTATCGGGTCTGTCATGAGCGTACATATGTGAACTGCTGAATGATTGAATGGATTCGAGCTTAAGCAATTAGGAAAATATTA 

ATGTATCCAGAGGTTTTTAGCACCGATGAAGCTATTAAACAACACAACCAAGGTTTAATTCTTTTTTCTTTCTTTTACGAGAGTGATGATGATGG 

TGACAGTTTTGTGGAAGATTGCCTAGAATGCTGTGAATACCAGCAACGATTCACCGAGTTGCTATGCATTCTAGCGCTCCTTTAGCGGCTTTGAT 

CTCCAGCTCCATTTTGATGGTGTTTATCAGGTGCACATTGGTGGCTCGATCCTTTCCTTCTCGGTGGCGGATGTCAGACTTGCCGGTGCCAATGT 

GCACATTCAAGGCGTCGTTTCTGATTAAATTCTTGGTGCACAGATAACAAGGATAGGCCACCTCGCCAATGACCTCGATCTGGATGATTGAATTA 

TTGGCCTCACTCAGGGACTTTTGCAGCGTGTCAATCTCCTCCAGGGCCGCCGTATGCGCCTGAGGCTGGCATACTGAGACATCACAAAGGACTTG 

ACAATGTCACTGTCCAGAATGTGTCCAGTGAAGCCTCCGAAAGGTGGGCACTAGCTCGAAGTTTTGCTCCCTGAGAATCCTCTCCGTTTCATGGG 

ATCCTGCCAGAAATGCCGTTGTGAAGTAAGTGATAACTATTATTTTTATAATTTTCCATACCCATAAGCCTCCAATCCAGTGGACCATCAGGATA 

CTATCTATTGAATATTTGCCCTTAAATCCCATTTCTCTGAAAATAAATCAGCCACTTCTTCTTCGCCTTTTTTTTTTTTTTGGTTTGGGTATTGA 

CCGATATTTCTTTTTGTGTATCGATATTGGGTCGAAAAACAAGGCATATCGATATATTATTATCTAATTATCAAGCCGCCAACGCCAAAATGTCT 

GCTACGTTAAACTTAGAAGACGCTACAAAAAATCAACCAACAGTGCGAGGTTGCAATTGCAACGCTAAAAGGAAGGAGTTTTTTCCTCCAATTTA 

AAAAAAGAGTCTGGCGGCTGAACACATCAGAATCACTTTGAGCAGACTCCTGAGCTTTTTAAGGGATAAACACATGCTTTTGGCAATTATTGTCG 

ACGACGAAAGCATGAACATCCCCAGCCGCACGCTGCGTGTCCACTGCTCTATGACATGGAGCACTGTCTGGCAAACTTTCGGAGATACTGTGAAG 

GCCACTGGACTAACCCCAGGAATGGGCCAACTGTGTGTACGTGGAGGAACAATGCGGAAATTTAAAGCGGAGATGATAATGATTCGCAACTGGAG 

AC T C T T C AG T GT AAG C GG AG AAG AT T T C T T C T T AAG C T T A T T G AAA GT GAAAC A AC T T AAAG ACT T C GAG T T T CG AT GC AGT GT T GC G AT G AAG A 

AACAGGGGTTTGGTCCTCTTCCTCCTATTAGGGCAAGTAAAAAACGTTTGCAAAGTTAAAATATTTGTATACTTTCAGGAAGAGCTTCCAGCTTC 

CGCCTGCGAGGCTGAGCCTTCCTTTGTGTCCGTCGCTGATCCTGACCACACCTAAACATATTCTGGCACCGACCTGCGATCAGCATATGTAAGCG 

GTCCGTTGCTCCAGCAGCCACATTTTGTACCAGTACCGCCCCCGTCCGACCACGAGCAGCAGCACCACCAGCAGATGCAACCGCGACCCCTACAG 

CAGCAGGTCAGTTACGTCAATCCAACTTGCCACCCAATTCCGT 

(SEQ ID NO: 7) 

Exon: 1513.. 1487 
Exon: 1300.. 1001 

Start ATG: 1513 (Reverse strand: CAT) 
Transcript No. : CT31974 

ATGGTCCACTGGATTGGAGGCTTATGGCCTCAGGCGCATACGGCGGCCCTGGAGGAGATTGACACGCTGCAAAAGTCCCTGAGTGAGGCCAATAA 
T T C AAT C AT C C AG AT C G AGGT C AT TGGC GAG GT G GC C T AT C CT T GT TAT CTGTGCAC C AAG AATT T AAT C AGAAAC G AC GC C T T G AAT GT G C AC A 
T T GG C AC C GG C AAGT C T G A C AT C C GC C AC C G AG AAGG AAAG GAT C G AGC C AC C AAT GT GC AC C TG AT AAAC AC C AT C AAAAT GG AGC T GG AGAT C 
AAAG C C G C T AAAG GAG C GC T AG AAT GC AT AG C AAC T C G GT G A 
(SEQ ID NO: 8) 

Start ATG: 1 (Reverse strand: CAT) 

MVHWIGGLWPQAHTAALEEIDTLQKSLSEANNSIIQIEVIGEVAYPCYLCTKNLIRNDALNVHIGTGKSDIRHREGKDRATNVHLINTIKMELEI 
K AAKG AL E C I AT R* 
(SEQ ID NO: 9) 



Celera Sequence No. : 142000013384626 

GCAGCAACTAAGTTTGGAGGCATCGGAGAAAAACGAGGAGCGACTGTTGCTGGCCAAGCATTTGTCCGAAAAGACGAAGATGTACGAACTGACCA 
AGCAGAAGCTGGAGGATGTTCAGGGTGACTTTGAGGCTACCCAGCACAAGCATGCCACCGTGCTGAAAGAGCTCCATCGCGAGCTGAACAAGTAC 
AAAAGGGGAATTACGGAACCGAAGACACCAATCAGCTACTGCAGCAACTGTCAGCAGGCGATCAATGGGTATCCAACGGAGAATCCTCAACAACG 
GAGTCATAGTCGCTCCAGTAGCCACGGCAGCATGCACAGCGGCAGTCGACGGGCCAGCGAGTCTTCCGAATCGGAAACGGTGGCCAGTAGTGCCA 
CCACCGTGCAACAGCCGCCACCGCAGCAAGATCTCCAGGCGGTACCGTCCAAAAAAGTCCTCGTAGAGCGAATTCTGCGACTGCAGCAGGCCACT 
GCCCGCCAAACCGAACGGATCGAGTTCCTGGAGAACCACACGGCCGCTCTGGTGGCCGAGGTGCAGAAGAAATCCAAGGTGGTGCAGCACTACAT 
GCTCAGGGATCAGACTGCCGGCGCATTAACCACATCGCGGAGCGATCAGAACAAAAGCGAGTTGGTCAAGTACGGAAATGGTATCATGGCAGCCA 
TCTATGGTGGTGGATCATCGAAGACGGGAGGCGAGAATAAAGCCATGTCCCTCGAACTTTCTCTAGAGATTAACAAGAAACTGCAAGCTGTGCTC 
GAGGATACGCTCTTGAAGAATATTACGCTGAAAGAGAACCTCGATGTGTTGGGACTCGAAGTGGATAATCTCACGAGGAAATTGCGCTCCCTGGA 
GGGCAGTTGCAAGTAAATGCCAGTCTTCTTATTCCATAGTATTCCATTCCAGAGTCTAACCCGCGTGTAGATAGTGTACAAATCAAATGTTCGTA 
TGTTTAAGTGTATAATATATGTATTTAATATTTCAACAACGTCCCCAGGCTTATAATAATCCTTCGTAAAGCTTGGCGCTCACGGGTTTCGGCTG 
GCCCACACCAAGGGCAAAGTTAACAGTTGGCTTTTCTCCATCCTTGGCTAATCTGGCCATTTTGTTGTCGGTCAGTTTGTAGTGGACCTTCATCT 
GGGGATTGGGGTTGCCCGTTTCCTCCGATACCAGTTCGCGGAGCAGAGGCGGAAGTTCAATCTTGGTGGGCAGTATCACTGGTTCAACCTTCTTC 
TTGTTGTTCAGATACTTTGCTTCCTGATCCTGCGGCACCAGTTCGTAGTCCGCCTTATAGCTGGTGCTGTAGATTTCCACAGGCTTTGGCTGCGT 
TACGCCGCGCCACGTTTTCTCCACAGTGACCCGCACCTTGCGGATCTCGCCGGGCTGCTCGTCCACGGACTGCACCTTCAGAATCCGCATGTAGC 
AGGGCTCCGGATATCGCTGGAACATGTTACGTATTAGCAGACGGCCCACACCCCAGTTGGGCAGGTTTGAGACCAGCTCCCACAGCGTGTTGCCG 
CGAAAGTCTGTGGTGCGACCGATGTACTTGATCACTTTTTGGGCCATTTTACTGGGTTATTTCACTATTTTAATTGAATTTTTGTTGCAAGTTAT 
AAAAACAGTGAGTTTGCCACTCACAGGGAAGGTCGAACAGCTGTGCCGGCTGGCTTAACAGGGCTGTTAAAACTTAACAGAAGGCTGTTATGAAC 
AGTAAATTGTGTGTGTTGGCAATGCTGCGCGGCGATTTTAAAACCAGTTAAATTTTGAAAAAATTAAATAATTTTTTAACTTCCCAAATAATAAA 
TGAAAACACACACACACATTTAAAATTTTAGTAACTAATAATCGCTTTAAAAGGTTTTAATCATGTTCTTGTCTCCTAGATCCAGATGTTTTGTC 
TAGTATAATATTTATTTTTAAAACAATTTGGTTTTTTTCAGAGATTTAAAGATCACCTAATATTTAAGGAATATTTAAATAATATTTATACATCT 
TTAAGTGCTGATAACTCACAGCCGCATGAGTCTGACTTGAATGCATTCATTATATTTACATAAATATGACTTTCAAACTAAGTTTAATATATAAA 
ATCTAGTGATAGTTAATCCCTCACTAATATGGAGCTCGAATATGTTCGGATAGTTCCTCTCCTCCGCCAGTGTGTCCCACTTCCCCGCCACTGGT 
GTCTAGGTGCTCATCGTAGGACTCCTCCTCGCCACCCAACTCCTGACCATCGTGCCCGTGGGCTGATGGATTCGTGCTCTCATGCAATCCGATCG 
TGGCATGCTGGGCCGCCTTGGATGCATTCACCTGAGCCGCAGCTGCGGAATTGGCCGCCTTAAGGGCGGACTCCTTGGTGGCCGCATAGTCTACG 
CGCGCCTGGTGCAACTGCTCCTCAACCTGCTCTAATCGGTTCTTGGACTGACCCACCATCTGGGATTGGGAGGCCAGTTGATTGTTCACCTCCGT 
GGAGGTTTGCTCAGCCTGTTCGGCCACGGATTTGGCATTATTTACCGCAGCTGTGAGCACGGATATGTGATGATGCGCCGCTTGTGCCGTTTGCT 
GTGCCAATCTCGCGGAGATCTTGGCCGCCTTCAGTTGCTCTAGCTCACGGGACAGAGATCTTTGGGCCTCGGCTGCCTGCTGCTCCAACTCCTGG 
AGAACAACCTGCTTGCCCACCAGAGCCGCTTGTGCAGTGGCCGCCGCCTGAGAAGCCGCTTGGGCCAGTGTATTCTGGGCTATATAGGCCGCCTG 
CTTGGCAGCTGCATGCTGGTTGGTCACTGCACTGTGAGCCTGATCCGCCGATCCCTGGGCAATGGAACGCAGGCCGCTGCCAATGCTATAGCCAC 
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CGGATCCCTTGCTGGAAACTGAGGAGCTACCTCCCGATGTGGTGTAGTAGCTTTTGGTGCTATGTTGTGGCTTCTCTGGCTCGGGAGGCTCTACT 
GCCGGCAGATGGGGCTCAAAGTAACCACCTTGGTGGATTGGTGGCGGTTGCTGCGGTCGGTAGTGATAGCCATTCCTCTTGTGCAGGCGACCCAA 
GGCATTGGACAAAAGGAGCACTAAAGGTATTTGGATATAAAATAAGATTTAACGTGGTTTTTTAACCGAATAGGAATCCTAAGTTAGGACCTAAG 
TAGGATTTAAGTTTTCAATATCCTGGTGTCTTTGTATCTATTTCTTTATAATTTGTATCTCTATATGTATATAAGACAAGTTCAAGGGATGTTGA 
AGATTTCCGAAGAGAATCTATTCGATTCGGGTATGAGAATAGTAATCTTCACTTTTCAACTCTTGGAATTTAATTTTTAAAGAACGGATCACTAC 
AACTTTTCACTTTGTTATAAAAGGGTATTCCCCGGCTTACCCAAAACAAGCTGGGCGAGAAGAAAACGTAATCCTTGGCCGTACATTTCGCTAGC 
GCTGCTGGAACTGCATTGAACTTCAAATTGAATGGAAGTGTACAATTTATAAGTCTTTCAGAGCCGCCAGCCGAGCATAAAGTTTATCAAGTTGC 
GGAAAGTCTGTTGCGGTTTTTAAATTTAACTGATCGCGGGCATGTTATGATTTTCCATCTCCAAGCTATTTGAATAGCTCTGCCCACTATTTGGA 
GAGCTGGGTCCACACACATGGCACCTTTACACCCATTTCCATCCATACCATCCCATCGAGGAAGTTTAAAGTGACCGCCTTTGAGCCACAAAATA 
TGGTGGCGTTTTTTTTTTAATGGTTCGTTGGCACGCTTGTTTGAACCAAACGGACAGTGTAAAGTGGTCGGTCTGCTCGGGTTTGTTAATGTTTT 
AAATTTAATTTGCATTTTATGTGCTGCACTACTTTGTAATTGAACCTTAAGTGGCTCTTTTGACGGGATTCAAGGTGCAACGATGCAGGGAAAGC 
CATTTGAGGGATTACGCTTTCGTTTAGATTAACTAGCCCACGAAATTTGGCAATTTACGAACGGGGGAATTTTCCAATCAAAATTTGAATATTTT 
T TT AATT GAAAAATT AT AT AAT GCTAATAT 
(SEQ ID NO: 10} 

Exon: 3411.. 3366 
Exon: 3060.. 2115 
Exon: 1571.. 1001 

Start ATG: 3411 {Reverse strand: CAT) 
Transcript No. : CT32256 

ATGTACGGCCAAGGATTACGTTTTCTTCTCGCCCAGCTTGTTTTGGTGCTCCTTTTGTCCAATGCCTTGGGTCGCCTGCACAAGAGGAATGGCTA 
TCACTACCGACCGCAGCAACCGCCACCAATCCACCAAGGTGGTTACTTTGAGCCCCATCTGCCGGCAGTAGAGCCTCCCGAGCCAGAGAAGCCAC 
AACATAGCACCAAAAGCTACTACACCACATCGGGAGGTAGCTCCTCAGTTTCCAGCAAGGGATCCGGTGGCTATAGCATTGGCAGCGGCCTGCGT 
TCCATTGCCCAGGGATCGGCGGATCAGGCTCACAGTGCAGTGACCAACCAGCATGCAGCTGCCAAGCAGGCGGCCTATATAGCCCAGAATACACT 
GGCCCAAGCGGCTTCTCAGGCGGCGGCCACTGCACAAGCGGCTCTGGTGGGCAAGCAGGTTGTTCTCCAGGAGTTGGAGCAGCAGGCAGCCGAGG 
CCCAAAGATCTCTGTCCCGTGAGCTAGAGCAACTGAAGGCGGCCAAGATCTCCGCGAGATTGGCACAGCAAACGGCACAAGCGGCGCATCATCAC 
ATATCCGTGCTCACAGCTGCGGTAAATAATGCCAAATCCGTGGCCGAACAGGCTGAGCAAACCTCCACGGAGGTGAACAATCAACTGGCCTCCCA 
ATCCCAGATGGTGGGTCAGTCCAAGAACCGATTAGAGCAGGTTGAGGAGCAGTTGCACCAGGCGCGCGTAGACTATGCGGCCACCAAGGAGTCCG 
CCCTTAAGGCGGCCAATTCCGCAGCTGCGGCTCAGGTGAATGCATCCAAGGCGGCCCAGCATGCCACGATCGGATTGCATGAGAGCACGAATCCA 
TCAGCCCACGGGCACGATGGTCAGGAGTTGGGTGGCGAGGAGGAGTCCTACGATGAGCACCTAGACACCAGTGGCGGGGAAGTGGGACACACTGG 
CGGAGGAGAGGAACTATCCGAACATATTCGAGCTCCATATTATAAAATGGCCCAAAAAGTGATCAAGTACATCGGTCGCACCACAGACTTTCGCG 
GCAACACGCTGTGGGAGCTGGTCTCAAACCTGCCCAACTGGGGTGTGGGCCGTCTGCTAATACGTAACATGTTCCAGCGATATCCGGAGCCCTGC 
TACATGCGGATTCTGAAGGTGCAGTCCGTGGACGAGCAGCCCGGCGAGATCCGCAAGGTGCGGGTCACTGTGGAGAAAACGTGGCGCGGCGTAAC 
GCAGCCAAAGCCTGTGGAAATCTACAGCACCAGCTATAAGGCGGACTACGAACTGGTGCCGCAGGATCAGGAAGCAAAGTATCTGAACAACAAGA 
AGAAGGTTGAACCAGTGATACTGCCCACCAAGATTGAACTTCCGCCTCTGCTCCGCGAACTGGTATCGGAGGAAACGGGCAACCCCAATCCCCAG 
ATGAAGGTCCACTACAAACTGACCGACAACAAAATGGCCAGATTAGCCAAGGATGGAGAAAAGCCAACTGTTAACTTTGCCCTTGGTGTGGGCCA 
GCCGAAACCCGTGAGCGCCAAGCTTTACGAAGGATTATTATAA 
(SEQ ID NO: 11) 

Start ATG: 1 (Reverse strand: CAT) 

MYGQGLRFLLAQLVLVLLLSMALGRLHKRNGYHYRPQQPPPIHQGGYFEPHLPAVEPPEPEKPQHSTKSYYTTSGGSSSVSSKGSGGYSIGSGLR 
SIAQGSADQAHSAVTNQHAAAKQAAYIAQNTLAQAASQAAATAQAALVGKQVVLQELEQQAAEAQRSLSRELEQLKAAKISARLAQQTAQAAHHH 
I S VL T A A VNN AK S V AE Q AE QT STEVNNQLASQS QM VG Q S KN RL EQ V EE QL H QARVD Y AAT KE S AL K AAN S AAAAQ VN A S K AAQH AT IGLHESTNP 
SAHGHDGQELGGEEESYDEHLDTSGGEVGHTGGGEELSEHIRAPYYKMAQKVIKYIGRTTDFRGNTLWELVSNLPKWGVGRLLIRNMFQRYPEPC 
YMRILKVQSVDEQPGEIRKVRVTVEKTWRGVTQPKPVEIYSTSYKADYELVPQDQEAKYLNNKKKVEPVILPTKIELPPLLRELVSEETGNPNPQ 
MKVHYKLTDMKMARLAKDGEKPTVMFALGVGQPKPVSAKLYEGLL* 
(SEQ ID NO: 12) 



Celera Sequence No. : 142000013384830 

TATAGTATGGTAGACTTAATAACAGTTGGCGTTCTTTGTTAACCCATTAGTGCCGCTTTGTCAACTGATGTATGCGATTGGCCGCTCAAACGCCA 
CGTAATTGCTTCAATCTGAGAGCCCCGCTATAACCTACAAAGTGCCGCTGATTAGCGGCATCTTTTGCTGGCCTAGCGGTACCTGGCACCCTACT 
ATGTTGGGGTATTTTCGGTACTCAAGTACCCTACTCACCCCTCGCCCTGCGGCTTCAGCGCCAGCTCCTTGAAGTTCTCCACCGTCTTGGGCACC 
GTCTTGCCGAACAGACCGATCTCGATGCGACCAGCGGGCTCGCCGCCAATGGTGATGTCAAAGAAAACCTACGCAATCGGAAGCGGTTTATTATT 
AACAAGTCATCTCACTTGATTTATGGCGCAGCAAATGTGTGTGTGTGTTTGTGTGTGTATGTACGTGGTGTATGCAGAATGTAAGGACGAAAGTC 
CAAGGAGCACGCACCTTCTCGGTCACTTTGGGACCCTTGCTATCGTCGGCAACAACGACGCCGGCCACCAGGGCTACCACGAAAACGGATAAGAA 
CAGCTTCATCGTTCTAAATATTCACAGCGCTTCAGTTTATTCCTCAAGCGACGTCGCCAATGGATATTTCACGACTGAATTTTCGGTTTCTGTTA 
GTGATGTAAGAAAACTTTGTTGCATCGCCAATCGATGTTTTCCATCGTTTTTTATGTTTTTGTCGAAATATCGATGCTTAAGTGTGACCATTCGA 
TGACGATGTCTGATGATCACACATTCGGGCGGAAAATTCAAATTCAAATAGTTAATACGCCGAAAGCATAAGTTAAGGTTTTCGTACGAAGGAAC 
AAAGTAAGTTCGAAGAGTTGACTTTATTGGTATAGTTTTTAATATTTTAGGGATTAGAAGTCCAGTTTGCATTAGCACGTAGCATTCTCTCTTGT 
TTTATTGACGGTTCTGTACACAAAAAGTAAGTAAAAAAATATGCTTTTATTTATTGGTGTACTGCTATTTGCGTCTGAGATTTGTCCATATCCTT 
GGCTGCCTTCTCTGCTCTCTTCTTCTCCCTGCTGGCCAGAACTTTCTCCCTGGATTTCTGGGCCCTTTCAGCGCAATCCTTGGGTCGTGTCTTCA 
GGGTCATATCATCCACGGCCGCCTGTTTACGGGCTGATCTCGAAATGCATCGTTCCATGCAGCTGATGGGCACAAATCTTCCAATGAAGCAACCG 
CAGTGTGAACAATAGTGATTGATGTCCTGGCGGCCAGACCATTTTTTACTGAAATAAAGTGAACATTACGAAAGATTTATAAAGCTTCATGCAGA 
TGGCCAACCAAAGTTTTGTGAATCCCAGAAATCTCTGCAGGCAAGTGACCAACTCCAGTTGCACCAAACTGGTTCCAGACTTCTCACACGAAGGA 
CAACGGATCCAGGTGGGCTCCTCCTTCAGGTATCCCACTTGTGGCTTGTGGCTGACAATGATGGCCACAATCTGCGGTTCGTCCACGGTCATGAT 
GCGAGTTCCAGCGAGAACGATCGGTGACTGAAAGATGGATCAAATTTGGAGAGATCATGGGAGGGAAAATCAAAATCAGAATCACTCTCCCGATT 
TCTCTTAACCGAGGCGTTGGTCTGGTAATTAAGTTAGTGCATGGAAATCAAATGAGTTATTCTCTCTGCGTTACAAATGCCATTGGATGTACAGT 
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GTAAAAAGTATAGTGTACATTTTGAGCATTACACATTAAATAGTAAACTTTTATTCGTTTCAAACAGCAGTATATTTATTA.TTATTATTGAATAT 
AAATTAAATGACAAAACTGATCTACATTTGTTCATTGTGTTATAATAATTCGGATCCGATGATCCTTCTTCAGTGGGTATCTAAAGTTGTGGACG 
GCGGTATACGCCCAGGTACTCCCCGCAGGCACCGCAGTGATGCCTCTTGGTGCAGCAACGCAGCCAGGAACGCACCACACGGGTCTTGGCCTTGG 
TCCTGCAATAAGGACACGTAATGTCGTTGGGGCCGGGGCCCACGGCAAAGTAAAGTGGTTTCGGATCCTGAGCCATTTCGCACCGAAAGATGTTG 
ACTGCTTTAAAAACAAACGCAGACTGAGGCTAGCGMGAGAMTTGTAGGAATAAAATGTTCAAATGATATGCAAAGAGAAAAAAATCGGCCCAA 
AAAAGAGAAAATGTGATATTTTTGGCTTTATTTTTGTTTCGATGGTATTGCAGTAGTGGGTGGGAAATTGAGTGTTTAGAAGAATTTTAATAAAT 
ATGTGAACAATGCTGCCTAGGCGGTGGTTGCCTTCAATTTGAGTGGTTGCCTCCTCTTCATGGAACGTCCAAAGTAACAGCCGCAATTGCCGCAG 
TAATGACTTCGATCGGTTTGTTTGCACGGCACATAGTAATTAAAGCAGCAGCAGCAGCAGAAGGTCCTGTAAAAGTATTTTATAAGTTTACGTTT 
CCAAGAAATAGGAGAAAGTAAGGCAACCTACACAAAAAGACATCCCACAATGCGGTTTAGTTCACTGGCCCACCACTTCAGATCATTTTGCACCA 
CACTCAGATCCTTGATTCCGCAGGCGGGGCACTCGATCTCCACGGGCTCGCGTGGCAGGTTGAGAAAAGTGCCTGCCAGGGGCGTGGCAAAGTTC 
TCGTAGATCTTGCGCTCCTTTCGGTCCTCCAGAGCCTTGGCCTCCCGCTCCCTGATCTCCTGCTGCGTTTCCGCTTCCTTCTCCGGCAGCGTTGT 
CGTCATGATCGCTCAACAATCGAACTGAAAGCGAAACGCTGCCAAGCACGCTGGCAACCTGTTTCCAAACTCAAAAGAGAACGTGAGTGCACAGA 
GAGAAATAATATGGAATTTTTGGCCAATATATTCCCCCGGTTTTCCGCCCAAAAATATGCAGTTTTTTTGATCGAAAAAGTAAAAATAATGAGCC 
AAATATGGAATGGGTTACATCGTTGAATTCATAATACAATTTTCTATCGACTGGTGTTGAAGCCTTAGATTTTCAAATTTTCGCCTTTTTTTGCA 
TAATATCATGATGTTACCCCTTATAAAAAATGCGAAAATTAAAAAAAAAAAAATTTTGAAATCAAACTAAAAAGTATTTTCCTTCGTTAGTTTGC 
TTAATTAGGTGCACAAAACAGTATTTATCATAGCATTGTGACTCTTTTTTCTGAACTTATGGAAAAAACACTGTTTATTGAAAATGTACTTGTGA 
TAACTTTGAGTTTTTGAAAAAAAAAAAAACTATGAATAAATTCTCAACAGTTAAGCTATATAAAAAAAGTATACTTTTGTTCTCAAATAGTGTTT 
AAACAAGTACAGGTTTTATTTCTGTAAGTGTATTTTCCAACTCTGGCAAAGAGTATGAGTATGATTTTTGAAAGAGCATAGAAAGAGCAGCGGCG 
AAATTGATTTACAGTGTGTCTTTAATTGAGTTAGACGCGTAATAGAATTATAGAACAGATAAATATTTGGCATTAATTTCGATTTAATTGGTCCG 
CGGGATGGGCTACTTGGCCTTGCGCTGGATTCCCACTTTGCCGCCACACGAGCTGCAGTAAACGCCCTTGGTGGTCCACTCCCGATTGCAACCGC 
ACCAGGTGCACACGCAGCAGATCCAGAAGAGCGGAAAGATGCTGGAAAATGAAATGGAATAGTTGGTGCAAGGAACGAGATCCCAGACAGTAGAT 
GTCCCCTTACCAGGATAGCAGGCTGAGGCAGAAACTGGCGTCCTTGCAGCTGGCTG 
(SEQ ID NO: 13) 

Exon: 2761.. 2519 
Exon: 2119.. 1910 
Exon: 1547.. 1339 
Exon: 1283.. 1001 

Start ATG: 2761 (Reverse strand: CAT) 
Transcript No. : CT32884 

ATGACGACAACGCTGCCGGAGAAGGAAGCGGAAACGCAGCAGGAGATCAGGGAGCGGGAGGCCAAGGCTCTGGAGGACCGAAAGGAGCGCAAGAT 
CTACGAGAACTTTGCCACGCCCCTGGCAGGCACTTTTCTCAACCTGCCACGCGAGCCCGTGGAGATCGAGTGCCCCGCCTGCGGAATCAAGGATC 
TGAGTGTGGTGCAAAATGATCTGAAGTGGTGGGCCAGTGAACTAAACCGCATTCCTCAGTCTGCGTTTGTTTTTAAAGCAGTCAACATCTTTCGG 
TGCGAAATGGCTCAGGATCCGAAACCACTTTACTTTGCCGTGGGCCCCGGCCCCAACGACATTACGTGTCCTTATTGCAGGACCAAGGCCAAGAC 
CCGTGTGGTGCGTTCCTGGCTGCGTTGCTGCACCAAGAGGCATCACTGCGGTGCCTGCGGGGAGTACCTGGGCTCACCGATCGTTCTCGCTGGAA 
CTCGCATCATGACCGTGGACGAACCGCAGATTGTGGCCATCATTGTCAGCCACAAGCCACAAGTGGGATACCTGAAGGAGGAGCCCACCTGGATC 
CGTTGTCCTTCGTGTGAGAAGTCTGGAACCAGTTTGGTGCAACTGGAGTTGGTCACTTGCCTGCAGAGATTTCTGGGATTCACAAAACTTTGTAA 
AAAATGGTCTGGCCGCCAGGACATCAATCACTATTGTTCACACTGCGGTTGCTTCATTGGAAGATTTGTGCCCATCAGCTGCATGGAACGATGCA 
TTTCGAGATCAGCCCG T AAAC AG GCG GCC G T GGAT GAT AT G AC C C T G AAG AC AC G AC C C AAGG AT T GC G C T G AAAGG GC C C AG AAAT C C AGGG AG 

AAAGTTCTGGCCAGCAGGGAGAAGAAGAGAGCAGAGAAGGCAGCCAAGGATATGGACAAATCTCAGACGCAAATAGCAGTACACCAATAA 
(SEQ ID NO: 14) 

Start ATG: 1 (Reverse strand: CAT) 

MTTTLPEKEAETQQEIREREAKALEDRKERKIYENFATPLAGTFLNLPREPVEIECPACGIKDLSVVQNDLKWWASELNRIPQSAFVFKAVNIFR 
CEMAQDPKPLYFAVGPGPNDITCPYCRTKAKTRVVRSWLRCCTKRHHCGACGEYLGSPIVLAGTRIMTVDEPQIVAIIVSHKPQVGYLKEEPTWI 

RCPSCEKSGTSLVQLELVTCLQRFLGFTKLCKKWSGRQDINHYCSHCGCFIGRFVPISCMERCISRSARKQAAVDDMTLKTRPKDCAERAOKSRE 
KVLASREKKRAEKAAKDMDKSQTQIAVHQ* w 
( SEQ ID NO: 15) 



Celera Sequence No. : 142000013384826 

GGTCACCTGCTGGCGGATCTGTTTTTGCAACTTCTGCATCTGCAGCATGAGCAGAGCCTCGTTTCGCATCTGGCGGTAGGAGTAATCCATTTTGG 
CCAGTACCAACAGGATCTTAGTATTATGCCTGCGACGAATTTAACTAGTGCATGAGCTTTCAGTTCATTTCCACCTTACTTACTGCAGACGCGGC 
TCGGTGAGGATTGAATAGAAAAGAACGCCGGCGGCTGAAATCTGACAGAGATTAGAGGTGTCCACGACGTAGATCAGCTTCTTGACATCCTCGAA 
ATACTGCCTCCACAAGGGGGCCATACTCCCTCCAATTTCCAGGATCTGTATGGACTTGGGCAGATTTTTACCGCCATGAGGAATGTTAGCTGGAG 
CTTCTGAGGGAGGAGGCTTATTTTTATCCCCATTTGGCGATTTTGTTGGGAAATGGATTCGGTAAATCCCAGTGCCAATGGTGGGCATGGAAAAC 
GTGGTCTCATCGATGGATTCCGGATCCTGTAAAGCTTTCAAAAGGTGCGTTTTGCCCGCTCTTTTGGGGCCCAAGCAAATGCAGGTGGACATTTT 
TGCTTTTACTTGGATTTTATTTATGGATCTAACAAAGGTAGCCTTGGTTACCAAAGTTTATTAATTAATTAATTAAATTAAAATTGTAAAAGTTA 
GTTCAGTAAGCACGTACATGTTGTTATTTCTGGCTTATTTTATATATAAAATATTTACAAATGGTTGAACATTTAATATCAATCGCGTAATCGAA 
ATCTTGTGTCAGTAGTTCAATTATTTTCTGCATGATGGCGCTCTCAGGAAATCAGTGCAAACGTTTGAACTGCGTGCTTCGGGAGTCACGAAACT 
CGGAACCATTCGGCCACACTTCATATTAGCACATCAATGCATAATGACTATCGATAAAACATCGATGTATTTTTCATCGTTTTTTCATGCTGTCT 

CAGTTCTCAGCGGACGTGAACGGCGTTCCCACGGAGATAGTGTTCCACTGCTTTGCCAAGAAGTGGTTCTTAGTCATAACGCAGCTCGGCAAGAT 
TCCCGGCATCTACAACGTGCACTTCGATGTCAAAAAGGACGAAAGAGTGGTTCCCTATTTGCACGGACCCGTGGACAATCCGGAGTTCCACGTAT 
CGGTGCCAATCACGATGAACTGCTGCTTGGGCTTGGACACCGACGAGACTCGCAGCGCCATACAGTTTCTGGTCAACAGAACCGGACTCCACAAG 
TGCCCGACGGAGTTCGTGGTGGGCTTGGGTCTCAAGAAAATCGACGGACCCAATCTTCGTGCCCTAGCTAAGGTCCTCGAGGAGACGTTATTCTA 
GTCTGCGTTCGTTATATCCAATAAATATTAGCTGTGACAGTTTTTGTAAAAAAAACCTTGAACATTTTAATCGTAGCCTTCAACCCGGCTGACCA 
AATTATTGGCCATTCTACTGCAACCTCTTTCATGTGCTTAAAAACCCAAAATAGAATCTACTCACAAAAAGGCAGTGGAATGACCAATAACATAG 
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TAAACCATAATAATTCTGCACCGTTGTGAAAGGAATGAGTCTCGATAATTTTTTAGCAAACGTCTTCTATATTTCCTCACATTTGTACATTAAAA 
ACGTGTACGCAGGTCGGGAAACGGGTACACGTCCACCTCTCGAGGATTACAGCACAAACACAACGGATGTCCGGTTGTCGAGAAGTTCTCTAGAG 
GATTTAGTCGAACAGACCGAAGCCCATGTCGTCGTCAGACTGGTCGGACTCCTCCTCCTTCTTCTTCTCCTCCTTCTTGGACTCGGCGGCTGGAG 
CAGCGGCGGCGGCAGCAGGGGCAGCACCACCGGCGGGAGCGGCACCAACTCCGGATCCGATGTTGGTGATCAGGTCCTTGACGTTGATGCCCTCC 
AGGGCCTTGGCGAAGAGACCGGGCCAGTAGGGCTCCACCTCGACGTTGGCGGCCTTCAGGATGGTGTTGATCTTCTCACCCTGGTGAATCCAAAA 
AATAAGATCAGCACAAGTTGCAGCAGTGCGAAAGAAACCATAGGACCCGCAAATTGTCCAATACTTCCGGCCGATGGCCGCGGCTTTGGAGGATT 
TGCTTCGTCCCCGATTGCAATATCGAAAAACTTACGGTGACGGCGACATCGTCATCGACGAGGATGAGGGAGGCGTAGACGCAGGCGAGCTCGGC 
TTTGGTGGACATGTCGAAGTGCTGTGGCTAATCACTGATTGTGCAAGTTGCCAAAAATTAATTGTTTATAACAAATGGCCTTAGCTCGGGACGAG 
AACAAGCACGCTGGTCAAAATCCTAAACCGGAAAGAGCAGTTGATATGCAG 
(SEQ ID NO: 16) 

Exon: 1001.. 1426 
Start ATG: 1001 

Transcript No. : CT33132 

ATGAACCACACAGCAAACAAACCTCGACAAATCGGATTCGACAGGCAGTTCTCAGCGGACGTGAACGGCGTTCCCACGGAGATAGTGTTCCACTG 

CTTTGCCAAGAAGTGGTTCTTAGTCATAACGCAGCTCGGCAAGATTCCCGGCATCTACAACGTGCACTTCGATGTCAAAAAGGACGAAAGAGTGG 

TTCCCTATTTGCACGGACCCGTGGACAATCCGGAGTTCCACGTATCGGTGCCAATCACGATGAACTGCTGCTTGGGCTTGGACACCGACGAGACT 

CGCAGCGCCATACAGTTTCTGGTCAACAGAACCGGACTCCACAAGTGCCCGACGGAGTTCGTGGTGGGCTTGGGTCTCAAGAAAATCGACGGACC 

CAATCTTCGTGCCCTAGCTAAGGTCCTCGAGGAGACGTTATTCTAG 

(SEQ ID NO: 17) 

Start ATG: 1 

MNHTANKPRQIGFDRQFSADVNGVPTEIVFHCFAKKWFLVITQLGKIPGIYNVHFDVKKDERVVPYLHGPVDNPEFHVSVPITMNCCLGLDTDET 

RSAIQFLVNRTGLHKCPTEFVVGLGLKKIDGPNLRALAKVLEETLF* 

(SEQ ID HO: 18) 



Celera Sequence No. : 142000013384809 

TACAACTCAGTCGCCGCACTTTAACAACAATATTTTATTTTTGTGTGTATTATTCAAATAGGTATACCTTCCGACAGCTCTAAGTCCAGTTCGGC 

GAGCTTCTCGCGGACGTAGCCAGGCAGCGAGGCAGTGTGCTCCATGGCAGATGAAAAGTGTGGCTTTAGGGTTTTGTTCTATCCTCTTCCTTGGA 

GGATTCGTCTAAACTTTGGCTGGTCGCAGTACACAAACACACCCAGACACCGCTACTTTCCCACTTCGTTGTCAAAAATCGATCGATTTGCCATT 

TATTCTATTTCACATTTTGCCGACGCAAATAAATACATCCGTACTTGCGCAGCACCAGTACTTTGGGTTAGTTAGAAAACACGATATTACGAATG 

TCTTTTACGAATTTGCACAATGAAAGTAAACAAAGAAAAATAGCAGTACAAATCGATTTTATGAACTAGCGAAAAATTGATAACTATTTCTTCAT 

ACTCTTGGGACAACGGCCTTTTTCAATTTTCGAGATGCGGTCACACTGCCCAAGTAATCGAACCACGAATTTAAGTGTTATTTTTAATGATATTA 

GGCATAGAACCATTACAATTAAAAAGCTCTATTATTTATTACTTTAATCGCTTGTCTCTTAAAATTCGATTCAACAAAAGCAATTGGTGGCATTT 

TGCTAGGTAGCCCTGAAATACTGTAGAGACTATCGATATCTGCACCTAAGGCGCCTATTATCACATTCGGATTACGAAGACAAAGACATGCGAAA 

CCTGGATACGATTAGTGGCGAAGAAAAGCAAAGTCCTGCCAACTTAGTCTAAATTGTTTCTAAGCCGGCTGCCTATACGCCTGCGTTTTGAGCAA 

AGCGGTAGTATCTTTTTCGGAACCATCGATAGCTTGTCATCTCCAGTAAAAATGCGTGAGTTTATTTAAAAAAAACGCTAGGCATACAACTAATT 

ATTCTTATGATGCAGAGAACTTGAAGATCCAAGAGGAAGTCAATTCATTGATGCGTCTAGGCCAGCACTTCGATGACCAGTTAAAGCTGGCTAGC 

GTCGAGTTGGGCGATTTCTCGGACGATGACCTGGCGCTCCTTGACAAATGCGCCCAATATTATTCACTTTTGCACATTCACGACATCAATCTGAA 

CTACTTGCGCGACTTCTATTGTGCCAAGAAGAGGGAATGCATAGAAAACCGGCAGACCACAGTGCAGCAACGCGTAGAGCTGCAGCGCATCCTCT 

CGTCCATCGAGGAGGCGACTAGGGACGTAGTTATGTTGGAGAGGTATTGGTGAACAAATCCCTTTAATAGTTGAATCCCTGACCAAAACCATTCT 

TCTGCAGATTTAATGCTGCTGCGGAAGAGCGGCTCATCCCGGATATCGTCGTAATGCAGCGAAACGCCCAACAGCTTGCAACAAAACAGGCCTTA 

CTGGACCGGCAAAAAACCCTTAAGATCCCAAAGGATTTTAGCATTGAAAGTGTCATCGAGAAGGTAGATTCGCTGGAGCAGCGCTAAAGAAGTGA 

ATTGCTTTCGTTTTTAAAAAATTATTTAATTGTATTTACTCAGTAATGCAAAACACTTTACAAAAACATAAAATCTGCAAATTTTAAAGATGAAT 

GTTGAGGTTTACAACTTGGAACACTACACAATCAGAGGGTGCACTTATTTTGGTTATTGTACGCATTTTAAAAACGTTCCAATTGTTGCTAGGTT 

GT GT C AAA T AT T TAT T T A AAGT ATC AT G AT AGG A AAACT AGAAGACC AAAT AAT AAT T TGCCAGCTGC T T AAA G C A T AC AAAAAGT GT AC C GT G T 

TCTTCCATGCCATTGCTCATATGTATGTATTTGTGTATTTCCAATGTGGTATATGTATCATATATATTGGTGTATAGATAGAATTAGTAATTTCA 

TATTCTTTAAGTTCTTGCTTGTTGTTATGTATATAAAGGGCGGGTATGTGGCAGACAATCTTTTCGGCTAGCAACGCGCTCCACGCCACAGGCCT 

ACTCCACGTTTCCGGGCATCTTTTAGCGGAACTCGGGCGCATCATCTTGCGTTATGTCACCGTACTTCCAGATGGCCAGATGCGCGACCCGAGCT 

GCTTCCTGCGCGGCTTTGTACTGGTCCACTAGTTCCTTGAGCTTCCGCTCTCCACGCTGTTCGGCCAGAACAAGGCCCTCGGCTACAAGCTGCTT 

TCCAAAATCGACTTTAGTAGTTGGATCGCGAAGAGTGGCCAAGTTCGGCGACCCAGTCACCTTTAGCTCGACATTGAGTTGAACCTTATGGTTCA 

AAACGTCTTCGGAAAAGGCACGCAAGGCTTCTTCCTTATCTTCGTTGTCAGTTGGCAGGGCGACTAAGGCCAAAGCATACTCCGTAGCGTAGGGC 

TTTTCGCTGCTGAAGGCTGGTGGCAGGGCTGCCAAACGGTTGGTCGGAAGTGTCTAAAAATGGAAACGAGTGTTAATACTGCCAGGAAAAAATCA 

C AAT GGT T TT GGT AT G AGT GT GT C G C AACT C G AAT AC TT T T G 

(SEQ ID NO: 19) 

Exon: 1001.. 1278 
Exon: 1338.. 1512 
Start ATG: 1001 

Transcript No. : CT33406 

ATGCGTCTAGGCCAGCACTTCGATGACCAGTTAAAGCTGGCTAGCGTCGAGTTGGGCGATTTCTCGGACGATGACCTGGCGCTCCTTGACAAATG 
CGCCCAATATTATTCACTTTTGCACATTCACGACATCAATCTGAACTACTTGCGCGACTTCTATTGTGCCAAGAAGAGGGAATGCATAGAAAACC 
GGCAGACCACAGTGCAGCAACGCGTAGAGCTGCAGCGCATCCTCTCGTCCATCGAGGAGGCGACTAGGGACGTAGTTATGTTGGAGAGATTTAAT 
GCTGCTGCGGAAGAGCGGCTCATCCCGGATATCGTCGTAATGCAGCGAAACGCCCAACAGCTTGCAACAAAACAGGCCTTACTGGACCGGCAAAA 
AACCCTTAAGATCCCAAAGGATTTTAGCATTGAAAGTGTCATCGAGAAGGTAGATTCGCTGGAGCAGCGCTAA 
(SEQ ID NO: 20) 
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Start ATG: 1 



MRLGQHFDDQLKLASVELGDFSDDDLALLDKCAQYYSLLHIHDINLNYLRDFYCAKKRECIENRQTTVQQRVELQRILSSIEEATRDVVMLERFN 
AAAEERLIPDIVVMQRNAQQLATKQALLDRQKTLKIPKDFSIESVIEKVDSLEQR* 
(SEQ ID NO: 21) 



Celera Sequence No. : 142000013384556 

CATTCCCGAATTGGAGGAAGTATTCACGAAGGCATTCCTCATCGTGGTTATCCTTAAGACCGCCTACGAACAGCTTCTTCACGGATATATTGGTC 
TCGCGCGATTCGCGTTCGGGACGGGGCAGAGCACGCTTGGCTTCCACAGTTCTGGAAGCGAAAGGCCTCATGGTATAGATCTATTCTTGGTACGA 
TATGATCCCCCACCAAGTCTATACTCACTTGCCGTCGATGATGTGCGGACGGTTCTCCTGGGCCCTGTCGACCATGAGAGACTTTGTGTAGGTGA 
TGAAGCCGAAGCCGCGAGATCTCTTGGTGGCAGCATCGCGCATCACCACAACATCGACCACCTTGCCCCACTGGCCGTAGAACAACTTCAGGTTC 
TCCTCGGTGGTGTACGGGGCCAGCCCGCCGATGAAGAGCTTACGCAAGTGCTCCAGCTCGCATATGTCCTACCGGTGCAGAGGTGGTAAGTGGGT 
TAGCCAAGTGGGTTGGCATGTGGTGAAGGGTGGTATGTGTGAAGGGAAACAAATATAGGTGCGTCACTTGCCAAGCCGCATACATGCACAGAAAG 
GTACGCGTATGTACTTACATCTTCCTCGCGATCTGCAAGCACAATCACATCGGCGGACTCGTCCGATAGGGGCTCATCCTTCACCATTGTGTTAG 
CAAATTAATTATCACTAAATTTAACCGAAATTAAGAAACGAAAACGACCAGTTTGCACGTCGAACGGTTGAAGCGAAGTGAAAATTGTGCTGTCA 
GGAAGCGAATCATTGATTCGATACACCCACTCTCTCTGTTTGTTATCGATAAGTTGGTTAGGGGTATTCTTGAAATTTCTAAAGTGCCAAGTTTC 
TTTACTATCGTAATAATAAATACATACACGAATTGGATATGCGATAGCTGTCAAATTGTAATATAGTTTTTCAGAAGCAAAGCGAAAACCGCCAA 
CGTATACTAGCCCCTTTATGCATTCAATTGAGTATTCAACTTAAGTTCTAATGTCGGATTCGCCCCATGAGGAGCACCGCTTGTGCGGTCTGGGG 
AATCACACGCTGAGGTCCCCGTACCGCGTAATCAGAAATCCGCACCTGCTGAAGTTTGGCCAGCTGATAAACTCGGAGCTGCGGTCTCAGCCACT 
GTGCCTCTCGTGCTACACGCAACTGATCCAGCTCTACCGGCTGAAGAACAACAATGCGAAGAGACACCTGGCAAATAGACAGGCAGCCAGTGCTA 
TTCTCTCGAGGAGCAGTAGTGAGAAATCTCATGAGTCCGCGGCTAGTGTGTCCACAACACAATCCTCGGAGGAGCAAGAATCAACCACTAGCGCA 
GCGCCGGCGGCGGCGGCTCAAAGAGCAGGAAGCCAGTTGGTTGAGTCCAATTCTTCCGACGATACACTTTTCTCCGATGACTACGATCCCAGCTC 
GAATCTGAGTTTGAATGCCGTGAATGGAACGCGGCTACCGAATGTCCAACCAATTCCGAAAAGGCGCCAGTTTGTGCATCTAAATCGTGAAGCTA 
TGGCCATTTATCTGGCGGGAACTACTGGGGGATAATGTTTGTTTCATTTAGGATATTTTCATAAATTAAATTTGAAATAACACTGTTACCATCTA 
TGTAACCATCTATCGATAGTTTCCGCCATGTTATTAGCAGCAGAAACAGCTGGCTGAGCACCAGGGCTGCATTATTGTTTGTTAGTTTTGCTGTT 
TGGTTTTTAACAAATTTCCTGGTTTTCCCTCAAAAACCCTCTCTTATATAAATAAATCAAGTGCAAAATGCTGAGTCCATAACAACACACGTAAG 
TTTGTTGGCAAGCAGTGAGTGATTTCTTCGAAAAGTGCAATTCCTGCAATCTGTTATGTTTTCACAATGACTTTTATTCGATCTTAATCTAAACA 
GTTTAAAGCTCAAATAAATGTAATCAATTAGCAAGTATTAGCAAATGTTAATAGGCGTTGTTGTTTCTCGACGTCGCTGCTGCTGCTGCTGTTGT 
TGTTGTTTTTGTTGTTGCTGGCGCTGGAACTGGTCATCGATGTCGATGAAATTGGCATCCTGCGTGTTATCCTTGGGCAGCGCTGCCGGCGTCGC 
TGCCGCGGCATAGTAGTTTAGGCCCACGCCCACCGCCACGCCCCCAGTGACGCCTCCGGGGGGCATGGTCGTGGCAGCCTGACCCATTATAGAGC 
T C AT AAAAC C AT T GGC C AC G AC AAAGT T T C TG AAAAAT T G AT T ACAT CAAT GT AC AAAAT AAT T T C G AGC AA C C C C C T C C GGC CC AAAAAAAAAA 
AAAAAAAAAAAAAAAAAATATGGAAAAAAATGATATCGTCTAATTGGGTCTGTGTGCGCATAAATGTCGTAAAATTAATTTCGTGTTAATTTCAT 
GTTGGCCATCAATTAAACTAGTGACATTTCACTAAATTACTTCCGTTTCGAAAATGCAGGTATAAATACCCAGTTAAAATCCTCTTCACGAGCTA 
TT CAT CAACCATTAATTTATGTTCATT T AATATTATTTAACTC AAT CAAAATCGATGATATCTTTGT ACAT ATGAAACAGAAAAA 
(SEQ ID NO: 22) 

Exon: 1001.. 1555 
Start ATG: 1001 

Transcript No. : CT33871 

ATGTCGGATTCGCCCCATGAGGAGCACCGCTTGTGCGGTCTGGGGAATCACACGCTGAGGTCCCCGTACCGCGTAATCAGAAATCCGCACCTGCT 

GAAGTTTGGCCAGCTGATAAACTCGGAGCTGCGGTCTCAGCCACTGTGCCTCTCGTGCTACACGCAACTGATCCAGCTCTACCGGCTGAAGAACA 

ACAATGCGAAGAGACACCTGGCAAATAGACAGGCAGCCAGTGCTATTCTCTCGAGGAGCAGTAGTGAGAAATCTCATGAGTCCGCGGCTAGTGTG 

TCCACAACACAATCCTCGGAGGAGCAAGAATCAACCACTAGCGCAGCGCCGGCGGCGGCGGCTCAAAGAGCAGGAAGCCAGTTGGTTGAGTCCT^A 

TTCTTCCGACGATACACTTTTCTCCGATGACTACGATCCCAGCTCGAATCTGAGTTTGAATGCCGTGAATGGAACGCGGCTACCGAATGTCCAAC 

CAATTCCGAAAAGGCGCCAGTTTGTGCATCTAAATCGTGAAGCTATGGCCATTTATCTGGCGGGAACTACTGGGGGATAA 

{SEQ ID NO: 23) 

Start ATG: 1 

MSDSPHEEHRLCGLGNHTLRSPYRVIRNPHLLKFGQLINSELRSQPLCLSCYTQLIQLYRLKNKNAKRHLANRQAASAILSRSSSEKSHESAASV 

STTQSSEEQESTTSAAPAAAAQRAGSQLVESNSSDDTLFSDDYDPSSNLSLNAVNGTRLPNVQPIPKRRQFVHLNREAMAIYLAGTTGG* 
(SEQ ID NO: 24) 



Celera Sequence No. : 142000013384172 

TATCTATACTCGCAGATGCACTAGCCTAAAAACGCAGCTATGTTATTTTTATATCTGTCACCCATACAGCAAAAGGGTACACGACTTGCAGAAAA 
GTATGTTCAAGAAGAAGGACTCTTTAAAATGCATATATACATCACTAACTATGTATATATATCATATAGATTCTATAGAATTTGTTATGTACAAT 
GGTTTTCTACATCTTAGCTAGGAAAATACGCAAGCAACAACTTTAGAACATTGTCTAATAACCAGATATTAAGAATCAATTACATATGCTGAAAG 
TTATCTTTTAGGATTTGTCATTTTATAGTCAGCTTAAATATGTGGCTGCACCGCCTTGAATGATATTTATTACTTAGCTATTCCCACTCACCAAT 
CCCAAAAATTCACTCCAGAACGAGGGAGCCACCAATGCGCTACTGAAACTGGACGGAACCAGCGTGGGAAATCGCTCGATAGCCGTGCGTTTGGC 
CAAGAACATTAAATATGTAAGCAAAGACCTTCGAGTCACCAAGCTAGCTATCCAAACTTCAAATCGATTCTGTTTCCTATATATATATAGGATGA 
AACGCAGAAGCCAAAGCCGCGCCTGGATATACCGGCCTTGGGCACAGGTAAACGCGAGGAGAAGGTCAGCAAAACGGAGGCCATTCGGGCCATTG 
AGGCCAAGCTGAAAATGTTGGAGCGCCAGACAGACGACAATCTCGAGCTGAATACAGCGGGCAGGGGCGAGGCCAACGTGCCCTTCATCCAGCGC 
TATCAGTTTAACAAGGATCGCGACGGTTCGCAGCGCTACGGAAAGTCATCGGCTCCATATCATCACCAGCAGCGTCCCAAGCGCCACTAGGCTCA 
ATCGACCCACACTCGACTTTGCCACCGTCTCACGCATTAGTCTGTAAAAATCATTAGTACTTAGGGCGCTACCTCAGCAGACGAGCCTTGCAGGT 
GTAATTTACAATTAATTTATTAAAAAAAAAGGTACAATCAGAGCGAACGTCTATTTGGGTGCGAACAAGATCTTGCCGCGCATGTGATCGGGCAG 
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CGGCACGGTTCCGGGACAGGGCACCTGGCCGGGGATCTCGCAGATGCAGTGCCTGCCGCATCCGTAGCCAAAGTTGGCCGGATTGTCCTTGCTCT 
CCTTTAGGCGCTCCTCTGCGTCCAGTTGCTCCCTGAAATTATGGAATAGGATGTGTATTAAGTTATAGCTTATGATGCTGAAAAGGTATCAGAAC 
TGAGCTAGCTTCTTCTTAACTCCCAACTCATAGTTCTACAAAATGTGATACACGAATAGAATCTTGATGTGTGTTCTCACCGGGTTTTGCCCACC 
ACCTTAACCAGGTGATCGATGATGTCGTTCCGATTCCGACTGTCCAAATCTATAAGCATATCGCGTCCATCGTCGAAGTAGCAGCGCACAAACGG 
CGAGGGCGTCATGTTCTTCAGCGTGAGCACCTGGACCTCGGGATTCTTGAACTGGATCTGCGGTATGTTCCAGMAACAAAGTCCCTGTGCGCAT 
TGCATTTATTAGCGTATATTTCAGTTATTAGACAATTCTATGCCATAACTTACCGAGCGCCCGCGTGATGAGCACCATATGTGTTGTAGTTCACG 
CTGAAAATGCGCACCTTGTCCTTCAGCACCAGCTTACCGGCATTCAGATACTTCAGGGTGCGGCGGATGGGCTCACGGCCCTTCATGAACGGCAT 
TTTGCACTTTTAATTCAAATACGCCTAAGTAAAACAAAAGAAATCCAATAAAAGAGCGGCTGCTTGTGTAATGCGAACAGCTGATCGGTCGGCTA 
GTGGCTCCCACAAGGCGCAAGCGGAACTATCGATATAAGCGCGGTATCGATGCAGACCAGAAATGTTTCCACTTAAAAGAAAAAATACGAATATT 
TATATGTTTTTATATGGGCGTGGCAGTTTTGGGTGGTTTGTAGACGTAATAAAGAGACATCAGGAAGAAACCTGAGCTGCTTGCATAATCTTAAC 
TTTCCAGCATTTGCTCTGACGAGCGTTAGAGTGGGTGTGGTAACATTTGGAAACAAACTTGCGTCTACGTCTCTAGAAGCTGCATGCTTCATCTC 
AATATTCTAGCTTTTATAGTTCCTGAGATCGAGGCGTTCATACGGACAAATGGACAGACGGACAGGGTCGGAACAACTCGGCTATTGATCCTGAT 
CAAGAATATATATACTATACATACGGAAACACTTCCTTCTGCCACTTACATACTTTTCAACGAATTTAAAATACCCTTTACTCCACGCGTAACGG 
GTCTAAAAATGGGTAGAACTATTATTGACTAACATAATTAGAAAACGCGTTAAATCTTCAATCTTACATTTTTTAAATTAAAAAAAAATTTTTTT 
TTTTGAGCATGGTCTAAAAAAGATAAATGCTTGCTTATATTTCATTTCAGTGGAGTACGTCTATTAAATGAAACAACACTTTCGATTTCTCTGGT 
CATATATCGATAGATCCGTTAAGCCGAACCGGCTGGCTAATTTGTTTACCTCTTAAACTTAAACTTTAAAGCTTCTTATTCTACCACAATCAGCA 
GGACATGGAAATAACACTCGCTCTGATAAAGCCGCACGTGCTGCGGAATACCTATGCAATGCAGCAGATCCGAGCCCTGATCTCCCAGAACTTCA 
CCATTCTTGACCAGAAGGAGGTTTGCATTACGAAAGAACTATCTGAGCGA 
{SEQ ID NO: 25) 

Exon: 1710.. 1574 
Exon: 1510.. 1316 
Exon: 1172.. 1001 

Start ATG: 1710 {Reverse strand: CAT) 
Transcript No. : CT34069 

ATGCCGTTCATGAAGGGCCGTGAGCCCATCCGCCGCACCCTGAAGTATCTGAATGCCGGTAAGCTGGTGCTGAAGGACAAGGTGCGCATTTTCAG 
CGTGAACTACAACACATATGGTGCTCATCACGCGGGCGCTCGGGACTTTGTTTTCTGGAACATACCGCAGATCCAGTTCAAGAATCCCGAGGTCC 
AGGTGCTCACGCTGAAGAACATGACGCCCTCGCCGTTTGTGCGCTGCTACTTCGACGATGGACGCGATATGCTTATAGATTTGGACAGTCGGAAT 
CGGAACGACATCATCGATCACCTGGTTAAGGTGGTGGGCAAAACCCGGGAGCAACTGGACGCAGAGGAGCGCCTAAAGGAGAGCAAGGACAATCC 
GGCCAACTTTGGCTACGGATGCGGCAGGCACTGCATCTGCGAGATCCCCGGCCAGGTGCCCTGTCCCGGAACCGTGCCGCTGCCCGATCACATGC 
GCGGCAAGATCTTGTTCGCACCCAAATAG 
(SEQ ID NO: 26) 

Start ATG: 1 (Reverse strand: CAT) 

MPFMKGREPIRRTLKYLNAGKLVLKDKVRIFSVNYNTYGAHHAGARDFVFWNIPQIQFKNPEVQVLTLKNMTPSPFVRCYFDDGRDMLIDLDSRN 
RNDIIDHLVKVVGKTREQLDAEERLKESKDNPANFGYGCGRHCICEIPGQVPCPGTVPLPDHMRGKILFAPK* 

(SEQ ID NO: 27) 



Celera Sequence No. : 142000013384525 

TTCGGCGCCACCGGCCCCTCTTCCTCAGTGGTCTCAGCTGCGGCGGGGTCTTCCTTTTTATCACTCGGCGTGCCAGGTCTTGCTGTGGATGGCGT 
CGATTGTGTGGACTTTCGGTGGGCCCGGCTGGTTGTAGAGTTGGCGGTGCTGTTAGTAGTCGTGGTGCCAGTGCTCCCGGCTTCCGATTTGCTTC 
CCGGTCCGGCGCTAGTTGTTGGTGTGCTGCTAGCGGTCGATTGCGAGGTGTTGCTATCCTTAGACGGCGTCGAGGCACGAGATTCGTTCCGCATC 
TGTTCCATTTTCTTGGCCTTGGCGCTGCCTGGTATGTAAACACCTTAATTTACAACTCCAGGAATTGCGTTTGTTACTGAACTCACCTTTCTTAT 
TATCCTTCTTGGAGCGCTCGCCGCACTGGCGAGCCAGTTCCTGACGTCGCTTGACGTTGTCATCGTTGTACTTGAGGACTCGGTTCTCGGGCACC 
CACTCGTCCCAGCTGAAAATGAAATGGCTTTCATGCAGATCGGTGTTCATACTAGTAGCTTTGCTCACTTTTTACTCCAGCCCGCGTAGTGGATG 
TAGTACTCAACCGGCGTCGCATCGGGTTTTGTTTTCAGAACCTTGGCCTCGTAGATGAGCGGTCCGTGGAAGCACAGAACCCGTTCCCCTGCAAA 
TGGCGCCAATGTGTTTAATTTGATTTATAGTTGGGCTTAAGTTATTGCCCTACCATCCACGAACAGAGTGTTTGCATCGGTCCCCGTGCTATAGT 
TTTCCACTTTAGCGGGTTTTACTTCTCCCATTTTATTTTCAATAATAGATAATTAACAAATCCTAGAGTGACCGTGCGTGACGCATAGGGCTATC 
GGATGGGACAGTACCGATGCACTTATCGATTATTTACAGGGTATTGTCTTGGTAGATATTAACAATATTGATTTATTTATGACTATGTACATCAA 
ACAGTCTGTTTTTATGGGTTGTTTTCTTAATGATTTTGAAGGCATCAAGTCTAGTGCTTCGCTGTCGTCGGCTGATCTTCCTGCTCCTCCCAGGG 
ACGTGGACCGCGTTTGTTCTCCCAGTTCTCTATGTCCACGGACTTGACTTTGTCGTACTCCGACTCCAACGTCACATCCTTGCGGTTTTTCATGC 
TTACGCCGTACTTTTTCATCTCCTCCGGCGTTACCGGCTGCTTCTTGGCGTACTGATACCTGTAAAGGGAGGAGACTCATGAGTTAGCTAAGGAT 
TTCCCGGAGATTCTTGAATGCTAGCCTGAGGTTGGAGAACTGCTGCAGTCCAAAGGAGCCAGCCACCATCATGATGAGGAAGGGTATGCCATATT 
TGAACGATTTTCGTGTGCTGTAATAGTTTAATTTATCCACGAGAGACATTGTTGTTATTTTGTGTTATGACATTTTGGCACATGCGGTTACACTG 
CAAGTTGAGAACGAAGTGTGACCAGTTATATGATAATAAAAAAATTGGCGCAAAATTTGAACACCGATTTCTTATCATATGCAATGTAGTTATAT 
TTTATTAAAATAACTTGGCGATAGTTTTAAAAATAATTGAATTTAAGCTAAACTATTAACAAGAATATTTAAAACATAACATTTTGTAATCGAAA 
AAAGTTATTTTCAATTTGGCACACAGCACTTTATTTTAGCCACACACACTAGGCGCCTATGTCAATCAATTTGTGCAGCTGGTGTCGTCTTTGAA 
AATAGTTGTCCTTGAATCCAAATCGAAAGATTTGTTGTCCCTGCCAAGTGCGAGCAAGTTCTGAATAATTGCAAAACATTGTTTTCCGGCGCACG 
CCTAATGCCCACCATTTGCATGGGGTTCGACGATAAGTTCAAGGTCCCCGCCGTCTCTTTGGTTTTCAAATAATTGACACTGACAATGTTGAAGA 
GCGCCAAAACGCTGTTCACATGCACACGACATTCGATCCGGATCACCTTGGCCAATTGCAGGCTTTGGCAAGGCAAGTAGCTCGCGGCGAAGATT 
TCCTTTTGCTGCCCACAGCTCCAACCAGTTCGCGATCGCTGCGGCTTCATTCACAAGGAGCCGGCGCCAATTGAGTCATCAGAGGCACCCCCCCC 
CCCCCTCCCGAGAGACTGGGATAGTCCCACACCCCTTTCCCAATAGATTATATTTCAGTTTGCAATGCACCCTACGCACAGTCTTTTAGTTGAGT 
GATTTGCCCAATGGGAAATAATATCATTACAATTAAAAGACCAAGGTTTTCATTTAGTATTGAACCTGATGAGCCTACAATGAGCTACATACCTA 
AGAAGTAAAACTTCTCCAACAACCCGT 
(SEQ ID NO: 28) 

Exon: 1307.. 1276 
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Exon: 1199.. 1001 

Start ATG: 1307 (Reverse strand: CAT) 
Transcript No. : CT34684 

ATGATGGTGGCTGGCTCCTTTGGACTGCAGCAGTATCAGTACGCCAAGAAGCAGCCGGTAACGCCGGAGGAGATGAAAAAGTACGGCGTAAGCAT 
GAAAAACCGCAAGGATGTGACGTTGGAGTCGGAGTACGACAAAGTCAAGTCCGTGGACATAGAGAACTGGGAGAACl^AACGCGGTCCACGTCCCT 
GGGAGGAGCAGGAAGATCAGCCGACGACAGCGAAGCACTAG 
(SEQ ID NO: 29) 

Start ATG: 1 (Reverse strand: CAT) 

MMVAGSFGLQQYQYAKKQPVTPEEMKKYGVSMKNRKDVTLESEYDKVKSVDIENWENKRGPRPWEEQEDQPTTAKH* 
(SEQ ID NO: 30) 



Celera Sequence No. : 1420000133837 69 

CAAAATATATGTGTATATCCATATTTCTATATATATATATATATATACATATAAATATAAAGGAAAACAAGAAAATGAGCAGTGCGAAATGTAGC 

AAGTGGAATAGAAAATACTCTCACGCTCAACACTTTTCTAACCCGTTTGTAATGCATATCTGTAGGGCCTACCGTTTTTAATCTTCTATAAGAAC 

TTTCCAACACTACCACGTCGACTGTGTGATGGAATGAGACTGTACTGAAGTTTTTTTTTTGAAATTTATGTAAATAGCAGCAATAGTAAATAGTA 

TCTAAACATGTCTGCAATAGAGCATTCCCAAGTCGTCGCAAGGCAATTTTACATTTATTTATACATTTTTTTTTTTTGCAATTCTCTGTTGCATG 

AT GGAAAAAGAAAGGC AACAAG AACAT GC AAAGAAACGGCG AAAT AAGAAGT GAAGAAGAAAAGC AGCGGCT AT G AATGTGCAGT GTGTCCGTGT 

GCATTTATGTATATGTATATGTAAGACTTTTGTGAGCGGGGGAGTAGTAGTGAAATGTAAGAAAATGAAGAGAAAATAGCACAACAATAATAAAC 

AAGTGCAGCAAAAGGAGCAGGAGTCGAACTCCCTGGGAAGTAAAGCACTCTTCTGAAAACGGTTGATGGGGGTCTTCAAACCCCTATAATTCCAA 

ATGCAATGCATCTACTGAGGATTGCAGTGAACTCTGTAATTAGGTTTAAATTGTCAATTTTTTTGAATCATTTATAGTTTGATGGGCAATTGCAT 

TGATTGGATTCCTAAACCATAGAACACTTTTCGTGGACTATTAAACATTAGGATTAAATTTTAAATTTATTCATAGATATTAATATACATATTCA 

AGACGTTCATATTTTAGTTGAAAATAAGAACGACCTTCAGTTATTTACTTTATTTATCTCCTTTAGTAAGCCCTTTGCAGCACTGTATTCGCTGC 

CACTTCTTGCCATGCCTCTCGTAACACTACACGTAGTGCAACCCACTCACTATGCCCTTTGCAACACTCTCCCTGCCTTGGTATCCCAATCCCTG 

AAATGGACTTACGTCTTCGGAGTCGTCCTGCGACTGTAGGTTCTCGGAAGAACTGGATGATTGCATTTGCTGCTGTTGTTGTTGTTGCTGTTGCG 

AGTTCTGATGCTGCTGCTGCTGCGGATTCTGATGCTGCTGCTGCTGCGCGATGGCGTCGCTGTCGCTGGAGGAGGCCGTCGACGAGGAGGTGGCC 

GCCGCTGCTGCAGAGGCGGCAACTGCGGCCGAGGCAGAGGCAGCGACGGCTGCAGCGGCCGCCGCCGACTGATGGAAGTTGGAATTATTTGTCAA 

CTGCTCCGAGCAGCGGACGCACGTGGGTATGCTATAGCCGGGCACCTGGAACAACTGTAGGGCGGGGGAATAGAATAGAACAGAACAGGACACGT 

GAATTAGAGAGTGCTTCCAATTGGAAAAAAAAAACCAATAATGCTCTTTATTTATTGTGGCAACTCAATCTTATTCCACACATACCCTCCCCCTT 

CC AT GC C AAC T AG CCCTTTCTCTTTTCCAC C AAC T AG AAAT AAG AGC C AAG AAG AT AAAC AC AC GC AC A C ACAC AGG C AC GC AC AC AT G GAG AGC 

GAAAAAGAAAGAAAAGAAACTACAATAACAACAGCACCAGCAGCTCGTTGCTTTGTTGTTGTTGTTTTCATTTCATGTTATTTTTTTTCTTCGGT 

ACAACGTTTTGTTATTCTTTCGCTTTTTGTTTTGCTTTGCCGCCGCTCTGCCAGCGGCGGCGGCAGAGGCGTCGCCTACAAAAAAGGGGTTATTT 

GGTAATTGTTTAAATAATGAACTAAAAACCAAGCAAAGTTAGATCTGTTTTGTTAGTTTTATAACCGCTTAACTTTAAATTTTTGGGCTTAGATG 

CCCAATATAGTAAACCCTCTGTCACTAAATTAACGCGCAGCCACTGCGTATTTGGCTGCCTTGGCGCAATTTCTGTCATTTTGTTTTAATATGTA 

TTTCTTTTTGTGCGGTTCATGGGCGGCGGGAGGGGGCTGAAAGAGGGGGGCCTGTTCGGGGGAATAGGTTGAAAAGGAGTCAAAGTGAAAAGCGA 

GTGGCGAGCGTAAGCAAAATGTAAAAATAATATTTTTTCAAGTTGTCGGCTTGAACAACTTTTCGGGCTGAGTTGGCTAAAAATAAAAATTGTTC 

AGCAATAGTTTGGGGTCGCATGTGTATGTGTATTTCTTATTTCTGTGTGTGTGCGTTTGTATGTGTGTGGGGAAAGGGGGGCAATGCGCGCGCTT 

TTGAACCCCGCCCCCTGCGCCAACAACCCGCCCGCACAATAACAGTAGCGCGAATTCAATTGTTTAAACTCCACCAAAAGAAAAAAAATCAATAG 

AAATAACAAAGTGTATATACCTAAATCACATGTGTGTATATCTTTTTTTTTTTGTTAACATACCGTTGCCGTTTGGAAGCACAATCTACAGATGG 

TGTCCATTATAACAGTAAATAAGGAATAGGATGGGGAGCAGGAGAAGGAGGATGCACTTTGTTGCTGACTTTTGCCCCCTCTGCGGTCGTTGTTG 

TTGTTGTTGTCGTCCTTCTCCCTCGCAATTTACCGCTATCTTTACCGGTTAACATTCAGCGCCGTTTTTCTTTTTCGTTTATTATTATTTAGTTC 

Q'P'P'PTTTTTCTATCTTTTTCCTTTTCCTTCCTACTATAAGTTTGTGAACACGATCCAAACGATGTGTGGTTGTATTTGAAACTAACTTTTTGGAG 

TTTGGTTTGCGTTTGGATTTCGGTGATCGCTATATGGCATATTATGATGGCAAAAACAAAAAAAAAACTATATAAATGATATAGAAATTTGATTT 

ATACGATATTTTCGTGGTACACACACACACGCGCGTTGTTTATGTTTGCCTTGCACTTGCACATACAAGCAAAAATGCGGCGCTGTTGTTGTTGC 

TGCTTCGAAGTTCTGCTTCTTTTTTTTGCTTAATTCCTTTGCCCTTTTTTTGGCGTAGTTACACGTGAAAAAAAAAACAAACCAGCGCGTTCGAA 

TGCGTTTTTTTTTTGTATTTTACAATTCACGACGAAGTTTTTGTCCGTCGTCTTTTTTCCCTTTTTTTTGGGGCCCGATGTTATTGTTGCGCTGT 

ATGCGTGAGCCGTGTGTGTGTATGTGTGGGGAAAAAACGAATACGATAAGAATACGCGGTATAGTTGTGCTGCTCTAACCCACAGGTTGGCTTGG 

CCGGTGCTCCTTGGCCTGGCCTATCGATATTTCTATCGATTTCCGTAATCGAAAACGAGGCGGGTAATGGAATCGTTGTTGAAAACAAGTTTTTT 

AATTTATTATTTATTTAATATTTAATTGCAAGTCCGAAAGATTTATTTTATCATAGGTTATTTTATTTAAATAATTGATGGTGTTTTTTAGGCAG 

CCATAGAATCATGAACTTTATTTTACAACAGAATAAATATAAAATCATTTTACAGAA 

(SEQ ID NO: 31) 

Exon: 2477.. 2439 
Exon: 1384.. 977 

Start ATG: 2477 (Reverse strand: CAT) 
Transcript No. : CT35541 

ATGGACACCATCTGTAGATTGTGCTTCCAAACGGCAACGTTGTTCCAGGTGCCCGGCTATAGCATACCCACGTGCGTCCGCTGCTCGGAGCAGTT 
GACAAATAATTCCAACTTCCATCAGTCGGCGGCGGCCGCTGCAGCCGTCGCTGCCTCTGCCTCGGCCGCAGTTGCCGCCTCTGCAGCAGCGGCGG 
CCACCTCCTCGTCGACGGCCTCCTCCAGCGACAGCGACGCCATCGCGCAGCAGCAGCAGCATCAGAATCCGCAGCAGCAGCAGCATCAGAACTCG 
CAACAGCAACAACAACAACAGCAGCAAATGCAATCATCCAGTTCTTCCGAGAACCTACAGTCGCAGGACGACTCCGAAGACGTAAGTCCATTTCA 
GGGATTGGGATACCAAGGCAGGGAGAGTGTTGCAAAGGGCATAGTGAGTGGGTTGCACTACGTGTAG 
{SEQ ID NO: 32) 

Start ATG: 1 (Reverse strand: CAT) 
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MDTICRLCFQTATLFQVPGYSIPTCVRCSEQLTNNSNFHQSAAAAAAVAASASAAVAASAAAAATSSSTASSSDSDAIAQQQQHQNPQQQQHQNS 
QQQQQQQQQMQSSSSSENLQSQDDSEDVSPFQGLGYQGRESVAKGIVSGLHYV* 
(SEQ ID NO: 33) 



Celera Sequence No. : 142000013384157 

CACCTCCATCGCACCCACACACACCTGGCCGATGTTACAGCACGCACACACGGACACAAACACAGGCAAAGACACACAGTCAGGCAGTCGCAGAA 
ATGGTAAAATTGCATGTGTGTTGCGCTGCGCACATTATTGAATTTTCCACGGCGAGATGACGACGACGACGGTGTGCAATTACTCACCTTGGCGT 
CGATTGGACTTGACTGACAATCAAGGATGATGGGCGAGAGCAGTGCAAAGGAGTGTCCTTTGAAGTTCGCGAAGGATAACGGTACGGCGACGTCT 
ACGGTAACGGAACGTAACGGAACAGATCGGTTCGGTTCGGATCTCGCACACACACACGCACGCACGGAGCAGCTGCTGAAAAGTGCCTGCTGCCT 
GTTACCGTTTTCGGTAGAAAAAGAGAGAGACGACTGAACGGAACAGAACGGGAGAAAGGCTCCACGTGGTGGCGATGGCGTTTGGCACGGAACCC 
GCGAAGCGGGAGAGGATTCGGTCGCCAATGCGTTAAGGTCACACTGGCACACGGGCGAGGGGCACACTGGCAGGGCCAGCGGATCAGCTAGTATC 
GCTACGATTCAAGTCTTAAATAATTTGTAAAATGAAAGAAACATATAATATTTAATAGCATAAATAAGTTTAATTTCATAAAAGCAATAGTTTTG 
ATTCGCTCAAAGCTGAATGCATTATATATTTCTATGTAATATAATATTGATATGCTGTATTCAGTACAAAGAAATTGAGACACTGAGACAATTTC 
ATAAAAATTTGAAATACTTGTCAAACTTATTCTTTTAAGTGTTTCCAAAATATCTATAAATATTAAATAGCTTGAAGTGAACGTTAGAATATTTG 
TTTTGGACACTAAACGATATATCAGCAGATAACGCCTTCGAAATTCGGTCACGTCCAAGTCCCGCATCGTCTGGCCTCACTAACCAAGTGTAAAC 
AAAAGTTGTGTTGTGAAGTTGTTTTCCTTTGGTCCAAAAAGGAATTGGTCATGTCGAAGTTGAGCGACACTAAGATCCCGATCACGGAGTTCCTG 
GAGGCATACCGTCGCCAGCCGTGCCTCTACAACACACTGCTGGACTCGTACAAGAATCGTGTGTCCCGGGAGGAGGCCTACGGAGCGATCATCCG 
GTCCCTGAAGATACCCCAACTGACAGTGTCGGACATAAAGCTAAAGATCAAGAGCGTGCGTACCGTGTACTCTAAGGAGCTGCGCATCTGGATGC 
GTGAGAAGGAGCTGGGCAGGACCTACGAACCAAAGTTGTTTTGGTTTAGGTTGGCCGACTCCTTTCTACGCTCCGTTTCCCTCTCCCACTGCAAA 
AGGGTAGGATTTCACGATGTTCTTATGTCAATGTTAACCTCTGACCCACCACCAGCAGGGAAAAAATAACAGTTCCAGTGCTCAATTAACGACGA 
TCAAATCGGATGAGACGAGCAAGCTTCTTTGCACTGCCGCCGCAGATATCACTATGAGCGAGGATGCGCTAGAGGAAGAGGATGCAGAGGTAAAT 
GGCGAGCCGGAGGAGTGTCCTTTGGAGGAAAGCAGGCCAACAGCAAGCATCTGCAAAGATGATTCCACGCTCTGCCTGGCGGATCAACCACAGCA 
GGAGCACTATTCTCAGGGGTGCAGCAGCTCCCAGCAGCTTCCTCACACGATGGCACAACGAAAATCAAAGTACATTACAAGTCTGGACTCGGCCG 
GAGAGGATGATCTTATTATATTCGGCCAGAGCATCGCCTCTCAGCTGCGAACCATTCCGGATTCATATTCTCGGTCCGTGGCTAAGCTGCGTATC 
CAGCAGGTGCTGTTCGAGGCTGAAACCGGGCAGTTTCAGAGCACCGAGGTCAATAGCACGCAACTCCAGAATACCTTTTAAACCATAAGTCTAAA 
TTTAAACACTGTGCTCAGTTCTGGCAAAATCCCCAAAAATATGCCACCCTGGCCTAATTTCCAGGGGATATTTATCAACCCGTTTGCCATTCAGT 
ATATGTAGTCAAGCTAGTTCTAAGGATTGACGGTTAAGTATAAAATGTAAATAAAAATGCTATCAAGTAATTACAAATTGCTAGCTAAAACAAGT 
ACTTTGTAAGTAGTAATACATATGTGGGTTTAATTTTTGTATGGCGGAGGTTAATTCGCCTGAACTCCAAGTAATAATAAGCACATTTTAAAAAT 

C AAC GC AT AAAAT AAAAT GAC T AAT T T AGTC AT AC AT AAAAAACAT AT AAC AAT C AG AGT T GT AC AAT TAT T AAAC T T C C T C AAT T T T C AAAT C A 
AAACCCGAATTTCAAAAAAAAGCAAATTATAGTGAATTATAACTATCGATACTTTTTGAACATTTATTCAAAAGTCACCATGCTGCGACTTACAA 
CTTAAATTCGATTAAAGTGTTGTTTCATGCTCACACAACAAAATTTGTTGTTATTTTGCTAACATATTTATTAACACTTAACCATATCGATAGGC 
AAAGCCGATTGTAATTCACAGTTGTCGTGGATTGGAGCGTTTGCAGTGTTTGGTCACACTATTTATGCTTGCATTTCGTCTGTGTAAATTTCATT 

TC AC AAAAAC C GAC AT T GTT AAAT T AAGC AAAT AAG AGT GT T T AAGAT C G AAGT G AGAAG AGC AGC AAT AT GC T G CAT T T AAAT AT AAT T GT G AG 
TATGGCTAGTGGTATTTCCGAGAACGCATAAACACGCACACATACATTGGTGCGAGAAAAGACTCGCGGAAACAGGCACACGCACGCACACAGCC 
AAATTCTCACCCACTGCATTGTTTCTGTTCTGACAGTGGCAATTTCTTTTCGCTACTCTCTTTTCGTCTATTTTTTTTCTTGTGCTGTATTACAG 

TCACCAATGTAGTTTTCTTTAACTTTTAAATACGCT 
(SEQ ID NO: 34) 

Exon: 1001.. 1333 
Exon: 1386.. 1886 
Start ATG: 1001 

Transcript No. : CT35724 

ATGTCGAAGTTGAGCGACACTAAGATCCCGATCACGGAGTTCCTGGAGGCATACCGTCGCCAGCCGTGCCTCTACAACACACTGCTGGACTCGTA 
CAAGAATCGTGTGTCCCGGGAGGAGGCCTACGGAGCGATCATCCGGTCCCTGAAGATACCCCAACTGACAGTGTCGGACATAAAGCTAAAGATCA 
AGAGCGTGCGTACCGTGTACTCTAAGGAGCTGCGCATCTGGATGCGTGAGAAGGAGCTGGGCAGGACCTACGAACCAAAGTTGTTTTGGTTTAGG 
TTGGCCGACTCCTTTCTACGCTCCGTTTCCCTCTCCCACTGCAAAAGGCAGGGAAAAAATAACAGTTCCAGTGCTCAATTAACGACGATCAAATC 
GGATGAGACGAGCAAGCTTCTTTGCACTGCCGCCGCAGATATCACTATGAGCGAGGATGCGCTAGAGGAAGAGGATGCAGAGGTAAATGGCGAGC 
CGGAGGAGTGTCCTTTGGAGGAAAGCAGGCCAACAGCAAGCATCTGCAAAGATGATTCCACGCTCTGCCTGGCGGATCAACCACAGCAGGAGCAC 
TATTCTCAGGGGTGCAGCAGCTCCCAGCAGCTTCCTCACACGATGGCACAACGAAAATCAAAGTACATTACAAGTCTGGACTCGGCCGGAGAGGA 
TGATCTTATTATATTCGGCCAGAGCATCGCCTCTCAGCTGCGAACCATTCCGGATTCATATTCTCGGTCCGTGGCTAAGCTGCGTATCCAGCAGG 
TGCTGTTCGAGGCTGAAACCGGGCAGTTTCAGAGCACCGAGGTCAATAGCACGCAACTCCAGAATACCTTTTAA 

(SEQ ID NO: 35) 
Start ATG: 1 

MSKLSDTKIPITEFLEAYRRQPCLYNTLLDSYKNRVSREEAYGAIIRSLKIPQLTVSDIKLKIKSVRTVYSKELRIWMREKELGRTYEPKLFWFR 
LADSFLRSVSLSHCKRQGKNNSSSAQLTTIKSDETSKLLCTAAADITMSEDALEEEDAEVNGEPEECPLEESRPTASICKDDSTLCLADQPQQEH 
YSQGCSSSQQLPHTMAQRKSKYITSLDSAGEDDLIIFGQSIASQLRTIPDSYSRSVAKLRIQQVLFEAETGQFQSTEVNSTQLQNTF* 

(SEQ ID NO: 36) 



Celera Sequence No. : 142000013384734 

TACCTTTGGACCGACCGGGAGCTGTTGATGCACCTCCAGAACTACACTCCCCTGATCCTTCTCATCGATTTCGTGGAGAAGACCCGAACCAAGCG 
CTTCTATGAGAGCTCCGAGCGCTATGAGATACTCATGCTGGTCTTCATCATGCGCAAGGGAGCGCCGTTTTGTGAGAACAAGCGCTTTCCAGCGG 
AGTACTGGGTTAATCTATCGGTGGGACCAATTGCCGAGGCGTTTGACCGTCTGCAGGCGGCCATCGATATACCGGATCCCCAGCTGCCGATCCAC 
ATGAGCGTTACAGATTTGACCAGCTGGAAGCAAATGTTCGACCTGGCCATGCTGGATATACGGCGTTTTGCGTACTACACCGATCCCTTGCAGCT 
GGCCGATGCGGGCGTCTTTAACCGGATCACCTTTGAGCAGCGATTCGGAATGCAGTGGCAGGAGTAATGGGAATTGATGCACGTTACTGGATATA 
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TATTCACGGGAAACAGCATGTAAGCACACCATCAGCACCAACACACTTTCCCAAATTGGGATGGATCGTGGGGAAGGCTACTGCATAAATCCGAA 
CGAAAGCATTTAGTTAAGTATTAATAGGACGCACTAGAGCGCAACCGATCCATCCAATCCCACATAGACAGAGTTATTTGTTGCGTTAATGTCCC 
TGCATAGCGAAATCTACTGTGCCTTCTCCAAAAACTCCAGATCCTTCTGCATTCCTTTCCCTCCAATCCGTCAACTTCATCTCTCCTCCGACTCC 
TCAAATGTATCTGTAATACAAGCGACGTGTAGGACCCAAAATGCTTTAATGGAAATAAACGTTAGTTACTGTTCAAACGCCTGCCTATTTATCTT 
TGTAAAATAGCGATAAATCATTCACAACACGCATTACTCACTATAAAAATAGCACTATGTACACTATGAAACTACACCTATGTTCCTAATGCGAA 
TTCAACGAGATCCTCGACAGTCCCATTCTTAGGGATCGGATCATCCAGCCTCATGGCTGCCGCTTAGGAGCGCGATCCCGGTCAAAGATGCCCAG 
CAGTTTCCCACGAGTCATCTCCATTGCCTCACGCAGTCCCAGCTGGTAAATTTCATCATGCATTGACTTTATTTTATCGATCAGATCCTCGGGAG 
GGAGCTGGGCTAGTTCTGGAATAGAAAAGTAGGTAATGGATATGCTATTCTTTAATATACTATTCTTAAATCTCATTGACTTTGCAGCTACTTAC 
CATTAATCTTAACCATATCATCATCGGTCAGATTGCTGGTGTAGTTGGTGCTGCGCGTGTACATCGGCGTGTCGCTCATGGAGAGGAACTCGTCC 
ATTCCGGGAACTGGAGCAATCGAAAATAAAGATATGCGTTAAAAATGGCGCCAATTTCTTCGTGCTCACGTTTTCTACACTTACAACGCTCTGGC 
CATTGGACAGGTGAGGGTCGCAGAGTTTCCATGGAGATGAGCTCGTCTTCCGGCGTTATGGGCCGCTCCTTGACCTCCGGGACATCCTTTTCGGC 
CAATAACGTACTACTGCTGGACGCTGAAGCCTTCTCCACTTCCTCGTCCGCCGCTTTTAGAATGGTTTCTGCAAGAAAACCTCGTTTTATGGCGC 
CTTTCATGTGATTTAAATATTTACTCACCTGTAGAAAGCAAATCCGTAGTTGTCTCTGCGTTGCTCATCGCGGATAATGTTGAAAAACTCCAAAA 

AC TAT T G AAAAT AAAC AC G G AAAG AT G C C AAAAAAT C CG T GG GCT AAT C AC T T GC C AC C T C G TT AAAC AAAAACT AT AG C GT T GC C ACT T T C AAC 
AATCACTATCGAATACTATCGATTTACCGCCAAGCCAGTGCTAGAAAGCACCAGGAAGTATAAAACAACTGATATTTTTGGTAAACAAAGAACTG 
TTGTTATTGCTTTTCCTGGTTTTATTAATAGAAGTTAAAAATAAATGGCACAACAATTAGGAAATGAGCGTGCCTCGACTTATACTAAATTCAGA 
GTGGAGAGCAGGCGTCGTAGCAGAGAAACGCGAACCTCTCCGCGACCTCTATCGTGCTCTCTTTTCCACTCACGGTCGTTTAACGGCGCATAGAG 
ACAACTCTGCGCTAGGTCGCCTGCTCTCGTTCGCTCTGGCGCGCAAACGGCACTTGGCACACTTCGCACAGTGGCGCCAAAAAATATGTGCCCGT 

AAAAAGC AC AT GTGC T GC GC AGT AGT T GT C C AG AAC AAC AC AAC AC GG AT G T C T A AC C AGT T AT T GAAAT AAACT T AT T C AAGAAC AT C T T AAT A 
ATAATATAAAATCTTAAGGAATAACAATATCTTATCAAGTTTTTATCAATTGGCGCCGCCGCCTGGTAGCACTGTTCGCATACGCAGACTTTTGG 
CCACCACTGTTTTCCCATGCGCCCTCGCACTTTCTTTCGGTTTTTCATTCGCTGCTCCGCTCTCACTCAGTCAACAAGCACTCATACACACGCGC 
TCACCTGTTCCGCCGCCCCGCTACCCCGTCTCTCGACATTACTGCGACCGCTCCCGTTCCGTTTCATTATTTTTTTTGTCGCTGTCATTCAGTTG 
CCGTTTTAACTTCGCCGCGCGAGAAGCGGAAACTTTTGCCAGTCCGCTCGAGGAAAATAGAAAAATACTGCAGTGCTGCTGCAGTCGTGCAGCTC 

GTACATTTCGTTTCGTTTCGTTT 
(SEQ ID NO: 37) 

Exon: 1683.. 1644 
Exon: 1588. .1415 
Exon: 1340.. 1236 
Exon: 1155.. 1001 

Start ATG: 1683 (Reverse strand: CAT) 
ATGAGCAACGCAGAGACAACT 

TAGTACGTTATTGGCCGAAAAGGATGTCCCGGAGGTCAAGGAGCGGCCCATAACGCCGGAAGACGAGCTCATCTCCATGGAAACTCTGCGACCCT 
CACCTGTCCAATGGCCAGAGCGTTTTCCCGGAATGGACGAGTTCCTCTCCATGAGCGACACGCCGATGTACACGCGCAGCACCAACTACACCAGC 
AATCTGACC GAT GATGATATGGTTAAGATTAATGAACTAGCCCAGCTCCCTCCCGAGGATCT GAT CGAT AAAAT AAAGTC AAT GCATGATGAAAT 
TTACCAGCTGGGACTGCGTGAGGCAATGGAGATGACTCGTGGGAAACTGCTGGGCATCTTTGACCGGGATCGCGCTCCTAAGCGGCAGCCATGA 

(SEQ ID NO: 38) 

Start ATG: 1 (Reverse strand: CAT) 

MSNAETTTDLLSTETILKAADEEVEKASASSSSTLLAEKDVPEVKERPITPEDELISMETLRPSPVQWPERFPGMDEFLSMSDTPMYTRSTNYTS 
NLTDDDMVKINELAQLPPEDLIDKIKSMHDEIYQLGLREAMEMTRGKLLGIFDRDRAPKRQP* 
( SEQ ID NO: 39) 



Celera Sequence No. : 142000012757 653 

GGAACTGGCCAGGGCGCCAATAACTGCGTGTAAAGATTTGGCCTGACTTTCGGCCGAGGGCGGGTATTCCGTAGACTGTATCAGATCCTTTGTGC 
CCAGGTGGGTCGAGACGTGAGCAAGCGTTTCGGTGCTAAGTAGATAACTGGCGATCGCCTGGAGTCCTTCATTAGGGACCTTCGGTAACTGGTGC 
TGCAGGAAGGCCTCAACGTAGGCACGGGCAATTTGTTGGCCCTTCTCGACCAGATCGCTATTGTGAGGCATCTGGAGGTCAGACTCCTCAATGCC 
TAGCTGCCGACGACGATCCTCCTCCCGCTCTGCAAACGATTTCTCTGTAAAAGCACGCTGCAGCTCCGACAATTCAAAGGATTCGCCCAAACGCT 
TGCCGAACGCGAAGAGCTCCGATCGCTGGTTCCACTCCACGAACCCAGAACGCCGCTCTGGCTTTTGTGGTCCTAGTTTCTTCTGCCGGTGGGCG 
AGCTCACGTAACGTTGGCGAAACCCATCGCTTGATGTGGCGCCTCGTTTGCACTGCCGAAAATCACGATTACTGACTGGCATTAGGTGAATCCGG 
GTTACTCACCAGTCGGTGAGCCCTCCAAGAGCTTGACACGAGCCAGGAGACCAACAGCACTGCGTAAGAATGACATCTTTAAAGCGATCTTAAAA 
ATAAGTAACGAATATGTTTTCGCGCATGGGAACTAGTGTTGCACGGGTTTTTGTGGCAGTGATGGCAGTCTAACCGTAAATATAATTTTATAACT 
TATAACTTAACTTATACATTTTAATAATATTTGTTTCTTGTTGTTTCTATATTTATTCAGAGATGTATAATTAAAAAAAAATAATAATAACTAAA 
TATGTTTCTTTATGAATTATAATTTTATCATTTAATCGAGTAATTATAACGAAAAATACTTTCAGCTTTTTATTTAGCGTTTTGTAGCTAAAAGG 
CGCCAGTACTAAATTGCGTTGGTGGCACCCGCTCGTTTTCCAACACTCCCGCAATAGACGCCATGTACATGTCTGTGTTTGTGTATGCGCCAAGA 
AAATTTAAGTCGCGGAATATTAACAAGGAATCCCTGCCAGAAACAGGGCATTGAGAGACCACAGAAGAAAATATACCCAACATGGCACTGCAGAC 
ATACGGGGACAAGCCGGTGGCCTTCCAGCTGGAGGAGGGCGGCGAGTACTACTACGTGGGCTCGGAAGTGGGCAACTACATGCGCCACTTCCGCG 
GCATTCTGTACAAGAAGTACCCGGGAATGACCCGCATCGTCCTGTCCAACGAGGAGCGCAAGCGGCTGGCTGAGTCCGGCCTCAGCTCCCACATC 
TTAGCCAGCTCTGTATCGCTGCTCCGCGCCGTAGAGGTGGATGATATCATGGCCGGCAACGATGAAAAGTAAGTGGGCGTCTTACCTAAGTAATG 
TCCCCAGAATAGTAACTAAATAGGGCCTTTCCTAAAAATTCCAATTAACTTCGTTGATATCTACTTCCTACTAGTCCTACTTAAACGTTTGTTTT 
TAAATGTTTAATCCAATGGTATCAAGTAATCCTTTTATCAAATATAAGACATGTGTACATTTGGCTGAAGATTCTTCATTTTAGTAACATTGTTA 
ATGACCACTTAGTTAAAAACCACTTATCATTCTGAAGTGAAGACGTATACAAACACACATAGCCACACACTGCCCAGTTATTTTGAAAAAATATG 
GAAAAATGTCATAGACATTTTTGGCAAAGCTGCCACAATAGTCTGTATGATTGACAACGATTTCCAATTGAGCACGGAGAATTGTATTATTGTCA 
GAATGTTGGGTAAATTTTGCCTGCTTGCAGGGCCAATCAGTCAATTTGCTAACAGTTGATTTAACTAACCATGAATCTCCGTCCCAGGTATCGCG 
CCGTCTCCGTGAACACTTCCGATACGCCAGTGCCGCGGGAGAGCAAGTCAAAGAAGCAGCCACAGTATGTGCCCACGATGCCGAACTCCAGCCAC 
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CTGGACGCAGTGCCACAGGCCACGCCAATCAACCGAAACCGAGTGCACACGAAGAAGGTTCGCACATTCCCGATGTGTTTCGACGACACGGATCC 

CACGGCTAGCCTGGAGAATGCGGCGCAGAAGGAGTGCCTGGTGCCCATTCGACTGGACATGGAGCTAGAGGGTCAAAAGCTGCGCGACACCTTCA 

CGTGGAACAAGAACGAGAGCATGATTACTCCGGAGCAGTTTGCCGAGGTGCTGTGCGACGACCTGGACCTCAATCCCCTGCCCTTTGTGCCGGCT 

ATTGCACAGGCCATCCGACAGCAGATCGAAGCCTTTCCCAACGATCCCCCCATCCTCGAGGAGACCTGCGACCAGCGGGTCATTGTTAAGCTGAA 

CATTCACGTGGGCAACACCTCGCTCGTCGACCAGGTCGAGTGGGACATGTCCGAGAAGAACMCMCCCCGAGGAGTTTGCCATTAAACTCTGTG 

CGGAATTGGGATTGGGAGGAGAGTTTGTTACGGCCATTGCCTACAGCATTAGGGGTCAGCTATCGTGGCACTGTCGAACGTACGCCTTCAGCGAG 

GCCCCTCTATCAACGATTGATGTGCCCTTCCGGAATCCCAGCGACGCTGACGCATGGGCGCCATTCCTAGAGACGCTTACCGACGCCGAAATGGA 

GAAGAAAATCCGCGACCAGGACCGCAACACGCGCAGAATGCGACGACTGGCCAATACCACAACTGGTTGGTGATCTTCCGCCGATCCAGCAATGT 

GTCACTAATGTAATCCTCCTATTAAGTACCATTATGCATCCCAAATAAATGTGTGTCTGTCGATTTTAAAACCTATTGTCTTCTACGCGCTACTT 

CCAAAAAAAATAGTTCCAAACGATGTTACAAAATAAGCTGATACTCTTTACTGCTTACGCTAAACTCGAATCCTTATTCTACATTATCCAATACT 

AACTTTCTGCCGAATCGGGCTTGTCTTGGTCCTGATCGCCGTCGTAGAACTCGTTCTTAGCATCAACCAGCTTGTCCTCATCGGCTACGGATATT 

CGCACGTCGGGATCACTCGCCTCCTCCCGGTTGCTGCGCAGCTCCTCCAAGTCCACATCAGGCGGGGTCTGCACCATTTGGACGCTGGGCGAGTG 

CTGGCACATCTTAAGGTTTTCATACACATGCTTAACGATGAGTTCAAGATACTGCTTTGAGTTAGCATTATCTTGGCGTGAGTTAATCTCTGGAT 

GGAGTGTAAAGTCGGGGGCGAAAAAGTCGTAATACTCGGTGGCGGGTAGGTCATTTTCAATGTCTTGGTCCACCAGCAGCGAGGTCTCGTGCGTC 

CAGCAGCGGGCCACGTTCCGTAAGGTGTAGCCACCGCCTCCGACCACCAATGTGGGTACGTTCAGTTCCTTCACAAATTTGACACACTCACCATG 

TCCCTTGGTGCTCAGGGAGAAGCAACCTAGTCGATCTCCAGCTAGTGAATCCGCGCCACACTGCAAAACGATAGCCGTGGGCCGGTAGAAGTCCA 

TAATGGCAGAAATTATGGGTTTGAACACCTGAAAATAGCTCTGGTCATCAATGCCCTCCTTTAGCGGCACATTAACGCTGTAGTATCGTCCCGAT 

TCGGCGCCTATCTCGTACATGTCTCCCGTGCCGGGAAAAAAGTAGTTTCCGTATTTGTGGAACGAAGCCGTCATCACTCGGTCCGTAAGGTAGAA 

CGCCTCCTGCACCCCGTCGCCGTGGTGCACATCGATGTCGATGTAGAGCACCCGAGGATGGTACTTAAGCAGCTCCAGAATGCCGATGACGATGT 

CATTCACGTAGCAGAATCCGCTTGCC 

{SEQ ID NO: 40) 

Exon: 1001.. 1398 
Exon: 1893.. 2826 
Start ATG: 1127 

Transcript Wo. : CT1273 

GCAATAGACGCCATGTACATGTCTGTGTTTGTGTATGCGCCAAGAAAATTTAAGTCGCGGAATATTAACAAGGAATCCCTGCCAGAAACAGGGCA 
TTGAGAGACCACAGAAGAAAATATACCCAACATGGCACTGCAGACATACGGGGACAAGCCGGTGGCCTTCCAGCTGGAGGAGGGCGGCGAGTACT 
ACTACGTGGGCTCGGAAGTGGGCAACTACATGCGCCACTTCCGCGGCATTCTGTACAAGAAGTACCCGGGAATGACCCGCATCGTCCTGTCCAAC 
GAGGAGCGCAAGCGGCTGGCTGAGTCCGGCCTCAGCTCCCACATCTTAGCCAGCTCTGTATCGCTGCTCCGCGCCGTAGAGGTGGATGATATCAT 
GGCCGGCAACGATGAAAAGTATCGCGCCGTCTCCGTGAACACTTCCGATACGCCAGTGCCGCGGGAGAGCAAGTCAAAGAAGCAGCCACAGTATG 
TGCCCACGATGCCGAACTCCAGCCACCTGGACGCAGTGCCACAGGCCACGCCAATCAACCGAAACCGAGTGCACACGAAGAAGGTTCGCACATTC 
CCGATGTGTTTCGACGACACGGATCCCACGGCTAGCCTGGAGAATGCGGCGCAGAAGGAGTGCCTGGTGCCCATTCGACTGGACATGGAGCTAGA 
GGGTCAAAAGCTGCGCGACACCTTCACGTGGAACAAGAACGAGAGCATGATTACTCCGGAGCAGTTTGCCGAGGTGCTGTGCGACGACCTGGACC 
TCAATCCCCTGCCCTTTGTGCCGGCTATTGCACAGGCCATCCGACAGCAGATCGAAGCCTTTCCCAACGATCCCCCCATCCTCGAGGAGACCTGC 
GACCAGCGGGTCATTGTTAAGCTGAACATTCACGTGGGCAACACCTCGCTCGTCGACCAGGTCGAGTGGGACATGTCCGAGAAGAACAACAACCC 
CGAGGAGTTTGCCATTAAACTCTGTGCGGAATTGGGATTGGGAGGAGAGTTTGTTACGGCCATTGCCTACAGCATTAGGGGTCAGCTATCGTGGC 
ACTGTCGAACGTACGCCTTCAGCGAGGCCCCTCTATCAACGATTGATGTGCCCTTCCGGAATCCCAGCGACGCTGACGCATGGGCGCCATTCCTA 
GAGACGCTTACCGACGCCGAAATGGAGAAGAAAATCCGCGACCAGGACCGCAACACGCGCAGAATGCGACGACTGGCCAATACCACAACTGGTTG 
GTGATCTTCCGCCGATCCAGCAATGTGTCACTAATGTAATCCTCCTATTAAGTACCATTATGCATCCCAAATAAATGTGTGTCTGTCGATTTTAA 
AA 

(SEQ ID NO: 41) 
Start ATG: 127 

MALQTYGDKPVAFQLEEGGEYYYVGSEVGNyMRHFRGILYKKYPGMTRIVLSNEERKRLAESGLSSHILASSVSLLRAVEVDDIMAGNDEKYRAV 

SVNTSDTPVPRESKSKKQPQYVPTMPNSSHLDAVPQATPINRNRVHTKKVRTFPMCFDDTDPTASLENAAQKECLVPIRLDMELEGQKLRDTFTW 

WKNESMITPEQFAEVLCDDLDLNPLPFVPAIAQAIRQQIEAFPNDPPILEETCDQRVIVKLNIHVGNTSLVDQVEWDMSEKNNNPEEFAIKLCAE 

LGLGGEFVTAIAYSIRGQLSWHCRTYAFSEAPLSTIDVPFRNPSDADAWAPFLETLTDAEMEKKIRDQDRNTRRMRRLAKTTTGW* 

(SEQ ID NO: 42} 

Name: Snf 5-related 1 
Classification : transcription_f actor 
Gene Symbol: Snrl 
FlyBase ID: FBgn0011715 



Celera Sequence No. : 142000013384244 

GTCGTTGAGGGTCTCTATGTATATGCGGTACTCGCGGTTCTTGCGCTCTCTCGGAATATCGAAGACGTGGTCAAAGGACATAAGGTAAGTGATAT 
AGTCCAGTTTCTCCACCGACCGCAAGTTGAGGTACAGCTCGTAGCACTCGTTGAGGTCCAGATAGCGACCGCCGCCCTCCTCGTCAGTGAACTCC 
ACTAGGGCACTCATGTCGTCCGGATTGTTGTACACCCGAATCATCTCGTCGAACTCGACGGACAGCGGCACGCTGACCTCCGCCGGATGGGACTT 
GTAGAACTGCTTGATTTGCTTGAGGCGGGCGTAGAACTCATTGAACTCGTTGGGCCCCGAAAGAGCGGCAATCTCCGCCTTTCGTTCATTGTCCT 
TGTCCTCGTAGAGATCGCGTAGCTGGGACGTAGAGTTGTGATGCAGTTCCATGAGGTATTTAAGCCGGTGCTCCGAATGGATGCGCTCCTTTTCG 
CCCGGCTTTTTGGTCGCATGCTCGTCCACCATCAGTTTGACCAGGCGCTCGCGCTCCTCGTGTAGGCGCCGCTGTTGCTCCAGGAGCGTCTCCAT 
GGCTCTTGAATGCGAGGATTTAATGCGGGGAAATATCAAATGGAGGCCTTTCACTTAGCGAAATCCCGAAGAAAACACAACTTTTCGTCAGCGTG 
TGTACATTAAAAAATACACTTACAATAAAGCAGGGATGGCTTATATTGTCACATCATATTATTGTCAGGTTATTATACTCCGATCACACTGGCAA 
TTAATGTGGATTTCGGGAATGCAGATGTATCAAATTAACGCTCGATTATTCGAGTAATATAGTAATGATATTTAAAGAGCCAAATTGATTTTTCA 
TTATCATATATAGATTATATCGCATGATTTTTCTACCTAGTCGATTAAAGTGAAGCCTCTGTTACAAATGCGATAACAATTACATCGTCTATCGG 
TGGTCTGACAGCGACAACAAATACACATTTTTACCCTCGCAATCGCAGGGTCACACTTTCGTGAAATCATATGATCGATTTGCAGTGAAAATTTT 
CAGACGTTGGGCAGAAGGCAAAAGTAACTTATCGTTTTCCACTTTCCTCGTGTTGGGCCGCCGTTTCCAACTCAGTTCGGCTGTGAATGTATTAG 
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CTTAATTAAATTTCAATTATTTCCAGGCACGGTTGTTGTACGTGGGCTTCTTTAAAACACTTGAATTTCCTTTCGGTTTGTGCAGTGAAAAAAAT 
CAGTCAAAATGGCACTGAGCGATGCTGATGTACAAAAGCAGGTAATTGAAAACTTGGATTGGGAACGGGCAGGCGATCAAGGTCGTAGGGAAACA 
AGCAAAACGAGAGGCTTCGTTTGCCTTTTTGCCTTTGCAATTTGCCTTTGCAATAAAGATGGCGAAGTCATGGGATCTCCCAGGTCATGTGAACT 
TTTCACCGCCAGTAGTCCAATTAGACTGACATCCTTCCAAATCGGCCCGGTCATTTGGGAGTTGCCGGAGTTTTGACATATTTGTTGGCTAATGA 
AGACACATCAATTTATTTGTCCAGATAGTTTGCGTAAAAAGTGAGTAAAAATTCGTGCTGGTCATGTGACACGGCCCCCGCATTGGAGCAATGTG 
TTGGAGCGAGACGACTAGCCCTGCACCCCACACTCGTACTCTCTGTCACACGACCAGCGGACCCCTTACGTTATCAAAACTTTAACGAAAATAAA 
TAGAGGCTAGGGTCTTGGACGTCTCCCTTTTCCATTTATCATGTCCAGTTATCATGTGACACACAGGCAACTACTAAACAGGACGACTGTTTCAG 
ATCAAGCACATGATGGCGTTCATTGAGCAGGAGGCCAATGAGAAAGCCGAGGAGATCGATGCCAAGGCCGAGGAGGAGTTCAACATTGAGAAGGG 
ACGCCTGGTCCAGCAGCAGCGTCTCAAGATCATGGAATACTACGAGAAGAAGGAGAAGCAAGTTGAGCTGCAGAAGAAGATTCAGTCCTCCAACA 
TGCTCAACCAGGCTCGTCTGAAGGTGCGTGTCGTCCAGTTGGTGGCCCTAACATATACCGGAAAACACCTTATTTTTAATCATTCGTMTGTACC 
CTGTAGGTGCTGAAAGTGCGCGAGGACCATGTGAGCAGCGTGCTGGATGATGCCCGCAAGCGTCTCGGCGAGGTCACCAAGAATCAGTCCGAGTA 
CGAGACTGTGCTGACCAAGCTCATCGTCCAGGGCCTGTTCCAGATCATGGAGCCCAAGGTGATCCTGCGCTGCCGCGAGGTGGACGTCCCCCTGG 
TACGTAACGTCCTGCCTGCCGCTGTGGAGCAATACAAGGCCCAGATCAATCAGAACGTCGAGCTGTTCATCGACGAGAAAGACTTCCTCTCTGCT 
GATACCTGCGGTGGTGTTGAGCTGCTGGCCCTCAACGGACGCATCAAGGTGAGTACTGTCCTTTCGGTGGAGAGAGAGCAATCCCAACTGATCTA 
ACAAACCACTTCAGGTGCCCAATACGCTGGAGTCCAGATTAGACCTCATTTCGCAGCAGCTGGTGCCCGAGATTCGTAACGCACTTTTCGGCCGC 
AACGTCAATCGCAAATTCACCGACTAAATTCTATAAGTGCAAAACAAAACATAACTAACCAGAAAGAGAACCAGCATCAACACCTATTCAGCAGG 
AACAGTTCAAGTTATTACAACAGAAGCTCCACCCACTAAATATTGAACCCAAGTAAACTTATCCTTTGGCAGTCAGGAGGCAACAGCTAGGATTA 
TATTTGATTGCAAAATACTTTTGCCGTTGTCTTTGTAAAGTGAAATTGAAACACTCAAGAACATTTCGGTCCTTGTGTACGCAACAGTTTTAATA 
GTAACCACACTAAACGCGCATATATATTCTCCGATATATATGTCTGTATGCCAATACTTATTATATAGTTTAGAGGACACGATCCTAGGAGCATA 
CGAAAGCATAATACGAGTTTGTTAAAGTTTGTTCGTTTTTTTTTTACATATGCACATATTTCTGAGCAGTAGGTCTAGATATGTGCTTATATTGT 
ATACATACACTTTAAAATTTTGCATACATTCCTGTCCAAGAATTTTTATTTCAGTTTTCCCCTCGTTTATTGTACATTATTTTCTGTAGTCTTTG 
TTAACTTTTTATATGTCTATGTCGTTTATGTTCGTAATTATCAAGTGCAGCGTTCAGGAGGAACAACGGCAGTGGATCGCCCCTTTTACAGAGCC 
GCTGGCAGGTTGCGCATGCGACCACACAGCATTGTTGCTCAGCGAAGCACCGAAATGGACCTAAACCCCCGATTTCGCTTCTTCGAGGGCAACGG 
ACGCTTGTGCAACTGCCACTGGCTCAACGAAAGCCCCGAAAATCATCAATGTCTGTTGTTGTTGAGATACCGAGAGTAGAGAATACACACTGCTT 
AGCACGCGACACTTAATACCCATTCATTACACATGCACCAACGACGATGAAGTTTTGCCAAGTAGCTAAGTTTGTTGACCTGACCATCAAGTGCA 
GCTTTCACACCCTCATATAACTACTTAAAGAAAATATAGAAAAATGGAAATTAGTTTTGCAATTTAGGGGACTGCCGAACTGCCACCGTTTCCAC 
CTGACGCTGCGCCATCATATCAGGCTCTAAAAATCAACACACCATGTTCAAACACACGACTAGCATACAGGAGCAGGAGCTACAGTAAATTTGAA 
CCTTGTATTCGCATGTTCGCCAATGTTCATAGTGTATTCTTCAAGCTCATTTTCTAACCAAGTTACCAAGTT 
(SEQ ID NO: 43) 

Exon: 1001.. 1068 
Exon; 1167.. 1276 
Exon: 1806.. 2018 
Exon: 2097.. 2423 
Exon: 2485.-2777 
Start ATG: 1244 

Transcript No. : CT1531 

TCACACTTTCGTGAAATCATATGATCGATTTGCAGTGAAAATTTTCAGACGTTGGGCAGAAGGCAAAAGCACGGTTGTTGTACGTGGGCTTCTTT 
AAAACACTTGAATTTCCTTTCGGTTTGTGCAGTGAAAAAAATCAGTCAAAATGGCACTGAGCGATGCTGATGTACAAAAGCAGATCAAGCACATG 
ATGGCGTTCATTGAGCAGGAGGCCAATGAGAAAGCCGAGGAGATCGATGCCAAGGCCGAGGAGGAGTTCAACATTGAGAAGGGACGCCTGGTCCA 
GCAGCAGCGTCTCAAGATCATGGAATACTACGAGAAGAAGGAGAAGCAAGTTGAGCTGCAGAAGAAGATTCAGTCCTCCAACATGCTCAACCAGG 
CTCGTCTGAAGGTGCTGAAAGTGCGCGAGGACCATGTGAGCAGCGTGCTGGATGATGCCCGCAAGCGTCTCGGCGAGGTCACCAAGAATCAGTCC 
GAGTACGAGACTGTGCTGACCAAGCTCATCGTCCAGGGCCTGTTCCAGATCATGGAGCCCAAGGTGATCCTGCGCTGCCGCGAGGTGGACGTCCC 
CCTGGTACGTAACGTCCTGCCTGCCGCTGTGGAGCAATACAAGGCCCAGATCAATCAGAACGTCGAGCTGTTCATCGACGAGAAAGACTTCCTCT 
CTGCTGATACCTGCGGTGGTGTTGAGCTGCTGGCCCTCAACGGACGCATCAAGGTGCCCAATACGCTGGAGTCCAGATTAGACCTCATTTCGCAG 
CAGCTGGTGCCCGAGATTCGTAACGCACTTTTCGGCCGCAACGTCAATCGCAAATTCACCGACTAAATTCTATAAGTGCAAAACAAAACATAACT 
AACCAGAAAGAGAACCAGCATCAACACCTATTCAGCAGGAACAGTTCAAGTTATTACAACAGAAGCTCCACCCACTAAATATTGAACCCAAGTAA 
ACTTATCCTTTGGCAGTCAGGAGGCAACAGCTAGGATTATATTTGATTGCAAAATACTTTT 
(SEQ ID NO: 44) 

Start ATG: 146 

MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQQQRLKIMEYYEKKEKQVELQKKIQSSNMLNQARLKVLKVREDHVSSVL 
DDARKRLGEVTKNQSEYETVLTKLIVQGLFQIMEPKVILRCREVDVPLVRNVLPAAVEQYKAQINQNVELFIDEKDFLSADTCGGVELLALNGRI 
KVPNTLESRLDLISQQLVPEIRNALFGRNVNRKFTD* 
(SEQ ID NO: 45) 

Name: vacuolar H+-ATPase E subunit-like 
Classification: enzyme 
Gene Symbol: Vha26 
FlyBase ID: FBgnO015324 



Celera Sequence No. : 142000013384244 

AATATCGAAGACGTGGTCAAAGGACATAAGGTAAGTGATATAGTCCAGTTTCTCCACCGACCGCAAGTTGAGGTACAGCTCGTAGCACTCGTTGA 
GGTCCAGATAGCGACCGCCGCCCTCCTCGTCAGTGAACTCCACTAGGGCACTCATGTCGTCCGGATTGTTGTACACCCGAATCATCTCGTCGAAC 
TCGACGGACAGCGGCACGCTGACCTCCGCCGGATGGGACTTGTAGAACTGCTTGATTTGCTTGAGGCGGGCGTAGAACTCATTGAACTCGTTGGG 
CCCCGAAAGAGCGGCAATCTCCGCCTTTCGTTCATTGTCCTTGTCCTCGTAGAGATCGCGTAGCTGGGACGTAGAGTTGTGATGCAGTTCCATGA 
GGTATTTAAGCCGGTGCTCCGAATGGATGCGCTCCTTTTCGCCCGGCTTTTTGGTCGCATGCTCGTCCACCATCAGTTTGACCAGGCGCTCGCGC 
TCCTCGTGTAGGCGCCGCTGTTGCTCCAGGAGCGTCTCCATGGCTCTTGAATGCGAGGATTTAATGCGGGGAAATATCAAATGGAGGCCTTTCAC 
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TTAGCGAAATCCCGAAGAAAACACAACTTTTCGTCAGCGTGTGTACATTAAAAAATACACTTACAATAAAGCAGGGATGGCTTATATTGTCACAT 

CATATTATTGTCAGGTTATTATACTCCGATCACACTGGCAATTAATGTGGATTTCGGGAATGCAGATGTATCAAATTAACGCTCGATTATTCGAG 

TAATATAGTAATGATATTTAAAGAGCCAAATTGATTTTTCATTATCATATATAGATTATATCGCATGATTTTTCTACCTAGTCGATTAAAGTGAA 

GCCTCTGTTACAAATGCGATAACAATTACATCGTCTATCGGTGGTCTGACAGCGACAACAAATACACATTTTTACCCTCGCAATCGCAGGGTCAC 

ACTTTCGTGAAATCATATGATCGATTTGCAGTGAAAATTTTCAGACGTTGGGCAGAAGGCAAAAGTAACTTATCGTTTTCCACTTTCCTCGTGTT 

GGGCCGCCGTTTCCAACTCAGTTCGGCTGTGAJ^TGTATTAGCTTAATTAAATTTCAATTATTTCCAGGCACGGTTGTTGTACGTGGGCTTCTTTA 

AAACACTTGAATTTCCTTTCGGTTTGTGCAGTGAAAAAAATCAGTCAAAATGGCACTGAGCGATGCTGATGTACAAAAGCAGGTAATTGAAAACT 

TGGATTGGGAACGGGCAGGCGATCAAGGTCGTAGGGAAACAAGCAAAACGAGAGGCTTCGTTTGCCTTTTTGCCTTTGCAATTTGCCTTTGCAAT 

AAAGATGGCGAAGTCATGGGATCTCCCAGGTCATGTGAACTTTTCACCGCCAGTAGTCCAATTAGACTGACATCCTTCCAAATCGGCCCGGTCAT 

TTGGGAGTTGCCGGAGTTTTGACATATTTGTTGGCTAATGAAGACACATCAATTTATTTGTCCAGATAGTTTGCGTAAAAAGTGAGTAAAAATTC 

GTGCTGGTCATGTGACACGGCCCCCGCATTGGAGCAATGTGTTGGAGCGAGACGACTAGCCCTGCACCCCACACTCGTACTCTCTGTCACACGAC 

CAGCGGACCCCTTACGTTATCAAAACTTTAACGAAAATAAATAGAGGCTAGGGTCTTGGACGTCTCCCTTTTCCATTTATCATGTCCAGTTATCA 

TGTGACACACAGGCAACTACTAAACAGGACGACTGTTTCAGATCAAGCACATGATGGCGTTCATTGAGCAGGAGGCCAATGAGAAAGCCGAGGAG 

ATCGATGCCAAGGCCGAGGAGGAGTTCAACATTGAGAAGGGACGCCTGGTCCAGCAGCAGCGTCTCAAGATCATGGAATACTACGAGAAGAAGGA 

GAAGCAAGTTGAGCTGCAGAAGAAGATTCAGTCCTCCAACATGCTCAACCAGGCTCGTCTGAAGGTGCGTGTCGTCCAGTTGGTGGCCCTAACAT 

ATACCGGAAAACACCTTATTTTTAATCATTCGTAATGTACCCTGTAGGTGCTGAAAGTGCGCGAGGACCATGTGAGCAGCGTGCTGGATGATGCC 

CGCAAGCGTCTCGGCGAGGTCACCAAGAATCAGTCCGAGTACGAGACTGTGCTGACCAAGCTCATCGTCCAGGGCCTGTTCCAGATCATGGAGCC 

CAAGGTGATCCTGCGCTGCCGCGAGGTGGACGTCCCCCTGGTACGTAACGTCCTGCCTGCCGCTGTGGAGCAATACAAGGCCCAGATCAATCAGA 

ACGTCGAGCTGTTCATCGACGAGAAAGACTTCCTCTCTGCTGATACCTGCGGTGGTGTTGAGCTGCTGGCCCTCAACGGACGCATCAAGGTGAGT 

ACTGTCCTTTCGGTGGAGAGAGAGCAATCCCAACTGA7CTAACAAACCACTTCAGGTGCCCAATACGCTGGAGTCCAGATTAGACCTCATTTCGC 

AGCAGCTGGTGCCCGAGATTCGTAACGCACTTTTCGGCCGCAACGTCAATCGCAAATTCACCGACTAAATTCTATAAGTGCAAAACAAAACATAA 

CTAACCAGAAAGAGAACCAGCATCAACACCTATTCAGCAGGAACAGTTCAAGTTATTACAACAGAAGCTCCACCCACTAAATATTGAACCCAAGT 

AAACTTATCCTTTGGCAGTCAGGAGGCAACAGCTAGGATTATATTTGATTGCAAAATACTTTTGCCGTTGTCTTTGTAAAGTGAAATTGAAACAC 

TCAAGAACATTTCGGTCCTTGTGTACGCAACAGTTTTAATAGTAACCACACTAAACGCGCATATATATTCTCCGATATATATGTCTGTATGCCAA 

TACTTATTATATAGTTTAGAGGACACGATCCTAGGAGCATACGAAAGCATAATACGAGTTTGTTAAAGTTTGTTCGTTTTTTTTTTACATATGCA 

CATATTTCTGAGCAGTAGGTCTAGATATGTGCTTATATTGTATACATACACTTTAAAATTTTGCATACATTCCTGTCCAAGAATTTTTATTTCAG 

TTTTCCCCTCGTTTATTGTACATTATTTTCTGTAGTCTTTGTTAACTTTTTATATGTCTATGTCGTTTATGTTCGTAATTATCAAGTGCAGCGTT 

CAGGAGGAACAACGGCAGTGGATCGCCCCTTTTACAGAGCCGCTGGCAGGTTGCGCATGCGACCACACAGCATTGTTGCTCAGCGAAGCACCGAA 

ATGGACCTAAACCCCCGATTTCGCTTCTTCGAGGGCAACGGACGCTTGTGCAACTGCCACTGGCTCAACGAAAGCCCCGAAAATCATCAATGTCT 

GTTGTTGTTGAGATACCGAGAGTAGAGAATACACACTGCTTAGCACGCGACACTTAATACCCATTCATTACACATGCACCAACGACGATGAAGTT 

TTGCCAAGTAGCTAAGTTTGTTGACCTGACCATCAAGTGCAGCTTTCACACCCTCATATAACTACTTAAAGAAAATATAGAAAAATGGAAATTAG 

TTTTGCAATTTAGGGGACXGCCGAACTGCCACCGTTTCCACCTGACGCTGCGCCATCATATCAGGCTCTAAAAATCAACACACCATGTTCAAACA 

CACGACTAGCATACAGGAGCAGGAGCTACAGTAAATTTGAACCTTGTATTCGCATGTTCGCCAATGTTCATAGTGTATTCTTCAAGCTCATTTTC 

TAACCAAGTTACCAAGTTCAATATGATGAATAACTACAAGATTAGCAAACAAATACAAGTAGCATATGCGTTATTATATAACATCGAGTACTATA 

T AC AT TAC ATGAAATACAAAATGCAAGAAAAATTACTTT T AAAC AAAATTTATGTTGAATAAAAAAC AGTATTTCCAAAAACT AAAC TT AAC T GT 

ATAACAACTTCCTTTTGCAATGTTCTAATGATCCTAAAAACAAGACATGGGGTAAACTATTTTAAGAAATTCAATCTAGGACTTCAAATAAGTCT 

TAATTAAGTACACCCATATTCTTGATCAGCAGAGTCAACTTGCTCATGTTTGTATGAAAACCTAGAACATAAAAAAGCCAGAAGGTCTGGACTAA 

ACTCGCAGATTCCTGTAACTTGGAAGCACCACGAGTTTGCTTTAGAAGATGCCTTTCCGTTGAACTAAAACTAGACGAGTAAAGCGTATCTCAAA 

CATTGTATCTTCCGAACCAAAATAGTGCTGCATACTTTTCGGGATATGTCTTATTCAATGTAAGTAATACAAAAAGAAAAAAGGTAACTTCTTAA 

AACTTGTAATATTTGTCACAAATGAATGACGACTAGATTTGGCCATGTCTGTAACAATAAACGTACAAAATATTCGAAGTTGCAGAAATGGGGAT 

CCAGCATTCGGATTTGGTTTTTTGTCAAACTTTTAAATATAGAATCGGACCACATCTTTTTAAGACCTTAAAAAACTGATTTGAAGTTAGACCCC 

TCATAATGCCAACTAAATTTGTTAAACAAAAATATGCCCCTATATGCCAATCAAATTTAAATACAATGCTTTCTCAGCGTTTCAGTTAAATACTT 

TGGGTTGTTTAATGGTTTTATACGTATACATGTGCTAAGTCCATGTATCTTATATTTCGTAAATCCATAGATTTTGTTCGACGAAACTTTGCTAA 

ATGATATTGATCTTTCAAATATAACTTGTTAATTTCCTTTCATCGCCGTGGCAAATCCTTCGAGAGTGAGTACTACTTGTTAAACGCATATATGA 

GGAAAATTTATTGATGAATAATGCGAACATGCAAACGGAACACATCCCCGAATTCACACAAAGCATCTCTGTGGTATCTCTTTTGGTTTTTTATT 

TGTTGCATCAACTCCGCGGAACTCCAGCTAC 

(SEQ ID NO: 46} 

Exon: 1001.. 1222 
Exon: 1752.. 1964 
Exon: 2043.. 2369 
Exon: 2431. .3876 
Start ATG: 1190 

Transcript No. : CT1535 

GGCAGAAGGCAAAAGTAACTTATCGTTTTCCACTTTCCTCGTGTTGGGCCGCCGTTTCCAACTCAGTTCGGCTGTGAATGTATTAGCTTAATTAA 
ATTTCAATTATTTCCAGGCACGGTTGTTGTACGTGGGCTTCTTTAAAACACTTGAATTTCCTTTCGGTTTGTGCAGTGAAAAAAATCAGTCAAAA 
TGGCACTGAGCGATGCTGATGTACAAAAGCAGATCAAGCACATGATGGCGTTCATTGAGCAGGAGGCCAATGAGAAAGCCGAGGAGATCGATGCC 
AAGGCCGAGGAGGAGTTCAACATTGAGAAGGGACGCCTGGTCCAGCAGCAGCGTCTCAAGATCATGGAATACTACGAGAAGAAGGAGAAGCAAGT 
TGAGCTGCAGAAGAAGATTCAGTCCTCCAACATGCTCAACCAGGCTCGTCTGAAGGTGCTGAAAGTGCGCGAGGACCATGTGAGCAGCGTGCTGG 
ATGATGCCCGCAAGCGTCTCGGCGAGGTCACCAAGAATCAGTCCGAGTACGAGACTGTGCTGACCAAGCTCATCGTCCAGGGCCTGTTCCAGATC 
ATGGAGCCCAAGGTGATCCTGCGCTGCCGCGAGGTGGACGTCCCCCTGGTACGTAACGTCCTGCCTGCCGCTGTGGAGCAATACAAGGCCCAGAT 
CAATCAGAACGTCGAGCTGTTCATCGACGAGAAAGACTTCCTCTCTGCTGATACCTGCGGTGGTGTTGAGCTGCTGGCCCTCAACGGACGCATCA 
AGGTGCCCAATACGCTGGAGTCCAGATTAGACCTCATTTCGCAGCAGCTGGTGCCCGAGATTCGTAACGCACTTTTCGGCCGCAACGTCAATCGC 
AAATTCACCGACTAAATTCTATAAGTGCAAAACAAAACATAACTAACCAGAAAGAGAACCAGCATCAACACCTATTCAGCAGGAACAGTTCAAGT 
TATTACAACAGAAGCTCCACCCACTAAATATTGAACCCAAGTAAACTTATCCTTTGGCAGTCAGGAGGCAACAGCTAGGATTATATTTGATTGCA 
AAATACTTTTGCCGTTGTCTTTGTAAAGTGAAATTGAAACACTCAAGAACATTTCGGTCCTTGTGTACGCAACAGTTTTAATAGTAACCACACTA 
AACGCGCATATATATTCTCCGATATATATGTCTGTATGCCAATACTTATTATATAGTTTAGAGGACACGATCCTAGGAGCATACGAAAGCATAAT 
ACGAGTTTGTTAAAGTTTGTTCGTTTTTTTTTTACATATGCACATATTTCTGAGCAGTAGGTCTAGATATGTGCTTATATTGTATACATACACTT 
TAAAATTTTGCATACATTCCTGTCCAAGAATTTTTATTTCAGTTTTCCCCTCGTTTATTGTACATTATTTTCTGTAGTCTTTGTTAACTTTTTAT 
ATGTCTATGTCGTTTATGTTCGTAATTATCAAGTGCAGCGTTCAGGAGGAACAACGGCAGTGGATCGCCCCTTTTACAGAGCCGCTGGCAGGTTG 
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CGCATGCGACCACACAGCATTGTTGCTCAGCGAAGCACCGAAATGGACCTAAACCCCCGATTTCGCTTCTTCGAGGGCAACGGACGCTTGTGCAA 

CTGCCACTGGGTCAACGAAAGCCCCGAAAATCATCAATGTCTGTTGTTGTTGAGATACCGAGAGTAGAGAATACACACTGCTTAGCACGCGACAC 

TTAATACCCATTCATTACACATGCACCAACGACGATGAAGTTTTGCCAAGTAGCTAAGTTTGTTGACCTGACCATCAAGTGCAGCTTTCACACCC 

TCATATAACTACTTAAAGAAAATATAGAAAAATGGAAATTAGTTTTGCAATTTAGGGGACTGCCGAACTGCCACCGTTTCCACCTGACGCTGCGC 

CATCATATCAGGCTCTAAAAATCAACACACCATGTTCAAACACACGACTAGCATACAGGAGCAGGAGCTACAGTAAATTTGAACCTTGTATTCGC 

ATGTTCGCCAATGTTCATAGTGTATTCTTCAAGCTCATTTTCTAACCAAGTTACCAAGTTCAATATGATGAATAACTACAAGATTAGCAAACAAA 

TACAAGTAGCATATGCGTTATTATATAACATCGAGTACTATATACATTACATGAAATACAAAATGCAAGAAAAATTACTTTTAAACAAAATTTAT 

GTTGAATAAAAAACAGTATTTCC 

(SEQ ID NO: 47) 

Start ATG: 190 

MALSDADVQKQIKHMyiAFIEQEANEKAEEIDAKAEEEFNIEKGRLVQQQRLKIMEYYEKKEKQVELQKKIQSSNMLNQARLKVLKVREDHVSSVL 
DDARKRLGEVTKNQSEYETVLTKLIVQGLFQIMEPKVILRCREVDVPLVRNVLPAAVEQYKAQINQNVELFIDEKDFLSADTCGGVELLALNGRI 
KVPNTLESRLDLISQQLVPEIRNALFGRNVNRKFTD* 
(SEQ ID NO: 48) 

Name: vacuolarH+-ATPase E subunit 
Classification: enzyme 
Gene Symbol: Vha26 
FlyBase ID: FBgn0015324 



Celera Sequence No. : 142000013384852 

CAAAGCCTGCAGATCGCTGTAAATTAAATTTTTACTAAAGGTATCAAAAGACTAAGGTCGCTATGAATCGTTGAAAATATCCAAAATGGCCAGAG 

TTTGGATCAATGAAATTCGTTTAGTGTAAATTGAATAATACCATTTACTCGTAGAGTACAAGTGTACACTAGATTCATTGCAATATAAACTCCGA 

GATCTCGGATTCATCTTTGTTTTAAGTTTTATATCCATTTTTATAATTTGAATACAAAGTCTAGTGTAAGACAGTTATACCCGTTTCTCGTAGAG 

TAAAAGGGTATACTAGATTCGTTTAAAAGTATTTTATAGATAGAAGGAAATTTTTTCCGACCATAAAAAGTATATTAATTCTTGATCAGGATCAA 

TAGCCGAGTCGATCTGGCCTGGTCCGTCTGTCTGTCCGTCTGTGTGAACGTCGAGATCTCTGGTACTATAAGGTTGAGATTAAGCATACAGATTC 

TTAAGACATAGAGGCAGCGCAAGTTTGTTGACCCATATTTTGAAAAATGTTTTGATATGTTTTTATAATTTTATTAGACTTGTAAATTTCTATCG 

ATTTGCCAAAAAAATTTGCCACGCCCACTCTAACGCCCTAAAACCGCCCAAAACTGCCACGCCCACACTATTGCAAAATGTTTACATTTTTTTCA 

TAATTTTATAAGTCTTGTAATATTCTATCGGTTAGCCAAACAATTTTTTTCACGTCCACTCTAACGACCTTAAGCTTCCAAACCGGTCACGCCCA 

CACTTTTAAGCCATTTTTAATTTGTTTTTCATTTTATTCCCCATCTATCGATATCCCAGAAAAATTATGAAATTTCGCGTTCGCATTCGCACTAG 

CTGAGTAACGGGTATCTGATAGTCGTGCAACTCGACTTAAGCATTCTCTCTTGTTTTACATTTCAGTTTTTGGCCACAAGCCGCCCGTATGCATG 

ACCCACGCTTTCAAAATTTTTTATTATATATTTCTTTTTTTTATTTTTTTCCCAACTTCAATCGATATGGAAGAAATTTCTCGTTTGCTCTTTCA 

CTAGCTGAGTAACGGGTATCTGATATTTGGGTATCTTAACTTCGCTTGTTTTTTAGTGAAAGCCATTTTTTTCAAATGGTTTGTACACCTTTATG 

AGAATGTTATTGCTTTTATTGTATCTCGTAAACTAAATATATGGTAGAAATGGTCTTGATCTTTTAAACACAAGTCATAGGGTTTTCCGTTTGGA 

ATCCACAAGAGTATATGTCGTAAAAGTCCTTGTTCTGTTTAAGGCCCAGATAGAAGACATGCTCGGCCAGATCCTTCCCAAACTCACAGAAGTTC 

TGAGCGTAAAGAGTAAAAAATATACGCTTGTTGTCAAGATGGTACTTTTGAAGGACGCCATTTTCCTGATCGTATATCCCACTGTTCTGTGGAGG 

CGAGTTGATCCAGGCGCTGTAGTCTATCCAGGCCGCTCGGAAGCTCTTTAAGTAGATCACAGCCAGCATTCTGGCCTCATCCAGAACATTAACGA 

ACTGCGGCAGCCTTAAACACTCCACATCTTCGCGAAACCGCGTGAAATCACGCTGACGCTCCAGACTATTAAAAAGCATCATGTACAAATGGACC 

GTAAACGTGGAGTAGAAGTAGGACATGGGCTTGTTGCGGCTCTGAAAGAGGTACATGCTCCAATCGAGCATGGTGCGCTGAACCCCCAATTCGTA 

TGTTGAATTCAGTTTTCTGTGCTGCTCCAGATTGCGAACGTTGTGTTCGTAGCTCTCCCGGTAGATCCGCAGGTCCTTGTACTCCCGGCTCAGGT 

CTCTAGCATGGAAGTAGACTCGCGGCGTTTTAAATTTCTGCTTATACGTGGCCAGCTTGCTGAGAATGGGATTCGACTCTCTGTCCTTCTGAAAC 

CTGACGACGAAGCGGTCCAAAATGCCCAGAAAGTCATCCAGCTCGTGGGATTCTCCATTGAGCATGTTTTTTGTAACTATATACCCGAAAAGTGG 

CTTGAAATACTCATAGGTCTCACGATAGCAGTCTTCCAGCGCGTGGTGCACGTTGCAGGAGATGAAGAAGTCCAGCGTCTTCTGCTTAGCCTCGA 

ATGGCTCTGGATCGTTCTGCAGCTCGGTCAGCAACTGCATAAGATCGCCCATTTGCGGCATGTGTCCTTGGATAGGGTATATGCATTAGGCACTC 

CCCCAACACAGGCTTTCCTCACTTACCCATGATCGAAAAGAGGGGTCGATTGCGGCTACAGACGTAGTCGAAATAGCTCTCGCAGGCATACTTTT 

CGGTGTTGATGGAGCCCTGTATACGAACCGACAATTTTTCAAGTTCCCTTATCCGCAGCTCCGACAGGTGCCGCACGTATCCCACTTGGCAATGC 

CCTGAACCAGTCAACATCAACAGAATAAGCGCTACGGATGCGCCGATTGCAAGACTTGAGGTTGTCATCGCAGAGCTCAAGTCACTGGCAACCCA 

GCTTTTGCGCCAGCCTATCGGACCCCTCAACGGATGTCGGATTTCGTTGCACAGTAGTTCGAGAGAGAGTCACTTTTCAAATACGTCTATCACTG 

GGCAACGAAACTATTTAACTTCAATTGAAATGTAAGCACAAAACTCCGTATCCCCTGGTATTAAATAACTAAAATAACTTATTGAATGTTTCAAT 

TTTTAGTAAAAACTTTTTTATTACTTTTGACAATATTTAAAAAGATTTTCTGTCCAAGTGGACAATAATTAAATGATTGTGGGCTGACATTGACA 

TAATTTATXTTTATCATAATGAACGTTTGAAAAGCATAGATAGTATTGAAAAATGCCCTATTAGTGCAACTCAGTATATGTTTTTATGGTGTGTT 

TTCAACATTTTCAACTTATGCTTCTTCAACGTGTTTAACCAAAAATTTATCCGGACTTTAGAAGAACGTTTTGTTGCAGACAGTTACAAACAGTT 

CTCCAAAATCACAATAATCAATCTTTACCGTTTTACCTTTTTTATTTTATCGCGCACATATAAACGGTCTAAAAACCAATTAAACGCTCACATTC 

CTATTTTATCATTTTTACTTGAAAATGCAAAAACACCCGAAAGTTCTCGGGTTGTTTGTTGTTTTTATATTGTTGAATTATTAGAACGGATTTAT 

ATATGATCTTATTACACTCGCACCACAGTGCATTGCTCAATTCGAGGTCTTATTTTAATTCAGTGAGTGAGTTCCTATGCTTAATTATATTTCCG 

AAGCGATAAGAATGTCAAGTCCTATCAGGAAAATTTCTCTCCCACTGTTTTTACGATCGCGCTATTCTGGAATATGGGCTTTGCATCGTTTCACA 

AGTATTACGAGTTATATCTGACTAAGCGTAGACAACAGTGTGCAGTTGCTTATGAAAGGAATGATTTAAATATACATATACATACATAATATACA 

TATATTATATTATATTTGTTGGT 

(SEQ ID NO: 49) 

Exon: 2443.. 2212 
Exon: 2156.. 1001 

Start ATG: 2443 (Reverse strand: CAT) 
Transcript No. : CT1569 

ATGACAACCTCAAGTCTTGCAATCGGCGCATCCGTAGCGCTTATTCTGTTGATGTTGACTGGTTCAGGGCATTGCCAAGTGGGATACGTGCGGCA 
CCTGTCGGAGCTGCGGATAAGGGAACTTGAAAAATTGTCGGTTCGTATACAGGGCTCCATCAACACCGAAAAGTATGCCTGCGAGAGCTATTTCG 
ACTACGTCTGTAGCCGCAATCGACCCCTCTTTTCGATCATGGGACACATGCCGCAAATGGGCGATCTTATGCAGTTGCTGACCGAGCTGCAGAAC 
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GATCCAGAGCCATTCGAGGCTAAGCAGAAGACGCTGGACTTCTTCATCTCCTGCAACGTGCACCACGCGCTGGMGACTGCTATCGTGAGACCTA 
TGAGTATTTCAAGCCACTTTTCGGGTATATAGTTACAAAAAACATGCTCAATGGAGAATCCCACGAGCTGGATGACTTTCTGGGCATTTTGGACC 
GCTTCGTCGTCAGGTTTCAGAAGGACAGAGAGTCGAATCCCATTCTCAGCAAGCTGGCCACGTATAAGCAGAAATTTAAAACGCCGCGAGTCTAC 
TTCCATGCTAGAGACCTGAGCCGGGAGTACAAGGACCTGCGGATCTACCGGGAGAGCTACGAACACAACGTTCGCAATCTGGAGCAGCACAGAAA 
ACTGAATTCAACATACGAATTGGGGGTTCAGCGCACCATGCTGGATTGGAGCATGTACCTCTTTCAGAGCCGCAACAAGCCCATGTCCTACTTCT 
ACTCCACGTTTACGGTCCATTTGTACATGATGCTTTTTAATAGTCTGGAGCGTCAGCGTGATTTCACGCGGTTTCGCGAAGATGTGGAGTGTTTA 
AGGCTGCCGCAGTTCGTTAATGTTCTGGATGAGGCCAGAATGCTGGCTGTGATCTACTTAAAGAGCTTCCGAGCGGCCTGGATAGACTACAGCGC 
CTGGATCAACTCGCCTCCACAGAACAGTGGGATATACGATCAGGAAAATGGCGTCCTTCAAAAGTACCATCTTGACAACAAGCGTATATTTTTTA 
CTCTTTACGGTCAGAACTTCTGTGAGTTTGGGAAGGATCTGGCCGAGCATGTCTTCTATCTGGGCCTTAAACAGAACAAGGACTTTTACGACATA 
TACTCTTGTGGATTCCAAACGGAAAACCCTATGACTTGTGTTTAAAAGATCAAGACCATTTCTACCATATATTTAGTTTACGAGATACAATAAAA 
GCAATAACATTCTCATAAAGGTGTACAAACCATTTGAAAAAAATGGCTTTCACTAAAAAACAAGCGAAGTTAAGATACCCAAATATCAGATACCC 
GTTACTCAGCTAGTGAAAGAGCAAACGAGAAATTTCTTCCATATCGATTGAAGTTGGG 
(SEQ ID NO: 50) 

Start ATG: 1 (Reverse strand: CAT) 

MTTSSLAIGASVALILLMLTGSGHCQVGYVRHLSELRIRELEKLSVRIQGSINTEKYACESYFDYVCSRNRPLFSIMGHMPQMGDLMQLLTELQN 

DPEPFEAKQKTLDFFISCNVHHALEDCYRETYEYFKPLFGYIVTKNMLNGESHELDDFLGILDRFWRFQKDRESNPILSKLATYKQKFKTPRVY 

FHARDLSREYKDLRIYRESYEHNVRNLEQHRKLNSTYELGVQRTMLDWSMYLFQSRNKPMSYFYSTFTVHLYMMLFNSLERQRDFTRFREDVECL 

RLPQFVNVLDEAP^LAVIYLKSFRAAWIDYSAWIWSPPQNSGIYDQENGVLQKYHLDNKRIFFTLYAQNFCEFGKDLAEHVFYLGLKQNKDFYDI 

YSCGFQTENPMTCV* 

(SEQ ID NO: 51) 



Celera Sequence No. : 142000013385194 

GCGTCCTCCGGCGATGTGTCTGCCGAGGCGTTGGCATTCTCCCCCTCGAGCTCCGCCTCATCGTCGTCGTCTGGCAAGGGGCCGGGAATCGGTGT 
CTCACCGTTCTTCAGGCAATTGTGGATGTAGGCGGCCTTCCACTTGGCGTACTTCCTGTTGTGCAGCGCCTCTTCGCTCAGTTCACCGAAGGTCT 
GGAGAATGTCGTAGAGCACACCACTGGAGTAGAAGGCCTTAACGACGTTTCTGCGAAGAAGCACAATCTTATGAATCAGGAAAATCGATTCAGGG 
TATTTCTACGAAGGGAACGCACTTTCCGAAGTTCTCCTCGCGATCCTGTTTGTCGGCGTACAGGAATAGTTTCAAGGCGTAGTTCTCGATGTGTG 
CCTGGGCAGCCACCTCGTTGGTGATGGCCTCGTTCTCAGCATACTGTTTCTTCATCTGCTCCAGCCAGTCCATGATGCCTGGCGGTCGTAGGGAA 
AACCAAAACAAAACAAACCACGGATGAGTCATAGGCTGGTAATATGCGTATCGGATTTTTCACTCACCCAGCAGCAGTTTGGTTTCTTCACCCGT 
TTGGGTGGAGGCTTTGAGGCCCACCTGGAGAGCACTGGATTCGGAAGATGAGTAAATACGGGCAAAACAAGAAAAACTAATCTGTAGATACTCAC 
TAGAGTCTCGCCCAGTAGGCGATGACCACATCCCGGGTATCATGTTCCTGGGCCAGCTTCAGAAAGTGTTGGATGGACTTCAGCGAGGGAGGACA 
AGGCGGAAACTCCATCGTGGTGGAAAGAGGGCTTAATCTTTAGTTCTATGCTTAATTACGGATCTGGAAAATTATGCCGACAACAATGGAAATGA 
CGAGCACGCAGAGTGACCAGATATCGCAGTTTCTCGAATGCCATGGAACCACCCCAGTTCCAAACTCTGGAACTAGCGAAATACTAAAGTTTGTA 
CCTTCGGCTAAAGTTGTTGGTTCGGCGCTCAGTTGCGAATCTGCGACCAAAACGTTTGGAGTTTCTCAGGTAAGCACTGGACTCTGGGAACTGGT 
TTTCGCTGTTATCAGTGCGACCAGTTGCACTTTGCACTTTGACCTGCATCCTTCACACCAGTCACATTCCAGGCACATCTCTGCACCACCGGCAA 
CATGGTTCTCTCCAAGCCCCTGTACTCGCTCTTCGGCACTTATCTGGAGCAGCTCTTCAACCACCCGGTCCGCACCAAATCCATTACGGCGTAAG 
TACACTATTCGCCTTATCATCCGGTTCGCCTCACCCACCGGCTGCTCCTTACTGCAGATGCGTTCTGGCCACCTCGGCTAATGTCACCTCCGAGC 
GTTTGGCGGGCGCCAAGACCCTCAACCAGCAGAGCGTATTCGCCTACGGACTCTTTGGACTGATCTTCGGCGGTAGTGTGCCGCACTACTTCTAC 
ACGACGGTGGAGCGACTCTTCAGCCAGGATGTGCGCTTCCGAAGATTCTTCCTGTTCCTGTCCGAGCGACTGGTCTATGCTCCGATTTACCAGGC 
ACTGTCCCTGTTCTTCTTGGCCCTGTTCGAGGTGAGAATCCATAATCCGCAATCTCAGTTCCTAGAACTATAATTTGTAATCCCTTACAGGGCAA 
GTCTCCCAGCACTGCATTGAAGAATGTTGAGAAGCTCTACTGGCCTCTGCTGAAGGCCAACTGGCAGTACCTGTCCGTGTTCGTGTACCTCAACT 
TCGCCTACGTGCCCCCGATGTTCCGGTCCATCAGCATGGCCATCATCTCCTTCATCTGGGTGGTGTACATCGCCCAGAAGCGTCGCCGCTTCCAG 
GACAAGCTGGCCGCCAAAGAGGCAGCCAAATAGGCTATACGACCCTTAAACCCTGTGGTAGTCGTCTTGGCCTAATATTCCAATTACTGCTTAGA 
TTACGGCTGAACTGCTTCAAGTTCTTTATCTTCCCGCGTTCCGGCGGATTGCAAAAAGTCCGTGTGTATTTGAACACTTTAGATAAGCTAAAACT 
AACTGGAGGGGCGCCTTATCAACCGACTGTTGTTATACCTTAAACCCTAAATTTTGTAAACTTTTCTTACCAAAATAAACTAATATACAATGTAT 
AAGATATCTAGATATCCATGGCCAAATTTGGCCAACAGAAATGCTTAACAAGTAGCGACAATCGTCGATCTAACATCCCATTTCCATTGCGCACA 
ATGGATCCCAATTTGAGAAATTCCAAGGCAACCTTTTGAGATGCAACAAATAATGTCTGTTTTTTTTTTATTAAAACGCAGCTTAGAACTAGCAA 
ATAATCGAGCAAAGGCTTACAAACGAGAATTATTGGCTTTTTATCGGCGCTACAATATAATTTCGCTAGTAAATTCATGGGCAGATGACTAACTT 
AGGGCTACGGAATGCAGTGTAAAACAAGTTAAATGCGCATAACAACTATCTTAAAACTAGGATTCTGACTAGAGGAGGTGACTACGATGATGGTG 
ATGATTGTGGGGGGCACCTGTGCCTCAAGACTCGCCGCACATCCTGCGCAGCGACGCGGTTAACACTCAGAAGGCGGCCATCGACGGACTCCGGC 
TGTTGCGCAGGGGCGTGAGAAGGGGCGTGGCAGGACTGCTGGCGCGGCTAAAGCAACTGGAGCAGCAACTAGAGGCATCCTCCACCTCGCCCTCG 
GCGTTCGCCTTGTGGTGATAGGGAATCGGTCGCTTGCGAGCGCTGAAAGCATAATGAATAGATTAGTATGGGATTCGTGTAGCAAGCTTAAAAAC 
CAAAGGGCAAATATGAAAGTAACTCACCAAACTGAAAATCGTAAGGAATAGAAAAGCAGTGGCTAAAAATCAAACAAAAGAAATCCTATAAAACC 
ATATAATCGGTAAAGTGACAA 
(SEQ ID NO: 52) 

Exon: 1001.. 1019 

Exon: 1109.. 1230 

Exon: 1293.. 1551 

Exon: 1611. .1871 

Start ATG: 1142 

Transcript No. : CT1579 

AACGTTTGGAGTTTCTCAGACATTCCAGGCACATCTCTGCACCACCGGCAACATGGTTCTCTCCAAGCCCCTGTACTCGCTCTTCGGCACTTATC 
TGGAGCAGCTCTTCAACCACCCGGTCCGCACCAAATCCATTACGGCATGCGTTCTGGCCACCTCGGCTAATGTCACCTCCCAGCGTTTGGCGGGC 
GCCAAGACCCTCAACCAGCAGAGCGTATTCGCCTACGGACTCTTTGGACTGATCTTCGGCGGTAGTGTGCCGCACTACTTCTACACGACGGTGGA 
GCGACTCTTCAGCCAGGATGTGCGCTTCCGAAGATTCTTCCTGTTCCTGTCCGAGCGACTGGTCTATGCTCCGATTTACCAGGCACTGTCCCTGT 
TCTTCTTGGCCCTGTTCGAGGGCAAGTCTCCCAGCACTGCATTGAAGAATGTTGAGAAGCTCTACTGGCCTCTGCTGAAGGCCAACTGGCAGTAC 
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CTGTCCGTGTTCGTGTACCTCAACTTCGCCTACGTGCCCCCGATGTTCCGGTCCATCAGCATGGCCATCATCTCCTTCATCTGGGTGGTGTACAT 
CGCCCAGAAGCGTCGCCGCTTCCAGGACAAGCTGGCCGCCAAAGAGGCAGCCAAATAGGCTATACGACCCTTAAACCCTGTGGTAGTCGTC 
(SEQ ID NO: 53) 

Start ATG: 53 

MVLSKPLYSLFGTYLEQLFNHPVRTKSITACVLATSANVTSQRLAGAKTLNQQSVFAyGLFGLIFGGSVPHYFYTTVERLFSQDVRFRRFFLFLS 
ERLVYAPIYQALSLFFLALFEGKSPSTALKNVEKLYWPLLKANWQYLSVFVYLNFAYVPPMFRSISMAIISFIWVVYIAQKRRRFQDKLAAKEAA 

{SEQ ID NO: 54) 

Name : putative transporter 
Classification: transporter 

Celera Sequence No. : 142000013384359 

ATTAAAGAGCTCCGCCATCTCCATCTTCATTCGCCATCCCAACCCATCGAACCCAAGCTCGACCTCGAAGAAAGGCCGCCACAGAAAGTCATCGA 

ACTTTCTCCGTGTTCTTTCGCCGACTAATTTCTATATTAGGTGTCATGGTCTCGGCTGTCGGTGGTTCTATTAATTACCCCAACAGCTGCGATCA 

TTTATAACTTAGTCGGGGTAGTCAATCTTGCGATCTGTTACAATTTGTAAGTGGAAAACCCGTGGAGTCAGTGGAGCTCCCAAGGTGCGCAAGGT 

CGAACAAATTTGCCTAAACACAATCGCAAAGATTAGCAGCCAAATTGCCGTGTCAAGTGAACCTCTCGAAGTTGGGAACCCAGTGGCTCCTCCAG 

AAGAAAGAAAAGATCCGACCCGACCCGGCTATTATACAATCGCGTGACTCTTCTCAGAGCTGCTGCGAATATTTGTTGCCCAATTCACCTTTGTA 

AACGGTGACATGCGAACAGAATGGTTATGGATAGAATGGTTAGATGGATGGAGCTGCGGATGGTGCCAATGGTGGCCCCAAGCAGCGTCTGAGCT 

GTAATTAATAGCAGCTTACGGGTTAGTCGCAAATTAGCCACATGACACAGCGCCAGCGAAGGCGTAGGAGTCTGGAGACATAACCGAGACGGTTC 

CTCGTGGGCGGAACTGACCCCATCGGGGAGAACTCCTAAGCTCTTTAGCACTTCCCTTCGTTTTTACCAGCGACGGTTGAAAAAGTCGCTAACAG 

CATCCGTTCGATTGGAGGCGTTGCGTTTGACGAGTTTTACAGCGTGGGTGTGTGTGCTTACCACTCTAATTTACTCGAAATACATTAAGTTTGAT 

TTAAGTCGTAAATATGCTAATAAAAAGCATTTAACTCGGATTAAAACTCATTATTTATACTGCAAAACTTTACACTGTCTATTAGTCAACTTCTA 

TAAACGAAGTGTGGATAAAATATTAAAACAAAAATATTTATGATTTATACATATGTCCATATGCTTTTTATTTCAGAAGTTTGTTTTTTTGGGGT 

TTACTAAGTACTAGAATATATTAAACACTCGCAGCCAGCCTGCTCTGATCGCAACTAGTTGAGGGGATTCGATGTCTTGGCGCTCTCCTGGCGGA 

TGGCGTTGATCAGCTCGTAGTTGACCAGGAAGGTGAAGCGCAGACCGGATATTTCCACTGTGCAGTTGTGACCGAGTCGAGCTTTGTTGGCGGAT 

AGCAAAGGAGTGCCGTCGATAAAGATGGCCCTCTTTCCCTCGTTAGCGATGAAGAAATCCCCATTGCTGCGCAGCTTAATGGTTCCTTGGCGGCG 

AGAGATCTTCGCAGCCGGTCCCTCGAGCCCCAGATCCACGTCCACCACACAGTCCTTGGCATCGCGACCAAATGTTATTTCCTTGGAACGCATTA 

GGTACCGCACATGGCGGCCACACAAGCAGGCCAGXGTCTGGTTGTCGAACTCGGAGGCAGCGGTCGGACTAAGCACGGAATCCACCAGGACGGCC 

CAGCGCGAGAGCTCGTTCTCCAGCAGGCGAATATTACGTTTGTTGCGACGATCTGCCAGAGCTCGCTCCATCTCGAGAGCTTCGTCACGTGGCTC 

GTTCAGGXCGTGCTCGAAGATTTGATCTTCTGCATCCGAGAAGCTGAGTGGCTGCTGATCCGTGCCATATATAGGCTTCACTGACTGATCCGGCA 

ACAGGGTGTACTGCTTGAGAAGAAGCCAATGGTTTTGCAAGGATTTGGCAGTGCGGGCGCAGTAGAAAACGGAGGCATTCTTATCCAGTAGCTCC 

TGGAACTGCTCAAGTTTTGGTTGTTCCGACTGTGAATTGAATTACTGATTAGAAAAAGCTTAAATCATGATGAAAATAGTGTACTTACGCTCTTA 

ATGGTGCCAAGCAGATCCTCCTCCTGCACGCTGTACAAAGCTTTGCGCTGAACGGACTCCACGAGTTCGGGGTGCAGGTTCCTTATGGCGGACAC 

TGCAATCCTGGACACCGCAGGCTCATACAGCAGAGCATACCAGCGCTGCTGTAGTTCCTGCAGAGTAAACTTGCAGGAAAACTTAACGCCGCGAT 

GGATTATCCGCAGATCGTTGGTCTGCTGGATGCCAATTATGAGGGCCAAGTCGTCAATGGGCTTCCAGCGGCCCAAATCCTTGGTGGCCATCTGA 

CCCATGGGACGACCATTTCTACGCTGCGCCTTTTTGCTGGCAGGACGCACTGGGCGCGAACGGCGCTCGCTGGTCGTGGAGCGCTCCATGGCCGG 

TGGTTTGGGATGCGCGACGGCAGTCACAATTGGGGCAACTGGAAGGGGAGTGGCCAAGGGAGCTGGCGTTGTGGGCGTGGCCAGTGACAGCGATG 

CAACAGTGCTAGGTGTCCCCGGCAGCAGGGAATCAACTGAAAGCGGCGTCTGTGGCGTGGAAGTGGGTATGTTAAGAGGCGCTAACGTGGTGTGC 

GGCACGCCCACCAAAGCTGGATAATTCTGGGAGGTGGTGCGAGGCCTGCTGCTGCGATTGGCGTCTGTTCCGCTGCGATTTGTGGGCACCGCAAT 

GTTGTACTCAACTATTTCGTCATCAAAACGCTTCCTTTTAATCGTCCGCGATGAACTGTTTCGATGCAAAAGGAGCTGCCACTTAATCTGTCAAT 

TTTGACATATGTCATCGTCTCACCTTCTTCGCTTTTGATCGTTTTGCTGTTCTATCTGCAGATTTGAAATGGGATTTCGTATCGGCGTTGAAATT 

CCAACGGCACTGCTAGCTGTACTTCCTATTGTGGTTGTTGTTGTTGTGGCTGCTGGTGTTGGCATGGTTGTTGTTGCTGGCGACACGCCTACCTG 

GATTAGTGTGCTCGCAGCTGCTGTGGGAATGGTGGACACTGTGGGGGGATTGGGAGCGGTAACGGACACTGCACTGCTGGCGATCGCGGTTATTC 

TTGATGCCTCCATCGCTCTAGCGGGCTGTTTATGGGCTAAAATGTATTTATTTTCGCGACGCACAGCTCAGCAGTGTGACCGCACCAGCAGGAAA 

AAGTATATCACTTGCAGAGTTGCCACGTCTTCGAAATAAAGTGTCACTGCAAGAAAAACATTTGTTAATAGCTTATATAAATATATATAAAATTA 

ATTTTCTTTCTTAACAGTTCACTTGCTTAATATATAATTAGCGAAAGTGAGAACATTAAGCTTGACTTTACCATTTTAAAAGTTACCATCATAGT 

GTCACTGGCCTGCCAGATCTCCATGGGAAACACATGTTTCGCAACACTGCTACCCACACAAACGCATTCGCGAATATCTGGTAACCTCGCCACTG 

GAGCCCTCTCGAAATTTTGTTTACTGGCATTTAAAATCAGCGAATTCGGGCTAAAAAGTCTTACTTATGCACCCAGCACCGATGCCCGGAGCGAC 

TAGATGAGTGTCTGGATGAGCGGGCCCCGAAATTTCGCACTAGCCACACAAATTATACGGGAAACTATGAGGTAAATGGTTCTTTGCACGAAAAC 

AAAAAGGCGACTGACTAAGATATGACTAATCGATTTGCAGTTTGCACTACGCGGAATCGGAATCTTCGCCGTCAGAGAGCAAAATGGACATCTCG 

GAGACGCCGACGGACTCAACGGGCGTCTCGCCGCTGGCGGATGATGTTTTGCCCACCGCGCCGGAGAAGAATATCCTGAAATTGGAGCAGGCCAA 

GCAGGATCACGATAAGGAGGAACTGGAGTTCAAGACGACCTCCGCGCTGGACAACAGAATACTGCGTCCAGAGCTGCTGCCCATGGAGGCGATGC 

CACAACGACACAACATCATCGAGCGCACCCTGGCCAACAGCCATCCGCTGATGTCGCCAACCAACACGGACTCAGACTCGGAACCCTTTCAGCTG 

AAGAAGCAGGAGAAACAGCTGCCGAGGAGCAACTTTCCGGACGTTGTGCCCACCACGGAGACGGTGCAGGTGAAGAACAACCTGAACCAATGCCG 

GAACAAGCCGCAGGACGCCAAGGTCAGCCTGCTAAGG 

(SEQ ID NO: 55) 

Exon: 3027., 2684 
Exon: 2620. .1894 
Exon: 1834.. 1001 

Start ATG: 2958 {Reverse strand: CAT) 
Transcript No. : CT1834 

CGGTCACACTGCTGAGCTGTGCGTCGCGAAAATAAATACATTTTAGCCCATAAACAGCCCGCTAGAGCGATGGAGGCATCAAGAATAACCGCGAT 
CGCCAGCAGTGCAGTGTCCGTTACCGCTCCCAATCCCCCCACAGTGTCCACCATTCCCACAGCAGCTGCGAGCACACTAATCCAGGTAGGCGTGT 
CGCCAGCAACAACAACCATGCCAACACCAGCAGCCACAACAACAACAACCACAATAGGAAGTACAGCTAGCAGTGCCGTTGGAATTTCAACGCCG 
ATACGAAATCCCATTTCAAATCTGCAGATAGAACAGCAAAACGATCAAAAGCGAAGAAGTTCATCGCGGACGATTAAAAGGAAGCGTTTTGATGA 
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CGAAATAGTTGAGTACAACATTGCGGTGCCCACAAATCGCAGCGGAACAGACGCCAATCGCAGCAGCAGGCCTCGCACCACCTCCCAGAATTATC 
CAGCTTTGGTGGGCGTGCCGCACACCACGTTAGCGCCTCTTAACATACCCACTTCCACGCCACAGACGCCGCTTTCAGTTGATTCCCTGCTGCCG 
GGGACACCTAGCACTGTTGCATCGCTGTCACTGGCCACGCCCACAACGCCAGCTCCCTTGGCCACTCCCCTTCCAGTTGCCCCAATTGTGACTGC 
CGTCGCGCATCCCAAACCACCGGCCATGGAGCGCTCCACGACCAGCGAGCGCCGTTCGCGCCCAGTGCGTCCTGCCAGCAAAAAGGCGCAGCGTA 
GAAATGGTCGTCCCATGGGTCAGATGGCCACCAAGGATTTGGGCCGCTGGAAGCCCATTGACGACTTGGCCCTCATAATTGGCATCCAGCAGACC 
AACGATCTGCGGATAATCCATCGCGGCGTTAAGTTTTCCTGCAAGTTTACTCTGCAGGAACTACAGCAGCGCTGGTATGCTCTGCTGTATGAGCC 
TGCGGTGTCCAGGATTGCAGTGTCCGCCATAAGGAACCTGCACCCCGAACTCGTGGAGTCCGTTCAGCGCAAAGCTTTGTACAGCGTGCAGGAGG 
AGGATCTGCTTGGCACCATTAAGAGCTCGGAACAACCAAAACTTGAGCAGTTCCAGGAGCTACTGGATAAGAATGCCTCCGTTTTCTACTGCGCC 
CGCACTGCCAAATCCTTGCAAAACCATTGGCTTCTTCTCAAGCAGTACACCCTGTTGCCGGATCAGTCAGTGAAGCCTATATATGGCACGGATCA 
GCAGCCACTCAGCTTCTCGGATGCAGAAGATCAAATCTTCGAGCACGACCTGAACGAGCCACGTGACGAAGCTCTCGAGATGGAGGGAGCTCTGG 
CAGATCGTCGCAACAAACGTAATATTCGCCTGCTGGAGAACGAGCTCTCGCGCTGGGCCGTCCTGGTGGATTCCGTGCTTAGTCCGACCGCTGCC 
TCCGAGTTCGACAACCAGACACTGGCCTGCTTGTGTGGCCGCCATGTGCGGTACCTAATGCGTTCCAAGGAAATAACATTTGGTCGCGATGCCAA 
GGACTGTGTGGTGGACGTGGATCTGGGGCTCGAGGGACCGGCTGCGAAGATCTCTCGCCGCCAAGGAACCATTAAGCTGCGCAGCAATGGGGATT 
TCTTCATCGCTAACGAGGGAAAGAGGGCCATCTTTATCGACGGCACTCCTTTGCTATCCGCCAACAAAGCTCGACTCGGTCACAACTGCACAGTG 
GAAATATCCGGTCTGCGCTTCACCTTCCTGGTCAACTACGAGCTGATCAACGCCATCCGCCAGGAGAGCGCCAAGACATCGAATCCCCTCAACTA 
GTTGCGATCAGAGCAGGCTGGCTGCGAGTGTTTAATATATTCTAGTACTTAGTAAACCCCAAAAAAACAAACTTCTGAAATAAAAAGCATATGGA 
CATAT 

(SEQ ID NO: 56) 

Start ATG: 70 (Reverse strand: CAT) 

MEASRITAIASSAVSVTAPNPPTVSTIPTAAASTLIQVGVSPATTTMPTPAATTTTTTIGSTASSAVGISTPIRNPISNLQIEQQNDQKRRSSSR 
TIKRKRFDDEIVEYNIAVPTWRSGTDANRSSRPRTTSQNYPALVGVPHTTLAPLNIPTSTPQTPLSVDSLLPGTPSTVASLSLATPTTPAPLATP 
LPVAPIVTAVAHPKPPAiyiERSTTSERRSRPVRPASKKAQRRNGRPMGQMATKDLGRWKPIDDLALIIGIQQTNDLRIIHRGVKFSCKFTLQELQQ 
RWYALLYEPAVSRIAVSAIRNLHPELVESVQRKALYSVQEEDLLGTIKSSEQPKLEQFQELLDKNASVFYCARTAKSLQNHWLLLKQYTLLPDQS 
VKPIYGTDQQPLSFSDAEDQIFEHDLNEPRDEALEMERALADRRNKRKIRLLENELSRWAVLVDSVLSPTAASEFDNQTLACLCGRHVRYLMRSK 
EITFGRDAKDCVVDVDLGLEGPAAKISRRQGTIKLRSNGDFFIANEGKRAIFIDGTPLLSANKARLGHNCTVEISGLRFTFLVNYELINAIRQES 
AKTSNPLN* 
(SEQ ID NO: 57) 

Classification: hypothetical 



Celera Sequence No. : 142000013384447 

AGGTGGTATTAGGTGTTGCTGCTTTGGAGGCGTCAACTTCACGAGAAGAGCTTCACCGTTCTGGTCGAGATTTTCATAGTCTAGTGCCAGATCCC 
TGTTGATCCTGAACAAAGTGGGCGTCTGCATTGGGATGAGTCTGTCCATTTTGTAGTTCATGAAGCATTCCGCTCGAAAGTGCCGGGCACAAACT 
GTTTTGGCTGCCAGTTTGGACGGCGGTGCATTTGAAACGTCAGGGTTTGAACAATTCCTGACCCAATCTTTCAACCTCTTCGGATCCTTAACGGG 
GAACTTGAAGAAGTGCATATTGGGGTTGCGGCTAGAACCCACCGGACAATCACGAAAACAACACTTAGTGCCGCCCATTTGTTTAAATATAAGCA 
AACAACTTTTATGTTATTAGTGGTGGAAGTGTTAGCGTCAGCTGGTGATATCGATGGGAGGCATCGATAACAGAATTGACCGAAACCAAATGATC 
GATATGACACTTCTTAATTAATGAGAGATTTTTTACTTGACTGGGCTTGTAGCAGGTTTTGCACAGAAGCAATTATTTTCCGGAATGTGAAATGT 
CTGCTTTTTAGCTAATTACAACAAAAACTTTCCAATTTTTGTTCCCCAAACCCACTCAAGTGATTTCAAATTTTACCGTCCGCTTAAAATGGAAC 
TAGTTTATTCCAGAGGAACCAGCTTGCACCACCAAGTCTCTGAAACTCTGGAAATATCGATAGTCTGGTGGAGAAAAGTATTCATAAATATAAAT 
AAAAATTAACAGGTCATAAGCTGATTTGTTTATTATTTACTGTTAAAACAAGTAAAATAATATTGGGAACAATTAAATTTTCCATTTTCCTAATT 
ACAGTATAAGCCTAGTGGGCGTTTTGATATCCAATTGTAATGTTTTAAGCAATCCCAGTGGGCTTTGCTCAATCGTTCGGACCACTTAGACGAAT 
TTCCACCAAACTTAGTTCAGTATAATTTTTGAATTCGCCCGCACAGGTTGCGCACTTTTCGACCGTATCACAACACTGATCTACCCTAGTATTCA 
CAGGAAGTTGCATCCCTGGCATCCAGAAGCCTCTAGAAGTTTCTAGAGACTTCCAGTTCGGGTCGGGTTTTTCTATAAAAGCAGACGCGCGGCGT 
TTGCCGGTTCGAGTCTTGAAAAAAATTTCGTACGGTGTGCGTCGTAACAACAAGCAGCGTCTGAAAAGTTTTGTGAATTTCCAATTCTATACAAA 
GCAAAGTGAAAATATCTGTATTTTTACCTTTATTCTGTGAATAGAACGAAAAACATACATACAAGGTGAGTAATGCAAATTACAAGAAAAGAGTG 
AATAGTTTATCAGTGGCTATGGCCAAAATGTGCATTTTGCGTGGTCCTGTGCATCTCGAATGTTCTTGACCCAAATGTGAGATATTGATTTTAAA 
TTTCTAGGAGCCAAGTTTAAGAATTTTTTTTGATTAATTAAGAGGTGGCAACGTGCAAATTAACTCAAAATGCCGGTTTCTTTTATTTTTTGTCG 
CTTGGACGCATCTTCCAGAGGTTTCTATGCTTTAGCATGAATTAAACATCGTGCCAAATAGGCCTTTTAATTATATATTACTGTCCTTTATTTAC 
AATTACATGTGGTTTATAGAATACAAGATTAATTTTTGTTTAATGCAATGGTCTTTTAGCGCTAAATCGAATTATGCCGCTCTTTTTAGGGGGTG 
ACAATGCGCAAATCACATTTGCCGCTCGAGAATGTTCTAGAGGTTTCTATGCTTTAGCATGAATTAAACAGCGTGCCAAATAGGCTTTTTAATTA 
TAAATTACTGTCCTTCATTTACAATTACACGTGCTTTCTTTGCATTCCCATTTTAAATTGCACATGCCGCATACGCACACATGCACGCCCATGAC 
TAATACTTTCAAGTAAAAATGTGGCGTCAGTAAGCAAATTCTGTTAAATCGGTTTTTTAATCCTATTTGCTCTATTTTAATTGGTTGGTTGCTAC 
TAGCAACTTGCTAGGCGAATTAGTTTTCCTTTGGCTTCTAGATGCTTCCACAAACTTCCTTGGTGAAGTAGGAATTTTCAATGCAATGCTCACTC 
ACACAGACACGAGTTTGCACACAGCAGGGGTAGAAAAATTATCAACCGACCCAATTTGCATAATTATATTTAAATATTTAAATTTAAAACATTAT 
TTTGCAACTTAAAATCAATTCTGTTGGCTAATTGAAATTAAAATTCCATTTTACGGGTTGCAAAGTGAATGCTATAATTTTGACCACCACTGTAC 
TTGTATATGCGCATGTTAAATGAGGCATGTGCAAAAGAGAAAGAAAGAAAAAGAATAAAACCGGAGCAGCTGCTGAAAATGCAGCTGCTTTTCCT 
TAGTGTTGAACCCACAGACTATAACTAATCCTAATGATTTTGTAAATCCATTGCAGATGCCAGAAGAAGCAGAGACCTTTGCATTCCAGGCTGAG 
ATTGCTCAGCTGATGTCCCTGATCATCAACACATTCTACTCGAACAAGGAGATTTTCCTGCGCGAGTTGATCTCGAACGCTTCCGATGCCCTGGA 
CAAGATCCGCTATGAGTCCCTTACTGACCCCAGCAAGCTGGACTCTGGCAAGGAGCTGTACATCAAGCTGATCCCTAACAAGACGGCTGGTACTC 
TGACCATCATTGATACCGGTATCGGTATGACCAAGTCCGACCTGGTCAACAACTTGGGAACCATTGCCAAGTCCGGAACCAAGGCCTTCATGGAG 
GCTCTGCAGGCTGGTGCCGACATTTCCATGATCGGTCAGTTCGGTGTGGGTTTCTACTCCGCCTACCTGGTCGCCGACAAGGTGACTGTCACCTC 
CAAGAACAACGATGACGAGCAGTACGTGTGGGAGTCCTCTGCCGGAGGCTCTTTCACAGTCCGTGCCGACAACTCTGAGCCCCTGGGCCGTGGCA 
CCAAGATCGTGCTGTACATCAAGGAGGACCAGACCGACTATCTGGAGGAGAGCAAGATCAAGGAGATTGTTAACAAGCACTCCCAGTTCATTGGC 
TACCCCATCAAGCTGCTCGTAGAGAAGGAGCGCGAGAAGGAGGTCAGCGACGATGAGGCTGATGATGAGAAGAAGGAAGGTGATGAGAAGAAGGA 
GATGGAGACTGATGAGCCCAAAATCGAGGATGTTGGCGAGGATGAGGATGCCGACAAGAAGGACAAGGATGCCAAGAAGAAGAAGACCATCAAGG 
AGAAGTACACTGAGGATGAGGAGCTGAACAAGACCAAGCCCATCTGGACCCGCAATCCCGATGATATCTCCCAGGAGGAGTACGGCGAGTTCTAC 
AAATCCCTGACCAACGACTGGGAGGATCATCTGGCCGTCAAGCACTTCTCCGTGGAGGGTCAGCTGGAGTTCCGTGCTCTGCTCTTCATTCCCCG 
TCGCACGCCCTTCGATCTCTTCGAGAACCAGAAGAAGCGCAACAACATCAAGCTGTACGTGCGTCGTGTCTTCATCATGGACAACTGCGAGGACC 
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TCATTCCAGAGTACTTGAACTTCATGAAGGGTGTGGTCGACTCCGAGGATCTGCCCCTCAACATCTCACGTGAGATGCTGCAGCAGAACAAGGTC 

CTAAAGGTGATCCGCAAGAACCTGGTCAAGAAGACCATGGAGCTGATTGAGGAGCTCACCGAGGACAAGGAGAACTACAAGAAGTTCTATGACCA 

GTTCAGCAAGAACCTGAAGCTGGGTGTGCACGAGGACAGCAACAACCGTGCCAAGTTGGCCGACTTCCTTCGCTTCCACACCTCTGCCTCCGGCG 

ACGATTTCTGCTCCCTGGCCGACTACGTGTCGCGCATGAAGGATAACCAGAAGCACGTGTACTTCATCACTGGCGAGTCCAAGGACCAGGTCAGC 

AACTCTGCCTTCGTGGAGCGCGTCAAGGCCCGTGGCTTCGAGGTGGTCTACATGACCGAGCCCATCGATGAGTATGTCATCCAGCACTTGAAGGA 

GTACAAGGGCAAGCAGCTGGTCTCTGTCACCAAGGAGGGTCTGGAGCTGCCTGAGGATGAGAGCGAGAAGAAGAAGCGCGAGGAGGACAAGGCCA 

AGTTCGAGAGCCTGTGCAAGCTGATGAAGTCCATCCTGGACAACAAGGTCGAGAAGGTGGTGGTGTCCAACCGCGTGGTGGATTCGCCCTGCTGC 

ATTGTCACTTCGCAGTTCGGCTGGTCCGCTAACATGGAGCGCATCATGAAGGCCCAGGCTCTGCGTGATACCGCCACAATGGGCTACATGGCCGG 

CAAGAAGCAGCTGGAGATCAACCCCGATCACCCAATTGTGGAGACTCTCCGCCAGAAGGCCGATGCCGACAAGAACGATAAGGCCGTCAAGGATC 

TGGTCATCCTGCTGTTCGAGACCTCTCTGCTGTCCTCTGGATTCTCGCTGGACAGCCCCCAGGTGCACGCCAGCCGCATCTACCGCATGATCAAG 

CTGGGCTTGGGAATCGACGAGGACGAGCCTATGACTACCGACGATGCCCAGAGCGCCGGAGATGCCCCCTCGCTGGTTGAGGACACAGAGGACGC 

TTCCCACATGGAGGAGGTCGATTAAGCGACCAGTCGAAACAAACAACCAAAATTCATTCTATCACTCGCATTCACATACACAATTTACTTGCGTT 

TCGAACTTTTATACTGAGTTTACTACGGCCGAGTTAAATTTTGTATTCATTAACATTTTGCCGCGTTATAAGCGACAGACATACGCTTAACTCAT 

AAAAAAGCAGGAATAACTCGTTAAATGGTTAGGTTCTCACAGAACATTCAAGAGCAGTTGTCGTTTTAAGAACTTATAATTTAGAATCCAAGTAA 

TTTATGTAAAAAACTAAAGACTACATACGCGCCCTAGTTGGTAGAGCTATATAAAGAATCGAGTATATATATAATTAAGGTTTGATGACCCGATC 

GATGATAAACATAAAACCAAATAAACAACAAGCAAATGTGTTTTAAAAATCTAACTTCTGAGCGAGTATTTATTGGGGGGAATAAACAATCTATG 

AATCGGATTCTTTGCGCAGCAGCTGCTCAATGGCCTCCACCGTGGACACTCCCTTGGTTATCATTATTATCTTGTTTCGCGATCGAGATCCCTTG 

TCCAAAGAAACGTCGCTCTTTCGAAGACCTAGAACTTTCGACAGAAACTTGACCAGTTCGGCGTTAGCTTCTCCCTCGCTGGGCGGAGCGGCGAT 

TTGGACGCCCACTCCTTCAAAGCCAATTCCTGTGATTCCGTTCTGCTTAGCCCCCGGCTTGGCAAGGATTTGTATGCATATATTGCCGCTCTTAT 

CGACTGAAATCGGACTGGCTTCCTTAGCCGGCATCGCTTTTGCATCGTTTTTCGCACTTTCAACGCCGGCCTTTGATTTCCCCTTGCTCTTCGAC 

ATTATTTGGGCGTTGACCGATGTAAACAAGCGGCGCATTAGCTTCTTGGCAATTATTTTAAATATTTTGTGTTTTTAAATGCATTTTAATAGCAA 

CTAGTGTTTAATTTTGCGAGGTCTACTGGCGTCTATCAGCTGATTCGTGGAGAAAACCAGGGATGCGCACATATAGGGATGAGCATATCTCGTGA 

GTTCTCACTCACGAAACGAAGATAAAACTACATTTTTACATTCCATGTAACTTACAGATTGGACATGAATAGTACGATTCAAAAATGTATTTTTA 

TTTTCCTGTTGAGTATTATTTAACGGTTTTCTAAACACATTCCCCCCGAAGTCACCACAACATGTAAATAAAAGTATTAAAACAAAATATGATTA 

GGTAAAAAATTATCATATATTCATCATTGTTATGTACTCATAGTTTCTTCTCTACTCGTCTTTGCATTAGAAAACAACGCTTTAACACTTTGCAC 

TATTTAAAACTGCGCCGAGAAGCATCTTAAAAATGCATTTATCGCCTATAAATGAT 

(SEQ ID NO: 58) 

Exon: 1001.. 1300 
Exon: 2432.. 4946 
Start ATG: 2432 

Transcript No. : CT2354 

CGCACTTTTCGACCGTATCACAACACTGATCTACCCTAGTATTCACAGGAAGTTGCATCCCTGGCATCCAGAAGCCTCTAGAAGTTTCTAGAGAC 
TTCCAGTTCGGGTCGGGTTTTTCTATAAAAGCAGACGCGCGGCGTTTGCCGGTTCGAGTCTTGAAAAAAATTTCGTACGGTGTGCGTCGTAACAA 
CAAGCAGGGTCTGAAAAGTTTTGTGAATTTCCAATTCTATACAAAGCAAAGTGAAAATATCTGTATTTTTACCTTTATTCTGTGAATAGAACGAA 
AAACATACATACAAGATGCCAGAAGAAGCAGAGACCTTTGCATTCCAGGCTGAGATTGCTCAGCTGATGTCCCTGATCATCAACACATTCTACTC 
GAACAAGGAGATTTTCCTGCGCGAGTTGATCTCGAACGCTTCCGATGCCCTGGACAAGATCCGCTATGAGTCCCTTACTGACCCCAGCAAGCTGG 
ACTCTGGCAAGGAGCTGTACATCAAGCTGATCCCTAACAAGACGGCTGGTACTCTGACCATCATTGATACCGGTATCGGTATGACCAAGTCCGAC 
CTGGTCAACAACTTGGGAACCATTGCCAAGTCCGGAACCAAGGCCTTCATGGAGGCTCTGCAGGCTGGTGCCGACATTTCCATGATCGGTCAGTT 
CGGTGTGGGTTTCTACTCCGCCTACCTGGTCGCCGACAAGGTGACTGTCACCTCCAAGAACAACGATGACGAGCAGTACGTGTGGGAGTCCTCTG 
CCGGAGGCTCTTTCACAGTCCGTGCCGACAACTCTGAGCCCCTGGGCCGTGGCACCAAGATCGTGCTGTACATCAAGGAGGACCAGACCGACTAT 
CTGGAGGAGAGCAAGATCAAGGAGATTGTTAACAAGCACTCCCAGTTCATTGGCTACCCCATCAAGCTGCTCGTAGAGAAGGAGCGCGAGAAGGA 
GGTCAGCGACGATGAGGCTGATGATGAGAAGAAGGAAGGTGATGAGAAGAAGGAGATGGAGACTGATGAGCCCAAAATCGAGGATGTTGGCGAGG 
ATGAGGATGCCGACAAGAAGGACAAGGATGCCZ\AGAAGAAGAAGACCATCAAGGAGAAGTACACTGAGGATGAGGAGCTGAACAAGACCAAGCCC 
ATCTGGACCCGCAATCCCGATGATATCTCCCAGGAGGAGTACGGCGAGTTCTACAAATCCCTGACCAACGACTGGGAGGATCATCTGGCCGTCAA 
GCACTTCTCCGTGGAGGGTCAGCTGGAGTTCCGTGCTCTGCTCTTCATTCCCCGTCGCACGCCCTTCGATCTCTTCGAGAACCAGAAGAAGCGCA 
ACAACATCAAGCTGTACGTGCGTCGTGTCTTCATCATGGACAACTGCGAGGACCTCATTCCAGAGTACTTGAACTTCATGAAGGGTGTGGTCGAC 
TCCGAGGATCTGCCCCTCAACATCTCACGTGAGATGCTGCAGCAGAACAAGGTCCTAAAGGTGATCCGCAAGAACCTGGTCAAGAAGACCATGGA 
GCTGATTGAGGAGCTCACCGAGGACAAGGAGAACTACAAGAAGTTCTATGACCAGTTCAGCAAGAACCTGAAGCTGGGTGTGCACGAGGACAGCA 
ACAACCGTGCCAAGTTGGCCGACTTCCTTCGCTTCCACACCTCTGCCTCCGGCGACGATTTCTGCTCCCTGGCCGACTACGTGTCGCGCATGAAG 
GATAACCAGAAGCACGTGTACTTCATCACTGGCGAGTCCAAGGACCAGGTCAGCAACTCTGCCTTCGTGGAGCGCGTCAAGGCCCGTGGCTTCGA 
GGTGGTCTACATGACCGAGCCCATCGATGAGTATGTCATCCAGCACTTGAAGGAGTACAAGGGCAAGCAGCTGGTCTCTGTCACCAAGGAGGGTC 
TGGAGCTGCCTGAGGATGAGAGCGAGAAGAAGAAGCGCGAGGAGGACAAGGCCAAGTTCGAGAGCCTGTGCAAGCTGATGAAGTCCATCCTGGAC 
AACAAGGTCGAGAAGGTGGTGGTGTCCAACCGCCTGGTGGATTCGCCCTGCTGCATTGTCACTTCGCAGTTCGGCTGGTCCGCTAACATGGAGCG 
CATCATGAAGGCCCAGGCTCTGCGTGATACCGCCACAATGGGCTACATGGCCGGCAAGAAGCAGCTGGAGATCAACCCCGATCACCCAATTGTGG 
AGACTCTCCGCCAGAAGGCCGATGCCGACAAGAACGATAAGGCCGTCAAGGATCTGGTCATCCTGCTGTTCGAGACCTCTCTGCTGTCCTCTGGA 
TTCTCGCTGGACAGCCCCCAGGTGCACGCCAGCCGCATCTACCGCATGATCAAGCTGGGCTTGGGAATCGACGAGGACGAGCCTATGACTACCGA 
CGATGCCCAGAGCGCCGGAGATGCCCCCTCGCTGGTTGAGGACACAGAGGACGCTTCCCACATGGAGGAGGTCGATTAAGCGACCAGTCGAAACA 
AACAACCAAAATTCATTCTATCACTCGCATTCACATACACAATTTACTTGCGTTTCGAACTTTTATACTGAGTTTACTACGGCCGAGTTAAATTT 
TGTATTCATTAACATTTTGCCGCGTTATAAGCGACAGACATACGCTTAACTCATAAAAAAGCAGGAATAACTCGTTAAATGGTTAGGTTCTCACA 
GAACATTCAAGAGCAGTTGTCGTTTTAAGAACTTATAATTTAGAATCCAAGTAATTTATGTAAAAAACTAAAGACTACATACGCGCCCTAGTTGG 
TAGAGCTATATAAAGAATCGAGTATATATATAATTAAGGTTTGATGACCCGATCGATGAT 
(SEQ ID NO: 59) 

Start ATG: 301 

MPEEAETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNASDALDKIRYESLTDPSKLDSGKELYIKLIPNKTAGTLTIIDTGIGMTKSDLVNNL 
GT I AKS GTK AFMEALQAG ADI SMI GQFGVGFY S AYL VADKVTVT SKNNDDEQYVWES S AGGS FTVRADNS E PLGRGTKIVLY I KEDQTDYLEES K 
IKEIVNKHSQFIGYPIKLLVEKEREKEVSDDEADDEKKEGDEKKEMETDEPKIEDVGEDEDADKKDKDAKKKKTIKEKYTEDEELWKTKPIWTRN 
PDDISQEEYGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFIPRRTPFDLFENQKKRNNIKLYVRRVFIMDNCEDLIPEYLNFMKGVVDSEDLP 
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LNISREMLQQNKVLKVIRKNLVKKTMELIEELTEDKENYKKFYDQFSKNLKLGVHEDSNNRAKLADFLRFHTSASGDDFCSLADYVSEIMKDNQKH 

VYFITGESKDQVSNSAFVERVKARGFEVVYMTEPIDEYVIQHLKEYKGKQLVSVTKEGLELPEDESEKKKREEDKAKFESLCKLMKSILDNKVEK 

VVVSNRLVDSPCCIVTSQFGWSANMERIMKAQALRDTATMGYMAGKKQLEINPDHPIVETLRQKADADKNDKAVKDLVILLFETSLLSSGFSLDS 

PQVHASRIYRMIKLGLGIDEDEPMTTDDAQSAGDAPSLVEDTEDASHMEEVD* 

(SEQ ID NO: 60) 

Name: Heat shock protein 83 
Classification: chaperone 
Gene Symbol: Hsp83 
FlyBase ID: FBgn00G1233 



Celera Sequence No. : 142000013384244 

CACGCTCTCGGACACTAGATAATTGCCAGCCCCAATATCGGTGATAGGCGTCGACTTTTGCCCCCACCCACCCCCGTCTCATATATACGTATATA 
ACTATCGATGACGTCATGTGGGAAGTGCAGGTGCATTTGAAATGCAAATTGCTTTTGCTGCAAACCATCTGTCCATATACCGCTTATATAAGGCG 
GGCACAGATCGCTCAGACTCAGTTCTGCATAATTTGAACGGAGTTTTTTAGAACACGTACATATAGTTCGTAGATTTGGAATTTTTATAGAAATG 
ACGAATCGAAACTAAGCGAATCCATTTTGGCAATCCATGGTCTTGGACCCGAAATGCTTTTAGCACTCCAGTGCGATGTTGTTGTTTGCATTGCT 
CGTGGCCCAATTAATTTATATCCAAGTGCGTATTTCGATTTAGTACGGCGCATTGAAAGTCTCCGGTTGAAAAAGCCCAAGGTTATCACACATCC 
CGACCAGAAATCGTACTCACAGCCAGTCTCCTCCTGGATTTGCGCGATCTCCTCGTTCCGCAGGAAAAGCGACGACTTATTGCCCATGGTGACGG 
TGTGTTGTGCGCCGATCGGTTTGTTTAAACATTCAACAATCGCAGACGAGCATTGAATGGCATTCGATTTGGATTGGACCACAAGTCCGATTTGT 
GTACAGGCTGAATGCAGGCGGAGTGACGGGAAAATGCCGAAAATCGGGTGCAAAAAAATACTGAACAGCCAGCGATGGTTGCACTAGAGACGGAC 
TTTGTCAAGTGGTGGATGGTTTCCCTATAAATATCGATTCACATATTTTTTTGTATCTATTTAAAGTGCACATTTTTTAAGGGGTCATTTGTAAG 
CATTTCCTTACAAACAACGACCAAAAATTTGATATATTAATTAATTTCTTAGAAATATCCAATCGTTTATGCAAAGCCTACAGTAATTTCGTCTC 
ATCACTAGCTCGTAACTGGCAATCTCCCATCTCTATTGGCATTCGCAAAGAAAACAAAGTAAACATTTTTTTTTGCCGGAACGAGAATTACCGTT 
ATGGATAAGTTAAATTCCACTCTAACTGCCGTCAAAAACCTGCGGTCCAACGTAAGGCTGTGCTTCGAGCACCTGGCCGATGGAACCGATGGGGA 
GAGCGGCGAGGAGAGCCGGAATAAGTTCGTAAATGACTTCCAGGAGAGATTCGCAGCCATCAACTCGCAAATACGGTTAGTGACCAGTTAACCTA 
TTCCAGTGAAGAATACATTTGAATTAAACCACTTTTAGTGAGGTAGAGCAGCTGATTAATGGACTGCCGGTGCCACCAACACCCTATTCTCTCGG 
GAACACCGCCTACCTGGCGCAGGAAACTTCGCAGGATCGACAGGCTCTATACCCTCAGCTGGTGAACAGCTACAAGTGGATGGACAAGGTGCACG 
ACCACAGCTTCTTGGCCTTCAACAATCTCAACCAGAACACACTCAGGCGCTCCTACAATTACTGCTCGCAGAAGCGCCAGCGACTGCCCTTCTCG 
TCCTTCAACAACGATCCAGAGTAAGTACGAAGTGCGCAGCTTCACTTTAATATTAACATCTTTCTTCCTCTGATTCTTAGCCACATAGATAAGCT 
CCTGAGTGAGATCAACACCCCTCCGCATACTTCGTACAGAATTTTTCGTCCCTTCGGCACAAACGCTGTCACCATCGTAACAATAAGTAACGTGA 
TGAAGGCGGCTATTGTCTTCAAAGGAGTGCTTATAGAGTGGGTTACGGTTAAGGGATTCGATGAGCCGCTGGAGCATGACGACCTGTGGGCCGAG 
TCACGTTACGAGGTGTTCCGCAAAGTCCAGGATCATGCTCACTCAGCAATGCTGCATTTCTTTTCGCCGACGTTGCCCGATCTGGCAGTGAAAAG 
CTATATAACCTGGCTGAACAGCCACGTCAAACTTTTCCTGGAGCCCTGCAAGCGTTGTGGGAAATTCGTGGTGAACGGATTGCCTCCGACGTGGC 
GGGACCTGCGCACTCTAGAACCCTTCCACGAGGACTGCCGAAATTGCTAAGGATTTTTAAGTAATTTAACGCCGATAGTTGTAACTTGTATACTT 
CATTTATTTCAAAAACAAAAATTATATGTACTTTGAATGCATGCGGGCGGATTTTTTAAAATCGCTGCGCCCGACTGAATAACTACAACGGTTTA 
ATGCGTTTCATTGGCCAACCAGTGCAGAAGTCGTCCGCTTCGATTTCTGCACCTGCGTCTGAACGCCGTGCTGCACGCACAGTCGCTCGAAGTTG 
GCGCGCAGTTTTGCGCGGAGCTCCTCTCGCTTTCGCTGCACTTGCTCCCGGTGGCACTTGAAGGCCATAACCACCTCCTGGGGAATGTAATATGT 
CTCCTCGTCTTTGTCGTCGTATTCGTCGAAGTCGTCCTGCAATTAATAAAGTTTTATTACTTTAATTTCGATATGGCAAATCATTTGCATAGCCC 
ACTCTGTCTAGAATATTTTCTGGTAATTAATTCCGACCCTAAGCTCTTGATGATAAGAGTCGACAGGGCTTGCCGCAAGTCTTAGATTATGTTTA 
AAAAACAATAATTACAATTTTCTTTTTCTATTTTTCCACTTCCTATTTTTATAGAGACAATCTTGCGACTCACCCACATCTGCAACAATGAAAAC 
TTGTGATCCAACTGCGACTGGTGACTGTAGAAGAGCAAGTCGTCTGAGTCCTCATCGTGGACGTCGTGATTACACAGTCCATCCGAGCAAGACAC 
TTCCGAGCCTTCTGGTGACGAAATCGCGCTAGATGTCCGAGAGCCGGCATGCGAAGGTGGATCGCTGCCGTATCCAGAATCGCGCACATCCTGCT 
CGCACGGGCAGTTAGGCATGGACGCCAGGCACTCCTCGCACTTATCGCCAACCGTTTTGGCGATGCTTGACGAAATGCAACTGGCCACGTCAAGT 
TGATCGTGATCAGCATCCAGCGGCTGGACCCTAGCCGTGGCTTGCATCTGCTTTTGGTTGCCCATCTTTGTGAAGGCCGCTTTCTTCTGCTTCTT 
TTTCT 

(SEQ ID NO: 61) 

Exon: 1001. .1215 
Exon: 1274. .1540 
Exon; 1601.. 2045 
Start ATG: 1046 

Transcrxpt No. : CT2432 

AAAACAAAGTAAACATTTTTTTTTGCCGGAACGAGAATTACCGTTATGGATAAGTTAAATTCCACTCTAACTGCCGTCAAAAACCTGCGGTCCAA 
CGTAAGGCTGTGCTTCGAGCACCTGGCCGATGGAACCGATGGGGAGAGCGGCGAGGAGAGCCGGAATAAGTTCGTAAATGACTTCCAGGAGAGAT 
TCGCAGCCATCAACTCGCAAATACGTGAGGTAGAGCAGCTGATTAATGGACTGCCGGTGCCACCAACACCCTATTCTCTCGGGAACACCGCCTAC 
CTGGCGCAGGAAACTTCGCAGGATCGACAGGCTCTATACCCTCAGCTGGTGAACAGCTACAAGTGGATGGACAAGGTGCACGACCACAGCTTCTT 
GGCCTTCAACAATCTCAACCAGAACACACTCAGGCGCTCCTACAATTACTGCTCGCAGAAGCGCCAGCGACTGCCCTTCTCGTCCTTCAACAACG 
ATCCAGACCACATAGATAAGCTCCTGAGTGAGATCAACACCCCTCCGCATACTTCGTACAGAATTTTTCGTCCCTTCGGCACAAACGCTGTCACC 
ATCGTAACAATAAGTAACGTGATGAAGGCGGCTATTGTCTTCAAAGGAGTGCTTATAGAGTGGGTTACGGTTAAGGGATTCGATGAGCCGCTGGA 
GCATGACGACCTGTGGGCCGAGTCACGTTACGAGGTGTTCCGCAAAGTCCAGGATCATGCTCACTCAGCAATGCTGCATTTCTTTTCGCCGACGT 
TGCCCGATCTGGCAGTGAAAAGCTATATAACCTGGCTGAACAGCCACGTCAAACTTTTCCTGGAGCCCTGCAAGCGTTGTGGGAAATTCGTGGTG 
AACGGATTGCCTCCGACGTGGCGGGACCTGCGCACTCTAGAACCCTTCCACGAGGACTGCCGAAATTGCTAA 
(SEQ ID NO: 62} 

Start ATG: 4 6 

MDKLNSTLTAVKNLRSNVRLCFEHLADGTDGESGEESRNKFVNDFQERFAAINSQIREVEQLINGLPVPPTPYSLGNTAYLAQETSQDRQALYPQ 
LVNSYKWMDKVHDHSFLAFNNLNQNTLRRSYNYCSQKRQRLPFSSFNNDPDHIDKLLSEINTPPHTSYRIFRPFGTNAVTIVTISNVMKAAIVFK 
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GVLIEWVTVKGFDEPLEHDDLWAESRYEVFRKVQDHAHSAMLHFFS PTLPDLAVKSYITWLNSHVKLFLEPCKRCGKFVVNGLPPTWRDLRTLEP 

FHEDCRNC* 

(SEQ ID NO: 63) 

Name: transcriptional co-activator 
Classification : transcription_f actor__binding 



Celera Sequence No. : 142000013385192 

TGCTTCGACGACCTCCTGGTTGACTACCAGACCAGGTACAAAATCGAAAAGATCAAGGTAATGGGTTGGACCTATATGGTGGCCTGTGGCCTCGA 
AACGGACCACTACACGGACTTCTCCATCGACATTCCGGTCAAGCAACCGGAAGCTGATTCCGAAATTAGACGAGGATCCAGCGGTGACTATTTCC 
CACGACTAGATTAGAGTACAATTGTAAAAGGTACATCTATTCAGTTCTGACTGTCCATTTTGGAAGTACCGAGGACGATGAGATGAGCGGCGATA 
ATGTCAGCCAGCCCTACGCCCAAGTGCAAGATGTCGCCGTCCTGGTGATGACCGAGTTTGCCCTGGACTTACTTCGCATCATGCACGACATTCGG 
TACAACTACGTGTTCTCGGAGTACGATACATTCTTAACGGGGAGCTTGAAGATAGGTGGGCTGCATTACCTTCACCGTTATCCTAATCCATCCAA 
TGATATCTAAGGCATCTCCCATGGATCCGTGATGGCCGGCGTTGTTGGACTCTCGAAACCACACTATGATATCTGGGGTCATACCGTCAATATGG 
CCTCCCGGATGTCCTCCACTGGATTGCTGGACAACATCCAAGTAACTCGGCACACGGCAAAGGTGCTGCGACAATTTAACATCCGCTGCAACTAC 
CGAGGGCACACGGAGGTAAAGGGAGTGGGCAAGGTGCCCACCTACCTGGTGGTCGTAGACCCGGATCTCACATTCCAAGATCACGACCAAGTGAG 
CATGAGTATGAAGGACTCAAAAAGCTGGGTCATCGACACTCTTTCACTGTCATTCACCCCGAGCTTCGTGCCCCCAAAGTCTTCCACTTCGGATG 
AAGAGGAGGCTGAAGGAGAGCTCGAGGAAAATGTATCTGAAGGCGATGAACTGGAGGTCTCGATACATGAGCGCCAAAAGGGAGGCATCTTTCTG 
GCCGGGAACCTTCATTAATTTAAATTTATTTTAATAGATATATATATATATATATATTTATTTAAGCATATATAACACACATATATAGATGTATT 
CGTACAGTTCAAAGCAAATGTTTTTCTTGAGGTTAAATACGTTATGATTTGAGCACTCGCTGAAATCATTGCACAACTGGTGACCAAAGTATACG 
TATGTTCATAATTCAGAGTCATACATAGTCGTTAGGTTATGTACATGCTACAGATAATACACTCATCTTGACGAATCTCCATCTCCATCACTAAG 
AAAGCACCATCCGTGTGATCAGGTTCGGTTTCGTTTGGTTTTGTGCACAGTTTTTAGCTTAAGTAAGCATTGCAGTTGATTTGGGGCATGGGCAT 
AGCCCTTACTCGTTGACAGAACTGGAGTTCAGCAGAGCCGTGTCCTCGGGAATGGGCTTCCGCTTGTAGGAGGGCTCAGTGGCTAGGTAGACAAC 
CAGCGCCCCGAAGACCACGTACATTACTCCAATCACATTGGCCAACACACCAGCCGGTGGAAGAGTGGAGTAAGCCGGAGTTTTGCTAGAAATGT 
CGTCATGGATTAGTCAAGTGATCGATCTAGAATGCATATACACCTACATGGCGAAGATGGCCTTCTCGGTTATGCCCATCAGGGCACTAGCAATA 
GCCAGAACGAATCCAAAGAGTCCGAAGTAGATGTGCAGCGGCATCATGGCGATCCTGTAGTTCTCCCTCAACCCCGGCGCCAAAAAGGCCACAAA 
TCCGGCCACATACTGCAGCGAGAAGACGATCACCGCGGAGAGCCCCAGCCAAGAGTGCAGGGAGTACATGTTGGGAATGGGCGGATTGGCCAAGT 
TGTGGGAATCGAAGACCGTCTTCAGGGCAATCACCGTCAGAATGAATGCACCCATGTGGATGCCGGCGTGGGTAAGCTTCAGAGTCTTCTTGCGC 
GTGGTGCGGAAACCACGATAGATCAGAATGGCTGTGGGATTGAATTGAGTTGGATTACCATTTCGAAGCTAGTCAATGGGAGTGGCGTGGCACCT 
CTTCCGCTGCATTTACTCACAGTTGCCATACAGGTAAATGAAGCCAATGGTCATGAACAGCGGATGCCAGTTGAACTCCACGCCGGGATTTGAGG 
TGCCGGCCAGACCGCCGAAGTGCTGGCCAATCCAGGTGGCCACCAGGACGA7CATGGTCAGGCCGCACAGCTGGGTGAGCACATAGAGCACCTTA 
AAGTTGATCAGCGCTGGATCCATGTTGGCCTTGCCGCTGCCTGAAAAATCCAATAAAAGCTGGGGATGAGCTAAAAATAGAGGCCATTTTACGCA 
CAGCACGCATTGGCGATCTAATCGCATAACTCATTCAAGGGTAGAGATATCGGGGCATCGAAAGGGGAAGGGGATCTCAAGCACTCGTTTATATT 
TTCCATACTTTTGGTTTATAGCCTCTGCTACGTAAACATGATTCAATTACATGTTCCACCTGCCCGCAGATTAAAATAACCCCCAAGCACTTGCT 
GATAACATGCGACCAGAAGTTTAGGGAGCCAATAGGCTTCCCAAAAACCCACTTTAACAGTTGTACATCTGATCCGCTTCGATGGCAGCTGCACT 
CGTTGGGTAAACAAACAGTTCCTTTGCCGCCAGCCATCCGATACTGCGATAAACAAGATCTCATCATATCGACTGGGCCCTGAGTTAATGGGAAC 
GGCCTGTTCCCAGGCACTGGGACACACCCACGACCCCTTTCGCCGGCCACTCACCAGGCGTAATTTGGCTZ\ATTCATGCGTAAACATTTGCGCTT 
GTTTACCCAACATTAATTTTCAACGCCAGAGCGTTCTGCCCACTTTATCTGTCGCCTTATCAGTATGTTCTCCAGATAGTCCGAACATCTGATAT 
TAAATCGCTTTAAATAGCATCGTACAAGTTCCTCGTGCTTGATGTTGCAGCATGATACATTAAATAAATTAAAGTTCGATTCATTTTACGGAACG 
TACGGGTAGTGAAAGTAGCCGTGTTCATAGTTTGGTATAGTATAGATTGGACTTGACTAAGATAGTTTAAAAGGCTAGTAAACTTGTAGGTGCAG 
ATGCACTCGATAAGGGCCGCTATCAGCCGCCCTAATTGGCGTTTGGTATGCAAATGGTGTTTTAATTTCGACTCGGGCGAATAATZVAGCAAATGA 
GCCAAACTGCCAAGTGTTGGCTTGTAAATTGGCTGATAAGTGCGCCGGAGACACTTGTAGCTGGCAATCGCAATTATAAATGCTGGTAACTGGCA 
GCAGCCACGCGTCCCCGGTTACTCAATGGGCTCCAGTTGGAGTGCTCCATTTCGACGCAGCTCTGGGTTCCGAGTGAAATTAATTGGCACATTAG 
CTGTGACTATGACTGTTAATTATTAGGGTTAAGTGGGCGTGCCAAAGACGCTGACCAGCAAAGCGTTGCAGCAGATCGTGCGCAAAATTTGAAAT 
AAACAATCCACATAGGCGCACCAAAAGACCGAAAGTAAAACCCGGGCAGATGACTTTGGATGAATTTGGAAAAGCGTACAAGTCCGAAGACAGCG 
GAGCAAAATAAATAAAGTCTGGAAAACCAAAGTGAAATTCAGCTGGTTACTGGTTACTGGTTACTAGTGGCTGCTTGCTGATCGCGGATCGTGAT 
GCCGATGGTGATGGTGAAAACCGCGGCGGGGAAAGGGGACGGCTGCAGTCTGTCTTTCGGTGCGTTTTGCTTTAACTTTTACTATAACTAAACAC 
TCGGCTTGTTGGTTTATTTTACTTGGCACTAATCAAACTTGCAGCCAAAATCCAATGCCATGCACAGTCTACCTGTTGGTGGTGAAGTTGCCGCC 
GCCGTTGGAGTGGCCAACTCAATGCCAGCGATGGTTGTTGCAGGTTGCACTTGTTCTGCTGTCGCAGGTGTTGCAGTTGTTGCTGGTGTCGCTGC 
CGCTGGCTTTTCCCCATTAGTCACCGCGATCACTGTGGTCGCCGTGGAGATGGTGGTCGGAACGGCGGCCTGTGGCTGCTCCGGAATCGCTGGCT 
CCACCTGGACAACTCCGCTCTCCGGCTTGGTCGCCTGCTCCACGTCCGCGTCCTTCGACATGGTCTGTGCACTTTAAATCAATAATTCACCAATG 
GTTCAACCACAAAAAGACATGCAACGTATAATGGACGGTGTTAATTGCCCTAAGACACAGCACAAAAACACTGGTAAATGCTAGTAACTAACTAC 
TAGTAACTAATCCGCACGACACCCCGTTAATACAACCGAACACTTCGGCGTGCAATGCGCAACTGAATTTAAAGTTTTCCAGCCGCCGCCCGGCG 
ATGGTTAGGAAAAGCTTTTTCCGCTCGCAAATCTCTCGCATGCGACATCGCACACACATATTGCTCATTTCGGTGTGGATAATTGTTTGGTTCTC 
GATGCGGATAGGATTCCAGTGGGGTTGGGTTTGCTTTTGACTTTGCCTTTGGCTTTGGGTGCCAACGAATGCTGGTTAGTATGCCGGACCAGGTC 
GATCGCTTCTGGTTGGGAAGGGGGTGCATTGATGATTATTTGCATTTCGCATTCCTCGGATCTATTCAAACCATTGTCACTGCGCGGCATTTCAA 
TTGCGACTTTGACTTTGATGTTCTGTGCATAGTAATGGGATGGTGCAGTGGAACTGCTTCAATTGGAACTACTTTTTGCCAGAAGGAGTGGAATC 
CAAAACGTAAAAGGGTTTACAAAAAGTTTTTGGGTAAAATTTGTTAGTAACAACTATTTCAATATTGTTCTTAGTTTCGAAAGTACATCTACAGT 
AAAGTGAAACCTTTAAGGACAACAAGTAGCTTTCCCTCTTAAAGAGCTTAAGTTCATGCTGTGCAATAATGACTGTATGTGCACTTCAACTCACC 
CAGTGAGAACAACATGGCGTATACTTGATTGATGCTTCACTCTGCTCAGATAATGCCTGTTTCAATACCGCCTGCTCTTATCTTTAGCAGCAATT 
AGTGACTTTAAAGCGATGACGATGGGTTTTTCGCGATTACTTCTGATTTTCGTGGCTACTGCCACGCACTTTACTCTACTTGTTGAACAAACTGC 
TTAAAATGCAGGAAGCTGGGCCTTCCATATGGAGGAAAAGCCGTGGAAATTCTGCACAGGCTGCGTGGGCGTGAAAGCCCTAAAGCCCCGCCCAC 
TCTGCCGCACTTGGCCATTTCATTTGGCATTTCCACGAGCTTTTCGTTGGAAGCTCAGAACTGTAAACGCCGAAGCACACATGCAAACAAACTTT 
CGGGCTCACACCACTGTAGAAAAACCACTTGTTGGATATTATATTAATGGAAAGCCACAACCAAATTATAACTGTTTGTAAACTACATTTAAAGA 
CGTAGTTGAAATAGAAACAGGTAAACTTACAGATTTGAAATGAATCAGTAATCCAATAATGTGTTTTGTTTGGAATATTTCCAAAATGTCTTCAA 
CGGAAGAGGCAAGAACACAAACAGAAACGGCACAAACACAAAGAGATAATTTGGCAGCATAAGAGAGCGGAATGCTGTGCGAATTTACTACGTCA 
GCATTATTATTAACAAGTTAAACTTAAAGCGGCACATTCATTGATACTTAATTTAGTACAGCAAGTCAAGGACACGGTCAAACTTTCACTATCAA 
AGGGTTTTAAACTGTAACTCGATCTGTTAACCGCTTCGATTTTCTGTTTACTCATTTGGAAAAAAGCGGGACAACCAGGGATGCGAGAATCATAC 
TGTAGCTGCGTGTGCTGAAAAAATAGTGTGCATT7AATATACGTTCACATGATGATAGAAAGTACTGACTGTGTGAGTTTTAATTTTAAAAATTT 
ACAAACTTTATATTCATATTTAACTAGAGCTTGGAAAACATTTACTTCGAGACCCTGGATTCCCTTATGCCGCCAGTGTGGCCCATTGAGCACGA 
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CTGAAACACGCTAACGCCTTTAGCTCGGCAGCACTGGCAGCTCCAACAAAAAGCTTCAATTTAAAATAATATTTAGGTAGTCCCACAATAAATGC 

TAACAAAATGCTCTCCAACCTGGTGGTGACCAAGCAGAAAGTGGTCCTCGTAATAGGTGAGTACACACTGACTACCCCATCGTCTGCCAAATAAT 

GCGACCACCAATCTCCCGCCTCATCCACAGATGAGCTAAATCGGGAGCGCATTGCCCCCAAGTTCATCGGCAGTCTGCTGCACGAGCAGGGACAG 

GGGGCGGATACCATCAAGGCCCTGCCCACCGGCGTCAGCCTTAAGCACGTGGCCACCTTCGAGGCACTGATCGACAAGTACGCCAACAACAACAC 

TGGGAGTACTACGGATTCCAACTCCACTGGCTTTAACGTCATCCTGCCAACATTGGCAGACCTGCTGTGCTACCAGACGCCTGCTTTCATATTTG 

GCTTCCTCAACCGCCTACGCCGCTCGGACAATGTGCGTCGGGTTTTCCTTTGGGCCTCCCCGCAGCACCTGCAGGATCCGCATGCCGACTACATC 

CTGGCTGGCTGCGAGTACCTGGCCGAGCTGGTCCTCCGACTGGAATCGGACAAACTACTCTCGCTGATATCACGCAAGCCGGGCGGAGGGGTGTC 

CAACAGGCGCTACTCCTGCGAAGTCAGCAAAACCCAGTTCAAGGTGACGCCGCTGGACGGAGGACTGCCTGCAGGTGCCTCCCCCAAGCAGC 

(SEQ ID NO: 64) 

Exon: 5647.. 5351 
Exon: 4061.. 3781 
Exon: 2225.. 2016 
Exon: 1931.. 1568 
Exon: 1510.. 1001 

Start ATG: 4051 {Reverse strand: CAT) 
Transcript No. : CT2537 

AAATCGAAGCGGTTAACAGATCGAGTTACAGTTTAAAACCCTTTGATAGTGAAAGTTTGACCGTGTCCTTGACTTGCTGTACTAAATTAAGTATC 
AATGAATGTGCCGCTTTAAGTTTAACTTGTTAATAATAATGCTGACGTAGTAAATTCGCACAGCATTCCGCTCTCTTATGCTGCCAAATTATCTC 
TTTGTGTTTGTGCCGTTTCTGTTTGTGTTCTTGCCTCTTCCGTTGAAGACATTTTGGAAATATTCCAAACAAAACACATTATTGGATTACTGATT 
CATTTCAAATCTTGCACAGACCATGTCGAAGGACGCGGACGTGGAGCAGGCGACCAAGCCGGAGAGCGGAGTTGTCCAGGTGGAGCCAGCGATTC 
CGGAGCAGCCACAGGCCGCCGTTCCGACCACCATCTCCACGGCGACCACAGTGATCGCGGTGACTAATGGGGAAAAGCCAGCGGCAGCGACACCA 
GC AAC AAC TGCAACACCT GCG AC AGC AG AAC AAGT GC AACCT GC AAC AAC CAT C GC T GG C AT T G AGTTGGCC AC T C C AAC GGCGGCGGC AAC T TC 
ACCACCAAGCAGCGGCAAGGCCAACATGGATCCAGCGCTGATCAACTTTAAGGTGCTCTATGTGCTCACCCAGCTGTGCGGCCTGACCATGATCG 
TCCTGGTGGCCACCTGGATTGGCCAGCACTTCGGCGGTCTGGCCGGCACCTCAAATCCCGGCGTGGAGTTCAACTGGCATCCGCTGTTCATGACC 
ATTGGCTTCATTTACCTGTATGGCAACTCCATTCTGATCTATCGTGGTTTCCGCACCACGCGCAAGAAGACTCTGAAGCTTACCCACGCCGGCAT 
CCACATGGGTGCATTCATTCTGACGGTGATTGCCCTGAAGACGGTCTTCGATTCCCACAACTTGGCCAATCCGCCCATTCCCAACATGTACTCCC 
TGCACTCTTGGCTGGGGCTCTCCGCGGTGATCGTCTTCTCGCTGCAGTATGTGGCCGGATTTGTGGCCTTTTTGGCGCCGGGGTTGAGGGAGAAC 
TACAGGATCGCCATGATGCCGCTGCACATCTACTTCGGACTCTTTGGATTCGTTCTGGCTATTGCTAGTGCCCTGATGGGCATAACCGAGAAGGC 
CATCTTCGCCATCAAAACTCCGGCTTACTCCACTCTTCCACCGGCTGGTGTGTTGGCCAATGTGATTGGAGTAATGTACGTGGTCTTCGGGGCGC 
TGGTTGTCTACCTAGCCACTGAGCCCTCCTACAAGCGGAAGCCCATTCCCGAGGACACGGCTCTGCTGAACTCCAGTTCTGTCAACGAGTAAGGG 
CTATGCCCATGCCCCAAATCAACTGCAATGCTTACTTAAGCTAAAAACTGTGCACAAAACCAAACGAAACCGAACCTGATCACACGGATGGTGCT 
TTCTTAGTGATGGAGATGGAGATTCGTCAAGATGAGTGTATTATCTGTAGCATGTACATAACCTAACGACTATGTATGACTCTGAATTATGAACA 
TACGTATACTTTGGTCACCAGTTGTGCAATGATTTCAGCGAGTGCTCAAATCATAACGTATTTAACCTCAAGAAAAACATTTGCTTTGAACTGTA 
CGAATACATCTATATATGTGTGTTATATATGCTTAAATAAATATATA 
{SEQ ID NO: 65) 

Start ATG: 308 (Reverse strand: CAT) 

MSKDADVEQATKPESGVVQVEPAIPEQPQAAVPTTISTATTVIAVTNGEKPAAATPATTATPATAEQVQPATTIAGIELATPTAAATSPPSSGKA 
NMDPALINFKVLYVLTQLCGLTMIVLVATWIGQHFGGLAGTSNPGVEFNWHPLFMTIGFiyLYGNSILIYRGFRTTRKKTLKLTHAGIHMGAFIL 
TVIALKTVFDSHNLAKPPIPNMYSLHSWLGLSAVIVFSLQYVAGFVAFLAPGLRENYRIAMMPLHIYFGLFGFVLAIASALMGITEKAIFAIKTP 
AYSTLPPAGVLANVIGVMYVVFGALVVYLATEPSYKRKPIPEDTALLNSSSVNE* 
(SEQ ID NO: 66) 

Name: cytochrome B561-like 
Classification: electron transfer 



Celera Sequence No. : 142000013384816 

AGCCCCATAATTCGTGGAGCAAACAACAAAAGAAAAATGTGGACATTGTTGTACACACAGGCAGAGTCACGCACACAGACGAACCCACTCGCTCG 
CTCGCTCGCAGCTAAAGTTTGAGTTCGCGTGGTGCCTTGCACATTGTTTATAAACTTTAAGGGCAGCCAACGTGGAGGGGGCACTAAAAAAACAC 
ACCCCACTTTAGCGGGCACAATTTGTTTTCAGGTTTTCAACTGCACATTTAACGCGTGTAAAACTTTTTGCGTTTGGGGCCCACTCTCACACATA 
CAGGCGCAACACAGCACACACACGTTCAATCGAATTTTTCACATACGTTTTAAGGACGTTGCTCCTCGGAGATTGCCTAATTTTTGCGTTTTTCA 
GATGACTCTGTACCCAGTCTATCTAAATTTCTGGTTTTCAGATACTAGAATATGCGTGCAATTTGCGGATTTGGACAGATTTTGGACCCGAAAAA 
AAACTAATGCGGATCCAGTGTGACCACCGCTCGACCGTTCAAATATACCATGGGAATTCACGGCTTAAGTTAGGGATGTGATGGTCTCCGCGTGA 
CTCGAGCTTGGTTTTAAAAAATGTGATACTCTGCAAGTTGTTTATATGTATGGATTTCATGAATTATTTATGTGTTGTAATAATAAACGTGTTTC 
TTACACAATTTAATTGCTCAGCAATTCTACTTTTCTTTTTTGATTGCTTATATTTTATATTATATTTTTGATTGCTTAATTACTAAAATTATTAT 
ATGTTTAAAAAACTTGTTTCTATTTTCATAGTTTTTAATTTTTATACTTATATTTTATAACTTATAGAAATTTCTTGAAAACCCTAAAAATGACA 
TGAGTTTGTTATTTATTGCTATGTTATAAAATAATTAGAAACCATTACAGCAAATATCGTTAATTTGTTTACTAGTGATAGAGTGTGGTAACACT 
AGCATTTCCGTCCACACAATGTTAAGGTCACACTCAGAAGCTGCAAATTTCGAATTTCGACGGTCGCTATGAGTGGAGCTCTCGGAGATGTGGTG 
TCCGAATACCTGGAGCGCGGCCCAGTGCTCGTGGTGGCGGACCTCCTTAGCCTGATAACGGTCTCGTCCTGCCTGGTGATCAAGGTTCCGCAGAT 
AAACACGATACGGGCGAACGAGTCCTCGAAAGGTGAGCACCGCCCACACAACATCTGCAACCTATCCTCCCACTCATATGATACGGATTTATGAA 
TTAGGCATCAGTGTTTTGGGACTGTGCCTGGAGCTGTTCAGTTATACGGTGATGTTGTCGTACAACTACACCAGTGGCTACGACTTCCTCTCGTA 
CATGGAGTATCCTGTTTTGCTGCTGCAGGAGTACGCCCTGATTTACTACGCCTTCAAGTACCAGGATCTGCTGGGAAGGAGGACGCAAGTGGTGG 
CCATACTGTATAGTATAGTGGCCACTCTCATATACATGAAGCTCTTTCCCATTATCATCCTTAAGTTCCTGGTGGTGAGTAAGCCTGTATTGATA 
TCACCTAGTGACTGATTAGTCGGGTATTCTTGCCCAGATAACGACTTCGGCGCAGTTCAAACCCGTTCCCCGCTCTCTTTTCCAATTAACTAATT 
ATGTTTCCGGGCAAACGGGTTTGGTTTGATTTGTTATTTGTTTTAGTTTGAAAAGGCGGTGCTCTTAACTGGGCAGCATCTTCGCTCAAGGTCAC 
CGATGTTTGTTGTATAATATTTGAGCGGCTATTAATACTTTCAGTGCTCGGGCAATATATATAAATCTCGTGAAATTCATCTAAATGTTTGATTT 
CCCGTCAGCCTTTCTGCACTCCCATTGGAGCCACGAGTAAGGTCCTCCAGCTGCTGGCTATTCTGAGGACCAAGGATGCCAGTTCCGTCAGTAGA 
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ACCACATGGGCGCTCTCCGCCTTTACAAACATGAGTGAGATCACATCTTGCATTCCATCACTTGATGTCATCAATTAACACATCATATATGTATC 
CCTTACAGCTCGAATCTACACGGTGTTCTTCCAGTCCCACGATTGGATGTTGCTATCAAACTTCCTGATTTCGACGTTCCTTAGCGCCTCCGTTT 
TCACCGCCGCTTGCGTCTACAAGAAGAAGGCCAAGGCGGAGTAATTGGCCAGCCTGATTAGAAAAGCTCAACATAGACAAACACTGCGCTGGCTG 
GAAATGCTTATCAGTCATTGTTTTAATTCTATTTGCTTATCTGTTTAAATTGATATTACAAATTCTTGTTTGAATCGGTTGTTGTTACTCGTTTA 
ACTGTCCGCGTAGAAAGCCACCACAATTGCACCCACAACTGTTAGAATGGATACTGCATAGATTGCTCTACGTTGCCAATCGATCTTGGTCTTGG 
CCGGTCCAGAACCCACAGTTTTGCGCTCCTTGTGCTTGCGTTTGCCGAACTTGGCCTCGAACTTCTCGAGTTCCAGGCGGTTTCTTTCTTCCTCG 
TCGCGACGCTTTTGTTCCAGCTGCAGTTGCTCTGCCGCCTGGACGCGAATTTGCTCCGCCTTGCAGTTCGAGTCGCAATCCAAGAATGTCTGACC 
AGCACGGTGTTTGTCACAGGCAATCTCCTGCTTGAGTCGCTTGCAGGCGCAGAAAATGCGAACCTTCTTGCGGCACAACTCGGGATTTGGGCACT 
TGCCTGTGTGGCAGATGGCTGTGCAGCGATGTCCGCAAGGATACTGAAAAAATATATATATCACTGTAAGGCTTGATCGACAATGGTATATTACA 
GCACTTACATTCTTCAAGCATCGATTACCGCAGCTCCGTAGCTTTTCTCTTCGCTCGATGATCTCCTGGACCGATCCCGTTTCATCATAATACTC 
GTTGCACTTGTAAACCAACTGGTTGAGTCCACAGTGACACTTGGTCTTAATCACAAAGTTGCAGGGCGCACAGGGCGGTGGATGACATCCTTTGG 
GACAGGCGTGAATACATCCGGTTGGTCGGGGAACGGTGCAACCCTCCTCGCAGTTGGCACAGCCAGTTTGCGCAGACATATCCTGGGGAAGCGGT 
ACGAAATGGCAAACCAATGAACACTTGTGATTGCCGCACTTAAGCTGACGTGCACACGATCGCTGACAGGAAGTGGGTTTGGAGTTCCAGCATG 
(SEQ ID NO: 67) 

Exon: 1001.. 1172 
Exon: 1240.. 1503 
Exon: 2004. .2134 
Start ATG: 1019 

Transcript No. : CT2591 

CGAATTTCGACGGTCGCTATGAGTGGAGCTCTCGGAGATGTGGTGTCCGAATACCTGGAGCGCGGCCCAGTGCTCGTGGTGGCGGACCTCCTTAG 
CCTGATAACGGTCTCGTCCTGCCTGGTGATCAAGGTTCCGCAGATAAACACGATACGGGCGAACGAGTCCTCGAAAGGCATCAGTGTTTTGGGAC 
TGTGCCTGGAGCTGTTCAGTTATACGGTGATGTTGTCGTACAACTACACCAGTGGCTACGACTTCCTCTCGTACATGGAGTATCCTGTTTTGCTG 
CTGCAGGAGTACGCCCTGATTTACTACGCCTTCAAGTACCAGGATCTGCTGGGAAGGAGGACGCAAGTGGTGGCCATACTGTATAGTATAGTGGC 
CACTCTCATATACATGAAGCTCTTTCCCATTATCATCCTTAAGTTCCTGGTGGTGACTCGAATCTACACGGTGTTCTTCCAGTCCCACGATTGGA 
TGTTGCTATCAAACTTCCTGATTTCGACGTTCCTTAGCGCCTCCGTTTTCACCGCCGCTTGCGTCTACAAGAAGAAGGCCAAGGCGGAGTAA 
{SEQ ID NO: 68) 

Start ATG: 19 

MSGALGDVVSEYLERGPVLVVADLLSLITVSSCLVIKVPQINTIRANESSKGISVLGLCLELFSYTVMLSYNYTSGYDFLSYMEYPVLLLQEYAL 

IYYAFKYQDLLGRRTQVVAILYSIVATLIYMKLFPIIILKFLVVTRIYTVFFQSHDWMLLSNFLISTFLSASVFTAACVYKKKAKAE* 

(SEQ ID NO: 69) 



Celera Sequence No. : 142000013384610 

AGTATTTATTCTACTTTTTACTCAGCCAATGCATCTTAAGCCTTCGGGACCTACTCTCGAAAAGAATCCGTCTAGAAACCTGCGACATGTTCAGG 
AAAAACGTAAACGTACGCCCATTAGAGAAAAGATGGCTTCGCTTGAGAAGTTGAGGAATCTCCAAAATAGCAAGTTTGTTCTCTCCCGCGACGCT 
TTTCAAAAAATTCAAAAAATTACTAGTGGCGGTCCCTATTACTTGACAAAAACAAAGATTAGACCATAGGCCTTGATTTGAATATCTCAAAATAA 
AAAAATGCAATGCAAACACATTTGAATCGATTTTTCGCTTTATTACATTTCTTGTATCATAATCGTTTTGTTTTTCTGGTTTTGCCATTTTTTAA 
ATATTGACGTTTTATATAACAATTCATAATTTTAGCTTACAATATTTTTGTTTTGTATTCTTAGAATCGTTTATGCCTTTTGCCATGAAAACACT 
TCTCTCTCTAGCAAATATACAATTTTCGAAACTTGTGATTTTGCAATTATTTATTATGTATATTATATATATTATATGTATATAACTAAAAATGC 
GTAAATTAATTTTTTTTTGTTTGTTACGTGGTTTGTGGGCACATTCCTTTTATCAAAAGGAACAAAACATGAAAATGTACATAAGTGTTATTGGC 
AAATAGTTTACCATCTGTTTTAGATCTTGATACACATACGTGCATCATCGATATAAAGAACCTGAATTGTTTTTAAATTTCGAAGCGGCAGTTGC 
AGTTCTGTATTTGCGCCTCATGAATTGCTCGTTTAACGTGTCTCTCTAAAAGGCAGGAGCAGCTGTGAAGTGCTCGAAGGCAAGGGTCGATCCGT 
CCGCCGTTGGATCTTTATCCTTCCTGACCGTCCATTGATTGGCCATTGTCCTATTATCGAGAGCCCGCCGCTTATTGGTTGGTTGATTATTTTTT 
CTCTCATCTATTTTTATGAAATGGCTGTTAAAAATTTTGGTTTGAACGCATCATTCCCATTCGGTGGTGCCCGGCGGCTTGGCTGTCAAGTCGTA 
CAGCACCCTCGAGATTCCATCCAGCTTGGATATATCGCGTACAATTTGATTTAGGACCTTTGAGCAGGAAAAGATTAAATATACATTTTTCAGAA 
AGATTGAAAACTGACTCACTTGCAAAGGCATTTGCACGGATCCGGGCTCAGCCGGCAGACCAGTCATAAAGTCGTTCGTTATGAACGGACGCAGC 
ACTACCGACCTTCTGCACGAGGGTGTGCGGTTAATGGGATCGCGGTCAAAGTGCACGGGTATGAGAACAACAGGCATCTGCGAGATCTTCCGGTA 
TAGTCCAGCTTGCATTATGATCTAAAACGGTTATGATATAATACATTTTATTTATTTCTATGACAAATCCACTTACTTCATTGGCAATATCATCG 
GCTTGCCTCAGCTGCGAAAGAACTACAGTATTCAGTGTGGTGTGCGTAATATCCGTCACTAGATACTGTACGGGTTCGCCGAAGATATAGCACAC 
CCTGTTCACGTTGTGCAGAATTCGCGGTATGATTTTGGCGAGGAAGAGAAGATCCTGCCAGTTGGGCTCCTGGCTCGTGGATAGGCCTACTACGT 
AGCTATATGACCGTTTATCACCTTGCACGCCCACTGATCGGATGGGCAGCAAAGTTGCCTGGATCTGCGAGTTCGATGAGATACGCATAAGGTCT 
TTCTGTTCCGCCTCGCTCGTGGCCGCCGTTACGCGGTTGATCAAAGCATGGTTCTTCTGCAGTTTATTCTTGTAGTCTACAATCACGCGGATAAT 
AACCTACAATTAAATAGTATATTTATTATAATAACTTTGACAGCAATTAATTGTTAACATGCCTGAGTTTCTGAGTAGTCCTTTTCCATGTATGC 
CTCCTCAGCGCAAAGGACGCGGATTGCCAGGCCAGGACCCGGAAAGGGTTGCCTCTCCACAAGCTCTTGAGGCAGGCCAAGATCATTGCCAAGGT 
CGCGCACTTCATCCTTATGAAAGTCGCAAAGGGGCTCAACCACACGTCCTGCGTTACGAAGCTCTCTGAAGAAATGAATTACAAAAAATAAGCTG 
ATCACTAAAATTTGGTTAGGAATTTAGACTTTTGCTTTTTTACCTGATCAGATCCGTGTCATTGTGGTGCGTTTTGATTGTTTCTGCATTCGTGC 
TCACCATGCTAGAGGCGGACTCGATCAGATCTGGTCGGAGGGTTCCCTGGGCCAACATAACTTCTTCGGGCTTTAGTTTCAATTCGGCTACTACA 
TCATTGGTCACCTTGACGAATATATCACCAATTATCTTGCGTTTTTCCTCCGGATTGTATGTCTGACATAACATCGGCGTTTCCACCACGGAGTA 
CTGTCCGGGCCTCTTGACCTGCGTGGTGCCTTTAAGGAACGTGTAGCCTTCTTTTCGGACGATTAAATCAATGCCAATATCGCGCAGTGAACGCT 
CCACCTTTTCACTTTCCTTCTTGCGCATGAAACCTTGAAGTAAGGGATAAAATATTAGGTTTTAAAGTACAAATGATATAAATTCGGTATTCACC 
GAGTTAATGAATTATTTGACGTAGAAAAAATATAAATGAATAAATTTCGCTAATCTGTCTACTGTCATACCATTATCTACATGCACGGCAATTAT 
CTGATGAGGGTACAAAGCACGGCGGAGCAAAGCTGCACAGACACTCGAATCCACGCCGCCGCTGACCAGGAGCTGTAAAATGTAGATTGTATTGA 
GATATTCATGTCTTGATTGAAAATTTATTTTCCCAAACTCACCAACACCTTATTGTTGCCCACTTTCTCACGGATATAGCGTATGCACTCCTCCT 
TTCGACTACCCATGGTAAAGTTAGGTGTCAGTTCGCAGATTTCGTACAGGAAGTTCGATAGCATCTGTTTGCCATTGATAGTGAGGTCCACCTCA 
GGATGGAACTGTACGCCGTAGATGCGTAGTACTTCATTGTAAATGGCTGTCACAATGCGGTTTGTAGACCAGCCACCAATCTTCAGATTCTCGCC 
TACCCTCTCAACGCTATCTCCGTGGGTTAACAGCACGGACTGTGTGCGACTGAGGCGACTGCGGGAAACGAAGATGTTATGTTTGTTAATTGCAT 
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GTATTAGAATAATTAATGGTAATGATAGATAATATGTTAGGCCACGTGTAGATCAGTTTATAAACTACTCTAAACTAAACTGTTCTTGGCTTTAA 

AATTGTTCTGGCAGCATTGTATCAGCGCTTAGGTAAACGCATTTAAGCCTAAGCACTTTCTTGTTGCTGCTTGTTTTGGGAATCAATATTTCCTG 

GCTCACCTAAAGAGCGGGCACGAGGTCTCGATCTCGATATTTTGTTGGCCATCCTCTCGAACATCCTTCTTGAGCACTGTGCCCCCGAACTCTTT 

GTTGATTAGCTGCATGCCGTAGCAGATGCCCAGCATAGGTATTTTTAGCTTGAACAGATCGGGATCATAGCTGGGCGCATCCTCAGCGTAGACTG 

AGTTGGGTCCGCCGGAGATGATGATGCCTCGATAGCCATTGTTGCGTATCGTGGCAGCTGGCGTATCCAGAGGAAGGATATCCGTCTCAACGAAG 

AGTTCGCGTACCTTACGGTCGATAACCTGCGGAGAAACCAGGACCATTTTAAGAAGAGGTTAAAACTACTCTTGGTTTTAAGCTTACCTTGCCGT 

ACTGTGCTCCCGCATCAAGTATAACAATCTTATCATGCCGCAGGCCATTCTCTGCTGTGCCCAGAAAAATGTTTGAGTTCATTTCTTTATCGTTG 

CAGCACTGAAAAGAGCAAAGATTTGAGTGTCAGTCAAGCAATCGCAGAAACAAACACCGAATCTCGACTGGCTGGTAAACGCGCTTCTGCTGCTT 

ATTTATACTCCAATTGTGGCTGCAATCGACGCTACAGAAACAAGGCTAAAAGCTGGCTAGGAGGTGGGGAAAATAGGTTTCGGATTGGGGCAATC 

GCGGAGATTGAGATTTGTGGTACAGAAGGCTCACATACCGGCAATATCTTGTGGAGCACTCGCAGAAACATTCAACATGCAGCTGGCCAGGACGC 

TCTACGATAGACAATAAACATTGGCAAATATTTTCTTGTCACAGAAAGATGATTTGTGCAATGTGGAAAGAAATTTACAAGCGACGAAAAACAAG 

GCCGAGAGACGCCTACCAAAAATGATTCGCTGTGCATCGCGACGATTACGTTCTGTGGGTATTATTTATTTGACCAAATATAAAACATTTACGTA 

TATGAATAAGCCTATAACTTAATCTTAATAATACCTGTTTATTTAACCGTGGTTAGAATTTAAGACAGAGTGTATTCACAGCCAGCATAAAATCA 

TAGATACCATAGGAAAAAGTGCTCCAAATGATCATAGTTTGGTTTTTTAAAATAATATTAAATAGATATAATTTTTGTATAATTTTGGTCAACAG 

ACTGAATTTGGATTAACAACATGAAAAACGAATTTTTGTGCCAGGAGTATCTGTCAATGCTATATACTTCTACATACTAAAGTAGATTATCAAAG 

CATGGAAGTTAACATTTAATTTTATAGACACATTTGTCCCATCAAAGAATTTAGAAAAACAGCAGGTCAGGAACATGCTTTTCGAATTCCCGAAG 

TAATCGGCGACTTAGTACATATATTTTAAAATACAAATTTCGATGCATGAGGATGAGGATGTGAGGAGGAAACGGAACGTAATGGTGATGGGGTC 

AGGGGGGAGGGCGCCGGCTACTGACCGCCAGATGACAGGCGAAAATACTAAGAGGAGAGTGGCAGCGGAAGCGAGAGAGAGCGAAAGATGAGCGG 

AGAAAAA.TGGCAGGTAAACGAAAACAGAGCTGAAAAGTGCACAAGTAAAATAGAAAACAAAAATAAACAACAACAATCCAG 

GTAGCAAGTCTTCTTGTTTGCCATTTATTTTCTCTTGAAGTGTGTAAAAACGCAGCAGCAGCAATATGAAAAGCCTCTCCGAAGGTTAGGAGTGC 

ATTAGGTCACTGAAGCGATGCGATGTCTGTGATGTAAGCCGTGCTCCTCCTTTGGCTCTCTCTCATTTTTGATAACAACATGCAAGAGGTTATCA 

GAAACGGCACAAACCACAGCAACAACAACTAAAGAGGGCGGCCGCGTGGTTGTAATTTAATACACATGCAGAAGGCACGCACACTTACACACGTG 

GAGCCGCCAACAAGCGGCGCTTATAACCACAAAATGGCCACGGGATATATTTGCAAAACATTTAAAAAATTGCAAATGCGGAGAACAAAAAATCG 

CAACAAAAGCAAAGGCAAATGCGGTTGAAGCTAATTTGCGAATTCAACGGGCGGGGTGCGGAATTGTGGGCGGAGCAGCACCGCATGGCGATTTG 

GCTGGGATCGGGGATCTTTTTCGCCATATTGCGACTCTCACCTTCTCTCGTTTTAGCCTGGCTGCGGTTTTCCAGAACCGGGAAGTCCGTCGGCA 

AGGCCGTGGAATACCGTGACAAGCAGCGCAAGACTTTGTCTTAAGACGAAATTCGCGATCCGTTGTTTGCGGTCCCCCGTTTTCCTTCCTCACGC 

ACACACTCAAAATTTCTTCGGCTGCTGCAGCTTCTTTTCCGCGTATCCAGGGCTGCTCGACACAGCTGGCGATGAAATCCCTGTGATTTTCACGG 

CTGCGTTATTATGGCCGCTTCCAGGGACTTTCCTTTTGGATTTCCTGCAGTTTTTTCGCGAATTAAAGCCGAAAGATGACTTATTGACGAGCGGA 

TGACCATATTTCGGATTTGGGAAAAATCCAGCTGTGCTGCAAACGAAAAATACCAGCTGTGAACGTTTTTGGTATTAATATTTACCAAATAAATA 

AATTTATATTTATTTCGAAAACAATGAAAATTCCTTAATAACATTACATTACTTCTTTATTAGGAGTGCTTAAGTATTCTTTTAAATAATGAATT 

ATAATTAAAATTATAATCTGTTTGTACAGTGTTGACAAACGAGCACTTTTTGTAGAACCATATACATTACCCGGTCACACTGAGCCCGCCAAAAC 

AGATATGTTATTGCGCTTATTTAGAAAACCAAGAAAAAACACGAGAACACGTGAAAATACAAATCTACCCAAATGAAATGGGTCCTGCTCAGAAA 

TCCGGAACAGATATTAGTATCGATGATGAGGAGGAAATACTGGCTCTGGAAAAACTACTGGGTGCAGCAGAAAACGAAAATACAAAATCTGCAGA 

GTCAGAAAAAGCAAAACCCACCGCACCCATTTTGGTGCCAAAACTACGAGAAGACAACAGTTTTGCTAATGCCTTCACCTTCGAGAAGATCGTGA 

AACCGGAAAAGCAGAAGAATGCTGCTATCATTAAGGAACCAGAGCTGGACTCGTCCGACGACGAGGAGGTAAAGAACTTCCTGGAACGAAAGTAC 

AATGAGTACGGCAGTGATATAAACAAGAGACTGAAGCAGCAGCAGGAGAACGCCTACGAGTCCAAGGTGGCGAGGGAGGTGGATCAGGAGCTTAA 

GAAGTCTATCCACGTGGTTACATCCACCCCGCAACCCCTGAAAAATCCGCATAATCCTATTAAACGGCAATCGGCGGTGAGCACCACGTTTCAAC 

GTCCTCCGCCAGTCGCTGCCGCCGTGGCATCTACATCCCAGTCAAGTGCTCCCGTATCTGCTGTTTTTACGGATCCAGTCTTCGGACTGCGCATG 

ATCAATCCGCTAGTCTCCAGCTCACTGCTGCAGGAGCGCATGACGGGCAGGAAACCTGT 



(SEQ 


ID NO. 


70) 


Exon: 


5804 


.5457 


Exon: 


3805 


.3698 


Exon: 


3636 


.3332 


Exon : 


3098 


,2798 


Exon: 


2732 


.2636 


Exon : 


2503 


.2134 


Exon ; 


2060 


.1868 


Exon : 


1808 


.1407 


Exon : 


1351 


.1160 


Exon : 


1102 


.1001 


Start 


ATG: 


3787 



(Reverse strand: CAT) 



Transcript No. : CT2988 

ATATGGTCATCCGCTCGTCAATAAGTCATCTTTCGGCTTTAATTCGCGAAAAAACTGCAGGAAATCCAAAAGGAAAGTCCCTGGAAGCGGCCATA 
ATAACGCAGCCGTGAAAATCACAGGGATTTCATCGCCAGCTGTGTCGAGCAGCCCTGGATACGCGGAAAAGAAGCTGCAGCAGCCGAAGAAATTT 
TGAGTGTGTGCGTGAGGAAGGAAAACGGGGGACCGCAAACAACGGATCGCGAATTTCGTCTTAAGACAAAGTCTTGCGCTGCTTGTCACGGTATT 
CCACGGCCTTGCCGACGGACTTCCCGGTTCTGGAAAACCGCAGCCAGGCTAAAACGAGAGAAGTGCTGCAACGATAAAGAAATGAACTCAAACAT 
TTTTCTGGGCACAGCAGAGAATGGCCTGCGGCATGATAAGATTGTTATACTTGATGCGGGAGCACAGTACGGCAAGGTTATCGACCGTAAGGTAC 
GCGAACTCTTCGTTGAGACGGATATCCTTCCTCTGGATACGCCAGCTGCCACGATACGCAACAATGGCTATCGAGGCATCATCATCTCCGGCGGA 
CCCAACTCAGTCTACGCTGAGGATGCGCCCAGCTATGATCCCGATCTGTTCAAGCTAAAAATACCTATGCTGGGCATCTGCTACGGCATGCAGCT 
AATCAACAAAGAGTTCGGGGGCACAGTGCTCAAGAAGGATGTTCGAGAGGATGGCCAACAAAATATCGAGATCGAGACCTCGTGCCCGCTCTTTA 
GTCGCCTCAGTCGCACACAGTCCGTGCTGTTAACCCACGGAGATAGCGTTGAGAGGGTAGGCGAGAATCTGAAGATTGGTGGCTGGTCTACAAAC 
CGCATTGTGACAGCCATTTACAATGAAGTACTACGCATCTACGGCGTAGAGTTCCATCCTGAGGTGGACCTCACTATCAATGGCAAACAGATGCT 
ATCGAACTTCCTGTACGAAATCTGCGAACTGACACCTAACTTTACCATGGGTAGTCGAAAGGAGGAGTGCATACGCTATATCCGTGAGAAAGTGG 
GCAACAATAAGGTGTTGCTCCTGGTCAGCGGCGGCGTGGATTCGAGTGTCTGTGCAGCTTTGCTCCGCCGTGCTTTGTACCCTCATCAGATAATT 
GCCGTGCATGTAGATAATGGTTTCATGCGCAAGAAGGAAAGTGAAAAGGTGGAGCGTTCACTGCGCGATATTGGCATTGATTTAATCGTCCGAAA 
AGAAGGCTACACGTTCCTTAAAGGCACCACGCAGGTCAAGAGGCCCGGACAGTACTCCGTGGTGGAAACGCCGATGTTATGTCAGACATACAATC 
CGGAGGAAAAACGCAAGATAATTGGTGATATATTCGTCAAGGTGACCAATGATGTAGTAGCCGAATTGAAACTAAAGCCCGAAGAAGTTATGTTG 
GCCCAGGGAACCCTCCGACCAGATCTGATCGAGTCCGCCTCTAGCATGGTGAGCACGAATGCAGAAACAATCAAAACGCACCACAATGACACGGA 
TCTGATCAGAGAGCTTCGTAACGCAGGACGTGTGGTTGAGCCCCTTTGCGACTTTCATAAGGATGAAGTGCGCGACCTTGGCAATGATCTTGGCC 
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TGCCTCAAGAGCTTGTGGAGAGGCAACCCTTTCCGGGTCCTGGCCTGGCAATCCGCGTCCTTTGCGCTGAGGAGGCATACATGGAAAAGGACTAC 

TCAGAAACTCAGGTTATTATCCGCGTGATTGTAGACTACAAGAATAAACTGCAGAAGAACCATGCTTTGATCAACCGCGTAACGGCGGCCACGAG 

CGAGGCGGAACAGAAAGACCTTATGCGTATCTCATCGAACTCGCAGATCCAGGCAACTTTGCTGCCCATCCGATCAGTGGGCGTGCAAGGTGATA 

AACGGTCATATAGCTACGTAGTAGGCCTATCCACGAGCCAGGAGCCCAACTGGCAGGATCTTCTCTTCCTCGCCAAAATCATACCGCGAATTCTG 

CACAACGTGAACAGGGTGTGCTATATCTTCGGCGAACCCGTACAGTATCTAGTGACGGATATTACGCACACCACACTGAATACTGTAGTTCTTTC 

GCAGCTGAGGCAAGCCGATGATATTGCCAATGAAATCATAATGCAAGCTGGACTATACCGGAAGATCTCGCAGATGCCTGTTGTTCTCATACCCG 

TGCACTTTGACCGCGATCCCATTAACCGCACACCCTCGTGCAGAAGGTCGGTAGTGCTGCGTCCGTTCATAACGAACGACTTTATGACTGGTGTG 

CCGGCTGAGCCCGGATCCGTGCAAATGCCTTTGCAAGTCCTAAATCAAATTGTACGCGATATATCCAAGCTGGATGGAATCTCGAGGGTGCTGTA 

CGACTTGACAGCCAAGCCGCCGGGCACCACCGAATGGGAATGA 

(SEQ ID NO: 71) 

Start ATG: 367 (Reverse strand; CAT) 

MNSNIFLGTAEKGLRHDKIVILDAGAQYGKVIDRKVRELFVETDILPLDTPAATIRNNGYRGIIISGGPNSVYAEDAPSYDPDLFKLKIPMLGIC 

YGMQLINKEFGGTVLKKDVREDGQQNIEIETSCPLFSRLSRTQSVLLTHGDSVERVGENLKIGGWSTNRIVTAIYNEVLRIYGVQFHPEVDLTIN 

GKQMLSNFLYEICELTPNFTMGSRKEECIRYIREKVGNHKVLLLVSGGVDSSVCAALLRRALYPHQriAVHVDWGFMRKKESEKVERSLRDIGID 

LIVRKEGYTFLKGTTQVKRPGQYSVVETPMLCQTYNPEEKRKIIGDIFVKVTNDVVAELKLKPEEVMLAQGTLRPDLIESASSMVSTNAETIKTH 

HNDTDLIRELRNAGRVVEPLCDFHKDEVRDLGNDLGLPQELVERQPFPGPGLAIRVLCAEEAYMEKDYSETQVIIRVIVDYKNKLQKNHALINRV 

TAATSEAEQKDLMRISSNSQIQATLLPIRSVGVQGDKRSYSYVVGLSTSQEPNWQDLLFUVKIIPRILHNVNRVCYIFGEPVQYLVTDITHTTLN 

TVVLSQLRQADDIANEIIMQAGLYRKISQMPVVLIPVHFDRDPIKRTPSCRRSVVLRPFITNDFMTGVPAEPGSVQMPLQVLNQIVRDISKLDGI 

SRVLYDLTAKPPGTTEWE* 

(SEQ ID NO: 72} 

Name: GMP SYNTHASE LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384340 

CAGGTTTTTTGTTTTCTTTATTTTTTGGTTTCTCTAGATCCTCGAAGAAGCGGAGTTGAGTCCGCTTATATGGCCAGTCATGTGGCACGCATCAT 
CATTTATCAAAGAACGCGGCAACAATTCGCAGGGCTGACAACAAGCCAAAAAACAACCTCCAATCTTTCGTAACGAAAGACACCCAACTTGTTCC 
ACCCACAAAAGACGCCGCCGTTTTCGTGTCGCTACTATTTTTACCCACTCCCAGAAATTTTTGTTTTTGTTTTTTTCTTTTTTTTTCTGTCTATT 
CGGTCTCCTGCGTTTCCTTGGATGCTTTATTGTTGTTGTCGACGCTTATCTGCCGTTTGACATTGATGTGCTGGGAGTACAGTGGTCGAAATGGA 
GAAGTACACTTGATTATCTACCAAAAGTAAAAAAAAACCGTTAGTTTAAGTAGATTTATCGGATAGTTGCTTTATAATAGACCACACTGCGTATG 
AGTAATATTTTACAGAGCTTATACATAAAATATTGCAAGTGGAATAGTTAACATTTCTCTAAATGATTTAAGTATTCAATTATTTTAGCCACTTT 
GGTAGTAGGGGTTTCACTGTACGCGGCGCTTGTTGGTCCTGTCATGTAACGGTTAAATCTGCATGAAAAATTAATGCTGTGTTGTTGATACGGGG 
AAAGTGGAGAGGGAAAATGGTCGCCGTATTGCGTATGTATGTATGTACATACATATGTACATTTATCTGCCTCTGCGAATTGTTGTTGTGTCGAA 
ATTAGCATCTCGAACAGCTTGCGTTACTATTCCACCCGCAGATTTACAGTTACCCACTTACATATGCACACTTGGTTTACCCTTGCATTTGCTGT 
TTTTAAAGACATTGATTTCGTGAGATTTTATTGTACTATTAAGTTGACCGTTACATTGGATTTATCTCTGTCTGGCTCATGGGTGGGAACAATTT 
GCGAGAGTAAGTGGGTGAGCACGGGCGGCATAACGGGATACAGTGAACACTCAGTAGCTGGGACGCCATGGGGGATATGGACAGTGAAACTGGCG 
CCGCTGCTGCGTCGACTGCGACGCCGACGGCGACTGCGACTGCAACTGCAACTGCAACTGCGACAGCGGCTCGTCTGCGTCGCGTCGCTGCAGCA 
GCGCTGCCGCTGCGCGGTTAACCGTTTCGCGCCGATAGCCAAGCAGCGAGTTACCGTTACGCGATTGCTGCTGCTGAGCTGCCACCGTCAGCGTT 
TCCCTTCCCGCTCTCCGAACAGCTCCACCAACTCCAACTCCCCTTCCCCCTCCACCACCAAAAATCGCAGCTCCATTTCCCCTTTTCGTTGTTGT 
TTTTGCCGTTAACACTATATGCTGCGCAGTCCGCTTCGCTGTTAGGGGTGCTGCTGTTGCAGTAGCAGCTGTAAAGAGTTAACATGATTTTTTTT 
TMCTAAAAGAAGTACAAAATGTTAAAGGAATTTATCAAAATCCGAAATTGTACCCTCGTAAACCGGCTTACCTGCTGTCTTCTTCATCTGTGAT 
CGATTTAATCCCTTCGTTTTCAGAACCATTCCATTAGATTCCGGCTTGGCTGTCAACGAAGAATGAGTTCCACATACCGCTTTGGATGACAGCTG 
ATCGTCCAACGGCGGAGCCTTGCCACCCGAACCGGCTGCACTGCGTCTGACCAACGTTCGAACATTGGTCCGATTAGAGATAGATTGTTGTATCG 
AGCCCTGGTGGCTCTGCAGCTTGGCGGTTTGTTTCGATCCCTCGATGGCAGACGGACTTATCGGGGACTCCTGCAGATGCTGCGCCTGCTGTTGG 
AGGATCTCCGCCTTGCGTATCTGCTGCTGGGTGCGGAAGGCACGCTTGTAGGCCGAAACCTGTCAAGGATCATGGGATATCTATGAGATATATGT 
GAGTTAGACATATTACCTACCTGCGAGGCACTCGCCACCAGCAGCTGATTGGCTCCCTTGACCTGGCCTGGTGTCCTGTCCTCCCGCTTCTGATT 
CGCCTTTCTTTCCGGTTTCTTTCCGAGCTTTGGATGCGATTTCTTGGAGGCATTAGCCGCCGTCACGTCATCATCTTTGGCCGGCTGCAGGAAGG 
ATCGATGGCTGGAGCGGCTAACCTGGTTGCTGGTCACCTTGGTCACTCGCGGCTTATCATGCAGAGTCTCAAGCAGCTGCTTCTTCTTGACATTT 
TCCTTAGCCTTCTCCTTGACCGTGTCCTTGACCTCGTGGGGACTGATTGGCACATCCTCGCACTTGGGTTCCTCCACTTTGGTTGAGTTCTGCAG 
ACTCTGTTGCTGCAGGTGGACGCGACGCACGGGTGCCAGGGTGGCTCTATTGGACTCATCTTGAACTGAAGCCGCTTCCGAATTTGCTTGCGAAG 
TGTCGTGGGAAGTGGAACCGGAATTGGAATTGGCTTTACCATGCTCCGAATTCAATTTCGTCGATTCCAAAGTATCGTCTTTGGTTTTTCCCTCA 
ATCGAGCGTAATTTTGTCGCAGATCCCGATGGGATTGCCCAATTTTGACTACTGGCATTTGAGTTACCCACCGAAGCGTTGTCTGATTTTGCTAC 
ATTTGACATCAAACCCTTGCCGCACGGGCTACTTGATCTGGGGTACTTGGGCCAATCTTCACCTGGATTAACATCCCAATCGCCTGATGTGTCTT 
CCGAACTCTTTGCATATCCGAAATCGTTAACCGACCTTTTACCTAAGTCTTCTGCATATACAATTATTATAATTATTATCACTCACATACATATA 
TGATTATATCATCTAGTTGACTTACCTTCATATCTACCCAAAGAATTATCCCAAGTGATTGTCGAGTTGCTGGGCCTGAAACTAGTCCGCGAACT 
GCTGTCCAAGCTGGATCCCCGAGTGGTTACATCGAAGCCCCGTGGACCGGGTGGAGTGGCGGGTCGCGCTATGGATTCCAGCGATTCCATTAGCT 
TAACGCCTCCGGACTCCAGGCTTCGGCTCCTCTGGCGATAGTTGGCGTGCAGGACATTGTGAAAGCTAAGCTGGCGCGGCTGTTGTCTCACCTCC 
GAACGTCGCGTGGTGGCATTGGTTTTGGTGGTATTAATTGACATATCCGAGCTCTTCTCAATTCGGCTCCTTGGCGAATCACGGATACTGGCATT 
GATTGGTCTCTGCTCGTTCCGATTGAAGGCATATGTGGCCACGGGTCCATCGGCGAATTGGATGTACTCGAACTGCTTGATGCGATCGCCCAAAG 
TCTGAAAGCGGTGGCGCATCTCCGACGGCTGTTGTCCGTTTCTGGTGGTGGTGCTGTCATTTCCAGTGCCTAGTGGACCGAGTCCAGACCGCTTT 
CGTCCACCGTCCAGCTTGGTCAGCACCGAAATGCGACGCTTGAGGGCGGCATCCACTGGCATCAGGGAGTCGCCATCGCTACGCGGCATTTGGAT 
GCCCTGCGCCAGTGAGCTCTTCGCCATGGCGGCTATGTGGTGTTGCCTGCTTGGCGGCATCATTGCTACGGATCCCAAAATGGCTGCAGATGGAC 
GGGGTGGCATGCTCATGGAACGGCCACTGAACATGCCCATATCGACCAGATGAGACTCCTCTTTGGAGTCCTTGCTTTTGCGCATCGATCGGCTC 
TGGCGCTCCCGTTTCGCTTTGCTTTTGCGGAAGGATCCGACCAGGAATCCGGTGCGGGATGACTCGCTACTGCGCCCGCTGGAGCAGCTCGAACT 
GGCGACCCTACGCACGGAACTTTGACCGTAAACGGAGGCGTAAGGACGAGCCGCCGTAAAGGACATGGAGCGTGAACGATTCGCCTGCATCATCG 
AGTACAGAAATTGGCCGGAATTGCTATTGCTGGCCGCCGATGCCTGATCCGCTTGCATGATGTGGCTGCGCGGATAGATATCGTTGGCACTGGGC 
TTCGCCTCCGCCCTCTTGATCCGTCGCAAAGGACCAGCGTTCTCAGTACGGAAACGTGGGTGTCGATTGTCTACAAATTGATTGTATACGATGTA 
TTTTACTTTACTTATATATAAGAAATGTTAGTTATAACGGTGAACTTACGTGAACTATAGCCGCGCTCGTAATAGGATCTAACGGCACTTTCGTA 
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CAACTCGCTAGTGTTGCGCAAAAACCAGGGTTGCATTCGGGTATTCACTTTGATCAAACGATTTATCTGGTCGGGTTTCATCGTAACGCCGGATA 

ATTTCTACGTCCGAATGTTGGCAATATTCGAATTTGGATAAATGGCACAGGTATTTGTGGCCTATAATAATAACTAGTTTGGCTTCTATCTATTG 

GCTTGAACTTGAACAATTTTGCTTTCCAAATGCACTGAATCGAAGTTAATGTTTGAATTTTTATTTGCCTGTTGCTCAGACCACTTTTGTATTGC 

TTGACAATTTAATGCTCTTAAAACTAATGGCTTGGGGCAAAACGGGAGAAAAGTAATCGAAAACGTAACTAACTGCGGTGTACTAAAACATGCCG 

GTGCGTGAAAATGGATATGTCTGTCTGTATTTTGAAAAGAAAAAAAAGAAAATCACACGCACGAAACTCAAGTAACGTCAACGTTAAAGCCGAGT 

AAAAAGAAGTGTTACATTTATTTCGATTTTGACACAACAATCGGAGTCTGCCGAGTTTATTCCCTCTATATCTATATATATTTCTAACACGCGCT 

ACAAATCAATAAGTCAATATGAAGTCAGGCTAACTTTAATCCAAGGAATTCATCTGTGCATCAATGATCTTTGGTTAGTCCATATGAAGTCCAGC 

AATCTTGAATCCAAGAATTCCCACAGTGAATGAACACACGAGCACCCAGCTTAATTTAACCTCGGTTGGCTCAGTTTGCGGATTTCGCGTGTCCG 

TAACGAACGAAACGCCGAACTTAGTTGTATAATTATGTGTGTTTCAATCAAATTAGCCCACACCGGTCATACATAATGCACTACCATGCTCACTC 

ATTCGTTAGTCAACGCTGCTAATCCGAGTCAACCACTCACTCCAACCCGCCGGGTAGCCAACGTCATGGACTCACGAAAAAGCAGTCATTTGACA 

TTGATTTATCAGTGGCTGTCAAAGTGCCGTCATCCGCCACCCCCGCTTTTCCAGCCCGCTTTTCCAGCCCGCTTTTCCACCCACTTCCAAACACA 

CCCCCCCACCACTTTGGCCGCTGGGCGTGGTCAAGGTAAAAATGTAATAATTAGCCGCAGTGATGTGATGATGTTCGGGGTCACCCCCTCGTTAG 

CCATGGTATCGTATCTCTGGCCCATCGTGGTGCCTCGATTTTGAGCTTGGCCAGCAGCAATTTGTCAAAGTTTGGCCTGAAAAATGAATGCTGGT 

CGTCCTGTTTGTGCACTCGAAGAAATTACGCAGTGAAAGTCATTAAGAGATATCCTTAAACTTCTGTTAAACTTCTTCTGGGGACACAAGAAAGG 

TTTCATAGGCTTACTATACTATTTTTCATTACACACAGTCTTAATTATGCAAAATAATAAATAGTTTAAAGGTATATGCCTATTTTAAATTATAT 

TCCTGATATTTAAGCGAATTTTTTAGTGATTCATAGCTAGTTTTCGGGCTTATTACCTGACATA 

(SEQ ID NO: 73} 

Exon: 4574.. 4040 
Exon: 3965. .2781 
Exon: 2711. .1921 
Exon: 1864.. 1498 
Exon: 1398.. 1001 

Start ATG: 4166 (Reverse strand: CAT) 
Transcript No. : CT3184 

TAACACTTCTTTTTACTCGGCTTTAACGTTGACGTTACTTGAGTTTCGTGCGTGTGATTTTCTTTTTTTTCTTTTCAAAATACAGACAGACATAT 

CCATTTTCACGCACCGGCATGTTTTAGTACACCGCAGTTAGTTACGTTTTCGATTACTTTTCTCCCGTTTTGCCCCAAGCCATTAGTTTTAAGAG 

CATT AAAT TGT CAAGCAAT AC AAAAGTGGTCT GAGCAACAGGC AAATAAAAATTCAAACATT AACTTCGATTCAGT GCATTTGGAAAGC AAAAT T 

GTTCAAGTTCAAGCCAATAGATAGAAGCCAAACTAGTTATTATTATAGGCCACAAATACCTGTGCCATTTATCCAAATTCGAATATTGCCAACAT 

TCGGACGTAGAAATTATCCGGCGTTACGATGAAACCCGACCAGATAAATCGTTTGATCAAAGTGAATACCCGAATGCAACCCTGGTTTTTGCGCA 

ACACTAGCGAGTTGTACGAAAGTGCCGTTAGATCCTATTACGAGCGCGGCTATAGTTCACACAATCGACACCCACGTTTCCGTACTGAGAACGCT 

GGTCCTTTGCGACGGATCAAGAGGGCGGAGGCGAAGCCCAGTGCCAACGATATCTATCCGCGCAGCCACATCATGCAAGCGGATCAGGCATCGGC 

GGCCAGCAATAGCAATTCCGGCCAATTTCTGTACTCGATGATGCAGGCGAATCGTTCACGCTCCATGTCCTTTACGGCGGCTCGTCCTTACGCCT 

CCGTTTACGGTCAAAGTTCCGTGCGTAGGGTCGCCAGTTCGAGCTGCTCCAGCGGGCGCAGTAGCGAGTCATCCCGCACCGGATTCCTGGTCGGA 

TCCTTCCGCAAAAGCAAAGCGAAACGGGAGCGCCAGAGCCGATCGATGCGCAAAAGCAAGGACTCCAAAGAGGAGTCTCATCTGGTCGATATGGG 

CATGTTCAGTGGCCGTTCCATGAGCATGCCACCCCGTCCATCTGCAGCCATTTTGGGATCCGTAGCAATGATGCCGCCAAGCAGGCAACACCACA 

TAGCCGCCATGGCGAAGAGCTCACTGGCGCAGGGCATCCAAATGCCGCGTAGCGATGGCGACTCCCTGATGCCAGTGGATGCCGCCCTCAAGCGT 

CGCATTTCGGTGCTGACCAAGCTGGACGGTGGACGAAAGCGGTCTGGACTCGGTCCACTAGGCACTGGAAATGACAGCACCACCACCAGAAACGG 

ACAACAGCCGTCGGAGATGCGCCACCGCTTTCAGACTTTGGGCGATCGCATCAAGCAGTTCGAGTACATCCAATTCGCCGATGGACCCGTGGCCA 

CATATGCCTTCAATCGGAACGAGCAGAGACCAATCAATGCCAGTATCCGTGATTCGCCAAGGAGCCGAATTGAGAAGAGCTCGGATATGTCAATT 

AATACCACCAAAACCAATGCCACCACGCGACGTTCGGAGGTGAGACAACAGCCGCGCCAGCTTAGCTTTCACAATGTCCTGCACGCCAACTATCG 

CCAGAGGAGCCGAAGCCTGGAGTCCGGAGGCGTTAAGCTAATGGAATCGCTGGAATCCATAGCGCGACCCGCCACTCCACCCGGTCCACGGGGCT 

TCGATGTAACCACTCGGGGATCCAGCTTGGACAGCAGTTCGCGGACTAGTTTCAGGCCCAGCAACTCGACAATCACTTGGGATAATTCTTTGGGT 

AGATATGAAGAAGACTTAGGTAAAAGGTCGGTTAACGATTTCGGATATGCAAAGAGTTCGGAAGACACATCAGGCGATTGGGATGTTAATCCAGG 

TGAAGATTGGCCCAAGTACCCCAGATCAAGTAGCCCGTGCGGCAAGGGTTTGATGTCAAATGTAGCAAAATCAGACAACGCTTCGGTGGGTAACT 

CAAATGCCAGTAGTCAAAATTGGGCAATCCCATCGGGATCTGCGACAAAATTACGCTCGATTGAGGGAAAAACCAAAGACGATACTTTGGAATCG 

ACGAAATTGAATTCGGAGCATGGTAAAGCCAATTCCAATTCCGGTTCCACTTCCCACGACACTTCGCAAGCAAATTCGGAAGCGGCTTCAGTTCA 

AGATGAGTCCAATAGAGCCACCCTGGCACCCGTGCGTCGCGTCCACCTGCAGCAACAGAGTCTGCAGAACTCAACCAAAGTGGAGGAACCCAAGT 

GCGAGGATGTGCCAATCAGTCCCCACGAGGTCAAGGACACGGTCAAGGAGAAGGCTAAGGAAAATGTCAAGAAGAAGCAGCTGCTTGAGACTCTG 

CATGATAAGCCGCGAGTGACCAAGGTGACCAGCAACCAGGTTAGCCGCTCCAGCCATCGATCCTTCCTGCAGCCGGCCAAAGATGATGACGTGAC 

GGCGGCTAATGCCTCCAAGAAATCGCATCCAAAGCTCGGAAAGAAACCGGAAAGAAAGGCGAATCAGAAGCGGGAGGACAGGACACCAGGCCAGG 

TCAAGGGAGCCAATCAGCTGCTGGTGGCGAGTGCCTCGCAGGTTTCGGCCTACAAGCGTGCCTTCCGCACCCAGCAGCAGATACGCAAGGCGGAG 

ATCCTCCAACAGCAGGCGCAGCATCTGCAGGAGTCCCCGATAAGTCCGTCTGCCATCGAGGGATCGAAACAAACCGCCAAGCTGCAGAGCCACCA 

GGGCTCGATACAACAATCTATCTCTAATCGGACCAATGTTCGAACGTTGGTCAGACGCAGTGCAGCCGGTTCGGGTGGCAAGGCTCCGCCGTTGG 

ACGATCAGCTGTCATCCAAAGCGGTATGTGGAACTCATTCTTCGTTGACAGCCAAGCCGGAATCTAATGGAATGGTTCTGAAAACGAAGGGATTA 

AATCGATCACAGATGAAGAAGACAGCAGCTGCTACTGCAACAGCAGCACCCCTAACAGCGAAGCGGACTGCGCAGCATATAGTGTTAACGGCAAA 

AACAACAACGAAAAGGGGAAATGGAGCTGCGATTTTTGGTGGTGGAGGGGGAAGGGGAGTTGGAGTTGGTGGAGCTGTTCGGAGAGCGGGAAGGG 

AAACGCTGACGGTGGCAGCTCAGCAGCAGCAATCGCGTAACGGTAACTCGCTGCTTGGCTATCGGCGCGAAACGGTTAACCGCGCAGCGGCAGCG 

CTGCTGCAGCGACGCGACGCAGACGAGCCGCTGTCGCAGTTGCAGTTGCAGTTGCAGTCGCAGTCGCCGTCGGCGTCGCAGTCGACGCAGCAGCG 

GCGCCAGTTTCACTGTCCATATCCCCCATGGCGTCCCAGCTACTGA 

(SEQ ID NO: 74) 

Start ATG: 409 (Reverse strand: CAT) 

MKPDQINRLIKVNTRMQPWFLRNTSELyESAVRSYYERGYSSHNRHPRFRTENAGPLRRIKRAEAKPSANDIYPRSHIMQADQASAASNSNSGQF 
LYSMMQANRSRSMSETAARPYASVYGQSSVRRVASSSCSSGRSSESSRTGFLVGSFRKSKAKRERQSRSMRKSKDSKEESHLVDMGMFSGRSMSM 
PPRPSAAILGSVAM^PPSRQHHIAAMAKSSLAQGIQMPRSDGDSLMPVDAALKRRISVLTKLDGGRKRSGLGPLGTGNDSTTTRNGQQPSEMRHR 
FQTLGDRIKQFEYIQFADGPVATYAFNRNEQRPINASIRDSPRSRIEKSSDMSINTTKTNATTRRSEVRQQPRQLSFHNVLHANYRQRSRSLESG 
GVKLMESLESIARPATPPGPRGFDVTTRGSSLDSS5RTSFRPSNSTITWDNSLGRYEEDLGKRSVNDFGYAKSSEDTSGDWDVNPGEDWPKYPRS 
SSPCGKGLMSNVAKSDWASVGNSNASSQNWAIPSGSATKLRSIEGKTKDDTLESTKLNSEHGKANSNSGSTSHDTSQANSEAASVQDESNRATLA 
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PVRRVHLQQQSLQNSTKVEEPKCEDVPISPHEVKDTVKEKAKENVKKKQLLETLHDKPRVTKVTSNQVSRSSHRSFLQPAKDDDVTAANASKKSH 
PKLGKKPERKANQKREDRTPGQVKGANQLLVASASQVSAYKRAFRTQQQIRKAEILQQQAQHLQESPISPSAIEGSKQTAKLQSHQGSIQQSISN 
RTNVRTLVRRSAAGSGGKAPPLDDQLSSKAVCGTHSSLTAKPESNGMVLKTKGLWRSQMKKTAAATATAAPLTAKRTAQHIVLTAKTTTKRGNGA 
AIFGGGGGRGVGVGGAVRRAGRETLTVAAQQQQSRNGNSLLGYRRETVNRAAAALLQRRDADEPLSQLQLQLQSQSPSASQSTQQRRQFHCPYPP 
WRPSY* 

(SEQ ID NO: 75) 



Celera Sequence No. : 142000013384562 

TACTGTACTGGATTAACCAGATTGCAATCCCTGCTGCTGGTTTTTATTTTTCTGGACTTCTTTCAGCTTCACTCTTAATTATGCAAAAGTTGTTC 
TCTTGGTTTGGCTTTCTGGGTTCTTTGGTTGCATATAAATAAATGTAAATGCGGGTCAATGAACTTTGCAGATTATCATATGATAACCAAATGTA 
GATTGGGTCCGCATAGAGAAAGTATAATCTAATATATGTTTTTGATCAGGTAAGTATCAATAAATAGTTCTGATCAATAAAAGTTCATTGAATTC 
ATTGTTTTAGTGTTTTAATTAAATACAGGAAATACAGGAAGAAAACGTGAAAATAAATTCCCGAATCGAAAGACAATAAAACTGCCCGCTTGAAA 
TTGAATTAAATACAATTGTTAACTTGTTGTGATTTCTCAAAAAATTTCAAACACGTGCTGCTGTGACATTGACCCATCAATTGATCCCTTCGAGT 
TAACTGGGAGTAAGTACAGCAGCGGTCAAGAAACTAGCTTCCAGAACATTGAATAATTTCAAAGTCAATAAAGCAAGTAAAAGTGCTTTGAAGTG 
CTAATACATGTGAAAGGTATTTTAAAGGTAATTATAATTTAATACAATGAGTTCAAAAAACTTCTTAATATGCACATTGATAATTTCTTTCTAAT 
TGATAATTTCTTTCTAATTGATAATTTCTTTCTAATTTATAATTTCTTCCTGGATTTGCGGACTTTTGTCTTAACAGAGGACAGTTTTAATATAA 
AAAATATTCACTAAGAAGACGTAATGGAATTATTTTTGACAACACATTATTTATGGTGCACAGTTCAGCACTTAAACTTGTTTTTTCTTGGCTTT 
GGGTATATGTATGTACATATATTCTTTTGGTTTGTGTTCTTAACACACTTGTGTCACTCTATTCCTTTTTTCGGCACTTGTTTCGAGTGCGTAGT 
TGTACTTGTCTCGTGATTTCGGGGTCAAATCTGGCGTTCTTTTCTTTTTTTTTTTTTTTTGCGATTCAATTCCGTTCAAGCTGCACTCTCGAGGT 
GCTTGACACACAAAGATTGTTGGGACCAACCGTTGCCAGAGCCTCAGTCACAGCCACAGAAAAGCGAAAAAAACACACGCAGGAGTGCGACCTGT 
TCGACGGTCGACGCGAATCTAAAGTTTACTGAGCACGGCGCTTTTATCTTATACGATAGCCCGTGGTTACACCGCTCACCCATACAGATGGAAAA 
GCCGAAAGCCGAGAAATGGAGAGCCAGTGAAACGTCGACGGCTTAGAGATTGAAACGCTGCTTTGCGTCGGCGCAGAGAATCGCAGATTCAGTAT 
GCTAGTTCTATCGTAGAGCATGCAATGTAGTTGAACAGCCAAGCAAAGGAGGTAAAAAAAACAATAAAAAAGGAAGAAAAAAACAGCACATGTGT 
GACCCACACGAAACGCACAGCGGGCGATTGCAATGAAATAATGGCAGGGCTTACCCGGCGTATGCGCAATTTTGATTCACGCTGCACACATACGC 
AGCGATGTCACTACGTGTTTTTGACCCTAATTGCGCTGATTAACCCATTTCTTTCTATTTCGTATGCCACTGGGCACTTGAGCCACTCGACCCTA 
GAGTTCACCCAGTCACTCAACCTGAAACACCCCCTACTTTTGGACCACCCCCCGTGGCGGAAGCGCCGAAGATAAGGGGCCACAAAAAGTGGCAA 
AAAACAAACCATAATAAACCAAGTGCAGACGGACAGACAAACGCAGCATGCGGGCTCACAGGAGGGGCTTTAGCGGGTTAACCCTTTGGGGAAAG 
CCTCATATCTAGGTCACCGATTAAAGTATTTATCAAATTTTGAATGTTTTATTTATACATTTACTTGTTGGTTCGAATTAACTTTGATTATTTGA 
ATTGCTAATTATAATTTATTATATTTACTACATGCCAGTCTTCAGAGGGTTAATTTGGCATGGAAAATGGCATGTAACTCGAAATATTACTGCGG 
CTAGGGCCCTGCCTTTGGCTAATTGCCACAAACCTCGTTTGGCTTTCATCCACACTTTTTTTGAGGATTAAAGATTTGATAGTTTTAAGGAGAAA 
TTGAAATGGGTGTGAAAGGGAATTGGAACATAAGCTATAAACGAAAATAATCTGCTTTATTTACTGAAATTCGATTATCATAAGCCGATTTTCGG 
AAAGTGTTAGGCTTTGCCTTCAATTTATAGAACAAGTTAACAAATTTAAAAGAGTATTTGTATATCGTATGCGTGGGTTTAAAAAAAAAAGCATC 
ACGTGTCCATCTCTAATTTAAAGAAATATAAAGTGACATACAAATTTTCGTATAACAGTCTGCCAAAAAGTCGCCGCTGTTATACTCAATTTTTT 
ACGGACACAGCGCCCTCTGACATTTACAAAGTGCAACTACTCGCTTTGCGCGCCAATTTCAAGTTGTTAATTCTGTGAAAACCCATAGAGTACCG 
AAATCCCGCACATAATGCGCATTTGGCGCCATTTGAATGATAAAAATTCCAGGAATTTCAGTTCTTTTTCGTCGTGTATTAATGTATATTTAAAA 
TTGATGGTCTTTGTTCAATAATATTGCCTTTTTTGATGCAAAAATAAGCTTTGCTTAAATAATAATCATTACATGGTAAAATATTTGTTTTCTTG 
TTTGCATTTGCGTTTCGGTGCCTTGTTGTTCGGTTACGGATTACATAAAAATAAACAAATAATAAGCGTAAATTGTACCTAGCAGAAGTTTACAA 
CTTTCAACTCTTAAACGATAGATCTTATAAATAAAGTTTTATGGATGAAATTAATGAGCGTGTTTGTTTTTGTGTGATATTTGTTTTCTTTTTTT 
TTTTTTGTTTTTAGCCATTTTTTGTTCTGCTTATCATCTACAAATGATTAAGGGGTCTGATTTCGGGATCGGCCTCTTAAATTTAAATGACATGG 
TTCCTCACGTTTTTAGATTATAATGCACATTTTTCGTTTGCTTGTTCGTTGGTTATCTTTAAAATTTGAACTAGAGTGCTTAAATAATTGAACTA 
A AGC C G AAAC C AT AAAAAC AAT AAT AACT AAAC AAAAGC GT T AAAT T C C AAAT GAAAT T GT C T GT T TT GC AT G AAT GC GAGT GT C G G GT G T GT GT 
CTGCGTCGTGTGCGAGAAACTTAGTCTACGGATTAACAAATGAAAATCAACAAAAATCAACTTAAAAGCAGCATGAAATAAATACCCCTATACGT 
CTATATATACAATACGATAACACCCAAACTTAGATAACGAACTGCACGATCATCCCACAACCTACAGCATCAGTCGCGAGCGCGACTTCTTTAGC 
CGCCCGGTGGCACCGCCCAGGCCATCGCCGTTCCAGGACTTGGACTTGGCGCGAAAGTGGCGATAGGCCTCGTTGTAGGCGGGATCCAGCATGGT 
GCGGTAGGCATGTGGTTCGCACAGCTCAACGTGCGACTCGAAGAACTCCTTGTAGCCGTTGTGCAGCAGATAGATCTCGGGATAGTGCAACGCCG 
GATAGGCGTTGGTATTCCTCTCGCGATCCAGATTACGCAGGAAGCGGGACATTTTCGGTCCACGCTCCGAGGAGAATTCGCAGTGGAAGATAATG 
ATGTTGCGCTTGTGTCCCGATTCGGCGTTCTGCTGCTGCTGCAGCTCAGTCTGTTGGACGGTGAGGAACTCATCGAGGATCTGCTCGGTGGTGTA 
CAGGTTCTTGGCTCCCTCGATGTGGCCGCCCTCGAATTCGTAGGGGTAGCGGCAGTCGATGATGCGGTAGCTGGCCACCTTATCGCTGAACTCGC 
CCTTAAGCAGGCGAGCCACTGTTTCGCTGGAGATGCTCTTCAGATCCCGATGACGACCCTCCATCAGGGGCAGGGCATAGGCTTTGCTGAAGTCG 
CCGATTAGCTCGGGCTCGTTGCGGTTCTCGGAGCGGGCCAGGGCGGACATGATCTCGGCGTCGTTCAGGGACATGCACTTCCTCAGTGGAGGGGG 
GTGGCTAATCGTGACCTGGCTGAGTGGGCTGGGTGCGGGGCAGTTCTCCTTCTCAACGGCGGCGCAGCGATGGCGTTTGCTCTGGATGGGCGAGC 
AGTTGGCCGATGCTGGCGGTTCCGGGCGCTTGAAGCAATCCCGGGCAGTCTCTGGGGTCTTTGGTGGTGGGGTGGTGGTGCTGTTGGTGTTGCTC 
TCCGTCATGCTGAGGCACCTTCTGACCGAAGGGCGGCGCATGGACAGACCCGCTGGCGATTTGGCAGCAGGCTGCTCCTTGATCTGGCCGCTGAT 
CAGCGAGTTTAGGCCACTGGGGAAGCCCAGGGCGGTCTGTTGGCTCTGCGACTCCATCTCGAAGAGCTCCATGTACTCATCGTCCATGGAGGACT 
CCATGGAGCATGTGGAGGACAGGCTGTTGAAGATGCGGAAGCTGCGCGCCGGCGTGTGATCACTTGATACTCCCATAGCTGGCAGAATTTTCGGC 
TGGCGGACGATCTGGAAGCGCTGGGGCGAGCCCTCCGGCGAGAGCAGACCCATCAGCTCCGGACTGGCCGATCGCTGCTGATCCTGGGGCACAAC 
GTCGTCGTCGTAGAACGACAGCTCCTCCTGGTCCATGCTCATCAGTTCCAGGGAACGGCGAGCACGACGAGATCCACTCATTTTGTTGATGCTAC 
TCGAACTGCTGGTGTTATTACTGCAAGAGCAAACAAAGAGAGGGGGAGGATTAGTCAACTGTTCGGCATCGAGGAAGAGAGCTGCTTTTGCTTCC 
CTAAACTCTCGCAAGAGCGGGTCATTAAACCCAGCTAAAAATACTGACAAACTGGTGGTGGTGGTGGCGTTGGTGGTGTAAAAACAACAAAAATA 
CCCGAGAAAATACAGAGAGGGGTTGTTGTAATTGGGTGAAGAGGGAGCTGAACTAGAAATTTACTTTAATGCAGCTTCGAAAACTGGAATTTTTG 
GGATTCGACTTGCGCACACGATATCATTGATACCATTTTTTTTTTTTTTGATAATTTTTACAATTCCCTAAAATTGCGAGGGAACCGTTGCGTAC 
CGCCTTATCTTCTTCTTCTTGATTCTACTTAAGCATTAGCGGTTCTCCAAATGGAATTCCCTTTTGCTTTTCGCATGCATCTGCTTTTCAGCTTG 
ACTTTTGCTTTTGGGTCACACGTCACGCAAGCGCAGCGATAATACAACAAAAACACGCCACAAAGAACCGCAAAGAACACGGAGGAAAAAAGGAA 
CAAAAATCGAAAAAAAAAAAATGACTGACTGACGGACGACGGACAAAAAGCATTTTTTGCTGCTGCTTTTTTTTGGAGCCTCGCGGAAGCCGGTT 
TAATGAATGAAAAACGGCAAGCGAGCAACAAAACAACAGCTGAGCGCCAAAATAAAAATCGGGCAAAAAGCAAACGTGCCGAGCGTGATAAAAAG 
AACTCTTACTTTTGTTGGCCCCATCGTTTACATTTCTGTTTTTTGTATTTTTTTTTTGCTTTGCTGTTGTTGCTGTTGCTCTTGAACTTGCACTT 
TTTATCGCGTATTTTCAACTGGTATTCCTGGTGCAAAAGGGGTTTGCTGTTGCTAATCCTTGGGCGGCATAATTACCTGATATTGCAATCCATGC 
TGCAGTTGTTTTCCTCCACAATAGTTTCCCACAGCATTTTGTTGGTTTTGTTGTGTTATGTGTTGCGTTGGTTGCTTCGAAGAGCCGAAGATGTT 
TGATTTCGCTGGCGAGTCTGAAGCCGAAGAAATTGCTAAAACACAATTTGTTCTTGGTCACTCTGCACTCATTTACAGTTTGTTTTTAGAATTTG 
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AGATGGTAGTCCTTGGTTTTTGGCTAGTATTGCATTTTAGTTAATTTAGGTTTTTTTGTTTTTTTTTTTTGTTAATGTGTCCGTTTGAGCATGAT 
TTTTTGTTTTGTATTTCGGAGTGTGGTTGTGCGAGGTTTTTCTTTTTCTTCAATAATTTGAACCGCACAATCTCAATTCACCGAACGAGGAACCG 
AAAGGCTGAACAAGAAGAGCTGTTGACTCGACGACGAACTGAGAACGAATGTGCTGAATATGCCGAGCAGCTGGTTTTTATAACCGCACCCCGGC 
CGATCACGATCAGCTGATGAGCTAATGGGCGCGAGCGAGATGGCGCGCGTTTTTTGTTCTCTCTGCTTCGCTCTTTAGCTCTTAGTTCTCAGCTC 
CTTTCCTTTGCCTTCCCATCCTATCCTTTCCTTCCTATCCTTCCCAGAAGTGCCAATAGAAGAGGAAGGGAGACGGGCAACGAGCGACACCAACA 
CACATAACGGGGACGGGGCCAGAGCCATGTGCGATGCGAAAAGGAAAACTGGGAAAATGGGAGAATCAGGATCATATGGACTCAGTTTTGGAAAT 
CTCTGCTCATGTGGAGCACAAGAGTGCCGGCAACAGGTAGAATCCTTTCCCCTGCACTTCCCCTATCGCAAAATCCCGCACCGAATACCAATGTC 
CCGAGAAACGAGGAGAATCTGGCGCCTTGGTGGTCGCCAGCTCTATGGAGGCCACTGATTTTGGCGCGCGAGCTTTTATACCGCGACGCGTACCG 
CCGAAGGGGATCCGAAGTCGCCACAATTGGAGCCGAAGTGGAGCCGAGGAGGCTCCGAACCGAACACATGTGAAAATAGTGTGGAGCCCCAAGAA 
GCGAAGAGGAAACGCTGAAGAACTGCGCGAATCTTTAAGCAGAATTTCAAGGCGGCATCGATTGTGTCGTTTGCAGGTAATAATGTGAGATCCAC 
GCATTCCTGCTGCCCCCTGTCCCCCCTTGTAACCCCTCCCTGAAAACTACATATCAGCAGGTAGTGGCCGCAGTCATATGATCGCCGCACCGGCG 
GGATGGGCGGTCGTGGAGGGGGCGTGGCTCCTTTGTGTTCTGCAGACGTAGACGCCAATGTCGTTAAATGAGCAAAGCTTCCAGCCACAGAGATC 
CATTCACAGGTAGCACTCTACACGATGGACCGTGTGTCTTTAATGAGAATCCGTACAAACACAGATCTTAAGCTAGATAAACTACTACTAACAGT 
AGCAAACACGTGGAAAATCATGATGTCAACCATGCGATATGACCAATAATCGATAATGTGAAGCTCAGACTAGTTGCCCACGCCGAAGCTCCACT 
GTAGCCCCTCGTTTGTGTGCAATCGCGGCAAAGTTAAAGCAAAAATCGAGTTACATATGCGCCGATGCCCTGCGGGAGTCATTCTGGAGGATCCG 
AGGGGTTCACTAAGGAGGGTTGGGGGTTTGCGAACACCCATGGGCTGTGCTGTGCAGTTCGAGTGTGTTTTTGTAAGGTTCGCGGCAAACAAATG 
AAGGACAGGCCAGTGCAACCAGGCCATATCATTCACACCGCTTGAAGAGCAGGCGAAAGTAAATGATTTCGGGACCAACCACAGCCGAAGCCACC 
AACCAAATGAGCTACGGTTTCATTCCGCTGAATTCCTTTGGCCAGCTTTTGTTGTTGCTGCAGGAAGA 
{ SEQ ID NO: 76) 

Exon: 6478.. 5587 
Exon: 4675.. 2532 
Exon: 1011.. 1001 

Start ATG: 5642 (Reverse strand: CAT) 
Transcript Wo. : CT3224 

ATTCTCCTCGTTTCTCGGGACATTGGTATTCGGTGCGGGATTTTGCGATAGGGGAAGTGCAGGGGAAAGGATTCTACCTGTTGCCGGCACTCTTG 

TGCTCCACATGAGCAGAGATTTCCAAAACTGAGTCCATATGATCCTGATTCTCCCATTTTCCCAGTTTTCCTTTTCGCATCGCACATGGCTCTGG 

CCCCGTCCCCGTTATGTGTGTTGGTGTCGCTCGTTGCCCGTCTCCCTTCCTCTTCTATTGGCACTTCTGGGAAGGATAGGAAGGAAAGGATAGGA 

TGGGAAGGCAAAGGAAAGGAGCTGAGAACTAAGAGCTAAAGAGCGAAGCAGAGAGAACAAAAAACGCGCGCCATCTCGCTCGCGCCCATTAGCTC 

ATCAGCTGATCGTGATCGGCCGGGGTGCGGTTATAAAAACCAGCTGCTCGGCATATTCAGCACATTCGTTCTCAGTTCGTCGTCGAGTCAACAGC 

TCTTCTTGTTCAGCCTTTCGGTTCCTCGTTCGGTGAATTGAGATTGTGCGGTTCAAATTATTGAAGAAAAAGAAAAACCTCGCACAACCACACTC 

CGAAATACAAAACAAAAAATCATGCTCAAACGGACACATTAACAAAAAAAAAAAACAAAAAAACCTAAATTAACTAAAATGCAAT 

AACCAAGGACTACCATCTCAAATTCTAAAAACAAACTGTAAATGAGTGCAGAGTGACCAAGAACAAATTGTGTTTTAGCAATTTCTTCGGCTTCA 

GACTCGCCAGCGAAATCAAACATCTTCGGCTCTTCGAAGCAACCAACGCAACACATAACACAACAAAACCAACAAAATGCTGTGGGAAACTATTG 

TGGAGGAAAACAACTGCAGCATGGATTGCAATATCAGTAATAACACCAGCAGTTCGAGTAGCATCAACAAAATGAGTGGATCTCGTCGTGCTCGC 

CGTTCCCT GG AAC T G AT G AGC AT GG AC C AGGAGG AGC T GT C GTT CT AC G AC G AC GACGT T GT GC C C C AGG AT C AGC AGC G ATC GGCC AGT C C G GA 

GCTGATGGGTCTGCTCTCGCCGGAGGGCTCGCCCCAGCGCTTCCAGATCGTCCGCCAGCCGAAAATTCTGCCAGCTATGGGAGTATCAAGTGATC 

ACACGCCGGCGCGCAGCTTCCGCATCTTCAACAGCCTGTCCTCCACATGCTCCATGGAGTCCTCCATGGACGATGAGTACATGGAGCTCTTCGAG 

ATGGAGTCGCAGAGCCAACAGACCGCCCTGGGCTTCCCCAGTGGCCTAAACTCGCTGATCAGCGGCCAGATCAAGGAGCAGCCTGCTGCCAAATC 

GCCAGCGGGTCTGTCCATGCGCCGCCCTTCGGTCAGAAGGTGCCTCAGCATGACGGAGAGCAACACCAACAGCACCACCACCCCACCACCAAAGA 

CCCCAGAGACTGCCCGGGATTGCTTCAAGCGCCCGGAACCGCCAGCATCGGCCAACTGCTCGCCCATCCAGAGCAAACGCCATCGCTGCGCCGCC 

GTTGAGAAGGAGAACTGCCCCGCACCCAGCCCACTCAGCCAGGTCACGATTAGCCACCCCCCTCCACTGAGGAAGTGCATGTCCCTGAACGACGC 

CGAGATCATGTCCGCCCTGGCCCGCTCCGAGAACCGCAACGAGCCCGAGCTAATCGGCGACTTCAGCAAAGCCTATGCCCTGCCCCTGATGGAGG 

GTCGTCATCGGGATCTGAAGAGCATCTCCAGCGAAACAGTGGCTCGCCTGCTTAAGGGCGAGTTCAGCGATAAGGTGGCCAGCTACCGCATCATC 

GACTGCCGCTACCCCTACGAATTCGAGGGCGGCCACATCGAGGGAGCCAAGAACCTGTACACCACCGAGCAGATCCTCGATGAGTTCCTCACCGT 

CCAACAGACTGAGCTGCAGCAGCAGCAGAACGCCGAATCGGGACACAAGCGCAACATCATTATCTTCCACTGCGAATTCTCCTCGGAGCGTGGAC 

CGAAAATGTCCCGCTTCCTGCGTAATCTGGATCGCGAGAGGAATACCAACGCCTATCCGGCGTTGCACTATCCCGAGATCTATCTGCTGCACAAC 

GGCTACAAGGAGTTCTTCGAGTCGCACGTTGAGCTGTGCGAACCACATGCCTACCGCACCATGCTGGATCCCGCCTACAACGAGGCCTATCGCCA 

CTTTCGCGCCAAGTCCAAGTCCTGGAACGGCGATGGCCTGGGCGGTGCCACCGGGCGGCTAAAGAAGTCGCGCTCGCGACTGATGCTGTAGGTTG 

TGGGATGATCGTGCAGTTCGTTATCTAAGTTTGGGTGTTATCGTATTGTATATATAGACGTATAGGGGTATTTATTTCATGCTGCTTTTAAGTTG 

ATTTTTGTTGATTTTCATTTGTTAATCCGTAGACTAAGTTTCTCGCACACGACGCAGACACACACCCGACACTCGCATTCATGCAAAACAGACAA 

TTTCATTTGGAATTTAACGCTTTTGTTTAGTTATTATTGTTTTTATGGTTTCGGCTTTAGTTCAATTATTTAAGCACTCTAGTTCAAATTTTAAA 

GAT AAC C AAC G A AC AAGC AAAC G AA AAAT GT GC AT T AT AAT C T AAAAAC GT G AGG AAC C AT GT CAT T T AAAT T TAAGAGGCC GAT C C CG AAAT C A 

GACCCCTTAATCATTTGTAGATGATAAGCAGAACAAAAAATGGCTAAAAACAAAAAAAAAAAAAGAAAACAAATATCACACAAAAACAAACACGC 

TCATTAATTTCATCCATAAAACTTTATTTATAAGATCTATCGTTTAAGAGTTGAAAGTTGTAAACTTCTGCTAGGTACAATTTACGCTTATTATT 

TGTTTATTTTTATGTAATCCGTAACCGAACAACAAGGCACCGAAACGCAAATGCAAACAAGAAAACAAATATTTTACCATGTAATGATTATTATT 

T AAGC AAAGC T TATTTTTG CAT CAA AAA AGG C AAT AT TAT T G AA C AAAG AC C AT C AAT T T T AAAT AT AC AT T AAT AC AC G AC G AAAAAG AAC AAA 

AAAAAAA 

(SEQ ID NO: 77) 

Start ATG: 837 (Reverse strand: CAT) 

MLWETIVEENNCSMDCNISNNTSSSSSINKMSGSRRARRSLELMSMDQEELSFYDDDVVPQDQQRSASPELMGLLSPEGSPQRFQIVRQPKILPA 
MGVSSDHTPARSFRI FNSLSSTCSMESSMDDEYMELFEMESQSQQTALGFPSGLNSLISGQIKEQPAAKSPAGLSMRRPSVRRCLSMTESNTNST 
TTPPPKTPETARDCFKRPEPPASANCSPIQSKRHRCAAVEKENCPAPSPLSQVTISHPPPLRKCMSLNDAEIMSALARSENRNEPELIGDFSKAY 
ALPLMEGRHRDLKSISSETVARLLKGEFSDKVASYRIIDCRYPYEFEGGHIEGAKNLYTTEQILDEFLTVQQTELQQQQNAESGHKRNIIIFHCE 
FSSERGPKMSRFLRNLDRERNTNAYPALHYPEIYLLHNGYKEFFESHVELCEPHAYRTMLDPAYNEAYRHFRAKSKSWNGDGLGGATGRLKKSRS 
RLML* 

{SEQ ID NO: 78) 
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Name: string 

Classification : protein_phosphatase 
Gene Symbol: stg 
FlyBase ID: FBgn0003525 



Celera Sequence No. : 142000013384859 
GCGACCCACGAACAACAACAAAATGTATGG 

TACATTGTTTTCCTCAGTGCGTAGTGTGCATTACTAATAATCTATACAAATAGTTTTAATTCATTTTACTGAAACTAAACGAATTTGTATGATGC 
TATCGAACACGATTTTTTGAAAAGTTATTAATAATCCAAGTTCCTTAGAGATTACTCAGATTACTCGGTTCTGTTCAATGTCCTCTAGTTAACTT 
GAACCTAATTTCCTCCACATTTGAAACTAAACTAAACATGCGCATGTTCAATCCCAAGTCTCCTGTAGTTTCAGAGATCAGGACGTTTATACGGA 
CGACAACGAATCTACCAGATCATTTATTTATAAAAATATAAAAGAATTAATATCGATTGGCTGCGACAAACACTCCCCACTCTCACTGCGCTACC 
TCTTGGTTAGCTTTGTGCTAGTTTTAGTCTAGTTTCTAATGTTAATGAACTTTGAAAAAGGGGCGTTCCACAACCGCGCCCAATTCCGCGAGTAC 
ATTATCATAAGACGGAATAATTATTCTATTTGCGACCAAGAGCCAAACAATGATAAGACGTATCGCACAAGCCGCGGAAACTCGGGAGGAAAATG 
GGGGCGGTTGAAGGGTACATAATGATAAGCGACAGCCGCACCCTCAAGTATTTTTTAAACGATACTAAATATGTAGGCTTTTGTCAAATATTTCC 
ATTTAAATTAGGGAAAAGAACACCACAATGTTGTTTGTTGTTTAATAAAGTATGGATTGGAGAGACCCAAAAATCCACTTTAAGGTTATCAGTTG 
ACTGAATAGTACAAGCTATTCCTAGCATTACTGTCCAACTACTATTAATTGAAATATTAGGTGCCCATTTTTGGAATGGCAACAATCACTCACCT 
AAATTGTCCACAAATAGCTTTGAGAAGCGATGTTCTGGTCGGGATTGATCCTACCGCTTGAGCACATTGATGAAGGTCTCGTGAGGAACGCGGAT 
GTTGGCGAACATGCGCATCTTCTTCTTGCCCTCGGCCTGCTGCTTGAGCAGCTTCATCCGCCGGGTGACATCCCCACCATAAAGCTTGGCTGTCA 
CATCCTTGCGATAGGCCTTGATGGTCTCGCGCGCTAGCACCTTGCTGCCCACGCATGCCTGGATTGCGATCTGTACCATCTGCTTTGGAATAAGC 
TCCCGGAGTTTGAGAACCATCTGCCGTGCCACGCCCGTGGCCTTGGACACATGGACAATGCGGCAGAGCTCCTCCACGGGCTTGCCGTTCAGATG 
GATGTCCAATCGAACCAAGTGCGATGGATGATAGCCGTGGTCCTCGTAGCTGAAACTGGCGTAGCCAGAACTTAACGACTTGAGGCGGTCGTGGA 
AGTCCAGGATGATCTCGCTTAACGGCAGTACGTACTTCATTAGCACACGGGTGTCGTCGATGTTCACCGAGCTTTGCTGCAATCCCCGACGCTCC 
ACACACAGGCTGATCACCTGGCCAACATACTCCGTGGGCGTGATGATTGTGCCCAAAACCAAGGGCTCATAGTACTCCTTGATGGAATGCGGTTC 
CGGGAAGAGGGCTGCATTGGAAATGTCCATTGTATCGCGACCCTGTTGCTTGATCATCTTTGGATTACTCAAGACCAAACGATATGTGACGGAAG 
GCGCGGTGATTATGGGCTCAGCGCCATGCTCCTGCTCCAGGCGCTGACAGAAGACCTCCATGTGCAGAAGTCCCAGGAAGCCCAAACGCCAACCT 
TGTCCCAAAGCTGGGCTGGAATCAATTTTTACAGTTACCGCTGAGTCATTGAGTACCATTTTGTCGATGGCACTGCGCAGGGCGACATGTTTTGA 
CTGGTCGGCGGGAAAAACTCCAGCAAAAACAAGCGGCTGTTGCGGTCGATAGCTTCCTGCGGCGGCGACTGCTTGGTTCTTCAGGTGAATGGTAT 
CGCCCACAATGGATTCCTTGCTGTTGCGCATATTGCAAGCAATCAAGCCCACTTGACCAGCGGATACATCCGGAACAGGACACTCGGCTGGTCTC 
AACACAGAGATGCTCTTGACGGAATACACCTTCTTGGTGGCTAGCGACTGAATATCCTGATTCTGCTCTAGCTTGCCATTCAGCACATAAATTAG 
ATTAAGGGCTCCGCGATACTTGTCGAACCAGCTGTCGAAGATTAGGGCCCGAAAGTCGCTGTCCCGTTGAACTTGTGGTGGCGGCACAGTTTCTA 
TGACCCTTTCTAAGACTTCAGATACGCCGGTGCCCAGCTTGGCGGACACGCGCAGCACCTCGTCCGGATCTATGCCAAACAGCAGCTTTAGATCC 
TGACAAACCTGATCAGGGTTTGCATGCTTGATGTCTATCTTGTTGAGCACGGGAACCACTGCCAACTGACGCTGCTTGGCTAAGTGATAGTTGGC 
CACGGTTTGAGCCTGGACACCATGGCATGCGTCCACCAGGAGTACCACGCCGTCGCAGGCAGCCAGAGATCGCGAAACCTGGGTAATGGGCAAAT 
GATTAGCAGAAGAATTAGCAGACTTTTGACATGAAACACACTTCATTGGAAAAATCCACGTGACCAGGAGTGTCAATGAGGTTCAGCAGATAGAG 
CTGTCCCTTGTGACGGTGGAAGATCGATGCGGTCTGGGCCTTGACCGTGATCCCACGCTCCCGCTCCACCTGCAGGTTGTCCAGTACCTGGTGCT 
GCCCGCCATTACGCGCAATGGCTCCCGTGAGCTCCAGCAACCGGTCAGCCAGCGTGCTCTTGCCGTGATCGACATGCGCAATGATGCTGAAGTTC 
CGAATGCGCTCCACCGGCATGTGGGCAAACTCGCGCAACAGATCCGCCTGCGAGGGCTCCTCCGTTTCGCCCTTGACCTGATTGGTGGTGCTCAG 
GTTGCGGACGAGTAAGCTCTTTGACCTGGCCACCAGCCAGGCGGAGTGTCGTTTGTCCCCCGGGCGCTGCAGCAGCCAACGGATTGAAATCGCAC 
GAATCATGGCTGAATTTACATTTCTTTGCCGGCAAAAAAAAATTGTTACCGCACACCATGTGCGCCGTGTGACCGTATGATGAAAAAATACCGAG 
TAGCCGACAATCATGTGTAAGACAGCAGGAAAATACTGTCCATCTTGTGACAAACAGCGCCATCTGTTGGCAATCTACGAAAACCGATGTTTACA 
CATGAACAAATATTTACAACTCAATAATCCTTTACTAATTTGTTCTCGGATGCAAATTAAACATATTCTTAAGCGAAAGATCGTAATGCTCCTAG 
ATCTCAGATTGCCTATAGGTTTATATTCATAAATAAACAAGTAAGAAAGTCCGAAATATTGAGCTATATATCATAAATATTCTGTTTTTTCAACT 
GCTTTCGTGAGGAAGTCATCGCTGCGAGATTCGATTTCTAGAAGCCGAGGTTCTTTTCTTGGAACAGCTTACAATGGTAAATGGTAATTGTAATA 
GACCAGCAAGGGAGCTGGAGGAACTGCTGGTTCCAGAAAAAGGGGGTTTGACCTCAGCACAGTCAGAGAAAAAGCCGTTCTCAACGATCTCGATT 
TAAGAACTTCGATCTCAAAATCTAGCCAAGATCGAATCATGCTCAGCATACTCAGGTCGAGATCGAACCATTCTCAGATTGCAACTGACTTTGCT 
CTTATTTTTCGATCTCTTTCTTGGTTTTTTAACCTTACCCACACATCATCTTCTACAATCTGAAACCCGATACTGCTTCCGTCTAAAATATGTAT 
CCAACATAGAGATCGTTTTCGTCTCGAAAAAATAAGTCTTCTTTTCATGAGAATGATTGATGGGATGGTAGGGATTATCCTGGCTGTGTCCCTGT 
ACCGATTCCACAAGACCTTGATGCTAGACGTGTTGAGGAGACCATACACACGGACAATACTCAAGTATGGGTTGAACTAGGGATACTTGAGGTTG 
TGAGGATCCTTAAGCTCTTTCGACGAACGCTTGACAAATATGTACAAGTAAAACTTTTGTTCTACTAACAATAATGCCAATGTGATTGGAAAAGC 

TGATAGTGAAATCTCGCAATGTTACTCTTGTGCTTGCTTGTGTTTTCGAATGCATACGTTGAAT 

(SEQ ID NO: 79) 

Exon: 3149. .2607 
Exon: 2548.. 1181 
Exon: 1129.. 1001 

Start ATG: 3149 (Reverse strand: CAT) 
Transcript No. : CT3280 

ATGATTGTCGGCTACTCGGTATTTTTTCATCATACGGTCACACGGCGCACATGGTGTGCGGTAACAATTTTTTTTTGCCGGCAAAGAAATGTAAA 
TTCAGCCATGATTCGTGCGATTTCAATCCGTTGGCTGCTGCAGCGCCCGGGGGACAAACGACACTCCGCCTGGCTGGTGGCCAGGTCAAAGAGCT 
TACTCGTCCGCAACCTGAGCACCACCAATCAGGTCAAGGGCGAAACGGAGGAGCCCTCGCAGGCGGATCTGTTGCGCGAGTTTGCCCACATGCCG 
GTGGAGCGCATTCGGAACTTCAGCATCATTGCGCATGTCGATCACGGCAAGAGCACGCTGGCTGACCGGTTGCTGGAGCTCACGGGAGCCATTGC 
GCGTAATGGCGGGCAGCACCAGGTACTGGACAACCTGCAGGTGGAGCGGGAGCGTGGGATCACGGTCAAGGCCCAGACCGCATCGATCTTCCACC 
GTCACAAGGGACAGCTCTATCTGCTGAACCTCATTGACACTCCTGGTCACGTGGATTTTTCCAATGAAGTTTCGCGATCTCTGGCTGCCTGCGAC 
GGCGTGGTACTCCTGGTGGACGCATGCCATGGTGTCCAGGCTCAAACCGTGGCCAACTATCACTTAGCCAAGCAGCGTCAGTTGGCAGTGGTTCC 
CGTGCTCAACAAGATAGACATCAAGCATGCAAACCCTGATCAGGTTTGTCAGGATCTAAAGCTGCTGTTTGGCATAGATCCGGACGAGGTGCTGC 
GCGTGTCCGCCAAGCTGGGCACCGGCGTATCTGAAGTCTTAGAAAGGGTCATAGAAACTGTGCCGCCACCACAAGTTCAACGGGACAGCGACTTT 
CGGGCCCTAATCTTCGACAGCTGGTTCGACAAGTATCGCGGAGCCCTTAATCTAATTTATGTGCTGAATGGCAAGCTAGAGCAGAATCAGGATAT 
TCAGTCGCTAGCCACCAAGAAGGTGTATTCCGTCAAGAGCATCTCTGTGTTGAGACCAGCCGAGTGTCCTGTTCCGGATGTATCCGCTGGTCAAG 
TGGGCTTGATTGCTTGCAATATGCGCAACAGCAAGGAATCCATTGTGGGCGATACCATTCACCTGAAGAACCAAGCAGTCGCCGCCGCAGGAAGC 
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TATCGACCGCAACAGCCGCTTGTTTTTGCTGGAGTTTTTCCCGCCGACCAGTCAAAACATGTCGCCCTGCGCAGTGCCATCGACAAAATGGTACT 
CAATGACTCAGCGGTAACTGTAAAAATTGATTCCAGCCCAGCTTTGGGACAAGGTTGGCGTTTGGGCTTCCTGGGACTTCTGCACATGGAGGTCT 
TCTGTCAGCGCCTGGAGCAGGAGCATGGCGCTGAGCCCATAATCACCGCGCCTTCCGTCACATATCGTTTGGTCTTGAGTAATCCAAAGATGATC 
AAGCAACAGGGTCGCGATACAATGGACATTTCCAATGCAGCCCTCTTCCCGGAACCGCATTCCATCAAGGAGTACTATGAGCCCTTGGTTTTGGG 
CACAATCATCACGCCCACGGAGTATGTTGGCCAGGTGATCAGCCTGTGTGTGGAGCGTCGGGGATTGCAGCAAAGCTCGGTGAACATCGACGACA 
CCCGTGTGCTAATGAAGTACGTACTGCCGTTAAGCGAGATCATCCTGGACTTCCACGACCGCCTCAAGTCGTTAAGTTCTGGCTACGCCAGTTTC 
AGCTACGAGGACCACGGCTATCATCCATCGCACTTGGTTCGATTGGACATCCATCTGAACGGCAAGCCCGTGGAGGAGCTCTGCCGCATTGTCCA 
TGTGTCCAAGGCCACGGGCGTGGCACGGCAGATGGTTCTCAAACTCCGGGAGCTTATTCCAAAGCAGATGGTACAGATCGCAATCCAGGCATGCG 
TGGGCAGCAAGCTTTATGGTGGGGATGTCACCCGGCGGATGAAGCTGCTCAAGCAGCAGGCCGAGGGCAAGAAGAAGATGCGCATGTTCGCCAAC 
ATCCGCGTTCCTCACGAGACCTTCATCAATGTGCTCAAGCGGTAG 
(SEQ ID NO: 80) 

Start ATG; 1 {Reverse strand: CAT) 

MIVGYSVFFHHTVTRRTWCAVTIFFCRQRWVNSAMIRAISIRWLLQRPGDKRHSAWLVARSKSLLVRNLSTTNQVKGETEEPSQADLLREFAHMP 

VERIRNFSIIAHVDHGKSTLADRLLELTGAIARNGGQHQVLDNLQVERERGITVKAQTASIFHRHKGQLYLLNLIDTPGHVDFSNEVSRSLAACD 

GVVLLVDACHGVQAQTVANYHLAKQRQLAVVPVLNKIDIKHANPDQVCQDLKLLFGIDPDEVLRVSAKLGTGVSEVLERVIETVPPPQVQRDSDF 

RALIFDSWFDKYRGALNLIYVLNGKLEQNQDIQSLATKKVYSVKSISVLRPAECPVPDVSAGQVGLIACNMRNSKESIVGDTIHLKNQAVAAAGS 

YRPQQPLVFAGVFPADQSKHVALRSAIDKMVLNDSAVTVKIDSSPALGQGWRLGFLGLLHMEVFCQRLEQEHGAEPIITAPSVTYRLVLSNPKMI 

KQQGRDTMDISNAALFPEPHSIKEYYEPLVLGTIITPTEYVGQVISLCVERRGLQQSSVNIDDTRVLMKYVLPLSEIILDFHDRLKSLSSGYASF 

SYEDHGYHPSHLVRLDIHLNGKPVEELCRIVHVSKATGVARQMVLKL^ 

I RVPHET FINVLKR* 

(SEQ ID NO: 81) 

Name: EF-Tu-like 

Classification: translation_f actor 
Gene Symbol : waw 
FlyBase ID: FBgnO024182 



Celera Sequence No. : 142000013384340 

TTAATATTATAATCGCATTCAAACGCATTTCGTTTCGTTTGTTATGCGCCCACACGCAGCACTTATGTATAGCATTTATTATGTAGGACTTATAG 
CGAAAACAAAAAAAATTTCCTATGAATATACAAGAAATTAATCAGTTGTTCTTCAGATAATTGATAACTATCCTAGTGGGCTTTTCTCTGGCCCC 
ACCTCCTCTCCACGGATTTTTACTTGAGAATTGTAGTTTGCTTGGCCAGAACTGAATTGGAGTTCAGCTTCAAAAAAAAATTTAAAATAAGATAG 
GAAGCTGCGAGATCCGATCCGCCTGGGCACCTTGCTCGTATCGTCCTCTTCCTGTCCTGGTGATTAGTTCTACATGTGTGTTCTCTTACTTTCAC 
CACAATTGATTGCAATTGCGATGATTGATGGTGACTCATCCAGGCTATCCACAGATTTCGGTTATATACGTAAAACGCTGTCGCATACGTATTTT 
CTTTAAGTCTAAACTTATGTTTACCGTGTGGTAAGCTCCCTTGCTTACCATCTAGATGTTATTGAGGTTTCTGGGATACTTCGGGATCGAAATCA 
TCTATTCGCTATCTCTGCTTGTTTGTTTCTGGTGTGCTCGCTTAACTTACAAAGTAGTTTAAACGTAAATCTGCAAGAAAATGTGCGCAGTTTTG 
AAAACTATTCAACGATTCAACAACGACTCACTTACCTTATATTTTAAATATTTGGCTCTTGATTTTGTTGTTGCTGCGGTGTTTTGTTTGTTGTT 
TGTTGTTGTTGCTTCGCTGTGGGTGCTTTTCGCTTCGGTGTCCAGGCTTTCTCGAGTTTAGTTTAAATGCGGCAAGCACACGAAAATAAATGTGA 
TGAAAAATGACTTGAACGAATAACCAACAAATTCGAAAAGAAGCTCGTGGCTATTTGTGAGTGTGGCGTGTTTAGTGTTGTAGTGTTTCATCATC 
GATTTGAACCATTTTAGTTGAAATTTTTTTTTGGCACGGGCAGCGCTAGCTTAGAAGGGCATATCCAATTCGTTGTTCTCCTCCTCCTCGATCAA 
GTCATCAACATCGAATTCGCTGGGCGAGAAGCTGGTCTGTTGCTGCTGCGGTTGCAGCTGTTGCTGCTGCTGCTGCATTTGGATACGTTGCGGCT 
GCTGCACGATTATGCTGTTGCGTGTGCATGTTGTCACCTTGGCGCCTAACATACGTGCATTGGCCAACGCGGCCGTCTTGAGAGGATTACGCAGA 
TGTGTCGCCTGGGAGGCTAACGAAGATGTGGCGGCTGATGCTGTGGACGATGCGGCGTTAATTGTGCCCGTGGTACCGGGATCATTGTAACGTGG 
ACGCTTATTCTGCAGCTTGGTTGCCCAGCATTCGGTGGCCCGGCGACGGGCATTGGCACAGATATTATCCCCGGATGAGGACGATGAGGAGGAGG 
TGGAGGGAGCTGCTGCCGCCGATGAGGTGGAGGCCGGATTGGCCTTCTCCCCATTAAACATTATATGGCGGCGCTCTTCCTGGGTTTGCTTCATC 
ATCTGGTGATAGTTGACCATCTGTTTCAGACGAGGCATACTATCAACTGTGCTGGACAAGATGGTCGCGGGTGCTGATGTTTGGTCATTCGACGA 
TGATTTAGAGTGCGCCGCTGTTGTATTAACAAAGATATTATCAAGGGAAGCAGTCTTGTTATCGGCCAAGACATTTTTCTCCAGTTCGTGGGGAT 
CACCGCCGGCTTGCAGACGCTGCTTTTGACGCAGATGCTGCAGCTCCTCCGCCGAACGTTGCTTGCGGAAATCAAAGACCAGCGGGTATATGTGC 
TGGATAGCCGATTCAACGTGGTTTACGCTAGCCGCTAAATAAATATGACGAAAATTAGAATTAGAATTAGTTTTAGTACAGTTTGAGGATAACAA 
CAGTACTCACCGGTGACTGTTACGCTGCCGGTGGAGAAGATCTTCAGGGTGGCCTTTGGATCGGGATCGCGCATCTTGTAAGTCACTCCAGGATG 
CAGCTCGGGCTCGTAGCTGGCGTTCTCGCGATGGCGCTCCGAAAAGTTGACGATCTTGATGGCCCACGGCATGCTGCAGGTGCCCAGTACGTTGA 
CAATGCGAAAGTTGAGGAAACGAGTGGGAAATCCCAGCTTGCCAAGGCATCTGGCATAGCGGCGTGCAGCGACCTTGGCCTAAAAAGATCATCAA 
CTATTGAGATAACAAAAATGTATAATTTAATGCTAAATCATACCATTGACTCGGAAGTGGCACCAGTGCAGGTAATCCTCCCCGACGACCAAATC 
GAGGCAGTGGTGTATGGATGGCGGAGCTTCATTGTCACCATTCCGTTCTCGCGACGGTACTCCACATTTGATCCCTGCAGCGCAATTTCGCGCAG 
CTTCAGGTGACAGCCGACGCTAAACGAGCAGACCACATTGTTTATCACAATGTCCAGCTCGTGCTCATTATCAGCAATTGGCTGGTGAAGGTCAA 
TGGGTTTGGCATTATTGCTGCTCTCTTCTTCGTCTTCAGCGACTTCCTTTTTCTCCTCGTCATCGTCATCAATCACAGTTATCGACTGCATGACC 
TCCTCCTCCAATAGATCCATTTTGGCTGTGATCAGACCATTGCCATTGCTGGCGACGGTCGGCAGATTAGCCTTATTTTCGCTGCTAGTAAGGAA 
CATATTTTTACTGCCGGTTGCCAGAACAATGGTATCCCCGTTGGCAAGTTTCCTTTGCAAACTGGGTCCGCCGGCCGTTTGCATTTTGCTGAATT 
GCGAAAAATAGCGAACGGTACCGGCTGTTGAGGCTGTGCCAGATGAACCACCGACGACTTGTCCGCTCATTCTTCCGGTGGACGCCACTCCCGCC 
ATTTGGCGATTGGTCATGAATGTGGAGGAGGGCGTTAGGTATACACGTGGGGCATTGGCCAGCACCGCCGAGGATACGACGCCGCCGGAGGTCAC 
TACACCAACTCCGGAACCGCTGCTGGCCGCTTTAAGGCCGCCGTTCAAATTGGCCACCGGTATGCTGACCATATCGTTTTGCATCAGCTTGTGCT 
TTCGATTCCCCTTCAACGAGCGCTGACTGGCCTTCTCCAGCTGCTTAATCTTGATCAGATAATGCTCGCGCTGCAAACGCAACTGACGTTTACAG 
TCCTCCCTATATTCCTTCAAGAATCGTTCGTATTCCAAGCGCTGTTTCTCCTTTTTCTGGCGCAATTTACGTTCGCGTTTTACGCGATTTTGCTG 
ACGACGATGCTGCTCCTTCTGTTGGCGCAATTTTTCTAGGACAAACTGTTTCTGTTGCTCAAAGAGGTGGCGCTTTCGCTCTCGCTGCTCCTCCA 
GTTCTTTCTTCTGTTGCAACTGTCGTAGCTGTAGGAAGACTTTATCCTGTTGCAACTGTCGCTCCTGTGGCAAGTGTTTCTCCTGTTGCAATTGT 
TTCTCCTGTTGCAAGAGTTTCTCCTGTTGCAAGAGTTTCTCCTGTTGCAAGAGTTTCTGCTGTTGCAAGAGTTTCTCCTGTTGCAAGAGTTTCTG 
CTGTTGCAAGAGTTTCTGCTGTTGCAACCGTTTCTCCTGTTGCAACCGTTTCTCCTGTTGCAACCGTTTCTCCTGTTGCAACCGTTTCTCCTGTT 
GCAACCGTTTCTCCTGTTGCAAGAGTTTCTCCTGTTGCAAAAGTTTCTCCTGTTGCAACCGTTTCTCCTGTTGCTCTTGTGGAAAGAATTTCTGC 
AAAAGTAGTTTCAGATACTCATATAGCAACAGTTGCTCCTTTGAAAAGAATTTCTCCTGGTGCCAGAATACCACCAGTTGCAACTTTTGATATTT 
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TTTTAAATGTTTCTGTTGCTGCTTCAGTTGCAGGTCCTGTTCTGGTTGGCCTTCCTCATCAATACTGCTGTATACGTCTTCCAGTTTGTAAGCTC 
CAAATACCTTATAGAGCTGTATGCTAAGGTCCAAACTAGATTTATCATCCCCTAAAGGATTTGTTACGAATTTCAAGAACTTCCTTTCCTTTGAG 
AAGATAAATGGTTTATAATCCTCCTCAGAATCACAAATGTGATCGTCGTAATAAGAAGCTGGATCCTGATCGTCGTAATAAGGAGCTGGATCCTG 
ATCGTTGTAATAAGGATGTGGATCTTGATCGGTTCCGAATTCAGGAGCTGGATCTTGATCGACTCCGCCTTCAGGAGCTAGATCTTGATCGGATT 
CGCCTTCAGGAGCTGGATCTTGATCGACTTCGCCTTCAGGAGCTGGATCTTGATCGACTCCGCCTTCAGGAGCTGGATCTTGATCGACTCCGCCT 
TCAGGAGCTGGATCTTGATCGGCTTCGCCTTCAGGAGCTGGATCTTGATCGGTTTCGCCTTCAGGAGCTGGATCCTGGGCAGATTCCTCAGAACT 
TGAAGATGATTCGAAATCGGAATGGGAATCGGAATCGGAATCTAAATCTAAATTTGAATCTGAATCTGAATTTGAATCTGTATCGAAACTTGAAC 
TATCTGAGTCTTCTTCGTCAGAGCATTCGCTTTCCGTATCTTCACGTTTTTCCTCCTCAGATTTGTCTTCCTCACTCTGCAAGTTTGGTTTAAAT 
TTCTTAGAGTCTGTACAATCAAAAGGCCATTTACGCTTTAAGGATCTGCGACTTCCCAGAGACGATAAATGTGAACGTGGGTCTCCTTTGTCACC 
GTCCAATAATGGTTTCTTCGAATACGAAGAATTCCCAGTGCCTAACGATTTTCGCTTATTCTGTTGCTTGTCTGGCGAATGATCGCGTCTATCAG 
AAGAAGTATGTTCAGCCGGCGACTGTTGTTCGACGTCGTCGCTTTCTTCCTTCTGTTCGTCTGTAATTTCCTCTGACGACTCCTGTTGGCTTTGG 
TCGCTTTTTTCCTCGCGTTCGGTAGAAGTTTCTCGCCGGCGCGATAGCCACCCAGTAACTGGATCGATTAAGTAGTCCGAGTCCCCAGAAGACGA 
TTCCTCATCTGAAGCTTGTCGCGACTCGCTTTCAGAGTCTTGTCTCTCCGGCGAAGATGACTCTAAAGATGAATCCGATTTTTGATTTGGCCTAT 
TGATTGGCTGATTGAATGGCTGATTGATTGGGTTATGGCGACGATCGTCGCTATCAACTTCTGATTCTCCTAAACGAGGGAGATGTACCTGACAA 
GCGTAGGATCGTAAACGGGTATATATGTTTCTTTTTCTAGCGGCCTCTTCCTCATCAGATTCAGGATATTGGTACTCTCGATTCAAGGCCTGATA 
CTCGTGATTTGAGTCCTGGTATTCGTGATTCCAAGCCTGGTACTCGTGATTTGAGTCTTGGTTGACGTGATTCAAATCCTGGTACTCGTCACTTG 
AGTCCTGATCGCCGTGATTCAACTCCTGGTACTCGTCATTTGAGTCCTGGTTGCTGTGATTCAACTCCTGGTACTCGTCATTTGAGTCCTGGTTG 
CCGTGATTTGATTCCTGGTACTCGTCATTTGAGTCCTGGTTGCCGTGATTCAACTCCTGGTACTCGTCATTTGAGTCCTGATCGCCGTGATTCAA 
CTCCTGGTACTCGGTATTCAAGTGCTGGTACTCTTGGTTTGAAACCTGGTACTCGTTATTAGACTCCTGGTACTCGCGATTAACTTGATTCAAGT 
CCTGGTTACCGATATTCAAGTCCTGGTACCCGGGATTCAAGTCAGATTGTGCATATTCCCTTGTTCTTCCTCTTGATGAAGTTGAATATTCTTGG 
TGCAGCTGTTCGTCCTCGCTATTAGAAGAACTTAAGCGATCTGCGTATGATGAAAAGAATTTGTTTAAAAAACATTTTTATAATTCTCACATTAT 
TTTACCACTAGAATTGCAGGCATCCCCGTACGACTCGTTACCGCTGCTGTTGGCTTCGTTCTTCTCAACTTCTTCGTTCGAAAACTGGTCAGCTT 
CGTTGCGACTTGAAGTATCTTTACCAACGTCAGAGCCTTGCTCACGATTTCCGATGGTGCTTCCGCTTGATGAAGTTGAATATTCTTGGTTAAGC 
TGTTCGTCCTCGCTATGAGAGGAACTTAAACGATCTGCGTATGTTGAAAAGAATTTGTGTAAAAAACATTTATATAAATTAACAATTCATTTTAC 
CTCTAGAATTGCGAGCATCCCCGCCCGACTCGTTACAGCTGCTGTTGGCTTCGTTCTTCTCAACTTCTTCGCTCGAAAACTGGTCAGCTTCGTTG 
GGATTTAGAGTATCTTTACCAATGTCAGGTCCTTGCTCACAATCACCGATGGCTCTTGGATTGGGGGATTGATTCGGCGATTGGTTCTCCTTATC 
CGAATCTTCGTCCGGATGTTCAACCGGATGTTCCGGAAACAAATCCTCGTTCCTTAAGTCTCTAGCTACTTGGTATTCTTCAAGTATTCGAGTCG 
GGGAACCATCCCCCTTTTTGACGCTACAGTTGTTTTAGGGAGACGAAGTTCAATTATATTTCATTGGGTATTTATGCTTACTGTTTGTGTGACTG 
TGGCGGTGTGGATATGCGGCTCTTTATATCTACTGTATATCGTGTAATAGTTCTTTTTTTTTCTCTGTATGCCTACTTACGCACTGCCCATCGTT 
AAAAAAATATTTCTTGTTATCTTTTTGGGTTTCTTGATAGGACTCTTGCTCTGCGTTTTCTTGCTAGCTTCTTAATTATATTTAATTGTATATAA 
GGCTAAAGGTGATTCTTCCTTGGTGTTGTATTACATTCTTTCCATTGTTTCGTTCATGTGGATGTGTATGTATGTGTTTGTATGTGAGTATGTTT 
GCACTTAAGGCCAAAAAATACTTTTTTATGTGTGATTTTGTATTAGAATTAATGTTTTCTTAGTGTTCTAGTTGTTGGGTTTGCTGGTTCTCTAA 
TTCTATCTACTATGTTTATGATTATGTCTAGTCTTCCGTTCTTTTCTCCTGGTTAAATTTGCTTGTAGTTTTTAGTTGTTCCGTTTAGCAGTCTG 
CAAGTAGAAGAGAATATAATAAAAGTTAATAATCGTAAAGTTGACATTTGGGAAAATGAAAACATGCAAATTGCCTTGATAAATACAAATGCATT 
TATTAATTGGTTTGATAGCGCACCTAATAAACACGATGACACTATTTAATCTAATTTCCACTTTTATATATCAACCCGACCCCCGACAACCGGCA 
CAAGTGAAGGGTCTAGGTCTAACCCTCTGGGAGTTTTCGTGGAATATTAGTAGTTCCTGTCATGTTAATAGAGATGGAACTCTGAGCCGGCCCTT 
ATAGAGTGGCATGATTCACGAAGACTCACGAAGACTCACGCTTTGCCTACCCAAAAAAAAAAAAATAATACGTCTTATACATTTTTAAAATGTAA 
T T T T T AAAAC AAAAAAAAT G AG AAG AGAT CAT G AC C T AC T T AT AG T AGT T T T AC AAAC T AAAAT AT T GT AT T TT AC AGAC G AC T T C AC T GT T AAC 
GGGAGCTGCGTTTAACAGAGAGCGCTGTTCGTTAACAGTCCTTTTTATTAGGATCGCCTAAGTGCGCTGACCTTAACATAGATTTTCGCATGCCT 
GCTAAGAGTATGCACGAAGAAGGGGCGGAGGAAAACGAAAAGGCGA 
(SEQ ID NO: 82) 

Exon: 6551.. 6541 
Exon: 6388 . . 6081 
Exon: 6019.. 5801 
Exon: 5740.. 2229 
Exon: 2169.. 1911 
Exon: 1839.. 1001 

Start ATG: 6551 {Reverse strand: CAT) 
Transcript No. : CT3310 

ATGGGCAGTGCCGTCAAAAAGGGGGATGGTTCCCCGACTCGAATACTTGAAGAATACCAAGTAGCTAGAGACTTAAGGAACGAGGATTTGTTTCC 
GGAACATCCGGTTGAACATCCGGACGAAGATTCGGATAAGGAGAACCAATCGCCGAATCAATCCCCCAATCCAAGAGCCATCGGTGATTGTGAGC 
AAGGACCTGACATTGGTAAAGATACTCTAAATCCCAACGAAGCTGACCAGTTTTCGAGCGAAGAAGTTGAGAAGAACGAAGCCAACAGCAGCTGT 
AACGAGTCGGGCGGGGATGCTCGCAATTCTAGAGATCGTTTAAGTTCCTCTCATAGCGAGGACGAACAGCTTAACCAAGAATATTCAACTTCATC 
AAGCGGAAGCACCATCGGAAATCGTGAGCAAGGCTCTGACGTTGGTAAAGATACTTCAAGTCGCAACGAAGCTGACCAGTTTTCGAACGAAGAAG 
TTGAGAAGAACGAAGCCAACAGCAGCGGTAACGAGTCGTACGGGGATGCCTGCAATTCTAGTGATCGCTTAAGTTCTTCTAATAGCGAGGACGAA 
CAGCTGCACCAAGAATATTCAACTTCATCAAGAGGAAGAACAAGGGAATATGCACAATCTGACTTGAATCCCGGGTACCAGGACTTGAATATCGG 
TAACCAGGACTTGAATCAAGTTAATCGCGAGTACCAGGAGTCTAATAACGAGTACCAGGTTTCAAACCAAGAGTACCAGCACTTGAATACCGAGT 
ACCAGGAGTTGAATCACGGCGATCAGGACTCAAATGACGAGTACCAGGAGTTGAATCACGGCAACCAGGACTCAAATGACGAGTACCAGGAATCA 
AATCACGGCAACCAGGACTCAAATGACGAGTACCAGGAGTTGAATCACAGCAACCAGGACTCAAATGACGAGTACCAGGAGTTGAATCACGGCGA 
TCAGGACTCAAGTGACGAGTACCAGGATTTGAATCACGTCAACCAAGACTCAAATCACGAGTACCAGGCTTGGAATCACGAATACCAGGACTCAA 
ATCACGAGTATCAGGCCTTGAATCGAGAGTACCAATATCCTGAATCTGATGAGGAAGAGGCCGCTAGAAAAAGAAACATATATACCCGTTTACGA 
TCCTACGCTTGTCAGGTACATCTCCCTCGTTTAGGAGAATCAGAAGTTGATAGCGACGATCGTCGCCATAACCCAATCAATCAGCCATTCAATCA 
GCCAATCAATAGGCCAAATCAAAAATCGGATTCATCTTTAGAGTCATCTTCGCCGGAGAGACAAGACTCTGAAAGCGAGTCGCGACAAGCTTCAG 
ATGAGGAATCGTCTTCTGGGGACTCGGACTACTTAATCGATCCAGTTACTGGGTGGCTATCGCGCCGGCGAGAAACTTCTACCGAACGCGAGGAA 
AAAAGCGACCAAAGCCAACAGGAGTCGTCAGAGGAAATTACAGACGAACAGAAGGAAGAAAGCGACGACGTCGAACAACAGTCGCCGGCTGAACA 
TACTTCTTCTGATAGACGCGATCATTCGCCAGACAAGCAACAGAATAAGCGAAAATCGTTAGGCACTGGGAATTCTTCGTATTCGAAGAAACCAT 
TATTGGACGGTGACAAAGGAGACCCACGTTCACATTTATCGTCTCTGGGAAGTCGCAGATCCTTAAAGCGTAAATGGCCTTTTGATTGTACAGAC 
TCTAAGAAATTTAAACCAAACTTGCAGAGTGAGGAAGACAAATCTGAGGAGGAAAAACGTGAAGATACGGAAAGCGAATGCTCTGACGAAGAAGA 
CTCAGATAGTTCAAGTTTCGATACAGATTCAAATTCAGATTCAGATTCAAATTTAGATTTAGATTCCGATTCCGATTCCCATTCCGATTTCGAAT 
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CATCTTCAAGTTCTGAGGAATCTGCCCAGGATCCAGCTCCTGAAGGCGAAACCGATCAAGATCCAGCTCCTGAAGGCGAAGCCGATCAAGATCCA 
GCTCCTGAAGGCGGAGTCGATCAAGATCCAGCTCCTGAAGGCGGAGTCGATCAAGATCCAGCTCCTGAAGGCGAAGTCGATCAAGATCCAGCTCC 
TGAAGGCGAATCCGATCAAGATCTAGCTCCTGAAGGCGGAGTCGATCAAGATCCAGCTCCTGAATTCGGAACCGATCAAGATCCACATCCTTATT 
ACAACGATCAGGATCCAGCTCCTTATTACGACGATCAGGATCCAGCTTCTTATTACGACGATCACATTTGTGATTCTGAGGAGGATTATAAACCA 
TTTATCTTCTCAAAGGAAAGGAAGTTCTTGAAATTCGTAACAAATCCTTTAGGGGATGATAAATCTAGTTTGGACCTTAGCATACAGCTCTATAA 
GGTATTTGGAGCTTACAAACTGGAAGACGTATACAGCAGTATTGATGAGGAAGGCCAACCAGAACAGGACCTGCAACTGAAGCAGCAACAGAAAC 
ATTTAAAAAAATATCAAAAGTTGCAACTGGTGGTATTCTGGCACCAGGAGAAATTCTTTTCAAAGGAGCAACTGTTGCTATATGAGTATCTGAAA 
CTACTTTTGCAGAAATTCTTTCCACAAGAGCAACAGGAGAAACGGTTGCAACAGGAGAAACTTTTGCAACAGGAGAAACTCTTGCAACAGGAGAA 
ACGGTTGCAACAGGAGAAACGGTTGCAACAGGAGAAACGGTTGCAACAGGAGAAACGGTTGCAACAGGAGAAACGGTTGCAACAGCAGAAACTCT 
TGCAACAGCAGAAACTCTTGCAACAGGAGAAACTCTTGCAACAGCAGAAACTCTTGCAACAGGAGAAACTCTTGCAACAGGAGAAACTCTTGCAA 
CAGGAGAAACAATTGCAACAGGAGAAACACTTGCCACAGGAGCGACAGTTGCAACAGGATAAAGTCTTCCTACAGCTACGACAGTTGCAACAGAA 
GAAAGAACTGGAGGAGCAGCGAGAGCGAAAGCGCCACCTCTTTGAGCAACAGAAACAGTTTGTCCTAGAAAAATTGCGCCAACAGAAGGAGCAGC 
ATCGTCGTCAGCAAAATCGCGTAAAACGCGAACGTAAATTGCGCCAGAAAAAGGAGAAACAGCGCTTGGAATACGAACGATTCTTGAAGGAATAT 
AGGGAGGACTGTAAACGTCAGTTGCGTTTGCAGCGCGAGCATTATCTGATCAAGATTAAGCAGCTGGAGAAGGCCAGTCAGCGCTCGTTGAAGGG 
GAATCGAAAGCACAAGCTGATGCAAAACGATATGGTCAGCATACCGGTGGCCAATTTGAACGGCGGCCTTAAAGCGGCCAGCAGCGGTTCCGGAG 
TTGGTGTAGTGACCTCCGGCGGCGTCGTATCCTCGGCGGTGCTGGCCAATGCCCCACGTGTATACCTAACGCCCTCCTCCACATTCATGACCAAT 
CGCCAAATGGCGGGAGTGGCGTCCACCGGAAGAATGAGCGGACAAGTCGTCGGTGGTTCATCTGGCACAGCCTCAACAGCCGGTACCGTTCGCTA 
TTTTTCGCAATTCAGCAAAATGCAAACGGCCGGCGGACCCAGTTTGCAAAGGAAACTTGCCAACGGGGATACCATTGTTCTGGCAACCGGCAGTA 
AAAATATGTTCCTTACTAGCAGCGAAAATAAGGCTAATCTGCCGACCGTCGCCAGCAATGGCAATGGTCTGATCACAGCCAAAATGGATCTATTG 
GAGGAGGAGGTCATGCAGTCGATAACTGTGATTGATGACGATGACGAGGAGAAAAAGGAAGTCGCTGAAGACGAAGAAGAGAGCAGCAATAATGC 
CAAACCCATTGACCTTCACCAGCCAATTGCTGATAATGAGCACGAGCTGGACATTGTGATAAACAATGTGGTCTGCTCGTTTAGCGTCGGCTGTC 

AC CT G AAG C T G C GC G AAAT T GC GCTGC AGG GAT C AAAT GT GG AGT AC C GT C GCG AGAAC GG AAT G GTG AC AAT G AAGC T C CGC C AT C CAT AC AC C 
ACTGCCTCGATTTGGTCGTCGGGGAGGATTACCTGCACTGGTGCCACTTCCGAGTCAATGGCCAAGGTCGCTGCACGCCGCTATGCCAGATGCCT 
TGGCAAGCTGGGATTTCCCACTCGTTTCCTCAACTTTCGCATTGTCAACGTACTGGGCACCTGCAGCATGCCGTGGGCCATCAAGATCGTCAACT 
TTTCGGAGCGCCATCGCGAGAACGCCAGCTACGAGCCCGAGCTGCATCCTGGAGTGACTTACAAGATGCGCGATCCCGATCCAAAGGCCACCCTG 
AAGATCTTCTCCACCGGCAGCGTAACAGTCACCGCGGCTAGCGTAAACCACGTTGAATCGGCTATCCAGCACATATACCCGCTGGTCTTTGATTT 
CCGCAAGCAACGTTCGGCGGAGGAGCTGCAGCATCTGCGTCAAAAGCAGCGTCTGCAAGCCGGCGGTGATCCCCACGAACTGGAGAAAAATGTCT 
TGGCCGATAACAAGACTGCTTCCCTTGATAATATCTTTGTTAATACAACAGCGGCGCACTCTAAATCATCGTCGAATGACCAAACATCAGCACCC 
GCGACCATCTTGT CC AGC AC AGT T G AT AG T AT GC C T C GT CT GAAAC AG AT GGT C AAC T AT C AC C AG AT G AT G AAGC AAAC CC AGG AAG AGC GC C G 
CCATATAATGTTTAATGGGGAGAAGGCCAATCCGGCCTCCACCTCATCGGCGGCAGCAGCTCCCTCCACCTCCTCCTCATCGTCCTCATCCGGGG 
ATAATATCTGTGCCAATGCCCGTCGCCGGGCCACCGAATGCTGGGCAACCAAGCTGCAGAATAAGCGTCCACGTTACAATGATCCCGGTACCACG 
GGCACAATTAACGCCGCATCGTCCACAGCATCAGCCGCCACATCTTCGTTAGCCTCCCAGGCGACACATCTGCGTAATCCTCTCAAGACGGCCGC 
GTTGGCCAATGCACGTATGTTAGGCGCCAAGGTGACAACATGCACACGCAACAGCATAATCGTGCAGCAGCCGCAACGTATCCAAATGCAGCAGC 
AGCAGCAACAGCTGCAACCGCAGCAGCAACAGACCAGCTTCTCGCCCAGCGAATTCGATGTTGATGACTTGATCGAGGAGGAGGAGAACAACGAA 

TTGGATATGCCCTTCTAA 
(SEQ ID NO: 83) 

Start ATG: 1 (Reverse strand: CAT) 

MGSAVKKGDGSPTRILEEYQVARDLRNEDLFPEHPVEHPDEDSDKENQSPNQSPNPRAIGDCEQGPDIGKDTLNPNEADQFSSEEVEKNEANSSC 
NESGGDARNSRDRLSSSHSEDEQLNQEYSTSSSGSTIGNREQGSDVGKDTSSRNEADQFSNEEVEKNEANSSGNESYGDACNSSDRLSSSNSEDE 
QLHQEYSTSSRGRTREYAQSDLNPGYQDLNIGNQDLNQVNREYQESNNEYQVSNQEYQHLNTEYQELNHGDQDSNDEYQELNHGNQDSNDEYQES 
NHGNQDSNDEYQELNHSNQDSNDEYQELNHGDQDSSDEYQDLNHVNQDSNHEYQAWNHEYQDSNHEYQALNREYQYPESDEEEAARKRNIYTRLR 
SYACQVHLPRLGESEVDSDDRRHNPINQPFNQPINRPNQKSDSSLESSSPERQDSESESRQASDEESSSGDSDYLIDPVTGWLSRRRETSTEREE 
KSDQSQQESSEEITDEQKEESDDVEQQSPAEHTSSDRRDHSPDKQQWKRKSLGTGNSSYSKKPLLDGDKGDPRSHLSSLGSRRSLKRKWPFDCTD 
SKKFKPNLQSEEDKSEEEKREDTESECSDEEDSDSSSFDTDSNSDSDSNLDLDSDSDSHSDFESSSSSEESAQDPAPEGETDQDPAPEGEADQDP 
APEGGVDQDPAPEGGVDQDPAPEGEVDQDPAPEGESDQDLAPEGGVDQDPAPEFGTDQDPHPYYNDQDPAPYYDDQDPASYYDDHICDSEEDYKP 
FIFSKERKFLKFVTNPLGDDKSSLDLSIQLYKVFGAYKLEDVYSSIDEEGQPEQDLQLKQQQKHLKKYQKLQLVVFWHQEKFFSKEQLLLYEYLK 
LLLQKFFPQEQQEKRLQQEKLLQQEKLLQQEKRLQQEKRLQQEKRLQQEKRLQQEKRLQQQKLLQQQKLLQQEKLLQQQKLLQQEKLLQQEKLLQ 
QEKQLQQEKHLPQERQLQQDKVFLQLRQLQQKKELEEQRERKRHLFEQQKQFVLEKLRQQKEQHRRQQNRVKRERKLRQKKEKQRLEYERFLKEY 
REDCKRQLRLQREHYLIKIKQLEKASQRSLKGNRKHKLMQNDMVSIPVANLNGGLKAASSGSGVGVVTSGGVVSSAVLANAPRVYLTPSSTFMTN 
RQMAGVASTGRMSGQVVGGSSGTASTAGTVRYFSQFSK^ 

EEEVMQSITVIDDDDEEKKEVAEDEEESSNNAKPIDLHQPIADNEHELDIVINNVVCSFSVGCHLKLREIALQGSNVEYRRENGMVTMKLRHPYT 
TASIWSSGRITCTGATSESMAKVAARRYARCLGKLGFPTRFLNFRIVNVLGTCSMPWAIKIVWFSERHRENASYEPELHPGVTYKMRDPDPKATL 
KIFSTGSVTVTAASVKTHVESAIQHIYPLVFDFRKQRSAEELQHLRQKQRLQAGGDPHELEKNVLADNKTASLDNIFVNTTAAHSKSSSNDQTSAP 
ATILSSTVDSMPRLKQMVNYHQI^KQTQEERRHIMFNGEKAWPASTSSAAAAPSTSSSSSSSGDNICANARRRATECWATKLQNKRPRYNDPGTT 
GTINAASSTASAATSSLASQATHLRNPLKTAALANARMLGAKVTTCTRNSIIVQQPQRIQMQQQQQQLQPQQQQTSFSPSEFDVDDLIEEEENNE 
LDMPF* 

(SEQ ID NO: 84) 

Gene Symbol: Trf2 
FlyBase ID: FBgn0026758 



Celera Sequence No. : 142000013384055 

TATTTTTTTTTTTTAACATAATTTTGAATAGCATTAAAATTATTAGCTACAAATGAATAAAAAACGAGCGGGGGTGGGGGGAAACAGAGGCTGCG 
GGGCAGCTGGAAAAAAAATTAGTGCGAAGGACTAGAAGTAAAGGAAAGAAACGGGCGTACCATCCTTATGTCGGGGGAAGCCCTCTGGAACCCCC 
ATTCTATCCCGTCCAGCACAAATGCGCATAATGAGCTCTGGTCAGCCAGCCTAGACGGCTGTGGGAGGTTGCCATCGCAGAGATTTACGCGCGAC 
AAGTTTTTCTTGCCCTTTTCCACAACGTAATTCTTTTTCATGCCCTATCTATAGAATTTTGGGTGTATGCCATTTTGTATCAAAATAAAGGGTAT 
GTCTTAGATTTTGATACTATATGTTACAACAAAATATGACACACCACTTATATGATGGAGATATATATTTATTCCTAGCATTTCATTAAAGTGTA 
TAGTCTTTTAAAAATGTAGAGTAGAGTATTGGAAAAGTAGCTCAACCCGTCTTCTATTTTAGGTAACATTTTTTGTAGACTCGCAACATCCAGCT 
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A AAC AC T T T T C AGT T GAT AT C G C A AATC C TT T TCGCTGGCGCT T AC AT AAT GAT AT G T G GC C GC AG AT GC C AAG GG A GGG AAC AT T T AAAAAAAA 
TGCATTTAAAAATAAAATGAGAAGGAAGTAGCGTCTGCTGTTTGCCTTTCTCCATTGAATATTGAGAATATGAATTAAATATAAACCTGCAGCAG 
GATAATCATAAGCAAATCACTATATGCAATTAGGGAAATAGTTTAAACATAATAAACTAAAATCTATAAGTGGACAGAGTCGCAAAGATTTAGGA 
AAACCATTTTTGATCGCTTGATCACCCAATAATTGAAAATATATGCTCGTATTATTTATAGTATATTTAAAAATCATTTTTATTCCGTTATATCA 
TGCTTGCTGTATTGTGTAATATATCAGTTAAACATAACTCTTAACTAGACAAAAAAAAAATAATCGATTATATGGTAAATAGGGTGACCGTGAAT 
AGTATGTAGCTCCACTTTTCAGAGTTTATAAATGGTGCATTTGATCCTCGACCAGCTTTTTGGGCCTTCTCTTGCGGCGTTTTCTTGCCTTTCGT 
CGTAGGCGATCGAATTTCCGGATCCGGTCGCAAATGAAATGCTTGAGGTCATCGCCCTCGGCGGCCGATTGACGACTGCACGACATTGTGAATAT 
TATTAGCGAGTTAAGGTAGACAAAAAATCTTATATCGTAGTCATCTAATGATGGTTTGCCCTTGGTTCGATGATTATTATTATTATTATCTTTCT 
GTCGGTCTTCCCTTTCCCGTTCCCGTTCCTTTTGATAATTACTAAAGTTGTGCTTGCGCAACTTGGAGGCGGGATACGTAGTGGGCGATGGATGG 
GGCTCAAAGGTCGAGCGCTGGCGCTTCGAACTGGACAAGTGTTGCCGAAAATAATCGTAAATGCTCATGTTCCTTATAACGATCGTGGTGGTGTT 
GCTAATGTTTATTAACGGTCCCGATTTATGAGGCGCCACATGCTGTAGATTTAGCCAACTGCCGCCTGCCGCCTGCACGCGAGTTGGCAACGAAG 
TGGCCACTATCTTATTGTAATTGTTATTGGCCAACATCGTTGTCGTATTCAATGTATAATGTAGCATTGGCTCATTGGTGTTATTGTTGCCGATG 
TTATGGCATCGGACGTTGTTGTGTTGATGGTTCTGATAATCTTGATTGTAGTAAGGATTTTTTTGTTGGGTTATTGTTGATTTGGTTGTATTGTA 

AT CT AC AGC AGT GAC T G C AAC CC G T C AAT GATCAGCGACCAGCGGTC C CCAG AAC AT T T T C GAT AT C T T C GAGT C GT CC T C GGT T AAAAAG AAC T 
GACGGCCCAAATGCACAAACTCCTTGAGCGACAATTGGCCATCGCCGTTCTTGTCTATCACAGCGAATGAAACGGCAGCATCGTCTTCGGTCAGT 

C C C AG AC AAC GG AAG AAG AGC T TGTATTGTTC GAG AT C C AAAT GGCCAGTGT G AT C AAAG T C C A C AGC C T AAG AAAAAAAAAAC C GG AAAAAT T A 
TTTTTCTGTTTATCGATAATAGCAACTTAACAATGAAACTAACCTTAAACAAGTAGGGCAGGAAGCGACCGATGCGTTTGGCCATCTTCTTGTCG 
TTGAGACGATGGTGCAGATCGGTGAGGAACTGCTCGGCAGTTACCAGATTGTATGGGGTTATCTCCCCGAACTGCTCTTCCCAAGTATGCTTGAT 
CACATCGTTGAATTCGGCAGCGACCTCTGGGGTCAAATGGCCCAGATCGCTAAAGCGTTTGGCCAGCAAACTAAAGTCGTCGAAGGACACGACGC 
CGTCGTGATTAACGTCCAAAATGCGGTGGAGAGTACGCATTTTGCGGCGCCAGAATGCTGAGTCGCCCTAAAAATGAGGAAAAATATACATATTA 
ATTGGTTTGCCTTGCGATTTTATTTTAAATTTTAATTATATTTGATTTAAAACATTCACAGGGCGTGCTAAAAGCTTCTTAAACTCTAAAGCTCA 
TTGCAATTCTGCAACCAAGCCTTTTGCTCTTTTCAAACTTTAAAAGGACTTGAAAAGTAAAAGCTTGAAAGCTTTGCAGGACTTGACTGTCCTTA 
TTAAAATGCTTTTATTAACATCAGCTTATCTCATAGACGCGCAGCGACTGCTTGTGAGCTTTTAGAAGCTTTAACCAAAAGGCTTTGAAAGGACT 
TTAAGGAACTGCTTGATGGCAAGTATAGTCAGGATACAAGCAACTGCTTTAAGCTTCAATTCGCTTTTTTTGAGAAAGCTAAGATTTCGTATTAA 
TAACCAAACAACAAAGCTTTGATCTTAAAACAGTCTGCACTCAAACCAGTTACTACTGTTAAATGGCCACTGTTAACCAGGTTACAATGTTGGAC 
TATTTTTTAAATGCCGTAGAGCAGGTTCAAAAAACGACATTGAGCACGATAAACAAATTTCGAAACACGTTTCTCGGCCTGTCGAGCAAACACAA 
TTGAAATTTCTCAGCCACGTGGACCGTGTGTTTTATGGATCTATTTAATGGATTATGCATTTCTCAAGCAGTTCACTCACCCGTTCGCTGCGCTT 
GGAATCCATTTTCTGGCGACGACGCTCGAACTCATCATCCGAGTCGGAATCGCTATCACTTTCCACCTCCCGTCGCAATTTGGCCGCTCCGGCCT 
TCTTTGAATATTGACGGACAGCCTGGACCACGGCAGTTGGCATTTGATGCCAAAGCAGTGAATGTGCTCCATGTCCACAGATAATGCCGCCAGAT 
GCCAGGCGCTGAAATCAGAAAAAAAAAGGAAATTAGAAAAATGTGAAAATGCAAAATGTATTGGATAAGTTTCTCTATCATAATAATCATTTTGA 
TTGTGCTTTAAAAAAAGAGATGCTAACGTGTCATTAATAATGTTACGCAATCATTATGCTATTGTTTTGTATTGTTTTTGTGTATATTTCGGTTT 
TCACAGCCCCATGCTGAATGCTCTTTTTGTTGTTGATATGGTTGCTAGACGCCCTTCGGTGTGTAAGCAATCATTATGTTTTTAATTACAACTCA 
GCGCACTGCTGTTTTCCTGTGTTGTTTTTGTTGTGCGTGTTATTTTCTCATTGTTAAGGTCAGTTCTGTTTACTCACAAAACAATTCTAAACCTG 
ATTCGTCGTTATCTGTTTTTTTTTACGAGTTTTCCATCTTTGGTTCGCCTTAAAAGTGCCAAAGAGTGACTGTCAAAATGTATCTCAAAAAACTC 
GGTCTACAGTTTGAGAGGCGCTTGGTAATGGTAAAAAACCACGTGGCGATCATGTATGTAACGAATTTAAAAGAAGAAATTGTAAACCAGCAGGA 
AAATAGCTTCCAATATATATAAAATATATGTAAATATAGGACACGGTATTTTACACATATATAATACATTCAATGTCCTTTGCTGCCAACATTGC 

AAC T T T GAAAAT T AAAAAAG AT AAAT AT T GAT GCC AAT AAC C AGT T C GG AG AAAG GT T T T T T C C AT C AC C T GT GC C AT T AGG AAAAGAT AAAAAA 
CAAAATTCGCTACTCTTTTAATTACAAGAAACCGAATTCTCTTGTAAACGTTTTGAATTTCAATGGCTCTCGATAAGAAAGAATCCAAGTACAGA 
TGTTGCCAGCCACCGACGACGATGATATCATGCATCATACCATATAACATTATCATAAAGCCCGCACACTTATTGTATTGGACTGACTGCTTTGG 
GACCTTTTTTCGGTTCCAGTTGCTTCGTTTGGTTAAACGAACTTTGGATTGTTTCTGTTTTTCGGTGTTTTTTTTTTCGTAAACAAGTTTCTCCA 
AAATACGAATTAATGAATGCAAAAAACGGGCCTCCTTTCATTTGAATGTCTGCGCGTCTCGAAAATAAAGAGAAACACGTTTCGACCATAAGAAA 
ATTAAACAAAAAACAATGCCAATGATTTATATGTGGAAGATGCAGAGAGTAAGAAGACGGCAAATAAAAATCAAAGTCGAGCAACAGGTCAATGT 
TAATTTAGCAGCAAACAAGAAGTAGAAAAATTGCGAATAAAATCACTTTAATCATTTCCTGTGATTCTAAAACTTCTTTTTCCGTGGCGGTTAGA 
GAAAAAGTAAAGGGAGGTGGGGTACTTTTGTTGTTATAATTTCCAATGCTTTTGCGCAAGATTTTCCTGCTACAGAATTTGCGCAGTTTTCCGAC 
TTTCAAAATGAGGGGGTTGTTGTCGGTGAGAAGGGAAAACAAGCTTTCACCTGAATACACGGTGAAAAATACCAGTAAGTATTAATATTGGCTTA 
GCTGGTTAAATTAATGTATGTTGTGTCAACATAATATCATAAATTTCGTTTCAAAACCGAAACTTTTATGCAAATACTATTGGACGGTATTTTTA 

AAGC T AAC AAC AT AGG AGT AAG AT G AAAC AT G T T TAT AT AT T AAC T G AAC TAT T T T GT T GAAT C T AT TG AT T C AAT AC CAAC AT T T C T T T GT GT G 
TAATTTCCACTTGTTTATCTTACTGTATCTTGTTTTATTTGTTTCATTGAGGGATATGCCTCTTTCGGCGCCGGGTTTTCAATTTCTTATTGATT 
GATTGAAGTGCTGCGCCCTTTAATGATTGATGTGCAAGAAATTTGCCAGCACCTAAACGTATTTTATAGTTAATCATCAATTGAAACCAAAAATC 
AGCAATATATAATGCTGCTGCTAACAACAATAAATTAATAATGCGGGAATGAAAAGTTAATAAAAATTAGAGAATTAATAATCCCAAACTCTCTT 
GTATTGCTAGAATTATAGTTGGGAATATCTATTAATATTAATCAAGAATTGAACTCGAGTTCTTACAGTTTCGAATGCACTGCTACAGTTTACTT 
AGCACTGGCCTCGTTTCGTGAACATTCCCGCTCCCAATTTTCATTCTCCACTTCCCTTGCTTGTCTGTATTCACCCCTCTTCCTCACTATTTACA 
GTTCTCCTTCCACCACTTACTGTTTTTCGAATTGCGATTCTATTTTTCTGCTTTGTTTTGAGTTTATATCATGCGGAACTGCTTTTTGGTTGGAA 
CATCTTTTGTGGAACTGGTTTTAAACTCTTTATTTTATAACTGTTGCTTTTAAGTTTGTTTTAGGAACTGCTCTTTCCTGTCCGCTTTGTGATGC 
TAAGTTTTTCTAACTGCTTGTTTCTGTCAATTCAATTAGTAACCTTAACTACAATGCACTGAAAGAAAAGATGCACTTACGTGACAATCGCTGTG 
CGAAATGGGCGTGTTTGGGGGCGGGGAGGCATTCTGCTGCAGAACGCCGACAGTTGCGGCCATGTTGCGTTCCAGAATTTCTCTTGAAGTGCGCA 
AAATACCACGGCTCACGATGGAGAATGCCATTTTGTTTTTCGCTCAAGTTTTAACCTCGGTTTTCTTAAATTTTTCCTCAATTTTCGCTACGTTT 
TCCTTACAGTTTGAACTGCACTCGAGAAAATGTGCAACGTTGCTTTTTTTTAGGGATTTTGTTGTTGAACTCTTGTTATTATTTCGATTGCAACG 
CGCATTTAACAACCGGTTTTTTCGAAGATGCACCACAAAAAAATGTTTGCCTTTATTTTTCTGCTTTGTTTTGTTACACTTTGTTTTTGAGTCCG 
CTAATTTCTATTTTGATTTCTCTGTGGTTTCAGTTTTTTTTTTTTTTGGATATCTTTGCTTGGGAGTAAACACAACCGTTCGCTTAAGCGTTCGA 
ATTCGTTTTGAAAACCATCGCTGCGCAAAGAAAAGAAGTCGACGTCGACGCCTGTAAAAGCAGAGCGGTCTTTCTGAGTGTGTGTGAGCGGGAGT 
GCGCGTGTGTGTGTGTGGTTTTGTGTGATGAAGAGGGAGAGGACAGCATAAGAAGGGAGCAGGGAGAACAGCTGCTACTTGGAATGGTTACCACA 
AGCAGGCAGTACACACTATTGAAGCCAAAATATTAGTAGTTACATTTTCAAATGTTCACGAAATCAAACTTTGAGCTGCATTAATATACACATGC 
ATATGTGTCGCAAGAATATTTTTGAATATGAGCGTTCCCAAATGTTAAAACCAGCTTTAAGAGATGCGGAGGAATTAGTATTTAAGTTTTTTAGT 
TCATATTTTTGCGCCCTTGGCTGTATATATGTAAGTACAAACACAAAGGCAAGACCATGTAAGCCGGTTGAAGAAGAAAATGGAGAGGAAGCAGG 
AGCACAAGAAGAAGCAAAAACAAACACAAAACCCAGTGATTGTCAAACCTCGCTTAAATGGAGAACTGTATTCGTTTTCAAAATAAAGCCCAAAG 
AAAATGCATTACATAAACATGGGATTACACACACATATACATACACGGACTACAGTTTTCCAATTGATAACGTAAACGAAGGAATCCAAAAACAA 
AAGCCAAAACTCAAAATGAAAATATCATAAAATGTGCGTATTTAGGTCGCTGCGTGTGACAAACTGTCGGTCCAATACAATATGCCCCAAGACAC 
CCTTCCCCCCTCGCTACGCACTTCCCCGAAAAGTGCCCTCTACGCACACACATACAAATGAAGTAACACAAATACCTACGCACTAATAGTTTCTG 
TTTTTCCGTTGCTGTTCTACTTCATTGCTCCACAGCAAAACGCATGCTTACACCCACACGTGTGCGGTAAATATTATAGGATCATTACAACACTT 
TAAAACTAATAACATTAAAAGATCGTTTTCAGTTACATACCAATTAA 
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(SEQ ID NO: 85) 



Exon: 6267.. 5781 
Exon: 3333.. 3121 
Exon: 2442. .2134 
Exon: 2063.. 924 

Start ATG: 5921 (Reverse strand: CAT) 
Transcript No. : CT3375 

AACGCTTAAGCGAACGGTTGTGTTTACTCCCAAGCAAAGATA 

ACTCAAAAACAAAGTGTAACAAAACAAAGCAGAAAAATAAAGGCAAACATTTTTTTGTGGTGCATCTTCGAAAAAACCGGTTGTTAAATGCGCGT 
TGCAATCGAAATAATAACAAGAGTTCAACAACAAAATCCCTAAAAAAAAGCAACGTTGCACATTTTCTCGAGTGCAGTTCAAACTGTAAGGAAAA 

C G TAG C G AAA AT T G AGG AAAAAT T T AAG AAAAC C GAG G T T AAAAC T T G AGC G AAAAAC AAAAT GGC AT T C T C C ATC GTG AGC C GT GGTATTTTGC 
GCACTTCAAGAGAAATTCTGGAACGCAACATGGCCGCAACTGTCGGCGTTCTGCAGCAGAATGCCTCCCCGCCCCCAAACACGCCCATTTCGCAC 
AGCGATTGTCACCGCCTGGCATCTGGCGGCATTATCTGTGGACATGGAGCACATTCACTGCTTTGGCATCAAATGCCAACTGCCGTGGTCCAGGC 
TGTCCGTCAATATTCAAAGAAGGCCGGAGCGGCCAAATTGCGACGGGAGGTGGAAAGTGATAGCGATTCCGACTCGGATGATGAGTTCGAGCGTC 
GTCGCCAGAAAATGGATTCCAAGCGCAGCGAACGGGGCGACTCAGCATTCTGGCGCCGCAAAATGCGTACTCTCCACCGCATTTTGGACGTTAAT 
CACGACGGCGTCGTGTCCTTCGACGACTTTAGTTTGCTGGCCAAACGCTTTAGCGATCTGGGCCATTTGACCCCAGAGGTCGCTGCCGAATTCAA 
CGATGTGATCAAGCATACTTGGGAAGAGCAGTTCGGGGAGATAACCCCATACAATCTGGTAACTGCCGAGCAGTTCCTCACCGATCTGCACCATC 
GTCTCAACGACAAGAAGATGGCCAAACGCATCGGTCGCTTCCTGCCCTACTTGTTTAAGGCTGTGGACTTTGATCACACTGGCCATTTGGATCTC 
GAACAATACAAGCTCTTCTTCCGTTGTCTGGGACTGACCGAAGACGATGCTGCCGTTTCATTCGCTGTGATAGACAAGAACGGCGATGGCCAATT 
GTCGCTCAAGGAGTTTGTGCATTTGGGCCGTCAGTTCTTTTTAACCGAGGACGACTCGAAGATATCGAAAATGTTCTGGGGACCGCTGGTCGCTG 
ATCATTGACGGGTTGCAGTCACTGCTGTAGATTACAATACAACCAAATCAACAATAACCCAACAAAAAAATCCTTACTACAATCAAGATTATCAG 
AACCATCAACACAACAACGTCCGATGCCATAACATCGGCAACAATAACACCAATGAGCCAATGCTACATTATACATTGAATACGACAACGATGTT 
GGCCAATAACAATTACAATAAGATAGTGGCCACTTCGTTGCCAACTCGCGTGCAGGCGGCAGGCGGCAGTTGGCTAAATCTACAGCATGTGGCGC 
CTCATAAATCGGGACCGTTAATAAACATTAGCAACACCACCACGATCGTTATAAGGAACATGAGCATTTACGATTATTTTCGGCAACACTTGTCC 
AGTTCGAAGCGCCAGCGCTCGACCTTTGAGCCCCATCCATCGCCCACTACGTATCCCGCCTCCAAGTTGCGCAAGCACAACTTTAGTAATTATCA 

AAAG GAAC GGG AAC G GG AAAGG GAAGAC C G AC AG AAAG AT AAT AAT AAT AAT AAT C AT CG AACC AAGGG C AAAC CAT CAT TAG AT G AC T AC GAT A 
TAAGATTTTTTGTCTACCTTAACTCGCTAATAATATTCACAATGTCGTGCAGTCGTCAATCGGCCGCCGAGGGCGATGACCTCAAGCATTTCATT 
TGCGACCGGATCCGGAAATTCGATCGCCTACGACGAAAGGCAAGAAAACGCCGCAAGAGAAGGCCCAAAAAGCTGGTCGAGGATCAAATGCACCA 
TTTATAAACTCTGAAAAGTGGAGCTACATACTATTCACGGTCACCCTATTTACCATATAATCGATTATTTTTTTTTTGTCTAGTTAAGAGTTATG 
TTTAACTGATATATTACACAATACAGCAAGCATGATATAACGGAATAAAAATGATTTTT 
(SEQ ID NO: 86) 

Start ATG: 347 (Reverse strand: CAT) 

MAFSIVSRGILRTSREILERNMAATVGVLQQNASPPPNTPISHSDCHRLASGGIICGHGAHSLLWHQMPTAVVQAVRQYSKKAGAAKLRREVESD 
SDSDSDDEFERRRQKMDSKRSERGDSAFWRRKMRTLHRILDVNHDGVVSFDDFSLLAKRFSDLGHLTPEVAAEFNDVIKHTWEEQFGEITPYNLV 
TAEQFLTDLHHRLKDKKMAKRIGRFLPYLFKAVDFDHTGHLDLEQYKLFFRCLGLTEDDAAVSFAVIDKNGDGQLSLKEFVHLGRQFFLTEDDSK 

ISKMFWGPLVADH* 
(SEQ ID NO: 87} 

Name: SARCOPLASMIC CALCIUM-BINDING PROTEIN 
Classification : ligand_binding_or_carrier 
Gene Symbol: CBP 
FlyBase ID: FBgn0026144 



Celera Sequence No. : 142000013384404 mm , mm „ 
TCCGTCGGTTGTTGTGGTAACTTAACTATGTCCACCTTTCCGTTTTCCTGGCACACAAAACTACTAGTTTTTAGCGTTTCTTTTTTGGTTTATTG 

TTTTTTGTGATATGATATCGATATCGATATCAGGAGGCGTTATTACGCTTTCCCTTCAACGCACAAACAAATACACCCCCACCAACAAAAGTCGA 

TGGTTCGGATTATAGTATCGATAGACAAGCGCGATGCAAAGTGCCCACCTATTCGCGTCTCGAATGTGGGTGTTTCCACTGTACAAGCAATTTGT 

ATACATGTGGAAAGCTCTTTGTCCAAATTGGTAAATGGCTAAATGTTTTTGAAAAAAAGATAATTAACAACCATAAAACTACAAGAAAATTACAA 

AATTTAACCCAAGTGCATATGTATACATGAATGACAAAACGGTAAAACCATGCAAATGAATAACCATCCACGTCGAATAATACCTAGTTCACCCT 

GATAAGGCACTCCCAATTTGTCTTATTAAACAGCTAATTTTGTTAGGGCTTAAAAATACCCAACGTTATGTATGTCTAGTGTTGTCTAGTTTCGG 

GGGGCGGAAAAGCCAATTGCACACAAATATACAAAGTATTGCCGAAACGTTGATTCTTTCAATAAATAAGGCACATTTTAATGAACTTATGCTTA 

TGCCAACAAAAGCGGTCGCTTATTTTTATTACTTATTGATATAAATTCCAAAACAGCTGACATTTTTAACGGTTACTTTGCCTAGTGTTGGCCAA 

GACATCAATCTTAATAATAAACTCTCAAGTTGTCAAAATTATACTTGTATTTCTACTAAAAAGTCATAAATATACCCAAAAACTACTTAGCCAAT 

AGTAACCCTGCTTAATATGGATTTTATTCACTTATTAACACGCCAATTGTCTCACTTTTCAAACACAACGGCTGATTGCCAGCACTGTTTGCGCT 

GCAGTCCAGTGCTGCGCAACGCGGCGGGGACAAAAAACAATGGCGAGGCGAAACTAAAAAATTGTGTTGCTGACATCTGGTCGCTTGCAAAACTA 

TTTCTAGCAGATTTTGTGATATTTCGTTGTGATCGGTCGATAAATCCGCCAGTTTTTTTTTTAATGGAAAGTGCTAACACATTGTAGCGGTTGGG 

AAGATAGCAGGAAAGGTGAGCGAAAGGCAGAAGATTGCGAAAAATTCAATTAAATTTCCGCGACCGTCTGTTTTCTGCATCGGTGTGTGTTTGTG 

TTTGCTCGCAAGTGTGTGTGGGTGCTCTGTGTATGACGTAAAGCGATAAAGCAAACGCATTCCCCCCACTCCTTGCATATGTGTGTGTGTGCAAG 

TGTTGTGTTTCTCTCTCATCAGTTTTGCATCTAATTAGGAGCGGAAAGCATTGATTGGCTGCCAAACGCCGCGGTTTGTTGTTTTTTTCGCTGCA 

TTTCGGCCAAAAGGCTTGCCGCAGCTGCATATGTGTCGCTATGTGTGTGTCAGTGTTGCACAGTGCCGCCAGTGCCCACAAAATGGGCGCACAAC 

AACAAAAGCAGCAACTCGGTTGGGATTTTCGCGATCCTACGCTGGCTCGGAGCTCCCCTTTTCCTATTTCCAATTCCCAGCCAGCGGGCTGCCGT 

TTTTCCTTTTTGTTATCCGTTGCCAGACGCAACGAAAACGACAGTTGGCATTTGAATTCAGCACAAACACACATACTAACGCCGACCCGCAAGCA 

GCACACACACACACACTGGGACACTCGAAAAAAAAAAAACAGACGCTGTCGGCGACCTCGACAAGCAGTTGGGTTCGATTTAGTTGTCAATGCCT 

TGAATTCGGTTCGGGGCTTAGTTTCCACAAGTTTATCGCTCGTCAAGAAACAACGAAATAAAATTATTTTCGACCTAAAAAATCTGACTAAATTG 

TGTTTTTTGTTTATGTATTTATTTAGGCACATTTTGCACACCACAACGTAGTTACTACATCTACGACTAACGGAACTCCTCCTGCAAGCAGTGGA 

AGTTGCTGTCCATCAAGCAGTACTCGGAGTTAACGCAGGATAAGCCGGGAGAAAGAGAAAGAGATCGGTGGAGAATAGAGATATACAGGTGGAGT 
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CAAAGAGGAAGGATCATGGACATGATTACGGTGGGGCAGAGCGTCAAGATCAAGCGGACGGATGGCCGCGTCCACATGGCCGTGGTGGCGGTGAT 
CAACCAGTCGGGCAAGTGCATCACAGTCGAATGGTACGAGCGCGGCGAAACGAAGGGCAAGGAGGTAGAACTGGACGCCATACTCACGCTCAATC 
CGGAGCTAATGCAAGATACTGTCGAACAGCACGCCGCCCCGGAGCCCAAGAAACAAGCCACCGCGCCGATGAACCTCTCGCGTAATCCCACACAA 
TCGGCTATCGGTGGCAATCTCACCAGCCGTATGACCATGGCCGGAAACATGCTGAACAAGATCCAGGAAAGCCAGTCGATTCCCAATCCGATTGT 
CAGCAGCAATAGCGTGAATACAAACAGCAACTCCAACACTACGGCCGGCGGAGGTGGTGGCACCACAACGTCGACGACCACTGGATTACAGCGTC 
CACGGTACTCGCAAGCTGCTACCGGCCAGCAGCAGACAAGGATCGCCTCGGCGGTGCCTAATAACACATTGCCCAATCCCAGCGCGGCAGCCAGT 
GCTGGTCCGGCGGCACAAGGAGTCGCCACTGCGGCCACAACCCAGGGAGCTGGCGGCGCTAGTACCCGGCGATCGCACGCATTGAAAGAGGTGGA 
GCGACTGAAGGAGAATCGCGAGAAGCGACGCGCCCGACAGGCCGAGATGAAGGAGGAGAAGGTGGCGCTGATGAACCAGGATCCGGGCAATCCAA 
ACTGGGAGACGGCGCAAATGATACGCGAATATCAGAGCACGCTGGAATTTGTGCCGCTGCTCGATGGCCAGGCCGTCGATGACCATCAGATCACA 
GTGTGCGTGCGCAAGCGTCCCATTAGCCGCAAGGAGGTCAATCGCAAGGAGATCGATGTCATTTCGGTGCCGCGCAAGGACATGCTCATCGTGCA 
CGAGCCGCGCAGCAAGGTCGACCTCACCAAGTTCCTGGAGAACCACAAGTTTCGCTTCGACTACGCCTTCAACGACACGTGCGACAATGCCATGG 
TATACAAGTGAGTATCCAATCCAAATAACAAGCTCCAAAACAACTCCATATATGGTCTGTTTGAATTACTAGTAGTGATTCTAAACAAACGCTCG 
ATATGGTGACCACACATTCCAATGTCTTGGGGAGGCTTCTTCAATTACTAATGCGTATCCTTGCTAGCAAACGGCTTTATTTCAATTGAACTGAA 
CTATAGAAAATCTTATTTCGTTATAGATGACCCTCTATAGCTATCATTCAATCTTAAACCTAATTTGTTGATGCAGTGGAATCATATATAGTATA 
AAGCAGCATATCCTTTAAAGAAATCATTGCCCCATTTATCAGATTTCCATCGAGTAACCAAATGTGCCAGTCACTTTTCGAGCTCGTTTCGTTGG 
CGCATCTCTGTTAGAGCGACCCATTGAGTCATGCCGACAGAGACGGCAACAGAGCCACACACATGCATCATGAAAGCGGTTGCTCCCGAAATAGC 
CAAGTTCGTTGAATGACTAACTTGGAAGCCCCAGACGCTTCACCTGCCCCCGCCCGGCCACTCCGTCCGCCGATCTGCACACGAGCTCTCTAAAC 
CCAATTCTCCAGCGGAAGAGATTTCGAAAAAGTAACTGCCTTGTGGCCCGGCATTTGTCCATTAGCAATTGTTTGGCTTCAATTGAATTAAAGCT 
GCTACGGCTGCTGCTGCCCCTTTCCTTTCTCCGCCAACTTCTTGCTTAGCTTCTCTTTCCGGTTTTCGCACTCGTATGTATTGTGTGGCTGTCGG 
GAATTGGTAATGCCATTAGTTGCGTCACGGCAATCGCTTAAGTATAGCCCAAAAAGTATACCCCATGAGTAATATAGCCCGGCATTAATGCACAA 

AATCTCGCC GAT GAT AAC G T AGT CTCGTGTT GAC AAAT CC AAAT G GGT GAAT C AAC T GC T G AC AT C GT GC GT TAT C AAT ACT C AGT AT T GGC AG A 
AAACAATGCAACTTACACATTTATTGATTACATTTCAATATGCGTTCCTTAATTAGTCGTATGTATAATTCTTATCGATAATCATTTTTACTAAG 
CAACTGCTAACTTTGAATGCAAATCGATGTATTTATCTATGATTAATGCTAAAGTCTGAAACACAACAGTCCATATAGTATTGAATGCTTGCATA 
TAGCACATATCTGCATTGAATAACCATGCTAATCTATAATCTTGCACTTCATCCGCGCAGATACACAGCCAAGCCGTTGGTGAAAACCATTTTCG 
AGGGCGGAATGGCGACGTGCTTCGCCTACGGCCAGACGGGATCGGGCAAAACGCACACCATGGGCGGTGAGTTTAATGGAAAGGTGCAGGACTGC 
AAGAACGGCATCTACGCCATGGCGGCCAAGGATGTCTTTGTGACCCTGAATATGCCGCGTTACCGCGCCATGAATCTAGTCGTCTCGGCCAGTTT 
CTTTGAGATTTACAGTGGCAAGGTGAGTGAGACAACATACAACTTGGCATGATTATCATACTTAGCTGTATTAATCGATCGTTGTGTTGTTGCCA 
TAGGTCTTCGATCTTCTGTCCGACAAGCAGAAACTGCGCGTCCTGGAGGATGGTAAACAGCAAGTGCAGGTGGTGGGACTCACCGAGAAGGTGGT 
CGATGGCGTCGAGGAGGTACTGAAGCTCATCCAGCACGGCAATGCTGCCCGAACATCCGGCCAGACGTCGGCCAACTCCAATTCGTCGCGTTCGC 
ACGCCGTTTTCCAGATTGTGCTGCGGCCGCAGGGCTCGACGAAGATCCATGGCAAGTTCTCGTTCATCGATCTGGCGGGCAATGAGCGGGGCGTG 
GAC ACTTCCTCGGCCGATCGGCAGACGCGTATGGAGGGTGCCGAGATT AAC AAAT CGCTGCTGGCCCTCAAGGAGTGCATTCGTGCGTTGGGCAA 
ACAGTCGGCCCACTTGCCCTTCCGTGTCTCCAAACTCACCCAGGTGCTGCGCGACTCGTTCATTGGCGAGAAGAGCAAGACGTGCATGATAGCCA 
TGATCTCGCCGGGACTTAGCTCCTGCGAGCACACGCTCAACACGCTGCGCTATGCGGATCGTGTCAAGGAGCTGGTGGTCAAGGATATCGTCGAA 
GTTTGCCCTGGCGGCGACACCGAGCCCATCGAGATCACGGACGACGAGGAGGAGGAGGAGCTCAACATGGTGCATCCGCACTCGCATCAGCTGCA 
TCCCAATTCGCATGCACCGGCCAGCCAGTCGAATAATCAGCGTGCTCCGGCCTCTCATCACTCGGGGGCGGTCATTCACAACAATAATAATAACA 
ACAACAAGAACGGAAACGCCGGCAACATGGACCTGGCCATGCTGAGTTCGCTGAGCGTAAGTATTTGCATATTTCTCACCTAGAAGTTACACACA 
AGGCCATAATATAATATATATTCTATAATATATGTTATATAGCTTAGTAGGCTTAGTTTAGGGCATTTGGCATTGTCACTCACTTCCGTCAGTCA 
GCATCTAGTCCCAGCTCAGTTTGCAACTCCCTTTAGTTCGGGTTACAAGTTATATGTTATAAATACATGTCGATACCATCACCATCCAACACAAT 
GCTCCAGCTAGGCAAAGACAAGGCAAAGGCGAGAGCACGCGATAGCCACACGCCAAATGTGGCTGTTTACACCAATGTACAGACCCTGCGATCCG 
TGTACAGCATGGTCAGCCAGGCGGCAGGCAGGAAAGCTGCTGCTGCTGCTGCCACCTCGATTCCCATGGCGCAAGTCAAAGATCACGAGGAACCG 
TCACGCGTTTGGAACTGCAACAAGCCCAAGAAAAAGGAATATAAAACATATAACGTCTACTCCAAACAAGCGGACAAGTTAAGCGGCGAAGAGGG 
ATCCAAGATGGGCATTCGACAGTGGGTCCATCCCCCGTATCCCGTTCATCGTGAGCGCAAGAAGCGCGACGGCCCGGGCAACTACGAATCCTTCT 
ATGTGCCCTACGAACAAGACGATGGGATAATAGTGAAAGACAATCCCTGGACCGAGACAAAGGAGCTGGCAGTGAATCTACGTAGCCGATCATTT 
AAGCGCTTGCCGACGGCGGGCGAACGGTATGCGCTAAATATCACCAAAATGGCCAGTCTGATTGAGAAGGTCAAGTCGCATCCGAATCACGACGG 
TGTGTCGAAGGCTGCGAGGCAGATGTGTCCGTTGTTGCCGCCGCCGCCGCAGTTCGCCGAGGATCCCATTGATTCCGGGCGGCGCTTCGCCCTGC 
TTCGCTCACGAACGGAGTTGTGCTTCTCGATTTTGGATTGCGATGACTTGCTGTACAAGTACGCGTGTCATCTGCCCTTTGTGCGCGATCCATTG 
CAGCCAGATGCCGTGCGGCCAAAGATGAAACGATTGTCGTGCTCAATTGGGGGGAATTCGTGCACCATACTCAACTAGCCATCAATGATCTTATA 
TATTTTATAACTTGTTCTCGTTGAATTCCAGTAAAATTCACGTGTTCCCCTTTGGGTCATACTCGTATGGAAATAAGTATAGGGCTTTTTCCCCA 
TAGACAGATCCCTAATGAAACATACAACCTATAATGTTCATTTATACACACTGCACATTGCCATTTTGCAGGAACACGAGATGTCCGACGAGCTG 
ATTGTGCAGCACCAGGCCATCGACGACCTGCAGCAGACGGAGGAGATGGTGGTGGAGTATCATCGCACCGTTAATGCCACACTGGAGACCTTCCT 
CGCCGAGTCGAAGGCGCTGTACAATCTGACCAACTATGTGGACTACGACCAGGACTCGTACTGCAAACGGGGCGAGTCGATGTTCTCGCAGCTGC 
TGGACATCGCCATCCAGTGCCGCGACATGATGGCCGAATATCGCGCCAAGTTGGCCAAGGAGGAGATGCTGTCGTGCAGCTTCAATTCGCCGAAT 
GGCAAGCGTTAGTTCATTCCCGCATTCGACCCGTTCATCCACAATCACCAACAATCATCCACCACACCCAACCAGGCGTACACTTTACATTACAA 
ATTCCCTCTGGCTTCCCAATGGACGTGTCTGTCGTGCAGTCCCCTAAAACAATCTCTCTATCACCTTTGCCCCAAACCGTCGAGTTTATGTCGCT 
GCGAGTTTATGGTTTATTAAGTTTTTAACTGATAAATAATAGTCGAGAGCCTATATTTTACAAACGAACTAAGCACGGGAACAGACCAGATACCA 
CAGCATTAATACCATTATGAGAGCAACGCAAACCAATAAAAAGGAACTCAACTGAAAACTGAAGACAGGCTAGGCTGGCTGGCCACGACTTTGTT 
TGGATTCGCCGTGAACTATGATTATATGAATATAATTGTAATTTCAGCGCAGTCGATACTCTAATTGTGTAAAAGAAAAAGACATTTACTATACA 
ATATATATATATTTACTACTTACAAACATACGTTCTACTCTAAACAATGATATGAGATTCCATTTAATTTTTAACAAATAACCTTATGCGATTCG 
ATTCGATGCGATTTGTTTCCCTCATTTTTGTTCATTTTGTAGAATTACTCGACCAGAGTTGTTTGTTCATGCCACGGCACAAATCTACTAGTTTT 
TAAATTACGAACTAATCGCGCATTGTATTTATTTAAAGGCCACAAAACTGTATTTAATTTGTATTAATTTATTAAAGTACTACTGCACCAAATCC 
GATTTACATACGTATTCCCATAGCCAGTAACTATTAATGATTTTCACACGCAAACCGAACTGCAATCTTAGACAGCAAGTGTTTTTTTTTGTGAA 
ACTTTTTATGCCCCTTCTTTATAAATTTATATATCTACATATAAATTATTTACATTATATTTTTTTTTGTCATTTTAAGCGACTTCGGTGAAGAA 
TTTAAAATGCAAACTGTTTTTGCTATACGATTATTATGATTTAGAGATGATTTGCATTTCATT 
(SEQ ID NO: 88) 

Exon: 1001. .1155 
Exon: 1599.. 3142 
Exon: 4336.. 4582 
Exon: 4659. .5471 
Exon: 6722.. 7043 
Start ATG: 2112 
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Transcript No. : CT3411 

AAACTAAAAAATTGTGTTGCTGACATCTGGTCGCTTGCAAAACTATTTCTAGCAGATTTTGTGATATTTCGTTGTGATCGGTCGATAAATCCGCC 
AGTTTTTTTTTTAATGGAAAGTGCTAACACATTGTAGCGGTTGGGAAGATAGCAGGAAAGAGCCAGCGGGCTGCCGTTTTTCCTTTTTGTTATCC 
G T T GC C AGAC GC A AC G A AAACGAC AGT T GGC AT T T G AAT T C AG C AC AAAC AC AC AT ACT AAC GCC GAC C CG C AAGC AGC AC AC AC AC AC A C AC TG 
GGACACTCGAAAAAAAAAAAACAGACGCTGTCGGCGACCTCGACAAGCAGTTGGGTTCGATTTAGTTGTCAATGCCTTGAATTCGGTTCGGGGCT 
TAGTTTCCACAAGTTTATCGCTCGTCAAGAAACAACGAAATAAAATTATTTTCGACCTAAAAAATCTGACTAAATTGTGTTTTTTGTTTATGTAT 
TTATTTAGGCACATTTTGCACACCACAACGTAGTTACTACATCTACGACTAACGGAACTCCTCCTGCAAGCAGTGGAAGTTGCTGTCCATCAAGC 
AGTACTCGGAGTTAACGCAGGATAAGCCGGGAGAAAGAGAAAGAGATCGGTGGAGAATAGAGATATACAGGTGGAGTCAAAGAGGAAGGATCATG 
GACATGATTACGGTGGGGCAGAGCGTCAAGATCAAGCGGACGGATGGCCGCGTCCACATGGCCGTGGTGGCGGTGATCAACCAGTCGGGCAAGTG 
CATCACAGTCGAATGGTACGAGCGCGGCGAAACGAAGGGCAAGGAGGTAGAACTGGACGCCATACTCACGCTCAATCCGGAGCTAATGCAAGATA 
CTGTCGAACAGCACGCCGCCCCGGAGCCCAAGAAACAAGCCACCGCGCCGATGAACCTCTCGCGTAATCCCACACAATCGGCTATCGGTGGCAAT 
CTCACCAGCCGTATGACCATGGCCGGAAACATGCTGAACAAGATCCAGGAAAGCCAGTCGATTCCCAATCCGATTGTCAGCAGCAATAGCGTGAA 
TACAAACAGCAACTCCAACACTACGGCCGGCGGAGGTGGTGGCACCACAACGTCGACGACCACTGGATTACAGCGTCCACGGTACTCGCAAGCTG 
CTACCGGCCAGCAGCAGACAAGGATCGCCTCGGCGGTGCCTAATAACACATTGCCCAATCCCAGCGCGGCAGCCAGTGCTGGTCCGGCGGCACAA 
GGAGTCGCCACTGCGGCCACAACCCAGGGAGCTGGCGGCGCTAGTACCCGGCGATCGCACGCATTGAAAGAGGTGGAGCGACTGAAGGAGAATCG 
CGAGAAGCGACGCGCCCGACAGGCCGAGATGAAGGAGGAGAAGGTGGCGCTGATGAACCAGGATCCGGGCAATCCAAACTGGGAGACGGCGCAAA 
TGATACGCGAATATCAGAGCACGCTGGAATTTGTGCCGCTGCTCGATGGCCAGGCCGTCGATGACCATCAGATCACAGTGTGCGTGCGCAAGCGT 
CCCATTAGCCGCAAGGAGGTCAATCGCAAGGAGATCGATGTCATTTCGGTGCCGCGCAAGGACATGCTCATCGTGCACGAGCCGCGCAGCAAGGT 
CGACCTCACCAAGTTCCTGGAGAACCACAAGTTTCGCTTCGACTACGCCTTCAACGACACGTGCGACAATGCCATGGTATACAAATACACAGCCA 
AGCCGTTGGTGAAAACCATTTTCGAGGGCGGAATGGCGACGTGCTTCGCCTACGGCCAGACGGGATCGGGCAAAACGCACACCATGGGCGGTGAG 
TTTAATGGAAAGGTGCAGGACTGCAAGAACGGCATCTACGCCATGGCGGCCAAGGATGTCTTTGTGACCCTGAATATGCCGCGTTACCGCGCCAT 
GAATCTAGTCGTCTCGGCCAGTTTCTTTGAGATTTACAGTGGCAAGGTCTTCGATCTTCTGTCCGACAAGCAGAAACTGCGCGTCCTGGAGGATG 
GTAAACAGCAAGTGCAGGTGGTGGGACTCACCGAGAAGGTGGTCGATGGCGTCGAGGAGGTACTGAAGCTCATCCAGCACGGCAATGCTGCCCGA 
ACATCCGGCCAGACGTCGGCCAACTCCAATTCGTCGCGTTCGCACGCCGTTTTCCAGATTGTGCTGCGGCCGCAGGGCTCGACGAAGATCCATGG 
CAAGTTCTCGTTCATCGATCTGGCGGGCAATGAGCGGGGCGTGGACACTTCCTCGGCCGATCGGCAGACGCGTATGGAGGGTGCCGAGATTAACA 
AATCGCTGCTGGCCCTCAAGGAGTGCATTCGTGCGTTGGGCAAACAGTCGGCCCACTTGCCCTTCCGTGTCTCCAAACTCACCCAGGTGCTGCGC 
GACTCGTTCATTGGCGAGAAGAGCAAGACGTGCATGATAGCCATGATCTCGCCGGGACTTAGCTCCTGCGAGCACACGCTCAACACGCTGCGCTA 
TGCGGATCGTGTCAAGGAGCTGGTGGTCAAGGATATCGTCGAAGTTTGCCCTGGCGGCGACACCGAGCCCATCGAGATCACGGACGACGAGGAGG 
AGGAGGAGCTCAACATGGTGCATCCGCACTCGCATCAGCTGCATCCCAATTCGCATGCACCGGCCAGCCAGTCGAATAATCAGCGTGCTCCGGCC 
TCTCATCACTCGGGGGCGGTCATTCACAACAATAATAATAACAACAACAAGAACGGAAACGCCGGCAACATGGACCTGGCCATGCTGAGTTCGCT 
GAGCGAACACGAGATGTCCGACGAGCTGATTGTGCAGCACCAGGCCATCGACGACCTGCAGCAGACGGAGGAGATGGTGGTGGAGTATCATCGCA 
CCGTTAATGCCACACTGGAGACCTTCCTCGCCGAGTCGAAGGCGCTGTACAATCTGACCAACTATGTGGACTACGACCAGGACTCGTACTGCAAA 
CGGG GCG AGT C GAT GT T CT C GC AGC T GC T GG AC AT C GC C AT CCAGTGCCGC GAC AT GAT GGC C GAAT AT C GC GC C AAGT T G GC C AAGGAGGAG AT 
GCTGTCGTGCAGCTTCAATTCGCCGAATGGCAAGCGTTAGT 
(SEQ ID NO: 89) 

Start ATG: 669 

MITVGQSVKIKRTDGRVHMAVVAVINQSGKCITVEWYERGETKGKEVELDAILTLNPELMQDTVEQHAAPEPKKQATAPMNLSRNPTQSAIGGNL 

TSRMTMAGNMLNKIQESQSIPNPIVSSNSVNTNSNSNTTAGGGGGTTTSTTTGLQRPRYSQAATGQQQTRIASAVPNNTLPNPSAAASAGPAAQG 

VATAATTQGAGGASTRRSHALKEVERLKENREKRRARQAEMKEEKVALMNQDPGKPNWETAQMIREYQSTLEFVPLLDGQAVDDHQITVCVRKRP 

I SRKEVNRKEI DVI SVPRKDMLIVHEPRSKVDLTKFLENHKFRFDYAFNDTCDNAMVYKYTAKPLVKTI FEGGMATCFAYGQTGSGKTHTMGGEF 

NGKVQDCKNGIYAMAAKDVFVTLNMPRYRAMNLVVSASFFEIYSGKVFDLLSDKQKLRVLEDGKQQVQVVGLTEKVVDGVEEVLKLIQHGNAART 

SGQTSANSNSSRSHAVFQIVLRPQGSTKIHGKFSFIDLAGNERGVDTSSADRQTRMEGAEIHKSLLALKECIRALGKQSAHLPFRVSKLTQVLRD 

SFIGEKSKTCMIAMISPGLSSCEHTLNTLRYADRVKELVVKDIVEVCPGGDTEPIEITDDEEEEELNMVHPHSHQLHPNSHAPASQSNNQRAPAS 

HHSGAVIHNNNNNNNKWGNAGNMDLAMLSSLSEHEMSDELIVQHQAIDDLQQTEEMVVEYHRTVNATLETFLAESKALYNLTNYVDYDQDSYCKR 

GESMFSQLLDIAIQCRDMMAEYRAKLAKEEMLSCSFNSPNGKR* 

(SEQ ID NO: 90) 

Name: KINESIN-LIKE PROTEIN 
Classification: motorjprotein 



Celera Sequence No, : 1420000133845 62 

AAATAAGTTGAATTTTTGACTTTAAAGCTTAAAATTAAGGGTTTTTTGCTTAATTAAACTCAATTTTTTTATAAAATATAATTAAACAATATTTA 
TTTTACTTATAAATCAAAAAACAAATTAAAAATATTAAATATACAAGAAAATAAACAACAAATTCCAAGTTTACACACTTTTGAGACTGTCAAGA 
AACTCTTTACACATTTTGTTTTCCTACTTTGTTTTGTTCTTTTTCTTAGAACGAACTAGATTTTTCCGTTATTTTTGGCTTTGCATTGCTTTTTA 
CAACGCTTTTATTGAAATTTGTTGACTTTGCCTGTGAAATTTTGCTGATCAAATTATTAAATTTAATAAAATGCCTGGAAAGAGATTAACTTTTG 
AAGTTACCCAATTAATAAACTATAACCACCATGTATATATTTATCATTAACTAGGGCCTGCAGCAAGTCAAGAACTTCTATCTCAGGTCCGAGAA 
GCTGCCCACGCCCATTCCCACTGGAGAATATCTGCTGATGATTGACTGGGTGTTCAACAAGAAACCGCAGGCTGCCACAAATGTTTACTTCACAT 
TTGTGGAGGACCTAAAAGATAGTTAACGAATCAATCGTTCAGACCAAAATGTTTAAACTGAACTGTAATTACAAAATAAACAATATAAGTCTATA 
TTGTTTTAGCCGATGAAAATTATATGGAATTTTGGCTTTGCCATCAACGAGAAATTGTTTCTATCCGTGATCTGTGGAATACAAATGCGACTATT 
AAGTCTTTGTTCCTGAGACCATCAAGAGCACTTCCAGCAGCCAGCGCAATATCGTAATCGTAATTTAAAACAAACAATACAAAAATAATTACCAA 
TTAAGCCCATTTTAAATTCACATTCCGTATTGCAAAGCACAAAATAGCCTTGTGTCCAGTGGTATCCAGAAAACCGGCCCATTTAATAAGCTTTT 
AAAAACGTTGAGGGATGTGCAATAGTTAGTTTATTCGTCTCTTTGGGATTTTAGCACTGTTCGTTACACTCATCGCCCGCGTCTGCCTTTGTCCA 
TACGGAGATTTTGTTGATATTTTTGCGAATCCTGACAAAAGCCCCACAAATCTGATCGCAGTGATCGCAGGCCTCGCCAATCATGCAAAGAAGCA 
CGTCCAGCCAAGTTTTGTCCAGTTCGGCTGTGGACCTTTTATCCACTTTATATGTCCAACGACCTCCGCCCTTGTTCGGCGGATCCTCCCACATG 
GGGCGAATGCCCTTCTTGAAGAGCATGTAATCACAGCCGCGGTTGAGCTTGGATGGCGTATCGATGCGGAAATACAGGTTCCAGAAGTCCTCGAC 
GGTATTGAAGCTGTCAATCTCCTTCAGCATGTCCTCCCATGATATTTCAGGGTCGTACTTTACGCCCCACAGGGTCCATGTGTTCTGCAGCCGGT 
GCTTGTTTGTGGCACCCAACGATGCCAGTTTGTTATTATCCAGAGAAGTCCCCATTTTAAATTCCGATGACTCTTTGGTTTGCCTTAAGTTTTCC 
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ATTTCGCAGATACTCTATGCTGAAGCCCCCTTTTAAAATGCATTCGTAGAATTCGTAGAATTGTTTTTAATGGACTTACAATATTTACCGACGAA 
TGTGGTCAAGCTTAGAGCATTTGATCATTTAATTATTGACGTTAAAGCAATGCACTGTCAATTAGGAAACAATCGTGATTACTAATGGCTGCATT 
TATATGTAAAATACATTTGACAGGCAATCGAAACATATCGATATATCGCGTGATGCTATCGTGAGAGGTTGGCCCAGCGCTAGTATTGATTGTTA 
AACAAAAAGGAATTTTGCACTAACTTGTGGGCCGCAGTTACTGCAAATTCGCGCCAGGATGGACATGATAAACAAGTTCTACTCATCCGCCGTAC 
AGACGGTGTCCACACTGTCCGGCGTTCTGCCCGGGAACAATGTGACGCGGGAGTACGAGGTGCTGGAACAGGTGTGCACTGCCGGAGTGGGTGAG 
TTTACCTGCATTGCTAGTTGAACATATTGTGGCTTGCTGATACCGAGTAATCCCACTGCCAGGTCTGATGTGGAAGGTGTACAATGGCCACAAGA 
AGAGCACCAAACAGGAAGTGTCCGTCTTCGTGTTCGAGAAAAAGTCGTTGGAGCGCTGGTCGAAGGATGACCGGGAGACCATGCTGGAAACACTG 
CGTCGCGGGGTTCAACAGCTAACAAAAATTCGACATCCTCACGTGCTAACCGTACAGCATCCACTGGAGGAGAGTCGGGACTCGCTTGCCTTCGC 
CACGGAGCCGGTTTTTGCCTCCCTAGCCAACGTGGTCGGCGACAACGTCCGCTCCGAAAAGAAGCTCTATGATGTAGAGATTCGACACGGTCTGC 
TGCAGCTCTTCGATGGCCTGCAGTTCCTTCACCAAGACGCCAAGATTGTGCACCGCAACATTAGCGCCGAGACAATCGTGATCAACAAGAACCGC 
AGCTGGAAGCTGTTCGGATTTGATTTCTGCATTGCCAACCAACCGGCCACCG 
(SEQ ID NO: 91) 

Exon: 1522.. 1001 

Start ATG: 1522 (Reverse strand: CAT) 
Transcript No. : CT3489 

ATGGAAAACTTAAGGCAAACCAAAGAGTCATCGGAATTTAAAATGGGGACTTCTCTGGATAATAACAAACTGGCA'TCGTTGGGTGCCACAAACAA 

GCACCGGCTGCAGAACACATGGACCCTGTGGGGCGTAAAGTACGACCCTGAAATATCATGGGAGGACATGCTGAAGGAGATTGACAGCTTCAATA 

CCGTCGAGGACTTCTGGAACCTGTATTTCCGCATCGATACGCCATCCAAGCTCAACCGCGGCTGTGATTACATGCTCTTCAAGAAGGGCATTCGC 

CCCATGTGGGAGGATCCGCCGAACAAGGGCGGAGGTCGTTGGACATATAAAGTGGATAAAAGGTCCACAGCCGAACTGGACAAAACTTGGCTGGA 

CGTGCTTCTTTGCATGATTGGCGAGGCCTGCGATCACTGCGATCAGATTTGTGGGGCTTTTGTCAGGATTCGCAAAAATATCAACAAAATCTCCG 

TATGGACAAAGGCAGACGCGGGCGATGAGTGTAACGAACAGTGCTAA 

(SEQ ID NO: 92) 

Start ATG: 1 (Reverse strand: CAT) 

MENLRQTKESSEFKMGTSLDNNKLASLGATNKHRLQNTWTLWGVKYDPEISWEDMLKEIDSFNTVEDFWNLYFRIDTPSKLNRGCDYMLFKKGIR 

PMWEDPPNKGGGRWTYKVDKRSTAELDKTWLDVLLCMIGEACDHCDQICGAFVRIRKNINKISVWTKADAGDECNEQC* 

(SEQ ID NO: 93) 

Name: eIF4E-related 
Classification: translation factor 



Celera Sequence No. : 142000013384566 

AGGCCTTTCTACGTTGTGTTTTAGCGAACTCGAAAGCGAACTGTCGCTCCGAACGGCGCACAAAACACTTTTAAGCGCGCGCGCTCTCGGTTTTT 
CTTCGTGTGTGTGTGAGTGATGGGAGAGTGTGCACTGTTATCTTTTCAACATGTCGCCCGTTCTTTGTTAACCGTTTAAATGGCTGATAATTGAA 
TATTCCCGATAATTGCCGCACTCAATGTGATACACACGTCCTCAATATGGGTATACTTTGTATAAAGTCGGGTTTCCTAGTCAGATAATCCCCAC 
TTACTAAGAAGTAAATTTATCAAATGCAATATGGCCGCTCTTTTGAGATGCGCATCAGCTGTTCTTTGTTTTTGTTTTGGCCGTGGGAGAGCACC 
ACGTCATGTGTTGCCAGGCGGTTTAATTATTGCTGACAATTTAATTCACATTTTGTCAAGTTGATTTGCACGTTCGAAGGGGGGATCAGTCACCC 
ACTTTGATAACTGCTAATATGGCGACGGCTCTTTTTTACGTGTCGCTTGCAGAGCTCCAAGAATGACACTTGATAAGAAGGATACATGGTGGGAA 
AACTTACAAATTACATAATTTGTTTGTTGATTGGGGAAACCCTACATTATCTCCGAACCCAGCAATAATATGTATTGTAAATAGCAACAGTCTCA 
AGCTCGATGCTGCTGTGAATTACCAATATATCAAAACTTAAAACATTATCTTAGGAGGTAGATGGTTTATCTGCAGTTTATTTTCAATTCTCATC 
GTTAGCTGATAAAACTCTCAGGAAACCACTTGAAATCCAAAGTGCAGCGTCCATATTTTGCACACGCTGACAACCCTGTCGTGAGTCACCTTCCA 
GCAGGCGCAGCAACAACAGCACTGGCAGCGGCCATATTTGAATGACACCGAAATCCCGCACTGAGCGCAAAACAACACATGCCCATGCACGCTCT 
CCAAAACCACACACACACAGCCATATCGATCGCATAGAAAGAGACGTACGCGACGCTATAAACAGCAGTTGATTTTCAGAGTTTCACCAAGAAGC 
ACATAAAGCGATTGAATTATTAATAAAAGCACAGGCACTGGAAGTGAGTTTTAATATTACTAACTAACTACTTTGTGAGTGTTTAGTTACGAAAA 
CTAGAAAATAATATTCTGGAAACTGTGTTTTTAAACCGACCACCATATAAATTTAAAAACCAAAACATTCCATTGGTGGTGGTTTTGTTGTTAAA 
AAAGAGCAC AGAAAATT CAAGAAAACGCT ACGACGC AAAACGAAAAAACT CGC CCAAAATGAAAAC AAAAGTGT CT GT T TT CCAATTGTCAATT G 
TTGTGAAACTAATTTCCTTTGTTTTCGCCTCTCGAATTTGATTTTAATTTACAGTAAACCGTCGACATGAAATTTTTCGTTGCACTCGCTCTCTT 
TGCGTGCCTCGGCTCTCTTGCCTTGGCCAAGGATGATGAATATTGCCAGAACACTGTAATCACCGCATGCTCTACGTCAGCCTTTTCGGGTGAGT 
AAAGAGAGCGCCTCTAAGAGAGAGTGGCTAATCTGTGTTGTTGCTTTTTGTATCTTTGTAAAACATAATAATAAAAACAAGCGCTCACGGGCTTT 
TTCACAGGCAAGAGTAAGTGAAATATAGATTGGATGTATTTCTGCACTATCTCAGATTTCGTGAAAATGAACACTTTCCAATTCTTACTGCATTA 
CGGCCCTTACATTATTATTTTAAAGATACAAATACAAATTCAAAATTCTATTGCCTTGTTTAGTATGGCACTACCTAGTTAGGAACCATTAAGAC 
TATGTCATTACTATTGGATGTGCTTCTGCTCTCCTATTCTAAGAGTGTGATCTTTATTATTTGCTTTTTCCCCGTAGCCAACTCCATTTGTAATG 
CCCGTTTTGCTGGCATTGACCATATTGAGCCCGAGATCCAGTCCTACATCAACGCCAACTTGGCCAAGTCGTACGACTACCTCCTGCTGGCCACC 
CACTTCAACTCCTACCAGAAGAACCGCCCCGGCTTCCAGAAGCTGTACCAGGGACTCTCGGACCGCTCCTTCGAGGACAGCATTGCCCTGATCAA 
GCAGGTGACCCGTCGCGGAGGAATCGTTGACTTCAACACCCGCCACGAGTCTTCCGGCTCCGTGAGCACCAAGCGCGGCACTCTTGAGGTCGACG 
AACTGCACTCCCTGGCTCTGGCTCTGGACACCGAGAAGCAGCTGGCCACCGGAGCCACTCACGTGCACTCCCGTGCCACCCACGCCACCGACGCC 
GAGAGGGATCCCGAGCTGGCCCACTACTTCGAGGAGAACTTCCTGGGCAAGCAGGCCGAGAGCGTGCGCAAGCTGTCCGGCTATGCCAACGACCT 
CGCCAAGCTGATGAAGGTCCCCGACCCATCCCTGTCCGTCTACCTGTTCGACGAGTATCTGCAGAAGCAGTAAGGAGTGGAGACCACCTCCTTTT 
CCTCCTTCTTTTTCTAACACGCTCCTACACTGTCTCCTCAAAATTGCTCTCATAATTCCCTTATAAATTTGTTTCCTTGGATTGTTCATCAATAA 
ATCGCAGTGCAGCAGCAGCATTATCAAAGCAGCCACAAGTGAATAATTTACACCGCGAAATTCACCTTGATTTACCAGCACACCAGATTGTGATA 
ATGTTATCATAAATAAGTTAGCAAGTTCGCCCCAGCAAATGTATCTTTAGTATGTACGTTACCGCTTGAACAAATAATAAACACATCAAAGAACC 
TAATTATCAACTAGTAAAACAGCAGCATTTTTGCCTCTCTGATAAGGTTTGGGTTTGGGTCGAAATGTATCTGGTCCTCAATGAGGGGCGTCTGT 
TTGCGTTTTTTCTCCGAGTGCTGCTGATTGCGGGCTGGCTGGCAGTGTACTTTGGATATCATATGGAACAATGAAATATACAAAGTCGAAATGGT 
ATCGGAAACTACTCAGTTCTGCGATCACAACGGGTGTAATGTTCCATAATGTTGTGGTTTCTGATATTCTTAGAATGTGCCTTCAACTAAAAACC 
ATATGTTTATGTACAAGCCATCAACGCTTGTCTAAAAATAAATAAATAAATGAGAATGTAAACCCATGTTGTAAATATCGTAGTATTGTATGCAA 
ACTACGTTACCATTTTAAAATCATCTTTTTGACCATTCAGAGACTGGAATCACTTAGGATCTAGTGGGATCTATCTATCTCAATATCAATTTATT 
ACTTTCTTTCAATTGACAACCTAAATCGATGGTGAAAAGTGAAACCTATCTCTCGCAGTGATCTGTGGGGCTTAAAGGAGCCCGACTATCTAATC 
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AGTGATACACTCCTCCGCCGCTGATACGAAGGGGCCACGTTTGTCATTCCATGCGAGATTCTCTCTGTCGCCTTATCACTGCCTGGTGCACACGG 

CTTTGCCGCTCCGGAACCGGAGCAGATCCGCCGGGCGCCATCTTGGCCGTCTGCTGACTAAGTCGGAGACTCCACGATGACGTGGATGTCACATT 

GTTGCGTAAGCCGATGCAATCTCGGATTTTAATTGTCCCCTGAATTTGCATCGCAGTTTCCCTATCTTCCTCTGCTCCCTCCTCACTCTCCTGAC 

CTACAATAATTGTTGTGAATCCCTATTTGTGAATATGAGTTTTTTTTTTATAATCCACATGGAAGCTCTATTAATTCTAGGAAATTCCGTAAGAT 

CCTATAACAGCTGCTCAATTAACGTGTGAAATAGGTAAAAATGTATCTTTAATCACTTTTGAAAGTGGT 

(SEQ ID NO: 94) 

Exon: 1001.. 1088 
Exon: 1385. .1514 
Exon: 1883.. 2774 
Start ATG: 1397 

Transcript No. : CT3604 

CGACGCTATAAACAGCAGTTGATTTTCAGAGTTTCACCAAGAAGCACATAAAGCGATTGAATTATTAATAAAAGCACAGGCACTGGAATAAACCG 

TCGACATGAAATTTTTCGTTGCACTCGCTCTCTTTGCGTGCCTCGGCTCTCTTGCCTTGGCCAAGGATGATGAATATTGCCAGAACACTGTAATC 

ACCGCATGCTCTACGTCAGCCTTTTCGGCCAACTCCATTTGTAATGCCCGTTTTGCTGGCATTGACCATATTGAGCCCGAGATCCAGTCCTACAT 

CAACGCCAACTTGGCCAAGTCGTACGACTACCTCCTGCTGGCCACCCACTTCAACTCCTACCAGAAGAACCGCCCCGGCTTCCAGAAGCTGTACC 

AGGGACTCTCGGACCGCTCCTTCGAGGACAGCATTGCCCTGATCAAGCAGGTGACCCGTCGCGGAGGAATCGTTGACTTCAACACCCGCCACGAG 

TCTTCCGGCTCCGTGAGCACCAAGCGCGGCACTCTTGAGGTCGACGAACTGCACTCCCTGGCTCTGGCTCTGGACACCGAGAAGCAGCTGGCCAC 

CGGAGCCACTCACGTGCACTCCCGTGCCACCCACGCCACCGACGCCGAGAGGGATCCCGAGCTGGCCCACTACTTCGAGGAGAACTTCCTGGGCA 

AGCAGGCCGAGAGCGTGCGCAAGCTGTCCGGCTATGCCAACGACCTCGCCAAGCTGATGAAGGTCCCCGACCCATCCCTGTCCGTCTACCTGTTC 

GACGAGTATCTGCAGAAGCAGTAAGGAGTGGAGACCACCTCCTTTTCCTCCTTCTTTTTCTAACACGCTCCTACACTGTCTCCTCAAAATTGCTC 

T C AT AAT T C C C T TAT AAAT T T GT T T CC T TG GAT T GT T C ATC AAT AAAT C GC AGT G C AG C AG C AGC AT T AT C AAAGC AGC C AC AAGT G AAT AAT T T 

ACACCGCGAAATTCACCTTGATTTACCAGCACACCAGATTGTGATAATGTTATCATAAATAAGTTAGCAAGTTCGCCCCAGCAAATGTATCTTTA 

GTATGTACGTTACCGCTTGAACAAATAATAAACACATCAAAGAACCTAATTATCAACTAGTAAAA 

(SEQ ID NO: 95) 

Start ATG: 101 

MKFFVALALFACLGSLALAKDDEYCQNTVITACSTSAFSANSICNARFAGIDHIEPEIQSYINANLAKSYDYLLLATHFNSYQKNRPGFQKLYQG 
LSDRSFEDSIALIKQVTRRGGIVDFNTRHESSGSVSTKRGTLEVDELHSLALALDTEKQLATGATHVHSRATHATDAERDPELAHYFEENFLGKQ 
AESVRKLSGYANDLAKLMKVPDPSLSVYLFDEYLQKQ* 
(SEQ ID NO: 96) 

Name: Ferritin 2 light chain homologue, FER2LCH 
Classification: ligand__binding_or_carrier 

Celera Sequence No. : 142000013384859 

C AAC AATT GT AAAT T T T T C G AT C AAAC G AT T AAG AT T T GT AAAAAAAGGT T G AGAAAAT C G G AT AAAAAGTT T GG AAAC T AT CG AG T T AAGT T T G 
GT GT AT G AAAAAAAAAAAC T T AT C AC ATTGC C ACG T T TC T AAACG AAAAC A T AAC AAT TAAAAAAA T AAC AT T TT T AAT CG GTT TAG GG AAAT AC 
AAACCAAAAGGGGAAGAT ATT ACCATTTAGAGGCGAAAGGTCAACAAG AAAC ACATGCTTTCTCTTTCT AAAT GAT TTCACATTTT ATT TTGT AT 
AGTTTCCTTTTATCATATTTTGTGCAGATATTTTCTTATAGCAGCAATTTTTGTATAATAAATATATTTTTGTCATTTAATTATTTTCTTTCTCC 
GCGTTTTCTTTAATTTCTTTAGTTTACTGTCACTTAGCTTAACGAATAAGAATAGCGCAATTTATTGCTTCGCACAATTGAACATTGAAGAATTG 
CCTATATATATATAAATGTATATATATTATGTCTCTCAAAGGTACTTGCATCTGAATAATTATAAAAACAAAAAAATAAAAGCGTAAAGAGGACG 
AAGTTACGCACATTTCCCTCTGATCATAATCAACAATAATGGTGGGAGTAAAGTATAAGGTAAAATAATATAATATATAATGGAGATATATAAAA 
GATCATAGAAATAGCCATATACATATAAATACATAAAAGCAATTCAATTGGATCATATCATCATCATCTTTTCATCTTTTCGGGCGAACAATCAT 
CCTCTTTGGAATTCTTTTCGTTTAAGTAGGGAAAGTGGAAATGAAGCCGTCCATATATATGGTAATCTGTCCAGTTTTTGTCTACGCGTTTCTGT 
TCAGTTCAAGCCTTGTTTGATATTGCTGCGTTTTAAGGCCTCTCCCACTGTGGAATCCATATTGCTGGCTCTGCCTTGATCTAAGTGTTTTCCTC 
TGCCGTTCAGTTATAGTTAATTTTGAAATATACTCGTACTTGTGTTATTGTTAGTTATTGTTAATTGCACTGCTGGCGGCCACTAACTCCGGAAC 
GGTGGACGAGGCGGCACTGCTCTGACTGACGGTGCGGGCATGATCCACATGGTTTTGGGTCGGCGCCGCAGACGGAGCCGCTCCCGAGGCTGCTG 
GCATCGATCCCGAAGCTTCTCCCGACTGCACTGGCGTGCCCTCGCTCGCTCCGGATCCGTTTCCATTTCCGGGTACGGGCTCCACCGTGGGAAAG 
ACGGGCGGCAAGGCAGGCGCGACCACGGGCGTCTGCACCATGGGTGTGGGAGGCGAACGATTCGCTGGCAACTGATGAGCAGTGGCTCCGGTCAT 
CTGCATGTGGTACCTTTGGAGTGTAGATTGATGAACATTAAACAAGGACACAATATCAAACGATCTTTGGAACCCACTTGTAAATGTTCTCGGTG 
CTTTCGACAACACCTTGGGTAAGATTGAGGCTCCGCCCCTTGATGCCCGACTCCTTGGCCGGTGGGGACCACCAGTTGAAGACGCGTCCCACCAC 
AGGCACGTGGCGCAGGATGCCCTCCTGCCAGAGCTTCTCCTCCGATTCGCGCTGCAGATCAGCGGTAACCGCCTCAATGCCCTGAGCGTACAAAT 
AAATTAACACATGTTACATTAATCCAGTACGAGGTGGTAGCCAACCTTTTTAACAATCTCCGACATGGTGTCCAATACCCGCGAGTTTTCCTGGA 
CGCTCTTGCCCACCGTGACCACCAAGTCTAACAACTCTTCGACCTCCGTTTCATGGGTCACCATTCCGAATCGCACACAAATCAGACCATCGGTG 
CCTTCGCCTAATGAAAAGGCATTGTCCGTGGATTTCAGGGCCTCCACCAGGTCGATGTTCAGCTTGTTTAGCTCTGTTTTGGCCTGGTCCGTCAG 
CAGGGACTCCCAGCCCTCGGGCACGAAGCGAACACCGCCCATGCCCGCCCATTCGGGCAGTGGAACGAGGCGCAGCACGTCGCTGCACTCCACCA 
GATGAATGAACCGTGCCTTGTGCTTGATGGTCGCGCGCAGGATGTCCACATGTGCCTCTAAGCTCTGAGCGAAGCTCTCCAGGTTCTCGGAGCTT 
GGCGGCTGCTCACCCAACCCCAATTCCAGCGGACAGTAGCGAATGCAGCTGCCGTGCGTCGGGTGTTCAATTACCTCAAAGTCAAACTGGAAGAA 
CATGGTGATTAGGCTCATACAAGGAGATGATTAATTCACACTGCATACATTGGGACAGTCGCGTTGCAAGATTTGACCCAGCCAGGAGTTGAGCC 
GGTCGAAGTAGGAAGCATTGTTAATCTTCTCTAGCGGCTTGAGGCCTTCGGCCGTTTCCCGCTCAGCAGCGGCAGCGGCGGCGGAAGAGCCACTC 
CCGCCCAGTGGAATGGTGGTGCTGCCGTCGAACTGATAGGCGACCACGGGGGCGGCGGCATCAAAAAGCGCCTGCACATCAAAGGGGCTCTGAAT 
GACGTCCGACAAAGAGGCTCCGGTCTGCGCTCCTGGAGTATGGCTCTGAAAATGCCAAGGTCCTTTGGTAAACTAAAAACATTATCCATAAACGT 
GTGTGCAATAATTTACCAACACCCGGATGCCCTCACACTTGGAGGCAATCTCGAAGAGAATACTGCACGTCTGGAAGGCCACATGCAGACGCTCC 
GCAATCGCCTTCCGGCCCAGCGCCTGCAAACTGGTCCACAAGGACAGGGCATTAAGACGACGCGACAGTATGGGATCGGATTCGAAGGCACTCAG 
AGCGCTGTTCTGCAGCGGACGGTGCAGCAGGACGATGGGCAGACTGGGCACACCCAGCCAGCTGCCCAAATTCAGGGCCATCGAGTGGAGTACCT 
CCTCCACATGGCCCTGATTTTGGGCGCACACCAAGGCGGCCAGTCCGTGGCCACTGGCGTGCAGCCACATGTTGTGCGCCTTGCATACGTCCCTC 
AGCCGGAGCAGGTTGTCCACATAGCCACACAACGAGGCTCCAATGTCGGCGACCACCAGCAGCGGAGTCCTGTTATTGCCCACATCCTGCTGAAT 
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TTGCTTTTGCAGCAGCGTCACATCCATTGTCCCGGACTGACTGTGCTCCGGAATGACCTTAATAGCCTCCAGCGGTATGCCCAACTGGCGGCATG 

CGTAGTGCAGCCTCATCGGCGTCGTATTCTCCGAGATGTAAAACGTAGGCTGCGCCAGGGCGGGTATGCCACCCTCCAAGTACCCCGGACACCTG 

GCGACGATGGCCAGTCGCACCGTACGCAGCACCGCGTCTGCATTGTCCGCATGGAAGCTGCTCGCCGGGTGGGCAAAGTGAAACAGCGTGCCCAA 

CCAGCGGGTGGCGTCCCCGGAAATGCTGTTCGTCACCCGGAGCAGTTGCTGACGGTCCAGATGCGACAAATAGGCCACAATGCTGTGCGATATGA 

GAGCCAAGTGGGAGACGTCATCCAGGGCGGGCAGCGGGAACTCCGGATCATCGTCGCTGTCCGTGTAAGAAACCAATTGCTCCAGCGACTTAAGA 

ATGTCGGAGGGCACGCGATGCTTCGCTGGCAGGAGGTCCTCGCTGGTCACCCGCTCCCCAGCCTCTTCCTCGTTCTCGACGGGATTTGGGCTCTC 

CGGCGTGGTCGACACCTTAACGTTCTCTAAGCGCTGCAGCACCTGCGAAGATCGCAGCTCCAGCTCCGCCAGGCCCGATCGAATCTATGTGGCGA 

ATACATGGCTAATTAGGTGGGATAGTGGACAAGGTAAACAACAGCGCTCCAGAAGTTGACGTGTCTTCCGTCGTCGTCAGCAAAAAAAAAAAAGA 

TAGAAGCAGGGCGAAAAAAACCAAACACTCACACTGGAAGCGGCGATTTCGGCCACCGCTGGCGTTGATTGGCCCGGATCCGGCTCTTGGTCTAC 

GAGTGGGCTCTGGCCGGGATCGTCCACCGATCTACCAGCCGCTGACATGGTTTCAGTGCGGTCTAAATCACTTTTAACTTATCGCCTGCGATGCA 

AAATAATTGTTTTCGTTTCGTGCCACTGCGTCTGTGTGGGTGGGTGTGTGTGCGTGTGTGTGTGGGGCGGGCGAGTGTGTGTGTGTGTTGATGGA 

AGAGAGGCTGATGTGGAATGGTGTGACCGTAGTGGGTGCGTCTGACTAAGATCGCAAATAGTGTGCGATTGTTGGCCCATGGTTCTGGTGATGGA 

ATAGGTCGCAAGGAAGTTGGAAAACTATTATAATATCCGCCACTAAAAGACGTTGGACAGCAGCACACAAACGTACATACATAGCACCTACTCGT 

AACCTTAAACGACGAGTTTTCACAGTGGTTTGTGTCACGGTTTGTGTCTCAGCGATTTTGAAGACAGCACCGCACGGTCATACTACACAGCCATC 

GATAGTTGCGGAAAACTCATCGATGCTGCGAAAGTGCGATAGTATCGAATAAACATGAGTGTGTGCATGAGTGTGGGAATTTATTAAACAAAAAC 

GAAACGCGGACAAACTATATTTATGTAATAAACACTAAGCCGCAGCGCCAACGAGTAATGAACAGTCCACGGCCAGGTCGTACTATTCAGGCGZVA 

CGCACCTCGCAATCGACTGCAATCAAAGTGCAATAGCTCAATCAATTGATTCGTTTTGCTCAACCAAAAACAAAATCTATTCCCAAATCGGTGCG 

ATAGTTGCCAAAATATAAAAACTACACTACGCTAAAAAAAAAACAATACACTCACACACTGGCGTACAAGACAACAAAAGAGAAGAAGAAGAGCA 

GACGCCAGATATAAAAAGCCCCCAAAAGAATTGGAAATAAGACCATACCCCTCCTTCTCCCTTGAAAAGGGACCTTAAAACTAGGCGACACCGAA 

TAATTGAACTCAAGTAAAAAACCGGGAAAAGAGAAAAACACTTTCAACAAAATATCTAGAAGCCTTGTTATCGATTTTGTTCCGGGTTTTTTTTG 

TGTGAGTGTGTGTTGTGTGAAGCGCGCCCGCGGGTGTGTGGGTGAGTGTGCGTGTGGCTCTCGGCGCG 

(SEQ ID NO: 97) 

Exon: 4008.. 3833 
Exon: 3694.. 2582 
Exon: 2515.. 2234 
Exon: 2176.. 1661 
Exon: 1601. .1408 
Exon: 1343.. 1001 

Start ATG: 3943 (Reverse strand: CAT) 
Transcript No. : CT3725 

AAACGAAAACAATTATTTTGCATCGCAGGCGATAAGTTAAAAGTGATTTAGACCGCACTGAAACCATGTCAGCGGCTGGTAGATCGGTGGACGAT 

CCCGGCCAGAGCCCACTCGTAGACCAAGAGCCGGATCCGGGCCAATCAACGCCAGCGGTGGCCGAAATCGCCGCTTCCAGTATTCGATCGGGCCT 

GGCGGAGCTGGAGCTGCGATCTTCGCAGGTGCTGCAGCGCTTAGAGAACGTTAAGGTGTCGACCACGCCGGAGAGCCCAAATCCCGTCGAGAACG 

AGGAAGAGGCTGGGGAGCGGGTGACCAGCGAGGACCTCCTGCCAGCGAAGCATCGCGTGCCCTCCGACATTCTTAAGTCGCTGGAGCAATTGGTT 

TCTTACACGGACAGCGACGATGATCCGGAGTTCCCGCTGCCCGCCCTGGATGACGTCTCCCACTTGGCTCTCATATCGCACAGCATTGTGGCCTA 

TTTGTCGCATCTGGACCGTCAGCAACTGCTCCGGGTGACGAACAGCATTTCCGGGGACGCCACCCGCTGGTTGGGCACGCTGTTTCACTTTGCCC 

ACCCGGCGAGCAGCTTCCATGCGGACAATGCAGACGCGGTGCTGCGTACGGTGCGACTGGCCATCGTCGCCAGGTGTCCGGGGTACTTGGAGGGT 

GGCATACCCGCCCTGGCGCAGCCTACGTTTTACATCTCGGAGAATACGACGCCGATGAGGCTGCACTACGCATGCCGCCAGTTGGGCATACCGCT 

GGAGGCTATTAAGGTCATTCCGGAGCACAGTCAGTCCGGGACAATGGATGTGACGCTGCTGCAAAAGCAAATTCAGCAGGATGTGGGCAATAACA 

GGACTCCGCTGCTGGTGGTCGCCGACATTGGAGCCTCGTTGTGTGGCTATGTGGACAACCTGCTCCGGCTGAGGGACGTATGCAAGGCGCACAAC 

ATGTGGCTGCACGCCAGTGGCCACGGACTGGCCGCCTTGGTGTGCGCCCAAAATCAGGGCCATGTGGAGGAGGTACTCCACTCGATGGCCCTGAA 

TTTGGGCAGCTGGCTGGGTGTGCCCAGTCTGCCCATCGTCCTGCTGCACCGTCCGCTGCAGAACAGCGCTCTGAGTGCCTTCGAATCCGATCCCA 

TACTGTCGCGTCGTCTTAATGCCCTGTCCTTGTGGACCAGTTTGCAGGCGCTGGGCCGGAAGGCGATTGCGGAGCGTCTGCATGTGGCCTTCCAG 

ACGTGCAGTATTCTCTTCGAGATTGCCTCCAAGTGTGAGGGCATCCGGGTGTTGAGCCATACTCCAGGAGCGCAGACCGGAGCCTCTTTGTCGGA 

CGTCATTCAGAGCCCCTTTGATGTGCAGGCGCTTTTTGATGCCGCCGCCCCCGTGGTCGCCTATCAGTTCGACGGCAGCACCACCATTCCACTGG 

GCGGGAGTGGCTCTTCCGCCGCCGCTGCCGCTGCTGAGCGGGAAACGGCCGAAGGCCTCAAGCCGCTAGAGAAGATTAACAATGCTTCCTACTTC 

GACCGGCTCAACTCCTGGCTGGGTCAAATCTTGCAACGCGACTGTCCCAATTTTGACTTTGAGGTAATTGAACACCCGACGCACGGCAGCTGCAT 

TCGCTACTGTCCGCTGGAATTGGGGTTGGGTGAGCAGCCGCCAAGCTCCGAGAACCTGGAGAGCTTCGCTCAGAGCTTAGAGGCACATGTGGACA 

TCCTGCGCGCGACCATCAAGCACAAGGCACGGTTCATTCATCTGGTGGAGTGCAGCGACGTGCTGCGCCTCGTTCCACTGCCCGAATGGGCGGGC 

ATGGGCGGTGTTCGCTTCGTGCCCGAGGGCTGGGAGTCCCTGCTGACGGACCAGGCCAAAACAGAGCTAAACAAGCTGAACATCGACCTGGTGGA 

GGCCCTGAAATCCACGGACAATGCCTTTTCATTAGGCGAAGGCACCGATGGTCTGATTTGTGTGCGATTCGGAATGGTGACCCATGAAACGGAGG 

TCGAAGAGTTGTTAGACTTGGTGGTCACGGTGGGCAAGAGCGTCCAGGAAAACTCGCGGGTATTGGACACCATGTCGGAGATTGTTAAAAAGGGC 

ATTGAGGCGGTTACCGCTGATCTGCAGCGCGAATCGGAGGAGAAGCTCTGGCAGGAGGGCATCCTGCGCCACGTGCCTGTGGTGGGACGCGTCTT 

CAACTGGTGGTCCCCACCGGCCAAGGAGTCGGGCATCAAGGGGCGGAGCCTCAATCTTACCCAAGGTGTTGTCGAAAGCACCGAGAACATTTACA 

AGTACCACATGCAGATGACCGGAGCCACTGCTCATCAGTTGCCAGCGAATCGTTCGCCTCCCACACCCATGGTGCAGACGCCCGTGGTCGCGCCT 

GCCTTGCCGCCCGTCTTTCCCACGGTGGAGCCCGTACCCGGAAATGGAAACGGATCCGGAGCGAGCGAGGGCACGCCAGTGCAGTCGGGAGAAGC 

TTCGGGATCGATGCCAGCAGCCTCGGGAGCGGCTCCGTCTGCGGCGCCGACCCAAAACCATGTGGATCATGCCCGCACCGTCAGTCAGAGCAGTG 

CCGCCTCGTCCACCGTTCCGGAGTTAGTGGCCGCCAGCAGTGCAATTAACAATAACTAA 

(SEQ ID NO: 98) 

Start ATG : 66 (Reverse strand: CAT) 

MSAAGRSVDDPGQSPLVDQEPDPGQSTPAVAEIAASSIRSGLAELELRSSQVLQRLENVKVSTTPESPNPVENEEEAGERVTSEDLLPAKHRVPS 
DILKSLEQLVSYTDSDDDPEFPLPALDDVSHLALISHSIVAYLSHLDRQQLLRVTNSISGDATRWLGTLFHFAHPASSFHADNADAVLRTVRLAI 
VARCPGYLEGGIPALAQPTFYISENTTPMRLHYACRQLGIPLEAIKVIPEHSQSGTMDVTLLQKQ1QQDVGNNRTPLLVVADIGASLCGYVDNLL 
RLRDVCKAHNMWLHASGHGLAALVCAQNQGHVEEVLHSMALNLGSWLGVPSLPIVLLHRPLQWSALSAFESDPILSRRLNALSLWTSLQALGRKA 
IAERLHVAFQTCSILFEIASKCEGIRVLSHTPGAQTGASLSDVIQSPFDVQALFDAAAPVVAYQFDGSTTIPLGGSGSSAAAAAAERETAEGLKP 
LEKIKNASYFDRLNSWLGQILQRDCPNFDFEVIEHPTHGSCIRYCPLELGLGEQPPSSENLESFAQSLEAHVDILRATIKHKARFIHLVECSDVL 
RLVPLPEWAGMGGVRFVPEGWESLLTDQAKTELNKLWIDLVEALKSTDNAFSLGEGTDGLICVRFGMVTHETEVEELLDLVVTVGKSVQENSRVL 
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DTMSEIVKKGIEAVTADLQRESEEKLWQEGILRHVPVVGRVFNWWSPPAKESGIKGRSLNLTQGVVESTENIYKYHMQMTGATAHQLPANRSPPT 
PMVQTPVVAPALPPVFPTVEPVPGNGNGSGASEGTPVQSGEASGSMPAASGAAPSAAPTQNHVDHARTVSQSSAASSTVPELVAASSAINNN* 

(SEQ ID NO: 99) 
Classification: enzyme 



Celera Sequence No. : 142000013384455 

GGTTGCCAAACGGAGAGCAGCATACCTTCCGGATAGCTGGCATCGATCACGATTACTTCCAAGCACTGTAGCAGCTCCTGGACCTCACTTAGTGG 
CCCCTGCAGCTTTCGCTTTATCTGACGCACTGGACGCACTGCAGCCACTACTACCAGTAGATTCACATTGGTGGTGATTATCTTATCGGGAAATC 
CCAAGCCGGAACGCACATCCGCCAGCTTAAGGACCGCACCAAGTGGCCTTATGGGCTGATGGAGTAACTGCTGGAGAATTGTTATTTGGCCAAAG 
TCGTCATTGTCGTGGCGCACCACATAGCCGGATCCCTCGTTGACCACCAGGGCGCAGGAGACCGTCGCTTGGGGCTGATAACGCTGTTCTCCCGG 
AGCAGCAAACGGTATGGACATCACTTTAGCGCCCACAATGTCTACCACATGGCCGATCTGCAGCATGGCCGCGTACTCATCCACACAGTCCCGCT 
GGCCCCAGCACTTGCAATTGGTCAGGTGATTGGGGGAATCTCGGATGGTCAAGCTGAGAACCCCTCGTTCGGTGCCACTCATCTTGTCATAGAAG 
ATGTTCGGCGAGGATTTGGAGACAATCAGGGCAACGGTGGAGAAACGGGTCATTGTGGGACGCATCTCTGCCAACCGCTGGAACTTAATACGCCT 
TGCCATTGAAATTATTTAAGAAATCGCGACCAGTCAGATTGCAGTGCTTAAGAAAACCGTTACATTGCCGTTACCGCCGATTAGTAGTTATCGAT 
GTTTGTAGTGATGGTACACCACAAATGGCCGTAGATTCAGTTAGTAACAGAAAGTATTGCTAACAACTTAGTTTGATAAAAACCACAATTAAAAC 
ATTTACAGAAAGTTGCGCTTATTAATTGTAACCATTGGAATCGATAATAATCGGATGGCATCCGAATACCAGACGTACCATCCCTAATAAGCTGA 
CACACACTACGCATATGTGCACCCGCGCACACACACACGCACACACAGAGGGGAACGGCACAGATTTACTGCGTTAAACAAACTCACTTGTCGCA 
CTTTTCCACGAATGCCAGTGCGCTGAAAGTGAAACCCATCCGCCGGAGCATAGTAGGATTAGCCAGGACAACCGTTCAGCTCAGTGCCCCAGAAG 
CATTTAATTGGAGGAGCAACAGTCGAGCAACCCGCGACCCGCAGACCCAGTCTTGGCAGCCCACAGCCACTCAAAATGGTCAAACTATTCGCGTT 
GAGCATCTTTCACAAGGGCGCCAGCGAGGCGCGTCTCCTGAAGACCGCCTCGGACCTGCAGTCCTTCTCCTTTTTCCAACGTGGCACCGTCAACG 

AGT T CAT GAC GT G AGTAT G C AT GT GG ATG CC AAAG AAC GGC GC AG GC T C AAAG AGT C AT C AAT G AC G AAC ACCC C C C T AAAC AAC TCCTCCTCTT 
GCAGTTTCGCTTCGAAGACGATTGTGGAGCGCACACAGCCGGCCCTGCGGCAATCGGTGAAACAGGATGCCTACATGTGTCATGTCTATGTGCGG 

GC GG AC AAT C T GGC C GG T GT AC T GAT C GCC GAT CAT GAGT AT C CGC AT C GCG T GG C GC AC ACGC T T AT C AC AAAG AT T C T GG AT G ACT T T ACGGC 
CAAGGTGTCGGCCGATCAGTGGCCCAATGGCACGGAGGCCACTATCTCATTTGACTTGCTGCCAGCGTTTCTAGCACGCTATCAAAATCCCGTCG 

AAGC AG AT C C GC T G AC G AAAAT GC AAAAC G AC CT GG AC G AAAC GAAG AT CAT AC T G AAG AAT AC C AT C G AGG C GGT GCT GG AGCGGGGC GAG AAG 
CTGGACGACATGGTCAGCAAGTCGGAGAAATTGTCGCTGCAGAGCAAGGCGTTCTACAAGACGGCGAAAAAGACTAACTCCTGCTGCAGCTTCAC 
CTAGACGACAGCTGCACCTCTCATCACCCCCCACCACACGCTAATCCTCTCAAGCAACAACCACAACAGGAAACACCAGAAAATGCAAATCAAAG 
ACAACACAAAGGATAATGGATAACACGATAACACCAACAAACTACAACCAAACAAACTCCAAGGGAGCGGTGCGACTTGGGCTTTGATGATTGCT 
TTTTATAAATGAATTAGTTAATTTTATATACTATATATATATATTATATTATATACTGTATGAATAGGATGGCCAGTTGCCAGTTACGATAGTCC 
GCTAACCCAATGTCTACCTGTCTGTATATGTCTACGCTTTGGATATCTAAATGTTAATTACTACCAATACAGATTACGATCTAAACCCATGTGCT 
TAACATAATCTTCTCTTCACTTATGTGTTGGCACGGCGGTCCAAAATTGGCCCTTTGCTAGCTATTGATGTTGTGGGCCTTCTTATCGCCTCCAC 
GGCGGATCTATGTATACCAATCGGATCGGAAGGCTGCACCTATGTATACACATGCAGTCGCACTTCCGATCCGGGCAACTGGCGGTCGGAATGTG 
TGGGCGTTTCAATAAAGAAGTATTATGGGAAATGTGAACCAAGCTCGTTTTCATTGCTCACATATACAAACCTTATCTTTGTCAATTTTGGCAAA 
ATGTACAAACTATTTGTACGAGCTCTGTACATAGCACATGAACTTATTTTTATTTATGTAATACGTACCACTTTATCTTAAGTGCAAATTCCATT 
CCTCCCAAAAAAGCAAAATCAATTTAAAAGTGATTTTCTGTTGGCGTTTATTTATTATATAGAAGACACCTATTTCGCTTCTTGTAAAGATTTCC 
GTTACATTTTTCATATTTATGCGTTTTCCTTACGTTTTCGTGTTTAACCTTTTGGATTTTGGTTATGTTTAATGGGTTCGGTTTGTATAGCCTAG 
TATAAGATAGGTAAATCGATTACAATTTAGTTTTCGCCCCCTCTCCATCGCAAT 
(SEQ ID NO: 100) 

Exon: 1001.. 1340 
Exon: 1430.. 1904 
Start ATG: 1216 

Transcript No. : CT3893 

GGGAACGGCACAGATTTACTGCGTTAAACAAACTCACTTGTCGCACTTTTCCACGAATGCCAGTGCGCTGAAAGTGAAACCCATCCGCCGGAGCA 

TAGTAGGATTAGCCAGGACAACCGTTCAGCTCAGTGCCCCAGAAGCATTTAATTGGAGGAGCAACAGTCGAGCAACCCGCGACCCGCAGACCCAG 

TCTTGGCAGCCCACAGCCACTCAAAATGGTCAAACTATTCGCGTTGAGCATCTTTCACAAGGGCGCCAGCGAGGCGCGTCTCCTGAAGACCGCCT 

CGGACCTGCAGTCCTTCTCCTTTTTCCAACGTGGCACCGTCAACGAGTTCATGACTTTCGCTTCGAAGACGATTGTGGAGCGCACACAGCCGGCC 

CTGCGGCAATCGGTGAAACAGGATGCCTACATGTGTCATGTCTATGTGCGGGCGGACAATCTGGCCGGTGTACTGATCGCCGATCATGAGTATCC 

GCATCGCGTGGCGCACACGCTTATCACAAAGATTCTGGATGACTTTACGGCCAAGGTGTCGGCCGATCAGTGGCCCAATGGCACGGAGGCCACTA 

TCTC AT TTGACTTGCTGCCAGCGTTTCTAGCACGCTATC AAAAT CCCGTCGAAGCAGATCCGCTGAC G AAAATGC AAAAC GACCTGGACGAAACG 

AAGATCATACTGAAGAATACCATCGAGGCGGTGCTGGAGCGGGGCGAGAAGCTGGACGACATGGTCAGCAAGTCGGAGAAATTGTCGCTGCAGAG 

CAAGGCGTTCTACAAGACGGCGAAAAAGACTAACTCCTGCTGCAGCTTCACCTAG 

(SEQ ID NO: 101) 

Start ATG: 216 

MVKLFALSIFHKGASEARLLKTASDLQSFSFFQRGTVNEFMTFASKTIVERTQPALRQSVKQDAYMCHVYVRADNLAGVLIADHEYPHRVAHTLI 
TKILDDFTAKVSADQWPNGTEATISFDLLPAFLARYQNPVEADPLTKMQNDLDETKIILKNTIEAVLERGEKLDDMVSKSEKLSLQSKAFYKTAK 

KTNSCCSFT* 
(SEQ ID NO: 102) 

Classification: hypothetical 



Celera Sequence No. : 142000013384012 

TGTGTGGATTCGTGGAGCAAATCGAAGTTGGCAGGATTAAAGGCATTCGCGAACGCAGAGGCAGCAGCTAAAGGGCCACCCATGCACCAAAGAAA 
CAAACTAGCTGGCCCAGCGGTTCATTTATCTGCGACTAACAAGCTAAAATCGGCGTAATTGGCAAGTTCAATGCACGACTAATATATTTGGCCAC 
ACTTGCCAGTGTATGTTTGTTCGTGTGCGTGCCTGTTTGTGTGGTGTCGTCTCCGTACGTGTTTGTGTGTGTGTCCGCGACGAGTATGTGTGCGC 
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ACATATTCGTATATCTGCAAGACTCGCGGATGAAACAGCGTGAAAAGTGAGCAAAATATCGGGGAAAAACGGCACGGTACGGCCACAGAACGGAA 
CGGTACGGTGCGATACGATATCGAGAAAAGGCCATCAGCCAGAGTACTCCTCGATCCGGAAACGGATAATGGGGATATAGATGATACAGCCAATA 
GCTCCCCGAAAGCCACAGCACACAATAGGTCGGTCGGTCGGTCGGTTGGTTGGGCGACGTCGCCGAGAGTTAAACATCAAAAATCAAACCCAGGA 
ACCGACGGAGCTAAGTGTCAAGTGTTTGGTGTTGTGAAATTTTTCGTTTCCAGCTCTCCGTTCAGCGTTCAGCCAATTTGCCAACCTTTTCCCCT 
CTTTAATTTGTTATCCCCGGACACCACAAGAGTTTCGATCAAAATTTTCAATTGAACGCAACATCGCCATAAACAAAAACATAAAGGTAAAATTC 
GGCCACTACCCAAACAACAAAAAAATAAAATCATAAGAAAAAATCACGCAGTAACGTAGAAGAAGCAGCAGCCGAAGCCGAAAAAAGACGAAAAG 
AAAACTTGTTTGAAAAGGCCGAACCGAAAGAAGAAAAAGAGCAGCAGCAGCAGCACAGCGGCCAAGTTTGGAAGAATTTCAGAATTTCAGCGTGC 
TAGGAACGGAGCAGCGGACCGGCGACGTCACCAAATAAACGCCCATCACCATGGACGAGGATGACAACCTGTCCAATGCCGACATAAGCATAGAT 
GACGAGGAGGTGGTGCGGGAGCAGACCCACCATCCGCCCATGCAAGAGGATCAGGAGCTGGATAACGAAGATGGGTCATCCGACGTGGATCTGTC 
TTCGGTGTACGTCGTGCCACCCATAGTCTCGGCACCATCGCCCGCATCAGTGCGCATCGCTGGTGGATCCACAATTGGTGGCGGAGGGGTCGCAG 
CTGGAGCAGTTCCAACAACCACCGATAACTCGCATTCCCCCTTCCACGATTGGGATTCCAGTGTGGCGGCCGCTGCTGCTCCACCGCCGCCACCG 
CCCAGGACAGGACCAACTACAACTACAAGCTCCGGCACAGCCGCGGAATCGGGAGCCGCGTCCGCCTTATCCGACGATGCCGGCGCCAAGCCAAG 
TTTCTTCTTTGGCGCCTCCTCGTTCAGTTTGCGCAGCCGCAAGGAGGTGGCCGCCCTGATCAACACGGAATGCTGCCGTGGCAGCCCCACACCCG 
ATCTCGACTCCATAATGGATACCCTATTCAATCCGGGGACGCCCATCGACAATCTGGACAACATCGAGTGGATACGATGGCTCATTGCCGGCGGA 
CGGACGCCGCAGGAGTTTGTTAAAATTGGTACGTCCCAATGTGTGCATATCGATAAGTGTGTTGCATTTTCTATCGCCAAACAGCTGATCGCAAG 
TCCGTTGCCCCTGAAACTCTGACCGAGCATGTGTGTGTTACCTGCGTTATGTGTGTGTGTGTCTGTGAAATGTAATGGTTGTAGGGCACGCAAAG 
AAAAGGTAGAGAGGAAGAGAGCATAAAAGCAAGGTGCAAGGCAGCGTGAAGAGTGCGAGTGCACCACCATGGCAACAGCAATCAAGCGCACTCTC 
TCGCACACACAGCACACTCACTACACACACATATGCACTGATTAGGGGAGTCAACTTTAGCCCAATTTAATCTCAACTGCTCATCTATTTGCAAT 
GCTGTATTTGTGCATAATTTTCATCACTACTTATTACTTTTTGTGCCAGGCTGCGTATGTCGCCAAAGTTGCTTTTATCGTTTATGGTTGTTTGT 
CGCGTCTAGTCTTGTCTTGTCTTGTCCCACTTGTCAGCTCTGTCACACGCACACATACACACACTCGTGCAAACTGGCGGGCGGAGACGTACACG 
CACAGTGGAATGCAGCTTGCACCTTGTGGCTTTGTATCAGCTGTTTCGGTACGCCTTGCGTAATAAATGTTTCCGCAGCATCCGTCCCAAAACCA 
CACGAACACGAATGTAGATGGTGCTTGATTATATGATGGCACCTTTTGTCACCGCTAGGTGCTCCTTTTTTTTTTTTGTGTTATATTTCTTTCAC 
CTTTTTGTTTGTTACTCGTTGCAAGAACAATTGCAAAATTGCACTGGTGCGGAAAAAGTTGATTGACCGTCATTGTTGTTGATTTGACAAAAACA 
CGCACAGTGGTCGCGAGGGGCGGATGTGATGTGGCCCACTGTAAGCTGTCAACGGCAATTGACCGAAAGACCGTTATGTTATTGGGTGGTGTGTA 
TGTGTGTGCGTGCGTGCGTGTATGGGTGAGGGTGAGAGAGTGCACCTTTTTTCCGTTCCCATGTCATGTCACCTCCCCTTCCCTTCGCCACACCC 
TTGCTCCTCCATTCGGACCTTGTCCAAAAGTTCTCAGTTAGAAAGAGACTGCTGAGCCAAGATAGTGAGAGCGAGAGAGAGTTGCTAACTTTCCG 
TTGATCGATCGCTATTACAATTTTACAACATTGTCAATCAACTTTCCGCTGCACTGTGGTTGTTATTGGCCAATCCATTCGATTTCCTTCCGCAT 
GCGGCGGCCGACAACGTTGACCTTTTTAGTCAACAAGGTCCGAGCGGAATACGTACATACATATGAACATACATATGTGTATAAACTACGAAACA 
TAAAACAATCGTATGTACATAAATACATGTACATACGCCCCAGAACGCAGGCCGAAGACAAAAAATCAAAGAGTCACGCAACTAAAAGCAAGAAA 
TTAACTAAGCAATGAGTGCATCATTTTATTTTTGGAACTCAAATCAAATCAATAGCAATAGCAAGGCAATGCGGTCGGTCAGAAAAAATCAGCTA 
GCCTCGACTGGTTTTACAATTTCACCTCATCGCTCAAATTTGCAAGCGATATTTAATCGTTTTGTGATTAGAGGCGGGAACAATGCTTTGCAATC 
GACCGATTATTTCGATGATTTTGCATATGAATTCGTTGAGTTCAATTGGTTAAAAACCCCAAATACAACTTCTTAGGTCAAATTACCGTGGATAT 
ATTAATAAATTATTATTATTATACACATATACAAATCATGATTAACGTACCAACAAGATTGTCTTACGCATGCCTATAAAATTGTGCTTGTGCAC 
TGATTTCTTGATGTCCTTTGTCTGTAATAACCGGATTCCAATGACCCATAATATTTAAGAAAATCGATAGTTGGCCAGGGCCTCTCACATAAATA 
CAGAGCAAGGTCTTAACTTTTTAAGAACTTGCGGATGTGTCTGTTGGTTAAACATAACAAATTTTTTAGCTCTGTGATTCAATGTGGCGATGCCC 
TTGCTCCTTATGTGTCCAAACCTATTTTATAGCTGGCTTCTCTTCTGCGTCTGCCATATACTCAATGAATTGCTTAATGGTTTTTAATTTTTGAA 
ATTTTATCTTGCAGTGCGCAGCTATGACAACCACGCCAAATGCGGTCTGGTATGGGTACCGCATGTGGTGGCCTACAGGTGCCGCACTTGCGGCA 
TATCGCCCTGCATGTCCATATGCCGGGACTGCTTCAAGAAGGGCAACCACACGAACCACGACTTCAATATGTTCCTGTCACAGGCGGGCGGTGCC 
TGCGATTGTGGTGACACCTCGGTCATGAAGGCGGAGGGTTTTTGCAGTGATCATGGCATCAATAATCGGGTGAATCGCGATCCGGTGCCAAACAA 
TCTTTTGGCAGTGGCCGAGGCCATAATGCCCAAGCTGCTCTTCCGCCTGCTGCAGCACTTCCGTGAGCACAGCGACACCCCACTGGAGGTGCAAG 
CGATAACGTCGTACTCCTGCGAGGAGTTCGCCAACATGCTGATCGATCTGAACAACATGGGCGAAATAATGCGCAAAGTGATGACGCGCACGCTG 
ATAAATCCGGAGGTGTATGCCTTCTTCATGGAAGCACCGTGCCAGGACACCCGGAATGGACGCTTCTTGAAGGCGAACCGAGAGAAGTACGAGGA 
TGCCGTCAATAGATTCCCAAACCCAGAGCCACCCGATGAGTACAGAGATCTGCCCGCGCTGGGCGACAAATTGGTCCACACCACGCTACTCGAAG 
AGTTCATCTTCTGGACATTTAAGTTCGAGTTTCCACAGACTCTGGTCTGCTTTCTGCTGAACATGCTGCCTGATCAGGATTACAAGGTTAGTCTT 
GATGGGATCATATGATTTTTACATATCTATATCAATATTTATTCGTTTTCTGCCCTAAAAGGAACACCTCACCCGCACATTTGTGATGCACTACA 
GCCGCATACCGTCCGTGTTGGAAATGTCCCGTGATCCGGACACGCTCAGCAACCGGGTGGTCCACATGAGCGTCCAACTCTTCTCTAACGAAAGT 
CTGGCCCTCAAGATGGTCAACGAGCTGTCGCTGCTGCACGTAATGATTATTAGTCTGAAGCTAATGATGTCCAAGATACTCATACAGAATACATT 
GCATGGTGAGAATTATTTTATATATCCGACTGTAGCTCTTTGAACATTTCCATTTTTCGTTCAGATCCCAACAAGAATTTTCATTTCGTCATCGA 
CTGCACGCGTCAGGTGATGAAGGACCACTGCTACTGGCCACTAGTCTCGGACTTTAACAACGTCCTTTCACACGAATCGGTGGCCCTGGTCTTTC 
TGCGGGACGACAATCTCATCGATATGTGGTTCCAGTTTCTCCAGATGCTCCAGGGCATGAATGTCAATGTGCGCGAGACGGCTTCTCATGTGGAA 
TTCGAGCCAAATAGCTACTATGCGGCGTTCTCCTGTGAGCTTGAGGCCAGTGCCTATCCCATGTGGTCGATTATCTCGCATCTGCAGGATGGCAC 
ACATGCCCATCTGGCAAAGAAGATCATCAACTACTGTGTGACGACGCTGCACGAGTGGCTAGACTCTATTTACTTTATGGAGGCGCGTCTATCTA 
TGGTTGGTATTTTCAATCCGTTTATTGCAGTTTCAATAAAATCCTTTTATTTATCCGTGCAGGAGGAGATGATGCAGGCTTCGTTCCACTTTCCC 
CTGCATCGTTATCTGGCTGCCTTTGTTTGCCAGGCGGTAACTAAAATGGGAATCAGTCTGAACGATGTGCTGCCCTCACGCCCCTATCTGCTGCC 
CCTGCTGATGATCCATCCACTTCGCGTCCAGGTGAGTCTTCAATCCTTTATTTCGCTTATTTTCGCTTTAAATAGTAAGAAGGTCCTCGGTTCTC 
TGGTTTCTCAGATTCACATTCGTTTTTATTTCTTTTGTTATTTTCAATATAATTTGTTTTTATATTCTTGTCTTGCCGGATTCTGCATCTCCTGC 
GGTTCGTTCTCCTCTGATCTCACTTAGACTATGCGACAGTTAAACGCAAGTGTGTATTGATAGGTAGTGTACAACTAAAAAAAAGCTATGACATA 
TGTGCGGGATGGAAACAGCACCGCTTCTCTCCGCTTTAGAATCTATTCTTTGGCACCTGAACGTGGGGCATGAAGTCGTAGGTGATGATCTTGGG 
GCCCAGGAAGCCGCCGTAGCCGTTGCGGCCCTTGTAGTAAATGCCAATGCCGGCTAGTGGTACTGGTGGATTCGAGACCACGTCCTGGATGTCCA 
TAAACGGCACTGTGCTCTGGGCGGCATCCTTGTCCAGTCCAGTGTTAGTGAAGTCGATGTACTGATTGTGGCGCGACAGAGGTATCGAGTGTGCT 
ATTGTCCGCGTAGACACGTCGGCGTTGCTCAGACGCAGTTTCTCCCTGAAATCAAGGGGTTAGTTATTTAGTTTTTGCTATCCGAAAATAGAGGA 
CACACTTACCTGCGTGCGCCGCTGACATCGGTATTGTCGTTGGACTTCCAATAGCTATTGTACTCCGGCCGGATGAGCTGACCCGTCCGGAAATC 
AAACTCACTGTAGTAGGCCTCCAAATTCAGATGTGTGCCCACCACACGGAATCGCACGCCGGTAACGACAAAGGAATTGTCATCCGTATCCACAT 
CGTCCAGATCGATGGTGCGTTTCTCGTAGCTCAACTTATGATAATCAACGCCATTCTTCACATGACGATCAAAGACGTTGTACTTTTCCACGGGT 
TTCCATTCCAGAGTGCTTTGATTGACAATACCACGAGGCAGGAGTTCGCCCTCCTGGATTTGCAGATGGAAAATGCGATTTTGCTTTACAAAACG 
CAAACCAGTCACGACTCTGAAGAATAATTAGTAAATATGTGGTTGATATTTCAAAATCAACAGATTCAACTAGAACTTACCGGTTGCGCTTGACA 
TCGGCGATCGTGTCCCGCAGATTGAAGAAACGATCAGACTTCAGACCCTCCTCGTCGCACAGGCAGAAGCAGTAGCTGCAGTGCCAGAGAAGGTA 
ACGCCACCAGCTGTCCACCTTGGTGGTGCCCCTGGTGCACTTACCCCGCTGGCCAAGGACGCGTCCGTTTTCGTACTCGATGAACTCATAGCGCC 
GCGTGCTATTCTGTGGCGACGGGCACACCCACATGTCGGAGTCTACGAACCTGCAGTTGTACAGACGACCACTACATTTCGGCTGACGGGAACAG 
TAGAGATCCTTAAAGCAGCCTTCACTGCGGGTGGACTGATAAAAGTCACACGTCTCTCGGCAAGTTTCCTCTTTATTCAAATCCACCTCGTTTTC 
AATGTAGCCTTGAAGGAGGCGCGTGACCTC 
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(SEQ ID NO: 103) 



Exon: 1001.. 1643 
Exon: 3720. .4456 
Exon: 4527.. 4755 
Exon: 4815.. 5446 
Exon: 5522.. 5965 
Start ATG: 1001 

Transcript No. : CT3955 

ATGGACGAGGATGACAACCTGTCCAATGCCGACATAAGCATAGATGACGAGGAGGTGGTGCGGGAGCAGACCCACCATCCGCCCATGCAAGAGGA 
TCAGGAGCTGGATAACGAAGATGGGTCATCCGACGTGGATCTGTCTTCGGTGTACGTCGTGCCACCCATAGTCTCGGCACCATCGCCCGCATCAG 
TGCGCATCGCTGGTGGATCCACAATTGGTGGCGGAGGGGTCGCAGCTGGAGCAGTTCCAACAACCACCGATAACTCGCATTCCCCCTTCCACGAT 
TGGGATTCCAGTGTGGCGGCCGCTGCTGCTCCACCGCCGCCACCGCCCAGGACAGGACCAACTACAACTACAAGCTCCGGCACAGCCGCGGAATC 
GGGAGCCGCGTCCGCCTTATCCGACGATGCCGGCGCCAAGCCAAGTTTCTTCTTTGGCGCCTCCTCGTTCAGTTTGCGCAGCCGCAAGGAGGTGG 
CCGCCCTGATCAACACGGAATGCTGCCGTGGCAGCCCCACACCCGATCTCGACTCCATAATGGATACCCTATTCAATCCGGGGACGCCCATCGAC 
AATCTGGACAACATCGAGTGGATACGATGGCTCATTGCCGGCGGACGGACGCCGCAGGAGTTTGTTAAAATTGTGCGCAGCTATGACAACCACGC 
CAAATGCGGTCTGGTATGGGTACCGCATGTGGTGGCCTACAGGTGCCGCACTTGCGGCATATCGCCCTGCATGTCCATATGCCGGGACTGCTTCA 
AGAAGGGCAACCACACGAACCACGACTTCAATATGTTCCTGTCACAGGCGGGCGGTGCCTGCGATTGTGGTGACACCTCGGTCATGAAGGCGGAG 
GGTTTTTGCAGTGATCATGGCATCAATAATCGGGTGAATCGCGATCCGGTGCCAAACAATCTTTTGGCAGTGGCCGAGGCCATAATGCCCAAGCT 
GCTCTTCCGCCTGCTGCAGCACTTCCGTGAGCACAGCGACACCCCACTGGAGGTGCAAGCGATAACGTCGTACTCCTGCGAGGAGTTCGCCAACA 

T GC T G ATC G AT C T G AAC AAC AT GGGC G AAAT AAT GC GC AAAGT GAT G ACG C GC AC GC T G AT AAAT C C GG AGG T GT AT GC C T T C T T C ATGG AAGC A 
CCGTGCCAGGACACCCGGAATGGACGCTTCTTGAAGGCGAACCGAGAGAAGTACGAGGATGCCGTCAATAGATTCCCAAACCCAGAGCCACCCGA 
TGAGTACAGAGATCTGCCCGCGCTGGGCGACAAATTGGTCCACACCACGCTACTCGAAGAGTTCATCTTCTGGACATTTAAGTTCGAGTTTCCAC 
AGACTCTGGTCTGCTTTCTGCTGAACATGCTGCCTGATCAGGATTACAAGGAACACCTCACCCGCACATTTGTGATGCACTACAGCCGCATACCG 
TCCGTGTTGGAAATGTCCCGTGATCCGGACACGCTCAGCAACCGGGTGGTCCACATGAGCGTCCAACTCTTCTCTAACGAAAGTCTGGCCCTCAA 
GATGGTCAACGAGCTGTCGCTGCTGCACGTAATGATTATTAGTCTGAAGCTAATGATGTCCAAGATACTCATACAGAATACATTGCATGATCCCA 
ACAAGAATTTTCATTTCGTCATCGACTGCACGCGTCAGGTGATGAAGGACCACTGCTACTGGCCACTAGTCTCGGACTTTAACAACGTCCTTTCA 
CACGAATCGGTGGCCCTGGTCTTTCTGCGGGACGACAATCTCATCGATATGTGGTTCCAGTTTCTCCAGATGCTCCAGGGCATGAATGTCAATGT 
GCGCGAGACGGCTTCTCATGTGGAATTCGAGCCAAATAGCTACTATGCGGCGTTCTCCTGTGAGCTTGAGGCCAGTGCCTATCCCATGTGGTCGA 
TTATCTCGCATCTGCAGGATGGCACACATGCCCATCTGGCAAAGAAGATCATCAACTACTGTGTGACGACGCTGCACGAGTGGCTAGACTCTATT 
T AC T T TAT GG AGGC GC GT C T AT C T AT GGT T GGT AT T T TC AAT C C GT T TAT T GC AGT T T C AAT AAAAT C C T T T TAT T T AT CC GT GC AG GAG GAG AT 
GATGCAGGCTTCGTTCCACTTTCCCCTGCATCGTTATCTGGCTGCCTTTGTTTGCCAGGCGGTAACTAAAATGGGAATCAGTCTGAACGATGTGC 
TGCCCTCACGCCCCTATCTGCTGCCCCTGCTGATGATCCATCCACTTCGCGTCCAGATTCACATTCGTTTTTATTTCTTTTGTTATTTTCAATAT 
AATTTGTTTTTATATTCTTGTCTTGCCGGATTCTGCATCTCCTGCGGTTCGTTCTCCTCTGATCTCACTTAGACTATGCGACAGTTAAACGCAAG 
TGTGTATTGATAGGTAGTGTACAACTAAAAAAAAGCTATGACATATGTGCGGGATGGAAACAGCACCGCTTCTCTCCGCTTTAGAATCTATTCTT 
TGGCACCTGAACGTGGGGCATGAAGTCGTAGGTGATGATCTTGGGGCCCAGGAAGCCGCCGTAGCCGTTGCGGCCCTTGTAGTAAATGCCAATGC 
CGGCTAGTGGTACTGGTGGATTCGAGACCACGTCCTGGATGTCCATAAACGGCACTGTGCTCTGGGCGGCATCCTTGTCCAGTCCAGTGTTAGTG 
AAGTCGATGTACTGATTGTGGCGCG 
(SEQ ID NO: 104) 

Start ATG: 1 

MDEDDNLSNADISIDDEEVVREQTHHPPMQEDQELDNEDGSSDVDLSSVYVVPPIVSAPSPASVRIAGGSTIGGGGVAAGAVPTTTDNSHSPFHD 
WDSSVAAAAAPPPPPPRTGPTTTTSSGTAAESGAASALSDDAGAKPSFFFGASSFSLRSRKEVAALINTECCRGSPTPDLDSIMDTLFNPGTPID 
NLDNIEWIRWLIAGGRTPQEFVKIVRSYDNHAKCGLVWVPHVVAYRCRTCGISPCMSICRDCFKKGNHTNHDFNMFLSQAGGACDCGDTSVMKAE 
GFCSDHGINNRVNRDPVPNNLLAVAEAIMPKLLFRLLQHFREHSDTPLEVQAITSYSCEEFANMLIDLNNMGEIMRKVMTRTLINPEVYAFFMEA 
PCQDTRNGRFLKANREKYEDAVNRFPNPEPPDEYRDLPALGDKLVHTTLLEEFIFWTFKFEFPQTLVCFLLNMLPDQDYKEHLTRTFVMHYSRIP 
SVLEMSRDPDTLSNRVVHMSVQLFSNESLALmVNELSLLHVMIISLKLMMSKILIQNTLHDPNKNFHFVIDCTRQVMKDHCYWPLVSDFNNVLS 
HESVALVFLRDDNLIDMWFQFLQMLQGIWVNVRETASHVEFEPNSYYAAFSCELEASAYPMWSIISHLQDGTHAHLAKKIINYCVTTLHEWLDSI 
YFMEARLSMVGIFNPFIAVSIKSFYLSVQEEMMQASFHFPLHRYLAAFVCQAVTKMGISLNDVLPSRPYLLPLLMIHPLRVQIHIRFYFFCYFQY 
NLFLYSCLAGFCISCGSFSSDLT* 
(SEQ ID NO: 105) 

Classification: hypothetical 



Celera Sequence No. : 142000013384425 

TTATCCGGGCACTTACACATCACATTATAATCTTATATCTCTTACTCTCTTAATCCAATAGTATGATATTAATTTTTGCTCGATTTTTATAGAAA 
GGCGATTCCCTGAAAAGATGCCAAACCGTGCTCATGGAAAGCGATGAAATGGAGCAGAAAACCTGCGTTGGGTATCGAAGCAAAGTCTCCATGGA 
GTCCGCCGTTCTGTTCTCGGTACTGTTGTCCACCAACCATGAGCTTTCTATTGGAAAGATGCTCGATATGAGTGAGATTCACGAAAGAATGCGGC 
GCGCCAAACAGTCATTCGAGGATACTGCTAGAGTTACTCCGTTCCGAACGAACATTGATGACCTAATCTCCGATTTGGATGTCTGTGCTAAGGTC 
AAATCCAAACTTTGTCGAGTAATAAGTGATTGTAAAAAGTACTAGAAAGGATTAGCTATATCTTTTATCTATATTTAATACTTTTATTTCATTTT 
AATCAATAGTATAAAATTATTTCTCTGGATTTTAGGTGACTATAAGAAGACTATTTTTGTAATATTTTTGAACCATTTAAGCTCAAAATACAACT 
TACTATTCATATTTTAATATATAATTCCATGTTATCATGAATTTATTAGAATTTATATATACAAGTAAAGATTCGAAATTCAGCTTCGACCCGCT 
CCTTACGATTGTTATCTATATTATCAATTAATACATTAAGGTTGCTCATCATAATTGTTAATTGTTTGCTCATAAAAGCGTTTAAAAATGAAAGA 
TGGGAAATACTTTTTTAGCTTTCCCTTATTGTCTCTAAAGTAAGATCGTAGTCATGATACAAGCTCTTCCTAAATGTGTAGAAATAATGATACAT 
TTGTATTATATATACATAAACGCGAAATATTAAAAAGAAACATTCCGAAAATTAAAGAATTGCATTAGTTTACAACAAAAACTATCGAATCTCTC 
TAATCAAAACGACGCGCCAAGCCAAATTTAGTACATTATCTGTGCTGAATCATTTTCGCTGTGCCGCAATATTTAATTAACTGCAGCGATCATTG 
C AGT TTTAGCCACCAGAGAAGCGT AAC ATAG AAAT C AAAAT GCAGAAGGTTGCACTGCTTCTAGTCGCCTTCCTGGCTGCGGCCGTGGCCCACCC 
AAATTCGCAGGAGAAGCCCGGATTGCTGCGCGTACCGCTCCACAAATTTCAGTCGGCCCGCCGGCACTTTGCGGATGTGGGCACGGAGCTGCAGC 
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AATTGCGCATTAGGTACGGCGGAGGTGATGTGCCGGAACCGCTGTCCAACTATATGGACGCGCAGTACTATGGGCCCATTGCCATTGGCTCGCCA 

CCGCAGAACTTCCGTGTGGTTTTCGACACCGGCTCCTCGAACTTGTGGGTGCCGTCCAAGAAGTGCCACCTGACCAACATCGCCTGTTTGATGCA 

CAACAAGTACGATGCCTCCAAGTCGAAGACCTACACCAAGAACGGCACTGAGTTCGCCATCCAGTATGGCTCGGGCAGCTTGTCCGGATACCTGT 

CCACGGATACGGTGTCCATTGCCGGCCTGGACATCAAGGATCAGACATTCGCTGAGGCGCTCAGTGAGCCGGGCCTGGTCTTCGTAGCTGCCAAG 

TTCGACGGCATCCTAGGCCTGGGCTACAACTCCATCTCGGTGGACAAGGTGAAGCCGCCATTTTACGCCATGTACGAACAGGGTTTGATCTCCGC 

TCCGGTGTTCTCATTCTACCTGAATCGCGATCCCGCCTCGCCCGAGGGCGGTGAGATCATCTTTGGCGGCTCCGATCCCAACCACTACACTGGTG 

AATTCACCTACTTGCCCGTTACCCGCAAGGCCTACTGGCAAATCAAGATGGATGCCGCCTCCATTGGCGATTTGCAGCTGTGCAAGGGCGGTTGC 

CAGGTGATCGCTGACACCGGTACCTCACTAATTGCTGCGCCCTTGGAAGAGGCCACCTCTATTAACCAGAAGATTGGTGGAACTCCCATCATCGG 

TGGTCAGGTGAGTACCATGCTCTAAGTTCTCACATGAAGCACTGCTAATCCTCTTAATTTCTTACTTTAGTATGTGGTTTCTTGCGATCTCATTC 

CCCAATTGCCGGTAATCAAGTTTGTGCTGGGCGGAAAGACCTTCGAGCTCGAGGGCAAGGACTATATTCTCCGTGTGGCCCAGATGGGTAAGACC 

ATCTGTCTGTCCGGCTTTATGGGCTTGGACATCCCCCCTCCAAACGGACCCTTGTGGATCCTCGGTGACGTTTTCATTGGCAAATACTACACCGA 

GTTTGACATGGGCAACGATCGTGTGGGCTTCGCCGATGCCAAATAATCGAACAAAGAAAAGTATACTTAAAATTTAAATTGTAAACATTATCGAT 

CGATCCCATTGCGGGTTTCTCCGAAGAATAATATATAAATTATCATTTTAGTAGATTTTCCATATATTTATTAGCGTAAGTTTATTAATTACAAT 

AGGGCTTGCGCGGTCTGCACTTAATGAGACGAATAGTAAGTACGTGGCGATGACCGACGCTTCTAATCTTCCAAAATCGACTAACTTAAGTATAG 

CAAGAGTTCTTATCTCTAAAGCTTAACAAGTTACAGTTTCTTATAGGTAAAGCTCGATCTATAGATGGGAAAAGCACTTGAGGCTTACGACTATA 

TTTCGGGTGTTTTGTGATTGTGTTAACTCCTCCCCTTCTAAGACGAAGGCTGTCCACAGACTTTTTACAATTGGCAAATCTATTTGAAAGCGGAT 

CTTACGGGTAACATTGGTTGTGCACTAAGTTCCCATACTGCCAACCTGGTACTGCGTTAAACCATCGAAGGGAAGGCAGCTCTGCGTGTCTGGAT 

GATCCGAGAAGTTCGTAAATGCTGCTGATTCTCCATGTCATGGGTTCGCAACAAGGTTCTGGAGATGATCGTAG 

(SEQ ID NO: 106} 

Exon: 1001. .2002 
Exon: 2066. .2441 
Start ATG: 1085 

Transcript No. : CT3996 

CATTTTCGCTGTGCCGCAATATTTAATTAACTGCAGCGATCATTGCAGTTTTAGCCACCAGAGAAGCGTAACATAGAAATCAAAATGCAGAAGGT 
TGCACTGCTTCTAGTCGCCTTCCTGGCTGCGGCCGTGGCCCACCCAAATTCGCAGGAGAAGCCCGGATTGCTGCGCGTACCGCTCCACAAATTTC 
AGTCGGCCCGCCGGCACTTTGCGGATGTGGGCACGGAGCTGCAGCAATTGCGCATTAGGTACGGCGGAGGTGATGTGCCGGAACCGCTGTCCAAC 
TATATGGACGCGCAGTACTATGGGCCCATTGCCATTGGCTCGCCACCGCAGAACTTCCGTGTGGTTTTCGACACCGGCTCCTCGAACTTGTGGGT 
GCCGTCCAAGAAGTGCCACCTGACCAACATCGCCTGTTTGATGCACAACAAGTACGATGCCTCCAAGTCGAAGACCTACACCAAGAACGGCACTG 
AGTTCGCCATCCAGTATGGCTCGGGCAGCTTGTCCGGATACCTGTCCACGGATACGGTGTCCATTGCCGGCCTGGACATCAAGGATCAGACATTC 
GCTGAGGCGCTCAGTGAGCCGGGCCTGGTCTTCGTAGCTGCCAAGTTCGACGGCATCCTAGGCCTGGGCTACAACTCCATCTCGGTGGACAAGGT 
GAAGCCGCCATTTTACGCCATGTACGAACAGGGTTTGATCTCCGCTCCGGTGTTCTCATTCTACCTGAATCGCGATCCCGCCTCGCCCGAGGGCG 
GTGAGATCATCTTTGGCGGCTCCGATCCCAACCACTACACTGGTGAATTCACCTACTTGCCCGTTACCCGCAAGGCCTACTGGCAAATCAAGATG 

G AT GC C GC C T C CAT T GGC GAT T T G C AGCT GT GC AAGGGC GGT T GC C AGG T G AT CGC T G AC AC C G GT AC CT C ACT AAT TGCTGCGCCCTT GG AAG A 
GGCCACCTCTATTAACCAGAAGATTGGTGGAACTCCCATCATCGGTGGTCAGTATGTGGTTTCTTGCGATCTCATTCCCCAATTGCCGGTAATCA 
AGTTTGTGCTGGGCGGAAAGACCTTCGAGCTCGAGGGCAAGGACTATATTCTCCGTGTGGCCCAGATGGGTAAGACCATCTGTCTGTCCGGCTTT 
ATGGGCTTGGACATCCCCCCTCCAAACGGACCCTTGTGGATCCTCGGTGACGTTTTCATTGGCAAATACTACACCGAGTTTGACATGGGCAACGA 
TCGTGTGGGCTTCGCCGATGCCAAATAATCGAACAAAGAAAAGTATACTTAAAATTTAAATTGTAAACATTATCGATCGATCCCATTGCGGGTTT 

CTCCGAAGAATAATATATAAATTATCATTTTAGTAGATTTTCCATATA 
(SEQ ID NO: 107) 

Start ATG: 85 

MQKVALLLVAFLAAAVAHPNSQEKPGLLRVPLHKFQSARRHFADVGTELQQLRIRYGGGDVPEPLSNYMDAQYYGPIAIGSPPQNFRVVFDTGSS 
NLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYNSIS 
VDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIADTGTSLIAA 
PLEEATSINQKIGGTPIIGGQYVVSCDLIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGLDIPPPNGPLWILGDVFIGKYYTEFD 

MGNDRVGFADAK* 
(SEQ ID NO: 108) 

Name: Cathepsin D 
Classification: endopeptidase 



Celera Sequence No. : 142000013383905 

CTTATTCCTTGTTCCAGGCTGCCAGGTATTTCATGAGGTCCGTGAGACTCTCGAAATCGGTGCGGCTGGCTGGTGGTTCGTCCTTGGTGATTCTT 
GGCAGTGCATTGCCCATCTTGAGGCGGAACTCCTGGAACTGCGAACGAAACGAAAGATTTAAAGCGAAATTCATTGAACAAATCGGAGGCATGCT 
CACATCCTTGGCTCCGGATAGCTTCCAGAAGCCGTAGCAGAAGGCGCCGGTGCCCATCACGGCGTACAGTGTGCCCCAGCCCAGCGCCCGCAGGG 
CCAGGGTGGCGCCCTCGTCCATCAGGATCATGCCTTGCCGCGTTCCAGCCTGCTGGAGGACCTTGCTATCCGTCTTCTTGGCGGTGGCCAGCGTG 
CGCGAGAAGCCGAACAGAGCGGACACCCCGCCCACCAAACCCAGAAAGGCGGCGGCTAGAAAGGAGGATGAAATGGAGTTGTACTTCAGTCAGTT 
GCTGGCTGATATATATGCGTGTGTCTGTATGTGTTTGGAGAACCACCTACCCTGGATGCGGAACTTGCGCTCCTTCTCGGCGGCCACTGCATCCG 
CGCTGGTGCCGGCCTCGCTCATCCTTCTCAATTGCGATTTATTGTTGTCGTAACTACTAATGCCCGAATTGGATCGCACAACACAGCTGATTGCT 
TGACCGCTCTTGACAACCAACACTGGTGCCGCAGCTGCCACTGCCACTGACGCGTTCGCAGAGGTGCTAAGATGGCGGGAGGGAAGCTAGAAGTG 
AAGTGCTAAAAGATAGTTCAAAATATTCAACAGGTACTGGCCTTTCTCACTGACTATGAGAGTCAATTTGATTTTAATGTGCTGCATTTAATTTA 
TTTGTTAAATTACTAACTAATACTAATAGATATATTTATATAAAAGATATAGCTAAAAAACGTCAGTCTGGCAACCCTGCTACAACGACCGCAAT 
ATGGTATGGTATTTCGCCGCTCGCCCTGCAGTCACACTTTTCGGTCGGCGTTGTCCGCATTTTCACATCGCATCGCGTCTGAAAAAACGGGCTAA 
TTCGCATTCGCAATCGCCTGTAGGGCCCGACTGGTATAGAATGCACGCCCAGCGTGCCAGCAACAAACAGTGATTGTTTACCCAGCCTTCTATTT 
GGGCTTTCGCGTTTTTCGAGCTCCTTCTTTTCCCTTCCCGGTCGTATCCCTTGCCTTTTGCACGAGGTGAAAAGTGTGACGACGCAGCAGAAAAA 
GCAAAAGCAAAAAAAAGGTGGAACAAAAAAAAACCGCACACACACACCGAAAGCAGGCTCCACTTTGGTCTAGTGGCCACCAGTATCGCTGACGA 
CCAGGATGAGCACCCCCACGGACTCGAAGGCGCCGTTGCGCCAGGTGAAGCGCGTACAGTTCGGCATTTTGTCCCCTGATGAAATTGTAAGTGCG 
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GCTCACAGATTCATTCGflTCGACGCCATATTGAAAaATTGAACAATATTGTAAACGCAAAAATAGCATTCCATCAATTCGCAATTCGTATATCGT 
GGTTTTATATGGTCTATATGGGAAATGCTTATTTCATGTTCACATGCATTCCAGTAACGCGAGCAGGACAACAAATAACAGATAGTTCAAGTCCT 
GTATCTTTAGGATCTTAAGCGAAACTCACTAAATATTCGATTAACTCGTAAAACCCTTGTATCAGAAATGTTTTCCAGCTGAATATCATTACTTC 
TCAATAAAGTTTATGAATATATTACGTAAAGAAATTGTATATTTGTTATTAGTTTCCTGGGAAAGTTCCAAGTCATAGCAAAATAAAACACAATT 
ATACCCTGCTTAGATTTTGGGAACTTGGAATCTATTATTTCGAATATCAAGTTATTGATGCGTATATCTAAAGGGTTCAAGACAAGGGAATTCAG 
ATATATATATTCTATCTATGATCCTTCAAGCACCTGTTGGCCATATCAATATGACACGCACTTTAAGAATTTGACTTTATTTGATCCACATGTGT 
TACTTCTTTCCTGCTTGCAGCGTCGCATGTCCGTCACCGAGGGTGGCGTCCAGTTTGCGGAGACAATGGAGGGCGGACGACCAAAGCTGGGCGGA 
TTGATGGATCCGCGCCAGGGTGTCATAGACAGGACGTCGCGCTGTCAGACGTGCGCCGGCAATATGACCGAGTGCCCCGGTCACTTTGGACACAT 
CGACCTGGCCAAACCAGTGTTCCACATCGGCTTCATCACCAAGACAATTAAGATTTTGCGATGTGTGTGCTTCTACTGCTCCAAAATGTTGGTCT 
CGCCACACAATCCAAAGATCAAGGAGATCGTTATGAAGTCTAGGGGACAGCCGCGTAAGCGCCTGGCCTACGTCTATGATCTGTGCAAGGGCAAG 

ACGATTTGCGAGGGCGGCGAGGACATG^^^ 

CCAGCCGTCGATCCGGAGAACAGGACTGGATCTCACCGCCGAGTGGAAGCATCAGAACGAGGATTCGCAGGAGAAGAAGATCGTGGTCTCGGCAG 
AGCGGGTTTGGGAAATCCTTAAGCATATCACCGATGAGGAGTGCTTTATTCTGGGCATGGATCCCAAGTATGCGCGTCCCGATTGGATGATCGTG 
ACCGTCCTGCCTGTTCCACCGTTGGCCGTACGGCCAGCGGTCGTGATGTTTGGTGCTGCCAAGAATCAGGATGATTTGACGCACAAGCTGTCCGA 
TATCATCAAGGCAAACAATGAGCTTCGCAAGAACGAGGCCAGTGGTGCCGCAGCGCATGTGATCCAGGAGAACATCAAGATGCTGCAATTCCACG 
TCGCCACACTAGTGGACAACGATATGCCCGGCATGCCCAGGGCTATGCAAAAGTCGGGAAAACCCCTTAAAGCCATCAAGGCGCGACTCAAGGGT 
AAGGAGGGTAGGATTCGTGGCAACTTGATGGGCAAACGTGTCGACTTCTCCGCCCGTACTGTCATCACACCCGATCCAAATCTGCGTATCGATCA 

^tc^tctgccgcgttccattgcccag^^ 

GTAATTCGCAGTATCCGGGTGCCAAGTACATTGTGCGCGACAATGGCGAGCGTATTGATCTTCGCTTTCATCCTAAATCCTCTGATTTACATCTG 

CAGTGCGGCTACAAGGTAGAGCGGCATTTGCGCGACGACGATCTGGTGATCTTCAACCGACAGCCCACGCTGCACAAGATGAGTATGATGGGTCA 

CAGGGTGAAAGTGTTGCCCTGGTCGACTTTCCGCATGAACCTGTCGTGTACATCGCCCTACAATGCTGATTTCGACGGCGATGAAATGAATTTGC 

ACGTTCCGCAGTCCATGGAGACGCGCGCCGAGGTGGAGAACATCCACATCACACCCAGACAGATCATCACGCCGCAGGCGAACAAACCTGTCATG 

GGTATCGTGCAGGACACTCTGACTGCGGTGCGAAAGATGACCAAGCGCGACGTATTCATCACGCGCGAGCAGGTGATGAATCTGCTCATGTTCCT 

GCCCACGTGGGACGCTAAAATGCCGCAACCCTGCATCCTGAAACCGCGTCCCCTGTGGACGGGCAAACAGATCTTCTCCCTGATCATCCCCGGCA 

ACGTGAACATGATACGCACACATTCCACGCATCCGGACGAGGAGGATGAAGGACCATACMGTGGATCTCGCCCGGTGACACCAAGGTAATGGTG 

GAGCACGGTGAACTTATCATGGGGATCCTCTGCAAGAAGTCACTGGGTACATCAGCCGGATCACTGCTGCATATTTGTTTCCTGGAACTTGGTCA 

CGATATCGCTGGTCGGTTCTATGGCAACATTCAGACCGTGATCAACAATTGGTTGCTTTTCGAGGGCCATAGTATCGGTATTGGTGACACTATTG 

CCGATCCCCAGACCTACAACGAGATCCAGCAGGCCATCAAGAAGGCAAAGGACGACGTGATAAACGTTATCCAAAAGGCCCACAACATGGAGCTG 

GAGCCCACGCCGGGTAATACTCTGCGTCAGACGTTCGAGAACAAGGTGAACCGTATCCTAAACGATGCTCGTGACAAGACTGGTGGTTCGGCCAA 

GAAATCTCTCACTGAGTACAACAATCTAAAGGCTATGGTGGTGTCTGGTTCCAAGGGTTCCAACATTAATATATCCCAGGTAAGCTACTGACTAG 

CGATTTTCATTCGAACTCATTTAATGTGCTGGCATTTTTATCTTCCAGGTTATTGCTTGTGTGGGTCAACAGAACGTGGAGGGTAAGCGAATCCC 

CTACGGTTTCCGCAAACGCACTCTTCCCCACTTTATTAAGGACGATTATGGTCCTGAGTCGCGCGGTTTCGTGGAGAATTCGTATCTTGCCGGCC 

TGACACCCTCTGAGTTCTATTTCCACGCTATGGGTGGTCGTGAAGGTCTTATCGATACTGCTGTAAAGACAGCTGAAACCGGTTACATCCAGCGT 

CGTCTTATAAAGGCTATGGAGTCGGTGATGGTTAACTACGACGGAACAGTGCGTAACTCGGTGGGCCAGCTTATTCAGCTGCGTTACGGCGAGGA 

CGGTCTTTGCGGCGAGCTGGTTGAGTTCCAGAACATGCCAACGGTGAAACTGTCGAACAAGTCGTTTGAAAAGCGCTTCAAATTTGACTGGAGCA 

ACGAGCGATTAATGAAAAAGGTATTTACGGATGATGTGATCAAGGAGATGACCGACAGCAGCGAAGCCATCCAGGAACTGGAGGCAGAGTGGGAT 

CGTTTGGTTTCCGATCGCGACAGTTTGAGACAAATCTTCCCTAACGGCGAATCCAAGGTGGTACTGCCGTGCAACCTGCAGCGTATGATCTGGAA 

TGTGCAGAAGATCTTTCACATCAACAAGCGCCTGCCCACGGATCTCTCGCCCATTCGGGTGATCAAGGGCGTTAAGACATTGCTCGAACGCTGTG 

TTATTGTCACGGGAAATGATCGAATTTCAAAGCAGGCCAACGAGAACGCCACGTTGCTATTCCAGTGCCTAATCCGTTCGACACTATGTACGAAA 

TACGTGTCGGAGGAGTTCCGCCTGTCCACAGAGGCGTTTGAGTGGTTGGTCGGAGAAATCGAGACGCGTTTCCAACAGGCCCAGGCCAATCCCGG 

CGAGATGGTGGGTGCCCTGGCTGCCCAGAGTTTGGGTGAGCCCGCTACTCAGATGACACTGAACACCTTCCATTTTGCTGGTGTGTCTTCAAAGA 

ACGTAACATTGGGTGTGCCGCGTCTCAAGGAGATTATCAACATATCCAAAAAGCCCAAAGCGCCGTCGCTAACCGTTTTCCTTACTGGCGGCGCT 

GCTCGCGATGCGGAGAAGGCCAAGAACGTACTGTGCCGCCTGGAGCATACCACGCTGCGCAAGGTTACGGCCAATACGGCTATTTACTACGACCC 

GGATCCGCAGAGAACGGTGATATCCGAGGATCAAGAATTTGTGAATGTCTACTACGAAATGCCCGACTTTGATCCCACACGCATTTCGCCCTGGT 

TGCTACGTATTGAATTGGACCGCAAGCGGATGACGGACAAGAAGCTGACAATGGAACAGATCGCCGAGAAGATTAACGTGGGCTTCGGGGAGGAT 

CTCAATTGTATTTTCAATGACGACAATGCCGATAAGCTGGTGCTGCGCATTAGGATCATGAACAACGAAGAGAACAAGTTCCAGGACGAAGACGA 

GGCCGTCGACAAGATGGAGGACGACATGTTCTTGCGCTGCATTGAGGCCAACATGCTGTCGGACATGACACTGCAGGGTATCGAAGCCATCGGCA 

AGGTGTACATGCATCTGCCACAGACCGACAGCAAGAAGCGTATCGTGATCACTGAGACCGGAGAGTTTAAGGCCATCGGCGAGTGGCTGCTCGAG 

ACGGACGGCACATCGATGATGAAAGTGCTGTCTGAGCGTGACGTGGACCCAATCCGAACATCCTCCAACGATATTTGCGAGATATTCCAGGTGCT 

GGGCATCGAGGCGGTGCGAAAGTCTGTCGAAAAGGAGATGAACGCCGTGTTGCAGTTCTACGGCCTGTACGTGAACTATCGTCACTTGGCCCTGT 

TGTGCGATGTGATGACGGCCAAGGGCCATCTGATGGCCATCACCCGTCACGGCATTAACAGGCAAGACACTGGTGCCCTTATGCGTTGCTCCTTT 

GAGGAGACAGTGGATGTGCTAATGGATGCCGCCGCTCACGCCGAAACGGATCCCATGAGGGGCGTCTCTGAGAACATTATCATGGGACAGCTGCC 

CAAAATGGGTACCGGCTGCTTTGACCTTCTGCTCGATGCAGAGAAGTGTCGTTTCGGCATCGAGATTCCCAATACGTTGGGCAACAGTATGCTAG 

GTGGCGCCGCTATGTTCATTGGCGGTGGATCGACACCGAGCATGACGCCACCGATGACGCCGTGGGCTAACTGCAACACGCCGCGATACTTCTCT 

CCACCCGGCCACGGTAAGTAAATCTATCCATTATGGGTTTTTGATTTTACTCTCGACTCAGCAACGCTATATCTTTTGTTAATTAACTTTTTTGA 

TGCGATTGCCTAATATATTCAATCTCTTTCTTTCTTGTGCTTGCAGTAAGTGCCATGACTCCTGGCGGTCCCAGTTTCTCGCCTTCGGCTGCATC 

GGATGCGTCCGGAATGTCGCCTAGCTGGTCGCCGGCTCATCCGGGCTCATCGCCCAGTTCACCAGGACCTTCGATGTCGCCGTATTTCCCAGCCT 

CGCCGAGTGTTTCTCCCTCTTATTCGCCAACGAGTCCGAACTACACGGCATCTTCl'CCCGGTGGAGCCTCGCCGAATTACTCGCCCTCGAGTCCG 

AACTATTCGCCGACGTCGCCGCTTTATGCAAGTCCACGTTACGCATCGACAACGCCAAATTTCAATCCACAGTCGACGGGTTACTCGCCATCTTC 

ATCGGGATACTCGCCAACATCCCCGGTCTACTCGCCCACGGTGCAATTCCAGTCGAGTCCGTCGTTTGCGGGCAGCGGTAGCAACATTTACTCGC 

CGGGCAATGCGTACTCGCCCAGCTCGTCCAACTACTCCCCCAATTCACCATCCTACTCGCCGACATCACCATCGTACTCGCCGTCAAGTCCTTCG 

TACTCGCCAACGTCGCCTTGCTATTCGCCCACATCGCCTTCGTACTCGCCAACGAGTCCGAACTACACACCCGTAACACCCTCATACTCGCCGAC 

AAGTCCGAACTATTCAGCGTCGCCGCAATATTCTCCAGCCTCGCCAGCTTACTCGCAAACGGGGGTGAAGTACTCACCGACATCGCCGACGTACT 

CGCCGCCGTCACCATCGTACGATGGGTCTCCCGGATCACCACAATATACGCCAGGATCTCCGCAGTACTCGCCGGCCTCGCCTAAGTACTCGCCG 

ACCTCACCGCTGTACTCGCCCAGCTCGCCGCAGCACTCGCCCTCAAACCAGTACAGCCCAACAGGATCGACCTATTCGGCGACGAGTCCGCGGTA 

CTCGCCGAACATGTCCATCTACTCGCCGAGCAGCACCAAGTACTCGCCCACCTCGCCAACGTACACACCGACGGCCCGCAACTACTCGCCCACGT 

CACCGATGTACTCGCCAACGGCTCCATCGCACTACAGTCCCACGAGTCCGGCCTACTCGCCCAGCAGTCCCACGTTCGAGGAGAGCGAAGACTGA 

GGAAGGGAGGACGGGGGTAGCTCCCCCAGCACGTCCCGATCTCCGGGGCGGTCAAGCCGTAGTTAAGTACCGACTACGCTCGATCGCCCGAACGG 

GAAAGGTGAATTTAATAATATTATGTTTGTATACAGCAAATAGGTCTCTCATACAAAATTCAATACTGTTTGCAAAAGTAGCTTTAAACAAGTTG 

TATAAATTAGCCTACAGTTTCGTATTTGGTTTTGTGGTAAAGTAAACATTTTGAATTGTTTGCGTTAACCAGAAAAGCGAATTCGAAGCCATTTA 

TATTTGCACAAATTGTTTCAAGTTGCAATTCTCGAACTGTATTCGATAATGCGAATCTATTTCTTTGCACAACCCACTGTATGAAATTGTTTACT 
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AACTGGAAGACTCTTGTCACTCTTCTGCAAAGTACTTTCACATTTTTTTGAATATCCCTATTCTTATTTTGCCTAATGTTAGTATAAAGACCCTA 
GTTATTAAAAACAACAACAAAACAAGCCAACGTCAGACACTATTTGTAGCGTTTTCCATATACCCTTTAAATCCCTAATTAACCCGATCAGTTGT 
TTTCGCCTGCACACAAATGTTACTTATAAAGCATAAGAATGCAGGCACCCACCCATACACTACACTATATATTCATATATATTTTTACGTCCGAC 
ACATTTTTGTAAATTATGACAGAAAACAAAATAGAAAATTATAAAATAAAAACCACAAAAAAATACTCAACAACATTGGTTTATTTACTTTACTA 
AACTGATATGCTCATTCTGGCAGACTACTGCAATTACTTGTAGTTGATTTAGGGATTTGAGAGCCAGTGCGGCCACTCTTGGATCTCGTTGTATA 
GAGTCAGTCCGGGACTCTGGGTGCTGATCACCTGGGAGCCGGCGATCAGGGCGTAATCCTCGCTCAGCTCACCCTGCTGCACTCCGCAGGCCAGC 
AGATCTTTGGCCGCCTGGAGAGCCGCATCCGATGGAAGTTTATCTATCGGAGATGTTGGGCATGTAG 
(SEQ ID NO: 109} 

Exon: 1001.. 1416 
Exon: 2016. . 4259 
Exon: 4324 . .6568 
Exon: 6697.. 7807 
Start ATG: 1336 

Transcript No. : CT4016 

TTGTCCGCATTTTCACATCGCATCGCGTCTGAAAAAACGGGCTAATTCGCATTCGCAATCGCCTGTAGGGCCCGACTGGTATAGAATGCACGCCC 
AGCGTGCCAGCAACAAACAGTGATTGTTTACCCAGCCTTCTATTTGGGCTTTCGCGTTTTTCGAGCTCCTTCTTTTCCCTTCCCGGTCGTATCCC 
TTGCCTTTTGCACGAGGTGAAAAGTGTGACGACGCAGCAGAAAAAGCAAAAGCAAAAAAAAGGTGGAACAAAAAAAAACCGCACACACACACCGA 
AAGCAGGCTCCACTTTGGTCTAGTGGCCACCAGTATCGCTGACGACCAGGATGAGCACCCCCACGGACTCGAAGGCGCCGTTGCGCCAGGTGAAG 
CGCGTACAGTTCGGCATTTTGTCCCCTGATGAAATTCGTCGCATGTCCGTCACCGAGGGTGGCGTCCAGTTTGCGGAGACAATGGAGGGCGGACG 
ACCAAAGCTGGGCGGATTGATGGATCCGCGCCAGGGTGTCATAGACAGGACGTCGCGCTGTCAGACGTGCGCCGGCAATATGACCGAGTGCCCCG 
GTCACTTTGGACACATCGACCTGGCCAAACCAGTGTTCCACATCGGCTTCATCACCAAGACAATTAAGATTTTGCGATGTGTGTGCTTCTACTGC 
TCCAAAATGTTGGTCTCGCCACACAATCCAAAGATCAAGGAGATCGTTATGAAGTCTAGGGGACAGCCGCGTAAGCGCCTGGCCTACGTCTATGA 
TCTGTGCAAGGGCAAGACGATTTGCGAGGGCGGCGAGGACATGGATCTCACCAAGGAGAACCAGCAGCCGGATCCAAACAAAAAGCCCGGCCACG 
GCGGTTGCGGTCACTACCAGCCGTCGATCCGGAGAACAGGACTGGATCTCACCGCCGAGTGGAAGCATCAGAACGAGGATTCGCAGGAGAAGAAG 
ATCGTGGTCTCGGCAGAGCGGGTTTGGGAAATCCTTAAGCATATCACCGATGAGGAGTGCTTTATTCTGGGCATGGATCCCAAGTATGCGCGTCC 
CGATTGGATGATCGTGACCGTCCTGCCTGTTCCACCGTTGGCCGTACGGCCAGCGGTCGTGATGTTTGGTGCTGCCAAGAATCAGGATGATTTGA 
CGCACAAGCTGTCCGATATCATCAAGGCAAACAATGAGCTTCGCAAGAACGAGGCCAGTGGTGCCGCAGCGCATGTGATCCAGGAGAACATCAAG 
ATGCTGCAATTCCACGTCGCCACACTAGTGGACAACGATATGCCCGGCATGCCCAGGGCTATGCAAAAGTCGGGAAAACCCCTTAAAGCCATCAA 
GGCGCGACTCAAGGGTAAGGAGGGTAGGATTCGTGGCAACTTGATGGGCAAACGTGTCGACTTCTCCGCCCGTACTGTCATCACACCCGATCCAA 
ATCTGCGTATCGATCAGGTCGGTGTGCCGCGTTCCATTGCCCAGAATCTAACCTTCCCCGAGCTGGTCACTCCCTTCAACATCGATCGCATGCAG 
GAGCTGGTGCGCAGGGGTAATTCGCAGTATCCGGGTGCCAAGTACATTGTGCGCGACAATGGCGAGCGTATTGATCTTCGCTTTCATCCTAAATC 
CTCTGATTTACATCTGCAGTGCGGCTACAAGGTAGAGCGGCATTTGCGCGACGACGATCTGGTGATCTTCAACCGACAGCCCACGCTGCACAAGA 
TGAGTATGATGGGTCACAGGGTGAAAGTGTTGCCCTGGTCGACTTTCCGCATGAACCTGTCGTGTACATCGCCCTACAATGCTGATTTCGACGGC 
G ATGAAATG AATT TGC ACG T TCC GC AGT C C AT GGAGAC GCGCGCCGAGGT GG AG AAC ATC C AC ATC AC AC CC AG AC AG AT CAT C AC G C C G C AGGC 
GAACAAACCTGTCATGGGTATCGTGCAGGACACTCTGACTGCGGTGCGAAAGATGACCAAGCGCGACGTATTCATCACGCGCGAGCAGGTGATGA 
ATCTGCTCATGTTCCTGCCCACGTGGGACGCTAAAATGCCGCAACCCTGCATCCTGAAACCGCGTCCCCTGTGGACGGGCAAACAGATCTTCTCC 
C TG ATC ATC C CCGGC AAC GT GAAC AT GA T AC GC A CAC AT T C C AC GC AT CC GG AC GAGG AGGAT G AAGG AC C AT AC AAGT G GAT C T C GC C C GG T G A 
CACCAAGGTAATGGTGGAGCACGGTGAACTTATCATGGGGATCCTCTGCAAGAAGTCACTGGGTACATCAGCCGGATCACTGCTGCATATTTGTT 
TCCTGGAACTTGGTCACGATATCGCTGGTCGGTTCTATGGCAACATTCAGACCGTGATCAACAATTGGTTGCTTTTCGAGGGCCATAGTATCGGT 
ATTGGTGACACTATTGCCGATCCCCAGACCTACAACGAGATCCAGCAGGCCATCAAGAAGGCAAAGGACGACGTGATAAACGTTATCCAAAAGGC 
CCACAACATGGAGCTGGAGCCCACGCCGGGTAATACTCTGCGTCAGACGTTCGAGAACAAGGTGAACCGTATCCTAAACGATGCTCGTGACAAGA 
CTGGTGGTTCGGCCAAGAAATCTCTCACTGAGTACAACAATCTAAAGGCTATGGTGGTGTCTGGTTCCAAGGGTTCCAACATTAATATATCCCAG 
GTTATTGCTTGTGTGGGTCAACAGAACGTGGAGGGTAAGCGAATCCCCTACGGTTTCCGCAAACGCACTCTTCCCCACTTTATTAAGGACGATTA 
TGGTCCTGAGTCGCGCGGTTTCGTGGAGAATTCGTATCTTGCCGGCCTGACACCCTCTGAGTTCTATTTCCACGCTATGGGTGGTCGTGAAGGTC 
TTATCGATACTGCTGTAAAGACAGCTGAAACCGGTTACATCCAGCGTCGTCTTATAAAGGCTATGGAGTCGGTGATGGTTAACTACGACGGAACA 
GTGCGTAACTCGGTGGGCCAGCTTATTCAGCTGCGTTACGGCGAGGACGGTCTTTGCGGCGAGCTGGTTGAGTTCCAGAACATGCCAACGGTGAA 
ACTGTCGAACAAGTCGTTTGAAAAGCGCTTCAAATTTGACTGGAGCAACGAGCGATTAATGAAAAAGGTATTTACGGATGATGTGATCAAGGAGA 
TGACCGACAGCAGCGAAGCCATCCAGGAACTGGAGGCAGAGTGGGATCGTTTGGTTTCCGATCGCGACAGTTTGAGACAAATCTTCCCTAACGGC 
GAATCCAAGGTGGTACTGCCGTGCAACCTGCAGCGTATGATCTGGAATGTGCAGAAGATCTTTCACATCAACAAGCGCCTGCCCACGGATCTCTC 
GCCCATTCGGGTGATCAAGGGCGTTAAGACATTGCTCGAACGCTGTGTTATTGTCACGGGAAATGATCGAATTTCAAAGCAGGCCAACGAGAACG 
CCACGTTGCTATTCCAGTGCCTAATCCGTTCGACACTATGTACGAAATACGTGTCGGAGGAGTTCCGCCTGTCCACAGAGGCGTTTGAGTGGTTG 
GTCGGAGAAATCGAGACGCGTTTCCAACAGGCCCAGGCCAATCCCGGCGAGATGGTGGGTGCCCTGGCTGCCCAGAGTTTGGGTGAGCCCGCTAC 
TCAGATGACACTGAACACCTTCCATTTTGCTGGTGTGTCTTCAAAGAACGTAACATTGGGTGTGCCGCGTCTCAAGGAGATTATCAACATATCCA 
AAAAGCCCAAAGCGCCGTCGCTAACCGTTTTCCTTACTGGCGGCGCTGCTCGCGATGCGGAGAAGGCCAAGAACGTACTGTGCCGCCTGGAGCAT 
ACCACGCTGCGCAAGGTTACGGCCAATACGGCTATTTACTACGACCCGGATCCGCAGAGAACGGTGATATCCGAGGATCAAGAATTTGTGAATGT 
CTACTACGAAATGCCCGACTTTGATCCCACACGCATTTCGCCCTGGTTGCTACGTATTGAATTGGACCGCAAGCGGATGACGGACAAGAAGCTGA 
CAATGGAACAGATCGCCGAGAAGATTAACGTGGGCTTCGGGGAGGATCTCAATTGTATTTTCAATGACGACAATGCCGATAAGCTGGTGCTGCGC 
ATTAGGATCATGAACAACGAAGAGAACAAGTTCCAGGACGAAGAC-GAGGCCGTCGACAAGATGGAGGACGACATGTTCTTGCGCTGCATTGAGGC 
CAACATGCTGTCGGACATGACACTGCAGGGTATCGAAGCCATCGGCAAGGTGTACATGCATCTGCCACAGACCGACAGCAAGAAGCGTATCGTGA 
TCACTGAGACCGGAGAGTTTAAGGCCATCGGCGAGTGGCTGCTCGAGACGGACGGCACATCGATGATGAAAGTGCTGTCTGAGCGTGACGTGGAC 
CCAATCCGAACATCCTCCAACGATATTTGCGAGATATTCCAGGTGCTGGGCATCGAGGCGGTGCGAAAGTCTGTCGAAAAGGAGATGAACGCCGT 
GTTGCAGTTCTACGGCCTGTACGTGAACTATCGTCACTTGGCCCTGTTGTGCGATGTGATGACGGCCAAGGGCCATCTGATGGCCATCACCCGTC 
ACGGCATTAACAGGCAAGACACTGGTGCCCTTATGCGTTGCTCCTTTGAGGAGACAGTGGATGTGCTAATGGATGCCGCCGCTCACGCCGAAACG 
GATCCCATGAGGGGCGTCTCTGAGAACATTATCATGGGACAGCTGCCCAAAATGGGTACCGGCTGCTTTGACCTTCTGCTCGATGCAGAGAAGTG 
TCGTTTCGGCATCGAGATTCCCAATACGTTGGGCAACAGTATGCTAGGTGGCGCCGCTATGTTCATTGGCGGTGGATCGACACCGAGCATGACGC 
CACCGATGACGCCGTGGGCTAACTGCAACACGCCGCGATACTTCTCTCCACCCGGCCACGTAAGTGCCATGACTCCTGGCGGTCCCAGTTTCTCG 
CCTTCGGCTGCATCGGATGCGTCCGGAATGTCGCCTAGCTGGTCGCCGGCTCATCCGGGCTCATCGCCCAGTTCACCAGGACCTTCGATGTCGCC 
GTATTTCCCAGCCTCGCCGAGTGTTTCTCCCTCTTATTCGCCAACGAGTCCGAACTACACGGCATCTTCTCCCGGTGGAGCCTCGCCGAATTACT 
CGCCCTCGAGTCCGAACTATTCGCCGACGTCGCCGCTTTATGCAAGTCCACGTTACGCATCGACAACGCCAAATTTCAATCCACAGTCGACGGGT 
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TACTCGCCATCTTCATCGGGATACTCGCCAACATCCCCGGTCTACTCGCCCACGGTGCAATTCCAGTCGAGTCCGTCGTTTGCGGGCAGCGGTAG 
CAACATTTACTCGCCGGGCAATGCGTACTCGCCCAGCTCGTCCAACTACTCCCCCAATTCACCATCCTACTCGCCGACATCACCATCGTACTCGC 
CGTCAAGTCCTTCGTACTCGCCAACGTCGCCTTGCTATTCGCCCACATCGCCTTCGTACTCGCCAACGAGTCCGAACTACACACCCGTAACACCC 
TCATACTCGCCGACAAGTCCGAACTATTCAGCGTCGCCGCAATATTCTCCAGCCTCGCCAGCTTACTCGCAAACGGGGGTGAAGTACTCACCGAC 
ATCGCCGACGTACTCGCCGCCGTCACCATCGTACGATGGGTCTCCCGGATCACCACAATATACGCCAGGATCTCCGCAGTACTCGCCGGCCTCGC 
CTAAGTACTCGCCGACCTCACCGCTGTACTCGCCCAGCTCGCCGCAGCACTCGCCCTCAAACCAGTACAGCCCAACAGGATCGACCTATTCGGCG 
ACGAGTCCGCGGTACTCGCCGAACATGTCCATCTACTCGCCGAGCAGCACCAAGTACTCGCCCACCTCGCCAACGTACACACCGACGGCCCGCAA 
CTACTCGCCCACGTCACCGATGTACTCGCCAACGGCTCCATCGCACTACAGTCCCACGAGTCCGGCCTACTCGCCCAGCAGTCCCACGTTCGAGG 
AGAGCGAAGACTGAGGAAGGGAGGACGGGGG 
(SEQ ID NO: 110) 

Start ATG: 336 

MSTPTDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGVQFAETMEGGRPKLGGLMDPRQGVIDRTSRCQTCAGNMTECPGHFGHIDLAKPVFHIGF 

ITKTIKILRCVCFYCSKMLVSPHNPKIKEIVMKSRGQPRKRLAYVYDLCKGKTICEGGEDMDLTKENQQPDPNKKPGHGGCGHYQPSIRRTGLDL 

TAEWKHQNEDSQEKKIVVSAERVWEILKHITDEECFILGMDPKYARPDWMIVTVLPVPPLAVRPAVVMFGAAKMQDDLTHKLSDIIKANWELRKN 

EASGAAAHVIQENIKMLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVITPDPNLRIDQVGVPRSIAQNL 

TFPELVTPFNID^QELVRRGNSQYPGAKYIVRDNGERIDLRFHPKSSDLHLQCGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFR 

MNLSCT SP YNADFDGDEMNLHVPQSMETRAEVENIH I T PRQI I T P Q AN KP VMG IVQDTLTAV RKMT KRD V FIT REQ VMNLLM FL P T W D AKM PQ P C 

ILKPRPLWTGKQIFSLIIPGNVNMIRTHSTHPDEEDEGPYKWISPGDTKVMVEHGELIMGILCKKSLGTSAGSLLHICFLELGHDIAGRFYGNIQ 

TVINNWLLFEGHSIGIGDTIADPQTYNEIQQAIKKAKDDVINVIQKAHNMELEPTPGNTLRQTFENKVNRILNDARDKTGGSAKKSLTEYNNLKA 

MVVSGSKGSNIWISQVIACVGQQNVEGKRIPYGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGYIQRR 

LIKAMESVMVNYDGTVRNSVGQLIQLRYGEDGLCGELVEFQNMPTVKLSNKSFEKRFKFDWSNERLMKKVFTDDVIKEMTDSSEAIQELEAEWDR 

LVSDRDSLRQIFPNGESKVVLPCNLQRMIWNVQKIFHINKRLPTDLSPIRVIKGVKTLLERCVIVTGWDRISKQANENATLLFQCLIRSTLCTKY 

VSEEFRLSTEAFEWLVGEIETRFQQAQANPGEMVGALAAQSLGEPATQMTLNTFHFAGVSSKHVTLGVPRLKEIINISKKPKAPSLTVFLTGGAA 

RDAEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQRTVISEDQEFVNVYYEMPDFDPTRISPWLLRIELDRKRMTDKKLTMEQIAEKINVGFGEDL 

NCIFNDDNADKLVLRIRIMNNEENKFQDEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKVYMHLPQTDSKKRIVITETGEFKAIGEWLLET 

DGTSMMKVLSERDVDPIRTSSNDICEIFQVLGIEAVRKSVEKEMNAVLQFYGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFE 

ETVDVLMDAAAHAETDPMRGVSENIIMGQLPKMGTGCFDLLLDAEKCRFGIEIPNTLGNSMLGGAAMFIGGGSTPSMTPPMTPWANCNTPRYFSP 

PGHVSAMTPGGPSFSPSAASDASGMSPSWSPAHPGSSPSSPGPSMSPYFPASPSVSPSYSPTSPNYTASSPGGASPNYSPSSPNYSPTSPLYASP 

RYASTTPNFNPQSTGYSPSSSGYSPTSPVYSPTVQFQSSPSFAGSGSNIYSPGNAYSPSSSNYSPNSPSYSPTSPSYSPSSPSYSPTSPCYSPTS 

PSYSPTSPNYTPVTPSYSPTSPNYSASPQYSPASPAYSQTGVKYSPTSPTYSPPSPSYDGSPGSPQYTPGSPQYSPASPKYSPTSPLYSPSSPQH 

SPSNQYSPTGSTYSATSPRYSPNMSIYSPSSTKYSPTSPTYTPTARNYSPTSPMYSPTAPSHYSPTSPAYSPSSPTFEESED* 

(SEQ ID NO: 111) 

Classification: enzyme 
Gene Symbol: Rpll215 
FlyBase ID: FBgn0003277 



Celera Sequence No. : 142000013383748 

CTCAATAAATTCGTCCCCCTCCATATAATAACCCCCCCCCCCCCACAGAGAACAAAACCAAAAAACAAAAAAAAAAAAGGTATCGAGCAACTTCA 
CAAAAGACCACCACACAACCCAGCCATCTCTTCGATCCGAATGGTTCGCTAGTATAGCCACAGTACAACTATGGTACACATTGGTTTTTAGGCCT 
TAAGACTTTGGTAGATTTTAATTCAGAAAATGAAAAGATAAAAAGAACAAGCCACCAGCGCAACTACAACTACAACACACAAAGAGTGAGGCATC 
CGATGAACAATGGATGTCAAAGGTAAATTTATTGTATGTCCGCGAACCTAAATTATTTATCCACTTGGTTTTTACCCTAAATCTGTCGCAACACA 
AAACATGTCAAGAGAAAAGTGAAAAAAAGGXACTATATGCGATTGTTTGGCGTAAGTGCAAGTAGTTACATAAGTCGTGAAATGAGAATCAATAA 
ATTGAACCTAATTTAAAACCGAAAATATACACAATGGGTGTTTTTAATGGGAATGTCGGGATTTTCAGCTTTTCTTAAAAACAAATTACAAAAGG 
ATACCCATTGCCCAAATTGAAATTAACGCAGCGCTGGCTTAACAATAATAATTATTTATTTATTTTCAAAGTTTCTTCTTGTCTATAACAATTAG 
TAATGACTTGAACGCACACACATCATATGATTTTCGTGATCGATTCGCGGGGGATAAGCACATGCAGGGCTATATATATTTATGTGTAGATAAAT 
ATAAATATACACAATGCAAATAACTTTCGTTTAGAACAATATCAGAATTTTCAATTATGCGCCGACCAAAAGATGGTAAACTACTTTTGGTTCTA 
ACAAAAATTATTGTCAAATCTCCGCCAGGAGTTTTCGTTCAATTAAGAATCAAAGACTACATGGTTAAGTGGGTTAATTGCTAACTGGAAACGTT 
ATCGTGGCGATAAAACCTGCGCGGGTTGGGTTCACAAATCATAGGGTTTTAAAAATGTTTTTATATATTTCTCATATATATCTATATATGCATAT 
AAAGAAGAGCTTAGAAGCATTCAGCTATGACACAGTAACAAAGAGGAAAAAAAAAAAACAATTAGGGTAATTTAAATTTTTAATCCTAAGCCGGA 
GGATTGACACTTAACACAGGGATCGAGGACGGGGAAGTTGGCAGCTGATCTACTTGTACTTCTTCATGAACTCGGCGTGGAACACACGGAACTTC 
TCCATCATCAGCTTCAGCTTGTCCGTTTGCGGGAGGATCGTGCTTCGGAAATGCCACGTGCCGGGCTTCTGACCAAATCCACTGCCAGGAACAAT 
GCAAATGCCTGGGAGAAAATTAAACAATAAATTGAGAGACTAACAAACGAGGTCAGCAAAGAAAACTCACCGCTCGTCTCGAGCAGCTCAAATGC 
GTAGAAAACATCGGGGGCCATGCCCTTGGCCTTGGCCGCCTCGATCGCCTTGGGCGGGATCTCGATCTGTGGAAAGACGTACATGGCGCCCTGTA 
CGGGGTTAACCTTGTAGCCTTCGAAGCTGTTCAGTGCCTTGTGGACGAGCTCGGCCCGTTCTTTTAGAGCGGCCAGAATGCCATCGCGCTCCTTC 
TTGTACAGATCGTATGATGGCTCTCCGGGCTGCGGTGGATTGACCAGGGCACTCACGGCCACCTGGCCAGCGGTGGTGCTGCAAAGCGCCGCCGT 
TATCGACTTGGTCAGCATGGCCTTGACCTTGGGATCAAGATTGAGAACCTCCATGTAACCGCCGCGAATGCCGCACTCGCCCAGATAGCCCTTCG 
AGGTGGACAGGAAACTGACCATTTCCAGATTACGATAGGGGTCGCCCATTTCGTAGGCCACCTTCTTGAACGACCAGAACTTGGAGTTCTTGTCG 
TAGACATTGTCCTGGTACACCTCATCGGCCAGCACCAGCACCTTGTTATCGTGTGCGAACTTAATGATCTCCTCGATGTTCTCGCGGGTCAGTAC 
CTGTCCGGTGGGATTGCCCGGATTGATCACGACCAGGGCACGCGGATTGCAGACCTTCTTCGCCTCATCGTAGGACCGTTGCAGCTCCTTCCTGT 
CCAGGCTCCAACCGGTCTCCTCCTCCAGATAGTAATCCACCTTGGTCATGCCGTATTCGGAGATGGTGGCCGAGTACAGTGGGTACTGCGGAATG 
GGCACCATGACACCAGGCGCCTTGCATCCCACCTCGGCGCTGAAAAAAGAAACGTAAAGGTAAGCTTCAAATGTATTTAGAATTCCTTAGTAAAT 
GCTCAGTAGTATTTCATAAACAAAGTGTTTGAATAATACTACTGAGCAATAAACAATAATAATATTTATCTATCTGGCTGCCACATAAGTTGTCA 
ACCCAAAAAATGGCACAATTTTGGGCCATATCTCTGGCAAATCTCTGGTTTTAACTGATAAGTCTCGACAGAAGAAGCGCCACATCATGGACAAT 
AAAAATGTTGGCCCTGGCGACATACTTGATCATGGAGAGAATGCTCTTGATGCCGGGAGAGGCACCGCCGGTTAGATAGATGTCCTGCCAATTGG 
AGGCGATGCCGCCATCCCTTTTCTCGATGTACTGAGCAACCTGGCGACGCACCACCTCCAGACCGGCGGAGTCGGTGTACGAGCCCACCGATTGA 
CCCTGGCAGCCGTTCAAAATGGCACAGGCGCGCTTCTTGACGTCCTCGGGATAATCGGGTGAATCCAGCAGACGTGTCTCGAAGGTCAGCGCCAA 
CAGCTGATAAGTAAAATATACAATACATTTAATGAAATCATATCATAAATGAAGGGGCCAAAACTCACCTGTCGCAGGAAGGTCAAGGGCTGCTG 



FIGURE SHEET 46 



GCCCATCGCATGGCAATCACCGATGTTGGCACGGATCACCTGGTCGAATGGCTTCTTGACACCCTGCAATCAAATTCAATCGGAACGGAATTACA 
AATTAGCCATATAAAATAACTCATTGTTTGCTCGCTGGCCAAATGAAAGTGTAATCGCTGCAGGCTGGCGGATCAAATGGGCATTTAATTTGATT 
TGTGGCTTTCGCCATGCAAATTACACTCAAGTGGATTGCCAAGCTAAATTGAAGACCAGCCAGAGTACAGAGTAATCGATTAAGTATCAAATTCA 
ATCAAATATTTATCAGTAATGAGCCATTGTGGATTCAGCAAATGGTATTCTCTTTCAAATAGACGCCTGCTAGAAATAACTATCAAATGACCATC 
ATTATAGTTACTTAACAATGTGAACTAGTTTACATTCAGTAGCAAATCAATGGTGCAATCTGTTACGATCTTTATAATCCAGCACACTTTGGGGA 
ATCCCAAATTAAAGTTTCTAAACGAATTAAAGTCATTGTAGACACAGGCAACCAGTAGTTTAGTTTCCCGCACGAATAGAGGTCACAATAATCGG 
CTGTGGAAATTGTCAGCTGAACAAAACACACAATTTGCAGGGCACATGCTAGTTGCACTCGTTCATGGAAGACAACATCTCATCGTCTCGGGTTC 
CATTTAACCACGCTCAAATGGCCAATTTACCACATAATGTCACGGAGGTTCGAGGAGCTGCGATGATATTGTGTGTTTTTTTTTTTTTTTTGAGA 
GGCGAAACAATGCAAGTTGCGCCATGTAAGCGAACTGCAGCGCTGCCATTGACGTCACTCGGCAGTGGATGTGTTTTGCCTGACAATGTGATAAT 
TGGAAAATTAAGGAATTCTGGGCTGTGGAAAAATCATCCAGTGCTCGAGTTTGGTTACTAAGAGTATAAAATCGTCAAGTTCTCACCGTGCAAAC 
GTGTGTCACCTTATCGAATTCAGTTAATTATTTATTATTTATTTTTTTTTTTTTTAATTTCGAAAATTACTGATAAAAGTGTTGCCGAGTCGCGG 
CAACTGAACTGGCTTGTGTTTGTAGTTCGTGTGGAAATATGTATAGAAATGGGTCCTCTATCGCATCGCAAATCGAACAGTGCTGGTGAAAAAAA 
ATGATATGCCAAAGTAGTTTCGAATATTTCAAAACCACAAAACCGCAGAGATAGTGTTAAGAAATCAATTAATAGCCCATCTAAACTCAGTTAGA 
AATTGCTAGTTAAGTTTTTAGCATCGTTTTGGAAATGGCTACTAATTAACTGACCAGCGCTGTTCGCACCTAAAGAATATTGCTTGCATATTCCT 
TTCTACCCTTCACTTCGCACTTGAAATCGTTGTAAACAAACTTTTTGTGCTAAGTGGCAATATAATTGATAACTACATATATATGCACAACATAT 
ATACTCGTGCAGGCAGGTAAGAGTCGTATAATCATATAGGCGCTGTATAAAAGGCGTTAGCAACTCAAACATTATCGCCAATACGAATCGAAGGA 
AAGTATAATAATATGTTGGCAAATGTTAATGCCAAAAGGAAGTCAAAGTAAACAGAGGGAAATGGGCATGATTAGGAATTAGAAATGATTATTAC 
ACGGGCACAGAATTGTTAATAATCCTATATCGTAGTGAACAGGACGAAGTGAAGTACATAAATGCCAGTGATAAGAATGCCTGCTGCTGACAGAT 
GATGAGTTTCTTTTGGACCATT GC C AAAT C C GT AT GT T G AAT CT GT GGGC AAGT GG AGG AAT T T G AT AAAAC GC AG AAT G AAC AGC AC AGC AG AT 
TCAGTCGTGCGGATAAGGAAATCAAATCGCAGCGGGAAAACTGGTGGGGGAAATCAACTGGCAGCGGCATTAAAAACGCAAACACAGAATGTGAA 
AAGTCAGCGGGGCTCACAATGGATTTAACTATATGTATATACATATGTATATCTTTAAATATTCATCACATATTTGTGTACATATGTACATATAG 
TATCGTTGCGGGTCACAAAAGTTCGAAGAGAATTTTGCAAAACAAATGAAGGAAAAAAAAAATGGAAAGCAAGAAGTGCAGAGAGAAAGCATCGT 
GTGTATACAACATGTTTTCCCCACAGACTCATGCGCATGTTCGGCCTTTGTTATGATTTTTCACTTTCCCAGTGATAAGAAAAGGTCCACGGAAT 
CAGAATCCGAATCAGAATCACGCCCACGCTCGCCCACAATTTCCGCACTTATGTAAGCCAAGTTGAAGGTCGTTCAGTGAACCCGCATCGCATCG 
CGTCGTTCGCTCTTCATCTAAATACTTTATTTAATTGATCCAAGAAGCACAATTTAGAATAATTACAAACATTAACTGGCAATTAAGAGCATGAC 
TCGTTATGGAAAACGGCGAAGAAACGCCGTTTTTTTTGTTTTTTTTTTTTAGATGAGAAAACCGGTTAATTTGGGCCAACTGCCTGTGGAAAAAT 
CATTCACATGCCGAGCAGTACTTAAATTAATTAGTTTATTGACTTGTTGTTTGCTCTAACCCAAAAAGTGTAATTTCCCCTTAGCCCAAAGTTCA 
CTTATAAAGATATATATTCCGTTTTAGATAAGTCTCGATATTTTTATAGCGGTCTGATTACAATTTCAAGTCTTATCTGAACTTTTGGATTACTT 
CTGATAGCTTTATTGTTGCCCAAAAATTGTTTGTGTTTAGTTGAAGTAAATAAAATTCCGAGCTCTATTTTTGGCAAATTTTAATGGGTTTAGTT 
CAAGTGGTTCAAACACTACTCTAAATTCTTGATGTGTAATAAAAAACAAACAATTGTTTTGATCAACTTTGTGCTCTTTGGTAAACATTTTTCGC 
TCAAAACTGAAATTAAACATATGGCATTTTACTTTCGTAAGCTTTTTATGCTCGTGGAATAAGAAATGGGTTTGGAAACTGGTTTTTTTTTTATT 
CGACAAGCTTATTGTTTTTGCAAAAATGTGTTTTTTGCAAAACAAAACCTTATCGCATTGAATAAACAAAAGCTAAAGTTTTCTATGTTTGTTTG 
TTGAAACTTTATACATTCGATAGATTTTTTATTGGATTGCTAGCTGAATTGGTACTCCCCTGGCTGGTTCTTATCACACTTCCATTCTTATCAAT 
CCAAATTACTTCATTTCGTGGGCGTATTTTCCAACAGGTGTCCTTGAGAAAGGGGTGACCACGATTGCCAGCCGAAGGTGAATGATTCAGCGCTG 
CGTGGCCGCTCCATTCAAGCGCGAATAAAAAAATGAAAAGGGCGGCAAAAAACAAACACAGACGCGCCTCACACTTGATACATACATACATACAT 
AAACGTGCATGTGTATATAAGTAGATATACATAGAGACATAGCGTGCTTAGATATTAAAATGCGGGCTTTGAATTGCCATTAAATTGCCATTGAA 
TAGAGGGAAGGGTAGGGGGGTGTGTGCAAAACACTAGTTAATGGCGTCGCACTTTGCCGCTTTGTTTGCCCGATTACTTGTGGATCAATTTCCCA 
TGGCATTAACTAACACTCACCTTTTCCAGTTCCTTCTCGATTTCCCCAGCACGAATCACCAGAGGACCGCGAACGGCATATTCCATGGCAATAAA 
GTTGGGATTTATGTTATCCAGTGTGAGCGCTTTCGACGACGACGGCATTTTGTGGCTGGTTGAAAAGGAACTGAGAAAAAAAAAGAAGGGGATGG 
ATATTCACTTCAGCAATCAACAATCACTTCTTGTTCGATATATAACGTGTTTTAAATGTGTTTTAAAATATATTGTTTTTGTTATGCGGCTTTTT 
CAGCGAACGAAACTAACCGGCGGACGTCTAACGGTTGAATGAAATCGCCGCTTGTTCCATGGCGCTGCGACGTCGCTGCTGCCGTCACCGTTGCT 
GTTGCCGTTGATTTCGCCGCACGTCGATTGGCATTGTTGCTGTTGTTGTGCAGGAACGATTTCCAGCGGCGAATAACGTACTGTTGCTGCTGGCG 
CTGGTGCTGCTGCAATGCGGCCGAAACTGTGGCGGCAGCAGGTTTAATTAGCATTCGACTCATTATTTCTGCAACGAGAAGAGCGTCTCAGTAAG 
T AAAT T ACTC AAAAT AC AAAAAAAAGGAAAGAGAC AGACCGCGC TTTAAT CGCAAGAATT TAGCACC ATGAGAAC AGCGGAAAGAAGAAGAAGCA 
GCGAAATGAAGGGTGTGTGCGTGTATAAGAGTCCGGCTCACAATTATAAGAGCGGCGAATTGAAAGGCTCAGCTGTTATCACTACACCGAAAGCA 
ATTAGTTTCGAAGCAGGGAATCATTTAAATAACTTTGTAGCTAGGAAAACTTAAGCAACTATAGATATTTCGATGTGAAATATATATCATATTAT 
ATTAAGCATTTTTATATGGTATTGTTAGCACGCGCTCAATATAATATTCAAGTACCTAAATACATTTTCTTCACTGTGCATTTTTGTATCTTATC 
AAAGACAAGGCGTGGTATCATTTACTTTTTCAGAGCTTATCAATGTGATTGCTTGTGGTTCGATACTTTATTGCCTTTCTGTTTATCAAGAAGCT 
AAAGTGTAAATATTTACGACCTTTGGAAACTTTGCAATTAAATTCGCCGTGCATTTATCCTAGACTCAAACCCAATTGCACTCGCCGCAAGGCGC 
GCATTTCAAAACTGCAATCGAGCATTTCTTATCAGAGTTTTGGGGCTTTGCACAAGCAACTGCAAAGCTGAAAAATATATATTTTATTTGAGGAG 
GAATATATATTTTTGGAAAATATAAAAACGGTTACCGGCTGAATAAAATGAATGGCCATAAAGCCTAATCTCAAGAACATTTATTGCTTGACCAT 
CCAGAAAATTTAATTAAAAATGGGACCGTTTAGAATCTACTTGTGTGTGCTGCTATATATTTCACTATATTTCTTATCTTGGTGGAACTTTTTTT 
TTAGAGGCAGTTAACGGATTTGTGGGCCATTTACCCAGGCATTATGTATACATATGTATGATTTAATAGCTATATTTCTCCATTAAATCCACTGG 
CAGGGCATCCATTGGTGGGGGCCACTATCAGTTGAGCGTACTTTTGACTTTGAGCCAAATCGGCGCAGCTTATCAGTGACGTCCCATTCGCAGCT 
TACGATTCCCTGTCTTTCAATTGAATATATGTATATAGCTTTTCTGATTTCTGCACTTTGAAACGCCAATTGCGTAATGCAAAAAATGAGCAAAC 
GAACAAAGAATCTTTTAGAAACATGACAATGACCAAAGATCGACATTACCTCCAAAATACTATACTTTTCGCACTCAAAAATCAGCATCAATCAG 
CCGCAGATTGTTTGCTTTCTTTTTAGCGACAGCGAATTTATTTGTTTTTAAATTAAAAATGTACTAAATGCCGGTCCCGAAAAATACACAAATGT 
ATGCTATTTTATATACGCACATATGTATCTTTGGAATATATGAATGTATGTGTATATATATAGGTATATTGTTTAGACGAGACAAACTGAAACTT 
GTTGGCAATAACTTTTTGGAAAACCGGTTGCGGTTTTGATGGCATTGATATAGACGAAAGGGTTAAGAATTAGGGGAAAGGGGTAAGAATTTGGG 
AATGAGAAAACATCGATACAGTGATCTCTGCTTAATTAAGACAAATTCAATACGATGCCGATATATAGATATAGATAGATATAAAATAATATATC 
CAATAGAATTTTTATATGATAACCATCTAAAGCAGCTGTGCATAGAACTTTTGCGTATTAATTTAAGAAACATTACCTCCAAATATTTATTTTAA 
ATAAATTTTTCAAATTTTAACATTTTTTTTTTTTTGAGACAACTTTTGTTAGTCACTGTTTATCTGCGATATATATATATGCAGCGTATCGCTTT 
TGTTTGTGATTGTTTTGACAACAAAACTGGCTTATGCAACGAACTGTTAAAAAAAAATCTGAAACAGAAAATTATGTGAACTGGCTGCACTTTGC 
TTCAAAGACCCCGCTCCCCCCTGCACTTTCACCCCCCGCATTGGGGGGATTATCAGACAAACCAACTGGATAAGATAGTAACGATAAGGAATCCA 
ATTAGAACACGGCCATTCGGATTTACTAAAATCACAACTTTTGCCAAACGTTAGTAAATTATATTAAATTATATTGTAACTCCGATTGACACGTA 
GAGCGCAACGAACAACAGCAACACAAAAAGAAGCAGAAATAACGGGTAAAGGGGGTGTTTTGATGTGCTAATTGGGTTGCTAAACAGCTGTTGAC 
GCCGCACAAATTGAGCACAAAAAAAAAAACAATAACACGGCACGCAATTCAATTAGGGAAACGCACGCAGAACGATAAACAAAATCAGGGTACTA 
TTGTGTATTAGATTTTTTTTTGGGCGCACAAATCACTTTCACTGTCACTTCCTCGCTGCTCCCAATTAGAATTTCTCGGCCACTGTGTTTATATA 
TATGTACATATATGCACATGCATATATACATATATACGTCGGCAGTTCAAGAGCAAAAAATCGATTTTTCGCTGCACTTTTTAGTCGTTTCACCT 
TGATGGGCGCAAGTGGGCGTGGCGATTATCTAACGGCGTTTGTTGTGATCCCAAGAACTTGCATAAACTTATTAATAAACGCGATATGTCTGTAT 
ATAAAATACCTAATAAAAACGTGTGTGCTTGCTTGGCTGCGTGTGTGTGCGCGTACGTGTGTGTGTGTGTGCGAATATGTGCGGCGTGCGAGCTC 
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CAATCAGAATCGTATCAAAAATTCAAAGCTCTTTGGCCGCTAGACGCAATTAAGGAACTGACAACAAGAACTGAAACCGCACCGAGATGGAAAAT 
GAAGTGCTGGCAGTATTTACCGACTATATGTTTTTCTTGCGGGCAAAAGTCACGATGAAGCGGGCAGCGATTAAGTTTTAAAAATATCACGTCTG 
CAGCCCTCGCAGCCAAACTTTCGATAGACGGCGATAGATCGATAGACCAGCGATTTGCACCAACACTGGAAAGCTCAGAGCATTCAAAAACAAAC 
TAATGTGTTTTATAAATAGCTCTTTTTAAATATCGATTAATTGCGATCTTAACTAACTTTCGTTTATTTCTATTAATTATGAACTTACTTCTGTT 
TTTAAACATCTTTGCTCTTTGCTTGTTTTTCGTGTAAGGCTTTAAATTCACTTAAACAATTTTGCTGGCTTTGTGCGCATCTTCATATATCACTG 
AAATTGATTGTTATAGGAGTGTCCCATTGAATTAGTTTTGTGTCGATTCAATCCTGGCCCGAAAATGTATGAACAAGCTATGCCCTACATTTTAC 
CCATAGTCTACTTAGGTTTGTTTATATAGTATTCTTTTCAAGTTCGTTATGATAGCTCTTAGAAAATAAGATTTCTGTGAATATCTCCTCCGTAT 
AAAAGAATTTTTATCAATAGTTTGGTAAGCTTTTTTTTCTCTTCTACAAAACCTTTTCGCATGTTTTATTTCACTCGAAACTTATTGCTATTCAA 
AAAAATGGGTTTTCTAACGCTAGAATTATGTACAGAGTGCATTTGGAAAAAATGATGGGTAGTAAGCCCTAGAAAGCCATGGCTACGTGCTGATG 
CCCATGATGGATTCGATGGAGAAACTCAGCTTGGGCTTCTTGCCACTGCCGGTGGAGGCAGTGGTGACTGCGGGCGTGGTCATCGCTGGATTTGT 
GGGCGGGATCGTTAGCGGTATCGTCGGCGGCATCGGGTGATAAATGTGCTGCTCCTGATGAACGGGTGCGCAGGCGGCGGCGGCGGCGGCAACGG 
TGGCAAAGTACAAAGCGTTACCGCCCAGCGTCGCCTCCATGACATGGTTCAGTTCCGGATCGGCGAATGCCCGCCGTTTGCGCATGCGCTCCATC 
AATG 



(SEQ 


ID NO: 


112) 


Exon ; 


9739. 


.9404 


Exon: 


6908. 


.6668 


Exon: 


6530. 


.6386 


Exon : 


2913. 


.2824 


Exon: 


2758. 


, .2496 


Exon: 


2224. 


.1401 


Exon: 


1338. 


.1001 


Start 


ATG: 


6508 (Reverse strand: CAT) 



Transcript No. : CT4436 

CTTTTGCCCGCAAGAAAAACATATAGTCGGTAAATACTGCCAGCACTTCATTTTCCATCTCGGTGCGGTTTCAGTTCTTGTTGTCAGTTCCTTAA 

TTGCGTCTAGCGGCCAAAGAGCTTTGAATTTTTGATACGATTCTGATTGGAGCTCGCACGCCGCACATATTCGCACACACACACACACGTACGCG 

CACACACACGCAGCCAAGCAAGCACACACGTTTTTATTAGGTATTTTATATACAGACATATCGCGTTTATTAATAAGTTTATGCAAGTTCTTGGG 

ATCACAACAAACGCCGTTAGATAATCGCCACGCCCACTTGCGCCCATCAAGAAATAATGAGTCGAATGCTAATTAAACCTGCTGCCGCCACAGTT 

TCGGCCGCATTGCAGCAGCACCAGCGCCAGCAGCAACAGTACGTTATTCGCCGCTGGAAATCGTTCCTGCACAACAACAGCAACAATGCCAATCG 

ACGTGCGGCGAAATCAACGGCAACAGCAACGGTGACGGCAGCAGCGACGTCGCAGCGCCATGGAACAAGCGGCGATTTCATTCAACCGTTAGACG 

TCCGCCGTTCCTTTTCAACCAGCCACAAAATGCCGTCGTCGTCGAAAGCGCTCACACTGGATAACATAAATCCCAACTTTATTGCCATGGAATAT 

GCCGTTCGCGGTCCTCTGGTGATTCGTGCTGGGGAAATCGAGAAGGAACTGGAAAAGGGTGTCAAGAAGCCATTCGACCAGGTGATCCGTGCCAA 

CATCGGTGATTGCCATGCGATGGGCCAGCAGCCCTTGACCTTCCTGCGACAGCTGTTGGCGCTGACCTTCGAGACACGTCTGCTGGATTCACCCG 

ATTATCCCGAGGACGTCAAGAAGCGCGCCTGTGCCATTTTGAACGGCTGCCAGGGTCAATCGGTGGGCTCGTACACCGACTCCGCCGGTCTGGAG 

GTGGTGCGTCGCCAGGTTGCTCAGTACATCGAGAAAAGGGATGGCGGCATCGCCTCCAATTGGCAGGACATCTATCTAACCGGCGGTGCCTCTCC 

CGGCATCAAGAGCATTCTCTCCATGATCAACGCCGAGGTGGGATGCAAGGCGCCTGGTGTCATGGTGCCCATTCCGCAGTACCCACTGTACTCGG 

CCACCATCTCCGAATACGGCATGACCAAGGTGGATTACTATCTGGAGGAGGAGACCGGTTGGAGCCTGGACAGGAAGGAGCTGCAACGGTCCTAC 

GATGAGGCGAAGAAGGTCTGCAATCCGCGTGCCCTGGTCGTGATCAATCCGGGCAATCCCACCGGACAGGTACTGACCCGCGAGAACATCGAGGA 

GATCATTAAGTTCGCACACGATAACAAGGTGCTGGTGCTGGCCGATGAGGTGTACCAGGACAATGTCTACGACAAGAACTCCAAGTTCTGGTCGT 

TCAAGAAGGTGGCCTACGAAATGGGCGACCCCTATCGTAATCTGGAAATGGTCAGTTTCCTGTCCACCTCGAAGGGCTATCTGGGCGAGTGCGGC 

ATTCGCGGCGGTTACATGGAGGTTCTCAATCTTGATCCCAAGGTCAAGGCCATGCTGACCAAGTCGATAACGGCGGCGCTTTGCAGCACCACCGC 

TGGCCAGGTGGCCGTGAGTGCCCTGGTCAATCCACCGCAGCCCGGAGAGCCATCATACGATCTGTACAAGAAGGAGCGCGATGGCATTCTGGCCG 

CTCTAAAAGAACGGGCCGAGCTCGTCCACAAGGCACTGAACAGCTTCGAAGGCTACAAGGTTAACCCCGTACAGGGCGCCATGTACGTCTTTCCA 

CAGATCGAGATCCCGCCCAAGGCGATCGAGGCGGCCAAGGCCAAGGGCATGGCCCCCGATGTTTTCTACGCATTTGAGCTGCTCGAGACGAGCGG 

CATTTGCATTGTTCCTGGCAGTGGATTTGGTCAGAAGCCCGGCACGTGGCATTTCCGAAGCACGATCCTCCCGCAAACGGACAAGCTGAAGCTGA 

TGATGGAGAAGTTCCGTGTGTTCCACGCCGAGTTCATGAAGAAGTACAAGTAGATCAGCTGCCAACTTCCCCGTCCTCGATCCCTGTGTTAAGTG 

TCAATCCTCCGGCTTAGGATTAAAAATTTAAATTACCCTAATTGTTTTTTTTTTTTCCTCTTTGTTACTGTGTCATAGCTGAATGCTTCTAAGCT 

C T T C T TT AT AT GC AT AT AT AG AT AT AT AT GAG AAAT AT AT AAAAAC AT T T TT 

(SEQ ID NO: 113) 

Start ATG: 600 (Reverse strand: CAT) 

MPSSSKALTLDNINPNFIAMEYAVRGPLVIRAGEIEKELEKGVKKPFDQVIRANIGDCHAMGQQPLTFLRQLLALTFETRLLDSPDYPEDVKKRA 
CAILNGCQGQSVGSYTDSAGLEVVRRQVAQYIEKRDGGIASNWQDIYLTGGASPGIKSILSMINAEVGCKAPGVMVPIPQYPLYSATISEYGMTK 
VD Y YLEEETGWSLDRKELQRS YDEAKKVCNPRALVVINPGNPTGQVLTREN I EE 1 1 KFAHDNKVLVL ADEVYQDN V YDKNS KFWS FKKVAYEMGD 
PYRNLEMVSFLSTSKGYLGECGIRGGYMEVLNLDPKVKAMLTKSITAALCSTTAGQVAVSALVNPPQPGEPSYDLYKKERDGILAALKERAELVH 
KALNSFEGYKVNPVQGAMYVFPQIEIPPKAIEAAKAKGMAPDVFYAFELLETSGICIVPGSGFGQKPGTWHFRSTILPQTDKLKLMMEKFRVFHA 
EFMKKYK* 
(SEQ ID NO: 114) 

Name: putative aminotransferase 
Classification: enzyme 



Celera Sequence No. : 142000013383748 

CTCAATAAATTCGTCCCCCTCCATATAATAACCCCCCCCCCCCCACAGAGAACAAAACCAAAAAACAAAAAAAAAAAAGGTATCGAGCAACTTCA 
CAAAAGACCACCACACAACCCAGCCATCTCTTCGATCCGAATGGTTCGCTAGTATAGCCACAGTACAACTATGGTACACATTGGTTTTTAGGCCT 
TAAGACTTTGGTAGATTTTAATTCAGAAAATGAAAAGATAAAAAGAACAAGCCACCAGCGCAACTACAACTACAACACACAAAGAGTGAGGCATC 
CGATGAACAATGGATGTCAAAGGTAAATTTATTGTATGTCCGCGAACCTAAATTATTTATCCACTTGGTTTTTACCCTAAATCTGTCGCAACACA 
AAACATGTCAAGAGAAAAGTGAAAAAAAGGTACTATATGCGATTGTTTGGCGTAAGTGCAAGTAGTTACATAAGTCGTGAAATGAGAATCAATAA 
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ATTGAACCTAATTTAAAACCGAAAATATACACAATGGGTGTTTTTAATGGGAATGTCGGGATTTTCAGCTTTTCTTAAAAACAAATTACAAAAGG 
ATACCCATTGCCCAAATTGAAATTAACGCAGCGCTGGCTTAACAATAATAATTATTTATTTATTTTCAAAGTTTCTTCTTGTCTATAACAATTAG 
TAATGACTTGAACGCACACACATCATATGATTTTCGTGATCGATTCGCGGGGGATAAGCACATGCAGGGCTATATATATTTATGTGTAGATAAAT 
ATAAATATACACAATGCAAATAACTTTCGTTTAGAACAATATCAGAATTTTCAATTATGCGCCGACCAAAAGATGGTAAACTACTTTTGGTTCTA 
ACAAAAATTATTGTCAAATCTCCGCCAGGAGTTTTCGTTCAATTAAGAATCAAAGACTACATGGTTAAGTGGGTTAATTGC'TAACTGGAAACGTT 
ATCGTGGCGATAAAACCTGCGCGGGTTGGGTTCACAAATCATAGGGTTTTAAAAATGTTTTTATATATTTCTCATATATATCTATATATGCATAT 
AAAGAAGAGCTTAGAAGCATTCAGCTATGACACAGTAACAAAGAGGAAAAAAAAAAAACAATTAGGGTAATTTAAATTTTTAATCCTAAGCCGGA 
GGATTGACACTTAACACAGGGATCGAGGACGGGGAAGTTGGCAGCTGATCTACTTGTACTTCTTCATGAACTCGGCGTGGAACACACGGAACTTC 
TCCATCATCAGCTTCAGCTTGTCCGTTTGCGGGAGGATCGTGCTTCGGAAATGCCACGTGCCGGGCTTCTGACCAAATCCACTGCCAGGAACAAT 
GCAAATGCCTGGGAGAAAATTAAACAATAAATTGAGAGACTAACAAACGAGGTCAGCAAAGAAAACTCACCGCTCGTCTCGAGCAGCTCAAATGC 
GTAGAAAACATCGGGGGCCATGCCCTTGGCCTTGGCCGCCTCGATCGCCTTGGGCGGGATCTCGATCTGTGGAAAGACGTACATGGCGCCCTGTA 
CGGGGTTAACCTTGTAGCCTTCGAAGCTGTTCAGTGCCTTGTGGACGAGCTCGGCCCGTTCTTTTAGAGCGGCCAGAATGCCATCGCGCTCCTTC 
TTGTACAGATCGTATGATGGCTCTCCGGGCTGCGGTGGATTGACCAGGGCACTCACGGCCACCTGGCCAGCGGTGGTGCTGCAAAGCGCCGCCGT 
TATCGACTTGGTCAGCATGGCCTTGACCTTGGGATCAAGATTGAGAACCTCCATGTAACCGCCGCGAATGCCGCACTCGCCCAGATAGCCCTTCG 
AGGTGGACAGGAAACTGACCATTTCCAGATTACGATAGGGGTCGCCCATTTCGTAGGCCACCTTCTTGAACGACCAGAACTTGGAGTTCTTGTCG 
TAGACATTGTCCTGGTACACCTCATCGGCCAGCACCAGCACCTTGTTATCGTGTGCGAACTTAATGATCTCCTCGATGTTCTCGCGGGTCAGTAC 
CTGTCCGGTGGGATTGCCCGGATTGATCACGACCAGGGCACGCGGATTGCAGACCTTCTTCGCCTCATCGTAGGACCGTTGCAGCTCCTTCCTGT 
CCAGGCTCCAACCGGTCTCCTCCTCCAGATAGTAATCCACCTTGGTCATGCCGTATTCGGAGATGGTGGCCGAGTACAGTGGGTACTGCGGAATG 
GGCACCATGACACCAGGCGCCTTGCATCCCACCTCGGCGCTGAAAAAAGAAACGTAAAGGTAAGCTTCAAATGTATTTAGAATTCCTTAGTAAAT 
GCTCAGTAGTATTTCATAAACAAAGTGTTTGAATAATACTACTGAGCAATAAACAATAATAATATTTATCTATCTGGCTGCCACATAAGTTGTCA 
ACCCAAAAAATGGCACAATTTTGGGCCATATCTCTGGCAAATCTCTGGTTTTAACTGATAAGTCTCGACAGAAGAAGCGCCACATCATGGACAAT 
AAAAATGTTGGCCCTGGCGACATACTTGATCATGGAGAGAATGCTCTTGATGCCGGGAGAGGCACCGCCGGTTAGATAGATGTCCTGCCAATTGG 
AGGCGATGCCGCCATCCCTTTTCTCGATGTACTGAGCAACCTGGCGACGCACCACCTCCAGACCGGCGGAGTCGGTGTACGAGCCCACCGATTGA 
CCCTGGCAGCCGTTCAAAATGGCACAGGCGCGCTTCTTGACGTCCTCGGGATAATCGGGTGAATCCAGCAGACGTGTCTCGAAGGTCAGCGCCAA 
C AGC TGAT AAGT AAAAT AT AC AAT AC AT T T AAT G AAAT CAT AT C AT AAAT G AAGGGGCC AAAAC T C ACCTGT CGC AGG AAGG T C AAGGGCT GC T G 
GCCCATCGCATGGCAATCACCGATGTTGGCACGGATCACCTGGTCGAATGGCTTCTTGACACCCTGCAATCAAATTCAATCGGAACGGAATTACA 
AATTAGCCATATAAAATAACTCATTGTTTGCTCGCTGGCCAAATGAAAGTGTAATCGCTGCAGGCTGGCGGATCAAATGGGCATTTAATTTGATT 
TGTGGCTTTCGCCATGCAAATTACACTCAAGTGGATTGCCAAGCTAAATTGAAGACCAGCCAGAGTACAGAGTAATCGATTAAGTATCAAATTCA 
ATCAAATATTTATCAGTAATGAGCCATTGTGGATTCAGCAAATGGTATTCTCTTTCAAATAGACGCCTGCTAGAAATAACTATCAAATGACCATC 
ATTATAGTTACTTAACAATGTGAACTAGTTTACATTCAGTAGCAAATCAATGGTGCAATCTGTTACGATCTTTATAATCCAGCACACTTTGGGGA 
ATCCCAAATTAAAGTTTCTAAACGAATTAAAGTCATTGTAGACACAGGCAACCAGTAGTTTAGTTTCCCGCACGAATAGAGGTCACAATAATCGG 
CTGTGGAAATTGTCAGCTGAACAAAACACACAATTTGCAGGGCACATGCTAGTTGCACTCGTTCATGGAAGACAACATCTCATCGTCTCGGGTTC 
CATTTAACCACGCTCAAATGGCCAATTTACCACATAATGTCACGGAGGTTCGAGGAGCTGCGATGATATTGTGTGTTTTTTTTTTTTTTTTGAGA 
GGCGAAACAATGCAAGTTGCGCCATGTAAGCGAACTGCAGCGCTGCCATTGACGTCACTCGGCAGTGGATGTGTTTTGCCTGACAATGTGATAAT 
TGGAAAATTAAGGAATTCTGGGCTGTGGAAAAATCATCCAGTGCTCGAGTTTGGTTACTAAGAGTATAAAATCGTCAAGTTCTCACCGTGCAAAC 
GTGTGTCACCTTATCGAATTCAGTTAATTATTTATTATTTATTTTTTTTTTTTTTAATTTCGAAAATTACTGATAAAAGTGTTGCCGAGTCGCGG 
CAACTGAACTGGCTTGTGTTTGTAGTTCGTGTGGAAATATGTATAGAAATGGGTCCTCTATCGCATCGCAAATCGAACAGTGCTGGTGAAAAAAA 
ATGATATGCCAAAGTAGTTTCGAATATTTCAAAACCACAAAACCGCAGAGATAGTGTTAAGAAATCAATTAATAGCCCATCTAAACTCAGTTAGA 
AATTGCTAGTTAAGTTTTTAGCATCGTTTTGGAAATGGCTACTAATTAACTGACCAGCGCTGTTCGCACCTAAAGAATATTGCTTGCATATTCCT 
TTCTACCCTTCACTTCGCACTTGAAATCGTTGTAAACAAACTTTTTGTGCTAAGTGGCAATATAATTGATAACTACATATATATGCACAACATAT 
ATACTCGTGCAGGCAGGTAAGAGTCGTATAATCATATAGGCGCTGTATAAAAGGCGTTAGCAACTCAAACATTATCGCCAATACGAATCGAAGGA 
AAGT AT AA T AA T A TGTT GGC AAAT GT T AATGC C AAAAGG AAGT C AAAGT AAAC AG AG GG AAAT GGG C AT GAT T AGG AAT TAG AAAT GATT AT T AC 
ACGGGCACAGAATTGTTAATAATCCTATATCGTAGTGAACAGGACGAAGTGAAGTACATAAATGCCAGTGATAAGAATGCCTGCTGCTGACAGAT 
GATGAGTTTCTTTTGGACCATTGCCAAATCCGTATGTTGAATCTGTGGGCAAGTGGAGGAATTTGATAAAACGCAGAATGAACAGCACAGCAGAT 
TCAGTCGTGCGGATAAGGAAATCAAATCGCAGCGGGAAAACTGGTGGGGGAAATCAACTGGCAGCGGCATTAAAAACGCAAACACAGAATGTGAA 
AAGTCAGCGGGGCTCACAATGGATTTAACTATATGTATATACATATGTATATCTTTAAATATTCATCACATATTTGTGTACATATGTACATATAG 
TATCGTTGCGGGTCACAAAAGTTCGAAGAGAATTTTGCAAAACAAATGAAGGAAAAAAAAAATGGAAAGCAAGAAGTGCAGAGAGAAAGCATCGT 
GTGTATACAACATGTTTTCCCCACAGACTCATGCGCATGTTCGGCCTTTGTTATGATTTTTCACTTTCCCAGTGATAAGAAAAGGTCCACGGAAT 
CAGAATCCGAATCAGAATCACGCCCACGCTCGCCCACAATTTCCGCACTTATGTAAGCCAAGTTGAAGGTCGTTCAGTGAACCCGCATCGCATCG 
CGTCGTTCGCTCTTCATCTAAATACTTTATTTAATTGATCCAAGAAGCACAATTTAGAATAATTACAAACATTAACTGGCAATTAAGAGCATGAC 
TCGTTATGGAAAACGGCGAAGAAACGCCGTTTTTTTTGTTTTTTTTTTTTAGATGAGAAAACCGGTTAATTTGGGCCAACTGCCTGTGGAAAAAT 
CATTCACATGCCGAGCAGTACTTAAATTAATTAGTTTATTGACTTGTTGTTTGCTCTAACCCAAAAAGTGTAATTTCCCCTTAGCCCAAAGTTCA 
CTTATAAAGATATATATTCCGTTTTAGATAAGTCTCGATATTTTTATAGCGGTCTGATTACAATTTCAAGTCTTATCTGAACTTTTGGATTACTT 
CTGATAGCTTTATTGTTGCCCAAAAATTGTTTGTGTTTAGTTGAAGTAAATAAAATTCCGAGCTCTATTTTTGGCAAATTTTAATGGGTTTAGTT 
CAAGTGGTTCAAACACTACTCTAAATTCTTGATGTGTAATAAAAAACAAACAATTGTTTTGATCAACTTTGTGCTCTTTGGTAAACATTTTTCGC 
TCAAAACTGAAATTAAACATATGGCATTTTACTTTCGTAAGCTTTTTATGCTCGTGGAATAAGAAATGGGTTTGGAAACTGGTTTTTTTTTTATT 
CGACAAGCTTATTGTTTTTGCAAAAATGTGTTTTTTGCAAAACAAAACCTTATCGCATTGAATAAACAAAAGCTAAAGTTTTCTATGTTTGTTTG 
TTGAAACTTTATACATTCGATAGATTTTTTATTGGATTGCTAGCTGAATTGGTACTCCCCTGGCTGGTTCTTATCACACTTCCATTCTTATCAAT 
CCAAATTACTTCATTTCGTGGGCGTATTTTCCAACAGGTGTCCTTGAGAAAGGGGTGACCACGATTGCCAGCCGAAGGTGAATGATTCAGCGCTG 
CGTGGCCGCTCCATTCAAGCGCGAATAAAAAAATGAAAAGGGCGGCAAAAAACAAACACAGACGCGCCTCACACTTGATACATACATACATACAT 
AAACGTGCATGTGTATATAAGTAGATATACATAGAGACATAGCGTGCTTAGATATTAAAATGCGGGCTTTGAATTGCCATTAAATTGCCATTGAA 
TAGAGGGAAGGGTAGGGGGGTGTGTGCAAAACACTAGTTAATGGCGTCGCACTTTGCCGCTTTGTTTGCCCGATTACTTGTGGATCAATTTCCCA 
TGGCATTAACTAACACTCACCTTTTCCAGTTCCTTCTCGATTTCCCCAGCACGAATCACCAGAGGACCGCGAACGGCATATTCCATGGCAATAAA 
GTTGGGATTTATGTTATCCAGTGTGAGCGCTTTCGACGACGACGGCATTTTGTGGCTGGTTGAAAAGGAACTGAGAAAAAAAAAGAAGGGGATGG 
ATATTCACTTCAGCAATCAACAATCACTTCTTGTTCGATATATAACGTGTTTTAAATGTGTTTTAAAATATATTGTTTTTGTTATGCGGCTTTTT 
CAGCGAACGAAACTAACCGGCGGACGTCTAACGGTTGAATGAAATCGCCGCTTGTTCCATGGCGCTGCGACGTCGCTGCTGCCGTCACCGTTGCT 
GTTGCCGTTGATTTCGCCGCACGTCGATTGGCATTGTTGCTGTTGTTGTGCAGGAACGATTTCCAGCGGCGAATAACGTACTGTTGCTGCTGGCG 
CTGGTGCTGCTGCAATGCGGCCGAAACTGTGGCGGCAGCAGGTTTAATTAGCATTCGACTCATTATTTCTGCAACGAGAAGAGCGTCTCAGTAAG 
TAAATTACTCAAAATACAAAAAAAAGGAAAGAGACAGACCGCGCTTTAATCGCAAGAATTTAGCACCATGAGAACAGCGGAAAGAAGAAGAAGCA 
GCGAAATGAAGGGTGTGTGCGTGTATAAGAGTCCGGCTCACAATTATAAGAGCGGCGAATTGAAAGGCTCAGCTGTTATCACTACACCGAAAGCA 
ATTAGTTTCGAAGCAGGGAATCATTTAAATAACTTTGTAGCTAGGAAAACTTAAGCAACTATAGATATTTCGATGTGAAATATATATCATATTAT 
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ATTAAGCATTTTTATATGGTATTGTTAGCACGCGCTCAATATAATATTCAAGTACCTAAATACATTTTCTTCACTGTGCATTTTTGTATCTTATC 
AAAGACAAGGCGTGGTATCATTTACTTTTTCAGAGCTTATCAATGTGATTGCTTGTGGTTCGATACTTTATTGCCTTTCTGTTTATCAAGAAGCT 
AAAGTGTAAATATTTACGACCTTTGGAAACTTTGCAATTAAATTCGCCGTGCATTTATCCTAGACTCAAACCCAATTGCACTCGCCGCAAGGCGC 
GCATTTCAAAACTGCAATCGAGCATTTCTTATCAGAGTTTTGGGGCTTTGCACAAGCAACTGCAAAGCTGAAAAATATATATTTTATTTGAGGAG 
G AAT AT AT ATT T T T GG AAAAT AT AAAAAC G GT T AC C GG CT G AAT AAAAT G AATGGC C AT AAAG C CT AAT C TC AAG AAC A T TTATTGCTTGACCAT 
CCAGAAAATTTAATTAAAAATGGGACCGTTTAGAATCTACTTGTGTGTGCTGCTATATATTTCACTATATTTCTTATCTTGGTGGAACTTTTTTT 
TTAGAGGCAGTTAACGGATTTGTGGGCCATTTACCCAGGCATTATGTATACATATGTATGATTTAATAGCTATATTTCTCCATTAAATCCACTGG 
CAGGGCATCCATTGGTGGGGGCCACTATCAGTTGAGCGTACTTTTGACTTTGAGCCAAATCGGCGCAGCTTATCAGTGACGTCCCATTCGCAGCT 
TACGATTCCCTGTCTTTCAATTGAATATATGTATATAGCTTTTCTGATTTCTGCACTTTGAAACGCCAATTGCGTAATGCAAAAAATGAGCAAAC 
GAACAAAGAATCTTTTAGAAACATGACAATGACCAAAGATCGACATTACCTCCAAAATACTATACTTTTCGCACTCAAAAATCAGCATCAATCAG 
CCGCAGATTGTTTGCTTTCTTTTTAGCGACAGCGAATTTATTTGTTTTTAAATTAAAAATGTACTAAATGCCGGTCCCGAAAAATACACAAATGT 
ATGCTATTTTATATACGCACATATGTATCTTTGGAATATATGAATGTATGTGTATATATATAGGTATATTGTTTAGACGAGACAAACTGAAACTT 
GTTGGCAATAACTTTTTGGAAAACCGGTTGCGGTTTTGATGGCATTGATATAGACGAAAGGGTTAAGAATTAGGGGAAAGGGGTAAGAATTTGGG 
AATGAGAAAACATCGATACAGTGATCTCTGCTTAATTAAGACAAATTCAATACGATGCCGATATATAGATATAGATAGATATAAAATAATATATC 
CAATAGAATTTTTATATGATAACCATCTAAAGCAGCTGTGCATAGAACTTTTGCGTATTAATTTAAGAAACATTACCTCCAAATATTTATTTTAA 
ATAAATTTTTCAAATTTTAACATTTTTTTTTTTTTGAGACAACTTTTGTTAGTCACTGTTTATCTGCGATATATATATATGCAGCGTATCGCTTT 
TGTTTGTGATTGTTTTGACAACAAAACTGGCTTATGCAACGAACTGTTAAAAAAAAATCTGAAACAGAAAATTATGTGAACTGGCTGCACTTTGC 
TTCAAAGACCCCGCTCCCCCCTGCACTTTCACCCCCCGCATTGGGGGGATTATCAGACAAACCAACTGGATAAGATAGTAACGATAAGGAATCCA 
ATTAGAACACGGCCATTCGGATTTACTAAAATCACAACTTTTGCCAAACGTTAGTAAATTATATTAAATTATATTGTAACTCCGATTGACACGTA 
GAGCGCAACGAACAACAGCAACACAAAAAGAAGCAGAAATAACGGGTAAAGGGGGTGTTTTGATGTGCTAATTGGGTTGCTAAACAGCTGTTGAC 
GCCGCACAAATTGAGCACAAAAAAAAAAACAATAACACGGCACGCAATTCAATTAGGGAAACGCACGCAGAACGATAAACAAAATCAGGGTACTA 
TTGTGTATTAGATTTTTTTTTGGGCGCACAAATCACTTTCACTGTCACTTCCTCGCTGCTCCCAATTAGAATTTCTCGGCCACTGTGTTTATATA 
TATGTACATATATGCACATGCATATATACATATATACGTCGGCAGTTCAAGAGCAAAAAATCGATTTTTCGCTGCACTTTTTAGTCGTTTCACCT 
TGATGGGCGCAAGTGGGCGTGGCGATTATCTAACGGCGTTTGTTGTGATCCCAAGAACTTGCATAAACTTATTAATAAACGCGATATGTCTGTAT 
ATAAAATACCTAATAAAAACGTGTGTGCTTGCTTGGCTGCGTGTGTGTGCGCGTACGTGTGTGTGTGTGTGCGAATATGTGCGGCGTGCGAGCTC 
CAATCAGAATCGTATCAAAAATTCAAAGCTCTTTGGCCGCTAGACGCAATTAAGGAACTGACAACAAGAACTGAAACCGCACCGAGATGGAAAAT 
GAAGTGCTGGCAGTATTTACCGACTATATGTTTTTCTTGCGGGCAAAAGTCACGATGAAGCGGGCAGCGATTAAGTTTTAAAAATATCACGTCTG 
CAGCCCTCGCAGCCAAACTTTCGATAGACGGCGATAGATCGATAGACCAGCGATTTGCACCAACACTGGAAAGCTCAGAGCATTCAAAAACAAAC 
TAATGTGTTTTATAAATAGCTCTTTTTAAATATCGATTAATTGCGATCTTAACTAACTTTCGTTTATTTCTATTAATTATGAACTTACTTCTGTT 
TTTAAACATCTTTGCTCTTTGCTTGTTTTTCGTGTAAGGCTTTAAATTCACTTAAACAATTTTGCTGGCTTTGTGCGCATCTTCATATATCACTG 
AAATTGATTGTTATAGGAGTGTCCCATTGAATTAGTTTTGTGTCGATTCAATCCTGGCCCGAAAATGTATGAACAAGCTATGCCCTACATTTTAC 
CCATAGTCTACTTAGGTTTGTTTATATAGTATTCTTTTCAAGTTCGTTATGATAGCTCTTAGAAAATAAGATTTCTGTGAATATCTCCTCCGTAT 
AAAAGAATTTTTATCAATAGTTTGGTAAGCTTTTTTTTCTCTTCTACAAAACCTTTTCGCATGTTTTATTTCACTCGAAACTTATTGCTATTCAA 
AAAAATGGGTTTTCTAACGCTAGAATTATGTACAGAGTGCATTTGGAAAAAATGATGGGTAGTAAGCCCTAGAAAGCCATGGCTACGTGCTGATG 
CCCATGATGGATTCGATGGAGAAACTCAGCTTGGGCTTCTTGCCACTGCCGGTGGAGGCAGTGGTGACTGCGGGCGTGGTCATCGCTGGATTTGT 
GGGCGGGATCGTTAGCGGTATCGTCGGCGGCATCGGGTGATAAATGTGCTGCTCCTGATGAACGGGTGCGCAGGCGGCGGCGGCGGCGGCAACGG 
TGGCAAAGTACAAAGCGTTACCGCCCAGCGTCGCCTCCATGACATGGTTCAGTTCCGGATCGGCGAATGCCCGCCGTTTGCGCATGCGCTCCATC 
AATGGCGAGCTGAACGTGGGAAATCGCTGTTTGTATTCTGAAAGAGAACGAACCAAAAGGAATGGTTGAAATTAGTGAGGGTTTTGGTGGGGGCG 
TTATCACTACAAATTGCCGCGATTCGTTATGAATGGGTATTCCAAATTGAAACAATTTATTATTGGGTAATCCGAGACACGCGTCGCCTGACCAG 
ACAGCCTGGCACCGTTTTTTTTTTTTTGTTTTTTTGTTTCAACTTCGGACCGGTTTCGGGATCTCCACTTATTCGGATGGAACCAGATGGAAAAC 
CCAGAAAGAAGTCAATTGCTGTTGCCACATTTGGCTCTTTTTGTGAGTCGGATGGGTTTTTCCGTTTTTGGTCGCGTCTCAGCTTACCGAATGTG 
ACTC 

(SEQ ID NO; 115} 

Exon: 10119.. 9969 
Exon: 6908.. 6668 
Exon: 6530.. 6386 
Exon: 2913. .2824 
Exon: 2758. .2496 
Exon: 2224. .1401 
Exon: 1338.. 1001 

Start ATG: 6508 (Reverse strand: CAT) 
Transcript No. : CT4458 

GAATCGACACAAAACTAATTCAATGGGACACTCCTATAACAATCAATTTCAGTGATATATGAAGATGCGCACAAAGCCAGCAAAATTGTTTAAGT 
GAATTTAAAGCCTTACACGAAAAACAAGCAAAGAGCAAAGATGTTTAAAAACAGAAAAATAATGAGTCGAATGCTAATTAAACCTGCTGCCGCCA 
CAGTTTCGGCCGCATTGCAGCAGCACCAGCGCCAGCAGCAACAGTACGTTATTCGCCGCTGGAAATCGTTCCTGCACAACAACAGCAACAATGCC 
AATCGACGTGCGGCGAAATCAACGGCAACAGCAACGGTGACGGCAGCAGCGACGTCGCAGCGCCATGGAACAAGCGGCGATTTCATTCAACCGTT 
AGACGTCCGCCGTTCCTTTTCAACCAGCCACAAAATGCCGTCGTCGTCGAAAGCGCTCACACTGGATAACATAAATCCCAACTTTATTGCCATGG 
AATATGCCGTTCGCGGTCCTCTGGTGATTCGTGCTGGGGAAATCGAGAAGGAACTGGAAAAGGGTGTCAAGAAGCCATTCGACCAGGTGATCCGT 
GCCAACATCGGTGATTGCCATGCGATGGGCCAGCAGCCCTTGACCTTCCTGCGACAGCTGTTGGCGCTGACCTTCGAGACACGTCTGCTGGATTC 
ACCCGATTATCCCGAGGACGTCAAGAAGCGCGCCTGTGCCATTTTGAACGGCTGCCAGGGTCAATCGGTGGGCTCGTACACCGACTCCGCCGGTC 
TGGAGGTGGTGCGTCGCCAGGTTGCTCAGTACATCGAGAAAAGGGATGGCGGCATCGCCTCCAATTGGCAGGACATCTATCTAACCGGCGGTGCC 
TCTCCCGGCATCAAGAGCATTCTCTCCATGATCAACGCCGAGGTGGGATGCAAGGCGCCTGGTGTCATGGTGCCCATTCCGCAGTACCCACTGTA 
CTCGGCCACCATCTCCGAATACGGCATGACCAAGGTGGATTACTATCTGGAGGAGGAGACCGGTTGGAGCCTGGACAGGAAGGAGCTGCAACGGT 
CCTACGATGAGGCGAAGAAGGTCTGCAATCCGCGTGCCCTGGTCGTGATCAATCCGGGCAATCCCACCGGACAGGTACTGACCCGCGAGAACATC 
GAGGAGATCATTAAGTTCGCACACGATAACAAGGTGCTGGTGCTGGCCGATGAGGTGTACCAGGACAATGTCTACGACAAGAACTCCAAGTTCTG 
GTCGTTCAAGAAGGTGGCCTACGAAATGGGCGACCCCTATCGTAATCTGGAAATGGTCAGTTTCCTGTCCACCTCGAAGGGCTATCTGGGCGAGT 
GCGGCATTCGCGGCGGTTACATGGAGGTTCTCAATCTTGATCCCAAGGTCAAGGCCATGCTGACCAAGTCGATAACGGCGGCGCTTTGCAGCACC 
ACCGCTGGCCAGGTGGCCGTGAGTGCCCTGGTCAATCCACCGCAGCCCGGAGAGCCATCATACGATCTGTACAAGAAGGAGCGCGATGGCATTCT 
GGCCGCTCTAAAAGAACGGGCCGAGCTCGTCCACAAGGCACTGAACAGCTTCGAAGGCTACAAGGTTAACCCCGTACAGGGCGCCATGTACGTCT 
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TTCCACAGATCGAGATCCCGCCCAAGGCGATCGAGGCGGCCAAGGCCAAGGGCATGGCCCCCGATGTTTTCTACGCATTTGAGCTGCTCGAGACG 
AGCGGCATTTGCATTGTTCCTGGCAGTGGATTTGGTCAGAAGCCCGGCACGTGGCATTTCCGAAGCACGATCCTCCCGCAAACGGACAAGCTGAA 
GCTGATGATGGAGAAGTTCCGTGTGTTCCACGCCGAGTTCATGAAGAAGTACAAGTAGATCAGCTGCCAACTTCCCCGTCCTCGATCCCTGTGTT 
AAGTGTCAATCCTCCGGCTTAGGATTAAAAATTTAAATTACCCTAATTGTTTTTTTTTTTTCCTCTTTGTTACTGTGTCATAGCTGAATGCTTCT 
AAGCTCTTCTTTATATGCATATATAGATATATATGAGAAATATATAAAAACATTTTT 
(SEQ ID NO: 116) 

Start ATG: 415 {Reverse strand: CAT) 

MPSSSKALTLDNINPNFIAMEYAVRGPLVIRAGEIEKELEKGVKKPFDQVIRANIGDCHAMGQQPLTFLRQLLALTFETRLLDSPDYPEDVKKRA 

CAILNGCQGQSVGSYTDSAGLEVVRRQVAQYIEKRDGGIASNWQDIYLTGGASPGIKSILSMINAEVGCKAPGVMVPIPQYPLYSATISEYGMTK 

VDYYLEEETGWSLDRKELQRSYDEAKKVCWPRALVVINPGNPTGQVLTRENIEEIIKFAHDNKVLVLADEVYQDNVYDKNSKFWSFKKVAYEMGD 

PYRNLEMVSFLSTSKGYLGECGIRGGYMEVLNLDPKVKAMLTKSITAALCSTTAGQVAVSALVNPPQPGEPSYDLYKKERDGILAALKERAELVH 

KALNSFEGYKVNPVQGAMYVFPQIEIPPKAIEAAKAKGMAPDVFYAFELLETSGICIVPGSGFGQKPGTWHFRSTILPQTDKLKLMMEKFRVFHA 

EFMKKYK* 

(SEQ ID NO: 117) 

Name: putative aminotransferase 
Classification: enzyme 

Celera Sequence No. : 142000013384690 

GAATCTCCGGTGTTCGCGACACTTTGCCGATTCAAATTTGAAAAAGTCGTTTTGCTGTTTAACATAGACGCATCCTTCGTTGTTCACCAGCTTTG 
AATTGTGCGGCGAGTACTCAATGGACTTGTTGTTCGAAACGTATCTGTAGGTGGGCTTATCGTGTGCATTCAATCCAAACCAGTACTCGTGGTCG 
TCTTGGTTTATTTTGCTTTTCATCTGGAATAGGGTAACTTCGGTGTTTAGATCAGCCAAGCACATGTGGAGGCTTTTGCAGATGGTCAGGGCACT 
AAACCAAGAAGCCTATGGTATTGTAATTTTGAAATTTCCAATCTCAATGACTTAAGTTAAGTCTACTCACTGTTTTTTCACTGAAATACAGGCAT 
ATGCCCTTACCGTTTGCCTCATCAGCACGCAAAAGCAGCAGCAGCCAAATGGGAAGACAGTAAAGCATGCGTAGTATTTGACTCATTTCTTTCGG 
TTTGTTGTATTCACCAGTACTGATGGATTTCGAAAGTTGCTCGTTTTGGTATAGCTCACTAATGCCATTATACCATGGTGCAAACAATTAGGGTT 
TGTGGTAAATTACCTTTCCATATTATTTGATATACCTTAACCTAAGCAGGTACCACAATTTTTTGCTAAATTATAATTTATATATTAATTCCCCA 
TTTTTTTTTTTTGCAAAATTTCAAATCTAACTGATTTTTTTCTTGACAAAAGTTGGCAGCACCGTATATGTCTAACCAGTGGTTCGGTTGTAGTA 
CCCCGACTATCGCTCTCGAAATAGAGCGCCACCTAGCCGATAAGAGTGCAACCACTTTACATCCACCAAAGGCATTGCCAACTGTAAATTTTTAG 
ACACTTGTGGACTATATTTCTTACGAGTGGCGCTCTTCTCTTAGTTTTTAGTTTGTGTGAAACGTAAACGTAATTTAGATCATTGAACGTTACGT 
TGAACGTACTGTGACTACGACTCCACACGAGTTTATACGTTTCTGTGCTCTCGACGCAGTCGGACGTTTAAAAATAAACGCGCTCCACCAAAAAT 
ACCTAGCCCGCTGGCCGCAAACGCAAATAAATCTGAACAAAAATTCGAATTCCATCCGGATCAAACAAACTTACAAACCCGAAAATTAGTTCTTC 
GACTCCAACGGCCGAGAAAACCGCAACAGTTTCAAAATGTTCAGCTCGATTTCAGGTAAAAAAGAAAAGCCCAGCCGCCAATCGAAGTTTATGTG 
AAAGTTTGGAAATTAGAAATCTTAGGTCTAAGCATGTGTACATATGTATGTATATATATGTACATATGATATACGAATGATATATGTATGTGTGC 
AGTGAGTGGGTTACCGCACTGGAGGAAGGAAGTCACTCGACTCCAACTACAACTCCAACTGCACTTCCAAGTTACATCATCATCACCAGAGCAGC 
GGTGGAGTCGGACAGTTGGAGCAGCTGGTGACCACGCGGGATTTGGATCCACGGTCGGGTGGGGTTGGCCAACCGGGCGTGTTCGATCCGATACG 
GGTCGCAGTAGCTAAAACCTAATTATCCGCTACCGCGACTCTGGTGATGGCAATGATGGTGATGACGACGATGTGGATGATACGTAAGCCAATTA 
TGGCGCAGTTATATGTAGCTCCAAATGGACTACAAGTTGCCGTTCTCGCTGGCACATCCCACACCCAAAACGATGATACGTATATAATTATAGCT 
GCCCAGATACACACTACACACAACACGTATCCGAGGCATCCATATATACTTTGTATCTTCGAGCGGTTACGCTGCTGCTCTTTCCTTGGGGCTCT 
GGCACACTTATATGTTCCTGATGTTCGGGGAAGTGGGTGCAAAGAACCAGAGTTCCATCAGCCAGCTCCGCTTTCGATCTCTGAAATCCTCCGGC 
AGTAGATGCCACTTGAGAACCGCATGGTCAAGCATTGGCACTTGTGAGTTAGGCACTTTTGTTCTATTATACATAGATTGTATTCATACCAATTA 
ATATTATCTGTTCCCACTCAATGCCAAGCAGATGCATCGAAATGTAAGTAACCCCAAATGGTGAACTTTCCACTCAATTACCTTGATAATGTTAC 
TCAGTATTAGACATTCCTCACCGACTGCGTAGTTCAACATTTAGTAGCTGCATATTGTTCCTATTCAGCACTTTGATCTTATATGTTTAGCATGT 
GTTTTGGGTCTATCAGCTAAAAACATTCCTTTGGAAACCAATGATTAAGAGCTTTTAAGTACATTCATTTTTTGATTTCTGCTTGAACAGATTGT 
TTAAAAAACACTATTCTAGAACAAGTGGTTATCACTGTGGGGATTTTTAACAGCATGTTTCTGAAATATGAAAATATAATAGTTTTAAATTTTAA 
ATTTATAATAAAGTGACTTCTTACAGGAATCAGCCATGTTAAGAGATACCATTATATATCATTCTTTGGGATCTTTGGATTATTGTTTTTTGTAA 
CTTGAATGGAATGAAACCCAGTTGATCTTCATGTTGTTAAAACTCAAATTTATTTGGTGAAAGCTCAAATATTCCTCGGTTCCCTAAATGTCCAT 
TGGTCCGCCCGTTTGGTGGTGGGTCCGTGCGTTCAGGCCGCCGGAGCTGGTGTTGCTTCGCCCTCGGGAGCTGCTGGTGCAGGAGTGGCGGCCTC 
GGCGGCTGGAGCTCCTTCCGCTGGTGGAGCTGCCGCCGGAGTGGCTCCTTCCGCTGCCGCGGCATCCGGTGCGGGCGTGGCTGCTCCAGTTGCGT 
CTGCCGCCGCGGGTGGACCCGTGCCGCCTGGCGGCACCGTTTGCTCCTGCTCGGCAATGGGTTCCGCGGCGTTTTCCATGCCATTCTCGAACACG 
GGCTCCGTGGTGCCACCACCATCACCTGGAGTCGGCGCGCCCTCCGCTTCCTTCTCCTTCGAGACCCCGATTTCCGTTGAATTCGAATTGTCAGC 
TGGCCGTGAATCGCTCTCCACACTTTGCTCTTTGATCCCCGTCTCTTGCTTATCCTTCTGGCCCGCGTCGTGGTGGAAGTGATGCAAGTGCTCGT 
GGCTGTGTTTCTGGCCATCCCCACTGTCCTCGGCGGGAACGGCGCACTTCATCTGGTTCTGTTCGCAGATGAAGTACTTCTTCTCCTGGCAGTTC 
ACGTCCAGGACGACGGTGACATTGGGCCTGATCCTGATCTCCAGGCAGTCGTCCAGGTTGGATCGTTGAGTCGGCTCCACAATATTGGAGTCGCC 
CATGTAGCGCACCAGTTTACCGCTGCTGATGTACTTGAAGCGCCCCTCCGACTGCAGATCGTTGCCACCGAACCAGAAGTCATCGAAGCCCACCT 
GTGACGTCACGTAGTGTACCACCGCCTTGAAGTCCTCCATCGTGGACACGTCGGCCAGGTTGAGGCCCTTGCGCAGGCAGTTGTTCTGGGCCCCG 
AACCAGTTGATCTGCCAATACATTTCAAGCGATATATAGTTATTTCGGGACTCTATGGATGGGCTTATTCGAATCTATGATAGCTTTATTGGTAC 
TAAAATGGCTGGCCAAGTATATTACAAGTGATGGGATGCTCTCTCACCTTTTTGATGCCCACATAGAAGCACTTTCCGTTTAGCTCCCTGAGATA 
CGGTTTGCCACAGGGACCCTTGTCCTTTTCCTTCTTGCTCTGCTTCTTTCCGCAGCCGGAATGGACCGTCATCAAGGTCAGAATCACAATTGGTA 
GGATCCTCATGGCTGCCAGCAAAACTGAGGCACCTGTCCGGTCAAACTGTAAGCATGCGTGCCGCCCCCTGTTCTATATATACCCTTTCGTGGGC 
GTGGCATCTGCATTATCGAGGAGTGACCAATTTCCATTAGGGAATCGTCGTGTATTGTTCGCACAGAATTTTTATCGACTGCTGCAGTCCATCAA 
ATTGATTAATTCTAATTGGACATGGTGATGCCCAGGGGTTGGTGCATCTGTCGTTGGTTTCTCAATCGCCCACAGTATTGAAACTATGCTGGTAC 
TTGTCAGACTTCCATCTGGGATGAAGAAGTTCCCGAATTCACTGACATCTTAAAAGGGTTCAGTAAATAAATTACTCAATTGCTATCCGATTAAG 
TGGTCAATAGGTGACCCTCCACTTAAAGTTCATCCCCATTATTTTGTATTTTTCAGAGATATAATTTCAATGCTCTGTATACTTATTACATGTAA 
AATTCGAATAATTACATATTCATCATCAGGATGAATTGAACAGAAACTAAACTTTCATATAAGAAACTATAAAGTATCTTTGCATGTCAATAATG 
CCATCTTTAAGAACTTATCTTATACACTGGCCCTTGATTTATTTGAAAGTTAGAAACTTGAACTTCGATTGAAATTTATTTAACTGCTCGTATAT 
TTAATTTCATTCGTGTACCCAAGCCGACCTGCACTTGAATATATATTTTAAAATAACTTGTTATATTATTGCTGTCATTTATTTTTAATCGATAA 
TAGTTGTGGGCTTCAGCTGGGCAAGTTGTGGAGATCAGGCTGCTGGCGGTGCCGGAGTCTCGGCTGCTGCGGCGGCGGGTGCTTCTGCTCCTGGC 
GCGGGAGTGGGTGCGGGCGTGGCTCCGTCGGCGGCAGGAGTGGCGGCACCTTCGGCAGCTGGAGCTGCTCCGGCGGGTGCGGCTCCTTCTGCTGC 
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TGGAGAAACAGCTTCAGCTCCGGCGGCAGGGGTGGCAGTTGCGCCCGGCTCACCAGTTCCGCCAACGGTATTTGTTTCTTCCGCTCCCGGTGTCA 
CCGAGACGCCATTCTGACCAGGTTGATCGTCATCGGGCACGTCGAGCGCCCCGACTGCATCCTCCGAGTCCGGCTCCGGGTGCTCCACTGAGCCA 
CTGGCAATGGGGTCCTGGCGGTCGTGGTCGCCATCGCCATCGCCATCGTCCTCTCCTTCGGCACTTCCGCCAATGTCGTGGTGGAAGTGATGCAA 
GTGCTCGTGACTGTGGTGCTTCGGCTTCTTGCCACTATCCGTGCCCTGGCAGTAGCGTTCCGAGCAGATGAAGTATTGCCGCTCGTGGCAGTTGT 
CTGCCGACACCATGTTGATCTCGGAGCGGATCCTCACCTCCAGGCAGTCGTCGCACTCGGAGTGCTCCAGTGGCAGGACGTTGCTATAGTTGCTG 
TAGTAGCGCACCAACCGACCGTTGCTGATGTACTGGAAGCGACCCTCGTGGTACAGATCGTTGCCCCCGAACCAGAAATCCTCCGTGTTGCCCAG 
GCCACTCAGGAACCCAATGGCTCCGTCGAAGTCCCTTTGGTTACTCAAGTCCGCCAGGGTCAGTCCTTTGCGTAGGCAGTTGTTCAGGGCGCCAA 
ACCAGTTCATCTGAATGGGTTTCATATTATTTCATAAGCCCATGCGACAGATCAACTCTTACCTTTTTCACCGAAAAGTAGTAGCACTTGCCGTT 
GATCCTGCGCAAATAGCGACGCGGGCAGGGGGGTCCTGATCCCAGGAGCTCGCCCCGCGGCAGGATGCTCAGCAGGGCCACGAGTGCAGTCAGGA 
GATGCTTCATTTCGCTCGCTTCCGCCGGTGCACAGTGAGCCGGCTCAAATCGCCTCCTGTACTTATATCCTCCGCTGGCCAGCAGTGCGCCCAAT 
CGATTTGGAGCTCGCTGGAGCTGGAGCGGCATCTTCGGCAGCATGGCCAATTCAATTACAAGTCCATGCCGCCGGAATGCCTACGCCAATGTCGT 
GGGCTAATTTGCTCACCCGCCAGTGGGTCAGGTTTAGGGTCATTCAGTTTAATTGGCTATTAGATACACTCGAAGAAAACGAGCCGGAAAAGTCG 
CATGGGGTGCTGTAATTGCCAGTAATGTTAACCCTCTAAATTCCTTGATGGCAATTCATCCGCCTAATAATGGGCATATTGCGAGTAACTATTAT 
ATCATATATAGATGTTATAAAATTGTAATACAAGTTTAATGCAAATAAATTAAAAGATTATATATATGGTTTTGAAATAAGCAATAATTCTTAGC 
AGTATTCTAACTCATTCGAAGGTAGCAGAGCTTTCTTCTTCTGGAGTATGATGATGATAGATGGATATAAAATGCTTCAGTTCCTTTGACCCTGA 
TCTGTCCTCCCATTTAGTTTGTATCTGCCGAAGTGAGTTAATTGCTGGCTATGAACTCGCTGGTAATCTTCAACAGCTCAAAAACGCTAAGAATA 
TTAAGTTCTGTGATTCGAGCAACTGGAGTGACTTCATAGTCATTAGCATAACCCGCCCGAAGGGCTAGATGTCTGACCCGGCCTTGTTCTTTCCG 
ATTATTGATTTACCTGGCAGGGTCGTGACATCAGCCATTGGATTGTCCGTGGCCAGGACACGAATCGCCCGCCTTTCCTTCTGTTGCCAGCTGAC 
TCAGACGACCTTCGGTCGGATGACGATTGCGCCGGGTGGCCACAACAATTCGTGGGGCGTGTGAGTACCACGCCACCCCTTTCTGCCACGCGCGC 
TACTCTCACATATATAATTCTTTCGTATATTGTATCTCGATTGTGCAGTTGTATCTCACATGACATCATCAGCAATGGCAACAACAACGGCGGCC 
CCAGTAGCTCCATCGATTCATCACGGATTGTAGCTCTTTTTTTTAAGGCATCCAGAACTGGTGTGGAACGTGAGTCGGCAGACCTCAAATCGCAG 
TAACTCATAGATATTCGTTCCATGCCCGTATGTGTGTCCGTTCCGAGCTGGGTTCCCATGTAGCCCAAGCTACAGCTACAGCTACAGGTTTTCCT 
CTATCCGAGGCTTCTGCGCATTATTTCAGCTGCGAGTGCGACATCGACGAAAACAAAATCAGAGAGCAAAATCAAAATCCAAAACTAAAACCAAA 
CCAAAACCAAAACAAAAACGACGCCAGAGGCTGTGAGAGAATCTGGGCTTATTGTGTTACCTTTTGGCCATGCTTCTAGCCGGTTTTCAGTGTTG 
ACCACAAGTTGCCACAGTTTTGAATAAAGACTGAAGAACAAAATGCGTCGTATTACGCAAATGCCTTAAATAGGGCCGCTGGCCTTAGGAATTCC 
GCGTGTGTAAGCCArTAAGATATATATGTACATATATCCAAAGGATTTGGCGGGGAATATTAAACACTGCGGTTGCCTTATACGTAAGGCCACCT 
TTTTTAAGTTTAATAACTTATCATTCTACGAATGTATTTTAAGTATCTTTAAATACGATTTTAATTTGGAAAACAGCACAATATACTTAGCATGC 
ATCTCTATACATATAGTTACATATTGTACAGTGTAGATTGCACGATTCCTTCCACTTAAATTGCTATTCTAGATTCTAGATTGTTAAGCAAGTCC 
AATGCCACATACATACATTCCTTAGCCATTAGCCATTAGCATTTGAAATGTCTGAATTCCCTTTGATTCACCCTGTTTTTTTTGAGCAGTAAAAG 
TAATTTTCAACACCTCGGCCGAGCCCACACATTCCATAATCTCGATACCCAGCAAGCGTAGAATAGAAACTATTATTTTCGCACAATTTTCGGCT 
CCTGCGGTGGCGATTGGCTCATTAAATGCGAAATGCGAAATGCGAAATTGGCCGAAACTAAATTATTTGGAAATGAGCCGAGTGTATGGGGCCAT 
TAAAGTATCTCTGCGGAGAGAACCGAATCAGAATAAACACATTCCCGATCACAATCAGAAAGCGTGAATAGACCGCACTAGAAACTTGGGGGCGA 
TTTCATTCGCCGGCTGCCCATTATGACATCACATACGCCGAATAATTTGAATACTCGAGCTTTGTTTGCCTCATTCGCATACAAGCATACTGTAT 
ATTACAAAGAAATTCCTCGGCTATTTCCGCCCGTTTCCGTTTCGCCCAACCGATTCGCACCCGGTTCCGAGTTGAGCGGCACAATGTTTCTATTT 
GAGTGGACGCACACGTCACTCCCAGTGAATGCAGATTCGAGGGGACAGCAAAAGCTGGCGCTCGTTAATTTCTATTAAATTGCGAATATAAGTCT 
CAAGTCTCGAATTTTAAGGCTGACGCCATGGGGCTGGCGATCACGATTTCAACAGCCCCTAGTTGCGCAGTTTCTATTAATAGGAAAGTGTCCCA 
GTCTAATCGATCCGATTTAGTCATGTAAATAGTTCATGCTCCAGCTGCTTTGTTTATGTTTCGGCAATTACCTGTCTGGAATTTGGAATCTGAAA 
TCTCAAATCTCGCATCTTCGATCTGATCGGCGCTCCACCCTGCTGATTGCAATCAGGCTATAAATTATAATTAGCGCCAGGCCAGAGCTTAGTTA 
GCATTCAGAGGGCCAGTCAGCATGTTTCCTAGTCGCCAAAACCAAAGTAATCCCATCCAGCCAGCACAGTCAGGATCTAAAATGCACTCCTCATT 
CAATCCCCAGCTTCCTTGAAGAACTTCGGTGACAAGACCGCCAACCTCTTCAGCAAGAAGAAGGATGAGGCCGAGAAGCTGGCCAACGAGAAGGC 
TGCCGAGGCCCAGAAGCTAGCCGAGGAGCAGGCCAAGAAGGTGGGCCAGTCCGTACAGCAGACCAAGGGCGAGGCCGAGCAGTTGGCCGCCAGCA 
CGGGTGAGT AACG AAC ATCC AAAAAAT AATGAAT GT AAC AC AT AAAAGATGTCTT AGAGAATTAATTGGAGTTATTAAT TCTAAAGAC T TC CAAA 
AACTCTTGCAACGAAGAATTGAATGAGTCCCGTTTGAGATAACTTTCAAAATCAACTTTCAGTTCTTGCTAAAAATGTTTTCTCATTAGCATTCC 
ATTTCCAATTTATCTAAAACGCAAATCTTTTAAGCATATCAAACACAAGAGAACTTAATAAATGGAATTCTTTAGTGACGCGTTGACGTAAAACA 
TGTTTCCAAGAATAAGATTCGCATTTTTGAAAACATTTCTTCATAAATATTGAAATATTTCTATAATTGTGAAAAGCAATTTAATGTGCTAGAGA 
TTTATAACTAAACAGAACATCAATGCTATGTTTTATGCTGGAGCAAAGCCCGACCAATTCAATAAATATCTTTAGGAGCACTTTTAAAGATATTT 
TGTAAGAATTTTTCCCACTAATATCTCTCTTTATCATACAGCCAAGGAAGCCGCTGCTCTGGCCACCGCGGAGGCTCAGAAGGCTGGACAGGCGA 
TTGACCAGGGTGTGAACCGTGCCGCCGGAGCCGTCAACCAAGGCAAGCAGGCGGTGGACAACACGGTGGCCCAGGCGGCGGCCGTGGCCCAGAAC 
TCCAAGCAGGTGGCGGCCAATGTGGCGGAGGCCTCCCAGCGAGCGGCCGCCAATGCGGTGGACCAGACCAAGAAGGCGGCCAACGACGCCATCAA 
CAAGAGCGTGAAGGCCGCCGAGAACGTGGCGGACCAGAAGCTGAAGCAGGCGGAGGGCGCCATCGATGGGGCACTGAAGCAGACCAGTCAAACGG 

TGGACCAGAAGCTGCAGGAGGCCAACCAGTATGTCGACCAGAAGCGCCAGTCCGTCGAGAAGACCGTCCAGGATGCGGCCGGACAGGCCCAGGAG 
TCTGCCGGACAGCAGGCCAACGCCCTGTTGGGC^lAGCTGCATCTGGGTCAGAAGTAGAGGCGTCTCCCTTCGCGGTGCCACCTAAAATATCTTGA 
TGT G AAAAT GC T T T AGAT AC AAAAAC AAAAAAAAT AG AAAGG AAT T T GC T 

(SEQ ID NO: 118} 

Exon: 1001.. 1195 

Exon: 8371.. 8553 

Exon: 9067.. 9557 

Start ATG: 1177 

Transcript No. : CT4516 

TCGACGCAGTCGGACGTTTAAAAATAAACGCGCTCCACCAAAAATACCTAGCCCGCTGGCCGCAAACGCAAATAAATCTGAACAAAAATTCGAAT 
TCCATCCGGATCAAACAAACTTACAAACCCGAAAATTAGTTCTTCGACTCCAACGGCCGAGAAAACCGCAACAGTTTCAAAATGTTCAGCTCGAT 
TTCAGCTTCCTTGAAGAACTTCGGTGACAAGACCGCCAACCTCTTCAGCAAGAAGAAGGATGAGGCCGAGAAGCTGGCCAACGAGAAGGCTGCCG 
AGGCCCAGAAGCTAGCCGAGGAGCAGGCCAAGAAGGTGGGCCAGTCCGTACAGCAGACCAAGGGCGAGGCCGAGCAGTTGGCCGCCAGCACGGCC 
AAGGAAGCCGCTGCTCTGGCCACCGCGGAGGCTCAGAAGGCTGGACAGGCGATTGACCAGGGTGTGAACCGTGCCGCCGGAGCCGTCAACCAAGG 
CAAGCAGGCGGTGGACAACACGGTGGCCCAGGCGGCGGCCGTGGCCCAGAACTCCAAGCAGGTGGCGGCCAATGTGGCGGAGGCCTCCCAGCGAG 
CGGCCGCCAATGCGGTGGACCAGACCAAGAAGGCGGCCAACGACGCCATCAACAAGAGCGTGAAGGCCGCCGAGAACGTGGCGGACCAGAAGCTG 
AAGCAGGCGGAGGGCGCCATCGATGGGGCACTGAAGCAGACCAGTCAAACGGTGGACCAGAAGCTGCAGGAGGCCAACCAGTATGTCGACCAGAA 
GCGCCAGTCCGTCGAGAAGACCGTCCAGGATGCGGCCGGACAGGCCCAGGAGTCTGCCGGACAGCAGGCCAACGCCCTGTTGGGCAAGCTGCATC 
TGGGTCAGAAGTAG 
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(SEQ ID NO: 119) 
Start ATG: 177 

MFSSISASLKKFGDKTANLFSKKKDEAEKLANEKAAEAQKLAEEQAKKVGQSVQQTKGEAEQLAASTAKEAAALATAEAQKAGQAIDQGVNRAAG 
AVKQGKQAVDNTVAQAAAVAQNSKQVAANVAEASQRAAANAVDQTKKAANDAINKSVKAAENVADQKLKQAEGAIDGALKQTSQTVDQKLQEANQ 
YVDQKRQSVEKTVQDAAGQAQESAGQQANALLGKLHLGQK* 
(SEQ ID NO: 120) 



Celera Sequence No. : 142000013383916 

AATT AAAT TAT T AAAT TAT T C AAAAC AAGC AAAAT AC AAC AAC AT AC AAC AACAAAAC AAAAC AAAAG AAGT AAAAC T AAAC AC T AC CAGC T AAT 
TATGTTAGATAACTGAATGAGAAAAGCTAAAACAACAGCAATAGCAACAGCAAAGGCATAAGCTAAGTTTGATCTGCAACCAACACCTCTTACCA 
CGAACAACGGCAAGCAATAACAGCAAGAACCCTTCCTTCCTCACCCCGATATAATTATGAAATTATATTATTTATAAAAACTGGGAACTAATTTA 
CGGAACGATGGCCAGCGTCAATAGAACAACAATACTACAGCAAACAGCAATGCTCGGGGCAGAGAGAGGATGCGACTCAGCACATGAGACTAGAC 
CCACTACACTGAAAGACGAACGAACATAGCGACGATATCGATTAGTATTGTTAAGTTTGGGTTAGGCTATATACTATAGATCCAGTCATGTTGCT 
GGGTCGTCCCACATCTCCCCAGCCAAATCAGTGTTATCGATCGGATGCAAAGAATCGAACCGAAGACCTGTAGATTCTTTCGCCTCCGAAACGAC 
AAACCAACAAACAAACTCAACTTCGGTTGGTTTAAAAATATATAAAAATTGATAAAAATAGTACAATGTTGTGTTTTTCTTTCTGCGCACGTACA 
TATGTATGTATGTATATCGATCTTTCATATTTCGACTTGAACCCAATTATTGCAATGAAAGTACATATGTATTTGTAATTCAAAAATTATATTAG 
CATACTTACACAGACCATTTGTGCAGTCAACCAATTGGTCGTCTCAATTTAATCTTCGGCAGTTCGAAGCTAAATTATTATAATTATTTTCTGTT 
GTTAAAAGAGGAAAAAAACGTACGCTCATCGCTTCTAAAATTGTTGGTCAAAGTAGGCCACGATAAATACATCGAATGTCATTTCGACCGACTGG 
CACACAACCTCGTTAGCCAATCTCAGTTCAGATCTCAACGTTGTAACATCAAAATTAAAAATTGTAAAAAGTGAAAGTCCGTTTTTATCCTTTTT 
TACCAACCAACTTTGAAGCTAAACTAAATCGTTTTAAAATCATTAAACAGTAAACTAACATGGGCGATGAAGGCGGTGGCGGTGGTCATGGCAAG 
GGAGACCTTAAGTCTTTCCTCATGGACTTCATGATGGGCGGCGTTTCAGCTGCCATAGCCAAAACGGCAGTGGCTCCTATCGAGCGCGTGAAGCT 
CATTCTGCAGGTGCAGGAGGTTTCCAAGCAGATCGCGGCAGATCAGCGCTACAAGGGCATAGTCGATTGCTTCATTCGCATTCCCAAGGAGCAGG 
GCTTCTCGTCGTTCTGGCGCGGCAACTTGGCCAACGTAATCCGATACTTCCCAACTCAAGCCCTTAACTTTGCCTTCAAGGATGTGTACAAATCG 
GTTAGTTGTTTGGACAATCCTTATAATAATCATTTCACATACGCCTATAACATATAAAACATAAAACATACTACAGGTATTCCTCGGGGGCGTGG 
ACAAGCACAAGCAGTTCTGGCGCCACTTTGCAGGAAACTTGGCCTCTGGCGGTGCAGCAGGTGCCACATCTCTGTGCTTCGTTTATCCACTGGAC 
TTTGCCAGAACCCGGTAAGTTGATTCATTATATACACATTCAATTGGTTTTTTCATTTATACACATATACATTAAAAATTGATAATTTTGAGTAA 
TTGAGGAAAACAATTCGAACCTTTACATACTTACTATAGTCGTTCAAAATAATCGATGTCCAAATGAGATTATACATTAAGACGCTTTTTCTACA 
ACTGCATTATTCACCCACATGAACTGTTTTCCCCATTTGATTTGAATTTGATTTACTTTCATTGGATTCGATTTTGTTATTAACTATTTATTACT 
ATTCCAATTCAATCTTCCGCCGACAGTTTGGCCGCCGATGTGGGCAAGGGCGGCAATCGGGAGTTCAACGGCCTGATCGATTGTCTAATGAAGGT 
GATCAAGAGTGACGGACCCATCGGTCTGTACCGTGGATTCATCGTGTCGGTGCAGGGCATCGTCATCTACCGAGCAGCATACTTTGGATTCTATG 
ACACTTGCCGCGACTTTCTGCCGAATCCGAAGAGCACACCGTTCTATGTGAGCTGGGCGATTGCCCAGGTGGTCACAACCGTTGCCGGTATTGCC 
TCCTATCCCTTCGATACTGTGCGTCGCCGCATGATGATGCAGTCTGGCCTGAAGAAGTCCGAGATGGTGTACAAGAACACAGCCCATTGCTGGCT 
GGTGATTGCCAAGCAGGAGGGCATTGGAGCCTTTTTCAAGGGCGCCCTATCGAACATAATTCGCGGCACGGGCGGTGCTCTGGTTCTCGCCCTTT 
ACGACGAGATGAAGAAGTACTTTTAGCCCCAGTGTGCCGCCGTTTGTGGTGGGCCAATTCGATCGCCAGCAGCAGCAGCAGCAGCCCCGGCCACA 
GC AAATGTCACACCACGATGGCGTAGCACCAACCGAACACACTGTCCACCTCTTCGTCGAGGAGGACTTGCTCCCACGAGCCCAGTAGTAAATTA 
ATAATTTGATTTGATAGGTCGCCCCACATGGTGGTCGGTCCCTCATTTCAGATTGTGTTCACTTTCGCAGTGGTTGAGAAAATACAAATAATATA 
TTGAACTCAAATCTTTGTTGTGCGCTCTTTGTTTGTTTTCATCATCGCGTGTTTTTTGCTTTTTGGTGCATGTGTGTGTACGGATTAGGGTGGGC 
CACATCTAACTAGGATCTAATTATCCTCTCAACAAGCTAAGCTAGTCGTTATATTCAACTATATGTGAAGGACAAACAAAAAAAATTTAATTAAT 
TTACGTATATTATTTTGTCTCGATTGAATACATTTCAAAAAAATTACAAAATCATTATATGTACGCCATGAAATGATAAAAGGTTATGTACTATA 
TGGGCTGCCCACCCTACTCGTTTTCGTAGATATGGTCTTGTGCGGGTGGATTTAAGACTTAAACGTTAGTTAGCCATTTAAATAAAAATAACATT 
AAAAAC AGAAAGCC ATGTC GCTT TCCATTGTC AATC TGCT GC ATACGT TAATTAAC AAC AAAAAACAAAAGT ACAAAAAAAAG AGGCAAAATT AC 
AAGTACTCGATCTGCACTTATTCTGTAAACAATTAGAAATAACTCTAGACTAAGATCCAGTCACTGAGTTGGCTGCACTTAGTGGGTAGCCCCGG 
ACCTGTGGGCCAGGAACAGGAAGTACGTGCCCGCCGCGTAGGCCAGGAAATTGAAGAGTCCAAAGACACCGGCGGCGGTATTCGAACCGGAACCC 
TTGCCCGTGGCATCCGACCATCTGGCCAGCTGGACGATGAATCCGATGAAGTACAGCAGCGTCAGCACAGCGGTGGTGATC 
(SEQ ID NO: 121) 

Exon: 1001.. 1425 

Exon: 1502.. 1629 

Exon: 1927.. 2406 

Start ATG: 1105 

Transcript No. : CT47 08 

AAAATTAAAAATTGTAAAAAGTGAAAGTCCGTTTTTATCCTTTTTTACCAACCAACTTTGAAGCTAAACTAAATCGTTTTAAAATCATTAAACAG 
TAAACTAACATGGGCGATGAAGGCGGTGGCGGTGGTCATGGCAAGGGAGACCTTAAGTCTTTCCTCATGGACTTCATGATGGGCGGCGTTTCAGC 
TGCCATAGCCAAAACGGCAGTGGCTCCTATCGAGCGCGTGAAGCTCATTCTGCAGGTGCAGGAGGTTTCCAAGCAGATCGCGGCAGATCAGCGCT 
ACAAGGGCATAGTCGATTGCTTCATTCGCATTCCCAAGGAGCAGGGCTTCTCGTCGTTCTGGCGCGGCAACTTGGCCAACGTAATCCGATACTTC 
CCAACTCAAGCCCTTAACTTTGCCTTCAAGGATGTGTACAAATCGGTATTCCTCGGGGGCGTGGACAAGCACAAGCAGTTCTGGCGCCACTTTGC 
AGGAAACTTGGCCTCTGGCGGTGCAGCAGGTGCCACATCTCTGTGCTTCGTTTATCCACTGGACTTTGCCAGAACCCGTTTGGCCGCCGATGTGG 
GCAAGGGCGGCAATCGGGAGTTCAACGGCCTGATCGATTGTCTAATGAAGGTGATCAAGAGTGACGGACCCATCGGTCTGTACCGTGGATTCATC 
GTGTCGGTGCAGGGCATCGTCATCTACCGAGCAGCATACTTTGGATTCTATGACACTTGCCGCGACTTTCTGCCGAATCCGAAGAGCACACCGTT 
CTATGTGAGCTGGGCGATTGCCCAGGTGGTCACAACCGTTGCCGGTATTGCCTCCTATCCCTTCGATACTGTGCGTCGCCGCATGATGATGCAGT 
CTGGCCTGAAGAAGTCCGAGATGGTGTACAAGAACACAGCCCATTGCTGGCTGGTGATTGCCAAGCAGGAGGGCATTGGAGCCTTTTTCAAGGGC 
GCCCTATCGAACATAATTCGCGGCACGGGCGGTGCTCTGGTTCTCGCCCTTTACGACGAGATGAAGAAGTACTTTTAGCCCCA 
(SEQ ID MO: 122) 

Start ATG: 105 
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MGDEGGGGGHGKGDLKSFLMDFMMGGVSAAIAKTAVAPIERVKLILQVQEVSKQIAADQRYKGIVDCFIRIPKEQGFSSFWRGNLANVIRYFPTQ 
ALWFAFKDVYKSVFLGGVDKHKQFWRHFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKGGNREFNGLIDCLMKVIKSDGPIGLYRGFIVSV 
QGIVIYRAAYFGFYDTCRDFLPNPKSTPFYVSWAIAQVVTTVAGIASYPFDTVRRRMMMQSGLKKSEMVYKNTAHCWLVIAKQEGIGAFFKGALS 
NIIRGTGGALVLALYDEMKKYF* 
{SEQ ID NO: 123) 

Name: mitochondrial carrier protein; ADP/ATP translocase 
Classification: transporter 
Gene Symbol: Ant2 
FlyBase ID: FBgn0025111 



Celera Sequence No. : 142000013384605 

GATGCGGCACGACGAGGCACATCCGCGGGAAATGGCGTCGTCGAGGAGGATTTGAGGCACCTGAGTGGTCTTGCCACATCCAGTGCTTCCTACAA 
TGAGTATTACTTGGTTTTCCCGCACAGCCTGGATAATATCGTCGGCGTATTTCATGGTGGGCAGTTTTTTCCGTGCTTCTAGCCGTTTCTTGGCA 
TTCTCCTCCAACTGGCGTTGCCCCAGTTCCAGCTGCAGTCTCTCGTCCAGCTTGGGGTTTACCCAGTCTAAGTCCGCGTTCCGTTCCTTGGTCTC 
CGCAACGAATTCCTCAAAGTTTACACTCAAAAGATGGCGAAACTGTTGCTGAAATTTGGCATCCACATTTTTGTCATCCACATTTTGTCTGGACG 
GTGCCCGGCTGTAGTCCTCCATTAACTCCTTCACACGCTCAAGCACTCCACTGGGAGCACTGACATTGCAGCCCAAGCGGATTTGCGGCTCCTTT 
GATTCCGCATTCTCGCCTCTGTCCTTCTTTTGCTGCCGGGCAAGATTGCGATAGTACAAACCGATGTCCTTGCCCCTGAGACCCGGTGGGCGGTT 
GCCCTTGCGGGCGTTGGAGCCGGATGAGTCCCTATCGCGCTGCATCCTGTCCAATTTATATAAAAGAATTCAGAAGAAGAATTCCCAACCCGAAG 
TCAACAAAATTCATAGTGCTGTCGGACTGTCAAAATGCGCAAATCAGATAGACTAGGACTGCCAACTTAATTATATACACCTATTGTGTTACTTT 
AAT AAT TAATTTTCAAATATAAAT ATTT TAT TCAAAC AAAAT T GGTTGCATTTAGAACACGAAGCAAAAT AAAATGAAAATC AT AAAC AAAGTTT 
TAAAAATAAGTGAAAAACACCGGCTTGCTGCTTATTGTTTGGTTCGAATACAGGCTAAATCCCAAAAGTATATTTCCATAACCCTTGAAAATGAA 
GCGGTTTGATTGAGCGATCCCGCTGCGGTCACACTAATTTCGATCAACTCAACGCGCTGAGTAAATCGAACCGCTTGTGCTTAGAATTTGTTTTT 
TCGCAATTAACGGCCGTGAAGAGCCAGCGATCTGCATGCCAAGTGTCCAATAACTAGGCAGAACAGTGTTTGCGTTGCACAAAAGCCCGCTAGCG 
GAGTGTGTTCCATGTGTCATTCCCGGCAAAGGAGCAACAAGTGGCCGAAAGTGGAAAGCAAGAGCAACAACTAAATAAGTAAGGAGTACCACACT 
GCAAGAAAAAGTGTTCGGAGGGAACGGTGTGCACAGCTTGTCAGCTTGTTTTTTGTACAGAGCGTCGCGCAATAAAACAAACAAAAAGACATAGT 
CCAAAAACAACAACCACAACAAAAGCCGTTTGATCGCGAGTTTGTGTGCGTGTGCGAGTGTGGCTCTATTTGTGTGAGAGTGGCAAGCCATTATT 
TTTCTGGTCTTATGACGTAAACCACCGAAAAAAATCAAAAGTGAGCAGGCGACACAGATATCAAAACGAAGGTGAGCGCATTCTTACCAATTCTC 
GTTTCGTTTCTTGGAATTCCAGCGGTTATTAGACGGTTTTATACTTGTTTACATTTTCGCAGTTGCAGCACACAACTAATGCCAGAGCGGGAGCG 
AGAAAGACATATGTAGATCGTTCTGTGGAAGGAGTGTTGGTCGCGGAGGGTTGTGTTGTGATGTTGTGGAGGGGAGGGAAAGAGATGCAGTGCAT 
GTAAACAAAATAGAGAAAGAGGCAATTGCAGTAAATTCTTTGGCAAAAGAAATCTTGTTGAAAGCATTTCCTCCGCAGATGCTTATGTGTACTGG 
CGACCTGAAAAGAAATTGTTGCCCTGCTTGAACTTGTCGCTCCACTCTCCACTACCCACCTCCTCGCCCCCTCCCTCGCACACTCCTCTGCCCAC 
ACACACACACACACACAACACTCCATTCACAATTGGCCTTTTGCGGAGTCATATATTAAATTTGATTTCTCTAGCCGAGCACTGTGAATGATTTG 
CCGCCAGTTGGCACGCCCGTTCGCCACACTGCAGAAAATACCAGGGCACATATATATCTATATCTATATATCTATTAAATTTATCCTTGCATGCC 
AAATATTGAAACGCAGCACTTTATAGCTCTCTTAGCTTAAAATGTAAATTTTACTCAATTTAAGAGCACTAATAACGAACGTTTATTTCATTTCC 
TAATCAAATCTTACAGTTTAAGTACTTTGTTTTCCGTGTAGCGAGCCAGAACCATCCGCAAAACCTGTTGCTCGTTTGCCTTTTTATTGGATTTT 
CTGTTTTCTACCTCTGCTGCTCAGCTGCTGCGATTCCGATTTCATTAGTGATTCATTAGATGCGTTGGACAAGCTAACGCAAAAAGCCAAAGCTT 
GTTCCCAACGCATCCTTCAATACAAGGCAATTCAAATGTTCAAACGCTAACGGCCAGCCTCTCAGCCTGGTCAAATGAATCACTCCATATTCTCC 
ACATATTCTAACCAATTTCTCTCGTGTTTTGTTTGGCTCTCAGTTTTGCGCTGTGAATAACGTGCTATTTCCTATTTTGGCTGGCCGCGATTATC 
GCATCGTTTTGTAAATTTAGTGCGCATTATTAATGAATTTGTCCAATTTGCTGCGGTATACTTTTATCGTTTTTGTTTATAATTATAATCAGTCG 
TTTTCACGCGGTGAGAGTTTTGGATAGCACAGTGGTCGGATTCTAATTTGGAAAATGGTTAAGCTTGAGAGTGTTTTAATCTGTAATATAAACTT 
ATTGTACAAACCTAATTATAAATTGATCATTTTTGAGCTTACCAGTAATATTCACACAGTATTAAAATTATAATTCCTATTAGATTTCCACCCTT 
AAAAAATTTTAAATGAAAGCCAACTCTCATGTGTTAATGTTCTTAAGCATGTAAAAGGAAAACAGCACCCTTTCCAAGTCGAAGAACCACTGTAA 
TGGGCCTCGTTCCCTCAAAATACGTACATATGTACGTGTTAATTGAAAATTCCACACTCCGTTGAGGCTGAATCGACATTACGGCACCCCAGATA 
ATTGTAAATAAACCCAAGCTGATAATGATCGACCACCTCCCAGCCATCCGTCCATCGATTGGATGCATTCGCTGGGAAGCCATCAGAAACCGCGG 
TCTGTGCGCTGCTCTCGAACACTTGTAAACAAGGAACAGCTGTGCAAGTTGTCTCCCCCAAAATATTGAAGACTGCGCAGGCCCCAAAAATCGTT 
GTTTGGTCTTTGTTTAGTTTTCGCCTAGCTATCCAACGAAATGCGCCTCAAAAAGCTGGGGTTAGACTTGAAATCTTTTGCATAGTTTATGAAAT 
CGAACCCAATATTTCTGTTCTGTTCTTTGCAGAATGTCTCGTCACGAGAAAACGAAATCCACGGGCGGCGGGCTCCTGGACAGTCTGTTCGGAAG 
ACCCTCGAAGTCCAAGGGAGGAACCATCAGCAGTGGCACCCTGGCCCATGGCGGACGACCCGTGTCCGCGGACAACTATGTGGTGCCGGGCGTGG 
AGGACTTTGAGCAGTACATCCAGCAGCTAAGCGTTGCGGAGCTGGATGCGAAGTTTCTGGAGATCATCGAGGACATGAACATTCCGAAGGACAAG 
AGGGAGCCCCTGTTGGCCAAATCGAAGGAGGAGCGACAGAAGATGATTATGTGGCACTTGAAAGGTAAAGAACATGGTGCAAGTACACGTTTTCA 
AACAGAAAACCGCATTTGTGTTTGAAGATTGAGTATCGTTTGGCTAAGAACTATAGTTCTGGAATAGAAAGTCTAAGAATAATCTACATACATAT 
ACGAATAATCTAAACTCGTCTGAACCGTAATGTTCCAAGCAACTTTGAAAGGCCAACCGGATTGGCAAAACTTTAACACCCTTTCCCTTTGCAAG 
GGTATACAAACAGCAGACCATCGATTTTCGTGCAACTGGGTGACTCTGCCTTCTACTTTTTTTTTTTGCTTTTGTTGTGCTGCGGAATATTGCAT 
TCCACTTGTGATAAGATAAGCAGGTAACGGAACTCGTGTCCGTGTGCTTTCCCCGTTTCGCTGTGTGCGTGACACTGCAAATTTCGCTTTCTGAT 
TGCTTAGCGCCTTCTTAGCTGCCAATTTGCTCTGTGGCCAGGTTCTGTTCTATATTTCTGGCCAATTCCTAAACCGATTGAAGTGATGTCAACGA 
TCCTTTGGATAAGTAAGCTCATAACATACATAAGTCTCTAAATGGTTTCCACTTCAATTAGTCAAGAGGTGATTACTTTGTTTGTTAACTACATA 
TACTTGCTTAACTCTACTTCAATTTTGAGAATACTTTATTTCTAGAATTATTATAACGATATTATGTGCATCGCAGCGCGGTAATGATTCAGCTG 
GAATCTTCCCTTGTGTTTGCTTCATTCATATCCAGCGAGATTAGTACGGTGTACTCTACTGTACTACAGTGATACTACCAATTATCACTGTAGCA 
CATCGCCAGCTGTATCTTGAGCACATTTCAATGCCAGCTCAGCTTTCAGCTTCAGCTTCGTTCTTTGGGTCTCGCTCCAGTTAGTTCACCATGGC 
CATGGTCCATTATCAGTCTTGCGGAGTGTGGAATCAAAGTTCCCTTATCGTTTGTAACGCCTAATTGCGCCACAGCCGCGGACTATCTGCTTGCC 
CATTTTATCAAGTAATACCAAGTGCTATATTATACATAGTATGTATGTAAGTGGTACGCGGCGAATGCGTGACAAAGTCGAGCGGCAAAAAATGA 
AAATGCAAAGACATTGAATTTCCACCACCGTGCACATTATACAATTTAAATTACGTCAAGTGGCAGTGAGAAAATGAAATGATTCTTTTATTTCG 
CGCTTAGGTCTTTTTGTTGTATAAGCAAGTGCATCGTGAAGGCGAGAGAGAGATGTACTATGTACACTGAAAGTATTTTTACTAAACATCATATA 
TAAATATAGCTAATGATTATAACCTACTATTTATAAATCTCAATGGCCTTGTTATTGAATATAAATTTAATATTTCAAATTTTAATTCAAATTTC 
CTGTACCGACGAGGCTTAGTTCTCAGTGTAGCAGAGGCAGAGGGTGATTTTCCTTACACTTTGGAACGGCCATAAAATGCAAATATGCGAAAACA 
TTAAACACATTATGACAAAATCGAGCTGAAAATAAATTGTGATGCCTGTCAGTAAACTTGAGCTGCGGCACAATCGTATCTCATAGGTGCAGATA 
GCTGCGAGTGAGAGGTAAAAGCGACTGAAGTAAGTATTTGTGGGTCGTGTTTATTGTGGCAGAACCCACTTGTTGCGAAATTAAATTTAAAACAC 
AAACAATGGACTGGCCGACAACAAAAAAGGCTCGCACAAAAAGCTGTTCATTAACACGCTTCGCTTGCCGTTTTGAGTTAATTAATAAAACAGTT 
AGCGTAGTTTTGCTTTGACGGATAAACGTATAAGTGGTGGGCAGGTTTTGGCCAATTTGAGTGATGTCAGTGGTGCAGGTCAATCAGAAAAAGGG 



FIGURE SHEET 54 



GTGGGTGGTCGCACCGAAGGCCTCCCTTCCAAAGGTGGGACATGATGATAGCCTTTTAACTACCCCTATCTCTCACAGTTTTAGGGCAAAGTTAT 
CACATAAAGCTGTGCAGCAACTATTTAGAACCTCCAAAGTGTAATCATTGAGCCACCTATATAGAATATGTATCATCCTTTATCATGAAATTTAC 
AAACTTCAAGTTGGAAAATCTGGAGTGATGGCAAATAAACTTAAAAACACATTTGCGTATTATAATGTGTTGATATGTGTATAATGTTACACCTT 
TGTTTAACATAACTTGTTAACGATTATATTCCAGATTAGAGCTTAAGTGGTAATTTCTCGTTAATTAAATGGTTTTCTTATCAAATTGCCATTGT 
ACGTTTGTGGGAAAGTATTCCGATTTAGCAGACTAATGATAGGGTAGAGTGAATCATCGCTGATATGACTAAATTCAGGTGTGTTGTCGGTGGGA 
GGCAAACCTCTTTCTTTCAAAATATAAGCCTTGAGCTATTTTGGAATGAACCCGTATACTTTTTATCATCGAAACAGCACAAACGTGAGTTGAAT 
GGCAAAATGTACTTTAATTTACTATTGTGCCAACTTTTAATTTATGTCAAAAGTGTAACAAATCAAAATGTCCAATATTAGCCATAATTATCAAT 
TGCTTAAGTTACGAACTGATCGGATTAGATATTAATAGGTGTTTATAGAACCATATTCCTATGTCTACTATGTAAAATACCGCTGACCACTAAAT 
AGCTTTACAGAAATTCAATACGTGCGCTCATCCGACTCCAACTCCAATTTCTCTTCCAGGTAAAAACTCACTGGAGCGTAGCGCCAACTCCCGCT 
TCGAGAAGCCCATAGACTATGTGGAATACCTGCAGAATGGGGAGCACAGCACGCACAAGGTGTACCAATGTGTGGAATCTCTGCGCGTGGCGCTC 
ACCAGCAATCCGATCTCGTGGATCAAGGAGTTTGGAGTGGCGGGCATCGGGACGATTGAGAAGCTGCTGGCCCGGTCAAAGAATAATGCCAGCTA 
CGAGAAGATCGAGTTCGAGGCGATTCGGTGCCTGAAGGCGATCATGAACAACACATGGGGTCTGAACGTGGTGCTCAATCCGGATCAGCATAGTG 
TGGTGCTGCTGCTGGCGCAATCCCTGGATCCCCGGAAGCCGCAGACGATGTGTGAAGCCCTCAAGCTGCTGGCCTCGTTCTGCATTGTCTATGAG 
CGGAATGGCTACGAGAAGGTTCTCCGAGCCATAACCACTATTGCAGCCACATCCTTTAAAGCGAGCGAGCGCTTTCGACCCATAGTGGATGCCTT 
GTTTGCATCGGATCAGCAGGATCCCAAACGGGACTTGGCATGCCACAGCCTGATTTTTATTAATACTCTCACCAATACCCCCACGGATTTGAACT 
TCCGCCTGCACTTGCGATGTGAGATTATGCGCATGGGTCTATACGATCGCCTAGATGAGTTCACCAAAATCGTGGAGGCCAGCAATAATGAGAAC 
CTGCAGCAGCACTTCAAGATCTTCAACGAGATCCGCGAGGATGACTTCGAGGAGTTTGTGCAGCGCTTCGATAATGTCACCTTCAACATGGACGA 
CGCCACCGATTGCTTCGATGTGCTGAAGAACCTGGTGACTGACACCACTTCCGAGCCCTACTTCCTGTCCATCCTGCAGCATTTGCTGTACATCA 
GGGATGACTTCTACTTCCGACCTGCCTATTATCAGCTGATTGAGGAGTGCATCTCACAAATCGTCTTCCACAAGGGTTACTGTGATCCGAATTTC 
GAGAACCGGAACTTTAATATAGACACCTCGCTACTGCTGGACGACATTGTGGAGAAGGCCAAGGCCAAGGAGTCGAAGCGGTCGGAGGAGTACGA 
GAAGAAGATCGAGCAGCTGGAGAGTGCCAAGCAAGAGGCGGAAGCGAAGGCGGCTCATCTGGAGGAGAAAGTCAAGTTGATGGAGGCTAATGGTG 
TGGCGGCTCCGTCGCCCAATAAGCTACCCAAGGTAAACATACCCATGCCACCGCCACCACCAGGAGGAGGAGGAGCACCTCCCCCACCGCCGCCA 
CCTATGCCGGGACGAGCAGGTGGTGGACCTCCGCCTCCACCACCACCTCCCATGCCGGGAAGGGCAGGTGGACCGCCACCACCTCCTCCACCGCC 
CGGAATGGGAGGCCCACCGCCTCCACCCATGCCTGGCATGATGCGCCCAGGAGGGGGACCTCCACCACCGCCCATGATGATGGGGCCCATGGTTC 
CCGTTCTGCCTCATGGCTTAAAACCGAAGAAGAAGTGGGACGTCAAGAATCCCATGAAGCGGGCCAACTGGAAAGCCATTGTCCCGGCCAAAATG 
TCCGACAAGGCATTCTGGGTCAAGTGCCAGGAGGATAAGCTGGCCCAGGATGACTTCCTCGCAGAACTGGCCGTGAAATTCTCTTCTAAACCTGT 
AAAGAAAGAACAGAAGGATGCGGTGGACAAGCCAACGACGCTGACAAAGAAAAATGTCGATCTTCGTGTGCTCGACTCGAAGACTGCTCAGAACT 
TGGCTATTATGTTGGGAGGATCGCTGAAGCATCTGTCCTACGAACAGATCAAGATCTGCTTGCTGCGCTGCGACACCGACATCCTGTCCTCCAAT 
ATCCTGCAGCAACTTATCCAGTACCTTCCGCCGCCAGAGCACCTCAAGCGTTTGCAGGAGATCAAGGCAAAGGGCGAACCGCTACCGCCGATTGA 
ACAGTTTGCAGCCACAATAGGTAAGGCGAATGGAGTTTGTAGTGCAGAAGGCGCAGAATAACTTTACTTAATTTCAGGGGAAATTAAACGCCTTT 
CGCCGCGACTTCACAACCTGAACTTCAAGCTGACCTACGCGGACATGGTGCAGGATATCAAACCCGACATTGTGGCAGGAACGGCAGCATGCGAA 
GAGATCCGGAATAGCAAAAAGTTCTCAAAGATCTTGGAGCTGATTCTGCTGCTTGGAAATTACATGAACTCGGGCTCCAAAAACGAGGCCGCCTT 
TGGCTTTGAGATCAGTTATTTAACCAAACTGTCCAATACGAAGGATGCGGATAATAAGCAGACATTGCTGCACTACCTGGCTGACCTGGTGGAAA 
AGAAATTCCCAGATGCACTAAACTTCTACGACGATCTGTCGCATGTTAATAAAGCGTCGCGGGTCAACATGGATGCCATCCAAAAGGCCATGCGG 
CAAATGAATTCGGCGGTTAAGAACCTGGAAACTGATCTCCAGAACAACAAGGTGCCGCAGTGTGATGATGACAAGTTTAGCGAGGTGATGGGCAA 
GTTTGCCGAGGAGTGCAGACAACAAGTGGACGTGCTGGGTAAGATTATCTCATCTGATCGGAACGAATATTACATGAGCAAAATCGAATTCTCTA 
ACATCTCGTTTTAATTTCAGGCAAAATGCAGCTGCAAATGGAGAAGCTGTACAAGGACCTCAGCGAGTACTATGCCTTCGATCCCAGCAAATACA 
CAATGGAGGAGTTCTTTGCGGACATCAAGACTTTCAAGGATGCCTTCCAAGCGGCCCACAACGACAATGTCCGGGTACGCGAGGAGCTGGAGAAG 
AAGCGTCGTCTGCAGGAGGCCCGAGAGCAGTCTGCTCGAGAGCAACAGGAGCGCCAACAGCGTAAGAAGGCAGTGGTTGACATGGATGCCCCGCA 
GACGCAGGAAGGCGTGATGGACAGTCTGTTGGAGGCGCTGCAAACGGGCTCAGCCTTTGGCCAACGAAATCGACAGGCCCGGCGACAACGACCGG 
CGGGAGCGGAGCGGAGGGCACAGCTCAGCCGGAGTCGATCGCGGACGCGTGTCACCAACGGACAACTAATGACCCGCGAAATGATCCTCAACGAG 
GTTCTAGGCTCCGCGTAGAAGTAATGGAACGAGCAATTCCCCTCTGTACATATATAAATAGATATCTGTAGCGATATAAGGGCCTGGGCTCATTA 
GCCGTGAGTCCGTGTGCGTCGCCTTGCTTGCATATCAACTCAATTGGCGTGTCCAAGAGATTTGACGAATTGGATGCCCAGCAGGAAGGACTGTA 
TCAAGAGCCGCAAAGTACTAATTAACTGGAACTAGAGTCACAGGTGGATTTTCCGAAATTATTAATTGCTGTAACGGAGTCCTTGGTTTATAGGA 
TTGAGTTTTCCGTTTGAATTGATTGGAGTATTTATTTAATTGATTGTTTAATAAATAACATTTTGTTTAGAATTTATAATATGCAAATAACACAT 
ACAAGATCACTTTTATTTAATTAAAATGATTGGCAGGCCTTTTAATTTCAGCTGTACATTTTATTTTACTGCTATGACTGTCCCAATATTAAAGT 
ACCAATTTTACAAGTCGAATATACTATAAAAAAACTTCTAGTTGAAACTAAATTCATGTCAAGAAATTAGCAATGCATATCGATGAAAACTCCAA 
CTTCGCTACCTATACGAAAAAACTAAATATATAGATATTTTTGATACGCTTTTACTTCTGAAATTATGTCGTTGTAAGTCTAAATACTTAAAAAT 
GAAACTGAAAGCAACTTTAAGATTTATCTGTATCTTGTGGTCTTTACATACTGTATTCCTATTTTACTGTGGCATGTTCCAATCATTTTGCTGTG 
CACTTCGTTTTCGAGGCACTCAGTAGAGCAGAGGCGAGATTTATATCAAAGCAAAATTCCTTTTGTATTTAATTTGTTTTTAAGCGAAACTTTTT 
TGAACTTGTTTACCTTAAAATCCTATTTTAATCCGCGCTATACATAACATTGTTAATAAATGTCTATTTTTCGACAATGACGGTATATTGAAAAT 
AAAACTACCAAATCAACTGCTTCTCAACCAATTGATTGGCAATAATCCTGGAATATTCCACAAGAAGGCTTGGCTGCGAATCCGAAAGCAATTTG 
GGCTTTCCACCCATGTGTGACTTGCCGCCTTTTGTTGTTCTTCTCCACTCTAATTTGTTTTATTAACAAAGATATTTTATCGCCAATGTCGCGGC 
ACCCATAAAGGTGCTTGATTGCGACGGCGAAGGAGGCGGTTGCTCGGCTCGTACAACGCAAAATGCACTTGCTGCGGGATCGGCTGGGGGTTGCG 
GGTGGGCGGAAGGGGTTTCTGGAGGGGTTTGGCCCGGCAAACGGCCGTAAAACTCTCAAAAACGTTAAACGAACCGACAAACAGCGAATGCCGCA 
CGACAATGATACGTCTACCATCCATACATATCCATCCATCCACACTCAAATCCGTCTGGCGTCGACGCTCCACCAAGCCACAGATATAGGTTGTT 
AGATCGGGACTGGAACTTGGACATATCTGTAATGGTCTGATTGCCCCTTGAGTTTTATTTCGGAGTCTGCTTTATGGGCTATGGCGCTTGCTTTG 
CTTTCTTTATGGATTTATTAACATAATTGAGGGAATATGTATAAATACCATTGAATTCGTAGCAGGTTGTTAATGTCAATGGCGGGGACAGATTC 
TAATTTACGATGGCCAGATAGAGATGCATTAGAGTGTTTTACAGATTGAAATAAATCTTTCATTGCCGGAAGGGCAATTCGAAATGGATTCTCTG 
CTTTCGATTCAACTAATCGTCGGGGAACGTTTGTATATTATAATATTATGTATTATGTTTGTAATTGATCAACACTTTCTATTTGATGAAAAAAT 
ATTACAATCTTCTATCAAATCTTGCGTTTGTACGACCAAAATTCATGATTCATCTCGAATTTGCCGCCCAGGGGGAGTTACCATATGTTTACATA 
GATCACTTAGGGATTAGCCTTTTTGTGCGCCCCTAATGACTTCCAGGTAAC 
(SEQ ID NO: 124) 

Exon: 1001.. 1496 
Exon: 3358.. 3674 
Exon: 6330.. 8285 
Exon: 8343. .8873 
Exon; 8951.. 10356 
Start ATG: 3359 
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AACGCGCTGAGTAAATCGAACCGCTTGTGCTTAGAATTTGTTTTTTCGCAATTAACGGCCGTGAAGAGCCAGCGATCTGCATGCCAAGTGTCCAA 
TAACTAGGCAGAACAGTGTTTGCGTTGCACAAAAGCCCGCTAGCGGAGTGTGTTCCATGTGTCATTCCCGGCAAAGGAGCAACAAGTGGCCGAAA 
GTGGAAAGCAAGAGCAACAACTAAATAAGTAAGGAGTACCACACTGCAAGAAAAAGTGTTCGGAGGGAACGGTGTGCACAGCTTGTCAGCTTGTT 
TTTTGTACAGAGCGTCGCGCAATAAAACAAACAAAAAGACATAGTCCAAAAACAACAACCACAACAAAAGCCGTTTGATCGCGAGTTTGTGTGCG 
TGTGCGAGTGTGGCTCTATTTGTGTGAGAGTGGCAAGCCATTATTTTTCTGGTCTTATGACGTAAACCACCGAAAAAAATCAAAAGTGAGCAGGC 
GACACAGATATCAAAACGAAGAATGTCTCGTCACGAGAAAACGAAATCCACGGGCGGCGGGCTCCTGGACAGTCTGTTCGGAAGACCCTCGAAGT 
CCAAGGGAGGAACCATCAGCAGTGGCACCCTGGCCCATGGCGGACGACCCGTGTCCGCGGACAACTATGTGGTGCCGGGCGTGGAGGACTTTGAG 
CAGTACATCCAGCAGCTAAGCGTTGCGGAGCTGGATGCGAAGTTTCTGGAGATCATCGAGGACATGAACATTCCGAAGGACAAGAGGGAGCCCCT 
GTTGGCCZ^AATCGAAGGAGGAGCGACAGAAGATGATTATGTGGCACTTGAAAGGTAAAAACTCACTGGAGCGTAGCGCCAACTCCCGCTTCGAGA 
AGCCCATAGACTATGTGGAATACCTGCAGAATGGGGAGCACAGCACGCACAAGGTGTACCAATGTGTGGAATCTCTGCGCGTGGCGCTCACCAGC 
AATCCGATCTCGTGGATCAAGGAGTTTGGAGTGGCGGGCATCGGGACGATTGAGAAGCTGCTGGCCCGGTCAAAGAATAATGCCAGCTACGAGAA 
GATCGAGTTCGAGGCGATTCGGTGCCTGAAGGCGATCATGAACAACACATGGGGTCTGAACGTGGTGCTCAATCCGGATCAGCATAGTGTGGTGC 
TGCTGCTGGCGCAATCCCTGGATCCCCGGAAGCCGCAGACGATGTGTGAAGCCCTCAAGCTGCTGGCCTCGTTCTGCATTGTCTATGAGCGGAAT 
GGCTACGAGAAGGTTCTCCGAGCCATAACCACTATTGCAGCCACATCCTTTAAAGCGAGCGAGCGCTTTCGACCCATAGTGGATGCCTTGTTTGC 
ATCGGATCAGCAGGATCCCAAACGGGACTTGGCATGCCACAGCCTGATTTTTATTAATACTCTCACCAATACCCCCACGGATTTGAACTTCCGCC 
TGCACTTGCGATGTGAGATTATGCGCATGGGTCTATACGATCGCCTAGATGAGTTCACCAAAATCGTGGAGGCCAGCAATAATGAGAACCTGCAG 
CAGCACTTCAAGATCTTCAACGAGATCCGCGAGGATGACTTCGAGGAGTTTGTGCAGCGCTTCGATAATGTCACCTTCAACATGGACGACGCCAC 
CGATTGCTTCGATGTGCTGAAGAACCTGGTGACTGACACCACTTCCGAGCCCTACTTCCTGTCCATCCTGCAGCATTTGCTGTACATCAGGGATG 
ACTTCTACTTCCGACCTGCCTATTATCAGCTGATTGAGGAGTGCATCTCACAAATCGTCTTCCACAAGGGTTACTGTGATCCGAATTTCGAGAAC 
CGGAACTTTAATATAGACACCTCGCTACTGCTGGACGACATTGTGGAGAAGGCCAAGGCCAAGGAGTCGAAGCGGTCGGAGGAGTACGAGAAGAA 
GATCGAGCAGCTGGAGAGTGCCAAGCAAGAGGCGGAAGCGAAGGCGGCTCATCTGGAGGAGAAAGTCAAGTTGATGGAGGCTAATGGTGTGGCGG 
CTCCGTCGCCCAATAAGCTACCCAAGGTAAACATACCCATGCCACCGCCACCACCAGGAGGAGGAGGAGCACCTCCCCCACCGCCGCCACCTATG 
CCGGGACGAGCAGGTGGTGGACCTCCGCCTCCACCACCACCTCCCATGCCGGGAAGGGCAGGTGGACCGCCACCACCTCCTCCACCGCCCGGAAT 
GGGAGGCCCACCGCCTCCACCCATGCCTGGCATGATGCGCCCAGGAGGGGGACCTCCACCACCGCCCATGATGATGGGGCCCATGGTTCCCGTTC 
TGCCTCATGGCTTAAAACCGAAGAAGAAGTGGGACGTCAAGAATCCCATGAAGCGGGCCAACTGGAAAGCCATTGTCCCGGCCAAAATGTCCGAC 
AAGGCATTCTGGGTCAAGTGCCAGGAGGATAAGCTGGCCCAGGATGACTTCCTCGCAGAACTGGCCGTGAAATTCTCTTCTAAACCTGTAAAGAA 
AGAACAGAAGGATGCGGTGGACAAGCCAACGACGCTGACAAAGAAAAATGTCGATCTTCGTGTGCTCGACTCGAAGACTGCTCAGAACTTGGCTA 
TTATGTTGGGAGGATCGCTGAAGCATCTGTCCTACGAACAGATCAAGATCTGCTTGCTGCGCTGCGACACCGACATCCTGTCCTCCAATATCCTG 
CAGCAACTTATCCAGTACCTTCCGCCGCCAGAGCACCTCAAGCGTTTGCAGGAGATCAAGGCAAAGGGCGAACCGCTACCGCCGATTGAACAGTT 
TGCAGCCACAATAGGGGAAATTAAACGCCTTTCGCCGCGACTTCACAACCTGAACTTCAAGCTGACCTACGCGGACATGGTGCAGGATATCAAAC 
CCGACATTGTGGCAGGAACGGCAGCATGCGAAGAGATCCGGAATAGCAAAAAGTTCTCAAAGATCTTGGAGCTGATTCTGCTGCTTGGAAATTAC 
ATGAACTCGGGCTCCAAAAACGAGGCCGCCTTTGGCTTTGAGATCAGTTATTTAACCAAACTGTCCAATACGAAGGATGCGGATAATAAGCAGAC 
ATTGCTGCACTACCTGGCTGACCTGGTGGAAAAGAAATTCCCAGATGCACTAAACTTCTACGACGATCTGTCGCATGTTAATAAAGCGTCGCGGG 
TCAACATGGATGCCATCCAAAAGGCCATGCGGCAAATGAATTCGGCGGTTAAGAACCTGGAAACTGATCTCCAGAACAACAAGGTGCCGCAGTGT 
GATGATGACAAGTTTAGCGAGGTGATGGGCAAGTTTGCCGAGGAGTGCAGACAACAAGTGGACGTGCTGGGCAAAATGCAGCTGCAAATGGAGAA 
GCTGTACAAGGACCTCAGCGAGTACTATGCCTTCGATCCCAGCAAATACACAATGGAGGAGTTCTTTGCGGACATCAAGACTTTCAAGGATGCCT 
TCCAAGCGGCCCACAACGACAATGTCCGGGTACGCGAGGAGCTGGAGAAGAAGCGTCGTCTGCAGGAGGCCCGAGAGCAGTCTGCTCGAGAGCAA 
CAGGAGCGCCAACAGCGTAAGAAGGCAGTGGTTGACATGGATGCCCCGCAGACGCAGGAAGGCGTGATGGACAGTCTGTTGGAGGCGCTGCAAAC 
GGGCTCAGCCTTTGGCCAACGAAATCGACAGGCCCGGCGACAACGACCGGCGGGAGCGGAGCGGAGGGCACAGCTCAGCCGGAGTCGATCGCGGA 
CGCGTGTCACCAACGGACAACTAATGACCCGCGAAATGATCCTCAACGAGGTTCTAGGCTCCGCGTAGAAGTAATGGAACGAGCAATTCCCCTCT 
GTACATATATAAATAGATATCTGTAGCGATATAAGGGCCTGGGCTCATTAGCCGTGAGTCCGTGTGCGTCGCCTTGCTTGCATATCAACTCAATT 
GGCGTGTCCAAGAGATTTGACGAATTGGATGCCCAGCAGGAAGGACTGTATCAAGAGCCGCAAAGTACTAATTAACTGGAACTAGAGTCACAGGT 
GGATTTTCCGAAATTATTAATTGCTGTAACGGAGTCCTTGGTTTATAGGATTGAGTTTTCCGTTTGAATTGATTGGAGTATTTATTTAATTGATT 
GTTTAATAAATAACATTTTGTTTAGAATTTATAATATGCAAATAACACATACAAGATCACTTTTATTTAATTAAAATGATTGGCAGGCCTTTTAA 
TTTCAGCTGTACATTTTATTTTACTGCTATGACTGTCCCAATATTAAAGTACCAATTTTACAAGTCGAATATACTATAAAAAAACTTCTAGTTGA 
AACTAAATTCATGTCAAGAAATTAGCAATGCATATCGATGAAAACTCCAACTTCGCTACCTATACGAAAAAACTAAATATATAGATATTTTTGAT 
ACGCTTTTACTTCTGAAATTATGTCGTTGTAAGTCTAAATACTTAAAAATGAAACTGAAAGCAACTTTAAGATTTATCTGTATCTTGTGGTCTTT 
ACATACTGTATTCCTATTTTACTGTGGCATGTTCCAATCATTTTGCTGTGCACTTCGTTTTCGAGGCACTCAGTAGAGCAGAGGCGAGATTTATA 
TCAAAGCAAAATTCCTTTTGTATTTAATTTGTTTTTAAGCGAAACTTTTTTGAACTTGTTTACCTTAAAATCCTATTTTAATCCGCGCTATACAT 
AACATTGTTAATAAATGTCTATTTTTCGACAATGACGGTATATTGAAAATA 
{SEQ ID NO: 125) 

Start ATG: 498 

MSRHEKTKSTGGGLLDSLFGRPSKSKGGTISSGTLAHGGRPVSADNYVVPGVEDFEQYIQQLSVAELDAKFLEIIEDMNIPKDKREPLLAKSKEE 
RQKMIMWHLKGKNSLERSANSRFEKPIDYVEYLQNGEHSTHKVYQCVESLRVALTSNPISWIKEFGVAGIGTIEKLIjARSKNNASYEKIEFEAIR 
CLKAIMNNTWGLNVVLNPDQHSVVLLLAQSLDPRKPQTMCEALKLLASFCIVYERNGYEKVLRAITTIAATSFKASERFRPIVDALFASDQQDPK 
RDLACHSLIFINTLTNTPTDLNFRLHLRCEIMRMGLYDRLDEFTKIVEASNNENLQQHFKIFNEIREDDFEEFVQRFDNVTFNMDDATDCFDVLK 
NLVTDTTSEPYFLSILQHLLYIRDDFYFRPAYYQLIEECISQIVFHKGYCDPNFENRNFNIDTSLLLDDIVEKAKAKESKRSEEYEKKIEQLESA 
KQEAEAKAAHLEEKVKLMEANGVAAPSPNKLPKVNIPMPPPPPGGGGAPPPPPPPMPGRAGGGPPPPPPPPMPGRAGGPPPPPPPPGMGGPPPPP 
MPGI^RPGGGPPPPPiynyiMGPMVPVLPHGLKPKKKWDVKNPMKRANWKAIVPAKMSDKAFW 

KPTTLTKKNVDLRVLDSKTAQNLAIMLGGSLKHLSYEQIKICLLRCDTDILSSNILQQLIQYLPPPEHLKRLQEIKAKGEPLPPIEQFAATIGEI 
KRLSPRLHNLNFKLTYADMVQDIKPDIVAGTAACEEIRNSKKFSKILELILLLGNYMNSGSKNEAAFGFEISYLTKLSNTKDADNKQTLLHYLAD 
LVEKKFPDALNFYDDLSHVNKASRVNMDAIQKAMRQMNSAVKNLETDLQNNKVPQCDDDKFSEVMGKFAEECRQQVDVLGKMQLQMEKLYKDLSE 
YYAFDPSKYTMEEFFADIKTFKDAFQAAHNDNVRVREELEKKRRLQEAREQSAREQQERQQRKKAVVDMDAPQTQEGVMDSLLEALQTGSAFGQR 
NRQARRQRPAGAERRAQLSRSRSRTRVTNGQLMTREMILNEVLGSA* 
(SEQ ID NO: 126) 

Name: DIAPHANOUS /FORMIN HOMOLOGY 
Classification: actin_binding 



FIGURE SHEET 56 



Gene Symbol: dia 
FlyBase ID: FBgn0011202 



Celera Sequence No, : 142000013384695 

CTCGCGCAACAACATACCATCGGTCATGTACTTGAGCAGCGTCTTGGCCGTGGAGCAGTCCTCGAAACGGATGGAGTAGCCGACCTCTTCGCCCA 
GTTTTACGTCCATCTCCTCCGAAACGCGCTGGGCCACGGACATGGCGGCAACTCGACGTGGCTGTGTGCAGGAGACTCCCTTGCGCCCCTTGGAT 
ACTGCGAAGTCCACGCACCACTGCGGGATCTGGGTCGTCTTGCCGGAGCCAGTCTCGCCCACCAGCACAATACACTGGTGCAGGCTGAGCAGCCG 
CATAAAGTCCGCCTGGTACTCGAACACGGGCAGCGCAATGCGCTTCTTGTAGAGATTCTGGTACCGTTGTGAGTACGGCGTGTTGGTCAGCGGGT 
TCATCGTGGGCGGTTGCTTGTTGGGCACGAATCCGCCCAGGATTTGAGCCACTGTGCCGGCAGCGGCTGCCGCCGACGAGGAGGACGCCTCCGGG 
TCCCTGGTGAGAAAGGAGCGTTACATTACTGCCGTATGTTGGCCTCGGGTCACTTACTTCTTCGCCTTGCTGCCGTAGGTCTCCCCCACTTCGAT 
TCGACGCTTGGACATGCTAATTTATGGTGATTACTGCACTTAATTCTCAAAACGAAGCGTTTAGCACAAAAAAAAACACGCGCTCAGTGAGACCT 
TTAGCATATACCGACCATGCTGATCGCAACTTTTAAAATACCAAAGTTTACTGTCTGTGTATACTATATAAGCCCATGGTTAGCGAAGATCACAA 
CCAACAGCACGGCAGCAATTTGGCATTTAAAAAAAAACAGCAGTTCAAGCTTAAGTCTTTTGACGAGAATCTATCTATAGAATTCGGTTTTACTC 
ACCAGTTTGAAGAATAGTTGTGAGCTCACAGTCTTAAAATTGACATTTGACAGCAATTTGATTGGAAACTGATTTTGATTTTTACAGTAACAAAC 
TGTTACATGGATTCGGTATTTTCGGCTTGGGTTGGCGCTTTAGCGTCGGCCACACTATCACATTGTATCGATTCGGAGGGAGTTGAATTTTTGCA 
CGTATTTGAAAGCGCAAAGATGCGAAACAAAGAGCAGGATATCGAATGAAAGAGCTCCTGCGAGACCAGGACTAGTTGCCTAACCACCATGGAAA 
TACCCATTCAGGTAGCGGTGCGCATCTTCCCGCATAGAGAGCTCAAGGACTTGTTGAGGAGCTTCGGCCCCACAGAGCCCAAAAAGGATGCGCAG 
GCGGTGGATGAGGGGGCGGACTCCAAGGACTCCGAGGCTCAAGTTCCGGCGGCTGAGAAGGACAATCCGTCAATTTCAGAGACGGACCCAAACGG 
GAATGCTGAGCAGGACAGTGCGGCCGATTCGAAGACAATTCCAGATGCCAATGGCAACGACAGTGGTCAGAAGGATTACCCAGACTCCGCATATT 
GCGTTCAGGCTATTCCCATCAGCGCTTCGGCCCTGGGATTGCCCAGTGCCCTGCCAGGCGGAGATCCCATGGACAGCATTGCAGCTGGACTGATA 
CAAGTCGGTCCCCACACGGTTCCGGTCACCCACGCCCTGCCCAGCAGCAGCTCCCAGGAGCAAGTGTACCACCAGACGGTCTTTCCGCTCATCAC 
CCTGTTCCTGGAAGGTTTCGACGCATCTGTAGTCACCTATGGCCAGAGGGGCCAGGGCAAAAGCTACACACTCTACGGAAACGTCCAGGACCCTA 
CTCTGACGGATTCCACCGAAGGAGTCGTCCAACTTTGTGTCCGTGACATTTTCTCACACATATCGTTGCACCCAGGTGAGTTTACTAAAAGGATC 
TTATAAGCATAGTTATCCATTTCCATTTTCACTTTAAGAACGCACCTATGCCATCAACGTGGGATTCGTGGAAATTTGCGGAGGTGATGTCTGCG 
ACTTGTTGGGCATGGGAAACATACACTGCACCAACGTAGATGCCGTCTTCCACTGGTTGCAGGTGGGTCTGTCGGCCCGCCAATCGTTGCCGGCG 
CACACCCTGTTCACGCTTACCCTGGAGCAGCAGTGGGTGTCCAAGGAGGGGCTGTTGCAGCACCGCTTGTCCACGGCCAGTTTCTCGGATCTCTG 
CGGCACAGAGAGATGTGGAGACCAGCCACCGGGACGTCCTCTCGATGCTGGCCTGTGTATGCTGGAGCAGGTGATCAGTACTCTCACAGATCCAG 
GCCTCATGTACGGGGTTAATGGCAACATTCCGTACGGTCAGACCACGCTCACCACTCTCCTGAAGGACTCGTTCGGCGGACGGGCTCAGACGCTC 
GTGATCCTGTGCGTGTCACCGCTGGAAGAGCACTTGCCCGAAACTCTTGGCAACCTGCAGTTCGCCTTTAAGGTGCAGTGCGTACGTAATTTTGT 
AATTATGAACACCTACTCCGACGACAACACGATGATCGTTCAGCCGGCTGAGCCTGTTCCCGAATCCAATTCCTCTGCTGGACCCTTGTCGCAGG 
CGGGACCAGGGGACAACTTTGGCCTACAATTCGCAGCAAGCCAATGGTCCAAACTAGTTACCAACGCCGAGGGGCTATTTTCCAAGTAAGTGTAC 
ACATTATTTCCGATGTGGGCCTTTTGACGACAAAAGAAATTTATAGGCTGATAGACTCCAAGCTAATTACTGAAGTGGAGAAGGAGCAGATCGAG 
GAGTGGCTCTTCCTCAAGCAGGAGTGCGAGGAGTGTCTCAGCTCAACAGAGGCTATGCGTCAGCAAAAACAGTTGGTTCCCATTCTGGAGGCCGA 
GGAGCCCGAGGACGTGAATTCTGAAGCAGCCAATTCGGAGTCGCCAAACTCCGACAACGAAAACGACACAGACAATGAGTCGCATCGGCCCGATC 
TGGACGACAAGATAGAAAGTCTAATGGAAGAGTTTCGCGACAAAACAGACGCTCTTATACTTGAAAAACACGCTGAATATCTATCCAAGCATCCG 
AAGGCGGTTATGCAAAGCCAAGACCGCGAGATTGAGGCACAGCCGCCAGAAGAAAATGGTGATGATCGAAAAGTCAGCATTGGCAGTAAGTCTTT 
GCCTTGAATAACATTTTTTTATACAGTAACATGTTGATTCCTTGCTTTAATAGGTCGCAGGAGAAGTGTTCAGCCAGGTGCTAGCTTAAGTACTG 
CTGAGCTTGCCATGCTTAATCGGGTAGCTTCCCAGCAGCCGCCTCCGCCCATCGATCCTGAGTCGGTCGTCGATCCTCTGGAAAGTTCTTCGGGC 
GAAGGAATCCGTCAGGCGGCTCTCGCTGCCGCCGCCGCCACTGCTCCTATTGAACAGCTGCAGAAAAAATTGCGCAAACTGGTCGCTGAGATCGA 
GGGCAAGCAACGACAGTTACGGGAAATCGAAGAGACAATCCAGGTAAAACAAAATATAATCGCCGAATTGGTCAAGAACAGCGATACACGCAGCC 
ATGCAAAGCAAAGATTTCACAAGAAACGTGCCAAACTTGAGGCCGAGTGCGACAAGGCCAAGAAGCAGTTAGGTAAGGCGCTAGTTCAAGGCCGG 
GATCAGTCGGAGATTGAGCGATGGACCACGATAATCGGACATCTCGAGCGTCGACTAGAAGACCTCAGCTCAATGAAGCATATTGCGGGTGAGAG 
CGGACAGAAGGTGAAGAAGCTACAGCAATCGGTGGGCGAGTCGCGAAAACAGGCCGATGATTTACAGAAAAAGCTTCGAAAGGAGTGCAAGCTGC 
GCTGCCAGATGGAGGCGGAGCTGGCCAAACTACGAGAATCCAGGGAGACTGGCAAAGAGCTAGTGAAGGCGCAAGGTTCTCCCGAGCAACAAGGC 
CGCCAGTTAAAGGCAGTACAGGCTAGGATTACGCACCTTAATCACATTTTACGCGAGAAGTCGGATAACCTGGAGGAGCAGCCGGGACCAGAACA 
GCAGGAGACCTTGCGTCATGAGATCCGCAACTTGCGCGGAACTCGTGACTTGTTGTTGGAAGAACGCTGTCATTTGGACCGCAAACTTAAGCGGG 
ACAAGGTGCTGACGCAAAAGGAGGAGCGCAAGCTGCTCGAGTGCGATGAGGCCATCGAGGCCATAGATGCGGCCATAGAATTCAAGAACGAGATG 
ATCACGGGCCACCGCTCCATCGACACGAGCGACCGAATTCAGCGGGAGAAGGGAGAACAGATGCTGATGGCACGCCTAAATCGTCTCTCAACGGA 
GGAGATGCGAACACTTCTGTACAAATACTTCACGAAGGTTATCGATTTGCGCGACTCTTCACGAAAGCTGGAGCTGCAGCTGGTGCAGTTGGAGC 
GTGAGCGGGATGCCTGGGAGTGGAAGGAGCGTGTTCTGTCCAATGCCGTGCGCCAGGCTAGACTGGAAGGCGAACGGAATGCGGTGCTGCTGCAG 
CGCCAGCACGAAATGAAACTCACTTTGATGCTGCGTCACATGGCGGAGGAAACGTCGGCCAGTTCGGCCAGCTACGGAGAACGAGCTTTGGCCCC 
TGCCTGTGTCGCCCCGCCGGTGCAGGCCAGTAGTGATTTCGACTACGATCATTTCTACAAAGGTGGCGGCAATCCAAGCAAGGCACTGATCAAAG 
CGCCAAAGCCGATGCCCACCGGCTCGGCGCTAGACAAATACAAGGACAAAGAGCAACGCAGCGGACGCAACATCTTTGCCAAGTTCCATGTGCTC 
ACCAGATATGCGTCAGCTGCCGCAGCCGGTTCCTCAGGGTCCACGGCCGAGGAATCCACGGCCCTGATTGAGTCAACCACCACGGCCACGGCAAC 
CACTACGTCGACAACCACCACTGGAGCCGTAGGAAAAGTGAAGGACAAGGCCCTGGTCAGCTTCAGGCCGGAGCAGCTGAAGCGTCTGATGCCAG 
CTCCGACGGCCACGAAAGTGACGCGTCAGAAGAACAAGATAATTATCCAGGACGCAAGTCGTCGAAACTAAATACGAATTTAACCATTTCCAGGA 
CGATATTATAAGGTTAAGCCAAATAGCCATATACAATTAATAGCCATGAGCTGAAATCCAGCCACTTTAATAATTAAGGCGATAATATAAAAAAA 
TGCTTAAAAATGTATAACAGATTACTCCCATATACATGCATACATTTTAATCAATAGTCCGACATATATACCAGCTTGAATTTATTAGTTTACTG 
CTTGTCGTTTAACTTATTAAAGTGGCTGGAGTTCGATATCTTTTTAAGTGTGAAGCAATAGCTCAGATCCTGGTAACAAGAACTGAACAAGTTTC 
GTCCACAGTCATGACACAAATAGACTACAACACATGGTTCTCATTCCGGCTCTTTAGCTGGGGAGCATTTATTTTAGAAACCCCCACACGTACAT 
ACACCTAGCATATGAAAGTAACTATCCTCATGAGATTCAACCGTAAAGACTTGACCTGCTACTTAAGATACTCGCAAGACAAACCGTACACAAAA 
CTCGCTCAAAGTGTTGACCTAAGTCTCCGCATTGTTTAACTTGTACATAACAATAATTATTAGGTAACCACTTAATTTTAATTGATTTTGTTAGT 
TTTAGCTTAGTTGTAAACACTATTGTGTCTTCCCTCGAATTTTAATTGACCTTCGATACCAATTACAACATACGTACTCCATACCCAAGTGTTAA 
CCTTTAATATTCTATCAAAAGCATTTACTCTGTACAACTTATAAATGCTTCAAATCGCAGTATTTGCGCTAATCCAATACTCATTTTGCATTAAC 
TTAAAATTAAAATTCAATACTGTTTGGTTGTGTACAACGTATTTTTTATTGTCCATTGATATAGATGCAGTTTGCACTGTACTCCGAGTAACTTA 
AATTGATATTAATGTATGTATGTTCCCTCAATATCAGCGTGTGAAAAATTATAAACACATTTGTAAGCATACATATTCCGCCAAATCGCTCGGCA 
TGGCTTTGTGTGGATATTGTTGGTTAAGCAAATAAATATATACAAACTAAACTAATCAATCAGTGTTTAATCAGTGTTTTATATTCAAAATAAAA 
TAGTTATCGTCTTATAAAGATAATTAAGTTCTACATTTGCAGCTGAAATCCAAGCAAACACAGTCCACAAGGCAACAGGTGAGAAATGTCCCAGA 
AATTTCCCAAACAATCGCCCCTTCGGACACACACACACACAGAAAGCGCAAAGAGAGAGGATGAGAGGGAGAGAGAGAGAGCGCCGGTTTAAGCG 
TTGTATCTAAATCCGATGGCGAGTATCTGAGCATCCGATTCAGTCTTGTTTGGACCGTCGAGCGATCGAGAACCGGATTTCAGCGCTAGTCGGCG 
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AATCGCGAGAGCGTGCGAGTATTTGTGTGTGCGTGTGAGCGTTTGTTTTGGGACACCGATACGGATACGGGTACGGATAGACGGATATACGGCAG 
CGGTACGCGATTAGCAATCCGCCAACCGGTTGCAGCGGTGCTAGTGCACCACCCTACAAAAAAGTGAATAATAGTATGCAAAGTATCTGAAAGAT 
AGTCAAGAGCAGAGCAAAGTATCTGTGTTTGTGTGTGCCGTGCTTTTGTGCGGAGTCGCCACCCCCGTGGATTAGTGGCACCCTGCTGCAAATAG 
CCCACCACTCTCACAAGTGTGTGTGTGTGAGTGTTAACAATTAATCGCATTTAAAAAACAATTTGGCCAGCCGCAGTAACAATAACGGTAAGAAA 
AGTCGGCGGTACCGTTGGGAAATTTTCACCGTTGTCTGGCAACTTCTTTGTGCCCGTCGTGTATGTATAGGTTGGGTGCAGAAGTGTGCGTGAGT 
CCTGCCCACGCCCACCGAGTAGCGCTAGTCTGCGAAATCAACTGCAGGATGCTGTAGGTTGATATTTTATCGGTTAGATGATAATAGTCTTTGGT 
CTTCCTTATGATTTTTCGCCAGACGTTAAGTTTTTTCAATTTGGGAATGCAAAAACGGTTACTTTATGCACCGTATCACATCACACACACTTGCA 
AAAGGGT GTGAAAA 
(SEQ ID NO: 127) 

Exon: 1001.. 1785 
Exon: 1844.. 2555 
Exon: 2612. .3030 
Exon: 3094.. 5949 
Start ATG: 1134 

Transcript No. : CT4888 

CACACTATCACATTGTATCGATTCGGAGGGAGTTGAATTTTTGCACGTATTTGAAAGCGCAAAGATGCGAAACAAAGAGCAGGATATCGAATGAA 
AGAGCTCCTGCGAGACCAGGACTAGTTGCCTAACCACCATGGAAATACCCATTCAGGTAGCGGTGCGCATCTTCCCGCATAGAGAGCTCAAGGAC 
TTGTTGAGGAGCTTCGGCCCCACAGAGCCCAAAAAGGATGCGCAGGCGGTGGATGAGGGGGCGGACTCCAAGGACTCCGAGGCTCAAGTTCCGGC 
GGCTGAGAAGGACAATCCGTCAATTTCAGAGACGGACCCAAACGGGAATGCTGAGCAGGACAGTGCGGCCGATTCGAAGACAATTCCAGATGCCA 
ATGGCAACGACAGTGGTCAGAAGGATTACCCAGACTCCGCATATTGCGTTCAGGCTATTCCCATCAGCGCTTCGGCCCTGGGATTGCCCAGTGCC 
CTGCCAGGCGGAGATCCCATGGACAGCATTGCAGCTGGACTGATACAAGTCGGTCCCCACACGGTTCCGGTCACCCACGCCCTGCCCAGCAGCAG 
CTCCCAGGAGCAAGTGTACCACCAGACGGTCTTTCCGCTCATCACCCTGTTCCTGGAAGGTTTCGACGCATCTGTAGTCACCTATGGCCAGAGGG 
GCCAGGGCAAAAGCTACACACTCTACGGAAACGTCCAGGACCCTACTCTGACGGATTCCACCGAAGGAGTCGTCCAACTTTGTGTCCGTGACATT 
TTCTCACACATATCGTTGCACCCAGAACGCACCTATGCCATCAACGTGGGATTCGTGGAAATTTGCGGAGGTGATGTCTGCGACTTGTTGGGCAT 
GGGAAACATACACTGCACCAACGTAGATGCCGTCTTCCACTGGTTGCAGGTGGGTCTGTCGGCCCGCCAATCGTTGCCGGCGCACACCCTGTTCA 
CGCTTACCCTGGAGCAGCAGTGGGTGTCCAAGGAGGGGCTGTTGCAGCACCGCTTGTCCACGGCCAGTTTCTCGGATCTCTGCGGCACAGAGAGA 
TGTGGAGACCAGCCACCGGGACGTCCTCTCGATGCTGGCCTGTGTATGCTGGAGCAGGTGATCAGTACTCTCACAGATCCAGGCCTCATGTACGG 
GGTTAATGGCAACATTCCGTACGGTCAGACCACGCTCACCACTCTCCTGAAGGACTCGTTCGGCGGACGGGCTCAGACGCTCGTGATCCTGTGCG 
TGTCACCGCTGGAAGAGCACTTGCCCGAAACTCTTGGCAACCTGCAGTTCGCCTTTAAGGTGCAGTGCGTACGTAATTTTGTAATTATGAACACC 
TACTCCGACGACAACACGATGATCGTTCAGCCGGCTGAGCCTGTTCCCGAATCCAATTCCTCTGCTGGACCCTTGTCGCAGGCGGGACCAGGGGA 
CAACTTTGGCCTACAATTCGCAGCAAGCCAATGGTCCAAACTAGTTACCAACGCCGAGGGGCTATTTTCCAAGCTGATAGACTCCAAGCTAATTA 
CTG AAGT GGAG AAGGAGC AG AT CG AGG AGTG GCT CTTC CTC AAGC AGG AG T GC GAGGAGT GT C T CAGCT C AAC AG AGGC TAT G C GT C AG C AAAAA 
CAGTTGGTTCCCATTCTGGAGGCCGAGGAGCCCGAGGACGTGAATTCTGAAGCAGCCAATTCGGAGTCGCCAAACTCCGACAACGAAAACGACAC 
AGACAATGAGTCGCATCGGCCCGATCTGGACGACAAGATAGAAAGTCTAATGGAAGAGTTTCGCGACAAAACAGACGCTCTTATACTTGAAAAAC 
ACGCTGAATATCTATCCAAGCATCCGAAGGCGGTTATGCAAAGCCAAGACCGCGAGATTGAGGCACAGCCGCCAGAAGAAAATGGTGATGATCGA 
AAAGTCAGCATTGGCAGTCGCAGGAGAAGTGTTCAGCCAGGTGCTAGCTTAAGTACTGCTGAGCTTGCCATGCTTAATCGGGTAGCTTCCCAGCA 
GCCGCCTCCGCCCATCGATCCTGAGTCGGTCGTCGATCCTCTGGAAAGTTCTTCGGGCGAAGGAATCCGTCAGGCGGCTCTCGCTGCCGCCGCCG 
CCACTGCTCCTATTGAACAGCTGCAGAAAAAATTGCGCAAACTGGTCGCTGAGATCGAGGGCAAGCAACGACAGTTACGGGAAATCGAAGAGACA 
ATCCAGGTAAAACAAAATATAATCGCCGAATTGGTCAAGAACAGCGATACACGCAGCCATGCAAAGCAAAGATTTCACAAGAAACGTGCCAAACT 
TGAGGCCGAGTGCGACAAGGCCAAGAAGCAGTTAGGTAAGGCGCTAGTTCAAGGCCGGGATCAGTCGGAGATTGAGCGATGGACCACGATAATCG 
GACATCTCGAGCGTCGACTAGAAGACCTCAGCTCAATGAAGCATATTGCGGGTGAGAGCGGACAGAAGGTGAAGAAGCTACAGCAATCGGTGGGC 
GAGTCGCGAAAACAGGCCGATGATTTACAGAAAAAGCTTCGAAAGGAGTGCAAGCTGCGCTGCCAGATGGAGGCGGAGCTGGCCAAACTACGAGA 
ATCCAGGGAGACTGGCAAAGAGCTAGTGAAGGCGCAAGGTTCTCCCGAGCAACAAGGCCGCCAGTTAAAGGCAGTACAGGCTAGGATTACGCACC 
T T AA T C AC AT T T T AC GC GAG AAGT C GG AT AAC CT GG AGG AGC AGCC GGG AC C AGAAC AGC AG GAG AC C T T GC GT C AT GAG AT C C GC AACT T GC GC 
GGAACTCGTGACTTGTTGTTGGAAGAACGCTGTCATTTGGACCGCAAACTTAAGCGGGACAAGGTGCTGACGCAAAAGGAGGAGCGCAAGCTGCT 
CGAGTGCGATGAGGCCATCGAGGCCATAGATGCGGCCATAGAATTCAAGAACGAGATGATCACGGGCCACCGCTCCATCGACACGAGCGACCGAA 
TTCAGCGGGAGAAGGGAGAACAGATGCTGATGGCACGCCTAAATCGTCTCTCAACGGAGGAGATGCGAACACTTCTGTACAAATACTTCACGAAG 
GTTATCGATTTGCGCGACTCTTCACGAAAGCTGGAGCTGCAGCTGGTGCAGTTGGAGCGTGAGCGGGATGCCTGGGAGTGGAAGGAGCGTGTTCT 
GTCCAATGCCGTGCGCCAGGCTAGACTGGAAGGCGAACGGAATGCGGTGCTGCTGCAGCGCCAGCACGAAATGAAACTCACTTTGATGCTGCGTC 
ACATGGCGGAGGAAACGTCGGCCAGTTCGGCCAGCTACGGAGAACGAGCTTTGGCCCCTGCCTGTGTCGCCCCGCCGGTGCAGGCCAGTAGTGAT 
TTCGACTACGATCATTTCTACAAAGGTGGCGGCAATCCAAGCAAGGCACTGATCAAAGCGCCAAAGCCGATGCCCACCGGCTCGGCGCTAGACAA 
ATACAAGGACAAAGAGCAACGCAGCGGACGCAACATCTTTGCCAAGTTCCATGTGCTCACCAGATATGCGTCAGCTGCCGCAGCCGGTTCCTCAG 
GGTCCACGGCCGAGGAATCCACGGCCCTGATTGAGTCAACCACCACGGCCACGGCAACCACTACGTCGACAACCACCACTGGAGCCGTAGGAAAA 
GTGAAGGACAAGGCCCTGGTCAGCTTCAGGCCGGAGCAGCTGAAGCGTCTGATGCCAGCTCCGACGGCCACGAAAGTGACGCGTCAGAAGAACAA 
GATAATTATCCAGGACGCAAGTCGTCGAAACTAAATACGAATTTAACCATTTCCAGGACGATATTATAAGGTTAAGCCAAATAGCCATATACAAT 
TAATAGCCATGAGCTGAAATCCAGCCACTTTAATAATTAAGGCGATAATATAAAAAAATGCTTAAAAATGTATAACAGATTACTCCCATATACAT 
GCATACATTTTAATCAATAGTCCGACATATATACCAGCTTGAATTTATTAGTTTACTGCTTGTCGTTTAACTTATTAAAGTGGCTGGAGTTCGAT 
ATCTTTTTAAGTGTGAAGCAATAGCTCAGATCCTGGTAACAAGAACTGAACAAGTTTCGTCCACAGTCATGACACAAATAGACTACAACACATGG 
TTCTCATTCCGGCTCTTTAGCTGGGGAGCATTTATTTTAGAAACCCCCACACGTACATACACCTAGCATATGAAAGTAACTATCCTCATGAGATT 
CAACCGTAAAGACTTGACCTGCTACTTAAGATACTCGCAAGACAAACCGTACACAAAACTCGCTCAAAGTGTTGACCTAAGTCTCCGCATTGTTT 
AACTTGTACATAACAATAATTATTAGGTAACCACTTAATTTTAATTGATTTTGTTAGTTTTAGCTTAGTTGTAAACACTATTGTGTCTTCCCTCG 
AATTTTAATTGACCTTCGATACCAATTACAACATACGTACTCCATACCCAAGTGTTAACCTTTAATATTCTATCAAAAGCATTTACTCTGTACAA 
CTTATAAATGCTTCAAATCGCAGTATTTGCGCTAATCCAATACTCATTTTGCATTAACTTAAAATTAAAATTCAATACTGTTTGGTTGTGTACAA 
CGTATTTTTTATTGTCCATTGATATAGATGCAGTTTGCACTGTACTCCGAGTAACTTAAATTGATATTAATGTATGTATGTTCCCTCAATATCAG 
CGTGTGAAAAATTATAAACACATTTGTAAGCATACATATTCCGCCAAATCGCTCGGCATGGCTTTGTGTGGATATTGTTGGTTAAGCAAATAAAT 
ATATACAAACTAAACTAATCAA 
{SEQ ID NO: 128) 

Start ATG: 134 
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MEIPIQVAVRIFPHRELKDLLRSFGPTEPKKDAQAVDEGADSKDSEAQVPAAEKDNPSISETDPNGNAEQDSAADSKTIPDANGNDSGQKDYPDS 
AYCVQAIPISASALGLPSALPGGDPMDSIAAGLIQVGPHTVPVTHALPSSSSQEQVYHQTVFPLITLFLEGFDASVVTYGQRGQGKSYTLYGNVQ 
DPTLTDSTEGVVQLCVRDIFSHISLHPERTYAINVGFVEICGGDVCDLLGMGNIHCTNVDAVFHWLQVGLSARQSLPAHTLFTLTLEQQWVSKEG 
LLQHRLSTASFSDLCGTERCGDQPPGRPLDAGLCMLEQVISTLTDPGLMYGVNGNIPYGQTTLTTLLKDSFGGRAQTLVILCVSPLEEHLPETLG 
NLQFAFKVQCVRNFVIMNTYSDDNTMIVQPAEPVPESNSSAGPLSQAGPGDNFGLQFAASQWSKLVTNAEGLFSKLIDSKLITEVEKEQIEEWLF 
LKQECEECLSSTEAMRQQKQLVPILEAEEPEDVNSEAANSESPNSDNENDTDNESHRPDLDDKIESLMEEFRDKTDALILEKHAEYLSKHPKAVM 
QSQDREIEAQPPEENGDDRKVSIGSRRRSVQPGASLSTAELAMLWRVASQQPPPPIDPESVVDPLESSSGEGIRQAALAAAAATAPIEQLQKKLR 
KLVAEI EGKQRQLRE I EET I QVKQN I I AELVKNSDTRSHAKQRFHKKRAKLEAECDKAKKQLGKAL VQGRDQSEI ERWTT I I GHLERRLEDLSSM 
KHIAGESGQKVKKLQQSVGESRKQADDLQKKLRKECKLRCQMEAELAKLRESRETGKELVKAQGSPEQQGRQLKAVQARITHLNHILREKSDHLE 
EQPGPEQQETLRHEIRNLRGTRDLLLEERCHLDRKLKRDKVLTQKEERKLLECDEAIEAIDAAIEFKNEMITGHRSIDTSDRIQREKGEQMLMAR 
LNRLSTEEMRTLLYKYFTKVIDLRDSSRKLELQLVQLERERDAWEWKERVLSNAVRQARLEGERNAVLLQRQHEMKLTLMLPJiMAEETSASSASY 
GERALAPACVAPPVQASSDFDYDHFYKGGGNPSKALIKAPKPMPTGSALDKYKDKEQRSGRNIFAKFHVLTRYASAAAAGSSGSTAEESTALIES 
TTTATATTTSTTTTGAVGKVKDKALVSFRPEQLKRLMPAPTATKVTRQKNKIIIQDASRRN* 
( SEQ ID HO: 129) 

Name: KINESIN-LIKE PROTEIN 
Classification: motor ^protein 
Gene Symbol: cos 
FlyBase ID: FBgn0000352 



Celera Sequence No. : 142000013384776 

ACGTTTTCAAGGGCAGGCAGCTTTGTAATTTAATTCAATTTCTATTTAATTGAACCAAAGGCGTCTTTAATGTGCAACTTTTTGATTTATGAGTC 
CGCAACAGCAAGCGGCTGTCAAGCAACTTGTTCATTTCACTCCACTTCTTTCTTAGTTTCCCTCTCTTGCCTTCGGGAAAATTTAAAATCTTACT 
ACGAAATTTCCATGCAAACGAAACATTTTCCGCCAGCCATGCCGCATTTTGCGTTTCCTCTTCTTTTTCTTCCTGTTTTTTTGGCACAAATTTAA 
TTTGATTTGATTTTGGCCGCTGGACATGGCAAGCCCCAGGCAATTTGTCAAAAAGTTATATAGCTACTGTGTATTTTTATGTGCCGGCATATTTA 
ATGACATTTTCCTCATTGCGACTTTTTTTTTTTGAATTTTCAAAAAATACAAAATACAATATCTTCTAAAATCCATGAGCAATGCCCAATTCTTG 
AAGTTTTTGATGGCCATGCAAGTGATGTGCCATTTGTTGAGTAAGTATATCTGTGGGCAAAGATTTCTAGCTTAGGCGTGTGCAATGTATGAGGA 
ATTAATTTGTGTAAAATTGGCAAACAATGGAGGAGATGGGCCGAGTAAAGGTAAGCAATTTGTTTGAATTCGCTTGGTAAATTAATTGCACAGGT 
TTGGAAAATTAATGGACACCTGCAGATCCACCAGTGGCATTGTATGTTGGCCATTGTGTGGTTATTGAATTTAAGTTCCATAAGCATTCAAATGA 
AGATGCTGTAATTTTGCAGGAATATATAAATAAGTATATGGCCGTTATTAAATGCTGACCCACATTTTGTGTACTGGCAAAGCCAGTTTTTTTTT 
TGTTGGTTGTTAAAAACCATTTTGATTCTTGGCAAATCTGGTTCAATTAACTGCAGCCCAAAATATCATTTAATATCGCGATATGACCACAAAAA 
TCATATATAAGAAGTGGCGAGCCTGGTCACCACAGCAAAGCGACTGAAAGCGAAGTCTGATAAAGAACAATCCTTACAAAATACAAAAAAATATA 
TATATTTAAAAAATACAATTGAGCAACCAAACATGGCCAACATTCTAAGTTTTACCATCTTTATGGCTGCCATTGCTATTGGACTGGCTTTAGCC 
ACAACAACGACAGCAAGTACAAGGACACCCATGCCCACTTCCACTGAAAAATCCATGGAAAAGGACGATCCCATGGAGTCCTTGGAACGCCGTGA 
CAAACGCCAATTGGTAAGTGATTATTGAAGCAAACTAGTCGTGGCAAACTTAGGTGTAGATTTTAGGTTAAATCAATCAATTTTTAAACATTTAC 
CTTTCCTCTTAATCGTATTCCCAGACCGCCAATGGTGGCCAGAGCAGCCTGGCTGGTTCCAGCGGCAACTATCCCGTTTTCTTCGATGGCCTTAA 
TGTGAATCCCCCGCTGCAGCCACTGCCGCCACTGCAGCAACTTCAGCCTCTGCAGCCATTGCAGCCGATTACGGTGGTGACTCCTGTGGCCTCCA 
GTTCGAATGCCCAGAGCCAGCAGCCGCAGTCCGGCTGGCTGCCTCAGTTTAGCCAGAATCTTCCACTTATCGGAACCTGGGTGGGTGGCCTGCCC 
AATTTGATCAACGGCCTGGGACTCGGTTCGCTAAACGGCGGCTTCGGCACTCTGGGCGGCCTGAATCTGGGCAACCTGGGACTGGGTGCAGTGGC 
GCCGGTGGCCGCGGTGCCGGGCCAACTGCTCGGCTCGTCACCCAATCCCCAGACCACATGCCCACTCACCCAGAAGCTCAACTGCCGCTGTGAGC 
CGCTCGTTCAGCTGCCGGGCCTGAAGCCCAGTTCCAATCCACTGAGGACGCAGCTGGTGCAGATCATGCGGCAGAATGCGCGGAAGAACGAGGAC 
GGTTCGCAGGAGGTGCGCGTAATCCTGAGCAGCGGACATGTGATCTACCAGCGGACGGCCAAGGATCGCGGCCAGAGTGGCTACCTTGCCATGAG 
AATTCAGTCGGGCAGGTACTTCAACATCTACTACAGTGTCAACGAGAAGGAGTACACGGTGCGGGCGGATGTCAAGGACACGCCACCCACCTCCG 
ATTTTGACGACGAGCTGAGTGGTCAGTTCTAAAAATTGGTCAAAGGGATTTCCCTTTACTTACCTACAGTTTTTTTTTCTATACTTAATACTTAA 
CTAGCCATTTTTTATATGAAATGCACTTAAACATATGTATTTGAATAAGCATTTAAACATATTTATGCTTATATGTTAAGCGTGCCCGATTTGAA 
CCAAGCCGCCGAAATTCTTCGTAAACTCCATTAATCCCTTTTTTATTCGCAACTTTTTTTTTGAACTTAAACAACTACAAACTAGCTGCGAAAGA 
AATAAACTACAATGAAACACATTTTCTGACGATTGCTCCTTGGGGACAAAAAGTTTAGCTACAAAGACAAATATTCATGGGGGTGCGGCAATAAA 
CTTGTTTTTATTGCCGAAAATGTTGTTAAAGGGTTAACAAGGTGGCAACGGGGGGGAGGTTGTTTCTTAGCCCTTAGCCCAATTTCCAAAAAAGG 
AACTTTCCTAATTACTGAAGCCACTTGGAAAAGAGATAAATAAATATTTACTCAGGCAATATTTAAGCAATAATTGTTCGCTTATTATTATCATT 
TTTCAGTATCAACTGATAATTATTCGTGAATAAAAGTTTTAACAATGGATATAGAAGTCTTTGTATTTTATAACTATTAAGTGCTCAAGTTTGTA 
TATATGGCATAAATAGTTATAATTTTGTTTGTTTATTAGTTGCAGCAAGCCGACACCAGTTGAATCTGCTGGTCATGGAAAAACCCACAGAGAAA 
GAGGCCAGCAAACAAACAAAGAGCCAATGGGCAAATGGAAATCAAGCTGTAAAATGCCATATCCCTATCCCCCTCTCCACCGCCCCCCCCCCCCT 
CACATCTGCGCTTTTCCGCACTTCACTTTTCCCCATTTTGCCTTTCAAGTTGCGCTCAGAAGTTTCGCAATGAGCCGGGAAATGGAAACTGGCAG 
GATATGCCCGGAAAAGCGAAGTCGAGTGCATGGTGGTAGAGGGGGGCGGGTTGCGGTGCGGTGTGTTGTATAATTAAAAGTTCCCTTTTGTGCCG 
AGATTGGTGGATCCTCCCAAAAGGCAGCTACATTTTGATCCTTGCCAGCTGTGATTAGTCAGTGAACAGCAGGAGTTCTCAAATATAGTAAAGTA 
GTCTTCCTTTTTTTTTGTTAACTGGAGGTTAACTGGAAGTTTAATATTAAGTAAAGGTTAAGTTGTAAGAAAAAAAAATAGATTACATTTTAAAC 
TGCACATTAAAATTAAATTGTAAACATAAAAGAAACCAAGAACTATTATTTAGTTTATTATTTAGCTTTTCAGATTGCTTTTGAAAAG7^AGCTAA 
GTTTTGGTGTTTAAATTGTGAAATTGCGTGATTTTCGATTATGGGGCCCGATTTGTGGGGTGCTTAAAGCACTTTCGGTAAATAATTTACACAGT 
TTGTGCATAAAAGTTGCTTCTGCTGGGGCCTTTTAACCGGGGCATGGAAGGGGGCGTGACTGGCAATGAAACCGCATTGCTCGGTGCACACGAAA 
CTTTTCACAAATCATAAATGGTCCCAGGACACACACACAAACACACACATTGCAGTTTAGTTCAGTGACTACCCAGCTTTTCAGCCACCCGTTTC 
CCCCCTGTATTTAAAAAGTCCCCCATTTCAGAATGACCACCTCCCCCCCCCCCCCCACTCCACCTTTGAGGGAGCCCCTCTCAACCAACCGCCTC 
ATTGGTTATGCCAATTGCAAAAAGTCCTGGGCAACAATTGAAATTGCCGGACTCGTTGATGAGACTTTAAGCTCTGGCTGCAGTTGAAACCCCGC 
CCCTAATTACGTATTACCAGATGACTGCCATTAAGTGTTCCACAACCCCCCGCCCCTCACTCCGCACCCCTTCCCCTGACCACAGCCCCTCTTTT 
AAAGCTGAATCATCTTAAAGTGCTGGACAACAAGGACGCACCATTAAATCCACTGAATGCTCCACATTTCGGTTGGCTTTGGTTTGTTGATGTTG 
CCAGTGTTGGGTGGGCATGGCCCAAGGGTTGGGGGGCTGATTCCTGCGGGTTATATAAGCGTTGGCGCTGCTTCTGCACCCATTAGCTGATACGG 
CGGATCAGCGCGAGGGAGGAGGAATCCAAACCCAAGCCCAAAGCCAAAGCCAGACCAGTCGCAACCGCAGACCCACGTGAGTGACCGGCGACGAC 
TAGACAGTGCGAGTCCATGCTTGGATGCATACATACATACATACATATGTATGTGTGCGTCTGGATGTGACCCACGGCGATGGCCCCTGAAACCA 
GGATGCACCTAGCCCAGAAGCAACCCTCCTGCGTTCGCCGCCCCTGGCTACCCTCGAAAAACTGCCGCATTGTGTGGCCAGAAAGCGAAACCGCG 
ACTGCAACTGGCCCTGCAATAGAAAATGCACTGCAAAAAAACTGGGAGTATGTATTTAAAGTACTTGATAAAGTTGGATTATTAAATAACACAAT 
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TAAATTTAATATTAATAGAGAAGTAGTATTCTTATCAAGATGAGAACTAAGGAGCTATGTTAATCTTGAACCAGGTGTGGCCAACTTTATTTTTC 

ACAGTGTGGCAAGGATAAGCAAAGCGACCAGAAAAGTATAACCAGCAAAGATGGACCCATGGAGAACGGATGCGAATCGAGAGCGGAAAGCGTGA 

CAGATAGATGGGTTGGAGCAGCAAGGGGTTAATGAAGGAGGGCATTTAGTTCGTGGTACTGTGGAAATGATTAAAATGTATTAGGTGAAATTAAG 

TTGGCCAAACAGAGAAATGGGGGGGGGGGGTTTAAATAAAGAGGCCAAAATGAGGTTAAAACGTTGCACAAAGCCAAATGGAGCCTGGCTGGGCG 

AAAACAAATTCATTAGGCTGCAGTGGCATAAAAACATCATACCCACAAAAAGAAGATGCACAACGAAAACCGAACAAAGTTGATTTAAAGTATTA 

TAATCTGCAACATTTTTGCGTATTTCGAGAATGCAGCTTGTTTACCCAGTTTGAAGTCAAGTTAGACTTATCTCTGGTTAATTGGTTTGCTCGAG 

AGCAGGGCGCGCAAAGCTGCTGGGAAATCGTGGAAAAATCGTAAAGCTTAATAATGACACTCCCCACACAGCCCCTACCACCAGTCCAGCATTGG 

GAATTCACAAGCCACTGTAAACAGGTGCAAACGTTGCGTTGCACAGCAATTTCAGCAACCCCTCTTAACTTTTGCGCCCAGCTTTCCACGTTTTC 

C C TTGTTC AAAT AGTTGT AAATGGGCGAAAAT GC AAACAAAAAAC AGTGATCC AC T TGAAAGGC AAAAC AAGAAAAAT C ATTCAAAGAGGC AAC A 

ATGCTGGGAAAACAAAAGGAAAACGAACGAAAACGAACTGCAAAGACAGCGACCACTAAATAAGTGAAGTGCGTGGACGACTGGACGAGTGGAAA 

TGCGGAAAAGTGGGAAAGCGGAAAAGCGCGCGCAAGAAATTTGCAATGTGGCGAATAAATAAAGGATCAAGCGATTCCCCCATTCAAATTGGCGA 

ATATTGGGCGGTTAAATGGAGATACATGCACGGATGTAAGGCAACTCGAAAAGGTGGCACAACAAATGAACGAGATAACTGGTTAACCACGAGTT 

GTAAACGCGCAAGCTATTGTTATTAGTATGTTAAAAAGCGGAATAAACTTTTTGTGTTTAACTACAGCTCCATGAAATATGTATATGCTATATGT 

AACAAACAATAAGCGAGAAAGTTTGTCGGCAACAAAAAAAAAAGTAArAACCATTATGATATTATGAATGCAATTACATTCTTGACGACTCGAAA 

GAATTTTAAATGTATAAATTATAACGAGATAAAAAAATAAAATATAAATCAAATTTACTTAACATTTATAACATTTCTAGTGCAATATTTACTTT 

CACTATATGGGTTAATAAAATAAACTTGCCTGTTTCTCGATGTTAAACCCCATTACTTCCAATTGGCCAATCGATTAGAAATTCAACAAGAGTTT 

GCCAAGATCTGAGTCACTTGAGGTCGGGCTTCTTATCTGGCCATGTAGGTATATTCCCGATAAGCCCTATCAGCGACATCGAGGCGGZVAACACAA 

AAAGACCATCAAATCGACCACTTTCACAGTTGAAAGTCAGAGCTGCTGAAGAACGGACAAGAAAATGGCAGGTAAATAGGTCGTAAGGCCAAGAT 

GCCATAACAAAAAATGAAAACATTTAACTAGGTGGATGATATAAAACAGTTACCAATCTTTAAAGTACTTAATCAGTTGAAACTCAAAATAAATT 

AATTCAATTTTGGAAACTAACTTGGCCTAGCCAAATAAACTAGTTTACTTGCCATGAACAACAGAATTAAAGAGTTATACCATTCGGTGTGCTGT 

TAAATAATCGTTTCATTTTTCCATTATCCCCCAGACCACAGATGGATGCACTTCAGCAACGGCAGCGTGCCGCCGAATGCCGTGGTGGCTGGTCA 

CGATTCGGACGGGGACACCATCTACGTGGGCCGTGCCTTCTTCTCCAACGACATGCTGCCGGCGAAGGTGATTCCCAACAAGGGCAAGGCCTACG 

TGGCGTATGCGCGCGAGGAGCACGAGCTGGAGAACTACGAGGTGCTTTCCGGCTACAACTACGAGTGGCTGTCGGCGGAGAACGGGGAGGTGCCG 

CCGGGCGCCGTCAAAGTTGGTCGGAATGTGGACGGCGAGTATTTGTATGCGGGCAGGGGTTATCATGCCGGCAGCTTAACCATGGGCAAGGTGCA 

TCCATCCCACGGCTGCCTGTATATTCCCTACGATTCCGATGAGGTTAAGATCTTTGCCTACGAGGTGCTGTGCCAGCCGGAACGTTGGATCGACA 

CCACCGCAACAAATATTCCGGATGGAGCTCTGGTTGCTGGGCACGATTCCAACGGGGACACCATCTACGTGGGCAGGGTGTTCCGCAATGGTGAT 

CTGCTGCCAGCCAAAGTGGTGCCGGCGAAGGGGAAAGCCTATGCCGCCTATGCCCAGGCGGAGCACGAGCTCACCGACGTCCAGGTGCTGACGGG 

ATCCGGATTCCGTTGGGTTCCGGCCTCGCACGGAAATGTGGCTCCCGGAGCCCTGTCCTCCGGTCCGAATGTCGACGGGGAGCCCCTCTACGTCG 

GCAGGGCCATCTACTGCGACAGCTTGAGTGTGGGCAAGATCCATCCCTCGCACGGCTGCATCTACATCCCATTCGGTGGCGAGGAGGTGCGTCTG 

GAGAATTACGAGGTTCTGGTGAGGATCTAGACGGACAGCTTTGCCTTTATTCCATTTTTTTTTTAATAAATCTATTTTTTACACACACAACAGAA 

GT AT ACAG AT T AAAG TAT AAAT T T AAGT T T AAT T C G AT CG AC AT C G AGC AACGT AACT G AT AAAAT T T AT G GT AAT GC AC CC AGT AAT TC T AAAA 

AAAAAGTACCCCTTTCATAGAGTTTTGGTGATAATAGTTCAAAGTGGTTGTCATAGAAATTTCGAGGTATTGCGGATGTGAGGAATTTCTTCTAT 

ATGCGGGCCTTATGTTATCACATCGGTTATCAATTAGCAACAATCAATTGATTAGTTGGCCATAAATTTGCGTCAACTTGTAACATAATTAACCA 

TTTGCCTTCGCCCACTGACTTAACAAACACTGCATCGTAAAACTCAGACAGATATCGCAGTTATCAGAATGTGTACATATATGCACTCGATGCTT 

TATAACTCGACCTTTCGTCTGCCGAGGTAAAAAGTTATAGGTATCGAAACTCTTTGTAACTTTCTTTGTATATTGAAAAGTTTATTAAAACGTTT 

CATAAAGTTAAAGTCAATTCATTAGTGGCGGGCGTAGCGTTTTATTTCATAGCTTATTAATGTGCACACGTTGTAATCAATACATGTTTTAAGCA 

CTTATTTGACAGCACAAAGTTTCCCATGGAAATACTTTTTTTCGCAATTTATTTTATAGGAAAGTTGCTAAAAGGCCTTAATCTTAACACTATCA 

TTGGCGCCACCTATGAGATGATTATCTGCACTAACCGTGGCGCCACCTATGACGAGCTG 

<SEQ ID NO: 130) 

Exon: 1001.. 1248 
Exon: 1355. .1799 
Exon: 3884.-4585 
Exon: 6212.. 6246 
Exon: 6495.. 7134 
Start ATG: 1078 

Transcript No. : CT4928 

CGAAGTCTGATAAAGAACAATCCTTACAAAATACAAAAAAATATATATATTTAAAAAATACAATTGAGCAACCAAACATGGCCAACATTCTAAGT 

TTTACCATCTTTATGGCTGCCATTGCTATTGGACTGGCTTTAGCCACAACAACGACAGCAAGTACAAGGACACCCATGCCCACTTCCACTGAAAA 

ATCCATGGAAAAGGACGATCCCATGGAGTCCTTGGAACGCCGTGACAAACGCCAATTGACCGCCAATGGTGGCCAGAGCAGCCTGGCTGGTTCCA 

GCGGCAACTATCCCGTTTTCTTCGATGGCCTTAATGTGAATCCCCCGCTGCAGCCACTGCCGCCACTGCAGCAACTTCAGCCTCTGCAGCCATTG 

CAGCCGATTACGGTGGTGACTCCTGTGGCCTCCAGTTCGAATGCCCAGAGCCAGCAGCCGCAGTCCGGCTGGCTGCCTCAGTTTAGCCAGAATCT 

TCCACTTATCGGAACCTGGGTGGGTGGCCTGCCCAATTTGATCAACGGCCTGGGACTCGGTTCGCTAAACGGCGGCTTCGGCACTCTGGGCGGCC 

TGAATCTGGGCAACCTGGGACTGGGTGCAGTGGCGCCGGTGGCCGCGGTGCCGGGCCAACTGCTCGGCTCGTCACCCAATCCCCAGACCACATGC 

CCACTCACCCAGAAGCTCAACTGCCGCTTTGAAACCCCGCCCCTAATTACGTATTACCAGATGACTGCCATTAAGTGTTCCACAACCCCCCGCCC 

CTCACTCCGCACCCCTTCCCCTGACCACAGCCCCTCTTTTAAAGCTGAATCATCTTAAAGTGCTGGACAACAAGGACGCACCATTAAATCCACTG 

AATGCTCCACATTTCGGTTGGCTTTGGTTTGTTGATGTTGCCAGTGTTGGGTGGGCATGGCCCAAGGGTTGGGGGGCTGATTCCTGCGGGTTATA 

TAAGCGTTGGCGCTGCTTCTGCACCCATTAGCTGATACGGCGGATCAGCGCGAGGGAGGAGGAATCCAAACCCAAGCCCAAAGCCAAAGCCAGAC 

CAGTCGCAACCGCAGACCCACGTGAGTGACCGGCGACGACTAGACAGTGCGAGTCCATGCTTGGATGCATACATACATACATACATATGTATGTG 

TGCGTCTGGATGTGACCCACGGCGATGGCCCCTGAAACCAGGATGCACCTAGCCCAGAAGCAACCCTCCTGCGTTCGCCGCCCCTGGCTACCCTC 

GAAAAACTGCCGCATTGTGTGGCCAGAAAGCGAAACCGCGACTGCAACTGGCCCTGCAATAGAAAATGCACTGCAAAAAAACTGGGAGTATGTAT 

TTAAAGTACTTGATAAAGTTGGATTATTAAATAACACAATTAAATTTAATATTAATAGAGAAGTATCAGAGCTGCTGAAGAACGGACAAGAAAAT 

GGCAGACCACAGATGGATGCACTTCAGCAACGGCAGCGTGCCGCCGAATGCCGTGGTGGCTGGTCACGATTCGGACGGGGACACCATCTACGTGG 

GCCGTGCCTTCTTCTCCAACGACATGCTGCCGGCGAAGGTGATTCCCAACAAGGGCAAGGCCTACGTGGCGTATGCGCGCGAGGAGCACGAGCTG 

GAGAACTACGAGGTGCTTTCCGGCTACAACTACGAGTGGCTGTCGGCGGAGAACGGGGAGGTGCCGCCGGGCGCCGTCAAAGTTGGTCGGAATGT 

GGACGGCGAGTATTTGTATGCGGGCAGGGGTTATCATGCCGGCAGCTTAACCATGGGCAAGGTGCATCCATCCCACGGCTGCCTGTATATTCCCT 

ACGATTCCGATGAGGTTAAGATCTTTGCCTACGAGGTGCTGTGCCAGCCGGAACGTTGGATCGACACCACCGCAACAAATATTCCGGATGGAGCT 

CTGGTTGCTGGGCACGATTCCAACGGGGACACCATCTACGTGGGCAGGGTGTTCCGCAATGGTGATCTGCTGCCAGCCAAAGTGGTGCCGGCGAA 

GGGGAAAGCCTATGCCGCCTATGCCCAGGCGGAGCACGAGCTCACCGACGTCCAGGTGCTGACGGGATCCGGATT 

(SEQ ID NO: 131) 
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Start ATG: 78 



MANILSFTIFMAAIAIGLALATTTTASTRTPMPTSTEKSMEKDDPMESLERRDKRQLTANGGQSSLAGSSGNYPVFFDGLNVNPPLQPLPPLQQL 
QPLQPLQPITWTPVASSSNAQSQQPQSGWLPQFSQNLPLIGTWVGGLPNLINGLGLGSLKGGFGTLGGLNLGNLGLGAVAPVAAVPGQLLGSSP 
NPQTTCPLTQKLNCRFETPPLITYYQMTAIKCSTTPRPSLRTPSPDHSPSFKAESS* 
(SEQ ID NO: 132) 



Celera Sequence No. : 142000013384694 

GAGCTCCGCTGGCGATGATCGCACCCAGGATTCGACTGGCGCATCCGCCTCGCTGAGGGCGTTCATCTTCTTCACATAGTCCTCTGGTTTTTCGC 

AGAAGTTAGAACACCACTGTCTCAACAGACGGTCCCTTTCGGTCTCCACCTTCGGTTTGGCCCTAGTCTTCTCTAAGGTCTCATTTCTCTCCCTT 

TCGTGATGGCCACGTGGCGAATAGTCATCGCTGACTTGGTTTCTGGCTGCAGATCCATAGTATCCGTGTTGCGACCCCTTTGCCTGCCTGTGGGC 

ATAGTGTCCGTGTCGTGGCTCGTGGGCATATCGACTGTCCTGAATTTGACGGTGTTCCTGATCATGTCTTTGTTCATACCGCCTACCCGAGCTAT 

AGCCGGGATAATGGTGGTAGCCTTGACTGGAGCTGTTGGAAGTGTACCTCTGCCTTTCGGATGGAGGTCGGTAGTTGGACGAATAAGCAGGATAG 

CGGTCTGGGTATTTATAGCTTTCATAGAGGAATGATGATCCGGATGCCGGGTACTGGTATTGGTAATGGGACTCCTGGCCACCATACTTCTGTGC 

TTGTTCGTTGTAAGCATATCCCTCGTAGCCGTACGCTGGTGGTTGCTGGTACGGATCATAGCTAGGTACTGAGCTGGCATAAGGAGCCGGTGTCT 

CCGGGTAGACGTAATCCAAAGACTTCGAGCTCTGAGAATGGGATTCATTTGAAGAATAGTTGTGAGAGGGAACACCTACTCCCGGTGGAGATAGA 

TCTTCTTCTGGCGGAGGTGGCGGGGGCGGAGGAGCAGGCTGGACTGGAGGCGGTGGCAAAGGCGGCTGGTACATGGCGAAAAGTTTTGGGAAAAT 

CTTTGGCAAAATCCACAATTATACTTTTTTCCACCCACACCTTTTACCGCGATTCGAATGATCTGGCAACTCGACCGGTTTGTTTACATTCGCAC 

ATCCGGCAAGGCCGTGCCACATATGTTTCGATAGACAACGTAAGAAAATTATGTTACTTCCGCGGTATTTTTGCTGGAACCTCCAACGCTGCACG 

ATGGCGTTTTACGCTGTAGCCAGTGGGCGGCGGTCTGGAGTCTACGGCTCTTGGGCAGAGTGCGAGGAGCAGGTCAAGGGTTTCAAGAACGCCAA 

ATACAAGAAGTTTAAGACACGCCAGGAAGCGGATCAGTTCGTTAATGGCTGCAAGTCGTATGCTCCGCAGGATGTGGCCGTGCCGCTGGGCAAGG 

AAAAGGCGTCACTGGCCAGTTGGAAAAACAGTATCGAAGTGAATAAGAACCCCAAGTACACAGATGAGTGGCCCGAAGAGGATCACGACCTAGCC 

GAGGATGATCTGGTAGGCATGAATATACATACGTTTGCAAAGTCCTGACCTTTCATCCTTTCAGAATGCTGCCATGAACGAGGTTGAGGGAGATC 

CAAAGCCATCTAACAGTAGCAATTTAGTATGAGACCGTAGTACATACTGTAGAAAAAGTTAAACATAGCTTCTTTCAGCCAGACATTCTCAATCG 

CAAGCGGAAGGGCACAACCAGCGGCGATAAGCGGAACAAGATCCCACGTCATGCCTCTCAGGTCTCAGAAGCCACAGGACTCAAGCAAGTGGGTG 

CCTTCCAGTTTGAAATCGATGCAGAGGGCTATGTCATTGTGTACACAGATGGCTCCTGCATAGGCAACGGACGAGCCGGAGCCTGTGCCGGCTAT 

GGCGTTTATTTCGGCAAGAATCACCAGCTGTAAGTGGTAACTGCAGCACAATAACTTTGCTTTTAACCTATATGTTCTTTGCAGAAACGCAGCCA 

AGCCCGTGGAAGGACGCGTTACAAATAATGTAGGTGAAATACAAGCGGCCATTCATGCCATTAAAACAGCTCTTGACTTGGGAATACAGAAGCTG 

TGCATCAGCACAGACTCTCAGTTTTTGATCAACTCCATAACGCTGTGGGTTGCGGGTTGGAAAAAAAGGGATTGGAAGCTAAAGAACAATCAGCC 

TGTTAAAAACGTTGTTGACTTCAAGGAACTGGATAAACTGCTTCAGGAGAACAACATCACCGTGAAATGGGTGCGTAACTAAATTTAAATAAGGT 

CCTGGAAAATACTTGACTTCAGTTAATATCCCCAGAACTATGTGGAGGCCCACAAGGGCATAGAGGGCAATGAAATGGCAGACAAATTGGCGCGG 

CAAGGATCCGCCTTGTATAAGCAGAAAAATGGTTGAAAAGCTATTTTAAATTTACCGTATTTTTAGTATGTAATTGATATGGCAACGCTGAATAA 

ATTTTAGTTACGCGACAGCCCTGGTCCAAATGTTTTTATTTACAATTTACTTCAGTCGCTTTTAATATAAGTTAGTTTGCAAAAAATGCATTCCA 

AGGGAAATAACCAGTGCTTACTGTTCACCTTGGTTTTGGGCTTACTGGGAGGCGCGGGACATTGTTATGTAGTCGAACCACCTATCGTTCACGTG 

GAAATGGAAAAAGCCGGCATGCACAGGTTTTTATATTGGAGAGCGGTTTATGGATTACCTGAATTAAAGTGATAAATGTTTATGCAGGACCCTTA 

ATTACCGTCTGCGGTTCGATATACCCTTGGTGGGCAAGGACTGCGAGTATGCGCTGGTCCAGGATTTGCCCGCATCCGTTTACATAAGCACAGAC 

GAACTGGATGACCTGCAGCGTTTGAAGAGGGTGCGTTATCTTGGGAATTAGTTACAAGGGTTACTATATAAAAAAATGTTCGTCTCCAGTTAAAT 

GCCATCTATCCTAAGTTCGTCAACATTGAAGTAGCCACGGAACGGGCGCAGCCCTTCTCCGTTTTACTGCGCGGTACACCAAAGATTACGGAATC 

GCTGGCTCTGCCCGTGCATTTTCGCTACCATGCCCCCAGTGATAAGCGGTGTGGTGATCTTGAAGTCTGAATTGCTTTCGTCCACTAGACTAATC 

AATTAATCCTCACAGTTTGGCGGCAACGGTGGCCATACCCCTACCCGAGCTCTATCTCAACTGCCCGATGGCTGATAGTGCGCTCATCGAAAACG 

AGTTGGTTGCCCGGCCCGATAAGCTGTACTGCCTGAATGCGCCGGAAAGCCGTTTCGACGAGAACCATATCAAGGATGGCCAACCCACGACCATG 

GCTAATCTGGAGCGCTGCAACTGGAGGCGGGTGCACGTGGACTGCCAGCTAAAGATGCCGCTGCGTGCCGAGATTCCGGTAGGCCATGCTAGCGC 

CTACGGTCCAGTTCTGTGCGGCACCATCCTGCTAGGTTGGTCTCTGGCCCTGTGGACCAT 

(SEQ ID NO: 133) 

Exon: 1001.. 1342 
Exon: 1395.. 1451 
Exon: 1504.. 1739 
Exon: 1795.. 2065 
Exon: 2126.. 2290 
Start ATG: 1001 

Transcript No, : CT4974 

ATGTTACTTCCGCGGTATTTTTGCTGGAACCTCCAACGCTGCACGATGGCGTTTTACGCTGTAGCCAGTGGGCGGCGGTCTGGAGTCTACGGCTC 

TTGGGCAGAGTGCGAGGAGCAGGTCAAGGGTTTCAAGAACGCCAAATACAAGAAGTTTAAGACACGCCAGGAAGCGGATCAGTTCGTTAATGGCT 

GCAAGTCGTATGCTCCGCAGGATGTGGCCGTGCCGCTGGGCAAGGAAAAGGCGTCACTGGCCAGTTGGAAAAACAGTATCGAAGTGAATAAGAAC 

CCCAAGTACACAGATGAGTGGCCCGAAGAGGATCACGACCTAGCCGAGGATGATCTGAATGCTGCCATGAACGAGGTTGAGGGAGATCCAAAGCC 

ATCTAACAGTAGCAATTTACCAGACATTCTCAATCGCAAGCGGAAGGGCACAACCAGCGGCGATAAGCGGAACAAGATCCCACGTCATGCCTCTC 

AGGTCTCAGAAGCCACAGGACTCAAGCAAGTGGGTGCCTTCCAGTTTGAAATCGATGCAGAGGGCTATGTCATTGTGTACACAGATGGCTCCTGC 

ATAGGCAACGGACGAGCCGGAGCCTGTGCCGGCTATGGCGTTTATTTCGGCAAGAATCACCAGCTAAACGCAGCCAAGCCCGTGGAAGGACGCGT 

TACAAATAATGTAGGTGAAATACAAGCGGCCATTCATGCCATTAAAACAGCTCTTGACTTGGGAATACAGAAGCTGTGCATCAGCACAGACTCTC 

AGTTTTTGATCAACTCCATAACGCTGTGGGTTGCGGGTTGGAAAAAAAGGGATTGGAAGCTAAAGAACAATCAGCCTGTTAAAAACGTTGTTGAC 

TrCAAGGAACTGGATAAACTGCTTCAGGAGAACAACATCACCGTGAAATGGAACTATGTGGAGGCCCACAAGGGCATAGAGGGCAATGAAATGGC 

AGACAAATTGGCGCGGCAAGGATCCGCCTTGTATAAGCAGAAAAATGGTTGAAAAGCTATTTTAAATTTACCGTATTTTTAGTATGTAATTGATA 

TGGCAACGCTGAATAAATTTTAGTTA 

(SEQ ID NO: 134) 

Start ATG: 1 
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MLLPRYFCWNLQRCTMAFYAVASGRRSGVYGSWAECEEQVKGFKNAKYKKFKTRQEADQFVNGCKSYAPQDVAVPLGKEKASLASWKNSIEVNKN 
PKYTDEWPEEDHDLAEDDLNAAMNEVEGDPKPSNSSNLPDILNRKRKGTTSGDKRNKIPRHASQVSEATGLKQVGAFQFEIDAEGYVIVYTDGSC 
IGNGRAGACAGYGVYFGKNHQLNAAKPVEGRVTNNVGEIQAAIHAIKTALDLGIQKLCISTDSQFLINSITLWVAGWKKRDWKLKNNQPVKNVVD 
FKEL DKL LQENN I T VKWN Y V E AH KG I E GNEMAD KL ARQG S AL YKQKN G * 
(SEQ ID HO: 135) 

Name: RIBONUCLEASE HI 
Classification: enzyme 
Gene Symbol: rnhl 
FlyBase ID: FBgn0023171 



Celera Sequence No. : 142000013384605 

TTTGTTGATTTATTAAAACATTGTTATCTTGGTTTGTGTAATCGTATCTTGAACTAACCACTTGATGTGTTCTTTTAGTTGTCCTGCCCCATATT 
GGATCCGCCACGAAGCGCACCCGTGCCGACATGTCCACCATTGCTGCCCATAATGTGCTCCGCGGTTTGGCTGGGGAGCCCATGTTGTCGCCCGC 
CTACTAGTAAATTCAAATGGGAAATATGGCTTTCTACCGCTAACCAGTTTGGAATGTCCGCCAAACAAAGTGCGAAAGGCCGCCGATGACCTTGG 
C ATGCCAGT AGAT GTGAAAAT AAAAAAAAC AATT AAAAAACAAGT AT TTGCTGTTGCAAAT TT ATT AAAT T AAAT ATGTGGGAAT CT AATCT AT A 
CCGTAAGATTAGCTCAACTGATTAATATGGAATAGGTAAAGCAATAGAATAATTGGTGCTAGAGAATGAGAAACATAATTTAAATACTAATATAG 
TATATCGTAGGAAATAATATTGTACGTTTTAAGAAATGAGAGCAGAAAACATTGATTTAGTAATTCGCAAAGTTTAGCTCTGCACAATTGCAAAA 
CCTACCACTTTTCCCCAAAATGTATCGAGGACCTTCAAACTGCTGATTTCGCATCCAAAAGTCCCAGGAACGGCTGCGGGAGCTTGTCGGATCAG 
GATCGGAATAGGATCATCGCTTCTTAAAACTGGGTGCTTATGCCGCTTGAGAGTTCGCTCGCGAAACCCGACTTCGGTTGTAGTTTTTCAACCTG 
CTCCCACTGATTATGGCTTTGCTGATCGAATATAATAATTGAAGGCCCCTGGGCATGGCCCGTTGGTTTCAAGGTGCTGGGGGCGAATCCCAGGG 
ATTACTATGTGGGCATTTATCTGCAGAAAGAGAACTGCAACTGCAACTGCTGCAACCAGCAAGAAAGCAGACCGGCGACTGAGCAGGACAGTAGC 
CCCATAAGGCAGTGGCTCTAAATGAATACTGCCCCCAGCTGACTACGCCGATGTGCCGGAAGTCAAGAGGTGCAAAGTCCCCTGTCAACAACGAC 
GTGGTCGATTTGGCAGTGGGTTCGAGTCGTCGTTTCCACAAGTGATCCAAGAGGATGTACACCATGTTGTACAGTAAAGGATGTAGTAGTATATT 
AAATTGAACGATTAATCAGTATACCTGTTACTCCAAAAGTAAACGGTATACTTCTTTCAAAAATAAGTAACGGGAAGAAGAAAACGTTTTCATTG 
CATACAAGTATATGTCTGTCCTTTCGAACAACTCGATCTCAGGACATAAAGCATTTTTAATTTTTCAAAAATATGTATGTAACAGTTGTGGGCGT 
GGCAACGATTTGAAACAATCTTTTGCAGCGTAGACGTAATCTGTATGCCTAATCTAAAGATTACAGCTTTTTATAGTTCCTGTAGGCTATTGATA 
TCAGTCCTAGTTGCTTTTAGGGCCCCCACCTAACAAATACTCGTACTTCTGGCCCCTTATCAGTATAAAGATTTTTGTCTTATCTCTTTAGTGGT 
TGACAGACAGTGTCTAAAAGTATTTTGGCAAGATTTCATTAAGGTTTCTTGAAGGTTACAAATGAGTAGGTTAACGTTTTTGCTGTTTAGTTTTA 
TCATGTTTCTGCGTAAATGTTGATCTTTAATGTCACCTTTTTAGTTGATTTATTTAAAGATCGTTATTTTTGCATAACTTTCAGTTGTCCTGCCG 
CATATTGTGGCACCATTACTACCTATAATGTGATTCGTGGCTTGGCTGGGGATCCCATGTACCTAGTTGCCCACCTACTATTACATTTAAATGGG 
T T T AGC C GC AT C C C AG T AAATGT G AAAT AGAGTAAAAAAACAT T T TC C T GA T GC AAAT T GAT T AAT TAAT GTGG AAAT CT AAT C T AAT GGT AT AG 
TTTGACTGATTACTACGAAATATGTATGAAAATCGCTCATAGAAACGTAAATAAATAATGACTTTAATTATATAAAGCTATATATAAATAATCAA 
TTTTATAATTGTAAACTTTTCTTTTTACTGTAATACTTTAATACTATCCCTAAAAATTGTATAGGAACGCGCAGGAAATAGTAATGTGAATTTCA 
AGAGATGAGAGCAAATTAAAATGTAAAAGCTTAAGAGAACCTTTTAAGCCTAAGAGCGGGGCACAGAAAAAGCTTGAATTGTCATTTCGAAAATT 
TCCACTTAGTTCAGCCCTGCAATTCGCCGCTACTAGTTGATATTAAGTGCTATATTTTCCCCCGAAATGTCTCGTGCGACCAGGGCTTTTAAAGT 
GCTGATTTCGCATCCAAATGTCCCAGCACCGGCTCTGGAACTGCTCCGATCCCGTGGAGCGGAGACCATCATCTGCCAGAGTGTGCCGCCCTCGA 
GGGATGAGATCCTGCAGAAGGTGCCCGGCGTGGATGCCATCTATTGGGCCCATTACCAGCCCCTGAATGCCGGAATCCTGGATGCTGCTGGATCC 
CAGCTGCGTTGCGTGAGCACCATGTCCTCCGGAATCGATTTTGTGGATATTCCGGAGTTCCAGAAGAGGGGAATCCCATTGGGCCACACTCCCGG 
GGTGGTGAAAAATGCAGTGGCCGATCTCGCCATTGGCTTGATGATTGCAGCCGGTCGTCACTTTCATGCCGGTCGCACCGAAATCGAGAGGTAAG 
AAAAGCTGATGGGCGCTTTTTTCTCAGAAGCTTTTAATATTGAGAGCTTTTCGGATGAGGGTTGCCAACTAGTTGCACATGCCCTATATACGGCT 
CTACTTCTACACTCTTTTGGAATATCAGATACTTAAGTACACCTAATACTTACCAATCATTAATACGTTTCATATGAAAACATAATATAATATAA 
AAATATTTACAGGTCCCAATGGAAAATCGAGCAGATCAACTGGATGATGGGTCAAGAGATTCGCGATTCCGTCATCGGTTTCTTTGGCTTTGGCG 
GTATTAGTCAGGCCATCGCCAAGCGATTGCAGTGCTGGGATGTGGCCAAGATCATCTACCATACTCGCACCCGAAAGGAAAACGATGGCGACTTC 
AAGGCAGAGCATGTGTCGTTTGAACAACTTTTACAAGAAAGTGACTTCCTGGTGGTAGCTGCTCCACTTACAAACGAAACTAGAGAGAAATTCAA 
TGGAAAGGCATTTAATCTGATGAAGAGGAGTTCTGTGTTTGTCAATGTGGCCAGAGGAGGTAGGATTTTAACTACATAACTGTATGAACGCTATT 
GCATACCCGTGGATTTTTTCAGGTCTGGTAAATCAAACTGACCTTCATGACGCCCTGACAAATGGTACAATTTCTGCCGCAGGCTTGGATGTGAC 
CACACCAGAACCACTGCCCGCCAATAGTCCTCTTCTCAATGTGCCCAATTGTGGTAAGCTAGAACATTTATTTCTCTTCCCTGAGTGAACTGAAA 
TATTATTTATATATTTCCACAGTTATCCTTCCCCATATGGGCACTCAGACAATGAAGACTACCATTGAAATGGGATTGCTTGCTGCCAATAATAT 
TTTGAACGCCATCGAAGGGAAGCCCATGATAAGGCCAGCCTACTGAGAACATGCAAAATTTGAATGGAAGAAACTTTACCTGTATCCTACTCGGA 
TCGGAAAATCAATATTAAAAAATGATATTAAACATATAAATGTCGAACTTGTTAACCGCAACTATAATGGATTGATATTTTTTCTTGTTTAATTG 
AATAAGTGAATAATTATGAAAATTATGAACGTAACCATAAAAGCTCTTCATGCGTTTTGTTGTCAGCTTTTTCTGTGATAGGGGGATCTGGTGCT 
CTATAAAAGTCCGGCGACAAAGCCAACCGTTTGTCAGTCGTTTAAGAAACGCAGGGAATATAACGTCGGTGCTAATAGACTCCGATTTTTGAGCA 
TTTAAGAGCTTCAGAGAGGAAGAGCAATCAAAAAGCTTCTAACTGCTCACGAAAAAAGAGAATATTCGAAACTCTGCTCTGTGTCAGTGATTATC 
TATAAAGTGTGGCACTTGTAATTATTTTTCCTATCAAATAATATCTTTTCGTCACCAGAAGGACTTAAAATGTCAACAACAAAGCCGTTTAAAGT 
GCTGATAGCACATACCGATGTTCCGCCAGAGGGAATTGAAATCCTGAAGGAGCAATGCGAGATCCTGCAAGTAATAAATGAACCTCCAAAAAATC 
GGCCTGAAATTCTAGAAAAGATCAGGGGGGCTCATGCCGTTATTTGGGGTGGTCGGGACATACTCAATGCCGAGATTCTGGACGCCGCTGGTCCG 
CAATTAAAGGCGGTATCCACCATGTCGTCGGGGATAAACAATGTGGATGTGCCGGAGCTAAAGAAAAGGGGTATCCCCTTGGGAAGTACTCCTGC 
GATGCTGACTGTTGCGGTAGCCGATCTTACGGTGGGCCTGTTGATCGCCGCTGCCCGGAGGTTCCAAGAGGGTCGAAGGAAGATAGACAGGTAAA 
GCAAAACAACCTGTCATTTTTTATTAATTACTCTGATATCGCAAAGTTTCGTATCCTTGATTACAGTGATAAATGGGATAAGGACCATCTTAACT 
GGATGCTGGGTCAGGATATCCGTGATTCCACCGTGGGCTTCTATGGCTTTGGAGGCATTGGCCAGGCAGTGGCTAAGCGTTTGATGGGATTCGAT 
ATTAAGCGCATGTTATATACCACCCGGAATCGCGTTAGCCAGGATATCGAGGAGCGGTTCAATGCCAAGAAGGTGGATTTCGAAACGCTTCTGGC 
GGAAAGTGATTTCCTCATCATTGCATCCCCATTGACCAAGGAGACATTGGGATTGTTTAATGCCACCGTCTTTAATAAAATGAAAGAAACGGCAG 
TACTGGTCAACGTTGGCAGGGGCAGTGAGTATATCCGAAAAAAGCAAACTAAAATGCCTAATGCCTTTCATTCTAGAAATTGTCAACCAAGATGA 
TCTTTACGAGGCGCTTAAATCAAACCGAATTTTTGCTGCTGGACTAGATGTTATGGATCCGGAGCCCCTTCCTTCCAACGATAAATTACTTACTC 
TAGACAATGTTGGTAAGTTATTATGTATCATCTACATAGGGTCAGAAACTATTTATATTTTTATCTCATTAGCGTTTTTTGGTATTTTTCTCAAG 
AATGATAAGAACTGCCATTCAAAGAATTCTTTTGAAAACCTGACACTTTACGTTATAATCTATAGTTCCCTTACCATTAGGCACTTCATTTATCT 
CGTTTGTTTCTTTTCATGAAGTGGTTACTCCTCATGTGGGCTATGCCACCCGGAGAACCCGTGTTGATGCAGCCAATTTGGCTTCGCGTAATGTG 
CTCAAAGGATTGGCAGGAGAGCCGATGTTGTCACCCGCCTACTAAAATCTATCTAATCGGAAACAGAAATCAATGATTGAAAGTATGTGATTTAA 



FIGURE SHEET 62 



TTACAAAACTAACTTGTCGAAACGTTATTCACTTGTATTGTCAGCCTCTTCTTTAGCCACCAGGGGCAGATACAGTGGATCCCACTCCTCCCGCT 
GCAGGGCCTTTAGATACTTCCTCCTCTCCTGGAGGACTCGCGGCACTAACTCCGGCTGCTCCCCCTTGTGTTTCTGTGATATTTCCAGGGCCCGC 
CGCCAGTTGTGCATCCGCAGGGCTAGTCCGACTGCCTGTTCGATTTTCTTGCCGTGCAGCAGGATGGTCTCTGCCTCAAGCATCCTGCCCAGCAT 
TAGTGAGTTTTCGGCCATCTGCTCGGCACTCGAGGGCGTCAGGGCCTTCAGGTGCTGCAGGTAGCTCACCTTGTCGATTTGCAGTGCTGCCGAGT 
AGGCCTCCTCGCTGATCTGCAACTGGTGCTTCCTGGTCGCCACCGCCGCAAGAGTGGCCCACAAGCTGGAATGCTGGCCCATTCGGCAGATCTTC 
AACGCCTGTTGCCACTGCCCTTCCAAAAGCGCACGATGCAGGATCTCGCAGTACATGTTAACATTGACGGGGAGTAGGGCGCCGGCGGAACGGAA 
TGTGAGCACGGATTCCTCAAAGCTTTCGATGGTAATGTGTTTGCCAAATTCCCTAGGTGAAAGAGATTTTTAAGCAATTTAAAAAATAGTACAAT 
TAATCTGGATTAACAGATAGCATTAAAAGTACAATGAAGCCTTGAAGTGATTTCAATTTAAAATAAAACCCGACATTCTAAAACAAGTTGAGTTA 
GTAAGTACATCTATGTTCCTTACGTCGTATCCAATGTAATAGTGGTCAGGGCAATGATCGTAGGATCGGAGGCTCCTTCCCCGGGACAATACCAG 
ATGCTGTAGCACGAGTCGTGTACTCCCACCAGAATATTTGTTTCGCTGGCCCACATAATGGCGGTGAGTTGGGTGCCGATCTTGTAGATCTCATC 
(SEQ ID NO: 136) 

Exon: 1001.. 1086 
Exon: 2269.. 2404 
Exon: 2474.. 2655 
Exon: 2863, .3194 
Exon: 3313.. 3378 
Exon: 4049.. 4460 
Exon: 4532.. 4869 
Exon: 4922.. 5047 
Exon: 5247.. 5365 
Start ATG: 1001 

Transcript No. : CT5232 

ATGTGCCGGAAGTCAAGAGGTGCAAAGTCCCCTGTCAACAACGACGTGGTCGATTTGGCAGTGGGTTCGAGTCGTCGTTTCCACAAGGCTTTTAA 

AGTGCTGATTTCGCATCCAAATGTCCCAGCACCGGCTCTGGAACTGCTCCGATCCCGTGGAGCGGAGACCATCATCTGCCAGAGTGTGCCGCCCT 

CGAGGGATGAGATCCTGCAGAAGGTGCCCGGCCTGCGTTGCGTGAGCACCATGTCCTCCGGAATCGATTTTGTGGATATTCCGGAGTTCCAGAAG 

AGGGGAATCCCATTGGGCCACACTCCCGGGGTGGTGAAAAATGCAGTGGCCGATCTCGCCATTGGCTTGATGATTGCAGCCGGTCGTCACTTTCA 

TGCCGGTCGCACCGAAATCGAGAGGTCCCAATGGAAAATCGAGCAGATCAACTGGATGATGGGTCAAGAGATTCGCGATTCCGTCATCGGTTTCT 

TTGGCTTTGGCGGTATTAGTCAGGCCATCGCCAAGCGATTGCAGTGCTGGGATGTGGCCAAGATCATCTACCATACTCGCACCCGAAAGGAAAAC 

GATGGCGACTTCAAGGCAGAGCATGTGTCGTTTGAACAACTTTTACAAGAAAGTGACTTCCTGGTGGTAGCTGCTCCACTTACAAACGAAACTAG 

AGAGAAATTCAATGGAAAGGCATTTAATCTGATGAAGAGGAGTTCTGTGTTTGTCAATGTGGCCAGAGGAGGCTTGGATGTGACCACACCAGAAC 

CACTGCCCGCCAATAGTCCTCTTCTCAATGTGCCCAATTGTGAAGGACTTAAAATGTCAACAACAAAGCCGTTTAAAGTGCTGATAGCACATACC 

GATGTTCCGCCAGAGGGAATTGAAATCCTGAAGGAGCAATGCGAGATCCTGCAAGTAATAAATGAACCTCCAAAAAATCGGCCTGAAATTCTAGA 

AAAGATCAGGGGGGCTCATGCCGTTATTTGGGGTGGTCGGGACATACTCAATGCCGAGATTCTGGACGCCGCTGGTCCGCAATTAAAGGCGGTAT 

CCACCATGTCGTCGGGGATAAACAATGTGGATGTGCCGGAGCTAAAGAAAAGGGGTATCCCCTTGGGAAGTACTCCTGCGATGCTGACTGTTGCG 

GTAGCCGATCTTACGGTGGGCCTGTTGATCGCCGCTGCCCGGAGGTTCCAAGAGGGTCGAAGGAAGATAGACAGTGATAAATGGGATAAGGACCA 

TCTTAACTGGATGCTGGGTCAGGATATCCGTGATTCCACCGTGGGCTTCTATGGCTTTGGAGGCATTGGCCAGGCAGTGGCTAAGCGTTTGATGG 

GATTCGATATTAAGCGCATGTTATATACCACCCGGAATCGCGTTAGCCAGGATATCGAGGAGCGGTTCAATGCCAAGAAGGTGGATTTCGAAACG 

CTTCTGGCGGAAAGTGATTTCCTCATCATTGCATCCCCATTGACCAAGGAGACATTGGGATTGTTTAATGCCACCGTCTTTAATAAAATGAAAGA 

AACGGCAGTACTGGTCAACGTTGGCAGGGGCAAAATTGTCAACCAAGATGATCTTTACGAGGCGCTTAAATCAAACCGAATTTTTGCTGCTGGAC 

TAGATGTTATGGATCCGGAGCCCCTTCCTTCCAACGATAAATTACTTACTCTAGACAATGTTGTGGTTACTCCTCATGTGGGCTATGCCACCCGG 

AGAACCCGTGTTGATGCAGCCAATTTGGCTTCGCGTAATGTGCTCAAAGGATTGGCAGGAGAGCCGATGTTGTCACCCGCCTACTAA 

{SEQ ID NO: 137} 

Start ATG: 1 

MCRKSRGAKSPVNNDVVDLAVGSSRRFHKAFKVLISHPNVPAPALELLRSRGAETIICQSVPPSKDEILQKVPGLRCVSTMSSGIDFVDIPEFQK 
RGIPLGHTPGVVKNAVADLAIGLMIAAGRHFHAGRTEIERSQWKIEQINWMMGQEIRDSVIGFFGFGGISQAIAKRLQCWDVAKIIYHTRTRKEN 
DGDFKAEHVSFEQLLQESDFLVVAAPLTNETREKFNGKAFNLMKRSSVFVNVARGGLDVTTPEPLPANSPLLNVPNCEGLKMSTTKPFKVLIAHT 
DVPPEGIEILKEQCEILQVINEPPKNRPEILEKIRGAHAVIWGGRDILNAEILDAAGPQLKAVSTMSSGINNVDVPELKKRGIPLGSTPAMLTVA 
VADLTVGLLIAAARRFQEGRRKIDSDKVJDKDHLNWMLGQDIRDSTVGFYGFGGIGQAVAKRLMGFDIKRMLYTTRWRVSQDIEERFNAKKVDFET 
LLAESDFLIIASPLTKETLGLFWATVFNKMKETAVLVWGRGKIVNQDDLYEALKSNRIFAAGLDVMDPEPLPSNDKLLTLDNVVVTPHVGYATR 
RT RVDAAML AS RN VLKGL AGE PML S PAY * 
(SEQ ID NO: 138) 

Name: HYDROXYPYRUVATE REDUCTASE-LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384610 

CATTTCGATGGTAGGTCTCCCAAGGCAACAGAAGCACAATCATCACCAGAACGAAGAGTATAAGCACTGGTATGATGTAGCCGTGCTTGCAGGAA 
CAGATATTTCGCTCATCCTGGAGCCCTAGGCCGCAATTGGGACGCCGGCAACACGATGTGGTCAAAGGGGTTGGCTCTTCCGGTGCTCTGGGCAC 
TGGCTGGTACTCTCCGGGTATCTCATGCACTACGCGACGCTCGGACATGGCAAGGGGATCAGGACTAATGCTGACTGGCCATTGTCAAACATTTT 
TCACAGCCGAGCCTATTTATTATTCGCGCAGTTGTGGGAAAGTATCACCCGCCCACGCAAACAAATGGGCAGTGTTGGACAGCACAGCGCGGTAG 
TCGTCTGCGCTGGCTGCTAAAAAAAAGCTAAAGGATAAATTAATCCAAGTCCCTGCTCAGAAAAATAAGTTGTCGTCAGTGCATTATTTCAGCAT 
TTACTTCGATTTAAGTTTTAACGCCATAGAGTCCATTAATTCTCTTGAAGTTAATAAATGTTTATCGCTTTAATTCAAAACTGATAAACTTTCTA 
TGGTCCTATTTATTTAACGACCCAATTTTGGCGGGAAACAAGCCGGCGGAACATTGCTGACGCGTATGCGTGACGCCAGCTAACACTGGTCGCCG 
TGGTAAAGAATTTAGTCTGTCTTAAAATCTTCCGGCTGAGAGGGCTGAAAATTTTGACGGCTTCGAAGCATTTGCTGCAATATCAATTTCGTCGG 
GTTTATTTCGAGTATCGCTTAATCAGGACGCGACCATGGCTGCGAGATTGATGAACGCCCAGGCACAGGTAGGTTCTAGAGCGGAGCCTGTGCCT 
CCAGTCGGCCAATATATAACCACGGCTATATATACGTTTACGTATAGAATCTGTGAATGGAACCCCGAAATTCCAAGCTGCGATAAAGACAACTG 



FIGURE SHEET 63 



GACAGAGCACTGCGCGAAAGTTTACAAAAAATCATTGCCCATTTCTTACACACCCAGCAGGATGTGGGCAGCGAAAAATAGCAAATTTTTCGCAC 
AGCGTTTTCTAACCTAAAATCATAAAATTCGAAGTGGTTAGGTAACCGAGTTCGCAGTGTTTGTGATTCTACAGGCCCTGCATCCCAAACATGTA 
TACTGAATACGCAGCAGTGAACCATGCGAAATCGTTAGCTCTCAACAATTAACAGCGATTCGTGGGGGTTCGATGGTTGGCATAGTTGCATTTAC 
ATAATCGCCGGGGTCCAGTTATTTAACCAGGAGGCGAACACCCTCTAGAGCATATATCTTGTGGCGGAGGACCGTCTTATCTGTGACAGGTTTAC 
TTACACTTTGATAATGCCAGCACTAAATATAGCTTAAAAAAGTTGCGCCGTTTAACTTGAGTATCACGCGCATTGGAGGAGTTTTGCCTTTCAAG 
TGGGGTTTTTCCACGTCAGCTTCGAGAAAGGTTTCACTGTACTCTGCCCATAAGTTAACGACCTATTTGACGTAATTGTCATTGCAATTGTTTCG 
ATTTGCGTGTATGTACATACAGTCGTGTCGTCACGTGGTTCTTCTAATTGGAGTGTTCTGTAGAATATTGTAACAATTTGAAAGGCTTACAAACA 
AACTATATTTCCAGCTACTTTTTATTTTTTTATTCTGCAGTTCGAAGAACCTTAAAGCACTTTGGCTAGCACTTAAAATACTTTGATTGTTAAAT 
GTAAACTTTCAATCATTTTACTAATCTCAAAAAAATATTTGGAAAAATTTCACTCTGAAGCGGTAAATGGATCAATGGATTATTTTTGGCTATTC 
AAAAATTTCMGACGATTAAAATTTATTAGCTTTAGCAATATATTGATCTTAAAACAAGTGAAAGCGGTGATAATAATGTTTTACATTTGTTATA 
TATGTATGCAAGATTAAATGTACAGTAAAGAAGAATTTTCACAGATATGGTCCCCACATGATTGTACTCTTGTCAGAATATCTTTGGGAATTATC 
ATTTCCTGATAGAGGAATCAATTTAATTAAAGTGACGAGGACTCGTCCTCTTGACATGATAATCAAGTGTTTATTGGGCTACCAAGCGAATAAAT 
AAAATCTCACAGTGTGCGAAGTGCAAATAAACTAGAGAATCACATTGTGTTAATTACTTGATCTTGATTGCTAAATGTACCACTTCACCATATTT 
AACAGTTCGACTTCTTATCTGCTTTCCAGGTGTGTCGACTGGGCAAGCATGTTGCGTCCGAAGCCACAGTGGTGCGTCAGTTCCACGCCTCCTGC 
TACACCGCCTCCAAAGTGGCTCTGTCCAAGTTCGACTCGGACGTCTACCTGCCGTACGAGAAGCTGAACAAGCGCCTGGAAGTGGTGCGCGGTCG 
CTTGAACCGCCCACTGACTCTCTCCGAGAAGGTGCTGTACTCGCATCTGGACGATCCCGCAAACCAGGACATCGTGCGCGGTACTTCGTACCTGC 
GTCTTCGCCCCGATCGTGTGGCTATGCAGGATGCCACTGCACAGATGGCTCTGCTTCAGTTTATCTCCTCCGGACTGAAAAAGGTGGCTGTGCCC 
TCCACTGTTCACTGTGATCATTTAATCGAGGCCCAGATTGGTGGACCCAAGGATTTAGCCCGTGCCAAGGACCTGAACAAGGAAGTGTACGATTT 
CTTGGCCAGCACCTGTGCAAAGTACGGATTGGGCTTCTGGAAGCCAGGCAGCGGAATCATCCATCAGATCATTCTGGAGAACTACGCTTTCCCCG 
GCCTGCTGATGATTGGTACTGACTCACACACCCCCAACGGTGGTGGCTTGGGTGGCCTGTGCATTGGCGTTGGTGGTGCTGATGCCGTCGATGTG 
ATGGCCGACATCCCCTGGGAGCTGAAGTGCCCCAAGGTGATTGGTGTCAATTTGACTGGCAAGATCAGCGGCTGGACCTCGCCCAAGGATGTTAT 
CCTGAAGGTTGCCGATATCCTGACCGTCAAGGGAGGCACCGGTGCCATCATTGAGTATCACGGCAAGGGTGTTGACTCTATCTCTTGTACAGGCA 
TGGCCACCATCTGCAACATGGGTGCTGAGATCGGAGCTACCACCTCTCTGTTCCCCTTCAACCAGCGCATGGCTGATTACCTGAAATCGACAGGC 
CGTGCTGGCATTGCCTCCGAAGCCCAGAAGTACCAGGCCAAGATTCTGTCCGCTGACAAGAACTGCGAGTACGATGAGCTGATTGAAATCAACCT 
AGACACCCTGGAGCCCCATGTTAACGGACCATTCACTCCCGATTTGGGCCATCCCATCAGCMGCTAGGCGAGAACTCGAAGAAGAACGGGTACC 
CTATGGACATCCGTGTAGGTCTTATTGGCTCTTGCACCAACTCTTCGTATGAGGACATGGGTCGCTGCGCCAGCATCGCCAAGGATGCCATGAGC 
CATGGCCTCAAGTCGAAGATTCCTTTCAATGTGACTCCCGGATCAGAGCAGATCCGTGCCACCATCGAGCGCGACGGCATCTCTGAGGTGTTCGA 
CAAGTTCGGCGGTACCGTTCTGGCCAACGCTTGCGGTCCCTGCATTGGACAGTGGGATCGTAAGGACGTTAAGAAGGGCGACAAGAACACCATTG 
TCACCTCCTACAACCGAAACTTTACTGGCAGGAACGACGCTAACCCGGCCACCCATTGCTTTGTCACTAGTCCCGAGTTGGTCACCGCTCTGTCC 
ATCGCCGGTCGTCTGGACTTCAACCCGCTGACCGACGAGCTTACTGGTGCTGATGGCAAGAAGTTCAAGCTCAAGGCTCCCTTCGGCGACGAGCT 
GCCCGCCAAGGGTTTCGATCCCGGCCAGGACACCTACACTGCTCCCCCACCCGTAAGTACATTTAAATTCCCAAACTGATTTATATTGTTCTTAA 
GAATCAATATGGAAATCCCACAGAGCGGCGAAAATGTCAAGGTGGCTGTCGACCCCAAGTCGACGCGTCTGCAGCTGCTGGAGCCTTTTGACAAG 
TGGAACGGCCAGGATCTGACCGATCTGACTGTGCTGATCAAGGTTAAGGGAAAGTGCACCACTGACCACATCTCCGCTGCCGGACCCTGGCTGAA 
GTACCGTGGCCATT T GGAC AAC AT CTCCAAC AACATGTTTATCGGAGCCACCAAC TACGAAAACAACGAGATGAACAACATCAAGAACCAACGCA 
ACGGTAGCTGGGGAGGAGTTCCCGACGTAGCCCGCGACTACAAGGCCAACGGCATCAAGTGGGTGGCCGTCGGCGATGAGAACTACGGCGAGGGT 
TCGTCCCGCGAGCATGCCGCTTTGGAGCCCCGTCACCTTGGTGGTCGTGCCATTATCGTCAAGTCCTTCGCCCGTATCCACGAGACCAACCTGAA 
GAAACAGGGTCTTCTGCCTCTCACCTTCGCCAATCCCGCTGATTACGATAAGGTTTGCGACTCTTAGCTTCTGTGACTAAGGTCATATTAACCAT 
ATCTTTGTACTTCAACAGATCCAGCCCACGAGCAAAATCTCCCTGCTCAACCTGAAATCCCTTGCGCCTGGAAAGCCCGTAGATGCCGAAATCAA 
GAATGGCGACAAGGTCGAGAGGATCAAGCTTAACCACACCCTGAACGACCTACAGATCGGCTGGTTCAAGGCCGGCAGCGCTCTCAACCGCATGA 
AGGAGCTGGCCCAGTAAATGGTCCTGGACCTACTTAAGACACCCAGTCAAGAAGCCTGATCCCAACCTTCTTCCTGTAGGTGAACACACTCAAAT 
TCACAACGGAAGCAATCGAATTATGGTATTGTTATTCAATTCTTTTCAAATTGTTTATATTATATTTTGACTATATCGAGTGTGCAGCTTTAAAA 
GTGAATTATGTAGCCGAACGATAGCGAGAAATTGTAAGAGTTTTATATATATAATTATACACAATTTTATTTTTAATTCTGGCTATGCATGCGTT 
TCCTTCAATTTTTGTTCTTTTTGAATTTGTCCAAATGCTCTACAGATTGCCTCTACTATGAAACTGTTTTAAGTGCGAATAAAACTCAAGTAATA 
TGTATTAAACCGATTCTTTCATTTAATAAACAACAGTTAAAACAAAAAAGCGGACAATAAGTGCCGTCAATTCATCCCGTGTCCTCTATATTTTC 
TTCATTATTAACATAATTTTATCAATTATAAGACGTAAGACACAACAATTTGCACAATAAAGCATAATATCAATAATAGAATCTACTGGGGCAGA 
TTACTTAATTATTCACCCATAGCTTTTGGAAAATATGTCGAATTGTCAATCAAATATAAATTATTAGTAAATCCAATTCTAGGACTATGAAACGC 
AGAGCTCAAAGTAAAACTTCCGTTGCTTCCAGCACGGATTTCTTTTCACTCTCCGACAAGCCATCGTTGGGAAGAAGACGAGCCCTACTGGCTGC 
AAAACTTAAGCGTCAGGAACAGAGTACAACGTCCTTTGACTCCAGCACAGGCTCCTACACTTATACAGACGCCACTTCTTTGCGAAAGACCCCCA 
TCACCTTCTCACCTAGCCCGGAGTTCATGAAGACCGCCAAGGCTAAATTAAAGTTAATCGCATTGTATGAAGATCACAAATGCTTGTGGAATCAG 
C G C AAC AAAG AT T T CTT C AAT T T C G AGCT C AAAG AC AG A AT C T GGG AT G C GAT T GCC G AT G AAAT GA AAG CGG ACT C GC CAT C T GG GTT T TGG AA 
ACATATGATTCACAAGCTTCGCTATAAAGTTGAGATGGAGCGCATACAAGAGCAGGGTGCAAAGTTTTCGGGTGAATCGCCGCAGCCAAAACTAT 
TTTATTCAGATAATCTGCTGTTTCTAAATCACATGTTTGACCGAGAAGGGGGAAAACCCCCTCGAGAGATTGTATGTTTATAGTTAACGAACAAA 
TCACTTTAGTATTTATTCTACTTTTTACTCAGCCAATGCATCTTAAGCCTTCGGGACCTACTCTCGAAAAGAATCCGTCTAGAAACCTGCGACAT 
GTT C AG G AA AAAC GT AAA CGT AC GC CC AT T AG AG AAA AG AT GGC T T CGC T T GAG AAGT T G AGGA 
(SEQ ID NO: 139) 

Exon: 1001.. 1086 

Exon: 2215.. 3852 

Exon: 3919.. 4422 

Exon: 4484.. 5049 

Start ATG: 2494 

Transcript No. : CT5284 

CACCCAGCAGGATGTGGGCAGCGAAAAATAGCAAATTTTTCGCACAGCGTTTTCTAACCTAAAATCATAAAATTCGAAGTGGTTAGGTGTGTCGA 
CTGGGCAAGCATGTTGCGTCCGAAGCCACAGTGGTGCGTCAGTTCCACGCCTCCTGCTACACCGCCTCCAAAGTGGCTCTGTCCAAGTTCGACTC 
GGACGTCTACCTGCCGTACGAGAAGCTGAACAAGCGCCTGGAAGTGGTGCGCGGTCGCTTGAACCGCCCACTGACTCTCTCCGAGAAGGTGCTGT 
ACTCGCATCTGGACGATCCCGCAAACCAGGACATCGTGCGCGGTACTTCGTACCTGCGTCTTCGCCCCGATCGTGTGGCTATGCAGGATGCCACT 
GCACAGATGGCTCTGCTTCAGTTTATCTCCTCCGGACTGAAAAAGGTGGCTGTGCCCTCCACTGTTCACTGTGATCATTTAATCGAGGCCCAGAT 
TGGTGGACCCAAGGATTTAGCCCGTGCCAAGGACCTGAACAAGGAAGTGTACGATTTCTTGGCCAGCACCTGTGCAAAGTACGGATTGGGCTTCT 
GGAAGCCAGGCAGCGGAATCATCCATCAGATCATTCTGGAGAACTACGCTTTCCCCGGCCTGCTGATGATTGGTACTGACTCACACACCCCCAAC 
GGTGGTGGCTTGGGTGGCCTGTGCATTGGCGTTGGTGGTGCTGATGCCGTCGATGTGATGGCCGACATCCCCTGGGAGCTGAAGTGCCCCAAGGT 
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GATTGGTGTCAATTTGACTGGCAAGATCAGCGGCTGGACCTCGCCCAAGGATGTTATCCTGAAGGTTGCCGATATCCTGACCGTCAAGGGAGGCA 
CCGGTGCCATCATTGAGTATCACGGCAAGGGTGTTGACTCTATCTCTTGTACAGGCATGGCCACCATCTGCAACATGGGTGCTGAGATCGGAGCT 
ACCACCTCTCTGTTCCCCTTCAACCAGCGCATGGCTGATTACCTGAAATCGACAGGCCGTGCTGGCATTGCCTCCGAAGCCCAGAAGTACCAGGC 
CAAGATTCTGTCCGCTGACAAGAACTGCGAGTACGATGAGCTGATTGAAATCAACCTAGACACCCTGGAGCCCCATGTTAACGGACCATTCACTC 
CCGATTTGGGCCATCCCATCAGCAAGCTAGGCGAGAACTCGAAGAAGAACGGGTACCCTATGGACATCCGTGTAGGTCTTATTGGCTCTTGCACC 
AACTCTTCGTATGAGGACATGGGTCGCTGCGCCAGCATCGCCAAGGATGCCATGAGCCATGGCCTCAAGTCGAAGATTCCTTTCAATGTGACTCC 
CGGATCAGAGCAGATCCGTGCCACCATCGAGCGCGACGGCATCTCTGAGGTGTTCGACAAGTTCGGCGGTACCGTTCTGGCCAACGCTTGCGGTC 
CCTGCATTGGACAGTGGGATCGTAAGGACGTTAAGAAGGGCGACAAGAACACCATTGTCACCTCCTACAACCGAAACTTTACTGGCAGGAACGAC 
GCTAACCCGGCCACCCATTGCTTTGTCACTAGTCCCGAGTTGGTCACCGCTCTGTCCATCGCCGGTCGTCTGGACTTCAACCCGCTGACCGACGA 
GCTTACTGGTGCTGATGGCAAGAAGTTCAAGCTCAAGGCTCCCTTCGGCGACGAGCTGCCCGCCAAGGGTTTCGATCCCGGCCAGGACACCTACA 
CTGCTCCCCCACCCAGCGGCGAAAATGTCAAGGTGGCTGTCGACCCCAAGTCGACGCGTCTGCAGCTGCTGGAGCCTTTTGACAAGTGGAACGGC 
CAGGATCTGACCGATCTGACTGTGCTGATCAAGGTTAAGGGAAAGTGCACCACTGACCACATCTCCGCTGCCGGACCCTGGCTGAAGTACCGTGG 
CCATTTGGACAACATCTCCAACAACATGTTTATCGGAGCCACCAACTACGAAAACAACGAGATGAACAACATCAAGAACCAACGCAACGGTAGCT 
GGGGAGGAGTTCCCGACGTAGCCCGCGACTACAAGGCCAACGGCATCAAGTGGGTGGCCGTCGGCGATGAGAACTACGGCGAGGGTTCGTCCCGC 
GAGCATGCCGCTTTGGAGCCCCGTCACCTTGGTGGTCGTGCCATTATCGTCAAGTCCTTCGCCCGTATCCACGAGACCAACCTGAAGAAACAGGG 
TCTTCTGCCTCTCACCTTCGCCAATCCCGCTGATTACGATAAGATCCAGCCCACGAGCAAAATCTCCCTGCTCAACCTGAAATCCCTTGCGCCTG 
GAAAGCCCGTAGATGCCGAAATCAAGAATGGCGACAAGGTCGAGAGGATCAAGCTTAACCACACCCTGAACGACCTACAGATCGGCTGGTTCAAG 
GCCGGCAGCGCTCTCAACCGCATGAAGGAGCTGGCCCAGTAAATGGTCCTGGACCTACTTAAGACACCCAGTCAAGAAGCCTGATCCCAACCTTC 
TTCCTGTAGGTGAACACACTCAAATTCACAACGGAAGCAATCGAATTATGGTATTGTTATTCAATTCTTTTCAAATTGTTTATATTATATTTTGA 
CTATATCGAGTGTGCAGCTTTAAAAGTGAATTATGTAGCCGAACGATAGCGAGAAATTGTAAGAGTTTTATATATATAATTATACACAATTTTAT 
TTTTAATTCTGGCTATGCATGCGTTTCCTTCAATTTTTGTTCTTTTTGAATTTGTCCAAATGCTCTACAGATTGCCTCTACTATGAAACTGTTTT 
AAGTGC G AAT AAAA C T C AAGT AA T AT GT AT T AAAC CG AT 
{SEQ ID NO: 140) 

Start ATG: 366 

MQDATAQMALLQFISSGLKKVAVPSTVHCDHLIEAQIGGPKDLARAKDLNKEVYDFLASTCAKYGLGFWKPGSGIIHQIILENYAFPGLLMIGTD 

SHTPNGGGLGGLCIGVGGADAVDVMADIPWELKCPKVIGVNLTGKISGWTSPKDVILKVADILTVKGGTGAIIEYHGKGVDSISCTGMATICNMG 

AEIGATTSLFPFNQRMADYLKSTGRAGIASEAQKYQAKILSADKNCEYDELIEINLDTLEPHVNGPFTPDLGHPISKLGENSKKNGYPMDIRVGL 

IGSCTNSSYEDMGRCASIAKDAMSHGLKSKIPFNVTPGSEQIRATIERDGISEVFDKFGGTVLANACGPCIGQWDRKDVKKGDKNTIVTSYNRNF 

TGRNDANPATHCFVTSPELVTALSIAGRLDFNPLTDELTGADGKKFKLKAPFGDELPAKGFDPGQDTYTAPPPSGEKVKVAVDPKSTRLQLLEPF 

DKWNGQDLTDLTVLIKVKGKCTTDHISAAGPWLKYRGHLDNISHHMFIGATNYENNEMNNIKKQP^GSWGGVPDVARDYKANGIKWAVGDENYG 

EGSSREHAALEPRHLGGRAIIVKSFARIHETNLKKQGLLPLTFANPADYDKIQPTSKISLLNLKSLAPGKPVDAEIKNGDKVERIKLNHTLNDLQ 

I GW FK AG S ALN RMKEL AQ * 

(SEQ ID NO: 141) 

Name: aconitase 
Classification: enzyme 

Celera Sequence No. : 142000013384610 

ATGAATAAATATCAAAACATTTTTCAAAAGTTTGATTATTATCCTCGTTTTTCTCGATGTTTATGTAGGCACGCTCCATATCTGCTTTTAATCAA 
AACATTGAAC AAC AC AAAAAAT GT AAC AGT AACT AT GTT TGGGATAGAATG ATTGATAAATGAG TGCT TGC AAC AGGTCCT GAAATT TAT AGACT 
GAACGGTTACCCGATATATATATATATATGTATGTATATGTTGTTAGTTATTCATTGGTTACAAATTAATTTATTTTATTGTATTGCTTATATAA 
AGACATGAGTTTATAATTTAATGGTGGATACAGATTTTTTCTAATTTAAAGTAGCAACCGAAAGTTACAACAAAACTAGTATGTGTAATCTATAC 
ATCCCACATTAGTAGTACAGAATAATTTAGTAATAATATATATACATACTTTATAATGGTACGAATTAATCCATTTGAAAATTTGTATGTTACAA 
CTTTAGAACATTTGTTTGCCCAAAAATAAGGATTCATATTTTTAAATAAAATTCTTATAAAGGAATGTGTTTAGATGGTTAGAAAGAGCCACCAC 
TATCTTAGAATATGTTGCTAAAGTTTCAGCGTTGCAAGCTTATTTCAGAACGTTGCCTCAATCATACTAACGCCCACAACTGTCACTGCTACAGT 
TTTGGAAAAGGTTTTGATTTTATCATTTGTCTTGCACATTGTATCGATGTGCCAAAAATATGATGAGGCCAAACATATACAATGGTCATATCCAA 
TAGTCAGAAATTAATAAAAACAAACAAGTTTTTTAAAAATACCGCACATGCAATTAGTGCGCATAGTGACATCTGCAGTCGCTAAAAGAGACCCA 
AGTGTTAAAAGCTTACTCGACTAGAAGCTATAAATGGCGATTAATTCCTCTAAGGGAATACCCTCGATGTCCTTATGAAGGCAGTGGAGTAGCAC 
AGAAAGGTAAACAGTACCTACGACTGATTTATTGAGATTTGTGGAAAAAGTACTACAGCCACACTGAAAACTCAACTCTGCTGTTAAACAGCTGA 
AGACAAGTAGACAGGCAGACAAACTCTAAAAATGAACGGAAAGTATGTAGTTAGCGCCATCCGAGCTCTCCGGTTGACCAATTTCAGCCAGTGCA 
AAGGAGCCGCTAGCAGTTTCACTGTGCGCCACTATGGTAAGCCGCGATAGCCAGAATAAAAACGTGTGTTGCCATTTCATAACATCCCGCGATCG 
GCGGTTCTTCTCCTGCAGCCTCTCCTCCCAAGCGATTCTACAAGAAGACTTCCGTACTGAGCGGCGACAGCGGCTATGAAGTGGTGTTGGATCAC 
CGGAAACTGAAAACGCCCAAGGGCACACCCTTCATCGTCCGCAGCGAACCCCTGGCCATTGCTGTAGCCACTGAGTTCGACGCTCAGAAGGAAAA 
CATCGAGCGTTCCCGCATGCACTTATCCGCGCTCTGCTTCACGGCCATTGACAATCCCAATCACCTCTCCAAGTTGGACATGGTCAACTACCTTC 
TCAACTTCATCGCCACAGACACGGTGCTCTTCCAGTACGATGTGAGTGAATCAACTGAGAATTAGAGTGCACATTCCACAGACATATATCTTTTG 
CAGGACGAAAAGGACCTGCAGGACCTTCAGGTGAATGAATGGGACCCGGTGATCGCCTGGTTCAACCAGCGCTACGACACAAACCTGCAGAAGAC 
CATGAACATAACGCCGCCTCAGGTCAGCGAGCAGGATAAAATGAATGTCGCAAAGCATTTTCAGTCCTACAGTTTGGAAACTCTGCATGGTAATG 
CAGACCATAAGTCTGGAGTTCAACGTCTATTAAATATATTTTCCTCCAGGCTTCATTTTCGCTGTGGACACCTTAAAGTCGATTGTCCTTGCCTG 
CGCCGTCATCGAGCAGATGCTTACCGTGGAGAAGGCTGTGGCCCTGGCGCGTCTAGAGGAAGAATACCAACTTAAGTTCTGGGGTCGCGTGGAGT 
GGGCACATGATTTAAGCCAACAGGAGTTGCAAGCACGCCTGGCGGCTGCCGTGCTCTTCGTACACCTTAACTGTTCGGAAAACCTTGTTAAGCAA 
AAGATAATTCTTTAAAATTAAAATACCTGTTCAACATACATATGAGCTGACTACCACAGGGTTATGGTAGCACGTATGTAGGTCTCCTAGGACCT 
AGGACATAATAATAGATGTATATAAATCGTGCCTAGTTTGAAACTTATCAAGGTCAGGTCATCCTCCCTTTTTTTTTAGCTCTTACCGGTTTTTT 
TTTGTTGTAATAATACGGTTTATATTATCATTTTTTCATACTTTCTTTAAGCACGAAAGTAACATACTCTTTTGACGTCTCTTATAACACAACTG 
AATTTTAGTTCTGATTTTAATTAGTAAGCCACAAGGATTCTGGTACAGCAATACGTGGAGGCAGTGAACATAAGGAAGTACATTAAGTTTTTTGG 
AACTACACATGTACAAATGTGTAGGATATATGTAACATTTTTAAATTTGAATGAGATTACTAAATTTTAGAAATATTTAGTTTGTTTAATCCTAA 
TTTAAGGTTCGGTTTTTTTGCACCTTGACATAGAACCCTATTTTTGGAACCCATTGGCAAAACATGGACATTTATTTATCTTACTTAAGAAAAAA 
TTAAGTGCTAGTGTAATAGGCGCTTAGAGGAAGTCTCAAAAAACAAAAATAATTAAGCAATTTTTTTTCGTAGGTGTGGTATATTCTGATAAATA 
CTTGCTTGGTATTTTAAAATAGTTACGGTACAGTCCTACTGTACACTGCGGTCACACTGGTTCGGACGTGGGTGGCGCATCATCGGTGGTATTCT 
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GTTGCTAATTAATCAAGTGCAAZVACTACATTTTTCGCCAGTAAATCCGAGGAAAGCTAAGTTTGCAAGAAAACAAACGAAAGCAAAGGCAGTTGA 
TAATATTACTGGCGCGTTTGGTAAGCAGCGTCGTTCGAGTGCGTGATTGTTCCAATGTTTGTGCTGGTGTGCGTTTGTGAGTGCCTTTTCACTAA 
AAAAAAATGTAAAAAAAAGTGTTGCCAAAGCAGTGAGGCATACATTTTCCCAGTCGCTTTGTTTC 
(SEQ ID NO: 142) 

Exon: 1001.. 1176 
Exon: 1254. .1561 
Exon : 1619.. 1799 
Exon; 1855. .2105 
Start ATG: 1077 

Transcript No. : CT5298 

TACTACAGCCACACTGAAAACTCAACTCTGCTGTTAAACAGCTGAAGACAAGTAGACAGGCAGACAAACTCTAAAAATGAACGGAAAGTATGTAG 
TTAGCGCCATCCGAGCTCTCCGGTTGACCAATTTCAGCCAGTGCAAAGGAGCCGCTAGCAGTTTCACTGTGCGCCACTATGCCTCTCCTCCCAAG 
CGATTCTACAAGAAGACTTCCGTACTGAGCGGCGACAGCGGCTATGAAGTGGTGTTGGATCACCGGAAACTGAAAACGCCCAAGGGCACACCCTT 
CATCGTCCGCAGCGAACCCCTGGCCATTGCTGTAGCCACTGAGTTCGACGCTCAGAAGGAAAACATCGAGCGTTCCCGCATGCACTTATCCGCGC 
TCTGCTTCACGGCCATTGACAATCCCAATCACCTCTCCAAGTTGGACATGGTCAACTACCTTCTCAACTTCATCGCCACAGACACGGTGCTCTTC 
CAGTACGATGACGAAAAGGACCTGCAGGACCTTCAGGTGAATGAATGGGACCCGGTGATCGCCTGGTTCAACCAGCGCTACGACACAAACCTGCA 
GAAGACCATGAACATAACGCCGCCTCAGGTCAGCGAGCAGGATAAAATGAATGTCGCAAAGCATTTTCAGTCCTACAGTTTGGAAACTCTGCATG 
GCTTCATTTTCGCTGTGGACACCTTAAAGTCGATTGTCCTTGCCTGCGCCGTCATCGAGCAGATGCTTACCGTGGAGAAGGCTGTGGCCCTGGCG 
CGTCTAGAGGAAGAATACCAACTTAAGTTCTGGGGTCGCGTGGAGTGGGCACATGATTTAAGCCAACAGGAGTTGCAAGCACGCCTGGCGGCTGC 
CGTGCTCTTCGTACACCTTAACTGTTCGGAAAACCTTGTTAAGCAAAAGATAATTCTTTAA 
{ SEQ ID NO: 143) 

Start ATG: 77 

MNGKYVVSAIRALRLTNFSQCKGAASSFTVRHYASPPKRFYKKTSVLSGDSGYEWLDHRKLKTPKGTPFIVRSEPLAIAVATEFDAQKENIERS 
RMHLSALCFTAIDNPNHLSKLDMVNYLLNFIATDTVLFQYDDEKDLQDLQVNEWDPVIAWFNQRYDTNLQKTMNITPPQVSEQDKMNVAKHFQSY 
SLETLHGFIFAVDTLKSIVLACAVIEQMLTVEKAVALARLEEEYQLKFWGRVEWAHDLSQQELQARLAAAVLFVHLNCSENLVKQKIIL* 
(SEQ ID NO: 144) 



Celera Sequence No. : 142000013384171 

CCAGCTCCAGCAATTATATAGCCTCCAACGCCAGCAACTCAATCTCGACCAGCGCCAGCAACCCAAGCTCCACCGGCGCCAGCAATCCAATTACC 
ACCAGCGCCAGCAACGAATCGGCCAGCAGTCGTCGAGCCAAGTTCAGGATAAATCAGATGAGTCGCGATGTGCCCGTTGGGCTGCCGGACACTCA 
TCAAACCGTGAATCTGGAGGAGGCGGCCAACACCACCAAGGATTGCCTGCTGCATTTGTTGGAGAAATACAACGAGACCCGCATCCGCAATCCGG 
TGGGTCGTCACCAGAGCATCAGCGTGGACTGGCGGGTCAGCGACAATCTGGAGTACCGCTCCATGAGCTCCATCAACGCCTTCTTCCAGCGCCAC 
AACCCCAATGGCGGCGGCCACAACGTACGCCACATTCAAGCCCAGCTGGAGGAGAAGGCGGCGGGCAAGTAGGTGCCCATATCCACAGCCCTCTG 
TATATATGCGATATATATATTTGTATGTAGAGTACAGATCAACTAATTGGCACATAGGTAAATCTACAAAGCGTCGAATGGAACAAATTAATATT 
TATGTACTTGATGTAAGCAATAAATCAAATTAATTTATTGAATAAAACTGAAGCAATGATTCGTTTCTATTGACGCGGGATTGAATTTTGTTGGC 
AT CC C AC GAT AT T T T AT AAAAAT G AAAT T AAAAA T T AAAAA C GC GAT T T AT GT AC TAC CAACT AGAAAAT C AAAAAC AAT T T AT T GC AGAAAAAC 
AAATATTAACTTATAAAATGTACAAAATTGTTTTAGTAAACTACATTTTACTGCTACGTCTACAACCAAAGTAATGCGGATGTGTTGGTAATTTT 
GAAACAGGACAATCGCGTTGTTTTTGTTTTGTTTATTGATGTATGAGTTGAATATGGTTTAATTATACATAGAGTGTCTATACATATAAATATTA 
GGAGAATACTTTGGATTTGTTCTTAGTTAGGCCTTACAAATTAATGCTTTGTATAAGTGTTATGAAGAATAGGCTCTGCCTATGTGTATAGTATG 
TATAATAATCTAGTTTTTGTACATGATAGTTTCAGGCAAAATTGTTTTCATAATTTGCCCAGAGATCATAGCAAATCATAAAAGACATTTTTAGC 
ACTGGTTAAGAAATGAGAGCGCGACAATTTGTGTCCATTCGGCGGCATTCATATAAATAAGGCATCCAAACGTGCAGAAATTGACGAAGTTAATT 
CAAAGGCATTCCTCTTTTTTTTTTTTGTTTTCCACTGGTCGAATTGGTTACAATGTTAGTAGGTCGCAAAAACAAAGCAGCAATGAGCCGAAATG 
CGGCTTATTGGGGCATCGTTTTGGCGCCCTTAATATGCAAGACAAAACTCAATTCTACTCTCCCCTCAGTTTCAGTACTTGGATGAAGTGGCAGG 
GCAGAGAAAGGATGAGTTTTGTTCACCCGGCAAATTTCAAATGCAAAAGTTGCCGGCGAAATTTGCAAGGATTTCACAAAGTTATGTAAAGTTAG 
TATTCCATCGTTATAAAGGTTAAATTTCGGTTGGCTTCAAGATAGTTAAGACTAATTCCATTTAGCTTGCGGTCTCTCTCTTCTTGTTCAATAGT 
TGTTTGTTGTTAAGAACTAATCACTTAGTTTATGGCATTTCTCACTAGAGAAATCTAAAAAGTTTCATTTAGCTCGAGCTTACTGGTTGACTTAC 
AACTAATTACTGGTTACGAGTAGTTGAAGACTAACGATTAATATTCGCAACTGTCGCCCTTGCACTGCATAAAAAGGTTCGTAATAAAAAATTAG 
GAAATACTACACAATCGCAGTGGCTTGTAAAAAATAAAATTACAAATTGCAATTCGGTGGGTTGGCCAGCGAATCTGGAGCATTAGTTGTGGCGC 
GGCCGCTTCAACATCGCCCCAAAGTGTGCCCCCAACCCAAATCCACCGCTGGAGCTGGAGCTGGAAGTGGCGGACGAACTGGGAGAACTGGTGCG 
CATGCGCTTTGCGGGCGATAGTTTCTTGGGTGCCTGGGCCAGACGTTTGCGCTCCTTCATGAATTCTGTAAACAAAGGAAGGAAAACGAGTGTTA 
TTAATACGTGTTAGGTGGAATTGCCAAATCAAAGTCAGCTGCTGCATTTCCGGTTGCCCCTGTTCACAAGTCACTGATCCCTAAACAGCAGGCGC 
CGCCTGTTTTTCTGTTTTTGCACATTTTGATTTCGGTTTCGGGTAGATTCCCAGATTGTGGCACGTGGACGGCAGCACTTCCCTACCAGAGGGTT 
GCAGATCCGCACAGGGGTAGCAAATAAATGCCAGACGGCAGCCAGACAGTCGGTCTTTAGCCGGTTTTCAGGTAGTCCGGTGAAAGCAAGAGTGG 
TGGTGCTACATAACTGCATAAATTATGAGCTCTAGCTTACTGGCCAAAGCAGCAAAAAGTGCACTTTAAGTAAAGCGAGTTGAGGCGTTGGGGGA 
TTCAAAGGCTTCCCTACTGACCACCCGATGCGGGTGCATTTATTATGTTCCGAACAGACACAATTTTTTTATGATTATAATTACACATGCATGTG 
CTAGTGGCATTTATAAAAAACGCTCAGTACAAAGCAATCAAAGGGAGGTTTGATAAGACCATGGACTCTGGATGGTGGTGCATTGAAAGCCGATC 
CCCAGACCCCTCTCCCGTTTTCTATAATGAGGTCAGGCCATAGACAAAGCGGAGTCACAGCTGTTGGCATTTATAGATGCAGCCAGCCAGCCCGC 
ATGCTATCGATCTGGAAACCAACCCCCGCGTTGTGGAGGATCAGAAGAGAATCATTCGGTTTAGGGAGATGAGGAGCTGAAACAGCTGCAGTCAG 
CTTTGACTTACCCTTCTGGACGAAAATAAATCTCCCACAAAGCATTGGCTAATTGCTATAATTTGGGTCTAAGTTTCACGAAAAACGCCCACAAC 
TAAAACGTGAATATAAACAATAAATGCGAAGGAGTGGACTTAAAACCAATCAAACAACCCCTTATGGAGTTTAGTATTGTAAGTACAAAATCCTG 
C AC AAAA TAT T G G TT T GGC AT AAAAT C AT T GAT G AAAAT C GT TTT CAT C G AGTT T GC AAAAC AAAG AGC AC G AAT C T AC TT T TAT AT C T C GAG C A 
CTATATTCGGAAATTTTTAGATAATACTCTCTACTCTACAGATAATTTTGAAAAATAATATCGTATCAGTCCGATTACAAAACAAGCCTGATAAG 
1"TAAAAAAAAAAAAAAAACACCCTCAACTTCCTCCTGAGAACCGAAAATAAACAGGG AAT AAAA 

TATCGAAGTGAAATGAAGCCTGACCCTGGGAACCGTCAAGCTCTACTGATCAAAATGGTGTAGAACATCCCCCAAAACTCACCTGTGATTTTTGG 
CTGTCTCTTGCGCAGGACTATGGAGGATGTGCTTGTCGTGCTGGAGGAGCTAAGCCGCATTGCCAGCGACTCGTCCTGGGAGTCGTAACAAGATT 
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CATTATCCGTGTTCGACTCTAGCGAGGAATTGACCAGATTGGAGCCAAAGTTCTCCCGTTCTGAGCGCTCGTTGACCAAAATGTCCATGTCGCTA 
GGACTTGCATCCGCGGAAGGGCGTACGTGGGCGGCCCGTGTTAGGGTGTACATCTCTGGGGCGAAGCTCGACTCCTGGAAGCTGACGCGCTCCCA 
AATGTAGGGTGATTTTTCAGCTAAAGGGCAGCCGGCGCGGAAGTCAAAGCCCCATTTTTGCGTGGCCAGTTCTTGATGGCGCTCCAGTTCGGAAT 
TAAAGGGTCTAAAGAGGATATATTGTTAGCTATTGGTCAAAATCCAACTGTTATTATAGAAACTCACGTTTTGTTGGCTGTACCCTCGGCGGATT 
TGGAGCTACCGAAGAGGTCACGCTTTATGCGATTCAACTGACGACCGCATGCCAGATTGCGACTCACCGCCGGGCTGCTGCTCATCTTGCAGAAC 
TCACTGATCATCACAGGATTCAGGACTCGGGCACTGACCATGTTGAGTCAATATATATTTTTTTGGGATCCCCTAGTGTTTTCTACTCTGTTTTG 
AGACGCACTTTTCTTGCCACTTGTTTGCGAATTACACGCCAAAGCGAACTGTCAAACTTTTGGCGCCAAAACTTTGTGTTCAGACAGCCCCAAAA 
GAGAGAGAGGCAGTCTCCACACGAATACAAATCACGGTCTCCACACGCGAGCGTGTGTGTATGTGAGAGGGAGAGCGAGTGTTCTGTGTGTGCGT 
TGTGCTGCAGCTTATTTTACGTCTTGCTGCGTTCAAAAGTGAAATTTCACGCTGCAATTGTTTATAAAAATAATTATAAACAAATGCCTTGGTCC 
AATAGCACTTGGTCACTATCTTTGAATGGCGATTCGAACACACTGCACTGCTAGTCGACTCACAAAACGAAACGTTTGCGGGTCTCGTTGGAGTT 
TTTGTATGAGTTAGCTGCCAATGGCAGCACCTTTGTATTATATTTGTCTTTACAACTTGAAGTCGTTTTTACAGCGATGTTTGTTAACCTTTTCG 
GC T AT G A G C G C AC T GC G C AC TAG A T T T T T AAAT AT AGC GC T ACC GAT C G AAGC G AACC T CT C T C GAC G AACT C GG AG AAAC GAAT A AAAGAT AAA 

TAAAACACTGGGCAAAAGCAAACGTCAAACAACCACCAGAGGCGTTGCCAACTGGAAATTCAGAGCGGTTCACGCTCTCTCTGTAGCAACCCTCG 
AATTCGAAAAGATCGGAAAAGCAGCAGGAGCGAAAGAGATAGGGAAAAAGAATGCGCTTTTAATGTTAATCATATCGAAAGAGCAATGCACTTCC 
TTTGAATTGTTCTTTAAACTTATATAGCAAACGTATAAAAAATACTCAGAAAAATAAAAGTTTCCTAACAAAATTTAACTAAGGCATCTGAGTTT 
AAAAGTTTTGATATTATGAGCTTTAATATAATTATTGATTGTTGGAATCATCATATAGTACAATATATAATACGTATTCATTATTATTATTATTA 
TTATTATTATCATTATTTTCATTATTTCCTGTCAAATAATATTCACAGCTCATAAGCCCATAGTTCATTTGTAAACAGTATTCAACCAGATTTTA 
TTTTTAATGTTTTTGAAATAACCAATTTAATTATATTTTAAACTGTCCAAGGTTCCTTCATAGAGCGTATCAAGAAAGTCGTCACAACTTCCACA 
ACCTCTCTCTCTCTCACCATATGTCTCCCACTCTCTCGCACGCCATCTACCCATCCCTGCCATTTCCATCCGTGTGTTCTGTTCGCTGCCTTCTT 
GGTTGTTGTTGCATCGGTGGAGCTGTACTTTTAGTTGCGATTGCGATGGGAAGATACGAAAAAACGATGTTATGGTTTTGGTTTGTTTGGCCTCA 
AACTGGATTTTTAGTCGTCGGAGGGAAGTGGCAGGGGGCATTTGGGGCACCAATTCTTTTCACTTTCCTCGTGTTGCCAAGGGGAAAATCGCGCT 
TGCATTGTGGGGCACTGGTCCACCAAATATACATTTGTACGATGTATGTATGTATGTATGTAAAATAATGTGCACGTCTAGTTCGAATTTCCTTG 
TGGTGTTTGCATAGCACTGGGTATTTCAAATTTCCGTTAT 
(SEQ ID NO: 145) 

Exon: 4645.. 3868 
Exon: 3808.. 3408 
Exon: 2060. .1001 

Start ATG: 4031 (Reverse strand: CAT) 
Transcript No. : CT5316 

ATTCGTTTCTCCGAGTTCGTCGAGAGAGGTTCGCTTCGATCGGTAGCGCTATATTTAAAAATCTAGTGCGCAGTGCGCTCATAGCCGAAAAGGTT 
AACAAACATCGCTGTAAAAACGACTTCAAGTTGTAAAGACAAATATAATACAAAGGTGCTGCCATTGGCAGCTAACTCATACAAAAACTCCAACG 
AGACCCGCAAACGTTTCGTTTTGTGAGTCGACTAGCAGTGCAGTGTGTTCGAATCGCCATTCAAAGATAGTGACCAAGTGCTATTGGACCAAGGC 
ATTTGTTTATAATTATTTTTATAAACAATTGCAGCGTGAAATTTCACTTTTGAACGCAGCAAGACGTAAAATAAGCTGCAGCACAACGCACACAC 
AGAACACTCGCTCTCCCTCTCACATACACACACGCTCGCGTGTGGAGACCGTGATTTGTATTCGTGTGGAGACTGCCTCTCTCTCTTTTGGGGCT 
GTCTGAACACAAAGTTTTGGCGCCAAAAGTTTGACAGTTCGCTTTGGCGTGTAATTCGCAAACAAGTGGCAAGAAAAGTGCGTCTCAAAACAGAG 
TAGAAAACACTAGGGGATCCCAAAAAAATATATATTGACTCAACATGGTCAGTGCCCGAGTCCTGAATCCTGTGATGATCAGTGAGTTCTGCAAG 
ATGAGCAGCAGCCCGGCGGTGAGTCGCAATCTGGCATGCGGTCGTCAGTTGAATCGCATAAAGCGTGACCTCTTCGGTAGCTCCAAATCCGCCGA 
GGGTACAGCCAACAAAACACCCTTTAATTCCGAACTGGAGCGCCATCAAGAACTGGCCACGCAAAAATGGGGCTTTGACTTCCGCGCCGGCTGCC 
CTTTAGCTGAAAAATCACCCTACATTTGGGAGCGCGTCAGCTTCCAGGAGTCGAGCTTCGCCCCAGAGATGTACACCCTAACACGGGCCGCCCAC 
GTACGCCCTTCCGCGGATGCAAGTCCTAGCGACATGGACATTTTGGTCAACGAGCGCTCAGAACGGGAGAACTTTGGCTCCAATCTGGTCAATTC 
CTCGCTAGAGTCGAACACGGATAATGAATCTTGTTACGACTCCCAGGACGAGTCGCTGGCAATGCGGCTTAGCTCCTCCAGCACGACAAGCACAT 
CCTCCATAGTCCTGCGCAAGAGACAGCCAAAAATCACAGAATTCATGAAGGAGCGCAAACGTCTGGCCCAGGCACCCAAGAAACTATCGCCCGCA 
AAGCGCATGCGCACCAGTTCTCCCAGTTCGTCCGCCACTTCCAGCTCCAGCTCCAGCGGTGGATTTGGGTTGGGGGCACACTTTGGGGCGATGTT 
GAAGCGGCCGCGCCACAACTAATGCTCCAGATTCGCTGGCCAACCCACCGAATTGCAATTTGTAATTTTATTTTTTACAAGCCACTGCGATTGTG 
TAGTATTTCCTAATTTTTTATTACGAACCTTTTTATGCAGTGCAAGGGCGACAGTTGCGAATATTAATCGTTAGTCTTCAACTACTCGTAACCAG 
TAATTAGTTGTAAGTCAACCAGTAAGCTCGAGCTAAATGAAACTTTTTAGATTTCTCTAGTGAGAAATGCCATAAACTAAGTGATTAGTTCTTAA 
CAACAAACAACTATTGAACAAGAAGAGAGAGACCGCAAGCTAAATGGAATTAGTCTTAACTATCTTGAAGCCAACCGAAATTTAACCTTTATAAC 
GATGGAATACTAACTTTACATAACTTTGTGAAATCCTTGCAAATTTCGCCGGCAACTTTTGCATTTGAAATTTGCCGGGTGAACAAAACTCATCC 
TTTCTCTGCCCTGCCACTTCATCCAAGTACTGAAACTGAGGGGAGAGTAGAATTGAGTTTTGTCTTGCATATTAAGGGCGCCAAAACGATGCCCC 
AATAAGCCGCATTTCGGCTCATTGCTGCTTTGTTTTTGCGACCTACTAACATTGTAACCAATTCGACCAGTGGAAAACAAAAAAAAAAAAGAGGA 
ATGCCTTTGAATTAACTTCGTCAATTTCTGCACGTTTGGATGCCTTATTTATATGAATGCCGCCGAATGGACACAAATTGTCGCGCTCTCATTTC 
TTAACCAGTGCTAAAAATGTCTTTTATGATTTGCTATGATCTCTGGGCAAATTATGAAAACAATTTTGCCTGAAACTATCATGTACAAAAACTAG 
ATTATTATACATACTATACACATAGGCAGAGCCTATTCTTCATAACACTTATAC 
( SEQ ID MO: 146) 

Start ATG: 615 (Reverse strand: CAT) 

MVSARVLNPVMISEFCKMSSSPAVSRNLACGRQLNRIKRDLFGSSKSAEGTAKKTPFNSELERHQELATQKWGFDFRAGCPLAEKSPYIWERVSF 
QESSFAPEMYTLTRAAHVRPSADASPSDMDILVNERSERENFGSNLVNSSLESNTDNESCYDSQDESLAMRLSSSSTTSTSSIVLRKRQPKITEF 
MKERKRLAQAPKKLS PAKRMRTS S PS SS ATS S SS S SGGFGLGAHFGAMLKRPRHN* 
(SEQ ID NO: 147) 

Name: cyclin-dependent protein kinase inhibitor 
Classification: enzyme_inhibxtor 
Gene Symbol: dap 
FlyBase ID: FBgn0010316 



Celera Sequence No. : 142000013384447 
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GCGCAGGAATTCCCTAATGTTCCCGACCTACATGTACTTCTGATAGTTGCCGAGGTCAAATGTTGTTGTATTTGTATTATACCTCAATATTGGTA 

TATTCAATATCTAATAGTACCCAATTCAATTGCAAAGATAGTCATTAAAAAAACCTAAATCACTTGCAAATTGACTTTTCTGCCGGAAAAGCAAC 

CTTGACACACAAAGTTAATTAGTTTATCTGGAAGTCATGTGAGAAATTTGTAAATAAATAAATAGAAAATTTTTCGCAGTAATTTAAGTGGGCCT 

AATCCCTTTTAAGCATCTTGGTTTTACGATGACACCGCAATAAGGTACAACTTTATATTGTTTTTGCAATCAGCTTGAGTCTTTATTAGGCATCA 

GTCTTTCTCTCTAAGTTTCTTCGTGCAATAAATGAGGTTCCAAACTCCGTAGATTTTTCCTTCTTTGTTGAATCCAGATCCTGCAAAGAAAAAAG 

AGCAAACCCCTAGGTCTGTCCAGGAATGTATTTTCGTGTTTGTCGACGGTCTGAGCTTTGGTGAAGCGTGAAAATGTGGCAGTCACTGAGTGGAA 

TCGATGGGGGGATTGACGCGTGCCCAACTGGTGGGCCAAACACTTCGGATTGCTACGGTTCGTATTGCTATTGAGCCAAAGGGACCCGGAGCGCA 

TCACTCCGGTCCAAAATGTAGGCGGATGTGTGTCTGTTTCGGGGAAAGATGGTCTTCTGTCGGCAAGACATATACATAGGTGTTAAATATGATTG 

CATTAGTTTTATACTACGTATAAGGAAGCTTTGATTGGATCGAAGCGACTGAAGGCGACCCTCGTCCTTGAAATTCGGTAGCAGTGATTATAACT 

CATAGCCCCCATTTGGAGCTAGGCTCCGCTCAGGTCTTTGAAAGTCTTGGGGGGCACAGTGCCCCTTTGATTTTCGGTTTGCTTTTGCTAAGTCC 

AACTCTCGTTTTTGGAACTTTTGGGGTGGTGGATGGAGAAGGGGGCTTTGTGGGGTTTGGCACACACGATACACACATCACTGATTGATCACTGC 

TCATGTTAACTTAAGTTCGTTCATGGTGTTGTTGTTGGTTGTGGTTGTTGTTGTTGGGCGTCTCTTTAAAGTCTCTTTCTCTCGCTTTCTCTCGC 

TACTTTTCGCTTTGGGAGGTAAATAAAGTTCACATTTTCTCTCTTTCATTTTGTATTTTTGTGGTTTAAAATATTTAATTAGCATTAAGATCTCT 

GCTGTATTGAAATTTGCTTAAGTCGTAAGTTTGTTTTTAATCTTAGTGTGGGTGGTGGGGGTGTTTGGTGAGCGGGACAGAGAGAAAGGGGAAAG 

AGAGTTGTATCGACTTTAGACGATAGCCGGTTTTATCATATTAACTTTTTGCTTATACCGTAACGCATTCAATATTTCTCTTTAGATTTTCACAC 

GCGAGAGGATTAGGATTACACTTTCCGATTCTAGCGAGGCCAGGAGCACGATCTCGTTTTAAACCTCGATGCGGCCTAGCGCCTGTTGTCCCGAT 

AGCTTCGCTGGCCGCCGCCGCCGCCACGCTCGCGAAACGGTCGCGAATCCCGCGAAGATCCGCCGCCACCGCTGCGATTTCCGTACTCCCGTCGA 

CGACCGCCCGCGGAGCCGTTGTTGCCGTCCCGATTCCGATAGCCGCCGGGTCCACCCATACCGCCACCGGATCCTCCACCGGCTCCCATGCCACC 

GCCCGCAGCCGCCGCCTGGGCAGCAGCTAACCCGCCAGCTCCATTGCTACCGCCATTGGCTGCTCCGCCGTAGACGGGGAAGCCAGGCGGTGGCA 

CTGTGGGTAGGCCAGCATTCTTCCAGGCAGCAAACTCCTCGTAGCTCGGCGCCTTGTCGAAGCCCGGTGGTGGTGGTCGCTGGGGAGGCGGATAA 

TCCGGTAGAGGCGGGGCACCGGGTCCATCGTAGTACCGCACCGGTGGCGGCAGCTGCTCGTACATGGCAGCTGCTGCTCCCGGTGGAAGGGCACC 

TGCTGCTCCCGCTCCAGCTCCTGGAATCGGCAGCTGGGCGAACGGAGGAACGAAGGGCAACGGCGGAAGTTGGCCGTGCATGTTGCGCATGTAGT 

CCGCCACGTATGCTTCAGCAGCTGCCGCCGAGGACATGTAGCGTTCCCATGACGGCATGGCCATGTCGCCCGGTCCGCCGGGTGGCGCTCCTGCT 

GCACCTCCCATCTGGCGGTACGGGGAACCGCTTCTTTTAAAGCCACCGCCCGGCGGAAGGCCCATGCCATGATGGTGACGGTACGGCCGATGGTG 

TGGTGCTCCGTGGTAGCTCCTCTTGTGCCGCATCGGGCCCAGGTGGTCGTGATTTGATCCTCTGCCTCCGCCACGACCTCCCCCTCCATGGCCAC 

GCGAGGATAGACTCCTGGGCGGCTTGTAGATCACGTGTATACCTTTTTCTCGCATAACTCGAGCCGTCTCCTTGGATTTCTTGAGTATGGGAGTG 

AGTTCGATTTGTTCATCCTCGGGAAAGAGACGACACAGCTCGCCGCCAATCTTGGGCTCAATCTCCTGGCCATCGCGACCGATCTTCACCGGCTT 

GCCCTCGTTCCAGGTGTTGTGCAGGTGCAGGGTGGCGTTGAAAGACAATTCCTTGCGGCAGATCCAGTCAAGCTCGATGACGCCGCCCAGTGCCT 

TGGGCGAAATACTAGGCGGCAATACCCAGGCCACCTGAGGAATGTCCCGCCGTGAGGGGGCTGCCATCCGGGCGAATCCTGCAAATTTACCTAAA 

AGCAAAATCGGATTATAATAGAGGTTTTGAAGACATAGACATTGTATGTATCTTATTCTTAAGAAACTAAGAAAGTATAAGAAATGAGAAACATT 

GTTTACTCCTTGAACCTACCACTCTCGTTGACCGAGAAAATGAGCAGGACGTTTCTGGCCTCCTTGAAAGCCTGGTTAAGATTGGCGTCGTTCTG 

CGGCAGCGTGGCCCACACACTCTTGTTCTTAGACAGCTGGACGTTGTCGCTATTGTTCGACTTGATGAGGAAGAACCGTGTGTCTCGGAATAGGT 

AGTTCAGCTTGGTCATGTAGTCGTAGCCCTTCTGCGAATGTGAGTGCGTACCGCCACCGACAGAATCTTTTTCCGGCGTGGAGCTCTTGGAACTG 

GGTTTGCCCCGGCCGTTGCCGCCACCATTTCCTCCGTTTCTCTTGTGCTGAGTGCCCGAATCCGAGTCAGAGGACTCGCTGTCGGACTCCAAAGT 

TGGAAGGCTTGGTTTCTCGTCCTCAGCCTCGCTGAGATCGCTCTTGTGGCCAGCAGAATCGTTGGCATTGGAGGGTATCTTGGTGCGCGACTTCT 

TGGCGGACGGCTGTCGATCCTCGGTATCCGGGGATTTGGATTTGACCTTATCCGACGCCTCAGATGAGGTTGTTTTCTTCGTCCCGCTATCCGTG 

CGCTTCTTTTTCTTCCCACCATTTAGCTCCTCGGGAGTGGGTTCCCGTTTGGCTTTGTTCTTGGCGGAAGATTTGGAGGATTTGGTCTCGACAGC 

GGGTTGAGGGCTTTGCTTGGCAGGCTTCTTGGTTTTCCGCTTGCTACTGCCCGCCAGGGAAGATGTGGCAGTGCTGACGGAGCTGATGCTCGGCT 

CCGAGTCGGATGAGTCTCCACCATTCGATTCGGAAGCCTCACTCCTTGTGTTGAACTCAAAGTCTTGCAGCTCCTCGGCAATGTCCGCCTCGTTC 

TCGTCGAGGCCCAGGTGCACTGCATCCAAGTCCGCCATCTCGCGCATCGGCAGCGTTTGTTTACCTGCTCCGAAATCGGAAATCGGAGTACCAGA 

T G AAT GG AGC AC C C AC GC AC AC G C AC AC AC GC AAAC CAAGAAAAAT AT AT AGAG AAAAAAG C C G A T C GAT C GGC AAGGT AAG AAAAGGT C AG G G A 

AATACAGGTATTCCCAAACTCACGGGCTGCTCTTGGCATTAATTGTGTGGACACCGGTCTAGCGCCACTTTTTGTTCACTTTTAACGCAATATAT 

C C C TT T AAT AT C GC G AAAT T AC GC C AAT T T T AC AAT GC T T AC AC AC AAAT T C T G C GTC AC A GC G GC G AC G AAT T GGC G AT GGC GTC AG AAG A GT G 

CGGCTATATGTCTGCAGGCCGCGACTATCGGACCACTATCGATAGGCGGATGCAGCTGAAAAAGGTAGCTAGAAATAGCTGAAAATGCACTGCAA 

TCGGTTATGTTATCGTTCCTAACGAAGCGAGGTAAACAAAATGCACCACAATCAGTCGGATTTCCCGACTTTTCCGCTTTCCTTGCTGGACGACA 

GCGATCTAGAGCCACAGAGATGTTATATACCCGCGTGGGAGAACTCTACTGGAACTGGATGGTACCGTTGTCACCTTTGCGATGGCAGCTACGCA 

GTTTGTTGGGTAACTCCGCGTGCAGGCAGTGAAACTTGTTGCTTCCTTGATGGAATTGTAACCTCCTTGACGCGGTTCTCCCCCACCAATCAGCG 

GTTGGTGCTTCTGGAGCCACTTAAATTAGTTCCCTGCGTGAAGAGGATTTCATGCGTGATTACAGTCACCCAGGAGCTCCTTCAGCGACCTCCGG 

TTTCCCTGGAGGATCTGAAGGACGATGTGCGAATCTTCCTGCGCCACCTGAAGCTTACCAAAGGATGCTCCGTGCAGCACAGACGCCTTGTGCAG 

CTGGGCATTGCCAGTGTGGAGATATTCGATGCTCCAGGAGTTGCTGATGACGAGTGTTTTGAAATAGCCCACGATGCTGAGCTACTCCTGGCGGA 

CATACGCCTGGGAATGCCCTTTTCTTTAGATTTTATCTTCAAGTATTTGCCACACAGCTTCGAGCGACCCATGGAGCAGTTGGAGAAGCTCCTGG 

AAACCTCTAAAATCAGCTTTTCGCATAGATTTCCTACTACAGCATTGGTTGTGGGTCCGGTGGACTGTGGAAAAAGTGGTCTACTCAGTGAGTTC 

CTGCGTCGTCATAGCTGCAACTGTTTTTACATTACAGCCAGCCAGGTGCTGCGCTCATATCCCGGCGAAACCGAAGAGGAGTTGAGGAGGATCTT 

TCAGGCTGCTCAGACTTTCAAGGAGAAACTA 

(SEQ ID NO: 148) 

Exon: 4066.. 3919 
Exon: 3772.. 2870 
Exon: 2750.. 1001 

Start ATG: 3934 {Reverse strand: CAT) 
Transcript No. : CT5320 

CCAATTCGTCGCCGCTGTGACGCAGAATTTGTGTGTAAGCATTGTAAAATTGGCGTAATTTCGCGATATTAAAGGGATATATTGCGTTAAAAGTG 
AACAAAAAGTGGCGCTAGACCGGTGTCCACACAATTAATGCCAAGAGCAGCCCCAGGTAAACAAACGCTGCCGATGCGCGAGATGGCGGACTTGG 
ATGCAGTGCACCTGGGCCTCGACGAGAACGAGGCGGACATTGCCGAGGAGCTGCAAGACTTTGAGTTCAACACAAGGAGTGAGGCTTCCGAATCG 
AATGGTGGAGACTCATCCGACTCGGAGCCGAGCATCAGCTCCGTCAGCACTGCCACATCTTCCCTGGCGGGCAGTAGCAAGCGGAAAACCAAGAA 
GCCTGCCAAGCAAAGCCCTCAACCCGCTGTCGAGACCAAATCCTCCAAATCTTCCGCCAAGAACAAAGCCAAACGGGAACCCACTCCCGAGGAGC 
TAAATGGTGGGAAGAAAAAGAAGCGCACGGATAGCGGGACGAAGAAAACAACCTCATCTGAGGCGTCGGATAAGGTCAAATCCAAATCCCCGGAT 
ACCGAGGATCGACAGCCGTCCGCCAAGAAGTCGCGCACCAAGATACCCTCCAATGCCAACGATTCTGCTGGCCACAAGAGCGATCTCAGCGAGGC 
TGAGGACGAGAAACCAAGCCTTCCAACTTTGGAGTCCGACAGCGAGTCCTCTGACTCGGATTCGGGCACTCAGCACAAGAGAAACGGAGGAAATG 
GTGGCGGCAACGGCCGGGGCAAACCCAGTTCCAAGAGCTCCACGCCGGAAAAAGATTCTGTCGGTGGCGGTACGCACTCACATTCGCAGAAGGGC 
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TACGACTACATGACCAAGCTGAACTACCTATTCCGAGACACACGGTTCTTCCTCATCAAGTCGAACAATAGCGACAACGTCCAGCTGTCTAAGAA 
CAAGAGTGTGTGGGCCACGCTGCCGCAGAACGACGCCAATCTTAACCAGGCTTTCAAGGAGGCCAGAAACGTCCTGCTCATTTTCTCGGTCAACG 
AGAGTGGTAAATTTGCAGGATTCGCCCGGATGGCAGCCCCCTCACGGCGGGACATTCCTCAGGTGGCCTGGGTATTGCCGCCTAGTATTTCGCCC 
AAGGCACTGGGCGGCGTCATCGAGCTTGACTGGATCTGCCGCAAGGAATTGTCTTTCAACGCCACCCTGCACCTGCACAACACCTGGAACGAGGG 
CAAGCCGGTGAAGATCGGTCGCGATGGCCAGGAGATTGAGCCCAAGATTGGCGGCGAGCTGTGTCGTCTCTTTCCCGAGGATGAACAAATCGAAC 
TCACTCCCATACTCAAGAAATCCAAGGAGACGGCTCGAGTTATGCGAGAAAAAGGTATACACGTGATCTACAAGCCGCCCAGGAGTCTATCCTCG 
CGTGGCCATGGAGGGGGAGGTCGTGGCGGAGGCAGAGGATCAAATCACGACCACCTGGGCCCGATGCGGCACAAGAGGAGCTACCACGGAGCACC 
ACACCATCGGCCGTACCGTCACCATCATGGCATGGGCCTTCCGCCGGGCGGTGGCTTTAAAAGAAGCGGTTCCCCGTACCGCCAGATGGGAGGTG 
CAGCAGGAGCGCCACCCGGCGGACCGGGCGACATGGCCATGCCGTCATGGGAACGCTACATGTCCTCGGCGGCAGCTGCTGAAGCATACGTGGCG 
GACTACATGCGCAACATGCACGGCCAACTTCCGCCGTTGCCCTTCGTTCCTCCGTTCGCCCAGCTGCCGATTCCAGGAGCTGGAGCGGGAGCAGC 
AGGTGCCCTTCCACCGGGAGCAGCAGCTGCCATGTACGAGCAGCTGCCGCCACCGGTGCGGTACTACGATGGACCCGGTGCCCCGCCTCTACCGG 
ATTATCCGCCTCCCCAGCGACCACCACCACCGGGCTTCGACAAGGCGCCGAGCTACGAGGAGTTTGCTGCCTGGAAGAATGCTGGCCTACCCACA 
GTGCCACCGCCTGGCTTCCCCGTCTACGGCGGAGCAGCCAATGGCGGTAGCAATGGAGCTGGCGGGTTAGCTGCTGCCCAGGCGGCGGCTGCGGG 
CGGTGGCATGGGAGCCGGTGGAGGATCCGGTGGCGGTATGGGTGGACCCGGCGGCTATCGGAATCGGGACGGCAACAACGGCTCCGCGGGCGGTC 
GTCGACGGGAGTACGGAAATCGCAGCGGTGGCGGCGGATCTTCGCGGGATTCGCGACCGTTTCGCGAGCGTGGCGGCGGCGGCGGCCAGCGAAGC 
TATCGGGACAACAGGCGCTAGGCCGCATCGAGGTTTAAAACGAGATCGTGCTCCTGGCCTCGCTAGAATCGGAAAGTGTAATCCTAATCCTCTCG 
CGTGTGAAAATCTAAAGAGAAATATTGAATGCGTTACGGTATAAGCAAAAAGTTAATATGATAAAACCGGCTATCGTCTAAAGTCGATACAACTC 
T C T T T CC C C T T T C T CT CT GT C C C GC T C AC C AAAC AC C C C C AC C ACC C AC AC T AAGAT T AAAAAC AAAC T T AC G AC T T AAGC AAAT T T C AAT AC AG 
CAGAGATCTTAATGCTAATTAAATATTTTAAACCACAAAAATACAAAATGAAAGAGAGAAAATGTGAACTTTATTTACCTCCCAAAGCGAAAAGT 
AGCGAGAGAAAGCGAGAGAAAGAGACTTTAAAGAGACGCCCAACAACAACAACCACAACCAACAACAACACCATGAACGAACTTAAGTTAACATG 
AGCAGTGATCAATCAGTGATGTGTGTATCGTGTGTGCCAAACCCCA 
(SEQ ID NO: 149) 

Start ATG: 133 (Reverse strand: CAT) 

MPRAAPGKQTLPMREMADLDAVHLGLDENEADIAEELQDFEFNTRSEASESNGGDSSDSEPSISSVSTATSSLAGSSKRKTKKPAKQSPQPAVET 
KSSKSSAKNKAKREPTPEELNGGKKKKRTDSGTKKTTSSEASDKVKSKSPDTEDRQPSAKKSRTKIPSNANDSAGHKSDLSEAEDEKPSLPTLES 
DSESSDSDSGTQHKRNGGNGGGNGRGKPSSKSSTPEKDSVGGGTHSHSQKGYDYMTKLNYLFRDTRFFLIKSNNSDNVQLSKNKSVWATLPQNDA 
NLNQAFKEARNVLLIFSVNESGKFAGFAPJytAAPSRRDIPQVAWVLPPSISPKALGGVIELDWICRKELSFNATLHLHNTWNEGKPVKIGRDGQEI 
EPKIGGELCRLFPEDEQIELTPILKKSKETARVMREKGIHVIYKPPRSLSSRGHGGGGRGGGRGSNHDHLGPMRHKRSYHGAPHHRPYRHHHGMG 
LPPGGGFKRSGSPYRQMGGAAGAPPGGPGDMAMPSWERYMSSAAAAEAYVADYMRNMHGQLPPLPFVPPFAQLPIPGAGAGAAGALPPGAAAAMY 
EQLPPPVRYYDGPGAPPLPDYPPPQRPPPPGFDKAPSYEEFAAWKNAGLPTVPPPGFPVYGGAANGGSNGAGGLAAAQAAAAGGGMGAGGGSGGG 
MGGPGGYRNRDGNNGSAGGRRREYGNRSGGGGSSRDSRP FRERGGGGGQRSYRDNRR* 
{SEQ ID NO: 150) 

Classification : known_f lybase_gene 
Gene Symbol: BcDNA:GH01918 
FlyBase ID: FBgn0027616 



Celera Sequence No. : 142000013384171 

ACACAATTTTTGAATTGTTTAGAAAAATAAGAACGATCTATTAAATGGCATAATTCAAAAATATCAAGTTTAAGCCATATACATGTATAAGTTAC 
TTTTAAGTTTGACCAGTACAAAGTATATAGGAATTTATAATGGGGAAGCCAAGCTATATTATAATGAATAATTATTATTATTATAATGAATAATA 
TC AAGAT ATCAGGGTTGTGCTTGCACCACAATTATGAAAAATAAAGTAATGACCCAGAAGTCTGTTATCATCTATTTATAAGCAGTTTGAAACAA 
ATTAAAACCCACTTTAAATTTTATCAAATAAATATATTACCAATATACATTTAAATTTCGAACATTACATAGCAAAAAAATGTCAATTGAAATCA 
ACAAATTTTTGAAAATGCGAGCCATATTTTCATTAAACGAGCTTGTATTTTTTGAACCCGAACCGTTTTTTTTTTCAAGTTTAAAAAGAATTAAC 
TAACTCTAACAAAATCTAATATTTCAGTCGGTGTAGTAAAAACATAAACATAAACGATTTCAAATGTATCCTGAATACGCGGTAAACTTGTGTTT 
ACCAACACCGCCTCTTAGACTATTCAGGCCTGCCAGATCTCCAAAAAAATATTATTTAGATAAGTTAGTGACACAAAAGGTTTAAGTCTTAACTG 
CGCTTAGCTGGGCTTTAAACCCACTGCCTATTCTTTGCTAATTGGAGTATTGCAAATCAGTTTTGAATCTGATTTATTTGTCCCGCTTACCAATT 
TTAATTGTGTAAATTCATTTGGTTTTACTTGGCCAACTTACAAATAATAGAATAATAAAAATGAAAGCCGTCTGTTTAATTATTAAAATAATGTG 
AATTAATTGGATTATTAATCTGGGCGTTATTATATAGAAAAGAAACCTAACTAACCAATTGCGGAAATATCGATAGCATCGCAATAAAATCTATA 
TTGTGCCACCTCCAATTCACACAGGTCGGTCACACTGGGCAGGCTTGCCAATTTCAAGAGCCGCGGCCACACTACTGCGAATTCTATACTGCAAC 
TCGATCGTGCGTGTTCTGTATTTTTGGGAGTGCGTTGTTAGAAATTCGGCCACAAATAGAATCCGCTGTCCTGGAACGAGTCAGAAACCACGGGA 
AATTGACTTCCGGACTAAATAGCGTTCTTGTGTGTCCATCCGTGTGCGTGTGTCCGCATTCTGCAGTGTGCGTGTGTGCGAAAGAGAGAGGGCGA 
GCGGTGCAGCCGCAACTGGGAAATTCGATTGCCAATATTTCAGCAAGGAAGCTATGTCGAGCGGTCAGCAATTGATGGCCGATTCGCCAGCCTGC 
AGCTCAAGTGACCATTCGCCGCTGGCCAAAAAAAGGCGTTTAGAGTTGTTGGGTGACGGAGTGGGGGCAATGGGAATGGAAGAGGAAGAAGCCGG 
CGGAGCAACGAGCAGCAGTAGCAGCAACAACTACGAAAATCACGACAAAGCAGGCGGCGTTGACATTTCGGGCAAGTGCCAGAGCAAGACGACTA 
GCGAGCACCAGCTGGAATTGTGCGTATCGTCGTCTACCTCTGACACATGCGGCGTTAATGCTGCGGCGAAGCGAGCCGAAAAGCAGAAACTGAAA 
CAGAGCCAAGGAAGCGGCGACGACGTTAGTACCTCGTCAGCAGTGGAAAAGGAAGGAAGCAAGCGGCCGGAGAAGCGCAATTACCTGAACACCAC 
CACAATAGAATCGCAAGAAGCCTCCTCCAGTGCATCCGCCACTAGCAACAGCAACAACAACGTCAGCTCCAGCAACAATAACAGTAATAACAGCA 
CTAATAACCGACAACTGGAAGGAAGCAGCAACATGGCTGGAAACAGTGCCGCCGCCGGCGGAGATATCGATGAATCGCTGTACTCGCGCCAATTG 
TACGTCCTGGGTCACGATGCGATGCGTCGGATGGCCAATTCGGATATCCTTCTGTCGGGACTTGGAGGACTCGGCCTGGAGATAGCCAAGAATGT 
GATTCTGGGCGGCGTGAAATCGATCACCTTGCATGACACAGCAACTTGTGGGGTATGTGTGTGGGGCGAAGAAAAAATCGATAATTTTCCAGGCG 
GACGCGGGAAATGCCCGGAAAGGCTGACTTGGCATCCCTGGTCTTAGTGAATCATTCAGTTTTTATTTGATTTATTTATCATAGACTTCGCGCGA 
AATAAGGTTAGGTTAACATCTCTATTTGCTTGCAAAAAATTCGTTTTGTGTGTGTAATTCGGTTTTCCTTCTTTTTCTATTTGCCTCAAAATGTC 
TGCGCCATGCGAAAAGTGTGTGAGCGGGACAACCGTATGCGCATCGCTCGCTGCCGTTGTGTTTGCATTCCCCTTTGCTCAACGCATTCGGTTTT 
GTTTTATGCACTGAAAGATTCAACTTAAGATCTATGAGTTTCTCATAAACCGTAATATATCAATTATTTAGTTAATAGCCTTTCAATTTAAAGTT 
TTCTTGTATCATAAAAACTTTTTTGTTCTAATGACAAATATGTCTTTGCTGTACACTCGGCACAGAACCAATTTTCCTTTGTTAGTCATGGCTCC 
CGTCTGCTAATTTCATCACCGTTGATATATAATATTAATGATTATTCGTTTGTCTTGCAGCTCCATGACTTGTCGTCGCAATTCTATCTCACGGA 
AGCCGATATCGGCAAGAATCGTGCGGAGGCCTCTTGCGCCCAGTTGGCCGAGTTAAACAACTATGTGCGCACCGTTTCGCACACGGGGCCGCTTA 
CCGAGGAGTTCCTGCGCAAGTTTCGCGTTGTGGTACTGACCAACTCCGACGGGGAGGAGCAGCAGAGGATCGCCAAGTTTGCCCATGAAAATGGC 
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ATTGCTCTGATCATCGCCGAGACGCGCGGACTGTTCGCCAAGGTGTTCTGTGACTTTGGCGAGAGCTTTACCATTTACGATCAGGATGGCACACA 
GCCGATCTCCACCATGATAGCTAGTATCACGCACGACGCACAGGGAGTCGTCACCTGCCTGGACGAGACGCGCCATGGTTTTAATGACGGTGACT 
ACGTCACCTTCTCGGAGGTGCAGGGCATGCAGGAACTGAACGGTTGTCAGCCCCTTAAGATCACAGTGCTGGGACCCTACACATTTAGCATCGGT 
GATACCAGCAAGTTTGGAGAATATAAGAGCGGAGGAGTGGCCACCCAGGTGAAGATGCCCAAGACCATCAGCTTCAAGCCCCTGGCACAGGCCAC 
TGAGGAGCCGGAGTTTTTGATCTCCGATTTCGCAAAGCTGGACTCGCCGGCCACGCTGCACGTGGCCTTCAATGCCCTCTCCTGCTACCGCAAGG 
CCCATAATGGAGCCTTACCGCGTCCCTGGAACGAGGAGGACGCCAACAGCTTCCTCGAGGTAGTCCGAGCCAGCAGCAACGCGGAAGTCGACGAA 
AAACTGGTGCTGCAATTCGCCAAAATCTGCTCGGGTAATACGTGCCCGCTGGACGCGGCCGTTGGTGGAATTGTGGCCCAGGAGGTGCTTAAGGC 
CTGCAGTGGAAAGTTTACACCCATCTACCAGTGGTTGTACTTTGACGCCCTGGAGTGTCTGCCCACAGAGGGTGTCGAGGAGGCGGATGCCCAGC 
CGGTGGGCTCGCGCTATGATTCTCAGATTGCCATATTTGGCAAGAAGTTCCAGGAGAAGTTGGCTGACTCGAAATGGTTTATTGTCGGCGCCGGA 
GCCATTGGTTGTGAGCTATTGAAAAACTTTGGAATGCTGGGCCTGGGAACTGGCAATGGCCAGATTTTCGTCACAGACATGGATCTCATTGAGAA 
GTCGAATCTGAACCGGCAGTTCCTGTTCCGCCCGCATGATGTGCAGAAGCCTAAATCAATGACGGCCGCCGACGCTATCAAACGGATGAATCCGG 
AAGTGAATGTCACCGCATACGAGTTGCGCGTTGGCGCCGAAACAGAGAAAGTCTTTTCCGAAGATTTCTTTGGAAAGTTGGATGGTGTAGCCAAT 
GCGCTGGACAACGTGGATGCTCGCATATATATGGATCGCAAATGCATCTTCAACCGCATTCCGCTCGTCGAGACCGGAACTCTGGGAACTTTGGG 
CAACGTTCAGGTGATTGTGCCCTTTGCCACCGAATCTTATAGCTCCTCCCAGGATCCCCCAGAAAAGAGTATTCCCATCTGCACGCTGAAGAACT 
TCCCCAATGCCATCGAGCACACTCTGCAGTGGGCGCGTGACGCCTTTGAGGGCGTCTTCAAGCAGTCTGCAGAAAATGCTGCTCAGTACATCGCC 
GATCCTCAGTTCACCGAGCGAATCGCCAAGCTGCCTGGTATTCAGCCTCTGGAGATCCTCGATTCCATCAAGGTAAGGATAGAATTTACTTCTTT 
GCCTAATATGTTGTACTTAATGGTAATTTATTTTTACAGAAAGCTCTGATCGACGACAAACCCAAGAGTTTCGCGCACTGCGTGGAGTGGGCCCG 
TCTTTATTGGGAGGATCAATATGTTAACCAGATTAAGCAGTTGCTCTTTAACTTCCCACCTGATCAGATCACCTCCAGTGGTCAGCCGTTTTGGT 
CTGGTCCCAAGCGGTGCCCCGATCCCCTGGTCTTTGATGTCAACGATCCCATGCACCTCGACTTTATCTACGCAGCCGCCAATCTTCGAGCTGAG 
GTCTATGGCATTGAGCAGGTTCGCAATCGCGAGACGATCGCAGAGCTGGTACAGAAGGTTAAGGTTAGTTGTTCGAGATTAATACCGTCGGGGAC 
CATTTCTGATATTATTGATTTATAGGTGCCCGAGTTTAAACCCCGTTCGGGCGTGAAAATAGAAACCAATGAGGCAGCAGCCGCCGCCTCGGCCA 
ATAACTTCGATGACGGCGAGTTGGACCAGGATCGCGTTGACAAGATCATTTCAGAGCTGCTAAAGAACGCCGACAAAAGCTCCAAGATCACGCCA 
CTTGAGTTTGAAAAGGACGACGACAGTAACTTGCACATGGACTTCATTGTGGCCTGCTCAAATCTACGCGCCGCCAACTACAAGATTCCGCCAGC 
GGATCGCCACAAGTCCAAGTTGATTGCTGGCAAAATCATACCTGCTATTGCCACCACCACGTCGGTGCTCTCTGGATTGGCTGTGCTCGAGGTGA 
TTAAGCTGATCGTCGGCCACCGGGATCTCGTAAAATTCAAGAACGGCTTCGCCAACTTGGCATTGCCGTTCATGGCCTTCTCTGAGCCGCTTCCT 
GCCGCCAAGAACACATACTACGGCAAGGAGTGGACGCTGTGGGACCGTTTTGAGGTGACCGGGGAGCTGTCGCTGCAGGAATTCCTCAACTATTT 
CGAGGAGAATGAGAAGCTTAAGATCACCATGCTCTCGCAGGGCGTGTCCATGCTGTACTCGTTCTTCATGCCCAAGGCCAAGTGCTCTGAGAGAC 
TGCCGTTGCCGATGTCGGAGGTCGTGCGTCGCGTGTCCAAGCGTCGCCTGGAGCCACACGAGCGATCCCTGGTTTTCGAGATCTGCTGCAACGAT 
GTGGATGGCGAGGATGTGGAGGTGCCCTACGTCCGTTACACGCTGCCCTAAGCCCGTCTATATTCCTAATTTAACCATTAAATTATTTTACGATC 
CCGTTTTGTTGTCGGCGACGCGTTTTGGAATCGCGCAGCTCCAACGAAGGAGTTACTGTCTAGCTGTTTGCATTTTAATTTTCACAACACCATTA 
CGAGAGCAGCCCAGATCTAGAAATTATATAAAATGTGTGTGAATCCTCTAAGGAATGGCAAAATAATACGCATCATACGAAATACAAAGACTCGA 
CTAAAAGCAACACTTAAAATGCATCTGATGAAACTCTCTGCTCAGCATAGTTGTAAAGGTAACGAATTACGAAAAACCGAAGGCGGATTTTAATC 
GCATTGATTAATTGGAAATTGAAAACTGAATGAGCCAGGCGGAAAATTTCTCTTCACTCATATATTTTTTTTATATGGAAGCATAAACTAATTAC 
GTCATTGTATTAAAAGGAGTAAAGCAAATGAATAAAGCTGTATATTTCATATGGTAATCGACCTGCTAGACGACTCAGATAATTTCGTTCAGCAC 
TTGACGTCTGGTAATCTCATTTCCCCCGCCACTTCGGAAATTAAAATTGTAATGCATACTGCTCTGCCAGTGAAGTATATAGGGGATGCCATCTA 
TCCGTACGCCAGTCAGTTTCAGAAAGTTACGTGACACAATTAAATATTTTATGCCCTCTCAAGACGTTTATACCACCCCCGCCCCGCACCCAAGG 
AGGCTGCTGCCACCAAAGTGCAATTGGCAAGTGCCGTAAAGTTGCACCGCTCATGGAAAGCGTTGGGCGGAGTGGCGGCGTCCTCTTATAATTAA 
ATATTACACGCAATGCAACTGAAGTATGGCCAAAGTTATGCAGTTTCGCATCTCACTCTTGGCCTTTTGCGGCTTTTTGGCAAACTTGCTTGAAT 
TATGCAGTTAATGAAAACAAAGCGAGGCGAGGAAAGTGGCTGCGCCGATCAGCTTTGTATCCGTCGAATAAGTGTGTTCTGGGCAGGGTAATCAA 
AGTTTTGGGTTTGTAGTTCGACTCCTTAGGGCTCTGATTGTTAGCATCTAAGTATTTAAGATTTACATATAAGATCGTCTTATTACTTAGAGCTT 
ACTTTAAAAAGATGAGTGAACAAACTATTTACAAGGCAAAGTGGAAATGCTGGGTCAGAATTAAGATTAATACCTATCTAAATTTGAGGTGAAAG 
TAGGTGACTAAGTTTTCAAATAGAGCTGGGAAAATTAGAGATACATTACAGGAGTACCACAAGAGATTTAGAATATGAATATTCCTGTTCTAACA 
CCACATTACAGTACATTTTTATGTTTTTTAAGATTAGTTTCTCAAAAATGAACAGACAATTTTGTTTTATGAAAAGAGTAATCCAAATATTATTC 
CATATTATTCATTTCGTTCTATTGGTTTTGTTAAGCTGGACTTCTGAATCACTGAAATGTAAAGACTTACATGTACATTTTTCGATCGTACATTC 
ATATGTGCAT 
(SEQ ID NO: 151) 

Exon: 1001.. 2047 
Exon: 2626. .4347 
Exon: 4410. .4718 
Exon: 4776.. 6040 
Start ATG: 1838 

Transcript No. : CT5340 

ATTTCAAGAGCCGCGGCCACACTACTGCGAATTCTATACTGCAACTCGATCGTGCGTGTTCTGTATTTTTGGGAGTGCGTTGTTAGAAATTCGGC 
CACAAATAGAATCCGCTGTCCTGGAACGAGTCAGAAACCACGGGAAATTGACTTCCGGACTAAATAGCGTTCTTGTGTGTCCATCCGTGTGCGTG 
TGTCCGCATTCTGCAGTGTGCGTGTGTGCGAAAGAGAGAGGGCGAGCGGTGCAGCCGCAACTGGGAAATTCGATTGCCAATATTTCAGCAAGGAA 
GCTATGTCGAGCGGTCAGCAATTGATGGCCGATTCGCCAGCCTGCAGCTCAAGTGACCATTCGCCGCTGGCCAAAAAAAGGCGTTTAGAGTTGTT 
GGGTGACGGAGTGGGGGCAATGGGAATGGAAGAGGAAGAAGCCGGCGGAGCAACGAGCAGCAGTAGCAGCAACAACTACGAAAATCACGACAAAG 
CAGGCGGCGTTGACATTTCGGGCAAGTGCCAGAGCAAGACGACTAGCGAGCACCAGCTGGAATTGTGCGTATCGTCGTCTACCTCTGACACATGC 
GGCGTTAATGCTGCGGCGAAGCGAGCCGAAAAGCAGAAACTGAAACAGAGCCAAGGAAGCGGCGACGACGTTAGTACCTCGTCAGCAGTGGAAAA 
GGAAGGAAGCAAGCGGCCGGAGAAGCGCAATTACCTGAACACCACCACAATAGAATCGCAAGAAGCCTCCTCCAGTGCATCCGCCACTAGCAACA 
GCAACAACAACGTCAGCTCCAGCAACAATAACAGTAATAACAGCACTAATAACCGACAACTGGAAGGAAGCAGCAACATGGCTGGAAACAGTGCC 
GCCGCCGGCGGAGATATCGATGAATCGCTGTACTCGCGCCAATTGTACGTCCTGGGTCACGATGCGATGCGTCGGATGGCCAATTCGGATATCCT 
TCTGTCGGGACTTGGAGGACTCGGCCTGGAGATAGCCAAGAATGTGATTCTGGGCGGCGTGAAATCGATCACCTTGCATGACACAGCAACTTGTG 
GGCTCCATGACTTGTCGTCGCAATTCTATCTCACGGAAGCCGATATCGGCAAGAATCGTGCGGAGGCCTCTTGCGCCCAGTTGGCCGAGTTAAAC 
AACTATGTGCGCACCGTTTCGCACACGGGGCCGCTTACCGAGGAGTTCCTGCGCAAGTTTCGCGTTGTGGTACTGACCAACTCCGACGGGGAGGA 
GCAGCAGAGGATCGCCAAGTTTGCCCATGAAAATGGCATTGCTCTGATCATCGCCGAGACGCGCGGACTGTTCGCCAAGGTGTTCTGTGACTTTG 
GCGAGAGCTTTACCATTTACGATCAGGATGGCACACAGCCGATCTCCACCATGATAGCTAGTATCACGCACGACGCACAGGGAGTCGTCACCTGC 
CTGGACGAGACGCGCCATGGTTTTAATGACGGTGACTACGTCACCTTCTCGGAGGTGCAGGGCATGCAGGAACTGAACGGTTGTCAGCCCCTTAA 
GATCACAGTGCTGGGACCCTACACATTTAGCATCGGTGATACCAGCAAGTTTGGAGAATATAAGAGCGGAGGAGTGGCCACCCAGGTGAAGATGC 
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CCAAGACCATCAGCTTCAAGCCCCTGGCACAGGCCACTGAGGAGCCGGAGTTTTTGATCTCCGATTTCGCAAAGCTGGACTCGCCGGCCACGCTG 
CACGTGGCCTTCAATGCCCTCTCCTGCTACCGCAAGGCCCATAATGGAGCCTTACCGCGTCCCTGGAACGAGGAGGACGCCAACAGCTTCCTCGA 
GGTAGTCCGAGCCAGCAGCAACGCGGAAGTCGACGAAAAACTGGTGCTGCAATTCGCCAAAATCTGCTCGGGTAATACGTGCCCGCTGGACGCGG 
CCGTTGGTGGAATTGTGGCCCAGGAGGTGCTTAAGGCCTGCAGTGGAAAGTTTACACCCATCTACCAGTGGTTGTACTTTGACGCCCTGGAGTGT 
CTGCCCACAGAGGGTGTCGAGGAGGCGGATGCCCAGCCGGTGGGCTCGCGCTATGATTCTCAGATTGCCATATTTGGCAAGAAGTTCCAGGAGAA 
GTTGGCTGACTCGAAATGGTTTATTGTCGGCGCCGGAGCCATTGGTTGTGAGCTATTGAAAAACTTTGGAATGCTGGGCCTGGGAACTGGCAATG 
GCCAGATTTTCGTCACAGACATGGATCTCATTGAGAAGTCGAATCTGAACCGGCAGTTCCTGTTCCGCCCGCATGATGTGCAGAAGCCTAAATCA 
ATGACGGCCGCCGACGCTATCAAACGGATGAATCCGGAAGTGAATGTCACCGCATACGAGTTGCGCGTTGGCGCCGAAACAGAGAAAGTCTTTTC 
CGAAGATTTCTTTGGAAAGTTGGATGGTGTAGCCAATGCGCTGGACAACGTGGATGCTCGCATATATATGGATCGCAAATGCATCTTCAACCGCA 
TTCCGCTCGTCGAGACCGGAACTCTGGGAACTTTGGGCAACGTTCAGGTGATTGTGCCCTTTGCCACCGAATCTTATAGCTCCTCCCAGGATCCC 
CCAGAAAAGAGTATTCCCATCTGCACGCTGAAGAACTTCCCCAATGCCATCGAGCACACTCTGCAGTGGGCGCGTGACGCCTTTGAGGGCGTCTT 
CAAGCAGTCTGCAGAAAATGCTGCTCAGTACATCGCCGATCCTCAGTTCACCGAGCGAATCGCCAAGCTGCCTGGTATTCAGCCTCTGGAGATCC 
TCGATTCCATCAAGAAAGCTCTGATCGACGACAAACCCAAGAGTTTCGCGCACTGCGTGGAGTGGGCCCGTCTTTATTGGGAGGATCAATATGTT 
AACCAGATTAAGCAGTTGCTCTTTAACTTCCCACCTGATCAGATCACCTCCAGTGGTCAGCCGTTTTGGTCTGGTCCCAAGCGGTGCCCCGATCC 
CCTGGTCTTTGATGTCAACGATCCCATGCACCTCGACTTTATCTACGCAGCCGCCAATCTTCGAGCTGAGGTCTATGGCATTGAGCAGGTTCGCA 
ATCGCGAGACGATCGCAGAGCTGGTACAGAAGGTTAAGGTGCCCGAGTTTAAACCCCGTTCGGGCGTGAAAATAGAAACCAATGAGGCAGCAGCC 
GCCGCCTCGGCCAATAACTTCGATGACGGCGAGTTGGACCAGGATCGCGTTGACAAGATCATTTCAGAGCTGCTAAAGAACGCCGACAAAAGCTC 
CAAGATCACGCCACTTGAGTTTGAAAAGGACGACGACAGTAACTTGCACATGGACTTCATTGTGGCCTGCTCAAATCTACGCGCCGCCAACTACA 
AGATTCCGCCAGCGGATCGCCACAAGTCCAAGTTGATTGCTGGCAAAATCATACCTGCTATTGCCACCACCACGTCGGTGCTCTCTGGATTGGCT 
GTGCTCGAGGTGATTAAGCTGATCGTCGGCCACCGGGATCTCGTAAAATTCAAGAACGGCTTCGCCAACTTGGCATTGCCGTTCATGGCCTTCTC 
TGAGCCGCTTCCTGCCGCCAAGAACACATACTACGGCAAGGAGTGGACGCTGTGGGACCGTTTTGAGGTGACCGGGGAGCTGTCGCTGCAGGAAT 
TCCTCAACTATTTCGAGGAGAATGAGAAGCTTAAGATCACCATGCTCTCGCAGGGCGTGTCCATGCTGTACTCGTTCTTCATGCCCAAGGCCAAG 
TGCTCTGAGAGACTGCCGTTGCCGATGTCGGAGGTCGTGCGTCGCGTGTCCAAGCGTCGCCTGGAGCCACACGAGCGATCCCTGGTTTTCGAGAT 
CTGCTGCAACGATGTGGATGGCGAGGATGTGGAGGTGCCCTACGTCCGTTACACGCTGCCCTAAGCCCGTCTATATTCCTAATTTAACCATTAAA 
TTATTTTACGATCCCGTTTTGTTGTCGGCGACGCGTTTTGGAATCGCGCAGCTCCAACGAAGGAGTTACTGTCTAGCTGTTTGCATTTTAATTTT 
CACAACACCATTACGAGAGCAGCCCAGATCTAGAAATTATATAAAATGTGTGTGAATCCTCTAAGGAATGGCAAAATAATACGCATCATACGAAA 
TACAAAGACTCGACTAAAAGCAACACTTAAAATGCATCTGATGAAACTCTCTGCTCAGCATAGTTGTAAAGGTAACGAATTACGAAAAACCGAAG 
GCGGATTTTAATCGCATTGATTAATTGGAAATTGAAAACTGAATGAGCCAGGCGGAAAATTTCTCTTCACTCATATATTTTTTTTATATGGAAGC 
ATAAACTAATTACGTCATTGTATTAAAAGGAGTAAAGCAAATGAATAAAGCTGTATATTTCATATGGT 
(SEQ ID NO: 152) 

Start ATG: 838 

MAGNSAAAGGDIDESLYSRQLYVLGHDAMRRMANSDILLSGLGGLGLEIAKNVILGGVKSITLHDTATCGLHDLSSQFYLTEADIGKNRAEASCA 

QLAELNNYVRTVSHTGPLTEEFLRKFRVVVLTNSDGEEQQRIAKFAHENGIALIIAETRGLFAKVFCDFGESFTIYDQDGTQPISTMIASITHDA 

QGVVTCLDETRHGFNDGDYVTFSEVQGMQELNGCQPLKITVLGPYTFSIGDTSKFGEYKSGGVATQVKMPKTISFKPLAQATEEPEFLISDFAKL 

DSPATLHVAFNALSCYRKAHNGALPRPWNEEDANSFLEVVRASSNAEVDEKLVLQFAKICSGNTCPLDAAVGGIVAQEVLKACSGKFTPIYQWLY 

FDALECLPTEGVEEADAQPVGSRYDSQIAIFGKKFQEKLADSKWFIVGAGAIGCELLKNFGMLGLGTGNGQIFVTDMDLIEKSNLWRQFLFRPHD 

VQKPKSMTAADAIKRMNPEVNVTAYELRVGAETEKVFSEDFFGKLDGVANALDNVDARIYMDRKCIFNRIPLVETGTLGTLGNVQVIVPFATESY 

SSSQDPPEKSIPICTLKNFPNAIEHTLQWARDAFEGVFKQSAENAAQYIADPQFTERIAKLPGIQPLEILDSIKKALIDDKPKSFAHCVEWARLY 

WEDQYVNQIKQLLFNFPPDQITSSGQPFWSGPKRCPDPLVFDVNDPMHLDFIYAAANLRAEVYGIEQVRNRETIAELVQKVKVPEFKPRSGVKIE 

TNEAAAAASANNFDDGELDQDRVDKIISELLKNADKSSKITPLEFEKDDDSNLHMDFIVACSNLRAANYKIPPADRHKSKLIAGKIIPAIATTTS 

VLSGLAVLEVIKLIVGHRDLVKFKKGFANLALPFI^FSEPLPAAKNTYYGKEWTLWDRFEVTGELSLQEFLNYFEENEKLKITMLSQGVSMLYSF 

FMPKAKCSERLPLPMSEVVRRVSKRRLEPHERSLVFEICCNDVDGEDVEVPYVRYTLP* 

(SEQ ID NO: 153) 

Name: Ubxquitin activating enzyme 1 
Classification: enzyme 
Gene Symbol: Ubal 
FlyBase ID: FBgn0023143 



Celera Sequence No. : 142000012789666 

TATTTTCTTTTTTTTTTAGTTTTCTTCAAGAACCCACAAATTCCCGTTTGGCTAGCACTGCATTTCGCCATCTCTTTCTGATTCGCTCGCTGGAT 
CGGCGTCCCTCTCTCTCTTCCCAACGCCCACTGTGCGTCGCAACTGGTTTAACGTTCTGATTCGACTTTTATGGCTGCCGCTGCAGCACGGCCGT 
GGGTCGCCTCTGATCGGAATTTCAAAACAGTGGAGCGTGTTGTGGATGTTGGGGCAACAGTGGGTGGCATGTGTGGTGCACAGTGGTCACACACG 
GCACAATTCGTGGTTGAAATGTGTGAGGGACTTTAAAAATTTTATAAGAAACCGATTGGTTTGCAAGTGAAACAATAAGCGGTGTTTTCATTTTT 
TTGGTTAGCTTTTATTTGCGTGCAATACAAACTATGACAAAATTTGCCTAATAAATAGAGCCTAATTTTTGGGCCGCTCGATGAGCCTAAAGACT 
ACGAGTTGCGGATCAAATGGATTGATTTGGACAAGGACATGGACCGGACATAAGGCTAGACAGTACAGATTTATACTATGCGATCGGATATCGTA 
TACAATTTTGGTTCAGCTGCTCGGAAATCGCATTCGAGTTCGCGTTAAACATGTGTGTGtGTGTGTGGTGTGTGTGTGTGGTGTGTGTGTGGGTG 
GGTGTGTGTGTGCATAGAATTCAGTTATATTGCTTATATAAATAACTCTGCTGCCTTGCAAACATAAGGAATTATTTTTGGATCACAATAAAATT 
ATAAAAACATATATCTATATATATATATATGTATATACATATAGTATATAACGAAGGTAATAACGCAAAACCAATATGCAATGCACAGGTTTTGG 
AACCACCAAGCATTGGGGCACTGGGAAGGACTAGATAACAAGAGCAGCACGGTCTGCAAAATATAACTAAGCGCGGAGACTAACAGCTAACAGAC 
CACAGGAGTATTTCAAGTACAAAACATTATCGGCCGCAGCGGCGATCAGCTCAGAGCGTCTCCCTGGTGTCAAGCTCAACCAATGGCGGCACTGG 
CTCCAACTCCGCACTCTGGCTTCTGCTCCTACTTAGGCTGGGACTGGGGTTGCTCCTTGAACTTGGACTGGTGTGGTGGCTTGCGTCTGCACTGA 
TATTACTTGGCTGATTTTGTTCATGGTTGTTATGCGGCGACGAGGTTGCCGGGGTCAGAGGAGCCGGAAGTGCTGGCGTGCCGGGCGGGGAGAGT 
GCTGGCGGAGTGAGTGGGGCAACGGCCACTTCCCTTGTCTGTGGCGGCCAGTGGATGGACCTTAAAACGCTTTGGCTGCTGCTGGAGGAAGTGGA 
GGGACTTTCAGCATGGCCTGAAGCCAACGAGGATCCCGAGGTAGAGGCAATATTGTCATCGTCAGTCTTGGCCGTCTTCTTCTCCTTCTCCCTCT 
TCTCGCGCTCCTTCTCCATGTCGGCTATCTTCTTTAAGTAGTTTTTGTCCTGGGTCGGACATTCGAGGTGGACGCAGTGGAAGATCTAAGAGCAA 
GGAGAGCATTAGCTTATCGGACATGTTTTGTGGCGCTGCACTGCTACGTACCTCATTCGCTGTGTTGCGTGTGCCCACAATGCGGATATAGACCA 
CTGGACGCGGAGTAAAGTGAAAGTTCTGCCAGGAGCGGGTCCTGTCGTTGCGGCGGTCCACGACCATTTGCCATTCCTTGCGGTTCGTCGAAATT 
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TCGATGTAGAAGCTGTAGGTGCGATCATCGCAGTCCCACAGCAACAGCCTCATTGAGCCCAAATAGTAGGGTTGGCCCAAACGGACGACGATCTC 
GCCGCTGCCCAATTGGTGGCAAGTATATCCCGAGTCCCAGTCGTAGCGCACATAATCGCCGTTGATAAGCGCGTTGCGGGTGCGACTCACGCCGT 
CCGTCACAATGGCGCTCATCTCTACGGTGGCCACGTTCGTCTTGGGCGCCACAAAATGGTTAACTAAACGCGGCACCTTGGCCGTGTGCATGGCT 
TCCAGACCCACAACATGGAATACCTATATATTAATGGAAAGGGAAAAAGTTAAAGAATGGGGAAAGTTTGAAGAGATCACTGCTGACTTACCCGG 
TTTACAGTGTTCTGGGTGCCCACCAGTCGAATGAATCTGACGGGTCGTGCTTCGAAGTATAAGTATTGCCAGGAGCGGCAGTGGTAGTCGCTGTA 
GTCGACCACGCGATCCCAATGCTGCTGGTCGCCGGACACCTCAACGTAGTAGGAGTACGCCCGGCTGTCTCTGTCCCAGAGCAGCATGCGAATGT 
GATTGATCATGCAGAACGTGCCCAATTCGACCACAATGCCCGCATCTTTGCTGTCCGTAATGCAGTGGCGCGTATACCCGTTCTCCATGTCATAG 
GTGGTCACATCTCCATCCAGCAGCGCGTCGCGACACTCTCCCTGGATGCAGCGCGACAAAAACGTCTCCGCGGCCACGTTTTCCTCCGGCCAGAG 
CGCCGCCCTGTAGGGCAGCGCCTTTGAAGTGCTGCGCTCGTCGATGGCGTCGAGAATCTTGTCGGGATCCAGGATGCCCGAGGGACGCACCACCT 
GCAGCAGATGCTCGAGGTTCATCAGTGGCAGGCGGACATAACTAACCACGGACTTGAAGTCCACGTTGCTATTGAAGCGGCTCCACTTCCACACG 
G C C AAAAAG AT C T GC AC C T C G GG G GCG AAG AAGC AGT C ACG C C G T AG T AC C T C CT C C AGC G AT T C C T GT AAGC AT T AC AT ACT C T T AGT G C C AAC 
TTTACCAATTGGTTGTGAACTTACCTTGGATAGCGTATTGAAGCTGTTGTGCAGCAGCAGGTCGCCGGCATTGCGATCCATGAACATTAGGCAAA 
CCTCGGTGAGTTCCTCTAGATTGTATAGACGCGCCGCATCAAGAATCATGCAGACATTGTCCAAGGCCAGATACTGGCGTAAATAATTCGATATG 
GCCATCTCCAGATCCTGGAAACCATACTGATTGGCCATGCCCAGCACATCGATGGTGGAATCCTCGTCCAATGTGGACAGCAGCAGTGTGCCCGA 
GTAAATGTAGCGCAGCAGCACCTTGAACGCTTCCAGCGGCACCTCAAGCGGTATTTGGCGCTGTGTGGTCTCCGCCATGCCGCCGTACAGCAGCG 
CCCGGAAGTATTCGCTGCGTGCCGCGAGTATGACGCGGTGGGCGGGTATGCGCTCCTCCTCGACGATGAACTCCACGTCCGCGTACTGTTCGTTC 
ATGCAGAGGCGCGCCATGTCCGCTGAGAAGCGATCGCCCAGATCGACCACGTCGGTGTAGTCCTGCTCCATGGCGCCCTTCTTGCCGCCGCCGGA 
CATCTTATGGTGTCCCTGGCTGCTCATCCTTCTCTTTTTTGTTTTTCTTCCGTGCGATGTGTATTTATGTGTGTAAGCAGCCAGAGGGAACAGTT 
GAC AAT T AAC GTT T C G T C T C GC CT GCT GC GT C AAT C C AT C G AT G AC AGT T AGT GAT AT C G AT AAT AT C GAT GGC AC AAT AGGG AT TCTTCTATTG 

CGGCGAAAAACTGTTATATGTGCAGTGTGACCGTAGGTGTGTATACCATTTAACGCGTACAACAACCATATCGACAGCCGTCTGCTAGCGAGCAC 
ATGAACCACATGGTTGCATCCGATAACATTTAGCGAATACATGAACCAAATGGTGTCTGTCTAATATGCCACCACTATTAGCGCGTTTCGGGTTG 
GTGTCTCTGCGCGCTCACTGGTCACACACGGTCAGTCTAGCCACAGCAGTTAGCGAGTGCAGTCACCACAGTGCGAGTGCTTATCGATAAAACCA 
GCAAGTCATCGATTTTTAGACCGAAGCGCTCTGTCAACTGCTGCGCACGGTCACTCTACACAAGTATTTCGTACCGTTTGTACGAGAAAACAGAG 
AAGAGAAAAACACAAGCTCAGCAACGCATCGATATTCGGCACAGTCCCAAAAGCAAACGAAGTGATAGAAGCTGCGTTTCTTAGCCAATTACCTC 
TTCATTGCAATCATGAAGTTCCAATACAAGGAGGAGCACGCCTTCGAGAAGCGTCGCGCCGAGGGCGACAAGATCCGTCGCAAATATCCAGACCG 
TGTGCCCGTAAGTACTCAAAAATATATATATAAATGCATATAGGTATAAAGAGACGTGTGTGTGTGTGTATATATGCGTGCGAGTGTGTGTGTAT 
GTGCGAGCTTCTGTGTATGCGTTTGTTTTGCGTCGTTTTTTTTCTCGCTCTTTTTTTTTGTGTTTGCACCGAAGAAAATCAATACTGCGCATATT 
GGGTATTTCGATTGTTATTACAATATTATTGCACTGGCACATCAGGTGGCTGGGGAAATTTGTGCGATGGGTATTGCATAAAGTGCATTTTGCAT 
TACTGGTGGGGGGAACAGGCGACAAGCTCTCAAAAGAAATTCAAACGAAAATTCTGCCCCCTGCATACACACATACGCACGCATACACGCGCACA 
CAGGCACACAGATGTCGACAGGAAAAAACAAAAGCAAAAAAAGGGAGAGAACAAAGCAGC 
{SEQ ID NO: 154) 

Exon: 3525.. 2780 
Exon: 2725.. 2087 
Exon: 2018.. 1572 
Exon: 1510.. 1001 

Start ATG: 3352 (Reverse strand: CAT) 
Transcript No. : CT5456 

TTTTTCGCCGCAATAGAAGAATCCCTATTGTGCCATCGATATTATCGATATCACTAACTGTCATCGATGGATTGACGCAGCAGGCGAGACGAAAC 
GTTAATTGTCAACTGTTCCCTCTGGCTGCTTACACACATAAATACACATCGCACGGAAGAAAAACAAAAAAGAGAAGGATGAGCAGCCAGGGACA 
CCATAAGATGTCCGGCGGCGGCAAGAAGGGCGCCATGGAGCAGGACTACACCGACGTGGTCGATCTGGGCGATCGCTTCTCAGCGGACATGGCGC 
GCCTCTGCATGAACGAACAGTACGCGGACGTGGAGTTCATCGTCGAGGAGGAGCGCATACCCGCCCACCGCGTCATACTCGCGGCACGCAGCGAA 
TACTTCCGGGCGCTGCTGTACGGCGGCATGGCGGAGACCACACAGCGCCAAATACCGCTTGAGGTGCCGCTGGAAGCGTTCAAGGTGCTGCTGCG 
CTACATTTACTCGGGCACACTGCTGCTGTCCACATTGGACGAGGATTCCACCATCGATGTGCTGGGCATGGCCAATCAGTATGGTTTCCAGGATC 
TGGAGATGGCCATATCGAATTATTTACGCCAGTATCTGGCCTTGGACAATGTCTGCATGATTCTTGATGCGGCGCGTCTATACAATCTAGAGGAA 
CTCACCGAGGTTTGCCTAATGTTCATGGATCGCAATGCCGGCGACCTGCTGCTGCACAACAGCTTCAATACGCTATCCAAGGAATCGCTGGAGGA 
GGTACTACGGCGTGACTGCTTCTTCGCCCCCGAGGTGCAGATCTTTTTGGCCGTGTGGAAGTGGAGCCGCTTCAATAGCAACGTGGACTTCAAGT 
CCGTGGTTAGTTATGTCCGCCTGCCACTGATGAACCTCGAGCATCTGCTGCAGGTGGTGCGTCCCTCGGGCATCCTGGATCCCGACAAGATTCTC 
GACGCCATCGACGAGCGCAGCACTTCAAAGGCGCTGCCCTACAGGGCGGCGCTCTGGCCGGAGGAAAACGTGGCCGCGGAGACGTTTTTGTCGCG 
CTGCATCCAGGGAGAGTGTCGCGACGCGCTGCTGGATGGAGATGTGACCACCTATGACATGGAGAACGGGTATACGCGCCACTGCATTACGGACA 
GCAAAGATGCGGGCATTGTGGTCGAATTGGGCACGTTCTGCATGATCAATCACATTCGCATGCTGCTCTGGGACAGAGACAGCCGGGCGTACTCC 
TACTACGTTGAGGTGTCCGGCGACCAGCAGCATTGGGATCGCGTGGTCGACTACAGCGACTACCACTGCCGCTCCTGGCAATACTTATACTTCGA 
AGCACGACCCGTCAGATTCATTCGACTGGTGGGCACCCAGAACACTGTAAACCGGGTATTCCATGTTGTGGGTCTGGAAGCCATGCACACGGCCA 
AGGTGCCGCGTTTAGTTAACCATTTTGTGGCGCCCAAGACGAACGTGGCCACCGTAGAGATGAGCGCCATTGTGACGGACGGCGTGAGTCGCACC 
CGCAACGCGCTTATCAACGGCGATTATGTGCGCTACGACTGGGACTCGGGATATACTTGCCACCAATTGGGCAGCGGCGAGATCGTCGTCCGTTT 
GGGCCAACCCTACTATTTGGGCTCAATGAGGCTGTTGCTGTGGGACTGCGATGATCGCACCTACAGCTTCTACATCGAAATTTCGACGAACCGCA 
AGGAATGGCAAATGGTCGTGGACCGCCGCAACGACAGGACCCGCTCCTGGCAGAACTTTCACTTTACTCCGCGTCCAGTGGTCTATATCCGCATT 
GTGGGCACACGCAACACAGCGAATGAGATCTTCCACTGCGTCCACCTCGAATGTCCGACCCAGGACAAAAACTACTTAAAGAAGATAGCCGACAT 
GGAGAAGGAGCGCGAGAAGAGGGAGAAGGAGAAGAAGACGGCCAAGACTGACGATGACAATATTGCCTCTACCTCGGGATCCTCGTTGGCTTCAG 
GCCATGCTGAAAGTCCCTCCACTTCCTCCAGCAGCAGCCAAAGCGTTTTAAGGTCCATCCACTGGCCGCCACAGACAAGGGAAGTGGCCGTTGCC 
CCACTCACTCCGCCAGCACTCTCCCCGCCCGGCACGCCAGCACTTCCGGCTCCTCTGACCCCGGCAACCTCGTCGCCGCATAACAACCATGAACA 
AAATCAGCCAAGTAATATCAGTGCAGACGCAAGCCACCACACCAGTCCAAGTTCAAGGAGCAACCCCAGTCCCAGCCTAAGTAGGAGCAGAAGCC 
AGAGTGCGGAGTTGGAGCCAGTGCCGCCATTGGTTGAGCTTGACACCAGGGAGACGCTCTGA 
(SEQ ID NO: 155) 

Start ATG: 174 (Reverse strand: CAT) 

MSSQGHHKMSGGGKKGAMEQDYTDVVDLGDRFSADMARLCMNEQYADVEFIVEEERIPAHRVILAARSEYFRALLYGGMAETTQRQIPLEVPLEA 
FKVLLRYIYSGTLLLSTLDEDSTIDVLGMANQYGFQDLEMAISNYLRQYLALDNVCMILDAARLYNLEELTEVCLMFMDRNAGDLLLHNSFNTLS 
KESLEEVLRRDCFFAPEVQIFLAVWKWSRFNSNVDFKSVVSYVRLPLMNLEHLLQVVRPSGILDPDKILDAIDERSTSKALPYRAALWPEENVAA 
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ETFLSRCIQGECRDALLDGDVTTYDMENGYTRHCITDSKDAGIVVELGTFCMINHIRMLLWDRDSRAYSYYVEVSGDQQHWDRVVDYSDYHCRSW 
QYLYFEARPVRFIRLVGTQNTVNRVFHVVGLEAMHTAKVPRLVNHFVAPKTNVATVEMSAIVTDGVSRTRNALINGDYVRYDWDSGYTCHQLGSG 
EIVVRLGQPYYLGSMRLLLWDCDDRTYSFYIEISTNRKEWQMVVDRRNDRTRSWQNFHFTPRPVVYIRIVGTRNTAKEIFHCVHLECPTQDKNYL 
KKIADMEKEREKREKEKKTAKTDDDNIASTSGSSLASGHAESPSTSSSSSQSVLRSIHWPPQTREVAVAPLTPPALSPPGTPALPAPLTPATSSP 
HNKHEQNQPSNISADASHHTSPSSRSNPSPSLSRSRSQSAELEPVPPLVELDTRETL* 
(SEQ ID NO: 156) 

Classification: hypothetical 



Celera Sequence No. : 142000013385193 

ATATGTACCATGTAAATTCGTTTCTTCGATCAGAATTGATTTTGGCCCGAAAATCGTCTTCTAGCACAACACGCACACTTATACGCGTTCTCGTC 
TCTTGTTTTTACTCACCCAAGCAAGCAAATTATATTTTTAGATTTCTTACGCTCTCAGCGGGAGTGAGCGGAAAGAGAGTAATTTTGGCCGTCAC 
CAAAAAAGTGGCTGCATAGTGCCAAACCAATGTATGGCCGTTACGCATCTTGTTATTCTAGGGTCTTTGGTAAGACACAGGCGTGATGTCGCTTA 
CGAATCAAATTATATATATGTATATACGTAATGTAAGGTAACTAATGGTAACGTACACTGTTATAAATATTAATGCCAAGCGCAGAACTTTCAAA 
ACTGTTACTTGCAATATATTTACAGTTGCTTAAGCCTAGCCTAGTGGTCTAGATTATATAGCTAGATTATTTTGAACGATCCGATCCTGAACTTG 
CAGCTACTTGGGTTCGTGATGCGAAAGCACAGCAAATGTGGGGGGGTTAAGTGAGCATCGTATGATGACCACAATTCAAAGAACAGATTCCATAC 
ACTCGTCTATGTGGCTAGAGAAAGAACTATCGATCCACGGGGAATTCAATTCTTAAGGATTTATTTAATCTCCAGCAGCCTTGCTTGCTACTCCA 
ATGTCTTCCTCTTTCCGGTTTTAACAATATTTACACTTCTGAAGGCCTAAAAAGATCGCTGGCTTACAACATGTGATACATCATATACATATATA 
ATCTAGTAATTATGTAATTATCGAATATTTCTGCATCATTAGATTTAGTTGCTTAACTCTCGCATAGTTTAAACTTGCATCTATGTATCTGCTGC 
T C T C GC T GT AAAC AT AAAAGT C AAG GAA GAAT C T C T T G AAC C AC AT AAC AC AT GC AC T T GC T T T AAAAC GGG AAAC AG AGG AAGC AG AAAAAAAA 
CATTACAACAAACTCGACTCCTAAAGCTACGTGTTATCCTAAATATCCGCTTAGCCGTCGACGACCATGCGTTCCGATCCGGAGTCGCACTGCTC 
TATGACCAGGCGAGCGATGCGCTTGATCTCGCAGTAGGGACTCTCGTGCTCGATTAGCTCGTTGAGGATCTGGATGCCGTTCTCCTGCTCCACCA 

GC T T GC AAT AT TT T T C GGG AT AAAC CT GT G ACGG AAT GC AG AAGT T AG T AC AC AAT G C AC C AAC AAT GGG AAAAGG AAT TGCTCCCC C AC CT GT G 
TCAGATTGGCCAGGGCCCAAACAGCCCAATGTTGGCACTGGGGCGTCTCGTAGCAGCGCACCAGGCTGAGGATCGGCTCAAAGCTGCGGTAGTTG 
ATGTTCCGCTCGCTCTTGATGTTCCACCGCTGAATGGCGGCCACCATGCGCTCCAGCACGTGCTCCCGGCTGGGCGTCTTGATCGTCCAGGCATC 
CGCTCCGTCCGAGGCAATGTGCGCCAGCACTCCAGCCGCATTGTAGCTGACCTGGAAGCGCAGGTAGGCATGGTTAGCACTGGCCATCTCTCTCT 
CCCGCACTCTCGCCACCACACTCGCACTAGCACCACCCACCTCGATGCCATCGCTCAGGGAGTCCAGCAGACGGGCAAACACCTCAATGAACTCC 
TGGGTCATCAGCTTGGGCCGCAGCCACTTCACCTCGGCCACATTGCCCAGCAGGCCCATCATGTTGCGCAACAGCTCGTCACGATCCGGGAAGGT 
CTGTTTTGGAAAAGGACAGTTCAGCTTAAGAAGTCTGGATACAGGAAGTTAACCTATTTCTTCATGAAACCAAGGAGTATACATTTGTTCGCTTT 
CTCTATGCAGCTCTTAAATTAAATATTTTAGTTCCCAATCTT AT CTTTGTTGCGCTGCGGATAATGGC AAGT TTTTGGAGAACTTACATGT AAAC 
ACTTAAGGAAATACTCCATGCCACGTCCATCGAGGAATCTCTTACAATTAATAGCAGTCTCATCGGTTACGTTCCACATGGTGGACCAAGCCACC 
TCCATCACATCGTC G AAAAC GGAGC GAG TC AGC C GAT C CT T TAT C AAC GT G AAC AT GGT C T AAAGT C AAGAAT AT C AC C AAT AAGT AC AAT AAGT 
TCCGGCAAACAGTTGATAGGCAAGACACTTACACTAACTACGCCCAACTCGCCCAGGAAGAGCTTCTGTCGCCCATCCACCTGGCAAGCGAGTGT 
GTTCAGCAGGTAGATTGCTATGCGCTGGACAAATCCCTCTTGCTCCGTCTTAGAAACGCCGTGCAGCAGGATCTTGATGAGACGCTCGTACTCGA 
AGAGCT G AAAAC C AG AC AT T TAG C T G T AC T T AAT AT AAT AT TAG AT T AGGC GGT GCTT ACC AC AT C GAC T GGC AT GT G AAAC T GGGT G AGC GT G A 
GGTAACCATTCCGCAGCATGGTGTCGTCGGTGATGTGCATCTCCATGCCATTGAGCAGCGTGCGAATGATGTGATTGCGCAGTAGTGCTCCAAAC 
TTGGATCGATCCCTGCCCTTGACAATGTAGAACAGCGTGGCACTAATGAAGATGAAATTTAGAATTAAATACTATTTTCACTTAACTTCAAAGCT 
ATAAACCCACCTGCCGGAGATTTGCATGTGCTTGAACTTGAGATGCCGATCCATGGCGGAGAGCACAACATCCAGTGCTGTGTGTATGTCCTTGC 
AGTTCTCGAAGCGGAAGAGATGGTACAAGTCGTTGAGAACTCTTGTCAGAAGCACTGGTCTGTCGTGGTAATAACGCGCCGCAGTAAGGATTTGC 
TGCTCGTTGGCATCGCCAGCTACCTCCAAAGCTGGAATATCGTGTCTTTTGCAGGCCCAGTGGGCCGTGTGATAGAGGCCTAGGAACTGCAGCGG 
TCGCTGGGTGCGGGACGCTAGCCCCGGGATGTCTGTGAGGGCAAAGTGTTGCTCCATTTTGGGGCTCTGCTGCATGCCACTGGTAGTTGTCGACT 
CCTTGGTGGCCACACCGTTGCCGGCCAAATTCGTGCCAGAGATATCCAAATGTGTGAGATGTCGCAAGTTGTCCATCAGCATCTCCAGCGTCTGA 
TCGGGCAGGTCGTAGGTGCCGTGGCCATTTCCGGAACTGGAGATGGATATGTCCAGTGTGCACAGACGCCTCAGGCAGCAGATCGCATGCAGCTG 
GTTGGCGATGGGCCACACATTGAACAGGATGAGCGTGTGCAGATTGGGCAGCGAGCCCAATGCCTCAAGGCTGAAGTTGGCCAGCACGCAGGAGG 
TCAGGTCCAGGTGGCCCAAGTCGTGGAGGTGAGCGAACTGCAGGCGGTGGTGCATCACAACACCGTTGAGCACCAGCCGCCGCAGATGCGGACAA 
GTCAACTGGAAGTCCACTGGCTCCTTCTCGTTGGGCTCCGCGTACTGCAGCAGATGGGAACTGATGCCCAATTCCAGGGATCGCAGACTGTCGCC 
GTAGTGGGCCAGAAGGTGGTGGGAACCAACGGATATCATGTCACAGTACCACAAGGACAAAGCGAACAGTTTGTGCCGCATCAGAGTCTCCAAGC 
CTGCAAAATGTATCAAATAGTATTTCAATATGTTTTAAACTTATTCAAGTCAATGTGGTTCAGCAATAAATATGCAATAGAAGGTTGATTGGATA 
TCTATCAATTCGAAGGTAACACATTTACATAAGTGTAATTTTTCAAATGAACTTGCATCTGATTTTGTATAAACTTTTAGAGGTGCGGTTGCAAA 
TATTTAAATTTAAGTTTGTCTGTTTGAAAGTCTGACAGATTCCAGTATAAATTCCGTTCAATTACTATTCAATGGAGCTCAACAACACCGGATGT 
GGGCGGAGAACACTACGGATGGCGGGCAGAGCTCACCTATACTGCTTAGGGTGCTGTTGCGAAGATTGACGATCTTGAGCGAAGTGCGGTGTGTG 
TCCTCGAAGAGGCGGATGACGCTGTCGTCCAGCGGGCGATTGAATCGCTGATAGTTCTCGAGGAAGCCGTCGCAGATCTCGTTGGGCAGCACAAT 
GCCCGGATTGAGGCCGCGCTGGCCATCTGGCCGGTGGCTGCTAATGATGTCCAGATTGTTGCACAGCTTCTGGTAGGCGATCTCCTTGAGGGTCA 
GCGGCTCCTCGTCCAGCGAGCCGTCCTCCATCAGCCGCACCTTGCAGCTCATCTTGGGGCCGCAGTTCTTGTGCTCCGCCAGTGGGGCCTCAATA 
ATGGCCCTGTTGTCTAAAACGAAACGAGGCGATGGGAGGTATGATTAAACAAAATTTCAATAAGCGTGAAACTAGTTTTTGGGTATGTGGTCATC 
ATTTCATCATTTTGTGAAATATTGAAATTACTGTTCTTTGAAATAAAAAAAGTGTTGAAACAGAATTCAGAAACGATATCGGAGAATATACATGA 
TAATTTATTTCAAGTCATGAAATGACTACTTCACATTATGATTATTATAGAAATGAACTTATATTGCTGATAATAAAATTAAATTAAACACGGGT 
TTTCTATCACAAAATACAAAACTATTGTAGACCGGCTTTTACTTTTAATCACGAGACATGCACATGTAATGCGTTCGATTTTTCAATTAATTTAA 
ATGAATTTGATTTTGGCCCATTAATCCATCCAGGTGCTCAAACTCAATACTCTGTCGATTATACATGATTTTATAGCCCCTTATTTGAGATTACA 
AGT C AT GAT G G AAC G GAT GAG T G AT T C AAC C T T T GAAT C ACAT C AAT GAAT G AATGA AAGT GC CCTTTTCTTT T C AAG ATT AC AG AC AT GG AT T G 
CCATTGAAGTGTTTTCCATTGAGTGAATTAAATATTTTGTCACTTTCAAATGAGCCCTTAACATCAATGTTTATAAAATGAGCTTACAATGTATT 
ATTATACTAAGATAAATGGGCAATGACTTGACATTCTAGATTATATACTTTTTGATGTACGATTTCTTTAGTTCTTTTGTAAGCTAGCCAGGAAT 
TAATCAATTTAATTAACGAGTATAGTTTGTGTTTACATGTTGCACTTTTTCGACTAATTATTGTTTGGGTATAGGAAGCGGGCTTGCCCTTGCAG 
TCGAACCCTTGCAGCTCCTAACTTTTGGAACTTGAGTTGCCATGGATGACTGGCCAATCTTGCCCAGACTTTTCAACGCAGGCACGTGCCATTGT 
TTTAATCTGTTTTCGGCTCTTGGCTTCCGATAGTCATGTACCCCTGCTGTTACATAACCAAGGGTCATCTAATTGGGAAGAATGCGGTATTAATA 
CGGGGACTATGATCCTGGTTACATCCAGGCAGATGGTTATGTCCATGTCGTGGCGTGTGTGTATGTATATGCACAAGTTCCGCGGATCTCAAAGG 
TACATACATGCATGTAAAAAACAAATGGCTTGCGTGTGTCTGTGTGGGTGCGTAGTTATTGAAGAAAAAAGCACAAAAACACTAATAAAGCGTCG 
GCCTGTCTGGCTTATCTGGCCAACAAAACTAAGCGGAGTGACCAAAATAACACGGAACTGCGAGCAGCGACAGTCAGCTGTCCGGATATTTTTAC 
CCGGAGTCTCGGAGTTGCTGGCCGACTGGCCGTGGAGCCCCACGCTTTCTGTACCTTTTATTTTGCTTTAGTGTCGCCCCAAAAAAGCAGCCTCC 
AAAATCCCTCGCTGCTTTTGCTTCTGTCGTTGCGTTTGTCTTTGCTTTTGCGTTTGCTTTGCCACTGCTGCTCTGTCTTTGCGTTCGTTTTGCCA 
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ACGGATTTTCCGAACTTCGAGCGATTTTTCCGCGATTGCGCAGCGTTTTACCGTTACACGCGGCACTCGAGCGCCAGAAAATTCCAATTGTTATT 
CTCTAATTTTACGATTTGCGTATGGAAATCACTTCATTATTTTGGTTTCTATTTTCTTTCTTCTCGCCAGTGTGACCAGATGCCGAATGCGCTCC 
AACACCCAATGCTGTCTACCTAACACTACTTCGCACTAAAAAATACCGAAAGTATCCTTTCATATTCGAAAGCTAATCCGAAAATTCTGGAACTC 
CGAAATCCTGACTTTTGAAAATCTAAAGTTAGTGAATAAATGTTATGCAAAGGCTTGCAGTTAAGATATGCAATTACTTAATGTCATTAGCCCGC 
GCAAGAAAAACAGACACTTGAATTGGAAACTATAGTTTATTTTGTAAGTTAAGCGCTGAAATAAATCAATTGTAAATGTACACATTGTTAGTCGG 
CGGCGGCGTGCTTGCCTGGGTTCTAATTTGTAGTTCTCTTGCTCCGATAATGCGAAGAAAACCGACAATTGCTTCCCTCGCTGTAGGGATACTTT 
TTAAAGACTAAATCACTGCGTTGCAGAATCATCATAATCATCATAATCGTATTGTATAATCCGGATGCCGTGCCGTTGGCTTTCGGACTGGCGTA 
ATAAATAAACTTGCATCGTTTGCGTTTTAGGTGCTACGTAAAAAATGATAGAACAAATGCGGGGAGCCGCATGCAATTCGGATAGCTCCCCCTTC 
TAACCGGACAGATCTCCCTGATCGAAGATGCCCTGGTCGTAGAGCTCGGCCACCACTCTCCGACCGCCGAAGAACCGGCCGTCGAGTGCCTCCTT 
GCCCCGCATGGCCTCCGCTCCTGCCGAGAACTCCACGAAGATCTTCACGATGATCTCCGCCTCGTCGTCGTCCTCGTTCTCCGTCTGCTTCTCGT 
TGAAGATGATCACGCGACTGACGGTGCCGAATTTACTGCACTCCTCCTGGATCTCCTCCTGCAGGGTCTCGTCCACATCCTCGGGTCCGACCATA 
TTTCGCAGGATGATTACACGTGAGTCCACCGGCCGCATCAGTCGCTGCATTACCAACTGCCGCGCGCTCTGGCCCTTGATGGACATGTTCTCCTG 
CTGCTGCAGCGTCTGCACGTCGCCCTCATC 
(SEQ ID NO: 157) 



Exon: 5870.. 5565 

Exon: 4193.. 3837 

Exon: 3515.. 2576 

Exon: 2512.. 2341 

Exon: 2284.. 2123 

Exon: 2054. .1893 

Exon: 1710.. 1561 

Exon: 1476.. 1231 

Exon: 1165.. 1001 

Start ATG: 4116 (Reverse strand: CAT) 



Transcript No. : CT5498 

TCACACTGGCGAGAAGAAAGAAAATAGAAACCAAAATAATGAAGTGATTTCCATACGCAAATCGTAAAATTAGAGAATAACAATTGGAATTTTCT 
GGCGCTCGAGTGCCGCGTGTAACGGTAAAACGCTGCGCAATCGCGGAAAAATCGCTCGAAGTTCGGAAAATCCGTTGGCAAAACGAACGCAAAGA 
CAGAGCAGCAGTGGCAAAGCAAACGCAAAAGCAAAGACAAACGCAACGACAGAAGCAAAAGCAGCGAGGGATTTTGGAGGCTGCTTTTTTGGGGC 
GACACTAAAGCAAAATAAAAGACAACAGGGCCATTATTGAGGCCCCACTGGCGGAGCACAAGAACTGCGGCCCCAAGATGAGCTGCAAGGTGCGG 
CTGATGGAGGACGGCTCGCTGGACGAGGAGCCGCTGACCCTCAAGGAGATCGCCTACCAGAAGCTGTGCAACAATCTGGACATCATTAGCAGCCA 
CCGGCCAGATGGCCAGCGCGGCCTCAATCCGGGCATTGTGCTGCCCAACGAGATCTGCGACGGCTTCCTCGAGAACTATCAGCGATTCAATCGCC 
CGCTGGACGACAGCGTCATCCGCCTCTTCGAGGACACACACCGCACTTCGCTCAAGATCGTCAATCTTCGCAACAGCACCCTAAGCAGTATAGGC 
TTGGAGACTCTGATGCGGCACAAACTGTTCGCTTTGTCCTTGTGGTACTGTGACATGATATCCGTTGGTTCCCACCACCTTCTGGCCCACTACGG 
CGACAGTCTGCGATCCCTGGAATTGGGCATCAGTTCCCATCTGCTGCAGTACGCGGAGCCCAACGAGAAGGAGCCAGTGGACTTCCAGTTGACTT 
GTCCGCATCTGCGGCGGCTGGTGCTCAACGGTGTTGTGATGCACCACCGCCTGCAGTTCGCTCACCTCCACGACTTGGGCCACCTGGACCTGACC 
TCCTGCGTGCTGGCCAACTTCAGCCTTGAGGCATTGGGCTCGCTGCCCAATCTGCACACGCTCATCCTGTTCAATGTGTGGCCCATCGCCAACCA 
GCTGCATGCGATCTGCTGCCTGAGGCGTCTGTGCACACTGGACATATCCATCTCCAGTTCCGGAAATGGCCACGGCACCTACGACCTGCCCGATC 
AGACGCTGGAGATGCTGATGGACAACTTGCGACATCTCACACATTTGGATATCTCTGGCACGAATTTGGCCGGCAACGGTGTGGCCACCAAGGAG 
TCGACAACTACCAGTGGCATGCAGCAGAGCCCCAAAATGGAGCAACACTTTGCCCTCACAGACATCCCGGGGCTAGCGTCCCGCACCCAGCGACC 
GCTGCAGTTCCTAGGCCTCTATCACACGGCCCACTGGGCCTGCAAAAGACACGATATTCCAGCTTTGGAGGTAGCTGGCGATGCCAACGAGCAGC 
AAATCCTTACTGCGGCGCGTTATTACCACGACAGACCAGTGCTTCTGACAAGAGTTCTCAACGACTTGTACCATCTCTTCCGCTTCGAGAACTGC 
AAGGACATACACACAGCACTGGATGTTGTGCTCTCCGCCATGGATCGGCATCTCAAGTTCAAGCACATGCAAATCTCCGGCAGTGCCACGCTGTT 
CTACATTGTCAAGGGCAGGGATCGATCCAAGTTTGGAGCACTACTGCGCAATCACATCATTCGCACGCTGCTCAATGGCATGGAGATGCACATCA 
CCGACGACACCATGCTGCGGAATGGTTACCTCACGCTCACCCAGTTTCACATGCCAGTCGATGTGCTCTTCGAGTACGAGCGTCTCATCAAGATC 
CTGCTGCACGGCGTTTCTAAGACGGAGCAAGAGGGATTTGTCCAGCGCATAGCAATCTACCTGCTGAACACACTCGCTTGCCAGGTGGATGGGCG 
ACAGAAGCTCTTCCTGGGCGAGTTGGGCGTAGTTAGTACCATGTTCACGTTGATAAAGGATCGGCTGACTCGCTCCGTTTTCGACGATGTGATGG 
AGGTGGCTTGGTCCACCATGTGGAACGTAACCGATGAGACTGCTATTAATTGTAAGAGATTCCTCGATGGACGTGGCATGGAGTATTTCCTTAAG 
TGTTTACATACCTTCCCGGATCGTGACGAGCTGTTGCGCAACATGATGGGCCTGCTGGGCAATGTGGCCGAGGTGAAGTGGCTGCGGCCCAAGCT 
GATGACCCAGGAGTTCATTGAGGTGTTTGCCCGTCTGCTGGACTCCCTGAGCGATGGCATCGAGGTCAGCTACAATGCGGCTGGAGTGCTGGCGC 
ACATTGCCTCGGACGGAGCGGATGCCTGGACGATCAAGACGCCCAGCCGGGAGCACGTGCTGGAGCGCATGGTGGCCGCCATTCAGCGGTGGAAC 
ATCAAGAGCGAGCGGAACATCAACTACCGCAGCTTTGAGCCGATCCTCAGCCTGGTGCGCTGCTACGAGACGCCCCAGTGCCAACATTGGGCTGT 
TTGGGCCCTGGCCAATCTGACACAGGTTTATCCCGAAAAATATTGCAAGCTGGTGGAGCAGGAGAACGGCATCCAGATCCTCAACGAGCTAATCG 
AGCACGAGAGTCCCTACTGCGAGATCAAGCGCATCGCTCGCCTGGTCATAGAGCAGTGCGACTCCGGATCGGAACGCATGGTCGTCGACGGCTAA 
(SEQ ID NO: 158) 



Start ATG: 384 {Reverse strand: CAT) 



MEDGSLDEEPLTLKEIAYQKLCNNLDIISSHRPDGQRGLNPGIVLPNEICDGFLENYQRFNRPLDDSVIRLFEDTHRTSLKIVNLRNSTLSSIGL 

ETLMRHKLEALSLWYCDMISVGSHHLLAHYGDSLRSLELGISSHLLQYAEPNEKEPVDFQLTCPHLRRLVLNGVVMHHRLQFAHLHDLGHLDLTS 

CVLANFSLEALGSLPNLHTLILFNVWPIANQLHAICCLRRLCTLDISISSSGNGHGTYDLPDQTLEMLMDNLRHLTHLDISGTNLAGNGVATKES 

TTTSGMQQSPKMEQHFALTDIPGLASRTQRPLQFLGLYHTAHWACKRHDIPALEVAGDANEQQILTAARYYHDRPVLLTRVLNDLYHLFRFENCK 

DIHTALDVVLSAMDRHLKFKHMQISGSATLFYIVKGRDRSKFGALLRNHIIRTLLNGMEMHITDDTMLRNGYLTLTQFHMPVDVLFEYERLIKIL 

LHGVSKTEQEGFVQRIAIYLLNTLACQVDGRQKLFLGELGVVSTMFTLIKDRLTRSVFDDVMEVAWSTMWNVTDETAINCKRFLDGRGMEYFLKC 

LHTFPDRDELLRNMMGLLGNVAEVKWLRPKLMTQEFIEVFARLLDSLSDGIEVSYNAAGVLAHIASDGADAWTIKTPSREHVLERMVAAIQRWNI 

KSERNINYRS FEPILSLVRCYETPQCQHWAWALANLTQVYPEKYCKLVEQENGIQILNELIEHESPYCEIKRIARLVIEQCDSGSERMVVDG* 

(SEQ ID NO: 159) 
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Celera Sequence No. : 142000013384767 

ATCCTCCAGCTGGAGTTTCTCGCGCAGCTCACGCTGCAGCAAACTTGGCTTGATGATAAACGTGGAGATGCGACGCAGGATGTAGGAGAAGGTCT 
TGATCAGCAGCAGGAAGTCCTGACGCGGTACGCCCACCAATCGCTCCAGTTCCTCGAGTGTGTACTCCGGTTCGGTGTTACTTGTGGGGTTCTCC 
TCGGTGCTGCTGTCCACCCGGCCGGTGGTCGTCACACTGGTGCCCACTGTACTACTGGTGGCACTCGCCGATGCTACGCTGGGACTCATCTGCCG 
GTGGGTGTAGAACAGAACCGTCGTAAAGGTCTCGTAGGGCAAGGCATTGATGATTTTGATGCCCTCACGGGCTCTAAGATATAAATGCAAGAAAT 
TAATCTCTATAAGAACATTGATATTACATACCTTTCCGTGATCTTTATCCAGTTAATGCTCATTGTGAGATATATTTAAAAATTTTTAAGGAACA 
TAGAGTTATTAAGTCTTTACCAGCACTTCTTTTATTATGACTTTAATATTATTTGTTTATACAGACCAACAATATACAAAAAACTATCGACTTGC 
CACGGTGCGGTGCTGCCACACAGTTACTACTGCCAATAATAACGGTAAGTGCTGACTAAAATACAAAAATATTACATTTTGCGATGTGATTTTAA 
ACATATTCTTTATTAATTTTTTAGATTTAAATGTTTTATTAAATTAATTTATTTTTCTGTATTTAGGCCTGAATAATTTACCTTATAAAATTACA 
ATGTTTTAAATAATATTTGCTGTTATGTTTCCTGCAAAATAAAATTCGTGAAAAAAAGCCGCACCAGCAAATATCAATACCAGATTGTCGACGGC 
ACCGACGATCGACGATTCAATTTGTAGAAGATTATCAAGTACGAATAGTTTTCCAACTCATTAATATAATCGTCCTACCTGGCAACTCCGCTGAT 
GCCTATCGATGCTCCGCTCGATAGCGCCAACAATCGATAAGCAGTATTTGTGTATTGGAAACGCGGCTTAACTTAATGCAGATTTTTCCGCTGAT 
TCGGCTGCGAAAGATGCACTTTTAAGGCGCAGCGAGTGCACCCACGCCCCGAGTTCGAGTGCAGTTGCAGTCGGGAAAGCTTGACAAGTGCGCGG 
AGCAAGGAGAGCGACGAGTTCGTTGAGCTACAGCGAAACGGAAAAGTGTAAAAGCGGGGTAAACAGGCGCAGCGGAGCGGATGATAAACGGAACT 
CGGGATCGGAAAATTGGAAGCAAAACCAACCACCGATTATAAAAAATAGTCAGCATCTTAAAAGTAAGGATTTCAAGCTAAGGTTTCAGCGAAAG 
CCCGAGAAAAACAAAAACCCCAACTAAACTAACGGAACAGTGCGTCGGGATATATGCATGTGTATGTGGTATACGTATGTGTGTCTTTGGCTGCG 
TGCGATTGTATTGTTCCTGTGTAAATAAACAAAACCCGGCTCTTTAGTCTTCCACTAGTTCGTTTCGCGTTTCTCTGCGAGGAATTTTTGTTGAA 
AAAACTCAATTTGGCATAGCATAACTTCCAGTTTATTTCCCTATTCTGCAAACTGTGTGTATCAGTGTGCTGGTGCTTCCAGCATGCCGTAAAAA 
TCGATAGCGCGTCGTCGGTTGTTTATTTTATGCTTCGTCTGCATTTTGCAGCCCCTTTTTATCGCTCTCCCCTTTTCAGCCACCCTTGCACACAC 
TCACACTCTCTCCGGCGGAGTCGTGTGATGCATACACACACACACACACAGACACATGCATACGCCTGATACGGGAAATTAAAAATAATACTTTT 
TATCGCGGCATGCAAAAGGCAAAGCTGCAAAAACAGAAACAATTCCAATTCCAACAGCAATAATCTAGCCATGCAATCAATTTTAGCCAAACAAT 
AAAAAGAATCGTTTGGTTTCAGTTTCCCCGCTTTTCTTTTCCGTTATTTTCCTCTCTATATTCGGTTTTATCGCATTTTCCGCTCATTATGGCAG 
TTTAGCATATTTATGCTGGAAAAATATTGGGGTTTCTTCGCTTCTTACCTTTGTTTTACGTTTTGTTTTGCTGTCGGGCTGTAAAAACAGCAGTT 
AATCCAATTAAGAAAAATAAAACTCTAAATGCCATCGCCACCACGAATTCGAAATCACCTTTTATCTACTCAAAAGATATTCATCTTATTTATTT 
T C C AAG AT AAAT GT T AAATT AT AAAT AT AT T AT AGAAC C AT AC AC T TGT T G AAAT AGAAT TT C T GC T AAC AGC T GC G A GT T T T AC AG T C AT AAC T 
ATTATTATTATTTATTAACTTAGATTTCTACAGGCTGCGATTTTATAAAAATGATTCAATTTTTTATCTAGACTTTGTTATTAAAATTTCTACAT 
AAGCAGAAAATAATTGCTGCAATGACAAATATTTCATTTTCAGCTTGTGTTTATCGTAGCTTATTTTTTGTTAAATGTATATTAAAGTATTTACT 
C C C C T T C TC AT GT AAT AT AC AT AT AT T G AAAAAACAGT AC T T T T GT CT AT T AG AAT AT AT TTATACCCT C AC T T AT T G GT T C AT C TC GAAAAC T G 
GTCGAGTAAACTGAGAATCTTGAGATAGTTTCTTGAATTGGATATAATTTGCATGGTTGGCCAATCTTTTGGGTCGGCTATGCCCGCCTCTTTGC 
TTCGTAGAGGAGCTTGGGCACCGGGCACTTGCAATTGGCACCAGCTTCGGGCCAGTCAGTCCGCCAGACTTGGTCTTTGGGTCGAGATGTAGAGC 
TGGATAGTGTGCCACACTATATAGGGGATACTACTGCGATACAGACACCGCGGACACGGCTGACATGCATTAATGCTGCGAATAATGCCTATTGC 
AGACGGGGATAATGGAGTCACAGAAGCGGCCTTCGTTGGACTTACACACGGATGTGCCAGCAGGATTAGCAGCTGGAGGATCTGGCCTGGGAGCT 
GCAGCTGAGATGTCGCCCACTTCCGGTTTCCTGCCGGACATGCCGCAGTGGAAGAAGGACCTCATCCAGCGCCGGAAAACGAACGTGGCCCGCAC 
CCAGGCGGCGTCCATCACCTCGCCCACCGATGGCAGTTGTGGGGCTTTGGCAGAAGCCAACGCAGCTCCAGGTGCAATTGCAGGTGAGTGGTGGT 
TGGACAAACTGGAATACTTTCACCTGACACGCAAATTAAAGGTATTTTCAAGCCAAGCATTTCTGTCAAGGCATGTTAGGTGCAAGTATTCGTAT 
TTCTAATCGCTATTTCTTGCATTCTTGACAAACGAAATAATTGCACTTACTTTTGAGCTCCATACTTATCCCTTTTAAATGATACCGTTCGGTAC 
TTAAAATAAATTCAACGAACTATAGACTTCGGAAAGGTTTTTAAAATATTTAAAAACTGCCGTAGCAAATAGTAATCAGGGTGAAATATATTGAC 
TGGAAAAATGCGAAAGACTAATTGTGACTTACACAATCTGCAATTGCTCAGTTGGCGTTAACCAGCCATGAGTAATGGTAACAAGTGTTTTTAAT 
TCATTTTGCCAACTTTTACGATTTCCAATACCAGTATACTTTTAAATTAGCCGCCTTACATATTCGCTTATCGATACGATATTCGCTTATCGATA 
GCTCACAGAGTTGTCAGATTGCGGACGAATAAGATAAAAAAAACCAAGTTATGTTTTAATGTTCTGTCTTTATAACATTTGTATTTGCTTTCTTT 
CTTCAATTTTTTTTCTAACAGATTTCACAGAACCGGCGACAATCAGTAGCACTAGTCAAAAGAGAAACATGATCGGTTCAGAGGAAAAGTCTGAG 
AAATCTTCTATTTCCAATACCAATTCCGATTCCACTGGAGGTCATCACTCTGTTGTTGCCGTCTCCCTTTCGCCCGATGCGGCAGCAACAACAAA 
TGTAACAGTAACACCAATACCAAAGCAGCGATCGAGTTTACTCAACACAAGAAGTCAGGAGAGGGAGATGGTGCGATATATTCTAAGCGAGAGTG 
GAGAACGGGATGGAGAGCTTGAGAGCGGCGAACAGCCGGCTGGTGTGGTGAGTAACAGCCGGTGCGGTGAAGTTGAAACTGGCACAATTGGATCG 
CCGTCGTCGTCAGCAAATCAAAATCCAAACCCAAATCATTTAAAAACGAAATGCAAACCGGGACAGAGCGTTGCTGAGGGCAAGCCTTCAGCTAA 
AGAGACCATCGTCGATAACAGCAAAAGCTGCAGCAAAACCAAGAGTATTTCCGATAAATTGCAGAGCAACAAGTTTATAATTCAACAGCAGCAGC 
AACAACAACAACAACAGCAGCAACAACAGCAACTGTCGCCCACAAAGGTAACGGTAAAACCGACAATGGTCGCCATGCAAGAGATGAAGAAGACA 
ACCAAACAAAATGGCCAGCACCGACATCTAGCGGGCAAAATTGGCAGCGTAGCAGGAGGCGATGTGGATCCACAACATCCACCGACGAATCCCAT 
ACCAGATTCATTGGACACCGGTGAGGATCTGAGCTACGGACCCGGAATTGTGTCCAAGCTGCGATGCCGCTACTTGAGCCTGGCCCTTCGCGAGT 
CCCGCCAGCAGAGCAGCAAACAACGCCTCCAACGCTCCACCAGTCTGAACACCCTGCTCGACCGCGACGATGACGAGGTGGAAGTGGAGGAGCCC 
GAAATGACCAACAGCCAGGTGCGTGCCAAATCAACACCGCCACCGATATTGGGGGCAAAACCGACACCGAAACCCAGCAGCCAGCATCAGCGTCC 
CGTCAGTCTGGGCGCCAATGGAGCTGGATCAGCTGTTAACCCGCCCTCGAACTCGGATCAGGCCCAAACGGGCGCCGTTGCCAACGGAGGCGCAG 
GATCTGGCCAACGTTCGCGTCACTTTAAGCGCGGCAACGAGGTGATGAAGCGGGCACGTTCCGTGGAGGCTCTGCTCTGCGAGAAATCGCCATGG 
AACAGCCAGAGAATCAGCACTGCCGGCGCAGCTGCAGCAGCAGCACCTTCACCACCTGCTCCCAAGGCCACAGCCGCTGCTCCCTCACCTGTTAC 
ATCGCCCACCTGCGTTACCATCGAGGACAAGATCCACAATGCCCGCGAACGACTGCACAGCGGGACGGATACGGCGCCACCCAAGCGTCTGACCT 
CCATCATCGATGACACCGAGCGGCCGCCACCGGATCTGGTCAAGCAGACGCTCAAGATGTTTGAGGCGAGTGCTAATCGACGACCGCGCGCAGCC 
CATCGTTCCAATGGTGTTGGCGGAGTGGCCAGCAAGGTGGCCAACTACAAATCGATCATTAAGGATCAGAAGCAACCATCATCAGCAGCGGCGAC 
GCCACCGCCGACCGGCATGGGTTTTGCCTCCTCAACGCCGCTAAGGCATGTCCATCCGGATATTATACCCCGTCAGGTGGATTCGCCCGTATCGG 
CGTTGAGTGTAATGATGCGTCGCATGGAGCTGCAGGAGCCCGAGACGCCGGAAAGGGAGACACGGGACGCGGAGGCCACACCGCAGAGCGAGGCG 
CTAAGTGAGACTGAGCGAAATGATCGCGATGAAGGCGATGGCGAAGTCGATGACGACAACCACAACAACAACGACGACGATCACGACGACGGCGA 
CGACGGCAGCGACAACAATGAGCAACGCGATAAAATGGGCGCGGCGGCGGCAGGCGATAAGCCTAAGCCCCCAGCGGAATCGGCAGCGGGAGGCG 
CCCAACGATCATTCGCTGCCTCGACGGAGAACGCGCATGTAGCCAGCGGTAGTAGCAGCAGTACCAGTGCCAGTGTACGGAAGCTCAGCAACAAC 
AACGACTCCAGCAGCGGACCAGCGGTGACCAAACAGATCGGTGTCATCCGTCCGCTATTCAATAGCCAGGGAGCCGGGAGCACGCCGCTAACGAG 
TCGCGAGATCGAAAAGAATCGCATCAATGAGATGAAGAAGTCGACGGCCACAGATGCCGGTGGTTCGGTATCCGGATCCGGAACGGTTGCTGGTG 
CTGGATCAGGAACCTCGCCCACCACTAGTCTCGACAGCGTGATAAACACCAAGGAAGCGGCCAGCAGCGAAACGGATGCCAGTGCCTCGCCACTG 
TGGACATTGCGCAAGCTGAGGAATCAGAGCCAGACGGCCGGCGGAGGATCGGGTCCCAGTTCGAGCCTTCATTCCACAGAGAACACCTCGATGGT 
GTTCAACTTTTCCAAGAGCACCAAGGAAGTGCCCGACTACATCGAAAGCGACGTTGTGATTTACAGGCGCAAGCGGGAGCTGCCAAAGGTGAGTC 
CGGATTGGGTAAATGGTTACTACTACGAGGGACTACGATTTCCATTTCCGTTCCTTGACGCAATTCCTTCGTGCGTTTCGTTTTTTTTTTTTTTT 
TTTTTACGAAATCTTTCCCGGAAGAGTCACATTTTCGACGTCTTATCGGCGGGCAGCCTTGCGGATTATTCATGCTTAACAATTACCCCATGCAT 
TATTATTTTCGCCGATAGTAATTTGGTTTCTTTCGTCCGTCCCCAATATCTTTGTTGTTTTGGCTCATTTTCCGTTTCTTTCTCGCGTACTTTTT 
TTGTATTATTATTATTATTTTTTTTTTTTTTTTTTGTGTCATACGGCGGCGGCAGCCATGCCGTATAAACAAATATGAAACAAAGCACACAGAGA 
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ACAGCGATCATGCGCATTTCGAAAGTTCATCAAGGCAGGGCGAGAACGGAACTGAGGAACTGCGGAGAATTGGGTGGCGCAGTGACAAAGCGCCA 
CTTGGATACTCTTTCTTTCTTTCTTTCGAGGGAACTCATGTGCCATATGTATGTACATATATCTTTATATCTTCGCCTTCTGGGAAATTAGTCAT 
GTGTCGTGAATACGTCGTTAATAACGCCAATATTGCCAAGATGACTGACTAACTGCTCACATGCGATAGGGGATCTCCTCCAGCTGTTCGATCGC 
TGATCGTGTTAATACCCATTTAGGAGTTTTACAGATTAGGAAGCAGTTGTTAGTTAAGGAACTTTGTGTTTTTTAGTTGGTATTGCACCACCAGG 
AAATTATTGGACTACATTTAATATATTTGCAATATATACAAGGGATCAGTAATGTAAAGGCCCCACTCTTTCAATTATCATTGAAATTAGCAGAA 
CTACAGGTTTCCAATGACTTCTGATGGGTATGTGTGTGTGGCCATTTATTATCTGCCGGCAACCAAATCATTGTGTTGATCAATAAGTATCTGGC 
CAAGATTACACAGCACAGTCGGTTAATCACAACAATACGGCGATCTGGCAGATCAGTTGGGGGGATCTGCCACTGCCACTGCCATTGCTGGTGCT 
ATTGCTTTAGGAAGTGGCAAACCCCTAATAGAGATCGAAGCTTGCATCACAGCTGCGAATAGTTCGATGTGATCTTCTGCTCCGAGTCAATTAAA 
ATTCGCCTTGAATAGCTCGCCGATCGATAGCCCCGGATCGTGTGCGACATGAACAGATGCCCCTCCTATATGTATATGCATATACATAACTCGAA 
ATATATATATATATATGTATATATACATAACACATATATCTGCATATTCTTTCCCAAGTCTGCTTTGTTCCCAGCCAGATGCAAGTCTCATCTGC 
TGGCAGTCTTCTGCCTTCGGTCGCTTCTTCTTCGTTTGGGCTCTTCTGTGGCTGCACTTTTCGAGATCGGAAGAATGGGAAACCAGTTTTGAAGG 
CATGCATATGGACATGAGAAGATATTTCTGCGCTTCCTGCTGCTGCCAGCAAAAGTCCGAGTCTGTAGAGATATGTATGTGTACACAGTGCCCAT 
GAGATATTATGTGTCTGGTCTGTCTGTCTCTCGGTCGGCATGTCTGTGGGGAACTTTATTCTTGGGGCACAGCAGCTCCTCGGGATGCCGGTTTC 
GATGGTATAGAATGGGATGGGATGCAAGGCAAGGCTGCTCCTTACAGCTGTTCCTGTTGCCTTGCCAGACATGTTTAAATACCCTTTATTTTGTG 
ATGGCAATTGAGCTTTTTTTTTATTGTTTATACTCTGATAATCAATTCAGTAGCTCAACTTGAATGATTCTTGTTTATAGTAGATATCCAATACG 
TATGGATTGTTTTGATATGAAAGAGACACACAGAACTTGTTAATATATATTTATTAATACCAAATTAGGAGTACAAAGCTCCACAATGGTGTCAA 
ACCCAAATATTGTTCTTCGCAATTGGAAAAGCGAATGGTACAGACCATACTCTATAGAGTCTTTATAGCTCTCCCTTAGCTTTAGACTTAGCTTT 
TCGGTGCGGTTGAGTTTTCCTCTTGCCTTCTTCTCGTTGGCAAACCCGTTTTGTTTGTGCTTTGTGCCGTTCTCCTCTCCCACTCCCTCTCCCTC 
TCTCTCTCTCTCTCTCTCTCTCCCTCTCAGCTGCCATGTTTCTCTCTATGTCCATATCCATATCTCCATATCTCAATGGCTCCATGGCTCCATGG 
CTCTCTTCGCCCTCTTCCTGCCACTGAACTCAATTTTGTCAATTCTTTTTTGAGGCTCACTTCTACTTACTGTGTTTGCTTTTGCTTTTTGCCTT 
TGCCTTGTCTCTTTTATCTCGCTATGAACTTTATTATCTGCGGCATTGGTATATACATATATGGCATATATAGCATATGGTATGGCATGGTATAG 
TATATGGCTGGGGGTCGATTTATGCCACAAAAAAAAAAAAATACAAAAAAATGGGCCAAAAACAGTGGGACAGTGGGTTTTGGGGAGGGTGGGGA 
GGGAAGGCGTGGAGAGGAGAGGAGCGGTGCCGATAGAGTAGATATATGACATGGAACGGGACTGGGAACTGAAATTGGAACCGGGACCTGGACTT 
GGTACTTGGTACACTGGTCTGGACCTCTGGAACCCCCGGCGTGAGTCACAGATGACCCATAACACAACGTCGATTAAGAATGGCTTTCCATGAAT 
TTCACTTTTCGAACACATGGTGACCTGGAATTTGGACAGATTTTCTTGTGGAAGGGGTGAAGCTGATCCTCCTTTGTAAATTGTCGAGAGACAGC 
CAT T T AAATC C AAAAAAAAAAAAAAAG AAC T GAAAT G AAAT AC T TAT GGT AAAC GAG T T AT GC AT C AGC GAAAAG AGG AGGGT T G C TG GAC GT T C 
TGCAACTTCTGTAGGTAGTTAAAGCGAAATAAAGGCGTGTTTACCATTAAATGATAATATCTGAAGCAGCAGAAGAACATTGCAAATAGCTTTAA 
CAACATTCGAAAGCAATAACACATTTTTTAAGTGCCTTGTTTGCATTTTAACGAATTTCATTTCCCAGCTAACAAAACGTTTGCAATGAAGATAA 
TGAATATTTTTGTTGGGGGGGTTGTATACTAATTCAAAATCAACATCTACTGTATTTTATTTATTATTTGAACTATTTGTGAACCCCCGGGGAAT 
GCCAAGAGTCAAGCATTTATTGATTGTACTAATCGTTTTTTTTTTTTATCATCTTATCCTTTTAGCCAAATGAACCTGGCTTCGTGCTCCTGGGC 
GATCTCTCCGTGGAGACGTCGACGGACACGGACTACGACGACTACTCCATGTGCCCGCCATCGCCGTGCGATGTGGAGTTTGAGAATGCCAACAT 
TGTGATCGACGGCAAGTCCAGCATACGCCAGAAACCCAAAGAGTCTTCGGTATGTACTACACCAAGGGATATTGCCGGGATTGGATTGGAGATCG 
ACTAGATCTCAACTCAATTCCAAGCCATATGTATATGTATGCGGTCGTTACATACTCCACTACTAATGAGTGAGTCAGGTGCGGCGTTTTGTTTT 
TTTTGCCACCGAATGGTAGATAGCATTGGTTTTTTTTTTCAATTCCAGTTTGGGTTGTGTATGTGAAAAGGATAAGATAAGATTACTTGGGCCCC 
TTGGCAAGGAGGGAATGTACATACATACACATGTACACACACACGCACACATTCGAACTTTTTGACCACTAACTTCGACGGGTGGTGAGGACGTT 
GATAAGCGGGAATGTAAAGCATAAACAATTTTGCCTTATATAACATTTATATCGAGTGTGTGTGTGTGTGTGTGTCGGTCGGTCGTCGCCTGTCC 
CAGGGACTATTAATCAACCCGCATTAGGTTTGTCTTATTTCGGGATTTGTTTGCCCATTTGGAGCTGCTTGTGTTGCGTCCCCAGATCTGTTCGC 
CTGGCGCTAATTTATTTATTTATTCCTTTGCTCGGGTGCGTTCCATCAGTGTGACTGATATGGGAATCGCTACTCTACTATTGTCTTTAGCCTCA 
GCAGCCATCGGCTGTATTTCGGAATCAATTCAATCGTTTTCCGTATCGCATCGCATCGAATTGCGTCGGTCTGTTGGGGTTTGTTCTCCGATAAG 
GAAGGAGCTCGAATTCTGCAGTCATTGATTGGGGTTGACTCATTGATATGCCATTGTGATAAGGCATCAATCGATGAGTAAGTGCCGATTGGGGT 
TGGGAGTCGCTTTTGAAGCAATCTATTTGCCATTAACCCAATGTCACGGTTATATGCATTTAAGAAAAATATCAAATGCATTATGCATATACACA 
CCATATTGCCAAAAGTCCATCAATGAGTTGTATATTTTATTTATTTATAACAAGGCATTGGTCCAATGCTTTCGTCTAACCTTGAATTTTTATGG 
CTAGCAGAGAAACTACTTAGTTTCGGATATATATTCGGTGATTTCTGGACCGCTAAATGTAAATCGAACGGCATCTGGTTATGCAAATTACATAA 
AGACCAATAGTTAAACGAAACCAAAATAAAGGATAGAAAACCATAAACATATATCTTTAAGTGTCAAAATGTAAAATCCAAATAGTTATCACTAA 
TTTCAATGGCGCTAGTATTATTGTAAATTCTACATTCGCTTTGTTATCAGCAATGCCAGCTAGTCATACATTTATGCCAAATTAAAAATAGTTTC 
GTTTAAAGGGAAAGTGGTTGAGTGGCATGATTTCAGTTCACTACGGGTATTAATTATATGACTAAAACAGTTTGGCATGTAAAAATCTATAGCAA 
AAATGGCATTTGATTATTAGTTATTATTATTATTATTCAACAAATGATTAATTATAATATGAACGCTGCGACGTGTGGGGATAGCGACTTCAGAT 
TCTGCCACATTACTTGATACCTTGATTGAACCCATCGAAGTGAAAGTAATCTCAGAATCGATCACTGATCAAATTTCTTTCGATTTTCCCACAGT 
TCCGCGTGCAGTTCAACGACACGCTGACGTCGACGTTTGAATACCCCTCCGAGGCATCGATGACCATCGAGGATCCGCCGTACGCCGATCCCTTT 
GGCCATGTGAGCAAACACCATCAGATGCTGCTCGCCGAGCAGATGCATCTGTTGCAGCACCAGGAGCAGCTGGAGCTGGAGCAGATGCACCAGCT 
GGGGCTGGCGCCGGAGCAGCACCATCATGTGACCGTCGACGAGATCATCGAGCTGCCCACATCGACGGCGGGACATGGACATGGCATCGGACTTG 
GACACAGGTCGGGGGCGGCGGGGGGCGGTGGCACGATGCTGGGGAATTTACCGTTGGGTAAGTAATATCTGTTCCACCGAGTCCATCCAAGCGTG 
CATTGTATTATTATTATTATTATTATTATTATTATTATTATTGTTGTTGCTGTACGCTGTACGAGATGTATGTTGTGTGTTGTTGTATGTGGCAT 
GCAGCAGCGAATGATGATGGAATGGCTTTTTGTCTTTGTCTCTCTCTGCGCTCTCTATTATCGTCTCTATCTCTCTCTCTGTCTGCCTGATTGCC 
TGCATCCAAGCCTCTCTATCTATCCATCTCTATCTATACCCGTCTCTATCTCTGTCACTTGCCTTGTCCTTACTCTTGACTTACCATCAACCAGC 
GTTGTGTTCTTGAGTTTTGTTTGTTTGTTTACCATCATTAATACCCACTCATCCATCACACATCGTCGCGATCTCGTTTTTTTTTTTTAACACTT 
TCGTTTGCCTTGGGCGGATCGCGAATATACAATAGCAATTGAGAAACCCCAAAGACAACTCAAGACACTTGTTTAGCAATGAACGTAGAAAGGTA 
GTGCCAATGAGCAGCAGGGAAATATACATAATGTAAATCAGCTGGATAACCATCCTTACAATGCTACTTATAAAATGTGATGATGTGAAGGAATT 
TACATGCACCATCTTATGCTCATTTTAATATTTACACTGTTCCAAGTAAAATGTTTCCAAATTAAAGAAATCAATGTAAAGTTCTCGTATAAAAC 
TGATTTAAACTAGAGTTTATAGGACTACGACGAACCCAGTTCGGAACTGGGTGACCCGCAAGCCCACATCCCATGCCTGGCTAAACTCTCGGGTA 
TATAACAACGAAGGTTTCTGCACTTTACAGATGCTGATAATAGAGCCACAGAGCATCTTGTCTCCGTATCCGTTTCTGAGCCCGCTCCAAACTCC 
GCTCCAGCCTCCGTCTCCGCTTCAGTCTCCGCCTCAACTTTCGTAACCGTCTCAGATCCCAAATGTCGTAGCGCCAGGACCTCGGGCAACGGCGG 
CTGCTGGCCGTGGCCGAAGTTCCGTACGGTCCTTGGTGCTCAGTTTCGCCGGCTGACCAGGCGCATCAGGTGGGTGCAGCCATCAGCTAGCAGCA 
GCCCCTTCCAATGCCCCGTTTACCGATCCCCGATTCACCTGCTCTTCTATATACATACGCTATGTCACAGCTCTCAACGAACCAAAAGCTAAACC 
CAAGCCACCGAGTTCTCCAAATAGCAAACCATCGTGCTACGGTCTTTTCTGGCTGAAAGGCTTTCAATTTCGCCTCACGCATATTTGCAGATTAC 
CCAAGTTTAGGGAGGGGTATTATCTTGCCACAGTCGCCACAACAGCAGTCAACGCAGCAGCAGGAGCAGCCAGAACCTGCAATAGCCACATCGGC 
AACAGTGGCAGGATTAGCAGTGGCAATGGCAACCGAAGTGGAAATCGAGCCACCACTACCAGCACCACCAGTCACAGTCGCCAAGCGCCCGCAGC 
TCCCGGCGAAGCCAGCACAGCAGCAGCAACAACAGGCGGCGCGTAGGGCCAGGCCATCGGTGCTGCTTAAGCCGCAGTACGCCTGCTTCCTTCAG 
CACGACGATGATGCACCCAAATCCCAGCCAACTGCTGCTGGACGACCGCGCAAGGCAGCCTCGTTTTCATTGGGCAATCAACGGCAATCGAAGGA 
GC AGC AGCAAC AT CAT CATC ATT ACC AGC C AC AGC AT C AGC AGAGT AAAT ATGAACGCAACGATTGCAGTAGGAGTCGGGGCAGTTGCGAGAGTC 
CAGCTCCAGAACACCAGCAGCGACGTCGTCGGCCTAGTCTGCCGTCGTCGCTGGCTTTGTCCAATGCCAGCTCGGCCCTGCTCCAGGCACGCATT 
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CCAGGCACCACGCTCTACTATGTCTGATGGCCACCCGATATGAAAAGCATTAGACTCCAAACTCCAGATAACCAGATATCCAGATAACCAGATAC 
TCCAACTCCGAACCGATTTGAAACCGCCAGCGAAACGTAGGCATTGCCCAAGCTTTACTATATGTTTACAACTAGGCTAAAGAACACCCATCATC 
CCTTGGATACCTCTTGGTGACCCCGTTCCGCTCCCAAAAAAAAAAAAAAACAAAAATATATATTGGTTTGATGGCCGCTAAACTGTAAAGTGTAA 
CCCTGTGCATATCCGACTCACTCCAAAACAAATAACTCAATGCGGAAACCAGGAACACCAAACCCGAACACACCAATCACAACACACACAAGAAA 
ACAAACACTGGGGCAACCGGACACTCGAAACACACGTTTAATCGATTCATAGATGCGATTCACGATGCTTCCCGATGTCGATGCACTGCCGCCGT 
TAATCGTTTATGCGATGCTCTGGATAACAATCAACAGTAATTCGCGAATTAGCGAGCGAAGAATACGCGAAGGATGATGGATTTTAGAACTAACC 
TAAATGATGGATAATGCTGGTGTCCCCATGTTTCACTAAAACCAAACCATGTTCCTATGACTGCTGCTTGATAATGCCCTGCTCCCCCGTCTGTA 
TTCATTATCCACAACCTTAGGTATAAGATGCTCCATAGTCTTTAGCAACCATTATATGATAGTTTACCTAGTAAATCGTATGCCAATCATCAGAT 
TGAACTGTACTATGCCCACCCAGCCCATGTGGAAGTTAAAGATTATTGTAGAAAACTATTTAGGTCACCTATAAATGTTGGCAACGCCTAAGAAT 
TAAGCTAACTACTCAGCTAGAGATCGCAGAAACCGAAACAATTGCGATAATTGATAACTAAGTGTTCATGGCAAATTTTGGGGTGTAACTAAACG 
TTTAATAAACGAAAAGTAAAGCAACTAAAGCTGCAGACAAATCAAAGAACCTTTATGTAGCGGCTAATTTCGACATGTGTTCGGTGTAAACCAAA 
CAAAAACGAATCACAGAACACTAGTTAGTGACTATTTTCATTTCAGTTACGTACTTAAGTATATATATTTGTGCACCCGTGTGAGTTGGCGTTAG 
TTATGTGTTATGTGCTTATGTATGTTTGCTTTGTTTGGCTCAGTTCGGTTTGGTATGCAGTTGTGGGGCATGTATTGAATTGCAATTTGCGATTT 
GAAATTGTAGCCATGTAGTTACGCTAACCTACCTTCTCGCCCCCGCTGTTTCACGCCACCCAAATGGTTTTTCATTTCAACAACAACCTTTTGTT 
AAAGTCACCGCTGTCGCTTTGTTTGTTTGCTATTTGTTTACTTCTATTTCTATTTTTTGTTCATACCAATTTCTTTTTGCAGATATTATTTAAAT 
GT AAC AG AAT A C AT AAGC AAAGC GT T G AAAT AAT AT C T AT T GT AT AC C AAAAC C AAAAAT AT C T AT GAT AAC AAAC AAGC AAAGG C AAT T AAAC C 
GATCCGTAAACTGAATAACCGATTAATAACTGCAGCCGCTCAACCCGAACCGCAATAAATAACTATCAAGTAAAGAATAGAACAACTTAACAATG 
AAAAATGC AAT AT T C AAT ATGC GAAAT G AC AAATGC C T AAT GC CAAAT GC C AAT C C AAAT AAC C C AAT AACC GAT T C AAGT G C GAG AAC AG AAAT 
GCCACACCTCTAACCCAGTGCACTTGCTCTGTCTGCTCCGCTGGGCAATCAAACTAATCGCCAATTCATATTATTATTAAGGTAACTAACCCAGA 
TTAGAACAGACACTCGAACACACGCACACACCAGCCACCAACACCACTTGGCACATCCTAGTCATATCATCAACTTCCATCGCTCATCTCCACTT 
CCATTTCTTTATCCACTGGTCATGTTTCGTTTTCATTTGCTTGAAAAGATCCAGGCGAGATCCTCCTTTATCCACAACACCGTCACCTCATCTCG 
ATTATATATACATATAAATATATATATATATGTTCTGAAATATATATTTCCCAACTCGGAGCCACCTTATTTATCATTGTACTAACCCCAAATCC 
T T GG AAG GAT T T AAAC CC ACT T G AAAT AAAGAT AT AC AG T C G AT T AATC T C G T T C T AC AC AT AAT T G C AG GCTC C AC GGCTCTGGGCTT C T AC AC 
GCCCATGAAGGGCACGGCGATGGACAACCTATTCCAGCTGGGTGTCACCCGATACGCCCTGCCCGAGAGCAGCAGCAGTGGGAGCAGCAATGGCA 
GCCACAGTCCAGGCAGCAACGGGAGCAGTCCAGCCGGAGCTGCGGGCAGCCGGTTGATGTTCAACGGGAATGGCAGCATTATGGGCATCGGCGAG 
GGACTGATCAAGGAGGGCGACTTGGCGGCAACGGAAGCAGGACCTGGAACTGCAGATGGAGCAGGTGCCTCCAAGAAGGGTTCAGCTGTGGACGA 
GGATGATTACCATCTGACGGCGACGCCGGGCGCCGAAGTTGTGTACAGCGAGGGTACACAGAAAACGGATTTGCTATATTAGGTCGTCCTGCCAG 
CCAAAAAACCGGAGCGTCTAGCTCTTTTTCTACCGGTTTCCACCAAACTCCAAACCTTCCGAGCATAACAACCTACCACATCCTGATCGTTCCCC 
T AAT T G AAAAGC GT GC AC T T AGGT GAAAT AT C AACT TC GGT T AG AC G C G AAAAC AGT AAC GAAGGAAAG AAAAAT AAAAAC AAC AAGG C AAC GT A 
AACCAAGTATGGCAAGAAATCCATAAAGATACGTATATAAATAGATATTATTTGTAAGTTTGTAAGCCTAGAGATCCGATGTGTATGTGTACAAC 
AGCAAGCGTGTGTGTCTGTCTTCTTTATAAATATATATATATGGTTTTTTTTTTTTGATAAATTAATAATTATATTATTGTAATGTATCTACATA 
CGGCACTGAATGAACAACTTTGCTGCTGCAATTGTATTTGTATTTGTATTGTGTATTCCATTTACGGCATTTTGATAGTAAATATACAACAACAA 
CAAATCAACGTATTTCCGACTGCATCCCGAAGGTTACTGAAACCTTAATCCACGAAGTACGCGACGAGCGGTTAGTCCCGCCCGAGTATTAACTT 
AAT GGATTCT AAC GATCCGATCTTGTGACCCATCATCATCCCTTTTCGCGACCCTTCC AAAAC GAAAATGAAGAAC AT TCTAAGCTCCGCTTGCC 
CATTCAGCGGAAGAAAGCAAAGGAAAATGTAACATTTGCCTCATAATTATTATATACATTTAGCGCTTATTGAATATTTTTTACAATTGTATTCC 
GTAAATAAATATAAAATACATCAACATCTAGTCGCGTGTTTTAAGTTAAAGGAAGTTAAGGCCAAAGCAAAATGTTAGCAACAGTTTTTGTTAAA 
AGTTAACATATATAATAATGGTAATTTAAATATTCAAATCTATGAAGTGACTTATTTTATAAGTCCACTTATGTTTGTTATATTTAATTGGTATT 
TAGAAAAGGATTCATTTTAAATTAACACCGTTGAGGGTTCAATTTTGGT AGT GCGTCTTTGCT AACC ATGCCGAGTGGGATCAAGTCCTTGCTAA 
GTTAGGATGTTGTCTTAGCTAACTGTGCCATCCCTGGTCGGGGTGTGTCTTAGCTAACAATTACGTGTCGATAGATATAATGCTATCGATATGAC 
AGAGTGTCGGGAACAATCGATAGCATTACTTCGATATCGGTACTAGTGCCACCTCCAGTTTTGTGATTGTGTCAAAAATACAGGGTTTTATTGTT 
GCACTAGCAATTTAATCCGAGGAATATAAATTTTGTGTGGAGTTATGTTCGAGACGTCGTTGCCAGCGATTCAAGAGGCGAGCGGTCACACGTGT 
ATGTGTTTCACTGGCGGTGCGGGCGCCAACTCCGCCAAAAAGCGTACTACTTTTAACCTTAGTTGACATTGAAATGTACCCGCTTAGTTGGATGT 
CCACGATGGTGCGTGTGTGTGCGTATATGTGTGCTATGCGGCGTCTGTGTGCGCGAGTGTGTGTGTGGGATTGCAAAGGAGGATGGGAGAAACGA 
GAGCAGAGCAAAAACAAAAACAAAAGGAACCTGCTGCACATAGTCGGTGCATGCGAGCGTGCTTATGTGTGTGTGCGTGCGTGCTAGCAAAACAA 
CTGCAAGAAATCAGCAACAGTAGTAGAAGAAGAGAAGCTGTATCACTGATAAGGAGGCTAAACAACAACAAAAACAGCAACGATAATGTGCACCC 
TTTCGAAATGCAAACAAAACAAAGCGTTGTAGTTGTGTAATCATTCGTTGTG 
(SEQ ID NO: 160) 

Exon: 1001.. 1298 

Exon: 2853.. 3123 

Exon: 3727 . .6263 

Exon: 9471.. 9644 

Exon: 11210.. 11552 

Exon: 14808.. 15186 

Exon: 15556.. 16722 

Start ATG: 2862 

Transcript No. : CT5502 

TGTATTGGAAACGCGGCTTAACTTAATGCAGATTTTTCCGCTGATTCGGCTGCGAAAGATGCACTTTTAAGGCGCAGCGAGTGCACCCACGCCCC 
GAGTTCGAGTGCAGTTGCAGTCGGGAAAGCTTGACAAGTGCGCGGAGCAAGGAGAGCGACGAGTTCGTTGAGCTACAGCGAAACGGAAAAGTGTA 
AAAGCGGGGTAAACAGGCGCAGCGGAGCGGATGATAAACGGAACTCGGGATCGGAAAATTGGAAGCAAAACCAACCACCGATTATAAAAAATAGT 
CAGCATCTTAAAAACGGGGATAATGGAGTCACAGAAGCGGCCTTCGTTGGACTTACACACGGATGTGCCAGCAGGATTAGCAGCTGGAGGATCTG 
GCCTGGGAGCTGCAGCTGAGATGTCGCCCACTTCCGGTTTCCTGCCGGACATGCCGCAGTGGAAGAAGGACCTCATCCAGCGCCGGAAAACGAAC 
GTGGCCCGCACCCAGGCGGCGTCCATCACCTCGCCCACCGATGGCAGTTGTGGGGCTTTGGCAGAAGCCAACGCAGCTCCAGGTGCAATTGCAGA 
TTTCACAGAACCGGCGACAATCAGTAGCACTAGTCAAAAGAGAAACATGATCGGTTCAGAGGAAAAGTCTGAGAAATCTTCTATTTCCAATACCA 
ATTCCGATTCCACTGGAGGTCATCACTCTGTTGTTGCCGTCTCCCTTTCGCCCGATGCGGCAGCAACAACAAATGTAACAGTAACACCAATACCA 
AAGCAGCGATCGAGTTTACTCAACACAAGAAGTCAGGAGAGGGAGATGGTGCGATATATTCTAAGCGAGAGTGGAGAACGGGATGGAGAGCTTGA 
GAGCGGCGAACAGCCGGCTGGTGTGGTGAGTAACAGCCGGTGCGGTGAAGTTGAAACTGGCACAATTGGATCGCCGTCGTCGTCAGCAAATCAAA 
ATCCAAACCCAAATCATTTAAAAACGAAATGCAAACCGGGACAGAGCGTTGCTGAGGGCAAGCCTTCAGCTAAAGAGACCATCGTCGATAACAGC 
AAAAGCTGCAGCAAAACCAAGAGTATTTCCGATAAATTGCAGAGCAACAAGTTTATAATTCAACAGCAGCAGCAACAACAACAACAACAGCAGCA 
ACAACAGCAACTGTCGCCCACAAAGGTAACGGTAAAACCGACAATGGTCGCCATGCAAGAGATGAAGAAGACAACCAAACAAAATGGCCAGCACC 
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GACATCTAGCGGGCAAAATTGGCAGCGTAGCAGGAGGCGATGTGGATCCACAACATCCACCGACGAATCCCATACCAGATTCATTGGACACCGGT 
GAGGATCTGAGCTACGGACCCGGAATTGTGTCCAAGCTGCGATGCCGCTACTTGAGCCTGGCCCTTCGCGAGTCCCGCCAGCAGAGCAGCAAACA 
ACGCCTCCAACGCTCCACCAGTCTGAACACCCTGCTCGACCGCGACGATGACGAGGTGGAAGTGGAGGAGCCCGAAATGACCAACAGCCAGGTGC 
GTGCCAAATCAACACCGCCACCGATATTGGGGGCAAAACCGACACCGAAACCCAGCAGCCAGCATCAGCGTCCCGTCAGTCTGGGCGCCAATGGA 
GCTGGATCAGCTGTTAACCCGCCCTCGAACTCGGATCAGGCCCAAACGGGCGCCGTTGCCAACGGAGGCGCAGGATCTGGCCAACGTTCGCGTCA 
CTTTAAGCGCGGCAACGAGGTGATGAAGCGGGCACGTTCCGTGGAGGCTCTGCTCTGCGAGAAATCGCCATGGAACAGCCAGAGAATCAGCACTG 
CCGGCGCAGCTGCAGCAGCAGCACCTTCACCACCTGCTCCCAAGGCCACAGCCGCTGCTCCCTCACCTGTTACATCGCCCACCTGCGTTACCATC 
GAGGACAAGATCCACAATGCCCGCGAACGACTGCACAGCGGGACGGATACGGCGCCACCCAAGCGTCTGACCTCCATCATCGATGACACCGAGCG 
GCCGCCACCGGATCTGGTCAAGCAGACGCTCAAGATGTTTGAGGCGAGTGCTAATCGACGACCGCGCGCAGCCCATCGTTCCAATGGTGTTGGCG 
GAGTGGCCAGCAAGGTGGCCAACTACAAATCGATCATTAAGGATCAGAAGCAACCATCATCAGCAGCGGCGACGCCACCGCCGACCGGCATGGGT 
TTTGCCTCCTCAACGCCGCTAAGGCATGTCCATCCGGATATTATACCCCGTCAGGTGGATTCGCCCGTATCGGCGTTGAGTGTAATGATGCGTCG 
CATGGAGCTGCAGGAGCCCGAGACGCCGGAAAGGGAGACACGGGACGCGGAGGCCACACCGCAGAGCGAGGCGCTAAGTGAGACTGAGCGAAATG 
AT C GC G AT GAAGGC GAT G GC G AAG TC G AT G AC G AC AA C C AC AAC AAC AAC G ACGAC GAT C AC GAC G AC GGCGACGACGGCAGC G AC AAC AATG AG 

CAACGCGATAAAATGGGCGCGGCGGCGGCAGGCGATAAGCCTAAGCCCCCAGCGGAATCGGCAGCGGGAGGCGCCCAACGATCATTCGCTGCCTC 
GACGGAGAACGCGCATGTAGCCAGCGGTAGTAGCAGCAGTACCAGTGCCAGTGTACGGAAGCTCAGCAACAACAACGACTCCAGCAGCGGACCAG 
CGGTGACCAAACAGATCGGTGTCATCCGTCCGCTATTCAATAGCCAGGGAGCCGGGAGCACGCCGCTAACGAGTCGCGAGATCGAAAAGAATCGC 
ATCAATGAGATGAAGAAGTCGACGGCCACAGATGCCGGTGGTTCGGTATCCGGATCCGGAACGGTTGCTGGTGCTGGATCAGGAACCTCGCCCAC 
CACTAGTCTCGACAGCGTGATAAACACCAAGGAAGCGGCCAGCAGCGAAACGGATGCCAGTGCCTCGCCACTGTGGACATTGCGCAAGCTGAGGA 
ATCAGAGCCAGACGGCCGGCGGAGGATCGGGTCCCAGTTCGAGCCTTCATTCCACAGAGAACACCTCGATGGTGTTCAACTTTTCCAAGAGCACC 
AAGGAAGTGCCCGACTACATCGAAAGCGACGTTGTGATTTACAGGCGCAAGCGGGAGCTGCCAAAGCCAAATGAACCTGGCTTCGTGCTCCTGGG 
CGATCTCTCCGTGGAGACGTCGACGGACACGGACTACGACGACTACTCCATGTGCCCGCCATCGCCGTGCGATGTGGAGTTTGAGAATGCCAACA 
TTGTGATCGACGGCAAGTCCAGCATACGCCAGAAACCCAAAGAGTCTTCGTTCCGCGTGCAGTTCAACGACACGCTGACGTCGACGTTTGAATAC 
CCCTCCGAGGCATCGATGACCATCGAGGATCCGCCGTACGCCGATCCCTTTGGCCATGTGAGCAAACACCATCAGATGCTGCTCGCCGAGCAGAT 
G C AT CT G T T G C AG C AC C AGG AGC AGCT GG AGC T GGAGC A GAT GC AC C AGC T GGG GC T GGC GC C GG AG C AG C AC CAT C AT GT G AC C GT C GAC GAGA 

TCATCGAGCTGCCCACATCGACGGCGGGACATGGACATGGCATCGGACTTGGACACAGGTCGGGGGCGGCGGGGGGCGGTGGCACGATGCTGGGG 

AATTTACCGTTGGATATTATTTAAATGTAACAGAATACATAAGCAAAGCGTTGAAATAATATCTATTGTATACCAAAACCAAAAATATCTATGAT 

AACAAACAAGCAAAGGCAATTAAACCGATCCGTAAACTGAATAACCGATTAATAACTGCAGCCGCTCAACCCGAACCGCAATAAATAACTATCAA 

GTAAAGAATAGAACAACTTAACAATGAAAAATGCAATATTCAATATGCGAAATGACAAATGCCTAATGCCAAATGCCAATCCAAATAACC 

ACCGATTCAAGTGCGAGAACAGAAATGCCACACCTCTAACCCAGTGCACTTGCTCTGTCTGCTCCGCTGGGCAATCAAACTAATCGCCAATTCAT 

ATTATTATTAAGGCTCCACGGCTCTGGGCTTCTACACGCCCATGAAGGGCACGGCGATGGACAACCTATTCCAGCTGGGTGTCACCCGATACGCC 

C TGC C C G AGAGC AG C AG C AG T GG GAG C AG C AAT GG C AGC C AC AGT C C AGGC AGC AAC GGG AGC AGT C C AG C CGG AGC T GC GG GC AGC C GGT T GAT 

GTTCAACGGGAATGGCAGCATTATGGGCATCGGCGAGGGACTGATCAAGGAGGGCGACTTGGCGGCAACGGAAGCAGGACCTGGAACTGCAGATG 
G AGC AGG T GC C T C C AAGAAGGG T T C AGC T GT GG AC G AG GATGATT AC CAT C T G ACG GC G AC GC C GG GCGC C G AAGT T GT GT AC AGC G AGGGT AC A 

CAGAAAACGGATTTGCTATATTAGGTCGTCCTGCCAGCCAAAAAACCGGAGCGTCTAGCTCTTTTTCTACCGGTTTCCACCAAACTCCAAACCTT 

CCGAGCATAACAACCTACCACATCCTGATCGTTCCCCTAATTGAAAAGCGTGCACTTAGGTGAAATATCAACTTCGGTTAGACGCGAAAACAGTA 

ACGAAGGAAAGAAAAATAAAAACAACAAGGCAACGTAAACCAAGTATGGCAAGAAATCCATAAAGATACGTATATAAATAGATATTATTTGTAAG 

TTTGTAAGCCTAGAGATCCGATGTGTATGTGTACAACAGCAAGCGTGTGTGTCTGTCTTCTTTATAAATATATATATATGGTTTTTTTTTTTTGA 

TAAATTAATAATTATATTATTGTAATGTATCTACATACGGCACTGAATGAACAACTTTGCTGCTGCAATTGTATTTGTATTTGTATTGTGTATTC 

CATTTACGGCATTTTGATAGTAAATATACAACAACAACAAATCAACGTATTTCCGACTGCATCCCGAAGGTTACTGAAACCTTAATCCACGAAGT 

ACGCGACGAGCGGTTAGTCCCGCCCGAGTATTAACTTAATGGATTCTAACGATCCGATCTTGTGACCCATCATCATCCCTTTTCGCGACCCTTCC 

AAAACGAAAATGAAGAACATTCTAAGCTCCGCTTGCCCATTCAGCGGAAGAAAGCAAAGGAAAATGTAACATTTGCCTCATAATTATTATATACA 

TTTAGCGCTTATTGAATATTTTTTACAATTGTATTCCGT 

(SEQ ID NO: 161) 

Start ATG: 308 

MESQKRPSLDLHTDVPAGLAAGGSGLGAAAEMSPTSGFLPDMPQWKKDLIQRRKTNVARTQAASITSPTDGSCGALAEANAAPGAIADFTEPATI 

SSTSQKRNMIGSEEKSEKSSISNTNSDSTGGHHSVVAVSLSPDAAATTNVTVTPIPKQRSSLLNTRSQEREMVRYILSESGERDGELESGEQPAG 

VVSNSRCGEVETGTIGSPSSSANQNPNPNHLKTKCKPGQSVAEGKPSAKETIVDNSKSCSKTKSISDKLQSNKFIIQQQQQQQQQQQQQQQLSPT 
KVTVKPTMVAMQEMKKTTKQNGQ 

LNTLLDRDDDEVEVEEPEMTNSQVRAKSTPPPILGAKPTPKPSSQHQRPVSLGANGAGSAVNPPSNSDQAQTGAVANGGAGSGQRSRHFKRGNEV 

MKRARSVEALLCEKSPWNSQRISTAGAAAAAAPSPPAPKATAAAPSPVTSPTCVTIEDKIHNARERLHSGTDTAPPKRLTSIIDDTERPPPDLVK 

QTLKMFEASANRRPRAAHRSNGVGGVASKVANYKSIIKDQKQPSSAAATPPPTGMGFASSTPLRHVHPDIIPRQVDSPVSALSVMMRRMELQEPE 

TPERETRDAEATPQSEALSETERNDRDEGDGEVDDDNHNNNDDDHDDGDDGSDNWEQRDKMGAAAAGDKPKPPAESAAGGAQRSFAASTENAHVA 

SGSSSSTSASVRKLSNNNDSSSGPAVTKQIGVIRPLFNSQGAGSTPLTSREIEKNRINEMKKSTATDAGGSVSGSGTVAGAGSGTSPTTSLDSVI 

NTKEAASSETDASASPLWTLRKLRJSIQSQTAGGGSGPSSSLHSTEWTSMVFNFSKSTKEVPDYIESDVVI YRRKRELPKPNEPGFVLLGDLSVETS 

TDTDYDDYSMCPPSPCDVEFENANIVIDGKSSIRQKPKESSFRVQFNDTLTSTFEYPSEASMTIEDPPYADPFGHVSKHHQMLLAEQMHLLQHQE 

QLELEQMHQLGLAPEQHHHVTVDEIIELPTSTAGHGHGIGLGHRSGAAGGGGTMLGNLPLDII* 

(SEQ ID NO: 162} 

Classification: known_flybase gene 
Gene Symbol: bif — 
FlyBase ID: FBgn0014133 

Celera Sequence No. : 142000013384767 

GATATTATTTGTAAGTTTGTAAGCCTAGAGATCCGATGTGTATGTGTACAACAGCAAGCGTGTGTGTCTGTCTTCTTTATAAATATATATATATC 
GTTTTTTTTTTTTGATAAATTAATAATTATATTATTGTAATGTATCTACATACGGCACTGAATGAAC7AACTTTGCTGCTGCAATTGTATTTGTAT 
TTGTATTGTGTATTCCATTTACGGCATTTTGATAGTAAATATACAACAACAACAAATCAACGTATTTCCGACTGCATCCCGAAGGTTACTGAAAC 
CTTAATCCACGAAGTACGCGACGAGCGGTTAGTCCCGCCCGAGTATTAACTTAATGGATTCTAACGATCCGATCTTGTGACCCATCATCATCCCT 
TTTCGCGACCCTTCCA7A7AACGA7AAATGAAG7AACATTCTAAGCTCCGCTTGCCCATTCAGCGGAAGAAAGCAAAGGA7AAATGTAACATTTGCCTCA 
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TAATTATTATATACATTTAGCGCTTATTGAATATTTTTTACAATTGTATTCCGTAAATAAATATAAAATACATCAACATCTAGTCGCGTGTTTTA 
AGTTAAAGGAAGTTAAGGCCAAAGCAAAATGTTAGCAACAGTTTTTGTTAAAAGTTAACATATATAATAATGGTAATTTAAATATTCAAATCTAT 
GAAGTGACTTATTTTATAAGTCCACTTATGTTTGTTATATTTAATTGGTATTTAGAAAAGGATTCATTTTAAATTAACACCGTTGAGGGTTCAAT 
TTTGGTAGTGCGTCTTTGCTAACCATGCCGAGTGGGATCAAGTCCTTGCTAAGTTAGGATGTTGTCTTAGCTAACTGTGCCATCCCTGGTCGGGG 
TGTGTCTTAGCTAACAATTACGTGTCGATAGATATAATGCTATCGATATGACAGAGTGTCGGGAACAATCGATAGCATTACTTCGATATCGGTAC 
TAGTGCCACCTCCAGTTTTGTGATTGTGTCAAAAATACAGGGTTTTATTGTTGCACTAGCAATTTAATCCGAGGAATATAAATTTTGTGTGGAGT 
TATGTTCGAGACGTCGTTGCCAGCGATTCAAGAGGCGAGCGGTCACACGTGTATGTGTTTCACTGGCGGTGCGGGCGCCAACTCCGCCAAAAAGC 
GTACTACTTTTAACCTTAGTTGACATTGAAATGTACCCGCTTAGTTGGATGTCCACGATGGTGCGTGTGTGTGCGTATATGTGTGCTATGCGGCG 
TCTGTGTGCGCGAGTGTGTGTGTGGGATTGCAAAGGAGGATGGGAGAAACGAGAGCAGAGCAAAAACAAAAACAAAAGGAACCTGCTGCACATAG 
TCGGTGCATGCGAGCGTGCTTATGTGTGTGTGCGTGCGTGCTAGCAAAACAACTGCAAGAAATCAGCAACAGTAGTAGAAGAAGAGAAGCTGTAT 
CACTGATAAGGAGGCTAAACAACAACAAAAACAGCAACGATAATGTGCACCCTTTCGAAATGCAAACAAAACAAAGCGTTGTAGTTGTGTAATCA 
TTCGTTGTGAACACACCACACACACACACACACACAACACTTGTTTAATAATACATTTTAATTCTATTATTTTTGATTTAATTTTTGCCTGTTAT 
TCAAATCGATCTGTGTTTTTGCAGTTGGCATTCGTTATTAATTAAATCAGCCGCCAACAGCTGAGCGAGAGAGCAATCGAGAGCGACAGGAGAGG 
ATCCACTTCCAATCGAATCGGGTGGAATCGAATAGCAGGGGCTTGGTAATTAAAAGGAAGACCGAACTGAACTGAACTAAACTGAAACGGAACGA 
AGCGAAGCGAAGCGCGGCGAGGCGGCAAGAAACGAAAAGAAAAGAACGAAAGTCGGCAGAGTCGGAAAGTCGGCAAGTCTGATTGTCGGCAAAGT 
CGCAAAGTCGGAGAGTCACAGTCTCAATCGCAGTCGGCGAGAGATTCCACTCGTGTCGCACGTCCGTTCGTTTCCACATTGTCGTTATAAAAATA 
GCAACGCCTTAAACGATCGCATCAACCTCATCAACAGAACACCACCAGAAATCACCGCCCCCTCCTACGCCTCAATCGCCATCGCCACCGCATCC 
CATCCCCATCATCTTCATTAGCAGGCAGCCTGACAGAGTAACTAACTGATCTTGATTGCGACCATAATCACCGTCATCATCATGTAATCCTTCAC 
TAACCAATTGATTTCCGCTCGTCCACCAAATAGATTCCAAACTGCCGTTTCGGCAGCCGGCAAGATTTCACGAAATCTAATGGCCAAGGACGAGG 
AGGATGATCTGCTGCCCGACAAGGATGCGGCCAAGGGCTTCTATGCCAAATACGAGCCCAAGGAGATTCTGGGCAGGTAAGAATCGAACTACACT 
TGCCGTGATCTCAAACCTCTTATTAAATCCTTTTAAATTCAGTTGAAGTATTATTGATCCTGCACTTAACCATTCAAATACGTTTCTTTCTATTT 
ACTTTGCGTAGTTAATTTTGCAAATTTAAGATCATAGAAACATTGATACTTGTATCCTCAGAAATCACTAGAACGTCCAAATATTGATTGTTTCT 
GCTTTCCCAACCTCTTACTGCCAAGTCTGCAGTAATGCAGTTTGGGCTAAATGGTGCCAAGTGAGTCAGCCTCATCAAGCTTCCATGGCATGGAG 
CAATGGCATGGGCTGCTTTTCAAGCGGCGCCTGTCTGCCGGCCTACTCACGCCACAACGAAGATCAGAATTTGCCAAGCAAATAGCAGAGACTAA 
GCGAAAACAAGGGCTAATTACATATTACAATTCTAGGGATTAGAGATTAGATAAATACCCGATCATGGCAATGTCAAGTGCAGCAGAAGCAGAGC 
TCTTTATCTACATGCAATGTTTGTCTTACAGGGGCATCAGTTCGACCGTGAGGCGATGTATCGAAAAGGAGACGGGCAAGGAGTTTGCCGCCAAG 
ATCATCGACTTGGGCGCCACCACAGAGTCAGGCGAGACGAATCCATATCACATGCTGGAAGCAACCAGACAAGAAATCTCAATACTGCGCCAGGT 
CATGGGGCATCCCTACATAAGTTAGTCTCACATAGCACCCACATGTACTTAATTCCACCACTAACCATCTACATTATCGTGCACAGTTGATCTGC 
AGGATGTGTTCGAGTCAGATGCATTTGTTTTCCTCGTCTTCGAGCTGTGTCCCAAAGGTGAACTCTTCGACTATCTGACCTCTGTGGTGACGCTA 
TC C G AG AAG AAG AC GC GC AC C AT T AT GC GAC AAAT C T T T G AGGG C GT C G AGT AC AT AC AC GC GAAG AGC AT T GT C C AT C GGGAT CT T AAG C CC G A 

GAATATCTTGCTGGACGAGAACCACAACGTTAAGATCACGGATTTTGGCTTCGCCAAGCAGTTGCAAGAGGGCGAGAAACTTACAAGTAATTATA 
ACACAGCATCCTTCTGTATTCGCTAAGCTAAACTATATGTACTTTCCCCCTATTAGATCTGTGCGGAACGCCGGGTTATTTGGCTCCCGAAACGC 
TCAAGTGCAACATGTTCGAGGGATCCCCCGGCTACTCGCAGGAAGTCGACATGTAAGAGCGAATAAACTAGCCACTTATTATTACACCTTTTTTT 
GGCTCACTTAAATTATAGAAGCAATTTAAGCACAAAAAGTATCACATTTATTGTAATACGAATTGTCTGCTCAAATTACTAAATTGCTTATGTTA 
ACAGCTAATTTTATTATTAAAAATTTTTGCTTACATAAAAAAAGAAAGGTCTAAACTTTTGTATCGATATTAGACCCGTTTTCCTTTTTTTTTTG 
TTTTATAGACGCAAATTTTGCTGATACATTTCATTAACTAGTGTTCCTCGTGTCAGGCGATTTGTTTTGATCTTTAGGTGTGCATTTTAATGAAT 
GAGTCAGAAGCAGAAAGGCATAATAATCTGAAATCTGAGTAGTTCCGATACCGACGTCTATATTTCTTCGGTTCAGCAAAACGTAGGGCTTTTTG 
TATTTTTGAGTTCTTTAACGTTTCCCACTGGTATCTACGTTACTTGGTTTCAGCAGAAATTTTTAATAGGAAAAATTGACACCTTTTACCCATTT 
CTTTTTCGATTTCTTTTTGTTGTTCGCAGATGGGCCTGTGGCGTGATTATGTTTACGCTGCTCGTCGGTTGTCCACCTTTCTGGCACCGCAAGCA 
GATGGTCATGCTGCGGAATATCATGGAGGGCAAGTACAGCTTCACCTCGCCGGAGTGGGCTGATATTTCAGGTTGGTCAAATTGATATGCACATG 
CCAAACGTATGCTTATTTACTTGGTTTCGATTGATTGCAGAGGATCCCAAGGATCTGATACGCAAATGTCTAGTCGTTGATCCTTCGCAACGTAT 
AACCGTCAAGGAAGTATTAAGACATCCATTCTTCAACCAAATGGTAAGTGCCGGAATTGTTGGCAGCCGCCGGGCTCCAATTACCACGACCACCA 
CCACCACCACTACTACAACCGCCTGTACAAGAGGAAAATCAGTCAAATTAGTTCAATTAGTCGCCAAGAATGAGGTCCGCCTACGAGTCGTCTTT 
TGTGCTCGGGCAGCAGTCTATAGTTTTCGGGTTCCTGTTCTTGTACATAGATACTAGATCTCTAGTTGATGGGTTTAACCTCGGGTGGGTTAATC 
TTCTACACAACCAATCCACCAAAACAACCAAACACCTGGCCCTGCAAATGTTTCTAAACACAACGTACATCTAAGAACACACACTCTGATGGAAA 
GAACACCCACCTGTGACTCACAAACCAACCACTCACAACAGCCTATTAACAGCTTCTTCTTACTGTTACCTCCTAAAAATACGATTATGATTTAT 
AATTCGGTTAGGGTAGCAGCCTAACCCCCCGTAGTTTAGTTCAATTGTCAGATTTATAGCCTGACAGGATCAGTCATTAGTTTGGATCCCCCGCC 
CCCCTTAAAAAGAAAAAGAAAGGAAAACCCTTCGGAACCTCTTGCGTTTTGTTAACTTTTTATTTCTATGATTACTATTTATTATACCAAACATT 
TTCTTTCTTCTTTTCCTTACATATAAACTATATATCAAATCGAATCCAATTAATTTTAAACGAACCCAAACGAACGATAACGAACACACACTACC 
ATCGATAGCTGTTCGAGCAAAACATTGATGGCCTCAAGCGCAGCCTGTCGACCAAGTCCCGACGAATGAGTCGCATCACTGAGATCGCACTGGTA 
ACGAACCGTTCCCGGACTAACCCAAGCTCTCTCTATCTCTTAATCTCTCTGTGCTGTGTGTCTCCGTGTTCATATATTGATATCGGAGCCGTAAT 
CGTAATCGCTCTAATAACGATTATCCCGATAATCCTGATAATCCCGATAATAAAATCAAGCAATCGGGCCGGCAGACCCAACTCATGACCATATG 
CAACCCTGCCGGCCCAGCATTCGTGTGTCTGTTTTCCGTTGCATTCTGTTCTGTTTCGTTCTGTGCCGATCTAATATAACCCAGCTAAACTGTAC 
ATCATAGTGCATATCAATTGTATACTATACATACTTATTTATTTAAAATCCGTTCAGGTCATCTGTGCCAAGCATAATAACTTTTAACCGGCCGG 
CAATAGCCAGCGAAAAACGAAACAAAAACGAATACGTACCCGATCTGCTTCGTTTTTGTTTGCATCTGATTTGATTTGCTTGTCGACTGATCGAT 
GCACCACAATGATAACAAAAAAAGAATATATATATATACATATATATAACCCTTCTCTATAATCCCATTCCTTGTTGTACTCGATGTTTTTTTGT 
CGATTTTGTGTGTTTTTTGTACAAATCAACCGATTGCCTCGATAACTAAAAACGATAACACACGTCTGCAGCTCTGCGCTCGCCATTTGAATCAT 
AATCTGTTAAGTTAATAACTGTGTCTGCATGCCAACCAACCAAAAAAAAAGCAAAACAAAAATCCAGCAGGGTTTCCCTTATCCCAACCTCCAAA 
GAAAACCCCACAATTCCTATTTTGAGTCACTGCATGCAAGCGTGCATTAACCATTCAGTCAGCGAGAAGAAGATTTGAAGATTTGGCGATAGTAG 
ATAGTAGAATTCACAGAATAGATACAATACCCCATCAAGGTGCATATAAGTATGAAACACGAAGGTATATATTATTGGTAGTCCACTAGGATGTT 
GTACACTTCCCACAGATCAAATAAGTAGACTGTAAGTTGAAGTCATTTTAGGTATTATATTCATATACGTATAAGAACGTCCTGTGCTTCACCCA 
ACCCAATTTTTAATTTAACCTCCGTGACTGATAAACCAATTTTAATTTCTTCCGGGCCCACTCTGCTGCTGCCTCTGACCCTCCTGCCCACTGCC 
AATCTCATCCTTCCTCCGTCTCGCCTCGCCTTCGGTTTTTTGTTTATTAATCGCGGTGTCGACGGGATCAGGTGTTAATGGGTGATCGCCGGCAT 
CCGGCCCCGCCCATCGCGCCCGCCCAAACGAACAGCAGGCATCTGCTGCAACCCGAGGCGTCATCGTATCGCTTTGGTCAGCTGAACAGTAGTTG 
CGCCGGTGCCCCCAATTATCTGTACTGTGCGCCACAGAGCTCCTACTCGTCGAATCGAATGCGCGACGGGGCGCTGGACGATGCGGGTGCCAGTG 
CTGCATCCGCCACCTGCCATGTCATGTCGCCATCCAACGCCTCCAATTCCTACGCCACCAACACCACAGCCAACACCTCGAACTCGGCCTTGGCA 
TCCGCCTCCTCCCACAAAACGCTCACGGCGATGGTCTACTATCAGCAGCAACCGCAGCAGCATCAGACGCAGCAGCAGCAGCAACAGTACCAGCA 
GCAACAGACGACACAGCTGCCGGTGCAGCTGCAGTTGCAGCATCCACTATTGCAGACCGGTGGCGATGGCAAGCAATCGGTGTATGCACCTCCTA 
CAGTGCAGGCAGGTTTTCCATTGATATTCTTTGATCTTGCTTACCCAACACGAATGGCCCAGCGGGCGATGGCAGGAATCGCTAGGGAAACGTTA 
AATTGCATACCACAGAGCAGGACGAGTGTGCCCCACAATTGCATTAGAGTTGCCTGAGATTTTTCATCATTAAAAGAATACTTAAAGCTTTGCTA 
CGATCATTTTAAATGTTTATATACAAATGCCTTTTCTTGACAAATGTTTATTTGAACTATTTATTTAATTGTGTATGTTTATATTTCCTTAAATA 
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TTATATTTAAATGTTAACTTATGTTGATTACTTTTGAAACATTTAACTTTCTATATTAATTTAAATAATAATTTCGTTTAAGACTATATATCAGT 
CAGTTTTGTATGGAATTTTTCGTTTTTGCTAACGTTTTTTGACAGCCGCGTGGGTTATACCTAATATTTAGTTGCCACACGTACTATTACCGATT 
AATTTCTAGTGCTTTTAGTTGATTCGAATCAGTCCACTCCACTCCATTAATCATCAGAGACAAGCTGCACTCGATTCACCTTCATTGTATCACTC 
ATCCATTAATCCATTTCACACCATTATGTCGTTCGGCTGTCTCTTTCTAAATGATGTTCGATCCGTCGATGGATCGTTATATCTAGCGATCGATA 
TGCTTTCCACTGCTGCTTTCCATTCAAACCAAATCATTCGTCTACTATTTAGCGTAAGACTTTGAAAGTACCTCTACTGCATTATACTATTACTA 
TTACTATTACTATTATTATATTCTCTTATAGCTGACACTTAGTCATATCATAGCGAGTAAACACTATGTTTGTATTATATGCACCTTTCTTTAGT 
ACATATCTCTCGGGTGAACTGGATGATATTTACAAGAACTGTATGTAGTTGACTTGCTTCTGCCGTTTATACCTCTGTGTTGTAGCCCCTATAGT 
AAATATATAGTATATAGATCATTAATCCCGCCTATCAGTGTCTCTCATTTGTCTCATGTGTCTTGCGGAGGAAGTCAGGATATTTAGATAAGCTA 
ACTAGCATGATAGACTTAAACTTAACTATCAATAACCCAACATCCACATTGAAAAAACTATGGTGCTTTTCGATCGTAACGTTAACGTTATTAAC 
CATTGCTTCTAATTACGTTGTAATATCAACTGTAAGCTTTATTTTATTCGGATTTAATATACTTTTGGGTGTGCATATGCTGCCCATCTATAATC 
GTATCAATTATGAAATAACGCCAAAATCGATTCACTGACGGATTATTTTGTTTAATTTCTCCTCTGTTGATGATGTTATACACGTTTTGTTTGTG 
TGTCATGTCCTCTATATTGTCAGAGTCTCAACGCCCTTATAAGATGTGTGTCTATAAGTCGGTGTTGTGGAGACTCCCTTGTGAGTAAACCAAGT 
CAAACTGAGATAGATTTGCATACAGTATTATATCCATACAACCACGCCCACTCCGCATATACCTAAAATTATCGTCTATGTATATTCGTTCTATC 
TGCATAATGCTACACATATGTCTATATATATATATATCTATCTATGTATGTATGTATGTATATCGCGCCAGTTGTCAATGTTGTTGTTCACCTCA 
AAATGTTTCTCTTACTGTAATTACCCCGCTAACCCATTGTCCTAACCTCTGACCCCTGACCCCAAACTGTTGGCTAATGGCTTTCGGTTATGGCT 
TTGTTGTCTTCTGTTACGGCCTCCGCTTGACCATCTAAACTATAAAACATTCGAAACGAAAGTAACTAATGCCACCCAGAATCACTAACGAATCA 
TCTATCGAAAAAACACTGACTGACCCAGTTGTTGTACTCTTCACTAAAAAATAACAACCATAAGAAAAGGACTAACACTGGCTGTCTACGATCCG 
TGTTGTTGTTCCTCCTTTGTGCGAGGAAGTAACCATTAACCATTAGGCTATAGTTCACCAACTGCTAACCACGAGTCGAATTAACAAAAAAAAAA 
AAAACCATAAACCCCTGCTAATTTAACCCGTGGCGTGTGCACCTCTAACCCAAACACGAAGCCAAAAGAAAGAGATCGATCCGCATTGCCGAGTA 
TCCAGATGGTTTAATCCGTATCCTTTGCCCTCTCTCTCTCTCCCTCTCTCTCTCTCTGTCTCTCTCTTTCTGATGTTTAGCTGCGCAAACAAAGT 
CGCTTCAACGCGCGCAAAAAGTTCCAATTCGCCATACTCGTGATACGGGCCATCATTCGGATACGGCGCCTACGCTTCACCGCCGAGCCGCTGCA 
CGTTGAGGAGGCCATCCGGGATCCGTATCGCGTCAAGGTTCTGCGCAAAGGTGAGTTGCTACAACAATTCATATATCGAATATAGTACACTAATT 
CAACTTTTGTTGTAGGTCATCGATGGCTGTGCCTTCCGTGTCTACGGGCATTGGGTCAAGAAGGGTGAGGGCCAGAATCGGGCCGCCCTCTTCGA 
GAATACTCCCCGCACGGAGCTCCACGCGCTCTACATCAACAATCTCAGCCGATAGAAGGCGGTCGGGCATGGGGCATGGGCCATGGGCCATGGGG 
AA T GG GGC GT G GC GG AGT C GC GC T GAT CT AC AC AAGC AC AC AC AC AC AC AC AC ACACAGAAAGAAAGAAAAAAGAAAATAAGAGGAAAC AC AG T A 
AGCAAGAAACCAAGAAACAAACCAACAAACGAATTAGCAAAAGCTACTACTATGAGAAGCAGGAAATTGCGATTGCAACAGGACACACCAAACAC 
ACCAAAAAGGATTCAATCCATCCATACATTCGGATTCGCCTCGTTCTGGACGGACAGCCAACACGCAGGTGTACATATATACGTATGTATGTATG 
TATGCATGCCTAGTAACATTACCTCCCCTGCCTTCTTCGTCCTTGTGATTTTGTTTAATACTTTTTAATTCGATTTTATTGCGTCTAATGCGCGC 
CAGCCATCTAGTGTAATCAGTTAGTGTAGTGAATGTAGCGTTCAACCACGGCCATGCATACCACCCAGATGGTACATGGCTATCCAGTGGTCCAA 
CGTTATGACCCGGCGCCAGAACGAGCCGAATCGGATGACGAGATGCGATTGATTGTTGGGTTGGGGGAATGTACACACCTCATCTCACCATACTC 
CACACCCAACTAAGCCCACACAAGCAAGCGTACATGCATATGTATTTTCTGTTAATGATTCTATTAAATTATTTAATTGTATTGCTAAGCGTTTA 
CATTTTAATTGTTATGTTGTAGTATACACAATAAATAAAAACAAACAA 

CCATTGCCTCCTAGTTAGTTGAGCAACTTAAGGATTATGGCAATTAATAATGGCTTAAGATGTTAGATGTGCTTGTCGCAATTGTATTTCCGCTC 
GCAGCAAGCGGGAAATTCGCGATTGATGTTCACAAAGTCGCGCTGCTTGCAGCCCGGTGGCGGAGCGACTGCATCGCAGGTTTTGAAGGTCACCA 
AGCCATCCGC 
(SEQ ID NO: 163) 

Exon: 1001.. 1095 

Exon: 1640.. 1755 

Exon: 2219.. 2356 

Exon: 2882.. 3060 

Exon: 3127.. 3411 

Exon: 3477.. 3567 

Exon: 4115.. 4251 

Exon: 4316.. 4413 

Exon: 6437.. 6943 

Exon: 9106.. 9264 

Exon: 9326.. 9460 

Start ATG: 22 65 

Transcript No. : CT5552 

TTGCACTAGCAATTTAATCCGAGGAATATAAATTTTGTGTGGAGTTATGTTCGAGACGTCGTTGCCAGCGATTCAAGAGGCGAGCGGTCACACGT 
TTGGCATTCGTTATTAATTAAATCAGCCGCCAACAGCTGAGCGAGAGAGCAATCGAGAGCGACAGGAGAGGATCCACTTCCAATCGAATCGGGTG 
GAATCGAATAGCAGGGGCTTGATTCCAAACTGCCGTTTCGGCAGCCGGCAAGATTTCACGAAATCTAATGGCCAAGGACGAGGAGGATGATCTGC 
TGCCCGACAAGGATGCGGCCAAGGGCTTCTATGCCAAATACGAGCCCAAGGAGATTCTGGGCAGGGGCATCAGTTCGACCGTGAGGCGATGTATC 
GAAAAGGAGACGGGCAAGGAGTTTGCCGCCAAGATCATCGACTTGGGCGCCACCACAGAGTCAGGCGAGACGAATCCATATCACATGCTGGAAGC 
AACCAGACAAGAAATCTCAATACTGCGCCAGGTCATGGGGCATCCCTACATAATTGATCTGCAGGATGTGTTCGAGTCAGATGCATTTGTTTTCC 
TCGTCTTCGAGCTGTGTCCCAAAGGTGAACTCTTCGACTATCTGACCTCTGTGGTGACGCTATCCGAGAAGAAGACGCGCACCATTATGCGACAA 
ATCTTTGAGGGCGTCGAGTACATACACGCGAAGAGCATTGTCCATCGGGATCTTAAGCCCGAGAATATCTTGCTGGACGAGAACCACAACGTTAA 
GATCACGGATTTTGGCTTCGCCAAGCAGTTGCAAGAGGGCGAGAAACTTACAAATCTGTGCGGAACGCCGGGTTATTTGGCTCCCGAAACGCTCA 
AGTGCAACATGTTCGAGGGATCCCCCGGCTACTCGCAGGAAGTCGACATATGGGCCTGTGGCGTGATTATGTTTACGCTGCTCGTCGGTTGTCCA 
CCTTTCTGGCACCGCAAGCAGATGGTCATGCTGCGGAATATCATGGAGGGCAAGTACAGCTTCACCTCGCCGGAGTGGGCTGATATTTCAGAGGA 
TCCCAAGGATCTGATACGCAAATGTCTAGTCGTTGATCCTTCGCAACGTATAACCGTCAAGGAAGTATTAAGACATCCATTCTTCAACCAAATGG 
TGTTAATGGGTGATCGCCGGCATCCGGCCCCGCCCATCGCGCCCGCCCAAACGAACAGCAGGCATCTGCTGCAACCCGAGGCGTCATCGTATCGC 
TTTGGTCAGCTGAACAGTAGTTGCGCCGGTGCCCCCAATTATCTGTACTGTGCGCCACAGAGCTCCTACTCGTCGAATCGAATGCGCGACGGGGC 
GCTGGACGATGCGGGTGCCAGTGCTGCATCCGCCACCTGCCATGTCATGTCGCCATCCAACGCCTCCAATTCCTACGCCACCAACACCACAGCCA 
ACACCTCGAACTCGGCCTTGGCATCCGCCTCCTCCCACAAAACGCTCACGGCGATGGTCTACTATCAGCAGCAACCGCAGCAGCATCAGACGCAG 
CAGCAGCAGCAACAGTACCAGCAGCAACAGACGACACAGCTGCCGGTGCAGCTGCAGTTGCAGCATCCACTATTGCAGACCGGTGGCGATGGCAA 
GCAATCGGTGTATGCACCTCCTACAGTGCAGCTGCGCAAACAAAGTCGCTTCAACGCGCGCAAAAAGTTCCAATTCGCCATACTCGTGATACGGG 
CCATCATTCGGATACGGCGCCTACGCTTCACCGCCGAGCCGCTGCACGTTGAGGAGGCCATCCGGGATCCGTATCGCGTCAAGGTTCTGCGCAAA 
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GTCATCGATGGCTGTGCCTTCCGTGTCTACGGGCATTGGGTCAAGAAGGGTGAGGGCCAGAATCGGGCCGCCCTCTTCGAGAATACTCCCCGCAC 

GGAGCTCCACGCGCTCTACATCAACAATCTCAGCCGATAG 

(SEQ ID NO: 164) 

Start ATG: 258 

MAKDEEDDLLPDKDAAKGFYAKYEPKEILGRGISSTVRRCIEKETGKEFAAKIIDLGATTESGETNPYHMLEATRQEISILRQVMGHPYIIDLQD 

VFESDAFVFLVFELCPKGELFDYLTSVVTLSEKKTRTIMRQIFEGVEYIHAKSIVHRDLKPENILLDENHNVKITDFGFAKQLQEGEKLTNLCGT 

PGYLAPETLKCNMFEGSPGYSQEVDIWACGVIMFTLLVGCPPFWHRKQMVMLRNIMEGKYSFTSPEWADISEDPKDLIRKCLVVDPSQRITVKEV 

LRHPFFNQMVLMGDRRHPAPPIAPAQTKSRHLLQPEASSYRFGQLNSSCAGAPNYLYCAPQSSYSSWRMRDGALDDAGASAASATCHVMSPSNAS 

NSYATWTTANTSNSALASASSHKTLTAMVYYQQQPQQHQTQQQQQQYQQQQTTQLPVQLQLQHPLLQTGGDGKQSVYAPPTVQLRKQSRFNARKK 

FQFAILVIRAIIRIRRLRFTAEPLHVEEAIRDPYRVKVLRKVIDGCAFRVYGHWVKKGEGQNRAALFENTPRTELHALYINNLSR* 

(SEQ ID NO: 165) 

Name: Phosphorylase kinase gamma 
Classification: protein_kinase 
Gene Symbol: PhKgamraa 
FlyBase ID: FBgn0011754 



Celera Sequence No. : 142000013383929 

CTTGGCCAACTGCATCACGGCGTGCCACATGACCGGCCACCAGCGAGATCTGCTTACGTTCCTGCAGCACATCCAGCCGGGCATCGAAGAGTGCA 
AAGCCGCGATGACCACTGTTATTATTCACCGGCGAGCGTGTAATCCGTTGCCACTCCTGCACATCGGTGCGCGTTCTGTCGCCGGTGTCCACCAG 
ATGGTAACTGGTTATACCGTCGCAGATCACACAGAATCGTTCCGAAACGAAAACTATGGTGTAGTTGTAGTCACCCGACCGATGTAGTTCCCCAT 
CGCCCTGATCCCTATAGTCCACCGTGAGGACAACGCGCTGGGCACGTGCGCGTCCAGTCTGTAGATCGTAGTTGCACTGCACCAGCTGGTGGCGC 
TTGTTAAAG AAGT AGCTGCAGTGGCGCATCCATGGATCGGCTACCAGGTGATTGTGTCGGGCAAAGAGCTCCGTGTGC AGC AGCGAGAACTGATT 
GC T G C T C GG AT CCAGC AC G T AGGC AT C AC AT C C AAG T T C CT G GC G AAG AAC AGGC AC C G AAT C G AAT GC C AAT T T GT AGC C AG AAAAG T C C GAGC 
ACACCAGGTTGCGATCCACTTTCAGTTCCACCACCGGCATCTTGCTTTACTGCGCCAATGCTGAGTTGGATTATATTGGAGATTTCGAAAAAATA 
GAGATAGATATTCCAAAACAAATCAAATCAAATCGGTAGTAGACCGGATAGAGATGGCAGATGACCGTTCGGCCGTGACTCGTATCGACTTTTTA 
TCGACAAGTGGGGGATATGTACGATACCTTGGCAATTTTCAAATCTAATAAGATTCCTTTTTTTAGTTTGAAATTTAACTAAACAATTAAATAGC 
T AAAAAG AAAAC ATT AGT T AGT C AC AT AT T T T T T TAT C GAAT C G AAAGT T T AAGC AAT GT AT C GAT AT AGC C C AC AT C GGT AT C T AGCT C AT CT C 
GATAGCGCCTGCCTATCGACGGACGGCCATCGGTAGTTGCGTGCCGTGGGAAAAACATAACGGGAGCGGAGCAGCTGTTGCCTGCTAATTGAAAA 
ATTAGACACAAGAACGCAAAATTGTGAGTTTCGTGTGGAGTTTGGCCAAGTGAAAAGCGTTGGCGGGAGGCGACTACTCCACCACAACAAATCCA 
GTAGTAGCTGCAATACGACTCACCCGACTCCACAAAGTTAGTTTAGTGACACAATTCGGTACGTGCGATAGGGCGAGGGGCAACTCAAAAGTTCA 
AAAT C T C AAGT AT AT C AAGT AAT T CCGCATCGTCGTGTTCCGCC C AC C C AAAAT AAT G C AC AT T G AGG AT GC C AAGT G AC G AG AGAG TAG AAGC G 
ACGATGTCCACGGTGCGCCTGCCCATTGTGGGCATGACTTGCCAGTCGTGTGTGCGCAATATCACAGAGCACATTGGCCAGAAATCGGGAATCCT 
TGGCGTGCGCGT CAT T C T C G AGG AG AAT GC GGGC T AT T T CG AC T AC GAT C CC C GAC AG AC GG AT CC C GC C C G GAT AGC C AGC GAT AT C G AC G AC A 
TGGGCTTCGAGTGCAGCTATCCAGGGGATGCAGCAGATCCTCCCGAAACTCCCGCCAGCGCATGGACCAACATACGGGTGGTGGGCATGACCTGC 
CAGTCGTGCGTGCGCAACATCGAAGGCAATATCGGCACCAAGCCGGGCATCCATTCCATTGAGGTGCAACTGGCGGCCAAGAATGCCCGCGTTCA 
GTACGATCCCGCCCAGTATGATCCCGCCCAGATAGCCGAGCTCATCGATGACATGGGTTTTGAGGCTAGTGTGCAGGAACCTCGATCGCCATCCC 
AATCGCCGTCGCCCGCCCCCGCCTCATCACCCAAAAAGAGGGCAACTCCTACTCCTCCACCTCCTTCATATGCTCAAAATGGTTCGGCAGTGGCC 
ATACCCGTCGAACAGGAGCTACTGACCAAATGCTTTCTGCACATCCGAGGCATGACGTGCGCCAGCTGTGTGGCCGCCATCGAGAAGCACTGCAA 
GAAGATCTACGGCCTGGACAGCATCCTGGTAGCCCTGCTGGCCGCCAAGGCGGAGGTCAAGTTCAATGCCAATGTTGTGACGGCCGAGAATATAG 
CGAAATCAATCACGGAGCTGGGCTTCCCCACCGAACTCATCGATGAGCCGGACAACGGCGAGGCGGAGGTGGAGTTGGAGATTATGGGCATGACT 
TGCGCCTCCTGCGTCAACAAGATCGAGAGTCATGTGCTTAAAATAAGGGGCGTGACCACCGCCTCCGTCACACTGCTAACAAAGAGGGGTGAGTG 
TTTTTTGTGTACTTAAGCTAAGTGGCTTTCTGGAACTAGTTTATCCCGAGAGAATTTCAAATCACGTTCTGGGTCATGCATTCGAATCGAATTGC 
GAGTCTGGAAGCATTCCACAAGTGGCAGTGCCAATTGTTATGCAATATTTATAATAATAAACACCTGGAATGGGGCCATCAACCGATATGCACCT 
GTTGCGCTTTATTCTTGCCCAACGAGCACAGGTGTTAAATTCGATCATATATCTTGTATATCTTAAGGGATCGAATAATGCAATGGGAGTAATGA 
GCTGT AAC AGAAGCAAACTATTGGC AAT CGTTAATTGATGCAACTTGTGGCAAT AAC AAAT ATT AAAT AAGCCATTCAAGGGAATTTCGTAGAAT 
GCCCTTATTTGTTTTGCATTTCGCAGAAAGCGACAGCTGTTCCTTTTTATTAACCTACTAGCATCGAGTTCGAGTAGAATCGGTGCAAATGTTTT 
TTTTTTTTTTATTGTTCGCTTTTCACAATTTCCACAATTGAAAACTGGCCCAGCAATTGTTTATTGCAACTGATATACTCGATTTACTTTATGCT 
TTTTTTTCTGGGGAGGGTGGCAGGTTCGTTGATAAGAAAGTGCGAGGAGCACTATACTCATATGCATGTATGTATGTATGTACTTTATAAACCAT 
ACACTACTAACTATGCTTTCAGCAAACAATGCCAGTGGCAACTAGATTAAATAGCTGGAATTCACCACGCTCGTGTGCGTGGGCAGTAAAGCTAT 
CTGGAACACTTGGATGAGGGGAATTCAATGGCAAGCCGAGAGTCGGGATATTTAAAATATTCAAGATATTCGCTAAAAAACACATAATATAATTA 
TTGTTTTGTTGGGCGCTGGCAGCTCGAGTTTTATTTGCCAGATTATAGCGTGTTTTTTGCGAGTGTTCAGGAAGAACTTATATATACATATTTAT 
GTATGCGATTAGCTGACAAAGAACAAATGAAATGATTTGAGCTCTTTGTTTTCATAAACTTTATACTGCCAATCGCATTTACCAATTAGCATTTA 
AGCCATGAAACTGAAAGTCCGCCTGGCCAAAGAAAAATCAAAACAAAAGTGAAAGAGCTCCAATACGCCTTGAAATACAAGTACATACATATAAT 
GTATGTACATATATATGTATGAATGTATAATAATTTATATACCCGCGATTGGCGGGCAAACGAGCGAGGAACTAATCTCGCTTTTGTCTGGAAAA 
AAAGAGGGAAACCTCTCTCTTTTTCAGCTTTGCAGTCTTTCTGCTCCGCATGCGTGGCCACGAACTCCACTTCCACTTCGTACACAGCGAGAAAT 
GGGGCTAACTATGATATAAGTATTATATAGAAGAGTGCTATCTACTTCGGCATATGATCAGCATATAAAATCTACTAATATTTAAAAGCACGAAT 
CGTACAATGTTTAGCAAGTTTCTTTTATTTTATGTTAAGCACGCACTTTTTCTTCGAGTGCATTTCTAGTTTCATTCACCTTCACTTTGATTTAG 
AGCTGTTTTTACTTTACTTTTGCTTTGCCAGCTCAAAAATGTTGATTTGAAATTAGTGAACTAGTGGCGAATTATGCAATAACTTGGCTTTGATC 
CACCACATTCACATTGAATTCTGGCTGATTTACGAATATCGGTTTGCATACCCGTATTTATGTATGTACTAGATGTACGTGCGCCGTTTTGAGAA 
GCTCAATTGCAAATTCAAAAATGAAAATGCATAAATGGGGAGCCTCGTAATTATGATTGCAATGGAACTGGTTTTCGATTTTTTCGCATTCGACA 
TGGAGACATGCGCACGGTGGAGAAAAGCGAGCGAAACTGTTGCGATTAAATGACACTGAAAATAATTTCAGACATTCAAAAGTGGTATGAAAACA 
GCGAGCCCACATGAATATCAATCGCTGGCAGCTGTATGATGGTATATAAAGTACAATATCGCTTGTTTTTTTAAAGTTTATTTGTTTATTTTGAA 
TTTACATATATACTTATTAAGATACTTAATTGTATTGTAATCATTTTCGAAAAACAATTACGTACCAGTCATGCTTCTGAAAGAGTTCATAGATG 
TTTTGGGCCAGCGAAGTTGGACTTCTGCCATATGGATTGGTCCAATTCCGTGCGGCTTCTTATCTTTCAATCAATTAGCACTTAAACTTCATTTA 
ACTAGCTTAACTTGTTGTTTGCCAATTGTTACTTGGCTTGTTTGCAGTTTGTTTGCCGCAAGTCGGTAGTCGAAATTGGGAATGTAAACTGTTGA 
GCAAAACATATGTTTTCCTCCGATTCTTATCGTCCGGCGATAAGTGAACGACAGAGAACGAACCACAGGCAATCTCAATTCAAATTGAAAATGTT 
CTAAAATTGTAACAACGAATCGCGGGCGATAAGATGCAATGGGCGGGCAAATAAAAGAAAGAAAGAACAAAAAAACTGAATCGAGAGCCCTAGAA 
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AATGTATATTTCGACGACTGCAGTGGAACTTGCATAACTCATAAGGGATAGATTAAGATACCCACATATATTACTTTTGCTACTTTACTTTTGAT 
TCGTTGAAATTCGAAAATGAAGTTCGAAAAAGCAATTTCTTGTGTAAACACACGAAACGCATCAATTATCAATTGAAAAGTGGGCTAAGATCGAT 
CGGTAGCACTGATAATTAGCGACAATTGTCACATTCGGTGTGATCAACCTAATCCTTCCTCAATTTTGCACACGACCCATCGTCTGGTGTCCACT 
TGGTGGCCCTCTAATTGAACGATAAATAATGATATCATGGGTTCTGTCAGCTCCCAAGCGATTCCCTTTGCTTCAATCTATGCTATTAATTTGAA 
GGCATATCTACAGATGACTCAAACAAACCGTTGCTCTAATCTAAACGTAAATTTGCCTAGTTACAGTTTACGCGATGACGAAACGAAACAGAATG 
ATCTTTTGTTCTATTCCCACTTCTGATAAGAACGTTGAACTAACAACGATTGCAATGAGTAGGCAAAAAGTATGCAGGCTCACAAATGTTGATAG 
ACTGAAATGTTGTGCCTGTTAATAAAATAAAAGTATTACTAAGCGCAGCCGATGGCTAAAACCAGTTTTCTTGGCCTCTGCGAACGAGCATAATA 
ATTCTATAATTTCTTTAAAATTATTCTTGGCTTGAGGAATAACTCATTATAGAACTGTATGATGAGTATGGTATATGGCGATTGCTCGCTTATCA 
ATGAATCCAATTGAAGCCAGTGACAATTTCCCCTCTTCAAATAAAGATCGGAAATGTGAAAAACTAATAAGACGATGAATGGGAGAACCTACTGG 
GGGAATAACATAGCCAACAGATGGCAATAATGTCGAGAAATTCCAGTTGCGGGCCAAAGTTGTGGGCGACAACATGCTCTAATCATGTAGAGAAA 
CGGCAGCGATTCAGTTTACACATTCTACATAGTTAAAAAGAAGCTATCTTATCTACAAATTGTGCTGGCGGGGAAAGGCAATGAACCTACAGCTG 
CGATATTTTGTGACCAAGATTCCCGGAGCCTCTGAGCCCCACGGATTTTTCTATAATCCGACTGAATGATGCGATAGTTTTTGGCCACATGTCTG 
CTATATGGGCAAATAATACTGCCTTAAGTTGTGAAATGAGGCAAATGTATGTAGCTCACAAACCAGAAACTATAAGCCATTGCTTATATCAAATC 
TGTCGCCATCGAACCGGTTGTATGCTCCAATTTCAATACCAGAACAACCGGCATTATTTCCACTCAACGACGGTGTACAAACACTTTTCATTTGA 
CGCCAGGGAAAAAGGAAACCCACAAAAATTCCCTAGGTTCCTCTGTCTATGTCCAAGGAAACATATGGAAACACTAGCCCTTCTCCTATAGAGTT 
TTCGCTGGTGTCTTGTCAAAACACAGGAGTAGCCATATAAATGTGCTAACAAGCTCCAGAAACTGTGATGAAATTGATATGGCATTCAGATATGC 
GAGTGCACATTCAGATTCACTTACACATACTCATGGAAATGAAATCGAATATACCTGAAAAGTAGTAATATTTCAATTCCCTAGATATACAAAGC 
TCATGCGGCTTCATTATTTTTCTTGGCTTGTACGCTTGGCATAGCAAATATCGTTATTAAAGATCAGCAGTATATAGCTTTCACTTCAATCAACT 
GAATCTTTGATAAATATCCTATGTGAATATTCACTGGCCATTTAAAGTGGCTTTCTCTATTTTTTTTTAACGACTGGTTCTGTAAAAGTTGCACA 
TGAAACAAGTCCGGTTTTTCAGTGTACCAATTGAAACTAACTAGGCTGAAATGAGGAGGAATAATAAATACTACGTCACTATAATTAACTTATTC 
GACTGATTTTAATAGATGATATTTAAGTTCTATCAACTTATAATCTTAATCTTAATTTTTACAGGCAAATTCCGGTACATCACGGAGGAGACGGG 
TCCCCGGAGCATCTGCGAGGCGATCGAGGCGCTGGGATTCGAGGCCAAACTGATGACCGGCCGAGACAAGATGGCCCACAATTATCTGGAGCACA 
AGGAGGAGATACGGAAGTGGCGCAATGCCTTCCTCGTCTCGCTAATTTTCGGTGGACCCTGCATGGTGGCCATGATATACTTCATGCTGGAGATG 
AGCGACAAGGGACATGCCAACATGTGCTGCCTCGTGCCCGGCCTGTCGATGGAGAATCTGGTGATGTTCCTGCTATCGACACCAGTGCAGTTCTT 
CGGTGGCTTTCACTTCTATGTGCAATCCTATCGTGCGATCAAACACGGCACCACGAACATGGATGTACTGATCTCGATGGTCACGACCATCTCGT 
ATGTGTACTCAGTGGCCGTGGTGATCGCCGCCGTGCTGCTAGAGCAGAATAGCTCCCCACTGACCTTCTTCGATACGCCGCCCATGCTGTTGATC 
TTCATATCGCTGGGTCGCTGGCTGGAGCACATAGCCAAGGGAAAGACCTCGGAGGCCTTGTCCAAGCTGCTGTCGCTCAAGGCGGCCGATGCCCT 
GCTGGTCGAGATATCACCGGACTTTGATATCATCAGCGAGAAGGTGATATCTGTGGATTATGTACAGCGTGGCGATATCCTAAAGGTGATACCGG 
GAGCCAAAGTTCCAGTGGACGGCAAAGTTCTGTACGGTCACTCCAGTTGCGATGAATCACTAATCACCGGGGAGTCTATGCCGGTGGCCAAGCGA 
AAGGGATCCGTCGTAATCGGCGGTTCCATTAACCAAAACGGTGTGCTCCTCGTAGAGGCAACGCACACGGGTGAGAATACGACGCTCGCGCAGAT 
TGTGCGATTAGTGGAGGAGGCCCAGACGAGTAAGGCGCCCATCCAGCAACTGGCCGATCGGATTGCCGGCTACTTTGTGCCCTTCGTTGTTGTAG 
TGTCCAGTATAACGCTGATTGCTTGGATCATCATTGGTTTTTCCAATCCAAACTTGGTGCCCGTGGCCATGGAGCATAAGATGCACATGGACCAG 
AATACCATCATCGTTAGTTATGCATTTAAATGTGCCTTGAGCGTGCTGGCCATTGCGTGTCCATGTGCTTTGGGCTTGGCCACGCCCACCGCTGT 
AATGGTGGCCACTGGAACGGGTGCGATCAATGGCGTGCTGGTGAAAGGAGCCACTGCCCTAGAGAATGCCCACAAGGTGAGCGAGCGAATAACCA 
TGCAAAACTTTAACCCACTTATACGCTTTTTACGCTTCCCAAATATTTTAGGTAAAGACTGTGGTCTTCGACAAGACCGGCACCATAACCCATGG 
CACTCCGATGACCAGTAAGGTTACCCTCTTCGTCACAGCCCAGGTCTGCAGCCTGGCGCGTGCTCTGACGATCGTGGGAGCCGCCGAGCAAAACA 
GCGAGCATCCGATCGCCTCCGCCATTGTGCACTTCGCCAAGGATATGCTGAACGTGGGCGCCACACCTCAGGCCGGCAGCTTTGGCAAGAGCAGT 
CACTTCCAGGCCGTGCCCGGCTGTGGCATCCGCGTCACCGTCTCCAATTACGAGCAGACTCTGCGACAGGCGTGCAACGCCGATCGGATCATTAA 
CTACGAGAACCTGCATCGCACTCATCCACAGGGTTCGGTGCCGGTGGACAATGGAGCCAGTATCGAACACTTGCTGCCGCAGCGCAGCGTACGCA 
AGTCCATGGAGCTGAACAATCAGCAGTTGCTATCCGACTTGGTATTGGAACCGGAGGAAGAGCTCCTTACGGATCAGAAAATCATCGATTCACCC 
GAGATTCTGGTGCTGATCGGCAATCGCGAATGGATGGAACGCAATGCCATCGAAGTGCCGCTGGAGATTAGCGATTGTATGACGCATGAAGAACG 
AAAGGGACACACAGCTGTGCTATGCGCCCTCAACGGGCAGCTGGTGTGCATGTTTGCGGTGTCCGACATGGTCAAGCCAGAGGCCCATCTTGCCG 
TCTATACACTAAAGCGAATGGGCATCGATGTGGTGCTGCTCACGGGGGATAACAAGAACACGGCAGCCAGCATTGCCCGAGAGGTGAGTGTCCAG 
CGTGCTCATAGCAATACTGTTCCTAGGCTAATGAACTTATGTATATTCATTAGGTTGGCATCCGAACCGTGTACGCGGAGGTTTTGCCCTCGCAT 
AAAGTGGCCAAAATCCAGCGGATACAGGCCAACGGTATACGCGTGGCCATGGTGGGCGATGGAGTCAACGACAGTCCGGCGCTAGCGCAGGCAGA 
TGTGGGAATTACGATTGCAGCGGGCACGGATGTTGCCGCCGAGGCATCGGACATTGTCCTGATGCGCAACGATCTGCTGGATGTGGTGGCTTGCC 
TGGACTTGTCCCGCTGCACGGTGCGTCGCATTCGGTACAACTTCTTCTTTGCCAGCATGTACAATCTGCTGGGCATACCGCTGGCCAGTGGCTTG 
TTCGCACCGTACGGCTTCACCCTGCTGCCATGGATGGCGTCCGTGGCCATGGCCGCCAGCTCCGTGAGCGTGGTATGCTCATCGCTGCTGCTCAA 
GATGTATCGCAAGCCCACGGCGAAGACGTTGCGCACGGCAGAGTACGAGGCGCAGCTGGCGGCGGAACGCGCAAGTGGGTCCGAAGACGAGCTGG 
ACAAGCTGTCCCTCCATCGTGGCCTTGACGATTTGCCCGAAAAGGGTAGGATGCCATTCAAGCGGTCCAGCACATCACTGATTTCGCGCATTTTC 
ATGCACGACTACGGGAACATCACTTCGCCGGATGCCAAATACGGGGAGGGACTGCTGGATCCCGAGGAGCAGTATGATGGCCGCACCAAGATCGT 
GAGGAGTCGCTTCCATGCCAATGACAGCACCGAACTGCAAAAGCTGTGAGCTAAACGATGGCAACTAGCAAGCTACTAGCAACCACAATCAAACC 
GAAGACCTTAAAACGACAAATTTTCAGTGAAATTGTTAGCGGACTATAAATTGAACCAGTAAGAGAGCGAATTATAAAAACCTATGTACATATTT 
GCAAATTGTTTTGACAAAGGTTGAGCACTAGGATACTAGGGTTATCTTTGATCTACGTGCAGTCTAATATGAGTTTGTGTGACTCACATGTGGGA 
AAAGATATAGGTATCTTAAGAAAATATTCTCGGTATGTAAATATATATAGCATAATCACGAAGTCATTCACTTTCATTTTAAGTATGGTTTTTTG 
AACTACTGAGGTGCCTTTTTATGTTAAAGCCACTACATACTTGTCTTTTCTTACAAACGTTGTATTGGAGATTGTAGAAAACAAAGATGACATTT 
CGTTACCACTGATTCTAGAGATCTTTTGTTCCAAAATCAATTTGTGACCGGTTTTCCATTTCTTAGTGCCGCACCCACACCGTCCCCCTTTTCTA 
TACAATTGTTATTTTCCTATCTTTTTTATTTTATTAAGAAACCTACAACAACAAAAGTTCGATACGATAGCTTCTTCTTTTCCACTTGATTATAT 
TATTATTATTATTGTATACAAATCTTTTTTTTTTTTGTTATAATCGTAAGTGCAATATTTTTATTATTAAGTTCAAATGAACAAAGGCAAAGGAG 
TCCACAGACCAGGAGATAAAACCCAGACATTAATCATCATTGAACTATTGTTACTCATTGTTAATTCACATTATCATATATATGCACATATGCAA 
AAGGTAAAATAAATCAAAAGACGAAAATCGAACAAAAAGACCATTTGACCCCATATCGGCATTAATCCTTAATTGTGCTATTCGCGGACGACGAA 
CACTTGTAAGACAATTGGGCACAAGCATAAGGCAAATTGAAACGCAAAAGAACGATTAACATTCTTTATGTGTAACTTTATTGTTTATAAACAAA 
AAGC 

(SEQ ID NO: 166) 

Exon: 1001.. 2273 
Exon: 6715.. 7961 
Exon: 8032.. 8448 
Exon: 8554.. 8823 
Exon: 8889.. 9644 
Start ATG: 1334 
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Transcript No. : CT5824 

AAAAACATAACGGGAGCGGAGCAGCTGTTGCCTGCTAATTGAAAAATTAGACACAAGAACGCAAAATTGTGAGTTTCGTGTGGAGTTTGGCCAAG 
TGAAAAGCGTTGGCGGGAGGCGACTACTCCACCACAACAAATCCAGTAGTAGCTGCAATACGACTCACCCGACTCCACAAAGTTAGTTTAGTGAC 
ACAATTCGGTACGTGCGATAGGGCGAGGGGCAACTCAAAAGTTCAAAATCTCAAGTATATCAAGTAATTCCGCATCGTCGTGTTCCGCCCACCCA 
AAAT AAT G C AC AT T G AGG ATG C C AAGT GAC G AG AG AGT AG AAGC G AC GAT GT C C AC GGTGCGCCTGCCCATTGTGGGCATGACTTGCCAGTCGTG 
TGTGCGCAATATCACAGAGCACATTGGCCAGAAATCGGGAATCCTTGGCGTGCGCGTCATTCTCGAGGAGAATGCGGGCTATTTCGACTACGATC 
CCCGACAGACGGATCCCGCCCGGATAGCCAGCGATATCGACGACATGGGCTTCGAGTGCAGCTATCCAGGGGATGCAGCAGATCCTCCCGAAACT 
CCCGCCAGCGCATGGACCAACATACGGGTGGTGGGCATGACCTGCCAGTCGTGCGTGCGCAACATCGAAGGCAATATCGGCACCAAGCCGGGCAT 
CCATTCCATTGAGGTGCAACTGGCGGCCAAGAATGCCCGCGTTCAGTACGATCCCGCCCAGTATGATCCCGCCCAGATAGCCGAGCTCATCGATG 
ACATGGGTTTTGAGGCTAGTGTGCAGGAACCTCGATCGCCATCCCAATCGCCGTCGCCCGCCCCCGCCTCATCACCCAAAAAGAGGGCAACTCCT 
ACTCCTCCACCTCCTTCATATGCTCAAAATGGTTCGGCAGTGGCCATACCCGTCGAACAGGAGCTACTGACCAAATGCTTTCTGCACATCCGAGG 
CATGACGTGCGCCAGCTGTGTGGCCGCCATCGAGAAGCACTGCAAGAAGATCTACGGCCTGGACAGCATCCTGGTAGCCCTGCTGGCCGCCAAGG 
CGGAGGTCAAGTTCAATGCCAATGTTGTGACGGCCGAGAATATAGCGAAATCAATCACGGAGCTGGGCTTCCCCACCGAACTCATCGATGAGCCG 
GACAACGGCGAGGCGGAGGTGGAGTTGGAGATTATGGGCATGACTTGCGCCTCCTGCGTCAACAAGATCGAGAGTCATGTGCTTAAAATAAGGGG 
CGTGACCACCGCCTCCGTCACACTGCTAACAAAGAGGGGCAAATTCCGGTACATCACGGAGGAGACGGGTCCCCGGAGCATCTGCGAGGCGATCG 
AGGCGCTGGGATTCGAGGCCAAACTGATGACCGGCCGAGACAAGATGGCCCACAATTATCTGGAGCACAAGGAGGAGATACGGAAGTGGCGCAAT 
GCCTTCCTCGTCTCGCTAATTTTCGGTGGACCCTGCATGGTGGCCATGATATACTTCATGCTGGAGATGAGCGACAAGGGACATGCCAACATGTG 
CTGCCTCGTGCCCGGCCTGTCGATGGAGAATCTGGTGATGTTCCTGCTATCGACACCAGTGCAGTTCTTCGGTGGCTTTCACTTCTATGTGCAAT 
CCTATCGTGCGATCAAACACGGCACCACGAACATGGATGTACTGATCTCGATGGTCACGACCATCTCGTATGTGTACTCAGTGGCCGTGGTGATC 
GCCGCCGTGCTGCTAGAGCAGAATAGCTCCCCACTGACCTTCTTCGATACGCCGCCCATGCTGTTGATCTTCATATCGCTGGGTCGCTGGCTGGA 
GCACATAGCCAAGGGAAAGACCTCGGAGGCCTTGTCCAAGCTGCTGTCGCTCAAGGCGGCCGATGCCCTGCTGGTCGAGATATCACCGGACTTTG 
ATATCATCAGCGAGAAGGTGATATCTGTGGATTATGTACAGCGTGGCGATATCCTAAAGGTGATACCGGGAGCCAAAGTTCCAGTGGACGGCAAA 
GTTCTGTACGGTCACTCCAGTTGCGATGAATCACTAATCACCGGGGAGTCTATGCCGGTGGCCAAGCGAAAGGGATCCGTCGTAATCGGCGGTTC 
CATTAACCAAAACGGTGTGCTCCTCGTAGAGGCAACGCACACGGGTGAGAATACGACGCTCGCGCAGATTGTGCGATTAGTGGAGGAGGCCCAGA 
CGAGTAAGGCGCCCATCCAGCAACTGGCCGATCGGATTGCCGGCTACTTTGTGCCCTTCGTTGTTGTAGTGTCCAGTATAACGCTGATTGCTTGG 
AT CAT CAT TGGTTTTTC C AAT C C AAAC TTGGTGCCCGTGGCCATG GAGC AT AAGAT GC AC ATGGAC C AG AAT AC C AT CAT C GT T AG T T A T GC AT T 
TAAATGTGCCTTGAGCGTGCTGGCCATTGCGTGTCCATGTGCTTTGGGCTTGGCCACGCCCACCGCTGTAATGGTGGCCACTGGAACGGGTGCGA 
TCAATGGCGTGCTGGTGAAAGGAGCCACTGCCCTAGAGAATGCCCACAAGGTAAAGACTGTGGTCTTCGACAAGACCGGCACCATAACCCATGGC 
AC T C C GAT GAC C AGT AAGGTT AC CCTCTTCGT C AC AGC C C AG GT C T GC AGC CT GG C GC GT G CT C T GAC G AT C GT GG GAGC C GC C G AGC AAAAC AG 
CGAGCATCCGATCGCCTCCGCCATTGTGCACTTCGCCAAGGATATGCTGAACGTGGGCGCCACACCTCAGGCCGGCAGCTTTGGCAAGAGCAGTC 
ACTTCCAGGCCGTGCCCGGCTGTGGCATCCGCGTCACCGTCTCCAATTACGAGCAGACTCTGCGACAGGCGTGCAACGCCGATCGGATCATTAAC 
TACGAGAACCTGCATCGCACTCATCCACAGGGTTCGGTGCCGGTGGACAATGGAGCCAGTATCGAACACTTGCTGCCGCAGCGCAGCATTCTGGT 
GCTGATCGGCAATCGCGAATGGATGGAACGCAATGCCATCGAAGTGCCGCTGGAGATTAGCGATTGTATGACGCATGAAGAACGAAAGGGACACA 
CAGCTGTGCTATGCGCCCTCAACGGGCAGCTGGTGTGCATGTTTGCGGTGTCCGACATGGTCAAGCCAGAGGCCCATCTTGCCGTCTATACACTA 
AAGCGAATGGGCATCGATGTGGTGCTGCTCACGGGGGATAACAAGAACACGGCAGCCAGCATTGCCCGAGAGGTTGGCATCCGAACCGTGTACGC 
GGAGGTTTTGCCCTCGCATAAAGTGGCCAAAATCCAGCGGATACAGGCCAACGGTATACGCGTGGCCATGGTGGGCGATGGAGTCAACGACAGTC 
CGGCGCTAGCGCAGGCAGATGTGGGAATTACGATTGCAGCGGGCACGGATGTTGCCGCCGAGGCATCGGACATTGTCCTGATGCGCAACGATCTG 
CTGGATGTGGTGGCTTGCCTGGACTTGTCCCGCTGCACGGTGCGTCGCATTCGGTACAACTTCTTCTTTGCCAGCATGTACAATCTGCTGGGCAT 
ACCGCTGGCCAGTGGCTTGTTCGCACCGTACGGCTTCACCCTGCTGCCATGGATGGCGTCCGTGGCCATGGCCGCCAGCTCCGTGAGCGTGGTAT 
GCTCATCGCTGCTGCTCAAGATGTATCGCAAGCCCACGGCGAAGACGTTGCGCACGGCAGAGTACGAGGCGCAGCTGGCGGCGGAACGCGCAAGT 
GGGTCCGAAGACGAGCTGGACAAGCTGTCCCTCCATCGTGGCCTTGACGATTTGCCCGAAAAGGGTAGGATGCCATTCAAGCGGTCCAGCACATC 
ACTGATTTCGCGCATTTTCATGCACGACTACGGGAACATCACTTCGCCGGATGCCAAATACGGGGAGGGACTGCTGGATCCCGAGGAGCAGTATG 
ATGGCCGCACCAAGATCGTGAGGAGTCGCTTCCATGCCAATGACAGCACCGAACTGCAAAAGCTGTGA 
(SEQ ID NO: 167) 

Start ATG: 334 

MSTVRLPIVGMTCQSCVRNITEHIGQKSGILGVRVILEENAGYFDYDPRQTDPARIASDIDDMGFECSYPGDAADPPETPASAWTNIRVVGMTCQ 

SCVRNIEGNIGTKPGIHSIEVQLAAKNARVQYDPAQYDPAQIAELIDDMGFEASVQEPRSPSQSPSPAPASSPKKRATPTPPPPSYAQNGSAVAI 

PVEQELLTKCFLHIRGMTCASCVAAIEKHCKKIYGLDSILVALLAAKAEVKFNANVVTAENIAKSITELGFPTELIDEPDNGEAEVELEIMGMTC 

ASCVNKIESHVLKIRGVTTASVTLLTKRGKFRYITEETGPRSICEAIEALGFEAKLMTGRDKMAHKYLEHKEEIRKWRNAFLVSLIFGGPCMVAM 

IYFMLEMSDKGHANMCCLVPGLSMENLVMFLLSTPVQFFGGFHFYVQSYRAIKHGTTNMDVLISMVTTISYVYSVAVVIAAVLLEQNSSPLTFFD 

TPPMLLIFISLGRWLEHIAKGKTSEALSKLLSLKAADALLVEISPDFDIISEKVISVDYVQRGDILKVIPGAKVPVDGKVLYGHSSCDESLITGE 

SMPVAKRKGSVVIGGSINQNGVLLVEATHTGENTTLAQIVRLVEEAQTSKAPIQQLADRIAGYFVPFVVVVSSITLIAWIIIGFSNPNLVPVAME 

HKMHMDQNTIIVSYAFKCALSVLAIACPCALGLATPTAVMVATGTGAINGVLVKGATALENAHKVKTVVFDKTGTITHGTPMTSKVTLFVTAQVC 

SLARALTIVGAAEQNSEHPIASAIVHFAKDMLNVGATPQAGSFGKSSHFQAVPGCGIRVTVSNYEQTLRQACNADRIINYENLHRTHPQGSVPVD 

NGASIEHLLPQRSILVLIGNREWMERNAIEVPLEISDCMTHEERKGHTAVLCALNGQLVCMFAVSDMVKPEAHLAVYTLKRMGIDVVLLTGDNKN 

TAASIAREVGIRTVYAEVLPSHKVAKIQRIQANGIRVAMVGDGVNDSPALAQADVGITIAAGTDVAAEASDIVLMRNDLLDVVACLDLSRCTVRR 

IRYNFFFASMYNLLGIPLASGLFAPYGFTLLPWMASVAMAASSVSVVCSSLLLKMYRKPTAKTLRTAEYEAQLAAEPJVSGSEDELDKLSLHRGLD 

DLPEKGRMPFKRSSTSLISRIFMHDYGNITSPDAKYGEGLLDPEEQYDGRTKIVRSRFHANDSTELQKL* 

{SEQ ID NO: 168) 

Name: copper-transporting P-type ATPase-like 
Classification: transporter 



Celera Sequence No. : 142000013384811 

TCGCCATCCTCGCTGCTGCCCGTGTAGCTGGCGGAGCTGAAGGAATCCGTTGAGGAGGCACTAAGCGAGCGGCTCCTGTACCGACTTGTCTTCTT 
CAGAGCCGAGGACTGCTTGGTGGCCGGCGGAGGAACCACCACGCTGGCATCCTGCGGCGCCGGTCCAGCCTGCTGCTGCTTGTCGCCGCCTCCGA 
ATCCAGGATCAACTACCACAGTGTCTGCCAGGTTGTTCATGGTGGAAGTAGAGGGTAATCTGGGGATTTCCTTACTATCCCTTAAATTGGCTCCT 
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TGCTCTAGCTCCTAGTGTTCCACCTTCTGCGACTCCAGAGCTTGCAATCCTTTTCCGCTGCTCCTTTGAATGGGCGCCTTGGATGTTATGTAAAG 

TCTGAAATCCAAGCAGGATGCGCTGCAAATGGCTAGTGGTGTGGCCTGGGTCTATACGAACTGCACAATCACGTGATTTATCGACGTTGTGGAGA 

CCACAAATTGCTAAATTAATTGTTTTTAGTGCGAGTCTCCTTTGTTTCACCTCTCCACTATTGTTTTTGTTCTTCTTCTTCTTTTGCGAATGGGG 

GGCTACACTTTGGATCCGGGGCTTGATGGGAGTTTCCGGAGAGGGGATGATGATGCAGCCGTCGGCTTTCGGGCGACGTTGTTTGTGTGGCGACC 

AAGTGGGGGATCGTTTGGCGATCGCTTATTGAATTTGATTTTATCCGGAGAAAAGAGAACTAAACCGAACCGAGGACAGTAAGCGAGACACGCGA 

AAATACCAAAAGCAGAAGGCGTTAGTGCTATTCTGCAAGTGTGGTAAGTTCCCACTGAAGAAAGGCCAAAGTAACGACTTTTTAACTGAAATCGT 

CTAAAAATTAATTTTTATAATTTCTTACCATTAATTATGTGTTTATTTTTTAATGAGCAAAGTAAATAAAGTGTGATTTACAACTCCTCAGCTGC 

AAGTCTTTTAAATCTGGTAGCTGGATAATTTAATGAGTCAAAAAGAAATTGGCAAAGTATTCGAGGTTTATTAAAAACAAATAGTTAACACATTT 

GGCCTACATATCCTCCTGTAGGTAGCCGCACAGCTCGTAGTCGTCCGCCGTGTTGACCAAGTGCGCTTCCCTAAGAAGGCCCTCCTTGTAGGTCT 

GTCTCTTGACAATCAAACGCGACAATTCCTCCGGTGCAAGTGGTCCTTGCTTGGCGTGTTTCTGGATGAAGTCCGCCGCCAGTTGTCGATCCACG 

ACGGACATCTCGAGGCTGCGACTCCAGGCGAGCAGGGCCAGCGGTCGCTGCGGACTCAGAGCCAGCGAGGGTGACACCAGATGGCGATACAGACA 

GCCACCCACGATGTCTTGCAGCTGCTTCACTTGGCCGGGAAAAGCGCAGGGATGGTAGAGCATTACGATGGCACCTTCCGCCAGATTACGCACGT 

ATCGCTGAGGAGGCAAGTAGGAATAGGTGCCATATTCAGCGGGCATCGGGCGATAGGGGCCACTGATAATACAGATTTGAATTTGAAATTAGCAG 

GAAACTATAATATTGTTCGCCTTACTTGGTGGCAGGCAGATGGGAGTAGCTAATGGACTCATTAAGGCACTTGGCAGGCGGCAGATAGGCGCTAG 

GTAGAAAGTGCTCCGTAAGAATGGCTTCTCTGGACAAATCCGGTTGGTACAAGCTGTGATTACTACTGATGCATAGGAAATTCAGGCTGTCACGG 

ATGTCGTTTGGATCGAAGTCCACGGCAAGATTGCTCTGAAAATTGTTAATAACAATGATAGAAAAAGATGACTAATACTAAGCATTCATCGGTAT 

TTAGGGGAAGTATTAGTTATATTGCCAAACTCACCTTGCCATCATCACAGCTGCTTTGTTTAGTCTTGTGCGATTCATTGGGACCCTGCGGAAAC 

AATTTACCCTGCCAAGTATCCTGTGTGGTGGACTTCCCCGACTTGGTTTTGTTATCGACCAGCGGTTTTTTCTTCACCAAGTGCGGCTCTCCGGG 

AGCTTGTGGAAACCACTTGCCCTGCCAGCCATCCTTGGTTGTGAACTCCAGCGGAGGAACTTCCTCCGTGGTGCTCTTCATTTTTAATTCAATCG 

CTTCCTGCTGTTGTCCAGGGGCATGGGGAAACCACTTGCCCTGCCAGCCATTCTTGCCGGGCGTCGCACCTAATGGAACCTGCACACGTTGCACA 

TCATTGGAAATGTCGGGTGGGGACTTCCCAATCTTCCCTAGCTTCTCATCAGGATCAAGAGTTGGCTCCAACAGATTGGCCTGCGACTTCTTGAT 

GGATTTCGCATCGATGAGGGCGAACCCAAGGCACAAGAGGGTCAAGTAATGCCAAATAAACATAATAATTGCAAGCAAAAGAAAAACAAAAGCGC 

AAAATAAGCAAACAAAAATATAATTAAGAACTTGGCAAAACAGCTGTGGAGCTGCGTTTTTCAACACTGGTCAAAGCATGACTTTGTTGCAGGGT 

AGCAACAGGTTGTTTGGAGTTTGCATAAATCTCCGATGATAATTATGTGTTGTTTTAATAACTAAATTAAAATTATTTACGCCAAATTAAAGTAA 

AT AAAGT T AAT T T AAAAT GAC T AC T T AAAT C AAGC AG AT T T AAAAGG AAAT C AAT GT AT AT TT G AAAT AT T C AG ACT G GAT T C T AAGGT T AAAT G 

ACAAATTACTCACCAGAAATCGGCTCATAATAATAAATCATTTTATTTTCAAATGCCCCCTGCGAACGCTCCCCCGCATTGCACACACCAGGGGG 

TATTTTACGAAGTGCCCACACCTGGCCACACTGGTCGCAGCTACAAAAAGAAGAAGAACAAGGGCCAACTTGATGGTCACTTGCTACTGCGCTTT 

ATTAATTCATTATTTTCCAAAATAATGCACATATATACCCGCCCACAGCCGCTCTGCGCCCCCGCTAATCGAAAAATGTGAGTGGTAAGCGTTTT 

TACCAGAGTCTTATCAACAATTTAAGCCTCAATTCCCTTCCGTTTCCGTGATTCGCCACCCAGAAAACAATAACAAAGGGGTGTTGTTGTTGCTG 

TCGCTGCTGCTG C GC T GG AAAAAC AACT C G CAT AAAAT T T T AAAAC GC AC AAAG AAGG C G AAC GT GG AAAGT TAGGC C AAC AAC C C AGT C GC GAT 

GTCGTGGTTCAATCCCTGGGATGGCCTCAAAACGAAAATGTGTCGCTACCTGCTGCAGCGATACCTGGGCCAGTTTTTCGAGAATAACTTGAACC 

TGGAGCAGCTGAAGGTGGACCTTTACAATGGCAAGGCTGTGGTGGAGGACATTTTCCTAAAGGTCAACGCCTTCAACGATCTCTTCGAGGACCAG 

GGCTGGGCCTTCGAGGTGGTCTCGGGCCACATTGGCCGCCTGACTGTCGTGGTGCCGTGGAATGCGCTGATGACCAACGACAGTAGCCTGGAGAT 

CCACAATCTAA 

(SEQ ID NO: 169) 

Exon: 2431. .1840 
Exon: 1745. .1546 
Exon: 1487. .1001 

Start ATG: 2343 (Reverse strand: CAT) 
Transcript No. : CT5902 

CGCAGCTCCACAGCTGTTTTGCCAAGTTCTTAATTATATTTTTGTTTGCTTATTTTGCGCTTTTGTTTTTCTTTTGCTTGCAATTATTATGTTTA 
TTTGGCATTACTTGACCCTCTTGTGCCTTGGGTTCGCCCTCATCGATGCGAAATCCATCAAGAAGTCGCAGGCCAATCTGTTGGAGCCAACTCTT 
GATCCTGATGAGAAGCTAGGGAAGATTGGGAAGTCCCCACCCGACATTTCCAATGATGTGCAACGTGTGCAGGTTCCATTAGGTGCGACGCCCGG 
CAAGAATGGCTGGCAGGGCAAGTGGTTTCCCCATGCCCCTGGACAACAGCAGGAAGCGATTGAATTAAAAATGAAGAGCACCACGGAGGAAGTTC 
CTCCGCTGGAGTTCACAACCAAGGATGGCTGGCAGGGCAAGTGGTTTCCACAAGCTCCCGGAGAGCCGCACTTGGTGAAGAAAAAACCGCTGGTC 
GATAACAAAACCAAGTCGGGGAAGTCCACCACACAGGATACTTGGCAGGGTAAATTGTTTCCGCAGGGTCCCAATGAATCGCACAAGACTAAACA 
AAGCAGCTGTGATGATGGCAAGAGCAATCTTGCCGTGGACTTCGATCCAAACGACATCCGTGACAGCCTGAATTTCCTATGCATCAGTAGTAATC 
ACAGCTTGTACCAACCGGATTTGTCCAGAGAAGCCATTCTTACGGAGCACTTTCTACCTAGCGCCTATCTGCCGCCTGCCAAGTGCCTTAATGAG 
TCCATTAGCTACTCCCATCTGCCTGCCACCAATGGCCCCTATCGCCCGATGCCCGCTGAATATGGCACCTATTCCTACTTGCCTCCTCAGCGATA 
CGTGCGTAATCTGGCGGAAGGTGCCATCGTAATGCTCTACCATCCCTGCGCTTTTCCCGGCCAAGTGAAGCAGCTGCAAGACATCGTGGGTGGCT 
GTCTGTATCGCCATCTGGTGTCACCCTCGCTGGCTCTGAGTCCGCAGCGACCGCTGGCCCTGCTCGCCTGGAGTCGCAGCCTCGAGATGTCCGTC 
GTGGATCGACAACTGGCGGCGGACTTCATCCAGAAACACGCCAAGCAAGGACCACTTGCACCGGAGGAATTGTCGCGTTTGATTGTCAAGAGACA 
GACCTACAAGGAGGGCCTTCTTAGGGAAGCGCACTTGGTCAACACGGCGGACGACTACGAGCTGTGCGGCTACCTACAGGAGGATATGTAGGCCA 
AATGTGTTAACTATTTGTTTTTAATAAACCTCGAATACTTTGCC 
( SEQ ID NO: 170) 

Start ATG: 89 (Reverse strand: CAT) 

MFIWHYLTLLCLGFALIDAKSIKKSQANLLEPTLDPDEKLGKIGKSPPDISNDVQRVQVPLGATPGKNGWQGKWFPHAPGQQQEAIELKMKSTTE 

EVPPLEFTTKDGWQGKWFPQAPGEPHLVKKKPLVDNKTKSGKSTTQDTWQGKLFPQGPNESHKTKQSSCDDGKSNLAVDFDPNDIRDSLNFLCIS 

SNHSLYQPDLSREAILTEHFLPSAYLPPAKCLNESISYSHLPATNGPYRPMPAEYGTYSYLPPQRYVRNLAEGAIVMLYHPCAFPGQVKQLQDIV 

GGCLYRHLVSPSLALSPQRPLALLAWSRSLEMSVVDRQLAADFIQKHAKQGPLAPEELSRLIVKRQTYKEGLLREAHLVNTADDYELCGYLQEDM 
* 

(SEQ ID NO: 171) 



Celera Sequence No. : 142000013384570 
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CAATGTGTACCAGTCAGTAAGTTTTCGAACATCACAGGCGTAGAGGCGCAGCTCCCACATGGGCTCCATTAGCAAGGTGGCGAGGATGGCAGATA 
TGGCAACCTCAACGAAAGCGCTGGCCGTCAAGATCAGCTGCTTCCGCGAAAACCTATGTGAATTCACTAAGTAAGTTCTTCTTCCAGTTTGATTA 
GTTCACTCACTTGCGATCCTTGGCGTCCATGTCAATGAAAAAGCAGTGCATCATAAGTGGCAGCATGGTCATGAATCCCAGGTATAGCCAGGAGT 
ACGTGCTTAACTCGTCGCGGCATGGAGAGCATGCGTAACTTTGGTTCACCTGAACAATATAAAAGTAAATTATTTTCCAGGCGCCGTTCCGAAGG 
ATGCTAGAACATTAACCCGAGATCCTCTGGGACAGACCCCGCAGCTGCTCCAGCTGTTGTTTCCTAGCGGTGATCGCCCACAGTAGGCTCCTGGG 
CAGCGTTCCAACGCGTCGTACATTCTCATGAAATCAAAACAATTACTTCGTATTGTATATTACATTAAACAAAATAAAAACAATACTCCTATCGA 
TATCGGACTTATCGTTATCAGTAGGACCACTGAATAAACGAATAGGTTGTTCGTGCACTTAAATTAATACTAATTTATAAAATGATAATACATCA 
GAAGCACATGCGAAATGTATTAAAAATAAGGAAATTATTTTCTACAGATTATATAATGGGCCCTAAGACTGCTTTCTAATTTGATCAAATTAATA 
ATACTATATGTTTTTATATATTTATTTACAATTAAACATGACTAGTAGAGTGTACATTCTTACAGTGCATTTCAATGATTTGATTTTATTTGAAA 
TCCCAATTGCCAGAAATTTAAATTTAGGGTCCCTTATATGTACCACATAAACCAGTTCGATAGGCGCGCTTTTGGCATGGCCTGATAACTCTGGT 
AAAAAGCACCGCGCGAACCGGTTCAATTCATAACTAGTTCGCCCGAACTGCACAACCCTAGCAAAAGAAATAGCTTAAATATAATTTATTTAACG 
CGAATTGGACAACGAGCGGGTAGAAAAAACCCTGTGGCCGCAACCGCGACGCTCGCAGGATACGCAGCTCTTCGCCGTCGGGCAGGCAGGTAAAA 
AGCAATGGCAATTGGAGTTGACCGCAGAACAACAGCAACGGCAGACGCCATTTTGTCTATTGGCCGGCTGATTTGTAAGCTCAATATTCGCGAAA 
TAGTTACCGCGACTAGCTTCGAGTGGCGCAGCCAAGTAAGGGCTCCCCGCTGTCCAGAGCAAAGGCGAAAGGAAGAGCAATCGCCCATTTTGAGG 
CCTGCGAAAAGTTGGGACTCGGTCGTCAGTTCGCTCGATTCCATTTGCGAAAAATAAATGTACGTGCGCGTGAGGAAGTGTGTGTGCGCGTGAAA 
AAACAGTGAAAAATAATAAAAAGCCGCACTGGCCGCCGTCCTTGGGTGACTTCCGTTAATCTCTTACAACTTTTAAAGCTATAGAAAGAATGTGA 
GCAATTTACCGCCCTCGACACACCCAATGAACCGCTAATTTTGGCATCTTTCTATAACTTTATGCGTAGTTATTTTCACCACCCTTCGCCTACAC 
CCTTTCCCCGCCCCTTTTTCCACCAGTGGCTCCCACCCACCGACCGACATTCCGCTTAAGGATGATGCCGGGCGGAGAGAGATAGAGAGTGAGTG 
CAAAATCCGTAAACTTGGACGTGATGTAATTTTCGGCAGGAAAATGTTCTTTGTTTGCCCAAAAAAATAACAATAAATAAATAAGAATGAAGAGC 
GAGGCGTAAAACAAAAACAATTTTAAATTAAACTCAAACGCACACACAAAAAGAGAGGGTTCTGTGGGTGAGCGTGCGTGCGTGTGAGTGGGTGA 
GAGCGTAATTTGTGTGCTTTTGGAGCATTATTTGGCGCTTGCTGCCCCGTCTATTAAATAATATTGATTTTTCATATTCCAACCCAGCGCTCTCT 
TACACACACGCAGGACCACTCACACACCCAAAACTGAGCTTCGTACACATGTGGGCCTTTTCGGTCTCTCTCTTAGCGCTGTATTTATCCGCCTC 
TTATTGCCTCTTTCGCTTCGTTGCACTTTATAGATTTTTGCAAACTGGCCGCTGACCAGGCATTGGGAAAAAGGACAATAAGAGGGAGCCCAGAG 
CAAGAAGTCCAACTAACCGACCAACTGTGCCTCAAGAAACAGCAAATCAGCGGGGAACTCCCAAACTGGACAAGCAGAAGCCCGGATACCTCACA 
TTTGCCATAAGACACCAGTCAAAAAACGACGCGCACTGCAGAACGCATTAGGATTAGCCCGGCACTAGAGCTCGTGCATTAACATCCGCTTCCGG 
TGCTCCGCAGAACGAGTTGGCTGGAGAGGATACCCCGCCACCATGACGTTCGACACTCGTAGGCACACCACCGGACAGCCGGGAAGCACAGCGCC 
CAGCTCAAGCAGCAGCACCACCAGCACAACAACCACCACAACGAGTCCGGCGCAGAGTGCCGGATCAGGATCGGGCATAGGAACAGGCACAGGAA 
CAGTCGCCAACTCGAGTCTGCCAGGAGGAGGATCAGGTAGCTTGGACGGCAACCAGGATCAACAGCCGGCCACAGACAGTCAGAGCTCTGATGAC 
GTGGCCGCTTCCTTGTCAGCGAACAGCGTGGACAGCACCATCACAATAGTGTAAGTGGTCGTGCTGCCGTATTTGCATGACTCCGCCCAATTAGT 
CGGGTTTTGTGCGTAGCACCAGTATGACCGCCACCCCTAAAGATCCCCAACATTCGTGTGTTTACTTTTGCGTAGTAATTCTTCCACATGCATTC 
TTTGTTTGTCTATGGTTGGGCCAGCACAATGTTCGTAATTCATTTAACTATGTTTCAGATGATTTTGGCCAGGCCACTGGTTAGTGACCATAAAT 
GTGGCAATAAATGTTATCTTTCTTTTTCTTCGATAGGTGCACTTGGCTGCTGTCCAGTTTTAGCAATGAACTTACCAAGCTTTACTAATATTACC 
TTACCCGGCTATTACTATCTCATTTTGAATGCCCGTCCACGGCCTATCTTATCGCTTGTCCATCAAAATAAGCTAACACTGAAAATTACAAAAAA 
AAAAAAAAAAAAAAT GT AT GC G T T T T GG C ATT TAT T T T AAT AG AAAT AAC CAT T T AAT AAAC AG AAAT GT T T AT GG AAG GGT GTGC T T T G AT T G A 
CCCCCTAAGAACTTTTTTTTTTATGTGTATTATTTAATTTGTTATTTTTAATATTGAATAATTGTCCATTTAATTAAATAGGCCTCCAGAGAAGC 
TTATATCCTCGTTTCCCACGACGAAACTGAGATCACTAACGCAAAAGATATCCAATCCGCGTTGGGTGGTGCCGGTGCTGCCTGAACAGGAATTG 
GAGGTTCTCTTAAATGCAGCCATCGAACTGACCCAGGCTGGTAAGTGTGTCCAAACGATTTATTTGATACCCTCTAAAGTCAATTTATTTGCAGG 
TGTGGATCACGACTGCGAGCCATGTGTGGAGTTTTATCGCAATGGACTGAGCACTTCCTTTGCCAAGATCCTAACCGACGAGGCGGTGAATTCGT 
GGAAGAATAACATCCACCACTGTATCCTTGTGTCTTGCGGCAAGCTTCTACATCTAATCGCCATCCACATGCAGCGAGACAATCCCTATCTGCTT 
GACTTGCTGGCTATTGTTTTTGATCCGGAGAACAAGTTCAACACATTTAACGCAGGCCGTCAACCGGAGTGCTTCGCAGCGCCGGACTACATCTG 
GGGTCAGTTGGACAGCAACAAAATGTACGCCCGTCCGCCGCCGGAGCCCAAGAATGCGCGTGGCTGGCTTGTGGATCTTATTAACCGCTTCGGAC 
AGTTGGGAGGCTTTGACAATCTACTGGAACGGTTTAACATCGGACTGGAACTGCTGAAGCGCAACCAAAACAAGTGCACTGGCAAGAATATTAGT 
GTCGAGGGTAGGGTTGAAAATGGGGCTCAGGATAATCGCCTCACCCTTGCCCTCATACACAGCCTCTTGCGTCCATTCGGTCAATGTTATGAGCT 
TCTAATGCCTGCCACTATAGCCAAGTACTTTATGCCCACCTGGAACGTTGTGTTAGACCTTCTGGACAGCTTCACAGACGAGGAGTTAAAGCGAG 
AAGTGAAACCTGAGGGACGCAATGACTACATCAATGGGATTGTAAAGTCAGCCCGCTTACTGGCCAGCCGCTTGACTGGCCAGGAGGAGCTAATC 
C G A GAT C T T G AG AT GT TT C GC C T AAAG AT G AT CCTGCGTCT GC T GC AGG T GT C C AGCT T C AAC GG C AAG AT G AAT GC C C T T AAT G AGAT C AA C AA 
GGTGCTCTCCTCGGTCGCATATTTCTCCCATCGATCCCAACCTTTGCCCCACTGTATGCCCGAAGACGAAATGGACTGGCTGACGGCGGATCGCA 
TGGCGCAATGGATAAAGTCGTCAGATGTATTGGGTGTCGTGCTCAAGGACTCGCTACACCAACCGCAGTATGTGGAAAAATTGGAAAAGATCATC 
CGCTTTCTTATTAAAGAGCAGGCACTAACGTTGGATGATTTGGATGCAGTTTGGAGAGCGCAGGCCGGCAAACACGAGGCAATCGTGAAGAACGT 
TCACGATCTACTGGCCAAACTTGCATGGGACTTTACTCCCGAGCAACTGGATCACCTCTTCGAGGCGTTCCAGGTGAGTTTTCTGCTTTAAAAGT 
GTATTAAGTAAGTTTAAGTTTTAAATTATGATTATTACATTTTCCGCGATGGTTTGTATATAAAATCTCATGAACAATGAAAACAATATTAATTT 
TTATTTCAGGCCAGCATGACCACTGCCAATAAGCGGCAACGGGAAAGACTTCTTGAGCTAATACGCCGCTTGGCTGAGGATGACAAAAATGGTGT 
GATGGCCCAAAAGGTGCTTAAGCTGTTTTGGACTCTGGCCCACAGTCAGGAGGTGCCGCCAGAGGTGCTTGATCAGGCGCTGGGCGCACACGTTA 
AAATCTTGGACTACAGCTGCTCGCAGGAGCGGGATGCCCAAAAGACCATTTGGTTGGATAAGTGCGTTGACGAACTTAAGTCAGGCGATGGATGG 
GTTCTGCCCGCCTTGCGTCTAATCCGGGATATTTGTTGCCTATATGATACCACGACAAATCATGCCCAGCGCACTCAAACGAGTACGAATCGTCA 
GCAGGTGATTGAACGACTACAGAATGATTATTCCTTGGTCATTCTGGTCACCAATAGTTTGACTGCATACATGGAAAAGGTACGCCAAATGGTAA 
CGGACTCACCAGGCCTGGATGCAACCAGAATCCTAATTGACGGAAGATTTCCGCATCACGTGCAAATAGCGGAGCGACTGGAGTTCCTTAAGTTC 
CTGCTGAAAGATGGACAGCTATGGCTGTGCGCAGATCAGGCAAAGCAGATTTGGCATTGCTTGGCGGTCAATGCAGTTTTTCCGGCAGATCGTGA 
GGAGTGTTTTAGATGGTTTGGCAAGTTGATGGGTGAGGAGCCTGACTTGGATCCTGGCATAAACAAAGACTTCTTCGAGAACAACATCCTGCAAC 
TTGATCCGCATCTACTAACCGAAAGCGGAATCAAGTGCTTTGAGCGCTTCTTTAAGGCTGTCAATTCCAAAGAGGACAAGCTGAAGGCGATACAT 
AGAGGTTACATGCTGGACAACGAGGATCTAATAGGAAAGGACTACTTGTGGCGGGTCATTACAACTGGAGGCGAGGAAATCGCCAGCAAGGCCAT 
TGATCTACTTAAGGAAGTGTCGACGGCATTGGGCCCACGTCTGCAGGAGAATATCGCAGAGTTTCATGAAATGTTCATTGGTGAGTGCTGCTCCC 
GCCTGCGCACACACTATGGAAATATCGTTATATTGGGCAAGACGCAGCTGCAAGAGGAATTAGATGCCCCCGATCAGTCGGATAATACCAACGAC 
GAGTCTAAAGATTCAAAAATGCGTTTCATCGAGGCGGAGAAGATGTGCCGAATTTTAAAGGTGCTGCAAGAATACGTAAAGGAGTGTGATCGTTC 
TTTCAGCGGCGATCGTGTTCACCTGCCACTTAGCCGGGTTACACGGGGCAAAAACACCATCTTATATATCCGCTTCCAGAACCCTGGCAGATCTA 
TCGACGACATGGAGATCGTTACACACAGCAACGAGACGATGGCCGCTTTTAAGAGAAATCTACTTAAGCGAATCAAGGGAACTTCGACGGCCAAC 
ATAAAGGTTGATCTCTTCTATGCTAATGACGAGATGATCGGGGTTTCCGATGAAATAAACCCACTTTATCAGTACACTATCCGGGATAAGATGAA 
TCTTACAGCAAAACTCACGCCCGTGGGTACGGGTCTAGCCAGCAGTCCCGACTCCTCGAGCGACTCGAGCACAGGATCTCCTCCGCGTCCCTGCC 
CTGACATGCAGCGTGTGGAGTCAGAAAGCACGCTACCCGGTGTGATCATCTCGCAAAACTACCAGTACACAGAGTTCTTTCTAAAACTGTACCAA 
CTGGGCAGTGATCTCGAGCACGGTCGTCTTCGAGACAGTGCAAAAGTTTTACTACACCTGCTTCCCTGCGACCGCCAGACGATCCGTCAGCTAAA 
GATTATGTGCAAGGTGCCGAAGGCTGCCGTAACGGTGGCTGTGACGGGTGACAAGATCGCTAAGGATGAGGAGGAGAAACTGTACCCCACTGAGC 
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AGGCGGGGATTGAAGATGAAGAGGTAAGACTATAGTTAGTCGAGTTAGTTCATTCAGGAAAGCTATAATGGTTATTTAATTTAAATTGTTTTCTG 
TATCAATCTCTTAATATCTCTTCTTTTTAGGAACACTGCACGCCAGAGCAAATGTTTTTGCACCCCACACCTGCTCAAGTGCTATACAACCTTAG 
TGTGTTGCACGGACTGCTGATTCCCGCGCTAGATCCACTAGGAGAAAGTGCCCTACTAGTGCAGTCTGCATGGATGCATTCTGGCTGCGCTCATT 
TCGTGCTGGAACTGTTGACCAAAAACAATTTTCTGCCTAGCGCCGATATGCACACAAAGCGTGCGTCCTTCCAGTGTGTGTTGAGGTTGGCAAAG 
CTGTTCCTGTACATAGTTGGCAGTGTATTGTCTCGTGTAGGCGATGAACCCATGATCTGCGACCTTGACAACGGATCCCGTTCCCAGGTTGACAT 
CCTGAAGCAGAACTTCTCCACGATGCCCAGTAGCTCACAGGGAACATTGAGGGCGATCTCCGCGAAACTTGCTGTGATCCTTGCCCGCGAGATGC 
TTTCAGCCAGCCCAGAAGGTGATCGGTGCCGAACGCTATTTAGCTCCACACTGCAGTGGTCATGTCCCGACATTTCAACCATTAAGGCGGTAGTG 
CAACTGGCCTGGGCCTCGTCTTGCGGAAATCTTCAGGCTCTAGGCAATAGTAGCGGCGACTTTGAAGACGAGGTGATCGTGCCGGACGGGCAAGA 
CTTTAGCATGTGCAAGGAGGCGCTAGAGGTTCTCACCATTTCGTTTATTCTGAATCCGAGTGCCAACGAGGCGTTGACTAGCGATCCGAACTGGC 
CAAAGTTCATCACCTCCATTGTTCTGAAGAACCCGCTTCGCCACGTGCGTCAGGTGGCCTCTGAGCAGTTGTTTTTGGCTTCTACCTATTGCGCT 
GGGGATCGGCGACCGTTCGTATACATGGTTAACTTGCTGGTTGGCGCGTTGAAGACGCTGGTTCCCCAGTACGAGTCAACTTGCGCTGAGTTCTT 
CTCGGTGCTGTGTCGGACCCTTTCTTATGGATGCATCTACAACTGGCCGCTGCAGATTAGCGAGGGATTGTTAGGCGATGAGATTAAGTGGCTGC 
AACGCATTCGCGAAAACGTCCACGCCACCGGCGACACACAAGTGCATGAGGAGCTCCTTGAAGGCCACCTCTGCCTTGCCAAAGAGCTAATGTTT 
TTCCTTGGCGCCGATTCAAAGGCGCAGCTGAACGAGCTCATCCACGAACTGATCGACGACTTCCTGTTCACGGCCTCTCGCGAATTTCTGCATTT 
ACGGCGTCATGGCAGTCTACGGCAGGACACCGTCCCGCCACCAGTTTGTCGCAGTCCGCACACAATCGCCGCTGCTTGCGATCTCCTCATTGCCT 
TGTGCCAGCTCTGTGTTCCTAATATGAAGCTACTCACCAATACACTTATAGACTTCGTCTGCACGGGTGAGTGTTGGAATCGTAAGATCATTTCA 
TGATTAAATGTTTCCTCAATTTTTTGTAGATACTGATCCGTTGCGCGAATGGGATTACCTGCCGCCGGTGGGCGCCAGGCCAACTAAAGGTTTCT 
GCGGACTAAAGAACGCAGGTGCCACTTGCTACATGAATTCGGTGTTGCAGCAGCTTTACATGGTACCAGCGGTCCGGGTGGGTATTCTGCGAGCC 
CACGGCGCTGCCACCACCGACGGCGAGGATTTCAGTGGCGATTCCGATTTGACGGGAGGCGGTCTTGGATCGGCCCTATTCTCGGGTCCCGCTTC 
TGCCCTAGTCAGCCTCCCCTCGTCGTCGTCAACTATCGAAGACGGTTTGCACGATGTGCGAAAGAACTATCACGTTGTGATCCTGAAACATGTGC 
AGGCCATATTCGCCCACCTGGGCCACAGTGCGTTGCAGTATTACGTTCCACGTGGTCTCTGGACACATTTTAAGTACAAAATTGTTTCATTGATT 
GGCATCACAATGATATTAATATATTATAATTTTTGTTTGCTAGGCTACTGGGAGAGCCCGTGAATCTTCGCGAGCAGCAGGATGCCGTGGAGTTC 
TTCATGTCCTTGCTTGAAAGTCTCGATGAAGGACTTAAGGCGCTAGGCCAGCCACAGCTTATGAATGCCACTCTGGGCGGTTCGTTCAGCGACCA 
GAAAATCTGCCAAGAGTGTCCCCATCGCTACTCCAAAGAGGAACCATTTAGTGTATTTAGTGTCGATATTAGGAATCATAGCTCATTAACCGAAT 
CTCTGGAGCAGTATGTCAAGGGAGAGCTGCTCGAGGGAGCCGATGCATACCATTGTGATAAATGTGATAAAAAAGTAATGTTCTCATCTCTGATA 
TTCATATACAATTAATTTTTATTAACATCATTTTTGTGTCCAACAGGTAGTTACCGTTAAGCGAGTGTGCGTTAAGAAGCTGCCACCCGTGTTAG 
CCATTCAACTAAAGCGTTTCGAATACGACTACGAACGCGTCTGTGCGATTAAATTTAATGATTATTTTGAATTTCCACGTATCTTGGATATGGAA 
CCATACACAGGTGAGTTTAACCGTAATTCCATACGTATATATCTTCTTATTGGATCTTCTTCTCCATGATCAGTGTCTGGCCTAGCTAAGCTAGA 
GGGCGAGGTGGTGGAAGTGGGTGATAATTGTCAAACAAACGTTGAGACGACGAAGTACGAACTGACCGGCATTGTGGTGCACAGCGGACAGGCTT 
CCGGCGGTCACTACTTCAGTTACATACTCTCCAAGTGCGTACCATAACAATGTTACAGTTTTTTTTAAGACTAATTCAATACAACACCTGCTCCA 
GAAACCCAGCAAACGGCAAGTGTCAGTGGTATAAGTTTGACGACGGCGAAGTCACCGAGTGCAAGATGCACGAGGACGAGGAAATGAAGGCGGAG 
TGCTTTGGC G GC GAGT AC AT GGGC G AAAC C T AC G AC AAC AAT C T T AAGC G CAT G C AGT AC C GT C GAC AG AAGC GC T GGT GG AAT G C C T AC AT GT T 
GTTCTACACCCGCTGTGATCAAACTCCCGTGCAGTATGAACCCAGCGTGGAGCAGTTGTCACTCGCAGAAAGTCGAAACATGGTTTTACCATTGC 
CAAAGCCCATTGAGCGCAGCGTGCGGTGAGTGGAAATAGTGCTTGACCACTATACAGATACTTTTTAATTATTTTTTTTCGATTGCAGACACCAG 
AACATTCGGTTTCTCCACTCCCGCAGCATTTTTTCCGTGGAGTTCTTTAACTTTATCAAAAAGTTGGTCAGCTGCAATCTACTTTCTGCTCGGAG 
CAATAAGATCGTCAGTTTGTATTCAAGTGGTTTTATTTCCCGTTTAAAAAAACTATTATTTTTTTAACAGACTCCTGCTGCAGAAGAGCTTTCAC 
TGCTGGGCGTCCAATTAGCATCGCAGTTTCTTTTCCACACCGGCTTCCGCACGAAAAAGTCTCTGCGAGGCCCCGTTATGGAATGGTGAGTAGCT 
TGGGAGGTATTGACTCGGGTATAATTAGGATACATCTTCTGCTTTACAGGTATGACGCGCTCTCACACCACATACGTTCCTCGGCACTAGTTCGC 
AAGTGGTTCGCTAATCATGCGCTCCTCTCGCCGCCATCGCGTTTAGGCGAGTACATTCTGATGGCTCCGTCTCCGGATGTCCGTACTGTCTTTGT 
CAAGTTGGTGGTTTTTTTCTGCCATTTTGCCATCAACGATGAACCTCTGACTGGCTATGACGGCGCTAATCTCTGCGAGCAGGTACTCATCAGCG 
TGTTGCGCCTCTTGAAATCCGAGGCAGCCGACTATGGCAAACACCTGCCCCACTATTTTAGCCTTTTCAGTATGTACGTGGGACTAGGAACACGG 
G AAAAAC AGC AAC TGCTTAGGGT GAG AAC TAT T T AC AAT AAG AT TGCTTGCCATT C T AAAC GAT T AT GT T C GT T T GC AGC T C AAT GTGCCGCTGC 
AGTTTATTCAGGTGGCGTTGGATGATGGCCCCGGTCCGGCAATAAAATACCAGTATCCAGAGTTCAGCAAGCTACACCAAGTGGTTTCGCACCTA 
ATACGCTGCAGCGATGTAAGCGAAAAATGCCAGAGCTCCAACCAGAACGCCCGTCCACTGTCCAATCCGTTCAAAGATCCCAATGTTGCGCACGA 
GGAGCTGACGCCCCTGTCCACTGAGTGCATGGACCTGCTCTTTAACCGCACGGGGTAAGGTGCATGGCATTTTACTGATAACTGTGCTCAGTTCA 
GAGAGCATTTCAAGCCATTTCCTTATGAAAGAAACGTACAGTGAGTGCTTACGATCCGTTTTGTCATTTTTTTTTCAGCTACATCAAGAAAGTGA 
TCGAAGACACTAACGTGGGCGACGAGGGTCTGAAGCTGTTGCAATACTGCAGCTGGGAGAATCCACATTTCTCTCGCGCAGTGCTAACCGAATTG 
TTGTGGCAGTGTGGGTTCGCCTATTGTCACGACATGCGTCACCACACTGATCTGCTTCTGAACATCCTGCTGATCGACGACTCGTGGCAGCACCA 
TCGCATACACAATGCTCTCAACGGAGTGGCGGAGGAACGTGAGGGTCTGTTAGAGACGATACAGCGGGCGAAGACGCACTACCAGAAGAGGGCGT 
ACCAAATAATCAAGTGCCTGACGCAACTTTTCCATAAGTCGCCGATTGCGCTGCAGATGCTCCACACAAATTCGAACATTACTCGGCACTGGAGC 
ATTGCAGTGGAGTGGCTGCAAGGCGAACTAGATCGGCAGCGGGGCATCGGTTGCCAGTACAATTCATACTCGTGGTCGCCGCCGGCGCAGAGTAA 
CGACAACACTAATGGCTACATGCTAGAGCGATCGCAGTCTGCTAAAAACACGTGGTCCATGGCCTTCGAGCTCTGTCCAGATGAGGTCAGCGAAA 
AAACGGTATCTAAAACATAACAATTATATTAAAGAATACTTTATTCTATTGATCGTTCATTATAATAGGATGAAAATAATGAGCCGAACTTGGAG 
ACGAACATGGATGAGAACAAATCGGAACCGGTGGCTCAGCCAGGAGGAGTGCTTGAGGGATCTACCGGCGGCACAGAGCAGTTGCCCGAAAATAA 
GACGCCAACCACAAGCAGCCCGTCGACTGCCGCCTGGCCAGCGCGTGGGGATAGCAACGCTATTCCACGACTTTCGCGTCAGCTATTTGGGGCTT 
ACACTTCGACTGGCAGTGGCTCCACTAGTGGAGGTTCCGCGCCAACTTCCGCCTTGACGACAACGGCGGGTAGTGGGGCTAACAGCGAAACAGAA 
AGCAGCGCCCAGGAAACAACCGGCGAGACGACCATTAACGGACTGACAAACAGCCTGGACCAAATGGAGATAACGGCCAAAAAGGTAACTCGGGC 
GAACAATGTCTAGGACAAGTGTCCATCTATAAGGTGTGTGTCCGTTGCCGCAATTGAAGAGTTCGAGACGTCTAAAAACAGTCCCAGAAAACTAG 
TAAACACAACAAACACATTTTACACTGGAACTCCAATCTGTATCTTTCTATCTGTCTTCTTTAATACGTGGATAGAAGATATGTAGTTTAGTCAA 
ACTAAACATGTCCTCCCGACGTTCTTTTCCATTTATATTTATGTTTACTAAAAGACACTATGTACTTTAACAAGAACGTTGGTTAATATAAAAAT 
ATTGGTTCGCTTCGACTGTTTACCCTTCGGTTTTCGAAGATCCACCACCTTCTTATTTAAACTCTTCTTTAATATTTCTACCATACTACAACATC 
ACCCTCGGTAACCCTTGCTGCGTTTCGGTGTTTAATCCTTTATTGTATATGAATTGTAACCACAAACGCAGGGTGTGCGAAGTCGCTCCAAAGGA 
GGCAAGCTCCCCCCGGTGCTGTTTCACCATCACCAGTTGCCGCCAGAGCAACAACTTAAACTGAAGCAGCAACAACAGCAGCAGCAGCAACAACA 
GCATCAGCAACTACGTGGTGATATAAACAAGACTTCCTCAAGCCGCTTAATTTGATTTGACGCCAAGTTAGTTAGAGTTGTTGTTTCGTTTGAGC 
TCAAAACGCCCGCCTTACTTTGTGTTTGTTGTTGTTTTCACGATCCCGGCACAAATGAGATGCACTTGCCATCCCCGGATCTGTCTTCCCCTTGG 
CTTTTTATTTCCTTTACTGCAAGTTTTCTCTACTTATCCATGCAAAACCACGCACGCCACAATCAGTCCACACACACAAAATCACAGGGGTCCAA 
CAAATTTGAATGTCAACGAAGTCAACGAAAACTAGTTGAAATACGATGGATATGTTTCTTAATTCGTACTTTTTTAAATAAAAAAACTAAGACTG 
ATTTCTAATATACCGAATCTTAAAAATCATTGTTTTTTTGTATGTTATATTGAAAAGAGTGTTTTTTGGTTCTTCAGCGAATTATTTCAACGAAA 
ATATACAATACAAAATATCAGTATATACATTTCATATAAAAAAGATATCCTAGATTAAAGTACATAACTAATCGTATTAGACTTTGACCTATTAA 
GATACTACTTTATGATCCACTTTTGAGGCGTGAATGAACCCGAAAACACTCGAAGCAAGATTGCATTACCGCTTAGCTGGCACCCGTAGCACTGG 
GCGATAGATTGTCAACCTCCACTAACCGAACTCTTTTTGTTGGCTATTCACAGAAGTGCCGCAGGGTGATAATAAAAAAGCTAGTGGAAAGTAAG 
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GACGAAGAGGACGCAACTACTGCTACCACAGCGGCCACCACAGAGGTCACAACATCGCCAGCGACAGCTATCGCAACGGCAGCAACTTTAGAGCC 
TGCCGGAATGAGTGAGCTCACGACGATGGTAGAGAAAAATCTTATAATAAGCCAAGAAAATCCACAAGCGAAAAGCTCATTGCAATAAAATGTAT 
TCGCAACCTAGCTGGAGGAGAAGATGATGAGTATCCATAGTGTTAAATGTAACCAAAGCAGCCTGACAATGAGAACGGTGTGGACAGGTGTTTAC 
TAGTGGGGATCCCCACCGTCGAAAACGAGACTTAATGGGTAAATACGAGAAAAAGAGCAGAATCCGTAGATATTAACAAAGGATACTTGGTTGGA 

gcgatgcgagtcaaggaacgaaagtctggtagataaggagatcaaaaccagcaaaggtgtttattagctatcgaactttataattacctttaata 

ATAAACAAATACACCAATTTTTTGTTTGGTTTTCTGTTTTGTTAAGCATTAAAAGGGATATGTTTATTTCTCCTCATTGTAAATGTGTCCCTTTT 

AATGTATAAATCTAAAAAAAAAAAATGAAAAGAACACCTTAAACTAACTCCCACAACACTACACAGAAACGGACGTAGGCAGAAAGAAACACACT 

CATAATCTATAAGAAGCGTTAAGTTGCACGTTTACAGTACCATATAATTGTATTTATATAGTATATAAATGATGTATTACGTATTGAAGCCGAGC 

AACACTTACTATACTAACCGAAAAAAATCAAAATGATGGACACTTAGATACGATACTAGGAAAAACTAATCGAAACGAATCGGCTGATTGTAAGA 

ATTAGGGCAAGAAACCAAATTGCATCTTTAGTACTCAGTACTCGCCTCTGAAACTGTTTCCATAATCATATTTCAACGGCATGAACCGATTTAAT 

GGGATGCTAATTAAAACAAAACAGTATAAACTCAAGAGCAGAATGGACTAAAGCAACACGAGCATAAAGAAAACTTGATTATGGTCATAAGTACA 

TATATTTTAAATGATTAGTGTTTTGTATTAAGTGAAATTTGTTAAACATTTATTACATTTTTATGTTCTAAGTTGCGAAATAAATGCGAAGAGAA 

AGCAAAGCGCGCAATGTCATTCATGCCTTCTCTCAGTTTCCATTTCCGTTTCCGTTTACCGACAGCAATCAACTCAAAGTCTAACTTTTATGCCG 

C GAG AT T T T GT AC GG AAC T C AAT T T T AAAT T AT GT AT T C AC AAAGAT GT ATT AC AT T T GG GAAATT AAT GT T C GT T AC AT C CAT T AAAT T C AGC G 

AAAATGAAACAAATAAATCACTTTTTAAAACCATTACTTTTGGCTTCCTTACTTACCCACAAAAACAATTCATGGCTAACAGTTTTAGACGTTTA 

ATGGGTT AAAGAT AAGAAAATTCC ATT AGATGGACACTTTTTTACTTAACGAT AAT AAATGCACTTCGTGATCCCTGAACCAAAAGCAGGCTAGT 

TAATATTTAAAATATAATTTTGTTTGTAAATGCAACAACTC 

(SEQ ID NO: 172) 

Exon: 1001.. 1134 

Exon: 2124.. 2710 

Exon: 3312.. 3460 

Exon: 3515.. 4728 

Exon: 4855.. 6768 

Exon: 6871.. 8236 

Exon: 8295.. 8718 

Exon: 8784.. 9099 

Exon: 9167.. 9320 

Exon: 9384.. 9534 

Exon: 9597.. 9905 

Exon: 9969.. 10080 

Exon: 10141.. 10250 

Exon: 10310.. 10661 

Exon: 10720.. 10979 

Exon: 11099.. 11690 

Exon: 11754.. 12149 

Exon: 13449.. 14051 

Start ATG: 2418 

Transcript No. : CT6055 

CACAACCCTAGCAAAAGAAATAGCTTAAATATAATTTATTTAACGCGAATTGGACAACGAGCGGGTAGAAAAAACCCTGTGGCCGCAACCGCGAC 
GCTCGCAGGATACGCAGCTCTTCGCCGTCGGGCAGGCAGATTTTTGCAAACTGGCCGCTGACCAGGCATTGGGAAAAAGGACAATAAGAGGGAGC 
CCAGAGCAAGAAGTCCAACTAACCGACCAACTGTGCCTCAAGAAACAGCAAATCAGCGGGGAACTCCCAAACTGGACAAGCAGAAGCCCGGATAC 
CTCACATTTGCCATAAGACACCAGTCAAAAAACGACGCGCACTGCAGAACGCATTAGGATTAGCCCGGCACTAGAGCTCGTGCATTAACATCCGC 
TTCCGGTGCTCCGCAGAACGAGTTGGCTGGAGAGGATACCCCGCCACCATGACGTTCGACACTCGTAGGCACACCACCGGACAGCCGGGAAGCAC 
AGCGCCCAGCTCAAGCAGCAGCACCACCAGCACAACAACCACCACAACGAGTCCGGCGCAGAGTGCCGGATCAGGATCGGGCATAGGAACAGGCA 
CAGGAACAGTCGCCAACTCGAGTCTGCCAGGAGGAGGATCAGGTAGCTTGGACGGCAACCAGGATCAACAGCCGGCCACAGACAGTCAGAGCTCT 
GATGACGTGGCCGCTTCCTTGTCAGCGAACAGCGTGGACAGCACCATCACAATAGTGCCTCCAGAGAAGCTTATATCCTCGTTTCCCACGACGAA 
ACTGAGATCACTAACGCAAAAGATATCCAATCCGCGTTGGGTGGTGCCGGTGCTGCCTGAACAGGAATTGGAGGTTCTCTTAAATGCAGCCATCG 
AACTGACCCAGGCTGGTGTGGATCACGACTGCGAGCCATGTGTGGAGTTTTATCGCAATGGACTGAGCACTTCCTTTGCCAAGATCCTAACCGAC 
GAGGCGGTGAATTCGTGGAAGAATAACATCCACCACTGTATCCTTGTGTCTTGCGGCAAGCTTCTACATCTAATCGCCATCCACATGCAGCGAGA 
CAATCCCTATCTGCTTGACTTGCTGGCTATTGTTTTTGATCCGGAGAACAAGTTCAACACATTTAACGCAGGCCGTCAACCGGAGTGCTTCGCAG 
CGCCGGACTACATCTGGGGTCAGTTGGACAGCAACAAAATGTACGCCCGTCCGCCGCCGGAGCCCAAGAATGCGCGTGGCTGGCTTGTGGATCTT 
ATTAACCGCTTCGGACAGTTGGGAGGCTTTGACAATCTACTGGAACGGTTTAACATCGGACTGGAACTGCTGAAGCGCAACCAAAACAAGTGCAC 
TGGCAAGAATATTAGTGTCGAGGGTAGGGTTGAAAATGGGGCTCAGGATAATCGCCTCACCCTTGCCCTCATACACAGCCTCTTGCGTCCATTCG 
GTCAATGTTATGAGCTTCTAATGCCTGCCACTATAGCCAAGTACTTTATGCCCACCTGGAACGTTGTGTTAGACCTTCTGGACAGCTTCACAGAC 
GAGGAGTTAAAGCGAGAAGTGAAACCTGAGGGACGCAATGACTACATCAATGGGATTGTAAAGTCAGCCCGCTTACTGGCCAGCCGCTTGACTGG 
CCAGGAGGAGCTAATCCGAGATCTTGAGATGTTTCGCCTAAAGATGATCCTGCGTCTGCTGCAGGTGTCCAGCTTCAACGGCAAGATGAATGCCC 
TTAATGAGATCAACAAGGTGCTCTCCTCGGTCGCATATTTCTCCCATCGATCCCAACCTTTGCCCCACTGTATGCCCGAAGACGAAATGGACTGG 
CTGACGGCGGATCGCATGGCGCAATGGATAAAGTCGTCAGATGTATTGGGTGTCGTGCTCAAGGACTCGCTACACCAACCGCAGTATGTGGAAAA 
ATTGGAAAAGATCATCCGCTTTCTTATTAAAGAGCAGGCACTAACGTTGGATGATTTGGATGCAGTTTGGAGAGCGCAGGCCGGCAAACACGAGG 
CAATCGTGAAGAACGTTCACGATCTACTGGCCAAACTTGCATGGGACTTTACTCCCGAGCAACTGGATCACCTCTTCGAGGCGTTCCAGGCCAGC 
ATGACCACTGCCAATAAGCGGCAACGGGAAAGACTTCTTGAGCTAATACGCCGCTTGGCTGAGGATGACAAAAATGGTGTGATGGCCCAAAAGGT 
GCTTAAGCTGTTTTGGACTCTGGCCCACAGTCAGGAGGTGCCGCCAGAGGTGCTTGATCAGGCGCTGGGCGCACACGTTAAAATCTTGGACTACA 
GCTGCTCGCAGGAGCGGGATGCCCAAAAGACCATTTGGTTGGATAAGTGCGTTGACGAACTTAAGTCAGGCGATGGATGGGTTCTGCCCGCCTTG 
CGTCTAATCCGGGATATTTGTTGCCTATATGATACCACGACAAATCATGCCCAGCGCACTCAAACGAGTACGAATCGTCAGCAGGTGATTGAACG 
ACTACAGAATGATTATTCCTTGGTCATTCTGGTCACCAATAGTTTGACTGCATACATGGAAAAGGTACGCCAAATGGTAACGGACTCACCAGGCC 
TGGATGCAACCAGAATCCTAATTGACGGAAGATTTCCGCATCACGTGCAAATAGCGGAGCGACTGGAGTTCCTTAAGTTCCTGCTGAAAGATGGA 
CAGCTATGGCTGTGCGCAGATCAGGCAAAGCAGATTTGGCATTGCTTGGCGGTCAATGCAGTTTTTCCGGCAGATCGTGAGGAGTGTTTTAGATG 
GTTTGGCAAGTTGATGGGTGAGGAGCCTGACTTGGATCCTGGCATAAACAAAGACTTCTTCGAGAACAACATCCTGCAACTTGATCCGCATCTAC 
TAACCGAAAGCGGAATCAAGTGCTTTGAGCGCTTCTTTAAGGCTGTCAATTCCAAAGAGGACAAGCTGAAGGCGATACATAGAGGTTACATGCTG 
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GACAACGAGGATCTAATAGGAAAGGACTACTTGTGGCGGGTCATTACAACTGGAGGCGAGGAAATCGCCAGCAAGGCCATTGATCTACTTAAGGA 
AGTGTCGACGGCATTGGGCCCACGTCTGCAGGAGAATATCGCAGAGTTTCATGAAATGTTCATTGGTGAGTGCTGCTCCCGCCTGCGCACACACT 
ATGGAAATATCGTTATATTGGGCAAGACGCAGCTGCAAGAGGAATTAGATGCCCCCGATCAGTCGGATAATACCAACGACGAGTCTAAAGATTCA 
AAAATGCGTTTCATCGAGGCGGAGAAGATGTGCCGAATTTTAAAGGTGCTGCAAGAATACGTAAAGGAGTGTGATCGTTCTTTCAGCGGCGATCG 
TGTTCACCTGCCACTTAGCCGGGTTACACGGGGCAAAAACACCATCTTATATATCCGCTTCCAGAACCCTGGCAGATCTATCGACGACATGGAGA 
T C GT T AC AC AC AGC A AC G AGAC GAT GGCCGCTTT T AAGAG A AAT C T AC T T AAGCG AAT C AAGGG AAC TT C G AC GGC C AAC AT AAAG GT T G AT CT C 
TTCTATGCTAATGACGAGATGATCGGGGTTTCCGATGAAATAAACCCACTTTATCAGTACACTATCCGGGATAAGATGAATCTTACAGCAAAACT 
CACGCCCGTGGGTACGGGTCTAGCCAGCAGTCCCGACTCCTCGAGCGACTCGAGCACAGGATCTCCTCCGCGTCCCTGCCCTGACATGCAGCGTG 
TGGAGTCAGAAAGCACGCTACCCGGTGTGATCATCTCGCAAAACTACCAGTACACAGAGTTCTTTCTAAAACTGTACCAACTGGGCAGTGATCTC 
GAGCACGGTCGTCTTCGAGACAGTGCAAAAGTTTTACTACACCTGCTTCCCTGCGACCGCCAGACGATCCGTCAGCTAAAGATTATGTGCAAGGT 
GCCGAAGGCTGCCGTAACGGTGGCTGTGACGGGTGACAAGATCGCTAAGGATGAGGAGGAGAAACTGTACCCCACTGAGCAGGCGGGGATTGAAG 
ATGAAGAGGAACACTGCACGCCAGAGCAAATGTTTTTGCACCCCACACCTGCTCAAGTGCTATACAACCTTAGTGTGTTGCACGGACTGCTGATT 
CCCGCGCTAGATCCACTAGGAGAAAGTGCCCTACTAGTGCAGTCTGCATGGATGCATTCTGGCTGCGCTCATTTCGTGCTGGAACTGTTGACCAA 
AAACAATTTTCTGCCTAGCGCCGATATGCACACAAAGCGTGCGTCCTTCCAGTGTGTGTTGAGGTTGGCAAAGCTGTTCCTGTACATAGTTGGCA 
GTGTATTGTCTCGTGTAGGCGATGAACCCATGATCTGCGACCTTGACAACGGATCCCGTTCCCAGGTTGACATCCTGAAGCAGAACTTCTCCACG 
ATGCCCAGTAGCTCACAGGGAACATTGAGGGCGATCTCCGCGAAACTTGCTGTGATCCTTGCCCGCGAGATGCTTTCAGCCAGCCCAGAAGGTGA 
TCGGTGCCGAACGCTATTTAGCTCCACACTGCAGTGGTCATGTCCCGACATTTCAACCATTAAGGCGGTAGTGCAACTGGCCTGGGCCTCGTCTT 
GCGGAAATCTTCAGGCTCTAGGCAATAGTAGCGGCGACTTTGAAGACGAGGTGATCGTGCCGGACGGGCAAGACTTTAGCATGTGCAAGGAGGCG 
CTAGAGGTTCTCACCATTTCGTTTATTCTGAATCCGAGTGCCAACGAGGCGTTGACTAGCGATCCGAACTGGCCAAAGTTCATCACCTCCATTGT 
TCTGAAGAACCCGCTTCGCCACGTGCGTCAGGTGGCCTCTGAGCAGTTGTTTTTGGCTTCTACCTATTGCGCTGGGGATCGGCGACCGTTCGTAT 
ACATGGTTAACTTGCTGGTTGGCGCGTTGAAGACGCTGGTTCCCCAGTACGAGTCAACTTGCGCTGAGTTCTTCTCGGTGCTGTGTCGGACCCTT 
TCTTATGGATGCATCTACAACTGGCCGCTGCAGATTAGCGAGGGATTGTTAGGCGATGAGATTAAGTGGCTGCAACGCATTCGCGAAAACGTCCA 
CGCCACCGGCGACACACAAGTGCATGAGGAGCTCCTTGAAGGCCACCTCTGCCTTGCCAAAGAGCTAATGTTTTTCCTTGGCGCCGATTCAAAGG 
CGCAGCTGAACGAGCTCATCCACGAACTGATCGACGACTTCCTGTTCACGGCCTCTCGCGAATTTCTGCATTTACGGCGTCATGGCAGTCTACGG 
CAGGACACCGTCCCGCCACCAGTTTGTCGCAGTCCGCACACAATCGCCGCTGCTTGCGATCTCCTCATTGCCTTGTGCCAGCTCTGTGTTCCTAA 
TATGAAGCTACTCACCAATACACTTATAGACTTCGTCTGCACGGATACTGATCCGTTGCGCGAATGGGATTACCTGCCGCCGGTGGGCGCCAGGC 
CAACTAAAGGTTTCTGCGGACTAAAGAACGCAGGTGCCACTTGCTACATGAATTCGGTGTTGCAGCAGCTTTACATGGTACCAGCGGTCCGGGTG 
GGTATTCTGCGAGCCCACGGCGCTGCCACCACCGACGGCGAGGATTTCAGTGGCGATTCCGATTTGACGGGAGGCGGTCTTGGATCGGCCCTATT 
CTCGGGTCCCGCTTCTGCCCTAGTCAGCCTCCCCTCGTCGTCGTCAACTATCGAAGACGGTTTGCACGATGTGCGAAAGAACTATCACGTTGTGA 
TCCTGAAACATGTGCAGGCCATATTCGCCCACCTGGGCCACAGTGCGTTGCAGTATTACGTTCCACGTGGTCTCTGGACACATTTTAAGCTACTG 
GGAGAGCCCGTGAATCTTCGCGAGCAGCAGGATGCCGTGGAGTTCTTCATGTCCTTGCTTGAAAGTCTCGATGAAGGACTTAAGGCGCTAGGCCA 
GCCACAGCTTATGAATGCCACTCTGGGCGGTTCGTTCAGCGACCAGAAAATCTGCCAAGAGTGTCCCCATCGCTACTCCAAAGAGGAACCATTTA 
GTGTATTTAGTGTCGATATTAGGAATCATAGCTCATTAACCGAATCTCTGGAGCAGTATGTCAAGGGAGAGCTGCTCGAGGGAGCCGATGCATAC 
CATTGTGATAAATGTGATAAAAAAGTAGTTACCGTTAAGCGAGTGTGCGTTAAGAAGCTGCCACCCGTGTTAGCCATTCAACTAAAGCGTTTCGA 
ATACGACTACGAACGCGTCTGTGCGATTAAATTTAATGATTATTTTGAATTTCCACGTATCTTGGATATGGAACCATACACAGTGTCTGGCCTAG 
CTAAGCTAGAGGGCGAGGTGGTGGAAGTGGGTGATAATTGTCAAACAAACGTTGAGACGACGAAGTACGAACTGACCGGCATTGTGGTGCACAGC 
GGACAGGCTTCCGGCGGTCACTACTTCAGTTACATACTCTCCAAAAACCCAGCAAACGGCAAGTGTCAGTGGTATAAGTTTGACGACGGCGAAGT 
C ACCGAGT GC AAG ATG C AC G AGG AC G AGG AAAT G AAGGC GG AGT GC T TTGGCGGC G AGT AC AT GGG C G AAAC C T AC GAC AAC AATC T T AAG C GC A 
TGCAGTACCGTCGACAGAAGCGCTGGTGGAATGCCTACATGTTGTTCTACACCCGCTGTGATCAAACTCCCGTGCAGTATGAACCCAGCGTGGAG 
CAGTTGTCACTCGCAGAAAGTCGAAACATGGTTTTACCATTGCCAAAGCCCATTGAGCGCAGCGTGCGACACCAGAACATTCGGTTTCTCCACTC 
CCGCAGCATTTTTTCCGTGGAGTTCTTTAACTTTATCAAAAAGTTGGTCAGCTGCAATCTACTTTCTGCTCGGAGCAATAAGATCACTCCTGCTG 
CAGAAGAGCTTTCACTGCTGGGCGTCCAATTAGCATCGCAGTTTCTTTTCCACACCGGCTTCCGCACGAAAAAGTCTCTGCGAGGCCCCGTTATG 
GAATGGTATGACGCGCTCTCACACCACATACGTTCCTCGGCACTAGTTCGCAAGTGGTTCGCTAATCATGCGCTCCTCTCGCCGCCATCGCGTTT 
AGGCGAGTACATTCTGATGGCTCCGTCTCCGGATGTCCGTACTGTCTTTGTCAAGTTGGTGGTTTTTTTCTGCCATTTTGCCATCAACGATGAAC 
CTCTGACTGGCTATGACGGCGCTAATCTCTGCGAGCAGGTACTCATCAGCGTGTTGCGCCTCTTGAAATCCGAGGCAGCCGACTATGGCAAACAC 
CTGCCCCACTATTTTAGCCTTTTCAGTATGTACGTGGGACTAGGAACACGGGAAAAACAGCAACTGCTTAGGCTCAATGTGCCGCTGCAGTTTAT 
TCAGGTGGCGTTGGATGATGGCCCCGGTCCGGCAATAAAATACCAGTATCCAGAGTTCAGCAAGCTACACCAAGTGGTTTCGCACCTAATACGCT 
GCAGCGATGTAAGCGAAAAATGCCAGAGCTCCAACCAGAACGCCCGTCCACTGTCCAATCCGTTCAAAGATCCCAATGTTGCGCACGAGGAGCTG 
ACGCCCCTGTCCACTGAGTGCATGGACCTGCTCTTTAACCGCACGGGCTACATCAAGAAAGTGATCGAAGACACTAACGTGGGCGACGAGGGTCT 
GAAGCTGTTGCAATACTGCAGCTGGGAGAATCCACATTTCTCTCGCGCAGTGCTAACCGAATTGTTGTGGCAGTGTGGGTTCGCCTATTGTCACG 
ACATGCGTCACCACACTGATCTGCTTCTGAACATCCTGCTGATCGACGACTCGTGGCAGCACCATCGCATACACAATGCTCTCAACGGAGTGGCG 
GAGGAACGTGAGGGTCTGTTAGAGACGATACAGCGGGCGAAGACGCACTACCAGAAGAGGGCGTACCAAATAATCAAGTGCCTGACGCAACTTTT 
CCATAAGTCGCCGATTGCGCTGCAGATGCTCCACACAAATTCGAACATTACTCGGCACTGGAGCATTGCAGTGGAGTGGCTGCAAGGCGAACTAG 
ATCGGCAGCGGGGCATCGGTTGCCAGTACAATTCATACTCGTGGTCGCCGCCGGCGCAGAGTAACGACAACACTAATGGCTACATGCTAGAGCGA 
TCGCAGTCTGCTAAAAACACGTGGTCCATGGCCTTCGAGCTCTGTCCAGATGAGGTCAGCGAAAAAACGGATGAAAATAATGAGCCGAACTTGGA 
GACGAACATGGATGAGAACAAATCGGAACCGGTGGCTCAGCCAGGAGGAGTGCTTGAGGGATCTACCGGCGGCACAGAGCAGTTGCCCGAAAATA 
AGACGCCAACCACAAGCAGCCCGTCGACTGCCGCCTGGCCAGCGCGTGGGGATAGCAACGCTATTCCACGACTTTCGCGTCAGCTATTTGGGGCT 
TACACTTCGACTGGCAGTGGCTCCACTAGTGGAGGTTCCGCGCCAACTTCCGCCTTGACGACAACGGCGGGTAGTGGGGCTAACAGCGAAACAGA 
AAGCAGCGCCCAGGAAACAACCGGCGAGACGACCATTAACGGACTGACAAACAGCCTGGACCAAATGGAGATAACGGCCAAAAAGAAGTGCCGCA 
GGGTGATAATAAAAAAGCTAGTGGAAAGTAAGGACGAAGAGGACGCAACTACTGCTACCACAGCGGCCACCACAGAGGTCACAACATCGCCAGCG 
ACAGCTATCGCAACGGCAGCAACTTTAGAGCCTGCCGGAATGAGTGAGCTCACGACGATGGTAGAGAAAAATCTTATAATAAGCCAAGAAAATCC 
ACAAGCGAAAAGCTCATTGCAATAAAATGTATTCGCAACCTAGCTGGAGGAGAAGATGATGAGTATCCATAGTGTTAAATGTAACCAAAGCAGCC 
TGACAATGAGAACGGTGTGGACAGGTGTTTACTAGTGGGGATCCCCACCGTCGAAAACGAGACTTAATGGGTAAATACGAGAAAAAGAGCAGAAT 
CCGTAGATATTAACAAAGGATACTTGGTTGGAGCGATGCGAGTCAAGGAACGAAAGTCTGGTAGATAAGGAGATCAAAACCAGCAAAGGTGTTTA 
TTAGCTATCGAACTTTATAATTACCTTTAATAATAAACAAATACACCAATTTTTTGTTTGGTTTTCTGTTTTGTTAAGCATTAAAAGGGATATGT 
TTATTTCTCCTCATTGTAAATGT 
(SEQ ID NO: 173) 

Start ATG: 429 
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MTFDTRRHTTGQPGSTAPSSSSSTTSTTTTTTSPAQSAGSGSGIGTGTGTVANSSLPGGGSGSLDGNQDQQPATDSQSSDDVAASLSANSVDSTI 
TIVPPEKLISSFPTTKLRSLTQKISNPRWVVPVLPEQELEVLLNAAIELTQAGVDHDCEPCVEFYRNGLSTSFAKILTDEAVNSWKNNIHHCILV 
SCGKLLHLIAIHMQRDNPYLLDLLAIVFDPENKFNTFNAGRQPECFAAPDYIWGQLDSNKMYARPPPEPKNARGWLVDLINRFGQLGGFDNLLER 
FN I GLELLKRNQNKCTGKN I S VEGRVENGAQDNRLTLAL I HSLLRPFGQC YELLMPAT I AKYFMPTWNVVLDLLDS FTDEELKREVKPEGRND YI 
NGIVKSARLLASRLTGQEELIRDLEMFRLKMILRLLQVSSFNGKMNALNEINKVLSSVAYFSHRSQPLPHCMPEDEMDWLTADPJy[AQWIKSSDVL 
GVVLKDSLHQPQYVEKLEKIIRFLIKEQALTLDDLDAVWRAQAGKHEAIVKNVHDLLAKLAWDFTPEQLDHLFEAFQASMTTANKRQRERLLELI 
RRLAEDDKNGVMAQKVLKLFWTLAHSQEVPPEVLDQALGAHVKILDYSCSQERDAQKTIWLDKCVDELKSGDGWVLPALRLIRDICCLYDTTTNH 
AQRTQTSTNRQQVIERLQNDYSLVILVTNSLTAYMEKVRQMVTDSPGLDATRILIDGRFPHHVQIAERLEFLKFLLKDGQLWLCADQAKQIWHCL 
AVNAVFPADREECFRWFGKLMGEEPDLDPGINKDFFENNILQLDPHLLTESGIKCFERFFKAVNSKEDKLKAIHRGYMLDNEDLIGKDYLWRVIT 
TGGEEIASKAIDLLKEVSTALGPRLQENIAEFHEMFIGECCSRLRTHYGNIVILGKTQLQEELDAPDQSDNTNDESKDSKMRFIEAEKMCRILKV 
LQEYVKECDRSFSGDRVHLPLSRVTRGKNTILYIRFQNPGRSIDDMEIVTHSNETMAAFKRNLLKRIKGTSTANIKVDLFYANDEMIGVSDEINP 
LYQYTIRDKMNLTAKLTPVGTGLASSPDSSSDSSTGSPPRPCPDMQRVESESTLPGVIISQNYQYTEFFLKLYQLGSDLEHGRLRDSAKVLLHLL 
PCDRQTIRQLKIMCKVPKAAVTVAVTGDKIAKDEEEKLYPTEQAGIEDEEEHCTPEQMFLHPTPAQVLYNLSVLHGLLIPALDPLGESALLVQSA 
WMHSGCAHFVLELLTKNNFLPSADMHTKRASFQCVLRLAKLFLYIVGSVLSRVGDEPMICDLDNGSRSQVDILKQNFSTMPSSSQGTLRAISAKL 
AVILAREMLSASPEGDRCRTLFSSTLQWSCPDISTIKAVVQLAWASSCGNLQALGNSSGDFEDEVIVPDGQDFSMCKEALEVLTISFILNPSANE 
ALTSDPNWPKFITSIVLKNPLRHVRQVASEQLFLASTYCAGDRRPFVYMVNLLVGALKTLVPQYESTCAEFFSVLCRTLSYGCIYNWPLQISEGL 
LGDEIKWLQRIRENVHATGDTQVHEELLEGHLCLAKELMFFLGADSKAQLNELIHELIDDFLFTASREFLHLRRHGSLRQDTVPPPVCRSPHTIA 
AACDLLIALCQLCVPNMKLLTNTLIDFVCTDTDPLREWDYLPPVGARPTKGFCGLKNAGATCYMNSVLQQLYMVPAVRVGILRAHGAATTDGEDF 
SGDSDLTGGGLGSALFSGPASALVSLPSSSSTIEDGLHDVRKNYHVVILKHVQAIFAHLGHSALQYYVPRGLWTHFKLLGEPVNLREQQDAVEFF 
MSLLESLDEGLKALGQPQLMKATLGGSFSDQKICQECPHRYSKEEPFSVFSVDIRNHSSLTESLEQYVKGELLEGADAYHCDKCDKKVVTVKRVC 
VKKLPPVLAIQLKRFEYDYERVCAIKFNDYFEFPRILDMEPYTVSGLAKLEGEWEVGDNCQTNVETTKYELTGIVVHSGQASGGHYFSYILSKN 
PANGKCQWYKFDDGEVTECKMHEDEEMKAECFGGEYMGETYDNNLKRMQYRRQKRWWNAYMLFYTRCDQTPVQYEPSVEQLSLAESRNMVLPLPK 
PIERSVRHQNIRFLHSRSIFSVEFFNFIKKLVSCNLLSARSNKITPAAEELSLLGVQLASQFLFHTGFRTKKSLRGPVMEWYDALSHHIRSSALV 
RKWFANHALLSPPSRLGEYILMAPSPDVRTVFVKLVVFFCHFAINDEPLTGYDGANLCEQVLISVLRLLKSEAADYGKHLPHYFSLFSMYVGLGT 
REKQQLLRLNVPLQFIQVALDDGPGPAIKYQYPEFSKLHQVVSHLIRCSDVSEKCQSSNQNARPLSNPFKDPNVAHEELTPLSTECMDLLFNRTG 
YIKKVIEDTNVGDEGLKLLQYCSWENPHFSRAVLTELLWQCGFAYCHDMRHHTDLLLNILLIDDSWQHHRIHNALNGVAEEREGLLETIQRAKTH 
YQKRAYQIIKCLTQLFHKSPIALQMLHTNSNITRHWSIAVEWLQGELDRQRGIGCQYNSYSWSPPAQSNDNTNGYMLERSQSAKNTWSMAFELCP 
DEVSEKTDENNEPNLETNMDENKSEPVAQPGGVLEGSTGGTEQLPENKTPTTSSPSTAAWPARGDSNAIPRLSRQLFGAYTSTGSGSTSGGSAPT 
SALTTTAGSGANSETESSAQETTGETTINGLTNSLDQMEITAKKKCRRVIIKKLVESKDEEDATTATTAATTEVTTSPATAIATAATLEPAGMSE 
LTTMVEKNLIISQENPQAKSSLQ* 
(SEQ ID NO: 174) 

Name: fat facets peptidase 
Classification: endopeptidase 
Gene Symbol: faf 
FlyBase ID: FBgn0005632 



Celera Sequence No. : 142000013384567 

TTTACGTGTTCTCATAAGATATAGTTTATATTAATGTATATTCATATTTTTACCCATTCATTATCTCTTGGCGCATTTAGAATTTAAGTTTTTTT 
TTTGCATTTTCCTTTTTTATTTTACGGTTTGGTATTACTTATAAACAAATTTCATATTATAGCTATAGTTTATTAGCTAGCGTTGTGTAACTTTC 
CTCCGCAGGGCCATTACTTACATTGTTAAATCGAAACGTTAATGCAATTGTTTTGTTGTGTATGTCTAACGCAATTTATATGCTCTCTATCTTAG 
TTACATGATTTTGTTGCGTAAACAAAGGGCTGCCCCAACTCCCAGCTCCCACATCTCGCATCCCATTTGCCCACCGCTCGGGAAATGCGCGCTTA 
AGTGTTTAAAAATCGGAAAGATATATATGAACACAGGCAGCAAAGTGCAGACCACGAAAGTATATATATATATATATATATATATATATATATAT 
ATATATATATATATATATATATATATATATATATATAAATATATATATATAGATAGGGGAGTAGGCGGTACAGAAGGTTACATAATATAACATAG 
GAGGGCCTTATAAGTCAGAGCGTTGCTTAAAGCTAGAGAACTGCACGTGGTAATCGCAATCATTTTACCATCAACTTAGGTTACAATGTGGGTGC 
CTTGCTGTGTGGCATATGCTTTTGTATGGATTGGATTCGCTAGAGCTAGAGCCTATAGTCTAGGGTTAGGGACACTCGCTGAACGCGGTGTCCTT 
TGTTTCTTTTACACAAATATTACACTTACAATCGAGTTATTATGTGTTCATGATCGGTTCTGCATGTTCTGCCTGATTTGGTCTGGGACTTCAAG 
TTAACAAAGAAAAAGTTGAGCCGGCCAAGAGCAAATAGTCAAATGCAAAACAGAATTAAATGCCTGCGTGTCCTGCTAACCCTCCTCCCAAACCC 
TTCCGCGGACCCCTAATCAACCGCCGATCTATTGGCCAATCTTCAGGCAGGGATCTCAAATCAATTGAACAGTGCGGTTTGTTGCAGTTTCTTGC 
AGCCAGGTTGCAATCGTTTATGTTCTAATAGTATAAATTAAATTAATCATAGGCTAGTTAATTAATAATAAGCGCGCTCAACGTTCTAACATCTG 
GGAGCAGAAACCAAAATGTCAAAAAACAAACGAAAATGTACACACAAAAAAAGAAAAGCGCTGCGCTTTGCTATGCTCTGCTCTTCTTCTCTCAC 
TGCTCTTGTCCTCCGTTTCTGGCAGGGACCTGGTCATCCTGCCGCTTGGATACATTCAACGTGTTTTACCGCATATATAAAGCAAGTATTATATA 
TATTAGGTATATATAACTATATATACTACATGTATAGTTTGTTGAGTTCTGTTGTTGTTTGTCTACTGTGTCGTGTCCAGCGTCGCAACCTAAAC 
GATAAAATATTATTTGGCGTTCCCCACGCCACCGCCCCCTCCGCCGCTCGGCATACCGCCTACGCCCACTCCACCGCCGCCACCACCGCCTCCCT 
GCCGCATCCGTATCGAAGGCCGACGCTCCGGCGTGTCGTACGGCGGTCTCGACGAGTTGGTGGCCGCCATCGAGTCGTCCATGTTGAGTCCGTTG 
CCGCCGATCAGTTGGCTGCCGTTACCGCCTCCATTGCCGCCCAGCATCGCCAGCGTTTGGCCGCCCTGGAGTTGCTGTTGCGCTGCACTGCCTCC 
TCCGCCACCCAGCGCCGCATTGACCTTGTGCTGCTGTTGTTGATGCGAGGAGGCGGCTGCCGCTGCTGCCACCGCCGCTGCTGCGGCCACCGCAT 
CATGACCCGCCTGGCCGTCCGCTCCGTCCTGCGCCTGCTGAATCGCCTGGGGATTGCGAGGTCCGCCAAACTTAAGCAGGTTCCAGTCAAACACA 
TAGTCGTAAGTGAAGCCCAAACGGTGGAACAAGTTGCGGAAGAGTTTGCGCAGGTGGCAGTAATCGGGACGCTGGTCGAAATGCATCTGGCGACA 
GAAGTTCAGATAGTTGACGAACTCGCTGGGGAAGCCCTTGCACAGCACCACAATCGAGGTGGACAGCTTCTTCTCCGAGATCCTCTCGTACTTTT 
GCCTCTTGTTGGCTGCCTTTAAGCCCTGCCAGGGCAAGGCGCCCAGATTGAAGTACATTAGGACGTAGCCCAGGGACTCCAGGTCGTCACGACGC 
GATTGCTCAATGCCCAAATGTGTGTTGATGGAGGCATAGCGGGCAGTGCCCGTGAGGTTCTTGTTTTCCCGATAGGGAATGTGCTTCAGCGACCG 
GGCATCGCGGAATTTCTTGGCCAGGCCAAAGTCAATGATGTACACCAGGTTGCCCTTTTTGCCAAGACCCATGAGGAAGTTATCCGGCTTGATGT 
CGCGATGGATGAAGTCCCGCGAGTGTATGTAATCGATGCGGGAGATCATCTGGTCCGCCAGCAGCAGAACCGTCTTCAACGAAAAGCGGCGTGAA 
CAAAAGTTGAAGAGGTCCTCCAGCGAGGGTCCGAGTAGCTCCATCACCATCACATTGTAGTCGCCCTCGCTGCCGCACCAGATTATGCGGGGTAT 
GCCTATGCCACCCTGCATCGTCTTGTAGAACTTTGACTCGATGTGCAGCTGGGGGTGTTTGGTGCGGATGCACTCCAGCTTGATGGCCACCTCCT 
CGCCAGTGTTGATCGTGGTGCCCAGGTAGATGTCGCCGAACGATCCCGATCCTATCTTGCGGCCCAGGCGATATTTGTTACCCACGCGCAGCTCC 
ATTTTGTTACGTCTGTTTCTGTGGGTTTCTGTGTGGTAGAGTTCGGACTCCTCGGGTTCCTCGGATTTGCGTTGCCTTTCTGCGGGACCGGAGAA 
TTGAAATTTAGTTTTAATTTTACAACACATTGCCTCGAAAAGAAGGCTGTACGCTTTTCTTACTCATAATACTACACGCTACACGAATTGATCGA 
TCCCAGTTCAGATTAGCTGCGCTTAAAATATTATACAGATTTAAATATCTACGCCAAGGACTAAAACTTTTACTTTGTAATCGCGAAAGGATATT 
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GAAAGGTTGCTACGATGCAGGCAAATTGTTGAATTCGATCAGAATGTAAATATTCGTAGTATTCTTAGTTGCTTTGACTAATCATAGAAATAGCA 

GCGGCGTCACAGGTTTTTACGAGTAAATATGAATCAAGTGACTTCGAAAATTCGGCTATTGACCACTAGCAACAATGTGTGATATTATTAGAAGA 

GATTATTTGCAAGCAACAACACAATATCGAACGCCTTGTTAGCGATAGTGATAAAATATTCGTCGGATTTTGTTTGCTACAACTTTTGATTCTTC 

TGACACTATATTGTGACACGTGATGACAACCCTGCTGTTTTGTACAGTTTGCTAAAAGTCTAATTTTGAGAGCTAACGGAGCTTCGTGAGTATTT 

TCCAACACTTCTCCACTTTTCTCACCTGGCACTGTCCTTTCGTCTGAGCTCCTTTCTTCTTCTTGTTCCGATTTGTTCAAGGCAGTCGCTTATGG 

CTTTGTATTCGAAGGGTTCTGTAATAGGAGGTACTAAGTTAATTTTTTTTTAGATCCTCAACATGGTTAATCGAAATTCGACCCGTCTCGCGATC 

ACTTATATATAGAGAGTAATCCCCTGTTTTTATCGCATCTGCAGCGTAAGCCCGAAAAACGTGCAGAATTCGCTTTTGTTTGTGGTGATAAGGAT 

TGTGGCACACAGACATGAAAACAACGCACGGCAACAATCGATATCGAAAATAGAAAACCGTTAGCTAAGAATAAAACAAGAGAGAGAGAGCGGGA 

GAGAAGTAAGCTTGGCTTACGTTGACAACACAAAAATAGCAGAGAAAAGCGTACCAAAAGCAATTCGAAATGCAAATGCACAGTGGGCAAAAATG 

ATAAGAACCCGAACCTCGCACAAAGTAAAAGTTTATCTCGTATATTTATATTTAAGAGTGAGTGCGTCATTGCATTCGGCTGTGTGTATTGATTA 

CGAATGGGGAAAACAACAGATTTAATAAGTGATGAGTAACGATAATTTATATGCACAGAGCAAAATAGCACTACTTTTTGACATTCGTCACATTA 

TTGTTTTAGTCATTTTAATATTATATTGTACTATTATAAAATAAAGCAAATGTGCCGTAGTGCTTTTACATCCTTGAATAGATATAATTCTTTTT 

T T C AAG T GT AT T T T T T AC G T GAC GAC AC C AG AAAC AT G G AGG T G AAT C AGC AAT C AG GAAAAG AC AAC G AAGAG AT T AAC AAC AAAT AG AT AAAA 

GAGACGTGTGTTTGTGTATGCGTGGAGTAAACAATCGGCGACAAAAGGGAGAGAAGTAAAATAAAACGCAGCGACTGTGAAAATGCCACTTAAAT 

GACACTTTCAGTGGTGGAAAGCACAAACAAATAGCGAAGATAAAGCGCATGGAAGTCAGAAAACGAAGCAACAAGCCGAAAGTTTTCACTTGCCG 

GTCAAATATCACAAAAATTTAGTTTAATTGTTTTCGGCGGCTCTCACTCACTAACCCCTTCCCACTCCTCTTTCCCACATTCTGTGCGCTTTCTT 

TTACCGCTCACGCATTTCGATCGCTCTCTCTCTTTCTCTCTCTCTCTCTCTCCTCCTTTTCCCGACTCTTTATCACTCACTCTGCGGTTCTAACC 

TCAAAACCTGCCTGTGTTTGTTGTTGTTTTCGCCTTGGCTTTGATTTATTTAACCGAAATGTACCGTTTTTTATTTGTATCTCCCATGTTCTACA 

CAGAAAGAAATCACTGTATAAGCACTGTAAGTCAAATTGAAATGTGGTATTAATTTTAGGAGCTTTAATTTGTTTACTGGAAGGCTAGCAGGGAA 

AACGCGGTCTTTCCTATTGGTAACTAAATATTTCCCAGGGTATCCAGCAGTTGTTTCGGACCAGCGGGTAAACACGAATTGATTACAATTGCAAA 

ACAGCTGCACGTTGCGTTGGCGGCTGCCGAAAAGCTTCCGCAAAAAAACGCACTGGAGAACAAAGAAGATACAAAAAACAGTTGTTCCAGCAGGC 

GCC AC AAAAGTAAAAAGCTTAAGT AAAC AGGGAAGAGCAAATGGAAAAACCAAAAACCAGAGCAAAACCAACGGGGAAACCTCAGAAGGGGAAGG 

AAACTTGCAATTAATTGTCAAACGCTAATGGCCTCCGAAAACTAACGGCACTCCACGAATGCACTGCGAATAACGGGGCTGATAAGGAGCAGCCG 

ATAGCGGGTGGAGAAGGGAAACTTGAACGCACAGTGGAACAAATTCCGCACTTACCGTGGCTTGTTTTTCTTATATTCCTTGTTTCCTATTCCGC 

TTTGCCCACTTCCTCTGCTGGCGACGCTGCTTTTCCTTCTTCTAATTCGACTGAAATCGAAATCGAATTTTTTAATTGGATTGATTCTGCCACAA 

TAATTTTCATTTTCTTTGTTTTAGAGCCCAGACGTTGTTTTTGTTGTAGTTGCCACAGTCGCACACAAACACACGCACACAGTCGCACTTTTTGG 

GGCCACACGCAAGCGCATTTTATGCAATAATTTTAATAATACTGCATTGCTGCCATATTTCAGTTATTTTTATCGTTTATCTTCGAGTCGGTTAT 

GCTGTTCTTCTCTGATTTTATGGCCACATTGATTAGCATCTTGTTGCCAATCAACTCGAGAAAATTAAACGAAACCAAACGGAACTCGGACTCTT 

ATTAAAAAAACCAAACTTATGCGAGTCGTTTGCCCTGGCAATAGATTCGTGCACACTAGTCGTCCCGCCCGCACTCGCACGGTTTCTACCGCGAG 

CACGTTGCTATCCCTCCTGCTTCAGCATTTTCACACCAACACAAAGACGAGGGCACTCACTCACACACTCGCACTCTCAATGAGCACTGAAATGC 

TGTTGGGAAATTCGAGTTTTCCTGCGCGTGGTAACCGCCGCTTTTTCGCTCGCAAACGGGTCTGTCGATCCGAAACGCTACTGACCACTTACGTT 

TTGCATCAACGCTGCGCTTTCGTTTCAATTTTTCTGGCTATTTCACATAGATTTCGACGATTTTTTACAAATTTACGCCATCTTGATATTCGCAG 

TGTGACCGCGCCGATAATGAAGAAAATTACCATCTGGTATTTACTCCTCAGCGAATGTCTAGAAGATATGTAATTGTGAAAAACTGTGATTCATA 

T AC T AAG T T AAG AT AGT C C AAAGT T GGT T TC G AAGT AC C AC AT T AAG AAAAT AGCGC AAG AGT AT T GGAAAAT C AAAAG AAAAAC GAAAT T AT AT 

CCTTGCAGCTAGCCATAAGTTCAAAGTTCGAGTAACTGGAAAGCTGTGAAATAGCCAAGTTGACAACGCTGTTTTTGTTATATGGGCACACTAGT 

TCTTACATAGACCCAGTGCTGCCAGATGGACGAAAAATATTTTTGAGCTGCGCCACCTAGCGGATGAGTTGGCGAAAGTGGTATCCAGGTGGCGC 

CCGAAACAAAAGGTGGCGCCTTATTTCAGAAATGCAATTGAAAATATCTAAAGTTATTGAAAGTTTTTGAATTATGCATTCAAAGGCCCATCTGC 

TTAATTTTATTTTCAAAGAGAATACAATAAACAATACAAAAAAAAAAAAATCATTCTGAAATGATATTTATACTAATATACAAATTTGATAACTC 

AATAATATGTAAATAAAATTTATGTCTTAATTTGCAATGTTCTTAAGAAAAGTTGTTAACTCTTTTTACAGGCTTGTGCTCCAAATTCTCGTGTA 

AAT AG AAAT T G AAAAT GG AT TTCATTGGCATGTCTTG C AGT T AT AGGT AG C AT GC C AAT T T T GAT GT T AC AC AG AT GT AT C GAAAT AC C T G AAAC 

GTATACTTTTATTGCAGCAATACGAATAAAGCAAAACACGTGTGAATTCCATTAGTCCGCCCAACCGCTTTCCCCATTTACCCTCACATCGTGCA 

TTTTCCGGCTGTGTAACCCAAGAAACGATGACATAAAAAAAGGAAGCAGATCCAGTTATGTATGCATATATGAGAAAATGTGTTTTTTTTTTTTT 

GGGAGTATGTGCCAAGGTGTGTTTCTTTTTAAGTGGGGGTAGGGTAGAT 

(SEQ ID NO: 175) 

Exon: 6079. .5983 
Exon: 5370.. 5281 
Exon: 3533.-3446 
Exon: 2834.. 1001 

Start ATG: 2757 (Reverse strand: CAT) 
Transcript No. : CT6302 

TGCGAATATCAAGATGGCGTAAATTTGTAAAAAATCGTCGAAATCTATGTGAAATAGCCAGAAAAATTGAAACGAAAGCGCAGCGTTGATGCAAA 
ACT C G AAT T AG AAG AAG GAAAAG C AGC GT C GC C AG CAG AGG AAGT GGGCAAAGCGG AAT AGG AAAC AAGG AAT AT AAGAAAAACAAGCC AC G AAC 
CCTTCGAATACAAAGCCATAAGCGACTGCCTTGAACAAATCGGAACAAGAAGAAGAAAGGAGCTCAGACGAAAGGACAGTGCCAGAAAGGCAACG 
CAAATCCGAGGAACCCGAGGAGTCCGAACTCTACCACACAGAAACCCACAGAAACAGACGTAACAAAATGGAGCTGCGCGTGGGTAACAAATATC 
GCCTGGGCCGCAAGATAGGATCGGGATCGTTCGGCGACATCTACCTGGGCACCACGATCAACACTGGCGAGGAGGTGGCCATCAAGCTGGAGTGC 
ATCCGCACCAAACACCCCCAGCTGCACATCGAGTCAAAGTTCTACAAGACGATGCAGGGTGGCATAGGCATACCCCGCATAATCTGGTGCGGCAG 
CGAGGGCGACTACAATGTGATGGTGATGGAGCTACTCGGACCCTCGCTGGAGGACCTCTTCAACTTTTGTTCACGCCGCTTTTCGTTGAAGACGG 
TTCTGCTGCTGGCGGACCAGATGATCTCCCGCATCGATTACATACACTCGCGGGACTTCATCCATCGCGACATCAAGCCGGATAACTTCCTCATG 
GGTCTTGGCAAAAAGGGCAACCTGGTGTACATCATTGACTTTGGCCTGGCCAAGAAATTCCGCGATGCCCGGTCGCTGAAGCACATTCCCTATCG 
GGAAAACAAGAACCTCACGGGCACTGCCCGCTATGCCTCCATCAACACACATTTGGGCATTGAGCAATCGCGTCGTGACGACCTGGAGTCCCTGG 
GCTACGTCCTAATGTACTTCAATCTGGGCGCCTTGCCCTGGCAGGGCTTAAAGGCAGCCAACAAGAGGCAAAAGTACGAGAGGATCTCGGAGAAG 
AAGCTGTCCACCTCGATTGTGGTGCTGTGCAAGGGCTTCCCCAGCGAGTTCGTCAACTATCTGAACTTCTGTCGCCAGATGCATTTCGACCAGCG 
TCCCGATTACTGCCACCTGCGCAAACTCTTCCGCAACTTGTTCCACCGTTTGGGCTTCACTTACGACTATGTGTTTGACTGGAACCTGCTTAAGT 
TTGGCGGACCTCGCAATCCCCAGGCGATTCAGCAGGCGCAGGACGGAGCGGACGGCCAGGCGGGTCATGATGCGGTGGCCGCAGCAGCGGCGGTG 
GCAGCAGCGGCAGCCGCCTCCTCGCATCAACAACAGCAGCACAAGGTCAATGCGGCGCTGGGTGGCGGAGGAGGCAGTGCAGCGCAACAGCAACT 
CCAGGGCGGCCAAACGCTGGCGATGCTGGGCGGCAATGGAGGCGGTAACGGCAGCCAACTGATCGGCGGCAACGGACTCAACATGGACGACTCGA 
TGGCGGCCACCAACTCGTCGAGACCGCCGTACGACACGCCGGAGCGTCGGCCTTCGATACGGATGCGGCAGGGAGGCGGTGGTGGCGGCGGTGGA 
GTGGGCGTAGGCGGTATGCCGAGCGGCGGAGGGGGCGGTGGCGTGGGGAACGCCAAATAATATTTTATCGTTTAGGTTGCGACGCTGGACACGAC 
ACAGTAGACAAACAACAACAGAACTCAACAAACTATACATGTAGTATATATAGTTATATATACCTAATATATATAATACTTGCTTTATATATGCG 
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GTAAAACACGTTGAATGTATCCAAGCGGCAGGATGACCAGGTCCCTGCCAGAAACGGAGGACAAGAGCAGTGAGAGAAGAAGAGCAGAGCATAGC 
AAAGCGCAGCGCTTTTCTTTTTTTGTGTGTACATTTTCGTTTGTTTTTTGACATTTTGGTTTCTGCTCCCAGATGTTAGAACGTTGAGCGCGCTT 
ATTATTAATTAACTAGCCTATGATTAATTTAATTTATACTATTAGAACATAAACGATTGCAACCTGGCTGCAAGAAACTGCAACAAACCGCACTG 
TTCAATTGATTTGAGATCC 
(SEQ ID NO: 176) 

Start ATG: 353 {Reverse strand: CAT) 

MELRVGNKYRLGRKIGSGSFGDIYLGTTINTGEEVAIKLECIRTKHPQLHIESKFYKTMQGGIGIPRIIWCGSEGDYN\MVMELLGPSLEDLFNF 
CSRRFSLKTVLLLADQMISRIDYIHSRDFIHRDIKPDNFLMGLGKKGNLVYIIDFGLAKKFRDARSLKHIPYRENKNLTGTARYASINTHLGIEQ 
SRRDDLESLGYVLMYFKLGALPWQGLKAANKRQKYERISEKKLSTSIVVLCKGFPSEFVNYLNFCRQMHFDQRPDYCHLRKLFRNLFHRLGFTYD 
YVFDWNLLKFGGPRNPQAIQQAQDGADGQAGHDAVAAAAAVAAAAAASSHQQQQHKVNAALGGGGGSAAQQQLQGGQTLAMLGGNGGGKGSQLIG 
GNGLNMDDSMAATNSSRPPYDTPERRPSIRMRQGGGGGGGGVGVGGMPSGGGGGGVGNAK* 
(SEQ ID NO: 177) 

Name: discs overgrown 
Classification: protein__kinase 
Gene Symbol: dco 
FlyBase ID: FBgn0002413 



Celera Sequence No. : 142000013384566 

TAAAAGTGAATACAATTTTTTGCAGTGCAGTTAGCGTCTCGCTTTACTTCCTCTTTATGGTTGGCTTTTTGTGTTTTCCAATCAGATTAGCATGG 

TAAGGGGTTAAGAGTACGGGGTTACCACCGCTGGGAGGGGACCACATGAAAGGGTCAATAATGGGACTCGAGACGAAGATGCTGACATTTATCAA 

ACGGAATATGCGATGCGTACAACCATTAGACACTGCCCACTGATGGACGTCGGACGATGGACAGGCGAGACGTGATCCCAGATACGGCTCATCAA 

GT GC AT T GT C GT GGG AC AAT AAAGGGT T7AAT C GG C T AAC AGAG AG AT T G C AAT AT TAT C T T CT G AGC AAAT T GG AGT T GGG AAG AC AGC AAAAAT 

TAGCGAAAGTAGATCCAAATATTAAGTGTATCAATTGTAAGGCGTATCACTTGATTCCGCACTCTACTCAACTCTCCAAGGACCTCTGTAATCTG 

ATAGGCTTACT G7AAC C AAC AG AG AGT AG AAAC AAAGT T TTGCTTTATTGT GAT T T T AC G AAAT AAC C T AC AAC T T C AC CAT T CC C AAAAA C AC AA 

TTTCTTTTTCCTTAAACATTCCAGTTTTAAACACCTTTTTTACAGTTTTTTCGTTGTAAGTTGGCCAAAATATGTCACAAGGCTGAAAAACGCAC 

TTTTTTAGTCAGCTGGGGTCCTCAGCTAAAGACCCATATCAATATTTGCTTTAATTTGGAAAAATAATTTTTCGGCCAAATTTCGCATTTTTTGT 

AAGGGGTAACATCAT7AAAAAATCGAAAAAATTTTCAAACCAAAAATTTCTCTAAATGAGTTACCCGTCGTTTAGGAATATTTTTACAAGTTC7AAC 

GAGGTATTCCACTCCACAATTGGACGTGTCTTTCCCAAGATATAGCACTTTTTCTGTGAAATCCCCATCGAAAATTATCAACAAGAATAAAATCA 

CCCTCGTTTTCCATCTGAAAAAACTCCCTCAAAAACAAACGCATATACAGTTTATTTCGTTACATATATATTGTTATTTATTTAAGAACTTGCTC 

GT CT AAT T GAC AG AAAAAT AT AAAG AT AT AGG AT GT AT T GAATGGGT T GT G C T GAGT GT C T T AC T AT GC G AAT G AGC GAT T CC AC T T AAGC C T AG 

GC G7AAAT C T C GGC G AT T GC AT AAT GTG GAG CC C C C T C C C G AGGT GT AC GG AAATC AC AAGT GGT T G AT AAG7AAAGAAC AG ACG T GG AAT T T T GC T 

GTGCTGGTTTGAGGATCGCAGTCGAACGGATAATTTATTGGCCAGTCGCCGACTTGGCTGGTGATGATTTAATAGCGGACCCTCGGGAAGGCTCT 

CTGGGCTGACAGATAACAGATAACTCGACTGGCTGATAATTTTGATGCGTGACTGGGCGCTGCGCCTTCTCTATCTCCACCTCCTTACAGGGTCT 

TGTCGAACAGGAACTCGCCCAGTTCGCCGTTGGTGTCCATCATCTTCTTGAGAGTGGTCAGCTTGCCGGCGAGCTCGCGCTGTCCGTGGAGCTGC 

TCCTCCAGATAGACACCGGTCAGGTAGTCCACCAGGTGGTAGTGGTTGTAGGGCTTGTTCTCGCAGGTCTGGATCAGCTTGCGGATGGACTTGGT 

CACCTTGATCTCCAGGTCGAGGGCGTCGGACAGGGCGGCGGCACCATCGGTCCACTCCTGCTTGGCCACAGTCTGTGGGGGATTCGCCAGGATTA 

GCTGGGCTTTTGGGTTTTCTAGGCGAGTCGTCGACTTACCGGCACATTGATCAGATCGCTGACTCCCTCGGTCAGTTGACCGCGCATGGACAGGT 

ACTCCACCAGCTTGGATCCGTGCTCACGTTCCTCCTTGGCGGCCTTGAAGAAGTGCTCGGCGAATCCAGGGCGGTTGACGGTGTCGCGGGAGAAG 

TAGGCGCCCATGGCCAAGTACTGGTAGGAGGCGTTGATCTCCTCCTGGATCTGATTGCGCATGCCCTTTATGCAGGCATCCTTCATGTCCACCCA 

GTCCTTGGTAATCTCAGGAACAGCCAGGGAGCCTGCAATGGGATAAGGTTGGGTTAAGTCGGGGGACCACGATAACAATATATGGCTAGTATTCA 

GCTTTAAAAAATTTATTTTTGTAATCAGAACTTAGGTAAATTTTAATATTTTAAAAACCTTTGCACTCTTGGTCTTGACCTTCATGGCTTAGGTT 

TTGTTTTTCTTTTAGTTGATGCTCTTCTCGCTCCCTTTGAAGCTCTCTCACGAAACTCTTGCTTTCTTGCTTGGCTGAGGGGTAGGGCGTTGTTG 

GTTATTGCATTTAAGAGGGCGTAATTGCATAAACCTTTGAAATTGCGCTTGAAATGCCGCTATTCTAGGGCACAAAAACCGTGTGCAAAACTGTA 

ATGGCCTACTCTCCCACTCTCTCTCTCTGTTTTTGTAACTGCATTGGTAACGTGCGCAGCTCTCGCCGCCATAGTAACGGCATTTGTTTTCGTGT 

AGTGCCGTGTGCAAAAAGCATGGTTCACCATGCTAGCTCACTCGGCCCGTAAAAGCCGTGTGCAAAGAGCGCCTTGAAATGTAACAACAAAAAGT 

CAAGCTTATTTATACCGCAATATTCATGGAAATGAAAGCGAAATTCGCATTATTTAATTATGAAAATTTATGTATAAAAACTGGTTATCATTTCC 

CCAAAGCGCAACGGAAGTATCAATAACTTTTAAATCAACTGGTTTGGTAACTTCACTTCATTGACGAAAGAGAGTGAAAGAGCGTCAAAGAGAAA 

CGGCGCATGCTCTTTTCAACGCATTACAAATGTTTTGCTTCTCTTTGCGTATTGCGTCGAAATTTATTGTTTTTTTTCCGATGACCTGCCTTTGT 

TTTGTTTGAAATTCTGATGAATAAAACGTAACATTAGTACTTACACTTGAAATCTCCATAGGCCTGGGCCACCACGGCCAACAGGAGCAGGCTAG 

CAATTAGTTTCACCATCTTTGATCGTCGAACGTAGTCTTTACACAAAATCACAGAACAAAAATTTGTTTTTCACTTTAGTTTGATTTGCTTGAAA 

TGCACAAATGAGAATGAGGCGACTCGAGTTGAGCTACAATGTTAAATAGGCGTTGAAATTGAAAATCTAGCTGCCTTTACACACACTGGCGCAGA 

AGGCGTCGCCTGCTTCAATTTGATGGGCTTGATTGGGATTATTTTCGACTTCTCGAATGCTTTCCAGTACCTTCTGCTGACAGCTTTAATCTGAT 

TTTTCAGTGGAGCTAAGGCCTTTCTACGTTGTGTTTTAGCGAACTCGAAAGCGAACTGTCGCTCCGAACGGCGCACAAAACACTTTTAAGCGCGC 

GCGCTCTCGGTTTTTCTTCGTGTGTGTGTGAGTGATGGGAGAGTGTGCACTGTTATCTTTTCAACATGTCGCCCGTTCTTTGTTAACCGTTTAAA 

TGGCTGATAATTGAATATTCCCGATAATTGCCGCACTCAATGTGATACACACGTCCTCAATATGGGTATACTTTGTATAAAGTCGGGTTTCCTAG 

TCAGATAATCCCCACTTACTAAGAAGTAAATTTATCAAATGCAATATGGCCGCTCTTTTGAGATGCGCATCAGCTGTTCTTTGTTTTTGTTTTGG 

CCGTGGGAGAGCACCACGTCATGTGTTGCCAGGCGGTTTAATTATTGCTGACAATTTAATTCACATTTTGTCAAGTTGATTTGCACGTTCGAAGG 

GGGGATCAGTCACCCACTTTGATAACTGCTAATATGGCGACGGCTCTTTTTTACGTGTCGCTTGCAGAGCTCCAAGAATGACACTTGATAAGAAG 

GATACATGGTGGGAAAACTTACAAATTACATAATTTGTTTGTTGATTGGGGAAACCCTACATTATCTCCGAACCCAGCAATAATATGTATTGTAA 

ATAGCAACAGTCTCAAGCTCGATGCTGCTGTGAATTACCAATATATCAAAACTTAAAACATTATCTTAGGAGGTAGATGGTTTATCTGCAGTTTA 

TTTTCAATTCTCATCGTTAGCTGATAAAACTCTCAGGAAACCACTTGAAATCCAAAGTGCAGCGTCCATATTTTGCACACGCTGACAACCCTGTC 

GTGAGTCACCTTCCAGCAGGCGCAGCAACAACAGCACTGGCAGCGGCCATATTTGAATGACACCGAAATCCCGCACTGAGCGCAAAACAACACAT 

GCCCATGCACGCTCTCCAAAACCACACACACACAGCCATATCGATCGCATAGAAAGAGACGTACGCGACGCTATAAACAGCAGTTGATTTTCAGA 

GTTTCACCAAGAAGCACATAAAGCGATTGAATTATTAATAAAAGCACAGGCACTGGAAGTGAGTTTTAATATTACTAACTAACTACTTTGTGAGT 

GTTTAGTTACGAAAACTAGAAAATAATATTCTGGAAACTGTGTTTTTAAACCGACCACCATATAAATTTAAAAACCAAAACATTCCATTGGTGGT 

GGTTTTGTTGTTAAAAAAGAGCACAGAAAATTCAAGAAAACGCTACGACGCAAAACGAAAAAACTCGCCCAAAATGAAAACAAAAGTGTCT 

(SEQ ID NO: 178) 
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Exon: 3556.-2895 
Exon: 2027.. 1750 
Exon: 1687.. 1001 

Start ATG: 2961 (Reverse strand: CAT) 
Transcript No. : CT6417 

CATTTGATAAATTTACTTCTTAGTAAGTGGGGATTATCTGACTAGGAAACCCGACTTTATACAAAGTATACCCATATTGAGGACGTGTGTATCAC 
ATTGAGTGCGGCAATTATCGGGAATATTCAATTATCAGCCATTTAAACGGTTAACAAAGAACGGGCGACATGTTGAAAAGATAACAGTGCACACT 
CTCCCATCACTCACACACACACGAAGAAAAACCGAGAGCGCGCGCGCTTAAAAGTGTTTTGTGCGCCGTTCGGAGCGACAGTTCGCTTTCGAGTT 
CGCTAAAACACAACGTAGAAAGGCCTTAGCTCCACTGAAAAATCAGATTAAAGCTGTCAGCAGAAGGTACTGGAAAGCATTCGAGAAGTCGAAAA 
TAATCCCAATCAAGCCCATCAAATTGAAGCAGGCGACGCCTTCTGCGCCAGTGTGTGTAAAGGCAGCTAGATTTTCAATTTCAACGCCTATTTAA 
CATTGTAGCTCAACTCGAGTCGCCTCATTCTCATTTGTGCATTTCAAGCAAATCAAACTAAAGTGAAAAACAAATTTTTGTTCTGTGATTTTGTG 
TAAAGACTACGTTCGACGATCAAAGATGGTGAAACTAATTGCTAGCCTGCTCCTGTTGGCCGTGGTGGCCCAGGCCTATGGAGATTTCAAGTGCT 
CCCTGGCTGTTCCTGAGATTACCAAGGACTGGGTGGACATGAAGGATGCCTGCATAAAGGGCATGCGCAATCAGATCCAGGAGGAGATCAACGCC 
TCCTACCAGTACTTGGCCATGGGCGCCTACTTCTCCCGCGACACCGTCAACCGCCCTGGATTCGCCGAGCACTTCTTCAAGGCCGCCAAGGAGGA 
ACGTGAGCACGGATCCAAGCTGGTGGAGTACCTGTCCATGCGCGGTCAACTGACCGAGGGAGTCAGCGATCTGATCAATGTGCCGACTGTGGCCA 
AGCAGGAGTGGACCGATGGTGCCGCCGCCCTGTCCGACGCCCTCGACCTGGAGATCAAGGTGACCAAGTCCATCCGCAAGCTGATCCAGACCTGC 
GAGAACAAGCCCTACAACCACTACCACCTGGTGGACTACCTGACCGGTGTCTATCTGGAGGAGCAGCTCCACGGACAGCGCGAGCTCGCCGGCAA 

GCTGACCACTCT C AAG AAG AT G ATGG AC AC C AAC GGC GAAC T G GGC G AGT TCCTGTTC G AC AAG AC C C T GT AAG GAGGT G GAG AT AG AG AAGG C G 
CAGCGCCCAGTCACGCATCAAAATTATCAGCCAGTCGAGTTATCTGTTATCTGTCAGCCCAGAGAGCCTTCCCGAGGGTCCGCTATTAAATCATC 
ACCAGCCAAGTCGGCGACTGGCCAATAAATTATCCGTTCGACTGCGATCCTCAAACCAGCACAGCAAAATTCCACGTCTGTTCTTTCTTATCAAC 
CACTTGTGATTTCCGTACACCTCGGGAGGGGGCTCCACATTATGCAATCGCCGAGATTTCGCCTAGGCTTAAGTGGAATCGCTCATTCGCATAGT 
AAGACACTCAGCACAACCCATTCAATACATCCTATATCTTTATATTTTTCTGTCAATTAGACGAGCAAGTTCTTAAATAAATAACAATATATATG 

T AAC G AAAT AAA 
(SEQ ID NO: 179) 

Start ATG: 596 (Reverse strand: CAT) 

MVKLIASLLLLAVVAQAYGDFKCSLAVPEITKDWVDMKDACIKGMRNQIQEEINASYQYLAMGAYFSRDTVNRPGFAEHFFKAAKEEREHGSKLV 
EYLSMRGQLTEGVSDLINVPTVAKQEWTDGAAALSDALDLEIKVTKSIRKLIQTCENKPYNHYHLVDYLTGVYLEEQLHGQRELAGKLTTLKKMM 

DTNGELGEFLFDKTL* 
(SEQ ID NO: 180) 

Name: Ferritin 1 heavy chain homologue, FER1HCH 
Classification: ligand_binding_or_carrier 
Gene Symbol: FerlHCH 
FlyBase ID: FBgnO015222 



Celera Sequence No. : 142000013384573 

TACCCGCAATTTAACTGCTCTAAGGTAAAAGAAAAATTTGTAAAGAAAAAAGAAAAATTTGTAAATTTGTAAATTTAATGTATTGCAACATATAT 
AATAAACACGGTATGTGCCACACTGCGAAATATCAATGATTTTGACTTTGACTGACCACCGTATACGGACACACCCACAAGATCAGACCCACAAA 
GACTCACCCACAAAGGCCTTCAGCTCGTAGTCTACTCCGCAAGACTTGCCCACATCTCCAGGAGCCGGCTGCAAGGACACAGAGGCTGGGCAATA 
GGGCGGAACCTCAAAATAAAACGGATGGGCGTTCGGCCCCAGCTTTTTAATTAACCTTTCCTGAAGCCGGGTCATCGGGCGGTCCAGCTGCATGG 
GCGGATAGATCTGCTCATGCGCCAGATACAGGTCCTTACGGAACGTTAATCCCAAAACATCTAGGTCTTCACGTCCGTACCGAAAGGCGGCAAGC 
ACCTGGCCAAAAACCTTACGGTCCTTTACGTACTCTGGATCAATGAAGACAACACCGTCAATGGGATCCACGTGCGTGACATGGTCGACGAAGTC 
CCGTTTGCCAAGGTAAACTGTTATCTTGCCGTTGGAGGAGCTCTTCTTAAACACCCGAGTCGCCTGGCGTCTGGAGCTGGCATCTCCACCAGTCT 
CGTCGCCTGCGCTTCCGCCCCCAGTGCTCCCGGCAGCTGTTACATTGGTGCTGGAGCCATTGCTCACATTTGCTCCTCCACTTCCTCCACCACCA 
CCGTTCATTATGTTATTTGGCGGTTATGCTTGTTATTCTGCTTTTCTGCTGTTCCTGCTTAATTGGAACTCCGTTAAATAAATATATGTGTGCGT 
TGCGGGACTTGTGCTGCAATTTAATTTTTTATGTGTATACTTGTTCCGCCATTTACCAGCACTGCTTGACAGTCATTGGAAAAATACCAGGGAAT 
ATTACATACTAACAAATTATGTACTTCTCTGGTATTTCAATTCAGTGCGAACAAACCGATATATGTTTGGTCGTACCATAATCGCAGCCTATAAT 
TAATATTTATTTTGCATTTTTGGCACCTATTTATATTTCCTAAGTGTGCCCTGGCCCATAGTAAGTGACTTTTAATAATTTTTCTTTCAATACCA 
TTAACAATTGCTTGAATTAACATTAAGTTTCCTACTAAGATAAAATGCAACTGCAGGTTGACTTGGCTTTTTAATGAAATATAATATATATATTA 
TAAATTCCTTTAGAAACAGATGATTCATAGCTGATGATTGAGGAAACTTACCATTTGGAATGTATAAACACATGTAAAATCAGACCGTTGGTAAA 
TATCTAATCATAGATTGTATCCTAAGTCTTAAATAAATACTTTATATAAACTGTTTAATATTTATTTTATTTTTTAAAGGCTTTTATTATTTTTA 
TATGTATACATGTGCATGAACGGACTTTAAACCATATTTAGTAGTGGAATAGAGCTATAAAAATGAGATTGAAGTCTCTAAAATACTTTATTTAA 
TCAGCCTTAGCACCGATAGAGCACAACGGTCACACTGTTCTGTCCCTCTTTTCCACGTGCATTTTATTGAACGAAACCACGCGTTGTTCAGTCCA 
AACTTGACTTTGCTAATTTTGCCCAGAAAACCTCTAAAAGCTGTCAAAATGGCCCAAAAGAAAGCCGTCACCGTCAAAGGAAAAAAAGCTACGAA 
TGGAGAGGAAAAGCCTTTGGCCAAACGGGTAACCAAGTCCACAAAGGTACAGGAAGAGGAGACAGTAGTCCCCCAATCGCCTTCCAAAAAATCCA 
GAAAACAGCCTGTTAAAGAAGTGCCGCAGTTCAGTGAAGAGGACGAGTCTGATGTGGAGGAGCAAAATGACGAGCAGCCAGGGGACGACAGCGAC 
TTTGAGGTAAGCTATAAAAAAAACGTGGTCTTCTGAATGGATATGGCTAACATTACATGCATAGTATGCACGTATATATAGATCTGCATGGATAA 
GAATCCATTTTACGTTTTCTTTGCCACGTGGTTGTAAAGATTTTATAATTATAAGACGAGTGCACATAACCCACGATCCTCTGGTTTTATACCGA 
CTCGGTTCATATATCGATAAAAAAAAAATTGTCTTTATTGGTGCTCTCCCATCAGCGGTTATAAGAAAATTGTCTACTTTTCAGACCGAAGAGGC 
AGCGGGCTTAATTGACGACGAGGCTGAGGAAGATGAAGAGTACAATAGTGACGATGAGGAAGATGATGATGACGATGAGTTGGAGCCTGGTGAGG 
TTTCCAAAAGTGAAGGCGCGGATGAGGTAGATGAATCCGATGATGATGAAGAAGCGCCTGTAGAGAAGCCAGTTTCCAAGAAATCAGAAAAAGCC 
AACTCTGAAAAGTCCGAAGAAAATAGAGGAATTCCAAAAGTAAAGGTTGGCAAGATTCCGCTGGGAACGCCAAAAAACCAGATCGTTTTTGTCAC 
AAACCTACCAAACGGTAAAGTAACTGGTTTATTATTTCATTTTTCAAGTATATTTAAACTTAGTATATTTGTTCCACATAGAGTATCTCCACAAG 
GACTTGGTCGCATTGTTCGCCAAGTTCGGACGACTTTCCGCTTTGCAGCGTTTCACTAATTTGAATGGCAATAAATCAGTTCTCATTGCCTTTGA 
TACGTCAACTGGAGCAGAAGCTGTTCTGCAGGCTAAGCCTAAAGCGTTAACCCTTGGAGACAATGTTTTGTCTGTGTCCCAACCGAGAAATAAGG 
AAGAGAACAACGAACGCACGGTGGTGGTTGGCCTGATTGGACCTAATATTACCAAGGACGACTTAAAAACCTTTTTTGAGAAGGTAGCTCCCGTG 
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GAGGCGGTGACTATATCCAGTZVACCGCCTCATGCCCAGAGCTTTTGTACGTTTGGCGTCGGTCGATGACATACCGAAGGCTCTTAAACTGCATAG 

CACTGAGCTGTTTTCTCGCTTTATAACGGTTCGTCGGATCTCACAAGAATCGATTTCACGTACCAGTGAACTTACCTTGGTTGTCGAAAATGTGG 

GCAAACACGAGTCTTACAGCTCCGACGCTCTAGAAAAGATATTCAAAAAATTTGGTGATGTTGAAGAAATCGACGTTGTGTGCAGCAAGGCAGTT 

TTAGCTTTTGTCACGTTCAAGCAGTCGGATGCCGCCACGAAGGCTTTAGCACAACTCGACGGAAAGACTGTTAATAAGTTTGAGTGGAAGCTTCA 

CCGATTCGAACGCAGCACCTCGGGGAGGGCCATTCTTGTGACGAACTTGACTTCAGGTGGGTTACCCGTTTCGTAAAATTGTTTAAAATTATTAA 

TCATTATCATGTAAAATTTCGTTGTAGATGCTACTGAAGCCGACCTGCGGAAAGTGTTTAACGATAGCGGCGAAATCGAAAGCATCATAATGTTG 

GGCCAAAAAGCGGTAGTAAAGTTCAAGGACGATGAGGGTTTCTGCAAATCTTTCTTGGCTAATGAAAGCATTGTAAACAACGCCCCTATATTTAT 

TGAGCCAAACTCTCTGCTGAAGCATAGATTGTTAAAGAAACGCTTAGCTATTGGTCAGACCCGCGCCCCGCGGAAGTTCCAAAAGGACACTAAGC 

CAAACTTTGGTAAAAAACCATTTAACAAGCGCCCGGCACAAGAGAATGGTGGTAAATCGTTTGTTAAAAGGGCAAGATTTTAGAACTTGAACACT 

ACAAGAATTAGCTTTAAGTTGCGTAGTACCATAAATACGGCCACAGCCCCGATAATTCCATACAACTTGATGTTATGGATTTAGTTGATAAGAGA 

TACTCCATTAAATTTTTACCGGAGAAATGCTAGCCGTATTCAAGCAGATGTCGATTTTTATAAGCATCGGACATGGAACTAAATAAAAGTAAATA 

AAAACCCCGATAAGAATTTTGTATTCAACGTCTTGGTTTTGGGTAACTAGATACTTCAAAAAACGATTTACTGCTTATATGAACGTATACAGGTG 

CGATCTCCAGTATATTCCGTTTTTTTTTTTGTGTTTGAATTTTATCTCGCTTCGGTACTCTCGTATATAGGTTCCTATCCCTTCCAAATGGCCCA 

ATGTACAGCTGCGCTTGTTCAAAAAGCGAGAACTATATGAATTTCCTTTGTATAATTAATTTATTAGTAATACATTTTTGTAGTATTTTATAATT 

TTTTGAATTTTTTTGTTCATACTCAGAAATCTAATATGGTTGGCTAACATATGTATAAACTTGTAAATGTTTGTATGTATGTTTATATTTTTTGT 

AAATGCCATTTAGATTAGTACTTTAAATTTCAGTAATTGTTGGTTCAAAAAATTAGTTTATCCTTTTTATGTGTAAATATGTATAAAAAATCATC 

ACTTGGATAAAAGCAACGTATAGGGAGTTTAGCCAACCTAAAAAAAACCCTTTCGATTAGGAATTTCCCAATCACTCTGTTTATATATAGTTCGT 

TCAAGCTACAAAAAACTGTGCTCGTGACGCGTTTGGCTTACTGACAAGTTTTTGTGTGTTATATGTAAGTCTCTGTCTCATGTTGCTTTTGACCT 

TGTTTTATAAGAAATATAATATCGCTATCGTAATATAGCTTAGGCAGAAAATCACTAATAAGGCAAATTAAAGTGAAGGGCACTGTGTGGGACTT 

C T GAAAAT AC AT AAT GC AAAC AAGC T T AC T AT T T AAAC T T GAC AAC T TAT AT AT C T C AG AGT T C GT T T AAG C C G T GC AT AAC T C AT AT ACGT T GT 

ATAGGCAAATTATAAGCCCTAGAATATTACTTTTTTTAGATATACTTTCACGAGAGCTTTTTCTTTCATGACGAATTGAAGTAGGTGACACTATC 

TTTCACAAAGCTTATAAAATTATGCCTACACAAAGGATAAAAGAATACAAAAATAAAA 

(SEQ ID NO: 181) 

Exon: 1001.. 1105 

Exon: 1249.. 1325 

Exon: 1410.. 1906 

Exon: 2175. .2484 

Exon: 2552. .3286 

Exon: 3353.. 3903 

Start ATG: 1664 

Transcrxpt No. : CT6639 

ACAAACCGATATATGTTTGGTCGTACCATAATCGCAGCCTATAATTAATATTTATTTTGCATTTTTGGCACCTATTTATATTTCCTAAGTGTGCC 
C T GGC C C AT AAAAC AGAT GAT T C AT AGC T GAT GAT T G AGG AAACT T AC C AT T T GG AAT G TAT AAAC AC AT GT AAAAT C AG A CC G TTGGCTTTTAT 
T AT TT T T AT AT GT AT AC AT GT G CAT GAACGG ACT T T AAAC CAT AT T T AGT AGT GG AAT AG AGC T AT AAAAAT GAG AT T GAAGT C T C T AAAAT ACT 
TTATTTAATCAGCCTTAGCACCGATAGAGCACAACGGTCACACTGTTCTGTCCCTCTTTTCCACGTGCATTTTATTGAACGAAACCACGCGTTGT 
TCAGTCCAAACTTGACTTTGCTAATTTTGCCCAGAAAACCTCTAAAAGCTGTC AAAAT GGCCCAAAAGAAAGCCGTCACCGTCAAAGGAAAAAAA 
GCTACGAATGGAGAGGAAAAGCCTTTGGCCAAACGGGTAACCAAGTCCACAAAGGTACAGGAAGAGGAGACAGTAGTCCCCCAATCGCCTTCCAA 
AAAATCCAGAAAACAGCCTGTTAAAGAAGTGCCGCAGTTCAGTGAAGAGGACGAGTCTGATGTGGAGGAGCAAAATGACGAGCAGCCAGGGGACG 
ACAGCGACTTTGAGACCGAAGAGGCAGCGGGCTTAATTGACGACGAGGCTGAGGAAGATGAAGAGTACAATAGTGACGATGAGGAAGATGATGAT 
GACGATGAGTTGGAGCCTGGTGAGGTTTCCAAAAGTGAAGGCGCGGATGAGGTAGATGAATCCGATGATGATGAAGAAGCGCCTGTAGAGAAGCC 
AGTTTCCAAGAAATCAGAAAAAGCCAACTCTGAAAAGTCCGAAGAAAATAGAGGAATTCCAAAAGTAAAGGTTGGCAAGATTCCGCTGGGAACGC 
CAAAAAACCAGATCGTTTTTGTCACAAACCTACCAAACGAGTATCTCCACAAGGACTTGGTCGCATTGTTCGCCAAGTTCGGACGACTTTCCGCT 
TTGCAGCGTTTCACTAATTTGAATGGCAATAAATCAGTTCTCATTGCCTTTGATACGTCAACTGGAGCAGAAGCTGTTCTGCAGGCTAAGCCTAA 
AGCGTTAACCCTTGGAGACAATGTTTTGTCTGTGTCCCAACCGAGAAATAAGGAAGAGAACAACGAACGCACGGTGGTGGTTGGCCTGATTGGAC 
CTAATATTACCAAGGACGACTTAAAAACCTTTTTTGAGAAGGTAGCTCCCGTGGAGGCGGTGACTATATCCAGTAACCGCCTCATGCCCAGAGCT 
TTTGTACGTTTGGCGTCGGTCGATGACATACCGAAGGCTCTTAAACTGCATAGCACTGAGCTGTTTTCTCGCTTTATAACGGTTCGTCGGATCTC 
ACAAGAATCGATTTCACGTACCAGTGAACTTACCTTGGTTGTCGAAAATGTGGGCAAACACGAGTCTTACAGCTCCGACGCTCTAGAAAAGATAT 
TCAAAAAATTTGGTGATGTTGAAGAAATCGACGTTGTGTGCAGCAAGGCAGTTTTAGCTTTTGTCACGTTCAAGCAGTCGGATGCCGCCACGAAG 
GCTTTAGCACAACTCGACGGAAAGACTGTTAATAAGTTTGAGTGGAAGCTTCACCGATTCGAACGCAGCACCTCGGGGAGGGCCATTCTTGTGAC 
GAACTTGACTTCAGATGCTACTGAAGCCGACCTGCGGAAAGTGTTTAACGATAGCGGCGAAATCGAAAGCATCATAATGTTGGGCCAAAAAGCGG 
TAGTAAAGTTCAAGGACGATGAGGGTTTCTGCAAATCTTTCTTGGCTAATGAAAGCATTGTAAACAACGCCCCTATATTTATTGAGCCAAACTCT 
CTGCTGAAGCATAGATTGTTAAAGAAACGCTTAGCTATTGGTCAGACCCGCGCCCCGCGGAAGTTCCAAAAGGACACTAAGCCAAACTTTGGTAA 
AAAACCATTTAACAAGCGCCCGGCACAAGAGAATGGTGGTAAATCGTTTGTTAAAAGGGCAAGATTTTAGAACTTGAACACTACAAGAATTAGCT 
TTAAGTTGCGTAGTACCATAAATACGGCCACAGCCCCGATAATTCCATACAACTTGATGTTATGGATTTAGTTGATAAGAGATACTCCATTAAAT 
TTTTACCGGAGAAATGCTAGCCGTATTCAAGCAGATGTCGATTTTTATAAGCATCGGACATGGAACTAAATAAAAGTAAATAAAAACCCC 
(SEQ ID NO: 182) 

Start ATG: 437 

MAQKKAVTVKGKKATNGEEKPLAKRVTKSTKVQEEETVVPQSPSKKSRKQPVKEVPQFSEEDESDVEEQNDEQPGDDSDFETEEAAGLIDDEAEE 

DEEYNSDDEEDDDDDELEPGEVSKSEGADEVDESDDDEEAPVEKPVSKKSEKAWSEKSEENRGIPKVKVGKIPLGTPKNQIVFVTNLPNEYLHKD 

LVALFAKFGRLSALQRFTNLNGNKSVLIAFDTSTGAEAVLQAKPKALTLGDNVLSVSQPRNKEENNERTVVVGLIGPNITKDDLKTFFEKVAPVE 

AVTISSNRLMPRAFVRLASVDDIPKALKLHSTELFSRFITVRRISQESISRTSELTLVVENVGKHESYSSDALEKIFKKFGDVEEIDVVCSKAVL 

AFVTFKQSDAATKALAQLDGKTVNKFEWKLHRFERSTSGRAILVTNLTSDATEADLRKVFNDSGEIESIIMLGQKAVVKFKDDEGFCKSFLANES 

IVNNAPIFIEPNSLLKHRLLKKRLAIGQTRAPRKFQKDTKPNFGKKPFNKRPAQENGGKSFVKRARF* 

(SEQ ID NO: 183) 

Name: RRM-type DNA binding protein 
Classification: DNA_binding 
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Gene Symbol : mod 
FlyBase ID: FBgn0002780 



Celera Sequence No. : 142000013384692 

AATGGTGCAAGAACCTCTTGAGAAATAACTAGCAAGTTTGAGGGTCGCATTAGCTAGCCCTACAGTGAAGAAATCCAATACATATACAATGATTC 
ATTGTGAAAAACGGAAATCTGACAATCTCAAAAAATAAAATAATATTACTTCTATCCCTTTGGCGACGTTTCTTAATAAAAATCCCACCAAGCTC 
ACCATTCGTTAGGTGGTACGAATTCAAAAATATTTATAATGAGTTTTCCATAAACAATTGTTTATTGAATAGTGTTTACTCTGTACCTCTCTGCA 
CTTGCCACTTTGTTTGGGCATACATATATGCACATGTTTTGCAGAACGTAGATTTCAACAGGAATTGAAAATAAACATATTCAATCGATAGCACA 
ACTGGGTAACTTTTGTTGCCTATGGCTCATTGTGTAAATATGTTCAGGCGGTTTGCCCTTAAATACCCAATTTGTGAATGCGGGAAAATTCAATT 
GCTTTAATTTGTTGTTTTTAAGAACATAAAAATGGGTTCATATGTATCTGACAGATAAGAATGAACTTGAACTTCATGAATGAGTTTGTAGGAGA 
ATAGAATTTTTGAATTTCATTTAAGATATTGTCTGAATGGTTGGATAAATTGAGTAATTTTTGTAGCAGAACTTTTCTGCTTTGTTCTTAGGAAA 
TAGTGTTTAGTTGAGCAACTGAAGACGAATAAGAAAGAACGCGAACTCACAAGTTAAAAGAATAAAATATAAAGGCACGTGCATTAAATCAACGC 
GTTTAATTGAATTCCTTTTGATTTTGGTTGCCAACTGGCTTTCCATTCAAGCAGACTTTCGCCTATCAAACAGTTTAACTTCATTCTGATATCTA 
AATTCACCGAGAAAGTGTAGCAAAGACGATGACATTCAATTGAAAACGATCGATCGATTAGAATTCCAATTTGCATAGCAGTTACTGTCGGCTTG 
TTCGCTTGTTTCAAGTGACACCTTTGTCGACTTTTCCCGGCAAAGACTCGAGTGGCTATCAGACAGAAGCAATCGAGAACCGCTTAATGAGAAAG 
CCAAAGAAACTTGTTGAACTACAAGAAAAAGAAGCACAAGAAGGCAGAAGAACCCGGAGGGGGAGGCAAAAAACATTTTCCAGATAAGCGGCAGT 
CGGAGTCGTGTGAATAAAAGCGTGTGTATTTGTTTTGTGGGTCAGCCTGTTCCAGCTAATTACCGTGAACTTGTCAGCGCACACAAAACGGCTGT 
AAAATGACCAAAGTGAGGTAATAAACACGGCGAATTAACGAGAAACCCGAGTGAATAATGTCCAAACTGGTGAAGAGAATCCGCAGCATGATCAA 
TCCAAGCAGCCAGCGGGATCGCGAAGATACCACCTCGGTATCCGCTTCAGAGGGCTTCATTGCTCGCAGGAAGTGCCACTCGCTGCCCCGAAGGA 
GCAATAGACACTCGACCGCCCGGCGATCAAACAGTGGTCTGTGGAATCGACTGGTCACCAATGTCTTCGGACCGGAAGATTCATATGACTTTGAC 
AACGAAGGCGACAGTGGTGGTGCCGTATTTTACACCGATTCGGTGAACGGTTCAAACTATGAGATATCTGCAGAAGTGAGTTAGTCGAACAAAAC 
TGGATAACATTTTATAAGATTTTAAGACGATGTCGGCTGGAGGTTACAACTTTTTTTTTTTTTACTATAGAACCATTATATTTTCATATTTCGGA 
TACAGTTCTCTAAGTATGTTCATCTCCTTGGGTTTTAAATTCTTTGTGCCAATCATGCACTTCAAATACTGCATAGTATTTCACTATGAACTTTG 
TTTGATTCAACTGCAGTTTTTGGCTTTTAATTAAATTGACTGCAGTCTATAAAGATGTCACACTTCTCCAAGAGGTGCCGCAGTATACGTTTTTT 
ATTGTTTGTGTTTCAAAAACTTGAATCGAATTTAATTACGAATATGTGTATAACCTTTTTCTTATTCCCATTTTTTCTCACCCACAGGGCGAATA 

CAT CAAG C AT C C C GTT C TGT AC G AGC T C AGT C AC AAAT AT G GT T T C AC AG AG AAT CTG C C G GAG AGCT GT AT G T C C AT AC GGC T GG AGG AG AT C A 
AGGAGGCCATTCGGCGAGAGATCCGCAAGGAGCTGAAGATCAAAGAGGGCGCCGAGAAGCTCCGCGAGGTGGCTAAAGATCGACGATCCCTCAGC 
GATGTGGCCGTTCTTGTCAAGAAGAGCAAAAGTAAACTTGCCGAGCTGAAGTCCGAGTTGCAGGAGCTCGAGAGTCAAATCCTCCTGACATCGGC 
CAACACCGCCGTCAATAGTAATGGACAAGGTGAGCATCCATCAATTCCCATAGAGATTGGGTAACTTTTTCTACCGTCGTCTCGCTTTTCGTTTC 
GAAAACCCACTAAAAGCCAGACATAAACTGTTGCAATTATACGAGAGCAAAAGTTTCATGCAACAAGTTTTGTGGGCAAGTGTAACGAGTCAAGT 
GTATTCCCAATAAAAACGAGGGCGTGGCCAAGGGGCGGGGTCACTAACGAAGCTATAACCAAAACAAGCTTCAAGAAAAAGTTTGGACACTGCAA 
TGAAGTTGAAAGATGTTCCTTTTTGTGTAGAAGATAAAGATAAGGTTGTTGGTTAATTCAATTTCAACCTGGTTGTAAAGTGTAATTCAATCAAC 
CAAAATGTTTGGTATATCCTGGGCTAGTTCGGATTTCTAGGTCACGTGGTTTCCGGGCAAGCAATTTAAATAAAGGAATGAGTAAATTCTGTATA 
AAACTGTTTATTATAAACAAAGATTTTGTAAAGATGTGTATTCAAAACTTTGTATTACTTGTTTCCAATGGACTAAAAGTATGAAGTAGTTTCCT 
AAAATTAAGTGATATTTAGAGCAGGGACAGTTTAAAATTGCTTTATAATAAAGCTGTAATTTAGTAGCTAAAAGCTATAATTAATAATAGTTTCA 

AT TAT G AAACC AT T GAAAT T T T T T T G AC C AAAAT AAAAAGC C G AAC AAAAAT G AAAGT AC TTACTTCC G AAAG AG AGT GAC AT C AAAAC CT C AC 
GAATGCTGCAATTGTTCGGGGCTTTGGGAATGGTTAAGTCCGTCTTTTACGAGTCTTTGTCACGCATTTTCTTTTACCTTTACCACATTTTTTTT 
TGGCCCATCACCTAATGCTTCATGGTGGTGACAAGTGAACATTGTGCTGGGTGTTAATGATCTGATTATCGCTTTATATATGCCCTCGTATCTTT 
AACAAAACACTTTGTGATAGTCTCGGCGACGGACGAAATGTTTTCTTCGTTTTCTTCGGTTTTTGTGAGCCAACCTATTGACACAACTGCCGTAA 
GTGTGATACGTTTGGTTTGTTTACATTTCCGAATGGAGACAGCGTTCTGGCATTCTATATATCTGCTTCCGTGTTTGGCCAATCCCCCAACAGAA 
GCATTTGCCCCACAAGAACCCCTCCAATGTGGCATTGTGTGGCGCCAGAGTCGACTTTTCATATGTCAGACGATTGACATGTTTGGCAGCTGTCG 
AAGAATGCTGCACAAATGCCGATTCTCTTTGAATAAATAGCTGGCGCATATAGAACCAAACACACACATTTTTAGTTTGTGCCAAACAAATGGCC 
TTTCAAAAATAAACAATGGGAGAAAAAGGGCATAAACAAGTTTTGGACATTTTTGCTGAGAAAATTTGGAACACCACTGCTGATTGATTGATTTT 
TTGGTTACCGTTTATTT ATT ATT AAAAT AAT AAT ACAACAGCTGTTTCTAACTT GAC AAGCGAAATTCTTTCTTTACGGTGGGTTTAAAAGTCCT 
TATCCTAAACTGGTATTGATAGAATACTTATTGTGCTGTACTCTGTGTATCTAAATCTATGAAAGGTTTCTATCATTCAATACTTCTTACGAAAT 
TCAGAAT AAAAT AGAATCTAATCTTTCCGATATCTTTCAGAATCGATCACTGCCTGCATTGATCCCAATGGCGGCTTCTTGGTCAGCGGTGCAGT 
TGGTGGCTTGGGCGGCGGAAACACGGCTCTGGAGGGCGGCGCACCGGCCACTGCCAATGACAAAGTGCTCGCCTCGCTGGAGAAGCAGCTGCAGA 
TCGAGATGAAGGTGAAGACCGGGGCGGAAAACATGATCCAGTCGCTGGGCATCGGATGCGACAAGAAGCTGCTGGCGGAAGCCCACCAGATGCTG 
GCCGACTCGAAGGCCAAGATTGAGTTTTTGCGCCTGCGCATCATCAAGGTGAAACAGAACCGCGAGCAGGCCGATCGCCTCAAGGCCTCGCGCCA 
GATGATCGACGAGCATGGCCAGACGATCGGCGGGAACAACAGCAGCCAGCCGCAGAGCCTGGAGACGACGCTGGAGGAGCGAATCGAGGAGCTCC 
GCCATCGACTGCGAATCGAGGCAGCCGTCGTCGATGGAGCCAAGAATGTTATACGCACGCTGCAGACGGCGAATAGGGCACCGGACAAGAAGGCT 
TTGCAGGAGGTGAGTTCTATCTTTTAAATCTGGGTTGATTATCTGTAGATATTTATATAGAAGTTTCGGGGAATCAAGGGTCATGTGGTAAAAGC 
ATAAATCTACAAGAAATTTACAAACAATAATCTAGAAGCCTAGAATCTAAACTTTAGATAACTATATAATTTTGGATGGCAGTTGCTATGGTAGC 
TTCTATTATGGAAGTTCTACTGTTCTTGGTAATTCAAAAAGAATCGGTATTTATGCTAATACTAGAGTTAATTAGAGCTGGCTGGCAGCCAGGTA 
GACACATAAACCCTGCGAGTATTCCCCCGCTAAGCTGAAATCAGGTGTGCCTGCTGCCATATGGCCACACGACGTATGAATCATATTTGAATTAT 
ACGTGCGTATACGTGATTTTTTCCCAACTTGTGGCAACGTCGTGTGAAACGAAACGGGAAGCGGTAAGGAGGCAATAAGCTAATTATGCGGCTGC 
TTTAAATATTTTTCAGTTCGTCCGCTTTCGTTATAATTTCAAGTTTTATTTTTTTAGTTGTTTTTGTCGCTAACTGTCGTCTCATTAATTAAACG 
GACGTGGAACGTCGCCATTACTAGAGTCGAAGAAAAATGATTTGGCAAGTGTGCTCAGGGCCTGCCCTAATAATTTTGGCATTAGCTTTTTAGTT 
CATTTCAAATTTTTTAGTTAATGGCATTGCTGACAGAAAGATGGTCCCAAAATAAACGCAAGACGAGTATTTTTTTCAAATTTCACAATGCATTT 
AACACTGAACGTACATGTGCAGGGACATAGATAAACTTTTTCTTATTTCCCAATGTCTGCGTTTCCTGTCGCGTGGCAAGCTCGCTTTTTTCCTC 
GCCATGATTTATCTTTTCACCGATGCTCGTGCGGTTTTGTCTTTGGATTCAACATAGACGCATACGTACAGACACTTAGTTTCCGCTGAAGCTAC 
TACAACAACATGTTGTCCGGCCTGCTGTGGCCTAAATATTCTATTTGCAGGCCCCATTTTCATAATTATTTGTTCTGCGATGCGGTTGCATCTGT 
TAGATGCAGAAATAAAAAAGTCGAGAGCAGCAGTGCGTTTCATATCAATGGGAACAAGTTTTCCTTAACAAGACCTAGTGATGGAGTTCTGTATT 
TTGGGATTTTCGGTTAGCAAGAACTTTCTATAGTACTGCTTTTTGTACTTTTCTTGCTGTGCATCCGAATATCGCAGAAGTTATTATACGATCTG 
CTAAACAATAAACACAAAACTTCTGCCGCTGCCCTTTTGGCCATTTGGGAGCGTTTTGCTAAACATTTTTAGAACTATGCCGTACGCTGCCGTAC 
ATAAGCATTGGATATATAAACCCGTCTATATGTCTGTGCCTCTGCTTAATCGGCCAAAAACGAGAAACGTATAAAACCGTCATAAGAAACGGAGC 
GCAAAGTTCTCTGGCCAGCATATGGGAAAATGTGTTGTTCAATGTGCGTGTGCATGTGCGTTTCTCTAGCGAATTTTCTCAAGCAGCTTTTCCTT 
GGCGTTCAGTTTGAGTTGGATCGCCGTTGGCTACGTGCGACGCCGCCTGGTCTTTTCCGATATTTTATAATTAGTTTTTAATTGCCGCGTCTTAA 
CTTACGTTGCGTTTGTGTTCCTTTGTCTCTAGTGTTAAATAAAAAAAATCCACTGAATGTGGAATGAAATAATAATAACAATAATAATAATGAAA 
TAATCGAAGTAAGCTCGCGAAAGGCACGCACACATGCTATATAAGTGGAATTCACTGGCTGAGCTTGTTATTACCTATTTGACATTACGCATACG 
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CAGCGTGTGCTGCAAGTTCTATGCGGTTATCCACCCGAAATTGCGTAATTCTCAGCTCTGCACTCAGTTTCTCTGAATTCTTTTTGGGGTCATCT 

tgcaattttggcgtaacgaaaaataaatcttgttataaataaaat^^ 

tgttttcgttccgccccatccatttgctaacccccgcgactatagtgtgtgtgcgtttttgtcgaatgaagctaaaaatagtgaaaacaaaaaag 
ccaaattaatatagttcaaactgcctgtaacgaattttacggtttattaaaacaaaaaataaactatttaactaaaaagtggtgtacaaaataca 
ggacgataagcaaaagtaattggatttgaaaaaaaaaccgagaaattcagatcattataaacactaaaagaattgcttgaatttgcctaaattgt 
atacaattcaaaggaaaagttttaaaaattataaacaatgtatggggatgttaccacaaattgttgatataaaattacctttaaacagttggaat 
tcaacatttaagccgttaatatatagaatttattcaatgtttggtcgacaataagaactatcaaagcggcttctcgatatttttagcaggaatat 

AAATTTCGTGTGTGACGTATATATATATATAAAAATATATATGTAGTATATAGTGCGTGAGTGTATTTATGTGAATATATCGATGGCACCATAAC 

CACAGCCAAAATGTGTTTATGCCACGACCGCAAAACAAGTGTTTAGGGAGTCATGAGCACGCCCACGCCCACTTCGGGGGCGGTTCAGGGCCAGG 

AGGTTCCACTGCCCCATGAACTGACCGCCGAGTGGACCGTGCGAGCACATCTAACTTTTGCCCGAGCTGGCGAAACGGTGCAAAAGGAGTTCAGT 

GCTCCGGAGACTGTGCATCCGATGCGCGTGGTCTACAAGTGGTGTACGCCCTGCGAGGAACCAGAAACAGAACAGGATGTCGACAAGGCAAGTTG 

TGACGGGAAAATCTTCCAACCTTTATTTATATCCTTAAAATCCTCTACAGACAGCGGCTACTGCGGGAACCAGCTTCCGTTCCACAAACTCCTCG 

GCCACCACCAATGCGGACTCGGCATTTGGCAGCCCACATGCTTCGAGGGCTCTTAACAAACCACCTGAGCCCACTGATTGTGCTGCTCCTCTGGG 

AGCCACTGCCCAAGTGTGTGGAACCCGCTGCCGGAGCACCTGCAACATTATGGTGTCCGCGGTGGCAGATTTCTTGCCTCAAGAGGCGGGAACCG 

GGTCAGTGAATGAGCCCTTTGTTTATCACAGCAAAGTGCGAGCACAGCAGCTAAAGGATGCCTTCTCCCAGACAAGCGACACGGAGGAGCAGCCA 

GCCAGGAGACGTCCAGCTTACGAGCAACGAAGGGCCACCACCGAAGCACTGATGAATTTTGCCAGCAAGCGACAAGCGGAGGAGGCTAACACCTA 

TGCGGTGCGGTTTCAAATTCATAATTGATTTTTAACTTTCTAACTAATTGCAATTTTCAGCCAACCTATTCGCCCACGCCCACAACACCTTCGTC 

TACATTTAAGTCGGCATCGATGTCGAGAATTCCACCGCTAGCTGGCAATATAAGCCAAGGAACAGCTCCTAAAATACCCAATTTGGGCGAGAAGA 

AACACCGCACTGTTCACATCGATGTCTACTGCACTGGTTCAGAGGGTGATGAGGAGGAGGAAGCCGATGATGAGAGGCATGCAGATGAGGAGCCC 

TCTAGCTCCTCCGACTCCGAGCTGGCTGGTAGCAAGCGATACGAACTGGACTCGAATTCCACCCCTCAAACCGTGCTGGACAATGAGCAGATGCT 

ATTGCGCCATCAAAGAATCTCTGGGGGAGCTATGCCCAGGCGTTTGGCTCAGAATCATGGGAAAAACAAAGTGGGGAACCAGCAGCAGCAAGGCA 

ATCTAAATTTGGGGCATGCCATTACAAAGTGCAGCACAGCCGAGGAAGTGTGCGAGTCCAAGCAATTGCTTTTCCGCAAACACATTGGTGATCAG 

CGGGCGGCCAAGTTGGCCAATCTACGTCAGAAATACATGCGCCAGCCCAGCGATGAGACCATCAGCAGCACATACCCGAACTCCTCGAGGTCAAC 

GATGCCAAGGGATGCCACCTGCAGCAGCATTTCCAGTGCCGTGGGAGGAACAGATTGTGTGGATTCATCGTGGAAGGATACAGACGATCTAGATA 

CACCAATAGCATCTTTTGGCTTAACCAAATCCGATAGCTTTGACTATGAAAACTCGCTTGATCGCCTGCGGATAAGCCAGATGGAGCGCCTATGG 

TCCCGTCAGCACTCATTGGACCAGGGTCACGCACAAACCCATCTGGGGGGAGCACCATCCCCTCATGCTCTGCAGACAATCACGGAGGTGCAGTC 

GCAACGAAGTTCCTTCCAGCGAAACGATACCATACCCTCGGAGTCGGATGGCTTCTCCGATACCAATGGATTCAATACCTATCCGGGAAGGCAGT 

CGCAATTGCAGCAGATCTCGCACTTTCCTCGTAACCGGCCTGGTTTTCTGCAGTTCTTTGGACCCTCCGGTGGTCCCGGCGTGGTAGCACAACCC 

ACTCCGCCAGCCACTGCACCTGTTCAACCCGCTGGGTCCGCTACTACGGATCCGTTTCGTATGCTCCACCCGAGTTTTCGATGGAAGAGCGAAGC 

CCGCGATAATCTGAGCGTCCAGGTTGCTTCCGGATCACCCTCTTCCGAGCGCAGTTCGCCACAACTCTTGTCTGCTGCGACCACCGTGGTGCGTT 

GTGGCAGTGAGGCGCCGCCAAGCACCACCTCTGCTAGTACTTTTGGAACTACAGTGACGCCTTCGCCAATTGCAGTGCGACGTTTTGAAATCTCA 

CGCGACAGTCCAAGTCTTGCAAGTGAAGCATCCACTTCCGTTTCCGGGTACACCCACGAGCATTTGGAGAAGGCGCGTCGCTTCGGCAACGTTGT 

GGCCATGAGAAAACCCGGCCATCATGTGGGACCCACCAAGAATCCGAATTGTCAATGCGAAAGCTGTCAGCGTTGGCTGGCCGAGAGGTTTCAGC 

TGCGCGGAAGAGCCTTCTCACTAGGCGAGAAGCCCTTATTAAGACGTCCACAGCTATAAAATCATACCGGGCTTACTTTTCTTTGTCTTAGGCTT 

GCATTTAACACACTGCTTTTCAGATTGATACGCTCCCTATTAACCCAACACCCAAAACCGAACCTAAATCCAACTTTATTTCCGATTAGTTGTAA 

GCCATGTAAGCGAATGTTCCTCACGCAAAACACAATTTGTGCGCTGAATCAATTTAATGTTGTAATAAAATAAAGCAATAGTTTTAACTAGAAAT 

GTATTTATTTATTTCGGCGAATGAAGAAAGAACTTATTTTAAATTATGTATGTTTTATAGGGTAAAAAATTATTTGGTATTGTGCCAGATGAAAT 

GAAATTATTATTAATTCGTTATATTTTGAAACAGGCCCATGGACGTTTGTCGGAATCGTCGCGAAAACTAGATCTCTTGCGCTACTCCTTGGATT 

TACGACGCCAGGAGCTGCCCGCCGATTCGCCCGCCGCCCAGCAATTAAAAACGGAGCTGCAGATCGTCCAGCTATCGACCTCCCCAGCTCCGGTC 

ACCTACACGTCACTGCAGTCCGGACAAGCGGGCATACTCGGCGGAAAGCCGTACCAGTCGGTATCCTCGCTTGGACGCTGTGCCAGTGTCACCGG 

AAAACTGGAGGTTCGCTTGCTGGGCTGTCAAGATTTGCTGGAAGATGTGCCCGGTAGATCGAGGAGGGACAAGGACAACAACTCAAGTCCGGGTG 

ATTTACGAAGCTTCGTCAAGGGCGTCACGTCGCGCAGCAGTTCAAAGAGCTATTCGGTAAAGGATGAGACGTCCATTGAGATCATGGCGGTCATC 

AAGCTGGACAATATCACCGTTGGCCAGACATCGTGGAAGCAATGCTCGCAGCAGGCCTGGGATCAGCGCTTCTCCATCGATCTAGACCGGTCGCG 

CGAACTGGAGATTGGAGTTTACTGGCGCGACTGGCGATCCCTGTGCGCCGTAAAGGTGCTGCGCCTTGAAGAGTTCATCGACGATGTACGACATG 

GCATGGCATTGCAGCTGGAGCCACAAGGTCTGCTATTTGCGGAGGTCAAGTTCTTGAACCCCATGATTTCACAGAAGCCAAAGCTGAGGCGCCAG 

CGTATGATCTTCAACAGGCAGCAAGCGAAGAACATCTCGCGGGCCAAGCAGATGAACATTAATGTGGCCACATGGGGCCGCCTGCTTAAGCGGAA 

TGCTCCTAACCATGTGCACATGGGATCGGCAGGATCAGGATCTTCTCTAACCGGTAGCTCGCCTATGGTGGTTGGTGGGTCTCGCGACTCTGAGT 

CTCCGATTTCGAGGACTCCCTCATCCGACGCACTGGTGGAACCGGAGCCCTACACGCCCGGAGAGCAGGCACAGAACTTGGAATTCGATCCCGAT 

GCAGGAATTAATGAACACGTGGAGACGCCAGGTGAATACCCGGATCCGGCGGCCAGTGGTCTGAGCGGAATGCGACCCCTGTCCATGCACATGCA 

GGGAATTAGTGTGCTGCCCCCGGAATCGCCACCCGTGGCCACCGGAGCAGCTGGTCGGCCCAATACGCTCAGCTTACAGATGCCGGGTGCCAGTA 

AAGGACAGGTGATCCAAGGCGGACGGACTGCGGCGCCTACAACGGCGCCACCGCCACCACCTGTGCTCAAGGCGACATCCACCACTCCGATATTG 

GATCAGGAGGTAAGTTTGCTAATTTATTTTACAATGCGATCCGTGTACAGTTCAGTATTTGACCGACCCTGTATAAAGATCTGCGTATGTACTAC 

TTAGCTGTGTACCCGTAGCTGGTTTGATCGTTGTGCCTGCCACTGCTCCTCCGCCCACTCTCTCATGTACTTATTCACCACCCACCCACTTATGC 

ACACACAAGCACTCAATCTGTACAGTCATCACATTATGAAATTTTGGTGTTGGTTTTGCATTAAGTATCTCTCTAATCTGTACATTTGGAACGGT 

AGCCGCACTTCATACAGAATCATGAATGTTCGCATAATTGGCAGAAAGTTTTACGAACTTGAGTTTGAAATACGAGATAGTTAAAAAGTAAATCT 

AATGAAATATGAAATATTAAGAAAAAATTTCGAATAACTTTAAATCAGATAAATTATTTGACTTATCTTCTTAAATTTTCGAAATCTATGAAAAG 

TGAACTTATTTTATTTCAAAATTCAAAGTAGTTTATGCCTTCTCTGCCTTTGAGTTTTCACATAATTCATCGTCATTATCGTTCACTCTTTCTCT 

TTATCCTCCACTCGCTTTCTTCAAGAAAAGTCTTTCCCTTCCTTGCTTTGATAATCCTATCGCCTTCTTTATTCATCTGCCCATATTCTTCTTTT 

TGTGGCCCATCGTCATCATGTTCTCTATCTATATCTCTCTCTCTCTGTATCTGTGTGTTTGTGTGCCCGTCTCGGTTTCCGTTTTCCGTTCCGTA 

TCGTCATGATTTCGTTAATCGCCGACAGCTGCAGGATGCCCTGCACGAATTCGATTTCCTGTCGGACCTGGACTCACGTCCCACCACGCTGCGTC 

GCCTGCTGAAGGAGCAGAAGATGGCGCTGGATGCCGGCGCGCTGGAGAATCTGCTGCTGCAGCAGCAACAGACTGAGCAGCAGGCACTGCAGCAG 

CGCCAGCGCCAGGAGCAGCTGCGTCTGCAGCAGTTTCAGGAGGCGCAGCGCCAGGCGATCCTGGACCTGTGCGTGAAACAGAGAGAGCCCGCGGA 

GCAGACATCACCCACCCTTGCCAACCAAACGCCGACGTTCCCACCGCTTGCCAGCAATCAGATCATGCCAAGCTTAAATGCCCGCCCTTGCCAAA 

GCCAAAGCCCAAACCACAACGAATTTCAGTTGCAGCCGCAGCCACCACATCCCGCCCAGAAACCGACGAATCCCGAACCACCGTCCGCTGTGGAA 

CAGCAGCTGCACCGTCCGGTCCTAATCCTGCCGCCCGTGGTGCTCCTGGGTGGTCGGGAGGAGGACCGTTCGGTGCCACCATCGCCACTAGTCGA 

GTATCCCGAGGACGACGACGAGTATCTGTATCGTGGTAGCAACGAAAACCTGGGCACCAGGCTGCACTCCAGCCACAGTTCCAGTGCCGGCGATC 

GTTTCTGTGTGGAGGTTAGGAGGTTTTTCATTTCCTGGTTTGGTTTCCGGTTTAATTTCTGTTTCTCCAATTTCTGATTTATTTTCTTTTTATGT 

TTTAATTTCACTAACCTTATTTTCCTTTTGATTTCTTACATGTTTGTTTTGTTTTTATTACGCGTTGGGTTTTACCTGTTTCTTCTTTAAATGTG 

TGTATGTTTTTGGTTTGGCTCTGGCCAAAAAAAAAAGAAAAACATTTGAAGCCCCAAAAATTGTGTATAAACATTTGACTCATAAACACACACAT 

TCGCACTGTAATCAATGAGCGTTCACAAGCATCCCAAAAAACTAACCTAGTCGTAGTGGAACTAACTAAAAGTTTAGCTATGAATGTGACTAACC 

ATTTAATTAACAGTGTTTAATAGTTTGATGTAGTTAAACATAGTTCCTATTGAAAGGAAGAGCTTTAAAATGGCAATCTTAATGGCACAGATGTA 
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AGCTGCACAGCTTAATTTAACGTAAATATAGCGTTTTTAAATGTTTAAATAACATATTTTCCAATAATAAGCAACTTGAGAGCATAAGAATTTGT 

TTTAGACTTACAATTTCCCACTTATTTGAAAAGGAACCAATATGATCGACGCGAAATAAAAGCTAGAAGTGACAATTTTAAACTCTTTAAATAGA 

CGCGCTTCTGGCGAATGACAGCGAAAAGATCAGAGGAAAGTAAAGGGGATATGTCTTGGGAGAATTGGGTCTTTAGGAGGCACCCGAAAGTGCGC 

CACTTCACGTGCCAATGTTCGGGAAATGTTTTATTTTTTCTCACCCACGCCCACCTTTCGTTTTGCCCTCCCTCTCACGTTGCAACGTCAGTGTT 

CCCCATTTCCATTTCCATTTTCATTTTCATCGAAGATGTTTGTCCCTGTGTGTTCGCTTACCTTTGTATCTATGTGTGTCCCAGCTGTGTACTCA 

TCGTTATGTGCGCTGTTGTCACATTTTCCTTTCGGCTGGTCATTTTTAGACCCAAAACAACGACGCAAATGCTTGGCCTCTGGTCATGACTTTGG 

CACTTGTTCGCCTTCAGTTGGTCGGCGCGGTGTTGGCCATCCCGCCTAAGTGTCGTATCCGTATCGGTATCGATCGCAATGTAGAAAAGTTATGA 

TTGTGCCAACCTTCCATGCGCTCGGCCAAGTCTTGAGGCAGATGTTTGCGGTCCAACTCCAACAGCAGCAGCAGCGCACTCATAATCAGCTCCAT 

TACCTCGATTTGTCGCCCGCGCGAACCATCAACCAACTGTTAGTGTGTCCATGCGCATCCAATCCTGTACCTCCTGCTGTATGTGTGTTTCTTGG 

GTTCAATCAAATCCCCGATCAATCTACCATACTATGAAGAACCATCCTCTTGCAGGCCCGTATTAGTCTTGTACATATAACCCTCGAACCGATCA 

ATGCCAGCCGGACGACCAGTTGCCTGATCGAGGAGGTAGCCGAGCCGGATTCACAGCCGGAGATTAAGCCGGTGGCAGAAGCGCAGTCTGCCAAA 

GTATCCGAGGCTTGTGTCGAAAGTATTCTCCCCGAGACAGTTGAAAAGTTAGAAACAGCAGACCAGGTCCAGCAGGTAATCGCCAGGGATTTAAG 

TGAATGGATTCACCAAGCTCTCAAGTTGTAGCCACTAATAAATGTACACCTAACGCTACTTTCCTTGATGACTAAAGTTTATCTTTCTGTGCACT 

CTAGGTTATACCACAGTTGGGGAAGCTTTACGTGGGCAGTAGCCAGCAGCAGTATGCGCAGCAGTCATCGCCCATCATCCAGGAGCCAGCTACTC 

CTACTATTTACGGAAACAGCGCCGCAGCCGGAGCGCCGCAGTTCCCGCAGCCCGCCCAAAGGCAAGAGAAGCAGCCTCCGCAGCAGCAGCCCATC 

TACGCTAACCAGTATGAGCTGAATGTGGCCAAGGCGGCAGCTGCAGCATCTGTTTACTCACCCAGCTCCTCCACCACCAGCAACTCCAATCAACA 

GCAGCAGCAGCAGCGCCGGAATGTGGCCCGTGGACTGCAGTATCGTGAATCTGGAGGACTCGAGACTGGACGTGCTGGCAAGCAGCCACCCAATG 

CGGGCATGTTGTCAATGGACAACTTCCGTTTGCTAAGCGTTCTTGGTCGCGGCCACTTTGGCAAGGTGATTCTGTCCCAATTGCGAAGCAACAAC 

CAGTACTACGCTATTAAGGCACTGAAGAAGGGAGACATCATTGCCCGCGACGAAGTGGAGTCCCTGCTTAGCGAAAAGCGCATCTTCGAGGTGGC 

CAACGCCATGCGCCATCCGTTCTTAGTTAACTTGTATTCGTGCTTCCAGACTGAGGTAAGTAGAGATAAGCTGAGGAGTTAGAAATTGAATAATA 

ATCTGTTTACATTTTCATCGTGCTAGCAACACGTATGCTTTGTGATGGAATACGCTGCTGGCGGAGATTTGATGATGCACATCCACACGGACGTG 

TTCCTAGAGCCGAGAGCCGTTTTCTACGCCGCTTGTGTGGTTCTGGGCCTGCAGTACCTGCACGAGAACAAGATCATCTACCGGGACCTGAAGCT 

GGACAATTTGCTTTTGGACACGGAAGGATATGTGAAAATTGCGGACTTTGGTTTGTGCAAGGAGGGCATGGGCTTTGGTGATCGCACGGGCACTT 

TCTGTGGTACGCCCGAGTTTCTGGCACCGGAAGTGCTCACGGAAACTTCCTACACACGAGCTGTGGATTGGTGGGGCTTGGGTGTGTTGATCTTT 

GAGATGTTGGTTGGTGAGGTATGTCGAATGGAACAAGCTTTAAAAATGCAGTTCATAATCATAATAATTAATGTTTACAGTCCCCATTCCCTGGT 

GACGATGAGGAGGAAGTATTCGATTCAATTGTCAACGATGAGGTGCGCTATCCGCGCTTCCTATCGCTGGAGGCCATAGCCGTGATGCGTAGGGT 

AAGCTGATTGATTGCAAAAGTAACTATTCGAAAGGAGAACTAATTTACAAATTTTTTTGGCAGCTTTTGCGCAAGAATCCAGAGAGACGTCTGGG 

ATCTTCGGAACGGGATGCGGAGGATGTTAAGAAACAGGCATTCTTCCGGTCAATTGTGTGGGATGACCTGCTCCTGCGAAAGGTTAAACCACCAT 

TTGTGCCGACAATTGTAAGTTAACTTGCACACTTCATTTATAAGGTTGCATATTTTAACATTGTTTTTAACTGCCTTGCAGAACCACTTGGAGGA 

TGTGTCAAACTTTGACGAGGAGTTCACGTCGGAGAAGGCTCAGCTTACGCCACCGAAGGAGCCGCGACACTTGACCGAGGAGGAGCAGTTGCTCT 

TCCAGGACTTTTCATACACGGCCGAATGGTGTTAGAAGCTCATTCTTAGTTCAAGGATTTACTAACGGGCCTAGCCCGATCCTTATAGCCTATGG 

CCTGCTTTTGCTTTTACTTGTGTACGAAATTGGTTTCGAGTAATTTTACTTGGCTTAGTAGTCGATGTGAGTTCAATATTTATACATGAAGACAG 

CATGATAATACAGATACAGGACATATATATTTATAAACTATATCTATATAGAAAATACATCTACTAACGAGAGAACTTTTGATTGTTGCTCACTG 

GCCGAGAGACCAACACAAACCAAAATACCTAGTTGTCCATGTTGTGGAAAAGCAAGCATTTGTCAATCATCCATGCAACCGGTTTAGCATACAGT 

GT ACC C AGAAAT C AT C C C AC AT T GT C GT TGC AT T G T AC AGC AT AT AGAGT TGTTTCCTTTTC C AAAC AC G AG AACG C AT T G AAT AGT C T G AC AAA 
CCCTAGGAAAGCCTTATACGCCAAGTTGTAGTTTTGGAAACCGTTTTTTAAGACCGGATACAAATAGATTTTATAGATGAGAAGTGTATGTCTAC 
ACCCCAGAGCAGTCATTGCAACCTATTTTCATAATCAACATACAAGAACAAAGATATATTTTATACATATATTGCATGCACATCCCAGGAATCGT 
CTTGCTAAACGAGCTCATCTTCTATACGACATACAAACATACATATTTATCTAGTAAAGTGAGAGAAAGTGAAATATTTACAGTTTTAACATGTA 
AGCCCTGTAAATTGCCTTTTGCCGCGTTTTATCAACTCCCCAGCCCATTGCCCGAGTTGTGTTATAATTTAAACATATTTTTTACGTCATATACA 

C AC GC AAAC AT T T GT T ATT T AT T AT T AT T AT AAC T AT T AT AC AAGAAAAC T GT AT T GC T T T GAGC AT T T T T AT T C GAC GC T AG AT C GAT GAT T GT 
AAACAATTTGTTTTGCATGCGCGGTGTGCTTTAAACCAGATATGAGGATCAGAAAACAAATAACCGCTCGACGGGATAGAATTTGGAAATTCCGT 
TGATGGATGGCATGCAATCGGATTGTTGTACTTCTAAGATCTATGTTGTATTTAATGTAACTTTTAGTTCGATTTAAGCTTGTAATAATCGATTA 
TTGAAACGTAGTCTAAGCTATCATTATGTGAAGATAATTATTGATAATTACAATTATGATAACGATTTTGCACACCTCATGCGAATCGAAACGAG 
TATAAACAATGCAGATCTTCTATTGTCATTTTTATTATGATTATGCTCACAACTTGCGGCTATAATCATCGATTTAATTTTAAATCTGTAAAGTC 
GGATTGGTGCACATCCAATCACATTGTTTAGTTTCAAGTTGCGACTGTAATTTATATTTATGATATGATTTTAAATAAATGTATCTATCGCTATT 
ATTTAACTGCTGTTTTTGTCGACTGGTTTTCGCTGAGACCTTCGCTCATTTGGTAATACAAAATATTGGCGGTATTGTCCACAACGGCTAAAAAT 
ATGTTGCTGATAGGGAATAAAGTGGTTAAAAACATAAATATTATTTTCATAACATTTAATACGCACCGTTTTTTAAGCT 

(SEQ ID NO: 184} 

Exon: 1001. .1595 

Exon: 1988.. 2309 

Exon: 3936.. 4474 

Exon: 9820.. 11124 

Exon: 14255.. 14875 

Exon: 14942.. 15313 

Exon: 15376.. 15483 

Exon: 15549.. 15689 

Exon: 15757.. 16464 

Start ATG: 1293 

Transcript No. : CT6660 

AGTGGCTATCAGACAGAAGCAATCGAGAACCGCTTAATGAGAAAGCCAAAGAAACTTGTTGAACTACAAGAAAAAGAAGCACAAGAAGGCAGAAG 
AACCCGGAGGGGGAGGCAAAAAACATTTTCCAGATAAGCGGCAGTCGGAGTCGTGTGAATAAAAGCGTGTGTATTTGTTTTGTGGGTCAGCCTGT 
TCCAGCTAATTACCGTGAACTTGTCAGCGCACACAAAACGGCTGTAAAATGACCAAAGTGAGGTAATAAACACGGCGAATTAACGAGAAACCCGA 
GTGAATAATGTCCAAACTGGTGAAGAGAATCCGCAGCATGATCAATCCAAGCAGCCAGCGGGATCGCGAAGATACCACCTCGGTATCCGCTTCAG 
AGGGCTTCATTGCTCGCAGGAAGTGCCACTCGCTGCCCCGAAGGAGCAATAGACACTCGACCGCCCGGCGATCAAACAGTGGTCTGTGGAATCGA 
CTGGTCACCAATGTCTTCGGACCGGAAGATTCATATGACTTTGACAACGAAGGCGACAGTGGTGGTGCCGTATTTTACACCGATTCGGTGAACGG 
TTCAAACTATGAGATATCTGCAGAAGGCGAATACATCAAGCATCCCGTTCTGTACGAGCTCAGTCACAAATATGGTTTCACAGAGAATCTGCCGG 
AGAGCTGTATGTCCATACGGCTGGAGGAGATCAAGGAGGCCATTCGGCGAGAGATCCGCAAGGAGCTGAAGATCAAAGAGGGCGCCGAGAAGCTC 
CGCGAGGTGGCTAAAGATCGACGATCCCTCAGCGATGTGGCCGTTCTTGTCAAGAAGAGCAAAAGTAAACTTGCCGAGCTGAAGTCCGAGTTGCA 
GGAGCTCGAGAGTCAAATCCTCCTGACATCGGCCAACACCGCCGTCAATAGTAATGGACAAGAATCGATCACTGCCTGCATTGATCCCAATGGCG 
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GCTTCTTGGTCAGCGGTGCAGTTGGTGGCTTGGGCGGCGGAAACACGGCTCTGGAGGGCGGCGCACCGGCCACTGCCAATGACAAAGTGCTCGCC 
TCGCTGGAGAAGCAGCTGCAGATCGAGATGAAGGTGAAGACCGGGGCGGAAAACATGATCCAGTCGCTGGGCATCGGATGCGACAAGAAGCTGCT 
GGCGGAAGCCCACCAGATGCTGGCCGACTCGAAGGCCAAGATTGAGTTTTTGCGCCTGCGCATCATCAAGGTGAAACAGAACCGCGAGCAGGCCG 
ATCGCCTCAAGGCCTCGCGCCAGATGATCGACGAGCATGGCCAGACGATCGGCGGGAACAACAGCAGCCAGCCGCAGAGCCTGGAGACGACGCTG 

GAG GAG C G AAT C G AGG AGCT C C GC CAT C GAC T GC G AAT CGAGGCAGCCGTCGTC GAT GG A GC C AAG AAT GTTATACGCACGCT GC AG AC GG C G AA 
TAGGGCACCGGACAAGAAGGCTTTGCAGGAGGCCCATGGACGTTTGTCGGAATCGTCGCGAAAACTAGATCTCTTGCGCTACTCCTTGGATTTAC 
GACGCCAGGAGCTGCCCGCCGATTCGCCCGCCGCCCAGCAATTAAAAACGGAGCTGCAGATCGTCCAGCTATCGACCTCCCCAGCTCCGGTCACC 
TACACGTCACTGCAGTCCGGACAAGCGGGCATACTCGGCGGAAAGCCGTACCAGTCGGTATCCTCGCTTGGACGCTGTGCCAGTGTCACCGGAAA 
ACTGGAGGTTCGCTTGCTGGGCTGTCAAGATTTGCTGGAAGATGTGCCCGGTAGATCGAGGAGGGACAAGGACAACAACTCAAGTCCGGGTGATT 
TACGAAGCTTCGTCAAGGGCGTCACGTCGCGCAGCAGTTCAAAGAGCTATTCGGTAAAGGATGAGACGTCCATTGAGATCATGGCGGTCATCAAG 
CTGGACAATATCACCGTTGGCCAGACATCGTGGAAGCAATGCTCGCAGCAGGCCTGGGATCAGCGCTTCTCCATCGATCTAGACCGGTCGCGCGA 
ACTGGAGATTGGAGTTTACTGGCGCGACTGGCGATCCCTGTGCGCCGTAAAGGTGCTGCGCCTTGAAGAGTTCATCGACGATGTACGACATGGCA 
TGGCATTGCAGCTGGAGCCACAAGGTCTGCTATTTGCGGAGGTCAAGTTCTTGAACCCCATGATTTCACAGAAGCCAAAGCTGAGGCGCCAGCGT 
ATGATCTTCAACAGGCAGCAAGCGAAGAACATCTCGCGGGCCAAGCAGATGAACATTAATGTGGCCACATGGGGCCGCCTGCTTAAGCGGAATGC 
TCCTAACCATGTGCACATGGGATCGGCAGGATCAGGATCTTCTCTAACCGGTAGCTCGCCTATGGTGGTTGGTGGGTCTCGCGACTCTGAGTCTC 
CGATTTCGAGGACTCCCTCATCCGACGCACTGGTGGAACCGGAGCCCTACACGCCCGGAGAGCAGGCACAGAACTTGGAATTCGATCCCGATGCA 
GGAATTAATGAACACGTGGAGACGCCAGGTGAATACCCGGATCCGGCGGCCAGTGGTCTGAGCGGAATGCGACCCCTGTCCATGCACATGCAGGG 
AATTAGTGTGCTGCCCCCGGAATCGCCACCCGTGGCCACCGGAGCAGCTGGTCGGCCCAATACGCTCAGCTTACAGATGCCGGGTGCCAGTAAAG 
GACAGGTGATCCAAGGCGGACGGACTGCGGCGCCTACAACGGCGCCACCGCCACCACCTGTGCTCAAGGCGACATCCACCACTCCGATATTGGAT 
CAGGAGGTTATACCACAGTTGGGGAAGCTTTACGTGGGCAGTAGCCAGCAGCAGTATGCGCAGCAGTCATCGCCCATCATCCAGGAGCCAGCTAC 
TCCTACTATTTACGGAAACAGCGCCGCAGCCGGAGCGCCGCAGTTCCCGCAGCCCGCCCAAAGGCAAGAGAAGCAGCCTCCGCAGCAGCAGCCCA 
TCTACGCTAACCAGTATGAGCTGAATGTGGCCAAGGCGGCAGCTGCAGCATCTGTTTACTCACCCAGCTCCTCCACCACCAGCAACTCCAATCAA 
CAGCAGCAGCAGCAGCGCCGGAATGTGGCCCGTGGACTGCAGTATCGTGAATCTGGAGGACTCGAGACTGGACGTGCTGGCAAGCAGCCACCCAA 
TGCGGGCATGTTGTCAATGGACAACTTCCGTTTGCTAAGCGTTCTTGGTCGCGGCCACTTTGGCAAGGTGATTCTGTCCCAATTGCGAAGCAACA 
ACCAGTACTACGCTATTAAGGCACTGAAGAAGGGAGACATCATTGCCCGCGACGAAGTGGAGTCCCTGCTTAGCGAAAAGCGCATCTTCGAGGTG 
GCCAACGCCATGCGCCATCCGTTCTTAGTTAACTTGTATTCGTGCTTCCAGACTGAGCAACACGTATGCTTTGTGATGGAATACGCTGCTGGCGG 
AGATTTGATGATGCACATCCACACGGACGTGTTCCTAGAGCCGAGAGCCGTTTTCTACGCCGCTTGTGTGGTTCTGGGCCTGCAGTACCTGCACG 
AGAACAAGATCATCTACCGGGACCTGAAGCTGGACAATTTGCTTTTGGACACGGAAGGATATGTGAAAATTGCGGACTTTGGTTTGTGCAAGGAG 
GGCATGGGCTTTGGTGATCGCACGGGCACTTTCTGTGGTACGCCCGAGTTTCTGGCACCGGAAGTGCTCACGGAAACTTCCTACACACGAGCTGT 
GGATTGGTGGGGCTTGGGTGTGTTGATCTTTGAGATGTTGGTTGGTGAGTCCCCATTCCCTGGTGACGATGAGGAGGAAGTATTCGATTCAATTG 
TCAACGATGAGGTGCGCTATCCGCGCTTCCTATCGCTGGAGGCCATAGCCGTGATGCGTAGGCTTTTGCGCAAGAATCCAGAGAGACGTCTGGGA 
TCTTCGGAACGGGATGCGGAGGATGTTAAGAAACAGGCATTCTTCCGGTCAATTGTGTGGGATGACCTGCTCCTGCGAAAGGTTAAACCACCATT 
T G T GC CG AC AAT T AAC CACTT GGAGG AT GT GT C AAAC T T T GAC G AGG AGT T C AC GT C GG AG AAG GCT C AGC T T ACGC C AC C G AAG GAG CCG CG AC 
ACTTGACCGAGGAGGAGCAGTTGCTCTTCCAGGACTTTTCATACACGGCCGAATGGTGTTAGAAGCTCATTCTTAGTTCAAGGATTTACTAACGG 
GCCTAGCCCGATCCTTATAGCCTATGGCCTGCTTTTGCTTTTACTTGTGTACGAAATTGGTTTCGAGTAATTTTACTTGGCTTAGTAGTCGATGT 
GAGT T C AAT AT T TAT AC AT GAAG AC AGC AT GAT AAT AC AG AT AC AGGAC AT AT AT AT T TAT AAAC TAT AT C TAT AT AG AAAAT AC AT CT AC T AAC 
GAGAGAACTTTTGATTGTTGCTCACTGGCCGAGAGACCAACACAAACCAAAATACCTAGTTGTCCATGTTGTGGAAAAGCAAGCATTTGTCAATC 
ATCCATGCAACCGGTTTAGCATACAGTGTACCCAGAAATCATCCCACATTGTCGTTGCATTGTACAGCATATAGAGTTGTTTCCTTTTCCAAACA 
CGAGAACGCATTGAATAGTCTGACAAACCCTAGGAAAGCCTTATACGCCAAGTTGTAGTTTTGGAAACCGTTTTTTAAGACCGGATACAAATAGA 
T T T TAT AG AT GAG AAGT GT AT GT C T AC AC C C C AG AG C AG T C AT T G C AAC C TAT T T T 
(SEQ ID NO: 185) 

Start ATG: 293 

MSKLVKRIRSMINPSSQRDREDTTSVSASEGFIARRKCHSLPRRSNRHSTARRSNSGLWNRLVTNVFGPEDSYDFDNEGDSGGAVFYTDSVNGSN 
YEISAEGEYIKHPVLYELSHKYGFTENLPESCMSIRLEEIKEAIRREIRKELKIKEGAEKLREVAKDRRSLSDVAVLVKKSKSKLAELKSELQEL 
ESQILLTSANTAVNSNGQESITACIDPNGGFLVSGAVGGLGGGNTALEGGAPATANDKVLASLEKQLQIEMKVKTGAENMIQSLGIGCDKKLLAE 
AHQMLADSKAKIEFLRLRIIKVKQNREQADRLKASRQMIDEHGQTIGGNNSSQPQSLETTLEERIEELRHRLRIEAAVVDGAKNVIRTLQTANRA 
PDKKALQEAHGRLSESSRKLDLLRYSLDLRRQELPADSPAAQQLKTELQIVQLSTSPAPVTYTSLQSGQAGILGGKPYQSVSSLGRCASVTGKLE 
VRLLGCQDLLEDVPGRSRRDKDNNSSPGDLRSFVKGVTSRSSSKSYSVKDETSIEIMAVIKLDNITVGQTSWKQCSQQAWDQRFSIDLDRSRELE 

IGVYWRDWRSLCAVKVLRLEEFIDDVRHGMALQLEP 

HVHMGSAGSGSSLTGSSPMVVGGSRDSESPISRTPSSDALVEPEPYTPGEQAQNLEFDPDAGINEHVETPGEYPDPAASGLSGMRPLSMHMQGIS 
VLPPESPPVATGAAGRPNTLSLQMPGASKGQVIQGGRTAAPTTAPPPPPVLKATSTTPILDQEVIPQLGKLYVGSSQQQYAQQSSPIIQEPATPT 
IYGNSAAAGAPQFPQPAQRQEKQPPQQQPIYANQYELNVAKAAAAASVYSPSSSTTSNSNQQQQQQRRNVARGLQYRESGGLETGRAGKQPPNAG 
MLSMDNFRLLSVLGRGHFGKVILSQLRSNNQYYAIKALKKGDIIARDEVESLLSEKRIFEVANAMRHPFLVNLYSCFQTEQHVCFVMEYAAGGDL 
MMHIHTDVFLEPRAVFYAACVVLGLQYLHENKIIYRDLKLDNLLLDTEGYVKIADFGLCKEGMGFGDRTGTFCGTPEFLAPEVLTETSYTRAVDW 
WGLGVLIFEMLVGESPFPGDDEEEVFDSIVNDEVRYPRFLSLEAIAVMRRLLRKNPERRLGSSERDAEDVKKQAFFRSIVWDDLLLRKVKPPFVP 
TINHLEDVSNFDEEFTSEKAQLTPPKEPRHLTEEEQLLFQDFSYTAEWC* 
(SEQ ID NO: 186) 

Name: protein kinase C-related kinase 
Classification: protein_kinase 



Celera Sequence No. : 142000013385194 

CCCCTGCCACCCTACGAGCACAAGCCGGCCAAGAACCAGGCGGAGGCAGACGCCACCCTAAAGGCGCTGCGGCGCCAGCAGGAGGTGGGCCAGCG 
ACAAACTCTAGTCGCCATCAACACGGACTACATCACGGAGAGCATCGCGGTGCACAAGGGTGAGATTGTCACGCTGTGCGAGTGCCGCGAGTCCA 
AGGATCAGCGCCAGTGGTTCTACGTGCGGACGAGGGACGGACGCGAAGGCTTCATTCCGGCCGAGGTGGCCGGCCATGGCTACCTGTAGACCTCC 
ACCTTCTGGTATTGGTATGGGTCTCTAAGGGTCATATGAAAGGGATTAGAAAGCTTAATGTTAACTATGGACATCCTTTGTCATATTAACCAACC 
ATCATATAATTGCTATATCTAGAATATCTTAATCTTTAAGATTTTCTCATGATTTTGTGGTAAATTACTTAGCTTTATATTTCGACTCTTTCCAT 
TACT GAGT AAAGTATCATT CAT GAAAAAATGGAGCCC AAC CACCGTAAGTTGTCCAAAGAGCACCTTCTCCCTTTTAAGATTTCCAAAAGG AC CC 
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ACCCCAAAAACAAGTCAAGTTCTACATTTAAATGAATTAACTTAATACCCATCTTAAGCACTCATAGGCCACTAATTCCATTCGCGATCCTTAGT 

ACGCTTCTAGTTGTAAACTGTATTCATATCGGTGTCCGTAATTGAATTGAATTGTATCTGTATCTGTGTATGTTTATGTGCGTGAGTGTGGATCC 

TTCGCAGGATCTTGTATTTATCTAGCCTAGCATTACGAGTACTCTATGTAATAGAAAACTTGCTTAAATGAAATCAAAGACGAAAGCGAAGACGG 

AACCTATTTATATAATAACTGTACTTGATGCTTCTAATAAATAACAGACATTTAAATAAATATAAAAAACCATTGCGTTACTATTGACCTGGGTA 

AGTGGGTAGAAAAGCAATTGCATATTTATACATATTTGTATTATTCATTTTATTATACACGTTTCACTCAGAATTGATATGTGTATATGAAGATA 

ACATAACGATTTTCAGGGTAGTTATAAATGGGTTAACGTTCCAATATTTGAACTAATTTTTTATTTTTTTTTAATGATCTCTATAAATGTATTGT 

TTATGCACTCTAAAAATAAGTTTCTTATTAAAAATATAAATACCATATACTTAAGCTATTGATTTCAATCTGTATATGCATGGGTTCGATTTAAA 

ATTATTGCAACCTCCGATCCGAGTTTGTCTCACAGGCACTCTGCAATTTTACCAGGAAGAGGTGTTGTAGGTACTTAATTTAATATGATTTCACT 

TAGGGAGTTAGTCGCCTATATGAGCATAAATGTTAGCACAATTTGGGTCGAATGAAGGAGACATTTACAAAAGTGAGGACCGTTGCTGGGTTGTA 

AAGGGCCCCTAAATAATAATTTACAATTGGTTGAGCTGACATGGCTTTGTTGAGCAAGAACTACAAGAAAACTACAATTACATATCGAATGCAAG 

ACAAATGCTACGCTACACAGAAATACATAACAGCATGAACTGAAGTTCTAGGAAACAACTTAAACTTACGTACCAGTTGGGCAATTCTTACCCTA 

CATACATAGACAAATTAAAATATCATGGCTTAAGATCTAACCACCGGGTGAAGTGACTGTTCCGGCTGCCTACTTTAGAAATTTCGGACTGGAAC 

TAGGCGACGCCGTCGACACGTTGTCCAGGTTGGTGTTCGTGTTGTCGAAGCACTCGTGTTGTGGCTCCGGTTCCGTGGGCGGCAGCGGTTTGTTC 

AGCAGCACAGTCATGTTCTTCACGTTGCTGGGCTTCTCCTCGTCCTGGTTGATGTTGAGCATGGCGTCGTTCATGCGGTGGTAGTGCGGCTTCTG 

CGAGGTGCTGGGTCGTGAGTAGTCCAGGTGATCGTATTCATCTAGCGTATAGAATTCAGACAATTAGATCGTTGTTATCTTAAATTGGTCATAAA 

AACATTGGACATGTGTGAGGACGAACAGAGCACTACACGGAATGGTTCTTTAACACTGGCTTACCCTCGGGAAACAAAATCTTGCTATATATTTT 

CACTGATTCATGCCGTAATGTGTGCAAATGTAGAGTACATAGTTAATTGTGAAGTTACTAAGCCTGCCTGTGTAAAAGTCTAATGACATACCGGG 

ATCCTTGTAGCCCTCCTTGTGCTTGATCTCGTCGTAGACATGCTCCTCTTTGAGGGATTCGTTGTACACAATCGGATTGGTCCGGTCCGCGTTCA 

AGTTCTTCGTTAGCAGGTCGTGGTTGTAGTTGATCGAATAGCTACCCACTGTGGAGGCAATGTACTATGAGACACTGACCCAAAATATTGTGCTC 

TAAACATCTTACCCCTGCCACTGGCATTGCAGTCATCGCCGTAGTCCGTGCTCATCGTGCTCCTCTGGTCAAAGTTGTTCATCTTGGAACGCATA 

TTGTTGGGCAGGAGACGCGTTTCCGCCTGCATGCCGTACACGGGATTGTCGAAGTTGTGGTTGGGCGGCCAGCTCGGGGGATTCGTATCATGGGT 

GTAGTGTACATGGGCGATTTCGGTTTTTAGATTGGACACACGTCGGCGGTAGTAAATGAACACTGCGAAGATTATGCACGCAAACAGGGTCATCA 

GGACTAAGGTCAGAGCTACACTCGCCCGGCTCGCTGCAATTCAAAATGGATGAAATGAATTAGTAATGTTGAGGAATATATTCTGATCAATGAAT 

TATCTTACAATTTTCGCTTTGGTCTGCAATACGTTGAGAGGCGATTAGCTCGTCGCAGTTATCGCCGGTGTAGCCACTCCTGCACACACAACCGT 

GGGCCGCATGGCACTGGAAATTGGCCGACGGACAGGCACAGGTATTCATGCAGTGCTCGCCGTAGAATCCCTCCGGACAGACCTCATCGCACCGG 

GTGCCCGACCAGCCGGTTTCGCAGATGCACATACCGTCGGCCTTGCGGCAAACCTTGTGGTGCACGCAGCGGCACCGCTGGGCGCATCCTTGTCC 

GTAGGTGTCTGTGGGACAGGACTCGTTGCAGTTGGAACCTGTCCACCCTGGCAGGCACTGGCACTGGCCTGTAACGTGGTTGCACTCGCCGCCGT 

GCTCGCAATTGCACTTGAGCTTGCAGCCGGGTCCGTAGAGTCCAGCAGGACATGGGTGCTCGCAGGTGAGACCTATGTAACCCGTGCGGCAGAGA 

ATTTCACCTGTGATGTGATCGCAGCTCTTGTTGCCTGAAAAATATGTGAGTATTAGGGGGTGTTTTGCTTTCGCCTTGTAATCTTATCCAATATA 

GCTTACCATGGAGGATCTCCGGGCACCGTTCCTTGCATTCCATGCCGTAGAATCCAGGTGGGCAGGGTTCCGCACAGTCGGCGCCAGTCCAACCG 

GCTGAGCAGATGCAGTTACCGGAATCGGGGTCGCAGGAGGAGTTATTCAGGCATCTGCAAACTTTATCACAATTTTCACCATAGTAACCGCTCCG 

ACAGTTGGTCTCGCAATGAACGCCTTGGGTTGTTCGATCGTTGGGTATCAGACAAGGCCAACAAAAACCAGAAACACAAAGGTGCAGAAACAGAA 

AAGCAAATTAAATTAAGGAAATGAAACGTGGCCATGCAAAATAGGTGCGATAGATGCAAAAATGAAAGTAAGTGAAATGAGTTCAGTGCGCACTA 

ATAAGGATATAAGATGGAACTGATTAACATGTTATAGGTAGAAAATCATTCATCAGATTTACACTTTGTCTTACAAAATTGTAAATATTTAATAG 

AAAACTGCCTTCTTAATATCAACAGTTTGGTTTAAAAAAATAAACAAATTTTAAGGGGTTCCTTAATTACTGTTAAAAGAAAATCATGCAAGCTC 

TACAATTTGGATATTTTATCTAAATTATTCAAGAATACACGTAACCCCTTTCTTATTGAACTATCAATTGATGGAAATCACGCCTTTTTTCTATG 

ATTTTAGATCGGGGCAACCTATAAATCTCGCTAATGTCAACAGTGATCTATGAACTTGATGAGCTGAATTCAATGAGGTATATAGAAATCCCGCT 

AGTACCTCCCCAATCCTGCTTGCACACGCATCTTCCATTGGTGGCATCGCAGGCCAGGCTGTTGTTGAAGTCGCACTGGCACTTCTGGGCGCAGT 

CGTGTCCGAATTTGCCGGTATCGCACTTCCTTTCGCACCGCTCGCCCGTCCAGCCGGCGGCACAGGTGCAGCTTCCGTTCTGCGGATTGCAGGCG 

GCGTTATTGTGGCAGTCGCACACCTTGGCGCAGTCCTGTCCGAAGTGGTTCAGATCACACGGCCGGTCGCACTTGATGTTTTTCCATCCGGCCGT 

GCACAAGCACTGGCCCGTCTCCGGCTCGCACTTGGCTCCGTTCTGGCAGTCGCAGCGCAGGGCGCAGTCCTGACCAAAGGTGCCCGGCTCGCAAC 

TCTCCTCGCAGGTGGGTCCGCGCCAGCCCGGCGCACATAGGCACGTACCATTGACCGGCGAGCACTTGGCTCCGTTCTTGCAGTTGCAGGTCAGC 

TCGCAATTCGGGCCGTAGCGCAGGAAGGTGCAGGGTCTGGTGCACTGAGCACTGCTCCATCCGATGCTGCACTGGCAATTCCCCGTCTCCGGGTG 

ACACAGATCCGTGTGCTCCATGTCGCACTCGCAGGTGCGGTTACAGTCCAGCCCGTACTTGTTCGCCTCGCAGATCCGCTCGGCGCACTTGGCAC 

CCGTCCAGCCGGGATTGCAGATGCACGTTCCGTTCGCCCGATCGCACATGGCATCGTTCGCGCAGTCGCAGGTCATGCTGCAGTTGAATCCGTAC 

GTGTTCAGCTGACACTCGTCGAAGCAGCGTTCTCCGCGATATCCGGGCGGACACTCGCACTGGCCGGTGATGTGGTGGCAGGGCGCTCCCTTGTA 

GCACTCGCAGCTCTCTTGGCATCCAGGGCCGTAGCTGCCCACCGGACACTTGTTTGCACACACGTCGCCCGTCCAGCCTGGATTGCACATGCAGG 

CCCCGGTCTCTGGCTGACACTTTCCGTCGTTCTGGCAGGGACAGTCCTGCTGGCACTGGGCGCCATGCTTTCCATCCGGACAGCGCATATCACAT 

CTAGTTGGAGCGGAGTATTAGTATGGGTTTTGCGTTGGAGCAGGAGCAAAATGAAACGTAGAATGCAACGGCCTAAAATCCCAGCCACGAAAAAT 

AATTGATTGCATTCTGCCTTTCATTTTGCCTGATCCTCGGCCAGAAGTTTGTGCCGCCAAAGAGGCCCCTTCGTCCACTTACAGGGGACCCGTGA 

AACCGGGAGCACACTGGCATTCGCCGGACACGTGGTGACACTTGCCGCCGTTCTCGCATCGGCACTTCTCCGAGCAGTTGGCCCCGAAAAAGCCC 

TCCGGACAGATGTCCGCGCAGCGTGCTCCGGTGTAGCCCTTGGCGCACTCGCAGTCGCCTGAAAAGGGCTCACATACCGCGTTGTTCAGGCAGTC 

GCACTGCATCGAGCAGTTCCGGCCATACCAGCCGGGTGGGCAATCTACCGAAGAGTGACACAAACATGGTTAGCTGCGGTTACGGATCACAAGGT 

ACGGGTTCGACAGTTAGTTGCTTTTTTCTTGCTTTTGCCTTTCGCTTGCTAGCCACGGGAAACCCAGGAGCCGGAAGAAGTTCCGCTCCCGGAAG 

CTCCCGTTTTGGTGGTGGGGATTAGTGACCGATAGCCTGCTTGCACCTGTATAACTGAGCCGGAGTGAACTGCATGGGTGCATGAGAGAGAGAGA 

GTGGAGTGTAGGTTGTGGGTGGTCGGAGTTAGATTTCGGGTTAAGATGGCATAAACAAAAAAAAAAACAAACAAAAGCTACAAAGCAACATATAA 

CATATAGCAGAAATATAGCTACAAAGACACTACGTGCAGATCTTTGGTAGTCCAAAGACGGAATGCCCTTGCAGAGGCTATAAACTATATACATT 

TAAACCTTAATAATAATCTTGTTTTTTGATATTATTATTTTGACTAGAATAGCATATGGTTTTCAGACAAATTTGATCATTTCCTTCTTGCTGAT 

TGTAAAAACATTCAATTTTGATGAAAATTAAAAAGTCAGCGAGCAGATTAGTAATCCATTCGCCTTTCTTTCATGAAGTAGGAGTGCCTGTTTCT 

CGTCCCCGGCAGAAAAACGTGAGCTCGGAACTCGTCTATGTTGAATTCGTTGGGTTTGTAGACAAAAGGTTCCTTGATCACCGACACTGAACTGA 

CTTTTTTCACTAGCTCTTCGACCGGAGGTGCGTCCTTGGTCACTTTTTTTGTGATGCCAAGCATCTGCCAAATATCGTACTCCGACTCCTCCGCT 

ACTAGTGACTGTGCCGCGAGGCTTTGGACACTTGGCGATGGGGTCGGGCAGAACGGCAAGCACTCCTTCTTCCTCGCCATGTCGTTCTCTACCTT 

TTGCACCATCGGATGCTTCAGCGTCTTGGCAAACTCCAAAACATCGCCAATGCTGGGGAAGCGCTGTGTATCGGTGACAAAGCCGTTTTCGCCCA 

AGAAAATGTTTCGCCTGACTGCCACGTTCGCCTCGAACTCTTCTTGCTCGTCCAAGTCGCAGTCGCGACAGTCAAAATGGGTGACCTTCATGTTG 

GCGGTGAGGGCGCATATAAGTCTTGTACCTAGCTTAGCACCTAAAGGAATGGCCACCATTATCAGCCGACGAAGTGGATCGTGATGACGCAGAAG 

ATCGCCTAGTTGGTCCACAATTGTCCGAGGATTCGCCTCGATGCCGGACACATTAAGCTCCTCCACGTGCTCGGTGCCCTTGATACCGTTGCACA 

AAATGGACAGGGAGTCGCCGCTCAACTTATTGTGGGCCAATCCCAGATACTCCAGTCTGGAGAGGTTCGGTCGCTGCAGTGCCTCTCCGATAGCC 

GACATTGCCCGCGTGTCCAGTCGGTTGTACTCCAGCTCAAGGATCTTTAGCATTTGCGGCCGCTCCAGCAGAATTCCCAACTCGGGGCCACAGTC 

GTCAGTCATCTGGTTGTACCCAAAGTCCACTACCTCCAGCAAAGGCAATAGGCGCAGAGCCCGGCACACGGTGTATAGCTTAAAGCTATTAAGTC 

TGCTGTTTCGAAGCCGGAAGATCTGCAGCTGCTGCAGTGCCACTAAACCTCTAGCCAGGCGAACCATGTCCTTGATCGAAAAGAGCACGTTCCGC 

TTGTGGTACTTTCTACCCAAAGCCGGACCCAGAAACTCCAGGTTCAGCGATACCAGATTGACAAAGTGGCGCACAAAGCTAAGATCGATGTGAAC 
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GCAGTCGTCCTCCGGATATTTTAATTGTTGTAGATGCGTCAGATATCTTTGCTTGTTGCGTTTGTCCAAAACCTTGTCATCGCCATCGTCTGGTT 

CAGGCTCCACGCGGATATCAAAGGTGCCCTTTATACGCTGCCTTCGCTTCTTTTTCTTTCGTGGCTCTGGTTCCTTGGGCTTCCGCATCTCGGAC 

CGTCGTCGCCTGCGCTCCTCATGCTCCGCCTCCCTGGCAGCCTTCAGATCACGCAATCGTTGGCGGGCGACGTTGCGTTCATGTCGACGTTTGCG 

ACGCTCGTTGCCGGAGTCATCTTCATCGTCGGAAGCATTCAGCGTAAATTTTGTATTGAATCCAATTGCCCTTTTCTGGTCAGAGGCGATTGAAC 

CCGCCCCCTCCTCTTCTTCCTCCCCCTCCTCATCGACGTCTTCTGGTGTCATTGGCTCATCACTGGAGATTTCCATGCCACCAGACTCCTCGGAT 

GTTATTTCCGGTTCCGAGTCGTCTTCCTCACGTTTTGTTTTCCGAAAGGCATACTCCGACAGCGGCTGCAGGTGCCGAATGTCCATGCAGCGAAC 

ATGATCCTTTATTAGCTCGGCTAGTTCGGTCATCTCTTTCTCTGGCCAAAGGGCGGCCGGACAGGCCTCCACCAGCTCCATGTACTTCAGGCTCA 

GTCCCTTGCCGCGCCAGTCGTAATCCTGCAACCCCTTCAGTGCAAGGCATGTCTCGGAACTCTTAGCTAGGACCACCCGCCTCCAGTAGCTCGCA 

TCCTCTACGCAGTAGCACTCACGCAGTGGCAGATCCACATCGAGTGAATCGTAGTGGATGCGCATCTTTCGCGGATCTTGAAGCACCTCGTCGGG 

CACGTTGGCATTTCCACGTTTAGCCATCTGTCGGACGCAGAGATCCGCCAAAGTGGGCACTTGGTCGGCATCGTTAAACGCCTCTGCGAAGAATT 

CCCGATTACGCTCCGAGAGTAAATTCCGATTGGCGGTCAGGGGCGGCTTGTAAAGCTCGTACACGGCCAGCATGACCGGATCATTCGGAATACTC 

AAGCTATTGGTGTTCGGCAGCGGCAAGTAATCTGAGTCCTCATCCTCCATCCTATACGTTCCAAGATGTGTGTGGTGTAGAGCTATTTATGGTAT 

TTTTTATTTTTTCCTATTTTTTTTTTTTGTTTGTATGCTTTTTGTTTGAGGAGTATTTGTTTGACTTTAATGTGTTGTCATATTGGAAGATTATT 

TTCCTCTGGCTTTAGTGGGATCATTGATAACTGAATACATTTCAGGTGTCTTTTTTGGGTCTGAGAATCGACTAATTCGGTTCTGTAGTATGTAC 

A AAAT AT TTTAGCTTTGCGTTGTCT GAT AC T TT AT AT AC T T AAAAG AAC AT AAT C T AC AGT C G AG C AG C AG AAC AAT T AAAC G T AC AGT T AAT T T 

CTAATCTTTGCAAGGGCATTTAGAACTACAGAGCTTCAAAGCTGCTTAAGTGATTATACAAGCTCTTATCTACGGGGCTCGGCTCATCGCAGGCA 

ATCCGGGGATATCGTGAACTTGAAGTTTCAAAATTGAAAAACCGCCGTTTCTGGGATAAGCCGCACCCTTGGAACACCCAAGGCGATTCATTAGG 

GGCAAATTACGATTAGCTTAAAGGATCAACGAACTTATATCGCAGGCGGGTCCTCCGTAACCATGATCACACTTGCACTTCTCCGGGGAGATGCA 

TCTACCGTGCTGGCAGGGTTCCGAACAATGCGGCACGCATTCTCCTGCGCTCGCGATGTAGCCATCTAAAAGCGGTTAAAAATGGTAATATGTAA 

AGTAAAGTCAAACATCCAGACTCATCCAACTCACCACAACAGTCCCTCACAATACGGTTCTTTGCAATCGTCTTGGTTTTGTTGACCACTCGGTG 

TTTAATGCGATAGGTGGAGCACCTCGGCGGGAATGTGACGCACCAGGTGGAGCCCCGTTCCTGGAACGATTGCAGTTCTGTGTAGACAACGTCTA 

CGTTATATCTAAAGGTAAAAGTATTCATTAATTTAGTTCATTTGTCGTGATAGGGGATAAGCTTACAGTTCTCTTCTTTTGCAGATATTGGGTCC 

ATCCAGATCTTTAAGATCCGCCTGAGCCAGTACCAGCTGGGCCAGGCAGGCTATGAGGATTACCGGCAACATGGTGGCAGTGGATTCCAGGTCCT 

GGAATGAGTTACTCGTTCTGATGCAGCTTCACTGGCGGATCACATTGAACTGATAAACACAGTCAGTATAAACTAAAGGATGACTCAAGCCGAGA 

TTAAACCGGATTCAACTGCAAGAACAAAGAGAGAATGAGTTTCAATAAAAAAAAACTTTCTCCGTATGATTTTACATACTACATATTATATGTAT 

AAT AT AT AT GGGG T AT AG AT T AAT AAAT AC AT AT TT T T G AAAAG AGCT AG AT T T AAT T GTT AGAAAAC AT C CAT AGT AT TTCCCGTC GG AT G AGT 

TTATGACGTAAAGCTTTTTCCGGGTCGTAGCCACCGGTCCAGGGTAGTCCAGTCTAAAAGTAGAAAATTTCCAACACAGTCAGAAATTGAAAGCT 

GCTTAATATCCCCAGAGGTGCCAGGAATTCGATTCTTCTATAAATCAAGAGCGCAGCAGTAGAGGATGCACAACCAACGTTCACCCTTCAATGGA 

GGCCCGGGAGGATTCGGAGCCAGTGCCTTCAACTTCAATGCACCACAGTTCGGCGACGGCGCTGCGTTCTCCGAGGGTCTCAACGGGATGCCCTT 

TCTGGGCAAGATGCCCACCGAACCGATGATGCGCTCCATTAGCCGCCAACGCTCGCTGGTTGATGGGGCTGGGCTACCCAAGATGCCGGCACCTC 

TTCCCGCGATGGCCCAGCAAATCTTGAGCGGCGCCGGAGTGGGACGTAGTCAACGCTTCCCAGTGCCACCTATCTCCACTTACTATGGGCACTAC 

GAGGGCGACTTTTCGATGTCTTCAATGTCAAAGCTAAAGGACACTGCCTCACCCCCACAGACCCAATGGCAGGAGCATAAAACACCGCGTCTAGC 

CGCACTTTCAGAAACCGACTCCGAGAGCGACATATGCAGTCAGCAGAACAGAGCGAAATTGCCTGAATCCCGTTGGCGTCGTGCCATTAACTACG 

TAACGGCCGCTCTGCCGGAGCAACAGCGCCAACGCATGGAGGTGATGGTCAAGCGCTGTGGCCAGTCGGCTATCGCCAAGCATCTTCTGTTGGTG 

CTGCATCTGATCGCCCTGGTGGTGGTTGTGTGTTTCCGCCAGGTCGCGCGACTTGGATGGCTCTGCGGCGACATTCGCTCTCGCCTAGGACGCGT 

CAAGTACCAGTCGCGCAGCTACCTGCGCCACATGCTGTGGCGGCTGGCCATTGCCAAGAGTAACGACACCTTTCTCTTTCTCATCGTGGTGCTGG 

TGACGCCCTGGCTATTCCTTCTCAGCCTTGTCGGCTTCGCCATATCCTTCGTCTTCTCCATGAGGACCACGTTGGCAGAGGGAGTCCGTCAGCTG 

CGCCGTCGCGTATTTTAGACCATAAGTGGAGGGAAATGGCTTTAGGAGCAGCACCCACAAGCTCCATCGCCCCCAAGTTATCACACAAGGACACT 

AAAGCAGCGCATTCCACAAAATAACACATGACATACAGCAAGATGAAACCACTCGAATTTTAACTATATTTTGTGTTGCAGCAACGAAATAGAAC 

TCAAGTAACGTTTGTGTTTCCATGGATTCAGGCTTGGGTTTATATTCCGTAGAGCAATATTGCATCATTCCCCATTTCGAATGTTGTTTTGATTT 

TGGAAAATTGCGATCTGCTCAAAGGT GTT AAAT ATCTGGCC AT CAGCTTATGTGTGATCTGATC AAT AAAT CGC AAT CGTATTTATTTGTTTACA 

GCTTAATCCGCTCGCTGGCCACTTGTTCGCCACTCATCAAACCCATTTATTGTGTGTATTATTTTTGCCATTTATTCTGCTCGATAAATATGAAT 

GCGAACAAATCAAAACGAAAACATTTCCGCCGCTCATGGGGCAGAACCACAGATTGTGGGCCTTTGTTGTTCATGCTCTCCGTAATTTCACTTTC 

AATTGCATCCCACCAGCAAAAAGCCCCATTTGTAGTATATTATATGATTCACTGGCTACCGACAGTCACATCTGCGTTCCACGGCATATGAAAAG 

CGTTCAGTGACTCGGGAAAACCCAGGAGAACCTCCTCCAGTTTCCATTCAGTGCCCCAGGGCCCACTTCACAAATATTCAGAGGTCACATCTGGG 

ATTTCTCAAATTTCAACTCAACATGGGGCTAAGAGATATTGAGAACTTGCCATTAATTATGTCAAGTCTACGTGTGAACTGGGCCAAGTAATATC 

CGAATGTGGTTTACAATTACCAGTGACCCATCAAGAATGGATTGCATAAACGTGGATCGCTATCTGTAAAATTATCCAGGCCGAACCATTTGCAA 

TTGATGCCAGCACGGTTTTTCAGGAAATAAAGATCCACTTTTAATACCAAACATTAGGTATAATATATATGACAATCGCGAACAAAAGAGAGCTG 

TTTCTTTCAGTACTTGGTTATTTGGAACTGACATGAAGCACGGGCCAGCCATGTCTGGTCAGTTGGAAAGTACGTACAATTAACTGGGGGATTGT 

CGTGATCCACAATCTATATTTATATTGCAAAGATAATTGTTGGTTTCATTTCTTGCCAGCGAGAAATCTGATTCACTTAATCGCACGGCAGGTGC 

TCCCTTAACTCATCAGAAAGTGGTGTTTGTCAAGGCCGTC ATT AGCAAGAAATCGCGTGATGAACTGCCAACGGCGCC AAAT AAAT TTTAAACGA 

GAAATTGTTCGAAACCCAGTGTTAATAGAACGTTCACGTTTTAGAGCAGATCGTGAATGTGTTCGCCCTCTTAGATACATATTACAATCCTTAAG 

ATAGCCGAAACGATAATATACAACTGGGGCAATATAACCTAAAGTAGAAAGAGATAACCATTTGGCATACCCCTCGACTTTGCTCAAGATTTTAC 

AAGTGATAAGGGTGGATGATAAGACCGCCGCAGTCATTCTCATCCATTACAACAGCGATGAGGTTCACTGATATTTGTTCTGTTGTAGGTGACTA 

ATTCAGTCTCTCTATATATAAGCAGATCCACGGAACGGAAGGTGCGTAATCCGGTCATGGACAAAGACTAATAGAATAATACCAAATCATCCCGT 

AACATCATACAAATTTACGATGGACAACGAAATCGTGTGAACCAAACAATCCGGTCTAATTTAATTTGCAATTTGAATGCTAAATCTTTCATTTT 

AACGATTAGCAGAGTCCACGTTTTTCAAACAAAGTATAAACCTTTGGAATCAAAAATATTAACCATAAACTAGGGTGTTACTCTAACCCATTGCC 

ATTAAGTTTCATTTTGCCACTCAATCTTCCCTTCCGCATATGTAGATCATATTAAAATGCGAACTTGCACGAGACAACGAGCTTACATTTGCAAT 

TTACTTATTTGTTTACATGCCTGATTGGTGTTTTAATGAGCTAAATGAATATGTCGTGGATGAGTCACCTTAGGTTAAACAATGCTAGACTATAG 

TCTATAAACTATACGGATCTGTTAGTTGTATGTATAATAGGCGAGTGTGCGAGATTTTTCAATGGAAAACGTGTTCGGCACAGTTTGTTTTATCG 

AGTATCGAATCCATGCACCATTCATCGATACCGCATACCCAGCATCTGTTCCAATGCTGAGTCACTCGGATTTCATTCACAAAAGTCTGGCTAAG 

GCTCGAACTTCGGGTCAATCTACCGCTCAATTCGCAGGTTTGTATTTCAGACCTTTCCCAATATGTGTGTACTTTTGGCACACCTCATATTTATC 

ATACCTCGCCCTTATCTGCGCCCAGTATTGGACAAGCAGAAACAACACGATGAGGCACACTGATAGTGGTTCTTAATCGGAAGGGGTCAACGAAA 

AGT T G A GGGAG AAC C AT AAAAT GT T AG C AAA GGT AAT AG AAG AT T GAT T AG AAGT AAGT T GC GAG TCCGTTTCCGC T GGC AAT C T GT C T AT AA T G 

T AT AC AT ATGT AGGT ACTTAT AT ATGACT AATTT CCT GAT TTT C GATG AAAGCT TT CG AAAG AAAAAAAC AAAACAAAAACAAAC CAAAGAAAAT 

CTGCGAAATACTTTCGGACTAAGGAGAAAGCTTTGTTGTTAAAAAAACTCAATGGCATTCGTCTATTTCATTCAATTAGAATGATATGTATTTTA 

TAAAGTATAAGCATTGCCTATTTTCGTTCCAATTAAACCTCCATCAGAGCTAAGAGTAGGTCTTATATTCTTCAAGTTGTGTGTATATATTTTGT 

GATGTATTTTCAATTGATTCAGCACAGTGCATATGTAAACATCTGGATTTGACTTGGATTACTATAATCTTAAGCATAGGGGGAAAATGAGCCAG 

CTATTCCCCCAACTTCAAGTACTGCGCACTTGATAAAGGCTAAACAAAGATTGTCGCTTCCCGCTTTATATAAACAGAATTATTCTACAAAGGAG 

GAGCCCTCACTTGAGCTGTACTTTGGATAGCGAAATCAAAGTCGAGTTTACTGAATCCACAATACACATCTCCAATAACCGCAACGCATATGTTT 

GGCGTTCAGTGGGCAATTTAACGCCTTGGCGAGGCGACAAAGCACATTTGGCAAAAACGATTATTAAGAATAATAAACGATTTCAATGGCAAGAA 



FIGURE SHEET 99 



ATAGATTTGGTATAAAAACAAGTTGTTCTGGCCCAGGaaATGAAATTATTGAAAGAATATCAGTTTGCACGAGGTCTAGGGTCTTACATTTGTGA 
ACTACGTAAATGCTAGGCCAATCAAAACTTTCAGTGAACAGTTTAATTAAAGCAAAGAGGCAAAGGGTTCTCAAACGTTATTATTTTGGGATGGA 
TGTTAGCTGCCATTATGAAGTATCAGTGAGAAACAGTGATTTCAGCTGTTTCGTATTCATTTCACAAGTGAGCCAAACACTCAACACCTCAGTTT 
GTATTGGCCAATTAGCGGGCGAAGCTCGGATTTCTAGCCCGAATTTGGATAGAGCTAGAAGCTTTCTCGATTGATTCAGCGAAGAGAACCGATTT 
CAGCTGAAAACCRAGAAAACATGTGTATATTATTCTAGTGGAAAGGCACCCACACATTGCAATGCAGACAGACGAACAAGAATTAAGCATAATCA 
TCAGCAAAAGCGAAATGCACTTGCAACAATACGGAGAAAGCAAAGAAAAATGTTGGATAATGCAAACCCAAAGCGAAAAGATAGAGAACAGAGAA 
TGTCGTAATCATAAGGGGAAAACTAACATCCCGACTCATGGGTTGCTCTCTAACCACCGCCACATTGGACAAGAAATCCCAGGAGGTTTCATCCC 
AGCCACTCCCTGCCACTCCTCCACCCAGCATTATCAATAAAGCTGAGTAATCATGGCGACGACAAAGCCCGGCTCCATAAAGGAACCTAAACAGA 
CATTGTGACATGTACGGCATGGTATGATATGATATGATACCGAGGGTGGGGAGAGTGCACCTCTGAGTGCCATTTCAGTTTTTAATTTTGCAATG 
GCTTGTGGGACTTTTCGCTGTGCATTGCATTTCGAAGCTCGTGTCCTAAGCAGGCGTCATTGTTCATTAAATCACGTATGATAATAAAAATCAAG 
TTCCATTGCCAGTGTCAGAAATATAAGGTTAAGCTTAAAATCGCTTGGTTAAGGAAACTGTGTGAAAAATAAAAATGCTTTGTATGGGAAATATG 
ATCGCAGTTTATTCCATACTTATTTAATGAATTCTATAAAAAATAAGAACGGTTTCTACATATATCTTAATTTGTGTGCACCCCCATATTCAGAA 
GTAATATGAAATATCGCTGAACTGATACCTTAATCAGTACTCATAAAGAATTAAACGCAAAAAAATTTTTTGCGACTTGATCTAAATTATAAACG 
GCTATTATTGTTATTTATTTTCGTGACGATGCATCGACTGTATTTTAATGTGTGCGTTAGTTAGGCAATTTCGATGTGTGTGAGTTTATGCTGGC 
TGGGTTTTCATTTGATCCGTTGGCGTTGTTGTTGTTGCGGCTGTGACAGCTGGAAGCTTCAGCAGCAATCTGGAAACTGGAAAGAATCCACAAGA 
TGCGGGTTTCCAGGGAGCGAGGTGCTGGTCGCAGGGGGTCGATGAAAGAGAAGTTTGCCGATTTGCTGCCAATTCCCCGAATGACAGATATAAAG 
ATACATGCATGAGATACAAAGAAGAAGAAGTCGACGAAAGCGCAAAACTAAAGCCAAAGAAAGGCCATTAAATTGTGCGAAAGAGAAACCAGAAC 
AGGTTTAACTGTTCTTGGTGTGCGCCCGATAAAAGCGCTAATTGACACGCACACAAGTGCACTACTGCGGTCATAAACAAACCGCTCGTTGGGAA 

agaagaaaggtggaagggaaagagack;tgacaaagctaaccaggcgaaattgcacatt^^ 

ACTAAAATTAACGTTAAACTGCCACGATTGGGGTTAAAACTCACCTTGGCAAGAAGCACGGTGTAAACTGGTTTTCGGTTCTCACTCAAAACAAA 
CCAAGAATGCACTCGCAATCGAATTCCACTCAATTTACACGTTTACGAAACTTTGGCCATTAAAACCTCGAGAGGATTTGTATTTTTAAATGCTT 
ACAGCACCTTACCACATATATATATAGCTATATATATAAATTTGCTATTGTTTTTACTTTCTCACCATCCACAAAAACTTGTTTTCGTTGTCGCT 
ACTCTCTATTTTGGCACTCTACTCGCAACTGAGCTGATTTAGCAATTAAAATGCAACCTCTTTAAATAAAAACAATTACTTTTCGTGCACTCCAC 
TAACGACAACAATAAGGACAAACGGGCGCGACGCACAACAACGCAATTCTTTAATGCACTTAAGTATTTTCGTCGGTCAATTGGAGCTCGTTTGG 
TTTTCGCAAGCACCGCAATTACTATTTACTTTATTTTATATTTGCGTACGCACACCACCGCACGAAGCAGAAATTTATTTAACTTATTTAAAATT 
CAGCAAGAAACGGACAACCGACTTACTTAGCTGCACTCGCCAGTGTGACCGCTTTGCGAGTGTTGGATGAGCTAGTAGTGTGGCCCTAGCGCACT 
CCCCAGGAAATACTCACTGTGAACAGTCACTCTTTCCACTTATAACACTTTTCACTTGAAAATAAATTTGAATTCTATCTTAAAACTACATTGTT 
TATTTGCGGAGTAGATACGCCTGTACCACCTGGGATAAGCAGAATGAAAAACAGCTGGAGTAATTAAAACACTAGTCCCCGCTCATTTCAGATCT 
CCACCGTCATGGACACGATCAAACTGTGGCACAGTGGGCAAGTTCGGTGCCTGGAGATCTCGGTGGGAGCTGCGCAGTGGTGGCAGGTGGTACAG 
ATCCATATGTTGTTCGTCGGCTGGGTGATGGGTTTCCCGGAGGAGATGCACGCCGGGAAGCGAGCATTGCACTCGGGGCAGGAGGGGGAGCTGAA 
AGGATCGGCATTCCATGAGTATCATCATTATAATCCAGCTTCTGACGTCGACCCTTCACTTACCTATCTGGCACGGGCACTCCGCAGGAATAGCA 
ATCCACCCTATCGCCAGTGGTGTCCTCCGGATCGTACTTGGCGAAGATCCCAGCCGCCAACTCCTCGTACCGTTGAAGGGTAGCCTCGGGAAGAT 
GAGCCTGTTGCTCCAACTTCATGAAAGCCTTGGAGCAGGTTCCGAAGGCCCGATCCGCACAGCTGGCCAGTGCCAACAGGCTGTAGATGTGCTCT 
GGAGGCAGGACGTCCTCGTAGTCGCGCAGCCTCACCGCAGTGACCACCGCACTGTGGACGATCCC 

GAAGTGGTAGGCCTCGGCGCAGTGCCAAAGCCTCTCGATGGCGGCCGCATCCTCCAGGGCAATGGAGTCCAGTAGGGTGTTCCTGCCGCTGGCGT 
AGTCGATCTCCGTCGTGGCCACAGCCTTGAGGTGCTCCTCGGCCAGCAGAGCAGCCAGCACGTACAGCTTCTTGATTCTCAGCAGCGGAGCAC 

(SEQ ID HO: 187) 

Exon: 16478.. 15910 
Exon: 9610.. 9377 
Exon: 9318.. 9155 
Exon: 9090.. 8965 
Exon: 3481.. 3332 
Exon: 3264.. 2764 
Exon: 2 693.-2388 
Exon: 2328. .2182 
Exon: 2092.. 2060 
Exon: 1941.. 1001 

Start ATG: 9477 (Reverse strand: CAT) 
CTGGCGAGTGCAGCT^ 

AAATATAAAATAAAGTAAATAGTAATTGCGGTGCTTGCGAAAACCAAACGAGCTCCAATTGACCGACGAAAATACTTAAGTGCATTAAAGAATTG 
CGTTGTTGTGCGTCGCGCCCGTTTGTCCTTATTGTTGTCGTTAGTGGAGTGCACGAAAAGTAATTGTTTTTATTTAAAGAGGTTGCATTTTAATT 
GCTAAATCAGCTCAGTTGCGAGTAGAGTGCCAAAATAGAGAGTAGCGACAACGAAAACAAGTTTTTGTGGATGGTGAGAAAGTAAAAACAATAGC 
AAATTTATATATATAGCTATATATATATGTGGTAAGGTGCTGTAAGCATTTAAAAATACAAATCCTCTCGAGGTTTTAATGGCCAAAGTTTCGTA 
AACGTGTAAATTGAGTGGAATTCGATTGCGAGTGCATTCTTGGTTTGTTTTGAGTGAGAACCGAAAACCAGTTTACACCGTGCTTCTTGCCAAGT 
TGAATCCGGTTTAATCTCGGCTTGAGTCATCCTTTAGTTTATACTGACTGTGTTTATCAGTTCAATGTGATCCGCCAGTGAAGCTGCATCAGAAC 
GAGTAACTCATTCCAGGACCTGGAATCCACTGCCACCATGTTGCCGGTAATCCTCATAGCCTGCCTGGCCCAGCTGGTACTGGCTCAGGCGGATC 
TTAAAGATCTGGATGGACCCAATATCTGCAAAAGAAGAGAACTATATAACGTAGACGTTGTCTACACAGAACTGCAATCGTTCCAGGAACGGGGC 
TCCACCTGGTGCGTCACATTCCCGCCGAGGTGCTCCACCTATCGCATTAAACACCGAGTGGTCAACAAAACCAAGACGATTGCAAAGAACCGTAT 
TGTGAGGGACTGTTGTGATGGCTACATCGCGAGCGCAGGAGAATGCGTGCCGCATTGTTCGGAACCCTGCCAGCACGGTAGATGCATCTCCCCGG 
AGAAGTGCAAGTGTGATCATGGTTACGGAGGACCCGCCTGCGATATAATTTGCAGATGCCTGAATAACTCCTCCTGCGACCCCGATTCCGGTAAC 
TGCATCTGCTCAGCCGGTTGGACTGGCGCCGACTGTGCGGAACCCTGCCCACCTGGATTCTACGGCATGGAATGCAAGGAACGGTGCCCGGAGAT 
CCTCCATGGCAACAAGAGCTGCGATCACATCACAGGTGAAATTCTCTGCCGCACGGGTTACATAGGTCTCACCTGCGAGCACCCATGTCCTGCTG 
GACTCTACGGACCCGGCTGCAAGCTCAAGTGCAATTGCGAGCACGGCGGCGAGTGCAACCACGTTACAGGCCAGTGCCAGTGCCTGCCAGGGTGG 
ACAGGTTCCAACTGCAACGAGTCCTGTCCCACAGACACCTACGGACAAGGATGCGCCCAGCGGTGCCGCTGCGTGCACCACAAGGTTTGCCGCAA 
GGCCGACGGTATGTGCATCTGCGAAACCGGCTGGTCGGGCACCCGGTGCGATGAGGTCTGTCCGGAGGGATTCTACGGCGAGCACTGCATGAATA 
CCTGTGCCTGTCCGTCGGCCAATTTCCAGTGCCATGCGGCCCACGGTTGTGTGTGCAGGAGTGGCTACACCGGCGATAACTGCGACGAGCTAATC 
GCCTCTCAACGTATTGCAGACCAAAGCGAAAATTCGAGCCGGGCGAGTGTAGCTCTGACCTTAGTCCTGATGACCCTGTTTGCGTGCATAATCTT 
CGCAGTGTTCATTTACTACCGCCGACGTGTGTCCAATCTAAAAACCGAAATCGCCCATGTACACTACACCCATGATACGAATCCCCCGAGCTGGC 
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CGCCCAACCACAACTTCGACAATCCCGTGTACGGCATGCAGGCGGAAACGCGTCTCCTGCCCAACAATATGCGTTCCAAGATGAACAACTTTGAC 
CAGAGGAGCACGATGAGCACGGACTACGGCGATGACTGCAATGCCAGTGGCAGGGTGGGTAGCTATTCGATCAACTACAACCACGACCTGCTAAC 
GAAGAACTTGAACGCGGACCGGACCAATCCGATTGTGTACAACGAATCCCTCAAAGAGGAGCATGTCTACGACGAGATCAAGCACAAGGAGGGCT 
ACAAGGATCCCGTGAAAATATATAGCAAGATTTTGTTTCCCGAGGATGAATACGATCACCTGGACTACTCACGACCCAGCACCTCGCAGAAGCCG 
CACTACCACCGCATGAACGACGCCATGCTCAACATCAACCAGGACGAGGAGAAGCCCAGCAACGTGAAGAACATGACTGTGCTGCTGAACAAACC 
GCTGCCGCCCACGGAACCGGAGCCACAACACGAGTGCTTCGACAACACGAACACCAACCTGGACAACGTGTCGACGGCGTCGCCTAGTTCCAGTC 
CGAAATTTCTAAAGTAGGCAGCCGGAACAGTCACTTCACCCGGTGGTTAGATCTTAAGCCATGATATTTTAATTTGTCTATGTATGTAGGGTAAG 
AATTGCCCAACTGGTACGTAAGTTTAAGTTGTTTCCTAGAACTTCAGTTCATGCTGTTATGTATTTCTGTGTAGCGTAGCATTTGTCTTGCATTC 
GATATGTAATTGTAGTTTTCTTGTAGTTCTTGCTCAACAAAGCCATGTCAGCTCAACCAATTGTAAATTATTATTTAGGGGCCCTTTACAACCCA 
GCAACGGTCCTCACTTTTGTAAATGTCTCCTTCATTCGACCCAAATTGTGCTAACATTTATGCTCATATAGGCGACTAACTCCCTAAGTGAAATC 
ATATTAAATTAAGTACCTACAACACCTCTTCCTGGTAAAATTGCAGAGTGCCTGTGAGACAAACTCGGATCGGAGGTTGCAATAATTTTAAATCG 
AACCCATGCATATACAGATTGAAATCAATAGCTTAAGTATATGGTATTTATATTTTTAATAAGAAACTTATTTTTAGAGTGCATAAACAATACAT 
TTATAGAGATCATTAAAAAAAAATAAAAAATTAGTTCAAATATTGGAACGTTAACCCATTTATAACTACCCTGAAAATCGTTATGTTATCTTCAT 

ATACACATATCAATTCTGAGTGAAACGTGTATAATA 
(SEQ ID NO: 188) 

Start ATG: 703 (Reverse strand: CAT) 

MLPVILIACLAQLVLAQADLKDLDGPNICKRRELYNVDVVYTELQSFQERGSTWCVTFPPRCSTYRIKHRVVNKTKTIAKKRIVRDCCDGYIASA 
GECVPHCSEPCQHGRCISPEKCKCDHGYGGPACDIICRCLNNSSCDPDSGNCICSAGWTGADCAEPCPPGFYGMECKERCPEILHGNKSCDHITG 
EILCRTGYIGLTCEHPCPAGLYGPGCKLKCNCEHGGECNHVTGQCQCLPGWTGSNCNESCPTDTYGQGCAQRCRCVHHKVCRKADGMCICETGWS 
GTRCDEVCPEGFYGEHCMNTCACPSANFQCHAAHGCVCRSGYTGDNCDELIASQRIADQSENSSRASVALTLVLMTLFACIIFAVFIYYRRRVSN 
LKTEIAHVHYTHDTNPPSWPPNHNFDNPVYGMQAETRLLPNNMRSKMNNFDQRSTMSTDYGDDCNASGRVGSYSINYNHDLLTKNLNADRTNPIV 
YNESLKEEHVYDEIKHKEGYKDPVKIYSKILFPEDEYDHLDYSRPSTSQKPHYHRMNDAMLNINQDEEKPSNVKNMTVLLNKPLPPTEPEPQHEC 

FDNTNTNLDNVSTASPSSSPKFLK* 
(SEQ ID NO: 189) 

Name: EGF repeat molecule 
Classification: cell_adhesion 
Gene Symbol: BcDNA: GH03529 
FlyBase ID: FBgn0027594 



Celera Sequence No. : 142000013384481 m ™ m ^ mrT1 
AAAATGCAAATCCAAAATCAATCGTACTCCGCCCCTCGTCCTCCGGCAATCGATTCGTCGATATTCGGGGATTCCCTTCCTGATTTGCATAAGTT 

TCCAGATAGCGCGCGTGCGCAGTCGATCGGCGCACCCGAATCCGTAATGCGGAGCAAACCCTCCTCCCCGATGTAGTGCAGGTCCATTTGCATAA 

CACTGGCTGGAGGAGCTGGTGATCCGGATGTCGAACTTCCTACCCGATCGCCCGCTCCGCCTCGGGACGATGTGCCAAAATTATGCAAATGTTTT 

ACACCAATAATTTATGCAAATTACGATCCGCAAAGGATTTGGTGATCCTGCGCAGCGGATGTTACACGTAGATGTCGCCAGCTTCAAATCGAAAA 

TGTTTTTTAGGGGTTTGCTCAAGTTCAGAATTTCTGCAAACGTTTTTTTATATGGTAATTTTTTTTTGCAATGGGTCTTAGGAAATATTTGGCAT 

TTTTAATTGTGTTTTTTTGCACGATTGTAATGCAGATTTTTTGCTCGATTGTAATTTTAATTTGCAAATTTGGGTTTTTATTTTTTGCATTTGAA 

TGGATTTCGATTCGATTATTTCAAAAAGGACAGGCAGCTGCTCCAGCTTGATATCCTTTTTGCAACGCGCCAATTTCTTAAAAATCATTTCGATT 

CCCAGTTTTGCCGAAATTTTTGTGCACACAATTTGCATAGAGATGTGAAGGAAAAGGGAACAAAGATGCGATTCTATACTTTAAACGTTCTAAGA 

TTTGATTATATTTATGGTAATAGGCGAATAAGATTGCCGATGCCAATATTTAGTGTGAGAAAGTATTTTTGAAAATCTTTTTTGAAATCGCTTAT 

GAAAAAAATACTGTAGACGTCGAAATATTTAAAATATTCATCTTAGAATATGTATTTTTTGATAGAATAGAAAATATATAGAGATAATAACAAAA 

AATTTAAAATAATTAACTATACAGTTTCTTGTGCCCAGATAACTTATTTACCCACTTTCTGCACGTAACTTTGCCGTAATTTGCCGTAATATCTT 

TGCAAATTGCCTGCGATTTTGGATAAAATTCCTCTGCTGATCTGTGCTCTTCTTTCATTTGCTGTCCCTTTCTAACGCACGTTGGCCATGCGGGA 

TTCCTTCACAAATACACGCTCCTCTTTCGCATTTTTTTTTACCGAAAAACCTCTCTTTTCTAGATAAATTTACATTACCGCTGGCGTTTTATTGG 

TCGAGAAAAGCGAGCAAAAAAACGGAAACCAGCCGAAAACAAGCTGGGTTTTCCCAGAGCATTTTCCTGCGAAGGGTCGGTTAAAAAACTAGGGG 

ATGAAGGTCGAACCACCACCACCAACGGACTTCTCATTTTATTCGTTTTTTAAGAGAGCGAGAGCGGCGAGATCGGTTTTTTAAGAGAACCGGTG 

GCGAGAGGGTTGCACTTGGGCATGGATTTGCGCAATTTTTGCTATATTAGCCGGAGGCCGCGGAAGAGTTGAAGCAGTTTGAGCCTCGCAGCCGA 

ACTTTGAGGATCGCTGAGACGAGACGCCGTGCAAGAAAGTCGATCCAAAATCCGAAATTCGAAATCCGTGCTGTGATCTGTGCTGAATATAGTGA 

AGAAATCTTAAAGAATTTTCAAAAAGAAATCCCAGTGCTGTGCTGAATATATCCGCGAAAGTGTGCCACAAAAATGTCCAACCAAATGGAGTTTA 

TTATGCAACTCTACATGATGAACTTGATGAAGCAGCAGCAGCAAATGCAGCTCCAGCTTCCGCAGCAGCAACAGCAGCAGCAGCTGGCTGGATAC 

ACTTACACACAGAACGAGGATATCTCTAGCAGCACATCCGTGCAGCAGCAACAGCAGCAGCAGCAGGAGCAACTGCAGCAGCCACAGCCAGATCT 

CCGCAAGACAAGAACGCACAAGCGCATTAGCTCGCAAAACACCAACTGTAGTAGTAGCCGTAGTTGTAGTCCCAATAGTAATTTGATTGCTTTTC 

AACCACAACAATACCCAGGCGCTACAGCGGCAACTCCTTCCACTCCCAACAGCCACCCCAGCAACCTGGTCAACCAGATGCTGCTCCAAAGCCTG 

CCGCCGCTGACGCAGCTCATGTTGCAGCAGCAGCAGCAGCAACATTTGCTGACCACCTCCAACCTTCTACTAACACCCACCCACACCCCCAGTTC 

GCTGGGCAAGCAGGATCCACTGCAGCATCCGTTGTTGCTGGGCCAGTTTGCCGGCTCCGAACAGATGGCCACAAATAACTTCCTTCAATCATCCA 

CAGTGACCTCCACGCCCATCGAGAGAGAGAAGGCAGCCACACCAGCACCCTCCGCCGGAGCAACTGCCGGCAACCTTTCGGCGGCCCAGGTCAAG 

TTTGAGCAGGAGTCCGCCGACGACGATGAGGACGATGATAAGCCGCTGTCCAGCCTCACCAGCTGCAGCTCCTCGGGCCACACCAACGCCAGCTC 

CGAGAAGCTGCTGTTGTCGGGCGTCCATCCGCTGGAGTCAACCACCGACAGCCTAGACTCACCCAGCATGGTAAGTGATCCGGGGATCTAGAAGG 

ATCTTAACAGTTTCTTTTATCTCGTTTTATCACCATCAATTTTAAAGATACTGTCACTTGCAGTGGCAACTACGTAGTCAAAAAGCTATATATTG 

CGCATTTTTGTATTAGTACATTTGATTTAAATTCTAAATTCATCCCAACAGCTTGGTTCGCCATACTTCACAAACCTTTTCCAGTTGCAAAATAG 

ACCATTTTCCAATTTCCAATTCAGAAAGCACTATATTTGTTCCCTATTTTGCTTTGTTTTTTTTTTTTTTTGGCTATGTCCCACGCTCTGTTGCT 

GTAATTTGAGAAAAAACGATTCGTGATTTTTGAGATTTAAGTGTGAGAGTAGCCAGAAAGCTCGCGATTCCGGGGAAAAGAAACAGCTGTGTCGG 

CAACGCGTGGGTTCTCTTGGTGGCTCGATCGCTGCCCAGCTGGGCACCAGGCAGTCTGAGCTGGGGATCTCACAGATGCATCCGTGTGTTGGTAT 

TGACTCTAAAGTTCTTCCACCCTCTTTTTCCCACTGCAGTATACGCCCGTCAAGCAGCCGGCGGACTCATCGTACGGACTCATCACACCCGTCGA 

CAGTGATCTGACCCCCAACACACCCCTCCAACCGACCCAAACAATATCCCTGCTGACGCCGCCGTCGAGTGAGCAGAGCAAGAGCCTGGTGAGCC 

TCTCGGCGGCCAGCGGCCTGGATGCTCTGCTCCAGAACGAGGAGGTACTGAAGAACCTGCGGAAAGTGTCCTCCTACCTGGAGTGCGAGAACAGC 

CTGTGCAGGCAGGAGAACCTGCGGGAGCACTTCCATTGCCACGAGGAGCCCTGCCAGGGCAAGATCCTGAGCAAGAAGGACGACATCATCCGGCA 

CCTGAAGTGGCACAAGAAGCGCAAGGAAAGCCTCAAGCTGGGCTTTGCCCGCTTCTCCTCCTCGGACGACTGTGCACCAGCCTACGGAGAGGGTT 
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GCGCCTACAACTGGAAACAGACCCACTACCACTGCGTCTACGAGCACTGTCCCAAGGTGTATGTGAGCACCAGTGACGTCCAGATGCATGCCAAT 
TTCCACCGCAAGGACTCCGAGATCGTGAACGAGGGCTTCCGGCGCTTCCGGGCGCACGAGACCTGCCGCATCGAGGATTGCCCCTTTTTCGGCAA 
GAAGATCTCCCACTACCACTGCTGTCGCGAGGGATGCACCCACACCTTCAAGAACAAGGCCGATATGGGTAAGTCTACTTGGAAAGCTTATTAGC 
TGAACTTAGTTAATATATCATATATGTATTTCTCCTTGCTTAGACAAGCACAAAACTTACCATCTGAAGGACCACCAGCTGAAGATGGACGGCTT 
CAAGAAGATCCTGAAGACCGAGGTATGTCCCTTCGACGCCTGCAAGTTCTCCACCGTCTGCAACCACATCCACTGCGTCCGCGAGGGCTGCGACT 
ACATCCTGCACTCCAGCAGCCAGATGATCAGCCACAAGAGAAAGCACGATCGTCAGGACGGAGAGCAGGCGTACCAGCAGTTTAAGATCAAGCAG 
GACGTGGAGGAGTCCTCCCTGGATGCCATGCCCCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCCCACCAGCCTTAGTCAATCCCAGAGCAGCAG 
CTCTGTATGCGGCGGCAGCAACACCTCCACTCCACTCTCCTCTTTGTCCGCCGAGCACTTCCTGGCTCGGAAACGCGGCAGGCCACCCAAGAAAA 
TCGTAAGTTTGTCTGTATATTTTGGGTAATTTTCCGCGACATTAAATAGGCATATTTTCTCAATCCACCACAGCAACTGCCAGCCGACGCTCAGC 
AGTCGGAAGCCAAGCGCCTGAAAGTCGAAGACGAGTCCTCTAATCCCGCCATGCTCCTGCCCCAGTCCCAGCCAGCAGCCGCTGTGCATCCGCTG 
ACAAGTGGACTCTTTCCCGGTCTCCTGCCCGCTGCCGCCGCTCCTGGAGTGGATCCCACAGCCCCTAACTTCCAGCTCACCCACCTGATGGCCCT 
CTTCCAGTTGCAGAATCCCCTCTTCTACCAGAACCTCTATCCGGGAATGACCCAGAATTCCTCCATGCTCGGCAACCTGGCAGCACTTAGCGCCG 
CATCGGCAGCAGCGGCGGCGGCAGCGGCGGCAAATGGAGCCGGAGTCCAGCAGCCGAAGGCAGAGTTTAGCTTTAAGCCAGAGTTTAAGGAGTAG 
GAAGGATGCGTGTCCTTAGGGTATTTTGCCAGCTACTTAGTTTGACCGACGGACGTTGGACATTGTACATACGCGCTTTAAGCAGGTTTAGGTGG 
GTTTTTTTGTGGGTTCAGGCAGATCCAGGACAAGGCATCCTTCCCTCCACAAAAACCAACTCAGCCACACTTTGAGAACCTAGAAGTCCCCTATA 
CAAAGTGTAGCTGAATTGGCTTTTGCCAAACATTTTCCCCTTTAGCTTAGATTCTCTCAGTGCCACATTAATTTTTAAAATTTTCAGCCATTTTA 
AATAAAAAGGAAGAACTTTCTGAATTTAAGTTTGACTAAACTGTGATGTTTTCATTAGAGTAGCCTCTTTTAATTGATAGAGCGAGTGCAGGCTA 
TGCAGGATATGCAGGACTTAGGAGACTTGGATTGTCAATAACATAGCGTCTAAGCATTACATTGTATGTGATATACTAATAAATAATGGCTAATT 
GTTTTTGCTTTATAATTTAAGGGAATAATATTATAATTCTAATTATAAACTTTAGTAAAATCAGCATTCTTCTATAATAAATAAGTCAACATTAC 
CTGAATAAATTTTATTATGCCTCAGAGCTTATTATTTAAACACAAAATGCTTGAGTTATTGGGGGTTACTGGAAAATACAATATATAAAGTAACT 
AAC AT AT AT GTAT AC C C AACC AC AC AT AAAAC AC T T AAACAT T T G AT T AAAT AAT T T AT G ACT T TAT T T T GT T T T C AT GT AAT TGTTGGTTTTAC 
TCATATGCTAACAGAAATCAGCTACACAAACTCAGGACGCCTGATAAAAACCGTCCCGCTAAGCAATATTTAAGTACTTTACACAAAATCAATTA 
TTTTTTCCTTTGCTTGTATTTTTGTATTTTTTATAGTTCATAATCGCATATATGTATATGTAAACAATCAATACAATCCAAATACCATAATCATA 
AGCATAACAAAAGAGATTTCAGAAAGTTGTATCCTAGTCAAGAACAATGTACAAGCAAGTATCATCATTTTATACATTAGGTATTTGCTTTTAGT 
TTACATTGCAAAGTGAAACCCTGCGTGAAATGTACAAATAAAGGTAAACTAAAAATAACATTAAGAACTAATGGTCTAGAACTCAAGAAACACGA 
AAATCAACAAGTAAACTTCTAAGTAAATTCGTTAAATGTTATGTTAAAAGTAGGAGAAAACTTAAACAAACTACAGACTCAGTAATGCCTTTTGA 
TTTTGCCTAAGGTTATAAGCTTCGATTTTGATGCGGTAGCCAAATTTATAAAACCTTGAAAAAGCAGTATCCAATTTATTCCTATAAGCATCTAT 
ACTCGTCTAATTTTTTGGGTCATCAGTTTATAACGATTGCATAACTTCCGGTGCAATATCTATGGCAACATTTTGTGCGCACATCGACACATACA 
CACAATTTGAAAACGAAGTTTAAGTTTGTATCTTATAGGTTGGGTTAATCGTATCTAAGCCGCAACAGAAGAGGTTATGAGTAGCGTTGGGTTTG 
AAAGCTGCCCGTTCTGCCGGCGGGCCTTAATCGCGGTAAAATCCGCGCAGCAGTTTCCCAATAAATCAGAACCGTTAGCTGCGACCGGT 
(SEQ ID NO: 190) 

Exon: 1001.. 2540 
Exon: 3080. .3773 
Exon: 3844.. 4277 
Exon: 4349. .5359 
Start ATG: 1689 

Transcript No. : CT6850 

CCCACTTTCTGCACGTAACTTTGCCGTAATTTGCCGTAATATCTTTGCAAATTGCCTGCGATTTTGGATAAAATTCCTCTGCTGATCTGTGCTCT 
TCTTTCATTTGCTGTCCCTTTCTAACGCACGTTGGCCATGCGGGATTCCTTCACAAATACACGCTCCTCTTTCGCATTTTTTTTTACCGAAAAAC 
CTCTCTTTTCTAGATAAATTTACATTACCGCTGGCGTTTTATTGGTCGAGAAAAGCGAGCAAAAAAACGGAAACCAGCCGAAAACAAGCTGGGTT 
TTCCCAGAGCATTTTCCTGCGAAGGGTCGGTTAAAAAACTAGGGGATGAAGGTCGAACCACCACCACCAACGGACTTCTCATTTTATTCGTTTTT 
TAAGAGAGCGAGAGCGGCGAGATCGGTTTTTTAAGAGAACCGGTGGCGAGAGGGTTGCACTTGGGCATGGATTTGCGCAATTTTTGCTATATTAG 
CCGGAGGCCGCGGAAGAGTTGAAGCAGTTTGAGCCTCGCAGCCGAACTTTGAGGATCGCTGAGACGAGACGCCGTGCAAGAAAGTCGATCCAAAA 
TCCGAAATTCGAAATCCGTGCTGTGATCTGTGCTGAATATAGTGAAGAAATCTTAAAGAATTTTCAAAAAGAAATCCCAGTGCTGTGCTGAATAT 
ATCCGCGAAAGTGTGCCACAAAAATGTCCAACCAAATGGAGTTTATTATGCAACTCTACATGATGAACTTGATGAAGCAGCAGCAGCAAATGCAG 
CTCCAGCTTCCGCAGCAGCAACAGCAGCAGCAGCTGGCTGGATACACTTACACACAGAACGAGGATATCTCTAGCAGCACATCCGTGCAGCAGCA 
ACAGCAGCAGCAGCAGGAGCAACTGCAGCAGCCACAGCCAGATCTCCGCAAGACAAGAACGCACAAGCGCATTAGCTCGCAAAACACCAACTGTA 
GTAGTAGCCGTAGTTGTAGTCCCAATAGTAATTTGATTGCTTTTCAACCACAACAATACCCAGGCGCTACAGCGGCAACTCCTTCCACTCCCAAC 
AGCCACCCCAGCAACCTGGTCAACCAGATGCTGCTCCAAAGCCTGCCGCCGCTGACGCAGCTCATGTTGCAGCAGCAGCAGCAGCAACATTTGCT 
GACCACCTCCAACCTTCTACTAACACCCACCCACACCCCCAGTTCGCTGGGCAAGCAGGATCCACTGCAGCATCCGTTGTTGCTGGGCCAGTTTG 
CCGGCTCCGAACAGATGGCCACAAATAACTTCCTTCAATCATCCACAGTGACCTCCACGCCCATCGAGAGAGAGAAGGCAGCCACACCAGCACCC 
TCCGCCGGAGCAACTGCCGGCAACCTTTCGGCGGCCCAGGTCAAGTTTGAGCAGGAGTCCGCCGACGACGATGAGGACGATGATAAGCCGCTGTC 
CAGCCTCACCAGCTGCAGCTCCTCGGGCCACACCAACGCCAGCTCCGAGAAGCTGCTGTTGTCGGGCGTCCATCCGCTGGAGTCAACCACCGACA 
GCCTAGACTCACCCAGCATGTATACGCCCGTCAAGCAGCCGGCGGACTCATCGTACGGACTCATCACACCCGTCGACAGTGATCTGACCCCCAAC 
ACACCCCTCCAACCGACCCAAACAATATCCCTGCTGACGCCGCCGTCGAGTGAGCAGAGCAAGAGCCTGGTGAGCCTCTCGGCGGCCAGCGGCCT 
GGATGCTCTGCTCCAGAACGAGGAGGTACTGAAGAACCTGCGGAAAGTGTCCTCCTACCTGGAGTGCGAGAACAGCCTGTGCAGGCAGGAGAACC 
TGCGGGAGCACTTCCATTGCCACGAGGAGCCCTGCCAGGGCAAGATCCTGAGCAAGAAGGACGACATCATCCGGCACCTGAAGTGGCACAAGAAG 
CGCAAGGAAAGCCTCAAGCTGGGCTTTGCCCGCTTCTCCTCCTCGGACGACTGTGCACCAGCCTACGGAGAGGGTTGCGCCTACAACTGGAAACA 
GACCCACTACCACTGCGTCTACGAGCACTGTCCCAAGGTGTATGTGAGCACCAGTGACGTCCAGATGCATGCCAATTTCCACCGCAAGGACTCCG 
AGATCGTGAACGAGGGCTTCCGGCGCTTCCGGGCGCACGAGACCTGCCGCATCGAGGATTGCCCCTTTTTCGGCAAGAAGATCTCCCACTACCAC 
TGCTGTCGCGAGGGATGCACCCACACCTTCAAGAACAAGGCCGATATGGACAAGCACAAAACTTACCATCTGAAGGACCACCAGCTGAAGATGGA 
CGGCTTCAAGAAGATCCTGAAGACCGAGGTATGTCCCTTCGACGCCTGCAAGTTCTCCACCGTCTGCAACCACATCCACTGCGTCCGCGAGGGCT 
GCGACTACATCCTGCACTCCAGCAGCCAGATGATCAGCCACAAGAGAAAGCACGATCGTCAGGACGGAGAGCAGGCGTACCAGCAGTTTAAGATC 
AAGCAGGACGTGGAGGAGTCCTCCCTGGATGCCATGCCCCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCCCACCAGCCTTAGTCAATCCCAGAG 
CAGCAGCTCTGTATGCGGCGGCAGCAACACCTCCACTCCACTCTCCTCTTTGTCCGCCGAGCACTTCCTGGCTCGGAAACGCGGCAGGCCACCCA 
AGAAAATCCAACTGCCAGCCGACGCTCAGCAGTCGGAAGCCAAGCGCCTGAAAGTCGAAGACGAGTCCTCTAATCCCGCCATGCTCCTGCCCCAG 
TCCCAGCCAGCAGCCGCTGTGCATCCGCTGACAAGTGGACTCTTTCCCGGTCTCCTGCCCGCTGCCGCCGCTCCTGGAGTGGATCCCACAGCCCC 
TAACTTCCAGCTCACCCACCTGATGGCCCTCTTCCAGTTGCAGAATCCCCTCTTCTACCAGAACCTCTATCCGGGAATGACCCAGAATTCCTCCA 
TGCTCGGCAACCTGGCAGCACTTAGCGCCGCATCGGCAGCAGCGGCGGCGGCAGCGGCGGCAAATGGAGCCGGAGTCCAGCAGCCGAAGGCAGAG 
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TTTAGCTTTAAGCCAGAGTTTAAGGAGTAGGAAGGATGCGTGTCCTTAGGGTATTTTGCCAGCTACTTAGTTTGACCGACGGACGTTGGACATTG 
TACATACGCGCTTTAAGCAGGTTTAGGTGGGTTTTTTTGTGGGTTCAGGCAGATCCAGGACAAGGCATCCTTCCCTCCACAAAAACCAACTCAGC 
CACACTTTGAGAACCTAGAAGTCCCCTATACAAAGTGTAGCTGAATTGGCTTTTGCCAAACATTTTCCCCTTTAGCTTAGATTCTCTCAGTGCCA 
CATTAATTTTTAAAATTTTCAGCCATTTTAAATAAAAAGGAAGAACTTTCTGAATTTAAGTTTGACTAAACTGTGATGTTTTCATTAGAGTAGCC 
TCTTTTAATTGATAGAGCGAGTGCAGGCTATGCAGGATATGCAGGACTTAGGAGACTTGGATTGTCAATAACATAGCGTCTAAGCATTACATTGT 
ATGTGATATACTAATAAATAATGGCTAATTGTTTTTGCTTTATAATTTAAGGGAATAATATTATAATTCTAATTATAAACTTTAGTAAAATCAGC 
ATTCTTCTATAATAAATAAGTCAACATTACCTGAATAAATTTTATTATGCCTCAGAGCTTATTATTTAA 
(SEQ ID NO: 191) 

Start ATG: 639 

MSNQMEFIMQLYMMNLMKQQQQMQLQLPQQQQQQQLAGYTYTQNEDISSSTSVQQQQQQQQEQLQQPQPDLRKTRTHKRISSQNTNCSSSRSCSP 
NSNLIAFQPQQYPGATAATPSTPNSHPSNLVNQMLLQSLPPLTQLMLQQQQQQHLLTTSNLLLTPTHTPSSLGKQDPLQHPLLLGQFAGSEQMAT 
NNFLQSSTVTSTPIEREKAATPAPSAGATAGNLSAAQVKFEQESADDDEDDDKPLSSLTSCSSSGHTNASSEKLLLSGVHPLESTTDSLDSPSMY 
TPVKQPADSSYGLITPVDSDLTPNTPLQPTQTISLLTPPSSEQSKSLVSLSAASGLDALLQNEEVLKNLRKVSSYLECENSLCRQENLREHFHCH 
EEPCQGKILSKKDDIIRHLKWHKKRKESLKLGFARFSSSDDCAPAYGEGCAYNWKQTHYHCVYEHCPKVYVSTSDVQMHANFHRKDSEIVNEGFR 
RFRAHETCRIEDCPFFGKKISHYHCCREGCTHTFKNKADMDKHKTYHLKDHQLKMDGFKKILKTEVCPFDACKFSTVCNHIHCVREGCDYILHSS 
SQMISHKRKHDRQDGEQAYQQFKIKQDVEESSLDAMPQQQQQQQQQQPTSLSQSQSSSSVCGGSNTSTPLSSLSAEHFLARKRGRPPKKIQLPAD 
AQQSEAKRLKVEDESSNPAMLLPQSQPAAAVHPLTSGLFPGLLPAAAAPGVDPTAPNFQLTHLMALFQLQNPLFYQNLYPGMTQNSSMLGNLAAL 

S AA S AAAAAAAAAN GAG VQQP K AE FSFKPEFKE* 
(SEQ ID NO: 192) 



Celera Sequence No. : 142000013384570 

AAAGATGGCACGGTGTGCATGGAAAAGTTGCCGTTGAATGGGAAATTGCTGCGACACGTTTTTGCCTTGGTCTGCTGGGATGGGATCTGTGACCG 

GTCTGAGTCGGCGGTGGTCCGGTGTCTATGTCTCAGCCCCAGGTTTGTTTTCCGCATAAAAAGACACATTCATGCTGGCATTCTGCAGCCAACTT 

TAGCCTTTGGGTGCTAACCGGCTAACTCGGCTGGCTTATGCAAAACTCCACATCCTCAGCTATTCCGCGACGATGTACCTTGTGCAACACCCCAG 

ACACTATGGAGTCTGACCAGTTTATCCACCCGCCATGTGGTCATCTAGGTGCACCGTTCTCGAGGCGCGGTAAAAGTAAAAGTTAATGTGTGTGG 

CCGGCGCGTGAAGAATGGCAATTACAGATCGATGATCGCCGTGAACCAGCTGGCTCATTTAGCCTCCTTAATTATGCCACGATTTCGCTCATCCA 

ACTCATCTGCTTTCTGGCCATGGAGGTGCCCCATTGTGCGACTTACTTTGCTATTTTGTGTTTCTCTGGCCAAAATACGGAAAGTGGCAAGGCAG 

AAAACATAAATGATTTTACGGGGGTCTAAATGAGTTAATTAGGTTTGCAGTCGAGAGGTCTTAACCAACACGAGTTGGCCATGGCAAAGCCAATG 

AAGATATCAGAGAAGGGGTAATATGTAGGTGGATGGATACATTTGTACTACATGAACCATGCAACCCTAAAGGCTTTGGGTTTTGTTGAGTTATT 

TTGGAAATCTGTTTTTCTCCATCATTAGCTAAGATTAAGCGTTTAAAAGATAATATATCTATTAGGAATCATGATTTAATAAGAAAAGATAATAC 

ATATTTAAAAAAAAAAATTATTCCACAGACCATTGCTGAAAAATTTGAATAAAGCTTAATGTAATGCGTAGTGAATAACTCGCACCTTTATAACA 

TACAAATACGGTCTCTTGGTCATGAGACTCTGGCTGTACGTATCCCTGATTCAAAGCTCGTTATGGATATGCTTGGAAATGTAAACAGTGAGATC 

CTCAGGCAGCCCAAACATATCCAACTCCTTGGCTAATGCGAAGTACCGCCGCCTAAGAATGTGAGCCACTTCTTTGGGTTGTCGGTTCCTTGTAA 

AGACTCCCTTTTTGTTGCCGCCCACGCGGGTAATAGCTGCAGAGGATAATTTTAATTGGGTTTGACTCTTGTTCAAAAGCGTTTGAAATACAGAC 

TCTGCGCGGTCCGAAAATCGGCGAAGTTCCAAACAAACTCTCCAATAAACCATCCTCTTCCGCGTAACTCGTCGAAAGCCTTGAAATGGCGAGAG 

AAGAGCTCAACCTGGTATTCCTCCGACCAAATAAAGGCGGGAAGCTTTGGGGAAAGTCCGAATTAGAATCAATTATAGGCCTGAACAGGGTTCCT 

TACTGAGTGCATGCCCTCCATAGTGTCGCCGCCGTACTCAAATTGGATGACAGGCTTTCCGAACCTATCCCGCCAACTCTGGGCCTCTATCGTCA 

ACAGGTTAAGTATCATGTCAGTACGGCCCGAGTTCTGATACCAAGAGTTATAGCGATTGAAACCCACGATGTCCAGAAACTGCGCCAAATGGCAA 

CTGGAAGAGTTGGCATTTATAGCCGCGGTTAGAGGTCGTCCGTGAGCTATTTCTCTTACATAGTTTACCAGGAATCTAAAGGTAAGACATCCGCG 

TTAAATTTTTGATCTAAGATGTACTTTCTACTGACTCAAAGTATTTAAGGGCTCCCTGCTTGTTCGATCTCGGCTCGTTGGCTACCGACCATGCA 

ACAACACTTGGATGGTTTCTGTCCCGGTGGATCAGTTGCTCCAGCGAGGACATGTGATTCTCCAGTAGCTGCGGCTCGAAGATATCTATATTGAC 

AGCAGGGCATTCGTCAATAATCATAATACCATGCTGATCGGCAAACTGCATTGACTCTTCGGAATAAGGATAGTGAGAGGTGCGATATGCATTGG 

CTCCAGTCCACTTCAGCAGGTTAAAATCTCTAGCAAGAAGCGCATTATCCAATCCCTTCCCGCGGATCTAGAATAAAGATTTTGATAAGCTTAAT 

GACCAGATTTACCCAACAATCAAGCTCACATCGGAGTCCTCGTGCCGTCCAAATCCCCGCAGATAGAGGGGTTTTCCATTCAGCAGCAGGCTGTC 

GTTGTCCCAACTTAAGCTACGTATGCCCACTGGGAGGCGGTAGGTATCCTGCAGAGATTCCAACTCCTTCTCATTGCTGGCCACAAACAGCTCGA 

ATTGAAGATTATACAGGTAGCCGGGATCGGAATGCATTAGGTAGGGCCACCAGGGAGTTGCATTTGGCACCAGCAGAGTACCATGGTAGACCGCC 

TTGTTGATCTGCTGGGCGGCCACATGACCATCTTTGTCCCTTAGTTGCACCCGAAGTTGGATGGGCTGGATTTGGAGGCCCTCCTTGCTAGCATC 

CAACCACACGCGGTAGTCTATGCGGCCCAATCCTTCTTTAGTTAGTTGGGTGGTGACCTCCAAATCGCTTATGTGCAGCAGCGGGGTCGTGTACA 

AGTGTACACTCCGGTGGATTCCCGCGTAGTTAAAAAAATCGAAAGTGTAACTCTGGATGGGCACGGTGCCGTCATCCGTAGCCACCTTGTAAACA 

G AAC CC T GAG G AA T GG T GC GGTT GC T C AAAC GAT T GT C AC AC AT C ACC GT GAT CCTGTTATCGCCGCC A AAGCT G AGC AGT C C C GAG AT C T C T GA 
CTCAAAGGGTAGGTGGCCAATGGAGTGACTTGTGGCGTTCCTTCCGTTGATCCATACCACGGCCGAATAGTGAACGCTGCTAAATCGCAACCAAG 
TGCGTTGCATAGTCTTCCAGGAGCGCGGCACAAAAAAAGTACGCTCATACCATACGGTTCCAACGTGATCTCGTAAGTTATCTGTAGTGATGTCG 
TTATAGGAGGCAGGCACTGGCATCGAAATAATCTCCCTTCCGGTTTTTCTCAAGGCATCCATAAACCACTTTTCCCTAATTCCCTGTAAGGGGTC 
ATATGGATCACTCCTAACCAGTTGCCACATTCCATCCAAGCTCCGCACTTCACGGGTCTCCGAATCCCTGGGGTACAACAATCCCTTGGTTGGTG 
TGGGCTTTCTGACATTATCATATTCAAACAAAGGGGAATCAGTTAGTTCTTTGCGATTGGCGTTCACCAAGAAGTCGTTTTCGGTGTAAGCCAAT 
AGAACCTGGAGACTGCCAAGCAGTAAGCAGGTGACGGGCATCCAATGACTTTTCATCTGAAATCAGAATATTTCTATCGAGTTACAATTTAATGG 
TTATCAAACATTTGTGAACCATCTTCTTCTTCACCCGGCCTTTGATAAGAGCAGTGTGACCAGCGCAAATATAGTGAACAAAATCCATCTATTAT 
TTAATATGAAATTTTTTTACATAAACTTAAAATTGTAAACTTAAAAGAATTTAGGTTTAGGTTTGGCCTTCTACTCATGATAAAAAAAATTGTTT 
ACCGTGAAACATCATTGTTTGGAATCTGCGTTTGAAGGATCTACTCAAGAGGTTAAAATAATTTGCAAATATCTGTGTATATCTCCCGCTTAATG 
TTCTTTCAAACTTGGTCACAATCATTTTATTCAATTACATTTATTTGAAAATTGAAATAGTCCGTGACATGGGAATCCGTGAGGGCAATCGATGG 
TTTGGTAGAAAAAATCGGTGCTGTCATATAATATCGATAAGATAATTCACCATCTGGTAGCACGACTAAGTTCATACGTGTAATTATCGGTAGTT 
AAAAATAAAAATATTAATACGCCTTGATAGAGGCAACCCTCAGTTTCAGAGTTGAAAATCTCGATGGCAATGCAAACAACTTCGTTCGGCTAACA 
AAACCCGACTAAATTCCAACGTCAACGAGAGCATAGACGATATTTTCGGTTTCTTCCCAAGTGATACCGATTCCGAGAGCTACGGCCTTATATGT 
AGAGTGGGGGCAGGCAGAGATCCCGGCCAAAATTAACATATTCCTGGAGACTATCAGCCAGCTGGCGGCCCTGGCGACCTCGACCGGCATCATGC 
AGCTGCTCTTATCGTCCGTTTGCATGGCGCTGACCAGCGCGGTGATTGGATTTGCCTACTACCAAAAGCAGCAGTTCTATCCTGCGGTGGTCTAC 
ATCACCAAATCGAACGCCTCCATGGGGGTGAGTAAATCCTATCATACAATCCAGTCCCATGCCTAAGCCTTGTTTTCGTTTGCTTCCAGGTCATA 
TACATACAATTCTTTGTAATTGTCTTCATGTTCGGCAAGCTGCTAAGCAAGGTGACAATCAGCAAGAGTACCTTGCAC 
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{SEQ ID NO: 193) 



Exon: 3353.. 2120 
Exon: 2062.. 1746 
Exon: 1690. .1429 
Exon: 1374.. 1236 
Exon: 1176.. 1001 

Start ATG : 3286 (Reverse strand: CAT) 
Transcript No. : CT6960 

AAG AAG ATG GTT C AC AA AT GT TT GAT AAC CAT T AAAT T GT AAC T C G AT AGAAAT AT T C T GAT T T C AGAT G AAAAG T C AT T GG AT GC C C GT C AC C T 
GCTTACTGCTTGGCAGTCTCCAGGTTCTATTGGCTTACACCGAAAACGACTTCTTGGTGAACGCCAATCGCAAAGAACTAACTGATTCCCCTTTG 
TTTGAATATGATAATGTCAGAAAGCCCACACCAACCAAGGGATTGTTGTACCCCAGGGATTCGGAGACCCGTGAAGTGCGGAGCTTGGATGGAAT 

G T GG C AAC T G GT TAG GAGT G AT CCATATGACCCC T T AC AGGGAAT T AGGG AAAAGT G GT T T AT G GAT GC C T T G AG AAAAAC C GG AAGG GAG AT T A 
TTTCGATGCCAGTGCCTGCCTCCTATAACGACATCACTACAGATAACTTACGAGATCACGTTGGAACCGTATGGTATGAGCGTACTTTTTTTGTG 
CCGCGCTCCTGGAAGACTATGCAACGCACTTGGTTGCGATTTAGCAGCGTTCACTATTCGGCCGTGGTATGGATCAACGGAAGGAACGCCACAAG 
TCACTCCATTGGCCACCTACCCTTTGAGTCAGAGATCTCGGGACTGCTCAGCTTTGGCGGCGATAACAGGATCACGGTGATGTGTGACAATCGTT 
TGAGCAACCGCACCATTCCTCAGGGTTCTGTTTACAAGGTGGCTACGGATGACGGCACCGTGCCCATCCAGAGTTACACTTTCGATTTTTTTAAC 
TACGCGGGAATCCACCGGAGTGTACACTTGTACACGACCCCGCTGCTGCACATAAGCGATTTGGAGGTCACCACCCAACTAACTAAAGAAGGATT 
GGGCCGCATAGACTACCGCGTGTGGTTGGATGCTAGCAAGGAGGGCCTCCAAATCCAGCCCATCCAACTTCGGGTGCAACTAAGGGACAAAGATG 
GTCATGTGGCCGCCCAGCAGATCAACAAGGCGGTCTACCATGGTACTCTGCTGGTGCCAAATGCAACTCCCTGGTGGCCCTACCTAATGCATTCC 
GATCCCGGCTACCTGTATAATCTTCAATTCGAGCTGTTTGTGGCCAGCAATGAGAAGGAGTTGGAATCTCTGCAGGATACCTACCGCCTCCCAGT 
GGGCATACGTAGCTTAAGTTGGGACAACGACAGCCTGCTGCTGAATGGAAAACCCCTCTATCTGCGGGGATTTGGACGGCACGAGGACTCCGATA 
TCCGCGGGAAGGGATTGGATAATGCGCTTCTTGCTAGAGATTTTAACCTGCTGAAGTGGACTGGAGCCAATGCATATCGCACCTCTCACTATCCT 
TATTCCGAAGAGTCAATGCAGTTTGCCGATCAGCATGGTATTATGATTATTGACGAATGCCCTGCTGTCAATATAGATATCTTCGAGCCGCAGCT 
ACTGGAGAATCACATGTCCTCGCTGGAGCAACTGATCCACCGGGACAGAAACCATCCAAGTGTTGTTGCATGGTCGGTAGCCAACGAGCCGAGAT 
CGAACAAGCAGGGAGCCCTTAAATACTTTGAATTCCTGGTAAACTATGTAAGAGAAATAGCTCACGGACGACCTCTAACCGCGGCTATAAATGCC 
AACTCTTCCAGTTGCCATTTGGCGCAGTTTCTGGACATCGTGGGTTTCAATCGCTATAACTCTTGGTATCAGAACTCGGGCCGTACTGACATGAT 
ACTTAACCTGTTGACGATAGAGGCCCAGAGTTGGCGGGATAGGTTCGGAAAGCCTGTCATCCAATTTGAGTACGGCGGCGACACTATGGAGGGCA 
TGCACTCACTTCCCGCCTTTATTTGGTCGGAGGAATACCAGGTTGAGCTCTTCTCTCGCCATTTCAAGGCTTTCGACGAGTTACGCGGAAGAGGA 
TGGTTTATTGGAGAGTTTGTTTGGAACTTCGCCGATTTTCGGACCGCGCAGACTATTACCCGCGTGGGCGGCAACAAAAAGGGAGTCTTTACAAG 
GAACCGACAACCCAAAGAAGTGGCTCACATTCTTAGGCGGCGGTACTTCGCATTAGCCAAGGAGTTGGATATGTTTGGGCTGCCTGAGGATCTCA 
CTGTTTACATTTCCAAGCATATCCATAACGAGCTTTGA 
(SEQ ID NO: 194) 

Start ATG: 68 (Reverse strand: CAT) 
MKSHWMPVTCLLLGSLQVLLAYTENDFLVNANRKELTDSPLF 

ALRKTGREIISMPVPASYNDITTDNLRDHVGTVWYERTFFVPRSWKTMQRTWLRFSSVHYSAVVWINGRNATSHSIGHLPFESEISGLLSFGGDN 
RITVMCDNRLSNRTIPQGSVYKVATDDGTVPIQSYTFDFFNYAGIHRSVHLYTTPLLHISDLEVTTQLTKEGLGRIDYRVWLDASKEGLQIQPIQ 
LRVQLRDKDGHVAAQQINKAVYHGTLLVPNATPWWPYLMHSDPGYLYNLQFELFVASNEKELESLQDTYRLPVGIRSLSWDNDSLLLNGKPLYLR 
GFGRHEDSDIRGKGLDNALLARDFNLLKWTGANAYRTSHYPYSEESMQFADQHGIMIIDECPAVNIDIFEPQLLENHMSSLEQLIHRDRNHPSVV 
AWSVANEPRSNKQGALKYFEFLVNYVREIAHGRPLTAAINANSSSCHLAQFLDIVGFNRYNSWYQNSGRTDMILNLLTIEAQSWRDRFGKPVIQF 
EYGGDTMEGMHSLPAFIWSEEYQVELFSRHFKAFDELRGRGWFIGEFVWNFADFRTAQTITRVGGNKKGVFTRNRQPKEVAHILRRRYFALAKEL 

DMFGLPEDLTVYI SKHIHNEL* 
(SEQ ID NO: 195) 

Name: BETA- GLUCURONIDASE PRECURSOR 
Classification: enzyme 



Celera Sequence No. : 142000013384244 

TCCTGCCGGTCCAGTGTAGAATTTATGTTAAGGGCATCCGCCCCCTGAAAGAGGCTTATACCTGGCGTGTTCATATTCGGCTAAGGGGACGCCAC 
AAGCATCAAATTCAAATAAATTCCGAGGCGCAGGCGCTATTTTCGTAGGCAACTCTAATTTTCTTCCTCTTGCTTCTGTTGACACTTTGACATTA 
ACATAGTAACGCGTGCTGTTAAGTGCAGTAAGCGCTTCTGTAGCGAAAACATTGCGTTCACACTACATACAGTCTAGAAGGGAATTAGGTTTTTA 
TTCTATACTTGTACATAAACTGTAATTGTTAAATTTAAAAATAAATATTTTCCCAAAAAGTTGCCGATATGTCGAAAAGATTGTGGCCCCGACAA 
TAGACCTGCCAGGCATACACAAGCTTGAAGATGATATTGTATGTATGTATTATCATGCCAACTATGGTTCCGTCGTATCATTGATAGGGGGAGCT 
GTTGTGCATTGCAGAATGCAGTAGATTTGCTATTGGAATATACATACATATGTATGTCGTGTTAGTCCAGATGACAGGTATGTATTTTCAGACAC 
GGCCAGTAATTTTTTATAGGATCACTTTATAGAAAGAATTTCTTCTACCTTTCAAATACATTTCAACGAATCGAAAAAATATAACTTGCTTTTTA 
ATTTAATGGGAGTCCGCTCATTTCATAATATCGAACAGCTAAAAAATGTTACATTTCAAAGATATTCTAAAAATATGAATACCAACTAGACTCCA 
GCAGGATAAAAATAAATCGCTTTAAATTGTTAACCGGCAATTTGTACCAGAATTTGCACATACATACATATGCATATATGTATGTATATATATCC 
TAACACCGTTTTAGAAGAAAAACTCAAATGGATTGTTTTTAATTTTTATAATAAAACTTATTATTCAGTTAGTGTTGAGTGTCCTTGTTGCAATG 
CAAAAACAACAGCATACTTTCTTTTTCAATTTTTTTCATTTTTATTAATTTTATACAGACTGTCCATTCATAAACGCACTTATATTGGGACATTT 
ATCTAATTTCTCGGGATCTATTTGGAAAACCTTGGACAAACGTTTCCAACTTTGCGCATCAAAGTAGTAGAAGGTGCCGCGCTCTTTTGTACCAT 
TTTTTTCATATTGATCAATGCCAGGAATTCGAGTTATCCATATTTTCTCCTCCACGTGGTATCGCCACTCGCGACTATGCCTGCGATGGGAGCAG 
AATTTTAGTATATATGTATGTTCTATATATGTTCATACGAAGCTTAGATAATAGCTACGCAAGAGGAAGTACTCACAATTCAGCAGCTGCCATCA 
ATTGCATCATATCGCCGATGTTCGTGTAAAAAAGGAAAAAGAGCAGGTCTTCTTGCAGCTTTTTCAGTACCGGCGCTGTTAGTTTGTCTCTTATC 
GCAAAGTTAATCAGATATTCGGGTGGCACGTTGAACTCAACGTCTTGTGCTCTGAAAAAAGTAAATGCAAATCCAATGTTACTAATTTGGTACAC 
CTCTTACTCAATAGTACTCAATAAGCAATTAAATTATGTAAGTTTATTATGTAAACTGACCTGCAGGGCTGTGCTACGAAGGGGCCAGCGAAGGT 
TGTATGCAAGCTCTCCTGCGAGTTCAGGTTCAGGCCTAAACCCGTGAGATCCGTACCCAAGGAGAGCGTCACCAGATTGGGATCCGTTTCTGCAG 
CCCGAATGAACGTCAGCAGGCCAACCATGCCGAACTGGTTGTTCACCATGGTCGCTGGTATGTTTGTGACCTTCCCTATTTATGCAAAACGAACG 
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AAAGGTAAATTTGTTACATTTTTAAATAAATAAAATATTTCATGGTACTATTATTTCTGGATAAAATGGGTCAAGGATAGTATAGCTGTTGGGCT 
TTTGTGATGTACACCGAAATCACCGAATTATAATATGAAACAAATAATAATAGAAACCAAGAAATCCATTGATTAAATGGTATCAATCATCCATA 
GAGTGTTCAATTCAAAGCTCCAAATGATCAAAAGGTCAAAAATAGCTCTCTCACATGTTGTGAGAAGTGTTCCAGCACTCACCATCTGGCGAAGT 
TTGCACGCCGCTCTTTACAAGTTTATCGTTGCTGGAATTATCGTTCAGATGCTCGCCGCCCGCTCCGCTCCCAGACGAGCCGCTTCCACTCCCCA 
GGCCGCCGCCTACGCCCATCATGGCATTGCTGTTGGGTACGCTGTTGACGCCGCCAATGCCGCTGTTACTGCCCACCAGTCCTACATGGGACCCG 
CCGACAACGCCGCTGGCTGCATTGCCGCCGACTCCTCCGCCTCCTGAGCTACCAGCTCCTCCGCCGCCGCCTACTGCCAAGCCCCCTATACCACT 
GCCAACGGCACCGAGACCATTGCCGCCTACAACGCCGACGTTGGAACCGCTGCTGCCGCTCGCTGATCCGCTGACCACTATCGAATTCATCGGCT 
TTTCCGTGCCGTCCAAGTGGTTCTCTGTGGACGCACCTCCGGAGCCTCCTGTTCCGGCGACGCTCCCCACCGCGTTCGTTGTGCCGTCCGAGTTT 
TGGGTGCCCGGTAACGCAGGGAAGTCCTCGTTAGACATCGTGAACTCCGATTGCTCTGACGTCGGTTGCTTCACCATGCCAACTGAAATCAGAAG 
AGGAGCGAGATGGTCGCCCAGGGCGAATGAGTGAAGCGAAGAAATTAGTACTAAAGCAGTTTAAGCTCATACAATACACACGATTTTAGTTAATT 
TCTTTTTCCTCAACCAATTAAAATCAATGAGTAGGTCCAAACAGCATTTGTTCAAGGTTTTAAGACATGTTACTGTGAATTTAAAGTTATCCTTC 
ACCTATATATGAACTTATAAAAGGTTAAGTCGCTTGCTTTATGTGAACAGTTTCTTTTTATTCATTGATCATGTTTCAAAGTAAATGAGATTGCA 
TATAATAATATGTTTGGAGTATAATCCGGAAACACAATTTCATTTCAAACTCTAAATTGCAAAAGCTCTTCATGGAATCAGAAGCCCGTATTCGG 
TTGGACCAACGTAGCCAAAGTAAATGTCCTTGACTGAATCTAGTTTTTGCAGCGTTTGCCATACCCAAACCAACACCCGAGGTTAGTCCCAAATA 
CAGCGTAGCATATCTAAGTGGGCTTAATGAGCTGATCACGAGACACCACGCTTACAACCGATTCTTAATCGTAGTTTTAAAAACAAAAATAAATA 
AAAAGAAGTGGATTTTGATTTGAACTCACAAGAGGTAAAGAAATTACCATAGGGTTTGCTGCCCGGCGGCTGGAGCGGATTCGATTGGGGCAGTG 
TCTGGTCGTTCTGGCCGCGCGCATTCGTCAGCGAGGGGAACTCCGTGGGATCGAGCAGGGCAGGCGTGGAGGTGTCGCTACCACCGCCGCCGCCG 
AACACGTGGAAGTTATTCAAAGGACCGCCACCGCCTCCGCCCGTTCCGTAGCCGCCGCGACCCGTCTGCATGAAGTTGCCCATGTTCGACTGCGG 
AAAGAGATAATGATTAGCCAAGTCTTAACAATTTATTTTTAGAATTAGGACTGACATCTCCAACTCAATCTTAATAGAGGGTATATTTTTTATGT 

AAT GT G AAAGC AT T TG T AAC T T CT G AAAT T AAT GT AT AT T C CGT T GT AT C AAAAAT AT T T C C T T TAT AT AAAT CT G AT C GATC AG C GAT AC GAT T 
CTCTGATGCAATCCGAAGTATTGAAGTACTAAACCCACAGTCAATGGATATAGATATTGTTATATGTTTTACTTTGAGTATCTTGGACAAGACCG 
AGTTGGAGGAGTTAGACTTTGTTGTAATGCCGCGCTGCTTACCATGGGGCCACTGCCCAATCCTTGCATGGCTCGCCGCTCCACAAACTGACGCT 
GGCCGAAGAGGTTGGCGTTGCGGTTGCCGCTCGAGATGGCGGGTCCGCCGACCGACAGCTGGGCATTCCGGTTCGGCGACAGTTGGGGCTGCTGC 
TGCTGTTGTTGCTGCTGCTGCGGCGCCTGTCCGTAGTTCGCCGTTCCTGGATAGGAGTTGGCGAAGTCTAAGGCAACAAATGTGAATTGGTGCTC 
CGTCAAAAGCTTTGGGCGACAAGTGCTTACCGGTTTGAAACATGCCCGACGTGGGCGTCACATGGCCAGCCAGGGAGCTACCACCGAATCCGCCT 
GTTGTCCGGCCAGCTAACGCCGCGTTCGCTATACTTCTCGGGGGTTGTTGAAAATTTAAATTCGCCATAATCTGCAATGGAAGGGGTTGTCGGTT 
ATTAGGCTGTGGCCTTGCTTTGCTTATCTACCCATAAAGAAAAACCCATATCGTATCCGCGTCGACATCGAAGTATCACTTAAGCAGTAATCACT 
TAAATGTAATAGTATAGGGATTTCCTTTTGGCAAAAACAAACTAATATATATTTATTTTGGAGATTATTACGCGTTATATAACATTACGGTTAAT 
ACGAGCAATTGGGAATATGCTGGATGATTCGCAACATAAATTCTCGAAGCCATCTGCGCAGCAATCAGAGAACTGCTGTCAAATGTCTGAATCAG 

GG AAAC TC T T C GC C C AT AAACAAG GGT TT AC AAT AAAAACT GT GT G AT TC AG T GG AGG GGG ACT AT AT T GAT AAC AAAT T GGC T CC C CGC AG AT A 
ATTGTCGCACACTAAATGCGAGTATATGCGTATTGAGCAAACAAACCAACAACCTTGTGGGTTAGCAGCCTGGAGCCATATATATGAAAAGGTTA 
TCTTAACGGCAAAATGGGATCCCAAAAAAGGGCCACCATCTATAGTATATCTATAAACAAAATAGCCCGCGCCAAGTGAATCCGTTAAATGGCCG 
ACGCATCAGCTGTCGATTGGCGAAGCGAATTCGGACAGCAAACAGAGCAGAAGGAGAAAAGGAGAGCAAATTGAGAGGCAGAGCAAGGAGGTGGA 
GAACGAGAAGCATGGGCAGAAAGGCTGTGGCAATTATTGCCGTTCCTCTTTGTCGTGGGCACACTCATGCACAGACCGTGTGGATAAGATGGAGA 
CCAAGACAATGGCGATGGTGTGGGTGAGAAAACCAGAATGACTGCCACCAGGGTGCAAGGGAGTGGGTGGAGAGCGGGGCAATGCAAAAGGATAT 
AAGCCTCAACTTGTCCCTCGATCAATTATGCAGGCAGTGGTCAGTGCAGTGCAAAAGGTGTACAGTAAAAACAGTTAACATTGTAAAATACAAAG 
CTTTATACCTTATCTCCACTACTGGCCAAACCACTACCGAGTTAGTCGAATTTCGTTTACTTCACTACGAAAGCTCCAGTTAGGTTTTGATGCTG 
CTTGCTAGTACCATAATATGTAAATAACCCTTCCTATGGTAAAGACCGGCTTTATCTTATCGAAACTTCAACGTGCGCTGAACTCGCCTGGAATG 
C AAG AT AT AC T AGC AC AGG AT GC C AGT GCAT T T AAT T TG AAAGG AGC AAAT CT T CC T GG AAT AC GGG AGC T GT C G AT T AAAT T GAT G T T T T AC C T 
GCACGCATGTAACTTTATCAGCTGACCTAACAGACGCTTCCCAACACTGGCGACCGCGCTTACAACAAAATTCAATGGCTAGCGTTAACTCTGGG 
GACAAGTGTTTAACCAACCATTTAACCTCGGGCGATTCAGCAAGTACAAAATAAGAACGACGGATCCGAAAGGAGGAATGCGCGCAACTTACCTA 
AATTAACGTTAGACGATACTGTCGCTTCTTGTTCTTCCACTGTTAACGCTCGCTTTCCACTCTGTCCAGCAACAACAACATACACTGATCGAATC 
GTTCTCACCGCATTTAACATTCCAAAGCGGCCGGCGGGACGAGGACGTAGGAGTCGTTGTCCTGGCGCCGTAACCGTAACTGTGACGGATCACGG 
ATCGGCGAATCAGCAGGAATATGAAAGTCCGCTAACTCCCACTGGGCAGCTGTTGGTTAGCTGTGGGTGTCTCGGTCACTAGCACTCACTCGCTG 
GAAGCTGAAAGGATGCAACGGAGAAACGGGAGCTGGAACCGTAGCGGGTGTAGGAGCGGGAACGAGGTTAGTTAGTACTTTTCCGCGCCGAGTCG 
TCCCTAAATAAATGGCATATATACCGCTGTGCGTGTGTGCATATGTATATGTGCCTCGTCTGTATGCAGCGTGGTCTTCGTGCCGAAGAGACAGC 
GAGCGAGAGAGAGAGAGAGAGGAGAGGAGAGTCGAAGGATGTTGATGCCTGGCGGAGGCTAAAAGTTGTCCGTTTTTTCTGCGCCGCTCTTGCGA 
CCACTGTAGTTTCTATATTCACGCTTATGCAAATGTCGCGCACGTTACACTCGGCCGATTGGATTGGCCAGGGCTCTCGCTTTTTGTTAATGTTT 
CCGTTTTGTATTTTATTTGTAAGTAAATAATCGCTCGCTATTGCGCTGTTGTCTCTGCTTCTTACTACACTGCTGCACAAGCACTGTGTCTGTGA 
CCCTGTGCGTGTGTTTGTGTGTGTGTCTGTCTTTGTTTTCTCAGTTCATTGAAATTTCACCCGAAGCAGCGAATTGGGCAAATTCCCTGCTAGAT 
TTGCGCGTCTTTTACGCCGCCGACCCGCAAATGCACTCGAACTTAATGGAGCCGGGTCCTTACGATCCACTATCGAGGGGCAGTAAACAATTCTT 
TAGCGCCCGAAATGGCTTTAAATTGTACACAAAAAATGTTCCGCAAAAAACTTTGTGGCTGTCACGAATACAAACACACAAGAATTAGGGATGTA 
AAAATCTGCTATCGAGACAGCGTTGGTATCGATAGTAGTTAGTTTGCAACGAGAGGGTTTGGTGCATTGATTACTGTATATTTCAAAGAATATTC 
GAATTGTTTTCTTTAAAGACAACAATTTTATTCGCCTTTGCGATTGGTAAAATAGTTGTTTACTTTTAATCGATAGACAAAAACAGTTATTGGGC 
TGCCATCTGGGGCGTCTATGCGAAGACCTCTTTGCGTATTTCAGTTATCGATTACTGTGACAGTGCCTAGACCTCCTAAACTCAACTTTATTGCC 
CGGCCGCGAATTGTCTTGATTGCAGATTTTCCGATTTTAAACAAAATTTACTTGGCATTATGGGCAGCCAAAACTCGGACGACGACAGTGGTGAG 
TGGCTAATGGCTAAGTCCAGGCCTGAGTAACTAATATTTACACTGTTCACATATGTATGTAGGTTCATCCGGTTCGAGTCGAAGTGGAAGCCGCA 
GTGTCACGCCGCAAGGAGGTAGTGCGCCCGGGTAACTATTTGATATTCAAACACGGCGGCGGGAGAGTGCTGATTACCTTCCCCTCGTTTTCCAG 
ATCCCAACGCAGTCGCAGGTCGGGTAGTGGCAGCGATCGTTCCCGATCGGGCTCCAGGTCTTCCCGATCCCGGTCCGGCTCTGGGTCTCCGCGTA 
GTGCCAGGTCCGGGTCTGCAGAAAGCCGCCACTCGCAGCTGTCTGGCTCCGCTCGTAGTAAGAGATCAAGATCTGCTCACAGTCGCAGGTCGGGA 
TCTGCGCGTAGCCGTAAATCTGGAACTCCGGAATCTCCACAAAGTCACAGATCAGGTTCGTTGCAAAGTCGCAAGTCAGGATCGCCTCAGAGTCG 
TAGATCTGGTTCTCCCCAAAGTCGCAAATCGGGATCTACTCACAGCCGCAGATCCGGTTCGGCCCACAGTCGCAGATCGGGATCCGCTCGGAGTC 

GCAAATCTGGTTCGGCACA 
{SEQ ID NO: 196) 

Exon: 6714.. 5698 

Exon: 4248.. 4116 

Exon: 4057.. 3843 

Exon: 3509.. 3278 

Exon: 2647.. 2078 

Exon: 1785.. 1581 
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Exon: 1476.. 1312 
Exon: 1220.. 1001 

Start ATG: 4248 {Reverse strand: CAT) 
Transcript No. : CT7034 

TGACAGCCACAAAGTTTTTTGCGGAACATTTTTTGTGTACAATTTAAAGCCATTTCGGGCGCTAAAGAATTGTTTACTGCCCCTCGATAGTGGAT 
CGTAAGGACCCGGCTCCATTAAGTTCGAGTGCATTTGCGGGTCGGCGGCGTAAAAGACGCGCAAATCTAGCAGGGAATTTGCCCAATTCGCTGCT 
T C GG GT G AAAT T T C AATG AAC T G AG AAAAC AAAG AC AG AC AC AC AC AC AAA C AC AC G C AC AGG GT C AC AGAC AC AG T GC T T G T GC AGC AGT GT AG 
TAAGAAGCAGAGACAACAGCGCAATAGCGAGCGATTATTTACTTACAAATAAAATACAAAACGGAAACATTAACAAAAAGCGAGAGCCCTGGCCA 
ATCCAATCGGCCGAGTGTAACGTGCGCGACATTTGCATAAGCGTGAATATAGAAACTACAGTGGTCGCAAGAGCGGCGCAGAAAAAACGGACAAC 
TTTTAGCCTCCGCCAGGCATCAACATCCTTCGACTCTCCTCTCCTCTCTCTCTCTCTCTCGCTCGCTGTCTCTTCGGCACGAAGACCACGCTGCA 
TACAGACGAGGCACATATACATATGCACACACGCACAGCGGTATATATGCCATTTATTTAGGGACGACTCGGCGCGGAAAAGTACTAACTAACCT 
CGTTCCCGCTCCTACACCCGCTACGGTTCCAGCTCCCGTTTCTCCGTTGCATCCTTTCAGCTTCCAGCGAGTGAGTGCTAGTGACCGAGACACCC 
ACAGCTAACCAACAGCTGCCCAGTGGGAGTTAGCGGACTTTCATATTCCTGCTGATTCGCCGATCCGTGATCCGTCACAGTTACGGTTACGGCGC 
CAGGACAACGACTCCTACGTCCTCGTCCCGCCGGCCGCTTTGGAATGTTAAATGCGGTGAGAACGATTCGATCAGTGTATGTTGTTGTTGCTGGA 
CAGAGTGGAAAGCGAGCGTTAACAGTGGAAGAACAAGAAGCGACAGTATCGTCTAACGTTAATTTAGATGGCGAATTT AAAT TTTC AAC AACCCC 
CGAGAAGTATAGCGAACGCGGCGTTAGCTGGCCGGACAACAGGCGGATTCGGTGGTAGCTCCCTGGCTGGCCATGTGACGCCCACGTCGGGCATG 
TTTCAAACCGACTTCGCCAACTCCTATCCAGGAACGGCGAACTACGGACAGGCGCCGCAGCAGCAGCAACAACAGCAGCAGCAGCCCCAACTGTC 
GCCGAACCGGAATGCCCAGCTGTCGGTCGGCGGACCCGCCATCTCGAGCGGCAACCGCAACGCCAACCTCTTCGGCCAGCGTCAGTTTGTGGAGC 
GGCGAGCCATGCAAGGATTGGGCAGTGGCCCCATGTCGAACATGGGCAACTTCATGCAGACGGGTCGCGGCGGCTACGGAACGGGCGGAGGCGGT 
GGCGGTCCTTTGAATAACTTCCACGTGTTCGGCGGCGGCGGTGGTAGCGACACCTCCACGCCTGCCCTGCTCGATCCCACGGAGTTCCCCTCGCT 
GACGAATGCGCGCGGCCAGAACGACCAGACACTGCCCCAATCGAATCCGCTCCAGCCGCCGGGCAGCAAACCCTATGTTGGCATGGTGAAGCAAC 
CGACGTCAGAGCAATCGGAGTTCACGATGTCTAACGAGGACTTCCCTGCGTTACCGGGCACCCAAAACTCGGACGGCACAACGAACGCGGTGGGG 
AGCGTCGCCGGAACAGGAGGCTCCGGAGGTGCGTCCACAGAGAACCACTTGGACGGCACGGAAAAGCCGATGAATTCGATAGTGGTCAGCGGATC 
AGCGAGCGGCAGCAGCGGTTCCAACGTCGGCGTTGTAGGCGGCAATGGTCTCGGTGCCGTTGGCAGTGGTATAGGGGGCTTGGCAGTAGGCGGCG 
GCGGAGGAGCTGGTAGCTCAGGAGGCGGAGGAGTCGGCGGCAATGCAGCCAGCGGCGTTGTCGGCGGGTCCCATGTAGGACTGGTGGGCAGTAAC 
AGCGGCATTGGCGGCGTCAACAGCGTACCCAACAGCAATGCCATGATGGGCGTAGGCGGCGGCCTGGGGAGTGGAAGCGGCTCGTCTGGGAGCGG 
AGC GG GC G GC G AGC AT C T GAAC G AT AAT T C C AGC AAC G AT AAAC T T GT AAAG AGCGGC GT GC AAAC T T CGC C AG AT GG GAAGGT C AC AAAC AT AC 
CAGCGACCATGGTGAACAACCAGTTCGGCATGGTTGGCCTGCTGACGTTCATTCGGGCTGCAGAAACGGATCCCAATCTGGTGACGCTCTCCTTG 
GGTACGGATCTCACGGGTTTAGGCCTGAACCTGAACTCGCAGGAGAGCTTGCATACAACCTTCGCTGGCCCCTTCGTAGCACAGCCCTGCAGAGC 
AC AAG AC GT T G AGT T CAAC GT G C C ACC C GAAT AT CT G AT T AAC T T T GC G AT AAG AG AC AAAC T AAC AGC GC C GGT AC T GAAAAAGC T GC AAG AAG 
ACCTGCTCTTTTTCCTTTTTTACACGAACATCGGCGATATGATGCAATTGATGGCAGCTGCTGAATTGCATAGTCGCGAGTGGCGATACCACGTG 
GAGGAGAAAATATGGATAACTCGAATTCCTGGCATTGATCAATATGAAAAAAATGGTACAAAAGAGCGCGGCACCTTCTACTACTTTGATGCGCA 
AAGTTGGAAACGTTTGTCCAAGGTTTTCCAAATAGATCCCGAGAAATTAGATAAATGTCCCAATATAAGTGCGTTTATGAATGGACAGTCTGTAT 
AA 

(SEQ ID NO: 197) 

Start ATG: 1018 (Reverse strand: CAT) 

MANLNFQQPPRSIANAALAGRTTGGFGGSSLAGHVTPTSGMFQTDFANSYPGTANYGQAPQQQQQQQQQPQLSPNRNAQLSVGGPAISSGNRNAN 
LFGQRQFVERRAMQGLGSGPMSNMGNFMQTGRGGYGTGGGGGGPLNNFHVFGGGGGSDTSTPALLDPTEFPSLTNARGQNDQTLPQSNPLQPPGS 
KPYVGMVKQPTSEQSEFTMSNEDFPALPGTQNSDGTTNAVGSVAGTGGSGGASTENHLDGTEKPMNSIVVSGSASGSSGSNVGVVGGNGLGAVGS 
GIGGLAVGGGGGAGSSGGGGVGGNAASGVVGGSHVGLVGSNSGIGGVNSVPNSNAMMGVGGGLGSGSGSSGSGAGGEHLNDNSSNDKLVKSGVQT 
SPDGKVTNIPATMVNNQFGMVGLLTFIRAAETDPNLVTLSLGTDLTGLGLNLNSQESLHTTFAGPFVAQPCRAQDVEFNVPPEYLINFAIRDKLT 
APVLKKLQEDLLFFLFYTNIGDI^QLMAAAELHSREWRYHVEEKIWITRIPGIDQYEKNGTKERGTFYYFDAQSWKRLSKVFQIDPEKLDKCPNI 
SAFMNGQSV* 
{SEQ ID NO: 198) 

Name: Regena 

Classification : transcription_f actor 
Gene Symbol: Rga 
FlyBase ID: FBgn0017550 



Celera Sequence No. : 142000013384812 

ATCTTAAAGGTTGCTATATTTAAAGGCAAAATCATATTTCACATACAAGGCCAGTTGAAAAATTCAAATCACTTGAATGATTAATGGGTAAATGG 
GCAACCTTTTTCAAGTAGAGCCTAAGCCAAGTACTACCATTTAATAACAACTTGTAATTAATACGAAATGTTCTCTTTCTAGTTATTTTTGAAGT 
GTTTTCCGCGTGGGTTTACAGAATCGACATTTACGGCTCATAACCCACTGCACTGAAATGACGGAGTATTTGTGCTTCCCGATAGCCATTAGAAA 
GTGGAGCTGGTAGGCGGGGAGGGTAGTGTGGGGTGTTGCAAGTGGGTGGCTTTTGGGCGACTGGGTGGTGTGGGAGGTGTCAAGGGCGCGATAGC 
TCTTTGACAGCCGGGAAAGCGGATATGGAGTGAACGACAGGTTTTCCCTCAGCTCTTCCGCCTGGTAGTTCGTGTTTATGTAAAATTAATTTGAA 
GCTTTT AAT CAAAT ATT AGCGGCAAAAAATGCTGCGGAGCTT GGT AGGAAAAACTGAGCCACAAAGC AGT GAAATT AAAT ACT TAACCTACTTTC 
AGCAGTCACTTTATGGACAAAATAAGAACTTATTCTCACGCATTAGCATTGTTTTAAGAAAAAAAGAATTTGCCTAACTAAAACACCAATTTGCG 
TTTGTAGCAACTTATTGTTCCCTCAAAAACGTTAATATAAAACTAAAACCGCACTGTAATAATGATGATACATACTAAAAATGTGAAGGGAAGAG 
TTAGTGATTGAGAAACGATTTATTTACTTTTACCACTTGTATTATTTGCTTTTAAAAATATCTACCGCGTAAGTATCTTACATTGTAAAATAAAT 
GTAGGAAATGACTGATGTTATCGATTATTTATAAAATTCAAATGTTTAAGCTTTCGCGCGTAACATGGTATTTTTTAGGCAGTTATTCGCGTCCT 
CGGATCACAATCTGGTAACATCGAAAGTTCTAATTTTGTTTGTGTGCGTGCGACGTCGGTTTTTTCGTCGTCGCGACTTTTTTAAAATAAACTGG 
AGGCATTCGCAAATAATTTAAGCCTAATCGATCAGGAATTCGTACATAATGTCGGTTGTGATATCGAGCTCTTTGTCAAGAAGCGCTTGAATCGT 
GGGTAAGAATAAAGTCGCACCTACAAAATGAACGAATTAGAACTTTTTGCCACGAGAATAGCAAGTGCATATGTGTGGGTGTATGTTAGTGTGGG 
CATTTGTGTGTGTGTCTGTTGTGGTCTCTGAATGTGTGAGTGAAGGAGAAGGAAGTAGAGCTCCCCTTATTTCTCCAAAAACAACAAATACAGAG 
AGT AAGAAGGAAAGAGTAGGAGAGA GAAT AACTTGT AAC AAAGT ACC GTTACAACAATGGT AAT ACGGAAATGTGC AAC AGAG AAAC AAAAACAA 
CACCAACAACAACGAGAAGTAGTACGAGTTTTGTGGGCCTTTCGTGCAGATACCACCAGTTTTCCCCCACCCCTCCCCGTCTTTTTCTGCACAAT 
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TTCGAGTTTTCCCAGCTGCGCCACAGCAACAACAACCACAACAACGAGAGCAATTGTGACTTTTCCGCATTGCAGTTGCGCCAACTAACACCGAT 
TTTTTGATTTGGGGCTTCGTGCCTAAGCGCTCTTCTTGCATTCAGCCCGCGTTTAATAAACAATTAATCGTCAAGCAATAAACAACAGCAAGCCG 
CGCATAGAAATTTGAGCTTTTTCAAAATATTACACCATCCGATTACATAATATAGTTTACAAGTCGTGCTTTGCCGTTCCGTTTTGTTTCGCTTG 
TTCGCCTTCTTCTTGAGGTCGTTTCTTGTGCTTCCTCTTCCAAGTGAGAATTCCCAACAACCTGTTAGAAGATTCGGTCCCATTGGGCGGCGCTC 
TAGCTGGGTTTATGCGGGACTGTAATCAAGATCTATCAATGGAACTAAAAATTTAATTATAGTGTGTTAACAATCAGGTAGTAAACAAGTTAGCA 
TTAGAAGCACATTCTAAAAAAAAAAAATGGATTTACCTCTTCATTTTTAACACGGCTTTTATGAATATATGATTCATTTTTATCATACGAATTTA 
GTATCTAAGCTCCCAAATGTATTTCCGCCTGTTTTGCCTGTGCTTATCAGGAGTGTTTTCAATTCGCCTAATTATCGTTAATGGAGTTCCCGCAT 
AAACGACTCACTATACACGCACGCAGTCAGCGAATTGAATTGATCGATGGTTTGTTCACCTTCTCCACCGCTTTCTTCCGCAGCGTTCTGCTCTT 
TCCACCTGTGGGCAACTACCGCCGTTATCTCATCCACCAAACATGTGAGAAATACCGCCAGCAGTACGATCTGTTCACCTTCTCAGTGGGCCAAG 
GGCCGCAGCGGCGCACCGTCCTCTGCTTTCGAAACCAGCTTCTGGATCCCAGCGGCTTTGAGAAGTGAGCATCTGATATCTGACAGTAGATTATT 
GTTCATTTCAATTTGTCGTTAATGCACAATTAACAGCCGCAAGCAGCAGCAGCAGAGGGAGAAGGATAGAGAGCGAGAGGCAGCGAGGAGCTGGC 
GGAGCAACAACAATTCGCACTATCACACGAACGGCAGAAGCAGCAGCGCCAGCAGAAGAAGCGCCACGGAGCTCTGTAAGATGGGCGTTAAGAAG 
CATATCAGCGAGACTCCATGTGAGTAAATGGCCCAAAGCAAGCTGGCCGTACATATAAACAAAGGCTATCCCCTGTTTTTTAATCACATTTTGGA 
GCGATGCGTTTTTGGCTCCCCGTTTAGCAAGCAGTCATAAATTTAAGTGTTCGCAAATACGCAAAAAGAAGCCAAACAAAGAAATCGAAAATGTA 

AC AGGG AT AGC GAC G AAT AT GAG AAT AC C C AGT AAT AC T GAAAAT T AAT T C AC AT AT T T AT C T AT GT AC AT T T T T GC C AT T AAAT T GC AC AGT T G 
TGTTAGATACCCAGCAAATTGAAGCATTAATGAGCTTTCCAATCAGAAGTAATAACTAAATTCCGGCCAAGTGTATAGCAAACAGGTAGATTAAC 
TAAAAATAACCAGATGCGCCAGCTGATCGACGTCGGCAGCGCCGTTGCCGTTCATTCAAGGTCGCCCTCTTTCTCGAGCGGATTTTTTGGTGATG 
TTGCATCACGGCTCACCTCACTTCACTTCACTTGGCTTCACGCAGGTAGACGCTCAACTGGTCGACCGCTGCGCACGCTCACCAGCTCTGGACCG 
CCCCCGCAGCCCCCCCTTGCTGCCTGGGATCCACCTGTCAAGTTCGTGTTGACATTCACGAAAATCGGGCATAGAGCGGATCGTAAAAAATTGGG 
TCGACATGAGTGAATTTTGCTTTTAATTTATTTTATGCCGAATCCGTTGCTTCACCTTAGTCTCTATTTTATTCGTTTTGTTTTGAGCATTTGCC 
AACTATTGTTGCTAATCGTCTGCTTAGCTCTCCCATTTGGCGATCTATCTTCGTATCATGTGAAGTATCTCACATTCGGCATCTTGTGTCTCTTT 
CTCAATGTATCTCATGGATGGCGCCTTCTCTATCTGTATCTTGCAGTAAGCAAATCGAATTCGGTGGATTTATACAGGCCGCCAGCGTTGAGGAA 
CAATAGCAATAACAACAACAGCAGCAGCGAGGAAGCAGCTGCAGTAAACGAAGCTATAACAACACCATTAGCTGCAGGAACAGCAACCACTGCAT 
CTACAAGTCCTACCTCCAAAGAATCCCCAAACAGGTAGGTAAATCGATTTGCAACCCATCAAAGTTAGGAATAGATCATAAACTGCGTATATACC 
TCTAACGAAACTGTTGATATTCTGTCGTGAGCTAATCTCGTTACCTCGTTTCCTTCACCGTTTATTATTATTATCTCTGTTTTGCTCCGCTCTTT 
CGTAGAGAGGCGGACGCCTCAGCAGCAGCAGCGGCGGCGGCGACACGACGTGAACGCCGTCCCGATCGAGCGGTTTATGTGCCAAGGGCACGACG 
CAGTCAAACTACACCACCGACTACTACAACAACAACAACGGCGGCGGCAGCGACAGCGGTTGCAACAACATCACCACCAGCAGCAGCAGCGGTGG 
TCGCAGCAACAATTACAACAGCGCCGGATTCCCCGCCAGTTGCAGATACACAGCCGAGCAAAGCGGACGACGTCAACGGTCAGCCGCCGTCCAAG 

T C GAAG AAAAGC AAAT C G CATC GG GAG C GC AG AGAAC GCG C AAAG AAAAC G AC AAC AGC T AC AAC GGC C GAT AAC GCC AC T GT AC T GGT T AT AT C 
AGCATCTGAGAACGATACTCAATCCAACGAAGAACACCGACCGGCAAAGCCAGCGGACTGTGATAAGGAGGTGTGTGTACTAAAACAAGACAAGA 
ACGTATATAAGCGAATCTAAAGCCTATGAATTAGTTCTTATTAACAATAATTGTTCAGCGTAGCGCCATTCTCCATCAGCAGTTAGAATTTTTTA 
ATACACATTTTAATAGCTGACCTTGCCTTTCTAACATACAGATCATGAGCGGCGCCCAGCGTACAAAGCCAGGCAATGGCAACAAATCAGCCAGC 
AATGGCAAAAGCCAGGCATCGCCTCCGCTGCGCATCGAGGATGTGGTGGCGGCCAGGAGCACCAATGGCGAGGAGCCGGCCAAATGTGATAACGA 

T G T GC GC GAGC T GC AGC GAGC C T C AAAGGT AAGT T AT GC C GC AT GG AAT C AGC G AAAT GT T T T AACT AAC T T C C AAAC GAT T GC AGG AAATC AAT 
CGCAGCAATCGGCGCATCATGAAACAAACCTTCAACTCAGATGTGCTGGAGATTCCCGAGAAAATAGAGACATCGGCAACAAAGGCCATCCCCAC 
AT C C C C GC AAGC AT C AGC C AC AGCGAAAC C AC C T GT AGAC AGC AC C AC C G AAGAC GAT GAC G AGG AGG AC G AGG AC G AC TGGG AG AAG AT GT T C G 
ACGAGAGCGGCGACTGTCTGGATCCCAAGTTACTCCAAGAACTGACTGAATCCGTGGGCAAGTGCAAAATAGAGCTGCCCAAAATGGACTATACG 
GTAAGTAGATATAACTGGTTAACCAGCATCATCATCAAATTAAACACTCTGTTCCACAATTCGTACAGGTCTTTCACATTAAACAACAACTGCTC 
AACGAAGAGGAATTTCCGCACGTTCTGGAGGTGTCCAATTTCCCAGTGGAGTTCAAAACTCCCGACCTGCTCATGCTTTTCGCCCAGTACAGGGG 
CAGTGGTTTCGACATCAAGTGGGTAGACGATACCCATGCCCTGGCTGTGTTCAGCAGTTCCCGCATTGGTGAGTGCCTACAAATGCAACCGGAGA 
TTGGTTCGGTAATGAAATAGAACGTTTCCTCCGCAGCTGCTGAGGTGCTGACGATGGGGCATCCGTTTGTCAAGTTGAAGCCACTTGCGGAAGCC 
ACTCTGGAGTCACGACTGAAGGCAAAGAAGGCCGGCGCGGTGTCAATGCAGCCCTATCGCCAGCGTCCGGAAACCTGCACGGCCTTGGCCCGACG 
ACTGGTATCGGGTGCACTGGGAGTCAAGCTGCCCACGGCGCCGCAGGAACGGGAGAACGAGAGGCGGGTACTACGTGAAGCCAAAGGTGAGTGCG 
GGCCATGTAACTAAACTGGGTCAAGGATATACTTGATTTGGAATCATTGGCGCTGCCTTCCCTTGGTCTTTTATTAATCTGCCTTATCTTTTTAT 
TCAGAACGCAAACT GCT AGCTGCC AAAC AGCGGGACGAAGTCTGGGAGAGCTAGACCGCGGCCCGGAAGAAGGCCAGGTGC ATT TTTCTCACCCT 

T T T C T CAT C T G C AA T AAATG C G GC AG C AAAC AC AAC AC CAT G T GG AT C GAC AAC AAGAAAC AC C AC C AT T AAC GC C C AC AAA AAC AAT T C T C AAT 
CAAAGCTAACACAGTAAATGAGTAATTCGCATGCTAAGCAATGTACTTTTAATGTTCGCTTTTTGTGAAATTAACACAGACTTTTGCTCAATTTG 
TTGTAGTTTTTAGAGGATGCAATGCATATTATATCATCTAGGCCGTAAGAGAGAAATATCTATATTTATTGGTCGTAGCAATTAAACGTGCAGAG 
AGACGCGTGTACGTGTTGTTTTAACTCCAGCCGTAAACCTAATTTTTAAGCGCGAGCAATTTATATATTGATATACAGTTTGTCAAGAAACTGTT 
TACACACTGTGAAATAAGTTGAATTTTTGACTTTAAAGCTAAAAATAAAGGGTTTTTTGCTTAATTAAACGCAATTTTTTTATAAAATATAATTA 
AACAATATTTATTTTACTTATAAATCAAAAAACAAATTAAAAATATTAAATATACAAGAAAATAAACAACAAATT 

GACTGTCAAGAAACTCTTTACACAAAACACTAAATCGAAGTTTAATCCTAAATTTTTTAATTATTTTATTAAAATTTTAAAAGTTTTGGTATGTA 
GTATGTTCCCCATTGGCTTTAGCTACAAGTTGAAGCCTTTTTTTCATGGATTGGACAAGTTTTTGCAGGTCATATTCAAACGGGATCTTTTCACA 
CTCTTCGACTATTACAGTCATAAGCTGTTGTTGTGTTTTAGGTCCCTTTTTGCCACCTTCTTCTTTAAGTAGGTCCACAAATTTTCAATGGGGTT 
CAGATCAGGACTCTGAGGGGGCGTATCGATCACTTTACCACAGTTATTAAGGCGCCAGGTGCGTACATTGCACTCTTTATGTTTCTGATCATGTC 
CTGGTAAAACTTAAGATTTGGCTTGTTGTTGCTAACTAGCCCGAATTTTTCTGCACTGGCCTTCAAATTTGTTTTTTAACATTTCTAGATATTGC 
ACGGCAGTTATTGTGCTCTCAATTAAGGCTAGTTTTCCCACTCCACGGGCTGGTGATACAGCCCCAAACCATCAGGTGACATTTTTCCAAATTTT 
ATCGGTTGGAATGATATTTTGTTTTTCTAGCGCACTCAACGGTTTGCGCCATACTCTGCTTGGCCCGTCGTTATAAAAGAGCATCATTTTTGTTT 
CGTCACAAAATATGACAGTCATCCCAGTACTCTTCCGCATGATCCATCATGCTCACAGCGAATTAATGACCGCTTTTCCATATTGGCATCTGATA 
GCAAAGGGTTTTTCCTTGCAAATCTTGAAGAGTACCTATGGCGTAGGATGACTTGGCGCACAGTTTCGGTGTGACACAGTTAGGTGGCATTCTTG 
CCTGATATCCAGAGCAAGTGATCTGACCGAGATTGCGGGGGTTCGCAATCACTATTCGCATAATAAAGCGGTCTACACGCTTTTTAATTTTAATT 

TTCCGCCCACCACCG 
(SEQ ID NO: 199) 

Exon: 1001.. 1142 
Exon: 2269.. 2439 
Exon: 2507.. 2679 
Exon: 3562.-3739 
Exon: 3901.. 4345 
Exon: 4507.. 4683 
Exon: 4742.. 5035 
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Exon: 5104.. 5293 
Exon: 5357.. 5596 
Exon: 5705.. 6330 
Start ATG: 2646 

Transcript No. : CT7070 

CGACGTCGGTTTTTTCGTCGTCGCGACTTTTTTAAAATAAACTGGAGGCATTCGCAAATAATTTAAGCCTAATCGATCAGGAATTCGTACATAAT 
GTCGGTTGTGATATCGAGCTCTTTGTCAAGAAGCGCTTGAATCGTGGCGTTCTGCTCTTTCCACCTGTGGGCAACTACCGCCGTTATCTCATCCA 
CCAAACATGTGAGAAATACCGCCAGCAGTACGATCTGTTCACCTTCTCAGTGGGCCAAGGGCCGCAGCGGCGCACCGTCCTCTGCTTTCGAAACC 
AGCTTCTGGATCCCAGCGGCTTTGAGAACCGCAAGCAGCAGCAGCAGAGGGAGAAGGATAGAGAGCGAGAGGCAGCGAGGAGCTGGCGGAGCAAC 
AACAATTCGCACTATCACACGAACGGCAGAAGCAGCAGCGCCAGCAGAAGAAGCGCCACGGAGCTCTGTAAGATGGGCGTTAAGAAGCATATCAG 
CGAGACTCCATTAAGCAAATCGAATTCGGTGGATTTATACAGGCCGCCAGCGTTGAGGAACAATAGCAATAACAACAACAGCAGCAGCGAGGAAG 
CAGCTGCAGTAAACGAAGCTATAACAACACCATTAGCTGCAGGAACAGCAACCACTGCATCTACAAGTCCTACCTCCAAAGAATCCCCAAACAGA 
GAGGCGGACGCCTCAGCAGCAGCAGCGGCGGCGGCGACACGACGTGAACGCCGTCCCGATCGAGCGGTTTATGTGCCAAGGGCACGACGCAGTCA 
AACTACACCACCGACTACTACAACAACAACAACGGCGGCGGCAGCGACAGCGGTTGCAACAACATCACCACCAGCAGCAGCAGCGGTGGTCGCAG 
CAACAATTACAACAGCGCCGGATTCCCCGCCAGTTGCAGATACACAGCCGAGCAAAGCGGACGACGTCAACGGTCAGCCGCCGTCCAAGTCGAAG 
AAAAGCAAATCGCATCGGGAGCGCAGAGAACGCGCAAAGAAAACGACAACAGCTACAACGGCCGATAACGCCACTGTACTGGTTATATCAGCATC 
TGAGAACGATACTCAATCCAACGAAGAACACCGACCGGCAAAGCCAGCGGACTGTGATAAGGAGATCATGAGCGGCGCCCAGCGTACAAAGCCAG 
GCAATGGCAACAAATCAGCCAGCAATGGCAAAAGCCAGGCATCGCCTCCGCTGCGCATCGAGGATGTGGTGGCGGCCAGGAGCACCAATGGCGAG 
G AGC C GGCC AAAT GT GAT AAC GAT GT G C GC G AGC T GC AGC GAGC CT C AAAGG AAAT C AAT CG C AGC AAT C GG C GC AT C AT G AAAC AAACC T T C AA 
CTCAGATGTGCTGGAGATTCCCGAGAAAATAGAGACATCGGCAACAAAGGCCATCCCCACATCCCCGCAAGCATCAGCCACAGCGAAACCACCTG 
TAGACAGCACCACCGAAGACGATGACGAGGAGGACGAGGACGACTGGGAGAAGATGTTCGACGAGAGCGGCGACTGTCTGGATCCCAAGTTACTC 
CAAGAACTGACTGAATCCGTGGGCAAGTGCAAAATAGAGCTGCCCAAAATGGACTATACGGTCTTTCACATTAAACAACAACTGCTCAACGAAGA 
GGAATTTCCGCACGTTCTGGAGGTGTCCAATTTCCCAGTGGAGTTCAAAACTCCCGACCTGCTCATGCTTTTCGCCCAGTACAGGGGCAGTGGTT 
TCGACATCAAGTGGGTAGACGATACCCATGCCCTGGCTGTGTTCAGCAGTTCCCGCATTGCTGCTGAGGTGCTGACGATGGGGCATCCGTTTGTC 
AAGTTGAAGCCACTTGCGGAAGCCACTCTGGAGTCACGACTGAAGGCAAAGAAGGCCGGCGCGGTGTCAATGCAGCCCTATCGCCAGCGTCCGGA 
AACCTGCACGGCCTTGGCCCGACGACTGGTATCGGGTGCACTGGGAGTCAAGCTGCCCACGGCGCCGCAGGAACGGGAGAACGAGAGGCGGGTAC 
TACGTGAAGCCAAAGAACGCAAACTGCTAGCTGCCAAACAGCGGGACGAAGTCTGGGAGAGCTAGACCGCGGCCCGGAAGAAGGCCAGGTGCATT 
TTTCTCACCCTTTTCTCATCTGCAATAAATGCGGCAGCAAACACAACACCATGTGGATCGACAACAAGAAACACCACCATTAACGCCCACAAAAA 
CAATTCTCAATCAAAGCTAACACAGTAAATGAGTAATTCGCATGCTAAGCAATGTACTTTTAATGTTCGCTTTTTGTGAAATTAACACAGACTTT 
TGCTCAATTTGTTGTAGTTTTTAGAGGATGCAATGCATATTATATCATCTAGGCCGTAAGAGAGAAATATCTATATTTATTGGTCGTAGCAATTA 
AACGTGCAGAGAGACGCGTGTACGTGTTGTTTTAACTCCAGCCGTAAACCTAATTTTTAAGCGCGAGCAATTTATATATTGATATACAGTTTGTC 
AAGAAACTGTTTACACACTGTGAAATAAGTTGAATTTTTGACTTTAAAGCTAAAAATAAAGGGTTTTTTGCTTAATTAAACGCAATTTTTTTATA 
AAAT AT AATT AAAC AAT ATTTATTTTACTTATAAATCAAAAAACAAATTAAAAATATT AAAT ATACAAGAA 
(SEQ ID NO: 200) 

Start ATG: 453 

MGVKKHISETPLSKSNSVDLYRPPALRNNSNNNNSSSEEAAAVNEAITTPLAAGTATTASTSPTSKESPNREADASAAAAAAATRRERRPDRAVY 
VPRARRSQTTPPTTTTTTTAAAATAVATTSPPAAAAVVAATITTAPDSPPVADTQPSKADDVNGQPPSKSKKSKSHRERRERAKKTTTATTADNA 
TVLVISASENDTQSNEEHRPAKPADCDKEIMSGAQRTKPGNGNKSASNGKSQASPPLRIEDVVAARSTNGEEPAKCDNDVRELQRASKEINRSNR 
RIMKQTFNSDVLEIPEKIETSATKAIPTSPQASATAKPPVDSTTEDDDEEDEDDWEKMFDESGDCLDPKLLQELTESVGKCKIELPPCMDYTVFHI 
KQQLLNEEEFPHVLEVSNFPVEFKTPDLLMLFAQYRGSGFDIKWVDDTHALAVFSSSRIAAEVLTMGHPFVKLKPLAEATLESRLKAKKAGAVSM 
QPYRQRPETCTALARRLVSGALGVKLPTAPQERENERRVLREAKERKLLAAKQRDEVWES* 
(SEQ ID NO: 201} 

Classification: hypothetical 



Celera Sequence No. : 142000013384244 

GTAACCTATGATGTCGCGCCATATCGATTCATTGTTTTTTTTAGTGAACCACCCCTAATGAACGGAGCTTAAATCTAAAAAATAAACAATCATCT 
GTGAACTTAGTTGCGCTTGCTTGCGAAATGCAGGCGGAGATTAAGGCCGACATCATCGTGGAGGCGATGGAGGTGCTAGTGAATCACATTCTATA 
CGTGCGCGGTATATACCCCTCGCACATTTTTAAAATGAAGCGGATGTACAACTCGCCCATCTATGTGTCCATATTTCCACCGCTTAATAACTACC 
TGGCCGGCGTCCTAAAATCCGCACAAGAACTTCTTCGCCGCCGGGAGCTGCAGTGCCTGGAGCTAATAGTGTATCAGAAGGAGAATGAGAAGTTG 
GAGAGCTATAAAATGCAACTAGAAACCCAGAGGAGTGGCCTACCGGCAGAGGATCATTTGATGGAGTTCGAGCAGAACATGCGCTCTGTCATCTA 
CAAAATCTCTCAGCGCTTAAACCAGGCGCCTAAACTGCCCGCCGGAAGTTGTCAGTTTAAGGTGCATCTGCACACTACCCAGGAGGCATTCATCC 
GCTTCAGCCATGACTCCCAGTACCAGGAGTTTCCCTGGCTTCAGACCCAAAAAACTGAATCCCAAGCGACAGGGCGAACCGTTTACCTCCTTCCT 
TTGGCTAGGGTAGATGATTTGGGACTGAAAATGGATGTCCTCATCGTTAATTAATGTTATATTCCTAAAACTGTTATCCATTTTATCGAAAACTA 
CACGACGTTGGGTTTATTTTAAACTACAATTGATTCTAATCATGAATCGTTGGCTGATCCATGGTCTTTCTTTAATGAATAAAAGGACGCCCAAT 
GGTTTTATGGCTTAACATTAGATTCCTTAAATAAAACCAATTTTAGTGTCATTCCGTTTTGAATTATAAAGACTGTTGAGGTTTTAGAGGAAGCG 
TCATATAGAATTGACCGGTACTCAAGCGACCATAATCGTAGGCTTGTCCTCTAAAGTAGACCCTGGCGTTTTAGATCCTCGAACGTCTTTCGGTT 
GACAACGTTTCCCAGCGAGTCCTCGAACTCTTCCTCCTGATCGGCTACCCAGCGCTCGCTCTGCTTTTGCGACTTGAGTTTTTCCCACAACGTGA 
TTGCGTCCTCGATCTGGGTGACATTGGCGAAATGGGCGGTGTTCGGAATGCCGAGGCATCGCATGCCGTGCGCATGCCGCCACTCGGCGAAGTGT 
CGTTGAAAAGCCTTAGGACCCTTGTAGGTGAAGTTTCCACATATCTCGCAATTATAGCTAATATTCAGGCCATGCAGTTTATACAACCAGTAGGG 
GATAGGTTTACCGTCCCATCCGAGCGGCAAGTTCTTGGGATTGTACGGAACGTCGTCCGCGTCGGGATCATCTTCGTTGTCCGACTCGCTGGCCT 
CCACATCGCTATCGTCCCGCTCGCCACCAGTCCGAGCCTGTTTTCTCTGCACATTCTCCTTGGTAGCCGCTCTCTGCTCAGACAACAGATCCGCG 
TATTTGTAGAGCAGAGCCTCTAGCTGGGCAATCTCCTTGTGGCGTTCGTGCTCCCGTGACTGGGCACTAGCAGTCTTTGCGCTGGGCTTCTTGGC 
CATCAGTGCGGGATCCAAGGTGCTTTTACCTTTTGTAGAGAATAAACGCTGTGCTCGCTCCTCCAGGGTGCCGCCGCACTTAAGACCCAGAGCCA 
CCAAAGCCGACTTTAGACGATCCAGTCCTAGCGAGGCCAGCTCCTCCCAGCTGGAGAAGGCGGATAGATCAAGGTGGGCGCCGGTGTTGGCCAAC 
GCCGACTCAGTTTCCTTAATCGAAAAACCCGGAAATGTGCCCATCAGCCACTGGCGCTGGAAGTCAAGTTCCACCTTTAGCAGTTCGCCCTCCAG 
GTCGAGCAAGGGCTGGATTCGGAGGATGAAGTGGTGCAGGTAGTCGTTGAGGGTCTCTATGTATATGCGGTACTCGCGGTTCTTGCGCTCTCTCG 
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GAATATCGAAGACGTGGTCAAAGGACATAAGGTAAGTGATATAGTCCAGTTTCTCCACCGACCGCAAGTTGAGGTACAGCTCGTAGCACTCGTTG 
AGGTCCAGATAGCGACCGCCGCCCTCCTCGTCAGTGAACTCCACTAGGGCACTCATGTCGTCCGGATTGTTGTACACCCGAATCATCTCGTCGAA 
CTCGACGGACAGCGGCACGCTGACCTCCGCCGGATGGGACTTGTAGAACTGCTTGATTTGCTTGAGGCGGGCGTAGAACTCATTGAACTCGTTGG 
GCCCCGAAAGAGCGGCAATCTCCGCCTTTCGTTCATTGTCCTTGTCCTCGTAGAGATCGCGTAGCTGGGACGTAGAGTTGTGATGCAGTTCCATG 
AGGTATTTAAGCCGGTGCTCCGAATGGATGCGCTCCTTTTCGCCCGGCTTTTTGGTCGCATGCTCGTCCACCATCAGTTTGACCAGGCGCTCGCG 
CTCCTCGTGTAGGCGCCGCTGTTGCTCCAGGAGCGTCTCCATGGCTCTTGAATGCGAGGATTTAATGCGGGGAAATATCAAATGGAGGCCTTTCA 
CTTAGCGAAATCCCGAAGAAAACACAACTTTTCGTCAGCGTGTGTACATTAAAAAATACACTTACAATAAAGCAGGGATGGCTTATATTGTCACA 
TCATATTATTGTCAGGTTATTATACTCCGATCACACTGGCAATTAATGTGGATTTCGGGAATGCAGATGTATCAAATTAACGCTCGATTATTCGA 
GTAATATAGTAATGATATTTAAAGAGCCAAATTGATTTTTCATTATCATATATAGATTATATCGCATGATTTTTCTACCTAGTCGATTAAAGTGA 
AGCCTCTGTTACAAATGCGATAACAATTACATCGTCTATCGGTGGTCTGACAGCGACAACAAATACACATTTTTACCCTCGCAATCGCAGGGTCA 
CACTTTCGTGAAATCATATGATCGATTTGCAGTGAAAATTTTCAGACGTTGGGCAGAAGGCAAAAGTAACTTATCGTTTTCCACTTTCCTCGTGT 
TGGGCCGCCGTTTCCAACTCAGTTCGGCTGTGAATGTATTAGCTTAATTAAATTTCAATTATTTCCAGGCACGGTTGTTGTACGTGGGCTTCTTT 
AAAACACTTGAATTTCCTTTCGGTTTGTGCAGTGAAAAAAATCAGTCAAAATGGCACTGAGCGATGCTGATGTACAAAAGCAGGTAATTGAAAAC 
TTGGATTGGGAACGGGCAGGCGATCAAGGTCGTAGGGAAACAAGCAAAACGAGAGGCTTCGTTTGCCTTTTTGCCTTTGCAATTTGCCTTTGCAA 
TAAAGATGGCGAAGTCATGGGATCTCCCAGGTCATGTGAACTTTTCACCGCCAGTAGTCCAATTAGACTGACATCCTTCCAAATCGGCCCGGTCA 
TTTGGGAGTTGCCGGAGTTTTGACATATTTGTTGGCTAATGAAGACACATCAATTTATTTGTCCAGATAGTTTGCGTAAAAAGTGAGTAAAAATT 
CGTGCTGGTCATGTGACACGGCCCCCGCATTGGAGCAATGTGTTGGAGCGAGACGACTAGCCCTGCACCCCACACTCGTACTCTCTGTCACACGA 

CCAGCGGACCCCTTACGTTATCAAAAC 
(SEQ ID NO: 202) 

Exon: 2637. .1001 

Start ATG: 2512 {Reverse strand: CAT) 
Transcript No. : CT7078 

CTTTATTGTAAGTGTATTTTTTAATGTACACACGCTGACGAAAAGTTGTGTTTTCTTCGGGATTTCGCTAAGTGAAAGGCCTCCATTTGATATTT 
CCCCGCATTAAATCCTCGCATTCAAGAGCCATGGAGACGCTCCTGGAGCAACAGCGGCGCCTACACGAGGAGCGCGAGCGCCTGGTCAAACTGAT 
GGTGGACGAGCATGCGACCAAAAAGCCGGGCGAAAAGGAGCGCATCCATTCGGAGCACCGGCTTAAATACCTCATGGAACTGCATCACAACTCTA 
CGTCCCAGCTACGCGATCTCTACGAGGACAAGGACAATGAACGAAAGGCGGAGATTGCCGCTCTTTCGGGGCCCAACGAGTTCAATGAGTTCTAC 
GCCCGCCTCAAGCAAATCAAGCAGTTCTACAAGTCCCATCCGGCGGAGGTCAGCGTGCCGCTGTCCGTCGAGTTCGACGAGATGATTCGGGTGTA 
CAACAATCCGGACGACATGAGTGCCCTAGTGGAGTTCACTGACGAGGAGGGCGGCGGTCGCTATCTGGACCTCAACGAGTGCTACGAGCTGTACC 
TCAACTTGCGGTCGGTGGAGAAACTGGACTATATCACTTACCTTATGTCCTTTGACCACGTCTTCGATATTCCGAGAGAGCGCAAGAACCGCGAG 
TACCGCATATACATAGAGACCCTCAACGACTACCTGCACCACTTCATCCTCCGAATCCAGCCCTTGCTCGACCTGGAGGGCGAACTGCTAAAGGT 
GGAACTTGACTTCCAGCGCCAGTGGCTGATGGGCACATTTCCGGGTTTTTCGATTAAGGAAACTGAGTCGGCGTTGGCCAACACCGGCGCCCACC 
TTGATCTATCCGCCTTCTCCAGCTGGGAGGAGCTGGCCTCGCTAGGACTGGATCGTCTAAAGTCGGCTTTGGTGGCTCTGGGTCTTAAGTGCGGC 
GGCACCCTGGAGGAGCGAGCACAGCGTTTATTCTCTACAAAAGGTAAAAGCACCTTGGATCCCGCACTGATGGCCAAGAAGCCCAGCGCAAAGAC 
TGCTAGTGCCCAGTCACGGGAGCACGAACGCCACAAGGAGATTGCCCAGCTAGAGGCTCTGCTCTACAAATACGCGGATCTGTTGTCTGAGCAGA 
GAGCGGCTACCAAGGAGAATGTGCAGAGAAAACAGGCTCGGACTGGTGGCGAGCGGGACGATAGCGATGTGGAGGCCAGCGAGTCGGACAACGAA 
GATGATCCCGACGCGGACGACGTTCCGTACAATCCCAAGAACTTGCCGCTCGGATGGGACGGTAAACCTATCCCCTACTGGTTGTATAAACTGCA 
TGGCCTGAATATTAGCTATAATTGCGAGATATGTGGAAACTTCACCTACAAGGGTCCTAAGGCTTTTCAACGACACTTCGCCGAGTGGCGGCATG 
CGCACGGCATGCGATGCCTCGGCATTCCGAACACCGCCCATTTCGCCAATGTCACCCAGATCGAGGACGCAATCACGTTGTGGGAAAAACTCAAG 
TCGCAAAAGCAGAGCGAGCGCTGGGTAGCCGATCAGGAGGAAGAGTTCGAGGACTCGCTGGGAAACGTTGTCAACCGAAAGACGTTCGAGGATCT 

AAAACGCCAGGGTCTACTTTAG 
(SEQ ID NO: 203} 

Start ATG: 12 6 (Reverse strand: CAT) 

METLLEQQRRLHEERERLVKLMVDEHATKKPGEKERIHSEHRLKYLMELHHNSTSQLRDLYEDKDNERKAEIAALSGPNEFNEFYARLKQIKQFY 
KSHPAEVSVPLSVEFDEMIRVYNNPDDMSALVEFTDEEGGGRYLDLNECYELYLNLRSVEKLDYITYLMSFDHVFDIPRERKNREYRIYIETLND 
YLHHFILRIQPLLDLEGELLKVELDFQRQWLMGTFPGFSIKETESALANTGAHLDLSAFSSWEELASLGLDRLKSALVALGLKCGGTLEERAQRL 
FSTKGKSTLDPALMAKKPSAKTASAQSREHERHKEIAQLEALLYKYADLLSEQRAATKENVQRKQARTGGERDDSDVEASESDNEDDPDADDVPY 
NPKNLPLGWDGKPIPYWLYKLHGLNISYKCEICGNFTYKGPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVTQIEDAITLWEKLKSQKQSERWVA 

DQEEEFEDSLGNWNRKTFEDLKRQGLL* 
{ SEQ ID NO: 204) 

Name: SAP61/PRP9/noisette 
Classifxcation: RNA_binding 
Gene Symbol: noi 
FlyBase ID: FBgn0014366 



Celera Sequence No. : 142000013384244 

TGGACTAAACTCGCAGATTCCTGTAACTTGGAAGCACCACGAGTTTGCTTTAGAAGATGCCTTTCCGTTGAACTAAAACTAGACGAGTAAAGCGT 
ATCTCAAACATTGTATCTTCCGAACCAAAATAGTGCTGCATACTTTTCGGGATATGTCTTATTCAATGTAAGTAATACAAAAAGAAAAAAGGTAA 
CTTCTTAAAACTTGTAATATTTGTCACAAATGAATGACGACTAGATTTGGCCATGTCTGTAACAATAAACGTACAAAATATTCGAAGTTGCAGAA 
ATGGGGATCCAGCATTCGGATTTGGTTTTTTGTCAAACTTTTAAATATAGAATCGGACCACATCTTTTTAAGACCTTAAAAAACTGATTTGAAGT 
TAGACCCCTCATAATGCCAACTAAATTTGTTAAACAAAAATATGCCCCTATATGCCAATCAAATTTAAATACAATGCTTTCTCAGCGTTTCAGTT 
AAATACTTTGGGTTGTTTAATGGTTTTATACGTATACATGTGCTAAGTCCATGTATCTTATATTTCGTAAATCCATAGATTTTGTTCGACGAAAC 
TTTGCTAAATGATATTGATCTTTCAAATATAACTTGTTAATTTCCTTTCATCGCCGTGGCAAATCCTTCGAGAGTGAGTACTACTTGTTAAACGC 
ATATATGAGGAAAATTTATTGATGAATAATGCGAACATGCAAACGGAACACATCCCCGAATTCACACAAAGCATCTCTGTGGTATCTCTTTTGGT 
TTTTTATTTGTTGCATCAACTCCGCGGAACTCCAGCTACCTCACTGCAGGTCGTAGTAGTCGACGTTATTGGCGGTCAGATGTTCTCTATTAATT 
TGGAGCGTGCTGCTGTTTCCACTTCAAACTTTTATTAACATATATAGTATGGGTTTGCCGGTATATTGCAAACGGGAACTAAGTGTAGATTCGTG 
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AATGATATCAGACTGGACTGCAAACGTTTAGTTTCTAGCATGTTTTTTTTACTCTCTCCTTTATTTGTTGGAATTTTTGAGTTTTATATAGGTGG 
AGTTTTCAAAATGACATTAATTCATAGGGGTGAGTTTTATATATAGTTCGACTGGCTGTTGCAGGTGCTGGTTTCGAATTTTTGGATGGGGATGC 
TAAAGGGCGGCTCCGTTTGTGTTCATCCTCATTTAGCCATCAACCAAGTTCTTACTCGCCATTCTTTTGTTCGTCCTCAGATTCGGATTCTGAAA 
GAGAATAGAATACAAAATGTTAGTATGGGGGCTTCAATTGGTACAAACTGAAAAGATGTGCAGCTCCGGTTATGCTGTCCAAATGATTATTGGCC 
AAATTTAACTATGGAAAGTTATCTGGACACCCAACCATTCAAATGCTTTGGCACAACTCCCCGCTTGGCTCAATCGAATGAAGCAACCATTCGGA 
GCTTTTATGAGGGGATTGTGCATGCGGACTAGGACAAATGCAGCTGTCCAAACGATTATTGGCCAAATTAACCACGGCATTGGGTTATATGGGCA 
CCCAATCATTCTTTCGCTTTTGCATACTGTACATGAACCCTTTAAAACTGGTTCCTAATAATGGGAACACTTTAAAAGCTTGGTAAGAAGCATGC 
TCGGTGAAGTCAGCTATTTGTGTCCAAGTTTTCATTGGCCTTTGATGCTCGGATTTTGTGAGGCAACAAGGCAAGCAAAAACTCAAATGCTCTTG 
C AAT T T T GT AC AAT GAAAAT G T C ATC C AGG AC AAG AC CAT T C T C AAAAAGT GC AC T G T GT AAAGT T T C G AAGGAAGC AAAAT T T G AAT AT T T AT A 
TTGTCCAGGTTTTCATTGGCCTACGTTGTTCAGTGAATGGAAATCAACGAGGCAAGCAAAAAACCCAAATTTCTCTTGCAATTACGTACCAGAAT 
CAAGCTATTTAAGACAAGACCATTCTCAAAAATTGCACGCTGATAACATTTATCAAAACATACTAACCTTCTTCGGCTGACTGAAGCCATTCGAC 
AAACTTTCGCATTTGTTCAAGGAAATGCATTTTGCCCTTGTTTGAGTGGCCCTCCTTGTACCAGCGCAAGATGATCTCCTCGGACAGGACCTCGG 
TCTTGTAGAACAAC AAAAT GATTTTCTGGAAGGCCTTC AT GAAGTTCATGTTCTCGTAGCAGAATTCCTGCACCTTCAAGATTAGGGCCAATTCC 
GAACGGTCGGTCGAGGCGAAAGCCTGCAACAGTGGGCAGTAGTTCTTCAGATGACGAACAGCCTGATCGGTAACCAGCTCCTCCTTCTTGTTCCA 
T T C G C C C AAT GAC AT GAT GGT G GACC AAAT C T GGT AAAG AT CAT AC AT GAG AT T T AGT AAG GAAAT C AT C C AGT T GG AT TAT T T ACGC AT C GAC T 
TACAATAACAATTATTTCATGATCGGGAATGTTGGTGCGCTGCGAGAAGTCCTTAATGTCTGAGGTTATCTCATTATATGGCTTCTCATCGTTGA 
TATCATCGATGAGCGCCTGCTGCAACTCACGCTTCGCCTCCTGGCTAGCCTGAGCCTTATGTAGCTTGACGATCTCGTTAAGTTCTTTGTCAAGA 
AAAAC T T GT T T AAAAT AT T C CT C AGT AC GT T T G T T C GGT GGG AAAAAGT CC AT AAGT C T GGAAAAACGT G AAAAAGT GG AAC CAT TAT T AAT AAT 
TTAAGTTAACGCATTCAATAATCAACCTTAAATGACGTATTGAGGTGTACGAAATTAAATCAGCACCCAAATGAAGCGAAAGGTCATTTGCGAAT 
TGGGTAAAACTCCGTCGAACTTTCTTACTCTAGGAATATCAATTAAAGATGCACAACTGGATCATTGAGTCATCTTCCTGAGAACATTCCGGGGC 
TAAAAACAAATCACAACTAAAAACTTACTTGCTTTCCAGTCCACCCTTCTTAAGCGCCTGAATCAGATATGCAATGCCCTTCTCTTGCTTGAACG 
TCTGGAACAGCTCTAGTAGAAACTCCAATGCAATGCCATCCTT GAT TAGGTGCTCGTTGT TT AGTACCAGT AAAAC ATTTGGCGGTACAGAACCG 
TTGACTGTTGATCAAAAATAGGGTT AAAT ATAGGGTT AGT CAAATGTTTTCGCAAAACAAGTTTAGACGGCCGGCTAGCAATCCTGTCCTTGGGC 
TCGTGGCCCTTAGATGTGGTTAGTTCATCCAAGAGTCAGTATACCAAGTGAAGGATTACTTTAGCCCCTCAGCTGCAAGTTGAACAAGTGTTTAT 
CGAAACAACTAATTGAAAGACCGAATGGGCAGACGCAAGCAATGCACAGGAGGAACAGGAACATAAATGTTAAGACTTACCTAACCAGAGCGCTG 
TCATACGCGCAAGCTTGATGCGTTCACTAGGAGTAAAGCCCTTTACGAATAATAGAACCTTTCCCATCTCCTCTTCGAACATTTTCTCAAGGTAC 
TTGTACCGTCTGATGAGCTTGACAAATACCTGGAAAAGGTAAAGTGTGATTGCCCCTGTTAGCGGATGGTGAAATTTTACTGTGTTCTCACGCAG 
GAGGCTTTTGTATTTGTTGTTGC AAC ATTAGAAATCTTTAGATTGCTGGCGGTAGTAATTTT AC AAAAT AAAGACAAGT AAAAT GTGGCTTTTT A 
TTTTGCATATTTTGACACCGATTGAGTTGTTGAGATGCATAACCGGTTCTAATAATATGCACTTTGATTCAATAGCGTGACCTTCGCAGTGGTTG 
CATGCATATAGTATATATATATAGTGTTGAATAAAGAGTAGTTGTTGACACATAAGAAATCTTCAGAACGTTAGTGTTTAGGACAAACCAAAGAC 
AAGTAAATGTGTTGATTGTTATTGCATATTCTTGACACCGATTACATTTTTGAAACTTTATAGTCAGCTCTAATGATATGCAGCCACTTCACGAA 
TTTCATACTTTTAAGTGGATCCGAAGAAAATCATTTAGTAGTAAATAAACGTTTTAGCTTGGAAAACAACAAGAGCTCAACTTGTGTTGCAATCA 
AACCAATTTTCGTTTACTCATAATCATAATTTAAGTATGACCTGCTAAAAGATTACAAACCTGCTCATGGTTGCGCATGCTCTCCATGCTCTCGG 
GCGCATCGAAGATGCAGTAGCTTGTGCGCGGCTTTTCGCCATCCTGAGATATAGAACCACCAGGCACTGCAATTGAAACGAGGATGGAAAAGGGT 
CAGTGAAGAGACTAAACACATTACTTGCAGCGGAAAACTTACCTAGTAGACCTCCTGCAATCAATATGTCAAAGAGAACTTCACCGTAGCGACGA 
TAGTCGAGCTTGTTGCCCGCGCTGTCTAGATATTTAGAGATTTGTTCCAGGTCGCCTTCAGTTTTTTCGAGACCAGCAATGACCGCATCACGGAA 
TCCCGTTGGGTCATATTTTTCTCTTTCATCTCTTTTTCTGGTCTTGATGCGTTGACCCGATAGCACTGGTCTTTCAGTTTTTTGACTCATACAAT 
AAATTGCTGCAAAGGTAAATGTGGATCAGAGATGGAAGCGGTTTTGGTTTGCCCCTGTAAGTTTTTATTTCTGACCAGACTAAATTATCATGGCC 
ACTAGATTCATTTTTAGATTTATCTTAATTTTAGTGTAAGATGACTAAGGTGGTCTCCAAGCGGTGGAAAACCGTTCCCTGTATATGTATTCTCG 
TTGCGCTCATAATCAAACTACAACATTTTCACCTATGACCGGCGTTTAAGTGGAAACTCTAGTGAAAACTGAGCCTAAGAAAGCCTTAACCTAAA 
AATGCCAGCGTACATGTGCTTTAAACACACACATTCAAGAGCGGAGAACGGGCACACTTCACATCTCTTTTTAATTATCTCTCTTTTTCACACAC 
ACATACCCACTGTGCATTTTACAGTTTGATAATGTC GAC AAAA GAAAT AAAAAGAATTGAGAGGTGAGGCGACCAAGAAATCGAC AT AGAAACGC 
AGAATAAAGTGAAATTTTGACGTGGTTACTTGTGTTAGGTAGCGCGCTGGGGAAATGAGGTCATTGTGCATGTAGTGCACATAAAAACGCGCCGA 
TGACTGTGTGCTAGTTGGCCGGGAAAATTGTCATAGCAGCGAAAACGCAAAAATTAAATTATCTAGGCTATCAAGTACACATTGGTACATATAAG 
GAACGGCAACTTACGCTTTCACCCAAGGATATTAAGTCTGGCCCTGTGTTTGCCGCTTCCGCGTGTCCTTGCAATCAATTGAAATCAGTCGTGTG 
CGAGTGTAGTGTGTATGTGTGTGTGTGTGTGCCTGTTGCTTTTTCACAACTAACGAACGTGCGAGAATATGTAATTTAATTTAACCTCACGAAGC 
GCTGGCTGGTTAATGCTTCTCACTCGAGTCGACGAAAAACAAAGAGACAGATGGCAGATCCCTTCTTTTCTTACCTCTCTTTTCTTATGAGCGAG 
CTTTTGCACGGTAAAATTGACACGCACGCACACACACAACAACAAAGAAAAGGGGAGAGCAAATAGAAAATATAATTTTTTGTAACGGAATTTAC 
TTTTATGCCTTCAAAATAAACGTTAACAATTTTGCTCAGACTACGTCCGGAATTCTATTTTTAACAGCTATCACAATCGGTTACTAGGTATATGT 
GTTGTTTGCCTCGAATCCAGCGCCCCTCTGCTCGCACCACTTTGTTAAACGAAATTCGGAGAGCGAATGAGACGACACACGAGCGCACGAACGAA 
GCGGACGAGCCGAGAACGAAGACATGTGGTTCGGCGCCATTTTGAAGCGATACACTGACATGGCGAAAGCGGAATGATGGGCACAGGGAAAACGA 
AAATTAGGGGGAAACGGTGAGAAAAAATGTGGGATGGGGGTTCAAAGTGTGTATATTGCTGTCGCGCTTATTAAATAATGAGCAAGGTACAGTCG 
AACAAAGGCCGACGGCCACCAAATGATTTTTTGCATGAATATATGACTGCCCGATGACTAATCTGGCCTTAGGTACGCACGCAGTTCGTACATAA 
CCCCTTTTCCAACGAAAGCGCAGACAAAGGATTGGTCAAGGCCAGAGAAATGACCAGGACCGCGATTTTACTCGCCCTATTTGTCTTTCTGTGTA 
CTTAGGTCTACTCTAGCGCTATTTTTCACTCCTATACTTACGATTAAAACAAAAAAATCTGTTTTGCTGCGAATAAAAGTAACTTTTTGCGTAGC 
TGTTGGGCTGCTTCTTCTGGATGACTTGCGGTATGACCGTGGCCTGCTTCGTACGCAATGGTACAGAGCTCACTAAGGTGTCCAAAAGGTGGCAA 
CATTTTGGTGAAGCTGTTGGCTGGCCCCACTAACCGGACCTGAAAATGGTGCAGCGCTGCCGGACATTTAACGGAATCAAAGATTCCACAAAAGT 
CATGTTTGCATTAACAAAATTCATATTTAACCTTAACAAATTCAACAAATTAAGTTAAGTCCATTTTAGTAGCGTTGCGAGGAGTTTATTTTCGA 
AAATACGCCTTTGTATATATTATATATTTTTCCTCCACTTTAAACAAAATCGAAAGGTGTCCATTTGTCAGCTATTTAAACAATATGGCATCGCC 
GCCTTACTTGCTCAGTAAGAAAATTTAACTGTTTTTTTTTTGCGCACCTGAAAGCGGCGATAGACATTGTTTACTTTTAGTTTATATTTACATTA 
TCCAAAAGCCAAATGAACTTTGAGCAGGTCAATATACCAGAGAAACTAGTGCCGGACAACTATTTGCAACTGGGACTGGCTGCGCAACGCAGTAA 
GCAGCGCAGTTTTAAGGAGCTGCTCGAAAAGGTAATCAAACCGAAATTAAACCCAAATTAATTGATAACTCCATGGATAATTCTTCTTTTCAAGC 
GAAAGCTTCCGAAAAATGGTTGGTCCGATGAACGTATCGAGGAGCTGGTGCACATGCTGGCTTCCTTGGACAGCAATAACTATCCACATAAGGTC 
GGTTTAGGAGAGCGGGAGGCGCGTATAGCCTGCAGTAAGTAAGGGATCAGGAGGCGATATCCCCCGTAATGAATTTGTCATTCAGAACTCGTGGC 
TCGCCGTCACTACAACTTCGGTCATGGTATCGGCAGATCTGGAGACTTACTAGAGGCACAACCCAAGGCAGCTGGCTCTACGCTGCTCGCTCGTC 
TCACAAACGCCCTGATTCTGGACCTTATTCGGGGAATCGGACTGCCCAGCTGTGCTGGCTGCTTTTTGGTGCCAATGTGTACAGGCATGACATTG 
ACTTTGTG 
(SEQ ID NO: 205) 

Exon: 6228.. 6122 
Exon: 4471.. 4223 
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Exon: 4151.. 4051 
Exon: 3449.. 3311 
Exon: 3044.. 2879 
Exon: 2622.. 2379 
Exon: 2310. .1968 
Exon: 1229.. 1001 

Start ATG: 4460 (Reverse strand: CAT) 
Transcript No. : CT7116 

TACGAAGCAGGCCACGGTCATACCGCAAGTCATCCAGAAGAAGCAGCCCAACAGCTACGCAAAAAGTTACTTTTATTCGCAGCAAAACAGATTTT 
TTTGTTTTAATCCAATTTATTGTATGAGTCAAAAAACTGAAAGACCAGTGCTATCGGGTCAACGCATCAAGACCAGAAAAAGAGATGAAAGAGAA 
AAATATGACCCAACGGGATTCCGTGATGCGGTCATTGCTGGTCTCGAAAAAACTGAAGGCGACCTGGAACAAATCTCTAAATATCTAGACAGCGC 
GGGCAACAAGCTCGACTATCGTCGCTACGGTGAAGTTCTCTTTGACATATTGATTGCAGGAGGTCTACTAGTGCCTGGTGGTTCTATATCTCAGG 
ATGGCGAAAAGCCGCGCACAAGCTACTGCATCTTCGATGCGCCCGAGAGCATGGAGAGCATGCGCAACCATGAGCAGGTATTTGTCAAGCTCATC 
AGACGGTACAAGTACCTTGAGAAAATGTTCGAAGAGGAGATGGGAAAGGTTCTATTATTCGTAAAGGGCTTTACTCCTAGTGAACGCATCAAGCT 
TGCGCGTATGACAGCGCTCTGGTTAGTCAACGGTTCTGTACCGCCAAATGTTTTACTGGTACTAAACAACGAGCACCTAATCAAGGATGGCATTG 
CATTGGAGTTTCTACTAGAGCTGTTCCAGACGTTCAAGCAAGAGAAGGGCATTGCATATCTGATTCAGGCGCTTAAGAAGGGTGGACTGGAAAGC 

AAAC T T AT G GAC T T TT TC C C AC C GAAC AAAC GT ACT G AGG AAT AT T T T AAAC AAGT TT T T C T T GAC AAAGAAC T T AAC GAG AT C GT C AAGCT AC A 
TAAGGCTCAGGCTAGCCAGGAGGCGAAGCGTGAGTTGCAGCAGGCGCTCATCGATGATATCAACGATGAGAAGCCATATAATGAGATAACCTCAG 

AC ATT AAGG ACT T C T C G C AG C GC AC C AAC AT T C C C GAT CAT G AAAT AAT TGTTATTATTTGGTC C AC CAT C AT GT CAT T GGGC G AAT GG AAC AAG 
AAGGAGGAGCTGGTTACCGATCAGGCTGTTCGTCATCTGAAGAACTACTGCCCACTGTTGCAGGCTTTCGCCTCGACCGACCGTTCGGAATTGGC 
CCTAATCTTGAAGGTGCAGGAATTCTGCTACGAGAACATGAACTTCATGAAGGCCTTCCAGAAAATCATTTTGTTGTTCTACAAGACCGAGGTCC 
TGTCCGAGGAGATCATCTTGCGCTGGTACAAGGAGGGCCACTCAAACAAGGGCAAAATGCATTTCCTTGAACAAATGCGAAAGTTTGTCGAATGG 
CTTCAGTCAGCCGAAGAAGAATCCGAATCTGAGGACGAACAAAAGAATGGCGAGTAAGAACTTGGTTGATGGCTAAATGAGGATGAACACAAACG 
GAGCCGCCCTTTAGCATCCCCATCCAAAAATTCGAAACCAGCACCTGCAACAGCCAGTCGAACTATATATAAAACTCACCCCTATGAATTAATGT 
CATTTTGAAAACTCCACCTATATAAAACTCAAAAATTCCAACAAATAAAGGAGAGAGT 
(SEQ ID NO: 206) 

Start ATG: 119 (Reverse strand: CAT) 

MSQKTERPVLSGQRIKTRKRDEREKYDPTGFRDAVIAGLEKTEGDLEQISKYLDSAGNKLDYRRYGEVLFDILIAGGLLVPGGSISQDGEKPRTS 
YCIFDAPESMESMRNHEQVFVKLIRRYKYLEKMFEEEMGKVLLFVKGFTPSERIKLAP^TALWLVNGSVPPNVLLVLNNEHLIKDGIALEFLLEL 
FQTFKQEKGIAYLIQALKKGGLESKLMDFFPPNKRTEEYFKQVFLDKELNEIVKLHKAQASQEAKRELQQALIDDINDEKPYNEITSDIKDFSQR 
TNIPDHEIIVIIWSTIMSLGEWNKKEELVTDQAVRHLKNYCPLLQAFASTDRSELALILKVQEFCYENMNFMKAFQKIILLFYKTEVLSEEIILR 

WYKEGHSNKGKMHFLEQMRKFVEWLQSAEEESESEDEQKNGE* 
(SEQ ID NO: 207) 



Celera Sequence No. : 142000013384564 

GATGGCTGCAACTGCTCCGGTAATTATGCGCGTCATGGCGTCCTCGATCAGCACCGCCAAGCGGGCTGGCGGCATCATACGCGACGTGCTCAAGA 
AGGGTGACCTGGGCATAGTGGACAAGGGCAAGAACGATCCGCAGACGGAGGCAGATCGCTCTGCCCAGCGTTGCATTATCGCCTCCTTGGCCAAA 
AAGTTCCCCACCGTCAAGAT CAT TGGCGAGGAGGGTGGATCCGATCTGAATGTCTGCGACGACTGGCTGGT GAAC GAGTTGGATGAGGAATTCCT 
GCAGCACAGTTGCCCCGCTGAGTGGAAGGATGTCAAGCCCGAGGACTTTGTCATATGGGTAGATCCCCTAGACGGCACAGCGGAGTATACACAGG 
GTGCGATCTGGAGATACCTGTAGAGCTCCTTATATAAAACTAAATGTTCTCCTTTCATCCGCAGGACACGTTGAACACGTGACCGTCCTGATCGG 
CATAGCCGTGAAGGATGCTGCTGTGGGCGGCATCATTCATCAGCCATTCTACCAACAGCCCGATGGCGAAATGGGTCGCACCATTTGGGGCCTGA 
AGGGTCTGGGCACGGGAGGATTCACGGCAGTCCCCGCGCCAGCCGGCCAGTTCATCATAACCACCACTCGCTCGCATTCCAATGCCCTGCATCAG 
CAAGCCCTCAACGCATTCGCATCCACAGAAGTTCTGAAAGTCGGAGGCGCTGGGTTCAAAGTGCTCCAGCTGCTGGAGGGAAAGGCCCACGCGTA 
CGTCTTCGCCACGCCGGGCTGCAAGAAGTGGGACACCTGCGCACCCGAGGCCGTTCTAGAGGCCCAGGGCGGCTGCCTCACCAACATCAACGGCG 
AGCACTACGCCTACAATGCGGATGTGGAGCACGTGAATCGACAGGGAGTGCTGGCCAGCCTGGGACAGGACCATGCCGCGCTGGTGGAAAAGATT 
CCCGCCGAGGTGCGGGCAGCAGTGGGAGCCAAGTGAGATAGCTTTGAATTCCAGCACTTTATTAGTTTATTTGCACTAAAAGTAAACACACATAA 
ATTAAGCAATAACAGGTTAGAAATTTGAAAATTGTTAAAACTGGGCTCAGAATGCATTCTTTATTTATCGAAACTGCCGCTGTTGTTGCTCTGTG 
TGCTGCTGTTGGCCACCGAGTTGTTCAGGTAGCGATTCCGCTTGGCCTTGGTCATTTTGCAGGAGTTTACGGAGGCCGCGAAGCGCAGCGACTTG 
ACGGACTCTTGGAAACAGTCTTGGAACGGCGAGACGTTGATGAACATAAGCGTTTTCGAGTTGCCGCCCAGCGAGGGCATCAGCAGGTGCGTCAG 
CTTGGAGTTCCTGTACGGGATGTGGTCCTGCTTCTGCAGCAGCGCCAGGATTACGTTGGTGAGCTCCGATAGCGAGCGATTGATGTTCTTTGTCT 
CGGTCATCCGGGTGCTCGTCTTGGGAGACTCAGAGCCGGCCAAATCCACCAGGTTTATGGAACCCACGGAGATCTCTTGCTTTTCGGCATGGCGT 
CCGATGAGCTCAAGCTTGGTAACCGCGTGGGAACGAGAGGAGCGCTCGTTGCCAGCTGTCGAGGCGGTGGCACGGTTCATCTTGGCCGTGTGCAT 
GAGGTGGCGCAGGTGATTTGGATCCAGAACCGTCTCCTCCGTTATGTTGGACACGTAGATGTCGTTCTTGTTGTTCTTGGCCATTCGAATCTCCA 
TGTCCTTCTGCTCGTTGCTCAGCAGATCGTAGAGCACCTCGTTGTAGATCTCCAGAAAGGTGGCCTTGATCTCGTACTCCCAGCCCAAGTTGCGA 
TATCCCCGGATGGAGTCGAAGAGCAGATCCACCGTGCGCGGTATGACGCCCACACTCTCCGGCACTCCGTCCATTGTGTAGGTCTTGCCACTGCC 
CGTCTGTCCGTAGGCAAAGATGCAGATATTGTAGCCATCCAGGGCCGACTGGATGAGCGGCGAGACCATCTCGAAGATGTCCGACTGCGAGGAGA 
GCGGGTGGAAGACCTGGTCGAATGAGAAGATCTGCTGGCCCATCTTGCTTTTGGCCTGTGCGTCAATGCTCTGCAGCTCCACGGTGGACTCGTCG 
TGATAGGTCCAGGTGCAACACATACGGTTCTCCTCGGACTCCAGCGGCGGTCGTATTCGACAGAAGACCCGGATGTTGCCGCGCAGGTCCATGAC 
CGTGTTGTGCAGCTCTTTGCGCTCCATGTTCGACTGGAAGAGCTGCTCTTTGCAGGTCTCCAGCTCGGCGGCCTGCTGCTCATTGCAGCGCAGGA 
GTTCCTCGGTGCGCTGGCGCAGATGCACCACTTCTGTGCTTAGAGCAGCATGCTCCGTTTTTACTTTCTGCAACACACACATTGTTGTTACATTT 
AACCTGGGTTTCTGTTTACTTGAGTACTCACCTCATGTATTGCTTGCAGCTCGGATAGCTCCTCTTTGGTCCTGCCCAGCGTCGATGTGATTGTT 
TCAATTTTGGCGGTATGCTGCTTCACCTGCCTCTGAAGACACTCGTTGTCGCTCTGGGTGTTCTTCAGCGAGGACTCCGTCTCGATAAGCTTGTT 
CTGCGTCTCCTCCAGTTGCTGAGGCATGGACTCCAGCTCGCCCATGTCCTCTGTTTGCTTTTCGTACTTTGTCTTAAGCACCTTGTGCTTCTCTA 
GCAGATCGTGGAAGCGGGCCTTGAAGTCGTAGGGAGCGATGCGCTTGGGAGCAGCAGCCGCGGCTCCTGACGAAGCTGCTGCTCCTGTTCCTGCT 
GGTTTCTTGGCGGCTGCTCCTCCTGCAGCCCGTGGAGCAGGACGCGTTACTGGTGGTCTTTTGACAGCTGTTGCTGTTATTGACGAAGGCGCCGC 
AGGACGCACTAGTCGCTGGCTGGAAACAGTGAGTGCACCGCCCAGGCGGGATACTTTGCTGGGAATGCTGGGCAATGAAGGAGCAGCCGCAGTGC 
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GCTTGTTACCACGCAGTTCGTTGATGTCGCAAGCGCTACGGCTACGTCTCAGCTTGGCCCGGTGGCCCAACTTCATGGGCTCTGGGGAGGCGGCA 

CGTGCTATCGGAAATTAGCAAGGACTTAAGGTGAATTCTTTGGAAGGATGCTTCTTTACTCACCTCCTCCTCCGCGACGCTCCAGCACCTGGGGC 

AGATTGTTGAGGTCCCTTGAGAGGGGCGGCAATAAGTTGCCGCAGTATGTCTGGTTGGCATTGACATTGAAGGCGCCTGCTCCAGCGGCTCCACC 

TCCCAATCCTGCGCGAATTCGATCTGTGGGCAGCACGGTTTTAATCGGCATTTGGGGTTTCTTCAGGCCCGACGGTTTCGGTAGCCGGGATTCCA 

TCGCCAACTGTGTTGTGCCTAACTTAGAAGATACGTCTGCCTCCTTGCAACCGATTTTATCAATTAGTTTTGCGTTGTTGACAAGCGACGCCGTC 

GATTCCCGCCGTTATTTTTGATTGCAGGGATGCTCTGGGGCAGTGTGACCATTTGTCGGGAGAGTACGGCGTTGCCAGCCAAGACGACTTCCAGT 

TTGTCAATCAGATGTTTTCAGTGCTGCAGCTTTGGCCCTCGGGCACAAATTTTCGCCTTGACTCGTGCATCCCGCAGGTTTTCGGCGCGTGAAAC 

TCACCCAAGCGGGTGTGAAATACGGTGGCTGTTGCATTCGAACCGTTCATTTTGTGGAAAAGTCGCGGCGGACTGTTCAACAGCCTGAAGAGTGA 

AGTCTTGTGCGCAGCACTAGTGTCGTTGCCGGACGGTTGCGGCAATCCGGGGCGAACTTCTTGAAAGCGTTTTGCTTTCTTTTTGCGAGCTTGTG 

CGGTGTGAAGGAAGGCCGCGCTATTCTGAAATTTATTTGTTGCCTCGAAAAAGCTGTGGAAATTGCGTTCCAGTTGGAAAGCAGTTGCGTGATCG 

GGTCGTTGCCAGACCGGCCGCAGGTCTCACGCGACCATCTCTAGTTTGTTTTCATTCGGATTGGAAAAGGCTCGTTCCCATCGAACTGATTTTGT 

GAATTTTAAAATTGGCTTCCCCCACGTCCGCCAATCCCCCCGAGATGTCGGGGCAGGCGGAGCTCAAGGAGCTGGGCATGTTCGAGCTGCGCCCG 

CTGATCACTAACATCAGCTGCAAGCTGTCCGCGGTGACAGCGGGGCTGGACGAAAAGCGACAGTTCCTATGCCTCGTTTCTGAGGAAAACGAGAT 

CCTGCTGCGCTACATAAGCGCCGCCTCCGGCCAGGAGGTGGTCAAGATGATATCCTGGTTCCGCAACCGCGTCATCCACGACGTCTGCTTCGATC 

CGGCGGGCGTTTGGCTGCTGGTGCTATGTGAGTTTCAGCGTATGATGGCATGAGCACCACCCACTTTTTTGTGTTGGGTGGGTGGTTGCTGTGGT 

GGTGCTGCTAAAATTACTTTGCCCCCATGTGTGCGTGTATGGTTATTGATTTTCTGCTGCCTCCCCTTTG 

(SEQ ID NO: 208) 

Exon; 3440.. 3104 
Exon: 3044.. 2407 
Exon: 2347.. 1001 

Start ATG: 3326 (Reverse strand: CAT) 
Transcript No. : CT722 6 

CAAAAATAACGGCGGGAATCGACGGCGTCGCTTGTCAACAACGCAAAACTAATTGATAAAATCGGTTGCAAGGAGGCAGACGTATCTTCTAAGTT 
AGGCACAACACAGTTGGCGATGGAATCCCGGCTACCGAAACCGTCGGGCCTGAAGAAACCCCAAATGCCGATTAAAACCGTGCTGCCCACAGATC 
GAATTCGCGCAGGATTGGGAGGTGGAGCCGCTGGAGCAGGCGCCTTCAATGTCAATGCCAACCAGACATACTGCGGCAACTTATTGCCGCCCCTC 
TCAAGGGACCTCAACAATCTGCCCCAGGTGCTGGAGCGTCGCGGAGGAGGAGCACGTGCCGCCTCCCCAGAGCCCATGAAGTTGGGCCACCGGGC 
CAAGCTGAGACGTAGCCGTAGCGCTTGCGACATCAACGAACTGCGTGGTAACAAGCGCACTGCGGCTGCTCCTTCATTGCCCAGCATTCCCAGCA 
AAGTATCCCGCCTGGGCGGTGCACTCACTGTTTCCAGCCAGCGACTAGTGCGTCCTGCGGCGCCTTCGTCAATAACAGCAACAGCTGTCAAAAGA 
CCACCAGTAACGCGTCCTGCTCCACGGGCTGCAGGAGGAGCAGCCGCCAAGAAACCAGCAGGAACAGGAGCAGCAGCTTCGTCAGGAGCCGCGGC 
TGCTGCTCCCAAGCGCATCGCTCCCTACGACTTCAAGGCCCGCTTCCACGATCTGCTAGAGAAGCACAAGGTGCTTAAGACAAAGTACGAAAAGC 
AAACAGAGGACATGGGCGAGCTGGAGTCCATGCCTCAGCAACTGGAGGAGACGCAGAACAAGCTTATCGAGACGGAGTCCTCGCTGAAGAACACC 
CAGAGCGACAACGAGTGTCTTCAGAGGCAGGTGAAGCAGCATACCGCCAAAATTGAAACAATCACATCGACGCTGGGCAGGACCAAAGAGGAGCT 
ATCCGAGCTGCAAGCAATACATGAGAAAGTAAAAACGGAGCATGCTGCTCTAAGCACAGAAGTGGTGCATCTGCGCCAGCGCACCGAGGAACTCC 
TGCGCTGCAATGAGCAGCAGGCCGCCGAGCTGGAGACCTGCAAAGAGCAGCTCTTCCAGTCGAACATGGAGCGCAAAGAGCTGCACAACACGGTC 
ATGGACCTGCGCGGCAACATCCGGGTCTTCTGTCGAATACGACCGCCGCTGGAGTCCGAGGAGAACCGTATGTGTTGCACCTGGACCTATCACGA 
CGAGTCCACCGTGGAGCTGCAGAGCATTGACGCACAGGCCAAAAGCAAGATGGGCCAGCAGATCTTCTCATTCGACCAGGTCTTCCACCCGCTCT 
CCTCGCAGTCGGACATCTTCGAGATGGTCTCGCCGCTCATCCAGTCGGCCCTGGATGGCTACAATATCTGCATCTTTGCCTACGGACAGACGGGC 
AGTGGCAAGACCTACACAATGGACGGAGTGCCGGAGAGTGTGGGCGTCATACCGCGCACGGTGGATCTGCTCTTCGACTCCATCCGGGGATATCG 
C AAC T T GG GC T GGG AGT AC G AGAT C AAG GC C AC C T T T C T GG AG ATC T AC AAC G AGG T GCT C T AC GAT C T GC T G AGC AAC G AGC AG AAGGAC AT G G 
AGATTCGAATGGCCAAGAACAACAAGAACGACATCTACGTGTCCAACATAACGGAGGAGACGGTTCTGGATCCAAATCACCTGCGCCACCTCATG 
CACACGGCCAAGATGAACCGTGCCACCGCCTCGACAGCTGGCAACGAGCGCTCCTCTCGTTCCCACGCGGTTACCAAGCTTGAGCTCATCGGACG 
CCATGCCGAAAAGCAAGAGATCTCCGTGGGTTCCATAAACCTGGTGGATTTGGCCGGCTCTGAGTCTCCCAAGACGAGCACCCGGATGACCGAGA 
CAAAGAACATCAATCGCTCGCTATCGGAGCTCACCAACGTAATCCTGGCGCTGCTGCAGAAGCAGGACCACATCCCGTACAGGAACTCCAAGCTG 
ACGCACCTGCTGATGCCCTCGCTGGGCGGCAACTCGAAAACGCTTATGTTCATCAACGTCTCGCCGTTCCAAGACTGTTTCCAAGAGTCCGTCAA 
GTCGCTGCGCTTCGCGGCCTCCGTAAACTCCTGCAAAATGACCAAGGCCAAGCGGAATCGCTACCTGAACAACTCGGTGGCCAACAGCAGCACAC 
AGAGCAACAACAGCGGCAGTTTCGATAAATAAAGAATGCATTCTGAGCCCAGTTTTAACAATTTTCAAATTTCTAACCTGTTATTGCTTAATTTA 

TGTGTGTTTACTTTTAGTGCAAATAAACTAATAAAGTGCTGG 

(SEQ ID NO: 209) 

Start ATG: 115 (Reverse strand: CAT) 

MESRLPKPSGLKKPQMPIKTVLPTDRIRAGLGGGAAGAGAFNVNANQTYCGNLLPPLSRDLNNLPQVLERRGGGARAASPEPMKLGHRAKLRRSR 
SACDINELRGNKRTAAAPSLPSIPSKVSRLGGALTVSSQRLVRPAAPSSITATAVKRPPVTRPAPRAAGGAAAKKPAGTGAAASSGAAAAAPKRI 
APYDFKARFHDLLEKHKVLKTKYEKQTEDMGELESMPQQLEETQNKLIETESSLKNTQSDNECLQRQVKQHTAKIETITSTLGRTKEELSELQAI 
HEKVKTEHAALSTEVVHLRQRTEELLRCNEQQAAELETCKEQLFQSNMERKELHNTVMDLRGNIRVFCRIRPPLESEENRMCCTWTYHDESTVEL 
QS I DAQAKSKMGQQI FSFDQVFHPLSSQSDI FEMVSPLIQSALDGYNICI FAYGQTGSGKTYTMDGVPESVGVI PRTVDLLFDSI RGYRNLGWEY 
EIKATFLEIYNEVLYDLLSNEQKDMEIPJyiAKNNKNDIYVSNITEETVLDPNHLRHLMHTAKMNRATASTAGNERSSRSHAVTKLELIGRHAEKQE 
ISVGSINLVDLAGSESPKTSTRMTETKNINRSLSELTNVILALLQKQDHIPYRNSKLTHLLMPSLGGNSKTLMFINVSPFQDCFQESVKSLRFAA 
SVNSCKMTKAKRNRYLNNSVANSSTQSNNSGSFDK* 
(SEQ ID NO: 210) 

Name: KINESIN-LIKE PROTEIN 
Classification: motor jprotem 
Gene Symbol : ncd 
FlyBase ID: FBgn0002924 



Celera Sequence No. : 142000013384244 
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TGGACTAAACTCGCAGATTCCTGTAACTTGGAAGCACCACGAGTTTGCTTTAGAAGATGCCTTTCCGTTGAACTAAAACTAGACGAGTAAAGCGT 

ATCTCAAACATTGTATCTTCCGAACCAAAATAGTGCTGCATACTTTTCGGGATATGTCTTATTCAATGTAAGTAATACAAAAAGAAAAAAGGTAA 

CTTCTTAAAACTTGTAATATTTGTCACAAATGAATGACGACTAGATTTGGCCATGTCTGTAACAATAAACGTACAAAATATTCGAAGTTGCAGAA 

ATGGGGATCCAGCATTCGGATTTGGTTTTTTGTCAAACTTTTAAATATAGAATCGGACCACATCTTTTTAAGACCTTAAAAAACTGATTTGAAGT 

TAGACCCCTCATAATGCCAACTAAATTTGTTAAACAAAAATATGCCCCTATATGCCAATCAAATTTAAATACAATGCTTTCTCAGCGTTTCAGTT 

AAATACTTTGGGTTGTTTAATGGTTTTATACGTATACATGTGCTAAGTCCATGTATCTTATATTTCGTAAATCCATAGATTTTGTTCGACGAAAC 

TTTGCTAAATGATATTGATCTTTCAAATATAACTTGTTAATTTCCTTTCATCGCCGTGGCAAATCCTTCGAGAGTGAGTACTACTTGTTAAACGC 

ATATATGAGGAAAATTTATTGATGAATAATGCGAACATGCAAACGGAACACATCCCCGAATTCACACAAAGCATCTCTGTGGTATCTCTTTTGGT 

TTTTTATTTGTTGCATCAACTCCGCGGAACTCCAGCTACCTCACTGCAGGTCGTAGTAGTCGACGTTATTGGCGGTCAGATGTTCTCTATTAATT 

TGGAGCGTGCTGCTGTTTCCACTTCAAACTTTTATTAACATATATAGTATGGGTTTGCCGGTATATTGCAAACGGGAACTAAGTGTAGATTCGTG 

AATGATATCAGACTGGACTGCAAACGTTTAGTTTCTAGCATGTTTTTTTTACTCTCTCCTTTATTTGTTGGAATTTTTGAGTTTTATATAGGTGG 

AGTTTTCAAAATGACATTAATTCATAGGGGTGAGTTTTATATATAGTTCGACTGGCTGTTGCAGGTGCTGGTTTCGAATTTTTGGATGGGGATGC 

TAAAGGGCGGCTCCGTTTGTGTTCATCCTCATTTAGCCATCAACCAAGTTCTTACTCGCCATTCTTTTGTTCGTCCTCAGATTCGGATTCTGAAA 

GAGAATAGAATACAAAATGTTAGTATGGGGGCTTCAATTGGTACAAACTGAAAAGATGTGCAGCTCCGGTTATGCTGTCCAAATGATTATTGGCC 

AAAT TT AAC T AT GG AAAGTT AT C T G GAC AC C C AAC C AT T C AA AT GC TT T GG C AC AAC TCCCCGCTTGGCT C AAT CG AAT G AAGC AACC AT T C GG A 

GCTTTTATGAGGGGATTGTGCATGCGGACTAGGACAAATGCAGCTGTCCAAACGATTATTGGCCAAATTAACCACGGCATTGGGTTATATGGGCA 

CCCAATCATTCTTTCGCTTTTGCATACTGTACATGAACCCTTTAAAACTGGTTCCTAATAATGGGAACACTTTAAAAGCTTGGTAAGAAGCATGC 

TCGGTGAAGTCAGCTATTTGTGTCCAAGTTTTCATTGGCCTTTGATGCTCGGATTTTGTGAGGCAACAAGGCAAGCAAAAACTCAAATGCTCTTG 

CAATTTTGTACAATGAAAATGTCATCCAGGACAAGACCATTCTCAAAAAGTGCACTGTGTAAAGTTTCGAAGGAAGCAAAATTTGAATATTTATA 

TTGTCCAGGTTTTCATTGGCCTACGTTGTTCAGTGAATGGAAATCAACGAGGCAAGCAAAAAACCCAAATTTCTCTTGCAATTACGTACCAGAAT 

CAAGCTATTTAAGACAAGACCATTCTCAAAAATTGCACGCTGATAACATTTATCAAAACATACTAACCTTCTTCGGCTGACTGAAGCCATTCGAC 

AAACTTTCGCATTTGTTCAAGGAAATGCATTTTGCCCTTGTTTGAGTGGCCCTCCTTGTACCAGCGCAAGATGATCTCCTCGGACAGGACCTCGG 

TCTTGTAGAACAACAAAATGATTTTCTGGAAGGCCTTCATGAAGTTCATGTTCTCGTAGCAGAATTCCTGCACCTTCAAGATTAGGGCCAATTCC 

GAACGGTCGGTCGAGGCGAAAGCCTGCAACAGTGGGCAGTAGTTCTTCAGATGACGAACAGCCTGATCGGTAACCAGCTCCTCCTTCTTGTTCCA 

TTCGCCCAATGACATGATGGTGGACCAAATCTGGTAAAGATCATACATGAGATTTAGTAAGGAAATCATCCAGTTGGATTATTTACGCATCGACT 

TACAATAACAATTATTTCATGATCGGGAATGTTGGTGCGCTGCGAGAAGTCCTTAATGTCTGAGGTTATCTCATTATATGGCTTCTCATCGTTGA 

TATCATCGATGAGCGCCTGCTGCAACTCACGCTTCGCCTCCTGGCTAGCCTGAGCCTTATGTAGCTTGACGATCTCGTTAAGTTCTTTGTCAAGA 

AAAACTTGTTTAAAATATTCCTCAGTACGTTTGTTCGGTGGGAAAAAGTCCATAAGTCTGGAAAAACGTGAAAAAGTGGAACCATTATTAATAAT 

TTAAGTT AACGCATTC AAT AAT C AACCTT AAAT GACGTATTGAGGTGTACGAAATTAAATCAGCACCCAAATGAAGCGAAAGGTCATTTGCGAAT 

TGGGTAAAACTCCGTCGAACTTTCTTACTCTAGGAATATCAATTAAAGATGCACAACTGGATCATTGAGTCATCTTCCTGAGAACATTCCGGGGC 

TAAAAACAAATCACAACTAAAAACTTACTTGCTTTCCAGTCCACCCTTCTTAAGCGCCTGAATCAGATATGCAATGCCCTTCTCTTGCTTGAACG 

TCTGGAACAGCTCTAGTAGAAACTCCAATGCAATGCCATCCTTGATTAGGTGCTCGTTGTTTAGTACCAGTAAAACATTTGGCGGTACAGAACCG 

TTGACTGTTGATCAAAAATAGGGTTAAATATAGGGTTAGTCAAATGTTTTCGCAAAACAAGTTTAGACGGCCGGCTAGCAATCCTGTCCTTGGGC 

TCGTGGCCCTTAGATGTGGTTAGTTCATCCAAGAGTCAGTATACCAAGTGAAGGATTACTTTAGCCCCTCAGCTGCAAGTTGAACAAGTGTTTAT 

CGAAACAACTAATTGAAAGACCGAATGGGCAGACGCAAGCAATGCACAGGAGGAACAGGAACATAAATGTTAAGACTTACCTAACCAGAGCGCTG 

TCATACGCGCAAGCTTGATGCGTTCACTAGGAGTAAAGCCCTTTACGAATAATAGAACCTTTCCCATCTCCTCTTCGAACATTTTCTCAAGGTAC 

TTGTACCGTCTGATGAGCTTGACAAATACCTGGAAAAGGTAAAGTGTGATTGCCCCTGTTAGCGGATGGTGAAATTTTACTGTGTTCTCACGCAG 

GAGGCTTTTGTATTTGTTGTTGCAACATTAGAAATCTTTAGATTGCTGGCGGTAGTAATTTTACAAAATAAAGACAAGTAAAATGTGGCTTTTTA 

TTTTGCATATTTTGACACCGATTGAGTTGTTGAGATGCATAACCGGTTCTAATAATATGCACTTTGATTCAATAGCGTGACCTTCGCAGTGGTTG 

CATGCATATAGTATATATATATAGTGTTGAATAAAGAGTAGTTGTTGACACATAAGAAATCTTCAGAACGTTAGTGTTTAGGACAAACCAAAGAC 

AAGTAAATGTGTTGATTGTTATTGCATATTCTTGACACCGATTACATTTTTGAAACTTTATAGTCAGCTCTAATGATATGCAGCCACTTCACGAA 

TTTCATACTTTTAAGTGGATCCGAAGAAAATCATTTAGTAGTAAATAAACGTTTTAGCTTGGAAAACAACAAGAGCTCAACTTGTGTTGCAATCA 

AACCAATTTTCGTTTACTCATAATCATAATTTAAGTATGACCTGCTAAAAGATTACAAACCTGCTCATGGTTGCGCATGCTCTCCATGCTCTCGG 

GCGCATCGAAGATGCAGTAGCTTGTGCGCGGCTTTTCGCCATCCTGAGATATAGAACCACCAGGCACTGCAATTGAAACGAGGATGGAAAAGGGT 

CAGTGAAGAGACTAAACACATTACTTGCAGCGGAAAACTTACCTAGTAGACCTCCTGCAATCAATATGTCAAAGAGAACTTCACCGTAGCGACGA 

TAGTCGAGCTTGTTGCCCGCGCTGTCTAGATATTTAGAGATTTGTTCCAGGTCGCCTTCAGTTTTTTCGAGACCAGCAATGACCGCATCACGGAA 

TCCCGTTGGGTCATATTTTTCTCTTTCATCTCTTTTTCTGGTCTTGATGCGTTGACCCGATAGCACTGGTCTTTCAGTTTTTTGACTCATACAAT 

AAATTGCTGCAAAGGTAAATGTGGATCAGAGATGGAAGCGGTTTTGGTTTGCCCCTGTAAGTTTTTATTTCTGACCAGACTAAATTATCATGGCC 

ACTAGATTCATTTTTAGATTTATCTTAATTTTAGTGTAAGATGACTAAGGTGGTCTCCAAGCGGTGGAAAACCGTTCCCTGTATATGTATTCTCG 

TTGCGCTCATAATCAAACTACAACATTTTCACCTATGACCGGCGTTTAAGTGGAAACTCTAGTGAAAACTGAGCCTAAGAAAGCCTTAACCTAAA 

AATGCCAGCGTACATGTGCTTTAAACACACACATTCAAGAGCGGAGAACGGGCACACTTCACATCTCTTTTTAATTATCTCTCTTTTTCACACAC 

ACATACCCACTGTGCATTTTACAGTTTGATAATGTCGACAAAAGAAATAAAAAGAATTGAGAGGTGAGGCGACCAAGAAATCGACATAGAAACGC 

AGAATAAAGTGAAATTTTGACGTGGTTACTTGTGTTAGGTAGCGCGCTGGGGAAATGAGGTCATTGTGCATGTAGTGCACATAAAAACGCGCCGA 

TGACTGTGTGCTAGTTGGCCGGGAAAATTGTCATAGCAGCGAAAACGCAAAAATTAAATTATCTAGGCTATCAAGTACACATTGGTACATATAAG 

GAACGGCAACTTACGCTTTCACCCAAGGATATTAAGTCTGGCCCTGTGTTTGCCGCTTCCGCGTGTCCTTGCAATCAATTGAAATCAGTCGTGTG 

CGAGTGTAGTGTGTATGTGTGTGTGTGTGTGCCTGTTGCTTTTTCACAACTAACGAACGTGCGAGAATATGTAATTTAATTTAACCTCACGAAGC 

GCTGGCTGGTTAATGCTTCTCACTCGAGTCGACGAAAAACAAAGAGACAGATGGCAGATCCCTTCTTTTCTTACCTCTCTTTTCTTATGAGCGAG 

CTTTTGCACGGTAAAATTGACACGCACGCACACACACAACAACAAAGAAAAGGGGAGAGCAAATAGAAAATATAATTTTTTGTAACGGAATTTAC 

TTTTATGCCTTCAAAATAAACGTTAACAATTTTGCTCAGACTACGTCCGGAATTCTATTTTTAACAGCTATCACAATCGGTTACTAGGTATATGT 

GTTGTTTGCCT CG AAT CCAGCGCCCCTCTGCTC GC AC C ACT T T G T T AAAC GAAAT T C GG AG AGC G AAT G AG AC G AC AC AC G AGC GC AC G AAC G AA 

GCGGACGAGCCGAGAACGAAGACATGTGGTTCGGCGCCATTTTGAAGCGATACACTGACATGGCGAAAGCGGAATGATGGGCACAGGGAAAACGA 

AAATTAGGGGGAAACGGTGAGAAAAAATGTGGGATGGGGGTTCAAAGTGTGTATATTGCTGTCGCGCTTATTAAATAATGAGCAAGGTACAGTCG 

AACAAAGGCCGACGGCCACCAAATGATTTTTTGCATGAATATATGACTGCCCGATGACTAATCTGGCCTTAGGTACGCACGCAGTTCGTACATAA 

CCCCTTTTCCAACGAAAGCGCAGACAAAGGATTGGTCAAGGCCAGAGAAATGACCAGGACCGCGATTTTACTCGCCCTATTTGTCTTTCTGTGTA 

CTTAGGTCTACTCTAGCGCTATTTTTCACTCCTATACTTACGATTAAAACAAAAAAATCTGTTTTGCTGCGAATAAAAGTAACTTTTTGCGTAGC 

TGTTGGGCTGCTTCTTCTGGATGACTTGCGGTATGACCGTGGCCTGCTTCGTACGCAATGGTACAGAGCTCACTAAGGTGTCCAAAAGGTGGCAA 

CATTTTGGTGAAGCTGTTGGCTGGCCCCACTAACCGGACCTGAAAATGGTGCAGCGCTGCCGGACATTTAACGGAATCAAAGATTCCACAAAAGT 

CATGTTTGCATTAACAAAATTCATATTTAACCTTAACAAATTCAACAAATTAAGTTAAGTCCATTTTAGTAGCGTTGCGAGGAGTTTATTTTCGA 

AAATACGCCTTTGTATATATTATATATTTTTCCTCCACTTTAAACAAAATCGAAAGGTGTCCATTTGTCAGCTATTTAAACAATATGGCATCGCC 

GCCTTACTTGCTCAGTAAGAAAATTTAACTGTTTTTTTTTTGCGCACCTGAAAGCGGCGATAGACATTGTTTACTTTTAGTTTATATTTACATTA 

TCCAAAAGCCAAATGAACTTTGAGCAGGTCAATATACCAGAGAAACTAGTGCCGGACAACTATTTGCAACTGGGACTGGCTGCGCAACGCAGTAA 
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GCAGCGCAGTTTTAAGGAGCTGCTCGAAAAGGTAATCAAACCGAAATTAAACCCAAATTAATTGATAACTCCATGGATAATTCTTCTTTTCAAGC 
GAAAGCTTCCGAA 
<SEQ ID NO: 211) 

Exon: 5853 . .5395 
Exon: 5274.. 5145 
Exon: 4471.. 4223 
Exon: 4151.. 4051 
Exon: 3449.. 3311 
Exon: 3044.. 2879 
Exon: 2622.. 2379 
Exon: 2310. .1968 
Exon: 1229.. 1001 

Start ATG: 4460 (Reverse strand: CAT) 
Transcript No. : CT7240 

CAATATACACACTTTGAACCCCCATCCCACATTTTTTCTCACCGTTTCCCCCTAATTTTCGTTTTCCCTGTGCCCATCATTCCGCTTTCGCCATG 
TCAGTGTATCGCTTCAAAATGGCGCCGAACCACATGTCTTCGTTCTCGGCTCGTCCGCTTCGTTCGTGCGCTCGTGTGTCGTCTCATTCGCTCTC 
CGAATTTCGTTTAACAAAGTGGTGCGAGCAGAGGGGCGCTGGATTCGAGGCAAACAACACATATACCTAGTAACCGATTGTGATAGCTGTTAAAA 
ATAGAATTCCGGACGTAGTCTGAGCAAAATTGTTAACGTTTATTTTGAAGGCATAAAAGTAAATTCCGTTACAAAAAATTATATTTTCTATTTGC 
TCTCCCCTTTTCTTTGTTGTTGTGTGTGTGCGTGCGTGTCAATTTTACCGTGCAAAAGCTCGCTCATAAGAAAAGAGAGTTGTGAAAAAGCAACA 
GGCACACACACACACACATACACACTACACTCGCACACGACTGATTTCAATTGATTGCAAGGACACGCGGAAGCGGCAAACACAGGGCCAGACTT 
AATATCCTTGGGTGAAAGCCAATTTATTGTATGAGTCAAAAAACTGAAAGACCAGTGCTATCGGGTCAACGCATCAAGACCAGAAAAAGAGATGA 
AAGAG AAAAAT ATG AC C C AAC GGGAT TC C G T G AT GC GGT CAT T GC T GGT CT CG AAAAAACT GAAGGC GAC C T GG AAC AAAT C T C T AAAT ATC T AG 
AC AGC G C GG GC AAC AAGC T C G AC T AT CGT CGCTACGGT G AAGT T C T C T T T G AC AT AT T GAT T GC AGG AGGT C T AC T AGT GCCTGGTGGTTCTATA 
TCTCAGGATGGCGAAAAGCCGCGCACAAGCTACTGCATCTTCGATGCGCCCGAGAGCATGGAGAGCATGCGCAACCATGAGCAGGTATTTGTCAA 
GCTCATCAGACGGTACAAGTACCTTGAGAAAATGTTCGAAGAGGAGATGGGAAAGGTTCTATTATTCGTAAAGGGCTTTACTCCTAGTGAACGCA 
TCAAGCTTGCGCGTATGACAGCGCTCTGGTTAGTCAACGGTTCTGTACCGCCAAATGTTTTACTGGTACTAAACAACGAGCACCTAATCAAGGAT 
GGCATTGCATTGGAGTTTCTACTAGAGCTGTTCCAGACGTTCAAGCAAGAGAAGGGCATTGCATATCTGATTCAGGCGCTTAAGAAGGGTGGACT 
G GAAAGC AAAC T TAT GG AC T T T T T C C C AC CG AAC AAAC GT ACT G AGGAAT AT T T T AAAC AAGT T T T T C TT GAC AAAG AACT T AAC G AGAT C GT C A 
AGC T AC AT AAG GC T C AG GC T AGC C AGG AG GCG AAG C GT G AGT T GC AGC AGG C GCT C AT C GAT GAT AT C AAC GAT G AGAAGC C AT AT AAT GAG AT A 
ACCTCAGACATTAAGGACTTCTCGCAGCGCACCAACATTCCCGATCATGAAATAATTGTTATTATTTGGTCCACCATCATGTCATTGGGCGAATG 
GAACAAGAAGGAGGAGCTGGTTACCGATCAGGCTGTTCGTCATCTGAAGAACTACTGCCCACTGTTGCAGGCTTTCGCCTCGACCGACCGTTCGG 
AATTGGCCCTAATCTTGAAGGTGCAGGAATTCTGCTACGAGAACATGAACTTCATGAAGGCCTTCCAGAAAATCATTTTGTTGTTCTACAAGACC 
GAGGTCCTGTCCGAGGAGATCATCTTGCGCTGGTACAAGGAGGGCCACTCAAACAAGGGCAAAATGCATTTCCTTGAACAAATGCGAAAGTTTGT 
CGAATGGCTTCAGTCAGCCGAAGAAGAATCCGAATCTGAGGACGAACAAAAGAATGGCGAGTAAGAACTTGGTTGATGGCTAAATGAGGATGAAC 
ACAAACGGAGCCGCCCTTTAGCATCCCCATCCAAAAATTCGAAACCAGCACCTGCAACAGCCAGTCGAACTATATATAAAACTCACCCCTATGAA 
TTAATGTCATTTTGAAAACTCCACCTATATAAAACTCAAAAATTCCAACAAATAAAGGAGAGAGT 
(SEQ ID NO: 212) 

Start ATG: 601 (Reverse strand: CAT) 

MSQKTERPVLSGQRIKTRKRDEREKYDPTGFRDAVIAGLEKTEGDLEQISKYLDSAGNKLDYRRYGEVLFDILIAGGLLVPGGSISQDGEKPRTS 
YCIFDAPESMESMRNHEQVFVKLIRRYKYLEKMFEEEMGKVLLFVKGFTPSERIKLARMTALWLVNGSVPPNVLLVLNNEHLIKDGIALEFLLEL 
FQTFKQEKGIAYLIQALKKGGLESKLMDFFPPNKRTEEYFKQVFLDKELNEIVKLHKAQASQEAKRELQQALIDDINDEKPYNEITSDIKDFSQR 
TNIPDHEIIVIIWSTIMSLGEWNKKEELVTDQAVRHLKNYCPLLQAFASTDRSELALILKVQEFCYENMNFMKAFQKIILLFYKTEVLSEEIILR 
WYKEGHSNKGKMHFLEQMRKFVEWLQSAEEESESEDEQKNGE* 
(SEQ ID NO: 213) 



Celera Sequence No. : 1420000133848 92 

CATTTACATAACCGAGCCAATATTGAAGTGGTTTATAATAGATTAATAAGATTGTCCCAAAATTTCTCTATTAAATGTCAAGAGGCTGTGCACCA 

GCATATAAATGATTTGACCTATCCACCTTAATTGTAATTTTTAGATAATAGCCTTTTTTAAGTGCTTAGATTTTTTTTCAATTACATAAAATTTT 

TTTCCCAATCGGTATAATATTTTCATTATCATATTATTCATATTGATATTAATATCTAAGTAGATAAGGTTTAATTATTGGCCTGTACTCTTATT 

ATTGTTAAGGTTAAAATAAAAAAAAATTGGATGTCATTAAGCGTGAAATAAATGAAATGAAATAAAATTTTGCGGAGTTTATTCTTCATGTAATT 

TTAGCTAAAGTGTGAACTGTAGATTTGCCGTGTTTGTTTTACAGGCTTCTTAATTGACGTAAATATATGGGATATAATGTCATGAGTCAAGAGAT 

TATCAAAT AT TAAATTTTCTGATTAAAATTAAACGAAAGTCC AAAAAT AC AAAAAAAAAAACAACTTACCGTAGGCAAAAT AGT AAT 

TAACGTTCTGTGTCGAAATCATATTACCTAAAATAGTTTACTTTCCACTAATATGTACTTGGACCGCACCCTACCAAGAAAATAATAATTTCTAC 

TATTGGCGACAGCTGTTATTTGTGCGAATACAAACATGCTCAAATTAATAAAAGTTACGCTTACAGTTTGATTTTAATTTTTACGTTTCATTTTA 

TTTTTTAATAATCTCGAATCTTGATTTTGCTGAATGCTGGATTTACTGGCCATTACCTTTCCTGCCCATTAAATCGTTGTTAAACATTGAACATT 

AATGAGAGTTTCGTCGCAGACTATTTCGTAGGCTTAGACAAGTAAATAAAATGTAATTGGGAATGTGAAATAGGTGGTACCTTGATTACAAATAA 

AAAAGGTACGATTTCGAAAGCATTTTAAAAATAAACATTTCAGAGTATTGCTGTAAAAGTTGTTTTTCGTACTTTTTATTGTTTCTATGTGGTAG 

TATTTTTTTAGCCGGTTGGCATACATTTTTCTGGGGTTGTTGCTGGCAGCTTTTTGTTACATTTTACTGAATTTTTAGTTTTACTTTGTTCGATA 

T AAAT ATTC AT ACTC AAAAAT AAGATACGGTTGTAAAATT AAAAAT TGT AAAGGATTAC AT ATGTT ATT TTGCCGTAATTGTTTTGATGTTTTTC 

TTACTTAAAGATTTTGTTTGCAATTTGTTTCATTCGCTTCGTATTTTTTTTTACAATTATTAAATTTGACGCTTTTGTTTACAACTTTTATTCTT 

AAAACATAGAGGGTCTCGCATCTTAATTTCGCTTAATATTGTTCTCTTAATCCTTCGCTTTATATATATATATACTTTGCTTTAAAGTTAAGATT 

TTAAGTAAGTTTTTAAATAATAATGGATTTTTTGTTTTCGTTTGTTGCCGGTATTGTTGGTTGGTAGGTTGTTTGTTAGTAGAGAGAGAGAGAAC 

CGGTACGCTATAAAACTACGCTCCCATTGCCGGATTGTTATTGGAGAATTGCGCCCGCCACCCAAGCAGCCACCCACGTATCACCCGCTCACAAG 

AGCGGAAAATGGATACAGTCCGGGTTCCTGGCGGTAGAACCGTAATTTCTGTGATTTGCTTTTTTTGTGTTAGTAAGTATTTAATAAGTAGATTA 

CTGAGTTTGCTGCTCCGCGGGCGATTCCCTTAGGCGGCCACTTCGCTAGCCTCGGCCCCATTCTGAACCTCATCCTTTGTGCTGGCCTCATCAAG 
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CTTAGCCTTTTTCTCCGGCGTGGCGACGGCCTCATCTGCCTTGGCTGCAGCCTCATCCACCTTCCGCTTTACAGCCTCAATGGCGGGAGCATCTG 
TAAGGCGATAATAAAAATTGAACAATAAGAACATGAACGTGGAAAATTTGTTTGGCATTGATATCATTGATATCATCATACCTGTTGAGTCACCG 
TTGGTGACCTCCTCGGAGCCATTCTCGACAGCAGGTAGAACAGCTTCGCTGCTCTCTTTTTTGGTGTCCTCCCCATCGGCAGCGGAATCCTCTTT 
TTTATCCTCGTCCTTCTTCTCATCCTTATCGGCGGCAAAAGAAACAGTGGGCTCAGACGCCTTTTCACCATCGACTGCGTCGGTGGGCTCGGTCG 
CGCTGCTGTCGGCGGCTCCGTTCTCCTTGGCTACGCTCTCCTCTTCGGCGCCACCGTTTTCTGTGATGCTCGCCTTCTCAGCTGCTACCTCCTCG 
GCGGCAACTGCATCTTTCTTCACAGCATCAACTTCCTCAGCTGCGACCTTCTCGACAACAGGGGTCTCACTGTGGGTGATAATCAGAGACGAAAG 
TCAGTATGCACTTATAATTTTTTCCGAAGGTGGCCGAACTGAACTTCGTATGCATAGCATTCTACTACATACAAAACTCTAAACAGTAAATCAAT 
TAAAAATTATCAAAAACTAAAACTATTAATTATTAATTAATTAAATATTAATT^ 

AGGTAACGTTTATGTTTTCATCCTCCACTTTCTATTCATAAGAGCCCCAGGACGTTCGGAATTAAAACTAAAAAAAAAGTGCATTGTTACATCTG 
CCGGCGTTTTTCAAAAATGGGATCAACAACTCGTGCAAGCTGCGACGCAAAAGGAAGCGCGAAAAGGTGAGTGCGATTAGGCGGCACGCTCTTTT 
TCATATAAATGGCATATATTCATTTGTTATTTCAGTGCTCTCTACCAACAGATGAGAACAGGGTAGAACCAAAAACACGCTTGCAGACGTGCAAA 
ATGAAAAGCGAGACCCGTTTAAACTAAAACGAACAAAGATAGAAATAAAAACCAGTTTTAATGGGATTAATGACAATATAGTCGTTTCTCTCGCA 
CTCATCTACCAACAGATACTCGCCATCTGAGGTAGATAGAATTCTATAGGGCGAACAAAAAGAGAGTTGCAAAGTAGCAACGCTAAACGACAAAA 
ATAACGGTCCAAACGATACAAACAGTAGCATAACGTCATACAACCTGCGTCTTTCAAAGAAAACCGGAACAGCCTTAGGCATGAGTTTTGTTTTG 
TTATGTATTATAAGCTGCAACATATAATAACTACTTAAAAAAAATATATACATAAGTGTTCATCGATAGGGAAAGAGGGAGTATTTTTGGTAATG 
CCAATATAGATATGCGTGAAAATTTATACTGTTAATTTCAAGCTTCGAGCTTTAAGCTATCAAATGGGTATAAAAGGCAAGCCTCGAAATTTGAT 
GTCTGGTGAGAAACGCAGCAAAGCAACATACGAGGTAGCACAGACAGATGAAATGAAACAGCACAAGAAAAGAGCGGCAACTTGAATGCTGCACA 
CGCATACACACACACATGTCTATAGACTTTTGCGCATTTGGCCTGTCGTCCGCATACACTCAACGCATTACTCCGCCGCTACACACACACACACA 
AATAATTCATCGTTGCTGTTTGCCGGCCGGAGAAAGAGCGAGAAGGCACCATGACGCCGCGCCAAATTAAACGGCGTCGGTTTTTGCGCGCACAC 
CGTTTTGCCTTT GGC GGG AAC G AG AC AAC T AC AT T AGT GC C AAC AAGC GGT T T G C GT AT AT AG AC AC AG AAT T C AAAAG AAAAT T TC C AAAT C T T 
TTCGGCAAATAACATAGATTTTCATTAGCAAAATTCGCTACAAGTCCATTGCTTAAACAAAATCCACAAACGTCCGAAAACAAGGCAAGCAAAGT 
ACT C ACT TC T T T T GC T C AGC C AC AT C AGC C AT TT T G AAT T AC T GT TAG AT AT T AC T T T T C AAAC GC G AAG C G AGAAAAC AC G T T G T AACT G AGG G 
TTTTTGGAT C AC C AAG C ACGCCT T AAAT AT C AG C AC AAAGAT T T T AAAAG AAC G AAAT T T TAT C AAT AT AGT T TCCTCTTTAGTAGCTT T GC T AA 
GAAAGTAGCTTGGTTGCTGGAGATTTTGCCGTACAATTAAGAACGTCCACACAGTCACAAAGAAAAACGCCGAATGAGAGAGCGGTGAGCGTCGG 
ACCGGCTTTTAAAGCGCCGCCCTGTAGTGTTGGTAGACGTCGGTAGTCCGCTTGCGATAGTTCCGCCTCCGGAAAACATATCTTTGTTTGTTATC 
GTTATCACCCGTTCTCTATTTAATTCGGAGTTAGCTTATGCTGATAGATTTTTACAATTTTATTTTAATATATTAAAGAGATGAAATATAGCAGT 
GACAAATATTTCAAAATAACGGTCCCCTAAAAAGGGCTGCCACCTTTTAACTATTATTTAGCGGCTACACAAATATAGCAGGTAAAATATTCAAA 
TGCATTCCTATCCTTGTTTTTCTCATTGACCTGAGTGATTTTTCGTGTAATACACCTTCCCAATTCTAAGTTTTTTTTTTGATCCGGTACTCCTA 
AAAAT T AAAT AAAC AAAC AC ACAAAC AT GT AT GT T CAT AT AT AT AT AT AT AC AC AT AT C GTC ATT T T T AAAAAAT T C AAGC T T T T T AT T AAAT T A 
CATCAATCAATTTCCTTGTAAATTACACAGTCTACAAGATCTTAAAAATAATATCTTAAAGATATGTCATATGTATGTAATCAAACACATAAAAC 
CGACGGAAATAATCCTCACCCCTCTGAAAACATTTTCAGGTAGTCAGATATGCACGCATAAGTACGTACATACATACATATGTATAGTGGCCTTG 
TCTGCGTCCCTTGATCGGTATAATAGAAAAAAGAATTAGAGTGCGCTACCAGGCAGCTGGTAAATTTTCTGCTTCACTAAAATTTTACAATATTT 
TAGGCTATTGGCAAGGAAAACAATACACTAGGACTGAGATTCCATTGCCGAAGAACATAGACGCTATTATACATACATATAATTACAAGTTTTCA 
ATT T T C AT AAGT C AAC AT T T AAT TTTAGGTCT CAT G C T AAAC T C C AC T T AT T GAT C T AT AT T GAT T G AC GAT C AAT CCTCGGTCT CG AGC T TT T T 
TAATATGGACTTTGGGCTGTAAAATTTTAATGGGC 
(SEQ ID NO: 214) 

Exon: 4070.. 3807 
Exon: 2349.. 1982 
Exon: 1897.. 1001 

Start ATG: 3832 (Reverse strand: CAT) 
Transcript No. : CT7302 

CTCTCATTCGGCGTTTTTCTTTGTGACTGTGTGGACGTTCTTAATTGTACGGCAAAATCTCCAGCAACCAAGCTACTTTCTTAGCAAAGCTACTA 
AAGAGGAAACTATATTGATAAAATTTCGTTCTTTTAAAATCTTTGTGCTGATATTTAAGGCGTGCTTGGTGATCCAAAAACCCTCAGTTACAACG 
TGTTTTCTCGCTTCGCGTTTGAAAAGTAATATCTAACAGTAATTCAAAATGGCTGATGTGGCTGAGCAAAAGAATGAGACCCCTGTTGTCGAGAA 
GGTCGCAGCTGAGGAAGTTGATGCTGTGAAGAAAGATGCAGTTGCCGCCGAGGAGGTAGCAGCTGAGAAGGCGAGCATCACAGAAAACGGTGGCG 
CCGAAGAGGAGAGCGTAGCCAAGGAGAACGGAGCCGCCGACAGCAGCGCGACCGAGCCCACCGACGCAGTCGATGGTGAAAAGGCGTCTGAGCCC 
ACTGTTTCTTTTGCCGCCGATAAGGATGAGAAGAAGGACGAGGATAAAAAAGAGGATTCCGCTGCCGATGGGGAGGACACCAAAAAAGAGAGCAG 
CGAAGCTGTTCTACCTGCTGTCGAGAATGGCTCCGAGGAGGTCACCAACGGTGACTCAACAGATGCTCCCGCCATTGAGGCTGTAAAGCGGAAGG 
TGGATGAGGCTGCAGCCAAGGCAGATGAGGCCGTCGCCACGCCGGAGAAAAAGGCTAAGCTTGATGAGGCCAGCACAAAGGATGAGGTTCAGAAT 
GGGGCCGAGGCTAGCGAAGTGGCCGCCTAAGGGAATCGCCCGCGGAGCAGCAAACTCAGTAATCTACTTATTAAATACTTACTAACACAAAAAAA 
GCAAATCACAGAAATTACGGTTCTACCGCCAGGAACCCGGACTGTATCCATTTTCCGCTCTTGTGAGCGGGTGATACGTGGGTGGCTGCTTGGGT 
GGCGGGCGCAATTCTCCAATAACAATCCGGCAATGGGAGCGTAGTTTTATAGCGTACCGGTTCTCTCTCTCTCTACTAACAAACAACCTACCAAC 
CAACAATACCGGCAACAAACGAAAACAAAAAATCCATTATTATTTAAAAACTTACTTAAAATCTTAACTTTAAAGCAAAGTATATATATATATAA 
AGCGAAGGATTAAGAGAACAATATTAAGCGAAATTAAGATGCGAGACCCTCTATGTTTTAAGAATAAAAGTTGTAAACAAAAGCGTCZ^AATTTAA 
TAATTGTAAAAAAAAATACGAAGCGAATGAAACAAATTGCAAACAAAATCTTTAAGTAAGAAAAACATCAAAACAATTACGGCAAAATAACATAT 
GTAATCCTTTACAATTTTTAATTTTACAACCGTATCTTATTTTTGAGTATGAATATTTATATCGAACAAAGTAAAACTAAAAATTCAGTAAAATG 
TAACAAAAAGCTGCCAGCAACAACCCCAGAAAAATGTATGCCAACCGGCTAAAAAAATACTACCACATAGAAACAATAAAAAGTACGAAAAAC^ 
CTTTTACAG 
(SEQ ID NO: 215) 

Start ATG: 239 (Reverse strand: CAT) 

MADVAEQKNETPVVEKVAAEEVDAVKKDAVAAEEVAAEKASITENGGAEEESVAKENGAADSSATEPTDAVDGEKASEPTVSFAADKDEKKDEDK 
KEDSAADGEDTKKESSEAVLPAVENGSEEVTNGDSTDAPAIEAVKRKVDEAAAKADEAVATPEKKAKLDEASTKDEVQNGAEASEVAA* 
(SEQ ID NO: 216) 

Classification : known_flybase_gene 
Gene Symbol: l(2)k0581 
FlyBase ID: FBgn0022155 
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Celera Sequence No. : 142000013384170 

TGTCCTCATCGCCGATGTCTTCCTCGTCTTCCGAGTCCGGAAGCGCTTCCCTCGCAGCTTCTGCAGGATGTACGGATTAATCAGGCCGTCTTGGC 
AGGCTTATAGCTAATAAAACCTTCGCAATATTTGTTAAAGTCGCTCATTCTTGTCTATTTTTAATTTGCCGCACGTGCATTAAATGACGACAGTG 
CTGACTATTCAGCTAGATTTAAGCTAACTTGGGCTGAAGACACATTGAATACTTAATAGACGAGTTGAAAGGGATAAGAGTGCACTTCAAATGCG 
TCAGCACTCTTTCTTTTATGACGAAATCCACTTATTCGACAACTGAATTTATTTTTAACTACTTCAATAATATGATTTTGAAGCCAAGTCACACA 
CCACACCTTGCGATGCTTCTTCTTGGCTATGCAAACAAAGCCGTTCACAGTGAAGCGCCTTCAAATTACGGCGCGCATTGTGATCGTTTTAAATA 
AATTAAAAATTATGGTTCCTTTGAAAAGTTCATGGTAATGGTAAGATAAAAGCGGCTACCTATAATTAACTCAATTGTCAATCATCAACAGTTCC 
ACCATCATACTTGTACACCCGTACCACACACCCACCCATACAGCACAATAAGCAAGATTTCTTTGTTTTCGTTTAATATCGATCTGTAGTTTTAT 
TCTTTATACAAAAATGAATTTCACAAGAAGCCGTTCCGAATACGACCTACATACATGTGCTGTCGCCATAAGTCAGTTGCCTAAATATTCTTTCG 
GTTTGGAGTTGGATTTGCGTGTTCGTTGAGGTTTCTTGATTGCCTGACTTGACTATGTAACTCGTTTTGTGTTATGTATGGTTATATAAATATAT 
ATTTACATATTTATATAGGTTCTGCCATTTTATATTTAACAAAGGACTCTTATTCCTGCGCAGCATCTCTAACAAGAAGAACTCTTCAACAAGCA 
AC T ATG C AT T C AAT AAT T AG AT AT AT GC G GGC T AC AT T TAT AG AT C CT AT T C AC AT G AAGT C GT C AT AGT C T T C GGT G AAG T C AT TT CC GT AC T T 
TTGGTAACCGTCAATGTCCTGCAAAAGAGCGGATTAAATAAGTTCATTCGTTGATCTCGAACGGCGGCGATTGAAACACTCACGTCGTTTTCGGT 
GCGGAGAGTGACCTTGCCCTTGCCCTTTCCAGCGGGCTTTTTAGCATTGGCCTTTTCCAGCTTCAGCTTTTCCGAGTGCAAGATTTCCACGTTCA 
TCTTGACCTTTTTGATGTCAGCAGCGCTCACTAGATAAGGTGCGAAATCAAATCAGATAAGCCGTCTTTAATTGTCAGCCAGCTAATCACTCACG 
ATTCACGCATAGGCTGCGCACCAGATCCTCAACAAATTGGGGGAAGTGCTCCGACTCGCGGAATTGGCCCACCTTCCAGCTGAGCGTTGCGCCAA 
ATTCCTTGAACTCCTCCTTGGTCTCGGGGTTGAAGGCGTCAAGACCTCCGCACGTACTCGTCACACCGAACGCTTCCTGGGCATGCTTGAGATCA 
GAGGCCTCCTGGATTTTTTGCAATCGCAGCTTCTCAGCCAACTTTTCTGCAGGCGATAGGTTGGCCAAACGTTTTGCCTCCGCTTCCTCCTCTAG 
GAGCTGTTAATAACAGATGATAACAATATAGCCAATGATTGAGAAATGCTTTTGAGCCTTATCTCACCGCCTGCTGTTCCAGTTTGGCTTTTAGC 
GCCTTATTTGGCTTCGGTTTGGCTGGTGCTTCTGTTTTCGTAGGCTTCTCCTCGTCCTTTTTTTCCTCCTCATCCTCCCAGCTGTCCTGTGGGCA 
GCAAATAAACAAATGATTCGATCAGTCTGCTGCAAAAGTGCACACGTGTACAGCGAAACGAAAGTGAAACCGAGTGCAGAAAAACAAGTTTTGGC 
CCATGGCATTTGTTGTTGGGCACTTGGATAGTTGGTTTGAATGCGGCACTGCTTTCGATCTCACTTGGATTTCGTTAGTCGCACCTTGATGTCCT 
CATCCTCGTCCTCGCCCTCCCACTTGTTTACGCTGGCGGCGGCAGTCGGCCGGATGACCACCTCGCTGTCTGCTGCGGATTCTGCCAGGTAACAG 
AATGAGTTCCAGGAACATGGTGAATAACAATGATTTCAAACGGTTAGTTGGCACAGTGGCTAAAAATACTCATCAAGGCTGGTTGCACGGCACTC 
CAACAACAACAGCAATAGCAGCAGCAACAACAACAACAACGACGGAGACCACGTTTATTGTCCGCCGTGCAGATGACGCACCGCCCACTCTTCCG 
CCCAGGCCAATCGCAACTTACCCCAATCGTCGGCCATGTTAATGGTTGCCTATTTGCTTGCGGGATTTGTGCTTCTGGATGGGATGGTGCTGGGA 
TCGACTTGTTTCCAGCCTCGGCAGCGATTTCAGGAATTTTCCAAAACCAAATTTCTGTGGATCTGTAGTGTGCCCAAGGCGCGACTAGCGGAAAA 
TGCGTGGGTAACGTCGCCGATAGTCTGTTGCCATCGAATATTCGTAGTGCGGCCAAGGTGATATCAAGTTATCGACAATTTGAAATTTGTTACCC 
TACAATATTTCATTTTTCATTTTTTTCCTTTCTTGATTGTAAGATTGCTTATTTTATTTTCAAACACAAATTTGTCAACTTTGTTAAGAAATCTT 
TAAGTTTTTAAAGATTTGAGTTACCCTACAAGTTTTTTTCAAAAAAAATTCGTGATAAATCTCCCTCTTCGTGGTGTTCTAAGAAACTTACTCTT 
AGAGTTATAAGGAAAATTGATTCGACTCGATAAATTATATATAGTAGAGAGTATATACATACATAAGCATGTTCTTGATAAGAAGAACTAGCTGT 
GTCGTTATATACCTCGAACTCCCCAATTGCTGACTAAGGCTATCTTATAGTCGAGGAATTCTTTCGATTGAGAGGCTAAGTCATCCTTATTGGAG 
TTAGAATTGGTCTAGTTAAGATCTGGATTTGTTTGCAAAATGTGAATTATGTCAAACGTCTGCAATTCTCAATGGTTACATCTGTTTTTAACATA 
ATCATGTTAAATGTGGCTTTTTGTCCCAAGAACGGTTTTAAAGATAATTGTGTACTTTCTTTACTTGACTGTTTTGGCGATTTTGTTAACAAAGA 
ACAACATCCATTTCTGCATGCATAAAAAGTATCATTTTAATAACGTTAAATTAAATTGGTATTCTTACAGATTTGTATTTCATTTTACGTTCATC 
TTCAGCGATATCATTGATTTTCTTCTTTATCCACTCATAATATGAGGAAGTTTTCGTGTAAACCGCTGGCCCATATCCGTAACTGGAGGGTCCCC 
CTCCGTAGGACGTTATTCCGGCTAGGTAATAAAATTGATCAGCACCCCTGCCCACCGAAGCCATCAGCGGGCTACCGGAATCTCCCAGTCCGGTG 
TCGGTGCCATCCCATCCCATGGCACATATCTGGATGT 
(SEQ ID NO: 217) 

Exon: 2457.. 2302 
Exon: 2076.. 1985 
Exon: 1796. .1683 
Exon: 1618.. 1330 
Exon: 1265.. 1129 
Exon: 1063.. 1001 

Start ATG: 2317 (Reverse strand: CAT) 
Transcript No. : CT7734 

GCGCCTTGGGCACACTACAGATCCACAGAAATTTGGTTTTGGAAAATTCCTGAAATCGCTGCCGAGGCTGGAAACAAGTCGATCCCAGCACCATC 
CCATCCAGAAGCACAAATCCCGCAAGCAAATAGGCAACCATTAACATGGCCGACGATTGGGAATCCGCAGCAGACAGCGAGGTGGTCATCCGGCC 
GACTGCCGCCGCCAGCGTAAACAAGTGGGAGGGCGAGGACGAGGATGAGGACATCAAGGACAGCTGGGAGGATGAGGAGGAAAAAAAGGACGAGG 
AGAAGCCTACGAAAACAGAAGCACCAGCCAAACCGAAGCCAAATAAGGCGCTAAAAGCCAAACTGGAACAGCAGGCGCTCCTAGAGGAGGAAGCG 
GAGGCAAAACGTTTGGCCAACCTATCGCCTGCAGAAAAGTTGGCTGAGAAGCTGCGATTGCAAAAAATCCAGGAGGCCTCTGATCTCAAGCATGC 
CCAGGAAGCGTTCGGTGTGACGAGTACGTGCGGAGGTCTTGACGCCTTCAACCCCGAGACCAAGGAGGAGTTCAAGGAATTTGGCGCAACGCTCA 
GCTGGAAGGTGGGCCAATTCCGCGAGTCGGAGCACTTCCCCCAATTTGTTGAGGATCTGGTGCGCAGCCTATGCGTGAATCTGAGCGCTGCTGAC 
ATCAAAAAGGTCAAGATGAACGTGGAAATCTTGCACTCGGAAAAGCTGAAGCTGGAAAAGGCCAATGCTAAAAAGCCCGCTGGAAAGGGCAAGGG 
CAAGGTCACTCTCCGCACCGAAAACGACGACATTGACGGTTACCAAAAGTACGGAAATGACTTCACCGAAGACTATGACGACTTCATGTGA 
{SEQ ID NO: 218) 

Start ATG: 141 (Reverse strand: CAT) 

MADDWESAADSEVVIRPTAAASVNKWEGEDEDEDIKDSWEDEEEKKDEEKPTKTEAPAKPKPNKALKAKLEQQALLEEEAEAKRLANLSPAEKLA 
EKLRLQKIQEASDLKHAQEAFGVTSTCGGLDAFNPETKEEFKEFGATLSWKVGQFRESEHFPQFVEDLVRSLCVNLSAADIKKVKMNVEILHSEK 
LKLEKANAKKPAGKGKGKVTLRTENDDIDGYQKYGNDFTEDYDDFM* 
(SEQ ID NO: 219) 

Name: eIF3 p35-like 
Classification: translation factor 
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Celera Sequence No. : 142000013384170 

AAAAATAAAAGAATATTTAAAAAATCGTTGAAATCACCGATGACAAGCTCTAAATGCGCAAACTTC^ 

TAAATACAAAAATAGTAAAGCCTATTCGTATTTTAACTGTTGTGTTGTAATTATGCATAAAGCGTATTAATTATGTATGTGTAACAAAAATATAA 
GAGAAAAACAAATACAAACGAAAAGAAAAGAAAAACAAATAAAGAGTATGTATTACTAAACAAAACAAAAGCATATTTGCAGAATTGAAGGTGTA 
GACACTGAGAGAAAAGGAAGCACACATTTCATTTAAATTAAGCGCCGAAGAAGCTGCAATAACGCATTAAAAAGCCAATCAAATTCAAAGTTTCC 
TGCTTTTGCCATTAACATTATTATTAATATTATTTCCCTTATTATCATTTACCGATGCCAAAACATTGATTTAATTAACGAGCGGTTAAAAGAAT 
GGCAAAGAAAGTTTATTTATAAGACCGAAGCGATAATGATTATTTAAATATGAAAGAATTCAACTAACTTCATATTAAGATGTCCTTGTTATGTC 
GCAAAGAATTAATAAAAATAAAAGTGAAAAAACAATTGAAAACCAGAAAAAGCAAACATGACGAATTAATA 

AAATGAAATGATATTTTCAAATGGGAACAGTATGGATTTGGTTAGTACTCTTGATGACTAAATTGGACTTGCAGAAATCGAGGCACGAAGACATA 

TTTTTTCACTGCACTCATACGAGTACCGACCAATTTAATTTATTACGCATTTTATTGCACTGTGGCCAATGATCAGTTAAAATAAAGCAACGTTC 

CACTTTGCTAGTGAACCTTATCAAAAATAAATAATTGATTATGTAACTCTTGCTCTTTAATCTCAACACCCAATTACCTCGTTGCGTCTATTTTT 

C AAT AT T T CT T C AAT T T AAAG C AC AT GAT T T T G T T G AAT T T AT AAT T T T TAT T C AC T T AAAAAT GT AG CT C C AAAT AC AC T T AT AAT T T T AC G GT 

GTTTAATATATGTATTTATCTTTAAATGTATGTGTGTATGTGTATCGTTAATAATTATGTTTATAGAAACTCATATATCGGAGCCACTCTACCGA 

CAATGCACGAATGATTAGTGTGAGGCGCGGAATAGTTCGCCAATTAAAATGGTTTCGGATTAGGCGAGCCCAAATCCAAATGGGAACCTTCTCCT 

ACTGCACGTATCCGCGACCAAGAGTAAGGATGGTATCAAAGAAGTGAATATTGAATCGGGTATGTCATCTGATTTCGCCATCGGCTGGATGTCTT 

AGCCAATGGTTGCTGTAGTAAGTTACCGCGTCCAGGATGCGAATGTGATTGGTTTTGCTTGCTCGGTTTTACGGAAAAAATTCATTTAAAAAGTT 

CCATTTTGTGGTTTTGTTTTGCGGTGTAAAACTAGAAATCTAACTGTAAAGATCATCGTCGCCGTTGTCGCCGGGCGAGTTAACTGGCAAGTTGT 

TACCAGATCCTGAGGTGTTGCCGGTTTGACCGGGGAATCTGCAATAACATATACACTATCATCAGTAACCTGGATTCCAACAAGAAATGCCTTTC 

AATTCTCACCTGAAGTTCTGTCCGAATCCACGGGACTGCTGCAAAGTCTGGGCAAACATCTCGTACTTCCTGATGTCATTATCGGACACCGAACG 

GCGAGCGAACTTCATGGCCTCCTCGAAGTGGGCGCTAGTAATCTCCGGCACTGGATCGTCCTCGTCCATCTGATTATATGGGTAAATAGTCAAAC 

ATAAACATTCTTATTTTTTGCGCTAAGCTAAATGACTTACGTCCATCGCGGAGTTCTGATTTTCGGCGCGCTCCTTCTCGCGACGAATTTCAGCC 

TCGATGGCCTGTCGGATGGCCAGCTTGCAAGCCCGCTGGCAGATCTCGGTCAGATCGGCGCCAGAGAAGCCCTGCGTCACCTTGGCGATGTAGGT 

AAGGTCTACTTCCTTGGCCAGCGGCGACTTGCGCAAGTTAGCCTTCAGGATAGCCTCACGCGATTTGTCGTCGGGCAGCGGAATATAGATCAACT 

GAT C C AG AC GG CC C GG ACGG AGG AT GG C C G GAT C G AT AATGT CC GGGCGATTGGTGGCTCC GAT GAT G AAC AC AT T CTTCTTGGCTCCCATGCCA 

T C C AT T T C GGT AAGG AT C T GAT T GATC AC AC G AT C AGC GGC GCC GC C AGC AT C GCC GAC AT T AC CGCCACGGGCTTTGGC GAT C GAG T C C AGCT C 

GTCGAAGAAGAGCACACAAGGAGCCGCCGAGCGGGCCTTATCGAAGATGTCGCGCACGTTGGCCTCAGACTCGCCGAACCACATGGTCAGCAGTT 

CGGGACCCTTGACTGAGATGAAGTTCGCCTGGCACTCGTTGGCAATGGCCTTGGCCAGCAGCGTCTTACCGCAACCAGGGGGTCCGTAGAAAAGG 

ACACCGCGACTGGGCTGCATGCCAAACTTCAAGAACTTGTCCGGATGCTCCACTGGGTATTGCACCAGCTCCTGCAATTCCTTCTTGACGCTCTC 

CAGACCTCCGATGTCTGTCCAGGTGGTGTTGGGCACCTCAACAACAGTTTCACGCAGCGCCGATGGACTGGACTTGGTCATAGCGTAGCGGAAGT 

T CT CC AT T GT C ACGGC C AGC GAC GC C AGCAC CT C G GC AT CG ATC T T AT C GT C T T C C AAAT C G AT G AGGT C C AT C T T C T C AC GG AT T T G C T G AAG G 

GCAGCCTCAGAGCAGAGTGAAGCCAGATCAGCGCCCACATGTCCGTGGGATTCGGCAGCAATCTGCTCCAGGTCAACATCATCATGCAGCTTCAT 

GTTCTTGGTGTGAATGCGCAGCACCTCCAGCCGTCCAGTGGCATCTGGAATGCCGATGTCAATCTCACGATCAAAACGTCCAAAGCGACGCAGGG 

CCGGGTCAATGGAGTTGGGCCTGTTGGTAGCAGCCATAACGATCAGATGGGAGCTCTTCTTCATGCCATCCATCAGGGTCAAAAGCTGCGACACA 

ATGCGACGCTCCACCTCACCGTGGGTCTTGTCACGCTTGGGGGCAATAGCGTCGATCTCATCGATGAAGATGATAGCCGGCGAGTTTTTCTCGGC 

TTCCTCGAAGGCCTTGCGCAGATTCGATTCCGACTCTCCGGCCAGTTTGGACATGATCTCCGGTCCGTTAATCAGGAAGAAAAAGGCTCCGGTCT 

CGTTGGCCACAGCCCTGGCAATCAGGGTCTTACCGGTACCCGGGGGACCGTACATAAGAATACCACGCGGTGGCTTCACACCAATGGCCTTGAAG 

AGCGATGGATGCCGCAAAGGCAACTCAACCATCTCCTTGATCTGGGCCAGCTGCTTGCGGCAACCACCGATATCATCGTAGCCAACGGCGTTTAG 

GGACTCCTCCTCCTCCTCGCGCTTAATTGGATCACCGTCGCAGAAGATTACCGTCTCGGGAGCTACGATGCAGTAGGGCTCCGGGTCGGTCAGCA 

CCACCTTGAACTCAATAGGACGCATGGCGGCGCGCACAATAAAGTTGTCGCCCATGTGGATTGGCCGATAGGCCTCTAGGAAGTACGGCTTCAGA 

TAGATCTCGAACAGGTTGCCGGTAACGCCTTCGGTGCTTTCGTCGATGGGCAGGATGCGCACCCGTTTGCCGTACTTGACGTCCGGGCAGGACTG 

CACGGAAACCACATCTGAAAGATGGACGCACAGGTTGTTGCGCACCACGCGGTTCATGCGGATCTTCTCGTCGGGACAGGTGTCGTCCGAGAGCA 

CGATGCACACCGTCTCCTTTCGGCGCTTGCCCTTCAGAATCACGGTGTCGCCACGGAAGAGCTGCAGCTCATCCATCTTCGCCTGATATAGTTTA 

AG GGGG AAAAAG AAC AG C C AAGT C AT GT G AATT AGC C AT GC G AT G AC GTGGC C T T T GGG AG AGT G AA T CT T AC CT G C GAT AG C G AC ACC AC AG AG 

TTGTCGTCATTCTGCGCCTCTTCCACAATCAGTCGGTTGGGACGGTCCTTGCGCTTCAGGATCGCCGTAGCCAGATCTTCACTGTAGGGTAGAAG 

AAAAAAGCATCGCGTAAGACTCGTCGCTAAAGAAATATCTTATGTCTTATATATAATTTGGCCTAACCGCTTATCGAACGAGTCTTTCGGCTTCA 

TTTTCTCATCCTGCTCGCTGTGATATCCCCGCATAAAGTCCCTCGTATCCCCATCATGATCCATCTTATCGTTCTTGCCGCCAGACATTGTTTTT 

TTTGTTTTTTTTTTTGTTTTTTTGGATTGCCTTTTTTGTTAGTTTTGATAAATTTATGGAAACTACTTTCACTAGTAATACGGAATGGAACTAAG 

CATCTGGATAATTAGGCCTGCCGAATCATTTTTCGTTGCTAACGTGATGATGTCGATCAATCTTTATTGATAGATTTTTGAACATAAAAGGTTAT 

CACCCTATATCGTTCATAAAAATCTATATTAATCGGAATAATAGTACAATTGTTTATTGTCTTTAGTTATAAAAGTATGTTTGTATTTCTTGTAT 

TCGATGCCAACTTTTTTGGCGCCATTTGTTATAATAATTTCTAAACTTTGAGGTAAATTATGCATTTATTGTTACTTAAACGATATGTTTTTTTT 

TACT TCTGGCGAGTTGAGGGAGCACAAAGACTTTTCTAGAATTAATGC AAT ATC AAAAAAGGTATT ACT AAT AGGT AGCATTGAAAT AAGT GGTT 

TCGGCATATAGTTTAGGATTTTCGCGATTTATTATAGAAATAGCTGCCATATATGTATGCAGTATTGTTTTTCGTATCCAAAGTTATAACTACAA 

CAACCGGAAAATTGTATCCTCCAGCAAAGGATGACTTGTATTTTTGGGACAACAAAGGGAAATCGTAAGTTTTCAAGCGGCAACAACAACTGCAA 

GGGCAGTAACACCAACAACAACAACAACGTACTGCAAATGGCGATTCTCGCTGGCGTTTTTTTTTCTTCGGTTTTTAGGTTTGCCCTAGTTTCGC 

AACATTTTGCAACCGAGCAGTGGAAAAGTCGCGGTAAATGTCTTTGTCTCGCGGAAATCGGCGAAAGCCTTTCACAATTAAATAATAGTATGTCT 

GCGATGTGGCCCTTGCGTATTTTTTTCGGCTCAATGACTGAGCGAAATTTGTGAATTTCGACTTGGACCGACACAGTCGGCGCCTATTTCGGCGT 

CCAGTTCACTTACCCCTTGGAATCTGCCATGATAAACTAGTTGACGTTGAACTTTTTAACTTAATTGTACAATTCGGCGCGACTTGGACTATTCG 

TTGAGCTGGAAATCGGCACTTTTAGTCAAATTCAAAGATGAGGCAGCAGAAACAGATTGGTGATGTGCGTTACCGGACATTCATCGATTGGGCTT 

AGAAAATATCGATTACACGCTGCGATGGTTTGTAAAAGAAATTGAAGTGGCAACGCTAAAGGAAAGAGTGGCAATGTGTGCAGGTGCCTAACTTG 

GCTATGCTCGCTGGCAGATGATCGCTGCTATGAAGCCATGCGTGATAACGTAGCTGCTTACGTTTTTATTGAAAAAGTGCAGCATACTTTTCGGC 

TATGTCTATATCTAAAATATATGTAGGAAGTATGTAGAACTCAATCCGTCTATTTCACTTAGTTGTAGAAGTTTTGGTAGTAAGAAAAAATCTTC 

ATTCATTCATTTCCCTAGAACACTCGGCCATCAACCTGTCCCTACAATAACTGCTCAATCGCAGCTTATGCTCGTAAAAACTTTTTAGTATTCAT 

GGTTTTGGCATTAACGATGCCAATTATGGCAGCGTTTTATTTAGCTTTACTTTCAGTTTTATTTTTTAGTACTTAAGAGAGTGACAGCGACGTCG 

GCAGAGAGGAGAGCAGCGTCATTGGGCGTGGTCGCTGCCCCCAAAAGAATGGGAGCAAACTCCATAGAAACGCAGAACGAATTCAACCAATCGCC 

TGCAAGTCTCTTGTCTTCGAAGAGCGCCTGCGCAGAGAGCAAAAGAAAGTGGCGAGCGGAAACCAGTTTTTGTGACGATGAGGGTGACGATGAAG 

AGGAGTGGGGAGTCGGGGTGGCTGATAAAGATGGCCGTCGCCTTCCGCGGCGCAGGAAGCAGCGAAGAGCGGGGGGCAGGGCAGCGGAACCGCCT 

CAGACCCAAAGTCACAATTATGGGGCACTCATTCATTGCGGTGTATGATGCAAACAGCGCTCTCCTCGCTTTTCTCTCTTTTCCACCAAACTCTG 

CTGTACAGTCCGACGTCTTTAGCTCTGATTTATTCACACGATTTTTTGTTTTATTTTACTATCTTG 

(SEQ ID NO: 220) 
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Exon: 5336.-5278 
Exon: 2506. .1846 
Exon: 1779.. 1625 
Exon: 1558.. 1001 

Start ATG: 2490 (Reverse strand: CAT) 
Transcript No. : CT7776 

CCGATTTCCAGCTCAACGAATAGTCCAAGTCGCGCCGAATTGTACAATTAAGTTAAAAAGTTCTTGAAGTTTGGCATGCAGCCCAGTCGCGGTGT 
CCTTTTCTACGGACCCCCTGGTTGCGGTAAGACGCTGCTGGCCAAGGCCATTGCCAACGAGTGCCAGGCGAACTTCATCTCAGTCAAGGGTCCCG 
AACTGCTGACCATGTGGTTCGGCGAGTCTGAGGCCAACGTGCGCGACATCTTCGATAAGGCCCGCTCGGCGGCTCCTTGTGTGCTCTTCTTCGAC 
GAGCTGGACTCGATCGCCAAAGCCCGTGGCGGTAATGTCGGCGATGCTGGCGGCGCCGCTGATCGTGTGATCAATCAGATCCTTACCGAAATGGA 
TGGCATGGGAGCCAAGAAGAATGTGTTCATCATCGGAGCCACCAATCGCCCGGACATTATCGATCCGGCCATCCTCCGTCCGGGCCGTCTGGATC 
AGTTGATCTATATTCCGCTGCCCGACGACAAATCGCGTGAGGCTATCCTGAAGGCTAACTTGCGCAAGTCGCCGCTGGCCAAGGAAGTAGACCTT 
ACCTACATCGCCAAGGTGACGCAGGGCTTCTCTGGCGCCGATCTGACCGAGATCTGCCAGCGGGCTTGCAAGCTGGCCATCCGACAGGCCATCGA 
GGCTGAAATTCGTCGCGAGAAGGAGCGCGCCGAAAATCAGAACTCCGCGATGGACATGGACGAGGACGATCCAGTGCCGGAGATTACTAGCGCCC 
ACTTCGAGGAGGCCATGAAGTTCGCTCGCCGTTCGGTGTCCGATAATGACATCAGGAAGTACGAGATGTTTGCCCAGACTTTGCAGCAGTCCCGT 
GGATTCGGACAGAACTTCAGATTCCCCGGTCAAACCGGCAACACCTCAGGATCTGGTAACAACTTGCCAGTTAACTCGCCCGGCGACAACGGCGA 
CGATGATCTTTACAGTTAGATTTCTAGTTTTACACCGCAAAACAAAACCACAAAATGGAACTTTTTAAATGAATTTTTTCCGTAAAACCGAGCAA 
GCAAAACCAATCACATTCGCATCCTGGACGCGGTAACTTACTACAGCAACCATTGGCTAAGACATCCAGCCGATGGCGAAATCAGATGACATACC 
CGATTCAATATTCACTTCTTTGATACCATCCTTACTCTTGGTCGCGGATACGTGCAGTAGGAGAAGGTTCCCATTTGGATTTGGGCTCGCCTAAT 
CCGAAACCATTTTAATTGGCGAACTATTCCGCGCCTCACACTAATCATTCGTGCATTGTCGGTAGAGTGGCTCCGATATATGAGTTTCTATAAAC 
ATAATTATTAACGATACACATACACACATACATTTAAAGATAAATACATATATTAAACACCGTAAAATTATAAGTGTATTTGGAGCTACATTTTT 
AAGTGAAT 
(SEQ ID NO: 221) 

Start ATG: 7 6 (Reverse strand: CAT) 

MQPSRGVLFYGPPGCGKTLLAKAIANECQANFISVKGPELLTMWFGESEANVRDIFDKARSAAPCVLFFDELDSIAKARGGNVGDAGGAADRVIN 
QILTEMDGMGAKKNVFIIGATNRPDIIDPAILRPGRLDQLIYIPLPDDKSREAILKANLRKSPLAKEVDLTYIAKVTQGFSGADLTEICQRACKL 
AIRQAIEAEIRREKERAENQNSAMDMDEDDPVPEITSAHFEEAMKFARRSVSDNDIRKYEMFAQTLQQSRGFGQNFRFPGQTGNTSGSGNNLPVN 
S PGDNGDDDL YS * 
(SEQ ID NO: 222) 

Classification: enzyme 



Celera Sequence No. : 142000013384690 

AACCGGCTCCACCTCTCCGAGTTCAGCATTAAGTGTCTCATTAGCGATCTGTGGAGCTCGTGGTAACCACTGGTTGTGAGCCCAACAGCAGGCGT 
ATAGTTTCTCTCTCTCTTGACACAATTTTACCACATCCTCAATTTCCTCGCCAGTGCGATGGATGTACTCCTGTGCAACAATTTGATCGAAATCC 
GAATAGGTACCCAATTCCCAAGGGCTAAAAATTTTCGGAGATATTTTGGACAGATTTAACCTATCCAGCTGCACTCCCTTACTGTGGAGTAGGTG 
CTTTAAACGTGGCAATAAGACGCATAGATTTACAGGCCAATTGCCATGGTCCACGACGTGCCTAAAGCAAAGGTGTAAGTTGTCTTCGCTTTTGT 
ACAAGTCCTCGCCATCGTGAATACTTGTGTAATAGGAGAATTCTTGAAGATCTAATTGGTAGAGTATGCGCCAAATGGTTCTCATCAACCCTTCA 
CTTGTGCAGATCAAATACTTAAGATTGTGATCCTATTTGAAATGAGATTTGGGTCATTCAAGCTTTGCATTCCTTACTTATATTTAATTGCTTAC 
CTTGCTTCCTTCTGCAAAAAATATTTCCTTCGCTCTTTCGAGAAGAAATTGATTTTCTTTTTTCTCTACTGAACTGTTTTCATTTTCGGAATTTC 
CTTCGGTCGTAACCCTTGCTCCATCTTCCAGAATTTTTGTCGATCCAGCTTCAATTTCAGTGTTTGGTCTGTTCGTAGAGTTTCTTGAATTATTA 
GATGATTCAGCATGGATCATTTTAATCTTAGCCGGACTTTCCAGGTTATTATTTAAATTAACTGAACTGCTTTCTTTCGTTGGCTCTGAACTCTG 
ATCCAACGCTTTCGGAATTGTATGCATTTCAGTTGCTTGTGTGTCAGTTGCTGTTGTTTGAGCTGTGCAGACACTTTCTATGTCAACCGAATTCG 
AAGACAATTGTGGCTCATTTTCTTCAGCAGCAATGGTATTGGCAACAATATGCCCGGTAGATACGGAATACCTTTTATATTGTTCTTCTAGAAAT 
GCTTGTGTTACTGGGCAGTTGATTCCTTTAAATAATTTATCGAGAAAGTTATGGGCCGTATAATTCCACCCGATCTTCAGTTTGCTAAATATAAT 
CGTGTTGTATAAATGTGCTTGCTAATCCTTTAAACATTATTTTTTTTACCTCGATAATTCATAAAGCCTCTCAGCATTTAGTTGAATATAATCTT 
TTATGGGTTCCGTAACTTCTAGTTCATGCTGCAATTCCAGTGGCTGTGTGCCTTGGTTGGTAGTGACGTCTTCTGGGACAACTAATGCGTCTTCG 
GGAAAAGGCTCTGGTTGAAATTGAGACGTGTTTTGTGTAGTCATCGGCTTACTGCTGATCGCTTCCTTTGCGACTTTAGTCCTTTTTCTGCTGCA 
ATTATAAGGTTTATTACACATCCGGAAAAAAAGTAAAAATGTATGTGGTATACCTTTTCATTAAGATTTCTAAAGCATTTCTGCGAACAGGTAAC 
GACATTTCGCTTTTGGACTGAAGGCAACTTTCCTTGAGCGGTGTTCTAGTGGTGCACGTCGCCAGGCAATCAGTTTCCTGTAGACAATCATTCAC 
ATTTTTCCCTGGATAACTAAGCAATTGTTGTAGTATATGGCCGGTACATGGCTTGAATCGACATGCGAACTTTTCCCTCATCTCTGGTCCAGAAT 
AGAAGCCTTTGTAGTAGTCGCGAATAACCTCATCCTTGGATTTTCCTCTATACTCAACGCACTTCTGCATATTATTAGCAATATCATCTAGATTG 
CTCACATGACTTTTGAACAGCTCTAAGGATACAGGGTATGGTGGACACTCTTCTCCGGGCTTGTCGTCATTAGAACTAGGCGTCCTTGTGGATGG 
TATACTTTCATGTTCAGCTATTGGTCTGCTGGAGTCATGACTTATTACTATTATATCTGAAAACAAATATATGTTTAACTTACTTCTCTTCGGTG 
ATATTTTGCGTTTCTCGTACTACATTTCTGGATGACTGCAGACAACTTTCATCGCTTAGGTTTTCTGTGGTACAAGCCACCTGGCAATTTGGATT 
CTTTGGGGGTAGCTCCTTGTTTTTCGGTGGAAAACTTAACAATTGTCTTATTTTTTTGCTGGGACACGGTTTAAATCGGCACTCGAACTTCTCCC 
TCATGTCTGGTCCAGAATAGAAGCCCTCGTAGTAGTCCTGAATTACCTCCTCCTTGCTTTTTTCTTTGTACTCATCGCATTTTTGCATCTCATTA 
ACAATATCATCGAGATTGCTCACATGACGTTTGAACATCTCTAAAGAAATGGGATTTGGTGGACAATTTTCTGTTGTTGTTTTAGGAGTGAAAGG 
TACCTGGGACTTGGATTCGTTTACTGCTTTTGCATTATCTGTTCGATTCATTGTAAAAAGAGCCTTTTCATCAAGTGGCTTAGCTTTTTGTTTTA 
GCTTAGCCAGCATCTTAGGATTGGGGCAATTGAATTTGTACTTGAATCTTTGACGAACATCATTGCGTGCATAAAATTTATTGTAAAACTGGCGG 
AAAAATTTTTTGCATTGTGGGTGCGAAGGATCAGAAATTAATTTTGACAGTTGATGTTCGTCTTTAATGTGATTATTTACTAGAATTGCTCGGAT 
AATCTCGTCATAGTTAATATATTGCCGAAAAGTTTCAAGTGATACGGGAAAAATAAAGCGCCCATGTCTAAAAAATTTTTTTTGAAAATTGTTAC 
TTAAAAATTTAAAAAAATGGTAATTTATATTTTTCTTACTTTTTAAGGTCCACTTCAACATCCGTCACAAAGGAAGTCCTAAAATTGTGGTTTTA 
AAATTATGAAAACAATTCAGGTTACGCTAACTTACTCATATACATCGGTAGAATCTGTGATCTCTTCAGGTTTGGCTGGGGTCTCGTCCGAAGCT 
CTGGTTTTTGAAGCAGGACTCTCTGGTGTCTTGGACATGGGCACTTCGAAACCTGACGGAGGAATCTCTAACTGGGGCGCTCTATCTGATTGTTC 
TTCGGCAACCTCCTTGCTCTGATGGGGTTTAACTTGGGGACTTAAGACTTCAGATGACTTCTCAGGGGCTTTCGAACTGTGCAGATGGCGTTCTG 
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GCGAAGCTAATGCCGGCATACATCTGCTTAGTTTGTTGGCATCGTTCCTGAGATTAGCTTGGTTTTCGAAAGTTTCTAAAGTATTATGTGAACTC 
GATAGCCCTTCAGTCGTGCTAATAGCTCCCGCAGCACAATTCTCTGGCGAAAGAGTAGCTAACTGGGACAGACCTTCACTCGGTTCGTGGGCAAC 
CTCCATGCTGGGACTGGTTTGGATTTGTGAAGAGCCACTGGTATTCTGCGACCCTAATGACCCTTTGCTCTCAGTAATATATGAAGTGTTGGGCC 
TGCGCGAATCAAGTCCTGGCGAGGGATTAGCTAACTGAGGGAGGCTTTTCCTAAGCTTTTCGGCAATATCTTTGCCGAGAATGGACAGCATGCAG 
GCCTTGAAAGTCGACGTGCAAGGTTTTAATTTGTATGCGTATTTATATCGGATCCTCGGAGTAATGTAAAACGCCAGGTAATAGTCAAGGGCACA 
TTCCTGGACATCCTTGCTCGCATAATTAGCATCCAGTTGCCTCATTTGCTCCACAATGTCCGATAAGTTTGCCATATTTTTCTTAAATACCTCGA 
AGCTGACGATGGAACTGTGAAATATTTATAGTAAACAACGAGTATTTGGAAAGTGTCCCAAACTTACTTTTTCGTAAGGGATTTCATTGCAGTAA 
GCTCCGCCACAGTCTTGACCGCTTTTTTGTCCAGTGGTTTCCCAGATTCAAGCAACTTTTCCAAAGGGATTTCTGTAGTTGCATTGAAGATATTC 
GTAGGGGATCGCTTTTCTGGATACATGTAAAAGCGGTTGTAGAACCTTTGTAGTGTAAGAAGACGACGCTCATCGTCGGACAAGAGGACCAGGCA 
GTGTTCTATGCTCCTGACAATTTTTTTGAACAGTAACATCTTATTGATGCACTGTAGGAATACATCAAAGGATACCGGAAAGATGAAGGCGCCAC 
ATCTGAAATGAGTCAAAATTGGTTAATCTGCTCTGCTCGAAAATAATATTACCCACCTCTCCACATTAACAATTACGTCTGGAGATTCCGATCCT 
GATGGTTTACATTCAACTGCAACGCCTTCAACTGCATCATGGGACTTCGCGGTTTCAGCGAGCGTCGTCAACTGGCTGCCTTCAACTAAGACTTC 
GGGTTGCTCTACTTGAGATGGTCCTGCAGCTGCTACGTTTGCAACATCGTTTTGTGAAATTCCTCCTGGCTCCGGTACTTTAAGCAAGTTGTCGC 
GTGTCCGCCTGGGACACGGCGCCACCTTGACAGGGAACCTTGCCCGTATGCTGGGATTCCGGTAGAACTCTTTGTAGTACCATCTGCTCCACTCG 
ACCAACGTTCTATTGTCTCCATTGGCAGTTTTCAACTCACTTGCAATTTCAGACAAATTAACGATTAACTTTTGAAATGCCGCAAAGGACACTGG 
ATACATGTTGCTCCTGAAATCAAACAATATTTAATGTTAAGATGTAAAAGATTCACAGTAACCTTACGACTTGTTCTATGCAAGTAACGTTATTT 
GTTGTTTATCATAAACATCACATTTCAATACCGCGAAATAAACTAGGCTGAATATCACAGTGTGACTGTGCTGCAAAATGACAATATTTATCGAT 
AGCGAACGACTAAATATTTTCACTACGAGAACTCGAACTTACTTAAAATTTTACAAGCCAGTTAATACTTTTGTGCTTTATTGTTTATTGTTTAT 
TTATTTATTGTGCTGATTTAAACAGGGTTATAATTTCATAGCTAGCTAATTTTACATGTTATGATTTTCCAAATGACTCCAAATTTGTAAGGTGT 
GCTAGGTATAGTAGGTGGTATATTCCAGCGGTCGCCGCACGGTCACACTCGAGCTGCGATTCGGAGGGCTAGAAAATCATCGATGTTTACGTACA 
TCGATAGTTGGCAGGGCTTTGCTTTGTTTTGCAAATTCCAAGCGGAGTTGAGGCTGTGACAGATTGCGGGTGCGAGTTGTGAAGCAACATTGCGT 
CATCATCAGCTCCTGGACAGAAAAGAGCTCAGGGAACGAGGCAGGTCCGCGCCCGGGGACAAGGAAAGCCTTTAGCCACGAGGACGCAGAGGGAT 
CAAAGGCGGAGGAGGAGGCGGTGGCCGTGTCACGGGAGCTGGATCAAATGGCCAGCACCACTGGAAGTGCCCCCAGCCTGGGACCCCCCCAGAGC 
GGGAGTGGTGGCTCTGGCACATTGGAGGGCACGCTAAGCAAGTGGACGAATGTGATGAAGGGCTGGCAGTACCGCTTCTTCGTTCTGGACGAGAA 
TGCTGGACTGCTGTCCTATTACACGGTGAGTGGCTGCAGTGCACCTCCTGACTGAAACCCATATCGCACCCAGAACGAACTCCTCATGGCCCGCA 
GCCTTATCTTATCTGCTATAGATACATTTGGATACACATACACACTTGTTGGTACCCCTTTATTGCGTGATTTGTTGACAGGAGATTACTTTTTT 
GAGTCTGACTGAGGGGTCTGACTCTTGCGCCGCTGCGAGGCTTGATAGGCACTATTATAGAAGTCGTAAAAGCCGTAGAACATGCCAAGAATGAC 
GGTGGTGAAGTAGAGAAGCACTAAGGGCGGCATGCCGCACAATACCAGCTGGAAAGCAACCTGCGTGAGGATAAGAACGAACTGCGTCATCTGAA 
TCACTGTGATGCACTTCTTCACTGGAGTTAGGGCCTGCACCAAGGTCTTATCCGCCACTGCCGCCACAAAGTAGTACGAGTACATGATAACGTGA 
ACGATAGAATTCAAGAACACGCAGAAGTAAGCAGCAGAACCTGTACAATTGAACGCATTTATGAAAATAGTCGGTCACCCAGCAGCCGGCTAGCT 
TACCATTTTCGTTGAAGTTGATCAGTGCGTAGACCAGCGTTACCGTGGAGAAGTGATGAAAGATGTGCAGCTTGGACACCTGGTTTTGTTTCTTG 
CGAAGCACAAAGATCACGGTTTCAATCAGCTCAGAGATCTTCAGCCAGAAGAGAAAGTAGGCCAAGTTGTAGTACCGCCTCACGAGCTCTGGGCT 
GCTCCCGATGTCGCGGCACTTCCAAAAGATGTATATGGTGTTGTCCGTAATGTACAGGACCTGCGAGAGAAAAGCAATTAGCCGATGCTGCTGCT 
GCTGTACAATTTATAAATATCCACCTCTTTGATAGCATATATGCAGGAAACCACCTGGATGAAATTATGTAGCAAGATAAGTCGCTTGAGCTCGT 
AAGGTTTGCGCCATTCCATGAAACTATAAGCGAGGTATTAAAGGTCTTCTTTCTTATATAAGTAAGCCAGCGAACCCGCTGAATTGCTAATGTCA 
AGT C C ACT G AC C C AC AT G C AAAC AAGC GAC G CT G AC AT T AGC AT GT GAACT T ACT G C GG AC CCGCTTTGGT G AC G AAGT AC AG AT AG AGG AC CAG 
CACTGTGATCATGTACCAGGGATCCTTATAAATCGTAGGAACGTTCACCACCGGCTGCAAATTTATACCTACAATTTTATTGAAACCATCAGACA 
GTTTTTTGACAAATCCAAGAGGAGCTCACTTGCTGACGCCTCCATTTCGCTTCTTTGTGTTGAGCTTCCGCTGGCCTGTATCCTTTAGGTGGCGT 
GGGACTGGTGGACTGCAGCTCTACCGCCTTGACCGCTGGCGAACATTTAAGCGTGAACAGCACGCCGAACACTGTTTTTATATTCCCCAAGCATT 
GATTAGAATACATTAATTTATAGCAGGCGTAATAGCCCTTTGACAGTATGAGCTGCGATTTAATGAACTTTCCTAATTCCGACTGCATTTCTGTT 
ACAGTCGAAGGATAAGATGATAAAGGGCGTGAGGCGCGGTTGTGTTCGTTTAAAGGATGCGCTAATCGGAATCGACGACCAGGAGGACAACACAT 
TCACAATCACTGTGGACCATAAGACCTTCCACTTTCAGGTAAGTGTCTCCTTAGTAAATGGAAATAAAGATTGTCTAAGGAAAGCTTGATCAATG 
TTTTGCAGTTTCACTAATAGTTCATAATTCACCGTAATAGTTTAAGTACGACGTTGTTTGTTTTATGAACTGAAGAACCAACTCATTTTAAATTC 
CACACTTGCATTGATTTTTTATGACGTCTAGTGGTATACGTAAGCAAAAACTATGTAGATCAAGAGGGCAGCACGTTTCTTTTTGCAATTCGCGT 
TTTCGTCCCAAAAGATATAAAGCAAGCATGATGGCCTATCCGCTTTGCTTGCCAAGAGTTTTCGTCCTCTTACCGGAAAAGGGTATCCGGGCATC 

GACCCAGAAACCACTGTCTGGTCCTCACGTTCCACCTTTTGTTTGACTTTGTTGCCTTGATGTAACGATAAAACTTTTTCCAATACCCATACACA 
AGCAAAGGAGCCCACTAAACTTGCCAACTTTCCAAACGCTTGCAGGCGCGACACAACGAGGAGCGGGAGCAGTGGGTGCGCCGCCTGGAGGACAC 

CATTCGACGCCACGCGAACCGGTCGCGTCTGTGGGACAGCCAAAGCGCTTTCTACATCGCCGGTGGCTACACGAAGGAAGGGGGCGGGAGCGGAA 

CAGGCGGCAGAAGACCCAACCACCTGGAGTTGGTGGCGCGCCGTGTTTCCGAGGCAGATGCCTACCTCCAGCTGATGATTGA 

(SEQ ID NO: 223} 

Exon: 6777.. 6680 
Exon: 6623.. 6515 
Exon: 6388.. 6295 
Exon: 6235.. 5989 
Exon: 5930. .5600 
Exon: 4573. .4142 
Exon: 4087.. 3773 
Exon: 3719.. 2886 
Exon: 2833.. 2795 
Exon: 2727.. 1984 
Exon: 1925.. 1479 
Exon: 1420.. 1001 

Start ATG: 6695 (Reverse strand: CAT) 
Transcript No. : CT7814 

TCAAGGCGGTAGAGCTGCAGTCCACCAGTCCCACGCCACCTAAAGGATACAGGCCAGCGGAAGCTCAACACAAAGAAGCGAAATGGAGGCGTCAG 
CAAGTATAAATTTGCAGCCGGTGGTGAACGTTCCTACGATTTATAAGGATCCCTGGTACATGATCACAGTGCTGGTCCTCTATCTGTACTTCGTC 
ACCAAAGCGGGTCCGCATTTCATGGAATGGCGCAAACCTTACGAGCTCAAGCGACTTATCTTGCTACATAATTTCATCCAGGTGGTTTCCTGCAT 
ATATGCTATCAAAGAGGTCCTGTACATTACGGACAACACCATATACATCTTTTGGAAGTGCCGCGACATCGGGAGCAGCCCAGAGCTCGTGAGGC 
GGTACTACAACTTGGCCTACTTTCTCTTCTGGCTGAAGATCTCTGAGCTGATTGAAACCGTGATCTTTGTGCTTCGCAAGAAACAAAACCAGGTG 
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TCCAAGCTGCACATCTTTCATCACTTCTCCACGGTAACGCTGGTCTACGCACTGATCAACTTCAACGAAAATGGTTCTGCTGCTTACTTCTGCGT 
GTTCTTGAATTCTATCGTTCACGTTATCATGTACTCGTACTACTTTGTGGCGGCAGTGGCGGATAAGACCTTGGTGCAGGCCCTAACTCCAGTGA 
AGAAGTGCATCACAGTGATTCAGATGACGCAGTTCGTTCTTATCCTCACGCAGGTTGCTTTCCAGCTGGTATTGTGCGGCATGCCGCCCTTAGTG 
CTTCTCTACTTCACCACCGTCATTCTTGGCATGTTCTACGGCTTTTACGACTTCTATAATAGTGCCTATCAAGCCTCGCAGCGGCGCAAGAGTCA 
GACCCCTCAGTCAGACTCAAAAAAGAGCAACATGTATCCAGTGTCCTTTGCGGCATTTCAAAAGTTAATCGTTAATTTGTCTGAAATTGCAAGTG 
AGTTGAAAACTGCCAATGGAGACAATAGAACGTTGGTCGAGTGGAGCAGATGGTACTACAAAGAGTTCTACCGGAATCCCAGCATACGGGCAAGG 
TTCCCTGTCAAGGTGGCGCCGTGTCCCAGGCGGACACGCGACAACTTGCTTAAAGTACCGGAGCCAGGAGGAATTTCACAAAACGATGTTGCAAA 
CGTAGCAGCTGCAGGACCATCTCAAGTAGAGCAACCCGAAGTCTTAGTTGAAGGCAGCCAGTTGACGACGCTCGCTGAAACCGCGAAGTCCCATG 
ATGCAGTTGAAGGCGTTGCAGTTGAATGTAAACCATCAGGATCGGAATCTCCAGACGTAATTGTTAATGTGGAGAGATGTGGCGCCTTCATCTTT 
CCGGTATCCTTTGATGTATTCCTACAGTGCATCAATAAGATGTTACTGTTCAAAAAAATTGTCAGGAGCATAGAACACTGCCTGGTCCTCTTGTC 
C GAC G AT G AGC GT C G T C T T CT T AC AC T AC AAAG G T T CT AC AAC C GC T T T T AC AT GT AT C C AG AAAAG C G AT C C C C T AC G AAT AT C T T C AAT GC AA 
CTACAGAAATCCCTTTGGAAAAGTTGCTTGAATCTGGGAAACCACTGGACAAAAAAGCGGTCAAGACTGTGGCGGAGCTTACTGCAATGAAATCC 
CTTACGAAAAATTCCATCGTCAGCTTCGAGGTATTTAAGAAAAATATGGCAAACTTATCGGACATTGTGGAGCAAATGAGGCAACTGGATGCTAA 
TTATGCGAGCAAGGATGTCCAGGAATGTGCCCTTGACTATTACCTGGCGTTTTACATTACTCCGAGGATCCGATATAAATACGCATACAAATTAA 
AACCTTGCACGTCGACTTTCAAGGCCTGCATGCTGTCCATTCTCGGCAAAGATATTGCCGAAAAGCTTAGGAAAAGCCTCCCTCAGTTAGCTAAT 
CCCTCGCCAGGACTTGATTCGCGCAGGCCCAACACTTCATATATTACTGAGAGCAAAGGGTCATTAGGGTCGCAGAATACCAGTGGCTCTTCACA 
AATCCAAACCAGTCCCAGCATGGAGGTTGCCCACGAACCGAGTGAAGGTCTGTCCCAGTTAGCTACTCTTTCGCCAGAGAATTGTGCTGCGGGAG 
CTATTAGCACGACTGAAGGGCTATCGAGTTCACATAATACTTTAGAAACTTTCGAAAACCAAGCTAATCTCAGGAACGATGCCAACAAACTAAGC 
AG AT G T AT G C C GGC AT T AGC T T C GC C AG AAC GC C AT C T GC AC AGT T C G AAAGC CC C T G AG AAGT C AT CT G AAGT C T T AAG T C CC C AAGT T AAAC C 
CCATCAGAGCAAGGAGGTTGCCGAAGAACAATCAGATAGAGCGCCCCAGTTAGAGATTCCTCCGTCAGGTTTCGAAGTGCCCATGTCCAAGACAC 
CAGAGAGTCCTGCTTCAAAAACCAGAGCTTCGGACGAGACCCCAGCCAAACCTGAAGAGATCACAGATTCTACCGATGTATATGAGACTTCCTTT 
GTGACGGATGTTGAAGTGGACCTTAAAAAACATGGGCGCTTTATTTTTCCCGTATCACTTGAAACTTTTCGGCAATATATTAACTATGACGAGAT 
TATCCGAGCAATTCTAGTAAATMTCACATTAAAGACGAACATCAACTGTCAAAATTAATTTCTGATCCTTCGCACCCACAATGCAAAAAATTTT 
TCCGCCAGTTTTACAATAAATTTTATGCACGCAATGATGTTCGTCAAAGATTCAAGTACAAATTCAATTGCCCCAATCCTAAGATGCTGGCTAAG 
CT AAAAC AAAAAGC T AAGCC AC T T GAT G AAAAGGCT C T T TT T AC AAT GAAT C G AAC AG AT AAT GC AAAAG C AG T AAAC G AAT C C AAG T C C C AGG T 
AC C T T T C AC T C C T AAAAC AAC AAC AGAAAAT T GT C C AC C AAATC C C AT T T C T T TAG AG AT GT T C AAAC GT C AT GT G AGC AATC T C G AT GAT ATT G 
TTAATGAGATGCAAAAATGCGATGAGTACAAAGAAAAAAGCAAGGAGGAGGTAATTCAGGACTACTACGAGGGCTTCTATTCTGGACCAGACATG 
AGGGAGAAGTTCGAGTGCCGATTT AAAC CGTGTCCCAGCAAAAAAATAAGACAATTGTTAAGTTTTCCACCGAAAAACAAGGAGCTACCCCC AAA 
GAATCCAAATTGCCAGGTGGCTTGTACCACAGAAAACCTAAGCGATGAAAGTTGTCTGCAGTCATCCAGAAATGTAGTACGAGAAACGCAAAATA 
TCACCGAAGAGAAACCAATAGCTGAACATGAAAGTATACCATCCACAAGGACGCCTAGTTCTAATGACGACAAGCCCGGAGAAGAGTGTCCACCA 
TACCCTGTATCCTTAGAGCTGTTCAAAAGTCATGTGAGCAATCTAGATGATATTGCTAATAATATGCAGAAGTGCGTTGAGTATAGAGGAAAATC 
CAAGGATGAGGTTATTCGCGACTACTACAAAGGCTTCTATTCTGGACCAGAGATGAGGGAAAAGTTCGCATGTCGATTCAAGCCATGTACCGGCC 
ATATACTACAACAATTGCTTAGTTATCCAGGGAAAAATGTGAATGATTGTCTACAGGAAACTGATTGCCTGGCGACGTGCACCACTAGAACACCG 
C T C AAG GAAAGT T GC C T TC AG T C C AAAAGC G AAAT GT C GT T AC C T GT T C GC AGAAATG CT T T AG AAAT C T T AAT G AAAAGC AG AAAAAGGAC T AA 
AGT CGC AAAGG AAGC GATC AGC AGT AAGC C GAT G AC T AC AC AAAAC AC GT C T C AAT T T C AAC C AG AGC C T T T T CC C G AAG ACGC AT TAG T T GT C C 
C AG AAG AC GT C AC T AC C AAC C AAGGC AC AC AGC C AC T G G AAT T G C AG CAT G AAC TAG AAGT T AC GG AAC CC AT AAAAG AT TAT AT T C AACT AAAT 
GCTGAGAGGCTTTATGAATTATCGAGGTAAAAAAAATAATGTTTAAAGGATTAGCAAGCACATTTATACAACACGATTATATTTAGCAAACTGAA 
GATCGGGTGGAATTATACGGCCCATAACTTTCTCGATAAATTATTTAAAGGAATCAACTGCCCAGTAACACAAGCATTTCTAGAAGAACAATATA 
AAAGGTATTCCGTATCTACCGGGCA 
(SEQ ID NO: 224) 

Start ATG: 83 {Reverse strand: CAT) 

MEASASINLQPVVNVPTIYKDPWYMITVLVLYLYFVTKAGPHFMEWRKPYELKRLILLHNFIQVVSCIYAIKEVLYITDWTIYIFWKCRDIGSSP 
ELVRRYYNLAYFLFWLKISELIETVIFVLRKKQNQVSKLHIFHHFSTVTLVYALINFNEKGSAAYFCVFLNSIVHVIMYSYYFVAAVADKTLVQA 
LTPVKKCITVIQMTQFVLILTQVAFQLVLCGMPPLVLLYFTTVILGMFYGFYDFYNSAYQASQRRKSQTPQSDSKKSNMYPVSFAAFQKLIVNLS 
EIASELKTANGDNRTLVEWSRWYYKEFYRNPSIRARFPVKVAPCPRRTRDNLLKVPEPGGISQNDVANVAAAGPSQVEQPEVLVEGSQLTTLAET 
AKSHDAVEGVAVECKPSGSESPDVIVNVERCGAFIFPVSFDVFLQCINKMLLFKKIVRSIEHCLVLLSDDERRLLTLQRFYWRFYMYPEKRSPTN 
IFNATTEIPLEKLLESGKPLDKKAVKTVAELTAMKSLTKNSIVSFEVFKKNMANLSDIVEQMRQLDANYASKDVQECALDYYLAFYITPRIRYKY 
AYKLKPCTSTFKACMLSILGKDIAEKLRKSLPQLANPSPGLDSRRPNTSYITESKGSLGSQNTSGSSQIQTSPSMEVAHEPSEGLSQLATLSPEN 
CAAGAISTTEGLSSSHNTLETFENQANLRNDANKLSRCMPALASPERHLHSSKAPEKSSEVLSPQVKPHQSKEVAEEQSDRAPQLEIPPSGFEVP 
MSKTPESPASKTRASDETPAKPEEITDSTDVYETSFVTDVEVDLKKHGRFIFPVSLETFRQYINYDEIIRAILVNNHIKDEHQLSKLISDPSHPQ 
CKKFFRQFYNKFYARNDVRQRFKYKFNCPNPKMLAKLKQKAKPLDEKALFTMNRTDNAKAVNESKSQVPFTPKTTTENCPPNPISLEMFKRHVSN 
LDDIVNEMQKCDEYKEKSKEEVIQDYYEGFYSGPDMREKFECRFKPCPSKKIRQLLSFPPKNKELPPKNPNCQVACTTENLSDESCLQSSRNVVR 
ETQNITEEKPIAEHESIPSTRTPSSNDDKPGEECPPYPVSLELFKSHVSNLDDIANNMQKCVEYRGKSKDEVIRDYYKGFYSGPEMREKFACRFK 
PCTGHILQQLLSYPGKNVNDCLQETDCLATCTTRTPLKESCLQSKSEMSLPVRRNALEILMKSRKRTKVAKEAISSKPMTTQNTSQFQPEPFPED 
ALVVPEDVTTNQGTQPLELQHELEVTEPIKDYIQLNAERLYELSR* 
(SEQ ID NO: 225) 

Classification: hypothetical 



Celera Sequence No. : 142000013384693 

GCTTCAGAAGCGGCTCCTCCTTGCCTGCAAGCAGCTTGTCGGCTGAGTATATGTTGATCTGCCCCGCCTCGCAGCCGCCCACAATCAGGCCACTG 
GGGTGTGGTCCCACTGGGGACCAGATGAGTTTTTGGAACCTGCGGGAGGAAAAGCCGGTTATGTGAACGACTTGATGCACAAATTGAAAGCCATA 
ACTCACTTGTACTGGCTGGAAACGCTGGCCCTCAGCTCCAGATCGTATGTGGCATCGCTGAAGTTCGGTGAGTACAACTCCAGAGTGGAATTGGC 
ATTGGAATCGAATTGTTGGGCGGCGGTGCCGGCGGCCAAGAGAATCTGCTGCTGCTGCGCCGGCGACCAGGCGATGTTCACCGTCTTTTGCAGTT 
CCTTGATCTTCATCTTGCAATGTCCCTTCGTGCTGCTCGTGTAAATAAAGCGAAAAAACACAAATATTATGGGAGGTACAAGTTCAGAACGAAAT 
TGTGTCACGTCCAATTGGAGTAAGCCCCGTCATCACTTTTCCCATTGGCTTCCCCATTGCAGATGTGGCCCAGTAGTGCCCGCCGGCCTTTAAGT 
ATCTTTAATTATGCTAATTGCGCAACTTACACACAATTGTGAATGCAATTTGTACTCGCTTGCGCTGCAACGAGACTTTTTGATCTTGAATTTAA 
CTGGAATCGTTAATATTTTTAAGAACCGTAGGAACGGCCAGTGTGACCGTGCTCTGAGCCAACGTAATATACCACAGGGCACGCACTGAAATACC 
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AAATGCGTGCGACCAGAATATACTGCGAGTTTAACGGTATGCAATTTAAATATTCCGTTTCAATGTGCAATTTTAAAAATAGTTTTTATCAGTAT 
ACATTATTAAAAATGATAGTCGAAATTATTTAAAGATTTATTTTCCATCGCATTTTATATTCGTAAACTAGCTTCAGCACTTTTAATATTAATAT 
TTATTATGAACTCATTTTAAATTCCAGATCACTTTGCATTTATTTTTTGACTGGAACTGGACTCGAATGTAGATCTCTTTATTATAGGTATCTTT 
CATAAATATAAGCTATGTTTATTTAATACGCATGGTATGCATCCACTCATTTGGCAGACGATATGGCCGCCACACGCTGCTCCACCTTGGCCACC 
TTGCTGTTGATAGTCTCCAGATTCTGGGCGAAAGTCTCCTGTACGGAATGCAGCAGCTCCTTGTTGTTCACCAAGCTAGTGGTGATGGTTCCCTG 
CTTCTGCTCAATCTCTGCGATGATCTTGGCGAAATTGTTTGCTGCAGAGCAAGTTGATTAGTTACAGAAACAAGCACAATGCTAAAATGCTAAAT 
AGAATTGCTTACCATATTTATGGAGGGCCTCGAGGGCTTGCATGCGTTCGATGACGTGCGGCAGTATTTCCACCACTGGCTCCGTGCGCTTCGCG 
ATGTCGTATAGTTCCGTAATCTTCTGATCTCGTTTGGCGTCCTGGGCACTGCCGCTGGACTTTTCGGCGATAGCATCCATCTTGCCGGCCAGCGA 
GGTCAGGCGCTGCTCGATGGTGTCCAGTTTATCAGGCTGTATCAGGGCCGCCTTGGTGCTTAGATGACGCACTGCCTCTAGTACATTGGTGGTGT 
TGGTGGCGGCGGTAAGGCGGCTCAACTTGTCCGGCTGAGCGCCCAGCACCTTCTCCAGCTGCGAGATTCGCTGCTCTAGACTGGCTACGCGGGCC 
GTGGCCGCCAGATCGGTGCCAGGCGTGGGTATGGCTGTGAGAACGCCGGACTGCTTGAACTCCTCCACCTGGCTAATGAGTGCTTTCACCTGCTT 
ACTTCCAGGCGTCTGCTCCTTGCCCAGCACTTGCTCCAGCTTCAGCGACTCCAGCACCTTTCGGGCCGTGCTGATAACCGTGGCCACCGCATCGT 
ACGACTGCTTCTCCTCGTCGGCTACCTTGCGGTCCACCTGCAAGGCGGCCACCTCGTTCAGAAGCTCGTTCATCTCGATCTGCAGGCGCTGGCAC 
TTCTGCACCGGCGTCTCCTTCTCGCCCTGGCCAACCAGCTCGTAGTCGCTGGAGCCGCGCGTATCGTAACCCCGGCACATGCGTCGTCCAATGCG 
ATCCGTGAAGTCCACACTCCCCTCGACCGTTGCTCCGCTGAAGCGCTTGTGAGCGACGCTCGGCGAGATGTGCAGTCGCTCGATGGCTTCGTTCT 
CCGGCTCCTCTTCGTAGTAGTCGGATGTATCGAGCTCCGGGTCATCTGGAGTTTCGTACACGTCCGGCTGGTCATAAGCCTAAAAACATAAAGTA 
CATATATGTATAAATGGGCAAAAACGAAAGCTTAAACGTCACTGCCATGGTGCGGAGTTCCGACTTACTATTCCCGGTAGGTTCTGGAACTTGGG 
ATCGGCCATTATTGTGGGCACGTCGACGGCGGTGTTTGGAATCCTTTGGCAACGAAATAATTATAAACCTTCAAAAATCCTTTTTTACAACAATC 
AATTCGATCTGATGTCATAAAAAAAGTTACCGTGCGCGGTTACCAAACAAATGCCAAAGTGCAGTTGTCATTGGAGTATTTATAAACAGAGGTAT 
CT AAAAAGT CAT T T AAAAAAG AC AGT T AT T AC T T G T AAC GG C T AAT AAGC T AAG AGT AAAT AGT AT T GC AAAAT AT AT CAT CGAT AT AAAT AAAT 
TGTACTTGCACATTGTACTTTGTTAACTGGATTCATAGATGCAAACATTTATAAAAATAAATTAAAGCTTTTTGTGCTAATTTAAACCTATTTTG 
ATTTTAGGAAAAACGATATACATTTAAAATCATTCATAAGCAATAGGTAAATAATTCCATGATCGACTTCCAATTGGTAAATCAATGAAGGTACA 
CTTTTCTTCTGGTATTTTATAGACCAGTGAGGATAGAAAAAAGGCACGGTCCTACTGTACCTAGCACAGGTACTAAGTTAGTCAACACTGTAGTT 
TTCGTGTAGTTTGCGGGCGAAGCGAGTGGGCGGTGGGAGCTGCTAGTTGGTTGTACTAATCTTAATATTTTGCATCCGAAAACAATAGAGTAATG 
TCTGGCTTAGAGTACGGGTTGCATAGATACGATGATCTGCCGCTGGAGGTGAGCATATAATGTGAACGGACTGCCGCGGTCCCGTGACCAATCCA 
TCATTTCGAGCGTATCTATATCTGCATATTCATATACCTGTTAGGGCAGTCAGCTGATTTTGTTTTTTTCTGTTCCGCCCCCAGATTGTTTTGAA 
GATTTTCAGCTATCTCGGCTACTCGGACTTACAAGCGGCAGGATCCACCTGCCAGCGATGGCATGCGGCGCTGGATCAGGCGGAATTTAACCAGC 
GCACGCGAGTCTGCTTCTCCAAGGTGGTGCTATCGGATCAATTGAGTCCTGGAGTGGATCTGATGCGCTGCGAGCGGCGTTTCCAGCACTTCCTG 
T T C G AGG AC GT G AC ACT GGG AC AAGT AAAGG AAT T GAT GC G C T T C AT GGG C AGGAC CGCTCAGTCC 
(SEQ ID NO: 226) 

Exon: 2486.. 2349 
Exon: 2264.. 1343 
Exon: 1276.. 1001 

Start ATG: 238 4 {Reverse strand: CAT) 
Transcript No. : CT7980 

GACATCAGATCGAATTGATTGTTGTAAAAAAGGATTTTTGAAGGTTTATAATTATTTCGTTGCCAAAGGATTCCAAACACCGCCGTCGACGTGCC 
CACAATAATGGCCGATCCCAAGTTCCAGAACCTACCGGGAATAGCTTATGACCAGCCGGACGTGTACGAAACTCCAGATGACCCGGAGCTCGATA 
CATCCGACTACTACGAAGAGGAGCCGGAGAACGAAGCCATCGAGCGACTGCACATCTCGCCGAGCGTCGCTCACAAGCGCTTCAGCGGAGCAACG 
GTCGAGGGGAGTGTGGACTTCACGGATCGCATTGGACGACGCATGTGCCGGGGTTACGATACGCGCGGCTCCAGCGACTACGAGCTGGTTGGCCA 
GGGCGAGAAGGAGACGCCGGTGCAGAAGTGCCAGCGCCTGCAGATCGAGATGAACGAGCTTCTGAACGAGGTGGCCGCCTTGCAGGTGGACCGCA 
AGGTAGCCGACGAGGAGAAGCAGTCGTACGATGCGGTGGCCACGGTTATCAGCACGGCCCGAAAGGTGCTGGAGTCGCTGAAGCTGGAGCAAGTG 
CTGGGCAAGGAGCAGACGCCTGGAAGTAAGCAGGTGAAAGCACTCATTAGCCAGGTGGAGGAGTTCAAGCAGTCCGGCGTTCTCACAGCCATACC 
CACGCCTGGCACCGATCTGGCGGCCACGGCCCGCGTAGCCAGTCTAGAGCAGCGAATCTCGCAGCTGGAGAAGGTGCTGGGCGCTCAGCCGGACA 
AGTTGAGCCGCCTTACCGCCGCCACCAACACCACCAATGTACTAGAGGCAGTGCGTCATCTAAGCACCAAGGCGGCCCTGATACAGCCTGATAAA 
CTGGACACCATCGAGCAGCGCCTGACCTCGCTGGCCGGCAAGATGGATGCTATCGCCGAAAAGTCCAGCGGCAGTGCCCAGGACGCCAAACGAGA 
TCAGAAGATTACGGAACTATACGACATCGCGAAGCGCACGGAGCCAGTGGTGGAAATACTGCCGCACGTCATCGAACGCATGCAAGCCCTCGAGG 
CCCTCCATAAATATGCAAACAATTTCGCCAAGATCATCGCAGAGATTGAGCAGAAGCAGGGAACCATCACCACTAGCTTGGTGAACAACAAGGAG 
CTGCTGCATTCCGTACAGGAGACTTTCGCCCAGAATCTGGAGACTATCAACAGCAAGGTGGCCAAGGTGGAGCAGCGTGTGGCGGCCATATCGTC 
TGCCAAATGAGTGGATGCATACCATGCGTATTAAATAAACATAGCTTATATTTATGAAAGATACCTATAATAAAGAGATCTACATTCGAGTCCAG 

TTCCAG 

(SEQ ID NO: 227) 

Start ATG: 103 (Reverse strand: CAT) 

MADPKFQNLPGIAYDQPDVYETPDDPELDTSDYYEEEPENEAIERLHISPSVAHKRFSGATVEGSVDFTDRIGRRMCRGYDTRGSSDYELVGQGE 
KETPVQKCQRLQIEMNELLNEVAALQVDRKVADEEKQSYDAVATVISTARKVLESLKLEQVLGKEQTPGSKQVKALISQVEEFKQSGVLTAIPTP 
GTDLAATARVASLEQRISQLEKVLGAQPDKLSRLTAATNTTNVLEAVRHLSTKAALIQPDKLDTIEQRLTSLAGKMDAIAEKSSGSAQDAKRDQK 
ITELYDIAKRTEPVVEILPHVIERMQALEALHKYANNFAKIIAEIEQKQGTITTSLVNNKELLHSVQETFAQNLETINSKVAKVEQRVAAISSAK 



(SEQ ID NO: 228) 

Name: dynami tin -like 
Classification: motor_protein 

Celera Sequence No. : 142000013384768 

GAAATGAGTTTTGTTTTGTTTTGTTTCTTCCGAGTGTTTGTGTGTTTATGTGTGTGGGGAGGGGGTGTATTCTGTGTGCGGCGTGTTTTGAGTGA 
GAGAGATAGAGAGAGAGCGCGAGTTGCTGCGGGTCGACGGTCGAAGGGGGTGGTGGTGGAGTTGGTGGTGGGGGGCTTGGTTTATACGACAAGTT 
TTGATTTATTATTAAATTTAAATACCTATGTATATAAATTACGTTTCTGTTTATTATAATTATTGTTGTTTCCATCTTAATCATTTTGTTTTCTT 
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TCGCTTTTTATTTTTCATATTTTTTTTTTGTTTTTTGTATGTAGTGGGCAACTGCGGCATTTGTAAATTTCTTTATATTTAAATGTTCCTTTTGA 

CTGTGTAAAAATATCGATTTCCATCTGATCTTGTTTTGTAGTTACTATTCATGTATGCATTTGCTTTTCGTTTTAGTTAAATTACGTTAACGCTG 

CATTCTGAACTCTCTGTTGCTGCTGATTCTGCTGCTTGTTCTTCGGATTCTTGATCGGATTCTTCTGATTCTTCATCTTGGCGCGCTTTCTATGC 

TATTGAAATGTTGTTTGTGGTGCTGCTTACTGCCTTCTGGATCGGGATCCTATATTCCGCTCTGATCTGATCGGTTATGTTCATGTTATGTCGGG 

TTGTGTGTGTCTGTGTTGTGTGTATGAGAGTGGAGTGTGTTGGGTGTTCGCATCTCGTTATATGTTGTATATATATATATATATATGTGTATATA 

TATATATATATAATATATTATATGTATGTATAGTTAGTTGCTTAATAGTAGAATTTCTTGAATGTTGTTGTGGTAAAGTGTAGGTAATGTTGCTC 

TATGTCTCTATGTCGACTCAACTGCACTACATACATATGTCTTTCCACCGGATACGCGGCGAACCGAAACGTGCCGAGCCGAACCGATCCCCGTT 

TCTGACCCGCATCCGCATCCGTATCCGTATTTCGTTTCCTCACCTCCGCCTCATTCGCTGCGCAGTTTCTTTAGATCCGGTGACTTCTCCCCCAC 

GCCCACACCCACTCCGCCGGCGTTCTTCAGCTCCATATCGCAGTGCAGGGCAGTACGCTTCTGTCCAATGGCTGGTGCCGAGTCTGCACCCGCTG 

CTACCGGTGAAACCGCATCCACCGCCTTGTTCGCCGCCAGCTGCTGCAGTTTACTGTCCCTCACAATGCCGTTGCTGCTATTGCTCTCGTCAGCC 

TCTAGCTCCGCCTCATTATCCTCCTCCTCGTCCAGCTCATCTTCCTCCTCGTCCTCCGCCTCATCCTCATCCTCGTCGCCCAGGACGCCGTTCCC 

GTTACCATTTCCATTGGCTGGTGCTGCAGTCGCTTTGCTGGCTGGCTGTGGTGCTGCTGTAACCGCTTCCTCTTCTGCCAAAGGAGCAGGCTGAC 

CGTTGTTGGTGATCGAATTGTTAGTTTTGTGGGGCTGGCTCGGCGACTGCTTTTGATTCTGATTCTGGCTCGAGTGCTGTGTGGGAGTCAGGATG 

GGCAGGCTGTTGGTCTTGGGCGGCGCCTCCTGATTGCTTTCAGAGTCCGAGGCGAGCGGTGTTCCACTCGAACCGATGCCAATGCCAATGTCTAG 

GGCGATGGGCTCACCGTCCTCTGTGGAAGTGGTGGCCGTTGTCTGTGTGGTGCCATCGTTGTCATCGATGATCGGCTCCTGGGAGGTGGAATCGT 

TCGAAACAAAGTTGGAGTCACCGTCCTGCAGGGCGGCGCCAACGGCAGCTGCGTTGTTCTTGGCCTCGCCCGACTGTAGAGGCGTCAGAGCAGCC 

GGGTTGGTGTTCTCATTCTCTGTCTCCGATTCTGTAGCAGTGTTGCGACCGCCTGGGGGATAGAAAGCATTGGATTAGCCATCAATGGTCAATTT 

GTTAACTCGGCGAACTGCGGTCAGAGTAATCAAACAAAGAAACTCTGTGTTTGAAATTATTAGACATCCTAACACTTTTGAAAAGCTGCCAACAA 

GTATAAGCTACTGATAATATGTTAAAAGTTAACTATTTTACAGTAGTAACTTTACAATTCCTCCTTTTTATTTAACATGACGCCACGAGTAACGA 

AT AAAT T GT AAAT T GT AAAT T GAC C AC T AGGC T AC T AC AAAC AT AAT GT T T AATGG AAAAT T CCGG AT AGG AAAAAT ACT T GGT T T C T GG AAAAG 

ACCACTCGGTTTGAGTGTTTTGTTGCAAGGGTGTATACTCAAGCCGCAGTTATCGGATCATCTTAGCTGCTTACCTGCTCCTTGGCTTTCGTCGT 

CCAAATCAACACCCACACCATCGTTGGTATCGTCGCCATCGAGCACCTTGCTGCTGTTGCCGTTTGTGCTGAGTGAACCGGTGCCAGTGGAGGCG 

CCGGATCCAGGCGGCGGCATATCGTCGAGCATCTCGGGACTGAGATCATTGGCCACATAGTGTGACCAGAGGGCCAGCTCCACGCGATGAGGCGA 

CCAGTGCGGCGTATCCCCGCCCACCTCCGCATTGAGGCGCTCCACGGTGGCCTGAATGTGATTGACGAAGTTGAGGTACTCCTTGGTGGTGTAAT 

CGATGCCCTCGATCTCTGGTATGGCCATCAGGCACTCGTCGGCCATGAATGGTGCCGAATCGGGAGCTGCGGCTGCCAGCAGTGCACTGGCCATT 

GTGGTGCCAACGCCCTTGAGGTTCGATAAAGCTGTGATCGCCTGCTCCAGATTGGGCAGCTTGCGGAAGGCCTTCTTTGTCTCCTGGATGACGGC 

GCGCGGTGTGTTGACCTTGACCAGGTAGGATAGCTGCGGATAGAATTTGCCGCGCGACTGCTTCCACTTCATCGACTGGACGAGCTCATCGTATA 

CCATATGCGCGTCCTTGCCGCGTGCCTTAATCAATTTGGGCAGTTCATTCTGATACCACTGATCCAGGCGGATCAGCTCTTGCGGCTTCTTGCAG 

CGCTTCTCGGCCTTTAGCTTAAGAACCTGGGGATAGAGCTGGTAGCAGTACTCGAACTGTTTGGTGCTCCCGGTCTCGAAGAACGAGACCGTCGC 

CTTGCCGTTGGACATCTTCTGCGGATGCGGATGTGTCGTTGTCGCCTGCTAATGATTGGTGGTATCGCTGTATCTCCTTATCGTGGTATCTATCG 

CCTTGGGGCCCAGGACGTTGGCTGATCCGCAAGTGACCGGCTGGCTATATATGCGGCAATATATGTACTTTGGTGTAAGCTACTATACGCCAAAT 

CAATTTATTGCTTACTGTTTAGTCTCCTGCTTTTCCTGTCCGTTCCGAGCTGCTCCTGTCAACGTACTGCTATTCGTGCGTTGAGAGTTCCCGGC 

GAGAAAGAAGCGAGTACGTGAGAGACAGAGAGAATGCGAGTGCGAGAGAGGGAGCACGAAGAAGCAGCAACGCAAAGGAGAGTGGATGAAAAGAG 

AAGAAAAGAGGCACACAATTAGGTTTACAGTTACGATTACATATGCAAGGGGGATCGGTGCGGTGGCATAGGGAGGTAGTTCGATTGATTACTGT 

TAATGCTGCTGCTGCTGCGGCCGTGTCTCTGTTGTTTGTTGTATTTACTCAATCCGATAATCGCTCACTATTTGCCAAAAATCTGCTCGCTTCCG 

CCGCCGTTTGCACAGCACACAAATCGAATTTTTACAGTGCTGTCGTCGGGTTCTTTTTAGTCCGCCATGCACTCACACAGTCAAACTGACAGGCA 

CACGGATGGACACACACACACTTACTCAGCACACTCGCTCTCGCACACCCAAAGAGAGCCAGGCGCTCATACACAGGCAAAAACACGCACACAAC 

ACACACACTGACTTAAACACGCACACACTTATATACTGAGGCAACAACAGCAGCAGCAGAAAATTTCTTTTTCATCTAGGTTTTCAAAAAAAATA 

CGAACGCGAGCATTAAATAAACGAATGCTACTATTTTTCGCTGTTGTTGTTGTTTGTGCTTTTTTCGTACCTCTTGTCTTTTTGGCAAACGGTGA 

GAACAGCTTTTCACTAGCGCACTGCACGCACACAGATGAGTGGGTGAACAACAAATACCTTTTAGTTAGTCGCCGCCAGTCTTCCTTTCCGTCCA 

CATCCAAACAACTACTCCAATGCTTTGCTGTTTTTCTGTGTTGCCTCTGCTTTGCTTTGCTTCGCTTCGCCTCTGTGCTTGTGCTGTGCTGCGTG 

CTTCGTGCTGTGCCCGTGCTGTGCTGCTTCTTCTTCTTCCTCTGCTGTTTCAGTTTCTGCTTTCTGCCTGCAGGGGCGCCGTTGACAACTGACAA 

ATAAAAAGGGGTTACTCTGTTCTCGAAATGAGCACGCCACCGAAGGCACAACAAAAAATTAAAAAGGATCGGAGCGGAGCGATGCAGCTGTGCGA 

TGTGCTGCTAACTCGTGTGACGGGAATTGGGAATTGGGATTGCGAATGGGAATGGGAGGGAATACGAGAAGCGAATATGGGAATGACGAACGACG 

ACGAACCGGGAACAGGAGTTGTCGTTCTATCTCGCCCGCACTCGGTGCCCGAATGTATCTGTATCTCTCTGGCACCCTCGGCGTTCCATTTCATT 

TTGCCGGCGACGTCCGTGCTCTCTGCCGCCGCTCTGCCATCGCCTCTGCCGCTGCGTTTATGCGCAGTGCGTCATTACACGAATACCTGTTTAAA 

GCGGAGGCATAAACGCGGCTTTGTCGATCAACATCCATGTTACCGAAACCCATGTTCGAGTTATAGAAACTTCGAGTAACAAATATCTATATTCT 

AGCAATGTTTACCCTTATTAGCTTAGTTAGTGCAATTATGTTTCATGTTTTTACAAACAGCAGAATGTTTTTAACAATAGTTATCTCTATATTCA 

GGTCCTTCAATTAAATCAAAAAGGAATGGGTTTTGTGAGTGGGTTGGGCTCCAGTGAGACTTGTTAATAATTTAAGAATTCGCGATATTCTAGTG 

TAAACAGCGCGTCACTGCATGTTCGCTGCTTTTGTTGTGCCAGTTGCCAGCGACAGCGGCGTCGTGTCATTATTCATATTCATGAGGGGGTGCGT 

CAGGTTTGCTGAATTTTACTGACTTGTTTGCGCGAACCGAAATTATTTATGGTTCAGCACGAAGAGCGATAGAAATTCATGCATATGTTGTTGGT 

TGGTTAAAAAGTTGCCCTTACATGCAGTTAAGTGCATATAATTTGTCTACTTGCGCATTTATTAAAAATGCATACATATTAAATAATGTACTTAA 

TTACTCAGCGGTTTGAGGGTTGGAGCATTTACAGATGACACTGTAGTGATCAAATGGTAGCTATACATGTATAAATATCTAAAATATTACAAAAC 

TATTCCAGCGTTCAAGCATTAACTAATTCGAGAACAGCTGCTGGTTCCATAATCGAGTACACACGTGTGCTACTCAAAGTCTAAGCTGAGAGAGC 

GGTCGAGCGGCAACTTGATATTTAAAAAACAGAACCCTACTTAACAGTCGACTCAAGTTCGATCCCCCCGCAGGTACGCCCC 

(SEQ ID NO: 229) 

Exon: 4497.. 4049 
Exon: 3299.. 3246 
Exon; 3179.. 2260 
Exon: 1857.. 1001 

Start ATG: 3055 (Reverse strand: CAT) 
Transcript Wo. : CT8088 

AGATAGAACGACAACTCCTGTTCCCGGTTCGTCGTCGTTCGTCATTCCCATATTCGCTTCTCGTATTCCCTCCCATTCCCATTCGCAATCCCAAT 
TCCCAATTCCCGTCACACGAGTTAGCAGCACATCGCACAGCTGCATCGCTCCGCTCCGATCCTTTTTAATTTTTTGTTGTGCCTTCGGTGGCGTG 
CTCATTTCGAGAACAGAGTAACCCCTTTTTATTTGTCAGTTGTCAACGGCGCCCCTGCAGGCAGAAAGCAGAAACTGAAACAGCAGAGGAAGAAG 
AAGAAGCAGCACAGCACGGGCACAGCACGAAGCACGCAGCACAGCACAAGCACAGAGGCGAAGCGAAGCAAAGCAAAGCAGAGGCAACACAGAAA 
AACAGCAAAGCATTGGAGTAGTTGTTTGGATGTGGACGGAAAGGAAGACTGGCGGCGACTAACTAAAAGCAGTACGTTGACAGGAGCAGCTCGGA 
ACGGACAGGAAAAGCAGGAGACTAAACACCAGCCGGTCACTTGCGGATCAGCCAACGTCCTGGGCCCCAAGGCGATAGATACCACGATAAGGAGA 
TACAGCGATACCACCAATCATTAGCAGGCGACAACGACACATCCGCATCCGCAGAAGATGTCCAACGGCAAGGCGACGGTCTCGTTCTTCGAGAC 
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CGGGAGCACCAAACAGTTCGAGTACTGCTACCAGCTCTATCCCCAGGTTCTTAAGCTAAAGGCCGAGAAGCGCTGCAAGAAGCCGCAAGAGCTGA 
TCCGCCTGGATCAGTGGTATCAGAATGAACTGCCCAAATTGATTAAGGCACGCGGCAAGGACGCGCATATGGTATACGATGAGCTCGTCCAGTCG 
ATGAAGTGGAAGCAGTCGCGCGGCAAATTCTATCCGCAGCTATCCTACCTGGTCAAGGTCAACACACCGCGCGCCGTCATCCAGGAGACAAAGAA 
GGCCTTCCGCAAGCTGCCCAATCTGGAGCAGGCGATCACAGCTTTATCGAACCTCAAGGGCGTTGGCACCACAATGGCCAGTGCACTGCTGGCAG 
CCGCAGCTCCCGATTCGGCACCATTCATGGCCGACGAGTGCCTGATGGCCATACCAGAGATCGAGGGCATCGATTACACCACCAAGGAGTACCTC 
AACTTCGTCAATCACATTCAGGCCACCGTGGAGCGCCTCAATGCGGAGGTGGGCGGGGATACGCCGCACTGGTCGCCTCATCGCGTGGAGCTGGC 
CCTCTGGTCACACTATGTGGCCAATGATCTCAGTCCCGAGATGCTCGACGATATGCCGCCGCCTGGATCCGGCGCCTCCACTGGCACCGGTTCAC 
TCAGCACAAACGGCAACAGCAGCAAGGTGCTCGATGGCGACGATACCAACGATGGTGTGGGTGTTGATTTGGACGACGAAAGCCAAGGAGCAGGC 
GGTCGCAACACTGCTACAGAATCGGAGACAGAGAATGAGAACACCAACCCGGCTGCTCTGACGCCTCTACAGTCGGGCGAGGCCAAGAACAACGC 
AGCTGCCGTTGGCGCCGCCCTGCAGGACGGTGACTCCAACTTTGTTTCGAACGATTCCACCTCCCAGGAGCCGATCATCGATGACAACGATGGCA 
CCACACAGACAACGGCCACCACTTCCACAGAGGACGGTGAGCCCATCGCCCTAGACATTGGCATTGGCATCGGTTCGAGTGGAACACCGCTCGCC 
TCGGACTCTGAAAGCAATCAGGAGGCGCCGCCCAAGACCAACAGCCTGCCCATCCTGACTCCCACACAGCACTCGAGCCAGAATCAGAATCAAAA 
GCAGTCGCCGAGCCAGCCCCACAAAACTAACAATTCGATCACCAACAACGGTCAGCCTGCTCCTTTGGCAGAAGAGGAAGCGGTTACAGCAGCAC 
CACAGCCAGCCAGCAAAGCGACTGCAGCACCAGCCAATGGAAATGGTAACGGGAACGGCGTCCTGGGCGACGAGGATGAGGATGAGGCGGAGGAC 
GAG GAG GAAGAT GAGC T GG AC GAGGAGGAGG AT AATG A GGCGG AG CTAGAGGCTGAC GAG AGCAATAGCAGCAACGGC AT TGTGAGGGACAGTAA 
ACTGCAGCAGCTGGCGGCGAACAAGGCGGTGGATGCGGTTTCACCGGTAGCAGCGGGTGCAGACTCGGCACCAGCCATTGGACAGAAGCGTACTG 
CCCTGCACTGCGATATGGAGCTGAAGAACGCCGGCGGAGTGGGTGTGGGCGTGGGGGAGAAGTCACCGGATCTAAAGAAACTGCGCAGCGAATGA 
( SEQ ID NO: 230) 

Start ATG: 628 (Reverse strand: CAT) 
MSNGKATVSFFETGSTKQFEYCYQLYPQVLKLKAEKRC^ 

VNTPRAVIQETKKAFRKLPNLEQAITALSNLKGVGTTMASALLAAAAPDSAPFMADECLMAIPEIEGIDYTTKEYLNFVNHIQATVERLNAEVGG 

DTPHWSPHRVELALWSHYVANDLSPEMLDDMPPPGSGASTGTGSLSTNGNSSKVLDGDDTNDGVGVDLDDESQGAGGRNTATESETENENTNPAA 

LTPLQSGEAKNNAAAVGAALQDGDSNFVSNDSTSQEPIIDDNDGTTQTTATTSTEDGEPIALDIGIGIGSSGTPLASDSESNQEAPPKTNSLPIL 

TPTQHSSQNQNQKQSPSQPHKTNNSITNNGQPAPLAEEEAVTAAPQPASKATAAPANGNGNGNGVLGDEDEDEAEDEEEDELDEEEDNEAELEAD 

ESNSSNGIVRDSKLQQLAANKAVDAVSPVAAGADSAPAIGQKRTALHCDMELKNAGGVGVGVGEKSPDLKKLRSE* 

(SEQ ID NO: 231) 



Celera Sequence No. : 142000013384445 

GGAGTTGATGAAAGTTTT GGAAGT C AATT AATTTGT TT GAT TTTGGC AATTAATTCCAGT GC AGCGCAAAAT AAAAAAAAAAT AAAAAT AT T TT A 
TATAATTTATATAAATTTTATTTAAACATAGCTCAGCTTGCCGCGCTGTAGGAAAACTTCTACATACACGTTTTATGCTTTATATACACATTCTT 
GAAATGAACTATGTACGTATATGATGTCAGTTTTGTATCGAATACTAGAAGCAATTAAATAAATTTCTAACACAACGTACGCACTAAAGTATAAC 
ACATATTAAATAAACGTTGCGGTTTTTTCTAGAATTTAAACATATTATCAACACATTTTACATATGCGTTGCTTGAATTTGTTGTTAAATAATAC 
TTAATAATTACAAATGTTTTCGCCTTCTCAAATTAAATTTGGACAAAATTGAAGCTAGTTTGGAATGCATTTACAAATATTTGCCGTACAGCAAA 
TCATAGCTTGCTTAACAACACTTTAAAACGTATAATATATATTTAGTCACAAGAAATGACTAAAAATAACAAAAAAACATTGATAGACGGATTAA 
AATATTGCAAATGATTACGTTAAGGTGTGCAAGCACTTGGACTTCGTTTGAAAATATCTATTCTGAATCGGTTTCATGTTTGCTCGATCATCTGC 
TTTGAGACAGTTAACATAGTTCCACTACATAATTCAACGTGTGTGTTTTTAGTGATTGGTGAGGACACAACTACAGGTTTTATAAGGACTTTTCT 
TAAAGTTCATAGGTATATTTCACATAGGCTACCTTCCTTTAGCTATTGATGGAAATTGGTTCGTTTTTGCTTTTACTTTCCCTTCATTTGTTTTA 
GCATAGTATATAAATTGGTGTATACATATAAATAGTCAAACTTATTTGTTTTCATTTAAATAGTAATAATATAAATATATATATATATATATATA 
TGTATATATGTATGTATATAGCATTTTTCGTGTTTTGCTCAATGTTCAACATTATAATATAATTTTCTTATCTGCTTCCGTTTCATTTGTGTTTG 
CTATTGCTGCCGTTTTTTTCATAAACTTTATATATGATTTCTGTAGCATCGTGTAAACAGTTTCAAGATCTTAATATTCACTTGCACGTTTATAT 
AATATATAGAGCTATTGGTATATTTAATTTTTCATTACGCATTTGTGTCTGTGAGTGTGTTTTCTGTTTTCCGTCTATGAGGTTAGAGCGCTTAG 
ACT GAGT G A GT G AGT G AGT GGT GAG T GG GT TGC AC GT AT G AGC T TC T TCTGGACCGCT T AT AAC G C AC C AAAT GG GT C GAG T G GGAC GC C T T T AT 
TGCTGCCATTCTGCGTTGGCTGGATGGGTTGCATGATGCCCGTTGCATTACTGTGCGGCGCCGCCATCATCGTCGGTTGCACCCCAGTCAAAGGG 
GCTGCTGCCTGCCCCCCCATCATACTTGGCTGTCCCATCACCGGCATTCCCTACAAGAGTCGTTAAAAAAGGCGTTAAGAGCGAACATAAAATGT 
TTGTTC AATT GAAATTCATGC ACT GGAGGAGGGAAT ACTT ATT AACTGCTATGACAGACGCAATGCTACAAAATCAGTGGCAGGAAAAGATT AGT 
CGTTTGGGCTTAAAAGGAGGGGGCCCGCCGTGAGCTTGGAAAACAGTTGCTACATGGAATCATACGGAAGGTGGTTAGCTCTACAGACAAGTGAA 
ATAGTAACTCCATTAAGAACTGAATGTCTACTAAACTTAAATATGTAGATTTAAATAATAATTTAACAGATACGAATGCAAGATTGTAATGCACT 
ATAGTTCAGAGACGGAGACCATAGAGGGAGGAGAGGCATGTTAAACGCTTGTTACTGGTCACCAAGTGGGAATTGTTGGCATGTTTTTTTTTTTT 
TTTTTTGCAAGTTAGAAGGAAGTGGCGTAGAGGTCACATTTATACGTTATAGTGCTGTGATGATAGAAGTTCGATTGTTAAACGAAATGATAGGA 
AATAAAACTGAAAGAATTTCTTGATTTAAGGAAAACGGCAACTCAGGGCAAAACCAAAATTATATTTATTTATAATCTTTCATTCTGTTTGAGTA 
ATTGTAATGGTTTCTTGTAATGTTTCATTTCGTTCGTTTTGGCATCATCTTACCTGCATATAATTTGGCATTACAGGATAACTAGCATGTATATA 
CGGATGATTGATTGTGATTGGCGCAGGACTTACAGTCATGCCATGTGCCTACGTATTGCATATTGGTATATATATATATAGGCATATTGATAGAA 
TATATATTGATATATCACAAACACACATACACACACACAGACAGCATTGGCACAGGCGCAGCGTGAGTGGTCATACGTGATTCGTTTGTTGTTGT 
TTGTTTGATTGGATTCAATATATATTTAGTTGTGATTCAATTTGATTCGGTTCGGTTAACGATTTTCGATTTTCGATTTCGATTTCGATTTCGAT 
TTTGCGGTATGCGTTGGGATAGTTGAGATATAAAGCAAACATAAAATAAAGAAAAGAAAGAAGTTGATAGTTTATATATACTCCGTTAAAATGCA 
AATTAAAAATTAAAATAAAAGAAATTAAAACTATAAAATGGAAATATGTGTCTTGACGTATTTGGCCTGCGTGTTTTGTGTGTGTGGAGGCATAA 
AACCCACCGTCGTTGCCCTTCTCTCCAGGATGCCCTAATGCTTATGATCTAGTGCTGTTTGCTGTGATTCTGTAGTTTACGGTGGGGTGTGTGGA 
AACTAGGGCCAAGAATATGTGTGGACAAAATAAAGTCAAACTTACAACCCGCAGACTTGATTTGCGATTTGATTTTGGTCATCCAATTTACAATT 
ATGATATACGAAATTTAAAAACACTTAGAATTTTAAAGGGGATTCTCACGACGAAGACATATTAATTATAAAAATTTACTTTAAATCTTATCAAG 
ACAATTTATGCAAGCTAAAAAATAGTTCATACAGATCAAAAATGTCATGAAAACCAAGGAAACCAAGTAGCTCAAGTTGTAAATTTATACGGATT 
GTGCAATTGGTAAATAAACATTAATTTTGATAATATCGTTCACATTAAATCAGGTAAAAAAAAGTGTACTTCGCATGAAGTGTAGCCTAAAGTGT 
TGGGTGACACAAAATTGTTAGAAAATGATAGGAAAAAACAATGCATTAACAAAAATAAAGAAATGTGTCGTTTGTTTTCTTCTCTGGCATTGTTA 
TAGTTCTTATTATTTTTTGTCTCTGCTTCGGCACAAATTCTAGTGTATTCGAATCTTTTGTTTTAGTCACTTTTTAAATTTGAACAAATTCGAGG 
CAACGTAGATTCTTTCGAGTAGTATTCATTGTTTCTGAGTAGAGTAGTTCGGTAGACAGAAGAGTTAGAGTAGGCAAAACTCACCATTGGACGAT 
AGCCAGCACCAGTAGTAGCCGCCATTGGTTGGGGTGTCCAATTAGCACCTGGTTTCGCTGTATTTTTAGGTGAATTCCATTGCACGGGTCTATGA 
GAATAATGGGGTAAGAGAGTAATGATATATAGACGACGTCAATGAACACGGCATTTGAATTTTGGGTTTTTTTTTTTGGGTATCTAAATAGATTT 
CTGCGTGATTTCGAACCATTTTTTTGATGGATTTTATGTACACGTATGGAACATGAATATGGGAGGGTCAGTTCGGAATGGTATTGGATAAATGG 
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TAAGGGATCATCAAAGGATCTGTTTCTTCGGATGTGGAACATAATCTTCAGTCGATCAAAATCGACACGATTCAGCCATCTACAGCGAATGAACT 

GGAAGGGATGTGACAACATGGATTCTGATCAACATTCTGGTTTGGTTTGGCTTCGCTTTCGAACGCACGCACACACAATCGTCAGTTTACTGGCT 

CGATTTACTCACTTTGCACTTGCCGTTTTGTTAATATTTAAGTTATCAACGAGTGACATTAGTGAACTATCCAAGTCGCCGGTAATTATCTTTCC 

AGTGCCAGCCGGTGGCTGCTGCTGTTGCTGCTGGTTCAGTTGATGCTGGTGTTGCTGCTGGTGCTGCTGGTGCTGCTGCTGTTGTAACGCTAGGA 

TCGAACTGCTATTGCTGGAGTAACCTGCGGCGACAACGTTCATTTGATTGCTGTTGTTGCTGGCCGAGGCCGGCTTGAGGACGTCGCCCAAGGCA 

TCGAAGCCTTGATAGGTTTTTACAATTCAGACAACATGGTTGTTTGGGGATAGCCATAGCCAATGCACCGTTTTTAGCAGACATTTATACACAGA 

GACAAACACATTGGTATATATATATATATGGTAGATATGCATTAACAATGAAGCCGATGAGCCCAGAGGATTGGTTTTGATGTTTGTCGAGTTAG 

CGTTGAGTTCGATTGACGTAGATATTCGATATTTCGGTTTTAATTCGTTTTTAAGTTTTTTTTTTTTTCATTTTGCCAGCGTAATCATAAAACAC 

ACAAATATGTTTTAGCGTTTTGTTGAGATTTTAAGTGTTCAATATGTATTTGTGCCACCAACATTAACATGATACAATGTGAAATTGTTTTTAGT 

T T T GAT AT T T T GAT T GG AT T T T T AC GT T C AG T AC AT AAG AAC AT AAG AG AG AG AAAAAG AG AAAAAAT AAC AT TT AC AAAAT AAAT AAT AT G AAA 

TTGCATTTTATATATAAAGATTTGGGATATATCGATCTCTAATGATTTTTGATATATGTAGGATTAATAATAGGTTGTTATTTACAAACACATGA 

CAGACGTAAGATTATTTACAAAAGCTACCAAAGCAGAATAGTTTTTTGTTAATGAAATAAGTGCGTTTTTTTTTAGTGTAATTTAAGCATTATAT 

CATCGTAAATACAATACGTCCTGATAGTTATGTGAAAAGTTCAGGTTCAAATAGCAAATAGGGAATCTGACACTAGAATTAGTTGACCTGAACTC 

AAAAACGTAAATCAAAAGAATAGT T ATCACACAGTTATCAC AT GAAAC ACAC ATT AAAT ACTTTTTTCC AAAAAAT AAATTAAAACT ATT TAGAC 

AACAACACAGCTTTCTTGTCAATTACGAGACGAGTTTATAGTCGATTGCAGACAAAATAGCTACAAAACACGAGCAGACAACAGAGGGAAAAGCG 

TGTTCTGTT T GGT GT GG AT G AGC AT CT GAT GC AC TG GGC GC AAGGC T C AG GAG AT T T T G GAT GC T G AG AT AGAAAAGT C AG T GT C C G CT G C AGC A 

CTACAAGCCAGTGATGGATCAAGCTTTGTACAAAAACAAAGCACGCGGAGGAACATATGCTTGTGTTTAATGGGTTTGTTTTCGTATTCGTTTTG 

CGGCGAGGGTCTGCAAATCTTCACCTGGTGGAAACTGAGCTGCTGACATGGAAGGTGTCAATATCGCTTGCTGTTGCGGCAGTTGCGGCCGGTGT 

TGCTGCTGCTGTAGTTGCAGCTGCTGTTGCTGCTGCTGTTGCACATATAAAGCCGCTTGCTGTTGTTGCTGCAGCTGCTGATCCATGAAATTTAT 

ATTGTTGCCAAAGGGTGCAGCAACAGATTCAGCAGGGAGTGGGATTTGGGGCAAAGACATATCATCGAAAAATGGATTTGGCAACGACGTCGAAG 

CTGTGTAAAATTTAACGCAAACATTCAGAGGATAACCATTAAAATAAATACTAAAAAATAAAAAGAAATTATTAAAAGTCCGAATATGATTCAAA 

AACTAGCATTGAGGAATACAGGTAAGGGATTAGGTTCCTAACATAATCACGCATATTTCTAATCCAGTTTTAGAATTTCAAAGACCTATATATAC 

GTAAAATGGAATTTTCTTAACTTTTTGTTGAACTTGTTTAATACATGAAATCAATCAGGCATATTAAATTACATGCTACATTTATATCAGTATAA 

TTCAAACACATCAAAGTTGGATGATATTGCTAGCATTGTTATTGCCTTTGCCAATTAGTAAACTTAAACGAACCACTCCGATTTTTACTCAATTT 

GGAATAGTTAGTAAATCATAAGCATTTTGTTAGTGACTGCATATTAAGTATTCAAATATGTATTCTTAGGATTAGATTGTAGCTTGCAGTTAGGA 

ATGAATGAAGACTGTAGACCATTGAGTGACTTACCTGCCGGTTCCGTATTACCAAATACGGATGAGAAATTACTATCGGAAACGAATGCATTTGT 

AGCAGCAGCGGAGGAAACTGATGCGCCATTGAAACCTATTTTTGTTTGTTTTATTTTGAAATTTGTTGGTGGTTTTGTCTCAGTTTAGTTGGGTG 

TAGTGTGGGCGGTGGACAGTGGGCGATGGTCGTTTGGAAGATTTGCGATTGGTTTGGTTTTCGTAACCGAGGAGTTGTTTTCGGTATCGTTGTTG 

AAG AT GT T GT CATTACAGTTT C AGT T TG GAAAAAAAC AGT GAT G AAAAAAT G AAG AGT AAC AAT AC AT T AG AC AAGAAT TT AAAAAAAAATC AAA 

ACTTACAAAAATAATTATCGTAAATAATTAAATGGATACAAAATATACCATTATGATTTCAACTATGGATAATGATGCATGAAAACTTGTGCAAA 

TAGACGTCTAAGCTGAAGTGATAGGAAGACAGACAAAGGGTGAGGGGCGTCTAGAGCGCCTAAAAAGTGCGCTCCTCATTGCTCTGATTTTTAGA 

GTACGACAACATAAATACGTAAGATCACTACACTTACACAACGAATGTTACCTATGTACAGCACAAAATAAATTTAACTCATATCGTAATCGCTA 

AACAAGAACTTAGGACGAAATAATAAGAAAATTATAACATGCTACCATTATTATGCATCCACAAATTATTGGCATTGAGTGCAGCACCTGTGGCT 

CCTACAGCAGCTGCTCCCCCACCGCTGGCGTCAAACATGTCCGCGAAAGGATTGCCCAACTGCAGCAGATCGTCAGAGGCCTTGGTGGCTGCTGC 

AGCAGCAGCGGGCTGCGCCGACGCGGCACCGAACAGATCAACTATCGGCTGGCCAGCCTGCGCGGCTGGCGGCGACGATAGAAATGGATTTGTTA 

GTGCAGCGCTAGCGCCAGCAGCACCGCTGCTAGTGGGCGACGACACCTTGGACTAACAGGCAGATAAATGGACCAATTAGTTTGTGATTTATACG 

TATACTGCTCACAATGAACTCACCTTGTATTGGTTCATAGCGGCTTCCTCTTCGGCCAAAACCTGGGCCTTAAGCTGCTCGTCAATGCCGTTGGT 

GGTATCAAATTTGCTGGACGCAGCTGCAGTACCAAATGCCGAGCTGGTAGAAGACAGGGCAGATACAGCGGATTTTACATTACGCTGATTACTAA 

TGTTTCCCAGCCATCAAAAACAGGTGCAACGCAAAAACCAAGGAAAATCATAGAAAAACAAAAGAAAAGAATGTTAAAAATATATCGCAAATAAT 

AC AT T GAT AC AT T AATGAT AAC AGT T C AGT AAAT AAAGT AT T T AT AC AAG AACT AAAAT AAT T T G T AAA C T AT C CAT AAT T CAT T GC C AAAT T TT 

ACATAATTTATCTTATTATTTTATGTTGCATTATAAAATTTGAATAAAATGTACAAAATACAAGTGGTTGTTAGTAGGAACCCAATGTTTAAATT 

ATGTTAGCATATAACAAATAAAAGTAATATCTAAAAGATAGATTTTGGAGCTTAGAAATAGATTAACAAACCTTGATGACTGCGTAGGAGTATTA 

GCAGCGGATACTTTTCGACCCTCAAGCGTGGCCAAGTGCTGCTCCAAGGCATCTAACAACGAACTGGGCGCCTTGGTCAAATCGGGTATGTCACC 

CTTGTCTATGCCCACATTCTGGAAGCAGAGAATAAAGCAAATGAAAATGATTTATTACATATGATAAATAGGATAACTCTTACCTCAGCCACTTT 

TAAAAACTCTCCAACACGATCCATGCGCACTAAGAACTTTTTGTAAAGATCCAACGCATCGCGGGCATGTTTCTTGTTCATATCGAAATATTTTT 

C AAGC AAAT T T AT G AT GC CAT CAT T GT AGC AAGC AAAC AAT C G TAT GAG AT C G C GG AAC AAGAG C AT AAAG C T CAT AT T G AT G AC GC C T GAAAAA 
TATTAAAGTCGTTATAGTTTTACAATATAATTTGGAAATATATTCATACCATTAGACAAGTCGTTGGATTGGCAGTCGAACTCCAAAAGTGCATC 
CAATTGTGCCTGCAAAACTGGCAAAGTCTTTAGAAGTTTTTCCGCGTTCATGCTGCGTAGCGAGCCTTCCTCTTTGCTAAAATGATAGATTGCGA 
AAC AAAT CAATCAGATGACAATTAAATGTGAGCTGGCGTTACTTACCCCCGTTTGACTTTGCAAAAGTCAAAGGCCATGGCTCGATAGGAGAGCG 
ATTTCTCATTCAAATACTTGGCGTAGCGCCGAATAAAGGGAGACATATCGTAACCTGTTGATTTAACAAATTATAGTATTGGTCTCCAACAATAT 

AGGT C AC AC AAAT T T T GG AC AAT GC C AAAT AC GAAAC T T AAG AAG AAT GGT C T T T T G GT T AGT T AAT T T C G C T C T AAAT CAT AAG T AAAAC AAAT 
TTTAGTAACAACCAGTAAGGAATTCAACGCCTATAGCCAAATAAGGAAAGTTGTATCCAATTGCTAATAGCTTCTGTATAATACTCACCCATTCT 
GCCACCAGGAACGCCCATGCCACCATCTGTATATAAGGTTTATATTGTTTGATTTTGAATATTTAGTTTGGATAATTGCATTTTTCAGTTGTTGT 
TAGCCAGCATGTAGTGTGCGGTATGTAGGTCAGTAAGCAGCGCATTTCGGTATTAATATTTTTGTTTTAGAGATTTTTGTTAAGTAAACAAAATT 
ACAAATGATCAAGCCAAACGTTCGCGCGCATTCGCATGAGTTGAAAGAGAGAATGAGAGAGGAATGATCAAAATGTAGGTATTAGTTAATAATTT 
GAAATAGGTAAATCGATTGAATAGTCGATGGGTTTGAAACAAATTTGGGAGGCTTAAAAGGGAGGCTTCGTTATACCTTGTACAGTTCCTTTGTC 
CAGAAAGGAGCTGAGATTGAATGTAGAATTGCTAGAGGCAAGATATTGCATAAATCGCTGAAAAGATTTAGGAAAAACAATAGGAATGCTGTTAA 
ATATTATTTAGATAATTATAAATCGCAATTAAATGGAGGGAACTATAAAAGTTGATGTGTTAAAAATTTAACATTTATTTATTTATTTTTCAGAG 
AAGACAACTTTTTATGTCACCAAAAAAATTACTAAGAAATAAAACTGAACGAACAAAAACTGAAAGTAAAATACCACACAACGATATACTTGGTC 
AGAGGTCATCCCAAAAGAGTAATTCTGCAGTAATAAGTTGCCATTAAAGGTACAAGAAGGAAAATAATAAGAATCTCATGATGTACGTGACCTAT 
TCATCTGTATATTAAATCAAACAAAATAGATTACCTCATTTCGAACGCTTAAATGAAAGTAAAAATGTATTTGAAATACGCCCCTAAGCTTCGCA 
CAAACACTAGATTCCGATAAATACACAAATACAAACCTCATTGCCATATGCCATCAGATGATGTGTGGTTATCAGCGACTTGTAGACGACGACCC 
AGTTGGCATTCTGTGAACGCTCGATAAGTAAATTGGCCAGATGAGGTATCGACACATTCGGCTCGTTTGTGCAGTGCACCAAATCTGCAATTAAA 
TAAAGGTATGAAACTTATATTAACACAAAGCAACCACAAACTAATTTAACATTTTACATTGATTTAAAATAATAAATGAAAATCATTTTATTAAG 
CATAGCTTTATATTAGTTTCTTAACGCATATCCATACAGTAGATTAGAAAACAATAAAGAACGTTTTTTTTTTTTGTAATTTTCGGTTAGTGAAC 
CATTTGCTCTTTTCCTTAATTTTGTACGTCCAACTCAACTTTACTTTTGGTTGCGATTAACTTTGACTTGATATTTATTTTCCCAAGCATTTATT 
GGAAAAATCAATCAGATTCGATATAACCGCCGACAACTATTAGTTATTATCCTCTAGATTTATTCAACAGTATATTTTTTTTACGAAAACTTAGT 
TCATATGAGCAAAACAACAAATGTTTACATGTTTAAGAAGCAACATAAAACTTTAAAAGTCAATTTATCTGTTCCAGAAACAAACAACGTTTCTG 
TAAAGATTATTATTATTATAATAACCGAACCTCCCATAAATTTAATCATCTTGTTTTTTGCGTCCCGTTATCGGTCGCGACTCTTAGTCGTTGTT 
TGCTTATCTGCGCTTTAAGCTTGCAGCCGCCATTCAAGCTAAATACTGACGGAAAAGTGTCGATGATAAGAGCTGAATCCCATTGTGTCTACTAT 
CATGCGCTTCCATTTGAATCAATTCCCATTTCGGTTGGAAAGTGTTTGTCGCCGAGCAAGAAAAAATTGCATGGGATTCGAGGGGCGTTGCACAT 
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TTCACAAGCCATCGCAGATGTTCGCCGTCCTTTTGCTTCATTTTGTTTGGCTGACGGGCGCCAGGCCAGTGATTCATTTATACATAAATAAGAAC 
AAATTTCAATAAACAGTAAATGTGGAAAATTGATCGGTCCTTGTAAACGAAACTGGTGAGTTTCTTAACGAGCACAGTGGCCTGGGGGTGTCACT 
TTATGACAAAACGTTGAAATCGCTGAGTAGTTAAGACATTTTGAATGAGTTATTTAGTTAAAATTCGTATCCGCCCTTTTTGAAATCTTCACTGA 
TTCACATATATTTCGAAAATATTTACAGTGCTCGGATACCAAAAGCGATTGGCGACTTCTGTAGCGTAGTTATCGCTGGGGTCAATTCGCATGCT 
TATGCTCTTAAATACATAAATAAATAAATGTGTCGGGTATCGAGACTCCACTGTAGTGGGGGACCATTGATACTTAATTTTAATTATTATTGCCC 
ACTTGTGTAGCTGGTTGAAATATTCTGGGATCTTTTCAATCGGTGTAAAGCTCATTAAGTGCTGTTGTTTCCGTGAACACACAACTGGTTTTATC 
ATAGGACAAAAGTTGTTAACAAAAGAAAATAGAGAAAATAGGTAGAAGAAATATATTTCATGCTACCTGCTGACGTCTTGTAAAATGTACGGACT 
GACTTTTAACTCAGGTTTAATTTATTTAGACTTTTGGTAACTGAATTGAGCATAACGTTTTGGATAAAGTAATCAACTTGAATAAACAGCACATG 
TGGCCAAATTTTGGGTTATATTGGAAATTGGTTTGCGGCAACTGCAATACAATGTTTATGTGACAAGTAACGCCCCACAATATTGCCCTCTTTTA 
TTCCCATTTACTCGTAGCCATATCTGATGAGTCATCTGAAAAGTGCCAACAAGCTAAATAAAAGGGGGTCAAAATGCGAGCAAAACTAGCCAAAA 
AAACAGATTCCAACGGGAATGAATGTCGTTAGGATATGGAGCTCTTTAGAAGTATTTGCAAAATTCAAACGTGTAAGCAAATCCAGCAAATTAGG 
GAAGACTCTTGCATAAATTTAACAAGCAATGACACCGTGACAAACACAATCCCTGCTTTCGTAGGCGTTTTAAGCCATTCAAAGCCCCTACAAAC 
ACACGTGCACCCGTGGGAAAGTGACGTAACATTAGAGAGAGAGAGAGAGTGTGTTGGTAGAAAGAGAGGGGAGTCCAATCAATGACGAACGGAGT 
CCACAAACACCAGCAGCTGAGCCACAGATGCAATTTGAGGGTGGTGGTTTGTGTTTGTGGGACTTTGTGGACCGTAATCAGTGGTTTCGTTTTCA 
GAAAACTTTGTGATTTATCGCACAAAAACAAAAGGCACGTGAAAATGGAGCGAGGAAGCCAAGAAACCCGAACTGGTTGCATCTGAATGTTGTTG 
CTAAATTACTTTTGCAGCGTACTAACTGTGCCGATGGTTTTTATATTTAGGCTACATTTAAATTTTTTATTTTCTTTGCTAATTTGTTAGGCACT 
CCTTACTTTAGTATATTATTCATTCCAATTTTTATTATTATTAAAATTTAAGTTAATTTCTAAACAGATTTTATTGCGACTTATGTTTGCAAATA 
AAAATAAATTAATTTTAAAGCGTTACAAAGTAACAATAATATTTTCGTGTGTTTGATGGGTACATTTCCACTAGTAGCACAAGGCTGTGCCTCTT 
TTCTCTCTGTGTTATAACAAGGAAATGTAACGGTGATTTACGCAATGAAAGACCAAGAGAGTGGGCAAGGGAAAGAGATAGACAGGGCAACTCAC 
AGTCCAAGTGCTTCTTCTTCGGGCCAATGCATTCCTCCGTGGTGGCCTTGCAAACGGACTTGGCGAGTCCTTGGCCCGCAAGGCTGTGACGGGCG 
GCCAGCAGCCTGTCGTTGATCGTTTGCCCTGCCATGGTCATGATGAAGCTGTTGCTGTTGTTGTTTCTGCTGCCGCTCGCTGGTTTTTATTTACT 
TGATTAATTTAATTATGCACCAAAGGAGCTGACCACACCGACACACACACACACTCGAAAGCAGATACAGAGAGAGGGAGAGAGAGACACACACA 
CACACGCGTATGGGGCGAGCGAGAGACAGAGAGGTATTGATTGATTGAGAATATTTCGAACGGGAAAACTCTGCTATGCGAGCCTCACGATTAAA 
CTAACAGTAAAGATTTAGTGGTTCCTACGTCACAATCACTACACAATTTATCGCCGCGGTTTATGCGCTCTCTAACCGGTGCCTAACGCTGAAAC 
GCGATTTTTTAATTCTCTATTTTTCCTCTGCCACAAATCGATAATGGCGACTGGCAAATGGAGCGAGCGTATGAGCAATGTCCTAAAGCGTCCAC 
AAAGTGGAGCTAACGCGTGCAATCGACTTGAGTTTACGGCGTTAGGTGGGAAAATATCGATTGCCATAACGTAAGCGAATACGAAATAGCTGAAC 
TTAGCTTTTACGATGTGTTCATATAAGTTCTGAAAATAAAAAAAAATAATGATAAATAAATAAATAAAAATATAAAATATTAAGATACTATAAAT 
AAATCTTTTTTTATAACTTAGCACTTAAAATTCCCCAACAGCAGTTAAAAAGTAGCTAGATTTGTCGGAAGTGTAGCCAAACGTAACGTAAGCTA 
AGGGGCTGTCAACAGGAATTTTTAGTTTCGAGTAGGCGGTCATACTGGTCATTAAAATAAATAAAATTATACTAAAAAACAGTAGTTAGCACAAT 
CAACAGTCTCGTAATAAAATTGTAAGATGTTGAATGGACAGATTGCACACAGTTATTTGGAGAACTCGTGCACGCTGATCATACCCCGCCCGCCT 
GTCCTGAATCAGTCCTCAAAGTCGGCCCAGCCAGCAGCAGCTGCTGCTGCTTCTTCTTCCGCTGGTGAGGAGTTTGGTCAGGAAGCGGGGTCGGG 
GTTGGCGGAAGCGGAGGCGTTGCATGTGGATCGCATAAGAAACTGCGATATATTCGTTGACGCGCACGATCTTTATTTCTACGCAGATGTTAAGC 
TGTACAAGTGAGTCACATCCGGCAATCTGTCGTGCCCATGGTGGGCCATGGTGGCGAAATTCCAGCGATATCATTGCGTACTTGTACTCCCTTTT 
CGGACAGATTCCACAGCCGTCGCACCTTCGACTTGCGAGAGCTGAAGACGCTGTTGATAAAGTTCAACACGACAGAGAATCACTACCAAATTTGC 
CTGCGCTGTCTGCCCCTAACAGCGGGTGCCCAGGAAGGACCCGGTGGAGAAAGAGGCGGAGGCGGTGGTACGGGGGATGCAGCCAA 

(SEQ ID NO: 232) 

Exon: 12766.. 12256 
Exon: 9679.. 9537 
Exon: 9082.. 9007 
Exon: 8671.. 8639 
Exon: 8414.. 8312 
Exon: 8246.. 8125 
Exon: 8067. .7874 
Exon: 7808.. 7672 
Exon: 7263. .7149 
Exon: 7082. .6833 
Exon: 1475.. 1001 

Start ATG: 12391 {Reverse strand: CAT) 
Transcript No. : CT8341 

TTGTGGCAGAGGAAAAATAGAGAATTAAAAAATCGCGTTTCAGCGTTAGGCACCGGTTAGAGAGCGCATAAACCGCGGCGATAAATTGTGTAGTG 
ATTGTGACGTAGGAACCACTAAATCTTTACTGTTAGTTTAATCGTGAGGCTCGCATAGCAGAGTTTTCCCGTTCGAAATATTCTCAATCAATCAA 
TACCTCTCTGTCTCTCGCTCGCCCCATACGCGTGTGTGTGTGTGTCTCTCTCTCCCTCTCTCTGTATCTGCTTTCGAGTGTGTGTGTGTGTCGGT 
GTGGTCAGCTCCTTTGGTGCATAATTAAATTAATCAAGTAAATAAAAACCAGCGAGCGGCAGCAGAAACAACAACAGCAACAGCTTCATCATGAC 
CATGGCAGGGCAAACGATCAACGACAGGCTGCTGGCCGCCCGTCACAGCCTTGCGGGCCAAGGACTCGCCAAGTCCGTTTGCAAGGCCACCACGG 
AGGAATGCATTGGCCCGAAGAAGAAGCACTTGGACTATTTGGTGCACTGCACAAACGAGCCGAATGTGTCGATACCTCATCTGGCCAATTTACTT 
ATCGAGCGTTCACAGAATGCCAACTGGGTCGTCGTCTACAAGTCGCTGATAACCACACATCATCTGATGGCATATGGCAATGAGCGATTTATGCA 
ATATCTTGCCTCTAGCAATTCTACATTCAATCTCAGCTCCTTTCTGGACAAAGGAACTGTACAAGATGGTGGCATGGGCGTTCCTGGTGGCAGAA 
TGGGTTACGATATGTCTCCCTTTATTCGGCGCTACGCCAAGTATTTGAATGAGAAATCGCTCTCCTATCGAGCCATGGCCTTTGACTTTTGCAAA 
GTCAAACGGGGCAAAGAGGAAGGCTCGCTACGCAGCATGAACGCGGAAAAACTTCTAAAGACTTTGCCAGTTTTGCAGGCACAATTGGATGCACT 
TTTGGAGTTCGACTGCCAATCCAACGACTTGTCTAATGGCGTCATCAATATGAGCTTTATGCTCTTGTTCCGCGATCTCATACGATTGTTTGCTT 
GCTACAATGATGGCATCATAAATTTGCTTGAAAAATATTTCGATATGAACAAGAAACATGCCCGCGATGCGTTGGATCTTTACAAAAAGTTCTTA 
GTGCGCATGGATCGTGTTGGAGAGTTTTTAAAAGTGGCTGAGAATGTGGGCATAGACAAGGGTGACATACCCGATTTGACCAAGGCGCCCAGTTC 
GTTGTTAGATGCCTTGGAGCAGCACTTGGCCACGCTTGAGGGTCGAAAAGTATCCGCTGCTAATACTCCTACGCAGTCATCAAGCTCGGCATTTG 
GTACTGCAGCTGCGTCCAGCAAATTTGATACCACCAACGGCATTGACGAGCAGCTTAAGGCCCAGGTTTTGGCCGAAGAGGAAGCCGCTATGAAC 
CAATACAAGTCCAAGGTGTCGTCGCCCACTAGCAGCGGTGCTGCTGGCGCTAGCGCTGCACTAACAAATCCATTTCTATCGTCGCCGCCAGCCGC 
GCAGGCTGGCCAGCCGATAGTTGATCTGTTCGGTGCCGCGTCGGCGCAGCCCGCTGCTGCTGCAGCAGCCACCAAGGCCTCTGACGATCTGCTGC 
AGTTGGGCAATCCTTTCGCGGACATGTTTGACGCCAGCGGTGGGGGAGCAGCTGCTGTAGGAGCCACAGGGAATGCCGGTGATGGGACAGCCAAG 
TATGATGGGGGGGCAGGCAGCAGCCCCTTTGACTGGGGTGCAACCGACGATGATGGCGGCGCCGCACAGTAATGCAACGGGCATCATGCAACCCA 
TCCAGCCAACGCAGAATGGCAGCAATAAAGGCGTCCCACTCGACCCATTTGGTGCGTTATAAGCGGTCCAGAAGAAGCTCATACGTGCAACCCAC 
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TCACCACTCACTCACTCACTCAGTCTAAGCGCTCTAACCTCATAGACGGAAAACAGAAAACACACTCACAGACACAAATGCGTAATGAAAAATTA 
AATATACCAATAGCTCTATATATTATATAAACGTGCAAGTGAATATTAAGATCTTGAAACTGTTTACACGATGCTACAGAAATCATATATAAAGT 
TTATGAAAAAAACGGCAGCAATAGCAAACACAAATGAAACGGAAGCAGATAAGAAAATTATATTATAAT 
{SEQ ID NO: 233) 

Start ATG: 376 (Reverse strand: CAT) 

MTMAGQTINDRLLAARHSLAGQGLAKSVCKATTEECIGPKKKHLDYLVHCTNEPNVSIPHLANLLIERSQNANWVWYKSLITTHHLMAYG 
MQYLASSNSTFNLSSFLDKGTVQDGGMGVPGGRMGYDMSPFIRRYAKYLNEKSLSYRAMAFDFCKVKRGKEEGSLRSMNAEKLLKTLPVLQAQLD 
ALLEFDCQSNDLSNGVINMSFMLLFRDLIRLFACYNDGIINLLEKYFDMNKKHARDALDLYKKFLVRMDRVGEFLKVAENVGIDKGDIPDLTKAP 
SSLLDALEQHLATLEGRKVSAANTPTQSSSSAFGTAAASSKFDTTNGIDEQLKAQVLAEEEAAMNQYKSKVSSPTSSGAAGASAALTNPFLSSPP 
AAQAGQPIVDLFGAASAQPAAAAAATKAS DDLLQLGNPFADMFDASGGGAAAVGATGNAGDGTARYDGGAGSSPFDWGATDDDGGAAQ* 
(SEQ ID NO: 234) 

Name: synapse-enriched clathrin adaptor protein LAP 
Classification: chaperone 
Gene Symbol: lap 
FlyBase ID: FBgn0026210 

Celera Sequence No. : 142000013384695 

TTAAATGCAGGTGTGCCACAGGGTTCTGTCTTAGGACCTACATTGTTTAATATTTTTACACACGATATTCCACAAGCTACAAACTGTGTACTTTC 

AC TGTTTGCT GAC GAC GCCGCTGTC T AC AGT GCTGGTTTCTC GT AC AG C G AGAT AAAT C AAT C C AT GC AG AGT T AC T T AAAT G AAT TAG AC AT AT 

ATTACAAAAAATGGAAAATTAAAATTAACCCTAATAAAACGAATGCTATATTTTTTACAAAAAGAAGGAAACCTCGTTATCTTCCTGATAGACAG 

TTGCGAATACTAGACTCTCCAATACAATGGGTTGACAACATTCGATATTTGGGTATAATTTTTGATAAAAAACTAACATTTAAATACCACATCAA 

CAACACAATTATGAAAGTTAATAAAATTATTTGCACACTGTACCCTCTAATAAATCGTAAATCTAAATTATCAATATCCAATAAGATCATAATTT 

TTAAAACAATTTTTAATCCAATATTAATGTATGGGTCACCTGTTTGGGGTAGATGTGCTCAAACTCATATCAAAAAACTTCAAATATGCCAAAAC 

AAGCTATTAAAATTAATAATGAACTTACCTTATTACACAAACACTAAGTACCTACATATAAAAGCAGGTGTACAAAAAGTTCAACAAAAAATACA 

ATATTAACAATCTTTTGTTTCAAGTCTTAGTTTTTAAGTTAGTTGTAAGTCAATTAAGGTTTTTCTTCTCCCTTGTTTTCCTTAATAAATAATAA 

ATATTGTTATACAAAAAACTAAAAACAGGTCAACTAGACATAATTAATAAATATTTCAGATGAAAGCCTTAGCTAAACACTGTAAATACCTAATT 

ATACCATTAGCTTTAATGAAATGTACATATATATATAAGGGATAAAGCACAAATAAATAAATAAATAAAATAAATTCATTACTTACCAGGAGTTT 

CTTTTACTACCAAAGCATATAGAATTAATGACATCGTCGAGGGGGAAAGTGCATTTGAAAATAATTTAATAGAATTTATTTTTAACAAAAATAGA 

ATTACTAGGGGCTACAATTAAAAGTGTATAATTTATGCATGTACGTAGGTTTACATTTAATTTTTGCACGTTTCGATTTGATTTGCATTTCGGTC 

GGCCTCCCAACCGCACTAAAAACTAAAGCCTAGCTATTTACACAGATCGATGCAGAAGTCATAGTTGAAAGTTCATAGTTTCCCCCTGGAATCCT 

TCCAGGAGATTGTGCTGCAGCTCCAATTCTATATATCTCTATCTCTGGTTGCGGAAATGCCATGTTATCCGGAAAGTGTGGGGCTATAATGGCAT 

ATCGGCAATTGGCTAAAATACTGTCATGGCCGATACTTGAATTTCTGGGTTGGCTGACCCAATTGCCACTGCGGTTTGATATGGTTTCCCGGGCC 

ATGTAGTTCTCCATCGGATCGCCAGCCAGAATCCCCTTGCCAAGCATTTGGAAGCTGAATATCCACTCATGACAGCCTTCCAGCAATCGGGCGAT 

ATGTAAAGTTTGTGCTGCTGTGCCGAAGGAGTCCGGCGATATTGCTTGGTGGCCAACCACCACCTCAGAAGTAGAAGCTCTCGCACTGCTGCTGC 

TGCTGGTGGCGTTGTCGCTGCTGCAAGGCGATGTTGCTGCCGCTCAAGCTGCTGCTGCTGCCGCCACTGACCCCGCCGCTGACCCCGCCAGTGCC 

ACCGGCGGCAGCGGCCGCCTCCAACTAGAATCATAAGTGCTGGTACTCGCCCATGTCGTACATGGCGGTCATGGGCATGGCGCCCATCACGCTGT 

GCAGGTGCGAATAGGCCACGGCGGCGGTCAGTGGAGGCATTTGCATCTGGGCAGCGGCCGCCGCACTCACGCCTCCTCCGCCGCCGCCGCCACTG 

CTCTTGGTGGCGGCGACATTGGCGATATTGCTGGCGGGATGTGGCTGGTGCGGATGGTGCTGCAGGCCTTGCTCGGCGGCGACATGCTGCAGCTG 

CTGCTGATGCAGGCCGTTGTTGTTGCCGCTGCTGTTGCCGGGTGAATGCTGCTGCTGTTGCTGCTGCTGCTGGTGCTGCGCACTCTGGCTGGGCA 

ACATGCTGCCCTCCATCTCGGAGTCGCTGGAGGAGTCAAGGTGCTGCTTGTCCGTGGAGCCGCTGCAATGGAGATGAACAGGTTGCGTGGTTAGC 

GGAGGTTATTGATTTAGAATGCTAAAGCCAGCATAGGCTACTATACTACTATACTTCTACAACTAGCATTAATACATCATATTGAGCACTTCATG 

ATCTTTATGTTCCAGGTCTCATCTATCCCTTATGTCGAATTGAAACTATCGCCTGCTCCGCCTGCTGAAGACGTGACCCGTGGCATATGGTTTTT 

GGATGTCAGCATCGCTTACGCTAAAATGTGTCAATTGCTGCAACTAATTTAGTGCTGTCAACACACAAACGCACTCCAGTGCATAATTTATTTAT 

AAACAAATGACAAATGACAGGGGCACAGGCATTGCAATTGAAACGAATTCCGGAAGGAAATCGTCTGCCGCTCGCGTTTCTATAAATGGTATCAT 

GTTCGCCGTTGAGGTGTCGCCTCAGGGAATCCGAATGGAGTGAGGAGCATGCGAGATGAGGAAGCGAGATGGCATGCCCTCCCCGGGGCATTCGG 

ATATGAGGCATCTGATTTCAGTCCCAGGATGACTTTCCACATTTCCAGGGGTTTGCTTCAAGTGGGTGCGAGTGCTGCACACCCCCGCTTGCATC 

TTAAGTAGCTGACTTTTATTGTTTGTGGCTGAAACTCATTCGAAACTCGAACGCCACTTTGGGGTTAATGTGAGTCCTGCCCCCTGCACTAATGG 

CTAATACCCGACCATAACAAGACCGAAACTCGAGCAAATGCCAATCTCCCTCCCGCAATCTCACCCACTCCGCTGGCAATCATCGGCAAAGTGTT 

AATCTTGTCGCCAATTGCATGGCCGAGCACAATCAGTAATCGGTTATCAGCAGCCGCATCGCGGAACCAGTTTGCGAAGAAATTGATGACAGCCA 

CGAGCAACATCAGCAAACAACATGGGGCACCTCCAGGATTGGACCCACCAAACTTTGCATACCCTACTGATAACGATACTTCTTTCATATACCTA 

ATAGTAGATATTTGTTAAACTTGATATGTTACTTGGAATTCCTGATCTTAGTAGCCATTCAAAACCATTCCACTTGGACCTAATGTTTAAAAGCC 

TTTAATGATAAAGAGACACTTGGTCACTAAACACATGTCGAATTAAAGGAATTATCGACAAATCCATTTGTCCAGCGAAGCAATCTTCATTGAGG 

GTATTGTTATTTGGGGGAAGGGGGGGAACGAGGCTCGTTCGGGGCAATGCGAAACATTTAAGCGCACACTCGACAAATTTGCGATCTGGCACAAT 

GGGACAGAGACGGAGGGCAGGACAGGACCGATCTGATGGCCGGCAGATACACCACTTCCCCAGCTGCCGCGGATTCCGATTCCTCATTTCTGATT 

CTCCATTCCCGATTCCAAATCCTCGATTCCGAGCGGGGGATCTGCAGTAGGAGGAGTAGGAGGAGGAGCCCACATACGGCGCAGCTGTACGATAT 

CGGCAATCGTGTAATTGTAGATTGTATCCACCGAGTGGCAGCCACCAACAGCGATGGCATGCCATCCACTTCTTGCAGCGTTTGCATGAACTTGA 

ACTTGCTCCAGCGATCAGAGGCGCAGTGGTGGAGGAGGAGGAGGAGGATGAGGAGGAGAAGGAGGCGGCGGGCGCGAGACCATGTGACATGTGAT 

GGACACTGCTCCCTACTTCCCCAATTTTCCGAAAACGAGTTCGCGCTGTGGACATTAAAGTGACGCTCGTTCCGATCCGCTCCGTTCCGTTCCAA 

ACCGTTCGAGGCAAAAATCGATGGCCTAAGCACAGCAAACATGAGGAATGGCCCATCCCAGAGCGAAAAGTGGGGAGCAAACAAAGGGCAGCAGA 

AGCATATAAACAATGATATCGAATTACCAGAGTTTCGTAGTTCTGCCCCCCATCTCTGGCGGCTCTGCAACACTTATTAACACATACAAACGGAC 

AGCGCCCACTCTTGTCCCGTCCCATCTCCCATTCCCATCCCCATCCCCAGTCCACATCCACATCTACATCCCTGTCCGCATTTCCAGTGATTGCA 

TTATAAACACACAAAAATGCCTCGGCTCGGTCTAGTTTGGTTTTACCGGGGCACCGAACGGGAGCAGCGAGTCTCAAGGACAATTCCTGGGCCAG 

GAGTGAGTGTCGCATATAGTTTATAACAAAATGGAATGGCAACCGCTGAGGAGCAGCAGGTTTTGGGTCTCCGTCCTCGTGTGTGGTACCTTTGG 

CTGAAGGTTCCTCGGGCTTTTCTTCCCGCACATTTCTTTTCATATTTCTTTTTGGGTCCAGAGTTGTGCCCAGTGAGGGGGAATTAATTTTTACT 

TGTTTGCCCAATTGGTTTTTTGTAGCCGGGCTCTGTCTTAAGCAGATTCATTAGCTTGGTAATAATGCATGATCGGATTATGTTTAACTGGAAAC 

GGAATCGGTGAGGGGTGAATCGGTGGGTTGGGCCACAAACGAGTTTGCCACGTGAGTGAGTTGGTCTATAAAGGTATTTAACATTTTGGCAAGAC 

ATCTTTCGCATCGATCTGATGTCAATTGATTGGGAAAATAGAAGAATTGAGAATGGCCCATGACATAACCCCAACTAACTTTCACACGAATGAGT 
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GGAGAACTGGGACTGGACTGTGGCTTCCAAGGAATTCCGTTGCTAATAAAAATTATAAGCCAATAAATCCTTAAAGAAAACAAACAAACAAACAA 

GAGGCTCAGGGCCCTGCAAGATCGCTGAGATCGGTGAGATCGGGATATTTGGAGAGATACGGGCAGATCAGGAAGATCGGGAGGATCGAGAGGGC 

CAAGGAACAGGTTGAGTTATCGCATTGCGCGATCGGTACACAAATCTCGGGACGTGTGCTCAGCAGGACACCCAACTAAACGGGCAACGGCGATC 

CATAAGTGGAATCTGAAAAGGTGTTGATCATAAATTATCATTTAATTTACGCATGTGTGTGGCTGTTGTTAATGTTCGTGGCGACCCTCTGACAC 

TCGAAAACGCTCGCTCGAGGATGGGGGAACCTGGACCTGGGTCGGTCGTCTCACCTCGGCTGGGGCTTGGGGTTCTCCTGCTCGTCTAGCCGACG 

TCTGGCTGGTTATTGTTGGGCATTGTCTACAACTATGTAAGATCCCAAAGATTAACCCAAAGTCCGAGCCCAGCGCTGTCATCGTTATAAGATAT 

TTTATTATCGCTCCGGCGAAGATCCCGAAACTCGAAGGTGGAAAGCTGTACGTAGCCTGTGTGGCGTGTGGAGCATTTTATACTGGTCGAGAATT 

GTTTTGACAGTTTCTTATAATTTTGTGGCCTTTTAATTGCCCGGCCCGAGGCAAATCGGCTGATTCCTGAGGAGAGCTAATCGCGTCCCCAGGCC 

CCCCGCCGATGGCAATCCAACTTTTATGTAATCCCCAAGTCTTTTCAGCACTTTGAGGTTTACACACTTTTCGTGGGAGCACCTAGAGGAAGCCC 

CATGGAAAACAATTGCTCAGGTCAAGTCGAATGAGCAGCTAAAGAAGTTTCCATAAAGAAACAAATATTTGTTTTATATATTTTCAACTATTGTA 

ACTCAACCCATTGGATACGCAAAGATTTAAGAAAGGAAGTGGAACTGAGAGCACCTGAACTTCACTTGGAGTTTTTTTTCAGGAAATTCGAATAA 

AGTAATGTATTGTAAATTACACCACAACAGTACTAATTTCATTTTCTATACCTTTTAGTAATGATATTAATGTAAATAACCAATCATCCTGTTTG 

AGTAATGCAATAAAGCAGCATGCCCAGTTCTTCATTTTCGGGAGCTTTCCATACTCCTCGTTCGCACCAGATGAGATCTAATTATTGTCAAGAGA 

GTGTACACTTGACACCCACTCCCGGAAAAGCGCGATGAGCGGTTGTCAGTGCGGATGGCAAGCGCCTCCGGGCGATTCCGTACCACCAGCCGGTC 

CATCATCACCAACAACATCATTATCAACGCCATCGGAGCACGGATCGGGATGGAGAGGGAGCGGGCATGAACTGGGACAGCAGCGAGATTGGGAA 

CGGGAACCGAACTGGCGAACTGAGCACACAAACACAAAGGCGAAGATGTAGAGTAGAGTAGAGAAGAGAAGAGACCCTCCGACGGGCAAAAGCGC 

AAAATACATAATAATAAAAACAGAATGTACAATAAAACGAATAAAAATGAATAACGTGCTGGCGGGGATTGGAGCGAGGGACCGGCGGTGGGCAG 

CGGGGAGTGCGGGGTTGTCAGGGCGAACGAATAAAGAACACACGCACATTTTGTGGTAACCCGATACCCTGGCCAGCTACCCTCCAGCCAACCGT 

GGCATTCGACGAACGCAAAATGATGACGATGTCTGGGGGAGATGCGGATTCGATGGGCAGGGGCTGGGGTAGGGGCAGGGGTAAGGGCAAGGGAA 

GGGGAGTGGGATGGACTTGGGACTGGAACTGGAAGTGGAACTCGGACTGGGTCGCCTCGGTGGTGAATACATACATGGAGATGCAGGGGCAGGGG 

AAGCTCGTCTGCAATTGGGACTGTCTGTGTGCAAGAGAGGGAGGGAAGAACGCTTAAGGTACTGGATTCCCGGAGAAATGGCTCATAATGCCTGG 

CAGATCAATCAAATGGCCCCTGAATTCTTCTCTTTCATAACGAGGATTCCTTGAAACAGCAAAGAAGCATCTCTGTCTTGGGTTAGATATTAACT 

CATTGGTTTAAGGATCGTTATTCTGCTAGAAGGTTTGGCCATATGGATGATATCTTGAAATACTAAGTTATATAACAAAATTTAAGTTTAGGACA 

ATGTTGAAACAGAAGAACATTTTGAAAATTTTTACATCCTTAATACAGGTATATAAGATTTGTAAAATTGTATCTTTTCAAGTATCTTTTACGAA 

TGTTTGGCATTTGGCTTATGCTACCTTTCAACCTGAGGTCATTAATCGGATATTTATGTAACCGAAAGTCCCAGAATGCCAGAATGCTATTGTAA 

AGG T T G G AAAG T T GAT AG GAAAAT AT C GG AT T AC GGC T AG AT T GT T T GGG T T AGT T CT T C AAAAC T T T T C AC C C AACT T T G C AT T AAAC AAT GC G 

ACGGAAACCCGATATTGTTATTGGCAGTATTTTCGAGCTCTCAGTAAGCAGTGCCCTTCAATTCAATTCAAAATTTACAAAATTCTTCGTAAAGA 

GAATATGCACAAGACCTGGAATAAGGCGTTCCACAAGCGCAAGCTGGTGAGCTCAGAGCGTAATAATGCATCAGCATGCGAAACTATTCCCAATT 

CTTCCAGTGGAGGAGCGTGTCCAAGCCCGGAAAGCTGGTGTACTACATGCAGCCGCTAATCGAGCACCTGTTCGACACCTGGATGCAACCACTGC 

CCTTTCCGACGCTCCTCAAGTTCATCTACAGCTGGGTGCTGATCTTCTTCATCATGATTCCGATGCTCTATCCGCTGCTCGTCCTCCTCTCCTAC 

TACGGCATATTCCAGTACGCGGCGGAGGAGCACTTCGGACTGAAGACGCCCGAGAAGTGGGACCTCCTGGGCGCCGCCGCCAGGCTGTGGCACTT 

CGAGGTGACCAACAGGAAGTATCTCCTGTTCGTGTCCATGTACATTGATCGCTACCGAGTCGTTTTAACCGCCATATCTTCGACAGTCGACTATA 

TGCGCATGGCTCTGTGGTTCGTCTTCAACTGAACCAACAGCCGGAGTCCTTCGATCCCTACTTGCGAATCCCGTTTGCGGGGGAGTTGGCTTAGT 

AACTGCCTTTGGAGTGGATAATTTTAGGTCTTTGGGTGGAAGTGTTTCGTACGAAGAACTTAACCAATTAAAAGTTGAGCTGTATTCTATTCTGT 

AATTCTATGGGTATGGATATAGGATGAATCGTTTGGTTCCATCTACCTAGTCTTTGAGCACTCTGTCCTCCATTCTCTTTTGGCCCAGCGAAGGT 

AGTGCCTATGCAAACAATAACCACTTTGCCAGTTGCTGGGCCGTTCAAAGTGCCGCTTTGGATTCAGATTTGGCGCCGTTGTGGTTTGCTGGCTG 

CCACAAAGCTGCATGGAACATAGATACACACACCCTCATGCAAAAGTAAAAAAACAGAAAACGAAAAGAGCGCAGAGGAGAGGAGCAAAGCGGAG 

TGGGGAGGGGAGGGGAAACAATATGAAAAAGTTAAAGTAGCAGCACACTGAGTTGCTCATCGAGGCAACAGCCATAAACATATATCAGGTTACCT 

CCAGGAGGAGCTGGAGGTATGCAGCTACGCAGTCCATAGTACGGCAAACTATCGCAGAGAACGAACGACGTGTATCTGTGAGATACTTTCGACGA 

CTGGCCGTGCATGGGTGTGTGCGCGAGTGTGTTAGTGTGCCTTGCGGTTGCCACGGCATCCAATTTATAATATTTTTTGCATTATTTCTGTTTTA 

CCATTTTTTATTTCCATTTTTATTACGACAATTTCTCGCTTTTCGCCTGCCGTTCGTGCGGCGAAACTGGGCGTAAGTGATATACCATTTAGCAT 

AATGGAACTATCCTGGCGAAAGTTTGGGACTTTTGCCAATTTGACCCGGCTAATGATTGTCCCACCCAACTGGAATGGCTAGAAAGACAGCGGAA 

TGGCCACTATCCCATTAGAAATTTGAATTTTCACTCTCTCGGCTGTTTGCACTTCACATGACTTCCAGCCGGATCGCAGGACAAACAAAAGGATG 

CTCGTCGGCCAAAGGATGTTTTATGTTTGAGAATCTAACTCCTATCTTTCGAGGATCTACGCCATCCCTCTGCCTCAAAGTGGAAACATCCGAAA 

GGAGTGGCTCAACTGCCCGAAACTATTTGCTTAAGCGGAAATGCAGATGTTCAGATGTTCAAATCGGGTGGCGGAAGGGAAGTGGAGGGAAGTGA 

AGTGAAAGGACTGGGGTTTTTAATTAGTGACGAGGCCTGCGAGACATATGCCGGAGATATGCGTGTACCGAAGGCAAACATCAACAAATCAAATA 

AAAATACAAATAAAATAAATATATAGGATCGTTTTGGAATCTGGAACAACATGTTAATGACTTTGTCAGCGACTCTTTCTTTCTTTCCCTCCGCA 

TTTAACCACATCTCGTGTGCCAGTTGCGTGTTGTCCTTGGCACTAATTAGCCAGCCCCCCCATTGAACGCCCACCCACCCGATACCGCCCACTCG 

GCGACCTTGAAAGTAAATGCCGTCAGCTTTTGTAATTGAAATTTTAAATGCAAATTAATATATTATTTTTTGCGGTGCCTTGCACGCGCTGACAC 

TTGTCAGATGTATCGCAAATAAAGCATAAATATACTCGTATGTGCAGATGCCCCCCTCCCCCTGCCACCCGCAACTCATATGTACGTATGAAACG 

CTGATATAAGCAATCTAGATATATCTATCTATCTTTGGAGTACGCCCAAAAATATCTCAGTTGTCCCCCCCCAGTGAATCAGACGGGGCAATCAG 

CTTAGATCTATAGTAATTAGTGATTTGGTGCAACTCACTCTTTGTGTTCGGCAGCTCGATCTCTTTGTCGTCGGTTCTTGAACCAATTGGAAACC 

TGTAAAAGCACAGAAATAGCTTAGATACATTTGTAAGCATTCGCAAGATTCTGCCCCATGATTAATGGGATGCAATTAACATCCTCTGCGAGGAA 

TTTCAATTGAAATTTGTCAAAAGATGCGTTTGACATTTATATTTCACTTCAGCCGCTCGCATCTCTGCTCTTTGCATCTTGGCATCTGGGCATTG 

CCATCGACTCACCTGCGTGGTGGTCAGTCCTGTGGCCTCGGCCAGATCGCGTTTCTCCCGCGGCGATGGATACGGATTGTGCGAGTACCAGTCCC 

TCAGTACGGAGCGCGATTTTTCCTGCGGATGAAAAGGTGTCATTTTAATCAATACGTATGGAGATAAACCAATTATAAGCCCCTAAAATCAATGT 

TGTCATATATCAAATTGTAACTAATATGAATACGAGTGAAGTGAAAGCACTTAATAGTCAAGCTTTTCTATCTATTAGAAACATATCTAAGAGCG 

AACAAGAACCTACAGTTCTCTTTATTTGTGAATTATTTTGTTTTGCGAGATCTTTTAAAATTAATAATAAACCTAGATTGTATAGAAAGCAAAAG 

GGTATTCCGTTTACTACCCAACTGTGTCAATCCTTTTTTCTTTATATTCCCAAATTGCGTCAAAACTGGATGATTCGTGGGGCGTAACTAGGACT 

CAGCCCACATTTCCGCGCGCTACTTGAGGCAACTGTAAGCCCGTCGAAGAGCAAACAAAGCGAATCCCGTCCAGGATGTGCCTCCGATAAGCCCG 

AAAGGCGTTACCCCCAGCGCAATCAGGCGGTTCGCAGTCCATGCAATCAGACAGGCGAGCGGGCATTTGTACGATTTCCGCGCTTGCCACAATTA 

AAGTGGAAAGCGGCCAGGACGACGTGGAGCTGGTGGCCCGAACTCACCTTAAAACAGTAGCTCGTCTCCTCGCCATCCCAGATGGTGCGGGGCAA 

TGGAAATTTACGGCGAACACGATATTTGCCAACAGCACCCAAGGGTCTTCCGCGCAGTTTTTCGGCTTCCACATAATGCGCTACAGTCGTGGAGA 

AGAAAGGAGGGGGAAGTCAGTTTGGTTGCGTTTCTGGCCCCAAAGATTTGGGCTTGGGCAGCGCAGCGGGCTGGAGGTTTGGAGGATCGGGGGGT 

TGGACGGTTGGACGGTTGGAGGCTCACCTTTCAACCACAGGGCCTGGAGCTTGGCGTGATTCTGGGCCGAGAAGTGGTGATGCTCGAGCAGGCGG 

TACAGCTCCTTGTATTGTCCCCGGTGGAATGCGACGACCGCCTTGGCCTTCAGCACGGACTCGTTCAGCTGCAGCTTATCACATTGTGGCAGCGA 

CCAGAGGAAGCGGCCCAGTCTTTCGATGTTGCCCGCCTGCTGGAGAACCTGTAGGTGTCCGGGTAAAGATATACATTATAATGTCATCTAAATAT 

TTCACTAAATCTATTGAAGCTATCATGACTAATGGTGATTTCCTTCCATAACTTATTCTTACACATTCTACTTATAGTTTGTGATTACCTCGCAA 

ACACAGGCCACCTGCTCCTGGGTGAAGCCGAAGCTGGGCAGGTTCTCCCTGCCGCCGGAGCCTCCACCACTGACCACTCCGCCGCCGCCGCCGCC 

GTTGCTGCTGCTGTTCAGATGGAGGCCACCGCCTGCTCCGCCGCCGTTGTGCGCCATGATGTCCAGAGTGCTGTTGTTGTTATTGCTGGTGCTGC 

TGTTATTGTTGTTGTTGTTGTGCAGGGCCACCGATCCGCTGAGACCGGTGGGACTATAAGGAGGCGTGTGACGGGCGGTGAGAACGGCAGCCGCA 
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TTGGCCGCGGCCAAGTCGTAGAAATCTGTGGCGGGATGCTGTAACATTTTCTGCGGGAATCGGTTGCCTGGCTGGATGTTGGTGGGGTGGATTTC 

AAGGACACCGCTACAAGTTGCAGAAGGGATCGATCAGTTGAACTGGTGAATGGTGTAACCTAAACTAGTAGTTGCCTAAATCTTTGGTGTTTTGA 

GCGCTGCAATGATAAAAGTAATAGAAAATCGCCATAAGTTTATGTTTGCATATTGAAAATACAAATATACAATGTGTGTACAGGCTTAAAAAGCC 

ACTGTATGCTAAATAGATAATATCTGTGTTGCTTGGAGCCATAAAAACCCACGATTGCAACACACAATTTTAGGAAAAACTCGCGCCACATTCTG 

CGTTTAGTAATTAGCCCCCTAGTATTCATCTTTTTATTACCCCGCTCGATTAATATTTAAACAGCTCTCTAATTGAGACACATTAGGCTGTGAAA 

ATCACCATTGAATGGCAAGTGATTCGAACAAATGCCAAACAAAACAAGCGCGTCTGCATGAGTTTGCCATATTCATAAGCATTAGAGATCCAAAT 

TCAAATCCAAATTGAAATCGAAATCAAAACAGAAGTGTACAGTACAGTGGAGAACCTCGATCCGACCCCATGTTGTCAAACATGATTAACCGAGT 

CCGGGGGCAATTATTAATGCGGCATGTTCGATGCTCGATAATCGGAAATACGGCTGGCCATTGCCATCGTGGAGCTCTCTAACCTGTCCGCACCC 

AACCGACCAAGAGATCTCTTACGAAAATAAACTCTTTACACAATATAGTACGTGTATATGGGGCGCATGGTGTTTCGGTATTCCTCCGATTTCAC 

ATTTCACATTTCCCATTTCACATTTCTGTATTCGGTTCTAAGAATATTTCTAAAATTACCAGGCGGCAGCGACGACGACGACCATGCCTCCGCCG 

CCATCGCGAGGGAGTGAGCGAGAGGCGCGAACGTATCCCCCAGATAGAATATCTATGAGGTATAGACGATGTTTTCACATTTCAAAACAATGCAA 

TCGATCTGTCCGAATAAACTGATATATGTAGATATATCAGCCACAGATATCTGCGTATCTAGGCCCCTCCCCCCCCCCCCCCACAAAAGCTAAAG 

CTAAAGATCGGGCTATATGCACACTGATTCTCGTGTATCGGAGTCGAGATATTCTTGGATCTGCTTTCGGTTTCAGTTCTCGTGTGTGCCGGCCA 

GATTATATATACAAAAAAACAAGGAACAAGAAACAAGAAAATAGAGATTCGTTCTGGCTGCCGCTGTAACGAAAATACCAACGCCAAGTTGGCTG 

CTCTTGAGTCATCGGCCTAGGCCAGATCCCCCGCAGCCGCCGCCTCGCCGAAGATTTCCATCGATTCGATTTCTTCAGCTCGCTGCGATTCTTCT 

TTCTCCGCTGCTTGTATTTTGTATTTTCGATGGCCTGTTTCATGTTTGACGCAAGTAAACAAGCAAAATGCGAACGTGAAACGCAATACGCGATA 

AATTTTCAATCTGTTTTGGCCAACAGCAATCCAGCAGTTCAGTTCACAAGTGGATAATATTCGATGGACACAATGCATTCTGGACATATTGCCGA 

ACTATTAGAGCCGATTTACGCCTCATTCGATCGGATTCTGAGCTCTTCAGCCACATTGGACCATAGCATACTGGCATACCAAGGTTTATGGCCCT 

GTACCGTTGCACGATCCCAAGTGGAAGTGGAAGTCCCTGCCCCAAAACGAAGAAGAACACCAGCAGCTACGACAGCAGCGGCCTCGAAAGATATT 

CTATCTGTATCTTTTGGCTGCCTCTTCAGTTCGCCTTGTGGGAGATCTGTTATCGCGCGCCTGTAGCTGCCTGACTTTTGTGGCATTCGAATAAA 

ATTAATTTAATTTATTTTACAAATTCTGTAATTTATTATCTATTGGTTTCTGTTTCTGTATCGTGTTTGCCCCATTCCCCTTCCCCTGTGCTTCT 

TCCTACTTTTATAAGATTTTAATTTGAATGTCGTGGAATCCCTGGAAATCTGAAAGCGGACACCTAACTCTATTGGGCTGTTAATCAAATTGAAT 

TGTTCGAGCTGCCGTGGCTGCAAGCGTTGTTCGCCTAATTAGTTTGTAATCAAAGTCAGCTTACGATTGACTTGTTGATGGATTCGATTCAACTC 

GCAGGGCAGTTCCAAAGCCCTAAGTCCTAAGTGCACTTCGGGAACTCTGCAGGTGAAGTGTGCCCCAGCTCGAGGAGCACCCAGCTCAACTTGGA 

GC AT G GT G C C AT C GC AAT C GC AAAAAG C AC AAAC AGT GC GCC T GT T T CT C GC GGC T AAC AC CGAAAT C AT CT GCTCCGGCGTCTGTGTT TAT AC A 

AT GT AAT C GAAT GG G AAT GGG AT T GAG AC T G T G ACT GGGG AT T GGGG AT T GG GG AT T G G GG AT T G AGG ACT GGG AAG AC GT GT G C C AT GC AAC C T 

GTTGTGGGCCAGCCCCCGTTCCGGGTAATTCGAACGGCTTTCTGGCCGCTGGTCGGGATTAGTAAGCGCTGCTGCTGCCAAACGGTCGAGACAAT 

CGCTGCTATCCGCCAGATACTATTTTTAACATTTTCCGTTTAATTGCCGCGTGAGGGAGCTACGCACAAAGAAATCGAATAGAGCGCGGCGCAAT 

GAATGAAATTTGTCAATTTAATGGGTTGCACACACATGTGTCTCTGCCTCCGCAGCCGAATGTATCTGCATCTGCATGTCGGCTGCTTTTCGCCT 

TGCTGCGCCGCATTTGCCAAACTTTGAATGAAACTGAAGTGCACTGGTCGAAAGTCCCGCCGGTGGGGAGCTCGTCCCCAAATTACGACGGATTC 

TGCACGCCGGTGCGTAAAATGATCAAAGCTGCTGCGGCCAAACTTGCGTATCTGTATCTGTTTCTGCCCGAGTCCCGAATCTAGGCGAATGGAAT 

GC GC AAAT T T G T C AAT T T GG C C AC T T C C GAG AT G GAAT C C AC AT G GCT G GT AT CT T GAT GAT CC GAAAT AGGC AACC AG AACC GT AC GC AAC AT C 
TGCTCCCATCTCTGCACTCAATTAATCATTCGACTTCTGACGACAAACACGCGTTGGCCAATTCGTTTAAACGCTCCCAATCCCCCGAATATCGC 
AAATATCGAATTTAGCCGGACAAAAGTGGGTCCACGCCCACTCATCCTCACACGAAGTACAAAAGTTGTCGGCCAAACAGTCGCTATAGTTAATA 
ATTTCATAAATTTTGTGGCGAGAAACGAAATGATAAAGCCAAATGGTTTCATAAATAAACACTGTGTGCGTTGGTTTCGAGCACCAAACGGTTGG 
AAAGCCTTTGCGATGAATGAATGACAACTGGAAACTACTTTTCCTTTGGATCAAATCCATCTCATATCTTTTCCTCTAATTAATAAATACATCTG 
AG GTGGGTGGGCCGTT GAGT AGC C AAC T TG GAAAC G T GG AAAAC T T AG AAAAAAAT T G AGC GGT GAAT AAAAC AAC AG AAT TT T TCTTTTGGTGT 
GTAAACAAGTTTCCTTTCCAAATATAAGTGAACATGCAATTCCCGTGCTTTTGCCTTCGTTTCGATTCCCTCAGAATTGCTTCTCAAGTGCCTCG 
AGTTGGTTTTCCGATGGCTTTTGGCCATTTGGAATTCGCCAATGCCACGCCGCCAAAAAGCGAAGTGCACTTTGGCTCTCAGCTCTCCTCATTGA 
AATCGGCAAAACGTTGTGAATCTATGGACTTCGATGATAGATTGATCCATTAGAAAGCGGATAGAGATACGCCGTGACATTTATGGACTGCATTA 
ATGTTTTCGCTGCACACGCTTTAAAGGCTATTTAAAGCACTAAAATATATTTTTAAATGAAGTCTTCAAATAGGCAAAGTGATTGATACTTTCTC 
AGAGAACTTTAGGCTTTTCTAATCAAGAATAACGCTTTGCTGTAAGTTCCATTTTAACCGTATGGGATGTTTAATAAAAGGTCTATGTTCTATGA 
TCTTATCTATAGGTGATAAGTGATTTCTTGAACATGTGGTTAGTTCTTCTAAGGCTTAGTTTGACCTATTGAAATTATTATTTTTCTGAAACTTT 
TGTATATATTTATGGTTTTATAATTTTAAGGTTCCAATATGGAAATGTATAATCTAATATTAGCTATATTTATTCACATAATCGTCCCGAAATTT 
ATCATAGTTATTATATCTCCTTTTTCAAGTATCAATGCCAATTTTATTTTAAATTAATCCCCCATATACCATTCGTTTTTCTTATTGTGAAAGAA 
TATCCGTCCTACTTATGCATCTACATATATTTTTGCCATGGTTTCTAGTGTTCCACAGTAAACTTTTGCCGTCGGCAGCATGAAAACTGAATCTC 
GCTGCTCACCTGTATTTTGATTATTTATTGTTGATACGTATCCGAGTTTGTTAGTTTTTGTTTGTTACTCTTTGTTTGGGTTTGTGCCTTTTGCC 
TTGGCTTTCCGATTCGACTTGTGCTATATCCGCAAATGTTAGAGTTGGCCAAGAAGGCGCTGATTTCGGCTGTGCGGCATTTCCTTGATTTGCAT 
TTTGGCCAAACTCTTCAAATCGAAGCCATGTGCCGTTCGCGCACGCACACACTCTTAGTCGCGATCACACTGGGCACTAACTAACACAAGCTTGC 
CGTTGAATATTGCTCATACGCACTGTGCGAACACGTGTTGCAAATTACTTTTCGCGAAAACTTCGCTGTCCTGCGCGACTCGGACAAATTGCACC 
TGCAAATTTGGTGGGCAAAGCTGCGACGACAGCACTGAGGCAGTGCCAACAAAAAATGGCCGCCCGCGGATTGATTAGCTCAATTTTGCTCGCTT 
TTCTCTCGCTTCGTGTGCGTTTATGTGGCGCGATACTGATGCGTTCATTCAGGAAACACGCAAATTAAATATAACTTTGTGCGCGGGACGTCAAA 
AGACCGCCGGATCCGGCCTGGATTACTAAGCGAGCGACGTGGGTAGAACGGGCAGCGAACCGCACGCGACGGAATCTCGAAAACAACTGAGAACT 
CTCCGGTTCTCAAACCTAATCGAACCGAAAACGGCCGAGACAAAAGGCAGAAAATGGTTGGTGCCGCTGGCGGTGTGCGTGCGCCAGAGCGAGAT 
GGCCGACTGGTCGTGCGAGTGTGTGAGTGAGCGAACACCAAGGGGGTGAACTGCACAGTGGGATGCACCGACACTTTGTGCGTCACTTTCCAAAA 
ACGTTATAGGAAAATAATGGAATCAAGTGAAGCGAGACTTACATTCACTGACTTTTATAAAGTCACTGGGCACACACAGTTCATGATTACACATA 
TATGAACTGATGATAACTCAGAATTCACTACGTTTTAGTTGTTTTCAATATCTTGCCTTTACAAGTGTTTTTATTTTTATTCTATTTTATTTTAT 
TGTATTCTAATGTTTTCGTAGTAAGACATAGTACATTTTTCCGTTATTATATGCTATTGTATCTTCAAAACATTAGAAAATGGTTTTATATATTT 
TTAAAGTGGCTTACAATTAAGTTAAGTTAAGTTAAGTTAATTGGACAAATATTTTTATAAACGTGTAGCCATTGAATTTGACACACGGTGCCAAT 

C G AC AAC AGT T T G AC AGC C AC T G C G AGAG GGG C AG C GT G C GAAAG AG C GAG AG AGC G CAT AAAC AG GT TG AC C G AAAG AAC GCC C CCTCGGCT T A 

CCTGTTCGCACACACACACACACACACACACACACATAGATGCCCTCACGCACACCGGCGCACTGGAGGACAGCAACGTGCTGATTGGTCAGCGG 

AAACGCATGCCAGCAAAAAGGCATGCGCATTGGGCGCCATATTGAAAGGCGGAAATCGGAGGTGATTGCAACTGCATCGCTGCTCCTCCCCAGGA 

ACTCCTCATCCTGGCTGAAAGGATGCCCGGTGCAATAGCTGCCATCCTACAATGCCGTTTGAATTACTCACCGCAATAGTTCCATGCATCTCTCG 

TATAATTGCAATTACTGGCCTTTTCTCATATTCATTCACTCCACTTTTTTGGTTTAACTCAACCTTGTGGTCCTTCCACATTAGTGCGGTTTGCA 

TTGTATTATTTAATGATTCCGAAGGAGATGAAGCCGTTTGGTAATTGAAGCCAACTAATTGTGTTAATGATTTACTAATATATTTAGTTTACTAA 

ATATTTACTATATACACAGGCTCCTTTCGTCTTCAAGTCCTTGTTCATGATAACTGAAATAACTTGAACCTAATTTAATTTCAAACTATTAACAT 

GATCTAGAGACACCTGTACAAAGTGCAGCGATGCGTTCAGAATGCCTAACTTTCTTTTTGAATACAAATAGTCAAC 

(SEQ ID NO: 235) 

Exon: 16841.. 15875 
Exon: 11688.. 11204 
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Exon: 11068.. 10858 
Exon: 10720.. 10593 
Exon: 9997.. 9893 
Exon: 9689.-9634 
Exon: 2152.. 1001 

Start ATG: 11542 (Reverse strand: CAT) 
Transcript No. : CT8839 

GAATTCTGAGTTATCATCAGTTCATATATGTGTAATCATGAACTGTGTGTGCCCAGTGACTTTATAAAAGTCAGTGAATGTAAGTCTCGCTTCAC 
TTGATTCCATTATTTTCCTATAACGTTTTTGGAAAGTGACGCACAAAGTGTCGGTGCATCCCACTGTGCAGTTCACCCCCTTGGTGTTCGCTCAC 
TCACACACTCGCACGACCAGTCGGCCATCTCGCTCTGGCGCACGCACACCGCCAGCGGCACCAACCATTTTCTGCCTTTTGTCTCGGCCGTTTTC 
GGTTCGATTAGGTTTGAGAACCGGAGAGTTCTCAGTTGTTTTCGAGATTCCGTCGCGTGCGGTTCGCTGCCCGTTCTACCCACGTCGCTCGCTTA 
GTAATCCAGGCCGGATCCGGCGGTCTTTTGACGTCCCGCGCACAAAGTTATATTTAATTTGCGTGTTTCCTGAATGAACGCATCAGTATCGCGCC 
ACATAAACGCACACGAAGCGAGAGAAAAGCGAGCAAAATTGAGCTAATCAATCCGCGGGCGGCCATTTTTTGTTGGCACTGCCTCAGTGCTGTCG 
TCGCAGCTTTGCCCACCAAATTTGCAGGTGCAATTTGTCCGAGTCGCGCAGGACAGCGAAGTTTTCGCGAAAAGTAATTTGCAACACGTGTTCGC 
ACAGTGCGTATGAGCAATATTCAACGGCAAGCTTGTGTTAGTTAGTGCCCAGTGTGATCGCGACTAAGAGTGTGTGCGTGCGCGAACGGCACATG 
GCTTCGATTTGAAGAGTTTGGCCAAAATGCAAATCAAGGAAATGCCGCACAGCCGAAATCAGCGCCTTCTTGGCCAACTCTAACATTTGCGGATA 
TAGCACAAGTCGAATCGGAAAGCCAAGGCAAAAGGCACAAACCCAAACAAAGAGTAACAAACAAAAACTAACAAACTCGGATACGTATCAACAAT 
AAATAATCAAAATACAGCGCTCAAAACACCAAAGATTTAGGCAACTACTAGTTTAGGTTACACCATTCACCAGTTCAACTGATCGATCCCTTCTG 
CAACTTGTAGCGGTGTCCTTGAAATCCACCCCACCAACATCCAGCCAGGCAACCGATTCCCGCAGAAAATGTTACAGCATCCCGCCACAGATTTC 
TACGACTTGGCCGCGGCCAATGCGGCTGCCGTTCTCACCGCCCGTCACACGCCTCCTTATAGTCCCACCGGTCTCAGCGGATCGGTGGCCCTGCA 
CAACAACAACAACAATAACAGCAGCACCAGCAATAACAACAACAGCACTCTGGACATCATGGCGCACAACGGCGGCGGAGCAGGCGGTGGCCTCC 
ATCTGAACAGCAGCAGCAACGGCGGCGGCGGCGGCGGAGTGGTCAGTGGTGGAGGCTCCGGCGGCAGGGAGAACCTGCCCAGCTTCGGCTTCACC 
CAGGAGCAGGTGGCCTGTGTTTGCGAGGTTCTCCAGCAGGCGGGCAACATCGAAAGACTGGGCCGCTTCCTCTGGTCGCTGCCACAATGTGATAA 
GCTGCAGCTGAACGAGTCCGTGCTGAAGGCCAAGGCGGTCGTCGCATTCCACCGGGGACAATACAAGGAGCTGTACCGCCTGCTCGAGCATCACC 
ACTTCTCGGCCCAGAATCACGCCAAGCTCCAGGCCCTGTGGTTGAAAGCGCATTATGTGGAAGCCGAAAAACTGCGCGGAAGACCCTTGGGTGCT 
GTTGGCAAATATCGTGTTCGCCGTAAATTTCCATTGCCCCGCACCATCTGGGATGGCGAGGAGACGAGCTACTGTTTTAAGGAAAAATCGCGCTC 
CGTACTGAGGGACTGGTACTCGCACAATCCGTATCCATCGCCGCGGGAGAAACGCGATCTGGCCGAGGCCACAGGACTGACCACCACGCAGGTTT 

C C AAT TGGT T C AAG AAC C G AC G AC AAAG AG AT C GAG C T G C C G AAC AC AAAG AC GG CT C C AC GG AC AAGC AGC AC C T T G ACT C C T C C AGCGACT C C 
GAGATGGAGGGCAGCATGTTGCCCAGCCAGAGTGCGCAGCACCAGCAGCAGCAGCAACAGCAGCAGCATTCACCCGGCAACAGCAGCGGCAACAA 
CAACGGCCTGCATCAGCAGCAGCTGCAGCATGTCGCCGCCGAGCAAGGCCTGCAGCACCATCCGCACCAGCCACATCCCGCCAGCAATATCGCCA 
ATGTCGCCGCCACCAAGAGCAGTGGCGGCGGCGGCGGAGGAGGCGTGAGTGCGGCGGCCGCTGCCCAGATGCAAATGCCTCCACTGACCGCCGCC 
GTGGCCTATTCGCACCTGCACAGCGTGATGGGCGCCATGCCCATGACCGCCATGTACGACATGGGCGAGTACCAGCACTTATGATTCTAGTTGGA 
GGCGGCCGCTGCCGCCGGTGGCACTGGCGGGGTCAGCGGCGGGGTCAGTGGCGGCAGCAGCAGCAGCTTGAGCGGCAGCAACATCGCCTTGCAGC 
AGCGACAACGCCACCAGCAGCAGCAGCAGTGCGAGAGCTTCTACTTCTGAGGTGGTGGTTGGCCACCAAGCAATATCGCCGGACTCCTTCGGCAC 
AGCAGCACAAACTTTACATATCGCCCGATTGCTGGAAGGCTGTCATGAGTGGATATTCAGCTTCCAAATGCTTGGCAAGGGGATTCTGGCTGGCG 
ATCCGATGGAGAACTACATGGCCCGGGAAACCATATCAAACCGCAGTGGCAATTGGGTCAGCCAACCCAGAAATTCAAGTATCGGCCATGACAGT 
ATTTTAGCCAATTGCCGATATGCCATTATAGCCCCACACTTTCCGGATAACATGGCATTTCCGCAACCAGAGATAGAGATATATAGAATTGGAGC 
TGCAGCACAATCTCCTGGAAGGATTCCAGGGGGAAACTATGAACTTTCAACTATGACTTCTGCATCGATCTGTGTAAATAGCTAGGCTTTAGTTT 
TTAGTGCGGTTGGGAGGCCGACCGAAATGCAAATCAAATCGAAACGTGCAAAAATTAAATGTAAACCTACGTACATGCATAAATTATACACTTTT 
AATTGTAGCCCCTAGTAATTCTATTTTTGTTAAAAATAAATTCTATTAAATTATTTTCAAATGC 
{SEQ ID NO: 236) 

Start ATG: 1114 (Reverse strand: CAT) 
MLQHPATDFYDLAAANAAAVLTARHTPPYSPTGLSGSVALH^ 

ENLPSFGFTQEQVACVCEVLQQAGNIERLGRFLWSLPQCDKLQLNESVLKAKAVVAFHRGQYKELYRLLEHHHFSAQNHAKLQALWLKAHYVEAE 
KLRGRPLGAVGKYRVRRKFPLPRTIWDGEETSYCFKEKSRSVLRDWYSHNPYPSPREKRDLAEATGLTTTQVSNWFKNRRQRDRAAEHKDGSTDK 
QHLDSSSDSEMEGSMLPSQSAQHQQQQQQQQHSPGNSSGNNNGLHQQQLQHVAAEQGLQHHPHQPHPASNIANVAATKSSGGGGGGGVSAAAAAQ 
MQMPPLTAAVAYSHLHSVMGAMPMTAMYDMGEYQHL* 
(SEQ ID NO: 237) 

Name: sine oculis 

Classification : transcription_f actor 
Gene Symbol: so 
FlyBase ID: FBgn0003460 



Celera Sequence No. : 142000013384695 

ACGAAGACTTCGCGCAGAGCCGAAGAAACGAGAAACCGCAACTCGCTTGGCCGCTCTTCGGAAGAGTGCCCGCGAGTGGAAGACTACGCAGCCAG 
TCTTCAGTTCGATTCTTCAAAATAGGTTTTTGATTTGGAGCTGCAAATGCTGTATTTTGCAAAAAGTTTCTTTTACTTGGCATGCTGATGGCAAC 
TCTGTTAGGCTGTAATATCAGGTGGAAGACCAACATGGGAAAATCTAAAAGTTGGTCTTTCAAAAGTTCTAAAGAATTGCAAACTGATGGATCAC 
AAAAGTGGCATTTGATGGAATTTTAGAGCTTATTCATTTAGATGTGTTACTATCTTTCCTTACAGTTTTTGATAGAGTATCCAATGTGCCACATG 
GAATTCACATAGATTCAGCCCCAATCGACTTGGGTCCGTGGGGAAGTGGAGCTACTAACAGAGTGCACTTAAACGGAAACAGTAACAATTGAAAG 
TGGCAATTTGGGGAGCGTACGCGCCTTAATTAGATTGCGGATTAAGATGGTCCCATCGAATCATATTCATGGGTGGATCCGCGGCGACTTCCACA 
TGGAGTCGACTCTAATCAGCAGCAATTACTGTGCCAATTGCCCATGCCGATTCCGCTGATTTCGGCTTTCGATTGAAAATGAGAATGGCTCGTTG 

G AAAAG T G G GAT T AGC AGC AAT GG AAAC AT GAT G GAG GAG A AT C GAAT CGAGGCAGTCTTTTCGATCGTCTG C GC ATC AT G C C AC AA AAC C AT G C 
CCCATGACCAGTGAAATTGTGTTTGCCGGGCTTGTTTTCTTTTATTTCGTTTACTTGCCTCCTCTGCGTTTTCACACTTTTCACGAATCGACGTA 
TCCATTGTTGGGCAACTTGGCTAATGGAATCGGCAAGCAGCCACCAGCGAACCATTAATTGAGCACGCTGGCGGCTTTAATAGTTCTAAAGCAGC 
TGGATATTAGTGTCAGAGTCGCAGTGAATCGCGCAGCGGTACTCCGCTTTAATTTTCACAATTAACTTTATTTATTCGATCATAAACATGTGCAT 
TTACGGGAGAAGAGAGCAGGACTACTTGGCTAGTATTTCGGCAGCTCGGAATAGCGCGGCGGCGTGGACCACAGCACGCTGGAGTTTCGCTTCAG 
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CTGGCTGGGATCGATGTTCGGGAATATCTGCTTTAGGTCCGGTGTGGTGGGCGCCACCGAGAAAAAGCTGTCGATGATGTCGGTTGGGCAGTGGC 
TCTGCATAGCAATTGCCTCTCCCCTTACGTGGCCTGAAAATAAACACATAAGTCAGAATCTATATTTAAGAACATTTCCTTTAGTAGTCCCAATG 
GAAATCAGACTGTAATCTGGTGTATTTACTTGTATATATTTACGATGAATAAGGTATCTCGAAAATAACTCTTCGAAATGATCCACAAACTTACT 
TCGACTCCAGGTGGGCGCTGGGGGACGCTTGTGGGTGACCTTTCCCTCGTCGTCTGTCGAGTCGTCTTCGTGCAGCATCTCCCAGGTGTACTTGG 
TTTTCGGCGGCGGCGGAAGCATCTTCTTGGGCTTGGCGCCGGTTAGCTGCTGCAGTTTCTGCTGCTCCAGCTCCTTGAACTTGGCCTTCTTGAGC 
GCCTCGGCTTCCTCTTGCTGGCGCATCTTCCGCTGCAGCTCCTCCAGCTCTTGTCGTTTGCGCTCTTCAATTCGCTTTCGCTCCTCCTGCTTCTT 
GGCGCGCTCGGCGGCCAGCTTGGCCATACGCTCTGCCCGCTCCCGCTCCTTGGCTTCGCGCTGCTGGGCAGCCTGCTGGTGCTTTTGCTCACGCA 
GCTTCTTGCGCTCCTCCGCCAGATTGTGCAGGCCCTTGCGGAAGTCGTCTCCGCTGTTCTCGCGCGACAGCGTTTTTGTGGACGTGGAATTGACC 
GACCTTAGGTTCGTACCGGTGGCCGTCTTCAGCAGACTAGTGGCCTTAGACGCGCTGGCAGGTCCCTTCTTGGGCTGCGCCGAGCTTAATGGTAG 
TGTGCTTGTCGTTTGAGTCGGTGTGAAAAAGCGACCAACAGTCGGAGCAGGCAGCTTGCCAATTTTGGGAGTGTTGTTTGATCCTGGCGTGCCCT 
GCTGAAGAGTAAAAAAAAATAACAATAAATGAATCACCTATCAAACAGTTCTAGAAAGCGCCAATTGATACAGAACTGGTGAATAGTCTAGCCTA 
ATTCTGCCTGCTATCTGCCTGCTATATATGTGCAACTATACTGTACCTACCAGATCTTTTCCCCGCTTAGGGCGCATGCTGTTCTGTGCGTTTGC 
CGCATTCTCGAACGCCTCCACGCGCATTTTCACTGGACTCTGAAGCAGTGGGCTGTAGAAGTGCAAAGATTGTTTAGAAGGATAACATGCATGAA 
GAGGCAGGGTATCTACTCACTTGAAAAGCTCATGTGTACGGGTAGGCAGTTTTATAGCCTTTATGCTCGAGAGCAGCTTGGTCGTCGGCAGCTTC 
TCCTCCACCGACTCGTCCTCCGTTAAGAGGCTGTCCTGCTTACTCAGAGGTTGTTTCTTGTCGTTGGAGTGCACCTCAAAAGTGGCATCTCCAGC 
TGGTCCTATTGGACTATCGCTCGTCGGCGCAGGACCCAGTACGAGGGTGGCGTTGGAAATCGCCACGGTCTCATTGACAGCAGCCGGAGCAACTG 
CAGCCACTGATTGCTCTGCAACGGGCGGATCTTCGAGTGCATCCTTGTATATGGATCGCTCCTGAACCGTTCGCGAATCGCTGCTACACTTGCGC 
GTGCGCGACGAGACTGGCGAGCTCTTGTCCAGGTCGCTGAATCGTTCCACCTTTATGGGTCGGTGGCTTTCCACGTCCTTCTTCTTCTTGCGGAC 
CTTCTTGGAGACCTCCGTGGATGTCGTCGTGTTGCTGGCCACCGTATCGTCCGGTCGCGCTGCCTCCGTTGTTACAGAGGAGACGGTGGTCACAT 
TTGCCGTGGAGACCGCATTGGTTAACGGGGGCACTGCTTCGGTACTGAGTTTCTCGCGCTTTACCTTCACGCGGAGTGTCTTCGTGGTGTTCACG 
TCCTCTGTGACTGAGGCCTCCTCCGGAGGCTTTTGCTGCTCAGCTGTTTCTGCTGCTGGCTCTTCTAGTTCCGGCTCCGCTAGTTTATTTTCCTC 
TAACGCCTGTGCACTGGTGAAGCTGTTAAACTGGGAGACTCGCTGCTCGCTCTCAACCTTGACCTTTACGAGCTCCTCGCTGCTGGGCCGCCGCA 
TCTTACGGTTGAGTGAGGGCTCCTTGAGTAATTTCTCGGTCTTCAGCTTAGCGGCTCGCTGTGGCCTTCCGGAGCCGAGGGTGGTGTCCGCCGAG 
ACGGGCACAGGCGGAGGCGGCATCAGCGAGGCCGTGGAGTTGTGCTCGTCCTCGGCGATAGCCAGCAACTGGGAGTTGCTGACCCGAGTGCTTTG 
ACGGGCCGACGTGTTGTTGGCCGTGGCGTCTGGAGCTGCTCAAATGGAATATCACAGGGGTTGTTGGTCGTTAATGCTTTAAAATGGCCAGGATA 
CTCACCCTCGGGCTCATTCTGATCCTCGGCCAGTGAGGTGAGTCGCTTGGGGCGCTTTTTCGTTTGCTGCGGCGTCACCGAAGAGTTCTCCTGAC 
TCTTCGCTGCCGGAGCACTGTCCCTCGGTTTGGCCTCCGCCTCCGGCTGGCCTGTGCCGTAGAAGAGGTGGTCCAGCTCCTCAAAGTGGGCCTAA 
AAAACACATATGACGCCATGTTGACAGTGTCGGAATTTCGTCTCCTGATCCTGGCTTACCTTGCGCAGGACCTCGAGCTCCCTGCGCAGGTCCGA 
AAGCGAGCCCAGCACGCCCAAGATGTCCTCCATATCTAGCGACTGTATCTGACAAACTGCAAATCCGGATTCGGTTCCTTCGATTTTATCGTTTT 
TTTCAAATGGCGGCAATTGCGACGCGCGTCAGCATTTGATAACATGCCTTTAATTGTGCACTGTTGACAGTTAACAGCTCGTCGGGTGCTGTAAG 
GCGCATTTATGCCTCTGCTTTCCAACACTGCACGGGCGATAAAGTTCCCGCGCAAAATGAAAAACACAACCAAAACTTGTTTTTTGAGTGAATAT 

T GG GAG AT ATT TAT T T G AAT T C T TT AAAAT AAT AT T ATT G GGT TAT T T AAAT AGT T AC GT AC T GGAT TTCTAGCCC T GT AAAT T T T G AAAGT AT A 
ACATTAGGTGAACCAGTTCCAATACGTTCCCGCCACTACCATCTCTAGAAAGCAGTGCTGCACGGCTCCCCAAAAAACATATTGTAAACAAATTT 
GAGGGAAAATGGTAAGGGAAAGAACAAAAAGGAACATTTAAATTATAAATATAGATTACGGAAGAGAATTAATTTGCGTAAAGGACCGGAAATAT 
TTCAAACAAGTTTCAGCAAACGTCCAGTGCTGCACAAGCACTTCCGACGATAGTATCGCCATGGCGAGCTGACCAGCAAGCCCATCTCTAGCACA 
TTTGTTAAAAAATTTGCAACAAGCCAACGAAAATCGACCAGGAACGGTGGACAGCCAAAACCGCGCGCTCTGCGACGACCACCGTGGGATGCACA 
TACGTATGCGCAGCCGTGCCGCGGAGTAGACACAGTGCCCGCGATCCGCGACCAGCGAACCGCCAAACCCCGAGAAGCCAGCTGCAGCAGATCCA 
CAGATCGCGCCGCAGATCCAGATCATGGGCCTCGACTTCGACGCAGTGGAATACTGCAAAGGGGTGAAGACCCAGCAGTCCCAGCCGCCTTTCGC 
CTGTCCCGTCCGCGGCTGCGACCGCAGCTACAAGACCATCATGGGTCTGCAATACCATCTTATGAAGTACGACCACGACAACCCACAGCCGCTCA 
CGCCCGTGCTGACTCCGAGCCGCAAGAAGGCGCGCTCACGCTCCGGCGGCCACCACTCCACGCCTCGTCCGCACAAGGATCACCCCA 

(SEQ ID MO: 238) 

Exon: 3932 . . 3765 
Exon: 3701.. 3521 
Exon: 3455.. 2396 
Exon: 2332.. 2236 
Exon: 2091.. 1425 
Exon: 1268.. 1001 

Start ATG: 3833 (Reverse strand: CAT) 
Transcript No. : CT8873 

AAATGCTGACGCGCGTCGCAATTGCCGCCATTTGAAAAAAACGATAAAATCGAAGGAACCGAATCCGGATTTGCAGTTTGTCAGATACAGTCGCT 
AGATATGGAGGACATCTTGGGCGTGCTGGGCTCGCTTTCGGACCTGCGCAGGGAGCTCGAGGTCCTGCGCAAGGCCCACTTTGAGGAGCTGGACC 
ACCTCTTCTACGGCACAGGCCAGCCGGAGGCGGAGGCCAAACCGAGGGACAGTGCTCCGGCAGCGAAGAGTCAGGAGAACTCTTCGGTGACGCCG 
CAGCAAACGAAAAAGCGCCCCAAGCGACTCACCTCACTGGCCGAGGATCAGAATGAGCCCGAGGCTCCAGACGCCACGGCCAACAACACGTCGGC 
CCGTCAAAGCACTCGGGTCAGCAACTCCCAGTTGCTGGCTATCGCCGAGGACGAGCACAACTCCACGGCCTCGCTGATGCCGCCTCCGCCTGTGC 
CCGTCTCGGCGGACACCACCCTCGGCTCCGGAAGGCCACAGCGAGCCGCTAAGCTGAAGACCGAGAAATTACTCAAGGAGCCCTCACTCAACCGT 
AAGATGCGGCGGCCCAGCAGCGAGGAGCTCGTAAAGGTCAAGGTTGAGAGCGAGCAGCGAGTCTCCCAGTTTAACAGCTTCACCAGTGCACAGGC 
GTTAGAGGAAAATAAACTAGCGGAGCCGGAACTAGAAGAGCCAGCAGCAGAAACAGCTGAGCAGCAAAAGCCTCCGGAGGAGGCCTCAGTCACAG 
AGGACGTGAACACCACGAAGACACTCCGCGTGAAGGTAAAGCGCGAGAAACTCAGTACCGAAGCAGTGCCCCCGTTAACCAATGCGGTCTCCACG 
GCAAATGTGACCACCGTCTCCTCTGTAACAACGGAGGCAGCGCGACCGGACGATACGGTGGCCAGCAACACGACGACATCCACGGAGGTCTCCAA 
GAAGGTCCGCAAGAAGAAGAAGGACGTGGAAAGCCACCGACCCATAAAGGTGGAACGATTCAGCGACCTGGACAAGAGCTCGCCAGTCTCGTCGC 
GCACGCGCAAGTGTAGCAGCGATTCGCGAACGGTTCAGGAGCGATCCATATACAAGGATGCACTCGAAGATCCGCCCGTTGCAGAGCAATCAGTG 
GCTGCAGTTGCTCCGGCTGCTGTCAATGAGACCGTGGCGATTTCCAACGCCACCCTCGTACTGGGTCCTGCGCCGACGAGCGATAGTCCAATAGG 
ACCAGCTGGAGATGCCACTTTTGAGGTGCACTCCAACGACAAGAAACAACCTCTGAGTAAGCAGGACAGCCTCTTAACGGAGGACGAGTCGGTGG 
AGGAGAAGCTGCCGACGACCAAGCTGCTCTCGAGCATAAAGGCTATAAAACTGCCTACCCGTACACATGAGCTTTTCAACCCACTGCTTCAGAGT 
CCAGTGAAAATGCGCGTGGAGGCGTTCGAGAATGCGGCAAACGCACAGAACAGCATGCGCCCTAAGCGGGGAAAAGATCTGCAGGGCACGCCAGG 
ATCAAACAACACTCCCAAAATTGGCAAGCTGCCTGCTCCGACTGTTGGTCGCTTTTTCACACCGACTCAAACGACAAGCACACTACCATTAAGCT 
CGGCGCAGCCCAAGAAGGGACCTGCCAGCGCGTCTAAGGCCACTAGTCTGCTGAAGACGGCCACCGGTACGAACCTAAGGTCGGTCAATTCCACG 
TCCACAAAAACGCTGTCGCGCGAGAACAGCGGAGACGACTTCCGCAAGGGCCTGCACAATCTGGCGGAGGAGCGCAAGAAGCTGCGTGAGCAAAA 
GCACCAGCAGGCTGCCCAGCAGCGCGAAGCCAAGGAGCGGGAGCGGGCAGAGCGTATGGCCAAGCTGGCCGCCGAGCGCGCCAAGAAGCAGGAGG 
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AGCGAAAGCGAATTGAAGAGCGCAAACGACAAGAGCTGGAGGAGCTGCAGCGGAAGATGCGCCAGCAAGAGGAAGCCGAGGCGCTCAAGAAGGCC 

AAGTTCAAGGAGCTGGAGCAGCAGAAACTGCAGCAGCTAACCGGCGCCAAGCCCAAGAAGATGCTTCCGCCGCCGCCGAAAACCAAGTACACCTG 

GGAGATGCTGCACGAAGACGACTCGACAGACGACGAGGGAAAGGTCACCCACAAGCGTCCCCCAGCGCCCACCTGGAGTCGAAGCCACGTAAGGG 

GAGAGGCAATTGCTATGCAGAGCCACTGCCCAACCGACATCATCGACAGCTTTTTCTCGGTGGCGCCCACCACACCGGACCTAAAGCAGATATTC 

CCGAACATCGATCCCAGCCAGCTGAAGCGAAACTCCAGCGTGCTGTGGTCCACGCCGCCGCGCTATTCCGAGCTGCCGAAATACTAGCCAAGTAG 

TCCTGCTCTCTTCTCCCGTAAATGCACATGTTTATGATCGAATAAATAAAGTTAATTGTGAAAATT 

(SEQ ID NO: 239) 

Start ATG: 100 (Reverse strand: CAT) 

MEDILGVLGSLSDLRRELEVLRKAHFEELDHLFYGTGQPEAEAKPRDSAPAAKSQENSSVTPQQTKKRPKRLTSLAEDQNEPEAPDATANNTSAR 
QSTRVSNSQLLAIAEDEHNSTASLMPPPPVPVSADTTLGSGRPQRAAKLKTEKLLKEPSLNRKMRRPSSEELVKVKVESEQRVSQFNSFTSAQAL 
EENKLAEPELEEPAAETAEQQKPPEEASVTEDVNTTKTLRVKVKREKLSTEAVPPLTNAVSTANVTTVSSVTTEAARPDDTVASNTTTSTEVSKK 
VRKKKKDVESHRPIKVERFSDLDKSSPVSSRTRKCSSDSRTVQERSIYKDALEDPPVAEQSVAAVAPAAVNETVAISNATLVLGPAPTSDSPIGP 
AGDATFEVHSNDKKQPLSKQDSLLTEDESVEEKLPTTKLLSSIKAIKLPTRTHELFMPLLQSPVKMRVEAFENAANAQNSMRPKRGKDLQGTPGS 
NNTPKIGKLPAPTVGRFFTPTQTTSTLPLSSAQPKKGPASASKATSLLKTATGTNLRSVNSTSTKTLSRENSGDDFRKGLHNLAEERKKLREQKH 
QQAAQQREAKERERAERMAKLAAERAKKQEERKRIEERKRQELEELQRKMRQQEEAEALKKAKFKELEQQKLQQLTGAKPKKMLPPPPKTKYTWE 
MLHEDDSTDDEGKVTHKRPPAPTWSRSHVRGEAIAMQSHCPTDIIDSFFSVAPTTPDLKQIFPNIDPSQLKRNSSVLWSTPPRYSELPKY* 
(SEQ ID NO: 240) 



Celera Sequence No. : 142000013384847 

AATGGCTTGCTGAATCCCTGACCTTTCCGCGACCCCTGCTCATTGCATTTTGCGGGAGAGGAGAGCCAAAGGCGACGACCGATTTTAACCTTGAC 

AGCTCTCCCAGCTTTTTTTACCCCACCTCACCTACGCCAACTGAGGCAATCACACACACACACGCTCGCAGATTATGGCTATTTGTTATCAATGC 

GAATTACGGCACTCGGGCGTTTTAGCCAACATTTCACTTAAGACTGTTTGTGCTTATTTTAATCACGTAGGCCACCGAATTTGCCAAATAAACAA 

AAAAAAACGCACATACAACGCACACACACAGACGCACGGGCAGCTATATGTGCATTGTTGTTGGTTTTTCCTGCGAGTATTCTCTTTTGTAAGAC 

GTTTTAGGGTTGCAACTTTTAACCCTATTGACTGTTTTATACCCTGCCTTGAGCTCTATTACGCCCCTTGTTTATAAAAACCGCGTAAACTGTGC 

AAAAAAAGGGAAAAATTCTTAACTCAGCACTAAAATTTCTTTTTGATAGCGATGGGCGTCGACCGATACTTAGCGTGCAGCGATTGACATGTACT 

CGTTGTACAGCACTATTACCGGCGGGCAGTAGGTATCGATAGCTGAACGAGCCAACGTTAGTGGTGATTATTTTATACTGGCACTTATATACTAA 

CGGTAATCAAAGCTTTTTATTTCGCTATTTTTATTTACTAGCTTTAATATTTAATTGCACAAGATTCGGAAACAAAAAGCTATTGGGAAAATTAA 

CTCAAAGCGGCCATTAACCCCATAAGAATTACTTTAATACAACTTCAGTCGATCGTTTATCAATTAAGCAAGGTCTCTAAGAATGGCAAATACCC 

TTATCGTCTATTCCAATACATTTATATATATATATTTTAAAGTCGATAGTCGATAGCGCGATGTGCCATCTCCATTATCAAAACTAAACAGATGC 

TGGGCGGAAAATCAACTGGTTCCTCGTGATAACTTAATCCCACTAATAAGCGCAATGTCGTCCAGTGAACGAGTGGCCAAAGTGGTGCTGGTGGA 

CATCGAGGGCACCACCACTTCGATTAGCTTTGTGCACGATGTCCTGTTTCCCTACGCCAAGCAGAATGTGGAAAAGTTCCTGAGGGATTCGTGGG 

AAGAAGATGATATAAAGCGTATTGTCCAGGATCTACAGCAAGTGCCCCAGTATGCGGACTACAAGGCCCTGTTAAGTGGTCCACCTACGGAAGTG 

GATGTAGATCTGATTGCAGGCTTTGTGCGCTATTTAATTGATCAGGATCTGAAGGTCACGCCAATGAAAACGCTGCAGGGCTTGATTTGGGCGCA 

GGGCTACGCTAATGGAGAGCTGAAGGGCCAGTAAGTTCCCCATTCAATATTCTCCTATCATGGCCATATAAGTTTTACAATCTGAACTCTAGAAG 

GTTATCAAAGAAAAGGCTTACTTAACGTCTGTTCCCAAAAATAATTTGGGTGTCTTTCTGAGATCCAATTATACAGCTTGGTGATTTTTTAAATA 

TGTATGCTAGCTATATTCTCCGCACGGGAATGTTTAACTTTTGCTGATTTCAATAGTTATCAGATAACATTCAACATAGTAACCAAAAGTACAAC 

GCAACAACTCATTATTATTGATAATTAATTAATTATCTATAAATGTTATTATAGTGTATACGAAGACGTCCCTGCAGCTTTTGAGGCTTGGCGTG 

CTGCTGGCCTCCAGATTGCAGTCTACTCCAGCGGCAGTGTGGCTGCCCAAAAGCTGATATTCGGTCATAGTTTGGCCGGGAACCTGCAGCCGTAC 

TTGAGTGCTTATTTCGACACCCATGTGGGCCACAAACAGGAACAGCAGTCCTATAAAAACATTGCCAAGCAATTGAAGGAGGACCCGAAGCAGAT 

ACTCTTTCTCACGGATATTCCAGGAGGTGAGTGGCTGATCCACACGAATCCTCATCCGTAACATTTTTAATGTGAAATTATATTTTAGAAGCGGC 

CGCCGCTCGCTGTGCGGGCTTGCAAGCCATAATTCTGAAGCGTCCTGGAAACGCCGCCCTAGCAGACGATCAAAAAACTGGATTCGAGTTGATTC 

CGGACTTTAAACCACTTCACAACCTGAAAGTTCCGGTCAATAAATCCCAGGCTTAGGTCATTCCAAAAAAGTATACATATATTGTGACTATGTGC 

TTGGGAATTAGGTGAATCTCTAGAGGAAAGCTCGATCTGCGCTGCACAGGTCGTCACCGCTGTCGTGGCTGGGCGTCTCGGTCGTGAGCCCCTGC 

GGCCGTCGTTCGGACCGCTTGCTTCGTTGAGACTGCCTCGAACAGGACTTTTCTGAGCCATAGACCTTACGCAGTTGCTGCAGCTTAGTATCCGC 

ATCGGGAACAGGAAACTCTGACTTTTTCTTCCTACGTTGCCGGCTCCCCTCGCACGAATGACGCAGTTCCTCGGTCCGGCAGGTGTGGGTGAGCT 

TGCCCACTTCTGGACCCCAGAACGTGGGTCCTTCCAGAAGAAGGGGAGCGGATTTGACCCGTTGACGCATCATCTCCATATGCAGTTCGTACTGA 

CGCTCCTGAAAAGCCTGCTCTGCCTTGCGAGTGGCCAACCGAATGGATAGCAGTGCCTTGTGGGATTCGCTAAAGAAAGAACACTGAATGTGTGC 

GAAACTTAATATTTGGTGAACTACTCACTTCAAGGACAGTGATTTCCAAGCCGGCGTTTTCCTCACCTCCCAATCGAATTGCGGACGAGGATCAT 

GATAAGTGCAGTTGTTCTTCAGCTCCTTAACCTTCTTCCTAGATACTTCAGCCCTCAAAGTTGCCGCTAGATTAGGACGAGATACCGGATGCAAC 

GGGGACACACTGGGCGTTCTCCTCTGGACCGGTTCCTCCGGCAACTCATAGATGCTCAGCGGTGGTAGTGGCTTCTCATCCTTTACGCTAGACCC 

TGGCTTTGGTTTCCTTACTCTTCCGCGTCTTCCTCGACCTCTTGCTGCGCTGGAAGCACTACTACTTCGCTTCGACGAGGTGGTGGGTGATTGAA 

TCTGTATCTTGGGAATTTCCTCCTGCTTGCTGAGTGGGAAATAGTTGCTTCGCGCAGGCTGAGCTAGTTTGGAAGGCTTCGAATGAGCCTCTAAG 

GCAATCCCTTG 

(SEQ ID NO: 241) 

Exon: 1001.. 1355 
Exon: 1671.. 1919 
Exon: 1988.. 2146 
Start ATG: 1005 

Transcript No. : CT8961 

CGCAATGTCGTCCAGTGAACGAGTGGCCAAAGTGGTGCTGGTGGACATCGAGGGCACCACCACTTCGATTAGCTTTGTGCACGATGTCCTGTTTC 
CCTACGCCAAGCAGAATGTGGAAAAGTTCCTGAGGGATTCGTGGGAAGAAGATGATATAAAGCGTATTGTCCAGGATCTACAGCAAGTGCCCCAG 
TATGCGGACTACAAGGCCCTGTTAAGTGGTCCACCTACGGAAGTGGATGTAGATCTGATTGCAGGCTTTGTGCGCTATTTAATTGATCAGGATCT 
GAAGGTCACGCCAATGAAAACGCTGCAGGGCTTGATTTGGGCGCAGGGCTACGCTAATGGAGAGCTGAAGGTATACGAAGACGTCCCTGCAGCTT 
TTGAGGCTTGGCGTGCTGCTGGCCTCCAGATTGCAGTCTACTCCAGCGGCAGTGTGGCTGCCCAAAAGCTGATATTCGGTCATAGTTTGGCCGGG 
AACCTGCAGCCGTACTTGAGTGCTTATTTCGACACCCATGTGGGCCACAAACAGGAACAGCAGTCCTATAAAAACATTGCCAAGCAATTGAAGGA 
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rGACCCGAAGCAGATACTCTTTCTCACGGATATTGAAGCGGCCGCCGCTCGCTGTGCGGGCTTGCAAGCCATAATTCTGAAGCGTCCTGGAAACG 

CCGCCCTAGCAGACGATCAAAAAACTGGAT^ 

TAG 

(SEQ ID NO: 242) 
Start ATG: 5 

MSSSERVAKVVLVDIEGTTTSISFVHDVLFPYAKQNVEKFLRDSWEEDDIKRIVQDLQQVPQYADYKALLSGPPTEVDVDLIAGFVRYLIDQDLK 
VTPMKTLQGLIWAQGYANGELKVYEDVPAAFEAWRAAGLQIAVYSSGSVAAQKLIFGHSLAGNLQPYLSAYFDTHVGHKQEQQSYKNIAKQLKED 
PKQILFLTDIEAAAARCAGLQAIILKRPGNAALADDQKTGFELIPDFKPLHNLKVPVNKSQA* 
(SEQ ID NO: 243) 

Name: ENOLASE-PHOSPHATASE-like 
Classification : proteinjphosphatase 

rplera Seauence No. : 142000013384830 

AAAT T AGC T CAT AT T G T CAGGC T AG AGC G AATGG AGGT C AT AT GG AG AT C GT C GC GT GT T CAT TTTTTGCTT ACGT GAT G AAT G TC AT G AAAC T G 
AAAGCTTTCGATCTACTTAAAAATCAAATTTAAAAGGTAGCTACATTAAGATGCAAAAACTATTGAGTTATTAATACACAAACAAAGCAAACAAC 
AAGGTGCATCGCAATTATCCTGATGATCTTGACAACAAGCGAATTAGATTAGATAGTCAGTGCGACCACTAAGAAATTCTATGGGAATGCCGTCA 
GCTTCCAAATGTAAAGTGACTTGCATAAGTCAGCACAGTCCCCTTATGTTTGGGGTAGATTTTGTAGGTCAGGTGACAAAGGTGAACCAGGTGGC 
ACTGAATGGAGCCGAACGGCCAGCCGGGTCACGTTTCCCCCAAATCCCACAATAATGGGGTTGGCGCTGTCTAGAACCGACCTTGGCTATAGCAA 
TAGTCTACTATGCAATAAATAGTGTGCACTCGTAATGCGTTCCCAACAGGGGTTTCGGTACAGAAATCAATATCACCGCAATGGCCATCGGCAAT 
TGACAACGGACAATGATTGAATGGGGGATAATGGGGCCTAGCTGGGCAGACCCCAACATCGGAATGGCATTTGTACAACTTCTGCCATAAACTAT 
GCAAATCTGGCGTCTCCTGGGATATCCGCCGTTCGCAATAGCAAAATGTCGACTGCATTTCCGGCAGCCAATTTAATTAGTTGTTAGGCAAACAT 
TCGGGTAATCAGTGGCTAAACAGCTTATTTAGCTGCATTTGGTCATTCAAGTTGGATTATGTCTGTGCAGCGTAAATATGCACAAAATGTTGCAC 
ATTGTTTCGGCTGAAATATAATATTATCTTACAATGAGACATAGAAGCAGCGTTTAACGCGCACAAACATAATGGGCAACCGACATTTAAACAAT 
CACTATAGCACTGAAAAAGTTGACAACCTCAAACTTCATACGTTTGGCAGCTCGGCCAGCATTTCAGTTCATTGATATACAGAACATTGTTCAGT 

CT AC AT C AAT T T T GAAT T G T G AAT AT TC GGT AAAG AT T AT TGT G AT AAAAATG GAAAC C C C AAGC G AGTT AAC T GAG AAAC AG AAC C AAG AC C AA 
CCTACCTACCAGCCGCGACTGAACGAGGTGGCCCAAAAGTTTCTCGCCGATTTGGACGCGGAGCGCAAGCGATTGTCCGCGGAGTTTCCACTTTG 
CGCACTGCTAATCGACGAATGTAAAATCAATTTGGCTCTTTACAGGGTATTTTAATGGTACTTATCCCAATATTCCAGCTGTGGACCGTGTCTAC 
TCCACTGGTCGCATTCCCGGCAAGGAGCCCTTCGCAGACGTGTACCAGCAGAAGCCGATGAAGATTATCCAAAAGGTCTTCGTGCCTGTTAATAA 
GTTTCCCAAGGTGCGTTTTATACAATTAGCTGTCCCCTCCGTCCGCAAAATAAATCCGTACCCTTGCGCAGTTCAACTTCACTGGCAAGATCCTA 
GGGCCCAAGGGCAACTCGCTGCGTCGCCTTCAGGAGGAGACCCACTGCAAGATTGTCATCAAGGGTCGCAACTCGATGCGCGATCGTAACAAAGA 
GGAGGAGCTGCGCAGCTCAGGCGACCCGAGATATGCCCATCTTCACAAGGATCTCTTTTTGGAGGTCAGCGCGGTGGCCCCTCCGGCGGAGTGCT 
ATGCACGGATAGCCTACGCCCTGGCCGAGATCCGTAAGTACCTCATTCCAGACGATAACGACGATGTTTGGCACGAGCAGCAGCGCGAGCTGATG 
GAAATGAATCCCAAGTCGGCCAAGAAAAGTAACGGACTCAATATGGCGCCCTACAGGTAATGGTCATCTTATTCTCTCTGCATACCGATGCAACT 
TGACTTACTTGACAAGCAAGTTAAGCATGTGGTATCCATCTGCTTGCCATTGGGCAAAGGGTAAAAATCCATTGTTATTTTGCAGATCGAATTTC 
GACAAGGCGATTGGAGCCATCAGGAATAGGGCTCCCAAGTACAACAACCAGATTAGGCGAGTTACGGAAAATCCGCGCGAGTAAGTTACTTGTAA 
AGTATAATTTTGTATCATTTAATGACCAAACCAGGAGTTACCGCGTCCTAGTAGTTCCTTGAATCCTAGCCAACATGCGTTAAACCAGCATTTAA 
CATTTCGATCTTTATAGAGTCGCCTATATGGAGGAGGTGGAATATGATTATGATGAACATCTTATGTTCTGCTCCCCAGTCGTCCGTCTATAGGC 
TATGAGTATAGCAAACGTAAGTGTTTGGTAGTGCCGGGGTCTGTTACATAAGATAGTTTTATGAGATGCACCTTAGATGGGAACCTTGGGATTTC 
CATTAGATTATATTGTACTATCAAAACCAAAAACAGTGCATGAGTAAGATCTCAGATCAAACTCTATAATATACTAGAATGACTAATATTATACA 
ACGATTTACTTCAGTTTAGTAATTACAAATAATAATCAACCAATTTTATAGCACCTACATCAATGACAGCGACAAACGCTACGCCATTTAAACGT 
GGATATCCGTATCCAAGTATAAGCCCAATCCTGCTATGTGGACTGACTGAGTTTATACAATCCAAGGAAGCACATTAAAGAAAGCAGCATTTAAA 
ATGATTCTTTTAATTAATAAACAAACATTTATAAAAAGCGCACTTGTTAAAATTATATCCTGAGTTCGTAATTTTCTATGAATTTAAAAAACTAA 
TAAAATGAAAGGCAATGCATGAATTTTATAAATACTTTGTACTCAGATCTAACAATATCTTTTATATTCTTACTCCGTAACTTACGCCGACACGA 
AAGCTTAATTTATTCTTTAACCGAAAAGATATTCCATGCTTTCAAGCTTTTGTGTAAAATACAATGTAAAATCGAAGTATGTAACGTAAGGAAAA 
TGTAAAAAGTAAATAAAACCCAACGAAAAATGTTAAATGAATTTATATGCAAATTAATATTCAGTCATATATATATATATGTCATATATATGGCA 
TATATGTCATATATCATTATGGAACATTGACTAATATCTTTAAGATCTCTTCTGAGAAATCTTGCCAAAAGATTTACGGGGTTTCGTGTCAGTTT 
GAAATCGAAAGACACTATAGAAAAATCGAAAGTGGATCAATAAACTTTATACGGCGCCTTCGTGACTGATAATAATCAGTCCATAATCACGCGAT 
TTATATCCTTCGATCGGTGGCACACTTCAGTCACATGCCACATGGATCGTAAAACGCAATCGAGACTATCGCTATGGACTTTGGACA 

(SEQ ID NO: 244) 

Exon: 1001 .. 1255 
Exon: 1314.. 1435 
Exon: 1497.. 1861 
Exon: 1986.. 2075 
Exon: 2222. .2317 
Start ATG: 1096 

Transcript No * CT8995 

CTCGGCCAGCATTTCAGTTCATTGATATACAGAACATTGTTCAGTCTACATCAATTTTGAATTGTGAATATTCGGTAAAGATTATTGTGATAAAA 
ATGGAAACCCCAAGCGAGTTAACTGAGAAACAGAACCAAGACCAACCTACCTACCAGCCGCGACTGAACGAGGTGGCCCAAAAGTTTCTCGCCGA 
TTTGGACGCGGAGCGCAAGCGATTGTCCGCGGAGTTTCCACTTTGCGCACTGCTAATCGACGAATCTGTGGACCGTGTCTACTCCACTGGTCGCA 
TTCCCGGCAAGGAGCCCTTCGCAGACGTGTACCAGCAGAAGCCGATGAAGATTATCCAAAAGGTCTTCGTGCCTGTTAATAAGTTTCCCAAGTTC 
AACTTCACTGGCAAGATCCTAGGGCCCAAGGGCAACTCGCTGCGTCGCCTTCAGGAGGAGACCCACTGCAAGATTGTCATCAAGGGTCGCAACTC 
GATGCGCGATCGTAACAAAGAGGAGGAGCTGCGCAGCTCAGGCGACCCGAGATATGCCCATCTTCACAAGGATCTCTTTTTGGAGGTCAGCGCGG 
TGGCCCCTCCGGCGGAGTGCTATGCACGGATAGCCTACGCCCTGGCCGAGATCCGTAAGTACCTCATTCCAGACGATAACGACGATGTTTGGCAC 
GAGCAGCAGCGCGAGCTGATGGAAATGAATCCCAAGTCGGCCAAGAAAAGTAACGGACTCAATATGGCGCCCTACAGATCGAATTTCGACAAGGC 
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GATTGGAGCCATCAGGAATAGGGCTCCCAAGTACAACAACCAGATTAGGCGAGTTACGGAAAATCCGCGCGAGTGGAATATGATTATGATGAACA 
TCTTATGTTCTGCTCCCCAGTCGTCCGTCTATAGGCTATGAGTATAGCAAACGTAAGTGTTTGGTAGTGCCGG 
{SEQ ID NO: 245) 

Start ATG: 96 

METPSELTEKQNQDQPTYQPRLNEVAQKFLADLDAERKRLSAEFPLCALLIDESVDRVYSTGRIPGKEPFADVYQQKPMKIIQKVFVPVNKFPKF 
NFTGKILGPKGNSLRRLQEETHCKIVIKGRNSMRDRNKEEELRSSGDPRYAHLHKDLFLEVSAVAPPAECYARIAYALAEIRKYLIPDDNDDVWH 
EQQRELMEMNPKSAKKSNGLNMAPYRSNFDKAIGAIRNRAPKYMNQIRRVTENPREWNMIMMNILCSAPQSSVYRL* 
(SEQ ID NO: 246) 

Name: KH domain RNA binding protein 
Classification: RNA_binding 



Celera Sequence No. : 142000013384830 

TGACTCCCCACAAAAACAAAACACTCAACTGATTGCTCTCGGGAATATAATCTTAGAAGTACGTCCAGAATTTCTGTGGTCACTGGAACATGTGT 
GGCACATGAGATTGCCACCAGACCCCAATACACATGCGTTTCATATAGATCCTATTGCTTTCATTAATGTATACACAGCGATTGTGGGGTCTGTG 
AAAGGGGTACAATTGCACATATGTACGTCTTCATATATATGTACATATACGATTCCCCACATTTGAACGATATGAGATCGTCTCTTCAGTTCTTG 
GTTTCTCTCGCTCTAAACAAAAGATTGGGGGACGTGAAGTCTAAGGATTGGCAGACATATGCATTTCTAGACAATGTTGTTGCATTTGAGTGCAT 
TCGCTAATTGTAAAAGTTGCCAATTCATCAGCGTCTGAGATCTGAAAAATTATGCTATATATAATTAGTTACTAGAGGGCTGGTTTCGGGCTTTC 
ACCTCTATATACATGGGATAGTATAGTATGGTATGTATATAGTAATCAATATTTAGTCCAAGGGTTGTGAAAGGAAACTTTCTATAAAGAACAGG 
ATACTTGATCGAAAGATGAGCCAGTGGCACTTATTTAAGTATCAATCGACCCGATTAACTTCTGCAGAGACAGTCATGCACCTGTTCATTCAATC 

AT AT AC AAT T AGT T C C C AGC AGG C C C AAAC AAAC T G AAT AAAT AT AC AAAT GAAC TCGTCTGTGTGTTCT T T AAT T T GT C GC AAGT GT GC AAAT C 
AGAAAAATGTGACGAACCGAGCGTTGTTTATCAACAACAAAGAGAAAAAACAGACAGAACATGGGCAATCTAAATCGAGAACGGCACAACATACA 
AAGGCAAACAAATTGGCGAAGAAGGCACAAACCGGTTGCCACTAAGGGAAGTCAGAGCCAAAAGAAAGCGATGGAACCAAAACCGGTTCATTGAA 
CCTGCACACCACCGGCATCATGGTATAGACACACAGAACGATTCCAGCTCTTTATCACACAAAAAATCATTCTTTATTTATGCCATTTCTGTTTT 
TAGTTGAGCACAATTACTTGCTTACGGAACGAAATGTAAATAAACATGGCGAGCTATGGTAAATGCAAATGTCTGGAAAAATCGATTATGCGTTT 
ATGCTTAGTTCAAATCGGTTATGTTGCTGATGGATGACATGCTCCCCAAACACTTTAGTCGGTGGCATCGGCCTTGGCCACGGAGAAGGCCTCCG 
AAACGGGCAGGGTGCCGCTGTTGGCGATGACCACATCCTTGACGGGACGGTCGCGGGCATCAGTGGCCGAGTTCTCGATCTGGCGCACCACATTC 
ATGCCCGACAGGATCTTGCCGAAGACGACGTGGCGTCCATCCAGCCAGCTGGTCTGCTTGGTGGTGATGAAGAACTGCGATCCGTTGGTGTCCTT 
GCCAGCGTTGGCCATGCTCAGCCAGCCGGCGCCATAGTGCTTCAGCTTGAAGTTCTCATCCTCGAAGCGCTCGCCGTAGATGGAGCGACCGCCGG 
TGCCGTCGCCCTTGGTGAAGTCACCGCCCTGGATCATGAAGTCCTTGATGATGCGGTGGAACTTGCTGCCCTTGTAGCTGCGGGGTATAAGAGCG 
TTGCATGATGGTCAGTTTTGTCAGCATGGGAATTTAAATTTAAAAAAAAACTGGGTTTAGAACTCGAAACCTAACCAAAAAAGAACATGCATACT 
TATAAATTCAGAGAAGAAAAAAGAAATACCCAAGAACTGCACTTTTATAGTATGGTAGACTTAATAACAGTTGGCGTTCTTTGTTAACCCATTAG 
TGCCGCTTTGTCAACTGATGTATGCGATTGGCCGCTCAAACGCCACGTAATTGCTTCAATCTGAGAGCCCCGCTATAACCTACAAAGTGCCGCTG 
ATTAGCGGCATCTTTTGCTGGCCTAGCGGTACCTGGCACCCTACTATGTTGGGGTATTTTCGGTACTCAAGTACCCTACTCACCCCTCGCCCTGC 
GGCTTCAGCGCCAGCTCCTTGAAGTTCTCCACCGTCTTGGGCACCGTCTTGCCGAACAGACCGATCTCGATGCGACCAGCGGGCTCGCCGCCAAT 
GGTGATGTCAAAGAAAACCTACGCAATCGGAAGCGGTTTATTATTAACAAGTCATCTCACTTGATTTATGGCGCAGCAAATGTGTGTGTGTGTTT 
GTGTGTGTATGTACGTGGTGTATGCAGAATGTAAGGACGAAAGTCCAAGGAGCACGCACCTTCTCGGTCACTTTGGGACCCTTGCTATCGTCGGC 

AAC AAC GACGC C GGC C AC C AGGGCT AC C AC G AAAAC G GAT AAG AAC AGCT T C AT C G T T C T AAAT AT T C AC AGC G C T T C AG T T TAT T CCT C AAGC G 
ACGTCGCCAATGGATATTTCACGACTGAATTTTCGGTTTCTGTTAGTGATGTAAGAAAACTTTGTTGCATCGCCAATCGATGTTTTCCATCGTTT 

TTTATGTTTTTGTC G AAAT AT C GAT G CT T AAGT GT GAC C AT T C GAT G AC GAT GT CT G AT GAT C AC AC AT T C G GGC GGAAAAT T C AAAT T C AAAT A 
GTTAATACGCCGAAAGCATAAGTTAAGGTTTTCGTACGAAGGAACAAAGTAAGTTCGAAGAGTTGACTTTATTGGTATAGTTTTTAATATTTTAG 
GGATTAGAAGTCCAGTTTGCATTAGCACGTAGCATTCTCTCTTGTTTTATTGACGGTTCTGTACACAAAAAGTAAGTAAAAAAATATGCTTTTAT 
TTATTGGTGTACTGCTATTTGCGTCTGAGATTTGTCCATATCCTTGGCTGCCTTCTCTGCTCTCTTCTTCTCCCTGCTGGCCAGAACTTTCTCCC 
TGGATTTCTGGGCCCTTTCAGCGCAATCCTTGGGTCGTGTCTTCAGGGTCATATCATCCACGGCCGCCTGTTTACGGGCTGATCTCGAAATGCAT 
CGTTCCATGCAGCTGATGGGCACAAATCTTCCAATGAAGCAACCGCAGTGTGAACAATAGTGATTGATGTCCTGGCGGCCAGACCATTTTTTACT 
GAAATAAAGTGAACATTACGAAAGATTTATAAAGCTTCATGCAGATGGCCAACCAAAGTTTTGTGAATCCCAGAAATCTCTGCAGGCAAGTGACC 
AACTCCAGTTGCACCAAACTGGTTCCAGACTTCTCACACGAAGGACAACGGATCCAGGTGGGCTCCTCCTTCAGGTATCCCACTTGTGGCTTGTG 
GCTGACAATGATGGCCACAATCTGCGGTTCGTCCACGGTCATGATGCGAGTTCCAGCGAGAACGATCGGTGACTGAAAGATGGATCAAATTTGGA 

GAG ATC AT G GG AGGG AAAAT C AAA AT C AG A AT CACTCTCC CG ATT T C T C T T AAC C GAGG 
(SEQ ID NO: 247) 

Exon: 2384.. 2245 
Exon: 2108. .1984 
Exon: 1597.. 1001 

Start ATG: 2334 (Reverse strand: CAT) 
Transcript No. : CT9017 

TGGCGACGTCGCTTGAGGAATAAACTGAAGCGCTGTGAATATTTAGAACGATGAAGCTGTTCTTATCCGTTTTCGTGGTAGCCCTGGTGGCCGGC 
GTCGTTGTTGCCGACGATAGCAAGGGTCCCAAAGTGACCGAGAAGGTTTTCTTTGACATCACCATTGGCGGCGAGCCCGCTGGTCGCATCGAGAT 
CGGTCTGTTCGGCAAGACGGTGCCCAAGACGGTGGAGAACTTCAAGGAGCTGGCGCTGAAGCCGCAGGGCGAGGGCTACAAGGGCAGCAAGTTCC 
ACCGCATCATCAAGGACTTCATGATCCAGGGCGGTGACTTCACCAAGGGCGACGGCACCGGCGGTCGCTCCATCTACGGCGAGCGCTTCGAGGAT 
GAGAACTTCAAGCTGAAGCACTATGGCGCCGGCTGGCTGAGCATGGCCAACGCTGGCAAGGACACCAACGGATCGCAGTTCTTCATCACCACCAA 
GCAGACCAGCTGGCTGGATGGACGCCACGTCGTCTTCGGCAAGATCCTGTCGGGCATGAATGTGGTGCGCCAGATCGAGAACTCGGCCACTGATG 
CCCGCGACCGTCCCGTCAAGGATGTGGTCATCGCCAACAGCGGCACCCTGCCCGTTTCGGAGGCCTTCTCCGTGGCCAAGGCCGATGCCACCGAC 
TAAAGTGTTTGGGGAGCATGTCATCCATCAGCAACATAACCGATTTGAACTAAGCATAAACGCATAATCGATTTTTCCAGACATTTGCATTTACC 
AT AGCTCGCCATGTTTATTTACATTTCGTTCCGT AAGC AAGT AATTGTGCTCAACTAAAAACAGAAATGGCAT AAAT AAAGAATGATTTTTTGTG 

T GAT AAA 

(SEQ ID NO: 248) 
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Start ATG: 51 (Reverse strand: CAT) 



MKLFLSVFVVALVAGVVVADDSKGPKVTEKVFFDITIGGEPAGRIEIGLFGKTVPKTVENFKELALKPQGEGYKGSKFHRIIKDFMIQGGDFTKG 
DGTGGRSIYGERFEDENFKLKHYGAGWLSMANAGKDTNGSQFFITTKQTSWLDGRHVVFGKILSGMNVVRQIENSATDARDRPVKDVVIANSGTL 
PVSEAFSVARADATD* 
(SEQ ID NO: 249) 

Name: peptidyl -prolyl cis-trans isomerase (cyclophilin) 
Classification: chaperone 



Celera Sequence No. : 142000013384827 

CCCTGGTATCCGAGATCGTCGGTGGGATTACCAACAAGTGCCGCCAAATAATCAAGCGCTCCCGCAAGTCGGCCGCCTCCTCTTGGTCGTCGGCG 
TCAAGTGGGTCAATGCTTCGCACTAATGCCGAGCAACTTTCCCATGATAAGCGAAAGGCCAATAGACGCGAGGCTGCTGAGGTTGCTCAAAAAAT 
GAGTTTTGGAATGCGCGAGTTAAATCTTACCCGCGCAACGCTTCGGGAAATATACGGAAGCTACGGGGCACCTGAAACAGATCATGGTCAGCTAG 
ACATCGTCCACACTGAGTTTCCAAACAGTTCTGCAAAATTAAATAATATTGAAGACGAAGAATCTCGGGAAGCTCTTGAATCTCTGCAGCGTTTA 
GACGAATTTATGGACCAGATGGATAACGACGGCAATCCTTCCTCACATACATTCCGTATTGACAATTAATCACAGAAATTTTTAAATGCAAATAC 
AAAATAATCACTTTAGTGCACATGAAATTATATGTATTGAAATAAAAAGTTTGATGCAACAACAAATTTTACAGTTGTTTGGTATTTTGTATGGG 
AAAGTTTGTGGCAATATATGGTTTTACTTGAAAGCGGATTGGACATGGTCACACTGCCCACAAATCACAACATTGCGTTATAAATGAAGCACATC 
TAGTTTTGTGTTAAGAAAGTGGTTCACAATTTATATAGCAGGTTATGGTAAAACATCGATCGTTCCGAAAAATTCCGCTCGTTGGCACATTTTCT 
AAATTTTTCGTGCGACAGCGGTGATTATTATATGATATTATTATGATTATATATACAAAAATTATTATAATTATATAAAAGTTGGCGACCATATC 
AAAAAGGTTGCTATAAAAACAAATGGCAATTTATAGCGGAGTTTCACTCCCAAAGCGATGTTCTCCACCCATTCCAGTCAGTCATGGTAAATATA 
CCTTCTAAAAAATATATGATTTTTTCGATAGAGATACACGGGCACACTAGACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGCGCATATC 
GTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATACACGGTTAAGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGGTTGACAC 
GTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAACAGTGCGGGCGTTTCTCAACATAAGAATACATACAAACTTATGCGTGATGGTG 
AGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAACCGTACAAACGGAAATACATAAAGGTAAAGGGCTCAACAACAGTATTACTGA 
TTAGGCAATTGATTTATATTTCAAACTTCTTAAATAAAGTGTCCACAAATATGTATGTTCTTAGCATAGAGGCGTAGTCGGTGAGTCAGCCACGC 
CAACAGTTAACAGTCGTCATTTTGCTGTCTGTCTAACACGCAATTTGTGTGTGTATATGTGAATAAACAGATACCGGTATGCAATCGCAAAGCTG 
GTCAACTGTAAAGATCGGTATGTACTCAAAATTAGGTCATCACCTTGTTATGCAGATACATTTGGGAAACAGTTGACTTACTTGGTATTGGTTAT 
CAGCCGAAAAGCAAATGGTGATACGGTTTTGCACCTCAGCTTACTCACGGATACATACCTACGTACATAAATGTATGTATACGGTATACAGTCTT 
AGAAAAAATATTAGTACCGACCAGCTAGCGTTTGCCTAAAACGCTTTTACAAAAATAAACGTATATGTTTTAAGTTGTTTTTTGTCATAATTAAT 
ACATTTATTATTATTTATTTATGAATTTCAGAGCTTTAATAGTAACTAAATTGTTTAAACAAACTTAAAAAAAGATTTACAAATAAAAAAATATT 
TCGTGTGCGTGACGGAATTGCTAGTTCATTAAAGGGAGGTCATATGTACATACATACACATTGCAAAGAGTACAGATCGCTCATCATTTCATTTT 
TCCTAATGTACATATACATATGTACAAACATGTTTATTAAACTGCCTACATGCAAATCTTTGAAGTTTTAAGAAAATAAGCACAACTAGTTTGGA 
AAATCTTGGGAGCCAAGATCTATCTGTCCACTGATTTTCCTATGACTTCAAGAATGTAAAACCCTTTTTGTTAAACAAATTTTGATTAAAGTATA 
CCAGAATAATATTAAACGTATCGTCCGTATCTATAAAAGGTATTAATAGTATTACCTAAATTTAAATTCTCGGTTTTATTCATAAGAGTGAAAAA 
ATATACAAGTTCCTTTTTAATTTTTTACTTATTGAAAATTTGATTAATTTTTATTATTATTTTCAGTTTAGAATAAATTCGGCCACAATTAATAT 
ACTTATTTATTCATCAAAACCAAACCACCACAAGAGCATAACCAATCATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGA 
CAACGCTACATCGTAAGGTAAAACAAACAATTTTCTCTTTCCGTTCTATTTTTTTTTTTTTAAGTATTTGTAGCCGATTGGTATATTTTGTGGGG 
TCATTTATTTTGATCTTGGGTGGTAAATACATTTTTATGGAAGTTAGAAGAAAGCTTATAATTTATGAGGAATAAATAAATACCTCCTTTTGAAT 
CTTTTGTTATGTCTTCTCCTATGTGCATACTTTTGGGCATTAACATTTAGGTAAATTTTTTCCACCGCAACTACAAGTAAACCGCATACACACAC 
AGTATTTTGTAGACTCGCGCCACAAAGCACCGCATAAAAAAGGTTACGGTGCTTTTTTGCTCGTTTCTTTTGTGCACTCCTTCTATGCGGTGCCA 
AATATGAGGAGGTACCTGAAGCTTGGTTGATTTTGTGAGTGTCTTTTTCCACAGCATATTTTTGTGTATTGGCTACTTTAAAAATTCTATTATAT 
CGTTGTGCTCGTAATGTTGTCCATTCCTTTTAAATATTTTCGAATTAAATATACCATAGATGTTTCCCTATTACCATAATAGTTTAATAAATATA 
CGTATATGAAATTTTTTTTATCCTATGTTAGTATAAACATGTTATTCAACAAATAAAAGTAGGTCAGAGGTAATCGCCATCCAGCTCCAGAAAGA 
GGGGGGAGCTGATAAGATTGCGTTTATGTACCTAATGGGTTGGGTAAGTGGCTTGGTTTCACAAACACCCTTATTTCAGGTACAAAATAAGTATG 
TCATAGACGGCGTAAAATTGATAAGATTTTAAAGTCAAAAGAAAATGAGTTTATTCGCCTGTAACTGCGTTAGAAATCCAAATACACCCGCAATA 
TTATATGTATAGAAAGAAAAAGTTTGATGCAACAACAAATTTAAAGTTGTTTGGTATTTTGTAATGGAAAGTTTGTGTCACTAAATATGGTTTTA 
CTTGAACGCAGATTGGACATGGTCACACTGCCCACAAAATCACGACATTGCGTTATAAATGAAGCACATCTAGTTTTGTGTTGAAAATGTGGTTC 
ACAATTTATATAGCAGGTTATGGTAAAACATCGATCGTTCCGAAAAATTCCGCTCGTTGGCACATTTCCTAAATTTTTCGTGCGACAGCGGTGAT 
TAATTACAAAAATTATTATTATAATTATATAAAAGTTGGCGACCATATCAAAAAGGTTGCTATAAAAACAAATGACAATTTATAGCGGAGTTTCA 
CTCCCAAAGCGATGTTCTCCACCCATTCCAGTCAGTCATGGAAAATATACCTTCTAAAAAATATATGATTTTTTCGATAGAGATACACGGGCACA 
CTAGACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGCGCATATCGTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATACACG 
GTTAAGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGGTTGACACGTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAACAGT 
GCGGGCGTTTCTCAACATAAGAATACATACAAACTTATGCGTGATGGTGAGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAACCG 
TACAAACGGAAATACATAAAGGTAAAGGGCTCAACAACAGTATTACTGATTAGGCAATTGATTTATATTTCAAACTTCTTAAATAAAGTGTCCAC 
AAATATGTATGTTCTTAGCATAGAGGCGTAGTCGGTGAGTCAGCCACGCCAACAGTTAACAGTCGTCATTTTGCTGTCTGTCTAACACGCAATTT 
GTGTGTGTATATGTGAATAAACAGATACCGGTATGCAATCGCAAAGCTGGTCAACTGTAAAGATCGGTATGTACTCAAAATTAGGTCATCACCTT 
GTTATGCAGATACATTTGGGAAACAGTTGACTTACTTGGTATTGGTTATCAGCCGAAAAGCAAATGGTGATACGGTTTTGCACCTCAGCTTACTC 
ACGGATACATACCTACGTACATAAATGTATGTATACGGTATACAGTCTTAGAAAAAATATTAGTACCGACCAGCTAGCGTTTGCCTAAAACGCTT 
TTACAAAAATAAACGTATATGTTTTAAGTTGTTTTTTGTCATAATTAATACATTTATTATTATTTATTTATGAATTTCAGAGCTTTAATAGTAAC 
T AAATT GTTT AAACAAACTT AAAAAAAGATT T CCAAAT AAAAAAAT ATTTCGTGTGCGTGACGGAATTGC TAGT TCAT T AAAGGGAGGTACATAT 
GTACATACATACACATTGCAAAGAGTACAGATCGCTCATCTTGCATCTCATTTTTCCTAATGTACATATACATATGTACAAACATGTTTATTAAA 
CTGCCTACATGCAAATCTTTGAAGTTTTAAGAAAATAAGCACAACTAGTTTGGAAAATCTTGGGAGCCAAGATCTATCTGTCCACTGAATTTCCT 
ATGACCAAGAATGTAAAACCCTTTTTGTTAAACAAATTTTGATTAAAGTATACCAGAATAATATTAAACGTATCGTCCGTATCTATAAAAGGTAT 
TAATAGTATTACCTAAATTTAATTTCTCGGTTTTATTCATAAGAGTGAAAAAATATACAAGTTCCTTTTTAATTTTTTACTTATTGAAAATGTGA 
TTAATTTTTATTTATTATTTTCAGTTTAGAATAAATACGGCCACAATTAGTATACATATTTATTCATCGAAACCAACCCACCACAAGAGCATAAC 
CAATTATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGACACCGCCTACAGAAGGACCTTTTTGCTTATCGTAAGGTACAA 
CAAACAATTTTCTCTTTCCATTCTATTTTATTTTTTTTTTTTCGTATATGTAGCCGATTGGTATATTTTGTGGGGTGTCATTTATTTTGATCCTG 
GGTGGTAAATACATTTTTATGGAAGTTAGAAGAAAGCTTATAATTTATTAAGAATAAATAAATACCTCCTTTTCAATCTTTTGTTATGTCTTCTC 
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CTATGTGCATACTTTTGGGCATTAACATTTAGGTAAATTTTTTCCACCGCAACTACAAGTAAACCGCATACACACACAGTATTTTGTAGACTCGC 
GCCACAAAGCACCGCATAAAAAAGGTTACGGTGCTTTTTTGCTCGTTTCTTTTGTGCACTCCTTCTATGCGGTGCCAAATATGAGGAGGTACCTG 
AAGCTTTGTTGATTTTGTGAGTGTCTTTTTCCACCGCCTGTTTTTGGCTACTTTAAAAATTCTATTATATCGTTGTGCTCGTAATGTTGTCCATT 
CCTTTTAAATATTTTCGAATTAAATATACCATAGATGTTTCCCTATTACCATAATAGTTTAATAAATATACGTATATGAAATTTTATAAGAATTT 
TTTTTATCCCATGTTAGTATAAACATGTTATTAAACAAATAAAAGTAGGTCAGAGGTAATCGCCATCCAGCTCCAGAAAGAGGGGGGAGCTGATA 
AGATTGCTTTTATGTGCTTAATGGGTTGGGTAAATGGCTTGGTTTCACAAACACCCTTATTTCAGGCACAAAATAAGTATGTCATAGACGGCGTA 
AAATTGATAAGATTTTAAAGTCAAAAG AAAAAGAATTT ATT C GCCT GT AACTGCGTT AGAAAT CC AAAT AC AC CTGC AAT AT GAAT C AAAC AAAA 
TCAACTCATAACGAGATCTGATAGCAACTTCTTAAAACTTTTTTTGTATAAAAATACAATTTATACAATTAGATAAGATTTACCAAACCTTTTTT 
CGATCAATGTAAATATAGTATGATAGTAGGGACTGCTGTCTACAGTGATGTTTATAAGAGACATACTTAAGAGCAGCATTTAAAGCTTTGGGCGC 
TTTTTGAACTTAATTTAAATTCAAACCTCAATATTTATTGAGGTTTTTATTCCCGTTAATTGTAAAGAAAAATGCTGTATTAGATTCGGTAAAAG 
TATGTAAATGGTTTAAGAAAGCGTTCCCGACTCTATAAAGTGGCAAGTAATCTGACGATTTACCAGATCTTGGGAACTGAAAAAACTAGAATGTT 
TTGGAAAACCCTTTTACAATTAAGAAATGAACCAAACAATTTTGAATTATTAAGAATCTGTAATTGCTTTTTGAAACTTTTTAATATTTTTTTCT 
TTTAGACGGCACAGCATGGCTTTCCTCATAAGCCTTCGGCTCTTGCGTATGATCCAGTTTTGAAACTTATGGCAATAGGGACGCAAACAGGGGCT 
TTAAAAGTTTTCGGTCAACCCGGAGTTGAATTGTACGGTCAGCATACTTTGTTAAACAATTCAGCATCGGAGCTTAATGTACAATTACTTGAATG 
GGTGTATGGAACTGGTCGCATACTTTCGTTGACGGCAGCGAATCAATTAATTCTATGGGAGCCAGTTGGAGCAACGTTGCTGCCAATCAAAACAC 
TACCGTTTGACGGCAAACTTAAAAAAGTTTCATCGCTGTGCTGTTCTCTCAGTAAGGATCTGCTATGGATTGGAACAGAAGGTGGAAACATCTAT 
CAACTGGATTTACATACATTTACCATTAAGGAGCCTGTAATTTACCATGACGTTGTGCTAGAGCAGGTGCCACCAGCCTACAAGCTAAATCCTGG 
TGCAATTGAGTCAATCCGCCAACTTCCAAACTCCCCTAGCAAACTTCTAGTTGCATACAATCGCGGCCTTTGTGTTTTGTGGGATTTTGAAAGCG 
CATCTGTCCAGCGAGCATACATAGCCCCTGGACATGGACAGAGCGTTGGTCTTACAGTGAACTTCGAAGGATCTGAATTTACCTGGTAGCACGCT 
GATGGTTCATACGCCACTTGGAGCATAGATAACCCAGAACCGCCGTCGAATGTTAATTATGTGCCTTATGGACCTGATCCATGCAAAAGCATAAA 
TCGACTGTACAAAGGCAAGCGAAGGTTAATATAACTTAAACACAAAGTTTTTTAGATGTGTACTAATTTCTCGTTCTTTTAGATCCAACGATGTA 
ATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTGATCACAATTGTGTGTCCGTTCACGCCAGCGATGGACACAAAGTGTGTCTTGACTT 
TACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAATAATAGAGATGTCGCTGAAGTTCTTGTTGTACTACTTGAAGAGGAACTCTGCGCTT 
ACGATCTTACTGACCCTAATATTTGTGCTATCAAAGCGCCATATCTTCACTCTGTCCATGCATCAGCTGTCACTTGCAATTACCTTGCTTCTGAA 
GTCGTACAGTCGGTATATGAAAGTATTTTAAGAGCTGGAGATGAACAAGACATTGACTATAGCAATATTAGCTGGCCTATCACTGGCGGTACTCT 
CCCGGATAACTTAGAAGAATCTGTAGAAGAGGACGCGACTAAGCTTTATGAGATTTTGTTAACTGGTCACGAAGATGGTTCTGTTAAATTTTGGG 
ACTGCACTGGAGTGTTGCTTAAACCAATTTATAATTTTAAAACTAGCAGCATTTTTGGAAGTGAGTCAGACTTCCGAGATGACGCAGCTGCAGAT 
ATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTCGGAAATCAGGACTTTTTGATCCTTATTCAGATGACCCTCGTTTAGCAGTGAAGAA 
AATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGTGGCACAGCGGGCCAAATAGTTATAGCCGACTTCATAGACTTACCCGAAAAAGTGT 
CTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGATGGATTTGTGTGGAAGGGTCACGATCAGTTAAACGTGCGATCGAACTTATTAGAC 
GGAGAAGCAATTCCTACGACGGAACGTGGTGTAAATATATCGGGAGTACTGCAAGTTTTGCCGCCAGCCAGCATAACATGCATGGCACTCGAAGC 
AAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTAGTTCTCTTTGACTTCAAAAACTTTGTTCCAGTATTTCATCGCTGCACTTTAAACC 
CAAATGATCTTACTGGAGCAGGAGAGCAGCTGTCTCGTCGAAAGTCTTTTAAGAAATCATTGAGGGAGTCATTTAGAAAGCTTCGCAAGGGTCGA 
TCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGGTTAGTAAGATAATATATACTCAACTAGTCTTTTATATTAAATTCACAATCCTACA 
GCTGGAAGCAAGACCCGTCGAGAGGCAAATAGAGGCTCGTTGTGCAGATGACGGGCTAGGATCCATGGTGCGATGTTTACTATTTGCCAAAACTT 
ATGTTACTAATGGTAATCAACATTTTGTACTGTTATTAGTTTTGATGTAACGATAAAAAAAAAAAAAGGTAACGGAAATAGTCTCTGAACCACGG 
CCTCGCGAGAATTGAACCCAAAAGCTAAAAACATACAGTGCTTTTCATTTTGGGACGCCCAGAGCTTTAATTATATTTTTGTGCAACAAATTTTT 
ATGCATTGGATATATTGATCAAAGTTTTAAAGGTATATGTATTTATGATCAGTTTAAACAGCCGACTACATTTGACTATGTCCGTCCTTAGAAAG 
AAATTCAGTCATTACAATATTCCTTGCTAGTCTATCGATATGCTATAAGTGTTAAAAGTGCTTTTGCATTTACTCTGGTTTGACAATATATTTTA 
TTGATATTGATTTCGTTTCATTTACTAGTTGGAAAACAGTTCTTAAACTAATTAAATCATAGTCTTTTTGTTTTCGCGATTTCGATCTATATGGC 
AAATTTTGTTTTTAAAATTTCACCTATCCTCCTACTATTTCTGTAATGGGTAACTCGACTGTAGCGTCTATTCTTTTGGTGTTCTGTATATGTAC 
ATCTAATTTGTTGTATAATTTTTACATTTTCAGTCAACATAACGTCGCCAACTTTGTGGTCAGCAACAAATGCCAGTACAGTCTCGGTTTTCCTT 
CTGCATTTGCCACCAGCGCAGACCGCGGCAACTGCCGTCCCGTCGGCAAGTGGCAATGCACCACCACACATGCCCCGCCGAATTTCTGCGCAGCT 
TGCTAAAGAAATACAATTAAAACATCGTGCTCCTGTGGTGGGTATTTCTATTTTTGATCAGGCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTG 
AAAACGGGAGTCCACCGCATCGTGTACTTATTGCTTCCGAGGAACAGTTCAAGGTGTTTTCACTTCCGCAACTAAAGCCGATTAACAAATATAAG 
CTTACCGCTAACGAAGGTGCTCGGATTCGCCGCATCCATTTTGGTTCGTTTAGTTGTCGCATATCCCCGGAAACACTGCAGAGTATGCACGGTTG 
TAGCCCAACTAAGTCCACGCGTTCACATGGCGATGGAGAGGCGGATCCTAATATCAGTGGAAGCTTGGCTGTAAGTCGTGGAGATGTATATAACG 
AAACAGCATTGATATGTTTAACGAATATGGGCGATATCATGGTTTTATCAGTACCTGAATTAAAAAGACAGCTGAATGCCGCAGCAGTGCGACGG 
GAAGACATTAAGTAAGTCACGCATAGTGCTATTATTTTAATTTTTTGGACCTAATACTGTATTTTAACTTTTCAGTGGAGTTTCGTCACTTTGCT 
TTACAAACTCTGGAGAAGCACTGTATATGATGTCTTCTTCTGAACTGCAGCGTATTGCTTTAGCCACGTCCAGAGTCGTGCAACCCACTGGCGTT 
GT TCCAGT AGAACC AT T AGAAAATGAAGAGTCTGT GT TGGAAGAAAATGAT GC AGAGAATAATAAGGAAACC TACGC AT GTGAT GAAGT TGT G AA 
TACATATGAAATTAAAAATCCATCAGGCATTTCAATATGCACAAGGCCTGCAGAGGAAAACGTTGGAAGAAATAGTGTTCAGCAAGTTAATGGAG 
TCAACATTTCAAATTCACCTAATCAAGCTAACGAGACTATCAGCAGCTCTATTGGCGATATTACCGTTGACTGGGTGCGCGACCATTTAAATATG 
ACGACCACCACTTTGTGTTCTATTAATACAGAGGAAACCATTGGTGAGTAATCATTTATTTACTAACATTATTCCACCTATTTTAATTTGCTGTT 
TTGTACGTATCTACATTTAAATGTTTTCGTTTCGACATTAGTTACTTTCGGCAACATAATTCTTATAAAAATTTACTTTTTAAAAACTTGATTTT 
AAGTATACGCACCAAACAACTTTAAATATATACTTTTATCTTATTTTAAAAGCACGGGCCCACATTATATCTGTACATAACTTTTTTTTTCTGTC 
ATCTCT AT GAAAATGTAGGTCGCCTATCTGT ACT TAGCACGC AAAC CAACAAAGCCAGT ACT ACCGT AAAC ATGAGTGAAATTCC AAAT ATTAAT 
ATTTCTAATTTAGAGGACTTGGAATCGAAAAGGTGTGCTTCTACAAGAAACATTGACATAAGCTTTAACCTACTAATTTTTTTTTTTTACCTAAC 
AGAAATACGACGGAAACGAGTACTAGTTCTGTTGTAATTAAATCTATAATTACAAACATTTCTCATGAAAAAACGAACGGAGACAACAAAATAGG 
AACGCCAAAAACAGCGCCTGAAGAAAGCCAATTTTAACATTGACAGAAGCCGTAACCTACTAATTATTTTATACCTTTACAGAAATACGACGCAA 
ACGAGTACTAGTCCTGATGTAATTTAATCTATAATTACAAACATGTCTCACTAAAAATCGAACGGGGACAGCAAAATAGGAACGCAAAAAAATTA 
AATTAAATTAAACGCGATTTACATACAAACAGAAATGACAGAATGATAATATAAAATATTCATTTTTTATTTGGCTTAAGCGATGTTGTTGTTCC 
AAAACCATATAATTATTTATGATTTTATGTAATGTTTTCATGTATTTTCGCGTAGTGACTTTATACCCTTTACCCGTATGATGAAACGAAACATG 
ATATTTGTAGAAAGGTACAAAGGATAAAAGTTGACATTTAAGGCCGATTGTAAAATATTAAAACGCAACCTGGATAACATGCCGATTAATTACTT 
CTGTTCGTCCGTATAAACACAAATACCGATAAATTGGTCCGTCCCCATTTAATAAATTGTATATTAAAATGGTCTTTATTTTGGATAAATAATTC 
AAATTATATAGGAACATTTGGGTTTTAAGGGATAGAAACAGCGCTGGTGTCTTTCATTTCCGTTCGTATGACCGTTCAAAGATTATZWVAAAATT 
GTCGTGCCCACATTTTTTZy\ATTTCTTTTATTTTCGGTTAAAATATTGTGATACAAAAAAATGCCCTTACAAAATTAGGACAACAACAATTTGCT 
TTATTATGATGAACAAATTAGGAGTACAAAAAATAGCCTGATGCAAGGATGGGGAGAAAAAAGACCTTGAAGTTCAAAAGAATTTGTTGTATCAA 
ATCCTCAATAAGAAAACTACTTAAAATTAAAAGCGTGTTTCAAATTAATTGTAGAATATTTCGTAAAATGAAATTGCAAACTTTTTTAGCACGAA 
ATTACATGTATTTAATTCTTTGGCTAATACGTCAGTTAAAAAAGGCTACATTTTGATGATACTCACAAATGTAATTTTTGCATTCTACACAGGTG 
CAATTAAAAAATTATGGTTTTTATGAGTTTTAGAGGGAAGAAAATATGTCAATATGAGAAGTAAGTTAATGAGCCTTCAAGTACTTTAGTACATA 
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ATGTAATTTAATTTTAGAAATCCTACGGCTTTATATTCAATTGTAAAGTATTTTAAAATTCCTGACGATTTGTAGTAATTGTTTTGTTCTTAGCT 
TTTTTGCATTCTCCTCCGTTACTCCGTTACTCGTATAGTAAAAGGTATAGTATATACAATAGTATTGTAGATTCGTTGAGAAGTATGTTACAGGC 
AGAAGGAAGCTTTTCCGACCATATAAAGTATATAATTGAGTATTTAGCTTAGGATTATTAATAAAACATGTATTTTATTTAACAAACTGGCTTTA 
CAATATTTTGTGGAAGAACACCGATCTATTTAGTTGTAGGTCTCAAACTGAAAACTCCTAATTTTGACTTATTTAAAGATCATACCTCGGTTTTG 
AGCCTTAGAGGTTTGTATGAGTGATTTTCTAGAATGATCTTTGTACTTCGTTACAATAAATTGTTTTCTTCTACTCTAATATAGTTGGAATCGAG 
TAATCCCTAGATAAGAGTTAGGCTTAAAATAGTAAATTACAAATATTGAATTGGACATTACAATTGCATATTCTCAGCTTGCTGTATTGTAAAAA 
TTAATTATTAACTCGAAGAGATTCGGCAATTAATACGTCCCTATTCGCATTGTTTTTACTTGGGGAAATTTTATTGAAAAACTAGATTCTCTGTT 
ACAATCAGATATGAGAACAAGTTATTGCACCAGCACATCTCTCGGCCAGCTCAATTCAATTAAATAGCTAAGCACTTGTTGCGAATTCATGCTTG 
GAGTGTTTACGTAAGATTTGTAAAAATAACTAAACTGGATTCTCCACATTCTTGCGTAGCTGTGTGATTCTACTGTCACAACTCTCCGTCGACGA 
TTTCTAGAGCCATCTTCTTGCGGAAGTACAACATTTCCGACTGGGTAAAGGTAATTGGCACATCACACGTTATATACTCGGCGAACATGCAGCTG 
AAGATACCGCAATCGCTGCCATCTAACTGTCGTGGTATATTCTGCACGCTCTCAATAACAAAATCGCTGGTATCAAACTGCTTTTTGGGCTTGAA 
TATTGACTCTTCGCGTAGATATTTCTCTAGAGCGTCCAGCACTGGTCGGTTTGGCTTTCCCTTTGAGTCATAATACCGGATTGTCTTGTTCCGCA 
AGTGTATGATGGCCATGCACCAGTGGACGCCGTTGCAGTGCACTGGTACCGGGATTATGTCCTTGCTGAACAAGTCCACTTTGCGAGTCCAGCGC 
TTAATGCCTGCATGCCCAGCTTGCAGGAGGCGGGG 
(SEQ ID NO: 250) 

Exon: 1001.. 1302 
Exon: 5155.. 5314 
Exon: 6656.. 7434 
Exon: 7493.. 8684 
Exon: 8742.. 8847 
Exon: 9439.. 10081 
Exon: 10146.. 10588 
Exon: 10849.. 10957 
Exon: 11023.. 12525 
Start ATG: 5231 

Transcript No. : CT9059 

ACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGCGCATATCGTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATACACGGTTA 
AGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGGTTGACACGTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAACAGTGCGG 
GCGTTTCTCAACATAAGAATACATACAAACTTATGCGTGATGGTGAGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAACCGTACA 
AACGGAAATACATAAAGTTTAGAATAAATACGGCCACAATTAGTATACATATTTATTCATCGAAACCAACCCACCACAAGAGCATAACCAATTAT 
GTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGACACCGCCTACAGAAGGACCTTTTTGCTTATCGTAAGACGGCACAGCATG 
GCTTTCCTCATAAGCCTTCGGCTCTTGCGTATGATCCAGTTTTGAAACTTATGGCAATAGGGACGCAAACAGGGGCTTTAAAAGTTTTCGGTCAA 
CCCGGAGTTGAATTGTACGGTCAGCATACTTTGTTAAACAATTCAGCATCGGAGCTTAATGTACAATTACTTGAATGGGTGTATGGAACTGGTCG 
CATACTTTCGTTGACGGCAGCGAATCAATTAATTCTATGGGAGCCAGTTGGAGCAACGTTGCTGCCAATCAAAACACTACCGTTTGACGGCAAAC 
TTAAAAAAGTTTCATCGCTGTGCTGTTCTCTCAGTAAGGATCTGCTATGGATTGGAACAGAAGGTGGAAACATCTATCAACTGGATTTACATACA 
TTTACCATTAAGGAGCCTGTAATTTACCATGACGTTGTGCTAGAGCAGGTGCCACCAGCCTACAAGCTAAATCCTGGTGCAATTGAGTCAATCCG 
CCAACTTCCAAACTCCCCTAGCAAACTTCTAGTTGCATACAATCGCGGCCTTTGTGTTTTGTGGGATTTTGAAAGCGCATCTGTCCAGCGAGCAT 
ACATAGCCCCTGGACATGGACAGAGCGTTGGTCTTACAGTGAACTTCGAAGGATCTGAATTTACCTGGTACCACGCTGATGGTTCATACGCCACT 
TGGAGCATAGATAACCCAGAACCGCCGTCGAATGTTAATTATGTGCCTTATGGACCTGATCCATGCAAAAGCATAAATCGACTGTACAAAGGCAA 
GCGAAGATCCAACGATGTAATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTGATCACAATTGTGTGTCCGTTCACGCCAGCGATGGAC 
ACAAAGTGTGTCTTGACTTTACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAATAATAGAGATGTCGCTGAAGTTCTTGTTGTACTACTT 
GAAGAGGAACTCTGCGCTTACGATCTTACTGACCCTAATATTTGTGCTATCAAAGCGCCATATCTTCACTCTGTCCATGCATCAGCTGTCACTTG 
CAATTACCTTGCTTCTGAAGTCGTACAGTCGGTATATGAAAGTATTTTAAGAGCTGGAGATGAACAAGACATTGACTATAGCAATATTAGCTGGC 
CTATCACTGGC GGT AC T C T C C C GG AT AAC TT AG AAGAATC T GT AGAAGAG GAC GC G AC T AAGCT T T AT GAG AT T T T GT T AAC T GGT C AC G AAG AT 
GGTTCTGTTAAATTTTGGGACTGCACTGGAGTGTTGCTTAAACCAATTTATAATTTTAAAACTAGCAGCATTTTTGGAAGTGAGTCAGACTTCCG 
AGATGACGCAGCTGCAGATATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTCGGAAATCAGGACTTTTTGATCCTTATTCAGATGACC 
CTCGTTTAGCAGTGAAGAAAATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGTGGCACAGCGGGCCAAATAGTTATAGCCGACTTCATA 
GACTTACCCGAAAAAGTGTCTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGATGGATTTGTGTGGAAGGGTCACGATCAGTTAAACGT 
GCGATCGAACTTATTAGACGGAGAAGCAATTCCTACGACGGAACGTGGTGTAAATATATCGGGAGTACTGCAAGTTTTGCCGCCAGCCAGCATAA 
CATGCATGGCACTCGAAGCAAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTAGTTCTCTTTGACTTCAAAAACTTTGTTCCAGTATTT 
CATCGCTGCACTTTAAACCCAAATGATCTTACTGGAGCAGGAGAGCAGCTGTCTCGTCGAAAGTCTTTTAAGAAATCATTGAGGGAGTCATTTAG 
AAAGCTTCGCAAGGGTCGATCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGCTGGAAGCAAGACCCGTCGAGAGGCAAATAGAGGCTC 
GTTGTGCAGATGACGGGCTAGGATCCATGGTGCGATGTTTACTATTTGCCAAAACTTATGTTACTAATGTCAACATAACGTCGCCAACTTTGTGG 
TCAGCAACAAATGCCAGTACAGTCTCGGTTTTCCTTCTGCATTTGCCACCAGCGCAGACCGCGGCAACTGCCGTCCCGTCGGCAAGTGGCAATGC 
ACCACCACACATGCCCCGCCGAATTTCTGCGCAGCTTGCTAAAGAAATACAATTAAAACATCGTGCTCCTGTGGTGGGTATTTCTATTTTTGATC 
AGGCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTGAAAACGGGAGTCCACCGCATCGTGTACTTATTGCTTCCGAGGAACAGTTCAAGGTGTTT 
TCACTTCCGCAACTAAAGCCGATTAACAAATATAAGCTTACCGCTAACGAAGGTGCTCGGATTCGCCGCATCCATTTTGGTTCGTTTAGTTGTCG 
CATATCCCCGGAAACACTGCAGAGTATGCACGGTTGTAGCCCAACTAAGTCCACGCGTTCACATGGCGATGGAGAGGCGGATCCTAATATCAGTG 
GAAGCTTGGCTGTAAGTCGTGGAGATGTATATAACGAAACAGCATTGATATGTTTAACGAATATGGGCGATATCATGGTTTTATCAGTACCTGAA 
TTAAAAAGACAGCTGAATGCCGCAGCAGTGCGACGGGAAGACATTAATGGAGTTTCGTCACTTTGCTTTACAAACTCTGGAGAAGCACTGTATAT 
GATGTCTTCTTCTGAACTGCAGCGTATTGCTTTAGCCACGTCCAGAGTCGTGCAACCCACTGGCGTTGTTCCAGTAGAACCATTAGAAAATGAAG 
AGTCTGTGTTGGAAGAAAATGATGCAGAGAATAATAAGGAAACCTACGCATGTGATGAAGTTGTGAATACATATGAAATTAAAAATCCATCAGGC 
ATTTCAATATGCACAAGGCCTGCAGAGGAAAACGTTGGAAGAAATAGTGTTCAGCAAGTTAATGGAGTCAACATTTCAAATTCACCTAATCAAGC 
TAACGAGACTATCAGCAGCTCTATTGGCGATATTACCGTTGACTCGGTGCGCGACCATTTAAATATGACGACCACCACTTTGTGTTCTATTAATA 
CAGAGGAAACCATTGGTCGCCTATCTGTACTTAGCACGCAAACCAACAAAGCCAGTACTACCGTAAACATGAGTGAAATTCCAAATATTAATATT 
TCTAATTTAGAGGACTTGGAATCGAAAAGAAATACGACGGAAACGAGTACTAGTTCTGTTGTAATTAAATCTATAATTACAAACATTTCTCATGA 
AAAAACGAACGGAGACAACAAAATAGGAACGCCAAAAACAGCGCCTGAAGAAAGCCAATTTTAACATTGACAGAAGCCGTAACCTACTAATTATT 
TTATACCTTTACAGAAATACGACGCAAACGAGTACTAGTCCTGATGTAATTTAATCTATAATTACAAACATGTCTCACTAAAAATCGAACGGGGA 
CAGCAAAATAGGAACGCAAAAAAATTAAATTAAATTAAACGCGATTTACATACAAACAGAAATGACAGAATGATAATATAAAATATTCATTTTTT 
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ATTTGGCTTAAGCGATGTTGTTGTTCCAAAACCATATAATTATTTATGATTTTATGTAATGTTTTCATGTATTTTCGCGTAGTGACTTTATACCC 
TTTACCCGTATGATGAAACGAAACATGATATTTGTAGAAAGGTACAAAGGATAAAAGTTGACATTTAAGGCCGATTGTAAAATATTAAAACGCAA 
CCTGGATAACATGCCGATTAATTACTTCTGTTCGTCCGTATAAACACAAATACCGATAAATTGGTCCGTCCCCATTTAATAAATTGTATATTAAA 
ATGGTCTTTATTTTGGATAAATAATTCAAATTATATAGGAACATTTGGGTTTTAAGGGATAGAAACAGCGCTGGTGTCTTTCATTTCCGTTCGTA 
TGACCGTTCAAAGATTATAAAAAAATTGTCGTGCCCACATTTTTTAAATTTCTTTTATTTTCGGTTAAAATATTGTGATACAAAAAAATGCCCTT 
ACAAAATTAGGACAACAACAATTTGCTTTATTATGATGAACAAATTAGGAGTACAAAAAATAGCCTGATGCAAGGATGGGGAGAAAAAAGACCTT 
GAAGTTCAAAAGAATTTGTTGTATCAAATCCTCAATAAGAAAACTACTTAAAATTAAAAGCGTGTTTCAAATTAATTGTAGAATATTTCGTAAAA 
TGAAATTGCAAACTTTTTTAGCACGAAATTACATGTATTTAATTCTTTGGCTAATACGTCAGTTAAAAAAGGCTACATTTTGATGATACTCACAA 
ATGTAATTTTTGCATTCTACACAGGTGCAATTAAAAAATTATGGTTTTTATGAGTTTTAGAGGGAAGAAAATATGTCAATATGAGAAGTAAGTTA 
ATGAGCCTTCAAGTACTTTAGTACATAATGTAATTTAATTTTAGAAATCCTACGGCTTTATATTCAATTGTAAAGTATTTTAAAATTCCTGACGA 
TTTGTAGTAATTGTTTTGTTCTTAGCTTTTTTGCATTCTCCTCCGTTACTCCGTTACTCGTATAGTAAAAGGTATAGTATATACAATAGTATTGT 
AGATTCGTTGAGAAGTATGTTACAGGCAGAAGGAAGCTTTTCCGACCATATAAAGTATATAATTGAGTATTTAGCTTAGGATTATTAATAAAACA 
TGTATTTTATTT 
(SEQ ID NO: 251) 

Start ATG: 379 

MLKFIRGKGQQPSADRHRLQKDLFAYRKTAQHGFPHKPSALAYDPVLKLMAIGTQTGALKVFGQPGVELYGQHTLLNNSASELNVQLLEWVYGTG 

RILSLTAANQLILWEPVGATLLPIKTLPFDGKLKKVSSLCCSLSKDLLWIGTEGGNIYQLDLHTFTIKEPVIYHDVVLEQVPPAYKLNPGAIESI 

RQLPNSPSKLLVAYNRGLCVLWDFESASVQRAYIAPGHGQSVGLTVNFEGSEFTWYHADGSYATWSIDNPEPPSNVNYVPYGPDPCKSINRLYKG 

KRRSNDVIVFSGGMPRSAYGDHNCVSVHASDGHKVCLDFTSKVIDFFVTFENNRDVAEVLVVLLEEELCAYDLTDPNICAIKAPYLHSVHASAVT 

CNYLASEWQSVYESILRAGDEQDIDYSNISWPITGGTLPDWLEESVEEDATKLYEILLTGHEDGSVKFWDCTGVLLKPIYNFKTSSIFGSESDF 

RDDAAADMSAEQVDEGEPPFRKSGLFDPYSDDPRLAVKKIAFCPKTGQLIVGGTAGQIVIADFIDLPEKVSLKYISMNLVSDRDGFVWKGHDQLN 

VRSNLLDGEAIPTTERGVNISGVLQVLPPASITCMALEASWGLVSGGTAHGLVLFDFKNFVPVFHRCTLNPNDLTGAGEQLSRRKSFKKSLRESF 

RKLRKGRSTRTNQSNQVPTTLEARPVERQIEARCADDGLGSMVRCLLFAKTYVTNVNITSPTLWSATNASTVSVFLLHLPPAQTAATAVPSASGN 

APPHMPRRI S AQLAKEI QLKHRAPVVGI S I FDQAGS PVDQLNAGENGS PPHRVL I AS EEQFKVFSLPQLKP I MK YKLTANEGARI RRIHFGS FS C 

RISPETLQSMHGCSPTKSTRSHGDGEADPNISGSLAVSRGDVYNETALICLTNMGDIMVLSVPELKRQLNAAAVRREDINGVSSLCFTNSGEALY 

MMSSSELQRIALATSRVVQPTGVVPVEPLENEESVLEENDAENKKETYACDEVVNTYEIKNPSGISICTRPAEENVGRNSVQQVWGVNISNSPNQ 

ANETISSSIGDITVDSVRDHLNMTTTTLCSINTEETIGRLSVLSTQTNKASTTVNMSEIPNINISNLEDLESKRNTTETSTSSVVIKSIITNISH 

EKTNGDNKI GTPKTAPEESQF* 

{SEQ ID NO: 252) 

Classification: tumor _suppr ess or 
Gene Symbol: l{2)gl 
FlyBase ID: FBgn0002121 



Celera Sequence No. : 142000013384907 

ACGAATCTCCAGAGGCCGGTCCTTTCCAGCCAGATTCTCGCCTCCAGCGCTCAACGAGGTGTACCGAAATTTCTCCAGCGCATCGCTTGCATTCG 
AAATAAGCTCGCGCACAAAGACCTCGTGGTCGGAGTATAAGGAACGAGCCACAATGTCAAGTAGCTGGCGCGTCTCTGCCTGGAACTCATGCTTG 
TCCACCACTGATCCTGATGCCTGCTTGGTCTCCGTGGAGTAGCGACGGAGCGAGAGAGCTGCAAGTGAAAAGATGAGTGTTATGTAAACATAAGA 
ATCCCTCCGGGGAGCCGGTTGCTTATATTCAGTGCTTACCACCCGGCGAATGCTGCGATCCAATGGTGATATTGAACGCTGCTGGGGCGGATCGG 
TGACTCTGCCTCAACACTTGGGCACAGCGACCGAGGCGGCACGCCTGGCCCAGCACTCCCATCGCTCGTACAGACATCTTGTTGTTTTAATGTTT 
TGCCCAACTTTAATGTTACCCAACTACGAAGTTTGTTCCAGCGCACATGTGTGCACAGTGTAGCCAAATCAATCGATTGTAAGTATTCGATAATA 
GTAAACTGGTAAAACCCTACTCATAACGGTGTTTTCTTCCAAATTTCTGGGCTGTAATGAATTTTGGACTCAAATGCTTAGTTGGTTTGAATAAG 
AGTTACTACATGTAATGGAGAGTTCAGTTTCATCACATTATTCAGCTCAAAACTGAAGCAAACTGCCAGCTGACTAGTTCTCTAAAGATTTCGAT 
CAGCAAGAGAGAGAGAGAGAGATAGAGATGTTTCAACTGTGAACAACAAACCGTAAATTCTGAAGGAATTAAGAACAGATTACGTTTCATTGTTT 
AGCCAATGTAGATTGGATGCTGATTTTTGAGTTATGAAATAACAAATGTCTTCAAAGAAGACAGTAAATAAAAGAAAGTTAAATCAAATCCAATA 
TTGAAAGACATCGATACCTCATATCGATATACGGTCCAATTGCTGACAGCCCTGCTCGTGAATGCAACTCTGCTGTAGCGCGTCAGCGTCGCGTT 
GTAATTTTTGTTGAGTTTTTTAAATAATTTTAATTAGCATCTGCGTGTCTTACTATTAGAATATCATGTGTTAAGTGTTTCCCATTCACGCCACC 
ACACAAAAAGATCCAATCATGAGCGCCACCACCATCGACAACGTACGGGTGCTGCTTCAGTCGCTGCCTGCTGCCGCCGTAAACAGCGCCACTCA 
GCCGGGCGATCTACAGACGCCGGGCAAGATTGCGGCCCCAGCGACTCCCCTTCGCGCCAAGAATCCCGCCCAATTACAGATCATGCCGGAGAAGG 
TCGAGGAGATGCCGCCGTCACCTCCGGAGGAGTTCTGGAGGGACCTGCAGCAGTTTCACGAACGACGCGGGTGAGTTGCCGCGGCTCTCCTGTAT 
TTGGAGACGACCTAACCCACTTTCCCTCTGCCCAAACAGCACTCCATTGACACAGCCCGCCAGGATCAGCGGTAAGCATGTCGACCTGTACAAGC 
TGTACAACGAGGTAACCGAACGAGGCGGCTTTAATAAGGTGACCATGCGGGACGAGTGGGACGAGGTGTACAGCGCCATGGAGACATTGCGGGAG 
CGCTGCGTTAACGGAACGGCTAGTATCAAGCACATCTACAGACGCTACCTGGACAAGTACGAACGGTTGAACTTCTTTGGCGAGGATCCAGACAA 
GATGGAGGCTCTGGAGGCGGCTATTGAGATTGTAGAAATGGGCGGTGGACGCTCGCGATCTCGTTTTTTGGCTGGCGCCGGTGGTGGCCTAGGAA 
GCATTTTTGGATCAGCACACTCTACGTTGCCGGCGGTGCCAATGGCCTACAACCAGCGCCAGCACTCTGTTAATGTGGAACGACGCCGACAGTAC 
AAGATGTCCACGCAGTTGCACCGACACAGCAGCTACGAGAAGCTGCTGTTGTCTCTGCTTTCTCCGCTGCCCAATGAGCAGGACTTCGCCATAAA 
CGTGTGCACACTTATGGCCAACGAGGCGCGCCACACCCTACAGCTGTCCGAGTGCCCCAAACTGCTTGATGTGCTGCTTGCTCACCTGGGAGTCT 
ACGCTGACTTCACCATGCGCCAGCTGTTCCAGCACAGTTACACCGAGGTTCGACACCATTCGCAGCTCAGTTTCTGGCGCGACCTGCTACACGAC 
AAGCCACAGGTCCTAGAGCTCTACACGGATGAGCAGGCCTGGCTGGACAACGGTCTGATAAGTAAGGAGGACGGTGAGAACTCTGCCGCTCGAAG 
TGAACTGCTGGCGGCCTGCGACGAGTTGCACTTCCTTAATCTGCGCCGGAGCAACGGGACTGATGAGCGCATGGGCCAGCGAGTGCTTCAGATTG 
TCCAGCTAGTCCGTACACTCAGCTTTCATCAGGAGAATCATGCACTCTTGGCCTCCAACCGCACGTTGTGGCGATATTTGGTCATGGGAGCAAAC 
GTACGATGGAGTAAGTTTTTTTTATCTGCTTAGAAATTATATTTTTTGGCGCTTTGAATGTACGAAGTATCAAGGTGCTATCATAAATACACTGC 
AAGGAAATTGACTATTGCGTAGCATAGCAATCCCGAAAGAATCAATTTAAAATAAAATAACTGAAACACGTTTCGCTCGTTCTAGGCAACATTCA 
CATCCAGGCCCTCGAGACCGCTGGCAACCTGGCTCAACAATTTGAGCTGCTAGACCCCACCACTGACGAGCTATCGCGGAATCTTTTAGCTACTC 
TCTGCGAGGGTATTGACTCTAACGACCGCGCGGTGATCATGAGCTGTCTGGAGATACTCTATAAGTTGTGCGGACGAGAAGGCAATTCGCAGCAC 
ATCAATCGTTGCCTTGGCCTGGACTTTTACCAGCGAGCCATGTTGCTCCTCTCGCTTACCGACGTCATGATGATTATCTTTACGCTGGAGGCAGT 
TTACGCACTAAGTGCTCTTGGCGCTCGGCCATGTTCGATGCTGATGCAAGTACGTGGCCTAATAGACCAGCTGGTGACCCTTATCACAGTGGAGG 
CTCAGTCATATGGCACAGGCGGTTGCATATTAATGCGTGTCGTAGAGGTTTTCCCCGGAAACATGGCTGCCTCGCTGGCTCAGAACATGCCCGCC 
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ATTCTTGCTCCAGGTCCCACTTCCCTGACTACACTTCCAACATCGCTGCAAGTAGCTGTTAAACCGCCAGCAGTTTTACCCACGCCTACACCCGC 
CCAGACGCAAGTGCTGCCTCACGGAGAGCAAACACCCTTGGTTACAGCTCTCCCCGGTCTGCCAATGCCCAGTCTGCCACAGGTTCAATTGCCTG 
TACCCAGTCTGCCGCAGGTACAATTGCCACCAATCGCTGCCACCGTGACCCTTCCCAGTCTGCCCTCCGCCTCCCAGAGTTTTACACACGAGGAC 
GAACAGTATGCGCTGGCTTGGCTGGGAGCCACGTATGAACGGGCTGGGAACGGTAACGATCTCCGTGTGGAACAAGCGGAACTATACAGAATATA 
TTTATCGCATTGCCAGAAGGCGGGTAAGCTTTCAGTGGTTAATCACATGCAGTTTCCTCGTCTAGTGCGGCTTATATTTAATCAATCGGTGGGAC 
CAGTAATTGTACGGCACCTGGACGGAATAGAGCTACCTGGGACGTACTACGTAGGCATTAGGATGCGAGCTCAGCCGCTAGCCATGCAACAGAGA 
CCCACCCCCAGCATTGTTCCTGTTAAAAAGGACACGCCAATATTGCCCAAGCAACCATTTAGTGATCCTGTTCAAACGGAAGTGGCGTCTGAGGA 
AAGCGTTTCTTCTGCAGTCACGCCACCCCCAAGCTCCTCGCTCATCAAAAGCTTGTTGGCCAACAAGGTAACTGAACGCCAACAGAAGCAGAAGG 
CCCAAGCTCAGACGCCGCAGCCGCCAGCCCTTGTGGCTACCACGGCATCGGGCACAAATGCTACCCAGCCCATCAAGGTAACCTCAACAGCTATA 
AGCGCCTTCGTGAACAACCCGCTAATGCAGCACACACCGGTGAAGGTGGGTCAGACTACCATTAAGCCGTTGCACCCCCAGATGGCCATCGAAAA 
GAAGCCAATCACTGATTCGGCCCCGCCACCTCTCGCTCCATTGAGTGGTGCAAACGTAGTGACGAAAGACGCCAGCGGGCGCACTCTCATCATTG 
CATCAGCAGCTGCCAAACGAAAATTAACCATTGACGAGGAGCAGAATAAGCGTCTGGCTTTAGATGCCGCTGCCAGCAGTTCCAGCAGCTCATCT 
TCTGGCGGCAAGGAAGAAGCGCAGGTGACTCCCTCGAAGAATGCAGCCAATCTCTATGCGGATCTGGCGGCATCTATTTTGGAGGATGAGGATAT 
GGATGACGTTCCACCGCTGGCCAAACCAAGCCAGGAGCAAAGCCAGCAGCACCCGCAACAGGCACAGCAGCTTCAACTCATTCCGGCTAAAGTGC 
AACCAACCCAACGTCAACTTGTTTTCCCCGGTAGCAATGTGGTTTCTCCAGCTCCGCATTCCCAACTAAAACTTGCCACCACGGCCACCATAAAG 
ACGGATCAGGGCTTGCAAACTGTGCCAGTGATCCTGCAGCCCAAGACTGCAATGGATTCCACGCCAGCTCCGCAGACGCAATATGTTCTGGCCAC 
CAACCAGCAAGGCCAGACGTATCTTGTGGCCCAGCAGCCGACTATTCCGACATCCACGCAGTCGGCACCACCCACTCTGCTGGTGACGCAAACAC 
CACAACAACAAACGAAGACCATAATAATCCTGCAGCAGCCAGGAGGAGTCTCTGCGGCGAACGTGGCTGGCACTCAGAAAATGATAATGACAACG 
GCGCAGGGCCAGCAGGTCCTGGTTACCACCACCACAGCTCCACAACTGGCCCAACCGCGTACTTCTACCCCCCAGCAGCAAATCTTTATCGCACC 
CCAGCGACCAGTTCCGGCGTTGGGTGCTGGCCAAATGTCGCCTTCACTGCTGTCGCAGTTGAACCAGATCCCGGCTACTATAAAGCTTCATCAGC 
CCCAACCACAGCCGCAGCTCGTTTCGCCGCAATTAGCACCCCAGCAGCGATTAGTTGCCGCAAAGCCGGGCGCTTCATTGCCCATTCCTCAGCTC 
CAGCAACATCAGTCTATCATCCAGCAACACATAATCTCGGGCCCCTCCACTCCTACTTCCACAGTTGGCGGTACGGTTGGTGGGGAAAAGAGGCA 
GGTAATTCTGGGCGGAATCAAGGAGACCACGCTAATTACCCAGACGTCGGCCACACCGGCTCCCCTGCAGCAAAGCCAGCTTAACTCGCAGACGA 
TCATCACTCAGTCACCTACCAGCACGGCTATCGTAGGAGGCGTGGCTCTTCAGCAGCAACATCACATTCGATCTGTGTTGGACCAGCAGCAGCAG 
CATCATCCATCCATCATCCAACAGAAGATAACCATGCCGGCAGTAAGTTGGGAACTGGGGCCGAAATTAAAGAATTATTATTTTTTTTTAAGTAT 
ATATACAAGGGTCATTGAAAAGTATGTAAACAGCTTAAAATTTGATTTCATAATATGCCCATGCCCGTTCTAAATCCCATAAATCGCATCACATT 
CACATTCCCCATTTTTAAATGTTTCCATCGCGTTTTCACTACTTGAGTGGCGGGTTTTACAGGATCTCTTTTCTAAATAGCCAACAGGTGTATGG 
TAATAAGAATTGTGCTAAGATGACTAAAACAAATATTTTAATATATTATTTTAGATTTCCGAAGCGACCAAACCGAAAATAGCACCCGTTCCAGT 
TTCACCAACTGTAACTCTCCTAGCAAAAAAGCCGGCGCCGCCACAACCTGCTCAAGTATCTAGCCCCAAGCCATCAGTCACCCAGGCTGTTGTGG 
TAACGACGTCAGGAACTAGTGATTCAACGGAAGGCGGCAAAACAGTTGTCTTAGCAACCAGCGAGGCCAAGCCAGCGGAATCAGAGGTAGTAGCA 
CAGACGGTTGTCCCACCTGCTGTCGTAACTACAGGTCCCAGCAGCAGTTCTTTGGTTGTCAGTCAGAGCGTCTACGCTGCTTCTGGCCCGAATGC 
CAACCCCACGCCCTCTCCCATGCCCGTGGATGCAAATTGGTTGTACATCTGCGACTGGCGGAACTGTCCCCGCAAGAAATTCAAGTCTCTTAACG 
AGCTGCAGTACCACGTGTGCAGTGTCCACTGTCCAGATCACCTTGATTCTGATGCCGACATTTATTGTCAGTGGGGCAGCGGACCCAACTTCTGC 
GACAACAGAGCGCGCAAACGGTACTCATTGATGACGCATCTGATTGACCGGCACCTGACAACGGAGAACTTGCGCGCCAGTGTGCAACGTCGCCT 
GGCAACTGGGATTCACAATGTGGCGCCCACCCAGGCTCCCGTGACCATAGTGCGCAACGAGGGTCATGCCCAACGATTGGCGGGAGGCGCTACTG 
GTTCACCTTCGGGTCCTGCTTCAGCGGTTCCAGTCGTTGGCAGCGCCGCAATGCAAGCTATGAACCGACACACGACTGATTACACGAACGCTAAA 
GAGCTGATGGACGAAAATGAGGGTCCAGTCACCAAGAGCATCCGTCTGACAGCGGCGCTCATTATACGCAACCTGGTCACCTACTCGGCTACGGC 
CAAGAGAAGCCTGAAACGCTATGAGTCACGCTTGGCCAATGTCGCCTTTAGCAATGTGGAGGCTAGCGGAGTGCTCTCCCACATCATGTACGAAT 
TGAGCCAGTAAGCCGATATAGCTTACCATCAATATGATTATTTAACTGACTTGTAAGAACGACACTAAAAAAATGTGAAAAGCAAACCATACCTT 
GTTTGTCTAAATGATTCATACCATATAGCACCAAAATATGGGAGATTCGGGAGTAAGATTTTAATTTGACTTCTACAATTCTACGAGGCTTAGAA 
TTAAAGAGCTATGGTGGACATTATCAGATATACTCCTGGTTAGCAAATAAACGTTTTGAATTAATCGTTCAAAACATACTTTGATTTACATTGTT 
CTTATAAAATATTAATTAACACATCTGAGTTATAATCCGTTAAAAGAATCCGAACCGAAGAGTCCGCTAAATCTTCGTGTACTACAGTGAAACAC 
T T T T TAT C GC C T AACGT AAG AT AT T AG AAT GG G AT G AAATC AGG T T CAT CAT AC AAT AG CTTCTCCGTTCTT C AAAAAGT T T T C AT C GT C C AA AA 
TCGTATACTTGTTGCTCCTGTACGTGTCCAATTCAAAAATGTACTTCTTGCCGCAGTCCTTCATTAGCTTGTGGATAACTGGCTTGATGTTCTCG 
ATGCCCGCGATACTGGCCAGTTCATTGTAGTAGACAAACTGAAATTACAGATAAAAGATTGTTTTATATATGTTCTCTGCTTGTAAAAGTCTTTA 
ACAAATAAGTCTACATAAGCTAATATTTTTAATTTCTTATACAGGGTATTGTGAACACTCACCTGTCGCTCGCCCATCTGATGTGCTTTCTTCTG 
GTTGCGCACGCCGCAGCCCCACCGCTCTCCGATCTCCTGCTCCAGATCGGCAATCATCTGCTCCCGGGAGGGAAACTTCCGCTGACCTGTGAAGA 
ACTTCAGCGTGAAGCGCACCTGCATGTCGAAGATGTGGGTGGGAATCACGTTGAAGGGCAGGCCCACAAAGGCCATGGTCGGGTGGTTGATGTTG 
ATGCAGTGCTTCCACAGTGGCTGCACGAAGTTGTCTATCACCTGGACACCCACATCGGTGCTGAGGCAGGGGAATGTGTACTTGTAGCCGGTGCA 
GAACATCACGTGGTCAAAGCTCTCCGTGCTGCCGTCCGTGA 
(SEQ ID NO: 253) 

Exon: 1001 . . 1400 

Exon: 1465.. 2480 

Exon: 2651.. 5457 

Exon: 5755.. 6831 

Start ATG: 1159 

Transcript No. : CT9107 

CCTGCTCGTGAATGCAACTCTGCTGTAGCGCGTCAGCGTCGCGTTGTAATTTTTGTTGAGTTTTTTAAATAATTTTAATTAGCATCTGCGTGTCT 
TACTATTAGAATATCATGTGTTAAGTGTTTCCCATTCACGCCACCACACAAAAAGATCCAATCATGAGCGCCACCACCATCGACAACGTACGGGT 
GCTGCTTCAGTCGCTGCCTGCTGCCGCCGTAAACAGCGCCACTCAGCCGGGCGATCTACAGACGCCGGGCAAGATTGCGGCCCCAGCGACTCCCC 
TTCGCGCCAAGAATCCCGCCCAATTACAGATCATGCCGGAGAAGGTCGAGGAGATGCCGCCGTCACCTCCGGAGGAGTTCTGGAGGGACCTGCAG 
CAGTTTCACGAACGACGCGGCACTCCATTGACACAGCCCGCCAGGATCAGCGGTAAGCATGTCGACCTGTACAAGCTGTACAACGAGGTAACCGA 
ACGAGGCGGCTTTAATAAGGTGACCATGCGGGACGAGTGGGACGAGGTGTACAGCGCCATGGAGACATTGCGGGAGCGCTGCGTTAACGGAACGG 
CTAGTATCAAGCACATCTACAGACGCTACCTGGACAAGTACGAACGGTTGAACTTCTTTGGCGAGGATCCAGACAAGATGGAGGCTCTGGAGGCG 
GCTATTGAGATTGTAGAAATGGGCGGTGGACGCTCGCGATCTCGTTTTTTGGCTGGCGCCGGTGGTGGCCTAGGAAGCATTTTTGGATCAGCACA 
CTCTACGTTGCCGGCGGTGCCAATGGCCTACAACCAGCGCCAGCACTCTGTTAATGTGGAACGACGCCGACAGTACAAGATGTCCACGCAGTTGC 
ACCGACACAGCAGCTACGAGAAGCTGCTGTTGTCTCTGCTTTCTCCGCTGCCCAATGAGCAGGACTTCGCCATAAACGTGTGCACACTTATGGCC 
AACGAGGCGCGCCACACCCTACAGCTGTCCGAGTGCCCCAAACTGCTTGATGTGCTGCTTGCTCACCTGGGAGTCTACGCTGACTTCACCATGCG 
CCAGCTGTTCCAGCACAGTTACACCGAGGTTCGACACCATTCGCAGCTCAGTTTCTGGCGCGACCTGCTACACGACAAGCCACAGGTCCTAGAGC 
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TCTACACGGATGAGCAGGCCTGGCTGGACAACGGTCTGATAAGTAAGGAGGACGGTGAGAACTCTGCCGCTCGAAGTGAACTGCTGGCGGCCTGC 

GACGAGTTGCACTTCCTTAATCTGCGCCGGAGCAACGGGACTGATGAGCGCATGGGCCAGCGAGTGCTTCAGATTGTCCAGCTAGTCCGTACACT 

CAGCTTTCATCAGGAGAATCATGCACTCTTGGCCTCCAACCGCACGTTGTGGCGATATTTGGTCATGGGAGCAAACGTACGATGGAGCAACATTC 

ACATCCAGGCCCTCGAGACCGCTGGCAACCTGGCTCAACAATTTGAGCTGCTAGACCCCACCACTGACGAGCTATCGCGGAATCTTTTAGCTACT 

CTCTGCGAGGGTATTGACTCTAACGACCGCGCGGTGATCATGAGCTGTCTGGAGATACTCTATAAGTTGTGCGGACGAGAAGGCAATTCGCAGCA 

CATCAATCGTTGCCTTGGCCTGGACTTTTACCAGCGAGCCATGTTGCTCCTCTCGCTTACCGACGTCATGATGATTATCTTTACGCTGGAGGCAG 

TTTACGCACTAAGTGCTCTTGGCGCTCGGCCATGTTCGATGCTGATGCAAGTACGTGGCCTAATAGACCAGCTGGTGACCCTTATCACAGTGGAG 

GCTCAGTCATATGGCACAGGCGGTTGCATATTAATGCGTGTCGTAGAGGTTTTCCCCGGAAACATGGCTGCCTCGCTGGCTCAGAACATGCCCGC 

CATTCTTGCTCCAGGTCCCACTTCCCTGACTACACTTCCAACATCGCTGCAAGTAGCTGTTAAACCGCCAGCAGTTTTACCCACGCCTACACCCG 

CCCAGACGCAAGTGCTGCCTCACGGAGAGCAAACACCCTTGGTTACAGCTCTCCCCGGTCTGCCAATGCCCAGTCTGCCACAGGTTCAATTGCCT 

GTACCCAGTCTGCCGCAGGTACAATTGCCACCAATCGCTGCCACCGTGACCCTTCCCAGTCTGCCCTCCGCCTCCCAGAGTTTTACACACGAGGA 

CGAACAGTATGCGCTGGCTTGGCTGGGAGCCACGTATGAACGGGCTGGGAACGGTAACGATCTCCGTGTGGAACAAGCGGAACTATACAGAATAT 

ATTTATCGCATTGCCAGAAGGCGGGTAAGCTTTCAGTGGTTAATCACATGCAGTTTCCTCGTCTAGTGCGGCTTATATTTAATCAATCGGTGGGA 

CCAGTAATTGTACGGCACCTGGACGGAATAGAGCTACCTGGGACGTACTACGTAGGCATTAGGATGCGAGCTCAGCCGCTAGCCATGCAACAGAG 

ACCCACCCCCAGCATTGTTCCTGTTAAAAAGGACACGCCAATATTGCCCAAGCAACCATTTAGTGATCCTGTTCAAACGGAAGTGGCGTCTGAGG 

AAAGCGTTTCTTCTGCAGTCACGCCACCCCCAAGCTCCTCGCTCATCAAAAGCTTGTTGGCCAACAAGGTAACTGAACGCCAACAGAAGCAGAAG 

GCCCAAGCTCAGACGCCGCAGCCGCCAGCCCTTGTGGCTACCACGGCATCGGGCACAAATGCTACCCAGCCCATCAAGGTAACCTCAACAGCTAT 

AAGCGCCTTCGTGAACAACCCGCTAATGCAGCACACACCGGTGAAGGTGGGTCAGACTACCATTAAGCCGTTGCACCCCCAGATGGCCATCGAAA 

AGAAGCCAATCACTGATTCGGCCCCGCCACCTCTCGCTCCATTGAGTGGTGCAAACGTAGTGACGAAAGACGCCAGCGGGCGCACTCTCATCATT 

GCATCAGCAGCTGCCAAACGAAAATTAACCATTGACGAGGAGCAGAATAAGCGTCTGGCTTTAGATGCCGCTGCCAGCAGTTCCAGCAGCTCATC 

TTCTGGCGGCAAGGAAGAAGCGCAGGTGACTCCCTCGAAGAATGCAGCCAATCTCTATGCGGATCTGGCGGCATCTATTTTGGAGGATGAGGATA 

TGGATGACGTTCCACCGCTGGCCAAACCAAGCCAGGAGCAAAGCCAGCAGCACCCGCAACAGGCACAGCAGCTTCAACTCATTCCGGCTAAAGTG 

CAACCAACCCAACGTCAACTTGTTTTCCCCGGTAGCAATGTGGTTTCTCCAGCTCCGCATTCCCAACTAAAACTTGCCACCACGGCCACCATAAA 

GACGGATCAGGGCTTGCAAACTGTGCCAGTGATCCTGCAGCCCAAGACTGCAATGGATTCCACGCCAGCTCCGCAGACGCAATATGTTCTGGCCA 

CCAACCAGCAAGGCCAGACGTATCTTGTGGCCCAGCAGCCGACTATTCCGACATCCACGCAGTCGGCACCACCCACTCTGCTGGTGACGCAAACA 

C C AC AAC AAC AAAC G AAG AC C AT AAT AAT CCTGCAGCAGC C AGG AGG AGT C T CT GC G GC G AAC GT G GC T GGC AC T C AG AAAAT G AT AAT G AC AAC 

GGCGCAGGGCCAGCAGGTCCTGGTTACCACCACCACAGCTCCACAACTGGCCCAACCGCGTACTTCTACCCCCCAGCAGCAAATCTTTATCGCAC 

CCCAGCGACCAGTTCCGGCGTTGGGTGCTGGCCAAATGTCGCCTTCACTGCTGTCGCAGTTGAACCAGATCCCGGCTACTATAAAGCTTCATCAG 

CCCCAACCACAGCCGCAGCTCGTTTCGCCGCAATTAGCACCCCAGCAGCGATTAGTTGCCGCAAAGCCGGGCGCTTCATTGCCCATTCCTCAGCT 

CCAGCAACATCAGTCTATCATCCAGCAACACATAATCTCGGGCCCCTCCACTCCTACTTCCACAGTTGGCGGTACGGTTGGTGGGGAAAAGAGGC 

AG GT AAT T CT GGG CGGAAT C AAGG AG ACC AC GC T AAT T AC C C AG AC GT C GGC C AC AC CGGCTCCCCT GC AGC AAAG C C AGCT T AACT C G C AG AC G 

ATCATCACTCAGTCACCTACCAGCACGGCTATCGTAGGAGGCGTGGCTCTTCAGCAGCAACATCACATTCGATCTGTGTTGGACCAGCAGCAGCA 

GC AT C AT C C AT CC AT C AT C C AAC AGAAG AT AAC C AT GC C GGC AAT T T C C G AAGC GAC C AAAC CG AAAAT AGC AC C C GT T C C AG T T T C AC C AAC T G 

TAACTCTCCTAGCAAAAAAGCCGGCGCCGCCACAACCTGCTCAAGTATCTAGCCCCAAGCCATCAGTCACCCAGGCTGTTGTGGTAACGACGTCA 

GGAACTAGTGATTCAACGGAAGGCGGCAAAACAGTTGTCTTAGCAACCAGCGAGGCCAAGCCAGCGGAATCAGAGGTAGTAGCACAGACGGTTGT 

CCCACCTGCTGTCGTAACTACAGGTCCCAGCAGCAGTTCTTTGGTTGTCAGTCAGAGCGTCTACGCTGCTTCTGGCCCGAATGCCAACCCCACGC 

CCTCTCCCATGCCCGTGGATGCAAATTGGTTGTACATCTGCGACTGGCGGAACTGTCCCCGCAAGAAATTCAAGTCTCTTAACGAGCTGCAGTAC 

CACGTGTGCAGTGTCCACTGTCCAGATCACCTTGATTCTGATGCCGACATTTATTGTCAGTGGGGCAGCGGACCCAACTTCTGCGACAACAGAGC 

GCGCAAACGGTACTCATTGATGACGCATCTGATTGACCGGCACCTGACAACGGAGAACTTGCGCGCCAGTGTGCAACGTCGCCTGGCAACTGGGA 

TTCACAATGTGGCGCCCACCCAGGCTCCCGTGACCATAGTGCGCAACGAGGGTCATGCCCAACGATTGGCGGGAGGCGCTACTGGTTCACCTTCG 

GGTCCTGCTTCAGCGGTTC C AGT C GT T GG C AG C GC C GC AAT GC AAGC TAT GAAC C GAC AC AC GAC T GAT T AC AC G AAC GC T AAAG AGC T GAT G GA 

CGAAAATGAGGGTCCAGTCACCAAGAGCATCCGTCTGACAGCGGCGCTCATTATACGCAACCTGGTCACCTACTCGGCTACGGCCAAGAGAAGCC 

TGAAACGCTATGAGTCACGCTTGGCCAATGTCGCCTTTAGCAATGTGGAGGCTAGCGGAGTGCTCTCCCACATCATGTACGAATTGAGCCAGTAA 

G C C GAT AT AG C T T AC CAT C AAT AT GAT TAT TT AAC T G AC T T GT AAG AAC GAC AC T AAAAAAAT GT G AAAAGC AAA 

(SEQ ID NO: 254) 

Start ATG: 159 

MSATTIDKVRVLLQSLPAAAVNSATQPGDLQTPGKIAAPATPLRAKNPAQLQIMPEKVEEMPPSPPEEFWRDLQQFHERRGTPLTQPARISGKHV 
DLYKLYNEVTERGGFNKVTMRDEWDEVYSAMETLRERCVNGTASIKHIYRRYLDKYERLNFFGEDPDKMEALEAAIEIVEMGGGRSRSRFLAGAG 
GGLGSIFGSAHSTLPAVPMAYNQRQHSVWVERRRQYKMSTQLHRHSSYEKLLLSLLSPLPNEQDFAINVCTLMANEARHTLQLSECPKLLDVLLA 
HLGVYADFTMRQLFQHSYTEVRHHSQLSFWRDLLHDKPQVLELYTDEQAWLDNGLISKEDGENSAARSELLAACDELHFLNLRRSKGTDERMGQR 
VLQIVQLVRTLSFHQENHALLASNRTLWRYLVMGANVRWSNIHIQALETAGNLAQQFELLDPTTDELSRNLLATLCEGIDSNDRAVIMSCLEILY 
KLCGREGNSQH I WRCLGLDFYQRAMLLLSLTDVMMI I FTLEAVYALS ALGARPCSMLMQVRGLI DQLVTL I TVEAQS YGTGGC I LMRVVEVFPGN 
MAASLAQNMPAILAPGPTSLTTLPTSLQVAVKPPAVLPTPTPAQTQVLPHGEQTPLVTALPGLPMPSLPQVQLPVPSLPQVQLPPIAATVTLPSL 
PSASQSFTHEDEQYALAWLGATYERAGNGNDLRVEQAELYRIYLSHCQKAGKLSVVMHMQFPRLVRLIFNQSVGPVI VRHLDGIELPGTYYVGIR 
MRAQPLAMQQRPTPSIVPVKKDTPILPKQPFSDPVQTEVASEESVSSAVTPPPSSSLIKSLLANKVTERQQKQKAQAQTPQPPALVATTASGTNA 
TQPIKVTSTAISAFVNNPLMQHTPVKVGQTTIKPLHPQMAIEKKPITDSAPPPLAPLSGANVVTKDASGRTLIIASAAAKRKLTIDEEQNKRLAL 
DAAASSSSSSSSGGKEEAQVTPSKNAANLYADLAASILEDEDMDDVPPLAKPSQEQSQQHPQQAQQLQLIPAKVQPTQRQLVFPGSNVVSPAPHS 
QLKLATTATIKTDQGLQTVPVILQPKTAMDSTPAPQTQYVLATNQQGQTYLVAQQPTIPTSTQSAPPTLLVTQTPQQQTKTIIILQQPGGVSAAN 
VAGTQKMIMTTAQGQQVLVTTTTAPQLAQPRTSTPQQQIFIAPQRPVPALGAGQMSPSLLSQLNQIPATIKLHQPQPQPQLVSPQLAPQQRLVAA 
KPGASLPIPQLQQHQSIIQQHIISGPSTPTSTVGGTVGGEKRQVILGGIKETTLITQTSATPAPLQQSQLNSQTIITQSPTSTAIVGGVALQQQH 
HIRSVLDQQQQHHPSI IQQKITMPAISEATKPKIAPVPVSPTVTLLAKKPAPPQPAQVSSPKPSVTQAVVVTTSGTSDSTEGGKTVVLATSEAKP 
AESEVVAQTVVPPAVVTTGPSSSSLVVSQSVYAASGPNANPTPSPMPVDANWLYICDWRNCPRKKFKSLNELQYHVCSVHCPDHLDSDADIYCQW 
GSGPNFCDNRARKRYSLMTHLIDRHLTTENLRASVQRRLATGIHNVAPTQAPVTIVRNEGHAQRLAGGATGSPSGPASAVPVVGSAAMQAMNRHT 
TDYTNAKELMDENEGPVTKSIRLTAALI IRNLVTYSATAKRSLKRYESRLANVAFSNVEASGVLSHIMYELSQ* 
{SEQ ID WO: 255) 

Classif icatxon : DNA_binding 
Gene Symbol: BcDNA:GHl2174 
FlyBase ID: FBgn0027541 
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Celera Sequence No. : 142000013384103 

GTGGCCGCAGTGGACTTGCTTCTCCTTCAGGATCTTCGCCACCTGGCGCGTCTTCAGAGCCAACCAAGCCAATGTGCGCTCTTGGTTGTACTTGT 
AGAACTTCTGTTCGCCAGAGCTCTTCACATCTGCCACGCATTTAAGGTTCCCTGGGTCGAGGATCTCGTTAAGCAGACGACTGGTGCTGGCCTCC 
TCCACGGCAATGTTGTCCAGCGATATTGCTCCTGATTGTGGTTAAAGATTTTCAAGAATACAATAGTATAGTATACCATCGAAAGCGAATGATAA 
TATCTTACTTTGCGCACAATGTTTTCGGAGGTGATGAAGAGCCAGGAACGTGGGATCCACGGGCGCGGTCATGTAGATTCTGCCGTTGGAGCAGA 
CCTCGCTGTCCACGAACCAACTGCGTCGGGGTTCCGTGAACGCCACCAGCTCCATCATTCTGCCATCGGGATGGGTTATGAACAGTGCCTCCTTG 
CCATGTCCGGGATGGAAAAACATCTCCAGCCGAAGGTGTCCGTCGTCCGCATCTTTGGGCAGCAGTTCCTCCGACATGAAGAACACCTTCTTCAG 
TGCCGAGGCCTTGCACATCTTGGCAGATCCCTCCGCATCCGGATCAGCTTTTGGTTTTCTCGCTCGCGTTTCCTTTTTGCCCATTTTCTCGAATA 
ACTTGCCAAGCGTGAAGCAAAAGTCAATAACTGTGGACCGAACCAAAACAAACGATTTGCCGCCTTTTTGGTTATTGATAGAAGTCTATCGATTA 
ATTTTGGTATTTTATATAGTCTGCTAATTTTAACTTGTTCCTATAAACTATTTTATCTATTTTATGCCAAATAGAAGAGCTTAAAAATTAAATAA 
TCTCTGCCACTAACGTCCGCATCTTAATATTGCAACTAGTCTAAAATGGGTATTAACTTTGATTGAGAAGCGCGGTCACATTGCTTGAACAGATG 
TTCACAGTGATGAAATTGTTTACAACAGTATATTGTTTTTAAACAACAAAATGAGCGTAGATATAGCCAGCACGATCAAGTTCCGGGATATTTGC 
AGCCTGTTCGAGAAGATTAAGGCCACGCAAAAGGTGGCTAACAAGGAGGAGGTCCTCAAGAGCTACTACGAGTCCTTCTGTCGCCACCGCGAATC 
CTTTCGCCGGCAAACCGGATTAAACAACGACCAGCCGGAGGACGGTGCCAGTAGCTTCTATTCCGTGCTGCGACTGCTGCTTCCGGGTGCGGACA 
CTGGTCGGGATACCTATGGATTGCAGATCACCGCGCTGGGCAGGCTCTACATCAGGGTGCTCCAGCTACCCACGGACTGTGAGTGCAGAAACGTA 
TTGGC7VATGGCATTTAGGCTTATTTAGAATCTCTTTGTTGTCTCTACTAGCCAGCGATGCCATCAGGCTGCAGCACCGCAACGGAAACATGTACC 
GGGACTACGGTGATGTTGTTTATTCCGTGCTTAAGCCAAGGTGCTTTAATCCGCCCAGCAATTTGAGACTGAAAGAAATCCATCAGATGCTGGAC 
ACCATTGCCAACGAGGACACAGAAGGTGGGCGCTTTTAAACTGATCTGAACGACTGAACGTCTTAATGAGACCGCTCTTTGCAGTTAAGCAGCAG 
CAACTCATCCGGTTTACGGAGCAGGCCTCGCCTGAGGAACAGAAGTGGCTAATTCGGTTGCTGCTTAAGAGCCTCGGGCTGGGGATCGGTGAGCA 
AAAGATTTTCGGTGTGCTGCACCCCAAGGCGCAGGATATCTACCAGCGCTGCTCCGATCTGGGACATGTGTGCAATCTGCTGGCAGACCGAACCA 
CCGACCTAGACGCCAGCAGCAGCAAGGACAGCAAGGCGGCGGTCAAGTTTGTAAACCTGAATTCCGTCATACGGCCATTCCACCAGATACGTCCC 
ATGCTGTGCGAACGATTCCCCGGTGACATCCAGGAGCTGATGCAGTCGGATGTGCTCTATCTGGAGACCAAAATGGATGGCGAACGCTTTCAGCT 
GCACATCGATCGGGGTCGTTTCATGTACATTTCCCGCAATGGAGTGGACTACACCAGAAACTTTGGTCACAGCTACGATCACGGCACCTTAACGC 
CAAAGCTGAGGGGACTTCTGCCGCTGGGTCTGGAGTCTATCATACTCGACGGCGAGATGATGGTGTGGGATACCAACAAACTGCGCTTTCGGGAA 
AAAGGCGAGAATACGGACGTGAAGAGCCTGAAGCCGGAGGGCAGCTGGCAGCCATGTTTCGTGGTGTACGATCTGCTATATTTTAATGGCCAGAG 
TCTGCTGGACCATACCTACATTCAGCGGGCGTACAAATTGCAGAAACTCATAGTTGAACAGTCGGGTGTGCTGCAGTTGATGCGTGCACGCAAAA 
TCGGTTCCGTTCAAGAGTTCAACGAGCTGTTCCAGCAGGCACTCGACTCCCACGCCGAGGGCATTGTGCTAAAGAAGCAGGGATCAAGATATCAG 
CCTGGCGTTAGACTGGGCGGTGGGTGGTACAAGGACAAGGCTGATGTGAGTAGAGTCATTTTTAAGTAGAGTGCTTTAATATGAACTCACTTCAA 
TTTTCCAACCAGTACATCAAAGGGCTGATCACCGAGTTCGATGTGCTGATCATCGGGGCATTCTACAATCGCAAGCGCACCTTTGTGGACTCATT 
TTTACTGGGCGTTCTCCAGCCGGCGCCGCCAGGCAGCTCTAATCGTCCCGAAGTCTTCAGTATTGGAGTGGTTGCCAACAATACGAAACAGCGCG 
GAGTGCTAAATCACACACTGAAACCACATTGGCATGATGTAGTCAATGAGCCACCGCCATTGTGGTTTCACTACAAGCCCAAGGAGAGGTCCGGA 
TGCCCCGATCTGTGGATTGAACCGCAGAACTCGGTAATTTTGCAGGTGAAAGCTGCTGATCTGGCGCCCAATGGAGCATTTTTCACCCGAAAATC 
G T T G C AC TT T C C GC GC AC T GAGAT G AAAC GGG AT G AT AAG A C C T GG AGC G AGT GT AT GAC AC T T AAGG AGT T T AAC GAT C T C T GC GG AGGT C C T T 
TGGCCATCAAGAAGTTAAACAAGCGGCAACTGCGACTAGAGGATGTGACCACCAAGCGGAAACAGATGCGTATGACGCCCTCGGAGCGAAGCCGA 
CTGGGATTGGCCGTCTATGAGAAGCGCTATGACGCTAGCACCTCCGCTAGCACTTCCAAACTTTTCGACGGTCTGAGCTTTTGCATTCTGAGCGG 
ATCTGCCGGACGACACAGCAAGCATCAGCTGCAGGAGCTGGCTGTTAAaAACGGTGGCTGCATCGTGGAAAATCCCCTGCCCAATGATCCAAAGT 
GTTTTTGCATCGCAGGCGATGAGACGTTTTTAGTTAAGCGGCTCATCCTTCAACAGCCACGCACCTGTGACATTGTGCGCATGGAGTGGCTACTT 
AGGGTCTGTCAGAAGCAGGAGCTTGAACTAAAGCCTAGAGATCTTATTGCGGCCACTGAACC ACT GC AGC AGGATCTGGCTGAATGTTTT GAT CG 
GCACGGTGATAGCTACACCAAAGACATTGCCAACGTGGAGGAGCTGCAGGATCTGCTTCAAGGCATTGAGTTGACAGCTGATAATGTGGCTGGCA 
TAACCGCATCTAATCTCAATGCTCTGGAGGATCAGCTTCTGGATGGGAAAAAGAATCTGAACATGTTCCGCAACCTTAATGCGTTCTTCTACAGT 
CCGCATGGTGATGAAGTGGCTAAGCTTCTGTTTCTTCAAAACGGCGGCCGGATAGTCGACGATTCCGATCCGCAACTAAATCTGGGATTTATCTG 
TATGTCTTCTGACATCGATAACGACCACTTCGAGCACTGGCTGCACAATCATTCCAAGTTGACCACTGACAAGGTCTTAAACTCGGCGTGGATAC 
ATCAGTGCCATCGCGAAGGCATCCTACTTCCTATGCATTCTTTCGTCTAACTCAATGAGTTTTTCACGCTTTATAGTTTTGTACAAAAATTACCT 
GTTATCCTTGTTATAAAATTGTTGTTTCATTGACATTTTTCTTACCATAAAAAGCTTTAAACAGAAGTCATTCCGAATTTAATGTTATATCAAAA 
AGATTTTCATGGATAATTTGCTTCAATAAAGTGAAACATAGAATAACAATAGTAGAATGACAATAAGTACGCAAAAAAGTTATTTTTCAATGATT 
TCGTTTTTATTCACATAAAAAGAGATAACGAGCAGAATAGTATTGTTTCCGTTGGGAGGATTTTGGACAAGTTTTGGGGTAGTGTGGATATGGAT 
ATTAGGATCGTTATTTCGCCAATTGATACGTTTGACAGTTCGGTTACAAAGTTCAGGGAGTTCTCTTCGGGATTTTCATTGGTTTATCGGTTCTG 
CACTTTGGGCAATAAGAACATCAAAAATACATTTATTTTATTCATTGTAAAGTTAGATATATGTTTGTATATATATAACATAAATAATGTATATA 
TAGGTGTATAACATTTAACACTAACAATTCTTAAGCGCGAGCTCTTCGCTCTCTCTCTCTCTCTCTCTCTCTATTTCTTTCTCTGTCTCTCTCCA 
TCTCACTTGCACCTACTAACGCTATCTCTGTCGTTCATTGACTTGACTTTAACATTTAATTTAATGAGGAATATGCAATCTCTGAGATTGTTTAA 
CACATTTATCGACTCTATGAACATATAGATTACGTATGTCAATCAAATTGTTTAAAATTTACCAAGCTAATATCAACAAAAAAAAAAAAACTTTA 
TAGTTTAAGTTTCTCAGTTAGTTCGATTCATGGGAGTGGGCAAAATTCGTGGTTGGAATATTTGTTGATGTTTATACATAAATAGTTGGGCTTGG 
GCATACAAATGCAGAGTACTCTCGATCTTGATCTTTATTTGTGGGTATAACAATCGGTATACTAAACTTAGTCGTTACTTTTCAATGTTGTTTGG 
TTGTGGGTTTGTTCTCTTATATTTTCACTTTTGTGTTTCTTATTCCGGCATGGTAAATAGGTTACAACGCTACACATAACAACTATCAACATTTA 
GGGTATGCATAATAATAGCGATATCATAATAATAATAATAATAATAATAATAGTAATAACATCAGTTACAACTTTATTTGTGTGTGTTCGTCGTT 
CCTTCAATATAATTGAAAATA 
(SEQ ID NO: 256) 

Exon: 1001.. 1313 
Exon: 1381.. 1545 
Exon: 1605.. 2515 
Exon: 2578. .4151 
Start ATG: 1001 

Transcript No. : CT9161 

ATGAGCGTAGATATAGCCAGCACGATCAAGTTCCGGGATATTTGCAGCCTGTTCGAGAAGATTAAGGCCACGCAAAAGGTGGCTAACAAGGAGGA 
GGTCCTCAAGAGCTACTACGAGTCCTTCTGTCGCCACCGCGAATCCTTTCGCCGGCAAACCGGATTAAACAACGACCAGCCGGAGGACGGTGCCA 
GTAGCTTCTATTCCGTGCTGCGACTGCTGCTTCCGGGTGCGGACACTGGTCGGGATACCTATGGATTGCAGATCACCGCGCTGGGCAGGCTCTAC 
ATCAGGGTGCTCCAGCTACCCACGGACTCCAGCGATGCCATCAGGCTGCAGCACCGCAACGGAAACATGTACCGGGACTACGGTGATGTTGTTTA 
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TTCCGTGCTTAAGCCAAGGTGCTTTAATCCGCCCAGCAATTTGAGACTGAAAGAAATCCATCAGATGCTGGACACCATTGCCAACGAGGACACAG 
AAGTTAAGCAGCAGCAACTCATCCGGTTTACGGAGCAGGCCTCGCCTGAGGAACAGAAGTGGCTAATTCGGTTGCTGCTTAAGAGCCTCGGGCTG 
GGGATCGGTGAGCAAAAGATTTTCGGTGTGCTGCACCCCAAGGCGCAGGATATCTACCAGCGCTGCTCCGATCTGGGACATGTGTGCAATCTGCT 
GGCAGACCGAACCACCGACCTAGACGCCAGCAGCAGCAAGGACAGCAAGGCGGCGGTCAAGTTTGTAAACCTGAATTCCGTCATACGGCCATTCC 
ACCAGATACGTCCCATGCTGTGCGAACGATTCCCCGGTGACATCCAGGAGCTGATGCAGTCGGATGTGCTCTATCTGGAGACCAAAATGGATGGC 
GAACGCTTTCAGCTGCACATCGATCGGGGTCGTTTCATGTACATTTCCCGCAATGGAGTGGACTACACCAGAAACTTTGGTCACAGCTACGATCA 
CGGCACCTTAACGCCAAAGCTGAGGGGACTTCTGCCGCTGGGTCTGGAGTCTATCATACTCGACGGCGAGATGATGGTGTGGGATACCAACAAAC 
TGCGCTTTCGGGAAAAAGGCGAGAATACGGACGTGAAGAGCCTGAAGCCGGAGGGCAGCTGGCAGCCATGTTTCGTGGTGTACGATCTGCTATAT 
TTTAATGGCCAGAGTCTGCTGGACCATACCTACATTCAGCGGGCGTACAAATTGCAGAAACTCATAGTTGAACAGTCGGGTGTGCTGCAGTTGAT 
GCGTGCACGCAAAATCGGTTCCGTTCAAGAGTTCAACGAGCTGTTCCAGCAGGCACTCGACTCCCACGCCGAGGGCATTGTGCTAAAGAAGCAGG 
GATCAAGATATCAGCCTGGCGTTAGACTGGGCGGTGGGTGGTACAAGGACAAGGCTGATTACATCAAAGGGCTGATCACCGAGTTCGATGTGCTG 
ATCATCGGGGCATTCTACAATCGCAAGCGCACCTTTGTGGACTCATTTTTACTGGGCGTTCTCCAGCCGGCGCCGCCAGGCAGCTCTAATCGTCC 
CGAAGTCTTCAGTATTGGAGTGGTTGCCAACAATACGAAACAGCGCGGAGTGCTAAATCACACACTGAAACCACATTGGCATGATGTAGTCAATG 
AGCCACCGCCATTGTGGTTTCACTACAAGCCCAAGGAGAGGTCCGGATGCCCCGATCTGTGGATTGAACCGCAGAACTCGGTAATTTTGCAGGTG 
AAAGCTGCTGATCTGGCGCCCAATGGAGCATTTTTCACCCGAAAATCGTTGCACTTTCCGCGCACTGAGATGAAACGGGATGATAAGACCTGGAG 
CGAGTGTATGACACTTAAGGAGTTTAACGATCTCTGCGGAGGTCCTTTGGCCATCAAGAAGTTAAACAAGCGGCAACTGCGACTAGAGGATGTGA 
CCACCAAGCGGAAACAGATGCGTATGACGCCCTCGGAGCGAAGCCGACTGGGATTGGCCGTCTATGAGAAGCGCTATGACGCTAGCACCTCCGCT 
AGCACTTCCAAACTTTTCGACGGTCTGAGCTTTTGCATTCTGAGCGGATCTGCCGGACGACACAGCAAGCATCAGCTGCAGGAGCTGGCTGTTAA 
AAACGGTGGCTGCATCGTGGAAAATCCCCTGCCCAATGATCCAAAGTGTTTTTGCATCGCAGGCGATGAGACGTTTTTAGTTAAGCGGCTCATCC 
TTCAACAGCCACGCACCTGTGACATTGTGCGCATGGAGTGGCTACTTAGGGTCTGTCAGAAGCAGGAGCTTGAACTAAAGCCTAGAGATCTTATT 
GC G GC C AC T GAAC C ACT GC AGC AGGAT CT GGC T G AAT GT T T TG AT CGG C AC GGT GAT AGC T AC AC C AAAG AC AT T G C C AACG T GG AGG AGC T GC A 
GGATCTGCTTCAAGGCATTGAGTTGACAGCTGATAATGTGGCTGGCATAACCGCATCTAATCTCAATGCTCTGGAGGATCAGCTTCTGGATGGGA 
AAAAGAATCTGAACATGTTCCGCAACCTTAATGCGTTCTTCTACAGTCCGCATGGTGATGAAGTGGCTAAGCTTCTGTTTCTTCAAAACGGCGGC 
CGGATAGTCGACGATTCCGATCCGCAACTAAATCTGGGATTTATCTGTATGTCTTCTGACATCGATAACGACCACTTCGAGCACTGGCTGCACAA 
TCATTCCAAGTTGACCACTGACAAGGTCTTAAACTCGGCGTGGATACATCAGTGCCATCGCGAAGGCATCCTACTTCCTATGCATTCTTTCGTCT 
AACTCAATGAGTTTTTCACGCTTTATAGTTTTGTACAAAAATTACCTGTTATCCTTGTTATAAAATTGTTGTTTCATTGACATTTTTCTTACCAT 
AAAAA GC T T T AAACAG AAGT C AT T CC G AAT T T AAT G T T AT ATC AAAAAG AT T T T CAT G GAT AAT T T GCT TC AAT AAAGT G AAAC AT AG AAT AAC A 
ATAGTAGAATGACAATAA 
(SEQ ID NO: 257) 

Start ATG: 1 

MSVDIASTIKFRDICSLFEKIKATQKVANKEEVLKSYYESFCRHRESFRRQTGLNNDQPEDGASSFYSVLRLLLPGADTGRDTYGLQITALGRLY 
IRVLQLPTDSSDAIRLQHRNGNMYRDYGDWYSVLKPRCFNPPSNLRLKEIHQMLDTIANEDTEVKQQQLIRFTEQASPEEQKWLIRLLLKSLGL 
GIGEQKIFGVLHPKAQDIYQRCSDLGHVCNLLADRTTDLDASSSKDSKAAVKFVNLNSVIRPFHQIRPMLCERFPGDIQELMQSDVLYLETKMDG 
ERFQLHIDRGRFMYISRNGVDYTRNFGHSYDHGTLTPKLRGLLPLGLESIILDGEMMVWDTNKLRFREKGENTDVKSLKPEGSWQPCFVVYDLLY 
FNGQSLLDHTYIQRAYKLQKLIVEQSGVLQLMRARKIGSVQEFNELFQQALDSHAEGIVLKKQGSRYQPGVRLGGGWYKDKADYIKGLITEFDVL 
IIGAFYNRKRTFVDSFLLGVLQPAPPGSSNRPEVFSIGVVANNTKQRGVLNHTLKPHWHDVVKEPPPLWFHYKPKERSGCPDLWIEPQNSVILQV 
KAADLAPNGAFFTRKSLHFPRTEMKRDDKTWSECMTLKEFNDLCGGPLAIKKLNKRQLRLEDVTTKRKQMRMTPSERSRLGLAVYEKRYDASTSA 
STSKLFDGLSFCILSGSAGRHSKHQLQELAVKNGGCIVENPLPNDPKCFCIAGDETFLVKRLILQQPRTCDIVRMEWLLRVCQKQELELKPRDLI 
AATEPLQQDLAECFDRHGDSYTKDIANVEELQDLLQGIELTADNVAGITASNLNALEDQLLDGKKNLNMFRNLNAFFYSPHGDEVAKLLFLQNGG 
RIVDDSDPQLNLGFICMSSDIDNDHFEHWLHNHSKLTTDKVLNSAWIHQCHREGILLPMHSFV* 
(SEQ ID MO: 258) 

Name: DNA ligase 

Classification: DNA_repair_protein 



Celera Sequence No. : 142000013384500 

GGTATACGGCCACTCCGGCAAAGATAAAGGCCAGCAGGACCATGTCAGTCTGGCCTGATATTCTGTTGGTCTCCCTTTGCACTTTTATTCGATCG 
TATTTGGCCTGGGCGGCTTGTCTCCGGTCAAATGATTTACCATAGTGATTTCGCGACCACTCGTCGAAGTCATAGATGGGCGTACGGCCCGCAGA 
ATCAGATACACGCGATTTCTGAAATCGCGACTTGTAAAACTTAGTCTCCGCATCGTCCTCGACTACGGGTTCCGCTACATCGTGGACATCTTGGG 
CATATTGGGCGCCTGCCGTATGTACGATTCCCTTGTCGTAAAGTCGACGCAGCCGGTAATTTCCCAGAATCTCATAAGCCTGATTGATCTCCCGA 
AATTTCTTGGCTGCGTTCTCACTTCCTTGGTTCCTGTCCGGATGATAAAGCATCGAGAGCTTATAGTAAGCTGCCTTGATCTCGTTCTGCGTGCA 
CTGACGCCTGATTCCCAGTGCGTCGTAGTGGCTCATCTGGTGGCGTTGACTGAGGTAGAGGCCACGCAGCAGAAGCAGAGGTCGCCACAACATTG 
GGCTGTGCTGGTACATCTTGGATGTGTTCTTTCGCCCGCTTCACTTGTTTTGGTTTCGTTATATCACATGAAAATTAAAAACAAATCGAATCAGC 
TGTTTGAACTTGAACTTGTTCACCCTCGCCATCGTTGTTATCGATAGATCGATAGTATTCTGAACCAATGGTGCAGGTACAGCAAAAAATTAGCT 
GAAAGAGGAAAAAGAGCTTGTAATTGATTAATTTATAAAAGATAGATTTTGAAAATATTTACAGAATAAATGATTTTCAAAAAAGTTTTTACAGA 
TATATCAGAAAAATATTTGTAATCCCTGGCTTAGTTAAAAATTTAAGACTTTAGTAAAGTTGAAATTCGTCAGCTGTTTGGGCTATGATGAAAAA 
AGCAAAACGTAAGCTCCACGGTTCCAGTCAGCTGTTTGGCCGATTCGCTGCGAATGGCAACTGTTGCAACTATTTTGGTTTGAGTCTTCTCAATT 
TTTATTTATATATAGAACAGGGCAGCAGCCGGTTCAATAGTCTCACGCAGCGTGCTTGATTTTCCCGCTAAACGCCTGCAACTGTATCTGTGTGT 
ATTCGAGAGCGATCGAGAAGTTTCTGGTGGTCGAAATTTAGGCCAGAAGGCACAAAACCGAACTGAATAGTGAGTCATCCGTTAACAATAAAAAG 
TTGCAGCTACAGTTTTTGTCTTTTTATACAGAAAATTGTGTAACCATGACAAACACCATCAACAGCGAGGAAGTGTCCGACGCAGGAAGCTACAA 
AGATAAGGGCAACGAGGCGTTCAAGGCCTCCCGCTGGGAGGAGGCAGTGGAGCACTATGGCAAAGCCATTAAGGCGGGCTCTAAGCACAAGGAGC 
TGGCGGTTTTCTATAAGAATCGTGCTGCCGCTTATCTGAAGCTAGGAAAGTATGAAAATGCGGTAGAGGACTGCACTGAATCTTTGAAAGCGGCT 
CCGGGGGATCCCAAGGCACTGTTCCGTAGGGCTCAGGCGTACGAAGCTCTGGAGAAGTTCGAAGAGGCCTACAAAGACGCAACCGCTTTGTTTAA 
AGCGGATCCCGGCAACAAAACCGTACAGCCCATGCTCCAGAGGCTTCATGTCGTCGTGGAAGAGCGCTCTGCCCGTAATGCCAAGACATCCACAA 
AAGTCAAGCAGATGATGGATCTTACTTTTGATTTGGCCACGCCCATCGATAAGCGTCGCGCGGCAGCCAATAACCTAGTAGTTTTGGCCAAAGAG 
CAGACTGGCGCTGAACTGCTTTACAAGGACCACTGCATCGCCAAAGTGGCCTCACTAACAAAGGTGGAAAAGGATCAGGACATTTATGTGAATAT 
GGTGCATTTGGTGGCTGCTCTTTGCGAAAATAGCGTCGAGCGTACAAAGGGCGTTTTAACTGAACTCGGAGTACCGTGGTTCATGAGGGTTCTCG 
ACCAAAAACACGAGAACTGTGTGTCCACCGCCCAGTTTTGCCTACAAACAATATTGAACGCATTGTCTGGGCTTAAGAACAAACCCGATTCGAAG 
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CCGGAGAAGGAGCTGTGCACTAGGAACAACCGGGAAATTGACACTCTCCTAACATGCCTAGTTTATAGTATCACGGATCGCACGATTTCTGGAGC 
AGCTAGGGATGGCGTTATTGAGCTTATAACCAGGAATGTCCACTACACAGCTCTAGAATGGGCCGAACGCCTGGTTGAGATCAGGGGACTGTGCC 
GTCTGCTGGATGTGTGCTCCGAACTGGAGGATTATAAGTACGAGAGTGCAATGGATATCACAGGCTCATCATCCACAATTGCATCCGTTTGTCTG 
GCCCGGATATATGAGAACATGTACTACGATGAAGCTAAGGCACGGTTCACGGACCAGATCGACGAATACATTAAGGACAAGCTTTTGGCGCCTGA 
TATGGAATCCAAGGTGCGGGTCACCGTTGCGATTACTGCCCTGCTTAACGGTCCATTGGATGTGGGCAATCAGGTTGTGGCCAGAGAAGGTGAGT 
TCTGTTATTGGATTTAGAAAATGGTACTTAATTAATGGTGAAATTTGTAACAGGAATACTGCAGATGATTCTAGCAATGGCTACGACCGACGATG 
AGTTGCAGCAGCGAGTATCATGTGAGTGCCTAATCGCCGCTTCCTCTAAAAAGGACAAGGCCAAGGCTCTTTGTGAGCAGGGAGTAGATATTTTA 
AAGCGGCTCTACCACTCCAAGAACGATGGTATTAGAGTACGCGCCCTGGTGGGTCTTTGCAAGCTGGGTAGCTACGGCGGTCAGGACGCGGCCAT 
CCGACCATTTGGCGATGGAGCTGCCCTTAAGCTGGCGGAGGCATGCCGTCGGTTTTTGATCAAACCTGGAAAGGATAAGGATATCCGTCGTTGGG 
CTGCCGATGGTCTGGCTTATTTAACACTGGATGCTGAATGTAAGGAGAAGCTCATTGAGGATAAGGCTTCCATTCACGCTTTAATGGATTTGGCC 
CGCGGAGGAAATCAGTCCTGTCTCTATGGTGTGGTCACTACTTTTGTGAATCTGTGCAATGCGTATGAGAAGCAGGAGATGTTGCCTGAGATGAT 
AGAACTTGCCAAGTTTGCTAAGCAACACATACCGGAGGAGCACGAGCTGGACGACGTAGATTTCATCAATAAGCGTATTACTGTGCTGGCAAATG 
AGGGCATAACAACTGCTCTTTGTGCACTGGCGAAAACTGAAAGCCACAATTCACAGGAGCTGATTGCCAGGGTTTTGAATGCGGTTTGTGGACTG 
AAAGAGTTGCGAGGAAAAGTGGTTCAAGAAGGCGGCGTTAAGGCGCTGCTTCGTATGGCGCTTGAGGGAACTGAGAAAGGAAAACGCCATGCTAC 
CCAGGCGTTGGCTAGGATTGGCATAACCATTAATCCGGAAGTATCTTTCAGTGGCCAGCGATCGCTTGATGTAATCCGTCCTTTGTTGAACCTTT 
TGCAACAGGACTGCACAGCGCTGGAGAATTTTGAGTCGCTAATGGCACTCACCAATCTAGCCAGCATGAACGAGAGCGTGCGCCAGAGGATTATC 
AAGGAGCAGGGGGTCTCCAAGATAGAGTATTATCTGATGGAGGACCATCTGTACCTGACTCGTGCAGCTGCCCAGTGTCTGTGTAATCTGGTTAT 
GAGTGAGGATGTCATTAAAATGTTCGAGGGCAATAACGACCGAGTGAAGTTCTTGGCGCTTCTCTGCGAGGATGAGGACGAGGAGACCGCAACAG 
CTTGTGCCGGAGCTCTGGCCATTATTACTTCAGTGTCTGTCAAGTGCTGTGAGAAGATCTTGGCCATCGCCAGCTGGCTGGACATTCTGCACACT 
CTGATCGCCAATCCCAGTCCGGCGGTCCAGCATCGAGGCATTGTGATTATTCTAAACATGATCAATGCTGGAGAGGAGATCGCCAAAAAGCTGTT 
TGAAACGGACATTATGGAACTGCTGAGCGGTTTGGGGCAGCTGCCCGATGACACGCGTGCCAAGGCCCGAGAGGTGGCCACCCAGTGCCTGGCGG 
CGGCGGAGCGTTATAGGATCATCGAGCGGTCCGACAACGCGGAAATCCCCGATGTGTTTGCCGAAAATTCGAAAATATCTGAGATTATCGATGAT 
TAAGCGAAATTCGTTTGTAGTGCCTTTTGTTTTTCCTCCATTTCATCTTCTTTACTTTACTCACAGTGTCATCGCGAAATTTAACTTATTTTTCA 
AAATATCAAATGTACTTAACTGTTTTATAAATATTGTTCGAATTTATTTCGTTTTTAAAGGTAGTTTGTTTTATTCCTTAGTGGGTAATATAATA 
T AT T AT T T AGTT ATT T CAT GAAT ACGC G AT T T T T T AAT T T AAC T AAT T T AAT AC AG AC C AAGC C T AAT C C AAG C C T AAT T GC T GC GGC C T AAT G T 
GATTTGGATTGTAATATGGATGGCCACTAGGATGGCCTAGTGTTTTCCCACCGACTCCATTTTCCCCGGTGCACTGCCTCGGTACGTTTTCTATT 
CATTGCCACACATTAGTCGCCAGCTCGAGGTAGCTGACTTCCTCTAATGCAGAAGATGCTTCTGATGCGGCAGTGTTGCAGTGTCAGGTAGCCGG 
GAATTCCGAGCCGAGCATCTCTTCCAGCTGATTGCTCAATCTCAATGAAGTGTGATTAAATAAATAATAGTTGCTTTAATTTAGTTTGGCTCAGC 
GCAGTGGGAAATGGCCGAGGGTTTCGGTTGCTATGATTCGGTTGCCATGACTTAACGGCATTTATTCCGAAATCCATCCCGCTTTTTGTCTTTGG 
CCATTTGGCATTCCTTTTCCCGCGATTATTTCGCATTTTCCTCTTCTCCTTAAACCATATACTGTGCTCAATTTGACCAATATAAATAACGTGGT 
TAGAAGGTAAAGTAATGAGGAAGATGAATTCTTCAGGACCCTTATATGCTAAAGGGATATAAGCTGAAATAAAATATGTTGATGTAAGACCCCAG 
AAGGAAACAAACTAATGCTTGGCAATATATTTAAAAAATTGGTGTATAAAAACAATTAAGTTAGGGGAATAAACATTTTAAAAAATGGTAAAAAA 
AG AC AT T T T AAAAAAT GT AAAAAGG AT AT T TC C GT GG AAAT AT AT T AGC AAAT 
(SEQ ID NO: 259) 

Exon: 1001.. 1209 
Exon: 1267.. 2559 
Exon: 2619.. 4183 
Start ATG: 1281 

Transcript No. : CT9235 

CGAATGGCAACTGTTGCAACTATTTTGGTTTGAGTCTTCTCAATTTTTATTTATATATAGAACAGGGCAGCAGCCGGTTCAATAGTCTCACGCAG 
CGTGCTTGATTTTCCCGCTAAACGCCTGCAACTGTATCTGTGTGTATTCGAGAGCGATCGAGAAGTTTCTGGTGGTCGAAATTTAGGCCAGAAGG 
CACAAAACCGAACTGAATAAAAATTGTGTAACCATGACAAACACCATCAACAGCGAGGAAGTGTCCGACGCAGGAAGCTACAAAGATAAGGGCAA 
CGAGGCGTTCAAGGCCTCCCGCTGGGAGGAGGCAGTGGAGCACTATGGCAAAGCCATTAAGGCGGGCTCTAAGCACAAGGAGCTGGCGGTTTTCT 
ATAAGAATCGTGCTGCCGCTTATCTGAAGCTAGGAAAGTATGAAAATGCGGTAGAGGACTGCACTGAATCTTTGAAAGCGGCTCCGGGGGATCCC 
AAGGCACTGTTCCGTAGGGCTCAGGCGTACGAAGCTCTGGAGAAGTTCGAAGAGGCCTACAAAGACGCAACCGCTTTGTTTAAAGCGGATCCCGG 
CAACAAAACCGTACAGCCCATGCTCCAGAGGCTTCATGTCGTCGTGGAAGAGCGCTCTGCCCGTAATGCCAAGACATCCACAAAAGTCAAGCAGA 
TGATGGATCTTACTTTTGATTTGGCCACGCCCATCGATAAGCGTCGCGCGGCAGCCAATAACCTAGTAGTTTTGGCCAAAGAGCAGACTGGCGCT 
GAACTGCTTTACAAGGACCACTGCATCGCCAAAGTGGCCTCACTAACAAAGGTGGAAAAGGATCAGGACATTTATGTGAATATGGTGCATTTGGT 
GGCTGCTCTTTGCGAAAATAGCGTCGAGCGTACAAAGGGCGTTTTAACTGAACTCGGAGTACCGTGGTTCATGAGGGTTCTCGACCAAAAACACG 
AGAACTGTGTGTCCACCGCCCAGTTTTGCCTACAAACAATATTGAACGCATTGTCTGGGCTTAAGAACAAACCCGATTCGAAGCCGGAGAAGGAG 
CTGTGCACTAGGAACAACCGGGAAATTGACACTCTCCTAACATGCCTAGTTTATAGTATCACGGATCGCACGATTTCTGGAGCAGCTAGGGATGG 
CGTTATTGAGCTTATAACCAGGAATGTCCACTACACAGCTCTAGAATGGGCCGAACGCCTGGTTGAGATCAGGGGACTGTGCCGTCTGCTGGATG 
TGTGCTCCGAACTGGAGGATTATAAGTACGAGAGTGCAATGGATATCACAGGCTCATCATCCACAATTGCATCCGTTTGTCTGGCCCGGATATAT 
GAGAACATGTACTACGATGAAGCTAAGGCACGGTTCACGGACCAGATCGACGAATACATTAAGGACAAGCTTTTGGCGCCTGATATGGAATCCAA 
GGTGCGGGTCACCGTTGCGATTACTGCCCTGCTTAACGGTCCATTGGATGTGGGCAATCAGGTTGTGGCCAGAGAAGGAATACTGCAGATGATTC 
TAGCAATGGCTACGACCGACGATGAGTTGCAGCAGCGAGTATCATGTGAGTGCCTAATCGCCGCTTCCTCTAAAAAGGACAAGGCCAAGGCTCTT 
TGTGAGCAGGGAGTAGATATTTTAAAGCGGCTCTACCACTCCAAGAACGATGGTATTAGAGTACGCGCCCTGGTGGGTCTTTGCAAGCTGGGTAG 
CTACGGCGGTCAGGACGCGGCCATCCGACCATTTGGCGATGGAGCTGCCCTTAAGCTGGCGGAGGCATGCCGTCGGTTTTTGATCAAACCTGGAA 
AGGATAAGGATATCCGTCGTTGGGCTGCCGATGGTCTGGCTTATTTAACACTGGATGCTGAATGTAAGGAGAAGCTCATTGAGGATAAGGCTTCC 
ATTCACGCTTTAATGGATTTGGCCCGCGGAGGAAATCAGTCCTGTCTCTATGGTGTGGTCACTACTTTTGTGAATCTGTGCAATGCGTATGAGAA 
GCAGGAGATGTTGCCTGAGATGATAGAACTTGCCAAGTTTGCTAAGCAACACATACCGGAGGAGCACGAGCTGGACGACGTAGATTTCATCAATA 
AGCGTATTACTGTGCTGGCAAATGAGGGCATAACAACTGCTCTTTGTGCACTGGCGAAAACTGAAAGCCACAATTCACAGGAGCTGATTGCCAGG 
GTTTTGAATGCGGTTTGTGGACTGAAAGAGTTGCGAGGAAAAGTGGTTCAAGAAGGCGGCGTTAAGGCGCTGCTTCGTATGGCGCTTGAGGGAAC 
TGAGAAAGGAAAACGCCATGCTACCCAGGCGTTGGCTAGGATTGGCATAACCATTAATCCGGAAGTATCTTTCAGTGGCCAGCGATCGCTTGATG 
TAATCCGTCCTTTGTTGAACCTTTTGCAACAGGACTGCACAGCGCTGGAGAATTTTGAGTCGCTAATGGCACTCACCAATCTAGCCAGCATGAAC 
GAGAGCGTGCGCCAGAGGATTATCAAGGAGCAGGGGGTCTCCAAGATAGAGTATTATCTGATGGAGGACCATCTGTACCTGACTCGTGCAGCTGC 
CCAGTGTCTGTGTAATCTGGTTATGAGTGAGGATGTCATTAAAATGTTCGAGGGCi\ATAACGACCGAGTGAAGTTCTTGGCGCTTCTCTGCGAGG 
ATGAGGACGAGGAGACCGCAACAGCTTGTGCCGGAGCTCTGGCCATTATTACTTCAGTGTCTGTCAAGTGCTGTGAGAAGATCTTGGCCATCGCC 
AGCTGGCTGGACATTCTGCACACTCTGATCGCCAATCCCAGTCCGGCGGTCCAGCATCGAGGCATTGTGATTATTCTAAACATGATCAATGCTGG 
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AGAGGAGATCGCCAAAAAGCTGTTTGAAACGGACATTATGGAACTGCTGAGCGGTTTGGGGCAGCTGCCCGATGACACGCGTGCCAAGGCCCGAG 
AGGTGGCCACCCAGTGCCTGGCGGCGGCGGAGCGTTATAGGATCATCGAGCGGTCCGACAACGCGGAAATCCCCGATGTGTTTGCCGAAAATTCG 

AAAAT AT C T G AG AT T AT C GAT GAT T AA 
(SEQ ID NO: 260) 

Start ATG: 224 

MTNTINSEEVSDAGSYKDKGNEAFKASRWEEAVEHYGKAIKAGS 

ALEKFEEAYKDATALFKADPGNKTVQPMLQRLHVVVEERSARNAKTSTKVKQI^DLTFDLATPIDKRRAAANNLVVLAKEQTGAELLYKDHCIAK 
VASLTKVEKDQDIYVNMVHLVAALCENSVERTKGVLTELGVPWFMRVLDQKHENCVSTAQFCLQTILMALSGLKNKPDSKPEKELCTRNNREIDT 
LLTCLVYSITDRTISGAARDGVIELITRNVHYTALEWAERLVEIRGLCRLLDVCSELEDYKYESAMDITGSSSTIASVCLARIYENMYYDEAKAR 
FTDQIDEYIKDKLLAPDMESKVRVTVAITALLNGPLDVGNQVVAREGILQMILAMATTDDELQQRVSCECLIAASSKKDKAKALCEQGVDILKRL 
YHSKNDGIRVRALVGLCKLGSYGGQDAAIRPFGDGAALKLAEACRRFLIKPGKDKDIRRWAADGLAYLTLDAECKEKLIEDKASIHALMDLARGG 
NQSCLYGVVTTFVNLCNAYEKQEMLPEMIELAKFAKQHIPEEHELDDVDFINKRITVLANEGITTALCALAKTESHNSQELIARVLKAVCGLKEL 
RGKWQEGGVKALLRmLEGTEKGKRHATQALARIGITINPEVSFSGQRSLDVIRPLLNLLQQDCTALENFESLMALTNLASMNESVRQRIIKEQ 
GVSKIEYYLMEDHLYLTRAAAQCLCNLVMSEDVIKMFEGNNDRVKFLALLCEDEDEETATACAGALAIITSVSVKCCEKILAIASWLDILHTLIA 
NPSPAVQHRGIVIILKMINAGEEIAKKLFETDIMELLSGLGQLPDDTRAKAREVATQCLAAAERYRIIERSDNAEIPDVFAENSKISEIIDD* 

(SEQ ID NO: 261) 
Classification: chaperone 



Celera Sequence No. : 142000013383999 

CGTTGGAAAAGCAAAGAAGCAACAAAAAAAAGACCGCAATTTGCCTTTGCAATTGGAAAATTAACAAACAA 

GCCGCGTTCTCTTGCCTTTCGCGTTTATTATTTTTTAAAAATAAATACGTACTCGCGATGAGCACGAAAACAACCGGACCGCTACCGTCGACTAA 
TATTTTGTTATTAACTATTTCCGCTGCACTTTTGTTTTAAATACTTCGGCCAAACACACAAAATACAACACATAAAGCGCAAAACAACAAAAAAC 
ACACTGCCCACACGCACACAAAGAGAGCGCGAGAGCGCGCCAACCAAGTAGCGATAGATAAGAGAGAGAGAGAGAGAACTAGTTCCAGTGAAATC 
CAAGCATTTTCTAAATTAAATGTATTCTTATTATTATAGTTGTTATTTTTGATATATATAAACAACACTATTATGCCCACCATTTTTTTGAGATG 
CATCTACACAAGGAACAAACACTGGATGTCACTTTCAGTTCAAATTGTAACGCTAATCACTCCGAACAGGTCACAAAAAATTACCTTAAAAAGTC 
ATAATATTAAATTAGAATAAATATAGCTGTGAGGGAAATATATACAAATATATTGGAGCAAATAAATTGTACATACAAATATTTATTACTAATTT 
C T AT T GAG AC G AAAT GAAC C AC T C GGAACC AT T T G AGC G AACC GAAT C GC GC GGAACT AAC G AC AGT C GC T CC AAG GT C GT CG AAC AAAAG GT G A 
AT GT G T T G C GG AG AGC GGGT GGG AG AC AG C GAAAG AGC AAC T ACG AAAC GT GGT GTG GT GG AGGT G AAT T AT G AAG AGGGC G CGC GAT T T G AAAA 
GTTTGTATATAAAAAATATATCCCGGTGTTTTATGTAGCGATAAACGAGTTTTTGATGTAAGGTATGCAGGTGTGTAAGTCTTTTGGTTAGAAGA 
CAAATCCAAAGTCTACTTGTGGGGATGTTCGAAGGGGAAATACTTGTATTCTATAGGTCATATCTTGTTTTTATTGGCACAAATATAATTACATT 
AGCTTTTTGAGGGGGCAATAAACAGTAAACACGATGGTAATAATGGTAAAAAAAAAAACAAGCAGTTATTTCGGATATATGTCGGCTACTCCTTG 
C GT C GGG C CC G AAGT C TT AGAGC C AG AT AT G C GAG C AC C CG GAAGC T C AC GAT G AG AATGGCC AG ACC C AC GT AG T C C AGC GGC AG AT C G GC G GC 
GGAGAAGTTAAGCGTCTCCAGGATGACCTTGCCCGAACTGGGGCACGTGGTGTTCGACGATGTGCAGCTAATTTCGCCCGGCTCCACGTCCGCCC 

AT T GG T T AAT C AGC AG ACC CT C GT T GGC G T AAC GG AAC CAT GAG AGGT AC G AC AAC C ATT T GAG GT AT AC T GG C ACC G A GC CC G AGTT C AAGAAG 
AAGCCGCCAAAGAGCAGGAATGGTATGATAACCGGCGGACCCACAGACAGCGCCATCGAGGTCGAGGAGCTGGCGCAGGATATTAGATATCCGAA 
GGACGTTGACACATTGGCCACCAGAGTGACCAGCGCCAGGCAGTTGAAGAAGTGCAGCACTCCGGCCCGCAGTCCGATCATCGGATAGGCAATCG 
CCGTGAAGACCAGTGGCACTGTGAGAAAAAGCGGCAATTCGGCAATCGTTTTGCCCAGAAAGTATGTGTCACAGCGATAAAGTCGACTTCGGGCC 
TCCCTCATAAAAACTGGCAGCTCTGAGGTGAACACCTAAATCGAATCGATTCATTAGAAAGTTAGTAAATTATTGAAATGCAAATGTATTCTAAA 
CATGACTTACATTTATCGTGGCAAAGACGTTTTGAAAGGTCATGTTGGTCAGGAAGAGGAAGATGGCTCCGTTGATATTCATCACACCCACTTGC 

GT GAG TTGTTGGCCC AAAAAG AT G AGGC C AAT C AAG AT G GC AAC CAT C T G C AAAT T AAAAT GT T AC T C GC AT CT CAT T AAT AT T C GC GAGT T AAA 
TGAAATTTATTTATCTTCTGCAAAACTATAAACTATACATCTCATTGAAAAAAACTAAGAAGGGTGTGGAATCAGGCAATTCTATCTAAAATCTA 
GCGAATTTGTTTCCAAGAATTGTAAGCGTTATATCATTTGTTTCCACTGGAACCACTCACCGTTGTCTGAATAAGTCGCACTTTTACGAGGAGTG 
GTTCCTTGAGCACCGACAGCCAGGATCGCCACAGGACCGCCCGGAACTGCATGAACCAGGTGGCCTTGTAGGTGTACCCATTCTCCGGCTGCTCC 
AGTGGCTTCTCCAGATTTTTGGTGGCCAACAACTGCTCCATATCCCGGGCTACTTTGCTAATGGCAAAATTGTCGCATATCTTGGCGATCCGATC 
ACGGGACTCGATCTCCCGTCCGGGCACAACGGCCAACACCTGTACGTAAAAGTCCGCCGGATTGTAGTTGGTAGGACACTGGGCACCCACGCTGG 
ATAGGAGTTGAGATGTTATGTAATACTAGATACCCTTAATAAACACATCGAACTCACTAGGAAAAGAAGTCGACGGCTTCGCTGGGAGTGCCCAA 
GAAAGCTACCCTGCCCTCGGCCATCAGAAGGATCTTGTCAAAGAGCTCAAACAGCTCGGAAGACGGCTGATGAATGGTCAGGATGACGGTCTTGC 
CCTTCTGCGACAGCTTCTTCAGCACCTGGACGACGCTGTGGGCGGTAAAGGAGTCCAGTCCGGAGGTGGGCTCATCGCAGATCAGAAGCGGCGGA 
TCGGTTAGAGCCTCGGAGGCGAATGCCAGACGCTTCCTTTCTCCGCCGGACAGACCTTTCACCCTGCCGGGCACACCGATGATCGTGTGCTGACA 
TTTGCTGAGCGAAAGCTCCTGGATCACCTGATCCACGCGGGCCACTCGCTGCCGATAGGTCAGATGTCGTGGCATCCGCACCATGGCTTGGAAAA 
TCAGGTGTTCCCTGGCCGTTAGGGAGCCGATAAAGAGGTCATCCTGCTGGACATAGGCGCACCTGGCCTGCATCTCCTTGGCGTCCACAGGTTGG 
CCATTGAGCAGTCGCATCCCGGATGGCGATACTTGGATGCCCTGCGGCGATCGAAAGGCAAGGGCATTCAGCAGGGTCGTCTTTCCGGCACCGGA 
ACTGCCCATCACGGCCAAAAGTTCGCCCGGATAGGCCACGCCGCAAACTGAGTTTCAAATTGGTAATTGGACCCTTTATTAAGATTTCACACAGA 
TCAGCCGACTGCGAATAGAAACTCACCGTTCTTGAGCAAATGTTTCCTGGGCGCCGGTATGTGTCGCTCGTTGCAGAATAGTCCGCGTGTCCGGT 
TGACCAGCTGCCGCCATCCGGAGCCCGGCTGATTGACCGCCCCAAAGATGTCCATATTGTGCCAGGCATAGGTGAGGTTCTCGGCTAGTTGGCCG 
CTCCCTGAACCGGAGTCCTCCGGCGGACTGGGTGGCCGGAGCGTGCCGTAGTTTTTGGCCTGCCCGAAGCCCTGGTTAATGCAGCTCTGCGAAGC 
CGCTCCGCTGTCACCCTGCAATGATAGGGGATCTCAAATATCAACTACAAGCGTTATGCTCATCTAACCCCGAACAAAAAGTACCCCGAAGTATC 
CTACGAAGTAGGTTTATACTTTTATTTATTTTTTGTGCATCTAGAAAAGGTTTCAAGCTGAAAACTTTTCAATTTCACGGCGTTCGTCCATAAAA 
ATGAAGGCGTCTGTGCATAAATCTAGCCCAGCCTGGCCAATCCCTGCTCCGCCGTCCCTGTTTCCGGGCACTAACCCCCCCCCCCCCCCTCGGCT 
TTATAAAGAAGGAGCCCATCTGTTTAATAAAAATGCCCAAGATCCGAGTTTTCCAAGTTTCCCAAAAAGGAGATGGCGAGGCCTATTTAAAATGT 
GCAAATTAGGGCTGGCACTTGGTGGATGGCCTTGAAAGTGGGCGGTGGACGGTTGACGGGGTCACAGGTCGTTGGGCGATGGACGATGGGGCGAT 
GGGACGATGGGTGGTGGCAGTGGCAAGGTTCCTCCCACCATGTTGCCACCTCATAAGACAATTGCGAGTGTAAATCGCTGGCATCTTTGGCTTTC 
GTTTGGGGACTTTGAGCTTCGCCAGTTTTGTTTTTATTAAATATAAAACTTTAAGGATACTTTGGCTTTTTTTGGGTGGTGGGAAAACTCAATCT 
CTTGTAGATAAGTGCAATGAGCAGTATAAAGCAGTCCATTTACTTGATCCAATTATGTTGAACGCAATAGTGTTCAAACTACTACTTCTGCAATT 
GAGTTTAAGTTTATAAATCCGAAATACAAATTCTTATTTACGTTATTTTGTTTACGGGTAAAGTGATTTCCAAAACCACTGAGCTGTCAGTTTTC 
GCCCAGGTGGGGCTTCCCTTTTCGAAATGGCGGTAACCGAAGACGTTTGCGGTATGGTCAAATCAAGCCATCCTCCCCGCCCCAACTTTTCGCCC 
CATACCACTTCCGTTCGCGCATATTTTCATTTTCATAAACAATTTTCCGTTTTCCATTTGTGGGGTTTCTTGCCATCGCAGCGGCCTGCGGCAAA 
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GCATAAATAACAGGGATCCACTAATTGGCGTCGAGGCAAAAAAAAAAAAAAATGGAAAAAACAAGACAAAATGATGGC^ 

TTTATCCGCCGCTGCTCTCATAATAATTTGTATAAATAATCATAAAAAAAATTAAGCAAAAAAATGCAAGCAAATGAGAAAGGCAAAATCCGAAG 
AAGAGGGCCAAATAAAAAGCTGACATTCAATAAAAGCAAGAAAAAATAATGTCCACCTTGCACTCAGCGCCTGGCGGTCCTTGCTACAGTGAGCC 
CTGCTAAGAACTTTCATTTTTTGAACTATTTAAAATTAAATTTATTTTTCAAAACGATTGGGGGATTTGGATTTATCGTTTACATTGCTACATAT 
AAAATCTAGCTAAAAGACCAAAGCCTGAAAACTGTTGCATTTCCTCCTTAGCGAGGCTTCACTGCTCATCTTCTGCTGCGATGCTTTGACAGCTT 
ATTCTTCTAGCCACTCAGTGTCCAGTTTTTGGCCTGTTACCAAAAAAAAAGCACAATTGGTGACGTTACGAGAGGTAGACGCGAAAAAATGTAAT 
ATCCGTTTTCAGGCACTCCTCGAGCAGAGCGAGATGGCCATATGGCTGGGAAAGAGATAGAGGGACATGGCGAAATTGTTTTCACAACAAATGAG 
GCCGAAAAATGTTATTAAAAAGTTTTCTTTTCTTTTCTGTTTTCGCCGTCCCTTTCCCACTGGGTTTTCTTATCTCCCTGCCTCCTCTGGTTTAC 
TCTCCCACCTTTTCCCTATTTTCGTATTTTGTTGAAATAAGTAGCGAAAGATGCTACATTTCATATTTATTTAATTTTTTGGCTTACTTTTTTTT 
TTATTTACTTTGCGTGCTATTTTTGAATTACTTACGAAATACGAAGGAAACTCGAGGAGTGGGCGCTAGATGGCGCTGTGCAGGGAGGCGAAGGA 
GTATCGTGATGAACTTGACAAATTGAAATGACTGAATAAAGAAGCGACTCAATTTGTTGTTCCTCCCGTTTTGTCGATTGTGCGAGTGGCTAATG 
AGTGCACTCAAAAGTCATTCACTTTGATGAGAACGAAATACTGAAGGAGGCGCAATTATTCAGCGGAAAAGCTGACGACCACTCGATATTATTAA 
GGAAATATCAGAGCGTAAACGGCCCAATATGTACATATATGCAACGAACATATAATTAAAGATGGCTTCAAATATTCCATACATATCATATTTTT 
CAATACTTTGTTACCTTTACTGCGTTACTTACTTCCCAAAAGAATTAAAGTTCTTATCTCCTTGTATAACTTTTTGAGACTTTTCAAAATTGTTT 
CAAAGAGCCCGGGGCTAGATATGCAAATCGTACTCACTTATTTATTTTGTTATACAAGCCGAGGTGCTAAGAATTTAAAGCAAGATACGCACGAA 
GATACCTAGATATCAGATGCTGAGCCCAAAGCATTCGTCTAGCCAAATAATGACCCAGATAAGACTAGATTGTACATTTTTCGTCGAGCGAGACG 

T GC AAAC AAC GAG GT AT T AAGAT C AG GT AG AAG AT G C G AGT AC AC AAC C AG AAAT GG AT AC GG AT AC AGT AT C T AGT AT C T T GGC AAGC CT AGAG 
TTATAGACACCCAGCAATGTTGTTGGCCAACTTAAGCCGACTCCTCCCACGCCACCACTGGTTATGCGTCCCTAATTACGATCTACTACATCCAA 
CTATTTGGGGTCGATGTATAATGTAGTGTAGTGTAGTGTAGTGTAGCAAGAAAATATTTGAGAATGAAAGGTGGAAGAAATGAGGACTTCATGGT 

AAGC T T AAAAT AT CT GGT T GT T AT AT T TT T T GT AAAAAAG AAT AT AGT C G AAAAT GAAT GC C T T T AGAT GT C T T GAT CAT GAT AT GAT CT C AAAA 
ATTGTCTTATATAGCGAGAACAGCTACCAGAATAATCTGTTTCGTGTCACTATTTGTTTGTGCAATTGCGGTTTGGGATTTTTGTGGGTCGCAGT 
TCTCACGCCGCAGACAATTTGATGTTGCAATCGCAGTTCCTATAGATCAAGTGAACTTAAGATGTATGCACATGTACTACTCACATTGTTCAGAT 
GCTCGGCAGATGGGTGTTTGCTGCCTCCGCGAATTAATAGCTCCTGATCCTCTTGGCCCATTGCCGGGATTTTTCACACTTTCCCCTGCTTACCC 

ACC C AAAAC C AAT C AC C AC CC C AAT C AC T CAAAAAAC AAAC AAAAAT AAG AAGC GAG AGG AGT T T T GG C AC AGC AC T T T GT G T TT AAT T GAT GGC 
GTAAACCGCTTGGAGCTTCGTCACGAAACCGCTGACAAAATGCAACTGAAGGCGGACATTGACGCTACGTAACGCTACAAACGGTGGCGAAAGAG 
ATAGCGGACGCAGCGGCGAAAGAGACGGCGATATTTCTGTGGACAGAGAAGGAGGCAAACAGCGCTGACTTTGAGTGGAATGTCATTTTGAGTGA 
GAGGT AAT CGAAAGAACCTGGTACATCAAATACCCTTGGAT CGAAGTAAATTTAAAACTGATC AGAT AAGTTCAATGAT AT CCAGTGC AGT AAAA 
AAAAAAAATGTTTTTTTTATCTACTTTCCGCAAAAATGGGTTTTATTAACTTACATACATACTAGTGACAACATGTTTATGTTTAGCTAATGTGT 
TCCGTTCAAAATAATTAAAAATTCGACCACCTTTGCACCAATATGATAGTACCCCACCTGGGCAGTACTCATAAGCTGAGTGGCTGTCTTTGAAT 
ACTGATTAGGTTGAGTGGGAAGCGTCTTCTTATCGTGAATAAGAGAGTTCTTGTTAATCCGCCTGTTGTCGTCCCGTTTCGGGGGTTTGCACTGT 
ACTTTGCATGCATGATTTATCAAACGGCGACTGTTGATGATGCGCATTCCGCACTCCCCGTTTCCCGTACGCCTATCCGCACCCGTACTTCGGAC 
ACTCCGGTCGTTGTCGTCATCGTTAGCACCTCATCTCAGGCCCCGGAATCATGAACTGCCGCCATCAGCTGTCCGGGCATTATCAATCTTCATCA 
TCGGGGCCAGTTGGTCCAGGATATGGATCTCGACTGGGACGGGTTTTCCGGGAGGCTATCAATACCATTTTCACGGACGATGTGGTAACAGGTTA 
GCAGTTGCAGTTGCCGAGGAATCTGCAGGAATTAATTTGCTGACGACGATTAAGTGGTCTTCAAATGAATATGGTTTTCTTTTGAGAGTTTAGCT 
CAGAATAATAGCTTTTGATTCGATTTGCAAGTTAATTATGCATATGCATGTGATATTTGTAGAAGTCTCACTCAGCACT 

(SEQ ID NO: 262) 

Exon: 6869.. 6640 

Exon: 3530. .3257 

Exon: 3182.. 2528 

Exon: 2466.. 2151 

Exon: 1947.. 1816 

Exon: 1745.. 1001 

Start ATG: 6711 (Reverse strand: CAT) 



Transcript No. : CT9359 

GTTTCGTGACGAAGCTCCAAGCGGTTTACGCCATCAATTAAACACAAAGTGCTGTGCCAAAACTCCTCTCGCTTCTTATTTTTGTTTGTTTTTTG 
AGT GAT TGGGGTGGTGATTGGTTTTGGGTGGGT AAGC AGGGGAAAGTGTGAAAAATCCCGGCAATGGGCCAAGAGGATCAGGAGCT ATT AAT TCG 
CGGAGGCAGCAAACACCCATCTGCCGAGCATCTGAACAATGGTGACAGCGGAGCGGCTTCGCAGAGCTGCATTAACCAGGGCTTCGGGCAGGCCA 
AAAACTACGGCACGCTCCGGCCACCCAGTCCGCCGGAGGACTCCGGTTCAGGGAGCGGCCAACTAGCCGAGAACCTCACCTATGCCTGGCACAAT 
ATGGACATCTTTGGGGCGGTCAATCAGCCGGGCTCCGGATGGCGGCAGCTGGTCAACCGGACACGCGGACTATTCTGCAACGAGCGACACATACC 
GGCGCCCAGGAAACATTTGCTCAAGAACGTTTGCGGCGTGGCCTATCCGGGCGAACTTTTGGCCGTGATGGGCAGTTCCGGTGCCGGAAAGACGA 
CCCTGCTGAATGCCCTTGCCTTTCGATCGCCGCAGGGCATCCAAGTATCGCCATCCGGGATGCGACTGCTCAATGGCCAACCTGTGGACGCCAAG 
GAGATGCAGGCCAGGTGCGCCTATGTCCAGCAGGATGACCTCTTTATCGGCTCCCTAACGGCCAGGGAACACCTGATTTTCCAAGCCATGGTGCG 
GATGCCACGACATCTGACCTATCGGCAGCGAGTGGCCCGCGTGGATCAGGTGATCCAGGAGCTTTCGCTCAGCAAATGTCAGCACACGATCATCG 
GTGTGCCCGGCAGGGTGAAAGGTCTGTCCGGCGGAGAAAGGAAGCGTCTGGCATTCGCCTCCGAGGCTCTAACCGATCCGCCGCTTCTGATCTGC 
GATGAGCCCACCTCCGGACTGGACTCCTTTACCGCCCACAGCGTCGTCCAGGTGCTGAAGAAGCTGTCGCAGAAGGGCAAGACCGTCATCCTGAC 
CATTCATCAGCCGTCTTCCGAGCTGTTTGAGCTCTTTGACAAGATCCTTCTGATGGCCGAGGGCAGGGTAGCTTTCTTGGGCACTCCCAGCGAAG 
CCGTCGACTTCTTTTCCTACGTGGGTGCCCAGTGTCCTACCAACTACAATCCGGCGGACTTTTACGTACAGGTGTTGGCCGTTGTGCCCGGACGG 
GAGATCGAGTCCCGTGATCGGATCGCCAAGATATGCGACAATTTTGCCATTAGCAAAGTAGCCCGGGATATGGAGCAGTTGTTGGCCACCAAAAA 
TCTGGAGAAGCCACTGGAGCAGCCGGAGAATGGGTACACCTACAAGGCCACCTGGTTCATGCAGTTCCGGGCGGTCCTGTGGCGATCCTGGCTGT 
CGGTGCTCAAGGAACCACTCCTCGTAAAAGTGCGACTTATTCAGACAACGATGGTTGCCATCTTGATTGGCCTCATCTTTTTGGGCCAACAACTC 
ACGCAAGTGGGTGTGATGAATATCAACGGAGCCATCTTCCTCTTCCTGACCAACATGACCTTTCAAAACGTCTTTGCCACGATAAATGTGTTCAC 
CTCAGAGCTGCCAGTTTTTATGAGGGAGGCCCGAAGTCGACTTTATCGCTGTGACACATACTTTCTGGGCAAAACGATTGCCGAATTGCCGCTTT 
TTCTCACAGTGCCACTGGTCTTCACGGCGATTGCCTATCCGATGATCGGACTGCGGGCCGGAGTGCTGCACTTCTTCAACTGCCTGGCGCTGGTC 
ACTCTGGTGGCCAATGTGTCAACGTCCTTCGGATATCTAATATCCTGCGCCAGCTCCTCGACCTCGATGGCGCTGTCTGTGGGTCCGCCGGTTAT 
CATACCATTCCTGCTCTTTGGCGGCTTCTTCTTGAACTCGGGCTCGGTGCCAGTATACCTCAAATGGTTGTCGTACCTCTCATGGTTCCGTTACG 
CCAACGAGGGTCTGCTGATTAACCAATGGGCGGACGTGGAGCCGGGCGAAATTAGCTGCACATCGTCGAACACCACGTGCCCCAGTTCGGGCAAG 
GTCATCCTGGAGACGCTTAACTTCTCCGCCGCCGATCTGCCGCTGGACTACGTGGGTCTGGCCATTCTCATCGTGAGCTTCCGGGTGCTCGCATA 
TCTGGCTCTAAGACTTCGGGCCCGACGCAAGGAGTAGCCGACATATATCCGAAATAACTGCTTGTTTTTTTTTTTACCATTATTACCATCGTGTT 
TACTGTTTATTGCCCCCTCAAAAAGCTAATGTAATTATATTTGTGCCAATAAAAACAAGATATGACCTATAG 
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(SEQ ID NO: 263) 

Start ATG: 159 (Reverse strand: CAT) 
MGQEDQELLIRGGSKHPSAEHLNNGDSGAASQSCI^ 

RGLFCNERHIPAPRKHLLKNVCGVAYPGELLAVMGSSGAGKTTLLWALAFRSPQGIQVSPSGMRLLNGQPVDAKEMQARCAYVQQDDLFIGSLTA 
REHLIFQAMVRMPRHLTYRQRVARVDQVIQELSLSKCQHTIIGVPGRVKGLSGGERKRLAFASEALTDPPLLICDEPTSGLDSFTAHSVVQVLKK 
LSOKGKTVILTIHQPSSELFELFDKILLMAEGRVAFLGTPSEAVDFFSYVGAQCPTNYNPADFYVQVLAVVPGREIESRDRIAKICDNFAISKVA 

RDMEQLIATKNLE^ 

ONVFATINVFTSELPVFMREARSRLYRCDTYFLGKTIAELPLFLTVPLVFTAIAYPMIGLRAGVLHFFNCLALVTLVANVSTSFGYLISCASSST 
SMALSVGPPVIIPFLLFGGFFLNSGSVPVYLKWLSYLSWFRYANEGLLINQWADVEPGEISCTSSNTTCPSSGKVILETLNFSAADLPLDYVGLA 

ILIVS FRVL A YL ALRLRARRKE * 
(SEQ ID NO: 264) 

Name: white 

Classification: transporter 

Gene Symbol: w 

FlyBase ID: FBgn0003996 

Celera Sequence No. : 142000013384832 

CGTTGCCAAGCCCCAAAAAGGGGGCTCACCAGAACGAGCACAGTAAAAGAGCAAGCTGAATATATTTTCTAAGTAAAATTTATTTATTATGTATA 

AATAATTCTCTTTTAGCTCTCGCACAAGCGCTATTAACGTTAGTAATTTACAGTTCGCTCTGCAGTCCTCGTTGCCGAACCAGGGACTCACAGTC 

GGGTATCCAGGTCACCTTTGGTGGGATTTAGGAGATCCCCCCTGCTGTAACCACTGGGTGTCCCTGGTCGGCTGCGAAGTGGAAATCAAACGAAA 

CATAACGGCACAAATAGTTGGCTTAAACAATTACAATAACAATTGTGGAATATATAAATTTGCATACAGGCTAAAAGTGAAGTGGTCCCGTCTCC 

ACAAGCATAGCTCCCACTTCACTTACGTTCGTATTCGTAATTTAGTTAGAAATTAGATTTCTTGTACTTGTTGAACATCGTTTGCTTTAGTGTTG 

TTTTGCAGTTCCCATAGCTATCTATCCAATATATAAATACGAAAATCAATTGTAAAGAAGAGGGATATTCGGGCCGGTGCCACCGCTGCGCCATG 

GTGGTTATTTGTTGACTTGGGCGCAATGGAGCCCCAACGGGTACAGAAGGAATGATTGGGATAGATGCGACTCGCTTCTGTGATTTTTGACTTTG 

GCTACCCTTAAACCCATAAGAGAGATGCAACGCAGAGGTTACAATTTTTACAATAGAACGTTTGTTAATTTGTTGCTTAGGCGCTGCTTTATCCA 

CTACTATATCTACCAGTTATTTATCATCCTCAGTTATACATTTATACAATCCTATTGCACACATACGATTCTCATATGTCTTGCTTAGCACCCCC 

TCAAGATACATATACATAATCATATACATACATAGATATAGATATTTTGATACAGTTATAGTTAAAGTTAAAAAAATACCTCACACTTGATAATA 

ATGAGTGTAAAGACCCGCCCGCGACTATGGTATGAAGTTAACCCCGACCAGCTCGAAAGAAAGTGATAGTGTAGGAGGCATAGGAAATAGAGGGT 

CGAGAAGGCCTTGAGTTGGCGGTCCTAGACGGGCCCAATATCACTTTGGTATCTGGCGGTCAGGTAGTTGGCCGCCTCTGTCATTTGTTCCAGGC 

ACTGGTACAGACCGTTAAGATGCCTCCGGCAGCGCTTGGCCAGCTCCGTGGCCTTGTCCCCCGAGCCCGGCTTCGATTTGGCGACGTCTACAATG 

TCGCTGCCGTCGCTGCTTTGGGCGTTGTCGCTCTTGTTGCCATCGGACTCTAGCCCCGAGTCGTTTTCCTGCGATATGTCCGAGTCAGAGTCACT 

CATGTTGCTCACCGAGTTGGTGGACATCATGGATGCGGTAGAGGTACGGCGCACATGACCCTTGACACTTCCGGATGCGGTTGTGCTTGGACTGC 

CGCTGTTGAATTCAGTCTTGCTCTCGCTGGGATTGCAGCTCGAAGTGGAACTGGTACTGGCGTTGAGCTGCTGTTCGGAGTGCTGCTGGTCGTCG 

TCCTCCTGGTTGGCGGTCCAAAGCAGATCTTTTTTCAGGGAGTTCAGCATGTTGTACAGATTGAACAGCTCGGAGGCGCTGAGAAAGTCTTGGAC 

ATTTTTTGAATGCGCCGCTCCCCGCTTGCCGTGGGCACTGGAGGTCTGTGTCAGCTGGTTGGCGGTCGAGTCCTTGCCGGTTGCTGGGATGATGT 

CAGTGCTGTCGCCGATCTGGGGATAAATTATGTAAAGTGGGAGCCATAAGTTGTTGGTTCCGGATCCGGATGGCTGGTCAAATGGGGGGCTGGGG 

TTCTTGGGTTATGCGGTACCAACCTGCCGATCCATGAGCCTGCATGGCACCAGTATGGTGTCGTCCATAATGTTGACAGTCTTTACAAACTTTTC 

CATCACATTGACAATGCTGTCTTTGGAGAACTGCTGCTCATCGTGACGGGCGATGCGGCGCAGGCAGTTTCTGAAAAACAGAACAATCGAAAGAT 

GAGCTTGGGCCAGAGCACAATAAACCAGTAGAGCGTGTGCAGGGGAACCAGTTTGGATTGGACAATAGTCTGCGTGATTAGTGACTATGCCACAT 

CACGCCAGACTGGACCAAGCGCTGGCCGTTCTAAAGCGTCGTTAAGTCTGCGGCGATCAGACTTCCGCCACTCGAAGCCATCTTTCTTTGATGCT 

TTCGACTGGAGATGGGTGCGCCTCCTGCTGACCAACGTGGCATATGATATATATGAATTGGTTTCTATGCAAATCAGCGACTCGCTGGCGTTGTC 

GCAAATTCAATTTGTATAAAATAGAAAACCAGTTGGTGCTTTATACCTGACATTGCTCGTGTGGCTCTTAAAACAAAGCACAGACGGACGGACAG 

AAAGATTTCTATTTTGGTTTCCAAAGCCAATTTAACGCGTCCACGTGCTGTCGGTGGCGGGGTGCGAGGGTAAAGCCCGACTTTGCTCGCAAGCT 

TCAATGTAATGGTTATACAATCCTGGCTGGTCGCATTAACTTTAATTTCACGAAAAGATCGCGTAACATCTCCAGGTCAAGCCCCATAAACTTGC 

CTGAGCTCCCTCCATTAACCCGCCGGGAATATATGTTAGTTGCTGATATCTGGAGCTGGAGCGGCCAAAGTCTAGAGGCGGCTTGGCTTATCGTC 

GCTAGACAGTTCCCAGACAGAGCTGAAGTTGCGTATCTCCCGCACCTCCTATTTAATGCGGCCAGATTGCTCGGGGCGGATCGGCATGTGGGCAG 

CCAGTTCGGGGCCGCGTGATCGCAGCTAGTTGACAGAGCGAGCGCCGAGCGCACTGATAAATTTAGTTTCAGTGACTGCGCCATTTAAAGACCTC 

TACCCTACAGTTCTATCCCTGGCCAGCGCCCGCCCCGATGCCATGGCAATCCAACTTAGAAAATCTAGATATCGATGACTGAAGTGTAACGCGCG 

GCAGGCCGGGGTGGGGTGGCCACGCGAGTACTGCGTGCTCCGAACTCTGCTGACCGACATCGTCTCACTGTGGTCTAGAACCACAATAAGATCTA 

GATCTAGAGAAGGCTGGCGCGGGTGTCAATGTTCTTTCCACTGGCACGGCCAGATGTCGTGGACGTCGACGGAGGATTCCCAGCCACTGAACAGA 

GTGACTAATTAAGCTAATTGCAGCGGGTTCCAATCATGGAAATCCGCGTAGGAATGCGAAAACAATTACCATCTTATCAACGCTTTTCGCTTAGA 

CACTCGACCATAATTGACTGCCATCTCGTGGAAACCTGAGTAATCGCCTCTATTTATTTTAAATTATAGATATTTTCTAGCCCTGACCACGACCC 

TTGCTTCGCTTAGTCACTTAAGCTGGCCACACGATAGTTCAACTTATCACTGCATTTTGAATCCTTTCCTTGCGAATCCTCTGAACAGATCCCGA 

AAGCCCAAAAACGCCGGCTTGGGCAACACGCCAGATGCTACCAGCCAATTGACCAGGCAATAGGTACACCTGAAGTGTTATGGAAATTTGGCTAA 

GCCCCACAAGCGTCACGCGACATGCTCGCCAAGAAGGGCGGGGTAAGCAGCGGGAGAGGGAGGGTACCGCATACAACGCCGAAATCAGGCGACTT 

TTTCGCTCAGCGGTAGCTAAGGGCGGTGAAGGAGTGCTGAAGGGGTGGTGGTTTGGTGGGTGGCTGATGACGAGCTGGCTTTATGTGGCAAGCGA 

ATCAAACGCTTCTCTTCGCACTCTCTCTTCGGTCCGTTTCGCCGTTTTCTCTTTGCCGCCGAGTGGCGTGATGTTGCCCAGAAACTGTACAGCGG 

CCAGAGTCGCCTCCATCAGCTGAGCAAGATGAATGAATCTCTAGCGATGCTCTGCCTTCCCCTCACCTTCCTACCATCGCTCATGCAACCATGTT 

TAGTGGTATTTCTCGTATTTGCAATGTTGAGCAATGGCATATACATAGCCATACGAACAGAGTTGACCAATATTTGGTTTACTTTGGCTTTGGCT 

GCGCCTCGACCTCCAGCATAATCGATTTTTGATACGCCCGCAAATTAGATATTCTACCTACTCGGTGGCCGGGTTGGCCGGATTGGCTCGGCCAT 

CGGGCATAGAGCATTGACGACCGTAAGTTGGCGCTCAAGGCGACACCTGCCTCAAAAAGTGACACACTGCCAGCGCAACGAGGTGAATCAGAGAA 

TTTTCTGCGATTTAAGCGCCCCAAGGAGGCGGGCAGCACCAGAAGCGATGAGTAATGGCTATGGCCGATAGCTGGGCTGGCGGACTGGTGTCAGT 

GGACCGGAGGTGCGCACAGATAAGGCATCTTATCGCTTATCCCAAAGCTGTGATTTGCCTCCGCTGGCTCACTCGAAACATTTGGGCAAACAAAC 

GGCAAAAAAAACACCAACATACTTCTGTATAGGTGTACAATATATAGCCGATTGTGTCATATTAGCGCTGGCTTGATTTTTTTATTTTATTTTAT 

TTTTTTTGTATTTGTTGTTGTATTTTTCTTTCGAGTTTGAATGGCCGTCCGTCCCCCTGTCCATCGAAAATCGTACGATAAATTTCTTGTATCTC 

GTTTTTCGTTCGGCGGTGGTGGTGCAAACGAAAGAGAAAACGCTCGCGGGGTGCGGTTTGGATTCAATTTGAGAGGCGAGTAAATGGACGGAAGA 

GAGCGATTCGGATGGGTTAGAGGAGAGGCAGTGCGGGGCTTGGGTGATGGATCGGGAACGGGGGGGGGGGATCAATTTAGCATCACTTGGGCAAA 
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TTGAACTTAAACGCCTTAACTAGCGAATTCGGATTGCGACTTACCGACTTGTCTCCAGCTTGGTGAGGTCCGTGTATCCACTCATGATGACTGTG 
TATTCCTGATGACTATGTACTGCTGCTGCAGGAAATATGTGGTTCGAGAAATTTTCTTTGGCCGCACCAACACTCGCGCACTTTCACTCGCGTTT 
GGAATCGGAGTATTCCTGACAGGGATTACGAAATCGGCACAGGTGACTGGGCTGGCTTCAGCTCGGTTGCAGTGCAGGTCTGGCCTGAATTGGTT 
TGGATTCGGAATAGGAGTAGCTGCTGCCCCGGCGGAACTGCTGTGTCTAGTGCGCTGCCTCCAATAAGTCGCGGCTGTTTTATGCCAAAAATTTA 
CATGTACTCGATTAGACCCCGACTTTGGTTTGGCTTTGGCCGTCGATCTAGTCCGAACCAATTAAATATGGGAATTCTAAGTAGTCTGCACACGC 
ACCGAGTTTAATTTAATTTCAGTTTCTGTTTCGACACGAGCAGTCAGCGAGAAGAATTTTGAAAAAATTACAGAGAGGGAAGCATTGCCAGTTGC 
TTTTATAGTACGGGCCGAGTCAGGGCTGCGGAGGCTCCGATAGAACTATCGATTACTCAGCTCGGGGCAATCGATGTAAACATCAGCTGTCATGT 
TCAATGTGGCAGCACCGGCCAACGCCGTTAATTAATTAGCTAATTCGAAAATGCTTAAATTTAAAAATAAATTAGGATAAGTGGCAAAATATTCA 
ACATTGTTTTATTTTTTTTTAGTTTTGCACCAAATTATCAATATAGATTGTTGTAACATAACTTGGGCAAAAAATAGAGCGTGCCATTTAATTTT 
CCTCTTATTACTACTAAAGTGCATTCAACAATTGCTGGAGAGTACATATGTAAATTGCACTGAGATTAGAGAACTTCCATTATTGCGAATCAATG 
CTCTTCCCAGAACCACCTTTGCAACCACCCGGAAGCGCACGTCTGTTAGGACCGCTCGTGTCGCAGGACACTATCCACTTACTCCACAATCCCTT 
CTGACCACGCTGCGTCCTTTACTGTGTCCCAAGTCGACACCGTTTCCGATGGGCCTCTGTACGCATGGCCGGGTCGAGTCCAAAGTCCAGCAGTC 
CAGGTACCGTTAGTTCGAGCATTTCTTAGCTTGCAAATTGAAGTGGTCTTCAGTCCCTTAATAACATTTCGTTTTCGCCTAGAGCTGCACTTTAG 
ATTTACATTTCTTTCCTTTTTGTGACCTAAAAAACAAAGAGGCGTTAAAGACAGTTGGAAGTACATTTTCTATAAAAGCTTACTGAATTCCCGAA 
GTACGAAAAAACTCCATTTATTCCGGCTGATCTCTGCGCCCCTCAGGTCTCTTGGAATCCTTTGCACGCACAGGTACATTTCCCAAACTTGATCC 
CGGGTCAATGCGTCGATCATGTTCAGCTCGGGGTCCCAAAAAAAGGTAATAACTCCCCGCCTTAAGTTCTCCGCCAAGTCGTCGAACATCTGTTA 
AATTTAAATTAAAATTATTTTTGCATTCGTCGACAT 
(SEQ ID NO: 265) 

Exon: 5306. .4795 
Exon: 1970.. 1829 
Exon: 1726., 1001 

Start ATG: 4835 (Reverse strand; CAT) 
Transcript No. : CT9411 

TTCCCTCTCTGTAATTTTTTCAAAATTCTTCTCGCTGACTGCTCGTGTCGAAACAGAAACTGAAATTAAATTAAACTCGGTGCGTGTGCAGACTA 

CTTAGAATTCCCATATTTAATTGGTTCGGACTAGATCGACGGCCAAAGCCAAACCAAAGTCGGGGTCTAATCGAGTACATGTAAATTTTTGGCAT 

AAAACAGCCGCGACTTATTGGAGGCAGCGCACTAGACACAGCAGTTCCGCCGGGGCAGCAGCTACTCCTATTCCGAATCCAAACCAATTCAGGCC 

AGACC T GC AC T GC AAC C GAG CTGAAGC C AG C CC AGT C AC CT G T GC CG ATT T C GT AAT CC C T GT C AGG AAT AC T C C GAT T C C AAAC G CG AGT G AAA 

GTGCGCGAGTGTTGGTGCGGCCAAAGAAAATTTCTCGAACCACATATTTCCTGCAGCAGCAGTACATAGTCATCAGGAATACACAGTCATCATGA 

GTGGATACACGGACCTCACCAAGCT GG AGACAAGTCGAAACTGCCTGCGCCGCATCGCCCGTCAC GAT GAGCAGCAGTTCTCCAAAG AC AGC ATT 

GT C AAT G T GAT GGAAAAGT T T G T AAAG ACT GT C AAC AT T AT GGAC GAC AC CAT ACT G GT G CC ATGC AGGCT C AT GG AT CGG C AG AT C GGC G AC AG 

CACTGACATCATCCCAGCAACCGGCAAGGACTCGACCGCCAACCAGCTGACACAGACCTCCAGTGCCCACGGCAAGCGGGGAGCGGCGCATTCAA 

AAAATGTCCAAGACTTTCTCAGCGCCTCCGAGCTGTTCAATCTGTACAACATGCTGAACTCCCTGAAAAAAGATCTGCTTTGGACCGCCAACCAG 

GAGGACGACGACCAGCAGCACTCCGAACAGCAGCTCAACGCCAGTACCAGTTCCACTTCGAGCTGCAATCCCAGCGAGAGCAAGACTGAATTCAA 

CAGCGGCAGTCCAAGCACAACCGCATCCGGAAGTGTCAAGGGTCATGTGCGCCGTACCTCTACCGCATCCATGATGTCCACCAACTCGGTGAGCA 

ACATGAGTGACTCTGACTCGGACATATCGCAGGAAAACGACTCGGGGCTAGAGTCCGATGGCAACAAGAGCGACAACGCCCAAAGCAGCGACGGC 

AGCGACATTGTAGACGTCGCCAAATCGAAGCCGGGCTCGGGGGACAAGGCCACGGAGCTGGCCAAGCGCTGCCGGAGGCATCTTAACGGTCTGTA 

CCAGTGCCTGGAACAAATGACAGAGGCGGCCAACTACCTGACCGCCAGATACCAAAGTGATATTGGGCCCGTCTAGGACCGCCAACTCAAGGCCT 

TCTCGACCCTCTATTTCCTATGCCTCCTACACTATCACTTTCTTTCGAGC 

(SEQ ID NO: 266) 

Start ATG: 472 (Reverse strand: CAT) 

MSGYTDLTKLETSRNCLRRIARHDEQQFSKDSIVNVMEKFVKTV1IIMDDTILVPCRLMDRQIGDSTDIIPATGKDSTANQLTQTSSAHGKRGAAH 
SKNVQDFLSASELFNLYNMLNSLKKDLLWTANQEDDDQQHSEQQLNASTSSTSSCNPSESKTEFNSGSPSTTASGSVKGHVRRTSTASMMSTNSV 
SNMSDSDSDISQENDSGLESDGNKSDNAQSSDGSDIVDVAKSKPGSGDKATELAKRCRRHLNGLYQCLEQMTEAANYLTARYQSDIGPV* 
(SEQ ID NO: 267) 



Celera Sequence No. : 142000013384832 

CTGGTCTGGCCCCGCGCTGCTGGTGGTGGCGAGTGGTGCGACGGTGGTGGTGGGTGACGGCAAAGTGGGTGTGGGATGTGCGTGCCCAGACAAAG 
TAAGAGCGCTTAGAGTCTATTATCTATCCGGCAAGTGCTGACGCGTTGCGATCGCCCGTCGTCGATCGCTAAATCTGTGTACTCCGAAAAATCTG 
GTGCATCAAAATGTGCTACGTTATTATCACCATCGCCAGTCTGGTGTGGTTCCTCTGTTCGCTGGTGGCCGACATGCTCTTTGCTGTGGCCCTGG 
TCACGCCCAGGTGGCTGGTGGGCCCCGCCCAAGGAACAGACTCCACCGCCAGCTCTCACCACCAATCCTCGGTGGGCATCTACACGCGCTGCAAG 
GTCATGCAGGAGGGCGGATTCCAATGCGGCCGCTTCGACTTGGACGGACTGGCCACGGACAGTAGCGTTTACCCCAACGAATGGAAAGCGGCAAT 
GTTCTTCGTCATGCTGGGATTTAGTCTACTCTCTGTAACGGTGATCCTGACCCTGATCACCTGCTGCCGCCAGTCGGCCTGTGGCAAGAGCATAC 
ACAACATGACGGCCTGCGCCCAGGTCGTAGCAGGTAAGCAACCATTACTGTGATATCCTCGCAGCATTTAAGTTTGCATTGTTTATAGGAATATG 
TATGATGTTGGGTCTATTTCTGCATCCGATGGGATGGCGAGCGAACCGGATTCAACGTCTATGCGGCATGGATGCCGAACCCTTTTACCCCGCCG 
ACTGTAGCATAGGTAATTCTTTGAAACTTTTCTTAGAAACTTACTAAACTTAATCCTTACAGGAGTTTCATTCTACTGCGGAATCATTGGGGTAC 
TCCTAACATTTACTGCCGCCGGTATAAGCCTAAAGGCAGAGTCTTCCAATATGCGAACTCGAGTTCGCAGACGCGTGGAGGCGGGCAGTAAACTG 
GTCTGTATTCCGTAGCTGCCTTTGAATACAATAAGAACACTATCTGACTTGGAGCGAGCATACTTAACTTCATTCTCGAATTAAATTTATCATTA 
GCATAAATAAACTTTTAACATTCTACAGTATAGATACAAGGTGTGGGTGGATATCTATACGACCCTACTTGTTGCGCTTGCCCTTAGCTTTGCCA 
GGAGGCTTGCCCTCAACATTAGGCTTTGCCTTAGGTATGTAGACACCATGGTTCGTCTTCTTTAGGTGGGCAAACAGCTTGTTCTTGGAGTCAAA 
GACAAGCCTACAGGTAACACAAGTGTGTTGCGACTTGGTCGGGTCGAGGTCCTCATCTCCGCTTTTGGGAGCCACTGTTGCCTCCTTTGTATTCG 
GTTCCGAAACAGTTTGCTCTAATAGCTTAACCTTCCTAGTCGTCGTCTTTTTGCTTTTACTCTTTTTGCTCGCTTTTTTGCCTTTACTCCAGTCT 
TCATCCTCTGACTCGCTCTTGATTGCCTTCTTTTCTATTGGCTCATCTGGTTCGTCACTGCCTTCCGCCTGCGCTTGCTTGACAGCTGACTTCCT 
TGCCTTTTTGTTTTTTTTGGTTCGTTTGGCTAGCGAAGATTCTTCTTCGCTTGGAACCCCGTCGAAAGATATCTGATCCTCAGAGACTTGCAATT 
CTTCCAGCGATTCCTGTACGCCCATTAAACTGTCTTCATGGGGTTCATTGTGGAAAGCGTCCTCTTCTTCCTCCATTTCCTGACACAGCCGGTCG 
ACATTTTCATTATGCTTCTTGCTTTCTTCGTGATTAGCACGCGCCTTGGCGTTTTTAAATGTCTTGTTGCATGCCACGCAGTAAAGATCGTCTAC 



FIGURE SHEET 146 



ATACTCCACTTCGTACTCCTCGGCTTCCTGGCCGCCTTCATGGTCAAAGTCCTCGCCATCGTCGTCGTTCTCGTCTTCATCCGTGTAATCTTCTG 

ACTTGCTATCGTACTGCTGCTCTAACTGCTTTAGCTGCTCCTCGTAGCCCTCGTTAAAGACATTATTTTTCCTGACGGCTGCCAGTTCTTCCTGG 

CGCTTCCTTAACTGCTCTTTGCGCTTCTCCTCCTGCTTAAGTCGATTGGCCTCGACGCGTTCCTCAAGCATGCGCCTGTAGGCCTGCACTCTGGG 

ATCCCTTTTTCGCACAAAGTTGACTAAGTTGCGAACCTAAGTAAATGAGGTAAGTAAAATGTGTCTCGGAAGAAAAAAAAGGGAATTCATCGCCC 

ACCTCTTCGTTGCGTTCCTTGCGGGCAGCCTGAACGATCTTTTTCATCTCCTTCTCCACCTTGCGTAGGATAAAGCGCTCTTTTATTTCGCGAAC 

ATCGTAGGGACACAGCCAGTCGTAGGTCTTCCGGGTACTGTATGCCTGCCAAAAGGCGTAGAACGGACCCACCACATCTTCGTAACTACTGTTTG 

AATGACCAAAGTCAGGCGCCATCCCCAACCGGTCGTCCTTGTCCATGAACTCTAAGTCCTCCGAGGCGATTTGGACGAAGACATCGGTGTACACG 

CGGTAAAAGCCGTGCTCATTGTCGCCATAGCCCTTGTAACAGGAGCTCGTGAAGAACTGGAACACGTCCAGGCAGTTCTCCGCATAATCGGAGTT 

CTTGCCGCGCAAGATCTGCTCGCGATGGTTGTCGTACCAAGACCGTTCCTGAGGATCAGACAGAACTTCATACGCTTGCTGGATCAGCTGGAAGC 

GCTCTTTGGCCTCCGCCAGTCGGTCGGGATTCTTGTCCGGGTGCCAGCGCAGGGCCATCTTACGGTACGCACTCTTGATGTCCCCATCGTTGGCG 

TTTCTTTGGAGCTCGAGCTCCTCGTAATAGCAACGCATGTTTCTTAGAACAACAAATAATTTTGATAGAGTTGCCAGATGGGCGGAAAGACAGCT 

GGCGCTCACAGATGCGTTGTCAGTGCTGCAAAGCGCTGGAAGGGGAACGTAAATATCTTGCAAAGTAGCTTACTTTTGGGCTGAGAGCGTTTTGC 

GTTTATTGTAATAGCTGAAAAAAACGTTAGGAAAACAATAGAAGCCTATTCTTACAAGGTCTTGAAATTACAGTTACCCAGTTAAGTGGCTAAAC 

TGCAAGTGGCATCTCCTAGCAGTCATACGTAACTGCGCATCCCGTCATTAGCTTTTTTACTTTCGACGGCGACGGACGTTTCTTTTCATCCGTGT 

CCGCGCATCCTTAGTTGACTAAATTCTGCCGCATGCTTTTCTCACTACCACCGAAGCTCACACATCGCATACCTATACACAGGCAGAGAAAAAAA 

ATAAAATCGGAAAGGTTCCTGTGAAAAGCAGCAGTTAAACACATCAGAAGTTGTAGTTTTTCCCAGGACGTAACGGTGCTTTTTTCCCTTAGTTT 

CCGAAGCATTTATACATAATTTGCATAAACTAAATAATAATCGACTGGAATCGAAGCTCGACTATTGCCAATCACCCATAACGATACAAGCAAAG 

TGCCAAAATGTCGTTTAGCATTTGAGTCTGCATACGAAAAGGAAAATAGCTTGACATAAAATTTACAGTTACGTGGGATTTACAACCGTGGGCTG 

TAACACACAGACAGGCCTTCACATACACACACATACTTACGTCGAGTGAGTGCGGGAGTGGAGTCCTTGAAGCATCTCTGCTTACGAACAACCCC 

CACAACATTGGGTGCGTCGGTACTCGGCGCGAAGTAATGAATAACGGCCAAAAATAACCGTCAACAAAAAACAAACAAACACAAGTATAGAAAGC 

CGACGAAGTCCGTGTGTGTTCTGGTGTAGATTTACCATTGCCCTTTCATCTCTTGTGCGGGTGTGTGTGTGGGCGCCAAAATAAAAGAAGGC 

(SEQ ID NO: 268) 

Exon: 2797.. 2188 
EKon: 2126. .1001 

Start ATG: 27 93 (Reverse strand: CAT) 
Transcript No. : CT9445 

AAACATGCGTTGCTATTACGAGGAGCTCGAGCTCCAAAGAAACGCCAACGATGGGGACATCAAGAGTGCGTACCGTAAGATGGCCCTGCGCTGGC 
ACCCGGACAAGAATCCCGACCGACTGGCGGAGGCCAAAGAGCGCTTCCAGCTGATCCAGCAAGCGTATGAAGTTCTGTCTGATCCTCAGGAACGG 
TCTTGGTACGACAACCATCGCGAGCAGATCTTGCGCGGCAAGAACTCCGATTATGCGGAGAACTGCCTGGACGTGTTCCAGTTCTTCACGAGCTC 
CTGTTACAAGGGCTATGGCGACAATGAGCACGGCTTTTACCGCGTGTACACCGATGTCTTCGTCCAAATCGCCTCGGAGGACTTAGAGTTCATGG 
ACAAGGACGACCGGTTGGGGATGGCGCCTGACTTTGGTCATTCAAACAGTAGTTACGAAGATGTGGTGGGTCCGTTCTACGCCTTTTGGCAGGCA 
TACAGTACCCGGAAGACCTACGACTGGCTGTGTCCCTACGATGTTCGCGAAATAAAAGAGCGCTTTATCCTACGCAAGGTGGAGAAGGAGATGAA 
AAAGATCGTTCAGGCTGCCCGCAAGGAACGCAACGAAGAGGTTCGCAACTTAGTCAACTTTGTGCGAAAAAGGGATCCCAGAGTGCAGGCCTACA 
GGCGCATGCTTGAGGAACGCGTCGAGGCCAATCGACTTAAGCAGGAGGAGAAGCGCAAAGAGCAGTTAAGGAAGCGCCAGGAAGAACTGGCAGCC 
GTCAGGAAAAATAATGTCTTTAACGAGGGCTACGAGGAGCAGCTAAAGCAGTTAGAGCAGCAGTACGATAGCAAGTCAGAAGATTACACGGATGA 
AGACGAGAACGACGACGATGGCGAGGACTTTGACCATGAAGGCGGCCAGGAAGCCGAGGAGTACGAAGTGGAGTATGTAGACGATCTTTACTGCG 
TGGCATGCAACAAGACATTTAAAAACGCCAAGGCGCGTGCTAATCACGAAGAAAGCAAGAAGCATAATGAAAATGTCGACCGGCTGTGTCAGGAA 
ATGGAGGAAGAAGAGGACGCTTTCCACAATGAACCCCATGAAGACAGTTTAATGGGCGTACAGGAATCGCTGGAAGAATTGCAAGTCTCTGAGGA 
TCAGATATCTTTCGACGGGGTTCCAAGCGAAGAAGAATCTTCGCTAGCCAAACGAACCAAAAAAAACAAAAAGGCAAGGAAGTCAGCTGTCAAGC 
AAGCGCAGGCGGAAGGCAGTGACGAACCAGATGAGCCAATAGAAAAGAAGGCAATCAAGAGCGAGTCAGAGGATGAAGACTGGAGTAAAGGCAAA 
AAAGCGAGCAAAAAGAGTAAAAGCAAAAAGACGACGACTAGGAAGGTTAAGCTATTAGAGCAAACTGTTTCGGAACCGAATACAAAGGAGGCAAC 
AGTGGCTCCCAAAAGCGGAGATGAGGACCTCGACCCGACCAAGTCGCAACACACTTGTGTTACCTGTAGGCTTGTCTTTGACTCCAAGAACAAGC 
TGTTTGCCCACCTAAAGAAGACGAACCATGGTGTCTACATACCTAAGGCAAAGCCTAATGTTGAGGGCAAGCCTCCTGGCAAAGCTAAGGGCAAG 
CGCAACAAGTAGGGTCGTATAGATATCCACCCACACCTTGTATCTATACTGTAGAATGTTAAAAGTTTATTTATGCTAATGATAAATTTAATTCG 
AGAATGAAGTTAAGTATGCTCGCTCC 
(SEQ ID NO: 269) 

Start ATG: 5 {Reverse strand: CAT) 

MRCYYEELELQRNANDGDIKSAYRKMALRWHPDKNPDRLAEAKERFQLIQQAYEVLSDPQERSWYDNHREQILRGKNSDYAENCLDVFQFFTSSC 
YKGYGDNEHGFYRVYTDVFVQIASEDLEFMDKDDRLGMAPDFGHSNSSYEDVVGPFYAFWQAYSTRKTYDWLCPYDVREIKERFILRKVEKEMKK 
IVQAARKERWEEVRNLVNFVRKRBPRVQAYRRMLEERVEANRLKQEEKRKEQLRKRQEELAAVRKNNVFNEGYEEQLKQLEQQYDSKSEDYTDED 
ENDDDGEDFDHEGGQEAEEYEVEYVDDLYCVACNKTFKNAKARANHEESKKHNENVDRLCQEMEEEEDAFHNEPHEDSLMGVQESLEELQVSEDQ 
ISFDGVPSEEESSLAKRTKKNKKARKSAVKQAQAEGSDEPDEPIEKKAIKSESEDEDWSKGKKASKKSKSKKTTTRKVKLLEQTVSEPNTKEATV 
APKSGDEDLDPTKSQHTCVTCRLVFDSKNKLFAHLKKTNHGVYIPKAKPNVEGKPPGKAKGKRNK* 
(SEQ ID NO: 270) 

Classification: chaperone 
Gene Symbol: BcDNA:GH03108 
FlyBase ID: FBgn0027599 



Celera Sequence No. : 142000013384832 

GTAAGAAAGATCATCTCTGGCTGAAACAGTGCGTACTGACTTTTCATTGTACGGCCTCTCCGGCGTGTTGTGGTACGAGATCAGCAGCTCCTTGG 
CCAGCATCTTGTCTGGGAAGTTGCGAATCAGGTACAGCCAGCACTGAATGGTTCTAGACATAGGGGCAATAGTGAGTTATTACATCTGGCCATCA 
CCCCACACCCTCATACTCATTCTGTTCCATTAATATGGTTTGTTGCTCGCGATCATAGTAGTCCGGGTAGTCTTCTAGTATGTCCAGTTTGGAGA 
GCATGGTCTCGTCCACTTCGTAGACTTCGCCTTCTATGTGGTTCCCCTCACCCGGGCGGGCGAGCAGAAATGGGATGTTGTAGCGGGTTCCGACA 
ACCAGGGGAAACTTCGTTTCCGTTTTCCCCCTTCCAAGGAATCGGGCTTGGCCGTTCTCCTTTTTCGTCAGCCAGTGGTGATTGGGCTCACCGCG 
CTTCAACGTGCCATACACAAAGACCCTTGCCGCCATTCTCAGCTTTTCAGCCATTGCGCTTCCCGCCGCCAGGCTTATCAGTTTCAAGCGGTATC 
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GTGCTATCAAATCCGACCTGTAGATTGCGCAGTCGACATAACCCGTCCGCTCTTCTTCTTGCTGCCATTCGCACCGAATCTAGCCTCCGTACACC 
GCGAGAAACTGAAGCCCTCGGCACTGGGCTGTGGAAAATTACCAGGGACTGCTGTCCAGCATCTTTACTTCTATAATAGAACGAGGGGCATTTTA 
TTTTATTTTAAATTGCTTCAGTTTATTTGCTTTGTTTGTCGAAAACTATCAAATTACAGACTACGTTTGCTAAAACCTAAATAATGCGTATGGTA 
CATATGTATCAGTTTATATCGTTTGCCCATATATACGGGTATATATCGATCAGTACATCTGCGTCTGCAAGTACGGTCTTAAAATGCAACTACGG 
TAACATAAGATTAGTGGCCAGGCTATCCAGAGAACATCCAAACAAACGGGCGACACCACAACCCACGTTATTAGTTCTGGGCCTCGTAGGTGAGG 
TCGTCCTGGGCCGGATGCTTGTGGGTGCGGCGATGGCGCATAATGTATGGATGGGTGGTTGAGTTCTCGTAGCTGGACAAGTAGCGCAGACTAAG 
TAGGTTCTCCGGGTACTTCTGCAGCAGGTACACCCAGCATGGAACGGTTCTGGAAAGGAMTTGTGACCCATCACACCACATTGCTGAAATGACG 
TCAGATCCGACTAATGCTCACCCTTCTCCAACGCCAATGTTCATATCGTGCATCTCGCGAGTGTAGATCTCCTCGCAATCCTCCAGGTTGTCCAG 
GGAATTCAGCATTCGATCGTCCACCTCGTAGATCTCCCCTGTTACGTAATATCCCACTCCGGGCTTGTTCAGCAGGAAGGGAATGTTGTAGCGGG 
TGGCGATCACTAGCGGCAGCTTTTGGGTGGTGGTCGCCTTGCACCAGAACTTAGCGAAGCCGTTGCCAGAGCTGGCCAGGATTGAGTTGCTGGGC 
TGTCCGTACTTGAGGGCGCCGTACACGAAGAGCTTGCTCAAGGCGGAGGAGACTTTCTGTGGAGTGCGTGAGAGGTGGGTGTGGCGTAAAGACAG 
ATAAGTAGGGTTAGGTTAGAGATCTGATAAACTGGTGGTATCTTGTGAGAGTTTGCTCTACTGGTATCAATCGATCGCCGTGGCTTATCAGGGAT 
CAGGGATCCGAACACTTTCCACTGGCTAGCCCCCGCAAGTTATTGATAAGGTTCACGAAACGTCTCGGTGTTTGGCGAGATTCTCAGCTGAACCG 
CTTTTTGGGCGAGAATGGGATGGCTCAACCGGTTTCGTGCATTCACACGTTTATTCTTAAAGTCTGTTATTGTTGCTAATATATGTACGTAGCTG 
TTTCGTATTTTTGGGGCTGAGTGCGTGCCTCACAAACTGCGCAGCAGTTACGCCTCCACAGGTACATATGTATGGAGACACTCGCTATCTGTACA 
CATGTAATTGGCACATTAATAATTGCTTAAATCGGTTTAATCATTGCAGTAAATGGATAACACACGCAAGCGGCCTTAAATCAATGTCATCAAAC 
GGTTGCATCAAAGCATCTTGGCGAATCCGCTTACATAACCCAGAACAGGCATCAGCTCACACTCATACATATGTAGATATCTTAAAAATTCCAAT 
AAACTATCTGATAACCCTCCGATGGATCGGATAACTTAAAATGCCTTAAATTTAGTTCCCAAGCTCTCAGGCTTTTACACGCTTTCCATTTGGGG 
CTGCCCACTTGCCAAGGACGAAGGATAAGGGATGCGTATACACAAGCTAAAAGTGTGCATGTGTGCGTGGGCTTCACATTGTGCTGCACTCTTAA 
TCAGCTTACCTTTGCTTGGCCCATTTTCTTGTCGAACCGCCTGCAGATAATTAAAATTGCAAATTGCTGCACTAAGCAAAGGCGAGAGAGCTCTT 
CGGGATTGGATAGGATAGTGTGCACGAGCTCCAGCAGAGATACAACCGCGCCGATGGCCAGCAAAACTCCAACTAAATCGTGATGGAGAACAACG 
TCGGCGGACACGGTGCTGCCGCCTGCCACTGTTGTTGTTGTTATTGCTTCTGGCACAGTGGAGCAAAGTCACAACGTGTTATTGGTTCAACTTAT 
ATAAATATATTGCATATGAAAGTATTAGTCGACACTACGTGGAAAAGCAATTCTTAAATAATAATTATGGTATATATCCAATTTCATCAAAATTG 
GAAAAAGGAAGAAAATATTATTTGCCCTGTTTACTGACTCTGCCCAGTGTGCCTTGTCGTCAAGACCAAGAGGCCCGCCTCTAGTGGAACAACAA 
CAAGTAAAACAATAACACGCCACACGCGCAAAGCTAAAAATGACGGTTCTGCCGACTGGTCTTTATCTAAGAAATATCCTCTGGGGGGGGGTGTG 
GGTTAGTTGAAATACCCGAAATATTATAGGCCCCGCAAGCACAAACACTCTAAAATTCGGGCATTATCAATGGCCTAATAACATCTGAGCACAGA 
AGTCCCTCTTGAGATCCACGCTCCCTCTATGATACCCCCCCCTAGCAGCATTTTAGTTCGCTGGCGACGCCTCTGGAGAACACGTACGCATGTCT 
TCTTCAGTTTCTCTGCTGATCGGTGGGGATCGGAAATCGTATACTGTATATTGTAATATTCTGCACTTCCATGTGTCCGATTTGTTATAAACAAT 
AACAAAGTCTCTTTGTAATTGATTGGTTGATAGGTGGGGGAGGGGGTGCGGCAACAACAACACCAAAACCAAACCAGTTGACATGCGCTCTTTAA 
AATCAAAATCTCACTGCGCTCTCTGAATTGTGCCCTGCGAAAGGTAAGGCATTTTAGTTTTCGGACTCAATACGAACTATCAAGCGTAACTACAT 
ATTGTATATTGTTGCCCATTCATTATGAAAGGGTAAGCACTCAATAAAGTTAGTGCTTCTCATATTTCCTCAGGTTAGATAAATCGATATAATAC 
ATATGCATAATGCGTTTTTGAAGTTGGTATTTTAATATGGACAAATTATAGGTTACTTTATGAGGCGATTAAAGCTGTTTTTATATTTTATGTTA 
TTCC 

(SEQ ID NO: 271) 

Exon: 2614.. 2385 
Exon: 1576.. 1257 
Exon: 1189.. 1001 

Start ATG: 2399 (Reverse strand: CAT) 
Transcript No. : CT9529 

AAGCAATAACAACAACAACAGTGGCAGGCGGCAGCACCGTGTCCGCCGACGTTGTTCTCCATCACGATTTAGTTGGAGTTTTGCTGGCCATCGGC 

GCGGTTGTATCTCTGCTGGAGCTCGTGCACACTATCCTATCCAATCCCGAAGAGCTCTCTCGCCTTTGCTTAGTGCAGCAATTTGCAATTTTAAT 

TATCTGCAGGCGGTTCGACAAGAAAATGGGCCAAGCAAAGAAAGTCTCCTCCGCCTTGAGCAAGCTCTTCGTGTACGGCGCCCTCAAGTACGGAC 

AGCCCAGCAACTCAATCCTGGCCAGCTCTGGCAACGGCTTCGCTAAGTTCTGGTGCAAGGCGACCACCACCCAAAAGCTGCCGCTAGTGATCGCC 

ACCCGCTACAACATTCCCTTCCTGCTGAACAAGCCCGGAGTGGGATATTACGTAACAGGGGAGATCTACGAGGTGGACGATCGAATGCTGAATTC 

CCTGGACAACCTGGAGGATTGCGAGGAGATCTACACTCGCGAGATGCACGATATGAACATTGGCGTTGGAGAAGGAACCGTTCCATGCTGGGTGT 

ACCTGCTGCAGAAGTACCCGGAGAACCTACTTAGTCTGCGCTACTTGTCCAGCTACGAGAACTCAACCACCCATCCATACATTATGCGCCATCGC 

CGCACCCACAAGCATCCGGCCCAGGACGACCTCACCTACGAGGCCCAGAACTAATAACGTGGGTTGTGGTGTCG 

(SEQ ID MO: 272) 

Start ATG: 216 (Reverse strand: CAT) 

MGQAKKVSSALSKLFVYGALKYGQPSNSILASSGNGFAKFWCKATTTQKLPLVIATRYNIPFLLNKPGVGYYVTGEIYEVDDRMLNSLDNLEDCE 

EIYTREMHDMNIGVGEGTVPCWVYLLQKYPENLLSLRYLSSYENSTTHPYIMRHRRTHKHPAQDDLTYEAQN* 

(SEQ ID NO: 273) 

Name: Calcium binding related protein 
Classification: ligand_binding_or_carrier 



Celera Sequence No. : 142000013384694 

GCACGACACGCCGGAGCGAATGCTAGAGAAGGAGTGTATCAGTGAGATTGTGCCTTGGCGCGATTCCCGCCGCTGGCTGTACTGGCGTCTGCGAC 
GTCTCCTGTTGGAGGACGCATATATTAAGAAGATCCTGCGCGCTCAGGACAACCTCTCCGTGGGTCAGGCCAAGCAGATGCTGCGTCGATGGCTG 
GTAGAGGAGAAGGGTGCCACAGAGGTGAGACATTTTGAGCTTCCACTCCAGGAGACCACTAAATTATTCTCTTATGTATAATTAGGCTTATCTGT 
GGGACAAAAACGAGGAGATGGTGTCTTGGTATGAGGAGCAGATCAATGCCGAATCTATTGTTTCCCGCAACGTGAACTCCGTGAGACGGGATGCC 
ATTATTTCTACCATTTCGAAAATGCTCGAGGTAAGAGCAGCCCAAATCTTAATACCTATAAATTGGGAACTAACTCCTCTATTTATATATCTTTT 
CAGGACTGTCCCGACGTAGCGCTGGACGCTGTTGTGGGTCTTTGCCAAGGTCTGACGCCAGTGAATCGAGGCGTGGTCGTACGCACATTAGCCCA 
GATGCAGCTGAATGAGGAGACCTCTAACAGCAACCAGGGATGATTCTACGCTATGTTCTAAGTTCTTTTTAGGCCCTCTATTTCATCCTTCCCGT 
TCCCGCCCTTCACCACATTTCTCAGTCCCGCTTTGCGCTTGTAAGATATCGCAAGGCGTGCATGATGTGCATTTTGTTTAGTTAAATTTGTTTAA 
AAGTAATTAGCGTAAATCTAGTTATGTATTGGAGTTGTAGGCATGGGTTAATTTGTTGCCTAAGTGGCCCGATAAACTAGTTGTATCGTGTACAT 



FIGURE SHEET 148 



ATTTGATTATGTATATCCCCTGAATACAAATCATTCCATTAGTTGTATAAGAACTAAAGTCCTTTGTACACAGAGCCTTTTGAATAATACGAGTT 

ACTGAAAAAATAAATGTATGGTACACTACATTAACAAAACTCGCTGGATTGCAAAAGGGGTTTTTCGTCTTGTTCTGTTTTTCTTTTCTTTTGGT 

TCCACAGGATCGATAATGTACATAATTTCATTTATTTTTATAATTTTAAAATGAAATCTATCGCAGTCGGGGGTTATATCCTCCGATTCACTGCG 

TTCCTATATGCCGTTTTTTAATTAAGATGAGTGGGGGGAGATGGGATGGCGAGAGCCTGCTTCAGTTCAGGTTATCCATCCAATCGGGCTCGGCG 

ACACTTATCTTCTCTATAATGGAATTAACCTGATAGCATAGCGACTGGATCTGCCTATCCCACTGAGGCAGTAGTTCCCGATTCTCGAAGTGCAC 

AATGGCCGAGATCTGGTCGATGTGTCCGTTCATGCGACCTTCGGTAATCATCTGAGAGGCAATCTTCTCTGCTTTCACCGCGGGTATGTCCAGCA 

GAGCGCCTAGCTCCTCAAAAGTTATGTTGTTGTACAGCTTACTAGCCGACAACAGATTGTGCTCGAAAACAGCCCGATCCAAGATCGAGGATCCA 

TCGGAGGTGGCAGCCTTTTGGTGGTCCTGCAGCAGGGCCTCGAACTCCTGTAGCTCGGAGCGCCGAATGATGCGCTCCAAGTACATCTTTTCCAA 

AATTCCATAGGCCGGCAAGTGCTGACAGCGTTCATCCTTGAACAGAGTGGCCAGCATGCGGGAACGCTGCTGTCCGGCAGAGGCGAGGACTGTGC 

AGATGAGCGCTTTCTTCAGTGCCGTCATCCTCTCGCCTTGGTCCACGATCTTGCGGTAGGAGAGCTCGTTATATCGCTGGGCGGCCTCGATAAAC 

TTGCGCCTGTAATCCAGGACACGAGCATAACAGACTTTATATAGAACCTAAAAGGTGCGATTTTTAACAGATTACCCATCTAAAATAGCGCTAGG 

AAGCCATTTACCTGCAACTCCTCGGAATTAGTTTCGGCCTGCAGCAATGATGCCCGGTTGATGAACAGCTCCGCCTGCACCGAATCGTTGTCCTC 

CAGATAGAGTCGAGCGATCTTTAAGTAGGTGCCCAACTTGCACTCCACGGAGTACTGTTTCTGACCCGTCTCCAGAGGAATGCCAACTAGCACTG 

TGGCCGCATCGCGCCACTGCTGGTTGCGCTCATAGATGTTGGCCAGGTGGAAACGTATGCCAGCCACTTGCTCTTCGAACGAGATGACGCGCGGA 

TTGACCTTTTCCAGGGTGAAGTGAGAAAGCATCTTGGACAGATCGTCAGGCAACTTGCTCAGCTCGCTGCCCACGTCGTTAAGGATCTGCCGCGA 

GATCACCAGGCTGACGTGCTCGTTTACAATGGCCTCCACAAAGAGCCGGAGCCCATCGATCAGCTCTTGGCCGGTGTTCGTCAGCACAGTCTTCA 

GCAGCTGGCGGTACTTGTCCGCCTGGTCCTTGTGCGTGCCAGTGAAGTTGATCAGGCCCATCAGCTGGCTACGCAGAGCTGCCGTCGAGATGCCG 

TAGTTTGCGGCCATATTCACTTCTATACCCAATTACTTCGCACTAGCTAAAAACAACAACGCAAGCTCATCCCTGTTATCGATAGCTCGCAAATC 

CGGCCGGGACAGTGCAGGACGCCTAGCTTTTATCTAGCTGGTGTTGGAAAATGTTCGATGGTGATACTCAGCTTACAAATTTAGTTCGCAATTTA 

TATTTTATTTATTATTACCTATCTATCTATGACTTCTTTCCATAGATAGGTTTTGAGTAAAGACCAAACCACATGAAAATGTTGACATAAAAAAA 

TGTATATCTGAGGATAAAATTTAAATATTACATAACATAATATAACATATTTAAAATACTATCTGCAAAGCGCAAGACTCTTACCAAGGCTTACA 

TTTGTTCTACTCGTATTATCTTCCAATAAAATATTTATGGAATAAAATATGCCAATAAAATATTGGCAGTATTTCCTTTCAGCATTCGATTTTTA 

AAGGAAGGCCTATAATGATTCGGTAAGGTTTAAAATTCACTTAATGCTTACACTTGTACATACTTTCCCAATAAATTATTTAAAATTGGCGCTTC 

AATCCMGCTGCAGTCTATCGGTGACTGCATTCTTGAAATATCGATACAAAAATGACAGTCGGTGCGCTGCAGTAAACAGTGAAAAAATATACTT 

GT GT C G AGTC C AAT T GG AC T G CG AAAAC T GC AT T AAAC AAAGAG GTC AAAG AC AAT C TG C AAGT GGTAGT AG CT AG AG AAT T ACT AGAAACT C GA 

GAT AT T C AGT GC AAAG AC C AATC C AT TCGGATCCTT GT GG AAAAGGG T GG AGG C C AGGCC CAC AC ATAC AT AC AC TT T C AG C AGC AG C GC AGC T C 

GCCGAGTGAAGTGGAAAATGTATCCAAAGGAGTCCAATGGCCGTTTTTGCGAGCAGATATGAACGGAAATTGCCAATTGACGACACCCAGGCGCT 

GAGCTGCGAGATAACCGCCGTCCAGGAGGTCGCCGGACATACGGTAGCTAGATCGCGAGGGGGCAGAGCTGCTGACCCCAGCGAAAAATCAATAA 

TTGTTGATGAT 

{SEQ ID NO: 274) 

Exon: 2526. .1912 
Exon: 1852.. 1001 

Start ATG: 2484 (Reverse strand: CAT) 
Transcript No. : CT9624 

TTGTTTTTAGCTAGTGCGAAGTAATTGGGTATAGAAGTGAATATGGCCGCAAACTACGGCATCTCGACGGCAGCTCTGCGTAGCCAGCTGATGGG 
CCTGATCAACTTCACTGGCACGCACAAGGACCAGGCGGACAAGTACCGCCAGCTGCTGAAGACTGTGCTGACGAACACCGGCCAAGAGCTGATCG 
ATGGGCTCCGGCTCTTTGTGGAGGCCATTGTAAACGAGCACGTCAGCCTGGTGATCTCGCGGCAGATCCTTAACGACGTGGGCAGCGAGCTGAGC 
AAGTTGCCTGACGATCTGTCCAAGATGCTTTCTCACTTCACCCTGGAAAAGGTCAATCCGCGCGTCATCTCGTTCGAAGAGCAAGTGGCTGGCAT 
ACGTTTCCACCTGGCCAACATCTATGAGCGCAACCAGCAGTGGCGCGATGCGGCCACAGTGCTAGTTGGCATTCCTCTGGAGACGGGTCAGAAAC 
AGTACTCCGTGGAGTGCAAGTTGGGCACCTACTTAAAGATCGCTCGACTCTATCTGGAGGACAACGATTCGGTGCAGGCGGAGCTGTTCATCAAC 
CGGGCATCATTGCTGCAGGCCGAAACTAATTCCGAGGAGTTGCAGGTTCTATATAAAGTCTGTTATGCTCGTGTCCTGGATTACAGGCGCAAGTT 
TATCGAGGCCGCCCAGCGATATAACGAGCTCTCCTACCGCAAGATCGTGGACCAAGGCGAGAGGATGACGGCACTGAAGAAAGCGCTCATCTGCA 
CAGTCCTCGCCTCTGCCGGACAGCAGCGTTCCCGCATGCTGGCCACTCTGTTCAAGGATGAACGCTGTCAGCACTTGCCGGCCTATGGAATTTTG 
GAAAAGATGTACTTGGAGCGCATCATTCGGCGCTCCGAGCTACAGGAGTTCGAGGCCCTGCTGCAGGACCACCAAAAGGCTGCCACCTCCGATGG 
ATCCTCGATCTTGGATCGGGCTGTTTTCGAGCACAATCTGTTGTCGGCTAGTAAGCTGTACAACAACATAACTTTTGAGGAGCTAGGCGCTCTGC 
TGGACATACCCGCGGTGAAAGCAGAGAAGATTGCCTCTCAGATGATTACCGAAGGTCGCATGAACGGACACATCGACCAGATCTCGGCCATTGTG 
CACTTCGAGAATCGGGAACTACTGCCTCAGTGGGATAGGCAGATCCAGTCGCTATGCTATCAGGTTAATTCCATTATAGAGAAGATAAGTGTCGC 
CGAGCCCGATTGGATGGATAACCTGAACTGAAGCAGGCTCTCGCCATCCCATCTCCCCCCACTCATCTTAATTAAAAAACGGCATATAGGAACGC 
AGTGAATCGGAGGATATAACCCCCGACTGCGATAGATTTCATTTTAAAATTATAAAAATAAATGAAATTATGTACATTATCGATCCTGTGGAACC 
AAAAGAAAAGAAAAAC AGAACAAGACGAAAAACCCCT TT T GC 
(SEQ ID NO: 275) 

Start ATG: 43 (Reverse strand: CAT) 

MAANYGISTAALRSQLMGLINFTGTHKDQADKYRQLLKTVLTNTGQELIDGLRLFVEAIVNEHVSLVISRQILNDVGSELSKLPDDLSKMLSHFT 

LEKVNPRVISFEEQVAGIRFHLANIYERNQQWRDAATVLVGIPLETGQKQYSVECKLGTYLKIARLYLEDNDSVQAELFINRASLLQAETNSEEL 

QVLYKVCYARVLDYRRKFIEAAQRYNELSYRKIVDQGERMTALKKALICTVLASAGQQRSRMLATLFKDERCQHLPAYGILEKMYLERIIRRSEL 
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TTCCCAATTCCTTCGAGGTGATGGCACCTTCGTTATCCTTATCCAATAAGCTATAGATATTCTTGAGCAGGTCCTGCTCTTCAACAGACAATTCA 
TCCATTATGCTTTATATTATGGGAAGGAGAATATTATTATTATGCTAGTAGATATACTTTTTCGATAGGATTGATACTCAGTTGAGGTCTGATAA 
GCTTTGAAGGAGTTCTACGATTGAAAATTGTTATTCAAAACATTCAATATAAATATTTTTGAAAAAAGCCAACTAAGAGTTAGCCTTCGGTCTTT 
AACAGATATTAAACAATCTGCGTGCTCCATGAAGTTCATAAACTTTTACTGAAGATAAAGGCATTAAAAGAGCATTCTCACGCACAGTTTGGCAA 
ACAAATCACGTGCTTTCCTGGACCCAAATGATATCATCTGGCGGAGAAACTCTTTGGCTTTACGCGTATTATTTTATAGTTGAACCAGAAATTGA 
TGTCATCGGTGGCGGTAAAAAATGGGCGCCCAATCCGCCCAATGAGGAAATTCCAGGAAGCCGGGCAACTCCAACTGTCGGTGATGACAATTTGC 
GTGGCATTTTAATTGAACTTTCTACGTGTTCGAGTGGTGTACTTTGATTTGATCCCCATCCACCGATTTTTCGTCGATTTGAGTCAATGTTAAAG 
GCGTTGCCACAAACCGCAAACTGTCGTGTCTAGGAGGTTGGGGAGGGGCTTGGTGGCTGGGGCCTCGAAACGTGCCTGAAGTATTATTACCGCCG 
TCTGTGATTGATGGCCTCATGCGAGTGAGTCAGTGGATTTTTATAACTCTACGCAGATCCCTTCCTTTCAATCAAGCGGCAAAAAATCTGGACAT 
CGATCTCGCCGATATTAGCTAACTTCATTTCAACTTAACTGCCCGGAGTGCTTTGAATAAAGTGGATTTGAGGCGATGCCTTGGGATTCTTGGCC 
AGATTAAGTCGATTTTTTTTTTTTACTTCAATGAAAAAAATGGCACGCAATTGTTTTTCATTATTAATGCATACTTTATTTGAAATCAACTAACT 
CGC T ATGGTT T GAAT AAAAT C AT T TAAAATT G AAGGAAG AC C AT GG AAT TCTATGGGTTGCT T AT AT T AACT AC T G C AAAAC T CT T AAAATT T AA 
GCCTAATTACATAAGGTTCTACTGGAATGTTGCTTGTAGTTGGATTTTTGTTTATGCTCTGCTTTACACTAACTTTTACTTTCAACCATTTTTTT 
CCCTACAGCAAAATCGATTAACGATTTTGTTTACTCATTAAGATTAGCTTACCGGATAAGGAAACATAAATAAAAAATTGGAATTCACTCAGCAA 
ACAGCAACACTTAGTGTTGGGCCACACATTAAAAACTAATCGATAGACTCTCGTACTAAAGGCTTAAACATTTAAATCGGTAGATTTAAGCTAAA 
ATCCTAACGCATAAGAGCTCTATAAAATGCAACAACGGAGTTTGTTTCGCTTACAATTATCGCTTAACTGATTTTATTACTTAGTTTAGGTCTAT 
TGCCTTGCCTAGACATTGTACTAAAAAGTATTAAATTGTAGATCTAAATTTATTTTAATGTACTTGTGGGTCTAAAACGATAAATCCTCGTTTAA 
CCATTGGGAACAATAACTGTTTGTCATTATGATATTCAACTGTTCATAAGACCTTCTAAATGAAATACCAGAAAAAACAACCGAGATATATATTT 
ATTAGGAATACTACTACTTTTTGATATTTCAAATTCGTAAGAGCTTTGATAAATAGTTTTCCAAATTGTGTTGTTTTTCTATTCCGATTTTACCT 
CCATTAATGGATTGTTTTCTAGTTCTTTCTCGGTTTACTTCTAACGCCTCATTATGTCCTCAATGCTGAACCCTGTTCTTCTCGGTGGTGCAGGC 
GGTGGTGCTATGGGCGGTGCTGCCACTGGTGGTCCCTTTTCGGTGGTGGCTCCACTTGTTTCCGTACTCGGTGGTTTTCCCAGAAGCTGTCTGGG 
GGGATGTCGCGACATATTCAAAGCATATATGCTGGGACTATCCACAAAGGTCCTTTGACCCAATCCAGGATGGTGGTGCGGATCCATCAAGTCCA 
AGGTCGGTGCCGGAAGAATCCTCTTTCGCATGGGTATCGATCCCAGGAAGCTGGGCATAGCTTCATGCGGTGAATGCAAAGGTTGATTGGGTGGT 
TTACCCGACTCCGGCTGGGTTATTATAGTGGGTGGTGTATGTGGTGGCGGTGGAACACCAGCCTTCGAAAGGTTCAAGCTCTTGTGGTGCAAGTC 
GCGTCGAACGTGCAGAATGGGCAAATAGGGATCCCCGTTGGGTGGGAGTAAGTGTGGATGTCCTGGGTGCTGGTGGGTTGGCGGAGGAGCTCCAG 
GTGGTGCAGGCGGCGGTCCAGCTGCCCCAGGTGGTACAGAATATCGCACAAAGGAAGGTGGAACGGGTAGGCTGGCAATGCCAGAGTTCCCGGGT 
TCCTCCTGATGTTTCTCGCCATGTGGCCGCATTGTGTAGGTTTCACCACCCTCATGATGACAGTGGGTGACTCCCTGGCGTTCCGGCTTGGTGCC 
CACTGACGACGATGATTGCGATGTGGCGGCAGCCTTACTCGCCTGGAAACGCGCCACTAGTGCCACTTCTGCCTGCTCCTCGTCCTCGGGATCAA 
AGTCGTCGTCCTCTTCCCTGGGCAAATCGTGTACTCTTGGCAGATCCTGCAGTTCCTCTTCCTCGTCGAAGTCATCGAACTCCTCTTCGCTTTCC 
TCATGATCGATGGCTGCCTTGCGCTTGAGTCTCATTCTTTCGGACTCAGATTTCTCACCCGATCCTCCAGACTCGCCCAAATTGTGAACTGGCGA 
CTGGCGAGGCGAGTCCACTTCCAGCAACGAATCCTCCTCGTCGCCACTTAAATTTCTGGCCTCATCCTCTTCGCCCACATCCACATAGTCCCCGG 
AGTTGACCTCACCCACTGCATGGGTCAAGGCATGGCGTCGTAGATCGCAGTTACGGCGGAAAACTTTGCCGCATGAAGAGCACTCGTAGGGCTTG 
TGATCCGTGTGGGTGAGCAGATGGGTCTTCAGGTTGGAGCGCTGATTGAATGATCGACTGCAGACGGGGCACTTGTGGGGTGATTCCTCCATGTG 
CAGGATCTTGTGGACAGCCAAGGTTCTCGATTGGCAGAATCCCTTGCCGCATTCCGTGCATTTGAAGGGTTTCTCCTTGGAGTGAATATACCTGG 
GCGAATACAAGTGGAGAAACAATTAGACTTTATTGTTATAGCGTGGGCGTTTTCCTGGTAGCTGCAATTGGTAGCCCAGGTTGACAGTTATCTCT 
GCTCAAATGCAGCCTTAACTTGAACCGATAAAGCTCACTGATATACAGCTTTACAATACTCTAACTAAGGATATAACTGGCTAGAGCAGACAATA 
GAATAAATAATATAATACTATAGCATATAATACCTATCTTTCTCTTTCAAAATGAAACCTAACCTGGATTTGTTTTAACCTTAAACCCAATTCTT 
TAAAGTTTTTCATTAAAGAAATCCTATGATTTTGTATCCTCTTAACTGATAGGGTATTTAACTTTCGTGTATGCGATCTGGCTGCATTAATTTTT 
TATGTTTATGGCGGCCATCTTGGCAATGGCGTCCACTTGCTGTTGGATAGCAGCAGCCACAGTTAAAAGTTAAAGTCCCCATCTTCGACGGAGCC 
ACAGATTCAGCATCGCATCGGAGGCAGCAGCTGCTGGTGGAAAAAAAAAAATAAATGTATAAAAACCCAAAACAGGTTCTGCCTCCGAAGCACCG 
AGATTCTTCTCCCATGCTCCGTGCGTTTCTGGTAAAAAAAAAATACAAAAAACTTAAGAAGAATTAGCCCCGCGGCAAAGCAAACAAACGACAAA 
TGGACCGATACCGAACGATCAAAAGACAGAACGCTCGGACAGGCCGACACATTGAAGCCGGGATGCTGGACCTGCTGCCTGGCATCGGCATGGGC 
ATCGGCACCGGGATACACAGACCCGATATATAGCCCCCAACCCCTTGGCCATCCCTCAATTTTAATGCTGGTGGAGGAGCCCCAAAGGAAAACCA 
ACAGAGTTAATTAACTTTGTGTTGTACTCTGCGGCCGAGTTGAAGTTTTACTGGCCGTCAGAGGAGCTCTGATTTTGGGTCCGCTTTATGGGGAC 
TTACTTATTTGAGTTGAGGTAGGAAAATTCGTCGTAAAGTTTAAACATTTCCTGGAAAATTCCTTATGGTAAAAGGGGTCTTTTGAAGCGATTGT 
CTGATATATCTAAAATCAATAAAACTCCATCTACGAGTAGTGGTCGCCTTTCCCTTTTGGCCGAGATTACTGTCACACCTGTTCAGAATGTGTGT 
ATAAACTGCCTAGTAAGCAATAAATTACATTGATCTAATTATCCTTTGCCTTTACAGGGTATTCCATTATTCGCCATTCGTTCACACTTTGAGCA 
CAAAGAATAACTCCAGGCCTGATGGCAATTACCCGAGCAGACACTGCCCCCTTTTGTGATTACTTGTACCCCAACCTCGACTCACCTGTGATCCC 
TCAGGTGATCCTGTCGCCGGAACGCCTTGCCGCAGATATCACAGGAATAGGGCCGCTCGTCCGTGTGTGTCCTCTCGTGGATGAGCAGATTGTAC 
GACTTGGTGAATTGTCGATTGCAGAACTTGCAGATGAACTGTTTTTTAGGCCGCGATGCCCGACCAGGAGCACGTAGCAGGCGCCTATCGAGTCC 
GGCATGCATTCCAGGACCCAGGCCAGGTGGAAATAGAGCTGCGGCGGCGGCTCCTGGTGGAAATGGAAAACTTCCGGCTCCCGTGGGCGATGGAC 
CCGGTCCGGTGAGATATGGCGGAGGTGGCACGCCAGCTGGTCCGTACAACACTCCAGGCGGTGGCACACCAGGACCGCCGGGTCCTCCAGGACCT 
CGTCCTAACATTGCCGCCGCCGCAGCCGCCTCCCTTAGGGCCAACTCCTTTCGCTTGTCCGCCATCATGGCAATGATTTCGTAATTGGAATTACT 
ACCCGGGCGCACCTTTCCGGTGGGCAGAACCGGCACAAAGGCAGAGATCTCGCGATGATGCGATGGTGGTGGCGGTGGCGGCGGACCATGCAGTC 
CGGCCAAATGCTGCTGATAGGCCAGCTGTGTTTCATACGCTGAGATCCGGCTGGCCGATGAGTTGCGCGAATCGCCGTTGCCATTCTGGGAGCCA 
CCATCCGATCCTCCGGATCCGCCGTTCCTATCCGCCGCACCAGGAAGACCGCCACCACCCACCGCCGCCGCTGCCGCCGCCATCGAGTTCATGAA 
CTGATCCATCCGGGAAGGTCGCAGCATGGGTATCGCACCGCCTCCTCCGCCGGAGATCTCGCTGGATGAGCTTTCCGTGGGCATTATATTGGAGG 
GGCTCTTCACATTCCGTTGGAACTGCCGGAGATAGGGATAGAACAGAAAAAGCCGAAAATAAAATAAGGGTAGTGGTAAATAACAATTAGTACAC 
GATTGTGCTGGGTTAATTGGGCTCTCGTTTTTTTTTCGAGGCTTATCTTAATTGCGAACCGAGTGCAATAAACAGATATTTCCACGTTGTATTCT 
TTATTGCTCCTAGGGTAGTACTGGGTATGATAAAAACTGACGCAAACTGGTTTAAAATAGATATATTACTATTAACTTAGTTTCTTCTTGGTGGG 
AAGTTTCCTTGTAATTGAAATATTCATGATCTTGGTTTGGCAAGGCACAGGAATATATACAATATATACTATAGAATATTTACTCTTTTTATCTC 
AACCGGTTGTAACTCACTTTTTAGATGTGAAATTCTTCAAATTTGCTTTTTGCACTTCTTATCACAAAATTAAACAGCTTTATGGGCAATTAACT 
TGCAAATAGCCAAGGATTATCACATTAGCTGATCTTTCAGAATTTATCCACGTGAACGTGATTTAGTAGTCACAAAAAAAAAAATAATGAAGAAA 
ATATTTTTCTCGAAAATAACCCATACGATTCTGAGAACGTTTTGGTATCACAGTACGTATATTTTCACAGGGTAACTAAGCAGTTTATCACATTA 
TGGCCCTCGTCTATGGGTTTGTTGCTGTGTTTTTTCCATAGGAATTTTATTATTATTGTTGGTTTATTATTATCAGCGGCGGACTGCCGACGACG 
GCGAGTGAAACTAAACTGAAAACGAGTCGCAGTTTGGCACCGCGATCGGTTGCTTCCAGTGCGCAGGTTAGATAAGATTGTGCGTCTAAAAAAAC 
CTGCCTCCGAACCTCTGAGACTGAGTAAGCAGCACAGTAGTAACGATATAAACACACACACACACACACAGTTATATTATATATGTGCCTGTGAT 
ATATGGCATATCGGCAGATACCCCGGCACACTCAGATACAGATACAAAGATACAGATACAGAGGCAGGCAGCCAAGTTGCCGTCGGTTTTGGTAA 
ATGACTCGAACCGAACTCGACCGACTGCTTACTGGATTTACGCAACAGGAGCCTCGACTTCTGAGTGCCTCGAAAGAAGAAGGGGCAGAGGCAGG 
GGCAGGGGCAGGGGATCGGGTCCCGGAACGTGATCATCACGATAATAACAAAATAATGCACATAATAATGATCACAATGATGGCCGGACCATAAA 
GTTTGTGGCCCCATATTGCAAACGCTTCTGGCTGCCATCTTCCCAACGCCTTGCACAGCAAAAAAAATGGCACCAATTTTGTACCCAATTCTATG 



FIGURE SHEET 150 



GAATTCTATTTTAAGATCTATCTTTTACGATAAGATTATATTTGTTATAATATTAAAGAATTTATAGTAATATGCTTTAAAATTTGTGGCTCGCA 
AAGAGTGCGCCCCAGAGAGCAAAGTTTCCTCAGTAAGTCAACCTTTTTTTTCGGTGTACTATCCTCATTTAAATGTACAACCAGTTGAAAAAAAA 
AATGCTTTAAAAATAACCCAGCTGTCGGTGCAATGACAGTTTTGCTTATCATTTTGCTAGAATCGTTCTCGCTGTCTCGGTTGATTTTAATGCAA 
AATGCTGCAACGTCATTAAGACCAACATTAACATTCGGCTAACAACAAGTGAAGTGATCGTCATAGTTGGTTAAAAATGAGCTTAGCCAAGTTTT 
TTGGAGAGGCGCCCATTGTTGAAGTTACCGTTCTTTCCGTACGGATGACTAGTCATTGTGGTGGTCTCTTATCTATGGAAATTATTTTGGCATTG 
TCTGGCTGGAGAGTGCGCTGTCAGATTAGCTCTATAAAGTAGCTGAAATCATGACTTGATCATGAGTGTATCATTGTTATGGGGGAATGACTCTA 
CGGACCACGTATAGTTCAAGTGGATGAGATATAAGCAGGTCCCAATTATTAATTCAAATAGCTGTTAGATGTGGGTAGGTCTTATCGTGCTATCA 
TATCAGCGAGTCTTTAGTAGACAACGGAGCACGCAGTTAAGATATTAGGCAAATAGATGATCTAATTGTAATATTTACACTAGAAATGCTTACGA 
AAGCCATATCGAAGCGTTTTTGTTTCCCTAATACGACTAGCACATGCCAAACTAATCTCCACGTTTATTAGATAGTAATTGGATAATTCTTTCAG 
TTCAATACCTTTTAACTCTCACCCATGCGAGTTCTCGTGCCCTCACTGATAATCTGTCGCAGTCTATCTATATTCGATTCAATAAAGAACTTATC 
TGATCTAATAAGTCCCCCAACTTAGGGAACCTTTGCTGTAGATGAGCGATGGCCCCGCCATGCCTCATTCGATTACGATCCCGATAATCCCATTC 
CAATTTGGGCTCACGTATTTCTGGTAGCCTCGAAAAAAAAAAAAAACTGAAGTGTMCCCGTTTTGGGTATTTATATATATTTTTTTCGATTTCA 
GTACTTTTTCTAGGGATTGTTTCCACACTTTTGCATTTACGACCGCTTTGGACCCACGTTTCACCCCATACCCTGTGCACATACAATGTACAATT 
CAGCGT TC ATTATCTTTTTTGAAACATCGGCACTTTTGAAAACGTCAAGCCAAATAAGAAAAACAAATAAAATTCATGTAAATAATGGACATCAT 
CAGCAAAAGCGACGGCACGTAGACTGCCGTAGAAATATTGAGAGAGAGTTGTGCGCACAGATACAGATACAGATGCGGAGATACAAAGATACTCG 
TACTCAAAGATCCACAGATACGTTTGCGCAAACGGTTGCCCAGTAAATGAAGCCAAAGGCTTAAAGTTTCCAATATGTGTGCGTCTGAAGGTGCA 
CT GGAAAAATT ATT GTGC ACT CTTGTGAAT T AT TTAATATT AAAAAAAAGAGT ATATCAAAT AATTCTG AAT AT ATAGGGAAAT G AAC AAAAAAT 
AATTAGTACGTACTTTTTTATGTAATATTGAAATGTGCATATTTAATTAGAGAGGTAATATTAATATCTTTTCTTTCTCTGTATCGTACATATAA 
TAAAAAGAGCCAGAAGACCAACTACTGCTGGGAGGAAAATAAAAGATATCGAGGAGCAATTTGGCTGCCACTTTTTTCCCTCCTCTTCTTTTTGA 
GCCGTCTTGCTTCTTTTCCTTGGATTTCTGGGTCCGGGATCGGTTCGTTTGCTGGGTTTTTGTGTGTAAACAACTGAGGCCAGCAGAAACCCAGG 
GCCCAGCAGAGTGAAAAGGTTTTAGTTGATAAAAAAAAAAAACAGAAAAATATACAAACACACACAGACACACACTGAGCAAACGGATTTAGTAT 
TTGTATAGAAAACGCAGAGAGCTGAGGAGGCGAGTTTGTCCCTCTGATGATGGCCTACGATGGCTGGAAATGTCCATGCGTCTTACTGCCCAGTG 
ATTATGTATTGAAAGGGGTATATCAGGATTTCCCGACCAAGCCATTATATAACCTTTAATATCTTAGAAGGATCTGTGTATTTACAGAAAACAGA 
ATGTTGCTCAATGACAAGAAAAAAATAATTTTGAATTAAATTATCATTTAACGTTATATGAATCAGTAAGATCAATCCCATGTTACCTACTAACC 
GGAGCTCAACTATTTTAAAGGTTCTTAGCAATGGTTTTTGAAATAGGGACCGTTAGCTAGAAATACCTACCACAATTTTAAGGCCGAGGTACACC 
CGCGGTCTATCAGTAAACCTTTTATTTCGGACCTTATCTTCTGGGCCGAGTGCTTACCGGGCCCCCATATGCACTTCAGTTTGCTTACAAAACAC 
CGGCCAACAACAACAAACAAGCCAGAAGATGACGACGAAGAAAAGTGGAGTGGTGGGAGGAGTTCCCCGGAGGAGGAGAATGGGTATGGGTATGG 
GTATGGGTATTACGGGTGAAGGAATGGAAAAAGGGTCGCCGAGATCCGCTGTGCAGTGCGGAATGCTCTTGGCCAGGCTGTGTGTTTTAATCCCT 
T C GGGAT AC T T T T T CT T C C ACCG AGT GT GC AT AT GT T T G T GT GAAGGT AGC T AAT AATC AAG AC AGCG ATGG AG AGG AGG AAAAAC CG ACGC AG A 
CTTATAATTATAGTCGAAGAGGAGAACCAAAGAGACCTGCACTTCACATGCGTTAATCTGAAAACCCTTTCGGGCCCAAGCAGCATCTGAGATTA 
TAGTCAAAGCTATTAAAAATCTAGCATAATCATTGCCCTCGGTGGAAAGCAAGTTGCGTTGTTGTTTTTCTCTTTTTTTTTTTTTTTTGCTCCCG 
AGGTAGGCAGGCGATCTTCCTGCGCAGCCTTGTCTATAATTTTCTTGATTAATATCCATGAGATTTACACATTTGGAGTGCTGGCCAGAACCCTG 
CACCTCAACCCTCCGCCACCCCCCGCGTTCCTGTGTGTGCTGGCCATTAATTTAAAAAGCTTTGCTAATTTCGCGGGGAAATCTGGAAACGAACA 
ACTGCTAATGAAGTTTTATGGGTTCGGGAATGCAGCTGCAGCAGGATTGATAATGTCGTTAGCATAAATTGAAAATGCTTTCAAGTTACGACCAA 
AGATTTGTTATGATTAAAGTATGGAGCACGCGGAGTGAGCAAGAATATCCACCAATGGATTTTTATTAAATGATAAATGGCGAGCTGCAGATAAC 
CAGATTCCTCTGGTTGCTGAAATAAATATTTTTTTTTGAATTGAAACCTTCTTAGGATTAACTGAAGTTCAGACAGTTAGTTCCTAAAATTGAAT 
TATTTCGGCTTAACTTTGACTCTTTCTAATCCCTCTTGAAGTAATCCACTTTGAATGCTAAGCCGATCTTCGTATAAATTCTATTGTCATATTGT 
GCTGCTCCTTTCATTCGAACTAAATCCATTCAAAAATTTCACCCTTGAATAGAACTCTTAAGATCTTTTGTTGGTCGGATTGCTACTTTTTCAGG 
CAATCTATTGTGTTATAATTTCAATGGGTCAATTCTTATTTATTGGAAGAGTTTATGCTAATAAGTGTAGAAGCACTAAGACTATGATTTATTAA 
TTCAAATGCAATTGGATTATGCTAATTGCACACAAATCTTGGTACAAATTATTTATATATTTTTTTCTTCGACAACTTTTGGAACCCAGCCCAAA 
ACTAAACATGTAAACTGATAATCACCTTTCCGCTGTCTGATTTCATTGATAGCATTAAAAATTCATATTCCCTAGACAACATCTAATTTCCCCAT 
C AAAAT T GAT G AGT T C CCGT T C C AG AAC AAC AAT AAAT GAAC T G GT C T AT C AGGT G AAT T AT AC AC AC G T GC C AAAAC GAGGC GTGT CC AT AT AG 
CCATTTCAATTTAAAACCCGAATATGGCGATCCCTAACGAGCCATTGTTCTGCGGTTTTATGGTAAATCTAAATTCATCAAGTATTTTGTCACGA 
TTTTAATTAATTTCGATTTCAATTTTCCTGCCACAAGGCAATTACCAGCAACATTTTCCCCTGCTGCTGGTGCAACCGAAAAATGATAGTTGGTG 
GCCCAAGATGGAGAGGAAAACCCACCCAAAGAGGTAAAGCGTAATAAACTTTTATGGTACTCCCTTTCTTATGCTAAAACTCTTTTTGGACACAA 
AACCAAATGGCCCATAAAGCAAGACAATGAATATCCAAAATAGATCTCCACTGAGGCGAAACAAAAGCCAATAACTTTATAGAACCGCAACCAAA 
AAATACAAAAAAAAAATAATAATAACTCAGACTGTGAGAGCGCGGGAGGCGTTCTACAAAAAAAAAGCTTAAAAATGAAGGAAAAAGTAAAGCAT 
T T AAAAG AAAT C AAG AG AT GT AG AT G AC GAT GAT GAT G AT GG T C T GAG GT T G AGG C T G AAT C CG AAT C C G AATC GGT ATC T G AACC C AA C AGAAG 
CTTGCGGCAATTGCGTTTCCTCTTTGGCCACACGTTCTGGACAATTAAAGCCCCAGCCGAGCTCCGAATCTGAATCGAGCCAAGTTAAGACAGCG 
ACCACGACCGAAAACCGAAAACAGAAAACCAAAAAAACCGGATTCCAAGCCGAGAAGGAGCGAAAAAGACGGCGAGAGCGACCACTGGCCACTGG 
C ACT GGC T AC AAGT G AACT GGT T T AC AAAGAAAAG AA AC AG AAAT G AAAG AAAC AT G T G AAAAAT C AC GAT C T AT AAAGC C GATGC ACT T AAT AC 
CCTAGGCAAAATAAGTAAATAATACTATAGATCAAACTAGAGTAGACTTGTTGAAAAAAAAACACTGGAATAAATAAGAAGCTTATAAACTATCA 
GCTTATTAAACTAAGTTTGCCTAAATTTCGTAAAATATATTTTTAAAACCTAACGCGATGCTTTGAAATTCTGTAAAATTAAAGTTTTTCAGAAA 
TGTAACGTTATTAACACCGATTAATATTTTTAAATAATATTTTACAAATTGTGTACACCATTTTCAAACGAAGTACCCTTACCAAGAGCACCCAT 
AAGAGACCCCCAGAATTCCAAGCGAAGGTTTTTCGAGTCCGGCTCGAACCGACCAACTCGAACTGAGCGCCACAGCAACAACAATGGGGCAAGCG 
GGTGGTTCAGGATGGTGGGGAAGTTTGCGGGGGGATGGGGGACGTACTCCACCGTGGACTGTGCGGCACCACACTGAGTTCAGTGGCAATGACAG 
TGGCACATGCAGTTGTAGTGGAAGCCGCTTGTCGGTCGACAGTCCGCCAACCGCCAACGTTGTTGCGCTCTCTTTCCTGTTTTTTCGCCCCGTCT 
CATTCTATTTGTGGAGCAGGTTTTGAGATATATCTTGGGTCTTTCTCTCTCTCTCCTTCTCTGTCTGCGTACTCGTTATATTTTTGTACTCGCTG 
CCTTTTTGGCCAACGC AAAAAAT ACCAAAATAAAAAAAT AAT AAAC GTAATTAGCTAGCC 

AAAAAATAAAACAAGT AAAAAT AAGTGGTGTGGAATTTTTGTGTTCATCTGGGAAGGCAGGGCCTGTTCTCGGCGGAATCTTACCTGTGGGTTAA 
ACAGAAGAAAGCCCCAGGCTCTTTGGGCTTTATCTTTGCGTCCTAAAGTATCCAAGTAGTTGTTTCAATATGATTTTGCTGGTTTTTTGTCAGTT 
TGTAAGCTTTGAAATGCTTGGCATTGTTTTTGGTTTTTAAAGTAGTAATTTATATCATTTAAGTGGCACTTGAATTGTATGTTGTTTTTTCTTGT 
TGTATTTGTTTGAAGACTTCTGTTAATACGATTCTGTGAATTTAAAAGTAAATAATTTTATTAAAATTTTCAAAATATTTGTCGGAAATTTTTTG 
TTATATTTTCAGGTTTGATTTTAATTTTAACAAGTATTTCACCGAATCATTCTCAAGTAAATGTATTTAAAAAAACACGCACACTAAGGGACAAT 
TTTGAGATTCAAATCACGGCACGATCGCCCCAGGGGGTGGGGCTGAGGGGGCGGCAGGGGGGCGGTTACGGGAGCAGCAGCAGCATCAACGGCAG 
TCAGCTACCACAAACCACGTAGTTTTTTTCCTTCCAAAATCCACGTACAACTCGCGCACAAACACACGATGTATTTGTATCTCACAGATATGAAA 
ACCGAACGAAGAAAAACCCGAACCGAAACCTCAACCTCAACGAACTGAAAACCGAACGAGAACGAGAGATACACGGCAGCTGAAATGTGTTTTTC 
TACATTTTGTTTGAAATAGGAGCTGAAGCCAGTGCAGCTACCAATTCGCTCCTCAAGTAGTCCCAAACTCTTCCCACACACAGAGTCAGTTGCAG 
TTGAAAAGCGCTTTTCTCGAGGCAACTGGCGATGCAACAAGTTCACTCACACACACACACACACTAGCGCTCTGGGCCCGCTCAATAAGATACAT 
CGGCGGCGAAAGATACAGATAGATATAGAGCGAGAGGCACTCACAATGCAGTTCACTTTTCCAACAGCTATTACCAGCGATCCGCGCGCAGGTGC 
GTTGGCCACGATCAACTCTCGAAGACTGAGCCACGAACCCAAGAAACAGGGCCCGAATGGAAAAAGAGAGAATCGAGAGAATCGCGGGCTGAGAA 
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ATGCGTAGAAAGAGACAAGCGACGAGTAGCGAGCAGTGGCACTAAAACCAGCTTAGTGCACTGTGGAAAAAGTTTTAACAATTCTTAAATATCTG 
AAGAGTAAGGCTCTAATTTTCTGTAAATAACAACAGTATAAATCTATCTGCTTAAATATACTAGATAAATAATGGATGCATTTACATATTTTTCT 
GAATTAACTAGTAACTTATATTCAGTTTGAATATAGTACGACATTTCTTTCAGTGGAGGAGAGGAATTAAACCGAACTCAACCCAAACCCAACCG 
GCCGCCAGTCGCACGCTGCTAAATGGACGGATGGGCCCGTGGACGGACTTATATGGAGACTGGCACTGGCGGAGCGTGGAACGTGCATTCGTACG 
ACGAGTTATTGTCAGTTAGAGCGCTACCTGTTTACCGACCGACTACCGACTACCACCGACTGCTTTTTTTTTTTTGGCCCAGTCGAGAGGGTAGG 
GTACAATTGGCCCGGGTGTTGCCAGATTGCCGTTGCTGCTGCTGCCCATGGATCGTTATGGTGGCTTCACCCGAAAAGGAGGAGGAGGGTGTAAC 
TCTGGTAGTTCCCTGTTTATTGTTATGCCTTTTTAGTGGGGGTGTATTTTGTTCTTCGGCTTTTGTTGCGGTTTCATTTAGGGGAATGACTAATC 
AGAAAGCATTTTTAGGGAGGCAAGGTGATAAGGTTTCTTCGAAGAAGGGGTTGCCAAGTCGAAAGTTGGGGTGAAATAAAGTGCATTACTCACAC 
GACTATATATCTATATTATGACAAGTTCCAATGTTCTACTGATAAGTAAATATTTATTTGGAATAATTAGTTGGTAGCrTTAGGGAAAAATATTT 
ATACATATGTATTTAAATATACACTGATGATTAATCGGCAGCCGAGTTATACAATTCATTTATGGAAATAGGTCGTA 
{SEQ ID NO: 277) 

Exon: 13422.. 13345 
Exon: 12668. .12435 
Exon: 5437 ..4551 
Exon: 3226.. 1001 

Start ATG: 5404 (Reverse strand: CAT) 
Transcript No. : CT9648 

AGTCTTCGAGAGTTGATCGTGGCCAACGCACCTGCGCGCGGATCGCTGGTAATAGCTGTTGGAAAAGTGAACTGCATTAATCGTATTAACAGAAG 
TCTTCAAACAAATACAACAAGAAAAAACAACATACAATTCAAGTGCCACTTAAATGATATAAATTACTACTTTAAAAACCAAAAACAATGCCAAG 
CATTTCAAAGCTTACAAACTGACAAAAAACCAGCAAAATCATATTGAAACAACTACTTGGATACTTTAGGACGCAAAGATAAAGCCCAAAGAGCC 
TGGGGCTTTCTTCTGTTTAACCCACAGTTCCAACGGAATGTGAAGAGCCCCTCCAATATAATGCCCACGGAAAGCTCATCCAGCGAGATCTCCGG 
CGGAGGAGGCGGTGCGATACCCATGCTGCGACCTTCCCGGATGGATCAGTTCATGAACTCGATGGCGGCGGCAGCGGCGGCGGTGGGTGGTGGCG 
GTCTTCCTGGTGCGGCGGATAGGAACGGCGGATCCGGAGGATCGGATGGTGGCTCCCAGAATGGCAACGGCGATTCGCGCAACTCATCGGCCAGC 
CGGATCTCAGCGTATGAAACACAGCTGGCCTATCAGCAGCATTTGGCCGGACTGCATGGTCCGCCGCCACCGCCACCACCATCGCATCATCGCGA 
GATCTCTGCCTTTGTGCCGGTTCTGCCCACCGGAAAGGTGCGCCCGGGTAGTAATTCCAATTACGAAATCATTGCCATGATGGCGGACAAGCGAA 
AGGAGTTGGCCCTAAGGGAGGCGGCTGCGGCGGCGGCAATGTTAGGACGAGGTCCTGGAGGACCCGGCGGTCCTGGTGTGCCACCGCCTGGAGTG 
TTGTACGGACCAGCTGGCGTGCCACCTCCGCCATATCTCACCGGACCGGGTCCATCGCCCACGGGAGCCGGAAGTTTTCCATTTCCACCAGGAGC 
CGCCGCCGCAGCTCTATTTCCACCTGGCCTGGGTCCTGGAATGCATGCCGGACTCGATAGGCGCCTGCTACGTGCTCCTGGTCGGGCATCGCGGC 
CTAAAAAACAGTTCATCTGCAAGTTCTGCAATCGACAATTCACCAAGTCGTACAATCTGCTCATCCACGAGAGGACACACACGGACGAGCGGCCC 
TATTCCTGTGATATCTGCGGCAAGGCGTTCCGGCGACAGGATCACCTGAGGGATCACAGGTATATTCACTCCAAGGAGAAACCCTTCAAATGCAC 
GGAATGCGGCAAGGGATTCTGCCAATCGAGAACCTTGGCTGTCCACAAGATCCTGCACATGGAGGAATCACCCCACAAGTGCCCCGTCTGCAGTC 
GATCATTCAATCAGCGCTCCAACCTGAAGACCCATCTGCTCACCCACACGGATCACAAGCCCTACGAGTGCTCTTCATGCGGCAAAGTTTTCCGC 
CGTAACTGCGATCTACGACGCCATGCCTTGACCCATGCAGTGGGTGAGGTCAACTCCGGGGACTATGTGGATGTGGGCGAAGAGGATGAGGCCAG 
AAATTTAAGTGGCGACGAGGAGGATTCGTTGCTGGAAGTGGACTCGCCTCGCCAGTCGCCAGTTCACAATTTGGGCGAGTCTGGAGGATCGGGTG 
AGAAATCTGAGTCCGAAAGAATGAGACTCAAGCGCAAGGCAGCCATCGATCATGAGGAAAGCGAAGAGGAGTTCGATGACTTCGACGAGGAAGAG 
G AACT GC AGGAT C T GC C AAG AG T AC ACG AT T T GC C C AG GG AAG AGG AC G ACG AC T T T GAT C C C G AGG AC G AGG AGC AGGC AGAAGT GGC ACT AGT 
GGCGCGTTTCCAGGCGAGTAAGGCTGCCGCCACATCGCAATCATCGTCGTCAGTGGGCACCAAGCCGGAACGCCAGGGAGTCACCCACTGTCATC 
ATGAGGGTGGTGAAACCTACACAATGCGGCCACATGGCGAGAAACATCAGGAGGAACCCGGGAACTCTGGCATTGCCAGCCTACCCGTTCCACCT 
TCCTTTGTGCGATATTCTGTACCACCTGGGGCAGCTGGACCGCCGCCTGCACCACCTGGAGCTCCTCCGCCAACCCACCAGCACCCAGGACATCC 
ACACTTACTCCCACCCAACGGGGATCCCTATTTGCCCATTCTGCACGTTCGACGCGACTTGCACCACAAGAGCTTGAACCTTTCGAAGGCTGGTG 
TTCCACCGCCACCACATACACCACCCACTATAATAACCCAGCCGGAGTCGGGTAAACCACCCAATCAACCTTTGCATTCACCGCATGAAGCTATG 
CCCAGCTTCCTGGGATCGATACCCATGCGAAAGAGGATTCTTCCGGCACCGACCTTGGACTTGATGGATCCGCACCACCATCCTGGATTGGGTCA 
AAGGACCTTTGTGGATAGTCCCAGCATATATGCTTTGAATATGTCGCGACATCCCCCCAGACAGCTTCTGGGAAAACCACCGAGTACGGAAACAA 
GTGGAGCCACCACCGAAAAGGGACCACCAGTGGCAGCACCGCCCATAGCACCACCGCCTGCACCACCGAGAAGAACAGGGTTCAGCATTGAGGAC 
ATAATGAGGCGTTAGAAGTAAACCGAGAAAGAACTAGAAAACAATCCATTAATGGAGGTAAAATCGGAATAGAAAAACAACACAATTTGGAAAAC 
TATTTATCAAAGCTCTTACGAATTTGAAATATCAAAAAGTAGTAGTATTCCTAATAAATATATATCTCGGTTGTTTTTTCTGGTATTTCATTTAG 
AAGGTCTTATGAACAGTTGAATATCATAATGACAAACAGTTATTGTTCCCAATGGTTAAACGAGGATTTATCGTTTTAGACCCACAAGTACATTA 
AAATAAATTTAGATCTACAATTTAATACTTTTTAGTACAATGTCTAGGCAAGGCAATAGACCTAAACTAAGTAATAAAATCAGTTAAGCGATAAT 
TGTAAGCGAAACAAACTCCGTTGTTGCATTTTATAGAGCTCTTATGCGTTAGGATTTTAGCTTAAATCTACCGATTTAAATGTTTAAGCCTTTAG 
TACGAGAGTCTATCGATTAGTTTTTAATGTGTGGCCCAACACTAAGTGTTGCTGTTTGCTGAGTGAATTCCAATTTTTTATTTATGTTTCCTTAT 
CCGGTAAGCTAATCTTAATGAGTAAACAAAATCGTTAATCGATTTTGCTGTAGGGAAAAAAATGGTTGAAAGTAAAAGTTAGTGTAAAGCAGAGC 
ATAAACAAAAATCCAACTACAAGCAACATTCCAGTAGAACCTTATGTAATTAGGCTTAAATTTTAAGAGTTTTGCAGTAGTTAATATAAGCAACC 
C AT AG AAT T C CAT GGTCTTCCTT C AAT T T T AAAT G AT T T TAT T C AAACC AT AGCG AGT T AGT T GAT T T C AAAT AAAGT AT GC AT T AAT AATG AAA 
AACAA 

{SEQ ID NO: 278) 

Start ATG: 34 6 (Reverse strand: CAT) 

MPTESSSSEISGGGGGAIPMLRPSRMDQFMNSMAAAAAAVGGGGLPGAADRNGGSGGSDGGSQNGNGDSRNSSASRISAYETQLAYQQHLAGLHG 
PPPPPPPSHHREISAFVPVLPTGKVRPGSNSWYEIIAM^DKRKELALREAAAAAAMLGRGPGGPGGPGVPPPGVLYGPAGVPPPPYLTGPGPSP 
TGAGSFPFPPGAAAAALFPPGLGPGMHAGLDRRLLRAPGRASRPKKQFICKFCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHR 
YIHSKEKPFKCTECGKGFCQSRTLAVHKILHMEESPHKCPVCSRSFNQRSNLKTHLLTHTDHKPYECSSCGKVFRRNCDLRRHALTHAVGEVNSG 
DYVDVGEEDEARNLSGDEEDSLLEVDSPRQSPVHNLGESGGSGEKSESERMRLKRKAAIDHEESEEEFDDFDEEEELQDLPRVHDLPREEDDDFD 
PEDEEQAEVALVARFQASKAAATSQSSSSVGTKPERQGVTHCHHEGGETYTMRPHGEKHQEEPGNSGIASLPVPPSFVRYSVPPGAAGPPPAPPG 
APPPTHQHPGHPHLLPPNGDPYLPILHVRRDLHHKSLNLSKAGVPPPPHTPPTIITQPESGKPPNQPLHSPHEAMPSFLGSIPMRKRILPAPTLD 
LMDPHHHPGLGQRTFVDSPSIYALNMSRHPPRQLLGKPPSTETSGATTEKGPPVAAPPIAPPPAPPRRTGFSIEDIMRR* 
(SEQ ID NO: 279) 

Name: brother of odd with entrails limited 
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Classification: trans cription__f actor 
Gene Symbol: bowl 
FlyBase ID: FBgn0004893 



Celera Sequence No. : 142000013384719 

GCAATTGCTGCCGTAGCTCTTCATTAGATTGGATTGTATTGGATTGATTGAATTGGATTGGATTGGCCGGTTGTGAAAGAGGTCCTGACCCGGTC 
CTGGGCCTCATTACTTCAAATAGAGCGCTGACCAAACGATTTAATTATACCGCCGACGCCCGACAGTTGGCCGGCACTCAGGTGGAGACCGGGTT 
CGGTTATATTTCAAAACCTGACAACACACCGACCAGAGCGCTCAGTCGGCACGCATCCGCCGTGTGAGCCGCATCAGGATCAGCATTCCTTCGAT 
AGCCTTCGAGCACAATTCGAGCACAAGAACTAACCAATATGACCAGAATGATAATACAAAACTCGGGCAGCTGGACACTTTGCGGAGCTGTACTC 
CTGTTCGTCCTGCCGCTGATACCCACGCCGGAGGCACTGCAGCACACCGAGGAGGGTCTCGAGATGCTTTTCCGCGAGCGGTGAGTTGTCAACTC 
CATTGACATCGAACCGTTCCGATGTCTATGTCTCCATCTTTTAGTTCCCAATCGGACTGGGAGAACGTGTGGCACCAGGAGACGCACTCCCGCTG 
CCGTGACAAGCTGGTCCGCCAGCTATATTGGGCCTGCGAGAAGGACATTTACCGACTGACACGGCGCAACAAAAAGAGGACGGGCAATGATGAGG 
CTTGGATCAAGAAGACAACCACGGAACGTAAGTTGTTGAATCGAAATCCATTCCAACCAAAGACGATTGGAATTTCAAGCGATGTACACATGTAC 
ATCTATGCTATAGATCCATTGATATCGATATGATACCACCTGCTGTCTTTCCAGCTGATGGCTCCACCTGGCTGCACGTGAACTATGCCAACATG 
TTTCTGCGAAGTCGTCGATCGGATGGAAACACTCCATCGATATCGAACGAATGCTGCACCAAGGCGGGCTGCACCTGGGAGGAATACGCCGAGTA 
TTGTCCATCCAACAAGCGACGCAATCACTACTGATTGGCCATCGACGTTGGATCTGCTTAGATATATATATATATATACGAACAAACAAATATAT 
ATTCGCTTTTTTTTTCGTGGTTGTTGTTGTTGATGTTCTTGTTGTGGTCAAATAAAATAAACAAAAGGTTTCGGTTAAATAAAAAGAAACTTTAC 
TGGCTTTAACTACAAATTTCGCATAATAAAAGCTAGATTTGCATGATATAACAAATTATATTAATCAATCAATGTTATACAAATTTAACAGTAAA 
ATTGTCTTTGCAGATTAGAATAGTATMTTAAAAAAAAAAAAAAAATTTCACTTTATTCGACTTACAATGATCCTACTTTAAAATGCCCTAAATA 
TGTCCTTAAAGTTATCTACAATACATTTTATACCTCTTCAATTTTGGGGGCAGTAACAATCCTGTAGTGCAATATCACCTAGTCAACAATATCGT 
CCTTTTCGTCCTTGTCGGTCACATCGATCTCCTTACGACTCTTTGCTGAACCCTCAGCCTTGCTGCAGTTGGGTGATTTTCGTGGCCTCTTCGAG 
GCCGAACCCTTTTCATAATGGGTGTCCAGCATCTCGAGCAGGCTCAACTGGCGGCCACCTGAACCCATTGATTTCGTGGCTGTACTGCTCGTTTT 
GGACGTTGTGCTGCTGTTGCTTATTTCTACGCTGCTTTTAGCCTCCAGACTAGCGGCCTCCATCATGGCATTGGCCTCCTCATCGTCCGACAGCA 
TAAGAAGGGCCTGATCTTCCGAAGAATTTGAGTTATCTTGCGATAATTGCGCGGGCAATAGATCGGTACTTTGTTGCGTTATGGTGATTTGCGGT 
CGGGCGGGTTGCTTGTTGAGTTCTGGCGAGGATTTCGCTGATGTTTCTGATTTTCCTTCGCCTAATCCAGCTACTCTAGAGCTTCCAGCTTCGGC 
GGATGCTCCCTCTCCGGGCACATCTCTGACCTTCTTAAGTTCCTCTTTAATAAAATCATATAGCTTCTTTTTGGATGCCTTATTCTCACGCGGCT 
CCTTGGAGCTCTTCTCCTTAATAAGCCTACTGTACGACCTTAAAATACTCCAAAGCTGCTGCACCGTAGTCCCGTCATTTCGAAACAACAGCCAC 
ATTTCATGAAAATCATCCCTAAACTCCACATTTCCAAAGGTATAGTAGGCCAAAGGTCTGCCCAACTGGGCGCAGACCATAAGTTGCAGCAGGGC 
TTTCAGATAGGAGTCACCGCCGAATGCGCCGCAACCCCAGTTACCAGTTGCCACACCCGGTGGCGGCGTCACCATCCAGTGAACAAATCCAATGT 
ACGCCTTGTTCAGCTCCCTTTCCATGAGATCCTCGCGATATTGATGATGTGACTGGGCAAAATGTAGGGCATCGATTGCCACAATGGCCGTTTGT 
CGACGACCTGAGCTATCTCTTGGCGTTGAATCCTCAAAGTTGCCGGACCACTCGAAGCTTCCGGCATATCCCGTATAGTTACTATACCTTTCGGC 
GCCCAACATCACCAGGGCCTCGAATGGTCGCAGACACTCCGTAAAGAGTTTACCCACCAATAGCTCCGGACAGATAACAAAGCGTATCTCCTCCT 
GAACGCAGCCATGTCCCAAGACACCGCCACCCAAATATTTGTTGGCAAAGTCTACTTGCAGCAGTCCAATGCCCTCATCCTCGATTGTTCCCTCG 
GCATCCACGTGCAATGGCACATCACCCAACGGCGCCGCACTTTGGCTCCAGTCGATCAGATGTTCCGGCAATCCGCTCCGACGAACAAAGGTTAC 
C AC AC CG GT G GGC AC AT T GC T GG CAT CCCGCTCTGTGG GAC AC ACGC G AC G AAAAT AGT GC AT AAT GC AT T T AAGC T T C T C C AG AAC T GC CGG T C 
CCGTCGATTGGTAAAGCCTTTTGATGGAATCGAATATATTTAACATACTTCGACTGAAGATATACGAATCTGGTGGACGCTCACCTGTTAAAGTT 
GATGTCTGGAAAAGTGCTGTACTCGGACTTCCTCTTGAGGGTGTTTCTCCGGGGAAAGGTGCACAGGAAGGCATTGGCCAACAGGCAGGAGATCT 
GCTGTTGGCTCAGGCTCAATGAGGCGTTCTTGTGGTGCTTGAGCAGCGGAACTGGCGATTGAATCAAGTCCGGTAGCCGCAATGCCAATCGGATA 
ATGCGCGGCAATAGATCCTCGAAGAAAACCCGTGTTTCGCTCTCGTCCAGTTCCTCGTCGAGAAGTTGATGAAGGGCACGAAAGTGCCACTGATC 
CCTATAGGTGGTATTATATGATATTATCGCCGCCTGCAGTTCCTCACACGTCTTTATGGGCTGCAGAAGGGCTCGTTCGATCATTTCCCAGCGAA 
AATCGATGGTGGTGCTGCCATCCGGATTCTCCCTCGGATATTTCGACTCGGGCGCACAGGGCAGACGCACATGCTCGGAGTCCCACTTTCCCGGT 
GATTTGTAGGGGCGTGGCGGTGTTTCACGAATTGGCAGCTGGTACATAACCCGGTGGAGATTGCCGGCAGTCACTGGTGGCAAATTCTCCAATTC 
GAAAGGCTGCCGATTACGATGTATAGCCTCCATGGAAACTCCACGCCACGAATCATCTAGGGAGTTCGCCAGATTTTCCGGCTCCTCCTCCGTTT 
CTATTTCGGAAATCCCGCCATCCGGCGACTTCGACATCCTGGCGCAGTTTAAGCACAAAGCCTTGCCGAGTCGATTGGTCAGCACGGATGCACTT 
GCTCTGCGGCGATTTGCCGTAGATTGGTAAACCTTTTGGAATATCGGAAAAATCAAGTGTGACCTGAATTCTTGCATACCGGTGTTTTGGGGATG 
AGCGATAGCTGTGATGAATATCGCACTTTATAAATACCACACACTTCGATTGAAGCGTAGTGAATTGAAGCGTTTATATATCGAAATATATTCAA 
ATATATGTTGGTATTCGTTTGAGAATGAATCAATAACAAATTCAAAATTGCATTGCCTTGAATTCGGCGGGCGAAAGTGTGACCATTACATGTGG 
TATTTTACCTCTCATTGTCGCTAGTATTTAAATACCAGCACGAGAAATACCAACACTGTGTCCGTTGCAATGCGAATTGCAAATATAACCGCCGA 
AGGAAC AAC AAC AAAC AAAAAG AAAAAAAAAAC AAATCT T CCAGCAATTTGTC AACTATTT TCT CTAATT T TT T TTT AAAGT GT AAAACT ATCAT 
AACGAGCGCCAAGTTTTACGTTAAGTGAAAGGATTTTTTCCTGTCCTTCGTGCGGTAAAAATAATAGTGCATATTACCAAGGAAAATCAAGTTAA 
AGGCGTCTGCGAAACTTTTATAAATGTATTTAATACTCAAAAAGGTAAGAAGCGGCGGAAAGTAATTGCTGTTTTATGTTTTTATAATCACGTTT 
GGCAAGAAAATGTTATTTCGTTGACGTCAGTGGTCAAATCAAATTTTTTCAGTTTTTAACCCTTAATGTGGCACTGTCACTCGCACCTATGTATA 
TATTTATGTGATTGTGGTATTTTGCTCAAGCTTACGTGTTGAAAAGAGGAAGAGAAACCGCTGCATACACGTGCAAAGACTCTGAGAATGGACAC 
AAGAGAGCGAGAGAGGGAGAGAGAGAGAGCGCAGTTGTTTGTTTTGAAAGCGTCGCAAAAACAATACAAAAAAGGTCATACATAATTTGCACACA 
CACTTGCATCTGAGGAAAACGCAAAAGCAAATTCACAGCTAGCTAAAAACCAAAATAAATATAATTTATGAAATGTACTTTTGCTAAAGTTCAAC 
ACACCTTTATCTCTCCGCGAAATTTGTTTGCACCATCTCTTTCTCTCCTTTTTGCGTAAACAAACGAAAAAGCTTTGCAAGGAGAAGCAAAAAAA 
AAAA 

(SEQ ID NO: 280) 

Exon: 3849. .2935 
Exon: 2867. .1001 

Start ATG: 3782 (Reverse strand: CAT) 
Transcript No. : CT9808 

TCGAAGTGTGTGGTATTTATAAAGTGCGATATTCATCACAGCTATCGCTCATCCCCAAAACACCGGTATGCAAGAATTCAGGTCACACTTGATTT 
TTCCGATATTCCAAAAGGTTTACCAATCTACGGCAAATCGCCGCAGAGCAAGTGCATCCGTGCTGACCAATCGACTCGGCAAGGCTTTGTGCTTA 
AACTGCGCCAGGATGTCGAAGTCGCCGGATGGCGGGATTTCCGAAATAGAAACGGAGGAGGAGCCGGAAAATCTGGCGAACTCCCTAGATGATTC 
GTGGCGTGGAGTTTCCATGGAGGCTATACATCGTAATCGGCAGCCTTTCGAATTGGAGAATTTGCCACCAGTGACTGCCGGCAATCTCCACCGGG 
TTATGTACCAGCTGCCAATTCGTGAAACACCGCCACGCCCCTACAAATCACCGGGAAAGTGGGACTCCGAGCATGTGCGTCTGCCCTGTGCGCCC 
GAGTCGAAATATCCGAGGGAGAATCCGGATGGCAGCACCACCATCGATTTTCGCTGGGAAATGATCGAACGAGCCCTTCTGCAGCCCATAAAGAC 
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GTGTGAGGAACTGCAGGCGGCGATAATATCATATMTACCACCTATAGGGATCAGTGGCACTTTCGTGCCCTTCATCAACTTCTCGACGAGGAAC 
TGGACGAGAGCGAAACACGGGTTTTCTTCGAGGATCTATTGCCGCGCATTATCCGATTGGCATTGCGGCTACCGGACTTGATTCAATCGCCAGTT 
CCGCTGCTCAAGCACCACAAGAACGCCTCATTGAGCCTGAGCCAACAGCAGATCTCCTGCCTGTTGGCCAATGCCTTCCTGTGCACCTTTCCCCG 
GAGAAACACCCTCAAGAGGAAGTCCGAGTACAGCACTTTTCCAGACATCAACTTTAACAGGCTTTACCAATCGACGGGACCGGCAGTTCTGGAGA 
AGCTTAAATGCATTATGCACTATTTTCGTCGCGTGTGTCCCACAGAGCGGGATGCCAGCAATGTGCCCACCGGTGTGGTAACCTTTGTTCGTCGG 
AGCGGATTGCCGGAACATCTGATCGACTGGAGCCAAAGTGCGGCGCCGTTGGGTGATGTGCCATTGCACGTGGATGCCGAGGGAACAATCGAGGA 
TGAGGGCATTGGACTGCTGCAAGTAGACTTTGCCAACAAATATTTGGGTGGCGGTGTCTTGGGACATGGCTGCGTTCAGGAGGAGATACGCTTTG 
TTATCTGTCCGGAGCTATTGGTGGGTAAACTCTTTACGGAGTGTCTGCGACCATTCGAGGCCCTGGTGATGTTGGGCGCCGAAAGGTATAGTAAC 
TATACGGGATATGCCGGAAGCTTCGAGTGGTCCGGCAACTTTGAGGATTCAACGCCAAGAGATAGCTCAGGTCGTCGACAAACGGCCATTGTGGC 
AATCGATGCCCTACATTTTGCCCAGTCACATCATCAATATCGCGAGGATCTCATGGAAAGGGAGCTGAACAAGGCGTACATTGGATTTGTTCACT 
GGATGGTGACGCCGCCACCGGGTGTGGCAACTGGTAACTGGGGTTGCGGCGCATTCGGCGGTGACTCCTATCTGAAAGCCCTGCTGCAACTTATG 
GTCTGCGCCCAGTTGGGCAGACCTTTGGCCTACTATACCTTTGGAAATGTGGAGTTTAGGGATGATTTTCATGAAATGTGGCTGTTGTTTCGAAA 
TGACGGGACTACGGTGCAGCAGCTTTGGAGTATTTTAAGGTCGTACAGTAGGCTTATTAAGGAGAAGAGCTCCAAGGAGCCGCGTGAGAATAAGG 
CATCCAAAAAGAAGCTATATGATTTTATTAAAGAGGAACTTAAGAAGGTCAGAGATGTGCCCGGAGAGGGAGCATCCGCCGAAGCTGGAAGCTCT 
AGAGTAGCTGGATTAGGCGAAGGAAAATCAGAAACATCAGCGAAATCCTCGCCAGAACTCAACAAGCAACCCGCCCGACCGCAAATCACCATAAC 
GCAACAAAGTACCGATCTATTGCCCGCGCAATTATCGCAAGATAACTCAAATTCTTCGGAAGATCAGGCCCTTCTTATGCTGTCGGACGATGAGG 
AGGCCAATGCCATGATGGAGGCCGCTAGTCTGGAGGCTAAAAGCAGCGTAGAAATAAGCAACAGCAGCACAACGTCCAAAACGAGCAGTACAGCC 
ACGAAATCAATGGGTTCAGGTGGCCGCCAGTTGAGCCTGCTCGAGATGCTGGACACCCATTATGAAAAGGGTTCGGCCTCGAAGAGGCCACGAAA 
ATCACCCAACTGCAGCAAGGCTGAGGGTTCAGCAAAGAGTCGTAAGGAGATCGATGTGACCGACAAGGACGAAAAGGACGATATTGTTGACTAGG 
TGATATTGCACTACAGGATTGTTACTGCCCCCAAAATTGAAGAGGTATAAAATGTATTGTAGATAACTTTAAGGACATATTTAGGGCATTTTAAA 
GTAGGATCATTGTAAGTCGAATAAAGTGAAATTTTTTTTTTTTTTTTAATTATACTATTCTAATCTGCAAAGACAATTTTACTGTTAAATTTGTA 
TAACATTGATTGATTAATATAATTTGTTATATCATGCAAATCTAGCTTTTATTATGCGAAATTTGTAGTTAAAGCCAGTAAAGTTTCTTTTTATT 
TAACCGAAACCTTTTGTTTATTTTATTTGACCACAACAAGAACATCAACAACAACAACCACGAAAAAAAAAGCGAATATATATTTGTTTGTTCGT 
ATATATATATATATATCTAAGCAGATC 
(SEQ ID NO: 281) 

Start ATG: 68 (Reverse strand: CAT) 

MQEFRSHLIFPIFQKVYQSTANRRRASASVLTNRLGKALCLNCARMSKSPDGGISEIETEEEPENLAWSLDDSWRGVSMEAIHRNRQPFELEKLP 
PVTAGNLHRVMYQLPIRETPPRPYKSPGKWDSEHVRLPCAPESKYPRENPDGSTTIDFRWEMIERALLQPIKTCEELQAAIISYNTTYRDQWHFR 
ALHQLLDEELDESETRVFFEDLLPRIIRLALRLPDLIQSPVPLLKHHKNASLSLSQQQISCLLANAFLCTFPRRNTLKRKSEYSTFPDINFNRLY 
QSTGPAVLEKLKCIMHYFRRVCPTERDASNVPTGVVTFVRRSGLPEHLIDWSQSAAPLGDVPLHVDAEGTIEDEGIGLLQVDFANKYLGGGVLGH 
GC VQE EIRFVICPELLVGKLFTECLRP FEAL VMLG AERY S N YT G Y AG S FEWS GHFEDSTPRDSS GRRQT A I V A I D ALH F AQ S HHQ YRE DLMEREL 
KKAYIGFVHWMVTPPPGVATGNWGCGAFGGDSYLKALLQLMVCAQLGRPLAYYTFGNVEFRDDFHEl^WLLFRNDGTTVQQLWSILRSYSRLIKEK 
SSKEPRENKASKKKLYDFIKEELKKVRDVPGEGASAEAGSSRVAGLGEGKSETSAKSSPELNKQPARPQITITQQSTDLLPAQLSQDNSNSSEDQ 
ALLMLSDDEEANAMMEAASLEAKSSVEISNSSTTSKTSSTATKSMGSGGRQLSLLEMLDTHYEKGSASKRPRKSPNCSKAEGSAKSRKEIDVTDK 
DEKDDIVD* 
(SEQ ID NO: 282) 

Name: PARG PROTEIN 
Classification: enzyme 
Gene Symbol: Parg 
FlyBase ID: FBgn0023216 



Celera Sequence No. ; 142000013384721 

TTGTATTGCTGCCAAAATACTGCCAATCATTTCGGTCTTAAGTTATGTACATACGTATGAACTTTGTTTGCCCACGGAAAAGCGTTGAAAAAGAA 
AGTCATGTAGCTTGTTAATATGCAAGTCATTAGTATTAAACACTGGACGCGTTGCTCTAAAAATTGTTTGGGTACTTTCTTAAAAATTAAATGGC 
TGATTAGTTACTTTTCGACCAATATCAGCAAGACATTTGGCTATTTAATAATGATTAATTGCCCGATTAACACACAGTTCAAGCCATTGAAGATT 
CTGACTCGATCTCCTTACCCTACAAATTTCCTTGGCCATCATTATCTCTTCTATTTTGGCGTTGATTTTTTTTTATCAGCGTATAGACGTAGTAG 
CTCTTTAAATAATTATCTTTAGACCATATACTAAACTACTTTTCGATTCATGACTTACGGTGGGTAAAGTTGGGAAGCTCCCATCCCTCAGACGT 
GCAAGTAATGGCCAGAGATAACCGTTTGGTGCGATGAAAATATTTAGTTAATGATGGTGCGCAATCTGCATCGCCCACTATACACAACACAGCAA 
CCACCCACCCACTGACTGATGACAGACGGGGCGTGGCAGGTGTGATAAGCGGAGCAATTGTGATGGACAGCGATCTGTAAATTGACAGGCGGCCA 
AGGGCGGGGAAGGGTGAAATGTCAAGCAGATATTCCGGTGGGTAGTTGTGGGCCATAAATAACAATCGCGAGAGCTGCGAGCTGCGGCTCCATTC 
ATTGTGGGTAAGCGAACACCGGGCCGTAAATACTTCCACTTGCCTGTCGATACGATAACATTGCGATTAGGCTTATCACCACTACTCGCTCGTTC 
TAACGCCTTTCGCCATCCTTCGACGTCGAGCAAGCGCGCAGTCGCAGCGGGCAGAGCAGTGCGGCAAAGCGACAAAGCGGCGAGCGGCGAAACCG 
TCAACGGCTTCTTAGCTAAGTTCGTTCACAGTCGACTTCAGTCTGAGGCTGTTGATCGCCGCTTGCGGACGTGGTCGCTCAATCCTGCTCGATTC 
ATAACCGATTCAAAACCGATCCCATCCGAAACGACGTGCGTTCCAATTTTTTTTTTTTTGCAGTTGTTAAATGTTTTAATGTGAGTTACGAGTGC 
G CG AAT G C AAT G G AAC AAT AAAAGC AAT AAC GAT C C G AAAGCT T C GC C AG AG AC AAT AAAAGC AA TAT C AA AAC AC T GAT GT C T AT AT TAT G AAA 
AGATCTTACGAAACAATAAAATCCTTTTAAAGGCCAATGCAGCAAAATAGGATAAAGTTACCAGAGAAAGAGCTTCCTTTTTTTATACAGTTTGA 
AAATCAATTTGCAACACAGAAAATCAATTAAAACTATTAATAATCTGAATGAGATATATTTGCGATATTGTTGGTTTAATTAATTATTTCGCAGT 
GGCAATTGGAAATTTCCCTTAAATATAGCCAATTTGTGGGCTTATTGAACTCCAGACGAGGATCAGTACTTTTTTGCTTTTATTGTCTCTTTTGT 
CTCTTATTGGAAAAAAATAAACAAAATTTACACGTAATATACATATATGTGCATATATTTTTCGTTAAATTAAAAGAAAGTTGAGTTTATGCATC 
TTATACATATGCACATTCCTACAACGTAATGATTTTTAAAGTAAATTTAATATTTTTCTTCGATTTCAAGTGACGCGTAACGGATGTGAAGCGAT 
CTCCGGGTTCCCAACATGACGGATAAAAAAAATGGTATTGTTAATTACTAAAAATAACCACAGATGCATAATTATAGTTTTTACAGAGGGGTGGC 
GATTATACAGACTACTGAACCGAGATTTATGGCCCATATTTTTGTGAGGCTCGAGTTTTACTCATTGTTTCGATTTCTAGACATGCTATATAAGC 
CACACAAAATACACCTCACCATAGATTGCAATAGTGTTCTTATTTAGTTTATTGCACCTATTTACTTCGATACTCCTCACCCTTCAGTTTTAATC 
AGTGGCCGTTGGGTAACGTGCAAGTTATAAGAATTGGCGGAATACATCTCCACTTGCCAGTTATGACTCAACGGACGAACGATTTTCAGAACTCC 
AGGTTCGCAACTACGATAATCGGTTCTCTTGGCCGCGTGTGCAGTTTGCAAATCTAATCGCTCTGATAAGGGACTTTGAGTGACTGGCCCCGGGT 
TTTAGCCACTTGAGCGGCTCCAACACAAGCAACGAGCAGGTCCAAATGTGGATTTACTTTCGACGGTCCCCAAAAAAGAGCTGCCAAACTTGGTC 
AATTGGCACGGACCAACCTTGCCTATATACTTGCCTATTCAAATCTCAAAAAATAAATAACTCGACTTGCCTACACCGATGCAAATTTTCCAATT 
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TGTACAACTGTGTGCGCCTGCATTTCAATGTCAGACGCCGGCTTCATTTGCATATTCGTAGGCAAACTGCCAATGCGGGTGTTTAAAACAAATGC 
GGCTAAAATTGCGATTACGATTTCGTTTGGTATACCAAACACACACACACACACACCACTTGCTGACAAATCGCTGGGAGAGCCTGTAGTAAATG 
CAACCGCTTGCAATTAGTGAGTGAGTAGTTCGCCTGGAATTTGCCATTTCGTATTTGGTTGCTCCATGCGGAGCAGAACCCGGTCGAATTGGATG 
GGGCGGTGCCGATATGGGGTCGTAATTGCCGCTTCGAACGGAACAAGACAATCAAAGTTTTGCTCACTACCGAAAACATCGAACTTATCTATCTG 
ACAACTTCCATTGAATAATTCTGTTTCTTTGCCGCCGTAAAAAGCAATTGTAATTAAAACTGGGCCAAAGCGCCAGCTCCTGCGGCGCTTATCAT 
TTCGCTGGACCAATAAACATCGCGCTCGTTCTAATCGCTTGTAAATCAGTTGGAATGCTTTTAGACCCTCTTCATGGCGATTATGTATGTTATTG 
GGT AG GT GT GA A AT T T AC A AC C TAT T T G AT GG CC AATC G AT G AAC AAACC G AC GATT ATG CG AT ATC AATGC C G AT ATGC AA T GT C AT T AAA GAA 
TCACTCACTACTATATATAGCCAATCCAATTATGTTCATCCGTTCTCGAATTCCAGAAGAATTTCTCTGCATATCACAGTTAGCGATAGTTCAAC 
GCACATCCTAACCACCTATCAATCAATCAATCAATCAATCAATCTATAGGTCCCAGGGCTCTTAGAGCTTAAGTAAAGAAATCTTGAGAAAGTGA 
AATTGAGAACACTTTCACAGCACTTTCCCTCCGAAACTTTACAACTGCACTCAGGGCATACCAGGCTTATCTAATTATTTATAGGTAAAGCCCTC 
AGTGTACTCATTTTTCAATTAGCTTGTTGAAAATTACGGAGTATCAGCGACTCCGATTGGTCTACGGCTTTTATGGAGCCCAATCGAGTGGATTG 
AGTGGAGATCCGTCTGGAGATCAATTTTCCGTCCTTCGGTGACACTTGGCCAGTTCCCTGGCACTAAACACTACGCTATTAGCGGCTGCAAAATG 
AATGAATGAATGAGTGGTTTTGAGTGTGAATGTGAGTGGCAGGCCTGGTCTACCAACTTCAAACGGCCACCGAAGCGGGTCAATGCTGTACACGT 
CTGGTACTTGTATATTTTGTATATCTCTGTTTCCCTGCGATAAGGCGCTTTATGTTAATGGATGTTTGATGTGAAGTGTAGTGCGTGCAATAGCA 
TTGTTTCCACGGTTTATGGAGCCATGTGATGGCTGCTATACATTACTGCCTCAAATAGCGGTCATAAACGCTGATAAAGCATCGATGGATTAGCT 
GCGAACCGATTCTGGATCGCAGTTAGATAAGGCCCATTCATTGGTCGCCATTCGGGACTTTCGCAGTTCGCAGTTCGCTTCCTGAATCATCAACC 
ATCGAACAGATACAGATCCAGTTGTCCGAACATGGCCATGACCATGACTAGCTGCACTAGGCACCAGCTGCAATTACCATTCGATCCACATGGAC 
ATGACCATTAAAGCTCCAGCAGACATTTAGACGCAGCATTGTCAGTAATCAGCATGTTGTTTATGGCGTGGAAAGCCATACTCCCAGCCAGTTTG 
TCAAAGTGTGGAGGCGATCGCCGAGAGAGCTATCTTCCCTAGCTGGGTTATCTGACGCCCGCCACTGGCGTAAACCTCGTAAAAGCTCGGGGCCT 
AAGGCTGGATAATTTATGCCGGGCCACCGAATTCACACGCCCGCCGTTGAAGTAAAGGCCCGTTCAAATATAGAACAGTCCTTATCTGTGGCGGA 
GGTGTCAAGCACAAGGCCACAAGTCTCGGTTTCGGTTTCTGATGAAATGTCGCCGATCCTCAGATCACTAACGAATCTCAGTGGCCATATGAAAT 
GGAGCTGCCGGCGGAGCAAGTACCCCTGCTGGGTGGGCTCTCCAAGTTTTCGATTATTGCATTTCGAATTACTAACCTGGATCCTTCTTTCGTCC 
GCAGAAATACAGCCAGCGGCTGAGCCTGTGGCTGAACCAGAGCCTTCTCCCGCGGCCGAAATAGCTCCTGCAAGTCCCGATGGCTCCGGTCCAAG 
TGCTGCCGCTGCAGAAGAAGTCACCCCGAGTGTTGCGACCGTTGAAGACACTACCCCGATTGCAGCCCCTGCAGAGAGTCAACCGTCCGGCGAGA 
GCAACGGCACGGCGGAGGCCAAGAAGCTGCCCGAGATCGCTAAAAAGAATGGCAAGGAGCCCGACCAGAATGACAAGGAGCCCGACCAGAATGGC 
AAGGAACTGGCAGAGACCAATGACTTCAAGAACGTCTCCGAGACTGAGGGTCAGCAGAAGGTAATCGTCATTGGTGCTCTTTAACAAGTATTGAA 
GCAAACACTGTACTTACCTTTTCATTGACACCCACAGAAACTGCCCTACCCCAAGTCCGTGTTCTTCATCATCAGTAACGAATTCTGCGAACGTT 
TCAACTATTACGGCATGAGGAGTAAGTTAATCGCAAAGTATCTTACAATGAAATCTCTGCTAATCGCTCGTACTCCATTCAGCTGTTTTGGTTCT 
GTATCTGAGTCGAGTTCTGGGCTATTCTGATGATACCGCCACCGTGGTCTTCCATATCTTCACCATGTTCGTGTACTTCCTGTGCGTTTTTGGAG 
CCATCATTTCGGACTCTTGGCTGGGAAAATTCAAGACTATCCTGTATTTATCGCTGGTTTATATATGCGGATCGGTGCTGCTGACCCTAGGTGCC 
ATTGGACCACTTAACCTGCCCATGGAAACGTTTACGATGTTGGGCCTGGCGTTAATTGCCCTTGGTTCAGGCGGCATTAAGCCGTGTGTGTCGGC 
ATTCGGTGGCGATCAGTTTAAGGTGCCGGAGCAGGTGAAGCAGATCACTTCGTTCTTCTCGCTCTTTTACTTTTCGATCAATGCCGGATCGCTGA 
TCTCCACCACGGTGACTCCAATATTGCGCGAGGACGTGTCCTGCTTTGACGACATTAACTGCTATCCATTGGCCTTTGGAGTACCGGCCGTACTT 
ATGATTGTGTCGGTGATCATCTTCGTTTTGGGACGATCCCTGTACAAGATGAAGCCACCGGCCGGAAATATGGTGGTGCTTGTGAGCAGTACCAT 
CTGGACAGCCCTGACCACCAAATGTAAGGAGAAAAAGACCAATCCGCGGGAGCACTGGCTGGACTATGCCGATAAAAAGTACGACCGCCAGTTGA 
TTGACGATGTAAAGGTCCTTATGCGAGTCCTCTTCCTCTACCTGCCACTTCCGGTCTTCTGGGCACTGTTTGATCAGCAGGGCTCCCGTTGGACC 
TTCCAGGCCACCCGAATGGACGGCGACATGGGCTCCTGGGACATCAAGCCCGACCAGTTGCAAGTGCTCAATCCTCTGCTCATCCTAATCTTTAT 
ACCGTTGTACGATGTGGCCCTCTATCCGGCCCTGAAACTAGTGGGTATCCGACGACCGCTGCAGAAACTCACCATGGGCGGCATCCTTGCGGGCA 
TTGCTTTCATCATTTCCGGAGTGGTGGAGCTTAGCTTGGAGGTGAGTAATTGTTCGAAAATGCTTGTTGGCTTGGGACTTAAAGCTCTGAAAGTA 
TTTAATTGGGAAGTTTTAAAGTATTTTTAACTGCTGTGCTTATATATATTGTATTGTACTGTACCTGCACCTGCACTTTCCGTTTTTCTTTCGTT 
GCACGCCTTGCTGTCGTTAGAAAACCTATCCCGACCTGCCCTACAGCCAGAACATCCAACTGCGCATCTTCAACGCGGACAACTGCGACTACTTC 
TTCACTTCGAACATTCCGGGAGCGGAAAATGTGACGGTGACCTCGTTGAATGCCTATACCAACAAGGATATTTACGCACCCGGTTCGCTGGACGT 
GGAGTTCTCGCTCAGCAGTGCAAGTGAAGCTTGTACCATTAGCGCTGACAAAATACCTAAAGTCCTTACTGATAATACGGCTTGGTCTCTGTTCC 
TCAATCCGCTAAATGGTAGTGGATACTACTGGGAGGAGGATTTTGTGGACAAGCCGAGCCAGAGCCATCCGCTGATCCGAAACGTCGCTAATGTG 
CCGCAGGAGCTGTCAATTCGTTGGGAAAAGGCTAGCACGGGAGAGGTGGTCTATGACGACCTGGCTAGCAAAACGTTGCTCACGCGTGTGAATTC 
GGCGGAGTATGTAGTCATGGTGGGCGAAAGGAACGTCGGAACATTCGATGTTCACACCGGCGGCGTGTACACACTCCTGATTAAAGATAACAACA 
ACATTTACGAGGCCAATTTGGAGGAAGTGACGAGCCCCAACTCGGTAAAGATCCTTTGGCTGGTGCCGCAGTACGTGGTGATGACCCTCGGCGAG 
GTGATGTTCTCGGTGACGGGACTGGAGTTCTCATACGCCCAGGCACCACCCAGCATGAAGAGTGTCCTGCAGGCCTGTTGGCTGCTAACGGTGGC 
CTTCGGCAACGTCATTGTGGTCGTCATCGCCGAGGCCGCCTTATTCGAATCGCAGGCCAGCGAGTTCTTCCTGTTCGCCGGCCTCATGTTCGCCG 
ACATGTTGATCTTCATGGTGATGGCCTACTACTATGTGCCCAACGATCCGAACAAGGTGGAGGAGGCCCAGCCCCTGACGGCGGGTGATGCCAAA 
ACAGAGATTCCCGCTCTCGAAGGCGGAGTCGAGAACAAGGGCAAGGACATCAACGAGTAGACCTCAAACCGATTCGTCCTGCTTGCTCCCGCGTT 
GTGTATTATATATATATATATTTGTACTTGTAGATGTGTATATGTATGTAAATCCTTAGTGGCCTTTGTCCATTTCCAAGCTAAGTGTAAGTGAA 
GCTGCGACCGGGCATTGTTGCATATGAAAAGACATCCAACTAGTTAATCTAATAAAAGAAAATTAATAATATAATCCGTCGTTGTTACTATTCTT 
GTTGTCTCTCTATCTGCTTTAATACTCTTAATTGTTTCTTTTCACCTATGACCTGGATTTTACTCAGTATGATAGTCTTATACGATATCATTCCG 
GAGATTTCTGGTATTGCTCCGATCTAATATATTAGTACCAGCTTGCACGTATATCTTCCACCCTTGCAGCACACTTACCCATTGATCCCGAGCAG 
TGGTAATACCCAGCTACGCGTCTTCAGCGGTATTCCCGACTGCGATTATCGCTTTAGTACGAACCTCGGAGCCCGGGAGAGCTTCACCTTAAGGG 
GGCTGGACTTCTACTATTCGGGTAGCGTGGCCACGAACACCGACGGCATCTTCCAGTTAACATACACCGTGGAAAGTTTAACTGAAGGTATGTTA 
CCACGTTATCGCGATCCTCCTATTCAAGAGAACTTGTTGATGTAAGAACTCGTGAATATGTTTAAAGCCCACCATCCTCTTTGGAGCGTTAAGCT 
GTCAACACAAATAGAATCACTTCAGTGGTCACAAGGTTTTCGGATGTTTGAGAGTATCTTAGAATGGCTGATAAGCCATTAGTAGTAATTACTAG 
TAACTTTTTTGGGTAGTATTTTGAATTGGGTATGGGTAGTATAGTTGAAATGGGTATTCTACCACAATTTGTACTTTATTCTGAATGGTGTACCC 
CTTAGGTTGCGCAATCCTGGAGGGCGGAACACAGCAGTTGTATGAGGCCACCGCCCACTCACTGTTCCTGCGACCTTCGCACGCCATCACTAAGT 
TTTGGTACGAGGACGAGGTGCAGAAGTCGAATCGCTCGTGGGCTTTTGTGCGAAACTTGGCGAATCTCTTGGCCACCACGCGTGTCGTTTGGCGG 
GAGGATTCAGGGGATACGGTGGCCCTGGATTTGCCCGCCCGCAATCATGATCTCTACGAGCTAAGTACGGGCGGTTACAACGTTCAGGTGGCGGA 
TCACAATCTTACCAGCATAGAGTTACGCGCTGGGGGCGTTTACGCCCTGCTGATCGGCCAGAGTCACGGAACATTCGTATCGCGAATCATAGAGG 
TAACCAGCCCCAATTCGATGCACATGCTCTGGCTGGTGCCGCAGTACGTGGTAATGACCCTCGGTGAGGTGATGTTCTCGGTGACGGGACTGGAG 
TTCTCCTATTCCCAGTCGCCGCCCAGCATGAAAAGTGTCCTGCAGGCCTGCTGGCTGCTAACAGTGGCCTTCGGCAACGTTATCGTCGTGGTTGT 
GGCGGAACTCAAGTTCTTCGACTCGCAAGCCAGCGAGTTCTTCCTGTTCGCTGGCCTCATGTTTGTCGACATGCTGGTCTTCATGTTCGTGGCCT 
ACTACTATAAGCCATATGATGAGGTGGCTGCTCTGGCGGAGAGGGTAAGGGCCAGACGGGAACGGAAATGGCAGGATCTCGACCGATTCAAGAGG 
GACTAGGAATACGGATGCCTGATTGTCATTTGAATCTGTCCGCTTATTATATCCTTATGTGCAGTTCTTTTACGATTCGAAATAAATATCCCCCA 
AATATGGCAGAACAAAGACTTTTTAATATCCGCCATCATCCTTTGAATTGTTTTAATACGCTAAGTTGAGTGTTAAATAAGTTATATTAAATGTT 
ACTGACCATTGCCTTTACATTTATCCAGCGCTCAACTTGTTTATTACCGCCAGTGGACTCTGCACTTAGTTTATCATTAAAATATAAGTCCACTG 
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CACTTAGTATATCAATAAAATATMGTCTACTGCACTTAGTATATCATTAAAATATAAAATGTAGGTTTATATTAGTATAACTGTAACGAATATT 
CAAGCTCGCGCGGAGTTGCCAGATCACAACTTAACAGCGTGCTGTTATGCGCATGTTGTCACCTGACATTCCGCCTTTTTTAGTACCTTTTGCGC 
TGACATTTGGTCGAGAAAAATATAAGAAGAGGAAGAGTTGAAAATCATATGACTTGTCGATCGATAGAGAAAAAAGGTTGAATTTTCACCGTCGC 
TCCGTATCGAACTGCACGCCCTTCGCCACACAATTCGCGCAACTTTCGGTCGAGCGAGAGCAGTCCCATAGAAAGGAATCCAGCGCAATTCGCCA 
CCTCTTCAACTTCCAGTCAGCTCCTCTGCGTGTGTGTGTGTGCGTGTGTCATCAAGATGAACAGCTCGGGCTTTATGTTTAACATCGATAATGGA 
TATCTGGAGGGTCTGTGCCGCGGCTTCAAGTGCGGCATCCTGAAGCAGGCTGACTATCTCAACCTGGTGCAGTGCGAGACCTTGGAGGGTAAGTA 
GTAGAGCCATCGCAAGCGTAAACCGAAAACCCCATTTTCCCACACGCCCCTGGGTCACATGACCAGCGATAACAGAGTGGTGCGTGTGTGTATAT 
ATATACATGTACGGCTGTTAATGTATCTTTGCGCCGGTGTGTGTAAGCTTGTCTTGTGATTTGTTGACGTTTTTCAATT 
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GTTGATCGCCGCTTGCGGACGTGGTCGCTCAATCCTGCTCGATTCATAACCGATTCAAAACCGATCCCATCCGAAACGACGTGCGTTCCAATTTT 
TTTTTTTTTGCAGTTGTTAAATGTTTTAATTGACGCGTAACGGATGTGAAGCGATCTCCGGGTTCCCAACATGACGGATAAAAAAAATGAAATAC 
AGCCAGCGGCTGAGCCTGTGGCTGAACCAGAGCCTTCTCCCGCGGCCGAAATAGCTCCTGCAAGTCCCGATGGCTCCGGTCCAAGTGCTGCCGCT 
GCAGAAGAAGTCACCCCGAGTGTTGCGACCGTTGAAGACACTACCCCGATTGCAGCCCCTGCAGAGAGTCAACCGTCCGGCGAGAGCAACGGCAC 
GGCGGAGGCCAAGAAGCTGCCCGAGATCGCTAAAAAGAATGGCAAGGAGCCCGACCAGAATGACAAGGAGCCCGACCAGAATGGCAAGGAACTGG 
CAGAGACCAATGACTTCAAGAACGTCTCCGAGACTGAGGGTCAGCAGAAGAAACTGCCCTACCCCAAGTCCGTGTTCTTCATCATCAGTAACGAA 
TTCTGCGAACGTTTCAACTATTACGGCATGAGGACTGTTTTGGTTCTGTATCTGAGTCGAGTTCTGGGCTATTCTGATGATACCGCCACCGTGGT 
CTTCCATATCTTCACCATGTTCGTGTACTTCCTGTGCGTTTTTGGAGCCATCATTTCGGACTCTTGGCTGGGAAAATTCAAGACTATCCTGTATT 
TATCGCTGGTTTATATATGCGGATCGGTGCTGCTGACCCTAGGTGCCATTGGACCACTTAACCTGCCCATGGAAACGTTTACGATGTTGGGCCTG 
GCGTTAATTGCCCTTGGTTCAGGCGGCATTAAGCCGTGTGTGTCGGCATTCGGTGGCGATCAGTTTAAGGTGCCGGAGCAGGTGAAGCAGATCAC 
TTCGTTCTTCTCGCTCTTTTACTTTTCGATCAATGCCGGATCGCTGATCTCCACCACGGTGACTCCAATATTGCGCGAGGACGTGTCCTGCTTTG 
ACGACATTAACTGCTATCCATTGGCCTTTGGAGTACCGGCCGTACTTATGATTGTGTCGGTGATCATCTTCGTTTTGGGACGATCCCTGTACAAG 
ATGAAGCCACCGGCCGGAAATATGGTGGTGCTTGTGAGCAGTACCATCTGGACAGCCCTGACCACCAAATGTAAGGAGAAAAAGACCAATCCGCG 
GGAGCACTGGCTGGACTATGCCGATAAAAAGTACGACCGCCAGTTGATTGACGATGTAAAGGTCCTTATGCGAGTCCTCTTCCTCTACCTGCCAC 
TTCCGGTCTTCTGGGCACTGTTTGATCAGCAGGGCTCCCGTTGGACCTTCCAGGCCACCCGAATGGACGGCGACATGGGCTCCTGGGACATCAAG 
CCCGACCAGTTGCAAGTGCTCAATCCTCTGCTCATCCTAATCTTTATACCGTTGTACGATGTGGCCCTCTATCCGGCCCTGAAACTAGTGGGTAT 
CCGACGACCGCTGCAGAAACTCACCATGGGCGGCATCCTTGCGGGCATTGCTTTCATCATTTCCGGAGTGGTGGAGCTTAGCTTGGAGAAAACCT 
ATCCCGACCTGCCCTACAGCCAGAACATCCAACTGCGCATCTTCAACGCGGACAACTGCGACTACTTCTTCACTTCGAACATTCCGGGAGCGGAA 
AATGTGACGGTGACCTCGTTGAATGCCTATACCAACAAGGATATTTACGCACCCGGTTCGCTGGACGTGGAGTTCTCGCTCAGCAGTGCAAGTGA 
AGCTTGTACCATTAGCGCTGACAAAATACCTAAAGTCCTTACTGATAATACGGCTTGGTCTCTGTTCCTCAATCCGCTAAATGGTAGTGGATACT 
ACTGGGAGGAGGATTTTGTGGACAAGCCGAGCCAGAGCCATCCGCTGATCCGAAACGTCGCTAATGTGCCGCAGGAGCTGTCAATTCGTTGGGAA 
AAGGCTAGCACGGGAGAGGTGGTCTATGACGACCTGGCTAGCAAAACGTTGCTCACGCGTGTGAATTCGGCGGAGTATGTAGTCATGGTGGGCGA 
AAGGAACGTCGGAACATTCGATGTTCACACCGGCGGCGTGTACACACTCCTGATTAAAGATAACAACAACATTTACGAGGCCAATTTGGAGGAAG 
TGACGAGCCCCAACTCGGTAAAGATCCTTTGGCTGGTGCCGCAGTACGTGGTGATGACCCTCGGCGAGGTGATGTTCTCGGTGACGGGACTGGAG 
TTCTCATACGCCCAGGCACCACCCAGCATGAAGAGTGTCCTGCAGGCCTGTTGGCTGCTAACGGTGGCCTTCGGCAACGTCATTGTGGTCGTCAT 
CGCCGAGGCCGCCTTATTCGAATCGCAGGCCAGCGAGTTCTTCCTGTTCGCCGGCCTCATGTTCGCCGACATGTTGATCTTCATGGTGATGGCCT 
ACTACTATGTGCCCAACGATCCGAACAAGGTGGAGGAGGCCCAGCCCCTGACGGCGGGTGATGCCAAAACAGAGATTCCCGCTCTCGAAGGCGGA 
GTCGAGAACAAGGGCAAGGACATCAACGATGGTAATACCCAGCTACGCGTCTTCAGCGGTATTCCCGACTGCGATTATCGCTTTAGTACGAACCT 
CGGAGCCCGGGAGAGCTTCACCTTAAGGGGGCTGGACTTCTACTATTCGGGTAGCGTGGCCACGAACACCGACGGCATCTTCCAGTTAACATACA 
CCGTGGAAAGTTTAACTGAAGGTTGCGCAATCCTGGAGGGCGGAACACAGCAGTTGTATGAGGCCACCGCCCACTCACTGTTCCTGCGACCTTCG 
CACGCCATCACTAAGTTTTGGTACGAGGACGAGGTGCAGAAGTCGAATCGCTCGTGGGCTTTTGTGCGAAACTTGGCGAATCTCTTGGCCACCAC 
GCGTGTCGTTTGGCGGGAGGATTCAGGGGATACGGTGGCCCTGGATTTGCCCGCCCGCAATCATGATCTCTACGAGCTAAGTACGGGCGGTTACA 
ACGTTCAGGTGGCGGATCACAATCTTACCAGCATAGAGTTACGCGCTGGGGGCGTTTACGCCCTGCTGATCGGCCAGAGTCACGGAACATTCGTA 
TCGCGAATCATAGAGGTAACCAGCCCCAATTCGATGCACATGCTCTGGCTGGTGCCGCAGTACGTGGTAATGACCCTCGGTGAGGTGATGTTCTC 
GGTGACGGGACTGGAGTTCTCCTATTCCCAGTCGCCGCCCAGCATGAAAAGTGTCCTGCAGGCCTGCTGGCTGCTAACAGTGGCCTTCGGCAACG 
TTATCGTCGTGGTTGTGGCGGAACTCAAGTTCTTCGACTCGCAAGCCAGCGAGTTCTTCCTGTTCGCTGGCCTCATGTTTGTCGACATGCTGGTC 
TTCATGTTCGTGGCCTACTACTATAAGCCATATGATGAGGTGGCTGCTCTGGCGGAGAGGGTAAGGGCCAGACGGGAACGGAAATGGCAGGATCT 
CGACCGATTCAAGAGGGACTAGGAATACGGATGCCTGATTGTCAT 
(SEQ ID NO: 284} 

Start ATG: 166 

MTDKKNEIQPAAEPVAEPEPSPAAEIAPASPDGSGPSAAAAEEVTPSVATVEDTTPIAAPAESQPSGESNGTAEAKKLPEIAKKNGKEPDQNDKE 
PDQNGKELAETNDFKNVSETEGQQKKLP YPKS VFFI I SNEFCERFN YYGMRT VLVLYLSRVLGYSDDT AT VVFHI FTMFVYFLCVFGAI I SDSWL 
GKFKTILYLSLVYICGSVLLTLGAIGPLNLPMETFTMLGLALIALGSGGIKPCVSAFGGDQFKVPEQVKQITSFFSLFYFSINAGSLISTTVTPI 
LREDVSCFDDINCYPLAFGVPAVLMIVSVIIFVLGRSLYKMKPPAGNMVVLVSSTIWTALTTKCKEKKTNPREHWLDYADKKYDRQLIDDVKVLM 
RVLFLYLPLPVFWALFDQQGSRWTFQATRMDGDMGSWDIKPDQLQVLNPLLILIFIPLYDVALYPALKLVGIRRPLQKLTMGGILAGIAFIISGV 
VELSLEKTYPDLPYSQNIQLRIFNADNCDYFFTSNIPGAENVTVTSLNAYTNKDIYAPGSLDVEFSLSSASEACTISADKIPKVLTDNTAWSLFL 
NPLNGSGYYWEEDFVDKPSQSHPLIRNVANVPQELSIRWEKASTGEVVYDDLASKTLLTRVNSAEYVVMVGERNVGTFDVHTGGVYTLLIKDWNN 
IYEANLEEVTSPNSVKILWLVPQYVVMTLGEVMFSVTGLEFSYAQAPPSMKSVLQACWLLTVAFGNVIVVVIAEAALFESQASEFFLFAGLMFAD 
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MLIFMVMAYYYVPNDPMKVEEAQPLTAGDAKTEIPALEGGVENKGKDINDGNTQLRVFSGIPDCDYRFSTNLGARESFTLRGLDFYYSGSVATNT 
DGIFQLTYTVESLTEGCAILEGGTQQLYEATAHSLFLRPSHAITKFWYEDEVQKSNRSWAFVRNLANLLATTRVVWREDSGDTVALDLPARNHDL 
YELSTGGYWVQVADHNLTSIELRAGGVYALLIGQSHGTFVSRIIEVTSPKSMHMLWLVPQYVVMTLGEVMFSVTGLEFSYSQSPPSMKSVLQACW 
LLTVAFGHVIVVVVAELKFFDSQASEFFLFAGLMFVDMLVFMFVAYYYKPYDEVAALAERVRARRERKWQDLDRFKRD* 
(SEQ ID NO: 285) 

Name: oligopeptide transporter 
Classification: transporter 
Gene Symbol: BcDNA:GH06717 
FlyBase ID: FBgn0028491 



Celera Sequence No. : 142000013384832 

CCAAGATAACGAATGACTCTTCGTCCGACATGTCTCCGTTGTCGCGGTAGCAATGGCGGGTATTAATAATCAGATTTTACGACTGTGAACGCAGG 
TGAGGGAATTCCCCTCTTTCGAAAACTAGACGGCAAAGTATTAGCCAAATCGATTAATTTGATATCGGCAATAGTAAGGATAAATGTATAATAAA 
TGAGTTTTAGGGCAATGATTCATTAGGCGTGGGAATTACATCGACGCGTCACAATGGAATAAGCAATTGTCATGTTTATTGCATATATACCTTAT 
ATACTACTGCGTTCCTTATCTCGTTTTGCTTTGCTGACAAGCTTTAAGGAAATAAAATTGTTTTCGGAGTAATATTTACTTTTAACTAATTTACC 
AAACAACTTAGACCAGTGTGACCGCAATCGTTTCAAATGTGCCTTTTTTTGTGTCATTGATTGTCATATTGTCATAATCGGAAATGTTATCGACA 
ATCGGCGAGCTCGTCTATCTAAATTTAAAAGCAGCTAATTTCAATAAAACTGATTACACGCCCAAAACTCGTGCATATGAAGTGCAGTTTGCACT 
GCAAGGGCGGTGGAAAGTCCCGGCATCATTTATTCAAGCCCAGTGACGTGCATCCCAATTTACCTTCCTTTGTTGCTCAGCTGAGATTAATGGGC 
GCGTCCTAAAGGCCTGAGCCTGACAAACCAGTTGCCAGCGAAATTACACCGAAATGTAATTGCCAAAACTCGCAAATTCTAATGGACGGCGCAGT 
GCGTTCTGAACCCGCCAAACCGGATTCAGTACCTCCATTGTTCGGCTCAACTCTGTAGATGTGGGTTAATAAGGCTTTCGCGTGTTAAATATAGG 
TGTGAGTGGAGCTCGCGACAAGGAACTGGTGGCTGGCCGAGATGCGACTTTCGTATTACCATAACCAACCGGTTCCTGGCTATGTGCACAGACCG 
CATCCCCATCCTCATTTCCATCCCAATCCACACATTTTAGAATTAATACGTTCGTAACAATGATAATCATCATGCGGTTTATTACTCAAGAATCT 
CTATGTGATCTTACGACTAATGCACGCGAATAAATAAAATTAAAAAAACTTAAACGAATACGATACGAAATTCCATTAAAGCTCAAGTGCATTCA 
AGAAAACTCCTCAAACGGCTACAACTAGAAGACTCACACGTTCACTCTCGCACTCTATGATGCAGGATCTCCTCATCTGCTCTATCACATTCACA 
CAAGGACTTAGGTCTAGGCCTGCGGATCCTCTGACCGCGATCTTACGCCTGTTGCTGGCGGAACAGACGCCTCTGCGTCTGGTGGGGTGGCTGGA 
GGTCGCTCTGCAGGATAGAGGCGAAGTGCTGCTGCTTTAAGGAGCTGATTGTGGCGATCTTGCGGTGCCTGTTGGGCTGAGCGGCGGCCAGACCA 
AGATGAAAGTAGGGCAGGTCTTCGTCCCGCGGCAGTGGCATCATCTTGTCCTCCTCCACCGGGGAGCCATCGGGTCGCACCCGGATCACCAAAGG 
TCCGGTGAGGAATCTTTTGGACGGTTCTGAAGTCAGTGCGGCGGCCATTGGGAGCTCACTGCTGGACTGCGGCCCGTAGCCGTAGGCCTTGTCCA 
TGTCCATGGCGTATTTGTGCTTCTGCTCGCCGCTCTCGAAGATTGGATTGGTGGCGGAGCCGGAGCTCTGGATGGGGTGGGCATGGGATGCTCCG 
TGCGGAAGTTGCTGGCGCGTTAGGATGATCAGAGAGGACGCAGGAGGAGTTGGCAGGGGCTGTTGGAGCGGCTGCCTCGTTGTCACCAGGCTGGC 
TCCGCTCGGATGGTCCCCCATAATCGATGACCAGACTGCGGAAGAAAAGAGCGATTAGATCGACACCGAGATTTAGAGCTCACGTCGAGCACGTC 
GCGACTGCGCACTGCGGAAGTTCTGACGCGGAAATCAATCATTCTCCCACGCAGATAGAGGCACTTACTCAGCTCGTTGGCATCGCGCTGATACG 
GATCTAGTGCCTCCAGCTGTACGTCGGAGAGCGTCTGCAGCTTGGCGGACCCGTAACCGAGCAGGTTGTGCTTGCACTGGTTCACTGGGAGGCCT 
CCGACGGCGGCTGTTGACTTGCAGTGCACGCTTACCAATTGAGCGAATGTTGAGTAGAGGTGGGACATCTGCAAAGTAACAAAATTGGTGGTTAT 
TTTAATGGCGTCTTGTGTTTAAGAATCAAGTTATAGAAATTAGAAAGCAGATCATCGCTTCTCTGTTCACCAGCCGAATATGGTATTGATTTGTC 
AT AAAC GAAAAT C GG AG AAC AT AT G T C G GT G AT AT T AG AAAAAAAAAG AAAAAAG AAAG T T CGAAGC AAT CAT ACT T C G ACC AG GTC T C TTGGGT 
AACGCAGGGCAGAGGTGGAGCTGAAGCCTGGTGCTCTCAATGGGGGGATTTCGCAAGGTGCGTGTGTGTCAGCCGGGAGATGATTGCACTTTCGT 
TGCACAAACGAAATGCATTTCGACAGCGCGGCCCGCGGGTTGAGCAACTGCCTCCCAGGGCGAGATTGGATGGCTGCCATCTGGTTCATGTGCCC 
CAGTTCGGATGGGCTGGGTATGAGTATGGGTTTACGGTTCAGCGGCCAAAGCAACAGCTCATTGTCTATTCTCCGTGGCCAGCGAAGCGTCGAAT 
GGACTTTTAATTAGCGCCGAATGTTCTACTGCGGTGACACATCTGCATCTGAGATTTGTGGGATTTCTCTGGTCGTCGGTGACGTCAAGTGGATC 
GCTTCAGCTCCGATGCTAATTTAGTGGTGGTGATACAGAGCAGAAGCTCCCTGAAATTTCCGAATCCGGCATTGGTCACGACTTAGCACCTCTCC 
CGACAGACAGCCAAACAATCGAAGAGTCGACGAGCAAACAGCAGAACAAAGAAATAAAAAGAAATGGGGAACTGGGTTACTTCACTAGGGGATCC 
AGTATCGAGTCTGTAGCCTCCTTCTAGATATATGTATATATATATATATATGTATTTGTATACACCCTTGATCTGCTTCAAAATCTAGCAAGAGC 
TTCATTTGCATTGGCTCTGCTTTGGGTTTTACAATTTTACGCCGTTTGCATAATGCATTGGATATATCGGATCGGCGGACTCGGAGGGGCCAAAG 
TTGGAGAGTTGGGTTGTTGGCGGGTTGGCTCACATTACCAATTACCGTTACCCACTCAGTGGGGCCACACAAACGTGTGCAAACTGCGAATTGCA 
GTGGATACCATCGCACCACCTCAGCACCCCCCTGGGTGTGCATTTGCTCAGATGTGTACTTGAGTGGGGCAGCGATTAATTGAACGAATCGGTTC 
CTTTGTGCCCACCCCTCCGCCAGTGGAAGCCACTCTGCGTCAGGAGCACGTTATCTGATTCGAATAACTCGGTCGGGATTTTATGGTCCAGGCCC 
TGGCACAATTAAATATATAACCCAGTCAAAATTGCAAATTCAGCAAAACAGCAATTTATTTATGGCTTAGTGCGCAAATATTCCAAAAAGCCTTT 
GCGGGAGTTGTCAAAGTCACGTACTTAATTCGCTTATATAATATACGCTCGTTTTTATAAAAATATGTTTACTCGAGATCGTCGCGGACAGGTTG 
TAACTCATTAGCCTTCCAGCGAAGTGAGCTAAGCTAAGTTGAGCTGAGCTGGCATCGCCTGGTTTGTCCACAAGTCTAGCCTATAGACAGACTGA 
TCTTGGCCCGAGCCATCTGTATCTGTATCTCAAACTTCCTTGTGGCTATGCGAGATACTCAGTCGACGCACAACCCCCGAGAAGCGAGACCCAGA 
GATGGCCAAAAGCGGTCTCGTGGGTCTGCAGACGCGACTGCTTCCCGGGCCGCAAGTGGTGGTGGTAAACCCCTGGCCCATACCACATGCCATCC 
CCAGCGGATTGGCGGTCAATAACCTGTTGATATGCGCAGGTTACATTATGAAACATCGAAATGGCCAACCACAGAGACCAACGCACAGACAGCCA 
CGAAAATGAAAAATTTCATAACAGACAAAATAAAAAGCTAAAAGACGAAGTAATGATCGAAGTTCGATCGGGTATCGGATGGGAAAAGGGTTCCA 
TTACTACACACATGCTTGTGTATCCCAGCATATTTTACGTAAAAACAAAACGGTAATGCGAACATAACTTATTTATTGGGGCCCGGACCGCAAAC 
CGGCCAAACGCGTTTGCACCCATAAAAACATAAGGGCAACAAAAAAATTGTTAAGCTGTTGTTTATTTTTGCAATCGAAAATCGAAAGCGCTCAA 
ATAGCTGCGATCACTCGGGAGCAGGGTAAAGTCGCCTCGAAACAGGAAGCTGAAGCATCTTCTATAAATACACTCAAAGCGATCATTCCGAGGCG 
AGTCTGGTTAGAAATTTACATGGACTGCAAAAAGGTATAGCCCCACAAACTGCGTCTCCACATCGCTGCGTTTCGGCAGCTAATTGCCTTTTAGA 
AATTATTTTCCCATTTCGAGAAACTCGTGTGGGATGCCGGATGCGGCTTTCAATCACTTCTGGCCCGGGATCGGATTGGGTCACATTGTCTGCGG 
GCTCTATTGTCTCGATCCGCGGATTTAAGGCGCAGTTCGCGTGCTTAGCGGTCAGAAAGGCAGAGATTCGGTTCGGATTGATGCGCTGGCAGCAG 
GGCACAAAGATCTAATGACTGGCAAATCGCTACAAATAAATTAAAGTCCGGCGGCTAATTAATGAGCGGACTGAAGCCACTTTGGCGGAGTTCAT 
TAACCAAAAAACAGCAGATAAACAAAAACGGCAAAGAAAATTGCCACAGAGTTGTCACGCTTTGTTGCACAAACATTTGTGCAGAAAAGTGAAAA 
GCTTTTAGCCATTATTAAGTTTTTCCTCAGCTCGCTGGCAGCACTTGCGAATGTAAATTGAAACTGATGTTCCTCATAAATGAAAATTAATGTTT 
GCTCTACGCTCCACCGAACTCGCTTGTTTGGGGGATTGGCTGGCTAATCGCGGCTAGATCCCAGGCGGTATAACCTTTTCGCTTCATCAGTTGTG 
AAACCAGATGGCTGGTGTTTTGGCAACGACAGCCAGCGGACTCCCCTCGAACGCTCTCGAAATCAAGTGGCTTTCCAGCCGGCCCGCTGGGCCGC 
TCGCCCACTGGACCGGTATTCCCAGGCCAGGCCACACTGTACCGCACCGCATAATCCTCGCCAGACTCGGCGCTGATAAGGCCCAATGTCAAGAC 
CGCACTCCGCAGGCGTCTATTTATGCCAAGGACCGTTCTTCTTCAGCTTTCGGCTCGAGTATTTGTTGTGCCATGTTGGTTACGATGCCAATCGC 
GGTACAGTTATGCAAATGAGCAGCGAATACCGCTCACTGACAATGAACGGCGTCTTGTCAACCATTCATATTCATGCTGACATTCATATTCATTC 
CTTTGGTTTTTTGTCTTCGACGGACTGAAAAGTGCGGAGAGAAACCCAAAAACAGAAGCGCGCAAAGCGCCGTTAATATGCGAACTCAGCGAACT 
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CATTGAAGTTATCACAACACCATATCCATAACCATACCCATATCCATATCAATATCAATATCGCTATTATTAACGATCATGCTCTGCTGATCAAG 

TATTCAGCGCTGCGCTAGATTCGACAGATTGAATCGAGCTCAATAGACTCAACAGACTCCACTCGACAGATGCGCAATGCCAAGGACAATTGCCG 

TGTGTAAGTGGAGTAAACGAGGCGTATGCGCAACCTGCACCTGGCGGACGCGGCGTATGCGCAATGTGCAATTCGCTTACCTTCTCGTTGCGGGT 

CAGGAACTCCCAGATGGGAATGGCCGATGACGAGCTGATCTGAATGTGGAAGGCGCCCAGCAGGCAGCAGGCCAAGATTACTTTCGCCGCAGTCG 

TCATGGTGTCGTTGCTGCTTTTATGTTGCGTACTCCGCACTACACGGAGAGTTCAGGGGATTCGTGCTCCGTGATCTGTGATCCGTGTTCCGTGG 

GTCAATTGCACGGTTCGGTTGTGTAACCTTCGTGTGGCGACTTTCTTTTTTTTTAGGGCCCAATAAAAGCGCTTTTGTGGCGGCTTGATAGATTA 

TCACTTGGTTTCGGTGGCTAGCCAAGTGGCTTTCTTCTGTCCGACGCACTTAATTGAATTAACCAAACAACGAGCGTGGCCAATTCGTATTATCG 

CTGTTTTCGGGTGTACGTGTGTCTCAGCTTGAAACGCAAAAGCTTGTTTCACACATCGGTTTCTCGGCAAGATGGGGGAGTCAGTCGGTCTAGGG 

AGAGGGGCGCCCACCAGTCGATCACGAAAACGGCGAATTCCAAGCGAAACGGAAACGGAGCGAGCACTATAATCTTGCAGTACTATGTCGAACAA 

CCGATCGCGGCGATGTCAGTGAGTCGTCTTCGGACAGCGCTGGCGCTCCACACGTATTTAAGCTCTGAGATCGGCTTTGGGAGAGCGCAGAGAGC 

GCCATCGCACGGCAGAGCGAAAGCGGCAGTGAGCGAAAGCGAGAGGGAGAGCGGCAGCGGGTGGGGGATCGGGAGCCCCCCGAAAAAAACAGAGG 

CGCACGTCGATGCCATCGGGGAATTGGAACCTCAATGTGTGGGAATGTTTAAATATTCTGTGTTAGGTAGTGTAGTTTCATAGACTATAGATTCT 

CATACAGATTGTATCTGAGAGTCCTTCGAGCCGATTATACACGACAGCAAAATATTTCAGTCGCGCTTGGGCAAAAGGCTTAAGCACGACTCCCA 

GTCCCCCCTTACATTTGTCTTCCTAAGCCCCTGGAGCCACTATCAAACTTGTTCTACGCTTGCACTGAAAATAGAAATTAAAAACCAAAGTAAAC 

AATCAAAAAGACCAAAAACAATAACAACCAGCACCGAGTCGAACATCAGTGAGGCATTGCAAAAATTTCAAAGTCAAGTTTGCGTCGTCATCGCG 

TCTGAGTCCGATCAAGCCGGGCTTGTAATTGAAGTTGTTGATGAGCCGCGGGATTACTGGATTGTGGCGAATTCTGGTCAGCATACTTAACAGCA 

GCCCGCTAATTAAGCAAAATAAACATATCAAATTCCAGAATGCGACGGCGCCATCATCCTGTTTGGGAATTCAATTCGCGGGCAGATCGTTTAAT 

TCAATTAAAAGGTAGAAGTCGGTAAAAGGGAGCAGAAGAATGCGATCGCTGGAATTTCCTAACATCACGGACCCCATAAATTTGATAAGCCCGAG 

CTCGCTGCGTTGAGTCAGCCACCCCACATCCCCAAATCCCCGCCAAAAGAAGACAGCTGGGTTGTTGACTCGCCAGATTGAGGTGCGGATTGCAG 

TGGAG T GGAC C T GG T C AAAG AAGC ACC GT T AAT GT GCT G AT T C CAT T CG ATT C CATC C G GG AAT G C G AT AAAGAAAG GCTC 

(SEQ ID NO: 286) 

Exon: 6396.. 5781 
Exon: 2158.. 1969 
Exon: 1840.. 1001 

Start ATG: 5894 (Reverse strand: CAT) 
Transcript No. : CT9939 

GAAGACGACTCACTGACATCGCCGCGATCGGTTGTTCGACATAGTACTGCAAGATTATAGTGCTCGCTCCGTTTCCGTTTCGCTTGGAATTCGCC 
GTTTTCGTGATCGACTGGTGGGCGCCCCTCTCCCTAGACCGACTGACTCCCCCATCTTGCCGAGAAACCGATGTGTGAAACAAGCTTTTGCGTTT 
CAAGCTGAGACACACGTACACCCGAAAACAGCGATAATACGAATTGGCCACGCTCGTTGTTTGGTTAATTCAATTAAGTGCGTCGGACAGAAGAA 
AGCCACTTGGCTAGCCACCGAAACCAAGTGATAATCTATCAAGCCGCCACAAAAGCGCTTTTATTGGGCCCTAAAAAAAAAGAAAGTCGCCACAC 
GAAGGTTACACAACCGAACCGTGCAATTGACCCACGGAACACGGATCACAGATCACGGAGCACGAATCCCCTGAACTCTCCGTGTAGTGCGGAGT 
ACGCAACATAAAAGCAGCAACGACACCATGACGACTGCGGCGAAAGTAATCTTGGCCTGCTGCCTGCTGGGCGCCTTCCACATTCAGATCAGCTC 
GTCATCGGCCATTCCCATCTGGGAGTTCCTGACCCGCAACGAGAAGATGTCCCACCTCTACTCAACATTCGCTCAATTGGTAAGCGTGCACTGCA 
AGTCAACAGCCGCCGTCGGAGGCCTCCCAGTGAACCAGTGCAAGCACAACCTGCTCGGTTACGGGTCCGCCAAGCTGCAGACGCTCTCCGACGTA 
CAGCTGGAGGCACTAGATCCGTATCAGCGCGATGCCAACGAGCTGATCTGGTCATCGATTATGGGGGACCATCCGAGCGGAGCCAGCCTGGTGAC 
AACGAGGCAGCCGCTCCAACAGCCCCTGCCAACTCCTCCTGCGTCCTCTCTGATCATCCTAACGCGCCAGCAACTTCCGCACGGAGCATCCCATG 
CCCACCCCATCCAGAGCTCCGGCTCCGCCACCAATCCAATCTTCGAGAGCGGCGAGCAGAAGCACAAATACGCCATGGACATGGACAAGGCCTAC 
GGCTACGGGCCGCAGTCCAGCAGTGAGCTCCCAATGGCCGCCGCACTGACTTCAGAACCGTCCAAAAGATTCCTCACCGGACCTTTGGTGATCCG 
GGTGCGACCCGATGGCTCCCCGGTGGAGGAGGACAAGATGATGCCACTGCCGCGGGACGAAGACCTGCCCTACTTTCATCTTGGTCTGGCCGCCG 
CTCAGCCCAACAGGCACCGCAAGATCGCCACAATCAGCTCCTTAAAGCAGCAGCACTTCGCCTCTATCCTGCAGAGCGACCTCCAGCCACCCCAC 
CAGACGCAGAGGCGTCTGTTCCGCCAGCAACAGGCGTAAGATCGCGGTCAGAGGATCCGCAGGCCTAGACCTAAGTCCTTGTGTGAATGTGATAG 
AGCAGATGAGGAGATCCTGCATCATAGAGTGCGAGAGTGAACGTGTGAGTCTTCTAGTTGTAGCCGTTTGAGGAGTTTTCTTGAATGCACTTGAG 
CTTTAATGGAATTTCGTATCGTATTCGTTTAAGTTTTTTTAATTTTATTTATTCGCGTGCATTAGTCGTAAGATCACATAGAGATTCTTGAGTAA 
TAAACCGCATGATGATTATCATTGTTACGAA 
(SEQ ID NO: 287) 

Start ATG: 503 (Reverse strand: CAT) 

MTTAAKVILACCLLGAFHIQISSSSAIPIWEFLTRNEKMSHLYSTFAQLVSVHCKSTAAVGGLPVNQCKHNLLGYGSAKLQTLSDVQLEALDPYQ 
RDAKELIWSSIMGDHPSGASLVTTRQPLQQPLPTPPASSLIILTRQQLPHGASHAHPIQSSGSATNPIFESGEQKHKYAMDMDKAYGYGPQSSSE 
LPMAAALTSEPSKRFLTGPLVIRVRPDGSPVEEDKMMPLPRDEDLPYFHLGLAAAQPNRHRKIATISSLKQQHFASILQSDLQPPHQTQRRLFRQ 
QQA* 

(SEQ ID NO: 288) 

Classification: known_flybase_gene 
Gene Symbol : Reg-5 
FlyBase ID: FBgn0015801 



Celera Sequence No. : 142000013385196 

GGAAAATACGGCGAAAAGTAAGGGGATTATTATGCTTTTGTTTGGGGTGCACATGAATGTACATACGTACATGCATATGTACATTAGTACGTAAT 
GCTCTATAAGGTAAGGAACTACTCACCATGTACTGTTTGCGAAAATATGCTAGTTACGAAGTCTGCCTAATACTGTCCAAGTTGCTGTGGCGTCC 
GTGTGGAAATGGTTATGCTATTGCATCACATTTATATATTCTTGTAACTGAGCAAGTTGCTTAACAATTTGTTGCCTCTTTGTTGTTCTTTGTCA 
CATATTTTAAAGGACGCGCAGTTTCCGTTTCCAATTGGATTCCGTTTTCGTTATGTTGTTGTGGTACCAACCGACTGCTTGTACCCTAAACTACT 
ACAACTACAACAGCACCGTCACATTTGGCCCCCAGGAATGGGTAAATGGTTGTTATGCCTCGCAAATTTAATATATATATAACTTTTTTTCGTTT 
CGCGACGCGATGAAAATCACTCGAGCTCTGAACTTGCAGTGTGACCGCGTTTGCTGAACTAGAGATGGCCAAGCATCGATTGCACTATCGATTAC 
CTATGCATTCTCAAAAATTCGCATCCACAAAGTAGGAACTATCTGTTTCGACTAACGGTCATACTATGCAATAAATTCAAAATTTGTTTATACTT 
TTTGCGGACGAAGAGTTATATATTTCATATTCTGTTTTGACAAAGGATTGCTGTTTGTTTATAATCGTAATACAAAGCATATGCTATTTTGGATA 
TAGAACACTAGAATTCCCATAGGCGAACTTATTCAAATATCAAATTAATTGGTTCCTGAAAGCACCCGTAAATTCTTAGTGGCTGAAAAGTCTTA 
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TTGGAAGTCAGAATAATTACTCCACACACTAAGCATCTTAGTTATGCTTTGCTTTGAGCTAATCCTGGTTCTAACTGCCACCGAAAAATGATTCC 

ATTGTTTATGCCAGAAGTAAATAAAACAATTTCGAACGTTATACGTTAAATTACAAAGGGAAATCGCTTTATTACCAAATCAAAATCTAGCTATG 

GGTACGCCACGCTTAGAAGTTCCTGCAAGGCGAAAAATGCGCGATTAGTGGATGTCCATCCCGGGGATTCTAGCACAAACTCACTTGGTAATGAT 

GCCGTCCTTCTTGGCCAGACGCACGATGCAGTCGTCGGACTCGCCCATGCGACGGATCTCGCCGCAGATGGCGTAGGTCTTGGAACCGTCGGTCT 

GGCGACCGGTCTCGGGGTCCACATCCACGATGCTCAGCTGCACGGAGGCGTGATCCTTGGCGTGGATGATCCTGTTGGACGCCGAGCTGCAGGGA 

AGGGTATCGGCTGGTCAGTAATCCATTCGGCAAATTGTGCGGGGAACACGTGTTTCACTTACCATTTGCGGGGCACGTACAGATCAACATTCTCA 

CCGGCGTCGTTCTCCATTTTATTGGGCTGAAAGGTAAGGGACACAATTAGCTTGCTTTTTATTCAGTTTTAGTGGCGGCCACTGCTGGTTTACTC 

ACTTTTTTGTAGAAATTTACAAATCTAGCGACCGCAACTCGTCGGTGAAGGCGAAAAAGAGAAAGAAATGACAGCTGGACGTGAAGTTGGCTGCG 

CTGCGCGTGGCGCCCCCTGTCGGAAGTTTTGCGTACCAGCTGGTTTTGGGGTGACCAGCTGGCAAAGGTTTTGGTATTTATTTAATTGTACATTT 

TTGCAAGATATTTCCATTCCCCTTTAATGAATAGGCATGTTTCAATTTTGGTTATGCACCAAAATGTATTTTAAAATTAAATGATAATTAAAAAA 

AAATTTCACTTGGTTTTGTAAGTAAATATCAATAGATTTAAATATATATTTTCCCGCATTAGTGGAAAATACCAAAATCTAATGAATGACATATT 

AGATACATGTTCCAGCAGCGACTGGTCACACTTTCGAAGGCCTGAGCGAAGGCTGCAGATTTCGCGTCGGGAAAATTCTATTGCATATTTAGGGG 

CGCATCAAGTAAATGGCTCCCATTAGAGTGGCTCAATATATATTATTCGCAAGCTCCCAACTGCACGGACTTACTTATTTAGCATGTGAGTATTG 

ATTTTGCAACAGCCAGTCAGTCAGTCTGTGATTTTGTTTTCGCTGGATGGGCGTTCAGCGCGACTGCCCTTTGACCTCGCTCGATGTCAGCAGCG 

ATTTTCGTCAATGGATGCGGACCCCCACTTTTTATTGCCACTTCGCGTGTGTTTGTGCGGCTGTTGGGGCGCCCCAAGTGGGCGGCATTGTTTTC 

ATCGCGCGCCATGTCGCGGAATTCGCAGTCCACGTTGCCTATTTATAGGCGTCGCTCACTCAATGAACTCTCTGATCGCATTTCATTGTGAGTTC 

GGACTTAGCTTTTTCAGTTGGACGCCAACTTTGGAAGCAATCGGTTCGCTTCGCGTCGCTCGATTTCAAAAATCCCAAACGGCGTGACTTTTGCC 

TTATTGTTGTTAGTTGTGTTTTTTCGACTGAAACTTCTTGAGGTGAGATTTCACAAGTGGTTATGCTGCTTGTTGACCTTTTGTGTGTGCAAAT 

(SEQ ID NO: 289) 

Exon: 1564. .1523 
Exon: 1451.. 1393 
Exon: 1321.. 1130 
Exon: 1067.. 1001 

Start ATG: 1442 (Reverse strand: CAT) 
Transcript No. : CT10101 

CGACGAGTTGCGGTCGCTAGATTTGTAAATTTCTACAAAAAACCCAATAAAATGGAGAACGACGCCGGTGAGAATGTTGATCTGTACGTGCCCCG 
CAAATGCTCGGCGTCCAACAGGATCATCCACGCCAAGGATCACGCCTCCGTGCAGCTGAGCATCGTGGATGTGGACCCCGAGACCGGTCGCCAGA 
CCGACGGTTCCAAGACCTACGCCATCTGCGGCGAGATCCGTCGCATGGGCGAGTCCGACGACTGCATCGTGCGTCTGGCCAAGAAGGACGGCATC 
ATTACCAAGAACTTCTAAGCGTGGCGTACCCATAGCTAGATTTTGATTTGGTAATAAAGCGATTTCCCTTTGTAA 
{SEQ ID NO: 290) 

Start ATG: 52 (Reverse strand: CAT) 

MENDAGENVDLYVPRKCSASNRIIHAKDHASVQLSIVDVDPETGRQTDGSKTYAICGEIRRMGESDDCIVRLAKKDGIITKNF* 
(SEQ ID NO: 291) 

Name: ribosoinal protein S21 
Classification: ribosomal_protein 
Gene Symbol: oho23B 
FlyBase ID: FBgn0015521 



Celera Sequence No. r 142000013384819 

TGAAGCAGCTGCAGTTGCTCTACAAAATGGCCACCAGTTCGAGCACCACAACCACCACAACAGTGCCTCCTGTCTTTACAGGAGGATCCCTTAAT 
TTGGGTGGTCCAAGTGGCTTTAAGCTGCCACCTCCTGCTCTTAGCCAGGATACCAATACGCTGAGCAATCCTGATCCTCTCAAGAGCAAATCTGA 
GATCACCAAGGAGCTTATGGATGTCAGTCTGGAGACGGACACTTAGGTTGTGCGCTTATCTAGTTACTAATTAACCTCTATATACAAATACTTGT 
GTATTTGGGTGCTTAGCAAATCTGTGTGTAGTATGGCTTAGCTATCTTTTGTAAATAATATTATAAATAAAACTTAATTTAGTGAATCGCTTTAT 
TGAACAGCCTGAATGTAGGCAATAGAAATTTTATGTCAAATCGAGGTCATGTAAAACTCATGTAGAATTGTATAAATGTAAACTAAAATTTACAT 
TGGTTAAATATATTAAAAACTGGTTATAAATTAAACATTTAAAGTTTCTGACGTCATTCGTATTTTTTTATTTACAAATATAAACAACTTTATTC 
TTTCTTTCTGAGTTACGCTTTATTATGAACTTAGAAATGTTTAAATTAATTACATAAATAGTCAAGTATATTACAAAGTTACATGCTGTATCAAC 
GTAGCTACATTATCGGCATATCTTCATGTATTTCCCTACTTGCATAAGTATTTTGTTCATCTAAGCTAGACAGTAGTTATGGGCTATATTATAAA 
TTAAACGAAACATATGGGACACAGCAGCTGATCGTGGATCGAAAATAGGATGAGGTCCTTCCACCAAGCCAACTGTGCGCTCTGTGGACATACAG 
AATACTATAAATCATTTAGGCTACATACATATCGCCTACAGTTACATATGTAGGCGTACTAAAGTAAATATTACTTATAAATATAGTCAATTTCC 
ATTTAGCAGACGAGTGCATCGGTGTACAGTGGTTTTTGGGTTTTTTTTTTATGGGTATGAGATGTTTGTGTGTGTTGTGTTTGTGCGAAAAAAAA 
CCATGCAGTGTCTATAAAAATAATGTCTTTCATATCATGTTTTTCTCTGTGTACTGAGTGTCCGGTGCCTTTCGGTCTGGAGTTCTCTGATTAAG 
GGATTGCCAAATGAAAGCCCCGTAATCTCAATGTGGGAGTAACAACTACTTTAACTTGTGCGGATGATGATACCCGGTGGGTTGATATCTTCCTC 
TTGGCTCTCATTTCCTTCCGCTCATCACGAACAGCAGTGGCACCGAATGAGTCTCACTGGCGATGTGCAATCACTTCTCGTTCTTGAGGGGCACA 
CCCTCCTCCTGGGTGACGTCGCTGATTCTGACACGTTCTGGAGGATTGTTCCACGGCTGCAGTTCACCGCTGCCAAGGATGGCGAACACCACAGC 
TGCCGAGATATAGGTGGCGGCCAGGATCCAGAACACAATTTGCCACGAATGGAAGGATTGCTATTGGTTTTAAAATAAAATATATAAGTAATATA 
GTCTCTCTGATATTTTCAGCTAGATTTACTCACATCCTTGTAGGTGAGGGCTCCCACCATCGACGTGGAAAGGAAGCCACCAAAGGAAGACAGCG 
TGTTGGCCAATCCGAAAATGGTTCCTCCAAAGTTCGGTGCAATATCCAAGCCATTGCCGAGATAACCAGCGGTGACTGCACCATGGGCAAACAGG 
GCCAGCGAGAAGATCGTGACAGACCAGGTGGCATCGTAGCCCAGAAACACTTGCACTATCATCAAAAGACCGGGAATCACAAGAGCTGAAAATAA 
GAGAAGATTGTTTATGTTATTCCTGCAATACCTTGAAATGATATTTTTGAATTTACTTACCAAAAGTAGTGAATAGTTTCCTGGTGGCCGTCGTG 
CTCAAAGTACCCTTCTTGCGCAGGTAATCAGCCAGGTAGGATGAAGCCACAGCCATAACGTATTTACCCAGATAAGGCAGCGACGAGAACAAACC 
ATTCTGTAGAAGAACAAATCTATTAGTAAATATAAGATACCCAGATAAGCCGTGTCCAACAAACCCACCTGTTTGATATCGAAGTGCAGAATCTT 
GGACATGAAAGTGGGCAACTGATTGACCACAGTGAAGAATCCAAAGACGGCCAGTCCGTGACAGATGATGATTGCCCACACAGCTGGAGAGCAGA 
GCAGCTGACCCCAGGGCACATGGCTCGGACGCTTCTTAGAGGTAGTGGTGCCAATGGCCTCCTCAATCTCCCGCCGCTCCTCGGCTGAAATCCTG 
GGATGAGTGGCGGGTGTCTCATAGACAAAAGTGAACCAGGCCAAGGACCACAGCAGACCCACGGCTCCCGTCAAATAGAAAACACTGGCCCAGCC 
AGCGACGGAGATCAGGTAGCCGCAGATGGGCATGGTGATTGCAGCACCAAGCGAAGAAGCTTTCGGGTCAAGAAAAAATATACGTTAGGGATAGG 
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ACATTTTTATAATACAGGTATGCAGCCTCTATTAGGCTTTGTTCCTACGGACAAAGAACTGCTTGACTTACCCATCATGTTGGACATAAACTTGG 
AGCGCTCCATGGGCGGAATCCAGACAGCGGCCACCGGGTGAATGGCTGGCCAGGATGCGCCCAGCATGAAGCCAAGAACAACTCGCACCACGATG 
AGCACCACGTAGTTGATATGCGCCGCCAGCGGCGTGATCAGGGTGAGCAGACTGGCCCACAGCATGGAATGTCCAAAGACGCGGCGTCCACCGAT 
CAGCTCGGCAAGACGACCGCCCGGCAGTTCGGTGAGGATGTAGCCCCAGAAGAAGCAGCCCAGCACGAAGTTGGTCTGGTAGCTGTCCCAGGGAA 
AGCGCTCCTCGTACACATCCACTCCATCAGGCGATGCGGTGCTATTGGCCGCCGTACTATTCCCCACTAGAGTGGCATTCACCGCTGATGTGACA 
TTGGGTCGGACCATGTCCACGATGGCAATGGTGAGGTTCACTCGCAGGGCGTAGTTCAGCATGAAGCCGAGCATGGTCAGCAGGTTCAGAACTTG 
GCGGCATGACAGAAAGTCTGGAAGGGAAGGGAAGAGCGGTGTCCAATTAATAATAGAATCCAAGGCATTGGATTGGATTGGATTCATTGATACTG 
TAGTACATGGCGAATAAGTCGGTGCTTCGTCGTCTGGGTCGTCGAAAGGTTAGGCACTTAAAATTCAGCTCTGGAATATTCCCAAGCTGCCGAAA 
ATCAGGCGACCAGCGGCCAAAGCAAAAACCCACCCAAGAAGCGCCCCAATTTGTGATAAGAAAACTATACTACATTTAAATTGAACAGAATAACC 
TTTTAATCGAAATTTAAAAAAAAAAATCCACCTGGATTATGATCGCCTGTAATTGCATTCAAAGCTAATACACGTTTTGTTTACAATCATTTTAA 
CTGATTTGTTATCAACTCACTACATCCACTACGCCCACACTACACACTTATTAACTGATGTAATCTGTTGTTTTCATTACACGATGCAGCCCAGA 
TAAAAGCATTAATTACCCATATTAATGAGCCAATAAAATCGCAATCGCACAAAACTTAACAGTCGTTGTCAGGTGACCTCAATAGCATATCGAAT 
GGGCCATGAATTTGATTTGAGGATCGCCGGTTTGTGGGTGCAGCTGGATTTCAAAAGGAGAAGAAAACCAATCAAACGGAATCAGCTTCATTCGA 
AGCCAAAGACATGAAGGCGAAGATCATTAAGTGGTGTCCACAATCAACACAAACTGGGCCTGTCACTTTGTTTTTTTTTTTTTTTTTTTGGGACG 
CTTATTAAGTGGCAAAGTGCATTGTACGGATTGGGTTGCATTTTTCGCAAATCACATTAATAAATTTCAAGTGGCATTGAACCGCAGTTATAACC 
CGCTCTTCAATTTCAGGGAAATGCTCACAAGTGTCGGTGTAAATTTATAAGCCACCAACCCGATTGTAATAAATAATAAACTGGCTTCCAGCGAC 
TTTCCCATGCAAAAAAAAACAAAAAAAAAAACAACGCTATCAGCTGGGCGAAGCGACTATCGGGGCCAGTCTATCGCCTGTCTGGCCGGGACACG 
CCCACTTCGCTGATTATTGCCCCAATATCATGTTAATGTCATGCAAAAAAAATACGAAAATAACGCCACTCGGCCGCTTGTCTAGACATCGGTGA 
GC ATC AT CAAAC GAGCGCGAAGCAACAATAAAACGAGACACTCAACTTGT AT TCAAATT AGC ACT CGAAAATAACCAAAATTCGAAACATCAAAA 
AAGCGTTAAAAAAAAAATATACAAAATCTCTAGTCATGTTTGCTATAAATTCTTGATTCCATATAAAACCGATAACCAGAAAAACGTGCCCAAAA 
TACACAAAATTATAAGTAATTCCACAGAATATCGGCAAGGTTTCGACTTAGACCAGTTTCTTCGCTCAAATCGCATAGAATTGGATCGAAGCAGA 
ACTATATCCAAATTTTTGGGCGATCTATTGTGAACTCAGGAGGTGTTTGGAAAGTTTGGATCATTAACAAAGTAGCGCTCTGTTAATTTGTGGGC 
CATGTCTGGACACCTGACACCTCATCATCAGCTCAAATCAATGCTAAATAAGCGTGTGCCGCAAATTACCGCCACTCCATATCGATAAGGATTGA 
T T T AT AC GACTT C C AC TT GC AC GGGGGAAAT AAATT ATAAT AAT AAAC T AAT GT C AT AAA C CGGC AA GGAC T AT G AACT T AAAGT GGT AAAT T CT 
CTTATCTCAAATAGTAGCTAAAAATAATTCTCAATGTGTGTGGGAATTTCTAACTATAAATCAGGTTCGAAAACTTGCACTTAGATTGGTAAACA 
AGTTTTAAGGGCCTCTATTTTCTATAATGAATCAATCCAGTTATTCAACTTATAAAATTCTATATCCATTAAGCACATATTTCGGATGATAATTA 
AATTGAATAGAATTCGAGACAGAGCTCTAATTTGTCGGAATGAACTCATCAAGGGTTTTCTCTAACCGATCTTACTCATCAAACGAATTCTGAAG 
TCTGTGACTCCTTCCGAGAATCTTAATGATTGAAACTCAACTCAGACTTCTTGATTAATTGTCGCCTCGAACGGTTCATTTGTTTAAAGCTTTGT 
TTACGGACAAACTATTGCTTCGCATTTTCTTTTGTTGTTATCTTTTTGGCAATAAATTAAGTGAAGGCCAGACTTGATGTTTACAACCGAGAGGC 
AGAAAAAACCGGCTAAAATCGTTCGAAATTATTTGGACCGACTGGATTTGGACAAACTTTTCTCAAAATTCTCAGAATTTCGCCTGTCTCGTGCA 
TAAACTTATTTGATTTGGCTGCGTTTCATCTCGTTTCCTTATTTTATTTAATTTTATTTCGCTTTTTGGCAGCAATCTTTAGCCTTTATTGGCTT 
TCGTTGATGCTCTACTTGAGTTTAAACAAGTTCGCCGCAGTCTGCGCCAACATAAAAATCCCATCAGAGAAAGGCAAAAACTTTATAAATAGCCG 
GCAAGTAAATAAAACGAAATGAATACGGGCAAAACGTGATACTTAAATTACCGCTTTTTACGATCGAGCAACAAGTGAATCTGAGTTCGTTACTG 
GGTTCATTGATTTCTCGGAAAATTCTTAAAATTTATTCGCCAAAACTGCGAATCAATTAAGTGGCACTTGAAGTTCAGATCAGCACGAAATCGAT 
TATTTGCTTTGCCTTCAAATAATCATTTTTAATATTTTCGAAAAGACGCATATCCGGCACTTTTTGGGGGGTTGACGCAAAAGTATTTTCGAAAA 
TGTCTTGTACGATTCTAAGTACATTCCGTCTGTTATTCTTTTTAATCTATTTCACTGGTTTATGTATATTTTGTTTTCCTTTACACTTTCGAATC 
TATTTATACAGACGCTCTATCGTTCGAATGCGGGCACTGGGTACACACCCACCGATGATTATCTTCTATGTCCGTCCACTCTCGATTCTCTCTGG 
CCATTCCTGACCTTCAAAATCGGAATGGGCTACATAGACTATACATTGTGTTGGTGAAAGTTAGCCACCGATATCGTCATCAGCTTCGATCAACT 
GATATATATTTTCTCCTCGAATGGATGGAACGCATGTGATCGGCTTTCGGATTTTCTGGTGGAATGTCCTCAGCATTTGCGTGGGCACTATTGGT 
CTGGCCCGAAAATTTTCCATTCATTCGACATACGAAAGAAAGGCCCGCAACCGACGTCACACAAATGCGCACTTTTCCGTCAAGGTGCCAAGTCA 
TTTTCCTGTCACGGGATCGAAGCGGCGGGCGCAACGTGGCGTATGAGTTATGGCGAAGGCGCGCCAGCCACGAGCCCACTTTTCAGATTCGATTT 
GCAGCGATTACAACAACTCCCACACAGCCCAAGATTTTCAATCGTATGTAGTCACAAACGATTTGTGCCCATGCCGGGGGAAATGCAAAACACTG 
TTTTGCATTTTGAGATCAATCGATTGTTTCCGAACTTTTCCATGTGATCATTTCTACAGTAGATTGTCCTCGGCATTGTGTATTTGTGGGAATTA 
ATCAATCAGAACTTCATAAGCATAACTTTACACTTTCAAATCACTGAAATAAATCATTTGCATATCTGTCGGTAATATGTTTCATTGCTTTCAAA 
CGCCAAAAACGCACTGTATCGAATATTGTATCTGTATCTGTCGGATGGCATGACATTGCCGCGTCGCCTTCTAGCGGCAATTATTGGCTTTGTTG 
TGCTGGCCAACTGCATTTCCGTTTGTAACTGATTTACGGAACTGGCCTCGTTGAAAGCGAGTGAATTGTAATGCCACGACGCCGACGACGACATT 
GTTTGTTTTCGAGTTTCTGTTTCTGTGGAACTGGTTCTACTTCAACACAATAAGCAATAAGCTAATTAATTGCTGAATTATTGTGCCAATTTGCT 
GTTTGTTGAGGGGCACACACAGCAGCCTCACTTATCACTTTTGTTGGACAATCGACCATTGTGATTAACTGGCCATCTCTCGAGGAACTCTCCCA 
TCGCCATTGTCCTAGATTCGTGCCTTTGATTCGCACTGGGAAATCTTCCATGGAAATTTACAATTGGAACCCTGCACTTGTGGCGTTTGCCGTTC 
AAAGTGCAATCGGTCTTAAGAAGGAGCAACTTGTATACATCGATTTACAGCACCAGCCCCCAGACTGATAAGTTTATTGTACGGAATGTCTGATG 
TCTGAAGCTCAGGAAAGTCAAAACTTTTATTGACTTGGCTGCATTTGGCCATAAAGTGGCCCACGAAGTGGCCAGTTCGCGAATCACTTTAATCC 
AGCATAAGATATGACCGGTATGAGCTGTTCTACACATGAATGTATCTTCAAGTTGTCATTTCCTTCGACCGTATCTCATAGATGCACGTGCCGGC 
TAATGAAGAGCTACAGGTTGTAAACAACTTGTCAGGGGTCTCAGGCCCCGTGCTATGTAAACATTTATAGCTGGCAATAAACAATACACATATAC 
GTATATATGTATAAAAAGGGGGCTCGTCGCCAAGTGTCAATTATTTTCTGGACCCATCAAAATTATCAGCAACTATTTTGCAAATAGTCGAAAAC 
TACCGACGAACTTTGGCACTTGATAGCGCGACACGCGTTGCCTTGACTATGGAGAGTCTTATAAATATATATGTAGATTGGAATGGAAATACATA 
CATGCTAATTTATAAATATGATTTCAGAACCATCAATGGAGGCTTGCCTCAGCACATTTGAATGATTAAAGGGTCTTGCGGGTCAAAGCCAGCAT 
TCTTTTTTTTTTTTGTACTGTCTGGGACTTGTAGATTTGATTAGCGTTGAAACTTTTAATTGTAGAAATACTTGAAATTTCTGGTTATAAATTCA 
ATAATAATTAAGTGTGCCATGTGCAACGGAAGTACAACGCAAGACATGAGATTATAATCAAATACGGCGTAAACCACTTTGAAAATTCCAAGAAA 
TGGTTTCAGCTCAATTGAAAGCAATCTCGTTATTACACAAAAACCAGAAAACTAGACTTGACTTAATGCATATTAAGCCGACGGCTAGAACAATT 
GAACCGTGTTGCGGAGCTTCAACAATGGAGTGCCAATTATGAGAATCATTAATAAAAGAACTAAAGGATCGAGTAAACACAAATCGATGTAAAGT 
GACAAAAGTTTAATAACAATTTATCGTAAGCGAAATGAGCCGAACACATGCTCATTATTCATTACTTGGCTGTTTGATTAATTCCAGACGACGAC 
CCACAGTCAGTCTGCCAAGGGTGAATACCCGACGGATTCAAATGTGGATTCAAATGAGAATCGAAAGCGGTTGCTCTGGGGAATTTACAAACGCA 
AATGAATGTGATTGGCCAAGCTAAATACCAATTAAATTGAAAGCTTTTTTTAGAAGTAGTTTATAATAAATTATGGGTTTTATGTCAAATTACAA 
TTGAAGTGTAATTTCTCTCTGAAAGTAACACTCATTTATAAGCCAATTTAATATTTAATATTTGACTATATACGCGTTCATCAATTTGTAAATTC 
CCACTTTTTGTGATACCTTTAAATAGCTTAAGCACCTTGAATTTTGTTTATACAAAGCACAATGTCAAAAAACAATAAAAACCTTTAAAAACCAG 
ACATGAATCACACCTTCCTGTATTATTTGCAATTGACTGAGTGACTGAGTGGGTGTGAATTACACGAGCTTGTAATGAATAACCATACTATAAGC 
TGTCTAATCGATATGAATTTCAGTTTGTTTTAGTGTTAATTGAATTTGTAATTGTCTCGTGCGACAACGTATCTAATTAGCACACTCAAAGTGAG 
TTTTAATACCAACTCCCCCTAATTAGAGGCTATAAACGAGTGCATTTCGGAATCTTGGAATCTTGAGACGTATTTCTTACAATTTTTTAATATTT 
TCCTGCTAATTGAACAATTTGCTCTTTGGTTAGAGCTCGCCGCTTCACTATCATTCATGATTGCTCACGGGATCGACGAAATGTGCACTCAAACA 
TGCGATCAACCCACCCGCCACCACTTCATCCTCACCTCACCTGTCTGATTCCACGATTGTCTGGCTCTAAAGTGCTAAACATTTTCAAAAGTACG 
AATTAAATAGTAAACAGTCGAATGACCGTCGTTGGTGGGCTATTAAAAATTCACAACACAACAACCACACACAAAAAGTGGTTCAATTAATTGCG 
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AAATTTTACTCGAAATTTAGTCCATGAACTAAGCTTTCCATACGCAAAAACGGAACTTGAATTGTTCAAAGACTTCAGGGGGGGGGGGATGTGTG 
GTAGACTAGACCAACTAACGTGCTTAGTTAGCGTGACTGTGCTAATTTGTTTGCATAGTCTAATGCGTGGTACTACGTGGAAATGTCATAACCAT 
GGCGTCATGAGGTCGCCCAACTGCGCACTCCAGCTGTCCCACGGCGTGGGCGTGTCCTTGTCCGTATCGATGACATCGCGAGGATTCGAGGAGTG 
CCTGTCCTATGGTATAAGTTCCACCAGGTGTCAATGATCGAGAAGTTGTCAAGGTGTCGCGAGGTTGATACAATTAAATATTTATCACATTTTCA 
TTGAAACAACTGGGTCAGGTTGTCATAGCGAAAAAATAAAACCCATTAGATACTTTCGATTATGTTATCTGTTATCAATATCTACATATTAGGAT 
TTCTCTAACAAAATTGGTATATGGCCATTGAGAATAATCTGTAAATCGAACAGCTGACGAACCAAATGAATATTATAAGCTGATATATAAACTGA 
AACAGTTTTTGATAATCTTGCTGAGCTTTAAGCATCTTATATCATATATCTTATATATTTTTTACTGCACACAAAAATCTCGAATATTCGAATTG 
AAAATTGACATTAGAACGATGAACATTTGCAATTGACCGATGGAGGAACCGGAAAGTCAATCAGCATTCAATTGAATCACAGTGCCCAATTGATT 
TGTAACCTCAATACAAGTTGTCTCGTTGACACCAATCAAAGTGGAGGCAGGAGGAGTCAAATCGATTTGTTTACGTTAACCGAGTTGCAATTCAA 
TTGGCCATCAAACATGGGCAATTCTGCTCGACTTTCGAGCCCACTGACCCTGGCGCTCCCTCATTTTCCAGTGGCGTCATGCGGTTTCTGGCCGC 
CTTATCGAGGAAAAAGAGGGGCTCGGCTGTTAAAAAAAAAAGGCGCTGCGGTTCACTGGGGGAGGCACATTGTTCTGGCATTTTCGACTGGCCAA 
ACAGCAGGTGCAAACTGGAAACTCAATACAGTAACCATAGATGATCTCTAGCTATTTCGTATAAATATTAAATTTCAACTCGAAAGTTCTCAAAT 
CATTATTTACTAATATTCAAATAATCAGTGTGCTGGCTATGATATAAGCAGGATGTTGGGTGGGCACCTATCATATCACATATGATATCATATCA 
CGTTATCTACTTACTTCGCACTCTTTGCAGACCTCCTTTCTCCGTGTTGGACATGACGCCTCTAAGTGCCGGCTCTGCAAGTGGAAATATACCAG 
AGGTTAGACGAATTTTATAAATTGGAAACGAAATATGGCAGGCGATGTGATAGACATCTAGAAAATGCGATATAAATAGTCTGCAGAGTTAATGA 
AACTTTTTACGAAGTAACCAAAATCATGGTTATACCTTCTTTTTTTTTTTTTGGCTATCTTATCTTATCGCCGCCGGTGTCTCGTGTTTGTACCT 
GAACGGTTAAACCGATATGCATATATATCATTTTCAGACGTTCTTTGATAACCACAATTGCACTGGCCTGTCATTATCTTTCTTTTGTTCAGTTG 
TGCTGGCGCCGCTATAAATTATTGAAACACTGTTGCAGCAATGGTGGTGCAAAGGGAAATTCGTTGACTATATATGCGATTTGTTGCATGCTAAT 
TGCCGCTAGACACGTGCATGTGCGGCACAGACAGGATCTTGGCCCTGGCAACATGTGTAACTTGGCGCAATCAATTAACTGCGAGTGCTTAGATG 
GCGTTTGGTCAATTTGGACTGCAAATATTTGCTGGCGGCGTCCTGGCAACAACAAAAATGGCAACAAAACTATGCCCGCATGTGTGAGGAAAGGA 
TAT AT GT AT AAAT AT AC AT AT AT AC AG AC G AAAT T AT TGTCCTTGCCTGTGTGGCATGTT GC C TT GT T AT T AGGC CC G AT GAAAAAGCGGC ATC T 
GCCAGCCAGTCGTGTGGGCATTGACTATATACGACTTCTATATGGTCACATGTCGTCATACGTCATCTTCCTACAAGTCCGTATATGCGGGAAAA 
ATAACTCAATTCGCCACTTTTATCGCGGGAAAGCAATTATGTTTACAGCATTAGTCATCAATAAGCCCAAATATATAAAGTACTTAAATATACTT 
AAGTGGCTTGCATTGATTTCATATAAAATGCTCAAAATGTTTTGCTCATGAATGAAACTTTTAATTACAGTAATTATAAATGGCAAAATTGGTCA 
GATATCAGTTTACAATTTTAGCCGCAGAATTAATTTGTATATCACGTTTATCTTAACTAACAAAGCACGATTAAATCACAATTTAATTTATGTCA 
ACATGGCTATAGTTAGCCCTCGTTTAATTAGTAGTTTAAACGTTTTGCGCCTTGAAAAACCAATTAATCACATTAATAGCTGTAGGAAACTAAGC 
AATTATGACACCACTAGCAGGTAAACGATCTATTTTAATTAGTTACTTACTGACCTAAAAAAACGGAAATAACACCTTTAATAAATATGCATCTT 
AAAACTAAACTTTAAGTAATTGTAATAGAATCAGATTGTAAGCAACATTTGCATTGCACGAACTGGAAACCCACCTTGCACTCGACAATAACTAA 
CAAATGTTAGTTGTTTATGGCAAATTATATAATTTGTTTGGCACCAACCGATTTGCAAACGGTTTTCGTATGATTAACTTTAGCTCCGATTTCCA 
TAAAACAAAACACAGTGAACGAACGAACGAAAACCGCCAAGAAACCGCACGTGTTTTGTGTTTATTTTTATCTGTTCTTCTCGTTCTGTGTAATA 
TCTCAGATATTGAAGTATTGAGATATTGAAAGATCGGTAGATTGGGGGGTGAAACACACTCGCACTCCCGAACAAGAAACGGTTTCAACAATGCC 
GGATGGGACAGCTGTTGATTATTTTTGAAAACGTCAAGCGACGAGTTTTCTAGCGCAAGCGCAGGCAATTCTAGAACTGTTTCGAAATCGAAACT 
CAAAACGCTGTTGCTGCCGTGGACAGTTGCTGCTGTTGTTGCTCTGCTGTTGCTGCGTGCTCGTTGGAAGAGCGTTGAAACTGAACTGAAAATGC 
GGCCGCTCGTCTGCGGCTTTTTCTATACTCCGCCGCAGTCGCAGTCGCCGTCACCGTCGATGTTGCTGTTGCTGTTGTCGCAGCGACTGCTGTTG 
TTGCTGCTGTTATTAATGTTGTTGCTGCTGCCGTCGTTCTCGTTGCTGCTGCTGCTTCTGTTATGCTGATATAAGTGTTGTTGCTGTTATTAGCG 
TTGTTGTCGCTGCTGCTGCTTCTGTTATTGCTGCTGTGCTCGTGTTGTTGCTGTCGCTGCTGCTGCTGTTATTTCTGTTGTTATTTCTTTTGCTG 
CTTCTGCTGACGTCTGTTTTGCTCAGCAGCATTACCATCTTTGATGAGAATAGTGTTAGTTCAAAATTGAGTTAAATTCGAATAACAATCTTACT 
CCATTCGCCGACCTTCATTCAGATTCAGATGACAGGAGCTTATTTCGGTGATGAAGATTCGCGTGCAGCGCGGATTGGATAAACCCAACCCAAAG 
GACACAGGATTCCTGAAGATGATGCGATCTGTATTCAAGTGCTGGTCATCTAAAGATCGTCACTCCGTGGATTTGCCCGCTTCCATTTCCAGCTG 
GGATCGCGTCGAATCGCAGCGGATAGTTGATACCCATTTGCCATGCGTAGCCAAAAGTGCTTATCAGCACACGATTCGGCACAAAAAACTGCTGC 
CGATACAGCAAAACGGTACCAAGTGCTCAATTTACGCGGCAATTTTGAAACATCAGTGGTTTTTTGTTTTGACAGATTTTTTTTTTTATTAATAG 
CTTATTTGGACGAGAAGCTAAGATGCCAAGAGCAGATCAGTCTGCAAGCATGATGAAATTCCACAAATTTTCGCATTCAGAACTTGTCTTTGTGA 
TATAAACTGTGATCTTTACATAGCTTAAATATAGAATACCATATCTATGAAATAAAAAGTACCGCATCAAATGGAGAATCGCGTCCAGCGCTCCA 
CGAGACTGGGTCGCGGCGAGGCCCTGT 
(SEQ ID NO: 292) 

Exon: 12327.. 11855 
Exon: 10524.. 10465 
Exon: 3057.. 2542 
Exon: 2434 . .2064 
Exon: 1998.. 1866 
Exon: 1795.. 1554 
Exon: 1485.. 1001 

Start ATG: 10504 (Reverse strand: CAT) 
Transcript No. : CT10168 

CTCTTCCAACGAGCACGCAGCAACAGCAGAGCAACAACAGCAGCAACTGTCCACGGCAGCAACAGCGTTTTGAGTTTCGATTTCGAAACAGTTCT 
AGAATTGCCTGCGCTTGCGCTAGAAAACTCGTCGCTTGACGTTTTCAAAAATAATCAACAGCTGTCCCATCCGGCATTGTTGAAACCGTTTCTTG 
TTCGGGAGTGCGAGTGTGTTTCACCCCCCAATCTACCGATCTTTCAATATCTCAATACTTCAATATCTGAGATATTACACAGAACGAGAAGAACA 
GATAAAAATAAACACAAAACACGTGCGGTTTCTTGGCGGTTTTCGTTCGTTCGTTCACTGTGTTTTGTTTTATGGAAATCGGAGCTAAAGTTAAT 
CATACGAAAACCGTTTGCAAATCGGTTGGTGCCAAACAAATTATATAATTTGCCATAAACAACTAACATTTGTTAGTTATTGTCGAGTGCAAGAG 
CCGGCACTTAGAGGCGTCATGTCCAACACGGAGAAAGGAGGTCTGCAAAGAGTGCGAAACTTTCTGTCATGCCGCCAAGTTCTGAACCTGCTGAC 
CATGCTCGGCTTCATGCTGAACTACGCCCTGCGAGTGAACCTCACCATTGCCATCGTGGACATGGTCCGACCCAATGTCACATCAGCGGTGAATG 
CCACTCTAGTGGGGAATAGTACGGCGGCCAATAGCACCGCATCGCCTGATGGAGTGGATGTGTACGAGGAGCGCTTTCCCTGGGACAGCTACCAG 
ACCAACTTCGTGCTGGGCTGCTTCTTCTGGGGCTACATCCTCACCGAACTGCCGGGCGGTCGTCTTGCCGAGCTGATCGGTGGACGCCGCGTCTT 
TGGACATTCCATGCTGTGGGCCAGTCTGCTCACCCTGATCACGCCGCTGGCGGCGCATATCAACTACGTGGTGCTCATCGTGGTGCGAGTTGTTC 
TTGGCTTCATGCTGGGCGCATCCTGGCCAGCCATTCACCCGGTGGCCGCTGTCTGGATTCCGCCCATGGAGCGCTCCAAGTTTATGTCCAACATG 
ATGGCTTCTTCGCTTGGTGCTGCAATCACCATGCCCATCTGCGGCTACCTGATCTCCGTCGCTGGCTGGGCCAGTGTTTTCTATTTGACGGGAGC 
CGTGGGTCTGCTGTGGTCCTTGGCCTGGTTCACTTTTGTCTATGAGACACCCGCCACTCATCCCAGGATTTCAGCCGAGGAGCGGCGGGAGATTG 
AGGAGGCCATTGGCACCACTACCTCTAAGAAGCGTCCGAGCCATGTGCCCTGGGGTCAGCTGCTCTGCTCTCCAGCTGTGTGGGCAATCATCATC 
TGTCACGGACTGGCCGTCTTTGGATTCTTCACTGTGGTCAATCAGTTGCCCACTTTCATGTCCAAGATTCTGCACTTCGATATCAAACAGAATGG 
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TTTGTTCTCGTCGCTGCCTTATCTGGGTAAATACGTTATGGCTGTGGCTTCATCCTACCTGGCTGATTACGTGCGCAAGAAGGGTACTTTGAGCA 
CGACGGCCACCAGGAAACTATTCACTACTTTTGCTCTTGTGATTCCCGGTCTTTTGATGATAGTGCAAGTGTTTCTGGGCTACGATGCCACCTGG 
TCTGTCACGATCTTCTCGCTGGCCCTGTTTGCCCATGGTGCAGTCACCGCTGGTTATCTCGGCAATGGCTTGGATATTGCACCGAACTTTGGAGG 
AACCATTTTCGGATTGGCCAACACGCTGTCTTCCTTTGGTGGCTTCCTTTCCACGTCGATGGTGGGAGCCCTCACCTACAAGGATCAATCCTTCC 
ATTCGTGGCAAATTGTGTTCTGGATCCTGGCCGCCACCTATATCTCGGCAGCTGTGGTGTTCGCCATCCTTGGCAGCGGTGAACTGCAGCCGTGG 
AACAATCCTCCAGAACGTGTCAGAATCAGCGACGTCACCCAGGAGGAGGGTGTGCCCCTCAAGAACGAGAAGTGATTGCACATCGCCAGTGAGAC 
TCATrCGGTGCCACTGCTGTTCGTGATGAGCGGAAGGAAATGAGAGCCAAGAGGAAGATATCAACCCACCGGGTATCATCATCCGCACAAGTTAA 
AGTAGTTGTTACTCCCACATTGAGATTACGGGGCTTTCATTTGGCAATCCCTTAATCAGAGAACTCCAGACCGAAAGGCACCGGACACTCAGTAC 
ACAGAGAAAAACATGATATGAAAGACATTATTTTTATAGACACTGCATGGTTTTTTTTCGCACAAACACAACACACACAAACATCTCATACCCAT 
(SEQ ID NO: 293) 

Start ATG: 494 (Reverse strand: CAT) 

MSMTEKGGLQRVRNFLSCRQVLNLLTMLGFMLNYALRVNLTIAIVDMVRPNVTSAVNATLVGNSTAANSTASPDGVDVYEERFPWDSYQTNFVLG 
CFFWGYILTELPGGRLAELIGGRRVFGHSMLWASLLTLITPLAAHINYVVLIVVRVVLGFMLGASWPAIHPVAAVWIPPMERSKFMSNMMASSLG 
AAITMPICGYLISVAGWASVFYLTGAVGLLWSLAWFTFVYETPATHPRISAEERREIEEAIGTTTSKKRPSHVPWGQLLCSPAVWAIIICHGLAV 
FGFFTVVHQLPTFMSKILHFDIKQNGLFSSLPYLGKYVMAVASSYLADYLRKKGTLSTTATRKLFTTFALVIPGLLMIVQVFLGYDATWSVTIFS 
LALFAHGAVTAGYLGNGLDIAPNFGGTIFGLANTLSSFGGFLSTSMVGALTYKDQSFHSWQIVFWILAATYISAAVVFAILGSGELQPWNNPPER 
VRISDVTOEEGVPLKNEK* 
(SEQ ID NO: 294) 

Name: Na+-dependent inorganic phosphate cotransporter 
Classification: transporter 



Celera Sequence No. : 142000013384407 

AATTATTAAATTTTTGAAATTTTTTGAAAAATATAATCAGTTTTCTTTTGGATAGCAACAAAGAGAGTTGAACAATATTAGAATGCCATGCCCCA 
CTAAACTCAAAATTAAATTTCCCATTTAACTATATTTAAACATGGAAACGGTAATTTTATCGAAAATTTCGATGACTTACACCCTTATCGAAAAT 
GCGAACATTTTTATTTCGAATTTAAATAGTCATATGTGTTTATTGGCCGTACAATAAGGAAAAATCCTATTTCGACAATTGTTTTTATTTCAATG 
ACTTCTGGAAAGCAACGTAGCCCATGTTCCTTTTCCCCAATAAAAATGCGTTTTGCAGATAAAGGAAAAATACATTGATTGTGGTAGCTATTTTC 
ATTCTTATGAATACACACATAACATTTAAATACAAATACATTTACAATTGAAAATTTTCTCTACATATACAAAATATTTAATTTAAAAATAGGTG 
AAAAAAAAAAACAAGAATAAATTGTTGTCCCTAGGTTCGTTAGCTTAGTAAACACAATTGATGCGCCAGCGGCAGCGGAAGCAGCAGCAAAAAGA 
AGATCGAATTCACTCAACACAAATTAAAAAAAAAAATTGACTTTTCTTTTTGTTTATTAAAATGGCTATGGTGATCGGGAATTAAATTCTTCAAG 
TAAGCAGCCTTTTTTTCTGTTTCCCATTTCTAAATAAGAAGACATTATTCTCAGACGAGACAGAGTATTGCTTAGTAAAAACAACTTGAGGTCGA 
TTTCATTCAGTTTTAACTCTGCTTTCAGATTTCTTCTCATGATTAATATTTTTACACGCAATATCAATAAATTTATTCCTCACAGAATTGAAAGG 
AGAAGTCAAAGCGTTATTTAGAACTATTATTACATGCCCATAAATATTTTTAATGTTAGTCGTACATTTAAAAGCAAATTGAATTTAGTTAATCT 
CTTCCACAGGTTCCTCGCCTTTTTTGTTTTTTTTTTTTTTTTTGGATTTTTTGTATTTTGTTTTGTTTTGTAACTTTAAAAGTTTCTTCAATTAA 
TTGAGTCGCAAAATGAATTTTTGATTGATTTCATTGATTGATTGATTTGATCGATAGATACCGCCTTCTGTTTTCTGTTTCGTGCACTCTGAAAT 
CTCTGATTTTGATTTTGAGTTCAATTTCAATTTGAATAACTTCTAACACACATACACACACACGGGTATTTAATATTGGTTGTTTGGAGAGAGAA 
TTGCATATAACGCGCTTTGTGCTAGGGTTCGCTTGGTGTTCTGCTCGTGGATTGCATCGATTTATAGTCCGAGATATCGCTACATATATCGTCGA 
ATATAGTTTGTCGGTTGGCCTCTACGTTCACGTAGCTTGCTTTCAAAATTCGTTGTGTTCTTGTGTAAAAAAGAGATCTCAAAACAAAAGCGAAA 
ATTGCTGATTTCTTGATTGGTTGTCTCTGGTGGTAAGTAAGTTAGTTAGTTAGTTTGGTGTTCTAGTGCTGGCAAAAGTCCGGGAAAAAGTTTTT 
GGGACTCGCAGACAGGGATTCGGATTCGAATATTGGTGGGGAGTTAGCTGAGGTTAGGTCCATCGACCCAATGACTCATAGAGCTTAATAGTTAA 
ACTACAGATCGGCTCGTTATTTGTACGCGGCTACATGTATCCGTATCCGTATCTGAATCGCTACATCTTAAGTAAGTATGTATATGTATTTCCGA 
TTCGGCATCGGCTGGTAATTACAGCTAAATATCTTGTATAATTGTATATATATAGACAGAGAATAGCGAACATTATTGCTCAGACCCATTAGAAT 
AGTTGTTGGCTGCTCCCGATGCTGATCCTCCTCCTAGACGTGGCAACCGTTTTGAACTGTTTAATTAATACGGTGTATATACTTTATCATATATA 
ATATTGGTGTTTTGCTGTTGCTGCTTGAAGACTAATAGTTAACGGGGCGGCGTCCACGCGGCTTGATCTTCTCCGATATCTGCCACATCTCCTCC 
TCGCCCATAAAGTCCTTGAAGTTATCAAAGAAGCGTATAATGCGCTCGTTGCGTCGAAATGGATAGGCACTGCAGCAGAAAAAGAAAATTGCGTG 
AAGTGAAGTGAAAGAAGTGATTATAATTGAATGAAAATTGATAATTACCATTTCTTAAAACGTTTCATGTTGTCAATTATGTTGTACTGCTGCCA 
GCGCTTGGAGAAGTTTATGACGCCCTTGGACAAGTAGTCCTGGTTGCCCACATGCACAAACGTTAGATCCTGCAGAATTAGGCCGCTGGAACGAA 
CGAATTGATAATTAGCTTTGGCTTTCGTGATTCTAACAAAAAAAGAAACATCAGATCAGATCCTTACATGTAGGGTATGCAGGGCGGATTAGTTT 
CGGCCAGCGCCTGTCGATAGGCGCGAAAACTGGAGCTGGAATCGATGAGGGCGCAGAATGATCGCACCTCCTCGGTGATGCCTTTTTGCCACTCC 
AATCTGCAGGACATGGTATATCAACATATCAATGGTATATCAATAGCTTCAGTGGGCCAGTAGAACCTACCTTCTTATGGGACCCGAATCGAGGG 
CCGACAACAACGCCAGATACGAGTTGTAGTTGTTCATCTTGCGTAGATGCTTCATGATTTTGATAAACTTGTTCACATGCTTCTCCCGCTCCTTG 
GCATCCTGCAGGCGCAGGATTTTGGAGCGCGCCCAGTAGGACATCTTGTTGAAGTGCTCGGTGAACTTGTTTAGGTTGGGCGACTTCTCCTCGCA 
CTGATCTTTGGCAAATAGTAATACTTCGGGTATCTCGATCTTCGTGAACAGCTCCGCATCCAGCAGCGTCATCTGTTCGGCAATCTCCAGCGACT 
TGAGGTCGAGCAGACTAGGCTGGTTTCCACCACCAGCTGTACCACTGCATCCACCACTGCCGGCGATTCCTGCTCCGCCGACACTACCGGCTCCG 
CCCAACTCCCCAACAAAGCCGTACACCTTGGGCTCCTTGTACAGCGTCACCTTCTCCACGAACTTGTTGCGCAGCAACTTGGCCAAGTACAATTG 
CCCAGAGCAAACCAACTGATAGACGAACTCCACCAGCAGGCTCAACAGCTGGCTGGTTAGATCCGTCGACCTGCATGGTGAACAATGAATGATAA 
GCGGGTATGTACCAACTTAGTTTATTCCAAAAATGGTGGTTAGGCAACTCACGTTAGATCGTTGACGACTCGGACCAGCAGCGCAAAGGTCTCCT 
TAGCGGCCTTCTGCTTGTTGTCCTGCACTTGACAGAAGAAGTATGTGTAGCGATGGGTCAGCTTCTCGATCACGTCGATCGGCTGGATGAAGGTG 
CGAAAGGTGGTGATGAAGGCCTCGCAGAATGCTGCACGAACGGAATGAAAATGGACAGAGACAACAGAGGAGTTGGGTAAATGACCATGCGAACG 
ATATTTGGTATGTTTTTGCTGCTTCTGCTTGCGCTGCGAATGCCTACCATTGTCGGCGACCTTTTGGACACGGCTGGCGTGCACGATGAGGGCAT 
CGATGTAGCCGCCCTTCACCTCGGGCCCGTCCTCCTCTCTCTTCTTGAGTATCAGGTACCGTGTGATGTTGACCTCTTCCAGCATGTTGATCAGC 
ATATCCGCTCGCTCCGCCTCCTCATCCTCGTCGACGTTCGCTGGCTTAACCGACTCGTGCACAGCGACCAGTGGTCGGGACTCCGCCGTTGGCTC 
CTCCTGCTCCTCGAGCACTTGGGGCGTGGTTATTGGCTGGTTGTCGGCACTGCTCGCATCCTCGCTCATCTCCATTTCGGCGGCGGGCAACTGAT 
GTGAGTGACAGTAGAAAGTCTGTCCATCATCGACGGCGCTCACATTCTCATTGCTGTCCAGCACGGGCGAACGGGGCTCCGGCATCGCATCCTCG 
CTGGCCACTGCATCGTTCAGCTCCTCGCATACGGTGGCCATCCGGCTGTCCGGCCTGCCCGTCGACCCATTCTCGCCCAGCGTCGGCTTCGGACT 
CGGCGGTGGCGTGGTCACAATGACGGGCTGCAGCGTGGGTGATGCTACCATGCTTGTCGCCTTGCGCTGCTTGTAGTTCTTTGGCGGCAGGGCAG 
GTGGCATCTCGAGCTCCGGCGACAGTTCCTTGCTCATGGGCATAATGCTGTAAAGGTAACAAAAAAATAAGGTTAGTACCGCCTAGTAACTACTA 
GCTACACACAACTTTACTCACTTCATGGTCTGGCTGTGCGAGATGTTGCGACTTATGTTGCAGGCGTGCATTGTGTGACGATGCTGCTCAATGGA 
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TCGCGTGGACGCCGAACAGATTTCCATGTACGCCAGZVACTGTGTGCGGTAGGGTAGGGAAGATGAAATCGTTAGCAACTCTTACGCTTCAGTGGC 
GGGGGACAGAAACGAAAACTAAGCCGATCGATCGATCCTTGGACCCACATCCCGAAGGTCTTTGTCTGTTCACCAGATCTACAGGAATCGCAACA 
TACAACACAGCACAACGAGTAACAGGACTGGATAGTAACACAGAGCAGGAGCTTTCGGGGCTTTGGACTGGAGATACTAGCTCTGGACGGACACA 
GCGGGTTAGGCGGAGCGAAATGGAAGCCACAGAACTGTTGCTCAGTCCATGTACTTGGGACTAGAAACGGAAACTGCAAACAGGTGGCTGTTCGG 
GCGTCAAGTCAATACGAATGCGGCGCAGCAGCAAAGTTATTAGCAGAGCAAGAAGCCAATAGCGAACTCGAAATGCTCTTTCCATCATGGTTATT 
AACTGGATAGCAAAGGGTATTCACCTTATGTAAAGTATTATAAAAACGAAAGAGGATAGTTTAATAGCTATCTTTGTGTTTACACACTAGATTCC 
CATTCGGTTGGACCATGGAAATCTCTGCTCAATGGTGAAATGAAGCCAAATTTAATCGAAAGGAGTGTCATTCAGTACTGAGGTAGGGAATAGAG 
CATTGCGAGCTTTCCAAGGGCGCCGTGCAAAAAAAGTTTGTTTCGTGGGTGATTTTGTTTTATTTCGTTTTTGTTTGGCATTTGGCAGCAAAAGT 
TGGCTTAGTGTTTTTGTTTCGTGTTGGTCCTCTAGTAACACACACAGACTTACCCGAAAAGGCCACACTTTGAAACACTAGATGTTGGTTGGAAC 
GATAATTTAGATGGGTTAGACTCTTGCGTACAACAATTTGGCAAACTTTCTTGTCGCCAGCATTTCTAAGGGGTTTGCGATAATGCGGGTATGTG 
CTATAAATAAAATGGTCTTAAAGAAACAAACAAATCTCGAAAAACGGACTGCTCTGCCTGGCTACCCATATAAAAGGTGATGACTGATTGACTGA 
TTGCCTGGTTGACTGATTGATTTGTTTAACTAACTAATTGGAGGTCTGATTGTTGTGGTGTTTCTGTTTCGTCTGATTTTTGTTTGGGTTTAACA 
ATGGAGTAGCCTATCAAATGGATCTGATGGAATGGATACTGCTGCAACAGGAGGACAGTGATGCAGGAAACCCTGAAAATGCGCTCAACAAAACA 
CCCATTGACTCACCGCTCGGCCTGGCCTGGGCGGTTCCGAAAATGACAAAAAAAAAAAAAATGATATAAAAGTAATTTACACAATATACACAACA 
ACAAAAAACACACACACGGTACGAGATGCAGAACCAAACCAAATCGAAAAAAAAACGGAAAAAAATAAGAAAGAAAAATAGGAAGAATCGATGTA 
AATAAAGTGTGCATCGTGGGCAAACTGACTTACTGTGCTTCTTTTTTATTGGCAGCGGCGGTGGCTCCTGATCCAAGTGTCGCTGATCGAACGCA 
CTGCAGCTGCCACTCCCCGCCAGCTGGAGGCGGGGTCCCTCGATCAGGCTGTGGTGCTTGGAGTGCCACTGCTGCAGTTGACCGGCGGACGCAGT 
ATGCGGATGGTGATTGAGGTGATGGAGCGGCGACAGCTGACGCAGCTCGGCCACCTCCGACTGGCTGCTCCGATGGCTGGCCCAGCCGGAGGAGG 
ACTGCACCTCGTCCAGCTCATCGCCAACTCCGGGCGAGTCGTGGCGGGTTAGGCTGCTCCGTTGGATGCTCTTTGGCGGCAAGGCGGGCGCCAGA 
TCGGCCGGCGTCGTCGTCGTCTGCTGCTGCTGCTGCTGCTGCAGCTGCTCCACTTCGCTGCTGCCATAGCCGGTGGTTGTGCTGCTGCTGGTGGT 
CGTGGTGGTGCAGCTGCTACTGCTCGAGGATATATGACGCTGGCTGTGCGACACCAACTGGCTAATCTGCTGGTACTCGCTACTGCTGCCGACCG 
CCGTCGACTCACTGATGCTGAACGCAATCTCCGAATTGCTGCTGGACGACTTCGTGGAGGACTGGACACTGTAGTCCGTCGTCTGTGTGGAAGTG 
C G AAAC T C ACGC AT C G AC ACG AAAC CT G GC G AT GG AAGG AGAGT GG T GT T AG AAAT GGT C AAAAAT GC AGT TCTCGTTGT G GGGAAC GGG T AG CC 
AAAACCAAGCGGAAATTATATAATAAGCCGGAGGAAATCTGTGGAAAGGATCATTACGGAGGCGGGAGAATAAAGCCAACTATTTACAGCATCTA 
GGACAAACATGTATACAATACGCCTAACGAAAAATTTGGAAGGGAATTCATTGGGCGCGTGCAACAGATACAAAATAATAAGCGGGATTTTAAGT 
ACGATGATAGATGTTCTGGGTACCCGATTCGTTTGAGTGGCGATTGCTGTTAGTTTCCCCGTCACCAGACGCTGCAGCAGCTACACCTTCGCCTG 
CTCCTCCAGTTCCACCAGCAACACCAGCTATACCACCACCAGTCGAAAGTGGTGTCTGCGTTTTGTCGTCGATTGCGGCGCCTGCGTAACCGGTG 
GATTCTGTACGGGATTCGTTGATTGGGGGGGCCACCACTCGAGTTGAGCTGAGTGAGGGGAGTGGCGAAGCCTTGCGGCAATACTCACTGTAGCT 
GACCATCTTGTCCATGTCTTCGTCCATCATCGCAGCCAACTTTGGAAATGACCTCAGATGCTGATGTTGCTGTTGCTGGTCCTCCTCCTCGCGTG 
AGTGATTCAACGCCGAGTCAAAGGAGCACTGACTATTCTCATCCGGTGACCGTGAACGCACGGATAGGCTGTGGAATGTGGGTGCACCAATTATT 
GTAGTGATTGCTAGTGATTGCTTTAATCTGCAAGCCACTTACCGATCCACGCCATAATTCAGCAGCGAGATATTGGAACTGCAGTCCAGGTCCGA 
GTTCAGACTAATATTATGGGACTGGGCGTAGGGCAGTGGACTGGCCTGATTGGAGGTGGATGTCGACGATGAGCAGCCTACGCCCACTGTACCAG 
CTGATGCGCTCGGCTGGCTGCGTCGCTTTGGTGGCAACGGCGGCGGATTACTGGCCCTATTGGGTAGCGGTGGCTTCGGCGGTGGACTCGTGTCG 
CGCAGGATACTTTCGGTGCTGTGGGAGCCGCGCATCGGGTTCACATTGTGCTGCTCCAGTATGTCGCGCTCCTTGGGCGTGAGCGCTATGTCCGG 
CAACGAAGTACGCTGGGCAGCTGATTCGGCGGCCGAAACGCGCATGACTCCAGTACCGGGAACACTCGCTCCCGGCGAGGCGGTGACCGCACCCA 
TGATCTCCGCCGCCGCTCCAATGCCGCCCAAGCCAGATCCACTGTAGCGAAAGGCGCACTGATCCTGCTCCTTCAGCTTGCCCTGCGCCAGCGTA 
ACGAGATTCTGTGCACGATCATGAAACATCATCGTATTTATTATAGCAATACACTGTGTACTAGGTTAGTGTTAATTGGACTTACCCGCACAGCA 
TCCTCGAGAAGATCAATGACTTCCCGCACATTCTCGTTGCTCAGGGAGGCGCACTCGCCGCTGTCCTCGGAGAGCATCACCTCGTCGCAGAGCTT 
GATGAGCTTGCCCAGGCTCTGGTAAACCTGCAGCGTGGCGCTGCTCATGATGGCACTGTTTTCATTCAGGGTATAGGTTTGGATCACTGCGGCAG 
CAAAGGAAAGCAACTGGATTAATTACAAAACGAGGAAATTGGGGTAGTAGATACTCACCGGAGTACATGCTGGCTATGGTTTCCAGAATGACCGT 
TCCGTTGCCCGGCAACACCTCCAGCTTCTTCTTGAGTATAACATCCCGGAAGTGCTTCAGGGCATTCGAAATGTCCTTGACGTGCGTCTCCAGGG 
TTTGGGTGGACCGTAGCACCTCCTCGGTGGTGGGCGGTGCCTTTGTGCCGTGCTTGGTGCGCGGACTGTGCGGCGAGTGGGAGCGACCCAGTGTA 
TTGTTGGTGGTTCGCATCAGGGAAATCTTCTCGATGAGGTCGTCCTTAAAGGAACGCGCCCGGCGCGCTAGCTTGGTGCCCTTGAGACTGTTCTT 
GTGGCCCGAGGATCCGGCACTGGGTGGTGCCGGTGGTACACCACCACCTCCGCCTGCTCCTGCTCCCCCTACCCGACTAGATCCTCCCGGCGTGC 
CCTGCAGCTGCTGGTGCTGCGAGTGATGCTGATGCGTGGGTGGTGGCGGAGTGCAGGCGGTGGAGTAGCTGCCGCTGTTGATGCTATTGTTGCTG 
CTGCTGCTGCAGCCGCCACCGCCCACTCCGATGCCACTGGAACAGGTGCCGGGCGTGGATGGAGAACTGATGCTGCCGTCTGCAAAAAGAGCAAG 
AAATCGGGGATTATTAATCTAAGGTAGGTACACTTAATCAATTCATAGGCATTAGATAACAATATACGTTTTTGGGTAACAATAATAAGAGACTA 
AGACTGGAATCTAGATCCTATATATATACACATATCATGGGCTTAGAGTCTTTCCTGTTCGAGCCGCATTCTTGAATGCCAATTTGAAAATCAAA 
AGTTGCTCCATATAAAACCACAATCTTTTGCAATATATGGCAATGTTGGTATCGTTTCGTTTCTTACTTTTCTTCATTTGGTTAGGTAAGATGAA 
TATCTTAAGGTCATCGTATACATCATTCTAAAGTGGCTAAGCCAGCAGCGAAAATGAAAATCTTTCAGAGTTAGCCAAACAGCTTGAGGAATCTT 
GAAAAGATTCATCATTACCCCTTCACAACACAATATATGTACATATGTACTCATTCGAATCTTGAAACGTGTCTACTTAAATGTGAAATAATTGC 
AAT T GAAAT C AAC AT A T GT AAT AC GG AAT T CC T AG C AT T C C C C AT T T CGGG AAT C GT T C AT GCG AGCC C AAAC AT T T T AGT T T CG G AA C T TT T T T 
AAAG AT CC AAAAC AGC GAG AAAC C GAT GAG C AC C ACC C AC C AAG C G AAAAG AC C GGC G AGT AGC T T TC AC T T AC GG C AC AC C GCATTCTTGAGCG 
GCACAGGTGTGTCCATCTCCATATCGAACTCCCTATCCACGGTCACATTCACATCGACATCCACATCCATCTGCAGTGGAATCGCTGGCGTTGAT 
AGCGCCAATCGTGCCAGCTTTTTGTATTTGCGACGTCTGTGGAATTTGATGCTATGCAACGTGTTGCCGGTGGCACTGATGCTGTTGCAGGTGTT 
ACTGCTGTTGCAGTTGCAGTTGTTGCTTGTGAGGCTATGATTGTTGTTGGTGTCGCGCGGCGACCACGACTGCCGCTGAGTCTTCCAAAGACGAT 
GATGATTGTGATGGTGCTGTTGATGATGATGATGATGGAGTTGGTGCTGATGGTCATCGTGGCTGGCACCGCTGGGATACGGATTCCGATTCCGA 
TTCGGTTTCGGACTCGGATGTGGCGGCAATTGCTTGACCGTGTACGAATAGAAATGCCAGCGGTCGGCGAGCGCACAGTCGCTCAAAAAAGATTC 
ATCAAACTGCGGCATAATGTTGCATAATGACAGCAGCAGCAGCAGCAGCAATAACAACAGCAGCAACAGCGACAGCAACACGCATAACAACAATT 
TGTTGTGGCTTAAATATGCTTGGCCAAAAGACACACACGTTGGCCTAACTGATTTCTCCAAATAGCGATAAGCGATAGCAGCAACATCGTTTTCG 
TATAAGATTGTTGATTTAAGCGATTGCCGTCGCGATTACAACATCAATTCATTTTTCTGTTTCGGCATGTTTTACATTGATTATTTCACTTTCAG 
TTTTCGGTTGTTTCAGTGTTTTCGGAAAAAGTTTTAAGCACTGATAAAAGCTATATCCGATGTGCTCTGCCAGCAGGAAGTAGATTTCAATTGCT 
TTCTCATCATGTTCGTTCACTTTTGCAATTCGTTCGATAGTCCGATAGGCAATTCAAGCGATTACCGATTCAAGCTTCCTGTGGCAACCGCAAGA 
ATTGTTCTGGACACTCTTTGTTTGACTGAAATACAAGTATGCCAGCTTAGAGATCACTTCAATTGGCCATAGTGATTTCCGTATATTATTTTCTT 
TAATGTTTAAAAGCCATCAAGCCCTAATTTCCGTTTCTTACTGCCTGCCCGATTGTGGTAAATTGAAGGTAATCAACTCCAAAGCCACTTGTCTT 
TTGCTTTACTTATATTTCAAGTGCGATTTCCGCAATTTTCCCCAATAGCTTTCCATTCGCCGCACAGTCACTTTTGCATTTCTGAACTTGAAATT 
TCAACAGGCTTCGGCTGATTGATTCAATCGCCGATAAGCGATAAGCTAACACTTTTGGCCTGCCACTCTCAGTGCAATTTCTTTCACTTTTCAAG 
TGCAATTTGGCGCTCTTTTCTTTGATTTTTCATCATCCATCCATCCATCCATCTCGTTGGCGCACCGCCCCCGCAGTGGGCTATCTATCTGGTTG 
GCCTCTGGGTGAGGAGATGTTGATTAAATAAATCACAAATAAAGATGATTATATTTAGACACTGGCGCTCTGCTGCTGCTGCTGCGGCGGCTGTC 
GTCGTAGTCCGTTCGCGATGTTGTCGTTGTCGACTGGAAAGCGCACTGAACTATGAAAATCCGCAAAACGGGCAAAGCCAAAGACGACCGACCGA 
CTGAAAGACCGACCAGGCGGCAAAAGCGAGTGTCCGAGTCTTGGAGACCCGCTAACCGGCCAACGGCAATGGCAACACATTAGCAATATAGCAAT 
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ACAGCAATAGCAGCAACAACAGACGCAGCAGCAGCAGCAGCAGCAACACGAGAGCCGCCGAGGGGGCAAAAAAAAAAAAGAGCCCCAAATGGCAC 
TTGCCCACCTGGCAATCGCCCTCTCACCATCTCTCCTGATCTCTCCTGCTGCTCTCGCCACATCGCTCTTTCTCTTTCCGAGATTCATCAAACAA 
ATCATACATGTTCACATCACTGATGGACCAAACGGGTGCTACAAACGTATCCATTCAATGTGGCTCGGAAATCGTGCAAATATTCAGTATATATT 
TACGATAATATTTCTATTTTAAAATGACAAGTTACTTTACTGTTTATACTTGCAGGTACACCCCCCTTTCTCGCAGTGCAGTCCGCGGTTTGCGG 
CTCTTTTCACCCGGGAGGCTGCTCTCCAACCGTCGGCCTCTCCGCCCAATTATCAATCAGCTGGCGCACCTGAGAAAGAGCCCGTGGCGAAGGGT 
GAGGGGGCGGTAATTGGGGGCGACAAGGGAGTGGTAAAGGGGGGGGGGTGAGGTGGATGCGGGGCAAGAGACAAGAAGACGCCGGGCGGATTTCT 
TCAATTTGCAAGTGCCGCTCGTG 
(SEQ ID NO: 295) 

Exon: 10613.. 9290 
Exon: 8629. .8039 
Exon: 7971.. 7781 
Exon: 7703.. 7168 
Exon: 7098.. 6929 
Exon: 6390. .5734 
Exon: 4313.. 4202 
Exon: 4 132.. 34 68 
Exon: 3356.. 3188 
Exon: 3110.. 2541 
Exon: 2473.. 2348 
Exon: 2270.. 2139 
Exon: 2064.. 1001 

Start ATG: 9380 (Reverse strand: CAT) 
Transcript No. : CT10408 

GAGTGGCAGGCCAAAAGTGTTAGCTTATCGCTTATCGGCGATTGAATCAATCAGCCGAAGCCTGTTGAAATTTCAAGTTCAGAAATGCAAAAGTG 
ACTGTGCGGCGAATGGAAAGCTATTGGGGAAAATTGCGGAAATCGCACTTGAAATATAAGTAAAGCAAAAGACAAGTGGCTTTGGAGTTGATTAC 
CTTCAATTTACCACAATCGGGCAGGCAGTAAGAAACGGAAATTAGGGCTTGATGGCTTTTAAACATTAAAGAAAATAATATACGGAAATCACTAT 
GGCCAATTGAAGTGATCTCTAAGCTGGCATACTTGTATTTCAGTCAAACAAAGAGTGTCCAGAACAATTCTTGCGGTTGCCACAGGAAGCTTGAA 
TCGGTAATCGCTTGAATTGCCTATCGGACTATCGAACGAATTGCAAAAGTGAACGAACATGATGAGAAAGCAATTGAAATCTACTTCCTGCTGGC 
AGAGCACATCGGATATAGCTTTTATCAGTGCTTAAAACTTTTTCCGAAAACACTGAAACAACCGAAAACTGAAAGTGAAATAATCAATGTAAAAC 
ATGCCGAAACAGAAAAATGAATTGATGTTGTAATCGCGACGGCAATCGCTTAAATCAACAATCTTATACGAAAACGATGTTGCTGCTATCGCTTA 
TCGCTATTTGGAGAAATCAGTTAGGCCAACGTGTGTGTCTTTTGGCCAAGCATATTTAAGCCACAACAAATTGTTGTTATGCGTGTTGCTGTCGC 
TGTTGCTGCTGTTGTTATTGCTGCTGCTGCTGCTGCTGTCATTATGCAACATTATGCCGCAGTTTGATGAATCTTTTTTGAGCGACTGTGCGCTC 
GCCGACCGCTGGCATTTCTATTCGTACACGGTCAAGCAATTGCCGCCACATCCGAGTCCGAAACCGAATCGGAATCGGAATCCGTATCCCAGCGG 
TGCCAGCCACGATGACCATCAGCACCAACTCCATCATCATCATCATCAACAGCACCATCACAATCATCATCGTCTTTGGAAGACTCAGCGGCAGT 
CGTGGTCGCCGCGCGACACCAACAACAATCATAGCCTCACAAGCAACAACTGCAACTGCAACAGCAGTAACACCTGCAACAGCATCAGTGCCACC 
GGC AAC ACGT T GC AT AGC ATC AAAT T CC AC AGACGT C GC AAAT AC AAAAAGC T GGC AC GAT T GGCGC T AT C AAC GC C AGCGAT T C C AC T GC AG AT 
GGATGTGGATGTCGATGTGAATGTGACCGTGGATAGGGAGTTCGATATGGAGATGGACACACCTGTGCCGCTCAAGAATGCGGTGTGCCACGGCA 
GCATCAGTTCTCCATCCACGCCCGGCACCTGTTCCAGTGGCATCGGAGTGGGCGGTGGCGGCTGCAGCAGCAGCAGCAACAATAGCATCAACAGC 
GGCAGCTACTCCACCGCCTGCACTCCGCCACCACCCACGCATCAGCATCACTCGCAGCACCAGCAGCTGCAGGGCACGCCGGGAGGATCTAGTCG 
GGTAGGGGGAGCAGGAGCAGGCGGAGGTGGTGGTGTACCACCGGCACCACCCAGTGCCGGATCCTCGGGCCACAAGAACAGTCTCAAGGGCACCA 
AGCTAGCGCGCCGGGCGCGTTCCTTTAAGGACGACCTCATCGAGAAGATTTCCCTGATGCGAACCACCAACAATACACTGGGTCGCTCCCACTCG 
CCGCACAGTCCGCGCACCAAGCACGGCACAAAGGCACCGCCCACCACCGAGGAGGTGCTACGGTCCACCCAAACCCTGGAGACGCACGTCAAGGA 
CATTTCGAATGCCCTGAAGCACTTCCGGGATGTTATACTCAAGAAGAAGCTGGAGGTGTTGCCGGGCAACGGAACGGTCATTCTGGAAACCATAG 
CCAGCATGTACTCCGTGATCCAAACCTATACCCTGAATGAAAACAGTGCCATCATGAGCAGCGCCACGCTGCAGGTTTACCAGAGCCTGGGCAAG 
CTCATCAAGCTCTGCGACGAGGTGATGCTCTCCGAGGACAGCGGCGAGTGCGCCTCCCTGAGCAACGAGAATGTGCGGGAAGTCATTGATCTTCT 
CGAGGATGCTGTGCGGAATCTCGTTACGCTGGCGCAGGGCAAGCTGAAGGAGCAGGATCAGTGCGCCTTTCGCTACAGTGGATCTGGCTTGGGCG 
GCATTGGAGCGGCGGCGGAGATCATGGGTGCGGTCACCGCCTCGCCGGGAGCGAGTGTTCCCGGTACTGGAGTCATGCGCGTTTCGGCCGCCGAA 
T C AGCT GC C C AGC GT AC TTCGTTGCC GG AC AT A GC G C T C AC G C C C AAG G AGC GC G AC AT AC T GG AGC AGC AC AATGT G AAC CC G AT GCGCGGCTC 
CCACAGCACCGAAAGTATCCTGCGCGACACGAGTCCACCGCCGAAGCCACCGCTACCCAATAGGGCCAGTAATCCGCCGCCGTTGCCACCAAAGC 
GACGCAGCCAGCCGAGCGCATCAGCTGGTACAGTGGGCGTAGGCTGCTCATCGTCGACATCCACCTCCAATCAGGCCAGTCCACTGCCCTACGCC 
CAGTCCCATAATATTAGTCTGAACTCGGACCTGGACTGCAGTTCCAATATCTCGCTGCTGAATTATGGCGTGGATCGCCTATCCGTGCGTTCACG 
GTCACCGGATGAGAATAGTCAGTGCTCCTTTGACTCGGCGTTGAATCACTCACGCGAGGAGGAGGACCAGCAACAGCAACATCAGCATCTGAGGT 
CATTTCCAAAGTTGGCTGCGATGATGGACGAAGACATGGACAAGATGGTCAGCTACAGTTTCGTGTCGATGCGTGAGTTTCGCACTTCCACACAG 
ACGACGGACTACAGTGTCCAGTCCTCCACGAAGTCGTCCAGCAGCAATTCGGAGATTGCGTTCAGCATCAGTGAGTCGACGGCGGTCGGCAGCAG 
TAGCGAGTACCAGCAGATTAGCCAGTTGGTGTCGCACAGCCAGCGTCATATATCCTCGAGCAGTAGCAGCTGCACCACCACGACCACCAGCAGCA 
GCACAACCACCGGCTATGGCAGCAGCGAAGTGGAGCAGCTGCAGCAGCAGCAGCAGCAGCAGACGACGACGACGCCGGCCGATCTGGCGCCCGCC 
TTGCCGCCAAAGAGCATCCAACGGAGCAGCCTAACCCGCCACGACTCGCCCGGAGTTGGCGATGAGCTGGACGAGGTGCAGTCCTCCTCCGGCTG 
GGCCAGCCATCGGAGCAGCCAGTCGGAGGTGGCCGAGCTGCGTCAGCTGTCGCCGCTCCATCACCTCAATCACCATCCGCATACTGCGTCCGCCG 
GTCAACTGCAGCAGTGGCACTCCAAGCACCACAGCCTGATCGAGGGACCCCGCCTCCAGCTGGCGGGGAGTGGCAGCTGCAGTGCGTTCGATCAG 
CGACACTTGGATCAGGAGCCACCGCCGCTGCCAATAAAAAAGAAGCACATTCTGGCGTACATGGAAATCTGTTCGGCGTCCACGCGATCCATTGA 
GCAGCATCGTCACACAATGCACGCCTGCAACATAAGTCGCAACATCTCGCACAGCCAGACCATGAACATTATGCCCATGAGCAAGGAACTGTCGC 
CGGAGCTCGAGATGCCACCTGCCCTGCCGCCAAAGAACTACAAGCAGCGCAAGGCGACAAGCATGGTAGCATCACCCACGCTGCAGCCCGTCATT 
GTGACCACGCCACCGCCGAGTCCGAAGCCGACGCTGGGCGAGAATGGGTCGACGGGCAGGCCGGACAGCCGGATGGCCACCGTATGCGAGGAGCT 
GAACGATGCAGTGGCCAGCGAGGATGCGATGCCGGAGCCCCGTTCGCCCGTGCTGGACAGCAATGAGAATGTGAGCGCCGTCGATGATGGACAGA 
CTTTCTACTGTCACTCACATCAGTTGCCCGCCGCCGAAATGGAGATGAGCGAGGATGCGAGCAGTGCCGACAACCAGCCAATAACCACGCCCCAA 
GTGCTCGAGGAGCAGGAGGAGCCAACGGCGGAGTCCCGACCACTGGTCGCTGTGCACGAGTCGGTTAAGCCAGCGAACGTCGACGAGGATGAGGA 
GGCGGAGCGAGCGGATATGCTGATCAACATGCTGGAAGAGGTCAACATCACACGGTACCTGATACTCAAGAAGAGAGAGGAGGACGGGCCCGAGG 
TGAAGGGCGGCTACATCGATGCCCTCATCGTGCACGCCAGCCGTGTCCAAAAGGTCGCCGACAATGCATTCTGCGAGGCCTTCATCACCACCTTT 
CGCACCTTCATCCAGCCGATCGACGTGATCGAGAAGCTGACCCATCGCTACACATACTTCTTCTGTCAAGTGCAGGACAACAAGCAGAAGGCCGC 
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TAAGGAGACCTTTGCGCTGCTGGTCCGAGTCGTCAACGATCTAACGTCGACGGATCTAACCAGCCAGCTGTTGAGCCTGCTGGTGGAGTTCGTCT 

ATCAGTTGGTTTGCTCTGGGCAATTGTACTTGGCCAAGTTGCTGCGCAACAAGTTCGTGGAGAAGGTGACGCTGTACAAGGAGCCCAAGGTGTAC 

GGCTTTGTTGGGGAGTTGGGCGGAGCCGGTAGTGTCGGCGGAGCAGGAATCGCCGGCAGTGGTGGATGCAGTGGTACAGCTGGTGGTGGAAACCA 

GCCTAGTCTGCTCGACCTCAAGTCGCTGGAGATTGCCGAACAGATGACGCTGCTGGATGCGGAGCTGTTCACGAAGATCGAGATACCCGAAGTAT 

TACTATTTGCCAAAGATCAGTGCGAGGAGAAGTCGCCCAACCTAAACAAGTTCACCGAGCACTTCAACAAGATGTCCTACTGGGCGCGCTCCAAA 

ATCCTGCGCCTGCAGGATGCCAAGGAGCGGGAGAAGCATGTGAACAAGTTTATCAAAATCATGAAGCATCTACGCAAGATGAACAACTACAACTC 

GTATCTGGCGTTGTTGTCGGCCCTCGATTCGGGTCCCATAAGAAGATTGGAGTGGCAAAAAGGCATCACCGAGGAGGTGCGATCATTCTGCGCCC 

TCATCGATTCCAGCTCCAGTTTTCGCGCCTATCGACAGGCGCTGGCCGAAACTAATCCGCCCTGCATACCCTACATCGGCCTAATTCTGCAGGAT 

CTAACGTTTGTGCATGTGGGCAACCAGGACTACTTGTCCAAGGGCGTCATAAACTTCTCCAAGCGCTGGCAGCAGTACAACATAATTGACAACAT 

GAAACGTTTTAAGAAATGTGCCTATCCATTTCGACGCAACGAGCGCATTATACGCTTCTTTGATAACTTCAAGGACTTTATGGGCGAGGAGGAGA 

TGTGGCAGATATCGGAGAAGATCAAGCCGCGTGGACGCCGCCCCGTTAACTATTAGTCTTCAAGCAGCAACAGCAAAACACCAATATTATATATG 

ATAAAGTATATACACCGTATTAATTAAACAGTTCAAAACGGTTGCCACGTCTAGGAGGAGGATCAGCATCGGGAGCAGCCAACAACTATTCTAAT 

GGGTCTGAGCAATAATGTTCGCTATTCTCTGTCTATATATATACAATTATACAAGATATTTAGCTGTAATTACCAGCCGATGCCGAATCGGAAAT 

ACATATACATACTTACTTAAGATGTAGCGATTCAGATACGGATACGGATACATGTAGCCGCGTACAAATAACGAGCCGATCTGTAGTTTAACTAT 

TAAGCTCTATGAGTCATTGGGTCGATGGACCTAACCTCAGCTAACTCCCCACCAATATTCGAATCCGAATCCCTGTCTGCGAGTCCCAAAAACTT 

TTTCCCGGACTTTTGCCAGCACTAGAACACCAAACTAACTAACTAACTTACTTACCACCAGAGACAACCAATCAAGAAATCAGCAATTTTCGCTT 

TTGTTTTGAGATCTCTTTTTTACACAAGAACACAACGAATTTTGAAAGCAAGCTACGTGAACGTAGAGGCCAACCGACAAACTATATTCGACGAT 

ATATGTAGCGATATCTCGGACTATAAATCGATGCAATCCACGAGCAGAACACCAAGCGAACCCTAGCACAAAGCGCGTTATATGCAATTCTCTCT 

CCAAACAACCAATATTAAATACCCGTGTGTGTGTATGTGTGTTAGAAGTTATTCAAATTGAAATTGAACTCAAAATCAAAATCAGAGATTTCAGA 

GTGCACGAAACAGAAAACAGAAGGCGGTATCTATCGATCAAATCAATCAATCAATGAAATCAATCAAAAATTCATTTTGCGACTCAATTAATTGA 

AGAAACTTTTAAAGTTACAAAACAAAACAAAATACAA 

(SEQ ID NO: 296) 

Start ATG: 1234 (Reverse strand: CAT) 

MDVDVDVNVTVDREFDMEMDTPVPLKNAVCHGSISSPSTPGTCSSGIGVGGGGCSSSSNNSINSGSYSTACTPPPPTHQHHSQHQOLQGTPGGSS 
RVGGAGAGGGGGVPPAPPSAGSSGHKNSLKGTKLARRARSFKDDLIEKISLMRTTNNTLGRSHSPHSPRTKHGTKAPPTTEEVLRSTQTLETHVK 
DISNALKHFRDVILKKKLEVLPGNGTVILETIASMYSVIQTYTLNENSAIMSSATLQVYQSLGKLIKLCDEVMLSEDSGECASLSNENVREVIDL 
LEDAVRNLVTLAQGKLKEQDQCAFRYSGSGLGGIGAAAEIMGAVTASPGASVPGTGVMRVSAAESAAQRTSLPDIALTPKERDILEQHNVNPMRG 
SHSTESILRDTSPPPKPPLPNRASNPPPLPPKRRSQPSASAGTVGVGCSSSTSTSNQASPLPYAQSHNISLNSDLDCSSNISLLNYGVDRLSVRS 
RSPDENSQCSFDSALNHSREEEDQQQQHQHLRSFPKLAAMMDEDMDKMVSYSFVSMREFRTSTQTTDYSVQSSTKSSSSNSEIAFSISESTAVGS 
SSEYQQISQLVSHSQRHISSSSSSCTTTTTSSSTTTGYGSSEVEQLQQQQQQQTTTTPADLAPALPPKSIQRSSLTRHDSPGVGDELDEVQSSSG 
WASHRSSQSEVAELRQLSPLHHLNHHPHTASAGQLQQWHSKHHSLIEGPRLQLAGSGSCSAFDQRHLDQEPPPLPIKKKHILAYMEICSASTRSI 
EQHRHTMHACNISRNISHSQTMNIMPMSKELSPELEMPPALPPKNYKQRKATSMVASPTLQPVIVTTPPPSPKPTLGENGSTGRPDSRMATVCEE 
LNDAVASEDAMPEPRSPVLDSNENVSAVDDGQTFYCHSHQLPAAEMEMSEDASSADWQPITTPQVLEEQEEPTAESRPLVAVHESVKPANVDEDE 
EAERADMLINMLEEVNITRYLILKKREEDGPEVKGGYIDALIVHASRVQKVADNAFCEAFITTFRTFIQPIDVIEKLTHRYTYFFCQVQDNKQKA 
AKETFALLVRVVNDLTSTDLTSQLLSLLVEFVYQLVCSGQLYLAKLLRNKFVEKVTLYKEPKVYGFVGELGGAGSVGGAGIAGSGGCSGTAGGGN 
QPSLLDLKSLEIAEQMTLLDAELFTKIEIPEVLLFAKDQCEEKSPNLNKFTEHFNKMSYWARSKILRLQDAKEREKHVKKFIKIMKHLRKMNNYN 
SYLALLSALDSGPIRRLEWQKGITEEVRSFCALIDSSSSFRAYRQALAETNPPCIPYIGLILQDLTFVHVGNQDYLSKGVIWFSKRWQQYNIIDN 
MKRFKKCAYPFRRNERIIRFFDNFKDFMGEEEMWQISEKIKPRGRRPVNY* 
(SEQ ID NO: 297) 

Name: GUANINE NUCLEOTIDE EXCHANGE FACTOR DC3G 
Classification: signal_transduction 
Gene Symbol: C3G 
FlyBase ID: FBgn0026145 



Celera Sequence No. : 142000013384832 

CAGCGACAGATCAGGCGGTTCCGGCGGACGGTCATATACACCATATACGACGTGAAGGTCTGCTCCAGCATCATACAAGACATAGCCGCCAAGTT 
GTCCCCGTACTTCAGCTCAACCCCGAACTCGTACTACAGCGACTCCCTCTTTTTCCTGTCCCTCGGCCTTCACAGGATCGCCGACAGGAGCCTAA 
ACAGAGCCATTCTGCTAGATTGTGATATAGTCTTCCGATCCGATGTCCGACTGCTGTTCAATGAGTTCGACAACTTTCTGCCGCACCAGCTGTAC 
GGACTGGCACCGGAACTGACCCCCGTCTACCGCCACATCCTCTACCGCTATCGCGTGCGCTATCCGAAGACCAGTTTCGGGAATCCCTACTATCC 
GATCAACAACGAAGGGGGCAACCAACATAGCCGTGTCCACCACGGCTATCCAGGCCTGAACTCCGGCGTGGTCCTGCTGCTCCTAAACCGCATCC 
GTAACTCAAAGTCCTACCTGGAGAAGCTGACCCACTCGGAGGTGCACACCCTGGTCGCCAAGTACTCCTTCAAGGGCCACCTCGGCGACCAGGAC 
TTCTTCACCCTGCTGGGATACGAGTACCCGAACTTAATCTACAGACTGGACTGCATTTGGAACCGCCAGCTGTGCACCTGGTGGAAGGATCACGG 
ATACTCACAGATATTCGACGCTTATTTCCGGTGCGAGGGCAATATTAAAATGTACCATGGCAACTGTAATACGCGAATTCCAGAATAAATAAAAT 
TGTTATAGGAATATTAAATTAATTAAATGCAGGGTATTTTTATTTGGTATAACTATTGTAATGGAACGTTATGTAATATACGGTGAATTTAATCT 
TAGTTTATAGACCATTATAAATGGTGATATATAAAAATGAAATTTAAAGATAAAAGACAGCCCACACCTATCACCATTTAAAATATCAAAACGAT 
GTAAGAATGTTATCGATAGGCGAGAAGCAACAGAGTGGGAATATGCCGCACAGGGTGTTTCGTGTGAATGATTTCGCCTCTTGTTTCTTATCTTT 
AACCTTCATTTCTGGCTCCGTTTGGCTATGAATTAGGAGCAGTCCCGAATAATCAGTCGAGCATACGTAGTTAACGATGGCGCCGAATGCAGCAG 
TACCAGTGGAGCGATATGAGCCTGAGAACATCGGGATCAACGCCATTCTCCTGGGGCCGCCGGGTTCCGGCAAAGGAACGCAGGTGGGTCAAAAT 
TACATAATGATGTGTTTTTTGTGACACGGACGCCATGTAGTCTATAAATAAATACAACGTGTTCAAGTGCGCGGTTTTTGTTTGCAAATAATCGG 
TGAAGCTCGTTAATTCTCTTGGAAGAGAATTAAAACTGGTGAAGTTATATAAGACGATGTGTACATGGAAATGTATATGTATGTATCTACGTGCC 
AAGTATGTACATATGCGTCAAGCAGTGTCTTTCTCTTGACCCTGAATCAAATCAAAGCCTTGGCGAGATGTGGCGAATTAAGTGGAACACCTTTA 
GGCAATCGATTGGGTATTATGCAGACCATCTATTAAAAGGCTACCAGAACATTCCTGGCCTCAAGAAATGCATCTTATCTAGCCGATTGGTTACC 
TTGCTTACCTTCCGATATGGTGCTAAATTGATTAATTGATTAGCTCTAACTGAAACTTATCAATGTTTTTAATTATGCCCATTTCAGGCTCCCCT 
GCTGAAGGAGAAGTTCTGCGTGTGCCACCTGTCGACCGGCGACATGCTGCGGGCGGAGATTTCATCCGGTTCGAAGCTGGGAGCCGAGTTGAAGA 
AGGTCATGGATGCCGGCAAACTGGTCTCCGACGACCTGGTCGTGGACATGATCGACTCCAACCTGGACAAGCCCGAGTGCAAGAATGGATTCCTC 
CTGGACGGATTCCCCAGGACTGTCGTTCAAGCTGAGAAGGTAGCCACACGTGTGAAGCACTTTGGACTTTAGGTTATAATCCGTTCGAAACGCAC 
TTTGCAGCTCGACACATTGCTGGACAAGAGGAAAACCAACTTGGATGCTGTCATCGAGTTTGCCATCGACGACAGTCTGTTGGTGAGGCGCATCA 
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CTGGCCGCTTGATCCACCAGGCCAGCGGGCGTTCGTATCACGAAGAGTTTGCCCCTCCAAAGAAGCCAATGACGGACGATGTAAGTATCTCTATA 
GGACGCTATCCAAGAACTCTAAATGAAAGCAGACCTTGTGTACTTGCAGGTCACTGGAGAGCCCCTAATCCGTCGCAGCGATGACAACGCCGAGG 
CTCTGAAGAAGCGACTGGAAGCATATCACAAGCAGACAAAGCCGCTGGTCGACTACTACGGCCTGAGGGGTCTGCACTTCAAGGTGGACGCCGCC 
AAGAAGTCCAGCGACGTCTTCTCCACCATCGACTCTATATTCCAACGCAAGCGTCCTGCCCAGATCCAGTTATGATTTGTTCCTGCGCAATGATC 
AACTACTGATAACTGCCTCCTGACAATTTATTTCGAGACACACACAACGTTTAAGCCTAAAGTTATAGTTGCATCCGGAAATCCTGAACAAACGA 
AATTACCTTTGCAGGTCGACTACTTTCCTTGTACAGATTTCTAAAGTGTATGTGTTTGTTACTTTTTACATTTCACTATTTGTAACTACAATACA 
ATTTAATAAAAAAATCGAAGGAAACAAGTACATTTTACGAGACATGACTGCTTTAAACTGGGCTGCGTTCGAATGAGGGATGGACGACGCCAAGT 
TCAAACTTCCACACAATTGTTTTCCGGAAAGCTTCTGGAATATGATGCGGCTGACTCAGTTTCGAATCGAGGTTTTTAGCGTTTGATGGATAGCT 
ACTAATTGTTCCGAGAACTATTTATTATTGCAATCAACTTTAAAGCGACTATTAAAGGGTCATGTTGCACGGGTCTCAAGTGTAACAAATGAGAA 
ATTGCTAATGGAAATCGCTAATGAGATCGTTAAATCTGGTAAACAAATAACATATCGCCAAGTTAGCCCGTCCATCGTGGCTGCGAATCTCTCGC 
AGAGCGACTTTCGATCGCCGTTTAGTACTCAGACGTCACCTGCCTAATGTTAATGTCAATTTAAGTGTGTATAACTATCGCACATATATTATATA 
AATTTATAACGACGATTGACTAATCACTGGGTTGACTTTTCAGCAATAGCGATTTTCGCTTCAAAGTCTAAAGTGTGTGCAATCCCGTATGGATA 
CGAGCCGAGCTCGAACACAGGCTCACAGCGCGCGGTTTGTGTTGTGTATTACACTTGAAGTTCAATCCCAGGTAAAAAGTGGGCCCAGGTATAAA 
AGCCCACTTCATAGAAACATTTCTCAGTTAGTTCCAGCAGCTGAATTGCTTCACATCTATTCGCTCCGAGTCGCCCCACAAGTCGAGCCCCATTG 
TAAGACCACCGCACCGTTTATAGTTTTTGGAGCCGATCGGCTCGGCTGGTCGACTGGTGGCTGGTAGTAGTCCACACACTCATGGACGCAGCCAG 
ACGGGCAACCAGTTTCATTGTGTATAAACAGTGACATCTTTTCAGACAGCGAATGTGTTTCAGATACGCATACAAAGTGGCCTGCCTGGCTGTGC 
ATTAATTTGCATACAAATCCCATAAGTCCACAGATACATTTTGCGAATGCAGTCGCGCTCGATGGCTAGTAAAAATA 
{SEQ ID NO: 298) 

Exon: 1001.. 1223 
Exon: 1724.. 1951 
Exon: 2003,, 2170 
Exon: 2235. .2687 
Start ATG: 1122 

Transcript No. : CT10496 

CAGGGTGTTTCGTGTGAATGATTTCGCCTCTTGTTTCTTATCTTTAACCTTCATTTCTGGCTCCGTTTGGCTATGAATTAGGAGCAGTCCCGAAT 
AATCAGTCGAGCATACGTAGTTAACGATGGCGCCGAATGCAGCAGTACCAGTGGAGCGATATGAGCCTGAGAACATCGGGATCAACGCCATTCTC 
CTGGGGCCGCCGGGTTCCGGCAAAGGAACGCAGTTCTGCGTGTGCCACCTGTCGACCGGCGACATGCTGCGGGCGGAGATTTCATCCGGTTCGAA 
GCTGGGAGCCGAGTTGAAGAAGGTCATGGATGCCGGCAAACTGGTCTCCGACGACCTGGTCGTGGACATGATCGACTCCAACCTGGACAAGCCCG 
AGTGCAAGAATGGATTCCTCCTGGACGGATTCCCCAGGACTGTCGTTCAAGCTGAGAAGGTAGCCACACGTCTCGACACATTGCTGGACAAGAGG 
AAAACCAACTTGGATGCTGTCATCGAGTTTGCCATCGACGACAGTCTGTTGGTGAGGCGCATCACTGGCCGCTTGATCCACCAGGCCAGCGGGCG 
TTCGTATCACGAAGAGTTTGCCCCTCCAAAGAAGCCAATGACGGACGATGTCACTGGAGAGCCCCTAATCCGTCGCAGCGATGACAACGCCGAGG 
CTCTGAAGAAGCGACTGGAAGCATATCACAAGCAGACAAAGCCGCTGGTCGACTACTACGGCCTGAGGGGTCTGCACTTCAAGGTGGACGCCGCC 
AAGAAGTCCAGCGACGTCTTCTCCACCATCGACTCTATATTCCAACGCAAGCGTCCTGCCCAGATCCAGTTATGATTTGTTCCTGCGCAATGATC 
AACTACTGATAACTGCCTCCTGACAATTTATTTCGAGACACACACAACGTTTAAGCCTAAAGTTATAGTTGCATCCGGAAATCCTGAACAAACGA 
AATTACCTTTGCAGGTCGACTACTTTCCTTGTACAGATTTCTAAAGTGTATGTGTTTGTTACTTTTTACATTTCACTATTTGTAACTACAATACA 
AT T T AAT AAAAAAAT C G AAGG AAAC AA 
(SEQ ID NO: 299) 

Start ATG: 122 

MAPNAAVPVERYEPENIGINAILLGPPGSGKGTQFCVCHLSTGDMLRAEISSGSKLGAELKKVMDAGKLVSDDLVVDMIDSNLDKPECKNGFLLD 
GFPRTVVQAEKVATRLDTLLDKRKTWLDAVIEFAIDDSLLVRRITGRLIHQASGRSYHEEFAPPKKPMTDDVTGEPLIRRSDDNAEALKKRLEAY 
HKQTKPLVDYYGLRGLH FK VD AAKKS SDVFSTIDSI FQ RKRP AQ I QL * 
(SEQ ID NO: 300) 

Name: Mitochondrial adenylate kinase isoenzyme 2, 
Classification: enzyme 
Gene Symbol: anon-Dak2 
FlyBase ID: FBgn0024906 



Celera Sequence No. : 142000013384074 

TGCAAGTTCAAAGTCAGAAAATGACAAATAACCGAAAGGCACACAAAAAACAATCCTAAACATGTTTGCTTGCATGCCGAGACACGCACATACAT 
CAATATTCATGTATAAACAAGCTTACAGTGTAGGAAATATAAAAAATTACAAAACTAGAACTGTGATTACCCACACACATGCACAACAAGATCTG 
CCTACTACCTTAAATTTACTCGCTGCGATATGTGTAGCACATGAGCACACAACATATATGTTTTATAGCACTTTTAGGCGCTTTTTATTCGTTTA 
TGCGTAAATAGAAAACGTGAACGACGTGACCAAGTGTGACCGTTGCTTATTTTCGTCGCAAATCTCGCGTGTGGTCAATTTGAACCGTTGTCAGA 
AACTATAGGGTCTCGAAACATGTGCAAATAATTCATTTATATCATTCAACTTATGCTGCAAATAATTGCTATTATTATTATTTTAATTAATTATC 
CACACTGCTTATGTTCTTCGCTCAAACAGGAGGTAATTAAATCATTGAAGGGATTTCAAAAATCTATGTATGCATTCACTACTAACTAGCTTGGT 
CTTTTTTTTATTTTGCGCCGCTTTATAAATTTAAAAAAATATCATTTGGAATTACACACATCACTGTGTTAAATTAAAAAGAAGATTTTTAGTGA 
TTGTTTTCTTTGAGATATTGAAGTCGCTATCTGCTCCTATATAAATATTATTTCGAAATTATCGATTTATAGTGCGCTATACTGACAAAAAATAG 
GCTGAAATTCGCATGAACAAGCATATTGTGGTAAGACAAAATGAAATCAGAACTTCCCAAGAACTACGTTGGAAAATTACAAATGCCCAAAAGTA 
TTGCTCCACAAAATATCGTAACGAAATATCGCGAAATTTGCAACACTTTTGTGCACATTCGGTGGCAACGCCGCCTAAGAAAATACCTATTGCCC 
CAATCGATACCTATCGATGAGCGTGGGCAGCCCTGTTTGTGCATGTGCTGCCATTCGATTTGATAACCAGCACTCCGTGCACACGTTTTTCGGCT 
TTCTAGAAATAATTGGGCTGCGATTCAAACTGTGAAGTGATTTTTTCTATATTTTTTTTTTGTGTGCACAAAGGCTCTTGTAATTCGAGGATTTG 
GCCATCGCGAGTGCGGGTGATTAAGCAGGCAGCTCCGATTGCCAGAGTTTTGCGGATGCATGGCAACACTGCCGCACGCCGGCCATGTTATTGTG 
ATTTGTAAAAGGTGGAACGTAGACACGTGTTTAATTCGTGCAGCACTTTAGCGTTGAGAAAACGACAGTAAATCGCCAGCAAATAAGCAAAAACA 
ATACACAACTCGCGTGTGCAGTAGATAGCTTTGAAAGCACCCAAAGATAGTGAGCAAAATAAGTTACCCAAGCGGTTCACAAAGGTAAGAAATCG 
CAGCACGCGGAACATAACCTAGTACAAAAAAAGTGCGTGTGTGTGTTTGCCAAAACAGAGAGATGTGACGGAAAATGGTGTCATGTTCTGTTCTG 
TGTGGATTTGTTCCAATGTCCGTGTCTTGAGCTCTATTCGCACCTCTTTCCCGGTACCATAGCCAACGTTTTAACTACCTTTTTTTTGTCAACTT 
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TTTGTTAAGTGTATGACTCTGCCTTGCCGCTGTTGTTGTTGTTCCGGCAAAGTGACAGTAAGTGGTTATGGAAAAAATTTGGAGTTCAATTCGGC 
GATTACACAATGACAGCACTCACACACACACACACCCACAATCGCACGCATACACATGCCGAGCATAAGTGCGTCTGTGTGTGTTACAGCTTTCA 
CCGTTAATGTTCGGTTTTTTTACGCCTGAAAGAGAACAAAGGCGAAAAGGAGCTGCCACTCCCACAAAGAAAACATTGCGTCTATGCATACATAC 
ATATGTATACACATATAAAGTAGTATATCCCCCGATTATACAATCCCCGCCCATTTTTGGCCACCATATGATCGGAAACCACGTTGCCAAGATTA 
TAACAATGTGGACTTTTTCGGCTAGCTGGCAATGTTGTTGCACCTGATTTGTTGTTGTTTTGGTGCTCTCTTCTGGCTTGCCTCTCTGTGTCGCT 
CATTCTCGCTTGTTCGGGGGAATTTCAAATTTTTAGGCTGTTGCTTCACCTCCCCGTTCGACGGCTTCCTCTTCTTTTGGCCGCACTGGTATTCG 
AGTTTGGAGCAACAACAATGTGCAAGTTCAAAGTCAGAAAATAACAAATAACCGAAAGGCACACAAAAAACATGCCTAAACATGTTTGTTTGCAT 
GCCGAGACACGCAGACACATCAATATTCATGTATAAACAAGCTTACAGTGTAGGAAATACAAAAAAAAAATAACAAAAACTAGAAGACCATTGCG 
AAGTTGTTGTGCCCTTATCTTACATTTTGTCGTTTTAATTCAAGAGTTGATACGAATACGTGTAAAACTACACTTCTCCCTGCTCTTATGCATAT 
CCATTTATTAAGAATTTTAGGCAAAATGTGGCAAGCTTTGTCAACAAGTTTTAATACTCTAAGCAAAGGGTATAAAACGTAATCTGAAAAATTCC 
CGTCACCCGGTTTCGACGATTAGGAAATGCCACAGCTGTCTGGCCAGTTGCCCTTCCATGGCCATTTTATAGCACCCCACTAGGCCCCTTATCCC 
CCACCCCCTGCATGCTCCCATCAACCGTTGATCTGTCGCTATATGTACATTTATACATATACATATGCATATACATATACATATGTACATACATA 
CAATGTATGTTTGTGTTTCAGTTGCTTGTCGATTGAGCTTATCATCTGACCACTGTGTAACAGGAGGACCTTTGGGCCCTAACAAAGTGACGATA 
TTGTGGATGGGGGGAAGCGGGAGTATAGTGGGTAAATCGGTTCTATAGTGAGTAACTTAAAAGTACTAAAGATCTCATGGCTTTAAACTACTGTT 
CGGCATGACTTGCACTTTTCGGGCCCCGGAAATTTTGACCAAATTCGAAGGAGCGTCTTACAATCCATTTGAATGGATTTTGATTAACGGTTTTC 
GGATTTCTTTGTTCTAATGAGCTCGCCACTCTCTTTTCTTCTCTCCGGCTTTCTGTCTCGTTTCCGCTTGGTGGGATGAGTGTGTGTGTGGGTGG 
AAAGGGGCGCTAAATAAAAACTTGCCACCACGCATCCTCTCTTTTTTCGCGGGTTGGTGGGGGTATGCCGATGCGGAAGAGCGGATGGAGTGAAG 
AGCGGAGAATCGGTGCCGAAATAAAAAAAATAATACAAATAAAAAGCCAGAGGAATTGGCAATTAGAAGGCGTGCAAAGTAGTTTCCCTTCATAT 
ATCGAGCAGTCGATATCTCGAATCTCTCGAACAATTTCGCCGGCATATCGAAACCCAACAAAAAAAGAAAGAAAATAAACAAAAGCATAGAAGAA 
ACTCCAAAAAATAAGCCGAAGCGAATTGATGCCATAAAACGACATAAAATACAAACGAAACCGGAAGAAGCAACGAATTCAGTTTTATCCTAGTG 
GT AAT AC AAAAAC AC AC ATT T AT T T T GT G AAT AGGT G AT G AAT T TC T GGCAAAGC TGTTGTTTT C GT AAC T T T T AGT T T T GT TT T GT AATG ATT T 
CGTAACATTTGCCATTGCCATACGTAAACGTAACGGTTTCTTCTTGCGCCTCTCTTCCGTTCTTCGCCGACGCTCTTTTGCGTTCTTGCGCACGC 
GCTCTTTGCATGCGATCTATCGCTCCATCTCTCTCGCTCTCTCCCGTTGCGTTTTTCCCTTCCGCGTGGCATTCAAAAATCGCTTTGCACCTTTT 
TCGTCGTTCTCTCTCGTTGTCTCTCTCTCTCTCGGCATTTGTGTTGTTGTCTATATTTACGTATTTATTTGTTGCTATTAATACGCAAAAGTTCA 
GAAAGGAAAAAACCAAAATCAAACACACTGGCAAACACACACACACACACGCACACTTATGCAGACGAGCACATGCAGGCGGGTCTCGTGACTTG 
CAAAACCAATTCGAAACCTTTAAATAGGTTTTTTCAAATATAACAACATGCATATGTCGTTATTTAAAAAACAACGTTCATGTGGATGGTGTTGA 
CAAATTAGCCCTTTAATCTCACACTTTTCTTCAAAATTGATTTCATATTTGAAATTCTCGCTTAAATTTAAACGCAAATTCAAAGCTCTTTACTT 
TGTTTTCAAAGTTATTCTACACAAAATTGCATATTGTGTACCGTTTCTTATCTTGAGCCATGACAAGCTTTTTAAAGTAATTATATACGACGCTA 
CTTGATTTGTAAATTGTTGCCCTGATAAGAGGTGTTCACACATTATTTCCAATTGTAAATTAAACAACAAACCATAACTTTTCCAAATATTGCAC 
AAACAAAAGCTACTTCATTTTTATTTCAGTATTAGTGTGATCAAAATAATCCTATTTGTGTGACAGTACACAAGTATATTTTCAAATCGCACAGA 
CTAATATAACTTCTTCTCCCTTTCAGATCTTTCAATATTCAGCGAAAATGGTATCCGGTCGTCCTCTACTCTACCTCTCACTGCCCGGTGTAGCA 
TTTATACTCGGCGTCTTTTGGTTTCGGCGTAGATATAAAAATTGTTTAGACAAGCCCGACGACGAGGACTCATCGGCCATCAATGACTCGTCGAT 
TGAACCAACTGTGCAGGCGCGCAAGGCCAACGGAGTCCTGCAGAATGGCAAGCTGCCACAGCAGTCGGCCAGCAAGTCGATGAACATCAACGGAA 
CTTTAGTTAACGGTAGCGGAAGCGGTAGCGGAAGTAGCAGTGATGAGAAGGACAGCCCCACTACCATGTTGTATGGTAAATCAGCACCAATCAAA 
AT C CAGAGC AAT GG ACGC AC TT C AAAT GGC AAGC AT C AGCAGC AG AT C G AT TC C G AAAT AC T G AAGT C AAAG AT AC AG GAT GC CGAAC AC AAG AC 
ACTCTGCTCCATAGACGAGGATTTCGAGAATCTATCGTCGCCACGTGATTTACCCGATTCGGTAAATACACGCGTATCCTTCTACAATCGTAAGG 
CTACCCAAAAGACGGTGGAACCGGTGGTGATTAAGGCCACGCGCACACCCAAAATATCGCCAGAGAACTCCTTCCTGGATACCAACTACACCAAC 
AAGGAGTGCGAGCAGAACAACAACTGTGAGCCCAAGGAGGAGCCCTCCAAGAAGGAGGCCGATCAGGAGGAGCTCGAGCAGGATCAGGAGCAGAC 
GGTGCTGAAGGAGGAAGTGGTGGATAGCAATGGTAATCAGAAGCGCAATGTGGATGCGGCCAGTCCATCGCTGAGCATTTGCAGTGTGCAATCGG 
GCGATTCTGGCAAGGGATCCAGCTTGCCGCGCTCGGAGGCGACACGCGTGAAAACCACCTACGAGTTCCTCTTCCCGATCAGCTTGATTGGTCAC 
CTGTACGGCCGCAAGCGGGCGTTTATTAACCAAATCAAGGCCAAGACCCTGGCCAGCGTGTCTGTAGGCAAGAATCCCTACTCGGGCAAGGTGCG 
CATCTGTACCATTGAGGGAACGGAGAGCGAGATTGATGCCGCCCTGGCCATGATCCGTCAACGTTTGCCGGCCAAGCGCTATCCGAACTTCACCA 
TGCAACGCATCCACTTTGCGCTTCCCCAGACCATAGTTCCTCTTTCGACCGAAAGCCTCTACAATCTACAAGTGAGTGGTGATGGTGTGACGATG 
ACAATGGTAACGATGCCGCCGACACGACATGGCTATAGCTGATTAGCTGGCTAATTAGCTGACTTGATTTGATGATTTTATGGTAGTATTCCTCC 
CGTTGAATACGCAGAGACTATTTCAAATTCTCTAAGTGGTTAACACTGCTGTTCCTGTCGTGTCGGCGCAATCGGAAAGGCTTCGATTTCGAGTC 
CCCATCC ATCCATTC AT CCATATAT CTCT AAT GCCTTGGCT AACTCCTCCTTCGCAGCTGAAACTGATTGAGGGCATCAAC AAC GAT GTGGTAGT 
GAGTGCGGTGCTATCTGGGTCGCATATTTTCATCCAGCACCCGCTGCATCCCTCGCATCCATCGTTGCCTTTGCTGCAGAAGCAGCTGTACGATA 
GCTACTCCACCATGGAGGCGCCATTGCTGCCCAGTTTGGAGCTTAGTGCCGTGTGCGTGATACCCATCAATGATGTGTGGTATCGCGTCCAGATT 
GTCGACACAGATCCCGAGGACGAGGAGCGTTGCGTAATCAAGTTTCTTGACTTTGGTGGCTACATGAATGTCGGCTTCAACACACTGCGACAGAT 
TCGCACCGATTTCATGAATGTGCCCTTCCAGTCCACCGAATGCATCCTGTCCAACATTGAGCCCATCGGTGAGTGTGTGTGTGTGTGTGCGCGCA 
TTATGCCCTTATACTTATTCCTTGATTCTCACTTTTTCCACGCAGGTGGCACCTGGTCCATTGAGGCGGCCGAAATCCTCAACAAACTTACCAAA 
GGCATTGTCTTGCAAGCACAAGTCGCCGGCTATAATAGTCACAACCTACCAGAAATCTTTTTATTCGCATCCCTAGGTCCCAATGTGAGTATAAA 
T T T C C GAT T AGAG AG AG AC GG AGT G AT T ATG AAG AGC G AG AGAG C AG AC T AC AGT GC GTG AC AT AAT AGC AA AAC GGTT AC G AG C AG AAAC T TGC 
ATCTTACTTTAAACAAGCTAACTAAGCCATTTGATTTGTTATGCAACTAAACTTAAACTTGTACACACATATGTTACCTTTTTAAAAACAAGCGG 
ATGCGCTATCGTCTGTTTTTAAAAAGGGGACAAACCCTTGTTGGAACAGAAAGTTTCTAACTCTTGTTTCTAATTGCAGAATGTAATCTTTATCA 
ATAAGGAGCTGGTCGGCAGAAAGCTCGCCAAATGGGTGGAGATGAGTGACTAACAGCCTTAGTGAACAACTCAACTAGCAACCAAGAACTATATG 
AAAACTT AAAAGC AAACTT AAACAAC AAT CACTC AAACC C AAATC AAGC AACT TGGCGGT GC AAC AAAAAAGGAATGCT TGAGT ATT GCGCAACA 
GGCTCCAAGATGCGGATTCGGATTAAGGATTTCGTTAGCAGCTTTTGCAGTTAGCATGTAGCATGTAGCACACAGCAAATAGCAATTGCCCGATT 
CCAAAAGAACTGGCAAGTGCAGAGGCCTCAAAACAAAGAAAATGAAACGTTTGTTACGTTGTTTAGATTGCGACTGAGCCCAGAGAGAGCCCCTT 
AATCATTACATAACAAACAACACACACTCGCCAAATACACACACACACACACACACACACACACGCATATATGAGTAAGAGCACCTAACACCACA 
CTAACCCATACACATTACACAACAAGACAAACGTTTGTTTGTTGACTGTCCAGAATTGGATACCAAAATGGAGGGGGAAGTTAGGGATCAGCTGG 
GTTATGCGTTTTGTGACCACATGGCAACACATGGGCGGGCGGGCATGCAACGGATCGACTGGAGCTGCAGATATTACTATACTCAGTTTATAAGT 
AGGCATTACTAAATAATGTACATAGATTTTATAACGAGAACGAGATGCGAAAAAAAAACAAAGAACATACTAAATAGACCCAACTAAAATACCAT 
GCGTTCATCAGATATACAGAAAAACAGAAACACAGAAACATAGAAACTCTGGATAAGAATGGAGAAAAACCTGACAAAACTTTAGTTTTATGCTG 
CTATTTGCTGAGATCTGTAAATATTATATTAGGCCTTTTTAAACAACTGATAATATCTAATCGACGAGTTCATACATTTACATGCATTATACATT 
TTTTTTTTTTTGTTAATTTGTATTCAAACTGTAATCAGAAAATGTATTCAAAATTCCCAACCCAAAAATAACTCACACCCGCACTAAAGCCACAG 
AATGCCTAACAAAGAAAGAAACCATTTAAAAGCAGAGACAGAGTGACGTTACGAATTCTTCATAATTATATATGCGTGTGTGCCATTGTTAGACA 
ACAAACCAACATTAAACATAACCAACATAACAAGCAAAACAACAAACACAATAATTAACAAAAAAAAAAGAAAAAAAAACAGAAATGGCTACTTT 
TT CCCCAAAAC AAAC CC ATT GT AAAT AAAAT CCATTTC G AC AGACAAAGAAAT GC AAAAAC AAAT GAG AAAC AATG ATT GTT GCT AC CAAAACT A 
GCCTATTGAAAAGAAGGAGTGAAAACCTCAAAATGCCTTTCTTTCAAACTTCTTTTGTATATGGAAGAGGAAACTTCTTGAAGCGACAAGAAGAA 
GCAAGCAAGCGAATTGTTGCAAACTTATCGCTTAAGCTACCAATTCTGGGTGTCTAGCCTAGCAGCAATATTAGCTAGAATTGTTCTAAGTGTAG 
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CACAGCCACAACAGCATCAACAACACATCGATAACACAATATAACACACAAACACACACTAATAAATCAACAAAGAAATTAATCATATTTATTTT 
GTTGAGATTGAAAATAGAGTTTTATTTCACTGTATCATTCATTGTATTTTACATTGACATTTTACATTGATCGGCAGCAACGGAATTACTATATA 
AACATTTATAAATCGTTTCAAAAAAAGATTTCAGGCTCAATCACAATTCAGCTTAATAAGTCGTCGTCATCGTTCTCATGTCGTCATCGTCATAA 
TTAAATTACTTAGATAATATAATGTACAAGTGATGATCCACACAATATCCATCTATTTGGTCAACATGAAAGACTTTGGAGGTGTTTTTTCTTCT 
TCAACGGTAGTCTACAAATATTCTAGTTAACGCTATCATTATACCGTTATTATTATTATTATTATAACTATTATTATTATTACCTCTTTTTTTGC 
TGTTTGTTTTTAGATCCTTCTTGATAATTAAAGCAAATTCATTCTAATTGTTATCCTAAACTAAAACAACTTTTGGAGGGGCTGCCCTCCACACA 
CTGTAGTGCAAATTCGTTTAATGGTTCCTCGTTTTAATAATGTCGTTTAATGCCTTTGGAAATTCGAACTACATTTTAGTCTATGGAACACTAAA 
TACATATTATTGGTTACTTTTAAATGAAGCATTCAACTAGATGAAGGGAGTAGGATTTCAATTTTTGCATGCACCAGAGTTGTCAACGTTCTCGG 
TGTTCTATCGAATTCATTGTAGATTTGTCACTTGTACAATTGTTGATAAACTGATTAACTGATAACTGATTAACTGACATAGAGATAATGCAAAC 
ACGACAGACATGATGTAGAGCACATATTGAAACTAATCTTAGTCGATAGCGAGAAGAGAAGAAGAGACGTGAAGTGAAGAGTCCTCCCTCAAAGA 
TCATCGTAAGTGTGTGGTACTTCTATTTTTTTTTCTTATCTAGTAATGCTGAGAGACGAGAGTTACGAATAAGATTTAATTTATAGACGCATTCA 
AGTGAGAATAATTTACAAATCTAGTTATTAACTAAATATAATGCAAGTACATTAAAATAAAATATTATTGAAAAGTAAAAAAAACGGTGTTTCAT 
CTGGATCGTTGTGGGGGGGGGGGGGTGGTTCGTTATCCCAGGAAGTGATATCGCCACTTTTATGGATCCGCCCGGCTAAACGTGCCAACTAATTT 
TATAATTCATGTACTTACCCGTGTCAGGGTAAGGTTCTTATCCGAAAAGCTGAATTGACGTGATGGCCGCGATAAGGAAGTTGCTAGCTTCATAT 
TGTTTGCGATCCCCCCAATAATGTCCAATATCCGATTCCCATGATTCCCTCCATGCAATCGATCAAAGAAAGCCTTGATTGCCGCACTAATGTAC 
TATTCGATGATCGTCTGGTTGGCTCGTTGGCTGCACGACGATCTCTTGGATATAAAACGATTGCGATATCGAACTTGCAGCTGGAGGTGGAGAAC 
TCTGCGTCAGAGTTGGTGGCCAGAGATTTGCTTGTGAAACTGTGAAACTGGTCAGAAGCGGTCTTTGGATCGTAAAACTTTCATTGACCATTGAG 
TGGAATTCTTCCGAATCTATATGTACCAGCCAGCCTAATTACCCCAGCGATTATCGACTCCTAATTGGTTTTGTCTACGGTTCTGTTTTCGTTTC 
CTTTTCAATTGATTGGTCACTTCGTGGTGGTTTCACCTAATCAGTGTGGAAAATGTCCAGTCACGGCCGGTTCGGATAAGTTCATAAAGTGCTTT 
TACTGTGTCTGGGTCTGGCGGGTCCGTCTATCTGAAGTTATTTACTACTGCTGACCATTTTGGCCTGCCGGTTTTTATCGAAGTACCAAGTGTTT 
CACTCACCGCGTCGTACAGTGGACCCCGCCTATGTTGGCCGCAATGATCGATTTGGGTTGATAAGATCCTGGCATCCCCATATATAGTTATATCT 
ATGTTCCCCTATCGCCGTATCTGCTCTCTATGTTCTATCAACGATCTTCGTGAGTTAATCAGCAAAAAAGAGCTCACGTTCGGGATTTCCTTACC 
TACACGCAGAGAAAAAGATCACTTCATAAATTTATCCAGCGGCAGCTCATCAAGTAACGTGGAATTCCCAGCTTATCGCATTGGGTAATTTGAAT 
GGAGCGCTGGTAGATAATTCACAAATGGAAAACATTTACCCAATGACTTATTTACTCGTTTTATAACATCTTGCAGCATTTTCTCTTCGAGTGCA 
GACTATATTTTTGGGTAAACGCCGTGCTGATGTCTTCAGCTTCTCGGCGGATACTTCCTTATCCGAGCGCTGGAATCTCCGATATCATCAACGAT 
CGCCCAACGAAGAGCAACCGAAGTTTGTTTTCCCCATTCCTAATCGAGCACTGGGGGGCTGGGGATTGGGGATTGGGTACTGCGTACTGGGTACT 
GGGTACTGGGTACTAGGTACTGGGCACTTAGGTACCAGCCAGTCAGGTACTACCGCTGATTTTCTCTACAAAGCACGGCTATACGGCTATATGTG 
TGTGAATAATTTCACGCTTATTACCTGCTGAGCGCTGGCAAAACGCTGCTTGACTTTGGCGGTTCTTAAAGCGGCCGGCACTCGCAATATTCTCA 
GTTCCAGCCGCGGCATTCACGGCACCACAAGCGCGCCTCCAGCACCGCCAAGTAGCCAAAATCCACAGCCCAAATTACGAAAATATTACAGAAAA 
CTAAACCCTGAAACTGCAGTTGGACGCTAACCCAACCAACCGTAACCAAGGGCTGGCCAAATGCAATGTGCTAATTACTATTAACAAAAAAAAAA 
AAAATAAATAATTAGTAATTAAATAAGCAAAACGCGAGTAGCCGTCAACTGTTTTCATTAACCGGGAAAATTAACAAGTGGCTAGAAAAATGGAA 
TTGAAAGGAGTTCAGCCATCGAATGGCAGCTCAAATGGAAGTGGAAATGGAGCCACCAATGCTGCATCAACGGAAGTAAGTGTTAGACCATGAAA 
TCAGGCAATTAATAGTATCCCTAATGGAATTGCTTTCGAGTGGCACTTGAAGAACTCAAGTGGCCAGCGATTCAGATTAGAAGAAGGCAGTGTTC 
AACAGTTGGAACATTAAGTATACGCCGTGTGGTGCATGAAAGTTTCACAATGTACACTGTGAAGAAACATTTCCCATGCAATAAACTTTATAAAA 
CTTCAATGGACTTTTACCCCGAAATTATGCTTTGATTGGCCCTTCTTCACAACCGTCCTTGCAATTCGCTCAGTGTTTCTGGCTTTACAATCGCC 
AAATGACTTTATGATCATTAAGGCTGTTAGACCTTAATTGCAGGCCAATCGACCGACAAACGAATATGCGATTTACGGTCCACCCCAATGTGGGG 
TGAAAGACTTTGTCAGCCGGTTACTCCATGCATATCTTATACGCTTTACGATACCCAGCGACACATAAATGTATCTTAGAGCTATTTTCAGCCGA 
AGAATGAACCGAAAACGTTAGTTATCTGGCCATAACTTAGCGTACGGCTCGAGATCAGCGAAAGCTATACTATAATAGGCT 
(SEQ ID NO: 301) 

Exon: 1001.. 1414 

Exon: 4 492.. 5 67 6 

Exon: 5948.. 6338 

Exon: 6411. .6608 

Exon: 10622.. 10956 

Start ATG: 4513 

Transcript No. : CT10921 

CCATTCGATTTGATAACCAGCACTCCGTGCACACGTTTTTCGGCTTTCTAGAAATAATTGGGCTGCGATTCAAACTGTGAAGTGATTTTTTCTAT 
ATTTTTTTTTTGTGTGCACAAAGGCTCTTGTAATTCGAGGATTTGGCCATCGCGAGTGCGGGTGATTAAGCAGGCAGCTCCGATTGCCAGAGTTT 
TGCGGATGCATGGCAACACTGCCGCACGCCGGCCATGTTATTGTGATTTGTAAAAGGTGGAACGTAGACACGTGTTTAATTCGTGCAGCACTTTA 
GCGTTGAGAAAACGACAGTAAATCGCCAGCAAATAAGCAAAAACAATACACAACTCGCGTGTGCAGTAGATAGCTTTGAAAGCACCCAAAGATAG 
TGAGCAAAATAAGTTACCCAAGCGGTTCACAAAGATCTTTCAATATTCAGCGAAAATGGTATCCGGTCGTCCTCTACTCTACCTCTCACTGCCCG 
GTGTAGCATTTATACTCGGCGTCTTTTGGTTTCGGCGTAGATATAAAAATTGTTTAGACAAGCCCGACGACGAGGACTCATCGGCCATCAATGAC 
TCGTCGATTGAACCAACTGTGCAGGCGCGCAAGGCCAACGGAGTCCTGCAGAATGGCAAGCTGCCACAGCAGTCGGCCAGCAAGTCGATGAACAT 
CAACGGAACTTTAGTTAACGGTAGCGGAAGCGGTAGCGGAAGTAGCAGTGATGAGAAGGACAGCCCCACTACCATGTTGTATGGTAAATCAGCAC 
CAATCAAAATCCAGAGCAATGGACGCACTTCAAATGGCAAGCATCAGCAGCAGATCGATTCCGAAATACTGAAGTCAAAGATACAGGATGCCGAA 
CACAAGACACTCTGCTCCATAGACGAGGATTTCGAGAATCTATCGTCGCCACGTGATTTACCCGATTCGGTAAATACACGCGTATCCTTCTACAA 
TCGTAAGGCTACCCAAAAGACGGTGGAACCGGTGGTGATTAAGGCCACGCGCACACCCAAAATATCGCCAGAGAACTCCTTCCTGGATACCAACT 
ACACCAACAAGGAGTGCGAGCAGAACAACAACTGTGAGCCCAAGGAGGAGCCCTCCAAGAAGGAGGCCGATCAGGAGGAGCTCGAGCAGGATCAG 
GAGCAGACGGTGCTGAAGGAGGAAGTGGTGGATAGCAATGGTAATCAGAAGCGCAATGTGGATGCGGCCAGTCCATCGCTGAGCATTTGCAGTGT 
GCAATCGGGCGATTCTGGCAAGGGATCCAGCTTGCCGCGCTCGGAGGCGACACGCGTGAAAACCACCTACGAGTTCCTCTTCCCGATCAGCTTGA 
TTGGTCACCTGTACGGCCGCAAGCGGGCGTTTATTAACCAAATCAAGGCCAAGACCCTGGCCAGCGTGTCTGTAGGCAAGAATCCCTACTCGGGC 
AAGGTGCGCATCTGTACCATTGAGGGAACGGAGAGCGAGATTGATGCCGCCCTGGCCATGATCCGTCAACGTTTGCCGGCCAAGCGCTATCCGAA 
CTTCACCATGCAACGCATCCACTTTGCGCTTCCCCAGACCATAGTTCCTCTTTCGACCGAAAGCCTCTACAATCTACAACTGAAACTGATTGAGG 
GCATCAACAACGATGTGGTAGTGAGTGCGGTGCTATCTGGGTCGCATATTTTCATCCAGCACCCGCTGCATCCCTCGCATCCATCGTTGCCTTTG 
CTGCAGAAGCAGCTGTACGATAGCTACTCCACCATGGAGGCGCCATTGCTGCCCAGTTTGGAGCTTAGTGCCGTGTGCGTGATACCCATCAATGA 
TGTGTGGTATCGCGTCCAGATTGTCGACACAGATCCCGAGGACGAGGAGCGTTGCGTAATCAAGTTTCTTGACTTTGGTGGCTACATGAATGTCG 
GCTTCAACACACTGCGACAGATTCGCACCGATTTCATGAATGTGCCCTTCCAGTCCACCGAATGCATCCTGTCCAACATTGAGCCCATCGGTGGC 
ACCTGGTCCATTGAGGCGGCCGAAATCCTCAACAAACTTACCAAAGGCATTGTCTTGCAAGCACAAGTCGCCGGCTATAATAGTCACAACCTACC 
AGAAATCTTTTTATTCGCATCCCTAGGTCCCAATGTGAGTATAAATTTCCGATTAGAGAGAGACGGAGTGATTATGAAGAGCGAGAGAGCAGACT 
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ACACATTTTCTCTTCGAGTGCAGACTATATTTTTGGGTAAACGCCGTGCTGATGTCTTCAGCTTCTCGGCGGATACTTCCTTATCCGAGCGCTGG 
AATCTCCGATATCATCAACGATCGCCCAACGAAGAGCAACCGAAGTTTGTTTTCCCCATTCCTAATCGAGCACTGGGGGGCTGGGGATTGGGGAT 
TGGGTACTGCGTACTGGGTACTGGGTACTGGGTACTAGGTACTGGGCACTTAGGTACCAGCCAGTCAGGTACTACCGCTGATTTTCTCTACAAAG 
CACGGCTATACGGCTATATGTGTGTGAATAATTTCACGCTTATTACCTGCTGA 
(SEQ ID NO: 302) 

Start ATG: 436 

MVSGRPLLYLSLPGVAFILGVE^FRRRYKNCLDKPDDEDSSAIWDSSIEPTVQARKANGVLQNGKLPQQSASKSMNINGTLVNGSGSGSGSSSDE 
KDSPTTMLYGKSAPIKIQSNGRTSNGKHQQQIDSEILKSKIQDAEHKTLCSIDEDFENLSSPRDLPDSVKTRVSFYNRKATQKTVEPVVIKATRT 
PKISPEKSFLDTNYTNKECEQNNNCEPKEEPSKKEADQEELEQDQEQTVLKEEVVDSNGNQKRNVDAASPSLSICSVQSGDSGKGSSLPRSEATR 
VKTTYEFLFPISLIGHLYGRKRAFINQIKAKTLASVSVGKNPYSGKVRICTIEGTESEIDAALAMIRQRLPAKRYPNFTMQRIHFALPQTIVPLS 
TESLYNLQLKLIEGINKDVVVSAVLSGSHIFIQHPLHPSHPSLPLLQKQLYDSYSTMEAPLLPSLELSAVCVIPINDVWYRVQIVDTDPEDEERC 
VIKFLDFGGYMNVGFNTLRQIRTDFMNVPFQSTECILSNIEPIGGTWSIEAAEILNKLTKGIVLQAQVAGYNSHNLPEIFLFASLGPNVSINFRL 
ERDGVIMKSERADYTFSLRVQTIFLGKRRADVFSFSADTSLSERWNLRYHQRSPNEEQPKFVFPIPKRALGGWGLGIGYCVLGTGYWVLGTGHLG 

TSQSGTTADFLYKARLYGYMCVNNFTLITC* 
(SEQ ID NO: 303) 

Name: A-kinase anchor protein 
Classification: enzyme_activator 



Celera Sequence No. : 142000013384832 

CCAGTGGTCCGCTGGTGGCTCGGCTCTCATTTTTGCATTTTTCTTCGGAAAAGTCCGCGCGCTGGAAAAGTCAACACAAGTTCACCTCTCCAAAT 
TCGTGCGTGGGTGTGGGCGGAAGGTGGTGGTAGGTGGATGGTGGGCGGGAGTGGGTGGTTGGCGATTCGAGAGGGGGCGGTGCGTGGGTGGGTTT 
ATTTTTAGCGCTGCTGCGACGGAAACGCGACAAGTCGGAGATTCTGCTTCCGCAATGTTGTTGTTTTTGCAGCCGTTATTATTAACAAATATTAC 
GCTATCCACGGGCTCTTCTTCTTGCTTGTTTGCGTTGACAAATCTCGTTGCGCTTTCACAATCACATCAAATCGTGGGAAAACTGGTTTTCCCCC 
CGTTTCTGAGCGGTTCACTCTTCGGTTCGTGATCGTTTGTGTCCGCGGGCAATGGAAAGTCTTTGTTTTTCACACGCGTTTATCACAACAGTAAA 

T AAAAT AT C ACT T T AGCC AGT T T AAT C AAC TC GGAG AGC AG AAC TT T T AACC GT G AG AAAT AAC C GC AT ACT G AAGT GGC C AAAAT CT G GG AAAT 
CGCCATGGAGGAACTCGCGGAAAATACCAAATCCATGGCATGCAAGCTTTATTATCAGTGGCGTAGAAGAGGGGCCTTATCTGATTAATCCTCGC 
TCTATAATTTTGAACCGCTTGCCATTTTTAAGGGCTCAGGATAATGAGATTTGTTATTCTTCCAGATTAATGTCATTAAATTAAAATCTTATGTT 
TCCCCCTTATGTTTATAAGAGTAATCGCAAATATTCAATGAAGAAAAGAATATTTTATTTTATTGAACTGCCGAAATCAACAAATTAAACGAATG 
TAATTAATTTAAATTTTAAATTATAACACCCAAAAAGTGTGATTCCCCGAAAATATCAGTTATTATACAACGTTATTTCGGAATTTGTAGAGGTA 
TTTTCTGGCGCTCCCGCTGCGGTCACACTGAACACGAGTTTCGTTTCGGAGATTGAAAGAAATGGAGATTGACTTGGGATTTGCGGAAAAGAGCG 
ACAATGGCGCCTGGCTGAGCAACCGCTACTTTCCCAGCAAGCTGGGAGGCCAGCCCGCATGGCTGGAACTGGAGGCCTTGCCGCCCACGTCGCAG 
CTGCAGTGCAGCAAGTGCAGGGCACCCAAATCCTTCCTGGCTCAGCTATACGCTCCCTTCGAAGACGAGTACAACTTTCATCGGTCCATCTATGT 
GTTCCTGTGCCGGAATTCCGACTGCCAGGAGGCCCAAAATGCAAGGTAATCCCTTCTAGCTCTGAAAACCAACTTGCAACCTAATCTTACCCATC 
TTTCCAGCAATTTCACAGTCCTCAGGTCACAGTTGCCGCGAAAAAATAAGTTCTTTTCGGAGGAGGAGCCGAGCGACGTGGGTCAACCCCTGGTA 
GGAACAGCAGCCATATTTTCAGAACATACTCATCATAATATGTTTTCTTAGCCCGCCGTTCCCTGCCTAAAGAAACTATGCGCCGCCTGCGGTTG 
CCATGCTCCTCACGCCTGCAGCAAATGCAAGGCAATCCACTACTGCTCACCAGAGCATCAAAGGGCCCACTGGCCACAACACAAGCCAAACTGCG 
GAGCACCAGAAGTAGCCACTGAGAAGCCCTTAACACAAATCGTTTTCCCAGAATTTGAGATTGTAATGGACAGCAACCCCGTGGAGTCTGGCGAG 
GAGGACAAGGACGATGAGGCTCGCTTGGCGGAGTTCCAGGAGCTGGAGTCCAGCGGGAAGACGGGCGACTTGAGCAATGTTTCCGAGGCGGAGAT 
GGACAAGTACTTCGGAAACTCGGCGGCCGCCGATGACAAGACCTTTCGTCAGTTCAAAAAGCAAACAGCAGCCGAACCCGATCAGATCGTGCGCT 
ACAAACGTGGAGGCCAGCCCCTCTGGATCACGAATACCGTCAAAACGGTGGAGGACCAACTTAACAAGCTGCCCAATTGCATTGCTTGCGGAGGA 
GAGCGTCAGTTCGAATTTCAGATTATGCCGCAAGCGCTGACTCTTCTGGAGGACGAAAACCTCGACTGGGGCGTCCTGGCCGTCTACACTTGCGC 
CAAAAGCTGCCCCATCGACGGCTACGTGGAGGAGCTTCTCATCAAGCAGGACATTGTGGCGGAGGATCAGAGTTGACATTAAAATGCGTACATTT 
AAATTTTATTCTTGTCAATTACGAAACACGTGAACATTTACTTCAGCTTGATGATGGGGAAGTAGCTGTGTTTGGAGAAGTCGAACTCTGGCTGC 
CGCTTGACTTCCTTCTGGTGACCGCGAGCCAGCACATTCATCACCACAAAGTTAGTCCGTGCGATTTGCAGGAGGTCCAGCATCTGAAATTAAAC 
GAAACGCATTTTAATCCAGGAAAACCAATTCAGAACTACCATTGACTTACCTCCTGATCTGGGTTTTGAATGCTTTCGAGAACATTGCGGGCCTG 
GTCAAAGTGCTTGGCCGCATAGGTATATAGATCCTCGACGCTATGTCGCATCATGTGCTCCCGGACGTTTTTGAACTCTGCGTACGAAACTGGCG 
GAGGACTGGTGAGTGTGGCGAAGGGCAGGAAGCGTCGGTTGTAGCGGATCTCCTCGTTGTCGAACTTGGACAGCGGCAGTCGCACGCGTCCGTCT 
TTGGTCAGGGCGCCCATTGCTTTATACATTCCGCCGCAGAGGCTGAGTAGCGCGTGGTAGAAGACGATCTCGGCGCGGTACGGCCTTTGAGTCTT 
TTTGTTCTTTTTCGGCTTGGCCGCCTTGCGGTTTTTAGCCGCTCCGCTGCCGCCCGACTGGGTCTTACTTTGGTGCTCGGCGTACTCCTCCTGGG 
CCAGGAGGATGTTCTCCGTGCGCGTCAGCGCGGAGACCAGGAAGCCAATGAGGAACTCGTATGGATACCAGTAGATGTAGAGGAACTCATGCACA 
GCATATAGTTCCAGCTCAAAGCCGGACATCAGGAAGATCAGCATCGCACGGAAACAGTTGTACAGCACCCATGTGGAGAAGTGGGTGCTGTGCTT 
CAGAGCGGTGGCCATGGGCTCGTTGCCCTCCATGGCTCGTTCGTTGGCCAGTTGGTTCATCATAGAGTCCAATCGCGCTGCATCCACTTGGATGG 
TGTCGAAGTTCTCTATCAGACGCGCCAGCTTATCCCGCTGTCTGGCGCGATTGAAGCCGCAGATGCGAATGAACTGCGTGAAGGTGTTCATGTTA 
ATGCAGTAACGGAAGAAGTTCTCCAGGTGCTGCTGCACCTTGGGATCGGCAGCCACCGGATGCTTGGCATTCAATACGGGTGGCGAGTTGAAGAC 
CTGAACCGAGTGGCGCAGGAACTGCTTCATGGGAAGCTTTCCGTGCGCCATACGCATGTTGGCGCTGAACAGGGTT 
(SEQ ID NO: 304) 

Exon: 1001.. 1280 
Exon: 1338.. 1422 
Exon: 1477.. 2401 
Start ATG: 1012 

Transcript No. : CT10985 

GATTGAAAGAAATGGAGATTGACTTGGGATTTGCGGAAAAGAGCGACAATGGCGCCTGGCTGAGCAACCGCTACTTTCCCAGCAAGCTGGGAGGC 
CAGCCCGCATGGCTGGAACTGGAGGCCTTGCCGCCCACGTCGCAGCTGCAGTGCAGCAAGTGCAGGGCACCCAAATCCTTCCTGGCTCAGCTATA 
CGCTCCCTTCGAAGACGAGTACAACTTTCATCGGTCCATCTATGTGTTCCTGTGCCGGAATTCCGACTGCCAGGAGGCCCAAAATGCAAGCAATT 
TCACAGTCCTCAGGTCACAGTTGCCGCGAAAAAATAAGTTCTTTTCGGAGGAGGAGCCGAGCGACGTGGGTCAACCCCTGCCCGCCGTTCCCTGC 
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CTAAAGAAACTATGCGCCGCCTGCGGTTGCCATGCTCCTCACGCCTGCAGCAAATGCAAGGCAATCCACTACTGCTCACCAGAGCATCAAAGGGC 

CCACTGGCCACAACACAAGCCAAACTGCGGAGCACCAGAAGTAGCCACTGAGAAGCGCTTAACACAAATCGTTTTCCCAGAATTTGAGATTGTAA 

TGGACAGCAACCCCGTGGAGTCTGGCGAGGAGGACAAGGACGATGAGGCTCGCTTGGCGGAGTTCCAGGAGCTGGAGTCCAGCGGGAAGACGGGC 

GACTTGAGCAATGTTTCCGAGGCGGAGATGGACAAGTACTTCGGAAACTCGGCGGCCGCCGATGACAAGACCTTTCGTCAGTTCAAAAAGCAAAC 

AGCAGCCGAACCCGATCAGATCGTGCGCTACAAACGTGGAGGCCAGCCCCTCTGGATCACGZ^ATACCGTCAAAACGGTGGAGGACCAACTTAACA 

AGCTGCCCAATTGCATTGCTTGCGGAGGAGAGCGTCAGTTCGAATTTCAGATTATGCCGCAAGCGCTGACTCTTCTGGAGGACGAAAACCTCGAC 

TGGGGCGTCCTGGCCGTCTACACTTGCGCCAAAAGCTGCCCCATCGACGGCTACGTGGAGGAGCTTCTCATCAAGCAGGACATTGTGGCGGAGGA 

TCAGAGTTGACATTAAAATGCGTACATTTAAATTTTATTCTTGTCAATTACGAAACACGTGAACATTTACTTCAGCTTGATGATGGGGAAGTAGC 

TGTGTTTGGAGAAGTCGAACTCTGGCTGCCGCTTGACTTCCTTCTGGTGACCGCGAGCCAGCACATTCATCACCACAAAGTTAGTCCGTGCGATT 

T GCAGGAGGTCC AGCAT CT GAAAT T AAACGAAACGCATT T T AATCC AGGAAAACC 

(SEQ ID NO: 305) 

Start ATG: 12 

MEIDLGFAEKSDNGAWLSNRYFPSKLGGQPAWLELEALPPTSQLQCSKCRAPKSFLAQLYAPFEDEYNFHRSIYVFLCRNSDCQEAQNASNFTVL 

RSQLPRKNKFFSEEEPSDVGQPLPAVPCLKKLCAACGCHAPHACSKCKAIHYCSPEHQRAHWPQHKPNCGAPEVATEKPLTQIVFPEFEIVMDSN 

PVESGEEDKDDEARLAEFQELESSGKTGDLSNVSEAEMDKYFGNSAAADDKTFRQFKKQTAAEPDQIVRYKRGGQPLWITNTVKTVEDQLNKLPN 

CIACGGERQFEFQIMPQALTLLEDENLDWGVLAVYTCAKSCPIDGYVEELLIKQDIVAEDQS* 

(SEQ ID NO: 306) 

Classification: hypothetical 



Celera Sequence No. : 142000013384832 

TGTATGTTTCCCCGTGGGTGAAGTCCTTGGTTCCCTCCAGGCGACCGCGGGGCGGTGCCTTCTCGTAGTCGAAGGCCTTCTCTGACTTGCCGAAC 
AGCTGGTTGGCCATCTCGATGTCCTCGCGGTAATTGCCCACGACCACCTTCTTGCCAGTGAGCGGGTCCGTTATGTCACCCAGCGGATAGACCAC 
TTTTTCCACGTTGTGCGTGATGAGGCGGGTGAGGCGCTCTGGCAGTTCCCGTATCAGGACCACGTCCCCGGTCTTGCACACCTTCTGCGGATCGT 
GGGCGAAGTAGAACTCGTCCTTCTTAAAGTACTGCGAGCAGAAGGGAAATTGTGTGAAATCTCCTGAGGCGAGATCTAATGTATATTATGCACAC 
CATATTCAGGTTCTTGTCCAGCTCCATTCGGCGAATGCGGATCTTGGAGGCGTTCTGCTTGATGCAGGGCATGCACTGGCCCATTAGTAGGAGTC 
CTCGCCTCGACATTACTTTAATTTAATTTAAGAGTTTAGAAAAATAAACAACCGGTTGGGAAACACAGCTGGTAGATCACTGGCGATAGTATCGA 
TTGGGCCTGGAACTATCAATATTTCGATATTTTTTACTCAGCAGTTTCTGATTTTCAAAATCCCGCCAAATGCAATCAGCTTGCACGCGTCAGTT 
AAAAAGTAGGGTAGTATGACTTTTCATCGCAAAAACAAGTGGATTCTAAGGACATAACATTTTCACCGTACTATAAACAGTGTATCAGTTTTACA 
TAACATATCTTTTCAATTAAATAATGCTTTAATGGATTAAAAATGTTTACGTCAAAGTAAATACCTTTAAATTCATTTTGTTTATGAATAATACA 
TATTTTTAATTCAAAATTGTAATAAGGAATTCGGCTCACGATGACATATCGCAAGATTTCCATTTATGTATCGATGTCTTCATGAAATACATTTC 
TGTGTAGTCACACTTATCGATAGACCGCGCACTCCACGTGTTTTTCGTCCATTTTCTCGCAGCGCGCAACACGTGCGGAGGTTTCTCTCCGTCTA 
TTAGTTGATTTAGCAGAAAACGTAGATAAAATGGCCGACGTGGGTAAGTCTGGCAAGTGATGAATTAAAAGAGTGCCCAAATAAACGGACTATGC 
GCCCGCAGAAGTACGCAAGGAGAAGAAAAAGAAGAAGATCT^AGGAGGAGCCCTTGGACGGCGATGACATTGGCACGCTGCAGAAGCAGGGCAACT 
TCCAAATCAAGCCCTCCTCCAAGATCGCCGAGCTGGACACCTCGCAATGGCCGCTGCTCCTGAAGAACTTCGACAAGTTGAACATCCGTTCCAAC 
CACTACACTCCGCTGGCGCACGGATCGTCGCCCCTGAACCGCGACATCAAAGAGTACATGAAGACTGGCTTCATCAACCTGGACAAGCCGTCGAA 
CCCCAGCTCCCACGAGGTGGTGGCCTGGATCAAGAAAATCCTCAAGGTGGAGAAGACAGGCCACTCCGGCACGCTGGATCCCAAAGTCACGGGTA 
AGTGAAGCGCGTGCTCGCCTACCGCCATACAAACCTAACCTCATTGTGTGCCGCGTGTTTTTTTTTCCTACAGGTTGCCTGATTGTTTGTATTGA 
CAGGGCCACCCGTCTGGTGAAGTCCCAGCAGAGCGCCGGCAAGGAGTACGTGGCCATCTTCAAGCTGCACGGAGCCGTGGAGTCGGTGGCCAAGG 
TGCGTCAGGGCCTGGAGAAGCTGCGTGGTGCCCTCTTCCAGAGACCTCCACTCATCTCCGCCGTAAAGCGTCAGTTGCGCGTTCGTACTGTCTAC 
GACAGCAAGCTGTTGGACTACGATGAAACCCGAAATATGGGTAAGTCAGAAGTTTTGTTTATCCATTCTCGCTCAGAGTTGGTGCGTCGCTGCTG 
GATTAGTTTATCCTGCAGAGGTGCAGCCTGCTTCAGCCAGTGAAACACCAGTATTTGCCTAACACAGCAAATGCACAATGTAATTGTTAAAACCT 
CGTCTTTTTCTCTCCTCCAATTCTGCCCACTCACGTGCTCAGGTGTTTTTTGGGTTAGTTGCGAGGCCGGCTCCTACATCCGTACCATGTGCGTC 
CATCTTGGTCTCGTACTGGGTGTCGGTGGCCAGATGCTGGAGCTCCGCCGTGTCCGCTCGGGCATTCAGTCTGAGCGCGATGGTATGGTGACCAT 
GCACGATGTTTTGGACGCCATGTGGCTGTACGAAAACCACAAGGACGAGTCTATGCTGCGACGTGTGATCAAGCCGCTAGAAGGTCTGCTGGTCA 
ACCACAAGCGCATTATTATGAAGGACAGTTCGGTAAGTTTGACATCCAGTTGCCGTTTGCGAAGTAGTACCCCATGATATTAAGCCGGCAGGTGC 
CTGCAATCCCCACGGGCACCTGTGACTATGATTGGGAACAGCAAAGCTGCTGATGTGTGTATGTTGGGCAGCGCTCTTATCCTGCGCGCTCGACA 
AAGAATATTCGGTCGACATTTCCTCTAGCAAAGGACCCCCTTATGATATCAACTAACAACCATCAATCTTTCTCCAGGTCAACGCCGTTTGCTAT 
GGTGCCAAGATTACATTGCCCGGTGTCCTGCGCTACGAGGATGGCATTGAAATCGATCAGGAGATCGTCATTTGCACCACCAAGGGTGAGGCCAT 
TTGCCTGGCCATCGCCCTCATGACCACAGCCACTATGGCCTCTTGTGACCATGGTGTGGTGGCCAAGATCAAGCGAGTGATCATGGAGCGAGACA 
CATACCCGCGCAAGTGGGGATTGGGACCAAAGGCGTCGGCCAAGAAGGCGCTCATTGCCGCCGGCAAGCTAGACAAGTTCGGAAGGCCCAATGAA 
AACACACCTAAGGAGTGGCTGACCGGTTATGTCGACTACAATGCCAAGAAGCCAGCAGCTCAAGAAGTCTCCCCCACAAATGGCTCCAGTGAACC 
CAGCAAAGTATGTTATTGAAAATGAGTTTAACATGGTATTCACTTGCTAAATAATTGTGTTCCTTTAGCGCAAGTTAAGCACCTCCAGCGTTGAG 
GAAACTGCTGCGGCGGCGGTATCCGAGGAGACTCCTTCCAAGGACAAAAAGAAGAAGAAAAAGAAGCACAAGGGCGACGAGGAAGCCCCAGAAGC 
CGCTGAGGAGGAGGCAGAGCCAGTAGAAAAGGAGAAGAAAAAGAAGAAGAAGAAGGACAAGGATAGGGACAGAGACGAAGCTCAGGAATAGGAAT 
CAGAAGTTTCTATAGGATTAGGTTTTAAAGAGAATTCTACGCTTTTTTGTAGAGGCACCTACCGATTGCTAGCTGTTAGGTTTAAGGAGTGTTTT 
TATGTGTAGAAGTGCATACAAATTACAAATGCATTTAAGTGACATTCGCAATGCAATAAATTGAACATTTTATTGTGGAACTCTATTGTAATGAT 
GAACTCCAATTAAGAACACAAGTCCGTTTTAAGTTGAAGGCAGCCGTCGGGCTGACATATATTTATTCAATTTCTTCTTTTGATTTTAGAGTAGG 
CAGAAATGGTTCTGTCAGCTCAACGTCTACTGAGTCTCTCTGATTTTACAGTTTTTTATATATTTGGACTTAGGCTAATCCGCGTAGCTGCGCTG 
CCGGGTACGCCAGCAGCGGAGCGGCGTTCTTATGTATATCTTATATATCTAGGCTTATCGGGAGAGCGAGATATGTATGTACGTATGTAATATGT 
ATTTGGTCGGCGTGTGAATGCTGACCATGCACTTATACTTTTTGCCTTCGAGTGGGTGATCATGTTACATCCGCCCTGGCCTAACTGCGTGCATA 
AACATAAACATAAACATAGCAACAAAGTGCTGCCATAAGTTAATATGCTATTTTATTTAATTGTTCTTAATATTGCATAGGCACATACTACATCT 
CCCTCCTTTTTGAAAAATAACGTAACCTAGTGAAGTTATTTTGATTATTAAATGCAAATTATTTTATTTTTATTTTTATTTTGTTTGTGTTTTTG 
TTTGAACAACAATAGTTGATTTTATAATGCAAAGATAAAAATATACGTAGTGTATGGAAAATTTGTATAAATGATTAAGGAAAATATAAGTTTAA 
ATTCAATCATGAATGAAATTTCTTTAATCTATCTTTATGAACTATTTGTTTTTTGTTTTTATTAGTAAGTAGCGTAATATTGTTATTATCTCCTA 
TGCTTTCTATCTTATAGGGCCCCGTATATTTAAAGTCTAATTTATGACCTACCTCATTTCTTAGTAAAACTTTATCTCCTACTTCTAATTCTATG 
TCTTTTATTTTTAAGTCATAATTTTCTTTATTTTTTTCTTTGTGTGCTTCAAGAAGTTTTCTTGCTCGAGCATATGCTACCTCTAACCTATATTT 
ACTCTCCTTAGCGTAATCATCTATGTTATATATTGGTTCTATGCTATGTAGTTTATTAAAATGTTTTGGTAAATTACTTGTTCTACCGAAAACTA 
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ATTCATATGGACAATAATTATGTACCATAGATTGGGTCGTGTTGAAGCAGTATACGAAATATTGAAGCCATACGTCCCAATCGGTTTTGTCCGTC 

GATATGTAGGATCGTATATACTCGTTTAAAGTTCTATGACTTCTTTCTACTACTCCAACTGTCTGGTGGTGATGAGCTGTTGATGTTATATTTTT 

TATTTTCAAATATTTACACAGGTCAGTAATTATTGAATTCTTATACTCTGTTCCCATGTCCGTTATGAACGTCTTCATTGGACCGTACTTTAGAA 

TAAAAGATTCAAATATAGCTTCTGCGACTGTTTTTGCGCTTTTATTTGCTATTGGTATGGCAACTAAGTACTTGGTTAAATCACATATGAGAGTG 

ACTGCGTACTCGTTACCATTTTCTGACTTGGGTAGTGGACCAATTGTGTCCACAAACAACTCTATCGAAAGCATGTTCTGGTGTTTCAGTTATCG 

GTCATTGGAGTCTTTGTGTGCTTTGTTGTTTTTGCTTTTTGGCATTTTTGACATTTCCTTACGGTACTCTTTTATGTATTTACTCATATTTTTCC 

AGTAATAATGTCTTTTGACCTTGGCCAAGGTTTTTGTAATGCCTGTATGCCCTCCTTGTATTGGATCATCATGTAATGTAGACAATATAGCTTCT 

TTTTCTTTTTCATTATTTATTTGGGTCACCGGGTTAAGTAGCGCTACTTTTAAATTCTTTAATATTTTATTGCCCATATTTTTAAATTTATCTAT 

TGAAACGTGTTCAAAGATTTTTTTCCACGGTGCCATTTTGATTTGGCTGATATCATATATACCGGCCTGCAATTCAAGCCTTTGGAGAAATTGAT 

CTAAATCAAGAATTCCATTAGTATIiAAGATCACCAACATCATATCTTGCAATAATTTTCTTTCCATGTTTAAATAAACATATCGAGTCATTCAAT 

TGCAATGTCACTACTTTTCGTACCTCGTCATTTGTTATGACTTCGTATACCGTTGGGCTCTGAAGCTATTTCTTTGGTTTGCTTTTGCAAATCCA 

ATTGTTCTTTTCCTGCGCAGGATTTTTGTCTACTTTGAAATCTTGTAGTGACTTTTAATATATTTCCAGTTATATCTTTTAGCTCTTTGATGGTT 

ATTCTTGATAACGCATCTGCTACATGATTGTCCTTGCCCTTTAGATACTCTACTGTAAAATTATATTCCTCTAGTTCAAGCCTTATTCTAGTTAA 

TTTAGAGCTGGGGTTCACCATCGAGAATAAATATGTCAATGGTCTATGGGTCTGTTTTCACAGTGAAATGTTTTCCGTAAATGTATGGTCTGAAA 

TGTATTATTGCCCAATGAATTGCTGCTAACTCTTGGTCTGTTGTACTCTTATTGCTTTCACCTTTCGTAAAAGCTCTGGATGCATAAGCAACTGG 

GGAGTTGGTGGCCATTATGGTTTTGAGTTAAAACTGCGCCACACGCTTGCTTGCTTGCATCTGTTGTTATGCAAAATTCTTTGCTGAAGTCTGGG 

TACTGCAAGAGTGTTGGGTTAATTAGCTGAGATTTTAAATGTATGAATGCTTTTTGACATTCATCTGGCCACTCGAATGGAACATTCTTTTTACA 

TAATCTTGTTATGTGCCGCGAATAGTCGGCGAAATTTTTGATAAAACGTCTGTAGTAATTGCAAAATGCTACAAAACGTCTAGCGCTGTCCGCAT 

CATGTGGAACTGGGTAGTTCTGAATGACATCATATTTTTTGGCATCCGGCAAAATTCCTTTGTCTGTGCATTTGTGTCCCAAAAATGTGACTTCA 

TGCATGAAAAATGAACATTTTTCAGGATGTAACTTTAGGTTGTATTCCCTGCATTTACCAAAAACTTCAGTGAGGTTTTTAAGCATATGTTTTTC 

GGAACAACCTATGACTATTAAGTCATCCATATAAAGGAATGCTTGAGACGGTTCTATTCCGGAGAATGCTATAGTCATCATTCTTTGGAATGAAT 

TAGGCGCTATTTTTAAGCCAAATGGCAATCGCGTGAAACGATATGAGCCATTGCTGGTTGAGAAAGATGTTATATCTCTCGAGCCTTCATCCAGT 

TCGATTTGATGAAAACCTGACATTAAATCAAGGCAGGAGAAATATTTTGCTCGACCCAAGTTGGTCCAAAATATCATCTATTCTCGGTAGTGGAA 

ATTTGTCAGCTAAAAGTTTCTTATTAATTTGGCGATAGTCTATTACTAATCTCCATTTCTTTTTATCAGAATTCGGGCTTGACTTTTTGGGTACT 

AATAGCAAAGGGCTATTGTACTGTGAAACTGATGGTTCAACTATTTTATCTTTTATTAATTTCTGAACTTGGGCTTGTATTTCTTCCACTTGACT 

ATGAGGACTTCTATAATTTTTCGTGTATACTGGCTCATCATCTTTTAATCTCAACTGTTGGTTATAAAAATTATTAACTGGTATAGGTTCTGATT 

CTAATGCAAATATATCTATATATTCGCTGCATATATTTTCTAATTGTGATTTAAACAATTCGGGGAAATTTTTCTTTAATTGAGATAAGACAGTT 

TTATTTCTGTGTTCACTATTTGCCTGAACTACATTGTAGTTCGAAAGTGGCTCATATTTTAGAGTGTCCATATTGACTAATTGGTCGGAATCGGT 

TGTATTTAAAATTCGGACAAATGTATTACTTGATGTTGCGATTGGATTTGCAACATAAATACCAGTTTGAATTTCCTGGTTTGGAATTAAAATGT 

TATCATCTTTTGATGATACTATTAATCTTCGGACAACTTGGGATCTTGCTGGTAATAACCGTTGTGTTAATACCAGAGCTGTATGCTATGGGAAT 

ATATATTGGAAATTTCAAATTGTTTGGTCTAATTATAAACCAATCTTCTTCTTGGGTTTAAATCGATTTGGCAATTATATTTTTTTATGAAATCT 

ATTCCGATTATTCCATCACACGGTATTGGAAAGTTTTTATCTACTAAATGAAAATCGTGTGGGATAACGTATTTACCTGTCTGTATCTCAATAAA 

AGTCTGTCCTCGAGACTGAATTTTCTGTTGGCCTATGCCCTTGAATGTTTATTTTATTGGTTATTTGAATATTAGAAAATTTGTCAGAGTTCTCT 

TTGAGAATAGAGATATCTGCACCTGTATCAAGTAGAAAAACTAGTTTTACGCCTGTTTTGGCATGAATGAATGTAGAAAATATGTTGAGATTATA 

ATTGATGGTGTATACTCTACGATCTTCTTCTACTGAGTACCTAAAGGCTGTTGCGAGTTTCCCTGTTCTTGGATTACTCGTACATTGGCATTGTT 

TCTGTTGTAGTTGTTCTGACTGTTGTTACGGCCTCTGTTTCCATTATAACGGCCTCTGTTATAGCCATTATAACGGTTATTATTTCCATTATAGC 

GGTTTTGATAACCGTTATTCTGGTAACCGTTATTTTGGTAACCGTTATTTTGGTAACCGTTATGGTAGTTGCCTCTAATACTATTGTTATTAAAG 

CCTCTACCGCGGTTACTACCTCTATTGGCATTTCCGTATCCTCTATTGGGTCTGTTACCTCTAAAGTACATCACTTTATTGATGTCACCGGTAAC 

TCTTGTACTCGTTCTTATGTATGCGCTTATCGCTGCATCCATAGTGGTGCAAGTCCCTGCTTCCAGTACTGTTTTGATACTTTCGTGCTCAGCAC 

GATGTATCATGGTTTCAATTGCCTCTTTGGTGCTTAGACCAGTGGCATGTTCTAGAGGTATACCCTCATCGATATAGGAAGCTTCTAAAAGTTTT 

CTCAGGCTATCAACTTCAGCGGTAAATTGCGTTGCAGTTTTACCTCTCTGCTGAACTGTTGCTAGCTTTGCTTTGACATTACTGGATGTCTCACC 

GACAACTACTTTCTGCAATTTGTTTATAATTGCTGGAATTGTCGTTTCATTGTCAACTGAGTTCAAAATTGTGCCAACTATTTTTGACTTAATGA 

CCTCAACTGCAATGTCTTCAAATGGTCCCTTAGCTAGGTCTACCAATTTGATTGCCTTAATGAATCTTTGTAAATGGATCTTTTGCCCATCAAAT 

TCTGGCACTGTGCAGGCGACTTCCCTAATGTAAGACCTAATGGCAGCTGCTTCTTCTGCCATGGTTAAGTTATTTTCGGAGTCGGGCTGCGTTAT 

TTGTTTTATCGTCGTTCTTCAGCCATTAATTTTGCTGGTATAGTTAGATCGGGAAGATCTTCCTCTTTAATGTCTTCCTTGACTATAGTCCTATC 

TTCCTCGTCAGAGTTAGTGCTGTCTTCATTCAAATTTATTCTTAGGGGGGTGTTTACTATGGGTGGAATTTCTATCTGGAGACTGAATTTATTTT 

TTATAAATATTAATTTTTCTCTTAATGTCACCAGTAGGTTCAAAGCCTTAGCACACTGTTTGCGTTCCAAATTCTTTCTATTTTGTTGGATCAAC 

TCTCTGATTTCATTATAACTATTTACCAAGACTTGGGCATTCCTATTGATTGGTACAGCTTGGGTTGATCTGTTTTGCGTTAAGCACTTATAGGA 

TTTTCTGAAATTTTCATTTTGATCTTTTATCTGATCAACAAGCTTTTCCCAGTCCATGCTGTGGTATACTCATAACTTCAAATTTATTCATTTGT 

ATATATTTTTTTTTTAATGTAGATAAATTCAATATTACGGTTTTATATTTTTTTTTTTATTTATTATTTTTTTTTTCATATAATTTTTCTCATAT 

TTTCATTCATAATTGACTTTATTAAAGATCTTCCTTAAATATTATCTATACCGTTAGCAGTGCTACGGTATCGTTTCTTAAGACACTTGTTATGC 

AGTTTGTATATTTTTATTAATGCATTGGATACCATAATTAATGCAATTACAATTAGTATTATGGTTATCCAGAACAAGTCTGCATTTTGGACTTC 

TACTTTATTTATTACATTGGCTGGAGAGTCCACAGTTTTAGTGTCTATTAGACCCATTTTGGGTATACTAATTTTATATTTTGGAGAATAGAATA 

GAAAATTTCCTTTGGATATACTTTTATTTTCAGGTTTACTTATTTCATTTGTCTCTATAGACATTTATGTTTGGAATTATCTATTTATAAAATCT 

TTCAATGTTCGCAAGAGTAAAAAAAAATTGTATTTTACATTGTTGCCTGCCTGATTCTAGCTATGGCTTCACGGGCCATATTGGGTTAACAGGGC 

ATAATAGTATATGCTCCGTTTTACAATATTTCTTACAATTTTTCTTATCTATATATATTTTATATTTACTTTATATTTATATATATTTATTATTA 

TATTCTGCTGTTGATGATGCTCATCAGGTGTCAGCTGGGCGTCGCCGGGGGTGGCACGCTGACACTCCCTCGCTGCTGTCTATTTCTTCGCCGCT 

GAAGACGTCGATGGTCGCTCACAAGTCCCCAGGCAGTGGGGACTCCAGATTCCTTCGCAGCCAGGGGGGCAGGACTTCTTTGGTTTTGTTAATGC 

TGGTGCTGGTGCTGGACATGCCTGGTATTTTGGAACTTCATTAATTGTGGGTATATTTTTAATATTATCTGAGTGAGTTTTCTCGAGCTTATGGG 

TCTTTAATTCTTGCTGATATTGCAACAAGTGGTCTAGCTCTGCATGTAGCTTTAGAGCTTTCGACCAAACCGGTGGCGTTTGCTGACCGTATATT 

AGCCACCTTACGTGGGCTATACGTTTATTCAGCTTAGTTTTCAAACCGCCACGGTGTTTGCCCATACTCAATAGAACTCTACTCACTCAGATTTT 

TCCTTGGTTTTTGCAGTTCCATTTAATTAGTCCAGTCTTCTTCCAGTCCAGTGTTCCTTTGTTCCTTTGTTTTCACCAGCTTTGGTCAATTCGTC 

CAGCGTTGGACGACTGTCACGGTCGCCATGTAATGATGAACTCCAATTAAGAACACAAGTCCGTTTTAAGTTGAAGGCAGCCGTCGGGCTGACAT 

ATATTTATTCAATTTCTTCTTTTGATTTTAGAGTAGGCAGAAATGGTTCTGTCAGCTCAACGTCTACTGAGTCTCTCTGATTTTACAGTTTTTTA 

ATATATTTGGACTTAGGCTAATCCGCGTAGCTGCGCTGCCGGGTACGCCAGCAGCGGAGCGGCGTTCTTATGTATATCTTATATATCTAGGCTTA 

TCGGGAGAGCGAGATATGTATGTACGTATGTAATATGTATTTGGTCGGCGTGTGAATGCTGACCATGCACTTATACTTTTTGCCTTCGAGTGGGT 

GATCATGTTACATCCGCCCTGGCCTAACTGCGTGCATAAACATAAACATAAACATAGCAACAAAGTGCTGCCATAAGTTAATATGCTATTTTATT 

TAATTGTTCTTAATATTGCATAGGCACATACTACACTATCGCTAGTTTCTTAGGTCTTAGCTTTTCCTTAGGTTTTTCAGATAGAAAGGCAGGGA 

ATACAGACAGCAAAGGTGAGTTAACATATTACTACGTTCCTTCAAGAGTGGCTTTCGTTTGCGTTATCTGATGAGCAATAACTTTCGCCCCTAGC 

TGAAAAATGATACTTCTCATGTTGAATGCGAGATTGGTCTTACTGAAAATAACGGAAAATAGAAAAGCTCATTTGAGTTTCTAAAAACGTTGACC 

TGGCCTCTTATTAACTTAATTTATATTTCTATTGTAGCTTGGAATACTGCTCAGAGGACAAAGCGGGAGTGACGGAAGATACGGTTAGTATCTAT 
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TTGATGACCATCTTTACGTTATATTAGTTCTGATGAGCATTAACTTTCGCCCCTATCTGAAATTGAAACTTCTCTTGTTGAATGCGAGATTGGTC 
TTACTGAAAAACTAAAAAACTATAAGCACATGTGGGCTATCTATATCCAATATATTTTAAAACCCAATATATATTTACAGATCGGCGGAACGAAA 
ACTAAAAACGTGCTTCGAATAAACATAGGAATTAAGGTAAGAATGCCTTAATTTATAGTTGAATTTCTTATTATCATGATGAGCATTAACTTTCG 
CCCCTATCTGAAATTGAAACTTCTCATGTTGAATGCGAGATTGGTCTTACTGAAAAGAATTTCGATTTAAAAACACGAAAACAATTTTCTAAGCA 
CTTTCTAACCAAATTTTGCCCATTCTCTTTACAGATTTGTTGATTAGGTTATGCAATATATGGACTATAACTTTAATAAAATAGGTAAATATTGA 
ATAACTAAAAGTTATTAAAATGTTTTTAAGTGATGAGCTTTAACTTTCGCCCCTATCTGAAATTGAAACTTCTCCTGTTGAATGCGAGATTGGTC 
TTACTGAAATGACTCACAATTAAAAACAAGCCTCAATCTTTTCGATTGCCTTTCTAATAAAATGCTACTTTTTCTTTACAGATAAGTTCAAAAAT 
TTTTTTGAAAATCATTGAATGCCACAGGAATCTTAAAAGGTAAATGTATTCAAAATGAATTATTTCTTTGCGAGAATAACTAACTTCTTTGAAAA 
TTAACTGATGTAAAAAAAAAACTGAATTATGTGGCATGATTTTCACATTAACCGCAACAAATCCTACTCCAATAGGTCCTACTGACAATTCTCAC 
AAAAATAATGCATTTAGGCCAATACAATAACAACGATGCCTAACAATTGACTCTCCTTTGCCTTTTCAGTTCGCATGACACTTTATTGCGTTGAG 
TAAACAAGGTAAAAATAAATTTAAAAAAACTCGTTTGCCCAGTTTCTAGTGAAAATGTTGAAATAAAAATACTGATTTATGTGGCATGATTTTCA 
CATTAACCGCAACAAATCCTACTCCAATAGGTCCTACTGAATATATTCACAAGAAACTTAAGGTGTGTTTTAACCAGATGTGGTTCTAAGTGAAT 
TTTTTCTTATATTTGCAGATGTGCTTTTGGCGACTTTCCACTGACTGTTCCAATTGAGTTCCATTTTATGCCCTTTTTTAGGTACAACACTGCCC 
AAGACATCATATAATACTTTATAAGCAAATAAAAGAAACATACACAAAAAGCACAACTTCTTTTTAATTAGACAAGGCCATGTGTGCTCTTAACC 
TAAAACACAGCTGAAATCTATAGACCGCTTGTTTTGGGCGGGCAATCGTGGGTCAGGGGTGCAAAAAAGCGCGAAAGAAGAGTGCCGACCTTCGG 

ATGGATAACTTGTGCGTGTGCAAAATGAAGCT 
{SEQ ID NO: 307) 

Exon: 1001.. 1088 

Exon: 1149.. 1517 

Exon: 1594.. 1845 

Exon: 2038.. 2312 

Exon: 2548.-2952 

Exon: 3014.. 3407 

Exon: 10866.. 10940 

Exon: 11153.. 11198 

Exon: 11386.. 11436 

Exon: 11625.. 11667 

Start ATG: 1076 

Transcript No. : CT11195 

ATTTTCTCGCAGCGCGCAACACGTGCGGAGGTTTCTCTCCGTCTATTAGTTGATTTAGCAGAAAACGTAGATAAAATGGCCGACGTGGAAGTACG 
C AAGG AGAAG AAAAAG AAG AAG AT C AAGG AGGAGC C CT T GG AC GGCG AT G AC ATT GG C AC GCT GC AG AAG C AG GGC AACT T C C AAAT C AAGC C C T 
CCTCCAAGATCGCCGAGCTGGACACCTCGCAATGGCCGCTGCTCCTGAAGAACTTCGACAAGTTGAACATCCGTTCCAACCACTACACTCCGCTG 
GCGCACGGATCGTCGCCCCTGAACCGCGACATCAAAGAGTACATGAAGACTGGCTTCATCAACCTGGACAAGCCGTCGAACCCCAGCTCCCACGA 
GGTGGTGGCCTGGATCAAGAAAATCCTCAAGGTGGAGAAGACAGGCCACTCCGGCACGCTGGATCCCAAAGTCACGGGTTGCCTGATTGTTTGTA 
TTGACAGGGCCACCCGTCTGGTGAAGTCCCAGCAGAGCGCCGGCAAGGAGTACGTGGCCATCTTCAAGCTGCACGGAGCCGTGGAGTCGGTGGCC 
AAGGTGCGTCAGGGCCTGGAGAAGCTGCGTGGTGCCCTCTTCCAGAGACCTCCACTCATCTCCGCCGTAAAGCGTCAGTTGCGCGTTCGTACTGT 
CTACGACAGCAAGCTGTTGGACTACGATGAAACCCGAAATATGGGTGTTTTTTGGGTTAGTTGCGAGGCCGGCTCCTACATCCGTACCATGTGCG 
TCCATCTTGGTCTCGTACTGGGTGTCGGTGGCCAGATGCTGGAGCTCCGCCGTGTCCGCTCGGGCATTCAGTCTGAGCGCGATGGTATGGTGACC 
ATGCACGATGTTTTGGACGCCATGTGGCTGTACGAAAACCACAAGGACGAGTCTATGCTGCGACGTGTGATCAAGCCGCTAGAAGGTCTGCTGGT 
CAACCACAAGCGCATTATTATGAAGGACAGTTCGGTCAACGCCGTTTGCTATGGTGCCAAGATTACATTGCCCGGTGTCCTGCGCTACGAGGATG 
GCATTGAAATCGATCAGGAGATCGTCATTTGCACCACCAAGGGTGAGGCCATTTGCCTGGCCATCGCCCTCATGACCACAGCCACTATGGCCTCT 
TGTGACCATGGTGTGGTGGCCAAGATCAAGCGAGTGATCATGGAGCGAGACACATACCCGCGCAAGTGGGGATTGGGACCAAAGGCGTCGGCCAA 
GAAGGCGCTCATTGCCGCCGGCAAGCTAGACAAGTTCGGAAGGCCCAATGAAAACACACCTAAGGAGTGGCTGACCGGTTATGTCGACTACAATG 
CCAAGAAGCCAGCAGCTCAAGAAGTCTCCCCCACAAATGGCTCCAGTGAACCCAGCAAACGCAAGTTAAGCACCTCCAGCGTTGAGGAAACTGCT 
GCGGCGGCGGTATCCGAGGAGACTCCTTCCAAGGACAAAAAGAAGAAGAAAAAGAAGCACAAGGGCGACGAGGAAGCCCCAGAAGCCGCTGAGGA 
GGAGGCAGAGCCAGTAGAAAAGGAGAAGAAAAAGAAGAAGAAGAAGGACAAGGATAGGGACAGAGACGAAGCTCAGGAATAGGAATCAGAAGTTT 
CTATAGGATTAGGTTTTAAAGAGAATTCTACGCTTTTTTGTAGAGGCACCTACCGATTGCTAGCTGTTAGGTTTAAGGAGTGTTTTTATGTGTAG 
AAGTGCATACAAATTACAAATGCATTTAAGTGACATTCGCAATGCAATAAATTGAACATTTTATTGTGGAACTCTATCGCTAGTTTCTTAGGTCT 
TAGCTTTTCCTTAGGTTTTTCAGATAGAAAGGCAGGGAATACAGACAGCAAAGCTTGGAATACTGCTCAGAGGACAAAGCGGGAGTGACGGAAGA 
TACGATCGGCG G AAC G AAAACT AAAAAC GT GCT T CG AAT AAAC AT AGG AAT T AAG AT T T GT T GAT T AG GT TAT GC AAT AT AT G G ACT AT AAC T T T 
AAT 

{ SEQ ID NO: 308) 
Start ATG: 76 

MADVEVRKEKKKKKIKEEPLDGDDIGTLQKQGNFQIKPSSKIAELDTSQWPLLLKNFDKLNIRSMHYTPLAHGSSPLNRDIKEYMKTGFINLDKP 
SNPSSHEVVAWIKKILKVEKTGHSGTLDPKVTGCLIVCIDRATRLVKSQQSAGKEYVAIFKLHGAVESVAKVRQGLEKLRGALFQRPPLISAVKR 
QLRVRTVYDSKLLDYDETRNMGVFWVSCEAGSYIRTMCVHLGLVLGVGGQMLELRRVRSGIQSERDGMVTMHDVLDAMWLYENHKDESMLRRVIK 
PLEGLLVNHKRIIMKDSSVNAVCYGAKITLPGVLRYEDGIEIDQEIVICTTKGEAICLAIALMTTATMASCDHGVVAKIKRVIMERDTYPRKWGL 
GPKASAKKALIAAGKLDKFGRPNENTPKEWLTGYVDYNAKKPAAQEVSPTNGSSEPSKRKLSTSSVEETAAAAVSEETPSKDKKKKKKKHKGDEE 
APEAAEEEAEPVEKEKKKKKKKDKDRDRDEAQE* 
(SEQ ID NO: 309) 

Classification : known_f lybase_gene 
Gene Symbol: Nop60B 
FlyBase ID: FBgn0023184 



Celera Sequence No. : 142000013384809 
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TGTATAGAAGGTGAACATAT^ATAGTGACTGCTACTCACGCTAAATTCTTGCCCAGTTCACTGCACCCTTGCAGTTTTGATAGTTTTGGCAGATT 
AGGAGAGACGGCAAATGTGCAATCTAATATCTATTGACAATTGCTGCATCACACATATTGAAAATATTATCGATAAGCCTTATTAGTTTTTGCAG 
TTTTGCAAGCAATACAGTGCGGATCCTGAGAGGTCTCACTGTGTCTTCAAGGTTCGCCTTTAGAAGTCAAATGGCTCGAAGAAGTCTAAGCGGAA 
TGTAGAATGTAGCCAAATATAGAACACCTACCACAGATCGCATTAGTTTCTTTCTTTCCTACGATATTTACTCCTTCAAAATCGATGCGCAAATG 
GTTTTTAAGGACCAGCGCCTGCGCAGCACTTGACCAAATTCAGGTAACGGCGAATCTTATTTTATCCACTTCAACCACATACGATAGAAAATATT 
AGGGGAGAGATGTGTGCGTGACAATTAGATAGTGGATTGTACATCCGATAGCTCAGGTAAACAACAACCAAACATACCTAAAACTTAAACAACTC 
TTTAATTTAACGTAAAAATAATCGATGACATTTTATGAGCGAGTTCCGAGAAAACGTTTAATAAATTATTACCCCATTGGGTGTTCGTGTTCGCA 
CAACATAAGACTCTCATCTCTACATTTTATGAGTGAAACATTCGTTTGAGTGAAAGTCGATTTCCAACCCTGTTATGATTTGGCGGGAAACTTAA 
ATACTTGGATGAAATTTTTTGTATAAAGAGACATTTCATATTTTGTATTTGTAGTACTTAAATTCTAGTTTTCTAGTTTGCAATGCCTCGTTCAT 
CGTTATTATGGCCTGAGTGATTTCGAACTTTCTTTAGAGCCGTATGCTTAGTTCTGTTTTTTTGAAGACTTCAATGAGATGTTCATTAAATCGTA 
GAGTGACAAATGCAGGAATATCAAATGACGTAATAGTAGAAGTCAAGGGXTTAGAAATGATTTGCCACGGTGCTCTGTTGTTCGCCGCGGTTTAC 
TCGCAGGTTGCCCATGCATCCCTCAAACAGGATGTCTGCGATGGCCTTTCGTGCCAGAATCCGTTGCGTGGGGGACATTTCCTCGTACTGGATAG 
CCCAGGATCTGGCCAGAACTTCGGAGGACTCGCTGAGTTGCATGGGTGCAGTTCCTCGTTCGTTCTCGGGATCTGTTTGAATCGAGGATGTGGGT 
TCTTTCGTTTCCAATGAATTTCCCAAACTATTTTCCTCTTTGACTTCACACTGCCCTTCTTCCGGCTCTATGATCTCCAATTTTGGACTGAGCAG 
GGAACCCTCTTCGGATTGCGGGGCGGTCGGTTCGGACTGGTAGTTGAAAGCGTTATCATCCTTGATCGGCCAATCACAAACGCTTTCCGTGTTGA 
GATCTGGATTTTTCAAGGGGTATTTGTCCTATGAGGTACACAATTTATTTATTTTTAAGTATCATTCAATCATTGAGAATGTAATCTTACATCCT 
TTTCCCTGCGAAAATCTTGCTTCTGTCTGCCGCCCCGACCTTTTCTGGTCCTGTGCTGGAAGGTCTCCTGGGTGAGGAGAAATCCCATGTAGTCG 
TAGAACCAGAGCGTCGGCTTACAGAGATCGGAGTGCAGCCCCAGCTCCTTCTGCTTGAGGATTCGCATCTGCTCCCGCCTGTAATTGGTGCGAAA 
TATGTTGATGCGCCTGCCGACTTCTGTGCGCGTAGCGTTCGGCTGTATCTCCCGTAGTTTCCTCTCCAGCAACTCATACGCCCGGTTTCGCAGCT 
CCTTGTTCCGGTAATTGAGGTGGTGGACGTCCCACAGCTCCGGCATTCCCTGATAAAGGGCCAGAAACTCGCGCCAGAACTGCCGTTTGCCCGCA 
CTAAACCGATCTGGCAGTGCAGCTGCCATCTGCCAGCACAGCATTCAGAAAGATACATTTAATTACCATATCCATTGGCATATAAACATAAATAT 
TTACTTTTTCTGTTGTTCAAGCCAATCAGCTGTTGTTGATTACAGTTGGCCAGAGTTGCCACTTCTAGCGTCTAGTGTTGGTAAACACCTCACAT 
ACTCCCACTGATTGTTGTAGTTGCTCTGGATCTGGATGCTCATATATGTATAAACTTATTTCCACCACCTGACGACCGGCGTTATGCTCTCCTCA 
AGCGACAATCACAAACTGAATTGATGTAGAGCCGAAACGGGCGGGTGTGTGTGCATGTGTCTCGATTGAGTGATTGATTAGCCGACTGACCCGAT 
GGGCATGTGCATGTGTGCGTGTGGAAATGCACAATAAATCATTTGATTCCCACATACTGCTTGAAAACAAAAACGTAGCTGAATCCGATTTTCTT 
ATTGTATTAGTTTTAATTATATGGATTAAACAATGACTTACAGCTCCCCCGCAATTTAGGTGCACGTGTGTGCGGGAGAGGAAAGCAGCAAATAT 
TCAGACTTTAACTAAGGGCAGACCTGCGCACTTACCCATCTTAGCCCCAATCAAGATTGTCGGCATAAGGAAATCATCAGCAAAGTATGAATTTA 
TTTTAACTCGCTCTCCAAAATTTCACACTGATCAGGGAAAATTTTAAAATACACTCTGAATCAGGGAAATTCTCTCTTTAACAAGATAGAATTCA 
ATGATTGACCTCTTTGTTCCGTTATTTTCATTTACATGCGCTTTGTCTCTTTCTCTCTTCTGGCTCGCCTGTGTGTGTGTGTGTGGGTGACCTCC 
CGCCCGGCAAAAAAAGAAGAAGAAGCCCAACTAAGGGCAGACATGCGCACTTCGAAAlAAAAAACAATTAGCGCGTATATTTAGCCGACTTTTTGT 

tctggcgagaatgtgtggaaggaaatgtagctgttataacatacaacacaggtcaaaataatagggttacagtcgggagcatagtataaaatagt 
tccaacagaccaattgacaattgacattagctcattcatcacaatattttataaccacatcaaatttactcactagaaacttggtttgattgcgt 

TTATTTGGTTCGAATACAAATTATGTTCACCTTAGATCTAATGATTGATTGCATCCGACCGATGTGCATCTTCTAGATGCATCAGTTTGCAAAGA 
TTACACTCTTTGGCTGCAGCTATTCCTCCTCCTCTTCGTTCTCCTT 
(SEQ ID WO: 310) 

Exon: 2181,. 2000 
Exon: 1929,. 1516 
Exon: 1453.. 1001 

Start ATG: 1929 (Reverse strand: CAT) 
Transcript No. : CT1134 9 

GAGAGCATAACGCCGGTCGTCAGGTGGTGGAAATAAGTTTATACATATATGAGCATCCAGATCCAGAGCAACTACAACAATCAGTGGGAGTATGT 
GAGGTGTTTACCAACACTAGACGCTAGAAGTGGCAACTCTGGCCAACTGTAATCAACAACAGCTGATTGGCTTGAACAACAGAAAAAATGGCAGC 
TGCACTGCCAGATCGGTTTAGTGCGGGCAAACGGCAGTTCTGGCGCGAGTTTCTGGCCCTTTATCAGGGAATGCCGGAGCTGTGGGACGTCCACC 
ACCTCAATTACCGGAACAAGGAGCTGCGAAACCGGGCGTATGAGTTGCTGGAGAGGAAACTACGGGAGATACAGCCGAACGCTACGCGCACAGAA 
GTCGGCAGGCGCATCAACATATTTCGCACCAATTACAGGCGGGAGCAGATGCGAATCCTCAAGCAGAAGGAGCTGGGGCTGCACTCCGATCTCTG 
TAAGCCGACGCTCTGGTTCTACGACTACATGGGATTTCTCCTCACCCAGGAGACCTTCCAGCACAGGACCAGAAAAGGTCGGGGCGGCAGACAGA 
AGCAAGATTTTCGCAGGGAAAAGGATGACAAATACCCCTTGAAAAATCCAGATCTCAACACGGAAAGCGTTTGTGATTGGCCGATCAAGGATGAT 
AACGCTTTCAACTACCAGTCCGAACCGACCGCCCCGCAATCCGAAGAGGGTTCCCTGCTCAGTCCAAAATTGGAGATCATAGAGCCGGAAGAAGG 
GCAGTGTGAAGTCAAAGAGGAAAATAGTTTGGGAAATTCATTGGAAACGAAAGAACCCACATCCTCGATTCAAACAGATCCCGAGAACGAACGAG 
GAACTGCACCCATGCAACTCAGCGAGTCCTCCGAAGTTCTGGCCAGATCCTGGGCTATCCAGTACGAGGAAATGTCCCCCACGCAACGGATTCTG 
GCACGAAAGGCCATCGCAGACATCCTGTTTGAGGGATGCATGGGCAACCTGCGAGTAAACCGCGGCGAACAACAGAGCACCGTGGCAAATCATTT 
CTAA 

(SEQ ID NO: 311) 

Start ATG: 183 (Reverse strand: CAT) 
MAAALPDRFSAGKRQFWREFLALYQGMPELWDVHH^ 

DLCKPTLWFYDYMGFLLTQETFQHRTRKGRGGRQKQDFRREKDDKYPLKNPDLNTESVCDWPIKDDNAFNYQSEPTAPQSEEGSLLSPKLEIIEP 
EEGQCEVKEENSLGNSLETKEPTSSIQTDPENERGTAPMQLSESSEVLARSWAIQYEEMSPTQRILARKAIADILFEGCMGNLRVNRGEQQSTVA 
NHF* 

(SEQ ID NO: 312) 



Celera Sequence No. : 142000013384832 

CAGTTTGCGTGTCTGCGAGCCTGCATTTGAAGGCTGTGCAGCAAAGGCATCTGACATGGAGGCACGAAGGAAGCCCTGTGTGTGTGTGGGTGTGT 
GTTTTCGGCGGCGCCATTGTCGATAATAACGATGGACTGAGCCCCGGGTCGCAGGCTCTTTTGTGCGCGTAACATGTGTCAATTACAGGCGCCCA 
CACCCACACACTCGCACACTGGCACACACGCACACACACAGAGAGCGAGCGGGAGGAACAGGAGCAGCCGGCACGTCCAGCACACACCCACACAC 
GCATATGCGCTCACTTAGTTTCGCTCTCACACCGTCGATAACTCAAAAAGCTGCCAAAAAGTGTGCGCACATTTTTTATATGCGCCGTCCGAAAG 
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TATGCACATATTTTTTATATTTGTGCTGCCGGCATTTGACAGCTAAGCGACGCTGCCCTATGTTAATTTATTTTCACACAGCATTTGCTTTTTGT 
TTGCATGTGCGTCGGGAAGGGGGGAAATTACTTTCCGCCTCTGTTTTTGTTGGCGAATAAAAAATCTTAATTAAAAAATTTCGAATCGCTGTAAC 
AGGTGGAAAGGCCAGCTGCTAAGTAGAACGGCCAAACTTTGCCTCTCCGAGAGATTTGGATTGCCGGGTGCTATAAATTAGCAGTTTCCACTTGT 
TACGTAGAAAGCTATTATCAACTTATTATTTGAATTACTTCTTTCTGTATATAGTTGCGATGAAATGGGGTTTGTCATTAATTATTCCATCAGAT 
TGATTGTGAAAGCTAACAGCGATCTGTAAACTTAACAGCCGTTAAGCGATCTCCTATAATGATCATCATCATCATCATCATCAAGTTCTTCACTG 
AGTTTTGGAGGCTGTTCGTACATAGCTCTCCTCTCATTTTCCTCACTCTTTTGTGGTTCTATTGCTCTCTGCCGCTCTCCAACGCGAATGTCTGT 
GGCTATAAAAACCGGGTCCCCACCGCAGCAGCAGTTATTTCTCCGATCCGACGTCGGGTGAACAGATCAGTCGTGCCGCAACTGCAGCTATATCT 
TTACCTATGTGTTCTAATTTTCCGTGCCAGTAACCGTTCTGATCGTGCGAATAAAAAGTATGTCACTTGGGGGCGCTGCACTGGCATTGAAAAGT 
GAATTACATTGATCGTGACATGGGAATGGAAAATGTCGCCGAGCCATGTAAATCGTTTGGCCTACCCGTCTACCAGCCAACCAACCAATCAACCA 
ACCAACCAAAACCTGTGCCAAGAACTGCGTGTGCGCAGAAGCGTTTGGGAAAACTCTGTGACTGGCATACGAAATAAGTCAAGGTCTTCAGCATA 
AGAAGACGCCAGAACATAAGTCATACTGGCGACTTATAGAGATAGCGCCCATTTCCCATCCATAATCGCGGGTAGCAATGAACTTTGAAGCCGGC 
ACGTCACTTGATAGCTTCCTTAACCCATGTGGGCCCCATGGGCTCCGATTACATTACCTCGCCAATTATTCCCCTGTGCAAAACGTAACTGCATT 
ATTAACAAGTGTTCCGACCACCTTTCCATTGTTGGCGCTTCGCGATAAGCCGGCACAACAAAGAGCAACAGACGCCGCGATATCAGTTGAGTTCA 
GTTCTGCTGCTCGCACCAGGGCAGGTTCAATGTCTGGCGCGGTTTTATCGACCGATTGCCCACTCAGAGGGGTCCACTTAGCGAGCCGACCAATC 
TATGAAATCGCACAGTTTAATGAAAGATTATTCACCTAGAGACTCGAAAAATTTCCATGCGCAAAGGTTTAAGGCTGCACTTTCTGTTGATATAT 
ATAATGTCGCCGTAATAAAAACATTCACTTGGGTCGTAGATGTTTGTTGAAATAATTAGCCGTTTTCCCACATAAAACGCTTTAATAATGATGTA 
AAAGTAATTAGCCGTTTTTCCCTAACGAGATCACTAGATAATATTTTTTATGCCCCTTTTTATATTGGCTTAAATAAGCTGTAGTCATCATCATA 
GACAGAGGGTAGTGTAATGACTACAATAGGTGGGGACATCTGGCTTTCCATGCCAGTTTTTAAAATAAAATTTAGACCCCGTACTCAATTGCCAC 
TGCTCCACTTGGGTCCGAATAAATGTTTTGAGTGCATGAAGGGGATTTTCACAGCCTAAGCACTTGTCACTCCCCCACAGCCCCTCCTAAACCAC 
ACCATCCAACTGCTCATGGCGGCTGATAAGAACTTGCGGCCCACAAGCCATGTAGTAAATCGGTATCAGACAGCACTATGGTGCGCTTATCACTC 
AGTTACTCGCAGGCGGTTGCCCTAGGCACAAGTGGAACTGAAATCGGAAGGGTGGCATCCACAAAGTGCTCTATTTGGCCATTCCCTAGCTGTCA 
CTAATCATATCCGTACGATTGAGCAATATCTGACTAACTGTTTCGATCATGTTTGCTTTCACCCCCTGCCCTTTGAAGTGAGCGAGAACGAGAAG 
CGTCCCGATCTGGAGGATCAGGATCAGGATCAGAATCAGGAGCAGGCTGGCGGCAGGAAAATGGAGATCCTGCTGGAACTAAACCTAAAGCAGAA 
GCTGGCCACCCGACTAGGCACCATCCTGCTGGCCACAGTGATCTTCGGCGCCTTTTGCTGGGACAACGCGTTGGCCAGTCTAACACTGGGAGCAG 
TGGTGGCTCTCCTGCTCAGAAATCCCACCTTCGTCTTTGCCTTGGTAATGACCGCGTCCAGGGACCTAAAGTGGGTATTTGAGTGGCTAATTTCG 
CGGAGAAACCCCCCAACTTCCGTTCTCTTTTAGGGCTTTCCAGCGATTCGTGGCCCTCAACATCTACCTGCTAAGAAAGGATCGCGGCGGATTCA 
CTGTGGCCCGCTGTTTCCAGGACCAGGCCAGACGCCGCCCCAAAAAGACATGCTTCGTGATGGACGATCGCCATTTAAGCTTTGCAGAGGCTCTG 
GAGTTTAGCCAAAAGATAGCTGGATATTTTAGCGATCGCGGCCTGGAGAGGGGCGACTGTGTGGCCCTACTAATGGAAACGCGTCTGGAGTATCC 
CTGCATTTGGTTGGGTCTCTCCCAGCTCGGGGTAATCACAGCCCTTATTAACTCCAATCTGCGCGGGGAGTCTCTGCTGCACTCCATCAAAGTGG 
CCAATGCCAAGGCCCTAATTGTGGGCAGTGAACTCCTGGATGTGCTGGTATCACTTAGGGAGAAGGAGCAGTTGGACGAGGTGCCCATCTATCAG 
TACACAGATGACGAGGTTCGCGGGGTGGCTGGACATGATCTTTTGCCAGGAGCTGTGGACTTGGTTACTGCTCTAAAGACCCAAAAGAAACTGGA 
GCTACCGAGTGCTGTCTGTCCTGGAGAAGCTCGTTCCAAATTGCTCTACGTTTACACCTCTGGAACTACGGGCTTGCCCAAAGCAGCTGTGATCA 
CCAACCTGCGGTTCCTCTTCATGTCTGCAGGATCCTACTACATGCTCAAGATGTCCAGCGACGATGTGGTGTACGATCCCCTTCCCCTCTACCAC 
ACCGCTGGTGGAATTGTGGGAGTGGGCAATGCCATCCTCAATGGATCCACGGTGGTGCTGCGCAAAAAGTTCTCGGCCAGGAACTTCTGGCTTGA 
CTGCGATCGCCACAACTGCACTGTGGCCCAGTATATCGGGGAGCTGTGTCGCTATCTTCTGGCCACCTCCTACTCCCCCGATCAGCAGAAGCACA 
ACCTGCGTCTGATGTACGGAAATGGACTGAGGCCACAGATCTGGTCGCAGTTCGTCCGTCGCTTCGGTATTCCGCACATAGGAGAAATTTACGGC 
GCCACCGAGGGAAACTCCAACCTGATCAACATCACCAACCGAGTGGGGGCCATCGGATTTGTTCCCGTCTACGGATCGAGCCTCTACCCTGTTCA 
AGTCCTGCGGTGCGACGAGTACACGGGTGAGCTGCTCAAGGACTCCAAGGGTCACTGCATCAGGTGTCAGCCTGGGCAAGCGGGTCTCCTGGTGG 
GCAAGGTGGACGCTCGGCGGGCCGTGAGTGCCTTCCATGGATACGCCGATAAGGGAGCCTCGGAGCAGAAGCTCCTTCGAAATGTCTTCACATCC 
GGAGACGTCTTCTTTAACTCGGGCGACATGGTGGTGCGAGACATACTCGGTTACTTTTACTTCAAGGATCGCACGGGAGACACCTTCCGCTGGCG 
GGGTGAAAACGTCGCCACCCAGGAAGTGGAGGCCATTATCACGAACTGTGTTGGCTTGGAGGACTGCGTGGTCTATGGCGTGCAGGTGAGTTTCA 
GATATTTCCGATTTCTTTTTATTTAAGTTTGCATCATAAATAATTGATACCTTATCATCAAATGGTGGTCAGCAAGAAAGAATGGAATCTGGCTT 
TAATTACCGTTTAATAGCAATCTGAAATGTGTTTATAGATCGCATATATAACTATTGTTTCAGTGTAGCTGAGGGAACTACCCATGTCGCTCTTA 
GTCACTGGGTTTCTGGGTTGTGCCATGTAGATAACAAACCACCTGAAACCACAAGCCGCAAACCTTGATTTTCACTATCGGCCCGCATATCTTTG 
CAGATCCCCCATGTGGAGGGCAAGGCGGGAATGGCGGCGATTGTGGATCCGGAGCGCAAGGTGGACATGGATTACCTGTCCGTGGTGCTGCGGGG 
CAGTTTGCCTCCGTACGCTCGCCCCCTGTTCATCCGACTGCTGGACGAGATTCCGCGCACCGCCACCTTCAAGCTGAAGAAACGGGAGTTGGCCA 
AAGAGGCGTATGACATCGGCCAGCTCTCAGATCCCATATACTACCTGAATCGGGACGGTATCTACAGACCACTCAGTCAGGAGCAGCACGAGTTG 
CTGCGATCGGGAAAGGCGGGACTCTGAGGATAGAGGTCCCTTTTGCGCCCACAATTGTATTCCAGCCAATCCTCTGTGTATATTGCAAAACGCTT 
GTCTGTCCCAATGTATTTTTAACGCTTAAACAACGACTGATTCCGAAAGACCAAAATACCTAAATGCCGAAATAGTGATAAAGTTCTTTCGCTGC 
TGGCCCAAAATGCCGGGCGCGTTTCTTGGCCATAAAATGGCAGCTTAACCGATAATGGGAGCCAATACGTTTTACGGCCAGAAGATGCTGCTACC 
GCCACAAGATATATCGAAAGTGTTAAATCCCCAAGATTAAGTTCTCGCTCATTGGCAGCTGCTGCAGTTGCTGCTGCTGTTGCTGCTGCTACTTT 
TACAGTTGCTGCTGTGCTGTTGCAACATTAACGCCATTTCCGCTCGCTTGCGGCAGCTGCGACTTCGTTGGCCGTCGCCTAAGATAAACGATGTG 
CAAAATCCATACAAAACAATGGCAGGAGCAGGAAGGCAAGGTCTTGCAACCGCAGCAGCGCAACATCGTCGACGGCCACCAATATCGATGTCGAT 
GTCGATGGCGAAAAAGTTGTGGCAGGCAATCATAAAAAACGAAGAAATGGCAAACAAAGCTGGGCGCCATTTTTAAGGGAGCGAAAATGGCGATG 
GCGCGCTGCTTCTGGGATGCCAAAGGTTCTTGGATGGCTAAAAGTTGTTGCATTCAGTTCAAAGTGAGTCCTTTTCGATTTCGCTGGATCGTAAT 
TGGTATGATATTTCACTTGGAATATATGCAAGCTTTTTAGGGTCCTGGAATGTGCTTTTAAGATAATGCTATAATGCCTGATTAAGATTACTTGG 
GCCGCCATACAAGGCAAACGACAAAGGTTTAGAAAGACACAAGCACTTGTTTCGGCACTGGGGGAAACTTCAGTTTTGCAGGGATCCCGGGATTC 
CAATACACAATGTAAAAGCCGGATGCCCAAAATACACATAAAGTTGCAGCTGCAGCAGCTGCCGAACTCCCCGGCCA 
(SEQ ID WO: 313) 

Exon: 1001.. 1102 
Exon: 2454.. 2730 
Exon: 2789.. 4265 
Exon: 4564.. 4872 
Start ATG: 2531 

Transcript No. : CTH417 

ACGTCGGGTGAACAGATCAGTCGTGCCGCAACTGCAGCTATATCTTTACCTATGTGTTCTAATTTTCCGTGCCAGTAACCGTTCTGATCGTGCGA 
ATAAAAATGAGCGAGAACGAGAAGCGTCCCGATCTGGAGGATCAGGATCAGGATCAGAATCAGGAGCAGGCTGGCGGCAGGAAAATGGAGATCCT 
GCTGGAACTAAACCTAAAGCAGAAGCTGGCCACCCGACTAGGCACCATCCTGCTGGCCACAGTGATCTTCGGCGCCTTTTGCTGGGACAACGCGT 
TGGCCAGTCTAACACTGGGAGCAGTGGTGGCTCTCCTGCTCAGAAATCCCACCTTCGTCTTTGCCTTGGTAATGACCGCGTCCAGGGACCTAAAG 
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GCTTTCCAGCGATTCGTGGCCCTCAACATCTACCTGCTAAGAAAGGATCGCGGCGGATTCACTGTGGCCCGCTGTTTCCAGGACCAGGCCAGACG 
CCGCCCCAAAAAGACATGCTTCGTGATGGACGATCGCCATTTAAGCTTTGCAGAGGCTCTGGAGTTTAGCCAAAAGATAGCTGGATATTTTAGCG 
ATCGCGGCCTGGAGAGGGGCGACTGTGTGGCCCTACTAATGGAAACGCGTCTGGAGTATCCCTGCATTTGGTTGGGTCTCTCCCAGCTCGGGGTA 
ATCACAGCCCTTATTAACTCCAATCTGCGCGGGGAGTCTCTGCTGCACTCCATCAAAGTGGCCAATGCCAAGGCCCTAATTGTGGGCAGTGAACT 
CCTGGATGTGCTGGTATCACTTAGGGAGAAGGAGCAGTTGGACGAGGTGCCCATCTATCAGTACACAGATGACGAGGTTCGCGGGGTGGCTGGAC 
ATGATCTTTTGCCAGGAGCTGTGGACTTGGTTACTGCTCTAAAGACCCAAAAGAAACTGGAGCTACCGAGTGCTGTCTGTCCTGGAGAAGCTCGT 
TCCAAATTGCTCTACGTTTACACCTCTGGAACTACGGGCTTGCCCAAAGCAGCTGTGATCACCAACCTGCGGTTCCTCTTCATGTCTGCAGGATC 
CTACTACATGCTCAAGATGTCCAGCGACGATGTGGTGTACGATCCCCTTCCCCTCTACCACACCGCTGGTGGAATTGTGGGAGTGGGCAATGCCA 
TCCTCAATGGATCCACGGTGGTGCTGCGCAAAAAGTTCTCGGCCAGGAACTTCTGGCTTGACTGCGATCGCCACAACTGCACTGTGGCCCAGTAT 
ATCGGGGAGCTGTGTCGCTATCTTCTGGCCACCTCCTACTCCCCCGATCAGCAGAAGCACAACCTGCGTCTGATGTACGGAAATGGACTGAGGCC 
ACAGATCTGGTCGCAGTTCGTCCGTCGCTTCGGTATTCCGCACATAGGAGAAATTTACGGCGCCACCGAGGGAAACTCCAACCTGATCAACATCA 
CCAACCGAGTGGGGGCCATCGGATTTGTTCCCGTCTACGGATCGAGCCTCTACCCTGTTCAAGTCCTGCGGTGCGACGAGTACACGGGTGAGCTG 
CTCAAGGACTCCAAGGGTCACTGCATCAGGTGTCAGCCTGGGCAAGCGGGTCTCCTGGTGGGCAAGGTGGACGCTCGGCGGGCCGTGAGTGCCTT 
CCATGGATACGCCGATAAGGGAGCCTCGGAGCAGAAGCTCCTTCGAAATGTCTTCACATCCGGAGACGTCTTCTTTAACTCGGGCGACATGGTGG 
TGCGAGACATACTCGGTTACTTTTACTTCAAGGATCGCACGGGAGACACCTTCCGCTGGCGGGGTGAAAACGTCGCCACCCAGGAAGTGGAGGCC 
ATTATCACGAACTGTGTTGGCTTGGAGGACTGCGTGGTCTATGGCGTGCAGATCCCCCATGTGGAGGGCAAGGCGGGAATGGCGGCGATTGTGGA 
TCCGGAGCGCAAGGTGGACATGGATTACCTGTCCGTGGTGCTGCGGGGCAGTTTGCCTCCGTACGCTCGCCCCCTGTTCATCCGACTGCTGGACG 
AGATTCCGCGCACCGCCACCTTCAAGCTGAAGAAACGGGAGTTGGCCAAAGAGGCGTATGACATCGGCCAGCTCTCAGATCCCATATACTACCTG 
AATCGGGACGGTATCTACAGACCACTCAGTCAGGAGCAGCACGAGTTGCTGCGATCGGGAAAGGCGGGACTCTGA 
(SEQ ID NO; 314) 

Start ATG: 180 

MEILLELNLKQKLATRLGTILLATVIFGAFCWDNALASLTLGAVVALLLRNPTFVFALVMTASRDLKAFQRFVALNIYLLRKDRGGFTVARCFQD 
QARRRPKKTCFVMDDRHLSFAEALEFSQKIAGYFSDRGLERGDCVALLMETRLEYPCIWLGLSQLGVITALINSNLRGESLLHSIKVANAKALIV 
GSELLDVLVSLREKEQLDEVPIYQYTDDEVRGVAGHDLLPGAVDLVTALKTQKKLELPSAVCPGEARSKLLYVYTSGTTGLPKAAVITNLRFLFM 
SAGSYYMLKMSSDDWYDPLPLYHTAGGIVGVGNAILNGSTVVLRKKFSARNFWLDCDRHNCTVAQYIGELCRYLLATSYSPDQQKHNLRLMYGN 
GLRPQIWSQFVRRFGIPHIGEIYGATEGNSNLINITNRVGAIGFVPVYGSSLYPVQVLRCDEYTGELLKDSKGHCIRCQPGQAGLLVGKVDARRA 
VSAFHGYADKGASEQKLLRWVFTSGDVFFNSGDMVVRDILGYFYFKDRTGDTFRWRGENVATQEVEAIITNCVGLEDCVVYGVQIPHVEGKAGMA 
AIVDPERKVDMDYLSVVLRGSLPPYARPLFIRLLDEIPRTATFKLKKRELAKEAYDIGQLSDPIYYLNRDGIYRPLSQEQHELLRSGKAGL* 
(SEQ ID NO: 315) 

Name: fatty acid transporter-like 
Classification : transporter 



Celera Sequence No. : 142000013384832 

GAAATTGCAGCGAAACCCGGGATTTACGGATGCCAAAGCTGACTAATTCGTAGAGGCGCCTTCAATTCGCACTGCTCAGCTCGTAATTCGTGAAA 
TTTGTAAACAACACGCGCACTTTTAACTAAATTCAACATCCCATTGATGATTACGGAGCAAGTATAGCGGCGAATGGGCGTGAAACTTGAGGACG 
CACGCAAACGGAATGGGGTATTACCTCATTCGTACACAGTCACTTTCACGGTTTTCGGCTTTATCTATCACTGCACTATCGCTATTTTGACTGTT 
TTTGCCGCACACTAAGCCATTTGAAGCGTATTTTACCCCGAAAATGCTTGGAAATTTCTCTCGCGCAAACGCTTCGTGTTGCCACTGAGCTGCAA 
TTGCACCCTGTTTGGCGATCTGGCAGCGCCGGCGAAAAATGGCCGTTTAGGTCATCTGTGCCAGAAGAGAGCAGCACTATTTGCCTTCGCCAAGG 
GCAAAAACAGAAACAACACACACTTATGCAAACACACACGCAGAGGGGGAAATAATAAGTTAAGTCGGCTGCTGCTTAGTTACTTTGCTTTACAG 
CTGGTTTTCGGCAGGTGCAGCCTCAGCAAACGTTCTTTTCCGCACTTCCGCTGTCCGATTGCGCACTTTCTAACTGTTTGCACGGCGCTTTTCCT 
CTCATGATTTTTCCACCAGCATTGTGTGGCCGTTCAACAAAACTCCAAAAATACCAAGGATGGGTGATTTTATACACCAGCAATCCGGTCCTCCT 
ATACCCTTCCCTTTATAACATATAGAACTATTAGGTTTTCTTAACAGCTGTAGAAAATCGATATCAATTCACTACTTTCCAGAACACGACAACTT 
TATTGTTCTAAAATGTAAGTGTAGGAAAATAATGAATATGTTATATTTAATATGCTGTATTCCGAGTCCGAGTGTGACCAGTTAATCATTTTTAG 
AAAAGCATGCTTTTGCTTGCCTTGAAATAAATTCAGAAAGCGAGGCGTCGGCCACACTGTAAATCGTTTCTGCTTGTCATTTTGGCCTGAGTTTT 
TTGCCGCAAATTGCAGGTAATTTAGCGAAAAACAGGAGTCGCAGGAAGCCAAACCCGTCGAAAACAAATCAGTCCGGTACCTATCAATCGGCGCA 
GAGGTCATAGTAAGCCCCCGAACCGAAAACCCAGAAAAAGCACAAACATGCCGTCGCAGGAGGTGAGCGTAAACAAAGTGATAGTGCATCCATTG 
GTTCTGCTGTCCGTGGTGGATCACTTCAACCGGATGGGAAAGATTGGCAACCAGAAGCGCGTAGTGGGAGTCCTTCTGGGCTGCTGGCGATCCAA 
GGGAGTGCTCGATGTGTCCAACAGCTTCGCAGGTTAGTACTGCCTTTGAATATCCAAAGTCGTGGCCATAAAACAAAAGAATTCCCTCCCGACTT 
CCAGTGCCCTTCGACGAGGATGACAAGGACAAGTCGGTGTGGTTCCTGGACCACGACTACCTGGAGAATATGTACGGCATGTTCAAAAAGGTGAA 
CGCCAGGGAACGGGTTGTGGGATGGTATCACACAGGTCCCAAGCTCCACCAAAACGACATAGCCATCAACGAGCTGGTCCGGCGCTATTGTCCCA 
ACTCCGTGCTGGTCATCATCGACGCCAAGCCCAAGGATTTGGGCCTGCCCACAGAGGCGTACATATCGGTGGAGGAAGTCCATGACGACGGCTCT 
CCGACCAGCAAAACTTTCGAGCATGTGCCCAGCGAGATTGGCGCCGAGGAGGCGGAGGAGGTGGGCGTGGAGCATCTGCTGCGTGACATCAAGGA 
CACAACCGTGGGCAGCCTGTCGCAAAAGATCACCAACCAGCTCATGGGTCTGAAAGGCCTGAATGCCCAGCTGCGGGACATAAAGCAGTATCTGC 
AGCGCGTCGGCGACAGCAAGATGCCAATCAACCACCAGATTGTCTACCAGCTGCAGGACATCTTCAACCTGCTGCCCGATATAACCAATGACCAG 
TTCACGGGCACAATGTATGTGAAGACCAACGACCAAATGCTAGTCGTCTACCTGGCCTCCATGGTGCGCTCGATCATTGCCCTGCACAACCTGAT 
CAACAACAAGCTGGCCAACCGCGACGCCGAGGAGGGCAAGAGCGACAGCAAGGAGGCCAAGGAGAAGAATAAGGATAGCAAGGATAAGGACAACA 
AGGAGACCAAGGACAAGGACGGCAAGAAGGCGGAGGAGAAGGCCGACAAGGGCAAGGACGAAGGCGGCAAGGGTTCGCGCAAATAGGAGCGGAAG 
AAAAAGCGCAAACCCCGTCGCAAACGAAAGAAGCAGCGGGCGCAGGACTAGCTGCTGCTTCTTCTGTTCTCCAGGAAGTGGAACTCATCTGCAGA 
CCATACGGCGAGGGTGTGCAGTGGCCTCAGAAATCAAGCATTGCTACCATTCATCGTTCATTTTCCACACGAAATTAAGCTCCCTCTTATTATTT 
CGCGTAACTAACTAATGGCAAGTTTATGTAATTAAAAATTAAATAATTTAAATACTCCTATTGTTTGCTTTCCTCTAATATTGCTAGAAGCAGCA 
TACCTAGTATTAAAACCGCTCAGGCTTGTGTTTTTGTATTTCGTAAAACCGAAATTATATTTTAAAACAAACTAAACTCGCTGGAGAGGCAACGT 
AAATGGCTGACTTACGATTGCCTTTCCCCGAGCACAAAAACGGTTGAGTAATGCATTGTACAGGGCGTTCTCTATCTTAGAGCAGTTATTACTTA 
GAAAGCATTTTACCAAATTTCACTTGACTTACAGCTGAGCTAGTTAACGCTTACGAACTAAGGCTAAGGGTTCCATTGCTAACACCATCGATGAC 
CCAACAACTAGAATACTTAGGTTCACTATGTGCTTGCAAAGCAATCAGAAACTAGATCAAACATTGAGTTCCTGGCACCTCCTTTCCGAATTGTA 
GCACTATGCACGGGAGTCCTGCAAAATGGCTGATTTGCAGGACTCCGGTGTAGACTGGAGGCTCACTTAGACCAGCCACTCGCTGGCCAGCGGGA 
ACTTAAGCGGCTGGCAGCTGCTGAGTCGCAGCGAGCAGTTATTGTCCTCGGCAGCAAGTGGGATCCTGGTGGTCGCCATGGGATTGCTCTGCACC 
GAGCAGAGGCTTAAACTATGGCTGTGCGACTGCCTAAGACTAGAGCTAATGATATCGCTGCTGATCCGATGCTGATGCTGATGGTGATGATGATG 
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ATGATGCTGTTGCTGTTGCTGTTGGTGGTTCCGGCTCCGATACTGATCTTCCGACGGGCAGGAATGGTTGCAGTTGAGATTAGTTGAGTAGCAGC 
TCCTTCCCTCGGCCCCTGAGGTTGTTGGTGACAAGCTGTTGCTGTTGCTGCTGCAGTTGAGGTTGTAGATCTTGATGCTCGTGCAGCTGGTGCGG 
ATGTTGACGTGAACCTGGGCGTGGTCATGTCACACCGAGGTGCCCTGTTGGCCGGGCACCACCGATCCGCAGTTGTCCT 
(SEQ ID NO: 316) 

Exon: 1001.. 1362 
Exon: 1430.. 2499 
Start ATG: 1188 

Transcript No. : CT11503 

GCCACACTGTAAATCGTTTCTGCTTGTCATTTTGGCCTGAGTTTTTTGCCGCAAATTGCAGGTAATTTAGCGAAAAACAGGAGTCGCAGGAAGCC 
AAACCCGTCGAAAACAAATCAGTCCGGTACCTATCAATCGGCGCAGAGGTCATAGTAAGCCCCCGAACCGAAAACCCAGAAAAAGCACAAACATG 
CCGTCGCAGGAGGTGAGCGTAAACAAAGTGATAGTGCATCCATTGGTTCTGCTGTCCGTGGTGGATCACTTCAACCGGATGGGAAAGATTGGCAA 
CCAGAAGCGCGTAGTGGGAGTCCTTCTGGGCTGCTGGCGATCCAAGGGAGTGCTCGATGTGTCCAACAGCTTCGCAGTGCCCTTCGACGAGGATG 
ACAAGGACAAGTCGGTGTGGTTCCTGGACCACGACTACCTGGAGAATATGTACGGCATGTTCAAAAAGGTGAACGCCAGGGAACGGGTTGTGGGA 
TGGTATCACACAGGTCCCAAGCTCCACCAAAACGACATAGCCATCAACGAGCTGGTCCGGCGCTATTGTCCCAACTCCGTGCTGGTCATCATCGA 
CGCCAAGCCCAAGGATTTGGGCCTGCCCACAGAGGCGTACATATCGGTGGAGGAAGTCCATGACGACGGCTCTCCGACCAGCAAAACTTTCGAGC 
ATGTGCCCAGCGAGATTGGCGCCGAGGAGGCGGAGGAGGTGGGCGTGGAGCATCTGCTGCGTGACATCAAGGACACAACCGTGGGCAGCCTGTCG 
CAAAAGATCACCAACCAGCTCATGGGTCTGAAAGGCCTGAATGCCCAGCTGCGGGACATAAAGCAGTATCTGCAGCGCGTCGGCGACAGCAAGAT 
GCC AAT C AAC C AC C AG AT TGT C T AC C AGCT GC AGG AC AT C T T C AACCT GC T GC C C G A T AT AAC C AAT G AC C AGT T C AC G GGC AC AAT GT AT GT G A 
AGACCAACGACCAAATGCTAGTCGTCTACCTGGCCTCCATGGTGCGCTCGATCATTGCCCTGCACAACCTGATCAACAACAAGCTGGCCAACCGC 
GACGCCGAGGAGGGCAAGAGCGACAGCAAGGAGGCCAAGGAGAAGAATAAGGATAGCAAGGATAAGGACAACAAGGAGACCAAGGACAAGGACGG 
CAAGAAGGCGGAGGAGAAGGCCGACAAGGGCAAGGACGAAGGCGGCAAGGGTTCGCGCAAATAGGAGCGGAAGAAAAAGCGCAAACCCCGTCGCA 
AACGAAAGAAGCAGCGGGCGCAGGACTAGCTGCTGCTTCTTCTGTTCTCCAGGAAGTGGAACTCATCTGCAGACCATACGGCGAGGGTGTGCAGT 
GGCCTCAGAAATCAAGCATTGCTACCATTCATCGTTCATTTTCCACACGAAATTAAGCTCCCTCTTATTATTTCGCGTAACTAACTAATGGCAAG 

TTTATGT 

(SEQ ID NO: 317) 
Start ATG: 188 

MPSQEVSVNKVIVHPLVLLSVVDHFNRMGKIGNQKRVVGVLLGCWRSKGVLDVSNSFAVPFDEDDKDKSVWFLDHDYLENMYGMFKKVNARERVV 
GWYHTGPKLHQNDIAINELVRRYCPNSVLVIIDAKPKDLGLPTEAYISVEEVHDDGSPTSKTFEHVPSEIGAEEAEEVGVEHLLRDIKDTTVGSL 
SQKITNQLMGLKGLNAQLRDIKQYLQRVGDSKMPINHQIVYQLQDIFNLLPDITNDQFTGTMYVKTNDQMLVVYLASMVRSIIALHNLINNKLAN 
RDAEEGKSDSKEAKEKNKDSKDKDNKETKDKDGKKAEEKADKGKDEGGKGSRK* 
(SEQ ID NO: 318) 

Name: 26S proteasome regulatory subunit S12 
Classification: endopeptidase 
Gene Symbol: Mov34 
FlyBase ID: FBgnO002787 



Celera Sequence No. : 142000013384496 

AAGCCGTTCCGGAATTTGAGGGCCGTGACCGATGGAAACTCCTTGACGCCCGGCAACTTCATGGCTACATCCAAAGTTGAACATCGTTGCTTGGT 
ACGTGGATCCCAGGCTTCTACGGTGCCTTCTTTGGTACCGGCTACCAGAAGATGGTGCTCTGGATTCACTTCACAGGCGTTCAGGCAGCTGCCAT 
CCGTTTCAAAGGGTTGCAGGAACTGACCGCGCTCCAAATTGAGTCGGAAAATGTCCTGAAATCAAATCTTTAGAATAAACTAGACATTAGATAGA 
CCTCAGTTCTTACCCTGCCCACTCCCACGATGAACATGTCGCAGCTGGGCTTGTGGTACTTCATGTCGCGTCCAAAGCGCGGTATCCTTAGTCTG 
TAGTGCCGCCCGTGGGCCGCATGGATCTCCACATAGCGATCGCATTGCAGGAACACCATCTTGCTGTAGTCGTCGCTAATCACCTCAAAGGTGGT 
CACCTCGGCGTCGAAGCAGCGCTCGAATTTGATAGACAGGTTGGACACCTCGAAGCACTTGACGCGCGGCTTGTAAGTGCCGGTGGCCAGGATGT 
ACTGTTGGTCGGGACTCATGCGGATGCTGGTGCAGACGCCGGGCATATCGAAGTCCTGGATGAGCTCGATCTGTCTCCGGGAGTCCACTTTTTTC 
ATGAGCTGTGACCGCTTTCGCCGATCCGTCAGCCACTGTGAAGGAGATACTCATGAGTTAACTGCTTGGCAACGGGTTTATCTTGCTACTCACAT 
CTGGCACCGACTTGCCGGCGCTCAGGTTGTAGATCTTCACATCGTTAACTTCGTTTACGAACATCTTGAGGGACGATTTTTGCTCAAAATTAAAA 
CATAAACAAACGAAGTCCGCAACACGTGCGCCCTCTATCGATAGACACGATTGTCGATAGCATCTGGCTGCGATAAGCACCTATCGCGGCAGTGT 
GACCCTTGGCTTGCCAATTGCTCAGTGTGACCGCAGGAAACCAAGCAAAACAAGAGAACGCAAAAGCAGTGAAAAAAGACTACTGATTTGCATCG 
ACCTACCAATTGAAAATAATCGCAATAATTGCCTCTCGGTGCAGCCCCGCAACCCCGCGAAGCCAGTGTGCGTGCTAGCACGAAAGTGAACTGCA 
TTTTGCATCGCCAAATTGCCCAAAAATTGCAATCGAATTCCGCACAGCCAAACAACAATACGCACACATACACACAGTACCCCCCGCTCTGCGTA 
ACCGCCCTACGTTACACATTCCGTTCGCTCCCTAAGGTGGACTAAAACTAAAGTGATCTAAAACGGTGAGTAATATTCGTTGTTTTTCCTTAAGC 
CAATTAAACACTGATTAAATTATGCGAAATGGAAAACCGTCTGCGAAGGGGTGACGCAGTGCAGCTGTGGGTGCTCCGCAACTTTCCCTTGTGCT 
CTCGCTCTCTCTTGTTTTTGTCGTTAGGCAGCAGCACTCCATTGTTGTTGTATACACCCCCTGATTGCTTTGATACCCTGCAATGGGGAGTTACT 
GTTTTGTGACCAAGTTAAGAAAATAAATATGATTTACAAAAATCGATTAATTCTGGGGATGTTTCAGGTTTTTATGGTGTATAATTTATTTTTTG 
AATTTTTAGGGGATGGTGGGGCAAAAACGGCCGCCACTCTACTTAAAAATCGGTAGGTGAGGAGTTTGTTTAGTATACGCTATAAATAAATACTC 
TAAATTAGTTTTTATAAGATGGATTATTATATGAGTGTATGAGTTATCATAGAAATATTTACCTGTGCTTATATATTAATTTATTATATGAATTA 
TGCAAACAAGGTTTTTTCAAAATGTAGGCGAGCATGAATAAAAAATTATAATAAATCAGAAGTTTTCTTAGTAGTTGTTATCAGTTATTAATATA 
TTTCTTTTTCATTTTGTTGTGTCTTATTGCGTTGTTGTCTTCAATGCGGGCTTATTTTATTAGTTTGAATGGATGTGCGCTTATTTGTAGATACT 
CAAATATCAAATCGTATTCAGTTATATCTTAGAAATGGAATATAAAGCGCACATTTATAAAAGTATATTTATATTTAAGATTTTTTGCCAAAGGG 
TATACATACTTCGTCTGTGTATAAATATATATTTCTCATACTCCTTTCGTTATTTTCCCTGTTATGGGTGCGCCGCTGTGTTCCTTTTATGAGGG 
GCAATGATGATCCACTTCCATTCAACACATTTTGCAAATAAGTACATATGTACTTCGCCTCCATTGATTGCTCCACTGATGTTTGGCCTTTGGCG 
CTTGACTGATAAGACTCGAGACCGAGACCTACTACTTAGTTAGTCTAAACTGCCCGTCATAGAGCCTATTAAACTGATATGCACTTGTCACTTTG 
ACAAGCAATGATAAGCGATTATAACTGTATTTGATGATTTAAGTATGAGCAATTAAGAATCCATAATTGATACTCATCTTTTCACACTATATGCT 
GCTAAGTCACATTCTTTTGTATGAAATTCATTCATATGAAGTAAGGCAGGATGATTACCCTAAATACAAACTAACTTGATGAATAACTTCGCGCG 
TACTGCTCTTAATAATTGAATTCCCACGATTGCTTGCTCGGCGACCTTGATCCAAATCGCTTTCAGCTTGGGCGAAGGGTTCAATTCACTCTCTT 
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CAGCGCCCCTTAACTGCGGACAAATCAAAGGCTTTCGAACGACACGGATACTGCGAATGGACAAAGTCAAGTGGCATGAAGTGGATATACCCGCC 
GCTCTTGAAGGTTGCCTCTTTCTTATTTTTTTTTTTTTTTGGCTTCAGTTGGCAGGCAAACAAACCCGGTCGGCTGTTCAAACACAAGCAGGCCT 
TAAAAAGAAATATCGGGGCCAGTTCGAGAAAGGGGAACAAAATGTGGTGTCGACGCAGAACAAATTAATGCAGCGCCTCGGTTTGTTGATAAGCA 
GCGACTACGGTTGGACATTCAAATGAGGAGCTGATAGTAGTAGTTAACACTGTTTGATTGGGTCAGAGTTCATTTCTGCCGGTGTCAAGATGAGC 
AAAGTGCGCAGCGTGGTGAACAAACTGCACGAACGCATAGACTACTTTATTGAAAAATGTGTGAAATATTCTATATACTCAATGACTCTGCAATG 
TAATAATAATAATAATTTCGCTTGCTTTTTAGGTCCCAACATGACTGCAGCGCCATACAACTACAACTATATCTTTAAATACATCATCATTGGTG 
ACATGGGCGTGGGCAAGTCCTGCCTGCTCCACCAGTTCACCGAGAAGAAATGTGAGTAGACTGTGCTGAGTAATAATGTATGTAATAAGAACTAA 
TCTCTACTTGCCGTCCTGCAGTCATGGCCAATTGTCCTCACACCATTGGCGTGGAGTTCGGCACACGCATCATTGAGGTGGACGACAAAAAGATC 
AAGCTACAGATCTGGGACACAGCGGGTCAGGAGCGATTCAGGGCAGTGACACGCTCCTATTACCGTGGAGCAGCTGGTGCGCTGATGGTCTACGA 
TATTACCAGGCGGTAAGCGAAGCAAGCTACTATTTGAGAAATTGCTTTACAACTTGACCATTCCAGCTCCACGTACAATCACCTGAGCAGCTGGC 
TTACCGACACTCGCAATCTCACCAATCCCAGCACTGTGATCTTTCTCATTGGCAACAAATCGGATCTGGAGAGCACTCGGGAGGTTACCTACGAG 
GAGGCCAAGGAGTTTGCCGACGAGAACGGCCTAATGTTTCTCGAAGCGAGCGCTATGACGTGAGTTTCAACTGGAAGCATAGTGTTTGAGCATAT 
ACTGATTATCTTTTTTTTTATCGCCCAAACAGTGGCCAGAATGTGGAGGAGGCTTTTCTGGAGACCGCACGCAAGATTTACCAGAACATCCAGGA 
GGGTCGGCTCGATCTGAACGCCTCCGAGTCCGGAGTTCAGCACAGGCCATCGCAGCCGTCGCGAACTTCGCTGAGTAGCGAGGCTACGGGCGCCA 
AGGATCAGTGCTCGTGCTAAATGAATGATGGAACCTAACTTTACGCAAAACCTAGTTATACAAATCAACTTTTATTTTTAGCCGACTCGTAATCT 
ACCGATGCCCAAGCAAAAACCAATGTATGTAGTTAAATAATTCGATTAAGCATCAATGTGTGAGCCAACMCAGCGACAGCMCCACAAAAAAAA 
AAAAAAAC AGAAAAAGAAAAT ATAAAGTCCTT TCGC TTAATAAGACGCAAGCAT TGCAACATTTAT T GT ATCGGTTAAGCAT AAAT ACT AGAAAA 
TTAAACATGTTTGTTGGCGCAAACTAAAGTTTTGACCATGAAATTTCGAAAAGTTTATCGCACTAACACCATCCTTCGGTCTCAATTTATAGATG 
AATTCGTTTGAGATATAGATCATGTTTTTGTTGGCTGATGAATCCCTGTATATAATATATATATGTATGAATTTTTGTATCTCCCTAATTTTTTG 
TTGAGTATTTGTTGTTGTCGCGTGCAAGAATTGGAGGAAACGATCAATGTTATATGAATATTTACGATGTATTAATGGCATAATGCAATAGGAAA 
TAAATAAAATGGTATTTATAAAACCCCTACCAATAGAAATGAATTTCTCTTTCAAATAATTTAAATTTTAATTTGCATATTTAACCATTCGTTTA 
TATTTAATTCTACACCAAATCGCTGGATAATTTTATAATTTACCGGATTTCCATTTAATAGATTTAAATTTTATAAATATTTGGGTACTTACGAT 
TTACTTAAATATTTTGTAAAAAGCTTTTTTAATCCATTTAGTGGAATACCCTCAACTTTGACTTGCGCATAACTTCATTCTTGTTTTATTTAGTG 
ATGCTAACGTTGGGTTGGAAATCAGCTGATTATGACTATGAACAATAAAGTTTGTAAGTCAAACAACGATTTTTCGTATACTGTTCTTTTGTTTG 
TTGCTGGTTGTCTAATTAATCATTTTAACTGGTTATCAGTCCAATTTTTGTTTTCTACACTACACTATTCTACACTATTTTCGATTTCCTTGAAA 
TTTAGCGGGAAAAACTCTGAGCTAGCATCTTCGGTGGCATCGCTGTGTGCGCGCTTTTGTGCGAATGCCATGGTATGCAAATTAAATTTAATTTA 
TTCCGATCATCATGAAAGTGACAAAGTAGCTGCTTAAGTAGCCAGGGAAGCGCACAACGTAATCTAATCTCGATCCATGATGCAAATCAAGCGCC 
TCCTGTGCAAATCCTGCGGCCTGGGCACCCTTCTGGTGGCGGTCGTCTGGCTGCTGGCGCTGCTTTTCTACTCGCACAGCCTGCGCAGCTCCATC 
CGCTCGGCAGGATGGAGGATCGACGAAGGGAACGCAACGCCACGGGCCGAGCTGAGCTACCAGGCCAGGGTCACCGTCGGATGCACTCCAAATGC 
CAGCATTACCACTGGCGAATCGCCGGCGGCACCCAAGCCGCCCAGTGATCCGGAGCAATTGGAACTGTTGGGTGTGGTGCGCAACAAGCAGGACA 
AGTACATACGCGACATTGGCTACAAGCATCACGCCTTCAACGCGCTGGTGTCCAATAACATAGGACTCTTCCGGGCGATCCCAGACACTCGGC 
(SEQ ID NO: 319) 

Exon: 1001.. 1300 
Exon: 3168.. 3281 
Exon: 3347.. 3527 
Exon: 3582.. 3764 
Exon: 3833.. 4603 
Start ATG: 3176 

Transcript No. : CT11521 

caagagaacgcaaaagcagtgaaaaaagactactgatttgcatcgacctaccaattgaaaataatcgcaataattgcctctcggtgcagccccgc 

aaccccgcgaagccagtgtgcgtgctagcacgaaagtgaactgcattttgcatcgccaaattgcccaaaaattgcaatcgaattccgcacagcca 

aacaacaatacgcacacatacacacagtaccccccgctctgcgtaaccgccctacgttacacattccgttcgctccctaaggtggactaaaacta 

aagtgatctaaaacggtcccaacatgactgcagcgccatacaactacaactatatctttaaatacatcatcattggtgacatgggcgtgggcaag 

tcctgcctgctccaccagttcaccgagaagaaattcatggccaattgtcctcacaccattggcgtggagttcggcacacgcatcattgaggtgga 

cgacaaaaagatcaagctacagatctgggacacagcgggtcaggagcgattcagggcagtgacacgctcctattaccgtggagcagctggtgcgc 

tgatggtctacgatattaccaggcgctccacgtacaatcacctgagcagctggcttaccgacactcgcaatctcaccaatcccagcactgtgatc 

tttctcattggcaacaaatcggatctggagagcactcgggaggttacctacgaggaggccaaggagtttgccgacgagaacggcctaatgtttct 

cgaagcgagcgctatgactggccagaatgtggaggaggcttttctggagaccgcacgcaagatttaccagaacatccaggagggtcggctcgatc 

tgaacgcctccgagtccggagttcagcacaggccatcgcagccgtcgcgaacttcgctgagtagcgaggctacgggcgccaaggatcagtgctcg 

tgctaaatgaatgatggaacctaactttacgcaaaacctagttatacaaatcaacttttatttttagccgactcgtaatctaccgatgcccaagc 

aaaaaccaatgtatgtagttaaataattcgattaagcatcaatgtgtgagccaacaacagcgacagcaaccacaaaaa^ 

agaaaatataaagtcctttcgcttaataagacgcaagcattgcaacatttattgtatcggttaagcataaatactagaaaattaaacatgtttgt 

tggcgcaaactaaagttttgaccatgaaatttcgaaaagtttatcgcactaacaccatccttcggtctcaatttatagatgaattcgtttgagat 

atagatcatgtttttgttggctgatgaatccctgtatataatatatatatgtatgaatttttgtatctccctaattttttgttgagtatttgttg 

ttgtcgcgtgcaagaattggaggaaacgatcaatgttatatgaatatttacgatgtattaatggcataatgcaataggaaataaataaaatggta 

tttataaaacccctaccaatagaaatgaa 

(SEQ ID NO; 320) 

Start ATG: 309 

MTAAPYNYNYIFKYIIIGDMGVGKSCLLHQFTEKKFMANCPHTIGVEFGTRIIEVDDKKIKLQIWDTAGQERFRAVTRSYYRGAAGALMVYDITR 
RSTYNHLSSWLTDTRNLTNPSTVIFLIGNKSDLESTREVTYEEAKEFADENGLMFLEASAMTGQNVEEAFLETARKXYQNIQEGRLDLWASESGV 
QHRPSQPSRTSLSSEATGAKDQCSC* 
(SEQ ID NO: 321) 

Classification: enzyme 
Gene Symbol: Rabl4 
FlyBase ID: FBgnO015791 
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Celera Sequence No. : 142000013383801 

TAGTCCTACCGTATTGGTTGAATCCGGAGAGATTGTAGGCCCATCTGCCCAGATTTGCTGTGGGGAAAAATTCGCATTACATATTTACGTAATCA 
TAATATTCCGGCGTACTGCACTACGAAATTTGTTGCATGCAACGTGAGTCCGATTGTTGCCCGTTCGTATTACTTACAGAGAACTGCGGGTCCCT 
TTCTGGCAATATAGTTCGACATGGTTTACGTAAAATCAGAAGTCGGGAAAGAGATATTCCTCCAGCCTTTTTATTTTAGGTATGAGAACGTTGGA 
GATGGGTCTCTCGATAGCTATCGCACCCCGTTATCGATCAACGAGCACTACGGTTGCGTTTGTACAAATCGATTTCCGAAATGTAATACAGACAC 
ACTGCACCTATCGATATATCGCACTCAAGAAAATTCGTTATTCTTTAGTTTTAGAGCTATTAAATATATTATCTTTTAACATTTTACCCTTTTGT 
ATTTGTATTTAGATTGTTCAAAAACTTGGTGGAGACGGCAACCGAACTACAGTATAATTACATCTCCAAAGTTGGTCATTTTGCAATAACTTTTT 
GT AT AAAGTAAAT AAAATC AAT T C AAAT T CTT GT AAACCC ATTTAAGCCC AT GTAAAAATCCGCCTT C ATAAAC AAAATACT TGTAGTTTATCAA 
ACCGAATGATTGTGGCTTTCTCCTTTTGTAATTTGTTATGGGAAACACTCAAAAAAGGTTTTTAATTTGGGTTAAGTTATAATTTATGTTAACCA 
ATAACCTCGATTAAATTAATTATAAAAATGATGCCTGAAGATATACAACTTTCACTTAACTTGCAACACGGGTATTGTATATTAAACAGCAAACA 
AAAAAGGCAAAAATGTTGGTAATTAGCGGAGTGCTTCGACAAGCGGAAATCCCTACTTCTCATTTAAATAATATTTCCCCAATTTTTTGCGATAA 
GCCCCCGCCAACGGTAAACCCTTCATTAAAATGATTGATAGCGTTTTTATTTTAAAGTACATACCTTCGCTTTATTATGACTACAAAATATCAAT 
TTATACAAATTAAAGAACATATACAAATTCTTTATATATACAGTGCCCTCCCTTAAGAGTGAACACTCGAATATGTGAACTTTTTAGTAAAAGTG 
AACGATTAATTTATGAAATTTTAGAAACTTTCGCTTAACTTAGTAAAAAAATTAAATTTTTTCAAACGTGGTGTTTTTGAAAAATATATTAAATT 
GTTTATTTTTTCTGTGATTCGTTATAGTGAACAAATTCGATTCAGTGCACACCCTTGGTTCTCAATAGTCCACTGTTTCGAGTGAGCACTGTACA 
TACACTAAGGCTGGCACGAACACGACACTAATTTAAGTTTGTTTCGGTTTTTGTTTCTGATTTCTGAACACACGTTATATATATACATATGTATA 
ACATTGAGACCCGTTCCAGCCTTAGTTTATACATATATCACTGGGCACATAATGTAGTAACTCCCAATGCGCTGAGTGGCCGGGAGAGAGAAATC 
ACTTATTGTTCTTAGCTGCAGTCAATGACAAAAACTCCACCTTTCGAGCCAAGTACTCATTATAACGAAGAAATGCATAGCTAAATGGGGCGAAT 
AAAAAAT T GGT CATTAGTATCGATTGGTTGTAAGGGTCGTAATTGGGGGAAAAAAGGAATT GCAGGGT GCACAATAATAT AAT AAATCC ATT ATA 
ATGGCCGTCTAATGGTTGTTGATGGATCTTATCAAACAAAACTGATATATATACTGATGATATATATCTATATATCTAAAGGATTTATCATCTAA 
TTGCATCGTGCATTCCCAACTCAAAGTTACAGTCCATATACTATGCAACACGCACCCTACGTAGTCGAAATCGATAGTGGAATGCTCGGCTTGCA 
ACAGGGACCACACCGTCCAAAAGATATGCGATGCCAGCGCAAACTGATTGACCTGGACATAGAGCAGTTCGACTTCGTCGTTTTGAATGTTGCTG 
CGCTGTAGGTACTCCTCAAGGTAGACCCTCAGCCACTGGAGCTGGAACTCGCGCTTCGGATAGCGCGAATAGTCGACCTCATCCACGCCGCACAT 
CTCCGCAAAGTGGTTGCCAATGTCGAAGGCCTGGAAATTGTAATCGGCATACTCGTAGTCGATGAAGTTCACCGTATTCAGGCTCTGCGTATAGA 
TCACATTTCCCAGCAGAAGATCGTTGTGGGAGAAGACAATCGGACTGTCCAGAGCCTCGAGGTATTCATATAGCTTGTTGAACTCCTCGCGCAGG 
CGGCCGATAGGTAGAAACGTTTCTTTCACTCTGCAAATTGCAAATCAAACAATGAGCGAATGAACAAAGGACCGAAGATGCAAAGGATGTCTGGT 
GTCGTTACCTTTTGTGTTTTTCAGCATCACTAAAACGTTCAGGTACTAAATCAAGAAAGCTCTGCGTCTTCTTCCAAATCATCGGCATGGGTTTG 
GTCGCCGAACTATCCCCGTGCTTTCTCACCTTGCGATGCATCTCGGCCATGCGACGGGCGACCAAGGGCCAAATTTCTGGACAGAGCACACTGTC 
CGTATTCAGGGTGGTTCCAGGTACGTATTCGTAGACGAGACCGTTCTTGAACGTGGCATACAGCGATGGCGCTAGCCCATACGTATGCAGTAGAA 
GAAAGTTTTGCGTCTCCGCCTTGCGATCTATCAGTAGGTCCGTTTTGTTACCGTATATCCTGACGAGCACTACGTTATCGGAGTACTGTACAGGT 
GACCCGTCGTCCGCTGCCCGATCGTCTGTGAACTCATCATCGTCCTCCTTCTCAATGATGACTGGATCCTCGGACTGAACGGGCGACAAACCCTG 
CGTCTTGATAGGTAAATACGATCCGCCGTTCTCATCGTTCAGTTTGGAGATCTCCTTATGAAAACATCCGACCAGTTTGTTTGTGATGCCATCGG 
TAAAACTCTAAAATAAGCACACAGAGAACAGCACCATTAAATCGTATACATAAAAACATAATACTCATCGATGTAGATCCAGTGAATGGGTTAGG 
ATGGATTAATTAGTAGAGTAGCTACATGATTTGTGTGCTTTGTTTATTGTAAGTGAAGTTATTGCATCATCACCTAATACCAATTGCTATACGGT 
AAGGTTCATTAGTAGCATTCAATGGAACGGTTTCGGGAATTCGCCTTTTAAGCCACAGCTCAAGAGCTACAGTAAAGGGCTTATGGCCACTTCCT 
TATCGACTCCACATGCTTCTTATGTACATATGTATGTACATATATAAAAGAGTGAAGCACATGGGTGATAAACTTTGTGACACCAGCTATTTGCG 
TGCTAAGCGTCTCAGAAAGACCACCTTCGTCTTGAATTATATACGCATACGGTGCTTTTTATAATGAAAACACAGAAACAGATAAGATAATGAAT 
T GC T AAAC AAT T T AAAT TC AT AT AATTC AAAGCG AAAAAG C CC AGAATG T GTGAAAGGT GAA TGAAAT AAAAGT GC GAAAAT AGT AAT T AGT T C A 
TTTCAATCACTTCGATCTGCCCTGGTTTTATTCATTCACTGAGAAAAGTTTATTTATTGCATTTAATCAATATTTCCCAGTTATTTTCCCTTTGC 
TCAGTGTTCAATGCACTTAAGATTAAAGTAGTGCTGTACAAACACGTGTGTTTCAGATCGCTGGCGATATAGTAAATTTTCCACTTCATATATAG 
AACGACGTCTCTAGCGGCTATAAAAATGTCAATAGTTAACCCATTACTCGCATGACAGTTGTGAGGCATAAATAACCGTGAAACTTGAAGTAAAA 
CTTAAACATTTGGTTTATGGTTGAGAAACGCTGCCTATGACGCCATGGTCCATGAATCAACTCAGATTTGTGCAAGTATTCATAGTATTCATAGT 
AGTCATATTGTAGCATTTATTATCACAATTTCGCCTGACTGGCGCCAAAGATTTGTGACTCCCCATAAGGATGCCTCCCCTCCACTTGAATCCTG 
AATCACGCGACAAATTATCTATAGCTTAGAAAACGGACATGACACGATTTAGACTGTGTACCATGTACCCATGTACTATGTACACCACTCGCTGA 
T C AAGGG AT G AACC AC GT T T T AT TG C C C AC C AGC G AT T TGG C AAGAC TCAGCACTT GGC AAT T C C G AC TGG AGC T GCC AT AAAGC T G AT AAGT G A 
GTTACGGACAAGCTGTGTACACCAATTAGACGTCATGCCTATGGTGCGGCATATACAGATAGCGAGGTCAAGGCGTACAAGTGGCTTCCGATAAG 
GCAAAAACGCCCCATTTCAAAAAAACCGGAGTGATTAAAGGCAGCCTCAGGCATTTGTACATACATACATCAATCATACTCCTCTAACTCGACAG 
CGTGAAATTATCAAATCGGGCCAAACGAAATCAAGCCCCTAAAAAGCGAGGGAAAAAACGTTTTTTTTTCGGATCATCCGTAAATTACGGACGGA 
CGGACGGATGGACGGTGTGTCGGTAATGGTTATCTGTAAAAACACTCGAGTCAATGCACCTGCTGTCAACACCTTCTATGTCAGGCATTTAAAGT 
ATTGTGGTAATTAGAGAACGTGCGATTTCATTAGAACTAAGGCGACTAATTGGCACGAAGGCCTATTTCTTGGCTCACACCCGTTTCCAACCCGC 
TCCGTTTGGTAGGGTGTGTGTGTGTGTGGGGGGGGGGGGGCACTTATTATTATTTTCCCCCCCGGAGTCGAGGGCTAGATTAGCAGCAAACTGGT 
TTCTACTGCGACGCGGGGTCTGAAGTCTCCATAGTTCGCTTGGCGACTCGGAAACTCAGAAACGCTGTCCTAATACACCAACACCAGCGTAGTGA 
ATAATGTTAATTTCGAATAACAATCCTATGCGTCTCTCTCTTCATTGATATAACAAAGTCTATTTACAATTATCATGGCGGCGCAGCAGCGAATA 
TATATATATAATTACTATTTGAAGGTTGGCTGTTGTGCAATAACCCTGTTTCAAACAATGTTAGCAACTACTACGACTCTATATCTCTATATATA 
TGTATATGAATGTATATAGTAAAGTATATATCCCACGTTCCGCACACCAGTTAGTGATCGTAATAAATGATAAGGAAACATACAGCGTCATCATC 
ACCATCGCATTTTGGGCCGGATAACGATCTTCGATCTGCGGAACTACCCTGATCTGATGGAGAATGTAGAGAATATGACATTGATGGTTTCGTTT 
TTTTGCATTGTATAGTGTGTTGTTTGGCTGGTGATTTAATTTGATTTGCTTTGATTTGATTGTTGGTATAATGATATTGTAGGGGGGTTATAATT 
TTGAATGTTTTGATGAGGTTTACGTTACCTTAAACTCGACGTGGCTGAGGTCCCAGGTCGGTCGTATAACCTTCAACAGTTCCTTGGCCCCTTGA 
ATTACGTCGGCCTCCTCAACAAATATGGGTACAAATGGTACAATTGCCTCTTTTCTGGATTTATCCTCCGGTTTGGCTCGAATATCGCGCGAATT 
TTGTTCGTTTTCGTTTTTGTTTTCGCTCCCGCTTGGATTTGGATACGAATTGGAATTCGAATGTGAATTCAATTGGTTCTGGACCTGGTTCGATT 
GCGATAGCGATAGCGTTTGCTGATTGACCGTCTGACGAACTAAGGATAAAGAATCTTTCATCACTTTGGGGTTGCCGCCACTTGTTGAAATCTGT 
CCTGTGTAACTGTTGCTCTTGGTTTCTGTGCCCATTCTATTGTATTATATATGTGAAATACGAAAGTGTCAGGTCGGGTCATACATATAGAACGA 
CAATAACATCAGCATGTAAAATGAGTAGGTAGGTAGGCGGTAGTTAAAAGCTAATCATCCGAATGCAAAACCAACCGAGAACCCAACCAAACCAG 
CAAAGATATCCTCTCACCCCCCCACGCAACGAACCGCAGCCAAATGAAAAATTAAAAAACCCAGTAAGTATACTACGTCCATGTCCTCTCACTCA 
TGCGCACCGAAACCAGAGTTTTGGATGGAGATGACCGCACCCAGTTAGTAGGATCTGCGTACCATGTGGGAAATAGCCCAAACACATCCACTTCA 
CCACCACACGAATGAAACATTATTGTATCGTTCATACTAAATTTAATCTCAGTTTAAGGTTTATTTGTTTATATATCAGATATAGAAGATTGTGC 
CAAAGTAAAGCAAAATCTACCTCCTAAACAAAAACTTATCTACATAATGATCAACACTTTTAGAGTCCCAAATGTGTAATTAACAATTATACGAG 
AAAATATTCGCTAATTTGACAAGCAAATAATATATATGTACGTATATTCCATAAAAAATGATTACGGAAAATAAACCTTGAAATTGTGATAGTCA 
ATTCGTTAACTGGTTTTCGCTTGATACAGGAAGTTCCTAATTTCAGTTATAGATAAACATGCTCTTTTACCCTCATTAAACACCAAAATTCGCAA 
TATAGATAAAAGCTGCCTCAATGATACTTATTAATCATTAATCACGATGTAATGAATGATACAATTTGCAAACTAACAACTTGTTAATAGTTAAT 
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AATAATACATAATGCAGCTCTAACTACACAAATTCATGCATGTGTGTGTATGTATGATGGACGCTTTTTGATTACCATTCTTCTTAGACCTTATC 
GTTGACATTATCTACTGACTAGATCCATCATCTTGTCTTCATTTTCGACTATATCACGAAAAGCTGATTTAAAAATAAACTTTTTATTCGTTTAT 
GATAAGCAAACATCTCGAATATTATAGAATCAAGTTGGTAATACTGACAACGGCTATTAGGTGCTATTATGTTACCCCGTGCTGTATTATGTCTT 
CGAATGCGGCACAGCCGGGCGGAAAGTACACGTAGCCGTGTGTACTCGAATTATACGAAGGTGCGTAAGGCGATGACTATAAATCGGAGAAGGTA 
ACGAGACAACATACGTATATGTATATGCATGGGATAAGGTAAGGGGGTCAAAATAAGTCAATAAGATCGCCGTGTTAAGCCAATGTCTTAAAATA 
CAGGGCCGGAGCGAGAGGGTGGCTAAGTTTTTAATTTGATCAAAGGCAATTTTAAATCGGAATTATGCTGTATCCACTAAATTTTATTACAATTG 
AAACTGTTTTGGGGGGTGACGCCCCAAAATGACCCCGCCGACTCCGCTCCTGGAGCCAGGGAATGCACCCACACTCGCGCACAAATAAATAGGGC 
GTAAAATGCTAATCTTGTGAATACGTGAATGTACATACATGTGTATGTACATACTCCACGCAAATCAAGTGTACATTATGTACATGCGAACGTAT 
GTAAGATCGGAAATCAGGTGAAAAGCCCAGGGCCCAATAGCACAGGTGGGGGCGGGCGCGTTAGCACGGGACATTTGCTTTAATATTAACATGTT 
TTCTGTTTCGCACTTTCTTCATTTTTTCATCTTTATTTTGGTTTTTTAGCCAGGCCAAAAGTTTACGCATACAGTGGCAATGTGTGTAGTACGTG 
CAAGGGTACAACTGCATGTGTGTGCATAGTAGGTGGGATTAGGGCTCATCTGCACGGGCTAAGTTTCCCTTATATCCCGTTATACTACTCGAATT 
CTGTAAACTTAAATGTTGTGAATGAAACTAAAATATAAATAAGCTACATTACTTTGCCAGTCACATTATAGACCTCTTTTAAGCCTCAAAAGCGG 
CGATTCCATGCGGAATTTAGGTCGTGCGAAGGGGCCTTAAGCGCGTTCTGGGTTAAGAACCCCCGCCCACGACCCCCTTACCACAACGACCCATC 
CCACAATCCGAGCATCCTTTTTCCTGAACCCGGTGCGATATTTGTTGGTTGATTCGAAGGATTCCCATACCAGATTTTGATTCCGATTCGGTGGT 
TCTGAATTCAGATAGTGATTCGATTGGATTCGATTTCATTCGACTGGATTTGATTCGTGTTTCGCATGCAATTTGAGTTTTGAAAACGACAAACA 
AGGTAAAATAATAATAAAATGTGTACCAATATGTCAAGCGTCAACGAATGCAAACGCACACTGAGGTGAATTGTTAGTTGTCTACACACGTTACC 
TTCGGGTATTGATTGTTTTTTAAAAGAAAACACAAAAAAAAAGCCGCGAAAATCACACAACCAGATTATCGATTTTGGCCCCCGCTGAAATTGGC 
GCACACAACACACAAATTTGACTGGGGAACGAACAGCGGGCGATGCAAACGATTCGGTTCCGACTTGGAATATATAGCTGCCGAACTATCGGTGC 
AACAATCGGATCGCTCAACCGGTCGCTCCATCTGCAGGGAATACCAAAACGCCTCAGTTAAGTGGCGCGCGAATGAAGGTACTTGCGCTTAGCCA 
CCAGCTGGGCACACTGCACGAGCCTGCATAGAGTTTTTTACGAAATTATCACAATTTAGGAGTTTTCCAATTAGAATTGGAATAAGTTCGTACTC 
TTAACTTGCTAACAAGTTGATTTTTAATGCAAAACTTTTCGCAGACAACTTTTCGACAAGAACAGTGCTGCCAGACTGTCAGACCGGCACCTTAC 
TGGCGTGTTTTTATCACATTTTCTAATATTTCCAAGCTATTGAAATATTCCAAAGCTTTCTATCTGCACATTCTTTTTATATAGGTGTTCCCAGA 
AAATCGCCTTAGCATGGAGAGTCGTGCTAATACGGCGTTGCCAGACTGTCATTGCATTTCTTGTTGCTCTTTGCAAGAGCCCGCCAAATGAAATT 
TTTATTTGAATACAATTTATTTTTTTCTTCCCCCGTTCTTTTGCTTGTATAATATCCCAACTGGAAAATGTGTACTTTTCAATTACGAAACATGT 
ACAATATATAGCAATAAAGTCCAGTAAAATCGTGGTATGTTTACAATTACTTTTTTGGGCCTGATGATACGTTGCCATTAAAGTGCAATGTCATA 
C C ACT AAT GT AAGT T T T T T AAC C CT T T AT AC C CC C C AAT AG C T T GC C C AT AAT CCTCCCGT G AAC AAT C T T G AAT C CC AAAT GC AAG AAC AAAGT 
T C G AAT AT T C AAC AGGAGC T C CGC C T AAT GT AT GAT G ACGC AT GT AT T TT AT ACAG AT C AAAT AT TT T C AT AC C G T TT T AC AGGGT T C C GAAG AA 
TAGAAAGTGGTAATTAAAAAAAGTATGCATTCGGTTCACCGAATTCAATTTGAATCGTTTTCCAAGTCAAATCAGGTTGGCTACAATGAGCAGAC 
AGAATGTGGGCGGCGACACCAGATAGTGTCGCCGAATGCATGGAAAGCGATTCAATGCAACGGCCTTCCATCGAAGAAGT 
(SEQ ID NO: 322} 

Exon: 8295.. 5349 
Exon: 5183.. 5119 
Exon: 2952.. 2384 
Exon: 2310.. 1861 
Exon: 1600.. 1001 

Start ATG: 5735 (Reverse strand: CAT) 
Transcrxpt No. : CT11537 

TGGAGCGACCGGTTGAGCGATCCGATTGTTGCACCGATAGTTCGGCAGCTATATATTCCAAGTCGGAACCGAATCGTTTGCATCGCCCGCTGTTC 
GTTCCCCAGTCAAATTTGTGTGTTGTGTGCGCCAATTTCAGCGGGGGCCAAAATCGATAATCTGGTTGTGTGATTTTCGCGGCTTTTTTTTTGTG 
TTTTCTTTTAAAAAACAATCAATACCCGAAGGTAACGTGTGTAGACAACTAACAATTCACCTCAGTGTGCGTTTGCATTCGTTGACGCTTGACAT 
ATTGGTACACATTTTATTATTATTTTACCTTGTTTGTCGTTTTCAAAACTC7AAATTGCATGCGAAACACG7AATC7AAATCCAGTCGAATGAAATCG 
AAT C C AATC G AAT C AC T ATC T G AAT T C AG AAC C AC C G AAT C GG AAT C AAAAT C T GGT AT G GG AAT C C T T C G AAT C AAC C AAC AAAT AT C GC ACC G 
GGTTCAGGAAAAAGGATGCTCGGATTGTGGGATGGGTCGTTGTGGTAAGGGGGTCGTGGGCGGGGGTTCTTAACCCAGAACGCGCTTAAGGCCCC 
TTCGCACGACCTAAATTCCGCATGGAATCGCCGCTTTTGAGGCTTAAAAGAGGTCTATAATGTGACTGGCAAAGTAATGTAGCTTATTTATATTT 
TAGTTTCATTCACAACATTTAAGTTTACAGAATTCGAGTAGTATAACGGGATATAAGGGAAACTTAGCCCGTGCAGATGAGCCCTAATCCCACCT 
ACTATGCACACACATGCAGTTGTACCCTTGCACGTACTACACACATTGCCACTGTATGCGTAAACTTTTGGCCTGGCTAAAAAACCAAAATAAAG 
ATGAAAAAATGAAGAAAGTGCGAAACAGAAAACATGTTAATATTAAAGCAAATGTCCCGTGCTAACGCGCCCGCCCCCACCTGTGCTATTGGGCC 
CTGGGCTTTTCACCTGATTTCCGATCTTACATACGTTCGCATGTACATAATGTACACTTGATTTGCGTGGAGTATGTACATACACATGTATGTAC 
ATTCACGTATTCACAAGATTAGCATTTTACGCCCTATTTATTTGTGCGCGAGTGTGGGTGCATTCCCTGGCTCCAGGAGCGGAGTCGGCGGGGTC 
ATTTTGGGGCGTCACCCCCCAAAACAGTTTCAATTGTAATAAAATTTAGTGGATACAGCATAATTCCGATTTAAAATTGCCTTTGATCAAATTAA 
AAACTTAGCCACCCTCTCGCTCCGGCCCTGTATTTTAAGACATTGGCTTAACACGGCGATCTTATTGACTTATTTTGACCCCCTTACCTTATCCC 
ATGCATATACATATACGTATGTTGTCTCGTTACCTTCTCCGATTTATAGTCATCGCCTTACGCACCTTCGTATAATTCGAGTACACACGGCTACG 
TGTACTTTCCGCCCGGCTGTGCCGCATTCGAAGACATAATACAGCACGGGGTAACATAATAGCACCTAATAGCCGTTGTCAGTATTACCAACTTG 
ATTCTATAATATTCGAGATGTTTGCTTATCATAAACGAATAAAAAGTTTATTTTTAAATCAGCTTTTCGTGATATAGTCGAAAATGAAGACAAGA 
TGATGGATCTAGTCAGTAGATAATGTCAACGATAAGGTCTAAGAAGAATGGTAATCAAAAAGCGTCCATCATACATACACACACATGCATGAATT 
TGTGTAGTTAGAGCTGCATTATGTATTATTATTAACTATTAACAAGTTGTTAGTTTGCAAATTGTATCATTCATTACATCGTGATTAATGATTAA 
TAAGTATCATTGAGGCAGCTTTTATCTATATTGCGAATTTTGGTGTTTAATGAGGGTAAAAGAGCATGTTTATCTATAACTGAAATTAGGAACTT 
CCTGTATCAAGCGAAAACCAGTTAACGAATTGACTATCACAATTTCAAGGTTTATTTTCCGTAATCATTTTTTATGGAATATACGTACATATATA 
TTATTTGCTTGTCAAATTAGCGAATATTTTCTCGTATAATTGTTAATTACACATTTGGGACTCTAAAAGTGTTGATCATTATGTAGATAAGTTTT 
TGTTTAGGAGGTAGATTTTGCTTTACTTTGGCACAATCTTCTATATCTGATATATAAACAAATAAACCTTAAACTGAGATTAAATTTAGTATGAA 
CGATACAATAATGTTTCATTCGTGTGGTGGTGAAGTGGATGTGTTTGGGCTATTTCCCACATGGTACGCAGATCCTACTAACTGGGTGCGGTCAT 
CTCCATCCAAAACTCTGGTTTCGGTGCGCATGAGTGAGAGGACATGGACGTAGTATACTTACTGGGTTTTTTAATTTTTCATTTGGCTGCGGTTC 
GTTGCGTGGGGGGGTGAGAGGATATCTTTGCTGGTTTGGTTGGGTTCTCGGTTGGTTTTGCATTCGGATGATTAGCTTTTAACTACCGCCTACCT 
ACCTACTCATTTTACATGCTGATGTTATTGTCGTTCTATATGTATGACCCGACCTGACACTTTCGTATTTCACATATATAATACAATAGAATGGG 
CACAGAAACCAAGAGCAACAGTTACACAGGACAGATTTCAACAAGTGGCGGCAACCCCAAAGTGATGAAAGATTCTTTATCCTTAGTTCGTCAGA 
CGGTCAATCAGCAAACGCTATCGCTATCGCAATCGAACCAGGTCCAGAACCAATTGAATTCACATTCGAATTCCAATTCGTATCCAAATCCAAGC 
GGGAGCGAAAACAAAAACGAAAACGAACAAAATTCGCGCGATATTCGAGCCAAACCGGAGGATAAATCCAGAAAAGAGGCAATTGTACCATTTGT 
ACCCATATTTGTTGAGGAGGCCGACGTAATTCAAGGGGCCAAGGAACTGTTGAAGGTTATACGACCGACCTGGGACCTCAGCCACGTCGAGTTTA 
AGATCAGGGTAGTTCCGCAGATCGAAGATCGTTATCCGGCCCAAAATGCGATGGTGATGATGACGCTAGTTTTACCGATGGCATCACAAACAAAC 
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TGGTCGGATGTTTTCATAAGGAGATCTCCAAACTGAACGATGAGAACGGCGGATCGTATTTACCTATCAAGACGCAGGGTTTGTCGCCCGTTCAG 
TCCGAGGATCCAGTCATCATTGAGAAGGAGGACGATGATGAGTTCACAGACGATCGGGCAGCGGACGACGGGTCACCTGTACAGTACTCCGATAA 
CGTAGTGCTCGTCAGGATATACGGTAACAAAACGGACCTACTGATAGATCGCAAGGCGGAGACGCAAAACTTTCTTCTACTGCATACGTATGGGC 
TAGCGCCATCGCTGTATGCCACGTTCAAGAACGGTCTCGTCTACGAATACGTACCTGGAACCACCCTGAATACGGACAGTGTGCTCTGTCCAGAA 
ATTTGGCCCTTGGTCGCCCGTCGCATGGCCGAGATGCATCGCAAGGTGAGAAAGCACGGGGATAGTTCGGCGACCAAACCCATGCCGATGATTTG 

GAAG AAG AC G C AG AGC T T T C T T G AT T T AGT AC C T G AACGT T T T AG T G AT GC TG AAAAA C AC AAAAGAGT G AAAG AAAC GTT T CT AC CTATCGGCC 

GCCTGCGCGAGGAGTTCAACAAGCTATATGAATACCTCGAGGCTCTGGACAGTCCGATTGTCTTCTCCCACAACGATCTTCTGCTGGGAAATGTG 

ATCTATACGCAGAGCCTGAATACGGTGAACTTCATCGACTACGAGTATGCCGATTACAATTTCCAGGCCTTCGACATTGGCAACCACTTTGCGGA 

GATGTGCGGCGTGGATGAGGTCGACTATTCGCGCTATCCGAAGCGCGAGTTCCAGCTCCAGTGGCTGAGGGTCTACCTTGAGGAGTACCTACAGC 

GCAGCAACATTCAAAACGACGAAGTCGAACTGCTCTATGTCCAGGTCAATCAGTTTGCGCTGGCATCGCATATCTTTTGGACGGTGTGGTCCCTG 

TTGCAAGCCGAGCATTCCACTATCGATTTCGACTACGTAGGCTATGCATTTCTTCGTTATAATGAGTACTTGGCTCGAAAGGTGGAGTTTTTGTC 

ATTGACTGCAGCTAAGAACAATAAGTGATTTCTCTCTCCCGGCCACTCAGCGCATTGGGAGTTACTACATTATGTGCCCAGTGATATATGTATAA 

ACTAAGGCTGGAACGGGTCTCAATGTTATACATATGTATATATATAACGTGTGTTCAGAAATCAGAAACAAAAACCGAAACAAACTTAAATTAGT 

GTCGTGTTCGTGCCAGCCTTAGTGTATGTACAGTGCTCACTCGAAACAGTGGACTATTGAGAACCAAGGGTGTGCACTGAATCGAATTTGTTCAC 

TATAACGAATCACAGAAAAAATAAACAATTTAATATATTTTTCAAAAACACCACGTTTGAAAAAATTTAATTTTTTTACTAAGTTAAGCGAAAGT 

TTCTAAAATTTCATAAATTAATCGTTCACTTTTACTAAAAAGTTCACATATTCGAGTGTTCACTCTTAAGGGAGGGCACTGTATATATAAAGAAT 

TTGTATATGTTCTTTAATTTGTATAAATTGATATTTTGTAGTCATAATAAAGCGAAGGTATGTACTTTAAA 

(SEQ ID NO: 323) 

Start ATG: 2561 (Reverse strand: CAT) 

MGTETKSNSYTGQISTSGGNPKVMKDSLSLVRQTVNQQTLSLSQSNQVQNQLKSHSNSNSYPNPSGSENKNENEQNSRDIRAKPEDKSRKEAIVP 
F V P I F VE E AD V I QGAKELLK V IRPTWDLSHVEFKI RV V P Q I E DRY P AQNAM VMMT L VL PMAS QTNW S D V F I RRS PN * 
(SEQ ID NO: 324) 

Classification: enzyme 
Gene Symbol: eas 
FlyBase ID: FBgn0000536 

Celera Sequence No. : 142000013384645 

AT AT T C G AT C GAT T GAG GGC GGT G CGT CGG GAG AT C GT G AT C C AG AT GT AC GAT G C C AGT C AG AAG AT T T GC C T GT TAG AGC C CAT T GT GATGT T 

T C T GGC AT AT AGT C GCT AT C GCTTGTGTGAGGAGCC C ATC G AGAAGT T C GAT CT C AAG AT AT GC AAT C AGC AT T T GC AAG AGT GC CT C AC GGG AG 

T GC T T T GC T GTT AC G AAG AAC T GG AGG AC T T GG AAT C ATC T AG AG AACC C AC CGT T C GT G AGCTT GAG CGG C GC T GC T T C AT AGAG AGC C T GT AT 

CAATTGTTTAATCTGGGTTCTCCCGAGTCCTTTACACGCGCCCTCACCTTACCGGACTATGTGCGTCGGGATGCGACGTTTAAGCTTTGTTTTGG 

GATCTGTTTGGCCTTCCAACAGGGCAATTTGTATAGAGTGCTCATGGGTGTGCCACAATTGCCGCATATTCTGTGCGCGGTAGCTGCCGCCAAGT 

TGCAGGTGATAAGGAGGTACAGTATAACTTTTTCTATTGGATCTCTTATTGACATGATGCATTTTTAATGTAAATACATATGTATGTATGTATAT 

TTGCAGGAGTTTGTTGCAGATTTTCACGCATGCCTACAACAACAAACAGCTTACGGTGCCGGTTCCTTACTTGTTGCGTTTACTGTTGTTCGATG 

GGCCAACAGGATTGCAGGACCAATGTCGGCATTATAATATATCCTTAACAGCGGATAGGAAGGCGGTGCAATTTAATAAGACTGATTTTAACCAC 

AATGCAGAGGTATTTACACCCCAACACGAGCGCTTTGTGGAGTCGAAGCTGAAGCGTATTTATATTCCAGAAGTTCTATTGCTAAAGAAATTTAA 

CTAGTTATTTATGAGCAACAATTGTAAATAGCTCGAAATAATAGTAAAGCCATGGAGATTAATTTATTGTTATTGCGCATTTGTTTTCTTGTCGG 

GTGGCAACCCTGGTGGCAGAACAACAAACCTATCGCTTATCGAAAGACAGAACGCCGCGGTATTTACAGCTAATAAATGGTTTGCCAGTTGGAGG 

TTAATAAAATGCAATTTTTGAATAGTAGTTAAGATTACATTTGTGCTGTTTTGCGAAGTAATGCGAAGAGAATAACCAATCTCCGCTGAATAAAC 

CGCTTAAGATATACACAAAGCGCGCGCAGGGTTGCATTGTGGCGCCTGTCGCTGCACACTACACACACACACACCCCACACACGCAACGCGAGAG 

AAAGTACCGAGCCGTTTCTGGAAAATACAATTCCGCCATATTAACCGGCAAAATTCAAAATATTTACACTGCCCGAGGTGAGCGGGATCGAGGTG 

GTCATCCAAAGCGAGAATTAACCCGCCATTGGGCAGCAGGAGAGTGGCGGGCGGAGCGGAGCGAGCGGTACGCATCATTCGCCGTCTGCCGCCGC 

CGTCGCACAATAAACGATAGGCATTGGCCACAAAAAAGTGGAAAAAGAGGTGGAGGGAAAGGCCAGTTCAGTGCAGCAATACCGACGAAAACACG 

TACGTTTGTCCACTTTGGAAGTCTGGCGAATATATAGAGATATATACATAATACGGATTTTTGGCCAGATACTAATTGCCCACTTTTTCCGCAGC 

GCGCTCGCACACACAACACTAACACTCACGCACGAAGCAAGAGCGCCCACAAATACAAGCATACAGATACAGGCACACAGACAGGCAGACGGAGC 

AGCCGCGATGACAAGTCATCATCATCACGTCGGCGTCGGAGGGAACTTCCAGCCTCAACTGCATCAGATACGAGCGCCCATCGTGAAGAATCTAT 

CGCTGGCCGCTGCGGCGGTACCGTTGCCCCTGCACTCCGCCCCGTCGTCGGCATACTTCACTCATGCACACCAGCCTACGGCGGCTACCATAGTC 

GAGCAAGATCCGTACACCGTTTTGGCGGGCGATCCGATGGGCGAGGAAAGCGAGAGCTTCCTGATCGAGGAGATCATCGACGACTATGATGATGA 

GGCTAATATCGTCGTAGACTCAGGCGAATTCTTTGAGCACTATGAATACTATCCGGTGGATCGGCTGGACAGACTTACCTCGGTCACGGTGGCTA 

CGGCAGATGCACTGCCGCCGCCGATCATCATGCAGCAGACCAGCGACGATGAGGAATTCATCGAGGAGGATGGACAGCAGATGACATCCTCCTGC 

GATGGCGATGGCGATGAGGATGATGAGGCTACCGCGCCGGCTACGACAAATGCCTTCGAGATTGCCTCCGAGATGCTAACCACCATCGAGGCCAC 

AAACATCAAGGAGGAGCAGCTCGAGAGCGACTTCTATATGAAGGCGACCGAGGATTCAAACAGTTCGAATGGCCAGCTGCAGGATTTTAAGCTAT 

TTGGAAGCAGTCGCGTCGAGAGGCCCATGGGAAATTCAGCGGCCAAGCGCTGGAAGCAAACAAAGGTACCCATACGCATAACGCAGGATGAGTTC 

AATGTCACCCTTTGGTCCTCGCTCAACTCGGAGGATGAGGACGAGGAGGAGGAGTTGGAGGAACCACCTGGTTGTGCTGGCACCACATCCGTGAC 

GTCCTCCTCCGCCGCCCAGAATGAAGGATCACCCAGCATGCCAAAGCTTATTATTGACGAGCACATGATCAACCACGATGTTTTAAGCGATGGCA 

TCGGCAACGAGTACGTCATCCTACCCGATCCATTCAGCGCCTCTGCGGTCACGGATGTGGAGGAGGTGGTTAATGCATTTATGGGCGATGTCAAA 

GGCGAGGAGGACAGCCAGCCGCTAAGTGAACAGTTGATCTATGCGGAAAACCAGTTGGAAGAAGCTTATGGTAACATCGAGCTTATCGAGAACAT 

GCCGAATAATGTGGAGCTGATACAGACACCGGGACCAGGTCCAGCCACTGTGTCGGTAGCGCTATCGAAGGTCAACGGATGTGTACCGAATCCAC 

AGCCGGCTCTCCCAAAACTAGTGCTCCAGAACAGCAGCACCAATGTGCGACTGAAAAGTGCGGGAAATCAGTCAAAGGTGACAAAGAGGCAACTC 

ACGGCGGCGGCTCAGGTGGCGACACCAAATCCCCCACCGCCACTAGTGGCTTCCAGCAATCCAGCCCGACCCCAACGACCCACAATTGTAACCAA 

ATTACCGACTGCAGTGGCCAATCAGACGGCTGGTAATGCCATGGGAGCAGCTGGCGCGGCCGGCGAAGAGGATGAGGAGCCAAAGTTTCGCTGCA 

CCCATCGCGGCTGCAACAAGGAGTTTCGCAACCACTCGGCCATGCGCAAGCACATGCACACGCACGGACCGCGCGGTCATGTGTGCAACGTCTGC 

GGCAAGAGCTTTGTAGAGAGCTCCAAGCTAAAGCGCCACCAGCTGGTGCACACCGGCGAGAAGCCCTTTGAGTGCACCTTTGAGGGGTGCGGCAA 

GCGCTTCTCGCTGGACTTCAATCTGCGCACCCACGTACGGATACACACTGGCGATAGGCCCTACCACTGTCCCATTGATGGCTGCTCCAAGTGCT 

TCGCCCAGTCAACGAACCTCAAGTCCCACATGCTGACGCACACGAAGCCCAAAAGGAAGTGGCCGCGCGCACCGCAAGTGAGCAACAACAAGACA 

CCGCTGGTGGCCAGATACGGGCGACTGGAGTTTGGCGAGGATCCTCGTCTAGTGTACGTAGAACAGAACGAGGAGGTGGCGTCGGTGCTGCTGGA 

TGGCACGTCCCTAACATCCTAACTAGTAGTGCGCTTTAATGTGTGGCAGGTATTGGCATGAGATGGTTAAAATATTAAAGCTCCTTTGGCTCCGG 
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AAGGCATTATTGTACAGTTAGTTGTAGCCAGTTTTTATCTGTATCTGTAACCTGTAACGAGCGGAGCGGCCGGACAAGATTGTCACCTGGTGAAC 
ACATCCGCCCGCTGCCCGCCCTTTAAGTAAGTAGTGTAAGTTCATTCTCTAACAAAATAGGATTGCAGTGCAAAAGGAAGTTAGATGGTAGCCAT 
TAATTAATTTCCTTCACGGATGAAGAGCAGTTCTAAGAGTTTAGCTTGTACCGTAGGCTAAGTGAGTCCCCCCGCAAAACCAAAACTGAACCGTA 
ATCCCCGAAGAACTCAGCCAAACTGTCACCTTAAAAAGCCCCAATTTGATCTGATACTAATCGTGTGCGATCTTCGCTCGAATAACAAAACGCGC 
ACTTTCTTTTCTACGAAAACAAAACAAACAAAAAAATAAGAAAAATACACAAAACTAGTTTTACCATATTCAATACGCATACACCATAGGCTATC 
TATATATTTTTGTAATGAATTATAACACTTAGAGCATTGAAGTACGGATTTAGCATGTTCAACGTCTGTTGGCGCTAACAATAATTCACAAATGG 
AAGCACATCAGTTAAGTCGGAATAAACAAGGGAAATACCTCTTCAAAATTAAACAAAAATAAACCAGGAAAACAATGCAACAACACTTGCGTATA 
TCATTTATCATTTCTTTTTTTAACTATTTCACAGTGCGATTGTGTAAGCGTTAGATTTATTAACTGACTTTAAACTTGTAACTACTGCTAGATAA 
CTACATTAAACGGTGAGGCAGCTATTGATGGACAGTCGCCGGGTGTTGTCCACTTCAGATTCGGATCAGCAAGCGATTAGATCGCCTCTTTGGAC 
ATGTGTCGCCTACAGCTACGTTCCATTCATTGATTGCGGTGTATAAACGCGTGCGGATTAGATAAAGCATCGCGGCGAGGTACAGATACGCGGCT 
ATTTTGGATATTGGAACGGAGAACAGTACACACGATTGGTCTGGGCATGGTCTGATCGGTATTTTAGGTGACCTTGAGCCCATGTCCTTGAACTC 
CGGGAATAATATCCGCGATGGAGGCACTTGTTGGGCCAATCAATCCGGCCAGCGAGAGCAGCACGGAGTAGTTGAAGTACCAGAAAACGCAGAAG 

AGT AC CAT GAAAAT GAC GT C T GC AAT G AAC C GAG AGC AAT AAT AAC C C T G AT AAGC GT AT AAATT C T C C AG AAGC AT GAC G AAT C C GT CG G AT AT 
TCCCAACTAGTTCGAGGGTGTGCTATTTTATTAACAGCAATTAACACTCACGTATCGTTATCCGCTCCCACTTTTCAACCATGTACAGCTCGGTG 
ACCTGCAAGTTAATGGGGTTCATTTAAATCAAAATCCCGTGGAGCGCAGCTTCTCAGGTGGAATATGTGCGTAATTTTA 

(SEQ ID NO: 325) 

Exon: 1001.. 1519 
Exon: 1615.. 3411 
Exon: 3478. .4209 
Start ATG: 1718 

Transcript No. : CT11601 

AACGCCGCGGTATTTACAGCTAATAAATGGTTTGCCAGTTGGAGGTTAATAAAATGCAATTTTTGAATAGTAGTTAAGATTACATTTGTGCTGTT 
TTGCGAAGTAATGCGAAGAGAATAACCAATCTCCGCTGAATAAACCGCTTAAGATATACACAAAGCGCGCGCAGGGTTGCATTGTGGCGCCTGTC 
GCTGCACACTACACACACACACACCCCACACACGCAACGCGAGAGAAAGTACCGAGCCGTTTCTGGAAAATACAATTCCGCCATATTAACCGGCA 
AAATTCAAAATATTTACACTGCCCGAGGTGAGCGGGATCGAGGTGGTCATCCAAAGCGAGAATTAACCCGCCATTGGGCAGCAGGAGAGTGGCGG 
GCGGAGCGGAGCGAGCGGTACGCATCATTCGCCGTCTGCCGCCGCCGTCGCACAATAAACGATAGGCATTGGCCACAAAAAAGTGGAAAAAGAGG 
TGGAGGGAAAGGCCAGTTCAGTGCAGCAATACCGACGAAAACACCGCGCTCGCACACACAACACTAACACTCACGCACGAAGCAAGAGCGCCCAC 
AAATACAAGCATACAGATACAGGCACACAGACAGGCAGACGGAGCAGCCGCGATGACAAGTCATCATCATCACGTCGGCGTCGGAGGGAACTTCC 
AGCCTCAACTGCATCAGATACGAGCGCCCATCGTGAAGAATCTATCGCTGGCCGCTGCGGCGGTACCGTTGCCCCTGCACTCCGCCCCGTCGTCG 
GCATACTTCACTCATGCACACCAGCCTACGGCGGCTACCATAGTCGAGCAAGATCCGTACACCGTTTTGGCGGGCGATCCGATGGGCGAGGAAAG 
CGAGAGCTTCCTGATCGAGGAGATCATCGACGACTATGATGATGAGGCTAATATCGTCGTAGACTCAGGCGAATTCTTTGAGCACTATGAATACT 
ATCCGGTGGATCGGCTGGACAGACTTACCTCGGTCACGGTGGCTACGGCAGATGCACTGCCGCCGCCGATCATCATGCAGCAGACCAGCGACGAT 
GAGGAATTCATCGAGGAGGATGGACAGCAGATGACATCCTCCTGCGATGGCGATGGCGATGAGGATGATGAGGCTACCGCGCCGGCTACGACAAA 
TGCCTTCGAGATTGCCTCCGAGATGCTAACCACCATCGAGGCCACAAACATCAAGGAGGAGCAGCTCGAGAGCGACTTCTATATGAAGGCGACCG 
AGGATTCAAACAGTTCGAATGGCCAGCTGCAGGATTTTAAGCTATTTGGAAGCAGTCGCGTCGAGAGGCCCATGGGAAATTCAGCGGCCAAGCGC 
TGGAAGCAAACAAAGGTACCCATACGCATAACGCAGGATGAGTTCAATGTCACCCTTTGGTCCTCGCTCAACTCGGAGGATGAGGACGAGGAGGA 

GG AGT T GG AGG AAC C ACC T GGT T GT GCT GGC AC C AC AT C C GT G AC GT C CTCCTCCGCCGCC C AG AAT G AAG GAT C AC C C AG CAT GC C AAAGC T T A 
TTATTGACGAGCACATGATCAACCACGATGTTTTAAGCGATGGCATCGGCAACGAGTACGTCATCCTACCCGATCCATTCAGCGCCTCTGCGGTC 
ACGGATGTGGAGGAGGTGGTTAATGCATTTATGGGCGATGTCAAAGGCGAGGAGGACAGCCAGCCGCTAAGTGAACAGTTGATCTATGCGGAAAA 
CCAGTTGGAAGAAGCTTATGGTAACATCGAGCTTATCGAGAACATGCCGAATAATGTGGAGCTGATACAGACACCGGGACCAGGTCCAGCCACTG 
TGTCGGTAGCGCTATCGAAGGTCAACGGATGTGTACCGAATCCACAGCCGGCTCTCCCAAAACTAGTGCTCCAGAACAGCAGCACCAATGTGCGA 
CTGAAAAGTGCGGGAAATCAGTCAAAGGTGACAAAGAGGCAACTCACGGCGGCGGCTCAGGTGGCGACACCAAATCCCCCACCGCCACTAGTGGC 
TTCCAGCAATCCAGCCCGACCCCAACGACCCACAATTGTAACCAAATTACCGACTGCAGTGGCCAATCAGACGGCTGGTAATGCCATGGGAGCAG 
CTGGCGCGGCCGGCGAAGAGGATGAGGAGCCAAAGTTTCGCTGCACCCATCGCGGCTGCAACAAGGAGTTTCGCAACCACTCGGCCATGCGCAAG 
CACATGCACACGCACGGACCGCGCGGTCATGTGTGCAACGTCTGCGGCAAGAGCTTTGTAGAGAGCTCCAAGCTAAAGCGCCACCAGCTGGTGCA 
CACCGGCGAGAAGCCCTTTGAGTGCACCTTTGAGGGGCCCTACCACTGTCCCATTGATGGCTGCTCCAAGTGCTTCGCCCAGTCAACGAACCTCA 
AGTCCCACATGCTGACGCACACGAAGCCCAAAAGGAAGTGGCCGCGCGCACCGCAAGTGAGCAACAACAAGACACCGCTGGTGGCCAGATACGGG 
CGACTGGAGTTTGGCGAGGATCCTCGTCTAGTGTACGTAGAACAGAACGAGGAGGTGGCGTCGGTGCTGCTGGATGGCACGTCCCTAACATCCTA 
ACTAGTAGTGCGCTTTAATGTGTGGCAGGTATTGGCATGAGATGGTTAAAATATTAAAGCTCCTTTGGCTCCGGAAGGCATTATTGTACAGTTAG 
TTGTAGCCAGTTTTTATCTGTATCTGTAACCTGTAACGAGCGGAGCGGCCGGACAAGATTGTCACCTGGTGAACACATCCGCCCGCTGCCCGCCC 
TTTAAGTAAGTAGTGTAAGTTCATTCTCTAACAAAATAGGATTGCAGTGCAAAAGGAAGTTAGATGGTAGCCATTAATTAATTTCCTTCACGGAT 
GAAGAGCAGTTCTAAGAGTTTAGCTTGTACCGTAGGCTAAGTGAGTCCCCCCGCAAAACCAAAACTGAACCGTAATCCCCGAAGAACTCAGCCAA 
ACTGTCACCTTAAAAAGCCCCAATTTGATCTGATACTAATCGTGTGCGATCTTCGCTCGAATAACAAAACGCGCACTTTCTTTTCTACGAAAACA 

AAACAAAC 
{SEQ ID NO: 326) 

Start ATG: 623 

MTSHHHHVGVGGNFQPQLHQIRAPIVKNLSLAAAAVPLPLHSAPSSAYFTHAHQPTAATIVEQDPYTVLAGDPMGEESESFLIEEIIDDYDDEAN 

IVVDSGEFFEHYEYYPVDRLDRLTSVTVATADALPPPIIMQQTSDDEEFIEEDGQQMTSSCDGDGDEDDEATAPATTNAFEIASEMLTTIEATNI 

KEEQLESDFYMKATEDSNSSNGQLQDFKLFGSSRVERPMGNSAAKRWKQTKVPIRITQDEFNVTLWSSLNSEDEDEEEELEEPPGCAGTTSVTSS 

SAAQNEGSPSMPKLIIDEHMINHDVLSDGIGNEYVILPDPFSASAVTDVEEVVNAFMGDVKGEEDSQPLSEQLIYAENQLEEAYGNIELIENMPN 

NVELIQTPGPGPATVSVALSKVNGCVPNPQPALPKLVLQNSSTNVRLKSAGNQSKVTKRQLTAAAQVATPNPPPPLVASSNPARPQRPTIVTKLP 

TAVANQTAGNAMGAAGAAGEEDEEPKFRCTHRGCNKEFRNHSAMRKHMHTHGPRGHVCNVCGKSFVESSKLKRHQLVHTGEKPFECTFEGPYHCP 

IDGCSKCFAQSTNLKSHMLTHTKPKRKWPRAPQVSNNKTPLVARYGRLEFGEDPRLVYVEQNEEVASVLLDGTSLTS* 

(SEQ ID NO: 327) 

Name: zinc finger protein 
Classification : transcription_f actor 
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Celera Sequence No. : 142000013384689 

ATCGTTAATTGACTTGAATTGCGGCACCACATTCAGATAGCGGAAAGTACTGCCCATCTGGTCCGTGAACAGGAAGTCCGCGGAAGCCACCATAT 
GTCCTCGATTAATCACCAAGTAGTTGGCATTTTGCAAGTAGGACTGGTCATCACCGTAAATGCATTTGAACGCAAAATAGATGTTGGACTTCATG 
TAGGCAGCAGCCTCCTGCGGACTGAACATCTCATCCGCCACAAACTCCACAAATGGCCGCTTCGGAACTATAACTCCTTACACTATACTAGGGGT 
TCAAAAGGCCAAACTTTGTGATTTACTCACAAAACGTTTTGTAGTAGACATCGCTCTGCGAGTACACGAGTCTTCCCTGCCCGCAATCGAAACAT 
GTGCGATACAGCTCCAGATCCTTGCCATCTATGGTATATCCCACGGCATACAGGTTCCCCGCACAATCCCCATCGCGAATATGCTTGGTCACCGG 
TTGCATTGGACTTAAACAGCGCATGGGCAGTGGGACCGAGAACTGTCCATTGTCCTGGCAAACAGTTTCTTTGAAAACCGAGGGACTGCAGTACA 
CGAGTAGAGTGACACCGGATGGAACTGTTTCCAAGCGCTGCAGCTGGATAGATCCGCTCGCATCGCGATAGGTGAAAATTCCATTGGATTGCTCT 
ACCATATATTGTAAAATTTGACAATCGCCGTGCACTTCCCAAGCTGTCGATCATGTGCCAAATCATGTGCCAAATTCACAATCTTAAAAAATAAT 
TTTTCAACTCACCCAGAAAGCATATGAAATATATGTAAATAAATAATGTGATATAGGAGCAATCCATAGGAGCAATTTTCATATACTATGAAAAT 
TTTGAAAAAATGCCAAATAAAGAAGTTAAATGTACTAAATTCCTAAATCGGAAGCTTCTATCGATACATTCTAGTTAGCAGTTAGTTCAACATAC 
TATTTTATAGAAGCCATTTCGGAAGCAAATTTCGGAACCTATTAAATAAGTAACCACATTTTTTTTGACAGTCCACTCGTGGATAACACTTTATT 
GCTTCTATCACATACAAAAGTTGTTGTTTGGTTGCTAGTTGAACTCTGCTTCGAAACAATTGATCAATTCTTGTATTCGTGGTACTGTACTTCAA 
GATAATAATCAGTATTTAAATTTTTTTTTGTTTTTTGGGTTTTTTCTTTCTACTTCAAACAATTGATATTGATGTGGTTCTACAAACAAATATAT 
AATAGTTCTTTATGTGATAAGCATACGCCATAACGATAAAAGAAATTAAAAGTAGGAAAAGAACGAATCGAAATCCAATCGCATCGGAGAATTAA 
ATGTGCATATGTAGTCGTATATCGTAGGTGGTATAAAATACATATAAAATAAACAAAACTTAAAATGAGAACTGAAAACGAAACTTAAATATTTT 
GCACTAGTTCTTTAACAAAGATTTTAGGGAACAGGGTTAGAGGCTGCTTGCTTTTTACTTGTTCAAAATTGCTGCGCGAAATCTCATACAAATTA 
AAACCTATATAGAAAACGGTTTCGATGTGCCATCTTACTGCGTGTAAATGCTATAAACTATACTTGATTGCTCGGGTGGTCTCTACTGCAGTGCA 
TACGCGTTGCCAGAACATCGCTTAGTTGTTGTCCTGCTTGTTGATCAGCTTAAGGATCACCTTGCCGGCCTGCACCTCCAGCGCGTAGAGGCTGC 
CGTCCTTGATCTCGTTGTTCAGCACCTCGTCGGTGATGGTCACAAGGATGCCCGATGGCCCCGATATGTACACGCAGCTCTTGTTGGCGTCGTTG 
AAGATATTGGAGCCGCTGCCCGAATACTTGGAGTGCATCCCCCAATTGTTGAAGAGTGCTGCAGCAGCTGCAGTGGCTGCTCCACCGGCTCCGTT 
CTCCTGGGCACTGGCCTTGGCCATGAACTCGTAGAAGCCGGGCATCTTGAAAAGCTTCTGCTGCAGCTCCTTGGCCGTGTTCGACAACAAGTAGA 
TGGCATTGTAGCTGCATCCATCCACGCTGGCGTACAACGTTAGCCTGGGAGCAATAGTTCTGTTGGAAGGATCACAATTAGAACAAGTATTTGAG 
TTTTTCAATATGGTTACACTCACTTGGCGCGCAAAATGTTGAACATGCGAATGCCATCGGCCAGGCCGCAAATTTGAATCAAATCCTCCTTGGAC 
AT GC T AAAAAT AAT T C G AAATT AT AAAT AAGTC GGAAG AAC AAAC AAAT T T C AAAT GT GTG GT G T GGC AC T AGTT AAAG AT AG AAC T AT T AC AT A 
CATAAACCTATAAAATGCATACATAAAGTAATAACGATCAAGTATCTGACGGAGTAGTCCAGCTATCTAATCAAGGTATTTAATAATCAAAAACT 
TTATGGTTTGATTGGTCGTAATCAATTGGATGTGAAATCATATTGTTCCTTAGCTGTTTTCTTTTAGTTCCGCTATCTGTTATATGTCTGTCAAC 
AAGATAACACAACTAACTGATTAAAAAATTATATGGTTTATCGCTTGATTGTAAGAAATGGTAATCAAAAACACCTAAGTACTCGTAGCTTAGCA 
GACAAATAGGGGCGTGACGATAATTAAAACGCTTTTATATATAAGTCAGTGAGTTGGTTGTAACTTCAATCGAACGATGCTTTTAATTAGTTTTA 
ATTAGTTTTTTAATATTAGTTTGATAAAAGAAAACTACTCGACTCTCCTACGATTTGGTTGGCATAAAATTGGGTCACTTAGAAGCAGAAAAACT 
ATAACATTAATTCGGTTCGAATTCCAGATCACATACCGCATGATATCCGCTCCCGAGAACTGTGCAAACGTTGAAAGATAGGCCGTGAGGCGATG 
ATTGGTCAGCCACTGCGTCACTTGCGAGGGCGTCGATTCCGGCATTATGTTCTGGCTCTGAAATGGACAATAGTAAGTCATTTATTTAGTAAGTC 
TGCTTAGGGAGCAGTGGGCAATGGCACCTACATAATCGTCCATGTCAGCGGGCACATGCTGCGGTGAGACCATGTTTGTGGCGTCCATCAGCTTG 
AGGGTGGGCGAGTTGGTCGATGTCACCGAGTTGATGGCTATGGGGCTGCTGCTGGTAGCCGGCGATGCTGCACTTGGCGCAAACGGAAGCATTCC 
ATCGTACTTGGGTATGTGCACGGGCGAGTTTGGCCAAAGTTTCATACTGCAAGGAAGAGGATACAGGAGGAGATTAGTACAAGCCAGCTTTAAGT 
GAG T T C T T GC AG AT C C T AC AAAT C C AAT AG AGC TCGTTCCGTAT CT G ACG T T G GC A GG AGC G AC GAT C C T T C AG AT GCAAG C GC AT G TT C AC G AA 
TGCGAGGAACTCCAGCGCGGAATTCACAATTCGAATTTCCCATGGTCCCGCTGCTCCCCGGGTAATCTGGTTTGCTCTTCGCCAAGCCTTGCTTA 
GAAGGCACCCGATACGCCCAGTTGCGGCTGATTGGCCAGACAAACCCGTTTTCCTTTCCACCGGGGAAATTTTACAGATTTGCGTCTTTCGCCGC 
CCCCCATGATTTCCGGTTTTTGTGGATGGATAATACATAAGCGTAGGCTTTTCTGCACCTGATCGATATAATCGATCTCAACAATACAAGACCTT 
AT AC AAT C C CC AT AT CAT T C GC CAT CAT TAT C T AT AGT GC T C GG AAT GC AAAC AAAC G AAAT CAT T G AAC AG AG ACC AAAAC C AAAAAC CGGT T A 
AAAGCAACTTTAAACTAAAAATAACTTGGAGCAGACGCAATAACAACAAAGACCGCAGCGCAAAGTGCTCTTAACTGATCTCTTAGCTTTCCACT 
ATCCTTCGGATTCTCTCTATAAATCTCTCTGGATCTCTTTCTCCCCATTTCTCCGGATCTCTCACCGATCGAGGTGTTCTCTCTTTATATGAAAC 
ACGTATACACCCGCTCTCAATTCGATTGATCTTTATGATTTTGTGATCATGCAAATGACAAACTGAATGATTTTCATTTGCTCTCGTTATGGAAA 
ATTTCCAACCAAATATCTGTAAATAGCGTTTTTCTATGCACACTAGATTCTAAATAAAATACATATTTAATAGAACCACAAAAAACAACGTGTAA 
TCGCTTGGTTTGTTTGTTTATGTTTTAAATTCGC AAAT AAAC AGAAAGCTTTTGATC 
{SEQ ID NO: 328} 



Exon: 3142.. 2977 
Exon: 2907. .2792 
Exon: 2188. .2114 
Exon: 2054. .1001 

Start ATG: 3033 (Reverse strand: CAT) 



Transcript No. : CT11653 

GTACGATGGAATGCTTCCGTTTGCGCCAAGTGCAGCATCGCCGGCTACCAGCAGCAGCCCCATAGCCATCAACTCGGTGACATCGACCAACTCGC 
CCACCCTCAAGCTGATGGACGCCACAAACATGGTCTCACCGCAGCATGTGCCCGCTGACATGGACGATTATAGCCAGAACATAATGCCGGAATCG 
ACGCCCTCGCAAGTGACGCAGTGGCTGACCAATCATCGCCTCACGGCCTATCTTTCAACGTTTGCACAGTTCTCGGGAGCGGATATCATGCGCAT 
GTCCAAGGAGGATTTGATTCAAATTTGCGGCCTGGCCGATGGCATTCGCATGTTCAACATTTTGCGCGCCAAAACTATTGCTCCCAGGCTAACGT 
TGTACGCCAGCGTGGATGGATGCAGCTACAATGCCATCTACTTGTTGTCGAACACGGCCAAGGAGCTGCAGCAGAAGCTTTTCAAGATGCCCGGC 
TTCTACGAGTTCATGGCCAAGGCCAGTGCCCAGGAGAACGGAGCCGGTGGAGCAGCCACTGCAGCTGCTGCAGCACTCTTCAACAATTGGGGGAT 
GCACTCCAAGTATTCGGGCAGCGGCTCCAATATCTTCAACGACGCCAACAAGAGCTGCGTGTACATATCGGGGCCATCGGGCATCCTTGTGACCA 
TCACCGACGAGGTGCTGAACAACGAGATCAAGGACGGCAGCCTCTACGCGCTGGAGGTGCAGGCCGGCAAGGTGATCCTTAAGCTGATCAACAAG 
CAGGACAACAACTAAGCGATGTTCTGGCAACGCGTATGCACTGCAGTAGAGACCACCCGAGCAATCAAGTATAGTTTATAGCATTTACACGCAGT 
AAGATGGCACATCGAAACCGTTTTCTATATAGGTTTTAATTTGTATGAGATTTCGCGCAGCAATTTTGAACAAGTAAAAAGCAAGCAGCCTCTAA 
CCCTGTTCCCTAAAATCTTTGTTAAAGAACTAGTGCAAAATATTTAAGTTTCGTTTTCAGTTCTCATTTTAAGTTTTGTTTATTTTATATGTATT 
TTATACCACCTACGATATACGACTACATATGCACATTTAATTCTCCGATGCGATTGGATTTCGATTCGTTCTTTTCCTACTTTTAATTTCTTTTA 
TCGTTATGGCGTATGCTTATCACATAAAGAACTATTATATATTTGTTTGTAGAACCACATCAATATCAATTGTTTGAAGTAGAAAGAAAAAACCC 
AAAAAACAAAAAAAAATTTAAATACTGATTATTATCTTGAAGTACAGTACCACGAATACAAGAATTGATCAATTGTTTCGAAGCAGAGTTCAACT 
AGCAACCAAACAACAACTTTTGTATGTGATAGAAGCAATAAAGTGTTATCCACGAGTGGACTGTCAAAAAAAATGTGGTTA 
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(SEQ ID NO: 329) 

Start ATG: 110 (Reverse strand: CAT) 



MDATNMVSPQHVPADMDDYSQNIMPESTPSQVTQW^ 
DGCSYNAI^ 

LNNEIKDGSLYALEVQAGKVILKLINKQDNN* 
(SEQ ID NO: 330) 



rYLLSNTAKELQQKLFKMPGFYEFMAKASAQENGAGGAATAAAAALFNNWGMHSKYSGSGSNIFNDANKSCVYISGPSGILVTITDEV 



Classification: transcription__f actor 
Celera Sequence No. : 142000013384479 

AACAATGGTACAACCTTATCGGCTTTTAAATTTAATTCAGTCTTCACAACTTCAATGCTTTAAGACTGCTAGTTTAATTTGGGTAATAAACTTCT 
AACAGGGTAAAAGTGGTGTGACACTAGTTTTTGAATTTGATATGCTAATATTAAAATACCATCGTGGGGTACAAGGTACATTTTACTAAATAAAG 
ATCGCTTGGAACATAAACCTTTCTGTTCTAAGCACAACACTGGATAACTATGCAGCCGCAATAAGTTAGAATAGTTATATATCTTTTGGATAGCG 
GAGTGTTTGGCCAGGGTGCGAGGATCTAGATACAAATTGAAGCCATAATAGTTCCATTATGGGGCGCACCACATTGCTGTCCGGACACAAATGCA 
CCGAGCCCCTCGTGATGTGAGTTTAGGAAGGGCAATTGGTAATTACCAGCATGAGGGCCACCTCCGCCATGGGGGTGACATACAGGCACTGGTGG 
AAAAGCGGAATCGCATCCACGATTTCCACTTGGGAGCCCTTCGATGTCTTCTCGGCCAGCAAGAGTCCGTTGACCGCCTGGTGTGGATACTTGGC 
CGCGTGGAATATCAACTTGGCGTACGCCCGCTCCGAGACCTTGTAGTCGCACATTTTCTCAGAATATTATCTCAATTTATTCGGCAAAAAATAAA 
ACATGAATTGGAAATTCGACAGCTAGAAAGTGTGACCGCGACGCAGCTGTTCAAATATACCGCAAACGTCAAATATACAATGAAAAATACTAAAA 

T GG AT G AGC AT GG AAT G GAT T C C AC AAAAAAT GT G AT C AAAAC T AT AT T T GT T AAT T AT G TCT T T AAC AT T G AAAAT AT AT T T AT AACT T AAGT T 
CTAGAAAATGGTTTCTAACATAGGTTGCAGTAGTCTGGTCACATCGATATGGGCGCGCAAACAATATGTGGATTCCACTTCAAATTAATTTATTA 

CT C AAA C T AAT AT T T AAT TCT AAC GT G AAT GAT GG AAAAC AT AAC T G AAT AT GT G T AAC ATT T T AT T T C T T AAC TGTCCGTCTGCGTCTCGCTCT 
TGCTTGGTTTTTTTCTCCGGTTTTTTATATATGTATATTGTAACTAAAAATATTTAGCAGTTCAGCTGAATGTCCGGTGGCAGACCAACCCGGCC 
TTCCAATTGGATCCTTGAACACCATCAAACCACACCGATGGGGGAACTACGTCTGGCTTTCATTCAGTATTGCCTTGATGTCGTCGGCGTCAAGA 
GTCTCGTACTTGAGAAGCGCCTCCGCAAGCGCTTTGTGCTCCCTGGTGTGTTTTCGCAGGATAGCCTTCGCCCGCTCGTAGCTGTCGCTGAGGAT 
GCGCTTGATCTCCGCGTCCACGGCTTCAATGGTGTTGGGACCAAGGGTGTCACCGGTGCCCAGACCCTTTGACGCCTCGATGGTGCGAAGACCGA 
CCTTGTCCGACATGCCCCAGTCCCTGACCATATGCGTCGCGATCGAGGTTGCCTGCTTGAGATCACTGCTGGCTCCCGACGTAATCTTGTCCGTG 
CCGAAGACCAGTTCCTCAGCAGCACGACCGCCCATCATGGTGTCCATCATGGCCAGCAGCTGTGCCTTTGTGACATGGTAACGCTCCTTCTCCGG 
TATGTAGGCGGTGTGTCCAAGGGAGGGTCCGCGGGGCATGATGGTCACCTTGTGCAGCGGATGCGACTCCTTGGTGTAGAAGGCCACAATCGCGT 
GGCCGCCCTCGTGGTAAGCGGTTATGGTGTTGGCCTCCTCGTCCGGTAGACGAGCCTTGCGCTCGGGTCCCATAAGTACTTTGTCTCGTGCCGTT 
TCCAAATGCTTCATGCTGACCGTTTCGGCTCCATCGATGGCAGCTCTGCCAAAATGTTAGAATTATTAGTACCGCTCTTCAAAATTGTGATTGAA 
TAATACAACACAAGTTATTCCTTACCTTAACGCCGCTTGGTTAATCATGTTTTCCAGATCCGCTCCAGTAAAACCAGAGGTGCCGCGCGCCAGCA 
TGTCCAAGTCTATTTCATCGTGCAAAATCTTCGTCAGATACAGAGACAATATCTCCTTACGTCCAGTGAAATCGGGCGTCGACACCATGACTTCT 

AC AT C AAAT C GG C CG GG AC GC AGC AGAGC CT G AT C C AAAT C GT CGC G AC GAT TGGTGGCTC C C AAAAC AAT GAC GCCAGCGTTCT GAT GG AA GC C 

ATCCATCTCGGAGAGCAGCTGATTGATGGTCTGATTGGCGTAAGGATGCAGCACAGAGTTGGTTCGCTTGGCGCCCACCGAGTCGATTTCGTCAA 

TGAAGATAACACAAGGGGCACGGGCCTTTGCTGCCTCTGCAAGATGATGTTCTTGTTTAGTTGAGCACTTTGGATAATTGAAATCTACTTACTGA 

ACAGATCACGCACTCGTCTGGCTCCTTGGCCTACGAGCACCTCGTCGAACTCCGGTCCGGCAGCATGGAAGAAAGGAACTTTGGCTTCACCGGCG 

ACGGCACGGGCCAGCAGTGTTTTGCCAGTACCCGGCGGGCCAACCAGGAGCACTCCCTTTGGAAGCTTTCCGCCCAAATTTGAGAACTTTTCCGG 

ACTCTTCAGGAATTCAACGACCTCCTTGAGCTCCTGCTTGGCTTCGTCGCAACCCTTGACATCCTCAAAAGTGACGTTGATCTCCTCAGGGTCCA 

CTTCAACCTGGTTGCCCAGTTGTATGCTGCTGAAAGAAAGGCATATCAGATGGGTAAAGCAAAACTAATAACGTGGAGGCTCACCGAAATACGGA 

TCCATTGGAGGTTGTAAAGAAGCTCAGGAAAATGCCCAAGAACACGACGATAACGACGATAGTCTGCAAAGTCTGCGTGACAATTACTAGAATTT 

ATTTATAACACTAAAGAAGACCAAGGCCTTCTGTGGTTTTCAAAGTAATCATGGTACACTTTGAATGAATTTAAACTCGTACCTTCAGATATTTC 

ATAGTTTTTCCTGATTTTGGAGAGTCCTCCGAGTTAGCGGCGGCCAAGTAGCCCTCGGCAAAGGCAATTTTCAGGCTCTCCTGGAATACAAAACA 

TTAGTACGGTTAACAATTGAGAGACCCTGTCGTCCTACGGCTTTATTAAATCCATGCTCTTCGGATTTGGCCAGCAGCCGTCTCAGTTTTTCAGC 

CTGAAGGGGTGTGTCCCCATCCAGACGCTGGGGCGAGTTGGCCAGAGCATTCTTTAGTCGGGAGGTCATTGTGGGATTGCGCTTTTGTTCCGCCT 

CAATGGAACGATCCGTCTTAAAGCCACGTATCTGATTTCTATAAAGAGCTAAATTTGAGATATAGTTCGTAGATACTTTCCGTATAATTACAGCT 

GTTGTTGTGAGAAGAGCGGACGAAGAAGACCTGGTGTAATAGTTGGAATCACCTTGCTGAAAACATAAACCAGGATTAAGCGGAGCCTCGAAAGG 

ATAATTTGCAATACGTACACATTGGCGCCTGTGGGCACGGAACCTGCTGCACCGGTTAGCGAAACTTCGAACTGATAATTTTGCAGCTGGACACG 

ACCGATCTGTGTGGCCGACTCCTCCAACGTGACTCGGACATCGCGCATGGAGGGCAGCATGCTCGTCACCTTGGCCAGCACATCATCGACCAGCC 

GCCGGTCGAGAATGCCATTCAGTCGAGCCGCCCTCTCCACCATGCCCTGTATACTGGCGCTCGAACGGCTCAGCAGGTTCCGCAGGTCGAGCACT 

AGGTCATGACTCCGGCTGGACGTGGACGTGGACTTGGACAACCGGGCAGCTCCCGCGCTCCCGTGCAGCTTGGTTCGATTAACGGAGTAGTAGTG 

CGGCTTTCTGCTGAAGTTGCCTAGGTACAGGTAGGGCTGTCACATCACATCACACGCAGTTCCCCGATTCCAAGTGTTGGGTGGTAGAAATGGAG 

CAGTCAGGTGGGCGAATGAATGTATTGTCATCGGGCAGCACTTACCACGGAGTGTGTGGTGGTGGAAAACATTGCCAAAACTACAAATTTCAATA 

AATTACAAGGGCAGGCTCAGGATCGCGGCCAGCAGACGACACAGCTGTAACTTTACTTGACAGCGAAAATGTCATAATCTCGTCGCGTCGTGCGC 

GTTGCATCCCTGGTTCTGGTGGTGCGCGACTTTCCACCTCTTATCAAAGCACGCGTTCTCCAATTACTGGCAATTTTCAAAAGTGTATTCGTTTG 

TCGACTGGTGGTTTTGATTTTATTCGATTTTTAATTTGGTTTATATATCTAAACTGCCTCTCCGTACAGTCAACGCTGCTGATAGTGTTGCTAAA 

CTTACATAAATATATAAAATTTTGTATAAATAAAATGAATGATAATCAGCGCAAACTTTGCATTCTATAAAATGTGATTTGTTTACTACTAAGAT 

C AAAT GAAAGTCTATTCCAAATTTTACTT AT ATTGACAACAAATTCCATTGT AT CAACTGT AAAT GAATGTTGTCTTTGTTATTTAGTTTT AACT 

TCTTAAAGAAAAAGGTTTTTAACCAGTTTAGGATAGACAAAGAGTCCCTATCGCAACCATCGTGCATAGATCGACCATCACTAGTTCTGGAGATG 

TGCAGAAGTCACTGGCCCGTTTCGCCGGCGACCAGGGTTGGCAAATCGGCTGCGCCGGTAGTTGGTCCGTTATACATTTCACAAATTCCACATTA 

AATTGTATCAGCATGGACTCCCCGCTGAACGATGGCTCTCACCATCCACCGCCGCACGCTCCCCACCCCCTGGTGAGTGCTCTACGGTCCTTAAA 

CTGCCACTACCCAAACGTTTGCATCTATTTCAGGCGGACTATCAGTTTTCTGCGGAGGAGGTGAAGGCGTTACGTGAGTGCAACACTGAGTCTTT 

CTTCCAGCGCAGCCTGCCTTTCGGCACTGGACTGGGCCTGCTGGCCTACTTCGGCGTCAAGAACGGCTATTTGCAGGGACACGTCAAGTACGGCG 

CCGTGCCCAAGGTCGTGATGGGCGTCATTCTGGGCTACTTTGTGGGCAAGTTCAGTTACCAGCAGAAGTGTGCCGAGAAGATCATGCGCCTACCC 

AACTCCCATC 
(SEQ ID NO: 331) 



Exon: 4045. .3941 
Exon: 3836. .3439 
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Exon: 3381.. 3322 
Exon: 3268.. 3079 
Exon: 3025. .2933 
Exon: 2818. .2745 
Exon: 2686. .2373 
Exon: 2316.. 1926 
Exon: 1850. .1001 

Start ATG: 3967 (Reverse strand: CAT) 
Transcript No. : CT11787 

TAAAGTTACAGCTGTGTCGTCTGCTGGCCGCGATCCTGAGCCTGCCCTTGTAATTTATTGAAATTTGTAGTTTTGGCAATGTTTTCCACCACCAC 
ACACTCCGTGCCCTACCTGTACCTAGGCAACTTCAGCAGAAAGCCGCACTACTACTCCGTTAATCGAACCAAGCTGCACGGGAGCGCGGGAGCTG 
CCCGGTTGTCCAAGTCCACGTCCACGTCCAGCCGGAGTCATGACCTAGTGCTCGACCTGCGGAACCTGCTGAGCCGTTCGAGCGCCAGTATACAG 
GGCATGGTGGAGAGGGCGGCTCGACTGAATGGCATTCTCGACCGGCGGCTGGTCGATGATGTGCTGGCCAAGGTGACGAGCATGCTGCCCTCCAT 
GCGCGATGTCCGAGTCACGTTGGAGGAGTCGGCCACACAGATCGGTCGTGTCCAGCTGCAAAATTATCAGTTCGAAGTTTCGCTAACCGGTGCAG 
CAGGTTCCGTGCCCACAGGCGCCAATGTCAAGGTGATTCCAACTATTACACCAGGTCTTCTTCGTCCGCTCTTCTCACAACAACAGCTAAATCAG 
ATACGTGGCTTTAAGACGGATCGTTCCATTGAGGCGGAACAAAAGCGCAATCCCACAATGACCTCCCGACTAAAGAATGCTCTGGCCAACTCGCC 
CCAGCGTCTGGATGGGGACACACCCCTTCAGGCTGAAAAACTGAGACGGCTGCTGGCCAAATCCGAAGAGCATGGATTTAATAAAGCCGAGAGCC 
TGAAAATTGCCTTTGCCGAGGGCTACTTGGCCGCCGCTAACTCGGAGGACTCTCCAAAATCAGGAAAAACTATGAAATATCTGAAGACTATCGTC 
GTTATCGTCGTGTTCTTGGGCATTTTCCTGAGCTTCTTTACAACCTCCAATGGATCCGTATTTCGCATACAACTGGGCAACCAGGTTGAAGTGGA 
CCCTGAGGAGATCAACGTCACTTTTGAGGATGTCAAGGGTTGCGACGAAGCCAAGCAGGAGCTCAAGGAGGTCGTTGAATTCCTGAAGAGTCCGG 
AAAAGTTCTCAAATTTGGGCGGAAAGCTTCCAAAGGGAGTGCTCCTGGTTGGCCCGCCGGGTACTGGCAAAACACTGCTGGCCCGTGCCGTCGCC 
GGTGAAGCCAAAGTTCCTTTCTTCCATGCTGCCGGACCGGAGTTCGACGAGGTGCTCGTAGGCCAAGGAGCCAGACGAGTGCGTGATCTGTTCAA 
GGCAGCAAAGGCCCGTGCCCCTTGTGTTATCTTCATTGACGAAATCGACTCGGTGGGCGCCAAGCGAACCAACTCTGTGCTGCATCCTTACGCCA 
ATCAGACCATCAATCAGCTGCTCTCCGAGATGGATGGCTTCCATCAGAACGCTGGCGTCATTGTTTTGGGAGCCACCAATCGTCGCGACGATTTG 
GATCAGGCTCTGCTGCGTCCCGGCCGATTTGATGTAGAAGTCATGGTGTCGACGCCCGATTTCACTGGACGTAAGGAGATATTGTCTCTGTATCT 
GACGAAGATTTTGCACGATGAAATAGACTTGGACATGCTGGCGCGCGGCACCTCTGGTTTTACTGGAGCGGATCTGGAAAACATGATTAACCAAG 
CGGCGTTAAGAGCTGCCATCGATGGAGCCGAAACGGTCAGCATGAAGCATTTGGAAACGGCACGAGACAAAGTACTTATGGGACCCGAGCGCAAG 
GCTCGTCTACCGGACGAGGAGGCCAACACCATAACCGCTTACCACGAGGGCGGCCACGCGATTGTGGCCTTCTACACCAAGGAGTCGCATCCGCT 
GCACAAGGTGACCATCATGCCCCGCGGACCCTCCCTTGGACACACCGCCTACATACCGGAGAAGGAGCGTTACCATGTCACAAAGGCACAGCTGC 
TGGCCATGATGGACACCATGATGGGCGGTCGTGCTGCTGAGGAACTGGTCTTCGGCACGGACAAGATTACGTCGGGAGCCAGCAGTGATCTCAAG 
CAGGCAACCTCGATCGCGACGCATATGGTCAGGGACTGGGGCATGTCGGACAAGGTCGGTCTTCGCACCATCGAGGCGTCAAAGGGTCTGGGCAC 
CGGTGACACCCTTGGTCCCAACACCATTGAAGCCGTGGACGCGGAGATCAAGCGCATCCTCAGCGACAGCTACGAGCGGGCGAAGGCTATCCTGC 
GAAAACACACCAGGGAGCACAAAGCGCTTGCGGAGGCGCTTCTCAAGTACGAGACTCTTGACGCCGACGACATCAAGGCAATACTGAATGAAAGC 
CAGACGTAGTTCCCCCATCGGTGTGGTTTGATGGTGTTCAAGGATCCAATTGGAAGGCCGGGTTGGTCTGCCACCGGACATTCAGCTGAACTGCT 
AAATATTTTTAGTTACAATATACATATATAAAAAACCGGAGAAAAAAACCAAGCAAGAGCGAGACGCAGACGGACAGTTAAGAAATAAAATGTTA 
CACAT 

{ SEQ ID NO: 332) 

Start ATG: 79 (Reverse strand: CAT) 

MFSTTTHSVPYLYLGNFSRKPHYYSVNRTKLHGSAGAARLSKSTSTSSRSHDLVLDLRNLLSRSSASIQGMVERAARLNGILDRRLVDDVLAKVT 
SMLPSMRDVRVTLEESATQIGRVQLQNYQFEVSLTGAAGSVPTGANVKVIPTITPGLLRPLFSQQQLNQIRGFKTDRSIEAEQKRNPTMTSRLKN 
ALANSPQRLDGDTPLQAEKLRRLLAKSEEHGFNKAESLKIAFAEGYLAAAWSEDSPKSGKTMKYLKTIVVIVVFLGIFLSFFTTSNGSVFRIQLG 
NQVEVDPEEINVTFEDVKGCDEAKQELKEVVEFLKSPEKFSNLGGKLPKGVLLVGPPGTGKTLLARAVAGEAKVPFFHAAGPEFDEVLVGQGARR 
VRDLFKAAKARAPCVIFIDEIDSVGAKRTNSVLHPYANQTINQLLSEMDGFHQNAGVIVLGATNRRDDLDQALLRPGRFDVEVMVSTPDFTGRKE 
ILSLYLTKILHDEIDLDMLARGTSGFTGADLENMINQAALRAAIDGAETVSMKHLETARDKVLMGPERKARLPDEEANTITAYHEGGHAIVAFYT 
KESHPLHKVTIMPRGPSLGHTAYIPEKERYHVTKAQLLAMMDTMMGGRAAEELVFGTDKITSGASSDLKQATSIATHMVRDWGMSDKVGLRTIEA 
SKGLGTGDTLGPHTIEAVDAEIKRILSDSYERAKAILRKHTREHKALAEALLKYETLDADDIKAILNESQT* 
(SEQ ID NO: 333) 

Name: paraplegin-like protein 
Classification: endopeptidase 



Celera Sequence No. : 142000013385217 

GCACTCGATCACAATCAGGTCGAAACTGGAGTTGGCATAGGAAGCTGTAATGCATAAACATCACTGATTTTAATCTGCTGACTCACTGGCTAGTC 
AAAAACAGCACACACACACCGCACCGCACGCAACTTACAAAACGAAAGGCGGTGCACATTTTCCAAAGCCACATCGCCGCCAGTGGCCGCCTTTA 
GTTGCTCCACGCGCTTGTCCAAGTCGGCGCTGTCCGTCCATATGTATAGGGATTTTTGTAGACCCTTAAAGTTCTCCATTGTGGATGTGTTTCTA 
GAATATCTCAAAAGTTCAAGCACCGAACGGAACCGATTTTTGATTTCACAAGTTAGATATGGCAATGCACCGAAAATCGAGCGGTGATATCGATT 
GTACCTGCCTATCGGATTGCAAATGATATTCAGTGATTGTAGATTTTTCCACTTTACACTGTGACTGCTAAAATATATGAATAACTAAGTGTACT 
CCACCAGATTCGTTTTTTAATGAGTTACAGCTTACATTCTGATACATAATTAACAGCTTTTAGTTGCATATTGTGTGTTTCTGCCACTAAACCTA 
TTTAAACGGTTACATTTAAGCCTAAAGTCGCGAATTATCTTTTAAATTCACCCATGTGTACTTTCCCACGGCCCGTTTTAAAAATGCATGGACAG 
TTAGACAAGAAGGTTGGTAGTTTGTACCTTGAATTTTAAATGTGGGCGCTCAAAATCGAAGTGTTCAATGAAAATCGATACCTTTCGCATCAGTT 
AAAAGTATTTATAGATATAAGAAATAATATCTTTAACAGTGATGTGATGGGAATAAATATACAAATGTATTCAGAAACTGCAAAGTTAAAACAAA 
ATATATTAGCTACTCCAAATTGGCTATAJ^AATAACCAAATGCCCACTGCCATCCAAATCCAACCAACAGTGCTGTGAAGCGACAAACATATCGCC 
GGTAGGCAGCGACTATCATAAATAAGCGCCTATCGGACTATCGATACAACATCAACATCCTCAGTCGCTTTTTTAGTTGTAGCATATTATAAATA 
TAACTCGAGCGCTAGTTGCAGCCTTAAACAGAAGCCAGCCGCCACTACCGCTAGTCGCTAGTCAAGATCTGCTCGAAACGCTGAAATATGTGTGA 
TTAGTCGTGAATCGCAGCTAGCCGCAAATCCAGCTTGAGCTACAACAACAAAGCCAATCCGCGGTTAAGAGGAAGAGAGTGCGTGAGAGAAACGA 
CAGCTGCTCTGCGTGTGTGTTTGTGCAGCAGGATAACAAAATAGAAAACATACAGCAGCAACAACAACAACAAAAACAGCACCAGCGAACAGTGT 
ATAAAGTGCCAAACGAATTTGCCAAATAGCCAAAAACGAAATAATAGAAAAGATCCAAACAAAATTCCAATAAGTTCAATGCAAGGAGACATGTG 
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CATAAATGAAGAGCTACAGCCAATTTAATTTAAACGCCGCCGCCCCGCCCGCCATTGCCTACGAAACTACGGTTGTAAATCCCAACGGCTCACCC 
CTCGATCCTCACCAGCAGCAGCAACAACAGAGCCAGGATATGCCGCACTTTGGACTGCCCGGTCCGCAGCCGCCGTCGTCGCAGCAGCAGCAACA 
ACAATTGCAAGTGCACCACCAACAGCAGCAGCAGCAGCAACAACAACAACAACAACAGCAGCATCAACAACAAATGCAGATGTCCTTGCTGCCGG 
GTCCTTACCGGCCACACATCGAGGAGAAGAAGCTCACGCGGGACGCCATGGAGAAGTACATGCGCGAGCGCAACGACATGGTCATCGTTATCCTG 
CACGCCAAGGTACGAAAGGATTTCCACTTTTAAGCCGGTATTATCAGGTGCTTGGTCTGGATTTAGTATTCAAGATATTCTAAAACATACAAGGG 
AACTCCTGAGTCTTAAGTCTTAATTAAATTATTTGAAGCTTGAAATACATTATTTCAATATTACTAAAAATATATTAACCCAAACTGGAGAAACG 
TAGGATATCCTAACTTTCATTTAAATTAGAAACTTGGTCAGTTTTGTACGTCGATTTCCTAAATTATTCGATTTTTTAGATTAAATAGATTAAAT 
TCAGATTTTCTTTCTATTTCAAGTTTAACGATGTATCCTTTCTTTGAAAATTACTTGTATGCTTTACAAGATATTTATTTTTTTTTTATAATTTA 
GTTTTAATCTAATCGGTCAGTCCGCAACGCCAAAACCACACCCACTATTTAAAGGTGCTAGTGTTTCCATTGTCCACTTTAAAGTGTTACATTTT 
TGCGGTTTTTGGCGAACGCAGACATGAAAATGGAATAAAATTGTCGCGGTTGCTTTTAGCGGTGGCGATCTTCAAGCGACGACTGCCGCCAAATG 
GGAATCCTTCACACTCGACTGCCCGCAAATTGAGCAACGCGAGCGCGGTCCAGTCCGCAATGAAAATAATTGAAATTAATTACAGAAAATTAACA 
AAGCTGCATTGCTTGCGGTTTTGATATGATTTTCCGATAATCTGGACGGCAATTTGTACTAATTGAATTTTCCCTGCACCATCGTGCGCCTCTGT 
CTGTCTGTCTGTGTGAATTCCAGGTGGCCCAAAAGTCCTATGGCAATGAGAAGCGATTCTTTTGCCCACCGCCGTGCATTTATCTGTTCGGAAGT 
GGCTGGCGCCGGCGGTACGAGGAGATGTTGCAGCAGGGCGAGGGCGAACAGGGAGCACAACTATGCGCGTTCATCGGTATCGGGAGCAGTGACCA 
GGATATGCAGCAGCTGGATCTCAATGGCAAGCAGTACTGTGCGGCCAAGACGCTCTTCATCTCGGACTCGGACAAGCGAAAACACTTTATGCTGT 
CGGTGAAGATGTTTTACGGAAACGGTCATGACATTGGCGTTTTTAACTCGAAACGGATCAAGGTTATATCCAAGCCGTCCAAAAAGAAACAGTCG 
CTAAAGAATGCCGATCTGTGCATAGCCAGCGGCACCAATGTAGCCCTGTTCAATCGCCTGCGTTCCCAGACTGTCTCCACCCGTTACCTTCACGT 
GGAGAACGGACACTTTCATGCGTCGTCAACACAATGGGGAGCTTTTACGATACACCTGCTGGATGACAATGAGTCCGAGTCGGAGGAGTTTCAGG 
TGCGCGATGGTTACATTCATTACGGAGCCACAGTCAAACTGGTGTGCTCCGTAACGGGCATGGCCCTGCCACGTCTGATCATCCGGAAGGTAGAC 
AAGCAGATGGCCCTGCTGGAAGCCGACGATCCCGTTTCTCAGCTGCACAAATGCGCCTTCTACATGAAGGATACGGATCGGATGTATCTCTGTTT 
GTCGCAGGAGAAAATCATACAGTTCCAGGCCACGCCGTGCCCCAAGGAGCCGAACAAGGAGATGATCAATGACGGCGCCTGCTGGACCATCATAT 
CCACCGACAAGGCTGAGTACCAGTTCTACGAGGGCATGGGTCCTGTGGCGTAAGTATAGCTTACTTACAACTCAATGGATTAGTTTTCATTTCGT 
GTTTTCTTCCAGTTCTCCAGTCACTCCAGTGCCAATCGTTAATTCCCTTAATCTCAATGGCGGCGGGGATGTGGCCATGCTAGAGCTGAGTGGCG 
ACAACTTTACGCCGCATCTGCAGGTGTGGTTTGGCGATGTGGAGGCGGAAACCATGTACCGCTGCACAGAGACATTGCTATGCGTGGTGCCGGAG 
ATTTCACAATTTCGAGGCGAATGGCTTTGGGTAGGTTTTCCAATGTAAATCTGTTTAATACGCGCTATAAATTTTACTTTTAATGGTCCTTGCAG 
GTACGTCAGCCCACGCAGGTGCCCATTTCGCTGGTGCGCAACGATGGAATAATTTATGCCACCGGTCTGACCTTCACGTATACACCTGAACCAGG 
TCCTCGGCCGCACTGCAATACACAGGCCGAGGATGTGATGCGAGCGCGACAGAACAATAACAATAATAACATCACTAGCATTAGCAACAATAATA 
ACAGCAACAATGCGGGATCGCCGGCAGCCGGCGGTGGCTTGCAACAACAACAGCAGCAGCATCAGGCTCTGCCCTCCATCTCAGAAGTGCAATGG 
AATAGTCATGGTAGCGGCTTATCCTGAGTGCAGGATTATTGCATCGCACACATATGTATCCTAAGTTATTTATAGTTTTTATCGAAGACCTTAGT 
TTGTAGATTGACAATTTTTCTCATACTAATTTTCAACCCCTCGTGACGTAAACCAAAATGTGTCCCCTTCTCTCTAAAGTGAATCATACTTTACA 
CATGTATCTAGTGCTAGTTCGGAACCAATAGAATTGTAAATGACACATAAAACTCTGTACATAACTCGAAAAGCTCACTGACATACATACATACA 
TTTGTATGTGCATGAATATACCTTTACACAAAACATGTAATTCTAGTTAAATATACAATTGATGTAAGTTTATAGACGATTTTTAGATGCGGTTG 
AAAT AT T T AT GAT T T T C TT T GC T AT G AAT T T C C AAAG AT CT AAAT G AAAAAAAAAAAT AC TGC G AAATGC ATCGGAAAT TT T TAT TAT AC AC AAG 
CCAGTAGCCACGCTACAAAAACATTCCATTTCCCTCTTTAGCTTTCTAAAAAGTCGAAGACAAAGTCAAAAAGGGAAAAATTTGTTTCTGTACAT 
ACAGTTTTGTTGCATTTTTTTATTAACTTTGGAAATTCGTAGCATTAAAAATCCCAATAAAATATTTTGTAAAATAAAATCGCAACAAGACAAAA 
CAGCGATTTATTTAAGTGTAAGCGAAATGCCTAACATAAATAAAAAAAAGAAATCCCAAATGATGACAAAGAATAATCAAATCATGTCAGCTTAT 
GTTTTTGGTTTAGCGTGATTCTTCTGGTTCACCATTTTTCCTGTGGCCTAAGGTGGAGCGCTGGGGAGGATTGGTTGCCTTAAATTGCTGTGGGA 
ATATTACCCAGTATCGTGTGAATATTTTTAAACAAAACAAGTGCAGCTCTTCCCACGATGGTTTGCCTCAACCCCCTCGGACGGGCCGACTTTAA 
TTATAGTTTGGTCTGGAATTGCTGAACGCGAAGCTAAGAAGCAAATATACATAGATATATAACACTATTGTTTCCGTTTTGGCCAAGTTAAGGAG 
CAGAATATTCTCACAGCCCAAAAAGCCGCTGCCGAGCTGCTGTGAGAATTTTTACTGCCTTTGTTGAATATTTATGCATTCGTTGCTGCTGCTGT 
CTTTACAGATTTACAAATTAAGATTCGCAGTTTTGAATTAATTATCGCTCACTTCCGCTCGACGGAATCAGAAGGTGAGCGCTGGCCAAGCATTT 
AATCTCATCTTGCACTTGGGAGACATATTTCTTATCAGAGTCAGCCAATAGGAACTTGGCTGCAACCCCTAAGATTTTACTGCACCGGGCGATGG 
GAACGAAAGACTTAAACAACGAACTTTGGTTGCGTGAGGAATCAAACTACTGGCGGCCGCCTGTGTTCATTATGATGGAAAAACCACCACCAAAA 
GTTCTGGCTCT G GC AAT T GG AACT G C AACC T C T T AAGGT C GTC GC C GTCT T AT AAG AAGC TG AAT AT GAAAT T A GT T T GG AAT T T T T AAT GAAC A 
ATTCCTTCACATTTCCCCAGGCCCTCTTTCTCCACCACTGTTTGGATAGGCGAGTGCGATGATTGTGGTCGTTTTTAGAAAGGGGGAGACAGCAA 
TTTCTAAAAGTTCTCAGGACTGCTACACAACCTCATACCGTCTACTAAAAAAGTTCATTATCTTGAAAAATCTAAGTCTACCTGCCGTCGTCTTT 
AGTCTTTAGCTTGCCGTGCCGACTGCTGCCGTTT 
(SEQ ID NO: 334) 

Exon: 1001. .1814 
Exon: 2589.. 3469 
Exon: 3528.-3735 
Exon: 3801.. 4829 
Start ATG: 1431 

Transcript No. : CT11789 

ATCAACATCCTCAGTCGCTTTTTTAGTTGTAGCATATTATAAATATAACTCGAGCGCTAGTTGCAGCCTTAAACAGAAGCCAGCCGCCACTACCG 
CTAGTCGCTAGTCAAGATCTGCTCGAAACGCTGAAATATGTGTGATTAGTCGTGAATCGCAGCTAGCCGCAAATCCAGCTTGAGCTACAACAACA 
AAGCCAATCCGCGGTTAAGAGGAAGAGAGTGCGTGAGAGAAACGACAGCTGCTCTGCGTGTGTGTTTGTGCAGCAGGATAACAAAATAGAAAACA 
TACAGCAGCAACAACAACAACAAAAACAGCACCAGCGAACAGTGTATAAAGTGCCAAACGAATTTGCCAAATAGCCAAAAACGAAATAATAGAAA 
AGATCCAAACAAAATTCCAATAAGTTCAATGCAAGGAGACATGTGCATAAATGAAGAGCTACAGCCAATTTAATTTAAACGCCGCCGCCCCGCCC 
GCCATTGCCTACGAAACTACGGTTGTAAATCCCAACGGCTCACCCCTCGATCCTCACCAGCAGCAGCAACAACAGAGCCAGGATATGCCGCACTT 
TGGACTGCCCGGTCCGCAGCCGCCGTCGTCGCAGCAGCAGCAACAACAATTGCAAGTGCACCACCAACAGCAGCAGCAGCAGCAACAACAACAAC 
AACAACAGCAGCATCAACAACAAATGCAGATGTCCTTGCTGCCGGGTCCTTACCGGCCACACATCGAGGAGAAGAAGCTCACGCGGGACGCCATG 
GAGAAGTACATGCGCGAGCGCAACGACATGGTCATCGTTATCCTGCACGCCAAGGTGGCCCAAAAGTCCTATGGCAATGAGAAGCGATTCTTTTG 
CCCACCGCCGTGCATTTATCTGTTCGGAAGTGGCTGGCGCCGGCGGTACGAGGAGATGTTGCAGCAGGGCGAGGGCGAACAGGGAGCACAACTAT 
GCGCGTTCATCGGTATCGGGAGCAGTGACCAGGATATGCAGCAGCTGGATCTCAATGGCAAGCAGTACTGTGCGGCCAAGACGCTCTTCATCTCG 
GACTCGGACAAGCGAAAACACTTTATGCTGTCGGTGAAGATGTTTTACGGAAACGGTCATGACATTGGCGTTTTTAACTCGAAACGGATCAAGGT 
TATATCCAAGCCGTCCAAAAAGAAACAGTCGCTAAAGAATGCCGATCTGTGCATAGCCAGCGGCACCAATGTAGCCCTGTTCAATCGCCTGCGTT 
CCCAGACTGTCTCCACCCGTTACCTTCACGTGGAGAACGGACACTTTCATGCGTCGTCAACACAATGGGGAGCTTTTACGATACACCTGCTGGAT 
GACAATGAGTCCGAGTCGGAGGAGTTTCAGGTGCGCGATGGTTACATTCATTACGGAGCCACAGTCAAACTGGTGTGCTCCGTAACGGGCATGGC 
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CCTGCCACGTCTGATCATCCGGAAGGTAGACAAGCAGATGGCCCTGCTGGAAGCCGACGATCCCGTTTCTCAGCTGCACAAATGCGCCTTCTACA 
TGAAGGATACGGATCGGATGTATCTCTGTTTGTCGCAGGAGAAAATCATACAGTTCCAGGCCACGCCGTGCCCCAAGGAGCCGAACAAGGAGATG 
ATCAATGACGGCGCCTGCTGGACCATCATATCCACCGACAAGGCTGAGTACCAGTTCTACGAGGGCATGGGTCCTGTGGCTTCTCCAGTCACTCC 
AGTGCCAATCGTTAATTCCCTTAATCTCAATGGCGGCGGGGATGTGGCCATGCTAGAGCTGAGTGGCGACAACTTTACGCCGCATCTGCAGGTGT 
GGTTTGGCGATGTGGAGGCGGAAACCATGTACCGCTGCACAGAGACATTGCTATGCGTGGTGCCGGAGATTTCACAATTTCGAGGCGAATGGCTT 
TGGGTACGTCAGCCCACGCAGGTGCCCATTTCGCTGGTGCGCAACGATGGAATAATTTATGCCACCGGTCTGACCTTCACGTATACACCTGAACC 
AGGTCCTCGGCCGCACTGCAATACACAGGCCGAGGATGTGATGCGAGCGCGACAGAACAATAACAATAATAACATCACTAGCATTAGCAACAATA 
ATAACAGCAACAATGCGGGATCGCCGGCAGCCGGCGGTGGCTTGCAACAACAACAGCAGCAGCATCAGGCTCTGCCCTCCATCTCAGAAGTGCAA 
TGGAATAGTCATGGTAGCGGCTTATCCTGAGTGCAGGATTATTGCATCGCACACATATGTATCCTAAGTTATTTATAGTTTTTATCGAAGACCTT 
AGTTTGTAGATTGACAATTTTTCTCATACTAATTTTCAACCCCTCGTGACGTAAACCAAAATGTGTCCCCTTCTCTCTAAAGTGAATCATACTTT 
ACACATGTATCTAGTGCTAGTTCGGAACCAATAGAATTGTAAATGACACATAAAACTCTGTACATAACTCGAAAAGCTCACTGACATACATACAT 
ACATTTGTATGTGCATGAATATACCTTTACACAAAACATGTAATTCTAGTTAAATATACAATTGATGTAAGTTTATAGACGATTTTTAGATGCGG 
TTGAAATATTTATGATTTTCTTTGCTATGAATTTCCAAAGATCTAAATGAAAAAAAAAAATACTGCGAAATGCATCGGAAATTTTTATTATACAC 
AAGCCAGTAGCCACGCTACAAAAACATTCCATTTCCCTCTTTAGCTTTCTAAAAAGTCGAAGACAAAGTCAAAAAGGGAAAAATTTGTTTCTGTA 
CATACAGTTTTGTTGCATTTTTTTATTAACTTTGGAAATTCGTAGCATTAAAAATCCCAATAAAATATTTTGTAAAATAAAATCGCAACAAGACA 
AAACAGCGATTTATTTAAGTGTAAGCGAAATGCCTAACATAAATAAAAAAAAGAAATCCCAAATGATGACAAAGAATAATCA 
(SEQ ID NO: 335) 

Start ATG: 431 

MKSYSQFNLNAAAPPAIAYETTVVNPNGSPLDPHQQQQQQSQDMPHFGLPGPQPPSSQQQQQQLQVHHQQQQQQQQQQQQQQHQQQMQMSLLPGP 

YRPHIEEKKLTRDAMEKYMRERNDMVIVILHAKVAQKSYGNEKRFFCPPPCIYLFGSGWRHRYEEMLQQGEGEQGAQLCAFIGIGSSDQDMQQLD 

LNGK Q YC AAKT L E I S D S D KRKH FML S VKMFYGNGH D I G V FN S KRI KVISKPSKKKQS LKN ADL C IAS GTN V ALFN RL RS QT VS T RY L H VEN GH FH 

ASSTQWGAFTIHLLDDNESESEEFQVRDGYIHYGATVKLVCSVTGMALPRLIIRKVDKQi^IALLEADDPVSQLHKCAFYMKDTDRMYLCLSQEKII 

QFQATPCPKEPNKEMINDGACWTIISTDKAEYQFYEGMGPVASPVTPVPIVNSLNLNGGGDVAMLELSGDNFTPHLQVWFGDVEAETMYRCTETL 

LCVVPEISQFRGEWLWVRQPTQVPISLVRNDGIIYATGLTFTYTPEPGPRPHCNTQAEDVMRARQNNMNNNITSISNNNNSNNAGSPAAGGGLQQ 

QQQQHQALPSISEVQWNSHGSGLS* 

(SEQ ID NO: 336) 

Name: J KAPPA- RECOMBINATION SIGNAL BINDING PR 
Classification: DNA_binding 
Gene Symbol: Su(H) 
FlyBase ID: FBgn0004837 



Celera Sequence No. : 142000013384832 

TGAACGGAAGCATACAGGATCAGAGTTCAGCAAAGTGCAATGCCAAAGCGGACTACAACAATGTCCACGAGGTCGGGCCATCGACCAGTGCGGAG 

GGGGCGCTGGCGCTCAATGCCGCCTGCAACTAGGGGCGTGGCTGTTGGCCCATATGCATATACAATACGAAGAGAGCCAGCAATTAGGGTGTAAG 

GC AT T T AG AGG AAAC ATC AAC AGT ACT TT AAAAT T TAGC T TG AAA TATGT T T AGC GAAAT GT AGC T CT T CC GGT C AGC C AG AGT AAC CGG AAAC G 

GAAACGTAAACGCCGGACGCGCATACAATACTCATCGACACACTCAAGTTGCCACTCGGACGAATGGACTGTTGTTTATTGTGCAATTCGCACGA 

GACGAACTC AC AGGCATTGCATTTTCAAGAAATTAGTATTACATAATTGTATCCGTATCTGTTAAGTACTCCGGGCGAAGA GAT CAATTC AAAAT 

GCGGTTGTGTCGTCGAAAAAAAAACAAACTCGAAAATACTCATGCACAGACTCTGAATAAGTTCCCCTTTGCAGAACGCAATTTACTCATCAGTC 

AGTATTCTCTATGGCCACTGTATTATTTTTATACCTTCATTTGTTAGCTTTAAGTTTTTTTTACACAACTACTACTTATACTTGTGCTGCCACCA 

GTTGGGCTGCCGTAGTTGCAGATCAATGTATCCATTGTCCATACATTGCGAGAATCTTTTATAATTCGTCTAAGCATAAGTTACTGTACGTCGTG 

TTGTATAGCGCTGTAATATCCTGTCACCTGCTTCTGCTTAATTAATACCCTATATCGGGCGAGAGCAAGGTACGATTAAACCATTAGATAATGAA 

ATACAAATACATAGTGTATAAACAGATGAACGACGTGCGTTTAAAGGGGGGTTCTCCGATCTCTCAGTCCCCGAAACAGGAATCAAAGTGGGCAA 

CTCTGCAAATGCTTAGTATTTAAAAATCAAGGGTTCTATCAATATATTTCATCCACTTGAACTTTTAGAAAGCTGTGATAAATCGAAATGAAGGA 

CCTGGACGGCTCCTTGGACACTCTGGAGAATGCCCGCTTCAACTACGTGTATATGAAGGACATTGCTCGCCTGGCAAAGGACTCGATCTTCTCGC 

ATAACGAGCTGATTAGCATTGTAATGCTCTACCATAAGTTTGTGCTGGTCAATGGGCCGAGAGCAAAGTACATGACCATTCAGCAACTCTCTGCG 

CTGATGGAGCTCTTGTTTGAGATCGTGGATCGCGATCTCATTGCGACCATTGTGTATAGAATAGCCCATACACCAGGTTCCAGGCCTCCTGACTT 

CTTTTCCGACAAGCATATACACTTGGAGTCCTTTGTGCGGCTTTTCACCGTATACTTCACCAAAGATCTTCAGCTGAAAATGGAATTCGCATTCT 

CGGTGGGCAATTCATAGTATTCATATGAAATTAGCCTAGACCATTTGCTTACATTGCAGGTCTACGATAAAAGCGATTCCAAGCAGTTGAATGGC 

GAGCAAGTTGGGTTCTTCGTCGGCAAGTTCTTTGAGAGCGAGGATGAAGACGAATCCATTGAGCTGCGCTTGGTGAGGCACCATCCCCTACACTT 

GTTCCATTGAAACACTGACACATTTCGCATTCCTTCAGGACATGAAGGAGATGCTGTTCCTCAAATTCGACTTGGACAAGGATACCAACATTGGG 

GTTGATGAGTACTACGAGGTGGTCCGCCGACAGCCCATGCTGCTGGAGTGCTTTGGTCGCGTGTTTCCCCCGAATCCCCAGATGGAGGTCCTTGC 

GCTGTGCGCCAATGTAATGTCTTGGTTTGACGATTCGCCCAATCCCAGGATTATGATAAAACCAGACGGCGGCAAGGCCAGCTAGCTAACAGGAC 

GTCCAGGAACGCATTTTTATCTAACCAGATACTTAAATTGTGACACGTCTAGAAATTGAAAAGTGCAAATAATTTAGTGCATCATAAAGGTTTAC 

ACTTTTATTAACTAGTTTTGTACGTTACTTTTGAATATTGGTTCTGAACGGTAAACTCGCTTCATTGAGGCTGCTTCGATAACGATACCGATTAG 

TCAGTGTGTCCGGATTGTCTCATCCCTGCTGCACGCGACCCACCATTTTAGTAACACCGAAGAAACGGTAGCGAAATGGCCACGGAGGAGCACCA 

GCGCCTGGCCAGCATCGTGAAGAGCTGCCACGAAAGCCTGCGCCAGTTGACTAAGGAGTATGGCGCCACAGCGGCCTGGCAGGAGCACACGAGTC 

CCAGGAACGCCAAGCAACTGGCGGAGTACGCCAAGGCGATGAAGCAACTGGCGGCCATATGGGAGACCAACGATGGGAAGGTGGAGCTGCAGGCA 

CGCAGCCGCATCAAGTGGGCCATCGACTACATCACCAAGTACTTCTTTACCGAGGGCATATATTTGCAGAAGCGCCAGCGGGAGCAGCGACTCCT 

GGAATCCTACAGAGCCGAGGGAAAACTGGGCGAGGTGCAGTGCAGGCTGATGGAGGAGCCGCCGGACAGGCTGCACGTCCTGGATGTGGGCAGCT 

GCTTCAATCCCTTTTCCAGTGCTCCGCACTTGGAGGTAACCGCCCTGGACTTGTGTCCCGCCACCGAAGATGTGCTGCAGGCGGATTTCCTCAAA 

GTGGAGGTGGTACCCGGAATACGAGAACCTGAGCTGGAGGAGGGCAGCGTAAGAAGGCTACCAGCAAGCCACTACGAGTGCGTCATTTTCAGTTT 

GCTACTGGAATACATGCCCAGCGCCGAGCAGAGACTACAGTGCTGCCTTCAGGCCTACGACCTGCTGCTGCCCGAGGGCATCCTTGTGCTCATCA 

CACCGGACTCCCAACATGTTGGCAAAAACGCCCACCTCAT 

{ SEQ ID NO: 337) 

Exon: 1001.. 1427 
Exon: 1485.-1592 
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Exon: 1654.. 1890 
Start ATG: 1038 

Transcript No. : CT11950 

ATCCACTTGAACTTTTAGAAAGCTGTGATAAATCGAAATGAAGGACCTGGACGGCTCCTTGGACACTCTGGAGAATGCCCGCTTCAACTACGTGT 
ATATGAAGGACATTGCTCGCCTGGCAAAGGACTCGATCTTCTCGCATAACGAGCTGATTAGCATTGTAATGCTCTACCATAAGTTTGTGCTGGTC 
AATGGGCCGAGAGCAAAGTACATGACCATTCAGCAACTCTCTGCGCTGATGGAGCTCTTGTTTGAGATCGTGGATCGCGATCTCATTGCGACCAT 
TGTGTATAGAATAGCCCATACACCAGGTTCCAGGCCTCCTGACTTCTTTTCCGACAAGCATATACACTTGGAGTCCTTTGTGCGGCTTTTCACCG 
TATACTTCACCAAAGATCTTCAGCTGAAAATGGAATTCGCATTCTCGGTCTACGATAAAAGCGATTCCAAGCAGTTGAATGGCGAGCAAGTTGGG 
TTCTTCGTCGGCAAGTTCTTTGAGAGCGAGGATGAAGACGAATCCATTGAGCTGCGCTTGGACATGAAGGAGATGCTGTTCCTCAAATTCGACTT 
GGACAAGGATACCAACATTGGGGTTGATGAGTACTACGAGGTGGTCCGCCGACAGCCCATGCTGCTGGAGTGCTTTGGTCGCGTGTTTCCCCCGA 
ATCCCCAGATGGAGGTCCTTGCGCTGTGCGCCAATGTAATGTCTTGGTTTGACGATTCGCCCAATCCCAGGATTATGATAAAACCAGACGGCGGC 
AAGGCCAGCTAG 
(SEQ ID NO: 338) 

Start ATG: 38 

MKDLDGSLDTLENARFNYVYMKDIARLAKDSIFSHNELISIVMLYHKFVLVNGPRAKYMTIQQLSALMELLFEIVDRDLIATIVYRIAHTPGSRP 
PDFFSDKHIHLESFVRLFTVYFTKDLQLKMEFAFSVYDKSDSKQLNGEQVGFFVGKFFESEDEDESIELRLDMKEMLFLKFDLDKDTNIGVDEYY 
EWRRQPMLLECFGRVFPPNPQMEVLALCANVMSWFDDSPNPRIMIRPDGGKAS* 
(SEQ ID NO: 339) 



Celera Sequence No. : 14200001338482 6 

TAACTGAGCTGTGTTTGAACTTACCCTCGTTGTACGCTTCGAAGGCCTTCATAAAGTCATTCAGACCCGCAGCGGCAGCTCCTTTGCGTGAGTAA 
CCCTTAGGCCAGGTGGGATTAAGCTGAATGGTCTTCTCCGCATCTTCTAGCGCCTCCTGGAACTTTCCAGCCTTGGCGAATGCCGCCGAACGGTT 
GCTGTAGAGCACATGGTTCTGGTCGTCCAGAGCGATGGCCTCTGTGTACGCCGCCACCGCCTCGTCGAACTTCTCGGCACTGAGCGCCTGGTTAC 
CCTTTTCCTTGAGTTCGTTCACCTGCGAAAACCGCAAGGGTTTTTGATTGGAAATGCCAACGCCTTTCCTTTGTCTCGGGCACTTACCTTGTCCA 
TTTCTGATAAGGGAGAGGATAGCGCTGTCGAATTACAGTTTTTCCTGTATTTTTGTACGACTTTTTAATAAAACACGCTGCGAAATGTGCCGTGC 
CAGGTGAAAATGCGAAAAATGCCGACTTTTCTAGAAGCAATCGATAGGGTTGTCAATGCTTCTGCGACTGAATTTAATCTTTTTCCAGACTGCGG 
TTTAGAACTTAGCTCGTTCTCAAGCGGGGAACAAAACCAGCAAGCCTTCTTAAATAATTTTTTCCATTGCTTATATAAATTAAAAAAATTGTTGA 
TCAAATATAAATAATACCTTGAGCTTTATAGAATTTAAATGTTATTTTATTGTATTGTTTATTTATGTTTATATTATATTATTATTTGCGTAAAT 
AATTTTCTCAATTTCTTATTTTGAATTGTAAAATTTTTCTAGTCAAATGAAAAATAAGTCCTCAGGCTTTTAAGTGCATTAATATTTAAATATTT 
GAAATTAATATTTAAAATGTTAGCATATTTTAAAACATTTAAATTAAGAACAGAGGTATCGAAAAACGATGCTGCCAAAAACACTAAATATCGAT 
AACTAATCATCGATTTTATTTGCAACTCTAGATTCGAGGCTGGCAACGCCGTGCAAAATACGACCAGACGCTGCTTTTTTCGGTTTCCGAAAATA 
ATTGAAAATTATCACTAAATAAACAGGCGGCTAACATAATTGCAGTTTGACCGCGTTGCAGAGACAAGCGGGTAGATAAGTTACACATTTTGGCC 
ACAGCAACACCACAGTAAAGCCGCCAGCATGGCCGAGTACTTGGCATCCATTTTTGGCACGGAAAAGGACAAGTAAGTCGCATGGCACATTTATT 
TTAACGTTGCCGGGATTAATAACTAAATGGCGTTGCATTTCCCAGGGTGAACTGTTCGTTCTACTTCAAGATCGGCGCCTGCCGCCACGGCGACC 
GGTGCTCTCGCATCCACAACAAACCCACTTTCTCGCAGACGGTGCTTCTCCAAAATCTATACGTGAACCCCCAAAACTCCGCCAAATCCGCGGAT 
GGCTCCCATCTGGTGGCCAACGTCTCCGACGAGGAGATGCAAGAACACTACGACAATTTTTTCGAGGACGTGTTCGTAGAGTGCGAGGACAAGTA 
CGGGGAAATCGAGGAGATGAACGTGTGCGACAACCTAGGCGACCATCTGGTCGGCAATGTGTACATCAAATTCCGTAACGAGGCTGATGCGGAAA 
AGGCGGCAAACGATTTGAACAACCGGTGGTTCGGTGGTCGACCGGTGTACTCGGAACTATCGCCGGTGACCGACTTCCGCGAGGCTTGCTGTCGG 
CAGTACGAGATGGGCGAATGTACCCGCTCCGGCTTCTGCAACTTCATGCACTTGAAGCCCATCTCGCGTGAGCTGCGAAGGTACCTCTACTCCCG 
CCGCCGTCGTGCCCGCTCCCGTTCCCGATCCCCTGGACGCCGTCGCGGCTCCCGCAGCAGGTCCCGATCCCCGGGTCGAAGAGGAGGCGGCAGAG 
GCGACGGTGTCGGCGGAGGAAACTACTTGAACAACGAGCGGGACAACATGCGTGGCAATGATCGCGGAAACGATCGCGATCGCCGCAAAGGTGGT 
GGCGGAGGAGGCGGTGGTGGTGGTGGACGGTATTAAATATTCTGGACATTTTTTCTGTACGGTGGCAATTGAGAGAGAAAGCAACGGACAAGCAG 
TAACGCAAAGCATCGTGTATTTTAGTCTCTGCGTTGGTTTTCCACGTTTTTATATCACTTTTGAGTAAACGAGATGGAAGGAGGAGGAGGAAAAT 
GACGATGGGGAGTTCCACCACCTATGTACAAACCCCCCGCGAAATATTTTACCGAGCGCCCATTAGAATTGTAATTTACTATACATACATTCATA 
ATAATAAAATTGAGTAATATTATACTGGAAGCCATAGTAAGAAATGGGTCTAATTGTCAAAAAGTCCAGGCACTCAAAATAAAAGACTCCTCACA 
GGTATGGATAACATCCAGAAGACACCACCCCTTGTTAGGCCTGCAGTCTTATTTCGCTTTGTAAGATTGAATCTTACCACGTTGTTTTAATTATA 
TTTCCATGCGCAGATGGCTTCACCAACTGAAAACGATCAACAGCGTAAATGATGCTGTCAACTGTCTTAAGACGAAATCTTAAAATCTTCAAAAA 
TGCGTACATTTGAACAACGAAATTCTTGTTTTCCCCGACTTAGTCTTAGATTTTAATAGGTGCATATTTAGGGGCATATTTGTGATCAAGCTATG 
TCAGTTGGCGGGCTTCCATCGTGATTAGTCGTTCCACCCTGAGGAACACCTCTGAAAGCCTCGCCTTCATTTTGCACTGCTGGCTTCTCCGCTTG 
AAGATTACTCTGTGCGCGATGCGTCGATGGAGGTACGGGCGTGTCAGAGGAGTTGCTAGACGCCTCCCTGGCTTTCAGTTGCTTGGTCATGTCTA 
TTTGCTGCTGCAGCGTGTAGGTATTCTCGTGTTTGCGCCGTTGTTGCTGCTCAACGTCCCTCAGGACACCATCGTGCAGTCTCAGGCTGGAAGTT 
AGAGATCGGAGGCTATAACTTTATTCCGGAGTATTTGGCTTACTTACCGGTCTGCCGATTGTTTATAGTGCACATAACCTATGATCGCTGTGGAA 
ATGGAGACCGCCAGTCCCAAAGTCACTCGCGAAGCGGTCGACATGACTGGGCACCGGTCTGCCGCTGGAGATTTCCGCCTAAAATGCCTGCACTA 
GGAATTTGGCTGTCGGGTTGTGTTGTTTTGAAGTCGGTTCGGGTTACGACATGACTTTAAATTTGAAAGAAAAACAACAGCACCACACAGCTGAC 
AAGAGAGTGGCCAGTTTACGGAAGCAAAGTTTTCCCTATCCAGAGTTAGTGCGCGGGAAAATCAAATTAATTCCCTTC 
(SEQ ID NO: 340) 

Exon: 1001.. 1212 
Exon: 1281.. 2308 
Start ATG: 1169 

Transcript No. : CT11994 

GTGCAAAATACGACCAGACGCTGCTTTTTTCGGTTTCCGAAAATAATTGAAAATTATCACTAAATAAACAGGCGGCTAACATAATTGCAGTTTGA 
CCGCGTTGCAGAGACAAGCGGGTAGATAAGTTACACATTTTGGCCACAGCAACACCACAGTAAAGCCGCCAGCATGGCCGAGTACTTGGCATCCA 
TTTTTGGCACGGAAAAGGACAAGGTGAACTGTTCGTTCTACTTCAAGATCGGCGCCTGCCGCCACGGCGACCGGTGCTCTCGCATCCACAACAAA 
CCCACTTTCTCGCAGACGGTGCTTCTCCAAAATCTATACGTGAACCCCCAAAACTCCGCCAAATCCGCGGATGGCTCCCATCTGGTGGCCAACGT 
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CTCCGACGAGGAGATGCAAGAACACTACGACAATTTTTTCGAGGACGTGTTCGTAGAGTGCGAGGACAAGTACGGGGAAATCGAGGAGATGAACG 
TGTGCGACAACCTAGGCGACCATCTGGTCGGCAATGTGrACATCAAATTCCGTAACGAGGCTGATGCGGAAAAGGCGGCAAACGATTTGAACAAC 
CGGTGGTTCGGTGGTCGACCGGTGTACTCGGAACTATCGCCGGTGACCGACTTCCGCGAGGCTTGCTGTCGGCAGTACGAGATGGGCGAATGTAC 
CCGCTCCGGCTTCTGCAACTTCATGCACTTGAAGCCCATCTCGCGTGAGCTGCGAAGGTACCTCTACTCCCGCCGCCGTCGTGCCCGCTCCCGTT 
CCCGATCCCCTGGACGCCGTCGCGGCTCCCGCAGCAGGTCCCGATCCCCGGGTCGAAGAGGAGGCGGCAGAGGCGACGGTGTCGGCGGAGGAAAC 
TACTTGAACAACGAGCGGGACAACATGCGTGGCAATGATCGCGGAAACGATCGCGATCGCCGCAAAGGTGGTGGCGGAGGAGGCGGTGGTGGTGG 
TGGACGGTATTAAATATTCTGGACATTTTTTCTGTACGGTGGCAATTGAGAGAGAAAGCAACGGACAAGCAGTAACGCAAAGCATCGTGTATTTT 
AGTCTCTGCGTTGGTTTTCCACGTTTTTATATCACTTTTGAGTAAACGAGATGGAAGGAGGAGGAGGAAAATGACGATGGGGAGTTCCACCACCT 
ATGTACAAACCCCCCGCGAAATATTTTACCGAGCGCCCATTAGAATTGTAATTTACTATACATACATTCATAATAATAAAATTGAGTAATATTAT 
ACTGG 

(SEQ ID NO: 341) 
Start ATG: 169 

MAEYLASIFGTEKDKVMCSFYFKIGACRHGDRCSRIHNKPTFSQTVLLQWLYVNPQNSAKSADGSHLVANVSDEEMQEHYDNFFEDVFVECEDKY 
GEIEEMNVCDNLGDHLVGNVYIKFRNEADAEKAANDLNMRWFGGRPVYSELSPVTDFREACCRQYEMGECTRSGFCNFMHLKPISRELRRYLYSR 
RRRARSRSRSPGRRRGSRSRSRSPGRRGGGRGDGVGGGNYLNNERDNMRGNDRGNDRDRRKGGGGGGGGGGGRY* 
{SEQ ID NO: 342) 

Name: RRM-type RNA binding protein 
Classification: RNA_binding 
Gene Symbol: U2af38 
FlyBase ID: FBgn0017457 



Celera Sequence No. : 142000013384832 

TCGATCTTCTCGCATAACGAGCTGATTAGCATTGTAATGCTCTACCATAAGTTTGTGCTGGTCAATGGGCCGAGAGCAAAGTACATGACCATTCA 
GCAACTCTCTGCGCTGATGGAGCTCTTGTTTGAGATCGTGGATCGCGATCTCATTGCGACCATTGTGTATAGAATAGCCCATACACCAGGTTCCA 
GGCCTCCTGACTTCTTTTCCGACAAGCATATACACTTGGAGTCCTTTGTGCGGCTTTTCACCGTATACTTCACCAAAGATCTTCAGCTGAAAATG 
GAATTCGCATTCTCGGTGGGCAATTCATAGTATTCATATGAAATTAGCCTAGACCATTTGCTTACATTGCAGGTCTACGATAAAAGCGATTCCAA 
GCAGTTGAATGGCGAGCAAGTTGGGTTCTTCGTCGGCAAGTTCTTTGAGAGCGAGGATGAAGACGAATCCATTGAGCTGCGCTTGGTGAGGCACC 
ATCCCCTACACTTGTTCCATTGAAACACTGACACATTTCGCATTCCTTCAGGACATGAAGGAGATGCTGTTCCTCAAATTCGACTTGGACAAGGA 
TACCAACATTGGGGTTGATGAGTACTACGAGGTGGTCCGCCGACAGCCCATGCTGCTGGAGTGCTTTGGTCGCGTGTTTCCCCCGAATCCCCAGA 
TGGAGGTCCTTGCGCTGTGCGCCAATGTAATGTCTTGGTTTGACGATTCGCCCAATCCCAGGATTATGATAAAACCAGACGGCGGCAAGGCCAGC 
TAGCTAACAGGACGTCCAGGAACGCATTTTTATCTAACCAGATACTTAAATTGTGACACGTCTAGAAATTGAAAAGTGCAAATAATTTAGTGCAT 
CATAAAGGTTTACACTTTTATTAACTAGTTTTGTACGTTACTTTTGAATATTGGTTCTGAACGGTAAACTCGCTTCATTGAGGCTGCTTCGATAA 
CGATACCGATTAGTCAGTGTGTCCGGATTGTCTCATCCCTGCTGCACGCGACCCACCATTTTAGTAACACCGAAGAAACGGTAGCGAAATGGCCA 
CGGAGGAGCACCAGCGCCTGGCCAGCATCGTGAAGAGCTGCCACGAAAGCCTGCGCCAGTTGACTAAGGAGTATGGCGCCACAGCGGCCTGGCAG 
GAGCACACGAGTCCCAGGAACGCCAAGCAACTGGCGGAGTACGCCAAGGCGATGAAGCAACTGGCGGCCATATGGGAGACCAACGATGGGAAGGT 
GGAGCTGCAGGCACGCAGCCGCATCAAGTGGGCCATCGACTACATCACCAAGTACTTCTTTACCGAGGGCATATATTTGCAGAAGCGCCAGCGGG 
AGCAGCGACTCCTGGAATCCTACAGAGCCGAGGGAAAACTGGGCGAGGTGCAGTGCAGGCTGATGGAGGAGCCGCCGGACAGGCTGCACGTCCTG 
GATGTGGGCAGCTGCTTCAATCCCTTTTCCAGTGCTCCGCACTTGGAGGTAACCGCCCTGGACTTGTGTCCCGCCACCGAAGATGTGCTGCAGGC 
GGATTTCCTCAAAGTGGAGGTGGTACCCGGAATACGAGAACCTGAGCTGGAGGAGGGCAGCGTAAGAAGGCTACCAGCAAGCCACTACGAGTGCG 
TCATTTTCAGTTTGCTACTGGAATACATGCCCAGCGCCGAGCAGAGACTACAGTGCTGCCTTCAGGCCTACGACCTGCTGCTGCCCGAGGGCATC 
CTTGTGCTCArCACACCGGACTCCCAACATGTTGGCAAAAACGCCCACCTCATGAAAAACTGGCGCTACTCCCTGGCCAGGATTGGACTGCTACG 
GGTACGCTTCGAGAAGTTGCCGCACATATCGTGCATGGTGTTCCGCAAGGCCATCAGCCGGGAGCTCTCACAGCACTGGGCGAGCATTCACCGGG 
AGGAGGGGATGTGCGAGGAAATTCGGATACCGCAGGACGATAGCTAGATAAGATTCAGCCCCCATAAATCGATGTAAATATGTCCACGTCTACTT 
ATTTATGGCTTAACTTATCTGTGTTAAGTGCGAGTTAATAATAATGGTTGACGATTTATCTACTATCTATGGATGTCGTAAATGGACAAATGCTA 
TTGCTTCTGTTCCGACTGTGGGAACAGCGCGCCGCTGGTCTGGCGCTTCTTCATCGCTGTGGTGGCGCCCTGCGAAATCTTCATGATGTTCGGCG 
GGAGCGGTTTCTTGCGATCCGCCTCCCACTCGGCTCGGTGGAGCCGCTGACGGTGCTTGTACATGTTGGCGTTGGAAAAGAACGTCATGGGGCAG 
TTAGGACAGGTGTACAGCACCTCTCCGGTGTGCGTTGATGTGTGCTCCTGTAAATTAGGTGTCGAATACTTAGATTTCTCTGTAAGTGAGCCCCA 
CACTTACCCTCAGATCCTGTGGACGCTTAAATGCCTTTTCGCACATCCCGCACTTGAACTTTCGCTCACAGAGGTGGTTGTGGTAGATGTGGCGC 
TTCAGGGCTCGGGAATTGGTGGATGTCTTGTTGCAGATATGACAGCGGTGCTCGTCGCCCGGCGGCTCGTGTACCGTTTTCATGTGCTGCTTTAG 
TCCGGACAAGTGGCGCAGCCAGGTGCCGCAGATCTCGCACTGCACGGGATCGGGTTTTGGACGCGGTTTCTCGGAATGTTCAAGCTCGTGATGCT 
TTTTGAGATTCGTTTGTGAACGTAGCGTCTTGCCACACTTATCGCAGATGATGGCCGCTTCCGGGTCGTGCATGTAACGCAAGTGCTCTTTCAGT 
TTGCGTTCCGTAGGAATCCTGAAAGCATTAAGGGATTACTTGGGGGGTTGCTCATATGCCTCATACCTGATGGACAATTACTTCTTGTTGCACTC 
CGGACAAGAGTATTTGAACTCGGATTTCGGGACATGCTTGGAAACGTCATGATACTCCATGGCCGCCTTGGTTGTGAAGCTCTTGGGGCACTTCT 
CG 

{ SEQ ID NO: 343) 

Exon: 1001.. 1947 
Start ATG: 1039 

Transcript No. : CT12002 

ACCCACCATTTTAGTAACACCGAAGAAACGGTAGCGAAATGGCCACGGAGGAGCACCAGCGCCTGGCCAGCATCGTGAAGAGCTGCCACGAAAGC 
CTGCGCCAGTTGACTAAGGAGTATGGCGCCACAGCGGCCTGGCAGGAGCACACGAGTCCCAGGAACGCCAAGCAACTGGCGGAGTACGCCAAGGC 
GATGAAGCAACTGGCGGCCATATGGGAGACCAACGATGGGAAGGTGGAGCTGCAGGCACGCAGCCGCATCAAGTGGGCCATCGACTACATCACCA 
AGTACTTCTTTACCGAGGGCATATATTTGCAGAAGCGCCAGCGGGAGCAGCGACTCCTGGAATCCTACAGAGCCGAGGGAAAACTGGGCGAGGTG 
CAGTGCAGGCTGATGGAGGAGCCGCCGGACAGGCTGCACGTCCTGGATGTGGGCAGCTGCTTCAATCCCTTTTCCAGTGCTCCGCACTTGGAGGT 
AACCGCCCTGGACTTGTGTCCCGCCACCGAAGATGTGCTGCAGGCGGATTTCCTCAAAGTGGAGGTGGTACCCGGAATACGAGAACCTGAGCTGG 
AGGAGGGCAGCGTAAGAAGGCTACCAGCAAGCCACTACGAGTGCGTCATTTTCAGTTTGCTACTGGAATACATGCCCAGCGCCGAGCAGAGACTA 
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CAGTGCTGCCTTCAGGCCTACGACCTGCTGCTGCCCGAGGGCATCCTTGTGCTCATCACACCGGACTCCCAACATGTTGGCAAAAACGCCCACCT 
CATGAAAAACTGGCGCTACTCCCTGGCCAGGATTGGACTGCTACGGGTACGCTTCGAGAAGTTGCCGCACATATCGTGCATGGTGTTCCGCAAGG 
CCATCAGCCGGGAGCTCTCACAGCACTGGGCGAGCATTCACCGGGAGGAGGGGATGTGCGAGGAAATTCGGATACCGCAGGACGATAGCTAG 
(SEQ ID NO: 344) 

Start ATG: 39 

MATEEHQRLASIVKSCHESLRQLTKEYGATAAWQEHTSPRNAKQLAEYAKAMKQLAAIWETNDGKVELQARSRIKWAIDYITKYFFTEGIYLQKR 
QREQRLLESYRAEGKLGEVQCRLMEEPPDRLHVLDVGSCFNPFSSAPHLEVTALDLCPATEDVLQADFLKVEVVPGIRSPELEEGSVRRLPASHY 
ECVIFSLLLEYMPSAEQRLQCCLQAYDLLLPEGILVLITPDSQHVGKNAHLMKNWRYSLARIGLLRVRFEKLPHISCMVFRKAISRELSQHWASI 
HREEGMCEEIRI PQDDS * 
{SEQ ID NO: 345) 



Celera Sequence No. ; 142000013384831 

CTTTCGGGGGAGAGGTGGTGAAAATCCAGTGGGTTTATCCTCACACAAATATCCTCACATTAATGGCACATGCAATTTTCCCCTACATTTGGCAC 
CTAGCACCCAGCTTTCACCCGCAGTACCGTTATACCGTTCCGTTTTACCGTTTTACCGCCGCGTTGTGTTCACAGGACCGCACAAGCTCAGACTG 
AAGCGACGAAGTGAAGCTGCGTGAGCGAGCGGCTGGTTGCTTTTACGCTTTAATACTAAAAAAAAAGGAAAACGGTCCTTTTTGCTTTCGTTCGC 
TCGCGCGCACAGCTTCCTTTGCTTCCTCTTTGTCGTCGGAGTGTAGACTTTCTCGCCGCTCTCTTTGCGCAACAATGAAACTCTTGAAAGTCTCC 
AAAATGAACAGACCGTTTTAATATCTTGGCCATATTTCGT^ACGCATTCCGATAAACATTTACTCCAAAGTCAAGTATGTGAATCTGTTAAGGTAA 
ATAGGGATGTAATCTGCCCATCCAATTACAATAGCATGCTTTTAGGCTGCTTATAAATTATCCCAGAGTTCTTCGAACCATATTTATGGCGGCGT 
CCACATATAAATATAGTTAAGTTATCGCAACAGTCTTCAACGATCTTTGGTATTATCAGATGACCTGATTCCCTTGGTATTTTAAAATAAGTGTG 
AAT G AAT C AC AT G AAT G AAT T C AAAAT T TT CT AAT AAG AT TTC T AC AAG TTT AAG AAC ATC AC C AACC AT CGT A T GT T T AAT GT C AC C ACC AAGG 
GTGCCCGTATTACGACATAACGTGACCACATATTTTAGGGAATTTTTAAAAAATAAGCATAGAAATTATCTCTTATAACTAAAGTCATGTCCACT 
TTGCTTTGAGTATTAATTTAATTTGAGAAATTATATTGCGTCGCGTAAGCTGCCACCACTGAGGGTGACCGTACTTGCGACCGGTGTGCCCGCAC 
CGCGCAATCACCAGCACTTCGGCCGTTTCCCAGCACTGCCTCACCGAACGTCTGGCCAGCGACCATCTTGTGAAAAGTACTACGAGTGTGGAGAG 
GAGTTGGCTTAGGAATTGTACAATTACTTCTGCGGCGAACAGTTAAAAAGCATTAAAAATGTCGATTTTTTCCGCCCGCCTGGCGTCCTCGGTCG 
CCCGTAACCTGCCCAAGGCTGCCAACCAGGTACCGCACCGAATTCCAGAAATTAAATCGCCCGACGAAGAGTTGGCGAAAAAGTGTAATACATAA 
CCGAATCGATGGGGTACTTTGCAGAAAATTAGTGCATCCGGGTGGTTCAAAGATCGATCGAACTGTGGATAAAGACTTTCTGGATGCGTAGCAAG 
TATTCGGGTGGGATTTTGTACCATTTTCCAACAAGAATTTAGTGTTACGCAATGCGGCGAATTGCAAGGCGGCCGCTGAAATTGTGGGTTACATA 
ATCAGTGCCAACCGAGAATTTGCTAAGCAAAACAGAGTTTTTGGTTATTTTTTCAGTAAAAGTGTGCGAAGAACGCTGCAATGTCTTTGTTTTTG 
TCCTCACATTACATAATTGTGATGTGGGTGAGATGCGGTGCCCGAAAAGTTTGAGGTTAGGTGCGGCTGTCAAGCAAGTGGTGAAGGTGGAAAGA 
ATACAAAGTGGGGAAGTGCCGACATCCAGATACCCTGTAATCAGTTTTTTTTTTTATTAACTAGCTAGAAAGTACTGGAGAGAAGTTATTGAACG 
ATCGTACCCTGCGGTTATCCAACGACACGGACCTACACCTTACATAATCTCGCATAATCATATCGATTAACCTACTCTCTTACAGGTCGCCTGCA 
AAGCCGCTTATCCCGCCGCCAGTCTTGCTGCCCGCAAGCTCCATGTGGCCAGCACGCAGCGTAGCGCTGAGATCTCCAACATCCTGGAGGAGCGC 
ATCCTGGGCGTCGCCCCCAAGGCTGATCTGGAGGAAACCGGCCGTGTGCTGAGCATCGGTGACGGTATCGCCCGTGTCTACGGTCTGAACAACAT 
CCAGGCCGATGAGATGGTGGAGTTCTCCTCCGGACTGAAGGGCATGGCCCTTAACTTGGAGCCCGACAACGTCGGTGTTGTGGTCTTCGGTAACG 
ATAAGCTGATCAAGCAGGGCGATATCGTCAAGCGTACCGGTGCCATCGTGGATGTGCCCGTCGGTGATGAGCTGCTGGGTCGCGTCGTCGATGCC 
CTGGGAAATGCCATCGACGGCAAGGGTGCCATCAACACCAAGGACCGTTTCCGTGTGGGAATCAAGGCCCCCGGCATCATCCCCCGTGTGTCCGT 
GCGCGAGCCCATGCAGACCGGTATCAAGGCCGTCGACTCCCTGGTGCCCATCGGTCGTGGTCAGCGTGAGCTGATCATTGGCGATCGTCAGACTG 
GGTAAGATTCTGGAGATAACCAAACCATGACACACCATTTCTCAATTGCTAGCAAGCAAAATTTAACCAAACAATTTTGTATTTAATCTGCAGTA 
AGACCGCTCTGGCCATCGACACCATCATCAACCAGAAGCGCTTCAACGAGGCCCAGGACGAGTCCAAGAAGCTGTACTGCATCTACGTCGCCATT 
GGCCAGAAGCGTTCCACCGTCGCCCAGATCGTGAAGCGTCTGACCGACTCCGGCGCCATGGGCTACTCCGTGATCGTGTCGGCCACCGCCTCCGA 
CGCTGCTCCTCTGCAGTACTTGGCCCCCTACTCCGGATGCGCCATGGGAGAGTACTTCCGCGACAAGGGCAAGCACGCCCTGATCATCTACGATG 
ATTTGTCCAAGCAGGCTGTGGCCTACCGTCAGATGTCCCTGTTGCTGCGTCGTCCCCCAGGTCGTGAGGCCTACCCCGGTGATGTGTTCTACCTG 
CATTCGCGTCTGCTGGAGCGTGCCGCCAAGATGTCCCCCGCCATGGGAGGCGGTTCCCTGACTGCTCTGCCCGTGATCGAGACCCAGGCTGGCGA 
TGTGTCCGCCTACATTCCAACCAACGTCATCTCGATTACCGACGGACAGATCTTCTTGGAGACCGAGTTGTTCTACAAGGGTATCCGCCCTGCCA 
TTAACGTCGGTCTGTCCGTGTCCCGTGTGGGTTCCGCTGCCCAGACCAAGGCCATGAAGCAGGTGGCCGGTTCCATGAAGCTGGAGTTGGCCCAG 
TACCGTGAGGTCGCTGCCTTCGCCCAGTTCGGTTCCGATCTGGATGCCGCCACCCAGCAGCTGCTGAACCGTGGTGTGCGCCTCACTGAGCTGCT 
CAAGCAGGGTCAGTACGTGCCCATGGCCATTGAGGATCAGGTGTGTATGCCAGGACTTACCTAGCAACTAACTTATATCCGTAGTGACAACGCGG 
ATATCGCGATTCCCGTGTAAAACCCAACCTCTAATCACCCAACATCAAATCTTCCAGGTCGCCGTTATCTACTGCGGTGTGCGCGGTCATCTGGA 
CAAGATGGATCCCGCCAAGATCACCAAGTTCGAGAAGGAGTTCTTGCAGCACATCAAGACCTCCGAGCAGGCTCTGCTCGACACCATCGCCAAGG 
ACGGTGCTATCTCGGAGGCGTCCGATGCCAAGCTGAAGGACATTGTTGCCAAGTTCATGTCCACCTTCCAGGGTTAAGCATCAGCAGCAAACGAA 
GAAT AGG A GT AGC GGT GCG T GC AAT T AAC AT CAT AG AT GGC T C G AAT AAT GC T GC C AT CC C T GC G C GC C ATC AAC AAC AAC AAAT AAC AAC CT AC 
AACTTAAGATAAACAGAAATTATAAAACAAAAACAAGAGAAAACTCCTTTGCACATTCATGTTTTCCTGCGCATCCTGACTGGATCGATCGCCTT 
CAATCCGTGGTTCGGTGCCATTATGCCGCAAAATCTTGGGAGAGAGAGAGCCGGCCCTGGCTACGGAGTGGACGGCGAGCCCCAGCCGGATGCAC 
TTTCGAGTTTCAGCGAAAAATCCTTGCGGTTCCTCCAACCTCCAAAACGATAGATAATCCCCCGTTGTTCTTCTACCTAAAATATGTAATCAATA 
T T TG AAA TAT T GT AAT C GA GAA TAG T C GAAAT AAT GT C T G AGG ATG ATT G T G AAAC GAT GAT G AAT T AAAAC GAT ACT AAAT TAT AC T CT AAAT G 
AGTCCTGCTCACACTTCTTTGCTGGGATTATTTCCTTTTGCTTTCTTTTTTAATTCCCGATTTTGGTAATATAATTTAAAGAATTCTTGACAAAT 
TGTTATGTTCTAAGCCATTTGGAGATGTCTTGCCAGGAATCTTTGAGCATATTTTGTTAATAGGGGTTTTAAGCTTGGTTTCTTGAGGTTCTGTT 
GCCGAAATGGATTTCCCTAACTAATCCTGGTCCTTTCGGCTCAGCTGATCCGCCAGACGACGAGCAGATGTCCTTTTGCGTAACCGTAACCTCTT 
GACCCCCGTGCGAATTAGCCACCCCTTGAACTACCTGAACGACCGCCTGTCTGCTCCAAGGCGCAGGATTCAGAAAGGAACTCGAACCCGGTGGT 
TCGGAGTGGTTCATGCTGTGCATATTTCGGTAACAGCAAGCCAGAAGCGAGAGCAAGAAGTTTCCAAAAAGAGAAGAAGAAACAGAGGGTACCAA 
GACCCAAAAACTGTAAGCCCGATTTATACGGCATGGAGTTTTTATGACGCCGATTGCCACTGGCAGCCAACATAAAATATGCCAAAAGGGAGGAG 
CAGCGGAGGTGCCGAGAAAAATGACTTTCTTTTTACTACATTTGTTTGACCGTAAACAAGAATTAATGAGAGTCATAAAACCGTCAAACTTAAAT 
TTCGAACAGGCGCACAGCAAAGTCAGCAGGCCGTGTACAGTGGGCGCTCTATAAGGCGAACAAATGTTGAAACTCAAACTTTTTACCTAATGCAT 
TAATTGCTACTAACGAAAACTATTTGAAATATTCGTTCGATAATTCGCTTATATGTTTCTTAAGAACTTAACTATAAGGTTGCCTTTATTTGGCT 
TTGTATCGAGTAACTACTGTACATGAGAAAAGTGAGATAGACACCCACACACACACACGCACAACCATGTGCGAGCAATGTGCTCGGCAATTACC 
GTTGCCCATGGAACACGGTACCAAAAAAAGGGAGAGAAAAGCGTTAAA 
(SEQ ID NO: 346) 
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Exon: 1001.. 1169 
Exon: 1796.. 2376 
Exon: 2469.. 3270 
Exon: 3383. .4083 
Start ATG: 1104 

Transcript No. : CT12071 

TCTGGCCAGCGACCATCTTGTGAAAAGTACTACGAGTGTGGAGAGGAGTTGGCTTAGGMTTGTACAATTACTTCTGCGGCGAACAGTTTUW^G 
CATTAAAAATGTCGATTTTTTCCGCCCGCCTGGCGTCCTCGGTCGCCCGTAACCTGCCCAAGGCTGCCAACCAGGTCGCCTGCAAAGCCGCTTAT 
CCCGCCGCCAGTCTTGCTGCCCGCAAGCTCCATGTGGCCAGCACGCAGCGTAGCGCTGAGATCTCCAACATCCTGGAGGAGCGCATCCTGGGCGT 
CGCCCCCAAGGCTGATCTGGAGGAAACCGGCCGTGTGCTGAGCATCGGTGACGGTATCGCCCGTGTCTACGGTCTGAACAACATCCAGGCCGATG 
AGATGGTGGAGTTCTCCTCCGGACTGAAGGGCATGGCCCTTAACTTGGAGCCCGACAACGTCGGTGTTGTGGTCTTCGGTAACGATAAGCTGATC 
AAGCAGGGCGATATCGTCAAGCGTACCGGTGCCATCGTGGATGTGCCCGTCGGTGATGAGCTGCTGGGTCGCGTCGTCGATGCCCTGGGAAATGC 
CATCGACGGCAAGGGTGCCATCAACACCAAGGACCGTTTCCGTGTGGGAATCAAGGCCCCCGGCATCATCCCCCGTGTGTCCGTGCGCGAGCCCA 
TGCAGACCGGTATCAAGGCCGTCGACTCCCTGGTGCCCATCGGTCGTGGTCAGCGTGAGCTGATCATTGGCGATCGTCAGACTGGTAAGACCGCT 
CTGGCCATCGACACCATCATCAACCAGAAGCGCTTCAACGAGGCCCAGGACGAGTCCAAGAAGCTGTACTGCATCTACGTCGCCATTGGCCAGAA 
GCGTTCCACCGTCGCCCAGATCGTGAAGCGTCTGACCGACTCCGGCGCCATGGGCTACTCCGTGATCGTGTCGGCCACCGCCTCCGACGCTGCTC 
CTCTGCAGTACTTGGCCCCCTACTCCGGATGCGCCATGGGAGAGTACTTCCGCGACAAGGGCAAGCACGCCCTGATCATCTACGATGATTTGTCC 
AAGCAGGCTGTGGCCTACCGTCAGATGTCCCTGTTGCTGCGTCGTCCCCCAGGTCGTGAGGCCTACCCCGGTGATGTGTTCTACCTGCATTCGCG 
TCTGCTGGAGCGTGCCGCCAAGATGTCCCCCGCCATGGGAGGCGGTTCCCTGACTGCTCTGCCCGTGATCGAGACCCAGGCTGGCGATGTGTCCG 
CCTACATTCCAACCAACGTCATCTCGATTACCGACGGACAGATCTTCTTGGAGACCGAGTTGTTCTACAAGGGTATCCGCCCTGCCATTAACGTC 
GGTCTGTCCGTGTCCCGTGTGGGTTCCGCTGCCCAGACCAAGGCCATGAAGCAGGTGGCCGGTTCCATGAAGCTGGAGTTGGCCCAGTACCGTGA 
GGTCGCTGCCTTCGCCCAGTTCGGTTCCGATCTGGATGCCGCCACCCAGCAGCTGCTGAACCGTGGTGTGCGCCTCACTGAGCTGCTCAAGCAGG 
GTCAGTACGTGCCCATGGCCATTGAGGATCAGGTCGCCGTTATCTACTGCGGTGTGCGCGGTCATCTGGACAAGATGGATCCCGCCAAGATCACC 
AAGTTCGAGAAGGAGTTCTTGCAGCACATCAAGACCTCCGAGCAGGCTCTGCTCGACACCATCGCCAAGGACGGTGCTATCTCGGAGGCGTCCGA 
TGCCAAGCTGAAGGACATTGTTGCCAAGTTCATGTCCACCTTCCAGGGTTAAGCATCAGCAGCAAACGAAGAATAGGAGTAGCGGTGCGTGCAAT 
TAACATCATAGATGGCTCGAATAATGCTGCCATCCCTGCGCGCCATCAACAACAACAAATAACAACCTACAACTTAAGATAAACAGAAATTATAA 
AACAAAAACAAGAGAAAACTCCTTTGCACATTCATGTTTTCCTGCGCATCCTGACTGGATCGATCGCCTTCAATCCGTGGTTCGGTGCCATTATG 
CCGCAAAATCTTGGGAGAGAGAGAGCCGGCCCTGGCTACGGAGTGGACGGCGAGCCCCAGCCGGATGCACTTTCGAGTTTCAGCGAAAAATCCTT 
GCGGTTCCTCCAACCTCCAAAACGATAGATAATCCCCCGTTGTTCTTCTACCTAAAATATGTAATCAATATTTGAAATATTGTAATCGAGAATAG 
TCGAAAT AATGTCTG AGGATGATT GTGAAACGAT GAT GAATT AAAACGATACTAAAT T AT ACTCT AAA 
(SEQ ID NO: 347) 

Start ATG: 104 

MSIFSARLASSVARNLPKAANQVACKAAYPAASLAARKLHVASTQRSAEISNILEERILGVAPKADLEETGRVLSIGDGIARVYGLNNIQADEMV 
EFSSGLKGMALNLEPDNVGVVVFGNDKLIKQGDIVKRTGAIVDVPVGDELLGRVVDALGNAIDGKGAINTKDRFRVGIKAPGIIPRVSVREPMQT 
GIKAVDSLVPIGRGQRELIIGDRQTGKTALAIDTIINQKRFNEAQDESKKLYCIYVAIGQKRSTVAQIVKRLTDSGAMGYSVIVSATASDAAPLQ 
YLAPYSGCAMGEYFRDKGKHALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSPAMGGGSLTALPVIETQAGDVSAYI 
PTNVISITDGQIFLETELFYKGIRPAINVGLSVSRVGSAAQTKAMKQVAGSMKLELAQYREVAAFAQFGSDLDAATQQLLNRGVRLTELLKQGQY 
VPMAIEDQVAVIYCGVRGHLDKMDPAKITKFEKEFLQHIKTSEQALLDTIAKDGAISEASDAKLKDIVAKFMSTFQG* 
( SEQ ID NO: 348) 

Classification: en2yme 
Gene Symbol: blw 
FlyBase ID: FBgn0011211 



Celera Sequence No. : 142000013384093 

ACTTCTCGAGTAATACTTAGTGATAACTGCAACTTTATTTAACATGCTAAGCGCCAAAGCTTTATGAGTACATAATACCCATAGCTCGATAAAGC 
AATCATAGAAGACTTACCACAGTCAAGCTTATAATCTATAGGACCCCATCGTTGATGGGTAAATGCTTTTTATGTACAATCGTAGTTCCCCTAAA 
GACATGCCTTACAAATGCTAAGGGTTGCGTAGAGCCCTACTAGGTACGTGTTAGTAGTGTTAAGCGTGTGATTTAGTGTTAATATCACAAGGGTG 
TGTGGGGGAGTGTGGCCTGATTGAGTCACCGGGATGATTCAGTTAGGCACTTTCTCCGAGACGGTATCAAGACCTTTCACCGTGTTAGTAGCTGA 
AT C GC CT ACAA C T AAGG T T AC AT C ACT GT G AAAAAT C G AAAG AAG AG AGC AC AGC AAAAT ACGG GT T GAG GGT TTAGTTTTAGTT C AG AG AT GAT 
AACCAAATGTAGATTAGTAAATAGAAAGTAAGTACAGATGAGATAAGATTAGAATCATTAGGAAGTTAGGCCGATTTGGTTGAAAGCAGAGCGGG 
AAAAAACTTAATCACAAGCAATGGGTTAGCAGAGAATGCTTTTAAGGAAGAAGATGCAAATCGAGAGAGTTTGTGCATTCGAGTGTGTTCTGGAT 
GGTTATATGGTTGAATGGTTGGAATCAAAAATCGGGAACATACCTGGCGTACGTCGAGTTGGCGTTCTTTATCGAAAAGCTGTTCTGGCTTGGCA 
ACACAAAACGCTGCACCCACTGTAAACAGAGAATACCTTGAGTACCAGTTGATGAAGTAGGCATAAGCCCTCAAAGCCCCCAAGAACCATATGTG 
GTCAGTCACGTAATTTGATAAAGTCAGTCCGTTCAGTGCGGTATCTGTGATGCTTTAAAATGTGTCAAAAAGTTCGCCTAATTTGTGATTCTCTA 
AGCCCATTTGATAAGTGACCCAACTAGTTGCGGCCAAGTACTTACCCGGCTGTTCATCAAACGCTGATTGCCCATGCAGGCAGCCATGTGTATCA 
CAAGGTAGAGGTTGAACAGTATATACAAAATGGCAAAAATACCTGCAATTGGCAAAAGGAAAAATACATTAGATTCGGTTTAAATTGTGGCAATT 
GACTACTCACAGACTAGTGCCAGCAAGCCGCCTTTAATTTCCATATTCTCTACTTGCACCACCACAAAGTACAAATTAACGCCAATCACTACAAT 
GGTGAGCAGTATGGATACGATTTTGTTGCCCCTGTGAATACAAAAATTTAAAGAATGCACGATAATCAATTAAAAGTGTCTGTATCACTTACAGT 
CCATTAACAAACTCGCCCATTATGGCGGCGCACGAGGTGAAGGCGATGGTGGGTATGGCAGCGAATGGCAGCTGGAGCGACATCACCGCATTCAG 
GATATCGTTCATGCTAGTCAAATCTTCCATCTTGCTGAACATGGCCAGGCAGAAGGTGGGAATAATGGCAATGCAGCGCGTGACCAGCACGCGAC 
ACCAGCGCGGCCACTGCAGATTCAAGAAGCCTTCCATTGAGAACTGACCGGCATAGGTGCCCGTCATGGTGGAACTCTGGCCAGCAGCTAGAATG 
CCCACACCCCAAATGTACATGGCGACAGCTCCAAATGTGCAGCCAAGGAAAAGTCCGCCCTTGTATAGATCTGCATCGATGATCGCGGTTCCATT 
TACATTGTCCACAAAAGACATTTTGGCATCTTCGTACATCGACTTGTCTTTACACACTTCAACCTATTGGCCCACAAAGAGTTAGCAATCATTTT 
ACACATAAATTCAAAAGTCTACTTACCACATCATTATTAGTTTTGCCATACATGCCATGGGCAAACACAGCCACCACAAACAGGTTAATGATGAA 
GGACACAAACAAGGCTACCGACGCCTCGATGAAGAAGTAAAAATTCGCCTCACTAACTTTCTTTGTCTGGCGACGGTCTATATCGCGGGACTGTT 
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AACAGAAAATTGTTATGTATTGGTATCTAATTACGAAGTCACAAAAAGCTAGTTGTGCTATACCTTAACCAAGGCGGAATGCAGGTAGAGATTGT 
GCGGCATAATAACAGCACCCACGACACCCACGGCCTGCAGCAACACATTCGAATTGCAGTTAGAGCACCAGGGCACAAACATTCCCTCGAGAACT 
TCTCCTTGATTAGGGGCTGATACAATGTACTGAAAGATATGAAAAATAAATTAATTATGACTTAAAACCATTTAAAAAATGATTATGGGGAGAAA 
ACAAATTGTTCCTAGCTTATATTGAGATTTGGTTAGAAATAACAAAATACTTGTTACTCGTAGAAAAATCCCCTCGAGGCACCCCCACTTTTTTC 
CATTGTAATTATCTCCAAAAACCACCATCGGTAATTGTACAATTCTGCATTGTATAATCTAATTGATAAGACTGGCATTACACATCCCATCTGGA 
ATTGTTAGTCATGTCCGGCAGCATCATGTGCCTGGCAAACAAAAGATGAAGTATCCAGCACCCACGCTCAGAACCCCATTTAGTGTGCGAGTGAA 
GCTTATCTAATCATCTCCGTCTGGGGCAGAAAGCTCAAGTACAATTGTTTATAATCTGGCCCGTTAACATGATGCGGTCGTAAACTTAAGAGTAA 
GAATATGGGGCTAGTTAACGGGCTAACTTCATCCAGAAATCGACAAGTCAAATGTAAATTTACTACTAGCGATCATTAAAATAAACCTAAACAGG 
CATCTCACAAATTATTTGCCATTGCGGTCGTTGCAATTGTTTAAACAATTCGTATTTGCACTCACGTCTGGTAATTTATTCTCTTGACTACTTTA 
TTGTCATTTGGCAAGGGGATTTTTTGCTGATTTTGGCGATTACTTTTTATAGTTAAGTTATTTTTTTCAACTTTGCATTTTAAACTTACCTCATA 
GCCGAATGAGACAGCCATAATGGTGATGAGGGTGCCGAATAGGAACTCCAGCTTGCGTAGACCGTACTTGTCCAGAAACAGGAACGTGAACGTGT 
CAACAATGGTGATCAACACTCCGCCCCACAAAGGCACTCTACAAAATGGGGAAAGCACCGCATGCATGAGTACTGGGTACTGCGATAATCCGCTC 
ATTAGATTTGCTTACACTTTGTTGGACAGCAGGTATATGGCAATGGCCGTGCCAATGACCTCCTGCATGTCGGATCCAATAATGGCTATCTCTAT 
CATTATCCACAGTATCCAACGTGGCAGACGCTTGTACTGTCTGTAGCACATCTCGGCCAGATGCAGGCCTGTAACCACGCCCAGTCTGCCAGGAA 
GATAATAAACGCTTATATCATCATCTTTTTTGCAAGGTGTGGAGACCCAATTACCTTGCAGCCAAGCGTTGCATGAGCAGACCTAGGACTGTGGC 
CCACAGCAAGACCCACAAGATCTTGTACTTGGCCGCCGCACCAGACTGCATATCGGATTCGATGTTGCCGGGGTCCAGATACGCAATGGACATAA 
GAAAACCGGGTCCCGTGAAGGCCCACAGCTTGCGAAAGCTGAAACCCACCTGCAGATCACCAGATATTAGATAAAGTTTGAGTAGCATTGCTATT 
CAAGCGGGTCTTACGTTTGTGCTATCATCGTCGGGTATGAGGACCTTCTCGTCGCTAAAGTAGGCCTGCTTGGCGGCGGGTTTCAGGTAGGTCGT 
CTCGTTGAGTATCTGCTGGTGGTGCAGCTGGTTGCTCCGCTGACTTCCTCCGCCGGATGCGCCAGACGATCCGCCCGGTCCATCTCCACCGGCAC 
CCGGCTCGTGGTAGGCCTCATTCGAAGACATGCTGCACAGAAGAGGCGAGGTTAGACTGGACCTAATTCGAGTTTCCACATGCTGCTATATTCCC 
GTCCACTGCCCATTGTCGACTACTCTAATTTGTATTTTCTATTTGATTGTTTCACCTGGCATCAGCTTATGATTCCCACCATTTTCATTTAATTG 
AGCATCTGTGTGTGATTCTAGCTCAATTAGCATTATCATATTGCACACAACAAACAAAATAATTTTGCGCTATAATTAAGTTTTTTGCACCATAT 
AGTTTTATTTGGGGATTTCACCATAATCAAGAGCAAAGTCACAGTAACACGTTTTATTTTTTCCGAAACGCGCGTGGCAAAATATGAAAACAATA 
AATAGTTCGAGAGGCATTAGCTCGGGGGTAACTGATAACGCGTCCATAATATTACTGAAATTTGCTTATATTATGCGGGTATATAAGGAAATAAA 
ACGAGCTCAGTGATTCATGAAGTCCAGTAATTGAAATGCCTCTGCCCAATTTACCCAGAGAACTTGCAGATATTTCGCATTTGCAGTCGTATAGA 
TATGGATGGTCGAAAACAAGACTACTTGAACATAAACACGAGTCCATTGACCCGTGTATCGATGTGAAATTTTGGACTGTTTATTTGGTAAAAGC 
GATTACAAGTGCATTTCGTATACAAGATGCCTGAGCAGTCTTGGCTGCTCTTCAACGCAATACATTGAATGTAGAGCCAAGTGGCTCACCTTGAC 
AAACGCGACTGCCCCACTGATGCTACATATCTGCTATAAGGCTTTCCACATCGCTTCATTGATGATTGAATCGGAGCAAGCAAAGACAAAGAGCT 
CGAGAGCATCGTGCAATGTCTACTTCACATTATATGTGGCACTTGGACGGGTACATTTGGCATATACTATAAGTAACCGAGTTGGCCATATCAAG 
CCGCCATTAATGGAGCTTAAATCGCGCTGCAGTGGCTCGAAGGAAATGTGTAACAACGAAATGTCGGAGGAGGGTGTTTAGTTCATTGGAACACT 
TTTGTGTTATATTGTGTGGCGCAGACTGCTCTGTTTACCCAGTTGTATGGCAATACCCACGCCCCCAAAAGCGGAATTCTAGCCGATAAGTGTAT 
AATCAGCCACATATTTATGTACGTGTTCCTTTGCCACTTTAGTGCCCAGTTGGACGCTGTAAATTATGACAATTGGTAATCAAAAAAAAAAAAAA 
CGAACCATTTCCATTAAGAGTCCAAATTAAGTGGCACCAAAAAATATCATGAGTTTATTTAGAGAAAATATATATTTTAAACTGCGTACCATTTT 
GCGCAGTCAGCGGGCAAAAGCAATTTATATTGTATATTGAAGCAGGCGAAACAAAACACTTGACAACAAAAAATTGACAAATGCAGATAACAGTA 
ATTTGAAAATAGAAGAGCAATACTTATCAGCCCACATTGTTAAGTCGCTTGTTAAAAAAAAAAAAACTAAAACAGTAATTATACGATTTCCGCTG 
CCTAAAATAGTTATTTCTATGACAGTTCTCGCTGGCAATAAAAGTAGTGAATTGTGTCTGGAAAACTATCCATTGTTAGGGGATTTTAAAGTGCG 
AATAGCTTGTCAACAATTGTTTTCTGTTTACGGTTATTTACGGTTAAGAATTATACATTACTCTCTTTGCCGGTTAGAGGGAGAGCGGACGGAGA 
GAGGGTGCACCACGTTGTTTTTGTTATTAGCGCACACCTGCGTGCGCACTCACACACACACACGAAATATTGATGGAGATGTAAATTATTGGGCA 
ACTCAGGTTAAACAGCACTTCCTACCAGTTAATGGCTTACTTGCAACTGGGGGTTAACTAATTGGAGCCCTCGCCTTTATTATATCACAATCTCG 
GCACTTGTGGCAGCTTATAAATGCATTTTCCTGCTTTTCTCGCGGAGCAATTTACGAAATTTACGAATTCCGTGACGTTGTACGGCCGAAACAAG 
AGTGGTGATGGTCGCAGTAAACGAAGCAGAGAAATGACGAAGAAGCGGACGGTTTGGCTGCCAATTGCTGACTTAGTTAACGACAATTGACGAGC 
GTGCAGGCAATTTGTGCTCAGCCGTGAGCGCAGCTCTTTGTTTATTAATTTTCCTCACAATATTTGAGATTCTGCCACCAGCTGCTTTCTTTTGC 
ACGCACACTCACTCGCCTCTCACATGCACACTGTCTTTTCGTCTGTGCCCAATGTGGCAACCTGCGGCCACTCTCTCTTCTTTGACTTGCCGAAA 
TCTGGCAACACTCGACAAGAGTACTAACCTTATCGGCCAAACTGAAGTTTTAACTCTGAGCCCACGCGATCGAGTGACTGACTATAGCATTTTGC 
GGTAGCCCCTATTTATTCAAAATATCTCTCTCTCGCGTCGGCAAGTGAGAAAAAGAAAGACAGAAAGAGAGAGAGAGAGAGTGTGTGCGTGGGAG 
AAAGACCACTGCTTTTTTTTGAGGGGAGATTTTTTGATAAGGCCTGAGATTTGGATTTTTGAAGGCCCCAGAAATACCGCCTAAGATTTGAACAT 
GCATTTGAAATACCGGTTTGTACATCGACCGCAGGTTACCGTTAGAAATGTTATGCGGGATACATAGTTAAGTTGCATACCCTTTGAGTTACAAT 
CACTAGTTAATAATATCTACGTTATTACCAACACGCACACTTTATCGTAATACCTCCTTGAAGTTTAATTTATACATCAACTTTATCAGTCAAAA 
CTTTGATTTCGTCTGACACTTTTTTCGATTACGATCCGTCGCCAATAATTGCGATAAATCTTATCAAGTCTTTTTGGGATTGGCGCTCAAATTTA 
CAATATGGCCGTACATCCTACTTATGTATGTTTTTTAACTAATTAATCACCACAATGCAAAGTACTCTTTCTTTGTTGAGCCCATATGCACTCAC 
ATTTGCACCATGAATCATGTCAGTAGCTCGTTTCATGTAACAATTTCTACTTTGCCAGATTACGATGCGTTCGGAACAGGCAGATAAGAATTCGG 
CCCATCCAAGAAAGGCCTTGACAGTTCTACCCCAAAATAGAGATATCCTCGTGATATTAGAAGGAACCCAACAATATGCTCGTTCTTATCTTCTT 
ATAGAAATTTGTGAATTCCCGTATCCAATGAAATCATTTTACTTAGTAAAATGATTTGTTAGGCCTTAAAAAAAAACAAAAAC 
(SEQ ID NO: 349) 

Exon: 6018.. 5646 
Exon: 3832. .3625 
Exon: 3564.. 3380 
Exon: 3315.. 3151 
Exon: 3078.. 2940 
Exon: 2214. .2059 
Exon: 1990.. 1832 
Exon: 1773.. 1001 

Start ATG: 3831 (Reverse strand: CAT) 
Transcript No. : CT12249 

CAGTGTGCATGTGAGAGGCGAGTGAGTGTGCGTGCAAAAGAAAGCAGCTGGTGGCAGAATCTCAAATATTGTGAGGAAAATTAATAAACAAAGAG 
CTGCGCTCACGGCTGAGCACAAATTGCCTGCACGCTCGTCAATTGTCGTTAACTAAGTCAGCAATTGGCAGCCAAACCGTCCGCTTCTTCGTCAT 
TTCTCTGCTTCGTTTACTGCGACCATCACCACTCTTGTTTCGGCCGTACAACGTCACGGAATTCGTAAATTTCGTAAATTGCTCCGCGAGAAAAG 
CAGGAAAATGCATTTATAAGCTGCCACAAGTGCCGAGATTGTGATATAATAAAGGCGAGGGCTCCAATTAGTT7AACCCCCAGTTGCAACATGTCT 
TCGAATGAGGCCTACCACGAGCCGGGTGCCGGTGGAGATGGACCGGGCGGATCGTCTGGCGCATCCGGCGGAGGAAGTCAGCGGAGCAACCAGCT 
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GCACCACCAGCAGATACTCAACGAGACGACCTACCTGAAACCCGCCGCCAAGCAGGCCTACTTTAGCGACGAGAAGGTCCTCATACCCGACGATG 

ATAGCACAAACGTGGGTTTCAGCTTTCGCAAGCTGTGGGCCTTCACGGGACCCGGTTTTCTTATGTCCATTGCGTATCTGGACCCCGGCAACATC 

GAATCCGATATGCAGTCTGGTGCGGCGGCCAAGTACAAGATCTTGTGGGTCTTGCTGTGGGCCACAGTCCTAGGTCTGCTCATGCAACGCTTGGC 

TGCAAGACTGGGCGTGGTTACAGGCCTGCATCTGGCCGAGATGTGCTACAGACAGTACAAGCGTCTGCCACGTTGGATACTGTGGATAATGATAG 

AGATAGCCATTATTGGATCCGACATGCAGGAGGTCATTGGCACGGCCATTGCCATATACCTGCTGTCCAACAAAGTAGTGCCTTTGTGGGGCGGA 

GTGTTGATCACCATTGTTGACACGTTCACGTTCCTGTTTCTGGACAAGTACGGTCTACGCAAGCTGGAGTTCCTATTCGGCACCCTCATCACCAT 

TATGGCTGTCTCATTCGGCTATGAGTACATTGTATCAGCCCCTAATCAAGGAGAAGTTCTCGAGGGAATGTTTGTGCCCTGGTGCTCTAACTGCA 

ATTCGAATGTGTTGCTGCAGGCCGTGGGTGTCGTGGGTGCTGTTATTATGCCGCACAATCTCTACCTGCATTCCGCCTTGGTTAAGTCCCGCGAT 

ATAGACCGTCGCCAGACAAAGAAAGTTAGTGAGGCGAATTTTTACTTCTTCATCGAGGCGTCGGTAGCCTTGTTTGTGTCCTTCATCATTAACCT 

GTTTGTGGTGGCTGTGTTTGCCCATGGCATGTATGGCAAAACTAATAATGATGTGGTTGAAGTGTGTAAAGACAAGTCGATGTACGAAGATGCCA 

AAATGTCTTTTGTGGACAATGTAAATGGAACCGCGATCATCGATGCAGATCTATACAAGGGCGGACTTTTCCTTGGCTGCACATTTGGAGCTGTC 

GCCATGTACATTTGGGGTGTGGGCATTCTAGCTGCTGGCCAGAGTTCCACCATGACGGGCACCTATGCCGGTCAGTTCTCAATGGAAGGCTTCTT 

GAATCTGCAGTGGCCGCGCTGGTGTCGCGTGCTGGTCACGCGCTGCATTGCCATTATTCCCACCTTCTGCCTGGCCATGTTCAGCAAGATGGAAG 

ATTTGACTAGCATGAACGATATCCTGAATGCGGTGATGTCGCTCCAGCTGCCATTCGCTGCCATACCCACCATCGCCTTCACCTCGTGCGCCGCC 

ATAATGGGCGAGTTTGTTAATGGACTGTAAGTGATACAGACACTTTTAATTGATTATCGTGCATTCTTTAAATTTTTGTATTCACAGGGGCAACA 

AAATCGTATCCATACTGCTCACCATTGTAGTGATTGGCGTTAATTTGTACTTTGTGGTGGTGCAAGTAGAGAATATGGAAATTAAAGGCGGCTTG 

CTGGCACTAGTCTGTGAGTAGTCAATTGCCACAATTTAAACCGAATCTAATGTATTTTTCCTTTTGCCAATTGCAGGTATTTTTGCCATTTTGTA 

TATACTGTTCAACCTCTACCTTGTGATACACATGGCTGCCTGCATGGGCAATCAGCGTTTGATGAACA 

(SEQ ID NO: 350) 

Start ATG; 375 (Reverse strand: CAT) 

MSSNEAYHEPGAGGDGPGGSSGASGGGSQRSNQLHHQQILNETTYLKPAAKQAYFSDEKVLIPDDDSTNVGFSFRKLWAFTGPGFLMSIAYLDPG 
KIESDMQSGAAAKYKILWVLLWATVLGLLMQRLAARLGVVTGLHLAEMCYRQYKRLPRWILWIMIEIAIIGSDMQEVIGTAIAIYLLSNKVVPLW 
GGVLITIVDTFTFLFLDKYGLRKLEFLFGTLITIMAVSFGYEYIVSAPNQGEVLEGMFVPWCSNCNSNVLLQAVGVVGAVIMPHNLYLHSALVKS 
RDIDRRQTKKVSEANFYFFIEASVALFVSFIINLFVVAVFAHGMYGKTNWDVVEVCKDKSMYEDAKMSFVDNVNGTAIIDADLYKGGLFLGCTFG 
AVAMYIWGVGILAAGQSSTMTGTYAGQFSMEGFLNLQWPRWCRVLVTRCIAIIPTFCLAMFSKMEDLTSMNDILNAVMSLQLPFAAIPTIAFTSC 
AAIMGEFVNGL* 
(SEQ ID NO: 351) 

Classification: known_f lybase_gene 
Gene Symbol : Mvl 
FlyBase ID: FBgn0011672 



Celera Sequence No. : 142000013383801 

CCAACCGTCGCAGCCAATGGATGGGTGATCTCCATCTATAACCATCAAAGGCCGGAGGAACGTCTCCCTTTACATCTGCATAAGCACTTTCTCTT 
TCTCTTTCTGTTTCTGTGCCTGGAACATTTTGGGCAATACTAGTCCTCAAAAAACTAGTCTACAATATATTCAAGGTTTCTACGCAATTTACATT 
ACTTTGCAGGCCCGCTGAAATCTACTCTAATATTAGGCTTGTATGATGTCGTTATTTGCTCTCGATTCTTGTTAATTTCTACGTAGTCTCATCTA 
AATAATACTTCAAATAGTATATACATGGCCAGCCAAGTCAACAACAATTGTTGCAATGTCCGACGACAGCGTTGCGGTTGCTTCCCACCTCATCA 
CGAAGCGCACCGAACCAACCAAGACTAACTTTCCGTATCTTATGATCGTGGCTATTATATATAGTTGATGTGTGTATTTACAAATGAGGACAAAC 
CCAATTATCTCCAAGGATCTGGGGGATTCACGAATCGCAAACTGAATTAATGTAAATGTAAGTGTAAGTTGAATGAAATGCATTTTGTTTTTTTG 
CGGGGGGATATTATATATGGATATATGGTACTGTGTGTGTGAGCGGCAAAAAATAAATAGATCGAGATATCACCCGCCACTGAGTTAAGTATTTT 
TCACATTATCACGTTTCATTGAGTTTGTTTCGTGTGTGGGAGGGAACATATAAACTCGAACGCATTAATTGTAGTTTCATTATGGTGAATCTAAC 
TTTAATAAATGGGAAATATGTACCAATATTTATAAATGTAGTGGTAACCTTCATAGTTGCTGAGAACGAATGACATCATGATGAATTTCTTCAAT 
TTATAAAATAGTTCTTTCTTATTACTTTAGGGAGAAACTTAGCACTGAAGCACAAATAAGGTTCGCAGACAGTTGAGTGTCATTTTAATCTAACC 
ATACTTAATCGTATTCATCTTAACATAGAACTTATAAAAGGTTTCTTTGTGATTTATCTTCATAATTCATGGAAATTTGATTACTTATGTTAAAA 
AAAAAAATTATATATTATTTATATATATGCATTTACATATACATAATAGGTGATTTTCTTAGAGATTCCACTTCCAGGTAATCATCAAATTGTCA 
CAGAAAAAAAAATAAATCAAAACAATTTATGCTTTGGTTTTTCCAATCACAGAAATCTTCTGCAGACATTTTGCCCACGGCTGGCATGAAAATCG 
CACGAAAAAATCCTCAAATTTCACAGAATCTCCACAAGATTTCGTTTTTTTTTTTTCGAAATGCACTAGGAACAATGGTTTCTCTTCTCTTCGAT 
TAACTATGAAAACCTCTACTGAAAGTTCAAGAGATTTCTGTGATAGTTGGATTTTTAGTTGGACATTGATTGGTACACAAATATTTCTAATACAA 
AAG AGT T AC AC C GCTC AT A C GG AG A TGC T AAAT AT GAAT GC AAAAAAAAAAAGG AAAAAAG AG AT T C AAAAG AGC TC T AT GC G C ACT GCT T AAAA 
AGAATACAAAAAAAAAAAAAAAAAAAGATTAACCATTCCACCATGATTTTTAACTTATTCTTTGATCAAAATACGATTGATATTAACTTTTCCAG 
ACAAATTTCTGAACAAATTGTGACGGTGTTCTGTTACCGGAAAGCGAAACCTCTAAAGCGGGCACACCCTTATATATATAAATTTATATTCAACA 
CACATTGGCCAACCTGATAATAGATCAACCAACGTTATCATGTTTCTTTCCTCCATCGCGACAACGATCGTGTGTATAAATAGTGAAGAGTATTT 
ACTGCTAAACCTATTGGTTCTCGTCATCATCTCCATCGCCGTCGTCGTCGTCGTGCTGCTCTTCCAGCTGCTGGTGCTGCTGCTGGGCAGCGGCG 
GCAAGTAACGCGGCTTTCTGTGCCTGCGCCTGCATCGACGCCTGCATCGACGGTGCGGACATGGCTATTTTCCCGCTCGTCTTGTACTCGGTCAT 
CTCCTATTAAAGGCAAAAGTGCACATGTCAATCGGGTCAATCCAAGCGGTCATTTATATATCTAAAACATACGCGTTCATATCGAGCCTTGTCCC 
GCTCGGCCATCGACTCGTACTTCTGCTTGACCTCGGGATCCACATCGGACCACTTTCGTCCCAGCTCTTTGGCGATGTCGCCCACTCCGAACTCG 
GGATTAAGTGCCTTCACCTTATTTCTCTCGTCATTGCAGAACCAGAAGAACGCAGACCTAAACAGCGGCAGAATAGAGCATAGTAAATCATTATT 
ATTGATCTGATTCGCTTCATTATCTCTTGCAGTCACATTGATTTTTATGCACTATAGCGTATAATATGATAATTGACTTTGCTACGAACTCACAA 
TGAACGTTTTGGAGCATTGGGGTCCTTGATCTGTTTCCTCTTCTTGCCGCGTCCCACAACCGCTCCCTTGGGCGGCACATAGTTCTGCATTTCCG 
CCTCATATCGCTGCTTGTCCTTCTCGGCCATTTCGTGGAAGCGCTTCTTCTCCTTGTCCACCATTGTCTAGACGAACGGGATAGATGGCAATCCA 
CTTTAAAACAAGGAGTCCTTACGTCATTTGATCATTTCATTTACTTACCTTCCACCGTTCGGCGCACTTCCGCGAGAATTCGGCGAATATCACCG 
TCTCATCGGGATGTTTCTTTTTGTGCTCCTCTCTGCAAGTTTGTACAAAGTATGCGTAGGCGGTCATTCGGCCTCTGGGCTTGGCATCGGCCTTG 
ACTCTGCTCATGGGGCTCTGTTGGTTTTTGGCCACGAAAACGAGAACGAGAAAGAGATCGAGAACGGAAACGGGATTAGCTGCTGACTAAGGGCT 
AATGTAAGCATGTGGCATATTCAACTTACCGCCGAATTGATGACATTCTGTTGCTGCTGCTGCTGTTGCATCTGTTGTTGCTGCTGGTGCTGTTG 
CTGCTGCTGCTGTTGTTGCTGTTGCTGTTGCTGCTGCTGGTGCTGCAGTTGCGCCGCTGTATTGGCATCGACCTGTCCCTGATTGGCGGCCGAAT 
ACCACCACTGGGAGCCGGGTACGGCTGCTGCTGCGGCTGCCTGTGCCATATTTAGACGATAATCATAGCCCGTGGTGCCGGCTGCGTTCGATGTG 
GCCACCACTTGGGCCATCGTCGTGGCCGGATTCGGAAAACTGCTGGCCATTTTGTATGTGAACAGCTGTTGGGTGCCCACGCCGGCTGTGGTATT 
GGAGCTGTTCTGGTGCGCCTGCTGCTGATTCTGCTGGACGACCACCTGGTGTTGTTGCAACTGTTGCTGTTGTACTTGTTGTTGTTGCTGAGCAG 
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CCAATTGCTGCTGATAGTGTTGAATTGTTTGCTGATCAATAGTTGGTTGATGGATGTTCATTAGCATTATTAGTGCGATGCCAGTCGATTGGGCT 

ACAACAATACACTACAATCATTTGGCTTGACTCTCGATGTCTGGCATCGCACTAACCACTAAGATGGTCCTTAATACTAACGAAAGTGAAAACTG 

TGCATTTTTTGTTTGTGAGTTATGCGGTTTAATGAGTGCCTGATGTTTTTGTTGACTACTTGCTCACAAAATGACTAAATAAATAAATACCGAAT 

GCAGCGATTTTCTCTACATTGCGTTTACACATTGCTGATTTGTTAAAATATTTAACAGTTTTTACACTGGTAAAACTGAATTTTAGTAAGCTAAG 

CTAAGCTTCTTTAAATTATTTATTAAGTGTGAAAAATATTGTTGCCTAAAGCAAAAAGCTTAAATAGCATTCTAAAATCCTCCATTCATCATTAA 

TACATGCTAAAAAAAAAAAAGTATCTGTCGGAACTGGGGCGTTATCCTTATTATTTACTTTCCATGTAAATAAATATGTAGATACTCGTTATCTA 

CAGATACCGATAAATTGATGTCAATACCACACAATATAATACTAAATCCTGTCCAAAGTGAATTCTCTTGTTTCCCATTGAACGCGGAATTTCAA 

TGTCAATAATCTAGGTTGTTTTGTTTACAAAGTAATAGTAATAATAGCACCAGCTGTTAACCCATAGATAAAAAGCTTTTACATGAATCGGGCAA 

GAAAGTGGAACTATTGTTGGAGAAAGAAGAAGCAGAGCGAGTAGCACCGGTTCCGTTTTGTGAATCGGATTGGATTGTCTTCATTTCGATTCCGA 

TTCCAGGGCTTAAAGTGCAAAACCAGGGCGCCACAGTTACAAAACATGAATGAACCCATTCTTTTTGCCGAAAATAGCCCCATCCTGCTTGTCAA 

TTATGAAAAAGCAAAATACACTTTCCTACAACAGCCAAAAAAAAAAATAAATAAATATTAGAGCCATTTTACGTGGGGATACATATGTATGTAAC 

CACAGAAATATACATGTGCGCATAAATATTTGCATAGCCAAGTGTTTGCACAGCCATTCATTTGTATTTTATGCACACATGGATAAAAATAAGCC 

AATTTCGGATTACGCAAGCCTGTGATTTGGGTCTTTTCATATACAGGTTTTTATATGTGTATGTATTTTTTTTATGTACATCATATTTTCCTTAT 

TTTTGCACATTGTATCTATTTACGTACATCAATAGGCATGCAAAAAAACGTGCATCGCATTTTGCCAGCATATTTCATAAGAAACGAAAAGCAAT 

GCGGTTTACCGTTTACATCAAGGCATTCACGATTCTTGGTATAAATGTACATATTTGTATATTTATACAAACTAATAAGAAAAATGGCGGACTTG 

GATGCGCACGCAAAATACAATTTTGATTGTACAGCGTCGCTTTTTTGTTTGTATGCATACTTGCACATACATAGGCGTACACTTTTTGCATTTAG 

TAACATGCCGATTTACTTAATAACTAAAAACAAAAGCGAGACGCAATGGATGTACAAGCGCTTGCATGTACATACGTATGTAAGTATGTATGCAG 

CATATATATTCACAAAAAGTTTCCACTGCGGCTGAGAAAAGGCAATCAGGTGCCGAAATATCCAAATAAATAATAAATAAATAGGTGCCTGGCTT 

TTTTGTTAATTATACATTGTCGTTTGGCGGGAAAATGATTAAAACGCGGCAACCTATACATAAACGCGCCCATGATTGTATGTGTGTTGTGCCAT 

GAGTGTATATAAATGTGTGTTTGTGTGTGCGTGCGTGTGTGTTTTAAATAAAACGCCGTATACCTGCACACAAGCTTACCTGTATTTGATGAAAG 

TGTTCCATAACTGTGAGATTTCGGATCTAGTCCGAATGTTTCAATCGTGGATTGGCTTTATACTGGGATTTCCGTTCTATATCTTTCACTTTTAT 

TCCTAAATGGTTAAAGCATATATATTTAGTTTTAGCACAAGTACGTATGTATGTAATACGCACACCAACGCAAATTGGAGTCGAGTCTTAAGCCA 

GTTTTTTTGTCTGTTTTCCTTTTTTTAATACTGTTCTACGCAGTTTGTTGTACGTCGTATTGTATTGATTATTGCATGTATGTATGCATGTACGG 

CCACAAACGCGAATGTATGArGCACGTTACAAGGAAAAAAAGGAACACGGCACAAGCCAAAACTTCGAAATGAAATTCGAAAATTTAAGAATTTC 

ACAATGCACGTTGTTTTTTTGTACTTCTTTTTTTTTGTCTAGGCCTTTTTCGCTTATTTGTTGTTTGATTTTATTTGTTTTTTTGGTTGGGAAAA 

CGCGAAACGCGGGCGGCTTTGCGGATTTGTTTTGTCGTTTGTATTTAGCTTGTAAAAAATAAACAAACAGCGCTGCATACGCTGAGTTTATTGTA 

TCGTATCGTATCGCTTGTGTTTTGTTGGTGTGGTATGGTATATATGCTATACAGTACACAGTACACAGCACACAGTATGGTATCGTATCAATGCG 

AATGCGGAAAATACAATATATGAGGTTGGTTCGCGCGCCGCTCGTACCAACAACGAATGAATTGAATTGAATTGCCGTCTCTGCCTTACCTTCTC 

ACTGTGCTTCCAAATCCAAACGGGCGGATAAATGTGTATGCGTTGCGTTGCGTTGGACTGTGTGCGCAATTCTTTGCTGGTGTGCGTGTGTGCGG 

GAGAGCGAGAGCGCGAAGAGAGTCGTTATAGTCATTTTGTTGGTAGAATTAGTGTTGCTGAGAGAGCGCTCAGTGTCATATACATACATATGTAC 

ATACACGTGGTGGAGGGAAAAGGCCTTATAAAACTTTTTGGCCTGTTACTTTCATACAAACATACGATCACGGGTATGTACATAAACGCTGGCGT 

GCCAAACATACAAACTATGGAGTGTGTGGAGTGTGGGAAAAATACAGAACTGTATGTATGCATATGTACATTACACAGTTCTTAGGGCCAAAATT 

TATTTTAAACATACATATCTTATTTTTAACTGCCAGCTACATTCGTTAATGCGACGAATTCTACAAAACCGATATGCATCAATGCAAACACATTG 

ATACATAGCCAAGCATATACAACTGTGCTCATGCTAATAGTAGTGCATATGACCGCCACTTTCTAAATTGATGATTTTATATGAATGCTATGTTC 

AATTTGTCAAGCGGTTTTTCAGAATGAATAAGTTCATATTAATTACTGAAATTATATGTTTTGTTAAAGAGCAGTGCAGGTACAAAAAGCCATTT 

GTAACTGCTTTGCTTGTTTTGCGACACAGAAGACGAAGCGAAAAAAAAAATTAATTAACAAAAGGGAATACGGAAAAAAACCGTTTGATGCCTTT 

CAGTGCTATTTCCACGAACGCAGCTGTATGTACATACGTGTGGAACACACAAACATATGTACATCGCAGCGACCCGCTACATACACATACACATT 

TTCATTTTGTGTGTTCCCGTCTGACTGTACTTTCAATTCGAATTCGAAAGAGAAAAAAATGTTCTCTTATGTTTTAAAACAAATTCTCCCTCAAT 

TCTCTCTGTCATACA 

(SEQ ID NO: 352) 

Exon; 5950.. 5860 
Exon: 5322.. 5210 
Exon: 3356.. 2880 
Exon: 2772. .2614 
Exon: 2537., 2374 
Exon: 2242.. 2068 
Exon: 1998.. 1001 

Start ATG: 3185 (Reverse strand: CAT) 
Transcript Wo. : CT12419 

TCATTCGTTGTTGGTACGAGCGGCGCGCGAACCAACCTCATATATTGTATTTTCCGCATTCGCATTGATACGATACCATACTGTGTGCTGTGAAT 
AAAAGTGAAAGATATAGAACGGAAATCCCAGTATAAAGCCAATCCACGATTGAAACATTCGGACTAGATCCGAAATCTCACAGTTATGGAACACT 
TTCATCAAATACAGCAAACAATTCAACACTATCAGCAGCAATTGGCTGCTCAGCAACAACAACAAGTACAACAGCAACAGTTGCAACAACACCAG 
GTGGTCGTCCAGCAGAATCAGCAGCAGGCGCACCAGAACAGCTCCAATACCACAGCCGGCGTGGGCACCCAACAGCTGTTCACATACAAAATGGC 
CAGCAGTTTTCCGAATCCGGCCACGACGATGGCCCAAGTGGTGGCCACATCGAACGCAGCCGGCACCACGGGCTATGATTATCGTCTAAATATGG 
CACAGGCAGCCGCAGCAGCAGCCGTACCCGGCTCCCAGTGGTGGTATTCGGCCGCCAATCAGGGACAGGTCGATGCCAATACAGCGGCGCAACTG 
CAGCACCAGCAGCAGCAACAGCAACAGCAACAACAGCAGCAGCAGCAACAGCACCAGCAGCAACAACAGATGCAACAGCAGCAGCAGCAACAGAA 
TGTCATCAATTCGGCGAGCCCCATGAGCAGAGTCAAGGCCGATGCCAAGCCCAGAGGCCGAATGACCGCCTACGCATACTTTGTACAAACTTGCA 
GAGAGGAGCACAAAAAGAAACATCCCGATGAGACGGTGATATTCGCCGAATTCTCGCGGAAGTGCGCCGAACGGTGGAAGACAATGGTGGACAAG 
GAGAAGAAGCGCTTCCACGAAATGGCCGAGAAGGACAAGCAGCGATATGAGGCGGAAATGCAGAACTATGTGCCGCCCAAGGGAGCGGTTGTGGG 
ACGCGGCAAGAAGAGGAAACAGATCAAGGACCCCAATGCTCCAAAACGTTCATTGTCTGCGTTCTTCTGGTTCTGCAATGACGAGAGAAATAAGG 
TGAAGGCACTTAATCCCGAGTTCGGAGTGGGCGACATCGCCAAAGAGCTGGGACGAAAGTGGTCCGATGTGGATCCCGAGGTCAAGCAGAAGTAC 
GAGTCGATGGCCGAGCGGGACAAGGCTGGATATGAACGCGAGATGACCGAGTACAAGACGAGCGGGAAAATAGCCATGTCCGCACCGTCGATGCA 
GGCGTCGATGCAGGCGCAGGCACAGAAAGCCGCGTTACTTGCCGCCGCTGCCCAGCAGCAGCACCAGCAGCTGGAAGAGCAGCACGACGACGACG 
ACGGCGATGGAGATGATGACGAGAACCAATAGGTTTAGCAGTAAATACTCTTCACTATTTATACACACGATCGTTGTCGCGATGGAGGAAAGAAA 
CATGATAACGTTGGTTGATCTATTATCAGGTTGGCCAATGTGTGTTGAATATAAATTTATATATATAAGGGTGTGCCCGCTTTAGAGGTTTCGCT 
TTCCGGTAACAGAACACCGTCACAATTTGTTCAGAAATTTGTCTGGAAAAGTTAATATCAATCGTATTTTGATCAAAGAATAAGTTAAAAATCAT 
GGTGGAATGGTTAATCTTTTTTTTTTTTTTTTTTTGTATTCTTTTTAAGCAGTGCGCATAGAGCTCTTTTGAATCTCTTTTTTCCTTTTTTTTTT 
TGCATTCATATTTAGCATCTCCGTATGAGCGGTGTAACTCTTTTGTATTAGAAATATTTGTGTACCAATCAATGTCCAACTAAAAATCCAACTAT 
CACAGAAATCTCTTGAACTTTCAGTAGAGGTTTTCATAGTTAATCGAAGAGAAGAGAAACCATTGTTCCTAGTGCATTTCGAAAAAAAAAAAACG 
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AAATCTTGTGGAGATTCTGTGAAATTTGAGGATTTTTTCGTGCGATTTTCATGCCAGCCGTGGGCAAAATGTCTGCAGAAGATTTCTGTGATTGG 
AAAAACCAAAGCATAAATTGTTTTGATTTATTTTTTTTTCTGTGACAATTTGATGATTACCTGGAAGTGGAATCTCTAAGAAAATCACCTATTAT 
GTATATGTAAATGCATATATATAAATAATATATAATTTTTTTTTTTAACATAAGTAATCAAATTTCCATGAATTATGAAGATAAATC 
(SEQ ID NO: 353) 

Start ATG: 376 (Reverse strand: CAT) 

MASSFPNPATTMAQVVATSNAAGTTGYDYRLNMAQAAAAAAVPGSQWWYSAANQGQVDANTAAQLQHQQQQQQQQQQQQQQQHQQQQQMQQQQQQ 
QHVINSASPMSRVKADAKPRGPJXtTAYAYFVQTCREEHKKKHPDETVIFAEFSRKCAERWKTMVDKEKKRFHEmEKDKQRYEAEMQNYVPPKGAV 
VGRGKKRKQIKDPNAPKRSLSAFFWFCNDERNKVKALNPEFGVGDIAKELGRKWSDVDPEVKQKYESMAERDKARYEREMTEYKTSGKIAMSAPS 
MQASMQAQAQKAALLAAAAQQQHQQLEEQHDDDDGDGDDDENQ* 
(SEQ ID NO: 354) 

Name: HIGH MOBILITY GROUP PROTEIN BSS (BANG SENSELESS) 
Classification: DNA__binding 
Gene Symbol: bss 
FlyBase ID: FBgn0000231 



Celera Sequence No. : 142000013385217 

AAAAGTAGTATATATGATGCATAGTCCTCTGTTTAATTATTAAATTTAAATGTTTAATCATAGAATTTGCATCCAAGAAACACAGTTACAAAGGT 
TTTAACACTTACTATTCTTTCGGTTTTGGGTTTCTTTCACGATCCACTGATTTTTATATTTCTCCCTTTTGTTTTTCTTTATTTTAAATAAACGT 
TAACATAGTAAAAATAATAAAGCGTACATTTCGCTACGAAAAGTTGATTCGTAATTGAACAAAAAAATTTATGAGTAATGAAAATGCTTTTACTG 
TTCTCTCGATGAAAACAAACTATTGTAATAGTGTATTTTTCTTCTTTACAAAGAAGAATTTATCAGGCCGTGATGTTGAATATTATATATTTTAT 
CTTTAGTTTATTTTATTCGCCTTGATCACTGGTTAGCGACTGTAATTTGTTTTTATAAGGTAGGATGATTTTTTGATTAGGTGAGTGGTAATAAC 
ATAAAAATAAACAACTTCACATCAGGTTTTGGTTGATGCAATAAAACAATTTCCATGACCGCCCCTGCCGTTTTAGAAAAAATCTCGAAACAAAT 
CTCGTATTGTGAACAGCTACTAGATATAAAGTCCATAATGAGGATAACTTAGCCTGACACTGACTAGTTTACACGAGGTCACTTGCTTGATCGCG 
ATTTTATCGTGTAAAATCTGATTTAGTTGGCCACGTATAAACTAAAATTGCTTTTCGTGTGTTCTCTTTTCATATATCTTTAAACATATGTATAA 
TTTATTATCCGTTTTTTTTTACTTAGCTTGTTGGTTGGTATTTTTGTTGCTTTTGGCTTATAATGGCGGCTACAATTGTTTTCGTAATTTCCCCA 
TTTTTCTTTTAGTTTTTAAAAAGTTTGTTGTCTGTGATATACGTAATTTATAAGTTTTCATTTTGGTAACTTGCTGCTTTCATTCACCATCGTCA 
GTAACCAAAACTGCAATTTGAGGAACCTACTCCTGCCCAAAACGATCGATTTAAGCCAGAGTGGGAAACTGACGCTCATCGTTGACCTTGGGAGC 
GGTGTTCTGTTGTCGCTCAAAACGCTTATTCGGCCCTGGACCGCTGTTCTGATTGTTGTTCTGGTTGTTTCCCGGACCGCGACGATCGATGGGTG 
GCCTTTGATTTTGAGCGGATGAACCGCCGCCCTCGTTATTATGCTGCTGGCCATCCCGTGGGCCACGGTTTCCACCCTCACGATTATCTCGGCCG 
CCCTCACGATTGCCGCCATCGCGGTTGCCACCCTCGCTGCGTGGTCCGCCTCCAAATCCACCGGGTCTGGGGCCTCCACGGCCACCGCGTCCACC 
GAAACCGCCAGGTCCGCCACGACGACCATCGTTGAAATTGAACTGAATGTCCAGGACGCGCTGTTGCCTGCCCACACGCTGTGGATAAAGGGCAG 
GGTCGTACTCGAGCTCCTCTTCGCTGTCGTTCTCCTTCTTCTTGTTGCTAGTTAGGACGACCATTTTCTTCCATTGTGTGGTGTCCTCACCTGCG 
GGAAATATTGATTGTAGATTAATTTCTAGATGATAGGCCATCATTTAGAGTTAAGCTAAGATCGGTTTGATTTTCAATTTTATACTTCTTTTATA 
CGGATTTTAGCTCAATATGATTTGTAACTTTGGTTGTTGGCTTGGCGAAAAGGTTTCTTAGAATAACGGAAGTTAAACTATCATTCCGTGGTATC 
GAATTCAAATTATTAACCATAAAAAAATTATGCATTGTTCACGCACCTTCTCCAGCCTTGCGGATATTAAAGGTTGGCTTGATGCGCTGTCCTTG 
CTGTGCCTTCCACTCGTCCAAGGTCAGCTCTTTGGCTTCCTCCTCGGCGGTAGCCTGCTCGTTTTGTGGTTCAGTGCCAGATTCATCGGCCTTGG 
CGCCGCCCTCGGCATTGGTCACATTGGTTTCGCTTTCGTTCTTGTTCACATCATCAATAGCCTCCTTGACCGATCCCCAGTTGTGGGAGCCAGCT 
CCATCGCGTTTGTCAATTGACTTCACACCTGTGCATAAAGAAAGTATCAAATATTAATAATGATTTATGTCTGCTTAAATTCAGAAACGTCAAGA 
AGCGGAAAATCTGAGCTTCTATATTTGCAAATTTTAGCCACAGCTATAGTGCAACTTGCCTGCAAGCCCTAAAGTAAGTTTTTCTGCTTAATTTA 
TTAACTGAATACTTTAATTATTAGCCAAATAATGGTCTATTTTATCAATACCCCACTAATTAGGTCAATTGAGAAGTAACCACGATAGGGGGACT 
ATATTCACTGCCCTCGCTCGCATAGTCACAGATAAGCCGACAGTAGCAGCCCCAGTTCACTGTAATCAGATTCAGTGGGGGAACTATTCCAACCC 
CTGTTTCCCCCCAATGATTATGCCTGCAAAGGCCTCCTACAAGCACTTGCGTTAGACGGGAATGCCGCCAAGAAGATGGGCGAACTATTTTTAAC 
ACATTTTCTCTTTCGATTTGTGCCGCAGTGCTTCTTGTTTCTGTTGCTGCCGGCCGCTGTGATATTATAATTTTGGATATTCCAAAAACGAAGCG 
GTCGGTCTAAGCAGGCAGTGGAAAACGGGGGAAAACGCGGCGAAATAGATTTCATTCAATGGAAAGAAGTGTGATGTAGCTTGTTGAATGTTCTG 
TCCGACCATTAGTATTGATTTGACTCACTTATCCCCCACGTGAGTGGGAGATTTGCCTGTGTTGTTTGACCATTGCCCATAAGTTCATCGGGATT 
TTGTGTGCCACAAATTGGTTTACACCGTTCTCCCGCACTTTTCGTGAAATTGCTTGGCATAAACAATATATATTTAAACTGATTTATAATGCAAC 
TGTTTAAATATTACTATCCATTAAATAAGATTGGCAATCTGCTCGGAGCAGGCTAATAAATATTTTATACTTTGAAATACATTAAATTCATGATT 
TAGATTAGCTATAGAAATCTTAAATCAATTGATGGAAATTTCACATAGTTTATTAAAAGGAATCGCATTTCAAACACCGAACGAATTCCAGTTTC 
TTTTAAATATTTGTTGTGTACACAGAAGATTGGAATCATGTTGCTTGAAGGCGACTAATAATTTCTCATGACTATAAACATGGTAATTGGTTACG 
CGTAAACTGTGGATGGAATTTGTTTTTCCTGCGTTGCCTGCTGAGTTTTATTTTCTGGTTGTCATGGAGTGTGGCTTTTGGTTTATTTGTTAATT 
TTTTCGTATGGGCGCGCTCGTTTTATGGTTATGTCTACCCCTTGGTGGTTCAACTTTAGTTTCGATCTTATCAACAGAAACATTAACTATTCATA 
TGCGAAAAAATGGTCATTAATCTAAACGATTTACTCCCTAAATATCCCATCTTCTCTATCCTTTTTACTAAGTATACTTCTCTGTAAAGACTTTA 
ATGTGATCTACACTGTATCTACTTGACTTGACCCCTAACCCATTTGGTACCTTAATGCCTTAATGAAATGACCACACACTCACCAGTTCTATCGG 
AACCCGACTGGCGGTCGAACTCGCGCTTCCTGTTCTGGCCATCGTAGTTGCGGTTCTGACGCTGGGGACGATCTCCGCCAGCACCTGGGCCTCCA 
CCTGGTCCGCGGTAAGGTCGCGACTGCTGGCCATCGGGTGCTCCGACGTTTTCGCGGACATTGCGGCGATTGTTGCGCTGCTCACGAGTCTCGGC 
ATTGCCATTCTGTTGACGGAAGTTTACGCTGCGCTGCTGCTGGCCACCTTGTCCTTGACCCTGTCCTTGACCGCTACCAAATTCACGTGTTCGGT 
TGGCGCCACCTCCTTGCCGTGGAGCACCCTGTTGTTTGTAATTGCTGCTGGCGTTATTATTGAACTGCTGCTGTTTGTTGTCGGCCGATGGAGTG 
AACTTCTTGCCATCGGTTTTGTTTAACGCATTATTGGGCTTGTTCTCCTTTTCGGCCTGATTTGGTTTCTTGGCATTGGGTCCACCAATATTGGA 
TCCTGCGTTCGCCTTGTTATTGCTGTTGGCCACCTTGTTGGCGGTCGTTTTGGTGGCCGCTGCAGCAGCAGCTGACGGTTGCTTCTTCTTGCCAG 
CTGCTGCTGCGGCAGCAGCGGCCGTCGGCGCCACAAGATTATCTAATGGATCGCTGACATCGTCGTCCATGAACAAAAGTTCATAACGATTTTTA 
CCGGCGCTGTCCATGATTTTGTGGGTGGGTGTGTTTTTCGTTGCTAGCTGGCTGCCGCTTTAATTGCCTTTTTGCTCTGTAAAAAGAAATAAATA 
CACACAGTTTTAGTGGGAATTGATTGATTTTCCTTTTTCGCCCGCCGCCTCTCACACCCTGTGAGATTCGATAGGATTTCTTTTTCGTGTCACAA 
ACTATGGTCTAATATGGCATATATTCGCTGGATTTATGGTTTACGACATTAACACGTGGACTAACGTGAATTTACTCAATTTATTTAGGCCCTTT 
GGGTGGATTTTTCTTACTTTTCTGCTGGCTGTTTGGTGGCTGCTGCTCTCTAACCTTAATTTTCAGTGTTTCTTCTTCGCCTCGCTGCTAGTATT 
ACCTACACACACTGCCTCACGCGATTCCACACGCACATGGGCCAAATCACGCACACCACAGTCAGCACGAACAGAGCGCCCACATGTGTTCTTGG 
TATACATAGAAAAACAGAAACTCACGTTTTTCAACGCGTTTCGCGGCGAATGTCGCAAATCGATTCGAATGGCAGCTGTGTTTAGCGGGCTCTCG 
ATTTGGCGGGCACTTTTTAGTTTGAAGTGCAAGCCAATTTCATTAGTTGGAGAATATGTGAAAAATTGAGCAAATTATATAAATTGGTACAAAAT 
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GTTGGAGCATATATTATAAGAATTCATAATTCAATACTTTAGAAATGAACGTATTAAGTTATTGCTTTTATACTTATATAGAATTAATTTTTTTT 
TCTTTCAGCC AAAT T AAAAAAAAAAT AT C AAT AT GT CT G ACG AC T G GG AT GAT GAG GTTCGTATTGCTTC GT AC AAAG AAT AT AT GC T T AC T AAG 
CCTATCTATTTTAGCCCATTGTTGATACTCGCGGCGCCCGCGGTGGAGATTGGAGCGATGATGAGGACACGGCCAAGAGCTTCAGCGGCGAAGCT 
GAAGGCGATGGTGTTGGAGGGAGCGGTGGTGAAGGCGGCGGCTACCAAGGAGGAAATCGAGATGTGTTCGGAAGGATCGGCGGAGGCAGAGGAGG 
AGGAGCTGGAGGTTATCGAGGAGGAAATCGAGATGGAGGGGGCTTCCACGGTGGACGTCGCGAGGGAGAAAGGGACTTCCGCGGTGGAGAAGGCG 
GCTTCCGCGGTGGACAAGGCGGCTCCCGCGGTGGACAAGGCGGCTCCCGCGGGGGACAAGGCGGCTTCCGTGGTGGAGAAGGCGGCTTCCGCGGT 
CGGCTGTACGAAAACGAGGATGGTGAGTAAACCTTTTGGACGACATGTTAAATTTTGGGTTATGGTCATTTGAGTCATTCCTGCTGAATTCCATG 
TGGT 

(SEQ ID NO: 355) 

Exon: 4514. .4483 
Exon: 4256. .3504 
Exon: 2023. .1757 
Exon: 1515.. 1001 

Start ATG: 4194 (Reverse strand: CAT) 
Transcript No. : CT12457 

AGAGCAGCAGCCACCAAACAGCCAGCAGAAAAAGCAAAAAGGCAATTAAAGCGGCAGCCAGCTAGCAACGAAAAACACACCCACCCACAAAATCA 

TGGACAGCGCCGGTAAAAATCGTTATGAACTTTTGTTCATGGACGACGATGTCAGCGATCCATTAGATAATCTTGTGGCGCCGACGGCCGCTGCT 

GCCGCAGCAGCAGCTGGCAAGAAGAAGCAACCGTCAGCTGCTGCTGCAGCGGCCACCAAAACGACCGCCAACAAGGTGGCCAACAGCAATAACAA 

GGCGAACGCAGGATCCAATATTGGTGGACCCAATGCCAAGAAACCAAATCAGGCCGAAAAGGAGAACAAGCCCAATAATGCGTTAAACAAAACCG 

ATGGCAAGAAGTTCACTCCATCGGCCGACAACAAACAGCAGCAGTTCAATAATAACGCCAGCAGCAATTACAAACAACAGGGTGCTCCACGGCAA 

GGAGGTGGCGCCAACCGAACACGTGAATTTGGTAGCGGTCAAGGACAGGGTCAAGGACAAGGTGGCCAGCAGCAGCGCAGCGTAAACTTCCGTCA 

ACAGAATGGCAATGCCGAGACTCGTGAGCAGCGCAACAATCGCCGCAATGTCCGCGAAAACGTCGGAGCACCCGATGGCCAGCAGTCGCGACCTT 

ACCGCGGACCAGGTGGAGGCCCAGGTGCTGGCGGAGATCGTCCCCAGCGTCAGAACCGCAACTACGATGGCCAGAACAGGAAGCGCGAGTTCGAC 

CGCCAGTCGGGTTCCGATAGAACTGGTGTGAAGTCAATTGACAAACGCGATGGAGCTGGCTCCCACAACTGGGGATCGGTCAAGGAGGCTATTGA 

TGATGTGAACAAGAACGAAAGCGAAACCAATGTGACCAATGCCGAGGGCGGCGCCAAGGCCGATGAATCTGGCACTGAACCACAAAACGAGCAGG 

CTACCGCCGAGGAGGAAGCCAAAGAGCTGACCTTGGACGAGTGGAAGGCACAGCAAGGACAGCGCATCAAGCCAACCTTTAATATCCGCAAGGCT 

GGAGAAGGTGAGGACACCACACAATGGAAGAAAATGGTCGTCCTAACTAGCAACAAGAAGAAGGAGAACGACAGCGAAGAGGAGCTCGAGTACGA 

CCCTGCCCTTTATCCACAGCGTGTGGGCAGGCAACAGCGCGTCCTGGACATTCAGTTCAATTTCAACGATGGTCGTCGTGGCGGACCTGGCGGTT 

TCGGTGGACGCGGTGGCCGTGGAGGCCCCAGACCCGGTGGATTTGGAGGCGGACCACGCAGCGAGGGTGGCAACCGCGATGGCGGCAATCGTGAG 

GGCGGCCGAGATAATCGTGAGGGTGGAAACCGTGGCCCACGGGATGGCCAGCAGCAT AAT AACGAGGGCGGCGGTTCATCCGCTCAAAATC AAAG 

GCCACCCATCGATCGTCGCGGTCCGGGAAACAACCAGAACAACAATCAGAACAGCGGTCCAGGGCCGAATAAGCGTTTTGAGCGACAACAGAACA 

CCGCTCCCAAGGTCAACGATGAGCGTCAGTTTCCCACTCTGGCTTAA 

(SEQ ID NO: 356) 

Start ATG: 95 (Reverse strand: CAT) 

MDSAGKWRYELLFMDDDVSDPLDNLVAPTAAAAAAAAGKKKQPSAAAAAATKTTANKVANSNNKANAGSNIGGPNAKKPNQAEKENKPNNALNKT 

DGKKFTPSADNKQQQFNNNASSNYKQQGAPRQGGGANRTREFGSGQGQGQGQGGQQQRSVNFRQQNGNAETREQRNNRRNVRENVGAPDGQQSRP 

YRGPGGGPGAGGDRPQRQNRNYDGQNRKREFDRQSGSDRTGVKSIDKRDGAGSHNWGSVKEAIDDVKKNESETNVTNAEGGAKADESGTEPQNEQ 

ATAEEEAKELTLDEWKAQQGQRIKPTFNIRKAGEGEDTTQWKKMVVLTSNKKKENDSEEELEYDPALYPQRVGRQQRVLDIQFNFNDGRRGGPGG 

FGGRGGRGGPRPGGFGGGPRSEGGNRDGGNREGGRDNREGGNRGPRDGQQHNNEGGGSSAQNQRPPIDRRGPGNNQNNNQNSGPGPNKRFERQQN 

TAPKVNDERQFPTLA* 

(SEQ ID NO: 357) 

Gene Symbol: vig 
FlyBase ID: FBgn0024183 



Celera Sequence No. : 142000013384828 

TTCCTCAAACGGTTATCTCGATCGTTAAATGATCAACACAGAAATGTATTCGATTTTATTATATTTTTTATTTTAACACAATTGTTGTGTGTTTG 
TGTGGCTCTTTCGTTTGTTGTAAATTATACAATTAATTAATATGTACTATATATAAATAAAGTATCTCAACTTTTCTATGATTGGTTCGCCTTGG 
CATGTAAAAAATGTCCCCTTACGGCTTAATCGAGTCTTAGTGTCTACGGGAAAAGACTTACAAAAATACATATCGCATTGATTTATTGAACATTT 
TCACTTATACATATGTCTCTATATTTATATTTATATATATTATATACCCTCTAATATAACATGGATATAGATGCACACGTGTTTCGTTCGCAATT 
ACGATTCGATAAGTTTAAATTAGGTTAAACTAGTTTATACATGTACTAGAAGACAACCCTAAGAACTTATACAAATATTGATAGGGTTTGAAGCT 
CTATATAAATTAAGATCGAAGAGTTAGATTAGAAATACGGACTTGTCTTATCAAATGTAGTGCCCCTTTGATTTTGTGTTTATTTCTTTTGGGTC 
GGAAGTTTTAACGAATGCTAATTGAAGATCTGTCTTGTTGGGGCCCTAATGCGATATGATTCATCATATTATGTTATGTAATGCATTTTCAACTA 
ACATATGTAGGTTATAAAATAAGCTAATAACTAGTGTTGGTATTATGAAGACTTATGCAACGAGAAATGCATACTGATTTCGGGATTCGCTTCTA 
GGAATATTTCGTTTTATGGACATTTTAATTTATATTTCGATTTTATTTCTACCCCATTTGTGTTTCCCTAAATGTGTGGAATCGAAGCATTAACA 
GTTCTGCACAAATTTGTGGTTTGTTCGCAGTTGCTTGGCTACATTAGTGTTTGCTTTAGTTTAAATAGTTCGTTTACGTAAGATCAAGATCACTT 
TTTGCATTTACACAACAAACAACAAAAACGTCTTCCCCCGCGGCCCTTGCAATATTCCTCCATAAAAATATATATGTATATATATAATATATAGT 
TTACCCATATATATATGGGTATATATCTCTGAATCGACTTTGGACGCCGGCTAGGCCGGCGCCTTGTCCCGACACTATCAACATCCGTACAACTC 
TTAGTATATAAAAAGTCTTTTCAATTTCTACGTCATCATACCGCGGCTGGCGAAGATTCCTAGATGCGCCAGCCCTGGTCATCGTCCACGTTGGA 
GTCTGTGTCTGAGGGCTCCTCGCCGTACATGTCGGCCAATTTGCGGAAGCGCGGTCCAAAGTTGGACAGGTAGTCGAAGTTGAGATCGCCGTCGT 
CGGTGCAGGACGCCAGACTGGAGAGGCTGCCATCGGAGTTGCCGTCACCTTCGTACGCGTAATGCCGCACATCGTCCACGGTTGTGGCGCCCACA 
TCCCGGTCGCAGTTCTCCTTCTTGTCGCCCAGGAAGTCGGCGATGTCGGGTATGTCGTTGGGGTCGCGCATGTTCCCCTGTAACGCATGAGGATC 
TTTGTACAGCTTCTCCTCGTAGAAGGGCTGGGTGCGCAGGACATTCAGATCATAGTCCGTGTCCCGCTCGCCGCCACCCTCGTCCTCGTAATTAA 
TGATCGTCTCGCGAATGTCGTCGATGTCCTTTTCGTGCCAGCCATTCTTCTGCTTTTTCTGCACCACCACTGCCAACAGGATGATCAGCAGTAGC 
GCGAGGCATACGATGATCGCAATGATGAAGTTTCTATCAATTCCAAAACTGAACGCAACGGCCACCGACTGCTGGCAACCGCTGTCGATGTTTCG 
GAACACAGAGGGCATTCCCAAGTTGTAAGTCTGCTCGTTAATTGTCAGGTTGCGAATGCAACCGAAGAAACCCTTCTGGTTGGGAACGTGCGTCC 
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AATTGAGCTGCCGTCCCAGCTGTTCCAGATCCACTGGAGTGCCGCCCAACTGCAGCGGTGCGTTTACATTGAGGAACTCATTGGGGCCAATTGGG 
GCGCCCAGTGTCTGGCAGGTGGATAGCCGGCAATTGTCCACAGTCATCTCAATGCTGGTACGCTGCAGGATAATGTCCAGTTGGTAGGTGCGGTC 
TGCCACCATCTTGATGTGCTGATGGCGGATGCGAATGGCACCTGATCCGTAGTCCACAGTCAGCACCGGATAGCCATTGTCCAACTCCAGAGCTA 
GGAAATCAGAAATGGCCAGCAGTGGATTGAAGTTAAGTGGTCCCAAGTACATGATCAGGCCCTGATCGATCTGGGGCGTGATCTCCAAGCTAATC 
TTGGTGTTGTTGCAGGCGGCAATGGGCTCGTAGAAGGCATACCCCGAGCCATAGAATGCCACGGAGACGAGTTCGCAGTGGGGTCCGGTGAATCC 
GTCTATGCAGTCGCACACGTCGTTCTCCCCGTACCGCGGAGATCCTCCGTTTAGACAACGCCTCATTAAAGGAGCCTCGCATACGCATTGCGCTT 
GAACAAAGGCATTCACGCCCACGAACGATGTTGTATTGGAGTAGATCATGTATGGCACCGAGCTCTTGTGCAGCTCATTGGTGCACGATTCCTCG 
CACTTGAACTTCTCAATCAGACACTCGTCGATGTTCACCATCAGCATTTGCAGGTCTAGTTCGTTTTCCAAACGCTGCTGATTCTGGGCCACTAT 
TCCATTTAATTTCTCCGGAGCATAGTAGGGTGAGCCGTGGGCAGAGAAGCGTACATCGAGCGTATGATTCTCGTTTTGGAGCACAGTGAACACGT 
CCACGTTGGATACCGACGTATTGAAAAGCTTGGCCAGCGAGAGCTGCAAGCGATCCTTTAGGGATAGCGCATCCGGTGACTGGAAGTCGCGTGGC 
ACACTGATAAACTCCTCCTTGGTTACGTTGATGAATCGGATGCTGCCGCTCTTATCCACCGCCTCCTCGGGAAGCTCGCGGACCACCACGGTGAC 
ATAAGCATCTACGGAGTGACGGGGCACGAACATGGAGTCCTCGGTGACTACGAACGACAGGTCGTACTCTCCCTCGGCGGTGTGAACCAGCATGG 
TGATCATACCCGTGCTGGGATTAAGCCTAAACTGATCGTGCGGAAGATCTTTCCATTCAAAGTACTTGTCCTCCAAGTCCCAGTCATCGAGATCA 
TCTACGAACACGCGACCAATATCGGTTTCAGGCGCTTCTCCCTTGTAATTGTAAATGAAGATGCGCGACGATCCCTCGCTCATGGCATTGTCGTT 
GACATCGCCGATCACCAGGTGCAGAATGCTTACGGCACTCTGTCGCGGAACGCCAGAATCACTGATGCGAATGGGGATAAAGTACTCCTTTTGAG 
CCTCGCGATCAAACTGGACGTTGGCGTGCAAATAGTCTCCATCCATGCTGAACTTGGTCTTGATGTCCGGTGTGGCTTCGCTGTCGATGCCGAAG 
GTGAAGTTACCTGCCCCCGGAGTGTCGTCGTAGTCGTTGGCCTGCAGTTGCACAACATGTCCGGGATTGCGGTTCTCCTGCCAGTAGACAGGCAT 
TTCGTTGATCAGGAACGGAGCATTGTCGTTGATGTCCTTCAACGTGACGGTGACCTCTTTCACTGTGCTGACCGTGGTTCCTACTCCATCCTCAT 
CATTAGCCTTGATCAAAACCTGCCAGTTCTTGTGACCATTGGGCGGATCGCGGTCCAGCGTCTTGACCAAACGCAAGCAGCCACTGTTGTCAATG 
GTCCATCGGTTTCCATCGTTCTGGTGAATCGAATAGACGATATGCTGATCTGCGTTGCGATCCTTGATATCCGGATCGTAGGCTTCGACAGTGAG 
TATGCAGTCGTCATATGTGGTCTCCTCCAGAATGGTGACGCTGTAGTCCTGCTTAAACACCGGCGGGTTGTCGTTCACGTCTTCAATTTTGATAA 
CCACCGTTGTGTAGTCGTCGTAGATTCCATCGAACGCACGCACCTTCAGCTCGTACTCCGTGATCGTTTCGTAGTCCAACTTCTGGTTGACCGTG 
ATTTTGCCGGTCTTAAGACCAATCTTAAAGGCATCACCCACGTTACCGCTCTCAATGCTGTATAGGATCTGGGATGCGTTATCCTCGTCTTCGGC 
CTTTACCTCAATGACCTCGGTATTCGTATTGGCATCCTCGAGCAGTCTTTCTGCCAGATATTTGTCCTGCTGGAAGTGCGGCTTGTGGTCGTTCT 
TGTCTCCAATGGAGATGCGGAACACTTGGTGACCACGGTTCGGCTCGCCATTATCAAAGAGACTCGATGGCGAATTGTCGCTGGCAATAACCTTT 
ACGTTATAGAAATCGCGTTCCTCGCGATCGAATGTCGTCAGGGCGGTTATGTTACCGGTATTGGGGTCGATGGTGAAGTACTCCCGATTGTCGGC 
CAGCTCGAATGACACAATATTATTGGCTGAGGTGCCGTCCATGTCGAAGGCACGCACCTGCATAACAGGCGTGCCGGGTGGTTCGTTCTCCAGGA 
TAGTGCCCGATTTGACCTCCGTGTAGTAGGGGATGTTATCGTTGACATCCTCCACCTGAATCTTGATCTGATGCTCTGTGCCCAGCTCGTGTGTA 
TTCCGCACAATCATGGTGAGGGTGTAGTCTGTGATGGCCTCGTAGTCCAGAGTCTTGCCCAGGCTAATGGTCACCTCGTTGCCGATCTGATTGAA 
CACGAATGTGTTCTTGCTGTTTGTCTGCTCCGTGCGGCCGGTGTTCAGTTCGAAGATTACCTCCGGCTTGTCCGGCATATTGGAAACGGCTCGTA 
ACGTGGCGATCGGATGTGTAAAGTTCTTGAGATTCTCCTTCAGGTAAATGGGCGTATCGATCGGATTCACAAATGATGGCGGCTTAATTGAGGAC 
TCCACCACCCGGATGCGCACTTCGATTTGTGCATCCTGTGGCGGATCTGGCACTACGTTATATGCCCGCACGATTATCGCATACGATTGGCCGGG 
TCTCTTGTCAATCGGTCGTTTCAGGTAGATAATACCCGACTCCTTGTCAATCTTGAAGTACTGAAAGTCGCGCTCCCGCAGAATCTCGTATTCGA 
CCAAACTGTTGTTGCCATCGTCGAAGTCACTGGCGCTGATAGTCATCACCACCGCGTCGGGCTGCAAGTAGAAGGTAAAGATAAAGCCACAGATA 
TTTGGGTGTTTCTGGTTTCCTTCCTTCCACACAACTTTCCTGCAAGTTTGCCCAAACCAGGAGCAAGCACATGACCCCGCTCTAGTTTCCTGAAT 
CTCCTTTTACTCAATGCTACCTGGGCGAAGGCCTAATTAGAACTGAAACCTTTTCGCAGAAAAGCTTCTCTGCTAACAAAGCTTTGCTAAAAATA 
ACAGCGTTTGTTGTGTCGAGGAAAATAACACATAGAAGTTAGATAACTATCTGCTGTACTATCCCTTAACTACAAAGTAAAAAGACTTTCCGATA 
TTGATTGCAAATCAAAGATGCTCCTGAAGATCCTCAGAATATCAGATCTAATCTAAGAAACCTTAATAAAATTATAATATATTTCTAACTAATCG 
AAGTAATTACCGAAAAGGGCTCTCTATTGCGGTTTTGGTACAAAAACTGCACGCTATTACAAGTACTTAGCTAAATTTAGTTTTCTACCACTGTT 
TTGTAGTTCGACAAGTTTT TCTGGTCGC AAACATGGAAATGGCGTAAAAGAAAAGGCGTTGGAAAGGGTTTCAAGGTTTTCTGTATCGAACCGGG 
CGAGCAATGAACCCAAAACCCGTCTCCAAGTGGTCGCCCTCCTACGCCAAGCGAAAACTAGTTCCGCCCAAGAAGATGTGCGCCCATGAACTGGC 
CAAGTGTTTAATTAACACCGTAGGACGTGCGACAGTGGGAGCCTCAGTCTATTGGAATGACCAGGTGGGCAGAGGGCAAATCTTATGGTGCCAGT 
GATCCATAAATATCAGTAGTCAAAAACATGATAGTTTTAGTAACAGCATAAATAAGCAAAAATCAGCTAACAAATATTTTCTTTTAATTTTTAAG 
CAAGATTTTATCTTAATATATTTTTGGTTAAAAGACATTTCTACTACTACACTCCAGTGAAAAGAGAAACTTGAAGCAACAATTGAGTTAGGTAC 
ATGACATCACAAGAGGGGGCAAGACAGAAAGAATGACATCACTCTAGTACCAATTCACCAGCTTGGCCAACCCGCAGTAATCCGGGGTCCGCGAG 
TTTGGCAAGCGGTGGTGGATAGAGTGGGCCAACTCGAACTCGACTCAGTGGGCCAGGAAACTCACCTGGGCATTTTCCGACATGGATTCGTCGTA 
GCGCGCCTTGTTGAAGGCGGGTGCATTGTCGTTGATGTCCTCGATGGTAACGTTGAAGGTGCAAACGTCGTCCAGTGGGGGCAGACCATTATCCG 
TGGCCTGGACGGTGACGAAGACGAACTTCTCGTGGATCGGCTCGTCACGATCGAATGTGTGGGTGGTGAAGATTACGCCCGTGCTGGGATTGATA 
AAGAACTTGGGTCGCTCGAACGGCGACTGGACGATGCTGTATCGAATGACCTGGTCGGGATCAGGGTCGACGGCCTCTACGGTGAGAACGTAGGT 
GTTTTCGGGCTGCTCCTCTTTGACCTTAGGAGCGTATCCGGCGCAGTTTTTGAAGGCGGGCTTACGATTGTCGCCGGAAAAGTTTTTGTCCGGCG 
GCGAGTGAAGTGCGAGGGCGGGTGACAGAAGGCTGATGGCCACGATAACGCCCAGGAGGAGTCTCGCGGGGATTAGTCGCCTGCCAAGGTCGCGG 
CATTTTCGCTTGTGCTGCTGGTGCGTCTGTTGCTGGGCGGGATTTAGGTGGCCGGCCTGCGGGGTCGCCGACATATTGATGCAGTGGTAGCTTCT 
GGACATTCGCTGGACACTGGTGGACATACTCTCTAGCGGCGATCCGCCATCGCAGACTCTCTTTTTTTCAACCGCAACACTTTTGAGCACTTCTT 
CGTGAATCACCCAGTATTATTTCACTCACACACCTTTTACTTTTGTTTGTGTTGCCCACTTCTACTTCCACTTCTTTTTTATTCTATTCTATTTG 
CACTCTTCTTCTTGTTTCGTTTCTCTCAGTGTACCGTTTGTTTGGACTGCATTTTCATGGCAACCGCTCAATAATTATCAATCGAATCGAACTCG 
TACAGTGCGCGCCCTGATTGTTAACTGACTTATTCCGATTTCGGTGCAGCTGCTCTGGCTGATATTTTCGTTCACTTTTATCTTTCACTCGTCCG 
CTTGGGCACTCTCTTTCTCCGTTTCTGTTGCCTCCTCTCTTTCGCACCGCAAACAGGTAAGACCAGGTAAAAAAGTGGTTACACGCACACCGCCA 
AGCGCGCGCATTTGTGTGCGTGTGGTTCGGTTCGCTGCGTTGGTATGTGTGTTACCAACGACTTTTGGTAAAAAGGCACTTTTCGATGGTATTTC 
GTTGTGAACTTGTACTTTCTGGCGAATTATTTTAAATAAAAAGATCGCAAAACGTATTTTCTTTCACTTCGATCGCTTTTATTGTCGACTTACAA 
CGAAGCTGCGGAGCCACACCGATTTTGAAACTGCGATCGACGCGCGAGACAACCGCTCAATTCAAAAGCAAAAACACAATGCAACAAAAACAACG 
GGCGGCGTTAGAGATGTGAGACACCGTCGATTCCTCGGTCATCGATAATTAACGGTTAATAATCTCGATGTTTTCGTTTTATCGATACTATATAC 
TGTCTACACTTCTCGATAAATACGTTGATGTGAATCAATTACACTGGATTTAATATAACAAATATTTCTTTAAAAATGGAAGCTGGAAGATAAAA 
ATGGCCCAGCTTGGGCTTTCAAATAGAATTATTAACAACTATTTTTCACAAACAAACAGGTTAATTTTAAACGAGTTGATCAATACTTTAATACT 
TATATAACTGAAAAATATTTAATAACCAAAACTAGCTACCATGTACAACAATAACATAATTTGCTTTAAAAAAGTTCTGATTGAGTTTGGTTTTG 
AACAGGCAATGATATTTATTGGTCTCGAGGAACTATTTCTATAATGTATTACATTCTACATCGTTAAAAGTTTTTTTATTGCAATTCAAGTAACG 
CGCCAAGACTTTTGCCATTCACCCGTCGGATTATGGAGCAGTGTAGACTGCCCGTAAAATACGGTGAATCGCATCCGGTACAGGTACACAAATAT 
TTCCCATCCCTAGCAAAGAGTGGAAAAAAAAGTGAATCGAATCGTCATCGTCTTGGCCCATTAATTGCGGAGAAAAAAGATTGCAGCGCCGAAAA 
AACGGTCAAGGAAGAGTTCCAGCGACACCGCAACTTCCAACCACTCCACAAGAAGATCCCGTCGAAAGGTAGAACCCTAGGAACCGGAGGGGAAA 
CCAGACCAAACCAAACAACCGCCCAGCCCCGATTTCGTGAATCACCAGTGGAACTATGGAGCAGACACCGATCACCGATGCGGAGAAGGTGCGCA 
TCGTATCGGACTTCATCCTACATGCGCCGCCCGGCGAGTTCAACGAAGTGTTCAACGATGTGCGGGAGCTCCTCAAGAATGACACTCTGCTAAAG 
GATGGGGC 
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{SEQ ID NO: 358) 



Exon: 7558.. 6241 
Exon: 5096.. 1001 

Start ATG: 6867 (Reverse strand: CAT) 
Transcript No. : CT12481 

GTTGTCTCGCGCGTCGATCGCAGTTTCAAAATCGGTGTGGCTCCGCAGCTTCGTTGTAAGTCGACAATAAAAGCGATCGAAGTGAAAGAAAATAC 
GTTTTGCGATCTTTTTATTTAAAATAATTCGCCAGAAAGTACAAGTTCACAACGAAATACCATCGAAAAGTGCCTTTTTACCAAAAGTCGTTGGT 
AACACACATACCAACGCAGCGAACCGAACCACACGCACACAAATGCGCGCGCTTGGCGGTGTGCGTGTAACCACTTTTTTACCTGGTCTTACCTG 
TTTGCGGTGCGAAAGAGAGGAGGCAACAGAAACGGAGAAAGAGAGTGCCCAAGCGGACGAGTGAAAGATAAAAGTGAACGAAAATATCAGCCAGA 
GCAGCTGCACCGAAATCGGAATAAGTCAGTTAACAATCAGGGCGCGCACTGTACGAGTTCGATTCGATTGATAATTATTGAGCGGTTGCCATGAA 
AATGCAGTCCAAACAAACGGTACACTGAGAGAAACGAAACAAGAAGAAGAGTGCAAATAGAATAGAATAAAAAAGAAGTGGAAGTAGAAGTGGGC 
AACACAAACAAAAGTAAAAGGTGTGTGAGTGAAATAATACTGGGTGATTCACGAAGAAGTGCTCAAAAGTGTTGCGGTTGAAAAAAAGAGAGTCT 
GCGATGGCGGATCGCCGCTAGAGAGTATGTCCACCAGTGTCCAGCGAATGTCCAGAAGCTACCACTGCATCAATATGTCGGCGACCCCGCAGGCC 
GGCCACCTAAATCCCGCCCAGCAACAGACGCACCAGCAGCACAAGCGAAAATGCCGCGACCTTGGCAGGCGACTAATCCCCGCGAGACTCCTCCT 
GGGCGTTATCGTGGCCATCAGCCTTCTGTCACCCGCCCTCGCACTTCACTCGCCGCCGGACAAAAACTTTTCCGGCGACAATCGTAAGCCCGCCT 
TCAAAAACTGCGCCGGATACGCTCCTAAGGTCAAAGAGGAGCAGCCCGAAAACACCTACGTTCTCACCGTAGAGGCCGTCGACCCTGATCCCGAC 
CAGGTCATTCGATACAGCATCGTCCAGTCGCCGTTCGAGCGACCCAAGTTCTTTATCAATCCCAGCACGGGCGTAATCTTCACCACCCACACATT 
CGATCGTGACGAGCCGATCCACGAGAAGTTCGTCTTCGTCACCGTCCAGGCCACGGATAATGGTCTGCCCCCACTGGACGACGTTTGCACCTTCA 
ACGTTACCATCGAGGACATCAACGACAATGCACCCGCCTTCAACAAGGCGCGCTACGACGAATCCATGTCGGAAAATGCCCAGCCCGACGCGGTG 
GTGATGACTATCAGCGCCAGTGACTTCGACGATGGCAACAACAGTTTGGTCGAATACGAGATTCTGCGGGAGCGCGACTTTCAGTACTTCAAGAT 
TGACAAGGAGTCGGGTATTATCTACCTGAAACGACCGATTGACAAGAGACCCGGCCAATCGTATGCGATAATCGTGCGGGCATATAACGTAGTGC 
CAGATCCGCCACAGGATGCACAAATCGAAGTGCGCATCCGGGTGGTGGAGTCCTCAATTAAGCCGCCATCATTTGTGAATCCGATCGATACGCCC 
ATTTACCTGAAGGAGAATCTCAAGAACTTTACACATCCGATCGCCACGTTACGAGCCGTTTCCAATATGCCGGACAAGCCGGAGGTAATCTTCGA 
ACTGAACACCGGCCGCACGGAGCAGACAAACAGCAAGAACACATTCGTGTTCAATCAGATCGGCAACGAGGTGACCATTAGCCTGGGCAAGACTC 
TGGACTACGAGGCCATCACAGACTACACCCTCACCATGATTGTGCGGAATACACACGAGCTGGGCACAGAGCATCAGATCAAGATTCAGGTGGAG 
GAT GTC AACG A T AAC AT C C C C T AC T AC A C GGAGGT C AAAT C GGGC ACT AT C CT GG AG AAC G AACC AC C CGGC AC GC CT G T T AT GC AGGT GC GT GC 
CTTCGACATGGACGGCACCTCAGCCAATAATATTGTGTCATTCGAGCTGGCCGACAATCGGGAGTACTTCACCATCGACCCCAATACCGGTAACA 
TAACCGCCCTGACGACATTCGATCGCGAGGAACGCGATTTCTATAACGTAAAGGTTATTGCCAGCGACAATTCGCCATCGAGTCTCTTTGATAAT 
GGCGAGCCGAACCGTGGTCACCAAGTGTTCCGCATCTCCATTGGAGACAAGAACGACCACAAGCCGCACTTCCAGCAGGACAAATATCTGGCAGA 
AAGACTGCTCGAGGATGCCAATACGAATACCGAGGTCATTGAGGTAAAGGCCGAAGACGAGGATAACGCATCCCAGATCCTATACAGCATTGAGA 
GCGGTAACGTGGGTGATGCCTTTAAGATTGGTCTTAAGACCGGCAAAATCACGGTCAACCAGAAGTTGGACTACGAAACGATCACGGAGTACGAG 
CTGAAGGTGCGTGCGTTCGATGGAATCTACGACGACTACACAACGGTGGTTATCAAAATTGAAGACGTGAACGACAACCCGCCGGTGTTTAAGCA 
GGACTACAGCGTCACCATTCTGGAGGAGACCACATATGACGACTGCATACTCACTGTCGAAGCCTACGATCCGGATATCAAGGATCGCAACGCAG 
ATCAGCATATCGTCTATTCGATTCACCAGAACGATGGAAACCGATGGACCATTGACAACAGTGGCTGCTTGCGTTTGGTCAAGACGCTGGACCGC 
GATCCGCCCAATGGTCACAAGAACTGGCAGGTTTTGATCAAGGCTAATGATGAGGATGGAGTAGGAACCACGGTCAGCACAGTGAAAGAGGTCAC 
C GT C AC GT TG AAG GAC ATC AAC G AC AATGC TCCGTTCCT GA T C AAC GAAA T GC C T GT CT AC T GGC AGG AG AACC G C AAT CC C GG AC AT GTT GT G C 
AACTGCAGGCCAACGACTACGACGACACTCCGGGGGCAGGTAACTTCACCTTCGGCATCGACAGCGAAGCCACACCGGACATCAAGACCAAGTTC 
AGCATGGATGGAGACTATTTGCACGCCAACGTCCAGTTTGATCGCGAGGCTCAAAAGGAGTACTTTATCCCCATTCGCATCAGTGATTCTGGCGT 
TCCGCGACAGAGTGCCGTAAGCATTCTGCACCTGGTGATCGGCGATGTCAACGACAATGCCATGAGCGAGGGATCGTCGCGCATCTTCATTTACA 
ATTACAAGGGAGAAGCGCCTGAAACCGATATTGGTCGCGTGTTCGTAGATGATCTCGATGACTGGGACTTGGAGGACAAGTACTTTGAATGGAAA 
GATCTTCCGCACGATCAGTTTAGGCTTAATCCCAGCACGGGTATGATCACCATGCTGGTTCACACCGCCGAGGGAGAGTACGACCTGTCGTTCGT 
AGTCACCGAGGACTCCATGTTCGTGCCCCGTCACTCCGTAGATGCTTATGTCACCGTGGTGGTCCGCGAGCTTCCCGAGGAGGCGGTGGATAAGA 
GCGGCAGCATCCGATTCATCAACGTAACCAAGGAGGAGTTTATCAGTGTGCCACGCGACTTCCAGTCACCGGATGCGCTATCCCTAAAGGATCGC 
TTGCAGCTCTCGCTGGCCAAGCTTTTCAATACGTCGGTATCCAACGTGGACGTGTTCACTGTGCTCCAAAACGAGAATCATACGCTCGATGTACG 
CTTCTCTGCCCACGGCTCACCCTACTATGCTCCGGAGAAATTAAATGGAATAGTGGCCCAGAATCAGCAGCGTTTGGAAAACGAACTAGACCTGC 
AAATGCTGATGGTGAACATCGACGAGTGTCTGATTGAGAAGTTCAAGTGCGAGGAATCGTGCACCAATGAGCTGCACAAGAGCTCGGTGCCATAC 
ATGATCTACTCCAATACAACATCGTTCGTGGGCGTGAATGCCTTTGTTCAAGCGCAATGCGTATGCGAGGCTCCTTTAATGAGGCGTTGTCTAAA 
CGGAGGATCTCCGCGGTACGGGGAGAACGACGTGTGCGACTGCATAGACGGATTCACCGGACCCCACTGCGAACTCGTCTCCGTGGCATTCTATG 
GCTCGGGGTATGCCTTCTACGAGCCCATTGCCGCCTGCAACAACACCAAGATTAGCTTGGAGATCACGCCCCAGATCGATCAGGGCCTGATCATG 
TACTTGGGACCACTTAACTTCAATCCACTGCTGGCCATTTCTGATTTCCTAGCTCTGGAGTTGGACAATGGCTATCCGGTGCTGACTGTGGACTA 
CGGATCAGGTGCCATTCGCATCCGCCATCAGCACATCAAGATGGTGGCAGACCGCACCTACCAACTGGACATTATCCTGCAGCGTACCAGCATTG 
AGATGACTGTGGACAATTGCCGGCTATCCACCTGCCAGACACTGGGCGCCCCAATTGGCCCCAATGAGTTCCTCAATGTAAACGCACCGCTGCAG 
TTGGGCGGCACTCCAGTGGATCTGGAACAGCTGGGACGGCAGCTCAATTGGACGCACGTTCCCAACCAGAAGGGTTTCTTCGGTTGCATTCGCAA 
CCTGACAATTAAC GAGCAGACTTACAACTTGGGAATGCCCTCTGTGTTCCGAAACATCGACAGCGGTTGCCAGCAGTCGGTGGCCGTTGCGTTCA 
GTTTTGGAATTGATAGAAACTTCATCATTGCGATCATCGTATGCCTCGCGCTACTGCTGATCATCCTGTTGGCAGTGGTGGTGCAGAAAAAGCAG 
AAGAATGGCTGGCACGAAAAGGACATCGACGACATTCGCGAGACGATCATTAATTACGAGGACGAGGGTGGCGGCGAGCGGGACACGGACTATGA 
TCTGAATGTCCTGCGCACCCAGCCCTTCTACGAGGAGAAGCTGTACAAAGATCCTCATGCGTTACAGGGGAACATGCGCGACCCCAACGACATAC 
CCGACATCGCCGACTTCCTGGGCGACAAGAAGGAGAACTGCGACCGGGATGTGGGCGCCACAACCGTGGACGATGTGCGGCATTACGCGTACGAA 
GGTGACGGCAACTCCGATGGCAGCCTCTCCAGTCTGGCGTCCTGCACCGACGACGGCGATCTCAACTTCGACTACCTGTCCAACTTTGGACCGCG 
CTTCCGCAAATTGGCCGACATGTACGGCGAGGAGCCCTCAGACACAGACTCCAACGTGGACGATGACCAGGGCTGGCGCATCTAGGAATCTTCGC 
CAGCCGCGGTATGATGACGTAGAAATTGAAAAGACTTTTTATATACTAAGAGTTGTACGGATGTTGATAGTGTCGGGACAAGGCGCCGGCCTAGC 
CGGCGTCCAAAGTCGATTCAGAGATATATACCCATATATATATGGGTAAACTATATATTATATATATACATATATATTTTTATGGAGGAATATT 
(SEQ ID NO: 359) 

Start ATG: 692 (Reverse strand: CAT) 

MSTSVQRMSRSYHCINMSATPQAGHLNPAQQQTHQQHKRKCRDLGRRLIPARLLLGVIVAISLLSPALALHSPPDKNFSGDNRKPAFKNCAGYAP 
KVKEEQPENTYVLTVEAVDPDPDQVIRYSIVQSPFERPKFFINPSTGVIFTTHTFDRDEPIHEKFVFVTVQATDNGLPPLDDVCTFNVTIEDIND 
NAPAFNKARYDESMSENAQPDAVVMTISASDFDDGNNSLVEYEILRERDFQYFKIDKESGIIYLKRPIDKRPGQSYAIIVRAYNVVPDPPQDAQI 
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EVRIRVVESSIKPPSFVNPIDTPIYLKENLKNFTHPIATLRAVSNMPDKPEVIFELNTGRTEQTNSKNTFVFNQIGNEVTISLGKTLDYEAITDY 

TLTMIVRNTHELGTEHQIKIQVEDVNDNIPYYTEVKSGTILENEPPGTPVMQVRAFDMDGTSANNIVSFELADNREYFTIDPNTGNITALTTFDR 

EERDFYNVKVIASDWSFSSLFDNGEPNRGHQVFRISIGDKNDHKPHFQQDKYLAERLLEDANTNTEVIEVKAEDEDNASQILYSIESGNVGDAFK 

IGLKTGKITVNQKLDYETITEYELKVRAFDGIYDDYTTVVIKIEDVNDNPPVFKQDYSVTILEETTYDDCILTVEAYDPDIKDRNADQHIVYSIH 

QNDGNRWTIDNSGCLRLVKTLDRDPPNGHKNWQVLIKANDEDGVGTTVSTVKEVTVTLKDINDNAPFLINEMPVYWQENRNPGHVVQLQANDYDD 

TPGAGNFTFGIDSEATPDIKTKFSMDGDYLHANVQFDREAQKEYFIPIRISDSGVPRQSAVSILHLVIGDVNDNAMSEGSSRIFIYNYKGEAPET 

DIGRVFVDDLDDWDLEDKYFEWKDLPHDQFRLNPSTGMITMLVHTAEGEYDLSFVVTEDSMFVPRHSVDAYVTVVVRELPEEAVDKSGSIRFINV 

TKEEFISVPRDFQSPDALSLKDRLQLSLAKLFNTSVSWVDVFTVLQNENHTLDVRFSAHGSPYYAPEKLNGIVAQNQQRLENELDLQMLMVNIDE 

CLIEKFKCEESCTNELHKSSVPYMIYSNTTSFVGVNAFVQAQCVCEAPLMRRCLWGGSPRYGENDVCDCIDGFTGPHCELVSVAFYGSGYAFYEP 

IAACKNTKISLEITPQIDQGLIMYLGPLNFWPLLAISDFLALELDNGYPVLTVDYGSGAIRIRHQHIKMVADRTYQLDIILQRTSIEMTVDNCRL 

STCQTLGAPIGPNEFLNVNAPLQLGGTPVDLEQLGRQLNWTHVPNQKGFFGCIRHLTINEQTYNLGMPSVFRNIDSGCQQSVAVAFSFGIDRNFI 

IAIIVCLALLLIILLAVVVQKKQKNGWHEKDIDDIRETIINYEDEGGGERDTDYDLNVLRTQPFYEEKLYKDPHALQGNMRDPNDIPDIADFLGD 

KKEKCDRDVGATTVDDVRHYAYEGDGNSDGSLSSLASCTDDGDLNFDYLSNFGPRFRKLADMYGEEPSDTDSNVDDDQGWRI* 

(SEQ ID NO: 360) 

Name: DE-cadherin 
Classification: cell_adhesion 
Gene Symbol : shg 
FlyBase ID: FBgn0003391 



Celera Sequence No. : 142000013384832 

ACTTGCAATATTGGTATCAAACTCTCGCACACGCGAATTAATGCGAGACTATATCGGTTCGGAGCGTACTAGTTCGGATCGAATCGAGTAGTGGA 
GGGAGCGGTGTGGACGCTGCGAAGCTCGAAAGAAGACTGAACGCGACCAGCCCGGTGGTTGTCAATATGAATACGTTGTGGAGTGGACTAGCAGT 
GCTTTGGGATGTTGCAAACCATGCCGAAACGAAGCAAAACCACTGGCCTGGCTGCACGCCGTACTGTGGTAGGTATTCGGCAGGTGTTGGTACAG 
ACATAAATATCTATTAGCTCGGCTAATCTAGGCGGAACTGGCCGTCTGGCCTAAGAGGCTTAGGCTGGGTCCAAGTGATGGGCCAAGTAGTTGAC 
TTTTGACTACATGGGCGGGACTGCTGGGTACCACAAGATGCAAGTCCCAGGCCCAGGAACTCTTTGGCATATTTTGGGAAACATTAACTTTGCCA 
TTTGGACGAGTTTTATTGACTTTGCTGCTTTTCAGCGAATAAGATATATTGGCATCATTATGGATGCAAAATTAGCAACGGATCGAATTGGTTTC 
CATTACACAATGGTCCAGCTTTGCATTCGCAGCCACCGCGGCGGTGGTTTCCACTTTGTTTGCCTTAATTAGGTGGTTATCAATCTCCTATGCAA 
ATCTATTGTTGCCCAATGTCGCGAATATTGGTAATTAGTGCAGTCTTTCCTGTGATTTTTGCACACGCTAAATTATGCATGTTACCAATTTTGTT 
GCAAAAAAATAATATGTTTGTGTCTTCCAGTCCCAACTGATGGTTTCAAAAAGGCCGCCAAAGTTGAAAATATCGATATCATGCAGATGATATAT 
GGATACTTGTCCTGCTAAGGTAGTGCTGTCAGAGTTGCTTAAACTGATTAGTTATAATAAACCATCAGTTAATTCCAGGTATGAAAAAAAGCGTG 
CATTATGTAAATTGCATAGTGAAATTAAATCGACAATTTGTAATCTCGCTGCGAATTTCCAGGAATCTCCAGCCCTGGCCGGCTTATCGATACAT 
CTGGCACAAAAGGCGTGCAGAGTTGTTCTCCAAATTTCCTGTCGCGAATTTTATTCATTTATAATTAATTACAAGCCATGGCCGAACTGGACGAT 
TTCTTCGCGAAAAAGGACAAGAAGAAGTCCAAGAACAAGACTAAGTTCGTGACCGCCGACGAGATGGTGAAGAACCTGGAGGACGGCACCAAGCG 
CGAGGTGGTCAAGCCCAAAAAACCTGAGGTAGCCGCTGGCGGCGTGGCAGTAGTAGGCGAAAACGAGAATAGCGGCACCAAGGTGCCAGAGTCCG 
CCCCGGTGAGTCCGAATCTCAGTCTGGTGAAATCAGCGCCCGTTGGCCACCTAGGGCATGTTTATAAGTTCATTGTTTTGCCGCTCCGTAGCCCG 
TCGAGGAGGAATGGAAGGAGTTCGAGGAGGAGCAGCGCAAGGACTACAGTGGCTTAAAGATCGGCCAGCTGAGCACCATGTGAGTTTTCGAACGT 
TACGCCGACCGCCTAATCCATGAGCAGGAGAAAAGGCCACTTCCATTGATTTACTTTCCATTTGACCCCGCATATTGCGAAAACTCGCTTTCTAT 
AGCTCTTCGGCCCGTTCCAGAACTGCCCAGGAAAGTTCCGAGTCGCAAGCTGCTCGAGTGCCCTCTGCCCCGGACGGCGGCAACTACAATGAGGA 
CGATGAGGACAGCAATGGGTATGACAACGCGGACGTAAACAAGGAGCGCGTCGGCCACGGACCCTGGAAGAAAGTTGTCCCAGCCGAGGAGGTAA 
TGCAGATTCCGGTGCCCGTGGAGGTGGAAAAGCACTCGTCCAAGACCTACGTTTCACCCGCGCTACGATACAGCGTAAGTAAGAAGAGCGAGACG 
ATCCAGATCGAGTTTAATCAACTTCTCTGCAGCAGCAGGCCGGAAGTGGACTGGGAGGCGGCCCTACTGGGGGGGCATTGCGTCCACGACGCGCC 
GCCCCCGACATCACCAATACCGAGTTTTTCCCAACCTTAAGTGCGGCGCGTCCGGAGGAACAGCGCAAGAAGAAAAACGAACCCGCCTTCGAAGA 
AGTCCGTCACGGAAGCCGTTTCCAGCGTGTTCAGGAGTCCACGGCTGCCCCGGTGGCGGCCTCCAACCGATTCCAGTCGCTTGACGACGAAGCAA 
GCTAGGTCAGCCCACGCCAACGGTGCTGTTGCTGGCCACTTGAACCGCTATCTACCACAGTGTCCGCTGCCGAGCTCAAGCCATCCAAAGGTTCT 
ATGCCAGCACATGCAGATTTTCTAGGCTTTTTGATTCATATAAACCACCACTGCTACTCTTACCCTAAACCACAAACTACTATTGTTCATTTCGT 
TTTGCATTTTGAAAAATGGATTAATGTAATACAATACGAATAATACCCATGCGTTGAAATCCGTTTATAAACGTTACTTTTTTATACGTTTCCGT 
TTCTCACTTTTCCATGGCGAGGTCTTCCGGGCAGCTGTCCAAGTAGCTGCAAAATCAGTAAGAAAGATCATCTCTGGCTGAAACAGTGCGTACTG 
ACTTTTCATTGTACGGCCTCTCCGGCGTGTTGTGGTACGAGATCAGCAGCTCCTTGGCCAGCATCTTGTCTGGGAAGTTGCGAATCAGGTACAGC 
CAGCACTGAATGGTTCTAGACATAGGGGCAATAGTGAGTTATTACATCTGGCCATCACCCCACACCCTCATACTCATTCTGTTCCATTAATATGG 
TTTGTTGCTCGCGATCATAGTAGTCCGGGTAGTCTTCTAGTATGTCCAGTTTGGAGAGCATGGTCTCGTCCACTTCGTAGACTTCGCCTTCTATG 
TGGTTCCCCTCACCCGGGCGGGCGAGCAGAAATGGGATGTTGTAGCGGGTTCCGACAACCAGGGGAAACTTCGTTTCCGTTTTCCCCCTTCCAAG 
GAATCGGGCTTGGCCGTTCTCCTTTTTCGTCAGCCAGTGGTGATTGGGCTCACCGCGCTTCAACGTGCCATACACAAAGACCCTTGCCGCCATTC 
TCAGCTTTTCAGCCATTGCGCTTCCCGCCGCCAGGCTTATCAGTTTCAAGCGGTATCGTGCTATCAAATCCGACCTGTAGATTGCGCAGTCGACA 
TAACCCGTCCGCTCTTCTTCTTGCTGCCATTCGCACCGAATCTAGCCTCCGTACACCGCGAGAAACTGAAGCCCTCGGCACTGGGCTGTGGAAAA 
TTACCAGGGACTGCTGTCCAGCATCTTTACTTCTATAATAGAACGAGGGGCATTTTATTTTATTTTAAATTGCTTCAGTTTATTTGCTTTGTTTG 
TCGAAAACTATCAAATTACAGACTACGTTTGCTAAAACCTAAATAATGCGTATGGTACATATGTATCAGTTTATATCGTTTGCCCATATATACGG 
GTATATATCGATCA 
(SEQ ID NO: 361) 

Exon: 1001.. 1335 
Exon: 1422.. 1504 
Exon: 1636.. 1879 
Exon: 1933.. 2434 
Start ATG: 1123 

Transcript No. : CT12563 

GCGAATTTCCAGGAATCTCCAGCCCTGGCCGGCTTATCGATACATCTGGCACAAAAGGCGTGCAGAGTTGTTCTCCAAATTTCCTGTCGCGAATT 
TTATTCATTTATAATTAATTACAAGCCATGGCCGAACTGGACGATTTCTTCGCGAAAAAGGACAAGAAGAAGTCCAAGAACAAGACTAAGTTCGT 
GACCGCCGACGAGATGGTGAAGAACCTGGAGGACGGCACCAAGCGCGAGGTGGTCAAGCCCAAAAAACCTGAGGTAGCCGCTGGCGGCGTGGCAG 
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TAGTAGGCGAAAACGAGAATAGCGGCACCAAGGTGCCAGAGTCCGCCCCGCCCGTCGAGGAGGAATGGAAGGAGTTCGAGGAGGAGCAGCGCAAG 
GACTACAGTGGCTTAAAGATCGGCCAGCTGAGCACCATAACTGCCCAGGAAAGTTCCGAGTCGCAAGCTGCTCGAGTGCCCTCTGCCCCGGACGG 
CGGCAACTACAATGAGGACGATGAGGACAGCAATGGGTATGACAACGCGGACGTAAACAAGGAGCGCGTCGGCCACGGACCCTGGAAGAAAGTTG 
TCCCAGCCGAGGAGGTAATGCAGATTCCGGTGCCCGTGGAGGTGGAAAAGCACTCGTCCAAGACCTACGTTTCACCCGCGCTACGATACAGCCAG 
CAGGCCGGAAGTGGACTGGGAGGCGGCCCTACTGGGGGGGCATTGCGTCCACGACGCGCCGCCCCCGACATCACCAATACCGAGTTTTTCCCAAC 
CTTAAGTGCGGCGCGTCCGGAGGAACAGCGCAAGAAGAAAAACGAACCCGCCTTCGAAGAAGTCCGTCACGGAAGCCGTTTCCAGCGTGTTCAGG 
AGTCCACGGCTGCCCCGGTGGCGGCCTCCAACCGATTCCAGTCGCTTGACGACGAAGCAAGCTAGGTCAGCCCACGCCAACGGTGCTGTTGCTGG 
CCACTTGAACCGCTATCTACCACAGTGTCCGCTGCCGAGCTCAAGCCATCCAAAGGTTCTATGCCAGCACATGCAGATTTTCTAGGCTTTTTGAT 
TCATATAAACCACCACTGCTACTCTTACCCTAAACCACAAACTACTATTGTTCATTTCGTTTTGCATTTTGAAAAATGGATTAATGTAATACAAT 
ACGAATAATACCCATGCGTTGAAA 
{SEQ ID NO: 362) 

Start ATG: 123 

MAELDDFFAKKDKKKSKNKTKFVTADEMVKNLEDGTKREVVKPKKPEVAAGGVAVVGEWENSGTKVPESAPPVEEEWKEFEEEQRKDYSGLKIGQ 
LSTITAQESSESQAARVPSAPDGGNYNEDDEDSNGYDNADVNKERVGHGPWKKVVPAEEVMQIPVPVEVEKHSSKTYVSPALRYSQQAGSGLGGG 
PTGGALRPRRAAPDITNTEFFPTLSAARPEEQRKKKNEPAFEEVRHGSRFQRVQESTAAPVAASNRFQSLDDEAS* 
(SEQ ID NO: 363) 

Gene Symbol: anon-EST : f e2C9 
FlyBase ID: FBgn0022343 



Celera Sequence No. : 142000013385215 

AATTAGCACATTCGTTTTCCTCTTTAAATAAGGAGAATCCACTTAAAGAAATCTGATAATCGCTGCATATTGTGCTATTTATTTTATGTTAAAAA 
AGTATATTCGCAAGAGCATAAAAACAATCTGATCGGGTTAATCTTAACTTTTTCGCTGCTCCATTCTCATTGATTACTACAGCACAAAAAAAAAG 
TGGGGAAAGCGTGGAATCCTAACTAGAAGAACGGAAGATAGAATAATGGATAAAAAACAAAGGGGCAGGGCAGCAAAAACTTAAATCATTGGCAA 
ATACCTTTTTACGATGCGGATTTTAATCAAAGTTTTTGCCACCGCTGTCAGTTTCTTTTTTTTACTTAAAATGGTAATAAAAAAAAAAGGGGAAA 
CGACGATAAATAAACATTTTAAAATAATTGCTAACGGCGTGTTGTTGCGCTTTAGTGAACTTATTAGTCTACCCTCGCACCGGCCAATTTCCACA 
ATTGATTAGTTTAAAATTGCATTTTTTGGACATTTAAATGGGAATCTAATTGATTGCCATTTTTATTGAGAGCACTTAACACTATACAATGGCAG 
GTTTTGTATTTAAGCTATATATTCCTCGTTTATACAGTGTCAAATTAATTATTGATTTTATAAGCAGCTAAGAAAATGATTGTTTAATTTGGACC 
GATTGATCATCGAGGACTTAAAATGGACTTAAAATAATTGGATATTAGCAACTTTGAAAAGATCCCGTTTACTCCCAAAACTTCAGAATCAACCC 
ACGAACTTCTTTAAGCCAGTGTGACCCCACCCGCTCTTTAGCTCTCTTTCGCTCCATCTCGCTTCCGCACATCGATTGGCGAAAGCTTCGCTCGC 
ACTCGATATCAGCTCACAACTTCGCTTGCAAGCTCGCCGCGGAAAGCTCGCGATCACACGGGTCGAAATCGAAATCGAATCGCGGTAGAGAGCTT 
TTCGAAGTGCGAGCGTCGACGTTTTATTTCTGCGGCTCAGTCGGTTTTAGTTCGTTCTGTTGGAGAAAAGCAGCAATCACACGTTCGCAAGGTGA 
ACGCGAAGACACAGCAAAGTAAGCCCTTCCCCCCCACCAACACACACACCCACCCAAAGCAAATAAGTAACAATAAATAATGGAATGGCTGGAAG 
ACGGTTCTGGGCGATTTAAACAATTAGCGAAAGAAAGCGGCATTGAAATCCGTCTTGAATTCGCCCCGAAAAAGTGACGAAGCAGCGATCAAAGC 
GCAGAGCGAAAACACGCACACAGACTGCAAGTGTGTTAGATAATAAGTGCAGCACAAGTCCACACTTGAGTAAAATAATCCCTAAAAAAGCCGAA 
TATCAATTAGTTTTCCAAGGAGCTTGAAAAAGTGCGGTATGAAAACGTGAAAATTTGCGCGTGGAAAATTATTTTGCCTTGTCAGCTGACCCCCT 
TCCCCGTGTTCGCTCCATCCCTGTCGCACCGCGGGTCTTGTGATCGCCGCCGCTCTTGCGCTCGCTTGCTCTCCCATTTCGAAACTCGAAACAGA 
AGTGGGAGTTATTCGTATTCCGATAATGAAAAACCAATATGGAGAACGAGCGACGTAAAAAAGGCGGCCCAAAGATTTTTACCATTTCCCTTACC 
CACTTTTTTTTCATTTGTCAGCTGACGGCAATGACAGTAGTCTTGTGATCAACGTCAAAAGCAATTGTCAAATATTCGAACTCGAATGGAGAGCG 
AGAGAGCCAGAGCGGAGAGTTGCTCTCCCACTCCACCCTCTCTTGTTTTTCTTTGCTGATAATTATGAAAACCCGCATATTTTGAAAAACATGCA 
TTTCAGTTACATTCCTCCGTTGAATTTGTCAACCTGTGGTTGTTTTTTCACAGCTCTTATTTTATTTATTTAGCGATTAGTTTGACAAATTGCTC 
TCTTCGAACTTTCAAAGCTCTGTCACGTGAAACGAAAGCTCTGCTTTTAAAGTTTTACGCAGCATAATCAAAGAGGGGGAGTTAAAAAAAAAATA 
ATTAAATCAATCGAAATTATTAGCTGCTAACCTACAACTTTATAACCTATAATCGAAAAATTTGGGAGCTGTGGGCTCTACAAAAACTTAACCTG 
TAAATGTAGCAGATACACCTGCCCCTTGTCAGCTGACAGAGGGCTGAGCAAGAAATTAGTGATAAGAAAATGTTCACCTTTATCTTCGCCCTTTT 
GCAGCCAGCATTTAACAATTTTCCTCTTCTATTTTCCCTCCATTGCAGTCGAAAAAACAGAATAAAGCAAAATGTCCAACCTTAAGCGTTTCGAT 
GATGAGGAGCGTGAGTCCAAATATGGACGTGTCTTCGCTGTCTCCGGTCCTGGTAAGCACCTAACTATACTGACTAACCATAACTCATGCTATCT 
AAAAGTTAATAAAAATAAGTTAATAATACCTGTGAACTGGAAACCTTACTTTGATGATTAGTCTAGAACTTACACTTCTGTGTGAAATAATGGCA 
ACTTTAGAAATGTGTCCACCTATTTGTGATTAATATTCAAACAACTCAAACATTTTGGTTTCATTATTCAAAATTAAATGTGAATAATTTTAATA 
ATTAATTAATTGTTTCTTTAAACTTTTTTCTATAATTCTAACAAAAACATCATCAAGTATCATTAATAATAAAAAATTTTAAAAGAAAATGTTCA 
ATGGAACCTATCTTCGTTTGCTGAGTTATAAAAACTTCTTGAATGAAATGTAGCCCCCCTAACCCGACCAACCGCTTCATTCCAGTCGTCACCGC 
CGAGGCCATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCGGCTACTACGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCA 
CCATCCAGGTGTACGAGGAGACCTCTGGCGTAACTGTCGGAGATCCGGTGCTGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATC 
ATGGGCAGCATCTTTGACGGTATCCAGCGTCCCCTGAAGGACATTAACGAGCTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAG 
TTTGTCCCGCGTGGCCAGCTGGGAGTTCAACCCCCTGAACGTCAAGGTCGGCTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGA 
ACACTCTGGTCAAGCACAAGATGATTGTGAACCCCCGCGCCAAGGGAACAGTGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTC 
GTCCTGGAGACCGAGTTCGATGGAGAGATCACCAAGCACACCATGTTGCAGGTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCC 
CGCCAACCACCCCCTGCTCACCGGACAGCGTGTGCTCGACTCGCTCTTCCCCTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTT 
GCGGCAAGACTGTGATCTCGCAGGTGAGAGTCCCACAAATTGAGAATTTAAGGAGCGATGCCTCGTGTAGCCTCCATACACTCAAGTTTCATAAA 
AATACAATCCCTAATAAATCATTTACTTGCTTGCAGGCTCTGTCCAAGTACTCCAACTCCGATGTCATCATCTACGTCGGTTGCGGTGAGCGTGG 
TAACGAGATGTCTGAGGTACTGCGTGACTTCCCCGAGCTGTCCGTGGAGATCGACGGTGTCACCGAGTCCATCATGAAGCGTACCGCCCTTGTGG 
CCAACACCTCCAACATGCCTGTGGCTGCTCGTGAGGCCTCCATCTACACTGGTATCACCTTGTCCGAATACTTCCGTGATATGGGTTACAACGTG 
TCCATGATGGCTGATTCCACCTCCCGTTGGGCTGAGGCTCTTCGTGAAATTTCTGGTCGTCTCGCTGAGATGCCTGCCGATTCCGGCTACCCAGC 
CTACTTGGGAGCCCGTCTGGCCTCCTTCTACGAGCGTGCCGGTCGCGTTAAGTGCTTGGGTAACCCCGAGCGCGAGGGATCCGTGTCCATTGTCG 
GAGCTGTGTCTCCTCCTGGTGGTGACTTCTCCGATCCCGTGACCTCCGCCACTCTGGGTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAGTTG 
GCCCAGCGCAAGCACTTCCCCTCGATCAACTGGCTCATCTCCTACTCGAAGTACATGCGTGCTCTGGATGACTTCTATGACAAGAACTTCCCCGA 
ATTCGTGCCGCTGCGTACCAAGGTCAAGGAGATCCTGCAGGAGGAGGAGGATCTGTCTGAGATCGTGCAACTGGTCGGCAAGGCCTCTCTGGCCG 
AAACCGACAAGATCACGCTGGAGGTGGCCAAGCTGCTGAAGGACGATTTCCTGCAGCAGAACTCCTACTCCTCGTACGATCGCTTCTGCCCCTTC 
TACAAGACCGTGGGCATGTTGAGGAACATCATCGACTTCTACGACATGGCCCGTCACTCCGTGGAGTCTACGGCTCAGTCTGAGAACAAGATCAC 
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CTGGAACGTGATTCGTGAGGCAATGGGCAACATTATGTACCAGCTGTCATCCATGAAGTTCAAGGTGGGTTAACACGCAAATTTATCCATTGCCT 

AGACACTGGGTGACCACATTTTTCAATCCATTTCAGGACCCCGTTAAGGATGGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGA 

CCTGCAGCAGGCCTTCAGAAATCTGGAGGACTAGAGACCGCGCTGGCCCTACTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAA 

AAATGAAAGCAGTCAAAAACCATCCGAAAAAGCCTAATCAAACACCAACAATTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCA 

CAACTTCTTGGTGCCTGCGAGAGATGTAAACATTCCGGCCTGCGGTTAATACTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCT 

AGGCAACAGCAACTACAACGTCCTGCTATGTACTTCCATTTACAACAACAACACCAACATACACTTGAATAAAAGTACACGGACACTGGCGCACA 

CACAACACATACATAAAAGACACAAATACAAATGCATGCATAAATAGTATTATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTC 

ATGTTTTCTCCCTGTTTGTTTGTTAAATTTATGTAAATATTTAAAGTATGAAATATTAAATGTACGAATAAAGTGCAACAACAAATACATTTAAT 

GTAATTGAAAGTGAATTTCACTGGCAGCAGAATGGATATTAAAAATGTGTCAACTCGATAAAAAGATAATAAGTTAAAATATTTTTTTGAATTTT 

AAACCTTCATTATATAAACATACTTGACTATATGAAAGCTAAGAAAATGGGAATATATTGAAAAGCATAATCTAAAATCCTATATCATTCTCAGT 

C AT C AC CT T AGT C T T T CC AT AAAC TTCTGCCCCATTTATT T T AT X TAT T T G AC AAT AT T AGT AAAAAT T GT ACC AAGT GAAAAAAT T GT AT AT AT 

TTAAACCAAATGGAAAAGAAGTCCTGGGTTGCACCGCTTACCTCGTTCACCAACAGTAACGCAAGTAACGAAGACAGAGCCCGCAACTCGGGATC 

GTTAAACCAAGATGGGGAGCCAAGGTCCGTGCCTCACCCAACGGTTCCGGCTAAGAAGTGCACATGTCCAACCTGCGCTTGCCAACGTAATGACC 

ATAAATCCAATCCGGTACCCCCTCCCCCTCAATCACGCAAGCCTGTAGCCGAGCCGGAAAGTCCCCATGATACAGTCAATGACGAGGACAGTTTG 

AAGGACTTGAGACGTTCGACGGACCAATCCCACAAGAGCGCTCACATCGCCTTGGAGAAGAATGAGGACTCGGGTTTTGTGATCGAGCAGGTGGT 

TGATACGCACAAATATTCGTCGGATGAAGAAGAGGAGGAGGCGACGATGGGTCGCATTTTCGGAGTCTCCGGTCCGGTGGTCAATGCCGAGGAGA 

TGGCCGGCGCAGCCATGTACGAGCTGGTTCGCGTTGGACACTCCCAGCTTGTTGGTGAGATCATTCGACTGGAGGGTGATATGGCCACCATTCAG 

GTTTACGAGGATACTTCGGGTGTGAGCGTGGGTGATCCCGTCTACCAGACGGGAAAGCCGCTCTCCGTGGAATTGGGACCCGGCATCATGGGCAG 

CATTTTCGATGGTATCCAGCGACCATTGAGGTCCATCAGTGAACTAACCAACTC 

(SEQ ID NO: 364) 

Exon: 1001.. 1063 
Exon: 2234.. 2332 
Exon: 2746.-3443 
Exon: 3552.. 4529 
Exon: 4597.. 5229 
Start ATG: 2257 

Transcript No. : CT12570 

T T C GT TCT GT T GGAGAAAAGC AGC AAT C AC AC GT T C GC AAG GT G AAC GCG AAG AC AC AG C AAAT C G AAAAAAC AG AAT AAAGC AAAAT GTC C AAC 
CTTAAGCGTTTCGATGATGAGGAGCGTGAGTCCAAATATGGACGTGTCTTCGCTGTCTCCGGTCCTGTCGTCACCGCCGAGGCCATGTCTGGATC 
AGCTATGTACGAGTTGGTCCGCGTCGGCTACTACGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCACCATCCAGGTGTACGAGG 
AGACCTCTGGCGTAACTGTCGGAGATCCGGTGCTGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATCATGGGCAGCATCTTTGAC 
GGTATCCAGCGTCCCCTGAAGGACATTAACGAGCTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAGTTTGTCCCGCGTGGCCAG 
CTGGGAGTTCAACCCCCTGAACGTCAAGGTCGGCTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGAACACTCTGGTCAAGCACA 
AGATGATTGTGAACCCCCGCGCCAAGGGAACAGTGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTCGTCCTGGAGACCGAGTTC 
GATGGAGAGATCACCAAGCACACCATGTTGCAGGTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCCCGCCAACCACCCCCTGCT 
CACCGGACAGCGTGTGCTCGACTCGCTCTTCCCCTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTTGCGGCAAGACTGTGATCT 
CGCAGGCTCTGTCCAAGTACTCCAACTCCGATGTCATCATCTACGTCGGTTGCGGTGAGCGTGGTAACGAGATGTCTGAGGTACTGCGTGACTTC 
CCCGAGCTGTCCGTGGAGATCGACGGTGTCACCGAGTCCATCATGAAGCGTACCGCCCTTGTGGCCAACACCTCCAACATGCCTGTGGCTGCTCG 
TGAGGCCTCCATCTACACTGGTATCACCTTGTCCGAATACTTCCGTGATATGGGTTACAACGTGTCCATGATGGCTGATTCCACCTCCCGTTGGG 
CTGAGGCTCTTCGTGAAATTTCTGGTCGTCTCGCTGAGATGCCTGCCGATTCCGGCTACCCAGCCTACTTGGGAGCCCGTCTGGCCTCCTTCTAC 
GAGCGTGCCGGTCGCGTTAAGTGCTTGGGTAACCCCGAGCGCGAGGGATCCGTGTCCATTGTCGGAGCTGTGTCTCCTCCTGGTGGTGACTTCTC 
CGATCCCGTGACCTCCGCCACTCTGGGTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAGTTGGCCCAGCGCAAGCACTTCCCCTCGATCAACT 
GGCTCATCTCCTACTCGAAGTACATGCGTGCTCTGGATGACTTCTATGACAAGAACTTCCCCGAATTCGTGCCGCTGCGTACCAAGGTCAAGGAG 
ATCCTGCAGGAGGAGGAGGATCTGTCTGAGATCGTGCAACTGGTCGGCAAGGCCTCTCTGGCCGAAACCGACAAGATCACGCTGGAGGTGGCCAA 
GCTGCTGAAGGACGATTTCCTGCAGCAGAACTCCTACTCCTCGTACGATCGCTTCTGCCCCTTCTACAAGACCGTGGGCATGTTGAGGAACATCA 
TCGACTTCTACGACATGGCCCGTCACTCCGTGGAGTCTACGGCTCAGTCTGAGAACAAGATCACCTGGAACGTGATTCGTGAGGCAATGGGCAAC 
ATTATGTACCAGCTGTCATCCATGAAGTTCAAGGACCCCGTTAAGGATGGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGACCT 
GCAGCAGGCCTTCAGAAATCTGGAGGACTAGAGACCGCGCTGGCCCTACTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAAAAA 
TGAAAGCAGTCAAAAACCATCCGAAAAAGCCTAATCAAACACCAACAATTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCACAA 
CTTCTTGGTGCCTGCGAGAGATGTAAACATTCCGGCCTGCGGTTAATACTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCTAGG 
CAACAGCAACTACAACGTCCTGCTATGTACTTCCATTTACAACAACAACACCAACATACACTTGAATAAAAGTACACGGACACTGGCGCACACAC 
AACACATACATAAAAGACACAAATACAAATGCATGCATAAATAGTATTATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTCATG 
TTTTCTCCCTGTTTGTTTGTTAAATTTATGTAAATATTTAAAGTATGAAATATTAAATGTACGAATAAAGTGCAACAACAAATACATTTAATGTA 
A 

{SEQ ID NO: 365) 
Start ATG; 87 

MSNLKRFDDEERESKYGRVFAVSGPWTAEAMSGSAMYELVRVGyYELVGEIIRLEGDMATIQVYEETSGVTVGDPVLRTGKPLSVELGPGIMGS 

IFDGIQRPLKDINELTESIYIPKGVNVPSLSRVASWEFNPLNVKVGSHITGGDLYGLVHENTLVKHKMIVNPRAKGTVRYIAPSGNYKVDDVVLE 

TEFDGEITKHTMLQWJPVRQPRPVTEKLPANHPLLTGQRVLDSLFPCVQGGTTAIPGAFGCGKTVISQALSKYSNSDVIIYVGCGERGNEMSEVL 

RDFPELSVEIDGVTESIMKRTALVANTSNMPVAAREASIYTGITLSEYFRDMGYNVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLA 

SFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSINWLISYSKYMRALDDFYDKNFPEFVPLRTK 

VKEILQEEEDLSEIVQLVGKASLAETDKITLSVAKLLKDDFLQQNSYSSYDRFCPFYKTVGMLRNIIDFYDMARHSVESTAQSENKITWNVIREA 

MGNIMYQLSSMKFKDPVKDGEAKIKADFEQLHEDLQQAFRNLED* 

(SEQ ID NO: 366) 

Name : vacuolar ATPase subunit 
Classification: enzyme 
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Celera Sequence No. : 142000013384592 

TGTTGGCATTTTGCAGAGGCTACATCTGCCACAAAAGCTTCCAACTCAATTAGAAGCCTTAGCGTCGTTTTCATGTTGTTTACCCACAGTTGGGA 
TGCGGCGCATACTGAAAACGAAACTGAAATGCGAGTTCAATGTTTTGATTTTGAAATGGGCTCTGAAAATCCTTGAAAAGTGTCATCGGCCGGCC 
ACAGCAGATTAATTTACAAAATAGTCGAAATTCTTCCGCAATAAAATGTGAAGAAAAACAAAAGCCTGCGGTATGGAGATGATCTATTTATGAGG 
CGAAATCCACTGCCCGAGACGTGTCTCCTTGCGGCAAAGAGTCAACGCACTTCGCAGCTCATGAATTTTACTACAATTGCCGCTTTACGCACTTT 
TATCTGAATATCTGTTGCCTGAAATTCTGTTTTCCTCCTTTTCTTTGTGCATTTTAATCTAATTTATTATTGCCGGCGAGGCGGAATTTTTCTTT 
GCGTAAGTTTTGGACCGATTTTAATGCTCAAGGGGAAAATGCTTGACGTGTTAAGTCTGAAAACTTGTGGTTTCTAATATGTAATGGAATATAAA 
ACATTATTTTTTTTAGCTTATCTCCCACTTAAATGATACATTGTTGAACCCTCGTATCATATATAAATTTATATATTTTCATTAAATTGCTGTCA 
AATAATTTAAGTTGAACCCCTATGTGACTAAAATAGAAACCATTTTAGTTTAAATAAATTTCTTTGAAAATCTAATTAATACAAAAATCTACTTC 
TCTTTTGGGACATCTGGCGCTCAGAAAGTATACTAAACAGCACTGTGGCTTGAACTGCTTGCATGGTGGAGGTATTCGCTAGGAGCTGGCAGTAC 
TAAAAGCAGAGAACACAACCGGAAGCGTTATTCCACAAAATGTGGCCTTTTAAATTATATGTGAAAATACGATAAGTACCTTAAAATACCGAAGG 
CGCGCAGTTTTCAGTATAACAGCTGGCCGAGTAACTGGCCACACCATTCCAATTAGAAAGAAACCTATTTAAGCAAAACGCATCGATCTCTCCGT 
AATTTTTTGTCCGCTGAGTAATCGGAAATCGGCAACGGAATAATGGGCAATTTCGACGGCGATTCTGGAACGGTTTTCGAGCAGAGCTATGCGAC 
GACCAACGGAAACGGAAAGGTGCCGAATGGCAATGGGGCGGCAGATCTCAGTGCCCCGGAAAAACAGGAACTTCTCCGGAAGTTCTCGGCACCCG 
CGTACACTGAAATTCGCACGCGACGCTGCCAGCCGAAGCCCGCAAACGACGCCCAGGTGAGCCAGGCATTTTCGCTCAGTGTGCGGGGTGAAACG 
CGAGTTAAACACTTGGACGAAACGAAAATAACGAGCTAAAAGGCCATAAAAATTGGCCAAAAATGTGGTTCCATAGCTGGTTCCATTCACCCTCT 
CGCTCGCTCTTCCAGCCGGCGAATTATTGATTTTGTGTGAGAGCGAGGGAGCGCGCATAAGAGAGAGGGAGAGAGAGTGAGAGTGAAGGCGTGCG 
TCAGCTGGAGAATTTCGGCCCAATTTTTGCTGCCCTTTTTGGCGATTTTCCTCCGCTTTTCCCGCTTCTGGGTCATCGGGCAGCCACAGCTGTTT 
TCCCTCCACTGCAGTGTTTTCCTCGCGTGTTTTCCTTTTCGCCGCGAGTGCTGACCATTTGGCGTATTTTGGCTACTTCCGGATGATAGAAACTA 
TTAGAACTCTGCGGAAATATCCTATCATCATATACGATCAGAGAGTTATTTCATAAATTGTGATCATGAAAGGCAATTTGTAGTGTTGGAAATTA 
AAGTATTGGGCAATATTATTTCACTTTCATTATTATAATTTTAAGTAGTCTATACATTCCTCTGAGATAGCTATCTTACAATTGACTAAGCCCTA 
AGATAAGAACATCAGCATGTTAAAGTAACTTAGATTTCAGAAATTTCATAATACCTATTTTAGATGTGCGTTAAACTGTTTGGTACGCTAGTCCA 
TTTTACCACATGCACTCTCTAATCTAGCCATTGTCTCTTCTTCGGCACTCCTGCACTCATTCGAAATTTTCCTAGTGACTGGAGTACGACCCCAA 
AGCAAACACACACATAAGTACGTAGTTTGGGTGCACAATGTGTGTGGGGAGAGCGGCACAATGCTGGTCTGGTCATAATATTGTCATGCCACGCC 
ATGCCCGGCTCGCCTTATCAGGCCCATTCCACCAGTCTTTTACGTAAAGCCTCGCAGAATGCGATTCGCTGGTGAATCACAATGCTCCGCAATCT 
CCGAAGGCAGCGGTTTTGGGTGGGCGTTGATTTGTCGGTGATAAGAGTTTTGTTTTAAAATTCATGGACTCTCCCTGCGTCTGGGCTGCCAAAGA 
ATAACTGTTGGCGGGTATACTTAACCGATTATCATATTTTTTATTATTCATAACTTTTTAAAAAGATAAGATAAACATATTATAACATTACATTC 
ATTGGGTAATTCCGATACAATTCGATCTTTGGGGAGTGAGCAATTGTTAGTTTTCTTTTAAGTACCTAAAGATTTCATTATTGGTAGCTTTGTAT 
TTCTTTTTTTCGGAAAGTAATACAATGGTTTACATAAAATTTATTTAATGAAATAGTTCCGTACCATGGCGACAACTTATTGAAATCATTCAGCT 
TGATTTAATGGCAACTAAAGACCTTTTCAAATTTTTAATCACACGAAAATAATGTTAGCGCGTGCAAGTCTCATATCTCCAATATCAACGGATAG 
AAAACTTTTATCAGCTCGGCAAGATTACAGTATGCATGATTTCAAAGGTAAAGGCTTAAATAAAAAAGAAATACTTAAGAATATGAAATTGCCAA 
AGCAATTATCTACTGACTCACAGCTAATACTTCTTGGAATCCCCACAAAATCGGTTAGAACTTGGCGTGCGCTGGTCGATTGTCAGTCGTTATCT 
TGGACTCCATGGATCCCCAACATATGGAAGTCATTATCTGGATCCCACACCATGCTCCCTACTTAGCTGTCCTTCTGTACGCGACCCATAGCGTA 
ATTAGTATTACACTTTTTCTATTTATTTATTTATGTCGGTTTCGATTTTGATCTTTCTACGGTAGCTTCGAGCTCGTCTATATACCCTAACACTG 
GGACTCCATTAAATAACTAGGCATCGCGTAACCCACTTGACTTTTCGCTCCCCGCACTCCGTCTCATCGTATTTAGTAAACAAGCCCGGTGGCAA 
ATGATAATAAACACGAAATATTATTGTCGCCGACTCGTTATTGTTATACGATTAAGCTGACGTTCAACTATATATATTTTTCGGAATATATATCT 
GCAGTTTGTGTTGCTCAGTTAATGAGATATATATATGTCTCGTATCGGATCCTTTGGCCAGCAAAATCAATAAGCAAATGATATATACACTTAAA 
TAAGTACGTTAATTATAACGCTCTATTTGTTAGCTGGTGAGGTTGAAGTGTGCCAAGTTGGGCACACAAATACTATATACATTTATGGCTTCCCC 
TGTATGATTTTTGTGTCTGTCTCTTTTTGTGTGACTTCATGTCGAACTCGGCTCATGTGAGCGGTGGAGCAAATTGTTCAGCAGTTATCGTGTGA 
TTGTATTTATATATGTACATACGTACCTTGCATATATGGATCTTTTAGTTAAATGTTAATTGACTGATGAATTGACTGAACGCATAGAAAGGGCA 
AGCAAGTGAGAATGTAGAATTTTTCATTTTTTTATAAAAGCGTTTACGCATGAGAACTATAAGTTTTGATATACATACTCGTATAAATGGGTATT 
ACTGTGCTGATCCCTTTGAAGCGAACATGTTTACAATCTGTCTTACAATTTCCCATCTAGCCCGTCTAACAATATTGATTATTTGCTCTTTGATG 
AGTGATTTTCAATGGAAATTATGAAGATTCACAGTGATACCAATGATAGCGATAACCAGCCAATAATCGTTCCTCTCCCCTCTTACAGAGTCCCA 
GCTCGATCAACTACCATCCGGAAAAGAAGTCGATCCCCAATGGGCATGCCTCCATTCTGGATCAGGAGTCCAAGCCCAACCAGGCAGAGGGCAAA 
CTTTCGCCGGAGCTGGAGCACTTCCAGAAAACCTCCTTTGAGGAGGTGCCTTTGCACACGGCCTGCCTTACGTATTTGGGATTCTATCTACTGAT 
GATCCTGGGCTACATCAACCAGTTGCTCTTCGTGCCCAAAGTGGCCACCGAGAAGGGTCGGGATGGCTATGTAGCCCTCTACGATGCCTTCGAAA 
GCTTCTACTCGAGGTACGTTTACAGGAGAATCAAGGATTGCTGGAACCGACCCATCTGCAGTGTTCCGGGAGACGAACTCACTCTAAAGGATCGC 
GTAACCGATGATTATGGATGGAGTTTCAAGTTCACTGGAACGGAGACACGTTGCTTAAATCTGGGATCCTACAATTACCTGGGATTTGCCGCCGC 
TACTGGACAGTGTGCTGATGATTCAGAGGAAAGTGCTCGGAGCTCTGGACTGGCATACTGCAGCTCCCGTTGTGAGCTTGGTGACAACGAGCAGC 
TGCAAGAGCTGGAGGCTCTAACTGCCAGATACTTTGGCGTGGAGGATGCCATTGTCTTTGGAATGGGTTTCGCCACAAATGCGTTGAACCTTCCC 
TCACTGCTGGGACCCAACAGTCTTGTGATTAGTGACGAAAAGAATCACGCCTCCATCATTTTGGGACTGCGTTTGTCCGGGGCCACCACCAAGGT 
CTTTAAGCACAACAACATGCGCGATCTGGAGCGCGTCCTCCGTCAGGGTGTGTGCTATGGAAATCCCAAAAAAGGTGGTCAGCCATGGGACAAGG 
TTATGATCTTGGTCGAGGGCATCTTCAGCATGGAGGGGTCCATAGTCCGTTTGCCCGAGGTTATTGCTCTGAAAAAGAAATACAAAGCATACTTG 
TATTTGGATGAAGCTCACAGCGTGGGAGCCATGGGTTCCCGTGGACGAGGAGTTACAGATTATTTTAATGTGGATCCGAAGGAGGTGGACATTCT 
GATGGGCACCTTCACCAAAAGTTTTGGCAGTGCTGGAGGCTATCTCGCTGGATCCAAGGTATGCTTGAGTGGCTACACTTTCGCATATTCCTCAC 
TTATCAATAATCTTTTATCCTCCCCAGAAACTGATTGACTTTCTCCGCACCAACAGCCACGCCCATTGTTATGCTGCTTCGATCTCGCCACCCAT 
CGCACAGCAAATCCTTACCTCCATGAAGACCATTATGGGTGAAGATGGCACTGACATTGGACGCAAGAAGATCCACCAGCTGGCCAGGAACACAC 
GTTACTTCCGCCGACGTCTGGCTCAGCTGGGAGTGATCACATACGGTCACGAGGACTCCCCAGTCGTGCCCATGCTGGTGTACCTGTTTTCCAAG 
ATTGGGTAAGAATGGGGTCATAAAACCAAAATAACTTTTGTCATTAGATTGTGCGTGGAGGAGCATTTCATTTTTTAAAAGATCAAACGGGAAAA 
AACTTCAATTATCCTGTGGAGTCTCTTTTTACTAAACTGCATTTTCTCTTTGTAGCGCTGTGGTTCGCACCTTGACGACTCGCCACATTGCCGCG 
GTGGGAGCTGGATTCCCGGCCACACCCATCATGGAAGGACGCATCCGATTCTGCCTTTCAGCCGCCCACACCAAGGAGCAGCTGGACTTTGCCCT 
GGAGGCGATAGACGAGATTGCCGATGATCTGGGCCTAAAGTATTCTCGCAAGCCACGCGATCCGAATCCCGTCATTTACTAGAATCACACGCACA 
CACATAGAAGTTAAGCTTGGGGAAATGGTAACCTCCCCGATTGGTAAACCAAAACTACAACACAGTCCAGATAGTGCTAAATCAATCACGAACGG 
TGGCTTACATTGACCAAGTTTCAGTTGCACTGGGCTTCTATTTATGACTAATTCTCTGGTTATATGCCTTAAAAGGGAAGATCACTACAGCAACA 
GAAACAAAACACTACAGCACATAGCTTAAATGATAAAATATTATTAATTATTTAGCATTTACCTCTACGTAGTTGTTATTGTATATGTAAAATCC 
AATCGTCTTCTTTGGTCGGCTGCTGCATTTTAAGTATTTATAGAAGAGAAAATGACAGCGCAATGTATTGGGCAAATCAGGGTTTGGTTACTTTT 
ATAGCTAAGCTAAGGGAAGCCCCTTTTTCGTGGGAAAGCCTCGAGCACCTCGCTTCTACCAATTAAGCATACTAAGCTGAGCCAAATGTAACGTC 
TAAAGAGTCAGCGATACAATTGTAAATCACTATTTGGGTAAGTCTTCGGGGGACCCAAAAATTGGGTCAACTTTAGGGCATTGTCAGGAAATATA 
TATGTATAATAATTATGTATTATTGATCTTATTTTTCCACCAAAAAAATAAAGAAATTGTTGCTTTATCTAATAAACTATATTGGAATAGCTTAC 



FIGURE SHEET 201 



TGTAAAAGTATTCACTGTACGGGATTCTTAAGTCTACTAGTGTAAATATTTATTTAAAATATATAAAACTTGGCGGGCAGATGAATCTTAAAGTC 
GAAAAATGTTTTGCCTTGTATTTTACATAAATTTTGACACAACTTATGCGACCTCTACAGTTAATTTACAGAAGTTGGATACTCGACGCAAATAT 
AAAACACATATGTAGAGAATCCGCTAAGTCTAGGGCCTTAGTTCTTATTTTTGAGAAAATAAGAAGTACAGTGGCCATTAATTAAATAAAGCTAC 
TTCTTGAACCATTTTTAATTAAGTAAATGCCTGACTTTTAAGCGATAATTAGAGATTAGCGATATCAATTAAATCGATATTGATAATATGTTATT 
TTAGACGATTTGGAAGTAAAATAAATAAAATTTGTTTCAGTATCATTTTGGTTGCACTGCCTCCTTTCTTTAATTTAAAATACATAAACTCCATT 
TGCAAGGGGTCAAGTTAATTTAAGTTTATGTAATAGGAATAGCAATCAATTGACAGTTCGACTTTCAAATCGTTCCTACCCAACACGATTTCACA 
CTTAAAGCGTCGAAGCCGAACTACCAATCCTTTGAATAACAATTTTCTAATTAACAGTAACTCATGACATCTCTACATCGCTCCCTGTTTTCCTT 
TTCCGGTGCTGACCACAATGGCAAATAAATTCCAATTGTAATTGTCTCTGCCTTTTAAGAAATTAAGACAAAAGGCGAACAACGCTCAGATCGAA 
GGACAAGAAGGAGTTTCGAGAGCGGGTGGCTTGTAAAGAGCCGTGTTGGTTTATGGGCTTAGGCAATACTTTCAATTTTATATTCTGGACAATAT 
TTArTGAGTGGTTCTTGTTGAATCTGTGGGGAGTACATGTAAACTGAATTTAAATATTTTGCATGAAAGAGAACATTCGCTCGTTAAAAGTGCAT 
TTGAATTGTTTTTAAAGCAACACGTTACACAGG 
(SEQ ID NO: 367) 

Exon: 1001.. 1291 
Exon: 3 984.. 5093 
Exon: 5158.. 5420 
Exon: 5566. . 6443 
Start ATG: 1088 

Transcript No. : CT12 625 

AATTAGAAAGAAACCTATTTAAGCAAAACGCATCGATCTCTCCGTAATTTTTTGTCCGCTGAGTAATCGGAAATCGGCAACGGAATAATGGGCAA 
TTTCGACGGCGATTCTGGAACGGTTTTCGAGCAGAGCTATGCGACGACCAACGGAAACGGAAAGGTGCCGAATGGCAATGGGGCGGCAGATCTCA 
GTGCCCCGGAAAAACAGGAACTTCTCCGGAAGTTCTCGGCACCCGCGTACACTGAAATTCGCACGCGACGCTGCCAGCCGAAGCCCGCAAACGAC 
GCCCAGAGTCCCAGCTCGATCAACTACCATCCGGAAAAGAAGTCGATCCCCAATGGGCATGCCTCCATTCTGGATCAGGAGTCCAAGCCCAACCA 
GGCAGAGGGCAAACTTTCGCCGGAGCTGGAGCACTTCCAGAAAACCTCCTTTGAGGAGGTGCCTTTGCACACGGCCTGCCTTACGTATTTGGGAT 
TCTATCTACTGATGATCCTGGGCTACATCAACCAGTTGCTCTTCGTGCCCAAAGTGGCCACCGAGAAGGGTCGGGATGGCTATGTAGCCCTCTAC 
GATGCCTTCGAAAGCTTCTACTCGAGGTACGTTTACAGGAGAATCAAGGATTGCTGGAACCGACCCATCTGCAGTGTTCCGGGAGACGAACTCAC 
TCTAAAGGATCGCGTAACCGATGATTATGGATGGAGTTTCAAGTTCACTGGAACGGAGACACGTTGCTTAAATCTGGGATCCTACAATTACCTGG 
GATTTGCCGCCGCTACTGGACAGTGTGCTGATGATTCAGAGGAAAGTGCTCGGAGCTCTGGACTGGCATACTGCAGCTCCCGTTGTGAGCTTGGT 
GACAACGAGCAGCTGCAAGAGCTGGAGGCTCTAACTGCCAGATACTTTGGCGTGGAGGATGCCATTGTCTTTGGAATGGGTTTCGCCACAAATGC 
GTTGAACCTTCCCTCACTGCTGGGACCCAACAGTCTTGTGATTAGTGACGAAAAGAATCACGCCTCCATCATTTTGGGACTGCGTTTGTCCGGGG 
CCACCACCAAGGTCTTTAAGCACAACAACATGCGCGATCTGGAGCGCGTCCTCCGTCAGGGTGTGTGCTATGGAAATCCCAAAAAAGGTGGTCAG 
CCATGGGACAAGGTTATGATCTTGGTCGAGGGCATCTTCAGCATGGAGGGGTCCATAGTCCGTTTGCCCGAGGTTATTGCTCTGAAAAAGAAATA 
CAAAGCATACTTGTATTTGGATGAAGCTCACAGCGTGGGAGCCATGGGTTCCCGTGGACGAGGAGTTACAGATTATTTTAATGTGGATCCGAAGG 
AGGTGGACATTCTGATGGGCACCTTCACCAAAAGTTTTGGCAGTGCTGGAGGCTATCTCGCTGGATCCAAGAAACTGATTGACTTTCTCCGCACC 
AACAGCCACGCCCATTGTTATGCTGCTTCGATCTCGCCACCCATCGCACAGCAAATCCTTACCTCCATGAAGACCATTATGGGTGAAGArGGCAC 
TGACATTGGACGCAAGAAGATCCACCAGCTGGCCAGGAACACACGTTACTTCCGCCGACGTCTGGCTCAGCTGGGAGTGATCACATACGGTCACG 
AGGACTCCCCAGTCGTGCCCATGCTGGTGTACCTGTTTTCCAAGATTGGCGCTGTGGTTCGCACCTTGACGACTCGCCACATTGCCGCGGTGGGA 
GCTGGATTCCCGGCCACACCCATCATGGAAGGACGCATCCGATTCTGCCTTTCAGCCGCCCACACCAAGGAGCAGCTGGACTTTGCCCTGGAGGC 
GATAGACGAGATTGCCGATGATCTGGGCCTAAAGTATTCTCGCAAGCCACGCGATCCGAATCCCGTCATTTACTAGAATCACACGCACACACATA 
GAAGTTAAGCTTGGGGAAATGGTAACCTCCCCGATTGGTAAACCAAAACTACAACACAGTCCAGATAGTGCTAAATCAATCACGAACGGTGGCTT 
ACATTGACCAAGTTTCAGTTGCACTGGGCTTCTATTTATGACTAATTCTCTGGTTATATGCCTTAAAAGGGAAGATCACTACAGCAACAGAAACA 
AAACACTACAGCACATAGCTTAAATGATAAAATATTATTAATTATTTAGCATTTACCTCTACGTAGTTGTTATTGTATATGTAAAATCCAATCGT 
CTTCTTTGGTCGGCTGCTGCATTTTAAGTATTTATAGAAGAGAAAATGACAGCGCAATGTATTGGGCAAATCAGGGTTTGGTTACTTTTATAGCT 
AAGCTAAGGGAAGCCCCTTTTTCGTGGGAAAGCCTCGAGCACCTCGCTTCTACCAATTAAGCATACTAAGCTGAGCCAAATGTAACGTCTAAAGA 
GTCAGCGATACAATTGTAAATCACTATTTGGGTAAGTCTTCGGGGGACCCAAAAATTGGGTCAACTTTAGGGCATTGTCAGGAAATATATATGTA 
TAATAATTATGTATTATTGATCTTATTTTTCCACCAAAAAAATAAAGAAATTGTTGCTTTATCTAATAAACT 
(SEQ ID NO: 368) 

Start ATG: 83 

MGNFDGDSGTVFEQSYATTNGNGKVPNGNGAADLSAPEKQELLRKFSAPAYTEIRTRRCQPKPANDAQSPSSINYHPEKKSIPNGHASILDQESK 
PNQAEGKLSPELEHFQKTSFEEVPLHTACLTYLGFYLLMILGYINQLLFVPKVATEKGRDGYVALYDAFESFYSRYVYRRIKDCWNRPICSVPGD 
ELTLKDRVTDDYGWSFKFTGTETRCLNLGSYNYLGFAAATGQCADDSEESARSSGLAYCSSRCELGDNEQLQELEALTARYFGVEDAIVFGMGFA 
TNALNLPSLLGPNSLVISDEKNHASIILGLRLSGATTKVFKHNNMRDLERVLRQGVCYGNPKKGGQPWDKVMILVEGIFSMEGSIVRLPEVIALK 
KKYKAYLYLDEAHSVGAMGSRGRGVTDYFNVDPKEVDILMGTFTKSFGSAGGYLAGSKKLIDFLRTNSHAHCYAASISPPIAQQILTSMKTIMGE 
DGTDIGRKKIHQLARNTRYFRRRLAQLGVITYGHEDSPVVPMLVYLFSKIGAVVRTLTTRHIAAVGAGFPATPIMEGRIRFCLSAAHTKEQLDFA 
LEAIDEIADDLGLKYSRKPRDPNPVIY* 
(SEQ ID WO: 369) 

Classification: enzyme 
Gene Symbol: lace 
FlyBase ID: FBgn0002524 



Celera Sequence No. : 142000013384832 

TGTACAGGTCGTGGTACAGGTGCCTGTCAATCTTCTTGCTGTCGCGGTACTTCTTCAACAGGCGGCGCAGAACGCGCTGGCGCTGCATCCACAGC 
AGCTTGGTAGGCATGCGGGCGTTCGCAGTACCCTTACGCTTTCCGAATCCGCAGTGACGTCCCTTGCGGCGGGCCTCGGTGTTTTTGCGCACACG 
GTAACGGGAGTGGACCACGACGGGCTTCTTGATGATCAGACCATCCTTGATAAGCTTGCGAATGTTCTGACCTGGAATGGGTTAACAACACAGTT 
AGCTGGGCTGTCTATTCGACCATTTAGACCATTTCTACTCACGCGAGTTTGTGTTAGCGATCTCGTTGATTTCATTGGGATCCAACCAGACCTTC 
TTCTTGCCGCATCGCAGCACGGAGGCTGCGAGCCTCTTCTGGAGCTTTAGAGAACTGCAAAGGAGGGTAACTTGTTAGTCCATCATTTCGAAAGT 
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AGCGTGCTGGGATGCGCCTATCAGCTTGTGTGTGTGTGTGCATGAGCGTCGCATGTTCGGTTATTTCACAACTGCCGGGT^AACAGAATGGCACTT 
GTCCGCCCGGCAGCTGCATCATACGGCACGCGCTCGCCCGGCACACTTTGCAAACATTTAGACCCCGCGAGCGCAACTTTTCGCTCATATTTGCG 
AACTCAAATCTACCTCATGCTGTCGACCTCGTCGTGGCTAGGAAAGAAAGAAAAATTTGACAGCCGGAAGTCAGCACCATTTTCTGAAGGGTGCT 
ATCGATACATCGATAACGTACTCGGGGGTCTGGCAACGCCTCGGGGATGTACTAGTTTAAGCGGAGTGGCAAGATTTATTTAAAAAAAATATTTA 
ATAACAACTACTTTGTGCGGTTTCCAGCAGCTTGGATGTGCAAGATCCGGCCATATTTATATTGTTCGAATTAAGTTAGAGCAATTAAGTAACGA 
AGCTTGGTATATTTTATTTATCTACTTGCGGTCACACTAAATTGGCTAAGAAAAACAAAAACAACTTTTTAGCCCAGTTGCAGCTGTTCAGAGTG 
CAATTTCTGCAATGCAGCACACGCTTATACGCTGCTTGGGCATGGCCCGGATATCCCTGATGCGTTTGCAGCCCAGACCCACAGTGGCAGCCTCC 
GGGGGACAGGAAGCTGGGTCCATCTCGAAGCCAACACAACCGGTCAGTCGGTCTTTCGCCTCGCTGCCGCAGGAGCAAGACAAGAAGGAGCAGAA 
TGCCAGAGAGAGCCTTAACCGCCTGCCTCGCCTAATGGACTTTCCAGAGATCGTGTGGCCATCGGCGCTTAACTCGTTAAAAAATTGGATCACCA 
TACAGTTCATAATTCGACCCTACTTCGACAGCGAGTTTCAACTCAAGGACTTCATATACGGCGCCAAGCAAGCGCTGCAGGTGCGTCATGGTTTG 
TCTATCATGAAGTGTAATGATATTAATTCCCTCATAGGTAGTATCCTCAAAACTGATGGGCGGCGACTTAGACTCCCTGGACAATCTTGTCTCGC 
CCGAGGCGATCGCAGAGCTTAGACCGGTAATCCAAAAACTGTCAATGACGCAACGGAGGCAGCTAGAAATCAAGGAGAGCGACATATACCTCAGC 
TTTCCCTATCAGGTAGGTATTATGTTTGACGATGCAAACGACAAGCTGCAGAAGCGTTTCGTAGAGATCACCATGGTGTTTCACGTAATGCGTGG 
TCTATCCGAGATGCGGGAACGCGGCGAGGAAATACCCTGGAACATGGGGTAAGTTTGGACTGTCACTGCCCGAAAGCTGAGCTCCTCATACCCAT 
TTGTTGCAGGACCCTTCCCGAGTACCAGGATAAGGTTTTCATCTGCAACTACCGTTTTGTAAAGGAGTTCACCGCTGGACACCAATCGGACTGGA 
CCGTAAATGTGGCCAACCAGTTTAGGGCCATCGACCTTATTAACGAGACCATATAACATCGGCCCATGGCTACCATCATCCGCGCTGTTCGCCGA 
CTCATAGCATATATTGGCGTAAGCCACCGCCTTCGTTGGAATAAAAGTTTCGTTTTCCGCATTTAGGTATAAAAAACTAAAATCTTTACTGTGGA 
TCTCTGGTTTAAATTAGGTTCATGTGAAATTAGTGCATTTGGTGGACAAGCAAAGCGCTCAGTGTCCCTCGTGCTTTGCTCTGTAGATGCCTTGG 
GGATCGTACTTGTTCAGGAGGAGCTTCCATTCGCCGGCTTTGTTGGCGCGAAGGTAGTTGATTACCTTTTGTGAGTTTTTGCGTTGGACTTCCGT 
GCACTTTGAGCAGTCGGAATGAAGGGCGTCGGGTAAAAGGCGCTTAAGTTCACGACCCTCTGCTGTGCATGGGCCCTTGTCCATCAGGCACTTGA 
GATAATTACCCAACACTCGGTTGTTGCCCAGAACTTCATCCACGTTGACACTGTCGTACTTGTTGGTGTAGGTTTTTTCAGGAGCAGCTGCCGCA 
AGACCGACACAAACACAGAACACTAGAGCTAGCGAGGCTTTCATCTTGGAGGATTATCTGGAAAGGCAGGTGGGAAATTCAAATTATTAAAGGTA 
AGCCACGAAACACATTTCCTAAAATGAGGGATTTTTGGTTGCCGGGTAAGCGGATAATAAGAAGGCGTCAGGCAGTCTCGGCTTTTCTGTGTTTT 
CGACGCATGTCAAAGAATCATCCAAAGCGGTTAGTGGTCTGCCCACCAGCTCATGCATTATTTATGGCCCTTGGCATATATATCGCATATGCATA 
CTAATTGGCAATCGTTTGGATTTAATATTTTTGAAAAGATTTCACAATTTCACCGACACCACGTAGTTTTAGCAGCTTTCGACTTATTTGCGTCC 
AATATCCGAATCCATGCTAGCCCGTGGCCAGCTGTTTGGTTTTCGCAATCCTTGATGGGTGCTGCACCTCAATTCCTCGGGTGCTCGGCCTAACC 
AGGGCAAACGAACCACGGCTTGAACAATGGCTTACTCACTTCTCTTCCGGTGATTTTCGGTGGCCCGGGTGCAGTTGTGGTACGATCAAAGGCGC 
AACTCAAATTGCGAGCTGTTAGCTCGCTGG 
(SEQ ID NO: 370) 

Exon: 1001.. 1410 
Exon: 1463. .1758 
Exon: 1815.. 2070 
Start ATG: 1057 

Transcript No. : CT12627 

AAAAACAAAAACAACTTTTTAGCCCAGrTGCAGCTGTTCAGAGTGCAATTTCTGCAATGCAGCACACGCTTATACGCTGCTTGGGCATGGCCCGG 
ATATCCCTGATGCGTTTGCAGCCCAGACCCACAGTGGCAGCCTCCGGGGGACAGGAAGCTGGGTCCATCTCGAAGCCAACACAACCGGTCAGTCG 
GTCTTTCGCCTCGCTGCCGCAGGAGCAAGACAAGAAGGAGCAGAATGCCAGAGAGAGCCTTAACCGCCTGCCTCGCCTAATGGACTTTCCAGAGA 
TCGTGTGGCCATCGGCGCTTAACTCGTTAAAAAATTGGATCACCATACAGTTCATAATTCGACCCTACTTCGACAGCGAGTTTCAACTCAAGGAC 
TTCATATACGGCGCCAAGCAAGCGCTGCAGGTAGTATCCTCAAAACTGATGGGCGGCGACTTAGACTCCCTGGACAATCTTGTCTCGCCCGAGGC 
GATCGCAGAGCTTAGACCGGTAATCCAAAAACTGTCAATGACGCAACGGAGGCAGCTAGAAATCAAGGAGAGCGACATATACCTCAGCTTTCCCT 
ATCAGGTAGGTATTATGTTTGACGATGCAAACGACAAGCTGCAGAAGCGTTTCGTAGAGATCACCATGGTGTTTCACGTAATGCGTGGTCTATCC 
GAGATGCGGGAACGCGGCGAGGAAATACCCTGGAACATGGGGACCCTTCCCGAGTACCAGGATAAGGTTTTCATCTGCAACTACCGTTTTGTAAA 
GGAGTTCACCGCTGGACACCAATCGGACTGGACCGTAAATGTGGCCAACCAGTTTAGGGCCATCGACCTTATTAACGAGACCATATAACATCGGC 
CCATGGCTACCATCATCCGCGCTGTTCGCCGACTCATAGCATATATTGGCGTAAGCCACCGCCTTCGTTGGAATAAAAGTTTCGTTTTCCGCATT 
TAGGTATAAAAA 
(SEQ ID NO: 371) 

Start ATG: 57 

MQHTLIRCLGIMRISLMRLQPRPTVAASGGQEAGSISKPTQPVSRSFASLPQEQDKKEQNARESLNRLPRLMDFPEIVWPSALNSLKNWITIQFI 
IRPYFDSEFQLKDFIYGAKQALQVVSSKLMGGDLDSLDNLVSPEAIAELRPVIQKLSMTQRRQLEIKESDIYLSFPYQVGIMFDDANDKLQKRFV 
EITMVFHVMRGLSEMRERGEEIPWNMGTLPEYQDKVFICWYRFVKEFTAGHQSDWTVNVANQFRAIDLINETI* 
(SEQ ID NO: 372) 



Celera Sequence No. : 142000013384479 

TCGTAATATAAGGTATTTTCATTGTTACCTTTATTACAAAAGTGGACTTAAAGGACATAAGTTTCCAAAATCATTTAGTATGCCTAAATTTGAGT 
TACATTTGTATCTTCACCCGTACATATGGTCACCAAACATTTGCAGATGAGTGTAAGCTGCGCAGTACTTTAATAGCGCCCCTCGTAGGCCTAAC 
TAAATGCGGCGATTACTTGGACGACCCGCGATCGCACCGCACTTAAGTATGGAGAAGCGCAAAAAGTGACCAAACCGGCTTGCCAACCGCTCTCT 
GCTTCTTGCCCATCCGCATGCGGCTCCAGAAACTCGGAGCGCAGCTTTCGCGAAGCCACCGTCGCGTCACGGCCGCCGTCGTGGTCGTCACACAT 
GCGCAGCAGCTGATGCGCGAGCGCTGACCTTGACCCTCAATTGGAGTCGCAGTCGCTTGCAGACTTGTTGCCAAGTGCAGCCGGCAAAGTTGAAG 
TTGCCACCCCACCCCGGGACCGCTCTCTCTTCCTCTTGCTCTCTGCCTGCTTTTTCGCACAGCCAGTTTTCACCATTAGCGGCGACTGTACCTCG 
CAGCACGCTGTACGTCCATACATACGTATGTATGCATTCACTTAGCGCGCGCGTATGTACGCATGCATGTGTGTGTGTGTTGTGTGTGCTGGTAC 
GCGAACTGCACACAGAAAAGAACAGGGCAGGAGAGCAAAGAAAGCGGCGTGAGCACATAGCGAAAAGAGAGCCCCCAGCCCCATATTCCATCCAG 
CCATCGCCATCAGCTGCCTGCGCCATGTGTTCCCGCAGCTGCAGTTGTGAAAATCGCCGGTAAACTGGTTGCGCGAGCGAGATGGAGAGAGTGCG 
AAAGCGGGCGATCGCTCTCTCCCGCTCAGGGGCGGAGATGCCTCAAATTGGCAGGGGGCCTAATGAAAGATGTTACCCTGGAAGACGCTGCAAGA 
ATTACCCCAAAGACATAACACATGGTGCGGGATTATTTTTTTTTTTGTCTCAAATCAGCAGCTCCAGCATTTTAATTTACAATTAAAAAACAATC 
ACAGTGATATTTGAATTTTCATTTCCTTCATCGTACGTAGGTTTCGGTCTCGCATCTTTTGTGTCCCGTCCGTCCAACTGAACTGAACTGAAGGT 
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GTGTGTGTGCCGTTGTCTTCTTCATCTGCCCACTGAGTGTTCTGAGTGCCGCTTGCTTGTCTGCTTCTTTAACCGCTTAGCTCATCCTCTCCAGC 
GTTGCGTTCCGTTTTCAAAGTCGTCGTGTATGTCTACCATTGTTGATGTTCTCTTCTTGTTATTTGTCTGCTTCTCTGTTCTTGTTTGCTGCCCC 
CTTTGGCTCCACAGCTATGAGCTACATAGCATGCTACCCCGCTTGAGAGATCTTGCCAGGCGTCTTGGCTGATGGGTGTGTGGTATGGTATGGAT 
GTGGATGGAGTTAATGGTATAGGGGTGGGATACTGGTTACTGTCTACAGATCCTGTCTTACAGTCTTAGATCCTGGATAATTCCCAAGGCTCGCC 
GGCAGCAGCTGGATGGAGTGGAGTTGAGTGGAGTCGAGTGGTAACGTGGCCTGATCTTCTAGCCGCCTGATTCTTCGCTTTTTCATTGGTTTTTA 
TAATTTTTATTTACGTTTTTCTTTCGGTTGTTTCTGGCTGTGGAACTCCTAATAGGATCATTTGCTTTAAATAGTATTCTTTAAGGTATTTTTCT 
TTATATAATCTTATATAGTTTCATTCCATTCGTTATAATGTAATTAGTAAATTCAAACACTTGCAGCATAATCATATAATCATATAGGTATAATT 
ATAGTAGAGTGGTTTGCTTGGTGATTTCTTTTGTTGTTGCTGTTGTGGTTTAGTCTCATCTTACAAATTGGGTCGTCGTGTCTTCTTCCTCATCT 
TCTACCTCATCCTCTTCCTCGTCGTCGGTGTCATCTGCTCTCTCGGCTGCACAGCTGCCGTTGGCTATCGTCTTCTGGTTCTGCTCCTCCGACCC 
CCGCTCGAGGAGCCTTTGCCCTCTTCCTCCTCCGTGATTCCTTTTCTGGCGGCTTTCCCTCGGAAAGCCAATACTTCTTGTCTTCTTTCAGTTCT 
CGGGACGTTGGCACCGCGCTGCAGGCTCGGCTTTTGGAATCAGCGATGGTTACTGGATGATGTTCAGGATTATGGATTCATATGCTGCATGTGTT 
AATGGAGGGATGTGGATGTGGTTGTGGGTTCTTTATTTTGAAGGGGTTGATCTTGACTTGGGTGCTGCTGCTATCGCTTCTAGATGTTGTTGCTG 
TTGGCTGGCTTTGTGAATGTTGTAGCAAATTGATTTTTCGTTTTTTTTTCATTTTTTTTCGTTCTCATCTTTTCTTCTGGTTGTTCTTCGTGTGT 
TGAATTATTACTCATATTCCTCCTTCGATGATTACATTTATGAGGTGCCACGGGTGGGTGGTGGCGCCTAGTTCCGTCCGCCCTTCTCGTCGCAC 
TCGCGCCTCAGATGTCCGCTCTTGCCGCAGCCGTAGCAAGTCTTCGAGGTCTCCGGGCAGTTCTTGGAGATGTGTCCGGTGCGGTTGCACTTGTA 
GCACGACACATTGGTCGGTCCACGTTCGTTGACGGCCTCTGGGCAGTTGCGCACCCAGTGTCCGGTCTTGTTGCAACGGTAGCAGGTCGGGTTGT 
CCGCCTGGGTGCAGTCCTTCGAGATGTGCCCGATGCCGTTGCAGCGGTAGCAGCGCTCGGCCTCCTCAGGGCAGGCACGTGCGAAGTGCCCAAAT 
TGGTTGCATTTGTAGCACTTCTCGCGGTTGCGACGCATGCCGCCGCCATCGTTGCCGCGCATACCGCCGCCGCCGCCACCACCGCCTCCTCCAAC 
GCCGCCCGGACCTCCGCCGCCGCCGAGACTGCAGTCCCGGGCAAAGTGGCCCGGCCGGTTGCACTTGTAGCACGTGGCAGACATCGACATCTTTG 
CTTCGGTTTTGGTTAGACTACTGATCCGCAAATCCGACTCCGCTTCTTAAACTCAACTTCTCAACTTCCTCACACACTCACTGGTAACCACGGGC 
CCTCGAGCACTACACACTGCTTGCGCTGGATGCGAAAAAATCACACAGACCAACAGGATCTCCGGCTTCCTCAGATTTCCTCGCCGTATTTTCGA 
CGCCAAAAGTTAAAATGGCTGCCGCTGAGCGAATGAGTACCGATTTGCTGTGGTATCGACTACTCGCCAGAGTTGCTTCGGCTACGCTTGAATAC 
AACACTGACCCTCCACGCGGTGCACATTGGTTCGAGGATATGCGATGGTGAGGCGAAATGTCAGTCTGGAATTGTGTAGTGTGCTGGTGTCGTAC 
TACTTCCACTCGGCGTGCTTTGTCTCACAGAATCTACATTTCCATTTTGCTTTCTCACTTGGCGTGTTTAGCTTTTGTCTCGTGTCCTCCATATT 
TAGGAGGCACTACATAACCCAATTTTGCGAACTCGATTTTGCCACTTTATAACCACACGTAACCAGTGTGTCCGTGGTGATGGACTGCAACTTAT 
C GGCT C AGT CGT T AGGAG T AC CC C T GGC AAAGGGC TAT C G AAAT AT AT AT CT TT AAAT T T AT AAG GAAGT C C ATT C AAAT AATT GTT T T A T T T T A 
ATTGTTTGGACACAACATACGTTCACCTTTTTGAAATTGGCTATTTGCAGAAATTGATATCGATTGACACATTAGGGGTGCTTAGCAAAGAATAG 
CTAACTGAAGACTTCATAAGAGAAAACGTAATACTCTAATAATAGAAAAGCACTCAGATTACGTAACAACTAATAAAATATTTTATGTGCATATA 
TATAAAACGATTTATCAAAAGAACCTGGCTTTGTCAAAAAATCGATATTTCATAGTGGTGTACATTTTTAAGGACTTTCACAAATAATACGCTGA 
GTACACTTATATTATTGTTATATTATCCCAAATATTGGGCAGCACGATTTTTTGAAATCTGGGTGACATCGATTACTATATGAAAAGGGTATTCC 
TGCTTATCAGTATTTGCAGAGAAAAGGAATTATGAATAATATTTACCTCAAAAGCTATAAATTAGATAAAGTCATAATAATTTACTTTACTATTT 
ATTAACTATATTAATT 
(SEQ ID NO: 373) 

Exon: 3101.. 1001 

Start ATG: 2817 (Reverse strand: CAT) 
Transcript No. : CT12687 

AGATCCTGTTGGTCTGTGTGATTTTTTCGCATCCAGCGCAAGCAGTGTGTAGTGCTCGAGGGCCCGTGGTTACCAGTGAGTGTGTGAGGAAGTTG 

AGAAGTTGAGTTTAAGAAGCGGAGTCGGATTTGCGGATCAGTAGTCTAACCAAAACCGAAGCAAAGATGTCGATGTCTGCCACGTGCTACAAGTG 

CAACCGGCCGGGCCACTTTGCCCGGGACTGCAGTCTCGGCGGCGGCGGAGGTCCGGGCGGCGTTGGAGGAGGCGGTGGTGGCGGCGGCGGCGGTA 

TGCGCGGCAACGATGGCGGCGGCATGCGTCGCAACCGCGAGAAGTGCTACAAATGCAACCAATTTGGGCACTTCGCACGTGCCTGCCCTGAGGAG 

GCCGAGCGCTGCTACCGCTGCAACGGCATCGGGCACATCTCGAAGGACTGCACCCAGGCGGACAACCCGACCTGCTACCGTTGCAACAAGACCGG 

ACACTGGGTGCGCAACTGCCCAGAGGCCGTCAACGAACGTGGACCGACCAATGTGTCGTGCTACAAGTGCAACCGCACCGGACACATCTCCAAGA 

ACTGCCCGGAGACCTCGAAGACTTGCTACGGCTGCGGCAAGAGCGGACATCTGAGGCGCGAGTGCGACGAGAAGGGCGGACGGAACTAGGCGCCA 

CCACCCACCCGT GGCACCT C ATAAAT GT AAT C ATCG AAGGAGGAAT ATGAGT AAT AAT T C AAC ACACGAAGAACAACCAG AAG AAAAG AT G AGAA 

CGAAAAAAAATGAAAAAAAAACGAAAAATCAATTTGCTACAACATTCACAAAGCCAGCCAACAGCAACAACATCTAGAAGCGATAGCAGCAGCAC 

CCAAGTCAAGATCAACCCCTTCAAAATAAAGAACCCACAACCACATCCACATCCCTCCATTAACACATGCAGCATATGAATCCATAATCCTGAAC 

ATCATCCAGTAACCATCGCTGATTCCAAAAGCCGAGCCTGCAGCGCGGTGCCAACGTCCCGAGAACTGAAAGAAGACAAGAAGTATTGGCTTTCC 

GAGGGAAAGCCGCCAGAAAAGGAATCACGGAGGAGGAAGAGGGCAAAGGCTCCTCGAGCGGGGGTCGGAGGAGCAGAACCAGAAGACGATAGCCA 

ACGGCAGCTGTGCAGCCGAGAGAGCAGATGACACCGACGACGAGGAAGAGGATGAGGTAGAAGATGAGGAAGAAGACACGACGACCCAATTTGTA 

AGATGAGACTAAACCACAACAGCAACAACAAAAGAAATCACCAAGCAAACCACTCTACTATAATTATACCTATATGATTATATGATTATGCTGCA 

AGTGTTTGAATTTACTAATTACATTATAACGAATGGAATGAAACTATATAAGATTATATAAAGAAAAATACCTTAAAGAATACTATTTAAAGCAA 

ATGATCCTATTAGGAGTTCCACAGCCAGAAACAACCGAAAGAAAAACGTAAATAAAAATTATAAAAACCAATGAAAAAGCGAAGAATCAGGCGGC 

TAGAAGATCAGGCCACGTTACCACTCGACTCCACTCAACTCCACTCCATCCAGCTGCTGCCGGCGAGCCTTGGGAATTATCCAGGATCTAAGACT 

GTAAGACAGGATCTGTAGACAGTAACCAGTATCCCACCCCTATACCATTAACTCCATCCACATCCATACCATACCACACACCCATCAGCCAAGAC 

GCCTGGCAAGATCTCTCAAGCGGGGTAGCATGCTATGTAGCTCATAGCTGTGGAGCCAAAGGGGGCAGCAAACAAGAACAGAGAAGCAGACAAAT 

AACAAGAAGAGAACATCAACAATGGTAGACATACACGACGACTTTGAAAACGGAACGCAACGCTGGAGAGGATGAGCTAAGCGGTTAAAGAAGCA 

GACAAGCAAGCGGCACTCAGAACACTCAGTGGGCAGATGAAGAAGACAACGGCACACACACACCTTCAGTTCAGTTCAGTTGGACGGACGGGACA 

CAAAAGATGCGAGACCGAAACCTACGTACGATGAAGGAAATGAAAATTCAAATATCACTGTGATTGTTTTTTAATTGTAAATTAAAATGCTGGAG 

CTGCTGATTTG 

(SEQ ID NO: 374) 

Start ATG: 285 (Reverse strand: CAT) 

MRGNDGGGMRRNREKCYKCNQFGHFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGPTNVSCYKCNRTGHISK 
NCPETSKTCYGCGKSGHLRRECDEKGGRN* 
(SEQ ID NO: 375) 

Classification: hypothetical 
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Celera Sequence No. : 142000013384774 

TTTCCTTTCTTTTTCGAAATTTAGCCTGCTCATTGAAATGAATTATTAACTTGCTTATTTTTGGCAAGCTAAATTATTGATCAGCTGGCCTTTGC 
AGCTATTATTTGTGTTTTACTTTTTTTTTTGCTAGCAAAACTATTGATTGAAAAACACAAGAGAGATATATGTTGCTTGTAGTTAACAAATGCGT 
TTACATTAAGACCGCTGAATAATTGATTTTGAAACTCAGGAGATTCATTGAGTTTTTTTTTCTCATTTAAATCCATAACAAATTGACTTTAAGTG 
CTGACATTAAGGCAACTGCATTATTGATTTTAAATAACTGGGGATCTCTTTGGACGAATATACACTGTAATAGGCAGATTTATTGCCGACTCACC 
GGCGCCTTAGAAATTGGAGTGTTTGCAACCTTCGATATATGCAAATTTCGCAGGAGAAAAAAAAAGGGAAAATATTCAAGCGAAGGCAGGCTGAC 
ATTGTTATGGGAGCATTTCTGACAGTTTCATTTCCTCTTTCTTTATTTTTTTTTTAATTTCTATGTAGCTTCCTACAACAAAATTCATTTTGTGC 
AAAACGAAAAGCGGAACGCAAATGTTATTCCTGGATAATCGCGTCATTAGAGGGCATTAAAACTGATTCGACGAAGCGGTGCTAACGAAGAAATG 
CAAATGTGTTCATTAGTTTCTTCTGTCGTGGGCATATGTGTCGGATACGTGATATAGCCAAATATATCACTCATACGCACTGGTGACAGCGCGTT 
GTTGATACGCAGTTGCAGTGTTGCAGTGACTACAACCCCTTAAATACTCAAAGATAATGGACAAATATCTGGCAAGTGTATTGCGGATCCATCAT 
GATTATATCCCTAATTTATAGTGCTATAAATTAAATGCTAATTACCTGTAAAGGGGCTGAGGCAGCCCAGCAGGCATAGGGCCAAAATCAGCTGG 
GCGAGATAGTCAGTTGGCAGCCAAAGCAACGACGTCATTGGTGGCGGCCACGCCCCCAGTCGACCGGCCGCCTTAAGGCGTTGCAAGTACGATTG 
TTGCTGTTGCTTTTGCTTTTGCTGCTGCATGTTTCGTATTGCTGCTGTTGTTGTTGTTGTGCAGCAGCCTCGTTGACGTCGCCATGTCGTTTGGG 
GCCCATTGATTGTTATTGCTTTGGTTGCTGCTGCTGTTTTGATTGGTGTTGCTGCCGCTGTTGTTGTTGCTGTTGCTGTTGCTGCTGCTGCTGCT 
GCCTTTGTCATTGTCAGGCGTTGATTTTGAACTTGGCGGCGGCGATGGCTGTGGGAGCTGCACTTGCAGCAAATGTTGCTGTTGCAGTTGCCTGG 
CTTACTTCTTATGTTTGCTGTTGCAATTATCCTTTTTGTACTGCTGCTTCTTCTTGGCCTGTCCTTGCACTCTGTGTGCGTGTGTGTTTCGGTGT 
GTGGAGAAAGGTGATATATTATTGGAAACTTCAATCAATTTCATGTTGTAAAACAGTGTCTTAAATGTACTAGTTACAATTCAGTTTTATATTAT 
TTGAATATGGTTGAAAAAAATATCCAAAACAAAGTGTTTGAATGAATGCAATTCCTTTTAGACTGATTTATTGATAGATCGCATAGTTGGCTGCA 
TTTATTAATTGTGTGCCTCAGAGATTTTTGATTGAATTTAGTTGATTTTGTTCGATATAAAATAAAATACCATGCTCTAGTGTTTACATTATATC 
AGCAAATTCTGCCCACTGTTCTGTTGGCCAAAAAGTTTCTCTGCATTGTGTCGACAGACAGTTTTTGGCAGCAATTTCACCTCTGCCTCTTTGTG 
TGTGTGTGTGTGTGTTTTTGCGTTTGATGGTGGGCATTTTGCAGGGGGTGGTTGCAGGGGGCGGCGGATCAGCTGTTCGGCAGTTCCGTGTTTGG 
CAGCCGAATTGCTTGTTTACAAAGCTTCTTGGGGCATTTTCATTTTCATTTTTTTTTGCCATATTTTTTGGCTCTTTGCGGCTGCCAAAAACAGA 
AAAGTGCACACACACACACACACCCACATACATCCAGTGCTTAATTCGAATCGTGTAATTTATTATTTGCATGTCCTGTGCGCCAAAAGAGAGCG 
ACAAGTTCGAGTTTCACTTCACGCCGTTTTATGGATTTTCTTACAAACACACACTCATACACACACACACACGCACACACAGGGGCACACGCGTA 
CCCGCATTGATTTTCCCTTTTCTATTTCGTTTTTTTTTTGTTTTTTTTTTTTTGCTATTTTTGCTCTTTTCAAGGCTCTGCrACTTACATCTCAC 
GTAAATT GTTT AACTTTGACACC AAC AAAA TTTCGC AC GC AAGCAC TAAAAATACTTGAAAAAAAAAAAT GAAGGGAAAT GTGAAAACAAAAC AC 
AGCACGAATTTCCCAGCCAGCGAGCAGCAGTGGCAAGCAGCTAGCCTACACACCAGCCCACCGCGCCCCCTTCCCACGATTTTCCCATTTTCCGA 
GTCCTTTTTACAACCGACTGTTTGCTGATTGCTTTTTGCCCACCGTCTTTTGTGTTGCCTTTTGTTGATTTTGTTGATTTTTGCTGCTGCCACTA 
CGCTGCTGCTGCTGCTGCTGCTGCTGCTGCAACTGCAACTGCAAGTGTGACGAACCACTCTCTCACTCCCAGTACGCTCTATGCAATCCGGGCAA 
GATTCCGATTCCGATTCCGATTCGGATTGCGATTCCGATACCGATTCCGATTCCGGTTCTGATTCCGGCCAGAGCCACCGACTGCAGCGTGGGAG 
AGTCAAGAGCCAACTGACGGCGCGTCGACGAGGCTGCGCCCCAACAAATGTTGCTGGCGTTGGGCGGTAGCAACATCGGCTCGCCACTATAGCAA 
CATTGACGTTCCAACAGCAACATTTCCGTGGACAACACAACGTGTTGTAAGCTTTTGCTCGATTCGAAAGGGTTCATAGCTTTTGGGAAACATTC 
TTAGTGTTCTTTCGAACCAGCCAAGAGCTTTTTAGTTCTGAATTAGACACATGATTTGATTCCGGTTTTAAGGAGGGCTTGAGCTTTTGCAAAAG 
CTTAATAGCGTAGCTCAGAAACACAGCACCGGAAATAACGTTAGCTTTCAAAGAGAAAAAATCCCGCCAAGGATCACATTTAAATTAAATAAAAA 
AGTGAATCTTATGGCTGACTTATGAGTAATAATTCAACAATACAATTTATGTACATGTAATGAAGAATTTTTTATATAAATGGAATTATTTCAAC 
TATAAATTATAACACTAAAAACTGAAAAATGAAGGGAATTCCGAAATTTAAAAATTATTTTGCAAACTTATAAAGATTAATATTTTCAAAATATT 
CAGCTTTACTCAGCTCTTTGTCCCAATAAAAGCGTACAGAACATTTTTTAATAAGTAGTTTGGAAATGATTTTATTTCAATATATTTCATATTCA 
TTCATTAGTTCATATTTTCTGTTTTATACTTTTACACAAAAATACTTGTGTGTGTGTGTGTGTGTGAGTGTGTCCGTGTGTTCGTGTTTGTCTCG 
TAGGTTCTTCTATTGGTATTGGTGTGTCCTTGAGTAGTCCTTTGCTATCTCGGTCTGTCTGTCTCTGTGTGTTTATTTTTTTTCTGTATGTATTG 
TAGTTGTCTTATATGCCATTAATCTGAGTTGGACATAACTAAGTGTTAAATACAAGTTGAGTTATCTCATTTAAATATTTGTTTACCATTTAAAT 
GCATTTTCGGAAAAGTTTCATTGAATTTCTCGCTTCTTAGCTGATTTATTAGTTGTATACCCATGTTGACTTTTCTGTGTGGGTGACTCAGCGAT 
ATTTGATTCATTTATTTAGGGGGTGCCAAGAGCTTTTTACATCTAAATCTTTTTAGGCTTGTATATAGGTGTGTATGTATATATATAT 
(SEQ ID NO: 376) 

Exon: 2927.. 2621 
Exon: 2334. .2274 
Exon: 1401.. 1001 

Start ATG: 2927 (Reverse strand: CAT) 
Transcript No. : CT12689 

ATGAACCCTTTCGAATCGAGCAAAAGCTTACAACACGTTGTGTTGTCCACGGAAATGTTGCTGTTGGAACGTCAATGTTGCTATAGTGGCGAGCC 

GATGTTGCTACCGCCCAACGCCAGCAACATTTGTTGGGGCGCAGCCTCGTCGACGCGCCGTCAGTTGGCTCTTGACTCTCCCACGCTGCAGTCGG 

TGGCTCTGGCCGGAATCAGAACCGGAATCGGAATCGGTATCGGAATCGCAATCCGAATCGGAATCGGAATCGGAATCTTGCCCGGATTGCATAGA 

GCGTACTGGGAGTGAGAGAGTGTATTTTTAGTGCTTGCGTGCGAAATTTTGTTGGTGTCAAAGTTAAACAATTTACGTGAGATAGTGCAAGGACA 

GGCCAAGAAGAAGCAGCAGTACAAAAAGGATAATTGCAACAGCAAACATAAGAAGTAAGCCAGGCAACTGCAACAGCAACATTTGCTGCAAGTGC 

AGCTCCCACAGCCATCGCCGCCGCCAAGTTCAAAATCAACGCCTGACAATGACAAAGGCAGCAGCAGCAGCAGCAACAGCAACAGCAACAACAAC 

AGCGGCAGCAACACCAATCAAAACAGCAGCAGCAACCAAAGCAATAACAATCAATGGGCCCCAAACGACATGGCGACGTCAACGAGGCTGCTGCA 

CAACAACAACAACAGCAGCAATACGAAACATGCAGCAGCAAAAGCAAAAGCAACAGCAACAATCGTACTTGCAACGCCTTAAGGCGGCCGGTCGA 

CTGGGGGCG 

(SEQ ID NO: 377) 

Start ATG: 1 (Reverse strand: CAT) 

MNPFESSKSLQHVVLSTEMLLLERQCCYSGEPMLLPPNASNICWGAASSTRRQLALDSPTLQSVALAGIRTGIGIGIGIAIRIGIGIGILPGLHR 
AYWE* 

(SEQ ID NO: 378) 



Celera Sequence No. : 142000013384832 
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CCGACATCAGCCCCATCAACGACCGCGAGGAGCTGACCGAGGAGGTGATGCGCTACCTGCCCTACCTGGAGGTGAATCCATCGAGCGACGGGCTC 
ACCCTTAAGGTGGAGTCGTCCAGCCTGCTGGGAAAACCGCTAAACGAGCCCGTCTTCGACTCCGAAGACAACATTGTGAACGACGCCAACTTGCA 
CTCGGCCAGCCATCAAATACCTCCATATGTGCCTGACAGCCACGACTGCTTCGCGGAGGACTGCGGCGGCGACAGCTCCTCGCACCAGGTGGAAT 
TCGAGGTGGTGCGGCCCCAGACGGTGACCATGACCATGACCTGCACGCTGCCCTACGGCGGACCGGACGCTGGGCACACCACATTCCAGGCCGAC 
GACTTCAACGCGATCCCGTCGGCCGCCGAGGACAGCGAATGCAGCATTCTGACCACCTCCAACTCGCCGCAGATTGGCTTCAACGGGAGCAGCTT 
CGTGGAGGCGGACGCCATCGGGAGCACTTGTACATACGCGCAACAAGACTACACCGGAAGTGTAATTGAAACACACAATGATCTCAACTACGCCG 
CACACGATAACAACGGAGCTCTGCTAGCCTACACATTTGAGGACTTGCCGCCTCAGCCAACCGGTAGTCATTTAGAGTTTAACACTAACAAATAC 
GAGTTTGCAAGTTATTATAAAATGTGAGAGATATTTAAGTTTGTGTAATTTATAACTAATCTAATTGTACGTTTATAATTTTACACCAAATTATA 
TATTAAATATGTATATGTATGCAGATAATACCATATATTCCATAAATTCTATATACATACAAAAAACGCGAGAATAAArGCTTTGCATTGAAAAT 
ATTTACGTCGGAGCAGTCTCATTTACTGCGTAGTGGAATGATCGCTGATCGCTGGTTCCTGCTATGCGCGACGAATGGGAAGGAGATTAACCTTG 
CGGCGATCAGCTAAGTGCTCCCAAATCCATTCAATAGGATCTCCAGCGGCTCTCGCGAATTATCTACATATGTTTATTTACACGTTACACAAATA 
TGTTATATAAATCATTGGTGGCCAACAAAAGGGGATACAGAATGAGGCTAAACCATCATTAGTTGTTGCACACGATAAAATAATCCAGACAGTAA 
GGTAGTATACATTCGTTTCTCTTTCCGCGGCCGATTTCCTGGCTCTCGACGAGCGCTGATTACAAGGTGCTCGCCTCGATATGTGGGATATTGTG 
TAGGCCGGAGTCGGGATTGGACCAGAAGTAGTAGGGGGCGCCCTGGCTGAAGCAGAACTTCATGTTTAGTGTCTCCCCGTGATCCACGTAGTACA 
TGAGATAGAAGTTGCACATCTCGTCCTCGTTCGTGGGACTGTCATCAATAAATAGTTGGATTAGGAGTGGTGCTAGAGCTGCGTTTCAAATCACT 
GCGTACTTACCCTATTTTGGTAGTCCGATGGCGAGTGCTCTGCATAGTACACCTCACGGCGATCTTATCTCCCTCGATTATGGGGTCTGTGTTGC 
TGGTGTTATAGAACATCTGGGGCGTGAGGGGATCTCTCTTGCCCAGCTGCAGCCACTCCTGTTCGCCGTCGCTGTTTGTCCTCACCCGGTAGCCG 
GAAACGACCTTTCCCAGGCCGTGGGTGTGCACCCGGTACGCAAAAGGATGCAGCACCTTCTGCTCGTTCACCTCGCAGGCCGTTTCCAGGTGCTC 
CGTCTTCATCGCCGGAATCTGTCCGTCAGTGCCCAGCAGCAGAGTGCCAGCCAGCTTTTTCCGACTGCCAGGATGGTACTGATTATTTTGAAAAT 
CACGCATATTAAACGAAAGGAACTTACGGCTCTTCTGTGTAATCCAAAAACACACCAGAATCATCAGTGGAGCCATCTGAAGAAAAACACATTTA 
AATTTCCTTATATTGAACGCAATCTATTACTTACCTTTGAACTTATCAATGTGCGCGTAGTGAACTTGCAGCACAAGGTACTTGATTGGCGAGTT 
CTTGCCCACCTTGAAACCCACTCCCTCGGGCAGATTTAACTTTTGGGCGTCTCTGGCCCAAGCGTATACGATCTGTTGAGAGGTAAGTGTTAGTC 
CAGTAGGTGAAATTGTCTATTAGCACATTTGGCAGTTAGTTAAAGGAAAAACTCGATAAACAGGTTCAAAACTCTTATTAAACTGTGCTGACTGT 
CAAGGATTGACCTATTAAAACTTAGCCAGTTAACTCCTAATTATCCGCGCATTTCGATCTTTATTCATATGCCACATCAATCGTTTTGAAATGTT 
ACTATATATGAGTTCGATTTTTGTGTCAAGAACAAGACAAAATTTGACTTTAAAACTATCTTTCCGATTTTTCCTTAAAATTTATAATTTGTCTA 
TCAGAAAGCTGCCTAGTATGTCCGCTTCAATTTGCTGCAGATGCAACGAGAAGATTTGGCCACGGGCAGTTTGCAGCTTGGGTAAGACGTACCAT 
CCGCACCACTTCACCTGCAAGGAGTGCGGCCTGGTGGTTGATCCTAAACTGTTTTTCGCGGTGGACGACGACGTGGTCTGCAGCGAGTGCTATCT 
GGACAAGCATGCCGCTCGGTGCTCCGCCTGCCGGACTCCGATCCTAGAGCGCGGCGTGGCTGCTGCGGAGCGCAAGTGGCACGAAAAGTGTTTCC 
GGTGCGTGAGCTGCAGCAAGTCGCTGGTCTCGGCGAGCTTCTTTGAAGTCAACGGATACCTGTTCTGCAAGGCGCACTTCCGGGAACTGTTTTCG 
TCCCGCTGCGCAGGCTGTGAGAAGCCAATCGATCGTCGAGCCGTGGTCGCGCTGAGCACCAAGTGGCATGCCAAGTGCTTTAAGTGCCACCACTG 
CCGCAAAAGGATCAGCGCACGGGAATTCTGGATCGAGAACGGTCAACCCATTTGCGCAGCCTGCCAAACTGTAGTCCCGAGTCCCCGAAATTTGT 
CGGCCCAATAAATGTGTGGTGTCTATCTGGGGGCCTATTTGGACATTTCTTTTTCTTTTAATACCTGGGAATTGGAGTGGGGTCCGCAAGGACTG 
GC AGACTCTTCTTGGGAAGCTCGGTTCATCTCGCCACAGTTCCTGTGAGCAAAAATGTGCGTGTTATTT ACT AGTCTGAAAACATCACTTGCAAC 
TCACCAGGTGGTCTTCGAGGTTCCGGGCTCTCCGCATCCGTAGAGCAGCATATGGTGGGCCGTGTTCATCGTGGCATTAGGATTGAAGCCAACTA 
TTTAAGATAGAAAGGTTAGTTGAAGTATTTAAGGAATGAATCTCATTCGAAAGGACGTACCAATATAGTAGGTGGTAGTTGGGTCGACCTTGATG 
GGCGTGCACAAGTACAGATCGGGCTATATATGTACACATATATATATTTGTTAGTTTTGTTAGAGAGAACAGGCATATTGGATAAGTCAACAAGT 
ATTATTCAAATGGGAGTATGGCCTTCAGTGGGCCGTTTGCCCTTTTTCGCCACCCTCGTAAAAACAAATTCGTATCTACGGCGCAGGTTCGAGAA 
GTGCAATCTACAGGTTTCGGTTTCAGGGCCTTACATGTGGGTGGCGAATTGTGTGTATTCGGGTACAACTGAACATATGGAAAAATTTGCACAGT 
ACTCTTATGGGTTTTCCCCGCGCGGAAGATACGGGAAAAACCAGAGCCTATATCGAGGACTTTTCTAATCTGATTGGAATTGTATGCGCACAGCT 
GCAGCTGGGCGAATCTGTTCATCCATATGGAGCGAACTCCATGATGTATATGTATGGCGTTTTTCAGGAACCTGGACTTACGGTCTGGGGCGAAA 
CGTTGGGCATCAGGAATGGAAACGAAGCCGTTGCGCCTGTTGCGGAGTTCGACTCGAGATTCTGTTGATAAAGGGAGTTTTGGTAATCCCCCTCT 
TTCACAAGGCCATCCACACTGATCACTCCGATAAGCAGGAGCAGCCCCACGGAAGCGGCTATTTCGGATATGCGTGGCATTTTCACCGTATTCGA 
ACTATATTCGAGTGTGCACTGCACTGAATACTTTGTGGAAAACGAAAAAAAACCTTCCTGTTTGGAAGAACAATACGTTTTGTATTGCTCTCTCC 
GATATTTTTCCACACGTAGCTAGGCCCAGTTTATATGTAAGTAAGTGCGTTCTAGGCAGCAATCGATACCCAGAGCCATTATATCGATGACGGCG 
CCAGGGCTGCAGGCTAAGGGTTAGCGGGACATTCAAATATATATAACTTGTTCGCATTAAAGTACATAAAACCAAGTGTAATTTTCAACCTTTCG 
AATACAGTGATTTAAAGACACCTCAACGCAAAAGGAAACCCCACAAAAGCCAGCCCAATTCAACGTGTCTGCGTTCCGTTCGACCCACTGGGGAA 
TTCCCCTTTTCGCATGCGCCATTCAATTGATTTTCTACGAGCAGAGTTCCCAATGTTTACGGTGCGCTCATTTGGTTGTAGCTGTAGGCCTCGAG 
TTGGAACCGCTGGGACTTGGAAAACATACGACGCTTAAATGGAGGATCAGGGCGACGTGACAGACCAAGCCGCTGGTAAGCTGAGTAGTTCGAAT 
GCCCAGTGCTCCAGAAACCGATGTGCTAATGGTAAAGCTATATAGATGTACACTCAAGAGCGGTACTAAATTATTTACAAAAAATCGTGTAATTC 
CCAGATGACCCCATTATGGGCAGCCACAGTCAGAGTCGCCAGAACTCGTCCGGCATGCTGAACATGATGGACGCCAGCTCCTTCAGCATGCCCCC 
CCCCAATCTGCACTCCTCAAAGGTGATCAACCCGAACAAGGCGATATTCACAATCGACGCGAACACAGGGCAAATTTTTATAGTGAACAACAAGG 
CCTGCCAGCTGCTGGGCTACACGTCGCAGGAGTTGAGGAACAAGGGATTCTTCGACCTGCTTAACGGCAAGACAGAGAGCCACATCTCCTCCCTG 
GCGGAGATGCAGATCGAAGGCGATGAGGGCCGGGTCGTCCTGCTAAGTGGAAAGGTCATCGAGATGAAGACCAAGTCGGGCGGCAAGATCCTGGT 
CTCTCTCTGGATCCGGCAGATCAGCAGCGATGGCCGACACATAGCCGTCGCGGAGCCTGTGGAGAGGCATATTTG 
(SEQ ID NO: 379) 



Exon: 


4110 


.3787 


Exon: 


3348 


.3291 


Exon : 


3227 


.3140 


Exon: 


3082. 


.3010 


Exon: 


2067 


.1935 


Exon: 


1881 


.1833 


Exon: 


1774 


.1436 


Exon: 


1368 


.1001 


Start 


ATG: 


3975 (Reverse strand: CAT) 



Transcript No. : CT12811 

GCCTAGCTACGTGTGGAAAAATATCGGAGAGAGCAATACAAAACGTATTGTTCTTCCAAACAGGAAGGTTTTTTTTCGTTTTCCACAAAGTATTC 
AGTGCAGTGCACACTCGAATATAGTTCGAATACGGTGAAAATGCCACGCATATCCGAAATAGCCGCTTCCGTGGGGCTGCTCCTGCTTATCGGAG 
TGATCAGTGTGGATGGCCTTGTGAAAGAGGGGGATTACCAAAACTCCCTTTATCAACAGAATCTCGAGTCGAACTCCGCAACAGGCGCAACGGCT 
TCGTTTCCATTCCTGATGCCCAACGTTTCGCCCCAGACCCCCGATCTGTACTTGTGCACGCCCATCAAGGTCGACCCAACTACCACCTACTATAT 
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TGTTGGCTTCAATCCTAATGCCACGATGAACACGGCCCACCATATGCTGCTCTACGGATGCGGAGAGCCCGGAACCTCGAAGACCACCTGGAACT 
GTGGCGAGATGAACCGAGCTTCCCAAGAAGAGTCTGCCAGTCCTTGCGGACCCCACTCCAATTCCCAGATCGTATACGCTTGGGCCAGAGACGCC 
CAAAAGTTAAATCTGCCCGAGGGAGTGGGTTTCAAGGTGGGCAAGAACTCGCCAATCAAGTACCTTGTGCTGCAAGTTCACTACGCGCACATTGA 
TAAGTTCAAAGATGGCTCCACTGATGATTCTGGTGTGTTTTTGGATTACACAGAAGAGCCTCGGAAAAAGCTGGCTGGCACTCTGCTGCTGGGCA 
CTGACGGACAGATTCCGGCGATGAAGACGGAGCACCTGGAAACGGCCTGCGAGGTGAACGAGCAGAAGGTGCTGCATCCTTTTGCGTACCGGGTG 
CACACCCACGGCCTGGGAAAGGTCGTTTCCGGCTACCGGGTGAGGACAAACAGCGACGGCGAACAGGAGTGGCTGCAGCTGGGCAAGAGAGATCC 
CCTCACGCCCCAGATGTTCTATAACACCAGCAACACAGACCCCATAATCGAGGGAGATAAGATCGCCGTGAGGTGTACTATGCAGAGCACTCGCC 
ATCGGACTACCAAAATAGGTCCCACGAACGAGGACGAGATGTGCAACTTCTATCTCATGTACTACGTGGATCACGGGGAGACACTAAACATGAAG 
TTCTGCTTCAGCCAGGGCGCCCCCTACTACTTCTGGTCCAATCCCGACTCCGGCCTACACAATATCCCACATATCGAGGCGAGCACCTTGTAATC 
AGCGCTCGTCGAGAGCCAGGAAATCGGCCGCGGAAAGAGAAACGAATGTATACTACCTTACTGTCTGGATTATTTTATCGTGTGCAACAACTAAT 
GATGGTTTAGCCTCATTCTGTATCCCCTTTTGTTGGCCACCAATGATTTATATAACATATTTGTGTAACGTGTAAATAAACATATGTAGATAATT 

CGCGAGA 

(SEQ ID NO: 380) 

Start ATG: 136 (Reverse strand: CAT) 

MPRISEIAASVGLLLLIGVISVDGLVKEGDYQNSLYQQNLESNSATGATASFPFLMPNVSPQTPDLYLCTPIKVDPTTTYYIVGFNPNATMNTAH 
HMLLYGCGEPGTSKTTWNCGEMNRASQEESASPCGPHSNSQIVYAWARDAQKLNLPEGVGFKVGKNSPIKYLVLQVHYAHIDKFKDGSTDDSGVF 
LDYTEEPRKKLAGTLLLGTDGQIPAMKTEHLETACEVNEQKVLHPFAYRVHTHGLGKWSGYRVRTNSDGEQEWLQLGKRDPLTPQMFYNTSNTD 
PIIEGDKIAVRCTMQSTRHRTTKIGPTNEDEMCNFYLMYYVDHGETLNMKFCFSQGAPYYFWSNPDSGLHNIPHIEASTL* 

(SEQ ID NO: 381) 

Classification: enzyme 
Gene Symbol: Phm 
FlyBase ID: FBgn0019948 



Celera Sequence No. : 142000013384822 

ATCGCTTTGAAGCGGCTAATTGAATTCCAGATTAAAGTGTAAAGCAAAATGGGGGGGAATAAAAAAAACTGATTTGTATTGCAAATTAAAATTTA 

AGTGAAAACGAATCAGGCGCAAACACAAACAGGGACAATAGATTACAATATCCTGCGGCGCAGACAGCACGACATTAAAAAGGCACGAGGAGCAG 

CGAGAGCTGGCGCGGAATGGGTATGGACATGGCAATGGAGGTGGAAGGGACGCGCCAGGACGACACAGGACCTTCGTGCAGCTGCTGCTCCTGCT 

GATGGCACATGCGCCTGGGAGGGCTGCGTTTCGGGATTGGGACTGGGATGCGGGCTGGACTCCAGCGATGGCTGCCTGCCATCAGCGGCAGAAGC 

AGCAGCAGCAGCAGCAGCTACAGCAGCAGCAGCAGCAGCAGCAGCAACGGCAACGGCAGCAGCAGCAACTACGGCAGCAACATCGGCAGCAGCAG 

CAACACCAACAATATCCCAGGCTTGCGACCGTCAGGCCAACGGTTTCCGCTTGCTTACTGCTGGAACGGAAAAAGCCAAGGCACATATATCCATG 

CACATCCACAGGCACATTCGGTCGGAGCCACATCCATGGGATAGTAGATACCGAACGCTGAGGGCCAATCCGAGAGCAGCAGGTAGGCGTGGACG 

TATCTACCATTTCCAAACAACTTCAACCCCGCCACAGGCAGCTGCCGTTGGTCGGGCCGCTCGCCACCCTCGTCGCCCTCCTCCGCACTTCTCGG 

CCGGTTCAAAAACGGGTCGCCCTGCCAGCAGTTGCAGTCAGTTATTAACGAGCGCTACTCCCAACAGCAACATCGCTCAGCGGCAGCAGCAGCAG 

CAGCAAAAGCAAAAGCAAAAGCAACACTAACAACAGCAACACGAGTCACTCAAACGTGGAAGCGAGACAGGAATATCAGCACTCCACTCGCCTCG 

TTTCACTTCGATTCTCAAGCGTTTGAGTCGGAGAGCGGCGAACGGCCAACGAATTCAGTTTGTGTCGAGACGTTGTCGGTGGCACGTCGCATCCA 

AAAGAACGCTCAGTCAGCTGGGAAATATACTGAGAAAAGATACATTCGTAGATACAAGTGCTAAGCCAAAGTGAACCGTGCAGTTCGCGCAACTA 

AACAATTTTAAGCCAAATAAAACTACACAAGGCCAACAAAGACAGTATAATGTCTTCCACATCGGCCTCACCCATCAGCAACATAACCGTGGATG 

ACGAGCTCAACTTAAGCAGAGAACAAGGTGGGTTCAAGTTCAAAACTGGCTCAGAACAACTTGGCAAATGCCGACGGCAAAGCGCATTACTCTCT 

AGCACTTTTTGACTAGTTTCAAAAAATGTTCCATTAATATTTGAAAGTAGTCTCACATAGCATTGCATTTATATGAATTGAAAACTAACCGTGGG 

CATTTCTATTCCTTCCTTTTTAGACTTTGCTGAAGAGGATTTCATAGTGATCAAGGAGGAGCGCGAGACAAGTCTCTCCCCCATGCTGACGCCCC 

CGCACACGCCCACCGAGGAGCCGCTGAGGAGAGTGCATCCGGCGATAAGCGAGGAGGCGGTGGCCACCCAGCTGCACATGCGACACATGGCCCAC 

TACCAGCAGCAGCAACAACAGCAGCAACAGCAGCAGCAGCACCGCCTGTGGCTGCAGATGCAGCAGCAGCAGCAACAACATCAGGCTCCACAGCA 

GTATCCAGTTTATCCCACAGCCAGCGCCGATCCCGTGGCCGTGCACCAGCAGTTGATGAACCACTGGATCCGCAACGCAGCCATCTACCAGCAGC 

AACAGCAGCAGCAGCAACATCCGCACCATCATCACCACCACGGCCATCCGCACCACCCTCACCCACATCCGCATCATGTGCGTCCCTATCCCGCT 

GGCCTCCATAGTCTGCATGCCGCGGTCATGGGTCGCCACTTCGGAGCCATGCCCACCCTGAAACTGGGTGGTGCCGGTGGAGCGAGTGGTGTACC 

CAGCGGCGCAACTGGCAGCAGTCGGCCAAAGAAGCAGTTCATCTGCAAGTACTGCAACCGGCAGTTCACCAAGTCGTACAATCTGCTCATCCATG 

AGAGAACCCACACGGACGAGAGGCCTTACTCCTGCGACATCTGCGGCAAAGCCTTCCGGCGACAGGATCATCTGCGGGACCACCGGTACATCCAC 

TCCAAGGACAAGCCCTTCAAGTGCAGCGATTGCGGCAAGGGTTTCTGCCAGTCGCGCACCCTGGCCGTGCACAAGGTCACCCATCTGGAGGAGGG 

TCCGCACAAGTGTCCCATCTGCCAGCGGAGCTTCAACCAGCGGGCCAACCTCAAGAGTCACCTCCAGAGCCACAGCGAGCAGAGCACCAAGGAGG 

TGGTGGTGACCACCTCACCAGCCACTTCACATTCGGTGCCAAACCAGGCATTGAGTTCGCCTCAACCTGAGAATTTGGCACAGCATCTGCCTGTC 

CTGGATCTATCCTCGTCATCCTCGAGCTCAGAGAAGCCCAAGCGGATGCTGGGCTTCACCATCGATGAGATCATGAGCAGATAGATTGAAGGACC 

AGCGAAACCCGGTATCCGGTTTCTTTTGAGACATTCTAAAAGAGATTCAGAGAGCTGGACGTCCGCCAGGCATGGGAAATGCCTGGAATGGAAAA 

CCAAGCGAGTCCAGCCCGCATTCTGGCCAAAGGTGCAAGTGGAAAATCCTCGCCAGCAACTCGTAATCGCAACCCAGTGAGCAGAAGAGTCATCC 

TCCGTGCAATCGGTTCTATGTAGCAGATCCCAATTGCGTTAAACAAATGCCAAAGTCAAATAGCAGAGGAAAACTGATGAAAAATCCAATGAAAA 

GTATTGCTAGCACAAGCCTCCTACAAATCGATTCCAAAACACACACACACACACACAACTGTAAACTATAAACTTTAATAAATAGCACATAATTT 

ATATTGTAATCTAACTATTATTGTATGTACCTTATAGTCGTAGTAGCCCATTCAGTTCGTAGTGCTTATGGAGTAAACCAAATGTTATGTTACAA 

TAGCTCAATCAAACAAACCTCAAACCCGCTTTGGTTATTGAAAGACTGTTTTAATTATTATATTTAAATAGTACTTGAATAAATCAATCGATCGA 

TTGTGTAATGTTAAACCCCGACGAGTGTCATGATTACTTACTGTTTAGACTTAACAACTTAATTGTCGCTATAGCAATTGTGACTATGCACAACG 

ATCTCATTGTCGTCTATATGTATGTCTGAAACATTCGTATTGATATGAAGTTATCACAAAAATACGTAAATATAAACGCAAAGTAATTTTCAACT 

GAAATTTAATAAGACTTTTTATGAATACGGTGGATTTTGTGTTTCCAGGTATTTGGCAATGGCAATTTCGCCTTGATTTATCATTACTTCTACTG 

GCAACTTGATAATCTTTATGGTGTACTAAGTTAAAGTTATCTGAAAAGATTTAAGCAGCAATTTTTTGATTACCTTCTACCACCTACTGACTGAT 

TCAAGTTCCAAGCGGACTGAATGATTTTGATCTACTTTTAATCACTTTGAAATCTTTCTTATCCGCTGACTAAATCCCACGCCATTCCGATTTGG 

GAATTTTCTCATGCTGCGAATATAACTCAGTAGACTGAACTCGTAGGACCATATAAGTTGAACCATCAAACAGCAGCTGTCAAAATTTATTAACC 

ATCCGAATATAACTCAGTTTTTATGCAGATCTCGATTTGCGCATTTTTAAGCAGCTCTATTATTTTTCAGGTTGCCGATTAAGTAAGCAGAAGTC 

TCGCCACCAAAAATATTCAAACTGTCGCAGAAGTTAATTTATGAAATCGAATCGAATTGAATTTGATGTCAGTTTTGCTATGATTTTGTCATGAA 

TCGGTTTCGATTTCGGTAACACTATGGTTATTGTTTAACTTTGAGGTGTGGTCCAAATATTTATGGCCTGTGTTTCAGTTTTGGGTTTAGTTCAG 
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TTTTCATTGTGGCTTCTCTTGAAAATCGGGAACGTGCCAAATGCCTTGCAAAAGGTATGTTTGATTTCCTTTAATGGTGGTTTTCTTTTCTAGTT 
TCTTCCATTGGCAATTGCATTTCTCTTAGCACGCTTCTTACTCAGAACTGTCAATTTCAGAT 
(SEQ ID NO: 382) 

Exon: 1001.. 1262 
Exon: 1449.. 3147 
Start ATG: 1190 

Transcript No. : CT128 67 

GAATTCAGTTTGTGTCGAGACGTTGTCGGTGGCACGTCGCATCCAAAAGAACGCTCAGTCAGCTGGGAAATATACTGAGAAAAGATACATTCGTA 

GATACAAGTGCTAAGCCAAAGTGAACCGTGCAGTTCGCGCAACTAAAC7AATTTTAAGCCAAATAAAACTACACAAGGCCAACAAAGACAGTATAA 

TGTCTTCCACATCGGCCTCACCCATCAGCAACATAACCGTGGATGACGAGCTCAACTTAAGCAGAGAACAAGACTTTGCTGAAGAGGATTTCATA 

GTGATCAAGGAGGAGCGCGAGACAAGTCTCTCCCCCATGCTGACGCCCCCGCACACGCCCACCGAGGAGCCGCTGAGGAGAGTGCATCCGGCGAT 

AAGCGAGGAGGCGGTGGCCACCCAGCTGCACATGCGACACATGGCCCACTACCAGCAGCAGCAACAACAGCAGCAACAGCAGCAGCAGCACCGCC 

TGTGGCTGCAGATGCAGCAGCAGCAGCAAC7AACATCAGGCTCCACAGCAGTATCCAGTTTATCCCACAGCCAGCGCCGATCCCGTGGCCGTGCAC 

CAGCAGTTGATGAACCACTGGATCCGCAACGCAGCCATCTACCAGCAGCAACAGCAGCAGCAGCAACATCCGCACCATCATCACCACCACGGCCA 

TCCGCACCACCCTCACCCACATCCGCATCATGTGCGTCCCTATCCCGCTGGCCTCCATAGTCTGCATGCCGCGGTCATGGGTCGCCACTTCGGAG 

CCATGCCCACCCTGAAACTGGGTGGTGCCGGTGGAGCGAGTGGTGTACCCAGCGGCGCAACTGGCAGCAGTCGGCCAAAGAAGCAGTTCATCTGC 

AAGTACTGCAACCGGCAGTTCACCAAGTCGTACAATCTGCTCATCCATGAGAGAACCCACACGGACGAGAGGCCTTACTCCTGCGACATCTGCGG 

CAAAGCCTTCCGGCGACAGGATCATCTGCGGGACCACCGGTACATCCACTCCAAGGACAAGCCCTTCAAGTGCAGCGATTGCGGCAAGGGTTTCT 

GCCAGTCGCGCACCCTGGCCGTGCACAAGGTCACCCATCTGGAGGAGGGTCCGCACAAGTGTCCCATCTGCCAGCGGAGCTTCAACCAGCGGGCC 

7AACCTC7AAGAGTCACCTCCAGAGCCACAGCGAGCAGAGCACCAAGGAGGTGGTGGTGACCACCTCACCAGCCACTTCACATTCGGTGCCAAACCA 

GGCATTGAGTTCGCCTCAACCTGAGAATTTGGCACAGCATCTGCCTGTCCTGGATCTATCCTCGTCATCCTCGAGCTCAGAGAAGCCCAAGCGGA 

TGCTGGGCTTCACCATCGATGAGATCATGAGCAGATAGATTGAAGGACCAGCGAAACCCGGTATCCGGTTTCTTTTGAGACATTCTAAAAGAGAT 

TCAGAGAGCTGGACGTCCGCCAGGCATGGGAAATGCCTGGAATGGAAAACCAAGCGAGTCCAGCCCGCATTCTGGCCAAAGGTGCAAGTGGAAAA 

TCCTCGCCAGCAACTCGTAATCGCAACCCAGTGAGCAGAAGAGTCATCCTCCGTGCAATCGGTTCTATGTAGCAGATCCCAATTGCGTTAAACAA 

ATGCC AAAGT C AAATAGCAGAGGAAAACT GAT GAAAAATCCAATGAAAAGT ATTGCTAGCAC AAGCCTCCTAC AAAT CGATTCCAAAAC AC ACAC 

ACACACACACAACTGTAAACTATAAACTTTAATAAATAGCACATAATTTATATTGTAATCTAACTATTATTGTATGTACCTTATAGTCGTAGTAG 

CCCATTCAGTTCGTAGTGCTTATGGAGTAAACCAAATGTTATGTTACAATAGCTCAATCAAACAAACCTCAAACCCGCTTTGGTTATTGAAAGAC 

TGTTTTAATTATTATATTTAAATAGTACTTGAATAAATCAATCGATCGATTGTGTAATGTT 

{SEQ ID NO: 383) 

Start ATG; 190 

MSSTSASPISNITVDDELNLSREQDFAEEDFIVIKEERETSLSPMLTPPHTPTEEPLRRVHPAISEEAVATQLHMRHMAHYQQQQQQQQQQQQHR 
LWLQMQQQQQQHQAPQQYPVYPTASADPVAVHQQLMNHWIRNAAIYQQQQQQQQHPHHHHHHGHPHHPHPHPHHVRPYPAGLHSLHAAVMGRHFG 
AMPTLKLGGAGGASGVPSGATGSSRPKKQFICKYCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHRYIHSKDKPFKCSDCGKGF 
CQSRTLAVHKVTHLEEGPHKCPICQRSFNQRANLKSHLQSHSEQSTKEVVVTTSPATSHSVPNQALSSPQPENLAQHLPVLDLSSSSSSSEKPKR 
MLGFTIDEIMSR* 
(SEQ ID NO: 384) 

Name: odd skipped 

Classification : transcript! on_f actor 
Gene Symbol: odd 
FlyBase ID: FBgn0002985 



Celera Sequence No. : 142000013384 622 

TGATAAATAGCTTTTTATAAAAAGAACTATAACACACCTTTAG7AAACTCTGGGGCTAGGTTCAAAATGGTTTTCGGTTTTAAGGTCCCAAATATC 
TGTTTCACCAATTCCAGGCGGTCGTCTATGTAATGGGACATGGGATGCAGCTTTCGTTCCTCCTCGATCTTCTCCTTCTTAATGCTAATGGGCTT 
GGGGATTTGGTCGCCCTTGCTGGATTTTTTCGCCTTCAAGCCACTCATTTTATCCAATTTGTTGTCAAAATAAAAACCAAGTAATTTCGGTCTTT 
CGGGCAGCGTTGCCACTACTATTTATACCTTGTGGACAGTGGGACGAAGGCAACAATTTTGTGGGATTAGTGTGAATGGCTCTATTTTAAAATTC 
AAAACGC AAAGAAAAACGAGC AGT ATT ACGAAAATGT AT T T AAT AAAAATT ATT T CGT AT ACT AGGGAATGGC ACAT AGAAT C AAT AAT TAAAAA 
TGTACAATGGTGTATATTGCCTTTAGGACTGTATTATCCACTGTGCGCTGTTGTTAACAGTGTTATTGTTTTTTGTACAGAACAGCTGATTATCA 
AGCGGTGTAGGTTGAATTAAATTGTGGGAGTGTTAGTTGTCTTAAAATTGTTAAGGTAAGTTGATGTATTAATTGAACTTGTAATCCATCAGTGT 
AAAAGTGTTTACGGAAAGCGGTGATAGCAATTTTATATCAGCAAACAGTTTCTTGTCGTAGATATCATGTGAAATCCGTTAGTAAGGTGTTACGT 
GAGAAATAGTACTTGTGGCAGAATTTTAGAGTTTGGCTTTTATTAATGGGGATCTGTTACTAGTGTAAGTTACATTTAAACACATTAGTCCAATA 
TATATTTAGCATTTTCTATATTTTTGTTTTTTTGACATGCCGCTAGAAAAGTAACAGAAACTGTTAAGTAACAGTGATGGCACATTTAGGTTATT 
TCATTGGGGAGCGTGTTTTGACAACTCTGCTGCGGCCACACTGAACATATGTTCCGGGGAGAATGGCTGCGATTTCGCGTCGGTAAAAATAGCAA 
ATACTCGTTAATGTGCTGTGGGAACGCTTCCTCCCCGGCCCCAAAGTGGCCCCGAAGAAAGTGAGCAAATGTGCGCGCCGCAAGATAGTCGCCGC 
CGAACAAACGATAGTGACGAAAGTGATTTAATTCAACTACCAGCACTCCCGCAAATACGATGAGTATGTCGCGCGGCGGCAACACAACTCTGGAC 
TTGCAGCCGCTCCTGGCGGAGAGCGATGTCGGAAACAGGGAGCTGGAGGAGAAGATGGGCGGATCGGCGGATCGGTCATCGCTGCTCGATGGATC 
CGGTTCGAAGGAGCTGAGTCACCGGGAACGCGAGGACTCGGCGTTGTTCGTCAAGAAGATCGGGAGCGCCTTGTTCTATGGCTTGTCCTCCTTCA 
TGATTACGGTGGTAAACAAGACGGTGCTTACCTCCTACCACTTCCCCTCGTTCCTGTTCCTCAGCCTCGGGCAACTTACTGCTAGCATTGTGGTC 
CTGGGCATGGGCAAGCGCCTGAAATTGGTGAACTTTCCCCCTCTGCAGAGGAATACCTTCGCCAAGATCTTTCCGCTGCCACTGATATTTCTGGG 
AAACATGATGTTTGGACTGGGTGGCACAAAAACCTTGAGTCTGCCCATGTTCGCAGCCCTACGACGCTTCTCTATCCTGATGACCATGCTGCTGG 
AGCTCAAGATCCTGGGACTGCGACCTTCGAATGCGGTTCAGGTCAGCGTATACGCAATGATCGGTGGAGCGCTGCTGGCCGCCTCTGATGATCTG 
TCCTTCAACATGAGGGGCTACATCTATGTGATGATCACTAACGCCTTGACCGCCTCGAATGGCGTATATGTGAAGAAAAAACTCGACACCTCGGA 
GATCGGAAAGTACGGCCTAATGTACTACAACTCGCTGTTTATGTTTCTGCCTGCCCTGGCCCTCAACTATGTTACAGGGAATCTAGATCAGGCGC 
TGAACTTTGAACAATGGAATGACTCAGTGTTTGTGGTGCAGTTCCTGCTCAGTTGCGTTATGGGTTTCATCCTATCGTACAGCACCATCCTGTGC 
ACGCAATTCAACTCGGCGCTGACCACCACCATTGTGGGATGCCTGAAAAACATCTGCGTAACATATCTGGGCATGTTCATTGGAGGCGACTACGT 
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CTTCTCGTGGCTCAACTGTATTGGGATCAACATCAGCGTGCTGGCTAGTCTGCTCTACACGTACGTCACTTTTCGGCGGAAGCGGGCTCCCGATA 

AGCAGGACCACTTGCCCAGCACCCGCGGCGAGAATGTCTAGCTTCATCTTAGCTCAAACTCAAGCCAAAGCCATACGATATGTTAAGTGTATTCC 

GTACTGTGTACTTCAATTGCGATCTGCGATTCGCGACAACGTTCGTATTTGATTTTCCTACAACCACACAGCGCATCGTCTCTGTACATATATCT 

AGCTATATTTCTATAAAAATGTAATTTAGCAATTATGTAATTTTAATCACTTTGTACATTTGTCTGCATGCAACTGTTTCCTGCATCGAAAGTGT 

CATAAGAAATAGAAATACACAGACTTCCACTCCTCCGCCTACGGAATGGGTGGGCCAGATAAACAATTGTGTTCCGTCTTCTGTTCCATTTTGGG 

TCATAATTGATAAGATTGCCTGCTCCGATAAGGCATACAACTCGGACGTTGCAAATTAAGCAGACATCTTGCTCTCTCTCACGTGCTTTTCTAAT 

TACTTTTTTGTGTAAAATAAGTAATGTAGTGGAACTGCGTTGTGGGATTTGCCAATTACTTGTGCTTAATAGGTTGTTGTTTATCTATTCTGAGC 

AAACACGTTGGCGCCTTGAGATTAATACAAAACAAAGTACTGCATTGCCGGACAATCCAGCAGCGTCATCACTCAACATCACTATGTAAAAAGCT 

TTGGATAGGTTGCTCGATTACTGATTACTCTATATCAAAATTTTGGGTTTTTATGTTGAATAAAAAAAAAACAATATTATCAGCCAAGAACGTGA 

TTGCGAGTGGTGATAGTATTTTTTAGAATAGTACGCCTTCCGATATTTTGTGTTATTTTCCGTTTCATTTGGTAATTGCCTCAAAAGGCTTTTGT 

TTAGATTGTAGTGCTGAAGAATCTTTTTCTTTGTCTTAAAAGTAAATAAATTTGCGCAGCTGAAACTGTTTTGTTTTAGCGAAATAAGTATGTAT 

ACTTTAAGATGGCTATCTATTTTCTTAGAATTAGATTTTACTTGGTAACTGCCTAGTTCCTTATCAAAAGATTAAGATTTAAATATCCCACATAT 

ATGTTGCTAAACTTATCTTTTAAATGGTGACTACTCATTTATCCAAAGAAACAACAATGACGACATTGAATCGGGCATAAAGCGGCAGAATGAGG 

TGCTAGTGCCCCGGCAGCACTTTGTTTGGGTTATTTTGGGAGCCCGCAATATATGATGGGCGGTGGTGAAGTGAGGGATTCGGAGTGGGGACAGG 

TAATGAGGAAGCCGAAACAGGTTTGCACAGACTTAGCCGTGCGGTGAGGTAAATTGTTAGTCAGCATCAGCCCCGACCG 

(SEQ ID NO: 385) 

Exon: 1001.. 2594 
Start ATG: 1200 

Transcript No. : CT12909 

GTTCCGGGGAGAATGGCTGCGATTTCGCGTCGGTAAAAATAGCAAATACTCGTTAATGTGCTGTGGGAACGCTTCCTCCCCGGCCCCAAAGTGGC 

CCCGAAGAAAGTGAGCAAATGTGCGCGCCGCAAGATAGTCGCCGCCGAACAAACGATAGTGACGAAAGTGATTTAATTCAACTACCAGCACTCCC 

GCAAATACGATGAGTATGTCGCGCGGCGGCAACACAACTCTGGACTTGCAGCCGCTCCTGGCGGAGAGCGATGTCGGAAACAGGGAGCTGGAGGA 

GAAGATGGGCGGATCGGCGGATCGGTCATCGCTGCTCGATGGATCCGGTTCGAAGGAGCTGAGTCACCGGGAACGCGAGGACTCGGCGTTGTTCG 

TCAAGAAGATCGGGAGCGCCTTGTTCTATGGCTTGTCCTCCTTCATGATTACGGTGGTAAACAAGACGGTGCTTACCTCCTACCACTTCCCCTCG 

TTCCTGTTCCTCAGCCTCGGGCAACTTACTGCTAGCATTGTGGTCCTGGGCATGGGCAAGCGCCTGAAATTGGTGAACTTTCCCCCTCTGCAGAG 

GAATACCTTCGCCAAGATCTTTCCGCTGCCACTGATATTTCTGGGAAACATGATGTTTGGACTGGGTGGCACAAAAACCTTGAGTCTGCCCATGT 

TCGCAGCCCTACGACGCTTCTCTATCCTGATGACCATGCTGCTGGAGCTCAAGATCCTGGGACTGCGACCTTCGAATGCGGTTCAGGTCAGCGTA 

TACGCAATGATCGGTGGAGCGCTGCTGGCCGCCTCTGATGATCTGTCCTTCAACATGAGGGGCTACATCTATGTGATGATCACTAACGCCTTGAC 

CGCCTCGAATGGCGTATATGTGAAGAAAAAACTCGACACCTCGGAGATCGGAAAGTACGGCCTAATGTACTACAACTCGCTGTTTATGTTTCTGC 

CTGCCCTGGCCCTCAACTATGTTACAGGGAATCTAGATCAGGCGCTGAACTTTGAACAATGGAATGACTCAGTGTTTGTGGTGCAGTTCCTGCTC 

AGTTGCGTTATGGGTTTCATCCTATCGTACAGCACCATCCTGTGCACGCAATTCAACTCGGCGCTGACCACCACCATTGTGGGATGCCTGAAAAA 

CATCTGCGTAACATATCTGGGCATGTTCATTGGAGGCGACTACGTCTTCTCGTGGCTCAACTGTATTGGGATCAACATCAGCGTGCTGGCTAGTC 

TGCTCTACACGTACGTCACTTTTCGGCGGAAGCGGGCTCCCGATAAGCAGGACCACTTGCCCAGCACCCGCGGCGAGAATGTCTAGCTTCATCTT 

AGCTCAAACTCAAGCCAAAGCCATACGATATGTTAAGTGTATTCCGTACTGTGTACTTCAATTGCGATCTGCGATTCGCGACAACGTTCGTATTT 

GATTTTCCTACAACCACACAGCGCATCGTCTCTGTACATATATCTAGCTATATTTCTATAAAAATGTAATTTAGCAATTATGTAATTTTAATCAC 

TTTGTACATTTGTCTGCATGCAACTGTTTCCTGCATCGAAAGTGTCATAAGAAATAGAAATACACAGACTTCCA 

(SEQ ID NO: 386) 

Start ATG: 200 

MSMSRGGNTTLDLQPLLAESDVGNRELEEKMGGSADRSSLLDGSGSKELSHREREDSALFVKKIGSALFYGLSSFMITVVNKTVLTSYHFPSFLF 
LSLGQLTASIVVLGMGKRLKLTOFPPLQRNTFAKIFPLPLIFLGK^FGLGGTKTLSLPMFAALRRFSILMTMLLELKILGLRPSNAVQVSVYAM 
IGGALLAASDDLSFNMRGYIYVMITNALTASNGVYVKKKLDTSEIGKYGLMYYNSLFMFLPALALNYVTGKLDQALNFEQWNDSVFVVQFLLSCV 
MGFILSYSTILCTQFNSALTTTIVGCLKNICVTYLGMFIGGDYVFSWLNCIGINISVLASLLYTYVTFRRKRAPDKQDHLPSTRGENV* 
(SEQ ID NO: 387) 

Name: nucleotide-sugar transporter, putative 
Classification: transporter 



Celera Sequence No. : 142000013384645 

GAATGATATATGTTCTGCAGCCATGCAATATATATAAGTACAATTTATTAAATTTTTTAATCTAACATTGCTGTGGTGAGTATGGGGGTTGGGTC 
ATAGCGTTTAGTATAAGTATTTTAAGTTAAGTACTTTTCTTGAGTGACATGGACTGGTAACATTAACATATGGCACAATATTTTAATTATTTTTA 
CAAATAGCATTTAATTTTAAACTAGCATTAATTTCTTTTTATATTTTTGGATACCCCGTCAGAAGGACGAAAGCACCGGAAGGACAGAAGGACAG 
GAGTCATCGCCGGGGATAAATTGGAGTTTTGCAGCATTTAAATATAGCCGACTGTCCCACCCACTGTCCCTCCCACTGAAATCTTAATGTTGCAG 
ATCTATTGCAGAAATCTTATTGTTAATAAAGCGGTTCATGACATGGATTGGTTGTGGTATTGGTACTAAAGAGGGCGGATCTTAAACTCCTCCCA 
CTGTTATGTTATTATCTCAAAAACAGTGGAAAATTTTTAATGATTTTTTAATTTAATGAAATATTCCTGAAATGTCCTTAAGGTCTGAGGAGTTA 
AGGTTTGAGGAGGTCTAGCGTGCAAAATGTTGGGGAGCCGATTATTCTATCGCATTATTTTTTTACTTTCTTGAGCCCTATAAGTAGTACATAGA 
TAGTTAGATCTGCCATTTTGAAACATTTTTGCATTTTTATTCTTGGGAATTTACTCCCAAATAACGTTAGCGCAAATTTATTAATTAATGTATTT 
ATTAAATAAATTTATTATACTGATAACATATTAATATAGCAAAAAAAAATACGTTTTTGAATATATGTGCATTCTTTTTAAACTGCTGCACACAT 
TTATTATTTTTTTGGACTCTTACAATACTTTTTCAAAAATTTTAAATATAACTGCGACTTTGAGCGATACTTTCACTTTTCCCGTGACGCCAACC 
GATATGCACAGCGCTTGCAGCCCTGGCCAATTTTGATGTGTTTCAGCATTGTACCCTGACAACCCTGCTCTGTAAAACTAGGAAAAACAGAAAAT 
TTTATTTTCAATTGTTAAATATCCAAACGTAAACAGCCGGAAAATGTCACGACAAGCCTCTCGTTTGGACGCCAAGAAAGTGGTAAGTTGGCCAA 
AGAAGAGAGAACAGCGCAAGATCGGTATCGTTAGTCATTTGGAGGCATAGATCTAGTTGTAAGGTTAGATTTGTGCCATAGTGCCTAAGAACGAA 
TCTTTGACGTGCCATCCACCCTCAAAATCGGCTTCATTCACTTTTCCAGTAAGTGTGTGTGTTGGGGGCTTGAGGTTGAGACACGGACACACAGA 
CAGGCACAAGCCGACGTGCAATCAGGCAATGGAAAACCATTTCAGCGCCACCAGTCGCCCCTCAGCCACGCCCAGAATGTGCACCACCGCCTCTG 
CCGTCGGTGCTGCCGCCAGCGCCGCCGCCGCCGCCGGCAACAACAATAACAACAGCAATAGCAGCAGCAACAACAACGCAACAACCGGCAACAAT 
TTAAGTGTTAGCACTGCAGCCGGAGGAGCCACAAATGCCGCAGCAACGGCACAACCCATCCAAGTGATACCAATGCCCATGCTGCCCACCGGAGC 
GGCGCAAATCATAATTGGCCAGCAGCCACAGGGTCAGGCGACGGCGACGGGTCTGCAGCCACAATTGATACCGCTGCAGGCCAACCAGATTATGC 
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TGCAGGCTGCGCAGCAGCAGCCACAGATGCAGGTGATGCAGCTGCCCGATGGCCAGACTATATTCTATCAGACGCCCACCATTGCCGCCTTGGAT 
CCGAATGCAGCGGCCAACGCCGCAGCTGCCATGGCAGCCCAGCCGACGCCGCACTACCTCAATATCAATGGGCAGCTGGTGCAGATCAATCCAGC 
ACCGAGCGCTAATCAGGCGGCACCCACAGCTGGGCAACAGATTATAATGGTGCCGCAGACAGCGATGGCAGCGGTGAATGCAGCGGCCGCCAATG 
CCGGAGTAGGCGCCGGAGTGGGCACAGTGGTTACCCAACAACAACAGCAGCAGCATCAACAAGTACAATCCCAGACCCAAAACCAGCAGCAGCAG 
CAACAGCAGCAGACGGTGGCGGCGGTGGCTGCCAGTGCTGCAGTCAATAATATAAGCGCCGATGTCAGCACAAGTACAACTGGAACGAATACGAA 
CAGCGAGGACGAGAGCTCCAAGGGCGAGGCGGACGAGGAGCCGCTCTATGTGAATGCCAAGCAGTACAAACGTATTCTCATTCGGCGGCAGGCCA 
GGGCCAAGCTAGAGTCGCGCATACCCAAGGAACGGTGCAAGTATCTGCACGAATCTCGTCATCGCCACGCCATGAATCGGGCTCGTGGCGAGGGT 
GGTCGCTTCCATTCGGCACAGGAGAAGGGCGACCAAGATTCGTCCGGCCCGGAAGGCGGCAGCATGCCAATGGCGTCCAGTGGCGGCGTCACCCT 
CAGCCGTGGCACGGCACGTGCTCCGCCCAAACTGATCGCGCCACATCAAACGCCCAGCATTACCATAACGGCGATCAAATCGGAATAGTCGCCTC 
CTAGCTGTAGGATACAMCTAGATAAGATTGTAGTCATAGCGTTTAGTCCTAAGCCAGTCGCTCTGTGTCCATTTGTCCTCAATCACACCACTTT 
AGTCGGCTCACATAGTTTTCATTTTTTTTTTTTTTCAAAATTAAGTCTTCATATATTATTTTCAAAGGGAATTTTTGGGTTTTTCTATTTATTTT 
CATTCACTTAGATTGGAAAAGTGATAAGTTCAAGAAAGAGATAATTGTTTTACTTATTATATTCATTTTCTAAATCGTTTCTAATTTCTAATATT 
TATTTTCTGTTAGCCAGCTGTTAAAAATTGTTTGTAATTATTAAATAGAGAATTTATTTCATTTACTAGACATTTTCAAAATTTACACAAGTAAT 
GCAArTATTAATTTAGGGTTTATAGTTCATTTCAAAATGGAATGGGGAAAGGGCTGCCAATAGTCACCGCAAAACAACAGAGCTCAACAATATAA 
GATTCAACAAGGGTGAATGATATAGTCGCCTTCCCCGACTATCAGATACCCGTTACTCAGCTTTTGGAAATGCGAACTCGAAATTTCAAAAGTTC 
ATAAATTTCAAACAAATTTTTTGGCAAATCGATAGAAGACTAATAAAATTATGAAAAAATATCCAAAAAATTTTCAAAAGTGTGGGAGTGGCAAT 
TTTGTGCGGCTTGTGGGCGCTGCGTCGTGGTCTCTATCATTAACCTCAATATCAACCCTTTAGCGTTTATATTTCCTGAGATCTCTTTTAAACGG 
ACAGACGGACATGGTTACATCGACTCGGCTACTGATTCCGATCAAAAATATATATACTTTATATGGTCGGAAACGCTTCCTTCTGCCTGTTACAT 
ACTTTTCAACGAATCTAGTATACTCTTTGAATCTACGAGTAACGGGTAAAGCACCATTGAGGACCATTTGCCGCATAATAAATGCAACACAGTTT 
ATAAAATTAAAAAAAAAAAGCATATACATTTACTGCGGAACTC 
(SEQ ID NO: 388) 

Exon: 1001.. 1127 
Exon: 1285.. 2558 
Start ATG; 1359 

Transcript No. : CT12939 

GTACCCTGACAACCCTGCTCTGTAAAACTAGGAAAAACAGAAAATTTTATTTTCAATTGTTAAATATCCAAACGTAAACAGCCGGAAAATGTCAC 
GACAAGCCTCTCGTTTGGACGCCAAGAAAGTGTAAGTGTGTGTGTTGGGGGCTTGAGGTTGAGACACGGACACACAGACAGGCACAAGCCGACGT 
GCAATCAGGCAATGGAAAACCATTTCAGCGCCACCAGTCGCCCCTCAGCCACGCCCAGAATGTGCACCACCGCCTCTGCCGTCGGTGCTGCCGCC 
AGCGCCGCCGCCGCCGCCGGCAACAACAATAACAACAGCAATAGCAGCAGCAACAACAACGCAACAACCGGCAACAATTTAAGTGTTAGCACTGC 
AGCCGGAGGAGCCACAAATGCCGCAGCAACGGCACAACCCATCCAAGTGATACCAATGCCCATGCTGCCCACCGGAGCGGCGCAAATCATAATTG 
GCCAGCAGCCACAGGGTCAGGCGACGGCGACGGGTCTGCAGCCACAATTGATACCGCTGCAGGCCAACCAGATTATGCTGCAGGCTGCGCAGCAG 
CAGCCACAGATGCAGGTGATGCAGCTGCCCGATGGCCAGACTATATTCTATCAGACGCCCACCATTGCCGCCTTGGATCCGAATGCAGCGGCCAA 
CGCCGCAGCTGCCATGGCAGCCCAGCCGACGCCGCACTACCTCAATATCAATGGGCAGCTGGTGCAGATCAATCCAGCACCGAGCGCTAATCAGG 
CGGCACCCACAGCTGGGCAACAGATTATAATGGTGCCGCAGACAGCGATGGCAGCGGTGAATGCAGCGGCCGCCAATGCCGGAGTAGGCGCCGGA 
GTGGGCACAGTGGTTACCCAACAACAACAGCAGCAGCATCAACAAGTACAATCCCAGACCCAAAACCAGCAGCAGCAGCAACAGCAGCAGACGGT 
GGCGGCGGTGGCTGCCAGTGCTGCAGTCAATAATATAAGCGCCGATGTCAGCACAAGTACAACTGGAACGAATACGAACAGCGAGGACGAGAGCT 
CCAAGGGCGAGGCGGACGAGGAGCCGCTCTATGTGAATGCCAAGCAGTACAAACGTATTCTCATTCGGCGGCAGGCCAGGGCCAAGCTAGAGTCG 
CGCATACCCAAGGAACGGTGCAAGTATCTGCACGAATCTCGTCATCGCCACGCCATGAATCGGGCTCGTGGCGAGGGTGGTCGCTTCCATTCGGC 
ACAGGAGAAGGGCGACCAAGATTCGTCCGGCCCGGAAGGCGGCAGCATGCCAATGGCGTCCAGTGGCGGCGTCACCCTCAGCCGTGGCACGGCAC 
GTGCTCCGCCCAAACTGATCGCGCCACATCAAACGCCCAGCATTACCATAACGGCGATCAAATCGGAATAG 
{SEQ ID NO: 389) 

Start ATG: 202 

MENHFSATSRPSATPRMCTTASAVGAAASAAAAAGNNNNNSNSSSNKNATTGNNLSVSTAAGGATNAAATAQPIQVI PMPMLPTGAAQI IIGQQP 

QGQATATGLQPQLIPLQANQIMLQAAQQQPQMQVMQLPDGQTIFYQTPTIAALDPNAAANAAAAMAAQPTPHYLNINGQLVQINPAPSANQAAPT 

AGQQIIMVPQTAMAAVNAAAANAGVGAGVGTVVTQQQQQQHQQVQSQTQNQQQQQQQQTVAAVAASAAVNNISADVSTSTTGTNTNSEDESSKGE 

ADEEPLYVNAKQYKRILIRRQARAKLESRIPKERCKYLHESF^RHAMNRARGEGGRFHSAQEKGDQDSSGPEGGSMPMASSGGVTLSRGTARAPP 

KLIAPHQTPSITITAIKSE* 

(SEQ ID NO: 390) 

Classification: transcription_f actor 



Celera Sequence No. : 142000013384615 

TTGGCGGGCTCTCCGAAATTCACAATAATTGCACGAACGAGTCGTGCGTTGTTTTCTCTGCTCGATTTCATTGTTTATGGGGCATAATATGTGGG 
GATGCCGGGCCAAAAACGAAATTAATCACGAAAACGCACTCACCGTCGAAAAATCAACGAAAAGAAAGAAATTTGACAGATGGGTCGCGGTTAGG 
GATGGCAAAGAGGCAGTGCTGGAGGTGGTATTTCGTAACGATAGTTAACGATGTTATCGGTTGTACTTGTCAGGGGTGAAGAGCATTGGATTCTA 
CGTTTATAAATGGTTTTGATAGCTCATTTTCCTTCTTCTACGCTTATTTTTTTCCTATGAAATTGTTTAAATATTTACTGTTTTGTTTATGTCTT 
CCTGTAACTGTTTAATTTAATTCAGCTAGTATCGATAACACTGTCATCTCTAAGTCGTGAACTGTCAAATTCAAAGCATGTGCTTTGCAACACTT 
CAAGGGAGTGTGTCTTTGCTAAAATCGTGGTCTTGTTTTCACTGTTACAATTCAATATTTTCCAATTGGAAACCAGTTTAAATTTTTCCCAACTC 
GTGAAAAATAAAAATAATTTTAAATTTTCGTTAAACGTTTTTATTACGTTTTAAAAGCATATTTTTCACATGTCAACATACTGATAGTTATAATT 
TAATAACGGTTCTTAATACCTTTGTAAAAAATGAAATACAGATCAAGGATCTAACGAAAGGCCATTAATAAAGTAGATATGTAAGTAATATAAAA 
AATATAGACAAATAGTTTTAATAGAATCAATAGCAATTCAGTGATTGAATTCATATTTTTACCAGCGTTGAAGAAAGTTCAGTAACATCTGTTGA 
TCATCCGCACTAACAGAATGTTACCGAAAAGCTCAACGAAATACAGCCCGCTGTTAACTAACATGGCGCTTCGCCAACAGCAACAGGTTTCCTAT 
GGTCCACGATCCCCACATATGTGGAGAATGGAAATTCAGTCGCGTGTGAGCAGCCCACTTTTAACGTTGTGGTGTCTCTGGGGATCAGATCCAAA 
ATTCAAAAAAGTCCAGTTTGGCCTCTTGATTAAATCAATTCTTCCACTTCATTCAAGATGATGAACAGCCTAAAGAAATTGCCCGTGCGCATGGT 
GAGTTGGCCACGCCCCCCTAACCGACACCTTTATACATACATATATGCCTACAACTAATTCCTTTTTCTTTATTAATTAACTTTTTAGCTGGCTA 
ACGCTGCACGTCAGCAAAGTGCTGCGATGTCCTCGGCAACTGGTCTCCCCCCACCCCTCACCTTTCTCACCGACGATGAGAAAATGATGAAGGAG 
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ACTGGTGAGTATACCCTTTGGATCTAAATAGATCATAGTGTTAATGGTACTGTACACGGAGTTGTAGTCCCCATTTGATAGGCGTGTTTTTTCTC 
CATGTCGAATTTTTAGATATTTTTATGGTGTTTAGAACTTGCTTGGTGTTACGTAACACCAAGCATCGCATTTTAATGATGCTCTTGTTATTTAT 
TAATTTTCCATTGTGCTAACCAGCTGAAGTCTTGCTGCACTACTATGTGTGTATTCGTATTTTTCGAGTTTTGTTCTTTTTTTTTTCCTTTTTTG 
CTGATAAAGTTTATTCAGTTGAGGGAAACGCGGAGGAGGCCGCTTGCCAGATAAGCACATTTTTATTTTGATAGTAGAAGTAAAATGTCACTAAG 
TATTAGTGGGTAGGTTTGTCCATTTTCTTTATGACGTTTGTATGTTTAAATATTACAAGATATACAATTTCCGATTTGATCCTTATTGTAAAATG 
ACGATCAACGATTATTACCTTATAATTAATTCCTTATTAAATTAATTTTTCTGGCGATTATTATATAATTTGTAATGATTAATCCCTTAAATTTT 
TTGTAGTGTAATAAACCTAATATTTTTAAGGTAAATGAGGTCCTTGAATGTGTTTCTATGTTTATAATTTATTCCCTTTGGTGAAAAATATTTTT 
ATCTATATAAAATAATTTTTAATAAAACTATTTCAGTATTTAATAATAGTATGCTTAAAAGTTACAAAATATTAATGATTCATCCCACTGAGGTC 
CTTAGGTGACAATTTTGTTATGTACTGCCATAATAAGGTTAGCTCTTAAAAATCATACGATTTCTTAAAGCTGGTTACATGCATGTTTAATGAAT 
ATAATTAAATTTAGAAAGTTACAAATAATATAATAATATAATATATATAATAATATTTCGAATATCGTGTTGTGTCAAATGTGTTTAAAAGTGTT 
C A C T T AAAC T AAAT G A AT AT AT GT AT T C T T AAAAAGT GC AC AAT T AAT G GC C GAAGAG C T AAAG C T G C C AGC T T GGC AAGTT CAAGGTC T AAC C T 
TATAACAAGCTTTTTTCCCGATGGAAAAATTTTCCTCCCACCTGCTTAATTCTTTAAAAAATTATAATTAATTGATAATAATAATAATTTACCAG 
TGAGAAAACTTACCCAGGTGGGCTGAAGGAAAAAGGGGTGGGCCCCGCACACACGCCCACCCTGCTCACACACACACACACGCGTGTTCTAAAAA 
TAGCTTTTGCCCACCGCCAACGCTGAACATCGAGTTTCGTCGCTGATATTTCTCGGTTGGCCGTGGTGTTCCTCGTTGAGCGGCCGGCACATAAA 
GAAAGCAATTTACCAAAGCGCGAGTTCGAGCCCTCAAATTATAAGCCTCTCTCATCCAAGATCCTCTCATTTCCCCTCCCACAGAGTACTGTTAT 
TGCGTGCGCGGAATGTTTCTTATTAAACTTATATAGTGTCCTTAAGGATATCCTTTAACCACGAGTTTTCACAGGTCAGTCAGCTGTACGGTTCT 
GGTTTTAAACGTCAACCGAATAGCGTTACACCAGTGATAGAGCAATAAATGCGGCTGTGCCATTCCCCCTGTCCACCTGTGGCCACTGTTCTAGT 
GGCCACGCCGGGCAATCCGGGGCGTGGCACTTGGCCGGGCGCCCTTCCCTAGAGAGTTGAATTACCTGAGTGGGGGGAGGACGACGACTTGTTGT 
TGTTGTCCCCCTGCAAACCATCTAATTCCTAGTTGGGAAGGTTACATTTCCTTTTTATCACCGATCACCTTGTTTTTCCTCCATAATAATTAATA 
ATCCTCGTCTCGCTTATCAGTCCCTCAACTGGACTCAATCTATATGGCTTCCCATATATATACATATGTACATATGTATGTGTGTTACATACAAA 
TAATTCTAAGTGGGAAAGTATGTGTCTATATGGCCTAACTGTTGCGTAATCTGAGCTGAATGCAAAGTGGCTAAGTGCATGAAGTCAAACTGGTT 
TTTTTTTATATTTTTCAATCATATATCATAGGCTTTGGCACAGTTAACACATTTTAAATTAAATTATTACATTAAATTTTCTGTTTTAATATGTA 
TTTCAAAGTGCTTGATTTTTTAAGTAAGCTTTAAGTACGTAAAAGAATAGCAAACACACATTTACTCGAACGATTATTACCTTCAAGAACCAAGT 
AC AGTT GC CC AAAAAAAAAGC AAAA T T TC AAT T T C AGT C TAT AT C A GAC TAAAG AAAT G AC CT GAT T T TAT T G AATT T C AT AT T T C C ACT AT AAG 
CCAAACTCCAAAAAGGACATGTAAAATTTACCATCAAGCCCATTGGGACACGACACATCTGTTCGAAATTTTCCGACTGACCCCTGTCCATATGT 
TGGAAAACTAAATGAATATATAAAGTTAGTTAATTATTTAAGCAATTCCCATGAAAAGTGAGCGAAAGAAAGTATTTCCAAATGTGTGCAAAACG 
GCAGGAAAGTGATTGAAGAAGGGGCGACGGAGGTGTGGGAGCTTCTGTGGGTGTGTACCAGTTGGATTTCCAAGCAGAAAAACAGAAACACAAAG 
TCAAGAAAAAAGAAAAAAATGCTTAAAAAAGATCAATAATAAATACGCTGAGAGAATAATAGAACTGCCGAAAGAAGACAGCTGCGAGAGCCCCA 
CGCGATCGTAGAATCTAGTGATAAAAAACGAACACACGAAGACTTTTTATCTGAGAGACAAACGCGTGTAGAAAAATACATACCTCAACCTACAC 
AAGCAAGAGATCAACAGCAAACATCGCGCCCCTTAGAAGAAAAAGAACTCCAAAAAAAATAAAAATCAAAAAAGCTTAAGTTTTTACCTTTTTTC 
AAAAACGTGCAGTTAAAAGCTTATCGAAGCTTTACAAAAACTTTGACATTAAGCTCTTTCACCGATCGTTACATTCTATATCTACACATATACAT 
ATACACATATGAGTTATACTTAAGGTCTAGGAATTCTTATGTGTTATAATATATTTAAAAAATAAAAAACCTATTTTAAATGTATCTAAATGTAT 
GCAAGCATATTTGACGTATATGAGTTATTTCACATGTTTCATAAACCATGGAAACTTTCCTTTAAACTTTCAGTTGCCAAACTGGCTCAGGAGCA 
GATCCAGCCCTTGGTGAAGAAAATGGACTTTGAGCACAAATTCGATCCCTCCGTGGTGAAGGCGGTTTTCGAGAACGGCCTGATGGGCATTGAGA 
TCGACACCGAGCTGGGAGGCAGTGGCTGCAACTTCATGACCAACATCGTCGTGGTGGAGGAGCTGTCCAAAATCGATCCTGCCGTCGCCGCTTTT 
GT GG AC AT T C AC AAC AC AC T G GT CAAC T C G CT GAT GATC AAGTT C G GC AAT G C C G AGC AG AAGG CG AAGT AT CTGCCTAAGCTGGCC C AGG AGT A 
CGCCGGTAGCTTCGCCCTGACCGAACCCGGTGCAGGATCCGATGCCTTCTCCCTGAAGACCGTCGCCAAGAAGGACGGCTCCCACTACGTAATCA 
ACGGCTCCAAGATGTGGATCTCCAACTCTGATGTGGCGGGAGTCTTCCTGATCTTTGCCAATGCCAAGCCAGAGGATGTAAGTATATATTGTTTT 
AAAGCCATTTAAATTTAGAAATAAATCATATTTTTATTAGTTTTTTTTTCGATTATACCCCCTAGCTTCGTTTTCACTTTTTCGCAGACCACTGA 
ATGATTTCAAAATATTTTCCGTTGCTAATCTTTATGTTATTCCAACTGATTGCCAACTAATCGCTCTCAGATCAAAGTTCACTTGATTGCAGTTC 
GTTAAATATTAAAATATGCAAATGAGTTTTCTTTTCAAATATAACGTTTGAGGAAACAAATTTCATATTCCAATTATTCGACTGCATAATATAAA 
CTTTCTTCAATTATATTTTACTGAACAAAAAATCTGATTGCGTAATTCTTTCAAACTAAAACCTTATCGTTATCTTGTTTTTTGCAGGGCTACCG 
TGGCATTACCACCTTCATTGTGGACCGTGAGACTCCCGGTCTGATCGTGAACAAGCCGGAGGACAAGCTGGGCATCCGTGCCTCTGGAACCTGCC 
AGCTCACCTTCGACAACGTGCGTGTGCCCGAGGAGAACATCCTGGGAACCTTCGGCCATGGCTACAAGTATGCCGCCGGCTTCCTGAACGAGGGT 
CGCATCGGTATTGCTGCCCAGATGGTGGGCCTGGCCCAAGGAACCTTCGACGCGACCATTCCGTACCTGCTGGAGCGCAAGCAGTTCGGTGATGC 
CATCTATAACTTCCAGTCGATGCAGCATCAGATCGCCACCGTGGCCACGGAGATCGAGGCCGCCCGCCTGATGACCTACAATGCTGCACGTCTGC 
AGGAGCAGGGTGTGCCCTTCCAGAAGGAGGCGGCC AT GGCC AAGT ACT ACGCCTCCGAGGTGGCCCAGAGGGCGGCGATCAAGTGCGTC GAC TGG 
ATGGGAGGCGTTGGCTTCACCCGTGATTTCCCGCAGGAGAAGTACTACCGTGACGTGAAGATCGGAGCCATTTACGAGGGCACCACCAACATGCA 
GCTGAGCACCATTGCCAAGTGCATCAAGAAGGATTATGCGGCCTAAAGACTACCCCGATACTCCTCCTTCCCCCAACATATCCCCCACACAAGCT 
GAGATATCGTACGTCAGATACGTCAAGTGCATTTATGCGTGGGAACCGCTGGAACAATTTTGACGGCCATCCGTTATCGGTTCTCACCATGTTAA 
CGTAGCATTTACCACTTAGTTGATTCTGTTATGCATATTGTATAAAATACAATTAAAATATATACACATATAAAATTTAAATAAAACATATAGTT 
GTATAGTATGGTTAATTCACTAATGACTGCCTCCTCCATCCGTTGTGGTAAATGCGGCCAGCAGAATGTTGATCACTGTGATCATGAAAACTCCC 
ATTACGAGCAGATCGTTGGTTCTTTCGTAACTTCGGTTCCTAAATCGCTTGAGACGTCGCTGCGGAGATTTTTTATTTAGTGAATGGACTAATTG 
AAGCTCTACTCACATACTTTAAAGATCAGCATTATGCCAACTAAAAGCTGCAGCACCAGGGAAAGTATCACCATAATCATGCTGTATATGTACGT 
GGAAGCCTTGTCGTTATAGGTGATCAGGAATCGCAACTGATTCGCATTCGCCGAAAGCAAAGCTATATCCATTAGGCCTACAAAAAGTAGTTTGA 
ATATCAGAGTTACAAGTTGTGGTTAGCGTTCTTTAAACTTTAAGTATTGTAACTTTCGAATAGTGATTTGCATTACGTTCTTACCTTCAGCTACA 
TTTTTGTTGGCAGCATAGCTATTCTCTGTCGAAAGATCGCTGGAACATTTCTTGTTCGATTTTTTGGATTCTTGTAATTCACTGGTGGCCCTGGA 
TGGGAACTGGTCGTCTTTTATATGGCTTTCATGCACCTGAATATCCAAATCCCTGCCAGAACCGCAACATGGTCCACTGGTTGTACTGGCGAAAC 
TTTCCCCGGAGCTGGGGGAATCCTCCAAAGATATTTTCACCTCGCCGCTGTCCATTGTTCGAGTTCCAACCGATTAATAAGCATTTAAATGCTCT 
TCTTTTAAAAAAATGCGAAAAGCTTTTCTTGTTTTTGAGCATGAAAGCCGCAACTGCGGTTTAGTAACTGATAAAAGGGAATGACACTGACCGAA 
CTACCCTAAAAAAAAAAAAAAAAAACTGGGCCATAAGCCGACCTGAAAAGTATCTGATGGGTTGAGATACGCACTAAGCATGTTTGCCTTAAAGT 
ATGCTACACAATCTTATTTAAAATGAAGGTAAA 
(SEQ ID NO; 391) 

Exon: 1001.. 1138 
Exon: 1229.. 1334 
Exon: 4444.-4922 
Exon: 5313.. 6158 
Start ATG: 1103 

Transcript No. : CT12987 
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CAGCCCACTTTTAACGTTGTGGTGTCTCTGGGGATCAGATCCAAAATTCAAAAAAGTCCAGTTTGGCCTCTTGATTAAATCAATTCTTCCACTTC 

ATTCAAGATGATGAACAGCCTAAAGAAATTGCCCGTGCGCATGCTGGCTAACGCTGCACGTCAGCAAAGTGCTGCGATGTCCTCGGCAACTGGTC 

TCCCCCCACCCCTCACCTTTCTCACCGACGATGAGAAAATGATGAAGGAGACTGTTGCCAAACTGGCTCAGGAGCAGATCCAGCCCTTGGTGAAG 

AAAATGGACTTTGAGCACAAATTCGATCCCTCCGTGGTGAAGGCGGTTTTCGAGAACGGCCTGATGGGCATTGAGATCGACACCGAGCTGGGAGG 

CAGTGGCTGCAACTTCATGACCAACATCGTCGTGGTGGAGGAGCTGTCCAAAATCGATCCTGCCGTCGCCGCTTTTGTGGACATTCACAACACAC 

TGGTCAACTCGCTGATGATCAAGTTCGGCAATGCCGAGCAGAAGGCGAAGTATCTGCCTAAGCTGGCCCAGGAGTACGCCGGTAGCTTCGCCCTG 

ACCGAACCCGGTGCAGGATCCGATGCCTTCTCCCTGAAGACCGTCGCCAAGAAGGACGGCTCCCACTACGTAATCAACGGCTCCAAGATGTGGAT 

CTCCAACTCTGATGTGGCGGGAGTCTTCCTGATCTTTGCCAATGCCAAGCCAGAGGATGGCTACCGTGGCATTACCACCTTCATTGTGGACCGTG 

AGACTCCCGGTCTGATCGTGAACAAGCCGGAGGACAAGCTGGGCATCCGTGCCTCTGGAACCTGCCAGCTCACCTTCGACAACGTGCGTGTGCCC 

GAGGAGAACATCCTGGGAACCTTCGGCCATGGCTACAAGTATGCCGCCGGCTTCCTGAACGAGGGTCGCATCGGTATTGCTGCCCAGATGGTGGG 

CCTGGCCCAAGGAACCTTCGACGCGACCATTCCGTACCTGCTGGAGCGCAAGCAGTTCGGTGATGCCATCTATAACTTCCAGTCGATGCAGCATC 

AGATCGCCACCGTGGCCACGGAGATCGAGGCCGCCCGCCTGATGACCTACAATGCTGCACGTCTGCAGGAGCAGGGTGTGCCCTTCCAGAAGGAG 

GCGGCCATGGCCAAGTACTACGCCTCCGAGGTGGCCCAGAGGGCGGCGATCAAGTGCGTCGACTGGATGGGAGGCGTTGGCTTCACCCGTGATTT 

C C C GC AGG AG AAGT AC T AC CGT GAC GTG AAG AT CGG AGC C AT T T ACGAGGGC ACC ACC AAC A T GC AGC T GAGC AC C AT T G C C AA GT GC AT C AAG A 

AGGATTATGCGGCCTAAAGACTACCCCGATACTCCTCCTTCCCCCAACATATCCCCCACACAAGCTGAGATATCGTACGTCAGATACGTCAAGTG 

CATTTATGCGTGGGAACCGCTGGAACAATTTTGACGGCCATCCGTTATCGGTTCTCACCATGTTAACGTAGCATTTACCACTTAGTTGATTCTGT 

T ATGC AT A T TGT AT AAAAT AC AAT T AAAAT AT AT AC AC AT AT AAAAT T T 

(SEQ ID NO: 392) 

Start ATG: 103 

MMNSLKKLPVRHLANAARQQSAAMSSATGL^ 

CNFMTNIVVVEELSKIDPAVAAFVDIHNTLVNSLMIKFGNAEQKAKYLPKLAQEYAGSFALTEPGAGSDAFSLKTVAKKDGSHYVIKGSKMWISN 
SDVAGVFLIFANAKPEDGYRGITTFIVDRETPGLIVNKPEDKLGIRASGTCQLTFDNVRVPEENILGTFGHGYKYAAGFLNEGRIGIAAQMVGLA 
QGTFDATIPYLLERKQFGDAIYNFQSMQHQIATVATEIEAARLMTYNAARLQEQGVPFQKEAAMAKYYASEVAQRAAIKCVDWMGGVGFTRDFPQ 
EKYYRDVKIGAIYEGTTNMQLSTIAKCIKKDYAA* 
(SEQ ID NO: 393) 

Classification: enzyme 



Celera Sequence No. : 142000013384678 

CAGTTGGCTGGTTTTCCTCACACACACACACACAGCATGCATACATACATACATATGTATGTATGTATATGGCGAAAAAAGAAGAAACTCATAAG 
CTGACGCTTACTGAAGCTTCCTTCGTTTCGCCTTCGTCCTTGTGCCATCGTCCTCGTGTATCCTTGGCTTCGGCGGCAGTCGCCCTTGCAGAAGG 
TTACGCCGCTGGGCTCGAGCGGTTGGATGAGGGGGGCGTTGGTTCTGTGGGTGGTGCTCTGGCACGTGTCCCGTTATGTGGTAGCAACTGCGGCA 
GGACTCTGCGGCTTTTGTGGGGCAGTCACAGTGAGAACACTCGTCACAAAACTGAACTATTTTGTTGTCATTGCTGCGAAATGCAATCGAATAAT 
GATCGTAGATAACTCTTTTTTTATAGTATCTTAGAATTGTGTATCTTCTATCTAAGAATATATGAAAGTACTTCAGTACAGTAAGGATCTCGGTT 
TGCTAAAAATCCTGTTCAAAACTTGTAGTCCTTGAGTTCTTCACTTGGAACCCCTAACTAGTTAGCCCCAATTGTTTACAGTGCAGCCGACACTG 
GCGGCGACAAAGGGCTTTGGTTTAGCGTGCTGCCATATGTTTCCGGCAAGTGCTCGCCCACCGCCACCCACTACTCCTTTATGGGCCATGAAAAG 
CGTTGAAATCCGCAATGCCAACAAGTAAACTGCTGGCCAAGCGGGGTTGCCAAGGCATCGGTGATTTGTGGACCCCAAGGCCAAGAATGGAATGA 
GGATGGCACCATGGAACGCTGAAACGATGGCTCTGTAATTTGAGCCGACTTCCTTTCGAGCGGGGGCGACACCTTCTGTCGACGCCGAGCGCCCT 
GATTTATCCGCAGCCCATAAAAGAATATATAAAGCCACACTCGCATTTGTTATACACTGTGTTCCATTTTCATGTTTTTATTGTTTTCGAAAATC 
AGCTAATTTAACTTAGTGTTTGTGTGGGGGTGTGTGTGTGTGTGCGTGTGTTTTTTTTTTTTGTTTTTTGGATTACATTTCGAGGGTTTTCTAAC 
AGTATTACATACAGATGTGCATACAATCAGATTTCTAAACATATATATATATGCGTTATATTTAAGTATTTATAGGTTTAATTAGTTAAGGTATC 
CGAACAGTACAATATCAGCTTAAATAAGTCGAGCCTTTTCTCGCATATACAACTAGATATATGTATATAGATAATGTATATATTTCAATAATAGG 
AATAAATAAATATACAACAATCAGGAGTCGGAGCCAAAGTTATTATTGGAATAGCTGGAGGTCGATCCGAGGCCTTCAATTTGTTTGATTGCCTG 
GAAATTATGTTAATTTTTCGCCGATCATGAGGGGGATGCTGCCAGCAGACAGGGGTTGCAAAAATAAGATACAGATACGAATAAACAATGGGGTG 
GATCTGCCACCATGTACCAATTCTAAATTCCAGCTATTCCTCTAAGAAACTCATATAACTTAGGGTTTTTCTTTTTGCTCCCCTCATAAACTTTA 
AAGTTTTTTCTTTGCTTAAAATATGTACAGACCATATACAAGATAACAAAACAAAAAATATCAGCTAAGAGCAAGTTGAGAATAAATGTACGAAA 
ATCTAAAGCCTTTTTCGAGTGTACGAATATCGTAAATATAAGAAGAAGTTTATCAATATTTGCATAATGCGTGTGTGTTCGTGTTTATGTGTATA 
GTATTTCATTTGGTCGAAGCTCTAAAAAATTAAAATTCATTTCCAAGTTGCTGGGTGCTGTGTGCTGGGTTGGCATTTTGTTTTCGCCTGGCTTC 
GATTTTTAAATTTATGTACAAAATTCGTGCTGCAATTAAACGTAACTCGCTTAAACAACTAATGCGACAAGGTATCTACAAGATATATATATATA 
TATATTCGTATATATAAATATGTGTTTATATTTATTTGGGTATATGTATAGAAATATACGCATTGTGCCAGTGCGAGCTTGTTATTTTCATTATT 
TTCATTCCGCTTTGTTTGCTTTTGTTTATATATGCCCAAGTATTGTTTTGTATCTTTATGCCTGACACGCTCGCTCAAAGCGGCGCAAAATCGTT 
AAAATTTTATCGCCCCGATGAGGAACGTGAGTGTGTATCGGGTATCGGGTATCGGGCTGTATCTTTCGCTATACGCGAGTGTGTTCCTTTCGAAA 
AATCTAGCAGCTAGCCCAACCACCACCAACGCTACCACCGAATTTTCACACTTCGTGGCAGTTTTTCCTCAGGGTGTGGGCCAAACACTTTCGGC 
ACTGAGTGTGTGTTGTTGTAATCAAATATTCCTGATGCGATTTCCGTTTTCGTATCGGTTTGTGGTTTTTCCTCTTCTGCGTTTGTTTTTTCGAA 
ACAAATTGGTTTTTACTATCGCGCATTTGAAATTAATTTTGCTATTTATTAGTAGAGATCTTTTTCTCTTGTTTACGCTTGGACTTTCCCCAAAA 
AGCAAAAGGCCAACCGAAGAAGATCTATCTACTAAAGGCATGTGTTTGTGTTCGTCTGACCTTATCGCGGTTTAACACATATTTTGCTGTTGATA 
TTTTATAATTAATTTTGTTGTTGCTTTTTTGCCTGCTTTCGAAAATGCCAGCGAACGCCAGCGCTTTTGATTAGTATTATTATTCTCCTCCTCGC 
TTCTTCTCTCTTTCTCTCACAGTTGGGTTATTTATTAATGAATATTGTTGCTGGCATTTTTGAAATTTGCTTGGATTATTTATTTAGTCGCGTAC 
TAATTCGATTGCAAATGTTTGCATATATGTATGCAAATGATGCGGTTTTCGCATCCAACAGATACATTCCGCTGTGTTGTAGATATTGCTCAGTG 
TTGTTCGGGTTAGTTCTAAGGCTTACGATCCATTCTAATGGTTGATTAAATTGCACTTCATCTTCGGGAGCGAGGTTTGGAAAACTTGCGCTATA 
GGAGTTGTTTATACGGTTGATCATCACCTCAGTTGCTGTCGGTATTTCGCAGATTCTCCAGCGGCGCCACAAGATACATTTGCGTTCCGCACAGC 
CAAAGAAACGCTGCAAAAGAAACGAAAAGAGGAAGAGAACGAATGAAAATGTTATTCGCCTTTCTATGGGTGAAAATACAAAATGATGTAATGAC 
TCT ACT CGGCCGATGT TCATT AAAAGGCC AAGCAAAAC AGAAAAAAAAAAAAC AAC AAC TGAACCAAAACAAAAAAC GAAT AT C AAAAC AAGAAT 
GTTTGTCAAAGTGCAATGAAACGACTACAAGATACTCAGTGTTAGCGATCCAATGGATTTGTAAAAAACACATTCCACATTTCTTTGGAGTAATT 
TGATTGATATTATTGAAAATTATGCTATATATGCTATATATTCATTTGAAATTTGTAAAAGCGACAGTTTTCGTAATGCAAATTAGCTTCTGCTT 
TTCTACACTTAAAGCCGGAATAGCTAATAAAATCCAGATGCTGAAACGGACAGACAGACTTTCATATCTGTCATCATCTCAAAACGCCGAGTTAG 
ACGAGTTCAAAATACATTTAGTGCTCTACGACGACAAAAGGGGTATACAAAAAAACACAGCGAAAAAGTCAAAACTCAGTGTAGTAACGTCTAAA 
ATAAAAGCAAAACGTCTGAAACGTATGAATGACCGACCAAGACCGCTGCCGAAGTGACGAACGACAAACAGAGCGAGATGGGTGGCTGTTTGGGG 
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GCAGAGGGAAAGAGAGAGAGAGAGGGCGAGAGAGTACGTGGAAGAAGATACATTATTTGTGTATAATGACTACTTGAGAAAAGCCAAAGCAGAAA 

AG AAAC AAACAACAC AAACAACACAACAT CGGCAAGAAAAT CGAGGC AAGAAGAAAACGC AAAAC ACGAAAT GT AATT TGT AT AAT TTGAGTCTC 

AATTCCATCAAGTGTGGGAAATGCGCCAAAAAGAAAAATTATATAGCCAACTCATAAGGGAAGTGGAAAGCACCGAAGAACTAGGGTTCAAGACA 

ACTGGCTCCAACAGAAGAGTTTGTTTTCCTAATGTAGCGTAGAGTTTAAAAGGTCACAATGAATCGCTAAAAAGAAAAGCAATCAAATGAGGAAA 

CTTGAACATCAGATCACCAGAATCAGGTGGTAACCAAATCATCTCACCTACTGGCCATCGAAGAGATCACATTCCAACAACAAAAAAAACCATTA 

AAAAACTAGGAATATAATATGTAGATACATATAATCTCAGAGCTAAAAAACAATATTGAGGAACGAAGAAAGTTCCAAAAACGGGAAGTTCGAGT 

TTTGTTAGTTTAACTTAAGTTAAGTTACTCAACAACAGAGGATTCTAGTTATAGATTCTAGTTATAGATTCTAGTTATGAATGACTCCACATATA 

TTTCATAATTGTACGCTTATCAGATACCCTTAACCAATAGAAATCAGTGCAACACACTCACCTTGTGACTAAGCGGAACTCGCCACCGGAGCATT 

GGCAATCTGTACGGGTGGCTTGGCCATGGTTTCCGGGACTGCGGACACACTGGAAACCGCGGATGGGATGGTCAGCAGAGTACTGGCCGCCGAAA 

CGGTAATGGGCGTCTCCTTGGGCTCTGGCCTCAACGCAACCGCTGTGGTCGTTTTGGCCAACTCCGGGGATTTGGCCGCATTGGAACTGAGAACG 

GGTGGTGCCTCCTCGGCGAGAGCGGCGGCCTTGGCCTCGGCGCGAATATTCAAATTGGCCGCCACTTGATCGATGGTCGTGGGTTTCTTGGTCAT 

TGCCCGCGGCGAGGTGGCTGCACTGCATCCAGAGCTGCCGCTTGAGCTGCTGCTGCTGCTGCTGCTTGTGTTGGTGGGGCTCAATGATCCCGGAT 

TGCTGGTGGAAACCTTCGGCTTTTCCTCAGCACGAACTGTCAACGGTTTGCTAATGCTGGCTGCTGTGCTGGCAGCTGCTGGCTGATTGCGGAAT 

GCCAAAGCCGATGGATTGGGTATGCTCCGAACTGAGGCATTCTGTGGTCTTACCAAAGCATTGGCATTCTTGGGTTTGGCGGAACTACTACTGGT 

GCTACCACTGCCGTTTCCACTGCTGGCGGCCTTTATGGAGATCTTCTCCCTGGCCGTTGCCGATTCCGAGAGCATATGCAGATTCTGTAGATCAT 

CGCCGTTGGTTGTACCATTCGAGTTACTGCCAGAAGTGGTCGATGGTCCACCATTGCTGTTTGCCGATATATTCTGTTTGATTTTGTTTAAGGCT 

GCCTCCAGGCTGTTGCTGGTGCTGTTGTTATTGTTATTATTATTATTATTGTTGTTATTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGCCAGC 

GGGCGGTGAGTTGGGCGCCAATTTGGCGTCATTGTTGCTCCTACCCGGGTTAGCTGATAAATCAAGTATATCGGCATCCTTGGCTAATGCTGTTC 

CTGTTCCTGTTCCTGCTCCTGCTCCTGCTCCCGCTGCTGTGGTTACCGATGTGGTTGCTGTTGCTGTTACAGCCGACTGCACTGCTGGCCGAAAC 

TCGCGCAAACTCGTTTTGGACTTGTCCAATGCGGCAGTGCCATTGGTTTTGCCTTGGCAGCCAGCTGCTGGCGGGGATTTCTTTGAATTGGCTTC 

CACCTTTGGGCTCGACTTTGATGGAAACGAGTGCTGTGGAGCGGCTGTGTTGCGCAGATTTATTAATGCAGGCGGTGATGTGCGATAAGCATTTG 

AGTTCGAGGAGTTGCCCTTGTTGTTCCTCTGGCCACTGAGCGCAACTGACAGTGGCGGCGACTTGACAATCGCCGGAATCGTGGTCATCGGCATG 

GGCAGCGGCAGCGGCAACGGCATTGGCATTGTCGGCATTTGGCAGCTCCGCTTGGCCGCACCATGTCCATGTCCTGCATGTTTGGGCAGAGCCGC 

TGGTGGTCCCATCAAGCGGGCACTGGACGTTGCCGGCGGGCTAAGAGGTGGCATCGTTAGACCCTTGCCTCCTGCACTCTGCTTATTCACCGGCA 

ATCGCACTATTTCCAGTGCAGGCACATGGATATTTGGCTCCGAGAAGGAATAAATCGACGGCGTAGCTCTGGGCCTTGGCGTAATCGTCACTTGG 

CTCGAACTGCTGGCCGGGCACAACATTGGCAACGTTGGCTTGCTGCCCAGGTCGGGTAGACCATAGGATCGCATCTGGGGATTGGGCAGAAAGGC 

GGCCGCTTGACTTTTGGCCTGGTTGCTAGTCTGCATATGCAACTTGGCCGCATAACTGCTGGCCATCGCCTGAGCCTGGCTCAATCTCGCCTGCT 

TGGCCGGATGCGATTCGTCCAGGGACATGGATCGCTTTGGCATCTTGTTAGCCGCACTGGCAATCGAACTGGGTGTGGACTGGTAGCGTTGCATG 

TGATACATATTCCGGTGGGTTGGAGCCTGTGGCACCGCAAAGTGTCCACTGTTGGGCTTACGATTATTGTTGCTACTCAGCGGCGGTTTGGCGGG 

ATTCGTTTTAGCCAGACTTTGCTGTAGTACAGCTTGAGGTTGCTGTGGCATTTGCTGTTGCTGCTGCTTGGGTTTTCCGCCGGCCACCGTATGTG 

GAGCAGTTGGCGTCGTGCTGGGTGTATAGGCTTTGAATAAACTGGGCGGCAGAGGTGGCGATGTGGGCAAGGTTAGGTCCTTTATTTTTGGCAGC 

ACCACAAACTGTTGCTGTTGCTGCTGCTGCTGCTGTTGTTGCTGCTGTTGTTGTGGTTGTTGTTGTTGCTGCTGCTGCTGCTGCTGCTGTTCTGG 

TTCGGCGGTGGGCGTGGGCGGCGGTGTGGGTGCTCCACCCACCACCAGGCTGTTGATACTCCGCGCCTCCTCGCCCAATGTGCGATTCAGTGCCT 

CGGAAGTGCTAGTACTGTCCAGTTTCAACTTCTGCAGATCCAACTCTCCACTGGCCAGTGAATGCGACCTCTTGAAATCCAACTTATGATTGGGT 

TTTGCCGTTATCTTAACCTTCATTTTCCCATCGGTTTGCGAACTGATCTCCGCGTGCAACTTCTTCCGCTTGCCATTCGCATCCTTTGGCTCCTT 

GGAGTGCTTCTTCTTGCGCTTGCGATGGGAACTGGACGTGCTGGTCGATGTACTGGCATCGGCTGTGCCCAGACTGCTGCTGGATCCATTGGTCG 

AACTGGAGCACGAAGACATGGGCGAGGCATTGGGACTGTATTTGCTATCGGGATTGGATGCACTTTGCTGCAAAGGCTGCTCAATGGGCTTCAAG 

CCAATATTCTTGGCGTAGGTCTCCAGATCGAGTTGTTGGGCTGAGCAAGATAAGGAGCAGGAGGTATCCTCCAGTCGCGGACCAGCCGATAATGG 

CATACTCAGCTTGCTCTTCATGCTGTTCAGTTCGATCTTCAGTTTGGGCAAATGGCCAACTCCCTTTCGATTCGCTGTCTTCAGCTCAAAATCCT 

CGGCATCAACGTAGGACTTTCGGAGTTGCGAGGGTGGAATGCTGACCACTATATTAGTATTGGCCTCCAGCGGCTGTTGTTCCCGTTCTCTTGGC 

AGAAGGAACTGTTCTTGTTCCGGTTCACTTTTAACTTTGGACACGGCGTAATCACTGTACCGCATGTCATTGCTTCGCAGACTTTTGAAGTTGCT 

CACCGGACTTTCAGGTTCCGCTTCCCGTTTACTCGACGTCAATCGTTTGTTTTTCGACGAGGGAGAAACTTTTGGCGGAGTCTTACACCGTACTT 

TGGAACGCGGAGAACCGGAGTCTATTTCCGATTCCAGGTCCTCGGCAAAGGTCACCGATTTGGCCGACTTGGAGCGACGTACCGAATGCTCTGGT 

TCAATGTCCTGGTTAATCCTGGATAGGTTGTTCTCATCGTTTTTTGTTTGCTCGTTGTCGTAAAGCAGAATCCGGTAGCAGAAGGCCATGGGCGA 

TTCCTAAATGGATTCGGAGAGGGGAGAAATAGATGGAAAAAAGGGCAGGTTGACAAGGGTGGAAAATTGAAAGTGGACATTGAACGCTAAACTGC 

CGGCACAATTGGATTGGGTTTCCCCTTGCGAAAATCTATCGGGTTATTGGGATAAACCGCAAAAAGGGACAATATACCGCCCGCCTGAAGAGAAG 

AAAAAAGGGCGTATATAACACTTAAAACTGACCGATAATGATAAGATGGATATGTTTTCAAGGTCTAATTTGGCAGTCGGATGACCGCAGAATCA 

AGTGAGAAGTTTCAACCCTGTTTTCTCTCCATATTGTTACGTAGTTTTTTTGTTTTACGGAATCAAAAAGTTACCCTTATGAAGAATACATTTCA 

ATACACTAATTGAGAACTAAAATACTAGCATAAGTACTTTTTAAAGTTAATTTAAAGATTTTACAATTGCATTGTCCCTTCATTGCAGTTAAATC 

CCTTTGCATTTTTAACCCATTGTCATTATAAGAATGCACTAATATTTCTTATTAAGTAAAACAAACTTACCCGGCTCCAGTTGTAACAGTAGGCA 

ACGTCCATCAGGGTGAAATTGGTAGGCAGTACCTCATCCTTGTAGGTCACCTCCACTTCCACCAGACCGTTGTCCGTCTCTATGTTGTACTTGGA 

GCAGAGGAACCTTTTCAGTAACTCCACAGGTAATCCGGCGGCACATTGCAGGTAAATAGTGGGAGGCACCTCACCAGGACCACACTGATGCAGCA 

TGGCGGGATGGTATTCCAGGGATAAGCTAATTGATATAAACGAAAAGTAAAGCAAAAAGTTTATTAGTAAAATTAGTTTTACACGTTTAAGCATT 

GTTAGCATTTAGCTGAAACCTACCTTATAAGTTCCGTGGTTGTGAAGAACTCCGGTTCATCAGTCGTTTGTTCATCCACCAAATCGTGCTGCTCC 

TTGAAATCTGCCAGCTCCTTGCACTCCCTTTGATAGAGTCCCGGCACTAGCTTATAGATTAGCGATCGCAACGTATCATCCGATTTCAGATTCAA 

TTCATTGATTTCCTTGCCTCCGCTGGCTTTACATTCCGGACAATATACCGCTCGCAGCAGGTGCTTTAGTATGCAACTCCGGCAATCTGTTGAGA 

AATGATAGATCAAACCGAATAACAATATGTTTAATGATCAAGGAAAATGTGTATTTTAGCTTAAGAACTACCAGAAGCTGTGATTACATTGTTTA 

GTAATGATCTTAAAGTTTAAGATCTAACTTGGGCAGTCAATATTGTAAATCATGACAACCTTAAATTAACTTTATAAATGAACTTTAAAATGGTA 

AAAGTGATCAAACAAATAATGATCAAATTACATGCAATTAATACAAGATATTGGTTTTATGCGTATATTAAAGGTACTCTTTGGAATCTAAGACT 

ACTTACAGGTATGATAACAGTAATCCACAGTGGTTGGATCGATCATATAACCGCGACAAAGTCGGCACGTTATCAGATCATGAAACTGCCGGACA 

TCCCGAGAAGCTGGGGATGCGGCATCCATAGCGGCGTTTGATTCCATTGTGTTGTGCGTGTTTTGCAGGTGCATCTTTCTGCCTATTCTATCTGC 

CTCAATACTTACTCCTAGCAGCCTCTTTCAATGGGGTCCTGGAAGAGGAACAAGGATGGGGGTTAGGACTCGGCATATCCATGTTTGGCATGTGC 

TCGCTCAACAAGTGACACTAGTGTGACAACTGATGATGGCTAATCAACTGTTAGGGTCTTTCCATAATTATGACGAGGGCACAAAGGAAACGGGG 

TAACAGGGGCCAAGTAAAACCTAGGAAAAAGACACACAACTGTGAGAACCAACAGCCAGCACACAATCCCATCAACAAGCACCATCACCATCATC 

GTCATCATCATTGTTGTCGACATTTGTGGTTGTCGCTCGTCAACAAAAAAGCCACAGAGCCAGATATATCAACCGTGTATCTGTATCTTGTCCCA 

AAAGATATGGCAACGTCAGCACACTCGCACAATACAGAAAACTTTAGACTAGTAAGGCAGCCATCCATATCCATGAATCGGAACAATGTGCTTGC 

AGCTCACAAAAATATGAATAAAAAAATAGAATAAAAAACAAAACAGAACACAAAAGTAGCAACTTTGTGGGCGGGTGGAAA 

GGGCACTGGGAGGTGGGGTCTTTGGTCTTTCGTACGCCCTTCGTGCCACAGAGGGCACCTACCGAGAGAAGCCGAGAAGAGCCGAGAGAAGCGCA 

CAGAAAGCGATGTGCACTCGACGGTGTACTTGAAGCTAGTTTTAGCTTGGCGCCCCCCTTTTCTGCCCCATCGCTTCAGACCCCTCTTTATCTAC 

CATAAAACTACGAAAATTACGCTTTTGTGCCCTGCACACCAGTGGAAACTGAATTTTCATCATTATGGCGATGCCCGGACGATGATTGCGAGTTG 

ATGGCAGGCAGCCGAGAGTTCAGGGCAAAGTTCAGGGCATAGTGGATCAGCAGGTTGTTCGGATAGCTCCTAGGAAACCAAGGAAAACTTCAAGT 
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TACTCTTGGGACTATTGTTGAAGTACTTGTGTGTAACAATTAGTGTACCTAAGATTTTGAAATGTAAGTCAATATTCGAAAACCCCTTTTAAAAA 
GTTCCATAAAAGCGTATAACTCAATACAATATTCACTACATATATATTTCTATATATTTTTGACATTATATTAATTACCAACATAAATAATAAAA 
AGAAATGCCTTTAATCGGATCAACAAGATCTGCATTACAAAGTGACTACTTGGGGTACAACTGTTCAACTGTAGAAAGTACAGTAAAACCATCTA 
ATTATCATCAGTATGATAAATAATTGTGCGCAAAGCAATTTCTTTTAGCTGAATGCTTTTTCTTCATTTTCATTAAAGATTCCCGCATTTGCATA 
ATTAGAAACAAAATTGCGTGTAAATGATGATTTCTTCTGGAATAGGCCATCTTTATTCCCCTCAATTCAGTCAGTCGGCAAGTTGAAGTTTGTTG 
AATTGTGACTGAATTTGGCACTGCATTATTCATTATTTAACTAAGTATTCTTCAAATTTAAGCACTTTTAATCCACAGCCACCCCATCTAAACCG 
CCCATCGCATAGCGCAACCAAAAGTCGCATCGCTTCGCCCATTTTTTGTCTCCGTTGCCTTTCGATTATTTTCGTTTGCCGCTCACGCGAAAATG 
GCACGCAAAGAGGAGGGTGGCATAGTTGGGTAAAGGGGAGGCAGAAGAACGGAGGCTGTGCTAGCAAAACAAAAAGCGAACAAGACAAAAACAAA 
AGACCGAGCGGCTTGAACTAAAGTTGGGCAGAAAAGTGGGTGGTTGGCGTTCGTGTGCCTACGCCCACTCAGCTGGAAAATGACAATCCCACGTT 
TGATGACGCTCTTAAATTGATGACGTCGCTGGCGAGCGCTTTTCTTGATTTTCCTTAGCCCGCTTTCTGTTTATACGCCGCAATTTCGTAAAAGG 
CGCGCACTTGCACACCAACCCACACACACACACATGGTCACAGCTTGAAAGTGTCGGCAATCCAATAACAACAATAACAGTTCGACCGGCACGGG 
CGCCCGCAGGGTGGTTGAACACACACACACTCGCCACTTCTCTCCCACTCTCTATCGCTCACACACACACACACGCAACAAGCTGGCTGTCCGAA 
AAATGAACTCACAAAAAAGACGAAAACATGGACGAAAATCTACGTAAAATCGCAACAACTCTGGAGGTACGATTTTTCCGCGGCCGCGTCCAGAG 
TTCAACTGAAATCCAATTCGAGAAATCCCAACTGCCACTTGCGCGCTCAATTCGCCGTTAGGCGAGAGCGCACGACACACACAAAATTGTATCTA 
TTCGCCGCGGCGAGAGAGACGACAAACAAACTTTCTACTTTTGCGCGGGCGCATTTTCGCCGACTGTCGACGTTTTCGCCGCACGACGAAAATTC 
CAATACGCGGCGAGAATAACACGCACGCACACCAGGGCATTTGTGTTTACAAAATTTTCACCCATACATGTGAGAGGGAAAATTTCATGTTCCCT 
TTCCTTTTTGGTTTTTATTTCCCGCATACGTACTCAAAGCAATTGCGTTGTTACTTGAAGAGTTTTCCTCTGTAGCATATTCCCTTTGGATTTTC 
CATTAAAACAAATCCATTCGCCGACACCGAACACACACGCTTTCGATTACTCCACTGCCTTTTCGTCTTCTTCTTCTGTCACTTCAGCTGCTGTT 
ATCGATTTCTCGTACACGCACAGATTTTCGCTGCTTCTCTGTCACTCACTCTCGCGCTCTCTCCCTCTCGCACTCGTTTGAGCTCGCGCCACGAC 
AAAAATATATTCTCCGTTCCGTTTCACAAAGTGCACGGCACGAAAAGTTTCCACAGACTGATTCAAATTAGACCTGCTGGCGCTGTCGCTGCCCA 
AAATCGCTCGGCTCGCATGAGAGAGCCGCACGAGTGAAACACGAGAGAACCACCACTAACCTATCACATCGCTGCACGAGCGAGATGACAAGAGG 
GGGGCGCTCGGCATCAGCAGGCTGTCTGCGTCGCACTCTCACGAATTGAGGTTACTCTGTAACTGTGTGGGTGGCGTCGTTTCTAATTGAATCCA 
ACTGTCAAATGAGAGAGATTAACGCACATATTCGTATGTATGTGCGTGCAGTGTGCCCGTGTATTGGCCTTGGCGTTTATAATTAACACAAAAAT 
GCCCGTTTTCGCAGATTTTTTCTCCTTTTTTTGTTGGTCTGCCGTTGGCATTTCATT 
(SEQ ID NO; 394) 

Ex on: 11692., 11603 
Exon: 9538.. 9317 
Exon: 9016.. 8764 
Exon: 8671.. 8431 
Exon: 7888.. 4432 
Exon: 3050. .1001 

Start ATG: 9479 (Reverse strand: CAT) 
Transcript No. : CT12997 

AGTTGAACTCTGGACGCGGCCGCGGAAAAATCGTACCTCCAGAGTTGTTGCGATTTTACGTAGATTTTCGTCCATGTTTTCGTCTTTTTTGACCC 
CATTGAAAGAGGCTGCTAGGAGTAAGTATTGAGGCAGATAGAATAGGCAGAAAGATGCACCTGCAAAACACGCACAACACAATGGAATCAAACGC 
CGCTATGGATGCCGCATCCCCAGCTTCTCGGGATGTCCGGCAGTTTCATGATCTGATAACGTGCCGACTTTGTCGCGGTTATATGATCGATCCAA 
CCACTGTGGATTACTGTTATCATACCTATTGCCGGAGTTGCATACTAAAGCACCTGCTGCGAGCGGTATATTGTCCGGAATGTAAAGCCAGCGGA 
GGCAAGGAAATCAATGAATTGAATCTGAAATCGGATGATACGTTGCGATCGCTAATCTATAAGCTAGTGCCGGGACTCTATCAAAGGGAGTGCAA 
GGAGCTGGCAGATTTCAAGGAGCAGCACGATTTGGTGGATGAACAAACGACTGATGAACCGGAGTTCTTCACAACCACGGAACTTATAAGCTTAT 
CCCTGGAATACCATCCCGCCATGCTGCATCAGTGTGGTCCTGGTGAGGTGCCTCCCACTATTTACCTGCAATGTGCCGCCGGATTACCTGTGGAG 
TTACTGAAAAGGTTCCTCTGCTCCAAGTACAACATAGAGACGGACAACGGTCTGGTGGAAGTGGAGGTGACCTACAAGGATGAGGTACTGCCTAC 
CAATTTCACCCTGATGGACGTTGCCTACTGTTACAACTGGAGCCGGGAATCGCCCATGGCCTTCTGCTACCGGATTCTGCTTTACGACAACGAGC 
AAACAAAAAACGATGAGAACAACCTATCCAGGATTAACCAGGACATTGAACCAGAGCATTCGGTACGTCGCTCCAAGTCGGCCAAATCGGTGACC 
TTTGCCGAGGACCTGGAATCGGAAATAGACTCCGGTTCTCCGCGTTCCAAAGTACGGTGTAAGACTCCGCCAAAAGTTTCTCCCTCGTCGAAAAA 
CAAACGATTGACGTCGAGTAAACGGGAAGCGGAACCTGAAAGTCCGGTGAGCAACTTCAAAAGTCTGCGAAGCAATGACATGCGGTACAGTGATT 
ACGCCGTGTCCAAAGTTAAAAGTGAACCGGAACAAGAACAGTTCCTTCTGCCAAGAGAACGGGAACAACAGCCGCTGGAGGCCAATACTAATATA 
GTGGTCAGCATTCCACCCTCGCAACTCCGAAAGTCCTACGTTGATGCCGAGGATTTTGAGCTGAAGACAGCGAATCGAAAGGGAGTTGGCCATTT 
GCCCAAACTGAAGATCGAACTGAACAGCATGAAGAGCAAGCTGAGTATGCCATTATCGGCTGGTCCGCGACTGGAGGATACCTCCTGCTCCTTAT 
CTTGCTCAGCCCAACAACTCGATCTGGAGACCTACGCCAAGAATATTGGCTTGAAGCCCATTGAGCAGCCTTTGCAGCAAAGTGCATCCAATCCC 
GATAGCAAATACAGTCCCAATGCCTCGCCCATGTCTTCGTGCTCCAGTTCGACCAATGGATCCAGCAGCAGTCTGGGCACAGCCGATGCCAGTAC 
ATCGACCAGCACGTCCAGTTCCCATCGCAAGCGCAAGAAGAAGCACTCCAAGGAGCCAAAGGATGCGAATGGCAAGCGGAAGAAGTTGCACGCGG 
AGATCAGTTCGCAAACCGATGGGAAAATGAAGGTTAAGATAACGGCAAAACCCAATCATAAGTTGGATTTCAAGAGGTCGCATTCACTGGCCAGT 
GGAGAGTTGGATCTGCAGAAGTTGAAACTGGACAGTACTAGCACTTCCGAGGCACTGAATCGCACATTGGGCGAGGAGGCGCGGAGTATCAACAG 
CCTGGTGGTGGGTGGAGCACCCACACCGCCGCCCACGCCCACCGCCGAACCAGAACAGCAGCAGCAGCAGCAGCAGCAACAACAACAACCACAAC 
AACAGCAGCAACAACAGCAGCAGCAGCAGCAACAGCAACAGTTTGTGGTGCTGCCAAAAATAAAGGACCTAACCTTGCCCACATCGCCACCTCTG 
CCGCCCAGTTTATTCAAAGCCTATACACCCAGCACGACGCCAACTGCTCCACATACGGTGGCCGGCGGAAAACCCAAGCAGCAGCAACAGCAAAT 
GCCACAGCAACCTCAAGCTGTACTACAGCAAAGTCTGGCTAAAACGAATCCCGCCAAACCGCCGCTGAGTAGCAACAATAATCGTAAGCCCAACA 
GTGGACACTTTGCGGTGCCACAGGCTCCAACCCACCGGAATATGTATCACATGCAACGCTACCAGTCCACACCCAGTTCGATTGCCAGTGCGGCT 
AACAAGATGCCAAAGCGATCCATGTCCCTGGACGAATCGCATCCGGCCAAGCAGGCGAGATTGAGCCAGGCTCAGGCGATGGCCAGCAGTTATGC 
GGCCAAGTTGCATATGCAGACTAGCAACCAGGCCAAAAGTCAAGCGGCCGCCTTTCTGCCCAATCCCCAGATGCGATCCTATGGTCTACCCGACC 
TGGGCAGCAAGCCAACGTTGCCAATGTTGTGCCCGGCCAGCAGTTCGAGCCAAGTGACGATTACGCCAAGGCCCAGAGCTACGCCGTCGATTTAT 
TCCTTCTCGGAGCCAAATATCCATGTGCCTGCACTGGAAATAGTGCGATTGCCGGTGAATAAGCAGAGTGCAGGAGGCAAGGGTCTAACGATGCC 
ACCTCTTAGCCCGCCGGCAACGTCCAGTGCCCGCTTGATGGGACCACCAGCGGCTCTGCCCAAACATGCAGGACATGGACATGGTGCGGCCAAGC 
GGAGCTGCCAAATGCCGACAATGCCAATGCCGTTGCCGCTGCCGCTGCCCATGCCGATGACCACGATTCCGGCGATTGTCAAGTCGCCGCCACTG 
TCAGTTGCGCTCAGTGGCCAGAGGAACAACAAGGGCAACTCCTCGAACTCAAATGCTTATCGCACATCACCGCCTGCATTAATAAATCTGCGCAA 
CACAGCCGCTCCACAGCACTCGTTTCCATCAAAGTCGAGCCCAAAGGTGGAAGCCAATTCAAAGAAATCCCCGCCAGCAGCTGGCTGCCAAGGCA 
AAACCAATGGCACTGCCGCATTGGACAAGTCCAAAACGAGTTTGCGCGAGTTTCGGCCAGCAGTGCAGTCGGCTGTAACAGCAACAGCAACCACA 
TCGGTAACCACAGCAGCGGGAGCAGGAGCAGGAGCAGGAACAGGAACAGGAACAGCATTAGCCAAGGATGCCGATATACTTGATTTATCAGCTAA 
CCCGGGTAGGAGCAACAATGACGCCAAATTGGCGCCCAACTCACCGCCCGCTGGCAACAACAACAACAACAACAACAACAACAACAATAACAACA 
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ATAATAATAATAATAACAATAACAACAGCACCAGCAACAGCCTGGAGGCAGCCTTAAACAAAATCAAACAGAATATATCGGCAAACAGCAATGGT 

GGACCATCGACCACTTCTGGCAGTAACTCGAATGGTACAACCAACGGCGATGATCTACAGAATCTGCATATGCTCTCGGAATCGGCAACGGCCAG 

GGAGAAGATCTCCATAAAGGCCGCCAGCAGTGGAAACGGCAGTGGTAGCACCAGTAGTAGTTCCGCCAAACCCAAGAATGCCAATGCTTTGGTAA 

GACCACAGAATGCCTCAGTTCGGAGCATACCCAATCCATCGGCTTTGGCATTCCGCAATCAGCCAGCAGCTGCCAGCACAGCAGCCAGCATTAGC 

AAACCGTTGACAGTTCGTGCTGAGGAAAAGCCGAAGGTTTCCACCAGCAATCCGGGATCATTGAGCCCCACCAACACAAGCAGCAGCAGCAGCAG 

CAGCTCAAGCGGCAGCTCTGGATGCAGTGCAGCCACCTCGCCGCGGGCAATGACCAAGAAACCCACGACCATCGATCAAGTGGCGGCCAATTTGA 

ATATTCGCGCCGAGGCCAAGGCCGCCGCTCTCGCCGAGGAGGCACCACCCGTTCTCAGTTCCAATGCGGCCAAATCCCCGGAGTTGGCCAAAACG 

ACCACAGCGGTTGCGTTGAGGCCAGAGCCCAAGGAGACGCCCATTACCGTTTCGGCGGCCAGTACTCTGCTGACCATCCCATCCGCGGTTTCCAG 

TGTGTCCGCAGTCCCGGAAACCATGGCCAAGCCACCCGTACAGATTGCCAATGCTCCGGTGGCGAGTTCCGCTTAGTCACAAGCGTTTCTTTGGC 

TGTGCGGAACGCAAATGTATCTTGTGGCGCCGCTGGAGAATCTGCGAAATACCGACAGCAACTGAGGTGATGATCAACCGTATAAACAACTCCTA 

TAGCGCAAGTTTTCCAAACCTCGCTCCCGAAGATGAAGTGCAATTTAATCAACCATTAGAATGGATCGTAAGCCTTAGAACTAACCCGAACAACA 

CTGAGCAATATCTACAACACAGCGGAATGTATCTGTTGGATGCGAAAACCGCATCATTTGCATACATATATGCAAACATTTGCAATCGAATTAGT 

ACGCGACTAAATAAATAATCCAAGCAAATTTCAAAAATGCCAGCAACAATATTCATTAATAAATAACCCAACTGTGAGAGAAAGAGAGAAGAAGC 

GAGGAGGAGAATAATAATACTAATCAAAAGCGCTGGCGTTCGCTGGCATTTTCGAAAGCAGGCAAAAAAGCAACAACAAAATTAATTATAAAATA 

TCAACAGCAAAATATGTGTTAAACCGCGATAAGGTCAGACGAACACAAACACATGCCTTTAGTAGATAGATCTTCTTCGGTTGGCCTTTTGCTTT 

TTGGGGAAAGTCCAAGCGTAAACAAGAGAAAAAGATCTCTACTAATAAATAGCAAAATTAATTTCAAATGCGCGATAGTAAAAACCAATTTGTTT 

CGAAAAAACAAACGCAGAAGAGGAAAAACCACAAACCGATACGAAAACGGAAATCGCATCAGGAATATTTGATTACAACAACACACACTCAGTGC 

CGAAAGTGTTTGGCCCACACCCTGAGGAAAAACTGCCACGAAGTGTGAAAATTCGGTGGTAGCGTTGGTGGTGGTTGGGCTAGCTGCTAGATTTT 

TCGAAAGGAACACACTCGCGTATAGCGAAAGATACAGCCCGATACCCGATACCCGATACACACTCACGTTCCTCATCGGGGCGATAAAATTTTAA 

CGATTTTGCGCCGCTTTGAGCGAGCGTGTCAGGCATAAAGATACAAAACAATACTTGGGCATATATAAACAAAAGCAAACAAAGCGGAATGAAAA 

TAATGAAAATAACAAGCTCGCACTGGCACAATGCGTATATTTCTATACATATACCCAAATAAATATAAACACATATTTATATATACGAATATATA 

TATATATATATCTTGTAGATACCTTGTCGCATTAGTTGTTTAAGCGAGTTACGTTTAATTGCAGCACGAATTTTGTACATAAATTTAAAAATCGA 

AGCCAGGCGAAAACAAAATGCCAACCCAGCACACAGCACCCAGCAACTTGGAAATGAATTTTAATTTTTTAGAGCTTCGACCAAATGAAATACTA 

TACACATAAACACGAACACACACGCATTATGCAAATATTGATAAACTTCTTCTTATATTTACGATATTCGTACACTCGAAAAAGGCTTTAGATTT 

TCGTACATTTATTCTCAACTTGCTCTTAGCTGATATTTTTTGTTTTGTTATCTTGTATATGGTCTGTACATATTTTAAGCAAAGAAAAAACTTTA 

AAGTTTATGAGGGGAGCAAAAAGAAAAACCCTAAGTTATATGAGTTTCTTAGAGGAATAGCTGGAATTTAGAATTGGTACATGGTGGCAGATCCA 

CCCCATTGTTTATTCGTATCTGTATCTTATTTTTGCAACCCCTGTCTGCTGGCAGCATCCCCCTCATGATCGGCGAAAAATTAACATAATTTCCA 

GGCAATCAAACAAATTGAAGGCCTCGGATCGACCTCCAGCTATTCCAATAATAACTTTGGCTCCGACTCCTGATTGTTGTATATTTATTTATTCC 

TATTATTGAAATATATACATTATCTATATACATATATCTAGTTGTATATGCGAGAAAAGGCTCGACTTATTTAAGCTGATATTGTACTGTTCGGA 

TACCTTAACTAATTAAACCTATAAATACTTAAATATAACGCATATATATATATGTTTAGAAATCTGATTGTATGCACATCTGTATGTAATACTGT 

TAGAAAACCCTCGAAATGTAATCCAAAAAACAAAAAAAAAAAA 

(SEQ ID NO: 395) 

Start ATG: 150 (Reverse strand: CAT) 

MHLQNTHNTMESNAAMDAASPASRDVRQFHDLITCRLCRGYMIDPTTVDYCYHTYCRSCILKHLLRAVYCPECKASGGKEINELNLKSDDTLRSL 
IYKLVPGLYQRECKELADFKEQHDLVDEQTTDEPEFFTTTELISLSLEYHPAMLHQCGPGEVPPTIYLQCAAGLPVELLKRFLCSKYNIETDNGL 
VEVEVTYKDEVLPTNFTLMDVAYCYNWSRESPMAFCYRILLYDNEQTKNDENKLSRINQDIEPEHSVRRSKSAKSVTFAEDLESEIDSGSPRSKV 
RCKTPPKVSPSSKNKRLTSSKREAEPESPVSNFKSLRSNDMRYSDYAVSKVKSEPEQEQFLLPREREQQPLEANTNIVVSIPPSQLRKSYVDAED 
FELKTANRKGVGHLPKLKIELNSMKSKLSMPLSAGPRLEDTSCSLSCSAQQLDLETYAKNIGLKPIEQPLQQSASNPDSKYSPNASPMSSCSSST 
NGSSSSLGTADASTSTSTSSSHRKRKKKHSKEPKDANGKRKKLHAEISSQTDGKMKVKITAKPNHKLDFKRSHSLASGELDLQKLKLDSTSTSEA 
LNRTLGEEARSINSLVVGGAPTPPPTPTAEPEQQQQQQQQQQQPQQQQQQQQQQQQQQQFVVLPKIKDLTLPTSPPLPPSLFKAYTPSTTPTAPH 
TVAGGKPKQQQQQMPQQPQAVLQQSLAKTNPAKPPLSSNNNRKPNSGHFAVPQAPTHRNMYHMQRYQSTPSSIASAANKMPKRSMSLDESHPAKQ 
ARLSQAQAMASSYAAKLHMQTSNQAKSQAAAFLPNPQMRSYGLPDLGSKPTLPMLCPASSSSQVTITPRPRATPSIYSFSEPNIHVPALEIVRLP 
VNKQSAGGKGLTiyrPPLSPPATSSARLMGPPAALPKHAGHGHGAAKRSCQMPTMPMPLPLPLPMPMTTIPAIVKSPPLSVALSGQRNNKGMSSNSN 
AYRTSPPALINLRNTAAPQHSFPSKSSPKVEANSKKSPPAAGCQGKTNGTAALDKSKTSLREFRPAVQSAVTATATTSVTTAAGAGAGAGTGTGT 
ALAKDADILDLSAKPGRSNNDAKLAPMSPPAGKNNNNNNNNNNNMNNNNNNNNWSTSNSLEAALNKIKQNrSANSNGGPSTTSGSNSWGTTNGDD 
LQNLHMLSESATAREKISIKAASSGNGSGSTSS3SAKPKNANALVRPQNASVRSIPNPSALAFRNQPAAASTAASISKPLTVRAEEKPKVSTSNP 
GSLSPTNTSSSSSSSSSGSSGCSAATSPRAMTKKPTTIDQVAANLNIRAEAKAAALAEEAPPVLSSNAAKSPELAKTTTAVALRPEPKETPITVS 
AASTLLTIPSAVSSVSAVPETMAKPPVQIANAPVASSA* 
(SEQ ID NO: 396) 

Name: SUPPRESSOR 2 OF ZESTE PROTEIN 
Classification : transcriptional" actor 
Gene Symbol: Su(z)2 
FlyBase ID: FBgnO008654 



Celera Sequence No. : 142000013384592 

CTTGACCAAAAACAAAGTGGCCAAGGTGGCTTGCTTCGCTGAGTGGGTGGATGGTGCAGCAGGGCGCAGATTTGCCGACACTTTTTACTACGCGA 
CTTTCGAACTGGAGAAAGCGGCGCGACTCGACGCGAGAAAACGAGAAGCGCGAACGGAGAGACGAGACGAGAGATTTATTTGGCTGCGCGGCTTC 
GTCTCTTCGGCTTTTTGGCGCTCACAATTATTGCACCACCACAGGCACTCGCTCACGTACATACATTATACATATACACAGATATCTGTATTGGA 
TTTTTTTTCCAAAGTTTGGATTTTGATGATATCCTGCGGGGTATACAGCGATTTTCGAACTGGCTTTGAAACTTGAAACTAAACTTTATGGCCAG 
CCAAAAGCACTTTAAGCCTTTTAAGTGTGAAAATTAATATTGCACTTTGAGTTGGTACTGGTTTTCCCTTATGTGTTTTATCGGTTTAGCGGATT 
GCATTGGGCAGAATAGAATTTTCGTTTGGACGCGATTTCTGCTCGGAACGCATCCACACTCGCCAGATTCTCGCCACGAACTGAGTGCGCTCTGT 
GGGGCCAGATAGCTGCTCTTGGGGCACGATTCCTTGATACCTCGCCATTCGAACTTGCTTCCTCTCAATTCGAATTTGGCCAAAACCGTTGGGGT 
TGCTTTGAGCCAGTGATGCCAGATGGGTTCGTTACCCTCTGTTAACCGCTTCCACAACAACGCTAGCGGTTCAGAATGTTGGCAGTTTGCAGAGC 
TATCGATATAAAGGTCAAAGTGCAACCGTTCTGAATTGCCGCCCTGTTTTCAAATATAAAGCGGTTTATTTGAATCCTAATGCATTCAACAAGGT 
AGTGTCTAAAATGTCCTACAAATAGTATGTTTTCTGGTTATCTTCAGTCGCATATTTTTGTATACTTGTTTCAAGACATGCAATATTTTATGACC 
ACATTTACCACGTTGCGCACCTAATGATGAATCCCATTGAAAATGAAACATTTGAATTGATTTCAGTTTTATTTACAAAAGTATATATAGTAGCA 
TATGTGCCATAGTATAAGACCACATATACTTACACACACACACACATACATAACCACATAGGAACAGCCATACACAAAATCATAGAATGCCCTTA 
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GGATATAAGTCATAAAATCATGTATCCAATAAGAACCCAAATTCTGGTGACAATAACAAATCCAGCCTACCCTAATGTAAACATTAACATCCGCT 
GCCGAATCAGTTATTCCCACCAAAGGGTCAAAACCGCTTACGCAGCCCAAGTTCTCGGCTGATCAGTACCCATGAGTGATCCACTCAAGAAGGCA 
CCCAATCCAACAGACATAAACATCCGCAGCCGCATTCGCGACTCCCACCAGAGGGTCGAAACTGCTAACACAGCTTTAAGGGTCAAAACTTTCCA 
CGTAGCTCAAACTCGGCTGATCTATATCCATGCGCAATCCACTCAAGAGAGCGCCTAATTAACGTAAACATAAATGAGAGCCTATGTACGCTCAA 
TATGAAGAAGTAAATAAGCTCAGCCAAGGGTTTGCTAAGCGTTCGCTTTGCAAATTAGATTTAGATTTTATTCATACTTCAATTGTGTAAGACGT 
GAATTTGTCTCCGACATTCGCTTAAGTTTTTGATCAATAAAAATACTTTTTTTTATAAACCAACAAACCGCGAAGTTGTAATTATATTTATTTAC 
TCATTCAATTCAACTTGATTTGTTTAGTTTACGATTTTGCGATCGTTTATTTATTAACATCAAATCATGGATATAAGCGTAGTTCTGAAATTATT 
TAAAATTAAATTCATAATTTTAAAATGATTATTGCTTTTAAAACTACAAATAAAACATAAATAGATTAGTTTTTTTTTTTGTTTGCTTTTCTTTT 
AGTTGTTTTTTGTTTGTTTTTGTTTTCAATCCAGCCTTAATCGTATGAACACTTACATGTTATTTATGCCTAAGCTAACGAATAATAATATTTTT 
CTACCTTGCGTGGTTGCCTTTTGAATGTACATAAAATATTTTAAGCCATATAAAGAATTGTACACTTATATAATAATATATTATATGCGTTTCAG 
TTACGGTTAATTCTTGTTTTTGTGCGTTTTGTATTGTTTTGTTTTATGTACAGAATTTATTTGCTTAGCAAATTATGGCAAATTAGTCTAATTAA 
AAAAATTTTACAATGATGTATATATGAATACATATAGTCTGTTCTGGGGTTTCAATTCATGAATATTTAATTCTTTACTTCTTCAATTGCCATTT 
TCTTTTTGTCACTCCATGTCAGCTGTGGACTTGTCTTGTTTGTTGATTCATTGTTAGTAGGATTAATGGTCGAATGAAATGCATTAGTAGAGTTA 
GATAGATTCCCTCGACTTATATGCTAATTTTAGTTGATTAGTCAAGTACTGTGCTCTGGGGCGCTACTGCCCCCTCTTCTGTTTGTCAGTTTTAG 
TTTTGTTAGTTGCTGCTTGGTGATTGGTGATATCAAATGATTGACTGACTGGTTGAAAGCTAAACATGTATGATTGCGAACATATTTTGTTGTTT 
CATTTTTGAGGCACCCAACTTGGTTTGTGAGAGCGTCACACCACTGAATTCTGGATAGTGTTGTGCTCCTTTTCAGCAGGTATTAGTCTAATGGT 
TAATTCAGTTTTGTCTGGTCTGCTATACTATAAGTGTATGTGTATATATATATATATAAGCTTGAGCGCTTTATAGATAGTGTACATTTAAAATT 
ATATTTCTGTTTTTTCTATAAATACACAAAGTTAAATATTAAGCTGAGCGAGGAAAATGATTATATCGCGAGTTGAGAATCAAGAGCGCGTAGAT 
AATTTCAAATTAAATATGTTTGTAGCATAGGAATTAGGCAACTAATAACTTTTCACTTAACAAAAATAATTTAAATATAAGAAACACTTAACGTA 
GACT GT T T G GGC GGGT GT GGT G AGT GT GAG AGG AT GAT GGC C AAG AGC AGT GAGGCG AAT CTCTCAGGGATTGCTTCTACT GC T GC AGCTGGXC A 
CCGAATTGTTGCTGCTGCTCTTGCCGCCCTTGTTACTGGCGGCAGTGTTGCAGCATGTGGTTGTGGAGGAGATGCCGCTCCTGGCCCCCGTCTCA 
TCGCCCTCATCGCCCAGGTACTCGTCCGGCTTGTGACTAGAGGCGGGAGGCGTCAGAAGACTTTCGGGAGCGCGCAGCGCGGTGGCCGGGGAAGC 
TTGAATCCTAGCGCGCATGGCATGCGCCGCATCCTGGGCCATAAGCACTTCGTCCTGAAAGATAGAAGTTTCAATAAGTAACTGACGGAATAAAG 
T AC GC T T TAT AT T C AG AGT G AAT CG AAT AC ATATT TGGACG T ATGC AC ACT TC TGT AT AC AC T TAG AT A T AC T T ACAT AC AT AT C CAT GGT AGT T 
GTGTGCGTTTGGATAATGTGCGAGTCGTCGGTGACAATATTGGGGCAGATTAGTCCCAGGCCAAATTTGTTGCTGACTTGCTCATTCTGCTCGTT 
CAGCTGCAGGTAGGGGGCCTTCTGGGAGATCACTCGGAAGAATGGCTCCATCCAGCGAGCACAAGGCTGAATCACCTGCCAATCCAGACCAGAAC 
AGCGTAAAGCCATCTCCCTAAAAACGAAAAATATTCGGTAAGATAACCATGTCGTATGGGATGCAACTTCCATTCCAACTTACCGACTAAATGTA 
TGACTGATGGCGGCCGCTGCCAGCACCGAGTAGGAATAGTTGGCCATGCCCACGTCCAGGGTGCACAGATCTAGCAGCTGTGAGGTTTGCACAAA 
CTCAAAGCCAGAGAATTGCGGGTAGATAAAGGCATCGTCCGCCTCCGCGCTCTTTTGCCTGCCAATCTGGGAGAACGAGGCCGGAGTGCGGTTGT 
TCACATTCAGTTGCATATAGACGCCCAGCCAGCCGGTAATGGTGATGGGGCTGATGTCCCAGTCGAGCGCCTGCAGCAGAATCTTCTCGTGGTTA 
AGGATGTCCCGCTCTGTGCAGGCGCCATCCGTCACATAAGCAAACTCCCCGATCTTTGGCGGATAAATCTCCTCTACCTTGGCGGCCACAAACAG 
GCAGGTGATGCCGATCAACTGCAAGTGCGTCTTCTGCACCTTATGCGCCACATGCAAATAGCGATCCAGGTAGTCGACGGCCAAGTAGAAGGTCT 
CCCGATGCAGCTTGTAAACTTCACAGACCTCGATCAACCAGTCCAAGAGAATGGCACGCATACGTGGTTGCAGTCCGGGATGCTGTTCCAGCATC 
GAAATGCTGCGCAGACGCGAGTCCTGCTCATCCCGGTGACACATCAGGCGCCAAACATCGGCGGCGTTAGCCCAGGCGAGCGCAGGAAGTGGGCA 
CTGGCGCAGACCGTTCTCCACAGCCGGCGACATGGGCGTCCGCATGTTAATCATCAGGTCACTGACCGGATGATGGATCTGCTCCTGATGTTTTG 
GCAGGCCGGGAGTACGCTCTCCATTCACCGCCTGCTGCTGGCTGCAGCCGGAGGAGGCTGGAGAAATTGTGGAGGAGTAGATCTCCACGTCGTCG 
TCCTCCTCCTCAACGTCATCCTCATCATCCTCGTCGTACGAGTAGTCCTCGCAGCTGTCGTCCAAAAGGTCATCATCCTCCGTCTTCTGCGTGAC 
CACCGTACTGGTCACCACCTGCTCCGCAGCGTAGCGCACCACCACTGGCGTCTGCTTGCTTCCACGATCCGGACTCGGCGGACTGTCCGGGCTGT 
CCGGCAGCGGACTGTGTGGCGCCTCCGACAGATCCTCGGCCGGTGAGCTACGTATGCTGAGCAGCTCCTGGGTACTCTGGCCATCAACCGAGACA 
ACGGGCGAGGTGTAGACCGAGGAGGCAACCGATGACAGATTGCCCTGCTCGCTGCCGTACAAAGCGGGCAGTCGCTGCTGGCGCTTAGCTGATGG 
AGGCTCAAAACCGAGTTCAGGGTCCTGAAAATGAAGGACAGATGTTTGAATAAGTAAATCATACAAGTGAATGCTTTAAATTGAAATGAACTAAG 
ATAGTAGACAATCAAATATAATTACTAAGCTATATTCAGATATTACTTTCTTAAATAGTAGTTCATATTTCTCAAAGGTGTCAGCAATTATTACC 
ATTTCAATAAATTTGCGCTTCTGTTCCAACTTCATAATAACATGAACAAATGAGTGGAGAGGATACTCAAGACAATAACAGGATCACACGACTGG 
CGGAATTAGAAACGTGCTCCGAATTTAATTAGCGAAGGAGGTTTGGGGAAATTTATGTGGGCACGCCTGTCAAAATTGGTAATCCTTGATAGATG 
GACTGCTGGCTGGACCGAATGCACCCGCACAAAATGAGGAGCCAGGACTTCACACAAAAGATCAAGGACTATGGAATGGCATGGAGGTAAGACAA 
TAGCACTTGGTTGGATTTTCTCGTGGGAACGAACGGGTTTCGTTTTGCTCTTTGTTTCTTTTGAATATTTTTGGTATGATCAACATATAGACACG 
ATATCGTTATTGCACTTTTAACTTTAATGTTATAAATTTAGTAGCAGTCTTTGGAAATAATTTTGTTGCGTCGTTTCTCTTTGGTATGATTAAAT 
AATTGTGCAAACAGATTTTTTGGTTATAGCATTTAAACAATTTTGGCTTCGTTTTGTTTTTTTGGTAGTTGTTGTTGTTGTTGGTGGGTGATCTT 
GGCAAATCCGGTTCTTAAATTAGTTCTTTGCACTCGTTCTTGGTGGTTTATATAAATATATTTATCAGTCTTAACATTTGCTACTTGGCAACGAA 
TAATTTTGGAATTTCGAATTTCGTTTCAGTTTGTCTGGGTATGTATTATTATTTGGCTAACTCCGCTCTTGGCTTGATCGAGTACCAATCGCGTT 
GACGTGCGCTGTGCAACTGAGCTCGACGTTCTCCGTTCCAGATCGCCGATCCCCGAATTCCCCGATTACCGATCGTCGTTATCCTGCCCAACGAA 
TGCAACGGTCTCGAAAATAGGGGTTGCTTTTCTCTGCAACAACAACAGCGGTAAAAACAGGATCGCCAAACTAACGTGCCGTCCCTTTCCAACGG 
GGGTTGCTTGCCTAAGATATCCCCGTATATCTTGAGACGACAACTTGTGTTGTTTTTGTCTTTGCAGCAGCATTGCATTTTTCGCCTCTGGGTGT 
TTATCGGCCCAACAACAAATCCCACTGACGGCTGTGGCTCCTGGATCCTTTTTACAATTATTTTTGTCAACATTTTCGTTCAGGTAGACGCAAGA 
CGTTTGGAAAATCACTACCTACATCATAATCCTCAAAATAGCGACACATAATATAGTACATAGGAAGTATATAAATTACGATTTGCTAGCTTTTG 
GCTCGAGCTGCCAGCGGCTGCATTAAGAATCTCCTGCATCTTAAAGGCTACTCAAATAATAAATTATCTTTTGACATTGATCAGAGGGATGGAGG 
C GGT GC C AAAGC C AAAAGT AG AGC G AT C C T AAT G G AAC C T CAT AAG AT T AC AGC AC AAAAAT G AC AAAA AC AAAAAAAC G AAAAT GG AAC C G AAA 
ATCGAAAATCGAAAACAATTGAGATCCTTGACCATTGTTCTTCTTGCAGACAGATGATTGAAAACAGAAGCCAAAATCAGAAAGTCCATAAAGTA 
CAATAAAATGTAGTCAGCATGGGACACAGATGTCTGAGATTGCTGATACCATAAAAAATACATAAATGAACGTATTTCGCTTAAAAAACTATTGG 
GAATGTTTTGAAAGTTATCTCAAGAAACCCATAATTTCTTGGCTTTCTCTTTTTTTTTTAATGCATAGTAAAAGATATTTATTTATAAGATACTG 
AAATATATCAAAAATAGCATTTAAAGCTATTCTGAAAGTTTTGGAAGCTTGGATTTCCATGTTTTAAATGTTTGGCACAATTATTTGCATTTATA 
TATACTTTGAATGAAAAATATACATCAAAAATATTTGAAATTGCCCAATATCTGCACATAAAACTAAGCAATACAAAAGATATAAAATCCACATA 
ATATTCGCATCAGTTTCTCTACAATTTCCTTTGGTTTACGATTGCAATTGATCTCTTCTTTGATTGAAAAAGGTGTGCAGTGCACACCATATAAT 
TTATCAAATACTGACTGCATTTCGTCAACTTTCTTAACAGTCAATCTCACACATCGTAAAAGCTTAACAATATCAACCAACCCTGCTGGCCATTA 
AAGATTTTCACATAATTTCTTGGGCGAAAAATGGGGGGAAATATTGCGATATTTATGTGACAATTTTAATTGTCTGAGTAGGTTTGCCTTCTCTC 
GACAAACATACCGAGCAGAACTCCCCTCCTGAAGTCGAGAATAAATACTTTTTTTTTTTTTTTGTTCAGGCAAGACAGGAAGGTCCAAGACGCAT 
GACAAATTTAATTTCTCATTCTTCTGGCCAACAAACTGCAACAACCCTCCGGAGGAGAAGATGATGTAGGTATAACCGGGAACGGCAATCGTAGA 
ATGAAAATGAAAATGAAAATGGACATGGATATGGACATGGACACATGTCAAGGCACCCAGATGCAATGTAATTAAAATCAAAGCGTGCAATGGCC 
TAAGAAGCCATCTCCCTGGACAATGGAAATTCCGTTCCTCTAGCCTTATTATTTCACTTCATTTTTGCTTTTGGTGCGTGGACATTGCATCTGCC 
GTTTGGATTTGGTCAAGCGTTATGGAATCACAAAGATTATGCACGAAAATTCTCAGTTTACTTAACGACTTGACGGATTCGAAGAGAAAGATCGT 
CGGAGTGAGTGTGAGAAATTAGTGAACTGTAAAGTGGGCTGGAATCTGGACCAATATGGACCCGAAGGGTTGCATTCCAAGTTTAACAAGGGTTA 
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AACTGGGTGGTTGCCCTTAAGCCACCGCCGTCGCCGCTGCCTTCAAGGATTTAAGTCTCAACTCCAAGCAGGCAGCTGCTGCAATCGTCCTTCGC 
ATCTCGATCAGCTCCTTCAAAATCGAAGCCCACAAGGAAGGCACCCAAAACATCTCTCGAAATCTCACTGGGAGGGAGACCATGGGTCGCCGGCG 
ACAGATGATGCAGTTCCTCAAGTTATGGGTACATACAGAAAGCAGACAAGAAGCAATTCGATTGTTAGGCGACAACCTCTGCTGCGTCGCCGACC 
TCAATACTCGATGTCCTGGTGCCCCATTGTGTCGCTCTCCCTCTCCTTTTCGCCTACCCTATAAAGTCCTGCGCTTATGTGTCGTAATCTTTCGA 
CGGCCGTCTGCTCGAAACCGAAACACCTTTAGTTTTCAGACCTTGGCGTGGGTCCAGGTGTAACTTTTTCACCGCTGTTTTCGTGTAATATGCTG 
CATTGTTGTTTCCATTTATTTTTCTTTTCTTACAGTTTTTTTCTTTTTTTTTCAATTCTCCCCCAAGTCCTTTGATAACATTTAATTTCATACTA 
CAATGTGCGGTAGTTGTTAATATTTTTACAGTTTGAAAAGATTATGATTTTCTTGAAATTCAGGATCAAGCTAGAGGTTTTTTCTTCAGATATTT 
CATTTTTCATGAAGGACTCATAAAAGTCTGATGATCGTTCAAAATTAGCTAAAGTCCTGTCTACTTTTTGGGACATCGCTGCTGCTCTGCTTTTA 
ATAAATGATATGCAAACAACATGCCAGTGGACAAATCCCCGAAGGACCATAACAAACACTCATATTTCAAACCAATTCAACTGACCCCTGGTCAG 
TCCAGAAAGGGTTAATCCAACACAACACTTTCGCGGTGACATTGCTGGTTTTATCATCGCGGAAAATGGAAAATTCGGTGGGTCAGGGGGACGTT 
GAAATTGCTTGGACATGTAATTAATCATGCAATTGGTTAGATCCATGACAAAACGAATTAAAGCATATCACATTGGTGAAACGGGAGGAGGGTAA 
GTGGACAAAGGACAAACCGCGAATGAATTTTGAATGCGAAAAATGAATTTTGATTAATTGTCAGGAAAGACAATTTTCTTTAGGGGATATGCCAA 
CTTCTTAGGTGTAAGCCTTTAATAATTGAATATCTCTTGATATATTTACTGCAATTACGTTTTCCCCTGTTTTATCTAATAATTTACCTTTTAAA 
ATTATCTGCTCAAAAACTAGTTTCTTATCAGTCGAATCTTAATGTAATTAATCTGAAAACAGATCGCTGCCCAAGTTTACCATTATCTCTTGTTT 
ACCTCCAGCTAAATAAAATCCTCTGTAAAATGAAAAAAGAAAACTTTGAATTACAAAGCAGGCGACGCATGATCCCAAATCGGGATGCAATCTGG 
GAAAGTGAAACTTGAATTCAGTTTCCCACAATATCCCGATACTGACTTGACAGAAATCAAGACCCACACTTGATTGCATCGAAAGGAAACCCGAG 
CGAAAGCAAACCCAACTCCCCTTTTCGGGACAAAAGAAACAAAGAAGTTTCCCAAAAAAAAAAAATAACAAGGAACTTTCGGCGATTTCATGCCT 
GCCATGTAAGGAGTAAGAAAAAGGGGTTTACATTACACTGACAAATTGGTAAAATAATACCATAATTTTGATCCTGTGCCAGCAGCAAAACAAAT 
ATTGTATAATATTTCAAGACCGCATTCCGATGCAGACGCGTTCTCCGTGTCCCGAAAAGCAGATGACACAAGAGTTTTAAGAAGGAAAAGCCTGA 
TGATTTGCATAAAAGTTTAACGGAAATCTAATGAGCAGACGTCGCAGGATCGGCCAAGGCATTTGCATAAGGATAACATGTCCCGAAACGGGGTA 
GCAAAATGCAACCCAGTTGGAAAGAATTTTTTTTGTTATGCTCCGAATCTTCTGCAGAAGCAAATGTCGTCTCGATGAATATTTAATTAGAGACG 
AAAAGAAAAGGGAAAAACTCGGCCCAGTGATCCTTGAAAAGGATAAGGCGrTGCTCAACCCCTCCGTCGGTTCGTTCGTGTGTTGCGGTTTCAGC 
TCTGTCCCATTCCGGTAGTCACCACGATCCCTTACCCATCACCGTCCCCCCATCCCGATGCCAATCCCTTAAACTGTCTTGATCTAGTTCTTCTG 
CCTCAAGCTAACCTATAACTATCGCAATTTCAAATTCTTACCACGGCCCTAACGGACATTTGCCGACGCCTTGACTTTTGGAAAAACCCACCCCT 
GGAGAACTCGGACTTGGACTCAAGGTGTGCGCGCATGCGCAACGACGCCTTCTGTCCTTGTTCTGCGTCTCCTGCTCCTGCGCCTGCTTCTGGTC 
CGGCTCCTTTGTCCTGGCATGCATATGCTCGGCATACTGCCAAAGACCCCTTCGGGGAATTCCCTTACTCAACAAAGTTCGCCTGAGAGGGGGCC 
AGACATTTCCTTGCCATCCACTTTTTTCGTACCAGAATGACATTTGTTTGGCACGGAAGCAGGCGCAAAAACCATTGCAAAAACAAAAAACTCAA 
AAAAAAAACAGCAAAAAAAATAACCAAGGCAGATATCATTTGTTTGGCAACAGACTGTTTGTATACCCTGGTCACTACATTTTAATTTTACGATA 
TATTCTCAGTACAAACCAAGGCACCTTATAAATAATATAAATTTCTATATTTCCACTTTATCAATAAGATATGCTGAGTTCTAAAACAGTCCTTT 
TAACATGCCTAAGATTTAATATATGAAAACTTATAGATATTCATAATCTTGCAAAGATAAAGATACCCATCTAGAGGGTACCAAACTATCAAACA 
AGAAAACAATTTCTGCAAGTTCATTGCACGTATAGGAGAATATTCTCTCCCCTACATACATATTTCATATGGCACTGGAGAAAAAAATCCTTTGC 
ACCAACACAGCTGCCCATCGTGAGGAGCCGCCAGGAGATGAGAAGGAAGCCATTGTTCTCCGCCTTGATTGCAGCTACTTGAGAGGAGATCTGAC 
TTCAAGGCTTCCAAGGATGCTGCTAAATGAGCCCTTTTCAGTAAGTAGCGTTTATTTTTGGGCTCATCTTTGCATTGTGCCCGCGCAAAAGTGCC 
AAGAAGTAGCTGTCTACGTGCTCAACGGCTTATTTGGCCAAAACTCAAGCAGCGAAAGATCAAATGAAGATCGGCAGCCAAACGAGTGGAAGCCA 
CCAAAGGGCTGTCCATTCAAATGGAAGTACAAATAAAAGATCGTTTGCCAAACATAATAATAAAAATGGAGAATTGTTTAGCGATCGCTGATCAA 
T GAAAAC AAT T GG GAAACTAT AATC AAAT AG AT AAAT AT AT AAAA T GGC T AT T GCT C AAAT T CAAAAAG AAAT ATC ACC T GC ACG GTT C AAAT AT 
GTATGTACATTGTTTAGTTATAATCTCAAATTTACATTATTATTATTAACAGAGTTCACAAGCAATCAGATTATTCAGCTTGGAAATAAATCTTT 
GAGC T T AT AAAC AGC C T AAC C T T CAATT AAT AG ATT AAAT T AAT AG AT T C T T AT TT AGGC T AAAAT GAAAGT GGC AAT C AC T GAT C GAC GAG AAT 
TTAGAGAACAGAGAATTCCCAGAGTTCAAGAGTTCGTTAAATAGAAACATTTAATTAATTTCAAATGCATAGGCACTTTTTGAAACACACTAGAG 
CTAATCATCGGCTTCCTAGAATGCCAGAATGCCTCTTGCCATTTCCCTGATAAGATTGCAAAAATCAACGAGGTCACATGACATGATAAAGCACT 
TCATTCATAAAACTTATCTGCGCTGTGTCTGGCTTTATGTTTAAATTACGATTCGTTTTGGATTACGTACCTCGTTTGAATCGCTGCTTAAACGC 
CGTTTACGTTTATTGCCATTGGAAGTCGCGGCTGTGGATGCTGCAAGATTCTGGCTATCACAAGAGCACATCTCGTCGGTTCTGGAGGCTGCTGA 
GGTTCCTGGTAGCTCCTCTTCACTTCGTTGCGACGCTGAAGAGAACGATGTTGATGGGATTGGATCTGGATGTGGAAGGATAGCGATTGGGCTAT 
CGCTGCTGCTACTGATACTACTGCTGCTGACGACGACGACTATGCTGCTGCTCACTTCTCCGTTGCTGGGGGCGGTTGTGACTATGCTGCCATTT 
GGCTCAGTGCTGCTGGTAGAACAAACACTGCAAAAGAAAGTGAGTGGAATTAGCAGTGGGGTACTAAATAATAATACTAAATAATTTAAATTATA 
TAGTTGAATGTCCCCAAAGAGCAGCTATTTGATAGAATACTTCAACATAGCTGCCTATATTTTTGTAGATTTAAAAGACATGATTGTATCAAAAA 
CCCTTCCATTTAAAGAATCCCCCATATTTTTGTAAACCCAAACCCAAACTCTGGAGCGCAGTTATTTCCAATTTCTCCACTTTTTTTTTTTTTTT 
GCAAAATAGAGAAATTCAATATGCCACGGAGGCAGGGCAGCAAATCACATAGGGGCGTTCTAAAATAGATACAAACGAAACGAACATAAATTCAT 
TGAATTTTGACACGGCATTCGAATTTCGAATCGCGATGCAAGACTCGAGATTAAGGCGCCCAACGGATATATTAATATTTAACATCCGGGAAATT 
GAAATCGAGATGACTGATCGGGGGGGGAAATGGAAATGGAAATGGGATCGTTGTGAGGGCCCCGGGGAAAGCCTTTCCTCCTCGAGATTGAGCAA 
TTTTGGCAAATTTGCATGCACATTAGCATAATGCAGGAGGCTGGCAAACGGGAAGAATACAATTTTCCATACTTTCCAAAGGACAGATGTTATAT 
CCTTGCGTCGCAACTCTCAAGTGTTGCGATCCAAGCAACTGCACTTTAAGCAATTCGATGCGCGAATTTTTGCAATTTGAGCGAGGAGCTCAGGA 
GAACGAGAGAAAAAAAAACGAAATCTAACCCTTGGGACCAAAAACCCTTCTTAAGG 

GGGCTGCTGCCCATACATAAATTCAAATGCTTTAGCCATTTCTTTCTTGGCCAGAGGCTCGGCATTGCAGCCTCCTGTTTTGATCCTTGCCCTGC 
ACTATCGATCGGGTTTAATGGTTTAATGAACAAGAAAAAGTAATCACGTAGTAAATTACGAGGAAATGAAATCGCTTTAAAAATGAGTTATGGCG 
AGAAAGGACTAATGGAGCGTACGTGCACGCGGCCTGCAACTTTTTCCCGCTTTCTTTCATTGTACTCAGGGTTTCGGGGTCAGGGGTTTCCAGTC 
ATCTCCGCGGCTTAAGGTTCAAACTGAAATTTGTCATTTCGAGGTGAAAGTTTCCTTTCCTTTGCGGCAAATGCAGATACTTTGTTTCGGTTTCG 
TTTCCCCTTTTCTATTCCGTTTCAAATCCGAATCCCTGTAGGTAGCAAAAGGCTAGGGTGACTATCTCGGAGATGATATGATGAAAAAATTCGAA 
GAAGTATTTCCTGGAATCTGGTCCAGTGGATAATAAGTGGACTATCGAAGGATCGATCTTTCTCACATGATAAATTATAAAGCAACGGCGACTTT 
TCGATTAGAAATTAGTTATTTATGCGTTTAACTTGGTATCAACGAAACAGTTATGTTTCGGAATTTATGTGTTTGATCCTAAACTCTTAAGGTTA 
CAAGAATTTTGTCATTTTTTGAACATTTTTATGTGAGTTCCTTGTACTACTTGTACTACCATGCCCTCAAAAGTATGCAATGGAATCTGGAAAAC 
CTGAACTTCCGTGACATTTTTTTGATAAATTCCGTTATAGATGGCAGGGAAATAGCTTCGCAACACATACCAATAATTATGTCGCACACAAGCTG 
AC C AC GA AAT A T GT C CAT AAAAT GG ACT AT C AC T T AAGT GAT C GC AGTG AT C AGT GGGT GAT GAT GAT GAT GAT GAT GAT GAT GT T G AATGC C AA 
AACGGGGACGCAACTGACAAAACCAAATGACTAGGCAATTATGCAATTATGGTCATGCGCAGGCAGCTACTACAAGCAAAAAGAATGAATATATA 
GTGCAGGGTGAAAACATATTAAACTCAATTAATTTTTGTCCAGGACATGACTGCAAAAGCCACAAAAAGTGTGTGTCTGCAGTGACAGCATTATC 
AAGAGCCGACCAAACAAACCCCCTTCATAACCCAACCCCCTGAATGACCAAACTGACCTGAACCGTCCCACAACTAAACACTTTTCTAATGACAT 
TTTCGCGCGCAATTATATACTAATCACACGCAGGACCTCGGAAAGGTCCTGTAAAGGTATGCAAGAGTGTTGAGGAATGCGGCACTTGGTTATCA 
ACGCTGCCTCGACTTTAACACCTACAAACAGATGGCTTTTTTTTTTTTGTTTTGAAAATTGACAGTGAAAATGCATTTTAGATTATCGCACTTGA 
GTCAGAAGGTTGGAACAGTAGGATCCCGAGTGGTTATAGCTCCTGCGGGTCCAACTTTCCCGCGGCGTATTCGGGCATAACCAGTTGGGCTCTGT 
CGCTTTATAATTACGAAATAGCTATTAGATAAACATCACTTTTGGTTAAGAGAATTAAAGGGAGACGCGCTGCCTACCAGGATAGTGTTGTTAGT 
GGTTAAAAAGGAAGTCGATCGGATGTCGCCACAGGAAGTCGCCGAGCGTTAGAAACAAATGGTAGCTCAGTCCGTTGGGGAGTCAGTCGCCTCTT 
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GGCTTTCATGTGGGGTGCTAATTAGTTTGGTCAGGGCTGCGGAGGAAGTGGAAGTCACGAAAAGTGTTTGATGGGAAAAGGAAATGTCTAAGATC 
ATAAGAGATGTCAAGAAAAACATGCAGGTTCGCTTGAAACTAAGATTTTTACAAACTCAAATCGAACTTGGGGTTTGCAAGAATAATTTACCTCT 
T AAAAC AT CG AAAATT AAATTAAATT ATTT AAAAGT AAGCCAGT AG AATTTAAAATGAATAAAG T AC TCATAAGC TT AC CTCT TGGCAT TTAAAC 
CCATAATTGTACGGCCTTGAGAAGCCGTCAAGGTTCTTTATCCTGAATCTTGTTTTTAAGCTGCTCGTCAAAAGCAGAAATGTTTTGCATGAAAT 
GTACAATTTAAAGAGGACTGCAAGTAAATTAAACAAAATATAAACTTAATAAAATATAATAAACAACATTTAAATAATATTGTAAAGCAATCATT 
AAAAACTAAAAATGGCATATAAACTAATTGGATAAGAGTTTCTAGGATATGGTCATCACATGCGACGCGTTTTCGAGTTTTTACGACTTCACAAA 
ACCTATCAACAGTTAGTAAAACATTATCTGTAGATCCTTGTGGAAAAAACAACATATGATCCTTGAGGGAAACTCACACACAGCATGTGGCCGAA 
AAGTCTCCTTGCATCGAAAACCCCAAACAAATTTGTTTACGTGAGTTTTTCTCGAAAAACATATATATACACCAACTTTTTGCTTTTAGCAAAAC 
GAAGAAAGAAAGTAAATATTTTTCTTTGGGGTGCAAAGTGTGTCTAAGATATTCAAATTACTTAAATAGAAAGATGGGTGGGAAGACCATTTTGG 
AGTTGGGTCATCTTAACAGGTAGTCACGGAATGGTCATCTTTTTATCATAATAGAATTTCCACGAGTCTCGATTGATGCCTTCTGCTGCAGTTTC 
AGCTTCAGCTGCTCCCCTTGAATTTCCTTTACACTTGAAAAACATAACTATACAATTCAAACAACTGAAAGTGCTATTAAATTATAGAAATCAAT 
AGAGTTCTGTTGTTGCAAGTACGATGTATTGTTGATAAATACTTTTTTAACTCCGTAATGCTTGACTTATGAGATAATTTAATTTTTTGAGTGTA 
CCGATGTCAGGTGCATCGCATTCGGCGCATGCGCAACGTCCGCATTTTCTAGTGGTTTACAACGCTTAATTATATCGAGGAGCCGAGTCCTTAGT 
GCCCCTGTTGCACCTGTTGTGGCTAGAGTTGATCGCTGACCTTCTGTTAGAACTGTAAGTATGGGTTCGATGCGATGCGGGGCAGGTGAGTAGAG 
TCGATGGGATTGGAGAAGGGGAGTGGAGTGGCAGCTGTTGAAGATCCTACCCCAACTAAAAGGATTCTGGCAATGTCTCTGCTGTTGCTTTTGTA 
ACTTCAATTGACTTCCCCCTCGCTAATTGTATCCTGTCGCGTTTCCTTTTCCCCTAGCCCTACCCCTCCCTCCCCTCCTCGCTTCCTTTCTTCCT 
TCATTTATTTTCCTTTTTCCGTGTGACATATGGGCCTGGCGTTATATTCGCCACGATTTATGGACCTCCCTTCGGCCTACCATTCAGTTCGACAC 
CTTTTTTTCCTACCCCATATGCGAATGCTTTTTAGCCAACGACTTTCTTTCCCCTACTTTCTTGTTCTTACGCAACGGATTGTCTATTTTCCTCA 
CTTTTTGCCTCGCTTTTCTTTTGCGTTTTCCGGCCTTTTCTCATGCACAATATTTTGCTGTTTCCAACCATCATTTTCACACTAATTGCGGATAT 
GCCTTCTAATTAATTGGGTTCCCTCTGCTTCCCCGAAGTCCTTTTAGAGCATTTTTGCTTTCTTAGTTTTCAACCCCCTAACTATTCTACTTACA 
ACAAAGAAATTTCCTTACACAGAAAAATAAGTCTCTTTGGTCATrTAAGAATATGTTAAAAATCTTAAAACAATTAATAAGATCATCATTTAAAT 
CGTAATCCCAATATAAAGATCTGGGTTTTGTAAATATTCGAGCTCAAAAAACATTATGAAACAATTTGTATCTTTAAGATACAGCAGAAACCTAA 
AAACGCAATAGAATAGACATTTATCTCTCAGTTGTAATCATGTTACACATCTAACATCATTTATAGTCGGATTAATGTAATCTTTTTTGTAAATC 
ACAAAACGTCCTAAAATTAATATTCGTTAATAAGGATGCGATAATATCCTTTAGTTTTACAAAACAGTTGACAGCGGCTCTTGGCAAACAAATGA 
CTGTATTATCGCTAGAGCAATCCAATTCTTGTCCAAATACCAGTGACTAAGATAGATTCCTACCTAATTGCAGGAACTGCCCACACGCAGGCTGC 
GGCTGCAACTTCCTCTAATGAAGATTAAGTGAAGAAGGCTGCGAGTTCGGGCGAAATCCGCGGCGACAGGTATACAAATATGAATAATGCCGCGG 
ATCGAGCGGATAAACTCGGGGGTTTCCTTTGCGCACAGCGCCTAATTAAAGGTGTACGATGCATGAGTGAATCTGTTTACCTGGACGGGGTCCTT 
CAAGGTCATTTAACTGGAAAACCGAGTGTCGAAAATAAAGTCGACCGTATTTTTTTCCCTCATCTTCGGAATATTTGCTTTGTGCGGAAGATTTC 
CCACAAGGAAATGGGACCAAGTCTCGAGATACAATATGAAATGCAACTTGAGGCAATGCAAAGAAGAATGCGCAAAATGCGCAAAGAAGAGCGCA 
AAAAAAATTATATACAAATATGTATGTATCTCGAGTATATGCCAGACTATATACATATGTACATATGTAGAGGGAAAACAGGTCTCGAGTTATCC 
GGCGTCTTCAACTGTCTGGGCCTGCAATTTTTGTGCGTCCTTTTTGGGCTGCATTTTGGGCTCCTCTGCACATGCGGCGAGATACATTCCTGAGA 
TACTCGAGATAAAAATGCACATGCATCTGGGCATTTGTGGCGTTGTCCATGCCAATGCCGCCTCCTCTCCATTGTTGTGGTTTGGCCACCACTCT 
TGCAATTAGTTGGAACTCGGTGGACCGAAAGTCTGCAGGAGTTGTGTAACAACATAGCTACATAATGCTGATAGGGTAAGAGGGTGAAGTTCATT 
AGGAAGATATTTTCTCAAATCGAAAGCCTAATCATATTTGCATAATATGAAACTATAAATAATACTACTAAAAGTTTAAACCAATACACGAAGAT 
TGCTTTATAAGCATCATTGAATTGAACCCGATCTTTGATGCTTCCTTTTCATACTCATGAATGCGTTTAAATTCTAATGGTCAATTGGCCCAAGT 
TTGCTGTGTTGCTTCAGTCGCATATGCCCACCCTACCCAGAACTACAGGGTACTGCAAACGAGAACAGGAAAGTCGATCCTCGCCTGCGCGCATC 
AGTGTATCTGTGAGTCCGAGTATCTGAGTATCTGTGTGCCGGGTATCTAAAGAGCGCCGCCTGGGCAAAACGGCGTGTGAAAGTAAAGTCGGCCC 
AGCCGATGGAGGAGAACAAATCAAAGTTCTTGAATGGAGCGGCAGCGGCGGCGATGGCGACGGCGGCGTTCCAGTTCCAAGAATCGATGGCCTTG 
AAGTAGCCGTTAGGACATCGGGACGCGCTCAATTGACTATTTAGTTTATATTTAGCAGCGTCTAAAAGGATGCCCACAGCGAGTCAGCTAGCTCA 
AGTGGCCTGCGTCCTGTCTCTTCCCCGCTCCACAAATCGCCAGTCGGACGGACTATCCTTTGTTTGTCTGGGAATAACTGGAAACTGTTTATCAT 
TGTTGTTTCTGCTCCTCATACATACGGCTTCACCTCATTCCGATTCCGAATCGGATTCTTCTGCCGCAGCGACAAAAATGTGTATGGACTATAAA 
AAAAAAATGCCCACATATACATACATATGTACATATGAACCATTTATTTAAGGAGTTACTATTCACAACTCATCCTTTTGGAGAACATGTAATAT 
TTGTAAAACATTCCGCACTTGTAATTGAGAAATATTGTTGGTATTTTACTAGGAATTCGGGTTAATATGGCCAGTTGATATGTGTGGATTCCTAT 
AACCATTTACGTATGTATGTACATATGATGAGAAAATATTCGAGATCAGTTTGGGTTCCCATCTCGTCTAGAAAATCCTTTGATGACATACCGCT 
TTCGAAGGTGGCAAGCACTTGGAGGATCATGAGCAGAATTGAAAGGTGTTGACTAGGATTTGAATAGATCATATAAGGAGTGTTTAGGTGGGTCA 
AAC AAAAG AAAAT GT T GG GAAAAT AT AC AAAAT T C AC G AC ACTG AATC AT C AAAAACT T GAT GC T C T T AC T I" T T T T C AAAATC AC TTCT AAGAAA 
AACAATTTATTTCCCCATGGTTTAAACTGGCCAAATCTAAACATAAGCGGCTGATTTTCGGCACGATCATGACATAATCGAAAAATTAAGTATAC 
AAAAATCCACCCCATCAACAATACACCACCCACCCCCCACGCACACCCCCGCCGCGATCGCTCCAAAAGTTTAACGGCGCAAAAAGCGAACACGC 
AAACTAAAAAACGTCAAAAGAATGCAAAAAATAAAGAAGAAAGCAAAAATTGCGAAGAAAACAACAACAAAATGAATGTGGCAGGTAAATGGAAG 
CAGAGGAGAAGAGGAAAGAACGAAGATGGCGCGAAGCAAGGTGGAGACGAAGGGGAAGCGAAAGCAAAACGCCGAGAAATATCAATGAACTCCAT 
TCTCCACTTGGCAGTCTATCTCGCTCACTCATTGGTTGGGAGCACGTACATAGAGTAACTACTCACAAAAATAAATATAACCACGATTTTACAAT 
AATTAAAATACTATTATTAATTCTACATATGTAAACATACATACATATGTATGTACGTAGATTGTGGGTAAGGGCTAAATGCCACTACCAATCTT 
TTTCTTCGTGTACCAAAGATATTTGCGCCACTTTTGACATAATTCATTATTTTGTTCTTCACTCGTTTACTCCATTTTGCTTTGAACCTCTGCTT 
CGTGTTCGTCAAGTTTTCGAAAGGTCCGTCGACTGATCGGCGATTTTCCGAATTTTTTCGACGTCGCAGTCAAATGCTCTCTTTTTTTCCGTTTT 
AGAGCGCAGTACTAAAACGTTTTTGTTTGTTGTTGGTGTTGGGAAGGGGAAAATAACTTGACTCGCTGTTCTGATTGGAATAAAACGGAAGAGAG 
CGAGAGAAAGAGAGCTTACTCTCTCTCTTAGCGAGCGCATTTCTTCCAAGTTTGTGCTTGTGTGTGTGTCATTGGAGGAAGCGTATAAAAATGAA 
ATGAGCATGGGAAAAGAATGGAGAAACGCAAAGTGGCGCGAATTAGGCGTTTGGGCTGGGGCAGTTTTGTTTTACCGCCCAAAATCTATGCGAAT 
GGCGGGAAAAAGTGCAATTCGGGCGGGAGAAAGATTAGAAAGTTGGCCCAAACAGCAGGGTAAACCATAAGCTGCATCTCAAGCCTTCTATAGTA 
AGAAAAGCTGTAATAGTTCACTGAACTCCCGTTACGCATTAGAAACTCAGTGCACTTTTAGACGTGATTAGAAGCTTTCATTTATTTCATAGGAT 
TTTCCAATTTGTTCCCATTGGATAACGGTTTAATTCTAATAGGACACTTTTCCTTATAGGGGGAACACTAAAACACTGTGTGGGGGAGACAATTT 
GTAGTTGGACACCCCGTCTCTTGCCAATCAGAAAGCAACGGTTAAATCCTACAGCGTGTCAATTTTCTTATAAGAGGATGTTGTATTTGTAGCAA 
AAAATGCAAAGACACGACGGCAAACGCAGCATTCTCTCCCGCTCTTTCCTTCACTGTTCCACACACACTTGTCACCGCACACGCGTCATAAATTT 
TATTCCACTCTGAATAACGTATTGCCTTTTATCCACTAAAACCATTGTGCAATTGGTTTTACCTTCTAGTAAGCGGAAAAATGCTTAACATAATA 
TCCCCGGTGTTTTCACTTGTTTTTCTTAAAGGGAATTGAACAACGTTTTTCGAGAGTTTTTCGTCGTCGTCGTCGCCGGCCACGAAAAAACTTCA 
CGAGCAGCGGAGCGGCGGCAGCAAAGTGATCGCTTTGGAACCGAAAAAAAAATTGGTGGAACTGGGTATCAAAAGCAGCGACGTCGACTGCGGCG 
AGTGTGCATGAGGAACGCGCCAAAACAGGCGGATCCGAAGCGCCAAAAAATGCAACAAACACAGGGGGCAACTTTGTTGTACTCTAGGGGAGGGA 
CGGAGATGGGGCGAGCATCTCCAGCGCAAAGAACAGAGCGTGAGAAAGCGAGACGGGGCGATGAAGCCGCTGGTGGCAAGCTTGGCATTTTATTG 
AATGGAGTCTGCAATTGGAGTCCCCAGTCCCCAACAGGCTAGTCAGACTACAATTGGAGGTTCGACGTCCAGTCAGCTGGGTGGTCGAAAAACAG 
CTGACGAGGATGCCAGAAGGGCGGATTTAGTCGGATATATCGTGGTTATATATAAATACCTAAGATTTATTTCTATGTCATATGCCAATTTTAAT 
TCAAATATGACCTTCAGAAAATAAATATTTAAATTTTAATATGAAACTATTTTTTTTAATATATTTTTAAAGCGCTCCAAAATCAATTCAAGGTT 
AAAATAAATTTATCTTGAAGGGTATTCATAAATCTTTAAAATTTTAATTATTTATTTATTCTTCAGAAGCCTGAAATTACATGGTTTAATATGCT 
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TTTCTAGTAGTTTTTATGAACTGCAGCCAAATAACATATGACAAGATGACTGAGGTCGAACTAGGTGGTAACGCTGCTTTGAAACCAAACAAACC 
ACTGCCACTTTGTTGTTTTCATCGATTAACATTCGAGCATTTTTGGCACCAACTTCGAGCGCTTATAAATCAGGTAAGCTAAATGAAAAACCATG 
CGTGCCTATAACCAATTGAAAGATCAAAACTCGCAGAATCTGATTTTTCACTAAAATCGAAAATCCCGTTCAATTCGGCCAAATTAGAATCGATT 
GAACTTGATGATTTCCCGCGCGCGGGAACCCGACGTCGGCGCGCGCTGCTGACAGCGATGACAGCTGCGCGCGCAAAGAGAAGGCGCGGAAGAGA 
GCGCGAGAGGGGCAGGGGAACAGAGAGAGAAAGAGGCAAAAACGTCGACGTCGCTGCTCACGTTTTTTTTCCATTACTTTTCGACTGCTACAATA 
TCGATGGAGAAAACGGGGTCTAAAGGAAATGGCTTAAAAAGCAAGTTCACGAACTTTTGAACCCTGCACCTGCTGCAAATCGAATGGGCCAATTG 
GCCGGCATCACTCACCTGCTGTTGGGCAGGATTCGGGCAAATTCTCAGCGGGATTGCACACTGAAAAACACACGCGGACTGGAATGGTGATGAAA 
AAAATTGCACACACCGCCGCACACATACACACGCACGCCAGCCAGCAAGACAGTATAGCCTCCTTTTTGTCACCTTTTTCGACAGATTTATCGTG 
GTTCTGCGGTCTTTTTCACGGCTAGCTCAACCCCACGGCAGTCAAGGCCAATTCGATGCGATCCCCGCGATTTTTCAGCGACTCCAAAGTAACAC 
AATGATCGCAAGTGGCACTGTGACGAGATCGAAATCGAGAGAGCGATGAAAGAGCAAAAAAAAAGGCCATAATCATCTGGCCACACGGCGCAAAG 
AGAGAGTCGGAGTTTAGTGGCAGCGGCGTTGCCACTACGGCCGATCGCGTTGCGGGATTTTTATTGGCGCGTTATTCAAATTTTGAAAAATAACT 
TCTATGTTAGCCTGACTTTTACTGCTCTGCATATTTAGTTCACTATTTTTCCGGGCTGATCATATCTGGAAATCGGGTATTTAAAATAATAAAAG 
ATGTGCAACGTGCTTAATGTTAAATGGGGTTTTAGAAAAGAAATTGTATTATATAAATTTTGCTTTGATTGTTCGATTAACGGATATTAAATAAT 
TGGATTTTCCGTTGGGAGCGTAATTTGAAACGCATATTTGATACGCTTATATTGTTTTTGTTTTATTGTTATATTCTTGTTTTTATCATGCGACT 
ATCTAGTAATTAAATTCAAACAGAATACTCAAGTAAGAATTTTGTATTTTAGGAAATACTTAAAGTATTTTCACTCGATTTAATTTTTGGCGCAC 
TTTTCAAACAATAACGGATGTCACTCAAATGATTTAATGGTTTTCAACTTTAATGCTCTTTTTTGTTGCTTAACTCACATATTGATTTTGTTTTA 
TTATAGCAAATTTCGTATAATAAATGTTAGAATTACACGAACACTTTACTTTTCGATTTGAATTTGTTTGATTTTAATCCACCTCACTGAATTGC 
AGATTTTTTCGGCGTGCTGTTATTTATCCACAAAATAATAATTTTTAGGCAGTAATAACTAACAATTAGCGAATATTTGTCTTTAGGGAACTTTG 
CTAATTTTCATGTTCATGCTAATGTTAGCTATTGGTTTTGAATTTTATATTTAATATTAAAAAGTTAAATAATTAACTTCTGATAAATATTTTTT 
AATATAGCTTGTATTTTTAGAACTCTTATTAATTCACTCAATCACTTTTGATAATAAATTTCCTCACACAAAC 
(SEQ ID NO: 397} 

Exon: 22253. .21961 
Exon: 15027.. 14900 
Exon: 12092.. 11756 
Exon: 4964.. 3694 
Exon: 3627. .3402 
Exon: 3284.. 1701 
Exon: 1036.. 1001 

Start ATG: 14919 (Reverse strand: CAT) 
Transcript No. : CT13013 

TCACAGTGCCACTTGCGATCATTGTGTTACTTTGGAGTCGCTGAAAAATCGCGGGGATCGCATCGAATTGGCCTTGACTGCCGTGGGGTTGAGCT 
AGCCGTGAAAAAGACCGCAGAACCACGATAAATCTGTCGAAAAAGGTGACAAAAAGGAGGCTATACTGTCTTGCTGGCTGGCGTGCGTGTGTATG 
TGTGCGGCGGTGTGTGCAATTTTTTTCATCACCATTCCAGTCCGCGTGTGTTTTTCAGTGTGCAATCCCGCTGAGAATTTGCCCGAATCCTGCCC 
AACAGCAGTCCTCTTTAAATTGTACATTTCATGCAAAACATTTCTGCTTTTGACGAGCAGCTTAAAAACAAGATTCAGGATAAAGAACCTTGACG 
GCTTCTCAAGGCCGTACAATTATGGGTTTAAATGCCAAGAGTGTTTGTTCTACCAGCAGCACTGAGCCAAATGGCAGCATAGTCACAACCGCCCC 
CAGCAACGGAGAAGTGAGCAGCAGCATAGTCGTCGTCGTCAGCAGCAGTAGTATCAGTAGCAGCAGCGATAGCCCAATCGCTATCCTTCCACATC 
CAGATCCAATCCCATCAACATCGTTCTCTTCAGCGTCGCAACGAAGTGAAGAGGAGCTACCAGGAACCTCAGCAGCCTCCAGAACCGACGAGATG 
TGCTCTTGTGATAGCCAGAATCTTGCAGCATCCACAGCCGCGACTTCCAATGGCAATAAACGTAAACGGCGTTTAAGCAGCGATTCAAACGAGGA 
CCCTGAACTCGGTTTTGAGCCTCCATCAGCTAAGCGCCAGCAGCGACTGCCCGCTTTGTACGGCAGCGAGCAGGGCAATCTGTCATCGGTTGCCT 
CCTCGGTCTACACCTCGCCCGTTGTCTCGGTTGATGGCCAGAGTACCCAGGAGCTGCTCAGCATACGTAGCTCACCGGCCGAGGATCTGTCGGAG 
GCGCCACACAGTCCGCTGCCGGACAGCCCGGACAGTCCGCCGAGTCCGGATCGTGGAAGCAAGCAGACGCCAGTGGTGGTGCGCTACGCTGCGGA 
GCAGGTGGTGACCAGTACGGTGGTCACGCAGAAGACGGAGGATGATGACCTTTTGGACGACAGCTGCGAGGACTACTCGTACGACGAGGATGATG 
AGGATGACGTTGAGGAGGAGGACGACGACGTGGAGATCTACTCCTCCACAATTTCTCCAGCCTCCTCCGGCTGCAGCCAGCAGCAGGCGGTGAAT 
GGAGAGCGTACTCCCGGCCTGCCAAAACATCAGGAGCAGATCCATCATCCGGTCAGTGACCTGATGATTAACATGCGGACGCCCATGTCGCCGGC 
TGTGGAGAACGGTCTGCGCCAGTGCCCACTTCCTGCGCTCGCCTGGGCTAACGCCGCCGATGTTTGGCGCCTGATGTGTCACCGGGATGAGCAGG 
ACTCGCGTCTGCGCAGCATTTCGATGCTGGAACAGCATCCCGGACTGCAACCACGTATGCGTGCCATTCTCTTGGACTGGTTGATCGAGGTCTGT 
GAAGTTTACAAGCTGCATCGGGAGACCTTCTACTTGGCCGTCGACTACCTGGATCGCTATTTGCATGTGGCGCATAAGGTGCAGAAGACGCACTT 
GCAGTTGATCGGCATCACCTGCCTGTTTGTGGCCGCCAAGGTAGAGGAGATTTATCCGCCAAAGATCGGGGAGTTTGCTTATGTGACGGATGGCG 
CCTGCACAGAGCGGGACATCCTTAACCACGAGAAGATTCTGCTGCAGGCGCTCGACTGGGACATCAGCCCCATCACCATTACCGGCTGGCTGGGC 
GTCTATATGCAACTGAATGTGAACAACCGCACTCCGGCCTCGTTCTCCCAGATTGGCAGGCAAAAGAGCGCGGAGGCGGACGATGCCTTTATCTA 
CCCGCAATTCTCTGGCTTTGAGTTTGTGCAAACCTCACAGCTGCTAGATCTGTGCACCCTGGACGTGGGCATGGCCAACTATTCCTACTCGGTGC 
TGGCAGCGGCCGCCATCAGTCATACATTTAGTCGGGAGATGGCTTTACGCTGTTCTGGTCTGGATTGGCAGGTGATTCAGCCTTGTGCTCGCTGG 
ATGGAGCCATTCTTCCGAGTGATCTCCCAGAAGGCCCCCTACCTGCAGCTGAACGAGCAGAATGAGCAAGTCAGCAACAAATTTGGCCTGGGACT 
AATCTGCCCCAATATTGTCACCGACGACTCGCACATTATCCAAACGCACACAACTACCATGGATATGTATGACGAAGTGCTTATGGCCCAGGATG 
CGGCGCATGCCATGCGCGCTAGGATTCAAGCTTCCCCGGCCACCGCGCTGCGCGCTCCCGAAAGTCTTCTGACGCCTCCCGCCTCTAGTCACAAG 
CCGGACGAGTACCTGGGCGATGAGGGCGATGAGACGGGGGCCAGGAGCGGCATCTCCTCCACAACCACATGCTGCAACACTGCCGCCAGTAACAA 
GGGCGGCAAGAGCAGCAGCAACAATTCGGTGACCAGCTGCAGCAGTAGAAGCAATCCCTGAGAGATTCGCCTCACTGCTCTTGGCCATCATCCTC 
TCACACTCACCACACCCGCCCAAACAGTCTACGTTAAGTGTTTCTTATATTTAAATTATTTTTGTTAAGTGAAAAGTTATTAGTTGCCTAATTCC 
TATGCTACAAACATATTTAATTTGAAATTATCTACGCGCTCTTGATTCTCAACTCGCGATATAATCATTTTCCTCGCTCAGCTTAATATTTAACT 
TTGTGTATTTATAGAAAAAACAGAAATATAATTTTAAATGTACACTATCTATAAAGCGCTCAAGCTTATATATATATATATACACATACACTTAT 
AGTATAGCAGACCAGACAAAACTGAATTAACCATTAGACTAATACCTGCTGAAAAGGAGCACAACACTATCCAGAATTCAGTGGTGTGACGCTCT 
CACAAACCAAGTTGGGTGCCTCAAAAATGAAACAACAAAATATGTTCGCAATCATACATGTTTAGCTTTCAACCAGTCAGTCAATCATTTGATAT 
CACCAATCACCAAGCAGCAACTAACAAAACTAAAACTGACAAACAGAAGAGGGGGCAGTAGCGGCCCAGAGCACAGTACTTGACTAATCAACTAA 
AATTAGCATATAAGTCGAGGGAATCTATCTAACTCTACTAATGCATTTCATTCGACCATTAATCCTACTAACAATGAATCAACAAACAAGACAAG 
TCCACAGCTGACATGGAGTGACAAAAAGAAAATGGCAATTGAAGAAGTAAAGAATTAAATATTCATGAATTGAAACCCCAGAACAGACTATATGT 
ATTCATATATACATCATTGTAAAATTTTTTTAATTAGACTAATTTGCCATAATTTGCTAAGCAAATAAATTCTGTACATAAAACAAAACAATACA 
AAACGCACAAAAACAAGAATTAACCGTAACTGAAACGCATATAATATATTATTATATAAGTGTACAATTCTTTATATGGCTTAAAATATTTTATG 
TACATTCAAAAGGCAACCACGCAAGGTAGAAAAATATTATTATTCGTTAGCTTAGGCATAAATAACATGTAAGTGTTCATACGATTAAGGCTGGA 
TTG AAAACAAAAAC AAAC AAAAAAC AACT AAAAG AAAAGC AAAC AAAAAAAAAAACT AATC TAT TT ATGT TTTAT TTGTAGTTT TAAAAGCAATA 



FIGURE SHEET 219 



ATCATTTTAAAATTATGAATTTAATTTTAAATAATTTCAGAACTACGCTTATATCCATGATTTGATGTTAATAAATAAACGATCGCAAAATCGTA 
AACTAAACAAATCAAGTTGAATTGAATGAGTAAATAAATATATACTTTTGTAAATAAAACTGAAATCAATTCAAA 

{SEQ ID NO: 398) 

Start ATG: 402 (Reverse strand: CAT) 

MGLNAKSVCSTSSTEPNGSIVTTAPSNGEVSSSIVVVVSSSSISSSSDSPIAILPHPDPIPSTSFSSASQRSEEELPGTSAASRTDEMCSCDSQN 
LAASTAATSNGKKRKRRLSSDSNEDPELGFEPPSAKRQQRLPALYGSEQGNLSSVASSVYTSPVVSVDGQSTQELLSIRSSPAEDLSEAPHSPLP 
DSPDSPPSPDRGSKQTPVVVRYAAEQVVTSTVVTQKTEDDDLLDDSCEDYSYDEDDEDDVEEEDDDVEIYSSTISPASSGCSQQQAVNGERTPGL 

PKHOEQIHHPVSDLMINMRTPMSPAVENGLRQCPLPALA 

ETFYLAVDYLDRYLHVAHKVQKTHLQLIGITCLFVAAKVEEIYPPKIGEFAYVTDGACTERDILNHEKILLQALDWDISPITITGWLGVYMQLNV 
NNRTPASFSQIGRQKSAEADDAFIYPQFSGFEFVQTSQLLDLCTLDVGMANYSYSVLAAAAISHTFSREMALRCSGLDWQVIQPCARWMEPFFRV 
ISQKAPYLQLWEQNEQVSNKFGLGLICPNIVTDDSHIIQTHTTTMDMYDEVLMAQDAAHAMRARIQASPATALRAPESLLTPPASSHKPDEYLGD 

EGDETGARSGISSTTTCCNTAASNKGGKSSSNNSVTSCSSRSNP* 
(SEQ ID NO: 399) 

Name: cyclin E type II 
Classification : cell_cycle_regulator 
Gene Symbol: CycE 
FlyBase ID: FBgn0010382 

Celera Seauence No. : 142000013384592 

cttgaccaaLacaaagtggccaaggtggcttgcttcgctgagtgggtggatggtgcagcagggcgcagatttgccgacactttttactacgcga 

CTTTCGAACTGGAGAAAGCGGCGCGACTCGACGCGAGAAAACGAGAAGCGCGAACGGAGAGACGAGACGAGAGATTTATTTGGCTGCGCGGCTTC 
GTCTCTTCGGCTTTTTGGCGCTCACAATTATTGCACCACCACAGGCACTCGCTCACGTACATACATTATACATATACACAGATATCTGTATTGGA 
TTTTTTTTCCAAAGTTTGGATTTTGATGATATCCTGCGGGGTATACAGCGATTTTCGAACTGGCTTTGAAACTTGAAACTAAACTTTATGGCCAG 
CCAAAAGCACTTTAAGCCTTTTAAGTGTGAAAATTAATATTGCACTTTGAGTTGGTACTGGTTTTCCCTTATGTGTTTTATCGGTTTAGCGGATT 
GCATTGGGCAGAATAGAATTTTCGTTTGGACGCGATTTCTGCTCGGAACGCATCCACACTCGCCAGATTCTCGCCACGAACTGAGTGCGCTCTGT 
GGGGCCAGATAGCTGCTCTTGGGGCACGATTCCTTGATACCTCGCCATTCGAACTTGCTTCCTCTCAATTCGAATTTGGCCAAAACCGTTGGGGT 
TGCTTTGAGCCAGTGATGCCAGATGGGTTCGTTACCCTCTGTTAACCGCTTCCACAACAACGCTAGCGGTTCAGAATGTTGGCAGTTTGCAGAGC 
TATCGATATAAAGGTCAAAGTGCAACCGTTCTGAATTGCCGCCCTGTTTTCAAATATAAAGCGGTTTATTTGAATCCTAATGCATTCAACAAGGT 
AGTGTCTAAAATGTCCTACAAATAGTATGTTTTCTGGTTATCTTCAGTCGCATATTTTTGTATACTTGTTTCAAGACATGCAATATTTTATGACC 

AC AT T T AC C ACG TT GC GC ACC T AAT GAT GAAT C C C AT TG AAAAT G AAAC AT TT G AAT T GAT T T C AGT T T T AT T T AC AAAAGT AT AT AT AGTAGC A 
TAT GT GC C AT AGT AT AAG AC C AC AT AT ACT T AC AC AC AC AC AC AC AT AC AT AAC C AC AT AGG AAC AGC C AT AC AC AAAATC AT AG AAT GC C C T T A 

GGAT AT AAGT CAT AAAAT CAT GT AT C CAAT AAG AACC C AAAT T C T GG T G AC AAT AAC AAAT CC AGCCT AC C C T AAT GT AAAC AT T AAC AT C C GCT 
GCCGAATCAGTTATTCCCACCAAAGGGTCAAAACCGCTTACGCAGCCCAAGTTCTCGGCTGATCAGTACCCATGAGTGATCCACTCAAGAAGGCA 
CCCAATCCAACAGACATAAACATCCGCAGCCGCATTCGCGACTCCCACCAGAGGGTCGAAACTGCTAACACAGCTTTAAGGGTCAAAACTTTCCA 
CGTAGCTCAAACTCGGCTGATCTATATCCATGCGCAATCCACTCAAGAGAGCGCCTAATTAACGTAAACATAAATGAGAGCCTATGTACGCTCAA 

TAT G AAG AAGT AAAT AAG C T C AGC C AAGG GT T T GC T AAGC GT T CGC T T T GC AAATT AG AT T T AGAT T T TAT T C AT AC T T C AATT GT GT AAG AC GT 

GAATTTGTCTCCGACATTCGCTTAAGTTTTTGATCAATAAAAATACTTTTTTTTATAAACCAACAAACCGCGAAGTTGTAATTATATTTATTTAC 

TCATTCAATTCAACTTGATTTGTTTAGTTTACGATTTTGCGATCGTTTATTTATTAACATCAAATCATGGATATAAGCGTAGTTCTGAAATTATT 

TAAAATT AAATT C AT AATTTTAAAATGATT AT TGCTTTTAAAACT AC AAAT AAAACAT AAAT AGAT T AGT TTTTTTTTTTGTTTGCTTTTCTTTT 

AGTTGTTTTTTGTTTGTTTTTGTTTTCAATCCAGCCTTAATCGTATGAACACTTACATGTTATTTATGCCTAAGCTAACGAATAATAATATTTTT 

CTACCTTGCGTGGTTGCCTTTTGAATGTACATAAAATATTTTAAGCCATATAAAGAATTGTACACTTATATAATAATATATTATATGCGTTTCAG 

TTACGGTTAATTCTTGTTTTTGTGCGTTTTGTATTGTTTTGTTTTATGTACAGAATTTATTTGCTTAGCAAATTATGGCAAATTAGTCTAATTAA 

AAAAATTTTACAATGATGTATATATGAATACATATAGTCTGTTCTGGGGTTTCAATTCATGAATATTTAATTCTTTACTTCTTCAATTGCCATTT 

TCTTTTTGTCACTCCATGTCAGCTGTGGACTTGTCTTGTTTGTTGATTCATTGTTAGTAGGATTAATGGTCGAATGAAATGCATTAGTAGAGTTA 

GATAGATTCCCTCGACTTATATGCTAATTTTAGTTGATTAGTCAAGTACTGTGCTCTGGGCCGCTACTGCCCCCTCTTCTGTTTGTCAGTTTTAG 

TTTTGTTAGTTGCTGCTTGGTGATTGGTGATATCAAATGATTGACTGACTGGTTGAAAGCTAAACATGTATGATTGCGAACATATTTTGTTGTTT 

CATTTTTGAGGCACCCAACTTGGTTTGTGAGAGCGTCACACCACTGAATTCTGGATAGTGTTGTGCTCCTTTTCAGCAGGTATTAGTCTAATGGT 

TAATTCAGTTTTGTCTGGTCTGCTATACTATAAGTGTATGTGTATATATATATATATAAGCTTGAGCGCTTTATAGATAGTGTACATTTAAAATT 

ATATTTCTGTTTTTTCTATAAATACACAAAGTTAAATATTAAGCTGAGCGAGGAAAATGATTATATCGCGAGTTGAGAATCAAGAGCGCGTAGAT 

AATTTCAAATTAAATATGTTTGTAGCATAGGAATTAGGCAACTAATAACTTTTCACTTAACAAAAATAATTTAAATATAAGAAACACTTAACGTA 

GACTGTTTGGGCGGGTGTGGTGAGTGTGAGAGGATGATGGCCAAGAGCAGTGAGGCGAATCTCTCAGGGATTGCTTCTACTGCTGCAGCTGGTCA 

CCGAATTGTTGCTGCTGCTCTTGCCGCCCTTGTTACTGGCGGCAGTGTTGCAGCATGTGGTTGTGGAGGAGATGCCGCTCCTGGCCCCCGTCTCA 

TCGCCCTCATCGCCCAGGTACTCGTCCGGCTTGTGACTAGAGGCGGGAGGCGTCAGAAGACTTTCGGGAGCGCGCAGCGCGGTGGCCGGGGAAGC 

TTGAATCCTAGCGCGCATGGCATGCGCCGCATCCTGGGCCATAAGCACTTCGTCCTGAAAGATAGAAGTTTCAATAAGTAACTGACGGAATAAAG 

TACGCTTTATATTCAGAGTGAATCGAATACATATTTGGACGTATGCACACTTCTGTATACACTTAGATATACTTACATACATATCCATGGTAGTT 

GTGTGCGTTTGGATAATGTGCGAGTCGTCGGTGACAATATTGGGGCAGATTAGTCCCAGGCCAAATTTGTTGCTGACTTGCTCATTCTGCTCGTT 

CAGCTGCAGGTAGGGGGCCTTCTGGGAGATCACTCGGAAGAATGGCTCCATCCAGCGAGCACAAGGCTGAATCACCTGCCAATCCAGACCAGAAC 

AGCGTAAAGCCATCTCCCTAAAAACGAAAAATATTCGGTAAGATAACCATGTCGTATGGGATGCAACTTCCATTCCAACTTACCGACTAAATGTA 

TGACTGATGGCGGCCGCTGCCAGCACCGAGTAGGAATAGTTGGCCATGCCCACGTCCAGGGTGCACAGATCTAGCAGCTGTGAGGTTTGCACAAA 

CTCAAAGCCAGAGAATTGCGGGTAGATAAAGGCATCGTCCGCCTCCGCGCTCTTTTGCCTGCCAATCTGGGAGAACGAGGCCGGAGTGCGGTTGT 

TCACATTCAGTTGCATATAGACGCCCAGCCAGCCGGTAATGGTGATGGGGCTGATGTCCCAGTCGAGCGCCTGCAGCAGAATCTTCTCGTGGTTA 

AGGATGTCCCGCTCTGTGCAGGCGCCATCCGTCACATAAGCAAACTCCCCGATCTTTGGCGGATAAATCTCCTCTACCTTGGCGGCCACAAACAG 

GCAGGTGATGCCGATCAACTGCAAGTGCGTCTTCTGCACCTTATGCGCCACATGCAAATAGCGATCCAGGTAGTCGACGGCCAAGTAGAAGGTCT 

CCCGATGCAGCTTGTAAACTTCACAGACCTCGATCAACCAGTCCAAGAGAATGGCACGCATACGTGGTTGCAGTCCGGGATGCTGTTCCAGCATC 

GAAATGCTGCGCAGACGCGAGTCCTGCTCATCCCGGTGACACATCAGGCGCCAAACATCGGCGGCGTTAGCCCAGGCGAGCGCAGGAAGTGGGCA 

CTGGCGCAGACCGTTCTCCACAGCCGGCGACATGGGCGTCCGCATGTTAATCATCAGGTCACTGACCGGATGATGGATCTGCTCCTGATGTTTTG 

GCAGGCCGGGAGTACGCTCTCCATTCACCGCCTGCTGCTGGCTGCAGCCGGAGGAGGCTGGAGAAATTGTGGAGGAGTAGATCTCCACGTCGTCG 
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TCCTCCTCCTCAACGTCATCCTCATCATCCTCGTCGTACGAGTAGTCCTCGCAGCTGTCGTCCAAAAGGTCATCATCCTCCGTCTTCTGCGTGAC 
CACCGTACTGGTCACCACCTGCTCCGCAGCGTAGCGCACCACCACTGGCGTCTGCTTGCTTCCACGATCCGGACTCGGCGGACTGTCCGGGCTGT 
CCGGCAGCGGACTGTGTGGCGCCTCCGACAGATCCTCGGCCGGTGAGCTACGTATGCTGAGCAGCTCCTGGGTACTCTGGCCATCAACCGAGACA 
ACGGGCGAGGTGTAGACCGAGGAGGCAACCGATGACAGATTGCCCTGCTCGCTGCCGTACAAAGCGGGCAGTCGCTGCTGGCGCTTAGCTGATGG 
AGGCTCAAAACCGAGTTCAGGGTCCTGAAAATGAAGGACAGATGTTTGAATAAGTAAATCATACAAGTGAATGCTTTAAATTGAAATGAACTAAG 
ATAGTAGACAATCAAATATAATTACTAAGCTATArTCAGATATTACTTTCTTAAATAGTAGTTCATATTTCTCAAAGGTGTCAGCAATTATTACC 
ATTTCAATAAATTTGCGCTTCTGTTCCAACTTCATAATAACATGAACAAATGAGTGGAGAGGATACTCAAGACAATAACAGGATCACACGACTGG 
CGGAATTAGAAACGTGCTCCGAATTTAATTAGCGAAGGAGGTTTGGGGAAATTTATGTGGGCACGCCTGTCAAAATTGGTAATCCTTGATAGATG 
GACTGCTGGCTGGACCGAATGCACCCGCACAAAATGAGGAGCCAGGACTTCACACAAAAGATCAAGGACTATGGAATGGCATGGAGGTAAGACAA 
TAGCACTTGGTTGGATTTTCTCGTGGGAACGAACGGGTTTCGTTTTGCTCTTTGTTTCTTTTGAATATTTTTGGTATGATCAACATATAGACACG 
ATATCGTTATTGCACTTTTAACTTTAATGTTATAAATTTAGTAGCAGTCTTTGGAAATAATTTTGTTGCGTCGTTTCTCTTTGGTATGATTAAAT 
AATTGTGCAAACAGATTTTTTGGTTATAGCATTTAAACAATTXTGGCTTCGTTTTGTTTTTTTGGTAGTTGTTGTTGTTGTTGGTGGGTGATCTT 
GGCAAATCCGGTTCTTAAATTAGTTCTTTGCACTCGTTCTTGGTGGTTTATATAAATATATTTATCAGTCTTAACATTTGCTACTTGGCAACGAA 
TAATTTTGGAATTTCGAATTTCGTTTCAGTTTGTCTGGGTATGTATTATTATTTGGCTAACTCCGCTCTTGGCTTGATCGAGTACCAATCGCGTT 
GACGTGCGCTGTGCAACTGAGCTCGACGTTCTCCGTTCCAGATCGCCGATCCCCGAATTCCCCGATTACCGATCGTCGTTATCCTGCCCAACGAA 
TGCAACGGTCTCGAAAATAGGGGTTGCTTTTCTCTGCAACAACAACAGCGGTAAAAACAGGATCGCCAAACTAACGTGCCGTCCCTTTCCAACGG 
GGGTTGCTTGCCTAAGATATCCCCGTATATCTTGAGACGACAACTTGTGTTGTTTTTGTCTTTGCAGCAGCATTGCATTTTTCGCCTCTGGGTGT 
TTATCGGCCCAACAACAAATCCCACTGACGGCTGTGGCTCCTGGATCCTTTTTACAATTATTTTTGTCAACATTTTCGTTCAGGTAGACGCAAGA 
CGTTTGGAAAATCACTACCTACATCATAATCCTCAAAATAGCGACACATAATATAGTACATAGGAAGTATATAAATTACGATTTGCTAGCTTTTG 
GCTCGAGCTGCCAGCGGCTGCATTAAGAATCTCCTGCATCTTAAAGGCTACTCAAATAATAAATTATCTTTTGACATTGATCAGAGGGATGGAGG 
CGGTGCCAAAGCCAAAAGTAGAGCGATCCTAATGGAACCTCATAAGATTACAGCACAAAAATGACAAAAACAAAAAAACGAAAATGGAACCGAAA 
ATCGAAAATCGAAAACAATTGAGATCCTTGACCATTGTTCTTCTTGCAGACAGATGATTGAAAACAGAAGCCAAAATCAGAAAGTCCATAAAGTA 
C AAT AAAAT GT AGT C AGC AT GGG AC AC AG AT GT CT GAG AT T GCT G AT AC C AT AAAAAAT AC AT AAATG AAC GT AT T T C GC TT AAAAAAC T AT T GG 
GAATGTTTTGAAAGTTATCTCAAGAAACCCATAATTTCTTGGCTTTCTCTTTTTTTTTTAATGCATAGTAAAAGATATTTATTTATAAGATACTG 
AAATATATCAAAAATAGCATTTAAAGCTATTCTGAAAGTTTTGGAAGCTTGGATTTCCATGTTTTAAATGTTTGGCACAATTATTTGCATTTATA 
TATACTTT G AAT G AAAAAT AT AC AT C AAAAAT ATT TG AAATT GCC C AAT AT CT GC AC AT AAAAC T AAGC AAT ACAAAAG AT A T AAAATC C AC AT A 
ATATTCGCATCAGTTTCTCTACAATTTCCTTTGGTTTACGATTGCAATTGATCTCTTCTTTGATTGAAAAAGGTGTGCAGTGCACACCATATAAT 
TTATCAAATACTGACTGCATTTCGTCAACTTTCTTAACAGTCAATCTCACACATCGTAAAAGCTTAACAATATCAACCAACCCTGCTGGCCATTA 
AAGATTTTCACATAATTTCTTGGGCGAAAAATGGGGGGAAATATTGCGATATTTATGTGACAATTTTAATTGTCTGAGTAGGTTTGCCTTCTCTC 
GACAAACATACCGAGCAGAACTCCCCTCCTGAAGTCGAGAATAAATACTTTTTTTTTTTTTTTGTTCAGGCAAGACAGGAAGGTCCAAGACGCAT 
GACAAATTTAATTTCTCATTCTTCTGGCCAACAAACTGCAACAACCCTCCGGAGGAGAAGATGATGTAGGTATAACCGGGAACGGCAATCGTAGA 
ATGAAAATGAAAATGAAAATGGACATGGATATGGACATGGACACATGTCAAGGCACCCAGATGCAATGTAATTAAAATCAAAGCGTGCAATGGCC 
TAAGAAGCCATCTCCCTGGACAATGGAAATTCCGTTCCTCTAGCCTTATTATTTCACTTCATTTTTGCTTTTGGTGCGTGGACATTGCATCTGCC 
GTTTGGATTTGGTCAAGCGTTATGGAATCACAAAGATTATGCACGAAAATTCTCAGTTTACTTAACGACTTGACGGATTCGAAGAGAAAGATCGT 
CGGAGTGAGTGTGAGAAATTAGTGAACTGTAAAGTGGGCTGGAATCTGGACCAATATGGACCCGAAGGGTTGCATTCCAAGTTTAACAAGGGTTA 
AACTGGGTGGTTGCCCTTAAGCCACCGCCGTCGCCGCTGCCTTCAAGGATTTAAGTCTCAACTCCAAGCAGGCAGCTGCTGCAATCGTCCTTCGC 
ATCTCGATCAGCTCCTTCAAAATCGAAGCCCACAAGGAAGGCACCCAAAACATCTCTCGAAATCTCACTGGGAGGGAGACCATGGGTCGCCGGCG 
ACAGATGATGCAGTTCCTCAAGTTATGGGTACATACAGAAAGCAGACAAGAAGCAATTCGATTGTTAGGCGACAACCTCTGCTGCGTCGCCGACC 
TCAATACTCGATGTCCTGGTGCCCCATTGTGTCGCTCTCCCTCTCCTTTTCGCCTACCCTATAAAGTCCTGCGCTTATGTGTCGTAATCTTTCGA 
CGGCCGTCTGCTCGAAACCGAAACACCTTTAGTTTTCAGACCTTGGCGTGGGTCCAGGTGTAACTTTTTCACCGCTGTTTTCGTGTAATATGCTG 
CATTGTTGTTTCCATTTATTTTTCTTTTCTTACAGTTTTTTTCTTTTTTTTTCAATTCTCCCCCAAGTCCTTTGATAACATTTAATTTCATACTA 
CAATGTGCGGTAGTTGTTAATATTTTTACAGTTTGAAAAGATTATGATTTTCTTGAAATTCAGGATCAAGCTAGAGGTTTTTTCTTCAGATATTT 
CATTTTTCATGAAGGACTCATAAAAGTCTGATGATCGTTCAAAATTAGCTAAAGTCCTGTCTACTTTTTGGGACATCGCTGCTGCTCTGCTTTTA 
ATAAATGATATGCAAACAACATGCCAGTGGACAAATCCCCGAAGGACCATAACAAACACTCATATTTCAAACCAATTCAACTGACCCCTGGTCAG 
TCCAGAAAGGGTTAATCCAACACAACACTTTCGCGGTGACATTGCTGGTTTTATCATCGCGGAAAATGGAAAATTCGGTGGGTCAGGGGGACGTT 
GAAATTGCTTGGACATGTAATTAATCATGCAATTGGTTAGATCCATGACAAAACGAATTAAAGCATATCACATTGGTGAAACGGGAGGAGGGTAA 
GTGGACAAAGGACAAACCGCGAATGAATTTTGAATGCGAAAAATGAATTTTGATTAATTGTCAGGAAAGACAATTTTCTTTAGGGGATATGCCAA 
CTTCTTAGGTGTAAGCCTTTAATAATTGAATATCTCTTGATATATTTACTGCAATTACGTTTTCCCCTGTTTTATCTAATAATTTACCTTTTAAA 
ATTATCTGCTCAAAAACTAGTTTCTTATCAGTCGAATCTTAATGTAATTAATCTGAAAACAGATCGCTGCCCAAGTTTACCATTATCTCTTGTTT 
ACCTCCAGCTAAATAAAATCCTCTGTAAAATGAAAAAAGAAAACTTTGAATTACAAAGCAGGCGACGCATGATCCCAAATCGGGATGCAATCTGG 
GAAAGTGAAACTTGAATTCAGTTTCCCACAATATCCCGATACTGACTTGACAGAAATCAAGACCCACACTTGATTGCATCGAAAGGAAACCCGAG 
CGAAAGCAAACCCAACTCCCCTTTTCGGGACAAAAGAAACAAAGAAGTTTCCCAAAAAAAAAAAATAACAAGGAACTTTCGGCGATTTCATGCCT 
GCC AT GTAAGGAGTAAGAAAAAGGGGTTTACATT AC ACTGACAAATTGGT AAAAT AATACC AT AATTTTGATCCTGTGCC AGC AGC AAAAC AAAT 
ATTGTATAATATTTCAAGACCGCATTCCGATGCAGACGCGTTCTCCGTGTCCCGAAAAGCAGATGACACAAGAGTTTTAAGAAGGAAAAGCCTGA 
TGATTTGCATAAAAGTTTAACGGAAATCTAATGAGCAGACGTCGCAGGATCGGCCAAGGCATTTGCATAAGGATAACATGTCCCGAAACGGGGTA 
GCAAAATGCAACCCAGTTGGAAAGAATTTTTTTTGTTATGCTCCGAATCTTCTGCAGAAGCAAATGTCGTCTCGATGAATATTTAATTAGAGACG 
AAAAGAAAAGGGAAAAACTCGGCCCAGTGATCCTTGAAAAGGATAAGGCGTTGCTCAACCCCTCCGTCGGTTCGTTCGTGTGTTGCGGTTTCAGC 
TCTGTCCCATTCCGGTAGTCACCACGATCCCTTACCCATCACCGTCCCCCCATCCCGATGCCAATCCCTTAAACTGTCTTGATCTAGTTCTTCTG 
CCTCAAGCTAACCTATAACTATCGCAATTTCAAATTCTTACCACGGCCCTAACGGACATTTGCCGACGCCTTGACTTTTGGAAAAACCCACCCCT 
GGAGAACTCGGACTTGGACTCAAGGTGTGCGCGCATGCGCAACGACGCCTTCTGTCCTTGTTCTGCGTCTCCTGCTCCTGCGCCTGCTTCTGGTC 
CGGCTCCTTTGTCCTGGCATGCATATGCTCGGCATACTGCCAAAGACCCCTTCGGGGAATTCCCTTACTCAACAAAGTTCGCCTGAGAGGGGGCC 
AGACATTTCCTTGCCATCCACTTTTTTCGTACCAGAATGACATTTGTTTGGCACGGAAGCAGGCGCAAAAACCATTGCAAAAACAAAAAACTCAA 
AAAAAAAACAGCAAAAAAAATAACCAAGGCAGATATCATTTGTTTGGCAACAGACTGTTTGTATACCCTGGTCACTACATTTTAATTTTACGATA 
TATTCTCAGTACAAACCAAGGCACCTTATAAATAATATAAATTTCTATATTTCCACTTTATCAATAAGATATGCTGAGTTCTAAAACAGTCCTTT 
TAACATGCCTAAGATTTAATATATGAAAACTTATAGATATTCATAATCTTGCAAAGATAAAGATACCCATCTAGAGGGTACCAAACTATCAAACA 
AGAAAACAATTTCTGCAAGTTCATTGCACGTATAGGAGAATATTCTCTCCCCTACATACATATTTCATATGGCACTGGAGAAAAAAATCCTTTGC 
ACCAACACAGCTGCCCATCGTGAGGAGCCGCCAGGAGATGAGAAGGAAGCCATTGTTCTCCGCCTTGATTGCAGCTACTTGAGAGGAGATCTGAC 
TTCAAGGCTTCCAAGGATGCTGCTAAATGAGCCCTTTTCAGTAAGTAGCGTTTATTTTTGGGCTCATCTTTGCATTGTGCCCGCGCAAAAGTGCC 
AAGAAGTAGCTGTCTACGTGCTCAACGGCTTATTTGGCCAAAACTCAAGCAGCGAAAGATCAAATGAAGATCGGCAGCCAAACGAGTGGAAGCCA 
CCAAAGGGCTGTCCATTCAAATGGAAGTACAAATAAAAGATCGTTTGCCAAACATAATAATAAAAATGGAGAATTGTTTAGCGATCGCTGATCAA 
TGAAAACAATTGGGAAACTATAATCAAATAGATAAATATATAAAATGGCTATTGCTCAAATTCAAAAAGAAATATCACCTGCACGGTTCAAATAT 
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GTATGTACATTGTTTAGTTATAATCTCAAATTTACATTATTATTATTAACAGAGTTCACAAGCAATCAGATTATTCAGCTTGGAAATAAATCTTT 
GAGCTTATAAACAGCCTAACCTTCAATTAATAGATTAAATTAATAGATTCTTATTTAGGCTAAAATGAAAGTGGCAATCACTGATCGACGAGAAT 
TTAGAGAACAGAGAATTCCCAGAGTTCAAGAGTTCGTTAAATAGAAACATTTAATTAATTTCAAATGCATAGGCACTTTTTGAAACACACTAGAG 
CTAATCATCGGCTTCCTAGAATGCCAGAATGCCTCTTGCCATTTCCCTGATAAGATTGCAAAAATCAACGAGGTCACATGACATGATAAAGCACT 
TCATTCATAAAACTTATCTGCGCTGTGTCTGGCTTTATGTTTAAATTACGATTCGTTTTGGATTACGTACCTCGTTTGAATCGCTGCTTAAACGC 
CGTTTACGTTTATTGCCATTGGAAGTCGCGGCTGTGGATGCTGCAAGATTCTGGCTATCACAAGAGCACATCTCGTCGGTTCTGGAGGCTGCTGA 
GGTTCCTGGTAGCTCCTCTTCACTTCGTTGCGACGCTGAAGAGAACGATGTTGATGGGATTGGATCTGGATGTGGAAGGATAGCGATTGGGCTAT 
CGCTGCTGCTACTGATACTACTGCTGCTGACGACGACGACTATGCTGCTGCTCACTTCTCCGTTGCTGGGGGCGGTTGTGACTATGCTGCCATTT 
GGCTCAGTGCTGCTGGTAGAACAAACACTGCAAAAGAAAGTGAGTGGAATTAGCAGTGGGGTACTAAATAATAATACTAAATAATTTAAATTATA 
TAGTTGAATGTCCCCAAAGAGCAGCTATTTGATAGAATACTTCAACATAGCTGCCTATATTTTTGTAGATTTAAAAGACATGATTGTATCAAAAA 
CCCTTCCATTTAAAGAATCCCCCATATTTTTGTAAACCCAAACCCAAACTCTGGAGCGCAGTTATTTCCAATTTCTCCACTTTTTTTTTTTTTTT 
GCAAAATAGAGAAATTCAATATGCCACGGAGGCAGGGCAGCAAATCACATAGGGGCGTTCTAAAATAGATACAAACGAAACGAACATAAATTCAT 
TGAATTTTGACACGGCATTCGAATTTCGAATCGCGATGCAAGACTCGAGATTAAGGCGCCCAACGGATATATTAATATTTAACATCCGGGAAATT 
GAAATCGAGATGACTGATCGGGGGGGGAAATGGAAATGGAAATGGGATCGTTGTGAGGGCCCCGGGGAAAGCCTTTCCTCCTCGAGATTGAGCAA 
TTTTGGCAAATTTGCATGCACATTAGCATAATGCAGGAGGCTGGCAAACGGGAAGAATACAATTTTCCATACTTTCCAAAGGACAGATGTTATAT 
CCTTGCGTCGCAACTCTCAAGTGTTGCGATCCAAGCAACTGCACTTTAAGCAATTCGATGCGCGAATTTTTGCAATTTGAGCGAGGAGCTCAGGA 
GAACGAGAGAAAAAAAAACGAAATCTAACCCTTGGGACCAAAAACCCTTCTTAAGGAACTCGTGAAAAGGGATAATA 

GGGCTGCTGCCCATACATAAATTCAAATGCTTTAGCCATTTCrTTCTTGGCCAGAGGCTCGGCATTGCAGCCTCCTGTTTTGATCCTTGCCCTGC 
ACTATCGATCGGGTTTAATGGTTTAATGAACAAGAAAAAGTAATCACGTAGTAAATTACGAGGAAATGAAATCGCTTTAAAAATGAGTTATGGCG 
AGAAAGGACTAATGGAGCGTACGTGCACGCGGCCTGCAACTTTTTCCCGCTTTCTTTCATTGTACTCAGGGTTTCGGGGTCAGGGGTTTCCAGTC 
ATCTCCGCGGCTTAAGGTTCAAACTGAAATTTGTCATTTCGAGGTGAAAGTTTCCTTTCCTTTGCGGCAAATGCAGATACTTTGTTTCGGTTTCG 
TTTCCCCTTTTCTATTCCGTTTCAAATCCGAATCCCTGTAGGTAGCAAAAGGCTAGGGTGACTATCTCGGAGATGATATGATGAAAAAATTCGAA 
GAAGTATTTCCTGGAATCTGGTCCAGTGGATAATAAGTGGACTATCGAAGGATCGATCTTTCTCACATGATAAATTATAAAGCAACGGCGACTTT 
TCGATTAGAAATTAGTTATTTATGCGTTTAACTTGGTATCAACGAAACAGTTATGTTTCGGAATTTATGTGTTTGATCCTAAACTCTTAAGGTTA 
CAAGAATTTTGTCATTTTTTGAACATTTTTATGTGAGTTCCTTGTACTACTTGTACTACCATGCCCTCAAAAGTATGCAATGGAATCTGGAAAAC 
CTGAACTTCCGTGACATTTTTTTGATAAATTCCGTTATAGATGGCAGGGAAATAGCTTCGCAACACATACCAATAATTATGTCGCACACAAGCTG 
ACCACGAAATATGTCCATAAAATGGACTATCACTTAAGTGATCGCAGTGATCAGTGGGTGATGATGATGATGATGATGATGATGTTGAATGCCAA 
AAC G GGG AC GC AAC T G AC AAAAC CAAAT G ACT AGG C AAT TAT GC AAT T AT GGT C AT GCGC AGGC AGCT ACT AC AAGC AAAAAG AAT GAAT AT AT A 
GTGCAGGGTGAAAACATATTAAACTCAATTAATTTTTGTCCAGGACATGACTGCAAAAGCCACAAAAAGTGTGTGTCTGCAGTGACAGCATTATC 
AAGAGCCGACCAAACAAACCCCCTTCATAACCCAACCCCCTGAATGACCAAACTGACCTGAACCGTCCCACAACTAAACACTTTTCTAATGACAT 
TTTCGCGCGCAATTATATACTAATCACACGCAGGACCTCGGAAAGGTCCTGTAAAGGTATGCAAGAGTGTTGAGGAATGCGGCACTTGGTTATCA 
ACGCTGCCTCGACTTTAACACCTACAAACAGATGGCTTTTTTTTTTTTGTTTTGAAAATTGACAGTGAAAATGCATTTTAGATTATCGCACTTGA 
GTCAGAAGGTTGGAACAGTAGGATCCCGAGTGGTTATAGCTCCTGCGGGTCCAACTTTCCCGCGGCGTATTCGGGCATAACCAGTTGGGCTCTGT 
CGCTTTATAATTACGAAATAGCTATTAGATAAACATCACTTTTGGTTAAGAGAATTAAAGGGAGACGCGCTGCCTACCAGGATAGTGTTGTTAGT 
GGTTAAAAAGGAAGTCGATCGGATGTCGCCACAGGAAGTCGCCGAGCGTTAGAAACAAATGGTAGCTCAGTCCGTTGGGGAGTCAGTCGCCTCTT 
GGCTTTCATGTGGGGTGCTAATTAGTTTGGTCAGGGCTGCGGAGGAAGTGGAAGTCACGAAAAGTGTTTGATGGGAAAAGGAAATGTCTAAGATC 
ATAAGAGATGTCAAGAAAAACATGCAGGTTCGCTTGAAACTAAGATTTTTACAAACTCAAATCGAACTTGGGGTTTGCAAGAATAATTTACCTCT 
TAAAACATCGAAAATTAAATTAAATTATTTAAAAGTAAGCCAGTAGAATTTAAAATGAATAAAGTACTCATAAGCTTACCTCTTGGCATTTAAAC 
CCATAATTGTACGGCCTTGAGAAGCCGTCAAGGTTCTTTATCCTGAATCTTGTTTTTAAGCTGCTCGTCAAAAGCAGAAATGTTTTGCATGAAAT 
GTACAATTTAAAGAGGACTGCAAGTAAATTAAACAAAATATAAACTTAATAAAATATAATAAACAACATTTAAATAATATTGTAAAGCAATCATT 
AAAAACTAAAAATGGCATATAAACTAATTGGATAAGAGTTTCTAGGATATGGTCATCACATGCGACGCGTTTTCGAGTTTTTACGACTTCACAAA 
ACCTATCAACAGTTAGTAAAACATTATCTGTAGATCCTTGTGGAAAAAACAACATATGATCCTTGAGGGAAACTCACACACAGCATGTGGCCGAA 
AAGTCTCCTTGCATCGAAAACCCCAAACAAATTTGTTTACGTGAGTTTTTCTCGAAAAACATATATATACACCAACTTTTTGCTTTTAGC AAAAC 
GAAGAAAGAAAGTAAATATTTTTCTTTGGGGTGCAAAGTGTGTCTAAGATATTCAAATTACTTAAATAGAAAGATGGGTGGGAAGACCATTTTGG 
AGTTGGGTCATCTTAACAGGTAGTCACGGAATGGTCATCTTTTTATCATAATAGAATTTCCACGAGTCTCGATTGATGCCTTCTGCTGCAGTTTC 
AGCTTCAGCTGCTCCCCTTGAATTTCCTTTACACTTGAAAAACATAACTATACAATTCAAACAACTGAAAGTGCTATTAAATTATAGAAATCAAT 
AGAGTTCTGTTGTTGCAAGTACGATGTATTGTTGATAAATACTTTTTTAACTCCGTAATGCTTGACTTATGAGATAATTTAATTTTTTGAGTGTA 
CCGATGTCAGGTGCATCGCATTCGGCGCATGCGCAACGTCCGCATTTTCTAGTGGTTTACAACGCTTAATTATATCGAGGAGCCGAGTCCTTAGT 
GCCCCTGTTGCACCTGTTGTGGCTAGAGTTGATCGCTGACCTTCTGTTAGAACTGTAAGTATGGGTTCGATGCGATGCGGGGCAGGTGAGTAGAG 
TCGATGGGATTGGAGAAGGGGAGTGGAGTGGCAGCTGTTGAAGATCCTACCCCAACTAAAAGGATTCTGGCAATGTCTCTGCTGTTGCTTTTGTA 
ACTTCAATTGACTTCCCCCTCGCTAATTGTATCCTGTCGCGTTTCCTTTTCCCCTAGCCCTACCCCTCCCTCCCCTCCTCGCTTCCTTTCTTCCT 
TCATTTATTTTCCTTTTTCCGTGTGACATATGGGCCTGGCGTTATATTCGCCACGATTTATGGACCTCCCTTCGGCCTACCATTCAGTTCGACAC 
CTTTTTTTCCTACCCCATATGCGAATGCTTTTTAGCCAACGACTTTCTTTCCCCTACTTTCTTGTTCTTACGCAACGGATTGTCTATTTTCCTCA 
CTTTTTGCCTCGCTTTTCTTTTGCGTTTTCCGGCCTTTTCTCATGCACAATATTTTGCTGTTTCCAACCATCATTTTCACACTAATTGCGGATAT 
GCCTTCTAATTAATTGGGTTCCCTCTGCTTCCCCGAAGTCCTTTTAGAGCATTTTTGCTTTCTTAGTTTTCAACCCCCTAACTATTCTACTTACA 
AC AAAGAAAT TT C CTTAC A C AG AAAAAT AAG TCTCTTTGGTCATT T AAG AAT AT GT T AAAAAT CT T AAAAC AAT T AAT AAG AT CAT CAT T T AA AT 
CGTAATCCCAATATAAAGATCTGGGTTTTGTAAATATTCGAGCTCAAAAAACATTATGAAACAATTTGTATCTTTAAGATACAGCAGAAACCTAA 
AAACGCAATAGAATAGACATTTATCTCTCAGTTGTAATCATGTTACACATCTAACATCATTTATAGTCGGATTAATGTAATCTTTTTTGTAAATC 
ACAAAACGTCCTAAAATTAATATTCGTTAATAAGGATGCGATAATATCCTTTAGTTTTACAAAACAGTTGACAGCGGCTCTTGGCAAACAAATGA 
CTGTATTATCGCTAGAGCAATCCAATTCTTGTCCAAATACCAGTGACTAAGATAGATTCCTACCTAATTGCAGGAACTGCCCACACGCAGGCTGC 
GGCTGCAACTTCCTCTAATGAAGATTAAGTGAAGAAGGCTGCGAGTTCGGGCGAAATCCGCGGCGACAGGTATACAAATATGAATAATGCCGCGG 
ATCGAGCGGATAAACTCGGGGGTTTCCTTTGCGCACAGCGCCTAATTAAAGGTGTACGATGCATGAGTGAATCTGTTTACCTGGACGGGGTCCTT 
CAAGGTCATTTAACTGGAAAACCGAGTGTCGAAAATAAAGTCGACCGTATTTTTTTCCCTCATCTTCGGAATATTTGCTTTGTGCGGAAGATTTC 
CCACAAGGAAATGGGACCAAGTCTCGAGATACAATATGAAATGCAACTTGAGGCAATGCAAAGAAGAATGCGCAAAATGCGCAAAGAAGAGCGCA 
AAAAAAATTATATACAAATATGTATGTATCTCGAGTATATGCCAGACTATATACATATGTACATATGTAGAGGGAAAACAGGTCTCGAGTTATCC 
GGCGTCTTCAACTGTCTGGGCCTGCAATTTTTGTGCGTCCTTTTTGGGCTGCATTTTGGGCTCCTCTGCACATGCGGCGAGATACATTCCTGAGA 
TACTCGAGATAAAAATGCACATGCATCTGGGCATTTGTGGCGTTGTCCATGCCAATGCCGCCTCCTCTCCATTGTTGTGGTTTGGCCACCACTCT 
TGCAATTAGTTGGAACTCGGTGGACCGAAAGTCTGCAGGAGTTGTGTAACAACATAGCTACATAATGCTGATAGGGTAAGAGGGTGAAGTTCATT 
AGGAAGATATTTTCTCAAATCGAAAGCCTAATCATATTTGCATAATATGAAACTATAAATAATACTACTAAAAGTTTAAACCAATACACGAAGAT 
TGCTTTATAAGCATCATTGAATTGAACCCGATCTTTGATGCTTCCTTTTCATACTCATGAATGCGTTTAAATTCTAATGGTCAATTGGCCCAAGT 
TTGCTGTGTTGCTTCAGTCGCATATGCCCACCCTACCCAGAACTACAGGGTACTGCAAACGAGAACAGGAAAGTCGATCCTCGCCTGCGCGCATC 
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AGTGTATCTGTGAGTCCGAGTATCTGAGTATCTGTGTGCCGGGTATCTAAAGRGCGCCGCCTGGGCAAAACGGCGTGTGAAAGTftAAGTCGGCCC 
AGCCGATGGAGGAGAACAAATCAAAGTTCTTGAATGGAGCGGCAGCGGCGGCGATGGCGACGGCGGCGTTCCAGTTCCAAGAATCGATGGCCTTG 
AAGTAGCCGTTAGGACATCGGGACGCGCTCAATTGACTATTTAGTTTATATTTAGCAGCGTCTAAAAGGATGCCCACAGCGAGTCAGCTAGCTCA 
AGTGGCCTGCGTCCTGTCTCTTCCCCGCTCCACAAATCGCCAGTCGGACGGACTATCCTTTGTTTGTCTGGGAATAACTGGAAACTGTTTATCAT 
TGTTGTTTCTGCTCCTCATACATACGGCTTCACCTCATTCCGATTCCGAATCGGATTCTTCTGCCGCAGCGACAAAAATGTGTATGGACTATAAA 
AAAAAAATGCCCACATATACATACATATGTACATATGAACCATTTATTTAAGGAGTTACTATTCACAACTCATCCTTTTGGAGAACATGTAATAT 
TTGTAAAACATTCCGCACTTGTAATTGAGAAATATTGTTGGTATTTTACTAGGAATTCGGGTTAATATGGCCAGTTGATATGTGTGGATTCCTAT 
AACCATTTACGTATGTATGTACATATGATGAGAAAATATTCGAGATCAGTTTGGGTTCCCATCTCGTCTAGAAAATCCTTTGATGACATACCGCT 
TTCGAAGGTGGCAAGCACTTGGAGGATCATGAGCAGAATTGAAAGGTGTTGACTAGGATTTGAATAGATCATATAAGGAGTGTTTAGGTGGGTCA 
AACAAAAGAAAATGTTGGGAAAATATACAAAATTCACGACACTGAATCATCAAAAACTTGATGCTCTTACTTTTTTCAAAATCACTTCTAAGAAA 
AACAATTTATTTCCCCATGGTTTAAACTGGCCAAATCTAAACATAAGCGGCTGATTTTCGGCACGATCATGACATAATCGAAAAATTAAGTATAC 
AAAAATCCACCCCATCAACAATACACCACCCACCCCCCACGCACACCCCCGCCGCGATCGCTCCAAAAGTTTAACGG 

AAACTAAAAAACGTCAAAAGAATGCAAAAiyiTAAAGAAGAAAGCAAAAATTGCGAAGAAAACAACAACAAAATGAATGTGGCAGGTAAATGGAAG 
^AGgI^AGGAAAGAACGAAGATGGCGCGAAGCAAGGTGGAG 

TCTCCACTTGGCAGTCTATCTCGCTCACTCATTGGTTGGGAGCACGTACATAGAGTAACTACTCACAAAAATAAATATAACCACGATTTTACAAT 

AATTAAAATACTATTATTAATTCTACATATGTAAACATACATACATATGTATGTACGTAGATTGTGGGTAAGGGCTAAATGCCACTACCAATCTT 

TTTCTTCGTGTACCAAAGATATTTGCGCCACTTTTGACATAATTCATTATTTTGTTCTTCACTCGTTTACTCCATTTTGCTTTGAACCTCTGCTT 

CGTGTTCGTCAAGTT^CGAAAGGTCCGTCGACTGATCGGCGATTTTCCGAATTTTTTCGACGTCGCAGTCAAATGCTCTCTTTTTTT 

AGAGCGCAGTACTAAAACGTTTTTGTTTGTTGTTGGTGTTGGGAAGGGGAAAATAACTTGACTCGCTGTTCTGATTGGAATAAAACGGAAGAGAG 

CGAGAGAAAGAGAGCTTACTCTCTCTCTTAGCGAGCGCATTTCTTCCAAGTTTGTGCTTGTGTGTGTGTCATTGGAGGAAGCGTATAAAAATGAA 

ATGAGCATGGGAAAAGAATGGAGAAACGCAAAGTGGCGCGAATTAGGCGTTTGGGCTGGGGCAGTTTTGTTTTACCGCCCAAAATCTATGCG2\AT 

ggcgggaSgtgcaattcgggcgg^gaaagattagaaagttggcccaaacagcaggg^^^ 

AGAAAAGCTGTAATAGTTCACTGAACTCCCGTTACGCATTAGAAACTCAGTGCACTTTTAGACGTGATTAGAAGCTTTCATTTATTTCATAGGAT 
TTTCCAATTTGTTCCCATTGGATAACGGTTTAATTCTAATAGGACACTTTTCCTTATAGGGGGAACACTAAAACACTGTGTGGGGGAGACAATTT 
GTAGTTGGACACCCCGTCTCTTGCCAATCAGAAAGCAACGGTTAAATCCTACAGCGTGTCAATTTTCTTATAAGAGGATGTTGTATTTGTAGCAA 
AAAATGCAAAGACACGACGGCAAACGCAGCATTCTCTCCCGCTCTTTCCTTCACTGTTCCACACACACTTGTCACCGCACACGCGTCATAAATTT 
TATTCCACTCTGAATAACGTATTGCCTTTTATCCACTAAAACCATTGTGCAATTGGTTTTACCTTCTAGTAAGCGGAAAAATGCTTAACATAATA 
TCCCCGGTGTTTTCACTTGTTTTTCTTAAAGGGAATTGAACAACGTTTTTCGAGAGTTTTTCGTCGTCGTCGTCGCCGGCCACGAAAAAACTTCA 
CGAGCAGCGGAGCGGCGGCAGCAAAGTGATCGCTTTGGAACCGAAAAAAAAATTGGTGGAACTGGGTATCAAAAGCAGCGACGTCGACTGCGGCG 
AGTGTGCATGAGGAACGCGCCAAAACAGGCGGATCCGAAGCGCCAAAAAATGCAACAAACACAGGGGGCAACTTTGTTGTACTCTAGGGGAGGGA 
CGGAGATGGGGCGAGCATCTCCAGCGCAAAGAACAGAGCGTGAGAAAGCGAGACGGGGCGATGAAGCCGCTGGTGGCAAGCTTGGCATTTTATTG 
AATGGAGTCTGCAATTGGAGTCCCCAGTCCCCAACAGGCTAGTCAGACTACAATTGGAGGTTCGACGTCCAGTCAGCTGGGTGGTCGAAAAACAG 
CTGACGAGGATGCCAGAAGGGCGGATTTAGTCGGATATATCGTGGTTATATATAAATACCTAAGATTTATTTCTATGTCATATGCCAATTTTAAT 
TCAAATATGACCTTCAGAAAATAAATATTTAAATTTTAATATGAAACTATTTTTTTTAATATATTTTTAAAGCGCTCCAAAATCAATTCAAGGTT 
AAAATAAATTTATCTTGAAGGGTATTCATAAATCTTTAAAATTTTAATTATTTATTTATTCTTCAGAAGCCTGAAATTACATGGTTTAATATGCT 
TTTCTAGTAGTTTTTATGAACTGCAGCCAAATAACATATGACAAGATGACTGAGGTCGAACTAGGTGGTAACGCTGCTTTGAAACCAAACAAACC 
ACTGCCACTTTGTTGTTTTCATCGATTAACATTCGAGCATTTTTGGCACCAACTTCGAGCGCTTATAAATCAGGTAAGCTAAATGAAAAACCATG 
CGTGCCTATAACCAATTGAAAGATCAAAACTCGCAGAATCTGATTTTTCACTAAAATCGAAAATCCCGTTCAATTCGGCCAAATTAGAATCGATT 
GAACTTGATGATTTCCCGCGCGCGGGAACCCGACGTCGGCGCGCGCTGCTGACAGCGATGACAGCTGCGCGCGCAAA 

(SEQ ID NO: 400) 

Exon: 20737. .20583 
Exon: 15027.. 14900 
Exon: 12092.. 11756 
Exon: 4964.-3694 
Exon: 3627.. 3402 
Exon: 3284.. 1701 
Exon: 1036.. 1001 

Start ATG: 14919 (Reverse strand: CAT) 
ACTTTGCTGCCGCCGCTCCGCTGCTCGTGA 

AAGAAAAACAAGTGAAAACACCGGGGATATTATGTTAAGCATTTTTCCGCTTACTAGAAGTCCTCTTTAAATTGTACATTTCATGCAAAACATTT 
CTGCTTTTGACGAGCAGCTTAAAAACAAGATTCAGGATAAAGAACCTTGACGGCTTCTCAAGGCCGTACAATTATGGGTTTAAATGCCAAGAGTG 
TTTGTTCTACCAGCAGCACTGAGCCAAATGGCAGCATAGTCACAACCGCCCCCAGCAACGGAGAAGTGAGCAGCAGCATAGTCGTCGTCGTCAGC 
AGCAGTAGTATCAGTAGCAGCAGCGATAGCCCAATCGCTATCCTTCCACATCCAGATCCAATCCCATCAACATCGTTCTCTTCAGCGTCGCAACG 
AAGTGAAGAGGAGCTACCAGGAACCTCAGCAGCCTCCAGAACCGACGAGATGTGCTCTTGTGATAGCCAGAATCTTGCAGCATCCACAGCCGCGA 
CTTCCAATGGCAATAAACGTAAACGGCGTTTAAGCAGCGATTCAAACGAGGACCCTGAACTCGGTTTTGAGCCTCCATCAGCTAAGCGCCAGCAG 
CGACTGCCCGCTTTGTACGGCAGCGAGCAGGGCAATCTGTCATCGGTTGCCTCCTCGGTCTACACCTCGCCCGTTGTCTCGGTTGATGGCCAGAG 
TACCCAGGAGCTGCTCAGCATACGTAGCTCACCGGCCGAGGATCTGTCGGAGGCGCCACACAGTCCGCTGCCGGACAGCCCGGACAGTCCGCCGA 
GTCCGGATCGTGGAAGCAAGCAGACGCCAGTGGTGGTGCGCTACGCTGCGGAGCAGGTGGTGACCAGTACGGTGGTCACGCAGAAGACGGAGGAT 
GATGACCTTTTGGACGACAGCTGCGAGGACTACTCGTACGACGAGGATGATGAGGATGACGTTGAGGAGGAGGACGACGACGTGGAGATCTACTC 
CTCCACAATTTCTCCAGCCTCCTCCGGCTGCAGCCAGCAGCAGGCGGTGAATGGAGAGCGTACTCCCGGCCTGCCAAAACATCAGGAGCAGATCC 
ATCATCCGGTCAGTGACCTGATGATTAACATGCGGACGCCCATGTCGCCGGCTGTGGAGAACGGTCTGCGCCAGTGCCCACTTCCTGCGCTCGCC 
TGGGCTAACGCCGCCGATGTTTGGCGCCTGATGTGTCACCGGGATGAGCAGGACTCGCGTCTGCGCAGCATTTCGATGCTGGAACAGCATCCCGG 
ACTGCAACCACGTATGCGTGCCATTCTCTTGGACTGGTTGATCGAGGTCTGTGAAGTTTACAAGCTGCATCGGGAGACCTTCTACTTGGCCGTCG 
ACTACCTGGATCGCTATTTGCATGTGGCGCATAAGGTGCAGAAGACGCACTTGCAGTTGATCGGCATCACCTGCCTGTTTGTGGCCGCCAAGGTA 
GAGGAGATTTATCCGCCAAAGATCGGGGAGTTTGCTTATGTGACGGATGGCGCCTGCACAGAGCGGGACATCCTTAACCACGAGAAGATTCTGCT 
GCAGGCGCTCGACTGGGACATCAGCCCCATCACCATTACCGGCTGGCTGGGCGTCTATATGCAACTGAATGTGAACAACCGCACTCCGGCCTCGT 
TCTCCCAGATTGGCAGGCAAAAGAGCGCGGAGGCGGACGATGCCTTTATCTACCCGCAATTCTCTGGCTTTGAGTTTGTGCAAACCTCACAGCTG 
CTAGATCTGTGCACCCTGGACGTGGGCATGGCCAACTATTCCTACTCGGTGCTGGCAGCGGCCGCCATCAGTCATACATTTAGTCGGGAGATGGC 
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TTTACGCTGTTCTGGTCTGGATTGGCAGGTGATTCAGCCTTGTGCTCGCTGGATGGAGCCATTCTTCCGAGTGATCTCCCAGAAGGCCCCCTACC 
TGCAGCTGAACGAGCAGAATGAGCAAGTCAGCAACAAATTTGGCCTGGGACTAATCTGCCCCAATATTGTCACCGACGACTCGCACATTATCCAA 
ACGCACACAACTACCATGGATATGTATGACGAAGTGCTTATGGCCCAGGATGCGGCGCATGCCATGCGCGCTAGGATTCAAGCTTCCCCGGCCAC 
CGCGCTGCGCGCTCCCGAAAGTCTTCTGACGCCTCCCGCCTCTAGTCACAAGCCGGACGAGTACCTGGGCGATGAGGGCGATGAGACGGGGGCCA 
GGAGCGGCATCTCCTCCACAACCACATGCTGCAACACTGCCGCCAGTAACAAGGGCGGCAAGAGCAGCAGCAACAATTCGGTGACCAGCTGCAGC 
AGTAGAAGCAATCCCTGAGAGATTCGCCTCACTGCTCTTGGCCATCATCCTCTCACACTCACCACACCCGCCCAAACAGTCTACGTTAAGTGTTT 
CTTATATTTAAATTATTTTTGTTAAGTGAAAAGTTATTAGTTGCCTAATTCCTATGCTACAAACATATTTAATTTGAAATTATCTACGCGCTCTT 
GATTCTCAACTCGCGATATAATCATTTTCCTCGCTCAGCTTAATATTTAACTTTGTGTATTTATAGAAAAAACAGAAATATAATTTTAAATGTAC 
ACTATCTATAAAGCGCTCAAGCTTATATATATATATATACACATACACTTATAGTATAGCAGACCAGACAAAACTGAATTAACCATTAGACTAAT 
ACCTGCTGAAAAGGAGCACAACACTATCCAGAATTCAGTGGTGTGACGCTCTCACAAACCAAGTTGGGTGCCTCAAAAATGAAACAACAAAATAT 
GTTCGCAATCATACATGTTTAGCTTTCAACCAGTCAGTCAATCATTTGATATCACCAATCACCAAGCAGCAACTAACAAAACTAAAACTGACAAA 
CAGAAGAGGGGGCAGTAGCGGCCCAGAGCACAGTACTTGACTAATCAACTAAAATTAGCATATAAGTCGAGGGAATCTATCTAACTCTACTAATG 
CATTTCATTCGACCATTAATCCTACTAACAATGAATCAACAAACAAGACAAGTCCACAGCTGACATGGAGTGACAAAAAGAAAATGGCAATTGAA 
GAAGTAAAGAATTAAATATTCATGAATTGAAACCCCAGAACAGACTATATGTATTCATATATACATCATTGTAAAATTTTTTTAATTAGACTAAT 
TTGCCATAATTTGCTAAGCAAATAAATTCTGTACATAAAACAAAACAATACAAAACGCACAAAAACAAGAATTAACCGTAACTGAAACGCATATA 
ATATATTATTATATAAGTGTACAATTCTTTATATGGCTTAAAATATTTTATGTACATTCAAAAGGCAACCACGCAAGGTAGAAAAATATTATTAT 
TCGTTAGCTTAGGCATAAATAACATGTAAGTGTTCATACGATTAAGGCTGGATTGAAAACAAAAACAAACAAAAAACAACTAAAAGAAAAGCAAA 
CAAAAAAAAAAACTAATCTATTTATGTTTTATTTGTAGTTTTAAAAGCAATAATCATTTTAAAATTATGAATTTAATTTTAAATAATTTCAGAAC 
TACGCTTATATCCATGATTTGATGTTAATAAATAAACGATCGCAAAATCGTAAACTAAACAAATCAAGTTGAATTGAATGAGTAAATAAATATAT 

ACTTTTGTAAATAAAACTGAAATCAATTCAAA 
(SEQ ID NO: 401) 

Start ATG: 2 64 (Reverse strand: CAT) 

MGLNAKSVCSTSSTEPNGSIVTTAPSNGEVSSSIVVVVSSSSISSSSDSPIAILPHPDPIPSTSFSSASQRSEEELPGTSAASRTDEMCSCDSQN 
LAASTAATSNGNKRKRRLSSDSNEDPELGFEPPSAKRQQRLPALYGSEQGNLSSVASSVYTSPVVSVDGQSTQELLSIRSSPAEDLSEAPHSPLP 
DSPDSPPSPDRGSKQTPWVRYAAEQWTSTVVTQKTEDDDLLDDSCEDYSYDEDDEDDVEEEDDDVEIYSSTISPASSGCSQQQAVNGERTPGL 
PKHOEOIHHPVSDLMINMRTPMSPAVENGLRQCPLPALAWANAADVWRLMCHRDEQDSRLRSISMLEQHPGLQPRMRAILLDWLIEVCEVYKLHR 
ETFYLAVDYLDRYLHVAHKVQKTHLQLIGITCLFVAAKVEEIYPPKIGEFAYVTDGACTERDILNHEKILLQALDWDISPITITGWLGVYMQLNV 
NNRTPASFSQIGRQKSAEADDAFIYPQFSGFEFVQTSQLLDLCTLDVGMANYSYSVLAAAAISHTFSREMALRCSGLDWQVIQPCARWMEPFFRV 
ISQKAPYLQLNEQNEQVSNKFGLGLICPNIVTDDSHIIQTHTTTMDMYDEVLMAQDAAHAMRARIQASPATALRAPESLLTPPASSHKPDEYLGD 

EGDETGARSGISSTTTCCNTAASNKGGKSSSNNSVTSCSSRSNP* 
(SEQ ID NO: 402) 

Name: cyclin E type II 
Classification : cell__cycle_regulator 
Gene Symbol: CycE 
FlyBase ID: FBgn0010382 

Celera Sequence No. : 142000013384450 

TTATACGCTTGTCCTTGATGATGTTGCTATTGCCATAGAGCGGCTTCTTGTAGACTATCTTGGATGCCTCCAAATGGATGGCTAGCAGAAGTGTC 
AGCAGCAGCAGGGAATTTTTAAAGGCGCCTGATATATGAAATATATACGATATGAAATGTGTTCTTTAGCTAATATTACTTTCAAAGCTTACCCA 
TCTTGGTATCTACGGAAACACTGAATACTTCTGCCTTTGCGCCATCCATTTTATATAAGCCATTTTCTTACTTGGCTTGCAGTTTTTCAAGATTT 
CGGTCTTCCATTTTAGTCGGCCCCCAACGAAGTTCAAGTTGTGAGAACCGGCAAAATTGTGTCAGGGTTAAGTCTGATTAAGCAGAATTTTCACA 
CAAAGCGAGTGGAATTCATTCATTTGGAGGAAAGTGAGAAGAAAGAAAAAGAATTACTTCCGAAAATTATGCGTAATCAATAAACCCAAAAGTTG 

TGTCTTGAGCTGCACAAAGTTTACCTTACCGTTCTAAA^ 

ATAATAACATAATGGATAACATAAATAAACGGCCGATTAAACAGCAATTATTATCCAAATTATACAAGTTATAATATGGTAATATTAACAAACAT 

TTCAACATAAGCAAAGCTTTGGTTTTTAATCTAATTATAGATTTAATTTAATTTTCATTTTTTGGTTTGTTTTTCGTTAAGTTAGTATTTTTTGC 

GCTGTTTATGGAACTTAACAGAAATACCAATCAATATTGGGCGGCCAACATTTGAAATATAAATACAGATTATCGATATGTATCTTGAACGAAGT 

TTTAAATGTTTGTTATTTGTTAAACTCAACAATCCTATAGATTTATGCCATAAAAATGTCAAAATTATCGTTGCTTCAAATGATTTAAGTACAAA 

ATTCGAGTTTCGATAAACGGTTAACAGTGCCGTACTCACCACGGTGCGCCCAACAGCACGCACACACAGCCGGCTGTGGTGGGTTCGCTGGACAG 

GCAGTGTGGTGAATATACTAGTTGCTGGAGCTTGCCGCAAAAAAAAAAAAAAAGAAAACAAAACAGATCGAAAAACACAGCAAACAACAAAATTA 

TACGCCATTAATTACCCGATAAACTTACCGATGGCCGATAAGCAACGGCTAAGTGATCCGTAACACCCAGCGATAAGATCCAGATAAGATTGGAA 

TTAAATTTTGGGTTTGCCATCCGCATCGAAATCGCCGCAATCGGAATTTACTTGTACACCAGCCAACTCGCCGTTCAAGACGTCCAAGTTTTTTT 

TAATCCAGAAATCCCATACACTGCCAAAATGCCCGAATATGTGCCGGCACGGGGCTTCAAGGAGATGGAAAATCTGATCAAGCTGTACGAGAACC 

AGCGCAGCCCCATCTACATCTACTTCTACGGCGAGAAAGACAAGGACGGACGCAGCTGGTGTCCGGACTGCGTGGCGGGTAGGTGACGAATTCTC 

CTCGCCCAGCGGGGCGTTACTAGTCCAGTTGGGCACTCTACATTCTGCTTATCACGCTTCTCATTTGTGGGTTTCAGTTAGTTACTTTCTACCCA 

GCTAACTTGAGTTACTTTTGATTTGATTTTCGCTCTCCATTGAGAGTTCCATCTACATTGTACATTGTCCTCATCATTTCGATTCATTTCATCGC 

AGAGTTATAACATAACATAAAGAAGTGACATACATTAAGTTAATCCCATTATCGCTGATATTTTCCAGCCGAGGAAACTATCATGAGTGCCTTTC 

GCAACCACGCGCCGGCGGATTGCATGATCCTGGTGGTGGACGTGGGTAGCCGGGAATCCTGGATAGGCAAGGACAACATGTTCCGAAAGCCGCCA 

TACTCGGTGGAAGGCATACCGACCCTTATACGCTGGAAGGGCGTGGAGCGCCTGGATGGGGATCAGCTGCTCAAGTCGAGTCTTCTGGAGCTCTT 

TTTCGAGGAGACCGATCCGAAGAAGTCCAATTTTGTCGCCCAATAAAACGCGATTTGGCTTTGGCTTAAACTCACTCCCACTCTCAGTCTCTCTC 

TCTATCACTCTCTCTCTCTCTCTCTCGCTGCGGCTCTCTTTCTCTTTGCTGCTTTGCGCTCAACTCGGTGGAAAGAACGAGTGAAATTGTCGTTG 

GGAGGGGTGGTGCTAAAAAAGGGGGGAATGGAGAAATGGGGGAGAACAGAATGAGTGGAGCATTTCTGGCTGGCAGCGCGTCGCTCGGCATTTAC 

TCGTTTATTTGCACGCATGTCGTCGACCAGGACGAATCGTTTGGCAGAGCGCGGCTGAAAAAGGAATCGAAAACGGGGCTATAACGGTAATAGCG 

GCACACGCGTCCAACGGTAAACACGCGTGCCAACGCGCCATCTCGCTCGCACGCGCTAATGCCGCGATTTAAGTTTTGTTTTATTTTTTGCAAGT 

GTTCGAATCAAAAAGCCTAACAAAACACAAAACCCGAAACAGTTCTCAAGAAATACATAGGTAAATGCAAAGTAAGATATGTATATATGTAGCTA 

AACATAGCGGAAAAGCGCAGAGTATCATTGAATTTGAAGCACACGACCGCACAGTTTTGCAAGCTGATTAACTGTAAACATGCTGAACATAAACA 

ATGGATAAAAAAACATGATAGATCCTGCCTAAAAGTGAATGGTTATAAATTAAGAATGGGCTTGCAACGGTAAATCCTGTTGCGTTTTGCTTTAA 

CTGTTTGAAAATAAAAAAAAATATATATTTAATAATAGAATAATAAAAGGTTTCGTAAGTAAAGTAACATCAAAACTAAACTAAAGGTAAATGTA 
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TAAATTGATCAAATGTTTTAAAAAGTAATACGTTCAATATTTAGCCAATGCAAAACCAGGCAAACGCAATTTACCGTAACTTCTTAGCCAAAAAC 
CCI^TTGCTTCGGCCGCGTTGCATATTAATTGTGTGGCAAGACTGTTGCTTCTTCTCGTTCGATTTGTTTGGTAACTTTTACGTTAAATCATTGA 

C 

(SEQ ID NO: 403) 

Exon: 1001. .1503 
Exon: 1779.. 2041 
Start ATG: 1395 

CT^CAGC^ 

AAAGAAAACAAAACAGATCGAAAAACAC 

AAG T GATC CGT AAC AC CC AGC G AT AAG AT C C AG AT AAG ATT GG AAT T AAAT T T T GG GT T T GC C AT C C GC AT C GAAAT C G C C GC AAT C GG AAT TT A 
CTTGTACACCAGCCAACTCGCCGTTCAAGACGTCCAAGTTTTTTTTAATCCAGAAATCCCATACACTGCCAAAATGCCCGAATATGTGCCGGCAC 
GGGGCTTCAAGGAGATGGAAAATCTGATCAAGCTGTACGAGAACCAGCGCAGCCCCATCTACATCTACTTCTACGGCGAGAAAGACAAGGACGGA 

CGCAGCTGOTGTCCGGACTGCGTGGCGGCCGAGGA 

CGTGGGTAGCCGGGAATCCTGGATAGGCAAGGACAACATGTTCCGAAAGCCGCCATACTCGGTGGAAGGCATACCGACCCTTATACGCTGGAAGG 
GCGTGGAGCGCCTGGATGGGGATCAGCTGCTCAAGTCGAGTCTTCTGGAGCTCTTTTTCGAGGAGACCGATCCGAAGAAGTCCAATTTTGTCGCC 

CAATAA 

{SEQ ID NO: 404) 
Start ATG: 395 

MENLIKLYENQRSPIYIYFYGEKDKDGRSWCPDCVAAEETIMSAFRNHAPADCM 
DGDQLLKS SLLELFFEETDPKKSNFVAQ* 
(SEQ ID NO: 405) 



rp l Pra Seauence No. : 142000013384645 

taaLtcaaataacaatatttaactatggcgccacgtttgacttacctcggatagcaatctcttgaggccctcgacctcatcctcgccggcgttt 

AGCTCGCCACCGGGCAATTTGAAGAACGTGGTGCCCAGCTGCAGGAGCAAGACATGTGGCAGTCCGTGCTCGTGGACCAGAAGCACTCCCTCGAC 

GGAGCGCCGCATTCCGATCCGGTCGAATTCCTCGCGCATCCGCTGGAATCGGGATGGCACCGACGGATCCTTCTCAAAGAGCGGCTCCTTGGTGC 

CGAATGTGTAGTTCGTAAGGGGGTATCTGAAACAATCGATCGATCTATGATGTGCGTCTATGAATGAAATGTGTTGTACCCACAAGTTGATGGTG 

CGATTGATGGTTAGCGCCTGGTTGGTGTACTTCTGGGTACCGTTGTTATTGCTGCTGGCTGCATCCGCTTGGCCTTGGCTTCCGCGGCGCGGCCA 

ACCGGAGCCCGATTTGTTTGAGACTTGCGAGGACGCCATTTTGTTATACACACACCCCTGTGGGAAAATTTGCGGGACAAGCTGGCCGACTTTCT 

ACCAAATCCTTTGGTTGTGCGTTGGAAACTTGTATTCCAATAATATTATAATCCTTAACTGTTTATTTACAAGTTATACTATAAATGTAAACCCG 

TTTTAAGCCGCTTAATTACACTTTTCTGTCCAGCGCTTGTGCAAAAAAATTAACCGGCTGGACGAGGGCTGCACGTGAATTGATTGATGATGAGC 

AGAACTGGTTCGTTGAACTAGATTATGGAATTAATCGATTTCGTACTTTGTGTGAAATAAACTTGTAATGACCTTTTGCTTAATATTTATTAAAG 

ATTTATTCAATTTTTTGTTTTATTTTTAAATGCAGTTTTAAATTATTGTTTGTTTACATATGTAACGACAGCCCTGGTGTTTCTTGTCTAATGGC 

AACGCTCTGAAATTGCGCAGCAACCCCATCTGGCCACACTGACCCATTAGTTTTTTGTTTATGTTGGGTTGTCGGAAAAATCGGCTGTTTTCCGT 

GTGTCCGTCTGCCATGAAAAGCTGCTAAAAAGCTAAATATAAAAATCAGCGCAGCACACACGTTCCGCTGCCTGCATTGTTTGCCCATTCTAATG 

GGAAATTATGTGAGTGCGAGTCAGGAAAACGCATCGTGTGTGTATATATACATATATACATAAGTATGTAACTGCGCCCGTTTGTTTGTGTGTGT 

TTTTGTGTGCATGGTGCAATGGAAAAGTGATTATTATAAATGAAAATCAGTGAAATGTGTAAGTGTGGCACCGCAAACGCTTTGAACTTTTCCAT 

TTTCATGTTTTCACCTTGTCTTCTCTTCTCTTCTTCACCCACCTCCCCCGTAGTCAACCAACTCCCACTTGTGCGTGTGTAGGAAGGCAGAAGGG 

TGTTTTGCATTCGTGTGCGTGCCAGATAAAAAGACTTTGAATTTCTGGAGGATATTAGTGTCAACTGTGTGGGGTATAATACGCCGTAGGCCCTA 

ATATGCAGGCAAGTCCATTAATTACGCCAAAGGTCACAGGTTAGGCACGCAGCCAGGATCCAAGGAGCAATAATAATGTTGTATGAAGGCGACCT 

GTAAGCGAGAGCGAGAGAGCTGGTGTTTTGGTGTGAGTGCGTGGTAGAACAAAGAGGACGAAAAACACGGCTAACGAATTCGCCGCACGACACAC 

TCGCCGCATGGCGAATTCAAACGTATCCCAGTGAACGAATGAGAAACAAAAGGATATCTTATTGAAACTGATTAAACTAAAGACTGCCGAATAGT 

TATATAAAATCCAAAACCATGTCTGCGGCCGAACTCGAGGCGGAGGCTTCAGCCCAGGAGCAACAATCCCAGCAACAACAACAAAATCAATCACA 

AAGCGAGGAGATAAATACAAACACTGCCATCAATGCAAATGCAACCACAACAAATAGCTACAACAATAATTTAGATACGGAAACCCAGCTTCAGA 

TGCACCAGGATCCGCAGCAGCAGCAGGAACTCGAGGGCAATAGCAACAATGGCATCACAGGACTCGGCCTCAGTCTGATGAACTCCTCACCGCCG 

CCGCACAGCTACCGACATTCGCGGGCCTATCGCCACTTCAAGAATCCGCCACAGCCCCACATGTGCATCCGCACCACCACGGAGGCGGGCGAGGA 

GCTCTTCATCAATGTCCTCAGCTGGACGCGCATTGTGATACCGCAGGAGCCCAGCGATCCCATTCCCCTTTACGGAGGCATGCGGGTAAGATTCC 

TTAGATTAAGTGTCCTCAAACATTCACGTCGGTGTTTCCTTTATTAGTTATTTATGTTAATGTTCTTATTCATTAAGGTACCGCCTGGCAGCCCA 

CGCAGTCCTCCCATTGTCTTTGCCGTAATGGCCAATCCGGAGGTGCTCAAAGACTCCGGACGTCACAGCAAAGATCCTGAGGAGCGACGAGCTAT 

GGTGGAGCTGATGTGCGACTTTGTGGAGGCTATGAATCCAGGCGTGAAATTAGTCAGGTATATAATAACACTATTAGTTTACTGCCCGATTAAAA 

ACAAGAATAAAAATATATGTAATACATCTCCAGAAACGCCGTAATACTCAAGGATCGCGATATCTCCGGCGAGCTGAAGGATGTCTGGAATGCCG 

TTCAGGCGCAGCGCGATCGCGAGAGGGAGGAGCAGATGATGCAGCAGCGCCAGCAGCAGCACTACCAGAACATCACCACGCAACAGATGTTTCCC 

AAATCGCCGGATGCATCGCGGGCAGCTAGTGCTGGTGCTGCAGTGAACGAGGCCAGCTCGCCGCCGGCTCACTTGTCTGAGCCGCTGCTGGTAGA 

GCAGGGAAATGGCAATGGCAATGGCATCACCCTGGCCGTGTTTGCCCAGCAGCTGGACAACAAAGTGGCCAGCGAGCAGGGTGAGCAGCAGGAGC 

AATCGCCAGTTTCGGAGGATGCTTGCGTGGATCAAACGGATGCAGGCAGCTCGCTGAATGGATCAACAACTGTGCTGGACAATCAGCCGGTGGCA 

GTGGAGAACGAACCAGAAGTAAAGGTTGCACCACCGGCAGCCACAAATGGTGCTAGCACTCCCACACCCAGCCACGCAACACCCACAGTAACAGC 

CACAGCCACATCATCGCCCAGTCCAACTCCAACTCCCGTTGTTGCTCCCGTCGTTACAGCAACTCCTCCACCTGCTGCTCCCGCAAAGAAGGAGA 

AACTGGGCGGCTTCCTGCCCAACGGCTGCATCTTCCCGCGCTTCAAGAACAACAAGCACAAGGACAAGGACAATAGTCACAAGGAGGGCAAGAGC 

AAAGAGAAGACCCTGCTGAATGCGCTGAAAAAGAGCAAAGAGAAGAAGGTGGCGCCCAGTGAACAGCCGACAGAAAAAGCCGCCGCCTCGGACAA 

CGGAACAACGCAGTACGAGAAGAACTGCATCAACAATCTGGAGTGCGAGGTGCAGAAGCTGGACTTGAACAGCGGCACCAATGGCGACAATAGCA 

TGTTCATCAAGCTGAAAGTTTCGCCGGCCAATGCCACAGCAGCGGCAGCGGCGGCCAAGTAGAGATAATCCTGAATTTACACACAAAAAAAAAAT 

AAATAAAATCTCGTAGGTAAGTGGGAAGTTAGGACGATAGAGAGCCAGAGGAGGCGGCACTCTACCAGTGAACAGTGTCACTGTGTCAGCACTTA 

GGGCTTAGAGTCGTGGATAGAGTCGTGTAGCATATACAAGATTTATATATACAGATATATATACAAGCATTCACACACCCGTTTTTACACCGTTG 

GTCAATTTTTTGGATAGCCACTCAGAACCAGGACTAGATGAACTAACAGTTGTGCCATGCTCCAGCTCATAGACATTTATAGACGACAGCGCTTC 
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TCCCAGGATCTTCCCCGGCATTTAACTGACAGAATGATAAACGTATTTACATAGTTCAATATATGTATGATTCGCATACAGTTTGCCTTATATAC 
AACAAGCCCGAGAGAGCGACAAGAACCTAGCAAGAACCAGTTGATAAACACGAGAATAAATGTATTTAAAAACATAATCGTCGAATGAAACTCAT 
GCGCCAAATGAACTTACCAGACTAACTACTAGCTCCATTCCCAAATTGTCACATCATCTCATCGTGAACCATACACCCACCAGGCATTTGCCCAT 
CGATTCATCCATCCATCCATCCATCCATCCATCCATCCATCCATATCCATATCCATATCCGTGCAAAAGTATTAAGAAACTTCGCAGATCCAAGA 
ACCCTAACCCTGGATCGATTGATCTCCACGCTTTGAAGTTGAGGGGCAGTGAAAACAGCGGAGGGAGTTTTAAAGCCGGCTCTTTAGTGCCACGA 
CTAACTTATGTACCTTAGTATAGCCGAATGCGAAAACAGTATACAGATCCGCAATGTCGCTATTATGATTTATATTTTAAAGTTTAACTTTTAGT 
TTCCCCTTTTGGATACCGAAACGGGGAAACTACATTTCATTCTATTTATTTTCCGTTCGTGCTTGTTGTGTTGTAATTGTGTAATACTACTGTTG 
GAATTTTTAACTTTTAC 
(SEQ ID NO: 406) 

Exon: 1001.. 1293 
Exon: 1384.. 2270 
Exon: 2358.. 2527 
Exon: 2599. .3577 
Start ATG: 1824 

Transcript No. : CT13259 

TTTTTTGTTTATGTTGGGTTGTCGGAAAAATCGGCTGTTTTCCGTGTGTCCGTCTGCCATGAAAAGCTGCTAAAAAGCTAAATATAAAAATCAGC 

GCAGCACACACGTTCCGCTGCCTGCATTGTTTGCCCATTCTAATGGGAAATTATGTGAGTGCGAGTCAGGAAAACGCATCGTGTGTGTATATATA 

CATATATACATAAGTATGTAACTGCGCCCGTTTGTTTGTGTGTGTTTTTGTGTGCATGGTGCAATGGAAAAGTGATTATTATAAATGAAAATCAG 

TGAAATGTTCAACCAACTCCCACTTGTGCGTGTGTAGGAAGGCAGAAGGGTGTTTTGCATTCGTGTGCGTGCCAGATAAAAAGACTTTGAATTTC 

TGGAGGATATTAGTGTCAACTGTGTGGGGTATAATACGCCGTAGGCCCTAATATGCAGGCAAGTCCATTAATTACGCCAAAGGTCACAGGTTAGG 

CACGCAGCCAGGATCCAAGGAGCAATAATAATGTTGTATGAAGGCGACCTGTAAGCGAGAGCGAGAGAGCTGGTGTTTTGGTGTGAGTGCGTGGT 

AGAACAAAGAGGACGAAAAACACGGCTAACGAATTCGCCGCACGACACACTCGCCGCATGGCGAATTCAAACGTATCCCAGTGAACGAATGAGAA 

ACAAAAGGATATCTTATTGAAACTGATTAAACTAAAGACTGCCGAATAGTTATATAAAATCCAAAACCATGTCTGCGGCCGAACTCGAGGCGGAG 

GC T TC AGC CC AGG AGC AAC AAT C C C AG C AAC AAC AAC AAAAT C AAT C AC AAAGC G AGG AG AT AAAT AC AAAC ACTGC C AT C AAT GC AAAT GC AAC 

CACAACAAATAGCTACAACAATAATTTAGATACGGAAACCCAGCTTCAGATGCACCAGGATCCGCAGCAGCAGCAGGAACTCGAGGGCAATAGCA 

ACAATGGCATCACAGGACTCGGCCTCAGTCTGATGAACTCCTCACCGCCGCCGCACAGCTACCGACATTCGCGGGCCTATCGCCACTTCAAGAAT 

CCGCCACAGCCCCACATGTGCATCCGCACCACCACGGAGGCGGGCGAGGAGCTCTTCATCAATGTCCTCAGCTGGACGCGCATTGTGATACCGCA 

GGAGCCCAGCGATCCCATTCCCCTTTACGGAGGCATGCGGGTACCGCCTGGCAGCCCACGCAGTCCTCCCATTGTCTTTGCCGTAATGGCCAATC 

CGGAGGTGCTCAAAGACTCCGGACGTCACAGCAAAGATCCTGAGGAGCGACGAGCTATGGTGGAGCTGATGTGCGACTTTGTGGAGGCTATGAAT 

CCAGGCGTGAAATTAGTCAGAAACGCCGTAATACTCAAGGATCGCGATATCTCCGGCGAGCTGAAGGATGTCTGGAATGCCGTTCAGGCGCAGCG 

CGATCGCGAGAGGGAGGAGCAGATGATGCAGCAGCGCCAGCAGCAGCACTACCAGAACATCACCACGCAACAGATGTTTCCCAAATCGCCGGATG 

CATCGCGGGCAGCTAGTGCTGGTGCTGCAGTGAACGAGGCCAGCTCGCCGCCGGCTCACTTGTCTGAGCCGCTGCTGGTAGAGCAGGGAAATGGC 

AATGGCAATGGCATCACCCTGGCCGTGTTTGCCCAGCAGCTGGACAACAAAGTGGCCAGCGAGCAGGGTGAGCAGCAGGAGCAATCGCCAGTTTC 

GGAGGATGCTTGCGTGGATCAAACGGATGCAGGCAGCTCGCTGAATGGATCAACAACTGTGCTGGACAATCAGCCGGTGGCAGTGGAGAACGAAC 

CAGAAGTAAAGGTTGCACCACCGGCAGCCACAAATGGTGCTAGCACTCCCACACCCAGCCACGCAACACCCACAGTAACAGCCACAGCCACATCA 

TCGCCCAGTCCAACTCCAACTCCCGTTGTTGCTCCCGTCGTTACAGCAACTCCTCCACCTGCTGCTCCCGCAAAGAAGGAGAAACTGGGCGGCTT 

CCTGCCCAACGGCTGCATCTTCCCGCGCTTCAAGAACAACAAGCACAAGGACAAGGACAATAGTCACAAGGAGGGCAAGAGCAAAGAGAAGACCC 

TGCTGAATGCGCTGAAAAAGAGCAAAGAGAAGAAGGTGGCGCCCAGTGAACAGCCGACAGAAAAAGCCGCCGCCTCGGACAACGGAACAACGCAG 

TACGAGAAGAACTGCATCAACAATCTGGAGTGCGAGGTGCAGAAGCTGGACTTGAACAGCGGCACCAATGGCGACAATAGCATGTTCATCAAGCT 

GAAAGTTTCGCCGGCCAATGCCACAGCAGCGGCAGCGGCGGCCAAGTAG 

(SEQ ID NO: 407) 

Start ATG: 734 

MSAAELEAEASAQEQQSQQQQQNQSQSEEINTNTAINANATTTNSYNNNLDTETQLQMHQDPQQQQELEGNSNNGITGLGLSLMNSSPPPHSYRH 
SPAYRHFKNPPQPHMCIRTTTEAGEELFINVLSWTRIVIPQEPSDPIPLYGGMRVPPGSPRSPPIVFAVMANPEVLKDSGRHSKDPEERRAMVEL 
MCDFVEAMNPGVKLVRNAVILKDRDISGELKDVWNAVQAQRDREREEQMMQQRQQQHYQNITTQQMFPKSPDASRAASAGAAVNEASSPPAHLSE 
PLLVEQGNGNGNGITLAVFAQQLDNKVASEQGEQQEQSPVSEDACVDQTDAGSSLNGSTTVLDNQPVAVENEPEVKVAPPAATNGASTPTPSHAT 
PTVTATATSSPSPTPTPVVAPVVTATPPPAAPAKKEKLGGFLPNGCIFPRFKNNKHKDKDNSHKEGKSKEKTLLNALKKSKEKKVAPSEQPTEKA 
AASDNGTTQYEKNCINNLECEVQKLDLNSGTNGDNSMFIKLKVSPANATAAAAAAK* 
(SEQ ID NO: 408) 



Celera Sequence No. : 142000013384676 

CTTTTCACCGAATACCACACAAAGCTGACGACGGGATGTTTCGCCACCCGTACTTCTCCCTTAGGGGTGGTTTCCTGTCACGAGCACAGGGTTGC 
TAAATGCGCGGCAAAGGCGGCCGAAAGTCCTGTGCTCGGCTTGAGAGCTCCACTGGCGAAATGTTTGTTTTTATTGATATTAATTTTTCTTCTGC 
TTCTCCTCCTCCTTCAGCTGTTTCTCCCTGAGCTTTTCCTCCTTCTTCAGCTGTTTCTCCCTTAGCTTCCGCTCTTTCTCCCGCTTCTTCACGAT 
TGCGGGATCATCGCTGTCCTCATTGGGGTCCTTCTCCTCCTTCTTCACCTCTTCATCGGGAATTTGTTCAAGACCCTATAGAGAAAGAAGCTTTA 
CACCTGAGGCAGCAAAAAAAAATCTGTAAGGAACTACAGGAAAAACCAAAAACGACCATTTCAACTATCACTTCCGAACTGCACGTTTCGTTTAT 
TCACTTTTGCACTTAAGCTTAAACGGAAAACTTTACTTAATTAAAATTAGTTCGTTTTCTCATCCGGTATGGCATCCAGACCCTGCAAGAAGATT 
GGTTCGGTTTTTTGTAAATGATTTAAGCTTATAACTCTTCGAATAAGCGTAAAAAAATGGATGGTATGAATTCGTAGGAAATAAAGTAACATTAA 
CT ACCGC C ACC AAT GAT CAAT AAACAAT AAAAACGAAATT AACC AAGAAAACT AAGAT TT TAAAT GC AAT T ACCTTATAATATAGAAAACTTTAA 
AACTAATATTTTATTAGTGAAGAAAGCTAGTTCATAACACCAAAAACAATCTATTTAACATACTTCAGAGAAAAACATAAACAACGAAACCAA^ 
TGAGCGTGCATAATAATGAAGTGCGTGCTGTGAGATAACAAAACTGCTGCCAAAAATAGGGTAAAAACGTGATGGTTGGCCTGCCGCACAAAAAT 
CAGTTGTAGGCAGGCCAGACTATAGCACAACACTTACGTTCATCTTGTAGGGATTGTCGGTCAGTCGGAAATCGCGCTGGTACATAACTCGGCTG 
AAACGCTCGTCCATGTTGCGCCGGAAAGCCTGTTGAGCAAACGATTAGCAAACAATTAGTAACTCCGTAGTGTTCCGTCGAACTCGTTGCGAAAA 
GCAGGCGGTTAAATAATGTTTAACAGATTCCGATTGGTGCACAAACAGCTGCGGTTATATAAGAATTTCGGGCTGCTGGGCCAGAAGGCCAGCGT 
GGGACTCACATTGCCGATCATCAGCTTGTCCAGGCCGTACATGGCGTATATGGGCACCGAGAGGAGCGTGGTGATGATCACCGCACCAGGTAGGA 
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TTTCGAACCACATATTGTCAATGGAAATAGATTGTTATGATATTTACTGCAAGACGACACCCAGTGTGACCGTGGATAGTCCGTTCCAATATACA 

GGTAGCCTTACAAAATATACCAAAAACTTGAATGTTGAACTAGAGGTGGGCCAGTAACGATAAGAAAGTAACGGTGACTGAATTCTATTCTGCTA 

CTGCGTTTAGAAAAACGTCAATGTTGATTTGAAAATAGTTTTTGACCAATAGTTGGCTTAAATAATTTGTAAATAGATCTGATATATCTAGTAGA 

GCTGATAAATGTACGCAATGTTTATAATCAATTTCAAAGCTCGAAAGTATATAGAAAACAAATTCCAGTGTAAAGCTGCGTGACCGTTTTACGCT 

CTATTCGGGTTGGTATTTTTCCCAATGCACTTTTCGTACATCCAGCTGGACCATTTTTTTGCACTCAGATCTCGGTCTTACCAATCCGATTCTTC 

ATTGAACAGCTACGAAGGAATTCGCAGAGAGTTCTGAGCGCAGCAATAGCAGCAAGAAGACTACCAACAAAGAGCAGAGCGACAAATCAGCGGAG 

AGCAGGATGGCCACCAGCGGTATTGTCGAGTCATTTCCCACGGGCGCACTGGTGCCCAAGGCGGAGACGGGAGTCCTGAACTTTCTGCAGAAGTA 

CCCGGAGTACGATGGACGCGACGTCACCATAGCCATCTTCGATTCCGGCGTCGATCCCCGGGCAACGGGACTGGAGGTGAGTTGGCACTTTTTCG 

CACTCTCTATACCGGTTACAAAGAACTTGTACAGTTATTCTTCCAATATTTACAATATTTCGTCTTACCTATTCTATTACGATTTAATCCTTCTA 

CTGTGTCACGACTGCTTCTTACTCCCGTTTCTCATTATAACCGATCCAATTTCAATCTGCTCCTACGCAACAGACGCTGTGCGATGGAAAGACCG 

TTAAAGTAATAGAGCGGTACGACTGTTCCGGATGCGGCGACGTGGACATGAAGAAGAAGGTGACGCCGGACGAGAACGGCAACATTAAGGGCCTG 

TCCGGAAACTCGCTCAAGCTGAGTCCGGAGCTGATGGCTCTGAACACAGATCCGGAGAAGGCGGTGCGGGTAGGCCTCAAGAGCTTCAGCGATTT 

GTTGCCCTCCAAGGTGCGGAACAACATTGTGGCCCAGGCCAAGCTGAAGCACTGGGACAAGCCGCACAAGACGGCCACTGCAAACGCCAGTCGCA 

AGATTGTTGAATTTGAGTCACAAAATCCAGGTGAGCTAAGTGATTGATTACTTGAGGCAAAATTACCAACAACATATGTATCTTTGCAGGAGAAG 

CCTCAAAACTGCCCTGGGACAAGAAGATATTGAAAGAGAATCTAGACTTTGAGCTAGAGATGTTGAATAGCTACGAGAAGGTGTACGGCGACATT 

AAGACCTCCTACGATTGCATCCTTTTTCCCACGGCCGACGGATGGCTGACAATCGTCGACACCACGGAGCAGGGCGATCTGGATCAGGCTCTGCG 

CATTGGCGAGTATTCCCGCACGCACGAGACCCGCAATGTGGACGACTTTCTTTCCATATCGGTAAACGTCCACGACGAGGGCAACGTACTGGAGG 

TGGTCGGCATGAGCTCACCCCACGGCACCCACGTATCGTCTATTGCCAGCGGCAACCATAGCTCCCGGGACGTGGATGGCGTGGCGCCGAATGCT 

AAGATCGTTTCTATGACCATCGGTGATGGTCGACTTGGGTCTATGGAGACCGGCACGGCACTAGTGCGTGCCATGACTAAGGTGATGGAACTATG 

TCGCGATGGCAGACGCATCGATGTGATCAACATGAGCTATGGCGAGCATGCCAATTGGTCAAATTCCGGGTAAGCAAGTGCGAAAGGCCGGCGGA 

ACTAATCCCAAAGTTGGTGATGTCTTTTAGTGTGTCTGGGTCAGGGTCTGGGTCGGATGAAATGCGTCGCAGAAGAAAGATGTGTGCGTCTAGGT 

AAATTTCATGAGAAATCCAAATATTCACACAGTCCAATCCTCTTGTCTTCGACACGTTCCAGCCGACACACACCTAACTGATAACCATCAAAATT 

CATGAAAAAATACCTAATCTATATCTCTAATCTCTTGCAGCCGCATTGGGGAGCTCATGAACGAGGTTGTCAACAAGTATGGCGTGGTGTGGGTG 

GCATCGGCCGGCAACCATGGTCCGGCACTTTGTACTGTGGGAACTCCGCCGGACATCAGCCAGCCCAGTTTGATCGGCGTGGGCGCGTACGTATC 

ACCCCAAATGATGGAGGCCGAGTATGCGATGCGAGAAAAGCTGCCCGGGAACGTGTACACCTGGACATCGCGAGATCCCTGCATCGACGGAGGTC 

AAGGCGTTACCGTATGCGCTCCGGGCGGAGCCATTGCGTCCGTGCCACAGTTTACTATGAGCAAGTCCCAGCTGATGAACGGTACCAGCATGGCG 

GCACCTCACGTCGCCGGCGCAGTGGCGCTGCTCATCTCCGGTCTGAAGCAACAGAACATCGAGTATTCGCCGTACAGTATTAAGCGAGCGATCAG 

CGTCACTGCCACCAAACTGGGCTATGTAGATCCCTTTGCTCAGGGCCATGGCTTGCTCAATGTCGAGAAGGCATTCGAGCATTTGACGGAGCACC 

GCCAGTCCAAGGATAATATGCTCAGGTGAGTGCTGTGGAACGCCAGATCTCAGGATCTCTAAACAAAAAACGATTAAATTTCGCGTTCGCATTCA 

CACTAGCTGAGTAAGGGCTAACTTGTTTTATAATTTAGTAGGTATATCGTTAAGATGACCGAATAGTAAACAATTGTCAAAGCAATCTTGTAAAA 

TTCCAACCAAATAATAAATTTCTTTCGTTTTAATCTTATAGGTTCTCCGTGCGCGTGGGCAACAACGCAGACAAGGGCATTCATTTGCGTCAGGG 

CGTGCAGCGTAACTCCATCGATTACAACGTTTATATAGAGCCCATCTTCTACAATGACAAGGAGGCGGGTGAGTAGAGACTATAGAGACCCAATT 

CCAGCTCTTACCGTTCCAATTTTACAGATCCCAAGGACAAGTTTAACTTCAATGTACGGCTAAATCTGATTGCCTCGCAGCCGTGGGTGCAGTGT 

GGAGCTTTCTTGGATCTCAGCTATGGCACCCGCTCCATTGCCGTGCGCGTCGATCCCACTGGACTCCAGCCAGGCGTTCACAGCGCTGTGTAAGG 

GACTCTTGAGCCAAAAACGCTTCCCCTTCTAATCGCATTTATAAAAATTTATCCAGGATTCGGGCTTATGACACTGACTGCGTACAGAAGGGTTC 

TCTCTTTGAGATTCCTGTCACGGTGGTGCAGCCCCATGTGCTGGAGTCGGATCAGAACACGCCCGTCTTCGAACCCGCCTCTTCCAAGGGAGACA 

ACAGCGTGGAGTTTCAGCCAAACACCATTCAAAGAGACTTCATCCTGGTGCCAGAACGTGCTACTTGGGCGGGTGAGTGTTATTCCCTGAAATAT 

GTTCACAAAGTGAGTCACACTTCATCCTTATGCAGAGTTGCGTATGCGTATAACCGATCCCAATCGTGGCGAGGACATTGGAAAGTTCTTTGTAC 

ACACGAACCAACTTCTTCCCAAGCAATCCTGTCGTAAGCTTGAGACCATGAAGATCGTATCGGTTGGCTCGGAAAACGAATCTATAATGGCTTTT 

AAAGTTAAGGTTGGTAAATCGATTGAAACCTTTCCAAGTCATTATACTGATTAAAACCCACATTTGTATGACAGTCTGGCAGGATTCTGGAGCTA 

TGCATTGCCAAGTACTGGTCCAACTACGGCCAGAGTCACCTGAAGTACAGCTTGCGTTTCCGTGGCGTGGAAGCGCACAATCCCAATGCCTGTGA 

GTCATTTGTAAAATGGTAATAGCTTTTGTTGAACTAAAGAAACCATTGTTTTGTAGACGTCATGCATGCGGGCAGGGGAATTCATAAGCTGGAGA 

TTGAGGCCCTGGTTGCCGAGGATGTGCAGCCCCAGCTACAGCTAAAGAATGCCGAAGTGGTGCTAAAACCGACCGAGGCCAAGATCTCGCCGCTA 

AGCGCCACACGAGACGTCATCCCAGATGGACGTCAGGTGTATCAGAACCTGCTGGCCTTTAACTTGAACGTGGCCAAGGCCGCAGATGTGTCGAT 

ATACGCACCGATCTTTAACGACTTGTTGTATGAAGCGGAGTTTGAGTCACAGATGTGGATGCTTTTTGATGCAAACAAGGCCCTGGTGGCCACCG 

GCGATGCTCACTCCCACACTTCCTTCACAAAGCTCGATAAGGGCGAATACACAATCAGACTGCAGGTGCGTCACGAGAAGCGCGACCTGCTGGAG 

AAGATCTCGGAGGCAAATCTGGTGGCCTCATTCAAGCTGACTAGCCCCCTCACCCTTGATTTCTATGAGAACTACAATCAGTGCATAGTGGGAGG 

TCGTAAATACGTCTCGAGTCCGCTAAGGCTGTCCACTCGGGTGCTTTATATCGCTCCTATTACCCAAGAGCGACTTACCAAGGCCAATCTGCCCG 

CTCAATGCGCCTGGCTGAGCGGCAATCTGGTATTCCCGCAAGATGAGGTCGGACGGCGAGTGGCTCAGCATCCATTTACTTACATCCTCAATCCC 

GCTGAGAAAAAGTCACACACGAATGGCTCGAGCAACGGTTCCAGCGCTGCAGGATCCACAGCTACGGCAGCTGCTGTCACCACTGCTAATGGCGC 

AAAACCGAAAGCTCCGGCGACTCCACAAGCGGCCACCTCAGTGACCAATCCTGCGGCCGGCGATGGAATTTCTGTTCAGAACGACCCGCCTGTGG 

ACAGCAGTGGGAGTCCCGCCTCACCCAAAAAGGGCAAGGCCAATGCCGACGATTATGCCGAAAGTTTTCGCGACTTTCAATGTTCGCAGATTGTC 

AAGTGTGGTAGGATTTTAAAAAATAGAGCACTATTTTCTGATTAGTTTAACATACGATCCTCTTACAGAACTGGAAATGGCAGAGAAAATCTATA 

ATGATGTAGTCGCTGCCCATCCCAAGCATTTGCAGGCAAACCTGCTACTTATCCAGAACATCGAGTCCAATCAGCTGAAGTCGCAACTGCCGCTG 

ACTTTCGTCAATGCTCAAAAGACATCGCCACCAGAGGCAGGCGAGAGCGCCGACAAGCAAAAGGAGGATCAAAAGAAAGTACGAAGCGCCCTGGA 

GCGAATTGTTAAGCTGGCCGACAAGGTGATCCAGGAGACCGATTCCGAGGCACTGCTCTCCTACTACGGTCTGAAGAACGACACTCGTGCCGATG 

CAGCCAAGATAAAGACGTATGTTGTTTAATTGGGTTAGGCATTCCAGTCAGAAATGAATTCTTTTCTTCTTTAGCAACATGGACAAACAGAAAAA 

CACTCTCATTGAGGCGCTAAGCAAGAAGGGCATTGCTGTGGCAAAGCTGGCTGTATTAGACGACTGTATCAAGGATAGCCTGGCTGAGATCAACG 

AGCTGTATACCGAGATTATCAAGTTCGTGGACGCCAACGACTCCAAGGCCATCCAGTTCGCTCTGTGGCACGCCTATGCCCATGGCCACTACGGT 

CGCATGTACAAGTATGTGGTTAAGCTAATCGAGGAGAAGCGAACCCGCGATCACTTCGTGGAGCTGGCCGCCATCAACGGCGCCCTGGGTCACGA 

GCACATCCGTACTGTCATCAACCGTATGATGATCACTGCCTTTCCCAGCAGCTTCCGTTTGTTCTGAGCACTTTCCGACAGAATACGCAACAAAT 

ATTCACGCTTCGAAGCGATCATTGTTTTGTATAACTAATGTAATCCATATGTATATGAATAAATGCTATTAACTGAAATTCAATATTGTTAGTCA 

T T T C ATT AAAAC T AAATT AT T AAAT T T AAC T AAG AT T AAT GAT C G AT T G AT T T T C C GGT AAG AAAAAAGG AAAT G AG AAT T G AAC AAT T T T AT T A 
AGGAGTGAATTTTAAAATCCCGCCAAACCGTTACGATGTTGAACATAGCATTTTCCAGCACTGGTGCCTGGTAATTGAGCCGACTCCGCGTGACG 
GTCATACTTATTGCAGCAAAAAAACAAAAAACAATGCAGACGCACACGAGAGTGTAAAATGGGTAAGTTTGACAGAAAAAGAGCTGTTTAAAACT 
GGAAAATTGTCTTATTGTGCGGCTAATAATATGAACTCGTGGGTGACATGTGAAAAGTGCAAGAATAACGACTTCCAACTGCAGTTTCCTGCTAC 
GTAGTTGCGATGAGGCTTCCGCCTGCGTGTGCAAGTGTGTGTGTGCGTGTATGTATTGGCCGATCGATTTTATTTTGCATATTCAAGGACAGCGA 
AGTCCTGCGGGAAAGTGCGGCCGGGATTAGAGGAAACGGGAATCGGGGCGGCGCCGGAAACTAGAGCGAGCTCCCGTTCCCAAGCCGAATGACTC 
AGTAATCGCCCATGAAATCCCCGGAACGATATCAGCTTACAATCTGTGTACATGCATTTGTTGCTTAAGTTCCACCCCGCATCCCAGTCCCGCTT 
TCACTCCCACTCCAACCTACCCCTTGCCCACCCACTGTGTCAACTCCGGAGAGGACAAACAATGGCCCCTTGCAGCCGGATGACAAATGCGGGTG 
CAGCGACATCAAGAAAGAACCTTCGAATTTGGTGACGCGACTATCAGATACTCTTACTTTCTTCGTTTGCCTATGAAGTTCATTATCATTTTTGT 
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GTCATTAAAGCGATTTATAATTAGATCAATCAGACTAATTCAACGTATTGTGACTTATAAATCAGTATAACCTATGAAAGTTTTTTTAACCACCC 
ATTAATTTCTGGTTAGGATTCAATTGAAGT 
(SEQ ID NO: 409) 

Exon: 1001. .1324 

Exon: 1815.. 2071 

Exon; 2259.. 2595 

Exon: 2655.. 3204 

Exon: 3461.. 4015 

Exon: 4222. .4343 

Exon: 4398.. 4554 

Exon: 4617.. 4822 

Exon: 4881.. 5044 

Exon: 5110. .5221 

Exon: 5282.. 6277 

Exon: 6339.. 6666 

Exon: 6725.. 7200 

Start ATG: 1907 

Transcript No. : CT13277 

GGATTGTCGGTCAGTCGGAAATCGCGCTGGTACATAACTCGGCTGAAACGCTCGTCCATGTTGCGCCGGAAAGCCTGTTGAGCAAACGATTAGCA 

AACAATTAGTAACTCCGTAGTGTTCCGTCGAACTCGTTGCGAAAAGCAGGCGGTTAAATAATGTTTAACAGATTCCGATTGGTGCACAAACAGCT 

GCGGTTATATAAGAATTTCGGGCTGCTGGGCCAGAAGGCCAGCGTGGGACTCACATTGCCGATCATCAGCTTGTCCAGGCCGTACATGGCGTATA 

TGGGCACCGAGAGGAGCGTGGTGATGATCACCGCACCAGCTACGAAGGAATTCGCAGAGAGTTCTGAGCGCAGCAATAGCAGCAAGAAGACTACC 

AACAAAGAGCAGAGCGACAAATCAGCGGAGAGCAGGATGGCCACCAGCGGTATTGTCGAGTCATTTCCCACGGGCGCACTGGTGCCCAAGGCGGA 

GACGGGAGTCCTGAACTTTCTGCAGAAGTACCCGGAGTACGATGGACGCGACGTCACCATAGCCATCTTCGATTCCGGCGTCGATCCCCGGGCAA 

CGGGACTGGAGACGCTGTGCGATGGAAAGACCGTTAAAGTAATAGAGCGGTACGACTGTTCCGGATGCGGCGACGTGGACATGAAGAAGAAGGTG 

ACGCCGGACGAGAACGGCAACATTAAGGGCCTGTCCGGAAACTCGCTCAAGCTGAGTCCGGAGCTGATGGCTCTGAACACAGATCCGGAGAAGGC 

GGTGCGGGTAGGCCTCAAGAGCTTCAGCGATTTGTTGCCCTCCAAGGTGCGGAACAACATTGTGGCCCAGGCCAAGCTGAAGCACTGGGACAAGC 

CGCACAAGACGGCCACTGCAAACGCCAGTCGCAAGATTGTTGAATTTGAGTCACAAAATCCAGGAGAAGCCTCAAAACTGCCCTGGGACAAGAAG 

ATATTGAAAGAGAATCTAGACTTTGAGCTAGAGATGTTGAATAGCTACGAGAAGGTGTACGGCGACATTAAGACCTCCTACGATTGCATCCTTTT 

TCCCACGGCCGACGGATGGCTGACAATCGTCGACACCACGGAGCAGGGCGATCTGGATCAGGCTCTGCGCATTGGCGAGTATTCCCGCACGCACG 

AGACCCGCAATGTGGACGACTTTCTTTCCATATCGGTAAACGTCCACGACGAGGGCAACGTACTGGAGGTGGTCGGCATGAGCTCACCCCACGGC 

ACCCACGTATCGTCTATTGCCAGCGGCAACCATAGCTCCCGGGACGTGGATGGCGTGGCGCCGAATGCTAAGATCGTTTCTATGACCATCGGTGA 

TGGTCGACTTGGGTCTATGGAGACCGGCACGGCACTAGTGCGTGCCATGACTAAGGTGATGGAACTATGTCGCGATGGCAGACGCATCGATGTGA 

TCAACATGAGCTATGGCGAGCATGCCAATTGGTCAAATTCCGGCCGCATTGGGGAGCTCATGAACGAGGTTGTCAACAAGTATGGCGTGGTGTGG 

GTGGCATCGGCCGGCAACCATGGTCCGGCACTTTGTACTGTGGGAACTCCGCCGGACATCAGCCAGCCCAGTTTGATCGGCGTGGGCGCGTACGT 

ATCACCCCAAATGATGGAGGCCGAGTATGCGATGCGAGAAAAGCTGCCCGGGAACGTGTACACCTGGACATCGCGAGATCCCTGCATCGACGGAG 

GTCAAGGCGTTACCGTATGCGCTCCGGGCGGAGCCATTGCGTCCGTGCCACAGTTTACTATGAGCAAGTCCCAGCTGATGAACGGTACCAGCATG 

GCGGCACCTCACGTCGCCGGCGCAGTGGCGCTGCTCATCTCCGGTCTGAAGCAACAGAACATCGAGTATTCGCCGTACAGTATTAAGCGAGCGAT 

CAGCGTCACTGCCACCAAACTGGGCTATGTAGATCCCTTTGCTCAGGGCCATGGCTTGCTCAATGTCGAGAAGGCATTCGAGCATTTGACGGAGC 

ACCGCCAGTCCAAGGATAATATGCTCAGGTTCTCCGTGCGCGTGGGCAACAACGCAGACAAGGGCATTCATTTGCGTCAGGGCGTGCAGCGTAAC 

TCCATCGATTACAACGTTTATATAGAGCCCATCTTCTACAATGACAAGGAGGCGGATCCCAAGGACAAGTTTAACTTCAATGTACGGCTAAATCT 

GATTGCCTCGCAGCCGTGGGTGCAGTGTGGAGCTTTCTTGGATCTCAGCTATGGCACCCGCTCCATTGCCGTGCGCGTCGATCCCACTGGACTCC 

AGCCAGGCGTTCACAGCGCTGTGATTCGGGCTTATGACACTGACTGCGTACAGAAGGGTTCTCTCTTTGAGATTCCTGTCACGGTGGTGCAGCCC 

CATGTGCTGGAGTCGGATCAGAACACGCCCGTCTTCGAACCCGCCTCTTCCAAGGGAGACAACAGCGTGGAGTTTCAGCCAAACACCATTCAAAG 

AGACTTCATCCTGGTGCCAGAACGTGCTACTTGGGCGGAGTTGCGTATGCGTATAACCGATCCCAATCGTGGCGAGGACATTGGAAAGTTCTTTG 

TACACACGAACCAACTTCTTCCCAAGCAATCCTGTCGTAAGCTTGAGACCATGAAGATCGTATCGGTTGGCTCGGAAAACGAATCTATAATGGCT 

TTTAAAGTTAAGTCTGGCAGGATTCTGGAGCTATGCATTGCCAAGTACTGGTCCAACTACGGCCAGAGTCACCTGAAGTACAGCTTGCGTTTCCG 

TGGCGTGGAAGCGCACAATCCCAATGCCTACGTCATGCATGCGGGCAGGGGAATTCATAAGCTGGAGATTGAGGCCCTGGTTGCCGAGGATGTGC 

AGCCCCAGCTACAGCTAAAGAATGCCGAAGTGGTGCTAAAACCGACCGAGGCCAAGATCTCGCCGCTAAGCGCCACACGAGACGTCATCCCAGAT 

GGACGTCAGGTGTATCAGAACCTGCTGGCCTTTAACTTGAACGTGGCCAAGGCCGCAGATGTGTCGATATACGCACCGATCTTTAACGACTTGTT 

GTATGAAGCGGAGTTTGAGTCACAGATGTGGATGCTTTTTGATGCAAACAAGGCCCTGGTGGCCACCGGCGATGCTCACTCCCACACTTCCTTCA 

CAAAGCTCGATAAGGGCGAATACACAATCAGACTGCAGGTGCGTCACGAGAAGCGCGACCTGCTGGAGAAGATCTCGGAGGCAAATCTGGTGGCC 

TCATTCAAGCTGACTAGCCCCCTCACCCTTGATTTCTATGAGAACTACAATCAGTGCATAGTGGGAGGTCGTAAATACGTCTCGAGTCCGCTAAG 

GCTGTCCACTCGGGTGCTTTATATCGCTCCTATTACCCAAGAGCGACTTACCAAGGCCAATCTGCCCGCTCAATGCGCCTGGCTGAGCGGCAATC 

TGGTATTCCCGCAAGATGAGGTCGGACGGCGAGTGGCTCAGCATCCATTTACTTACATCCTCAATCCCGCTGAGAAAAAGTCACACACGAATGGC 

TCGAGCAACGGTTCCAGCGCTGCAGGATCCACAGCTACGGCAGCTGCTGTCACCACTGCTAATGGCGCAAAACCGAAAGCTCCGGCGACTCCACA 

AGCGGCCACCTCAGTGACCAATCCTGCGGCCGGCGATGGAATTTCTGTTCAGAACGACCCGCCTGTGGACAGCAGTGGGAGTCCCGCCTCACCCA 

AAAAGGGCAAGGCCAATGCCGACGATTATGCCGAAAGTTTTCGCGACTTTCAATGTTCGCAGATTGTCAAGTGTGAACTGGAAATGGCAGAGAAA 

ATCTATAATGATGTAGTCGCTGCCCATCCCAAGCATTTGCAGGCAAACCTGCTACTTATCCAGAACATCGAGTCCAATCAGCTGAAGTCGCAACT 

GCCGCTGACTTTCGTCAATGCTCAAAAGACATCGCCACCAGAGGCAGGCGAGAGCGCCGACAAGCAAAAGGAGGATCAAAAGAAAGTACGAAGCG 

CCCTGGAGCGAATTGTTAAGCTGGCCGACAAGGTGATCCAGGAGACCGATTCCGAGGCACTGCTCTCCTACTACGGTCTGAAGAACGACACTCGT 

GCCGATGCAGCCAAGATAAAGACCAACATGGACAAACAGAAAAACACTCTCATTGAGGCGCTAAGCAAGAAGGGCATTGCTGTGGCAAAGCTGGC 

TGTATTAGACGACTGTATCAAGGATAGCCTGGCTGAGATCAACGAGCTGTATACCGAGATTATCAAGTTCGTGGACGCCAACGACTCCAAGGCCA 

TCCAGTTCGCTCTGTGGCACGCCTATGCCCATGGCCACTACGGTCGCATGTACAAGTATGTGGTTAAGCTAATCGAGGAGAAGCGAACCCGCGAT 

CACTTCGTGGAGCTGGCCGCCATCAACGGCGCCCTGGGTCACGAGCACATCCGTACTGTCATCAACCGTATGATGATCACTGCCTTTCCCAGCAG 

CTTCCGTTTGTTCTGAGCACTTTCCGACAGAATACGCAACAAATATTCACGCTTCGAAGCGATCATTGTTTTGTATAACTAATGTAATCCATATG 

TATATGAATAAATGCTATTAACTG 

(SEQ ID NO: 410) 
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Start ATG: 417 

MATSGIVESFPTGALVPKAETGVLNFLQKYPEYDGRDVTIAIFDSGVDPRATGLETLCDGKTVKVIERYDCSGCGDVDMKKKVTPDENGNIKGLS 
GNSLKLSPELMALNTDPEKAVRVGLKSFSDLLPSKVRNNIVAQAKLKHWDKPHKTATANASRKIVEFESQNPGEASKLPWDKKILKENLDFELEM 
LNSYEKVYGDIKTSYDCILFPTADGWLTIVDTTEQGDLDQALRIGEYSRTHETRNVDDFLSISVNVHDEGNVLEVVGMSSPHGTHVSSIASGNHS 
SRDVDGVAPNAKIVSMTIGDGRLGSMETGTALVRAMTKVMELCRDGRRIDVINMSYGEHANWSNSGRIGELMNEVVNKYGVVWVASAGNHGPALC 
TVGTPPDISQPSLIGVGAYVSPQMMEAEYAMREKLPGNVYTWTSRDPCIDGGQGVTVCAPGGAIASVPQFTMSKSQLMNGTSMAAPHVAGAVALL 
ISGLKQQNIEYSPYSIKFJ^ISVTATKLGYVDPFAQGHGLLNVEKAFEHLTEHRQSKDNHLRFSVRVGNNADKGIHLRQGVQRNSIDYNVYIEPIF 

YNDKeSpKDKFNFNVRLNLIASQPWQCGAFLDLSYG 
E^ASSKGDNSVEFG 

IAKYWSNYGQSHLKYSLRFRGVEAHNPNAYVMHAGRGIHKLEIEALVAEDVQPQLQLKNAEWLKPTEAKISPLSATRDVIPDGRQVYQNLLAFN 

LN V AKAAD V S I YA PIFNDLLYEAEFES QMWML F DAN KAL VAT GD AH S HT S FT KLD KGE Y T I RL Q VRH EKRDL L E K I S E AN L V A SFKLTSPLTLDF 
YENYNQCIVGGRKYVSSPLRLSTRVLYIAPITQERLTKANLPAQCAWLSGNLVFPQDEVGRRVAQHPFTYILNPAEKKSHTNGSSNGSSAAGSTA 

TAAAVT^ANGAKPKAPATP 

LQANLLLIQNIESNQLKSQLPLTFVNAQKTSPPEAGESADKQKEDQKKVRSALERIVKLADKVIQETDSEALLSYYGLKNDTRADAAKIKTNMDK 
QKNTLIEALSKKGIAVAKLAVLDDCIKDSLAEINELYTEIIKFVDANDSKAIQFALWHAYAHGHYG^ 
GHEHI RTVINRMMI T AFP SS FRLF* 
(SEQ ID NO: 411) 

Name: tripeptidyl-peptidase II 
Classification: peptidase 
Gene Symbol: TppII 
FlyBase ID: FBgn0020370 

CPlera Seauence No. : 142000013385212 

ACAAACTATATTTCTTAATGTCTCGACTCGCTAGTTCAGTCTGAAATAGGGCTGCTGGTATCTGCCTATGGGCATCCGATCCTTGGTCCTCAGTC 
GTCGCACTGCCTCTCGTATGAATTTGGGCTGCAGGGCACCGGATTCACCTTGGGCCTCCATCACGTCGAGGGCTTCCTCCACAACCTCGCCGACG 
AAGACCTTCGCAATGCCGGACATGGCTATCACAACATTCTGGGACACGGAACAGCCGGTGATAGTTTGCATTAGACGCTTGACGGCGGCCTTGGG 
AAAGGCAGAGCGACGATACATTTCGTAGCGATCCAGCTGTTCTTCCGTAAAGTTGGAAACGAGAACCCTGGAGTGGAATAAAATAGTTTTTATAG 

G^TACCCCTT^ 

GCTCGCCAGAGTCACCATCTCCGTCCGTGTTCTTGTTGTCGTTGTCCCCATCCGCATCCTTGCCATCACGATCCGCATCGTTTCCCGGATCCGAG 
GTGTCGCTGTCCTTCCGCTCGCCGGAGGCCGACTGGAAGAATTTCAGGTCGACATCGTCGCCGTCGCTTAGGGAGTTGCTTTTTTGCTGCGTGGG 
AAAGAGGATTTCGTCCATTTAAATGAAATTTGAAAAGCACCAGCGAATAATAAACAAAATCAACCAGGGATGGAGTTGCACCATATTCGCTTTTT 
GCAACACTCAAAAGTTCCGCACCACATAATAGATGAATATAATTCAAAATATCGGATTAAATATATTAAAGGTATATCATAGTTGTTTATAAGGT 
GGCTTTGAGATTTTAATAAATAATAAATATATAAATATACAATATTCGATAAACTTTTATTTTTGTAGTGTTTTTTTTTTCTTCAAATTAGTGCT 
CAAAGTCAAATTATGTAAATTTCAAATTGAACATTCACTGGTTCCGATAGTCCACAGCACGTTTTACAGCACTGGCCTCTGACACCTCGTGTTTT 
TGTTCTTTTCTCCGCATCGCCATCGCCAGGCAAGAGTGCGCGCTACATTTTCGTTTATTTCATCGTTCAGCCAGCAACCAAGGCACTGCGAAATG 
TCTGCAACCGAGACTCTGAATGTGGAGGCGGCAACGCCCGAGATCGGCGAGGAGGTCAAGAAACAAAAGAAGCCCAAGCCGAACAACAAGAAGCT 
CCGCCAAGGTGAGTGACCTCATGCGAGCAGTGCACTGTGGTAATTGCATTTGCACGACCCTCTCCGCCTCAACCACCCACCACCCACTGAACTAA 
CCACTTGCCCACAGAAGCGGCAGCCAAAATAGCCTCCGGAGAAGGCGGCGACGAGGAGCTGACCACAAACGGAGATGCCAAACCGGCAACACCGG 
CCGCTCAGCCGGCAAAGAAGAAGGGGAACAAGGGCAAGAAAAGCGGTCAGACTGATCCGCCCACCATACCCATTGCTAAGCTTTATCCGGACGGC 

AAC T T C C C C G AGG GCG AG AT C GT GG AGC AC C C C AC GC C C AAGG AT ATGC C CG AC GAT C GT AC G GC C AAG GAC C GAT T C AC AT C GG AGG AG AAG C G 

TGCTCTGGACCGTATCAACACGGACATCTACCAGGAATTGCGCCAGGCGGCTGAGGCCCATCGTCAGACGCGTCAGTATATGCAGCGCTACATTA 

AGCCGGGCATGACCATGATCCAAATCTGCGAGGAGCTGGAGAACACTGCTCGGCGCCTGATCGGCGAAAATGGTCTGGAGGCTGGTTTGGCCTTC 

CCGACTGGTTGCTCTCTTAACCATTGCGCCGCCCACTACACGCCCAATGCCGGAGACCCCACTGTACTGCAGTACGACGATGTGTGCAAGATTGA 

TTTTGGTACCCATATCAAGGGGCGGATCATTGATTGCGCCTTCACCCTGACCTTCAACAACAAGTACGACAAGTTGCTGCAGGCCGTCAAGGAGG 

CCACCAACACGGGCATCAGAGAGGCGGGCATCGATGTGCGCCTATGCGACATCGGCGCCGCTATCCAGGAGGTCATGGAGTCGTACGAGATCGAG 

TTGGACGGCAAAACATATCCCATCAAGGCGATTCGCAACTTAAATGGACACTCCATCAGCCCGTACCGTAAGCAAAAATCTATTTATTATAAATA 

TTTTAAATATGCTTATGCTTTCCTTAATATTTTTTGTTTAAATTTATTGTGATTTGTGCAAACACTTAATGAAGTCTAAAATTAAATTCTCAGGC 

ATTCATGCTGGAAAAACGGTGCCCATTGTGAAAGGCGGAGAGTCCACTCGCATGGAGGAGGATGAATTCTACGCCATTGAGACGTTCGGCTCGAC 

GGGTCGCGGTCTGGTCCATGACGATATGGACTGCTCGCATTACATGAAGAACTTTGATCTGCCGTTTGTGCCGCTGCGCCTGCAGTCATCCAAGC 

AGCTGCTTGGCACCATCAACAAAAACTTCGGCACCCTGGCCTTCTGCAAGCGTTGGCTGGACCGCGCCGGTGCCACGAAGTACCAGATGGCTCTC 

AAGGATCTGTGCGACAAGGGCATTGTGGAGGCTTATCCGCCGTTGTGCGACATTAAGGGCTGCTACACGGCTCAGTACGAACACACAATTATGCT 

GCGACCCACCTGCAAAGAAGTCGTTTCCCGTGGCGACGACTACTAGTGGGATGAGGAACTGGACAGTGACCGATTTTCGGCGATGTATGGCTTGC 

ATTATATACGGTTTTTGATATAAGTTCATTCTATTGTTGATTTTTTGTGAGCATGCGCCAACCGAATTATATATACAATGTATTATGCTCAGAAT 

ATAAGCGGAGTGAGATTATTTTCGAAGTTGTTCTTGTCTTGGCAGATTGCTACGGAGCTCACGCCAGTAAAGTGAATAGTTGTGATTGATAGGCT 

CCTTATCTAATAAGAATGACCTGTATATGTAGACAAATCCTTTGCCTGTAAGATAGTACAGGATCTCTATGTAACTGACTATGCTGAAACCGTAC 

ATAAGCGAGAGAATGCCTCCAAAGCGATCTAAAGGGTTAGTTAATATTTTTTAACTACTCGTCTTAGTTATTTACTGTACTCACTTAGCAGAACA 

ACCCAACTGCTAACCATTTGAGATTTTGTGGCTTGGGCAAAGCGCAGTTTAAAGAAAAGCTTTATAATGGCTAATGGTCGCTCGTTGGTCAATCC 

CTTTCTAAAATATTCGATATTCATTATTTATCAAAGAAATTGTGGTACATCACTTACAGCAACCCGGTAAAGTAACCACTGCTATTATAAGTCCG 

TCTCAAAGGAGCCACGTTGGTGGTGATACTCATGCTAACTCCGTTGCAGGTGGGCAAACAGTGCTCGCACTGCCTCTGGACCAAGTTCTCTTCCT 

TGTTCTGCAGGTAGGCGAAGTCACTGTAGCCATACCAAATGCTCTTCCATCTCATTAGGCATGGAAGATCGGGAAGCATACAATAGTCGAGATCC 

ATGGAACTGCCGGTCAAGGGCGGAGATACACAGCCGCAATGATCGATCATGCTCTTCATGCGGCATACCAGAAGGCACTCGTCCAGGGAGTAGTA 

CGATCTGTAGGAACGATGTTCACCAGGTATTCCGCGGATTGGTTTCTAATTACTTACTGGTTCAGCAACCGCATGCCTTCTTCGGGATAGTAACA 

ACGTCGTATATGTGGCGGGACTCCGCGAACCCCATCGCCTGAACTAAAAAACATCGGCTGGATGGGAATCTCCACTATGCTGTAGTGATCCACCA 

GCACCTCTCCCAAAGTCGTAGACTGAATGGTGGCGTAGTCCTCTTGGGGGAAAATTATGAGCTAAGG 

(SEQ ID NO: 412) 



Exon: 1001.. 1243 
Exon: 1345.. 2157 
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Exon: 2279. .2867 
Start ATG: 1138 



Transcript No. : CT13300 

TCCACAGCACGTTTTACAGCACTGGCCTCTGACACCTCGTGTTTTTGTTCTTTTCTCCGCATCGCCATCGCCAGGCAAGAGTGCGCGCTACATTT 
TCGTTTATTTCATCGTTCAGCCAGCAACCAAGGCACTGCGAAATGTCTGCAACCGAGACTCTGAATGTGGAGGCGGCAACGCCCGAGATCGGCGA 
GGAGGTCAAGAAACAAAAGAAGCCCAAGCCGAACAACAAGAAGCTCCGCCAAGAAGCGGCAGCCAAAATAGCCTCCGGAGAAGGCGGCGACGAGG 
AGCTGACCACAAACGGAGATGCCAAACCGGCAACACCGGCCGCTCAGCCGGCAAAGAAGAAGGGGAACAAGGGCAAGAAAAGCGGTCAGACTGAT 
CCGCCCACCATACCCATTGCTAAGCTTTATCCGGACGGCAACTTCCCCGAGGGCGAGATCGTGGAGCACCCCACGCCCAAGGATATGCCCGACGA 
TCGTACGGCCAAGGACCGATTCACATCGGAGGAGAAGCGTGCTCTGGACCGTATCAACACGGACATCTACCAGGAATTGCGCCAGGCGGCTGAGG 
CCCATCGTCAGACGCGTCAGTATATGCAGCGCTACATTAAGCCGGGCATGACCATGATCCAAATCTGCGAGGAGCTGGAGAACACTGCTCGGCGC 
CTGATCGGCGAAAATGGTCTGGAGGCTGGTTTGGCCTTCCCGACTGGTTGCTCTCTTAACCATTGCGCCGCCCACTACACGCCCAATGCCGGAGA 
CCCCACTGTACTGCAGTACGACGATGTGTGCAAGATTGATTTTGGTACCCATATCAAGGGGCGGATCATTGATTGCGCCTTCACCCTGACCTTCA 
ACAACAAGTACGACAAGTTGCTGCAGGCCGTCAAGGAGGCCACCAACACGGGCATCAGAGAGGCGGGCATCGATGTGCGCCTATGCGACATCGGC 
GCCGCTATCCAGGAGGTCATGGAGTCGTACGAGATCGAGTTGGACGGCAAAACATATCCCATCAAGGCGATTCGCAACTTAAATGGACACTCCAT 
CAGCCCGTACCGCATTCATGCTGGAAAAACGGTGCCCATTGTGAAAGGCGGAGAGTCCACTCGCATGGAGGAGGATGAATTCTACGCCATTGAGA 
CGTTCGGCTCGACGGGTCGCGGTCTGGTCCATGACGATATGGACTGCTCGCATTACATGAAGAACTTTGATCTGCCGTTTGTGCCGCTGCGCCTG 
CAGTCATCCAAGCAGCTGCTTGGCACCATCAACAAAAACTTCGGCACCCTGGCCTTCTGCAAGCGTTGGCTGGACCGCGCCGGTGCCACGAAGTA 
CCAGATGGCTCTCAAGGATCTGTGCGACAAGGGCATTGTGGAGGCTTATCCGCCGTTGTGCGACATTAAGGGCTGCTACACGGCTCAGTACGAAC 
ACACAATTATGCTGCGACCCACCTGCAAAGAAGTCGTTTCCCGTGGCGACGACTACTAGTGGGATGAGGAACTGGACAGTGACCGATTTTCGGCG 
ATGTATGGCTTGCATTATATACGGTTTTTGATATAAGTTCATTCTATTGTTGATTTTTTGTGAGCATGCGCCAACCGAATTATATATACAATGTA 

TTATGCTCAGAATATAAGCGGAGTGAGATT 
(SEQ ID NO: 413) 

Start ATG: 138 

MSATETLNVEAATPEIGEEVKKQKKPKPHNKKLRQEAAAKIASGEGGDEELTTNGDAKPATPAAQPAKKKGNKGKKSGQTDPPTIPIAKLYPDGN 

FPEGEIVEHPTPKDMPDDRTAKDRFTSEEKRALDRINTDIYQELRQAAEAHRQTRQYMQRYIKPGMTMIQICEELENTARRLIGENGLEAGLAFP 

TGCSLNHCAAHYTPNAGDPTVLQYDDVCKIDFGTHIKGRIIDCAFTLTFNNKYDKLLQAVKEATNTGIREAGIDVRLCDIGAAIQEVMESYEIEL 

DGKTYPIKAIRNLNGHSISPYRIHAGKTVPIVKGGESTRMEEDEFYAIETFGSTGRGLVHDDMDCSHYMKNFDLPFVPLRLQSSKQLLGTINKNF 

GTLAFCKRWLDRAGATKYQMALKDLCDKGIVEAYPPLCDIKGCYTAQYEHTIMLRPTCKEVVSRGDDY* 

(SEQ ID NO: 414) 

Name: methionine aminopeptidase 
Classification: peptidase 
Gene Symbol: und 
FlyBase ID: FBgn0025117 

Celera Sequence No. : 142000013384676 

TGTCCTTAGCCTCGGCCATAGCGGAGGAGAATGAATCCCCGAGAGGTGTCTCGGTTGAACGGTGGGAACTGAAGAAGGCGGCGTTGGCGAAGACG 
CTGCTGATTTGTTTGCCTGTTGACCGACGAGCGACAACTGTGCTCATTTATCAGCTGCCGGCCAACTCAAAAGCTCCGCTGCGCACGGAAAGCTT 
TCGCTTGCGATCAGCTGTTTTTGCAGCTTTGCCTTTTGCGCCCGCCATTCGCGTATGTAAACAACATAGCGAAAATTGCTGTAATTGCATGTTTT 
TGTCGCCAAATGTTGCGTATTCGCACTGATAGCACCGCACCCAAGTTCATAGTACGTATACGCTCAATTGGCGCTGATTGCCGGCTGGCATATCG 
GCATCGAAATCAGAGCGCTATGTGCACAACATCATGATTTGGGGTCGTCGTTGTCAAGTTCATCAGTGAATCGAATGCACAAACATGGGTTTGAA 
AGTCGAAGGGTTATGATTATTGCTCACAGATTAGGTCGCCACTCGGAAACAGCTCTGCTCCACTGGCAATCAAATCGTTTCAAAGCATTTCCATT 
CCATCTGGCTTATTCCACATATTAGTTATTATTAAGCGTGTGATGTGTATCTTAATTGTCATGCAAGAAAAACATAAATAAATAATGAATGTTTC 
TTTGTGGAAGTTGTTCTTGTAAATTTTACAACAAATGCCACATAATATTACCCTTATCTTTATCGGGATATTCAAAAACGTTTGCGATTGCGCAG 

GCGATAAGAAACACCAATAAAAGGGCACGTAACTACCAAATAAACTT^ 

AAGTTTTCCTAGACTTAAAATTTTTCATTAAAATGCAATGCAACAAAGTGCAAAGAGCAAGTGCAAGGCTAGCTATTAACTTAAACAATTCATAC 
TGGACTGGTCGTGAAGAAGTGATGAAAAAATGATGGGGCCAGCAGGCCTTAAAACTTAACAGCAAGGCCATTGTCGAAAGAAGATCATTCCTTTC 
CAACAACAAGGCTTATAGGACGGAGCTGGAAACACTCTCGATGACCTCAAACGCATTGTCTATTTCCGAACTCTCCAGCAAACGGTTGACAGCAA 
TGCGAATGCTGGGACGGACTGGTTGGCGTTCCCTGTTCTGCAGGTAGGCAGCCTGTACCACAGCCACTCCACGAGCAATGCACTGGGAAAAGTGA 
GTAGAAATGTATAGTTTTCATAATTGCATGGGGAAATAAAATGCCTACCTTATCAGCCAGCTCTGTTAGAAGTTTCAGTTCCTTGTCAAAGTTCT 
CGGCGGGCTGGGCAAGATACAAGTGCTTCACTGGTGACACCTCATCGCCGCGCAGGGTTAGCTTGCTGAATCGCAAAAACTTCTGGTGCAACGTC 
TTAGACTTTGCCTGCAGCTGCTCGAAGATTTGTGGTTCTCGTTCGAAACGATCCAACGCTGAAATAGCCGCCTGGGTGAGCATGGGCGGCAGCGA 
GGCCGAGAAGATGTAGCCCAAGCCAGAGAGGCGCTGGTGCTCGGCTATGAAGTGCGAGCCAACACAGAAGCCACCGACTGTTGCCATCGAACCCT 
CCATGCCGGCCGATATCAGATCGACTTCATCACGCTAAAAGAAAAATACCAGTTATAATTTTAGTTTTTCTTTCCGGTTTACCACCTACATCGAC 
ATTAAAGTGCTCCGTAACTCCGTGACCACCCTGGCCCAATGTGCCAAAGGAGATGCTTTCGTCGATGAACAAACGCAGCTTGTACTTCTGACGCA 
AGGCCACCAGGTCGGGAAGTGGGCAAATCTCGCCCGTATTCATATAGATACCCTCGGCGACGAGGAAGCGACGTGTCTTAGCCGCCTTTTTCGGA 
TTTTTCTGGTCTCGTTTTTCTTGCTCAATCAGCAGACGTTCTAGATCCTCAACGTCATTGTGCTTAAAGAAAACGATTGTGCTTCGTGAAGCATC 
CAGACCCTTCTGAATGGCAAAGTTGACAGCCTCATCCCTACAAATAACAAGTAATTAGTCGCCAATCTGAGTAGGTGAAGTAGACAATCCTACAC 
AAAGATAAGGTCGCCACGTTTGGCATACGCCGGAATGGCACTGGCCACGGTCGAGAAGCCGTAGGAATAGACAATGGCCTCCTCCAGGCCCATGA 
ACTTGGCAATACGATCCTCCAGGTCCAGATGCACGTCCATAGTGCCGTAGAAGCCCCGAGGTCCGCAAGATCCAACTCCGTACTTGCGCAGCGAC 
TTGCAGGCCTCCTCCAGTATCTCCTGGTCCTCGAGGAAGCCAAGATAGTTGTGAGAGCCAAGATTTAGGCAGTCGTGTCCATCGACCTGGATGCG 
CTTGCCCACCCTGGACTGCACAACGCGGGTGTGCAGCAGGGGATGATTGGGATCGGTGTCGGCCACCAAGGGCTCTGGTTCATAGTCGGCGATTA 
TTCGGTCTTCCTCCTCCTTGGTCAGCTGTCGGCGACGACCTCCTCCTCGTCTGTGGAGCAGCAACCAAATCACGGTTATGAGCAGTAGGGTCTCC 
AAGACGAGAGCAAACGTGGGGGTCTGTGGAAGGAGTAGAAACTGGAGTAAGCTAAATGTGGTGAAAAAGGCGAATGCAAACTTACGTTTCGGAAA 
ATACTGCCGATTTCGTTGAACAATTGGATGGCCACCATTTTGACAGTCTGGTTCCGTGCCCTTTCTGTGAAAAATCAATTGAAACCGCCTTAATT 
CCCTGAAGTTATGTGTCATCTAGCCGACGCTTTCTTATCGCGGGTATGCCCCATACATATGTAGAATTGAAGCATTCGACCATACGGTACGGAAA 
TCTTTTCAATTGGGTGTGTATGCTGACATAGGCTAATTGCCAGTGGGCGAAAAATGTGCAACCACGCCCTTCTCCAACATTCTCTTACATTAAAT 
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ATCCATTTAAATCGAGCACTCACCACAAAACTCAGTTAACTATAGGCAAAATGCGTGAAATATCCGCCGAAACAGATGGATGAACGAATGTACTA 
AAAAATACCAATACGCGGATATCTTTAAGATGTGCGCCAAATTGGCGGCGATGGAGGAACAGTGGAATATGGAACTAGGGGACGTTTTTATTTAT 
TTTTTGTTATTCAAAAACCTGGTCCTGTCGACCTGTCCCGCCCACCTGCCCCACCGACCTGTCCCGCCGATCTGACCCACCGACCTGTTCCGCTG 
ACGTGTCCCACTGACCTGTCCCACCGAGCTGACTCGCTAACAACAAAATACAACAAGTTACAACAGATTTTAAATTTTTATTAAAAATTACCTAG 
ATTTTATCAATATTTTTATGATACTGAACTCAAATAAACTTGCTACGTAACGGTTCTTTTCTAGTTATCATTTCAAATATATGATCGATTACTAT 
TTTATATCGATAGGCAGAGCAGCGATCGCAGTGTGACCGTGCGGTGGGAAATAACGAAATCGAATTCAATTCAAACCGCCGATCCATTCAGTTTC 
CATTGAACTGCTGCTGCGATCGGTCAACCGATGTGAATGAGCCGCGTTACTGGCGCCATCTGAAATTCGAGTTCGCGAGATAAAAACCGCGAGAA 

GAC AAAG AAC AC C GC AAAAAGC C G AT GT AAAC AAAGAG AT AT AGTT AGT T C T GC ATT T T G C AAG AAAAC TT T AT C AC AT T AC ACGC G AAT C G AAA 
CGGATACTTGTATGTGCACAAAGTGAAATCAGCTGGACAACAAGAGTATTTTGCAATTGCCTTGGTTTATTTTCAGCCACTTGCGGCTGTCGTTT 
TCTGTGAAATTAACACTTTTTTCTATTGGCAGTGATGGGAAGTGTATATTTGGGTGTACCTGGCTGGTAGTCTAGATACAACAATTTACGCTATG 
TGTCGCGATCTAGTCTAGGTACAAGTATTCACTCTTCCTTGCAAATGTCCTTGTTTTAATATATAGTAAATTCTATTGCATTTCATATAATTATT 

TACTATTTACCTATATCACTATTCAAT 
(SEQ ID NO: 415) 

Exon: 3017.. 2969 

Exon: 2724.. 2651 

Exon: 2588. .2089 

Exon: 2032.. 1705 

Exon: 1649.. 1284 

Exon: 1222.. 1001 

Start ATG: 2698 {Reverse strand: CAT) 



Transcript No. : CT13338 

TTTCGGCGGATATTTCACGCATTTTGCCTATAGTTAACTGAGTTTTGTGAAAGGGCACGGAACCAGACTGTCAAAATGGTGGCCATCCAATTGTT 
CAACGAAATCGGCAGTATTTTCCGAAACACCCCCACGTTTGCTCTCGTCTTGGAGACCCTACTGCTCATAACCGTGATTTGGTTGCTGCTCCACA 
GACGAGGAGGAGGTCGTCGCCGACAGCTGACCAAGGAGGAGGAAGACCGAATAATCGCCGACTATGAACCAGAGCCCTTGGTGGCCGACACCGAT 

C C C AAT CATC C CC T GC T GC AC AC C C G C GTT GT GC AGT C C AGG GT GGGC AAG C GC AT CC AG GT C GAT G GAC AC GAC T GC CT AAAT CTTGGCTCTCA 
CAACTATCTTGGCTTCCTCGAGGACCAGGAGATACTGGAGGAGGCCTGCAAGTCGCTGCGCAAGTACGGAGTTGGATCTTGCGGACCTCGGGGCT 
TCTACGGCACTATGGACGTGCATCTGGACCTGGAGGATCGTATTGCCAAGTTCATGGGCCTGGAGGAGGCCATTGTCTATTCCTACGGCTTCTCG 
ACCGTGGCCAGTGCCATTCCGGCGTATGCCAAACGTGGCGACCTTATCTTTGTGGATGAGGCTGTCAACTTTGCCATTCAGAAGGGTCTGGATGC 
TTCACGAAGCACAATCGTTTTCTTTAAGCACAATGACGTTGAGGATCTAGAACGTCTGCTGATTGAGCAAGAAAAACGAGACCAGAAAAATCCGA 
AAAAGGCGGCTAAGACACGTCGCTTCCTCGTCGCCGAGGGTATCTATATGAATACGGGCGAGATTTGCCCACTTCCCGACCTGGTGGCCTTGCGT 
CAGAAGTACAAGCTGCGTTTGTTCATCGACGAAAGCATCTCCTTTGGCACATTGGGCCAGGGTGGTCACGGAGTTACGGAGCACTTTAATGTCGA 
TCGTGATGAAGTCGATCTGATATCGGCCGGCATGGAGGGTTCGATGGCAACAGTCGGTGGCTTCTGTGTTGGCTCGCACTTCATAGCCGAGCACC 
AGCGCCTCTCTGGCTTGGGCTACATCTTCTCGGCCTCGCTGCCGCCCATGCTCACCCAGGCGGCTATTTCAGCGTTGGATCGTTTCGAACGAGAA 
CCACAAATCTTCGAGCAGCTGCAGGCAAAGTCTAAGACGTTGCACCAGAAGTTTTTGCGATTCAGCAAGCTAACCCTGCGCGGCGATGAGGTGTC 

AC C AGT G AAGC AC T T GT AT C TT GC C C AG C C C GCC GAG AAC T T T G AC AAGG AACT G AAAC TT C T AAC AG AGC T GGC T G AT AAGT GC AT TGCTCGTG 
GAGTGGCTGTGGTACAGGCTGCCTACCTGCAGAACAGGGAACGCCAACCAGTCCGTCCCAGCATTCGCATTGCTGTCAACCGTTTGCTGGAGAGT 
TCGGAAATAGACAATGCGTTTGAGGTCATCGAGAGTGTTTCCAGCTCCGTCCTATAAGCCTTGTTGTTGGAAAGGAATGATCTTCTTTCGACAAT 

GGCCTTGCTGTTAAGTTTT 
(SEQ ID NO: 416) 

Start ATG: 7 6 (Reverse strand: CAT) 

MVAIQLFNEIGSIFRNTPTFALVLETLLLITVIWLLLHRRGGGRRRQLTKEEEDRIIADYEPEPLVADTDPNHPLLHTRVVQSRVGKRIQVDGHD 
CLNLGSHNYLGFLEDQEILEEACKSLRKYGVGSCGPRGFYGTMDVHLDLEDRIAKFMGLEEAIVYSYGFSTVASAIPAYAKRGDLIFVDEAVNFA 
IQKGLDASRSTIVFFKHNDVEDLERLLIEQEKRDQKNPKKAAKTRRFLVAEGIYMNTGEICPLPDLVALRQKYKLRLFIDESISFGTLGQGGHGV 
TEHFNVDRDEVDLISAGMEGSMATVGGFCVGSHFIAEHQRLSGLGYIFSASLPPiXILTQAAISALDRFEREPQIFEQLQAKSKTLHQKFLRFSKLT 
LRGDEVSPVKHLYLAQPAENFDKELKLLTELADKCIARGVAVVQAAYLQNRERQPVRPSIRIAVNRLLESSEIDNAFEVIESVSSSVL* 

(SEQ ID NO: 417) 
Classif xcation: enzyme 



Celera Sequence No. : 142000013384826 

TCAATATAAAACAAAAGAGAAAAGCTTGCAAGGGAATTATTAAGATGCCTGCAAATGACAAGTATGCAAATCGTTATCTAAATATTTATACATAT 
ATTGATTATTGCAATTGCATCTAAACGAAATTCATGTTTAATGTTTTTGATTGTAAAATCCAATGGAGGACCGCAAGAGCAGAAATCTCATTAAA 
TAATAGACAAATAATTAGACAGATCCACAAAATCTCCTTGATTAATCACTGTTGAGCAACTTAAAATTTATTAACGATAGGAAAAATTTCCCTAT 
TGAAGAAAACGAAGCAAAAATTTCAGAATATTATTATACTTATAGCAGAATTTTTTGAATGATTCTGGGTAATATTACGTTTCGTTTGTGTACCA 
TAGATAAACTGAGAGGGGCT ACT TCATGTTGTGCCGCTTCCTGAAGAGAAAGCG AT CTTTGT AGC ACCAAGT ATT AACT AATCTG AAGC AGC AGC 
AGC AGT AGCCGCCAGCC AT AAAT CTGCTAATGACT CTCTTC AAAT GCGAGACGGCTGCCGATAAAGAACAAAGAATTGTAAACTAAAATATACAC 
TATATAGATACTGATACTGAAAACCGATAATGGAAACTGACATCAAATACTTATTACCATTAAACTCTCGATCCTTTCGTAAAAACCAATGTGCA 
TGACAGCTGGAAATTTAGGCATTTCCAAAGTGCAATAGTTAAATGTTACAAAATTCCACACAAATTCCATTTAACACTCACAGACAAAACAAAAG 
CAACTGCAAGAGCAAGAAAACCAAGCCACAACTACATATAGCAAGTGCAGGAATTTAACTGTAAACGTAAACCAATATGTATTTAAATAAATATA 
AATTGAAACCATTTGAAAGGTGTTTAAAACGTTTCCACGATCTGGATATAAACACTTGGCAAGCACAGTTGTATTTTACGTATTTATCTTTATTA 
CGATTAGCGCACGGGTTAACATAGGACCTAACAATAATTTTAGATTTAAATCAAATTTCATTTAACTTAAATCTCGCATTAATGTTGACCGAACT 
CAATTCGGTAACCAGATACTTAAAAGAGAATGGTATCTCGATCATCGATACAGAACTGTTATCCCCGCACAAGCGACACGTATCTGGCTGAGAAG 
AGAGTCCTCCCGTCTCATTTCGCTTAACAATGCGTTGCAGAGGCGCCAGAATGGAGCCGCACTTGTGGCACACCAGGGTGTGCGTCTTATCGGAG 
TTGTGGAACAAGCGATCCTGCAGCAGGAAAGCAGCACCGTGAGAGATCAAAGCATCCCGTTCCATCTCACCGAATCTGACGCCTCCACCGCGTTT 
TCTGCCCTTAATTGGCTGATGGGTCCGAGCTTCCACCGCTCCCGTAGATCTTACTTGCCACTTGTCAAACACCATGTGCCGAAGACGCTGGTAAT 
GCACCACTCCAAAAAATATGTCAGCAGTCATCTCGCGGCCATCTACGCCGGAATACAGCCTCTCCGTTCCGTAGTAATTGTAACCCCCAGCCTCT 
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AACATTTTGCCAAAATAGTCAATAGCAGTATTCTCTTCGGAGAAGCGAAAAGGCGTGGCATCGTAAACATTTCCGTGGATTGCAGCCCCCTTTCC 

CGCCATCGTTTCTATCATCATGGCGATTGTCATTCGGGAGGGGAAGCCGTGGGGATTGAACACAATGTCTGGAATCAAACCAGATTCCGTAAATG 

GCAAGTCCTCTGCAGGATATTTTTGTGAGCAAATTCCTTTCTGACCAGCTCTTGATGCAAACTTGTCTCCAATAGTGGCAGGTCGTGGAACCCTC 

AGTGTGATGGCCACCATTTTTTTTGGCGACAAATCGAAGCTTCCCAACTGTCGGATGCTCTCTACTATGCAATCCTCCTTTTCATCCATTTTCAC 

AACTTTATAAGTGGCGACCTCTCCGTCAAAGTAACAGTACAGGGGCGAACCATAGCTCAATTTGGAGCCCGGATGTGGAAGACCATCCGTGTCAA 

GATGCTTTATTAACTCCGGCATGTGTGGATGCCTTGCGAAGTAACTGCTCTTCTTGTCCAACGTCAAAAACTTTGTTTTGTAGATACTTCCATAG 

GCGAATCCTCGTTCATAAGCAGCCTTGTTGATGATCATAGCATCTTCCATGTCGTAGCCCGTGTAGGAAATGACAGCTACAATAGCATTTGTGCC 

CATTGCAAAGTCATCCAGCTGGATGTTGTCGTAGTGTACTGGTCGGAAGAGAGGAGTCCCTGGCGTCTGAAGGCGGTATAACTTGTTGGCCGCTT 

GCTTAGGCCAATTCAAGCATGGCGTTCCCATCGTCTGTTTTCCCATCTGACACTGGTACATGTTACGTGGCGATTGATTGTAGTCCGGCATGGGT 

ATCAAGTTGGCCAAGTTGCTCATGAAGTGGGTCTTAGCCAACTCCAAATGAGTGGTGAAGTCAGGATACATTTCCTTTGCATCGATGGCGATCTC 

CATATAAAGCTGTTCCAGTGTTCCGATGTACTCCACCCTTTTCCACTTTAGGTTCCAAACGGGCCTCATCATCCTGGCAGGACCCGTTGCAATGT 

AAAGCCCTGGAAACTGACCGTTCTTCTTGAAGGGTATAAAGCCGATTTCCATCATCTGGGGAAGGGTGCCGAATATCTTCCCGTAGCGCAATTCG 

TCAACTATTTTTTCGGCTTCTGATTGATGAATGTGACCCAAGTGTTTGCCATCTAGGAAGACGACGTAAAGTTTTTCGCCCAAATATCTTCGGTT 

TGACAAGGGCATCATGCCCATATCTATAAGATGCTTGGGAATGGCCTGAAAAAAAAAGTAATCGGTTAAAATTTCTAATAAAGTCTACCATTAGC 

TGTACCCACCTTCACCAACTTGGGATCGGGACGCATCGAAATCTCGCAAGTTAGAGTCAAATGATTTAACAGTCCGCACGGTGTGCCATCAGGAG 

TGTGCACTGGACAAATAAAACCCCAGGCATCCGGGAGCAGCTGCCGCGCCTCAGTGGTTCTCATCGTTGTAAAATAGGAACCACGGTGGATGGCG 

CGAAAGTGGGACATGTAACGCATCCTGTTAATGTTCTCCGCCATAATGACTAATCCGCTATTTTGCATTAATCCCAGGCCAGTGCGACTCGCAAT 

ATTTCCAGTTGCTAAAAACGACTCTATGGCCCGGCCAACTCCTCCTGCTTGTCGCATGCACTGCGTCATCACAGCACTGGTAACTAGGGCATCAG 

GTGAAGTCAGTTTTTTTTGAAGGCAACGTCGGACTTGCGAGACCCAGGATTCGACTCGCTCACTTAGGTATTTCTGATAGAGATGGCCTGGCAGA 

AGCACTTCCTGCATCATAGCTGAGTCGACGTTCTCCACTTTGTACTTGCCCTGTGCGCATTGGAAAAGCTTCTGAATCATAAACACAATCAACTG 

GAACTTATCCTCATAGGTATCCAGGTGAATCATTACTCGCTCCCTAAGAATAAAGTCTGTGACATCATCATCCGGTTGCCATTCCGGAACTTCTG 

GGAAGCGGGCACGGAACAGATTGCCAATGAAGCTCTTGCACTGGGCGTGCGTATAGACGTTTTCATTTTGAACTTCCCTCAACATGGCTTGCACG 

CAGGACACATAGTACTGGTCGGACTCGTAGCCTTGCACCAATCTGTTGTATATTTCCTCATCTGTATAATCCATTAGGCACTTGAGGATCAGGCA 

AACTGGTACGTATGATAGTCTCTTGACGTGAGAAAACATAAACTTTGCAGTTCCATTGTTTAAATAATGTACTACATTGCTCAGGGATGATTCGT 

CCTCACGGACCGTTTGCACTAACATACCCAAATCGCTAAAATTCTGTCCACGATCCTTCCAGGATGAACGTTTTACACATATGGGATGGTTCCTC 

CGGGTCATGATCAACATACGTACAATCTTCTCATTTCCTCGGATCACAAAAATGCCCCCCCACTCGCTGTCGTGCTCTCCATGTTTTACCATTTC 

TTCCGGAGTAGCCTGTCCCAGATTGCAGGCTTTGGAGCGCAACATGATGGGCACTTCTCCTAGGTCCATGTTTATGGGAGTCTTTTGCACTCCAT 

TGACACTCCACCCCAATCGGACCGAGCACATTCCCGAGTAGGAGACATGGAGCTGCCTGGAGTCCGTGGGGTAAATCTCTCGGGTGCGCACATCA 

ATCACATCTTGCGGCACCTTTGGCTTCGCTATCCATATTGATTCTACTTTCATTGAAATCTTTTCGCCGGCAGGACTCAGCCAGTGGTTCGGGAT 

C AT GT GT T T G GC CGAG T T AT C C AGT C C C AC GGT C AG C AT CT C GT C G AAGG AAT C C AC AT GT GG AC C T C C C AAAT T GGC CAAAT GCT AGC CAT AG A 
AGTAAATCTATTAGCACGAAACCTTGCTATAGGTACAAGATTGATATTTACCCTGCTGAGTTTTTTAGGTATCTGCTTAAACTCCGGGCGGGAAT 
TGGTCAGCACTGGAATGGTCTTCATTTGTTGCATCTCCTCCAGCATGTTGTAAGTATTATTCTTTCAGAATGCCGACCAAAAATATTAAGAAACC 
TATTGCCTTCCTTCGCACGTGTTGTTTGGAATCGAACCGTTTTGAATGCGAACCAGTCAGTGGTGAGGCAGTCGCAAAGAACTAGTTCATATATT 

AT AT T GTT C AT AT AACT GAATG AAAT AAGAAT AT AG AT GAT T G GG AT AAAAAT T T T C GGTAAAT AAAG AAC CATC AC T AAAAATT T G T AT T C AGA 
TTAATAGAACACACTAAAATAACCTTAAAGTTCGGTGAATTGTCGTATGGTTCCGATTCGTGTTCAGCTCATCTTTAAATAACTGGGCTGCCAAC 
CGAATGCAAGGAACAGCTGATATGTGCGCGCTTGATAAGCCTGATAACCCAATTTTGAATTGTATCCAAGTTGTATAAAAGCAAATATAATTTAT 
TTGCAACGTCTGACATATTGTGCACATACGGCAATAACCATTTAGCATTTGCAAAGGCATCCGGTATGTAGAACACAACTCCTGCGGAAATGAGA 
CTTTGCAACAGGATGTGCCAACGTGTTGCTCTGGCAATGAAGTTGTTGCATTGTTGTGAGCCACATAGGTGGCGTATGTATGCACATATGTATGT 
ATGTGCATATTAATCTGAAGCACCCTCCCCAAAATACGCCATCTAAAGCTTCTAAGAAGCAGTTTTGTCGAATACTTGGGCTTAAACCCCACTGA 
CTGGCACATTAATGTCTAATGACTGCGAGAATATGAAAATTTTACTAACATTATATTTCGTCTTAAGATAGGTGTTAATCAAGGCTGATGCCAGA 
TTTCTTAATTTTTCTTGTATTGGAATTGCCAAATTACTTTGTTGGGTTTT^CGTGAAAACATGAAGTGGTTGTGGGAACATCAAGAAAATATCA 
TCAATTTGAGATGGTTCAGGGGCTAATGGAAAAGGAAAGAACCCCAATCTGAACGCAGTCACTCCGGTTTATTATTCCATTATAATGAACACAAT 
GGATTTCTTGCAACTTATTTTGGGCAGAGCGTCCTTGATGCCTGGTTCTTGTATCCTTGCTACTTTCAACTTTCGCTGCA 

(SEQ ID NO: 418) 

Exon: 4590 ..4422 
Exon: 4359.. 2860 
Exon: 2800.. 1001 

Start ATG: 4511 (Reverse strand: CAT) 



Transcript No. : CT1335S 

TCCAAACAACACGTGCGAAGGAAGGCAATAGGTTTCTTAATATTTTTGGTCGGCATTCTGAAAGAATAATACTTACAACATGCTGGAGGAGATGC 
AAC AAAT GAAGACCATTCC AGT GCT GACC AAT TCCCGCCCGGAGTTTAAGCAGATACCTAAAAAACTCAGCAGGC ATT TGGCCAATTTGGGAGGT 
CCACATGTGGATTCCTTCGACGAGATGCTGACCGTGGGACTGGATAACTCGGCCAAACACATGATCCCGAACCACTGGCTGAGTCCTGCCGGCGA 
AAAGATTTCAATGAAAGTAGAATCAATATGGATAGCGAAGCCAAAGGTGCCGCAAGATGTGATTGATGTGCGCACCCGAGAGATTTACCCCACGG 
ACTCCAGGCAGCTCCATGTCTCCTACTCGGGAATGTGCTCGGTCCGATTGGGGTGGAGTGTCAATGGAGTGCAAAAGACTCCCATAAACATGGAC 
CTAGGAGAAGTGCCCATCATGTTGCGCTCCAAAGCCTGCAATCTGGGACAGGCTACTCCGGAAGAAATGGTAAAACATGGAGAGCACGACAGCGA 
GTGGGGGGGCATTTTTGTGATCCGAGGAAATGAGAAGATTGTACGTATGTTGATCATGACCCGGAGGAACCATCCCATATGTGTAAAACGTTCAT 
CCTGGAAGGATCGTGGACAGAATTTTAGCGATTTGGGTATGTTAGTGCAAACGGTCCGTGAGGACGAATCATCCCTGAGCAATGTAGTACATTAT 
TTAAACAATGGAACTGCAAAGTTTATGTTTTCTCACGTCAAGAGACTATCATACGTACCAGTTTGCCTGATCCTCAAGTGCCTAATGGATTATAC 
AGATGAGGAAATATACAACAGATTGGTGCAAGGCTACGAGTCCGACCAGTACTATGTGTCCTGCGTGCAAGCCATGTTGAGGGAAGTTCAAAATG 
AAAACGTCTATACGCACGCCCAGTGCAAGAGCTTCATTGGCAATCTGTTCCGTGCCCGCTTCCCAGAAGTTCCGGAATGGCAACCGGATGATGAT 
GTCACAGACTTTATTCTTAGGGAGCGAGTAATGATTCACCTGGATACCTATGAGGATAAGTTCCAGTTGATTGTGTTTATGATTCAGAAGCTTTT 
CCAATGCGCACAGGGCAAGTACAAAGTGGAGAACGTCGACTCAGCTATGATGCAGGAAGTGCTTCTGCCAGGCCATCTCTATCAGAAATACCTAA 
GTGAGCGAGTCGAATCCTGGGTCTCGCAAGTCCGACGTTGCCTTCAAAAAAAACTGACTTCACCTGATGCCCTAGTTACCAGTGCTGTGATGACG 
CAGTGCATGCGACAAGCAGGAGGAGTTGGCCGGGCCATAGAGTCGTTTTTAGCAACTGGAAATATTGCGAGTCGCACTGGCCTGGGATTAATGCA 
AAATAGCGGATTAGTCATTATGGCGGAGAACATTAACAGGATGCGTTACATGTCCCACTTTCGCGCCATCCACCGTGGTTCCTATTTTACAACGA 
TGAGAACCACTGAGGCGCGGCAGCTGCTCCCGGATGCCTGGGGTTTTATTTGTCCAGTGCACACTCCTGATGGCACACCGTGCGGACTGTTAAAT 
CATTTGACTCTAACTTGCGAGATTTCGATGCGTCCCGATCCCAAGTTGGTGAAGGCCATTCCCAAGCATCTTATAGATATGGGCATGATGCCCTT 
GTCAAACCGAAGATATTTGGGCGAAAAACTTTACGTCGTCTTCCTAGATGGCAAACACTTGGGTCACATTCATCAATCAGAAGCCGAAAAAATAG 
TTGACGAATTGCGCTACGGGAAGATATTCGGCACCCTTCCCCAGATGATGGAAATCGGCTTTATACCCTTCAAGAAGAACGGTCAGTTTCCAGGG 
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CTTTACATTGCAACGGGTCCTGCCAGGATGATGAGGCCCGTTTGGAACCTAAAGTGGAAAAGGGTGGAGTACATCGGAACACTGGAACAGCTTTA 

TATGGAGATCGCCATCGATGCAAAGGAAATGTATCCTGACTTCACCACTCATTTGGAGTTGGCTAAGACCCACTTCATGAGCAACTTGGCCAACT 

TGATACCCATGCCGGACTACAATCAATCGCCACGTAACATGTACCAGTGTCAGATGGGAAAACAGACGATGGGAACGCCATGCTTGAATTGGCCT 

AAGCAAGCGGCCAACAAGTTATACCGCCTTCAGACGCCAGGGACTCCTCTCTTCCGACCAGTACACTACGACAACATCCAGCTGGATGACTTTGC 

AATGGGCACAAATGCTATTGTAGCTGTCATTTCCTACACGGGCTACGACATGGAAGATGCTATGATCATCAACAAGGCTGCTTATGAACGAGGAT 

TCGCCTATGGAAGTATCTACAAAACAAAGTTTTTGACGTTGGACAAGAAGAGCAGTTACTTCGCAAGGCATCCACACATGCCGGAGTTAATAAAG 

CATCTTGACACGGATGGTCTTCCACATCCGGGCTCCAAATTGAGCTATGGTTCGCCCCTGTACTGTTACTTTGACGGAGAGGTCGCCACTTATAA 

AGTTGTGAAAATGGATGAAAAGGAGGATTGCATAGTAGAGAGCATCCGACAGTTGGGAAGCTTCGATTTGTCGCCAAAAAAAA 

CACTGAGGGTTCCACGACCTGCCACTATTGGAGACAAGTTTGCATCAAGAGCTGGTCAGAAAGGAATTTGCTCACAAAAATATCCTGCAGAGGAC 

T T GC C AT T T AC GG AATC T GG T T T GAT T C C AG AC AT T GT GT T C AAT C C C C AC G GCT T CCCCTCCC G AAT G AC AAT C GC CAT GATGAT AG AAAC GAT 
GGCGGGAAAGGGGGCTGCAATCCACGGAAATGTTTACGATGCCACGCCTTTTCGCTTCTCCGAAGAGAATACTGCTATTGACTATTTTGGCAAAA 
TGTTAGAGGCTGGGGGTTACAATTACTACGGAACGGAGAGGCTGTATTCCGGCGTAGATGGCCGCGAGATGACTGCTGACATATTTTTTGGAGTG 
GTGCATTACCAGCGTCTTCGGCACATGGTGTTTGACAAGTGGCAAGTAAGATCTACGGGAGCGGTGGAAGCTCGGACCCATCAGCCAATTAAGGG 

cagLaacgcggtggaggcgtcagattcggtgagatggaacgggatgctttgatctctcacggtgctgctttcctgctgcaggatcgcttgttcc 

ACAACTCCGATAAGACGCACACCCTGGTGTGCCACAAGTGCGGCTCCATTCTGGCGCCTCTGCAACGCATTGTTAAGCGAAATGAGACGGGAGGA 

ctctcttctcagccagatacgtgtcgcttgtgcggggataacagttctgtatcgatgat 

AT T G AGT TC GGT C AAC AT T AATGC GAG AT T T AAG T T AAAT GAAAT T TG A 
(SEQ ID NO: 419) 

Start ATG: 80 (Reverse strand: CAT) 
MLEEMQQMKTIPVLTNSRPEFKQIPKKLSR^ 

EIYPTDSRQLHVSYSGMCSVRLGWSVNGVQKTPINMDLGEVPIMLRSKACNLGQATPEEMVKHGEHDSEWGGIFVIRGNEKIVRMLIMTRRNHPI 

CVKRSSWKDRGQNFSDLGMLVQTVREDESSLSNVVHYLNNGTAKFMFSHVKRLSYVPVCLILKCLMDYTDEEIYNRLVQGYESDQYYVSCVQ^L 

REVQNENVYTHAQCKSFIGNLFPARFPEVPEWQPDDDVTDFILRERVMIHLDTYEDKFQLIVFMIQKLFQCAQGKYKVENVDSAMMQEVLLPGHL 

YQKYLSERVESWVSQVRRCLQKKLTSPDALVTSAVMTQCMRQAGGVGRAIESFLATGNIASRTGLGLMQNSGLVIMAENINRMRYMSHFRAIHRG 

SYFTTMRTTEARQLLPDAWGFICPVHTPDGTPCGLLNHLTLTCEISMRPDPKLVKAIPKHLIDMGMMPLSNRRYLGEKLYVVFLDGKHLGHIHQS 

EAEKIVDELRYGKIFGTLPQMMEIGFIPFKKNGQFPGLYIATGPARMMRPVWNLKWKRVEYIGTLEQLYMEIAIDAKEMYPDFTTHL^ 

SNLANLIPMPDYNQSPRNMYQCQMGKQTMGTPCLNWPKQAANKLYRLQTPGTPLFRPVHYDNIQLDDFAHGTNAIVAVISYTGYDMEDAMIINKA 

AYERGFAYGSIYKTKFLTLDKKSSYFARHPHMPELIKHLDTDGLPHPGSKLSYGSPLYCYFDGEVATYKVVKMDEKEDCIVESIRQLGSFDLSPK 

KMVAITLRVPRPATIGDKFASRAGQKGICSQKYPAEDLPFTESGLIPDIVFNPHGFPSRMTIAMMIETMAGKGAAIHGNVYDATPFR 

DYFGKMLEAGGYNYYGTERLYSGVDGREMTADIFFGWHYQRLRHMVFDKWQVRSTGAVEARTHQPIKGRKRGGGVRFGEMERDALISHGAAFLL 

QDRLFHNSDKTHTLVCHKCGSILAPLQRIVKRNETGGLSSQPDTCRLCGDNSSVSMIEIPFSFKYLVTELSSVNINARFKLNEI* 

{ SEQ ID NO: 420) 

Classification: enzyme 
Gene Symbol: RpI135 
FlyBase ID: FBgn0003278 

Celera Sequence No. : 142000013384281 

ATGTGGAATAAATTAATTATATTAATACGCTGGAACCACACAACATGCGCCCAAAACGATGATGATAATGATGATGGTGTGCAGGTGGTTCCGCT 
CTTTTCATCTTGTAGTTGGTGTTGTTGTTGATTTTGTTGTGCTGCACTCCAAACTTCCACCACTCGCACTTGCACTTTCGCTGCTGCTTCCTCGA 
CTTCTCCTCCTCCTCCTCCAGCAGAATCAAGACCATCCCGATCCTGATCCTCTTGCCCAGACAAGCGATCCTTCTTAGAAGCACTTGCGGTGCAC 
AATGGAGGGGCCGGACTCGTCGTACTCCTGCTTGGAGATCCACATCTGCTGGAAGGTGGACAGCGAAGCCAGGATGGAGCCACCGATCCAGACAG 
AGTACTTGCGCTCTGGCGGGGCAATGATCTTGATCTTCATGGTCGACGGTGCCAGGGCGGTGATCTCCTTCTGCATACGGTCGGCGATGCCAGGG 
TACATGGTGGTGCCACCGGACAGCACGGTGTTGGCATACAGATCCTTACGGATATCCACATCACACTTCATGATGGAGTTGTAGGTGGTCTCGTG 
GATGCCGCAAGCCTCCATTCCCAAGAACGAGGGCTGGAACAGGGCCTCGGGGCAGCGGAAACGCTCGTTGCCGATGGTGATCACCTGTCCGTCGG 
GCAGCTCGTAGGACTTCTCCAACGAGGAGCTGCTGGCAGCGGTGGCCATCTCCTGCTCAAAGTCGAGGGCAACATAGCACAGCTTCTCCTTGATG 
TCACGGACGATTTCACGCTCAGCGGTGGTGGTGAAAGAGTAACCGCGCTCGGTCAGGATCTTCATCAGGTAGTCGGTCAAATCGCGACCAGCCAG 
ATCCAGACGCAGGATGGCATGGGGAAGGGCATAACCCTCGTAGATGGGCACGGTGTGGGAGACACCATCGCCGGAGTCCAGAACGATACCGGTGG 
TACGACCGGAGGCGTACAGCGAGAGCACAGCCTGGATGGCCACATACATGGCGGGTGTGTTGAAGGTCTCGAACATGATCTGGGTCATCTTCTCA 
CGGTTGGCCTTGGGGTTCAGCGGGGCCTCGGTCAGCAGCACGGGGTGCTCCTCGGGTGCCACACGCAGCTCATTGTAGAAGGTGTGGTGCCAGAT 
CTTCTCCATATCGTCCCAGTTGGTCACGATACCGTGCTCAATGGGGTACTTCAGGGTGAGGATACCACGCTTGCTCTGCGCCTCATCACCCACGT 
ACGAGTCCTTCTGGCCCATGCCGACCATCACACCCTGGTGACGGGGACGTCCCACAATCGATGGGAAGACGGCGCGGGGAGCATCGTCTCCGGCA 
AATCCGGCCTTGCACATGCCAGAGCCGTTGTCGACAACCAGAGCAGCAACTTCTTCGTCACACATTTTGTAAGCTGCAATGGAAAGAATGCGGAT 
TTCCCATTAGTTTTTGGTTACCTTATTTGGTAATGTAGAAATGGCAAAAAAAAAACTGAGAATTTGCGTGGTTTCCTTGGACTGCGCTGGTCTAC 
CAAAGTACAATTGTCATAACAAACTAACTTGTTTTTCGTGTACAATTAAAAATCACTCAATGCAATGTACGTTTTTTTAGTACAAAATTTTAATC 
TTGGACAAAAAAAATTTCATAAAAAAATACTTTTCCGATTGAAAACTGATTTTCACTTGCGATCG 

CAAAATAGAAGGCCCCGCCCCGAAAAATTAAAACCACAATACGAATATGCCGAAAAGGGAGGGGAGAAGCTTCTAGACGACACTTAACGAATGCC 
GGTGCAAAAGAAAGCCGCAAAGAGAGCACAGTTGCCTGTGTGTATGCGCCCTTAAAATGGCGGCTGCCTAAAACTCGAATTGGAACCCGAAATTA 
ACATTACAAAATAAAAAAAAAAAAAAACACTAAACGCACTCTAGAAAATTTAACAATTAAGAGCCAC 

CCTTTTTGTTCGATTTAACAAATTCAAGGCGTGAAAACTACTGTAAACGCAAGTGGCGAGGCTGAGCTAAGCGACCGCACGGTTTGAAAGGAATG 
ACTGGAATAAACCGACTGAAAGTGGCTGGTGAATGTTGAATGCCAGTTAACTGCGTTTTTCAATCGCTGCCTGCGTCGACGTCGCCATAGGCGTC 
GCTCGCTCTCGCCATCTCGCTCGCTCCGCGCCATGGTCGCTATGGGTGCGAAGGAGAGCGCTGGTGTGTTGAATTGCGGCGGCCAAGTTTTGTGA 
CCATATATGGTCATGTTTCGAGGCGAAAAGCAAACCGTCGTCTGGCTTTCGCTCTCTCACACACACATACACTCTCTCTCGTTATTTTGCGCGTG 
AGTGTGTGTATATGAACGAATGCTCGAAGGTTTTCTCCATTTTTGCAACTACAACTTTGAAAAACCGGCAAAAATTTCTGCTTCAGCCAAAACGG 
CTGCCTACGTCACTATTTTCAAGAACCTGATCTAGTTTTACAGATTATAATATGCATATACTGTATGTATGTACATTTATGGATGATCTGGTGTT 
TTAGGGCGTGGGAAAAAAAGAAAGTGTTGGCGAATTGATTTGCAATTCGTAATTTGGCATATCTTTGGGTGAAATCGTGACACGCCCACATCAGC 
GAGCTTGGCTTTAAACACTATATTAGATCCTAACATTTGAGTTGGGGCTATTGAATTGCTTTGGTTTTTATAATTTCGTATAATTTGTATGTATA 
TATCCTGAAATCTGCAGTTGTAATCCTGATTCTTGAGTTATGGTATTTGAAACCCTGGTTTTGTTAATTTTGATGTCTTTCGACCTATTGGTATT 
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TTGATTGACAATTTAGGGCCAAACTTGGGTATTTTTGGAGTTTTGGTCAATACAATAACTCTTTAGCTCGTGGCTAGTATTTTTTGATCACTTTC 
AGCACTGGCACATAAAAAAATGTTTTGTTTACAAAAAAAATCACACATTTTCCGCGGCTTTTCTCGATTATTGCTTTTTCTTGGTGGGAGTGGTT 
CGATGGTGTTTTGTTGTTGGGAAGGGGAAAGCTACTTACATTTGGTGTGTTTTATTAAAATTAATTAAATAAATATAAATTAAGTCTTTCGGTTT 
GGTGTCTCTGGATTAGACGACTGCTGGCTGATGGAGCGGCTTTGTGTCGGGAGGAGTATCCACACAGCACAAGAACTCAAACGGTAGTGATATGA 
AAACTGGTCCCGCAGCCCGTTTTATAAAGCCCGCTTAGCGCTAAAACCTTATCAGCCGCAATGCTCTCCGCTCTCAAGTCGCGTTCAAAACTTTT 
ACCATATATGGGTATGAATTGTTCGCCGATTGGGTTTTCATTTACGGAGAATTTCCTCCGCAACTGGGTGTTCGTGCTTGTGTGTGTCTTGTATA 
TCGCCATGAAGAGTGTACTTCCATGGGTCCTATCAGTTGCGGCATGTGATTCAATTTGGGGGTTACTCATATTCGATACTTCAGCTGGTTATCTT 
TATGCTGTGATGGTCACGGTGCTTTTACACATTTTTTCCCCAACTACTCATTGTATGCCAAAACATACATATGTACATGTGTTTATAGTGGGTTT 
ATTAGTAATGTCCAGCGATATTAAAACATTACAGTTAAATAATATTCATATTAGCTTACGTTTACTGCTATCTGCACTTCAAATTGGTTATTCCA 
GGGACATTTTGTGCAACACACATCTTCTAGTAGGGCATTCACTAGTCGTAGAAAACACACCTAACTTAAGCCGGTTTTAACAATAACGGCGAAAC 
AGCTGCCAGACGAGAGAAAAGTACACAGCAAGAAGCATTTCTATAATGATGCCGTCCTAAATATGCAGCATTTTTGCTCAAATTGATCATTTCGT 
TGCATCGCTCAGGTGGTATAGTATTTGGCTATTGAATGTGATTGCTATCTTAATAGTTTTGAATAGTAATAGTGCAAATATTCATTCATAAAAAA 
TAGGTAATCGATTTTTTCAGTGCAGTCTATGCGTTTGCGCAACAGAGACGAAGAGAGAGACGCCTCACACACAAGCACATGCGGTCTAAGCTGCT 
CTCCCTCTTTTTCTCTCTCTCTCTGGCTTGCGGCAGTTTTCCAGAGGCTGTGACTCAGTAAGCATTTCTTGTTGTTACCCTAACGATAAGCGTCC 

GATTCCTAATTCTCTTTCAAATAATTTGTT 
(SEQ ID NO: 421) 

Exon: 3210.. 3080 
Exon: 2113.. 1997 
Exon: 1395.. 265 

Start ATG: 1395 (Reverse strand: CAT) 



TTCTTGTGCTGTGTGGATACTCCTCCCGACACAAAGCCGCTCCATCAGCCAGCAGTCGTCTAATCCAGAGACACCAAACCGAAAGACTTAATTTA 
TATTTATTTAATTAATTTTAATAAAACACACCAAATACTTTCAGTCGGTTTATTCCAGTCATTCCTTTCAAACCGTGCGGTCGCTTAGCTCAGCC 
T CGC C AC T T GC GTTT AC AGT AGT T T T CAC GC CT T G AAT T T GT T AAAT C GAAC AAAAAGAT GT G T G AC G AAGAAGT T GC T GC T CT G GT T GT CGAC A 
ACGGCTCTGGCATGTGCAAGGCCGGATTTGCCGGAGACGATGCTCCCCGCGCCGTCTTCCCATCGATTGTGGGACGTCCCCGTCACCAGGGTGTG 
ATGGTCGGCATGGGCCAGAAGGACTCGTACGTGGGTGATGAGGCGCAGAGCAAGCGTGGTATCCTCACCCTGAAGTACCCCATTGAGCACGGTAT 
CGT G AC C AACT GGG AC G AT AT GG AG AAGATCT G GC AC C AC AC CT T C T AC AAT G AGC T G C GT GT G GC AC CC G AGG AGC AC C CC GT GC T GC T G A C CG 
AGGCCCCGCT GAAC C CC AAGGC C AAC C GT G AG AAG AT G AC CC AGAT C AT G T T C GAG AC C T T C AAC AC ACCC GC C AT GT AT GT GG C C AT C C AG GCT 
GTGCTCTCGCTGTACGCCTCCGGTCGTACCACCGGTATCGTTCTGGACTCCGGCGATGGTGTCTCCCACACCGTGCCCATCTACGAGGGTTATGC 
CCTTCCCCATGCCATCCTGCGTCTGGATCTGGCTGGTCGCGATTTGACCGACTACCTGATGAAGATCCTGACCGAGCGCGGTTACTCTTTCACCA 

C C AC CGCT G AGCGT G AAAT C GT C C GT GAC AT C AAG GAG AAG C T GT GC TAT GT T G C C CTC G AC T T T G AGC AGG AG AT GGC CAC C GC T GC C AGC AGC 

TCCTCGTTGGAGAAGTCCTACGAGCTGCCCGACGGACAGGTGATCACCATCGGCAACGAGCGTTTCCGCTGCCCCGAGGCCCTGTTCCAGCCCTC 

GTTCTTGGGAATGGAGGCTTGCGGCATCCACGAGACCACCTACAACTCCATCATGAAGTGTGATGTGGATATCCGTAAGGATCTGTATGCCAACA 

CCGTGCTGTCCGGTGGCACCACCATGTACCCTGGCATCGCCGACCGTATGCAGAAGGAGATCACCGCCCTGGCACCGTCGACCATGAAGATCAAG 

ATCATTGCCCCGCCAGAGCGCAAGTACTCTGTCTGGATCGGTGGCTCCATCCTGGCTTCGCTGTCCACCTTCCAGCAGATGTGGATCTCCAAGCA 

GGAGTACGACGAGTCCGGCCCCTCCATTGTGCACCGCAAGTGCTTCTAA 

(SEQ ID NO: 422) 

Start ATG: 249 (Reverse strand: CAT) 

MCDEEVAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNE^ 
RVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVLDSGDGVSHTVPIYEGYALPHAILRLDLAGRDLTDYL 
MKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMEACGIHETTYNSIMK 
CDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF* 

(SEQ ID NO: 423) 
Name: ACT IN 

Classification: cytoskeletal_structural_protein 
Gene Symbol: Act5C 
FlyBase ID: FBgn0000042 



Celera Sequence No. : 142000013384645 

GGAAAGAATTTTGGTTGGATTCAAATACGAAGCAGCTTTGGTAACAACTAAGGTTCTGGGTTAGTCATGATTCACCAACAGGTTAGACTTAGTTT 
TGTGGCAAAATTTAACCAATGACGTCCAAGTTCTAACTAAAGTAATATCGTTACTTTGTTATTTTATTTTTGGATATTCTCGAAAGTTTCTGGAA 
TTCTCAAGAAAACTTAAACAATATAGAGTCGCATTATATGGAAAACTCATTGATTCATTCTTATGTTTGTTATTTTGTTATTAAATTCCAATTGA 
TGATGCCAGTGTCTAAATAAAAACGAAGACCTAATTTTCAACCAAAAATCAGGTTTTCTGGGCTTGTCCGTATTAAACATTTTTGATAATTATTT- 
ATAATATATTGTTATTTTTTGCTGGCTTCAAAACATCAACAAATTGTGTATGATAAATGACGATGACGTTGTTGTTGATGGCGATCGCGCTGTGG 
GAATCAGATGAAATGTCAATTAAGTTTGCATTTATATCTTTGATTGAAGCTCTAAGATTTGCTTTGCCATTTTATGCGGAAGTTAAGCTCGGTCA 
GTTCTTGAAAAAGTTCCCATGGGAGCAAGTTATTTGTTTTTTGACAATAAGAAGCGTTTATTTAAAATGGCTTATTATGGCCTCAATGATTCCTA 
ATAGGCGTAACTTACTGGTTATAAACTTGTTTAATATCGTTTCATTTTAAACCTGGATGACAGAAAATGAATTTCGGTTCAAATACAAATATTAC 
AATAGAAAGTATGATATTTAGCTCCGATAAACATTTAACAATTTTAGTATTTTAATGGTGAAATGGTAAATTCCGCGCATGAAAAGAGATAGAAG 
AGATGACATATATGGAAACCTTAAAAACGTAACGGTTGCTTGGGTTTTATAACAATCAGTCAGTGACAGGCATTTCCAGAGTTGCCCTGTTCAAC 
AATCGATAGCTGCCTTCGGTCGCTCGTCACTAACTTTCCTTTAGGCAACGGCCACACTGTCTGGCCACCAAAATCCCAAACTTAATTAAAGAATT 
AAATAATTCGAATAATAATTAAGCCCAGTAACCTACGCAGCTTGAGTGCGTAACCGATATCTAGTATACATTTCGATACATCGAAATCATGGTAG 
TGTTGGAGACGGAGAAGGTAAGGCGAGAAACGGCGAGCCGCATGGTTTCGATTTCGGCTGAGCCGTGGCAGGAACAACAAAAACAGGGTTGTTGC 
ACAAGAGGGGAGGCGATAGTCGAGCGGAAAAGTGTGCAGTTGGCGTGGCTACATCATCATTGTGTTCACCGATTATTTTTTGCACAATTGCTTAA 
TATTAATTGTACTTGCACGCTATTGTCTACGTCATAGCTATCGCTATCTCTGTCTGTCTCTATCAAGCTATCTCTCTTTCGCGGTCACTCGTTCT 
CTTTTTCTCTCCTTTCGCATTTGCATACGCATACCACACGTTTTCAGTGTTCTCGCTCTCTCTCTCTTGTCAAGACATCGCGCGCGTGTGTGTGG 
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GTGTGTTCTTCTTTAGCTACATATACATAAATAGGAGAGCGGAGAGACAAATATGGAAAGAATGGZyVAAAGAGTGAATTACTGCAATTAACCAGT 

CGCGAACAGTTAAATCATATTTTTGTCGGCCATTGCAGTAAATAAACCGTTGGCTTTCCCTCTTCACTTTCCACCTCCTTTCTTTGACGTTAATT 

TTTTTCAGTTAATCGCGCCGCTGCTTTGAACTCGAACACGAATTTTAGCCGCAACATAAAATAAAATCAAGTAACTCTTTTAACTCAAATATAAA 

ACAACAATTCCAATTCTTTCAACAGGCAATCTGTGTTTTTATGTCAGATACGTGCGCGTGTGTGTGTGTGCTGTAATTCCATCGCCCCTTTCGAT 

TCCGAGTTCGTTAGGAACAGCATTAGTTCGCCTATTTTAGTAGTAGCCTAGTCCGATTTTAAGTGAAACAGGATACTCCAACACCATATACTCAA 

TAATTAGTTACAACACCCACTCAACCATACAGCAACAACAAGTTTAACGAGTTTTTTGTATTATCATTACTTAGTTTTTTGGTTAATAATACACA 

AGTGAAGAGCGAACTGCAGGGGAGCGAGATATCACGAAACAATCCAAAATCCACACACACTCAAACAGAAATCAAAAGCTTCGCTCTCTCGCACA 

C AC AC GC AC C AAC C A AC TAT C AAC T AT C AC AAAC AC C GC GAC AGAG A G AG AGC GGC AAG T G AAT C AC GGC G AAT C GAAAC C GAT C C G AAC C C AC T 

CC GG AGC C G AAAAAG AAC T G AT C C T AC C AT C AAAC G C AT CC AAT AAAC AC GGC CGC C AAC AT G C AG AGC G AC T T T C AC A G AAT G AAG AAC T T T GC 

CAATCCCAAGTCCATGTTCAAAGTAATACTCTCAGTGCGCCTGTCGCTAAGCCAAGCTAATCTAATCTTCTGATTCCCCTTCCCATCCATTGCCA 

TCTTCTCCCGCAGACCAGCGCCCCCAGCACCGAGCAGGGTCGTCCGGAACCACCAACTTCGGCTGCAGCGCCCGCCGAGGCTAAGGATGTCAAGC 

CCAAGGAGGACCCACAGGAGACTGGTGAACCAGCAGGCAACACTGCAACCACTACTGCTCCTGCCGGCGACGATGCTGTGCGCACCGAGCATTTA 

TACAAACACCCGCTCATGAATGTCTGGACGCTGTGGTACCTTGAAAACGATCGGTCCAAGTCCTGGGAGGACATGCAAAACGAGATCACCAGCTT 

CGATACCGTCGAGGACTTCTGGAGCCTATACAACCACATCAAGCCCCCATCAGAGATCAAGCTGGGTAGTGACTACTCGCTATTCAAGAAGAACA 

TTCGGTGGGTTTGCTGTTTATTGCAATTCTGCCAAGATAACCTTTACTAACTGATATCTCATTTGCAGTCCCATGTGGGAGGATGCAGCCAACAA 

ACAGGGCGGTCGTTGGGTCATTACCCTTAACAAAAGCTCCAAGACCGATCTGGATAACCTATGGCTCGATGTGGTAAGTGCACAAAGAAACGAGT 

GGTGAGCGGATGGTCTATTTATAGTGAATGTACATTCTTGAAATGCAAAAATATAGAAATAGGTGTATGATTTTGCAATATAAATTATAACTTAT 

AGAAAATATCAGCTAAAAATACGCTAGTGCTAGCTTTTGTCTTAGGAACATTCAATAGTGAGCTTATATCATAAATATTTTTCGCATATGAGTAA 

CTACAACTGTTTTGCCTTCCAGCTGCTCTGCCTGATTGGTGAGGCCTTCGATCACTCTGATCAGATCTGCGGCGCTGTTATAAACATTCGCGGCA 

AGAGCAACAAGATATGTAAGTTTTCACGCACACCCAACTTCAGCGGAATTCCTTTGTTTAACGATTAATGCTTTCCAGCCATCTGGACTGCCGAC 

GGAAACAACGAGGAAGCTGCCCTTGAGATTGGTCACAAGCTGCGCGATGCCCTGCGTCTGGGACGCAACAACTCGCTGCAGTATCAGTTGCACAA 

GGACACGATGGTCAAGCAGGGCTCCAACGTGAAATCGATCTACACTTTGTAGGCGGCTAATAACTGGCCGCTCCTTATTCGGTCCGATCCCACAC 

TGATTATTTTGTCTTTCATTTATTTATCGTTATAAGCAACAGTAGCGATTAATCGTGACTATTGTCTAAGACCCGCGTAACGAAACCGAAACGGA 

ACCCCCTTTGTTATCAAAAATCGGCATAATATAAAATCTATCCGCTTTTTGTAGTCACTGTCAATAATGGATTAGACGGAAAAGTATATTAATAA 

AAACCTACATTAAAACCGGATACTTTTGATCTTTGATTTTGGAGATATTTTAAAATAGAAAAAAGTTAACCTATATTGTTTTTGGGATAATTACT 

TGTTGTATTACTAAAACGAGCTTGTATCCTAATATGGGCACCTAAAGCAAAACTTCCAAATTTGGAATTCAATTTGTTGCTAAAAATACGTTAGC 

AATTTCTAGGGTGGCCAAAAAAAAATGTGTATCGATATATCGATAGGTTTCCTAAATAGACATTAAAATAATTTTAGCTTTTTACTTATATTGTT 

AGAGTAAATACGTTTGTATTTAAAAAATAAGTCAGTATTTTTTTTTTATTAACCTATTATTGAATTTTATATGAGTGAGAGCTTTTCCATTCAGC 

CAAGAACGGTCACACTGCATGTGACATGTTTCAATTGTGTAGGCGCAGTCACACTGGTCGCGTCGCTCGTAGATATCGTGTAAGGATATGTCTCC 

GTGTTTAGCTAAATAAAATACATAAAATATATGTGCAATTCTAACAAGCATAGCCATTTAGCGTAGAGTAGTGGAACGAACAGCTCGTACCGGGC 

CAGTAACAACAAATTAACAAGCAAACGCACTAAAGTTTGTCGCGGAATAAAGATATAATAATAGACCAGCTAACCAACACAAACAAACACACACA 

CGGCCAGAAGTTCGTTGGCGTTGAAAGGGAAAAATGAAACGAAATGAAAGGAGAATAACAGTGCAATAGATTCGGTTTACATTGTGCAAAAAATC 

GAAGTAGCCCCTGCCAACAGGGTCGCTAAAAATAAAGAAGGCGACAGCCTCTGTTTATCGTTGGGAATAAACAAATATATGGCCAGCGGAGGAGA 

ATCATAACAAAGGCGCAAGACTAGCCAGCATAAAAGACAAAGGAGCTCTCCTCCCCCGCGTCCGCCACCCCCCTCCCTTTTTGGGGTTGAAAACC 

CCCCACGCGTTGCGGTGCAGCATCGCCCCGACTATGCGATACCCTGCAATCGAGTTTATAGTGCTATAG 

(SEQ ID NO: 424) 

Exon: 1001.. 1157 

Exon: 2070.. 2397 

Exon: 2484.. 2854 

Exon: 2919.. 3018 

Exon: 3253.. 3340 

Exon: 3404.. 3819 

Start ATG: 2341 

Transcript Wo. : CT13384 

GCCACACTGTCTGGCCACCAAAATCCCAAACTTAATTAAAGAATTAAATAATTCGAATAATAATTAAGCCCAGTAACCTACGCAGCTTGAGTGCG 
TAACCGATATCTAGTATACATTTCGATACATCGAAATCATGGTAGTGTTGGAGACGGAGAAGTTTTTTGGTTAATAATACACAAGTGAAGAGCGA 
ACTGCAGGGGAGCGAGATATCACGAAACAATCCAAAATCCACACACACTCAAACAGAAATCAAAAGCTTCGCTCTCTCGCACACACACGCACCAA 

C C AAC T AT C AAC T AT C AC AAAC AC CGC GAC AGAG AG AG AGC GGC AAG T G AAT C AC G GC G AAT C GAAAC C G AT C C G AACCC AC T C C GG AGC C G AAA 
AAGAACTGATCCTACCATCAAACGCATCCAATAAACACGGCCGCCAACATGCAGAGCGACTTTCACAGAATGAAGAACTTTGCCAATCCCAAGTC 
CATGTTCAAAACCAGCGCCCCCAGCACCGAGCAGGGTCGTCCGGAACCACCAACTTCGGCTGCAGCGCCCGCCGAGGCTAAGGATGTCAAGCCCA 
AGGAGGACCCACAGGAGACTGGTGAACCAGCAGGCAACACTGCAACCACTACTGCTCCTGCCGGCGACGATGCTGTGCGCACCGAGCATTTATAC 
AAACACCCGCTCATGAATGTCTGGACGCTGTGGTACCTTGAAAACGATCGGTCCAAGTCCTGGGAGGACATGCAAAACGAGATCACCAGCTTCGA 
TACCGTCGAGGACTTCTGGAGCCTATACAACCACATCAAGCCCCCATCAGAGATCAAGCTGGGTAGTGACTACTCGCTATTCAAGAAGAACATTC 
GTCCCATGTGGGAGGATGCAGCCAACAAACAGGGCGGTCGTTGGGTCATTACCCTTAACAAAAGCTCCAAGACCGATCTGGATAACCTATGGCTC 
GATGTGCTGCTCTGCCTGATTGGTGAGGCCTTCGATCACTCTGATCAGATCTGCGGCGCTGTTATAAACATTCGCGGCAAGAGCAACAAGATATC 
CATCTGGACTGCCGACGGAAACAACGAGGAAGCTGCCCTTGAGATTGGTCACAAGCTGCGCGATGCCCTGCGTCTGGGACGCAACAACTCGCTGC 
AGTATCAGTTGCACAAGGACACGATGGTCAAGCAGGGCTCCAACGTGAAATCGATCTACACTTTGTAGGCGGCTAATAACTGGCCGCTCCTTATT 
CGGTCCGATCCCACACTGATTATTTTGTCTTTCATTTATTTATCGTTATAAGCAACAGTAGCGATTAATCGTGACTATTGTCTAAGACCCGCGTA 
ACGAAACCGAAACGGAACCCCCTTTGTTATCAAAAATCGGCATAATATAAAATCTATCCGCTTTTTGTAGTCACTGTCAATAATGGATTAGACGG 
AAAAGTATATTAATAAAAACCTACATTAAAACCGG 
{SEQ ID NO: 425) 

Start ATG: 429 

MQSDFHRMKNFANPKSMFKTSAPSTEQGRPEPPTSAAAPAEAKDVKPKEDPQETGEPAGNTATTTAPAGDDAVRTEHLYKHPLMNVWTLWYLEND 
RSKSWEDMQNEITSFDTVEDFWSLYNHIKPPSEIKLGSDYSLFKKNIRPMWEDAANKQGGRWVITLNKSSKTDLDNLWLDVLLCLIGEAFDHSDQ 
ICGAVIMIRGKSNKISIWTADGNNEEAALEIGHKLRDALRLGRNNSLQYQLHKDTMVKQGSNVKSIYTL* 
(SEQ ID NO: 426) 
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Name: eIF4E 

Classification : translation_f actor 
Gene Symbol: eIF-4E 
FlyBase ID: FBgn0015218 



Celera Sequence No. : 142000013384826 

CCCGATGATGTAGATTTTAAGGACAGCGGAAAAATAAGCTCCTTGAAGGCCCTGGGTCCATTTATAGCCGGCAAGGAGAATAGCCGCAACACGGT 
TTACCTCAGCGATGTTCCGGATATTTGCAAGGATAAGCCGTGCATGCTGGGCGTGGACGAAGCAGGGCGTGGTCCCGTCCTGGGACCTATGGTAT 
ATGGGATCTCCTACTGTCCACTGGAAAGTAACAAGGCTTTAGAGGACCTGGGATGCGCCGATTCCAAACAACTAACCGAGGGGAAACGTGATATC 
ATATTTAACGATATCAACACCAAGGAATATGCGACCAGTTGCGTTGGCTGGGCCGTGGAGATCATATCGCCGAACACGATCAGCACCAGCATGTA 
CCGGCGGTCGAAGTGCTCCCTGAACGAAGTTTCCATGGACTCGGCCATGGGCCTTATCCAACAGGCGATCGATGCGGGCGTCAATATAGCCGAGG 
TTTATGTGGATACGGTGGGTCCGCCGGAGAAGTACCAGGAGAAGTTGCTCAAACGTTTCCCCAGCTTTAAAATCACCGTGGCGAAAAAAGCTGAC 
TCCACCTATCCAATAGTCTCTGCTGCTAGTATATGCGCCAAAGTTACCCGTGATCATGCCCTCAAGGTGTGGAGCTTCCCCGAGGGCTTGGTCAT 

T AAG G AC AAC GAG T T C GGC AGT GG AT AT C C GG GAG AT C C GGT C AC AAGAAG AT T T C T G AC C GAGT AC AT C GAT C T AGT GT T T GG AT TT C C AC G AC 
TTGTGCGCTTCAGTTGGTCCACCGCGGAAAACGCACTGGCGGACAAGGCCTATGACATGGAGTTCGACGAGCCGGATTCCGAGAAGCCCAAGTAC 
GCCGGCACCAAGCTCACAAAGTTCTTTAAGGGCACAACAAAATCCGGCGAAATAATTCGCGAGGAGTGCCGCTTCTTTAAGCAACGCCATCTGGA 
AAGTGTCATGGAGTTTTAGCAACAAATGAAAATAAAATTAAGTTCAGGAGTATCTATATTTACAAAGAATAAAGTTTAATTTAAGTACTTTAGTA 

T T AAAT T T T AAAAT G CT AAAAT AC AAAAAAC AC C C AGG C C C AAAAC G AAAAC AG AT T AAT T T T T AT C T T T AAT GT AT T T T T GC T GAT GT T C T T GT 
GTTTCCGCCAATCTCGTGTTCTTAATAATTACGTTCGATTACTGACTACGTGGGACTGCAGTTACACTGTTCTGAATGTATGCGTGAATCGGTCT 
GATCTGATCTCTTGGCATCCCTTTTTTGGAGGACAACTGTGGAATTGGGTAAAGGACTTTTGGTGGGGAGGGCGGGGAGTTGGCTGGCATCTTTG 
ATCGATACAATAACAAAATGCAAAGTTATACTAAACGATTTTCGTATAAAAAGGGACGACCTTAGAAACATGGAGCTTATGATTAAGATTAAAAT 
GAGGCGAGTCGCATCCCGCTGATCTGAAGAGAACAACAACCATGTTGAAGCCCCCAAAAATCCCAGGTGCATATCGATATTTTGTGACTCGAAAC 

T AAGC GT AC G AAAT AG AC GT GG AC AAAAGC GGG AGGAT C TC ATC T C AAG AGGT AT AC CT T AAAG AACT AAT T AG AAT T T AAAGC AAT T AC AAAT A 
CGATTGCAGGGCGGGATAGCTAAGAAAGGAGTTACCTAAGGTCCGGACTCCAGATCCAAATCGACTTATGTTGGTAGAATTGTAACTGATCATAT 
TTCACTTCTTTAAAAAATATAGTTAAATGCAATTTTTGAATCAAAAGTAAAAACTTCTCGTTCCGTAAGAAAGTGAAAAACTACCGATTTCATTT 

TT ACC AAAT AAT G AGCT GGAAAAGG AAT CT T G AC T AT T T T AAGC GT T GGC T C AAT AC AC GAAAT C C AGT T GT GGC T GCTC C GGC AAT C AG AACT T 

TCGCAGGTCCAACACGAAAACGCTACCGTCGGAGGCACTGGCACCCACCTTAGTGCCCTTGGAGTTCCAGCACACCTCAAAGATTCCGCCCGTGC 

CCTTGTAACTGTGCACTAGCTGTCCCGTTTGCGTGCTCCAAATGTGTACGCACTTGTCGAAGCTGCCGGAGGCCAAATGCTTTCCATCCGGACTG 

AAGGCCACCGAATAGACTGGCTCTGTGTGCTTGGTGAGCGTGTGGATACAGCTGCCCCTCTCCACGTCCCACAGTCTTACCGTGGAATCGAACGA 

AGCGGATGCGAGGATCAGGTTTGTGTTTGGATTATTGGTTCCCGGCCCCGTGGGCGACCATTTTATCGTATAGATCTCCTTTGAGTGAGCCTGCA 

GATCGTGGCAGCAACGATCTCGGTTCATGCTCCAGATCTTGAGGGTCATGTCATCAGAGCAGGACGCCAGCAGTTGGCCCTGCGGACACCACTTG 

ATTGCATTCACCTCGTTTGTGTGTCCTTTAAAGGTCTTGATGGGTTCATTTACACCCAACCGGCACACATGTATCCGCTGATCCGTACTGCAAGA 

GGCAAAGGCCTGGTTTGTCTGCCAGTCCACATCCAAGGCTGGAGCACTGTGAAAGGCAAATTGCTGGGTGCATTGGCCCGTGGATGCGTCCCAGA 

TGATCGTCGTCTTGTCCACGCCAGCCGAGAGGATGTAGTTGCCGCACTTGTTCCACTTCAGAGCAAAGATCGGACCCTTGTGTTGTCCCAAAGTA 

GAAGCAAGTCGACCGTCGGTTTTCCAAATTCGGGCGTAACCATCGTAGCTGCCGGTGGCGAGCAAGGAACCATCGCACTGCAAAAACGAGAATAG 

ATTAAGTAAGGATCAGAAATCAAACTTTGCAAGGCTTAAGTGCCAAGATCTTCGACTTACATTCCAGTCCAGAGAGGTGACGTCCTTGTTGCTAG 

GCACCTCAGCGCCGCCTTTCTGGATGCAGTGGCGCAGCACCAACTGGTTGGAGTTAGTGTTTGCATCGGACATGTCCCATATCCTAGCAGTGCTG 

TCTCCCGAACCGCTTGCCAGTAAATCCCGGCTAGGATTCCATGCGCAGATAAACACCTCGCTTTCGTGGCCGCGTAATACACGAGCCTTGGATTC 

GGGTATTTCGATGTTCTCGTCAATGTCCATGGGAACAAGAGCTGCGGGAGTCCCAGCACCTCCGCTGGTAGACGAGGAGATGGTTCCACTGGGAG 

CAGAGCTTTGAGCATTGGACGAACCAGACTGCACGTTGTTTCCTTGGGCTCCTACGCGAATCGTAACCGCTCCACTTGGCGCCTTTGATCCTGTT 

GCAGTCGAAGCTCCGCCCGATGCTGCTGCGTCCGGATTGGAGGTTGATACAGTTCCACCAGCGGGCGTCAGCCCGCTTGTAGGTTGTTCTACGCT 

GCTGGAAGTGCTGCTATTGCCATTTGTGGATGTTGAGGAGGCTGCATCATCGGTACTGGTAGCATTTGCATTGCCTGCATTGCCACTAGATGAGC 

TTCCCGCTTCATTCGAGTTCTGGGATTTCTTTTGACTTGCCCCAGGTGCTGCCAAATCTCCACCAGCTGGTGTGCTTGTGCTATTAGATCCTCCA 

GTACTGGCCTGGTTCCCTCCAGCTCCATTGTTTCCGGCGTATGTGCCGCCAGCATTATTTGCCGCATTTCCACTGGAGTCTACCTCGCTTGCCGA 

CTGATCGGTTGGTGTGCTTGTGCCCGTGGTGCTGCCAGCAATTTTGTTTCCACCTGCTGAGCCCGCCACTCCGGTACCCGGTTCTATTTTTATCT 

CAGGCTTGGCATTGTTGTTCTGGTTGCCACCGGCAGGGGCAGAAGAGTCGACTGCCCCGGGTTTTCCCGGCTCTGTTTTCACAATGGGCTTGAGT 

GGCTTCACCTCCGGCATGACAGCGTCTATTAGAGAGAGTCCTTCAATGGGTCGCGCCACCTCGCCGTCTTCGCCTACGCTCCACTCCACTTCAGT 

GTACAGAAGACCCTTCTGCAGGATGGTGAGCAAAGCGGCAGGTGGAACCAATGCGCCATTGATATTACTTTGCGAGATGTGTGACTCAATGCCAA 

ACACGTAGGCCGAGTGCAGAAAGCCTAGAAAACGGGATTGCATGAGATTAGGGCGGGCTGTTATCTTGCGATTGGAGGGTTTTACTCACCCGACT 

CCTGCAGGTATCGGTAAACCAGAAAGTTCACCTCGTCGCTGGAAAAACTCATGACGGAACGCGGTTGGCTCCTTCTCCTACTACTAGCGCACTCC 

AAAATCCACTCCTATCTCCAATGCCAATTATGTTTTTACGGTGCGAAAATCGGAACTGTGCTGCGTAACACATGCGTCTGCATTCGGCACGCTTA 

TCACTTCGGCGCAAGCCACGGTATCCGGTGCTACTCCTGGTACTTGGATCGATCTGGTCGGTCCTGTGTTTTCAATTCTGCCAGTTTGCTGCTTG 

CCGCAAGAGCACTTCACATATTTTTTCCACAAATAAAACCGAGGGATCGAAAACGCGTTTCTGAAACGCGCAAAAAAATGGCGGAGTAATCTCAA 

GGTGTGCGCTGCGAGAGGTTGCGCAGCCTTGTGTTTTGTGACGTCACAAGTTTTGACAACAACTTAATCAAATTTGAGTCATATTTTTGTATTGC 

ACCAAGCGTTACCTTAAAGGATAGAACGGAAATTTATTAGCCATAAAAACAATTAAGAAGATATTTTTGGATTTCTGCTTGGAGCAAAAGGTAAA 

AAATTCTTAAAGCGAATTGTCGTCTCTCTTCGTCCATCCCTGTGTACCCATCCCTCGAAGTAATCATTGTATTTCAACCCAACGCTCTACTGGTC 

ACACCAATCGTTCCACCCGCGCCCGCTAGATGGCTCCACTCTTAGGCAGCGAGAGCGAAGAAAAATCAATAGAAATTTTCATTTTAGTGAATTGC 

CCTGCACTTGGTAATAGTTAACAAACAAATTAGCGGCAAATCAGCGAGGCGAAGCTCCAGCGAGTTGGAGTTCCATAAAGTGCAACACGCAGGGT 

CAAACATTCCGCAGCCAACACCCCGGATTCCGATTCCAAGCGAAGCCAACTTCAGAAGTCAGAGATTACTCTCTCTGCTCCGCCACAGAAATTCC 

GCAACGCTAACCGATTTTGTCCGCTGGCGAGTAAAAAAAATCGGAAAAGTAAGCTCAAAAAAAAAGAACAGCTAACATACGCCCATTAATGTTTT 

TGGAGGCTTCTTCGGTGCTTTTTTTTTTGGGCAAAACGCCCTCGTCATCCAGCGGTGAAAAAAGTGACGTGACATGAAAGCAAAGAAAGAAGAGA 

ACCGAAGAGCGCAGTGAAAAACAGAAAATAAAACGAAACGGAAAGCAAAGTAAAAAGTATACTACAGAAACAAAAGCCGGTGGTCGCGCAAATCC 

AAATAATAATGCCCAAGTGCACATCGACTTCGCCAGGCGATAAAAAGTGGGCGAGCGAGGTCACAGAATCTTTATCAACGGGCAAAACGCTGCCA 

AGTTGAAAATAAATCGGTTGCCAAAAATGCGGGGGTACAC 
(SEQ ID NO: 427) 

Exon: 4455.. 4080 
Exon: 4014.. 2816 
Exon: 2737.. 1001 

Start ATG: 4137 (Reverse strand: CAT) 
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Transcript No. : CT13462 

TCCGCCATTTTTTTGCGCGTTTCAGAAACGCGTTTTCGATCCCTCGGTTTTATTTGTGGAAAAAATATGTGAAGTGCTCTTGCGGCAAGCAGCAA 

ACTGGCAGAATTGAAAACACAGGACCGACCAGATCGATCCAAGTACCAGGAGTAGCACCGGATACCGTGGCTTGCGCCGAAGTGATAAGCGTGCC 

GAATGCAGACGCATGTGTTACGCAGCACAGTTCCGATTTTCGCACCGTAAAAACATAATTGGCATTGGAGATAGGAGTGGATTTTGGAGTGCGCT 

AGTAGTAGGAGAAGGAGCCAACCGCGTTCCGTCATGAGTTTTTCCAGCGACGAGGTGAACTTTCTGGTTTACCGATACCTGCAGGAGTCGGGCTT 

TCTGCACTCGGCCTACGTGTTTGGCATTGAGTCACACATCTCGCAAAGTMTATCAATGGCGCATTGGTTCCACCTGCCGCTTTGCTCACCATCC 

TGCAGAAGGGTCTTCTGTACACTGAAGTGGAGTGGAGCGTAGGCGAAGACGGCGAGGTGGCGCGACCCATTGAAGGACTCTCTCTAATAGACGCT 

GTCATGCCGGAGGTGAAGCCACTCAAGCCCATTGTGAAAACAGAGCCGGGAAAACCCGGGGCAGTCGACTCTTCTGCCCCTGCCGGTGGCAACCA 

GAACAACAATGCCAAGCCTGAGATAAAAATAGAACCGGGTACCGGAGTGGCGGGCTCAGCAGGTGGAAACAAAATTGCTGGCAGCACCACGGGCA 

CAAGCACACCAACCGATCAGTCGGCAAGCGAGGTAGACTCCAGTGGAAATGCGGCAAATAATGCTGGCGGCACATACGCCGGAAACAATGGAGCT 

GGAGGGAACCAGGCCAGTACTGGAGGATCTAATAGCACAAGCACACCAGCTGGTGGAGATTTGGCAGCACCTGGGGCAAGTCAAAAGAAATCCCA 

GAACTCGAATGAAGCGGGAAGCTCATCTAGTGGCAATGCAGGCAATGCAAATGCTACCAGTACCGATGATGCAGCCTCCTCAACATCCACAAATG 

GCAATAGCAGCACTTCCAGCAGCGTAGAACAACCTACAAGCGGGCTGACGCCCGCTGGTGGAACTGTATCAACCTCCAATCCGGACGCAGCAGCA 

TCGGGCGGAGCTTCGACTGCAACAGGATCAAAGGCGCCAAGTGGAGCGGTTACGATTCGCGTAGGAGCCCAAGGAAACAACGTGCAGTCTGGTTC 

GTCCAATGCTCAAAGCTCTGCTCCCAGTGGAACCATCTCCTCGTCTACCAGCGGAGGTGCTGGGACTCCCGCAGCTCTTGTTCCCATGGACATTG 

ACGAGAACATCGAAATACCCGAATCCAAGGCTCGTGTATTACGCGGCCACGAAAGCGAGGTGTTTATCTGCGCATGGAATCCTAGCCGGGATTTA 

CTGGCAAGCGGTTCGGGAGACAGCACTGCTAGGATATGGGACATGTCCGATGCAAACACTAACTCCAACCAGTTGGTGCTGCGCCACTGCATCCA 

GAAAGGCGGCGCTGAGGTGCCTAGCAACAAGGACGTCACCTCTCTGGACTGGAATTGCGATGGTTCCTTGCTCGCCACCGGCAGCTACGATGGTT 

ACGCCCGAATTTGGAAAACCGACGGTCGACTTGCTTCTACTTTGGGACAACACAAGGGTCCGATCTTTGCTCTGAAGTGGAACAAGTGCGGCAAC 

TACATCCTCTCGGCTGGCGTGGACAAGACGACGATCATCTGGGACGCATCCACGGGCCAATGCACCCAGCAATTTGCCTTTCACAGTGCTCCAGC 

CTTGGATGTGGACTGGCAGACAAACCAGGCCTTTGCCTCTTGCAGTACGGATCAGCGGATACATGTGTGCCGGTTGGGTGTAAATGAACCCATCA 

AGACCTTTAAAGGACACACAAACGAGGTGAATGCAATCAAGTGGTGTCCGCAGGGCCAACTGCTGGCGTCCTGCTCTGATGACATGACCCTCAAG 

ATCTGGAGCATGAACCGAGATCGTTGCTGCCACGATCTGCAGGCTCACTCAAAGGAGATCTATACGATAAAATGGTCGCCCACGGGGCCGGGAAC 

CAATAATCCAAACACAAACCTGATCCTCGCATCCGCTTCGTTCGATTCCACGGTAAGACTGTGGGACGTGGAGAGGGGCAGCTGTATCCACACGC 

TCACCAAGCACACAGAGCCAGTCTATTCGGTGGCCTTCAGTCCGGATGGAAAGCATTTGGCCTCCGGCAGCTTCGACAAGTGCGTACACATTTGG 

AGCACGCAAACGGGACAGCTAGTGCACAGTTACAAGGGCACGGGCGGAATCTTTGAGGTGTGCTGGAACTCCAAGGGCACTAAGGTGGGTGCCAG 

TGCCTCCGACGGTAGCGTTTTCGTGTTGGACCTGCGAAAGTTCTGATTGCCGGAGCAGCCACAACTGGATTTCGTGTATTGAGCCAACGCTTAAA 

ATAGTCAAGATTCCTTTTCCAGCTCATTATTTGGTAAAAATGAAATCGGTAGTTTTTCACTTTCTTACGGAACGAGAAGTTTTTACTTTTGATTC 

AAAAATTGCATTTAACTATATTTTTTAAAGAAGTGAAATATGATCAGTTACAATTCTACCAACATAAGTCGATTTGGATCTGGAGTCCGGACCTT 

AGGTAACTCCTTTCTTAGCTATCCCGCCCTGCAATCGTATTTGTAATTGCTTTAAATTCTAATTAGTTCTTTAAGGTATACCTCTTGAGATGAGA 

TCCTCCCGCTTTTGTCCACGTCTATTTCGTACGCTTAGTTTCGAGTCACAAAATATCGATATGCACCTGGGATTTTTGGGGGCTTCAACATGGTT 

GTTGTTCTCTTCAGATCAGCGGGATGCGACTCGCCTCATTTTAATCTTAATCATAAGCTCCATGTTTCTAAGGTCGTCCCTTTTTATACGAAAAT 

CGT T TAGT AT AAC TTTGCATTTTGTTAT T GT ATC G AT C AAAG ATG C C AGC C AAC TCCCCGCCCTCCC C AC C AAAAGT CC T T T AC C C AAT T CC AC A 
GTTGTCCTCCAAAAAAGGGATGCCAAGAGATCAGATCAGACCGATTCACGCATACATTCAGAACAGTGTAACTGCAGTCCCACGTAGTCAGTAAT 
CGAACGTAATTATTAAGAACACGAGATTGGCGGAAACACAAGAACATCAGCAAAAATACATTAAAGATAAAAATTAATCTGTTTTCGTTTTGGGC 
CTGGGTGTTTTTTGTATTTTAGCATTTTAAAATTTAATACTAAAGTACTTAAATTAAACTTTATTCTTTGTAAATATAGATA 

(SEQ ID NO: 428) 

Start ATG: 319 (Reverse strand: CAT) 

MSFSSDEVNFLVYRYLQESGFLHSAYVFGIESHISQSNINGALVPPAALLTILQKGLLYTEVEWSVGEDGEVARPIEGLSLIDAVMPEVKPLKPI 
VKTEPGKPGAVDSSAPAGGNQNNNAKPEIKIEPGTGVAGSAGGNKIAGSTTGTSTPTDQSASEVDSSGNAANNAGGTYAGNNGAGGNQASTGGSN 
STSTPAGGDLAAPGASQKKSQNSNEAGSSSSGNAGNANATSTDDAASSTSTNGNSSTSSSVEQPTSGLTPAGGTVSTSNPDAAASGGASTATGSK 
APSGAVTIRVGAQGNNVQSGSSNAQSSAPSGTISSSTSGGAGTPAALVPMDIDENIEIPESKARVLRGHESEVFICAWNPSRDLLASGSGDSTAR 
IWDMSDANTNSNQLVLRHCIQKGGAEVPSNKDVTSLDWNCDGSLLATGSYDGYARIWKTDGRLASTLGQHKGPIFALKWNKCGNYILSAGVDKTT 
IIWDASTGQCTQQFAFHSAPALDVDWQTNQAFASCSTDQRIHVCRLGVNEPIKTFKGHTWEVNAIKWCPQGQLLASCSDDMTLKIWSMNRDRCCH 
DLQAHSKEIYTIKWSPTGPGTNNPNTNLILASASFDSTVRLWDVERGSCIHTLTKHTEPVYSVAFSPDGKHLASGSFDKCVHIWSTQTGQLVHSY 

KGTGGIFEVCWNSKGTKVGASASDGSVFVLDLRKF* 
{SEQ ID NO: 429) 

Name: transducin (beta) like 1 protein 
Classif icatxon : signal_transduction 
Gene Symbol : ebi 
FlyBase ID: FBgn0023444 



Celera Sequence No. : 142000013384830 

ACCTTCTCCCAGATTTTTGTTGGAACTTATAGCCGTTTCCAGGAAAAGACTCGAGCAGGGAGATTAAGTGCATGTCGCAATCGAAGAGAACGTTG 
GCAGATTTGACCATTGTTATTTGAGGCTTCATCGACACTTTGATTAACTTTAGTTATTTTCTATGAACTGTATCTGTGTTGTGGGTTGCTAATTG 
TTTGAAACTTCTTTCTTGCAGACGTCGATGAGGCCTCCAAGAAGGAGATCAAGGACATTCTGGTGCAGTACGACAGAACCCTGCTGGTCGGCGAT 
CCGCGTCGCTGCGAGCCCAAGAAGTTCGGCGGTCCAGGTGCCCGTGCTCGCTACCAGAAGTCGTACCGTTAAACTACTCTGTCTGGTTTTCAATT 
TGGATTAGAGCTATTTTTTACAATTTGAAAATGGTTTGCACGCACGTGCGAATAAAACGGAATAATTAACTGTACAAACGAGTAGTTTTGTTCTA 
TAAAGTGGGTGAAATATGTATAACAAAACCTTTTTTGGAACTAAGTGACGGTTTCCAGAAGAATTGAGTTTTCGAAAAATTAATTTTTGAAGCAG 
ATGATATTCAGTTTGAGAACATAATTTTATAAAGAGGTGTTTATATCTTGGAAGGAGTAATATTTGAAAATTCTTGAACAATGTTCGGTAGGTTT 
GCGCCTCTGCTAAGAACAATCACTGAGTATGGTATCTTTTTGGTATTTAATTAGATGGACTTATTTCTGAACTCCGCCAACGGCCACTCTATAAT 
GCAGCCATCGTTTGTTTTTCAAAACGTATAGCTAAATTTCGGGTTAATAATATCTATCAAATGTTTATTTGAAAATTTAAAATTCTACAAGTAAT 
TTTTTCCCCTATAAGCACTTATATATATATTTATTTTTATATTCATATTTTATTCCTTTGCAAATTTTTTCGACTTAAGATTTCCTCGAATTTCC 
CACTGCAGCTACAAAGTAGCTAAGTTGCCTACACAGCCACGTAGTTAGTCATGGGAGCTTTGTTTGGAAAAACCAGCAAAAAGACGGCTCCTAGT 
CGGATCACCGACCAGGACAAGGCGGTTCTGGTAAGTTGGTGGTAACTGAGTAGCCAGAAAGTTTGCCCAAACACTTAGCCACTTCCCCATTGCAG 
CAATTGAAGCAACAGAGGGATCGTCTGAAGCAGTACCAAAAACGCATCGAGACGCAGTTGGAGAATGATCGCCTCCTGGCAAGGAAGTGCCTACA 
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GCAAGGTCGCAAGGAGTATGTGTCATACCCATATGTGATCTAAACATGTTCAACTAAACCGATTGCTTTCATTTTAGCCGGGCCAAGCTGCTGCT 
GCGCAAGAAGAAGTACCAGGAGAGTCTGCTGACCAATGCCGACAAACAGCTGGAAAACCTGGAGAAGCTGGCTGCGGACATTGAGTTCGCCCAGG 
TGGAGATGAAGGTGCTGGACGGCCTCAAAGCGGGCAATGCTGCTCTGAAGAAGGTGCACGAAATGCTTGACATCGACGAAGTTGAGCGAATCATG 
GACGAGACACGCGAGGGCATCGAAAAGCAGCAGGAAATAGACGCCATACTGACGGATGTGCTGACCACGCAGGACGAGGAGGACGTTTTGGCCGA 
GCTGGATGCCCTGGAGGCGGAGGAAGAGCAGCAGAAGGGTGCACAGCTGCCGGATGTGCCCACCGAAGATCTGCCCATTCCCGCTGAGATCGAGT 
CCGTCGAGGAGCCGGCAAAGACCAAAGCAACCAAGAAAGTCCTGGTGGAGGCATAGAACCTATAGTAAAACGAAACCAATGCCATTTTGTATGTA 
TATATGTCGTTCAACTAAATTTCTCGATAAGTCCAATGTGAAATACATATTTTAATAGATTATGCTTGACTAAAGAACACATAGACTCGCATGTC 
TACTTGTGTATTGTCTTTGTTGATCGACGCAGGTCCTCCTTCACCTTGCTGCGCAGCATCTTATTGGTCTCGTTCAGCTTCTCGATCTCCTCAAT 
CAGGATCACGTTCTCCTTGAACAGCTTGTCGTACTTCTTCTGCACGGATTTGTCCTCGGCCACCGACTTGTATCGGTCCAGAACATTCTCGATCT 
GCCTGCGCTGCCGCATAAACTCGTCCCTGACCTCCGCGTCCAGAGTGACGAATCGCTTTAGCTCGTCGTCGGAGGCGTGTTTGCGGAACAGTTCT 
TTCACCGCCTTCTTGAGGGCCTCGGCCGAGTTGATCTCCCCGGCCACGTAGTAAATATCCGAGCAGATGGAGTGCAGGCACTCCCGAGAAGCCTT 
TGCCCGATGTCGCTCCGTCCGAAGTTCGGCCACATTACTGAGGTATTTATCCCGCATCTCCTTTAACTGCAGCTCCAACTGCACGTTGTTCTGAT 
TGAGACCCTCCAGCTCCGCTTCCATTTCGATGATGTGCTTGCGCTTGTCGTTGATCTGAAACTCACGCGGCTCAATCTGCGCCTTGAGCTCGGCA 
ATCTTGTGGCCCAGCACCTGCTTGTACTTGTCCAGCTCCTGGTTCTTGTGCAGCAGGTCTTGGATGCGTTTTTCCTTGCCGTTGATGGCGTAGTC 
GCGATCGGCAATGTCCTTCTGTAGATCCTCGATGTTGCGCATCTGTTTCTGGATGTTGCGCTGCGACTTGTGGTGCTCCTCCTTTAGGATCTCAA 
CTTCCTCTAGGAGATTGTCAATCTCTCGGCTCTGCGACTCGAATTTCTTCTGCAAGACACCCGCTTTACCTCGCCACATTTGGGTTTCATTTCGC 

TCAGTGGTCAA 
(SEQ ID NO: 430) 

Exon: 1001.. 1075 
Exon: 1141.. 1250 
Exon: 1313.. 1766 
Start ATG: 1001 

Transcript No. : CT13482 

ATGGGAGCTTTGTTTGGAAAAACCAGCAAAAAGACGGCTCCTAGTCGGATCACCGACCAGGACAAGGCGGTTCTGCAATTGAAGCAACAGAGGGA 

TCGTCTGAAGCAGTACCAAAAACGCATCGAGACGCAGTTGGAGAATGATCGCCTCCTGGCAAGGAAGTGCCTACAGCAAGGTCGCAAGGACCGGG 

CCAAGCTGCTGCTGCGCAAGAAGAAGTACCAGGAGAGTCTGCTGACCAATGCCGACAAACAGCTGGAAAACCTGGAGAAGCTGGCTGCGGACATT 

GAGTTCGCCCAGGTGGAGATGAAGGTGCTGGACGGCCTCAAAGCGGGCAATGCTGCTCTGAAGAAGGTGCACGAAATGCTTGACATCGACGAAGT 

TGAGCGAATCATGGACGAGACACGCGAGGGCATCGAAAAGCAGCAGGAAATAGACGCCATACTGACGGATGTGCTGACCACGCAGGACGAGGAGG 

ACGTTTTGGCCGAGCTGGATGCCCTGGAGGCGGAGGAAGAGCAGCAGAAGGGTGCACAGCTGCCGGATGTGCCCACCGAAGATCTGCCCATTCCC 

GCTGAGATCGAGTCCGTCGAGGAGCCGGCAAAGACCAAAGCAACCAAGAAAGTCCTGGTGGAGGCATAG 

(SEQ ID NO: 431) 

Start ATG: 1 

MGALFGKTSKKTAPSRITDQDKAVLQLKQQRDRLKQYQKRIETQLENDRLLARKCLQQGRKDRAKLLLRKKKYQESLLTNADKQLENLEKLAADI 
EFAQVEMKVLDGLKAGNAALKKVHEMLDIDEVERIMDETREGIEKQQEIDAILTDVLTTQDEEDVLAELDALEAEEEQQKGAQLPDVPTEDLPIP 

AEIESVEEPAKTKATKKVLVEA* 
(SEQ ID NO: 432) 

Classification: hypothetical 



Celera Sequence No. : 142000013384676 

TACGCCTTGCAGTGGACGAAGTGGTCAAGGCCACCATGGCGACAATTACAACGACATCGCCCAATGAAACCCGCCAGGCGACTGACCATTTGACT 
CCGGTGGCCTCGCGTCAAGAATTGAAAGATCAGCTCCTACAGCGTAGCAAGGAGGACAAGATGCGGCATCAGCTGAACACGCCCACCACCCAATT 
CGGACGAGCTCTACAAAAGACTCTGCAGCTTATCGAAACGCAGATTGTACAGGATCATCTCCGTGCGCTGCAGGATGCACCACCTATTCACGAAC 
TTTTTGTGTTCAGCGACATTGCTACCGTGCGGCGAAACATAATTGGTGCTCCGCGGGCTGCACTGCACACTGCCCTGAATAATCCACATTTTTAC 
ATGCAGTGCAAGTGCTGCGAGCTACAAGATCAATCACTGCTGGTTGGCACACTGCCCGATCTCTCCGTGGTTTACAAGCTGCACCTGGAGTGCGG 
TCGCATGATCAACCTTTTCGACTGGCTGCAGGCCTTTCGATCTGTTGTGAGCGACAGCGACCACGAGGAAGTGGCCCAGGAACAAATCGATCCCC 
AGATCCAGTGAGTAAAAAAATGGATTTCTCTCTGTAATATCATCAATTTCATTTGGTTTTCCTTTCAACCAGGGCCCGATTCACGCGTGCCGTAG 
CCGAGTTGCAGTTTCTGGGGTACATTAAGATGTCGAAGCGCAAGACGGATCATGCCACTCGATTAACCTGGTAGCTCGCTCGGGCTATGAATAAC 
CAAAATCCCTTAGGTATTTTTATCTTTTCTATTTATTTATATTGGGGCAAATTTCATGTGTGGGACGTTATGAATAAATAAAACATCAATCGATT 
TGTCTACTCTGTACGAAATGCATTAACTTTGTTTGCGAGTGAAGCGCGTTCTTGTATTTTTTAAGTAAAGCCCAGCAGCTGGCAAGGCAAGCGTA 
CTAAATATCAATTACCAGTCTTCCCGACTAGCGGTTACCACGTTCCCTGTTCACATTAGTTTCAGAGTCTCGATGGCGGCAGCGATGCGTTCGAT 
CTCTGCCCGCGACTCGCAGAGTTTTGTCTCGTTGGCCTCCTGCACCTCGGCGGGAACCTTAGTTGCGTAGTCGGCGGCCTGGATGGCCTGAGTTA 
GCTTGCCCACCGTCTGCACCAGCTGGTCGCTCTTCTTCTGCAGCTTGGCTATCTCCTTGTCCGCCTCCACGAGTCCCTTGAGCAGCAGGTGCACC 
TCGCACTGGCCCGTCACGGTGAGGATCGCACAGCCCTGCGGTGCGGGACTGTCAAAGACCACATTGGAGCAGTAGGAGATTGTGGCCAGATCGCT 
GGCGTAGCGTTTGAGTATTTCACTGGGCACCGAATCCGTGCATACTATATACACCTCGGTTTTAACCTTGTTGGGCAGATTATAGTCGGAACGGG 
CAGATCGGATGATTCGCGCCGCCTTCTGCACAAATTCAACGTCCGATTCTATCTTTGTGCTGCGCCAGGATGTATTGCCTAGTTATGAGGAAGTT 
AAATTTATTTGAATTCTATCATCATGAAGAAATCAAATGCGACTTACTGGGATAGCTAGCCACGCAAATGCTAGGAGCGGGATTTGCCCTCGGAA 
GCCGCTGGTAAAGTTCCTCCGTGATGAAAGGCATGAACGGGGAGAGTAAGCGCAGTCCATAATCTAGACATACGTAGAGCGTGCGCCTTGCGGCG 
GTCTGCTGCTCCTCGCTGCCGCTCTGGAAGATCGGCTTCAGACACTCCAAATACACGTCGCACAGATCGTAGAGCCAGAATGCGTAGCAGGCGCT 
GGTCGCTGCAGCAAAGTCGTACGACTCAAAGCCGGTGTTGCAAGCTTCTATGGCAGCTGCCAAGCGTGACAGGATCCAGGCGTCCATCTGGTTGA 
TCGCTGCAGACGCACTTAGCTCCGTGTCAAACTTCTCAGAGCCGGTGAAGTACAGCAGGGCGAACTTGGTGGCGTTCCACAGCTTGTTGCAGAAG 
AAACGATATCCCAGAACGCGATTGATGTCCAGGTTGATGTCGCGCGCTTGCGTGATATAGGCGCACAGCGCGAATCGAAGTGCATCCGAGCCGCA 
CTCGGGAATTCCCTGCGGATAATCCTGTTTCTGTCCAGCCTTAGCCTTTTCGATCTCCCGAGGATCCAGGTTGGAGCCCACAAGTTGAGCATGCA 
GTCCCTCGAGCGTAATGCCACGAATAACATCCATAGGATCGATAACATTTCCCAGGGACTTGGACATTTTGCGTCCGTGCGCATCCCTCACCATG 
GGGTGAAGATAGACCTCCTTAAACGGCAGTTTGCCCAGCAGCTTCTGTCCAAAGAATACCATGCGAGCCACCCAGAAGAAGAGAATGTCGTGACC 
CGTCTCCAACAGCGATGTGGGATAGAACGTTTGCAGATCCTTAGTTTGGTCTGGCCAGCCAAAAACGGAGAAGGGAAAGATTCCCGAACTGAACC 
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ATGTGTCCAGGACATCTTCGTCTTGTTTCAGTACTATCTTGCTGGCGTCCACTCCAAAACGTTCGGCTGCCTTGGTCAGAGCCTCCGCTTCGCTG 
CGTGCTACAATCCAGTACTGCTCGTCGTCGTTCTAAAAATGAAACAAATAGGAAAATTTTAATTTGTCTAATTTAAAAGTGTTGTCTATAATTTA 
TAGAGTTATGTTTTGGTTTTTTACGCACCACACTTCCATGAGCAGACTAAGATATCCAACGAAATATATTACCCATACAAATTATTATTATCCAA 
AGTAGATTTACTCACCGATCCAGTTTGAAGAGAAGGGTCGGTGAAGCTGACATGGTATGCCGGAATCCGGTGACCCCACCACAGCTGCCTCGACA 
CGCACCAGTCGCGAATACCGTCCATCCAGTGGTACCACGTCTTGGTGTGATGCTCCGGAATGATCTTTAGTTCTCCGGAGCGCACTGCCTCAGTG 
GCAGAGGCTGCCATGTCCGAGCAACTAACATACCACTGCGGCTTAATCAGCGGCTCCACCACGTCCTTGGAGCGACTGCAGATGGGCACCACCAT 
CGGGTTGTTCAATGTCTCGCGGTATAGATTCAGGGCCTTAAGCTTCTCCAGGATTTTCTTGCGGCACTCGAACCGCTTCATTCCAGTAAACTCTC 
CGTAGTCGCCGATTATGTAGCCGTCGTCGTTGAAGATTGTAATGAAGGGCAAATTGCAGCGCTTGCCCACCTCGTAATCGTTGGGATCGTGGGCT 
GGGGTGATCTTCACAGCACCCGTACCGAAAGCCATGTCCACGAACTCGTCACAGACGATTGGCAGGCGGCGCGTGGAGAACGGATGGACTACGAA 
CTTGCCATGCAGGTGCTTGTACCGATCGTCTTGGGGATGCACAGCCACAGCAGTATCGCCCAGCATAGTCTCGATACGGGTGGTGGCTACAATGA 
TCTCTTCGTCACTGCCTTCCACCTTGTAGGCGAACTTTATTAGAACACCAAATTCCACTTTGTCCTCGTATCCGGGAATGGAGAGGAATGTGCGA 
CCGGGAATCTCCACCTTGTCCACTTCAATGTCCGAAATAGCCGATCGTAGGGTGCACGACCAATTGACCAGTCGGGAGCTACGGTAGATGCTGCC 
CTCCTCGTGCAATCGGACGAAAGCCTCGGTAACGGCGCGACAGAGCTTGGGGTCCATGGTGAAGGCCACTCGGGTCCAGTCATAGGAGGAGCCCA 
GGGACTTCAGCTGCTCATAGATGCGACCGCCCTTTTCCCGGCGCCAGTCCCAGATGCGCTCAATGAACTTTTCGCGACCTAGGTCGTGCCGCGAT 
AGCTTCTCGTCGCGCCACAATAGCTTCTCTACCACCACCTGGGTGGCGATGCCGGCGTGGTCGCAACCAGGCACCCATAGAGTGGTTCGTCCCTT 
CATGCGGTGGTAGCGCGTGATGGCGTCCTCGATGGCGTTGGTCAGGGCGTGACCCAAATGCAGAGAGCCCGTCACGTTTGGTGGTGGAATGATCA 
TCACAAACTTGCCATTGGGATTGGGGGCGTCTATGGATGCGCGCTAAATGTTCAAAGAGTATTAGAAACCAGCATTTAACCAAAACCAAGGATAA 
ACTCACTCCGTACTCGGGCGTGAAGAAGCCCTCCTTCTCCCACCAGCTGTACCACTGTGCTTCCACATACCGTGGACTGTAGGCATCCGGCAAGG 
CTCCACTCAGATCCTTCTTCTCACCGGGCGCGGTTTGGGCCGTGTACACGGCAGCCTCTTTCACCTCCTTGGTGCGCTTCTGCAAAGTAAAGTAA 
TTCGAGCATTACATATAAGTAGCACTCGCTTTCACCAGTAACAATGGGCAGCAATTACTGCCTCTCACTTGGCCACATAACCTCGATTTGAGATC 
ACAAAATAGGAGGCACTTTTGGGATGGATTTTTCACTAACCTCGGGCTTCTCCTTCTTCTCACCCGCTGCTGGAGCAGCCGCAGCCTTCTTGTCC 
AGCTTCGCCTGCAGCTTGGCCAGCTTCTCCGCCTTCAGACGCTCCTTTTCCAGTTGCTTGGCCGTCTTGGGAGGATCACCCCCGGCTCCGGCTGG 
TGCGTTGGCATCGTTCTGCTGTTCAGCCTCGGGCATTTTTCAGTTGATCTCAGGATTAAATCTGGATAGTGCAACTGTAAGTTAAATGACGAAAT 
GG AC G AAGT AC AT T ACT C ACAT GC AGT T AGACT GG AGT C GC AG AAAAAC AT AT G AAAACG T GT GCAGTTTGGG AC CAGGGTTGCCAAC AG AT ATT 
ATTATTTTGAATTCCATTTGGCACATAACATTTAATCATTTTTAAAAGGTATAAAAACTGAATTTATGATTTATATTAAGTGTATTTTAAAATTC 
TTATGTAAGATTAATATTTTGTATACATAAATTCTTTAATATTCATTTATTTTTAAACGGAAAAACAATAGAATTTTATCACCTTTTAAAATATA 

AG GT G C T T C AAAT AT T T AG AAATC AAT GAT AAT AT ATT T T AC G AT AACT AAAT AC GAC TAT AT T T T AAC AT GT T GT AT AAAT AT T T T AAAAGT T T 
TTAAATTTTTCCAGTATGAAAAAGATTTAAATTATATAAAACAATCTGAAATTGCTTACTCCGGAATCAAAACTTAAATGGTCAAGTTATTCCAA 
AATATATTTTTTATATCTTAAATGCTTTATTTTGTTAAAAGCTTTAACGAAAATGTAGTTTAACAGCACTGTCGGAATTCGATAGGCCTGCTCAA 
AATTTAGATTGCGTTATCGCTGCAATCTGGTCACACTTACAGTCGTGTTTTTTGGTGTTTTTGTATGGGAAGATTTTTGCGAAAAATAAATTCAG 
AAGCTAGTGTATTGTGTGTTGTTCCTTTGTGAAATGTAAAGTGGGACGAACATCAGAGATCGGTAAATACTATCACAGTGTACCACCCCATAAAA 

TG AC AAAC T AT T C C GT T TAG T GT AAGT GC T C C CAT T CG AGT G T GT GT C C ATT GG AC T G C AG AGG AAG CGT CT G AT AAGC AGC CAT C T T CC T AT AA 
GAATGGCAAGGAAATTCCTGCCAAATTTATATAAATTCTAAGAACTACTGTCGTGCAGCGATAGCATGGCCGTTCTAGCCGCTCACGAGATCGAA 

AAACAATAGGTTTAGCAACACAACAACAACAACACACTCGCAGACATGTGCAACCGAAAGCTAAT 
(SEQ ID NO: 433) 

Exon: 4670.. 4411 
Exon: 4259.. 4092 
Exon: 4033.. 2771 
Exon: 2597. .1568 
Exon: 1503.. 1001 

Start ATG: 4596 (Reverse strand: CAT) 



Transcript Wo. : CTlJbU4 

TAATGTACTTCGTCCATTTCGTCATTTAACTTACAGTTGCACTATCCAGATTTAATCCTGAGATCAACTGAAAAATGCCCGAGGCTGAACAGCAG 
AACGATGCCAACGCACCAGCCGGAGCCGGGGGTGATCCTCCCAAGACGGCC7AAGCAACTGGAAAAGGAGCGTCTGAAGGCGGAGAAGCTGGCCAA 
GCTGCAGGCGAAGCTGGACAAGAAGGCTGCGGCTGCTCCAGCAGCGGGTGAGAAGAAGGAGAAGCCCGAGAAGCGCACCAAGGAGGTGAAAGAGG 
CTGCCGTGTACACGGCCCAAACCGCGCCCGGTGAGAAGAAGGATCTGAGTGGAGCCTTGCCGGATGCCTACAGTCCACGGTATGTGGAAGCACAG 
TGGTACAGCTGGTGGGAGAAGGAGGGCTTCTTCACGCCCGAGTACGGACGCGCATCCATAGACGCCCCCAATCCCAATGGCAAGTTTGTGATGAT 
CATTCCACCACCAAACGTGACGGGCTCTCTGCATTTGGGTCACGCCCTGACCAACGCCATCGAGGACGCCATCACGCGCTACCACCGCATGAAGG 
GACGAACCACTCTATGGGTGCCTGGTTGCGACCACGCCGGCATCGCCACCCAGGTGGTGGTAGAGAAGCTATTGTGGCGCGACGAGAAGCTATCG 
CGGCACGACCTAGGTCGCGAAAAGTTCATTGAGCGCATCTGGGACTGGCGCCGGGAAAAGGGCGGTCGCATCTATGAGCAGCTGAAGTCCCTGGG 
CTCCTCCTATGACTGGACCCGAGTGGCCTTCACCATGGACCCCAAGCTCTGTCGCGCCGTTACCGAGGCTTTCGTCCGATTGCACGAGGAGGGCA 
GCATCTACCGTAGCTCCCGACTGGTCAATTGGTCGTGCACCCTACGATCGGCTATTTCGGACATTGAAGTGGACAAGGTGGAGATTCCCGGTCGC 
ACATTCCTCTCCATTCCCGGATACGAGGACAAAGTGGAATTTGGTGTTCTAATAAAGTTCGCCTACAAGGTGGAAGGCAGTGACGAAGAGATCAT 
TGTAGCCACCACCCGTATCGAGACTATGCTGGGCGATACTGCTGTGGCTGTGCATCCCCAAGACGATCGGTACAAGCACCTGCATGGCAAGTTCG 
TAGTCCATCCGTTCTCCACGCGCCGCCTGCCAATCGTCTGTGACGAGTTCGTGGACATGGCTTTCGGTACGGGTGCTGTGAAGATCACCCCAGCC 
CACGATCCCAACGATTACGAGGTGGGCAAGCGCTGCAATTTGCCCTTCATTACAATCTTCAACGACGACGGCTACATAATCGGCGACTACGGAGA 
GTTTACTGGAATGAAGCGGTTCGAGTGCCGCAAGAAAATCCTGGAGAAGCTTAAGGCCCTGAATCTATACCGCGAGACATTGAACAACCCGATGG 
TGGTGCCCATCTGCAGTCGCTCCAAGGACGTGGTGGAGCCGCTGATTAAGCCGCAGTGGTATGTTAGTTGCTCGGACATGGCAGCCTCTGCCACT 
GAGGCAGTGCGCTCCGGAGAACTAAAGATCATTCCGGAGCATCACACCAAGACGTGGTACCACTGGATGGACGGTATTCGCGACTGGTGCGTGTC 
GAGGCAGCTGTGGTGGGGTCACCGGATTCCGGCATACCATGTCAGCTTCACCGACCCTTCTCTTCAAACTGGATCGAACGACGACGAGCAGTACT 
GGATTGTAGCACGCAGCGAAGCGGAGGCTCTGACCAAGGCAGCCGAACGTTTTGGAGTGGACGCCAGCAAGATAGTACTGAAACAAGACGAAGAT 
GTCCTGGACACATGGTTCAGTTCGGGAATCTTTCCCTTCTCCGTTTTTGGCTGGCCAGACCAAACTAAGGATCTGCAAACGTTCTATCCCACATC 
GCTGTTGGAGACGGGTCACGACATTCTCTTCTTCTGGGTGGCTCGCATGGTATTCTTTGGACAGAAGCTGCTGGGCAAACTGCCGTTTAAGGAGG 
TCTATCTTCACCCCATGGTGAGGGATGCGCACGGACGCAAAATGTCCAAGTCCCTGGGAAATGTTATCGATCCTATGGATGTTATTCGTGGCATT 
ACGCTCGAGGGACTGCATGCTCAACTTGTGGGCTCCAACCTGGATCCTCGGGAGATCGAAAAGGCTAAGGCTGGACAGAAACAGGATTATCCGCA 
GGGAATTCCCGAGTGCGGCTCGGATGCACTTCGATTCGCGCTGTGCGCCTATATCACGCAAGCGCGCGACATCAACCTGGACATCAATCGCGTTC 
TGGGATATCGTTTCTTCTGCAACAAGCTGTGGAACGCCACCAAGTTCGCCCTGCTGTACTTCACCGGCTCTGAGAAGTTTGACACGGAGCTAAGT 
GCGTCTGCAGCGATCAACCAGATGGACGCCTGGATCCTGTCACGCTTGGCAGCTGCCATAGAAGCTTGCAACACCGGCTTTGAGTCGTACGACTT 
TGCTGCAGCGACCAGCGCCTGCTACGCATTCTGGCTCTACGATCTGTGCGACGTGTATTTGGAGTGTCTGAAGCCGATCTTCCAGAGCGGCAGCG 
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AGGAGCAGCAGACCGCCGCAAGGCGCACGCTCTACGTATGTCTAGATTATGGACTGCGCTTACTCTCCCCGTTCATGCCTTTCATCACGGAGGAA 
CTTTACCAGCGGCTTCCGAGGGCAAATCCCGCTCCTAGCATTTGCGTGGCTAGCTATCCCAGCAATACATCCTGGCGCAGCACAAAGATAGAATC 
GGACGTTGAATTTGTGCAGAAGGCGGCGCGAATCATCCGATCTGCCCGTTCCGACTATAATCTGCCCAACAAGGTTAAAACCGAGGTGTATATAG 
TATGCACGGATTCGGTGCCCAGTGAAATACTCAAACGCTACGCCAGCGATCTGGCCACAATCTCCTACTGCTCCAATGTGGTCTTTGACAGTCCC 
GCACCGCAGGGCTGTGCGATCCTCACCGTGACGGGCCAGTGCGAGGTGCACCTGCTGCTCAAGGGACTCGTGGAGGCGGACAAGGAGATAGCCAA 
GCTGCAGAAGAAGAGCGACCAGCTGGTGCAGACGGTGGGCAAGCTAACTCAGGCCATCCAGGCCGCCGACTACGCAACTAAGGTTCCCGCCGAGG 
TGCAGGAGGCCAACGAGACAAAACTCTGCGAGTCGCGGGCAGAGATCGAACGCATCGCTGCCGCCATCGAGACTCTGAAACTAATGTGA 

(SEQ ID NO: 434) 

Start ATG: 75 (Reverse strand: CAT) 

MPEAEQQNDANAPAGAGGDPPKTAKQLEKERLKAEKLAKLQAKLDKKAAAAPAAGEKKEKPEKRTKEVKEAAVYTAQTAPGEKKDLSGALPDAYS 
PRYVEAQWYSWWEKEGFFTPEYGP^ASIDAPNPNGKFVMIIPPP 

WRDEKLSRHDLGREKFIERIWDWRREKGGRIYEQLKSLGSSYDWTRVAFTMDPKLCRAVTEAFVRLHEEGSIYRSSRLVNWSCTLRSAISDIEVD 
KVEIPGRTFLSIPGYEDKVEFGVLIKFAYKVEGSDEEIIVATTRIETMLGDTAVAVHPQDDRYKHLHGKFVVHPFSTRRLPIVCDEFVDMAFGTG 
AVKITPAHDPNDYEVGKRCNLPFITIFNDDGYIIGDYGEFTGMKRFECRKKILEKLKALNLYRETLNNPMVVPICSRSKDVVEPLIKPQWYVSCS 
DMAASATEAVRSGELKIIPEHHTKTWYHWMDGIRDWCVSRQLWWGHRIPAYHVSFTDPSLQTGSNDDEQYWIVARSEAEALTKAAERFGVDASKI 
VLKODEDVLDTWFSSGIFPFSVFGWPDQTKDLQTFYPTSLLETGHDILFFWVARMVFFGQKLLGKLPFKEVYLHPMVRDAHGRKMSKSLGNVIDP 
MDVIRGITLEGLHAQLVGSNLDPREIEKAKAGQKQDYPQGIPECGSDALRFALCAYITQARDINLDINRVLGYRFFCNKLWNATKFALLYFTGSE 
KFDTELSASAAINQMDAWILSRLAAAIEACNTGFESYDFAAATSACYAFWLYDLCDVYLECLKPIFQSGSEEQQTAARRTLYVCLDYGLRLLSPF 
MPFITEELYQRLPRANPAPSICVASYPSNTSWRSTKIESDVEFVQKAARIIRSARSDYNLPNKVKTEVYIVCTDSVPSEILKRYASDLATISYCS 
NVVFDSPAPQGCAILTVTGQCEVHLLLKGLVEADKEIAKLQKKSDQLVQTVGKLTQAIQAADYATKVPAEVQEANETKLCESRAEIERIAAAIET 

LKLM* 

(SEQ ID NO: 435) 

Name: valyl-tRNA synthetase 
Classification: enzyme 

Celera Sequence No. : 142000013384624 

TAGTGGCAAAGACTATATTGATTTATGTTTAATCTATGTTCGAGAACGAAGCGCATCGAAATGCTTTTAAGACGTATTACACAACCAACTGTAAT 

TAT T AT C T GAT AT T TT C AAAG AT T T T AAAC AAC T T T T AT AC GT AAAC AAAAAT CAC AT AAAAAT T AAAAAAAAAC AAC AAAAAC GAAACG AAAAA 
CATTTATTTCAAAATAAACGAATCGAGACAGTGAGGCGTGGGGCAGTCTGCGATAATATGCAGGGCGGTCAAACCTACAGATATAATTACAAATA 
TTGATGTGCATACAATAAACTTATGCCTTTCCCTACAACCCGAAAAGTCCTCTTCCTTGGCCCGGAATCGAAAAAAATGTCAAAAAAGAAACAAA 
AATGGGCCAACACAGCTGGCGAGGCGAAAGCGAAAAGTTGCCAGCATATACACAAATATATTTCTGTAACTTCTATAATACACCAAGCGCCTGAT 
AAGCGCTAAAGAATAATATCTAAAACGAAAACGGATCAAAAACAAAGTACTTCAATGAGAAATAAAACGATATTTAAAGAATTCAAACTACCATG 
TTCTAACGGGATGAATTGGGCGTTGGGTATTTATATTTTGAAACCTATAAACTTATATTCGTATTAAATACCAATGTATTATTATTTGTTATTTC 
GCCTTTCTAGGGCGAAATAACAAAGACAATGCTTTTTTCCCTTTTCAGATTACCCTTTTATAGAAAAGCAACAAAAAATTAATCTGTAATTAATT 
TTATATAATATTGATTTTACGGCTATAAACAAAAGTTTGTCAACAAGTTTACAATTTTCGAATCTTTAATGTTTATATTAATATTATAAAGCCAT 
AAAAGGATTGGTAATTTTTTAATTTAAAATTCAATTTTAGTGTTGTGCTTAGCTAACGAGCCTTTACAACACTGCCCGCCAGCAATGCGATACTT 

GT AT C GAG CCGTCGTCTATC GAAAC GT C GGC C AATC G AAAT T GT T C AT CC C T GG AAG AAG AC AAG AAAAAAAT AAT AT AT AAT AAT AG AAT AT T C 

CACCGTGATAATATTGCCAAGCATTGTTTTCCGCGTCTTTGCCGTCGGTAATTGGGCAACAGTCAGAACGACCAAAAAGCCGAAGGGCCCCGAAC 

AGCAGGATAATCGGCAGAAAGTACGAGGCATCATCGACAGGCCAACTAAACCACAACGACAACCACATCCGCAGTCAAATCCCAATCCAAATCCA 

CGTCCAAATCCCATTTCGAGTGAAACAGCGAGTGAGACAGCGAAATGCCACTGTTCGGGAAGTCACAGAAGTCGCCAGTGGAGCTGGTCAAGTCG 

CTGAAGGAGGCGATCAACGCCCTGGAGGCGGGCGACCGCAAGGTGGAGAAGGCGCAGGAGGACGTCAGCAAGAACCTGGTCTCGATCAAGAACAT 

GCTGTACGGTAGCAGCGATGCCGAGCCGCCGGCGGACTACGTGGTGGCCCAGCTGTCCCAGGAGCTGTACAACAGCAACCTGCTGCTGCTGCTCA 

TCCAGAACCTGCACCGCATCGATTTCGAGGGCAAGAAGCATGTGGCGCTCATTTTCAATAATGTGCTGCGCCGCCAGATTGGCACCCGTTCGCCC 

ACCGTCGAGTATATATGCACGAAGCCGGAGATCCTGTTCACCCTGATGGCCGGCTACGAAGATGCGCATCCGGAGATCGCACTGAACTCCGGTAC 

CATGCTGAGGGAGTGCGCCCGGTACGAGGCGCTGGCCAAGATCATGCTGCACTCGGATGAGTTCTTCAAGTTCTTCCGCTACGTGGAGGTTTCCA 

CCTTTGACATTGCCAGCGATGCCTTTTCTACGTTCAAGGAGCTGCTCACGCGCCATAAGCTGCTGTGCGCGGAGTTTCTGGACGCCAACTACGAC 

AAGTTCTTCTCGCAGCACTACCAGCGCCTGCTCAACTCGGAGAACTATGTGACGCGGCGTCAGAGCTTGAAGCTGCTTGGGGAACTGCTGCTGGA 

CCGGCACAATTTCACCGTGATGACGCGCTACATATCCGAGCCGGAGAACCTCAAGCTAATGATGAACATGCTCAAGGAAAAGTCGCGCAACATTC 

AGTTCGAGGCGTTTCACGTCTTCAAGGTGTTCGTGGCCAATCCCAACAAGCCGAAGCCCATTCTGGACATCCTGCTGCGCAACCAGACGAAGCTG 

GTCGACTTTCTGACCAACTTCCATACGGATCGCTCCGAGGACGAGCAGTTCAACGACGAGAAGGCCTATCTGATCAAGCAGATAAAGGAGCTGAA 

GCCGCTGCCCGAGGCTTAGTTGGACCTCCACCTCGGCTCCAACTTCAGCTCCAACTCCAGCTCCAGCGTCCCGGGGGCAAAAGGACCGTAGCCAG 

AAGCCAGAAGCGATCGATACGGAAGCAGAGAGAGCGATGGTGGGTTAGCAGGATGTGAACATGTTTTGTGTGTGGGTTAAAGCAAATATATATTA 

TATACTATATTTTGTGACGCTGATGACTCTACACTTGTGAGTAACTAAACAATAATTTAGACAACAACAAAAACAACACTGATGATGATAATGAT 

AAAAAT AAC G AT AAAGCGAATT AG AAAACAAGAAAAAGTACGAGT AAAT AGCGTAACGCCGAAAAATAAAACTGTACCGAGAGTTTCGAATTTAA 

CATAGCCACATTTAGACATTCTGGGCCTAGAGATCCGCTATCCTATTATACAGATATGTTTAACCATTCGTCGATCCTCCGCGAAGTAATCCCCC 

TTCCTTTCCAGACCCTTTTTTTCGATATCCATTTAACTAGTTGTTGTATTTAAAATTTGATCGTTTTTGAAACGAGATATCCCTTCACCGCCTTG 

ACCTGCCACGCCCCTTTTGCGCCCCAAAAAAAAAAAACTGAAGATT CAC AATGAGTTTTGACTT AAAAAT GTAATTCC AC GCAAAATTCATGAAA 

AGAACGAAAACTAAATACTGCATAAAAATAATAATAATTCATAGTGATAATTGTCTGTGCATCCTTCGGCATGGACTCTACTATATATACACTGA 

TATATATATATAGAAAGAGCAAGAGAAAGAAAAAGAGAGAGAAAGCATTCAACACACACACACACTACACACATGCATACGTACATAGTTAAAAG 

GGAAGTAAAACAAGAAACAACAACATTGCAAGCAACATAATAACTATACGATTATTATAATTTATATTATTCCCAAATTAAAGATAACAGCCAAA 

CAAATCAGCAGTTTTTCCACTGTGGGCGGTGGTTTTCTCAGGGGGCGTGCAGCTCCCCCCTGAGTCATGTGCAAAAAATCGTGCCAGGGCGGCAA 

CCTCACCCAAAAAACGTCGATCAAGGGCTGGGGAGATGATTCACACGCTGGGAATGCCACCGAAACCAAAAGAGATTTCAAATCACGTTTCAAGG 

TGTGCCTAAATGAGTAAATGAGTCCATTCCAGGACTCAGCCTGCTGTAGGAATCCCATGAAGGAGTTTTCAAATGTGTGCAGGCTTCTACTTTAA 

TCGTTGTCGACTAGGTTTTATTACAATTATCTTAATAAATGAATTCGAGAGTTCTTAGGCTAGCTACATATGTATATATTAAGGATGGCTACATC 

TATCGTCGCTTCTTCTTCTCCAGCTTACGAATCTTGTGCAGGCGCGTCCGGACCGGCTCGCCCTCCCGCTCCGTCACTTCGCCGGACTCCGGATT 

ACTTTGGCCATCCGGGTTATTCTCCATCAATACTGCGGCCGAAATGCAAACGAACTGGAAGAGAGCACCCACATAAAGCCCGTATCGGATTAGCA 

AGCTGAAGATGTCCTCGTCGCCGTACTTGTCCAGGGCTGCGGCGGCGCCCAGACTGTCCGCGGAAGCCGACATCGCTGCGCTGGTATTGTTTATA 
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AACAAAACCCGCGAGGATTAGACAAAAAAAAATAACAACTACAAAAAGTTTGTTTTTTGTATTGGGTAGAGATGGTACCCTGAACCAACAGTTCA 
GGTGGAAACTATCGATAACTTTTTTAGGTCAAAGTTTTAAGGTTGTTAGCTCATTAACGGCAAGTATTTAAAGGCTTGCACAAAATCTTCGCTTG 
ATTTTATTTAAATTCAATTACCTGGATTATTAAATACATTTTATAAAGTTTAACACTAGTGAAATAAATATCCCATCCCCAACACATTCTTCAGG 
GTTACCGTTTAAGGTATGTTCAAAACTTTATGGTATATTCCGGAGCCCTAC 
(SEQ ID NO: 436) 

Exon: 1001.. 3231 
Start ATG: 1280 



CTGGAAGAAGACAAGAAAAAAATAATATATAATAATAGAATATTCCACCGTGATAATATTGCCAAGCATTGTTTTCCGCGTCTTTGCCGTCGGTA 
ATTGGGCAACAGTCAGAACGACCAAAAAGCCGAAGGGCCCCGAACAGCAGGATAATCGGCAGAAAGTACGAGGCATCATCGACAGGCCAACTAAA 

C C AC AAC G AC AAC C AC AT C CGC AGTC AAAT C C C AAT C C AAAT C C AC GT C C AAAT C C CAT T T C G AGT GAAAC AGC G AGT GAG AC AG C G AAAT GC C A 
CTGTTCGGGAAGTCACAGAAGTCGCCAGTGGAGCTGGTCAAGTCGCTGAAGGAGGCGATCAACGCCCTGGAGGCGGGCGACCGCAAGGTGGAGAA 
GGCGCAGGAGGACGTCAGCAAGAACCTGGTCTCGATCAAGAACATGCTGTACGGTAGCAGCGATGCCGAGCCGCCGGCGGACTACGTGGTGGCCC 
AGCTGTCCCAGGAGCTGTACAACAGCAACCTGCTGCTGCTGCTCATCCAGAACCTGCACCGCATCGATTTCGAGGGCAAGAAGCATGTGGCGCTC 
ATTTTCAATAATGTGCTGCGCCGCCAGATTGGCACCCGTTCGCCCACCGTCGAGTATATATGCACGAAGCCGGAGATCCTGTTCACCCTGATGGC 
CGGCTACGAAGATGCGCATCCGGAGATCGCACTGAACTCCGGTACCATGCTGAGGGAGTGCGCCCGGTACGAGGCGCTGGCCAAGATCATGCTGC 
ACTCGGATGAGTTCTTCAAGTTCTTCCGCTACGTGGAGGTTTCCACCTTTGACATTGCCAGCGATGCCTTTTCTACGTTCAAGGAGCTGCTCACG 
CGCCATAAGCTGCTGTGCGCGGAGTTTCTGGACGCCAACTACGACAAGTTCTTCTCGCAGCACTACCAGCGCCTGCTCAACTCGGAGAACTATGT 

G AC GCGGC GT C AG AGC T T G AAGCT GC T T G GGG AAC T GC T GC T GG AC C GGC AC AAT TT C AC C GT GAT GAC G C GCT AC AT AT C C G AGC C GG AG AAC C 
TCAAGCTAATGATGAACATGCTCAAGGAAAAGTCGCGCAACATTCAGTTCGAGGCGTTTCACGTCTTCAAGGTGTTCGTGGCCAATCCCAACAAG 
CCGAAGCCCATTCTGGACATCCTGCTGCGCAACCAGACG AAGCT GGTCGACTTTCTGACCAACTTCC AT ACGGATCGCTCCGAGGACGAGCAGTT 
CAACGACGAGAAGGCCTATCTGATCAAGCAGATAAAGGAGCTGAAGCCGCTGCCCGAGGCTTAGTTGGACCTCCACCTCGGCTCCAACTTCAGCT 
CCAACTCCAGCTCCAGCGTCCCGGGGGCAAAAGGACCGTAGCCAGAAGCCAGAAGCGATCGATACGGAAGCAGAGAGAGCGATGGTGGGTTAGCA 
GGATGTGAACATGTTTTGTGTGTGGGTTAAAGCAAATATATATTATATACTATATTTTGTGACGCTGATGACTCTACACTTGTGAGTAACTAAAC 
AATAATTTAGACAACAACAAAAACAACACTGATGATGATAATGATAAAAATAACGATAAAGCGAATTAGAAAACAAGAAAAAGTACGAGTAAATA 
GCGTAACGCCGAAAAATAAAACTGTACCGAGAGTTTCGAATTTAACATAGCCACATTTAGACATTCTGGGCCTAGAGATCCGCTATCCTATTATA 
CAGATATGTTTAACCATTCGTCGATCCTCCGCGAAGTAATCCCCCTTCCTTTCCAGACCCTTTTTTTCGATATCCATTTAACTAGTTGTTGTATT 
TAAAATTTGATCGTTTTTGAAACGAGATATCCCTTCACCGCCTTGACCTGCCACGCCCCTTTTGCGCCCCAAAAAAAAAAAACTGAAGATTCACA 
AT G AGT T T T GACT T AAAAAT GT AAT T C C AC GC AAAATT C AT G AAAAG AAC GAAAACT AAAT ACT GC AT AAAAAT AAT AAT AAT T C AT AGTG AT AA 
TTGTCTGTGCATCCTTCGGCATGGACTCTACTATATATACACTGATATATATATATAGAAAGAGCAAGAGAAAGAAAAAGAGAGAGAAAGCATTC 
AACACACACACACACTACACACATGCATACGTACATAGTTAAAAGGGAAGTAAAACAAGAAACAACAACATTGCAAGCAACATAATAACTATACG 
ATTATTATAATTTATATTATTCCCAAATTAAAGATAACAGCCAAAC 
(SEQ ID NO: 437) 

Start ATG: 280 

MPLFGKSQKSPVELVKSLKEAINALEAGDRKVEKAQEDVSKNLVSIKNMLYGSSDAEPPADYVVAQLSQELYNSNLLLLLIQNLHRIDFEGKKHV 
ALIFNNVLRRQIGTRSPTVEYICTKPEILFTLMAGYEDAHPEIALNSGTMLRECARYEALAKIMLHSDEFFKFFRYVEVSTFDIASDAFSTFKEL 
LTRHKLLCAEFLDANYDKFFSQHYQRLLNSENYVTRRQSLKLLGELLLDRHNFTVMTRYISEPENLKLMMNMLKEKSRNIQFEAFHVFKVFVANP 
NKPKPILDILLRNQTKLVDFLTNFHTDRSEDEQFNDEKAYLIKQIKELKPLPEA* 
(SEQ ID NO: 438) 

Classification: known_f lybase__gene 
Gene Symbol: Mo25 
FlyBase ID: FBgn0017572 



Celera Sequence No. : 142000013384676 

ATTAACCTCCATGGTTTTTATCTTCTTGCCAAGTGCCTTATTCTCAACCTTTAGCGCTTCAATTTCGGCCAGTGCCTTTTCGTATTTGGCTTCCC 
ATGCCTGTAGCTCCTTGGTATTCTGTGAGATGGTAAATGGGTAACTCTGCTGAAAGGACGTATTCCTGCACAACTTACACGATCTTCGGCCTTCT 
GAAAGTCGTCCAAATCATCGTAGATGTCCAGTTCAAGGGATCTGTCCATTTGGTCAGGCTTCTTTGAACCTGCAGGCTCTATATCGTACAAATCG 
TGATAGATATCCAAATCCATGGATTTATCCTTAACGTTGGACTTCATCCCGCCCGCGTCCTGGTCTTCCAAATCCATCAAGTCCTCCAGAGCTTC 
GGGCAGCACCAGTTCTCCGCCAGATTTCGTGGCATATGCAGGCGTTTCCATACCAATCGCTTACGTTTCTTCTTGGCTGCCCTTTAAACAAGGAT 
ACATAAAATGAATTGTTAGAGTTACTGTTTTTTAATGGAGCTACTTTATTGATTACCTGTTCAGTCATAAATTGATCAAAATTCGAAACACAAAA 
GAATTTCAATGTTCTTGATTTTTGTTTCAGGGAATAAAAGTTCCACTAAAGCTCAATGGTCACTGTCCTGAGCATTTCCTCCAAACTGGCGTGCA 
ATTTGTTTCCCTGCCGTGTATTTGTATAAATGTAATTATTATTATTATTCTATTTAATATTGTTAGAAAATCCCGTGTTGCCGCCAATCGATAAC 
AGAAGATGCAGTTGATTATCGGAAGCAACAGCTGACCGGCTGAAAATGTGGCAACCCTGATAGTGGATATCGATATCCAGCAGCAACATTTGGCG 
CCAAAACGAATATTTTAAAAATTTTATTATGCACTTACGTGTTTTCCCAAGAATATAATTTTAAAATCAATCTAGTTCCCATTTCGTATTACTGT 
ACACTTAAATTCGTTAAAAAAAAACATCACAATACAAACAATAATACTCGAGAAACACTACACAATGGCAGACATTCCGGAAAAAAGTTCTTACT 
ACAAGAAGGAAAGAAGAAAGTTCGCTCTGATCACTTACCTACTAACCGCGATATTCCTGATCCTTGCGCTGTTGCAATGGGCCACTTTCCGCTTT 
GTGTACGTATGTATTTTCACATTACTTCATCCTCACTGCACTTTTTCTTAATTTTCAGAAATTTTCTAAGGGAATTTTTTACTTCGTACCATTGG 
TTGAGCTGCCTTTTTTTTGGAATAGGCCTGATACTCCTCGTACTTTTCATATTTTTCGAGGTACTGCGCTTTAATAAAATGGTAAACTGGTTGTT 
TGCCTTTCTAATTGTAAGTCTACAATGTGAAACATAACCTGGGTGGCAGTATGAACTTATTCTTTCCAGTTTGAATGCATTGTACTGGGGATTGC 
ACCGCTGGTTGCCCGCCACTACAAGTATCAGTTTCTCTTCAGCTTCCTTATATGGACCGTAGCATTGGCGCTTTTTATTGTCTGTGGTTCCTTTC 
TTCCGGTAAGAAGAGTCCACATGAAAGCCATTTGATTCAAATCTGAAGCTTCACATTTTTATTGTTATCCCCTTAAGCTGGATCTTACTTTGGAT 
GTTGTGGTGCTATTTGTGCTTGCAGTGGTTTCCATTATCGGCGCCATATACTTCGTGATGCTCTACATCGTGGCTAATGTCGCATACTCGTTTAT 
AATCGCCCGTTGCTTTATAGTGATCAGCATCTTGATGGTTAGTACTGAGTCATACATTTTTGAATTAGTTTCTGTCATTTAGTTTCCTCCTTACA 
GTTTGTCATGTATCACGCGCAGATTATAAACGGTGGAAGGTTCGCCGAGATGCGCACGAAGGACTACTTCCTGGCAGCCATCATACTGTTCCTCG 
ATTTCCTGCTCCTCTATCTTTTCTCATTCCAAGTGGCCCCAAAATGGTCGGATCGTTGCGATGGCGAACGTTCAAAAAACTTGGTGCTGTTCACA 
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GAAACTACGACGAATTCGTATCCACCAAAATGGTGGCGTACGAGCACGAAGGGTATGATTTCTTATTATTAGTTCATGTTAGATGAAATATTCAA 
TTAATTTATACTTTTCACACTTGTTTTCAAAGATACCCCCTZVAAGTCGTTAGCCAACACTGCCGCCCGTTTGGCGCCAAACACAACAGCCAGCTG 
ATTTCATTTCGCGTTCCTACCGGTGAAGAAACTTTAATTGGCACAGAGCTGACCTAAACAAATAAATAAATAAACGCAACCCGGCGACTTAAAGA 
CTCCACGACATCGGGCCGCAGTGCAAAGCCAGCGATGGATCCCACCATTTCAGTGTCCAAGGTGAGTGGCTTCGGCTTCCAAACTGGACATAGAT 
GTTTAACTACTTATTTTTGTTGTCTGCAGGGCTGTTTTGTCTACAAAAATGGCGCTACGAGAGCAGGGAAAAAGGCGGCCAGCAAACGGAAGCGT 
CCTGCCGCAGAATCCAGTAGTCTTCTTGGCAAAGAGGTAGTGCAGCAGCCCTTCTATGAGGAATATCGCAAGGCCTGGAATCAGATAAACGATCA 
CATTGCCGACCTGCAGCACCGCAGCTATGCCCGCACTCTGGAGCAACTGGTGGACTTTGTGGTGGGGCAGGCAGAGCGCGACACCCCGGACGAGG 
^TGCTGCCCACCGCCGCCTTGCTAACGGGCATTAATCAGCCGGATCATCTCAGCCAGTTCACGGCACTTACCCAGCGACTCCATGCCCAACGTGCT 
GCAATGGTATGTGTGCTGCAGTCCAGGGATTGCGCCACTCTCAAGGCGGCCGTTGAGACATTGGTTTTCGGGCTCGTAGAAGATAATGCAGAAGT 
GGAGCAAATGGAAGACGAGGACGAGGACGAGGATGGTGCCGAACGGGATCGCAAGCGGCTACGTCGCTCGCAGTGCACCATGAAGCAGTTGAAGT 
CTTGGTACACGAACAATTTTGACTCGGAACAGAAGCGCCGCCAGCTGGTCGTCATCCTTCCCGACTTTGAGTGCTTTAACGCAAGCGTGCTGCAG 
GATCTTATTCTAATTTTGAGCGCTCACTGCGGCTCGCTTCCATTCGTCCTGGTCCTGGGAGTGGCCACGGCCATGACAGCTGTCCATGGGACACT 
ACCATACCACGTCAGCAGCAAGATTCGTCTAAGGGTGTTCCAGACTCAAGCTGCTCCCACGGGCCTTAATGAGGTAATATTCCTTAACTCTGTGC 
TAGTTTATGTACTAATCGGTTTTCTAACAACCTTTAGGTGCTGGATAAAGTTCTGCTGTCGCCCAAGTACGCTTTCCACCTGTCGGGAAAGACAT 
TCAAGTTCCTGACGCACATCTTCCTATACTACGACTTCTCCATCCACGGCTTTATTCAGGGCTTCAAGTACTGCCTGATGGAACACTTCTTTGGT 
GGGAATGCCTTTGCTCTTTGCACAGACTACAGCAAAGCATTGGGACGCATAAAGCAGCTGACCCACGAGGATATGGAAACCATCAGAAGGCTCCC 
ATCGTTTCGTCCGTACGTCGAGCAGATCAATGATTGCAAACGCATCATAGCCGTGCTCACGGACGATGACTACCTAAAGAAGAAACTGCCGCAGT 
TGCTGCGCGACTGCCTGCTCCACTTCCTGCTTTTCCGCTGCTCTCTGGAGTTCCTTACAGAGCTGGTTGGTGATTTGCCGCGCTGTCCGCTGGGA 
AAGCTACGCCGGGAACTGTACGTCAACTGTCTTAATCGTGCGATCATTTCGACGCCGGAGTACAAGGAGTGCCTGCAGATGCTAAGCTTTCTGTC 
CAAGGACGAATTTGTTGCTAAAGTGAATCGAGCCCTGGAAAGAACCGAACAGTTCCTGGTTGAAGAAATTGCTCCGCTGGAATTGGGCGAGGCCT 
GCACTGCTGTGCTGAGACCTAAGTTAGAAGCTATACGCCTTGCAGTGGACGAAGTGGTCAAGGCCACCATGGCGACAATTACAACGACATCGCCC 
AATGAAACCCGCCAGGCGACTGACCATTTGACTCCGGTGGCCTCGCGTCAAGAATTGAAAGATCAGCTCCTACAGCGTAGCAAGGAGGACAAGAT 
GCGGCATCAGCTGAACACGCCCACCACCCAATTCGGACGAGCTCTACAAAAGACTCTGCAGCTTATCGAAACGCAGATTGTACAGGATCATCTCC 
GTGCGCTGCAGGATGCACCACCTATTCACGAACTTTTTGTGTTCAGCGACATTGCTACCGTGCGGCGAAACATAATTGGTGCTCCGCGGGCTGCA 
CTGCACACTGCCCTGAATAATCCACATTTTTACATGCAGTGCAAGTGCTGCGAGCTACAAGATCAATCACTGCTGGTTGGCACACTGCCCGATCT 
CTCCGTGGTTTACAAGCTGCACCTGGAGTGCGGTCGCATGATCAACCTTTTCGACTGGCTGCAGGCCTTTCGATCTGTTGTGAGCGACAGCGACC 
ACGAGGAAGTGGCCCAGGAACAAATCGATCCCCAGATCCAGTGAGTAAAAAAATGGATTTCTCTCTGTAATATCATCAATTTCATTTGGTTTTCC 
TTTCAACCAGGGCCCGATTCACGCGTGCCGTAGCCGAGTTGCAGTTTCTGGGGTACATTAAGATGTCGAAGCGCAAGACGGATCATGCCACTCGA 
TTAACCTGGTAGCTCGCTCGGGCTATGAATAACCAAAATCCCTTAGGTATTTTTATCTTTTCTATTTATTTATATTGGGGCAAATTTCATGTGTG 
GGACGTTATGAATAAATAAAACATCAATCGATTTGTCTACTCTGTACGAAATGCATTAACTTTGTTTGCGAGTGAAGCGCGTTCTTGTATTTTTT 
AAGTAAAGCCCAGCAGCTGGCAAGGCAAGCGTACTAAATATCAATTACCAGTCTTCCCGACTAGCGGTTACCACGTTCCCTGTTCACATTAGTTT 
CAGAGTCTCGATGGCGGCAGCGATGCGTTCGATCTCTGCCCGCGACTCGCAGAGTTTTGTCTCGTTGGCCTCCTGCACCTCGGCGGGAACCTTAG 
TTGCGTAGTCGGCGGCCTGGATGGCCTGAGTTAGCTTGCCCACCGTCTGCACCAGCTGGTCGCTCTTCTTCTGCAGCTTGGCTATCTCCTTGTCC 
GCCTCCACGAGTCCCTTGAGCAGCAGGTGCACCTCGCACTGGCCCGTCACGGTGAGGATCGCACAGCCCTGCGGTGCGGGACTGTCAAAGACCAC 
ATTGGAGCAGTAGGAGATTGTGGCCAGATCGCTGGCGTAGCGTTTGAGTATTTCACTGGGCACCGAATCCGTGCATACTATATACACCTCGGTTT 
TAACCTTGTTGGGCAGATTATAGTCGGAACGGGCAGATCGGATGATTCGCGCCGCCTTCTGCACAAATTCAACGTCCGATTCTATCTTTGTGCTG 
CGCCAGGATGTATTGCCTAGTTATGAGGAAGTTAAATTTATTTGAATTCTATCATCATGAAGAAATCAAATGCGACTTACTGGGATAGCTAGCCA 
CGCAAATGCTAGGAGCGGGATTTGCCCTCGGAAGCCGCTGGTAAAGTTCCTCCGTGATGAAAGGCATGAACGGGGAGAGTAAGCGCAGTCCATAA 
TCTAGACATACGTAGAGCGTGCGCCTTGCGGCGGTCTGCTGCTCCTCGCTGCCGCTCTGGAAGATCGGCTTCAGACACTCCAAATACACGTCGCA 
CAGATCGTAGAGCCAGAATGCGTAGCAGGCGCTGGTCGCTGCAGCAAAGTCGTACGACTCAAAGCCGGTGTTGCAAGCTTCTATGGCAGCTGCCA 
AGCGTGACAGGATCCAGGCGTCCATCTGGTTGATCGCTGCAGACGCACTTAGCTCCGTGTCAAACTTCTCAGAGCCGGTGAAGTACAGCAGGGCG 
AACTTGGTGGCGTTCCACAGCTTGTTGCAGAAGAAACGATATCCCAGAACGCGATTGATGTCCAGGTTGATGTCGCGCGCTTGCGTGATATAGGC 
GCACAGCGCGAATCGAAGTGCATCCGAGCCGCACTCGGGAATTCCCTGCGGATAATCCTGTTTCTGTCCAGCCTTAGCCTTTTCGATCTCCCGAG 
GATCCAGGTTGGAGCCCACAAGTTGAGCATGCAGTCCCTCGAGCGTAATGCCACGAATAACATCCATAGGATCGATAACATTTCCCAGGGACTTG 
GACATTTTGCGTCCGTGCGCATCCCTCACCATGGGGTGAAGATAGACCTCCTTAAACGGCA 
(SEQ ID NO: 439) 

Exon: 1001.. 1142 
Exon: 1199.. 1343 
Exon: 1400.. 1525 
Exon: 1598.. 1747 
Exon: 1807.. 2047 
Exon: 2123. .2341 
Exon: 2405.. 3208 
Exon: 3268.. 4505 
Exon: 4571.. 5236 
Start ATG: 2315 

Transcript No. : CT13534 

AGAAACACTACACAATGGCAGACATTCCGGAAAAAAGTTCTTACTACAAGAAGGAAAGAAGAAAGTTCGCTCTGATCACTTACCTACTAACCGCG 
ATATTCCTGATCCTTGCGCTGTTGCAATGGGCCACTTTCCGCTTTGTAAATTTTCTAAGGGAATTTTTTACTTCGTACCATTGGTTGAGCTGCCT 
TTTTTTTGGAATAGGCCTGATACTCCTCGTACTTTTCATATTTTTCGAGGTACTGCGCTTTAATAAAATGGTAAACTGGTTGTTTGCCTTTCTAA 
TTTTTGAATGCATTGTACTGGGGATTGCACCGCTGGTTGCCCGCCACTACAAGTATCAGTTTCTCTTCAGCTTCCTTATATGGACCGTAGCATTG 
GCGCTTTTTATTGTCTGTGGTTCCTTTCTTCCGCTGGATCTTACTTTGGATGTTGTGGTGCTATTTGTGCTTGCAGTGGTTTCCATTATCGGCGC 
CATATACTTCGTGATGCTCTACATCGTGGCTAATGTCGCATACTCGTTTATAATCGCCCGTTGCTTTATAGTGATCAGCATCTTGATGTTTGTCA 
TGTATCACGCGCAGATTATAAACGGTGGAAGGTTCGCCGAGATGCGCACGAAGGACTACTTCCTGGCAGCCATCATACTGTTCCTCGATTTCCTG 
CTCCTCTATCTTTTCTCATTCCAAGTGGCCCCAAAATGGTCGGATCGTTGCGATGGCGAACGTTCAAAAAACTTGGTGCTGTTCACAGAAACTAC 
GACGAATTCGTATCCACCAAAATGGTGGCGTACGAGCACGAAGGATACCCCCTAAAGTCGTTAGCCAACACTGCCGCCCGTTTGGCGCCAAACAC 
AACAGCCAGCTGATTTCATTTCGCGTTCCTACCGGTGAAGAAACTTTAATTGGCACAGAGCTGACCTAAACAAATAAATAAATAAACGCAACCCG 
GCGACTTAAAGACTCCACGACATCGGGCCGCAGTGCAAAGCCAGCGATGGATCCCACCATTTCAGTGTCCAAGGGCTGTTTTGTCTACAAAAATG 
GCGCTACGAGAGCAGGGAAAAAGGCGGCCAGCAAACGGAAGCGTCCTGCCGCAGAATCCAGTAGTCTTCTTGGCAAAGAGGTAGTGCAGCAGCCC 
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TTCTATGAGGAATATCGCAAGGCCTGGAATCAGATAAACGATCACATTGCCGACCTGCAGCACCGCAGCTATGCCCGCACTCTGGAGCAACTGGT 
GGACTTTGTGGTGGGGCAGGCAGAGCGCGACACCCCGGACGAGGTGCTGCCCACCGCCGCCTTGCTAACGGGCATTAATCAGCCGGATCATCTCA 
GCCAGTTCACGGCACTTACCCAGCGACTCCATGCCCAACGTGCTGCAATGGTATGTGTGCTGCAGTCCAGGGATTGCGCCACTCTCAAGGCGGCC 
GTTGAGACATTGGTTTTCGGGCTCGTAGAAGATAATGCAGAAGTGGAGCAAATGGAAGACGAGGACGAGGACGAGGATGGTGCCGAACGGGATCG 
CAAGCGGCTACGTCGCTCGCAGTGCACCATGAAGCAGTTGAAGTCTTGGTACACGAACAATTTTGACTCGGAACAGAAGCGCCGCCAGCTGGTCG 
TCATCCTTCCCGACTTTGAGTGCTTTAACGCAAGCGTGCTGCAGGATCTTATTCTAATTTTGAGCGCTCACTGCGGCTCGCTTCCATTCGTCCTG 
GTCCTGGGAGTGGCCACGGCCATGACAGCTGTCCATGGGACACTACCATACCACGTCAGCAGCAAGATTCGTCTAAGGGTGTTCCAGACTCAAGC 
TGCTCCCACGGGCCTTAATGAGGTGCTGGATAAAGTTCTGCTGTCGCCCAAGTACGCTTTCCACCTGTCGGGAAAGACATTCAAGTTCCTGACGC 
ACATCTTCCTATACTACGACTTCTCCATCCACGGCTTTATTCAGGGCTTCAAGTACTGCCTGATGGAACACTTCTTTGGTGGGAATGCCTTTGCT 
CTTTGCACAGACTACAGCAAAGCATTGGGACGCATAAAGCAGCTGACCCACGAGGATATGGAAACCATCAGAAGGCTCCCATCGTTTCGTCCGTA 
CGTCGAGCAGATCAATGATTGCAAACGCATCATAGCCGTGCTCACGGACGATGACTACCTAAAGAAGAAACTGCCGCAGTTGCTGCGCGACTGCC 
TGCTCCACTTCCTGCTTTTCCGCTGCTCTCTGGAGTTCCTTACAGAGCTGGTTGGTGATTTGCCGCGCTGTCCGCTGGGAAAGCTACGCCGGGAA 
CTGTACGTCAACTGTCTTAATCGTGCGATCATTTCGACGCCGGAGTACAAGGAGTGCCTGCAGATGCTAAGCTTTCTGTCCAAGGACGAATTTGT 
TGCTAAAGTGAATCGAGCCCTGGAAAGAACCGAACAGTTCCTGGTTGAAGAAATTGCTCCGCTGGAATTGGGCGAGGCCTGCACTGCTGTGCTGA 
GACCTAAGTTAGAAGCTATACGCCTTGCAGTGGACGAAGTGGTCAAGGCCACCATGGCGACAATTACAACGACATCGCCCAATGAAACCCGCCAG 
GCGACTGACCATTTGACTCCGGTGGCCTCGCGTCAAGAATTGAAAGATCAGCTCCTACAGCGTAGCAAGGAGGACAAGATGCGGCATCAGCTGAA 

C AC G C C C AC C AC C C AAT T C GG ACG AG CT CT AC AAAAG ACT C T GC AGC T T ATCG AAAC GC AG ATT GT AC AG GAT C AT CT C C GT GC G CT G C AGGAT G 
CACCACCTATTCACGAACTTTTTGTGTTCAGCGACATTGCTACCGTGCGGCGAAACATAATTGGTGCTCCGCGGGCTGCACTGCACACTGCCCTG 
AATAATCCACATTTTTACATGCAGTGCAAGTGCTGCGAGCTACAAGATCAATCACTGCTGGTTGGCACACTGCCCGATCTCTCCGTGGTTTACAA 
GCTGCACCTGGAGTGCGGTCGCATGATCAACCTTTTCGACTGGCTGCAGGCCTTTCGATCTGTTGTGAGCGACAGCGACCACGAGGAAGTGGCCC 
AGGAACAAATCGATCCCCAGATCCAGGCCCGATTCACGCGTGCCGTAGCCGAGTTGCAGTTTCTGGGGTACATTAAGATGTCGAAGCGCAAGACG 
GATCATGCCACTCGATTAACCTGGTAGCTCGCTCGGGCTATGAATAACCAAAATCCCTTAGGTATTTTTATCTTTTCTATTTATTTATATTGGGG 
CAAATTTCATGTGTGGGACGTTATGAATAAATAAAACATCAATCGATTTGTCTACTCTGTACGAAATGCATTAACTTTGTTTGCGAGTGAAGCGC 
GTTCTTGTATTTTTTAAGTAAAGCCCAGCAGCTGGCAAGGCAAGCGTACTAAATATCAATTACCAGTCTTCCCGACTAGCGGTTACCACGTTCCC 
TGTTCACATTAGTTTCAGAGTCTCGATGGCGGCAGCGATGCGTTCGATCTCTGCCCGCGACTCGCAGAGTTTTGTCTCGTTGGCCTCCTGCACCT 
CGGCGGGAACCTTAGTTGCGTAGTCGGCGGCCTGGATGGCCTGAGTTAGCTTGCCCACCGTCTGCACCAGCTGGTCGCTCTTCTTCTGCAGCTTG 
GCTATCTCCTTGTCCGCCTCCACGAGTCCCTTGAGCAGCAGGTGCACCTCGCACTGGCCCGTCACGGTGAGGATCGCACAGCCCTGCGGTGCGGG 

ACTGTCAAAGACCACATTGGAGCAGT 
(SEQ ID NO: 440) 

Start ATG: 997 

MDPTISVSKGCFVYKNGATRAGKKAASKRKRPAAESSSLLGKEVVQQPFYEEYRKAWNQINDHIADLQHRSYARTLEQLVDFVVGQAERDTPDE 

LPTAALLTGINQPDHLSQFTALTQRLHAQRAAMVCVLQSRDCATLKAAVETLVFGLVEDNAEVEQMEDEDEDEDGAERDRKRLRRSQCTMKQLKS 

WYTNNFDSEQKRRQLVVILPDFECFNASVLQDLILILSAHCGSLPFVLVLGVATAMTAVHGTLPYHVSSKIRLRVFQTQAAPTGLNEVLDKVLLS 

PKYAFHLSGKTFKFLTHIFLYYDFSIHGFIQGFKYCLMEHFFGGNAFALCTDYSKALGRIKQLTHEDMETIRRLPSFRPYVEQINDCKRIIAVLT 

DDDYLKKKLPQLLRDCLLHFLLFRCSLEFLTELVGDLPRCPLGKLRRELYVNCLNRAIISTPEYKECLQMLSFLSKDEFVAKVNRALERTEQFLV 

EEIAPLELGEACTAVLRPKLEAIRLAVDEVVKATMATITTTSPNETRQATDHLTPVASRQELKDQLLQRSKEDKMRHQLNTPTTQFGRALQKTLQ 

LIETQIVQDHLRALQDAPPIHELFVFSDIATVRRNIIGAPRAALHTALNNPHFYMQCKCCELQDQSLLVGTLPDLSVVYKLHLECGRMINLFDWL 

QAFRSVVSDSDHE E V AQE Q I D PQ I QARFT RAV AEL QFLG Y I KM S K RKT DH AT RL TW * 
(SEQ ID NO: 441) 

Name : Origin recognition complex subunit 3 
Classification: DNA_replication_f actor 
Gene Symbol: lat 
FlyBase ID: FBgn0005654 



Celera Sequence No. : 142000013384826 

ATTAATTTGTCCAGTTACGGAATCATATTCGGCTATTCCATGATAGCTTTTGCCAGTGTCTTTCTCATCACCGGCATCACGCTGACTGTGACGAA 
AAAGTGGGTGCGGCGAACTCCCGAGGATGAGGACATACTTACCAACTAGAGCAAAATAGTTTTTAAGCTTTGCCAAAGTGCCAGGAATTGTTAAC 
TCTCCTAGTTCATTCTGATAGTCGCCCGTCAAATGCTGTTCTTTGTCAGTTATGCGTTATGTAATATGTAAGTGATACGAATCGACGTTCGTCTT 
GCAGTTTGTGATAGCAAAACCTACTCTAAAAAAAAAACTAAAAACTACTCTAACTCTAAAAAAAAAACCTTACTCTATCTGCATATTACACGAAT 
CATTCACAAGTTGAAACTTTCTGTTTGAGACTTAAGTTATCGTACTTGATAAACAATACTTCATACAACTAAAACACTCCGTGCATTTTATTTTA 
ACGTTTTTATTTGAATTTTCATTTATTTTGAAAAGATAGTTTTTTACAATTTCTGTATGCTTTTCTGCAGGACTTTTACTTATCGTATACGATTG 
TTGGATGCTTTAACGATTTGCTTTATAAATGCATTATTCAAATGTTGTTTTGTAAATTAATAAATAAAAATAATTGTATACGTATATATATTTAT 
TTATATATATGTATATATATTGGTGTATATGTAAGATATATGGCAAAAATTTTACATTTTCTCAAAGATCACATTCCTCAATTTAGCCATAGACT 
TTTAGTCGAGAAAAATCTACGCAAATATTTGTGACATCCATCAAAAACTCAAATTAATTTTTGCACGCACACTAACACATCCAAAAAAACAGTGG 
GGGATTATTAATCGAACAAAAAATAATAAACAAAGTGGATACAAAAAATAAATGATAATCATCTCTCGCTTATTACTATGGTGGGGATGGAGGGC 
AAGTCTAGGCACTCTTGAAATTCTTTAGTTCCCTTCGTCCAGCGATCGGAGCAAGCAAACCCATTTCAAACCATTTCATAGCCGATTTGAATGCT 
TTTCACTTGATTCTACCGCTGGCCCGATTCGTTGTGAGTATGTGTGTGTGTGTATAAAGAGCGACGTTGCTACGTACTAAAAAATTTGCATAGTA 
TATGTTGCGTTGTTCTTGGTGACTAAAATAGTTATAAATTTAGTATAAGTTTAAATAAAAATCAATCACAGTTGTTTGGGCGGAAGTAGTTAGTT 
TTAAGTATTCGCATTTCTGGCCTAACTAAGTATTTGCAGCTAAGACTAAAGATCTACGATGTAGTTTAGTAAATTAGGTAGTGGTCGTATTTTAT 
ACTTGAGTATTGTTGCTTCCCATTTTGATAAGCTTCCTCTCTATTTCCGATGTTGTGTCATGTTGTGGTCTGTGCTGTGGTTACGTTACCAATTC 
CCTTAAGTGACTAGACTGGGTTCTGATTATTTGAATGTGTATTTTGAGGTTACGTTGCTCACGTCCTGTAGGCTTAAGTATTGACTCTGTGATAT 
TCAGTTGTTTCAGTGGCTCGTGTATTTTACCATCTGAAGTTGGTGGACCTGCTAGTTTGTGTTTTTCTAAAAAACTGTCTTCGCTAGTTAAATGC 
ATGTTAGTTCCTACTAAATAGCTATTGTTTCTGCTGTTCTGACAAATGGTTTCTCACGGTGGGTTTTCGTTTACATTTACCCTTTTGTGCCCGGC 
GAATCTTTACCCTTCGAAACACTGTGATCGAATTTATATAACGCTACACCACCCGATCTATGCAAATTTCCGAACGCAGAATTAGCTCTTCACCT 
GGGCACACTCATACATCAGCACTTCGAACAGCAATCGAACTAGAGAGAAGAGTGTCGTATTGGACCCCCAATGTCCACTTCCGTTCCGGTTTCCA 
ATTAGCTTGTCGAGTCTAGCTGGTCTTGTTGTGCTGCTGCTGCTGCTGCTGTTGCTTCATCATGACAGCGGGATCGATCACGTAGTTGTTGGCCT 
CCGCCAATTGTTGGACAGGTAAAGGCGGCAGTTTGTCCTGCGGATGACGGTGGCTTACGGTAGATCCTTTTCTGGCCTTGCTTAGGGTTCCTGCT 
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CCTGCAGAGGAGTTGGCAACTCCATTTGTGGTGGCACCTGAAGGTTGCGCACTGGAAGTGGGCATCTCCTGGCCAGTCTTCGGATTTATAACAAC 

CTTAGTCTGTTTCAGCAAGGAGCGATCGTTGCAGAGCGCCGAGATCAGCGGCAGATCTAGATGCTTGCTCTTCTCCCGAAGAAGCGCAATGTCTA 

GCGCTTGGGCCTGTGCCTGGGCATGAGACTGGGCTACCATTGAAGCCTGCAGCTGGGCCAAAATGCTGTTGTTTCCCCGGTTCAATCCCAGACCA 

AGACCCAAGTCAGGTCTCGGACGCTCACCTGCCCACTGGGCGCCGCTGGAGGTCTTTGTTAGCGGTGAGTTCGCATGATGCGGATTCAACGTTGG 

GGGTTTTCGTGGTGGAATCGGGCTCATCCTGCTAGGAATTGGCTTAGAGGGTACTGAAAATCGGAGAGATTAATAAGATATCGAAATGAATCTAA 

ATGTAAAGTTACTCACTTGGAGGTGGCTTTTCACTGGGCGCCGGCATGGGTGGTTTCTTTAGCCGTCTGTTTACATACGGTGGTGGTGGCGGCGG 

TGGCAGTCGTTTTCCCTTGGGCAGCAAATCTAGTGAGTTCAGAATATTGTCATCTGATCGCGATCGCGCAAACATCGAGGAAGACGAGGAGCCAG 

CCCCCGCCGCAGCAGCTGCTGCTAGAGCAGCCGGGTGAAGGGGAGGAGGCGGTGGAGGCTCATACGAGGAGCATAGGCTGGCCATCGAAGGAGAG 

TACAGGGGTGAGTATCCGGGTCCTGGAAGCTGCACATATGTATCCGATAGATTTTGGGTGGATGCATATCGTGAGTGGGTCGACAGAGTCAGTGG 

TAGGTAGGGGGTGCTTCTGTAAATCGCCGGCGACACCTGCATGGCCAGATATGGGTGGTGCGGTGGGGGTGGTGCTGCTTGGGCCATTGCCGCCA 

GTAAGGATCCCTGTTTGGCATCCAGATACTCTTGCTGGCTTACAGCTGGATCCTGGATGGGGTAAATTACGTAATCCACATCGCTGTAAAGCTGC 

TGCTGGTAGAGCTCCAATTGTTTTCTAGCCATTTCCCTCTCGTAAAGGTGACTCGCCAAATGGGGGTGATTCGGTGGGGGCGGTGGTGGAGGCTG 

CCGCGGCAAACTGGGCGGTGGTGGTGCCAGGAACTGCCGAGGTTTAATAACCGGTAGTAGTACACATGACTGCTGATGATGCAGTGATGCCGTGC 

TCTTGTGCAGACCGTAAGGAACTATTCCAATGGCTGCCGATCCAGAATGTCGGTGGTGTGACGGCACGTGATGATGGTGCAGTGAAGCCTGGGAT 

CCGCCGGCATAGTTGGAGTTATGCAGCGATTGCTGACTCAACTGATGGCTGCAAACCGATCCATTGGAGGAGGCAACTGATGAACAATGGTGCGG 

TGCCGCGTAACTTGGCTTGGCCCCATCGGGAATCAGGCTGGTGTGGGCTTTCAGAATACTAATCTGTGGCCTGGTGGTGATAAATCTGGCTGTAA 

CCTCCTCGGACTTTAAAGAACTCATCGAGCCGGCAGCACTGGGCGCAGGATGTGCCGACGAACTGACAGATATCAGGGATGTGGGTCTAGGCTTG 

CTAGAGATGGAATATGGCGGTGGATCTCTTCGTCGAATAGCCGGCGGTGGGGCTGGAATTGGTGCTGGCGGTACCGGTGGACTACATGGCGGTGA 

ATCTGGTTTGGGAGCTACCACAGCTATAGGTTTAGGGATAGGTTTGGCGATAGGTGGGCCACAGAGACCAGTTTTCTCATGCCTAGCACTGTAGG 

GTGGTGGTGGTGCTGGTGGTTGGTCTACCTCCTCTTCTCCTTGATCTCCTAGTGGTAACGTCACGTAATCACAGTCCGAGTGAAGGCTACTAGAT 

ACTGTTTTCCGACTGGGGTAGGTCCCCCTGCCCACAATCAAATTGGTTAACTCTTCTGCGGAGAGCAAGGAATGTTTGTTGTCTGTTGGATCGCT 

GCCATCGCCTCTAAAGGAACTCCCTGTGCTGACGTTAGAATCCGATCGCGGTCTGAAGTCTTCTTTCTTTTTTCCATCTACGGAATCAGAATGTT 

GCACGTGGGTTTCTTCTAGTTTTTGGGGCTGGAACATCCCGTAGTGCGAACTCTTTTCCGACTCGGCGATCTCCGATGATTGTCCAGTTAGCTCA 

CTGCTGTGCATGGTGTAAACGCCTGAGGTTTCCGTGGGTGGAGCTCCGTGCGCCAAAGTATAAACCCCAGATGCGCCCGATACACTCTCCGTTTC 

TTCTCTTGCCGATGTGGAGAGAAAATCGTCGGGACTCGTTTCGCTCAAGGCCGAGTTTTGAGCGGTGTGACTAAAGCCAAGCTCCAAACTAGATG 

AAGTGGAGTGCACAGGTATGGGAGCACCTGAAGAGGAAGCGCCTGCTCCATTTACAGGAGATGTTACCACCACAGTGCTACAAGTGGATGAGCAC 

T GAG AGC CG AT GT T AT GC T GC G AAC T GGGT G AATC AGT GGC GGT AG ATGAAT CATTGGCAGTTGTTCC C AT GG AATT T C T C T G AAC AG AGG AGT T 

CGTGACGATCGTGGCCCCGTTAGAACTCGCGTCTTGTGGCCTCACACTCAAGGCAGCCAGTTGTTCTTCCAAATCCTTCAGAGACATCTGACCCA 

C GCT C GAAGT TTGTCTGCT C AC AC T GGGT C T GT CC AAAAG AT G AGCC AAGGCC AAAC T T T C CGT TG ACG GAG C AG CTAAGGGT GCT GG T GG T GT A 

TTGATCATGATCTCCAGCTCGTCTTCGGAGTGCACTCGATCACTTACTATCCCAGAGGTTGTGTTGGAGCTGGTACTCGAGATAACCGATATTCG 

CTGCTCGTTTTTACTTTTGTAGGGCAAGAGCAGACTTCTGGAACAAGCGTAGCTGGCATGTAGCCTTTGTGATTCGGCAGCCTCCTCCTCCTCGC 

GTTTGGATACCTCCTTTAGCCTGGCAGCCAGTTTCATGCTCCACTGATGTGTATCCTTGCAGAGGGTCAATAGGAGTTTATTTTTCTCATCGGAT 

GCGGCGTACAGGGTGATTCGGCTCTCTCCAGATCGTATTTCAAACTTTTTCCGCTCGAAGGAGAGCTTCGTGATATTCGGCCACAGAAAGGTTGT 

GGTTTCGGGATTGGAAGAGCTGTTGGTAGACTCTCCTCCCAGAATCTTGATTCCCTTGGCATAGACCACAAACCAGGCACTACCAGATCCCTGCT 

CGCTTTTGGCCAATCGCATCCTGTAGGTATGGGCATTGATGGTTTCATGAAGACTGCATGCCTGTTGAATATATAGCAGCTCCGCCTCTGCCGTG 

GCCATGCCCAAATGCTCCCGATGACAGGCTCTCATTGCACTTCTCGCCCAGGGAGTGTGCAGTCCACTGGGTAGGTAATCCTCGATCCGGTAGTA 

ATCTGCTCCATTCGGTGGCAGGCTGGGTGGTAATGGTATTGTTTCTCTCTTTGATGTGGACGCCACATATGTCGAGAGTCTAAGCATCCTCTCAT 

TAGCACGTTTGCTGATCTTGGGTAATGTAGTGGTGGCACTTAGACCTCTTCCTCCGTTAGAAGGAGATGCCGATGTCTCCTCTCCTGAATCATCC 

TTGGAATTACTCGTGCCCGGAGGAGCATCACCCAGGTCCGCCTGAAGAGCAAGTCCAGCAAGGAAGACCAGTGCCTGCTCCGCCTGTTCTCTCGG 

CAAATCGCGCTGCAAGATGTTGTGGCGCAGCTGCAGGTAGTAATTATGGCGGGACGTCTCATCTTTCAGCATGAACGGGCTCTCGATGTAGAACT 

GCACGCGAAAGTGTAGCTCCAGGAGCGGTCGTCCGTTGGCATCCAGCCCCTAAATAAAGACATGTCGCATTACTTAACACATCATGATAATTACT 

TTTTTAATAAATATTTACGTGGGTGTGCGATGATCGCCAGCTCTTTGGGCCATATTTTGAGAGCTTGCTCTCGGGATCTGCGAACATGTACTCGC 

CATCTGCAACATACGAAACAAGAGATTGGTTATAAAAATGGTTTGATTGATCAGATTTTGGGTCTGCTATTCCATCGAACATTCTTTTTTTAATT 

TCAATTTTTTTAAAAGTTAAAACGGAGGTGCATTCATTAAGGATATATAAATAACCAAACAGAGACCTACAACTGCTTTCATTGAAATATCAAAT 

TACTTGTCAAGTCGGATGCCATGCCTTGGCTGAATTTCCGGTTGGGCTAAGGAACTTTTTGGCGCGCAGCCGCCAAGGTCGCTAATCAAAATTTT 

AAATGAACTTTCGGCTATGCAAGAGGGAGCGCTCCTCTTGCGTCCCAAATCGGGCTCAAATCGGATCGAGACCCCGACCAGACTTCTGTCAGTTT 

GT C AGG CAAT AAAC AAG C AAC C C AT T G AC AT C AAC AAG AGT C AG AGAC T C C AGC AG AT G AAG AC G AT GAT GAT GAC GAC AC GT C AG GT T GT C AG A 

GGCCCATTGCCTGATGCTCGAAGACGAGACCGGTACCGGACCGCGCTACATAGCTCCGACCCGAGCCAAGCAGAAGGGCCCAAGGGCATGTCGGC 

GGTGCTCGAGAAGGCAGAGAAAAAACAACAGCGAAATCAACACACACTTCATCGAACGAAAAATTGGAATTCCTGCATACAAAATGAAGAGCAGC 

AAGATGAAAAATAATAATTATGAAAAAAGCAGGCAGGCAGGTTGAAGGAGGCCATGGAATAGAGGCACCTCAGCATCGAATCCACAACACAGCAG 

CAGCAGCTACAATATCATAATAATCATCAGCGGCCGAATAAGTGCCTATCAATGACGACGTCGTCGACGCAAAGCTGGAATGGAAGTGGCAACAA 

GTGGATGGCCAGGGGAGTAGAGCTCTTCGAGTTTGGTTCACTCAAAAAAATGATTAGGGAATACTAGCAAAATATTTTGTTGCCAAACCTAATCA 

CCCCCCACAAGTTTGTTCTGGGAAAGTTTTGCCGTAGCATAGAGCGTTGTTCAGTTTATTTGGATTTACGAGAAAACCCATTTCTTTTGCACAGT 

GTAGAAACGGAGTCGAAGCGAGAGCGAGTGTTGCATACCAAGCCCAAAACGCAGCATCGAACAAAACCCGCAGAGAAAGAGGAAAAAACAGGTAG 

AAAGACAGGTAAGCAGCGCTGGAAAAAAGAGGCAGCTGAAACCCAAGTCGTTCGGTTTTTAGCCGCAGTTTTTAGACTCCAGCCTCAAAACTCAA 

AGCGATTTCTTTTCGCATTCAAATTTCACTCCACTCCAGAAGTTTTCGGCCAAAGGCTCTCCAATGGAATTCGTTCTTGTGCACGTATCGGAATC 

CTGGTATTCTACGAGCTGCGGGCAGGTGTTAGGGCCAAAACAGTCGGATCTCGGGAAGCCACTCGTCTACGGCGGCCTGATGTGATGGATCCTCA 

CGCAATCCCAGCGTTCCATTCTACCTTGCCCGCTCCCAACGGCAAAGACCTAATAATTAACATATTTTCCATCGCATTCCTCAACGCTCACAATT 

AACACGCACGCACTCGGTTTTTGGAGAAGGGTTTCGAGGGGCAGCGAAATAAATCTTCAATTTGAAACACATGCAACACAAATTACAGCCTCAAA 

AGCAAAGGCAGCTGGCCAGGCAGAGGCAGAGGCGGGGGCAGGGGCAGAGGCAAGGCTCATCAAGCAGAGGAATTCCTGAGGTAATGGCAGGAGAA 

GGAGGAGGAGGTGGGGGGCAGGAGATGGAGTCTTAAGTTGAAGATGGAGAAAGAGAATGAGACGCAGTCAGGGACAGCTGCGCCCAATCCCATGC 

CATGGCTTGGAATTCTTCAGATTCAGATTTATTTTCAAGAAGGTCGCAGTCCGAGCTGAAGTGGCCATAATAACCAAAAAAAAAAAAAAAAATTT 

CGCAGGCGAAACTAAGCGAAAAACTACTCCTGGCTTTAACGTAAGACTAAGACTTGTGAAATATTAATATATTAATTAATAACAATAGAATGCTA 

GAGGACAGACCACCATTAGAGAATATTAACTTCAATATATGTTAAAAGCCCACAAAGTCCAATAAGTTTATAAAAACAAAATATAAAATATTTTT 

AAATTGCCAACTTCGGATATACCCGGTAAGTTGGATGTGGAGGCTACAGGGTATCCTAAGAATGGGCAGCAACATAAATTTATGTGTGTGTCTGT 

GCATCGAAACTGGCTAAGGAAAAGAAGAACGAAAGACGAAAAAAAAAAACGTAAAAACCAAAAGAATTTTAATTTCCGATGTTATCAGATGCACG 

CAGCCCAAAGAGGAGAGGCGGCAGCTCCAGGGGATCAGGGGATGGACAGCCGATGCAAGGCCGAGTGTTCACGTCCAAAGATTGAAGACGAAGAC 

GAAACAGCCAACGCCGTATAATAATGATTATGAGCCATTAACATTAAATGGAAAAGAAAATGCTGGTCTACAGATTTCAGCTACAAAGGGGGTGG 

GGCTCCACAAAAAAGGGTGGGGAAAATCATGAGCCCGGAGCCAAGGGGGAGCGAAAAATGTGCTTTTCTGGCCCGGAATGCAGCTGAGGAGTGGA 

ATTCAAGGAATGCGACAGCCTAAAAATAACACCAAAAGGGGAGAATGTCGCTTGGTAGATGACCACTGGATGACAATGTTGCCAAATGGCCATCG 

GCCCAGTGCCACTTAAAGTGCTGCGATCTCAAGGAGCAGCTTAAATTTGCATGAAAATGCAATTTATTGACACTGGGCGGCCATAAAATGTAAAT 
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TCATTTCCATCCCCGTTAAAAGCAAAAGAGCAGACCCAACTTAATTGTGACCCTCTTTTCGTTCATTTTGAGGGAAATATCGAAAGACCAAAACA 
ACGAACTCGGAAACATGGAAGGACCAAAAGTCTCAGATTAGCTATCCGAAGTATTCCAGGATAAGTCAGCAAGCATAAATCTGGCAAATGCAATA 
GCAAATTGAATTGCCAATAAGAATTTCTTCCTTTAGTTTAAATACTTTTTGATTGATTTTTCGACTAAATAATATACTTAAAGTTTGCTAAATAT 
ATGTACTCTGGATGAGGGATTGGGTCTCGGAAACATACCAAGTCCAGGCCAGAAGAGCAACAACAATATATAGCCGCATACCAATCACAGATACA 
CCGTCCACCGACATTGGCAGGAATCCTTGGCCACAAACCCGAGCACCAGCGCACACACGAAAGCAGAATGCAAACTGCCAATTTCATAGTTGCGC 
CTGAGGAATCCTCTGCGAGGCGAGGACGTTGGCATGCCCAGCCCGATTCTCATATTCAGTTAAAGCTGCATCTACAGCTTCGGCAGCGACTCCGG 
CTTCAAGCTGCATACCAAATTCTTAGCTGAAGATGTTGTGGCAACTTCAACGCTCCCCTGCCTCCTGCCGAGAAATTGATGCCAAGCCAACCAGC 
TTCGAAAATTTACGTGTTGTGTTCTTGGGCTTTTGCTCTAACATCGGATCACAAAATAAAGGGTATACCGGAATAGTCATTTGGGTTAAATAACA 
ACAATCTCCCTAGGCGATTCGATGTTTTAATGTACATTACTTAAACAGGCGACATTTGGCGTGGCGTTTGAAATAAACGATAAGAATCTTTCCCA 
TACAATTTCTTTTGTACAATAATTAATATACTTTTCCCTTTAAGAGTAAGAAAACTTCGATTACCAGAAAATCAAATTCCTCTCTCCTTTCTTGT 
TTGCTTCTCCCCGGACCCCTCGTTTCCCCGTTTTTGGTTGCTTGTGGGTCTTTGGAGAACACATATATCTGTGGCCAATGCCAAAGACGTTGTCT 
GCTTGTTCCCCAGGTTTCAGCTCAGACTCCGGAGCTCTGGGCTCCTCCATCCTCAATCCTCCATCTTCCATCCTCGAAACTCCATCCTCCGTGGC 
TGTCGTTCGATTCGTTTGTGTTAATGCAACGCGCGACGGCTACCTGAATTACTGAACACGCGTCTTGTGTCTGCTCCCCTGCATCTGCAGCTCAA 
CGGCTGCTCCTTTGCTGAGCCCCTCTTCTGTATTCCTGAAATTGTTTGTTCGCCATGAAGGCGACGCGTTCCCTGCCTTTTTGTTCGTTTTATGT 
GCCATTTTTTGTGTTGCTCCGCTGTGTTGGTATTTTTATGTGTTTTGGCAGATCGCTTTGGCCAGAAAGGAGGATGGAGGCTGAGTCCTCTGCCG 
CTGCTGCTGCTGCTGCTTCTGCTGCTGATGATGATGAGCAGCTTTTAGAGCGGACTGGTTTATGTTTATTAGTTGTATTTTTCAGGCCTTATCGC 
TAGCATTTGGCTACTCCACAACTCGGAAACAAACTGGACGCAACTGACTTGAAATCCCCCCCCAGAACCGAAATGGACTTACCTATGAGCACAGC 
CAGACCGAAGAGTTCGGTGTCCAGCATCCCTTGCGTGGCAAAGTGCAGACATGTCTGCGTGTAAACCTCTCGAACTCGACTCTTGGCATCGACTA 
GGAAGTACAAGGTTTTCGGCTGCTGCTGCCCCAGAAGTCTGCAAATTCCACACAAAATTAAGCAACACCATTTTTGGGCGGCCTTTCCACCTTAA 
ACACTCACCTCAGGGCCAGGAATCGCGAGCCTGGTGATAGTTGCTCGGCTGAGGACGCGCATACTTCCAAAGGCGCGCTGACGGTGCAAAATGCT 
CGCATGCTGCATCATTTTCCGTATACAGCGCGCGCGTTCGAAATACGTTGCTATTTAAAGTAAAAGAATAACAAAGATTAATATCTGTATAGATG 
AATAAAGAAACTAAACTACATAGCAGGATTCGATTTTGTAAATTTTGCAAGTGAGTGTGTGGTGGTAGGCACATTATACAAATTGAAACGCATTC 
AATTACGTTTTATGTACGTATTTACGTATTACGTATATGTAAGAAAACTCATTTTAAAAGATTTTTTTTTACAATTCTTGCAAATAGAAATTGTC 
GCATTCAACGCGTCATTTCTAACCGCATTTCTGGCTCATCAGATGCGATAAAGCTGCCGCTCGATCGGATCGGTTAGGCACCATTCTTTCAGGCA 
AATTGATGCCGCTGGAGCCGCCCCAGACGAAGTTGGAGACGAAAGGCGGAAGATGGCAATGGCTATGGCGATGGCGATGGCGCCCGCCTCTTTTG 

GGC T C AC C GAC T GC C GAAC AAT T TCACC AATC AAT AACG AAAG AC T C AGG CG AC AAACT T G AT T AGC GGGC AGGG GAC T TG GAC T GGC AG AAAAA 
CTGAATAGTGAATAGTGAATACCTGGGGACTCTTAGCGGTGCGTGCGTGGCACGCAAGCTCCCTCCCCCTCTTGATGCGTTGCGCATAAAATTAT 
TAAAATTCAGTTAAAGAAAGTTCCAAATGCCTGAGCTAAGAGGCAGACAGACACCG AAACT GGACTCGTGATGAAAGAAAAGT ATT TTGTGC ATA 

AAT T AAAAT C AAAT C G AAT C AAAT C AAC T T T T T T G AAT AGAG CGC C GGC AGC GGCG AC AAC AGT T TC T AC C AAC C C GGC GG AG AC GC AT AAAT TT 
CAATATTGGCGCATTTTCTGTGTCTAATAAATTTGAACGTTCAATAAATTTATTCAGAGTCAAATCGAGAGTCGAAGTCCAACTCGAAAGCCGAT 
AGAGAGGGTTAGTGGCATCTGCATGCCACACCATAGATAGACCGAGAGACGGATGCTGAAAGAACCGCCCTGACTTATAAATATACGATCGGACC 

GAT C GG AGGT AT GCC TC C GC C AC C AC C G CCT C C C AAC CG AAAAGC GAT CAT C AATC AAAGAG C AC AT T T GAC AT AAAT TAG AAAC C AAAG AAT AT 
CTGGGAACACCATAGCTTCCTCCCCTTGAAAAGACCGCTCCAAAAAAGCTTTTCAAAAGCAATTATATTTAAGCTGATTTCCATGCGCACCTTGG 
TCTCGGCTTAGGTGATCTGGAGAACAGGCTCAGACACATACTTTATGCCCGATCACAGACGATTACGAAACGAGAATCGAAATTCCAGGTCTCTA 
ATGTGCTGCGATTGACTTATTACCGAGGAATTTATACGGATAAACATAAAAACTACATTGTTGAGCTCTCAAGAGGAACTGGATTAAGCGATCAC 
TCACATATCTCACATGTTGAGTGACTGGCCAATAAAGAGTTGAAGTATTTTTTGAGGGCTTGGTCACATTTCTTGCACAATATTAGTAAGTGCTT 
ACTATAAATATTAAATCAAATGATTTTTACCATTAGCAAGGAAATTATCCTTGTCAATAAGGCATAATGATATGTAATTCAGTGAGACAATAAGA 
GGAAACCGTAAATTTTCTTTGACAAACCCTCTTATCGATTCCGATAACAATAAAAGGAAGCTAGTCGGCAGATTGCACGAAATGAATAGAAGCTC 
AGAAGGAAACGAACGCGGCTTTAATGCAAAAATTGCCATTAATGGCAAGAAGGAGAACGCTGTCTGATTGGCGAGATTTCTGCGGAAGTAATACC 

T C T GT C AT AAAGT TAT TAG AC GAT AAAC C T T C AC T AT AGAC T AC T T CG C AG AAAAC T AC T TGC AAT C AC T T C T T TT C AAT T C C AAGC C CC C AC AA 
TGCCGGTCATTATGTAATCGACAAATCGAATATCCAAAAATACCGAAAACATATGTGCGATCGCGATTTCTTTGAGGATGAGTCAACGAAAGCGT 
TTTAATTAAAAATTCAAACAAATTCAAGGTGTTTAAAAATTTCAATCATTGAAAATCATCCGCATTCCATTGGCT AAC AAGAGCGGAAAGC AAAG 
AAAGCCGTGGCAAAGCTAATTATTATAGCCAACCTACGAGTTGGGGGTACAAGGAGAGAGAGAAATATTGTCTTCTCAAAAATAAAAGAAGTGTA 
AGATTTATAATATTTCGGTATGCAACGAGTATCCAGGATTTCACCGTGCACTGCTGGCAATTAAAGTCGAGGCAAAGAAGCGAGCCAAGTGGAGC 
TGAGCGACCGCAAGAGACACGCTAATC AGC AATT AGC AAAT GGCAAT AAAT AT TTTGTTTATGATTAAGCGAGGGGCGAAGGTGTCCAGTGGCGG 
GAGGGGGGTTGGCTGTTAGGAGTTTCGCGAGTGGGTCGGCTTTCCGAGAGAGCTTGGAATGCAGACACTGATGGAGCACAAGAAGGCAGAGGGGG 
CAGATGGGGGCTGGAGCAATAAAATCTCACGGGCTCTGCCGCGATCAAAGTCAATTAAATGGCGCAAGCGCCAGCAGTGAATGCTTTTCTGCTGG 

CT AAG C G AAT GCC AAAGG GGT AAAGAG AGC AAAAAC C AGAG AAC TAAGT T GGCG AC T G AGC AAC AG AG AC GT T C C G CAT AC GC C AC GT T GT GC T G 
CGACAGCAACTAAGCACAGAAAGAAAGCACTGTAAAAAGTTTAGCATTTAATATACGAATTTTATTTCATTTAAAAATGTATTATATAAAAAAAA 
AACCCATTCCTAATTATACAGTCATCTAAACTCGTTGTTGTTGTTGATAAGTTCCATAAGAAAATTAAGAACTTTAATATTTATTTCGGTTTTCT 
TTTCTCCCTGTGAACGACCGACTACTATGTCAACTGCACATGCCAATGCACTGAGCTCATTCGGGGCGTGTGTTTGTCAAGCGAGTTCCAACAAT 
CGCCAAAGTCGAACGATCGATTCGATCGATCAGACCCGAGACCCCAGACCTCACAGAGGTCCAACTACGAACTACGTATCGTGGGGCTGGCGGGG 
CAGTCGGAGGTGCCGGAACCAAGACCTATAGAGTCGAGACTACGATAAACGTTTCAACCGCATGTGCAGGCCCATTGAAATGCTGAACGAATCAG 
AAATTGAAAATATCCATATGGATTTCAGCGCGGAATGCGAGTGCGCATTGAGCAGATCAATAAACATGGCAGCGCAAACGGTCGTAAGTTCGAGC 
AGCGAGGAATGCGAATCAAATTGGCTCTAAAGAGGTCTCAGAGATCACCAAGCTGGATAGCTAACTCGGGGATACCACATGTAACTGTCTTTAAT 
CACCTAAAAAAATATATTAAAGAATTTAATATATGTGTCCATTTATCCACTTAGGCTCTGATGGGAAGTAGCCAAAATATTTTATGTGGATCTCA 
TCATATCAAGCATAACATCGAAAAGCCCTCAGTTCTAAACTTGAATCTGGATTTAGCATGTAGCCCTCACTATCTATCTACCTCTTTATCTTTCT 

T T AGC G AAAG C GT AT G AG AAGC C C GT T T AAAC C C AC AC AC T C C ACGT AAAG AT C C C T C AAC AT T C T T C G C AG T C T T G AAT T C T TT GAG AAAAT GG 
CAAGGCGATCTTTTGAGTCTGCGGTAAGTGCTCGCGTACCGCGCGGCTCTCAAGCTCGATTCGCATTTTATTGAAAGTGTTTCGAGCGCTCTCAG 
CTCGCCAACTAATGAATGACGATCGGATCGGATCGGCGAGCGGTAACTTTTGACGTGGAATATTTTGCTCATGAGTTGGACGGCCGAGACCGGAA 
TGGACTTAAAGCCGGAGCGGGTTCTACGTGAGTCTGGGCCTTTTAATTACCGATAGAACGGCAACTGGCGTCAATGGGACGGCCAGAAATAGTTC 
AGTTCGAATCTGTTTGCTGGCATTTTCACACGCTGGCCAACGCGGCGGATGCGCGTCATTCAAAACGGGGCCCACAAGTGGTATTCCGAAAGGGG 
TGTGGCTCCCGGGTGTTGTCGCGCAAATGAGTTTAGCCTCAGCCTCTGCCTCTGGCTCAGCCGATTTCTGGCTGATCAGGGGTAAAAAGTCCCCG 
AAATGCCCATTTTCGGAACGATTGTTGAAATTGCCTAATGACGCAAATTATGTAAAGAGCCATCAATTTGGGTGTACCTCATAAAGCACACGGTG 
TTTTAAATTCGCGAACTCCAGACACCTTTAAAGCATTATGTGGATAATATATATTGTTTTGAATTGAAATCATCGCAAGTAATTATGAAAGCAGT 
TTTCTGTATGCCGGGCTTAGTGTGCCATCAAATCGCCAAATAACTCGTCGTCTCTTTGCCAAACTGCGAATATCGAATAGGAGAAAACAGAAATC 
AGGGAACAAGCGACT GAG AAAT AAAAC AGC AAAGCGGCAAATGCCG AAT TCCGAGCTGCGGAATTCCACTTGTTGTGCCAGTTATGGCCAAACAA 
TTTGATGTCTTTGAAATGCGTGAAATGTCCCCGCCGACCGCCGATGATAATAATGAAAATGGTCGCTTGCTCGGTTGGCTGGATAGCTGTGAGGA 
GCAGCCAGGGCGGAGTGGGCCAAAAGAAAGCAGCCCACACGTCGTCGCTCGGCTTCAGTTGGCTAGGCACCACAAAATCGCACCACATCGCACCA 
CTCTCCTCCACTGCGCACCACACCCCGCAGACGCGAAATGCCCAACTGGGCGAACGAAGGTGGTAACGGTGGCGGAGGCGATGGCCTCCACAATG 
AGAGCAGCAAACAGCAAGATACAAAAACAAAAAAGAAGCCGCAACGTGTGCCGCATGCCAAATCGGAGGCGTGCGAAACTATGCTGCCATAATAC 
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CGTAGAAAAATTAGCAGTCCATACAGGGTATCTTGGGTTAAATGAATGCGCTATAATACGAGTAACCAATAATTCTAAAGTTTAAGGTATTTTGA 

TTATGGCTAGAATAATTCTTTGACCAATTCTAAGTAAAGAGGCTTTTAAGTTTTTGCCTATCTGGCAATATGTTGTTGAAACAATATTTTTTTAC 

GAGCATAGAGATAAGTACATTTTTTTAATCCTCGAAGGTCGTTCATTAGTGTATCCTCTATATCTCTGAGCCAGCCTCGACTCCGATTGTCATTC 

GAAATTTGTTTAATTTGTTGTACATTTTAATTGCCTGAATTGTTCGAGGCAATGAGCAACCAGAGTTTTCAATGTTCGCACACGCATATCAACTG 

CGTGCAGGAATTTCAGTTTACGAAATTAATATTTTGAGTTTGTATGATCATTGCTGTTGCTGTTGTCAGCTTCAATGCAAAAGCCCATTGCCTGA 

GGTCAAGTGTGATGTGGAGATCAAGAGATACGCACGGAGATGGAGATGTAGACGTCAGTGCTCAATCGAAAAAACCACAAGCGAACTAAAACTAC 

GATCCGGGGAATACATGTGTACTTGCCGAGATATAGATACGTCTGTCCGCAGATACAGTTACCCATATGTATATGATTGCCTGGTGGCAAAATCG 

TTTAACGATTAGCGTTTAACCGCTTAAAGCTTATCGACTGATACCCAAAGCGTGTGTTTTCACCCGTTTGTTTATCTTTCGCTGATTTACATTCA 

AGATGTCTGATTGCCAAGCGGTTTCTGATTTCAGCAATTGTTCCAGGGATCCCGATGGACGGTTGAAATTCCAAACCCGGGAATAACAATAGAAC 

TCTGTGGAGCTTCCATTTTCACTTAATTGCATCGCCGTCGATCGTTCTGACAACCGTTTAAGTCGCCTATTAGCCATAGAACCTATCAAATTGAC 

TAGCTCACTTGGAAATTGGATCCCGAGATAAGTGAAACCCTGTGCAAAGATACTTTAGGGCCGCATAGAAAATGTCAAAAAAAATTAATTAAATG 

TGACCGGTATTTAAAAAGTCGGTCAGGCCTACTATATAACTACATATGTATGCACATGTGAATGTACATATATATTTGACACAATAGTACTCTAA 

ACAGCTTCCCCATTTCAATATACCCTCCGTAGAGTGTCAAGTTTGACTGACATTTGATATTCCATGGCCCGAAATGGGTTAATTGCACAATTCGC 

TGAATTTTTGACTCTTAGCGGGACGCAAAAAAAAAAATTTAAAGAAAAAAGCTGGCGACAACAACGAAATGTTAAACAATCGTTGGACGTTGGCG 

ATTTTATGGTTGACAGATAAGAAATTTGCTGGCATTTGTTTTGTTTGCCGTTTTCAGGGAGTCATTAAAATTCCGGGGAAATGGCCCCATATGGG 

TACACCACTCTATATCTGCACGTGTATCTGTCCTTTTCTTACCTTAAGCCGCTGCAGAAGTCGGCATTGTTAATGCTGTATCTGTATCTGCTTCT 

GTTTCGGTTTCGCGTAATCCACAATAACTGGCTTTTTAATTTTGGTTTGGTAGTTTTGTTTAACTAAAGCTCACTCTTCGTGCACTCTTAAAACC 

GATCTATTGATTTGGCTATCGATAAGTGCCTAACAGTAAAATTTTCGTTTCGTTGTTGTTACTTTCTGACCTCTTCCACTTCCACGACGCGCGCT 

GCGACTGCGCAGTCAACAGCGATCGGGTCGCATTCCAGCGTCGCACATACTTTAATCAAAACTCACTAACACCAGAACAATTGCACACGCGGGCG 

AGCGCAACTGAGAGTATCTGTGTGCGTGGAGTCGTTGTTGTTGCGCGAAAGAATGCCGTTTGCTGGGTTGTTGTTTTTGCTGTTGCTGACTGCTT 

TATTTGTTGTAGTTGTAGTTTCCAGTGGGCAGTTGTTGTTGCGGCTGAATTTTTTCTTTTTGACCAGCACCAACTTTTCGGAGTTGACTTCAGCT 

TTTTTACACTACTAGTCGTAGATATTTTCGTGGTTTTAAGGCGCGTTTTCCCAAAATCAGAACTAGTTGCACCAACACCAACAAAAATCACTTTT 

GTAATTGAGGCGAGTGCGAAAACGAAGCGCGCTGTTTTTGTTTCAGTGAAAACCGAACTTCACTTTAAGCAACTGAATCTGTGTAGCTCTGTGTG 

TGTCTGGGCGATCTGGTGTGCGTGCGGTGTGCATTTAAGATCTGAAGAACAACTTTTCCGAATGTTAGGTCTCGAATAAATTCATTAAACCGAAT 

TTATATTTCCCAGTGGAAAGCACATGCTTTGCTGTCCCGACGATCTGTTCGGTTACCGTTGACGTTTAGCGATATACTGATTTTGCCCCTGCTCA 

GCGAAACCGAACACAACGAAGATCGAGTCGAGATTTCGTTAAAACTTAACGAAGCCGCGTTTAACGAAAAGAGAGCCCAAGAGAGAATTCGAATT 

CGAAAGAAGCGCACACAGCGATACACAAACCCGATGGCAACAACAACAACGGCTACGAAAATGTAAATAGCAATAAGCAATAAAAGAAAAACAAA 

CTTCAAAGGAGAATCCCAGTAGCCAACTACATAGTGTGTGCGTCCGTGTGAGTAGTGGAATTCCAATGGCTTATCTCGCGAGACGACGATTCCGA 

TTCTCAAGGGCTTAAAGGATACCCTGTTAGGATAGCAGATTTAAGTAAAATGCATATGAGAATTTTAACAAAGGTACCTTTCTTTCTAATTCTGC 

TTTTTATCACATGCTATCTTGTAGTTTACATTGTATATTACAATTACATTATGAGTTCTATCTTTTGGCATTCACTTTTTGCCAAACTATGGCGG 

AAGCCTATTGGATTCAAATCTAGCTTTCAGTATCCCTGCGCTAGCCATCGATGCGACTATCTTATAAATAAACTCTCTGGGCAAAACTCCAGATA 

CAGATGCACTGGACGAGGGAGAATCGGAATCGGAACTTGAGGGTCTGAAGACTCTGCCTCTATCGGCGTCCTCTGCTGCTACGATATCGTCACTT 

GGGCGCATTCCTCAGCATAATGTGAGCTTGATTTGCTGGATTCTTTTTGCTCCCTCTTCCCTCTTTTTCTCTTCTTTAGATTTCCTCCTTTTCAT 

TTCCCCTCCTCCACCTCTTCAACCATCATTTGCTTCAAATATGCGGCATACACACGCCCCACGAAAGGGCGGGGATCAGTGTAAGCCAAGTAAGG 

TTTTAACCAGTTCGTCCAAGGGGCTACTTAGAAAAATCCACAGAAAACAAAATTTAGATTATATGGAAAAAAGTCTAAATGATATAAAAATCGTA 

ACTA 

(SEQ ID NO: 442) 

Exon: 17909.. 17048 
Exon: 10785.. 10649 
Exon: 10583. .10438 
Exon: 6083.. 6004 
Exon: 5939.. 2582 
Exon: 2523.. 1001 

Start ATG: 10740 (Reverse strand: CAT) 
Transcript No. : CT13612 

AACAGATCGTCGGGACAGCAAAGCATGTGCTTTCCACTGGGAAATATAAATTCGGTTTAATGAATTTATTCGAGACCTAACATTCGGAAAAGTTG 
TTCTTCAGATCTTAAATGCACACCGCACGCACACCAGATCGCCCAGACACACACAGAGCTACACAGATTCAGTTGCTTAAAGTGAAGTTCGGTTT 
TCACTGAAACAAAAACAGCGCGCTTCGTTTTCGCACTCGCCTCAATTACAAAAGTGATTTTTGTTGGTGTTGGTGCAACTAGTTCTGATTTTGGG 
AAAACGCGCCTTAAAACCACGAAAATATCTACGACTAGTAGTGTAAAAAAGCTGAAGTCAACTCCGAAAAGTTGGTGCTGGTCAAAAAGAAAA^ 
TTCAGCCGCAACAACAACTGCCCACTGGAAACTACAACTACAACAAATAAAGCAGTCAGCAACAGCAAAAACAACAACCCAGCAAACGGCATTCT 
TTCGCGCAACAACAACGACTCCACGCACACAGATACTCTCAGTTGCGCTCGCCCGCGTGTGCAATTGTTCTGGTGTTAGTGAGTTTTGATTAAAG 
TATGTGCGACGCTGGAATGCGACCCGATCGCTGTTGACTGCGCAGTCGCAGCGCGCGTCGTGGAAGTGGAAGAGGTCAGAAAGTAACAACAACGA 
AACGAAAATTTTACTGTTAGGCACTTATCGATAGCCAAATCAATAGATCGGTTTTAAGAGTGCACGAAGAGTGAGCTTTAGTTAAACAAAACTAC 
CAAACCAAAATTAAAAAGCCAGTTATTGTGGATTACGCGAAACCGAAACAGAAGCAGATACAGATACAGCATTAACAATGCCGACTTCTGCAGCG 
GCTTAAGCAACGTATTTCGAACGCGCGCGCTGTATACGGAAAATGATGCAGCATGCGAGCATTTTGCACCGTCAGCGCGCCTTTGGAAGTATGCG 
CGTCCTCAGCCGAGCAACTATCACCAGGCTCGCGATTCCTGGCCCTGAGACTTCTGGGGCAGCAGCAGCCGAAAACCTTGTACTTCCTAGTCGAT 
GCCAAGAGTCGAGTTCGAGAGGTTTACACGCAGACATGTCTGCACTTTGCCACGCAAGGGATGCTGGACACCGAACTCTTCGGTCTGGCTGTGCT 
CATAGATGGCGAGTACATGTTCGCAGATCCCGAGAGCAAGCTCTCAAAATATGGCCCAAAGAGCTGGCGATCATCGCACACCCACGGGCTGGATG 
CCAACGGACGACCGCTCCTGGAGCTACACTTTCGCGTGCAGTTCTACATCGAGAGCCCGTTCATGCTGAAAGATGAGACGTCCCGCCATAATTAC 
TACCTGCAGCTGCGCCACAACATCTTGCAGCGCGATTTGCCGAGAGAACAGGCGGAGCAGGCACTGGTCTTCCTTGCTGGACTTGCTCTTCAGGC 
GGACCTGGGTGATGCTCCTCCGGGCACGAGTAATTCCAAGGATGATTCAGGAGAGGAGACATCGGCATCTCCTTCTAACGGAGGAAGAGGTCTAA 
GTGCCACCACTACATTACCCAAGATCAGCAAACGTGCTAATGAGAGGATGCTTAGACTCTCGACATATGTGGCGTCCACATCAAAGAGAGAAACA 
ATACCATTACCACCCAGCCTGCCACCGAATGGAGCAGATTACTACCGGATCGAGGATTACCTACCCAGTGGACTGCACACTCCCTGGGCGAGAAG 
TGCAATGAGAGCCTGTCATCGGGAGCATTTGGGCATGGCCACGGCAGAGGCGGAGCTGCTATATATTCAACAGGCATGCAGTCTTCATGAAACCA 
TCAATGCCCATACCTACAGGATGCGATTGGCCAAAAGCGAGCAGGGATCTGGTAGTGCCTGGTTTGTGGTCTATGCCAAGGGAATCAAGATTCTG 

G G AG GAG AGT C T AC C AAC AGC T C T T C C A AT CC C G AAAC C AC AAC CT TTCTGTGGCC G AAT AT C AC G AAG CT C T C C T T CG AGCGG AAAA AG T T T G A 
AATACGATCTGGAGAGAGCCGAATCACCCTGTACGCCGCATCCGATGAGAAAAATAAACTCCTATTGACCCTCTGCAAGGATACACATCAGTGGA 
GCATGAAACTGGCTGCCAGGCTAAAGGAGGTATCCAAACGCGAGGAGGAGGAGGCTGCCGAATCACAAAGGCTACATGCCAGCTACGCTTGTTCC 
AGAAGTCTGCTCTTGCCCTACAAAAGTAAAAACGAGCAGCGAATATCGGTTATCTCGAGTACCAGCTCCAACACAACCTCTGGGATAGTAAGTGA 
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TCGAGTGCACTCCGAAGACGAGCTGGAGATCATGATCAATACACCACCAGCACCCTTAGCTGCTCCGTCAACGGAAAGTTTGGCCTTGGCTCATC 
TTTTGGACAGACCCAGTGTGAGCAGACAAACTTCGAGCGTGGGTCAGATGTCTCTGAAGGATTTGGAAGAACAACTGGCTGCCTTGAGTGTGAGG 
CCACAAGACGCGAGTTCTAACGGGGCCACGATCGTCACGAACTCCTCTGTTCAGAGAAATTCCATGGGAACAACTGCCAATGATTCATCTACCGC 
CACTGATTCACCCAGTTCGCAGCATAACATCGGCTCTCAGTGCTCATCCACTTGTAGCACTGTGGTGGTAACATCTCCTGTAAATGGAGCAGGCG 
CTTCCTCTTCAGGTGCTCCCATACCTGTGCACTCCACTTCATCTAGTTTGGAGCTTGGCTTTAGTCACACCGCTCAAAACTCGGCCTTGAGCGAA 
ACGAGTCCCGACGATTTTCTCTCCACATCGGCAAGAGAAGAAACGGAGAGTGTATCGGGCGCATCTGGGGTTTATACTTTGGCGCACGGAGCTCC 
ACCCACGGAAACCTCAGGCGTTTACACCATGCACAGCAGTGAGCTAACTGGACAATCATCGGAGATCGCCGAGTCGGAAAAGAGTTCGCACTACG 
GGATGTTCCAGCCCCAAAAACTAGAAGAAACCCACGTGCAACATTCTGATTCCGTAGATGGAAAAAAGAAAGAAGACTTCAGACCGCGATCGGAT 
TCTAACGTCAGCACAGGGAGTTCCTTTAGAGGCGATGGCAGCGATCCAACAGACAACAAACATTCCTTGCTCTCCGCAGAAGAGTTAACCAATTT 
GATTGTGGGCAGGGGGACCTACCCCAGTCGGAAAACAGTATCTAGTAGCCTTCACTCGGACTGTGATTACGTGACGTTACCACTAGGAGATCAAG 
GAGAAGAGGAGGTAGACCAACCACCAGCACCACCACCACCCTACAGTGCTAGGCATGAGAAAACTGGTCTCTGTGGCCCACCTATCGCCAAACCT 
ATCCCTAAACCTATAGCTGTGGTAGCTCCCAAACCAGATTCACCGCCATGTAGTCCACCGGTACCGCCAGCACCAATTCCAGCCCCACCGCCGGC 
TATTCGACGAAGAGATCCACCGCCATATTCCATCTCTAGCAAGCCTAGACCCACATCCCTGATATCTGTCAGTTCGTCGGCACATCCTGCGCCCA 
GTGCTGCCGGCTCGATGAGTTCTTTAAAGTCCGAGGAGGTTACAGCCAGATTTATCACCACCAGGCCACAGATTAGTATTCTGAAAGCCCACACC 
AGCCTGATTCCCGATGGGGCCAAGCCAAGTTACGCGGCACCGCACCATTGTTCATCAGTTGCCTCCTCCAATGGATCGGTTTGCAGCCATCAGTT 
GAGTCAGCAATCGCTGCATAACTCCAACTATGCCGGCGGATCCCAGGCTTCACTGCACCATCATCACGTGCCGTCACACCACCGACATTCTGGAT 
CGGCAGCCATTGGAATAGTTCCTTACGGTCTGCACAAGAGCACGGCATCACTGCATCATCAGCAGTCATGTGTACTACTACCGGTTATTAAACCT 
CGGCAGTTCCTGGCACCACCACCGCCCAGTTTGCCGCGGCAGCCTCCACCACCGCCCCCACCGAATCACCCCCATTTGGCGAGTCACCTTTACGA 
GAGGGAAATGGCTAGAAAACAATTGGAGCTCTACCAGCAGCAGCTTTACAGCGATGTGGATTACGTAATTTACCCCATCCAGGATCCAGCTGTAA 
GCCAGCAAGAGTATCTGGATGCCAAACAGGGATCCTTACTGGCGGCAATGGCCCAAGCAGCACCACCCCCACCGCACCACCCATATCTGGCCATG 

C AGGT GT C GCC GGCGAT T T AC AG AAGC AC CCCCTACCTAC C AC T G AC T C T GT CG ACC C ACT C ACG AT AT GC AT C C AC C C AAAAT C T AT C GG AT AC 
ATATGTGCAGCTTCCAGGACCCGGATACTCACCCCTGTACTCTCCTTCGATGGCCAGCCTATGCTCCTCGTATGAGCCTCCACCGCCTCCTCCCC 
TTCACCCGGCTGCTCTAGCAGCAGCTGCTGCGGCGGGGGCTGGCTCCTCGTCTTCCTCGATGTTTGCGCGATCGCGATCAGATGACAATATTCTG 
AACTCACTAGATTTGCTGCCCAAGGGAAAACGACTGCCACCGCCGCCACCACCACCGTATGTAAACAGACGGCTAAAGAAACCACCCATGCCGGC 
GCCCAGTGAAAAGCCACCTCCAATACCCTCTAAGCCAATTCCTAGCAGGATGAGCCCGATTCCACCACGAAAACCCCCAACGTTGAATCCGCATC 
ATGCGAACTCACCGCTAACAAAGACCTCCAGCGGCGCCCAGTGGGCAGGTGAGCGTCCGAGACCTGACTTGGGTCTTGGTCTGGGATTGAACCGG 
GGAAACAACAGCATTTTGGCCCAGCTGCAGGCTTCAATGGTAGCCCAGTCTCATGCCCAGGCACAGGCCCAAGCGCTAGACATTGCGCTTCTTCG 
GGAGAAGAGCAAGCATCTAGATCTGCCGCTGATCTCGGCGCTCTGCAACGATCGCTCCTTGCTGAAACAGACTAAGGTTGTTATAAATCCGAAGA 
CTGGCCAGGAGATGCCCACTTCCAGTGCGCAACCTTCAGGTGCCACCACAAATGGAGTTGCCAACTCCTCTGCAGGAGCAGGAACCCTAAGCAAG 
GCCAGAAAAGGATCTACCGTAAGCCACCGTCATCCGCAGGACAAACTGCCGCCTTTACCTGTCCAACAATTGGCGGAGGCCAACAACTACGTGAT 
CGATCCCGCTGTCAT G AT G AAGC AAC AGC AGC AGC AGC AG C AG C AC AAC AAG AC C AGC T AGAC T C G AC AAG C T AAT T GG AAAC CGG AAC GG AAGT 
GGACATTGGGGGTCCAATACGACACTCTTCTCTCTAGTTCGATTGCTGTTCGAAGTGCTGATGTATGAGTGTGCCCAGGTGAAGAGCTAATTCTG 
CGTTCGGAAATTTGCATAGATCGGGTGGTGTAGCGTTATATAAATTCGATCACAGTGTTTCGAAGGGTAAAGATTCGCCGGGCACAAAAGGGTAA 
ATGTAAACGAAAACCCACCGTGAGAAACCATTTGTCAGAACAGCAGAAACAATAGCTATTTAGTAGGAACTAACATGCATTTAACTAGCGAAGAC 
AGT T T TT T AGAAAAAC AC AAAC T AGC AG GT CC AC C AACT T C AG ATGG T AAAAT AC AC G AGC C AC T G AAAC AAC T G AAT AT C AC AGAG T C AAT AC T 
TAAGCCTACAGGACGTGAGCAACGTAACCTCAAAATACACATTCAAATAATCAGAACCCAGTCTAGTCACTTAAGGGAATTGGTAACGTAACCAC 
AGCACAGACCACAACATGACACAACATCGGAAATAGAGAGGAAGCTTATCAAAATGGGAAGCAACAATACTCAAGTATAAAATACGACCACTACC 
T AAT T T AC T AAAC T AC AT C GT AGAT CT T T AGT C T T AGC T GC AAAT AC T T AGT T AGGC C AG AAAT GC GAAT AC TT AAAAC T AAC T AC T T C C GC C C A 
AACAACTGTGATTGATTTTTATTTAAACTTATACTAAATTTATAACTATTTTAGTCACCAAGAACAACGCAACATATACTATGCAAATTTTTTAG 
TACGTAGCAACGTCGCTCTTTATACACACACACACATACTCACAACGAATCGGGCCAGCGGTAGAATCAAGTGAAAAGCATTCAAATCGGCTATG 

AAATGGTTTGAAATGGGTTTGCTTGC 
(SEQ ID NO: 443) 

Start ATG: 908 {Reverse strand: CAT) 

MflAFCTVSAPLEVCASSAEQLSPGSRFLALRLLGQQQPKTLYFLVDAKSRVREVYTQTCLHFATQGMLDTELFGLAVLIDGEYMFADPESKLSKY 
GPKSWRSSHTHGLDANGRPLLELHFRVQFYIESPFMLKDETSRHNYYLQLRHNILQRDLPREQAEQALVFLAGLALQADLGDAPPGTSNSKDDSG 
EETSASPSNGGRGLSATTTLPKISKRANERMLRLSTYVASTSKRETIPLPPSLPPNGADYYRIEDYLPSGLHTPWARSAMRACHREHLGMATAEA 
ELLYIQQACSLHETINAHTYRMRLAKSEQGSGSAWFVVYAKGIKILGGESTNSSSNPETTTFLWPNITKLSFERKKFEIRSGESRITLYAASDEK 
NKLLLTLCKDTHQWSMKLAARLKEVSKREEEEAAESQRLHASYACSRSLLLPYKSKNEQRISVISSTSSNTTSGIVSDRVHSEDELEIMINTPPA 
PLAAPSTESLALAHLLDRPSVSRQTSSVGQMSLKDLEEQLAALSVRPQDASSNGATIVTNSSVQRNSMGTTANDSSTATDSPSSQHKIGSQCSST 
CSTVVVTSPVNGAGASSSGAPIPVHSTSSSLELGFSHTAQNSALSETSPDDFLSTSAREETESVSGASGVYTLAHGAPPTETSGVYTMHSSELTG 
QSSEIAESEKSSHYGMFQPQKLEETHVQHSDSVDGKKKEDFRPRSDSNVSTGSSFRGDGSDPTDNKHSLLSAEELTNLIVGRGTYPSRKTVSSSL 
HSDCDYVTLPLGDQGEEEVDQPPAPPPPYSARHEKTGLCGPPIAKPIPKPIAVVAPKPDSPPCSPPVPPAPIPAPPPAIRRRDPPPYSISSKPRP 
TSLISVSSSAHPAPSAAGSMSSLKSEEVTARFITTRPQISILKAHTSLIPDGAKPSYAAPHHCSSVASSNGSVCSHQLSQQSLHNSNYAGGSQAS 
LHHHHVPSHHRHSGSAAIGIVPYGLHKSTASLHHQQSCVLLPVIKPRQFLAPPPPSLPRQPPPPPPPNHPHLASHLYEREMARKQLELYQQQLYS 
DVDYVIYPIQDPAVSQQEYLDAKQGSLLAAMAQAAPPPPHHPYLAMQVSPAIYRSTPYLPLTLSTHSRYASTQNLSDTYVQLPGPGYSPLYSPSM 
ASLCSSYEPPPPPPLHPAALAAAAAAGAGSSSSSMFARSRSDDNILNSLDLLPKGKRLPPPPPPPYVNRRLKKPPMPAPSEKPPPIPSKPIPSRM 
SPIPPRKPPTLNPHHANSPLTKTSSGAQWAGERPRPDLGLGLGLNRGKNSILAQLQASMVAQSHAQAQAQALDIALLREKSKHLDLPLISALCND 
RSLLKQTKVVINPKTGQEMPTSSAQPSGATTNGVAKSSAGAGTLSKARKGSTVSHRHPQDKLPPLPVQQLAEANNYVIDPAVMMKQQQQQQQHNK 
TS* 

{SEQ ID NO: 444) 
Name: expanded 

Classification: known_f lybase_gene 
Gene Symbol : ex 
FlyBase ID: FBgn0004583 



Celera Sequence No. : 142000013383801 
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TAGTCCTACCGTATTGGTTGAATCCGGAGAGATTGTAGGCCCATCTGCCCAGATTTGCTGTGGGGAAAAATTCGCATTACATATTTACGTAATCA 

TAATATTCCGGCGTACTGCACTACGAAATTTGTTGCATGCAACGTGAGTCCGATTGTTGCCCGTTCGTATTACTTACAGAGAACTGCGGGTCCCT 

TTCTGGCAATATAGTTCGACATGGTTTACGTAAAATCAGAAGTCGGGAAAGAGATATTCCTCCAGCCTTTTTATTTTAGGTATGAGAACGTTGGA 

GATGGGTCTCTCGATAGCTATCGCACCCCGTTATCGATCAACGAGCACTACGGTTGCGTTTGTACAAATCGATTTCCGAAATGTAATACAGACAC 

ACTGCACCTATCGATATATCGCACTCAAGAAAATTCGTTATTCTTTAGTTTTAGAGCTATTAAATATATTATCTTTTAACATTTTACCCTTTTGT 

ATTTGTATTTAGATTGTTCAAAAACTTGGTGGAGACGGCAACCGAACTACAGTATAATTACATCTCCAAAGTTGGTCATTTTGCAATAACTTTTT 

GTATAAAGTAAATAAAATCAATTCAAATTCTTGTAAACCCATTTAAGCCCATGTAAAAATCCGCCTTCATAAACAAAATACTTGTAGTTTATCAA 

ACCGAATGATTGTGGCTTTCTCCTTTTGTAATTTGTTATGGGAAACACTCAAAAAAGGTTTTTAATTTGGGTTAAGTTATAATTTATGTTAACCA 

ATAACCTCGATTAAATTAATTATAAAAATGATGCCTGAAGATATACAACTTTCACTTAACTTGCAACACGGGTATTGTATATTAAACAGCAAACA 

AAAAAGGCAAAAATGTTGGTAATTAGCGGAGTGCTTCGACAAGCGGAAATCCCTACTTCTCATTTAAATAATATTTCCCCAATTTTTTGCGATAA 

GCCCCCGCCAACGGTAAACCCTTCATTAAAATGATTGATAGCGTTTTTATTTTAAAGTACATACCTTCGCTTTATTATGACTACAAAATATCAAT 

TTATACAAATTAAAGAACATATACAAATTCTTTATATATACAGTGCCCTCCCTTAAGAGTGAACACTCGAATATGTGAACTTTTTAGTAAAAGTG 

AACGATTAATTTATGAAATTTTAGAAACTTTCGCTTAACTTAGTAAAAAAATTAAATTTTTTCAAACGTGGTGTTTTTGAAAAATATATTAAATT 

GT T TAT T T T T TC T GT G AT T C GTT AT AGT G AAC AAAT T C GAT T C AG T GC AC AC C CT T G GT T CT C AAT AGT C C AC T GT T T C GAG T G AGC AC T GT AC A 

TACACTAAGGCTGGCACGAACACGACACTAATTTAAGTTTGTTTCGGTTTTTGTTTCTGATTTCTGAACACACGTTATATATATACATATGTATA 

ACATTGAGACCCGTTCCAGCCTTAGTTTATACATATATCACTGGGCACATAATGTAGTAACTCCCAATGCGCTGAGTGGCCGGGAGAGAGAAATC 

ACTTATTGTTCTTAGCTGCAGTCAATGACAAAAACTCCACCTTTCGAGCCAAGTACTCATTATAACGAAGAAATGCATAGCTAAATGGGGCGAAT 

AAAAAATTGGTCATTAGTATCGATTGGTTGTAAGGGTCGTAATTGGGGGAAAAAAGGAATTGCAGGGTGCACAATAATATAATAAATCCATTATA 

AT GGC C GT C T AAT GGT T GT T GAT GG AT CT T AT C AAAC AAAACT G AT AT AT AT AC T GAT GAT AT AT AT C TAT AT AT C T AAAGG AT T T AT CAT C T AA 

TTGCATCGTGCATTCCCAACTCAAAGTTACAGTCCATATACTATGCAACACGCACCCTACGTAGTCGAAATCGATAGTGGAATGCTCGGCTTGCA 

ACAGGGACCACACCGTCCAAAAGATATGCGATGCCAGCGCAAACTGATTGACCTGGACATAGAGCAGTTCGACTTCGTCGTTTTGAATGTTGCTG 

CGCTGTAGGTACTCCTCAAGGTAGACCCTCAGCCACTGGAGCTGGAACTCGCGCTTCGGATAGCGCGAATAGTCGACCTCATCCACGCCGCACAT 

CTCCGCAAAGTGGTTGCCAATGTCGAAGGCCTGGAAATTGTAATCGGCATACTCGTAGTCGATGAAGTTCACCGTATTCAGGCTCTGCGTATAGA 

TCACATTTCCCAGCAGAAGATCGTTGTGGGAGAAGACAATCGGACTGTCCAGAGCCTCGAGGTATTCATATAGCTTGTTGAACTCCTCGCGCAGG 

C GGC CG AT AGG T AG AAAC GT T T C T T T C AC T CT GC AAAT T G CAAATC AAAC AATG AGC G AAT GAAC AAAGG AC C G AAG AT GC AAAGG AT G T CT GGT 

GTCGTTACCTTTTGTGTTTTTCAGCATCACTAAAACGTTCAGGTACTAAATCAAGAAAGCTCTGCGTCTTCTTCCAAATCATCGGCATGGGTTTG 

GTCGCCGAACTATCCCCGTGCTTTCTCACCTTGCGATGCATCTCGGCCATGCGACGGGCGACCAAGGGCCAAATTTCTGGACAGAGCACACTGTC 

CGTATTCAGGGTGGTTCCAGGTACGTATTCGTAGACGAGACCGTTCTTGAACGTGGCATACAGCGATGGCGCTAGCCCATACGTATGCAGTAGAA 

GAAAGTTTTGCGTCTCCGCCTTGCGATCTATCAGTAGGTCCGTTTTGTTACCGTATATCCTGACGAGCACTACGTTATCGGAGTACTGTACAGGT 

GACCCGTCGTCCGCTGCCCGATCGTCTGTGAACTCATCATCGTCCTCCTTCTCAATGATGACTGGATCCTCGGACTGAACGGGCGACAAACCCTG 

CGTCTTGATAGGTAAATACGATCCGCCGTTCTCATCGTTCAGTTTGGAGATCTCCTTATGAAAACATCCGACCAGTTTGTTTGTGATGCCATCGG 

TAAAACTCTAAAATAAGCACACAGAGAACAGCACCATTAAATCGTATACATAAAAACATAATACTCATCGATGTAGATCCAGTGAATGGGTTAGG 

ATGGATTAATTAGTAGAGTAGCTACATGATTTGTGTGCTTTGTTTATTGTAAGTGAAGTTATTGCATCATCACCTAATACCAATTGCTATACGGT 

AAGGTTCATTAGTAGCATTCAATGGAACGGTTTCGGGAATTCGCCTTTTAAGCCACAGCTCAAGAGCTACAGTAAAGGGCTTATGGCCACTTCCT 

TATCGACTCCACATGCTTCTTATGTACATATGTATGTACATATATAAAAGAGTGAAGCACATGGGTGATAAACTTTGTGACACCAGCTATTTGCG 

TGCTAAGCGTCTCAGAAAGACCACCTTCGTCTTGAATTATATACGCATACGGTGCTTTTTATAATGAAAACACAGAAACAGATAAGATAATGAAT 

TGCTAAACAATTTAAATTCATATAATTCAAAGCGAAAAAGCCCAGAATGTGTGAAAGGTGAATGAAATAAAAGTGCGAAAATAGTAATTAGTTCA 

TTTCAATCACTTCGATCTGCCCTGGTTTTATTCATTCACTGAGAAAAGTTTATTTATTGCATTTAATCAATATTTCCCAGTTATTTTCCCTTTGC 

TCAGTGTTCAATGCACTTAAGATTAAAGTAGTGCTGTACAAACACGTGTGTTTCAGATCGCTGGCGATATAGTAAATTTTCCACTTCATATATAG 

AACGACGTCTCTAGCGGCTATAAAAATGTCAATAGTTAACCCATTACTCGCATGACAGTTGTGAGGCATAAATAACCGTGAAACTTGAAGTAAAA 

CTTAAACATTTGGTTTATGGTTGAGAAACGCTGCCTATGACGCCATGGTCCATGAATCAACTCAGATTTGTGCAAGTATTCATAGTATTCATAGT 

AG T CAT AT T GT AGC AT T TAT TAT C AC AAT T T C G C C T G AC T GGC GC C AAAG AT TTGTGACTCC C CAT AAG GAT GCCTCCCCTC C AC T T G AAT C C T G 

AATCACGCGACAAATTATCTATAGCTTAGAAAACGGACATGACACGATTTAGACTGTGTACCATGTACCCATGTACTATGTACACCACTCGCTGA 

TCAAGGGATGAACCACGTTTTATTGCCCACCAGCGATTTGGCAAGACTCAGCACTTGGCAATTCCGACTGGAGCTGCCATAAAGCTGATAAGTGA 

GTTACGGACAAGCTGTGTACACCAATTAGACGTCATGCCTATGGTGCGGCATATACAGATAGCGAGGTCAAGGCGTACAAGTGGCTTCCGATAAG 

GCAAAAACGCCCCATTTCAAAAAAACCGGAGTGATTAAAGGCAGCCTCAGGCATTTGTACATACATACATCAATCATACTCCTCTAACTCGACAG 

CGTGAAATTATCAAATCGGGCCAAACGAAATCAAGCCCCTAAAAAGCGAGGGAAAAAACGTTTTTTTTTCGGATCATCCGTAAATTACGGACGGA 

CGGACGGATGGACGGTGTGTCGGTAATGGTTATCTGTAAAAACACTCGAGTCAATGCACCTGCTGTCAACACCTTCTATGTCAGGCATTTAAAGT 

ATTGTGGTAATTAGAGAACGTGCGATTTCATTAGAACTAAGGCGACTAATTGGCACGAAGGCCTATTTCTTGGCTCACACCCGTTTCCAACCCGC 

TCCGTTTGGTAGGGTGTGTGTGTGTGTGGGGGGGGGGGGGCACTTATTATTATTTTCCCCCCCGGAGTCGAGGGCTAGATTAGCAGCAAACTGGT 

TTCTACTGCGACGCGGGGTCTGAAGTCTCCATAGTTCGCTTGGCGACTCGGAAACTCAGAAACGCTGTCCTAATACACCAACACCAGCGTAGTGA 

ATAATGTTAATTTCGAATAACAATCCTATGCGTCTCTCTCTTCATTGATATAACAAAGTCTATTTACAATTATCATGGCGGCGCAGCAGCGAATA 

TATATATATAATTACTATTTGAAGGTTGGCTGTTGTGCAATAACCCTGTTTCAAACAATGTTAGCAACTACTACGACTCTATATCTCTATATATA 

TGTATATGAATGTATATAGTAAAGTATATATCCCACGTTCCGCACACCAGTTAGTGATCGTAATAAATGATAAGGAAACATACAGCGTCATCATC 

ACCATCGCATTTTGGGCCGGATAACGATCTTCGATCTGCGGAACTACCCTGATCTGATGGAGAATGTAGAGAATATGACATTGATGGTTTCGTTT 

TTTTGCATTGTATAGTGTGTTGTTTGGCTGGTGATTTAATTTGATTTGCTTTGATTTGATTGTTGGTATAATGATATTGTAGGGGGGTTATAATT 

TTGAATGTTTTGATGAGGTTTACGTTACCTTAAACTCGACGTGGCTGAGGTCCCAGGTCGGTCGTATAACCTTCAACAGTTCCTTGGCCCCTTGA 

ATTACGTCGGCCTCCTCAACAAATATGGGTACAAATGGTACAATTGCCTCTTTTCTGGATTTATCCTCCGGTTTGGCTCGAATATCGCGCGAATT 

TTGTTCGTTTTCGTTTTTGTTTTCGCTCCCGCTTGGATTTGGATACGAATTGGAATTCGAATGTGAATTCAATTGGTTCTGGACCTGGTTCGATT 

GCGATAGCGATAGCGTTTGCTGATTGACCGTCTGACGAACTAAGGATAAAGAATCTTTCATCACTTTGGGGTTGCCGCCACTTGTTGAAATCTGT 

CCTGTGTAACTGTTGCTCTTGGTTTCTGTGCCCATTCTATTGTATTATATATGTGAAATACGAAAGTGTCAGGTCGGGTCATACATATAGAACGA 

CAATAACATCAGCATGTAAAATGAGTAGGTAGGTAGGCGGTAGTTAAAAGCTAATCATCCGAATGCAAAACCAACCGAGAACCCAACCAAACCAG 

CAAAGATATCCTCTCACCCCCCCACGCAACGAACCGCAGCCAAATGAAAAATTAAAAAACCCAGTAAGTATACTACGTCCATGTCCTCTCACTCA 

TGCGCACCGAAACCAGAGTTTTGGATGGAGATGACCGCACCCAGTTAGTAGGATCTGCGTACCATGTGGGAAATAGCCCAAACACATCCACTTCA 

CCACCACACGAATGAAACATTATTGTATCGTTCATACTAAATTTAATCTCAGTTTAAGGTTTATTTGTTTATATATCAGATATAGAAGATTGTGC 

CAAAGTAAAGCAAAATCTACCTCCTAAACAAAAACTTATCTACATAATGATCAACACTTTTAGAGTCCCAAATGTGTAATTAACAATTATACGAG 

AAAATATTCGCTAATTTGACAAGCAAATAATATATATGTACGTATATTCCATAAAAAATGATTACGGAAAATAAACCTTGAAATTGTGATAGTCA 

ATTCGTTAACTGGTTTTCGCTTGATACAGGAAGTTCCTAATTTCAGTTATAGATAAACATGCTCTTTTACCCTCATTAAACACCAAAATTCGCAA 

TATAGATAAAAGCTGCCTCAATGATACTTATTAATCATTAATCACGATGTAATGAATGATACAATTTGCAAACTAACAACTTGTTAATAGTTAAT 

AATAATACATAATGCAGCTCTAACTACACAAATTCATGCATGTGTGTGTATGTATGATGGACGCTTTTTGATTACCATTCTTCTTAGACCTTATC 

GTTGACATTATCTACTGACTAGATCCATCATCTTGTCTTCATTTTCGACTATATCACGAAAAGCTGATTTAAAAATAAACTTTTTATTCGTTTAT 
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GATAAGCAAACATCTCGAATATTATAGAATCAAGTTGGTAATACTGACAACGGCTATTAGGTGCTATTATGTTACCCCGTGCTGTATTATGTCTT 
CGAATGCGGCACAGCCGGGCGGAAAGTACACGTAGCCGTGTGTACTCGAATTATACGAAGGTGCGTAAGGCGATGACTATAAATCGGAGAAGGTA 
ACGAGACAACATACGTATATGTATATGCATGGGATAAGGTAAGGGGGTCAAAATAAGTCAATAAGATCGCCGTGTTAAGCCAATGTCTTAAAATA 
CAGGGCCGGAGCGAGAGGGTGGCTAAGTTTTTAATTTGATCAAAGGCAATTTTAAATCGGAATTATGCTGTATCCACTAAATTTTATTACAATTG 
AAACTGTTTTGGGGGGTGACGCCCCAAAATGACCCCGCCGACTCCGCTCCTGGAGCCAGGGAATGCACCCACACTCGCGCACAAATAAATAGGGC 
GTAAAATGCTAATCTTGTGAATACGTGAATGTACATACATGTGTATGTACATACTCCACGCAAATCAAGTGTACATTATGTACATGCGAACGTAT 
GTAAGATCGGAAATCAGGTGAAAAGCCCAGGGCCCAATAGCACAGGTGGGGGCGGGCGCGTTAGCACGGGACATTTGCTTTAATATTAACATGTT 
TTCTGTTTCGCACTTTCTTCATTTTTTCATCTTTATTTTGGTTTTTTAGCCAGGCCAAAAGTTTACGCATACAGTGGCAATGTGTGTAGTACGTG 
CAAGGGTACAACTGCATGTGTGTGCATAGTAGGTGGGATTAGGGCTCATCTGCACGGGCTAAGTTTCCCTTATATCCCGTTATACTACTCGAATT 
CTGTAAACTTAAATGTTGTGAATGAAACTAAAATATAAATAAGCTACATTACTTTGCCAGTCACATTATAGACCTCTTTTAAGCCTCAAAAGCGG 
CGATTCCATGCGGAATTTAGGTCGTGCGAAGGGGCCTTAAGCGCGTTCTGGGTTAAGAACCCCCGCCCACGACCCCCTTACCACAACGACCCATC 
CCACAATCCGAGCATCCTTTTTCCTGAACCCGGTGCGATATTTGTTGGTTGATTCGAAGGATTCCCATACCAGATTTTGATTCCGATTCGGTGGT 
TCTGAATTCAGATAGTGATTCGATTGGATTCGATTTCATTCGACTGGATTTGATTCGTGTTTCGCATGCAATTTGAGTTTTGAAAACGACAAACA 
AGGTAAAATAATAATAAAATGTGTACCAATATGTCAAGCGTCAACGAATGCAAACGCACACTGAGGTGAATTGTTAGTTGTCTACACACGTTACC 
TTCGGGTATTGATTGTTTTTTAAAAGAAAACACAAAAAAAAAGCCGCGAAAATCACACAACCAGATTATCGATTTTGGCCCCCGCTGAAATTGGC 
GCACACAACACACAAATTTGACTGGGGAACGAACAGCGGGCGATGCAAACGATTCGGTTCCGACTTGGAATATATAGCTGCCGAACTATCGGTGC 
AACAATCGGATCGCTCAACCGGTCGCTCCATCTGCAGGGAATACCAAAACGCCTCAGTTAAGTGGCGCGCGAATGAAGGTACTTGCGCTTAGCCA 
CCAGCTGGGCACACTGCACGAGCCTGCATAGAGTTTTTTACGAAATTATCACAATTTAGGAGTTTTCCAATTAGAATTGGAATAAGTTCGTACTC 
TTAACTTGCTAACAAGTTGATTTTTAATGCAAAACTTTTCGCAGACAACTTTTCGACAAGAACAGTGCTGCCAGACTGTCAGACCGGCACCTTAC 
TGGCGTGTTTTTATCACATTTTCTAATATTTCCAAGCTATTGAAATATTCCAAAGCTTTCTATCTGCACATTCTTTTTATATAGGTGTTCCCAGA 
AAATCGCCTTAGCATGGAGAGTCGTGCTAATACGGCGTTGCCAGACTGTCATTGCATTTCTTGTTGCTCTTTGCAAGAGCCCGCCAAATGAAATT 
TTTATTTGAATACAATTTATTTTTTTCTTCCCCCGTTCTTTTGCTTGTATAATATCCCAACTGGAAAATGTGTACTTTTCAATTACGAAACATGT 
ACAATATATAGCAATAAAGTCCAGTAAAATCGTGGTATGTTTACAATTACTTTTTTGGGCCTGATGATACGTTGCCATTAAAGTGCAATGTCATA 
CCACTAATGTAAGTTTTTTAACCCTTTATACCCCCCAATAGCTTGCCCATAATCCTCCCGTGAACAATCTTGAATCCCAAATGCAAGAACAAAGT 
TCGAATATTCAACAGGAGCTCCGCCTAATGTATGATGACGCATGTATTTTATACAGATCAAATATTTTCATACCGTTTTACAGGGTTCCGAAGAA 
TAGAAAGTGGTAATTAAAAAAAGTATGCATTCGGTTCACCGAATTCAATTTGAATCGTTTTCCAAGTCAAATCAGGTTGGCTACAATGAGCAGAC 
AGAATGTGGGCGGCGACACCAGATAGTGTCGCCGAATGCATGGAAAGCGATTCAATGCAACGGCCTTCCATCGAAGAAGT 

(SEQ ID NO: 445) 

Exon: 8295.. 8075 
Exon: 5736.-5349 
Exon: 2952.. 2384 
Exon: 2310.. 1861 
Exon: 1600.. 1001 

Start ATG: 5735 (Reverse strand: CAT) 
Transcript No. : CT13 670 

TGGAGCGACCGGTTGAGCGATCCGATTGTTGCACCGATAGTTCGGCAGCTATATATTCCAAGTCGGAACCGAATCGTTTGCATCGCCCGCTGTTC 
GTTCCCCAGTCAAATTTGTGTGTTGTGTGCGCCAATTTCAGCGGGGGCCAAAATCGATAATCTGGTTGTGTGATTTTCGCGGCTTTTTTTTTGTG 
TTTTCTTTTAAAAAACAATCAATACCCGAAGAATGGGCACAGAAACCAAGAGCAACAGTTACACAGGACAGATTTCAACAAGTGGCGGCAACCCC 
AAAGT GAT G AAAG AT T C T T T AT C C TT AGT T C GT C AGAC GGT C AAT C AG C AAACG C T AT C GCT ATC GC AAT C G AAC C AGGT C C AG AAC C AAT TG AA 
T T C AC AT T C G AAT T C C AAT T C GT ATC C AAAT C C AAG C GGGAGC G AAAAC AAAAAC G AAAAC GAAC AAAAT T C G C GC GAT AT T C G AGCC AAACC G G 
AGGATAAATCCAGAAAAGAGGCAATTGTACCATTTGTACCCATATTTGTTGAGGAGGCCGACGTAATTCAAGGGGCCAAGGAACTGTTGAAGGTT 
ATACGACCGACCTGGGACCTCAGCCACGTCGAGTTTAAGAGTTTTACCGATGGCATCACAAACAAACTGGTCGGATGTTTTCATAAGGAGATCTC 
CAAACTGAACGATGAGAACGGCGGATCGTATTTACCTATCAAGACGCAGGGTTTGTCGCCCGTTCAGTCCGAGGATCCAGTCATCATTGAGAAGG 
AGGACGATGATGAGTTCACAGACGATCGGGCAGCGGACGACGGGTCACCTGTACAGTACTCCGATAACGTAGTGCTCGTCAGGATATACGGTAAC 
AAAACGGACCTACTGATAGATCGCAAGGCGGAGACGCAAAACTTTCTTCTACTGCATACGTATGGGCTAGCGCCATCGCTGTATGCCACGTTCAA 
GAACGGTCTCGTCTACGAATACGTACCTGGAACCACCCTGAATACGGACAGTGTGCTCTGTCCAGAAATTTGGCCCTTGGTCGCCCGTCGCATGG 
CCGAGATGCATCGCAAGGTGAGAAAGCACGGGGATAGTTCGGCGACCAAACCCATGCCGATGATTTGGAAGAAGACGCAGAGCTTTCTTGATTTA 
GTACCTGAACGTTTTAGTGATGCTGAAAAACACAAAAGAGTGAAAGAAACGTTTCTACCTATCGGCCGCCTGCGCGAGGAGTTCAACAAGCTATA 
TGAATACCTCGAGGCTCTGGACAGTCCGATTGTCTTCTCCCACAACGATCTTCTGCTGGGAAATGTGATCTATACGCAGAGCCTGAATACGGTGA 
ACTTCATCGACTACGAGTATGCCGATTACAATTTCCAGGCCTTCGACATTGGCAACCACTTTGCGGAGATGTGCGGCGTGGATGAGGTCGACTAT 
TCGCGCTATCCGAAGCGCGAGTTCCAGCTCCAGTGGCTGAGGGTCTACCTTGAGGAGTACCTACAGCGCAGCAACATTCAAAACGACGAAGTCGA 
ACTGCTCTATGTCCAGGTCAATCAGTTTGCGCTGGCATCGCATATCTTTTGGACGGTGTGGTCCCTGTTGCAAGCCGAGCATTCCACTATCGATT 
TCGACTACGTAGGCTATGCATTTCTTCGTTATAATGAGTACTTGGCTCGAAAGGTGGAGTTTTTGTCATTGACTGCAGCTAAGAACAATAAGTGA 
TTTCTCTCTCCCGGCCACTCAGCGCATTGGGAGTTACTACATTATGTGCCCAGTGATATATGTATAAACTAAGGCTGGAACGGGTCTCAATGTTA 
TACATATGTATATATATAACGTGTGTTCAGAAATCAGAAACAAAAACCGAAACAAACTTAAATTAGTGTCGTGTTCGTGCCAGCCTTAGTGTATG 
TACAGTGCTCACTCGAAACAGTGGACTATTGAGAACCAAGGGTGTGCACTGAATCGAATTTGTTCACTATAACGAATCACAGAAAAAATAAACAA 
TTTAATATATTTTTCAAAAACACCACGTTTGAAAAAATTTAATTTTTTTACTAAGTTAAGCGAAAGTTTCTAAAATTTCATAAATTAATCGTTCA 
CTTTTACTAAAAAGTTCACATATTCGAGTGTTCACTCTTAAGGGAGGGCACTGTATATATAAAGAATTTGTATATGTTCTTTAATTTGTATAAAT 
TGATATTTTGTAGTCATAATAAAGCGAAGGTATGTACTTTAAA 
(SEQ ID NO: 446) 

Start ATG: 223 (Reverse strand: CAT} 

MGTETKSNSYTGQISTSGGNPKVMKDSLSLVRQTVNQQTLSLSQSNQVQWQLNSHSNSNSYPNPSGSENKNENEQNSRDIRAKPEDKSRKEAIVP 
FVPIFVEEADVIQGAKELLKVIRPTWDLSHVEFKSFTDGITNKLVGCFHKEISKLNDENGGSYLPIKTQGLSPVQSEDPVIIEKEDDDEFTDDRA 
ADDGSPVQYSDNVVLVRIYGNKTDLLIDRKAETQNFLLLHTYGLAPSLYATFKNGLVYEYVPGTTLNTDSVLCPEIWPLVARRMAEMHRKVRKHG 
DSSATKPMPMIWKKTQSFLDLVPERFSDAEKHKRVKETFLPIGRLREEFNKLYEYLEALDSPIVFSHNDLLLGNVIYTQSLNTVNFIDYEYADYN 
FQAFDIGNHFAEMCGVDEVDYSRYPKREFQLQWLRVYLEEYLQRSNIQNDEVELLYVQVNQFALASHIFWTVWSLLQAEHSTIDFDYVGYAFLRY 

NEYLARKVEFLSLTAAKNNK* 
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{SEQ ID NO: 447) 



Classification: enzyme 
Gene Symbol: eas 
FlyBase ID: FBgn0000536 



Celera Sequence No. : 142000013384541 

AGTCCTTGTTATAACTATACCATCACCCGTAGTGACCTCCATCCGTCCGGAGTTTGCTGTGCTAGCCTCGGGCTTCTTGTCTGGCAGTCTGGTGC 
ACATGAGCGCCTACGAAGAGGCCATCAGCATACCCACGTTGCACATCTACGGACAGACGGATGAGATCATTCCCAAGGAGATGAGCGAGTCCCTG 
GCTGCGCGCTTCAAGAACGCCGAGGTGCTGGAGCACAGCGGCGGACACTACTTCCCGGCCACGGCGCAGCAGAAGCAGACATTCATCAACTTCTT 
CCAGGATCGGCTGCAGGAGTACCTGGAGCACGAGGAGTTGCAGCAGAGCGGCAATGCCTCATTCGTGGATAGCGGAGCGGAGGACGACAACGATG 
CCGAGGTGGCGGCCATGACGGCCGAGCTGGACGAGAGTGATTAGCCGAGGGACGGGGGTTTTTTCGTTCGTTTTTAATAATAATTAAATCTTATT 
ATTACAAAAATAATGATTTGATTATAAATTATAATATAATTAGATCCCAATATTAGTTAAATATTCACTCTGCTCACGAAGGATCAAGCGGATCG 
GGTATTGAAATGTCTGCACGTTCGCTCAGTTCGCACATATTGTTTTTATGACTTTGTGTATTTTCTCAACAAATTCGTATAAATTTCATTAAAAA 
GAAACAATCAAGGGGTTAGGGTAAAATTAAAGCCGAATATTATCAATCCCATTCCAAAGTTCAATTTTGTGTCGGAACCATAGTAAATTAATTGT 
TCCTTGCTATTAACAACGAAAAATGCATATTTAGCTATTGCAGTTGAGACGGCAGCTTTTGCTTCTTCACCACGCTGGGAAGTTGAGAATCGCAG 
ACAAATAGATCTTCCTCCTCCTTCGTCCGGTCCGACCATCAACTTCGATTTCAATTTCATACATTTCGTTTGCGTGGGACAAGCGAGCGACAGAG 
TTAGCGGATTTATTTTGTCTCGATTTGCTGCTGCTGTTGATTTTGATGATGTGTTTGCTGCTGTTTGTTGTTCTCGTAGGGGTGATTGACTGACT 
GACTGCTGTGGCTGCATCCTTATGCCACCTGCTCCTGTCCGTTCGAGGCCTCTTGGTTTTTCATGACTTCGGGTAAGTCTGGTGGTGCCGAGTAG 
GGTGTCATGTCCAGTGTCTCAAAGTCGCCCACCTCGTTCCTTAAAGACAGATAGCTATGTTGTACTACTACGCTGAGCTGTAAGCTTGTAAGCGG 
CACACGTGCCGCACGCCCGCCTTGTCCGCTAGATAACCCGCAACCGGCTTATCAGCGACTCTCACCTGAATGTCAAGCCCAGCTGTGCCGGAGCC 
TGCGCCTTGGCCGTGGACACCGTCACGCCGTAGTCCTGCAACGTGCTGTCGTCCTCCATGACATCGTTGTCCTGATTGTATAACCGCTGGTCCAC 
GGGCTGCACCTTTAGTATGCCTGCGATCGATGGGGTTGTATGATCAGTCATGGAGTTACCCAATAATGGCGCTGCTCTATAAATAGAAGCCAACA 
CTTACCCTCAATCATTCGCTTCAGCTCGGCCACCGTTGTGTTCTCCTTGGCGTCCGTGAAGATGGTGGTCTTTTGCCGCCTGATCATAAGGAACA 
CGTCCTGCAAAAGAGAGGGTTAGCCATTAGATTTGATTCGATGCGAAGTGATTCGTTTCGGTCAAAGTTCAGTGGCGCTTTTTCAGTGTGTCATG 
CTTTCTGATCTTGCCATCACACAGGCGACATTTGTGCACATGCATTAAGAGCGCGACGGTGTCTATGGCAGCAGCATCTGCGATGCACTGCATCC 
TTGCTAGGCTGCACGACACCGGCACGAGCACTCACCATATTGAGGAGTTTGTGGCGCTGGCTGCGACTGAGGCTGCTACGCGGATGTGGGTTAAC 
CAGGTGCGTGGGCTAGTGCGCTGTGTTCGGCAACTGACGGCGTGGGTGGCGCTTCCTTTGGCGGTCGCGGGCGGACGTTTCGATCGGCTGGAATT 
TTCCTACTTCGCTTGCACCACCGCTGCTGGGCGCTTCTATTTTTTGCTCTGTGCCGCTCGACCGGAGTTGGGGGCGGGCGCAGGTGGGCGTGGCG 
TGCGGTTCGCGGTACGATGAGCGTGCGTGAGTGTGTATGTTTGTGCAACAGCTTCCTCGGTTGCCCCCCACTTGTAGGTTTTCGTAGTTTTACGA 
TTTTTTCTAGCCTTTTGCTGCTGCTTTCTCAACTAATTAGAGCTGTCAAACCATCGATGGTCGCATCGATAGTGAGCTATCGATAGTTCAGGGTC 
AGCGGCCTTCGTTGTCTAATGCCATCGCTAATCCGATATTATTGCAGGGCTTTTAGCAAGGGATTGTCTAAAGCCTTGGTGAATTCTATTAATGT 
GTATTAACAAATACTTCACAATTCCGTTTAATGTATATTTTCCATTAGTTTATAAATAAAATAATTCGCATAAGTAATGAGCTTTTGCAGCCCTG 
GTTATTGCCAAGAAAATGTACTAGAAAATACCGCCATAGCTGTCAGTGTCACTTTTCCCGTTTCTCCTGTTTTATTTGCCACGAAATGGTTTTGC 
TAGACAATTCGAACGTAAATAAATGAAATACCTTTAAAAGAAGCCGATAAGTAACCGTATCTCCTTACAGTTTATCCTGCGCCTGGAGAAGATCG 
CAAATGCGGCCAAGAAGGACTCCTCCTTCACGCTGACGTTCAAGCGATGTGAGTGTTCGACACCGAAAAACATGGTACACTAGGAACTAATTTAC 
AATTTATTTAATGTGCCGCAGATGATGGCAACGATAAGCCCGTGCCGAGGGAAGGACGGCCACCGCTGCCCAAGCCGGAGACCTACATGTGCCTG 
ATGCGCGCCCAGTCCAAGTCCCAAAAAGTAAGCCCGATTGAAAGCTCCACGTTTCGACTCCTTACCAAATAAACCTATTTTTCCGGCAGATTTCC 
ACCGTTGTCCGGCAGGAGGATGTGCCCGCCATGATGAGCATGTACTCGCAGTTCATGAAGAGCAAGATGGACGGGCTAAAGCGGGTTAAGAAAGT 
CAAAAGCAAGGCCAAGGCGACAAAGGGTTAATCGGGGCATAGTCTTAGTTAGATTTTATTTGACTGTTTACTGGTCGTTTGTTTTAAACCTTAAA 
TGCATCATTTTGTACGGAAAGCACGCTATACTTCTTATTTCTATCATCAATCTGACGCAGATCGAGCAGCCACTTCG 
(SEQ ID NO: 448) 

Exon: 2212.. 1841 
Exon: 1619. .1526 
Exon: 1445.. 1301 
Exon: 1179.. 1001 

Start ATG: 1843 (Reverse strand: CAT) 
Transcript No. : CT13840 

AAAGCAGCAGCAAAAGGCTAGAAAAAATCGTAAAACTACGAAAACCTACAAGTGGGGGGCAACCGAGGAAGCTGTTGCACAAACATACACACTCA 

CGCACGCTCATCGTACCGCGAACCGCACGCCACGCCCACCTGCGCCCGCCCCCAACTCCGGTCGAGCGGCACAGAGCAAAAAATAGAAGCGCCCA 

GCAGCGGTGGTGCAAGCGAAGTAGGAAAATTCCAGCCGATCGAAACGTCCGCCCGCGACCGCCAAAGGAAGCGCCACCCACGCCGTCAGTTGCCG 

AACACAGCGCACTAGCCCACGCACCTGGTTAACCCACATCCGCGTAGCAGCCTCAGTCGCAGCCAGCGCCACAAACTCCTCAATATGGACGTGTT 

CCTTATGATCAGGCGGCAAAAGACCACCATCTTCACGGACGCCAAGGAGAACACAACGGTGGCCGAGCTGAAGCGAATGATTGAGGGCATACTAA 

AGGTGCAGCCCGTGGACCAGCGGTTATACAATCAGGACAACGATGTCATGGAGGACGACAGCACGTTGCAGGACTACGGCGTGACGGTGTCCACG 

GCCAAGGCGCAGGCTCCGGCACAGCTGGGCTTGACATTCAGGAACGAGGTGGGCGACTTTGAGACACTGGACATGACACCCTACTCGGCACCACC 

AGACTTACCCGAAGTCATGAAAAACCAAGAGGCCTCGAACGGACAGGAGCAGGTGGCATAAGGATGCAGCCACAGCAGTCAGTCAGTCAATCACC 

CCTACGAGAACAACAAACAGCAGCAAACAC 

(SEQ ID NO: 449) 

Start ATG: 370 (Reverse strand: CAT) 

MDVFLMIRRQKTTIFTDAKENTTVAELKRMIEGILKVQPVDQRLYNQDNDVTyfEDDSTLQDYGVTVSTAKAQAPAQLGLTFRNEVGDFETLDMTPY 

SAPPDLPEVMKNQEASNGQEQVA* 
( SEQ ID NO: 450) 

Name: Elongin B 
Classification: enzyme 
FlyBase ID: FBgn0023212 

FIGURE SHEET 250 



Celera Sequence Wo. : 142000013384593 

CTGGCTATCCCTGGGTGTGTAGAATATGAAGCACATTGGGTACGAAATGCGCTGATCGTCATGGACCATTTTATACGTATAGATGACATATCGGG 

GCTGGTGTCCTGGCAATGTATCCTGCAGCTCGTCCACCGAAATATCGTCGATGAACTCGTCCAGCACCACCGTTTGCTTCTCCCGGTCGACTTTC 

ACTGCAAGGGACAATGGAGTGGTCATAAAAACTACACACATCTGTTACCATGACAATCAAGGCTTACATATCAAAGCCGCATTATTTTTGCTCTT 

GCTGAATCGAAACTTCTTCAATTCCTCCAGAACTTCGTTGCTTATGTCGCATATCTGGTTATCACTCTGTATGAAATCGATATAATTGATTAATG 

AATTGGAATCACCAAAGTTATCTGCTAACAACAATCGTCGATTGAAAAACAAAAACAAAAACACGCCCACACTCTATTAGTCATGTTTTCATTTT 

GGTAATTGATTTCCGACCCTATATACACTTCCTACCATAACTTTAGCTATGAAGTTAGGGATGCGATCTGTAAAGTGTGCTGCACTACCGAATTT 

TTGGCAGTTAATTGTAGGAATGCAATATCACTTGTCCTTAAGGAATCGTTTAAATACTGTCGTTTATTCTGAGTTTTGTTTTGTTGTTCCTGCAG 

TGAGACCGTTCACTGAACGCCAAATAATATTCTGTGCTGCATCAAAAATATGTGAAGTGTTGTTAAACGACGTTTCTGATAAATTAAAAAATAAT 

AACAATATTTGCTAAAGGTTTTTTTTTCATTAAGCTTTTTTCTTTACGGCAACATTGGTATTATTTATGGAATTTATTAAAAATAATTGAAATGT 

ATTTTTTAGATTGAAATAGGAACGCTCTTTTTTAAAAATGTTTTGTAGATATACTTAAAACTTATCATTATTTTATAGTAGGTAAAAATCGTTCG 

TATACGATATTTGGATCATGTTTGTCCTCTCTAGTGGCTGATGGACTGTCAAAAAAGTACCGTTACTTTAGGGTTGTTACAGAGAGCACTAACTT 

ATAAATAGTTAATACTATAATCTATGTGCTTAATGTCGTAGTTACAATGAGCTTGAATTTATATGAGAGCAACATCCTTAATTCCTGCTGGAAAG 

CGGGAGGGCGCAAACCGAGGAAATCGGCATTTGGCCAACTTAATCGAAATGAGATCGCTGATGTGGATGTAGTCGCCTGCTGGTAATTAAATATT 

AAATATAATGAATTTTTTTAACATTTCATTATATTCAATTGTTTTCAACATTTATCTACATTTCAGCAAGCAAATCACCGAATTAATTGAGGAAA 

ATGCATTAAGAAAACGCCAGAAACGATCCAATGTTTTGGCCGCAAGAAACCGGCAAAATGTTATTTTCAAGGACATTTCCCGCTTGGTTTTTGGA 

GTTGCGGATATTTTTAGATGCCAAGTGGATCTACTTTTGGGTAAGTATTAGTCCGAATATTACGTAACATTGATATATAATCACGAATTCTTACA 

GGGGACACGAAGGTATTACTTGACCAATGTACACGTACCAACTTGGATTACGTTCTCACCACAACAAAATCCGTAGTGGTCAACAAATCAGAGAT 

TCGAATCCAACGCAAAGGAAAGCGGGTGATCACCAAGAAATCTAAAGTAACAGTATCAAAACGTCCCAGACTGCAGGAATCGGATCTATTGGATG 

AAT T CT C AC AT GAAT AC T AC G AG AAGAT GC T ATC C G AAT GC C AG AT GT GG C AAAC G GAG T GC AC AC AAC AGG T GGT GG AGG AC C T G AAC GAGGT T 

AGTCGTCAATATACTGTTGGACTGCTTCCTTATATGATTTCATTTTTCGCAAAAAGTCTATTGAACAACCCCGGAGTTGCACTCAGTCAAAGTCG 

TATCATTCCATTACAATCACCGAAGAGATTGAACTTCCCGAAGATCATAGCTTAATTATGCCATCAAATGGTTTTGGCGAAGCCGAGGGCGCGGA 

CTTAACCATTTTTCAAGAGCTTTACCCCAAAGATGGCACACGGAACTCACGTAAGAATTCTCAATTCAATTGAAAATGTAATAAACTATTTTTTT 

GTATTATAGTAAAGCGACACTCAATAGCTCAGGATCCAACTGACATCTTACCGGATAAAATGCCCAGATTGGATGATTGCGATGTTTTCCAAGCT 

GACAATGCTATCATGACCCAAATCTTTCCGCATAACATTGAACCAGCTGAAGTAACTCAGATAGTTTGTGAACCGTCGTTGCCATCCGTTTTAAA 

TTCTTTTGGAGAAATAACATTTAGCCACCCGAAAAATCGGAAAAGAAAGCTAATAGTAGACAAACGAATTGAATACACCCGGGAGCAGCTGGTGA 

AGCATAGACAGAAATATATGGAAGAATACCTTTCAAGGAATGTGATAGTGCCAAAATCCTCGGATTTACGGAAACCCCCTAAGGAACTTCTCTGT 

AAACTGTATAATAAGTAGGAGCCTTATACATTTTGCTATTTAAAACTTTTTAATTTTTTGATGTAGTGTAAGTTTTTTAGCTCTTCATAACCACT 

CCGGACCTAAGCTCAGTAATGAAGAAAAGGAATTCGAGGCAGAAAACACATTGAGAACCATATTTGGCTGCGAGTTTACCGAGAATCTTTCTAAA 

GAGATATTTGTGCGACCATCGCAGGCTACAAAATGTAGGGATAAAGCGGCACATATCTATCAGCCCGAACCGCTTGAAGTGGAAGACTGTGTGCA 

ACCTCAACTTCAGCCACGTGATGATGTAAATCAGAATTATCATAATGAAATGAATGAGCACGGCATATACCGTGAGGATAATCATGATGGTGAGT 

TCAGCCAATGTTTACAACATGCTATCGTAAATAAAATATAATCTCATTGCAGATACCTACAGCGTAATGATGAGTCTCTTAAGCATTTGGCGTAA 

TAATCCGAAAATCACGGGAATCGATGCCATTGATTTTATAAAAACGTTTGATAGTCGCATCAAGGCTTCGTTAGCCTTTCTCCATTTGTTGTGTG 

AGTTCGAAAAAGAACAAAATGTGTGTTGGAAATAACGTATAGCTTAAATTTCAGATCTTGTTCGAGACCATTTTATAGAGATATCAAAGCGAGCT 

AATTCGCTGGAAATGTACCAAATTACTCTTGGCAAAGAGTCCGCTAAGCTAATAGACAATCTTATGCTGAGTGAGACTCTGTGAATATTTTTAGA 

TAATTTTTTTCAAGATTACCAAACAACTTTGTAGTTGACAATCTTCCACATTTTTGTTGTTGTTTTGTTTTCTGTTATTGCTATTGTGTTATAAC 

ATATAAGTTGAGCCAATCTCTAACCTAGATGGCTTTAATCTACTCTTTTTCCATACAATCTTATGGTACAATTGAGCAACGATGTCCTGGATAAT 

TCGTTTCATTTCAAGAATTGAAAAATTCTATAGATGTTACTTGTTGCTGTCTATAAAAATAGTTTTTTTTTCATTATCATATTAGTTTCGTGTAC 

CCATTAAAAGTTGTTTGTTATCTTTGTTACTATTACGTTATCGTAGCTATAATTCATTAACATTTGGAGTAATTCTCTAAACCGTAATTGTTGTC 

TTTGTTACAAAATCTCAATTTCTCTGACAACACCAAAAAATAAACAAGTTACAAAAACAAAGGAATACAGAGTGTTTGGCACAATTGCTTAAATT 

CCGATCTGCTGGGCCAGACCCGAATAGATGTGCAGCGTGCAGAATCCAATGACGGCCGCCACCAGCGCTTTAACAGCCACCGTTAGCCAAGGTCC 

AATGGTCAGAATGTGGAGCATGCCCTCCAGCCAGACGCCTTTGAAGACAGTCACCGCCAGAAGTAGACTAACCAGCGGCTTTAATGCCTTGTTCA 

AATCGTGACGGGTGAAGAGCCAGATCAGGGTGGCCGTAGTGATGTGCTGCACCAGAAGCACGTTCGATTCCAGACACTTGAGGATGTAGATCCAA 

CTGAACTCGGTGCCTCGGGCACCCACCCACAGCATGATACCTCGGGAAAGAATCACTTCGGCGGTGGCCCCTGCGGAAGTATACG 

(SEQ ID NO: 451) 



Exon: 1001.. 1222 

Exon: 1302.. 1465 

Exon: 1522.. 1802 

Exon: 1862.. 2045 

Exon: 2100 . .2484 

Exon: 2537. .2844 

Exon: 2903.. 3075 

Start ATG: 1092 



Transcript No. : CT13940 

AAAAAAGTACCGTTACTTTAGGGTTGTTACAGAGAGCACTAACTTATAAATAGTTAATACTATAATCTATGTGCTTAATGTCGTAGTTACAATGA 
GCTTGAATTTATATGAGAGCAACATCCTTAATTCCTGCTGGAAAGCGGGAGGGCGCAAACCGAGGAAATCGGCATTTGGCCAACTTAATCGAAAT 
GAGATCGCTGATGTGGATGTAGTCGCCTGCTGCAAGCAAATCACCGAATTAATTGAGGAAAATGCATTAAGAAAACGCCAGAAACGATCCAATGT 
TTTGGCCGCAAGAAACCGGCAAAATGTTATTTTCAAGGACATTTCCCGCTTGGTTTTTGGAGTTGCGGATATTTTTAGATGCCAAGTGGATCTAC 
TTTTGGGGGACACGAAGGTATTACTTGACCAATGTACACGTACCAACTTGGATTACGTTCTCACCACAACAAAATCCGTAGTGGTCAACAAATCA 
GAGATTCGAATCCAACGCAAAGGAAAGCGGGTGATCACCAAGAAATCTAAAGTAACAGTATCAAAACGTCCCAGACTGCAGGAATCGGATCTATT 
GGATGAATTCTCACATGAATACTACGAGAAGATGCTATCCGAATGCCAGATGTGGCAAACGGAGTGCACACAACAGGTGGTGGAGGACCTGAACG 
AGTCTATTGAACAACCCCGGAGTTGCACTCAGTCAAAGTCGTATCATTCCATTACAATCACCGAAGAGATTGAACTTCCCGAAGATCATAGCTTA 
ATTATGCCATCAAATGGTTTTGGCGAAGCCGAGGGCGCGGACTTAACCATTTTTCAAGAGCTTTACCCCAAAGATGGCACACGGAACTCACTAAA 
GCGACACTCAATAGCTCAGGATCCAACTGACATCTTACCGGATAAAATGCCCAGATTGGATGATTGCGATGTTTTCCAAGCTGACAATGCTATCA 
TGACCCAAATCTTTCCGCATAACATTGAACCAGCTGAAGTAACTCAGATAGTTTGTGAACCGTCGTTGCCATCCGTTTTAAATTCTTTTGGAGAA 
ATAACATTTAGCCACCCGAAAAATCGGAAAAGAAAGCTAATAGTAGACAAACGAATTGAATACACCCGGGAGCAGCTGGTGAAGCATAGACAGAA 
ATATATGGAAGAATACCTTTCAAGGAATGTGATAGTGCCAAAATCCTCGGATTTACGGAAACCCCCTAAGGAACTTCTCTGTAAACTGTATAATA 
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ATGTAAGTTTTTTAGCTCTTCATAACCACTCCGGACCTAAGCTCAGTAATGAAGAAAAGGAATTCGAGGCAGAAAACACATTGAGAACCATATTT 
GGCTGCGAGTTTACCGAGAATCTTTCTAAAGAGATATTTGTGCGACCATCGCAGGCTACAAAATGTAGGGATAAAGCGGCACATATCTATCAGCC 
CGAACCGCTTGAAGTGGAAGACTGTGTGCAACCTCAACTTCAGCCACGTGATGATGTAAATCAGAATTATCATAATGAAATGAATGAGCACGGCA 
TATACCGTGAGGATAATCATGATGATACCTACAGCGTAATGATGAGTCTCTTAAGCATTTGGCGTAATAATCCGAAAATCACGGGAATCGATGCC 
ATTGATTTTATAAAAACGTTTGATAGTCGCATCAAGGCTTCGTTAGCCTTTCTCCATTTGTTGTGTGAGTTCGAAAAAGAACAAAATGTGTGTTG 

GAAATAA 

(SEQ ID NO: 452) 
Start ATG: 92 

MSLKLYESNILNSCWKAGGRKPRKSAFGQLNRNEIADVDVVACCKQITELIEENALRKRQKRSNVLAARNRQNVIFKDISRLVFGVADIFRCQVD 
LLLGDTKVLLDQCTRTNLDYVLTTTKSVVVNKSEIRIQRKGKRVITKKSKVTVSKRPRLQESDLLDEFSHEYYEKMLSECQMWQTECTQQVVEDL 
NESIEQPRSCTQSKSYHSITITEEIELPEDHSLIMPSNGFGEAEGADLTIFQELYPKDGTRNSLKRHSIAQDPTDILPDKMPRLDDCDVFQADNA 
IMTQIFPHNIEPAEVTQIVCEPSLPSVLNSFGEITFSHPKNRKRKLIVDKRIEYTREQLVKHRQKYMEEYLSRNVIVPKSSDLRKPPKELLCKLY 
NNVSFLALHNHSGPKLSNEEKEFEAENTLRTIFGCEFTENLSKEIFVRPSQATKCRDKAAHIYQPEPLEVEDCVQPQLQPRDDVNQNYHNEMNEH 
GIYREDNHDDTYSVMMSLLSIWRNNPKITGIDAIDFIKTFDSRIKASLAFLHLLCEFEKEQNVCWK* 
{SEQ ID NO: 453) 

Classification : known_f lybase_gene 
Gene Symbol: BG: DS02740 . 10 
FlyBase ID: FBgn0028525 



Celera Sequence No. : 142000013384828 

AAG AC T C C G AAAAT CCCTTTTGATCTCT G AC AAC T AAAAAAT GT AAAAC T AT AT CC AC AAAT T GGTG T TT T AAG CC T AAC AAAT AC T AAAAAATC 

TATTTTGCATCGGCCGGGAATCGAACCCGGGCCGCCCGCGTGGCAGGCGAGCATTCTACCACTGAACCACCGATGCTTATGACTCTTTTGAAACA 

ATTCTGCTGAAGATTCTACAGAAGTCTCGTTCTTATGATCATTTGGGAACTAAGAGTATCTGATATCCCTCCCCTCATCTGACTCCGTGGCGCAA 

CGGTAGCGCGTCCGACTCCAGATCGGAAGGTTGCGTGTTCAAATCACGTCGGGGTCAATGCATTCGTTTTTTTGTCCACATTTCCGATGGAAACC 

T AAC GT T G AAT G GT T TG ACCT T GACAAC AAGT G GGT CT AC AAAAAAT AAAT GTAGT T AAG AT AT G C AT GC GAGAG AT GT T T T TC AT TTTTTATTT 

T AT AGT AT C C G AT AAT AT AGAT AT ATT T GAAT T GT AC AT T AAT AT GC GT GT AC C T AGAAT C T GCGC AT T AG AAAC AAT T T G AGT GT T AAAT AT AC 

TCTTAAAACTAAATACATAACAAAATAATAAACATTTCAAGTTTTGCGATTGCTGATTTGTTTATGATTGGGCTTCTACTCCCCCACAAGCTCCC 

CTGTCAATTTCCTGTCCATACAATCCGTCTCCTTTGCTAATTATTTGCTATAACTGTGTGTATCTGTTGCTCCACGTTTTCGTTGGTAGTTGTGG 

TTCCTATTGTACGACAATTTAAATCACTCGAATTTAACTGAATCTCTGGTTTTAAAAGCGTCCGGTTCAACCCCCACCCCCCTCGTCAAAAATGC 

TACGTAATCTCAATTTAAAAACGGTAAAATGTTCGCATGTATGGAGTTACACTAAAGTTACTTGATAAGGAAAATCATAAATTACAACTCATCTT 

GAATTTGCTCAATAATAATAAGTTTGTTTCTCATTTGATCTGTCTGTCGCCTAAGGATCAGCTTCGGCTACGACATCGGCGGAAGCAGCAGGGGC 

TATGGGTTCCTCCGCGGATGGAGCCTCCGTTTGGAGTTCAGCCGGCGGCGCTGGAACCGATTCCTCATGGGGCGCAGCCTCGGTTTCAATGACTG 

GCGGCGCCGGCCGGGGCGGAGAACCTTGAGGTTCTTTCGGGGGCTGAGTTGGAGTCTGTGGAACTGCTGGCGGTGGCAGCTCGTCATAAAGGGGC 

GTGCAGGCGGCTGAAGTTTCTGTTTGTGATTTCGCAACCAAAGGATCGCCGGTCACGTCGGCTGGCTTTGCGAAGTTTTCATTCTCCGCCGGCAG 

CTTTTCTTGCGGAGGACTGTTAGCAGACTGAGATGCTCGTTCGCCTGGCTTCTGGGCCTCCATTCTCGCCTCATAGTCTTGAAGTTCCTGATCCC 

AATCCTCCTCGGTGTCCACTCTAGTGGCCGCAGCCGGCGCACCGGTGTTGGTGCTACCTAATTCGGTTTCAACGTTTTTGGGCTTGGGTTCCGTC 

TCCGAAGCCGCCTTGTTGCCTTCATCCGTCGAACTTATTGAGTGGCGGTTGCTGTCATCTCGCGATCCCCGCCTGTTTCCGAATCCCGGACGTTC 

ACGATCCTCTCGCGGTCCTCCGCGTCCGCCCCGTGAATTGTTGCCACGACGGTCGTCCATCATTTCGCCTCCACGCTCGCCACGGAATCTATTAC 

CGCCTGGCCCACCGGGACCACCACGCCGCTTGAAGTTGTTTCCGCCCTCGTCCTCATCGTCGCTCCAACGTCCGCGACCTCCAGGACCGCCACCT 

CCTCCTCCGCCCACGCCGCGAACACCGCCTCCACTTTGACCTCCACGTCCTCGTTGGTTGTCATTGAAGGGATTCCGAGGATTAAAGAAACCTGA 

AACGAACGTTAGCATAAATTGCACAAATTAGTTTACATAAAATGTTCCTTATATAGTTGCAATAAAAAAAAAATTAATAAAGTTAAATTAATTAT 

TATTTATATACACCTGGTAAACATAATATTTACACGCATAAAAAAGATGCCACCAAATTCTATTATCGTTTTTTCTTCTTCTTTCGTCTTTGCTT 

CAAATTCCCACGATTCTCACCTGGTCCCTGCTGACGTCCGCCCGGACCACCGCCGCCGGTTCCAGCACCGCCGGGTCCACCACCGCTGGGTCCAG 

CACCACCTGGCCCACCTCCTTGGTTGCGCATAAACATGGGCCCAACGCCGTTGCCACCGTTACCAGCTGGTCCACCCATGCGCTGGTTGAAAGAA 

GCCGGCGGCGGGATGCGCATCTGCATCAAGCTGGTGGGCTGATTGAACCGATTGTGATTCATGTTGGGCGGGAAAAAGTCGCCTCCGGGGCCTCC 

ACCGTTATGGTTTCCACGCGGTCCCATTCCTCGTGGTCCACCCGGTCCACCGCGATTATCAAAGTCTGCAATGCGTTTTAGTTTAGTGAGCTGCG 

AAGATTCGTACCATTTATGTCGACTCACCCATAAAGTCCGGTCTTGGCATACCGTGCTCAATGGGATTAAGGTTCAGACGCGCTCGTGCCTCGTT 

AAAGGCTGCGTTCGGTCCGAGTCCAGATCCCGGACCACCACCACCTCCTCCATTTTCGGCGCGCCAGCGTTCTGCCGCTTCAACGACATCATCTC 

GTCCGCCCCAGCGCGAGTTGCCCGGTCCCCTGGAGCGATCCCTATTATCCCGATCGCGCTCCCTGTCCCGCTGCTGGAACATCTCGTTGGCCATG 

GCCGCCGGCGATAGGTCCACATTGCTGGGCGACGGATTAAAGTTGGCTTGCTGTTGCGGCGGCAGAGGTGGGGCGTCTTCCAGGTCCATTTCAAT 

GTCCATTTGATCATCGCTCACGTTGCCACTATTGCTAATCTGCCCACCGCTCGGTATGGGTGGCGGCACAGCTCCAGGCGGAGGAAATGGAGCTC 

CCGGTGGCAAGGCCATAGGATGTGGTGGTCCCCCAATGCCTGCAGGAAAAAAAATTAAAATTAAAGCTTGAAGTTCAATTTTACGAGATTATTAT 

CTACCTGGAAAGCCAGGAGGCATCATTGTCGGTGGCATCATCATTGGAGGAGGCATGTTGGTCGTTGGCATCATCATCGGTGGCGGCATATTTGT 

TGGCGGCATCATTATGGGCGGTGGCATATTAATGCCCATGGGTGGGGCCATCGGAAATTGTCCCCTTACCATACCCATGAGCCCCGGAGGTGGGG 

CTCCCGGACCCGGTGGAGGACCCACTGGGCCGACTGGACCAACCGGTGGTGGTTGGGTTGTATCGATACCAAATATCAGATTTGGCGGTGGAACA 

CCAACACTTGGCGCACCAGCTGCCTCCGCAATGGATGCACCCTTGTGCTCCTTGGCAGCATTCTTGGCCTGGTTAATCTTCTGCTTCATTTGGAT 

GGGCATAGTATCTTCGTCGAACATACCACCCTCCTCCAGCGCGTCGAAGTCCGTATCCGAACTGAGTTTGGACCAAGGAATGTAAGTAACACCCA 

ATTCGAGGTCCCAGTAGTCCTTCCACTCCTTGCTTTTCACACCCTTGCCGGCGGCCCATGAAATGGTGATGGCTCGTCCCTGCAGCTTGTGGTTC 

TTCAGAGCTTGCATCGCCTTGTGGGCGTCTTGGCGTCGGTTCATTACAATAAAAGCGCATCCACGCGGCACAATCTGATCGATGCTCACTATATC 

GCCATATTCGCCAAAGGTATCGGACAGCTCCTCCTGGTAAACAAGCTTAGACAGGTGGCCCACCCACAGCGTAGTGCTGCATACTGAAAGCAAAA 

GCATTGGAATGCCGCATTAGTTTTTGGTGTCTGAACATGTTGTCAATATGCCCATTTGCCTTGTTGAGGTGTGAGTGCCACTGTTGGGATAGTGT 

TTAGAATTCCTTATCCTTAAGTATATTTACCAAGATTTAAACTATGATCTATGTACTTCATTAAGAAGCCAGTTCTTGGTAATTAAGCTTAAGTA 

TATATTAGTGTAGAACACAACGTTTGTTTGCATCCTTATGTATATGTATGTAAATCGGGTCTCAAAAAATATTGCTCGAAAACGCATATTATATA 

GGGGGATATTCTAAGATCAAGAACCACTAAAGCTAAACACACACACGTCAACCGAGCCAAAGGAGCTGCAGAGAAAAGGAAGTAAATCTTTAGGA 

TTATCAAAAACAAAAACAAAAAAAAAAAAGAAGGAAAAACAGGAAAGCACACAATGCAAATGGATCTTAAGAAGCACCATCT 

ATCCATTTATCATTCCCGTCATAGTTACCACTGAGATGCTCCTTTTTGATGTCGGGCAATCCCTTCTTGCGCCGCTCACGCTCGTGTTCCCGATC 

GCGTTCCTTGTCTCGATTGCTACGATCCATGCGATCCCGGTCCCTTGATCCGCGTCGCCGGCTGGACCGTGGTGATCGGCTGCGCGATCGGGAAC 
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GAGTGCCACGCCGACGTCGATCGTTGCCCGTGCCTCCTCCGGCGCCACGTCCTCGCGGAGATCTCGTCCGGCTGCGACTGTTCCGGCGACTGCGC 
TTGTAGCGATCACGGTCGGGCGTCGGGCTGCGGGAATCGCCACCATCCAGATTGATCACCTATCGGAGATGGGAGTGTTTGCAAGAAATTTGAGA 
GTGATACGTGATAAAAATTTTGTTAAGAATAAAAAAAAAAAGTTCAGTGGGATAGAGAGGATTCGGGATTAGTGGCTGGCCACAGGGCGAATACG 
TTATTATCGACTCAATAAACTGCATTCTTTGGCGTGTGTTGACGTGTGGTTCTGTTTCGCTTTCGACTTTCATTCATCTAGTTATTTCGGACTAA 
TTGGCAATTTGCGTTTTCGAGCAACATTTTTATTTACATATTAGGAACTTACCAAACGTTGTTCTAGCATCGGGTAACGAGTTGTTCCGCCGGAT 
ATTATGTGGATTTTAAAACCTTACCTTTGGCTTCATTTGCATATTTTTTAAATATTTTACTATTGGTTCGACTTATTCAAATATTGGCAATTGTT 
TTATAGTTCAAATATATGTTGGGGACATATAGATCGACTTATGAAACTGATTTAGGGTTTGCCCAAAAGCATCATCTTTTGATTTTAAACATTTT 
TTTTACAACAAAATAAAATTTAATTTTAAATGGATTGGGAATTGAATTGTTTGCATATTCAAGATATATAGTATTAAAGTTATTGATTAAATAGT 
GCAAGACCACCCCGTTACAACTAGTTGAATCAATTTGGATTCGAACACGCTAATATCCCATCAGCTATAACAACATATTGAACACATTTCCAGGA 
TTAACTGCACGTCAGCACAAGGCAAATAGTATCACCCGTTTTACATCTAATATGTTTATACTTTAGACCAAAATTAATGGCGAAGTCTTGGTGAA 
AAGCATACACAAGTCGGACCCCAGGAACTAAGATAGATTGGAAAAGTAGAAGGGTATACCTCTATGGTCTGTTGCTCCGATATGAACTCGACATC 
CTTGTTAAGGTCCATGCTGTCGTTCATGTTCATGCCCATGCCCATGCCCGGCGGCATACCAGCATTTCCCTGAAAGCAAATCATAATGAAAATAT 
GTACATCGTCATGAAATCATAAGATGTACCTTCATTACGTTCTGGAAATGCTGCTGCAGTGCCTCGTCCTGATAGCGATGCTTCTGGTTCTCCTC 
CTGCGGCTTCAAATTTTGGAAGTTCTGCAGCGTATGGAGCTGCCGCAGCACATTCGGATTGTTTAGCAGCTGCTTGATCTTCTTCTCCTGAAAGA 
GT T AT T T AAAAGGT T AT T C AAAAT GGG AT C G AT C T AAT GTG AGC AAC AAAAGC AAC AAAT AG AAAC T T AAG AT AAAC AAAACT AT CT AAAT G AAG 
AGATGTGTTAATGGGGTTCGCCATACGTCTAACAGGTTTGAAGAAGTGGGCGGTGAGTCGTCGCCCATACAGTGGTGATCATCTGCCATCAGCAT 
CTGCAGGCCTTCATCTTCTTCCTCGATGATCTCCCGCACCTGGTAGTCAATGTCGTGCGAGCTCTTGACATATTCCCGTTTGTGATGATGCGACT 
TGCTGGTCTTCGACGAGCACGAGACCACCGACTGTCGTTTGGAATAGTGTTGCTCCGAGTAGTGACGCTTGGAGCTGCTGCTGCCACCGCCGAAC 
TTCTCATGGCCCAGCTAGCGATTGGGCGAAGGGCAAAATGGGCGGTTAGTGGGCAAGACATGAGAGACAGAGAAATGTAAATGCAGCAGAATTGT 
AGACGATGTTAGTTAAGCCATTATCCAATACTAGACACCTGTAGAAGCCTCAACCGAAAGCATCACTTACCGACAAGTCGGGCATGGCACCGCCC 
ATCTTATCGTCGCCCCCGGAGCTGTTCATGCTTATCTCCATTCCGGAGCTGTTTAGACCGTTCGGACCGCTCGATATGTCCGCCAGGCTAAGAGC 
ACCGCTGCTGCTGGGTCCTGGACCCACGCCGCCGCCCTGTCCTCCACTGCCACCCACCGGAGGCATCTGGTGGTAGATTGGGTGGTTTGGGTCGG 
CCAGGTCGAAGATCGGCTGAATGACCTCGGACTTGAAGACGTTGTTTTTCTGCCACAGGTTCAGTACCCGGATGATGCGACTCTTGTCCTCCGGA 
GCGCACCGGAACAAGTTGGCAAAGGTCTCCGTGAGATTACGCTGGAATCGGGGCGCGAACAGATCCTTGTCCATACCGTACTGGTGGCGCGACTG 
GCGCACTATCGAGTCGATTACATACAGCCCGGGTACTTTGTACTCCGGCTTGCACTTCAAGATGAACTTCTCCACACTCTGGACGACGTGCTTAT 
ACAGCTTGATGGCCCGCATCGCCGACTTGGTGATGGCGGCCATCTTCGCTTTGGAAATGGGCGGTCGGCTGTCATAGAGTCCGGAGAGCTAAGGA 
GAAATATAGATGTTACAGTACGGCTATAGTTTACGGATATCTGGGACATGGAGTTATCGTTATCGGGTGCTGGCCAAAAAGCAACCCTGAGCGGT 

GC T GT T CG GT G C T CT GC CT GCGC C T C C AGC AGG AT C T GT AT ACC T CC C GC AC AT T GG AC AAT CC GAAT AC C G ACT T C AAGC GG AC AT TG T GC AAC 
CACGCCGGCTGCTCCATAACGAACCTCGTTTCTAATGCATCTATAGCTTACCTCATTGTTGAAGGCAACCACGGTTTCCATGTTCAATTGCTTGG 
TTCACTCACGGCACGCACACACAATATTTAGTTTTGCATACTTTGGTTGGTCAACACAATCGTTTAAAACTGGCGAAAAACAATTTTCTCGACCT 
TTCTGCGGTTTCCACGGCTTTTAAAAAAACTTTTTACAAAGCGTGAAAAAAATGGCTGCCCTAAAGCCTGAAGCGTTGCCAGACCTGTAAAGTTG 
ACATTTTCAAGGCATTTCTGGAAATTGTGCTGTTAATGGCAGTGTGACCGTATGTGGTGCATAACGTTTCGGTATTAAAAAATTTTCCAAATATT 
T AT AAAAAATAAAACGAAATAGAATTTTCATTATGGAATCCAAAAGTTTC AT ATAAAAAAAT AAT GAC AAC AAAAT AAAAATGACCTT AAAT AAA 
TTTAAATGAAATCAAATCTTCTTAAATGTACTACCTTTTAAAAATAATCGCTATCGGCGGTGTCGATCTTTTTAGCATTGTGTCGATGACAATAC 
ATCTGGCAACACTGGTCAATATTAAACATATATTAAACAAATTAAAATGAGCTCAAACCACCGTCCTCTGGACGATTCGGATATCCTCCTAATAC 
AAACCATCCGCGAGACGCCGTCGCTGTACGATCCCCAGCTGCCCTCCTTTCGGCTGTCGCAGCGCAAAGAGGAGGACTGGGCAAAAGTGGCGGAT 
TTGCTAAACATCTCCATCTCAGACGCTCGGCGACGGTGGACGTGTCTGCGGGATCGCTATTCGCGGGAGCTTAAGCAGAAGCGCCTGCATCCGTC 
CGGCGAGTTTGGTCACAATGACTTCTTTCGGAAGATGGACTTCCTGCGGGACTTTGTGCGCAAGAGACGCGAGCGCAGAGGACGCGAACGGGATC 
GGGAACAAAAGCCAACCGGGTGGATGAAGGTTGATTTACAGCGACGGCGTCGTACCCGCCTGCCTATTGACACAGAGACCTTGATCGAGGAGCAA 
GGTTCGCACGCCTACGATGAGGGCGAGGAGCAACACGACTACGACGCTAAGCTGGAGTCTCATACCACACAATCGGAGACGTATTCGGTGGTCGT 
GGAGGCGGACGATGGGCAGGAGCCCGAACAGGAGAGCTTCGATGAGTTCCTGGGGGATGCGGAGTGCGAGCAAAAGGTCAAGGTGGTCACAATTC 

ATCCGGAAATCGCTGCTCCGA 
(SEQ ID NO: 454) 

Exon: 7381.. 7177 
Exon: 6928.. 6341 
Exon: 5787. .5635 
Exon: 5579.-5475 
Exon: 4619.. 4304 
Exon: 3788.. 3045 
Exon: 2984. .2499 
Exon: 2440.. 2111 
Exon: 1896.. 1001 

Start ATG: 6883 (Reverse strand: CAT) 
Transcript No. : CT13962 

CTTTAGGGCAGCCATTTTTTTCACGCTTTGTAAAAAGTTTTTTTAAAAGCCGTGGAAACCGCAGAAAGGTCGAGAAAATTGTTTTTCGCCAGTTT 
TAAACGATTGTGTTGACCAACCAAAGTATGCAAAACTAAATATTGTGTGTGCGTGCCGTGAGTGAACCAAGCAATTGAACATGGAAACCGTGGTT 
GCCTTCAACAATGAGCTCTCCGGACTCTATGACAGCCGACCGCCCATTTCCAAAGCGAAGATGGCCGCCATCACCAAGTCGGCGATGCGGGCCAT 
CAAGCTGTATAAGCACGTCGTCCAGAGTGTGGAGAAGTTCATCTTGAAGTGCAAGCCGGAGTACAAAGTACCCGGGCTGTATGTAATCGACTCGA 
TAGTGCGCCAGTCGCGCCACCAGTACGGTATGGACAAGGATCTGTTCGCGCCCCGATTCCAGCGTAATCTCACGGAGACCTTTGCCAACTTGTTC 
CGGTGCGCTCCGGAGGACAAGAGTCGCATCATCCGGGTACTGAACCTGTGGCAGAAAAACAACGTCTTCAAGTCCGAGGTCATTCAGCCGATCTT 
CGACCTGGCCGACCCAAACCACCCAATCTACCACCAGATGCCTCCGGTGGGTGGCAGTGGAGGACAGGGCGGCGGCGTGGGTCCAGGACCCAGCA 
GCAGCGGTGCTCTTAGCCTGGCGGACATATCGAGCGGTCCGAACGGTCTAAACAGCTCCGGAATGGAGATAAGCATGAACAGCTCCGGGGGCGAC 
G AT AAGAT GG GC G GT GCC AT GC C C G AC T T GT C G G AG AAGAAG AT C AAGC AGC TGC T AAAC AAT CC GAAT GT G C T G C GGC AG C T C C AT ACG C T GC A 
GAACTTCCAAAATTTGAAGCCGCAGGAGGAGAACCAGAAGCATCGCTATCAGGACGAGGCACTGCAGCAGCATTTCCAGAACGTAATGAAGGGAA 
ATGCTGGTATGCCGCCGGGCATGGGCATGGGCATGAACATGAACGACAGCATGGACCTTAACAAGGATGTCGAGTTCATATCGGAGCAACAGACC 
ATAGAGGTGATCAATCTGGATGGTGGCGATTCCCGCAGCCCGACGCCCGACCGTGATCGCTACAAGCGCAGTCGCCGGAACAGTCGCAGCCGGAC 
GAGATCTCCGCGAGGACGTGGCGCCGGAGGAGGCACGGGCAACGATCGACGTCGGCGTGGCACTCGTTCCCGATCGCGCAGCCGATCACCACGGT 
CCAGCCGGCGACGCGGATCAAGGGACCGGGATCGCATGGATCGTAGCAATCGAGACAAGGAACGCGATCGGGAACACGAGCGTGAGCGGCGCAAG 
AAGGGATTGCCCGACATCAAAAAGGAGCATCTCAGTGTATGCAGCACTACGCTGTGGGTGGGCCACCTGTCTAAGCTTGTTTACCAGGAGGAGCT 
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GTCCGATACCTTTGGCGAATATGGCGATATAGTGAGCATCGATCAGATTGTGCCGCGTGGATGCGCTTTTATTGTAATGAACCGACGCCAAGACG 

CCCACAAGGCGATGCAAGCTCTGAAGAACCACAAGCTGCAGGGACGAGCCATCACCATTTCATGGGCCGCCGGCAAGGGTGTGAAAAGCAAGGAG 

TGGAAGGACTACTGGGACCTCGAATTGGGTGTTACTTACATTCCTTGGTCCAAACTCAGTTCGGATACGGACTTCGACGCGCTGGAGGAGGGTGG 

TATGTTCGACGAAGATACTATGCCCATCCAAATGAAGCAGAAGATTAACCAGGCCAAGAATGCTGCCAAGGAGCACAAGGGTGCATCCATTGCGG 

AGGCAGCTGGTGCGCCAAGTGTTGGTGTTCCACCGCCAAATCTGATATTTGGTATCGATACAACCCAACCACCACCGGTTGGTCCAGTCGGCCCA 

GTGGGTCCTCCACCGGGTCCGGGAGCCCCACCTCCGGGGCTCATGGGTATGGTAAGGGGACAATTTCCGATGGCCCCACCCATGGGCATTAATAT 

GCCACCGCCCATAATGATGCCGCCAACAAATATGCCGCCACCGATGATGATGCCAACGACCAACATGCCTCCTCCAATGATGATGCCACCGACAA 

TGATGCCTCCTGGCTTTCCAGGCATTGGGGGACCACCACATCCTATGGCCTTGCCACCGGGAGCTCCATTTCCTCCGCCTGGAGCTGTGCCGCCA 

CCCATACCGAGCGGTGGGCAGATTAGCAATAGTGGCAACGTGAGCGATGATCAAATGGACATTGAAATGGACCTGGAAGACGCCCCACCTCTGCC 

GCCGCAACAGCAAGCCAACTTTAATCCGTCGCCCAGCAATGTGGACCTATCGCCGGCGGCCATGGCCAACGAGATGTTCCAGCAGCGGGACAGGG 

AGCGCGATCGGGATAATAGGGATCGCTCCAGGGGACCGGGCAACTCGCGCTGGGGCGGACGAGATGATGTCGTTGAAGCGGCAGAACGCTGGCGC 

GCCGAAAATGGAGGAGGTGGTGGTGGTCCGGGATCTGGACTCGGACCGAACGCAGCCTTTAACGAGGCACGAGCGCGTCTGAACCTTAATCCCAT 

TGAGCACGGTATGCCAAGACCGGACTTTATGGACTTTGATAATCGCGGTGGACCGGGTGGACCACGAGGAATGGGACCGCGTGGAAACCATAACG 

GT G G AG GC C C C G GAG GC G ACT TTTTCCCGCC C AAC AT G AAT C AC AATC GG T T C AAT C AGC CC ACC AG CT T GAT GC AG AT GC GC AT CC C GC C GC C G 

GCTTCTTTCAACCAGCGCATGGGTGGACCAGCTGGTAACGGTGGCAACGGCGTTGGGCCCATGTTTATGCGCAACCAAGGAGGTGGGCCAGGTGG 

TGCTGGACCCAGCGGTGGTGGACCCGGCGGTGCTGGAACCGGCGGCGGTGGTCCGGGCGGACGTCAGCAGGGACCAGGTTTCTTTAATCCTCGGA 

ATCCCTTCAATGACAACCAACGAGGACGTGGAGGTCAAAGTGGAGGCGGTGTTCGCGGCGTGGGCGGAGGAGGAGGTGGCGGTCCTGGAGGTCGC 

GGACGTTGGAGCGACGATGAGGACGAGGGCGGAAACAACTTCAAGCGGCGTGGTGGTCCCGGTGGGCCAGGCGGTAATAGATTCCGTGGCGAGCG 

TGGAGGCGAAATGATGGACGACCGTCGTGGCAACAATTCACGGGGCGGACGCGGAGGACCGCGAGAGGATCGTGAACGTCCGGGATTCGGAAACA 

GGCGGGGATCGCGAGATGACAGCAACCGCCACTCAATAAGTTCGACGGATGAAGGCAACAAGGCGGCTTCGGAGACGGAACCCAAGCCCAAAAAC 

GTTGAAACCGAATTAGGTAGCACCAACACCGGTGCGCCGGCTGCGGCCACTAGAGTGGACACCGAGGAGGATTGGGATCAGGAACTTCAAGACTA 

TGAGGCGAGAATGGAGGCCCAGAAGCCAGGCGAACGAGCATCTCAGTCTGCTAACAGTCCTCCGCAAGAAAAGCTGCCGGCGGAGAATGAAAACT 

TCGCAAAGCCAGCCGACGTGACCGGCGATCCTTTGGTTGCGAAATCACAAACAGAAACTTCAGCCGCCTGCACGCCCCTTTATGACGAGCTGCCA 

CCGCCAGCAGTTCCACAGACTCCAACTCAGCCCCCGAAAGAACCTCAAGGTTCTCCGCCCCGGCCGGCGCCGCCAGTCATTGAAACCGAGGCTGC 

GCCCCATGAGGAATCGGTTCCAGCGCCGCCGGCTGAACTCCAAACGGAGGCTCCATCCGCGGAGGAACCCATAGCCCCTGCTGCTTCCGCCGATG 

TCGTAGCCGAAGCTGATCCTTAG 

(SEQ ID NO: 455) 

Start ATG: 251 {Reverse strand: CAT) 

MAAI TKSAMRAI KLYKHVVQS VEKFI LKCKPE YKVPGLYVI DS I VRQSRHQ YGMDKDLFAPRFQRWLTETFANLFRCAPEDKSRI I RVLNLWQKN 
NVFKSEVIQPIFDLADPNHPIYHQMPPVGGSGGQGGGVGPGPSSSGALSLADISSGPNGLNSSGMEISMNSSGGDDKMGGAMPDLSEKKIKQLLN 
NPNVLRQLHTLQNFQNLKPQEEKQKHRYQDEALQQHFQNVMKGNAGMPPGMGMGMNMNDSMDLNKDVEFISEQQTIEVINLDGGDSRSPTPDRDR 
YKRSRRNSRSRTRSPRGRGAGGGTGNDRRRRGTRSRSRSRSPRSSRRRGSRDRDRMDRSNRDKERDREHERERRKKGLPDIKKEHLSVCSTTLWV 
GHLSKLVYQEELSDTFGEYGDIVSIDQIVPRGCAFIWNRRQDAHKAMQALKNHKLQGRAITISWAAGKGVKSKEWKDYWDLELGVTYIPWSKLS 
SDTDFDALEEGGMFDEDTMPIQMKQKINQAKNAAKEHKGASIAEAAGAPSVGVPPPNLIFGIDTTQPPPVGPVGPVGPPPGPGAPPPGLMGMVRG 
QFPMAPPMGINMPPPIMMPPTNMPPPMMMPTTNMPPPMMMP 

IEMDLEDAPPLPPQQQANFNPSPSNVDLSPAAMANEMFQQRDRERDRDNRDRSRGPGNSRWGGRDDVVEAAERWRAENGGGGGGPGSGLGPNAAF 
NEARARLNLNPIEHGMPRPDFMDFDNRGGPGGPRGMGPRGNHNGGGPGGDFFPPNMNHNRFNQPTSLMQMRIPPPASFNQRMGGPAGNGGNGVGP 
MFMRNQGGGPGGAGPSGGGPGGAGTGGGGPGGRQQGPGFFNPRNPFNDNQRGRGGQSGGGVRGVGGGGGGGPGGRGRWSDDEDEGGNNFKRRGGP 
GGPGGNRFRGERGGEMMDDRRGNKSRGGRGGPREDRERPGFGNRRGSRDDSNRHSISSTDEGNKAASETEPKPKKVETELGSTNTGAPAAATRVD 
TEEDWDQELQDYEARMEAQKPGERASQSANSPPQEKLPAENENFAKPADVTGDPLVAKSQTETSAACTPLYDELPPPAVPQTPTQPPKEPQGSPP 
RPAPPVIETEAAPHEESVPAPPAELQTEAPSAEEPIAPAASADVVAEADP* 
(SEQ ID WO: 456) 

Classification: RNA_binding 



Celera Sequence No. : 142000013384826 

GTTTTCCCGGCTCTGTTTTCACAATGGGCTTGAGTGGCTTCACCTCCGGCATGACAGCGTCTATTAGAGAGAGTCCTTCAATGGGTCGCGCCACC 
TCGCCGTCTTCGCCTACGCTCCACTCCACTTCAGTGTACAGAAGACCCTTCTGCAGGATGGTGAGCAAAGCGGCAGGTGGAACCAATGCGCCATT 
GATATTACTTTGCGAGATGTGTGACTCAATGCCAAACACGTAGGCCGAGTGCAGAAAGCCTAGAAAACGGGATTGCATGAGATTAGGGCGGGCTG 
TTATCTTGCGATTGGAGGGTTTTACTCACCCGACTCCTGCAGGTATCGGTAAACCAGAAAGTTCACCTCGTCGCTGGAAAAACTCATGACGGAAC 
GCGGTTGGCTCCTTCTCCTACTACTAGCGCACTCCAAAATCCACTCCTATCTCCAATGCCAATTATGTTTTTACGGTGCGAAAATCGGAACTGTG 
CTGCGTAACACATGCGTCTGCATTCGGCACGCTTATCACTTCGGCGCAAGCCACGGTATCCGGTGCTACTCCTGGTACTTGGATCGATCTGGTCG 
GTCCTGTGTTTTCAATTCTGCCAGTTTGCTGCTTGCCGCAAGAGCACTTCACATATTTTTTCCACAAATAAAACCGAGGGATCGAAAACGCGTTT 
CTGAAACGCGCAAAAAAATGGCGGAGTAATCTCAAGGTGTGCGCTGCGAGAGGTTGCGCAGCCTTGTGTTTTGTGACGTCACAAGTTTTGACAAC 
AACTTAATCAAATTTGAGTCATATTTTTGTATTGCACCAAGCGTTACCTTAAAGGATAGAACGGAAATTTATTAGCCATAAAAACAATTAAGAAG 
ATATTTTTGGATTTCTGCTTGGAGCAAAAGGTAAAAAATTCTTAAAGCGAATTGTCGTCTCTCTTCGTCCATCCCTGTGTACCCATCCCTCGAAG 
TAATCATTGTATTTCAACCCAACGCTCTACTGGTCACACCAATCGTTCCACCCGCGCCCGCTAGATGGCTCCACTCTTAGGCAGCGAGAGCGAAG 
AAAAATCAATAGAAATTTTCATTTTAGTGAATTGCCCTGCACTTGGTAATAGTTAACAAACAAATTAGCGGCAAATCAGCGAGGCGAAGCTCCAG 
CGAGTTGGAGTTCCATAAAGTGCAACACGCAGGGTCAAACATTCCGCAGCCAACACCCCGGATTCCGATTCCAAGCGAAGCCAACTTCAGAAGTC 
AGAGATTACTCTCTCTGCTCCGCCACAGAAATTCCGCAACGCTAACCGATTTTGTCCGCTGGCGAGTAAAAAAAATCGGAAAAGTAAGCTCAAAA 
AAAAAGAACAGCTAACATACGCCCATTAATGTTTTTGGAGGCTTCTTCGGTGCTTTTTTTTTTGGGCAAAACGCCCTCGTCATCCAGCGGTGAAA 
AAAGTGACGTGACATGAAAGCAAAGAAAGAAGAGAACCGAAGAGCGCAGTGAAAAACAGAAAATAAAACGAAACGGAAAG 

ACTACAGAAACAAAAGCCGGTGGTCGCGCAAATCCAAATAATAATGCCCAAGTGCACATCGACTTCGCCAGGCGATAAAAAGTGGGCGAGCGAGG 
TCACAGAATCTTTATCAACGGGCAAAACGCTGCC7VAGTTGAAAATAAATCGGTTGCCAAAAATGCGGGGGTACACTCTACTTCCACCAGAAAATG 
TCAAAGTTCGCCAGCCATTGGGTGCACTGGCCGGAAAATCAATCACGTAGTCCACTATTTACCGTTTAAAAGTTCTGAGTTTATTAAACATTGGA 
TTTTTTATACATATAAATCGTTATCAGCGCTTGATGCTCAGTGAAATGTGTGGTGCATACATATTTTGTGGCGTTTTCGTTATCAAAAGTAAAAC 
TGCAATACGGACTTTCAAAATAGAAGCTGTATCAATGTGTTGGCATGCACAATAAGCTACTCCCTGGCTAAATACAAATTGTTATCCAAAATACC 
TTTTGTTTGAATAATATACGTTTGTTAACACAAATACTAAAACGTAATCATAATACTAAGTTTTCCCATTCCCACAACCCACAATAAATAGAGTA 
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TCGTTTCCCCTTTCAGGTCCCAAGGAACGGCCGAGATCCAATCGAAGTAAAAAGAGCTTTGCAGAACCAGAGCGGAGAGCAACTATATATCTTAG 
GGTCCGACGTCGAGTTCGAGCGATAAGAACGCCTCGAGACTCGGCCGCCATCCATTGACGATTTTCTTGCGCCCCCAGTTTGCATAAAACCAGCG 
AAAATTAACAAGCTAACTACCATGGCGCCGGTGCGGGGGGATGGCATGAGAGGCCTGGCCGTCTTTATATCGGATATAAGGAACTGTAAGTGCAG 
ATCTCTTGGGTAACCAGCATTAAATCCAATTGAGCGGGTCGCAACAAGAGATTTTCCATAGGGGCGGAATCAGCGCCATTACTTGATCAATTACC 
AGGAGCAGCGCACTGACAAAAACGTGGGGCGGTACAAAAACGCAGGAACAGTAAGCTCTCTGCCAGATGAATTACAAAATATTGGCTACTTTTCT 
TCTCCATTCCTATGTAAATGTTTTCGTAGTCGTTACATTAGAAAGATATGAGAAAGTTATATATATAGTTATATATATGGGTACAACACAGCAAA 
TAGTTATATCAATGTGAATCGGTTGCCTCTCAGTATTAATAAATACTTACAATTTTGATAGTTTGATGACTTACTGTAGTACTTCGAAGCGTGCA 
TATGTGTGTGCGCAAGAGTCCGACGCTTCACTGTATGGTATTATTATCACAGCTCGTTGTTTGCTGTATGCTCGGTTACTAGTTTCTGGTTTGTG 
GACAACATATAGGAGAGCAGAGGTTCCGCTGCGGATCGGTGACTGGTACTAATTGACGCCTATGTGAATCGGAGAAAATCCGCGACATGGAGAGA 
GCTGAAGGGTGTGAAGTCGTCTCTCCGATCGATCAATCGCGGGGTGGAAAGGAAAGCGAAGCTTCCGCAGACGAATCACTGTTGTTTTACTGTGA 
ATCTCCCTTATACCCGCATAAGACACAAATTTCTATACTAATATTCACCCATTTGCCCACAGGCAAAAGCAAAGAGGCCGAAGTCAAACGCATAA 
ACAAGGAGTTGGCCAATATACGAAGCAAATTTAAAGGAGACAAAACCCTCGACGGCTATCAGAAAAAGAAATATGTCTGTAAGCTGTTATTCATA 

TTTCTCCTCGGC C ATG AC AT C G AC T T CG GGC AC AT GGAGGCGGT C AAT TTGCTGTCCTC C AAC AAAT AC T C AGAAAAGC AG AT T GT G AGT AC C C C 
AAAAAATTGCTTCAATAATCATTTGTTTATTTAGCGAGTCGAAATAATTGAACTGAAAGCAGACGACTCTGCTTGTGGGAAAAGGGAATTTCATT 
CAATTAAATTTAACTTGTCGTTGTTTGAAGTTTGATAAAAATAATTAAAAAATTGGTTTTTTAAATTTAATGTTACTGATAAAATCTGTTTTTTG 
AACTTTAGGGTTACCTTTTCATCTCAGTGTTGGTGAACACCAACAGCGACCTTATTCGGCTGATCATCCAATCTATAAAGAATGATTTGCAGTCG 
CGTAATCCTGTGCACGTTAATCTGGCACTACAATGCATTGCCAACATTGGAAGTCGGGATATGGCCGAGTCCTTTTCCAACGAAATACCCAAGCT 
ACTCGTCTCGGGCGACACCATGGACGTGGTGAAGCAGTCGGCGGCGCTATGCCTGCTCCGCCTGTTCCGCTCCTCGCCGGACATCATTCCCGGTG 
GTGAGTGGACTTCGCGCATTATCCATTTGCTCAATGATCAGCACATGGGAGTGGTCACAGCGGCCACCTCGCTGATCGACGCCTTGGTGAAGCGC 
AATCCGGACGAGTACAAGGGTTGTGTTAATTTGGCTGTTTCACGCTTATCCCGGATCGTAACAGCCAGCTACACGGATCTTCAGGTACGATTTAA 
TGCATTTTTTAAGGTAAGAAACTTCTAATTACTAACTAAATTCGCATTGCAGGATTACACATATTACTTTGTGCCGGCTCCATGGCTATCGGTGA 
AACTTCTGCGATTGCTGCAAAACTACAATCCAGTGACCGAAGAGGCCGGTGTGCGGGCCCGCTTGAATGAAACTTTGGAGACGATTTTGAACAAG 
GCGCAAGAGCCGCCAAAGAGTAAGAAGGTGCAGCACTCTAACGCAAAGAACGCCGTGCTCTTCGAGGCCATCAACCTGATCATTCACAGCGACAG 
CGAGCCGAACCTTCTTGTCCGCGCTTGCAACCAGTTGGGACAGTTCCTTAGCAATCGGGAGACCAACTTGCGCTATCTGGCTCTGGAGTCCATGT 
GCCACCTGGCCACATCGGAGTTCTCGCACGAGGAGGTGAAGAAGCACCAGGAGGTGGTCATTTTGTCAATGAAAATGGAGAAGGACGTGTCCGTG 
CGCCAAATGGCGGTGGATCTTTTGTATGCCATGTGTGATCGTGGTAATGCCGAGGAAATTGTTCAGGAGATGCTCAACTATCTGGAGACGGCCGA 
CTACTCTATTCGTGAGGAAATGGTGCTCAAGGTGGCCATTCTTGCCGAGAAGTACGCCACGGATTATACCTGGTAGGTGTTCTCTTTTTTAGCTT 
GTGCTTAAAAGTCTATGTGCTTTATGACAACCGCTATATTACCAGGTATGTGGACGTTATCCTCAATCTAATTCGCATTGCTGGCGACTACGTGT 
CTGAGGAAGTATGGTACCGCGTCATTCAGATCGTGATCAATCGCGAGGAGGTTCAAGGCTATGCCGCAAAGACTGTTTTTGAGGCGCTACAGGCC 
CCTGCCTGCCACGAAAACATGGTTAAAGTGGGCGGCTACATCTTGGGCGAGTTTGGCAACTTAATCGCCGGAGATTCCCGCTCAGCACCACTGGT 
ACAATTCAAGCTACTGCACTCCAAGTACCACTTGTGCTCGCCGATGACCCGGGCCCTTCTTCTGTCCACGTACATTAAATTTATCAACCTCTTTC 
CGGAAATTCGTACCAATATCCAGGATGTGTTCCGCCAGCACAGCAATCTTCGATCCGCAGATGCGGAACTGCAGCAGCGAGCTAGCGAATATCTC 
CAGCTTAGCATAGTGGCATCGACAGACGTTTTGGCCACCGTCCTGGAGGAGATGCCATCGTTCCCGGAACGCGAGAGCTCAATTTTAGCAGTGCT 
AAAGAAGAAGAAGCCCGGCAGGGTGCCGGAGAACGAAATTCGCGAGTCGAAGAGCCCGGCACCCCTGACTTCTGCAGCGCAGAACAACGCCCTTG 
TGAACAATTCACATAGCAAGCTGAATAACAGCAACGCAAATACGGATCTGCTTGGATTAAGCACTCCGCCATCAAATAATATTGGCAGTGGTAGC 
AATAGCAACAGTACGCTCATTGATGTTTTGGGCGACATGTACGGCAGCAATAGTAACAACAACTCAAGTGCCGTGTACAACACGAAGAAGTTTTT 

GT T C AAG AAC AAT GGTGT TT T GT T T G AGAAC G AAAT GC T GC AG AT C GG T GT G AAG AGCG AGT T T C GC C AG AAT T T AGG AC G AT T AGGC CT T T T CT 
ACGGCAACAAGACACAGGTTCCCCTGACAGTAAGTAAATTAGTTTCTGAAACATTCCAGGCATTACATTGCTCTCTTTTCAGAACTTCAATCCTG 
TGCTGCAATGGTCTGCCGAGGATGCGCTTAAGCTGAACGTGCAGATGAAGGTGGTGGAGCCAACCCTGGAGGCCGGCGCCCAGATCCAACAACTG 
TTAACCGCGGAATGCATCGAGGACTACGCCGATGCGCCAACAATCGAGATCAGTTTCCGCTACAACGGCACCCAGCAGAAGTTCAGCATTAAACT 
GCCATTGAGCGTTAACAAATTCTTTGAGCCTACCGAAATGAATGCGGAATCATTCTTTGCACGATGGAAGAACCTTAGCGGGTATTTAATAAGGT 
GTCTTTTGATAATATTCAATATTAATTGTGCATTTGTAGTGAACAACAACGGTCACAGAAAGTGTTCAAGGCTGCACAGCCACTGGATTTGCCCG 
GAGCCCGCAACAAGCTTATGGGCTTTGGAATGCAACTGCTGGACCAAGTGGATCCCAATCCAGACAACATGGTCTGCGCGGGCATCATTCATACG 
CAATCGCAACAGGTGGGCTGCCTTATGCGATTGGAGCCGAACAAGCAGGCTCAGGTGAGTAATTTATTGTTAGATACTTGGAACATAATGCCTAT 
TGCCTATTTCTCAAATTATCAATAGATGTTCCGACTGACAGTTCGGGCAAGCAAGGAGACCGTAACTCGGGAAATCTGCGATCTGTTGACGGATC 
AATTCTAAGCCACAACAATCATGGACGGGGTGATGGAGACGGAGCTGCGAAGCCATGGAGCAACGGCAACCACTTTACAATGCACAATCCCAATT 
ACCTAGAGAAATATCAAAGCATACAAGGCCCTAGAACTGTACGCGTTTAGTTATGCCAGGCACACTCCTACACTCTTCTTAAGTACCTTTCATCT 
TCTCCAACAGCGAGTTGATTTACCCACTGTCCTGTTTATTTGAATATTATGTTTGTAAACCTACTCTTACTTTAACCATTTTCAATTTCATGTTC 
CCAAACTTTTTTGTAAATGTTAAGCTCAAAAAGTTCATAGTAGAGAAACGAGAGAACATAATTAGATTATGTTTATATGTATGACAAATTATATA 
TTGTTGCATCATTTCGAGGCAGTAGCAATTGGATATTATATACCTATATACAGATCTACATACATTCATACAAACCACATACATATACCTAGTAT 

AT AC GT C G T T G T A AT A AAAAC AG AAAT C AAT AT AAAAC AAAAG AG AAAAGC T T GC AAGGG A AT T AT T AAG AT GC C T GC AAAT G AC AAGT AT GC AA 
ATCGTTATCTAAATATTTATACATATATTGATTATTGCAATTGCATCTAAACGAAATTCATGTTTAATGTTTTTGATTGTAAAATCCAATGGAGG 
ACCGCAAGAGCAGAAATCTCATTAAATAATAGACAAATAATTAGACAGATCCACAAAATCTCCTTGATTAATCACTGTTGAGCAACTTAAAATTT 

AT T AAC G AT AGG AAAAAT TTCCCTATT G AAG AAAAC G AAG C AAAAAT T T C AG AAT AT T AT T A T AC T T A T AGC AG AAT T T T T T G AAT GAT T C TGGG 
TAATATTACGTTTCGTTTGTGTACCATAGATAAACTGAGAGGGGCTACTTCATGTTGTGCCGCTTCCTGAAGAGAAAGCGATCTTTGTAGCACCA 
AGTATTAACTAATCTGAAGCAGCAGCAGCAGTAGCCGCCAGCCATAAATCTGCTAATGACTCTCTTCAAATGCGAGACGGCTGCCGATAAAGAAC 
AAAGAATTGTAAACTAAAATATACACTATATAGATACTGATACTGAAAACCGATAATGGAAACTGACATCAAATACTTATTACCATTAAACTCTC 
GATCCTTTCGTAAAAACCAATGTGCATGACAGCTGGAAATTTAGGCATTTCCAAAGTGCAATAGTTAAATGTTACAAAATTCCACACAAATTCCA 
TTTAACACTCACAGACAAAACAAAAGCAACTGCAAGAGCAAGAAAACCAAGCCACAACTACATATAGCAAGTGCAGGAATTTAACTGTAAACGTA 
AACCAATATGTATTTAAATAAATATAAATTGAAACCATTTGAAAGGTGTTTAAAACGTTTCCACGATCTGGATATAAACACTTGGCAAGCACAGT 
TGTATTTTACGTATTTATCTTTATTACGATTAGCGCACGGGTTAACATAGGACCTAACAATAATTTTAGATTTAAATCAAATT 

(SEQ ID NO: 457) 

Exon: 1001., 1318 

Exon: 2107.. 2365 

Exon: 3103. .3314 

Exon : 3524. .3979 

Exon: 4043.. 4632 

Exon: 4701.. 5634 

Exon: 5688. .5971 

Exon: 6025.. 6229 
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Exon: 6296.. 6873 
Start ATG: 2326 

Transcript No. : CT13966 

CCCGCGCCCGCTAGATGGCTCCACTCTTAGGCAGCGAGAGCGAAGAAAAATCAATAGAAATTTTCATTTTAGTGAATTGCCCTGCACTTGGTAAT 
AGTTAACAAACAAATTAGCGGCAAATCAGCGAGGCGAAGCTCCAGCGAGTTGGAGTTCCATAAAGTGCAACACGCAGGGTCAAACATTCCGCAGC 
CAACACCCCGGATTCCGATTCCAAGCGAAGCCAACTTCAGAAGTCAGAGATTACTCTCTCTGCTCCGCCACAGAAATTCCGCAACGCTAACCGAT 
TTTGTCCGCTGGCGAGTAAAAAAAATCGGAAAAGTCCCAAGGAACGGCCGAGATCCAATCGAAGTAAAAAGAGCTTTGCAGAACCAGAGCGGAGA 
GCAACTATATATCTTAGGGTCCGACGTCGAGTTCGAGCGATAAGAACGCCTCGAGACTCGGCCGCCATCCATTGACGATTTTCTTGCGCCCCCAG 
TTTGCATAAAACCAGCGAAAATTAACAAGCTAACTACCATGGCGCCGGTGCGGGGGGATGGCATGAGAGGCCTGGCCGTCTTTATATCGGATATA 
AGG AAC T GC AAAAGC AAAG AGGC C G AAGT C AAAC GC AT AAAC AAG G AGT TGGCCAATATAC G AAGC AAAT T T AAAG G AG AC AAAAC C C T C G AC GG 
CTATCAGAAAAAGAAATATGTCTGTAAGCTGTTATTCATATTTCTCCTCGGCCATGACATCGACTTCGGGCACATGGAGGCGGTCAATTTGCTGT 
CCTCCAACAAATACTCAGAAAAGCAGATTGGTTACCTTTTCATCTCAGTGTTGGTGAACACCAACAGCGACCTTATTCGGCTGATCATCCAATCT 
ATAAAGAATGATTTGCAGTCGCGTAATCCTGTGCACGTTAATCTGGCACTACAATGCATTGCCAACATTGGAAGTCGGGATATGGCCGAGTCCTT 
TTCCAACGAAATACCCAAGCTACTCGTCTCGGGCGACACCATGGACGTGGTGAAGCAGTCGGCGGCGCTATGCCTGCTCCGCCTGTTCCGCTCCT 
CGCCGGACATCATTCCCGGTGGTGAGTGGACTTCGCGCATTATCCATTTGCTCAATGATCAGCACATGGGAGTGGTCACAGCGGCCACCTCGCTG 
ATCGACGCCTTGGTGAAGCGCAATCCGGACGAGTACAAGGGTTGTGTTAATTTGGCTGTTTCACGCTTATCCCGGATCGTAACAGCCAGCTACAC 
GGATCTTCAGGATTACACATATTACTTTGTGCCGGCTCCATGGCTATCGGTGAAACTTCTGCGATTGCTGCAAAACTACAATCCAGTGACCGAAG 
AGGCCGGTGTGCGGGCCCGCTTGAATGAAACTTTGGAGACGATTTTGAACAAGGCGCAAGAGCCGCCAAAGAGTAAGAAGGTGCAGCACTCTAAC 
GCAAAGAACGCCGTGCTCTTCGAGGCCATCAACCTGATCATTCACAGCGACAGCGAGCCGAACCTTCTTGTCCGCGCTTGCAACCAGTTGGGACA 
GTTCCTTAGCAATCGGGAGACCAACTTGCGCTATCTGGCTCTGGAGTCCATGTGCCACCTGGCCACATCGGAGTTCTCGCACGAGGAGGTGAAGA 
AGCACCAGGAGGTGGTCATTTTGTCAATGAAAATGGAGAAGGACGTGTCCGTGCGCCAAATGGCGGTGGATCTTTTGTATGCCATGTGTGATCGT 
GGTAATGCCGAGGAAATTGTTCAGGAGATGCTCAACTATCTGGAGACGGCCGACTACTCTATTCGTGAGGAAATGGTGCTCAAGGTGGCCATTCT 
TGCCGAGAAGTACGCCACGGATTATACCTGGTATGTGGACGTTATCCTCAATCTAATTCGCATTGCTGGCGACTACGTGTCTGAGGAAGTATGGT 
ACCGCGTCATTCAGATCGTGATCAATCGCGAGGAGGTTCAAGGCTATGCCGCAAAGACTGTTTTTGAGGCGCTACAGGCCCCTGCCTGCCACGAA 
AACATGGTTAAAGTGGGCGGCTACATCTTGGGCGAGTTTGGCAACTTAATCGCCGGAGATTCCCGCTCAGCACCACTGGTACAATTCAAGCTACT 
GCACTCCAAGTACCACTTGTGCTCGCCGATGACCCGGGCCCTTCTTCTGTCCACGTACATTAAATTTATCAACCTCTTTCCGGAAATTCGTACCA 
ATATCCAGGATGTGTTCCGCCAGCACAGCAATCTTCGATCCGCAGATGCGGAACTGCAGCAGCGAGCTAGCGAATATCTCCAGCTTAGCATAGTG 
GCATCGACAGACGTTTTGGCCACCGTCCTGGAGGAGATGCCATCGTTCCCGGAACGCGAGAGCTCAATTTTAGCAGTGCTAAAGAAGAAGAAGCC 
CGGCAGGGTGCCGGAGAACGAAATTCGCGAGTCGAAGAGCCCGGCACCCCTGACTTCTGCAGCGCAGAACAACGCCCTTGTGAACAATTCACATA 
GCAAGCTGAATAACAGCAACGCAAATACGGATCTGCTTGGATTAAGCACTCCGCCATCAAATAATATTGGCAGTGGTAGCAATAGCAACAGTACG 
C T CAT T G AT GT T T T GG G C G AC AT GT AC GGC AGC AAT AGT AAC AAC AAC TCAAGTGCCGT GTAC AAC A C G AAG AAGT T T T T GT T C AAG AAC AAT G G 
TGTTTTGTTTGAGAACGAAATGCTGCAGATCGGTGTGAAGAGCGAGTTTCGCCAGAATTTAGGACGATTAGGCCTTTTCTACGGCAACAAGACAC 
AGGTTCCCCTGACAAACTTCAATCCTGTGCTGCAATGGTCTGCCGAGGATGCGCTTAAGCTGAACGTGCAGATGAAGGTGGTGGAGCCAACCCTG 
GAGGCCGGCGCCCAGATCCAACAACTGTTAACCGCGGAATGCATCGAGGACTACGCCGATGCGCCAACAATCGAGATCAGTTTCCGCTACAACGG 
CACCCAGCAGAAGTTCAGCATTAAACTGCCATTGAGCGTTAACAAATTCTTTGAGCCTACCGAAATGAATGCGGAATCATTCTTTGCACGATGGA 
AGAACCTTAGCGGTGAACAACAACGGTCACAGAAAGTGTTCAAGGCTGCACAGCCACTGGATTTGCCCGGAGCCCGCAACAAGCTTATGGGCTTT 
GGAATGCAACTGCTGGACCAAGTGGATCCCAATCCAGACAACATGGTCTGCGCGGGCATCATTCATACGCAATCGCAACAGGTGGGCTGCCTTAT 
GCGATTGGAGCCGAACAAGCAGGCTCAGATGTTCCGACTGACAGTTCGGGCAAGCAAGGAGACCGTAACTCGGGAAATCTGCGATCTGTTGACGG 
ATCAATTCTAAGCCACAACAATCATGGACGGGGTGATGGAGACGGAGCTGCGAAGCCATGGAGCAACGGCAACCACTTTACAATGCACAATCCCA 
ATTACCTAGAGAAATATCAAAGCATACAAGGCCCTAGAACTGTACGCGTTTAGTTATGCCAGGCACACTCCTACACTCTTCTTAAGTACCTTTCA 
TCTTCTCCAACAGCGAGTTGATTTACCCACTGTCCTGTTTATTTGAATATTATGTTTGTAAACCTACTCTTACTTTAACCATTTTCAATTTCATG 
TTCCCAAACTTTTTTGTAAATGTTAAGCTCAAAAAGTTCATAGTAGAGAAACGAGAGAACATAATTAGATTATGTTTATATGTATGACAAATTAT 
ATATTGTTGCATCATTTCGAGGCAGTAGCAATTGGATATTATATACCTATATACAGATCTACATACATTCATACAAACCACATACATATACCTAG 
T ATAT ACGTC GT TGT AAT AAAAAC AGAAATCAAT AT 
(SEQ ID NO: 458) 

Start ATG: 538 

MRGLAVFISDIRNCKSKEAEVKRINKELANIRSKFKGDKTLDGYQKKKYVCKLLFIFLLGHDIDFGHMEAVNLLSSNKYSEKQIGYLFISVLVNT 
NSDLIRLIIQSIKNDLQSRNPVHVNLALQCIANIGSRDMAESFSNEIPKLLVSGDTMDVVKQSAALCLLRLFRSSPDIIPGGEWTSRIIHLLNDQ 
HMGVVTAATSLIDALVKRNPDEYKGCVNLAVSRLSRIVTASYTDLQDYTYYFVPAPWLSVKLLRLLQNYNPVTEEAGVRARLNETLETILNKAQE 
PPKSKKVQHSNAKNAVLFEAINLIIHSDSEPNLLVRACNQLGQFLSNRETKLRYLALESMCHLATSEFSHEEVKKHQEVVILSMKMEKDVSVRQM 
AVDLLYAMCDRGNAEEIVQEMLNYLETADYSIREEMVLKVAILAEKYATDYTWYVDVILKLIRIAGDYVSEEVWYRVIQIVINREEVQGYAAKTV 
FEALQAPACHENMVKVGGYILGEFGNLIAGDSRSAPLVQFKLLHSKYHLCSPMTRALLLSTYIKFINLFPEIRTNIQDVFRQHSNLRSADAELQQ 
RASEYLQLSIVASTDVLATVLEEMPSFPERESSILAVLKKKKPGRVPENEIRESKSPAPLTSAAQNNALVWNSHSKLNNSNANTDLLGLSTPPSN 
NIGSGSNSNSTLIDVLGDMYGSNSNNNSSAVYNTKKFLFKNNGVLFENEMLQIGVKSEFRQNLGRLGLFYGNKTQVPLTNFNPVLQWSAEDALKL 
NVQMKVVEPTLEAGAQIQQLLTAECIEDYADAPTIEISFRYNGTQQKFSIKLPLSVNKFFEPTEMNAESFFARWKNLSGEQQRSQKVFKAAQPLD 
LPGARHKLMGFGMQLLDQVDPNPDNMVCAGIIHTQSQQVGCLMRLEPNKQAQMFRLTVRASKETVTREICDLLTDQF* 
(SEQ ID NO: 459) 

Classif icatxon : known_f lybase_gene 
Gene Symbol: alpha-Adaptm 
FlyBase ID: FBgn0015567 



Celera Sequence No. : 142000013384593 

ACAATGAATCTCAATTAGACTTACCCGTATATGTAGTAGAAGAAGCTAATCAGATAGACGGCCAGTTTAATCCAACCCTCACGCATGTTCCGGTA 
CAAAGTATCGGTTTTCAGGACCGTGGTGGGATCATAAAGTCCGGGTCCCGACATCACAGGGCGGTTTTTGTAACTGAAAGCGATAGGGGTATTAC 
AAAGCATGAGTGCTTAAGGGTGCTTATGTTGGGCTTGGGTGAATCTTGATCTCCTCGGATCTCAAATCACATTATATCACACGGTCAAATGAGTC 
ACTTACCGCCAAATATGATAGGCTATGAGGGGTATGTTGATGCATAGCGAAAACCATTCGCCGCAGAACAAGAAGAGCAAGTTCAGAAAGATGTG 
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CAGAAGGTATTCGGGCAAAACCAGCTGAAATGCAAACACATACATTACAGGGCGCACTGCGGGTGTAAGCAGTAAGTTTACTGACCGGATTGAGG 
CTATTGCACTGATCAATTGGGTTCTTATAATCCGTTTTAAGTTCGTCGAATGCAATGACATGAAATATGGCGAAAAATATCAAGAATGCATCCCC 
GATCAGGGCCACTATGTACGTGAAGGCGGTGAAGTTGAAGGCCATCTTGTAAAATCGCTGGAGCACGTCTTCCAGGTGCTAATCAAAGTGACTTG 
CTAATTATTATCGTAAAATTCGGCTTTGCAATTGTAGCGCTGGGACTCACTTGTTTTTTTTTTACAAGGGCGGCGATGATGACAGCTGTGCGCCA 
GGGATGCACTAATCTAAACGATCAGGCAGCACAGATGTACGGTCACATTGCAAATGAGGCTAATAAATAATCAGCTATCCATTATAAATCAAATA 
TAATTTCATTTAAAGAAGCAATTTTAATTTCTAACGAATTATAATGTTTTGTATTAATTTTAATTCTCTGGTAATTTTCTCTACGCACAAAAAAT 
ACCACTAACAGTTAGCCGATGCGGTCACACTACATTTCCAAATCGAACCGCCATTTGTGTTGCATCTCGAAATCTGGAAAAAATTCGATCCAAGT 
GTTCTATGAGCTTCTGTTTTTTAATAATAATAAATAAATAAATTCGATAATCAACTGACACCAGAAAAGAGCGGAACACAATGTCGCAGTTCAAT 
TTTGTGAGCGATTTGCAGAATGCTCTCATCATGGACGGCGAGACGCGCGGACCTGCGCCCAGGTGGAAGAAGAAGCTGGAGGCGTCTCTAAATGG 
AAGTGTGAATACCACTCGGTCGGTGCTATCCGTCTCGTACAACACCAGTTTCTCGGGTGTCCAGGCGCCCACGAAAACTCCGGGCAAGAGCAGCG 
AGGGCAAGACCAAGAAGTCCAACACCACGCCCTCTAAGACGCCAGGAGGCGGAGATCGCTTTATTCCGAATCGGGCGGCTACCAACTTTGAGTTA 
GCACACTTTCTGGTAAGGATTCCACTCATCCCTCCATCCCGCCCTGTAAGGGCATCCTTATTCCAGACCCCTTAAGAATTTATTGACCACCTGAC 
CATCTATTAGGTGAACAAAGACTCCGGCGATAAGTCCGATGAGGAGAACGACAAGGCCACCTCGAGCAACAGCAACGAGAGCAATGTCCAGGCTT 
CGGCTCACAAGGGCGACCGGCAGAAACTCATCTCTGAAGTGGCCCAGGTCGGTGACTCCAAGGGCGGGCGCATTTTGTGCTACCAAAACAAGGCT 
CCCGCTGCTCCAGAAACACACAACAATCCCCTGAAGGTCGTGTACTCCATTAAGACACCCATATCCACAAAGAGTGGCTCACGCTATATACCCAC 
CACATCCGAGAGGATTCTGGATGCACCTGATTTTATTAACGATTACTGTAGGTCGAAGAAAACTTTTCTTTTCTTTTTTTCTACTATTGATCTCT 
TCCAGATTTAAATCTTATGGATTGGAGTGCCGACAATATAGTGGCTGTGGCCTTGGGCAGTTGCGTCTATTTGTGGAACGCACAGACCGGAAATA 
TCGAGCAGCTTACGGAGTTTGAGGAGGGCGACTACGCAGGCTCGCTATCGTGGATCCAGGAGGGGCAGATACTTGCCATCGGCAACAGCACCGGT 
GCCGTGGAGCTGTGGGACTGCTCCAAAGTGAAGCGTCTGCGAGTGATGGATGGACACAGTGCCCGAGTGGGATCCTTGGCCTGGAACTCATTCCT 
GGTTTCCTCTGGCAGCCGGGATGGCACCATTGTCCACCACGATGTGCGTGCACGTGAGCACAAGCTTTCCACATTGTCCGGACACACGCAGGAGG 
TTTGCGGCCTAAAGTGGTCCACGGATTTCAAGTATTTGGCTAGCGGAGGCAACGACAATCTGGTGAATGTTTGGTCGGCGGCCAGCGGTGGCGTG 
GGAACTGCCACCGATCCCTTGCACAAATTCAACGACCATCAAGCTGCAGTGCGTGCCTTGGCCTGGTGTCCCTGGCAACCAAGTACTCTAGCCTC 
TGGAGGCGGCACCGCCGATCGCTGCATCAAGTTCTGGAATGTGAACAATGGCACTTTAATGAAATCCGTGGACTCCAAGTCGCAGGTCTGTTCTC 
TGCTCTTTTCTCGCCACTACAAGGAGCTGATCTCTGCGCATGGTTTTGCTAACAACCAACTGACCATTTGGAAATACCCAACAATGGTGAAGCAA 

GC CG AT TT G AC T GG AC AC AC GT C AC GAGT T C T C C AG AT GGC C AT GT CT CC GG ACGG C AG C AC AGT GAT C AGCGCC GGAGCT GAT GAAAC C C T G C G 
TCTTTGGAACTGCTTCGCTCCCGATCCGTTGGCGTCCAAGAAGGCAGTTTCGACCAGCAAGGGCAAACAGAGCGTGTTCCGACAGAGCATCCGTT 
GATATGCTCAGACCTTTAGAACTGTTTTTACCCCCTTGATTGCTAAGTTTAAGCTTCAATACTTACTACTGGTATGTTTTCCAGACTAGACAATT 
ATTTCTTGAATGCATATCCAATTTTTATGTTCTCGTTTAATGTTTCGTACTTTGTAATTACGGATAAATCTCTGATTATGTGCCCCCGCCGACGA 
TATATGAATAATTAAATTTTGTCAGTTACTTTTATTGCTTAGAGTTAAGTTTGTGTGCGTATAAATAAAATGCTTGACATTCTGCATCCTTGTAT 
GCTTTATTTCGCTCTAAACATTTTGCGTTAACACAAGGAATTCTATTCGTTTCGGATTCCCACGTGCCGGGACAGTAAGTACATAAGGTTCCATT 

T C CT GG AT G CAT TT T GAG AAC GT GT GTGT C T AGG AAT AAT AT T GC AT AC AATT C T G GGGC AG AAAAGG CGTGCTTCTG GGG AC AT C G AT AT AAC C 
ACATCGGGAATTTAAGTTGTGTTAATCATAGGTATGTTTATGTTTAGTCTAAATCGTATGTACAGATTGGATGGGCATTCAGGCAACTGTCATGG 

GAT T GC C AC C GT TGT AGC T G AC AAAAT AGC G AGG AT C ACC AAAGCG AT C GT AC ATC T GC AAG AAC AAG AGGT T C GC AAG AAAAAAGC G GGTT AAA 
TTAAAAATATTACAAAATTGTTAGGTACAAGAACAAAAGCCTCATTAAGCGATAAAATAAGCAAACAACATAGCAAGAACAAGTAAACAATAAAG 
GGGGG CAT AC T T T C AAGT C T AAAT AG AGT T AC AAACC T T C TT T T C AAT C C AT T ACAAC AT C GAAT T C AAC AT T T AAT TTTTGTTTGCT AAT T T TT 
T TT T GT AG AAT AAAAT AT T T AT T T T GC AAC AT T AT T T CAT T C AGT C AG AAC AT G AAAAC AT C AAT TAG AT T TAG C T AGC T T AGT AC T AC GGT T T A 
TGATTCAATTTTGTATAATAGATTAGTTGGTCGATTCACGCACACGCGCACG 
(SEQ ID NO: 460) 

Exon: 1001.. 1437 
Exon: 1531.. 1852 
Exon: 1906.. 2852 
Start ATG: 1126 

Tirans cr ipt No * CT13999 

CCATTTGTGTTGCATCTCGAAATCTGGAAAAAATTCGATCCAAGTGTTCTATGAGCTTCTGTTTTTTAATAATAATAAATAAATAAATTCGATAA 

T C AAC T G AC ACC AG AAAAGAG C GG AAC AC AAT GT C GC AG T T C AAT T T T GT G AGC GAT T T GC AG AAT G C T C T CAT CAT GG AC GG C G A G ACGC GC G G 
ACCTGCGCCCAGGTGGAAGAAGAAGCTGGAGGCGTCTCTAAATGGAAGTGTGAATACCACTCGGTCGGTGCTATCCGTCTCGTACAACACCAGTT 
TCTCGGGTGTCCAGGCGCCCACGAAAACTCCGGGCAAGAGCAGCGAGGGCAAGACCAAGAAGTCCAACACCACGCCCTCTAAGACGCCAGGAGGC 
GGAGATCGCTTTATTCCGAATCGGGCGGCTACCAACTTTGAGTTAGCACACTTTCTGGTGAACAAAGACTCCGGCGATAAGTCCGATGAGGAGAA 
CGACAAGGCCACCTCGAGCAACAGCAACGAGAGCAATGTCCAGGCTTCGGCTCACAAGGGCGACCGGCAGAAACTCATCTCTGAAGTGGCCCAGG 
TCGGTGACTCCAAGGGCGGGCGCATTTTGTGCTACCAAAACAAGGCTCCCGCTGCTCCAGAAACACACAACAATCCCCTGAAGGTCGTGTACTCC 
ATTAAGACACCCATATCCACAAAGAGTGGCTCACGCTATATACCCACCACATCCGAGAGGATTCTGGATGCACCTGATTTTATTAACGATTACTA 
TTTAAATCTTATGGATTGGAGTGCCGACAATATAGTGGCTGTGGCCTTGGGCAGTTGCGTCTATTTGTGGAACGCACAGACCGGAAATATCGAGC 
AGCTTACGGAGTTTGAGGAGGGCGACTACGCAGGCTCGCTATCGTGGATCCAGGAGGGGCAGATACTTGCCATCGGCAACAGCACCGGTGCCGTG 
GAGCTGTGGGACTGCTCCAAAGTGAAGCGTCTGCGAGTGATGGATGGACACAGTGCCCGAGTGGGATCCTTGGCCTGGAACTCATTCCTGGTTTC 
CTCTGGCAGCCGGGATGGCACCATTGTCCACCACGATGTGCGTGCACGTGAGCACAAGCTTTCCACATTGTCCGGACACACGCAGGAGGTTTGCG 
GCCTAAAGTGGTCCACGGATTTCAAGTATTTGGCTAGCGGAGGCAACGACAATCTGGTGAATGTTTGGTCGGCGGCCAGCGGTGGCGTGGGAACT 
GCC ACC GATCCCTTGCAC AAAT TCAACGACC AT CAAGCTGCAGTGCGTGCCTTGGCCTGGTGTCCCT GGC AACC AAGT ACTCT AGC CTCTGGAGG 
CGGCACCGCCGATCGCTGCATCAAGTTCTGGAATGTGAACAATGGCACTTTAATGAAATCCGTGGACTCCAAGTCGCAGGTCTGTTCTCTGCTCT 
TTTCTCGCCACTACAAGGAGCTGATCTCTGCGCATGGTTTTGCTAACAACCAACTGACCATTTGGAAATACCCAACAATGGTGAAGCAAGCCGAT 
TTGACTGGACACACGTCACGAGTTCTCCAGATGGCCATGTCTCCGGACGGCAGCACAGTGATCAGCGCCGGAGCTGATGAAACCCTGCGTCTTTG 
GAACTGCTTCGCTCCCGATCCGTTGGCGTCCAAGAAGGCAGTTTCGACCAGCAAGGGCAAACAGAGCGTGTTCCGACAGAGCATCCGTTGA 

(SEQ ID WO: 461) 
Start ATG: 12 6 

MSQFNFVSDLQNALIMDGETRGPAPRWKKKLEASLNGSVNTTRSVLSVSYWTSFSGVQAPTKTPGKSSEGKTKKSNTTPSKTPGGGDRFIPNRAA 
TNFELAHFLVNKDSGDKSDEENDKATSSNSNESNVQASAHKGDRQKLISEVAQVGDSKGGRILCYQNKAPAAPETHNNPLKVVYSIKTPISTKSG 
SRYIPTTSERILDAPDFINDYYLNLMDWSADKIVAVALGSCVYLWNAQTGNIEQLTEFEEGDYAGSLSWIQEGQILAIGNSTGAVELWDCSKVKR 
LRVMDGHSARVGSLAWNSFLVSSGSRDGTIVHHDVRAREHKLSTLSGHTQEVCGLKWSTDFKYLASGGKDNLVNVWSAASGGVGTATDPLHKFND 
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HQAAVRALAWCPWQPSTLASGGGTADRCIKFWNVWNGTLMKSVDSKSQVCSLLFSRHYKELISAHGFANNQLTIWKYPTMVKQADLTGHTSRVLQ 
MAMSPDGSTVISAGADETLRLWNCFAPDPLASKKAVSTSKGKQSVFRQSIR* 
(SEQ ID NO: 462) 

Name: fizzy 

Classification : cell_cycle_regulator 
Gene Symbol: fzy 
FlyBase ID: FBgn0001086 



Celera Sequence No. : 142000013384668 

GGGCTATCCGGAGTTCGGACGCCTGCTCAATGAAACTGGTCGACCGATGGTATACTCCTGCAGTTGGCCCGCGTATCAGGAGGATGCAGGAGAAA 
TGCCCGATTATGAGTCACTCAAGCAGCACTGTAATCTGTGGCGCAACTGGGACGACATCGAAGACTCGCTCGAGTCCCTTATGCAGATCATTGAC 
TACTTTGCCAAGAATCAGGACAGGATTCAGCCGCATGGCGGACCAGGACATTGGAACGATCCAGATATGCTGCTGCTGGGAAACTACGGCTTGAG 
CTACGATCAAAGCAAGCTGCAGATGGCGATTTGGGCCATTATGGCAGCTCCTCTAATTATGTCCAATGATCTGGCTGCAGTGCGTCCCGAGATCA 
AGGCTATACTCCAGAATCGGTTGGTTAAGCTTCGATATCTTATCTTCTTGGACTTATTGGTAAACTTTTGTTATTTGCAGTGCGGTTATTGCTGT 
GGACCAGGATGAGCTGGGCATCCAGGGGCGCCGTGTTTTGTCCCGCAACCAAATCGAAGTCTGGAAGCGTCCCATTACGCCAGTAACCAAGAGTG 
GACATCACTCCTATGCTGTCGCCTTTGTCAGTCGCCGAGATGATGGCGCTCCCTACAGGATCCCCTTCACGGTCAAGGAGCTCGGATTGACGAAT 
CCCAAGGGTTATAATGTGCAGGATCTGTACGATGCCAGCAGCAAGTTGGGCGTCTTCCAGTCTGAGAGTCAGTTCATCACACGCGTCAATCCCAA 
TGGTAAAATCCGTTGCAAAACACTAACTAGCCCTGACTTACTTCATTTTCCCCTCCCCCAGGCGTAACTTTCTACAAGTTTACAGCGTTGTAAGC 
CCAAGATATGCAAATCAGCCGCTGCCATCAAATCGAATATGCTCTTATATACACACTATTTCCACATTGTCCATTGACGTTGCATCGATGATCTG 
TGTAATAAGTTAATTCCGACAATAAATACCAGGTAATCACTGAAGAGTTTGACTTTGTTGATTTTTATTGTTTATTAAACAATTGATTGTGTGCA 
TTACACACAGGTATAAGTATTGTACAATTATAGGCATAGATTGCAGTACCGACGCTAGTTAAGACTTGTTCCATGATATAAATAAGGTCTGACTG 
GATGAAAGAGTTAATTCGCAAGCTTGTATAAATAAGAAGTGTAGTACTATATACGCGACAAGGACCCGAATAAGAGCAAAAAGGATAGCGACCCC 
TCGGGATCATACCAGTTATAAATAGATACCATCTACAGCTACTGCTTACATGGGTTTGGTCGAGAGAGTTAGAGTTATTGTCAAGTTGTAGGCGA 
AACCGAAACCAAACCGAAGGAACCATAGCCCGGGGTTAGTGCACCCCATTGCTGATGGTCGGTGTGCGCTCGTAGGTGCGTCTATGCAGCGCATC 
CACGTCCTTCAGGTAGTAGGTGCCGGGAAACAGTGCGGAGATGCTGCCCGTTGGCGTGTAGGGCGCCGCATGGTTGTTCTTCTCCCGCACCTCCA 
TCAGCGCGGAGAACTGCTCGGGCGCAACCTTCTCCCGTGAATTCAGCAACGGCTGCACGTAGTCCAGCTGCGAGACGAACTTTTCGAAAGCAGCT 
GCATCCTGGGTCACGCTGATCGAGTACATGGAAGCAGCCAGTCCGGATCCGTAGGAGAACAAACCAATGCGCTTTCCCACCAGCTCCTGGGCTGG 
TCCGCTAATCAACAGAGACACCAAACCGGAGTACACCGAGGGTGTGTACATATTGCCCACTTGGTTGGCCAGCAGCAGCGATTTCTTGGTCTTGC 
TGGCAAAGATGTTGGCGGACTGCGTCATGAAGGCCTTCTCCACATCGCGATCAAAGTAGGTGCTCTCCAGGGTAGCGGTATTAAAACGTTCCAGG 
TCGGGAAACTGCTTCGTTCGCTCCTCCTCACTGCTCAGCAGGAAATCGTTGAAGCTCAGGCGACCCACAGATTTTTGCACCAACTTGCAGAACGG 
TGTGTGGAATAGGATAGCATCGAAGGTGCTCAGGCTGGCCGGCTGCTTGGAAGTATCCTTCTGCTGCTGGTCGAACTTCTTGCGATACAGCCTGT 
AGCAGGTGTCCAGAGCAGAAAGGTAGCACTGAATAGAGAGCTTTCCATCTACCGTTGGGTATTCGGAGCTGAGGTCCGGCTTGTAAAAGTCGTAG 
GCGTGCTCCATGTGCGTTGCGCGTAGTCCACGATCGAGGATAAGCGGCGCATTTGGACCCACCAGCATAGCCACAGCACCCGCACCACCAGTTGG 
GCGCGCTGCACCCTTGGCGTACACGGCGATATCAGCACAAACGGCCAAAGCCAGGCGGCCGTCCCAGCTAGAAGATTCCACCCAATTGACGGCAT 
TGAAGAGCGCCGCCGTGCCGCCGTAACAGGCGTTCGTGGTGTCGATGCCCTCGATGTCCGTGTTCCCGCTCTCGGCGAACAGCTGCATCAGCACG 
GACTTCACCGACTTCGACTTGTCCACAATGGTCTCGGTGCCCACCTCCAGTCGTCCGATCTCCGAGTGCTTCACATGTTGCCGCTCCAGCAGGCG 
GCTCACAACCGTCAGGCAGAGAGAGTTGACATCCTCGCGGTCCGAGCAGAAGCCCATCTTGGCCTGGCCCAGGCCAATTGTGTACTTGCCCGCGG 
AGGCACCATCGAAGGTCTCCAGCTCCGTTTGGTCCACGTACTGGGAGGGGAAGAGAATCTCGATGGCGCGTATGCCAACGTTCTCGGGCCAGTGG 
GATGCCATGATTGTATCCGGTTATCTATTTAATCCTGAGCTCTGTAAGAGAGACAATTTGTATACTTTTAGTGGGAGCTTTGGCATGAGAGTAAA 
TATTTAATTCAATATCTTTGTTATTAACAACTATAACAATCACTATTTGCGTTTCGAAATTGCTTCATGATGTTGGGAGCTAAGGCCTTGTTTAA 
TTTAATTTAAGCAGAGTTCTCGATGTGGCCCGAATTCCACTGCCAATTGGCATTGGGAAACTTTCTCCCCGCGCAATCACGGCGTATTTATAATG 
CTTACAAATTGGACGAGGGCAGAGACAACCTGTTGCACGCTCGAGCTGCGTTGGACGTGTGTAAGCCAGTGACTTTGGCTGCGGCTGAAGAAGTC 
ATCAAAGTCTGCCAGCCAGAGGCCCCCGTCTTGGGGGCCAACTGATTTCCCTGGCTGAGAAACTGAGCGTGTAGAGAGAGAGAGAGAGAGGGAGA 
GAGAGCCGCCGGAGCGGAGAAAGTTTCCCCCGCTCAGGCGAGCTCCACAAAAGAGAACACTAAACCGCTGTGAAGTAAGCCAAAACACACGCATC 
GCAATAAATGTAAGCCAACAAGACAAGCCCAAAGAGCCAAAGGGACAGGGGCGTGGATGCTGTGGCTACACGCAGAGAATTTATGATTACATTTA 
TTTACAATAGATAATCATTTCTTTTAAACTTTTTTTGTCAGAGAACAAGCTACAAAAATGATTACTTTAACACAATAACTCTTTTTTCAAAACCG 
CTTCCCCTGAAACTTCTCTACATTTTCACACATTTTAATTTCAGTTAGGCGCAAATTATAATTAAATCAAAATGATGATCGAAATATTATGATTT 
CCTATCGAAATAGAATGGAATTTTGCTCGGTGGAGCATTGTGGGACAGGCTGGGAGCGTCTTCACCCATAAGTGGGTCATGCCCACCAAGCACTA 
AGCACCAACCACCACTCTCCCTCAATCACACGCGTCCATATTGTATAACCGATAAGAGGTGCGAGTGTGGCCCAAACCCGAAAAGCGGTTGACAT 
ATTTACCGTTACGTTTAGCACCTGCGACTACTGGCCTATTGACTATAGATTTTCACTTTTCGGCCCGGAACTTTGTGGATTTGTGAAAAAAGGCT 
GGGTAGCGCGAAGCTCACTCGATCGGATCTCTCCGCGGACTGACGGCACTGGAGGCGCTGCACTGGCATATGAATATGACCATGACGACGGAATT 
GGCAGTGGAGAAGCAGACACCTAGCCAACGAAGCCACCTAGCCACCGAGCAAACACAGAGGCGAGAATAGGCGCTGTAAGTCAATGCCAATTGCT 
GAAAGCGCTCGGCGAACAAATAGCGATCGTGCATAATTACGGGAAATATTTATCGCTTAGATACGGCCAGCGAGCACGAAACACAATCGTCGGGG 
ACTAAGAAGCCGCCAGACTATATAGATATGCACATCGATAGTCGTGGCCTTTTGCGATGGGGTAAAGATCAAAGGCTGTCCATCGCTGATAAGGC 
CGAAACAAAGCGGCTTCCTTAAAAATGCCACCGAATAAGATGTGACACTCTGTCCATTTCAGTGACAAAAACTGGGAAGTGACTTCCAATAGCTT 
TTCTCCGATTAATTACTTAAGACGTGCCCTTAGGAATTATTAATAGAAAATTATTTATAATTTGAGTTCTACTAATTTTCCTACTGGTATAACTT 
AAATTGGATTATCATCTTTTCTGAACACTATCTTGAAACGTGATTTCAAATCACCTGAACACAATTGGTAATTGATAAAATATTCTACAGAAATC 
ACTTAAAGTAAGATAATAACAAGACTACTTTAAGTAGTTGTGATATTACTTGAGTCGTACTTAAGTTAATTAATACAGTTTAGTTAAAGTAAGTA 
AAATTAACAAATAAGAGAATACTTTGTCCTATTGTGCAAAAGAGTATGAATCATTATCCAAAGTCATGAAACTCTAATTTCAAGCAATTTAAGTG 
ATTTAAGTCATTAATTCCTCGAGTGCCTGCGTGGAATACAGCTCACCCGGTGTTCCTAAGCAGCCCGTGGATTTAAGAATGGAAATTGATGCGCT 
GTTTATGCGGAAAAGGAGTCATCGAAGGGAAGGGAACGGGGCACTGTTGTGGTTTTGTGGGCTTATTTTTAGCACT 
(SEQ ID NO: 463) 

Exon: 3921.. 3822 
Exon: 2796.. 1001 

Start ATG: 27 63 {Reverse strand: CAT) 
Transcript No. : CT14107 
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CCGATCGAGTGAGCTTCGCGCTACCCAGCCTTTTTTCACAAATCCACAAAGTTCCGGGCCGAAKAGTGAAAATCTATAGTCAATAGGCCAGTAGT 
CGCAGAGCTCAGGATTAAATAGATAACCGGATACAATCATGGCATCCCACTGGCCCGAGAACGTTGGCATACGCGCCATCGAGATTCTCTTCCCC 
TCCCAGTACGTGGACCAAACGGAGCTGGAGACCTTCGATGGTGCCTCCGCGGGCAAGTACACAATTGGCCTGGGCCAGGCCAAGATGGGCTTCTG 
CTCGGACCGCGAGGATGTCAACTCTCTCTGCCTGACGGTTGTGAGCCGCCTGCTGGAGCGGCAACATGTGAAGCACTCGGAGATCGGACGACTGG 
AGGTGGGCACCGAGACCATTGTGGACAAGTCGAAGTCGGTGAAGTCCGTGCTGATGCAGCTGTTCGCCGAGAGCGGGAACACGGACATCGAGGGC 
ATCGACACCACGAACGCCTGTTACGGCGGCACGGCGGCGCTCTTCAATGCCGTCAATTGGGTGGAATCTTCTAGCTGGGACGGCCGCCTGGCTTT 
GGCCGTTTGTGCTGATATCGCCGTGTACGCCAAGGGTGCAGCGCGCCCAACTGGTGGTGCGGGTGCTGTGGCTATGCTGGTGGGTCCAAATGCGC 
CGCTTATCCTCGATCGTGGACTACGCGCAACGCACATGGAGCACGCCTACGACTTTTACAAGCCGGACCTCAGCTCCGAATACCCAACGGTAGAT 
GGAAAGCTCTCTATTCAGTGCTACCTTTCTGCTCTGGACACCTGCTACAGGCTGTATCGCAAGAAGTTCGACCAGCAGCAGAAGGATACTTCCAA 
GCAGCCGGCCAGCCTGAGCACCTTCGATGCTATCCTATTCCACACACCGTTCTGCAAGTTGGTGCAAAAATCTGTGGGTCGCCTGAGCTTCAACG 
ATTTCCTGCTGAGCAGTGAGGAGGAGCGAACGAAGCAGTTTCCCGACCTGGAACGTTTTT^ATACCGCTACCCTGGAGAGCACCTACTTTGATCGC 
GATGTGGAGAAGGCCTTCATGACGCAGTCCGCCAACATCTTTGCCAGCAAGACCAAGAAATCGCTGCTGCTGGCCAACCAAGTGGGCAATATGTA 
CACACCCTCGGTGTACTCCGGTTTGGTGTCTCTGTTGATTAGCGGACCAGCCCAGGAGCTGGTGGGAAAGCGCATTGGTTTGTTCTCCTACGGAT 
CCGGACTGGCTGCTTCCATGTACTCGATCAGCGTGACCCAGGATGCAGCTGCTTTCGAAAAGTTCGTCTCGCAGCTGGACTACGTGCAGCCGTTG 
CTGAATTCACGGGAGAAGGTTGCGCCCGAGCAGTTCTCCGCGCTGATGGAGGTGCGGGAGAAGAACAACCATGCGGCGCCCTACACGCCAACGGG 
CAGCATCTCCGCACTGTTTCCCGGCACCTACTACCTGAAGGACGTGGATGCGCTGCATAGACGCACCTACGAGCGCACACCGACCATCAGCAATG 
GGGTGCACTAACCCCGGGCTATGGTTCCTTCGGTTTGGTTTCGGTTTCGCCTACAACTTGACAATAACTCTAACTCTCTCGACCAAACCCATGTA 
AGCAGTAGCTGTAGATGGTATCTATTTATAACTGGTATGATCCCGAGGGGTCGCTATCCTTTTTGCTCTTATTCGGGTCCTTGTCGCGTATATAG 
TACTACACTTCTTATTTATACAAGCTTGCGAATTAACTCTTTCATCCAGTCAGACCTTATTTATATCATGGAACAAGTCTTAACTAGCGTCGGTA 
CTGCAATCTATGCCTATAATTGTACAATACTTATACCTGTGTGTAATGCACACAATCAATTGTTTAATAAACAATAAAAATCAACAAAGTC 
(SEQ ID NO: 4 64) 

Start ATG: 134 {Reverse strand: CAT) 

MASHWPENVGIRAIEILFPSQYVDQTELETFDGASAGKYTIGLGQAKMGFCSDREDVNSLCLTVVSRLLERQHVKHSEIGRLEVGTETIVDKSKS 
VKSVLMQLFAESGNTDIEGIDTTNACYGGTAALFNAVNWVESSSWDGRLALAVCADIAVYAKGAARPTGGAGAVAMLVGPNAPLILDRGLRATHM 
EHAYDFYKPDLSSEYPTVDGKLSIQCYLSALDTCYRLYRKKFDQQQKDTSKQPASLSTFDAILFHTPFCKLVQKSVGRLSFNDFLLSSEEERTKQ 
FPDLERFNTATLESTYFDRDVEKAFMTQSAWIFASKTKKSLLLANQVGNMYTPSVYSGLVSLLISGPAQELVGKRIGLFSYGSGLAASMYSISVT 
QDAAAFEKFVSQLDYVQPLLNSREKVAPEQFSALMEVREKNHHAAPYTPTGSISALFPGTYYLKDVDALHRRTYERTPTISNGVH* 
(SEQ ID NO: 465) 

Classification: enzyme 



Celera Sequence No. : 142000013384045 

ATCAGACACACACACACACATTCACACGTAACTCTAAATACTATATGTATAAATGAAAGTTCTTTTTTTGTCGGCTTCCCACATTCCCCGATTGT 

TCAGATATTGGCCATAGGCGGGGCCTTTCCTGGGGTTTTTCTTCCACACCCAGATAACGGCTATCTGGTACTGGTTATAGTGACTGCGTTTGGGG 

CGCTAAAGGCTATAAATCGTCTAGATTTATTTACAATTAATTTTATTTGATTTGTCATACGTTTCTCAGGCCGAATTTCATTTGCAAAGTTCTAA 

TTTCTGCTGTTTAATATAATGCACTTGCGTTTTAAGGCAATTTATGGATTACTATTTGCGAAAATCAAAATTTCTTTTTTTGCAATTAACATTAT 

ATAAAAAATGTTTTTTTAATTAGTCGAAATCATTTTTAAAATGTTAATTCTTGTATGTCATAAGCGCAAAAGTATGAGTTGTACTTAATCTAAAT 

TCTAAAACAAGTTAGTCATTAAAAATACTACGAAACAATAACTCAGAACAGAAAAATTTAAACATGTTATCTAGAGCAATATAGAAATTAATTTG 

AGCAATATTACATCAAATATCTCAAGTTTTAGAAGTTCTTCCACATTTATTTTACTGTGTACGCACAATATCATACTTATATGCTTATGTGTATA 

TCTTTCTATCCGTGATCTGCCCCCCAGAGACCCCTGGCCCCGCCCCCACCCTGTGTAGCCCCCTTGCTGATAGCTGTGTTGTTGTTTTTGTGGCT 

GCTGCTCCCCCACATAAAGCTGACAATAAAAACAAAACTACGCTCACTCATTCACACACTCATGCCGTCACACTCGCAACCCAGAGACGCGCTCA 

CACTTGCACTCACATCACACACACACACAATACACACACTCACACACTATCTGCTGTAACTAAAGACCCAACAAAGCGAATATGAATATTTTGCA 

CAGAATTTTGCGCTTTGTTCTCTTCTCTTCTCGTTTTTTTTTTTGTTTTTGTTGAAGTTCTTCGCGTTTTTGTTTTGGGTGGGTTCAAAGTTCTT 

AGAGCAGGGCGCACTTGTGCCGCTTGGGTCCTCGAACGACAGGACATAGCACGGATCGTATGGCCTCGTCGAAGACCGTCTTCAGACCCTTTTGG 

GTCAGGGCCGAGCACTCCAGATACTTGACCGCAGCTATTTCCTTGGCCATCGCCAGTCCTTGGGGATAGGTGATCGGTGTTAGCTTCTTGTCCTT 

CAGCTTCTCGATCGTCTGCTTATCGTCGCGCAGATCCAGTTTGGTGCCGACCAGGATTATCGGCACACTCGGGCAATGATGACGCACCTCGGGAA 

ACCATTTGGCTCGCACATTCTCAAACGATGCCGGATTCACCAGTGAGAAlACAGATGAGAAAGACATCCGTTTGCGGATAGGATAGCGGCCTCAGG 

CGATCGTAGTCCTCCTGTCCAGCCGTATCCCAGAGGCCCAGATTGATGGGCTTGGCATCCACCATCACATTCGCCGAATAGTTGTCGAACACCGT 

GGGTATGTACTCGCCGGGGAAGGCGTTGGTCGTATAGCTGATCAGCAGACAGGTCTTTCCCACCGCTCCGTCGCCCACAACCACACACTTGATGG 

CCTGCATGATGAAGGCTACTGTACTGGTGGTACTACTGGGCAATCGATGGGTTCAGCAGCTCGATGGGTTGCTTGTGCGCTGGGGGCAGAGGGTT 

TTTCAGTGGATCAGCTGGTTTCGTGCTCGATTTCCAGTGGCTCAACTGGTTTTCTTTGTGGCAGCTTAACTTATAATTACTTTGGGGTAATTCAT 

TCAATTCAGTAGGTCAGACAGTTTTTGAATGCTTCGATATTTGTACTTTTGAACTATTTTGTACTTTTCTTTTTTCTGTTCAACAGACAAATTTT 

AGTCACACAATTTAGTTTTGCCTTGTATCGCAGCTCAGCTAGTTTTTCTAATTGTATAGCTTTTGCTTTTCGCGTTGGCGTTTTGCTTTTATTGT 

TTTTTTTTTTTGGCGATTATACAATAATTTGCTCACGCACTAATACAAATTCTGCCCACACACACACACACATTCAAGCAATCGCGTACAGAGAG 

AGATGCGAATTTGAAGTGGAGTTTGATTTATCGCGAAAGTTGTCAAATCAATATTCTTAATTGAAAGCAGAACCGTGACGGCGTTCTTTCTGTTC 

GGTCGGTATATTATTTATGAATTAATTGGGCCCGCATATGTGGGGATTTGCTGATTTTCTGTGGCAGAAACGGAAAAAACACGCCCAAACGCTGT 

GAAATCTTCGCGGCGATTGAAGAAAATTTCTGCCTCCGCTGGCAGTTGTTGTACTTGTTTTTTCACTGTAGTTTTCGCTCTTTCAACTGGCTTGT 

ACGGCTGTCTCGCTCGTTCGCGTGCGTTAGTCGCTTAAACGAAGCGCTGCGCGCATACGCTCTCCCACACAAAGAGCGAGCGAGCGAACGTGAGC 

GAGAGCGGCACTGGCACAGTGCAGTCCCATAAAATTAGTCACAGTGAGTCATTCGCTCGGTGACGTTAGTCGTGGCAATCCACAGAGAGCAGAGC 

GAGACAGAGATAGTGAGTCTCTTTACTTGACCTTGCTTTGGTCGTTTTTGTAGAGTTTTCTATATTCGTTCGACGGTTTTCCACGAATTTCCCGA 

CCGTTGCTGTTGTTTTGAGCCAATTTCCACGCGACTAGAGGCTATCAGAGATGTGCGGTACCCAACTCAGGTACAAATCGCCCGTAGCTTATCGC 

GGTACAATTACAGTGCCACACCAAAAACAAGCAAAACGTTGATATCGCGCGATATAAATATTAATTGAACAATATTTTTTTTATTGATGAATCAA 

TTATGTGACCTAAATATGTGTCACAGTCTTTGGATGAATATACTTAAATGGAAGTATTCATATTTTAAAATTGGAAACTCAAATTTAGATTCTGG 

AATTACTTTTAATTCTCCTTAATTCGGGCCATATTAATTTACGACCGTTACATCCCATCTAATTCTTAAACGAATTATTAAATTTTTCATGACCA 

GCGATATTTGCTGCCGTTGACTGTTTTAATGGACCACAAAATTGCGTGTGTCGGCAAAAACCAAAGCCCAAGGAATAACAAACAAATAACATTGG 

CAAACAGTGCAAAATATCGACAGTAAAAACCGAAAAGCAAAAACAGAACCCAAACCGAATCCACCGAATCCGCCGCATCCGCATTCAGCCAAAAG 

TGATAGCCGAAATATTTCGTGTTTGTGTTTTGGCTTCCGCCCCACCAACCAGAAGCCCAAA 

(SEQ ID NO: 466) 
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Exon: 2291.. 1001 

Start ATG: 1622 (Reverse strand: CAT) 
Transcript No. : CT14366 

GCGAAGATTTCACAGCGTTTGGGCGTGTTTTTTCCGTTTCTGCCACAGAAAATCAGCAAATCCCCACATATGCGGGCCCAATTAATTCATAAATA 
ATATACCGACCGAACAGAAAGAACGCCGTCACGGTTCTGCTTTCAATTAAGAATATTGATTTGACAACTTTCGCGATAAATCAAACTCCACTTCA 
AATTCGCATCTCTCTCTGTACGCGATTGCTTGAATGTGTGTGTGTGTGTGGGCAGAATTTGTATTAGTGCGTGAGCAAATTATTGTATAATCGCC 
AAAAAAAAAAAACAATAAAAGCAAAACGCCAACGCGAAAAGCAAAAGCTATACMTTAGAAAAA 

ATTGTGTGACTAAAATTTGTCTGTTGAACAGAAAAAAGAAAAGTACAAAATAGTTCAAAAGTACAAATATCGAAGCATTCAAAAACTGTCTGACC 

TACTGAATTGAATGAATTACCCCAAAGTAATTATAAGTTAAGCTGCCACAAAGAAAACCAGTTGAGCCACTGGAAATCGAGCACGAAACCAGCTG 

ATCCACTGAAAAACCCTCTGCCCCCAGCGCACAAGCAACCCATCGAGCTGCTGAACCCATCGATTGCCCAGTAGTACCACCAGTACAGTAGCCTT 

CATCATGCAGGCCATCAAGTGTGTGGTTGTGGGCGACGGAGCGGTGGGAAAGACCTGTCTGCTGATCAGCTATACGACCAACGCCTTCCCCGGCG 

AGTACATACCCACGGTGTTCGACAACTATTCGGCGAATGTGATGGTGGATGCCAAGCCCATCAATCTGGGCCTCTGGGATACGGCTGGACAGGAG 

GACTACGATCGCCTGAGGCCGCTATCCTATCCGCAAACGGATGTCTTTCTCATCTGTTTCTCACTGGTGAATCCGGCATCGTTTGAGAATGTGCG 

AGCCAAATGGTTTCCCGAGGTGCGTCATCATTGCCCGAGTGTGCCGATAATCCTGGTCGGCACCAAACTGGATCTGCGCGACGATAAGCAGACGA 

TCGAGAAGCTGAAGGACAAGAAGCTAACACCGATCACCTATCCCCAAGGACTGGCGATGGCCAAGGAAATAGCTGCGGTCAAGTATCTGGAGTGC 

TCGGCCCTGACCCAAAAGGGTCTGAAGACGGTCTTCGACGAGGCCATACGATCCGTGCTATGTCCTGTCGTTCGAGGACCCAAGCGGCACAAGTG 

CGCCCTGCTCTAAGAACTTTGAACCCACCCAAAACAAAAACGCGAAGAACTTCAAC 

{SEQ ID NO: 467) 

Start ATG: 670 (Reverse strand: CAT) 

MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDAKPIMLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVNPASFENVRA 
KWFPEVRHHCPSVPIILVGTKLDLRDDKQTIEKLKDKKLTPITYPQGLAMAKEIAAVKYLECSALTQKGLKTVFDEAIRSVLCPVVRGPKRHKCA 
LL* 

{SEQ ID NO: 468) 

Classification: enzyme 
Gene Symbol : Rac2 
FlyBase ID: FBgn0014011 



Celera Sequence No. : 142000012746884 

TCGTTTATCTTrGGCAGCGCAGCCGTTCTTGTAAACATCCTAAAGGCTGACCTAAGCAGATTTGACTGCCCTCTTTCAACGCTACCTAATCTTAA 
GAACCCAAGAGCGAGGCTCTCCCGAAAATACAAATATTGTTCAAATACTGAGGCTTCTCCTCAATCCAATTTGCATTTGATTTTTAGTCTTAAGC 
TGAGATCCAAAGAATAAAGTCGTGAAACTATTTCTCCTAAAAACTATTTTTTATTTCTTGGCGTTGTCCTTAGTCAACTGACGGGACATTAGTTC 
GACTCATAAATAAAACAACAATTTTACTATATATATACTACATAATAGTCATACCCATGGGGAAACCTCTTACTTCTCGGCTATCGCAAGGAAAG 
TATATGCCTGAACTGCGCTATATAGAGAAAAGCGAAAAGCTCTCGAGAATTGGCACACGACCGATAAGCCCGCCGAAGCAATCAATGGACAATTG 
ACCTTCGTAATTGAAGGGGGGCAGGTTGTGGACCACTCACCTAAGCTCATGGCGGTCTTGAAGTATTCCGGCGGAACATAGCGTCCGCCCATGCG 
CAAATAGGTATCGCAATCGGACACATAGCTGGGGCATTCGGCCTGCGGACAGGGGCACGTGGATGAGGACGGTGGACCGCAGCCGGTGCTGGCTA 
GTGGCATGTATACCGAGTTCCCGTTCCCATGGCACTCGATGCCCAGTATTACGGATTCCGCAGAGCTGTTCTTCACCAGTAGCTGCAGCTTGAGC 
ACGGCTATCCAGAATTTCTTGAGCTCGCCGAAGGTGAGTTTGTAGCCCAGTTTGTAGCGCATATCACGTACATCCTCCTCGGTGAGGGAGAGCAG 
CACCTCGCCGTCGATGGCCTCCTGGCGCAGGCAGTCGAGCAGTGTTCTCGAAAAGTGCTCCATGCAGGTGGCCCAACTGGTCACATTCTCGATGG 
ACCACTGGGCCACGAGTAGCGGATCCACGGGCTCTGGATCTGGCGATCCATTGGTCAGGTGCTTATCTGTCGCCGCGTCGAGGTAGTGCGTCCGG 
GTCTCCTGATCCATTGTGACCAATCCACCGCCCTCGGAACGTGGCTGAGTCACCGGATCGCCAATTTCACCGTCGCACATCGCGCTTTCTTTTTT 
TTCCGTTCGATGTTCTTCTGGCTGTTTCTTATTCGTTCCAATTTTGCGATTTATCCAAAATCCAAAACAATTGTGCGGCCGAACTCAGATGGAGT 
GCAACATTTTTTCACTGCTGGCTGCTGTTGCTGCTGCGGCGCATAATTAACACCTCACTTATTGCGCGTCCAGCGTTTGTTGATGTAAAAGCACA 
TTTAATTTTTGGTTTGGTTACTTGTTAAATTCAGACTACTCAGTTGTTCCGTTCAGAGAGCAAGCGAGAGAGCAGAGCGGAGTCAGCAACTATGC 
TGGGGGCGGGAGCGAGATAACAAGGCCGAACAGCTGACTTCGGCCATCGATAGTTTTTGGACAACGCCAACGCTATCGATGAACAGTTACTTCGG 
TCTTAACTGCGTGTTCACACCGAAAAATTTTCGAGAATTTCGCTCTCGTAAAAGTCAAAAGGTTTAAAACTCTTATCGTTAGCTTAATATTATTT 
TAAATAAAACAATTTGAGGTGTCTGGCGAGTAGAAATTATTTAAATATTAACGCTTGACTGATATAAATCTCGTTCGATATATACGAAACCATGG 
TGAAAATGTGGAAGAGTGGAAACGGCCTCCAAAGCGAAGATATCGAAGCCACAGCTTCGCTTCGTTAGCATTTTGCGAGCGGCAAAGTGATTTAA 
TTGTCCTCGTGCGAATGTCTAGACAGAAGTAATCCTCGAATAATATGAAGAAAGCCAAGCCGCTGCTGCACGACCATCTAATAACGATCCGAGTA 
AAGAAGGTGTGTCCAAATCCGTTGTTTATATCCACAAACTAGCCAGCTGCCCGATAGCCGGTAAATTCCCATGAAAGCAGCACAAAAAAAAAAAA 
AAAAACACACAAAAACCCTTTGCAGGGCGGCGATTTTGGGCAAGGGAAAACAAGGTTAGGTCGCAGGCTACCCGCATCGTGTGCTAATTGCGTTC 
TACTTTCGATTTCAGTATCTGGAGGAGCACATCGGGGAAACCTACCTGGATGTGAAACAGATGACCAGGGAGTTGATGCAAAAGTATCCGGAATA 
TTCACGACGCAAATTCGGACCATTCAGACAGCTGGTGCACCAAGGTAAACATGGAACATTGTGGTTATTTGTTTTTAACAAATATTTCTATTACA 
GCATTCTCGATCATTTCGGAAAGCTATAACTTGGACAAGGTAAGCAGTTCCGAGGAAGATTGTGTTAGCGAGGACTCCGAGCCACCACCCACGAA 
TAGTGTGATGAACAATATGATGAACAGCCTGTATAGT 
{ SEQ ID NO: 469) 

Exon: 1412.. 516 
Exon: 3 65.. 355 

Start ATG: 1125 (Reverse strand: CAT) 
Transcript No. : CT14406 

CTCCGCTCTGCTCTCTCGCTTGCTCTCTGAACGGAACAACTGAGTAGTCTGAATTTAACAAGTAACCAAACCAAAAATTAAATGTGCTTTTACAT 
CAACAAACGCTGGACGCGCAATAAGTGAGGTGTTAATTATGCGCCGCAGCAGCAACAGCAGCCAGCAGTGAAAAAATGTTGCACTCCATCTGAGT 
TCGGCCGCACAATTGTTTTGGATTTTGGATAAATCGCAAAATTGGAACGAATAAGAAACAGCCAGAAGAACATCGAACGGAAAAAAAAGAAAGCG 
CGATGTGCGACGGTGAAATTGGCGATCCGGTGACTCAGCCACGTTCCGAGGGCGGTGGATTGGTCACAATGGATCAGGAGACCCGGACGCACTAC 
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CTCGACGCGGCGACAGATAAGCACCTGACCAATGGATCGCCAGATCCAGAGCCCGTGGATCCGCTACTCGTGGCCCAGTGGTCCATCGAGAATGT 

GACCAGTTGGGCCACCTGCATGGAGCACTTTTCGAGAACACTGCTCGACTGCCTGCGCCAGGAGGCCATCGACGGCGAGGTGCTGCTCTCCCTCA 

CCGAGGAGGATGTACGTGATATGCGCTACAAACTGGGCTACAAACTCACCTTCGGCGAGCTCAAGAAATTCTGGATAGCCGTGCTCAAGCTGCAG 

CTACTGGTGAAGAACAGCTCTGCGGAATCCGTAATACTGGGCATCGAGTGCCATGGGAACGGGAACTCGGTATACATGCCACTAGCCAGCACCGG 

CTGCGGTCCACCGTCCTCATCCACGTGCCCCTGTCCGCAGGCCGAATGCCCCAGCTATGTGTCCGATTGCGATACCTATTTGCGCATGGGCGGAC 

GCTATGTTCCGCCGGAATACTTCAAGACCGCCATGAGCTTAGCCGAGAAGTAA 

(SEQ ID NO; 470) 

Start ATG: 288 {Reverse strand: CAT) 

MCDGEIGDPVTQPRSEGGGLVTMDQETRTHYLDAATDKHLTNGSPDPEPVDPLLVAQWSIENVTSWATCMEHFSRTLLDCLRQEAIDGEVLLSLT 
EEDVRDMRYKLGYKLTFGELKKFWIAVLKLQLLVKNSSAESVILGIECHGNGNSVYMPLASTGCGPPSSSTCPCPQAECPSYVSDCDTYLRMGGR 
YVPPEYFKTAMSLAEK* 
(SEQ ID NO: 471) 



Celera Sequence No. : 142000013384043 

TGGCCGACTATGCCCGAACAAAACAGCCGGGAGCCGTTCATCCTTTCGCCCAGTGGTGGCGAAATGTTCCAGGTGCTGGTCACCCTGCTACACAA 
TCTGGAACGGGAACTCTCCGCCAATCTGTTCACCCAAACGCTCCGTTTGATAGCCCATCAAATCGATGACTTCATGCTCGAGAGCATGGTCATGA 
ACACGAAGTTCTCAGCTGCGGGAGCAGCGCAATTCAATTATGACATGACACGCAATTTATTCGCGCTCTTTGGACAGTATACACGTCGTCCCGAG 
CTGCTCTTCAAGCGGTGAGTCATCTCTACATTTGCTATGGTGCACAATAAACATAAATCTGACACCTTTCATTTCCATTCGATTCGCCAGAACAC 
ACGATGCCTGCAAGCTGCTGGCGGCGGCACGAGGAACAGCTTTGTTGCTGCTGGAAACGCTGCGTGGTAATCAATCGGTGGAGGAGAAGACGAAG 
CCGCTGAGGGAACTGCATGTGCTCAGCATGGACAGCAAACAGTGCATTGAGGTTCTGGAGCGGAGGATGGACATAAAGATGTTTTAGGTGATGGA 
AATTAATTTAAGCTAGTTAATTGGGCCCTAGGAACTTTTGAAATGATATACAACGTTCGGACGCCGTTTTTTTGCGAAATAAAATCATTAACACC 
CAAAATGAAAGTTAAAAGATGTTTATCGTAATTTACATTCATTTTATTTGAAGAAAGGCATGTGTATAAACATAGTTATCATGACGGAGATTTAA 
CTAATTTGTATTTAACCAGGACACAGTTTCGAACTCGTGCGAATGCTAGTAACAAACGACCAAGTTGTATGAATGAGATATAATTTTGAGTACAT 
ATTTACAATTAAATAGCAATTCGTTGCTCTAGGCGCACAAGTCTTGAATGTTTTTTGTGATAATTAAACATTTAGTTGGAATTGTGGTCTCAATG 
TTTGGTTTAATTTGTAAAACGTTGACACAAAAACATGCGGTAACTTTAACTTTAATAATAAATTGATCTAAAAATACATAGCGTAAAATAGCATA 
TAAACCGGCCTGATCGGGCTCGAATCGCATATCAATACTGATTCATGGCTGATCAGGGCGTATTGTATATCCCTATCGTCGGATGACAAACAATT 
GCAGTTAGTAAATACCAATTTGCGACTCGAGACGCACATCACACATCATCATCACATCGTGGAACGGACCAAGGCGCATTGCCCGCAGAGAGTAA 
CTTTTATGTCTACACTAAATAACCGATAATAATCCGATCTGCGGGCGCCGTTCGCCTTTGTATCCCCCCCTTTCTATTCAAAGTTCTGATAATTC 
TCGGCGCTGACGAACTGTGCCCGTTGCTGGGGTGTCATGTTGAGATTACCGCGGTTCTTGTTGTTGTAGGCCCAGATGACGCCGGTGGAGATCAG 
GGCAATGGTCAGCCAGCCCATGAAGAACAACTGCAGCCACATGTGCTCGCCGCCGCTGTGATCCACGATAATTCCAGCCACGATAGTGATGACCG 
CCAAGCCAAGATTCTGTATCGATTGGCAGAAGCCGTAGGCCGTGCCCAATTGGTATTCCGGAATGATGAGAGCCACCAGAGGCCACAGGCTTGCG 
GCCAGCATGGAATACGACAGTCCCATGATGATCATGCCCACATAGGGTGTCAGCTGGGTGAACGTAAGCAGGGCATGGGCTCCAATGGTGGTCAG 
TGTCGCTGTGAACACCCAGGTCACATTCCTACCCAGTTTGTCGATGATGAAACCAAAGACCGGAGACGATACGGCTGCAATTAGATACACCAGCG 
AGTCCACGGTGTTGGCCTCAGCAGGCGAAAGCCCGAAGCGATCAACAAAGAAATTCTGTCCGAGGGCAATGAATGGGAAGATGGCCACATAGTAG 
GCGACGCAAATGATGGATACCATCCAAAAGGGTGGTTTGAACGAAAAGACATCGGTGAGTTTCGGTATCTGTCCAGCCGGGTTGTTGTTACGCTG 
CAAAATCCGCTCGGCCCTCTTGTCCATCCAGCCGAGGATGAGGGCACAGGTCATCGACATCACACAGGTGAGGGTAGCCAGCAATAAGACGACGC 
CTAGAGCGGTATGTCCCTTGTAAAAATTGCTCACATACTCATAGATGGGCTGCATCACCCAGAAGTTGACGGTGCTGCCGAATCGGGCAACCGAC 
AGCTGCAGGCCGAATACCATGTTCAGTTCCTTGCCCTTAAACCAGAGCACCGCATAGCTGTTCTGGGCCACCGCCAGTGATTCGGCGCCAATGCC 
GAAGATAAACCGTCCCAGGATCATCATCCAGAAGGCGTCCAGAATGCCGCCGCAGGCAAAGATCAACTGACCCACCAGCAGGATCATCATGTAGA 
TAATCGTGCCCAGTCGAATGCCAAACAGTCGATCGATCAGGAAACCTCCCACGAAGCACAGGACGACATTGGGCCACGAGTAAATCGAGTAGATG 
AGTGTGAACTGGGCGGAGGTCAGATTAAGATCTTTCTTGAAATAGTTCTGCAGGGCGCCGGGTGCATCGTAACAGAAATAGGAGCCGAATCCCAG 
CAGGCACATGAACACCAGAGCCATGAACCGGTGGCTCGTGCTCGAGGGCATGCAGCAGCCGCCGGACGGAAGGCTCAGTTCATTGTCACGCCGAC 
CGGCCCGACTGACCACTTGGTCCTCCTCCGTCGGTCGCGACACCTCCTCATTGTCCACGATGCGTTCCTCGTCCTCGCGCGCCATCTCCACTCAC 
CGCTGGCACAATCCGCACTGACCAGTTGGATGTGGAGCCGAATCCCGCAGAATGGGCGTATGCAAAACTATGTGATATGATTAATCGGCTTTCTG 
ATTTCCTGCCGCAGTTGTTTTTTCTTTTGCTTTCGTTTTTCTCTTTTCGTATTTGCCGTTTCTGCAAAATATTTGTATCAATTTCTGCGAACGAG 
TGAGCGAGTTGAACTCTTCTGCTTTGTCTTGGCTTATCTCGCTGCTTTTGTTTTTGTTTACTGTACTAGTACCCTTTGTCTAACAACTATTGTTG 
TTGTTGGCTGTGTGGGCTAGTTAGTTGTCTCTGGGTTCACTGGTTATCAGCAGTTTTTCCACTCCACTTGCAGCTGAAGTTCGCAGTGCGGTGTT 
GGGCAACAAATGTTCGGTGACGTGTGCAACCGCACTTTATTGCTGATCGAATAAAACTAAATTCTTTCCTCTGCCTAATTAACGTTAATTACGCA 
TATACAATGGCGGGGCTGCCAACTCGCTGTAGAGCAGGGTTGTCGACGCAGCTACAGGTGCATTCGAACAGCTGGCAGGCGATAACGATCACAAA 
TATGAACGGTATTCCTGTTTTGCTCTCCATTATAATCGAAAGTTTACTCTAATTGAAATGCAAATTTTAGTAAGCCATGATTATTTATATAATAG 
AAT T AT AT AAAT AAAAAT AAAAAC GCT T GAT TAT AT T AAT AAAAC AAG AT AAT C AT T AAT T GAG AAA C TAT TT AAGT AC AAG AGC T C AACGACGC 
AGAAATCTCTTGCATCTTGTTTTGATTTAATTTAGTATAGCCTGACAACCCTAATCCAGCAACGCCGAGTTGAAGCATGTTTTCATACTTTTTAC 
CGGGTAAAATGTATTTAATTATATAATTCAGCGGCCCTTGCAGCAATGGATCCCATCGACACCACCAAGCGAAAGCCCAGACGCACCTACGGCAC 
GCCATCCTACACGTATCGCAATCGATTTGCCTATGCCCTCTTGGCCGCCGGAACTGTGCTCTTCGGCATTTGGAGGTGAGCAGATCTTCGAACTT 
AACGTATAAATGTGGTTTTAATGCTTGCCACCGCCTCGTTTTCAGCCTGACTCCCATCCAGCGCATTGCCAACGAGAAGCTGTCCCAGGCGGTGG 
CCCAAACCGAACAGGAGCGCGATAGGAAAGGACTCTTTGAGTTCGGCGCACCAAGGACTTCGCAGTTCATCAGGGACGCCATCAAGGAGAGCGAG 
GAGCAGAGGAAC AAAT AAT GGATCGCAAGACGCGGCGAGCCAACTTCAAAAACCTGCAGCCTCGGTAATTTTAATTGAAATAAAGGCC AAT AAAA 
TGAATGTCATTATCTAACCGAACACTTTGTTTTCCTTTCATCCGAAGTTTGACGCGAAGCACCAAAGCAGCCGCAGATACATTATCGTTATCCAA 
CTTCTTTGCCACACCCGAAAATGCTGAAGAAGTTCCGTCATGTCTCGCCGACATTCCTGTTCAGCCCGATCCTCCTGCGCCAAAAATCAAACCAA 
(SEQ ID NO: 472) 

Exon: 3275.. 1001 

Start ATG: 2745 (Reverse strand: CAT) 
Transcript No. : CT14536 

CGACAACCCTGCTCTACAGCGAGTTGGCAGCCCCGCCATTGTATATGCGTAATTAACGTTAATTAGGCAGAGGAAAGAATTTAGTTTTATTCGAT 
CAGCAATAAAGTGCGGTTGCACACGTCACCGAACATTTGTTGCCCAACACCGCACTGCGAACTTCAGCTGCAAGTGGAGTGGAAAAACTGCTGAT 
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AACCAGTGAACCCAGAGACAACTAACTAGCCCACACAGCCAACAACAACAATAGTTGTTAGACAAAGGGTACTAGTACAGTAAACAAAAACAAAA 
GCAGCGAGATAAGCCZ^GACAAAGCAGAAGAGTTCAACTCGCTCACTCGTTCGCAGAAATTGATACAAATATTTTGCAGAAACGGCAAATACGAA 
AAGAGAAAAACGAMGCAAAAGAAAAAACAACTGCGGCAGGAAATCAGAAAGCCGATTAATCATATCACATAGTTTTGCATACGCCCATTCTGCG 
GGATTCGGCTCCACATCCAACTGGTCAGTGCGGATTGTGCCAGCGGTGAGTGGAGATGGCGCGCGAGGACGAGGAACGCATCGTGGACAATGAGG 
AGGTGTCGCGACCGACGGAGGAGGACCAAGTGGTCAGTCGGGCCGGTCGGCGTGACAATGAACTGAGCCTTCCGTCCGGCGGCTGCTGCATGCCC 
TCGAGCACGAGCCACCGGTTCATGGCTCTGGTGTTCATGTGCCTGCTGGGATTCGGCTCCTATTTCTGTTACGATGCACCCGGCGCCCTGCAGAA 
CTATTTCAAGAAAGATCTTAATCTGACCTCCGCCCAGTTCACACTCATCTACTCGATTTACTCGTGGCCCAATGTCGTCCTGTGCTTCGTGGGAG 
GTTTCCTGATCGATCGACTGTTTGGCATTCGACTGGGCACGATTATCTACATGATGATCCTGCTGGTGGGTCAGTTGATCTTTGCCTGCGGCGGC 
ATTCTGGACGCCTTCTGGATGATGATCCTGGGACGGTTTATCTTCGGCATTGGCGCCGAATCACTGGCGGTGGCCCAGAACAGCTATGCGGTGCT 
CTGGTTTAAGGGCAAGGAACTGAACATGGTATTCGGCCTGCAGCTGTCGGTTGCCCGATTCGGCAGCACCGTCAACTTCTGGGTGATGCAGCCCA 
TCTATGAGTATGTGAGCAATTTTTACAAGGGACATACCGCTCTAGGCGTCGTCTTATTGCTGGCTACCCTCACCTGTGTGATGTCGATGACCTGT 
GCCCTCATCCTCGGCTGGATGGACAAGAGGGCCGAGCGGATTTTGCAGCGTAACAACAACCCGGCTGGACAGATACCGAAACTCACCGATGTCTT 
TTCGTTCAAACCACCCTTTTGGATGGTATCCATCATTTGCGTCGCCTACTATGTGGCCATCTTCCCATTCATTGCCCTCGGACAGAATTTCTTTG 
TTGATCGCTTCGGGCTTTCGCCTGCTGAGGCCAACACCGTGGACTCGCTGGTGTATCTAATTGCAGCCGTATCGTCTCCGGTCTTTGGTTTCATC 
ATCGACAAACTGGGTAGGAATGTGACCTGGGTGTTCACAGCGACACTGACCACCATTGGAGCCCATGCCCTGCTTACGTTCACCCAGCTGACACC 
CTATGTGGGCATGATCATCATGGGACTGTCGTATTCCATGCTGGCCGCAAGCCTGTGGCCTCTGGTGGCTCTCATCATTCCGGAATACCAATTGG 
GCACGGCCTACGGCTTCTGCCAATCGATACAGAATCTTGGCTTGGCGGTCATCACTATCGTGGCTGGAATTATCGTGGATCACAGCGGCGGCGAG 
CACATGTGGCTGCAGTTGTTCTTCATGGGCTGGCTGACCATTGCCCTGATCTCCACCGGCGTCATCTGGGCCTACAACAACAAGAACCGCGGTAA 
TCTCAACATGACACCCCAGCAACGGGCACAGTTCGTCAGCGCCGAGAATTATCAGAACTTTGAATAGAAAGGGGGGGATACAAAGGCGAACGGCG 
CCCGCAGATCGGATTATTATCGGTTATTTAGTGTAGACATAAAAGTTACTCTCTGCGGGCAATGCGCCTTGGTCCGTTCCACGATGTGATGATGA 
TGTGTGATGTGCGTCTCGAGTCGCAAATTGGTATTTACTAACTGCAATTGTTTGTCATCCGACGATAGGGATATACAATACGCCCTGATCAGCCA 
TGAATCAGTATTGATATGCGATTCGAGCCCGATCAGGCCGGTTTATATGCTATTTTACGCTATGTATTTTTAGATCAATTTATTATTAAA 
(SEQ ID NO: 473) 

Start ATG: 531 (Reverse strand: CAT) 
MAREDEERIVDNEEVSRPTEEDQVVSRAGRRDNELS 

IYSWPNVVLCFVGGFLIDRLFGIRLGTIIYI^ILLVGQLIFACGGILDAFWmiLGRFIFGIGAESLAVAQRSYAVLWFKGKELNMVFGLQLSVA 
RFGSTVNFWVMQPIYEYVSNFYKGHTALGVVLLLATLTCVMSMTC 

AIFPFIALGQNFFVDRFGLSPAEANTVDSLVYLIAAVSSPVFGFIIDKLGRNVTWVFTATLTTIGAHALLTFTQLTPYVGMIIMGLSYSMLAASL 
W PL VAL 1 1 PE Y QL G T A Y GFCQS I QNL GL AV I T I VAGI I VDH S GGE HMWL QLF FMGW LT I AL 1ST G V I WA YNN KNRGN LNMT P QQ RAQ FV S AEN YQ 
NFE* 

{SEQ ID NO: 474) 

Name: permease, putative 
Classification: transporter 



Celera Sequence No. : 142000013384043 

AG AAG AAG AAT AAT AT T CAG AAAGAAGT T C AT C AAAGG AAT T C AC AAAGGAG AAGC AT T C AAT G C AC T C C C AAAAAAC GT GGT C G AAAAC C AAAG 
CAACCGGCTAAGAAACTGCAGTCAAGAATTTCCACCGATCAGCTAGGAAGCACACCATCGCCATCCAAGCTGCCTGCCAAATATCCCGCCCATTT 
TGTTCATCAACTTCCGCCCGCTGAGGATTCCAGCTGTGCCACTCGAAGACGATCCTTATCTGCTCCAGCTATGCAACTTAGCGATCTAAGCCTCA 
CACCCGATGAGATCCAAACTCCGGAGCGGCGCGGTGAAATCGTAAACAAATCGCCATCGGATCTCGACTCCGATGTCTCCTTGAACTTCTCGCTG 
TCCGATTCCATTGGC GAG AT CT T CGGC AC C AAGG AC AT CAG CAG CAT T C T GAC C AT GC AGC GGC CG C GT C AG T AC AT T T T GC T GG AGG AG C AT CT 
GCCCACACTGGCCACCATGTTGAACGTTGACTTGGAGCGACTGCGCTGCGTGCTGGATATCACTCAGGGCTTGAGCCACGAGCAAATCCTGCGCT 
T T C C GAT AAAAC AT GAAG AGG AT G AATT GG AAGT ACC T TAG AT ATG AT GGGG AAC T T AT AGC AGT AGT AT AAT GT T TTCACTTT GAAT AGT T T T A 
ATGTCCAAGTCCAACATATTTAATTCCCATAAGATCCTTGTTGCTTTAACTTAACATTTAAAATGTATATCTCCCTTTTTTTTTGTATATTTTTA 
AGTGGTATTTAATTGTTGTTGTTTTTTGTTGTTGTTGTTTCTTTTTCGTTATTTTGCCTTAGACTTTCAGAGTGCACAATTTTTGTTTAAACCAT 
TTACTTTTTTGTATATTTTAAATTACTAATGCTAAACGTTATCTAGACTTTATCATACTTCATAATTGGTTTCCTTTCCTCCACGACAATACAGC 
AGCAATCGATCGGGGTTTTTTTTTGTATCTGTTACTAAGTTTTCTTTTTTGTTTACTGTGTTTCTCATTTAACGTATATACAACCGCGTGAATTT 
GGAGTTGCAGCTCCCTAGTAGTCGCTGTCGTAGCTTCCGTCCTGCTGCGCCTGCGTCGAGTCATCGGAACGCTCCGAAGTGGGCTCCAGATCAAC 
GTCCAGGGCCGTAGTGTTGGCCGGACAGCAGCTGGGATGAGCCATTGAGTCGAAGGCATCTTTGATTTGATTGTTGGCACCTTTGGGATCTAGGT 
CGGTGGCCCAGCTGAAGTGCATCAGCGCCAGATCCATGTTGCCGAGCGTCTTGTGGATTTTGCCGATCAGATAGAACACCACCGACTCCTTGGGC 
ACGACTTCCTTGAGCTCCTCCAGTTCGCGCAGCGCCTCCTGGTACTTGCCCAGCGAGAAGTAGATGGAGCCGCGATGGAATCGCGTCAGCGGGTT 
CTTGGGGTCCAGCGTGGCGGCCGTGTTTAGCGTCTGCAGCGACAGATCCTTCTTCTTCATGTAGAATTGCATGGCGCCGATGTGCACGAGAATAA 
CCGAGTTCTGTGGATTGATTTTCAGCGCCTTCACGTAGTGAATCTCAGCCAGCTCGTATTTCTCCTGCTTGGAGTAGATGGTGCCGATGCCGTAC 
CAAGCATTGTAATGCCGCGGATCTCTAACCACTGCTGCACGGAAGTAGTCCATGGCCTTATCAAATTCTTCGGTGAGTACGAGTTCGTGGCCCAA 
CAGTGTGTAGCTGTAGACGAAGTCTGGATCCACTTGGACAGCCCGCTTGAAGAACTTGATCGCGGTTTCGTGCTCCTTTTGCAGCGAAAAGCAAT 
TGCCGGACACACACCAGGTAACCGGGCTGGTTTTGTCCTGGTTAATCAGATCCTGGGCGAGAGCTGACAGCTCAACCTCGCGCTGCAAATGCCAC 
AGGGATGACGAGTATATCTCCATGTAGTCCAAGCGACATGGCTCTGTCTTGTGGATGGTCTCAAATATCGCCACCGCCGCCTCATACTCACGCAT 
CTCGTACCTGGCCAGTCCGATGAGCGACTGCACCCAGCTGGAGTTCAGGTGGTGCTTTGGTATCGTCGTCTCCAGCTGCTTTATGGCCGCCTTGC 
ACTGGAAGTTCGAGAGCAGCTGATAGGCTTCAGCTAAACCCCGCAGCAGGGCCATCAGCCCATCGGCCGATTGTTTCTTGAGTCCCATCAGCTGG 
TGGGCCATCGTCTGGGCATTGTTTAGGCTATTGTTCAGCAGTACTTTGGCCTCCTCGGCGGCACTGCGTCCACCGCTGTTGTTATTCGCTCCCGA 
GGAAGTAATAGTCTCCACCTTCTCCTTGCGCTTCTCGGAGAGGTGGTGGGACTTATCCTCGATCAGTTCGTTATTTAGGCAGATCTTGGTCATCC 
GTGACTTGGCTTTGCGGGGCGGCGAACGTGGCTGCACGAACTTGTTGGCTATGTTGGGCGATTTATTGTTCTCCTTCACCGAATACGAGTTGCTG 
AACAGCCGCGAACTGCGACGCACTGCAGCATTCGGATTGGGCGGCACATTCATATTGCCATTGTTGCCCACATTATTGTTGTTCGGAGTGCGGGG 
CGTTATGTTTCCGGCTTGCGTAAAGACGGCGGGTTTATTAAGGCTGCCCTCTTTGCGATTGATGAGTCCGCCCACATGTGTCTTTAGCTTTTTGC 
CCATCATCTTCGGCTCCTGGTTGACCTCGACCAACATCTGTGGCATCGGCGAATAGCTTATATTCATAACGGGCGTGTGGTTGCCAATGAAGGAT 
CCATCGTTGCCGGTGCACGGACTAGTTAAAGGCATGATGCCGAAGCTCGGTGTTGGCGGTGAGATGGCCGATAGGTATTTGAACTGTTTGCGAAA 
CGGAGTACCGCTGCTCATGTCGTAGAGCTGCTGGTTCTGCTGATACTGGCCGGCCGGGTCCTGCGGAGCTCCCATAGGTGTGTCCTCCAGCATCG 
CCAGCATGCTGGAGTTTTGTACCAAACCCCCGCGCAGCATGCTGATCGAACTATTGAGATTGTTGTTATTATTATTATTGTTGATGGGCGTCACC 
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ATGTTGCTGTTGTGGTGCTGGTTCTGATTCTGGTTGATCTGTTGTTGCTGCTGATCCACGGGCGTGGTCAGAATGTAGTTGGAGACATTGCTTAG 
GTTGGTGATCAGCGACTGGCGCTCCTGATGCTGTTGACCCTGCTGCTCGGCTCCGAACAGCACCATTGCATTGGCATTTACGCTGCTGCCCTGGC 
AGGTATTGAACACATCTGTGCTGTGGATTTGGAATATGGCCGCCGCATCTGTATCCTGGCCCAGCAGGCACAGATCCGCGAATGCGTGCCACATG 
AAGGGGTTCAGTTTGAGGGCGCGCCTTAAAGCACTAACGGCCAGTTTATTGCGCTCGGTCCTCATGCAGATCTGGGCCATTAGCTGATAGGCGAA 
GCAGGCCAGATCCCCGAAGTCCCGCTGCAGCTCATCGCAGTTCTTGGCGTCCGCGAATCCCGTGGAGATCAGGGCACTCTCCGCCTCAGCGTATT 
TTTTCAGCTCGTAGGCACATTTGGCCTGCAGAAAGCGGCACTGCGGCGAGCGGCGTGCCTTCTCCTTGAGCAGCCAATATGCCTGGTGCACCTGA 
TTGGAGCGGAAGTAGCTGGTGGCCAGCAGGAATATCGTCTCGTCGCTCTCCACTGCAAGGACACGGTGAAAGAGAAAATCAGTGATATACCCGGT 
CAGACAAACGAGACCTTGACCTATACCTTCTGAGCACAGGCGTTCCGACAGGAAAACGGCATCCTTGAAGTCGTAGTAGTTCAGGCAATGCCATA 
TGGCGGCCTTTGGGAGCGAAAAAAAAGAAGGGAGAAAATTTGGTTTTTTATAGAGGCCAGGTGGACCGAGGAGCTATTGATTTACCTGCACGGGC 
TCTTGAATCATCATGTCTGGGCGGTATGTGCGACTTTCTAGGGGCGCTCTATGAGATGTACGTTCACTGTACTTCAGGGCTTTACAATAATTCAA 
ACATGCAGTGTGTAGAACTCAAACAAACTATCATGTCGCCCCCCACTCGTACACTCTTCTGTTTGTATTTGTATTCGAATTGTTTTCTTTTGGCC 
AAATTTTTTCCAACAATATCGAATGGGAAAAAGTAAAAGTTTGGCCTGTGTGCTGGCTGGCTGTGTGGTATGTGAGTGTCGTATGTAACCCGATT 
GCGGATGCGGCGATGCAAGTGGCGATGGCGGTGTCTCTCGCTCCCTGGCCTCCACTCCACTCCACCTTCCACTTCCACTTTTTCTTTATATTTGA 
TATTGCAATTGGTTTGAGTCGAGCCTTCGCTTCGCTTCGCCCTCTGTACACACACACTGCGAACGGGGGGTCGGTTTTCTACTATCGTTTTCGGG 
GCTTATCTCTCGTTAAAGACGCACACGGGGCTGATCTGCGATTGGCGGCGACACTGGACTACACCTTGGCCACATTTTAGCTACTTGAAACACGC 
TGCATACGCGTTGCATTCCATTTTATTTGAAAAACCAAAGCTAAAATAATTGCGAGCTGTCAAAGTATCGATTGCACGCGACCCTGCCAATCGAT 
TGTTTATATGGTTGAGGAGGACCGTATCGCACTAGCGCAAAAGTACCAAGCTCTTGACACGAAGCTATTATAAGCTGTTTCCACTGATACATAAC 
TTTGCATTTTTTTAAATGCCAATTTAAATTGTAAATTAATATCATTAAAGAAGGTCATAAAATTAGATATGTATCCCAGCTACTCAAATATATAC 
ATAACTAGACTTCCTTACAGCGCGCTTTTATTAAATACCTAATACTACTAGCTTAAACGAGATTTAGTTATAGTTTTTTATCCAATTCTACTGCG 
AATCGCTGGAACTGGAGCTTGAGCTATCCGTGGACATGACCTCCACATCGTCCTGGGCGCGCGTCTCCAGGGCCACTGTGTTCCCAGAGGCGCCC 
ATGGTCATGTGCAGCTCTCCGAGATTCCAGTTAAACTGGTTCTGGAAGGCATTGGGCACCTCGCCGCCGGCGCCACTGCCACTCCCACTTCCTGC 
GCTACCGGACAGCGTTCGCTGCTGCTCCGAGGGCATGCCGCTGTTCTGGTGGGTCACTGTTCCGCCGTTGATGTTCTGCTCGGATTTACCCAGCA 
GCCCGTTGCGCATCTCAATGTCCGTGGGATAGGGACGCCGCAGGTCACCTATGCACCAGGTGAGCGGAGCACTGACCGCATTGGCGGAGCTAATC 
CGATGTGCGTATTTGATGAGCTCCTCGGATGAAACCGGTCGTTTGTTGGCCTGATTGATGCTGGCCAGTTTCTGTCGCGCTTGGAAGATGGCCGT 
GGACAGTATGAGCTCCGCATCCTTTAGCGACTTTTGCAGCTTCTGGATCTCACGGTCCTAAGCGGGAAAATCCAGATTATACCTTATTTTTGGGG 
AATGCAAAACTTCTACTTACGTGCACCTCGACCTTAGCGCGCAGTTGGTCCATCGCCTCCTCCACTTTCGCCTGCTCCTCCGCCAGTTCCAGCAT 
TTTGCGAAATTCCTCATCTTTGGCCACCAGCAAATCCA 
(SEQ ID NO: 475) 

Exon: 4548. .3886 
Exon: 3807.. 3732 
Exon: 3662, .1001 

Start ATG: 3909 (Reverse strand: CAT) 
Transcript No. : CT14558 

GGTCGCGTGCAATCGATACTTTGACAGCTCGCAATTATTTTAGCTTTGGTTTTTCAAATAAAATGGAATGCAACGCGTATGCAGCGTGTTTCAAG 
TAGCTAAAATGTGGCCAAGGTGTAGTCCAGTGTCGCCGCCAATCGCAGATCAGCCCCGTGTGCGTCTTTAACGAGAGATAAGCCCCGAAAACGAT 
AGTAGAAAACCGACCCCCCGTTCGCAGTGTGTGTGTACAGAGGGCGAAGCGAAGCGAAGGCTCGACTCAAACCAATTGCAATATCAAATATAAAG 
AAAAAGTGGAAGTGGAAGGTGGAGTGGAGTGGAGGCCAGGGAGCGAGAGACACCGCCATCGCCACTTGCATCGCCGCATCCGCAATCGGGTTACA 
TACGACACTCACATACCACACAGCCAGCCAGCACACAGGCCAAACTTTTACTTTTTCCCATTCGATATTGTTGGAAAAAATTTGGCCAAAAGAAA 
ACAATTCGAATACAAATACAAACAGAAGAGTGTACGAGTGGGGGGCGACATGATAGTTTGTTTGAGTTCTACACACTGCATGTTTGAATTATTGT 
AAAGCCCTGAAGTACAGTGAACGTACATCTCATAGAGCGCCCCTAGAAAGTCGCACATACCGCCCAGACATGATGATTCAAGAGCCCGTGCAGGC 
CGCCATATGGCATTGCCTGAACTACTACGACTTCAAGGATGCCGTTTTCCTGTCGGAACGCCTGTGCTCAGAAGTGGAGAGCGACGAGACGATAT 
TCCTGCTGGCCACCAGCTACTTCCGCTCCAATCAGGTGCACCAGGCATATTGGCTGCTCAAGGAGAAGGCACGCCGCTCGCCGCAGTGCCGCTTT 
CTGCAGGCCAAATGTGCCTACGAGCTGAAAAAATACGCTGAGGCGGAGAGTGCCCTGATCTCCACGGGATTCGCGGACGCCAAGAACTGCGATGA 
GCTGCAGCGGGACTTCGGGGATCTGGCCTGCTTCGCCTATCAGCTAATGGCCCAGATCTGCATGAGGACCGAGCGCAATAAACTGGCCGTTAGTG 
CTTTAAGGCGCGCCCTCAAACTGAACCCCTTCATGTGGCACGCATTCGCGGATCTGTGCCTGCTGGGCCAGGATACAGATGCGGCGGCCATATTC 
CAAATCCACAGCACAGATGTGTTCAATACCTGCCAGGGCAGCAGCGTAAATGCCAATGCAATGGTGCTGTTCGGAGCCGAGCAGCAGGGTCAACA 
GCATCAGGAGCGCCAGTCGCTGATCACCAACCTAAGCAATGTCTCCAACTACATTCTGACCACGCCCGTGGATCAGCAGCAACAACAGATCAACC 
AGAATCAGAACCAGCACCACAACAGCAACATGGTGACGCCCATCAACAATAATAATAATAACAACAATCTCAATAGTTCGATCAGCATGCTGCGC 
GGGGGTTTGGTACAAAACTCCAGCATGCTGGCGATGCTGGAGGACACACCTATGGGAGCTCCGCAGGACCCGGCCGGCCAGTATCAGCAGAACCA 
GCAGCTCTACGACATGAGCAGCGGTACTCCGTTTCGCAAACAGTTCAAATACCTATCGGCCATCTCACCGCCAACACCGAGCTTCGGCATCATGC 
CTTTAACTAGTCCGTGCACCGGCAACGATGGATCCTTCATTGGCAACCACACGCCCGTTATGAATATAAGCTATTCGCCGATGCCACAGATGTTG 
GTCGAGGTCAACCAGGAGCCGAAGATGATGGGCAAAAAGCTAAAGACACATGTGGGCGGACTCATCAATCGCAAAGAGGGCAGCCTTAATAAACC 
CGCCGTCTTTACGCAAGCCGGAAACATAACGCCCCGCACTCCGAACAACAATAATGTGGGCAACAATGGCAATATGAATGTGCCGCCCAATCCGA 
ATGCTGCAGTGCGTCGCAGTTCGCGGCTGTTCAGCAACTCGTATTCGGTGAAGGAGAACAATAAATCGCCCAACATAGCCAACAAGTTCGTGCAG 
CCACGTTCGCCGCCCCGCAAAGCCAAGTCACGGATGACCAAGATCTGCCTAAATAACGAACTGATCGAGGATAAGTCCCACCACCTCTCCGAGAA 
GCGCAAGGAGAAGGTGGAGACTATTACTTCCTCGGGAGCGAATAACAACAGCGGTGGACGCAGTGCCGCCGAGGAGGCCAAAGTACTGCTGAACA 
ATAGCCTAAACAATGCCCAGACGATGGCCCACCAGCTGATGGGACTCAAGAAACAATCGGCCGATGGGCTGATGGCCCTGCTGCGGGGTTTAGCT 
GAAGCCTATCAGCTGCTCTCGAACTTCCAGTGCAAGGCGGCCATAAAGCAGCTGGAGACGACGATACCAAAGCACCACCTGAACTCCAGCTGGGT 
GCAGTCGCTCATCGGACTGGCCAGGTACGAGATGCGTGAGTATGAGGCGGCGGTGGCGATATTTGAGACCATCCACAAGACAGAGCCATGTCGCT 
TGGACTACATGGAGATATACTCGTCATCCCTGTGGCATTTGCAGCGCGAGGTTGAGCTGTCAGCTCTCGCCCAGGATCTGATTAACCAGGACAAA 
ACCAGCCCGGTTACCTGGTGTGTGTCCGGCAATTGCTTTTCGCTGCAAAAGGAGCACGAAACCGCGATCAAGTTCTTCAAGCGGGCTGTCCAAGT 
GGATCCAGACTTCGTCTACAGCTACACACTGTTGGGCCACGAACTCGTACTCACCGAAGAATTTGATAAGGCCATGGACTACTTCCGTGCAGCAG 
TGGTTAGAGATCCGCGGCATTACAATGCTTGGTACGGCATCGGCACCATCTACTCCAAGCAGGAGAAATACGAGCTGGCTGAGATTCACTACGTG 
AAGGCGCTGAAAATCAATCCACAGAACTCGGTTATTCTCGTGCACATCGGCGCCATGCAATTCTACATGAAGAAGAAGGATCTGTCGCTGCAGAC 
GCTAAACACGGCCGCCACGCTGGACCCCAAGAACCCGCTGACGCGATTCCATCGCGGCTCCATCTACTTCTCGCTGGGCAAGTACCAGGAGGCGC 
TGCGCGAACTGGAGGAGCTCAAGGAAGTCGTGCCCAAGGAGTCGGTGGTGTTCTATCTGATCGGCAAAATCCACAAGACGCTCGGCAACATGGAT 
CTGGCGCTGATGCACTTCAGCTGGGCCACCGACCTAGATCCCAAAGGTGCCAACAATCAAATCAAAGATGCCTTCGACTCAATGGCTCATCCCAG 
CTGCTGTCCGGCCAACACTACGGCCCTGGACGTTGATCTGGAGCCCACTTCGGAGCGTTCCGATGACTCGACGCAGGCGCAGCAGGACGGAAGCT 
ACGACAGCGACTACTAGGGAGCTGCAACTCCAAATTCACGCGGTTGTATATACGTTAAATGAGAAACACAGTAAAC 
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(SEQ ID NO: 476) 



Start ATG: 640 (Reverse strand: CAT) 

MMIQEPVQAAIWHCLNYYDFKDAVFLSERLCSEVESDETIFLLATSYFRSNQVHQAYWLLKEKARRSPQCRFLQAKCAYELKKYAEAESALISTG 
FADAKNCDELQRDFGDLACFAYQLMAQICMRTERNKLAVSALRRALKLNPFiyWHAFADLCLLGQDTDAAAIFQIHSTDVFNTCQGSSVNANAMVL 
FGAEQQGQQHQERQSLITNLSNVSNYILTTPVDQQQ 

PAGQYQQNQQLYDMSSGTPFRKQFKYLSAISPPTPSFGIMPLTSPCTGNDGSFIGNHTPVMKflSYSPMPQMLVEVNQEPKMMGKKLKTHVGGLIN 
RKEGSLNKPAVFTQAGNITPRTPNNNNVGNNGNMNVPPKPNAAVRRSSRLFSNSYSVKENNKSPHIANKFVQPRSPPRKAKSRMTKICLNNELIE 
DKSHHLSEKRKEKVETITSSGANNNSGGRSAAEEAKVLLNNSLNmQTMAHQLMGLKKQSADGLMALLRGLAEAYQLLSNFQCKAAIKQLETTI^ 
KHHLNSSWVQSLIGLARYEMREYEAAVAIFETIHKTEPCRLDYMEIYSSSLWHLQREVELSALAQDLINQDKTSPVTWCVSGNCFSLQKEHETAI 
KFFKRAVQVDPDFVYSYTLLGHELVLTEEFDKAMDYFRAAVVRDPRHYNAWYGIGTIYSKQEKYELAEIHYVKALKINPQNSVILVHIGAMQFYM 
KKKDLSLQTLNTAATLDPKNPLTRFHRGSIYFSLGKYQEALRELEELKEVVPKESVVFYLIGKIHKTLGNMDLALMHFSWATDLDPKGAWNQIKD 
AFDSMAHPSCCPANTTALDVDLEPTSERSDDSTQAQQDGSYDSDY* 
(SEQ ID NO: 477) 

Classification: known_flybase_gene 
Gene Symbol: Cdc27 
FlyBase ID: FBgnO012058 



Celera Sequence No. : 142000013384416 

CCCGCTCCTGCTCGCCCATTTGCAGAAGTCGCTGCGATTTCCGCGACAGCTTGGAGTGGTCTATGGTTTTTAGGGAGTCAAGCAGGGCGGGATGG 
GCACTTGGCTTCACAACCGGCAGTTTTCGCCATACCTCCAGCTCCAGATCCTCGGAGTCCTGTGCGGGAATCGTGTACTTGAAGTATGGCTTTAG 
GTGACGATAGAAGAGCACTGACTCAGCGATGGCCACGGCAATTACTGGAAAATTGTATTCATTATGGAATCTAATTTAAAAATATTAAAATGCAA 
CGTTATTTCTACCCGGCGTCTTGGGCTCCGTTTCGTCGATGTAGAGACTTTCAATGGCCGCTGGAATCCCCGGCAGTGGCTGCTCGTGCTTGAGT 
TTAAGACCACGGAAAACCTTCAGCGTGGCAGTGGGCTCCTCGGCGGCATTGTCCAGTGAAATGTCCGCCGCCAGCAGCCGGACATATCCGTCGCA 
CTGCACGTCGCTCAGGGCCATGCAACTGGTGAGGGTGGTGAGATGCGGTGGAGGAGCCTCCTCCGCCGTGTCCACGTGAAGCCAGTTGGGAGCAC 
CGGCCATGGCAGGATCTTGGTGAAAAATTTGCAGCACAGTGTGAAAACGAACAGGTGATTCAGGACATGCTTTTGTTTTGGTTTTTCTTGTCCTT 
AGCAACCGGGATGACAGCACATTTTTTGTTACTTATTTGGGTTTTTTCTTAAAAACACAAAATGATCGTCAGCATTATTCTTAAATGCTAACAGG 
GT AAAGC T ACGTATAACTA7AAAAAT CATTTGGC AAAAGGT AAAACT GT GATC AAATT ACTGAT GAT AGAAAATTTTATTT AAAT T GGCCGCC7AAA 
AATGCATTTTGTCTGGTCACACTTAAGGCCGGCCTGCTAAAGACGTGCAAGCATCGACAGTATGTGTTATCGATAAGTGCGAGGGAATATTCGAT 
ACCCACGTCAATTGCTTATCGCAAAAAACATTCGTTATAAATATCATCTCAGCGACGCCAGCAGTTGAATTCAATATTTAACATTTTGTATACAT 
AAGTGACTGACTACGGCCCTATTTAACTGCAAGTAATGACGGAGCTAAGGGACGAGACACCGCTCTTTCACAAGGGCGAGATAGTCCTTTGCTAC 
GAGCCGGATAAATCGAAGGCCCGTGTTCTTTACACCAGCAAGGTAAATACACATGCACACACACACACACGCGCGCGCATTGAAATGTCAAGCAA 
ACGTCGTAGGCGAAGTTTTCACAAATGATTTTCCGTTTCCCGCAATTTTCCGTACATGCCGCAGGTGCTAAACGTCTTTGAGCGCCGGAACGAGC 
ATGGCCTGAGGTTCTACGAGTACAAGATTCACTTCCAGGGCTGGCGTCCATCCTACGATCGCTGTGTGCGCGCCACCGTTCTATTGAAAGACACC 
GAGGAGAATCGCCAGTTGCAAAGGGAGCTGGCCGAAGCTGCCAAGCTGCAAATGTAAGATAACCCCCTAGACATGCGTGGAATACCTCTCTAACG 
AGTTCCCCCTTCTTTTTCGCACAGTCGTGGTGACTACTCCTACAAGGGCACACCCGATAAGCCATCGGCCAAGAAGAAACGCGGCGGAAAAGCGG 
CCCACGTTGAGGAGCCAATAGTGGTTCCAATGGACACGGGACATCTTGAGGCGGAGCACGAAATGGCGCCCACTCCACGAGCAGCGGGCAACCGG 
ACGCGGGACAACAGCGGTGGCAAGCGAAAGGAAAAACCCCCGTCCGGAGATGGCCGATTAAAGGGAAACCGGGGCAGACAAACAGAGACCTTTTA 
TAACAATGCCATAAATGATGTATCGGTCTATAACCATGTGCCACAGGAGGATCGCATTATGATGCGAGTCAGCGAGCGATTGCGAGAGCTTATAG 
AGTACGATCGGAATATGATCAAGGTACTAGGCAAACAGCACGCACTGCCCGCCCGCGTGCCCATCGTGACCATCATGGAAAACTTCGTCAAGCAG 
CAGGCCGTGGAGCTAGCCATTAGCATTAAACAGGACAGCTCGAGGGCCAGGAACACGCAGAGTCGCAATGCCAGAATGGAGCGGGAATACGATCG 
CGTTATGTCCACGTTGGTTACCACTCAATATCAGACTTAACAACCCACTTATACTCTTACTTATTAACAGTGTCTGCATGCTGAAGGAGGTGGTA 
GACGGATTGCGTATCTACTTTGAGTTCCACGTGGATGACCACTTGCTGTATACGGAGGAGAAGGAGTATGTCCACAACTACCTCACAGACGACAA 
CATGCGCAACTGCAGCCTAATATTGAACAAGTCGTACGAGTATATCAATCCCAGCGGTGACACGGAACTGATCGGTCTGGACGGCACGCCAGTGG 
TAGAAGGGTCGGGCGACACCAATGGACAGATCGGAGTTATTAATATCGGCGGTCCGGAGTACGAGAAGCAGTTGCAAAAATGCCTGCTCTACATT 
GTTACCGCCAGCGGGAAAAACACTGCGCAGGCCTATGAGAGGACTTCCCCATATACGGCTGCCTACAAACTGCCCGTGGAGATGCGTGGATTTCT 
GAACGAAACATTCAAGTGGCGGCTTTTGTCAGCGGAATCGCCGCCAGAAAAGTCCATGGTCTTTGGCGCTCCCCACCTGGTTAGGCTAATGAGTA 
TGTACAATCTTTCTTGCATTCTATAACACACATTAATTCATGGTACTTGCAGTTAAAATGCCCATGTTCTTAAACGCATCGCCGATCAGCAACAA 
GAAGCTAGAAGATCTACTTCCCCACCTCGACGCCTTCATCAAGTAAGTCTAAGACCCCTCCCTAGTCCATCACGAAGCTAATCTGTTGGTTTTTC 
AGCTACTTGGAGAACCACAGGGAGTGGTTCGACAGGGAAAACTTTGTTAACTCCACTGCCCTGCCCCAGGAAGATCTGCAGCGGGAGCTGTTGGA 
CTCCCTGGATGGGATTGCTGCTTAGCTTTAGTTGATCGTAGTTATCTAGTTTATTTATTATTACGATTAAAGCTTATGCTCTTTTATTTAAAGAT 
TTGTAATTGTCGTTTTGGTTTCTTCAACTACACACTTTGTAAACAACGCACTTCTATAAAAAAGTTTCCTAATGTGATCGTATAATCGAATAATG 
CCTATAACTATATATGTTAAATATTGAAAATACGATTTTAACATTTTTTTTTGTTTTTTTTTTTTTTTTTGCTTAACCTAAGTTTTAGTTGAGTA 
GTACATGCTAGTATTTTTGTCATGCCTTGTTTTTTGTGCTCAAATATATTTAAGAGTTGGAAAGTTGTTGTCTTTTGTATTAAGGATTTGCGGAC 
CTTGAGTTGGAATGTGCAGGGAATTCTTAAGATTTGTATAATTTCTCAAAATATTGCAACTGTAACAACTATGTGTGTGCTGATTTGTGTTTCAG 
ATCCTGCTTAGAAATTCGAAATTGTCGCTTGGACGGCTTGGAGTTGGATGCATTCGCGTTTGTAACATGTGTAAACTTGGGACGTTAACTTGGAC 
GATGACTGAACTGGACTGGACTGCCGGCCCTCACTTCTGTCGCTTGGTGGCACTGCTGGCCGTGGGTCGCATGAAGCTGCTCACGGATCGACTGC 
TGCTAGTGCTCCTCTGCTGCGGACTGGGGTTCGCACTCATCCTGGCCTGCTGCTGCTGGTTCAGGGCATTTTTCACCAGCACCACGCTGCGAGCG 
GCGCTCATTCGGGACTGTGACTGGGGTTGGGACTGGGACTGGTTTTCCTTAAAACTGGTGCTACCATGGATACTCGATTTCGGAGCCGTGTTGGA 
CATCTTGCTGCGCACCAAGATCACACTGGGTCCCACACTCGACCCGCTACTGCTGCTCCTGCTGGCGGGGACTCCACGTCCCTGGGTAGGTCGCA 
GATTCTGGGCGGCCAGTGGTCTTTTGCTGGGCGAGGAGTCAACGGACACAGTGGCTCCCGCTCGCTGAACGGCGG 
(SEQ ID WO: 478) 

Exon: 1001.. 1182 
Exon: 1300.. 1478 
Exon: 1545.. 2102 
Exon: 2161. .2657 
Exon: 2713. .2797 
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Exon: 2853.. 2970 
Start ATG: 1081 

Transcript No * CT14 600 

AGCGACGCCAGCAGTTGAATTCAATATTTAACATTTTGTATACATAAGTGACTGACTACGGCCCTATTTAACTGCAAGTAATGACGGAGCTAAGG 
GACGAGACACCGCTCTTTCACAAGGGCGAGATAGTCCTTTGCTACGAGCCGGATAAATCGAAGGCCCGTGTTCTTTACACCAGCAAGGTGCTAAA 
CGTCTTTGAGCGCCGGAACGAGCATGGCCTGAGGTTCTACGAGTACAAGATTCACTTCCAGGGCTGG^ 

CCACCGTTCTATTGAAAGACACCGAGGAGAATCGCCAGTTGCAAAGGGAGCTGGCCGAAGCTGCCAAGCTGCAAATTCGTGGTGACTACTCCTAC 
AAGGGCACACCCGATAAGCCATCGGCCAAGAAGAAACGCGGCGGAAAAGCGGCCCACGTTGAGGAGCCAATAGTGGTTCCAATGGACACGGGACA 
TCTTGAGGCGGAGCACGAAATGGCGCCCACTCCACGAGCAGCGGGCAACCGGACGCGGGACAACAGCGGTGGCAAGCGAAAGGAAAAACCCCCGT 
CCGGAGATGGCCGATTAAAGGGAAACCGGGGCAGACAAACAGAGACCTTTTATAACAATGCCATAAATGATGTATCGGTCTATAACCATGTGCCA 

CAGGAGGATCGCATTATGATGCGAGTCAGCGAGCGATTGC 

ACTGCCCGCCCGCGTGCCCATCGTGACCATCATGGAAAACTTCGTCAAGCAGCAGGCCGTGGAGCTAGCCATTAGCATTAAACAGGACAGCTCGA 
GGGCCAGGAACAC 

TTGCGTATCTACTTTGAGTTCCACGTGGATGACCACTTGCTGTATACGGAGGAGAAGGAGTATGTCCACAACTACCTCACAGACGACAACATGCG 
CAACTGCAGCCTAATATTGAACAAGTCGTACGAGTATATCAATCCCAGCGGTGACACGGAACTGATCGGTCTGGACGGCACGCCAGTGGTAGAAG 

GGTCGGGCGACACCAATGGACAGATCGGAGTTATTAATATCGGCGGTCCGGAGT^ 

GCCAGCGGGAAAAACACTGCGCAGGCCTATGAGAGGACTTCCCCATATACGGCTGCCTACAAACTGCCCGTGGAGATGCGTGGATTTCTGAACGA 
AACATTCAAGTGGCGGCTTTTGTCAGCGGAATCGCCGCCAGAAAAGTCCATGGTCTTTGGCGCTCCCCACCTGGTTAGGCTAATGATTAAAATGC 

CC AT GT T CT T AAAC GC ATC GCC G AT C AG C AAC AAG AAGCT AG AAG AT CTACTTCCC C AC CT C G AC GC C T T C AT C AAC T AC T T GG AG AAC C AC AGG 
GAGTGGTTCGACAGGGAAAACTTTGTTAACTCCACTGCCCTGCCCCAGGAAGATCTGCAGCGGGAGCTGTTGGACTCCCTGGATGGGATTGCTGC 

TTAG 

(SEQ ID NO: 479) 
Start ATG: 81 

MTELRDETPLFHKGEIVLCYEPDKSKARVLYTSKVLNVFERRNEHGLRFYEYKIHFQGWRPSYDRCVRATVLLKDTEENRQLQRELAEAAKLQIR 
GDYSYKGTPDKPSAKKKRGGKAAHVEEPIVVPMDTGHLEAEHEMAPTPPJ^GKRTRDNSGGKP^EKPPSGDGRLKGNRGRQTETFYNNAINDVSV 
YNHVPQEDRIMKRVSERLRELIEYDRNMIKVLGKQHALPARVPIVTIMENFVKQQAVELAISIKQDSSP^ 

EVVDGLRIYFEFHVDDHLLYTEEKEYVHNYLTDDNMRNCSLILNKSYEYINPSGDTELIGLDGTPWEGSGDTNGQIGVINIGGPEYEKQLQKCL 
LYIVTASGKNTAQAYERTSPYTAAYKLPVEMRGFLNETFKWRLLSAESPPEKSMVFGAPHLVRLMIKMPMFLNASPISNKKLEDLLPHLDAFINY 

LENHREWFDRENFVNSTALPQEDLQRELLDSLDGIAA* 
(SEQ ID NO: 480) 

Name : LETHAL- 3 

Classification: tumor__suppressor 
Gene Symbol: msl-3 
FlyBase ID: FBgn0002775 

Celera Sequence No. : 142000013384580 

GTTGTAATGAGAGCAAATCATCGGTCAGCATTATACACAAAAGTTAAAAGTATCGAGTTCTAATTAAAATGCTTCATTTTAAATGTTAAAGGTGC 
GTTCGTCACTAAGCTAGTGCTTAGAGTATCAAAACGGAAGTGGCTTGCAACATGTTGATGCAAACAGTATTATTAGCACTGGTGGCGAAAAGCGA 
TAAATTAGTAGCGCTCAATAAATACGTCAGCCACTCAGACAATTGGGGCTTTTGCCACATTAATCAAAACCTCAGGTGGATGCAGTCTCACCTGC 
AACAAGCCTCCCATTTTCTTCCGCCTCCTATTCTGGTTTTACCCAATTTCGTTGTGCTTCGGTTGCATTTGGAGTTTCAATTGGTGCACGCAGTG 

GT T AAAT GC AACG AAAAT C T T T AT TTCATCGTGCATGTGTGC AAT T AAGCT GC C T GAT T GAT AC AT T T T GAT T G AT T AAT GGG AT T C AAAGT AC G 
TGCGTGCTGTCTAAGCAATCAGCCGACACTTCGCCCGGTAAAATACGGAGCTGCGATTCTTTAAGGTAAACAATTCGTAGGAGGCGGTAATGTTT 
GCAAATTTTGTAGCGTTTTTCAACAGATTTCATGATGATATTTGTTCGTTTACTACAAGAAATGTGCAATATGCTTTACTCAGTTTATCATATTA 
AT AC AAAT AAAAAATTGT AAAC AAAT AAGAAT AAC AAACCTTAACTAGC AAC AGCTGATTAACGT 

CAAAAATACCATAAATTCGGTATTTTCCGGTTCGTACACCATTCAACTCTTGCTCTCTAACCAAGTAGATGTAACATAATTTATTTAAATATTTG 
TATTTTATACAAACGATTTAATTGAATGTGAACAGAGTGCAGTGTAAAATGTTCACAAGCACCCGAGATCAAAAGTAAACACAACACCGCAAGTT 
GGTATTATGCAGACATACGAATGCGGCAACACTAACGTATTTGCCAAAAAATGCGCATAAATATCGAAGTGAATTTGATAATATAAAAAAACCAA 
GTTAACTACGCTATCGAAAATGTCTGTGCTGCGATTCCTGGTGACGCGCCAGGCCTTGGCTGCCTTAACCAGACCAAGGACCCTCAACATTATCC 
AAAATCCAGCGCAAATCGCCTACGCCTCCACTTTGTGCAATCAAAATAGCAATCATAATGCCAAGGATCTGACCAAATCGAGTGCCAATTTAAGT 
CTGATGCAAACTAGGGGCCATAAACGTTTTGGTCATCAGGAGGAGAAGACGCCCAGTGTGACAAAGTACTTCCACATGTTCATTCTCAGTTTATT 
CCTCATTTCGGTAATGGACTGGGGCAAGTGAGTAAACTAACCAATAAAGTATTTTCAACCAAGTCCAAACTACTGAATTGGAATCTATTTTTAGA 
GTGAAACGCATGTTGACGCCAAAAGTGGACGCAGATGCTGGACAACGTCCGTCATCGGCGGCCGACGTTAATGGCGAGGATAAGTCATCCGAATC 
CGAATCGGAAGATAGCGAGGACGAGGAGGCAGGATCCGATCTGCACCTGCACGAGGGCAAGAAGATCCGCGAGAAAGTCGGTTTCCGCGAGAGGA 
AGGTATAAAGCTGCTTGACGATGCTATAACTCTGTAAAAAGTATCATTATATTCTGCTTAAATTCTGGGTCTCACAATAGTTCCAATTTTTGGTT 
ATGTTACTAATTGTGAGGACCGGAAAAATCGTAACTTTTCTTTTAATATATTTTAAACTTTCGTATATTATGTTAAACCAAAATGTTTGATTTTT 
ACTTTTTCTTTTATGTTTGTGACGTCTTGTTTTTGACCCACCATCAAACACATTAAACACACACACACCGCGAAGATCATTGAGTATGAGAACCG 
AATCAGGCAGTTCTCCACCCCCGACAAGATTTTTCGTTATTTTGCCACCGTAAGACTGCAGGATGCGACCCAAACGATTGTCTGCATGACTCCCG 
AGGATTTTCTTCGCTCCATTTATCCGGGAATCAAGCAGCCAGATGGTAAATCACTCATTCAAATGAGCATATGTAATTATATCTTTAGTTATTGG 
ATTAACTTGAGTAAGGATATTTTTTTTTATTGTATTAGTTGTTTTGGTTGGCATAACTTACTTTTCATAATTTGAATAATGCTCATAATTGATGC 
ATTTTCGTAAGCTAAACTCACGTTGTTTGCCTTAGATAATTGAGTATGAGAACCGCATACGGCAGTTCTCCACGCCGGACAAAGTGTTTCGCTAC 
TTTGCCACCATACAGGTGCCCGTGGCGGATGATCGTCATGAGGTCTACATGACTCCGACCGACTTTCTAACAAGCATGACGCCCGGCATGAAGCA 
GCCAGATGGTATGTGCAAGAAAAAACCCAGGGAAAAACTTAAGAAAGCCAGTAGTTGTAGCACTTACTAACTGACATTATGATTAGAGAGCTCCT 
GCAATAATAGCTATATATATTTTAGGATTGGGCCTGGACCAATATCGTCGCTATGATCCCAAGGTGAGTCTAAAATGCATTTGTAATACAAGTTA 
ATAATCCCTATTTTTCCCTACTAGTCTGTTGGCGAGCAGCTCAATTTGCATCTGGAGAAGAATAGCATTTTTTACAAACTAGGCTCTTACGGACT 
GATCACATTCTCAGACTACATCTTCCTGCTCACAGTCCTCTCGAGTAAGTGATAAGTATTCCCATTAGGATACATTCCAATAACCACGTTTTCTT 
TATAGTTTCACGACGCCACTTCGAGATTGCCTTCCGCATGTTTGACTTGAATGGGGATGGTGACGTGGACTGCGAGGAATTCGAGATGGTGGCCA 
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CCTTGGTGAGGCAGCAAACCAGCATGGGAACCCGACACCGCGATCACGCCAATACAGGGAACACTTTTAAGGTAATGAAACTCATTTGCCCTAGA 

TTTTAATCAATAAGAGTACCTCGAGCCGTACTAAATACGATTCCTTGTGTATTTTTGTATATTCTCTGTCTTTTCGCTTTGACCCTATATATTGA 

ACGCGTGTCTAAACTGTGTACATATCAAATCTTAACTCCCCTCTGTTTTTGACGTACTTACAGTCCTTGAAGGTACTTTGTGTGATAATGCTTAT 

TGACTTATTTTGGCTTTTCAAATTTTCTTTCGAAAACACCCGAACACAATCTCACTCAAGTTAAACTAAATCATATTTCACGTAGTTAATTCCAC 

GATGCACCTGCTCCCATGCACACAAATTACTCTTTTTTATATCCAACTTGTCGTTTCTATTTTTGGAAATATTAATTGCTTAATTTTAAATTGTT 

TTCTTAGGGTGTGAACTCGGCTTTGATCACATATTTCTTTGGACCCAATATGGATGAAAAGTTGACAATAGAAAAGTTCCTGGATTTCCAGGAGC 

AACT GC AAAG G G AGAT C T T GT C AC T C G AAT T C GAAAG AA AAG AACC G AATG ATG AGGGG AA T AT AAC AG AAGCGG A T T T T GCAG AA CTCCTACTG 

GCCTACGCTGGATATCCGTTAAAAAAGAAACAAAAGAAGTTGAAGAGGGTTAAGAGGAGGTTTAGGGACCATGGCAAAGGCATCTCCAAACAGGA 

CTATCTGGATTTCTTTCATTTTCTGAACAACATCAATGACGTGGATACTGCCCTTACTTTCTACCACATTGCGGGTGCCTCGATTGACCAACAAA 

CATTGCAACATGTGGCCAAAACGGTGGCCATGGTCAATCTTTCGGATCATGTGGTGGATGTGGTCTTTACCATCTTTGATGAGAATAGTAAGTTC 

GTAATTGAGTGTAATTAGAATAACAATCTTAATTCATTCGGCAAACCTCTTTCAGATGACAACCAACTCAGCAACAAGGAATTTATTTCGGTAAT 

GAAGAATCGCGTGCAGCGCGGATTGGAGAAACCCAAAGACACAGGATTCCTAAAGATGATGCGATCGGTGTTCAAGTGCGCCAAGGAAACCAAGC 

CTGTGCTGCTGGACATCTAAAGATCGTCACTATGTGGATTCTCCCGCTTCAAGCGCCCCAATCTTAGTCACTTTTTCCTACGCTTACGCTTTTTA 

CTATTTATTTTTGTAGTTAACGTTTACATCAATCGATTTTAGCTCAATCTTTGTAATTGTTCGACCCCCGGCAGGTTGACGACTGTCTTCTGTAT 

ATTTCACCGTCTCTTGTGCACTTAACTCGTTTATTTTTTAATAAAAATATTATATTTATAAATAAGTACATCTTTAGGAATTTTTGCTGTAGCTC 

ATCATTCAGTTCTTGGATTATGGCGCACGTTACATTTTACATTTCAAAAGTGTATTATAAGTGCTTAAAAAAGAATATGAATGAATAAATTAATA 

GTGATATAAAACACTTTAAAAAGATATGAATATATTAATAAGGAATGTTTTAATGATCCTTAATAGCTAATTTATATGAAATATGGGATTTTCAA 

TCCTCTTTATGCATTAAAAAGAATGACCAAGAGTTTACAACTCTCGCGCAGTTAATATTAATTGTGGGCTCAAAACATTTTCTATGCCACATCTA 

CGTCATCCTGCTCCTCGTCCGCCACATCATCAATGCTCATCCCGTACAACTCCAAGTCGTGCTTTTTAAGATGGGCCGTATAAGCATCCAACTCG 

GTGAATCGTTGCTGACACTCGCAGCACTCTAGGCAATCTTCCACAGAACTGTCGCCATCGTCGTCACTCTCGTAAATGGACGACTTGCGGCTACT 

CAGCTGATCCTGATGATTGTTCTTGTGATCCGAGTACTCCAGATAGTCCCTGAATTTGGAGGGACACCTCAGGCACTTGTACCAGCGATCCCTGA 

CATGGGTGACCATGTGTTTCTTGAGCGTGGAGCACTCCCTGAAGGGTTTCCTGCATTTTGAGCAGCTAAATGGTTTCTCGCCAGTATGGATCATA 

T AGT G GCGTT T AAG AT T AT G AC G AGT T CC GAAAGG AC GAT G AC AC AAAGG AC AC GG AT AAG AT C T GC G AG 

(SEQ ID NO; 481) 

Exon: 1001.. 1357 
Exon: 1425.. 1617 
Exon: 1881.. 2040 
Exon: 2496.. 2533 
Exon: 2590.. 2704 
Exon: 2761. .2921 
Exon: 3333.. 3792 
Exon: 3856. .4010 
Start ATG: 1065 

Transcript No. : CT14629 

ATGCGCATAAATATCGAAGTGAATTTGATAATATAAAAAAACCAAGTTAACTACGCTATCGAAAATGTCTGTGCTGCGATTCCTGGTGACGCGCC 
AGGCCTTGGCTGCCTTAACCAGACCAAGGACCCTCAACATTATCCAAAATCCAGCGCAAATCGCCTACGCCTCCACTTTGTGCAATCAAAATAGC 
AATCATAATGCCAAGGATCTGACCAAATCGAGTGCCAATTTAAGTCTGATGCAAACTAGGGGCCATAAACGTTTTGGTCATCAGGAGGAGAAGAC 
GCCCAGTGTGACAAAGTACTTCCACATGTTCATTCTCAGTTTATTCCTCATTTCGGTAATGGACTGGGGCAAAGTGAAACGCATGTTGACGCCAA 
AAGTGGACGCAGATGCTGGACAACGTCCGTCATCGGCGGCCGACGTTAATGGCGAGGATAAGTCATCCGAATCCGAATCGGAAGATAGCGAGGAC 
GAGGAGGCAGGATCCGATCTGCACCTGCACGAGGGCAAGAAGATCCGCGAGAAAGTCGGTTTCCGCGAGAGGAAGATCATTGAGTATGAGAACCG 
AATCAGGCAGTTCTCCACCCCCGACAAGATTTTTCGTTATTTTGCCACCGTAAGACTGCAGGATGCGACCCAAACGATTGTCTGCATGACTCCCG 
AGGATTTTCTTCGCTCCATTTATCCGGGAATCAAGCAGCC AGAT GGATTGGGCCTGGACCAATATCGTCGCTAT GAT CCCAAGTCTGTTGGCGAG 
CAGCTCAATTTGCATCTGGAGAAGAATAGCATTTTTTACAAACTAGGCTCTTACGGACTGATCACATTCTCAGACTACATCTTCCTGCTCACAGT 
CCTCTCGATTTCACGACGCCACTTCGAGATTGCCTTCCGCATGTTTGACTTGAATGGGGATGGTGACGTGGACTGCGAGGAATTCGAGATGGTGG 
CCACCTTGGTGAGGCAGCAAACCAGCATGGGAACCCGACACCGCGATCACGCCAATACAGGGAACACTTTTAAGGGTGTGAACTCGGCTTTGATC 
ACATATTTCTTTGGACCCAATATGGATGAAAAGTTGACAATAGAAAAGTTCCTGGATTTCCAGGAGCAACTGCAAAGGGAGATCTTGTCACTCGA 
ATTCGAAAGAAAAGAACCGAATGATGAGGGGAATATAACAGAAGCGGATTTTGCAGAACTCCTACTGGCCTACGCTGGATATCCGTTAAAAAAGA 
AACAAAAGAAGTTGAAGAGGGTTAAGAGGAGGTTTAGGGACCATGGCAAAGGCATCTCCAAACAGGACTATCTGGATTTCTTTCATTTTCTGAAC 
AACATCAATGACGTGGATACTGCCCTTACTTTCTACCACATTGCGGGTGCCTCGATTGACCAACAAACATTGCAACATGTGGCCAAAACGGTGGC 
CATGGTCAATCTTTCGGATCATGTGGTGGATGTGGTCTTTACCATCTTTGATGAGAATAATGACAACCAACTCAGCAACAAGGAATTTATTTCGG 
T AAT GAAGA ATC GC GT GC AGC GC G GAT T GG AG AAAC C C AAAG AC AC AGG ATT C CT AAAG AT GAT GCGATCGGTGTT C AAGT G C GC C AAGG AAAC C 
AAGCCTGTGCTGCTGGACATCTAA 
(SEQ ID NO: 482) 

Start ATG: 65 

MSVLRFLVTRQALAALTRPRTLNIIQNPAQIAYASTLCNQNSNHNAKDLTKSSANLSLMQTRGHKRFGHQEEKTPSVTKYFHMFILSLFLISVMD 
WGKVKRMLTPKVDADAGQRPSSAADVNGEDKSSESESEDSEDEEAGSDLHLHEGKKIREKVGFRERKIIEYENRIRQFSTPDKIFRYFATVRLQD 
ATQTIVCMTPEDFLRSIYPGIKQPDGLGLDQYRRYDPKSVGEQLNLHLEKNSIFYKLGSYGLITFSDYIFLLTVLSISRRHFEIAFRMFDLNGDG 
DVDCEEFEMVATLVRQQTSMGTRHRDHANTGNTFKGVNSALITYFFGPNMDEKLTIEKFLDFQEQLQREILSLEFERKEPNDEGNITEADFAELL 
LAYAGYPLKKKQKKLKRVKRRFRDHGKGISKQDYLDFFHFLNNINDVDTALTFYHIAGASIDQQTLQHVAKTVAMVNLSDHVVDVVFTIFDENND 
NQLSNKEFISVMKNRVQRGLEKPKDTGFLKMMRSVFKCAKETKPVLLDI * 
(SEQ ID NO: 483) 



Celera Sequence No. : 142000013384580 

AAAAGAAGTTGAAGAGGGTTAAGAGGAGGTTTAGGGACCATGGCAAAGGCATCTCCAAACAGGACTATCTGGATTTCTTTCATTTTCTGAACAAC 
ATCAATGACGTGGATACTGCCCTTACTTTCTACCACATTGCGGGTGCCTCGATTGACCAACAAACATTGCAACATGTGGCCAAAACGGTGGCCAT 
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GGTCAATCTTTCGGATCATGTGGTGGATGTGGTCTTTACCATCTTTGATGAGAATAGTAAGTTCGTAATTGAGTGTAATTAGAATAACAATCTTA 
ATTCATTCGGCAAACCTCTTTCAGATGACAACCAACTCAGCAACAAGGAATTTATTTCGGTAATGAAGAATCGCGTGCAGCGCGGATTGGAGAAA 
CCCAAAGACACAGGATTCCTAAAGATGATGCGATCGGTGTTCAAGTGCGCCAAGGAAACCAAGCCTGTGCTGCTGGACATCTAAAGATCGTCACT 
ATGTGGATTCTCCCGCTTCAAGCGCCCCAATCTTAGTCACTTTTTCCTACGCTTACGCTTTTTACTATTTATTTTTGTAGTTAACGTTTACATCA 
ATCGATTTTAGCTCAATCTTTGTAATTGTTCGACCCCCGGCAGGTTGACGACTGTCTTCTGTATATTTCACCGTCTCTTGTGCACTTAACTCGTT 
TATTTTTTAATAAAAATATTATATTTATAAATAAGTACATCTTTAGGAATTTTTGCTGTAGCTCATCATTCAGTTCTTGGATTATGGCGCACGTT 
ACATTTTACATTTCAAAAGTGTATTATAAGTGCTTAAAAAAGAATATGAATGAATAAATTAATAGTGATATAAAACACTTTAAAAAGATATGAAT 
ATATTAATAAGGAATGTTTTAATGATCCTTAATAGCTAATTTATATGAAATATGGGATTTTCAATCCTCTTTATGCATTAAAAAGAATGACCAAG 
AGTTTACAACTCTCGCGCAGTTAATATTAATTGTGGGCTCAAAACATTTTCTATGCCACATCTACGTCATCCTGCTCCTCGTCCGCCACATCATC 
AATGCTCATCCCGTACAACTCCAAGTCGTGCTTTTTAAGATGGGCCGTATAAGCATCCAACTCGGTGAATCGTTGCTGACACTCGCAGCACTCTA 
GGCAATCTTCCACAGAACTGTCGCCATCGTCGTCACTCTCGTAAATGGACGACTTGCGGCTACTCAGCTGATCCTGATGATTGTTCTTGTGATCC 
GAGTACTCCAGATAGTCCCTGAATTTGGAGGGACACCTCAGGCACTTGTACCAGCGATCCCTGACATGGGTGACCATGTGTTTCTTGAGCGTGGA 
GCACTCCCTGAAGGGTTTCCTGCATTTTGAGCAGCTAAATGGTTTCTCGCCAGTATGGATCATATAGTGGCGTTTAAGATTATGACGAGTTCCGA 
AAGGACGATGACACAAAGGACACGGATAAGATCTGCGAGTGCGACTCATGGACTTTCCAGCTATATTTCCATCGTTGCTCAAGGAAGATGAGGAG 
TTGGCACTGCTGCAGCTGCGAGGAAGTGGCGGGCGACTTGGCTGCGTGTGTGAAGGAACTTTTTCCGGTGGAGATTGAATAGGTGGTATTAAAGT 
AACCATTAAACTGGGTTGTGTTGCCACTTGTGGCTGCTTCTCATTTTCTAGAGCCGCTTCCTCTATTTCGTCATCATCCCCTTCCATCTCATCCG 
AATGTTCCTCCGTTGTTTTTTCGCCCAGTTCCTCTGGTTGAGGTTTTACCATAAGATTCTTGCTTTCACAATCCGCACACCTTGTAGAGTCCTGC 
TCCAAAGGTGCTCCCAAGTGACAGATCATGCAAATGAATTGTGGATCGTGTGAGTGCCTTCGACGATGCTTTTTAAGATCGCTGAATTCCCGAAA 
ACGTCGCTCGCACAGATCACAAGGGAATGGTCTTTCACCTGGGGAAAATTTAGTAGAAAGTGATAAGTGGAGGTATTTGAATATTTTATGCGAAG 
TTTCGTACCTGTGTGGATCATCATATGACGTCTTAGATTGTAATTAGATTGCACTTTCTTCCCGCAGTCAGGACATTCGTACATATGCCTTCCCC 
TTCTGTTGGCACCTGGACGCCTCTTTACTAATCTCCTCGACCCAGAAGATTTCCTCTTGATGACATTGTCTGTCTTCTGTTCGTCCTCCTGGTCG 
GAGGGCACGATGATCACCAGTTCCTGTGGATTCTGGCTTCCATCCTGTCTCTGCTCATTCAGGACGCAGCTTACAGCGTTTGATGGAAGACTTAC 
ATCCTCATCTACTTCACTATCTTCATCGTGATCACTATCACTCAGTTCGATTTCTGCGACAACCCTGTGCACCCGTCTGGGAATCAGTTCGATAT 
CTGGTGTTACCCTCCTAATGGATGAATTCCAATGGGCTAGACTCCTGGCTGCGCTGATTATGAGACCCCGAGGAGATTGAGCCTCCTCCTCGTCC 
AATTCGATTACCTCTTGATCTCTTCGCTTTTCGGCCTCCCGCCTTTCCTTCTCTCTTTTCTCCCTTTTCTCTTGTTCCATGATCCTTTTGATTTC 
CAACTCCTTTTCCCTCTGCTGCTGGCCGATTTGCCACAGGAAACTCTTGATTTTCACATCATCAATGGATATGAAGTTCTCCTCGGGCTCCTGGT 
ACTCATATCTGACTGGATCGGGCGGTATTAACCCGCAGGCGGCGTCGCAGCTAGCGTCGCAGCAGGCGGAGGCAGAACAGCAGGGCAGCAGAGTG 
CAGTCCTTGCAGCAGTCCACGCAGGATGATGGTTTATTCAGAGCCAGCTCCATCAAATCTGCAATGTTACAATAAACAATTTAGAAAAGTTTAAG 
AAAGGAACGTAAGAATTGCACTATCATTGTACTATCAGAAAAAAACATTGCATACATGTAGGCGTGTTTTATATAAATCACATTCTACTATGGAT 
AAATCGGCACTTATCAGTATTTCCAAGAAGCGTACTTTCTAGTATCGAATAATCACAATAAAACCTCTCAAACCATAAAAAAAGAATATATTATA 
GGGCACTACATTTCCATCGATCACATTGAATAGCCTGTATTGGTTTATATTAATATCCTGGACAAGTCGCATACGAACCATTGATAACCCAGGAG 
ATTCCGATCGAATGAATTCTGCGATTGCACACACACCCCTCTCTCACACTTTACAAAGCTTCGCTCCTTTCTTTCGTTTTTTATCCGCAATTTGT 
GCGCTCGGC GCrGCGTCTGCCGTGCGAAGAAGTGAAGAAGAAACTGGATTCGCGATTCGCGATTTGCGGCTTGCCGAGCAGCAACCTCTCGGCAA 
AAACAACAAAACGCCGCAAAAGTCGAGCTTCGAGTTGACGCCGACTGCGCTGCCCGGCTTGTGAATCAGCGAAGCGAGCAGAGCAGCAGCAGCAA 
CAACAACAACAACCATGTGGGCAGCGCTTAAGGCTTTTTTTTTGTGTGCACGAACGTGAGCCTTTAGCCCCCCTTTCCCACCACTGCCCATTGGC 
GTGTTAACTTGATTTTTCTTGGGGCCACGCAAGAGCGAAACACGAAAAACAAAGAAAATGCCCCACAACAACAACAAGGGCAACAACAAAAACAA 
T GC AAT T AC AG C CGGAGC AAC AGGC GC AGC AGC AAC AAC AAC T C AC GAT T T T C AAGAT T GC T C ACAAG AGC AGGGGC GT AGGC AGG AC AT G AG AA 
GGGGTGAGGTTACCAGGAGTGAAAGGAATTCCTTACTATATATACTTGTATATAACAACGATCTAATCTTCAATGAAATGCCAAAAAAAAGATAA 
TTTGCTTT 
(SEQ ID NO: 484) 

Exon: 2808 . .2004 
Exon: 1938.. 1001 

Start ATG: 2808 (Reverse strand: CAT) 
Transcript No. : CT14 635 

ATGGAGCTGGCTCTGAATAAACCATCATCCTGCGTGGACTGCTGCAAGGACTGCACTCTGCTGCCCTGCTGTTCTGCCTCCGCCTGCTGCGACGC 

TAGCTGCGACGCCGCCTGCGGGTTAATACCGCCCGATCCAGTCAGATATGAGTACCAGGAGCCCGAGGAGAACTTCATATCCATTGATGATGTGA 

AAATCAAGAGTTTCCTGTGGCAAATCGGCCAGCAGCAGAGGGAAAAGGAGTTGGAAATCAAAAGGATCATGGAACAAGAGAAAAGGGAGAAAAGA 

GAGAAGGAAAGGCGGGAGGCCGAAAAGCGAAGAGATCAAGAGGTAATCGAATTGGACGAGGAGGAGGCTCAATCTCCTCGGGGTCTCATAATCAG 

CGCAGCCAGGAGTCTAGCCCATTGGAATTCATCCATTAGGAGGGTAACACCAGATATCGAACTGATTCCCAGACGGGTGCACAGGGTTGTCGCAG 

AAATCGAACTGAGTGATAGTGATCACGATGAAGATAGTGAAGTAGATGAGGATGTAAGTCTTCCATCAAACGCTGTAAGCTGCGTCCTGAATGAG 

C AG AGAC AG G AT GG AAGC C AG AAT C C AC AGG AAC TGGTGATCATCGTGCCCTCC G AC C AGG AGG ACG AAC AGAAGAC AG ACAAT GT C AT C AAG AG 

GAAATCTTCTGGGTCGAGGAGATTAGTAAAGAGGCGTCCAGGTGCCAACAGAAGGGGAAGGCATATGTACGAATGTCCTGACTGCGGGAAGAAAG 

TGCAATCTAATTACAATCTAAGACGTCATATGATGATCCACACAGGTGAAAGACCATTCCCTTGTGATCTGTGCGAGCGACGTTTTCGGGAATTC 

AGCGATCTTAAAAAGCATCGTCGAAGGCACTCACACGATCCACAATTCATTTGCATGATCTGTCACTTGGGAGCACCTTTGGAGCAGGACTCTAC 

AAGGTGTGCGGATTGTGAAAGCAAGAATCTTATGGTAAAACCTCAACCAGAGGAACTGGGCGAAAAAACAACGGAGGAACATTCGGATGAGATGG 

AAGGGGATGATGACGAAATAGAGGAAGCGGCTCTAGAAAATGAGAAGCAGCCACAAGTGGCAACACAACCCAGTTTAATGGTTACTTTAATACCA 

CCTATTCAATCTCCACCGGAAAAAGTTCCTTCACACACGCAGCCAAGTCGCCCGCCACTTCCTCGCAGCTGCAGCAGTGCCAACTCCTCATCTTC 

CTTGAGCAACGATGGAAATATAGCTGGAAAGTCCATGAGTCGCACTCGCAGATCTTATCCGTGTCCTTTGTGTCATCGTCCTTTCGGAACTCGTC 

ATAATCTTAAACGCCACTATATGATCCATACTGGCGAGAAACCATTTAGCTGCTCAAAATGCAGGAAACCCTTCAGGGAGTGCTCCACGCTCAAG 

AAACACATGGTCACCCATGTCAGGGATCGCTGGTACAAGTGCCTGAGGTGTCCCTCCAAATTCAGGGACTATCTGGAGTACTCGGATCACAAGAA 

CAATCATCAGGATCAGCTGAGTAGCCGCAAGTCGTCCATTTACGAGAGTGACGACGATGGCGACAGTTCTGTGGAAGATTGCCTAGAGTGCTGCG 

AGTGTCAGCAACGATTCACCGAGTTGGATGCTTATACGGCCCATCTTAAAAAGCACGACTTGGAGTTGTACGGGATGAGCATTGATGATGTGGCG 

GAC GAGGAGC AGG AT GACGTAGAT GTGGCAT AG 

(SEQ ID NO: 485) 

Start ATG: 1 (Reverse strand: CAT) 

MELALNKPSSCVDCCKDCTLLPCCSASACCDASCDAACGLIPPDPVRYEYQEPEENFISIDDVKIKSFLWQIGQQQREKELEIKRIMEQEKREKR 
EKERREAEKRRDQEVIELDEEEAQSPRGLIISAARSLAHWNSSIRRVTPDIELIPRRVHRVVAEIELSDSDHDEDSEVDEDVSLPSNAVSCVLNE 
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QRQDGSQNPQELVIIVPSDQEDEQKTDNVIKRKSSGSRRLVKRRPGANRRGRHMYECPDCGKKVQSNYKLRRHMMIHTGERPFPCDLCERRFREF 
SDLKKHRRRHSHDPQFICMICHLGAPLEQDSTRCADCESKNLMVKPQPEELGEKTTEEHSDEMEGDDDEIEEAALENEKQPQVATQPSLMVTLIP 
PIQSPPEKVPSHTQPSRPPLPRSCSSANSSSSLSNDGNIAGKSMSRTRRSYPCPLCHRPFGTRHNLKRHYMIHTGEKPFSCSKCRKPFRECSTLK 
KHMVTHVRDRWYKCLRCPSKFRDYLEYSDHKNNHQDQLSSRKSSIYESDDDGDSSVEDCLECCECQQRFTELDAYTAHLKKHDLELYGMSIDDVA 

DEEQDDVDVA* 
{SEQ ID NO: 486) 

Name: zinc finger protein 
Classification : transcription_f actor 



Celera Sequence No. : 142000013385212 

AATGTTAGTGTAAATAAATTACCTTTATCGGTAAATCCATGTGCTCGCTGCTGCAGCTGGACCCATACATAACCTCAAACCAGAATGTAAATTCT 
GGAAATAAAAGAAAATGGCATCAATACAGTGTGTACAAGGAGTACTTCCAAATATTTTATATATTTGATTAGGTTTCAGTTTACAATTTTCTGCA 
CGGTTATGAGCTCAGTACAGGTCTGTGTAAAAGATCATCAGAGAAGCATAGACCAACCAAAAGTATTAATATCTTACCTCCAATGTGACAATGGC 
GTCTCCAGTGGGCTCAGTGCCCAAGGTGTCTCAACGGGCTCACATGTTGTCCCGAGGTGTCTCACATGCACCATGCACGTGGAGTGCCCGGATTT 
CGATTTATCCAAATGCTTAGGGTTTCAATATTGTTCACTGCAAGAAATACAGAACTTTTAGAACATATTCTTGCCAAACCCAAAGAAAGTTCTTG 

AAGT T AT AT G CC AT C AGAT C GC C GAAT CAT C T T AAAT G AAC AAAT AT AT AACGAG GC AC T G AAAAAT AC AGC G C C AAGT C GAG AAT TTGTTCCTT 
CAGTCTCGTCATCTGTTTCATCATTTGAGGCATAACACTCTTCCATTTAAGCACTTTAATTCACTTGATATAAAACTATTTACGAAGAGCACACA 
ACTTACCATTTCGAGTCTTGAAATAAAATGAACGAGTTTCGTCATCGCACGTGGAAGGAAAAATTCTGTGAAAAGATGGCGGCCAACTATCGATG 

TC T CT GAAT GC AAC CAT G GT AGT AT CGT AT C GG AAAAAT T GT TT GG TTTTTGCCT GAT AT AT AT T GT AT AAGAAT GT GC AAT AAT T AACT T AACC 
GAATAAATGATATTCCATTTAATTTTCATTAAATCATTCATAAAATACTTAATATATGAAAAACTTTCATATTTATATGTTAATTCGATGTATCG 
ATTGCCTATCGCTGTCTTCGGTGATCTAGCCCAAACAGCTGATTTTACCAAAATACTTACCACACGTGTGCCACTTTTTAAGTTATATTGTTTAC 
AGTTTTTGGGAATTACAGCTTACAATGGTATATATAAATCTTTAATATTTAATTAAACCCGACCTAACATCTTCAGTTCTTTTGCAGGGAAAACA 
CAAGAAAACGCAGAAGGTTAAGAAACAACGTAATGCACAGCTTAAACGCATCATAAAACCCACAGACGCGCGGCTTAAGGATCAGATTCGAGTGA 

AAAG AAAG AAG GC C GAG GAT CC GC AC C AG AT C AAGGT GC AC G AGG CC AC C C AGC AG AGCT CCGCTCTGTTCTT C C AGT AC AAC ACC C AGC T G GG A 
CCGCCGTACCACATCGTTCTGGACACGAACTTCATTAACTTTAGCATTAAGAACAAACTGGATATTGTGCAAGGTATGATGGACTGTCTGTATGC 
CAAGTGCATACCCTATATCTCCGACTGCGTGCGTGCCGAGCTGGAGAAGCTGGGCAACAAGTACAAGCTGGCCCTGCGCATCATCTCCGATCCCA 
GATTCGAGCGACTGCCGTGCCTGCACAAGGGCACCTATGCGGACGATTGCCTCGTGGAGCGGGTGCGCCAGCACAAATGCTACATTGTGGCCACC 
AACGACAAGGACCTTAAGAATCGCATCCGGAAGATTCCCGGCGTGCCCATCATGTACGTGGCTGCGCATAAGTACGCGATTGAGCGTATGCCAGA 
AGCGTATGGAGTCAAGGCGTAGACCTTTTTTACTTCTTATCCTAAGTGTACATTTTTAAATATATTCTTACATGTAAGCAGCAAGCAATGCGTTT 
CGTATTTCCTGTTGTTTTTTAGTAGAATGCTTTTACTGCATTGTCTGGATCAACAAAGGAGACATAAGTTCTATTAAGTTACAAATAGTTTAACT 
ACAACAGAATCTCCAAATCGACCAATGTCCACAACAATAACAAACACGAAGCGATTACATTTAAAAGTTTATGACTTTATTCAAAAACTCTGGAT 
TGATTGGCCTCGAAAGTGTTCACTTAGGCATCGGCGTGCAGACGAGGAGTGGGCTTGGACAGGAAGTCCGAGTATGCCTGGTAAGGAGTGGAGCC 
CAGAGGCATCTCCTTCCACAGATCGGGGGTCAAGTACGCGTACGTCTTGGCGATGGCGGCATATGTAGCCTTGGCGAAGTTGCCGAGGGTTCCAG 
TGGAGCCACGGGCCGAGGTGTAGCAATCCTCAATACCGGCCATGGTCAGCAGCTTCTTGGGCACGGGGGCCGAGACAATGCCAGTACCACGGGGA 
GCGGGGATGAGGCGCACGGAGACGGAACCGCACTTGCCGGTGACCTTGCAGGGCACGGTGTGGGGCTTGCCGATCTTGTTGCCCCAGTAGCCACG 
GCGCACGGGCACCACGGAGAGCTTGGCCAGAATGATGGCACCACGGATGGCGGTGGCCACTTCCTTGCTGCACTTAACGCCCAGACCAATGTGGC 
CATTGTTGTCGCCGATGGCAACGAAGGCCTTGAAACGGGTACGCTGACCAGCACGGGTCTGCTTCTGGACGGGCATGATCTTCAGCACCTCATCC 
TTCAGCGAGGATCCCAGGAAGAAGTCGATGATCTCGAACTCTTTGATGGGAAGCGAGTACAGGTAGATCTCCTCCAAAGACTTGATCTTGCCCTC 
GCGCACCAGGCGTCCCAGCTTGGTCACTGGCACCCACTCCTTGGAGTCCTCCTTTCCACGTCCACGGGCCCA 
{SEQ ID NO: 487) 

Exon: 1001.. 1072 
Exon: 1133.. 1732 
Start ATG: 1409 

Transcript No. : CT14682 ™™ 
AAATACTTACCACACGTGTGCCACTTTTTAAGTTATATTGTTTACAGTTTTTGGGAATTACAGCTTACAATGGGAAAACACAAGAAAACGCAGAA 

GGTTAAGAAACAACGTAATGCACAGCTTAAACGCATCATAAAACCCACAGACGCGCGGCTTAAGGATCAGATTCGAGTGAAAAGAAAGAAGGCCG 

AGGATCCGCACCAGATCAAGGTGCACGAGGCCACCCAGCAGAGCTCCGCTCTGTTCTTCCAGTACAACACCCAGCTGGGACCGCCGTACCACATC 

GTTCTGGACACGAACTTCATTAACTTTAGCATTAAGAACAAACTGGATATTGTGCAAGGTATGATGGACTGTCTGTATGCCAAGTGCATACCCTA 

TATCTCCGACTGCGTGCGTGCCGAGCTGGAGAAGCTGGGCAACAAGTACAAGCTGGCCCTGCGCATCATCTCCGATCCCAGATTCGAGCGACTGC 

CGTGCCTGCACAAGGGCACCTATGCGGACGATTGCCTCGTGGAGCGGGTGCGCCAGCACAAATGCTACATTGTGGCCACCAACGACAAGGACCTT 

AAGAATCGCATCCGGAAGATTCCCGGCGTGCCCATCATGTACGTGGCTGCGCATAAGTACGCGATTGAGCGTATGCCAGAAGCGTATGGAGTCAA 

GGCGTAG 

(SEQ ID NO: 488) 
Start ATG: 349 

MDCLYAKCIPYISDCVRAELEKLGNKYKLALRIISDPRFERLPCLHKGTYADDCLVERVRQHKCYIVATNDKDLKNRIRKIPGVPIMYVAAHKYA 

I ERMPE A YGVKA* 
(SEQ ID NO: 489} 

Classification: hypothetical 



Celera Sequence No. : 142000013383795 

GCCGTCCACTAGCTGCGGACAGGAATATAATCGGATTTGGTACAGAGTTCTTGTGAGGATGTAGTTGATATACGTACGCCATGCTTGTTGAGAAA 
GGCACAGTAGGCCTCCGCCTCGCCGGGACCTTGGACGCACTGGATTCCCATGGAGAGCAGCAGCGTTTCGCACTGCTTGAGAACATGGTTGAACC 
GGCTGCGTCCCTTGTCGCCCTTGGACGGCTGACTTTGGGTGCACTCAGGCGAATTCTTGGGCTTCACTCCCCGAAACTGGAGCTCATTCCTTTTG 

FIGURE SHEET 268 



GCTATCACCTGGCTCTTGAGTTTTGGAGCAACTCCCTCCAAAACGAAAACCGGAGTAACTTGCTCCCAAATCAAGTAGCAAGTGCGAAAGAAGAG 
GTTCCTGTTGGTAAAGCGTAAATATTAATATTTTAAAAGAATATAAGGTATTAAGAATTTTTTAAACCTACTTTAAATGATGCCTTGGATGCACA 
AAGTAGTCCACAACATTAAGAGATTCGCACACCCAGCCGGCCAGGTCGATGGCCACTTTCTTTCCCCGCAGTTCATTAATGGGTTTACGTTCACA 
ATGGGGCGTTAAAACACCCCATAATTCCTTGACGCCCATTTAAATAAGCTAAACAAAAACCCCGCGAGAAATTCGCAACAAATTTAAACAATCGC 
TTCTTCGTCAGAGTTGCCCAATCAGTTACTTTCTAGCTACTTGAAGCTATATTCAAAAGCAGCCAAGTGCAAAGTTGTGACAATAATTGGCAATG 
ATAACTTTTTTAACAATAATACATGATGCTATTTTCAATAACAAATATTATACATATAAATGTCATCTTATTCGATTTTAAGAACACATTTCGTT 
TTTAGCTTTTTTTGAGCTTGTCTCTAGCTACATTTGACAGCACTGCGACTTCCGCCAGTCACTGCGAAGCATGCTCGATAACCGATAGCGGGCTT 
TGTTTTGACAGCTGCACAAGCGACCGCTGCGAATGGCCAGTGGTCAGCAGTCTCTTCTTCTTCCACGTCGACGTGTCGTGTCAGCCAAAAGAAAG 
TCGAAATAAAATATTGAATTGAAGCTGCCCCTTTGCGAGTAGGCCCGACAAAATTCAGAAGCTGCCAGTCGTCCGTACGCCAAACCACACACAAC 
CACCACCGGCAAGATGGTTCTCGCGGATTTGGGTCGCAAGATAACAACGGCGCTGCACTCGCTCAGCAAGGCGACCGTAATCAATGAGGAGGCAC 
TGAATTCCATGCTGAAGGAGATCTGCGCAGCGCTCCTGGAGGCTGATGTGAATATCCGACTGGTTAAGCAGCTGCGCGAGAATGTGCGTGCCGTT 
ATT GAT T TC G ATGAGATGGCC GGT G GC CT G AAC AAG AGGC GAAT GAT C C AAT C GGC T GT GT T C AAGG AGCT C GT C AAACT G GT CGAT C C C GGCGT 
GAAACCCTACCAACCCATCAAGGGCAAGGCCAATGTGGTCATGTTTGTGGGTCTGCAGGGCTCCGGTAAGACCACCACCTGTACCAAGCTGGCCT 
ATCACTACCAGAAGCGCAACTGGAAGTCGTGTCTCGTGTGCGCGGATACCTTCCGTGCTGGTGCCTACGATCAGGTCAAGCAGAACGCCACCAAG 
GCGCGCATTCCGTTCTACGGCAGCTACACGGAAATCGATCCCGTGGTCATTGCCCAGGACGGTGTGGATATGTTCAAACGCGAGGGTTTCGAGAT 
GATCATTGTGGACACCTCGGGTCGGCACAAGCAGGAGGAATCACTGTTCGAGGAGATGTTGGCAGTGTCCAATGCCGTCAGTCCCGACAACATAA 
TCTTCGTGATGGACGCCACCATCGGACAGGCTTGCGAGGCTCAAGCGAAAGCCTTCAAAGACAAGGTGGACATCGGTTCGGTGATCATAACCAAG 
TTGGATGGCCACGCCAAGGGAGGCGGAGCTCTGTCCGCAGTGGCAGCCACACAGTCGCCCATCATCTTCATCGGTACGGGCGAGCACATCGACGA 
TCTGGAGCCCTTCAAGACGAAACCCTTCGTGAGCAAACTGCTTGGAATGGGTGACATCGAAGGACTGATAGATAAGGTGAGACATGCCACACGAG 
CTTAAAATTATACCGCACTACAATACAATTGAAAACACGTACTGAATATAAACTCATTTTGCACAGGTAAACGAGCTGAAGCTAGACGGAAACGA 
TGAGCTGTTAGAAAAGATCAAGCACGGTCATTTCACCATCCGCGACATGTACGAGCAGTTCCAGAACATCATGAAGATGGGCCCCTTCTCCCAGT 
TCATGAACATGATACCCGGTTTCTCGCAGGACTTTATGACCAAGGGCGGCGAACAGGAGTCGATGGCGCGCGTTAAGCGTATGATGACGATGATG 
GACAGTATGTCGGATAATGAACTGGACAATCGCGACGGAGCAAAGCTCTTTAGCAAGCAGCCAAACCGCTGCGTTCGTGTGGCCCAGGGAGCTGG 
TGTCATGGAGCGCGAGGTCAAGGAACTGATTGCACACTATACGAAATTTTCGGCGGTAGTAAAGAAGATGGGCGGCGTTAAGGGACTGTTCAAGC 
AGGGCGACATGACCAAGAATGTGAATCCAACACAGATGGCCAAGCTCAACCAGCAGATAGCGAAGATGATCGATCCCCGGATGCTGCAACAGATG 
GGCGGCGTTGGTGGTATCCAAAATATGATGAGGCAACTGCAGCAAGGAGCTGCGGGTGGTTTGGGTGGCCTGGGGAACCTGATGAGCGGTTTTGG 
TGGCAAGTGAGCAGGTGGCGTGTAAATTTAACGATTTTTATATGATTTTGTTTTCGCGTAGTTCGTAATTTGTTAGCCATTTGTTTCCATTTGTC 
CAAGGCCACAAAAATTGATACGCGTTGTGGAAGCGAATATAGCGTATATAGCCTATAAAGCCCATATAGATGTGCAAACAGTTAACAAATTCGGT 
TTTTGTACGGCTTGGCAGCCACAGTGTCAGTTTTTATAGTTTAAACAGTTTTCTGTACACGAACTAATGGAATTAGCATGCCTATCAGAATACCA 
GTAAGGCAATTGAACTAGATATAAACAATAAATAATTAATATATGAACTTGGCTTTAAATTGGTTGTGTTTATTGGGTTTAATCTCGGGCAACGA 
AAAGCAGAAATCC AGTCAC CC TGTT GAC CTAACACATACAACTAAAAC AAAAAACAAGTTGCGAAC AGGTGCCC AGAAAGATCGAAAGGAT GCC A 
ACGCAGCAAGGAAACCAAAATAAACAAACAGGCTGGCGGGCAAATTGTTAATGCTGTTTTTGGGGGAGGGTCTGTACACATACTCAAATGTATGC 
CTAGTGGGGTTGTACACTCAAAAAAAAAAACCTGACAGAAATTTATTTTTATTTGTCTGAATCATAATGGGTTATCATAGTGGATTATTTTGCAA 
ATCTTTGGAAATTTATATTCAATTTCTCTTTTTAAGTTTTGACAACACTTTTTTCTGCGTGTGGTGTTTGGTGGCTCAGCTTTTCTGGTGGGGCT 
GACAGCATCCCTCGATTCTGTCCCGGAAGTTGTGCTTCGTCTGGGATTTTGGGGTTGTCACTTTCTCTCTCCCTTGCGTGCTGAAAACTCCATAA 
TCCCAGCCACTCTGCCTTTTGCAATAACAATGAAATGGCGATTGCCTGCCATAAAAATGTCTATTAAGCTGTCAGGTTCTGACACTCATAAAACC 
ATCAAACAAGTCAAACGAACAGGATAGCAGCAAAATCAGCAGGACGAACCGCTGTCAAAGTTTTCGGCCTCATCCTGGCGGCTTCCCCACGCACA 
CATAGGCTCACATTCTCGCTAGGAGATTATACATGTGTATGTGTGCTGCGTCCTTTCATATTTCTTGCGCCCCCCACAAAAGGACTTTATTTATG 
GGTGGGTGGGATGAAAAGATACAGATACACGCCCCCACACAGAGTCCAAATTATTTTATGAGCGCAGAGGCAGAATGCCTTTGGCATCCTTGGCA 
GGATAACCCTTCTCCAGAAGGATATTTCCTTAGATTTCCAACCCCAGCGTAAGATTTGCTGAAAAAAGCGAATCGAATATTCTACATACTAAATT 
CC 

(SEQ ID NO: 490) 

Exon: 1001.. 2071 
Exon: 2157.. 3087 
Start ATG: 1154 

Transcript No. : CT14984 

TCTCTTCTTCTTCCACGTCGACGTGTCGTGTCAGCCAAAAGAAAGTCGAAATAAAATATTGAATTGAAGCTGCCCCTTTGCGAGTAGGCCCGACA 
AAATTCAGAAGCTGCCAGTCGTCCGTACGCCAAACCACACACAACCACCACCGGCAAGATGGTTCTCGCGGATTTGGGTCGCAAGATAACAACGG 
CGCTGCACTCGCTCAGCAAGGCGACCGTAATCAATGAGGAGGCACTGAATTCCATGCTGAAGGAGATCTGCGCAGCGCTCCTGGAGGCTGATGTG 
AATATCCGACTGGTTAAGCAGCTGCGCGAGAATGTGCGTGCCGTTATTGATTTCGATGAGATGGCCGGTGGCCTGAACAAGAGGCGAATGATCCA 
ATCGGCTGTGTTCAAGGAGCTCGTCAAACTGGTCGATCCCGGCGTGAAACCCTACCAACCCATCAAGGGCAAGGCCAATGTGGTCATGTTTGTGG 
GTCTGCAGGGCTCCGGTAAGACCACCACCTGTACCAAGCTGGCCTATCACTACCAGAAGCGCAACTGGAAGTCGTGTCTCGTGTGCGCGGATACC 
TTCCGTGCTGGTGCCTACGATCAGGTCAAGCAGAACGCCACCAAGGCGCGCATTCCGTTCTACGGCAGCTACACGGAAATCGATCCCGTGGTCAT 
TGCCCAGGACGGTGTGGATATGTTCAAACGCGAGGGTTTCGAGATGATCATTGTGGACACCTCGGGTCGGCACAAGCAGGAGGAATCACTGTTCG 
AGGAGATGTTGGCAGTGTCCAATGCCGTCAGTCCCGACAACATAATCTTCGTGAT GGACGCCACCATCGGACAGGCTTGCGAGGCTCAAGCGAAA 
GCCTTCAAAGACAAGGTGGACATCGGTTCGGTGATCATAACCAAGTTGGATGGCCACGCCAAGGGAGGCGGAGCTCTGTCCGCAGTGGCAGCCAC 
ACAGTCGCCCATCATCTTCATCGGTACGGGCGAGCACATCGACGATCTGGAGCCCTTCAAGACGAAACCCTTCGTGAGCAAACTGCTTGGAATGG 
GTGACATCGAAGGACTGATAGATAAGGTAAACGAGCTGAAGCTAGACGGAAACGATGAGCTGTTAGAAAAGATCAAGCACGGTCATTTCACCATC 
CGCGACATGTACGAGCAGTTCCAGAACATCATGAAGATGGGCCCCTTCTCCCAGTTCATGAACATGATACCCGGTTTCTCGCAGGACTTTATGAC 
CAAGGGCGGCGAACAGGAGTCGATGGCGCGCGTTAAGCGTATGATGACGATGATGGACAGTATGTCGGATAATGAACTGGACAATCGCGACGGAG 
CAAAGCTCTTTAGCAAGCAGCCAAACCGCTGCGTTCGTGTGGCCCAGGGAGCTGGTGTCATGGAGCGCGAGGTCAAGGAACTGATTGCACACTAT 
ACGAAATTTTCGGCGGTAGTAAAGAAGATGGGCGGCGTTAAGGGACTGTTCAAGCAGGGCGACATGACCAAGAATGTGAATCCAACACAGATGGC 
CAAGCTCAACCAGCAGATAGCGAAGATGATCGATCCCCGGATGCTGCAACAGATGGGCGGCGTTGGTGGTATCCAAAATATGATGAGGCAACTGC 
AGCAAGGAGCTGCGGGTGGTTTGGGTGGCCTGGGGAACCTGATGAGCGGTTTTGGTGGCAAGTGAGCAGGTGGCGTGTAAATTTAACGATTTTTA 
TATGATTTTGTTTTCGCGTAGTTCGTAATTTGTTAGCCATTTGTTTCCATTTGTCCAAGGCCACAAAAATTGATACGCGTTGTGGAAGCGAATAT 
AGCGTATATAGCCTATAAAGCCCATATAGATGTGCAAACAGTTAACAAATTCGGTTTTTGTACGGCTTGGCAGCCACAGTGTCAGTTTTTATAGT 
TTAAACAGTTTTCTGTACACGAACTAATGGAATTAGCATGCCTATCAGAATACCAGTAAGGCAATTGAACTAGATATAAACAATAAATAATTAAT 
ATATGAA 

(SEQ ID NO: 491) 
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Start ATG: 154 



MVLADLGRKITTALHSLSKATVINEEALNSMLKEICAALLEADVNIRLVKQLRENVRAVIDFDEMAGGLNKRRMIQSAVFKELVKLVDPGVKPYQ 
PIKGKANVVMFVGLQGSGKTTTCTKLAYHYQKRNWKSCLVCADTFRAGAYDQVKQNATKARIPFYGSYTEIDPVVIAQDGVDMFKREGFEMIIVD 
TSGRHKQEESLFEEMLAVSNAVSPDNIIFVMDATIGQACEAQAKAFKDKVDIGSVIITKLDGHAKGGGALSAVAATQSPIIFIGTGEHIDDLEPF 
KTKPFVSKLLGMGDIEGLIDKVNELKLDGNDELLEKIKH^ 

SMSDNELDNRDGAKLFSKQPNRCVRVAQGAGVMEREVKELIAHYTKFSAVVKKMGGVKGLFKQGDMTKNVNPTQMAKLNQQIAKMIDPRMLQQMG 

GVGGIQNMMRQLQQGAAGGLGGLGNLMSGFGGK* 

(SEQ ID NO: 492) 

Classification: known_f lybase_gene 
Gene Symbol: Srp54k 
FlyBase ID: FBgn0010747 



Celera Sequence No. : 142000013384547 

AGCATAAAGCTGCATACGACCTCTCCTCCAGTGACGGAAAGAAGGCTCTGGACATTGCAGTGGACCAGGAAAACGCAGACATTGTAACGCTGTGA 
GTAAGAAAGCAGAGCTTCAATCCGGCAGATACATAACCATTCTTGTTTCAGACTCCGATTGACACAACTGAATGACGAGATTGGACCCAACGACG 
AGT AC AAT GG C G AAG AT G AG AC GT AC AAG AAC GT AATG AAGG AC TTCTCCAAGT TAC CAGC C AGC C AGAC GC GA GTT C TGC GC C AGC GGC AC G AC 
TCAAATACACTCGAATCGCAGCGCAGCTCGCTTACAAACTCCGACTAGCAGTGGCAGAAGGGCCGCAAGAGTTAGGACCTTACTGGATATTATGG 
AATAATTTGTTCCAATTATTAGGATAGCGTGATATTGGATTGTTATTGGCTTTATAACGCATTAACCAGGAATGTTAAACGGTTTCGACCTCTAA 
GGATGTCGTTGCGGGCTTGTTTCCCTTGGTAATCTCATTTGAAAAGCGCTTCCGATTATACAAGAAAATATTATTCATTGCATTTGGCATTCGAA 
TGGCAGGCACGATCGTTCGTTTTCTCTTAGGCGCATTTTGAATCCGTTTTTGGCGATTAGCTTAAGTTTAAAGTATTCGTAGAGCAGTTGAGTTT 
ATTAAAAATACGTATATTTGCCTATAAAATTGTACATCGTCTTATGTGGTATTTATTAAACGATATTGTAAAATGTACGTCTAGCTATATGTATC 
TTGTTACAGAGCTGTTCCAAATGTTCCGATTTGTATTTGTTTGCTAAACTGTTTAATGCTGCATGTTTCTAAATGTTTTAACGCTATCATGTTGA 
TGATTTTTGGTGCCAACTGTTGTGAGCACTCTAGGTTCGCGAATAAAGTTGTAAAATATTCAGATCAATCAAAGTTGGTCTCTTCAGGTGTAAAA 
CATGGTGGTCCAATCTGCGGTGTATTGGTAAAACATGTCCAGGGACTATTCTAGGCCACAGCTTGAAACTCGTTCGTCTCGGATATGTAGTCTGT 
GAGCTTGTTCTTCACCTTGTGCATTGTGGTTACATGTCTCTGGACCGCACTGCGATCGTTAAACTGACGGCCGCACTGTTTGCAGGAGAACATGA 
CGCCAGCGTGGGTGACCAGATGGCGACTGAGGCCCGAAACATGATTGAAGCTACGATGACACAGATGACACTTGTGCGTCCGAGAGTTGCCCATA 
TGCACGCGGCCAATGTGCTGCTGCAGGTTGGACTTCTGGGCAAAGCAGCTCGAGCATTGGGGGCACTGATGGCGCTTTACTCCCGTATGGGTGTC 
CATGTGACGCTTGAGCACTTCGCGCTGGGAGAAGGACAGTTCGCACTCGCTGCACTTGAAGGGTTTCAGGCCCGTGTGGATAATCATGTGGCGTT 
TGAGCGAGCTGGCGCACGACAGTTGCAGTGCGCAGCTGGGACACTTGAGCGTGCCGCCCTTTCCACTTTTCAGTCCCTTAATGTGGGTGTTGTGG 
TGGCGCGTCAGAAGTTTCTGCGTAGGAAATGCCACCGGGCACAAATGGCACTGGAACGACTTCACCGTGTCCGTAATTCGACCGGTTTTTGGTTG 
AGGACAACTGGACTCCTGCTTGAACACCAACTCACTCGATCTCTTTCGTTTGAGGGGCGTCTGCTCATCTCTATCGTTCTGTTCTTTTTCCTACA 
GATAATAAGATCCAAGTTGAATTCGAGTTCAATTTAAAATCACACGGTTAGTTACCACTATAATGACATTTTCATGGAAGCTGAAGACCTGTTGC 
TCCTGAGATATCTCCTCGCCCAGATTAAACACGACCACGTCTTCGATTTCCTCTGGCTTTTCGCCACTGAACTGAATTTCGCCGTCTTCTTTGAC 
T AAG C AAAG AATT T AAT G AAAT GAAAAT TTTTTCACATTTTTTTT GC C AAAT T AT AC C C ACT T AC T GT T AT AGG AGGT GAT TCCGTTGTCCTGGT 
CAAATTCGGGAGTGGAGTCCTGGTTTTGATCCTCTGAGCTGGCAGAGTTTATGATGTAAATCTCTCCATTGTCTGTGTGCACTACATCTGATCCA 
ACCATGTTGATTTCCCTGAAGTCTTCACCGCTTTCCGCGGGCACTTCCTCGTCCAGTTCGAGGCCATCTATGTTTACATCGCTATTATTTGCGGG 
GTCGGTATTGCAGCCCAATATGAATTCCTCTTCTTTTGCATCCTTCCTGGCCTCTGTCGCCTTTAGATTGGCGTCGACCAAGCTGGGTCGTCGCT 
TTGGAAGAGCTTTAGTTTTTGACCTAGAAGGGGAGGCATCTTCCTTGACTTGCGGCTTGGTCTCTGGTTCGGACCCATCCAAAGACTCCCACCGA 
CCGGCCTTTTTCTCGGTAAGATACACCTCCAATTCGCCCAACAGCATGGCCAGCTGCTCTTTTCTTATGTCCTCGCTCACCTCGTTTCGCAGGTC 
ATCTCTAACCTGGCTGCGCACCTCCTCGGCCAGCTCCTTGCGAACCTCACTGCGCACCTCCTCCGCTAGTTCCTTGCGCAGTTCGGTGGTCAGTT 
TCGCTCGAAGTTCTAGACGAGATTTGACCAGCCGTTTGTGCATTTCCGCCGCCTGTTCTTCCTTAAAATCTTCTCGCCACTTTTCGTCCTCAAGC 
TTGCGCACTTTGTCCTGCACCTCGATGAAGGCAATAGAGTCCTCGAACTCCAGCTCGTCCTCGTCATAATCGTCCGGATCCTTGGAGAAGACGCG 
GCTTCGCTTCCCGAGACCCAGAAGTCGGATATGCTTGCGAAGCTTCTTCTCCGCCCATTGGCAGCGCTGCCGGAACAGGAATGACTTCTCCAGCT 
GATAGCGGCACTCCTGGCAGATCCTCTTCGGCAGAGTATCCGTCGCCTTGGCGTCCAGTGCGACGCAGGCCATCAGGCGCACGGGTATCGAGAGC 
TGGGTGTCGTCCTCGTCGAAGATCGCGTACGCATCCTGGTGCTCCTTGAGGCAAAGGCGGCACAGGGTCGCCAAATCGGGATACGACGGCTGTGC 
ATCCATGAAGGCGTCTATTTTTGGTAAATCAACGTATTTACTCGGTGGCGTCTCTGCATCGCAAGCAATGTGACCGCAGACAGGTTCGCTAAAAA 
TACCTCCTCTCGAAACGAAAGCAATACAGTCACACTGTTCGTAAATAAACAATTTGATATATACGGCCCATAAACGTATCATAACTTAATTTATT 
AATGTAAGGTTTAAAGGTATTGATTTTCATACAAATCCAATGCATACCGAATGTGCATATTTAAAGTTTAATAGAATACAATGACAATTAAAATA 
GGTGATGGCTTAGTCAAATATGCGTGCACAAAAAAAGGTACTATTCTTATAATATAGAGGATTAAACTAAATTCGTTGTTACGTGGCTACGTCAC 
GGTGTTATTTAGCTTATCCGCTCCTCTTAGACGACAGCGCACACAACTTTTGAAATCGGCGTACGAACTGCTGCATTTTGTCTTGAAGGAACTGT 
GCTTTCGGCACAGGTTATCGGTCGTCTGCTTGGGGCTCATAAAGCGCACTTTCTTTCGACACGGCGATGGGGTTTTTACATCCTCTAAGGCAGTT 
GTGTTGGCACCCAGAAAGGTGTCCTTGCTATAGTCGTTGCGCAGATGACTCAGCTGCGGGTCGTTCATGTTGATCTCAACCATGGGCACAGCAAC 
TAGTGGCGTTGCAGCTGGCTGTCTGCTGGCTTCTGGTTCATCGTCTATTACATTTGAGAAATATATAAGTCGTGCAATTGCAAAGTAAGTGGGCT 
GGTCTTACCCAGTAACTCCACCACGCTGTTATCGCTCTGCTCCCTTAGCTCCTGGCCCATGTCTGTTTCGTAGAAGCCAGCGGTCTGTTCGTCAG 
GCAGCGGACTGCTTAAGTTCATCTTCATCTCATAGGTCGCCTCATTGCGCTTGCGCACAAAGTACCGTTCCAGTTCGACAAACACGTCGTCCAGA 
ACGCTCAGTCGCAACTGCTGCTTGGCTTCGGGTATGATGACCTCTTCGAGTTCCGCCCGCAGCTCCAGCTCGATTTCCGCCCGCACAGCTTGCTT 
GCAAGTGCTGAGCAGCTCCTTTTTATACGCGTCGATCTCGCTGCTAATGAGCTGGGCGGCCTGCTGGCGCCACTGGGCATTGTTCTCGGCGCAGC 
CAGTTTGCGTGCATCCTTCCCCGGAGCCCCGCTCCTCCTCCACTTCACTCAGCTGCGACTTGACACCCTGCCGCTGCAACGCCTTGTGGCGCCTC 
AATTTGCGATCGGCTGCGAGGCAGCGTCGTCGGAAGCAGTGCATCTCCTCCAGTTGCAGCCGACACATGGTGCATATGTGCTGTGGCAGACCGTC 
GCCGGGCTCGATCTCCAGCGATGTGCAGACCATTATCTGCAGAGCAGTCGATCCATTTGCACCGAAAACGGGGAAATTGGGTTCCTTCGACAAGC 
ACAGGCGACAAACATTCATGTTGAATTTCAAGGCAATTCGGAGCGTGGCAGCCGGACACCTGATTAGGCACGATTCGGAATGATGGACAATCTAG 
CAATCTTTCCAAGCAACCAGTTTAGAATATGAAACAAAAAAACATTTGAAGACAAATAACTAAAATTGGAAGCGGAGGAATTTCCTACTAGGAAA 
CATCAACAAGTCTAGCCGATTTGTTATCGATATTTTCAGTGTGGCCATGCGGTTGAGAGAATGCATTTTAGTATTTCAAGGCTGGCTGCTCTGCC 
ATTGATGTGCCCGTATTTCAGCAAGGCGCTTGGCGGGAGAATTTCAAATAGTCAAAATTTTACAACGATACTATAGCGCTCATTTGAATAACCTG 
AAATGCAGCAATAACACACATTCATTTATCAAACAATTTCTTTATATTTCATTAAATATTAAGTATAAATACGCAGTTAACTATTAAGTATTATA 
TACACAATTACAATAAGATATTCAGTCACTATAATGCTTCTTCTATATACAAAGTACGATATATATTATATCAAAGGGCTTGAACTCGCCGCCGC 
ACCTCAATAAATTATGCAATTCAATTCCATGATAATATACAGTAACAGTCGTCTGTGTTCTCCGGGAGTTTGCCGATTGATTTTCTCAGGATGAT 
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TTAGGATCTGCGCCGAGGATTGCCTACTTACAGCTAAAATGTTTGCAAAAACAATAAGTCAAGGACATTTAACAGTGTTTTCATTTCGTTTCGCA 

TCGTTTTGATTTCATACTCATTTGCTCTTTGCTTTATCGCTTGGGTGGGGTGTTTTGTGTGTGGGTTTTTGATTTGTGGTGGTAAATAAATAGTA 

TCGATCGATTATTAACGATATTCAACATTGAAATACATCAATCATTAGGCATATACATAACTACAGGATATAATACAATACAGGATCGTTGGGAT 

CAGGATCGCTCTTCCAGGATTCCATCTCAATCGTGGCGCCAGCTCTGCGGCAGAACGTAGTCCTTGACCTTGTAGAGCGCATTGGCATACCAATG 

GATGCCATTCTGCTGCTGCGCCGAGCTCACCACCTTCGGCGAACTGTGCMCTGCTCCTTGGCGGGCAGCCACGCCTCCAGCGTGGACAGCAGGC 

GC AT GGGAGCC AGT CCC C AGTG 

{SEQ ID NO: 493) 

Exon: 4627. .3809 
Exon: 3749. .3421 
Exon: 3169.. 1966 
Exon: 1899. .1766 
Exon: 1705.. 1001 

Start ATG: 4484 (Reverse strand: CAT) 
Transcript Wo. : CT14997 

AATTTTAGTTATTTGTCTTCAAATGTTTTTTTGTTTCATATTCTAAACTGGTTGCTTGGAAAGATTGCTAGATTGTCCATCATTCCGAATCGTGC 
CTAATCAGGTGTCCGGCTGCCACGCTCCGAATTGCCTTGAAATTCAACATGAATGTTTGTCGCCTGTGCTTGTCGAAGGAACCCAATTTCCCCGT 
TTTCGGTGCAAATGGATCGACTGCTCTGCAGATAATGGTCTGCACATCGCTGGAGATCGAGCCCGGCGACGGTCTGCCACAGCACATATGCACCA 
TGTGTCGGCTGCAACTGGAGGAGATGCACTGCTTCCGACGACGCTGCCTCGCAGCCGATCGCAAATTGAGGCGCCACAAGGCGTTGCAGCGGCAG 
GGTGTCAAGTCGCAGCTGAGTGAAGTGGAGGAGGAGCGGGGCTCCGGGGAAGGATGCACGCAAACTGGCTGCGCCGAGAACAATGCCCAGTGGCG 
CCAGCAGGCCGCCCAGCTCATTAGCAGCGAGATCGACGCGTATAAAAAGGAGCTGCTCAGCACTTGCAAGCAAGCTGTGCGGGCGGAAATCGAGC 
T GG AGC T GC GGG CGG AACT CG AAG AGGT CAT C AT ACC C GAAGC C AAGC AGC AGTT G C G ACT G AGC GT T CT GG AC G AC GT GT T T GT C G AAC T GG AA 
CGGTACTTTGTGCGCAAGCGCAATGAGGCGACCTATGAGATGAAGATGAACTTAAGCAGTCCGCTGCCTGACGAACAGACCGCTGGCTTCTACGA 
AACAGACATGGGCCAGGAGCTAAGGGAGCAGAGCGATAACAGCGTGGTGGAGTTACTGGACGATGAACCAGAAGCCAGCAGACAGCCAGCTGCAA 
CGCCACTAGTTGCTGTGCCCATGGTTGAGATCAACATGAACGACCCGCAGCTGAGTCATCTGCGCAACGACTATAGCAAGGACACCTTTCTGGGT 
GCCAACACAACTGCCTTAGAGGATGTAAAAACCCCATCGCCGTGTCGAAAGAAAGTGCGCTTTATGAGCCCCAAGCAGACGACCGATAACCTGTG 
CCGAAAGCACAGTTCCTTCAAGACAAAATGCAGCAGTTCGTACGCCGATTTCAAAAGTTGTGTGCGCTGTCGTCTAAGAGGAGCGGATAAGCTAA 
ATAACACCTGTGACTGTATTGCTTTCGTTTCGAGAGGAGGTATTTTTAGCGAACCTGTCTGCGGTCACATTGCTTGCGATGCAGAGACGCCACCG 
AGTAAATACGTTGATTTACCAAAAATAGACGCCTTCATGGATGCACAGCCGTCGTATCCCGATTTGGCGACCCTGTGCCGCCTTTGCCTCAAGGA 
GCACCAGGATGCGTACGCGATCTTCGACGAGGACGACACCCAGCTCTCGATACCCGTGCGCCTGATGGCCTGCGTCGCACTGGACGCCAAGGCGA 
CGGATACTCTGCCGAAGAGGATCTGCCAGGAGTGCCGCTATCAGCTGGAGAAGTCATTCCTGTTCCGGCAGCGCTGCCAATGGGCGGAGAAGAAG 
CTTCGCAAGCATATCCGACTTCTGGGTCTCGGGAAGCGAAGCCGCGTCTTCTCCAAGGATCCGGACGATTATGACGAGGACGAGCTGGAGTTCGA 
GGACTCTATTGCCTTCATCGAGGTGCAGGACAAAGTGCGCAAGCTTGAGGACGAAAAGTGGCGAGAAGATTTTAAGGAAGAACAGGCGGCGGAAA 
TGCACAAACGGCTGGTCAAATCTCGTCTAGAACTTCGAGCGAAACTGACCACCGAACTGCGCAAGGAACTAGCGGAGGAGGTGCGCAGTGAGGTT 
CGCAAGGAGCTGGCCGAGGAGGTGCGCAGCCAGGTTAGAGATGACCTGCGAAACGAGGTGAGCGAGGACATAAGAAAAGAGCAGCTGGCCATGCT 
GTTGGGCGAATTGGAGGTGTATCTTACCGAGAAAAAGGCCGGTCGGTGGGAGTCTTTGGATGGGTCCGAACCAGAGACCAAGCCGCAAGTCAAGG 
AAGATGCCTCCCCTTCTAGGTCAAAAACTAAAGCTCTTCCAAAGCGACGACCCAGCTTGGTCGACGCCAATCTAAAGGCGACAGAGGCCAGGAAG 
GATGCAAAAGAAGAGGAATTCATATTGGGCTGCAATACCGACCCCGCAAATAATAGCGATGTAAACATAGATGGCCTCGAACTGGACGAGGAAGT 
GCCCGCGGAAAGCGGTGAAGACTTCAGGGAAATCAACATGGTTGGATCAGATGTAGTGCACACAGACAATGGAGAGATTTACATCATAAACTCTG 
CCAGCTCAGAGGATCAAAACCAGGACTCCACTCCCGAATTTGACCAGGACAACGGAATCACCTCCTATAACATCAAAGAAGACGGCGAAATTCAG 
TTCAGTGGCGAAAAGCCAGAGGAAATCGAAGACGTGGTCGTGTTTAATCTGGGCGAGGAGATATCTCAGGAGCAACAGGTCTTCAGCTTCCATGA 
AAATGTCATTATAGTGGAAAAAGAACAGAACGATAGAGATGAGCAGACGCCCCTCAAACGAAAGAGATCGAGTGAGTTGGTGTTCAAGCAGGAGT 
CCAGTTGTCCTCAACCAAAAACCGGTCGAATTACGGACACGGTGAAGTCGTTCCAGTGCCATTTGTGCCCGGTGGCATTTCCTACGCAGAAACTT 
CTGACGCGCCACCACAACACCCACATTAAGGGACTGAAAAGTGGAAAGGGCGGCACGCTCAAGTGTCCCAGCTGCGCACTGCAACTGTCGTGCGC 
CAGCTCGCTCAAACGCCACATGATTATCCACACGGGCCTGAAACCCTTCAAGTGCAGCGAGTGCGAACTGTCCTTCTCCCAGCGCGAAGTGCTCA 
AGCGTCACATGGACACCCATACGGGAGTAAAGCGCCATCAGTGCCCCCAATGCTCGAGCTGCTTTGCCCAGAAGTCCAACCTGCAGCAGCACATT 
GGCCGCGTGCATATGGGCAACTCTCGGACGCACAAGTGTCATCTGTGTCATCGTAGCTTCAATCATGTTTCGGGCCTCAGTCGCCATCTGGTCAC 
CCACGCTGGCGTCATGTTCTCCTGCAAACAGTGCGGCCGTCAGTTTAACGATCGCAGTGCGGTCCAGAGACATGTAACCACAATGCACAAGGTGA 
AGAACAAGCTCACAGACTACATATCCGAGACGAACGAGTTTCAAGCTGTGGCCTAG 
(SEQ ID NO: 494) 

Start ATG: 144 (Reverse strand: CAT) 

MNVCRLCLSKEPNFPVFGANGSTALQIMVCTSLEIEPGDGLPQHICTMCRLQLEEMHCFRRRCLAADRKLRRHKALQRQGVKSQLSEVEEERGSG 

EGCTQTGCAEWNAQWRQQAAQLISSEIDAYKKELLSTCKQAVRAEIELELRAELEEVIIPEAKQQLRLSVLDDVFVELERYFVRKRNEATYEMKM 

NLSSPLPDEQTAGFYETDMGQELREQSDNSVVELLDDEPEASRQPAATPLVAVPMVEINMNDPQLSHLRNDYSKDTFLGANTTALEDVKTPSPCR 

KKVRFMSPKQTTDNLCRKHSSFKTKCSSSYADFKSCVRCRLRGADKLNNTCDCIAFVSRGGIFSEPVCGHIACDAETPPSKYVDLPKIDAFMDAQ 

PSYPDLATLCRLCLKEHQDAYAIFDEDDTQLSIPVRLMACVALDAKATDTLPKRICQECRYQLEKSFLFRQRCQWAEKKLRKHIRLLGLGKRSRV 

FSKDPDDYDEDELEFEDSIAFIEVQDKVRKLEDEKWREDFKEEQAAEMHKRLVKSRLELRAKLTTELRKELAEEVRSEVRKELAEEVRSQVRDDL 

RNEVSEDIRKEQLAMLLGELEVYLTEKKAGRWESLDGSEPETKPQVKEDASPSRSKTKALPKRRPSLVDANLKATEARKDAKEEEFILGCNTDPA 

NNSDVNIDGLELDEEVPAESGEDFREINHVGSDVVHTDNGEIYriNSASSEDQNQDSTPEFDQDMGITSYNIKEDGEIQFSGEKPEEIEDVVVFN 

LGEEISQEQQVFSFHENVIIVEKEQNDRDEQTPLKRKRSSELVFKQESSCPQPKTGRITDTVKSFQCHLCPVAFPTQKLLTRHHNTHIKGLKSGK 

GGTLKCPSCALQLSCASSLKRHMIIHTGLKPFKCSECELSFSQREVLKRHMDTHTGVKRHQCPQCSSCFAQKSNLQQHIGRVHMGNSRTHKCHLC 

HRSFNHVSGLSRHLVTHAGVMFSCKQCGRQFNDRSAVQRHVTTMHKVKNKLTDYISETNEFQAVA* 

(SEQ ID NO: 495) 

Name: zinc finger protein 
Classification : transcription_f actor 
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Celera Sequence No. : 142000013384288 

GCTGGCAAGAAGACACGCATCATGTGCCTGCTACGGGAGATGTACGGATCCGGCGTTGAGCGGTTGAGGAGCGAGACCATGACATTCACCACGCC 
ATCCAATCGGAAGGTAGAGGTGATGACGGTCAGCAGCAACTACCACTTGGAGGTGAATCCCTCCGACGCTGGCATGTACGACCGCACGGTGGTGA 
TCGATCTCATCAAGCAGGTGGCCCAGACGCATCAAATTGAGATTAGTGGCCAACGGGAATTCAAGGTGATTGTAATTTCGGAGGGCGACGAACTC 
ACCAAGGATGCCCAGCACGCCCTGCGCCGTACGATGGAGAAATATGTGGCCACGTGCCGGATAATCATATCGGTGAACTCCACATCTCGGATCAT 
TCCGGCCATACGGTCGCGTTGCCTGGGCATACGGGTGGCGGCTCCCAACGAAACGGAGATCGTGTCCATCTTGCAGAACACTTGCAAGCGGGAGG 
GACTCGCACTGCCCGTGGAGCTGGCCAAACGGGTTGTGGACAAGTCGGAGCGCAATCTCAGGCGAGCACTTTTGATGCTGGAGGCCGCCAAGGTG 
GCCAAGGCGCCGTTTACCGCCAATCAGGAGATTCCCGACCTGGACTGGCAGGTGTTCCTGCGTGAGACAGCATCTCAGATTATCAGTGAGCAAAC 
GCCTGCCAAACTGGAGAAGATCCGCGAACGCCTGTACGAGCTACTCACGCAAGGCGTTCCACCTAATCTCATATTCCGCGGCCTGGTCGAACAGC 
TGGTTAACAATTGTGATATGTCCATCAAGGCAAAGACGCTCGAGTTCGCCACCGAGTACGAGCACCGGATGCAGTCCGGCGCCAAGCACATTTTT 
CATCTTGAAGCGTTCGTCGCTCAGTTTATGAACATCTATAAAAAGTTCCTTTCTGAGCTGGACATGACGGATGATTTTTAAACGATTTACACGCT 
TTAGCATTTAATTCAAAAAACTTATTAAATAAACTGATTTAAACGAACTATTAACGTTCTTCCTGAAGATCCGTTCGCATCTGCAGAACGGCCAT 
GCTCATCTTATCATCGGTGGTGCGCTTTATGGTCTCAATATATCTTTGAACTTCTTCTTCGAGTTTAAAGTAAGCCTAAAATAGGAATAGATTTG 
GAATATGATTGGAATGTATTTTGAGTAGAACGAAATTTATATGTATGACTTACCATTTCATAATCAAGATCATCCTTGCCATTAATCTTTAAACC 
GTGCAGCAGCAAATCGATCGTGCTGATGGGGAAGCAGAGCTGTGGGTTTGCTATGTTCACCGCGTGTTTGAATTCGAAATACTCGCTACCTGCTC 
CCAGCATGCCGGCGATTATTTCACATCCCAGATCTGCGGCTTCCAGCAGACGATTGTGTTTCACAAGTAAGTGGAGCAGTTCCCGGGAGTGGGAT 
AACTTGTAGGAGTTTATCAGCCACTGGGGCAAGAAGGCGTTTAACCCGAGAAGCCTTTGAACCACGCTCTTTCTTATCAAGGTGGAGTCTTTTGC 
C TC GTTATC C AC AACC AG T TT T TGAAG T AAAGT CC AGGC CAT AT C GGC AGC AT T AC T T C G AT G AGG T AGAT C T GT T AT AAG AC AT AT A GT AAGC C 
TCTTTTTTTCATACACGTTAAAAAATCCCTACCTGCCATGTCGTTGTTTTGCAGCCAGTTCCAGGCGTCCGACGACTTGTCCTCAGTGGCAGCAA 
CACAAGCGGATGTCAGACTCTCGAATATGGGGAGCACAGAGAACGAGTGTCCCCGCGAAAGTTTCAATGCCGCGGTGTACAGGCCGGAACTAGCC 
AGCAGGTATGATAGCTCCTCGGGTGTGGCCCGCTCGTAGGCAGCTGTATCTTTGCGGTAGAAGGAAAGCTCACGTAAAGCTTCTGCGTGCACCAG 
TTCCCTTCGGATATCAGCCAGCTCTAGAACAACCACCTCCTGGCCTCTCTTGTCCTCGTCACATTTGGGCTCACCATCGTCGTTATCCTGATCTA 
TGGTAATCACCTGCTCATCACCAAGAACAGGCTTGGCAATCCATCTGTAGCGGCTGTCAACTAGGTGCAAACAGTTTAGGCAAATTAACAGCGAG 
GAGCAACGTTTCTCTAGCGCGTTTGGTGCATCGCTGTCTACTTGAAACCGCATGGCCTGCTCATACATAACCGTAGAAGCTGTTCGGAAACAAAT 
TATTAATGCAAGAAGAAATGGAAGAAAATAAAGTAATTTTTATGCAAACCTTTTCTCATGTTACCCTTGTTCGTATGGAATGCATACAAAAAGTT 
GTAGACTTCGTTCTGATCGATGCTCATTGATCGTGCCCGGGATTCAACAATGCTCTCGAACTCGTCTTGCAACCCGATGTAGGAGAACTGCATAA 
GCAGTGGCAGACATTTATTCTGGAACAGCGTTATGACCAACTGGCGAAGGCAATCCTTACGCCTTGAAATATCTGCATTGTTGACTAAGGCGGTG 
TACGCCTCCACGTAGTGTCCCAACTGCAGATGATTATTGAACACTATTGACTGGAACATGGGCAACTGAGGATCATCTGGCTGCAGCACCCTAAT 
GGCCATGTCAGCCAGTTGAATAATGTAGTCCAATGCGGAATGCTGTTCAAAGAGCTGGATAACCTTAAGATAGTAGTGAACAATTGCCAGTTTTG 
TGTCTTCGGGGGAAATTGTATCGCCACGGCTGACGCTATTCTGTAATTTGCCGTATAGAGGGGTATTTTTCAGTACATGCTCAAAGAGGAAGTCA 
TCCTCCACAATGCCAGACTCCGCCTCGTGAAAGAGATGCACGGCCTTGTGGGCCTCACCGCAATCCAATAGAGATACGGCTAGCATAAAGCGTCC 
TGCGGAAAAAAAGGATTAATTTCAAAGCTGGGAAAACTAGCAAATAATGCATACTGGCATGTTTTTGTACGGAACACCAGTTTGACAAAATGCGC 
ACGTAATCCTGAACAATGATGTGATGACAGGTCCCAAATAACCATTCCGGAAATACGTAAATCGGAGATCCCGGCCAACTAAGAGATGTTTGAAA 
CTTAAAACATGATCAATTTTTCTATATATTATACGCACAGCATCTTATTGATGTATCCAACAAGTTGGAGCAAAGACTGGCGTAGATTCAATTGC 
TCCGAGTCCAGCGACAAACTGTCATGCTTCAGCAGCATAGTCAAGGCGCTGAACAGACCTTTCGATTCCAGGAACAGTCGGAGCAGAGTGGTCTG 
ATCGTTCCCATTATGTTTCAGATGGCTGGAGTATGGTCGATTATAGCCGCTGAAAAGTTGGGCGCGGCTCAATCGTTGAATAGATGCCTCGCTAA 
AATTAAAATAGAAGTTAGAACTGTGCAATACAATCAATTATATATCCTTACAATCCAGCTGGGGTACTCGATGAAATGGGTGTCTCGGCAATCCA 
AACCAGTGTATAATATGAGTTCAAATAATTTAAATTTGTTAAAACATTTTCGCTTTCCATTTCGTTCTGACCATAAGCCATGTACTGCAATACCA 
AAAGGTTTCGGCAAACAGAAAACCTAGGGACACATGATTTAATAAATGATAAACCAAGTTCAAAGTTTTTTAAACTATTACGAAAGCTCACCGTA 
TCATTGCCATTTGCTTCACTGTTTCGGACAAAATAGAAAGTCCATATTCGCTACCCATTAGAGCCCCAGACGACTGCAAGAAGCGCGTAGAAAGG 
GAGTAATCTATAAAGTAGAAATGAAATATATGAATCATTTTCAAACATTATATATATTTATAAGTAATTTACCATGCGGCGGTTCATCTGGATCA 
ATCATACACAGCACATCGAGCAGCATTTCCAAAGCGGGTTCAAGATTTGGTATATTCTGCAGCTTCTGTTGGATTTGCCTCACACAGTTTGATGG 
CAATATTGGACCATTATCATCGATCATACTGATTCTAGCCACCAGCTTAGATATTACTTCAACCGGTGACTCCCGCTGGTAAAGCTGTTTATCTA 
GCTCGATTTTAATGTCCTCCGAAATTAGTTTATCAAGCAGTGTCACCACATTCATGAGCTCCACAAAACCCTTGGCCATTTCGGGATCGTTGCGA 
AATAGTGGCGCCACATATGTCGCCACTTCATCGTTATGCTCGCCGATGAGCAACAGGTGCTCGAGCACTTCGCAGGGGCGCAGCAATGCGAATGA 
TTGCCTTCTAACCAAACATACTGCATCCATTCCACCAAGAACGGCGAGTCCTGTCGGCTCGGAAAGCTTAATGTGATACTGTTCGCAGCAGGAGT 
AAAAGCGATCCCACAGCCGAGTGGATATCTCCAAGTACTCCTCATCGGTGACCACAAAGTCTTTCAGTTCGTTTTGGATTTCATCTTCCACGGCC 
TGGCATACCTGCTCCTTGAGCACTGACATGGAAAGTTGCTTTACATCGAATTGCAGGTTAACGCGTCTGAACATCTACGATACAAAAAATTCAAA 
TTACTAAACTATTCCAATAATTCATAATAAAGAATGTTTTGGTTATACTCACATATAAAGCCTTGGCAATTACATTTCGGTCAAATCGACCAGGA 
TGAAATATATAGCTGCAGTAAGTTTCGCGTGGATCCACTCCCTGCTCCATGGTGAGACAATACCGATCCGGTGGCGGTTCTAGAGCAGCCGAAAC 
CCATTTGATTGCGTTGTTGGAGGCAAAATATGCTGCCGAGACATGGAAATCGCCTTCAGCGTTGCTCCAAAGGGCCCAGATGTGCGAGGATGTGG 
CATCGAAGTCGACCAAATCCGTTTGCGGCGCCGGCACCATGTTCTGCACCAAATTGATAACGCTGGCATCGTCGGCATCGGGCATTATGCTAACG 
CATATAAACTCAGCTCGGGAATTGTGCGAGAGAAATAGACAGAAGTTTTGATCGCTTATTTTTCGGAGCTGGCTGTTTTGTGCTGTTAAAAAAAA 
GAATATAATCCCATTATATCTAGATCATAAATCTAAATCGTAATAATATTTTATACTCACGTCCTTGAGCTGCTGAGTCCTGTCCCAGTTCGGTG 
GAGCAGTTAATGCTGGCTACAGTCTGCAAGTTATCCACTGACCATAGACGCAGCTCATTATTGCGATACAGAACCAGAATG.AAAATTTCCCCCCC 
GATTTCACTGAAAGCCATGGACGTGGCTGCCTCCAACGTTTCAGAGCGGCCGCTACAATGATTAATGAAACATTAGGGATTTGCCATTTATCGAG 
GTTTATCCACTTACGTGAGGGCACCCTTGAGATTCGACAGAAACCATGGCATTAGCTTCGGCTCCTTGATTTCGTGGGTGATAGTATGGCCGGTG 
GAGCATTTCATAACGTGAAGCAGTAGTTTGCTCTGATACGCCACCGCAAAGTAGGCTGACTGGGAGTCTTGAGTGAGGTACGATACAGCCGCGTT 
GGGCATAGTTCCAAAGCCATCCGTGACATAGTAAGTGCTGGGATCATTGATCTTGTCATTGACGTCGTAGAAAATGGACTGGGATAGCAGATCCT 
CTGGAGAAGCTGCGCCACCTTCCTGCCCAGCGACCTTCAAGGGAAACACATAGCGATGCACACTGCTGACCGTGACCACCAGCAGGGTGACGGAT 
GTACCTTGCTCCGTGAGTGATACATTCAGCACAGCGGAGTCGGTGAATCGCAGACGCAGGTGATTCCTCTGCAGGGAGATGTCCAAACTAACCTC 
TGATAGTTCCAGGACATCCTGGTATGTCCGCCTGCAAATCAGTGGCCATGTGAAACTAATTTAACGATTCTGATTTGCACTTACCAATATATGAA 
ACGGTTTCTGGTCTGAACATTTTTCGTATTCTTGTAGCAATAGCCGCCGGAGGTTTCGAAGGTTTTTAAATCCTGCAAAGTGCTCTGGGTTCCTA 
CATTTTGCGGAGGAGCATTTGGTTACTCAGAATTTCTCTGGCAATCTTGCACGAAACTCACCCGTGTTAATTTTCACCTCGATCCACTCCGCCGG 
ACTGAGGTTCCGCGGTATGACCTCGCGGTAGCTCATGTTGGCTTGCAGTTTGGATGTGGGCATCTTGCTAGCACAACAAATTAATATAAAAACCT 
CAACGAAAACGGAAAATTATTGGCGGCACCGGCGTAAACACGCTCAGTAGTAGTGTTGGAAAACGCTTACGCCAGCGGTGCACACCAGGCAAAAT 
AGTCGACATAGCTTTTTATACTCACATACAAAAGTTAATTACAGAAGTTAGTTAAAAATTGTACATTTTCCTAAATGTAATTTCATGACAATAAA 
GTTTATTATAGCAAAGCAGGAATGATATAAGATGCCTTTAAATCTTGTGATATTAAGAAAGTTGTCGCTAGCATTTTAACAATTGCCACATATTG 
GTCACACTGCATTGCACGTGTGTTTCCATTTGTTGATGCGGCTTTATTATTTAGTTTATTTTTTAAAATAAAAACTGTCTCTGGCCTAAAAATGA 
GTGAACAGCAGGATGAGACTGTAGTCTGCCTGCCCGGGGAGCGACTCTGCAGAACCGAGGACAGCATAGTCCTGGGCATAGGCACCTACGAGCAG 
AATGGCTACATTTACGCCTCCAAGTCTGGAATTGTCAACATCGAAGATTCGGGTGATAAAGTAAGTAATCCTGGCATTTCGTTGACTCTGGGAAC 
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CAATTAATCTCCATTTCAGTGCCAGGTGGTTAGTGTGCATAAGCCCGGATTTCATCTAACCATCCCAGCCACCGGAGACGTGGTCACAGCCAGAG 
TTCTCGTCACAACTCCAAAATTCGCCAAGTGCGCCATTTTCTGCGTACGAAATGTGCTGCTGGAAAGCAGCTACCGCGGACTTCTGCGCAAGGAG 
GATGTCCGGGAGACCGAAAAAGATCGCGTCGATATCTACAAGTCCTTCCGGCCGGGCGATGTCATCCTGGCGAGGGTCATTAACCAACTGGAGCA 

GTCCTTCCTCC^ 

GC T G G AGC GAG AT GCAGTGCCC C C AGAC C AC GAT C AAG GAG C C AC GG AAAG T GGC T AAAG T T C T GC CT G AG AGT T C AAT AA 
(SEQ ID NO: 496) 

Exon: 6111.. 5952 
Exon: 5887.. 5785 
Exon: 5731. .5240 
Exon: 5182.. 5001 
Exon: 4927.. 4518 
Exon: 4444.-3778 
Exon: 3712.. 3607 
Exon: 3538. .3377 
Exon: 3321.. 3079 
Exon: 3023.. 2905 
Exon: 2849.-2235 
Exon: 2169.. 1648 
Exon: 1591.-1194 
Exon: 1120.. 1001 

Start ATG: 6048 (Reverse strand: CAT) 
CGGTGCCGCCAATAATTTT^ 

CTACCGCGAGGTCATACCGCGGAACCTCAGTCCGGCGGAGTGGATCGAGGTGAAAATTAACACGGGAACCCAGAGCACTTTGCAGGATTTAAAAA 

CCTTCGAAACCTCCGGCGGCTATTGCTACAAGAATACGAAAAATGTTCAGACCAGAAACCGTTTCATATATTGGCGGACATACCAGGATGTCCTG 

GAACTATCAGAGGTTAGTTTGGACATCTCCCTGCAGAGGAATCACCTGCGTCTGCGATTCACCGACTCCGCTGTGCTGAATGTATCACTCACGGA 

GCAAGGTACATCCGTCACCCTGCTGGTGGTCACGGTCAGCAGTGTGCATCGCTATGTGTTTCCCTTGAAGGTCGCTGGGCAGGAAGGTGGCGCAG 

CTTCTCCAGAGGATCTGCTATCCCAGTCCATTTTCTACGACGTCAATGACAAGATCAATGATCCCAGCACTTACTATGTCACGGATGGCTTTGGA 

ACTATGCCCAACGCGGCTGTATCGTACCTCACTCAAGACTCCCAGTCAGCCTACTTTGCGGTGGCGTATCAGAGCAAACTACTGCTTCACGTTAT 

GAAATGCTCCACCGGCCATACTATCACCCACGAAATCAAGGAGCCGAAGCTAATGCCATGGTTTCTGTCGAATCTCAAGGGTGCCCTCACCGGCC 

GCTCTGAAACGTTGGAGGCAGCCACGTCCATGGCTTTCAGTGAAATCGGGGGGGAAATTTTCATTCTGGTTCTGTATCGCAATAATGAGCTGCGT 

CTATGGTCAGTGGATAACTTGCAGACTGTAGCCAGCATTAACTGCTCCACCGAACTGGGACAGGACTCAGCAGCTCAAGGACCACAAAACAGCCA 

GCTCCGAAAAATAAGCGATCAAAACTTCTGTCTATTTCTCTCGCACAATTCCCGAGCTGAGTTTATATGCGTTAGCATAATGCCCGATGCCGACG 

ATGCCAGCGTTATCAATTTGGTGCAGAACATGGTGCCGGCGCCGCAAACGGATTTGGTCGACTTCGATGCCACATCCTCGCACATCTGGGCCCTT 

TGGAGCAACGCTGAAGGCGATTTCCATGTCTCGGCAGCATATTTTGCCTCCAACAACGCAATCAAATGGGTTTCGGCTGCTCTAGAACCGCCACC 

GGATCGGTATTGTCTCACCATGGAGCAGGGAGTGGATCCACGCGAAACTTACTGCAGCTATATATTTCATCCTGGTCGATTTGACCGAAATGTAA 

TTGCCAAGGCTTTATATATGTTCAGACGCGTTAACCTGCAATTCGATGTAAAGCAACTTTCCATGTCAGTGCTCAAGGAGCAGGTATGCCAGGCC 

GTGGAAGATGAAATCCAAAACGAACTGAAAGACTTTGTGGTCACCGATGAGGAGTACTTGGAGATATCCACTCGGCTGTGGGATCGCTTTTACTC 

CTGCTGCGAACAGTATCACATTAAGCTTTCCGAGCCGACAGGACTCGCCGTTCTTGGTGGAATGGATGCAGTATGTTTGGTTAGAAGGCAATCAT 

TCGCATTGCTGCGCCCCTGCGAAGTGCTCGAGCACCTGTTGCTCATCGGCGAGCATAACGATGAAGTGGCGACATATGTGGCGCCACTATTTCGC 

AACGATCCCGAAATGGCCAAGGGTTTTGTGGAGCTCATGAATGTGGTGACACTGCTTGATAAACTAATTTCGGAGGACATTAAAATCGAGCTAGA 

TAAACAGCTTTACCAGCGGGAGTCACCGGTTGAAGTAATATCTAAGCTGGTGGCTAGAATCAGTATGATCGATGATAATGGTCCAATATTGCCAT 

CAAACTGTGTGAGGCAAATCCAACAGAAGCTGCAGAATATACCAAATCTTGAACCCGCTTTGGAAATGCTGCTCGATGTGCTGTGTATGATTGAT 

CCAGATGAACCGCCGCATGATTACTCCCTTTCTACGCGCTTCTTGCAGTCGTCTGGGGCTCTAATGGGTAGCGAATATGGACTTTCTATTTTGTC 

CGAAACAGTGAAGCAAATGGCAATGATACGGTTTTCTGTTTGCCGAAACCTTTTGGTATTGCAGTACATGGCTTATGGTCAGAACGAAATGGAAA 

GCGAAAATGTTTTAACAAATTTAAATTATTTGAACTCATATTATACACTGGTTTGGATTGCCGAGACACCCATTTCATCGAGTACCCCAGCTGGA 

TTCGAGGCATCTATTCAACGATTGAGCCGCGCCCAACTTTTCAGCGGCTATAATCGACCATACTCCAGCCATCTGAAACATAATGGGAACGATCA 

GACCACTCTGCTCCGACTGTTCCTGGAATCGAAAGGTCTGTTCAGCGCCTTGACTATGCTGCTGAAGCATGACAGTTTGTCGCTGGACTCGGAGC 

AATTGAATCTACGCCAGTCTTTGCTCCAACTTGTTGGATACATCAATAAGATGCTTTGGCCGGGATCTCCGATTTACGTATTTCCGGAATGGTTA 

TTTGGGACCTGTCATCACATCATTGTTCAGGATTACGTGCGCATTTTGTCAAACTGGTGTTCCGTACAAAAACATGCCAGACGCTTTATGCTAGC 

CGTATCTCTATTGGATTGCGGTGAGGCCCACAAGGCCGTGCATCTCTTTCACGAGGCGGAGTCTGGCATTGTGGAGGATGACTTCCTCTTTGAGC 

ATGTACTGAAAAATACCCCTCTATACGGCAAATTACAGAATAGCGTCAGCCGTGGCGATACAATTTCCCCCGAAGACACAAAACTGGCAATTGTT 

CACTACTATCTTAAGGTTATCCAGCTCTTTGAACAGCATTCCGCATTGGACTACATTATTCAACTGGCTGACATGGCCATTAGGGTGCTGCAGCC 

AGATGATCCTCAGTTGCCCATGTTCCAGTCAATAGTGTTCAATAATCATCTGCAGTTGGGACACTACGTGGAGGCGTACACCGCCTTAGTCAACA 

ATGCAGATATTTCAAGGCGTAAGGATTGCCTTCGCCAGTTGGTCATAACGCTGTTCCAGAATAAATGTCTGCCACTGCTTATGCAGTTCTCCTAC 

ATCGGGTTGCAAGACGAGTTCGAGAGCATTGTTGAATCCCGGGCACGATCAATGAGCATCGATCAGAACGAAGTCTACAACTTTTTGTATGCATT 

CCATACGAACAAGGGTAACATGAGAAAAGCTTCTACGGTTATGTATGAGCAGGCCATGCGGTTTCAAGTAGACAGCGATGCACCAAACGCGCTAG 

AGAAACGTTGCTCCTCGCTGTTAATTTGCCTAAACTGTTTGCACCTAGTTGACAGCCGCTACAGATGGATTGCCAAGCCTGTTCTTGGTGATGAG 

CAGGTGATTACCATAGATCAGGATAACGACGATGGTGAGCCCAAATGTGACGAGGACAAGAGAGGCCAGGAGGTGGTTGTTCTAGAGCTGGCTGA 

TATCCGAAGGGAACTGGTGCACGCAGAAGCTTTACGTGAGCTTTCCTTCTACCGCAAAGATACAGCTGCCTACGAGCGGGCCACACCCGAGGAGC 

TATCATACCTGCTGGCTAGTTCCGGCCTGTACACCGCGGCATTGAAACTTTCGCGGGGACACTCGTTCTCTGTGCTCCCCATATTCGAGAGTCTG 

ACATCCGCTTGTGTTGCTGCCACTGAGGACAAGTCGTCGGACGCCTGGAACTGGCTGCAAAACAACGACATGGCAGATCTACCTCATCGAAGTAA 

TGCTGCCGATATGGCCTGGACTTTACTTCAAAAACTGGTTGTGGATAACGAGGCAAAAGACTCCACCTTGATAAGAAAGAGCGTGGTTCAAAGGC 

TTCTCGGGTTAAACGCCTTCTTGCCCCAGTGGCTGATAAACTCCTACAAGTTATCCCACTCCCGGGAACTGCTCCACTTACTTGTGAAACACAAT 

CGTCTGCTGGAAGCCGCAGATCTGGGATGTGAAATAATCGCCGGCATGCTGGGAGCAGGTAGCGAGTATTTCGAATTCAAACACGCGGTGAACAT 

AGCAAACCCACAGCTCTGCTTCCCCATCAGCACGATCGATTTGCTGCTGCACGGTTTAAAGATTAATGGCAAGGATGATCTTGATTATGAAATGG 

CTTACTTTAAACTCGAAGAAGAAGTTCAAAGATATATTGAGACCATAAAGCGCACCACCGATGATAAGATGAGCATGGCCGTTCTGCAGATGCGA 

ACGGATCTTCAGGAAGAACGTTAA 
(SEQ ID NO: 497) 
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Start ATG: 64 (Reverse strand: CAT) 



MPTSKLQANMSYREVIPRNLSPAEWIEVKINTGTQSTLQDLKTFETSGGYCYKNTKKVQTRNRFIYWRTYQDVLELSEVSLDISLQRNHLRLRFT 
DSAVLNVSLTEQGTSVTLLVVTVSSVHRYVFPLKVAGQEGGAASPEDLLSQSIFYDVNDKINDPSTYYVTDGFGTMPNAAVSYLTQDSQSAYFAV 
AYQSKLLLHVMKCSTGHTITHEIKEPKLMPW 

DSAAQGPQNSQLRKISDQNFCLFLSHNSRAEFICVSIMPDADDASVINLVQNMVPAPQTDLVDFDATSSHIWALWSNAEGDFHVSAAYFASNNAI 
KWVSAALEPPPDRYCLTMEQGVDPRETYCSYIFHPGRFDRNVIAKALYMFRRVNLQFDVKQLSMSVLKEQVCQAVEDEIQNELKDFVVTDEEYLE 
ISTRLWDRFYSCCEQYHIKLSEPTGLAVLGGMDAVCLVRRQSFALLRPCEVLEHLLLIGEHNDEVATYVAPLFRNDPEMAKGFVELMNVVTLLDK 
LI SEDIKIELDKQLYQRESPVEVI SKLVARI SMI DDNGPILPSNCVRQI QQKLQNI PNLEPALEMLLDVLCMI DPDEPPHDYSLSTRFLQSSGAL 
MGSEYGLSILSETVKQMAMIRFSVCRHLLVLQYMAYGQNEMESENVLTNLNYLNSYYTLWIAETPISSSTPAGFEASIQRLSRAQLFSGYNRPY 
SSHLKHNGNDQTTLLRLFLESKGLFSALTMLLKHDSLSLDSEQLNLRQSLLQLVGYINKMLWPGSPIYVFPEWLFGTCHHIIVQDYVRILSNWCS 
VQKHARRFMLAVSLLDCGEAHKAVHLFHEAESGIVEDDFLFEHVLKNTPLYGKLQNSVSRGDTISPEDTKLAIVHYYLKVIQLFEQHSALDYIIQ 
LADI^IRVLQPDDPQLPMFQSIVFNNHLQL^^ 

QNEVYNFLYAFHTNKGNMRKASTVMYEQAMRFQVDSDAPNALEKRCSSLLICLHCLHLVDSRYRWIAKPVLGDEQVITIDQDNDDGEPKCDEDKR 
GQEVVVLELADIRRELVHAEALRELSFYRKDTAAYERATPEELSYLLASSGLYTAALKLSRGHSFSVLPIFESLTSACVAATEDKSSDAWNWLQN 
NDMADLPHRSNAADMAWTLLQKLVVDNEAKDSTLIRKSVVQRLLGLNAFLPQWLINSYKLSHSRELLHLLVKHNRLLEAADLGCEIIAGMLGAGS 
EYFEFKHAVNIANPQLCFPISTIDLLLHGLKINGKDDLDYEMAYFKLEEEVQRYIETIKRTTDDKMSMAVLQMRTDLQEER* 
(SEQ ID NO: 498) 

Name: KIAA0197-like 
Classification: hypothetical 



Celera Sequence No. : 142000013385213 

CACAAAAAGGCTTCATCTTGTAATCTATAATGGATGTGTTTTAATTTATTATCTTATTATCGATATGTTCGGGAATAAAATATGTATAATTTTAT 
ATATACTAACTATTATAGCAACTATATCATAACTATAATTTTCCCTTGGCGAATTCCAGTGTGATGGTCCTGGGATCGAAATCAGTTTTATTTTA 
ATACACCGAGTTAACCACAAAAGTGGCATGCCCAACTCATCTAATGAAAGTTTTTGCCCATTCGGGCAGATTATTTTACATTTCATTAATGTGTC 
GAAAAAGGGAAATGCTTTCAATGAAAGCTGCTCCCAGGCCCAGCTGCCTGTGTTCCCCATCATCATCATCATCATCCTCATTCTCATCGTCGCAA 
CCATCATCATCATCATCATCTTTGGGTCCCGGGAATCTGGAATCTGTGTTATTGATCAGTGCTGCGCAATTAACTTAGCAACTTATGATTATGAT 
TATGCTGCCGAATGGGTGGTGAGTACGAGGCATTACTCATGTATGGCGGTAAAACCAACAGGTTGACAGGACCTCTGTTTCGTTTCCATTCCGTT 
CCAATGCACTGAGCATATTTTGGAAACTGTGAAATCAATATATTTGATATTAAATATTATCTTATGAGGTTATTTAATGTATAAAATGTGGCTTT 
TACTTAGAAGTGTGCAACTAAAAAATTATTTATATAAGTAAGTATAAGTTGTTGGAAATTACTTATACGTTTATATAAGTATATCGTTCAAAATA 
TATATAGGAAAGATAATGATATTAAAACGTTGTTTTTGCTTGCAGATTTCCATCGTTTGCTGCGTGACAAATGCACCTTATTTATTTCTCAGTGC 
AGGGGTTGCTTATCCACCACTGCGTCGGTGGAGTGGGCAGCTGTTTCGCTGCCGCCGCCGAGCGATAGTGGTGGTGGTGTTGGTGGTGAGTTGTG 
TTGCAACTTCAACTTGTTCCTGAGGGCCGTGGACGTCGATGTCGGCGAGCAGAAGCAACCTGACTCTGACTCGACTCGGTTATTCGCTCACCACT 
CGCGCTTGAATACGGACGTGTTTTCGGTTCGAAGTCAAAGTCAAAGTAGAAGGCGAATCACAACGGATCGGTCGAAAGTTCAGAAACTCGTCAAT 
TGACATAAATTGAGTGCAATTTGCAGTGTGCGAAATGTGTGTCTAAGTGAAGCCAATCGACCAGCCAACAATTCGCCAGCTTCGCCGCCTCTTGT 
CGCGTGTTTCGCTCAATTCAACTACTTAATTAGTAGACATAAACAAATCGTGCTTTGAGTCCGACATGGCCGACGAAAGTCTGCACACCGTGCCC 
CTGGAGCACAACATAGACTACCACATCGTGACGCTATTCGAGCGCCTGGAGGCGATGCGTAAGGACTCGCACGGCGGTGGCCACGGGGTCAACAA 
TCGCCTGTCCAGCACCCTGCAGGCTCCCAAGCGGAGCATGCAGGCTGAGATTCGCACGCTGGAGTTTTGGAGGTAAGTCAAGCCTGCCCTAGGAC 
AAATACTTATATAGATATACGGGATTAACTATGTTATTTAATGACTGAAATTCGGAAAAACGTCTTTTGCCAAAAACTTGATATTTACAAGCGGC 
ATTTTTGTCATAACTTCGCTAAAACTTCTCATATTGATGAAAGAATAACTGTTTTGAGCAGCTATTTACCAGAGCTATCGAACCCTATCACTTTT 
TGAAGGATTTAGGAAAATTAATTTTTGGGTGAATTTTCACATTTTTTGTAAGGGGTAACATCATCAAAATTTGAAAAAAAATGGCAAAAAAATTG 
AATTGTCGTTTTTAAATACAGTTCGATTAGGAATTTAATTGCGAGTTTAACGAGGTATAACATTCAATATTCAGACAACTATTTTAAATGTTCTG 
ATAAAAAAACTGATAATTTTTGTGCAGAAAATTAGAGTAATGATTTGGGAAAAATGTCATTTTTACTAAGATTTTTTTAAGATTTTAAATAAAAC 
CACAATCTGTTGTGTATGATCAAATACTTACAATTAGAGAGGGTAAAACATGATTGTATCTCGTATGCATATTAACTAACCCTTTTTAATCTAAA 
CTAAATCTTCAATCTAAATGGCGGGGTTTTTTTCGCGTGTATATCATCTTCGAACCTTTTGTTTTTGTAGTCATTATCTGCACTTGCTTTCAATT 
TGCTCAACTTTAGATGGCAATTGTTGGATGTCAGCTGGCTTTGTCTGAACGCTAAGTAATTGAAATCCAAGTGTTCTCCAGACGCACATCATCAA 
GATAATCGCCAAACTAAACACGGCCAAAGTTTCGAGCACTTTGTGGCCCATACAAGGACCTTTTGCTCAGGGAACTGGCAAATAACAAACATTTC 
ATCACGAGCTGATGGAGTAATTAAATCTTAGCCAGCAGCATGTCGAGCGCTTAAATTCCGGCTAAACTGCGGGACCCTTTTGTCTCAAGTCCTCC 
GACTGACCATTTAAAATGAAAAACTTGAACTTAGTGGGGAGCAGAGAACTTAGGCAGAAGTTGAGATACATATTTAGTTCCAATTAAAGTGAAGT 
GCCGCTGCCAAATTAAACAGCTGAATACATGATTTTATGGAAATTCATAAAGCCCTCGATTTGTTTGCATTTTCGTTATCGTGCAGCGGACTCTG 
CGCTTTTTAGCTTAATGTGAATCAACTGAATTCGGCAGCATTGCAGCTGCTCATCAGAACGCTTCCCCTAGAGCTCCCCCATCCGCTCCAGAACC 
CACTCGCAGTCCCTCCAGCTCCTCCTCAGCCTGCCCACCCGCCCCCTGCTTAATTGTCACTCCACGACTTGAGGGCAACAAAAAAGAGCTTTGTG 
TGCAACTTGATTTGTGCCGTACAGTAGATACCAGCTAACTTGGAAAAGTCTAGGGCTGCACGACAAAAACTTTGAAACGGATTTCATTTGATTTC 
AGTATGAATCCTTTGGCTCAACACGCTTAGCCTTACTATACTTAAGCGCTGAAATATATGTTTATGCAACCAATATTCACTGTACTGTACACATT 
CCACATGCAGTCGGTGGAGCCACATGCAAACCAAACCCAGTTGCGGAACCATGCTAACCCCGGGCTCGTTCGCCTTCTTCCATTCCTCTTCCATT 
CCAGATCCATCATCAGCGAGTGTCTGGCCTCCTTCATGTACGTGTTCATCGTCTGCGGTGCCGCCGCTGGCGTCGGAGTGGGGGCCAGTGTGTCG 
TCCGTGCTTTTGGCCACGGCTCTGGCCTCCGGCCTGGCGATGGCTACACTGACGCAGTGCTTCCTCCACATCTCGGGTAAGTCGAAGGAGACAAT 
AAACAAGCGCAAAAACATCAACAGAATGGGGCGACAAGTGTCTCTTTTCGGGTTGCCCATTGGAATTGCGTGGGAAACTGCCGTTTTCAAGGGAG 
CCCCATGACCATAACCAGCCAGCACACGGAACCCCCTTTTTTTTTGTTTGCTATACTTTCGAGAAGGCAATGCACTCGGTGCGGCTCCACCTACT 
CCAGCTCCTTGGAATCTTATCGCCCCCGGGCCAAGTTCCACCACTTCTTTCTTTTTCCCTTATTCCCTTGCATTTATTCTGGTATTTATGTATTT 
GTTCATTTGCGTTGACATTCAAGGCAATTAAATGTCTTTACAAGTAAAAAAAGAAAAAGAGAATGAAAACAGACCAAAGACCAAAGTAATGTAGT 
TGGGACCCCATCGTGAAAAGCGTCTTGAAACATGTGTTCTTGATTTTCCTGCAACTCCCCCATTTTGTGTTGGCTTGCAATTCCCCGCCTTTTCA 
CAGCCAATTGGTTGTGCATTAAGTTATGATGTTGTTCTATACGAAATGCCTTCCGAAGAAGTGTCAAATTGATATTAAATGTTGCTCCCCTTTGG 
GTGGCCACCACCAGAGATTGACGTCAGTATCTAAAGATATGCTTTCCCCTTCATAGGCGCCCACATCAATCCCGCCGTAACCCTGGCTCTATGCG 
TAGTGCGATCCATATCTCCGATCCGGGCTGCCATGTACATAACCGCACAATGTGGCGGAGGAATCGCCGGAGCTGCTCTGCTTTATGGGTAAGTT 
TTTTCGTATTTATTAAGTTACACATATCTTAAGTAGACAAATCATTGGTTTGAAAACAAGTAAGCCAAAATCATAAATAAACATTGCCCAATAAA 
TGGAGTTTAAAGCGATATCCTTTTGAAAAGCCTCACATGAATAATTTAACCAATTGATTGAGCAAAGCCATTGATAGTCGAGGCTATCTATTATG 
CACATCAAATCCCCGCATTAAAAGCCTTATGTTGTCGCATACAGCACGCCAAACTCAATTGAACAGGAATTCATTTTAAATTTTCGGTGTGCAAG 
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AGTGTATTTAAATCTGTGTATATTTGCACGCCTGACCACAGCAATTGCTGCATTATGCGAAGGACATCGTGCATCCTGCGTGTGGAATTGCCCAA 
ZUVGCAAACTGAAGAGTCTCAAATTAAATCCAACAGGCTGTCCAAAGAACCTAGTCGTAAATCAAAAGATTGTGGGACAGGCACGGGGAGAGAGGA 
CAGAGGGATACTAGGATAAATCGGATAGAGACAGATGAAACGCACTTTTGGGCGATGACTGTACAACAAGGACTCACGACAGGAAGCGTGAAATG 
CGTAGAAATGGGGAATCCTACAAAAGAAATGTGCAGGTAACCTAGGACGAACACACACATACACATGCATATGCACAATCACCTGCAGGAAAAAG 
GACGCAGGTAGGCTAGGATTCTCTATCTGGCGAAGAAACATCTGCCCCTATGCTCAGATGTTTGCGTAGTGATTTCCCGCTTCGAAAATGAAATT 
TCTTTGTGATGCCTAGATAAAGGACCCTCTAGGCCAAGGATCTACACGCACACCAGCTAGCCAAAAATCTGTCTGCCTGTCACGGTTTTTCCTTT 
GTGTAACCAAAAATCGCAGGACGAAGGACGAACGAAACGGGAAAGAGGAGTATATATGTATATATATCTGCGCATGCGCACCATTTTTCAAATGT 
CGCTCCAGTTTTTTGTTGTGCGTTGAACTCCTTTTGGCAGCAATGTTAATGTTAATGCCTCTTCCCGATTGTCTGCTCGACATCTGTTTCGACCC 
TCGAAGAGAAGCCTCTTTTTAAGTGCGTTGGCATCTTTACTTTCTACAGATCCGATCCAAGAATCAGGAGGCGGAGAAAGGGACGAGGAGAAACA 
TGTTTCAGCTGCTGCGATCAGGTTCGGGGTTTTTTTTCTTTCTTTCTACTCCTCCGATATGGATTTCTTAGGGTCCAACGAGTTCATCGAGTGTT 
TTAGTTGGAACATATGTTTACCTAAAAAAAAGGAGATTGGGGGAACCGCTCAGCTCCTTCCCCACAGCTGCCTGTGTGTGCTGATTAATTGGTTT 
AGATTCAGCCCATATAGCATATATATGTAGGTTATAGCTGCGCCTGGCGATTTCTCCGAAATCGTCGTGATTTTATGAAGGGTGTTAAACTTAAT 
TTATCCATGATTTTCTTAAGCTTTGAGCCAGTAACCCTTACACATAACCATATAATAAGCAACTTTAGTTGGGATTAGCAAGGGATTTATAGCTT 
ATATAGTAAAGATAAAAGATTCTATGTAGAATGTTTGGAATGCAGGAAAACGATTAGTTCCCTCGCTGCCAACAATGTGTGACAAAGAGCATTTT 
TAAAAAATCTAGGGGCAAGACACCCACTCTCCAAGGACCTGCCACTTCAGCTGCTCCTCCACTGGCTTCTCCACTTTTCCGCTGTCACATGTCAC 
TTGGCACTTACCCTAATTGCGATGCAAATGCCACATGTGGCCAAGTGGCTAAAAAAAAGTACCACAGAGCTAAAACCGAGTATGAACTTCATTAT 
CTTATGTTAAACAAAGAGCGATTGAAGAAACATTTGTGTACTTGGAATGGGTAATATGCGGGGGCGGGGGAAACGAACCTGAGTATCGATATCCG 
GCTCAGGAATATTTGAATAGTCTTTAGCTGCTCCTCGAATTTCAATGATTTCTTCCCGTTCTCAATTCATATTATTATTATTATTATGGCCTTTT 
TTTGGTGGTACGAAGAACGCACGCCGAGCACGTTCCAAAAATGCGAGTAGCCTTTGAAAATGTGGAAAGATAAAAATCAGGTAGAGGCCGCCGCC 
AAAGGCAGCGGGGCAGCCAAGCAATGCAAAGAAACCAGAAAGCAGAAGAACATAATTAACATTTCCCTTTCGGATGGGCCTTTGTGCCCAGGTTC 
TCCTGGCACTGCCGGCTACTCCAGGCTACTGCGCCTGCGCAGGATGCGTGGTGCGGAAACTTGAAATCAAACACGTGTCCGCTTCTGCGATCTGG 
TTGGTCATTCGAGGAGCAGCTGGAAAAAAAAAGTGATCGGTGAACACCTGCTGCCCGAAAACAGAAAATTACTGTTGCCGAGGATGCACGACGAA 
T GG AAC GGAG AAC AGG AT TT C AAAACG AAC AAAT T T AAAT T T C AAGC AGCT AAC T GC AAC AAAGCG AACT GG AC AACT C AAC AAAAAAAG AG AGT 
GAGATCTGTACCTGCACTGAGAGAAAATCAATGGTCATTACTTAGGGTCAATGATTTGCCATCATATTATTTAAAGTCGCTCTAGGATTAATCAC 
ATTACATTTATGATGGCTGATTTATATTCTATAATAATATCTGAATAAGATTTTTTTATTTTCTCAAGGCAAAATCAGAAACCCTAACTTTCTAT 
TCTTCTTGTGTGCCTCGTCGATTTGCATCTAGCATGGAAATGCACTTCCTGCCGTTGAAGGGATGGTTTTTAGAAATCGAAATCGTAATCAATTC 
GTGTCACATGGGCAATTCAATTACTAAATCCGTTTGCCAGTGGATTCCGTTCCCCTTTCTTTATTTGCTACTACTCACTACTACGGCAGGCTTAT 
CGCACTTGGCTTGGAAATAGCATCCTGCCAGTGTTTCCGGTTGCCTGCACTTGGTGGCCCCCGCATCCTCACCACCATCCTCATCAACAATGCTG 
CACAGGTGGTCCAGTTATAGAGTTATGTTGCCGGCATTCAGAAAAAACACCTGGAAAGCGGGGTCAAAAGTAGTGCATTTAAAATGAACTTTTTA 
TATGGGATGAACTCAAGGAAGCAGGATCAATCCTCATTGTTTCTACAAATCGAAAGTTATAGGTATGATGAGTTCTGATTATTGTTTAATTTCAG 
TGGTTCTAAAAATACATCTAAAGCATGAAGAGTCTCTAGTTAACTTAGATTTAAGACAAGTGCGAAGCTACTACATGTAGTAACTTCATTTGGCG 
GTCGCATCGGGAATTCAAGTGCTCCATTCCCAGTCCCTGGCAACCCAATGAGCTTGGAAGCCAGCCACAAGCCAAAAAGACCGAAAAGCAAACGC 
AACCAAACCAGGAGCATTGCAAATGGCACATGGCAATCCGCAGCTCACAGTTTGCAGTTCGCAATCCGCAATTCGCCTTTCACCCGGATACGCGG 
TGGCCACCAGTGCGAGAAAATGGGCAATGAGTCGACTTCGCTTGGGGATAGGGATGAGGATAACGATTCCGGTTCACAGTTCAAGTTTGCAGTTC 
GGCTGCCACTTGTTTCTTTTGGCACTTTTTTTTTGTATTCCGCAAACAATTTGCGCAGGAAGAGCAGCTGAAAGCCGCCCAACATTCTGCCATTT 
CCGGCTACTTAGTTGCTTCCTCCCCAGCGCTAACAGCTGAGTGTTTGTTTATCCCCGTGGATGAGTGCCTGTTGGCCACTCGTGCCATGTGAAAT 
TAGCACTCTATGGATTTATGAGCAAACAAGGGTCACTTTCTATTCCAAACCAACCGAACATATCGTTTCGAAGCGATCAAATTTTAATCAAGACT 
TGGGAGCAACTGTGCCCAGGAATTTCGTAAGGCAGTCGAGGATTAAGTCCTACTAATCCGTTGCTGCCACAGAGCTGGTGAACCTTACGCCCACC 
ACAACTCACCTCAATCCGCAGTGCAGCCGGAACTTGCAGCAATTTCCTATTTGCTGAAGTACTTTTAAGTGCATTCCGAAAGCACTTATTTCCGG 
ATTTTCAAAGTGGGCGGATTCGGTGTGGTAAAGGATTGCGGTATGTCTATATAAATACCCACTCTGCTCCGCATTGCAAAATGTTATCTCACGGA 
GATTCCTGTGACAACAATCAACTTCTGGAAAGCCATAATATATGGTGCCCTCGAGGAACTTTGCATCTGCATTCGGGAAACAAATTATCCGGCAG 
TTTCAAATCGCAAGGATTCAAACTCATCTGGCCCAAAAGGTGCTCAACAACTTTGGTAATTGAAACTTTGACAAGCTAAATGATGTTGAGGCAGT 
TGCCGGGTAGCGCAGATTAATGGAATTGACTTTGGCGGGGTGTGGATGGTTTGGAGGGTTGATTGCCGAACTGTCAGGACGATAAATAATAAATT 
AAACAAATTGTCATCGGCGTTTAGCTACGAAAACTTTGTGCTCCGATATCGAAATGGTGCGGTTTTTGTACAAGCAACTTGCAACTATAAATATT 
TATATACTCTATATACTATATACTTATAACTCCTTTCCTTTCATTTATTTTACAGCGTAACTGTGCCTGGTTACCAGGGGAATCTGCAGGCCGCC 
ATCTCACATAGCGCTGCTCTGGCCGCCTGGGAAAGATTCGGTGTGGAATTCATCCTCACCTTTCTGGTGGTTCTGTGCTATTTCGTATCCACGGA 
TCCCATGAAGAAGTTCATGGGCAATTCGGCCGCCTCGATTGGATGTGCTTACAGCGCCTGCTGCTTTGTGTCGGTAAGTTGACTTCCCCCCTCTC 
CCCAACTCCAATCTCGCAACCCAGGAATGCCGGTCAGAAATCAGAAATCAGCTAAATGAAAACCAGAACCCGCCAAAACCGAAAACGAGCCGAAA 
ACAAACAACTGAAACGCACCTGCCGACTGTGACGCTGTCATTTTGATGTTGGTGCAAAATCGCAGCATGACAACGGACGAGCTGGTATCAACTGG 
GTAAACTCAGGTAGAGACTATCCCACAATCGTGGGACGCTGGAAGGTCGAGCTGCTGGGTTGCAGGAGATAATTAATTTAGCAAAGGAACTCCGT 
GGACGGAGCTGCATTGCTGGCATAACGGACGGACAGACAGACGATGGCGACATCCAAAGAGGATGCCATCACACGGCCAAGTTTTCGAATGGGGA 
ACCGATTAAAGTCAAGTTAATATCGACGCAGGATTGAATTACGATTAAGGAAAATCATGGATAAAGCGGAGTTTAATGTCCTTAGGGTTCCACGA 
GTATGCTATACAATGCCCACTTTAAATTCCTTTGAACTCCTCAAGGATCATTTAAATTAAGTTATTTCATACTTAACTCTTCGACTTAGAAACTG 
CTGCGAACTTCTTAAACTCTTCAAATTGTTTAAACAGTTTACTCCCCGCTTTGTTTGCCAACTTCTTTCACAAGTATCCTTTGATGAAGAATATT 
CCCAGCTAGAAGTTGACACAAACGACATTGACTGGCTGACTTGTGCAGGCCGTGAAGTACTTTCCTTGCTTGGCAGATGTCTAACCAAATCCTTC 
CTCCTCCCTCTTCTCCTTGCAGATGCCGTACCTGAATCCAGCCCGCTCCCTGGGTCCTTCGTTTGTGCTTAACAAATGGGACAGCCATTGGGTGT 
ACTGGTTCGGACCACTGGTGGGCGGCATGGCCTCTGGCCTGGTGTACGAGTACATCTTCAACTCGCGCAACCGCAACCTGCGCCACAACAAGGGC 
AGCATCGACAACGATTCCAGTTCGATTCACTCGGAGGACGAACTGAACTACGACATGGATATGGAGAAGCCCAACAAGTATCAGCAGTCGCAGGG 
CACCTACCCGCGTGGCCAGTCCAATGGCAATGGCGGAGGACAGGCGGCCGGAAATGGTCAGCACCAAGCTGCCAACATGGGCCAGATGCCGGGCG 
TAGTGGCCAATGCCGGTCAGGGCAATTACTGCCAGAATCTGTACACTGCTCCGCCGCTCTCCTCGAAGTACGATCAGCAGCAGGAACCTCTGTAC 
GGTGGAACCCGCTCACTGTACTGCCGTTCGCCCACTCTGACCAGAAGCAACCTGAATCGCTCGCAATCTGTGTACGCCAAGAGCAACACGGCCAT 
CAATCGAGACATTGTGCCGCGTCCAGGTCCTCTGGTGCCCGCCCAGAGTCTTTATCCCATGCGCACCCAGCAGCAGCAGCAACAGCAACAACAGC 
AACAGCAACAAGTGGCACCCGCACCTCAATCTTCCCATTTACAGAACCAAAATGTCCAGAATCAGATGCAGCAGCGCAGCGAGAGCATCTACGGA 
ATGAGGGGCTCCATGCGCGGACAGCAGCAGCCAATCCAGCAGCAACAACAACAGCAGCAGCAGCAGCAGCTTCAGCAACAACAGCCCAACATGGG 
AGTGCAGCAGCAACAGATGCAGCCTCCGCCACAGATGATGTCCGATCCTCAGCAGCAGCCGCAGGGCTTCCAGCCAGTCTACGGCACACGCACAA 
ATCCGACGCCGATGGACGGAAATCACAAGTATGACCGACGTGACCCGCAGCAAATGTACGGCGTGACGGGACCAAGGAATCGCGGACAGTCGGCG 
CAGTCGGACGACAGCTCCTATGGCTCATATCACGGCTCGGCTGTCACGCCGCCAGCTCGTCATCCCAGCGTGGAGCCATCACCTCCGCCACCGCC 
CATGCTGATGTATGCCCCGCCCCCACAGCCGAATGCAGCCCACCCGCAACCCATTCGCACGCAGTCGGAGCGAAAAGTTAGTGCTCCAGTTGTGG 
TATCCCAGCCGGCAGCGTGCGCCGTGACCTACACAACCTCGCAAGGATCAGCGGTGACAGCCCAACAGCAGCAACAACAACAGCAGCAGCAGCAG 
CAACAACAGCAGCAGCAGCAGCAACAGATGATGATGCAACAACAGCAGCAACATTATGGAATGCTGCCGCTGAGGCCCAACTGAAAGCGGCTGTC 
ATGGGGTCCGGTGACAACGCCGCCGCCAGGGATCCACGCCCTCGCCCTCCAGCCCGGAGGCACCATGACCCTGCACCGCTGCAGCCTAGCCAAGC 
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AGAGCAGTTTGTTTTAGTCGTAGTCGTACGTAGTAGCATTCGGGCATCGCATCCAAAGGAAATGTACACAAGTCTCAGCCCCATATAGTCTAGCA 

TAGTCCTAGTTTAGATAGACGCAGCATAGCCACATTGTATTAATTGAGAGTCAACGCGACGAGGAACTCACACCTAGCAGCAGACAACGACGATG 

CGATCTAGCAAAGATCATGTACGCTTAAGGAACTAATAAGGGTTATCATTCACCGAAGGCTAGACTAAACGAGTGATCTAGATGATCTAGACTGA 

AAGACAGGTGCATTTAACTAAGCTACTTCCCTTAAGTTTTGTTCAACTCTTCCAATATTAATTTTGTCAAAAGAAACTTACCTTTTTTTAAAACA 

GCCTTTTGTTGCAACTCTTATTTGTTTTGGCAAATCATTTGGTTATTCATGTTTCTTGTAAATGCAAAACTAGACTGAACTTACCTAMGCATTT 

TGTACGTTATATTTCAAAGTTCATACATATATGTATATGTACTCTTCCCATTTTCCGCGGACATTCAAAGATTTTCGTTTACTTTTCGATTTTAA 

TTTGTATATAGTCCATATAGACGTAAGCTATTCATGTAAATCTTACAACCCTAGGATCGGAAGTCGATACAAATATATGTATATGCTTACTTATT 

AAAATAGTTGTATTTTCTTAGTGATAAACCCGTTTGCTAATAGTTGTTAGAGCCTAAGAGAACAGCGCAAATATATTGTGTATCAAAATAAAGAC 

AAGTTCATCCAAACAAAGAAAGAGTTACGTTCTTATATTATTTTACCAAAAAACTGTAAAATGTGTATATTTTATTATACAGAGGTGGTCAAAAG 

TATTTACACAACATGCTTTCTTTTCATTTGTCAACTAAATATGCGTTTTTGGTTGGCATCAACAGTTGATATAATTGTTATATTATCATTTAATT 

ATGAATACTGAGAATCCTACCTACTATTCAAGGCCTTTTTTATGAAGTTGATAGTCTTATTTCAATATGACCTTGAATACATTTAAATACCTATT 

ATTTTGTGTGGTTTTTTTGAACACGTGAATTTTTAGAAAAATACAAATAACACTTAAAGCATTTTACGTTGAATAGATAGTGATTAATATCAATC 

GTTGGTCAATGAATAGCTTGAATACTATCATAATCCAAATAACCATGCGCTTGGAAGATACAGTCCATGTGGGATTAATTCACGAATGCCGATTC 

AAAGCATCATCTATGGATGGCCTGTTGGTCATCTGTCTCCAACGCCGAGGACATGAAGGAGAAGTACAGTCGAAGCTCGGCCAGAGGAAAGTGGG 

GTAGCTCCTTTCGTACCTTTGCGAAGTCGGTGCCGTATTGCTCGAAGGCCTTGGTAAGACGGATGAGATCCTCGGCCTTCCAGCGATGGTAGCGA 

CATCCTCGAGCTGTCTCAGTCTTCTCCAGACGTCGATGTGCCAGGGGCAGGTCGTATCCGTGACGCCGCAGGATGT^ACAGCGCCTGCTCCTCCTC 

GATATCATAGAGACTGGCCGCATAGTCAAGGTATTCGCGAACAGCCTTGTCATCTTGGTCCTGCAGTTGCGGCGACCACAGGATCGAGGCTCGAT 

GTTGCAGCTGCTGCAAATCCTGTTCCCGCTTCCACGGCGGCACAAAATCCGGCAAGGTGGCTGCCTGTTTTCGCCAGTAATGTATGTCTCGTTTC 

TCCTTGCGT GC CG AC AT C CTTC GT T C AAAAC AAAAAG AAAAAAAT AAAT AATC T AC ATG AAG AT AGC ACC G 

{SEQ ID NO: 499) 

Exon: 1001.. 1497 
Exon: 3140.. 3306 
Exon: 3952.. 4078 
Exon: 8606.. 8813 
Exon : 9618.. 11896 
Start ATG: 1301 

Transcript No. : CT15229 

AGAAGCAACCTGACTCTGACTCGACTCGGTTATTCGCTCACCACTCGCGCTTGAATACGGACGTGTTTTCGGTTCGAAGTCAAAGTCAAAGTAGA 
AGGCGAATCACAACGGATCGGTCGAAAGTTCAGAAACTCGTCAATTGACATAAATTGAGTGCAATTTGCAGTGTGCGAAATGTGTGTCTAAGTGA 
AGCCAATCGACCAGCCAACAATTCGCCAGCTTCGCCGCCTCTTGTCGCGTGTTTCGCTCAATTCAACTACTTAATTAGTAGACATAAACAAATCG 
TGCTTTGAGTCCGACATGGCCGACGAAAGTCTGCACACCGTGCCCCTGGAGCACAACATAGACTACCACATCGTGACGCTATTCGAGCGCCTGGA 
GGCGATGCGTAAGGACTCGCACGGCGGTGGCCACGGGGTCAACAATCGCCTGTCCAGCACCCTGCAGGCTCCCAAGCGGAGCATGCAGGCTGAGA 
TTCGCACGCTGGAGTTTTGGAGATCCATCATCAGCGAGTGTCTGGCCTCCTTCATGTACGTGTTCATCGTCTGCGGTGCCGCCGCTGGCGTCGGA 
GTGGGGGCCAGTGTGTCGTCCGTGCTTTTGGCCACGGCTCTGGCCTCCGGCCTGGCGATGGCTACACTGACGCAGTGCTTCCTCCACATCTCGGG 
CGCCCACATCAATCCCGCCGTAACCCTGGCTCTATGCGTAGTGCGATCCATATCTCCGATCCGGGCTGCCATGTACATAACCGCACAATGTGGCG 
GAGGAATCGCCGGAGCTGCTCTGCTTTATGGCGTAACTGTGCCTGGTTACCAGGGGAATCTGCAGGCCGCCATCTCACATAGCGCTGCTCTGGCC 
GCCTGGGAAAGATTCGGTGTGGAATTCATCCTCACCTTTCTGGTGGTTCTGTGCTATTTCGTATCCACGGATCCCATGAAGAAGTTCATGGGCAA 
TTCGGCCGCCTCGATTGGATGTGCTTACAGCGCCTGCTGCTTTGTGTCGATGCCGTACCTGAATCCAGCCCGCTCCCTGGGTCCTTCGTTTGTGC 
TTAACAAATGGGACAGCCATTGGGTGTACTGGTTCGGACCACTGGTGGGCGGCATGGCCTCTGGCCTGGTGTACGAGTACATCTTCAACTCGCGC 
AACCGCAACCTGCGCCACAACAAGGGCAGCATCGACAACGATTCCAGTTCGATTCACTCGGAGGACGAACTGAACTACGACATGGATATGGAGAA 
GCCCAACAAGTATCAGCAGTCGCAGGGCACCTACCCGCGTGGCCAGTCCAATGGCAATGGCGGAGGACAGGCGGCCGGAAATGGTCAGCACCAAG 
CTGCCAACATGGGCCAGATGCCGGGCGTAGTGGCCAATGCCGGTCAGGGCAATTACTGCCAGAATCTGTACACTGCTCCGCCGCTCTCCTCGAAG 
TACGATCAGCAGCAGGAACCTCTGTACGGTGGAACCCGCTCACTGTACTGCCGTTCGCCCACTCTGACCAGAAGCAACCTGAATCGCTCGCAATC 
TGTGTACGCCAAGAGCAACACGGCCATCAATCGAGACATTGTGCCGCGTCCAGGTCCTCTGGTGCCCGCCCAGAGTCTTTATCCCATGCGCACCC 
AGCAGCAGCAGCAACAGCAACAACAGCAACAGCAACAAGTGGCACCCGCACCTCAATCTTCCCATTTACAGAACCAAAATGTCCAGAATCAGATG 
CAGCAGCGCAGCGAGAGCATCTACGGAATGAGGGGCTCCATGCGCGGACAGCAGCAGCCAATCCAGCAGCAACAACAACAGCAGCAGCAGCAGCA 
GCTTCAGCAACAACAGCCCAACATGGGAGTGCAGCAGCAACAGATGCAGCCTCCGCCACAGATGATGTCCGATCCTCAGCAGCAGCCGCAGGGCT 
TCCAGCCAGTCTACGGCACACGCACAAATCCGACGCCGATGGACGGAAATCACAAGTATGACCGACGTGACCCGCAGCAAATGTACGGCGTGACG 
GGACCAAGGAATCGCGGACAGTCGGCGCAGTCGGACGACAGCTCCTATGGCTCATATCACGGCTCGGCTGTCACGCCGCCAGCTCGTCATCCCAG 
CGTGGAGCCATCACCTCCGCCACCGCCCATGCTGATGTATGCCCCGCCCCCACAGCCGAATGCAGCCCACCCGCAACCCATTCGCACGCAGTCGG 
AGCGAAAAGTTAGTGCTCCAGTTGTGGTATCCCAGCCGGCAGCGTGCGCCGTGACCTACACAACCTCGCAAGGATCAGCGGTGACAGCCCAACAG 
CAGCAACAACAACAGCAGCAGCAGCAGCAACAACAGCAGCAGCAGCAGCAACAGATGATGATGCAACAACAGCAGCAACATTATGGAATGCTGCC 
GCTGAGGCCCAACTGAAAGCGGCTGTCATGGGGTCCGGTGACAACGCCGCCGCCAGGGATCCACGCCCTCGCCCTCCAGCCCGGAGGCACCATGA 
CCCTGCACCGCTGCAGCCTAGCCAAGCAGAGCAGTTTGTTTTAGTCGTAGTCGTACGTAGTAGCATTCGGGCATCGCATCCAAAGGAAATGTACA 
CAAGTCTCAGCCCCATATAGTCTAGCATAGTCCTAGTTTAGATAGACGCAGCATAGCCACATTGTATTAATTGAGAGTCAACGCGACGAGGAACT 
CACACCTAGCAGCAGACAACGACGATGCGATCTAGCAAAGATCATGTACGCTTAAGGAACTAATAAGGGTTATCATTCACCGAAGGCTAGACTAA 
ACGAGTGATCTAGATGATCTAGACTGAAAGACAGGTGCATTTAACTAAGCTACTTCCCTTAAGTTTTGTTCAACTCTTCCAATATTAATTTTGTC 
AAAAGAAACTTACCTTTTTTTAAAACAGCCTTTTGTTGCAACTCTTATTTGTTTTGGCAAATCATTTGGTTATTCATGTTTCTTGTAAATGCAAA 
ACTAGACTGAACTTACCTAAAGCATTTTGTACGTTATATTTCAAAGTTCATACATATATGTATATGTACTCTTCCCATTTTCCGCGGACATTCAA 
AGATTTTCGTTTACTTTTCGATTTTAATTTGTATATAGTCCATATAGACGTAAGCTATTCATGTAAATCTTACAACCCTAGGATCGGAAGTCGAT 
ACAAATATATGTATATGCTTACTTATTAAAATAGTTGTATTTTCTTAGTGATAAACCCGTTTGCTAATAGTTGTTAGAGCCTAAGAGAACAGCGC 
AAATATATTGTGTATCAAAATAAAGACAAGTTCATCCAAACAAAGAAA 
(SEQ ID NO: 500) 

Start ATG: 301 

MADESLHTVPLEHNIDYHIVTLFERLEAMRKDSHGGGHGVNNRLSSTLQAPKRSMQAEIRTLEFWRSIISECLASFMYVFIVCGAAAGVGVGASV 
SSVLLATALASGLAMATLTQCFLHISGAHINPAVTLALCVVRSISPIRAAMYITAQCGGGIAGAALLYGVTVPGYQGNLQAAISHSAALAAWERF 
GVEFILTFLVVLCYFVSTDPMKKFMGNSAASIGCAYSACCFVSMPYLNPARSLGPSFVLNKWDSHWVYWFGPLVGGMASGLVYEYIFNSRNRNLR 
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HNKGSIDNDSSSIHSEDELNYDMDMEKPNKYQQSQGTYPRGQSNGNGGGQAAGNGQHQAAKMGQMPGVVANAGQGNYCQNLYTAPPLSSKYDQQQ 
EPLYGGTRSLYCRSPTLTRSNLNRSQSVYAKSNTAINRDIVPRPGPLVPAQSLYPMRTQQQQQQQQQQQQQVAPAPQSSHLQNQNVQNQMQQRSE 
S I YGMRGSMRGQQQPI QQQQQQQQQQQLQQQQPNMGVQQQQMQPPPQMMSDPQQQPQGFQPVYGTRTN PT PMDGNHKYDRRDPQQMYGVTGPRNR 
GQSAQSDDSSYGSYHGSAVTPPARHPSVEPSPPPPPMLMYAPPPQPNAAHPQPIRTQSERKVSAPVVVSQPAACAVTYTTSQGSAVTAQQQQQQQ 
QQQQQQQQQQQQQMMMQQQQQH YGML PLRPN * 
(SEQ ID NO: 501) 

Classification: known_f lybase_gene 
Gene Symbol: bib 
FlyBase ID: FBgn0000180 



Celera Sequence No. : 142000013385213 

TTTATC GGAAT GGTCAGAT CGGAATTTAAAAAAAAAAACT TTAATTTCT CCCC AACTCAAATTGAAAC AC ACAAAC AAAT GAAT GTACTT AAACC 
CTTTTTTTTTGCTAGCGAGTATGCGATCAAAACTAGAGAAACATATGTACGTTTTTTCCCTTATAATTGATGGCAATCTGGTCCCAATAAATCCT 
TCTCACCCATTTCGCGCCTGCTAATTGCCAAACAAAATGCAATAATTCAACCCAAAAGCACGCTTTCTTTTCCTAGCACAAATACAATACCCCTT 
TAACGGCCGCAATTTCCCTTTTTCACCCTTTTGTTTGCCAGGTCACCCGAAAAGTTTTACCGAGCAGGGTTGTCGCATGACCCCCCGCAACCCAC 
AACCACAGCAGCAATCGCTTTTTATTCTCTTTTTATATGATTGTTTCATATTTATGAGGATTATATGGCTGATGCTGAGTGCTGAGAATCAGAGA 
AAAATTGAGTTGGACGTGTGGTAATTTATTTTTCGGCTAAGGGGACGGACTTGTAGTTAACTGTTAGTTTTCCGAGCTCAAGATCGCACAGATTA 
TAAAGAATTTACTGTGCAATACTAATTAAATTTATCTTTATCATTATTTTACTTAAACGGAATTTGTATAAGCCAAAAAACAAAATTTATTAATA 
ATCTCCAACTAAAAATGTGTATCAATGAAATTATTTTCTTATGGAATTAAATTTCAATAGTTTATGCCCGCCTTTGTATTTAACATCTAAACATT 
AGAATTAAATAAAGTATTTAAAAAAAAATTTAGCGTATAAAAAGCATGGATTGATTCCAAGTGCATTTTACGAATTTGCATT GAAT CAC GAT AAT 
AATATTCACTTGTAATACTTAATTAAATAATTTAATTCGGAACCCATATTAATTTACCGAGTTCGAGGCGGGTTGGCAGTCCGATTGCATGACGT 
AGCAACGCCGCCACTTTCCAGCACTGGCCGATTGCCAGCCAAAATGGCGCCCGCCTGCTATCGCATTGCTCGAGCAGATATCGGTTCGGCAGCGC 
ATCACACACACAACGCGAAAAAAAAAAGAAAAATACGGGTGGCCTGCAAGCGAATATCTCCGAAATTCATTGAATCCCGACGCCGAGATTTGTGC 
TCCGATCCGAGAAACCCATTAGCTTCCGTGCACCGTCCGCCCAAGCCAGCAAAATAACCAAGTTCAACATACGCCTGGACAACATAATGTCGAAG 
AATAAAAAACTGTCGCTGTCGGGCGGCGATACGACGGAGAAATTTATTTACAAACCCAAGGATCTGATATGGTAAGCCGGGACCGTGGACTCATC 
AATATATTCGCTGTATTCTCAGCTGCGCTGCGCCGTGAAGAAACAAGAAAAAAAAAAAAACGAGAGATGAAAATCAAAGCAAACACCTAACCTCA 
ATTGCGCACCGTTGGCGCTCGGAAAAAAAAATATGAAGAAAAAAAGAGCCAGTGAAGTGTGCATAAAAAAGCAATTAGCGCTGCCTTTTTGACAG 
CCACGCAGCGCAGTATCCATGTAGCTCCATCCGAATGACGACCATAATTCCGGAGCTACGTGCGTGGATTCCGGGCGCGCGACGTCGACTTCGCT 
GCGTTGGCGCTTTTTTCGCGCGCGCGCCCTTTTTTCTTGCTTTTCCTCCTTGCGGCCAATATGAGTTTTCCCTGTTCTGATAGCGCATTTTCTTT 
GCAGGGCCAAAATGAAGGGCTTCACTCCGTGGCCAGGAATGGTAAGCATTGCAGTTGGCGATCGGAATGTTTACGCCGCGTCGACGTCGCGTCGC 
TGCCGCCTTCTCTGCTGCCATGTTGTTGTGTTTGTTTTCGACTGTACTTTTCGATTCAACGCTGCCGGCTTCGCTGCCGCCGCTCTGCCCGCAGC 
CACGCACTTGCACACATTGCCCCATTGCCGTTGTGGTTGTTGTTGTTGCCGCTGTGCTGTTGTCGCTGCCGGCGGAGGAAATGGTCTCTTCCATG 
GAGCCGCTATCGGGGGCCACGAGCAGCAGCTTTTGATCGTGTGTACAGCCGCTGGAAAGCAGCATGTGCACTCGGAGGAGGGGGAGTGACGCGCG 
GCAGGCGTGGAGTTTGAACACAGACGTTGCCAAGGATATCAGTTCCCAGTTTCGTTATTCTATGCCGTTCAAAAGGAGTTTTGTTAATATATTAA 
CTTTTATTTGACATTAACATTCTCATCGGTAATCTTGTGATATACCAATCGGTTCTTTAGAGTTTTGTGAAATGTGCTAATGATAGTGAATCACC 
TCAGCTAAGGTTTTATCAAATCAAATTGGTTTTGTGCAAGTTGCTAGAAAAATGAGGATATATTTTTAAATCCTTATAATTCCTAGCTCAAATGT 
AAAGTTAAAGAGTGGTTTTCGATTGAAATTGGAAGTGAAAAAGCAACATTTGCATATGTTTTGTTTCGTGCGTGATAAAGTCTTGTAAATGGGTT 
ATCACAGAATAGATTAGTTCGAGGGAATTCATAGGCACATCCTTATAGAGTAAAATGCATTGTCTGCTTAAAATGTTTCTGAAGACTTTAAAGAT 
ATGATGTGGTGGAAGCTAATGTTGATTTCTCTCCTTAGATTGTCGATCCTCCACTTGATCTGCTCAGCCAGCAGCGTCGGGCGAATACCAAATGC 
GTGTTTTTCTTTGGATCTAGGAATTTGTAAGTAAAATGCACGCCAACCCCAAAAAGCGCCTGTAATAACATGGGTAATATTTAACAGTGCCTGGA 
TTGAGGAGAACAACATCAAGCCCTTCGAAGGACCGTGGAAGGAGGAAC TGGCC AAGGTGAGCAAGCCCGCTGCTTTCCGACACGCC AT GACGGAT 
ATCGAAAAGTACATCGACGACCCCGCCGAGGTGGACGAGCAGGTGAACAAAAGCTGCGGTGCGCCGAATCACGCAACCGAGGCTGATTTTGACAA 
GATCCGGGATGGCCTGGACAGCGAGGAGATTGTGGGAGAGGAAGCAACTGCCGATGGCAACAATGGAGTGGTGGCCCATGTGGTGGGCAGCCCCG 
ATGAGGGCGATGGCTTGGATGTGGAAATCAATGCAGATTCATCAGCGTCACCAGTCACGTCGCCGGCGGTCACAACGAAGGCAGCTGGCAAGCGG 
ACGCCGAAAGCAAAGTCCGTGGCCGCTACATCGGTCAAGTCGACAAAGGGATCGGCAAAATCAGCACAGAAACGGCGTACCTCTGCCCAACAGTC 
GCCCAGTGGGCCATCGAATGCCAAGCGAGGCAAGCGTGATGTTTCCGGGGAGGCATTGCAGGACGCGGATGAAGCCAGCTCCACGCCCACTGGCA 
GACGTCGGGTCGAGACTGACGCACTCTTGGCCTCAATCGCTGCCAAGCGTGCACCGAATGCAATTGCTCTGCTTGACCGACCGGTTGTCACGCGG 
CCAGAGGCCCAAGTCATCGACATGAGCTCGCGCTCCAACACACTGGCCGATCGCGATATCGTGCCTTCAGAGCAGACCTTCGGCTTCCTGGGTCT 
TGGAATGATGGGATCCACCATAGTCAAGGATTTGATCTACACGGGTCACAAGGTTGTGGTCTGGAACCGCACCATTGACAAGGTAAAGTTTTACG 
ATAGCAAGTTCACACATGAATCCTACAACAATCTCGTTTTGTAGTGCCAACCGTTTGCCGAAGCCGGCGCTGAGGTTAAGGACACTCCCATGGAT 
GTTGTGGAAGCTGCCGACGTTATCTTTTGCTGTGTGTCCGATCCAAAGGGCGCCAAGGATGTAAGTCATAAAAGGTTTCATTTATATACATATTT 
AATTTGGGAAATGCCTAAAACACAACGTGGTTAACAAAACTTAAATTTTGGCCTTGGGGTTCCTATAAAATTCAAATGACTCATCCGCGCTTGTT 
AACTTAAAAAGTACATATGAATGAAAGCACGCAGTTTAAAATGACGATTTTCTGACGCTCTCTTTGCAAAGCGATCTTCCTTACACTGTATAAAA 
TTAAAATATATTAGAATACAAAATAATTGTGCAACTGGGAGGGATGGCGTAATTTAACATTCTTCTTATTTATGCATTCGTTTTATTCGATGTTT 
AGCTCGTATTTGGCAACTGCGGCGTTCTGCAGTTGAAGGACTTGAACAATAAGGCCTACGTTGAAATGTCTACAATTGATCCGGACACATCGTTG 
GACATTGGCGAGGGCATCAAGCAGTGCAACGGTCGCTATCTTGAGGCACAAATTCACGGCTCTCGGCAGGAGGCTGCCGAAGGCATGCTCATCAT 
TCTCGCCGGTGGAGATCGTTCAGTGTTCGAGGAGTGCCACTCGTGCTTCAAGACCATTGCCAAGAACACCTTCTTCTTGGGGAGTAAGTACAGCC 
TGCACTGCTGACCATTTCCTTTGAAACAATTTATTAAAATATGTCGTTTGTTCACATCTAGCAGATATCGGTAACGCCTGCAAAGTAAACCTAAT 
TTTGCAAACCATCTTGGGAGTGAGTCTGGTCGGGTTGGCTGAGGCGTTAGCACTTGGTAGGTTTTATTGTATATTGTTAGCAGTAAGCAGCCTTA 
ACTAATTGATCTTTTTTTTTATTGCTTTTGCAGCTGATCGGTTCTCGATCTCTCTGAACGACATCATAGACATTTTCGATTTGACTTCAATGAAA 
TCACCCATGCTGCTGGCTAAGGGCAAAGGTATGGATAGCACCGTGGCTATTTCCTAGTGTTTATCTTATCCTATTTTGAATTCCTTTGCAGAAAT 
GGCCAAGGGTGACTTCAATCCCCAGCAGCCTTTGAGCCACATGCAACGCGACTTGCGCTTGGTGTTAAACATGGCTGAGAACCTGGACCAGTCCA 
TGCCCGTCACCAGCATCACCAACGAGGTGTTCAAGCACACCAAGCGCTTGGGCTACAGTGAACACGACTCGAGCGCCGTATTCGTAAGATCCAGA 
TTTTAACATTTAGTTAACACGACTAACGTTTAAACTATGTTTTATCAATTAGTTTAAAGCGTACACCGAACAGAACGACAGAACCGACAAACGAC 
TGACTTAAAAAAAAAAAACTAAACAGGACAGTAATAGAAACGCATAACCCTCACTTATTCACCCACCAATCCAAACTGACAGAGGCGTTCGTTCA 
CAAGGAGTTCTCCTGCCCTTGTGATTGGCAGCGCCACCACCCCAACCTTCTGCATCTTCCCCAATCGCCCCATATGCAAAGCGTTTAATGCAATA 
TGTCTACGATTCGCCGGGAATCGCAGAACCGTGTCGTCCTGCTTTCAACGCAGGAGCCACTGAAGAGCGCTTGGGTGAGTGTGTTCAACAGCTGG 
CGGATGAAGAGCACAATCTTTCCGGGATCACTTCAATCAAGTTTAGAGTTTTTTTGTATTTTATTCTTTGACGGGTGAAACGCAATCCCCAGAGA 
GGAGGGAGGAGGCCAGGGCGCCGTGTTTCCGATTTTATATGTGTAAGGTTTTGCTTTCCAAATATCATTTTTACGCATGTCCATACATAACTTGT 
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AAGTGTATAGAATACTGCATCCGAATAAAATGTACAAAGCATACTTGTGAAAGACAACGCCGATTTTGAGAAACCGAGCGAAGTATGGAACCCTC 

CAATGTGTGTCGATCCAACCTCCCAGCCTCCCACTTTTTGTGCATTTACTCCAGTCCTAAAATTGTATACATTTAAAATCCCCTTTTTTTCTATG 

AGTAAATAGTAGGCTAATCGATCATACTGCGTAGAATGGCAAACATATTTGGAGTGACGGTAACGATAGATTGGTTTAGTTTTAGCCATCTCTCC 

CCATCATCTGTCACTGCAACAAACGAGACACAACCTAAAACGGAAAAAACGGTACAGAACGCACCACAACCGATACGCAAACGATACGAAAACGT 

AAACTACACAACAACCGGAAGAAAACAAAATAGAGTAAACTTTGATCGAACCCTTTGCGTGGGTCGCTATTAATAATAATAATAAGAATATATAA 

TAAATAATATATTTTACTTATTATTTAGTCTGTATTTTGTTGTAAAACAAAGATTA 

(SEQ ID NO: 502) 

Exon: 1001. .1306 
Exon; 1722.. 1751 
Exon: 2604.. 2686 
Exon: 2748. .3597 
Exon: 3655.. 3765 
Exon: 4088. .4358 
Exon: 4435.. 4521 
Exon: 4594.. 4683 
Exon: 4747 . . 5041 
Start ATG: 1304 

Transcript No. : CT15275 

CCGCCTGCTATCGCATTGCTCGAGCAGATATCGGTTCGGCAGCGCATCACACACACAACGCGAAAAAAAAAAGAAAAATACGGGTGGCCTGCAAG 
CGAATATCTCCGAAATTCATTGAATCCCGACGCCGAGATTTGTGCTCCGATCCGAGAAACCCATTAGCTTCCGTGCACCGTCCGCCCAAGCCAGC 
AAAATAACCAAGTTCAACATACGCCTGGACAACATAATGTCGAAGAATAAAAAACTGTCGCTGTCGGGCGGCGATACGACGGAGAAATTTATTTA 
CAAACCCAAGGATCTGATATGATGAAGGGCTTCACTCCGTGGCCAGGAATGATTGTCGATCCTCCACTTGATCTGCTCAGCCAGCAGCGTCGGGC 
GAATACCAAATGCGTGTTTTTCTTTGGATCTAGGAATTTTGCCTGGATTGAGGAGAACAACATCAAGCCCTTCGAAGGACCGTGGAAGGAGGAAC 
TGGCCAAGGTGAGCAAGCCCGCTGCTTTCCGACACGCCATGACGGATATCGAAAAGTACATCGACGACCCCGCCGAGGTGGACGAGCAGGTGAAC 
AAAAGCTGCGGTGCGCCGAATCACGCAACCGAGGCTGATTTTGACAAGATCCGGGATGGCCTGGACAGCGAGGAGATTGTGGGAGAGGAAGCAAC 
TGCCGATGGCAACAATGGAGTGGTGGCCCATGTGGTGGGCAGCCCCGATGAGGGCGATGGCTTGGATGTGGAAATCAATGCAGATTCATCAGCGT 
CACCAGTCACGTCGCCGGCGGTCACAACGAAGGCAGCTGGCAAGCGGACGCCGAAAGCAAAGTCCGTGGCCGCTACATCGGTCAAGTCGACAAAG 
GGATCGGCAAAATCAGCACAGAAACGGCGTACCTCTGCCCAACAGTCGCCCAGTGGGCCATCGAATGCCAAGCGAGGCAAGCGTGATGTTTCCGG 
GGAGGCATTGCAGGACGCGGATGAAGCCAGCTCCACGCCCACTGGCAGACGTCGGGTCGAGACTGACGCACTCTTGGCCTCAATCGCTGCCAAGC 
GTGCACCGAATGCAATTGCTCTGCTTGACCGACCGGTTGTCACGCGGCCAGAGGCCCAAGTCATCGACATGAGCTCGCGCTCCAACACACTGGCC 
GATCGCGATATCGTGCCTTCAGAGCAGACCTTCGGCTTCCTGGGTCTTGGAATGATGGGATCCACCATAGTCAAGGATTTGATCTACACGGGTCA 
CAAGGTTGTGGTCTGGAACCGCACCATTGACAAGTGCCAACCGTTTGCCGAAGCCGGCGCTGAGGTTAAGGACACTCCCATGGATGTTGTGGAAG 
CTGCCGACGTTATCTTTTGCTGTGTGTCCGATCCAAAGGGCGCCAAGGATCTCGTATTTGGCAACTGCGGCGTTCTGCAGTTGAAGGACTTGAAC 
AATAAGGCCTACGTTGAAATGTCTACAATTGATCCGGACACATCGTTGGACATTGGCGAGGGCATCAAGCAGTGCAACGGTCGCTATCTTGAGGC 
ACAAATTCACGGCTCTCGGCAGGAGGCTGCCGAAGGCATGCTCATCATTCTCGCCGGTGGAGATCGTTCAGTGTTCGAGGAGTGCCACTCGTGCT 
TCAAGACCATTGCCAAGAACACCTTCTTCTTGGGGAATATCGGTAACGCCTGCAAAGTAAACCTAATTTTGCAAACCATCTTGGGAGTGAGTCTG 
GTCGGGTTGGCTGAGGCGTTAGCACTTGCTGATCGGTTCTCGATCTCTCTGAACGACATCATAGACATTTTCGATTTGACTTCAATGAAATCACC 
CATGCTGCTGGCTAAGGGCAAAGAAATGGCCAAGGGTGACTTCAATCCCCAGCAGCCTTTGAGCCACATGCAACGCGACTTGCGCTTGGTGTTAA 
ACATGGCTGAGAACCTGGACCAGTCCATGCCCGTCACCAGCATCACCAACGAGGTGTTCAAGCACACCAAGCGCTTGGGCTACAGTGAACACGAC 
TCGAGCGCCGTATTCGTAAGATCCAGATTTTAACATTTAGTTAACACGACTAACGTTTAAACTATGTTTTATCAATTAGTTTAAAGCGTACACCG 
AAC AG AAC GAC AG AAC C GAC AAACG ACT G AC T T 
{ SEQ ID NO: 503) 

Start ATG: 304 

MMKGFTPWPGMIVDPPLDLLSQQRRANTKCVFFFGSRNFAWIEENNIKPFEGPWKEELAKVSKPAAFKHAMTDIEKYIDDPAEVDEQVNKSCGAP 
NHATEADFDKIRDGLDSEEIVGEEATADGNNGVVAHVVGSPDEGDGLDVEINADSSASPVTSPAVTTKAAGKRTPKAKSVAATSVKSTKGSAKSA 
QKRRTSAQQSPSGPSNAKRGKRDVSGEALQDADEASSTPTGRRRVET DALLAS IAAKRAPNAIALLDRPVVTRPEAQVIDMSSRSNTLADRDI VP 
SEQTFGFLGLGMMGSTIVKDLIYTGHKVVVWNRTIDKCQPFAEAGAEVKDTPMDVVEAADVIFCCVSDPKGAKDLVFGNCGVLQLKDLNNKAYVE 
MSTIDPDTSLDIGEGIKQCNGRYLEAQIHGSRQEAAEGMLIILAGGDRSVFEECHSCFKTIAKNTFFLGNIGNACKVNLILQTILGVSLVGLAEA 
LALADRFSISLNDIIDIFDLTSMKSPMLLAKGKEMAKGDFNPQQPLSHMQRDLRLVLNMAENLDQSMPVTSITNEVFKHTKRLGYSEHDSSAVFV 
RSRF* 

(SEQ ID NO: 504) 
Classification: hypothetical 



Celera Sequence No. : 142000013384043 

GATCGGGTCCTTGGCATTACGGAGAATCTGGTGGACTTCCTCCACTTTGGCGCGCTCTTCACGCTTGGTGGTGATCAGGTTCTTGAGCCGTTCGT 
TGCTCAGCGAGTTGAGAATGTTGTCCATATCGGTGGCTGGCAGTCGGTTGGCAAAACCGTGACGCTTGCGTATTTTGTGGTGAACTCCCTTATTA 
ATGCCCAATAGTGGACGGGAGGTTGGCGAGATTTTCGTTGGTTGTTTAAGCGAATTCTTGGATATGGACGGCCGCCTTCTTGACTTTGTGACCGC 
TTCTCCACTGTGGGAATAGAAGGCCATGGTGGGAATGCTGCTGCTCAGCGTCACCTTGCGCATCTTGCTCTGCGGCGAACCAGAGTCGGCGTCCG 
CTGAAGGAGCAACCGGGGTGGACGTTTCTGGGTGCTGTCGCTTTTGAGGCTTATGATTGGATGCTTCTTCGTCCGATTCGGGCACTTCGACATCA 
ATCTTCATACTACTTCCTGGGCTTTCTGGCCTTTTTGCTGTGGCAACTTCGGAAGATTTCGCTGTTTTATTGTTGGTGTTTCCTGTGTTTCGTTT 
TACTGCATCGTCATTTGACTTGAACACTTCTACCGGGGACTTGGATCTGCTTCTGCTGTTTTCTAGCTTAACTGCTGCTGTTTTTTCTTCATCAT 
ACGACCTCATGTTCTCATGTAGATGGGCAGCAGCCTTCAGCGTCTTGCAGTTTATATCCGCCGACGTGAAGGGTCTTCGCCTATTCCTGGTTTTT 
GGACTGATGATTTGCGAGCTCCTTGTTTCACGAACTTCTGTATTCTTCCTGGAGTTCTTACTGCTAGAAGGTTTCTCGTCGTCCGACTGGGACTC 
TTCAGCTGGGCTCGTTTTCTTCGGTTTAGTTGTATGCACTTGGGCAGTGGTCTTTAGGCTGTGGTTGTGTTTCTTCTTTGATTTTTCTAACATTG 
ATCTTGAATCCTGACCCTTCATTAGCATAGCTGCCACCTTGCTTAGCCTAGTCTGAAAAGATTCTGGCGAGGAGAACGCATCTGGATCCTTTTGC 
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GCACTGAAGGTTCGTGGACTGCGTCTGATGCTATTTCTGGCTCTGGAAGAAGGAGTTCCTTCAGAGGAAGAGGTGGGCGACGACATTGAAGAATC 
CGCTCGCTTTCTTTGGACCCTCGTCTGACGGGGGCTGCTGTTGCGACTATTGCTGCTGCTGTTGCTATTTAGCCGCATGGTGGCTGTAAACAATT 
GGGGCAGCTGCTCCGCCGTGGCAGTCATGACTCCGCCACGGGTTTTTTTGTTCTCCTTGTTGGTTTCCGGGCTGCTGTTCGCACTGCTGCTTGAC 
TTGCCGGGGACTGCGGCGAAGAGACTCCTGCCCTTCCTGTTGGCCGCCACATGGGTCTTTAATGGCAAGACACCCAGACTGTCTACGTCCGCATC 
ATCCTCGTCGTCGTTGATTTGGCGCAGCCTGGATCTGGGACTGCCAAACAGTTGCCGCTTGCGCTCCGAAAGGCGCGGAGTGGCCATGCGACTGG 
GACGCCCACTACCCGTCGGGGTCTCCATGATCGCGCGCAAATCTTTTATTTCGCATTAACTTTGTCCAAATTGTGACTCCTTCGCGCTTTTTTGT 
AGCTGCCTAGTGTTGGGGGTGGTTGGATCGTTCAGGGTCTCTTCGGTCGTTTGCCAACACATGCCAGATCGTCTATGTTTTTACTGCAATGTTTA 
TATATATTAAATGCATTACATTGTTTTTAAATTTTTTTTACATCAATGTTTATAAAAATCAAATACATTGCAATGTAAATATTTTCTGTTAGTTA 
TAATTTTTTTACCACATTTTACATTTAACATTTTTCTTCGAAGCGATCAAGTCAGCTTGCAGTTTTTCGTAAATTATTTGGACTTATTCTACGAT 
TTCTGTAAAATAAATAAGTAATGTTAATTATATTCTTGATCCAAGACAAAGTATTCAATATAGTACCACTTAACGAGTTACGATCATGATCGCTA 
GACATTCTATGTTTTCTAATCTTCATATATTTTAAAGATTTCCGTATTATTTTAACTATCCGGTATTTTTACAGAGCTAAGGTGATCATATAACG 
GGAAAACTCTATCTGGTATTTCCACCCAAGTCAGACTCAGTCACACTCTTCCAGACGTCACACGGTCATATTGCGGACGAAGTGCGAATCGATTT 
GGAAAAATCTGCTGCTCGGCCTGGAAAGTAGAAAGTATAAAGCTCTGCGTTTATAATTTAATTTGCAGCCAAAGCCCGCTGCCGATCAGGTAAGT 
TAGCTCCCCCACATTATCCGTACCGCCATTTGAAAGCCACGCCAACGAGAACCGTGAGAAAATGTGAACTCTAGCCGCCGCCATCTTTTTTTATG 
TATGCGTTAGGCCACAACGCCAATAATGCGATTTAAAACCGAGTTTATTAAGATTACTCGCGCTCCTTGTGCTGCTGCCTTAATGTTTATCCTTT 
GCTCGTATTGGTTAAAATAGAAAACAATTGCACTGTCTTCTCTCTTCTCTTGAGGTATCGCGTTTGTTCGTCGCCGTTGTCGTCCTCCAACCGCG 
GTATATACATACAAACATATAGGCAAATATATATAAATATATTTTCGACGCAGCCGCCGCCGCCGCCGCCGCGTAGATATCCATTTTCCGGGATT 
ACCCCACCCAGAGTGATATAAACGATATAACCGTGGACGACCACGCAGCGGAACCCGCTGCCGCCATGCCCGGAGCCGGCACGTTCAAGTTATTC 
ATCGGGAATCTTGACGAGAAGACACAGGCCACCGAGCTGCGTGCACTGTTCGAGAAGTACGGTACCGTCGTCGAGTGCGACGTGGTGAAGAACTA 
TGGCTTCGTTCACATGGAGACGGAGCAGCAGGGTCGCGATGCCATACAGAATCTGAACGGTTATACGCTGAACGAGTTCGCCATCAAGGTGGAGG 
CGGCCAAGAGCCGGCGGGCGCCCAATACGCCGACCACGAAAATCTTCGTGGGAAATCTAACGGACAAGACGCGGGCGCCGGAGGTGCGGGAACTG 
TTTCAGAAATACGGCACCGTCGTCGAATGCGACATTGTGCGCAACTATGGTTTTGTCCATCTGGACTGTGTCGGTGATGTGCAGGACGCCATTAA 
GGAGCTGAACGGTCGCGTTGTCGATGGCCAGCCGCTCAAGGTTCAAGTGTCGACCAGTCGGGTCCGCCCCAAGCCGGGCATGGGCGATCCGGAGC 
AGTGCTATCGGTGCGGAAGATCCGGGCATTGGTCGAAAGAGTGTCCGCGACTGTACGGCAGTGCCGGAGGCGGACGCGAGCCGCCATCGCCCCTC 
AGCGCCGGCGGCTACAGAGATCGCATGTACGGTCGTGACCCGTATCCGCCGCCACCACCGCCGCCGCCGTTCCTGCGTGACCGCATCATGGATGG 
CTTTAGGGTAAATTCAGCATCCATTTGCCCCCCCGATCAAATCATCCATTGTGCCAAGTATTATTCACATTCTGTGCTGCTACTAATCTAAATCT 
TGTTTTCCCGCCCGCAGGACTATGACTACTATGACCGCCGCTTCGAGGACTCACGCGATCTGTATGAGCGTCGCTATCAGACCTCGCGAATGCGC 
GACTTCCCACCGCCGCCCATCTCTCGCCGCGAACCCATGCCTCTACCGCCCACCTTAAGCGGCAGCCTGCGCTCCTGCAGCGTTTCTCGTGGATA 
CGACACTATGTTCAGCCGGCGCTCGCCTCCGCCGCCGCGTAGCAGCAATGGAATGAGTCGCTACGGGTAGGTAACCCTGCAAGTGGATGAAATGT 
CGCACTTTCTTTTCCAAATATGTGTGTACTAAAGTTAATTCTTATCAGATGACTACGTTTTCCATACTTACTTACATATTAAGACCTAGATAGTG 
CTTTGCTTATATGATGAAATTATAAAATGAAACGATGAAAGATAACATTCGTATATCTCCATCGAACCTATTTTGAAATCAGTTTTTAATGAATT 
ATTTTCTATCTCCTCCCAGCTCGCCCACTCCACACGGTTACGAGGACTTTAGTCGCGATGCATTCGACGAACGCATGATTTCGTCGCGCGGAATG 
CGCGGTCCATCGCCGCCGGGTCGCAGATATGCGCCCTACTGAGATGAGTTTTACTATCACGGCAGCAGCTGAGTAGCGGTTGCACTACCCATGCC 
CCTGCCAATGGGTCACACATTCTACCACACGACCAGCAATCGAAGAACGGCGCCTGTGGACTGCTGGCTGCTCTGCTGGTCGCCAGCGGCGGCCT 
ACTACCATCACCACCAAGCACACCAGCAGCACATGCTGATGCCGACGACAATTTCACAACTGTGGCTGCCGCCTTGGTAAACCACATACATGTTA 
CTAAACGCAAACTAACCATTACTCGTACACTTAAACAAGAAGATGGAAATACGTGTTTAAGCATCCTCCCTATATGCAAAACACGCCAAAAAAAA 
AAAAGAACTAGACAACGACGTGCAACCTCTACACATTCAACCGTAGTACCAATAAACGCATAATCAGTCGAAACTGTGCAAAAATAAACACTGTA 
TTTTAAAGCTAGTTTAAGGGGCAGTATTGCAACGATGTGTCTTAGATCTACGGATGCTCCATAGCCAGCCGTCGTTACGCATGTGAAATCGAGTG 
CTGGCTGCCGTGCTTGCCCAAAATGGGCTGATCAGGAGACAGATCTTCCGGTGGGCTACCATTGTGCGTGTCCGGTGAGTGGTCGTCCAGCAGCT 
TGCTACCATTGTGCTTGTTCAGTTCTTTCAGCTCGACAGGTGTGGTCTTTTCGAATGTTGGCGGGATTAGTAATCGACTGAAAAGATAACATGTT 
ATTTATTGTGGGGAACCGCCTTAAAAACGAATAGTATTTCAGCTTACTTGCTGCACTGTAGCCAAAACATGGAGACCAGCATCATGGCCGAAGTA 
ACGCCGAAGGTCCAGAACGTACCTAGGCGCGTGTAGATGGATCCCACGAAGACTGGACCAAGGACGCGGGACAGACAACCCGAACCGGTCATCCA 
GCCCATCCAGACGCCCTGCGGCCGAGGTCCCAGAACTTTGGAGAATATGGTCTGGATGAGAGTCACCCCAATGGGATATCCTATCGATGTGAAGG 
CAAATCCAATGATGAACTGCGTTAAGGTCAAGGCCGGCAGTTCGCCGCAGCATTTCTGCGTTGTGGGGCAGCCCAAGAAGATGGGATCGCTTTCG 
CTTAGGTTCCAGCTGGCATTGAATGGCTGCGCCAACTTTGGCGGATCCGGACCCCAAGGCACGAAGAGCACGCGACCGAGAAACATCAGCGAGAA 
TCCGCCCCAGATGAGCACATAACGCTCGGCTATAATCTTGCACATGGGCTCAATCAAAACGAACGTGACCAGGGAGACAATAGCTGCGGTGGTCA 
TCATGATGCCCATGTACCATAGGGCCTCGTCGTTGGACCAAGCGAACATATCCATTGTCAGGGATGTGCCCAGACTGCAAGGCGAGGAAGCAATA 
GT T T AAAA AAAAAAAAAC C C AC T C T G AGGAT T ACTC A CGT CT C C AACAGC 
(SEQ ID WO: 505) 

Exon: 1001.. 2274 

Exon: 2525. .3427 

Exon: 3533.. 3771 

Exon: 4010.. 4560 

Start ATG: 2726 

Transcript No. : CT15293 

GTCTGAAAAGATTCTGGCGAGGAGAACGCATCTGGATCCTTTTGCGCACTGAAGGTTCGTGGACTGCGTCTGATGCTATTTCTGGCTCTGGAAGA 
AGGAGTTCCTTCAGAGGAAGAGGTGGGCGACGACATTGAAGAATCCGCTCGCTTTCTTTGGACCCTCGTCTGACGGGGGCTGCTGTTGCGACTAT 
TGCTGCTGCTGTTGCTATTTAGCCGCATGGTGGCTGTAAACAATTGGGGCAGCTGCTCCGCCGTGGCAGTCATGACTCCGCCACGGGTTTTTTTG 
TTCTCCTTGTTGGTTTCCGGGCTGCTGTTCGCACTGCTGCTTGACTTGCCGGGGACTGCGGCGAAGAGACTCCTGCCCTTCCTGTTGGCCGCCAC 
ATGGGTCTTTAATGGCAAGACACCCAGACTGTCTACGTCCGCATCATCCrCGTCGTCGTTGATTTGGCGCAGCCTGGATCTGGGACTGCCAAACA 
GTTGCCGCTTGCGCTCCGAAAGGCGCGGAGTGGCCATGCGACTGGGACGCCCACTACCCGTCGGGGTCTCCATGATCGCGCGCAAATCTTTTATT 
TCGCATTAACTTTGTCCAAATTGTGACTCCTTCGCGCTTTTTTGTAGCTGCCTAGTGTTGGGGGTGGTTGGATCGTTCAGGGTCTCTTCGGTCGT 
TTGCCAACACATGCCAGATCGTCTATGTTTTTACTGCAATGTTTATATATATTAAATGCATTACATTGTTTTTAAATTTTTTTTACATCAATGTT 
TATAAAAATCAAATACATTGCAATGTAAATATTTTCTGTTAGTTATAATTTTTTTACCACATTTTACATTTAACATTTTTCTTCGAAGCGATCAA 
GTCAGCTTGCAGTTTTTCGTAAATTATTTGGACTTATTCTACGATTTCTGTAAAATAAATAAGTAATGTTAATTATATTCTTGATCCAAGACAAA 
GTATTCAATATAGTACCACTTAACGAGTTACGATCATGATCGCTAGACATTCTATGTTTTCTAATCTTCATATATTTTAAAGATTTCCGTATTAT 
TTTAACTATCCGGTATTTTTACAGAGCTAAGGTGATCATATAACGGGAAAACTCTATCTGGTATTTCCACCCAAGTCAGACTCAGTCACACTCTT 
CCAGACGTCACACGGTCATATTGCGGACGAAGTGCGAATCGATTTGGAAAAATCTGCTGCTCGGCCTGGAAAGTAGAAAGTATAAAGCTCTGCGT 
TTATAATTTAATTTGCAGCCAAAGCCCGCTGCCGATCAGGTATCGCGTTTGTTCGTCGCCGTTGTCGTCCTCCAACCGCGGTATATACATACAAA 
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CATATAGGCAAATATATATAAATATATTTTCGACGCAGCCGCCGCCGCCGCCGCCGCGTAGATATCCATTTTCCGGGATTACCCCACCCAGAGTG 

ATATAAACGATATAACCGTGGACGACCACGCAGCGGAACCCGCTGCCGCCATGCCCGGAGCCGGCACGTTCAAGTTATTCATCGGGAATCTTGAC 

GAGAAGACACAGGCCACCGAGCTGCGTGCACTGTTCGAGAAGTACGGTACCGTCGTCGAGTGCGACGTGGTGAAGAACTATGGCTTCGTTCACAT 

GGAGACGGAGCAGCAGGGTCGCGATGCCATACAGAATCTGAACGGTTATACGCTGAACGAGTTCGCCATCAAGGTGGAGGCGGCCAAGAGCCGGC 

GGGCGCCCAATACGCCGACCACGAAAATCTTCGTGGGAAATCTAACGGACAAGACGCGGGCGCCGGAGGTGCGGGAACTGTTTCAGAAATACGGC 

ACCGTCGTCGAATGCGACATTGTGCGCAACTATGGTTTTGTCCATCTGGACTGTGTCGGTGATGTGCAGGACGCCATTAAGGAGCTGAACGGTCG 

CGTTGTCGATGGCCAGCCGCTCAAGGTTCAAGTGTCGACCAGTCGGGTCCGCCCCAAGCCGGGCATGGGCGATCCGGAGCAGTGCTATCGGTGCG 

GAAGATCCGGGCATTGGTCGAAAGAGTGTCCGCGACTGTACGGCAGTGCCGGAGGCGGACGCGAGCCGCCATCGCCCCTCAGCGCCGGCGGCTAC 

AGAGATCGCATGTACGGTCGTGACCCGTATCCGCCGCCACCACCGCCGCCGCCGTTCCTGCGTGACCGCATCATGGATGGCTTTAGGGACTATGA 

CTACTATGACCGCCGCTTCGAGGACTCACGCGATCTGTATGAGCGTCGCTATCAGACCTCGCGAATGCGCGACTTCCCACCGCCGCCCATCTCTC 

GCCGCGAACCCATGCCTCTACCGCCCACCTTAAGCGGCAGCCTGCGCTCCTGCAGCGTTTCTCGTGGATACGACACTATGTTCAGCCGGCGCTCG 

CCTCCGCCGCCGCGTAGCAGCAATGGAATGAGTCGGTACGGCTCGCCCACTCCACACGGTTACGAGGACTTTAGTCGCGATGCATTCGACGAACG 

CATGATTTCGTCGCGCGGAATGCGCGGTCCATCGCCGCCGGGTCGCAGATATGCGCCCTACTGAGATGAGTTTTACTATCACGGCAGCAGCTGAG 

TAGCGGTTGCACTACCCATGCCCCTGCCAATGGGTCACACATTCTACCACACGACCAGCAATCGAAGAACGGCGCCTGTGGACTGCTGGCTGCTC 

TGCTGGTCGCCAGCGGCGGCCTACTACCATCACCACCAAGCACACCAGCAGCACATGCTGATGCCGACGACAATTTCACAACTGTGGCTGCCGCC 

TTGGTAAACCACATACATGTTACTAAACCCAAACTAACCATTACTCGTACACTTAAACAAGAAGATGGAAATACGTGTTTAAGCATCCTCCCTAT 

ATGC AAAAC ACGCC AAAAAAAAAAAAGAAC TAGACAACGACGTGCAAC CTC TAC ACATTCAACCGT AGTACC AAT AAACGCAT AATC AGT CGAAA 

CTGTGCAAAAATAAACACTGTA 

(SEQ ID NO: 506) 

Start ATG: 1476 

MPGAGTFKLFIGNLDEKTQATELRALFEKYGTVVECDVVKNYGFVHMETEQQGRDAIQNLNGYTLNEFAIKVEAAKSRRAPNTPTTKIFVGNLTD 

KTRAPEVRELFQKYGTVVECDIVRNYGFVHLDCVGDVQDAIKELNGRVVDGQPLKVQVSTSRVRPKPGMGDPEQCYRCGRSGHWSKECPRLYGSA 

GGGREPPSPLSAGGYRDRMYGRDPYPPPPPPFPFLRDRIMDGFRDYDYYDRRFEDSRDLYERRYQTSRMRDFPPPPISRREPMPLPPTLSGSLRS 

CSVSRGYDTMFSRRSPPPPRSSNGMSRYGSPTPHGYEDFSRDAFDERMISSRGMRGPSPPGRRYAPY* 

{SEQ ID NO: 507) 

Name: RRM-type RNA binding protein 
Classification: RNA_binding 
Gene Symbol : lark 
FlyBase ID: FBgn0011640 



Celera Sequence No. : 142000013384256 

AAATGCATCTGAATATTCTCGAAAATATCTTCAAATGGCTTTCTCGAATCATTCTTTTAGCAAAACATTGGCGCCTGGCGCATCATTTTCGGAGT 
GACCATCGTGCTGTTCGCCTTGGAATTCCTGGTATTCGTCTTCTTGGGATCGGGCAGCGAGCAGCCGTGGAACAAGGCCGGAACTCCCAAGGATC 
CCGAGGCCAAGGACGAAAAGACTCCGTTGAAGGAGCTGCCAACTAAGCCCTAAGCCAACAATCTCGCCCCGAATCTTCTGCCTCAATTACTCGAC 
GAACTGCTGCTTATTCGGATAACTCTTGGATAAAGCCAGTTCAAAGTGGAATGGTCAATGCTGGACTGTAAATACCAATTACAAGTGTACAATGC 
CCACCCTGAATAGCGAATTCATTGAACATACGTTCGCTATTTAGGGCGTGCACTGTAGTTTTTAGATCGCTGTAAACCAATAGTTAGTAGCCGTA 
TCCATTACGGATAAACACTTCAGCCCGCCTAATGTACTTAACACTCTTGATTTCAAAAGTTGTCTACAAATAAAGCGACGTAATAGTGGATGCAC 
GGGGGTTCCTAAACAAATTCTGGTGGGGAAGGGAGAATGGGTCTCAGCCTCAATTTCGATGGAAATTCTGGGTGAATGACCCCATTTTCCTGTTT 
ATATTTAGCGCTAGCAATAAGCCATTCCACAGAATCTCAGAGGATTTAGTTACAACTCTGTACGTATAAAAAAATATATAAACAGGGGATCTGTT 
CTATCTGTTTGGTTGAAGATATGAACAATTGCATGATGTAACTAAATGGCAAAAGATGTAAGGCTAGAGGATGATCCTAACTCAACAATGCAAAG 
ATTGGGTTGTGAAATTGATTGTAACATCTAAGAACCAACTTTCTTTGCATCCTTATCACAATTTAAACTCCGATAACCTGCGGCGCCAAAAGACA 
GAGTCCGTTATCGATATTCCGGCAGAAACATATGATTTCATGTAATCAGTTTGCCTTTGCCGGTTTCAACGGATTGCGGAAAATGTTGAAAGCAC 
GGAACGTTTGTTTGCTGTCCAGGAGGTGGAAGAGTGGCGCCAAGAAGCGATGGAATGTCCTGCAGAACAAGAAGAACAACTGTGACCGAGCGCTG 
GAGAACTTCGATGACTTCTACGGAAGTGTCTACGGCACCCGGTGGAAGAACATGCGGGCGGCGCTTCTCACGAGGCACAAGTATGTCGCCCTGGT 
CAACAATTTCGGGGACACGGAGCAGACGTGCGCCATGCTGGAGTCAGATGGAGCCATCAACATGAAGTCGCTTATTAACCTGGCGCAGGATCGGG 
TAAAGGACAGCACGGAGACGGTGCCGGAACAAGGCAAATCACGTCATGAAATTGAAAGCAAACTGGATGCACTGCTGCGGAAACAGCAGGAGCGC 
GAGGTGGCCTCCATATACCCGAGCTCGCCAGAAGATGGTTCATCGGCGCCACTGCCGCTGGAACTGAAGTTTGACAACGTAGAGGAAGATCAGCC 
GGAGCAAGTGAATAATCCCTTCAAGCAGAGCCTGACGAAGGCTTTGGAGGAGGATGTGAAGCTGGACGAGCATCGGTTGGTGGATCCCCAATTTG 
GCACAGGCGGACTGTATGAATACATGCCTGCCCACAGCATCAAGGGCATGGAGGACTGGGTGGCAGAGTCGGAGCACTACAAGTACTACCAAACC 
AGCGCCGACTTCCCGCTCGCCATTGAACCGGAGCCTTCCTTTCACTACCCCGAGCATCTGTCCCTGTACACATATGAAATGGGCAACTGCTCGGA 
CTTCAAGGGACCCAAGAAGTGCATGACCGGTGTGCTCTCCCACTTTATGATGGATGGTGCCTCGACTTTGCCGCCTCTCTTCCTGCAGGTGAAGC 
CTGGCGAGCGTGTGCTCGATGCCTGTGCCTCTCCCGGCGGCAAATCGCTGCTTATGCTGCAGACGCTGCACTTGGACCACCTGGTCTGCAATGAC 
ATCCAAGAATCCCGCCTAAACAAGCTGCGCAAAGTGATGCAGGAGTACCTGTTCGACTACAAGGAGCGCTGGGCTGGAAAGCGCCTCATCCTCAG 
CCAGAGTGACGCCCGCAATCTGGACCAGTACGAGCAGTTCGACAAGATCCTGGTCGACGTGCCCTGCACCACGGATCGCCATGTGCTCAACGAGC 
AGGACAACAATATTTTCAAGCCGACGCGCATAAAGGAGCGCCTGCGGATACCAGAGCTCCAGGCAGGCATCCTGGCCAACTGCTTGCGTCTCCTG 
CGACCAGGCGGCAGTTTGGTCTACTCCACCTGCTCGCTGTCGCCCATCCAGAACGATGGTGTCGTCCACATGGCTCTGCAAAAGGTCTTCACGGA 
GTACGGCATCACAACTACCATCAAGGATCTGAGTCGCCACACGGCGCTCTTCAGCGATGTCTTCAAGTTCGAGCACCCAAAGGGACTCAAGTATG 
GACAAATGGTGGTGCCCTACCTGCCGGCCAACTTTGGACCCATGTACTTTAGTAAAATAACCAGAAATGTGTGATTAAATGAAAGTTGTTGGCTG 
ATTAACAAGACTATATAACTGTTGCCCCAATAAATAATAAGAGTTGTAAACTAAAAAGGTACCTTTATATTTAACATACTGCTTCCTTTGGTAAC 
CAGTATTTTCTACTGCTGGAATTACATTTGCTTGTCGGTCTTCTTTTTCTTTTTGCCCTTGATGGGCAGAGGACCGGGATTCTTGAAAATGGTGC 
TGGAAAGTATAGTCAGGCTCCTTAAGTTAAGTTCTAGTTAATTGAAGTTTAAATTGTACTCACTGGCAAGTGGTGCATATGGGACTGTATTTGCA 
GAAGACCGAGAGGCAAACCGAGCAGACGTAGCCAATGTCGATCAGCTCACGATGGCAGAAGCAGGAGGCGCGGTAGTCCACTTTGGGCGGTGGCG 
GCAGGACCAATTTGTGACGGATCTGCGGGGCGGGCAGGAAGACCCACAGGAGGTACTGCAGCAGGCCGTCCAGCTGGGTGACCTTGAGGAACTGT 
CCGGAGGTAATGTCACAGCCTTGCTGTAGCAAACTGAGGGTCTTGTCCAGGGCGCAGGTATCGATCGTGATGCCCAGTTTCTGGGCGGTGAAGAA 
GACGTTCATGAACGTCATGTACTGCGAGGCGCACTCGTTGCTGCCTGTGACGACCAGGATGCGGGAATGCATCTTGACGCCAGGAGCCAGATTCC 
TCTGGAGCTGTACGGGAGTATTTTGTTTTAGATTTGGTTTCCCAAGGCGATTCCTAGCTATTCTCTACCCTTGATATGTAGCACAGCGCCATGGA 
CATGCTGCCGGCCAGGAGACTCTCACAAGGAGCGCTGAGCCGTGGTGCGTTCATCAGGATGCTGCCCAGCTGCTGCTTCACCGTCTTCTCCACCA 
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GATTAAACGCCTCGTACTGCCCATCCACTTGGCGCAGTTCCACTTGTCTTCTGGGCAGCGGATACAGGAAGTTTCTGTAATGGAAGAGCCATTAG 

GTGGCGTACATGTGTGGCAGCCTCTACTC 

(SEQ ID NO: 508) 

Exon: 1001.. 2544 
Start ATG: 1033 

Transcript No. : CT15305 

TTGCCTTTGCCGGTTTCAACGGATTGCGGAAAATGTTGAAAGCACGGAACGTTTGTTTGCTGTCCAGGAGGTGGAAGAGTGGCGCCAAGAAGCGA 
TGGAATGTCCTGCAGAACAAGAAGAACAACTGTGACCGAGCGCTGGAGAACTTCGATGACTTCTACGGAAGTGTCTACGGCACCCGGTGGAAGAA 
CATGCGGGCGGCGCTTCTCACGAGGCACAAGTATGTCGCCCTGGTCAACAATTTCGGGGACACGGAGCAGACGTGCGCCATGCTGGAGTCAGATG 
GAGCCATCAACATGAAGTCGCTTATTAACCTGGCGCAGGATCGGGTAAAGGACAGCACGGAGACGGTGCCGGAACAAGGCAAATCACGTCATGAA 
ATTGAAAGCAAACTGGATGCACTGCTGCGGAAACAGCAGGAGCGCGAGGTGGCCTCCATATACCCGAGCTCGCCAGAAGATGGTTCATCGGCGCC 
ACTGCCGCTGGAACTGAAGTTTGACAACGTAGAGGAAGATCAGCCGGAGCAAGTGAATAATCCCTTCAAGCAGAGCCTGACGAAGGCTTTGGAGG 
AGGATGTGAAGCTGGACGAGCATCGGTTGGTGGATCCCCAATTTGGCACAGGCGGACTGTATGAATACATGCCTGCCCACAGCATCAAGGGCATG 
GAGGACTGGGTGGCAGAGTCGGAGCACTACAAGTACTACCAAACCAGCGCCGACTTCCCGCTCGCCATTGAACCGGAGCCTTCCTTTCACTACCC 
CGAGCATCTGTCCCTGTACACATATGAAATGGGCAACTGCTCGGACTTCAAGGGACCCAAGAAGTGCATGACCGGTGTGCTCTCCCACTTTATGA 
TGGATGGTGCCTCGACTTTGCCGCCTCTCTTCCTGCAGGTGAAGCCTGGCGAGCGTGTGCTCGATGCCTGTGCCTCTCCCGGCGGCAAATCGCTG 
CTTATGCTGCAGACGCTGCACTTGGACCACCTGGTCTGCAATGACATCCAAGAATCCCGCCTAAACAAGCTGCGCAAAGTGATGCAGGAGTACCT 
GTTCGACTACAAGGAGCGCTGGGCTGGAAAGCGCCTCATCCTCAGCCAGAGTGACGCCCGCAATCTGGACCAGTACGAGCAGTTCGACAAGATCC 
TGGTCGACGTGCCCTGCACCACGGATCGCCATGTGCTCAACGAGCAGGACAACAATATTTTCAAGCCGACGCGCATAAAGGAGCGCCTGCGGATA 
CCAGAGCTCCAGGCAGGCATCCTGGCCAACTGCTTGCGTCTCCTGCGACCAGGCGGCAGTTTGGTCTACTCCACCTGCTCGCTGTCGCCCATCCA 
GAACGAT GGT GTCGTCCACATGGCTCT GC AAAAGGT CT T C AC GG AGT ACGG CAT C ACAACT ACC ATC AAGGAT CT G AGT C GC C AC AC G GC GC T C T 
TCAGCGATGTCTTCAAGTTCGAGCACCCAAAGGGACTCAAGTATGGACAAATGGTGGTGCCCTACCTGCCGGCCAACTTTGGACCCATGTACTTT 
AGT AAAAT AACCAGAAAT GTGTGA 
{SEQ ID NO; 509) 

Start ATG: 33 

MLKARNVCLLSRRWKSGAKKRWNVLQNKKNNCDFJUjENFDDFYGSVYGTRW 

AQDRVKDSTETVPEQGKSRHEIESKLDALLRKQQEREVASIYPSSPEDGSSAPLPLELKFDNVEEDQPEQVNNPFKQSLTKALEEDVKLDEHRLV 
DPQFGTGGLYEYMPAHSIKGMEDWVAESEHYKYYQTSADFPLAIEPEPSFHYPEHLSLYTYEMGNCSDFKGPKKCMTGVLSHFMMDGASTLPPLF 
LQVKPGERVLDACASPGGKSLLMLQTLHLDHLVCNDIQESRLNKLRKVMQEYLFDYKERWAGKRLILSQSDARNLDQYEQFDKILVDVPCTTDRH 
VLNEQDNNIFKPTRIKERLRIPELQAGILANCLRLLRPGGSLVYSTCSLSPIQNDGVVHMALQKVFTEYGITTTIKDLSRHTALFSDVFKFEHPK 
GLK YGQMVV P YL PAN FGPMYFSKITRNV* 
(SEQ ID NO: 510) 

Classification: hypothetical 



Celera Sequence No. : 142000013384256 

ATCACACAAAAATAGCCATCACACAGCTGTCTCTTGTCTTTTTGGCTCTCTTCGCTGTTGTTGTTGTTGTGTGTGTACAAACACCCACAGCCTGC 
ATCACACACACACGCTCAAATTCGTGACAGCACCACCCACCCCTCACCAATAACCGTTAATTCCGCAGACACTTTCAGCCGGCGAACGGGAGAAG 
AGAATTTGCGAATAATAGCAAAGCCCCGTTTAAATGAACGTTTTTAGCAACAATCAGGCGTTACTCACCTTCTTTTGCTGCTGATGAATTTGCAA 
GGATCTGCAGTTGTATTCCAATTAGAACTGACGAGAAGCTGCGTAGCGAACACCGGCGAAATGGCGAAAATAATAACTTGGCGTGGCCTTTGGTG 
GGACCAGGCGTACATTCGTGGAAATAGTCAAGCGAGTTAACCCTTATGTGGGTGGCAGAATACTAAAAATATATGATACAACAAATTTTAGACTT 
AAGAAAACGGATGTTTAAATGTATAAAATATGATGGAAGTGCAATAGCCAAGCTTATAAACTAATTCTTAAATTATTTGCAATGCATTTAGGACT 
TTTCGATATACACGTAAATAAACTATATTTTGATTGAACACGGTACATTTCTACTAGAGTTCACCTTCAGATATTGACGCTATTTCTTGTCCTGT 
AAATTATTTTGTTAAATCGTTCACATTGTATGTCTAATAAAGATGTTTTCCAATAAAATGTTATTCCCTTTTCACTTAGTCTAACATATTCATTA 
AATAATCAATGTTTTATTTTTTATTTAGTTATTTTAATTAGTTCTTAATTATAAAACTGCTCTAGATTCTCGATATTTTGCGGAATCCGGCGCAT 
GGTCACACTGACGAACACGTATTTTCATTATTCTTGCGAAGCGCATCGCTTAAAACTTGATTACGCTCGGATAACAAGATGCACCCGCGTCTCAA 
GTGACTGAACAGGTAGCGAAGTGACCCTGGATGAGCGTGGGTAGGCAGCCGGCACCTCTGGAAAGCCCTAATCCCAGGTCCGCCAGCCGTTTGCA 
GGTTATGTCCGCTTGTGTACACTGTGACGGGGAACCGAAGAAGCAACAGGAGCGGATTCCTGGCAGAAGAGACAACCGCTGTACGAGATACCGCC 
CCTGACCTGCGCAACATGATCGAGGTGCTGTGCCTGCTGCTGGGCAGGATCCTGCTGCTCCTGGTCGGCGGCTACGAACTGATGTGGCGGCTCCG 
AGAGCGACTGAATGCACTGGCCGTCCGGTCATATGACCTTTGGCGCAGCAAAGCGGCACGTGAGGCGCACGAGCGGCGCGTGCTCGCTGACTGCC 
GCTCCCAGTTGACGAAGACGCCGCAGCATCTGGTCCTGGTTATCTCCCCCGTAGATGCCGGCGTGGATGCGGTGCTCCTCAGCAGGATCTTCGAC 
TTTGCCCTGGACGTGGGCATCAAACACGTCAGCCTGTACGACAGGCGGACGAAAGGCAGGGGATACGTGGACATGGCCGATCTCTGTCGATCCAC 
CAACGCGGACACGGGCAGCTGCTTAAAGTGGCCACCCGTAGCCAGTCCCAGCAAACTGGAGAACCAGCCCAAAAACGGACAAAAGACAAATGGTT 
ATGTGAACGGTTCACATTCCCCTCAACTGCAGGTGAGTGGCGGCGTCTGATAAGCCGTGATTAGTAATGGACTTTAGAACGATGTCGGAAAATGT 
CTTACAAGTATTACACTTTTTGTTTCAATTGTTAAATAAAAGAAGTAATGAGCTGGAGATTTAATTTAGTTCATCATTCACATGAGATACAAAAA 
ATTCTTCGTGCTGAGAACCAGTTTTACCTTTGTATCCATTAAAACCTATGTATAGCAACGTAATCGATTCCCGGAACTAGTCGTTGCCTTATCAG 
TTTCTAATTGACTTTTCGGCTTCGGCGTCTTTCAGCTCCATCAAATCAGCGCCTCCGACGGCCATGCCCTGATCGCCGACGTCTGCCGCGAGTTG 
TACGAAGACAGTAAGACGGAGTTGGTGCAGAGCCTACTTAAGCAGAAGCGCGAAGCGCTGACGGAGCAGATCAGCGACATGCTGAGCAAGCGACT 
CGGGTTCGAGGCCCCGGAGCCGGAGCTGGGCATCGTGTTCGCCCGGCAGACGTGCACCTACGGCCTGCTTCCCTGGCACGCGCGCTTCACCGAGT 
TCCACACGCATCCCAGCGGACGCCACTTCGACGTGGAGACGTTCGCCAGCATCCTGTGCAAGTACTCGCGGTGCGAGCAGCGGTGGGGCACGTAG 
GCACATCGTTCGCGTGGGTAGCGACACCCTTATATTAACTAGTTACCTTAACATTATGTAAATTATCGCCAGGAATTCCTTTTAGTTCGTAAGAG 
TAAACTGCAAAAGTGTATTGTTCACATGAAAGTGTCTTTTGCTGGAGAGGATCGCGCGTTTCTAATGGCGGTGCAACGCAGCAGCTCGTATCTTG 
TACATAATATTCGATCTTTATTGAGTGACGTTTTGCTGTGCTTAAGCTTAAGGCGCAACTAGATGAGTATATGTAAGTGGGCAGACTTAAGCGCT 
ATGGTTAAAAGTATTGTATTGCTTTTCGTTCAAGTGTCATCTCCGGCGCTCAGCCGCTCTACTCCCGACCGGGCTCAGCGTGGCGCTTGGACAGG 
AGATTGACGTCGGTCAGATACGTGTTCTTGTCGATGTTGTGCGGACCGGGACCCGTGGGCATCCGCTCGTGGAGGCCGTGGAAGGAGTTGCGCAG 
ATCCTCGAGGGAGTCGGCCAGGCCCTTCTTGATGCTGGCGAACTCGGTGGACATGAGCGACAGCTTGCCCAGCAGATAGTTGAGGTTCTCGTCCA 
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TGTCGATCATGTGTTTCTTGGTCTCCTCCACGCGGCTGCCGAGACCCATCATGGTGATGAGAGTGCTGTTGTTGTAGCCATCGGAAGCGGACTGG 
ATATCTCCGAGCATCAGCCGGCTCTGGCTGATCTCGCCGGCCATGATCTCGATGCGGTGCCACACCTGGTTCAGCTCCGTTTCCAGCAGTCCGCT 
GATGTTCTTGTTCGCCTCCTGCTGGATGCTCACCACCGCTCCGTCCAGCGACGAGAAGCGCTCCTTCAGCAGTCCGCTCAGCGTGGCCATTTCCT 
CCGCCAGCAGCTTGCCAACCCGCTGCACCACGTTCAGTGTGCCGAAGTCCACCTTGCGCTGCGTGTCCAGCACGACCTCCGAGTTGGTCATCATG 
ATGTTGTTGAACTTACCGAGCGCTTCGAGTCCCTCGTTCAGCTTGGCATT 
(SEQ ID NO: 511) 

Exon: 1001.. 1647 
Exon: 1936.. 2280 
Start ATG: 1156 

Transcript No. : CT15363 

GGCACCTCTGGAAAGCCCTAATCCCAGGTCCGCCAGCCGTTTGCAGGTTATGTCCGCTTGTGTACACTGTGACGGGGAACCGAAGAAGCAACAGG 

AGCGGATTCCTGGCAGAAGAGACAACCGCTGTACGAGATACCGCCCCTGACCTGCGCAACATGATCGAGGTGCTGTGCCTGCTGCTGGGCAGGAT 

CCTGCTGCTCCTGGTCGGCGGCTACGAACTGATGTGGCGGCTCCGAGAGCGACTGAATGCACTGGCCGTCCGGTCATATGACCTTTGGCGCAGCA 

AAGCGGCACGTGAGGCGCACGAGCGGCGCGTGCTCGCTGACTGCCGCTCCCAGTTGACGAAGACGCCGCAGCATCTGGTCCTGGTTATCTCCCCC 

GTAGATGCCGGCGTGGATGCGGTGCTCCTCAGCAGGATCTTCGACTTTGCCCTGGACGTGGGCATCAAACACGTCAGCCTGTACGACAGGCGGAC 

GAAAGGCAGGGGATACGTGGACATGGCCGATCTCTGTCGATCCACCAACGCGGACACGGGCAGCTGCTTAAAGTGGCCACCCGTAGCCAGTCCCA 

GCAAACTGGAGAACCAGCCCAAAAACGGACAAAAGACAAATGGTTATGTGAACGGTTCACATTCCCCTCAACTGCAGCTCCATCAAATCAGCGCC 

TCCGACGGCCATGCCCTGATCGCCGACGTCTGCCGCGAGTTGTACGAAGACAGTAAGACGGAGTTGGTGCAGAGCCTACTTAAGCAGAAGCGCGA 

AGCGCTGACGGAGCAGATCAGCGACATGCTGAGCAAGCGACTCGGGTTCGAGGCCCCGGAGCCGGAGCTGGGCATCGTGTTCGCCCGGCAGACGT 

GCACCTACGGCCTGCTTCCCTGGCACGCGCGCTTCACCGAGTTCCACACGCATCCCAGCGGACGCCACTTCGACGTGGAGACGTTCGCCAGCATC 

CTGTGCAAGTACTCGCGGTGCGAGCAGCGGTGGGGCACGTAG 

(SEQ ID NO: 512) 

Start ATG: 156 

MIEVLCLLLGRILLLLVGGYELMWRLRERLNALAVRSYDLWRSKAAREAHERRVLADCRSQLTKTPQHLVLVISPVDAGVDAVLLSRIFDFALDV 
GIKHVSLYDRRTKGRGYVDMADLCRSTNADTGSCLKWPPVASPSKLENQPKNGQKTNGyVNGSHSPQLQLHQISASDGHALIADVCRELYEDSKT 
ELVQSLLKQKREALTEQISDMLSKRLGFEAPEPELGIVFARQTCTYGLLPWHARFTEFHTHPSGRHFDVETFASILCKYSRCEQRWGT* 
{SEQ ID NO: 513) 

Classification: hypothetical 



Celera Sequence No. : 142000013384044 

ACATGACACTATTTCGGTTTTAAGCCCGCCACGCCCACAAACTCACCTCCTTATTGCCGGACATGCCGCTGACGTAGACTTTCAAGACCATTTTC 
GCAGATTACTGTGTCCTCGCACCGCAAATGAAATAATATTTTGCAACTAAAATCTGCTGGCTACTTTCACGGCGATTAAAAATATTTATAGATCT 
TTTCTTTTCCTGCTTTCTTCCGTCCTGCGCCCCTACGTAATGTTTTTCCACTGGTGACGAAGGCACACGCACTGGGCAAGGTCGTTGGTTTTATT 
GGCTGTAAAAACGGCAAAGACGCGGATAAAAAAATTGCCTGGCGACGCTTATAAAAGTGCAAAAATATTTGATATCAGGAGTGGAAAATATTTCC 
TACTTTGTCGTTGAATTGAATGGTGGGACCGCGCATTGACGGCTATAGAAAACCTTCCGGCTTATCAAATATACTAAATATGATGCTTGATTAAA 
ATACTAAAAACTTTGAACTTAAGTGATATTTCAAAATTAAATATTTTTAAGACAACTTAAATTTTATTATGCTCCCATGTAAGGTTTTCATTTTT 
TAAACGTTCTTCGATTAATAATCGTAATAATAAAATAATAATTTATAAAAAGCATAAGATAGGCTTTTTAAGGTTTTTTAAAAGATAGAAGAAAA 
GGATTCTTATTATAGAACAAATTATTTTTAATTTGGCGTCGATGAGTTAGCGGCTTCAAGGAACGGTGGAATTTAGAACATGAGGGTGTTTTTTA 
TTTATTAATTTTTTGTTACTTGGAGTTGAATTCCATAAAAATAAAAATGCAATAATGCTAACTTTCTACAAACGACTTAAGGTTATAGGACTCAA 
GAAAGTACATAGAATATGCAATTGAAAAATGTTCTCCCGAGACTTTTCGGTTATTTGAAGTTTTTGCTATATTTGGTAGAACCATAAAGTATTTT 
TTGGTATAGATCCGTTAGCAGGGCAACCAATATTAGCGGTCGCCATGCGGTCACTCTGTTCACGTCCCGTCACGCAAGCATCGTCGTCCATTTTT 
GCTAAGAAGCGAGACGAAAATTTGTAAATTATATCTAGCGAGTCGCCGAGGACGAACAACTATTGACAGTGCATTTCAGTGGCCTTTGGAATACT 
GCTCAAACAACCAGCAAGCAATAACATTACTTATATCTAACCGAAAACCTAAATTAATCACCCAAACTGAAGTACGGACGTGAATGTAAGTGGGT 
ATGTGCCAGCGGACTAGTGGCACAATAAAAAAGTAAATAAAGCGCGTGTGAAGTGGTGCAGAAAGCAAAAAAGCAGTCGGAAGGAAAAATCAAAA 
TACGCATTAATGTAAGTGCGATTGTTGGAAAAGCTGTTTAACTTTTTGTTCCTCTGAAAAGCTGATAATGATATGCGATGCGACCATTTTTAAAA 
TTTCATGAAATCAACATTCCAACTCACAACGGTGTGCGTGTGCGTTTCTTTGTGTGCGTGCGTGTGCAGGCTAGAGTGGGACAACCAGTGTGATC 
GCTTTGGCTGCTGCCATTTCGTTTGCTTTTTTTTAAACTAATCTCTGGCATTTGTCACGTAAAAGAGTATAACGTTAAACCCAAAAACGTTTTGT 
ATATGGCTTTAAATATTTAATAATTGTGGGCAGAGACAGCAACACCAGCAGCAGACGCATACACACGTGTTGCGGTTTATCGAGAGGGCAGTGTG 
CGTGTGTTAGTGCGGAAAACAGCTGGTACAGCTGCCGCTGCAGCGACGAAGAAGAAGAAAAGAAAGAAAGGGGAAGAGTGAGAAGAACACGCATT 
ATTATCCTGCGTAGTTTTCTTTTATCGGTGGTTGTTCTGCTATTGGGTTACAGTTTACACTTTTCTCTGTTTTCTTTATTTTCCCTCTGCATTGC 
TATGCATTTTGCCTTTGCGTGGTGGTTTTTTTTATTTTCATTGTCGGTCGTCCTTTAATTGCGCTGCGCCACTTCGTTTCGTCGCTTTCTTTGGC 
TGGTTTCTCCTTCTCTTCGGAGTTTCTGGATTCGTATTCCGATTCGGTATCGCCCTTTTCCTGCGACAACAGCAGCTGGGTGCACGGTATCGGAG 
CTGTCGGCGATAGCCGAAAGAAGCTAGAAACCCCAAGGAAGCCAACATCCTTTTAATTGTCGTGGAAACTATAATAGTCTCATAAAAATTAGGTT 
TTTAAAATGGTTTTGTCATTCTATTCTAGAAAAAGCTATGAATATAAGAATATAACAATATGTTTAATTGCTTACTAATTTTATTGAACAATATA 
ATTTAATCGTGACGAAATGGGGGAAAAAAAACATTTTTTTGTTGTCTTGAATTTTTTTTCCTCTCTTGTCATGATTATTTAGGTAAAACGGACGT 
CATCAAAGTCATTAAACTTACGCGAACTTCGCATTTAAAAGTTTTTTACTGGCATTTCTATTTTTAATAGCTTTCTGGGGAAAATATCGGATAAT 
AGCTTTTCAAATTATGCATTGCTAGCAACTTTGACTAGCGAAAAGTATTTACTAAAAATTTTTGTACCTTCCTATTTTCGGCGCTCTTTTTAAGC 
AAAGGCCCTTGTTTTGTAGCGTTGAAGCCTCAAATTATTTCCAAAATCACGCCCCATAAGCGATAAGATAAGAGAGCGTACAATTGTTGCCCACG 
ATACATACAAGGCGAGAAAAAGTCCAAATTGCCAGCATTGGCGGCCATAAATCGCTGCACCATTTTCTTTTATCGGCTGCGTGTGCGTGAGTGCA 
CGCCTGTGAATCCTGTTTTTTCCTTGAAACCAGTTTGCGTCCGTCTTTTCTTTGCCAGCAAGGTGTTCACTTCACCTTACTCATTCATTGTTTGT 
TTGCCCATTGCAGCCGACGCTTGATCTTCCATCTGCAGCATCCATCATTTGGAATCGAGGCAATAATCAGCATCTACCGACAATCAAGCAAGCCA 
TCTATCTATCAGAAGACCACCGGGTAGTAGAACCTCAATCATAGCGATGGCCTGTGCAACCCTGAAACGAGCCCTAGACTGGGAGTCGATGAACC 
AGCGGCCTCCGAAGCGCCGGCGTTGCAATCCCTTTGGCCAGGCTGGGAGCAATGCAGGTCCAGCGTCGCCATCCCGCGACGGTCCCAGCACCTCG 
GCTGGTCTGCCCCACACGCCCAGCAACCGATTCGCCAAGGATAGCACCGAACCTAGTCCGTTCAGTGAGTCGTCGCTGGCCAAAATGTCACCAGG 
TTAGTAAATCAGGTCGATGCTGGAAAAGGGACGTCGGATTCCTTTTATGCATATTACCTTAATACACCTTTCTTCTTTTCGAAACACAGACAAAA 
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TGGCCGAGAGCTTGTGCAATGAGATCAAGAGACTGCACAAGCGCAAACAGCTGCCGATCACTTCGTCGGCCTTGGAACGCATGCAGGATTCGGAG 
TCCAGCGGATCGGAGATGGGTCCAGAGAGTCCGCGCCGCCCGGACAGTCCACAGAACCTGATGCGCCACGGCGAAAAGGCCCTGTTCACGTTCAA 
GCAGGTGCAGCTCATTTGCGAGAGCATGATCAAGGAGCGCGAGAATCAGCTAAGGGAGCGCTACGAGTCCGTGCTGACCACCAAGCTGGCCGAGC 
AGTACGATGCCTTTGTCAAGTTCACATATGATCAGATACAGCGTCGCTACGAGGCAGCGCCTAGCTGTGAGTTCAAGTTACCATCGGTCATGGTT 
ATAGTGCTAATTACATTATATTTTTATTTCAGACCTGTCGTAAAGCTGTGTACACAGCTGTAGCCTCAGAAAGAAGAAATTAATAATTGCACTAC 
GATCTTATGTGACGGTAGATCTACCGCGACAACAACAGCCCTCATAAACTATAACAAACAACAAATAAGGAAACTAAAGCAAACGCAAAACAAAC 
CAACAAAAAAAAAAAACCTAAAAAGCAAAAAACCTAAAAACAA^ 

ATACACGCGAAGAAAACGGTTATAGTTTATGCCTAGATTTACATATATATATTGTATCGGTTAGCAGCAAAGGGATGAATTATTTAAAAGCAAGC 
AAAAACCACAAAAGAGAGAAACCAACAAACCAACTTGTTAAATTTATGTTTTTCACATAACGCAAATTCTAAATAAAATGACAAATATCAAATTA 
TGAACTATATGAACTATAAGCACACCGCCACTGCAGACATATACATATGGGTGCATCCGTAGAGAGAACGATTCAATGAAACCTCAAACGTAGCT 
TCGTACATAACAGACCGACAGAATTGGCAAATATATAATGTTAGGCACATGATGTAGCATATAAGGTATGATGTAAAACCTATCAAACTACATAT 
ATTGTTTTTTTGGAAATTGTAATTTATGAAGCGAAGCAACAAAGATAACTCACAACTCATGAAGAACACTTTTCATAGTTTATTTTTGCATATTA 
GTTGTAACACTAAGGCGAGAAACGAACGCTTGAATTTGCCTATTTTCTCGAGTATCGTTTGTTAATTGTTTCTTCCTTTGATGCGAATTATTGCA 
AAGTAATGTAAACACACAAAACAATTGAAATGTTTCGATTTGCTTTACACCATACAAAAATGAATAACGAAAAGAAAAGGCAAACAACAAAAAAG 
GTATAAAAGTTATTCATTAAGTTAAATTGAACGCTTTCTTATAAAATACGCTTGACAGAAGATGTTTCATGTTCATTGCAATATTTT 
(SEQ ID NO: 514) 

Exon: 1001. .1341 
Exon: 2864.. 3229 
Exon: 3320.. 3676 
Exon: 3738.. 3748 
Start ATG; 2992 

Transcript No. : CT15373 

TCACTCTGTTCACGTCCCGTCACGCAAGCATCGTCGTCCATTTTTGCTAAGAAGCGAGACGAAAATTTGTAAATTATATCTAGCGAGTCGCCGAG 

GACGAACAACTATTGACAGTGCATTTCAGTGGCCTTTGGAATACTGCTCAAACAACCAGCAAGCAATAACATTACTTATATCTAACCGAAAACCT 

AAATTAATCACCCAAACTGAAGTACGGACGTGAATGTAAGTGGGTATGTGCCAGCGGACTAGTGGCACAATAAAAAAGTAAATAAAGCGCGTGTG 

AAGTGGTGCAGAAAGCAAAAAAGCAGTCGGAAGGAAAAATCAAAATACGCATTAATCCGACGCTTGATCTTCCATCTGCAGCATCCATCATTTGG 

AATCGAGGCAATAATCAGCATCTACCGACAATCAAGCAAGCCATCTATCTATCAGAAGACCACCGGGTAGTAGAACCTCAATCATAGCGATGGCC 

TGTGCAACCCTGAAACGAGCCCTAGACTGGGAGTCGATGAACCAGCGGCCTCCGAAGCGCCGGCGTTGCAATCCCTTTGGCCAGGCTGGGAGCAA 

TGCAGGTCCAGCGTCGCCATCCCGCGACGGTCCCAGCACCTCGGCTGGTCTGCCCCACACGCCCAGCAACCGATTCGCCAAGGATAGCACCGAAC 

CTAGTCCGTTCAGTGAGTCGTCGCTGGCCAAAATGTCACCAGACAAAATGGCCGAGAGCTTGTGCAATGAGATCAAGAGACTGCACAAGCGCAAA 

CAGCTGCCGATCACTTCGTCGGCCTTGGAACGCATGCAGGATTCGGAGTCCAGCGGATCGGAGATGGGTCCAGAGAGTCCGCGCCGCCCGGACAG 

TCCACAGAACCTGATGCGCCACGGCGAAAAGGCCCTGTTCACGTTCAAGCAGGTGCAGCTCATTTGCGAGAGCATGATCAAGGAGCGCGAGAATC 

AGCTAAGGGAGCGCTACGAGTCCGTGCTGACCACCAAGCTGGCCGAGCAGTACGATGCCTTTGTCAAGTTCACATATGATCAGATACAGCGTCGC 

TACGAGGCAGCGCCTAGCTACCTGTCGTAA 

(SEQ ID NO: 515) 

Start ATG: 470 

MACATLKRALDWESMNQRPPKRRR^ 

RKQLPITSSALERMQDSESSGSEMGPESPRRPDSPQNLMRHGEKALFTFKQVQLICESMIKERENQLRERYESVLTTKLAEQYDAFVKFTYDQIQ 

RRYEAAPSYLS* 

(SEQ ID NO: 516} 



Celera Sequence No. : 142000012746884 

CTATCCAGAATTTCTTGAGCTCGCCGAAGGTGAGTTTGTAGCCCAGTTTGTAGCGCATATCACGTACATCCTCCTCGGTGAGGGAGAGCAGCACC 
TCGCCGTCGATGGCCTCCTGGCGCAGGCAGTCGAGCAGTGTTCTCGAAAAGTGCTCCATGCAGGTGGCCCAACTGGTCACATTCTCGATGGACCA 
CTGGGCCACGAGTAGCGGATCCACGGGCTCTGGATCTGGCGATCCATTGGTCAGGTGCTTATCTGTCGCCGCGTCGAGGTAGTGCGTCCGGGTCT 
CCTGATCCATTGTGACCAATCCACCGCCCTCGGAACGTGGCTGAGTCACCGGATCGCCAATTTCACCGTCGCACATCGCGCTTTCTTTTTTTTCC 
GTTCGATGTTCTTCTGGCTGTTTCTTATTCGTTCCAATTTTGCGATTTATCCAAAATCCAAAACAATTGTGCGGCCGAACTCAGATGGAGTGCAA 
CATTTTTTCACTGCTGGCTGCTGTTGCTGCTGCGGCGCATAATTAACACCTCACTTATTGCGCGTCCAGCGTTTGTTGATGTAAAAGCACATTTA 
ATTTTTGGTTTGGTTACTTGTTAAATTCAGACTACTCAGTTGTTCCGTTCAGAGAGCAAGCGAGAGAGCAGAGCGGAGTCAGCAACTATGCTGGG 
GGCGGGAGCGAGATAACAAGGCCGAACAGCTGACTTCGGCCATCGATAGTTTTTGGACAACGCCAACGCTATCGATGAACAGTTACTTCGGTCTT 
AACTGCGTGTTCACACCGAAAAATTTTCGAGAATTTCGCTCTCGTAAAAGTCAAAAGGTTTAAAACTCTTATCGTTAGCTTAATATTATTTTAAA 
TAAAACAATTTGAGGTGTCTGGCGAGTAGAAATTATTTAAATATTAACGCTTGACTGATATAAATCTCGTTCGATATATACGAAACCATGGTGAA 
AATGTGGAAGAGTGGAAACGGCCTCCAAAGCGAAGATATCGAAGCCACAGCTTCGCTTCGTTAGCATTTTGCGAGCGGCAAAGTGATTTAATTGT 
CCTCGTGCGAATGTCTAGACAGAAGTAATCCTCGAATAATATGAAGAAAGCCAAGCCGCTGCTGCACGACCATCTAATAACGATCCGAGTAAAGA 
AGGTGTGTCCAAATCCGTTGTTTATATCCACAAACTAGCCAGCTGCCCGATAGCCGGTAAATTCCCATGAAAGCAGCACAAAAAAAAAAAAAAAA 
ACACACAAAAACCCTTTGCAGGGCGGCGATTTTGGGCAAGGGAAAACAAGGTTAGGTCGCAGGCTACCCGCATCGTGTGCTAATTGCGTTCTACT 
TTCGATTTCAGTATCTGGAGGAGCACATCGGGGAAACCTACCTGGATGTGAAACAGATGACCAGGGAGTTGATGCAAAAGTATCCGGAATATTCA 
CGACGCAAATTCGGACCATTCAGACAGCTGGTGCACCAAGGTAAACATGGAACATTGTGGTTATTTGTTTTTAACAAATATTTCTATTACAGCAT 
TCTCGATCATTTCGGAAAGCTATAACTTGGACAAGGTAAGCAGTTCCGAGGAAGATTGTGTTAGCGAGGACTCCGAGCCACCACCCACGAATAGT 
GTGATGAACAATATGATGAACAGCCTGTATAGTCAACCAAGAAAGCCGCTGGCGCCCAAGCCAATCAGCGAGCCCATCGATATATCCAGTGGAGA 
CGAGAACGAAGATGACTCCAATACAAAAACCACCAACGGAGATGGGGTAGCTGCTGCTGCTGCTCCTCCTCCTCCTACTCCAGCAGTGCAGGGCA 
GTGCTTTGAAACGCCTGATGGAGGAAGTGCCGGAGATCGCAGTGGCTGCAAAGAAAGGTGGGTTATCAATCCATTAGTCTATGACTTGGTTAAAT 
GACATTCTTCATTAGTAGCTAAACCGAATACTATTCACGTTAGCAGTTCGGAGGCCATTCAAAAGCGTGAGTTTTTTTTTAAATACTTTTGTAAA 
ATCAATGAAACCTAAACTTTTTGTTTTAGTGCACCAAGTTGTGGGCAACAGAGCGAAAAATCTTTCAGAGGACGCCGTTCCGCGTTCCAAAGATC 
ACCGTAATGTGCCTGGCTTATATCAGCAGTTGCACCAAAACCAAAGTCGCGATCGCCTTCGGAAGTTCAAAAGGGATCTGGAGGTGCAGCACCCA 
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ACGGAAAGTTTCCGGGACATTGGTGGCATGGATAGCACTCTTAAGGAGTTGTGTGAAATGCTGATCCATATTAAGTCGCCGGAGTTCTACTTTCA 
ATTGGGTCTATTGCCTTCGAGAGGTTTATTGCTTCATGGGCCGCCAGGCTGTGGCAAAACCTTTCTTGCTCGTGCCATTAGCGGGGTGAGTTTTA 
TAAATGAACATTGAATCGTTAATTCATATATTAATCATCTCTGCTTTACGACACAGCAACTAAAAATGCCTCTAATGGAAATTCCGGCCACAGAG 
CTAATTGGTGGCATCTCTGGTGAGTCGGAAGAACGGATTCGCGAGGTCTTTGACCAGGCCATTGGCTACTCGCCTTGTGTGCTTTTCATCGATGA 
AATCGACGCCATCGGAGGAAATCGTCAGTGGGCGTCCAAGGATATGGAACGTCGCATTGTCTCGCAGCTGATAAGCAGTCTGGACAACCTGAAGG 
CCAACGAGTTTGGCCAATCAGTAGTCGTCATAGCGGCCACCACGCGTCCCGATGTCCTCGACCCTGGACTACGTCGCATCGGTCGCTTCGATCAC 
GAAATTGCCATACACATCCCATCGCGGAAAGAGCGGCGTGAAATCTTGCGCATCCAGTGCGAGGGCTTGTCCGTCGATCCCAAGCTAAATTATGA 
TAAGATTGCAGAGCTAACACCGGGATATGTCGGTGCTGATCTAATGGCTCTCGTAAGTCGTGCAGCTTCTGTGGCCGTAAAGCGTAGGTCCATGA 
AGAAGTTCCGTGAACTGCACGCTGCTAGTGAAAAGAATATGACAACGGTCACTTTGGATGACGACGAGCCCAGCGAGGACGCAGGTGAAACACCT 
GTGCCTGATTCAAAAGGTGAAGAAACTGCAAAAGATGCAGAGGCAGAACAAAAGGTAGATGGTGACAAGGAAACATCTGCTAAGGATAAATCAGA 
AGGCGACTCGCCAAATATTGAAACCCCCAAGAAAGCGACCAACGGCAACTCATCGATAAAGTCGCCGCAGAAAACACCGAAGAAATCAGCGGAGA 
AACCAACTGACGCAGCGATGGATGTAGACAATGTTGCTCCCGAGGAGCCGAAAAAAGCCGTTGAGCAGGAGGTGGATTCCTCCTCCTCCAACGAC 
GAGTACTACGAACCCACACTGGCCGAACTGACCAATTTTCTTGACAACCCCCCTGAAGAGTTCGCCGACCCAAACTTTTGCCTCACTCTCATTGA 
TTTCGTAGATGCCATTAAGGTGATGCAGCCTTCGGCCAAGCGCGAGGGCTTTATAACAGTACCAGATACCACTTGGGATGACATTGGTGCCCTAG 
AGAAAATTCGCGAAGAGCTTAAGTTGGCTGTCTTGGCACCAGTTAAGTATCCAGAGATGCTGGAACGTCTTGGGCTGACAGCTCCGTCAGGCGTT 
CTGCTATGCGGTCCGCCGGGTTGCGGCAAAACTCTTTTGGCCAAGGCTATTGCCAATGAGGCTGGCATCAATTTCATATCGGTGAAGGGCCCCGA 
ACTTATGAACATGGTAAGATAATTGAAATGTTGTATATTTCTTGCATTGCTCATCATGTCCGTATATTGTATAGTATGTCGGCGAGAGCGAACGA 
GCTGTGCGTGCCTGTTTCCAACGTGCCCGGAACTCTGCACCCTGCGTCATCTTCTTCGACGAGTTCGACTCCTTGTGCCCCAAGCGATCGGATGG 
TGGCGATGGTAATAATTCGGGCACTCGTATCGTTAACCAACTGCTGACCGAAATGGATGGCGTCGAGGAGCGCAAGGGTGTATACATATTGGCGG 
CCACCAACAGACCGGATATCATTGATCCGGCCATCCTGCGACCGGGTCGCCTGGACACCATTTTGTACGTGGGTTTCCCCGAGCAGAGCGAACGT 
ACCGAGATTTTGAAGGCCACCACAAAGGTAGAATATTGTGGTTACATATTTTATTCGGTAAATTGAAATCTATTTTTTGTTCTTTTCCATCGCAG 
AACGGAAAGCGTCCGGTTCTGGCCGATGATGTAGATCTCGATGAAATTGCTGCCCAAACCGAGGGCTATACTGGCGCAGATTTGGCAGGATTGGT 
CAAGCAGGCCTCCATGTTCTCCCTGCGACAATCTCTAAACAACGGCGACACAAATCTCGACGATTTGTGCGTACGCAGCCAGCATTTCCAGGAAG 
CTTTGCAGCAACTTCGGCCCTCAGTTAACGAACAGGTAAGAATGCCCCAGTACATTGTTAATTATAATTGTTTAACATGTATTTATTTAATAAAC 
AGGACCGCAAAATATACGATAAACTGCGCCTGAAATATGCTGCACCAAGGGTACCCACTCTAAATGATAAGTAAACCGATCAACAATAATGAGTT 
TGTTGATTTAGCACCATCGCGAATCAATTTAATTGTAAGTTTCGAAAATTAACTTTTAATTTTCGATACAAAATGCGATATGTTACCCAAAAAAA 
CAAAACCATTTAATCTATCTTGTAAAATGCGTAGAAGAGATATGTATTTAGTTTTTGGATTGTTTTAATTTTTTGTAGTATCTTACAAAATAAAA 
GTTAAACTATTTGGGTTCATCGCAGTTTAGTTTGACTTACACATTTAATTTCACGTTGCCACACTAATCACAAAACAAAGTTCTAAGCTTATAGG 
AGCACCAGCGACACTGGCGAAGAAATGGTAGGTTTGCCACGGCTGGTCAGAATCGCGCCATCTCCACCGCCCTGTCCATCACGACTAAGTCCGGA 
TTGTTTTATCGGTGGATACTGAGGCTCCATTTGTCGCGGGAAGAGTCCGGGCGGCAGGGCTTGAATGCCCTCGGGCAGATCTAGGGCTTCACGTG 
CCTTTAAACTCCTTTGCAGCCGTAAGATGTTCTCGTGCTGCGCGCGAATCGTTGTGCGCTTCGATCGCTTCTTCTTGGACGTGGTTTTGGCAAAA 
TCGCCACCAGCCGTATTGGTCTCTTCCGTGGTCGTTGACGTATTCTCGATTGCCACGCCCTCATTACTGCTTTCGTGACGAGACGGCCACGGCAG 
TGGTGGCTCCTTTTCACGGACAATCTGATGTCGCAGATCAAAGGAGCGCTTGCACATCTCTCGTATACCGTACCAACTTTCATGGCCAATTTGAC 
TGATCATCTCCACTGGCGGCTGAAAGCCGAGCTCTCGAGATGGCAGCTCCATTACCAATGCCATCGGTACTTTCAACACTCCATATACGTAGTCC 
ACAACGGAACCGGCAATGGTTCGCTTGGTCAGACAGCTGATGCTACCCGTTCGGTACTCCCGGCCATTGTAGGACTTGATTGCACTCTTGCCGGA 
ATTCGCCAGGGAGCTCAATTCACGCCAATAAATCGGATTGTCACGGCAGTAGCCCCAAGGGTACATGATACTTTGGCCATACGAGTGCAGGGACA 
AAAAGAACAGGAGGTTCGAACTCATCCTGTCGAGAATGCATCTCATGGCCCGCGTTTCCGGCTCGGAAAACGGACTTTCACCCTTGTATGTATTA 
CGGTTGATCTTGCTGGGTCCGCTGTTCCAGAAGATGTCGTAATTTCGGTTGCAATCAGTGCCCACAAAT 
(SEQ ID NO: 517) 

Exon: 1001.. 1142 

Exon: 1342.. 1465 

Exon: 1518.. 1862 

Exon: 1916.. 1966 

Exon: 2025. .2365 

Exon: 2432.. 3718 

Exon: 3780.. 4112 

Exon: 4181.. 4405 

Exon: 4468.. 4769 

Start ATG: 1086 

Transcript No. : CT15413 

CTTCGCTTCGTTAGCATTTTGCGAGCGGCAAAGTGATTTAATTGTCCTCGTGCGAATGTCTAGACAGAAGTAATCCTCGAATAATATGAAGAAAG 
CCAAGCCGCTGCTGCACGACCATCTAATAACGATCCGAGTAAAGAAGTATCTGGAGGAGCACATCGGGGAAACCTACCTGGATGTGAAACAGATG 
ACCAGGGAGTTGATGCAAAAGTATCCGGAATATTCACGACGCAAATTCGGACCATTCAGACAGCTGGTGCACCAAGCATTCTCGATCATTTCGGA 
AAGCTATAACTTGGACAAGGTAAGCAGTTCCGAGGAAGATTGTGTTAGCGAGGACTCCGAGCCACCACCCACGAATAGTGTGATGAACAATATGA 
TGAACAGCCTGTATAGTCAACCAAGAAAGCCGCTGGCGCCCAAGCCAATCAGCGAGCCCATCGATATATCCAGTGGAGACGAGAACGAAGATGAC 
TCCAATACAAAAACCACCAACGGAGATGGGGTAGCTGCTGCTGCTGCTCCTCCTCCTCCTACTCCAGCAGTGCAGGGCAGTGCTTTGAAACGCCT 
GATGGAGGAAGTGCCGGAGATCGCAGTGGCTGCAAAGAAAGTAGCTAAACCGAATACTATTCACGTTAGCAGTTCGGAGGCCATTCAAAAGCTGC 
ACCAAGTTGTGGGCAACAGAGCGAAAAATCTTTCAGAGGACGCCGTTCCGCGTTCCAAAGATCACCGTAATGTGCCTGGCTTATATCAGCAGTTG 
CACCAAAACCAAAGTCGCGATCGCCTTCGGAAGTTCAAAAGGGATCTGGAGGTGCAGCACCCAACGGAAAGTTTCCGGGACATTGGTGGCATGGA 
TAGCACTCTTAAGGAGTTGTGTGAAATGCTGATCCATATTAAGTCGCCGGAGTTCTACTTTCAATTGGGTCTATTGCCTTCGAGAGGTTTATTGC 
TTCATGGGCCGCCAGGCTGTGGCAAAACCTTTCTTGCTCGTGCCATTAGCGGGCAACTAAAAATGCCTCTAATGGAAATTCCGGCCACAGAGCTA 
ATTGGTGGCATCTCTGGTGAGTCGGAAGAACGGATTCGCGAGGTCTTTGACCAGGCCATTGGCTACTCGCCTTGTGTGCTTTTCATCGATGAAAT 
CGACGCCATCGGAGGAAATCGTCAGTGGGCGTCCAAGGATATGGAACGTCGCATTGTCTCGCAGCTGATAAGCAGTCTGGACAACCTGAAGGCCA 
ACGAGTTTGGCCAATCAGTAGTCGTCATAGCGGCCACCACGCGTCCCGATGTCCTCGACCCTGGACTACGTCGCATCGGTCGCTTCGATCACGAA 
ATTGCCATACACATCCCATCGCGGAAAGAGCGGCGTGAAATCTTGCGCATCCAGTGCGAGGGCTTGTCCGTCGATCCCAAGCTAAATTATGATAA 
GATTGCAGAGCTAACACCGGGATATGTCGGTGCTGATCTAATGGCTCTCGTAAGTCGTGCAGCTTCTGTGGCCGTAAAGCGTAGGTCCATGAAGA 
AGTTCCGTGAACTGCACGCTGCTAGTGAAAAGAATATGACAACGGTCACTTTGGATGACGACGAGCCCAGCGAGGACGCAGGTGAAACACCTGTG 
CCTGATTCAAAAGGTGAAGAAACTGCAAAAGATGCAGAGGCAGAACAAAAGGTAGATGGTGACAAGGAAACATCTGCTAAGGATAAATCAGAAGG 
CGACTCGCCAAATATTGAAACCCCCAAGAAAGCGACCAACGGCAACTCATCGATAAAGTCGCCGCAGAAAACACCGAAGAAATCAGCGGAGAAAC 
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CAACTGACGCAGCGATGGATGTAGACAATGTTGCTCCCGAGGAGCCGAAAAAAGCCGTTGAGCAGGAGGTGGATTCCTCCTCCTCCAACGACGAG 
TACTACGAACCCACACTGGCCGAACTGACCAATTTTCTTGACAACCCCCCTGAAGAGTTCGCCGACCCAAACTTTTGCCTCACTCTCATTGATTT 
CGTAGATGCCATTAAGGTGATGCAGCCTTCGGCCAAGCGCGAGGGCTTTATAACAGTACCAGATACCACTTGGGATGACATTGGTGCCCTAGAGA 
AAATTCGCGAAGAGCTTAAGTTGGCTGTCTTGGCACCAGTTAAGTATCCAGAGATGCTGGAACGTCTTGGGCTGACAGCTCCGTCAGGCGTTCTG 
CTATGCGGTCCGCCGGGTTGCGGCAAAACTCTTTTGGCCAAGGCTATTGCCAATGAGGCTGGCATCAATTTCATATCGGTGAAGGGCCCCGAACT 
TATGAACATGTATGTCGGCGAGAGCGAACGAGCTGTGCGTGCCTGTTTCCAACGTGCCCGGAACTCTGCACCCTGCGTCATCTTCTTCGACGAGT 
TCGACTCCTTGTGCCCCAAGCGATCGGATGGTGGCGATGGTAATAATTCGGGCACTCGTATCGTTAACCAACTGCTGACCGAAATGGATGGCGTC 
GAGGAGCGCAAGGGTGTATACATATTGGCGGCCACCAACAGACCGGATATCATTGATCCGGCCATCCTGCGACCGGGTCGCCTGGACACCATTTT 
GTACGTGGGTTTCCCCGAGCAGAGCGAACGTACCGAGATTTTGAAGGCCACCACAAAGAACGGAAAGCGTCCGGTTCTGGCCGATGATGTAGATC 
TCGATGAAATTGCTGCCCAAACCGAGGGCTATACTGGCGCAGATTTGGCAGGATTGGTCAAGCAGGCCTCCATGTTCTCCCTGCGACAATCTCTA 
AACAACGGCGACACAAATCTCGACGATTTGTGCGTACGCAGCCAGCATTTCCAGGAAGCTTTGCAGCAACTTCGGCCCTCAGTTAACGAACAGGA 
^GCAAAATATACGATAAACTGCGCCTGAAATATGCTGCACCAAGGGTACCCACTCTAAATGATAAGTAAACCGATCAACAATAATGAGTTTGTT 
GATTTAGCACCATCGCGAATCAATTTAATTGTAAGTTTCGAAAATTAACTTTTAATTTTCGATACAAAATGCGATATGTTACCCAAAAAAACAAA 
ACCATTTAATCTATCTTGTAAAATGCGTAGAAGAGATATGTATTTAGTTTTTGGATTGTTTTAATTTTTTGTAGTATCTTACAAAATAAAAGTTA 

AACTATTTGGGTTCA 
(SEQ ID NO: 518) 

Start ATG: 8 6 

MKKAKPLLHDHLITIRVKKYLEEHIGETYLDVKQMTRELMQKYPEYSRRKFGPFRQLVHQAFSIISESYNLDKVSSSEEDCVSEDSEPPPTNSVM 
NNMMNSLYSQPRKPLAPKPISEPIDISSGDENEDDSNTKTTNGDGVAAAAAPPPPTPAVQGSALKRLMEEVPEIAVAAKKVAKPNTIHVSSSEAI 
QKLHQWGNRAKNLSEDAVPRSKDHRNVPGLYQQLHQNQSRDRLRKFKRDLEVQHPTESFRDIGGMDSTLKELCEMLIHIKSPEFYFQLGLLPSR 
GLLLHGPPGCGKTFLARAISGQLKMPLMEIPATELIGGISGESEERIREVFDQAIGYSPCVLFIDEIDAIGGNRQWASKDMERRIVSQLISSLDN 
LKANEFGQSWVIAATTRPDVLDPGLRRIGRFDHEIAIHIPSRKERREILRIQCEGLSVDPKLNYDKIAELTPGYVGADLMALVSRAASVAVKRR 
SMKKFRELHAASEKNMTTVTLDDDEPSEDAGETPVPDSKGEETAKDAEAEQKVDGDKETSAKDKSEGDSPNIETPKKATNGNSSIKSPQKTPKKS 
AEKPTDAAMDVDWAPEEPKKAVEQEVDSSSSNDEYYEPTLAELTNFLDNPPEEFADPNFCLTLIDFVDAIKVMQPSAKREGFITVPDTTWDDIG 
ALEKIREELKLAVLAPVKYPEMLERLGLTAPSGVLLCGPPGCGKTLLAKAIANEAGINFISVKGPELMNMYVGESERAVRACFQRARNSAPCVIF 
FDEFDSLCPKRSDGGDGNNSGTRIVNQLLTEMDGVEERKGVYILAATNRPDIIDPAILRPGRLDTILYVGFPEQSERTEILKATTKNGKRPVLAD 
DVDLDEIAAQTEGYTGADLAGLVKQASMFSLRQSLNNGDTNLDDLCVRSQHFQEALQQLRPSVNEQDRKIYDKLRLKYAAPRVPTLNDK* 

{SEQ ID NO: 519) 

Classification: enzyme 
Gene Symbol: smid 
FlyBase ID: FBgn0016983 

Celera Sequence No. : 142000013385213 

GTCTACTAATTCAAATTAAATGTAGTTTAGTTACATTTATAAGCTGATAATTCAAAAGCTTAAATGCGGCGATTTTAATAATTAATTTACCACAT 
TAATTTAGCGGATTACTTCTCAAAGAATCGAAAAACAAACAATTGAAAAAGGGACTAAATCGATTGGCAGCAATCATCTCTAAAAAAAATTTTGT 

T T AT T T CT G AC AAC C GC AT AT A AAC AAAAT G AAAAG AGC AT T AAC C G GAAAT AT CT GC GC AT T T AT T GC AT T AT T AC T G AT AAT C G AGAG ATT AA 

ATGGTATTAGTGCAATGATAACATATTTGCAAGCAAAAAAAAAGAGACAATCAAATTTTATATTTAAAGGTAGACAGATAAAATCAGTACCTTTT 

ATTATCCAATTAAATATTGATTTAGGTGGCTATTTTTTGTCTTATTAAATATGGATAGAGCTTTACATTTTAACACATATTCGTTGAATAAATTA 

ACAACAAAGATTGAACATAAATATTTATGGAAATTGTAATTGAAGCTTTTAAACGTATAGCCTTAATATTAAACATTATTAATATTATATTATTT 

TAAGATGAAGGTTTTCCTTACAAAAGTAAAGACGAGAAAGTAAAACATAAACATTTGTCATAAACATTTAAATGAGCCCGCGTGGTGTGCAGATT 

CAGCCGACTTCTATTCAGCTGTTGGCTGAATTTGGCTTGCATGCGCACACGTTCCACGTTCCACAACAACATATTGAAATCCAAAAGCTGCTTCA 

CCACTTGGCAAGCGATCGTTAACAAGTTACACCAGAAATTTATATATACTAAAAAAATAATAAGTTACTTAAAAATTTAGTAATAATTATGTGGA 

TATAAAAAATGTACATTATTTTTCTCTACGCGCCGTTGTCCGAAGTTAACAGCAAGATAACATGCGATATTAGGCACCTACGTTCACACTGGCAT 

CAAATTTCGATAACTGACAAAGATCGATAGCCATCTGCGTTTCGTTCGCCACTAAGGCTTGCTTTTCAAATCAAACGGTAAATTCTGGCGGCGCC 

AAGCAGGCATTGACTCCAATTTCGCATCGATTACGAGAGTACTGTCTACGTTTGTCTACAGTTTCACCGACACATTCAGCGTGCTAACACGCTCG 

ACAAATTCTCGTGCTCGCGTTTCGTTAGGGAACGATAAAAATTGTGTAAAAATTCAGTAAAGTGAATTTCCATTCGAAAAAGCCAGCACACGCAT 

TCAAAAGGGCAAAGGTACAAAGGTAAGTGCGTGTGTACTCTTTTCTTCCTTTTCGCGGTTTTTCCCACTGCTTTTTACTAGCGTTCATCAATTTG 

ACCGCCTCTTCGTGATCTCCTCTTCTCGTTCCCCTCTTTCCGCGAATCGGCAAATGGCGGCTGCTGCAGCTGTTGCCAACCAAAGGCGTAGTTCT 

TAACTCGCGGGGGCTTGAGCGATTGGGTGCCCCGTTAAAAATAATCTTGTAAATTAGTTGGCTTATGTTGGCTAACACATAGGAATAACATATCG 

AGTTGCTGGCCACAAAGAAAACCCCTTTTTTGAACTCATGGTTACGCCCCTGTTGCACGCCGAAAGGAATATAAAAGACAGAGATAAAAAAAAAG 

AACCCAAGCAGCCCTTGATTCTGCTTCTTCCTGACATTGACTCACATTTTCCTTTTCAGTTTCACACATTTCATCGCAGCAGCAAACATCTAAAC 

TTTTAAAGAAAAAACACAAAACTCAAAAAAAAAAAAAAAATGAGTAAGGCGGATTCTAACTCACGACAGGGCTCCTACAAGGCCAACAGCATTAA 

CACGCAGGTAATGTACACAAGATCCTCCGCAGAACTGCGTTCGGAACTAACCTCACTTTTTGCCCTAGGACTCACGCATGCGCCGCCATGAGGTG 

ACCATCGAGCTGCGCAAGTCCAAAAAGGAGGACCAGATGTTCAAGCGGCGCAACATCAACGACGAGGATCTAACGTCGCCGCTCAAAGAGCTCAA 

TGGCCAGTCGCCGGTGCAGCTGTCCGTGGACGAGATAGTGGCGGCCATGAACAGCGAGGATCAGGAGCGCCAGTTTCTGGGCATGCAGTCTGCCC 

GCAAGATGCTCAGTCGGGAGCGCAATCCACCCATCGACCTGATGATCGGCCATGGTATTGTGCCCATTTGCATACGCTTCCTGCAGAATACCAAC 

AAGTGAGTAGTGGCTTTAGAATGTTGATGAGAGCATTATTGAATATCCCCTGATTCGCCGCTACAGCTCAATGCTGCAGTTCGAGGCCGCTTGGG 

CGCTTACCAACATCGCCTCTGGCACATCCGACCAAACGCGCTGCGTTATCGAACACAATGCTGTGCCGCATTTCGTGGCTCTGCTCCAGTCCAAG 

TCCATGAACCTGGCCGAGCAGGCAGTCTGGGCTCTGGGCAACATTGCCGGCGACGGAGCCGCCGCCCGCGACATTGTCATCCACCACAACGTAAT 

TGACGGAATCTTGCCACTGATCAACAATGAGACACCGCTCTCTTTTCTGCGCAACATCGTCTGGCTGATGTCCAACCTGTGCCGGAACAAGAATC 

CATCGCCGCCATTCGATCAGGTGAAGCGGCTGTTGCCCGTCCTGTCGCAGCTTCTGCTTAGTCAGGACATCCAAGTGCTGGCCGACGCCTGCTGG 

GCTTTGTCCTACGTCACGGACGACGATAACACCAAGATCCAGGCTGTGGTCGACTCGGACGCAGTGCCGCGCCTGGTCAAACTGCTGCAAATGGA 

CGAGCCGAGCATTATTGTGCCCGCCCTGCGCAGCGTTGGCAACATTGTGACTGGCACAGATCAACAGGTAGGAAAACAAATGATTTTAATCCTTA 

TTGTTATAACATGCCTGACCGACCTGTTTTTTTTTTTTTTTTTTATTAGACTGACGTTGTAATTGCATCTGGAGGTTTACCAAGGCTGGGACTCC 

TTCTACAGCACAACAAAAGCAACATTGTGAAGGAGGCTGCCTGGACGGTCAGCAACATCACAGCAGGTAACCAGAAGCAGATCCAGGCTGTGATT 

CAGGCCGGCATCTTCCAGCAGCTGCGCACCGTGCTGGAGAAGGGTGATTTCAAGGCTCAAAAAGAGGCTGCCTGGGCGGTGACAAACACCACGAC 

ATCTGGCACTCCCGAACAGATCGTCGATCTAATTGAGAAGTACAAAATATTGAAGCCTTTTATCGATTTGCTGGACACAAAGGATCCGCGTACCA 
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TCAAGGTGGTGCAGACGGGCCTATCCAATCTGTTTGCCCTGGCGGAGAAACTTGGTGGCACCGAGAACCTATGCTTGATGGTCGAGGAGATGGGC 
GGTCTAGACAAGCTGGAAACTCTGCAGCAGCACGAGAACGAGGAGGTCTACAAGAAGGCCTACGCCATCATTGACACATACTTCAGCAACGGCGA 
CGACGAGGCCGAGCAAGAGCTCGCACCTCAGGAGGTCAACGGAGCCCTCGAGTTCAATGCCACCCAGCCCAAGGCTCCCGAAGGTGGCTACACGT 
TCTAATCGCCCACCCCACACATTCCAAACTCCGCTCACACGCCTTACAAACAACTACACCTTCGACCGCGCTCACACACTATGCCATTGTCAAAC 
ATACGCATACTCATCATCACATCTTACAAACATTCCCAAACACTTTGGAGGTATGCATGGAAACACAGGCGAATGCATTAGCTTAAAGTCATAAT 
TGAATCGGGTTGGCAAAATCCACTCGCATGAATCCCCATTTAACATACCCGTATTCCTCGCGTTAGACTAGACTTCACAGATTGCAGCATGGATC 
TTTCTGGCAGTGTTCCTCGTTCACGATGCAACCGCTCGGGCAGCTGCGCTGGCTGGGATCCGTGGATCATAGTATTTCCAGAATCCGTCTGTACA 
GGGGGCACCTTATGTTTGCGTGTACTAGGCTTTTTATTGGATACGTTTCGAGCTCATTTCGGATATATTTTAGGCTGTTTAGTATTTATTTATGT 
ATACAGGAAACGCTTTTAATCCTCCACCTTCGCGCAAGCATTACAACAACAACAACAACAACAACAACTCATTCCCTTCACACCAGTGTCCTCAT 
TCCCGAATCGCATGTTTATATTCAAAGTTGTCAATAAATAATGAATTGTTTGAATGATTAATTAAATTTCATGTGTATTTAAGCTGCATTTGAGC 
ATTGCGAAAATTGGAGCCTATTTTTGGGCCTTGGCATTTTTTTTTGTATATTATTTGGAATGCTATTGGGATATACACCGATGCAGGCAATCGAA 
TTTAAAATGTTATCCCTTCGATGCGCTGGCGCCTTTTGTCCGTAAATCCGCGATAAAAAGTCATGCAGTCAATACATGTTTTTTTTTTTGTCTTG 
TACATTTAAATCGCATTTGGGCCGTAAGCGGATAGCCGGAAGATTAAGAGCTATCTACCGCAGCCACGCCCACTCGAGCAGCGGTGGACAGCGAC 
CCCAAAAAAGTTCCAGATTAGATATAATGATTAAACAACCGACGAGTGAGTAGTGAGTAGTAGTCTACTACTGGCACAGAACGCGCACCACGCAC 
AACTCAATCAAGATGAATCGCAGCGTTCGTGGTTATTGCAGTGATCCAGAGGGTAAGCTTAATCAGTGGAGCGACGACGTCACATCAACAACCAT 
GTCCACGTACGATCTTGTTCCCATCGGATGCAGTGCGTCGAATTCGGCGAATCGTGTGTGTGTGTGGCAAAACAAATTATAAAAATAGATTAAAA 
ATTCAGCATGGGCAAAGTAGTTGCCGAAAATTCGAACTGAATTGTGGAAGCATTGGCCGCCATCCGGAGCTTTTGCCTCCGCGTTGCCCAAATTG 
TCGTCACCGATGGATCGGATCTATGTATTCAATTGGTTCTTTCTCGAGCTATTTTATTATTTCCGAGCGCAAGCGTTTAGTCGACTTTCTGCCGA 
AGCTGCGTGCGGGCGTGAATCTCTAAGGGCAACTAATTTATATAATTGCAAAACGATCTGAACTGACATTACGCATTTGAAACTTAAAAATTTCT 
AACGATTTTTAAAAAAGGTAATAGAGAATAGTGCAAGTGATGTGAAACCAATCGGTAGTTTCGTAAATTGCCTTGTGTGTAAATCAATTCGCATT 

TTCACTGAATGTGT 
{SEQ ID NO: 520) 

Exon: 1001.. 1257 

Exon: 1675.. 1812 

Exon: 1874. .2187 

Exon: 2252.. 2822 

Exon: 2900.. 4144 

Start ATG: 1750 

ACTAAGGCTTGCTTTTCAAATC 

TTTGTCTACAGTTTCACCGACACATTCAGCGTGCTAACACGCTCGACAAATTCTCGTGCTCGCGTTTCGTTAGGGAACGATAAAAATTGTGTAAA 
AATTCAGTAAAGTGAATTTCCATTCGAAAAAGCCAGCACACGCATTCAAAAGGGCAAAGGTACAAAGTTTCACACATTTCATCGCAGCAGCAAAC 
ATCTAAACTTTTAAAGAAAAAACACAAAACTCAAAAAAAAAAAAAAAATGAGTAAGGCGGATTCTAACTCACGACAGGGCTCCTACAAGGCCAAC 
AGCATTAACACGCAGGACTCACGCATGCGCCGCCATGAGGTGACCATCGAGCTGCGCAAGTCCAAAAAGGAGGACCAGATGTTCAAGCGGCGCAA 
CATCAACGACGAGGATCTAACGTCGCCGCTCAAAGAGCTCAATGGCCAGTCGCCGGTGCAGCTGTCCGTGGACGAGATAGTGGCGGCCATGAACA 
GCGAGGATCAGGAGCGCCAGTTTCTGGGCATGCAGTCTGCCCGCAAGATGCTCAGTCGGGAGCGCAATCCACCCATCGACCTGATGATCGGCCAT 
GGTATTGTGCCCATTTGCATACGCTTCCTGCAGAATACCAACAACTCAATGCTGCAGTTCGAGGCCGCTTGGGCGCTTACCAACATCGCCTCTGG 
CACATCCGACCAAACGCGCTGCGTTATCGAACACAATGCTGTGCCGCATTTCGTGGCTCTGCTCCAGTCCAAGTCCATGAACCTGGCCGAGCAGG 
CAGTCTGGGCTCTGGGCAACATTGCCGGCGACGGAGCCGCCGCCCGCGACATTGTCATCCACCACAACGTAATTGACGGAATCTTGCCACTGATC 
AACAATGAGACACCGCTCTCTTTTCTGCGCAACATCGTCTGGCTGATGTCCAACCTGTGCCGGAACAAGAATCCATCGCCGCCATTCGATCAGGT 
GAAGCGGCTGTTGCCCGTCCTGTCGCAGCTTCTGCTTAGTCAGGACATCCAAGTGCTGGCCGACGCCTGCTGGGCTTTGTCCTACGTCACGGACG 
ACGATAACACCAAGATCCAGGCTGTGGTCGACTCGGACGCAGTGCCGCGCCTGGTCAAACTGCTGCAAATGGACGAGCCGAGCATTATTGTGCCC 
GCCCTGCGCAGCGTTGGCAACATTGTGACTGGCACAGATCAACAGACTGACGTTGTAATTGCATCTGGAGGTTTACCAAGGCTGGGACTCCTTCT 
ACAGCACAACAAAAGCAACATTGTGAAGGAGGCTGCCTGGACGGTCAGCAACATCACAGCAGGTAACCAGAAGCAGATCCAGGCTGTGATTCAGG 
CCGGCATCTTCCAGCAGCTGCGCACCGTGCTGGAGAAGGGTGATTTCAAGGCTCAAAAAGAGGCTGCCTGGGCGGTGACAAACACCACGACATCT 
GGCACTCCCGAACAGATCGTCGATCTAATTGAGAAGTACAAAATATTGAAGCCTTTTATCGATTTGCTGGACACAAAGGATCCGCGTACCATCAA 
GGTGGTGCAGACGGGCCTATCCAATCTGTTTGCCCTGGCGGAGAAACTTGGTGGCACCGAGAACCTATGCTTGATGGTCGAGGAGATGGGCGGTC 
TAGACAAGCTGGAAACTCTGCAGCAGCACGAGAACGAGGAGGTCTACAAGAAGGCCTACGCCATCATTGACACATACTTCAGCAACGGCGACGAC 
GAGGCCGAGCAAGAGCTCGCACCTCAGGAGGTCAACGGAGCCCTCGAGTTCAATGCCACCCAGCCCAAGGCTCCCGAAGGTGGCTACACGTTCTA 
ATCGCCCACCCCACACATTCCAAACTCCGCTCACACGCCTTACAAACAACTACACCTTCGACCGCGCTCACACACTATGCCATTGTCAAACATAC 
GCATACTCATCATCACATCTTACAAACATTCCCAAACACTTTGGAGGTATGCATGGAAACACAGGCGAATGCATTAGCTTAAAGTCATAATTGAA 
TCGGGTTGGCAAAATCCACTCGCATGAATCCCCATTTAACATACCCGTATTCCTCGCGTTAGACTAGACTTCACAGATTGCAGCATGGATCTTTC 
TGGCAGTGTTCCTCGTTCACGATGCAACCGCTCGGGCAGCTGCGCTGGCTGGGATCCGTGGATCATAGTATTTCCAGAATCCGTCTGTACAGGGG 
GCACCTTATGTTTGCGTGTACTAGGCTTTTTATTGGATACGTTTCGAGCTCATTTCGGATATATTTTAGGCTGTTTAGTATTTATTTATGTATAC 
AGGAAACGCTTTTAATCCTCCACCTTCGCGCAAGCATTACAACAACAACAACAACAACAACAACTCATTCCCTTCACACCAGTGTCCTCATTCCC 
GAATCGCATGTTTATATTCAAAGTTGTCAATAAATAATGAATTGTTTGAATGATT 
(SEQ ID NO: 521) 

Start ATG: 333 

MSKADSNSRQGSYKANSINTQDSRMRRHEVTIELR^ 

MLSRERNPPIDLMIGHGIVPICIRFLQNTNNSMLQFEAAWALTNIASGTSDQTRCVIEHNAVPHFVALLQSKSMKLAEQAVWALGNIAGDGAAAR 
DIVIHHNVIDGILPLINNETPLSFLRNIVWLMSNLCRNKNPSPPFDQVKRLLPVLSQLLLSQDIQVLADACWALSYVTDDDNTKIQAVVDSDAVP 
RLVKLLQMDEPSI IVPALRSVGNIVTGTDQQTDVVIASGGLPRLGLLLQHNKSNIVKEAAWTVSNITAGNQKQIQAVIQAGIFQQLRTVLEKGDF 
KAQKEAAWAVTNTTTSGTPEQIVDLIEKYKILKPFIDLLDTKDPRTIKVVQTGLSNLFALAEKLGGTENLCLMVEEMGGLDKLETLQQHENEEVY 
KKAYAIIDTYFSNGDDEAEQELAPQEVNGALEFNATQPKAPEGGYTF* 
(SEQ ID NO: 522) 

Name: Pendulin 
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Classification : known__f lybase_gene 
Gene Symbol: Pen 
FlyBase ID: FBgn0011823 



Celera Sequence No. : 142000013384539 

AACATTGCAGTAAAAAAAAACAGAAGCAAAGGGGAGCAACAAAAACAGAGAGAGGGCAAAATAAGAGAGAGGACAACCAGACAGAGCGAGATGCC 
AAACAACAGCTGTTCTTCTTGCCAGATGCGAAATGGGCAATTTAATGAAATTTATCTAGATTTCACTCACTTTTCACTTGCCAAACGTCGTGTTT 
GTGCACTTTTCACAATTGTCGCGGCGTCACACGTTTATTTATTTACATTTCTCTGCGTTTCAGGCGCTCGCACGCTCAATTAGCGCGCTTTCCGC 
TTTGTGCCACACATTCTGTGCCAGCCACAAAAAAGCCCGCAATTCGCACTTTCATGTGTTTTCGGCTATTTCTGCGATCGTTTTTCCTGCACTTG 
CACTGCTCAATAGCTAATGTTTAGTTAATTGACACTTTCTAAGCGTTTTGCGTATATTTTTTACGAATTATTTTGGCAAAACGACGCGGTACCAG 
CGTCAACGTCCGCATTCTTCAAAATCGGCGAATGAACTGAACAAAACGAACTGCGCAAGTTGGGCAATAGGAACAGCTGACCGCGCTGCCAGACT 
GCCGAACGGAAACTCGCTGCGAGAGTCTGGCAGCACTTGCAACCATCGATATAAACTATTGGTAACGAATATTTAAAGAATTAACTTTATAGTTC 
CATTGAAAATACATACATAAAACAACTATAATTATGATACCTAACATAAAACATCGTAATATCTTATTTATAGTTATCTTTTGTATGCAATATTC 

AAT T T T AAGTT AAT T AG AT AAAC AC C AAAC GT T TTTTTATTTGT G AAAAT AG AAAAAG AAT T C AAT AG AAAC AAGT TT T C AAG GC AATC T GC AT A 
TTCTTAAATCCAACTTTATGGGCACTTTTGAATATTCATTTATTTTGCAGCCCTAGCAATCGCTGAGCGATAACCATAAGTTGACTATCGATAGC 
AGCAGAGCGATATGCAAAACAAACGAAAAAAGTAAAAGAGCGGTGTAAAGCAGACAACATAGTTTACATTTTTTGCAACAAGTTTTTCAACTTAA 
AGCCAACTCCCATCGTCCAATGGAGGTGGACCAGATTCCCCTGGCCGACGTGGTCTTCGTCATCGAGGGCAGCGCCATCAACGGAGCCTACATCA 
ACGAACTAAAGACCAACTACATCCTGCCGACTTTGGAGCACTTCACCACTGGCTCCATCGACGAGCGGGAATACCTGATTGCGGAGCGCTTTGCC 
ACTCTGTACGGTATTGTGGTCTACCGTACAGCGGCCAATCTGCTGGAGCCAGTTTGTTCAACCTATGGTCCCTTTCTGCAGCCCCAAAAAGTGAT 
GGAGACGATCGAAAGATTGCCACTAGTGGGAGGCGGAATGGAGTCCTGTGCCCACATGGCCGAAGGATTTGCTGCGGCCCACGGCTGTTTTGATG 
ACATCAGCGAAAGGCGCCAGTTGCTCGATCAGACCAGCGTCCAGAGGCACTGCATTCTCATCTGCAACAGTCCGCCGTACCAGATGCCCACAACG 
GAGTCGTGGAAGTATCCGGGCAAATCATGTGAGCAGCTGGCGGCGCTCTTCAACGAGCGCAAAATCAATCTCTCCATTATTGCGCCACGCAAGAT 

GCC GGT GT T GT T C AAACT T T T CAT G AAG GC C GAC GG AG AT C AGCCC AT T AC G AGC AAAAACT AT GC C AAG AAC AT AAG AC AC CT AGT GC T T C T AA 
AGGGATACAGCCTCAAGGAGAGAGCACCTAGTCCCAACAGCATGGCGGCCCAAATGGCTGCACCCAATGCGGCACAGGCTACAGTGCAGCAGCAA 
CAACAACAGCAACAAAATCAAGCTGGTCAGCAACAACAAGGGCAGGGCATGCCCATGGACACCACTCCAGCCCAGCAGCAGCAACAGCAACAACA 
ACAGC AGC AGC AAC AGC AAC AAC AAG GC AAT C CT C AGC AGC AG GT C AT G AAC AT G AAC AC GAT G C AGC AAC AGC AGC CCG GT CC AAAT CC C CC AG 
CGGGCTTACT C AAT C C AC AGC AGC AGC AGCAAC T AT T GC AGC AGC AGC AGC AGAAT CAATTCGTTT C C AAT C AAAT GC AG AATC AG AAC T T C C AG 
CAAAACGTGGGACCCGGCCAAAACCGCTGGATGTATCCCAACCAGCCTGGTCAGGCGCGTCCGCCCTTCATGCAGGGAGCAGGCAATGTGGGCGG 
TGTGGGCCAAGGCGGCGGCATGC AAC AGAAT CCAAATTCTGCTCTGATATCGCGC AT TAATGCGCCGCCGCCGAACC AAACT GTGACCTCGCTGC 
AGCAGCAGCAACAGCAGCAGGCGCAGCAGCAGCAGCAGCAGCAACAACAGGCCCAACAGCAGCAGCAACAAAGGATGCAGATGCTGAGCCAGCAG 
CAGATGCTAAATCACCAGCAGTTGCAGCAGCAACAGCAACTGGCCCAGCAACAGCAACAGCAGCAGCAAGGACAGCAACAGCAGCAGGTAAATCC 
CAATGCAGGAAACAATATGATGCCGGCTAGCAATGCCGGCAACATGTCTAATCCGCAGCAGCAGCAACAACAAGTAGGACAACAGTCGAATCCGC 
AGCAGCAGGGCAATCCGCAGCAGCAACAGCAGGGAAATTCGCAGCAGGAGCAGGCCTCCTTGAGGGAAAAGATATGGACCGGAGTGCTGGAGTGG 
TCAGAGAAGCCAAAGTCCGATCAACAGAAGATTCCCCATACACTCCAGTGTACCGTGTGCACCAACATCAAAGATGGCGAGCCCGAGATCAAGGC 
TGAGAACTGGCCGCCCAAGCTGTTGATGCAGCTAATGCCCAAGCACCTAGTTGGCAATATTGGTGGGCAGTTTCTCAAGGACTCGAAAATGGTTG 
TTTTTCGACCCACTCCAGGAGAGGCACTCGATTCGCTGGCCAAGATGATGACCTCCGGCTATGCTGGCTGCGTCCACTTCTCCTCCATCCCCAAC 
TCACCAGCCTGTGACCTTAAGGTTCTCATCCTACTCTACACGCCCGATCGCAACGCTTTTCTGGGCTTCATTCCCAATAATCAGGCAATGTTTGT 
GGAACGTCTGCGCAAAGTCATACAGCAAAAGCAGCACGGCAACATGCAACAGCAGCAGCAACAGCAGCAAATGATGCAGCAGCAAGGCAAGTCGC 
CCATGGAGCT GCAGC AGC AAC AGCAACAAC AAC AAC AGCAGC AAC AGATGC AAC AGGATAACTCGCAGC AGCAAC ACTACAACC AGT TCCAACTC 

AAT AT G C AG AT GG GT G GT G GAG GTCCTGGCGGCG GAC CTGGTCCCGGTCCCGGCGGCATGCC CAT G C AAC AG AAT C AAAT GC AG AT GAAC AT GAT 
GCAGCAACAGCGAATGCCACTGGGCGTAGGTGTTGGCGTGGGTGTTGGTCCCGGTGGCGTGCCCAATCCAAACCTCCAGCAGCAGTTGCAACAAG 
TGGCGCCAAACGTCGCCGCAATGCAGCAGCAGCAGGCGCAACAGCAGCAGCAGCGCATGGTGCGTCCCATGATGAGCAACAACAACCCGGGGCTG 
CGCCAGCTCTTGCAGCACCAGACCACGCCAGGCAATCAGTTCCGGCCGCAAATGGGTGGCCAGAATCCCAATCAAATGGGAGCAGGCGGTCCAAT 
GGTCGGCAATCGCAACTTTGACGACGGAAACTATGAGTTCATGTAGGCTACTTAAGAGTATCGTATAAAAAACCAGAATTAAGACAGTAAAATAT 
TTT AT AAAAT CCAACTGC AAGT CTTTTTTGGGGTAGTGGC AAAAT CTATTTCCCAAAATCGAAATATTAATTTCTCCATTTTC AT AAC AT TTCGT 
TTATTTAATTAGTAAGAATCAAGGGAAATCCGAATTTGCACAAGACTTATATTAATCACAGAACATAATCTCCGAGAAAGTCCACTAGTTTGCTC 
CACCGCCGCCGCCCCCCGAGGATGCGGAACTTGCACTTCCAGCGCTAGCGGCAAACACCTTGCGAAGATTCTGCAAGATGCTGAACAGAACCGTT 
CCCGCGCTGCCCAACGTTTCCAGAACATCGAATCCGGTGCTGTCGATGGCTGTCTCCCGATCCTCCTCCGTTTCGATGGCCGGGGAGCTGATGTT 
CCCGCCATTGTGGTGAGTTTCGTGGACGCCATGGTGTCCAATCAGCCGATGAGTATTGCTCTTCACATGGGAGTCGTTCACGCCGCCACCGCCAA 
CTGTGTTTATCAGGTGCTTCACCGGCTCCAGGACCGGAATATTTAGCTTAATGCGATGCCATGCATCCGCAGTTTCGGTGAAGCCCTCATGCTGC 
TCGCCAGTGGCCTTCATCTCATCATCCAGCTGGTTTCGCTCGCTGACCGGATCGTCGTCCGTCTCCAGGGACAGCTCCAGACTCATGTTTCCCTT 
ACCAGGCAGTTGCAACAAACTGCCGAAGAGCTGTTGCAGATGCGCCGAGTTGGTGGCCTCCGCCTCATTCTGCTCGAAATCGAACTCTTCACGTA 
CTCCCGACTTGGGCTTCTGGTTCACAAAGCTGGTCACATGCAGCATTCCATTCTTGGTGCCAGAATCACTGGAGTTGGTGGGCTTGGCGGACGTC 
GTCGTCGTCGTCGTCGTCAGCAGTAGCAGCCACAAAAACTGCAGATACATTCGGAGAAATGGAATCTATAATTGGCGCTTAATGACCGGCTGCAC 

G 

(SEQ ID NO: 523) 

Exon: 1001.. 3656 
Start ATG: 1065 

Transcript No. : CT15479 

CAGACAACATAGTTTACATTTTTTGCAACAAGTTTTTCAACTTAAAGCCAACTCCCATCGTCCAATGGAGGTGGACCAGATTCCCCTGGCCGACG 
TGGTCTTCGTCATCGAGGGCAGCGCCATCAACGGAGCCTACATCAACGAACTAAAGACCAACTACATCCTGCCGACTTTGGAGCACTTCACCACT 
GGCTCCATCGACGAGCGGGAATACCTGATTGCGGAGCGCTTTGCCACTCTGTACGGTATTGTGGTCTACCGTACAGCGGCCAATCTGCTGGAGCC 
AGTTTGTTCAACCTATGGTCCCTTTCTGCAGCCCCAAAAAGTGATGGAGACGATCGAAAGATTGCCACTAGTGGGAGGCGGAATGGAGTCCTGTG 
CCCACATGGCCGAAGGATTTGCTGCGGCCCACGGCTGTTTTGATGACATCAGCGAAAGGCGCCAGTTGCTCGATCAGACCAGCGTCCAGAGGCAC 
TGCATTCTCATCTGCAACAGTCCGCCGTACCAGATGCCCACAACGGAGTCGTGGAAGTATCCGGGCAAATCATGTGAGCAGCTGGCGGCGCTCTT 
CAACGAGCGCAAAATCAATCTCTCCATTATTGCGCCACGCAAGATGCCGGTGTTGTTCAAACTTTTCATGAAGGCCGACGGAGATCAGCCCATTA 
CGAGCAAAAACTATGCCAAGAACATAAGACACCTAGTGCTTCTAAAGGGATACAGCCTCAAGGAGAGAGCACCTAGTCCCAACAGCATGGCGGCC 
C AAAT GGCTGC AC CCAATGC GGC ACAGGCT AC AGT GC AGC AGCAACAAC AAC AGC AAC AAAAT CAAGCTGGTC AGC AAC AAC AAGGGCAGGGC AT 
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GCCCATGGACACCACTCCAGCCCAGCAGCAGCAACAGCAACAACAACAGCAGCAGCAACAGCRACAACAAGGCAATCCTCAGCAGCAGGTCATGA 

CAGAATCAATTCGTTTCCAATCAAATGCAGAATCAGAACTTCCAGCAAAACGTGGGACCCGGCCAAAACCGCTGGATGTATCCCAACCAGCCTGG 
TCAGGCGCGTCCGCCCTTCATGCAGGGAGCAGGCAATGTGGGCGGTGTGGGCCAAGGCGGCGGCATGCAACAGAATCCAAATTCTGCTCTGATAT 
CGCGCATTAATGCGCCGCCGCCGAACCAAACTGTGACCTCGCTGCAGCAGCAGCAACAGCAGCAGGCGCAGCAGCAGCAGCAGCAGCAACAACAG 

gcccaKcagcagcagcaacaaaggatgcagatgctgagccagcagcagatgctaaatcaccagcagttgcagcagcaacagcaactggcccagca 

ACAGctACAGCAGCA^AAGGACAGCAACAGCAGCAGGTAM 
ATCCGraGCAGCAGCAACAACAAGTAGGACAACAGTCGM^ 

CAGGCCTCCTTGAGGGAAAAGATATGGACCGGAGTGCTGGAGTGGTCAGAGAAGCCAAAGTCCGATCAACAGAAGATTCCCCATACACTCCAGTG 
TACCGTGTGCACCAACATCAAAGATGGCGAGCCCGAGATCAAGGCTGAGAACTGGCCGCCCAAGCTGTTGATGCAGCTAATGCCCAAGCACCTAG 
TTGGCAATATTGGTGGGCAGTTTCTCAAGGACTCGAAAATGGTTGTTTTTCGACCCACTCCAGGAGAGGCACTCGATTCGCTGGCCAAGATGATG 
ACCTCCGGCTATGCTGGCTGCGTCCACTTCTCCTCCATCCCCAACTCACCAGCCTGTGACCTTAAGGTTCTCATCCTACTCTACACGCCCGATCG 
CAACGCTTTTCTGGGCTTCATTCCCAATAATCAGGCAATGTTTGTGGAACGTCTGCGCAAAGTCATACAGCAAAAGCAGCACGGCAACATGCAAC 
AGCAGCAGCAACAGCAGCAAATGATGCAGCAGCAAGGCAAGTCGCCCATGGAGCTGCAGCAGCAACAGCAACAACAACAACAGCAGCAACAGATG 
CAACAGGATAACTCGCAGCAGCAACACTACAACCAGTTCCAACTCAATATGCAGATGGGTGGTGGAGGTCCTGGCGGCGGACCTGGTCCCGGTCC 

CG^CGGCATGCCCATGCAACAGAATCAAATGCAGATGA^ 

CCGGTGGCGTGCCCAATCCAAACCTCCAGCAGCAGTTGCAACAAGTGGCGCCAAACGTCGCCGCAATGCAGCAGCAGCAGGCGCAACAGCAGCAG 
CAGCGCATGGTGCGTCCCATGATGAGCAACAACAACCCGGGGCTGCGCCAGCTCTTGCAGCACCAGACCACGCCAGGCAATCAGTTCCGGCCGCA 
AATGGGTGGCCAGAATCCCAATCAAATGGGAGCAGGCGGTCCAATGGTCGGCAATCGCAACTTTGACGACGGAAACTATGAGTTCATGTAG 

(SEQ ID NO: 524) 
Start ATG: 65 

MEVDQIPLADVVFVIEGSAINGAYIHELKTNYILPTLEHFTTGSIDEREYLIAERFATLYGIVVYRTAANLLEPVCSTYGPFLQPQKVMETIERL 
PLVGGGMESCAHlffiEGFAAAHGCFDDISERRQLLDQTSVQP>HCILICNSPPYQMPTTESWKYPGKSCEQLAALFNERKINLSIIAP^MPVLFKL 
FMKADGDQPITSKNYAKNIRHLVLLKGYSLKERAPSPNSMAAQMAAPNAAQATVQQQQQQQQNQAGQQQQGQGMPMDTTPAQQQQQQQQQQQQQQ 
QQGNPQQQVMHMNTMQQQQPGPNPPAGLLNPQQQQQLLQQQQQNQFVSNQMQNQNFQQNVGPGQNRWMYPNQPGQARPPFMQGAGNVGGVGQGGG 
MQQNPNSALISRINAPPPNQTVTSLQQQQQQQAQQQQQQQQQAQQQQQQRMQMLSQQQMLNHQQLQQQQQLAQQQQQQQQGQQQQQVNPNAGNNM 
MPASNAGNMSNPQQQQQQVGQQSNPQQQGNPQQQQQGKSQQEQASLREKIWTGVLEWSEKPKSDQQKIPHTLQCTVCTNIKDGEPEIKAENWPPK 

LLMOLMPKHLVGNI GGQFLKDSKMWFRPTPGEALDSLAKMMTSG YAGCVHFSS I PKS PACDLKVL I LL YTPDRNAFLGFI PNNQAMFVERLRKV 
IQQKQHGNMQQQQQQQQMMQQQGKSPMELQQQQQQQQQQQQMQQDNSQQQHYNQFQLNMQMGGGGPGGGPGPGPGGMPMQQNQMQ^QQQRMP 
LGVGVGVGVGPGGVPNPNLQQQLQQVAPNVAAMQQMAQQQQQRMVRPMMSNNNPGLRQLLQHQTTPGNQFRPQMG 

DDGNYEFM* 

(SEQ ID NO: 525) 



Celera Sequence No. : 142000013384273 

CTATAACATTTTAAACAGGCGTGTTTCGATAGTGCGACGTAAGATAGTGTTAGCCTTGGTAATGGCGAGCAATTAGATAAAGATCACTCAGCTCA 

AGGTCTCAGAGAAATCTCGAGCTAATCAAACATCTCGATGGAATCCCGCTTTCTGGACGAAAAATTGTTTTCCCACCCGTAGCAATAAGAAAAGG 

CGAAACAAAACTGTCAAGCAGAGTGGACACATGTAGCACGATCACTTTTTTTGTTCCGGCGATAGTGTGTTTAGCTAGCCTGGGTCGCCAGCTTC 

GGCTCCAGCTCTAGTACCAGGTTCATCTTCATCTGCAGATCAGTAGCTGAAATCAGGATCAATGTGAGCCACATCCAGGCTTAGGTTTATCTCTA 

CCGCCTGCTCTCGTCTAGATCTGTCGGATGTCAGGTTGCATAGTTGGGCCGGCAATTGCATGTCGCGATCGTCCAGCGCCGAAGATCTGCCTCAA 

CCTCAAGCTGAACCTCATCCGCGTCCACATCCACATCCTCAATCCTGAGTCGCTAAAACTTGAAAGCCGCATTCGATGGCTGGCCGCTGACAGCG 

CGGAATTGTCGTTCGCATTGTCAGCTGGCATGGGCGTCGAAGGAGGGGCGGTGCTGAGCCAGGGGGGTTTATCCGCCGACTTTTGCCGACGATAT 

ACAAAAGATGCAACTGTCGCTGCTTGTTGCAGCAGTTGTTTCTGCTTTCTGTAGCTGCCCAGCTTGACAGTTGCTGCAAGAGCAAAAAAATGCGC 

TTGGGAAACTGCCACTGGCGATGCCGCTGCTGCTGCCACCGCGAGCTGGCTTAAATCAAATATTGAAATGCCTCAGAGCAGATCGGCGACCCTGA 

CCCGTGCCCGTGCCCCTGTCCCTGCCCCCCCACTCCACCTCCCCGGATACCAGCTAATTGCCCCACACATGGTCCAAATAATTGCCCCAGCAACA 

ACAGCCCGTACCAATGAATTCCACTTGGCTTTTATTTTTTGTAATTTTTTACTCATTGTCGTTTTGTGTTTTTTTTTTATTGATTTGTTTTCGAT 

TGCATTCGCATTTTTCTGCTCGCTTTTTTGTTTCTTGTTTTCCTTCGCATTAATACTTTACAAAATTGAATGTCTATGAGCTAGTATAAATTAAT 

TAAATATTATATTCATATCGAGCTATTTATATACTTAGTTTAACTAGCCAGCTAAGTGCATTTAATTAGTTAGTAATACCTACATGGATGGGGTG 

GTTTAAGTTAAATGTGTTTACAAAAAAAAATATTTCGTGTGCAAAATTTCATAAATTCCAAGCAACTAAAATCTTTTTCTGTACTTCGTGGGGTT 

GGTATGAATTTAATCTAAAACTAAAAGTTAATTTAAGCTTATTTATGTACAAAAAAGAATTTGTTGTTTTGTATGACCATGGAAATGGAAAAACA 

ATAGTGGAATTTTTACTGTTGCATTCCAAATGTTTTCCAGCCCTGGTTACCGATTTTCTCGAACTATTATACTGTCCACTTCGCATACGTTTTGG 

TCCTGCCCAAAGCAGTATGCAAGCAGTTCAAGCAGTGCTCGCTAGAGAGAAGGAAGGAGCTGCGCGCTTTTTGAATTTTCAGTCTTGTTTTCAAT 

GATTTAAAAAGATTTTAAACGACAGCATGAGGAGGGATTCAGCAGAAAGGGGGCGCGCTTCGAAGACTTTAAGCTACACTAAAGTATTTTAAATC 

GATTACTATTATCTAGGGGTACAAAAAATGGGGGCTTTGTAATTTACAAAATATTTTGCTTCGATTGGGAGCTGGCTTACATGACTAAAAACTGG 

GCAAACACATGTTGTATTCCTCACTTCTTCTTGAAATCTATTCGTTAATATTCCTTGTGTAATACTGCTAAAAAAAAAGTGAGAGCTTAAAGGGT 

TTTGCATGGTACACTTCAGGGGCGGTTAGTCGTATCTTTAACTAGGCTATAGTCTAGGTTCTTAAGGTGCTAATAACAAAGGCTAAGCGTAGACA 

AAATACATAACACATACAAAGAACGAAGAGGGCGGCTTCAGAGGACGGGACTGGGCTGGAGGCGGGCAGTGTCCTTACAGACACGTGTAGATGAC 

CTTTTTGGTCCGACACAGCTTGCACTTCACCTCGCAGCACCAGTGGAAGGTGCAGGCGCACCGCTCCACAACGACGACCTCGTCTCGCCGATAGC 

CACGCCCACAGCACATCAGCCCACAGCCGTCGACGCCCAGCGAGGTCTCATTGCACTGGCGGCCATGGGTTCCCAGGATGCCCTGCCGCAGGTTC 

TTCTCGCAGAAGCTCGGCGAAGGCTCCAGATAGACAAGGTCCTTCGAGCCGGGTGGCTTGTGCTCGGGATTGTGCGGGTTCAGTTGGAAGTGATA 

TCTGCAAGGAACAAAACAAGAAGGTAATGAGTAAACATGCTATTTTGTGAGATTACTATAATTTTAAGGCTGTGTTTTTTTTAAGAAATGAAGTT 

ATGATTGTTAACAAAAAATATCGTGGTGATTCTTTACCTATTATGCTTGCGTCCCTGACGACGTCCATTTCGTCCGCCCCTTTGACCAGCCTGGG 

GCAAACTGTTGGGCGACGGCATGTTCGGGTGATGGATCTTGCTGATCGGGTGGCTGTTCTCTAGCAGGATGTCCGGCATATGGTCGTTCAGCATA 

CGCTCCTCCTCCTCGCCGTAGACCAGACCAGACTGCGGAATAATCAGGCCGTTGGAGCCCACGGAATTCGAGCCGGCTGCATTCGGACTAACTGG 

GGCCAGAGCGTTGGTGGCCCGGAGACTGTTGGTCACTTGCACGCGGGTGGCTCCATCGAAGCGGGCCTTCAGATTGTCGCCAATCACACGGAAGT 

TGGCCAGTCGCATCCAGCAGGTCTTCACTGTACACGATCCGGACATGCCATGGCATTTGCACTCCTGTCGCATCTCCGCTTGGACGTGCTGTAGA 

TGAGAAAAGAGCGGGGAGAGATTGTTCATTAGTATGACTTTATTAGGCTTCTTGCGTATCGACTCGCAATATGTTTTTCACTTAATTATTTTGCT 

GTTTTTTCTTGGCCGGTTGTTTTTGCCGTTTTGCTGGCTGAATGGCTGGATGGGATGGCTGGCTGGCTGGGAGATTGGCGGCCCGCCTGCCACGT 
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TGTTGCAGTTATTAAAAATCAATAAATGTTGTTGCAACTTGCTGTGCGCTTGTTGCATGTCCCCCCCGAAGTTGCTGTTTGTAATTGTTGTGCGC 
GGACCAAAAGTAAGAAATCAACGCAGTTGGGGGCCTTCTCAGTTAGTTCGGTTAGCAGCGGCCGCAGCAGCCCGATGATCATCATCATGCTGATC 
GCAATGATTACGGAGCTGCGGCCCATAACAAGTAATAAAATAATAACACAGGCGGTGGGCTCTGCTCCCTCCCCAAAAACGTCGGAGGACTAGGC 
CCGGATCCGGAGGGATCGGTCGACTCTAGCTGGATACAGTGCATGTAGTGGATGTTGGCGCAGTGTGCTACTTTCTACGCCCCACTAAAATGGAT 
AGCAGGAGTGACGGTGCACAGTGGTTTGGTCAGGCCAAAAAAGTCCATTGCAATATGTGAAAAACTAACTATTATCCCACCATTAAAACTCATTT 
ATCGGTAATTAAGAAAACAGCTCTTTTTACATTTCTGTATAATCACAATTTTCTCAACATTCGACCACTGTACCCCTAAAACTAGAGTCCTACTA 
TTTTCTTGCATGTGGAGGGGAACTGGAAGCTGTCAACGAGTTAATATACGAGCGACACAAGCGCAAACGAGTGCAGAAATCCCCCCACATTCTCA 
TTCGGATGAAAGAGACAGCTGGAGGGGGAGCGAAAGAGAGAGCGCTAATCTCTACAAGCTAGGCCAATTGGCGTTCGGCTCCAATTAATTGTGGG 
GACGGAAATTGTCGTCGCTGGTTTTTATTAGCGCTCCAGTCGTAAATCGCCCAGCCAGGCTTAAAACCGAACATTTGGACTACTTATTACGAGGT 
ACACTGCGAAAAGAGCATATGAAATAAAGCCAAATCATACCTCATTTCTTATAGTATTGTATAGTAGTATTTTGGCTCTATTTGTATTCGTTTCT 
CAGTGTATTAATGCTGGGTATCTAAACTTACCGCTCGACCCGCCTCGTTGTTGTGCAGATTCATCTTCTCGCGCAGATTGCGACCCCTCTCGCCG 
GTATCGACGAATTCCCGGGAGAACTTGAACCCGAATCCGATGTTGTCGGAGCAGCCGCCCCACTCCCAATCCCGCACGCCGGCCACACTGCCCGC 
CTGGTGGTTCGCTTGTGGAGATCTCGACTGGTGGCTGTAGTCGCAGGTGCAGGACTCTATCGTTCCTTCACTGCAGGCCCTGGCAATCGAGTGGG 
TCACCGCCGCGCTGGTGATTGCGTAAATGAAGCTCGTCTCTCGGCAGCCTAGAATGATTACAAATAAAATATTTTAGTTAGTCAAATGTAACATA 
TTTTTAGAAAATTAAAATACAGAAATATTTTTTAAGCGAGTGGAATGTCCACATGGCTGATTTCATGAGTTTTCTTCTCTGTGCAGTCATTATTT 
TCGCCTTCTGGGTGTTCCACTTGGCACCGAAGACCAATACCGCGACCAAGACAAGATCTCAATCTCGACCAAGTGAAGTGATGGCCGGAGACGTC 
GTGGGCAAGAGTTGAAATTTTCGTTATGAGTAATGGCACTTGTCTTCCAAAGCCTGGTCAATTATGACACTTCAAGTTATCAATGCCACGATGGC 
ACGGCGAAAAAGAGAGATGAACGCACCAAAACCTGACACAGATTAAACGCTCAACTATAAATTCCAACTTAACACTTTGGGCCCATCTCGGCACA 
ATCCATTTTGAGTGGACAGTTTCCCCTTGTGGTCTTTACATTTATTTTTTATTTTTAGTAACTGCCTCCGTTTCTGTTTCGGTTTCTGCTCTGAG 
AGCGAGACCATCACAGATAAAGAACAATGAAAATTGAATTTACATGTTGCAGATAATTTTTCAAGTGGTTTCGCCTCGGTTCATAGCTCAAGATA 
TGATATATATATAGGGATGGTCCCGCCGGTCAGGTAGCTGAAAATTCCTGGGTAGCGCGTCTTGTCAACCTTGTTTGGAATTTCCAGTTGCGTTG 
CGAAGAGCGTTTCAAGTGTAGTATTTATTATTAAATGAATAACTCTTTCCCTATTTGAAATGAGTAGTAACATTTAGAATTTGAAAGGAGCAAGC 
CCAACAAAAATCACTGCCTAACTTATTTTCGTTGAGGTGTTGATTGACGCATCTGTGATTGAGTAGTCTGCACTTGAAGATTGACACATTCTAGC 
GCTCTCCGTATCACGGAAAACAACTTCCGCTTGGTGTTAGGAATTTTGCCGCTTTTGATTTCGATTTCCTGGAATTCGACCACGAAATAATTTGG 
TTTATTTTTTCGAAACTATACCGAAGTGAGCCTCCAATTCCCGGTTAATGTTTGTTTGTTTGTCGTTGGTGGTTTCGTTTTTCCTTAAATAATTT 
CACGTATGTAACGAAATTGAAGGCGCACACAGGTAAATTGTGTGTAAATATGTGCAAGATGTTTGGGCCGCGATAAGACGTCGGCGCATGCGCAA 
ACGGTATGGGAATCACAGAGGCGGGGGTGCGAGAAGGGATCGGAACGGGGGGCGGAGGAGCGGACTCAGGGGCGTGACACAGTCATTGGTGACTT 
TGTTTTTTTTTTTTCTGATGATTAGCTTAGCAAGAATTCGCGTTACCTTTTGGAAGCGAAATGTTGTTTGCATTTCGTACTCTCCTTTTGTGGCT 
TTGCTTTCCTCGTTTTATGTGTAAATAGCGAACAAAAGATAATAAATACGAAATATAAAATAAAAAGCGGGGGTGGTGCACCCTGAGCAATTAAG 
CAAAGAAAGGGTGTGGCCAAAATGGAGCAACCAAAAAATAAATTTCCGCATGTAATGCGATATGTTTCTGTCTTTCTGTTTTATATAATTGCCAA 
AACGGATTTCCGCTGCGAGTGGGCGTGGCTCGAGCTGATTGAAATTGAGAGCAGGACGAAAATCAGAAGCAGGGTGCCTTGCTGATCCTCAGCCC 
AGC ACC AAAAACC GGG AGT AC AGC T C AT CT C GG GGT T AAGC T GC T T AGT G AC AG C AGC GT C AGC AGT T C C C GG AAT AGT T T AGAC CT C T TG AT AT 
AAAACCCACAGGAATATAGGCATAGGAATATGCGAGACACTGGGACTCAGGGACACCCGGGGCGGCTTCGTCACGAACTGATGATTGCCAAAAAG 
GGATCTGTACGAGGAACCCAACGAAAATTTACTAATAGAGACAGCCTGTTAAAGCGTGTGGCGTGGCAAATTGACAAGGTGTCACCAATTAGAAA 
ACAAGTTAGGCCAGCCCTAGGACTAAAAACGACAAAAAAAAAAAAGAGAACCCAGTCAAGAGACAGGAATTTCGGTATGCCCCGCTGGCAGGGTT 
CCTACTTCTTTCAGTTTCCAGCAGTTTGGAAGCCACTGACTCAACTCCGAACCCCAAGTGTTAATGGTGCGCTCTAGGTGTTAATTAGGGCCAAG 
GGCACGGCACGGCAAATACATTAAAAATTTGATTGTGGAAAGCGATGAAAGTGCTGCGATAAAAAGCTGTCACCAGGGCAATAATTTCCCTTTTC 
T TT T AAAGT ACT C C AT AAAAT GTC G AG AGC T C AT AAAAAGG AT GT T GG C GGAG AGT C AAAAT AAC T T G AT AAGGT C AAG AGC C TGG CT ATAAAAG 
TCAGCTGCCGTTGACAGACGGACGCAGCGTAATCGGAATATCGAAGCCACGCGACGCTTGCCACGCCCACTATGAACATATATATAGGGACATCA 
GGTAGTTTCAGTTTCAGTTGCAGGATCTCAGGAAACAGGACAAAACAGTAGCTAATAAAACAGCAGGCGAGGGCGGGAGTCGTATGGATTTCCAG 
CTAAAAACGCGCTAGGACAAAAACTGCGCATGCGCGACTTTTGTTTACATTGATATAAGAAAAACTGGCCGTTCTCCTGGGATTTTGCTGGCGCA 
GCAGGCGAACTAAACTCCCAGAAGACAAGCACGTTTGATTGATTTGAATGACTGAGTGAGTGACAACAGTGGTTCAGTGGTTCCAGTGGTGCTGC 
CAACTTGTGCTCATTAGCATAAACACGCTGAAACCGAAACGCAAGTGGAGCAAGTGCAGCGAGCACAGAGCCAACTCTTTGTTCCTCCGACCCGA 
TCCGATCCGATTGTCCGTCCAGGGGAAAGTGGCCCCTTCAAAAGGGTAGTCAACCCATATACAAAATATATACATGCTGCCAAGTGCTCAGATCG 
TATTACAATAACAAACAAGCGCCACATACATATACCGAAGAGGTCCAGGGGAGTGACATTTCATAACATAAATAAAGGGCCAGTCAAAGGGAAAA 
ACTGGATTCGAGCTGGGTTATGCAAGCAATTTGTGTGGTAGCCTTGTTTTCTAGTAGCTAGCTAAGGCAAAAAAGAAGGCGTTGCCAAGAATAGA 
TCAGTGGATCATATTTCCCATGGTAAAAGGAGTCTTAAGTGAGTTGCAAGTACTGCGGCAGTTATTAAATGAAGTGTTATTTTTAATAAAGAAAA 
AATCATATAGTTAGCTCACCTCGATCAACGATTTTGCCGAATAGATTTTTGCCCCTCGAGAAGTTTCTCGTCGAGCAGTTCCAGCGGCGATTTCT 
GAACTGGTGTTGGCACTCGCTAATGGCCAAGTTGGCGCCCTTGACCAGGGCTCCCAGTACACCGGGATTGTCCCTGACCAGGCGTCGCTGTTTCC 
TTCTCAACGTAGAGTGGATCGCTGGGTCCATGTACATGATGGGCGTAATGTTGTTGGGTTCGCCGACCTTGGCAATGCCCCTGAAAGAAAATGAG 
CCAGAAAACCGAAAGTTAATAGAGCAATCTTCGGAAAGAGGTTGTCAAAACTATTAAATGCTTGGCTGAGTTTTTCGCCTAATTATGAGAACAAG 
AATAAAGGGGACGAGGATTTCCACCAGCTGAACGATATTTTTTTTGTCATATTTTCCAGGGGCTTATGTGTCTTTCCACTTGACTTGAGATCGAG 
CTGTTTTTCTTTTGGCAGTGCGTATCTTTTTTTTCGTTTTTTCATTTGCGGCTTTTCTGTGTCTTTTCTATTTCGAATATTTGTTGTTTCTGGGC 
GGTATACGTCAATTGCCGGCGTTACTCAAGCATAAACCACACAGAAAACGCATATATAAATTAGGTACACTCACACGGGGATGGACAAATATTTG 
CAGACACCTGTCGGCTAACGCTTTTCTTCTCTGTGACCTGTCATAATTTATTTTCTTGTTTCAATTGTTGATCTTTATAATATTTTTTCTGCGCT 
TCATTAGCAACAACTCGTATATATACATTTATTTTTTCACATAACAAACAATAAACAAGTATAAACAAATGTTTATTGACAGGCAGCGGAGAATG 
CAAATTAATTGCCAATTCAATTGAATGCTGATTGCGGTGGGAACCGAACGTTGCGCAATGCAACGAAAATTAATAAAAGTACTTAAATAAAATAA 
AAACTTGATCGAAAGTGTAACATCTGTGGGTATCCGCAGGTCAGCGATTGCCAATCGATTTCAATTCCCAGGCGGATTAAATCGATGGCCCGAAC 
TGAGATTTATGCGACGGCAATCCGAAGTCGTAGCCGCAATTTGTGTCGTTGTGTTTTTATGTATTTGTGTTGATTTTTATTGTTCAGGCCGAAAT 
TTATGCCGTTATAAAAAGTTATTAAGGCTTAGCTGAAGTTTTGCACATGATAAAAGTCAAACAAGCAACTCATCAAAGCCGGGGAATCTCTGGGA 
GCTGGGTAATAATATCAATGACAAATATCAGTTAGTCCCAAAATGGTTAAAAAGCCTTAGTAAATAATTTGTATTTAACGAAACAAACCATTTGT 
TGTTTAATTTTAATCGTATTCGTTCACAATACTTTAGATCAATTTACACTTTCAATAAATTTGTAAAATGCATATCAACAATTTTTCCGCATGCC 
AAACGAAAGACTTAGGCAACGAACTTGATTTACTTCATCAATAACTGTTGTTGCATCTCTGCGGTAGATCCCCAAGTCTTGACTTCCTTATCGAG 
TCTTTATTGCTCCCCTGCCATTCACCCAAAACCCCATTCATATCTAAAACTGTGTCACACACACACAAAGTTTATCCCAAGTAAAAGTTGAATGC 
GCGCTTAGAATTTTTAATTGCTTTCATCAGGCGAGAAGATGAACGAAGTTGAGAAGAAGTCGAAGCTTTATTTCAGTTTTAAGTGTGTGTGCATT 
TTGTGGCTTCCGCATTTGGTGCGTGACGCACAAGACCTTTAAACTTTTTTTTTCTTTTGATTTTGTTTGCTTTGTAGGTTTGTTGCTGTGGGAAA 
TGTTTCTCGAAATGTTTCGGCGCGTGCCTTGTTGTTGTTCGTTGACTTCTGCTTTTCAATTTTTCCCCGGCTGGTGTTTGCAGATATTATTTTCG 
CCCGCCCGCTAATTGCCCAATTTAATGGAAAATATTCAATGAACTTACCACCACATGGAGCCCCGGCCCCTTCCGGATTTCTGTTTGCCCTCGAC 
TTGGCTGAGACTGCTGCCGCCGCTGCACAGGGCCATCAGGCAGATGACGAAGATATAGCTGATATCCATTATTGCTGATCGGGTTTATCTGTTCG 
ACGGCACACACACTCTCACACTGACACACGGGGTATGATAGATACTTCCCTTCGAGTTGTTGTTTGTTTGCGAATGGTTTGGTTATATTGAATGT 
ATCGAGCAATTTCGCGTTCAATTAGCGCACTATAAATGAGGCATAATCATCATTAGCATGCGTATGCCGATATTGCTGCTCATATTTGTGCGATT 
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AATTCCGCTGGATCACACGGCAATCATGGGACACAACAAGAACTTGTAAGTTTTCCACGCATATACAAATATATATTGTATCGTAAATTTGGCTG 

TTTCGATAGAATACACTCGGCTCGCTCTAGTTATAGATGGCTAGTTATAGATCGGTTCGATCGCGGCGTTTCGAGCGGAGGAGTGAAGATTGTTG 

CCCGGTTCAGTGCCAACGCTTAACTGAAACACACGGTGCGTTCTTTCGTCCGACTCTGCGTTTGTCAGCAGCGACCCGAGATCCCCGATCCGATC 

GCATCGTCGGTCTTCGTCTTCGGATCGCATACACGACCGCCCGATGGTGTGTGTGTGCGAGTGGGGTGGAGCTATACCCTGACTCCTGCATTGCA 

GCTGCGTGCGCCAGAAGAGCACCGCTATATGGCGCGGTGATCGTTCGTTCTCCCTGTCTCTTCGTAAACGTTTCTTGTGTGTTGC^ 

GTGCGTGCGTGTGTGTGTGTGCGGTATATAGCCAGCATGGCCCAATAAGAAATGGCGGAAATGGAAAAACTCTGCCCGGCATTTTCGGTGTAATC 

GATTGTATTGCCCGATTATTTGCGCAACCTCATGAATTTCGTCATAGCTTTGACTACATTTAATTATACATATTTGTCGCAATATTTTCGCCTAT 

TTCTGTAAGATATTTATGGGATTGGTTTTTTATGGTGATTTTGTTACGCTTACTGTTCGGCTAATAAAGGCTTTGAACTATTATTTTCGACTGTT 

GAATTTGGGTTTTATCTTTAGCTTATGGGTGAAAAAGTGGTATTACTACTTTTAACTTCTAGCTTTTGGCTATGAAAGTATTCTTAAATCGATAG 

CAACTAAATTTAGCTACTCTATAAATATAAATATTTAGAAACAATTAACAAGAAGTTTCTTTAATATGCCTGTTTCCTTAAACCTTTCCATACAT 

TTGCCGTGTCAGAACAAGTTTTCTGCTGCTTTTCGGAGTAAATTTCGGAGTAGTCTTATATACTGCACTTGGATGCAGCCAGCTCCCTCACTCTC 

AGGGGATTGTTACCACACACTTTGGCATTCCGAACCGAGTTTATGTCTTACCTTTTCCTCATCCATTGCTTGGCCGCTTCCACTAATTGGACAGT 

T T GTG C AG C C C AAG AAG AGC T GGT AT C C C ACCC C AC C T T T AGG GGCT AT CT T C C C C AC C T A 
(SEQ ID NO: 526) 

Exon: 10081.. 9549 
Exon: 7870.. 7620 
Exon: 4418.. 4117 
Exon: 2938. .2508 
Exon: 2376.. 1001 

Start ATG: 9664 (Reverse strand: CAT) 
Transcript No. : CT15493 

ACTGAACCGGGCAACAATCTTCACTCCTCCGCTCGAAACGCCGCGATCGAACCGATCTATAACTAGCCATCTATAACTAGAGCGAGCCGAGTGTA 
TTCTATCGAAACAGCCAAATTTACGATACAATATATATTTGTATATGCGTGGAAAACTTACAAGTTCTTGTTGTGTCCCATGATTGCCGTGTGAT 
CCAGCGGAATTAATCGCACAAATATGAGCAGCAATATCGGCATACGCATGCTAATGATGATTATGCCTCATTTATAGTGCGCTAATTGAACGCGA 

AAT T GC T C G AT ACAT TC AAT AT AAC C AAA CC AT T C GC AAAC AAAC AAC AACT C G AAG GG AAGT AT C T ATC AT AC C C C G T GT GT C AGT GT G AG AGT 
GTGTGTGCCGTCGAACAGATAAACCCGATCAGCAATAATGGATATCAGCTATATCTTCGTCATCTGCCTGATGGCCCTGTGCAGCGGCGGCAGCA 
GTCTCAGCCAAGTCGAGGGCAAACAGAAATCCGGAAGGGGCCGGGGCTCCATGTGGTGGGGCATTGCCAAGGTCGGCGAACCCAACAACATTACG 
CCCATCATGTACATGGACCCAGCGATCCACTCTACGTTGAGAAGGAAACAGCGACGCCTGGTCAGGGACAATCCCGGTGTACTGGGAGCCCTGGT 
CAAGGGCGCCAACTTGGCCATTAGCGAGTGCCAACACCAGTTCAGAAATCGCCGCTGGAACTGCTCGACGAGAAACTTCTCGAGGGGCAAAAATC 
TATTCGGCAAAATCGTTGATCGAGGCTGCCGAGAGACGAGCTTCATTTACGCAATCACCAGCGCGGCGGTGACCCACTCGATTGCCAGGGCCTGC 
AGTGAAGGAACGATAGAGTCCTGCACCTGCGACTACAGCCACCAGTCGAGATCTCCACAAGCGAACCACCAGGCGGGCAGTGTGGCCGGCGTGCG 
GGATTGGGAGTGGGGCGGCTGCTCCGACAACATCGGATTCGGGTTCAAGTTCTCCCGGGAATTCGTCGATACCGGCGAGAGGGGTCGCAATCTGC 
GCGAGAAGATGAATCTGCACAACAACGAGGCGGGTCGAGCGCACGTCCAAGCGGAGATGCGACAGGAGTGCAAATGCCATGGCATGTCCGGATCG 
TGTACAGTGAAGACCTGCTGGATGCGACTGGCCAACTTCCGTGTGATTGGCGACAATCTGAAGGCCCGCTTCGATGGAGCCACCCGCGTGCAAGT 
GACCAACAGTCTCCGGGCCACCAACGCTCTGGCCCCAGTTAGTCCGAATGCAGCCGGCTCGAATTCCGTGGGCTCCAACGGCCTGATTATTCCGC 
AGTCTGGTCTGGTCTACGGCGAGGAGGAGGAGCGTATGCTGAACGACCATATGCCGGACATCCTGCTAGAGAACAGCCACCCGATCAGCAAGATC 
CATCACCCGAACATGCCGTCGCCCAACAGTTTGCCCCAGGCTGGTCAAAGGGGCGGACGAAATGGACGTCGTCAGGGACGCAAGCATAATAGATA 
TCACTTCCAACTGAACCCGCACAATCCCGAGCACAAGCCACCCGGCTCGAAGGACCTTGTCTATCTGGAGCCTTCGCCGAGCTTCTGCGAGAAGA 
ACCTGCGGCAGGGCATCCTGGGAACCCATGGCCGCCAGTGCAATGAGACCTCGCTGGGCGTCGACGGCTGTGGGCTGATGTGCTGTGGGCGTGGC 
TATCGGCGAGACGAGGTCGTCGTTGTGGAGCGGTGCGCCTGCACCTTCCACTGGTGCTGCGAGGTGAAGTGCAAGCTGTGTCGGACCAAAAAGGT 
CATCTACACGTGTCTGTAAGGACACTGCCCGCCTCCAGCCCAGTCCCGTCCTCTGAAGCCGCCCTCTTCGTTCTTTGTATGTGTTATGTATTTTG 
TCTACGCTTAGCCTTTGTTATTAGCACCTTAAGAACCTAGACTATAGCCTAGTTAAAGATACGACTAACCGCCCCTGAAGTGTACCATGCAAAAC 
CCTTTAAGCTCTCACTTTTTTTTTAGCAGTATTACACAAGGAATATTAACGAATAGATTTCAAGAAGAAGTGAGGAATACAACATGTGTTTGCCC 
AGTTTTTAGTCATGTAAGCCAGCTCCCAATCGAAGCAAAATATTTTGTAAATTACAAAGCCCCCATTTTTTGTACCCCTAGATAATAGTAATCGA 
TTTAAAATACTTTAGTGTAGCTTAAAGTCTTCGAAGCGCGCCCCCTTTCTGCTGAATCCCTCCTCATGCTGTCGTTTAAAATCTTTTTAAATCAT 
TGAAAACAAGACTGAAAATTCAAAAAGCGCGCAGCTCCTTCCTTCTCTCTAGCGAGCACTGCTTGAACTGCTTGCATACTGCTTTGGGCAGGACC 

aaaacgtatgcgaagtggacagtataatagttcgagaaaatcggtaaccagggctggaaaacatttggaatgcaacagtaaaaattccactattg 

TTTTTCCATTTCCATGGTCATACAAAACAACAAATTCTTTTTTGTACATAAATAAGCTTAAATTAACTTTTAGTTTTAGATTAAATTCATACCAA 
CCCCACGAAGTACAGAAAAAGATTTTAGTTGCTTGGAAT^ 

CCCCATCCATGTAGGTATTACTAACTAATTAAATGCACTTAGCTGGCTAGTTAAACTAAGTATATAAATAGCTCGATATGAATATAATATTTAAT 
TAATTTATACTAGCTCATAGACATTCAATTTTGTAAAGTATTAATGCGAAGGAAAACAAGAAACAAAAAAGCGAGCAGAAAAATGCGAATGCAAT 

CGAAAACAAATCAATAAAAAAAAAACACAAAACGACAATGAGT 
(SEQ ID NO: 527) 

Start ATG: 418 (Reverse strand: CAT) 

MDISYIFVICLMALCSGGSSLSQVEGKQKSGRGRGSMWWGIAKVGEPNNITPIMYMDPAIHSTLRRKQRRLVRDNPGVLGALVKGANLAISECQH 
QFRNRRWNCSTRNFSRGKNLFGKIVDRGCRETSFIYAITSAAVTHSIARACSEGTIESCTCDYSHQSRSPQANHQAGSVAGVRDWEWGGCSDNIG 
FGFKFSREFVDTGERGRNLREKMNLHNNEAGRAHVQAEMRQECKCHGMSGSCTVKTCWMRLANFRVIGDNLKARFDGATRVQVTNSLRATNALAP 
VSPNAAGSNSVGSNGLIIPQSGLVYGEEEERMLNDHMPDILLENSHPISKIHHPNMPSPNSLPQAGQRGGRNGRRQGRKHNRYHFQLNPHMPEHK 
PPGSKDLVYLEPSPSFCEKNLRQGILGTHGRQCNETSLGVDGCGLMCCGRGYRRDEVVVVERCACTFHWCCEVKCKLCRTKKVIYTCL* 

(SEQ ID NO: 528) 
Name: wingless 

Classification: signal_transduction 
Gene Symbol: wg 
FlyBase ID: FBgn0OO4O09 
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Celera Sequence No. : 142000013384554 

TTATTAATTTTGTTTTTACCTTTGACATTCGTTGGCTTAAATCCAAAGCGTTTTAACCCTCCGCTTTTAGCCAAGTTGAAGGTTTCGTTGAAGAT 

TAAAAACAAACTTGCTTGGAGTCCAACCATTTGTTATTCCCTTACTGCGAACATAACAAAAACTATTCATATTTCGATATGATGGACTAAAATAT 

TCAGATTAAAAGCGACTAACAATGGTGAAATGGTCGATACTATGCCCTGATATTTAATCAGGGTCTATTGTATTTTTGGTTCTACACTGTACTTT 

TAATTTGCGCAAATAATTGATTATTCAGCAATTTTCTTACCAATTTGAATGTTTGAATGTTTTGAATGTTCTTTGAATGTTTTACGTAGGCCTTT 

ATGTAAACTGCCTGCTTATTCAATGTCATTTACAAACAGCCGCGTTTGAAGCTAAAATTACTTCGTTAGTCAATCTTCTGTTTATGTTTTTTTAA 

AGCTAAGCCATATTTTTTAAAGTGCTTGAATTACTTCTGGGTGGGGGATAATCAAAATAGATATCGAGATCAGCTTGGTAACAACAAATGTTTAT 

AAATAAGTCGAGCTACTTCCAGTTTATGTTGGAGTAGAATGTACATATATCTGCCACGTGCTCGGATCGATTTACAAAAAAACGTTGACCAATGT 

GCAAGGCCAAGAAACTCCAAGAATCGGCCACTTGTTGCTCTTTTTGTTCTCTGCTCCCCCGTCTACCACGGTTTCTCTCCTCTTCTCTCTTTCTC 

GGTATTCTCTCTGTACATTTCACAAGAGCCGCCGGAGTTTAAGTCTTCAAGTGGCGCTCCATTCGCTTCATGTAGCGATGCCGGCCGGTACAAAA 

AAT AAGC AT AAAAT GG AAT AAT T AT AAGGGG AAG C AG AAC AAGC AAAG AGT GT AC AC C G AAAG AAAATG C AT AAATTC T GAAC TTT AAAG AAC CC 

TTTTTTTTAATGTCATATATAGCACGCCAAAAAGTATGCCACACTCGATTTTATCCCGGTACTTTAGTCACTTTAAGCATGAGTTTTAGTGGAAC 

TTTTATTACAGTGTAAGCAGAGGCGAGTTGTAGAGGAAAGTAAAAGGTTTCTGCCGACAAAAAAGAGCACGGAGCATGGGGAGAAAAGGGAAAGA 

GCTCTGCAGTCTGGAAATCGTATGACGATGTGGAATGTTGGGACATGGAAATGGAAGTGCAGTTTACTGGCATTGCCAGACGCTTTGGCTTTCGG 

CT GGC AAC AAAAT T C AC AT AAAT AAAAC T TG AGC C G AGT C AGT C GG AGT C GG AGC C T T AGCT AAAAT C GG AGT C G GAC T GG AAAT C G GGG TAT GC 

ATACGTATGTACACATGGCTGGTGGCATGATCGCCAGAAATGAGAGAGTTGGAGTTGGAGCCACCAACTATAAACACCGCATCCCCTTACACTCG 

GAGAAATCGAAGAAGATCCGACAAAGTCGAACCATGATTACGAGCCTTAAGTGTTTACAACTCTCAATTAAGGCTGCAAAGTATATTGTGATTTA 

AATTATTTTTCAGTTTAAATTTTGATTATTCTGCACTTTAGTTTTGAATTTTATATTCATAGCTTTTTAATAGATATTTAAAATACGGCAACTCT 

TACAAAAATGATTAGTTTTGATATGATGGAAAGCTTAATTTGTGACGTGATGTTAATAGTTGACTAATTCTAATAATTCGCCGGCAATATGTATG 

AATGTTTAAGCCTTTCATGAATCATAAAAATAAATACAAGTTGATAAAAAGAGTGACTAAAAGATTTATAACTACAAATACCTCTAGCATTTAGA 

ACTTCTAGGAATAAACCATTTTTATGGGCTTGTAAAAACTTACATATGTCCAAAGGAGTGCCGATTTTTCCTGACACATTCAAGAACTTTAAAGG 

ATTTCCGTTTAGCACCGAATTAATGGAACTGCCAAAGTGTCCCATCCCGATTCCCATGCCAGATCCTGTACCAGGACTAATGCTGTTGGCTGCCG 

CAGGAGCCAGATGCTTCAGCTGATCGAGGGTGGGTCCCAGAGTACTGGCTTTCAGACGCTCTAGCAGCGCCGATCCGCTCAGATTGTGCATTATC 

TCGCTGGCAGTGGGCGTGGCAGCGGTCACATGGGCGGCATTTACCGCCGCCGCATTGCAGAATTTGCTGATGCTCAGTATGCAGGTGATGAAGAG 

CAGGTGGAACGATGAGGTGAAGCTGAAGCTGATTAGCTGCTGCACTTTGCTCTGGTAATGTGTGCTGGCGCTGTAGAAGAAGCCACTGGTTTTGG 

ACGGTTCTCTACAGCCACTGCTGCTGCTGCTGCTTCTGCTGTTGGCACTTTCATTTGTTGTTAACTCTGACTGGGTGGTGGTTTGAAAATATTCC 

GTGAAATGCGGCATGCAGGTTGTGGCCATCTTTTGCAGCTGCTAGAATATATATACGGTAGAAATTGCACTTTAGGCTCAGTGGGTCACACACGC 

T T T T C AC T CT GGT TCG ATT T GC C AAC TT ATT T AT AC AC GT C CAT T AAAT AAT AAG TTT AGG C G AAT AC AC GC AACGT T TGCC AAAAAAAC G T T C A 

CAAAAGTGAGAGAATGAATCGACTGTATCGGGATGAGAGTGTAAGGCATAGGTACGATAAACATGCGGAAAAGGCGATGAAGGGAAAAAGAAACA 

TACGATGTTGCGAAAAAACAGCATAAGAGCACAGAAAACACGAAATACACACATAAAACTGCAAGCTAAGGGCGACGCCTTTGGTCAAAGCGAGG 

GTTAGCCCCTTATGCTGTCCCTCTCACTCATTTTAAAACATTAAAATAATTTTTTATTTGGATATATCTGCTATCAGCGGTATAACGGTTGATAA 

GCCAGGTAAGCCTGGTAAGTCATGCGCAGCCACTACCCATTGACACCATTATTCCAGCTCGGAATGGGTTCTGGTTGTTCAAATCTTGTATTTAC 

TTTATTGAATGGCAAAACACATGTTGGACTAAAATTTATGCTGTTTTTTCACATGACATATTGACGTAACGTTCGGCCAAAATAGATTCATACAT 

TTTTTTTTAATTAACACTTATTTCCGCTCGACAAATGCGATGAGCTGGCCTTAAAGCCCCGGCTTGAACTTTTTTTTATTTGCTTTGAACTTTTG 

CTTTCCGAAACTTTTTTCGCATTCCGAAATTTCTCGGTTTAACGGTGAAGTTATATCCAAAGCATGGGGCGTATGAGCGATAAGCGACCAGCGAC 

GACCTAGCGACACGCACTGCGAGTACAAACTGCCGGAATATCGCTTCCGAAGTGAGCGGACCAATGGAACCGTCTTAGAAACTTCGGCAGTGCTC 

GTCTTTCGTTCGGCAACGAACCGACGATCGCCACGCGTACCATATCACACCTGCGAAGAGAAAAGAGAGTGTCGACGTCATCATCGGGGGGAGCC 

CCAAAGCGTTGAAAATCAAAAGCCTCGATCTCCAACACAAAGAGAAGACAAAGCTACCAGTGCGCAAATAAGCTGACATTGAATCAGAGCTAACA 

GGTGGTTCTCTCTTCGGCCGCTCTCTTAGAACGTGTATCAATCTATTTTGGAGGTGTGTTCCACTATTTGTCGTTTCGGCATCCAATTAGTGAGG 

TAGCAATTTCT AAAAT TACAGGTCT AAAT TGTTTGTTGGCCCGGGTCAATCAGTCAAGGCTTCGATTGAAACT AGT AATGGGCTTTGT AGC AT AG 

ACACACGTAAATCACAATTGGTCAATAAATGTTAAACCATTTACTGTAGCGGAAATTGATCTATTGACAGCACCTGTACCTCTGCATTTCAATTC 

ATAACATGGTTTCCATTTGAATAACAAAGATGCTTGAGAAACACGATGGGCTTTTAGCAAGTCTGGATATCATCAAAGTGTAAAATCATCAAGCT 

TTGATGAATTATTCCCCTTCGGTACATTTCCAATTGAGATTTCTTCAAGGAACAGAAATATATGTGCATTTTTAACAGCTGTGCATCTGAAGATG 

AAACGAAATGCTACAAATTCAGCAAAAAATTGTAAATTGGACAATGTTGGAAATAAAAACTGGAAAAATAGCTGGCAACTTACCACTCACACTTT 

G AAAT C AAT C AAAT T AAAAT C C CAT C C AC AAAGT CAT GTAATT AT AT CAC AGG AGT AC GC T AC AC AT AAG T ATG T T C TGAGT T TA T T AT T C A T CA 

TATTTTCTTTTTAGTGGGGAAACATATACATATGTATGTACACTTAACAAATACAGT 

(SEQ ID NO: 529) 

Exon: 3237,. 3039 
Exon: 2404.. 1849 
Exon: 1020.. 1001 

Start ATG: 2404 {Reverse strand: CAT) 
Transcript No. : CT15529 

CTAGGTCGTCGCTGGTCGCTTATCGCTCATACGCCCCATGCTTTGGATATAACTTCACCGTTAAACCGAGAAATTTCGGAATGCGAAAAAAGTTT 
CGGAAAGCAAAAGTTCAAAGCAAATAAAAAAAAGTTCAAGCCGGGGCTTTAAGGCCAGCTCATCGCATTTGTCGAGCGGAAATAAGTGTTAATTA 
AAAAAAAATATGGCCACAACCTGCATGCCGCATTTCACGGAATATTTTCAAACCACCACCCAGTCAGAGTTAACAACAAATGAAAGTGCCAACAG 
CAGAAGCAGCAGCAGCAGCAGTGGCTGTAGAGAACCGTCCAAAACCAGTGGCTTCTTCTACAGCGCCAGCACACATTACCAGAGCAAAGTGCAGC 
AGCTAATCAGCTTCAGCTTCACCTCATCGTTCCACCTGCTCTTCATCACCTGCATACTGAGCATCAGCAAATTCTGCAATGCGGCGGCGGTAAAT 
GCCGCCCATGTGACCGCTGCCACGCCCACTGCCAGCGAGATAATGCACAATCTGAGCGGATCGGCGCTGCTAGAGCGTCTGAAAGCCAGTACTCT 
GGGACCCACCCTCGATCAGCTGAAGCATCTGGCTCCTGCGGCAGCCAACAGCATTAGTCCTGGTACAGGATCTGGCATGGGAATCGGGATGGGAC 
ACTTTGGCAGTTCCATTAATTCGGTGCTAAACGGAAATCCTTTAAAGTTCTTGAATGTGTCAGGAAAAATCGGCACTCCTTTGGACATATTGACT 
AAAGTACCGGGATAA 
(SEQ ID NO: 530) 

Start ATG: 200 (Reverse strand: CAT) 

MATTCMPHFTEYFQTTTQSELTTNESANSRSSSSSSGCREPSKTSGFFYSASTHYQSKVQQLISFSFTSSFHLLFITCILSISKFCNAAAVNAAH 
VTAATPTASEIMHNLSGSALLERLKASTLGPTLDQLKHLAPAAANSISPGTGSGMGIGMGHFGSSINSVLNGNPLKFLNVSGKIGTPLDILTKVP 
G* 

(SEQ ID NO: 531) 
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Celera Sequence No. : 142000013384153 

CATCAGTACGCACCAAGAATTCGCAATGATTAACAAAACGGTAGACACCGAGATCTACAATCACAAATATGATAACATAGTTACAGTACACAACA 
CTTTCTTCGTAAAAAAACCATTTGTTGTAGTTGCTATCGAATAGTCGAAGTGAGAACTGACTGCTAAATGGCTGAAATAATTGTGACAAATACTA 
CCCAGTCTTCAAATTGAAACGCGGAAATGGAACAAAATGTGATGTTGCACTTTTACTTTGCTCTTAAAAAAGTTTTTTACTGTAAATAAAATACA 
TGTACTGGAATTTTTGTGTAAAATCGACATCGAATTAATTCGATTCCATCAAACACAGCTGACGCAAACATCTGTTAATCTCTTATACAAAGTAA 
AGGAGAGTTGTATACCTTTAAAAAACTTTAAAACATTTTTTTTATTATGGTATCTTTTTCTAGCTTTTTAGTAGCACACAAGGACGAGGTCCGTG 
ATATAAAAATTTTGTTAAAGAGAAGAAGAAGCATTAAATAGTTAAGTGGAAGATAATATATAAAAAAAGTATCCCTCTGTATCCCTCTTTCATCA 
AACCTCTTCACGCAACGACGACAATTGTGGTGAACATTTCGTTGCAACCCATCACAAGCAATAAATCAACTGATGACAAGTATACACACATGTTG 
ATCCGCAGGCCTGTGCAAAAGTTCGTCAAACATCGATGAGGTGCCACGTGGAACTTACATTTTGCTACGTTGAGATCAAACTTTTTTTTTATTTC 
TTTCCGGGCACTAATAAGTGTTTCAATTTATGGACGTTTTGTAAAAATTGCAATAAATTTGTTTATTTACTCGCACATCAAAAAGCAGTTCGTAT 
GAGAATGAGAACGTGTTAAAAATTTGGTTGAAATAAAAATGTCGTGTTTCATTTCCATCGTTTCAGTAGCAATAAAAAACGAATCCGATGGCGTA 
GATATCGATATACCTGCATTCCCTTTCAAACGTGTTATCGTATTGTTCAGCCAATTGTGCAACGTGTTTGAGTAAATCTGCGGGCGAAAGACTAA 
AAATGAAAGTAGAGGCATTGGACAAATTCTGGGCTGACAAGAGCCGCTACGATCTTGCCGAGAAGCGATTCTACGAGGGTCCCCAGAAGGTAAAA 
TCTCTAAGCAATCACAGCCATAAGTACTTAAGTTGTTAGCCAGTTTCACGTGGCCGGTAAATCAGTTCAAATGGTTTCGGCAATGTTTATTGCAA 
ATCTGTTTGTTTTAACAGCAAGGACGCCGAAACACACATGTTCTAAACTTCAATACATACGTAGAATGATCTATTAAAAGATTTCTATATTTATC 
TGCCTTTGATTTTCAATTGCTGTGTTCATATTGCACATTACATTGGTATATTCTTGTAGCTGTATGTAACAGGTGAACGTTTAACACTTTTAAAA 
AATATAACTATGTGCGTAAAGTGTATACAAAACAGAGTGTGAGAGAAAACTAACTCTGGCAAGCCAAATTTTGAATACACTTTCCGTTTGGCAAT 
TGGTATAAATACAAAATATAGTATGCAAAACATAAGTACTTAACATGACACCTACACAACAAAATTCGTTTATTCAATGGAAATGTGCCAAAAAC 

AAAT C G AT GT AT AC AAAG T T AAAAC C T AT T G AAATT C AGG GAAAT ACT AT AGT T AAGT AAAGGTT T AAAC AC GT T GT AT T C TAT C T AC AC AAAT A 
TACTACACAAAATGCATAGCACTGCATTGTATAATATTTAAACTACGTACAATTTTTGGGCTGATTAGAGTGAGGCTTATCAGTATTTCAGCATG 

CT T G C AT GAC AAC AG AC T T GAAGT AGAT GC C C GT AC AT AAAAC AT C T CAT T T T G C T T AAAAAT T ATT AT CC C T AG AAT C CT T T GT C AAC C AT T CG 
ATACACCAAATGTCAGTTCACACAAGTAATCAATGGTGTTGTCTTATTTTTAAAGGTAACTGATCGCAGCCACTACAGCCCATTAGTAAGCGAAA 
TCGCAAAGGCTAGGGAACACATACAGAACTCGCTGGAAAAGGTAACTATTGTGATAAATCTTTGTATTTCAACATTTGATTTGTTCACATTCCTA 

GGTTTCTTTCGGTTTCT T T AAAAC T T TGT AGAG AAT T GT AT AAT TAT T AT T T AC AAT AT AT T T AG TC T GT ACT TAT GAT C AAC AT AT C C C AAT G A 
GGACTTGAATTTGAATT ATT AATCAAGCCTTTTGTGTGTGTTCCAGATCGATGGCGTT ACT CTGGACGATGGCTT AAAT TCCGAGCTGGCCAAGC 
GTCTGGCCCAATTGGAGGGTGAACACAAGGAGTTGAAGACCCAGGTTTCCTTGTTGAACGAACTTTTGACTGCCACAGTGAAGCGCTTGGAAACT 
C AGC T GAAAT T G AC G AAT GG AGT GT CGAAAG AGCC C G AGGTC G AAGC G AAAAAACC AGAAGC G AAC GACG AC GAT GAC G AT G T GG AT C T AT T T GG 
ATC GG AC AGC G AGG AGG AGG AT G GAG AGGCC G CC C GC AT AAG GG AGG AAAGG C T G GC AGC CT AT GC T GCC AAG AAGGC C AAG AAAGT T C AAAT T A 
TTGCCAAGTCGAACATCATTCTGGATGTTAAGCCGTGGGACGATGAGACCGATCTGAAGGTTATGGAGACGGAAATCCGCAAAATCACCCAGGAC 
GGTCTACTGTGGGGAGCTTCC AAATT CGTGCCAGTTGCCTTTGGC ATT CAGAAACTGAGCATTTCATGCGTGGTTGAGGATGATAAGGTTTCCAT 
CGATTGGCTGACTGAGGAGATCGAGAAACTGGAAGACTTTGTCCAAAGTGTTGACATCGCTGCGTTCAACAAAATCTAAGGGGTAACATATAATT 
TTTCAATTCACGATCGGATTTGCCGCATTAC AAAT GTGTTTGGGAGTCGTCT AAAT AAAT ATT TTGATAAAAACCAACTTAAAAAACTGCATTT A 
AACTGGGAGTAATTGGTAAATAGTTTACTTTAGGATATTTTTTATGCAAACGATATGTTTTGAATTCGCTAAAAGGCCATATTATTTCAAAATTT 
ATAAGAAAAGAAGATATATATTAGTTTAAGTCATGTATTCGCCATTCTTTGATTAATTAAATTCATTTACGCACGAGTATAAAAACAATGCAAAT 
TACCTATTAAATAAATATTCCAGGTCACAGCTAAATTATGCACATATTAATAATGATTAAACTGAGGGACTAGTTTCATAAGGCTCTACAAATTA 
TAAATAGTTAATCATAAGTTATTTAACCATCCTAAAGCATAACTTCGTTTTCGGGTCTCGTCGACGTTTGGCGCTTAAACTCACAGCTGCCAGGA 
GAATGTCGAAGAGTGGTCATCGCGGTGCTGGTTGGAGCATAGGATTTGGTCACGCCAGCGCGGGTTCTAGTCGATGCAGTGCTCGGATTTGATTG 

C AG AC C T T AAAG AT T AAAT AAAT AC AAAT T AC AT G TAG TAT AT T C AACT AAC T G C C AGT T AAT T G AT GC C AC T G AAACT T AC C AAAC T T AAAC AT 
AATTTCGTTATAAATTCTGCGCTTAAACACAAATATAATGAAGACAAAGATTCCAAGTAGCACATTAAACGAATCGCTGATGGTCCAAAAGAACG 
TTTTCTTGTGATCATAAAACAAAATGCTAACTATTTCCAACAGCCATGTGATGCCCATTACGAAACACAGTTTCGTGTTCATAACGAATCTGGTT 
TTGTCCTGAAAGAATCGCCTTTTCAAAACTGGCTTGTCGCTGTTCAAGCTCTGCATTTTGTATATTTCATTTTTTACTTTGTTGCAATACTTCAT 
T GT G AGC AC G AAT AG AAC T AAG T T TAT AAT G AAAAGT AT GC C AAC T GGAC C CGAAAAG AAC AAT AAGC T GGC T G AAC C AAAAG AAT CAT C T GG AA 
TTAATACAAAAATATACAAATACTTTAATATTTAAATCGTAATTTAATTAGATTTTTAACTTACA 

(SEQ ID NO: 532) 

Exon: 1001.. 1134 

Exon: 1956.. 2036 

Exon: 2232.. 2960 

Start ATG: 1048 

Transcript No * CT15788 

CCAATTGTGCAACGTGTTTGAGTAAATCTGCGGGCGAAAGACTAAAAATGAAAGTAGAGGCATTGGACAAATTCTGGGCTGACAAGAGCCGCTAC 
GATCTTGCCGAGAAGCGATTCTACGAGGGTCCCCAGAAGGTAACTGATCGCAGCCACTACAGCCCATTAGTAAGCGAAATCGCAAAGGCTAGGGA 
ACACATACAGAACTCGCTGGAAAAGATCGATGGCGTTACTCTGGACGATGGCTTAAATTCCGAGCTGGCCAAGCGTCTGGCCCAATTGGAGGGTG 
AACACAAGGAGTTGAAGACCCAGGTTTCCTTGTTGAACGAACTTTTGACTGCCACAGTGAAGCGCTTGGAAACTCAGCTGAAATTGACGAATGGA 
GTGTCGAAAGAGCCCGAGGTCGAAGCGAAAAAACCAGAAGCGAACGACGACGATGACGATGTGGATCTATTTGGATCGGACAGCGAGGAGGAGGA 
TGGAGAGGCCGCCCGCATAAGGGAGGAAAGGCTGGCAGCCTATGCTGCCAAGAAGGCCAAGAAAGTTCAAATTATTGCCAAGTCGAACATCATTC 
TGGATGTTAAGCCGTGGGACGATGAGACCGATCTGAAGGTTATGGAGACGGAAATCCGCAAAATCACCCAGGACGGTCTACTGTGGGGAGCTTCC 
AAATTCGTGCCAGTTGCCTTTGGCATTCAGAAACTGAGCATTTCATGCGTGGTTGAGGATGATAAGGTTTCCATCGATTGGCTGACTGAGGAGAT 
CGAGAAACTGGAAGACTTTGTCCAAAGTGTTGACATCGCTGCGTTCAACAAAATCTAAGGGGTAACATATAATTTTTCAATTCACGATCGGATTT 
GCCGCATTACAAATGTGTTTGGGAGTCGTCTAAATAAATATTTTGATAAAAACCAACTTAAAAAACTGCATTTAAACTGGGAGTAATTG 

{SEQ ID NO: 533) 
Start ATG: 48 

MKVEALDKFWADKSRYDLAEKRFYEGPQKVTDRSHYSPLVSEIAKAREHIQNSLEKIDGVTLDDGLNSELAKRLAQLEGEHKELKTQVSLLNELL 
TATVKRLETQLKLTNGVSKEPEVEAKKPEANDDDDDVDLFGSDSEEEDGEAARIREERLAAYAAKKAKKVQIIAKSNIILDVKPWDDETDLKVME 
TEIRKITQDGLLWGASKFVPVAFGIQKLSISCVVEDDKVSIDWLTEEIEKLEDFVQSVDIAAFNKI* 
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(SEQ ID NO: 534} 

Classification: translation factor 



Celera Sequence No. : 142000013384153 

GGTGAGGTAAGCAATGGATCCTTTAATCGTCTAAGTACTCTACGCACCATAAATCACCTGGAAAATGTCAACGATACGCAAGTATTTATCTTTTC 
AATTAACTTAAGTCTAGTTATTAGCTGACTTTTTTCCATATATTCTTTCATGACACAAACTAAAATCGCAAACATAAATGTCATTTTTATAATAT 
ACAAATCTCCTCTATCGCAAGTACTATTTATTTATTCTAATACAAATATATTAATTTTGTCATTCTCCAATCGGAATAAGTATAAATAAAGTATA 
AATCCATTGGTTAAGAAAAGAAAAATCAAAAAAATAATTTTTATTCATTTTAATGTTATTAGTTAGTTTCTAATGTGATCTTTCATTAAATATTT 
AAT AC TT AT AAAT T AT AAAT G AAAAG AT AT T C AT AAAAAT C GC AG AC AAT C AAAT T AAAG C TT AAC T C AGAT G T T C T TC T T TC A T AC AC G T T C GA 
AATCTTCCTGTCGTTCCTAAGTCCCGTTCACACAGGCAGCTGCTTCATCTGAAATGCAGTTCATATATTGTTTTTGTTATAATTAAACGGTGATT 
TTAACCCGGCCGGAGTGGAGTTGCTCATTGGAAATGCTTTTCT AAAT TAATT TAAATT TAAAATAACGGC AAC AGCAGCTAACACAACAAAAATG 
TGAAATATTTGGGTATACATATACACCTACAGGATTTATAAGAGGAAAATTTAAATGCCGATTTTTTTTAAATAAAATAATTCTTGCCTAAATAT 
GTGGACTAATTTTAAAACGTTCATTTGTTTTTGTTAAGCCATGTATAAATTTCTACAATATTATTGTTCGTCATTTTGAACTTGCAAAATTTGTT 
TCATTCCAACGTTCCAAATCAGGGCAATACATATTTATTATCTACACCGAGTGAAAATATTTTCCGAGTCATCAGAAATTTGTATTATTGATAGC 
ATGTTGTAAGGCGACTCGACTATTTTGAAGCTAAGTGAAATTTTATATTGAAACGCAAAGTTGAATACGTGAGCAGTATATGAAAATCGGTGAAT 
ACAGTGACTTCATTAGTCAGCCGAAAGCAAAAGAGAGCGAAATACATTTTTTAAACAAACTAAACTAAAATTTCACTGTGAATAATGATGACTGA 
AAAGTTAAATTCTGGGCACACTAATTTAACAAGCAAGGGGTTTGTATATTTTATATTTATTTATATTTTTTTTTTCACAAGAGGACCCAAGCTTG 
GTGTCCCAACAAACATAATATGCATGCGTATTCATCAGCCCATATTTATGATTAGATGCCGAAAAGACAAGCCGTTTTATAGTTTAGAACGTCAA 
TTTTTGCAAATGATTGAGCATGTAAAAGCCGAAGATCTGTATTTTAGCAACATCATGTAGTTAGGCTGTTTTCACTCAACAACTAAACCTGCAGA 
ACATGTTATCTAATGAAAAAAACCAAACAATGTGATTCCCGTATTTATATTACTTTATTGTATGTAGTAGCGGAACATGTGTACATATGTATAAA 
AGAGTTATCTACATATGTAGTTGCACTCGTGGACTTATTTACCGCTAAAAATGTTGCTAAATATAATTGCACTGACTTAAATCATTTAGAAACTA 
ACACTGTATTTAGGTAAAACTTCTATTGGGTGATTCTCAACTTTTGAGATCAGAACAAAACAGGTATTTTTTTTTTGCATTTCAGCGCCATGAAC 
AGTTATTCGTGGGCTGCCATTTTAAAATTTGGAGTAGCGTTTCTTTTTCTCTCCTCCATAAATGTGCGAAAGAGAAAAAATGTTGAGTTCAATCG 
CTTTTAGATTGTTCAGTTCTTAAATATTCAAGACATTATCCATCTTTGTTTACGATTTAAAATCAATATTAACATTATTATTAATATACATTAAT 
ACATGGCTTAGACTTTAGCTTAAAATGCACTTTACATAAAAGAACTCATTTACAAAATTTTAAAAATCAAAAATTTTTAATATATATATATGCTT 
AT AT AT GC CAT CC AT AT AT AT AT GG AT GT GAAT G ACC T AC AT AT AT C GT AAT ACT TT AAT T AC C ATT T T ATTTTT T T T C T T T TC ATCC AGGAT T A 
T AAAT GAT CT T C AAAT T GCT GG AAAC AC AAGT G AC GAT AT GGG CTG GAAATC AAAGC T G AAGC T GC C G C C AAAG GAC AACC G AT TC AAAACAACT 
GT GAGTT T GC C G AC AAAT C AAT C GG AAT T T AT AC AAT T T AAAGC AC AAAC AAAC T AAC AAAT AAAT GTAT TGC C AT TGC C TT A C TC AGG ATG TG A 
CCGATACGCGAGGCAATGAATTCGAGGAGTTTTGCCTTAAAAGAGAACTGCTTATGGGTATATTCGAGAAGGGATGGGAGCGCCCTTCGCCAATT 
CAGGAAGCAGCTATTCCTATAGCATTAAGTGGAAAAGATGTGCTGGCTCGGGCCAAGAACGGCACAGGCAAGACAGGAGCCTACTGCATCCCAGT 
TTTAGAACAAATCGACCCAACCAAGGACTACATTCAGGCACTTGTGATGGTCCCCACCCGAGAGTTGGCGCTACAAACCTCACAGATATGTATTG 
AGCTGGCGAAACATCTTGATATCCGGGTAATGGTCACAACGGGAGGTACCATTCTAAAGGATGACATTCTTCGTATTTATCAAAAAGGTGGGTTT 
TGATGATAGTTAAATAACCTAATATCGGTTTAATTGCAACTTTCTAAATTTTCCCTACGCCTGCAGTACAATTAATTATTGCCACTCCCGGACGA 
ATATTAGATCTGATGGATAAGAAAGTTGCCGATATGTCGCATTGCAGAATATTAGTTTTGGATGAGGCCGATAAGCTATTGTCGCTTGATTTTCA 
AGGCATGCTGGATCATGTTATACTAAAGTTACCAAAGGATCCACAGATCTTGCTATTTTCCGCAACATTTCCACTAACAGTCAAGAATTTCATGG 
AGAAGCATTTACGCGAGCCTTACGAGATCAACCTGATGGAGGAGCTAACACTGAAGGGTGTCACTCAGTACTATGCATTCGTACAGGAGCGCCAG 
AAAGTACACTGCTTGAATACGCTCTTTTCGAAGCTGCAGATCAATCAGTCTATCATATTCTGCAATTCCACGCAACGTGTCGAGCTATTGGCCAA 
AAAGATCACAGAGCTTGGATATTGCTGTTACTATATACACGCTAAGATGGCCCAGGCACACAGAAATCGAGTATTCCACGATTTCCGCCAAGGAC 
TCTGTCGCAATCTGGTGTGCTCCGACCTGTTCACCCGTGGTATCGACGTGCAGGCCGTGAATGTAGTAATCAATTTCGATTTCCCACGAATGGCA 
GAGACATACTTGCATCGAATTGGTCGCTCAGGACGTTTCGGTCATTTGGGTGAGTTTAATTGCATTCTCATATTATAGTTTAGCCTATTTACAAT 
CTTCGTTTCTCATTTTAAGGTATTGCTATTAATCTGATAACCTACGAGGATCGGTTTGATCTGCATCGGATTGAAAAAGAACTTGGAACCGAAAT 
AAAGCCTATTCCGAAGGTCATAGACCCCGCCCTTTATGTCGCAAACGTTGGCGCATCCGTTGGCGATACTTGCAACAACAGTGATCTGAATAATT 
CTGCGAACGAGGAGGGCAACGTTAGCAAATAAAACGGATATGCCCTGTGTAACGCTAGTATCTGAAATAGAGTCTAAATCGTGGAATCAATAAAA 
ACAGATGAACTATTTCAGAAATCAAAATTGTATTAAATAAATGGAAAGCAATTCAATTTTTAAATGCGCTCTTAAGATAACTGAAATGACTGGTC 
TATAAAAAAAAGGCTCGTTCACTTTGTGGGTTTTGGTGCTGAGAAGTATTAACATTAACTATTGGGGATTTGGTAAGGAAAGTTAGTCGAATGCT 
ATTAAAAGGCAGGCTAAGGTCGTTTAATATCAATCCAGTTTTTAGTAAAGGGTTTAAATCAAATAAAAGGTCTAAAAGTAATTACAGGATTTTCG 
GACAAGAGAAGTACACTTGGATCAAGCTAAATCCTAGATTTACTTACATGGAAGAATTACCATTGGCAATTACCCGATTTACTAGATTCCAAAAA 
AAAAGTACAAAGGGATCATTATAATTTAAATAAAATTTATTTATTAGCATCCAAAATTGTAAGCTTTGCTTGTCAGAATATTTACAATAAATGTA 
CAATATTAAACAAATCATATATATAGGATAGGGTCATAAATTTCAAAACTTTACGTATTAAATATGTTCGATGAGGAGGAAGAGGAGTCGGATTG 
GTCAAAGTCGATTTTCGCATAGGTATTCCCATCCTCCGGGCATGCGGACACCGGCGGCGATTCGCGTGATCCTCTCTTCAGGTATTTAGCTGGAT 
TTTGGCCACTGCAATGGGGAAGATCTAGGCTGGCGTAGTGCAGCTCCGGCTGGGATATGCGAGACGACGACCTTGAGGGGAAATTCAAATCTTTA 
ATCTGTAGAATGTTCTGTGGTGTACTTGGATCTGTGCCAGTGCTACGCGAATCGGCGGCGGCGGCTGGCTTTGCTGAATTTATATTGAATTTGGT 
GAGTGAGCTGCTGCTCACCGCGGTGGCGGTGGCATTATCGGGAATCGTATTCACACTCTCTGTAAGTCTCTGCGAACGCTCCAAGTTATCCTTGT 
GTATACCCTTTTGGTATAGCTTCGATGAGATGAGTGAGCTGTCGCTTTTAATTTGCAGAATTTTGTAGCCGGCTTCGTTAAATTTTGTGAAATCA 
TTCGATTTGTCCTTGATTT 
(SEQ ID NO: 535) 

Exon: 1001.. 1179 
Exon: 2086.. 2185 
Exon: 2274.. 2652 
Exon: 2727.. 3374 
Exon: 3440.. 3769 
Start ATG: 1133 

Transcript No. : CT15802 

A7AACGCAAAGTTGAATACGT GAGCAGTATATGAAAATCGGT G7AATACAGTGACTTCATTAGTCAGCCGAAAGCAAAAGAGAGCGAAATAC ATT TT 
TTAAACAAACTAAACTAAAATTTCACTGTGAATAATGATGACTGAAAAGTTAAATTCTGGGCACACTAATTTAACAAGCAAGGGGATTATAAATG 
ATCTTCAAATTGCTGGAAACACAAGTGACGATATGGGCTGGAAATCAAAGCTGAAGCTGCCGCCAAAGGACAACCGATTCAAAACAACTGATGTG 
ACCGATACGCGAGGCAATGAATTCGAGGAGTTTTGCCTTAAAAGAGAACTGCTTATGGGTATATTCGAGAAGGGATGGGAGCGCCCTTCGCCAAT 
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TCAGGAAGCAGCTATTCCTATAGCATTAAGTGGAAAAGATGTGCTGGCTCGGGCCAAGAACGGCACAGGCAAGACAGGAGCCTACTGCATCCCAG 
TTTTAGAACAAATCGACCCAACCAAGGACTACATTCAGGCACTTGTGATGGTCCCCACCCGAGAGTTGGCGCTACAAACCTCACAGATATGTATT 
GAGCTGGCGAAACATCTTGATATCCGGGTAATGGTCACAACGGGAGGTACCATTCTAAAGGATGACATTCTTCGTATTTATCAAAAAGTACAATT 
AATTATTGCCACTCCCGGACGAATATTAGATCTGATGGATAAGAAAGTTGCCGATATGTCGCATTGCAGAATATTAGTTTTGGATGAGGCCGATA 
AGCTATTGTCGCTTGATTTTCAAGGCATGCTGGATCATGTTATACTAAAGTTACCAAAGGATCCACAGATCTTGCTATTTTCCGCAACATTTCCA 
C T AAC AGT C AAG AAT T T C AT GGAG AAGC AT T T AC GC G AGC C T T AC G AG AT C AAC C T G AT GG AGG AG C T AA C AC T G AAGG GT G T C AC T CAGT AC T A 
TGCATTCGTACAGGAGCGCCAGAAAGTACACTGCTTGAATACGCTCTTTTCGAAGCTGCAGATCAATCAGTCTATCATATTCTGCAATTCCACGC 
AACGTGTCGAGCTATTGGCCAAAAAGATCACAGAGCTTGGATATTGCTGTTACTATATACACGCTAAGATGGCCCAGGCACACAGAAATCGAGTA 
TTCCACGATTTCCGCCAAGGACTCTGTCGCAATCTGGTGTGCTCCGACCTGTTCACCCGTGGTATCGACGTGCAGGCCGTGAATGTAGTAATCAA 
TTTCGATTTCCCACGAATGGCAGAGACATACTTGCATCGAATTGGTCGCTCAGGACGTTTCGGTCATTTGGGTATTGCTATTAATCTGATAACCT 
ACGAGGATCGGTTTGATCTGCATCGGATTGAAAAAGAACTTGGAACCGAAATAAAGCCTATTCCGAAGGTCATAGACCCCGCCCTTTATGTCGCA 
AACGTTGGCGCATCCGTTGGCGATACTTGCAACAACAGTGATCTGAATAATTCTGCGAACGAGGAGGGCAACGTTAGCAAATAAAACGGATATGC 
CCTGT GTAACGCT AGT ATCTGAAAT AGAGTCT AAATCGTGGAATC AAT AAAAAC AGATGAACT AT T TCAGAAAT CAAAAT T GTATTAAAT AAATG 
GAAAGCAATTCAATTTTTAAA 
{SEQ ID NO: 536) 

Start ATG: 133 

MTEKLNSGHTNLTSKGIINDLQIAGNTSDDMGWKSKLKLPPKDNRFKTTDVTDTRGNEFEEFCLKRELLMGIFEKGWERPSPIQEAAIPIALSGK 

DVLARAKNGTGKTGAYCIPVLEQIDPTKDYIQALVMVPTRELALQTSQICIELAKHLDIRVMVTTGGTILKDDILRIYQKVQLIIATPGRILDLM 

DKKVADMSHCRILVLDEADKLLSLDFQGMLDHVILKLPKDPQILLFSATFPLTVKNFMEKHLREPYEINLMEELTLKGVTQYYAFVQERQKVHCL 

NTLFSKLQINQSIIFCNSTQRVELLAKKITELGYCCYYIHAKMAQAHRNRVFHDFRQGLCRNLVCSDLFTRGIDVQAVNWINFDFPRMAETYLH 

RIGRSGRFGHLGIAINLITYEDRFDLHRIEKELGTEIKPIPKVIDPALYVANVGASVGDTCNNSDLNNSANEEGNVSK* 

{ SEQ ID NO: 537) 

Name : DEAD-box helicase 
Classification: RNA_binding 
Gene Symbol: me31B 
FlyBase ID: FBgn0004419 



Celera Sequence No. : 142000013384326 

ATCACCAAGAGCAGTGCCTACGGAGCCTGCTACCTGGGTGCGGATAGTGCGGACTTCGATCTTCCGAGAAACTATGCCGACAACGTCACAGTACT 
GTACACGTACACCGATCCTCGAAAGAAAGCCAAAGCCACCAACGGCTGTAACTGTTAGATAAATTGTGTTTGACAACCCATTCCGCAATACACTT 
CCCAATTGCCAAATTCGCAATCAATATTCCAGTTAGTGGTACGGATAGGTAGTATTATAGCTTATCAAAAAATAAATAATTTTATCCTTTGACGA 
AATCCGCTGGAAAGTAAAGAAGTTTCCCTTTCAATCTTGCATCAAACTCACCTACATATACTTTCTAAACTTTTTTTGTTGGGTTTGTTAGAAAT 
ACGTATGAAAATTTTATTCGATAATGTTCATTGTATTAATTGTATGTGTCTCAATTGTTGGCTGTCGTGGCTTCTTTTCATGTATGTGGGCATAC 
ATGTGCGGCATCTTCTTTGTAATACTTTGATTGTGTTTAAAGCCAATTTGTTTTTATTTTCGCAGTGCTGTCAATTCAATTCTTTGGCTAATTTG 
ATTCTCGAATCGGGAAGCAGAACATAAATGTATCTGTTTGTAGCGAAGGAATACATAGGCCAAAAACATAATATCTTAGGGAGCTTAGAATAGAG 
TATTTGGGATAATGTACGAGTAGCTGAATCGAATGCCGTAGCAAGTGAATCAAAATTACATCTACATGTATATATATATAATTGCTTGGATAAAA 
CATTTAACGTACTGTAAGATTGAATGCGACATACCAAAAGAGATrTTCAATAAAATCGAATTCAAATAGAACCAAGACCGCCACTTTGACTAAAA 
AAACTGTCAAAGTTTCGGCAGCATAGCCAAGAAAACTTGGCTTTTCGGCTTCTACGTCGCTGTTTTCCTGGGGACTAATTCATCTGCGATTAAGT 
GTAACCTATTTGTATGCGATTAGCTTAAATGCATCCCTTTCCGGGCGACTTCAATTGTTTATGCTCTGCCGCAGCTTCTGGGGATCCTGGTAGTG 
CGACGAGGACGATATGACCGAGGGATGCTGAGGCGGTGGAAAACACCGCACGTGCTCCACATTTGAGCCCTCGCCATCGCCGCCCTTTAGATACT 
TGTTGAGAATGGAGAAGATTTGTGAGTTTAATACTTGGAACCGACGAATCCGATCCACCATTCGCTTCAATTGCTGTTAAGGAATATTCAAATTG 
TACTTTGGGCACTTTGGTAGTTATGTTAGAACTTACAATGCCCTTTACGTCCTCATCCTTGCCATCGACGCGCTGTACCCGCAGAATGTGGTAGC 
AAAAGTCCAGAGCCTCGAAGCGACGCTGCTGTCCCAGCAGTACGATCATGACGCAGCCAGCCCAGTTCAGACCTTCGCCGAAGAGCTCCTCGATG 
GTGTACTCCGTTCCCCTGACTGGAATGCAATAAACGAACTGCAGAGCCGACCAAAGGCGATGGAACTCCGAGCACTCATCCACGTGGATAATCCC 
GTTGGCCGGTGGAGGACCACACCACACAGGGTCGTCCAAATAGCTCTTCACGCGGTTGAGAATAACTTCAAAAATGCTGAGGCCACAGCAAAGAC 
GCTCACGAGTCAGCAAATCTCCCTCTCGTGCTATCATTGCTTGCTGTTGGTAAAGATTATTATTATAAAGTTAATCAATTAAGTTTGGGGGCTCA 
CCTTTGCAGTGCCAATCTTTTCCACGTTTGAAACTATCTGCAGATTGGCAAACTGTGCTTCAAGACGCTTTTGCTTAGCCTCCGGTTTTTCATTC 
TCTGAAATGGGTAGAGATGTAAGTAAGAAAAACTACCCATGAAAGCATCCACTTGTACATACCTTTGCAAAAAGGTCTGGGAAAGATGTTCTGGA 
AGAGGGCGGCGTGCAGTAGATCGCACACCTCCTCCTGCGAGAGCGCCTGTTCGATCAGCAGGCAAAAGATAATGCTGTTGCCAAACTCCCGGAAA 
GACTGAAATAGCTCTGTCTTCGCGTCCGGATATTGCACGATGTCTGTAAGATGAGCCTGATAGTAGCTGAGCACTCCAGGCGAACCATATTCGCA 
ACGGGGTAGTTTGCATGACTTGGGCATGGCAATCATCAGCGTCTTGGTGAACTGCAGCAGCGAGCCCTGGATCAGTGGCTTAACAATGTCCTTCA 
GTATAATGTCCATGACGACTGCTATGCCCTGATAGCCCAGCAGGCGGCACATGGCATGGAAATGCGGTGAACCTACAAAGCCAGTGTACTGGCCA 
TACTGTGTGGAATAGGCAGCATTCAGCTGCTTGGAACCCCACAGATAGTAATGGGACATTTGTGGCGGCTTCTCACGCTGAATAGCCTGTGACGA 
CGAAAGATTCACTTTGGTTCGAATGAAACTGCAAGAGATAATAGTAATAATAAGTATTTTAAACAATAGAAAATTGGCGGCACACCCACCGATTC 
GTGGCTGCATTATAGCAGTAATTGACCAGAAAGTCGTAGTTCAATTCCACAAAGACGTGCAGTGTTATTCTGCCGTAAGGCGCCAGTACATTGTG 
GTTAGCCTCCTTTACCATGCCGTCAAAGTTATCCAGGGCCAAGTACTTGCTTAGCAACTTGTGGCAAATTCGATTCGCCTCTAGCAGGCCTTCGA 
GTTCCTGAAATTAAGCGAATTAGAGACCGATTCTAGACATCTGTAGGATTCCACTTACCACAATGCCGGTTATATCGTTGCCCTCGAAACGACTG 
ATGGCCAACTCAATGCTCTTGTGCATGTTGGCGTTTATGCGTTGTGTGATCAGTTTGTTCAAGTCAATAGAACGGCCTTAAACGAATGGAGAAAC 
CATTAACCAAAAGATCATGATGCAATGGGTGTTTTGCTTACCCAAAAGTTGGACATGTCGTTGTTTTAACAAGGTCTCATAACGATTATTGCGGG 
GATACGACTGGAAGTTGAAACCGAGTACCTCGCACTCCAAACGGAATCGCTTGTCCAAGAAGATGCTGCCCGCCAATTGCTTGTAATGGGCAAAG 
ATCTGCTCGCTCAGCTTGTAGACAAACTGATCGAAGCACAGATTGACTTCGGCCTCCACTTCGTCGTACAGGAACTGCTTTCGGAAGACAGTGAG 
TGCGTAGTACGCCGAGTCGTTGTACAAGTCCAGGGGATACAAGACAAATCTGTGTGAGGCACTTAGTTAGTTCAGATCATTTCACCTATTCGCCA 
TTTACTTACTCCATCATTGAGGGCTCCTTCGTTTGCAGAATGTGATCGGTTAGGATCCAGGGCATGGACATTTCAATCGGAAACTGAATGCGCTT 
CTCCATGGTGATCAGATCCTTGCACTCCTCGTTGTGCTGATGCCGCACCAAGCACTTGTTAACCTTACGACCCATGGTCArCTCCAAGTAGAACT 
CGCGATACCACAGCTGCGAGAGATCGCAGCACTTCTGAAGCGTGTCGCTGAAGTTGAGCAAGTAGCTCCAGTAGAACGATGTCTTGTGGAACGTG 
TCGATCTGCAGCAAACAGTTTCCGTCGATGTCCTTGCGCAAAGTGCGCTTGCCGCCGCTTTTGTCGGCAATCAGAGACTCCAGCATGGTGCGCAC 
CATGTATAGCTGAGTGGAGGAAGGTCCCACATTGAGGCGAGGCACCTGGATGCGGAAGCCGCCGTCGGGATCCTTCTTGCCCTTGGCCACTGGAT 
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CGTCGGTGGGCTCATAGCCCTTTTGCCAGTCCGCCGATGTCTCTCGCACCGACATGATGATGCTCCGGATAAGATCCTTCTTATTTTTTACGGCC 
TTTCGCAGCGGTTCACGAAGCGAAAGCTGCACGAAGTCCTGCAGCTCGGAATAGATGTTACGCCGGATAGCCTCGCACAATACGGTTTCAATGCG 
CGCCATTAGCACCTGCAGTCCCTTTATCATGGCGATGACCTCGATCAGCGCGAACTTCTCTTCCGAGGTGTAATTGTAGCGGGTGGCCCGCTCGT 
ACTCCTCAGCCTCCACGGGGCACTCCTTGTTCTGGTGATGATCAGTCGGATGCAGGAGCTTCCAGGAGTACAACTCGGTGACCACACTGGTCCAC 
TCGGACAGCAATTGAAGGCCGCGCAGAGCCAGGTCGGCCGTAATCCGGTTCTCGGCATCAGAGGGATTCTCCTTGACGGTGGTGGTCACCTCGTT 
TGTGTATCGCGCCAGCTCCGAAATGTACTTGACGTGATCCTCGCGTATCTGCGGCAAATGCACCATTAGATCCGCCTGTGGGCTGATGGCGTTTG 
AGCTGGACAGCGGCCACTTGCTGGAATCGAAGTGCTTGCTGCGCTTTATGTAGTTGAAGGGAGCAATCTGCATGTCGCCGAAGAGGGGCACCACT 
TCCAGGTTCTTAAAGATGCGATCGATGCGATCCAGCCGTATCTTCTTCTTTTGATCAAGTTTGTTGATATTGCACGCATCGCTGTCCATAAGGAA 
CAGGCCAAAGCCCATAACTTTTACCAGCATGTGCTTCTCCTCAGGGGTCAAGTACATCTTTGTCTCGAACATATGCACGCAAATGTTGACCACAT 
CTGAGAGCAGATCCTCGTAGCCCACGATCTTCTCCAGCGTGTCCTTGACCGTGTCGCGAATCTTGTTCTGCGTGGCCAGGAACATGGACAGATTC 
TGCGACTCCTGCAAGGTGTGCGAGTCCGACATCACTTTGAGGAACTGCGCAGCTCGCCTGTAGGTGGAGTAATCGTTCTTCACGCTGGACTTCAT 
GTTCTTCAGCTCGTCCAGCACGGCAAACATGTTGATAAACTTGCCTAGCGTCAATAGGTACGCCTCGGACACGAAGTCCTTGCGCTTCTCGGCGT 
GGCACAAGCGCTTCACCTCGCCGGAGAAGGCCTCGATGGCCTTGCGTTGGAAGTACATGAAGTTGAGCAGCTTGTTCACCTCCGGGGCTAGCACC 
TCGACCGTCTTCTCGTAGATCTCCACGCGATTCGGCTGCTCATTGGACTTGGGCTGCGGAATGGCGCGCGAGCAGCAGCGCCAGGTGTAGAGCAT 
TACTGCGTGTTTCTGCCCCTCGTCCAGGAGCACATTCTGCGGGCGGAGAGAAGGGGTTCAGAGCTGTGAGTAATGCTAATGCGATAAGCCGGATA 
CGCTAACTCACCAAGTTGGCATGGGTGGTGGCCTCCTCGATGTACTTGGCGATACCCGTGACAAATCCATTGCGATCCTCGAAATTCGTATCGAA 
GTTTGCCTTGTAGATAATCGAGCAGGGCTGCGCCTCGATGCAGGGCTGCTCGTCCGGCAGGGATAGCTCGTCCAACACCTCCACGTTGGACAGCG 
CGTCGGCTAGCGTAATCTTCTCCGTCATGCTGGGCTGCTTATCATGCTCCAGCAGCGCGAAATGGTTAAATTCTAACAAGGCGGGGCCCCAATCC 
AATCGACACAAATTTTTTTCGCCACAAATACAACACGACTTTTACCATTCGCCTGGGTGCGTGGACACGTGGCCATAATTAGCACATTTTTGCTG 
GCGGTTGGCGTCTACGATGTGCTTATGCTTATATGGCTGCGCACCGCTAATTTCGATAAGAGAAAGATAATCGATAAATACGGCGCATTGGCGGC 
ACATCACATCACTAGCGTCTGTGGTTCCGTTTCGTTTTAGCGAACTCAATTTCTCCTATTTTATATTGCATTTTCCGTTAAGAGTAGCCTACAAT 
TCTGCAAGATGTCGATGTTTTGGGGTAAGTCGAACTGATCTCACTTGCCCCCTCCGGGCTGGTTGTGCCAACGAGCTCTTGGGTGCTCCGACCGC 
GCTTTTATTGCCACCACGTGCTTGCCGCTCGATTTACCGGCAGCGTGGAGCGGCGCCATTTGCACGCCGATGTAAAGTAAACCATGTTTTTGACC 
AACTGTGTCCTCTTCCAGGTTTGAACATGAAGCCTGAGCGCAAGTACTCGCAGACGATCATCAAGTCGTTCCACATCTCGGGCGTGGCACTCGAC 
AAGGGCCAGGAGGCCAAGTTGTACCTCGCTGCCGAGAAGCAGGAGTACATCGTGGCCACCGTGACCAAGGCCATTCCACAGGTTGCGCTCGACCT 
GAACTTCAGCAAGGGCGATCGCATCATGTTCTACACCGCCGGTAAGACCTTTTATGTCCACTATGTTCGCATGTTTATCCGTTTCTGACGCTTTC 
GTTCGTTTCACAGGCGACGCCAGCGTTTCTCTGTTGGGCTACTTGCACGACATTGATTCCGAAGATGATGAGGATGACGATCAAATGACGATCGA 
GAATTTGCTGAACTCCAAGGCGATCAAGAATAGCAAAAAGTCTGAAGATGATGAGGATGAGAATGAGAGCGGCGAGGAGGACGAGGAGGATACTG 
ATGACGATAGCCAGATAATCGAGGAATACGAGTCCTTCCTCGAAAATGGTGAGGAGGAAGATGACGACGATGTTGATGAGGATAACGAGGAGA 
(SEQ ID NO: 538) 

Exon: 5458.. 5142 
Exon: 5071.. 3240 
Exon: 3184.. 2892 
Exon: 2831. .2719 
Exon: 2664.. 2465 
Exon: 2403.. 1868 
Exon: 1806.. 1712 
Exon: 1658.. 1272 
Exon: 1213.. 1001 

Start ATG: 5348 (Reverse strand: CAT) 
Transcript No. : CT15834 

AAAAGTCGTGTTGTATTTGTGGCGAAAAAAATTTGTGTCGATTGGATTGGGGCCCCGCCTTGTTAGAATTTAACCATTTCGCGCTGCTGGAGCAT 
GATAAGCAGCCCAGCATGACGGAGAAGATTACGCTAGCCGACGCGCTGTCCAACGTGGAGGTGTTGGACGAGCTATCCCTGCCGGACGAGCAGCC 
CTGCATCGAGGCGCAGCCCTGCTCGATTATCTACAAGGCAAACTTCGATACGAATTTCGAGGATCGCAATGGATTTGTCACGGGTATCGCCAAGT 
ACATCGAGGAGGCCACCACCCATGCCAACTTGAATGTGCTCCTGGACGAGGGGCAGAAACACGCAGTAATGCTCTACACCTGGCGCTGCTGCTCG 
CGCGCCATTCCGCAGCCCAAGTCCAATGAGCAGCCGAATCGCGTGGAGATCTACGAGAAGACGGTCGAGGTGCTAGCCCCGGAGGTGAACAAGCT 
GCTCAACTTCATGTACTTCCAACGCAAGGCCATCGAGGCCTTCTCCGGCGAGGTGAAGCGCTTGTGCCACGCCGAGAAGCGCAAGGACTTCGTGT 
C C G AGG C GT ACC T AT T G AC GC T AGG C AAGT T TAT C AA C AT GTTTGCCGTGCT GG A C G AGC T G AAG AAC AT GAAGTC C AG C GT G AAG AAC GAT T AC 
TCCACCTACAGGCGAGCTGCGCAGTTCCTCAAAGTGATGTCGGACTCGCACACCTTGCAGGAGTCGCAGAATCTGTCCATGTTCCTGGCCACGCA 
GAACAAGATTCGCGACACGGTCAAGGACACGCTGGAGAAGATCGTGGGCTACGAGGATCTGCTCTCAGATGTGGTCAACATTTGCGTGCATATGT 
TCGAGACAAAGATGTACTTGACCCCTGAGGAGAAGCACATGCTGGTAAAAGTTATGGGCTTTGGCCTGTTCCTTATGGACAGCGATGCGTGCAAT 
ATCAACAAACTTGATCAAAAGAAGAAGATACGGCTGGATCGCATCGATCGCATCTTTAAGAACCTGGAAGTGGTGCCCCTCTTCGGCGACATGCA 
GATTGCTCCCTTCAACTACATAAAGCGCAGCAAGCACTTCGATTCCAGCAAGTGGCCGCTGTCCAGCTCAAACGCCATCAGCCCACAGGCGGATC 
TAATGGTGCATTTGCCGCAGATACGCGAGGATCACGTCAAGTACATTTCGGAGCTGGCGCGATACACAAACGAGGTGACCACCACCGTCAAGGAG 
AATCCCTCTGATGCCGAGAACCGGATTACGGCCGACCTGGCTCTGCGCGGCCTTCAATTGCTGTCCGAGTGGACCAGTGTGGTCACCGAGTTGTA 
CTCCTGGAAGCTCCTGCATCCGACTGATCATCACCAGAACAAGGAGTGCCCCGTGGAGGCTGAGGAGTACGAGCGGGCCACCCGCTACAATTACA 
CCTCGGAAGAGAAGTTCGCGCTGATCGAGGTCATCGCCATGATAAAGGGACTGCAGGTGCTAATGGCGCGCATTGAAACCGTATTGTGCGAGGCT 
ATCCGGCGTAACATCTATTCCGAGCTGCAGGACTTCGTGCAGCTTTCGCTTCGTGAACCGCTGCGAAAGGCCGTAAAAAATAAGAAGGATCTTAT 
CCGGAGCATCATCATGTCGGTGCGAGAGACATCGGCGGACTGGCAAAAGGGCTATGAGCCCACCGACGATCCAGTGGCCAAGGGCAAGAAGGATC 
CCGACGGCGGCTTCCGCATCCAGGTGCCTCGCCTCAATGTGGGACCTTCCTCCACTCAGCTATACATGGTGCGCACCATGCTGGAGTCTCTGATT 
GCCGACAAAAGCGGCGGCAAGCGCACTTTGCGCAAGGACATCGACGGAAACTGTTTGCTGCAGATCGACACGTTCCACAAGACATCGTTCTACTG 
GAGCTACTTGCTCAACTTCAGCGACACGCTTCAGAAGTGCTGCGATCTCTCGCAGCTGTGGTATCGCGAGTTCTACTTGGAGATGACCATGGGTC 
GTAAGGTTAACAAGTGCTTGGTGCGGCATCAGCACAACGAGGAGTGCAAGGATCTGATCACCATGGAGAAGCGCATTCAGTTTCCGATTGAAATG 
TCCATGCCCTGGATCCTAACCGATCACATTCTGCAAACGAAGGAGCCCTCAATGATGGAATTTGTCTTGTATCCCCTGGACTTGTACAACGACTC 
GGCGTACTACGCACTCACTGTCTTCCGAAAGCAGTTCCTGTACGACGAAGTGGAGGCCGAAGTCAATCTGTGCTTCGATCAGTTTGTCTACAAGC 
TGAGCGAGCAGATCTTTGCCCATTACAAGCAATTGGCGGGCAGCATCTTCTTGGACAAGCGATTCCGTTTGGAGTGCGAGGTACTCGGTTTCAAC 
TTCCAGTCGTATCCCCGCAATAATCGTTATGAGACCTTGTTAAAACAACGACATGTCCAACTTTTGGGCCGTTCTATTGACTTGAACAAACTGAT 
CACACAACGCATAAACGCCAACATGCACAAGAGCATTGAGTTGGCCATCAGTCGTTTCGAGGGCAACGATATAACCGGCATTGTGGAACTCGAAG 
GCCTGCTAGAGGCGAATCGAATTTGCCACAAGTTGCTAAGCAAGTACTTGGCCCTGGATAACTTTGACGGCATGGTAAAGGAGGCTAACCACAAT 
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GTACTGGCGCCTTACGGCAGAATAACACTGCACGTCTTTGTGGAATTGAACTACGACTTTCTGGTCAATTACTGCTATAATGCAGCCACGAATCG 
TTTCATTCGAACCAAAGTGAATCTTTCGTCGTCACAGGCTATTCAGCGTGAGAAGCCGCCACAAATGTCCCATTACTATCTGTGGGGTTCCAAGC 
AGCTGAATGCTGCCTATTCCACACAGTATGGCCAGTACACTGGCTTTGTAGGTTCACCGCATTTCCATGCCATGTGCCGCCTGCTGGGCTATCAG 
GGCATAGCAGTCGTCATGGACATTATACTGAAGGACATTGTTAAGCCACTGATCCAGGGCTCGCTGCTGCAGTTCACCAAGACGCTGATGATTGC 
CATGCCCAAGTCATGCAAACTACCCCGTTGCGAATATGGTTCGCCTGGAGTGCTCAGCTACTATCAGGCTCATCTTACAGACATCGTGCAATATC 
CGGACGCGAAGACAGAGCTATTTCAGTCTTTCCGGGAGTTTGGCAACAGCATTATCTTTTGCCTGCTGATCGAACAGGCGCTCTCGCAGGAGGAG 
GTGTGCGATCTACTGCACGCCGCCCTCTTCCAGAACATCTTTCCCAGACCTTTTTGCAAAGAGAATGAAAAACCGGAGGCTAAGCAAAAGCGTCT 
TGAAGCACAGTTTGCCAATCTGCAGATAGTTTCAAACGTGGAAAAGATTGGCACTGCAAAGCAAGCAATGATAGCACGAGAGGGAGATTTGCTGA 
CTCGTGAGCGTCTTTGCTGTGGCCTCAGCATTTTTGAAGTTATTCTCAACCGCGTGAAGAGCTATTTGGACGACCCTGTGTGGTGTGGTCCTCCA 
CCGGCCAACGGGATTATCCACGTGGATGAGTGCTCGGAGTTCCATCGCCTTTGGTCGGCTCTGCAGTTCGTTTATTGCATTCCAGTCAGGGGAAC 
GGAGTACACCATCGAGGAGCTCTTCGGCGAAGGTCTGAACTGGGCTGGCTGCGTCATGATCGTACTGCTGGGACAGCAGCGTCGCTTCGAGGCTC 
TGGACTTTTGCTACCACATTCTGCGGGTACAGCGCGTCGATGGCAAGGATGAGGACGTAAAGGGCATTCAATTGAAGCGAATGGTGGATCGGATT 
CGTCGGTTCCAAGTATTAAACTCACAAATCTTCTCCATTCTCAACAAGTATCTAAAGGGCGGCGATGGCGAGGGCTCAAATGTGGAGCACGTGCG 
GTGTTTTCCACCGCCTCAGCATCCCTCGGTCATATCGTCCTCGTCGCACTACCAGGATCCCCAGAAGCTGCGGCAGAGCATAAACAATTGA 
(SEQ ID NO: 539) 

Start ATG: 111 (Reverse strand: CAT) 

MTEKITLADALSNVEVLDELSLPDEQPCIEAQPCSIIYKANFDTNFEDRKGFVTGIAKYIEEATTHANLNVLLDEGQKHAVMLYTWRCCSRAIPQ 

PKSNEQPNRVEIYEKTVEVLAPEVNKLLNFMYFQRKAIEAFSGEVKRLCHAEKRKDFVSEAYLLTLGKFINMFAVLDELKNMKSSVKNDYSTYRP 

AAQFLKVMSDSHTLQESQNLSMFLATQNKIRDTVKDTLEKIVGYEDLLSDVVNICVHMFETKMYLTPEEKHMLVKVMGFGLFLMDSDACNINKLD 

QKKKIRLDRIDRIFKNLEVVPLFGDMQIAPFNYIKRSKHFDSSKWPLSSSNAISPQADLMVHLPQIREDHVKYISELARYTNEVTTTVKENPSDA 

ENRITADLALRGLQLLSEWTSVVTELYSWKLLHPTDHHQNKECPVEAEEYERATRYNYTSEEKFALIEVIAMIKGLQVLMARIETVLCEAIRRNI 

YSELQDFVQLSLREPLRKAVKNKKDLIRSIIMSVRETSADWQKGYEPTDDPVAKGKKDPDGGFRIQVPRLNVGPSSTQLYMVRTMLESLIADKSG 

GKRTLRKDIDGKCLLQIDTFHKTSFYWSYLLNFSDTLQKCCDLSQLWYREFYLEMTMGRKVNKCLVRHQHNEECKDLITMEKRIQFPIEMSMPWI 

LTDHILQTKEPSMMEFVLYPLDLYNDSAYYALTVFRKQFLYDEVEAEVNLCFDQFVYKLSEQIFAHYKQLAGSIFLDKRFRLECEVLGFNFQSYP 

RNNRYETLLKQRHVQLLGRSIDLNKLITQRINANMHKSIELAISRFEGNDITGIVELEGLLEANRICHKLLSKYLALDNFDGMVKEANHNVLAPY 

GRITLHVFVELNYDFLVNYCYNAATNRFIRTKVNLSSSQAIQREKPPQMSHYYLWGSKQLKAAYSTQYGQYTGFVGSPHFHAMCRLLGYOGIAW 

MDIILKDIVKPLIQGSLLQFTKTLMIAMPKSCKLPRCEYGSPGVLSYYQAHLTDIVQYPDAKTELFQSFREFGNSIIFCLLIEQALSQEEVCDLL 

HAALFQNIFPRPFCKENEKPEAKQKRLEAQFANLQIVSNVEKIGTAKQAMIAREGDLLTRERLCCGLSIFEVILNRVKSYLDDPVWCGPPPANGI 

IHVDECSEFHRLWSALQFVYCIPVRGTEYTIEELFGEGLNWAGCVMIVLLGQQRRFEALDFCYHILRVQRVDGKDEDVKGIQLKRMVDRIRRFQV 

LNSQIFSILWKYLKGGDGEGSNVEHVRCFPPPQHPSVISSSSHYQDPQKLRQSINN* 

(SEQ ID NO; 540) 

Name: SHYOlike 



Celera Sequence No. : 142000013384668 

CAGCGGCTGCGACGGCACCACCGCCGCCCACCGGCATCGAGGAGAGCTTCTCGCGACCCTCCTTGATCAGGTCGTCGATGCTCTTGCCAGCCAGC 
TCCTTGATGACCTTGGTCAGACGCTCGGCGTCGACCTCAACGCCCACAGAGCTGAGGATCTTCTCCAGATCGCTGTTGGCGGGCGAGTCCTTGCC 
ACCGAGGACGGCCAGAAGGTAAGCAGCCACGTAACGCATGTTTAAGTCTTGGGTTCTTGACACGAAACGTGAATTGGCAAAATGTACGAAAAACA 
CAACTTTCCTTTGCAAGTTGTCGTCGAAAAAGAGGCGGAAGTCAGTGTGGCCGCGCGGTCGAGGCTCGAAAATACCAGCTCATACTTATATATAC 
CAAAGTATACCACATCCTTATCTAGCAATACCCCTAATGAAACGATAAATTCAAATATTATTTTCAATTTTGCTTTTAAATAAAAATTAGAGCAA 
TATTATTGATTTTTGAACAATTTGGTGTGTTTTTTTTATATTTACAACATCAAAAAGTATTTATATTAGCTTAATAATATGACCTTTCTTACAGC 
ATGGAAATGTTATCAGTTGCAACACTGGGTTTCAAACTTGGCTATCGTAATAGTGGGAAGCACTTTGCTCTTGCAGTAGGCGGTCACACCATTTG 
AAAC GGT T C T GT T AGG C G AGGT C T AA C AT AC T GT T AAG T GC T GG AAAG AG AGC AAAAT G AGAGT GAT C GT AAGC T AAG AG AAGG AAC T AGGG AT G 
C GT AAAAGT T AAC AGC T GGC T AT C GAT G AT T AA C AT GAC CAT C GT T T T AACAGC AG AAC TT AAC T GT TAT T TG T AC AC A T AAAT A T AAAAC TGT A 
ATAAAACTCGAAAAAGCTGTATAAAAAATAAATTTTTTTCGAAAATATGTTTAATATGAGATATTTATTATTCTTCAAAAACACTTTAAATTTAA 
GTTGCACCCCCACTTGAAATAAATTGCAGAAAAGACATGAGACCCATCTTATTGTATGCTGTATATATAATTGCATATATAAATGATTGTTGTAC 
AAAATTTTGAAATTGTATAATGCATTCCTGAACTTTGTGTATCACATTTCGATTTGGAGGGCCTTCAAACGCCCTCGAGAATCGTTTCACATGTA 
ATGGTCATCAGATAGTTGATCGGCAGTAGTATTCACACTTAACTTACGACTACAAACAAATTAAGGTATTCTTTATTCCGGATCCGTATCCCCGG 
ACACAATAACTGGACTGTCGCGAAAAAATAAACAAAATGCTTCACTACACTTAAAGGAACATTGGTAACAAGAACGAAGCGCGGCCGCAATGATC 
AGTCCACTGCCACTTTCACTTCCACATCCATAATCCATAATCCATAATCCATAATCCATTATCCATTATCCTTCTGGGCATCGAGACTATAGGGG 
TTCCTGCTGAAAGTAATCGTGCTCCAAACAGGCCAAGGCGGATGGTCGCAGGTGGAGGTCGTAGGAAAGCATTTTCTGCGGGCAAACATAGGGTA 
TGCGTTAGTCTAGACCATGCAACAAGATATGTATCTTCATTTTCATACTCACATTCAACAGATCATCGGCGTACTTGCACAGGTGTGGGCAAAAG 
TCTTTTGGTCTCTTTGGATGCCGCTGGGGAAAGTGTTCCAGGGCCACGGAAATGGTCTGTGGCCACTGTTGCTCGGTGGGTCTACCGGTCAGTCT 
GCGTGAAATCGCATAAATGTAATTAAAAATCACCGATTTCAATGGCAGTATTTGTGCACTCACTCGAAGATGCGGTCCAGCTGATTCTTTTCGGA 
GGTGCCCGGGAATAGGGCGCGCCGATTGAACATCTCGAAGATGATGCAGGCGGCACTCCAGATGTCCACGGTGCTGTTGTAGGGCTGGGCGAGCA 
GAACCTCGGGTGCGCGGTACCACAAGGTGACCACCACGGAGGTGAGCTTCATCTCCGAACCGTAGGTCTTGGCCAAGCCAAAGTCGGCGATCTTG 
AGGTGGCCCTGCGAGGAGACC AGC AGGT TTTGCGGCTTAAGATCCCGATGGATGATACGGTGTGAGTGTAAGAAGTCCACACCGGTCAGTAGTTC 
CCTTGACAAGCGCTGCAATAGGAAATAATTTCATATACCATTTTCTGAAATTTTGATATCATTTTGAGACCCACCTGAATTGTGGGCGGCGACAT 
GCCCGATTTGGGAAGTCTGTCTATAAGATCCGAAAGATCCTGCTCCACATGTTCGAAAACCAACAGAATCAGCAATTGGCCGTCGCGCTCCAGAA 
ACTGGCATACTTCGTACAGTCTGAAAGAGAGGGGAAGTGCAATCTTTGTATGTACAAATTGTTTGATTTTGTGTGCGACAGAGAGCGATGTATGG 
GGGAAATGTTGAGAGATATAAAAAAAATGTGTCATAAAATGCTGCTGGTTGTGGTGCGCCAATACGCCATTGGCCATACTTTGTTCGTTTCGGGA 
ACTACGCGAAGCACTCGAGACAAACACATGCGCACATGGCGATCAAAGAGCCAAAGAGAGCGTCCAGTAAGAGAGGCTACAGATCGTTTTCGGAC 
AGAGGCAAAACGTAAACGTTAGCAGAGCAGATTAGCATTATGTTATGTTAGCATTATGTTAGAATCAGTTCGGGCAGTTATTTTAACATTGCCTT 
TGAATTACTTAAAAGGCACAGTTTTATTGAAACCTTCTACTGTTACACTTGCTTTAGCAACTGGCACATTGGGTTTTTAAAAATACAGATAAATA 
CAAGAAACTTAATTTGCCATTGATTTATTTCTTGTACCCCTCCAACACTTTTGGCTTCAAACAAAATAACAGCTGCACTTTGTAGCCATTTGCAA 
CTTAACTTAACATTCTGTTGTGCGAGATCTCTCTCTCTCTCTCTCTCTCTGTCGATCTCTTTGTACTGTATTCGCGTCGCGCTCTCACGCTGCTA 
AAGTGCAAGCGAGTGCGCACATCGATAACGCGCGACAGCAACATGTGGTTATCGAGCTGCTCTGTTAATTCAAATTCCTAACGGCCGCCGCAAAC 
AAACATAATGCCAATACAAAATCGCCCACATTTGCGAACTGCAGTGATCAGTTATCAGCGGGAGCGGATCGACTTGCTTCAGACTTACTTGACTT 
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GACTGGCATATGTCAGAAGCGAGGGCGTTCGCCGGGGGCAAATCGTCGGAATATTACGGCTATAGCATGCACAGATAAACACAAGCGCGCTGACA 
TTGCGATGCGGCTCTTTACAAATGAGTTTGGCAAATAGCTTTGGCGCCGCCACTTGGCCCCAATGGCGGGTGATAACACCCCTCATATACAGCTA 
GCTAGCTATAGTTCTACATAAAGTACTCAACTCCAAGGCCATTTAGAAGCCCACTGCCCTTGTGGCAGCGTGTAAATCACCCATACTCACTTGAC 
AATATTCGCATGATTGCTGGCATTCAGCTGCTTCAGCAGCGAAATCTCCCGCAGCGTGGACATTGGCACACCGTTCTCGTTCAGCGATATCCTAA 
CCTTCTTCAGCGCCACTATGTTGCCGGTGATCACATCGCGGGCTCTGTAAACCGTTCCATAGGCACCTGAAAAGTGGATCGAAATTGTACATCAA 
AATGCTTACAACGGGAAATAAATGCGAAAGACAGCCGAAAATAGAATAAAATAAAGTCAGCTGTGCCGATAAGATAAATGGCAGAAATCAGAGAG 
GCGAGAGAAAGAGAGAAGGTGCCACTTAGCTTCTTCATATCATTATGGTGTATTTATACTTTTCATATTCAACATCGTATTTATAGCCCTTACGA 
TTGAAGTGACTTGATTTATTTGCCTTGTTCGATGAACTAACTGTTTATAATGATTGAATGAATAGACAATGAATGCATTTAGCCCAAAAGAATCG 
TATTGCAATGCATGCAATAATTGCATATATTAGAGACTATATTATATGCACTCAAAATTAGTCAAAATATTGCACGAAATTCAGACTAATTTCCT 
CATGAAACTGTTGCTGTCTCAGGTACAACGTGATCTTAATTTCTACATGTTCACGGGCAAAGAAAAACGACAAGAGACGAGCAAAATTCCCAATC 
AGAATCAAATTCTGGCATGCCAGAGAAATGTCACCGAAAATTTATCACCCGTACGGACAAATGGACAAATGGACAAACGGACGGACGGACGGACA 
TGCGTGGGGGGTAGAAAAGCTAGAAGGCACCGGCTATTTCGGTTGACACTGACATACAGAAACAGCAGATATGTACTGATACACCCGAAATAAAC 
GCAACTTGAGGGGGGGGGGGGGTGAATTTGGTTTTCCGTCCCTTTGTCACCGTCTCTAGACCACCTGGCTTATGAACTGTGCCGCTGTTGCTACA 
TTCAAGTACAAGGCCGTGAGCTCGACTCGTTTTTGAACGTCGGCAACTGGTCTGTCCAATTGGAATTGTCAAAAAAATATACAAATATACAAGCA 
AAATAAACTCGAAAAATTGCGCAAAGAAAACAACAAAGCAAAGCACTATGGGAAATTTACGCACTTTATCTGGCCGAAAACCGATTAATATATGT 
AATATTAAATGCATGCATTTAGTTACAACGGTGACTACTGTTAGGCAATGCAAAAAAGTTAATTGAATGACTATCTGCAAACTAAAAAATATATA 
TTATATATCATTAAAAATATGGGCTAGTGTAAGGCAATGCAAAAAAAGTTTGGAGTTAGTTTGGCCACAAAAAGTGGTCAAATTTTCCATAGTAC 
AAAGTAGCAAAAATGAGACTGAGCGAAAGCGAATAATAATTGAATGAATGTGCACAGAAAGGCAGACAGCCGAAAGTTTGAGTGTATGTGTGTGT 
GTGTGCGTTGTATAACAGATGAAATTGTGCGGACCCATGACTCGCACAACACCCAGTAACAATATCCATCATTAGCATTCGCCTGCCCATGATGA 
TCGGCATGATCGGTGACGTAATTGATTTGGGCTTTATTTACAGCAAACAAGCGATCGTAATTACTGACCACAAAAAAAAAAGACCATGAGCCACT 
TGTCAGATCATCATGGATGTGAGATCGGCAGAAACACGTAGTCCGCCGAGAGTGGACCTAAAAGTATGCAGGTAGAAGGTGTAGGATCCAATACA 
GGGGAGTTTTCCATCCAATATTTCCCCTTTTCGCGGAATTTCCACGCTTAATTAGTAAAGGGGTAAAAAAACAACAACATTCCTCCAAGTGCACT 
TGTCATTTCAGCTTCATAACAATGTCGGAGCTCCCGATTGATCGTAAACAGTAGTGAAACCAAATAGCCCAGACTTTGTAAGCAAATTAAAGTAA 
TCAAACGTTAACAACAAGCGACCAGCAAACGAAAGCCTCTCGTGCGAAGAGAGCCGGACCTATCCCAACAACAAAGCAACCGCGTCCCATTATTA 
AATCCCACCCAAGTCGCTCACCTTCGCCGATGATGTTGAGCTCCTGGTAGTTGAACGGATCGCCATCGCCGAACTTCTTCGCCTGGGACATTTTC 
TGCCGCTTCAGCTGGCGTACATATGACATTTAGCTGGATTTTACGGTTATTTGCGTATTCCTCTAATATTCCGGGTAATCGAAACCGTTAGTTGC 
ACCAGCTGGAAGTGAAGGGGTGTTTTTTTTTTGGTTATCGATCGTCTGCATGTCACGTATCGATATGTCCTTGTGTCCCCACGCGATAGTTCACA 
CAGATCGTAGCGTGAGAGGGAGAGTGCGAGTGCGTGAGAGAGAGAGGGAGGGCCTTACCTGATTTCATACCTGTTGCCTTTTCTAGATAACCCCA 
AAACTGGCGATAAAGACTGGTGTGACAGGTGCGCTAATGGAAGAAGTCTGGCTATATGCTCCGCTATGCTTTTCTATCTCCGAGAACCGAAAACC 
GAAACATATACATGGACAACTACATATGTATGTATGTATGTAGATGGCGGCGGTCGTTGAACCGCGTTTAATGTTTTCGTACATACTCCGTTTTT 
ATGTCTAACCAATCGCAGAACGGCAAAGGCCAACACTAGTTATGACTTATTCATGTTATTCAGTGACGCTGCTCGCCGTCTCCGTTTTGCGGCTT 
CGTTCGTGCATCCTGTTGCTCTTATTCGAGCTTGACTGTGAACGCTGTACTGGTTATCTACTGTCGCGTTAGTTGGCGTGTGTCCACTATTCTCT 
CCTATCTAAAGTGCTACCACTTTTCTGTCAGTGCATTTTTTTTGCCCGAAAAATATACATATACCGCAAAAAAGAACCCGAATACAATGTATTTT 
GTTGCTGGCATTTGTCAATTAATTCTAATCGCGAGACAAGCGAAACAACAATAATAAGCGACACAGTCACAAAACTGGGTAGAAAGAGAGTGTCC 
AGAATTGTGAAAGGTArTACGAGGTATTACGAGCTGTATATTTGGGTCTAAAATACATAGTTATTGTTGCACCAGTTTTCGCCCCCTCTCTCTCC 
CTCGTCATCCCTCTCTCGCTCACCATTTGCGCTCTACTGATAAGGAAATTTCGGCTTCACTTTACTTTTTTTTCGGTGAACTTCAATGCCAATGT 
TGTCGTTTTTCTTTCAACCGTTGTGCGAAATATTAGTTAAATATTAAGAAAATGTGGTTTTTATTTGCGAAGACTCGAACACACACACAACGCCG 
C T AAGT GAT T T C GC T T GAG AG AT C GGT C T T AC AC AC AC AC AC AT AC AC AAAC ACAAAC G C AC AC ACC G AGT GGT AC AC AC AC AC AGGC G C AG GT A 
TTATGTATACATATATGATTTTTATTCGCTTTGACGAAATTTTGGCATTTGTTGATAATCCAGCTAGAACCGCACGATAGGAGCAACGCGACGCG 
AATTTTCGGTGCGAACACGGTTCAGTTGCCGGCAGTCGCACTTTAATTAAGTTTTTAAAACGTCTTATAAATTTTTACCAATATTCGTAAGAGTT 
GAGTGATTTTGTGTTTTGTTGTTGGCACAGCTCTGGAAACTGTTTTAGATGGTCTACTTAACAGTGGCTTCTCTTCGCCTGTTATCGCCCGCTGT 
TAGCAAACAGACGGCCGCTAACATTGTGCTGTATACGTCCAATCGGTTAAACGGTTAATTTCGAAATTGCTGCCAGCGATCGTTATTGAATGTTA 
TTCCAATTCGATAACGTAAGTGTTATTGTGTTATACTAATGTTATTGATTTCTGGGAATTTGATATTTTCAACTTGTAATAATGCCCAAATTTAT 
T TTTTTTGCAATGT GAATT TC AGGC ATTT ATAT AT GT AACAT AT AT AAATT AC AAATAAAT AAAT AAT AAT AAACAT AAT GAAAAT AAT AT TT GA 
AAAAAATTTTTA 
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Transcript No. : CT15896 

TTATTATTGTTGTTTCGCTTGTCTCGCGATTAGAATTAATTGACAAATGCCAGCAACAAAATACATTGTATTCGGGTTCTTTTTTGCGGTATATG 
TATATTTTTCGGGCAAAAAAAATGCACTGACAGAAAAGTGGTAGCACTTTAGATAGGAGAGAATAGTGGACACACGCCAACTAACGCGACAGTAG 
ATAACCAGTACAGCGTTCACAGTCAAGCTCGAATAAGAGCAACAGGATGCACGAACGAAGCCGCAAAACGGAGACGGCGAGCAGCGTCACTGAAT 
AACATGAATAAGTCATAACTAGTGTTGGCCTTTGCCGTTCTGCGATTGGTTAGACATAAAAACGGAGTATGTACGAAAACATTAAACGCGGTTCA 
ACGACCGCCGCCATCTACATACATACATACATATGTAGTTGTCCATGTATATGTTTCGGTTTTCGGTTCTCGGAGATAGAAAAGCATAGCGGAGC 
ATATAGCCAGACTTCTTCCATTAGCGCACCTGTCACACCAGTCTTTATCGCCAGTTTTGGGGTTATCTAGAAAAGGCAACAGGTATGAAATCAGC 
TGGTGCAACTAACGGTTTCGATTACCCGGAATATTAGAGGAATACGCAAATAACCGTAAAATCCAGCTAAATGTCATATGTACGCCAGCTGAAGC 
GGCAGAAAATGTCCCAGGCGAAGAAGTTCGGCGATGGCGATCCGTTCAACTACCAGGAGCTCAACATCATCGGCGAAGGTGCCTATGGAACGGTT 
TACAGAGCCCGCGATGTGATCACCGGCAACATAGTGGCGCTGAAGAAGGTTAGGATATCGCTGAACGAGAACGGTGTGCCAATGTCCACGCTGCG 
GGAGATTTCGCTGCTGAAGCAGCTGAATGCCAGCAATCATGCGAATATTGTCAAACTGTACGAAGTATGCCAGTTTCTGGAGCGCGACGGCCAAT 
TGCTGATTCTGTTGGTTTTCGAACATGTGGAGCAGGATCTTTCGGATCTTATAGACAGACTTCCCAAATCGGGCATGTCGCCGCCCACAATTCAG 
CGCTTGTCAAGGGAACTACTGACCGGTGTGGACTTCTTACACTCACACCGTATCATCCATCGGGATCTTAAGCCGCAAAACCTGCTGGTCTCCTC 
GCAGGGCCACCTCAAGATCGCCGACTTTGGCTTGGCCAAGACCTACGGTTCGGAGATGAAGCTCACCTCCGTGGTGGTCACCTTGTGGTACCGCG 
CACCCGAGGTTCTGCTCGCCCAGCCCTACAACAGCACCGTGGACATCTGGAGTGCCGCCTGCATCATCTTCGAGATGTTCAATCGGCGCGCCCTA 
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TTCCCGGGCACCTCCGAAAAGAATCAGCTGGACCGCATCTTCGAACTGACCGGTAGACCCACCGAGCAACAGTGGCCACAGACCATTTCCGTGGC 
CCTGGAACACTTTCCCCAGCGGCATCCAAAGAGACCAAAAGACTTTTGCCCACACCTGTGCAAGTACGCCGATGATCTGTTGAATAAAATGCTTT 
CCTACGACCTCCACCTGCGACCATCCGCCTTGGCCTGTTTGGAGCACGATTACTTTCAGCAGGAACCCCTATAGTCTCGATGCCCAGAAGGATAA 
TGGATAATGGATTATGGATTATGGATTATGGATTATGGATGTGGAAGTGAAAGTGGCAGTGGACTGATCATTGCGGCCGCGCTTCGTTCTTGTTA 
rCAATGTTCCTTTAAGTGTAGTGAAGCATTTTGTTTATTTTTTCGCGACAGTCCAGTTATTGTGTCCGGGGATACGGATCCGGAATAAAGAATAC 
CTTAATTTGTTTGTAGTCGTAAGTTAAGTGTGAATACTACTGCCGATCAACTATCTGATGACCATTACATGTGAAACGATTCTCGAGGGCGTTTG 
AAGGCCCTCCAAATCGAAATGTGATACACAAAGTTCAGGAATGCATTATACAATTTCAAAATTTTGTACAACAATCATTTATATATGCAATTATA 

TATACAGCATACAAT 
(SEQ ID NO: 542) 

Start ATG: 641 (Reverse strand: CAT) 

MSYVROLKRQKMSQAKKFGDGDPFNYQELNIIGEGAYGTVYRARDVITGNIVALKKVRISLNENGVPMSTLREISLLKQLNASNHANIVKLYEVC 
QFLERDGQLLILLVFEHVEQDLSDLIDRLPKSGMSPPTIQRLSRELLTGVDFLHSHRIIHRDLKPQNLLVSSQGHLKIADFGLAKTYGSEMKLTS 
VVVTLWYRAPEVLLAQPYNSTVDIWSAACIIFEMFNRRALFPGTSEKNQLDRIFELTGRPTEQQWPQTISVALEHFPQRHPKRPKDFCPHLCKYA 

DDLLNKMLSYDLHLRPSALACLEHDYFQQEPL* 
(SEQ ID NO: 543) 

Name: Cyclin-dependent kinase 4/6 
Classification: protein_kinase 
Gene Symbol: Cdk4/6 
FlyBase ID: FBgn0016131 

Celera Sequence No. : 142000013384668 

CAGCGGCTGCGACGGCACCACCGCCGCCCACCGGCATCGAGGAGAGCTTCTCGCGACCCTCCTTGATCAGGTCGTCGATGCTCTTGCCAGCCAGC 
TCCTTGATGACCTTGGTCAGACGCTCGGCGTCGACCTCAACGCCCACAGAGCTGAGGATCTTCTCCAGATCGCTGTTGGCGGGCGAGTCCTTGCC 
ACCGAGGACGGCCAGAAGGTAAGCAGCCACGTAACGCATGTTTAAGTCTTGGGTTCTTGACACGAAACGTGAATTGGCAAAATGTACGAAAAACA 

C AAC TT T CC T T T GC AAGT TGT C GT C G AAAAAG AG GC G GAAGT C AGT GT GG C CGCGCGGTCGAGGCT CG AAAAT AC C AG C T CAT AC T TAT AT AT AC 
CAAAGTATACCACATCCTTATCTAGCAATACCCCTAATGAAACGATAAATTCAAATATTATTTTCAATTTTGCTTTTAAATAAAAATTAGAGCAA 
TATTATTGATTTTTGAACAATTTGGTGTGTTTTTTTTATATTTACAACATCAAAAAGTATTTATATTAGCTTAATAATATGACCTTTCTTACAGC 
ATGGAAATGTTATCAGTTGCAACACTGGGTTTCAAACTTGGCTATCGTAATAGTGGGAAGCACTTTGCTCTTGCAGTAGGCGGTCACACCATTTG 
AAACGGTTCTGTTAGGCGAGGTCTAACATACTGTTAAGTGCTGGAAAGAGAGCAAAATGAGAGTGATCGTAAGCTAAGAGAAGGAACTAGGGATG 

C GT AAAAGT T AAC AGCT GGC T AT C GAT GAT T AAC ATG AC C AT CGT T TT AAC AGC AGAAC T T AAC T GT T AT T T GT AC AC AT AAAT AT AAAAC T GT A 
ATAAAACTCGAAAAAGCTGTATAAAAAATAAATTTTTTTCGAAAATATGTTTAATATGAGATATTTATTATTCTTCAAAAACACTTTAAATTTAA 
GTTGCACCCCCACTTGAAATAAATTGCAGAAAAGACATGAGACCCATCTTATTGTATGCTGTATATATAATTGCATATATAAATGATTGTTGTAC 
AAAATTTTGAAATTGTATAATGCATTCCTGAACTTTGTGTATCACATTTCGATTTGGAGGGCCTTCAAACGCCCTCGAGAATCGTTTCACATGTA 
ATGGTCATCAGATAGTTGATCGGCAGTAGTATTCACACTTAACTTACGACTACAAACAAATTAAGGTATTCTTTATTCCGGATCCGTATCCCCGG 
ACACAATAACTGGACTGTCGCGAAAAAATAAACAAAATGCTTCACTACACTTAAAGGAACATTGGTAACAAGAACGAAGCGCGGCCGCAATGATC 
AGTCCACTGCCACTTTCACTTCCACATCCATAATCCATAATCCATAATCCATAATCCATTATCCATTATCCTTCTGGGCATCGAGACTATAGGGG 
TTCCTGCTGAAAGTAATCGTGCTCCAAACAGGCCAAGGCGGATGGTCGCAGGTGGAGGTCGTAGGAAAGCATTTTCTGCGGGCAAACATAGGGTA 
TGCGTTAGTCTAGACCATGCAACAAGATATGTATCTTCATTTTCATACTCACATTCAACAGATCATCGGCGTACTTGCACAGGTGTGGGCAAAAG 
TCTTTTGGTCTCTTTGGATGCCGCTGGGGAAAGTGTTCCAGGGCCACGGAAATGGTCTGTGGCCACTGTTGCTCGGTGGGTCTACCGGTCAGTCT 
GCGTGAAATCGCATAAATGTAATTAAAAATCACCGATTTCAATGGCAGTATTTGTGCACTCACTCGAAGATGCGGTCCAGCTGATTCTTTTCGGA 
GGTGCCCGGGAATAGGGCGCGCCGATTGAACATCTCGAAGATGATGCAGGCGGCACTCCAGATGTCCACGGTGCTGTTGTAGGGCTGGGCGAGCA 
GAACCTCGGGTGCGCGGTACCACAAGGTGACCACCACGGAGGTGAGCTTCATCTCCGAACCGTAGGTCTTGGCCAAGCCAAAGTCGGCGATCTTG 
AGGTGGCCCTGCGAGGAGACCAGCAGGTTTTGCGGCTTAAGATCCCGATGGATGATACGGTGTGAGTGTAAGAAGTCCACACCGGTCAGTAGTTC 
CCTTGACAAGCGCTGCAATAGGAAATAATTTCATATACCATTTTCTGAAATTTTGATATCATTTTGAGACCCACCTGAATTGTGGGCGGCGACAT 

GC C C GAT T T GG GAAGT CT GTCT AT AAGAT C CG AAAGAT C C T G C T CC AC AT G T T C G AAAAC C AAC AG AAT C AGC AAT T GG C CGT CG C GCT C C AG AA 
ACTGGCATACTTCGTACAGTCTGAAAGAGAGGGGAAGTGCAATCTTTGTATGTACAAATTGTTTGATTTTGTGTGCGACAGAGAGCGATGTATGG 
GGGAAATGTTGAGAGATATAAAAAAAATGTGTCATAAAATGCTGCTGGTTGTGGTGCGCCAATACGCCATTGGCCATACTTTGTTCGTTTCGGGA 
ACTACGCGAAGCACTCGAGACAAACACATGCGCACATGGCGATCAAAGAGCCAAAGAGAGCGTCCAGTAAGAGAGGCTACAGATCGTTTTCGGAC 
AGAGGCAAAACGTAAACGTTAGCAGAGCAGATTAGCATTATGTTATGTTAGCATTATGTTAGAATCAGTTCGGGCAGTTATTTTAACATTGCCTT 
TGAATTACTTAAAAGGCACAGTTTTATTGAAACCTTCTACTGTTACACTTGCTTTAGCAACTGGCACATTGGGTTTTTAAAAATACAGATAAATA 
CAAGAAACTTAATTTGCCATTGATTTATTTCTTGTACCCCTCCAACACTTTTGGCTTCAAACAAAATAACAGCTGCACTTTGTAGCCATTTGCAA 
CTTAACTTAACATTCTGTTGTGCGAGATCTCTCTCTCTCTCTCTCTCTCTGTCGATCTCTTTGTACTGTATTCGCGTCGCGCTCTCACGCTGCTA 
AAGTGCAAGCGAGTGCGCACATCGATAACGCGCGACAGCAACATGTGGTTATCGAGCTGCTCTGTTAATTCAAATTCCTAACGGCCGCCGCAAAC 
AAACATAATGCCAATACAAAATCGCCCACATTTGCGAACTGCAGTGATCAGTTATCAGCGGGAGCGGATCGACTTGCTTCAGACTTACTTGACTT 
GACTGGCATATGTCAGAAGCGAGGGCGTTCGCCGGGGGCAAATCGTCGGAATATTACGGCTATAGCATGCACAGATAAACACAAGCGCGCTGACA 
TTGCGATGCGGCTCTTTACAAATGAGTTTGGCAAATAGCTTTGGCGCCGCCACTTGGCCCCAATGGCGGGTGATAACACCCCTCATATACAGCTA 
GCTAGCTATAGTTCTACATAAAGTACTCAACTCCAAGGCCATTTAGAAGCCCACTGCCCTTGTGGCAGCGTGTAAATCACCCATACTCACTTGAC 
AATATTCGCATGATTGCTGGCATTCAGCTGCTTCAGCAGCGAAATCTCCCGCAGCGTGGACATTGGCACACCGTTCTCGTTCAGCGATATCCTAA 
CCTTCTTCAGCGCCACTATGTTGCCGGTGATCACATCGCGGGCTCTGTAAACCGTTCCATAGGCACCTGAAAAGTGGATCGAAATTGTACATCAA 
AATGCTTACAACGGGAAATAAATGCGAAAGACAGCCGAAAATAGAATAAAATAAAGTCAGCTGTGCCGATAAGATAAATGGCAGAAATCAGAGAG 
GCGAGAGAAAGAGAGAAGGTGCCACTTAGCTTCTTCATATCATTATGGTGTATTTATACTTTTCATATTCAACATCGTATTTATAGCCCTTACGA 
TTGAAGTGACTTGATTTATTTGCCTTGTTCGATGAACTAACTGTTTATAATGATTGAATGAATAGACAATGAATGCATTTAGCCCAAAAGAATCG 
TATTGCAATGCATGCAATAATTGCATATATTAGAGACTATATTATATGCACTCAAAATTAGTCAAAATATTGCACGAAATTCAGACTAATTTCCT 
CATGAAACTGTTGCTGTCTCAGGTACAACGTGATCTTAATTTCTACATGTTCACGGGCAAAGAAAAACGACAAGAGACGAGCAAAATTCCCAATC 
AGAATCAAATTCTGGCATGCCAGAGAAATGTCACCGAAAATTTATCACCCGTACGGACAAATGGACAAATGGACAAACGGACGGACGGACGGACA 
TGCGTGGGGGGTAGAAAAGCTAGAAGGCACCGGCTATTTCGGTTGACACTGACATACAGAAACAGCAGATATGTACTGATACACCCGAAATAAAC 
GCAACTTGAGGGGGGGGGGGGGTGAATTTGGTTTTCCGTCCCTTTGTCACCGTCTCTAGACCACCTGGCTTATGAACTGTGCCGCTGTTGCTACA 
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TTCAAGTACAAGGCCGTGAGCTCGACTCGTTTTTGAACGTCGGCAACTGGTCTGTCCAATTGGAATTGTCAAAAAAATATACAAATATACAAGCA 
AAATAAACTCGAAAAATTGCGCAAAGAAAACAACAAAGCAAAGCACTATGGGAAATTTACGCACTTTATCTGGCCGAAAACCGATTAATATATGT 
AATATTAAATGCATGCATTTAGTTACAACGGTGACTACTGTTAGGCAATGCAAAAAAGTTAATTGAATGACTATCTGCAAACTAAAAAATATATA 
TTATATATCATTAAAAATATGGGCTAGTGTAAGGCAATGCAAAAAAAGTTTGGAGTTAGTTTGGCCACAAAAAGTGGTCAAATTTTCCATAGTAC 
AAAGTAGCAAAAATGAGACTGAGCGAAAGCGAATAATAATTGAATGAATGTGCACAGAAAGGCAGACAGCCGAAAGTTTGAGTGTATGTGTGTGT 
GTGTGCGTTGTATAACAGATGAAATTGTGCGGACCCATGACTCGCACAACACCCAGTAACAATATCCATCATTAGCATTCGCCTGCCCATGATGA 
TCGGCATGATCGGTGACGTAATTGATTTGGGCTTTATTTACAGCAAACAAGCGATCGTAATTACTGACCACAAAAAAAAAAGACCATGAGCCACT 
TGTCAGATCATCATGGATGTGAGATCGGCAGAAACACGTAGTCCGCCGAGAGTGGACCTAAAAGTATGCAGGTAGAAGGTGTAGGATCCAATACA 
GGGGAGTTTTCCATCCAATATTTCCCCTTTTCGCGGAATTTCCACGCTTAATTAGTAAAGGGGTAAAAAAACAACAACATTCCTCCAAGTGCACT 
TGTCATTTCAGCTTCATAACAATGTCGGAGCTCCCGATTGATCGTAAACAGTAGTGAAACCAAATAGCCCAGACTTTGTAAGCAAATTAAAGTAA 
TCAAACGTTAACAACAAGCGACCAGCAAACGAAAGCCTCTCGTGCGAAGAGAGCCGGACCTATCCCAACAACAAAGCAACCGCGTCCCATTATTA 
AATCCCACCCAAGTCGCTCACCTTCGCCGATGATGTTGAGCTCCTGGTAGTTGAACGGATCGCCATCGCCGAACTTCTTCGCCTGGGACATTTTC 
TGCCGCTTCAGCTGGCGTACATATGACATTTAGCTGGATTTTACGGTTATTTGCGTATTCCTCTAATATTCCGGGTAATCGAAACCGTTAGTTGC 
ACCAGCTGGAAGTGAAGGGGTGTTTTTTTTTTGGTTATCGATCGTCTGCATGTCACGTATCGATATGTCCTTGTGTCCCCACGCGATAGTTCACA 
CAGATCGTAGCGTGAGAGGGAGAGTGCGAGTGCGTGAGAGAGAGAGGGAGGGCCTTACCTGATTTCATACCTGTTGCCTTTTCTAGATAACCCCA 
AAACTGGCGATAAAGACTGGTGTGACAGGTGCGCTAATGGAAGAAGTCTGGCTATATGCTCCGCTATGCTTTTCTATCTCCGAGAACCGAAAACC 
GAAACATATACATGGACAACTACATATGTATGTATGTATGTAGATGGCGGCGGTCGTTGAACCGCGTTTAATGTTTTCGTACATACTCCGTTTTT 
ATGTCTAACCAATCGCAGAACGGCAAAGGCCAACACTAGTTATGACTTATTCATGTTATTCAGTGACGCTGCTCGCCGTCTCCGTTTTGCGGCTT 
CGTTCGTGCATCCTGTTGCTCTTATTCGAGCTTGACTGTGAACGCTGTACTGGTTATCTACTGTCGCGTTAGTTGGCGTGTGTCCACTATTCTCT 
CCTATCTAAAGTGCTACCACTTTTCTGTCAGTGCATTTTTTTTGCCCGAAAAATATACATATACCGCAAAAAAGAACCCGAATACAATGTATTTT 
GTTGCTGGCATTTGTCAATTAATTCTAATCGCGAGACAAGCGAAACAACAATAATAAGCGACACAGTCACAAAACTGGGTAGAAAGAGAGTGTCC 
AGAATTGTGAAAGGTATTACGAGGTATTACGAGCTGTATATTTGGGTCTAAAATACATAGTTATTGTTGCACCAGTTTTCGCCCCCTCTCTCTCC 
CTCGTCATCCCTCTCTCGCTCACCATTTGCGCTCTACTGATAAGGAAATTTCGGCTTCACTTTACTTTTTTTTCGGTGAACTTCAATGCCAATGT 
TGTCGTTTTTCTTTCAACCGTTGTGCGAAATATTAGTTAAATATTAAGAAAATGTGGTTTTTATTTGCGAAGACTCGAACACACACACAACGCCG 
CTAAGTGATTTCGCTTGAGAGATCGGTCTTACACACACACACATACACAAACACAAACGCACACACCGAGTGGTACACACACACAGGCGCAGGTA 
TTATGTATACATATATGATTTTTATTCGCTTTGACGAAATTTTGGCATTTGTTGATAATCCAGCTAGAACCGCACGATAGGAGCAACGCGACGCG 
AATTTTCGGTGCGAACACGGTTCAGTTGCCGGCAGTCGCACTTTAATTAAGTTTTTAAAACGTCTTATAAATTTTTACCAATATTCGTAAGAGTT 
GAGTGATTTTGTGTTTTGTTGTTGGCACAGCTCTGGAAACTGTTTTAGATGGTCTACTTAACAGTGGCTTCTCTTCGCCTGTTATCGCCCGCTGT 
TAGCAAACAGACGGCCGCTAACATTGTGCTGTATACGTCCAATCGGTTAAACGGTTAATTTCGAAATTGCTGCCAGCGATCGTTATTGAATGTTA 

T TC C AAT T C G AT AAC GT AAGT GT T AT T GT GT TAT AC T AAT GT TAT T GAT T T C T G GG AAT T T GAT AT T T T C AACT T GT AAT AATGC C C AAAT T T AT 
TTTTTTTGCAATGTGAATTTCAGGCATTTATATATGTAACATATATAAATTACAAATAAATAAATAATAATAAACATAATGAAAATAATATTTGA 
AAAAAATTTTTATTACATACGATAAAATGTGAAAATTTTAAAACTGTCGTGGTCAACTACAATTGGTCGGCCATAATTAGTTAACTCCAAACATG 

T GT GTC AT T G AC T T T T AGC AT T T T T AGG ATT T GGT AAACT T AAAT T TT T AT AAC C CT GC AAAAT AAT AC AAAAT T C T GGAAT AAAAT GT CT T AAT 
GATCTAGATTGGCGGCATAATTCAATTTGTAACTCAATCCGGCGATAAAGTCACTCGTTGAATGTATAATTTATTGACATGAGTTGAATGCCAAA 
AGTCGCTCAACATTTTTTTGAGTGAAAGCTTGCTTACCGATTTCCAACGATCTGGCCGCACTGCCAATTTCGAAATCGAATCAAGGCATTCGCAG 
TGGGGCGCGTAAATTTAAAAACAATGTTGCAAATACATTGTGAAACCAATAAACGCATTTAACAAACCGCCTATCACAAATGTTACCCATTGGAT 
TGGCGAAAATTTCCATGTTTTTTTTTGGGGAAAAAGCGCGATTAATTAGTAACGAAATCAAATTAGAGCACAGTGACGAAGTCAGCCGAGTGCTA 
ATTACCACCAGCGGGTATCAGGCACCTAAATTATGCATGACGTACGCC 
{ SEQ ID NO: 544) 

Exon: 6933.. 6526 
Exon: 5610.. 5437 
Exon: 3581.. 3416 
Exon: 2300.. 2165 
Exon: 2102.. 1774 
Exon: 1708.. 1573 
Exon: 1500.. 1001 

Start ATG: 5539 (Reverse strand: CAT) 
Transcript No. : CT16072 

CTCTTACGAATATTGGTAAAAATTTATAAGACGTTTTAAAAACTTAATTAAAGTGCGACTGCCGGCAACTGAACCGTGTTCGCACCGAAAATTCG 
CGTCGCGTTGCTCCTATCGTGCGGTTCTAGCTGGATTATCAACAAATGCCAAAATTTCGTCAAAGCGAATAAAAATCATATATGTATACATAATA 
CCTGCGCCTGTGTGTGTGTACCACTCGGTGTGTGCGTTTGTGTTTGTGTATGTGTGTGTGTGTAAGACCGATCTCTCAAGCGAAATCACTTAGCG 
GCGTTGTGTGTGTGTTCGAGTCTTCGCAAATAAAAACCACATTTTCTTAATATTTAACTAATATTTCGCACAACGGTTGAAAGAAAAACGACAAC 

attggcattgaagttcaccgaaaaaaaactggtgcaactaacggtttcgattacccggaatattagaggaatacgcaaataaccgtaaaatccag 

CTAAATGTCATATGTACGCCAGCTGAAGCGGCAGAAAATGTCCCAGGCGAAGAAGTTCGGCGATGGCGATCCGTTCAACTACCAGGAGCTCAACA 
TCATCGGCGAAGGTGCCTATGGAACGGTTTACAGAGCCCGCGATGTGATCACCGGCAACATAGTGGCGCTGAAGAAGGTTAGGATATCGCTGAAC 
GAGAACGGTGTGCCAATGTCCACGCTGCGGGAGATTTCGCTGCTGAAGCAGCTGAATGCCAGCAATCATGCGAAT AT TGTC AAACT GTACGAAGT 
ATGCCAGTTTCTGGAGCGCGACGGCCAATTGCTGATTCTGTTGGTTTTCGAACATGTGGAGCAGGATCTTTCGGATCTTATAGACAGACTTCCCA 
AATCGGGCATGTCGCCGCCCACAATTCAGCGCTTGTCAAGGGAACTACTGACCGGTGTGGACTTCTTACACTCACACCGTATCATCCATCGGGAT 
CTTAAGCCGCAAAACCTGCTGGTCTCCTCGCAGGGCCACCTCAAGATCGCCGACTTTGGCTTGGCCAAGACCTACGGTTCGGAGATGAAGCTCAC 
CTCCGTGGTGGTCACCTTGTGGTACCGCGCACCCGAGGTTCTGCTCGCCCAGCCCTACAACAGCACCGTGGACATCTGGAGTGCCGCCTGCATCA 
TCTTCGAGATGTTCAATCGGCGCGCCCTATTCCCGGGCACCTCCGAAAAGAATCAGCTGGACCGCATCTTCGAACTGACCGGTAGACCCACCGAG 
CAACAGTGGCCACAGACCATTTCCGTGGCCCTGGAACACTTTCCCCAGCGGCATCCAAAGAGACCAAAAGACTTTTGCCCACACCTGTGCAAGTA 
CGCCGATGATCTGTTGAATAAAATGCTTTCCTACGACCTCCACCTGCGACCATCCGCCTTGGCCTGTTTGGAGCACGATTACTTTCAGCAGGAAC 
CCCTATAGTCTCGATGCCCAGAAGGATAATGGATAATGGATTATGGATTATGGATTATGGATTATGGATGTGGAAGTGAAAGTGGCAGTGGACTG 
ATCATTGCGGCCGCGCTTCGTTCTTGTTACCAATGTTCCTTTAAGTGTAGTGAAGCATTTTGTTTATTTTTTCGCGACAGTCCAGTTATTGTGTC 
CGGGGATACGGATCCGGAATAAAGAATACCTTAATTTGTTTGTAGTCGTAAGTTAAGTGTGAATACTACTGCCGATCAACTATCTGATGACCATT 
ACATGTGAAACGATTCTCGAGGGCGTTTGAAGGCCCTCCAAATCGAAATGTGATACACAAAGTTCAGGAATGCATTATACAATTTCAAAATTTTG 

T AC AAC AAT C AT T TAT AT AT GC AAT TAT AT AT AC AGC AT AC AAT 
(SEQ ID NO: 545) 
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Start ATG: 480 (Reverse strand: CAT) 



MSYVRQLKRQKMSQAKKFGDGDPFNYQELNIIGEGAYGTVYE^ARDVITGNIVALKKVRISLNENGVPMSTLREISLLKQLNASNHANIVKLYEVC 
QFLERDGQLLILLVFEHVEQDLSDLIDRLPKSGMSPPTIQRLSRELLTGVDFLHSHRIIHRDLKPQNLLVSSQGHLKIADFGLAKTYGSEMKLTS 
VVVTLWYPAPEVLLAQPYNSTVDIWSAACIIFEMFNRRALFPGTSEKNQLDRIFELTGRPTEQQWPQTISVALEHFPQRHPKRPKDFCPHLCKYA 
DDLLNKMLSYDLHLRPSALACLEHDYFQQEPL* 
(SEQ ID NO; 546) 

Name: cyclin-dependent kinase 4/6 
Classification: protein_kinase 
Gene Symbol; Cdk4/6 
FlyBase ID: FBgn0016131 



Celera Sequence No. : 142000013384 627 

TGGGGATGCTTATAAGGAACTTGGACCAAATATGGTATATTCTCACCTTGAACTCCTTCGCCGCCGCCGGATACTTGGTACAATCTAGGCGACCC 
ACTCGAACGTTGGTGGCATGCAAAGCCTGAGCCACCAGAGCAAAAATGGGCTCCGTTTTCTTGCAGTAGCCGCACCACGGGGCATAGAACATGAC 
CAACCATTGGCCCTCGTGTCGCACATCGATGAACCGGTCACTGAGCTCCAAAACCTTCGAGGATAGCCCCGTCGAACCCAGAGTCAGAAGCAGAG 
CTGTTGGTTTTGTAAATCATTTAGCATCAGCTTGGATTTTTTCTATTGTTCGTAATTTATTTACCACTTATGAGGCCGAAAATCCACATAGAATT 
TGGCGACATCCTTGCAGCAGGTTTAATTTGTACTTCCAAGTGTAATTTACGTGCCCCTTATTTGCTATATCATGCTTTCCGTTATTTTAATAGCT 
TCTTTACTTAATTTAATTAGTTGATTGGTTTAATTTTCAGTTCCAAATTATTTTCCTTCGGACCACAGCAACAACAAAAGAAAACATTTGCAAAA 
ACCACCCCCTTTAGACTGACCAGATGTGGAAAATATCCAAAATACTTAAGGAGATACTCTTTAGTCCAATGCGTAAATTGAACAAGTTTATTTAT 
TAATTTTGTAGTTGTTTGGATTATGTTTTTAAAAGATTATTGAACTCAAGATTTTCATTCTACTATAACCTTAAAACTAAAATGAATTCGAAATA 
AAAACTAATTAAATTTTTTTATATGCATCTCTAAGAATTTATCTTTAATAATGACATAGTTAATAATTACATTTATAAAAGAATTTTTAATTTTG 
AAAATAAGTTTTTGAGCGGCCATAGAGCGGCTGCCTAAGGGTTAATCTGGTATATTTTTAGTCAGATGGCAACTCTTCAGCTTTGTTTACCTCAC 
AAAATAGACTATCTGGCAACTCTTAGGCACACACGTGCATCTCCAGAGGAATTTTGTTTCAGTTGTTGTTCCTTATTTCTTCTTAACTAAATAAT 
T GT AT T AAC T AAAT AAAAT G AC G AAT AAT GCG AAG AAGG AAC C C CAGCAAAGGGGC AAGCAT C AGGT GC AT AAT GT GCGC TTC T AC AC T AT AAAG 
CCGCGTGAAATTGTCAGCTTGGCGTATAGTAAATCGAGCAAGTGCCTCGCTCTGAGTCGGTAAATTATTGAAAATTAAAAAAAACCTAATAATTA 
CTGTGCAGTAAACTAATAAGTACACTCTTTGCAGTGCAACACCCGTCATTGAGCTATGGAACCTGGAGCACGCCCCGTATTTGGATCGTGTAATC 
CACCTGCCCACCGATTCTCATGTGGAATCTATAGCCTGGGCGGGAAACCGCTTATTTTCCGTGGATTTAAGTGGTAAACTGATCGAGTGGGATGT 
GATTAAGCTAAAACAACGATATGAGCACTCGCCCACTGGCAATGCTCTCTGGAGTATCGATGTGAATCCCGCGGAAACGGATATTGCCATTGGCT 
CAGAGGAAGGGCACATCAACATCCTGAGTATTGAGAATGACGAAATCACCTATAAATCCCTATTTAACAAGCAAAAAGGTCGCGTTCTCTGCATC 
AAGTTTGATAAGACTGGAACTAAACTGGTGACCGGCACTGAGGGATTTGTTCGCATTTGGAACGTCCTCAAGGGAACCACTCTGCACACGATGAC 
CCTATCCGAAAAGAACGTAATAGTTTGGTCCCTGCAGGTGCTATCCGATAATACGATTATTGCCGGTGATTCAGCGGGATTTGTTACCGTTTGGG 
ATGCGGATAACGCGACACAGATAGATTCCACACGAGTTCTGGATAAAAACGTATTTGCCCTGGCTGTGAATAATAAGGAGGATCGATTAGTTTGC 
AGTGGAATGCAGCCGCCGCTCATCCGGATTTTCAGCAAAACCAAGATAAAGCGGGAAGAGTCCACCAGCGAGCGTTGGATTAAATTCCTACAGCG 
CGATGCCCACAAACACTATGTGAAATCCCTGTTGGTTATCGACGATCAAATTTATTCGGGAGGCAGGGATGGCATATTGACCATTACGTCCAGTG 
AACGAATGCAGGCACATTTGTCGCAGCATGCTCCCTTTCTGAAAGGTTCCGTAGCTTCCATGGCAATCAGTAAAAAGCTTTTGCTTTTGCGATAT 
CCCAACAGCGTGCACCTGTGGCGCTTGGGTTCTGTGGCCCCGCAAGGCGAGGACAAGAAGAAACCCTGGGCACTGCCCGTCGGCCACACGGAGGA 
ACAACTTCTGGAGCTGCCACCCCAGAAACTTCTCCAACTAAATGTCAAAGAGCACAACTTTATACAGTCTGCTGCCATTTCATCAGATGCAAACT 
GGATTTGCTACTCTACCTTAAAGGAACTCCGAATCAGTCGTTTGAAAAGTGATCCCCTGCAGGTGGAGCGTCTGGTAGACGATCTTCCCGAGGAA 
CTTCAACCAGCCAGTCATATAATTTTCACAAAGCAGGGAGATCTAGTGCTGCTCAACCCGCAGAACAATCATCTT^AGTTGGTTTACTTTAGAGGA 
AGATCTGGTTACATTCAAATATACCATCGATTTGAGTGAGAAGTGTAAGAATACGATTAGCCACGTGGTTATCTCTTCGCATGGCGAATACATTG 
TGGCTGCGAGCTCAGACCACATAATATCAGTTTGGAAACTACATGGAAAACAATACAAGCACCTGCTGAACCTACCAAGACACCGAGCAGGTACC 
ACCGCCCTATCCATGCACGAGGATTACCCACGCGTGGTGGTCGCCTACGCCAATGGCCAGCTGGTGGAGTACGATTTGGTCAATCGGATATTCAC 
CTGTGAAACTAATGAGTACCTGATACCCGAGACCAGGCGCCATTGCATCAATGGAATCAGCCTGGATCCACAAAATCGTAATATATTTATTGTTC 
ACACCGAGGGCAATTTGTATGTGTTGGAGAGGGATCAGCATCTGGATCCCAAAGAGCTTGTGAGCAACAGCAAGTCTAAAAAGTTATCAAATGGA 
AACCGCTCTTCAGTTGATGGCGGCTCAAAAGGCCTGAGCCTCAAGACGACTCTGTCACGACAGGTAAGATTGGTAGGTCTGTTTACGCACAAGAC 
TTTCATTTTGTAACACTTTGGCTCACAAGGTAATCTTAATTCATCCATGAAATAAGAATGAGAGGATTAAAATCTCCTTTTAGGCTTACCGACTA 
CCCACTTTAACGTTTTCCGTCCTTTCTATAGCTGCCACAAATCCCAAACCCGCCCCTTATTACTTGCAAACACTTTCACACCTCAATTTTTTTCC 
TTGCAGCACCTGGTGCACGTGTCTCGCCTGAGCCCTAACGAACTTGTCAACGTCAGCATTTCGACAAACAATCTGTTGGCTCCGCTGCCGCCGCC 
GTACCAGCGGAAGAAGTTCGGTGCCTCGTAAATGGAGCGCCGCAGGATGAAGTTGAAGTGGAGTGGAGTGGCGAAGCTCCAGCATCACTTTGGCT 
TACCCAGTAGTTTCCACTTAGTTACCAAATAAAATCAAAAGTATAAAACAATGGTGGAGTGGATTTTTCGCAGGCTTGTGCGGCTCGCACTCTGC 
ATTCGCACGGAAATTTGTCAAATGTATTGCCAAAAATTTGTCTCCGGCCGTCTCGTCGACGACTACAATGTCTCTCGTTCCCATTCCCATTCCTG 
TGATTTCTTGCAGGCGCTTCTACGTGGATTGTGCAACATTTCGCTTCGTTTTGTGTCGCCGGCATTTACTGTAATTGTTGTTGCTGTTATTTTTT 
GGTTTTACACTGTCTAGCCAATGTCTAGACAAGTGCATACCACACAGACAGTCGAAAACACACGCACACAGTGGCGGCAAGCGGCGGTCGAGGGG 
GTCTTTTGGTGGAGTCCTGGGGGGACGTCGATGCACAATTCAACTTTTAGCTCAAAAACTCATTGGCCGGCTGGGTCCGCTACCCCGCCGCTCCA 
CCATTTCCACCATTGTGGCCGAATGGTTTGGGTGGGCGTTCTGTTTCTGATTCTCGGCAAAAAAGTCCGAGTATCCCGCAGCCATATTCGAATGC 
CAGCAGAGGCCTGGACTTGGGCTGGGTCATGACCACTTTGGGATCAAAAAGGTGATTAAAACTTTACAATCGCAGCTCGGGGTGGTGATTCGTAT 
CAGACGGCTTGCTGCATCGCCGGAATTAAAATGGATGGAAAGTATTAAACAAAACTAATTTAGAAATCGCTTCGCTTATAAACTAACTTTATAAG 
C C AAGG C T G C AG GAC T TT C CT C C AAT T GC AT GC AT AT C GC AC AT AC T G AAAG GG AT AT AAC CAT T T AC AT A T T A TAAGT TAT T AGGT T C CT C AAA 
TATTGTGTTCAAATTTGATTTTAAGGTATTCAAAATTCGGTTTGGCCTTGTCAGTCGTTTTCCATCTTGTTATGTGCATTG 
(SEQ ID NO: 547) 

Exon: 1001.. 1199 
Exon: 1270. .3103 
Exon: 3332.. 3451 
Start ATG: 1063 

Transcript No. : CT16106 
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ATTTTGTTTCAGTTGTTGTTCCTTATTTCTTCTTAACTAAATAATTGTATTAACTAAATAAAATGACGAATAATGCGAAGAAGGAACCCCAGCAA 
AGGGGCAAGCATCAGGTGCATAATGTGCGCTTCTACACTATAAAGCCGCGTGAAATTGTCAGCTTGGCGTATAGTAAATCGAGCAAGTGCCTCGC 
TCTGAGTCGTGCAACACCCGTCATTGAGCTATGGAACCTGGAGCACGCCCCGTATTTGGATCGTGTAATCCACCTGCCCACCGATTCTCATGTGG 
AATCTATAGCCTGGGCGGGAAACCGCTTATTTTCCGTGGATTTAAGTGGTAAACTGATCGAGTGGGATGTGATTAAGCTAAAACAACGATATGAG 
CACTCGCCCACTGGCAATGCTCTCTGGAGTATCGATGTGAATCCCGCGGAAACGGATATTGCCATTGGCTCAGAGGAAGGGCACATCMCATCCT 
GAGTATTGAGAATGACGAAATCACCTATAAATCCCTATTTAACAAGCAAAAAGGTCGCGTTCTCTGCATCAAGTTTGATAAGACTGGAACTAAAC 
TGGTGACCGGCACTGAGGGATTTGTTCGCATTTGGAACGTCCTCAAGGGAACCACTCTGCACACGATGACCCTATCCGAAAAGAACGTAATAGTT 
TGGTCCCTGCAGGTGCTATCCGATAATACGATTATTGCCGGTGATTCAGCGGGATTTGTTACCGTTTGGGATGCGGATAACGCGACACAGATAGA 
TTCCACACGAGTTCTGGATAAAAACGTATTTGCCCTGGCTGTGAATAATAAGGAGGATCGATTAGTTTGCAGTGGAATGCAGCCGCCGCTCATCC 
GGATTTTCAGCAAAACCAAGATAAAGCGGGAAGAGTCCACCAGCGAGCGTTGGATTAAATTCCTACAGCGCGATGCCCACAAACACTATGTGAAA 
TCCCTGTTGGTTATCGACGATCAAATTTATTCGGGAGGCAGGGATGGCATATTGACCATTACGTCCAGTGAACGAATGCAGGCACATTTGTCGCA 
GCATGCTCCCTTTCTGAAAGGTTCCGTAGCTTCCATGGCAATCAGTAAAAAGCTTTTGCTTTTGCGATATCCCAACAGCGTGCACCTGTGGCGCT 
TGGGTTCTGTGGCCCCGCAAGGCGAGGACAAGAAGAAACCCTGGGCACTGCCCGTCGGCCACACGGAGGAACAACTTCTGGAGCTGCCACCCCAG 
AAACTTCTCCAACTAAATGTCAAAGAGCACAACTTTATACAGTCTGCTGCCATTTCATCAGATGCAAACTGGATTTGCTACTCTACCTTAAAGGA 
ACTCCGAATCAGTCGTTTGAAAAGTGATCCCCTGCAGGTGGAGCGTCTGGTAGACGATCTTCCCGAGGAACTTCAACCAGCCAGTCATATAATTT 
TCACAAAGCAGGGAGATCTAGTGCTGCTCAACCCGCAGAACAATCATCTAAGTTGGTTTACTTTAGAGGAAGATCTGGTTACATTCAAATATACC 
ATCGATTTGAGTGAGAAGTGTAAGAATACGATTAGCCACGTGGTTATCTCTTCGCATGGCGAATACATTGTGGCTGCGAGCTCAGACCACATAAT 
ATCAGTTTGGAAACTACATGGAAAACAATACAAGCACCTGCTGAACCTACCAAGACACCGAGCAGGTACCACCGCCCTATCCATGCACGAGGATT 
ACCCACGCGTGGTGGTCGCCTACGCCAATGGCCAGCTGGTGGAGTACGATTTGGTCAATCGGATATTCACCTGTGAAACTAATGAGTACCTGATA 
CCCGAGACCAGGCGCCATTGCATCAATGGAATCAGCCTGGATCCACAAAATCGTAATATATTTATTGTTCACACCGAGGGCAATTTGTATGTGTT 
GGAGAGGGATCAGCATCTGGATCCCAAAGAGCTTGTGAGCAACAGCAAGTCTAAAAAGTTATCAAATGGAAACCGCTCTTCAGTTGATGGCGGCT 
CAAAAGGCCTGAGCCTCAAGACGACTCTGTCACGACAGCACCTGGTGCACGTGTCTCGCCTGAGCCCTAACGAACTTGTCAACGTCAGCATTTCG 
ACAAACAATCTGTTGGCTCCGCTGCCGCCGCCGTACCAGCGGAAGAAGTTCGGTGCCTCGTAA 
{SEQ ID NO: 548) 

Start ATG: 63 

MTNHAKKEPQQRGKHQVHNVRFYTIKPREIVSLAYSKSSKCLALSRATPVIELWNLEHAPYLDRVIHLPTDSHVESIAWAGNRLFSVDLSGKLIE 

WDVIKLKQRYEHSPTGNALWSIDVNPAETDIAIGSEEGHINILSIENDEITYKSLFNKQKGRVLCIKFDKTGTKLVTGTEGFVRIWNVLKGTTLH 

TMTLSEKNVIVWSLQVLSDNTIIAGDSAGFVTVWDADNATQIDSTRVLDKNVFALAVNNKEDRLVCSGMQPPLIRIFSKTKIKREESTSERWIKF 

LQRDAHKHYVKSLLVIDDQIYSGGRDGILTITSSERMQAHLSQHAPFLKGSVASMAISKKLLLLRYPNSVHLWRLGSVAPQGEDKKKPWALPVGH 

TEEQLLELPPQKLLQLNVKEHNFIQSAAISSDANWICYSTLKELRISRLKSDPLQVERLVDDLPEELQPASHIIFTKQGDLVLLKPQNNHLSWFT 

LEEDLVTFKYTIDLSEKCKNTISHWISSHGEYIVAASSDHIISVWKLHGKQYKHLLNLPRHRAGTTALSMHEDYPRVVVAYANGQLVEYDLVNR 

IFTCETNEYLIPETRRHCINGISLDPQNRNIFIVHTEGNLYVLERDQHLDPKELVSNSKSKKLSNGNRSSVDGGSKGLSLKTTLSRQHLVHVSRL 

S PNELVKVS I S TNNLLAPLPPP YQRKKFGAS * 

(SEQ ID NO; 549) 

Name: WD-40 repeats containing protein 



Celera Sequence No. : 142000013384256 

GAACCAATCGATATTTGCAGTAGGGTAGCGCAATAAACTTACGTTTTATTTATTGAAAACAAATCGGTGTCACCACCAGCCGCCTTCGATAGAGC 
TGTTATCGTTATCGCAGCCACCTATCGTTTAATTCGCTGGGCGTTTCACAATTTTTTTGGGAAACACAACAAAGCTTCACAAAGGACACGATGCT 
CGTTCTGGTACTCGGCGACCTGCACATCCCGCACCGGTGCAGCAGCCTGCCGGCTAAATTTAAGAAGCTGCTGGTGCCGGGCCGCATACATCACA 
TCCTGGCCACCGGAAACATCTGCACCAAGGAGTCCTACGACTACCTGAAGTCCCTGGCCAATGATGTGCACATAGTGCGCGGCGACTTCGACGAG 
AACCTGACGTATCCGGAGCAGAAGGTGGTCACGGTAGGCCAGTTCCGGATCGGTCTGTGCCACGGCCACCAGGTGGTTCCCCGCGGAGACCCGGA 
GGCGCTGGCCCTCATCCAGCGGCAACTGGACGTGGACATCCTGATCACGGGGCACACGTACAAGTTCGAGGCCTACGAGCACGGCAACAAATTCT 
ACATCAATCCGGGATCGGCCACGGGTGCCTTCAACCCACTGGACACCAATGTGGTGCCTTCGTTCGTGCTGATGGACATTCAGAGCACCACGGTG 
GTCACGTACGTGTACCAACTGATCGGCGACGAGGTCAAAGTGGAGCGCATCGAGTACAAGAAGATCTAGGGTCCTAGTATTAGCCACCAGCCATT 
CCCCTCCAATCCAACCCTTGACCCCTGCATTCTGAGCTCGGCTTCATTAAAGAAATTGTAAATGTACGTAGCAATAAATCAAGTTCTCATTTTTT 
TATCTGATAAAAAATGCTAGCACCTTGTTTAAATAGAGAATAAAAGTACGAACGGACGAAAGTGAAGTTTTAGATTTAATGCAATAATAAACGAT 
T AC AAAGTAAGGAAAAAC ACCCCCAGAAAT T GTGAT ACT CCC AC AAT AATCT GAT AAAAAAGTTT AAATT TTAT T AAAT AGAAT AT AAAAAGGAG 
TTACATCTGATGATCGAGTAAAGAGTCTGCTCAAAAGTTCGGATTTCATGCTCAGTTTCTCATTGCAACACACTCTCGATATAAATACATTGGGT 
TCGGTTTGTGGCCATCGCTTTTCCATCGAAATAAATTATATAAATACTTTCCACGAGCCAGGACAGCAATGTTTGGCAACATCATCAATACGAAA 
T GT T T G AT C T AAAAAT C T T T G AAT T AC G AAAAC T TT GC AC T TAG AT T A AAT AC AAT TAT GAT T GC T GC TG ACG AC AG T T G G AAAAGC AACGAAC A 
GGATCCCCGAGGAGGAGAACCGCCTTAGCCGAGGCTGTCCACGCACACAGTCTTCAGGGTCTTGCCGGCAATCGTGGTCTTATACTGCTCCGGTA 
AAGCCAACTCGGTCTGGAAATCACAAAAAAAATAAATAAATGACTGAAGACAAGAGACAAATTTTTGAACTCACATAATCGCCGCTGTCGTCGGG 
CAACAGGATGTTCATCTCCGAGGACTTGGAGTTGACGATCTCCACGCCCAGCGAGTCCTTGGACAGATACATCTGGCAGCCGTCGGTCTTGTCGA 
TCGATACCGTCGGCACCGAGCCGAGCACCTGCATCTGCACGCTCTGGCAGTTGACGAACTCCACGGAGGCCACCACGGAATCGAACAGCAGGGAG 
CACTTCTTGCAGGAGTCGAAGACGATGTTGTTCACCTTGCCCTTGACGGTCAGAGTGGATCCCTCACACTTGAACACATACACCACGTTGTTCAT 
CTCGGCGTTCTCCACCAACAGCCCGGTGTTGTTCTTCTGGTACTCGATGATCCACTTCTTTCCGTCGCGCTCGAAAACGGGCGCCTTGGCTTGCG 
CAGCTGATGGAGCAGCAACGGCCTTGCTGCCGCCCGACTGGGCTGGCGATTTGAACGGAGCCGGTCCAGTGCGTAGAGATGGATTTTTATGGGTC 
TGCATGTCGCCAGTGACCTTCTTCAAGCCTGCGGGTTTATCAAAAGATACATTTTCTTACTATAGGAGCACTTAACCATTTTTGCAGTTGCTGCT 
CACCCTTGGTGATGTCGGCACCCTGGTTGATCTGGGCGAAGAGGGCGCTGCGATCATCTCCGGCGCTATCCAGCTTGAGGGCACTCAGATCCAGC 
ATGGGCGGCGGTGGGGGCAGACCACCGGGTGGCGGTGGTGGCGGGGCACCTCCGGCGGGGGCCGCTCCCTTGCCGGACCACACCAAGCCCGTGGT 
GTGGTACTGCCGGATGTAGGCCTGCAGTTCGGTGAGCGTCTGCACCCAGGCGCGGGCCCACTCCACATGCGTAACATCCTTCTCCTTCCACTCCT 
TGAGCACGCGGTTCGTGTAGAACTGGCCGGCGTCGTTCATCTCCTTGACATAGGGACCCGGGGTCTTCTCCACGCACACCCAGCCCAAGGCGGGA 
ATGCTTTCGCTGATGGCCGACAGGTGGTTGAAGAAGGGCGAGGAGCGGTGCTTCTCGCGGAAGTCCTGGATGGCGCTGATCTGCGTCGAGGTCGG 
CTTCAGAAGCTCCGCCTGCTTGGGCTGCGCCGGCTGGGCGATCTGGGTGGCCAGCGTCACGTACTGCAGTTGGGAGCTGTTGCGGAATTGAACGA 
TGGCATTATTACACTTTTACCATAGATGACTCATCCAGACTGACTCACCCAAAGGCGCTCTTCACGAGCTCCGCATGCTGCGCCACATCGCCGCC 
AATTTTGGCGGAGAGAGTTAGGTATTGGCTCAGCGGACCCGCCACAATGTCCTCGAATCCGGCGACACTCATGTTGCTGCTGGGCGGCGAGGGAG 
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GTGGAGGTGGTGTCTCTGCTTTTTCAAAAACTGGAAGAGCTTCTTCTTCTTCGGCTGGCGGAGGCGGGAGTGTGGGCGTGGGCAGTTCTATCGGC 
TGTGGCGCTGTTAGAGTCCGTTCCAATCGGTCGACGAGGGTCTCCAGTCGCTCGCAAATGCTCTCCAAGTGGTCGACGGCGGCCGAGGGCTTGGC 
TTCCGGCTCGTTATCTGAACCAAAATGACGCGGAGGAGAACGGCAAATGTTTAGTTCGTTTTGGCAATTATCTCGAATTATTCAAATAACACTGA 
GTTTTATAAAGAAAATAACTTGAGTGAGATGATGCTTCAGCGTGATAAACCCCAGTGAATCGTGCACTCACAATACCTGACAAAAAAGGCGATAT 
AAACTAAGGCGTCTCGAGAGGAATACCTCATCACGAGCTTGTGTATGATGATGAACTGGTGCGCTTAATACTTATAAAGTTCAATATGCAAAAAG 
AGCTAATCCCAACGAGTGAAAGTAATAATCCACATGCAAAAACGGCACAAGACAACGAAAAATCATAAGAGAGAAGAACTAAGTTGAACCTTTAA 
ACGGAAGCACTTGACTTGCGAACTTATTGGCCATCAGGAAGTTACGTGAATCGCTAGACTCTCAAAACTTCTTCACAGCGAAAACGAACACATAA 
AGAACACAGAAAGTCAAGCATCCGCTTAAATAACTGCTGAGATCGAAAAAAAAAAGCAAAAGAATCGGACTCTATCGAAGCTCGCATACCCCGCA 
CGGCCACGCTATGCTCCACAGCCGCATGGGCGGCCCGCAGGAGCTCCGTGTAGAGACGCTGGCGCTTCAGCTCCAGCTCGTCGCGTCCCACCGGC 
GTGACCGGGGGCTCGATGCAGCCGCTGGACATGGAACGAGGACAGAGGAGACGATGTACTGGCAATTCAAAGCGGGGCTGGGAATGCAGGGGTCG 
ACCTGTTCTGGTTTCAGCTTAACTATATGGATATTTTGGGCGTGAAACCAAAACAGGATCTTGGGTAAGCATACATGTCGTTCAGGCGCTCGTCG 
TCGAACCACGAGGGTAATCCGTGGGCAAAGAAGAGTGTTTCCGAGACGACATCCCCCTCGATGTAGGGTGCACACATGCTCTCCCGCTTGGGTGG 
CGGCAGTGAGTCGGAGAACACCTCGTCGTCGTCCAGCAAGGAGCCCTGATTCTGGCTGGCCAGCAGGAGCAGTGGCAGTGGATCCGGCTCGATGA 
TCTCCGCCCGGTTGCTGATATTCACCTTCTTCTTGATCGACTGATGCGTTGAGTTGTGCCTGGACAGGCAGCTCTTGGTGGGCGGCGGCTGAGGA 
GGCGGTGTGGTCGGCGTGACCGTCGCCACCACCACCACTTCGTCCGCCTCCTCCGCGTCGCCGCTGGTCATCGAAGCAGGTGGTGCCGTGGATGT 
GGAAGAGTGCGTTGAGCGGGCCAGTGGCTCCGGTTCTTTGGGTGCTAGTGCTGGATTTGCTGGTGATGATTTGCTAGCCGCTTCAGCCTGAGTTG 
GTTGGGCTGTGGAAGTGTTGCTTAAGATTACTCTACATGAAGCGACGAATTACCAAGACGATCATCAAGGAAAGCGCCGATACTTAAGTTGATTA 
CTTCACTTCACTCTCTGCTCACCTCGCTTCCTGTCTTCATTATGTTTATTTATCGCATTTAAAAGCATTTACCATGGTCACTGGCAGATTGAGTA 

GT T AGT T T T TAC T GC T C AC C T T AT T AT T AT AG AAT C T AT AG AAT C T T G AAT T T T C T GT C T AAAT TAT T G ACCT T C ATT T TAG T T T C T AT ATT CGT 
CACTACTATATACTATACGTTCACTTTGATGGATGAGCACGATCTAAGTGCTCAACAAAAAGTGTGTATGCAATGTCATTAGCTATTCCTCATTA 
ATCAACTGAGAATCGGCGCAATTCCTTGATATTCATATGTATAAAATTCACAGAAATAAACCGAAGAGAAAGCACAGTGAAGACAACTCATAAGT 
AAATAGAAAAGGACACTCAAGGCAGCTTGAAGTGCCCGGATTGAAATGAAAGGAAAGATCAGAAGTGGTGGTCAGCTGCAGTGGCATTGCAGTGA 
GTGAGAGCCGGCCGGATAGGGGCCTTCGTTCCGGAGAGCAGACAAAAGCCGAGACAGCTGTTGAGCTCGAAGGCCGGAGAACGCGAATAGATGGG 
GTGGTGGGTAAATACTTAGCGTTGCGTCAGTACCAGTGTTTGGCTGGGTGCTTGTGGGCACATCCTCGGGATTATGCGCTTCCAAGAGCTCCTTT 
TCTGGTTTCACAGGCTCTGGCTCCTCCTTCGGAGTTCCTGGCTTGCCGTTCAGCTGGGGTTCCTCCTTTTCGGTGACCTGTCCGCCGTTCTCCTC 
CTCTGCGTTCCGCTTTTCCTTCTCCTTCTTGCCCGCCTTGGGGTTCGATCGGCAGCAGGAGAACATTTTGCAGGCGTGTGGCTGGGCTTTTCCGG 

T G C T C AAC AC T C T AGGT AC G AAT C AAG C ACAT T AT GT AAGT T CCGTGGCTC G AC T GC GGAT GC C C GT G AAG C T C AAGC GGC GGT C AC AAC AC T G A 
CGGCACACCAAACCAAATCGAATCAGTAGAAGCAGTAACAGCAGCAGCCATATAAGACACTCAACGGGTTTCCAACGAACGTGGCCACTCCTTGC 
TCACTGTGTACATATATTCAGTGTATGGCCCAAATTTAGTGGCACATTAATAAGGTCCTCCGAGTTGTCCGCGCCTTCAACTGGGTCAAGTGACT 
GCCGACCAAGCACAGAACCAAATGGCACACAAATGCGATTACACACGGAGAACAGAACACACCCTAAACGAAGCGCACGCCCACGCACTCGGGGA 
TTCGATTCGATTTCGAACTCGAACTCGAAGGTCTCAACAGGACTCTGTTAGGAGAGCTCCGCGCTGTTAACTGCTGAGTTTCGCCGCCTTGTTGG 
GCTACTCCGCGCTGCTTTTGACCATTTGGGGCCAGCCACCGACTGGCAATGAAATGTGTTTCATTAATAACCGCTTTTGGCCTGACATGCAGGAT 
TTTGTGCAAGCTCCGGCGATGTCTTGCACTTTTGCAGCAGAGAAAGCTTACGGTCCAGGCGGAAAATGGAAAACTCTCCGAGAGAGTTGCGCAGA 
CACCACTGTGATTTCGTGATACCCTTCTCGCCACGGGTGGCGCGACTTAAGTGAGGGAAAACCACGGCGATCCCATAGTTTTATCAGCCCTGTAA 
GGCAGGGTTCGCCTATCAATTAGTTGTGACAACACCGAGTTATCTTTACTATTTGGAAAGTTTCCAGGAATGATCGGTAATGAAACTATGATTCC 

AT T T T G AGC AT T AC C AT AAC GTT T GT AAG CC AT AG AG AGT AGAG AG AT GC AAT GAT C AT ATC T T AGC C GGC T AT AAAAAC AGT T T T AAAAT GT T C 
AGCTTTTTGTGAAGTAATCTGCCGCAGAAGAGTATAAAAACGTCGACGTAACTGAAACTCCCATATCGTTTGCTTGCCAGTTTTTGTTTTGGCCT 
TAGTGCAATGGGTTTATTGCACCGGCAACGTTGTTTTTGTTTTTAAGTAGCTTTAAACAAATGGCTGGCAATTAGAGTGGGAAATGCATCAGTAG 
CCGTTGGATTGGAGGGAGGGGCTAAGCCGCCGAGGACAAGTAAGCCGAGATTCAACTTTCGCCTGGCGATACTAGTTCTCCTTGCAGTCTTGATG 

C GT T T T C T GC GGC AG AT GCGTTTGTTGCGTG C AG AAAGC AAAG AAG AAAT GGGG AAG AAAAAC TAG AAAC C AAG AT T AAAT GT CT G C AGCT G AGG 
AACCGTTATTCCCGCTGCGAGAGTGTGAGTGAGCTCGAATTACCAGGGGTGGGGGGTGTTCGTAATGGTAGAAGAAAAATCGCCCACCTTATAAG 
GCGAGGTGTTATTTTCTTCCACCGAAATCGAGTGAGTGGTAATATCTGCCGATTTTCAATCGCGATTCGCTGATAAAAGCGTTTTGGCCAACTTT 
TTGACTCATTCGCGCACGAATACAAGCAAACGTGTTTGCTCTTGCGTAAACTACATACATGCATACATACATATGCATATCAACTATGTGTCTGT 
GAGTCAGCGAAACAAAAACAAAAAATGGCCACAACAGCTGAGTAGTTACTTTTTTTTATCGGGGGTTAACAATCGCAATGGGAGCTGCTAAGGGT 
TAATTTGCATACACTTGCTGGCAATGAATTAGTTGTGTCTTTATGCCATGGAATCGGATATGAAATCATCGTTAAAAATACACCATGTAGATTTA 
ATTTTACAAGCAACATGAATCACACAAAAATAGCCATCACACAGCTGTCTCTTGTCTTTTTGGCTCTCTTCGCTGTTGTTGTTGTTGTGTGTGTA 
CAAACACCCACAGCCTGCATCACACACACACGCTCAAATTCGTGACAGCACCACCCACCCCTCACCAATAACCGTTAATTCCGCAGACACTTTCA 
GCCGGCGAACGGGAGAAGAGAATTTGCGAATAATAGCAAAGCCCCGTTTAAATGAACGTTTTTAGCAACAATCAGGCGTTACTCACCTTCTTTTG 
CTGCTGATGAATTTGCAAGGATCTGCAGTTGTATTCCAATTAGAACTGACGAGAAGCTGCGTAGCGAACACCGGCGAAATGGCGAAAATAATAAC 
TTGGCGTGGCCTTTGGTGGGACCAGGCGTACATTCGTGGAAATAGTCAAGCGAGTTAACCCTTATGTGGGTGGCAGAATACTAAAAATATATGAT 

AC AAC AAAT T T TAG ACTT AAG AAAACGGAT GT T T AAAT GT AT AAAAT AT GAT GG AAGT GC AAT AGC C AAGC T TAT AAACT AAT TC T T AAAT TAT T 
TGCAATGCATTTAGGACTTTTCGATATACACGTAAATAAACTATATTTTGATTGAACACGGTACATTTCTACTAGAGTTCACCTTCAGATATTGA 
CGCTATTTCTTGTCCTGTAAATTATTTTGTTAAATCGTTCACATTGTATGTCTAATAAAGATGTTTTCCAATAAAATGTTATTCCCTTTTCACTT 
AGTCTAACATATTCATTAAATAATCAATGTTTTATTTTTTATTTAGTTATTTTAATTAGTTCTTAATTATAAAACTGCTCTAGATTCTCGATATT 
TTGCGGAATCCGGCGCATGGTCACACTGACGAACACGTATTTTCATTATTCTTGCGAAGCGCATCGCTTAAAACTTGATTACGCTCGGATAACAA 
GATGCACCCGCGTCTCAAGTGACTGAACAGGTAGCGAAGTGACCCTGGATGAGCGTGGGTAGGCAGCCGGCACCTCTGGAAAGCCCTAATCCCAG 
GTCCGCCAGCCGTTTGCAGGTTATGTCCGCTTGTGTACACTGTGACGGGGAACCGAAGAAGCAACAGGAGCGGATTCCTGGCAGAAGAGACAACC 
GCTGTACGAGATACCGCCCCTGACCTGCGCAACATGATCGAGGTGCTGTGCCTGCTGCTGGGCAGGATCCTGCTGCTCCTGGTCGGCGGCTACGA 
ACTGATGTGGCGGCTCCGAGAGCGACTGAATGCACTGGCCGTCCGGTCATATGACCTTTGGCGCAGCAAAGCGGCACGTGAGGCGCACGAGCGGC 
GCGTGCTCGCTGACTGCCGCTCCCAGTTGACGAAGACGCCGCAGCATCTGGTCCTGGTTATCTCCCCCGTAGATGCCGGCGTG 

(SEQ ID NO: 550) 

Exon: 7538. .7506 
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ATGTACGCCTGGTCCCACCAAAGGCCACGCCAAATCCTTGCAAATTCATCAGCAGCAAAAGAAGATAACGAGCCGGAAGCCAAGCCCTCGGCCGC 
CGTCGACCACTTGGAGAGCATTTGCGAGCGACTGGAGACCCTCGTCGACCGATTGGAACGGACTCTAACAGCGCCACAGCCGATAGAACTGCCCA 
CGCCCACACTCCCGCCTCCGCCAGCCGAAGAAGAAGAAGCTCTTCCAGTTTTTGAAAAAGCAGAGACACCACCTCCACCTCCCTCGCCGCCCAGC 

AGCAACATGAGT^ 

GCATGCGGAGCTCGTGAAGAGCGCCTTTGGCTCCCAACTGCAGTACGTGACGCTGGCCACCCAGATCGCCCAGCCGGCGCAGCCCAAGCAGGCGG 
AGCTTCTGAAGCCGACCTCGACG 

AGCGAAAGCATTCCCGCCTTGGGCTGGGTGTGCGTGGAGAAGACCCCGGGTCCCTATGTCAAGGAGATGAACGACGCCGGCCAGTTCTACACGAA 
CCGCGTGCTCAAGGAGTGGAAGGAGAAGGATGTTACGCATGTGGAGTGGGCCCGCGCCTGGGTGCAGACGCTCACCGAACTGCAGGCCTACATCC 
GGCAGTACCACACCACGGGCTTGGTGTGGTCCGGCAAGGGAGCGGCCCCCGCCGGAGGTGCCCCGCCACCACCGCCACCCGGTGGTCTGCCCCCA 
CCGCCGCCCATGCTGGATCTGAGTGCCCTCAAGCTGGATAGCGCCGGAGATGATCGCAGCGCCCTCTTCGCCCAGATCAACCAGGGTGCCGACAT 
CACCAAGGGCTTGAAGAAGGTCACTGGCGACATGCAGACCCATAAAAATCCATCTCTACGCACTGGACCGGCTCCGTTCAAATCGCCAGCCCAGT 
CGGGCGGCAGCAAGGCCGTTGCTGCTCCATCAGCTGCGCAAGCCAAGGCGCCCGTTTTCGAGCGCGACGGAAAGAAGTGGATCATCGAGTACCAG 
AAGAACAACACCGGGCTGTTGGTGGAGAACGCCGAGATGAACAACGTGGTGTATGTGTTCAAGTGTGAGGGATCCACTCTGACCGTCAAGGGCAA 
GGTGAACAACATCGTCTTCGACTCCTGCAAGAAGTGCTCCCTGCTGTTCGATTCCGTGGTGGCCTCCGTGGAGTTCGTCAACTGCCAGAGCGTGC 
AGATGCAGGTGCTCGGCTCGGTGCCGACGGTATCGATCGACAAGACCGACGGCTGCCAGATGTATCTGTCCAAGGACTCGCTGGGCGTGGAGATC 
GTCAACTCCAAGTCCTCGGAGATGAACATCCTGTTGCCCGACGACAGCGGCGATTATACCGAGTTGGCTTTACCGGAGCAGTATAAGACCACGAT 
TGCCGGCAAGACCCTGAAGACTGTGTGCGTGGACAGCCTCGGCTAAGGCGGTTCTCCTCCTCGGGGATCCTGTTCGTTGCTTTTCCAACTGTCGT 
CAGCAGCAATCATAATTGTATTTAATCTAAGTGCAAAGTTTTCGTAATTCAAAGATTTTTAGATCAAACATTTCGTATTGATGATGTTGCCAAAC 
ATTGCTGTCCTGGCTCGTGGAAAGTATTTATATAATTTATTTCGATGGAAAAGCGATGGCCACAAACCGAACCCAATGTATTTATATCGAGAGTG 
TGT T G C AAT G AGAAAC T G AGC AT G AAAT C C GAAC T T T T G AGC AGAC T CT T T AC T CG AT CAT C AG AT GT AAC T C C T T T T T AT AT T CT AT T T AAT AA 
AATTTAAACTTTTTTATCAG 
(SEQ ID NO: 551) 

Start ATG: 1 (Reverse strand: CAT) 
MYAWSHQRPRQILANSSAAKEDNEPEAKPSAAVD 

SNMSVAGFEDIVAGPLSQYLTLSAKIGGDVAQHAELVKSAFGSQLQYVTLATQIAQPAQPKQAELLKPTSTQISAIQDFREKHRSSPFFNHLSAI 
SESIPALGWVCVEKTPGPYVKEMNDAGQFYTNRVLKEWKEKDVTHVEWARAWVQTLTELQAYIRQYHTTGLVWSGKGAAPAGGAPPPPPPGGLPP 
PPPMLDLSALKLDSAGDDRSALFAQINQGADITKGLKKVTGDMQTHKNPSLRTGPAPFKSPAQSGGSKAVAAPSAAQAKAPVFERDGKKWIIEYQ 
KNNTGLLVENAEMNNVVYVFKCEGSTLTVKGKVNNIVFDSCKKCSLLFDSVVASVEFVNCQSVQMQVLGSVPTVSIDKTDGCQMYLSKDSLGVEI 

VNSKSSEMNILLPDDSGDYTELALPEQYKTTIAGKTLKTVCVDSLG* 
(SEQ ID NO: 552) 

Name: Adenylyl cyclase-associated protein 
Classification: enzyme 
Gene Symbol: BcDNA:LD24380 
FlyBase ID: FBgn0027062 

Celera Seauence No. : 142000013384665 

ATCTCATCTCATATGAAAGCAATATCATATATGTATGTTAATATTAAATAGATATTAAACATAAAAATCCATCCCACCCACATTCCGCTGAAAAA 

TGAAAACACTCGCTTTTGCCGAAGTGGAATTTTATTGTAATTAAATTTGTTGGGTGTACAAAGGTATGTTTTAAGGACTTTCTCTCTGCATTTAT 

TACAATTTCTTCAAATGGTTTTAATTGCTGGGCTTTTGCTTCGAAGACGGGCAATCAAGTTTCAAGAATCCTATTTTCTTTAATCATACAAATTA 

TGTTTTATTTCGCTTTAATATATAAGGATACATATATATATCTGCTATATATAGAAGTGTGTTGACTTGCGTATATTTCGTGAAAAGTAAAAGTT 

TTCTTCTTCGCAATTTCCCTTGAACTAACATTCCAATTCGGGTTTTGGTTATGTTTTTTTTTTCTGTTTTACTCTCATTATTTTGTTTGTTTGTT 

TTTTTTAAATAATTTTGCATTTTCTGCATACATGTGGTTTTGTGTTTTGCAATTTTTTCAATGATTTTTCCTTGGGTTTTTATTTGTTTTTAGTT 

TTCGATTTGTTTAATAAAAAAATATGTATTTCACTGCCCAGAGCGAAAGCCATTTCTTACTTCCGCTTGCGTTAATATATCATTTTATGTATGTA 

TGGTGTAAAAACATCATTTACTGGTAAAACTCTCGATATCCTTTGTAACAGAAATCGGAACGCAAATGAAATCGAGGCGACTTACCAGCTCCTTC 

TCATTAAATACATATATGTTCTTGCAAATGTTTATAAATGTATGCATGTTTTTCCTTCTCTCGGATAATAGTGTAATCATAAAAATCCTGTCTGC 

TTTTTTGTTGTTTATTAATATGCTTCATAAGTTGTTTACGCTTTAGCGCTTTTCATGTGTTTCTTGTATTTCTCATATATGTTGTGGTTGTTTAA 

TATGTGTGTTGCTGTGTGTGTGTTTTTGTGTTTATATATATTATTTTTTTTTTGCTGTTTTTACATAGGTTTTGCCATTTTTTTGTAGTTTTTTC 

GATTTATGTTTTGTTTTCATTTGTCAACATATTTTTTCAGCTGTTGGTATTCTGTATTTTCTCTCTTTTTTCAAAGTAGCAAAAACTTGACTCTC 

ATATTCTTGCGGTTTTCTTTTTCTTTCTTTCTTTCTATTCTTCTGTTCTCTGCACAGTTTGCTTTCCATTTTTTCGTTTTTATTTTAACTCTTTT 

GCAATTTTTTTGTTTTTTCTTTGCTTTTTTAAAAGTGTATGGTTGTCGCGTTTCTTTTTCTAAACAAAGCTTTCTTAAAGTTTGAATTTGAGAAA 

AAAGAATTGAAAAAAATTACATATCGCTTTCGTGTTTGTTGTTGTTTGAAAGTGTGGATGGAAAAAGTGTACGAAAAACTCAATATGTATGTAAC 

ACTAGAAACTTCTTTAAAATTTTTAAAACACAATTTCAATACTTGCAAGAAGCGGTTTACATGTATGCTTTGCTTTAATCTGTTTATTTTGCAAT 

GGTTTTTTGGTGTTTTCATTATACGCATACAAAATAATATCGATCCAGTTAAAGCAATAACGAAATTTTTTTCATTTCACATTTTTCCGTTTTTT 

TTTTTAATTAACATGTTTATGTAAAAAAAAAGAATTTATAAGTGGACTGCAGAATTTGATTGCGTGACTTTCTTCTTTTGCTTCGTGTACTTGCG 

TAAATGCGTTCGAAATTAAATGATTGAAATTGCTTTTGTTGATGTTTCTGGGATTAGAAAAGAAAAAGCCGTAGATTTTTCGCTTTGTTTTTCTG 

TGTTAATTGTGTGTCCTTGGCAATCCTCCCCTGCGCAAAGCATCAGTGAATGCGAATTTTTGTTGTTCATGTATTGAAAGATTAGTGGAAAACTG 

CAAGGACAAAAGAGAGGCGAGTTTAGACAGCTTGAACTTTTCGGAAAATTTGTAGACTACTTACTTTTTGGGTGTAAAGTGTTCTTGTAAAATGT 

TCATACGCTTGAGCTGTTCGAGCTTAGTTGGCGGGCTCGGTGACGCGGTGCACCTGAAGCACGGCCACAGCCTCCTCCACCTTGGCCTTAAGGGC 

CTCCTGATCCTCGATCATGTGCAAGAGCTCAGAGTTTTCGATCTCCAACAACATGCCGGTGATCTTACCAGCCAAATTGGCGTGCATATGCTCAA 

TCATCGGATACAGACGCTCGCCCAGGATCTGTTTCTGCTCCTGCGGCTTGGCGTTGGCCAGCAACGAGGCAATGAGCTTCTCAGAATCTGCAAGA 

AAAATAGCAATGGTGCCATTAGTGACAAATTGTGCATATTTTCAGTCGACCTAAAACTTACTCTTTCCTTGCAATTGCTGTGGAATTGGCTGGGT 

CTGATGCAACTGTGGCACTGGAGGATTGCGCATGTTTGAGGTGTACTTATAGTTGGAAGTACGCTGCTGGGCACCGCCAGCGATTTGGGCTCCGG 

GGATCTGCATGTTGGGTGCTGCAGTTTGCTGGCCAGTAATAGCGCGTGCTCCAGTGTTGCGCATGTTGTTAGCCGCAGCAGCGGCGGCATGTGTG 

CCCTGCACCTGAGGCTGTGCCCCGCGAGCGCCAGCAGCAGCCGAACGGAACTGAGTGGGCACCGCACCGGCTGTGCCCTGGAAGCCACCAGCAGC 

GGCAGCTCCGGCCTGGACACCCTGTATGGCTGCTGGGGGACGCACCTGGGGCACCCAGCGGGGTGTGTTTCTCATCTGAGTGGCCACCTGGGAAC 

CGAAGAAACGCTGATTTGACGGAAGGGTCGGCACAAAGAAGCCGCTAGCCGCGTTGGGCTGGTAGATCTGTCCCAGCTGCTGCATACGCATGCCG 

GTCATGTGACGCATGTACTGCGAGGCGAGATGAGCCTTGCGCTCCTCCTTCCTTTGGGCCAAAGCAACGTACAACGGCTTACTGCCGACGACGCG 
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ACCGTTCAGCTCGGTGACGGCGCAGGTAGCCTCGCTTGCGGCATTGAAGCACACGAAGCCGAATCCCTTGGAGCGACCCTCCTCATCGGTCATGA 
CCTTGGCCGATGTGATGTTGCCGTATGGAGAGAACGCGATGCGCAGACGATCGTCATCGATTGTGTCGTCCAGGTTCTTAACGTACAAGTTAACG 
CCGAACACAGACTCGTGACGCTTCTGCTTCAGTTCCTCGAACTTGCGCTTCAGCTCCTGCTGGCGTTCGGCCTTCTTCTGGGCACGAGCCACGTA 
CAGGGACTTGCCCTCTCCCATGTCCTTGCCATTGAGAGCCTGAACGGCCGCCTCAGCAGCCTCTGTTGTCTCGAATGCAACGAATCCGAAGCCCT 
TGCTCTTGCCATCCTCCTTGGACATGACCTTGTAGCTGGTTATCTTGCCGTAGGGCTCAAAGAATTCCTTCAGTTTTTCATCGTCGAAATCCTCA 
GTGAAGTTCTTCACATATACATTGGTGAAGAGCTTAGCCTTCTCGCCCAGCTCCTTCTCGCGCTCCTTGCGCGGGATGAACTTACCCACGTAGAC 
CTTCTTGCCGTTGAGCAACATGCCATTGACTTTGTCGATGGACGTGTTTGCGGCCTCCTCGGTCTCGAAGTGGACGAATCCATAGCCCTTTGAGT 
TGCCCTTTTCATCGGTGGCTACCTTGCAGCTTAAAATGTTGCCGAAGGCGGAGAAAGTGTCGTAGATTGCCTTGTTGTCGATGGCCCTATCTAAG 
TTCTTGATGAACACGTTGCCCACACCTGAGCGACGAAGAGAAGGATCACGCTGAGACCACATAATGCGAATGGGCTTGTTGCGAACCAGGTCAAA 
GTTCATGGTGTCCAAAGCACGCTCAGCTGTTGATGAAAATCGGGAAAACATTTGGTTACTACCGACACGATTTGGGTATATCAAGATCATTATAA 
GTTGAAGGGGCTTAAATAACAATGTGACTCTTCAACTTCTGCACTGGCCATGAGCAAAAGCAGCGCAGTGCATGGTAAGTCAAGTCTGATCTTTT 
ATCTGGCAACTATGAATCTTAGTCGTGTCAAAAAGGGTTGAATCTGACCAACCAATGGGTAGGAGTTGCTAGATGTCTAAATACAAGGACAATCT 
TATCAGTCTAATCATGTGATACATTGAGTGTTTGTTATGCAGATGGTATGCAAAACCCATCAGCATGGTAAGTCGCATTGCTTTCTCTGGCAACT 

AT G AAT CT T AGT CGT G T C AAAAT GGGT T AAAT C T GAC C AAC C AAT GGGT AG AAGT T GCT AG AAG C T T AG AT AAG AAAGGG C AT AAT C CT T C G AGT 
CTAATCATATGATGTATTATTTAGTGTTTGTTATGCTTCTACTTCTGTGGATGGCATAGATCAGCCATCTACATGGTAAGTTGCATTGCTTTCTC 
TGGCAACATTGAATCTTAGTCGTGTCAAAAAGAGATAAATCTGACCAACCAATGGGTAGGAGTTGCGAGATATGAATCGAAAAAAGAAAAACATT 
TAGGGTGTTACTAGTCTAGAACGAGATGGTATCATTTTTGAAAGGTCTGTTTTTCCAACCTAACCTTAACAGCTAGAGTTCAAGAAACTTGCTGG 
AAT GCGACAGACTTTCACACGCAGCGTAATCATCACCAACCGCCAGCTTTGTGCCATTGTGCGCTT AAT TGAATAAAC AAAT GTAAGCGAAC AAC 
GGGAATCGAAGAAGCACCAACTATGCGCAGTGAAAGTTTTGTTTTCGTTTTTTGTAAAAACAAGATTTTGATGGTTAAAAGAGAGGCCGAAAAGC 
AGTTCACTGACATGTCAAGAATTTGGAAGTGTTGATGGATGGAATGCGGAGAGTATGGAAGATATGAGAAAGTGAAAGCAAAACAAAAGTTGGAG 
GCAGTACGAAAACTAAAGAGCGAACGTGCGGAAACTCAGTTAATGGAATATAACCTAAAATAGATGAAGGCCAAAAAGAAATGGTCCAAACAACA 
ATGAACACATGTTTTTGCGCGTCGGCGTGTGCGTGCGCGCAAGTTTGCCGCTTGGAAACAACACACATCACACGAATAAAACGAGAAGAACGGGC 
AAGGAAACAATGAAACCACCAATGGGCGGGGCTGCGATAAGCAGTTTGGGCAGTATGGAACCATATTATTCCATCATAATTGCAATTTTGTGCGA 

AAT T G AGAC GG AAAT T T C T AG GC G AAC AGT T GAT GAG T GC C AC GAGCC AG CTGTTTAGCCCC CAT CAT CC C C C C G C AC AAAT C G T AT AC AT C C AG 
AATAACTGGAACTGATATAATCATGTGTGTGTGTGTACACACTGGCGGGAATTGAGTGCGTGTGTGTGTGCGCGACGAGATTAACGTAAGCAGTG 

C AC T GT G GAAC AAGT TAT G AAAC AC T T GGAAGC C GT T TATCTGGTCTCCTAT G AAAG AC AT C T T T C AAT AAAGT C C AGT TT G G ACT T GC GT T T AG 
CCCCACTGTATGACATGCGCTAGTGTGCGTGCGTGTGTGAGTAAGCGAGTGCAAAACTGCACACGTGTTTTCTTTGTTTCGCGCGGAAGGGGGCT 
ACTTTTGCGGGCAGTAAGACTAAAAAAATTACACTACTTTTGCGGGGATTACTCACCATCGGCTGGCTGCTGGAAGTTGACGTAGGCATAGCCCA 
ACGAGCGACGGGTAATCACATCGCGGCAGACACGAATGGACAGCACTGGACCAGCAGACGAGAACTTGTCAAAAAGTCCCGATTCGTTGACGTCC 

T GT GGG AG AT C AC C G AC GT AT AG AGAAGCCAT AT T T GGT GAT C AGGT CT AC C T GC AATT T T T AAC AAT T AATT T C C ACT T T T TT G AC AC T GAAT T 
TTGGGTTTTTTCTTAATCTTTACGGACCGGATGGATTCGACCAGGCAGGCGGTGAAATAGAAAGAAAAAAAGCGGCGTGGCTTTCGGTCTTCTAT 
TCGTCTCTCTGTCTGTGTGTCTGGGTGGCAACACTGGTACATGTGACAGTCGGTAGACATGGCAACGTTGGCATTTGCTTTGTTGTTGCTCTTTG 
CTCTCCCCTTTTATTTCTCTTCGTGTTGGAAAATTTCCACACAGACACAAAGATAGACACGCGTTTGCCTGATCGAATTTTGGGGAATTCTTTGA 
TTTTCTCGAAAATATGCGACAACTAGTTCCTAGACCACTATTTTGCGGGCAATCTGGCGTTTATAGACGCCATCTTTGAGCGAAGAACGACCGCA 
AACAAAGCAAATTTCGATGGGAAAGGAAAATTTCCCTATTCGCTTTGTCGTGCGTTGCGCCCTCTCACTCTCTACACCATTGTAACCAATAGCTT 
TACACTGTACTTACTTCTCAAACACGCTCGATTTTTTTTGCTACTTTTTACAATTTTTTTTGTTACGTTTTTGTGTTTTAATATTTTTTTTTATT 
TTTTATATGTTTTTTCTTCACAAAAAATATGTGATAAGCCGTGTGCTGAAAGCGAATCCACGACCGCTCACGTAAACGGACTCTACACGACTCAT 
AGAGAGGAGATTCGTTAGCGATGACCCCACAAAGTATCGATTAAATCCCACTGCACAATGCTGCCACATAGGGTTAGAGACAAATAAATGTCACT 
AAAAAATCCGTGAGGACAATTTACTAATTACGAAGCATTATTTTGAATTTCAAAAAAGCCAGCCATCACACATAAAAAATTTTTATATTTAGCTA 
ATTCCCGAAACATGAAATACTTGTGCATTTTCAGGGGTTCTTTGGTAACTCTGCTTCATTTATGTTCTGATATTTTCGCCAATGTCCTTGAATAC 
TTTGTCTCATAAAACACAATCCAAAATTAGGAATGCAGTTTTTTTTTCTTTTAAAAATGAGCGTAGACTTTTAGAAAATGATTTTTTGGCACTGC 
ACAGATATTAGCGTATGCTCCGAAAACCGAATCCCATCTTTGTAATTCCCCACTGTGCGAACTTACCTAAATCAGAGAATAAAGCGATGAGGCTG 
CAAGTCGACACTCCGAGAAGTGCTTGAGTACTTCTTTTCGTTCTCGATGGAAATCGGCACTACAGAGATTGAAGAATTTTTTAAGTTTTTTTTTT 
GTGTTTTTGTTTTTACGAAATAGACCACATTCAAGGATGGCTGAGGATAATTTTTTTTTTCTTTTTCCCGTTTTGAACGTCTGGTGCATGTGCCC 
TATCTGGCGATCGAGTACCTAATGAAGCGGTGTGAGGCGTCGGTGCCTGACAAGCTTCAAAGCGAAGTGTCACTTTGGGCGCCTACTGGCGTGTC 
CTGAGTAAAATCTTAAAAGATTTGACATTTCGGAAACATTTATTTGGGCATTATATACTTCATTAAACATATTTCGATCTGGTTTGGCCAAACAT 
TTACTTGGCTTGAGAATTAGTTTGCTTTCAACTCAATTTTGATTTTTCGTAGCATTTAATAGTTAAACATAAAACGTACTGGCTTAATTCATTAT 

TGTGTAGTTTTG 
{ SEQ ID NO: 553) 

Exon: 6422.. 5823 
Exon: 5751.. 5567 
Exon: 3826.. 2342 
Exon: 2272.. 1965 
Exon: 1897.. 1001 

Start ATG: 5732 (Reverse strand: CAT) 



Transcript No. : CT16411 ^ 
ATTCGCTTTCAGCACACGGCTTATCACATATTTTTTGTGAAGAAAAAACATATAAAAAATAAAAAAAAATATTAAAACACAAAAACGTAACAAAA 

AAAATTGTAAAAAGTAGCAAAAAAAATCGAGCGTGTTTGAGAAGTAAGTACAGTGTAAAGCTATTGGTTACAATGGTGTAGAGAGTGAGAGGGCG 

CAACGCACGACAAAGCGAATAGGGAAATTTTCCTTTCCCATCGAAATTTGCTTTGTTTGCGGTCGTTCTTCGCTCAAAGATGGCGTCTATAAACG 

CCAGATTGCCCGCAAAATAGTGGTCTAGGAACTAGTTGTCGCATATTTTCGAGAAAATCAAAGAATTCCCCAAAATTCGATCAGGCAAACGCGTG 

TCTATCTTTGTGTCTGTGTGGAAATTTTCC AAC ACGAAGAGAAATAAAAGGGGAGAGC AAAG AGC AAC AAC AAAGCAAATGCCAACGTTGCC ATG 

TCTACCGACTGTCACATGTACCAGTGTTGCCACCCAGACACACAGACAGAGAGACGAATAGAAGACCGAAAGCCACGCCGCTTTTTTTCTTTCTA 

TTTCACCGCCTGCCTGGTCGAATCCATCCGGTAGACCTGATCACCAAATATGGCTTCTCTATACGTCGGTGATCTCCCACAGGACGTCAACGAAT 

CGGGACTTTTTGACAAGTTCTCGTCTGCTGGTCCAGTGCTGTCCATTCGTGTCTGCCGCGATGTGATTACCCGTCGCTCGTTGGGCTATGCCTAC 

GTCAACTTCCAGCAGCCAGCCGATGCTGAGCGTGCTTTGGACACCATGAACTTTGACCTGGTTCGCAACAAGCCCATTCGCATTATGTGGTCTCA 

GCGTGATCCTTCTCTTCGTCGCTCAGGTGTGGGCAACGTGTTCATCAAGAACTTAGATAGGGCCATCGACAACAAGGCAATCTACGACACTTTCT 

CCGCCTTCGGCAACATTTTAAGCTGCAAGGTAGCCACCGATGAAAAGGGCAACTCAAAGGGCTATGGATTCGTCCACTTCGAGACCGAGGAGGCC 

GCAAACACGTCCATCGACAAAGTCAATGGCATGTTGCTCAACGGCAAGAAGGTCTACGTGGGTAAGTTCATCCCGCGCAAGGAGCGCGAGAAGGA 

GCTGGGCGAGAAGGCTAAGCTCTTCACCAATGTATATGTGAAGAACTTCACTGAGGATTTCGACGATGAAAAACTGAAGGAATTCTTTGAGCCCT 
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ACGGCAAGATAACCAGCTACAAGGTCATGTCCAAGGAGGATGGCAAGAGCAAGGGCTTCGGATTCGTTGCATTCGAGACAACAGAGGCTGCTGAG 
GCGGCCGTTCAGGCTCTCAATGGCAAGGACATGGGAGAGGGCAAGTCCCTGTACGTGGCTCGTGCCCAGAAGAAGGCCGAACGCCAGCAGGAGCT 
GAAGCGCAAGTTCGAGGAACTGAAGCAGAAGCGTCACGAGTCTGTGTTCGGCGTTAACTTGTACGTTAAGAACCTGGACGACACAATCGATGACG 
ATCGTCTGCGCATCGCGTTCTCTCCATACGGCAACATCACATCGGCCAAGGTCATGACCGATGAGGAGGGTCGCTCCAAGGGATTCGGCTTCGTG 
TGCTTCAATGCCGCAAGCGAGGCTACCTGCGCCGTCACCGAGCTGAACGGTCGCGTCGTCGGCAGTAAGCCGTTGTACGTTGCTTTGGCCCAAAG 
GAAGGAGGAGCGCAAGGCTCATCTCGCCTCGCAGTACATGCGTCACATGACCGGCATGCGTATGCAGCAGCTGGGACAGATCTACCAGCCCAACG 
CGGCTAGCGGCTTCTTTGTGCCGACCCTTCCGTCAAATCAGCGTTTCTTCGGTTCCCAGGTGGCCACTCAGATGAGAAACACACCCCGCTGGGTG 
CCCCAGGTGCGTCCCCCAGCAGCCATACAGGGTGTCCAGGCCGGAGCTGCCGCTGCTGGTGGCTTCCAGGGCACAGCCGGTGCGGTGCCCACTCA 
GTTCCGTTCGGCTGCTGCTGGCGCTCGCGGGGCACAGCCTCAGGTGCAGGGCACACATGCCGCCGCTGCTGCGGCTAACAACATGCGCAACACTG 
GAGCACGCGCTATTACTGGCCAGCAAACTGCAGCACCCAACATGCAGATCCCCGGAGCCCAAATCGCTGGCGGTGCCCAGCAGCGTACTTCCAAC 
TATAAGTACACCTCAAACATGCGCAATCCTCCAGTGCCACAGTTGCATCAGACCCAGCCAATTCCACAGCAATTGCAAGGAAAGAATTCTGAGAA 
GCTCATTGCCTCGTTGCTGGCCAACGCCAAGCCGCAGGAGCAGAAACAGATCCTGGGCGAGCGTCTGTATCCGATGATTGAGCATATGCACGCCA 
ATTTGGCTGGTAAGATCACCGGCATGTTGTTGGAGATCGAAAACTCTGAGCTCTTGCACATGATCGAGGATCAGGAGGCCCTTAAGGCCAAGGTG 
GAGGAGGCTGTGGCCGTGCTTCAGGTGCACCGCGTCACCGAGCCCGCCAACTAAGCTCGAACAGCTCAAGCGTATGAACATTTTACAAGAACACT 
TTACACCCAAAAATTTTCCACTAATCTTTCAATACATGAACAACAAAAATTCGCATTCACTGATGCTTTGCGCAGGGGAGGATTGCCAAGGACAC 
AC AATT AAC AC AG AAAAAC AAAGCG AAAAATC T AC GGC T TTTTCTTTTC T AAT CCC AG AAAC AT C AAC AAAAGC AAT TT C AAT C AT TT AAT T TC G 
AACGCATTTACGCAAGTACACGAAGCAAAAGAAGAAAGTCACGCAATCAAATTCTGCAGTCCACTTATAAATTCTTTTTTTTTACATAAACATGT 
TAATTAAAAAAAAAAACGGAAAAATGTGAAATGAAAAAAATTTCGTTATTGCTTTAACTGGATCGATATTATTTTGTATGCGTATAATGAAAACA 
C C AAAAAAC C AT T GCAAAAT AAAC AG AT T AAAG C AAAGC AT AC AT GT AAACC GCT T CT T GC AAGT AT T G AAAT T GT GTT T T AAAAAT T T T AAAGA 
AGTTTCTAGTGTTACATACATATTGAGTTTTTCGTACACTTTTTCCATCCACACTTTCAAACAACAACAAACACGAAAGCGATATGTAATTTTTT 
TCAATTCTTTTTTCTCAAATTCAAACTTTAAGAAAGCTTTGTTTAGAAAAAGAAACGCGACAACCATACACTTTTAAAAAAGCAAAGAAAAAACA 
AAAAAATTGCAAAAGAGTTAAAATAAAAACGAAAAAATGGAAAGCAAACTGTGCAGAGA 

G C AAGAAT AT G AG AGT C AAGT T TTTGCTACTTT G AAAAAAGAG AG AAAAT AC AGAATACC AAC AGCT G AAAAAAT AT GT T GAC AAAT G AAAA C AA 
AAC AT AAAT CGAAAAAACT AC AAAAAAATGGCAAAACCT AT GT AAAAAC AGC AAA 
(SEQ ID NO: 554) 

Start ATG: 620 {Reverse strand: CAT) 

I^SLYVGDLPQDVNESGLFDKFSSAGPVLSIRVCRDVITRRSLGYAYVNFQQP 

NLDRAIDNKAIYDTFSAFGNILSCKVATDEKGKSKGYGFVHFETEEAANTSIDKVNGMLLNGKKVYVGKFIPRKEREKELGEKAKLFTNVYVKNF 

TEDFDDEKLKEFFEPYGKITSYKVMSKEDGKSKGFGFVAFETTEAAEAAVQALNGKDMGEGKSLYVARAQKKAERQQELKRKFEELKQKRHESVF 

GVNLYVKNLDDTIDDDRLRIAFSPYGNITSAKVMTDEEGRSKGFGFVCFNAASEATCAVTELNGRVVGSKPLYVALAQRKEERKAHLASQYMRHM 

TGMRMQQLGQIYQPNAASGFFVPTLPSHQRFFGSQVATQMRNTPRWVPQVRPPAAIQGVQAGAAAAGGFQGTAGAVPTQFRSAAAGARGAQPQVQ 

GTHAAAAAANNMRNTGARAITGQQTAAPNMQIPGAQIAGGAQQRTSNYKYTSNMRNPPVPQLHQTQPIPQQLQGKNSEKLIASLLANAKPQEQKQ 

ILGERLYPMIEHMHANLAGKITGMLLEIENSELLHMIEDQEALKAKVEEAVAVLQVHRVTEPAK* 

(SEQ ID NO: 555) 

Classification: RNA_binding 
Gene Symbol: pAbp 
FlyBase ID: FBgn0003031 



Celera Sequence Mo. : 142000013384618 

ATCCTTTGTTAATCATGTTGCTATTTTTCAAATTTATATTCAAAAAAAAAGTATTGAACATTATAAAATTGTGAACTTCCCTTCTCAAAATATTT 
TAAAATGTGAGAAAGTAATCAAGCTTTGTCTAAAAACTCCCTTTTTTCTACTACCTTTTTAACTCATTTAATCCGCTCATAGAATATACTCCCAT 
TTATCATACTCTTTCTACTCCGGTTTTTTGGCCTCTTAAGAACGCTTGCTCTCCCTCCGCTCTCTTGTTTAAACAAATACAAATCAAAACCGAAA 
GCAAGAGCAAAGTGCAATAACGAAAACCACAAGCAAAGAGTGTTGTACAATTGCAGTTTTTGTTTTGCTTTAGCGTTCGCTAGGCAGCGCAGTCG 
GATATACTGCCTCTTCGCTACTTCTGATGAGTTTTCTTCTTCTTCTGAATTTACCTGCTCGACAAGATCTTTGTGTACCCACACATACACGCGTA 
CACAGCCTGCCCACACACTCTTAGTTGAAACGAGAAAGAACGTTTTGTAAGAGTACCAACTGAGACATACCCTTCACATTGTTTGTGCATCTAAA 
TATGTACTTTATGTTGTAAGTAAATATGAGTTATAAAATAGTGTTACTTCAGCAATTAGTGCGCTGGTACAATATTTCAGTTTTTTCAGTATGCG 
AAAGCCAATCAATTAAATTTTTACAACCTTAAAATCTAGATATGTTAACTATGCTCCTTGATGCATTTATGTAGACGCAATTCATGGTCCGAGTA 
TACTCTTCCTCACTCAGCTATTACAGGGTAGGTAAAGAGCGGGATTTTTTAAAATATTTTTTTTCTCTCAAGTGTGCCTCCTTCATGCACGTGCA 
TGTGCAGGAGAGGGAGCGAGAGAGATTGCGATTGCACTGGCTGCGCGCCTACGCTGCCGCTCTGCCGACGTCGCGGTAGAGGCTCTGTTGCCGTA 
GTTTGAGGCCCCAGTGCGAACAGTTCATTTTTAGCCGCGGAGCCAGTAAGACGTGTTTCCTGCCCTCTTTCTTTGAGTCTGCGACACGTTTTAAG 
TGCTCTTCCATAATTGAC AAC AGCAAAAGCAAAGAATAAAAAAAT AAC AAAAAAT AAAAAACGAAATCC AT CGTGAACAGTTTTGTGTTTTT AAA 
TCAGTTCTAAACACGAAAAGGGTTGATGAAAAACGCAGAAGAATCCGAAAAACTAACTAACCGAGCAAAAACTTGACTTGAGTGTTGTTTGACAA 
ATCAGGAAAGATAAAAAACAAATCATAAGAAAAAACTGCACGAAAAATGAAAAAGTTTCTAATATTCAAAATCTTGCACAAGAAATACAAAATCA 
ATTAAAGTGAACTCTAACCAAAAGTTGTACACAAAATAAAAAGCAAAACAAAGCAGCGAAGAACAATCACAAGAAGAGCAAAGTGCCAACAAAGT 
GCAGGAAGGAAGGAAGCGGATAAGGACAAAAAGGAAGCCAGCACACACACACACCCACACAATGGCCGTGCCCTTTTATTTGCCCGAGGGCGGCG 
CCGATGACGTAGCGTCGAGTTCATCGGGAGCCTCGGGCAACTCCTCCCCCCACAACCACCCACTTCCCTCGAGCGCATCCTCGTCCGTCTCCTCC 
TCGGGCGTGTCCTCGGCCTCCGCCTCCTCGGCCTCATCTTCGTCCTCCGCATCGTCGGACGGCGCCAGCAGCGCCGCCTCGCAATCGCCGAACAC 
CACCACCTCGTCGGCCACGCAGACGCCGATGCAGTCTCCACTGCCCACCGACCAAGTGCTATACGCCCTCTACGAGTGGGTCAGGATGTACCAGA 
GCCAGCAGAGTGGTAAGTCTACAAAGATCTCAATTCTCCACTCTTAAGAACTTTGAAATTGTGTGGGTTAATCAGGATATCCATTTAGTTTACCT 
CAAATACATTTGCAGATACAAAAATAAGTTTCGATTCATATACGGTTATTAATTGCGAAATGTTTAAGAAGTTCCCACACAGAATAACTGCTTAC 
GCATGTTGGTAATATATTTTCTAAAATGCTTATAACCAACACTTTATTTAAGCCCACTTGTTTTGCTATTAACAAGAAGGATTTTCATAGGATCT 
CCTTGTAAAAGT AAAC AAAAAATGTT ATT AACTTGAAATTGCTTAAAGAATAGTAAATTTATTTCTTTCGAAAAAACAGCATTTTAAATGTTTGA 
AGATATTTTCAAACTAATAGATTTGATCAATAAAAAATGGCGAACAATTTTTTTCTATACAAGCGACTTTAAAGGTTTCATATTTTTGATTACCG 
CTTTTAATATCATTTAGTTTAGTTTAAGAAAGTTTCTGGACAGCCGGAGGGTTTCCCTCATTTTAAAATGGGTCAGCAATTAGCCAAGTTGTGAC 
GTTATTTGTTAACTGCTTCAGTTCCCCTTGTGCAAGTTTCACTTTTCAAGGTTGTTGATTCATCCAGTTTCGAGTAACTGCGTCAACTGCATCCA 
GGTTGTTTTTGTTTACTTTTACCCAGTTTTTTTTTTATTTTATATATGGTCAATTCAGCACTTTGTCAGCTTATCAGCAAAAAAAAAAGTGTAAA 
AGGCAGAGCGGAAGGCAAATTTCTCGGCCACATTCGAAAGGAAGCCGAAAAAACGGTAATTCAATAAAAATTAATCGAACAAGTTCAAAAAGGGG 
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AAAATTGAGAAGTAATTGAGAAGTTTCGATTGGAGGAGAGAGCAAACTTAATAAACGAAGTCACAAAACTGCAAAAACTCAAAATCATGTGGGGA 
ATTACTGGATCAAATTCCCAAAGTGGGAATCTTACAAATTAAAAAATTTTGTATTGAGACAAAAATCTACTTATGTTATAAATAAATGAAAATAA 
AGTAGAAATGGTGATAACTAGCTCGATAAATCACATTAAGATCAATTTCCTTCGGTCATGATTATCTAAACTAGGCTGATAAAACTTTTAGTGTG 
CTATGGCTTTAAGAAAGAAACGCCGAGACACATGCAAGACAACTGCCGAGTTGGCTGTCATGCGATATTGTTCGTAAATATGGTCAAAAAATTGG 
GCGTGTGCTGGACCAGCAAGAAAATAAAGACCTGGCTGAAACCAGGGAAACATAACCCAGCGGCAATAAAACAGAGCTGGGAGCACAGAGCTAAG 
CACGTTTAATTGGCACGCGCCAAACTCGAGTCGAATGTCTTACTTTTGAGACTTTTGAGTGAGTGTGGTGTGCCTGTGTGTCTGGGTATCTGCAA 
GATATCTATCTATCTCGAGCGTACTGCTCGTATATCTTACCACTCGACTAACCCAAATTATTAAACGCCATTTGACCTTAGGCAACTAGTTTTCC 
GATTTTCCAAAAGGGGGCCAAACGGGTTGAGGGTGTGGGCCATTTGCAAAGTGTTTGACTTAACGGTTGTTTAAGTAGCTTTTGTGCTCGGCCAC 
GTTGAGTGCGGTCAGTTTTTATTGTTTTTCTTTTTTTTGCGCGCGCGCTTTTCATGTTTGCTTTGCGGTTTTTCATTCGCGTTCCTTAATTTTGT 
GTTGCCTTTGCATAATAGATTTTGTCGCTGTCGATGATTGCCAGCTGCGCTCTTTGGTTCCACCGAACAACCGAATTGTTACTTTATATTATTTT 
GCTGTTTTATTTGGTTTGTTACCTTTTTTGCTGTCAGCAACAGTTGGTCAGACGAAAAAAAAGCGTTCCCTTCTCGAAATTACAGTGAGTCACTT 
TTGGGCGCTTCAATCTGTTTTCGTCTATAATATCATAATTTATACAGAGATACTTTTTTTTCGCTTACGAGAGCTGATGCCCAGTGAAAATTCGA 
TTGCGGGGCATTTATTTTGAAAAACGAGCTAAAAAGACTTGACGTTATCGTGAAAGTTTCGATTTAAAAGTAATTTTCCCTTATCTCATTTTCCC 
GAACCACATCTAAGCCCCTACTCAACGCACGCCGATCATGTACTTTAGTTAGCCACTCAAGAGGATAATTTATCTAATCGTGACACATCCCGAAA 
CTAGTGAACAATGCTTGAAGCCCTTGAAACAAGAAAGGGATAAACTGTAGTTAACCAAGAAGCGAGAACCCTTTTTCCTGGTTCCCCCAGCTATG 
CCATACATACACACATACATGCAAGCCAGATTCACGGAATACTTTTAGCCAGAAACGCGCTTAGTCATCAGCCAGCTATCAACCCTAAAAGCCTA 
AAAATGCGAGGCGCAAACAACCCCGACAAAAATTCCTAAATACCAACAGAGCTTTTCCCTACCAAAAGCAAAAAAAAAAATGCGGCAGGGAGCCT 
TTTGTTTTGGGAGAGGATTATGCATACAGCATCCAGAGCCCACCACGTAACCCATTCCCAATGGCGACCAACAAAAGGTGCAATGACAGCAGCGG 
TTCCCTAATCAGTGATGGGAACAGGTACATTTTATCTGTTGGCTAATTGGATGCAGATTAAAAGATGCGTAAACCTAGCGGTTATTATGTAAACT 

AT T AT GG AC T AT T AC T TTTTTGGTT C AAC AT T T AGT T AT T AAC AT AG AT T T TT GT T AT CAT AG AT AAT AAT AT GT AC C T AT GT AGG T AC AGT GC A 

GAGTACAAAAATGTTCCCCACCGTACTCAACTCCAAGTTTTGGTCGCTCTGCTTTGGCTTGCAAATTCGCATCATTCGCTTTGCTGTTGCCAACA 

GCTGTTCTGCTGCTAGGCGTTGTTTTTCGCTGCCGGCGTCTCTGCCGCACTTGACATTTTCAACCATTTTGTTTTGCTCGGCAACTTCGTGTAGG 

TGGACTTTAGTAGAAAAGAGGAGCCATAATGGCGGCAAAAATGGCGACTCCTGCCGCATCCTGCCTGGCAACAAAACACAGCTGACTGCTGCGCA 

GATAGAGCGAGATGAAGCCTACGCCTGGAGAAAGAGAGGGAGCGCAAAAACTCCGATTAACCGCAAGTTTTGAACTCCTCTTTTTTCTGCACAGC 

ATCACAAAGGGGGTGGGAAAATAGGAGGAGGGTTGGATGGGAGGAGGTTTGCCTCCGCTTTTCATCATCGTCATTTCCATCTCGATTTGCTTGGC 

CTGTTGCCAAGCTGATTTATACGTACAGGCGGCTTTTTATATTTTACATACTTTGCATTGCTTTTTTCTGTTTCTCTCGTTGTCTGCTTTATTTT 

TTCGCTTTTCGTTGGTATTACGGCTTCGCCTTTACATTGACCGTGATTTTTGGTTTCCTTCTGGTTTTGTTTTCGATTTTTTTATTTTTCCCTTG 

CAAATACATATGCAGCTTTTTTTTGTTGAGTTGCGTCCTCTGTAATTGTTACATTTCATTTAAGTTTTATATTGTCTTACAGCTCAGTTTTTTAT 

GACTTTACTGCTTTCATAAATTTGGCATAAACATTCTTGACGATCTGCCCGAAACAGCTGCGACAAATCCAATCGGAAGTCTTTACTCGAAAATG 

ATTCGCTCATAAAAGTTGAGAGTGGGTTATGAACAAAAAACTTGAAAACCAGCTGTCAAACACAATTTCTATCTGAGCCCAAAACTTTTGAAGAA 

ATAGTTTCACTAAAAATACACTTCTAAAAAGGTATAAAATTTATTTTATTAGTATTTTGGATTGCCAAATATTTCCACCCATTTATATACATACA 

TTTATATACATTATATACATATGTTCAGCATATTGAAAATTAAATAATCCAGTTTGTTTACATTTCACACCACTCAGCGCAATGAACTTTACACA 

TTTTTCAAGTTTAAATGCCCGCCAGGCACTTTGCCTAATAAGAAAAAATATTACCTGAATGCGAATATAAACTTGACACTTTTGAGCCACTATAA 

GCGACGGGACACGGCTGAATCGGGCTCAGACGGTAGTCAGCCAGGATGCCTAAGTTGGCTTTTGATTAAGTCTCCGCTGAGGCCGTGCTAAATTG 

TTTGCACTGAGGCGTTCGGGTTAAGTTCAGGGGGCAGTGGGGGGTGTTAGGGATAGAACGTGCTCAAGTTTTTGTTATGAAATTGAGCCTAGACA 

AGCTAGGCAAAACCCCATCTGCGCTGACAACTCGACTCCTTTTCAGGAACTAAGGACAATGCAACTTGTGCGACACGCTCAAGGCAGAAGTGGAA 

AAGGATATGTATCAAAAGGCGGAAAACTAAGAACAGGATATACTCCTCGGCACAAAACAGACGGCAGACATCATACCTGGCTACTTGTTATAATT 

GGCAAGCTGGCAAACGCTCCAGACACCAGTTAACTTTCAAATTTTCCTCGCGATGAGCTCTCAAAGGAACATTCGTTGAAGGTGTTCTCCGATTC 

CCAAAGGGAAAATGTTCCTTCGCGATTTACCTTCCACATTTTAAAGAGCGCTGGGTGAGATCAAGTTTAAAGGGGCTTAAATGAGGAAGGATAGT 

GGAATGGCATCTGATTTATTTGAAAGAGTTGAAAAACAAAAACACCTTTTGTTGTATATTATTTTTAACAAAACCCATTTCATTTGGCAATTCCA 

ATTTTAATGTTCAACATTAAGTAAACTACTGAAATATGAACTCTGTTTAGAACTCCCCCTCCTATTTATCTACTGCTCGGGTTGAGTATCAATAT 

CAACTTATTTGGTAAACAAATCAAATGCGCTTACTAAATGTGCAAAGTGCCAACGACAATTAAATCAACAAATAAGCTCGTGTAATTGTGTACGC 

CAAAGGCAGTCGGTGGGTGGGTGTTTTCAGTGGTCTCGGTGGGTTGGTGGTTTGGTGGGTTGGTGGGTTGGTGGGGCACATCCGATCCACTGGAT 

CGAGTATAGATAACCCTACGGGCACGCAAGCTCCGAGTGGTTGTGGATTGGATGACGCAGCGAAGACAGCCAGTCTGGCCCAGTCGCAAATGCAA 

TTCCCATTCATTTATTAATAACTTTTTGCCGCGGCTGTTGAGACGTGCAGTTTGCCGTTTTGCGGCTGCAGTTTTGCTTTTTTTCTCTCTTTTTA 

ATTATTTTAATTGCGTTGCATCTGATCGAATGCTGGCGTACAGTTTTATTTATTGTTTAATTGTAATTTTCGCATTTACGTTGTTAATTGCAGCG 

TGGTACTGGGCGGAGATTCGGTTTAATCAACTACTAAATGCTTTTGGCTCGTTGCAAAGATTTTGAATTTGGTTAAGTGGCGATGGCATTTATGG 

CCATACTTCAGTGCGGAAGTATGGAAAATTCGTGATTTTCATAAATAAGAACTAATGAGCTTTAGCAATTAACAAGAAATTAATAGAAATGTTTA 

GTTAATCTGTAATATTTTACTAAATCTCCCAACTACACAGTTTTCAAATTACAGTGTAGAACATTTTCTTTAAGACTTCTTTATATTTGTTTGAT 

CCTTTTTACACGCTGATTCAGCCAGTTATATTTTACTGTCTTCCCAAACCATTTTCAAATGTTCCTCTTGCTCACAAAACACATGCTCGCCATTT 

TCCACTCTTCCGGAAAATCCTTTGGATCCCCGGAAAATATGCAAACAACTGCAAAGACCAAAGACTTGTGTTTCGCTCAGTTTCCTGTCAATCGA 

TGTGCATCGTATAAGTTCCTTGGGAACGTCGTCTCATGCCTGATTTTCCTTTTGGCTTTTTATAGCCTTTTAAGCTGTTTTCCTTTTGTTGTGTG 

GAGTGAAGAGAGGAAATTGAAAAATTGCATTTGTCTGTCGCTGATAAAATATTGAGCAGCTTGCTGCACATTCTCCCTGGTTTTAGCTGGGCTGT 

CGTGATAAGCGTTTGAGCTATTTCCAGAAAAACGCTCTCATGGAGATTGGCAAAGAATGAAAATGCGATGTGGGTGGGAGAATTTCGGAATTCCA 

TTCAGCTGTCCCTCAAAGGGTTTCCTCATTCTTTAAAGCCGTTTCCAACGGCTCTTAATTCCTGAAATATTTAATTAAATTCTTGCATATTAGCT 

TTATGTGGGCTTTAATAATGGCTCGCGTGTTTGCTGGCCCCGTTTTTGTTTATATTAGTTTATAACCTTGCTGCGGGAAGATTAAAATATTTTAA 

ACGCATCACGATGCGAACATAACTGGACTGGGTTAGATCATCAAAGCTTAGATTTAGAATTGCCGACGATATGAGCTTTAATATCTTCATACACT 

TGACCTCTGTTTTTATTATCGAGAATTTCTGCTGACTGAGTGAGTTTAGTGATAATTGTATTGTTTGTATTCTTGGAGACCCACAAGATTGATGC 

GTGAATTTGATTCAATCGGGATTTGCCAAATCATCAAACAAAGAAACTTTCTCCTTGCCATCGAGACATTACGCAAGTTAATTAAATTGCCATGC 

TAAGAATCATCCTTATAACATCTCACGAGAACTTTCTCTTTCTGTTTACATGAATCACTTGTGACATTATCAAATTTGTCATTAGCTTCCGCATA 

TTTTTCACCACTCACTTTTGGATATTTCCTTAATTCACAAACTTCTCGCATTCCACAAATGGCTAATTAGCGGCTTACCGAAATTGGCCAATAAC 

AGCGGGATCTGATCTATAGATTGAGACTCGTACATAGATCGACACCTGATTGACATGCAGTTCGAGACAAAAACTGATCGAATTGTGCAAAACCC 

CAAAAAAGGCATTTGTAAAACAGTCTTTTGTCATATATAGAAAAAGCCAGAACACAGAGATTCTTAATCAGGAGTCGTCGGCATTGTTTTCGGAA 

TTCTCTCCCAATTTCCTGAAGATCAATTAATTAGTCACTTGCCTTGGCCATGAGCAATTTTACCGTTCGACTGTGCGGGGAGAAAATCGGCAGCT 

GTTGTTGCCTAATTAAATCGAGGTGACAGGCCAGACAGACGGACGATCAAGTGAAACTCGAGTTGACCAACCAAATTAGTAATCTCGCATCGTGA 

CAGACGACAAATCGCCTTAAATGAACGACTTCAAATGTAATTTTTGTGGGATTTTTATTTACTGTTTTTGGTTGTGGTTACAGCTGCTTATTCTG 

CTTCCTCTTCGTGTGTGGGCGTTATGTGGTTGGTAACTGTAAAATGAGTTTATACACATTTTTTTTTTTTTGGCCAAGTTATTCGGCATTATTCC 

GACGTGCATCATATCATGTTGTAATCTTGGACTTTGCTTAACCCCAGTTTTGAGTTCGGTATGGAGTGTTTCTTTGTTCTGTCATCTTGAGCTTG 

TTTCGGTTCTGTTTTTCATGTTTTCATTTTTAAATTAAATATTCATCAATAAATGTCAATGTCAAGTGAAGTTGCCGCTGGCTTTCGAGAGGTTT 

CGAGATTTTGGGAGAAGAGATTGCAATTTGCTGAAAGACTTGGAGTTGTGATGGCACAAATGAAACATGCGGTTATTCTGTGAATCCATGACGGA 

TTTATACATCACGTATGCTATATATTTATTTTTGATTTCAAAGATTCCAGTTTCAACTGGAATTAGCTTCTTTATTTTCTTAAAGTTCGTGGCTA 
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GC C TAT G AAT AAAT AT GT T G AAT AT CAT T T T AT T AT T AAC AA AAAC T C C GT T C CGC AGG T C AG GT T GTT AGC C AC C AC T T C CT T CC T A T AT C T C A 
GCTGGTCAATAGCTTCCTCCCTCGATATCTGGCCGATTCATATGGCATTTCCCATTCCGTTCCACTGACCAAGTTATAGAACTCGCACTTTGCAA 
ACTCGTTTAGCCCCCGATACCGATTGCGATCCCAAGTGCGACGCAATTGTTCGAGTTGGGCCGTTGGAGATACGAGCCACTCGAGTACGGGCTCG 
TATTTATTCATACTTACCCGTGCAGTGCCGGTGGTGACTCTAGTATATTTTTATGTGTAGTTTTCTTGTGCATTTTTTCTTTTTTTTTTTTTTTT 
GCATTTAGTCAGCACTTCAAGGTTATAATGAGCAACATATGCTGCGGGTACGCCAGCCAGCAAACTGCAACTGCAAATGCAATCAACTGCAAAGG 
CACAACTAAAAATGCATTTCAACGTTGCCGACGACGATTTGTTCTTTTGTGTGGCAACTAAAGCGAGGCATTTTTCACGTTTTTCCGCTGCCTCC 
GAGCTGTTCTTGAGCTTTTTCTGAGGCGTTCTGCTTAGTTAAACTTTTGCCTGATTGATGTTTGTCAGCTGCTGCCCAGCGGCCTTAGAAACGGT 
TGCAACATCGCCTAGAACCGATGGAGAATTAAAGTGAATTATACACATAATTGCAATTTTTTTTATTTCAGTCGCCGTTGTTATGTCCAGATGCG 
ACTGGCACTCGTAATTCAATTAAACATTTTGCCCATATTGTCGCCCGTGGAAATATAAGCGAATGCAGCAATTCGAATAAAACAATACTCCTAAC 
AATTGGTCACGTTTCGTCACCAACAACTGCAGTGCTAACAACTACTGCATCGCCCGCATTGAGTTGTGATCATAAAATTCTTCCACCTTCCCACC 
TTATAGAGATGGCGAGGAAAACTCGAAAATATATCACAGAAATCACAAAACCACATTTAAATGAAAATAAATTTTCCCTTGCAATGCGTGACTCA 
ATAGACGTGTTTAAAGAAATTAACACTTTTTTCTCTAAGGGTACACCCCTTGTACATTTACGAATTGATTTAAATATCTCATCATCACCTATTTA 
ACGTGTGAATTATATAACGCATATAAGCATTAGGTGCTATAATTAAGCGTTTTATAAATCAAGATAAATCAAACATTTTCTTGCCTCACAAATTG 
GAACAAAAAATCGTTTTAAATCATTTAATCACGCCTTTTTAAGTTTAAACAATTGTTTTATGCGAAAAAAAGAAACGTACACTATTGTTAAGCCA 
GGTTCCCGTTTTTATAGAGAATTTCAAGCACGCTCACCTAAACAAAGATTAAATCACCATAGATAGTAGTTTGTAAACGTTTCGTTCTAATCGTA 
GATTAAATCGTAATCTATGGCAAAATGCGCTATCTATCTTAGCTTATTTTAGCTCACGCAGTCACCACCAAATAAAAATATATGTTAAATATGAG 
TAAGCGATTACTTACTACAGTTACAGTTACAGTTTGTAACTGTATAATTTCATAATAGATATTGTATACGATGAGAAAAGCCTATACAAATTAAA 

AC AAAT GT T CT T T AAG AC AAAT T TGG AAAGT T TAT AAT AC AT T T T AAC AT G TAG AAC C AAAT G AAGAT G AGC AT AC AT C T GT AG AT T T G ATT AAA 
ATTAATATAGAGAAAGAATAAGACATCAATTCTTTCTCCACTCATGGCTCGAAAAATTGTATCACCTTAAGTGGTGATGGTATTTTTCTGTGTGT 
ACCAGCATCGACATAGCATGGCTAGTTATATAAGTTGGCCAGGCAGTAATGATCCCTGCCGTTGGCTTCGCTTCGTTTCTGAAGCTGATGCTGAA 
GCTGAGGCAAAACCTGGCAGATTTTCAAGGTTAACAGCGACTTTACGATGGCGCGCGGTGAGCGCACGTGAGTTGCCTCAGCTGCTGGCCAGCAG 
CAGCAAACTAGTTTCAAATGCCTGGCACAGTATCTTTCAGATACAGATGGTAAGGCTGCGGCAAGGAGTGGCTCTTAACTCGCTTCGCCTCGGTT 
CGCTTCGGTTTCGTTTGCTTTGGTTCGGATCTTGGCCCGCAATTAAATACCGATTTTTTGCGCTCTTGCAACTGCAACTGCGAATGCAACAGCTG 
TGGGCTCGTTTCGACTCGATTCGAGTTCAAGGCTATTGGGTCAAATTGCAAATTAGTCGCCTGCAGGTTGACAGTGAGTGACATTATTGTGTATC 
TGCACACTGCACACTGACCACCGACAAAGCGACAGAAAGACAGACAGACATGGATGTATGTATTGGGTAGATTGTCGGGCATTATCTGCGGTCAA 
TTATCGTTCTTGCTGGCTGCTAATCGGTAAGATACACGTACTTCTGCCGCCGGCTGCTGTCGATTTGTATAAATTTTGCATTACAATCAATCATT 
TGTCGCTGGGCAATGGCTTTTTAAGTAAATAAACATTAAGAAATCATACAGATGAAAATGTACACCCAACTAATGGGGAGCTCACAATTTATGCG 
CATACCAAGTGGCCAGCTCGGCAGCGCATTTACCTATCAACGCTTTTGTTGTCGATTTGCATGAAATTGCAAGTAAAAAAAAAAAACAAAAAAAA 
AACAGGGAGGAGAATACAAAAATTGAAATACAACTGGAAGTCAAATCGCAGATACAGATACGCTTCTCGTTACATGCATAAATGATGCTAAGCTG 

G AAAG AT TT C GAAG ACAG AT AC AG AT ACT T T T CAAGT T G GGC C AC TC AAGT AG AC C C T CT T GGC AAT GT T T GT T GC T AAAG AT AC AAAAC AAAAA 

AAAAAAAATAAAATACTTTGCCAAGTGAGCACGAGAATTGCCAATGACGTGTAAACAACTTGGATACAAATGCGCCTGCAGGCGGCTTGT AAGAT 

ACTTAAGGCTGCGAATAACTTGAATATTAGTGGCCATTGTTCATGGATATGTATCAGAAGTGCAATATGTCGATAAAGTGGTGTATTTAGGAAGT 

TTAATAGTATTTGTTGGAAGCCATATCAATTTATGATGAAAGCCCTAAATGGATAGTTATATTGTTAAATAAGCCATATTACTTAATTAAGGCCC 

TTGGAGCAAAGGGAGCCGAAAGCCATTAAAAATTGCAACGTTCCAGTTTTAGTCGTTTATTTACATTTACATTTATCTTTAAAACAGCATTATGT 

GGCTTCCTGCTGCACCGGCCGAAGATATTTTATTTTATTGTGCACTGCAAGCCTGTAACTGGTAGGGGCTGCCTGATCGTTTTATGGGTGGTTTT 

ATGATCAGAGAGGACTTGCCATAATTGCGGCGGGTTTTACAGTCAGTCGGCTAAGTGGCAGTTGGCAAGACAATGGGATCGCCAGCGCCCCGACC 

TTAGACACTTCCCTACATTGTCACTGACTCAATTTCAGACCCGGCCAATGCCGCCTTTGTTTACCTTATCCATTGTTGGGTGATCTTTATTACGG 

GTATATGTAAGCGAACTGTACATCATCAAAATATTAAATATTTTAGACTTTTCAATAATATTAGTAAGTTCGTTTGATTTTCTTATTATGTGCCA 

GCTGTATTTAAATTATTATTCGCTTAACTTTCGTTTCAGCCCCGCAAATCTTCCAGTATCCGCCGCCAAGCCCCTCTTGCAATTTCACGGGCGGC 

GATGTGTTCTTTCCGCACGGCCATCCGAATCCGAACTCGAATCCCCATCCGCGCACCCCCCGAACCAGCGTGAGCTTCTCCTCCGGCGAGGAGTA 

CAACTTCTTCCGGCAGCAGCAGCCGCAACCACATCCGTCATATCCGGCGCCATCAACACCGCAGCCAATGCCACCGCAGTCAGCGCCGCCGATGC 

ACTGCAGCCACAGCTACCCGCAGCAGTCGGCGCACATGATGCCACACCATTCCGCTCCCTTCGGAATGGGCGGTACCTACTACGCGGGCTACACG 

CCGCCACCCACTCCGAACACGGCCAGTGCGGGCACCTCCAGCTCATCGGCGGCCTTCGGCTGGCACGGCCACCCCCACAGCCCCTTCACGTCGAC 

CTCCACGCCGTTATCGGCGCCAGTGGCGCCCAAGATGCGCCTGCAGCGCAGCCAGTCGGATGCGGCCAGACGGTGAGTAGCCAGCGATGCAGGGT 

GCCAAAAGATACACTGCCTGGGTGGTGCAAAACCAATCAAACTGGAATTTAGATTCAGATCGATGAGCATACAGAATAAGAGGGAAAGTTCCAAA 

CTATGACATGATAGGATGCAATTTAGACCAACTAAAATATACAAAGCTATACAGCGTGTATTCAAATTAGAGATTAATATAAACTATGCTAGGCG 

TTGAATATAATTGAAGAATAATATGAACTTTAGTTAAGTGTAATACTTAAATTCATTTTCATTGATGTTTATAATTATTATATAATATTTATTAT 

AATAATTAATAATCCCACTGAGAGAAATATAGTTGTAAAGGAACGAAGACTAGAAATCGTTACACACACGCTGTGAAATTGATTCGTACTGAAGT 

ACCAAGCAAAATAGCTAGATTATTCATCGCAATCTCGACACCTTTCGCCCCACCCGACCCCCTTTTTCCAATTCCCATGACACAACGGCAAGTTT 

TTAATTAATGAAATCGCAAAGTTGTCGCTGGATTATGTGCACACACAAGTGGCATATAGTGCTATACAACCCGAATCCGAATCCGAACCGACTGA 

TAATTGCTCATGACGTTCCCAAGTCATCTGTCTATATGTGCAGCGATATTTATAGTGCCCATTATGCGTCTCTTCCCACAGCAAGCGATTGACCT 

CGACGGGCGAGGATGAGCGCGAGTACCAGAGCGATCATGAGGCCACTTGGGACGAGTTTGGCGATCGCTACGACAACTTTACGGCCGGCCGGGAG 

CGTCTGCAGGAGTTCAATGGACGCATCCCGCCCCGGAAGAAGAAGAGCTCCAATAGCCACTCGAGCAGCAGCAATAATCCAGTCTGCCATACCGA 

TAGCCAGCCCGGTGGTACATCCCAAGCGGAGAGCGGTGCCATCCATGGCCACATCAGTCAGCAGCGACAGGTGGAGCGAGAACGACAAAAGGCGA 

AGGCCGAGAAGAAGGTAAGAAATGGCCACCAATCTTGGAATGCACAACGCATACAGAGAAAGGGTATTCTCGTTTCGGTGAATCAGCATCACTAC 

CCAAATCTAGTTAGCATTCATTGCCTGCCAACTTAATCTAATATACTTTAGGTTTTAAAAAATGTTCAAAAAACTTGAAAAGAACTATACAATCT 

TGCTGCAAACATACATAGCAAAATATTTCCAACAGGCAGAGGCCCCTTCTTTGCTTAAGTCGCCCAAAACTGTCCAACAATTAGAATACAAAAGA 

GGATATGAGTTGGGGAGTTCATTAACTCGGGACTTCACACATCTGCTAGCAGTTCGTGCCTCCTAAGTCCGTGGCACGTGGCAAACATTTTTCAA 

AGGCTCAACTAGAGAGAAGTGCTAAAAATACGCGGCAAGGAATGACAAATTGCCGTACGCGAAAGGAGTCGCAGATTCGGATGCAGATACGAATA 

CGATACGCTAGCAAAATGTATCTGTATCTGTATTTAGTGGAGTGCGGCCCGGGCATGTATAACACTTTTGCACATGCTAAATGTGCGATAAAAAT 

TAATTGCGCAGTGCGAAGTGAAAGAGTCGCTCTTGTCACTTGTCTCCGCTTTATGTTAATCAAGGCAGGCGCCTCTATAGAGCCACATGTGCCCC 

CTCCCCCCCCCCCCCCCCTTAAGTCGGGCCTTTGTGTATCGGCCCTCATTGAGCTCACTTTTTTACGCGTCGCGTTTTTTTTTTAGTTTTAGTTT 

CTTGCGCGTTTGCCTGCGCAGTTGAAAGAAATGCAAACTTGAAGAAATTCAATCGCAATCGCATGACACACACATTGACTCAGACCTGATACCAT 

TTCCAGTCCATTTCAGTCCTACATTATGTTATTCCTATTCCACTGCGATTTTGTTTTTATTGTAATTCATATGGGCTCCGGGCTGGCCGTTCTTA 

TCGCTCATCGCATTGTGGGGGCTCATCCTTGGGGAAGCTGCCGACTGATCGGGAAGCTATATAGCGGCTCCCTCGGCTCCCTCGGCTCGCTGGGA 

TCTGACCTTATCCGAGACTCACTCAGCTGCTAAATGAGAGCGTGAGAAATTCCGTAATCTGAAAGGCAACAGGCATGCAAAAGCTAGTGGTTCTG 

CAGATGTAAATACATTTTAAAATATTCTAATTAACTATAGCTTAGTCAAGGATTAGAGAGTGGAAAACTTATAAGTAAGGAAAGGGAATGAATGC 

TATACACATTAAACATAATTGCCAAACATAAATTCTAGCATGTGACAAACTGTACCTACACATCTAGTATTAAAATGTGTTACTTAATTCATAGA 

TACTAAAGCACATTTTATTCGCATAATGGTGTGTATTTCACCATAATTTATTCTCCTTTCAATGCCAAAAATGAAATCAAAATATGGCCACATGT 

TCGATGGCCTTTCATACCATTAACCGCATAGAAGAAGCCACAATAGTCCTATCAAAATATAGACTAAAATTCGAGCGAAAACAGCAGAAAAGCAA 

ATTTGCATAATAAATGTGGGGTCAGTTGGATTTTTGGTGGTTGCAGTTCCACCACCTCATCATTCACACGTGCCGCACTGCAACCGTTGAAGCGT 
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GGACTATTGGACAAATAAATGGTGTAAATTATAAAACCATTGATGCGGGTTGGGGGGGAGTCAAGTGCGCTGTATAGAAAAATAATTACACCAGT 
GACCGGACAAAAGTTGACCGCCCATTTGGGTCAGGCCACATGAAGTTCATCCGTCGGGCAGTCAATAGTTTCGCTTGATTTACGGTTCAAAAGAG 
GGGTGGTTCGGTCACGTGTGTGGTTTTAGATATTTAATGCCGCACAGGGGAGCTATTGGTTGAACTGAATGATTTCTTGAAATGCGTTATATTAA 
AGCAACTTGTTCTTTATGTTTAAGATTTGCCTTCACGTGCACCTGAATATAACTAAATGCTATTGTTTTATTCTCCTTTCAGAAACCACAGAGCT 
TCACTTGGCCAACTGTTGTGACCGTTTTCGTTTTGGCCATGGGCTGTGGCTTCTTTGCGGCGCGATGAAAGCGCAGGAGACGTGTAATCGAATGA 
TCTATAGTGAAATCAGCTAGCCCTTAAGATATATGCCGATCTAAACATAGTTGTAGTTAAACCGTACATAAGTGCAACGAATTTATTGAACTGCA 
GGAGCGAAAGCAGAAAGTCATTAATTCGTAAACGGATTGTTAGATACACAAACAGCCAACATACACGAAGAGTGTGCCTAAGATTAAGAAGGTTG 
ACGGGACACAAGAACAATATATTCTATCTGTCTATGGTAACTGCATTTGTATTTCTAAAACGAAACGAAAGATAACAATCTTAACTGCTCAAAGT 
AATGAAAACTCTTAGACTGGCAAGAGACTCAAATCACACTTATTTTTTTGCTGATCCATATTTTTGTACAACCTTTTGAGCGATATTTACAAATT 
ATACTAGTACAAAAAAAAGAGAGAGAGAGATAAGCAAAAGAAAACTGCCACTTTTGAGATACTTTTGATAATCTTTGATTTGCATTTAATCATTT 
CCACACTTGCATTTTTTATAAACAACAAACAAAATTACTTCCATTGTAGAACAAAGTAAACTGCAATTTCAATGTCTTCGCATTTGTAATTCCGA 
ATTGCAAGAAAAACAAAAATATTTTAAATATGTTTAACTAGTAGAATTTTTTAAACGTAAGTCCACAAAAACAAGCACATCTAGCTTTAATTGTT 

G AAAC AAAAGC AG AAAAAAC GC AAC AAAAAAAT G A AT G AAAAT C AT T AAAT T AAT T T T GT AT AT AGT TTTTATGCCATTTT T GT G AT G T T TT GT G 
TCTACGGTTTATGTCATGTTATTTTAGTTAAATTTCTTATGATTTATGTTTATTTGTAATATTTTTTGTCATTGTTTGTTCATCATCATATTCAA 
ATTGGTCTCACAATATAATAGTTTTAAGCTCCACGCCCGGGAGATTGATGGCAAAACGATTGAAATTTGGCCAGAAGAGAGATAGTTTTCCCCAT 
TCGTACACAGTCTTTTTTGGAATGCACATTAATGATCTCTCACAATGGAAATTAATGAAAATTGATCTCCGCAGCTAGCCAAAGTTAAAAAAGAA 
ATGAAGAGGAAAACATATTCTATAGGCAATTTTCACTATATGCTAGAATTTCCCGGGCGTTTCAATGCTAATCGAATACAGTGACATGAAAGCAA 
ACATAGCGAAAATATTAAGAAAATCAATCAAAAAGAAAGAAAAACCAATTCCCAAAAATCGCATTGATCTCATGGATTTATACAATACAATTACA 
TCAACCGTTTTTTTACAATGAGAAATGTTATAAAAAGCAGAAAGTGAAACACAGAAACATAAACAAAAATTAACGAAAAGCTTAGATATAAGTTC 
GCCAAGCGTTTTAGTTCTATTTTCTAGAATGTCTAAGTTGGTTTAGTGAGTTTATTAAGCTGTCTTCGGACACAAGTTTATTTGTATATAAGCAA 
TATTATTTGTGTAGCCTAAGTGACAGTCCCAATCAAATCCAATCCAATATCACCCAGTCCCGGACATTTCCCAGCAAAACAATAGACTATTCTCG 

CGT T C AC AT GT ATC AAT CT T AAT T T G AAT T AC C AC AAAAT G AAAT GAAAT AC T AAAACC AT AC AC AAATG AAAAAT T AT TT T T GT AAAT T GT T TG 
CATCAAGTGAGCAAGGGGATTAGATTAAGGAATCATCCTTGCTTTATCCCCTGCTTATTGCTAATTAGTTTTCACAATGATCTCGGTAAAGTTTT 
GTGGCCTTGCGCCCAAAAGTCGTACAGATTTTTGGTTTGCCATAAATACTCGAACAAAAAGTTAATGAAAAACGAAGCAAATGGAAAAAAAATCA 
GAATGAAACACAAGAAATTTATATTTTTGACCCAATGCTACTTAATCCGTTTTTGTAATTTAAGTATCTTTACTCGACCTTGTATATAGCGCAGT 
T C G AAT C AC AG AAT C AAATGC C AT T T T T GT AT AGAAT TTTATTTGGT GC C AAAAC AGT G AC AG AT AAT T AAAT GT C T AT G AACC C GT GT AT T T CG 
CAT ATT AT AC AT T T AT AC AT AT AT C GT AAC T T C AAT G AT AAGT T T GAT T C T GAAAT T TT G T C AAC T C AAT T T AAGAAAC AT T TC T GT T GT AGT T T 
AGT GAT T GC T AGC AGAAAGC AC T T T GT T T AAT TGT AC AT T T TAT AT TAT GC T GT AAT AT T TT AAT AT AC AT AAAT AT CAT TAT T GAT C T C AT G AA 
T AT GT T CAT AAGAC AAC AAAAAT TAT AT AT AT GAAT AC AT C T AT GT GT AT GT GT AAAT GAAAT AAAAGC G CAT AAT TAT TAT GT AAG AT T T AT GG 
AACAAATGAAAATAAAAGTGAAAAAAATATTTGAAAAAATTAAAAGTTTGTATATGATTTGAATGGGTTGTCATGGCATACAAGCGTTTTTTTTA 
TAAGGTTATTACTCTTCATAAATTCATCAATCTCAAACATTATTATTATTTAATTATGTTGAATAACCAAAGAAAAGTAAGTAAAAGTATTTATT 
TAGTATATTTTACATCGCAAGACATTTTACATATTTATAAAACTATCATTTTTCATTACCAATATTTTATTAAGATGGACGCAAATAAATGTAAA 
GGAATATTTATATATTTATTTATTTGAAATATTTTACATGGCAACTTATTTTACATGGATTATGGCAAGCTGATCACATGTAAAATACATTTTAC 

AAGG CC T AC T T T C AGG C AG GT AT G AG AAAAG AT C ACT G AAG CAT T GC GT GT GAG AG AGAG AG AGT TTTCCTCCTACCCT G AAAAAC G AAC CT T T T 
GTTATTGTTCAGGGTTGAAGTGTGCGTGAGGAAATATTTTTCTTCGATTTAAATGGCCACTTGGACGGCATATAACCAGGCGAAAGCTTAAACCA 
ATTTGTAGTTTCAATTCCTTTCGTGGCTACGGATTAAGTGGCCATAATTGCTGCTCCTGCAGCCGCGCCACGTGACTTGGTAACGGGATACTGTG 
CCGGCCACGTTGCTCGTGTCACAAAAATGATGTGTTATTAGCGGCTTAAGCAGGCGATGGCGATTATAATGAAACGCGGCACCAGGCTGGGCAAT 
TATGTTAACCTGGCCACCGAAAAACCTCAGGCTCAGCGGTCATGACATCCGTTTCCGCCTTAAATCACGCGACACATGCAAGTGTCAATCAAAAC 
GGTATATTTAATCAGAAACTGATATTTTCATTGAATGCAGGCGCGTGTGCGAATCAGTTGAACGGACATCAAAAAGCAACTGCTAATCGGAGATT 

GGAAATCTTACA 
(SEQ ID NO: 556) 

Exon: 1001.. 1817 
Exon: 13055.. 13562 
Exon: 14237.. 14549 
Exon: 16518.. 18867 
Start ATG: 1487 

Transcript No. : CT16413 

ACGTGTTTCCTGCCCTCTTTCTTTGAGTCTGCGACACGTTTTAAGTGCTCTTCCATAATTGACAACAGCAAAAGCAAAGAATAAAAAAATAACAA 
AAAAT AAAAAACGAAATCCATCGTGAACAGTTTTGTGTTTTTAAATCAGTTCT AAAC AC GAAAAGGGTTG AT GAAAAACGCAG AAG AATCCGAAA 
AAC T AAC T AAC C GAGC AAAAAC T T G AC T T G AGT GT T GT T T G AC AAAT C AG GAAAG AT AAAAAAC AAAT CAT AAG AAAAAAC T GC AC G AAAAAT GA 
AAAAGT T T CT AAT AT T C AAAAT CT T G C AC AAG AAAT AC AAAAT C AAT T AAAGT G AAC T CT AAC C AAAAG T T G T AC AC AAAAT AAAAAGC AAAAC A 
AAGCAGCGAAGAACAATCACAAGAAGAGCAAAGTGCCAACAAAGTGCAGGAAGGAAGGAAGCGGATAAGGACAAAAAGGAAGCCAGCACACACAC 
ACACCCACACAATGGCCGTGCCCTTTTATTTGCCCGAGGGCGGCGCCGATGACGTAGCGTCGAGTTCATCGGGAGCCTCGGGCAACTCCTCCCCC 
CACAACCACCCACTTCCCTCGAGCGCATCCTCGTCCGTCTCCTCCTCGGGCGTGTCCTCGGCCTCCGCCTCCTCGGCCTCATCTTCGTCCTCCGC 
ATCGTCGGACGGCGCCAGCAGCGCCGCCTCGCAATCGCCGAACACCACCACCTCGTCGGCCACGCAGACGCCGATGCAGTCTCCACTGCCCACCG 
ACCAAGTGCTATACGCCCTCTACGAGTGGGTCAGGATGTACCAGAGCCAGCAGAGTGCCCCGCAAATCTTCCAGTATCCGCCGCCAAGCCCCTCT 
TGCAATTTCACGGGCGGCGATGTGTTCTTTCCGCACGGCCATCCGAATCCGAACTCGAATCCCCATCCGCGCACCCCCCGAACCAGCGTGAGCTT 
CTCCTCCGGCGAGGAGTACAACTTCTTCCGGCAGCAGCAGCCGCAACCACATCCGTCATATCCGGCGCCATCAACACCGCAGCCAATGCCACCGC 
AGTCAGCGCCGCCGATGCACTGCAGCCACAGCTACCCGCAGCAGTCGGCGCACATGATGCCACACCATTCCGCTCCCTTCGGAATGGGCGGTACC 
TACTACGCGGGCTACACGCCGCCACCCACTCCGAACACGGCCAGTGCGGGCACCTCCAGCTCATCGGCGGCCTTCGGCTGGCACGGCCACCCCCA 
CAGCCCCTTCACGTCGACCTCCACGCCGTTATCGGCGCCAGTGGCGCCCAAGATGCGCCTGCAGCGCAGCCAGTCGGATGCGGCCAGACGCAAGC 
GATTGACCTCGACGGGCGAGGATGAGCGCGAGTACCAGAGCGATCATGAGGCCACTTGGGACGAGTTTGGCGATCGCTACGACAACTTTACGGCC 
GGCCGGGAGCGTCTGCAGGAGTTCAATGGACGCATCCCGCCCCGGAAGAAGAAGAGCTCCAATAGCCACTCGAGCAGCAGCAATAATCCAGTCTG 
CCATACCGATAGCCAGCCCGGTGGTACATCCCAAGCGGAGAGCGGTGCCATCCATGGCCACATCAGTCAGCAGCGACAGGTGGAGCGAGAACGAC 
AAAAGGCGAAGGCCGAGAAGAAGAAACCACAGAGCTTCACTTGGCCAACTGTTGTGACCGTTTTCGTTTTGGCCATGGGCTGTGGCTTCTTTGCG 
GCGCGATGAAAGCGCAGGAGACGTGTAATCGAATGATCTATAGTGAAATCAGCTAGCCCTTAAGATATATGCCGATCTAAACATAGTTGTAGTTA 
AACCGTACATAAGTGCAACGAATTTATTGAACTGCAGGAGCGAAAGCAGAAAGTCATTAATTCGTAAACGGATTGTTAGATACACAAACAGCCAA 
CATACACGAAGAGTGTGCCTAAGATTAAGAAGGTTGACGGGACACAAGAACAATATATTCTATCTGTCTATGGTAACTGCATTTGTATTTCTAAA 
AC GAAACGAAAGAT AAC AAT CTTAACTGCTC AAAGT AAT GAAAACTCTTAGACTGGCAAGAGACTC AAAT CACACTTATTTTTTTGCTGATCCAT 
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ATTTTTGTACAACCTTTTGAGCGATATTTACAAATTATACTAGTACAAAAAAAAGAGAGAGAGAGATAAGCAAAAGAAAACTGCCACTTTTGAGA 
TACTTTTGATAATCTTTGATTTGCATTTAATCATTTCCACACTTGCATTTTTTATAAACAACAAACAAAATTACTTCCATTGTAGAACAAAGTAA 
ACTGCAATTTCAATGTCTTCGCATTTGTAATTCCGAATTGCAAGAAAAACAAAAATATTTTAAATATGTTTAACTAGTAGAATTTTTTAAACGTA 

AGT C C AC AAAAAC AAG C AC AT C T AGC T T T AAT T GTT G AAAC AAAAGC AGAA AAAAC GC AAC AAAAAAAT G AAT G AAAAT CAT T AAAT T AAT T T T G 
TATATAGTTTTTATGCCATTTTTGTGATGTTTTGTGTCTACGGTTTATGTCATGTTATTTTAGTTAAATTTCTTATGATTTATGTTTATTTGTAA 
TATTTTTTGTCATTGTTTGTTCATCATCATATTCAAATTGGTCTCACAATATAATAGTTTTAAGCTCCACGCCCGGGAGATTGATGGCAAAACGA 
TTGAAATTTGGCCAGAAGAGAGATAGTTTTCCCCATTCGTACACAGTCTTTTTTGGAATGCACATTAATGATCTCTCACAATGGAAATTAATGAA 
AATTGATCTCCGCAGCTAGCCAAAGTTAAAAAAGAAATGAAGAGGAAAACATATTCTATAGGCAATTTTCACTATATGCTAGAATTTCCCGGGCG 
TTTCAATGCT AAT CGAAT AC AGTGACATGAAAGCAAACATAGCG AAAAT ATT AAGAAAATCAATCAAAAAGAAAGAAAAACCAATTCCCAAAAAT 

C GC AT T GAT CTCATGGATTT AT AC AAT AC AAT T AC AT C AAC C GT T T T T T T AC AAT GAG AAAT GT T AT AAAAAGC AG AAAGT G AAAC AC AG AAAC A 
TAAACAAAAATTAACGAAAAGCTTAGATATAAGTTCGCCAAGCGTTTTAGTTCTATTTTCTAGAATGTCTAAGTTGGTTTAGTGAGTTTATTAAG 
CTGTCTTCGGACACAAGTTTATTTGTATATAAGCAATATTATTTGTGTAGCCTAAGTGACAGTCCCAATCAAATCCAATCCAATATCACCCAGTC 
CCGGACATTTC CC AGC AAAAC AAT AG AC TATTCTCGCGT T C AC AT GT AT C AAT C T T AAT T T G AAT T AC C AC AAAAT G AAAT G AAAT AC T AAAACC 
AT AC AC AAAT G AAA AAT TAT T T T T G T AAAT T GT T T GC AT C AAGT G AG C AAGGGG AT T AG AT T AAG GAATC AT C CT T GC T T T ATC C C C T GC T T ATT 
GCTAATTAGTTTTCACAATGATCTCGGTAAAGTTTTGTGGCCTTGCGCCCAAAAGTCGTACAGATTTTTGGTTTGCCATAAATACTCGAACAAAA 
AGT T AAT G AAAAAC GAAGC AAAT GGAAAAAAAAT C AG AAT GAAAC AC AAG AAAT TT AT AT TTTTGACC C AAT GCT AC T T AAT CC GT T T T T G T AAT 
TTAAGTATCTTTACTCGACCTTGTATATAGCGCAGTTCGAATCACAGAATCAAATGCCATTTTTGTATAGAATTTTATTTGGTGCCAAAACAGTG 

ACAG AT AAT T AAAT GTCT AT GAACC C GT GT AT T T C GC AT ATT AT AC AT T TAT AC AT AT AT C GT AAC T T C AAT GAT AAGT T T GAT T C T G AAAT T TT 
GTCAACTCAATTTAAGAAACATTTCTGTTGTAGTTTAGTGATTGCTAGCAGAAAGCACTTTGTTTAATTGTACATTTTATATTATGCTGTAATAT 
TTTAATATACATAAATATCATTATTGATCTCATGAATATGTTCATAAGACAACAAAAATTATATATATGAATACATCTATGTGTATGTGTAAA 

(SEQ ID NO: 557) 
Start ATG: 487 

MAVPFYLPEGGADDVASSSSGASGNSSPHNHPLPSSASSSVSSSGVSSASASSASSSSSASSDGASSAASQSPNTTTSSATQTPMQSPLPTDQVL 
YALYEWVRMYQSQQSAPQIFQYPPPSPSCNFTGGDVFFPHGHPNPNSNPHPRTPRTSVSFSSGEEYNFFRQQQPQPHPSYPAPSTPQPMPPQSAP 
PMHCSHSYPQQSAHMMPHHSAPFGMGGTYYAGYTPPPTPNTASAGTSSSSAAFGWHGHPHSPFTSTSTPLSAPVAPKMRLQRSQSDAARRKRLTS 
TGEDEREYQSDHEATWDEFGDRYDNFTAGRERLQEFKGRIPPRKKKSSNSHSSSSNNPVCHTDSQPGGTSQAESGAIHGHISQQRQVERERQKAK 

AEKKKPQSFTWPTVVTVFVLAMGCGFFAAR* 
(SEQ ID NO: 558) 

Gene Symbol : W 

FlyBase ID: FBgn0003997 



Celera Sequence No. : 142000013384553 

ATAATATGGAAAATCTCAATTTAAATTTGTTGTTGGCTGTTGCTTTGCTTGTTGTTGCTGCCGCTGCTGTTGTTGTTGTTGTTGAATTTCACGCG 
AAAGCAGGCAAGACAACAACATAAAATCACAAACACTTGCACGCACATACGTCACACACACCCGCACACACACTTGGACAGCTGAGACATAAACA 

C ACGG AG AGC AAATC GC AGC G AG AAT GGG AGAG AG AAAG AG AC AG AG AGC G AT AACAAAT TTGTTTTCTGGTTTTTCT T C AC G AT TTTATTACAC 
GCTTTTCGAATGGCTAACCATATTTTGATTCACATTTAATTCGCTCCAATCACAGTGTGTTCCGTGCGCTTGCTAATTTTCGATTTTCCACCGCA 
GAACTCCGCCCGTTTTGCTGCTGTTTTGGCAATCGTCTCTCTCCTGGCT AAAAAC AT ATGATTTCGGTTGACGG AAT CACGAT AAAAAC AC AT AA 
ATGCGAACAGGGCGCGCGCAACAACTTCCCACGACAAACTCTCGCTTCGGAATTTGGAACGAAACTGCCACTTTTGCAGCGAATCGCGCTTAAAT 
ACCCGTCTAAAGATGGGAACGCATGGCGAACCGGTTCGATAGTTCGTGAGCGAACAGCCTTCGCACAGTGGTATGCAATCAAACTGAGCAGTAGA 
TTAAGTACTTTTGTTTAACAAACCGGATAGCTCGAACTTTGAAAATAAGTTTTGGATCGGCAGGAATTAAGACTGTGCACAATTAATGCTTTTAA 
TGGTGACTCAACACTTTGTGACGTCAGTAGTAGCATAAGTTCAAAATAATTAATTTGAGTATAGCTTTTGTTTAATGTATTGTATAGAATAACTA 
GTAATACTGCTTTTTTGACTGCGATAAACATTTACGTAATAAAAACTGCATATATTTGCTTTGTGACAACCCTATTTTATTTTGCGTGCGTCACC 
TTACATTTTGCTACCTTGCTGCATCAGCTGTTTGTGTTGTGCGGCGAAGAATGGATTTGACGCTGGACAAAAAGCTGCAGGGATTCCAGTTGGTG 
GCACAAGAAAAGGTATAATAATGAATCGGAAGAGTGAAACTATTAACTACCTGCAATTCGACTGCAGCTTTGTTCCATACTATGTAGCATACCTG 
GAAAGAAGGAGTTAATTTTGGAGCCCGACTTGATAAAGCCACTGGAACATGTTGTCACTGCCTCCTGGCTGAAGTAGGTGTTTAATCCATATTTT 
ATGGATATTTACGTCTTATATATACCCCTTAACAGGTTGAAGGGCATACAACGGATCTACAAACATGATGCGGCACAATCTTTGCCCCGATCCGC 
CGATCAGGTGCACATCTACATGATCCGCAGTGTTTTGGGTACATTTCAGACACTTCTCAAGCAATTGCAACCCGTGGCCTCAGAGGAAATGCCCG 
ACATCTCCATGAAAATGTATCACATAGTCTGTGTGCCCAGTTGTTACTCCTATTTTCAAACCCTGCTGGAACAGGCTGGTTTGTACGGACTGGTG 

C AAC T T CAT C ACT T T AAC T GGG ACT T CATCTATTTT GAT C AGGG AGT T CT C AGC T T GG AGC T GC C C AAT GT GAGT AG AT GT AC AT T G AACAT T C A 
GAAAAGAGTCTTTTATACTGGATATTCTTATTATTCAGCTTTACGAGTGTCTCTACCTGCAGAAGAACACATCTCCATTGCCTGCGGTGGCCCAG 
AGCCTGCGATTGCTTCAGATGATTTGCGGACAACCGTCTTTGATTCTGAGCTTTGGCCACCACTCGTCGCAGCTAATGCAAATGGCGAAGACTCT 
TGGCAAACTTCCGGCGCCCACAAATCCTCCAGACTATGGCGGCTGGCTGGTCATCGATCGGGATAAGGATTACCCCGCCAGCTTGCTCACTCCGG 
CCATCTATGCAGGTCTTCTCTTGGAAGTTTTCGAGCATAGTTCTGGCGAGATTCTGGTAGACAATTCCAAGAACAAGATCCGCAGCCAGCGTGTG 
GAGTTACTCCAGGGAAAGAAATCCAAGATAGGCGTAAACTCAGCCAGCAAGCCGTGTTCCATTCGTCTTAACTCCACATCCGATGAGATCTACGG 
CGATAATCGGTACAAACGATTTGCTCAGGTCAGCAGTTTGATTCACGCACAAGTAAAAGCCCTGGGCTTGGAACTGCAGAAACTCAATGACATGC 
AGCTGGACGAGATGCATGACTATGTGGCCAGGAAGCTACCCAAACTAACCGAGCTTAAGAGCAAAGTCCTTAGGCATCTGAATGCCAGCGAGATT 
GTGATCCAAATGATGGGTAACTTCAGGAAGCTCCAAACGCTGGAGGAGGACATACTAAACAATGATTCAAGAAAGCGATTGCTAAGCGAGATCGA 
CGAGCTGTTGACCACAGATGGTCAGAGATTTAATACACTGCGACTGCTGTGCCTCTTGCATCATTGCGTTGGCGTGGCGCCCGAGGAGCTGCAGA 
TATTTGCGCGGAACTACTGCAATCTGTTTGGACACCAGGAATTGGGCGTGTTCCAGCAGTTGTCGCAGGCGGGCTTGTTGCCACCACTTGTGGTG 
GAGAAAAAGGCGCC AAC GAAACT GCT CTC AAAT CTACCTCTGCCCAAGTTCC AGC AAACGGAGTTTCAGGCC AAT GCTAATCGGCTAAAGCT GTT 
GACCTCCAGTGGAGATGGTCAGGATGGAGGTTCCTCCTCCAAGAGCCAATCCAGTGGCTTGCAGACCTGTCCCAGTTTTGTGTTCAATGGCACCT 
ACATACCATTGGTGGCTCAGCTCTGCTCCATTTTGCTGAAAATCAATAGTGCTGAGGAGTTGTCCTCGAAGCTGGGCATGATTGAGGGGCTGCAT 
CTACACCTGGACAGCGGAAAAATCACGCCCAAGGCCTATGCCAGCCAAGTGAAGCTAAACGGAGTTGCAGATCAGGATATGTTCCCCTTGCGTCT 
CAGGAATCTCTTTGTATTCATTGTGGGTGGTGCCAGTTATGCGGAAATCGCCGCCTGCGATTTTGTGGCAAAGCTAACAGGCGCCCAAATAACCG 
TGGCATCCGACTCTCTAATGGCGGGCAGTGATTTGATCGCCACCGCCTTCAACCGCTGACCATGAAGTAACTAAGAATCTGTGTATCAAATGTTT 
TATATATTTTTATACAAAACAATGATGACAAAGCTTGTGATTAACAATTTTCGAAATTTACAATTTAAACAATGCTCTTTTCGAATAAATATTAC 
CTAAAAGTGGCATTTGCTATTTATCACCACTGGCTCTGGTCATTCCAGGCACTCGGTAGAGCAGCTCTCCATCGTCATCGTAATAGGCATCCTCC 
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ACCCGGTAAGTGGCTCCTTGGCGATGCAGTTTCCGGGGAATAACAATGCGTGTTTTCACCTGGGTTATTTCATACTTGCGCTGATCCGCGGCACT 
GATCACAATCTCGCTGCTCGCAAACTTGGGCACGAAACTGGGCAGGAATCCGGTGCCACTGGGATTCTCGATGGGCCGCTTCATTTCCTCTGGAA 
CTGCGGGCGCAGACTCAACGAGCTGCAATTCCTTAAGCTCCTGCAGCCGCATCTTCTTTTTATTACGGATTTTGTCCAACAAATAGGTTTCATCT 
TCGGCAGCAGAGGGTGGAGCTTCATAGCCGGATGGGATTTGTGGTGTAGGCATCTCCTCTGTTGTTTTACTACTGCTGCCCAGCAGTTTGCGGAA 
TTCGTTGCCTTGCGGAGTCGGTAGTTGTGATTTTGTGGGCCAATCCTGGATTTTAAGCTCCCGCAGATCCTGCCACCTTCGCTCGGCGAATTTCT 
GCAGATGCTGGAGAAGAGACGGTGGGATACTGAAGTCACCCTTTCCCGTCCCTAAAAGCTGCCAGTTTTTAATGACGGTTTCATCGTGGGAGCGT 
ATCCTTTGCACACTTCTCGGTCGCCACTTGGCCCGCAGAATTTTCTGCACAATGTCCGGAGTGGCGGGAAAACTTTCGGCCAATCGCTCAACGCT 
CCATTCCTCGGGATCTCGCTCGTGCAGCAGGCGCATCTGCTCCTTTTCGGCATGAAGCAAGAGATTGGGCAACTTGGCGTCTCGGAAATACTTGT 

GCTTTATCATAAAC 
(SEQ ID NO: 559) 

Exon: 1001.. 1057 
Exon: 1113.. 1213 
Exon: 1271.. 1589 
Exon: 1654.. 3099 
Start ATG: 1001 

Transcript No. : CT16445 

ATGGATTTGACGCTGGACAAAAAGCTGCAGGGATTCCAGTTGGTGGCACAAGAAAAGCTTTGTTCCATACTATGTAGCATACCTGGAAAGAAGGA 

GTT AAT T T T G GAG C CCGAC T T G AT AAAGC C AC T GG AAC AT GT T GT C AC TGCCTCCT GGC T G AAGT T G AAGGGC AT AC AAC GG AT C T AC AAAC ATG 
ATGCGGCACAATCTTTGCCCCGATCCGCCGATCAGGTGCACATCTACATGATCCGCAGTGTTTTGGGTACATTTCAGACACTTCTCAAGCAATTG 
CAACCCGTGGCCTCAGAGGAAATGCCCGACATCTCCATGAAAATGTATCACATAGTCTGTGTGCCCAGTTGTTACTCCTATTTTCAAACCCTGCT 
GGAACAGGCTGGTTTGTACGGACTGGTGCAACTTCATCACTTTAACTGGGACTTCATCTATTTTGATCAGGGAGTTCTCAGCTTGGAGCTGCCCA 

AT C T TT AC G AGT GT C T CT AC C T GC AG AAG AAC AC AT CTCCATTGCCTGCGGTGGCC C AG AGC C TGCGATTGCTT C AG AT G AT TT GC GG AC AAC C G 
TCTTTGATTCTGAGCTTTGGCCACCACTCGTCGCAGCTAATGCAAATGGCGAAGACTCTTGGCAAACTTCCGGCGCCCACAAATCCTCCAGACTA 
TGGCGGCTGGCTGGTCATCGATCGGGATAAGGATTACCCCGCCAGCTTGCTCACTCCGGCCATCTATGCAGGTCTTCTCTTGGAAGTTTTCGAGC 
ATAGTTCTGGCGAGATTCTGGTAGACAATTCCAAGAACAAGATCCGCAGCCAGCGTGTGGAGTTACTCCAGGGAAAGAAATCCAAGATAGGCGTA 
AACT C AGCC AG C AAGC C GTG T T C CAT T C GT C T T AACT C C AC AT C C GAT GAG AT C T AC G GC GAT AAT C GGT AC AAAC GAT T T G C T C AGG T C AGC AG 
T T T GAT T C AC GC AC AAGT AAAAGC C C T GGG CT TGG AAC T GC AG AAACT C AAT GAC AT G C AGC T GGAC GAG AT GC AT G AC T AT GT GGC C AGG AAGC 
TACCCAAACTAACCGAGCTTAAGAGCAAAGTCCTTAGGCATCTGAATGCCAGCGAGATTGTGATCCAAATGATGGGTAACTTCAGGAAGCTCCAA 
ACGCTGGAGGAGGACATACTAAACAATGATTCAAGAAAGCGATTGCTAAGCGAGATCGACGAGCTGTTGACCACAGATGGTCAGAGATTTAATAC 
ACTGCGACTGCTGTGCCTCTTGCATCATTGCGTTGGCGTGGCGCCCGAGGAGCTGCAGATATTTGCGCGGAACTACTGCAATCTGTTTGGACACC 
AGGAATTGGGCGTGTTCCAGCAGTTGTCGCAGGCGGGCTTGTTGCCACCACTTGTGGTGGAGAAAAAGGCGCCAACGAAACTGCTCTCAAATCTA 
CCTCTGCCCAAGTTCCAGCAAACGGAGTTTCAGGCCAATGCTAATCGGCTAAAGCTGTTGACCTCCAGTGGAGATGGTCAGGATGGAGGTTCCTC 
CTCCAAGAGCCAATCCAGTGGCTTGCAGACCTGTCCCAGTTTTGTGTTCAATGGCACCTACATACCATTGGTGGCTCAGCTCTGCTCCATTTTGC 
TGAAAATCAATAGTGCTGAGGAGTTGTCCTCGAAGCTGGGCATGATTGAGGGGCTGCATCTACACCTGGACAGCGGAAAAATCACGCCCAAGGCC 
TATGCCAGCCAAGTGAAGCTAAACGGAGTTGCAGATCAGGATATGTTCCCCTTGCGTCTCAGGAATCTCTTTGTATTCATTGTGGGTGGTGCCAG 
TTATGCGGAAATCGCCGCCTGCGATTTTGTGGCAAAGCTAACAGGCGCCCAAATAACCGTGGCATCCGACTCTCTAATGGCGGGCAGTGATTTGA 

TCGCCACCGCCTTCAACCGCTGA 
(SEQ ID NO: 560) 

Start ATG: 1 

MDLTLDKKLQGFQLVAQEKLCSILCSIPGKKELILEPDLIKPLEHVVTASWLKLKGIQRIYKHDAAQSLPRSADQVHIYMIRSVLGTFQTLLKQL 
QPVASEEMPDISMKMYHIVCVPSCYSYFQTLLEQAGLYGLVQLHHFNWDFIYFDQGVLSLELPNLYECLYLQKNTSPLPAVAQSLRLLQMICGQP 
SLILSFGHHSSQLMQMAKTLGKLPAPTNPPDYGGWLVIDRDKDYPASLLTPAIYAGLLLEVFEHSSGEILVDNSKNKIRSQRVELLQGKKSKIGV 
NSASKPCSIRLNSTSDEIYGDNRYKRFAQVSSLIHAQVKALGLELQKLNDMQLDEMHDYVARKLPKLTELKSKVLRHLNASEIVIQMMGNFRKLQ 
TLEEDILNNDSRKRLLSEIDELLTTDGQRFNTLRLLCLLHHCVGVAPEELQIFARNYCNLFGHQELGVFQQLSQAGLLPPLVVEKKAPTKLLSNL 
PLPKFQQTEFQANANRLKLLTSSGDGQDGGSSSKSQSSGLQTCPSFVFNGTYIPLVAQLCSILLKINSAEELSSKLGMIEGLHLHLDSGKITPKA 
YASQVKLNGVADQDMFPLRLRNLFVFIVGGASYAEIAACDFVAKLTGAQITVASDSLMAGSDLIATAFNR* 
(SEQ ID NO: 561) 

Name: VACUOLAR PROTEIN SORTING HOMOLOG -like 
Classification: transporter 



Celera Sequence No. : 142000013384253 

GGTCAACAAAACGGCACGGACCGGCGGACTGGACTCAGTGCCATATAGAATCAGGTTAGCCATTGTGGTTGCTTAACTAATTGCCTTCTAGCTGG 
TTGAATTGAGATTCGTAGCAGACAGTGTGCCATCCACAACTGACTCAAGGCAGTCTAAATAATTTAAATTTCTTTCAAACGCGATAAGATTAGAT 
ATTAAACTTAGTTTCTTGATAATTACCACAAAGACGTCATAATGCTGCAAGTTTAAGTTTGCCTGATTGTTTATTAATGTCCGGGATAAGTCGGT 
CTGCTATCTATTAGCTGTGTCAATATTTGATCAATGTCCCCGTGATTTATTTGATCATAGTTGGAGCACCAAAGGGGCTACTTATATACACTGCT 
AGAAACGTCCAAACGAAACCTAAAGTAACCAGACTCATTAGCAGTGTTTCTATGCAAACATTTTCATATAAATATAATTATAAAATTATGAGTTT 
AATATTATTACTTCTGTGTATCAGATCATACAATTTTATCCAAAATAACGATAAGCTTTATGCCAGCTTAACTTAAGCTTTAAGCTTTTAGAAAC 
TTTCAAGCTTCGTTAGGAAGACTTGAATTGTAAATTTACATGGGTTCTACATGATTCAAATAGATGTATATTACAATAAATCATAAAACAAGTAA 
CTGTATTAAATTAATTTCTTCTGACAACCAATGTTCCTGAGTTCACAGAGACCTACTTTGTTTTTGATGAGTATATGTTTTTTGTAGTTCACCAC 
TGCATATGGTGTTTTAAAATTAGTATCATATAGATGATTCCAGAACTATCAGTTGGGGGTAATTAATGTTTGTTATGATGAAATAGGCATACATA 
TTATATATATGGTTTTTTTTTAACTGTTTAACACATGGTTTCGAGCTTTTATCTTAGATTTACGAAATGCTCTCTCATTTAGCAAAAAGCTTGAG 
GCAATAAGGTGTGTATATATGTACAGTAAATAGAGAGATCAGTCGAAAGAAAAACACTGACGGGTAAACAACACGTATCATGTCGAGCTCTGGAA 
TTGTACTCTATGGGACAGATCTCAGTCCCTGTGTGAGGACCGTCAAACTTACCCTAAAGGTCCTGAATCTGGACTACGAGTACAAGGAGGTGAAT 
CTTCAGGCGGGCGAGCACCTGAGCGAGGAATATGTGAAGAAGAATCCCCAACACACGGTACCGATGCTCGATGATAATGGCACTTTCATCTGGGA 
CTCCCATGCCATTGCCGCCTACTTGGTGGACAAGTATGCCAAGTCGGATGAGCTGTATCCCAAGGATCTGGCCAAGCGTGCGATCGTCAATCAGC 
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GTCTCTTCTTCGATGCCAGTGTAATCTATGCCAGTATAGCCAATGTCAGCCGCCCGTTTTGGATAAACGGTGTTACCGAAGTGCCCCAGGAAAAA 
CTGGACGCCGTACACCAGGGTCTGAAGCTGCTGGAGACGTTCCTGGGCAACAGCCCCTACCTGGCCGGCGATTCGCTAACCCTAGCCGATCTGTC 
CACCGGACCCACTGTAAGCGCCGTGCCCGCTGCCGTGGACATAGATCCTGCTACATATCCCAAGGTCACCGCCTGGTTGGATCGCCTCAATAAGC 
TGCCCTACTACAAGGAGATCAACGAAGCTCCGGCCCAGAGCTACGTCGCCTTCCTGCGCAGCAAGTGGACCAAGCTGGGCGACAAGTGAAACTAA 
ACTTATATTTAAATATGACATTTCACAAGTTTGAATTCCATCAATTAACAACATACACATCCATTTCTAAAACGTAAACTTCTTCAATAAAATAT 
AACATATATATAAAATATTTAATTTATTTTTTTATTTAATTTATATTTATTTAAATATTTGAATTTATAATAAATACTGAAATTTATAAACTTTA 
ATATGGGACTTGAGTCAGTAATTATTATCTCACAGTTGGGGAACCCGACCGTAATATTTTTATCGCTCATTTATGATATCGTCAGTGTACAATTA 
CAAAAGTTTATTTAAGATACAAATGGGGCAAGCTCAGCGCACCCAATTGGGTTGTTTAGTTAAGCTCGTTTAGATTTCTGTGTGTGGCAAATTTA 
GTGGAACTTGGGTCTCTCTGGTGATCGCATCATGTCGGATAAATTGGTTTTGTATGGCATGGATATTAGTCCTCCTGTTCGCGCTTGCAAGCTGA 
CCTTGCGGGCCTTAAACTTGGACTACGAATACAAGGAAATGGATCTACTGGCAGGAGATCACTTTAAGGATGCGTTCCTCAAAAAGAACCCGCAG 
CACACCGTACCACTCCTCGAAGATAATGGTGCCCTTATCTGGGATTCACATGCTATTGTCTGCTACCTGGTGGACAAGTATGCCAATTCGGATGA 
GCTATATCCCAGGGATCTGGTGTTGCGCGCCCAGGTGGATCAGCGTTTGTTCTTTGATGCCAGCATTCTGTTTATGTCGCTGCGAAATGTCAGTA 
TACCCTATTTTCTTCGCCAAGTAAGCCTGGTACCCAAGGAGAAGGTGGACAACATTAAAGATGCATATGGCCATTTGGAGAACTTTCTAGGGGAT 
AATCCCTATTTGACCGGGTCGCAACTGACCATAGCTGATTTATGCTGCGGAGCTACTGCATCCTCGCTGGCCGCTGTTCTTGATCTGGATGAGTT 
AAAGTATCCAAAGGTGGCTGCTTGGTTCGAACGACTCTCTAAGTTGCCCCACTATGAGGAAGACAATCTGCGGGGCTTGAAGAAGTATATCAATT 
TATTGAAACCCGTATTAAATCTGGAGCAATAGTAATTTGTATTAAAAAAAAAATACCATCAAATGC 
(SEQ ID NO: 562) 

Exon: 1001. .1821 
Start ATG: 1030 

Transcript No. : CT16545 

AAAACACTGACGGGTAAACAACACGTATCATGTCGAGCTCTGGAATTGTACTCTATGGGACAGATCTCAGTCCCTGTGTGAGGACCGTCAAACTT 

ACC C T AAAGGT C C T G AAT C T GG AC TACGAGT AC AAGG AG GTG AAT C T T C AG GC GGGC G AG C AC C T G AGCG AGG AAT AT GT G AAG AAG AAT CCCC A 
ACACACGGTACCGATGCTCGATGATAATGGCACTTTCATCTGGGACTCCCATGCCATTGCCGCCTACTTGGTGGACAAGTATGCCAAGTCGGATG 
AGCTGTATCCCAAGGATCTGGCCAAGCGTGCGATCGTCAATCAGCGTCTCTTCTTCGATGCCAGTGTAATCTATGCCAGTATAGCCAATGTCAGC 
CGCCCGTTTTGGATAAACGGTGTTACCGAAGTGCCCCAGGAAAAACTGGACGCCGTACACCAGGGTCTGAAGCTGCTGGAGACGTTCCTGGGCAA 
CAGCCCCTACCTGGCCGGCGATTCGCTAACCCTAGCCGATCTGTCCACCGGACCCACTGTAAGCGCCGTGCCCGCTGCCGTGGACATAGATCCTG 
CTACATATCCCAAGGTCACCGCCTGGTTGGATCGCCTCAATAAGCTGCCCTACTACAAGGAGATCAACGAAGCTCCGGCCCAGAGCTACGTCGCC 
TTCCTGCGCAGCAAGTGGACCAAGCTGGGCGACAAGTGAAACTAAACTTATATTTAAATATGACATTTCACAAGTTTGAATTCCATCAATTAACA 

AC AT AC AC AT C CAT T T C T AAAAC GT AAAC T T C TT C AAT AAAAT AT AAC AT AT AT AT AAAAT 
(SEQ ID NO: 563) 

Start ATG: 30 

MSSSGIVLYGTDLSPCVRTVKLTLKVLNLDYEYKEVWLQAGEHLSEEYVKKNPQHTVPMLDDNGTFIWDSHAIAAYLVDKYAKSDELYPKDLAKR 
AIVNQRLFFDASVIYASIANVSRPFWINGVTEVPQEKLDAVHQGLKLLETFLGNSPYLAGDSLTLADLSTGPTVSAVPAAVDIDPATYPKVTAWL 

DRLNKLPYYKEINEAPAQSYVAFLRSKWTKLGDK* 
(SEQ ID NO: 564) 

Name: glutathione S-transf erase (GST) 
Classification: enzyme 

Celera Sequence No. : 142000013384541 

TTGGGCAAAACGAGAACAACAAATTATTTTGAACACACACACGTATACATATGCGTACATATGTACGATTTCATGTGTGTTTGGCCCTACACGCA 
CCTTTGTTTTTGTTAGCACAGATTCTTTTTTTTCGGCCTCCCTTTTCGCTCCTTCGGCCTTGGCAACAATAACAACAACACACAGCAGTGGTAGG 
CGTTTCGCACTGATTTTCGATTCCAAAAACACAGGTGCTGTATTCTCTTTAACCGCGAGCTAATTCTTTGTCTTTATTTCGGATTTTTCTATTTG 
TTAATGCACATTTTCCGCGCGATTCTCGGACTCCGCCTGCGAAGCTCCAAAGAGCAATTGACAAAAACATCGAGAACTGCGCTCGGCGTTGCCAT 
GCTGACTGCGCTGCCAGACGGCGTGGGTGGAGTCCACGCTGGAGGTAAGCCCAGACCGCAACCCTCTAACTGTGGCACAGTGGTGCAAATTAAGA 
GAGGACTGCTGTCAAAAATTGTTTATAAAATTTGTTCCAGAGCCGTTATGTCATTGTTTCGTCATGTCAATTGTAAAAGAAAGCACCCTTTCAAA 
ACGTTGTAAATAATTTCAACAAGTGCTACAGATATTTGTGTTATTCACTGGCATACAAGTTCGTTTAAACAAAATATTAAACATAAATAAACATT 
AAAATATATTATTAAACAAAATATATTCTTGACTTGCAATAGATATGCCGTATCACTCATAAATGCAATACAAAGGGTAAATCATTATTTCCTGG 

C T T AAAAT T GT AAT G AAAC G GAT AAT G AAT C AC AT AGC T AT T C AG T T TT AGAG AAT T T GT ATG C AC T T T G T C AT AT T T AAC GT T AAAGT C GAT C A 
CTTTTTATTTAAGGGGCTAAAGTTGTTTTACTTTATTTGCCTAATTTAAATTTTTGCTCCTTATATTCAAAAATGCAGGCTCGATACCATCTCGA 
CCGCGATGATTTGTTTGGGTTGGCAACGCCCCGACGTTTGTGGCGCAGCCAACTTCACGCGAGACGCAGCGAAAACAAAAAATACTAAGCTGAAG 
TTAATTGAGAAAACCCCAAATAAAACGCGAGTGAAAGGGACCATGAACTTAGACCTGCGTTCGTACTCACGCCACTGGCTCACGGAGTTTATCGA 
GCAGTACCAAGAGGAGGAGTGCCTCTGGCAGCCCAAGCACAACGACTACAGCAATCACACAGCCCGTAACAAGTCCTACGATCGCCTGGTGGAGA 
AGCTAAAAGAAGTGGAGCCCAATCCGGACAGGGCGATGGTAGTAAGGTGAGAATTTCCATGTCAAATGTGCCACAGTTCAGTAAACACAATTCAC 
GCTTTACAGGAAAATTAACTCACTGCGGTCCGCTTTTCGGCGGGAATTCCGCAAGACGAGTACCAAAGGCGACTACGCAACGCGTTTGTGGTACT 
ACGACAAGCTGCTTTTCATCGCTGACCACAAGCCCAAGCGCCACGAACTCGGCTCCAAGCCCAAGAGAGAACTCCATATCAGCTTCGACGACGAG 
GAGTCAATGGAGTTCGAGGACGACTCACATCACACGGGCACTCAGTCTCAGCACATGGAGTCCATAATACCCACGTCCCCGGACGATGTGGAAGA 
AGTCGCGGCGACGGCCAACAATGTGGTCGTCAGCAGTCAGGGCGCCACTCTGAGCACCATTTCGGTGACGCCCGCGGAATGTGTGACCCTTGTCA 
AGAGCGAGGAGCACCAGGCGGCCGAAGCAGCGGCAGCCGCAGCCCAAGCACACCAGCAAATGGTAGCCCATGCAGCAGCCCAGACCTCCATTGCG 
GCGGCGGCGGCTCAGGGACATGCCGTGAAGGTCTTAGAGATCACCTCCCTAGACTCCAACTCCCAGCGCGAGATACAACAAGTGAGTAGCTCCAA 
AGCAGTGAAGAGTTCCTTAGGGTCTTCAATCACCTCCTCTCCCCACGCAGGCGGTCAATCATCTGGAGCACCACCAGCAGCAGCTCCACCTGCAG 
CAGACGAATGGCCAACATCAGGGCGTTCCCACCATCCAGATAGGCCGCGATCACTATCAGCCATTGTTTGGCAATGCCGGCACCACTGCCTATAC 
CACCACAGCGGCTACGAGCACATCGCACCGGCAAGACGACGAGTACGATGCCATTGGCGTGAATGTGGCGAGCAAACTGCGCTCCATCAACCCGA 
CGCAGCGGATCGTGGCCGAGAAACTGATCAGCGACGTTCTATTCAACGCACAGCTGGGCAACCTCACCGTTCACTCGGCCCTCACGCAGTAATCT 
CTCTTGTCCCCCTAGACTAAGTATGTGTAGTCTCTGGCTTTATATCTCCATTTTGTCTGTGTCCCAACCTGCAGCCCGGCCAACATGTGTCGCCC 
GGGCAGCAGCCGATTGTACCATACAGTTTGCATGTAGTTTTAATTATTCTATACAATAAATAGATTGTATGTCTTAAGCTCATCACCAGAGGGTT 
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TGTCATTCCGTCGAACTTCCTGCGCCCGTTAGCTTGTTCCACACAAACCTGGCAAACTGGACAAAAAGAGGAGCTGCTGTCTTGAGGTGATCGCC 
TTAAGAATTAGGACTAACATACCGTGACCGGCGGCATCTCGATCTGTTCCGCCTCCGTGACCAGTTTTCGGTTGCTCTGCATGTAGCTCCAGCCC 
AGATGGATCCCCACCAGACTGGGGATCATCACGACGGCCACCAGGTTGCGCGACACGAACGAGGGCTGCTTGCTCATCTTTACAAAATTCGTAAA 
ACGGAATTATTATTATTTAACAGCTATCACAAATCAACAATGACGATGTGACCGCATTGCGATGTTTGTTTAATCTTATCAACGAAAAATACCAA 
CCTATAACGCTGCCCCAAAACGGGCGGCTAGTTTTGAATAAAGTTTCTATCTCCCCCAACTTGGTCCAAAGTCCAACGTTTTCGGGGATAATAAT 
GGGAAACTAAACGTCTCGCAATTCTTTGTCTGGGACTTCGATTGCTTGGGCAGATAAGCATCTGTCATTAGTTTGTTGCGGTGGCTGGTTTTAAT 
AACAAATGCGGTGGAGGTGGTTAGAGTAGATCAATGGCATTGTAAAGGGGAACTTCACTTCTCGTAGGGTATATTAGTTGCTGCGAAATTCTTTA 
AAGACTTTTTGTGATTATAGAAGGCCTGAAGTAACTGGTAAGGTCCTTCGCTTTTATAACTAACATAAGATTATAATAGTTAAAAAACAAAAATG 
TGTAATTATACTACTTTACGTACAAGTTAAAATTGTCTAAGCTTTGAATTCAAATAACAGAAAGATGAAAAATAATTTTAGGTGCTTAAAGTTCT 
TAGTGGACAAACTAATTCGGAAAAATTAGAAAATATGTATGAAATAATTATAAGCTGAAATCTGAAATTTTAACTTAATCTGTTTTAAAAACCGT 

AAAATAAGCTATAAATTTATCGTCACAATAAAACCA 
( SEQ ID NO: 565) 

Exon: 1001.. 1281 
Exon: 1340.. 1886 
Exon: 1951.. 2456 
Start ATG: 1088 

Transcript No. : CT16581 

AAC T T C AC G C G AGAC GC AGC G AAAAC AAAAAAT AC T AAGC T G AAGT T AAT T GAGAAAAC C C C AAAT AAAAC GC G AGT G AAAGGG ACC AT G AAC T T 
AGACCTGCGTTCGTACTCACGCCACTGGCTCACGGAGTTTATCGAGCAGTACCAAGAGGAGGAGTGCCTCTGGCAGCCCAAGCACAACGACTACA 
GCAATCACACAGCCCGTAACAAGTCCTACGATCGCCTGGTGGAGAAGCTAAAAGAAGTGGAGCCCAATCCGGACAGGGCGATGGTAGTAAGGAAA 
ATTAACTCACTGCGGTCCGCTTTTCGGCGGGAATTCCGCAAGACGAGTACCAAAGGCGACTACGCAACGCGTTTGTGGTACTACGACAAGCTGCT 
TTTCATCGCTGACCACAAGCCCAAGCGCCACGAACTCGGCTCCAAGCCCAAGAGAGAACTCCATATCAGCTTCGACGACGAGGAGTCAATGGAGT 
TCGAGGACGACTCACATCACACGGGCACTCAGTCTCAGCACATGGAGTCCATAATACCCACGTCCCCGGACGATGTGGAAGAAGTCGCGGCGACG 
GCCAACAATGTGGTCGTCAGCAGTCAGGGCGCCACTCTGAGCACCATTTCGGTGACGCCCGCGGAATGTGTGACCCTTGTCAAGAGCGAGGAGCA 
CCAGGCGGCCGAAGCAGCGGCAGCCGCAGCCCAAGCACACCAGCAAATGGTAGCCCATGCAGCAGCCCAGACCTCCATTGCGGCGGCGGCGGCTC 

AGGG AC ATGC C GT G AAGGT CT TAG AG AT C AC C T C C CT AGAC T CC AAC T CCC AGCGC G AG AT AC AAC AAGC GGT C AAT C AT C T GG AGC ACC AC C AG 
CAGCAGCTCCACCTGCAGCAGACGAATGGCCAACATCAGGGCGTTCCCACCATCCAGATAGGCCGCGATCACTATCAGCCATTGTTTGGCAATGC 
CGGCACCACTGCCTATACCACCACAGCGGCTACGAGCACATCGCACCGGCAAGACGACGAGTACGATGCCATTGGCGTGAATGTGGCGAGCAAAC 
TGCGCTCCATCAACCCGACGCAGCGGATCGTGGCCGAGAAACTGATCAGCGACGTTCTATTCAACGCACAGCTGGGCAACCTCACCGTTCACTCG 
GCCCTCACGCAGTAATCTCTCTTGTCCCCCTAGACTAAGTATGTGTAGTCTCTGGCTTTATATCTCCATTTTGTCTGTGTCCCAACCTGCAGCCC 
GGCCAACATGTGTCGCCCGGGCAGCAGCCGATTGTACCATACAGTTTGCATGTAGTTTTAATTATTCTATACAATAAATAGATTGTATGTCTTAA 

GCTC 

(SEQ ID NO: 566) 
Start ATG: 88 

MNLDLRSYSRHWLTEFIEQYQEEECLWQPKHNDYSNHTARNKSYDRLVEKLK^ 

KLLFIADHKPKRHELGSKPKRELHISFDDEESMEFEDDSHHTGTQSQHMESIIPTSPDDVEEVAATANNVVVSSQGATLSTISVTPAECVTLVKS 
EEHQAAEAAAAAAQAHQQMVAHAAAQTSIAAAAAQGHAVKVLEITSLDSNSQREIQQAVWHLEHHQQQLHLQQTNGQHQGVPTIQIGRDHYQPLF 
GNAGTTAYTTTAATSTSHRQDDEYDAIGVNVASKLRSINPTQRIVAEKLISDVLFNAQLGNLTVHSALTQ* 
{ SEQ ID NO: 567) 



Celera Sequence No. : 142000013384633 

CAGTTCGGTGGCTTGTTAGCCGTATATACATACATATTATATATGATATTATTGTGATAAATATTTATCATTGACTCATTTAAAGAATTTTGAGA 
ATTCACGCGTTTGTAGCACATTCAAAATTTTTAAAAATTATTTTATTCATTTTAGACTGGGTTATTTGATATGGTTTTCTTATAATTAGTTCTTG 
AAGAATTCTTTGTAGTATTTTAATATTTGTAGATACAAACAGTAATTTTTTTTAACCATAATATTTTTTTTTTATTTTACTTTGCTTTTATTTAT 
TGAATCTTCTTGTATAAGAACTAACAATAAGTTTAAAAATCTTAACTATAACAAGTATTTTTTGAACAGTTGTATTATTTTTCCAACTGTTCTAT 
GTTTAAACGGCCCTAATAATTTATTCGCTGGGTTGGTAGGTCCTTTCGCCGGAGACGGGAACGGCTGCTGAGCTGGATTAGACCTCGCGCTAGGT 
GGTTGGGGTGCGTGTGAAGCGTGTGAACCATAATATGATATGAAAAGATATGATATTTACTATACCAATATTTTCGAAATATAAATAGAATAAAT 
AAATAAATACATATATACTACAAAGGTATATATTAGGTATATTTGAAACAAAAATCATATTTTAACGATAACAGATGCAGAAATTGTATGAAAGA 
CAAAAAATAATATTTGGTGAATATTATATTAGCCTGTGGTCAGGCTAAATATAAATGTAAATAAAATTCTTTATTGTTCTTACGGAAGTATTTCC 
AACTGCCT ACC AGCAGCATTTTTGGATTC AAAT GGT CTCC AAGC AGGAATGAATGC AGT C ACT AATTGGAATGTGTTC ACC AGC AT AGCGAGTCC 
AATTTAAAATCCATTCTTCAAATCAGAATTGTACAGAGTAGCCGAATTTAACTATACTTACAATGGTCATTCAATTTGGGATTTTTCTTTTCCAA 
CCTTATTCATATTATTTCATATCAGTTTTCAGAATAAAGAGACATTAGAATTTCCTTTTTTCGATTTATTTATTTCTTATATGATATTAGTGTTT 
ATAGCTAAAGTTTTACATGGAAGACAATGATTCTTAGCCTAATTCTTATACGAAGACCCTAACAAACAAACATGGAAGACAATGATTCGGCTAAG 
ACTAACTTAAGGCTAATAAGACACGATAAATCACAATGAGTTAACGATAGTAACGCTTGATTGTGATAATTGAATATCAATTAGAGTTGTGTTTC 
CCTTGGGACGCGAGCCCAAAGAATCCAATCTATGCCTAATATCACCCTGAGATTACGAACATGCGAAGCTGAACCCGCATCCATCCGACGACCAA 
TGGACGGAGACACTTGCACTTAAAGCTGCCAATTGGAGTTTTTCCCGGTGCATCGAGAAGTGGAACTGTGCATCCTGGCGATGTTTAGGCCTGAA 
CCCAACGGTCGAACTGCTGGCGCAACTGCTGTTCGTGGTGCTGGCGAGCTCCCTCGGCTGAGGTCCAGAAGAAGGTGCCGCTGGTGGTGCGGGCG 
AAGTGGACCGGGATGAAGGTCTGCTCCGGATGGACGGGCACGATGCTCTGTTGCTGACCCTGCTGCACGATCATGGCCACGTTGGTGGGGCACTG 
GCGGAGATCGGTGTACAGTCGCTGCTCCAGCTCCTCGTTGGCCATGTTATCGCAACTGGCCGCCGGATCTTTGCGCATGTCCTCCAGCGAGGCGT 
TGACTGCATTTTGGTTCTGGATCTCGGCCAGAGTCTTCCGGAGTAGCTGCTTCTTCTTCACCAGTCGCAGAAGGGGTTTGACCAGTTTCCTCAGA 
CTCCGCATGGCCATCGTGTGCTGATCTGATTTCGGCTCCATCACCTTGCTGCTCTCAAACTTGTATTCCCGTGTGATGAACGACATTGTGTTGAG 
GATTGTTGAGATTTATAATAAAACTTGGGATCTTAGTTGGTTGCACTCGGTTGAGAGTTGGTAGTATTCTGGTTTTCCTGCTGCACGACCGACTC 
CTTATATACGATTTGCTCTCTTTGGTTTCTCTCGGCGGGGGAGAATTCTCGCACAGAGCGGCGGTTCTCTCGATCGATGGACCCAGCCAATGGGA 
AGGCAGACTGCCTGCAGGTGGTTGCACAAACAGTGAGAGCGGGAGACAAAAGAGAGCCAAGAGAAAGCGGTCTCGGAATCAGGAGCATGTCCTTA 
CCCTTGAGAAAGCAGGGTATGATGAATTTTTGAGTCTGTGCATAAAAAATAGATTTTACGAAAAATAAGGTTAATATGTTTTTATTGTGTGTATA 
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TCTTTTAAGTGTATTTTTTCTAAAAAAAATTATTTATAATAGTAATATTGAAAGGGGCCATATAAGGTTGAATTTAATTGAATTGAATTCATTGA 
ATTTATAATAACTATAGTAATAAGATCATAAACTTGATGAAAAATATCAATATTTAAGGACTCTATTAAAATTAAACTTAAAAGACTAGCAAGAA 
TAAAATGTATCCTAAAATATTTGGTATACAACTAAATTAAAAACTATTTAAAATTAAATAATTAAGATCGGCACAGCCTTACAAAAAAAGTTTTC 
CTTAACAGCACACTGCTAAATTAAGGTATGTAGGAAGTACATCGGTAAAATTACAATAAATACAAAGAGATCGTTGTATATTTAACAATTATTTA 
GTATTTTACTAATGGTATTTATTTTTTGGAACATAGCATACCATACTAAGTCATTTACTTTAAAATTTCATATGTTTCCCACGTGATTCAAATGC 
TGGGTATTACCCACTCGAGTTGCTCATTCACTGCAGGTTTTTAACGGATTTTTTGACAGTCGCATAGATCTCGGTCGGTTCTCGATCTTTCGCCC 
CCTCTGAGAGTACCATGCCTCACAACCTCTCTTTACCTGCTCGACTCTGTCGCAAGAAAAGTAGAAGTCCCTAAATCCCAAACGTAAAATTCCAG 

GAGAGTACCGA 
(SEQ ID NO: 568) 

Exon: 1956.. 1001 

Start ATG: 1891 (Reverse strand: CAT) 



TCTCAACCGAGTGCAACCAACTAAGATCCCAAGTTTTATTATAAATCTCAACAATCCTCAACACAATGTCGTTCATCACACGGGAATACAAGTTT 
GAGAGCAGCAAGGTGATGGAGCCGAAATCAGATCAGCACACGATGGCCATGCGGAGTCTGAGGAAACTGGTCAAACCCCTTCTGCGACTGGTGAA 
GAAGAAGCAGCTACTCCGGAAGACTCTGGCCGAGATCCAGAACCAAAATGCAGTCAACGCCTCGCTGGAGGACATGCGCAAAGATCCGGCGGCCA 
GTTGCGATAACATGGCCAACGAGGAGCTGGAGCAGCGACTGTACACCGATCTCCGCCAGTGCCCCACCAACGTGGCCATGATCGTGCAGCAGGGT 
CAGCAACAGAGCATCGTGCCCGTCCATCCGGAGCAGACCTTCATCCCGGTCCACTTCGCCCGCACCACCAGCGGCACCTTCTTCTGGACCTCAGC 
CGAGGGAGCTCGCCAGCACCACGAACAGCAGTTGCGCCAGCAGTTCGACCGTTGGGTTCAGGCCTAAACATCGCCAGGATGCACAGTTCCACTTC 
TCGATGCACCGGGAAAAACTCCAATTGGCAGCTTTAAGTGCAAGTGTCTCCGTCCATTGGTCGTCGGATGGATGCGGGTTCAGCTTCGCATGTTC 
GTAATCTCAGGGTGATATTAGGCATAGATTGGATTCTTTGGGCTCGCGTCCCAAGGGAAACACAACTCTAATTGATATTCAATTATCACAATCAA 
GCGTTACTATCGTTAACTCATTGTGATTTATCGTGTCTTATTAGCCTTAAGTTAGTCTTAGCCGAATCATTGTCTTCCATGTTTGTTTGTTAGGG 
T C T T C GT AT AAG AAT T AGGC T AAG AAT C AT T G T C T TC C AT GT AAAACT T T AGC T AT AAAC AC T AAT AT C AT AT AAG AAAT AAAT AAAT CG AAAAA 
AGGAAA 

(SEQ ID NO: 569) 

Start ATG: 66 (Reverse strand: CAT) 
MSFITREYKFESSKVMEPKSDQHTMAMRSLRKL^ 

TNVAHI VQQGQQQS I VP VHPEQT FI PVHFARTT SGT FFWTSAEGARQHHEQQLRQQFDRWVQA* 
(SEQ ID NO: 570) 

Gene Symbol: Tom 
FlyBase ID: FBgn0026320 



Celera Sequence No. : 142000013384113 

CCCCCCCCCGGCACAATTGTTTATAGAAATCAAATGATTAATATATTATTATAGAAGAAAAACCGTTTAATAGCACAAGCGTAAACACCAATTGG 
AAGCAAATCAATAAATTTGAAGGAAAGCTAAAAAAATCCATAAATAAAATATCATTGAAATGGACAACTAATGTGTTTTTATCAGACTGCTTACT 
AAACTAAATATCATCTTTTTCAAAACCCCCCCAAAAAAATAAGGTGTTTTAGTTTCGGTGTGTTCTGTGAAATGGTATTTTTTATTTATTATACT 
CTCGGAAATTATATCAATTTAAAAAAATTTACTTTTGAATCAGCGAACATATTTATAGGAAAAGGTTTGAAGTGTTTGTATGTGATAAATACAAA 
TTAAACTACAATTATTGTATATGTGTTTGCAATCTTAGACAGCTATCGTATTTTGACTCTTGTTTGCTAAAAAAAAGCATAGTTTTAAAGTATTT 

gcaagacgtttgcagtcgagtagaatttttttcagatacaaaacatgaatttcaataaaaaaaatgaattaaacaacattgaatgatggtgtaga 
tacaattaaaaacgaaataaaattaaaatgctccacaacggatttttacaacaaatatctatccgttgcgttacagaagttttgtgaaaattgct 
gttttctaatttcaaatttgtatttgtgtgttaaactaattctttccgatcatcggaacttcgtttatgtttaaaagtcaaaatgtatcacaatt 

TTGATCATTTTGTTATAATAAATACGGCTGCACCTGTCACAATGCTAATTATTTAGTACACATTTATGCTGAATGTCTCGACTATATTTAATGGG 
TCTTGCACAAATTTGATTCAATGAAACCCAAAAGGTAGTATAGTAAAATTTCATTAAACCGATATTTACAAATACCAGAAAATAAGCAGATACTT 
AAGGATGCACATTTAAATGAGTATGCATGGATCGCGCATTAAATCGGTGCTTTTAAATTTTAAAACGTATTTATGATATTCCTTTTACAATCGTG 
ATTACAAGATATAGGAGAGCCCAGTTGTTAATTCGATGACAAAAAAAAAAAAAGAATAATTTGTGGGGTGAATTATTCTTTATAAATCTCTACAG 
AAAATGCCGGAAAAGTACTCCTTATCGAATATAAAATGATATAAGTATGTACAATGAGCAAAACCTACCATTCGATATTATATTTATAAAACATG 
ATTTCCTCTCTCTACCTTTCTTGTACAGTGCACACAATTTTGGATAAAAGAACGTTTCTAACAAACTATTATCAAAATTTATTGACAGTCGAGTA 
CTTTCTCAATTTCAGTATGAGATTTTTATACCCTGTATCAATATTTTTGTGTAGTATGCGTAAATGTAAAAACGATCGTACATATGTAAGTTTTG 
GATACATATTCTATACAAACATATGCGTAATTGAGTAAAGCAAAGTGAACTCGGCTATATAATTTCACAATTATTTGAAACGCTTCAATCAAGTT 
AAATGAAAATCCGTCAAAGCAACCTAATCAATTTGATGCCTGGCTTCAGTTATTAATGCTAAGAGTGCATCCAGTTTAAGATCTTTAAACCGTTC 
AAAGTGGTCCATAAGTGGCTGAATTGCATCCGACTCAATGGCTGCCGAGATGGTCTGGTTTTGAATAGTGCGATTGAGGCAGGATGCAACCTGTG 
CCAGAAATTCTCGCCACTTTTGCCTGTTCTCCTCGCTAAGAGTTGCGTCAGTTCGCGAGACGTAGATTTCCTCGGCGCGAAGTATCAGGGCAACC 
AGCGAAAGGATTCCAAACTGCCAATTTAATAATTAGTACGCGTATCTATTACTTAATACTGAGGATAACTTACGCGACTGCAAAAAATTCCAGCA 
AGTTTCTTGTCATTCAGCGTAATAGCAGATGAAATGCTCACTAGGAGTTCGAAAGTAGCTTTGTTAACGTGTTTTTTAAACTCTGGGTAAAGAGC 
GTACAAGACGCCTTCAACGTCGTCCTTGTCCTTTTTCACCACTGTTTGTAGGGAACAGAAGACGCCGCTCCACACGACCCAGCGAATGCTTTGAT 
CCGCAATAAAGGGCATTATGCGTCTAATAAGTATCTGCAAAGGAAAGATTGATTTAAATAAAATAGTGAGTTTCATGCGTATAGCACTAGAACAT 
AAATCGAATTCTATTTGGTTGCACTTGTTAAAATTAGTATAAGAAAAAGTACACACCTTTCCCTTGCGCACATTCATCATGGCGATAACCTTGTC 
GAAGGCCAGACCTGCCTTTAATTTATTGACCAACGCAGCTTTCGTCTCTACCTCATAGTTGAACTTGTTCTTTAGTTGCGGATTCATGGTTTGCG 
GATTGGCGGCTTCAGCAGCACGTTCTTCGGGCGTTTTATTGGCCATTTCCAGCTGCTCAAGGGCTGCGATCCGTTCGGCTTCCTTTTTCTTTTTC 
AGCTCAATGGTGGCCATCACTTCTGGGCTTTCCAAATCCTCCAACTTCAGCAGATAGCGATACAGTGTTTCAATCAGAAGCAAAATGTGTCGCGA 
CTTCCGCATGCTACTGGGATTTACATCATGGCTCGCGGAGGTGGGCAGCGGAGTCGAGCTTTTTGATGAAGAAGTTGCGTTGGCTTCAGCAGCGG 
CATTTGATTCACCACTCATAATGTCAGCATCGATAATTTTTCGCGGAGCTGTCACACTTCCGTACTGTAGCTTTCCCAGTGAGTTCTCGAACTTC 
AGTGGTGAGAATATATAGCCTGTCAGGTTGTTGTTGTTACCACCTGATCCATTGTTCGTGTTGCCTGACGTGCTGTTGGTGTTCTCCGAGTTGCG 
TCGCTCACGTCCGTGTCCATTGCGAGTACCATTCTTGTTGCCCAACTGTACGAGAATAAGCTGCGCATGGCCGCGTGGCTGAGTAAGCGGATGAT 
TCAACTGGTTGTCCTTGTGCGCCTGACTGTTGCGCATTTGCCCGTTCTGTTGGGCCTTGCGCTCCCTGAACACCGTGTAGTAATAGTCGTCGATA 
TACGGCGTGTCCGTATTCAGCTGCGAAAGCTGGATTCCGATTAGCCAGTGCTTGTCGCGCGTGCTCATTAGGTTGGCATATTCATCGTACTCTTC 
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CAGCGGGTAATTTCCTGTACCATCTGGACGTCGTCCTTGTTGCTGCTGTTGCGCCTGCGATCTCTGACGATCGCTGGCCACGGATCCTGCTCTGC 
TCTGTTGCATTTGGCGTACAACAGCAGCCTGTTGAAGCAGCGGATGGTTTTGCTGGATCTCTTGCACTAGTCGCATGTTGAACATGTTATACATG 
GAATTCGGTGGCTGCTGACCTAATAGACTTGTGGCAGCTTGCAGCGGGTGCGGCGGCATTCGCGGTGGCAGCATCTGGTGCGGATGGGGAGGTTG 
CATAAAAGCCCCTGGATGGATTCCAGCGGCGCGCATCGCATTGAAGCTTGGATGCATCGCGAAATTGTTCAGGGCCGTGGGCAGAGCTTGTGGAT 
TGGGTAGAGGTCGCTGGTTGCCGCCGGGCAGTGGAAAGTTGTGCGGCATGTGCTGCTGCATTAGAGGATGCTGGGGCAATTGCGGAATATTCCCG 
GGCAGACCTTGCGGGTAAATAAATCCGGGCGGCACTCGGTTGCCACCAATACCGTTTAGTTGTTGCTGCGACGATGGCGTGGGCAGTAGCTCGGG 
TCCGGGGAAACCCAAATTTGGCCTCGGTGGAGAGGTGTGCGGCGTGCCCAAGAAGCCAGGGGGTACTGCAATGAATTGTTAAATGGTGTTATTTT 
CATATGTCGCAGAAATCTATCAGGGACTTGAAGTTTGTTCCTTAAATGGTTAAATCTGTCTTAAAAATTGTCCCCAAAAATAGATTCGTAAGAAA 
TAAATATTTTTAGGAATTATACTTTTTGTTGGACATGTTTTCTTTTTAAATACTGAGGCCTGTTTTTGTATGCATCAATACTTTTTAGTTTTAAT 
ATATAATTACCCTTGTGCTGCGCTTGTTGTGGCGGCTGCTGCTGCGACTGTTGTTTCTGATGATGCTGCTGCTGCAACATCTGTTGCTGCAACAT 
GGTTGGTGTAGGCACCTGCTGCCTGTTACCCAGACTAAAATCATCAAACATTTTGCGTTCCGCCTGTTGCTGGCTCTGCTGCTGTTGTTTCGGAA 
TGGCCTGCTGAATGATCATATTGCGCTCTATATCCTCGAGAGTTGTAATTTTTGGAAAAGGCTGTGGATTCAGTGCAAAGTTGGCATTCGCTTCC 
TGCGGTGGCTTCATTTGTTGTGGTTGGTGCTGCTGATGTTGCTGCTGCTGTGGCTTGAAGGCGCTGCGCATAGGCTCGCGGAACAAATGTTGGTT 
GTTGGGAAACGGCTGTGATCCCCAGACCCCGGAGTCCAGCTTAATGCGGTTGCTCAGCCCACCGGCTTCGCTATCAGTGGTGAAAGAGGACAGGT 
CCATGTCGTCCAACTTCATGGAGGAAATGTTTAGCTCCAAGTCCGAGTCCCCTATGTGACGAGACTGTCGGAAGGGGGCATTCAAAGGGGCCGGA 
GTGGAGCCGCTTGAAGAGGGCGCCGATGATAGCGGATTAGCGTGTCGCAGGAAGGCACCATTTCCCTGATCGGCAAAATCCGTATTTCCACTAGA 
TTCCGCGGGTCGCTTCCGCACCCTTTCGCCATTCCCGCCCAGGCGTACCATCGTCTCATGGGCTTCCTCCCAGTCGCCATTTATAGCGGATCCAA 
AGGTTTCGTCATTGAGGGCATCGTATTCCGCCTCGACGATGCGTCCATCGCCACCATCTTCGTCCTGGAGAAACAATTGAAGCAAGTAAATTAGA 
CATTCCTACTTTTTCTGCCCCCGGTGGCCTTCGATGCCCCCTTATTTGCCGAGTGAAAAGTGTGCGTTGGGCATTTGCCGACCCCTGTGCGGCCC 
GCTATCCCGGATATTGCAGCCGCAATGTCGGGACCGCTGCTTACCGGTAGATTCGTGTCAAAGCCGAAAAACGAGTCATCCATGGCGATAAATCT 
ATTGCTTTACAAAAAATTATCATGACTTGTCAGTGACAAGTTGTCACCTCAGCAGTATAGCTGTCAAAATATCGATGGTACTATCGATAGTCTGA 
CGTCGATGGTTTTCTGACTATCGCCACAATACATCAATTTCGCTCGCATATAAATATTATCATATTACAAATGTTGGAATTAACATGCAAATTTA 
AAATATATATTTTTTTCATACGTTTTTTTTCAATAGGTGAACGACTGAAGATTTGTCTAAATTTATAAGTTTGCATTTTGCTACAGTGATGTTAG 
AAAAAAATGATGATTTAACCACTGCCAAATTTCGTATGTTGCATATTTATTTATTCAGTTTGGGACTTACCACGACTGGTTAATAAGTCAACAAC 

AT T AG AT TT T AC T GAT C AAT G AAAAAT C T G C C CT AAT T AAGGT GC ATT T GCACTCCTT C AAT GT AT T CG AT AT T TT G AACT AGT T CGAAAC T T G A 
AATCAGCGCCCGCAAGCTCATCCCTAAATTACTAGTTCTTGAACTATTGCTTTGGACGGAACTACGAGCCTGTTTCCTTTTCCAAAACAACTTAG 
TCGAAATGAAGCGGAAAACGAGCGAGATCTGGTGCTTTTTCCGGGCGGTGGACGACACGTTTGCGGTGTGTAATATCTGCAAGGCGAAGCTCTCT 
TACAAGACGACCACCACGAATCTAAGCAAGCACATGAACAGGATGCACCCCACATCGGGACTCAACCGGAACTCCAAGTACAAATTCCAGCCGCA 
GACCACCATCGCCATCGACCGGAACAAAGCCAAGCCGCGCAACTCAACGCAAGAGCTGATATTGTTAGAATTCGTGAAGAAACATCCGCGTCTGC 
TTCAGAATCCGACGTGGGAAACACGGGGCAATCTGGAGTCGCTGTGGGAGGAACTGTCATCCGAACTTAATAGCCACGGTCCGCCGCGAAAAGAT 
ATAGCCACCTGGAGAAGGGTATTCTAACTTACTTTTTTTCTAACCGCTTGCATTATAAGTTGATTTGTTTGTAGG 
(SEQ ID NO: 571) 

Exon: 5060.. 4985 
Exon: 4814.. 4001 
Exon: 3770.. 2242 
Exon: 2124.. 1879 
Exon: 1822.. 1001 

Start ATG: 5023 {Reverse strand: CAT) 
Transcript No. : CT16663 

TCATGATAATTTTTTGTAAAGCAATAGATTTATCGCCATGGATGACTCGTTTTTCGGCTTTGACACGAATCTACCGGACGAAGATGGTGGCGATG 
GACGCATCGTCGAGGCGGAATACGATGCCCTCAATGACGAAACCTTTGGATCCGCTATAAATGGCGACTGGGAGGAAGCCCATGAGACGATGGTA 
CGCCTGGGCGGGAATGGCGAAAGGGTGCGGAAGCGACCCGCGGAATCTAGTGGAAATACGGATTTTGCCGATCAGGGAAATGGTGCCTTCCTGCG 
ACACGCTAATCCGCTATCATCGGCGCCCTCTTCAAGCGGCTCCACTCCGGCCCCTTTGAATGCCCCCTTCCGACAGTCTCGTCACATAGGGGACT 
CGGACTTGGAGCTAAACATTTCCTCCATGAAGTTGGACGACATGGACCTGTCCTCTTTCACCACTGATAGCGAAGCCGGTGGGCTGAGCAACCGC 
ATTAAGCTGGACTCCGGGGTCTGGGGATCACAGCCGTTTCCCAACAACCAACATTTGTTCCGCGAGCCTATGCGCAGCGCCTTCAAGCCACAGCA 
GCAGCAACATCAGCAGCACCAACCACAACAAATGAAGCCACCGCAGGAAGCGAATGCCAACTTTGCACTGAATCCACAGCCTTTTCCAAAAATTA 
CAACTCTCGAGGATATAGAGCGCAATATGATCATTCAGCAGGCCATTCCGAAACAACAGCAGCAGAGCCAGCAACAGGCGGAACGCAAAATGTTT 
GATGATTTTAGTCTGGGTAACAGGCAGCAGGTGCCTACACCAACCATGTTGCAGCAACAGATGTTGCAGCAGCAGCATCATCAGAAACAACAGTC 
GCAGCAGCAGCCGCCACAACAAGCGCAGCACAAGGTACCCCCTGGCTTCTTGGGCACGCCGCACACCTCTCCACCGAGGCCAAATTTGGGTTTCC 
CCGGACCCGAGCTACTGCCCACGCCATCGTCGCAGCAACAACTAAACGGTATTGGTGGCAACCGAGTGCCGCCCGGATTTATTTACCCGCAAGGT 
CTGCCCGGGAATATTCCGCAATTGCCCCAGCATCCTCTAATGCAGCAGCACATGCCGCACAACTTTCCACTGCCCGGCGGCAACCAGCGACCTCT 
ACCCAATCCACAAGCTCTGCCCACGGCCCTGAACAATTTCGCGATGCATCCAAGCTTCAATGCGATGCGCGCCGCTGGAATCCATCCAGGGGCTT 
TTATGCAACCTCCCCATCCGCACCAGATGCTGCCACCGCGAATGCCGCCGCACCCGCTGCAAGCTGCCACAAGTCTATTAGGTCAGCAGCCACCG 
AATTCCATGTATAACATGTTCAACATGCGACTAGTGCAAGAGATCCAGCAAAACCATCCGCTGCTTCAACAGGCTGCTGTTGTACGCCAAATGCA 
AC AGAG C AGAG C AGG AT C C GT GGC C AGC G AT C GT C AG AG AT C G C AG GC G C AAC AGC AGC AAC AAGG AC G AC GT C C AG AT G GT A C AG GAAAT T AC C 
C GC T G G AAG AGT AC G AT G AAT AT GC C AA C C T AAT GAGC AC GC G C G AC AAG C AC T GG C T AAT C GG AAT C C AG C T T T CG C AG C T G AAT AC GGAC AC G 
CCGTATATCGACGACTATTACTACACGGTGTTCAGGGAGCGCAAGGCCCAACAGAACGGGCAAATGCGCAACAGTCAGGCGCACAAGGACAACCA 
GTTGAATCATCCGCTTACTCAGCCACGCGGCCATGCGCAGCTTATTCTCGTACAGTTGGGCAACAAGAATGGTACTCGCAATGGACACGGACGTG 
AGCGACGCAACTCGGAGAACACCAACAGCACGTCAGGCAACACGAACAATGGATCAGGTGGTAACAACAACAACCTGACAGGCTATATATTCTCA 
CCACTGAAGTTCGAGAACTCACTGGGAAAGCTACAGTACGGAAGTGTGACAGCTCCGCGAAAAATTATCGATGCTGACATTATGAGTGGTGAATC 
AAATGCCGCTGCTGAAGCCAACGCAACTTCTTCATCAAAAAGCTCGACTCCGCTGCCCACCTCCGCGAGCCATGATGTAAATCCCAGTAGCATGC 
GGAAGTCGCGACACATTTTGCTTCTGATTGAAACACTGTATCGCTATCTGCTGAAGTTGGAGGATTTGGAAAGCCCAGAAGTGATGGCCACCATT 
GAGCTGAAAAAGAAAAAGGAAGCCGAACGGATCGCAGCCCTTGAGCAGCTGGAAATGGCCAATAAAACGCCCGAAGAACGTGCTGCTGAAGCCGC 
CAATCCGCAAACCATGAATCCGCAACTAAAGAACAAGTTCAACTATGAGGTAGAGACGAAAGCTGCGTTGGTCAATAAATTAAAGGCAGGTCTGG 
CCTTCGACAAGGTTATCGCCATGATGAATGTGCGCAAGGGAAAGATACTTATTAGACGCATAATGCCCTTTATTGCGGATCAAAGCATTCGCTGG 
GTCGTGTGGAGCGGCGTCTTCTGTTCCCTACAAACAGTGGTGAAAAAGGACAAGGACGACGTTGAAGGCGTCTTGTACGCTCTTTACCCAGAGTT 
TAAAAAACACGTTAACAAAGCTACTTTCGAACTCCTAGTGAGCATTTCATCTGCTATTACGCTGAATGACAAGAAACTTGCTGGAATTTTTTGCA 
GTCGCTTTGGAATCCTTTCGCTGGTTGCCCTGATACTTCGCGCCGAGGAAATCTACGTCTCGCGAACTGACGCAACTCTTAGCGAGGAGAACAGG 
CAAAAGTGGCGAGAATTTCTGGCACAGGTTGCATCCTGCCTCAATCGCACTATTCAAAACCAGACCATCTCGGCAGCCATTGAGTCGGATGCAAT 
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TCAGCCACTTATGGACCACTTTGAACGGTTTAAAGATCTTAAACTGGATGCACTCTTAGCATTAATAACTGAAGCCAGGCATCAAATTGATTAGG 
TTGCTTTGACGGATTTTCATTTAACTTGATTGAAGCGTTTCAAATAATTGTGAAATTATATAGCCGAGTTCACTTTGCTTTACTCAATTACGCAT 
ATGTTTGTATAGAATATGTATCCAAAACTTACATATGTACGATCGTTTTTACATTTACGCATACTACACAAAAATATTGATACAGGGTATAAAAA 
TCTCATACTGAAATTGAGAAAGTACTCGACTGTCAATAAATTTTGATAATAGTTTGTTAGAAACGTTCTTTTATCCAAAATTGTGTGCACTGTAC 
AAGAAAGGTAGAGAGAGGAAATCATGTTTTATAAATATAATATCGAATGGTAGGTTTTGCTCATTGTACATACTTATATCATTTTATATTCGATA 
AGGAGTACTTTTCCGGCATTTTCTGTAGAGATTTATAAAGAATAATTCACCCCACAAATTATTCTTTTTTTTTTTTTGTCATCGAATTAACAACT 
GGGCTCTCCTATATCTTGTAATCACGATTGTAAAAGGAATATCATAAATACGTTTTAAAATTTAAAA 
(SEQ ID NO: 572) 

Start ATG: 38 (Reverse strand: CAT) 

MDDSFFGFDTNLPDEDGGDGRIVEAEYDALNDETFGSAINGDWEEAHETMVRLGGNGERVRKRPAESSGNTDFADQGNGAFLRHANPLSSAPSSS 
GSTPAPLNAPFRQSRHIGDSDLELNISSMKLDDMDLSSFTTDSEAGGLSNRIKLDSGVWGSQPFPNNQHLFREPMRSAFKPQQQQHQQHQPQQMK 
PPQEANANFALNPQPFPKITTLEDIERNMIIQQAIPKQQQQSQQQAERKMFDDFSLGWRQQVPTPTMLQQQMLQQQHHQKQQSQQQPPQQAQHKV 
PPGFLGTPHTSPPRPNLGFPGPELLPTPSSQQQLNGIGGNRVPPGFIYPQGLPGNIPQLPQHPLMQQHMPHNFPLPGGNQRPLPNPQALPTALNN 
FAMHPSFNAMRAAGIHPGAFMQPPHPHQMLPPRMPPHPLQAATSLLGQQPPNSMYNM 

RSQAQQQQQGRRPDGTGNYPLEEYDEYANLMSTRDKHWLIGIQLSQLNTDTPYIDDYYYTVFRERKAQQNGQMRNSQAHKDNQLNHPLTQPRGHA 
QLILVQLGNKNGTRNGHGRERRNSENTNSTSGNTNNGSGGNNNNLTGYIFSPLKFENSLGKLQYGSVTAPRKIIDADIMSGESNAAAEANATSSS 
KSSTPLPTSASHDVNPSSMRKSRHILLLIETLYRYLLKLEDLESPEVMATIELKKKKEAERIAALEQLEMANKTPEERAAEAANPQTMNPQLKNK 
FNYEVETKAALVNKLKAGLAFDKVIAI#dNVP^GKILIRRIMPFIADQSIRWVVWSGVFCSLQTVVKKDKDDVEGVLYALYPEFKKHVNKATFELL 
VSISSAITLNDKKLAGIFCSRFGILSLVALILRAEEIYVSRTDATLSEENRQKWREFLAQVASCLWRTIQNQTISAAIESDAIQPLMDHFERFKD 

LKLDALLALITEARHQID* 
{SEQ ID NO: 573) 

Name: BCDNA. LD27979 
Classification: hypothetical 
Gene Symbol: BcDNA:LD27979 
FlyBase ID: FBgn0028470 



Celera Sequence No. : 142000013384285 

CTGAAAATTTCCGACCTTATTGCACAGCTTAAGTGCAAGGTTAATCCGGGCAGAGCCAGATCCCACAAGACCAACAAATCACCGTTTTTATAGTG 

CGCGGATTTTAGTTTTTTCAAAGCAAAACGAAAGTTATCACCGATAGAACACACACACACACTTTTATACGTGTACACTACTGCGTATGCATATA 

TAATATATATATATATATATATAGATATATAGATATTGAAGTATTTTGTTAGCCCTGGAAATCGAAACCCAAACGTTGCCTAGAATGGATCAACC 

ATCCATGATACAATTGTATAACTAACTCAAGTCTCACAGCGCGTTATTTTATACCGATTAAATTGTAAACCGTAGACCTTAGGCGTACGCTCATG 

TTATATCAGTCAGAGTTCTATATCAACAATTGTATAAAGATGATTGAATTGATGACAGTGACTTTTTTTAAATCACGCATATTTTTATAAAAATA 

AGAGTAATAACAATGCATGCAACACATGTATTCACATGAATATATGGGGAGATTGCATAGCATGGGCTTGTACAGTGTACATAAAATAAGCCGTC 

GCCGATTATATAGGACCCAAGAATATCATATTTTATATAGAAGCAAGCGCAAGCAAATAACCCAGGCAGCATGCCCTCAACTTTCAGTCGTAAAC 

GGAGCGATTGGAAGTCGCTCCTAGTCTGTAAGATCCACGATTAACTAGCCGCTCTCTCCTTCGGTTAACTTCGTTCCGGTGTCACACATCTCTGC 

AAGGAGGGCATTATTGGTGTAAGGCAAACTTTTGAAGAAGGCAACCTTAATGATAAAATATGTATTTAGATATCCCTAAGGTACCAGCTAATGTC 

AAGATACTTTAATAAAACCGAATTCAAAATTGCAAGTATTGAAATATCGCCGACTAACTTTTATATGTGTTGGTTTTCTTTGGGTTTATATGTGC 

TCTACAGTCAAAACTCATTTTCATTTCTGTTTTTTTTTTTTTTAGATTGGTTTATATTCTGGATATTAATACACTTTTAAGACATATAGACTAAA 

TTCTTACTGCTATGTTTACACTGCGTTTGGTGGCTATATATTTATGCTAGGTTTCGAATAAACATTTCAAACACGCACACCAAATTTCCAACTCT 

TATGAATATTTTGAAATCCATGCGCTGTTACATTTTGATGTGGGTGCGAGTGTATAAATGTTAAAAAATAGTTGCCATAACCGATAGAAATATAA 

AATGGTAAGGAGATAACACAAAAAAGTAGAAAAAGAATCATGTACAATACAAATTAGGGGCGAGTGCTTATATACAGAGTGTTTGTGTCTAGTTT 

TTAACTAAAACTAACGTATTTTTCCGTTTGCATTTGCATAGTTTTGAATATGTTACGCTTAGAACTAGATTTAAAGTTGAGGCGCTGCTTCTCTT 

TGAATACAAAGAAATCCTGCTTTCTAAAGGATTTTCAACTTTCCGCTTATCTCTTTAGTTTGTCTAAAAAGGCCTCAATAACAACTTAATATAAT 

AAAATACTAACATTTACAGCGTTTCGTTTTGAAAAAGAGAGATAATAAATAAACTAAGAGTTTTTCAAAAATATCAAAACGGGATTGGGCATAAG 

GGGTTGGTGGTAAAGCAACGAGGGTCAGATCTCTAAAGCCTCGAATTGAGTAAACTAGTGGATCCGGTTTCATTTCTTGAAAACTTCGTGATACC 

GGGAATTGAAGTGTTCTCATATTCTAAATAAATAACAAGTAAACGGTTTACCGTTTCCGATTCCAGTACCTTTCCGGGGAATGAAATTGGTAACA 

TGGCCCTCGGCTACGATTTCGCAGCTTCTCTGTTTACAAGTTGGCTTGAATTGCATTTAAAATAAATACTACACTACTAGCAGGGGCTAAGATTA 

CGTTTACACTGCTGCTAACTGTCCTGGAGGCGGACTCTCCTCAGATCTTTCTATTTGGGTCTGCTGCTTATTTTCCTCACTTGGTCGCCGCTTCT 

TTAGTGGGGGAATTTCCGTTTCAATAGCGGGCGTTGGCTGCTGTTCTATAATGCAGGCGGTTCTTTTCTGGCCGGGACTGCTGTTGGGCAGCGGT 

TCGTTGGATCCATTCACTATATCGGTGACTATGTGAGCCTCCAGTGAAGGCGGTAGCAATGGTGACAATGGAACGGCGTTCAGATCACATGCATC 

CTCATCATCATCCTTATAATTGTGCCCCACATTGCCAAAGAGATCACTGTGTCTATTGTGTGATCTTTCCTCTTCTTCGTCATCATCATCGCTGT 

AATCGTTGTCCCAGAATACCTGGGCTCCTGGAAACACATACGAATAGTTGTTGGTGTGCGAATAGAATGTTCCAGGCTGCAGACGCTCGGCTGCT 

GTCTGTCGCTTTCCCCTCTGTTGTGGAATGGGCGTCAGGCTGGGCGCCACTGTCTTGGTCTCCTGAACAAGATTGCTGGGTCGCACGTAGCTAGG 

CGTGGCTTCGTTGTCGCACTGCTCGATGCCGCTATCCTTGGAGGAATCAATGGAAAGATGGCGATAGCTCTCCACCGTTGTCTGAGGGCCTTCCA 

GGCCCAGACGATCGCAACTTAGGGCTGTGGTCTCGTTCAGCTCAACCAGTATGTCTGTTTTAATACGTTCGATGGCTGCCGCTTCGGCGGCAGTG 

GAACGCTTTCCACAGCTAAAAGTAGATGACTCAAATCCACTATCCGAGCAGGTGCCAGCTGATCCTAGTATGTAGTCCGCCGAACTATTTGATTT 

AATAGACTGCGTATCCGTGTCCAGTTGCGACTGTCGCTCGCTCTCTGAACTGCAGTGGCGGTGGTGAAGTAGATGATGATGGAGGTGGTGATTAG 

AGTGCTCCGGAGGCATTAGCTCCTTGCTTTCGGTAAGCACTGACTCATCGCAGCACAGCTGCCGCCAGCAATCATCGTTGTCCGACAACCGGTGG 

CAAATCTGGTTGATGATCACATCGGAGTCGCCCAGCAGCTCCACATCGAACTTAAGGTGATGCAGCTGCTCGCGATTGATAAGAATCTGCGGCAC 

CGTGGCCGGTATGCTGCTGGGAATGTGGGCCACAGGTCGGACCTTCAGCGAGGAGCCGATCACGATCAGTAGATCGCACACGTCCTTGTCGGTGG 

CCATGACCGTGTGGTACTCATCCGGCAGTCCTGTTGGCAATATGAAAAGCAAACACAATATGATATATGGTAAATAGCGTTACTTTGTATTTTCA 

ATAGTTTATCTCTTAAATAAACACTTCTTTTGATCGTCGAATTTGAAATTCTTGCATTTATTTTACCAAGGCTATTGATAATATATAATAAATCT 

GTATAAAAATGGCACCTACCCTCGCCGAAAAAGACGATATCCGGCTTCATGATGCCGTTCTCGACCAGTTGGCGCAGCTCCTCCTCAGTAACGGC 

CACCGAGGCATCCACGCTCTGCTCCTTATTGGGCTGGCACTGCGGGCACACCGGAATTCGCTGGGCAAATATGTCCGCCCGCAGGGCGTCAGCGT 

TGCACTTGAAACGACACTTGGTGCACGAGGCCGTTGAAAAGGAGCCGTGACACTCGATTACTCGCTGAATGCCTGCCACCCGCTCGAGGGTGTCG 

ATGTTCTGTGTGTAGTTGCGCAACAGTTTGCCCTTGGTCTCCAGCATTTTGATGAAACGATGGCAGGGTGAGGGCTGAAACTCGCCGGGATATAT 

CTCGCGGGCAAACTTGTAGAACGGTCGTGGATCCCTCTTGAAGTAGTTGATATCAAACATGGCCTGCGGATCGGGCAGATCGGGAAAATCATGGG 
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CCAATCGCGCATATATGCCATTGGTGGACCGGAAGTCCGGAATGCCGCAGGAGACGGATACTCCGGCTCCCGTTAGCACAATGATCTTCTGTGAT 
TTCTTGACCAAACTGATGACATCGTCGAAGGTGTTCACTGAGGCCAGCTTGTTGCGCCGCTTCGGCTCGTTCAACAGGTGGGCCAAATAGTCCCA 
CAGCACGGAGTCGTCGGTGTCGCCCGCCAGGCCGGTGGCGAAATGCGGCATAATGCTGGCAATAACCTGGCGCGGCACACGACCTGTGTAAAATT 
CTCGTTGCAACCAGCGCAGCTTCCAGTCGGGTTCCACGGATGAGGAGCAATTTGAGCTGGAGTCCTCATCCTCGTTGCTCGTTCCGGTGATTCCC 
TCCTCGTCGTCCTCCTCCTCCTCCTCATCGTCCTCTTCCTCCTCCTCCTCGTCATCTTCATCCTCGCCCATTGACTTTGTGGGATTTTTGGTTTT 
ATGCTCCAAATTGGCGCCAATCTCTTGTTCTTCCCTGGCAGCCAATGTTTTTGTTTTGATTTCACCATTTGCTTTGCCAGCAAGTTCGCTTGTTG 
CTGGTTCTGTGGTTGTTGCTGTTGCTTCTGCCTCTGCCTCTGCCTCTGTTGACGTTGAGGCCAGAATTTCCGCGCCAAAATCAAACTTAGTTGGC 
GGATAGAATTGCGTAGTGTCTGGCACCTGGTTGCCCAGATCTTTAGACCTAATGTGGCCCAGGCGAATTTCCTCGTAATTTTCCATCATTTTTTC 
CGAACCCAAGCGCGCGCACACGCGCACACACACAGACTCACGGAGAGAAAACTCTCGTAGTTGGTGCTGTTGCTGGCTCTTCTTCTTCTTGCGCG 
AGTGTTTCCTCGGTCTCTCCGCTTTGCGCTCTCGCCTCGTCCGTTTTCTCCGCTCTTTCGTTGGTACACACAAAAATTTTTACGTCGTCAGCTTC 
GCACCCATTTGGCTTGATTGAAAAAATTCTCCAGCGATCGTTTGGCCGCGATTTTTTCAACCAACCCGACTTTATTTCACGTATATGCACGAAAC 
TTAAATTTTAAAAAAATTTCGACACCATTTGGTTGCGATTTTTTCGTAATTTTCGAGCGTAGTTTCTGGAAAATTTTCCGATGAAAAAAGCAAGA 
TGGCACACGTAGCTGCGCCTGTGTGTGTATGTGGGCTGGCAGACGTGATTCGGGTGGGACCCTGACACGTCGTGTGGAAAGTGCTAACGCGTTTC 
TGCATGGAAAGTGCCGCCAGCATGACTCATCACGGTATGTTTTTAGTATTTTTTGGGATTTTTGAGGGTGATTACGTGTTGCTACAGTGGCCACT 
GCCTTATGTCATATCTAATTTAATCCATTTCTTTAAAAACTGTTTAAAATATATCAATAATTTGTCAAAAAGTTAGTAAAATTAATTTTAAATTA 
TGCTTCGAACTTTAATTTTAGCACTCTTTTACTTATCAAATTTACTCGATATTTAAACAATTAAGTAAAAATAATTTTTTTGTCCAATATTTACA 
CAT T T CAT AG TAG T AT T AT AAAT T G AT GT AT T GAT AAT AT GC AGT T C AT AT GT C C AAAAACT T C C AT C AAG CT G TT AAAT AC AAAAAT GT GT AAA 
CAGATTTCCAGAACGTTCCAGCAAGTGTCTACTGTTCCCCAAAAAAGCTGCGATTTTAAGTATTTATTTGGTACTTTCCCCAAAATTATAGGTTT 
TTGGTAGAACTTGGGCATACTTGTTAGCCAATATGACTTGAAAATATTGTCATAAATACATTGTTAAATAATTTAATAAGTCGTACACTTACAAT 
GTACAAAATATTTTAGTTTGTAGGGGGACTGGGGATGCAATAGGAAATGGACAACCTAAATTTATACGACGTTCTTAAAGTGGCTCCGGATGCAA 
CTGATGAGGAAATAAAAAAGGTATGTAACAGTTCCAATTCGATATGTTGGTAAATAGTTTGTATTCACCTTTTCACCAATAGAACTACCGAAAAT 
TGGCAAAAGAGTTCCATCCAGACAAGAATCCCGATGCGGGCGACAAGTTTAAAGAAATATCCTTCGCCTACGAAGTGCTGTCCGATCCCGAGAAG 
CGGCGCATCTACGACCGATATGGATTGAAGGGCCTGCAAGAGGGCGCCGAGGGATTCTCTGATGCCTCCGAATTCTTCGCCCAGTGGTTCCCCTT 

C GAT C GAG 

(SEQ ID NO: 574) 

Exon: 4993.. 3345 
Exon: 3165.. 1001 

Start ATG: 4554 (Reverse strand: CAT) 
Transcript No. : CT16687 

CCGAATCACGTCTGCCAGCCCACATACACACACAGGCGCAGCTACGTGTGCCATCTTGCTTTTTTCATCGGAAAATTTTCCAGAAACTACGCTCG 
AAAATTACGAAAAAATCGCAACCAAATGGTGTCGAAATTTTTTTAAAATTTAAGTTTCGTGCATATACGTGAAATAAAGTCGGGTTGGTTGAAAA 
AATCGCGGCCAAACGATCGCTGGAGAATT TTTTCAATCAAGCCAAATGGGTGCGAAGCTGACGACGT AAAAAT TTTTGTGTGTACCAACGAAAGA 
GCGGAGAAAACGGACGAGGCGAGAGCGCAAAGCGGAGAGACCGAGGAAACACTCGCGCAAGAAGAAGAAGAGCCAGCAACAGCACCAACTACGAG 
AGTTTTCTCTCCGTGAGTCTGTGTGTGTGCGCGTGTGCGCGCGCTTGGGTTCGGAAAAAATGATGGAAAATTACGAGGAAATTCGCCTGGGCCAC 
ATTAGGTCTAAAGATCTGGGCAACCAGGTGCCAGACACTACGCAATTCTATCCGCCAACTAAGTTTGATTTTGGCGCGGAAATTCTGGCCTCAAC 
GTCAACAGAGGCAGAGGCAGAGGCAGAAGCAACAGCAACAACCACAGAACCAGCAACAAGCGAACTTGCTGGCAAAGCAAATGGTGAAATCAAAA 
CAAAAACATTGGCTGCCAGGGAAGAACAAGAGATTGGCGCCAATTTGGAGCATAAAACCAAAAATCCCACAAAGTCAATGGGCGAGGATGAAGAT 
GACGAGGAGGAGGAGGAAGAGGACGATGAGGAGGAGGAGGAGGACGACGAGGAGGGAATCACCGGAACGAGCAACGAGGATGAGGACTCCAGCTC 
AAATTGCTCCTCATCCGTGGAACCCGACTGGAAGCTGCGCTGGTTGCAACGAGAATTTTACACAGGTCGTGTGCCGCGCCAGGTTATTGCCAGCA 
TTATGCCGCATTTCGCCACCGGCCTGGCGGGCGACACCGACGACTCCGTGCTGTGGGACTATTTGGCCCACCTGTTGAACGAGCCGAAGCGGCGC 
AACAAGCTGGCCTCAGTGAACACCTTCGACGATGTCATCAGTTTGGTCAAGAAATCACAGAAGATCATTGTGCTAACGGGAGCCGGAGTATCCGT 
CTCCTGCGGCATTCCGGACTTCCGGTCCACCAATGGCATATATGCGCGATTGGCCCATGATTTTCCCGATCTGCCCGATCCGCAGGCCATGTTTG 
ATATCAACTACTTCAAGAGGGATCCACGACCGTTCTACAAGTTTGCCCGCGAGATATATCCCGGCGAGTTTCAGCCCTCACCCTGCCATCGTTTC 
ATCAAAATGCTGGAGACCAAGGGCAAACTGTTGCGCAACTACACACAGAACATCGACACCCTCGAGCGGGTGGCAGGCATTCAGCGAGTAATCGA 
GTGTCACGGCTCCTTTTCAACGGCCTCGTGCACCAAGTGTCGTTTCAAGTGCAACGCTGACGCCCTGCGGGCGGACATATTTGCCCAGCGAATTC 
CGGTGTGCCCGCAGTGCCAGCCCAATAAGGAGCAGAGCGTGGATGCCTCGGTGGCCGTTACTGAGGAGGAGCTGCGCCAACTGGTCGAGAACGGC 
ATCATGAAGCCGGATATCGTCTTTTTCGGCGAGGGACTGCCGGATGAGTACCACACGGTCATGGCCACCGACAAGGACGTGTGCGATCTACTGAT 
CGTGATCGGCTCCTCGCTGAAGGTCCGACCTGTGGCCCACATTCCCAGCAGCATACCGGCCACGGTGCCGCAGATTCTTATCAATCGCGAGCAGC 
TGCATCACCTTAAGTTCGATGTGGAGCTGCTGGGCGACTCCGATGTGATCATCAACCAGATTTGCCACCGGTTGTCGGACAACGATGATTGCTGG 
CGGCAGCTGTGCTGCGATGAGTCAGTGCTTACCGAAAGCAAGGAGCTAATGCCTCCGGAGCACTCTAATCACCACCTCCATCATCATCTACTTCA 
CCACCGCCACTGCAGTTCAGAGAGCGAGCGACAGTCGCAACTGGACACGGATACGCAGTCTATTAAATCAAATAGTTCGGCGGACTACATACTAG 
GATCAGCTGGCACCTGCTCGGATAGTGGATTTGAGTCATCTACTTTTAGCTGTGGAAAGCGTTCCACTGCCGCCGAAGCGGCAGCCATCGAACGT 
ATTAAAACAGACATACTGGTTGAGCTGAACGAGACCACAGCCCTAAGTTGCGATCGTCTGGGCCTGGAAGGCCCTCAGACAACGGTGGAGAGCTA 
TCGCCATCTTTCCATTGATTCCTCCAAGGATAGCGGCATCGAGCAGTGCGACAACGAAGCCACGCCTAGCTACGTGCGACCCAGCAATCTTGTTC 
AGGAGACCAAGACAGTGGCGCCCAGCCTGACGCCCATTCCACAACAGAGGGGAAAGCGACAGACAGCAGCCGAGCGTCTGCAGCCTGGAACATTC 
TATTCGCACACCAACAACTATTCGTATGTGTTTCCAGGAGCCCAGGTATTCTGGGACAACGATTACAGCGATGATGATGACGAAGAAGAGGAAAG 
ATCACACAATAGACACAGTGATCTCTTTGGCAATGTGGGGCACAATTATAAGGATGATGATGAGGATGCATGTGATCTGAACGCCGTTCCATTGT 
CACCATTGCTACCGCCTTCACTGGAGGCTCACATAGTCACCGATATAGTGAATGGATCCAACGAACCGCTGCCCAACAGCAGTCCCGGCCAGAAA 
AGAACCGCCTGCATTATAGAACAGCAGCCAACGCCCGCTATTGAAACGGAAATTCCCCCACTAAAGAAGCGGCGACCAAGTGAGGAAAATAAGCA 
GCAGACCCAAATAGAAAGATCTGAGGAGAGTCCGCCTCCAGGACAGTTAGCAGCAGTGTAAACGTAATCTTAGCCCCTGCTAGTAGTGTAGTATT 
TATTTTAAATGCAATTCAAGCCAACTTGTAAACAGAGAAGCTGCGAAATCGTAGCCGAGGGCCATGTTACCAATTTCATTCCCCGGAAAGGTACT 
GGAATCGGAAACGGTAAACCGTTTACTTGTTATTTATTTAGAATATGAGAACACTTCAATTCCCGGTATCACGAAGTTTTCAAGAAATGAAACCG 
GATCCACTAGTTTACTCAATTCGAGGCTTTAGAGATCTGACCCTCGTTGCTTTACCACCAACCCCTTATGCCCAATCCCGTTTTGATATTTTTGA 
AAAACTCTTAGTTTATTTATTATCTCTCTTTTTCAAAACGAAACGCTGTAAATGTTAGTATTTTATTATATTAAGTTGTTATTGAGGCCTTTTTA 
GACAAACTAAAGAGATAAGCGGAAAGTTGAAAATCCTTTAGAAAGCAGGATTTCTTTGTATTCAAAGAGAAGCAGCGCCTCAACTTTAAATCTAG 
TTCTAAGCGTAACATATTCAAAACTATGCAAATGCAAACGGAAAAATACGTTAGTTTTAGTTAAAAACTAGACACAAACACTCTGTATATAAGCA 
CTCGCCCCTAATTTGTATTGTACATGATTCTTTTTCTACTTTTTTGTGTTATCTCCTTACCATTTTATATTTCTATCGGTTATGGCAACTATTTT 
TTAACATTTATACACTCGCACCCACATCAAAATGTAACAGCGCATGGATTTCAAAATATTCATAAGAGTTGGAAATTTGGTGTGCGTGTTTGAAA 
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TGTTTATTCGAAACCTAGCATAAATATATAGCCACCAAACGCAGTGTAAACATAGCAGTAAGAATTTAGTCTATATGTCTTAAAAGTGTATTAAT 
ATCCAGAATATAAA 
(SEQ ID NO: 575) 

Start ATG: 440 {Reverse strand: CAT) 

MMENYEEIRLGHIRSKDLGNQVPDTTQFYPPTKFDFGAEILASTSTEAEAEAEATATTTEPATSELAGKANGEIKTKTLAAREEQEIGANLEHKT 
KKPTKSMGEDEDDEEEEEEDDEEEEEDDEEGITGTSNEDEDSSSKCSSSVEPDWKLRWLQREFYTGRVPRQVIASIMPHFATGLAGDTDDSVLWD 
YLAHLLNEPKRRNKLASVNTFDDVISLVKKSQKIIVLTGAGVSVSCGIPDFRSTNGIYARLAHDFPDLPDPQAMFDINYFKRDPRPFYKFAREIY 
PGEFQPSPCHRFIKMLETKGKLLRNYTQNIDTLERVAGIQRVIECHGSFSTASCTKCRFKCNADALRADIFAQRIPVCPQCQPNKEQSVDASVAV 
TEEELRQLVENGIMKPDIVFFGEGLPDEYHTVMATDKDVCDLLIVIGSSLKVRPVAHIPSSIPATVPQILINREQLHHLKFDVELLGDSDVIINQ 
ICHRLSDNDDCWRQLCCDESVLTESKELMPPEHSNHHLHHHLLHHRHCSSESERQSQLDTDTQSIKSNSSADYILGSAGTCSDSGFESSTFSCGK 
RSTAAEAAAIERIKTDILVELNETTALSCDRLGLEGPQTTVESYRHLSIDSSKDSGIEQCDNEATPSYVRPSNLVQETKTVAPSLTPIPQQRGKR 
QTAAERLQPGTFYSHTNNYSYVFPGAQVFWDNDYSDDDDEEEERSHNRHSDLFGNVGHNYKDDDEDACDLNAVPLSPLLPPSLEAHIVTDIVNGS 
NEPLPNSSPGQKRTACIIEQQPTPAIETEIPPLKKRRPSEENKQQTQIERSEESPPPGQLAAV* 
(SEQ ID NO: 576) 

Name: silent information regulator 2 
Classification: known_f lybase_gene 
Gene Symbol: Sir2 
FlyBase ID: FBgn0024291 



Celera Sequence No. : 142000013384004 

AACGTATTGAACAATTATAATTAAATTATTTTTTTAATAAAGATAAGCTTAATTCATTCACGTAACTGGAAATCTTAATTGGAGATAATCTAGCA 
CTGATAATATTCTTCGCTTTATAAATAAGTTCTAATTGAAATTAATTCAAATATATATTTATATATATGATATATAATTTTTAAGGAATTACAAG 
AAGTCGTTCCACTTAGAGCAAATTTTCTTAGTAACTGTTGGCATATTTTCAGCGGAACGTTTAGAGGATCGTGGCCCTGTGTTCCTGGTTTTCTT 
AAAGTTCATAGAGATTTCGTCGGAAAGTTCATCGGTTTTCGTAGGTTTTGCTGCAGGCTTTTGAGTAGTTTCAATGGTATTATCCACATAGTTTG 
CCCATTTATTTGTTTTCTTCTGAGGTGTGGTGTTTTCCTCTTTTTCTTGGCAGTCTTTCTCCTCTTCGAGCTGTTGCTTTGCCGTGAGGAATAAG 
CCCGTGTTCAGACATTCCTCCTGCATTCCGCGCTCTAAGTTGAGATGCTGCACCTTAACCCGGCATTCCGCTGCCGATCCTCGAAAGAATTCCTT 
GAGCAAACTTTGCTTTTGGCGGCAAATTTTACACTGCCATGTGCTTGCCTTTTTCACGAAGTCCACCTTGTTTAAATTAGGTATATATATATAAG 
TAGTTATTGAGTTCTAAGATATATTTCAGTTTGTTTGCACTCACCTGGTACATTTTGCATTGAATGCATTGTACCACGCGAATTTGCTGGGACAT 
TATTCAAATTTGCGAAAAACCAAATAAAGAGACCAGGGATGGAAACCAACAGCACTGCAACAAGTGGAGTGAAAAAAATGTGGTTGCACAAATGT 
TAAAGCGTGATGACTACAGTTTAACAAGTAGTAAAACATTTACCAAAATTTGTAATTAACTAATTTTGGTACCTATGGTTAATGATTTAGCCGGG 
TGCTGATTCCGGTTCATAAGTGAACCAAATCGTTCCCCATCCCTGGTCCAAATATAAACAAACTTGCATCGTCTAATCGGCACTGTCGGTTTAAT 
AAAATGCGATTGGTAAGTCATCCTGAGAAGTAAATAACAACTTAATTAGCGGAAGTGACTATCCACTTTGCAGTATTGTCTCAGCGGGGACCTGG 
CCAAGCCATGTTATATCATTACCTTCAAGGGCCTGCGGATTATGCTGGACTGCGGTCTCACGGAGCAGACAGTCCTGAATTTCCTTCCGCTGCCC 
TTCGTTCAGTCACTGAAGTGGTCCAATCTGCCCAACTTCGTGCCCAGTCGGGATCATGATCCCCAAATGGATGGCGAGTTGAAGGACTGCTGTGG 
CCGGGTTTTCGTGGACTCGACGCCCGAGTTTAATCTACCCATGGACAAAATGCTGGATTTCAGCGAAGTGGATGTGATACTTATCTCGAATTATC 
TTAATATGCTGGCCTTGCCGTATATCACGGAAAACACGGGGTTCAAGGGCAAAGTCTATGCCACGGAGCCAACGCTGCAGATCGGCCGATTTTTC 
CTGGAGGAGCTGGTGGACTATATCGAAGTGTCGCCCAAGGCGTGCACCGCGCGCTTGTGGAAGGAGAAGCTTCATCTGCTGCCCAGTCCCTTGAG 
CGAAGCCTTTCGCGCCAAGAAGTGGCGCACTATCTTTAGTCTTAAGGATGTCCAAGGCAGTCTGTCCAAGGTGACCATCATGGGCTACGACGAGA 
AGCTGGACATCCTCGGAGCTTTTATCGCCACTCCTGTCAGCTCGGGCTACTGCCTGGGCTCGAGCAACTGGGTACTGAGCACGGCGCATGAAAAG 
ATTTGCTATGTCAGTGGTTCCTCCACGTTGACCACACATCCGCGGCCCATCAATCAGTCGGCGCTGAAGCACGCCGATGTACTCATTATGACCGG 
ACTGACGCAGGCGCCGACGGTAAATCCGGACACGAAGCTGGGCGAGCTCTGCATGAATGTGGCCTTGACGATCCGCAACAATGGGTCTGCCTTGA 
TTCCCTGTTATCCTTCGGGCGTTGTCTATGACCTCTTTGAGTGCCTCACCCAGAATTTGGAAAACGCTGGCCTAAACAATGTTCCCATGTTCTTT 
ATTTCCCCTGTGGCAGACAGCTCCTTGGCGTATTCTAATATCCTGGCCGAGTGGCTTAGTTCCGCCAAGCAGAACAAAGTGTATCTTCCAGACGA 
TCCTTTCCCACATGCCTTTTACCTTCGCAACAACAAGCTGAAGCACTATAACCATGTGTTCTCCGAGGGCTTTAGCAAGGACTTCCGGCAGGTTT 
GTCTAACATAATTGTGAAAACTTCATACTACAAATTCTCTGTCTTCCTTACAGCCCTGCGTGGTCTTTTGTGGCCATCCTAGCTTGCGTTTCGGC 
GACGCTGTTCACTTTATCGAGATGTGGGGCAATAACCCTAACAACTCCATCATATTCACGGAGCCGGACTTTCCGTATCTGCAAGTACTGGCGCC 
CTTCCAACCACTGGCCATGAAGGCTTTTTACTGTCCCATCGACACCTCTTTGAATTATCAGCAGGCAAATAAGCTGATCAAGGAGTTGAAACCAA 
ATGTGCTTGTCATCCCAGAGGCATACACAAAACCACATCCCTCGGCCCCAAATTTGTTCATTGAACAGCCGGATAAGAAGATAATAACGTTCAAG 
TGCGGCGAGATCATACGTCTGCCATTAAAACGGAAGCTGGATCGAATTTACATTACCTCGGAACTGGCCCAAAAGATATCGCCAAAGGAGGTGGC 
GGCCGGCGTGACCTTCTCCACATTGACAGGAGTTTTGCAAGTTAAGGACAAGGTCCACTGCATCCAGCCATGTGCGGATAGCGTTAAAGACGAGA 
CCATCTCGAGCAACAGCGCTCCGACAAAGGAGGATGTGCTGAAGAACGTCAAATACGAGTATGGCAGCATTGATGTAGATGCAGTGATGAAGAAG 
CTGGCACAGGATGGCTTCTCCAACATCAAGCTAGACCGCACTGGCGGCGCTCTGACTCTGAACCTTGTCAATGAGGACACAGTCATTAAGTTCGA 
GGACAACGAGACGCATATTATCTGCGGCGGAAAGCCAACAACCCGGCTCAAGCTGCGAGACACCATCATGAAATGCTTACAGAGTTTTTAAAAAC 
ATAAATGTATTCCTTACACCTAAATAACAACACATCATTACGCTTAGTTCTTTGCCTCCATAACGTCAGAGCAACGTTGACACGTTTTCTGCTCA 
TCCTCCAGACCATAGCGACGACAACGTGGGCACAGTGATATCTGAAGTTTACTTAAAGCAATATCCAAGTCGCTATTATGGGACTGTAAGAGCGT 
AACTGACCCCACTTGCAATATCTCGCATAATTCAGAATTGCTCACAGACTCTCCAAGAGGGCTGTGAAGTTCTCGCAGCAAATCCAGCTGGCTTC 
TTTCCTTTCCTTTGATGGTCACGGCCAAGTGCCAGGTGTTAACATCTCCAGCCTGTTGGTTAATAAGTCGTTTAACATCGAACGCTGCATTAATC 
ACTTCAGTGGCTTTGGTGTCCTGCCATTCAGGATTAGTCTGAACAATTTGCTCGTGGAATGCACCACCTGTGGTGTCATAGTAGCTCCAGCTTTC 
CTCCACCAGGAAGGGAACAATGGGCCATAAGCTCTTGCAGAGTTGTTGATAGCAGTGGGTAAGAGTTTGACGAATGGCCAGTAGCTCGTGATCAT 
CACCACAATAAAGACGATCCTTGATGAGATGCACATAGACGGCGGACACCTGATTTGCAATGAAGTTTTGGACACAGGCGACAACACGATTGTAC 
TCATAAGCTTGGTATAGCTTCTCCACCTTAAAAGCGATTGACATGATTAGTTGAAAAACTAAGATTAACAAAATTTCCTACCTCAGATTCAAATT 
CAACCAGCTGGCTTAACAAATAACGATTAAGATAGCTCTTATCCGAAGTTCGTATAAGCTTCTCCCCCGATTGTTTCTCTCCAATTACGCCTTTT 
AAATATCGCATAGTTCCGCGGATCTTGCATAGATTTTCTGCCGCTTGCTGCAGTAGCTTATCGCTCACTGTAATGGACATGTGCTGCGTTCCATG 
GGAGGCAACCCACCAACGCAGTGCATCCGTACCATATTTCTTGGTT 
(SEQ ID NO: 577) 

Exon: 1001.. 1057 
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Exon: 1182.. 2276 
Exon: 2334.. 3131 
Start ATG: 1182 

Transcript No. : CT16697 

AATATAAACAAACTTGCATCGTCTAATCGGCACTGTCGGTTTAATAAAATGCGATTGATGCTGGACTGCGGTCTCACGGAGCAGACAGTCCTGAA 

TTTCCTTCCGCTGCCCTTCGTTCAGTCACTGAAGTGGTCCAATCTGCCCAACTTCGTGCCCAGTCGGGATCATGATCCCCAAATGGATGGCGAGT 

TGAAGGACTGCTGTGGCCGGGTTTTCGTGGACTCGACGCCCGAGTTTAATCTACCCATGGACAAAATGCTGGATTTCAGCGAAGTGGATGTGATA 

CTTATCTCGAATTATCTTAATATGCTGGCCTTGCCGTATATCACGGAAAACACGGGGTTCAAGGGCAAAGTCTATGCCACGGAGCCAACGCTGCA 

GATCGGCCGATTTTTCCTGGAGGAGCTGGTGGACTATATCGAAGTGTCGCCCAAGGCGTGCACCGCGCGCTTGTGGAAGGAGAAGCTTCATCTGC 

TGCCCAGTCCCTTGAGCGAAGCCTTTCGCGCCAAGAAGTGGCGCACTATCTTTAGTCTTAAGGATGTCCAAGGCAGTCTGTCCAAGGTGACCATC 

ATGGGCTACGACGAGAAGCTGGACATCCTCGGAGCTTTTATCGCCACTCCTGTCAGCTCGGGCTACTGCCTGGGCTCGAGCAACTGGGTACTGAG 

CACGGCGCATGAAAAGATTTGCTATGTCAGTGGTTCCTCCACGTTGACCACACATCCGCGGCCCATCAATCAGTCGGCGCTGAAGCACGCCGATG 

TACTCATTATGACCGGACTGACGCAGGCGCCGACGGTAAATCCGGACACGAAGCTGGGCGAGCTCTGCATGAATGTGGCCTTGACGATCCGCAAC 

AATGGGTCTGCCTTGATTCCCTGTTATCCTTCGGGCGTTGTCTATGACCTCTTTGAGTGCCTCACCCAGAATTTGGAAAACGCTGGCCTAAACAA 

TGTTCCCATGTTCTTTATTTCCCCTGTGGCAGACAGCTCCTTGGCGTATTCTAATATCCTGGCCGAGTGGCTTAGTTCCGCCAAGCAGAACAAAG 

TGTATCTTCCAGACGATCCTTTCCCACATGCCTTTTACCTTCGCAACAACAAGCTGAAGCACTATAACCATGTGTTCTCCGAGGGCTTTAGCAAG 

GACTTCCGGCAGCCCTGCGTGGTCTTTTGTGGCCATCCTAGCTTGCGTTTCGGCGACGCTGTTCACTTTATCGAGATGTGGGGCAATAACCCTAA 

C AAC T C CAT CAT AT T C AC G GAG C C GG AC T T T C C GT AT C T GCAAGT AC TGGCGCCCTT CC AAC C ACT GGCC AT G AAG GC T TT T T ACT GT C C C AT CG 

ACACCTCTTTGAATTATCAGCAGGCAAATAAGCTGATCAAGGAGTTGAAACCAAATGTGCTTGTCATCCCAGAGGCATACACAAAACCACATCCC 

TCGGCCCCAAATTTGTTCATTGAACAGCCGGATAAGAAGATAATAACGTTCAAGTGCGGCGAGATCATACGTCTGCCATTAAAACGGAAGCTGGA 

TCGAATTTACATTACCTCGGAACTGGCCCAAAAGATATCGCCAAAGGAGGTGGCGGCCGGCGTGACCTTCTCCACATTGACAGGAGTTTTGCAAG 

T T AAGGAC AAGGT C C ACT GC AT C C AGC C ATGT G C GG AT AGC GT T AAAG AC G AG AC C AT CT C GAGC AAC AGC GC T C C G AC AAAGG AGG AT GT GCTG 

AAGAACGTCAAATACGAGTATGGCAGCATTGATGTAGATGCAGTGATGAAGAAGCTGGCACAGGATGGCTTCTCCAACATCAAGCTAGACCGCAC 

TGGCGGCGCTCTGACTCTGAACCTTGTCAATGAGGACACAGTCATTAAGTTCGAGGACAACGAGACGCATATTATCTGCGGCGGAAAGCCAACAA 

CCCGGCTCAAGCTGCGAGACACCATCATGAAATGCTTACAGAGTTTTTAA 

(SEQ ID NO: 578) 

Start ATG: 58 

MLDCGLTEQTVLNFLPLPFVQSLKWSNLPNFVPSRDHDPQMDGELKDCCGRVFVDSTPEFNLPMDKMLDFSEVDVILISNYLNMLALPYITENTG 
FKGKVYATEPTLQIGRFFLEELVDYIEVSPKACTARLWKEKLHLLPSPLSEAFRAKKWRTIFSLKDVQGSLSKVTIMGYDEKLDILGAFIATPVS 
SGYCLGSSNWVLSTAHEKICYVSGSSTLTTHPRPINQSALKHADVLIMTGLTQAPTVNPDTKLGELCMNVALTIRNNGSALIPCYPSGVVYDLFE 
CLTQKLENAGLNNVPMFFISPVADSSLAYSNILAEWLSSAKQNKVYLPDDPFPHAFYLRNNKLKHYNHVFSEGFSKDFRQPCWFCGHPSLRFGD 
AVHFIEMWGNNPNNSIIFTEPDFPYLQVLAPFQPLAMKAFYCPIDTSLNYQQANKLIKELKPNVLVIPEAYTKPHPSAPNLFIEQPDKKIITFKC 
GE 1 1 RLPLKRKLDRI YI TSELAQKI S PKEVAAGVT FSTLTGVLQVKDKVHC I QPCADSVKDET I SSNSAPTKEDVLKNVKYEYGS I DVDAVMKKL 
AQDGFSNIKLDRTGGALTLNLVNEDTVIKFEDNETHIICGGKPTTRLKLRDTIMKCLQSF* 
(SEQ ID NO: 579) 

Classification: hypothetical 



Celera Sequence No. : 142000013384546 

GCAGCGGGCACCAGTGGTTCTGTGACCATATCGCTGCATCCGCTGGTCATAATGAACATCTCCGAGCACTGGACACGCTTTCGTGCACAGCACGG 
TGAGCCGCGACAGGTGTACGGAGCACTGATTGGCAAGCAGAAGGGCCGCAACATTGAGATTATGAACTCGTTCGAACTGAAGACGGACGTTATTG 
GCGATGAAACAGTGATCAACAAGGACTACTACAACAAGAAGGAACAACAGTACAAGCAGGTGGGTGTCCAAGCCTCTACATACGCTGCTTTACCA 
GCTGTTACTCTTTTTTCCCCAGGTGTTCAGTGATCTCGACTTCATAGGCTGGTACACAACCGGCGACAATCCCACTGCCGATGACATTAAGATCC 
AAAGACAGATAGCCGCCATCAACGAGTGCCCCATCATGCTACAGCTGAATCCGCTCTCCCGCAGCGTCGACCACCTTCCCCTGAAGCTATTCGAG 
TCCCTAATCGATCTGGTGGACGGCGAGGCCACCATGCTGTTCGTGCCCCTCACCTACACACTGGCCACCGAGGAGGCGGAGCGCATTGGTGTGGA 
TCATGTGGCGCGCATGACCTCGAACGAGTCCGGCGAGAAGAGCGTGGTGGCGGAGCATTTGGTGGCCCAGGACAGTGCCATTAAGATGCTAAATA 
CACGCATCAAGATCGTGCTGCAGTACATACGCGATGTGGAGGCCGGAAAGCTGCGGGCCAACCAAGAGATTCTGCGAGAAGCCTACGCTCTCTGC 
CACCGACTGCCCGTCATGCAGGTGCCCGCCTTCCAGGAGGAATTCTACACGCAGTGCAACGATGTGGGACTGATTAGCTACCTGGGCACGCTGAC 
CAAGGGCTGCAATGATATGCACCACTTTGTGAACAAGTTTAATATGCTGTACGACCGCCAGGGATCCGCGCGCCGCATGCGAGGTCTCTACTATT 
GATATATGCAAACAATCTATTAAAGCTTCGGAGATAAATACACATAATCGTACGATGGGTGAATGCGGTTTTATTTCAGCTGCCGCTGGGCAGGC 
TACATACGTATACGGGTGGCATCAAGCGATATCACTTTCTTCCTGAGCTTCCAGCTCCTCTTGCTTCGTAAATTCAATTCAGATGCACAGACAAC 
ATTTCCTAGACACTAAATTAGGTAATTTCGCAGCGGAATTAGTGGTTGTAGTCGTCCAGCTTGATGCTGGGGGCATACTGCTGCAAGAACTCCAC 
CTCCAGCGGATGCATCTCCTTGGCGGGAATGATGAGCGAGTGCAGCGGTCCGCCCAAATCGGTCGAGCGCATCTCAAGCAGCGTGCCAACGGCAA 
TCTGCTGCGACTCCTGGCCAACACGGGCCAGTCCAACGCACAGTGACTGCTCGTTGAGCACTGTGTTCTTCTCCAGCGAGTCCTTCTTCTCCACA 
ATGGACAGCAACTGATGAGCCGCCTCCGCCACGGTCATGAAACGCGGTGGCATGTACTCCTTGCGTTTGCGCATCAAGGACTCCGGCGTGGGCTC 
CTTCACCTTGATGTCCAGCAGGCACAGCGTGTGCATGTTGTGCAGGCGATTCAGCTTGATCTTGTCGTAGAAGCTGTCCGGCTTCCATGTCTCGT 
CCCAGTACGGAATCGAAACGGTCTCGCCGAATTTGTACAATTGCAGGCCGCAGCAGCCGACGGCGTTCATGATCGATGCATTGTGGATGACCTTG 
TAGGGGATGTTCTTCTCCTTGGCGCGCAGTATGAAGTCTGTGTGGGTGGTGGCGCCAAAGGGGTCGCCCACCACGAGCAGCGCAACATCGGATTC 
GCCCGCACCCGCCAGGATCTCATCCGCACCCTGCTCCACCAGATCGCGGTCGGCTAGCAGCAGTGGTCGTCCATAGAACTCTTGCTGCGAACAAA 
GTGGAGATGCAATTAGCATTGGGGGTGCTCCCAGAACGAAGGCCATTCCAGATACCTACCATATCCTCAAGGCTGCAGCCCAGAATCGAGGTGTA 
CATCTCCAGATAGACGCGACTGCATTGCTTCACTATCTCCAGACCCTTGACCGTGATGTCCTTCAGGTCGCCAAGGCCGAGGCCGATTAGGTAGA 
ACATGCTGGGCGACTGGTGCGACTGCTCGGCCACTGCAACTCCTGCCAAATGTTATTAAAATCGCTTTTTAAATGCAGTCAACACGTGCCGTCGC 
GCGTTGTGTGACCGCCGCCAGAGCGCCAAGAAATACTCACATTGCGACAATGGGCTAAAAAAGCCTGTGCAGCAATGGCGCGTATTTTGAATTTG 
AAGGCCATTTCATTTAATTGGGAAATAAACACTAAAAACTTTGGTCCACTACTTAAGAAATGATATATTTGGTTTGCTTTTTTGATCTCAGCTGT 
ATGATACCTATGTTACTCTTAGACTGTAATAATAAGTAAACTACTTAACATAACTACACTCCAGCAATTATTGGAACTCTGAATTTGTTTGCTCG 
GAAATCTTGAAGCTGCAAAGTTGGGGTCTGTGGATGAAAAATATCATTTAATTAACGTACTTACGTACTTACAACAATATTACGCCATTTCAACT 
TACGTTGTTGAGGCGCCCCGAACCACGCAATAAGCTATGAATCGTTCAATCTGCAAAGCAATCTTTATCTTTACCGATTCCCTTATTATAAATGC 
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ATAGGCACTCACCATCATTTCAAAGTCGCCATAGGCGGCGGAGTGATCCATGCGCACCTGCAGGAAGTACTTGTGTATCTCGGCACCCAGATTGT 

CCTTCCAATCAGGACACTCCTCGCGTATACGTCTGCAATTGATGATCATGCTGAAAGTGCGGATGAGCATAAACAAATCTGCCTGGCGTCCACTG 

TTTGCCAACTGGCGCCAGTTGACCGTTTTGGCCATTTCAAAAGCCTTGAGAGCCTGCATATATATGGGTTTAGATTTTAAAGTGCTGATGGAATC 

AGTTTTAGGTTCATACATATTCCTTCTGGTCCAGTTCAACTGGTTCTCCCGTGTACCGTCCCATTATGGTGAGTAGCAGGGCCATAGCTACATCG 

TTGCCAAGATGGCAGGATTCTACCATGGCGGGAATCGCGGCCTGCAGGTCCAGCACTCTTTCCAGGCGCTCCATACCCACGCCAGTGATAAGGTT 

TTCGTACAACCCACTGGCCATGTAGAACATCTAAAGGGAATAAGATGTAATTGATCGGAAATGTTAGGATAGCAGAAAGGGGAACTACCTCCTCC 

GAGTCCAGGGACTCTGCCAGGTGACACACCACATCCACCATCATGTAGCGCTCCTTCATCAGCAGAACGTTGTCCCTTTGGTACGTCATCATCAT 

AATGGAGAGGGTAAACAGATCGAGCAGCTGCTGCAAATTGAGTTCCAGCTTCCCGGCTAAGGTAGCCATCATGCGATTCAGCAACTAAGTTAATA 

TAAACAAATTAGTAATAGTTATTAGGAAATAAACAGAATAATAACCTGTTTGCCGTCTGTGGTTTCTGCAGCGTAAGGAAACAAATGCTTGATGG 

ACGACTTCCGGAGAAAGGACATGATGGATTCGTGGTCATAGCCTAGGCGCATTTCATCCACTTCCACGAGGGAATTATTCAACGATGAATCAGAT 

AGATTCATTTTGTATTGGCGGTAAAAAATGCGCAAAGCAAATAAATAGAAATAATAAACTGAAAGAGTGACCAGGCACCAAGGGGTGGGTCAAAT 

AGCCTAACGCCTAGTTAAGCGCTGTATCGCATGAACGTAAACGGTCATACCATTATTGTTGCTCCACGTGCGGCGATTTTTATTCATTTACTACT 

TTAATAATCAATTTTAGTGAATTGACACTAATTAAAATGCCGCCTGTTAAGAAGCCCAAGGCGCAGAAGATCGAGAAGCAAAGCACGCCCAAAAA 

GCCCATCGATGCAGCCAAGAATGTGGTTTCCGGACAGGTGAGCGGCTACCAAAAACAAATGCTCCTCCGTTGGGTTGGATAACCGTAAACTGCGT 

CCATTTTAGCTAAAGAAAAAGGTGACCAAGGCCAGGCCGCAGAAGCCAAAGGGTCCAGAACGCGGCGTGGTCATTGTGAAGCACCTGCCTCACGG 

ATTCTTCGAGCAGCAGCTGCGCCAGTACTTCCGACAGTTCGGTCGGGTGCTGCGCGTCCGTTTGGCCCGTTCGCTGCGCACCGGTAACAGCAAGG 

GCTACGCCTTCGTGGAGTTCGAGTACCCGGAGGTGGCCAAGGTGGCTGCCGACACCATGGACAACTACCTGATGTTCCAGAAGGTGGTGAAGGCC 

ACCTACATTCCACCAGAGAAGCAGACACTCAACTTCTTCAGGACTTCCTTGAAGAAGGTTGTCAATAAGGTGCGTTCAAAGGGAAATACATCTAA 

TGTCCGTTTGTTATTCATAGTTGTACTTCACTTGCAGGCCGGCAAGGAAATCTATGTCTCGGATTTAACCAAAGCCACACAGCGCAGCGTGAAAA 

AGCAGAATAACTGGGATGAGTCCGCCTGCCAGAAGCGCACCGTGGCCAATCTCAACAAGATCAAGAAGCTGCAGGAGAAGTACAAGGATCTGGGC 

ATCGACTTCTCCAACTTACTGGTGGAACCAGTAAAGAAAGCAGCAGGCTCCGCTGAGGCATCTACAAGCCAGGCAGCCGCCAAGAAAGCCGGTAA 

GAAACAAAAAACGGCCGCGAAGAAAGAACCAACGCTGGAGGATTTGCTGGGCAACACCATCAACGAGGACTCCGATGATGAGGACTACGTTGATG 

CATCCGATGATGAGTCCGATCTGGAGGCTGAGG 

(SEQ ID NO: 580) 

Exon: 3783. .3466 
Exon: 3397.. 3224 
Exon: 3165.. 2962 
Exon: 2903.. 2673 
Exon: 2599.. 2569 
Exon: 2132.. 1960 
Exon: 1889.. 1001 

Start ATG: 3618 (Reverse strand: CAT) 
Transcript No. : CT16835 

AATCGCCGCACGTGGAGCAACAATAATGGTATGACCGTTTACGTTCATGCGATACAGCGCTTAACTAGGCGTTAGGCTATTTGACCCACCCCTTG 
GTGCCTGGTCACTCTTTCAGTTTATTATTTCTATTTATTTGCTTTGCGCATTTTTTACCGCCAATACAAAATGAATCTATCTGATTCATCGTTGA 
ATAATTCCCTCGTGGAAGTGGATGAAATGCGCCTAGGCTATGACCACGAATCCATCATGTCCTTTCTCCGGAAGTCGTCCATCAAGCATTTGTTT 
CCTTACGCTGCAGAAACCACAGACGGCAAACAGATGATGGCTACCTTAGCCGGGAAGCTGGAACTCAATTTGCAGCAGCTGCTCGATCTGTTTAC 
CCTCTCCATTATGATGATGACGTACCAAAGGGACAACGTTCTGCTGATGAAGGAGCGCTACATGATGGTGGATGTGGTGTGTCACCTGGCAGAGT 
CCCTGGACTCGGAGGAGATGTTCTACATGGCCAGTGGGTTGTACGAAAACCTTATCACTGGCGTGGGTATGGAGCGCCTGGAAAGAGTGCTGGAC 
CTGCAGGCCGCGATTCCCGCCATGGTAGAATCCTGCCATCTTGGCAACGATGTAGCTATGGCCCTGCTACTCACCATAATGGGACGGTACACGGG 
AGAACCAGTTGAACTGGACCAGAAGGAATATGCTCTCAAGGCTTTTGAAATGGCCAAAACGGTCAACTGGCGCCAGTTGGCAAACAGTGGACGCC 
AGGCAGATTTGTTTATGCTCATCCGCACTTTCAGCATGATCATCAATTGCAGACGTATACGCGAGGAGTGTCCTGATTGGAAGGACAATCTGGGT 
GCCGAGATACACAAGTACTTCCTGCAGGTGCGCATGGATCACTCCGCCGCCTATGGCGACTTTGAAATGATGCTTATTGCGTGGTTCGGGGCGCC 
TCAACAACGAGTTGCAGTGGCCGAGCAGTCGCACCAGTCGCCCAGCATGTTCTACCTAATCGGCCTCGGCCTTGGCGACCTGAAGGACATCACGG 
TCAAGGGTCTGGAGATAGTGAAGCAATGCAGTCGCGTCTATCTGGAGATGTACACCTCGATTCTGGGCTGCAGCCTTGAGGATATGCAAGAGTTC 
TATGGACGACCACTGCTGCTAGCCGACCGCGATCTGGTGGAGCAGGGTGCGGATGAGATCCTGGCGGGTGCGGGCGAATCCGATGTTGCGCTGCT 
CGTGGTGGGCGACCCCTTTGGCGCCACCACCCACACAGACTTCATACTGCGCGCCAAGGAGAAGAACATCCCCTACAAGGTCATCCACAATGCAT 
CGATCATGAACGCCGTCGGCTGCTGCGGCCTGCAATTGTACAAATTCGGCGAGACCGTTTCGATTCCGTACTGGGACGAGACATGGAAGCCGGAC 
AGCTTCTACGACAAGATCAAGCTGAATCGCCTGCACAACATGCACACGCTGTGCCTGCTGGACATCAAGGTGAAGGAGCCCACGCCGGAGTCCTT 
GATGCGCAAACGCAAGGAGTACATGCCACCGCGTTTCATGACCGTGGCGGAGGCGGCTCATCAGTTGCTGTCCATTGTGGAGAAGAAGGACTCGC 
TGGAGAAGAACACAGTGCTCAACGAGCAGTCACTGTGCGTTGGACTGGCCCGTGTTGGCCAGGAGTCGCAGCAGATTGCCGTTGGCACGCTGCTT 
GAGATGCGCTCGACCGATTTGGGCGGACCGCTGCACTCGCTCATCATTCCCGCCAAGGAGATGCATCCGCTGGAGGTGGAGTTCTTGCAGCAGTA 
TGCCCCCAGCATCAAGCTGGACGACTACAACCACTAATTCCGCTGCGAAATTACCTAATTTAGTGTCTAGGAAATGTTGTCTGTGCATCTGAATT 
GAATTTACGAAGCAAGAGGAGCTGGAAGCTCAGGAAGAAAGTGATATCGCTTGATGCCACCCGTATACGTATGTAGCCTGCCCAGCGGCAGCTGA 
AATAAAACCGCATTCACCCATCGTA 
(SEQ ID NO: 581) 

Start ATG: 166 (Reverse strand: CAT) 
MNLSDSSLNNSLVEVDEMRLGYDHESIMSFLRKSSIKHLFPYA^ 

DVVCHLAESLDSEEMFYMASGLYENLITGVGMERLERVLDLQAAIPAMVESCHLGNDVAMALLLTIMGRYTGEPVELDQKEYALKAFEMAKTVNW 

RQLANSGRQADLmLIRTFSMIINCRRIREECPDWKDNLGAEIHKYFLQVRMDHSAAYGDFEMMLIAWFGAPQQRVAVAEQSHQSPSMFYLIGLG 

LGDLKDITVKGLEIVKQCSRVYLEMYTSILGCSLEDMQEFYGRPLLLADRDLVEQGADEILAGAGESDVALLVVGDPFGATTHTDFILRAKEKNI 

PYKVIHNASIMNAVGCCGLQLYKFGETVSIPYWDETWKPDSFYDKIKLNRLHNMHTLCLLDIKVKEPTPESLMRKRKEYMPPRFMTVAEAAHQLL 

SIVEKKDSLEKNTVLNEQSLCVGLARVGQESQQIAVGTLLEMRSTDLGGPLHSLIIPAKEMHPLEVEFLQQYAPSIKLDDYNH* 

(SEQ ID NO: 582) 

Classification: enzyme 
Gene Symbol : Dph5 



FIGURE SHEET 319 



FlyBase ID: FBgn0024558 



Celera Sequence No. : 142000013384285 

TCGTTGCTGATCGCCTTCATGTTGTAACTGGCGGGCAGGAAATAGCCCAGCAGAAGGTATCGATTGTTGAAGCTAAACACGGCCGGTCCACTCGG 
GTACAGGTTATCCCAAATTCCGTGAAGTATTGTGCACTTGTCCTCATTGCCGATATACAGCTCGAAGGGACCCGTTCGCATGTTGCTGTTCCAGG 
TGGCCTTCATTCTGACGGACAGGCAGCAGCCATCCTTTCCGCTGTTAAAGTGATAGATGCCCACCCCATGACGTTGTCCCTTGAACCAGTCACCC 
GAATAGTTGTCCCCATTCACATACTTGTACCGGCCCTTGCCGTGCTTCAAGTTCTTTCGCCAGTTGCCCTCGTACACGGATCCGTCCGGATAGAT 
GAAGATGCCGCGCCCACAACGCTTCCCGCAGCGATACTGTCCGTAGTATCGTGAACCATCCTTGAACACGTACACCCCGATGCCGTGACGGCGAC 
CCTTCCTGTAGTTCCCATCATACTGATCTCCGTTGGGCAGGATGGCCCAACCTCTACCTTGTCTCTGTCCGGCAGCATTTCGACCGCCTATGTAG 
AGCTGGGATTGAGGATTATTATAGATATTTGTTTTAATTGAGCTCCTTGCTTACTAAAAACTGTTACTACTAGAGAATGTTCGTATTAAAGTTAC 
TCCACTAAAAATTTATTTCTTTTGAGGAAGGAAGATGCACTACTAAGAAACTCATCCATTTTGGAACTTACGCCTATATTGGGTCCCAAATCCTC 
CTCTTCCTCGGGCGCCGACAGATCCTCCTCGGACATTTCGGTCATGGCCATGGTGATGATTTTGAGTTAACTAACTCCTTTCGGTTCTTTTTATT 
GCCGAGTGAAGTAGATTTTTACTTACGAATTACGTTCGGGAAATATTTTGTGTAGGCATTAAATAAATGTATTCTGTTTGTATAGTTAACTTAGC 
AAAGGCTTACTACAATAAATATCAGAATCGAAGAATGATGAAATCGAGGGCTACGTTTTCGCATAGAAGTAGTTGGCCAGTGGATATCGTATGGT 
CAGGGCAATCAGTGTACACAGCACCTGGAATCCCATCAGCATCTGGGTGACCACTCGCTCGTGGACAGGTCCGAAAACGACTAGGACAAAGTTAA 
TGAGCGTAAAGTTGTTGGTCCTAACAACTCCGTCCGCTTTGGTATGCCAGCTTATTAGTCGCAAGTTCCGCAGTACTGTGACCATCAGGCGACCC 
GGAGCGTTGAGATCCTCCAGGCGAAACTCCGTGGTGCTGATGTGGAGCAGATCCGTCTTGCTGTCGTACTTGGGCAGCCGGTGACGCGGACATGG 
CACAAAGTGGAACAGCTGCGGCGTGGAGTACAGGAAATTCAGGATCTGCGGCAGGAAGAAGAGCAGCAGCGTCTTGCTGAAGTGGCCCAGGATTC 
CAACCACGGCGAAAGTCATGCCGGCAAAGTAGCAGTAGGTGTCCCCAACAAACACCTGCGATGGATATCTGCGAAATAAGTAAGATAGAAAATAT 
ATTACAAACGAAAAGTTAGTTAGAAGTAAAAAAAATATTAGGAATTAACAACAAAATTACGCATGACAGAAAAATTAACATTATTATCCGAACTT 
TATTACATAAAAGTTTTATTTTTTGTGAATATGATAATTATTAAATTTAAGCTGACGAAAAGAGATTATAACGTATTTTTAAGTTTCAAATAACA 
TCCTTAATAATTCGATAACAGTAACAACCAGTGTTGTAAGGCCCATCGATGCGAAACAGCCAGATTTACAACTGCAGCGGCGAACAGCTGACTGA 
GGACATTTGCGTCAGCTGTGCGGGGGAGTTGAGGGGTGCCGTAAAATTTTTGGGCGTGTTTGCCAGCAGAAGAAAAGCGTGAAAAGTACGAAAAA 
TCGAAACGGAATGAGCAGGATGAACTGCAGAAAGTGAACCAGTGACCATGGTGAGAAGCACATTCCTGGCCACCTACAACCAGACACCGGTAGCC 
GGCGTTGGACGGCTGGCCATCACCAAGAAGCTGGTGCTCTGGTTGGTGCTGGTGCTGCAGGTGGGCTTCATCGGCTGCATATGGCTGCTGCAGAT 
GAGCCGTGGCCAGGAGTCGCAGCAGCGCAGTGCAATGGACCAACTTGGCGGACGGGTCAACTTCGTGGCCAGCGGTGAAGCCCCTATGGAGAAGA 
TGGCACTTGTGGGCGGAACCCACTGTCCGCATCTGCAGGAGAATGTCAACCGCTACGACAAGGACTTTTATCAAATGAAACCGGAACGACTGAAC 
GAATCCCACCTGCCCGACTACAATGTGCCCGCCTATGTGGACGCGGAGATGGGTCTAACCCCAAACCTCTGGTGCTATCGCGAGGGGACCATCAA 
CGAGAGCCAGCGGCTCAACGATGTGGACTATCTGATGGCGCCGCCTCAATGCAGATGTGAAAGTGGCTGGCACGGCAGAGATTGCGGCCAGCCGG 
AGATCATATGGCGAGCCCTGATGACGTCCAATCGGGCCAGTAAACGAGGCGGTAGCACACCGCTTCAACTGGTCGAGGCCAGTTCTTCTTCACTG 
CACCGCCTGTTCTATATGCTTGAATTGGGCGCCTGGGACCACCTAAGTCTGGAGCTCTTGGAGCTGCAAATTCGAGCGCTCATCGAGGTTGTGGA 
CTACTTTCTCATATACTATGTGAGCAACGGCAGCAAGGAGCGGAGTCTGGAGAGCATGCTGGGCAGCCAAACGAGCTATACGCTACTACGCTGCT 
CCTCCGAGAGCAACTGCACCAGCTCAATGGCCTACAGTCACTTTAGGCGGCAGCTGTGGCAGCAGTGTGGTGTCCAAATGCAGGCACAGGACCTG 
CTGCTGCACGGGGACAGTGGAACGGTCTACGCGCCAGCAGCACTCAAGTTTCTGAAGTACTATGCCAAGGATGTGCTGCCGCTGAAATTTCGTTT 
GAAGTACAATGTATATGGATTCTACTGGCAGCATCCGAAGAAGACCCTCCTAAATGGGGTGATAAGTTCCTTGGGGCACCTGCACAGTGCCCAAC 
TGGATGCTCACCGCCTGCACCGGCTGGCGAGCAGCACACTGGGTGATCTCAACCACTACGGCGGCTGGAACTGCGAGCTGTGCCTGCCTCCCGAG 
CAGATTGTGCTCCTGCTGCAGAGCTCCAGTCCCCGCAAGCTGCCAGTAAAACTACCCAACGATACCCGCAATGCCCACATCGATGCCAACTACAT 
GCAGCAACTGATAGCCAATGGAGTTCACATCGATGGCACCACGCAGCTGCACCGCTTGCGGGAGCAGAGCGAGAAGTATTTCGCTCCGGAGGAGG 
CGCTCCAGCACAGCAGCCAGTACGGCCAGCTGCTGGTGAACCTATACGATGTGGACGTCCTGGAAGATCTGCAGGACGAAGACTAACTACGCCTG 
CCGAAGTTTGACTTGGGCATGGCAGCCTATCGAACCAGCATAAACCTACGAATATATATATTTATATGTGCCTGCTTTTGAATCCCCATCAAAGT 
CTAACAATCTCATAATCATCATATCATACTCTGCCTAATCCACTCATGCGAATGCCCTTCTGTGTTTTTGGCCAGGTGAATTCCAAATTTAGTAT 
C T C AGC AG AC AG GAC AG AC AG AC C C GC C AT CT GT GAT AAG AAAAAC C AG AAC T AAAC AAT G AAT T TAT GAG AGG AAGT C G AT GC AAC C AA C T GAG 
AGAAGTCTCAAATTTGCTGGAAAAAGTTGTGAAATGGGTTGATCGCAATTTAGCATTATTTTTAAGACATAACACACACGATCTGTATGAAAAAG 
CTTTTTAGACTTACTTTAAATATTAATACCAATTATCAGAAGGCAAGCAAATTTAAGCTTAGTTTTGGTTACAATTTTGAAGCAATTTTGAATTT 
GTGTTATTACTTTATTACTAACGTACCCTAATTATAATTAGACGAAAATTTGTTGACCTTCCAGATCAATTCAGTATTTATAAAAATAAAAATCA 
ATTGGATTACTCACTTGTTGAACTTCCATAGCGCTAGGGTGGTGGCCAGGAAAGGCAGCATGAAGTAAATGGAGAATATATGCGAATCCACCTGG 
TGACCGAGCAACAGTTCAATGGCATTGAAGACCAGAATGGAGCCTGCAATAATAAAGGATTGTCCCACCTCCAGGCCATTGATGCCCGCCAGGAT 
GTTGATGGCATTTGTGCAGAATACCGCCAACATGCCCATGAAGACGTAGTACAAGGCACCTTGAAGAAATCAAAATAGGTTTCATCTCTTGATTT 
TCATTTCTGGAAAACCTTAGCTACCTATATTCAAGGAGGTCCCAATCAGATTCCTTGCAAAGTTGGGCATAATGACCGTCGTGGAGTTATAGTTT 
ACGTAGTACACCATTAGCAGCGGCAACGTGGCGATGGTGGGCAACAGGAGCTTGTGGCGCCATCGCAGGTCGAGAACGTCATCGGCGAAGCCCAA 
GAAGATCATGCAGCAAATGGAAAGAAGGGCGGCAATCAATTCCACGAACTGTGTGGGGAATAACAGCCGATAAGGTGGAGCACAAAGACATCTAT 
CAATTAGATAGCTACCTTATCATGTGGAAAGGTATCTGGTTTGCCTCCAGTAATCGCATCCGTGGCAGCTGCCTCGTCGAAGGCGAAGGGAATCG 
GT AT G AAC AGAAAC AAGG A C AC C AG GAAC AC AC AGC C AAT AAG C AC A CC AAAG GAT T C AGGT C T AG AGT T T AGG AAG C GT T T T AGT T AG AAAAC C 
T AC AAAAC AAAG GAG AGC G AAC TT AC AC CTGCGGCTTGTCCTTCTT G C AC AG AT C G T T GC C AAAG AG AT T G GCC T T GAT G AAC AT C T CG C GG AAG 
CGGGGTATCATCCGGACTGTCATGCAGTAGGCGGCGCCCGAAATGGCCGCATTGATGGCTATGGCGACTCCCGACATGGTCGGCCTTGAAAATCC 
ACCTTCCGCTGCACTAATTCAATTAGCTGGTGCAGCTCTGTGCGCAGCAAATTTGCCGGAGTCACGCGTACGGAGTGCGCTGTTTCAATTGGGAT 
AGTGGGAGGCCGCGGAAAACCAGCCGATCACTCGATCATGGTGCAGAAATCAATTACGCCGACAAAAAAAAGTACACGCAATTGAAACTCAAATT 
GCTGATAACAACAAACTCAGCTGTTCGGCGGCTAGTGTGCCCGTACTTTCGACTGTTAGACAAACCAAACAGCTGATTTTAGTTTACTAATTTAA 
ATTATTCATATATAAAATACCACAAATGAATTTAATTTTATTTTGACCTACCTTAGTGAAAATAGTTTTTAATTTGTATAGTCGGCAATAATCTT 
AATTTAATTTCGCTTACATATCAAAGTTAGAAAATGTTATAGTATTTCTAAAATCAAGAGATCGTTAAAAATTTAAAGACTGAAAGTAAAAACAC 
TTTACTAGATGAAAAAGGCTTAACTTTATTTAATATCAATGATTACAACACTTGGACCAACTCCGTTATTACACATATAGATAAAGATATCTACA 
ACGAAATAGATAAAATATGTTGCGAATACTTGTTCTTTTCGAGCTTCCTTTGTCAATTTGGATTCGGTAAAGATTGCTACCTATCCCTGAAGAAG 
GGGAAGACCTTTTGCAGCTCCGCCTTCGACTTGCAGGCCTCCACCTGCGGCAGGGTCTCCACCAGATTGAGCAGATAAATCTGCAGCTGCTGGCG 
GCGTTCCTCCACGAAGACCAGGTTCATGTTGCCGAAATGCTTCTTTGGTGGAAACTCCACGGCACTGACCACCGGATGGGTCTTAAGCAGTGACT 
TGTGCAGCTTGTAGAACTCGCTGTAGCGTCTGAAAAACGTCCAGTGCTCCAAACGCTGCCTCATCGTGATGTGAACTTCGTAAGTGTAGTGGGAA 
CCAGATCGTTGTGTCTTTGCCAGCTTTACATGCGGAATGGTTATCAGGAAATTGGAGCACAGCGATGAGCTGGGGTCGATGCCGGTCTCTTCAAA 
CTGTTTGGTTAGCCTTCGGATAAGCAGGCGATTTTGAAGACTAAGCTCCGCCACACGGGTTTCCAGCAGAGCGCATCTTTCGTTTAGCTGGTTGA 
CATGCTCTTCCAGCGCAGCTTGCTGTTGGCTGTTAGCCAAGCAATTGGCAGAGGAGGAGGACTTGCTCCACTGCG 
(SEQ ID NO: 583) 
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Exon: 5155.. 4777 
Exon: 4717.. 4576 
Exon: 4513.. 4300 
Exon: 4239.. 4082 
Exon: 1480.. 1001 

Start ATG: 4922 (Reverse strand: CAT) 
Transcript No. : CT16853 

AACAGCTGAGTTTGTTGTTATCAGCAATTTGAGTTTCAATTGCGTGTACTTTTTTTTGTCGGCGTAATTGATTTCTGCACCATGATCGAGTGATC 

GGCTGGTTTTCCGCGGCCTCCCACTATCCCAATTGAAACAGCGCACTCCGTACGCGTGACTCCGGCAAATTTGCTGCGCACAGAGCTGCACCAGC 

TAATTGAATTAGTGCAGCGGAAGGTGGATTTTCAAGGCCGACCATGTCGGGAGTCGCCATAGCCATCAATGCGGCCATTTCGGGCGCCGCCTACT 

GCATGACAGTCCGGATGATACCCCGCTTCCGCGAGATGTTCATCAAGGCCAATCTCTTTGGCAACGATCTGTGCAAGAAGGACAAGCCGCAGGTA 

CCTGAATCCTTTGGTGTGCTTATTGGCTGTGTGTTCCTGGTGTCCTTGTTTCTGTTCATACCGATTCCCTTCGCCTTCGACGAGGCAGCTGCCAC 

GGATGCGATTACTGGAGGCAAACCAGATACCTTTCCACATGATAAGTTCGTGGAATTGATTGCCGCCCTTCTTTCCATTTGCTGCATGATCTTCT 

TGGGCTTCGCCGATGACGTTCTCGACCTGCGATGGCGCCACAAGCTCCTGTTGCCCACCATCGCCACGTTGCCGCTGCTAATGGTGTACTACGTA 

AACTATAACTCCACGACGGTCATTATGCCCAACTTTGCAAGGAATCTGATTGGGACCTCCTTGAATATAGGTGCCTTGTACTACGTCTTCATGGG 

CATGTTGGCGGTATTCTGCACAAATGCCATCAACATCCTGGCGGGCATCAATGGCCTGGAGGTGGGACAATCCTTTATTATTGCAGGCTCCATTC 

TGGTCTTCAATGCCATTGAACTGTTGCTCGGTCACCAGGTGTTTGTTGGGGACACCTACTGCTACTTTGCCGGCATGACTTTCGCCGTGGTTGGA 

ATCCTGGGCCACTTCAGCAAGACGCTGCTGCTCTTCTTCCTGCCGCAGATCCTGAATTTCCTGTACTCCACGCCGCAGCTGTTCCACTTTGTGCC 

ATGTCCGCGTCACCGGCTGCCCAAGTACGACAGCAAGACGGATCTGCTCCACATCAGCACCACGGAGTTTCGCCTGGAGGATCTCAACGCTCCGG 

GTCGCCTGATGGTCACAGTACTGCGGAACTTGCGACTAATAAGCTGGCATACCAAAGCGGACGGAGTTGTTAGGACCAACAACTTTACGCTCATT 

AACTTTGTCCTAGTCGTTTTCGGACCTGTCCACGAGCGAGTGGTCACCCAGATGCTGATGGGATTCCAGGTGCTGTGTACACTGATTGCCCTGAC 

CATACGATATCCACTGGCCAACTACTTCTATGCGAAAACGTAG 

<SEQ ID NO: 584) 

Start ATG: 234 (Reverse strand: CAT) 

MSGVAIAINAAISGAAYCMTVRMIPRFREMFIKANLFGNDLCKKDKPQVPESFGVLIGCVFLVSLFLFIPIPFAFDEAAATDAITGGKPDTFPHD 
KFVELIAALLSICCMIFLGFADDVLDLRWRHKLLLPTIATLPLLMVYYVNYNSTTVIMPNFARNLIGTSLNIGALYYVFMGMLAVFCTKAINILA 
GINGLEVGQSFIIAGSILVFNAIELLLGHQVFVGDTYCYFAGMTFAVVGILGHFSKTLLLFFLPQILNFLYSTPQLFHFVPCPRHRLPKYDSKTD 
LLHISTTEFRLEDLNAPGRLMVTVLRNLRLISWHTKADGVVRTNNFTLINFVLVVFGPVHERVVTQMLMGFQVLCTLIALTIRYPLANYFYAKT* 

(SEQ ID NO: 585) 

Name : N-acetylglucosaminephosphotransf erase 
Classification: enzyme 

Celera Sequence No. : 142000013384548 

CATACAAATATTATGTGGATAAATGGTCAGATGCTGATGAATCGGGAGAATCCAAGGATCAGGAGTCCTTTGCTCATTACCAACGCAACCAGCAG 
CGCTCTAAGGCAAAGGCCACCAAGGAATCTATTAGTGACATGAGAGAGGTGAGGTTCCCCTTACATCCAGGACTTCTTTTGGTCTTTACTTGCCA 
GTTCTCTCCTACAGATCATCCGACAAGCCTTCAGGAGCTCTGAAGTGAAGAGCCTGCGATTTCTTTTTGGCAACAATATGTTCATGCCCTCGGAG 
GCCTATACTCTGCACATACCACATGATTCCATATCCAGAGATCACTATTGCGAGCACCATGCTCTGCCCGAAGGTCGCATCAACCAGGCGCTGCT 
CCGCCTGCTCACCTGCGAGGAGCTGTACAGGCTATTCTCCACCGAACTAAAAGTCACCAACGTGTTTCTAGAAATGGAGCTCCTTACAGACTCCG 
ATCGACCCCAAGGATCCCATTGTGACTCCTTTAATCTAATCCCAAAGCATATATTAAGCCAACTTCCGCGCAGCTGCAAGAACATACACCTGCAT 
CTACTGCATTGCAGTGAAAACACACCGAATGAATTACGCTGCTGCAAGGAGATGGATATTTACCACGATCTTGGCGTGCGGAATCTGGATAAATC 
TGGGGAAGACGTTAGCCAGACCAACGAAGAGCATGACGTTTTGAAAGCAGCAAATGAAACCAGCGGTTGGTGGCAGGCCGAGGTCATAGTTCGCG 
GTTTCAGAGCTCCTGGCAACCAAAAATCGGGAGATTTGTGGTCTAGCTAACATTTTATAACTTGGATTCTTGCTATTCGGCGTGTTTGTGTCAAA 
AATAAAGTTTCGAATCGTTTATATTTCAAGTATTAAGTTATTTTATTATGTATATAATTGTTTAATTTTATTTTTGGTTTCGATTTATTGAGAAT 
GTTTTGAAAACCCAAGGCACTCAAAGATGATCCATCGTGGTATTGGTATTATCATCTTAGGTAGTGATAATCATGGAAACTGGTGCACTGCGCTC 
TCGCTGCTTGGCCACAATCTTGTTGGCTATGCGATCCAAGTTACGTTGTCCTTGCGGAGTCAGCTTACGACCACCATCCGGGTGCCTCTCCACCA 
T GT T AGC AGC T T C C AAG GC T T G AAG AGC C T G C T CAT G T GT T AG AT AC AT GC T AAGT AAT AT T AAAT AT AAC C T T AAAG T AC AAC C AT T T C C C AC T 
CACCTTGCGAATACAGCCATCCGATGAGCGACAGTGCTTGGACGGACGAACACCGTTGCGCTTGCGCCCGCTATAGACCTTGGTGAAGGCCCCAA 
CGCCGGCGGGACTGCGCAGGTACAGGTGTCTCATAATAGAGGCGCAGCGAGTGTAGAACCAGTCGTCGTCGGTAGGAGCAGTCTCCTTGAACTTG 
CCAGTCTTCATATAGACGGCCTGTTCGGGCACAAAAATTTTTCCGGATTTTTTGAGAAAAGCAGCCATGTTTTTGGTCAGAACGTGCTGATCGAT 
TTCCTTTACTGTGACTCCAGGCATGTTGCTCTCTACGTTCTTGAAAGCACGTTTTTAAGGCTCATGGGTCCGATATATGTATGTATGTATTTAAA 
AAAGTCTTACCTTTTGAGAAGAAAAAGGTTTGCTTCCGGTTAGTTTGCTGACTTTAGTCAAAATGGACGATTGGCAAGTGACGTTTGAAAGAGGG 
ACCAGGTCATTGACCCCAAAGGATTACTTTCCGATAGTTTTGTCGTGCAGTATTGGTGAACTTGGCAATTCTTTTCGAAAACTTAACCTATAAAC 
TTGGAAAGGAACGCAATGTAGCAAAACTACTGTTCTTGGTGACAGGGGGTTTAAGGTAGACTAACAAGGACAATTTTATGACACTGAAGCCCTAT 
GGAGTAAGAATCAAAGAACTGCTGTATTTTGGTTTGTATAAATGAATAAAACGTTCTACGCTAATTGAAGAGCATTCGAAGAGGTTTGAATACAG 
CGCCATAGGGTGACCAGCTTGTGGAGCATTGAAGGTATTTCTTGTTTTAAGAATGATCACGGGATGGTCACACTAGAAATACAGCCAAACAAAAC 
AACAAAAAGCATTTCGAGCGCTAACAAATATATATCTTTCGACTTGACTCATTCGCATTCCGGTTGACCGTGTCGCGCCTGCAGCATGTCTGAAA 
AGCCGACTGTTCTGATTTTGGGTGGTAAGTGGTCCCGCTGCATCCTCCTCATCGTCATCCTGCCCCCGCTCCCCACTCCACACTTCCGCTTATCA 
GCATCGCTGACTGCATCGCTGCTCTTCCTCTTCCCGCAGGCTGCGGCTTCATTGGACGCAACTTGGCCACTTATCTGCTGGACAATGAACTGGCG 
CAGGAAATACGACTGGCCGATAAGACCCCTCCGCAGATGGCATGGCTAAACGAGGAGCAGACGCGGGTCTTCGAGAGCGATCGGGTGGAGTTCTG 
CAGCGCGAACCTAATAAACGCCGGTAGGTAATCACTACATTAGGGTGGCTCGTTCCACCAGGATGCGGATCACA 
(SEQ ID NO: 586) 

Exon: 1544.. 1239 
Exon: 1168.. 1001 

Start ATG: 1544 (Reverse strand: CAT) 



FIGURE 




Transcript No. : CTI6966 

ATGCCTGGAGTCACAGTAAAGGAAATCGATCAGCACGTTCTGACCAAAAACATGGCTGCTTTTCTCAAAAAATCCGGAAAAATTTTTGTGCCCGA 
ACAGGCCGTCTATATGAAGACTGGCAAGTTCAAGGAGACTGCTCCTACCGACGACGACTGGTTCTACACTCGCTGCGCCTCTATTATGAGACACC 
TGTACCTGCGCAGTCCCGCCGGCGTTGGGGCCTTCACCAAGGTCTATAGCGGGCGCAAGCGCAACGGTGTTCGTCCGTCCAAGCACTGTCGCTCA 
TCGGATGGCTGTATTCGCAAGGCTCTTCAAGCCTTGGAAGCTGCTAACATGGTGGAGAGGCACCCGGATGGTGGTCGTAAGCTGACTCCGCAAGG 
ACAACGTAACTTGGATCGCATAGCCAACAAGATTGTGGCCAAGCAGCGAGAGCGCAGTGCACCAGTTTCCATGATTATCACTACCTAAGATGAT 

(SEQ ID NO: 587) 

Start ATG: 1 (Reverse strand: CAT) 

MPGVTVKEIDQHVLTKNMAAFLKKSGKIFVPEQAVYMKTGKFKETAPTDDDWFYTRCASIMRHLYLRSPAGVGAFTKVYSGRKRNGVRPSKHCRS 

SDGCIRKALQALEAANMVERHPDGGRKLTPQGQRNLDRIANKIVAKQRERSAPVSMIITT* 

(SEQ ID NO: 588) 

Name: ribosomal protein SI 9 like 
Classification: ribosomal_protein 

Celera Sequence No. : 142000013384589 

TCCACGAGATTCTCCACCTCGCTGACCTCTCCGTTCTTGATAATATCGAAGATGTCGTCCAAACTCACGGTGGTGGCACCATCGAGCCCCTCAAA 

GAGGAGCACATTTTCGTGGTCCTCGACGCCGCTACTGCCGTGTCCACCGTCCGTGCCGCGACCCGTTAATGGCGGCGAATGCGGTCGATGATCCA 

AATGGATGTCGTCGGCGTGGGCATGGCCATCCTCCTCCGCGACGACCATGTTGCAGGAGGAGGAGGCGGTTAGGGGCAACGCCCACACACTGCGC 

GCCGCCCC C GG AC T T GG AT G C AC C TT C GAGGC C AT G T AG AACT CG AAGT C C AC GC CC AGAC GGT CCGCCGTTC C AC C GT T GC CT T G AT GC AG GGG 

GTTTTCCAGTAGGTGTGCAATCTTGTCGCCTTTTTCCGTTACGCTCCGACCGAAATTTGTACTCTGTTTGGCCAATCTTAGGCGTGGAATGCTAG 

GTAAACACGACTGCACTCGATGGCGAGGAAATTATGTCAGCACTGTGAGAAAATCAATATTTGTCCCGAATCGCAGGACTCGAATTGGGGCCACT 

TTCGAACTCGACTTGGCGCCAAACAAATCAACAGCTGACCGTTATTTTCGCAGTTGGGCGCGCAGTGTTGCCAACACATACTGGTAATTAAGACT 

TTTCTCAGCGAGGGGTGATTTAAGATTTAAGTCAAGAAGTATTAAAGAAAAAATATATATATTTAATTTGTTGGAAGGTCAATTGTTGAAACCGA 

GGTATCATTTAGCTTAACTTAAATATACTAGAAATCTCGTGGCATAAGCAAACTAAAAAAAAAAGGTTTATCAACTGACTTTTATATTAGTTTTA 

ACATTATTGGGAAACATTTGTTTTCCGCCGCTAGAAACCTTTAAAATGTATACTACTATATTGCTTGGTACGAATGTTGCATTTTGAACTGATGC 

TAGCATTGCATTTTGTAAGGTTCCCATGTCTCTGTTATTTTGCCACCTGGTAACACTGTGCCCCTTTTTGATAAATTTGTTTCGTAAAAAATGGG 

C GAC AAC GT GGT G AC G GAACT AAAT AAC T T GG AT T T G AAC AAT C AAAAGGC AT GT T AAAC T CT T T T C GAACT GC AT T C G AGT GT AT GAG AAGT G C 

TAGTTAGCCGAATTATGTTGAAGTTGGCGCCGCAGGGCGGCAGATATTATAGCCAAATAAGCCGCATTTGCCAGTGGATTTTGCCGAAAATTCAG 

AC GC AGT C GAAGC C AC CAAAAC AAC AAC AAC AAC AG C AACC AC AAT T G AAAG AAC AAC AACAAC AC AGCT AC GC AAC TTTGGTTGGTGTTT T AAG 

C T AAAAAG AAT T G AC CT AAAC T T C AAT T T AAAC GC C T GT T ACT GCT GGC AC G T GT AAATGT T AT GCT T GT C T GT GT G AGT GT ATGC G AT T C T C C A 

ATGAATCAGTGCCGCCGGCTGTGACGTCACTAACAACCAACCAACTTCATGAGTGTCTGTTTTTTTTTGCTACTCTATTGCATCAGCCATAAAAC 

GACGAAGATACGCTCAAAGACGGCTCGATAAAACAGAGATAGGAGCACCGAAAACATAGCACCGAAAAGTGCTCATTGCCATGTCCCCAATTGTT 

T AAAC AAT AAG A AAAC AT C AGT C G AAAGT AG AT AC C G AAAAAG AGC ATT T GT C T AAT T GC AT ACGT AGC AC A CT T GT G GC C AC T T G AAT AT C T CC 

GGTTTATATTGATTTTTAACCTATTTCCGATGGATTTGCAGGCTGCCAAGATGAAGAAGGAGAAAAAGGAGAAGCCATCCGGCGGTGGAGATACT 

CGCAAGGGTAAGTAATTCTAACTAAGAATTAAAAACACAAAACGCACTTCCGTCTGTAAGATATGTAGCATATCTTTAAGATTAGCTGCCTTCTT 

CTCTCTAATACCTGAAATTCCACTGATCTCTGCTATATTGATTTCAGAACTCAGTCCCTTGCCCAAGTACATCGAGGAGCGCAATGTCTTCTGGG 

AGAAGTGCAAGGCGGAATACGAGGCCGAACTGGCGGCCAAGAAGCGTGAGCCCATCAAGGTGACACTGCCCGATGGCAAGCAGGTGGACGCCACC 

TCCTGGGAGACCACTCCGTACGAAGTGGCCCGCGGCATTAGTCAGGGTCTCGCTGACAACACCGTCATCTCCAAGGTGAACGGCGAGGTGTGGGA 

TTTGGACCGTGTGCTCGAGGGAAACTGCACCCTGCAGCTTCTCAAGTTCGACGACCCTGAGGCGCAGGCCGTCTTCTGGCACAGCTCTGCCCACA 

TCATGGGTGAGGCCATGGAGCGCATCTATGGTGGACACCTGTGCTACGGTCCGCCCATTGAAAACGGCTTCTACTACGACATGCATCTGGAGGGC 

GAGGGTGTAAGTAATTTGTGATGTTCAGTAAGAATTAAATTAAATTGAACGCCCAATTAGATCTCCACCAACGACTACGGCGCCATGGAGGGTCT 

GGTTAAGCAAATTGTCAAGGAGAAGCAGAACTTCGAACGTCTGGAGATGAAGAAGTCCGACCTGCTGGAGATGTTCAAATACAATGAGTTCAAGG 

TGCGCATTCTCAACGAGAAGGTGACCACCGACCGCACCACTGTCTACAAGTGCGGTTCTTTGATCGATCTCTGCCGCGGACCTCATGTTCGTCAC 

ACCGGCAAGGTGAAGGCCCTGAAGATCACCAAGAACTCCTCTACCTACTGGGAGGGTAAGGCTGACGCTGAGACACTGCAACGCGTCTACGGCAT 

CTCGTTCCCCGACCCGAAGCAGCTCAAGGAGTGGGAGAAACTTCAGGAGGAGGCGGCTAAGCGTGATCACCGCAAAATCGGTCGCGAGCAGGAGC 

TGTTCTTCTTCCATGAACTTTCGCCTGGCTCATGCTTTTTCCAGCCGCGAGGAGCGCACATCTACAACACCCTGATGGGCTTCATCAAGGCGGAA 

T AT AG AAAGC G TGG T T T C C AAG AAGT CATTTCCCC C AAC AT C T AC AAC G C TAAGT T GTGGATGACCTCCGGCCACTGGCAGCAC T AT GC CGAG AA 

CATGTTCTCCTTCGAGGCCGAGAAGGAGAAGTTCGCCCTCAAGCCCATGAATTGTCCAGGACATTGTCTCATCTTCGATAACCGCAATCGATCCT 

GGCGAGAGCTGCCACTGCGAATGGCCGACTTCGGTGTGCTTCATCGCAACGAGCTCTCCGGTGCTTTAACCGGATTAACTCGCGTGCGTCGCTTC 

CAGCAGGACGATGCGCACATCTTCTGTGCCCCCGAGCAGATTAAAAGCGAAATGAAGGGCTGCCTGGAGTTCCTGAAGTATGTGTATACCATTTT 

CGGATTCTCCTTCCAGCTGGTGCTGTCCACACGCCCGGATAATTACCTTGGCGAGTTGGAACAGTGGAATGATGCGGAAAAGGCGCTGGCTGAAT 

CACTCAACGAGTTTGGAATGCCGTGGAAGGAGAACCCCGGCGATGGCGCCTTCTACGGACCGAAGATTGATATTACCATTATGGATGCACTGAAG 

CGTGCCCATCAGTGCGCCACCATCCAGTTGGACTTCCAGCTGCCAATTCGTTTTAATCTCAGCTACATCGCCGACGATGGCGAGAAGAAGAGGCC 

GGTGATCATCCATCGCGCTATTCTTGGCTCAGTGGAGCGGATGATTGCCATTCTCACGGAGAACTTTGCTGGCAAGTGGCCCTTCTGGCTATCAC 

CGCGCCAGGTGATGGTCGTTCCAGTGGGTCCTGCCTATGATCAGTACGCCCAGTCAGTGCGGGATCAGTTGCACGATGCTGGATTCATGAGCGAG 

GCGGATTGCGATGCCGGCGACACGATGAACAAGAAGATACGCAATGCTCAGTTGGCCCAGTTCAACTTCATTCTGGTGGTGGGCGACAAGGAGCG 

CTCCTCGAATACCGTTAACGTGCGCACTCGGGACAACAAGGTGCACGGCGAGGTCTCCGTGGCGGAACTCATCACAAAGCTGCAAAAGATTCGCG 

ACGAATTCATCGCCAACGAGGACAGCTTCTAAGACTCTATTTCGAACTATTTATCAAAAGTATACTTAACATTTGCATAGCAAAGCAAGTACGCA 

TTTTTGTTCTTTCTATGTATTTACGCTTACGAAACCGGAATTCACAAATAAAAAACCAAATCCAAACACATTTTAAGGTGTTTCAAATAGAAGAG 

AAGAGCATGCAATCAAATAAAAGCAATTCGATTCAATATACGATTATTAAGCTTCCCCAACTAAAAATCGTATTTTTCTGATCCACCTTAAGCCG 

CCTATTTCACTCCAGTTTCTTGACGAATCAGTGGAATTTGTCAGCAATTGAGGACTAGTCATGCCGACACAGAGGTGCGCTGTAGTCCATTCAGG 

AACGATTTGGAATGATAAGATAAGTCGGCCCACATTTGGCGGAAAACTTTTTGGGAAAGGTGTGCGTGCACCCACAACGTATGAGTCATGTTTTG 

AATTTATTTTTTTGTATTAATTTATTGCTAGTATCTTGTATACTTTTCATTTTCATAATTTGATTTATTAAAAATTGCATCAATAGTTTATGTTT 

TTATTCAATTATCAATTTTGTAAGCTTTTGCTTTTCGCGTAAAAAATGCACAAAGTTCTCTCGCTAATCGAAGCAAAAATCGTTAATAATATTGT 

TAAGTGTTTGCTCTGCTCGTGCTCTGCTTATCTGTTTCAGTTTTAGTCATAAAAATACATATTTTATATTTGTGTATATTTATTCATAAGATGCA 

CGTGGCCAAGTCCTATGCCATCCACAGTTCCAGTTAGCAGTCGCGTTCGCTGTGCTAAAAAAAAAACCGAAATCCTAACCAAAACTGGCGAAACA 
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AAAGCCGCAAACAGAACAGAACGGGGGCTAAAATTGGAAACAAAATGCCTGGCTGGTGGAATCATTGAACACAGACTGGCTGCGTGGTATCTACG 
ACTGCTGGAGATGCCGAGAGCTGGAATGTCGGAGGATCCTGGGACCGCCTGGCCTGACATAAGTACATGGTCTTGGGTTTTT 

(SEQ ID WO: 589) 

Exon: 1001.. 1094 
Exon: 1752.. 1812 
Exon: 1948., 2381 
Exon: 2436. .4022 
Start ATG: 1761 

Transcript No. : CT16980 

TAACACTGTGCCCCTTTTTGATAAATTTGTTTCGTAAAAAATGGGCGACAACGTGGTGACGGAACTAAATAACTTGGATTTGAACAATCAAAAGG 
CTGCCAAGATGAAGAAGGAGAAAAAGGAGAAGCCATCCGGCGGTGGAGATACTCGCAAGGAACTCAGTCCCTTGCCCAAGTACATCGAGGAGCGC 
AATGTCTTCTGGGAGAAGTGCAAGGCGGAATACGAGGCCGAACTGGCGGCCAAGAAGCGTGAGCCCATCAAGGTGACACTGCCCGATGGCAAGCA 
GGTGGACGCCACCTCCTGGGAGACCACTCCGTACGAAGTGGCCCGCGGCATTAGTCAGGGTCTCGCTGACAACACCGTCATCTCCAAGGTGAACG 
GCGAGGTGTGGGATTTGGACCGTGTGCTCGAGGGAAACTGCACCCTGCAGCTTCTCAAGTTCGACGACCCTGAGGCGCAGGCCGTCTTCTGGCAC 
AGCTCTGCCCACATCATGGGTGAGGCCATGGAGCGCATCTATGGTGGACACCTGTGCTACGGTCCGCCCATTGAAAACGGCTTCTACTACGACAT 
GCATCTGGAGGGCGAGGGTATCTCCACCAACGACTACGGCGCCATGGAGGGTCTGGTTAAGCAAATTGTCAAGGAGAAGCAGAACTTCGAACGTC 

T G GAG AT G AAG AAGT CC G AC C T GC T GG AG AT GT T C AAAT AC AAT GAGT T C AAGGT GC GC AT T C T C AAC G AG AAGGT G AC C AC CG AC CGCACCACT 
GTCTACAAGTGCGGTTCTTTGATCGATCTCTGCCGCGGACCTCATGTTCGTCACACCGGCAAGGTGAAGGCCCTGAAGATCACCAAGAACTCCTC 
TACCTACTGGGAGGGTAAGGCTGACGCTGAGACACTGCAACGCGTCTACGGCATCTCGTTCCCCGACCCGAAGCAGCTCAAGGAGTGGGAGAAAC 
TTCAGGAGGAGGCGGCTAAGCGTGATCACCGCAAAATCGGTCGCGAGCAGGAGCTGTTCTTCTTCCATGAACTTTCGCCTGGCTCATGCTTTTTC 
CAGCCGCGAGGAGCGCACATCTACAACACCCTGATGGGCTTCATCAAGGCGGAATATAGAAAGCGTGGTTTCCAAGAAGTCATTTCCCCCAACAT 
CTACAACGCTAAGTTGTGGATGACCTCCGGCCACTGGCAGCACTATGCCGAGAACATGTTCTCCTTCGAGGCCGAGAAGGAGAAGTTCGCCCTCA 
AGCCCATGAATTGTCCAGGACATTGTCTCATCTTCGATAACCGCAATCGATCCTGGCGAGAGCTGCCACTGCGAATGGCCGACTTCGGTGTGCTT 
CATCGCAACGAGCTCTCCGGTGCTTTAACCGGATTAACTCGCGTGCGTCGCTTCCAGCAGGACGATGCGCACATCTTCTGTGCCCCCGAGCAGAT 
TAAAAGCGAAATGAAGGGCTGCCTGGAGTTCCTGAAGTATGTGTATACCATTTTCGGATTCTCCTTCCAGCTGGTGCTGTCCACACGCCCGGATA 
ATTACCTTGGCGAGTTGGAACAGTGGAATGATGCGGAAAAGGCGCTGGCTGAATCACTCAACGAGTTTGGAATGCCGTGGAAGGAGAACCCCGGC 
GATGGCGCCTTCTACGGACCGAAGATTGATATTACCATTATGGATGCACTGAAGCGTGCCCATCAGTGCGCCACCATCCAGTTGGACTTCCAGCT 
GCCAATTCGTTTTAATCTCAGCTACATCGCCGACGATGGCGAGAAGAAGAGGCCGGTGATCATCCATCGCGCTATTCTTGGCTCAGTGGAGCGGA 
TGATTGCCATTCTCACGGAGAACTTTGCTGGCAAGTGGCCCTTCTGGCTATCACCGCGCCAGGTGATGGTCGTTCCAGTGGGTCCTGCCTATGAT 
CAGTACGCCCAGTCAGTGCGGGATCAGTTGCACGATGCTGGATTCATGAGCGAGGCGGATTGCGATGCCGGCGACACGATGAACAAGAAGATACG 
CAATGCTCAGTTGGCCCAGTTCAACTTCATTCTGGTGGTGGGCGACAAGGAGCGCTCCTCGAATACCGTTAACGTGCGCACTCGGGACAACAAGG 
TGCACGGCGAGGTCTCCGTGGCGGAACTCATCACAAAGCTGCAAAAGATTCGCGACGAATTCATCGCCAACGAGGACAGCTTCTAA 

(SEQ ID NO: 590) 
Start ATG: 104 

MKKEKKEKPSGGGDTRKELSPLPKYIEERNVFWEKCKAEYEAELAAKKREPIKVTLPDGKQVDATSWETTPYEVARGISQGLADNTVISKVNGEV 
WDLDRVLEGNCTLQLLKFDDPEAQAVFWHSSAHIMGEAMERIYGGHLCYGPPIENGFYYDMHLEGEGISTNDYGAMEGLVKQIVKEKQNFERLEM 
KKSDLLEMFKYNEFKVRILNEKVTTDRTTVYKCGSLIDLCRGPHVRHTGKVKALKITKNSSTYWEGKADAETLQRVYGISFPDPKQLKEWEKLQE 
EAAKRDHRKIGREQELFFFHELSPGSCFFQPRGAHIYNTLMGFIKAEYRKRGFQEVISPNIYNAKLWMTSGHWQHYAENMFSFEAEKEKFALKPM 
NCPGHCLIFDNRNRSWRELPLRMADFGVLHRNELSGALTGLTRVRRFQQDDAHIFCAPEQIKSEMKGCLEFLKYVYTIFGFSFQLVLSTRPDNYL 
GELEQWNDAEKALAESLNEFGMPWKENPGDGAFYGPKIDITIMDALKRAHQCATIQLDFQLPIRFNLSYIADDGEKKRPVIIHRAILGSVERMIA 
ILTEKFAGKWPFWLSPRQVMVVPVGPAYDQYAQSVRDQLHDAGFMSEADCDAGDTMNKKIRNAQLAQFNFILVVGDKERSSNTVNVRTRDNKVHG 

EVSVAELITKLQKIRDEFIANEDSF* 
{SEQ ID NO: 591) 

Name: threonyl-tRNA synthetase 
Classification: enzyme 



Celera Sequence No. : 142000013384663 

ATAGCAGTCGAATGGAGACGGCACAATGCCCACTGCGTTCTGCAGGAACTTCAAGCTTTTGTAGAGCTTCTCCGAGTTCATGGTGATGCCACCCA 
T G AC C AC AT C C GT AT G AC C GT T C AT GT AC T T G GT C AGGG AGT AG C AG AC C AG AT C AGC GC C C AGCT C C AAGG GT C GC T GG AAG TAG GAG GT C AGG 
AAGGTGTTGTCGACGGCCAGGACGATATCCTCGCGTACTCCATGGACCAGCTGCGCAATAGCCTCGATGTCGGCTACCTTTACCAATGGATTAGT 
TGGTGACTCGATCCACACCAACTTGGTCTCCGGCTTGATGGAACTTTTAATTAGATCCAACTTCGTGGGATCCACAAAGGTGGCTGAGATTCCCA 
GACGGGTGGCAACCTGTCGGATCAAACGGTTGGTGCCTCCGTAAACATCGTCGCCCATGATGATGTGATCGCCGCTGCTCAGCATAGTTAGCACA 
GCGGTGGTGGCTCCCAATCCCGATGAGAACGTGAGGCCGTATTTGGCATTATCCAAGGCGGCGAAGCACGTCTCCAGCACGTTCCTAGTGGGATT 
TCCACTGCGGGAGTACTCATAGCCCCTGTGCTCGCCCGGTGCATCTTGCTTGAAGGTGGTGCTCAGCGATATCGGCGGAATCACGGCAGCACTCT 
TCCACTGGTCGGGGCTCTGGCCCGAGTGGATGGACTTGGTGGCGAAGCCGCTCGGCTGGACTCTGTAGCTCATCTTGTCTGGGTAGGAGGATCGT 
TGGGAACTGACTGACACGTGGAGAGAGACCGTCGCGAATATATATACTCCTGCCAGATGTTATCGCCTAGCCGGTAATTAAATATTTCGACCGTT 
ACAACACTACATTCGCCGACAGTGATGGAGCTCTAGCTATTAAACGGCTAGAAAACGGAACCCTTCAAAAGTAACGATACATATATGCATTGAAA 
CGATGGATATTCTTACTTTCCTTAGAACTTCTTCGATTATTTATACAATAATTTATTTGTTGATAAAGCACAATAATATTACAATCGCAGCTTAA 
GAGTAAATACGATACTGAACAGAAAGGGACATGCGGAATGGGTTGGGAACTATACTCCAAGGTCTACCACAGTTTGTCGCTGAACACGATCTCCT 
TGAAGATGCAATCCTGCTTGGCGATGCTGACCATTGCCTTCTTCGACTTCATGGCATCCTGCACCTTCTCGCGCACCTCGGCCGCCTTCACATCG 
TTGCGAATGTAAAAGAGCCGTACCAGGTGGTCCTCGTTCAGAGACGGATAGCTGCCCAGCAAGGTGTGCAGGTCCAAGACGAGCAGTTCGATATC 
CGAGCTAAGCAGTGCCGATAGCGTGGATATAAGATCCAATGGAGAGTCGCTGAGGGAAATCTTCGGCGCGATTTTCTCAAAGAAGAGCTTAAACC 
GCTTGGCTTCGGTCTTGATTTTTCGCGTAATGGCATCGCACTCGGCCCGTGGCTTAGACAGTCGCTTGGATAGCAGTGCCCTAATGTAGCGCTTG 
GCCAGCAGCTTCTGGGCCTCGTTGATGACCATCTCGAAGTTATTGGGTCGCAGGTGATTATAGTCCTGGAAGTAATCGTCCAGCGTTACACAAAT 
GGTGTCCACGGCAATGTTACTGGCCAGCCACTTGGCAGTGAATAGGTCGTTAAAGTGACACTCCATGTCCAGGAAGGCTTCCTCC AGC AAGT AAC 
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TTGCGGCATGAGCCCGGATCCGCTGGAATGTGGCAAGCAAACGTTCGAAATCCTCATAATGCTCAGTACGTGACTTGGGCCAGTAGAGTTGTTTC 
ATCTGCTGCGCCAGTTCAACCATTTGCTGACTGTTATTCACGATGGTGATGATGTAGTGAGTGAAGTATTTGATCTGGTCCCGATTGCGAAAATG 
GTGTTCCTTCAGCTCGATTACGTTCTTAAGATATGTCTGGCCAAAGATCTCCACCTGCTGAATGCTCATTACCAACGCCTTGAACGTCAGTTCCT 
GATGAATTGTGTTGGTCACCTGCAGATGCTGGTCGATCATCTGGAATATGATGACAGGCGCTGAAGTGTGGTAGTACTCCTCATCCTGATCGGGC 
ACCGTCTCAGTAAACCACTCCTGCTTCTCCGTTTCTGCGGCCTTGGTCATCCATTCCTGGAAGTTTCTCTGCATATTTTGCAGATATTCATCTTC 
CAGCGCTTTGAGATGTTCCGGACGCAGCAGAGTACCTATCTGTCGGTGTACATCCACATTAAGATCCGGATGGGACATCAGCTCGACGCCAGGAT 
AGGTGTGTGTAACCCAAGCCATCATTGATACATATTCATTGCCCTCCAAGCCCGAGCGCACAATGTTGTCAAGCTGGAATTAGTAAATATTAGTG 
GTATAAAAACATTAATCAATTTAAATGATTGACGATTACTCACATAGCTAGAAAGTCCTTCGTGATAGAACTTGACATATTCGCCGAAAATGTCG 
TAGTGCGGCGGGAAGCAGGGCACACAAAGGGACTTCACAACGCGCAGATCCTCGAGAATGATTTGACGCAATATCTCCAGGTCTCGTACTAACCA 
CATCTTGTTATCGGCCCGCTCCTCTAGCTTGGATCCCTCTATGCGCGTTATCACCGACTGTTGTAGCACGTCCATGATCATGCGGCGCCAAGCCT 
TTGGTCTTCCAGGCGGCAGGAATCCTGTGACCTTTTGCTGCTGCAACGCGAACTGGTCGTTTTTCTCTTCCCGCTCTATGATGCGTAGGGCGGTA 
ACGATGATGGTGGGCTTCTTGCGCAGAGTGTTCAGCGTGCGGCTAAGAATGAGTCGCAGCTTTTTCTCCAGCTCTTGGGATACCGTGTCAACCTT 
CTCGAAGTGACGCTTTAGGGTGATCTTGTCGGACGCATGCTGCTTGGGCTGCTTGTGCAGTTCGTAGAGAAGATCATCCCGCGAATTCTCCAGAT 
CCGCCAGGCACTGATGGGCATTAAGCAGCTTATCATCCTCGATCAGCGCCATCGTCTTTTGTACGCTGGCATCTACGTTGAAGATGTGCTTCAGA 
TTCTCCATGGCGGTGGCGTACTGGGAGTGCTTTGTGTTTTCCTCCCGCACAACTTCCAAAGCGTCGTAGACTTCGGGAACGCCCCTCAGCAGGCG 
CTCCACCTCGTCCATTCGACGCCTTACCTCGCGCACGTCCTGCATGCAGGTCTCCAACTGCTTAAGTCCCACCCTGACCCCGTCCAGCTGTCCTT 
GCATTCCGGTCTTAAGGAGCGCCTCAACGGATGCCTTCTTCCGGGCGATTCGGTGGCGATATTGTTCCACCTTCTCCAGTTGACCTGGTCGCTGC 
AACATGTTCTGTATGTCCTTGAGCGCTGCCTGGCGAGCCTGCTCCTCCAGCTGCTGTAAATCCATCTTCAATTTATTTGGATCACTTTCAGTCTC 
TGTTTTGGTCAATCTTCGTCAGCAGTGGAAGTGCTGGTGTGACCATATAGCAGTGCGGCTGACAATCTCACTTCCAAGAACTACAGTGTAACTTG 
ACAATAAGCTATATTCTACTCTTAAAATTATTATAGTTTTATTGTGTCTTAAAATTTTTATTTTATTCTTCAAGCTTAAAAAATTATGAAGTTAA 
AGTATATCTTTTAGTTTGCCAAAGCAACAGAGGATTCGACCTAAGTATGGAAAACTGTACTTATCGATAAGTTATCTTCTCATTGGTCTATCGCC 
ATCGAGAGCTTTGTTTGTTTATAATTTGCAGCCAAACGTTTGAACGTAAACAACGCCCTAAGCAAAACATGAGCACGGAAAAAGAGATAATTTTG 
CAATTTGCTCCGTGGGAATCTTTTGTGTCGCCCACTTTCTGGCACAAGCTAGCGGAGCTTAAGCTCGACCACGATCGCCTGTCCGACTCGAAACG 
CTCCATTACTGGACACTACACAAATCGTAATGCAAGTGGATGCCTTTTGGAAGTAGACTACACGGCCTACAACAGGTGAGCTTGAAAACTACGAG 
TTTGATGAAACCAGCGCTTGACCACCCACATTCCAGAATGGCAAAGCCTCCAAAATTCAGCCATTCCGCTATAGGCACCATCTACAATAAGAACA 
CAATCGAAGAGTTCAAGGCCCTGGACAAATTACAACTGCTGGCCGATGAGGGCAAGGAACTGCTGGCTGATATGTGCAGTGGTGGCGCCTTGAGA 
GATCCCAGTCTATTGACCCGGTTCTTTGTTCTCTCCTTTGCCGATTTAAAGTGTCACAGCTACTACTATTGGTTCGCCTTTCCGTGCCCGCTGAC 
GCCCACCTTAAAGCTTCAGGGAGCCGTCCAAAAACTGCGGGACTTGCCCAATAGTAGTAGCTATATAATGGCTCTAAAGGCCTTACCCACTGAGT 
C AC AG AACT T CTTCATTCTGT AT GCT AAT GT GG AG AAG AAC AT T T T C GAAGC CCGT AGT T T AAGT T CC C T T G AC GAT AAAAAT GT 
(SEQ ID NO: 592) 

Exon: 3455.. 3163 

Exon: 3060.. 2692 

Exon: 2622.. 2419 

Exon: 2353.. 1001 

Start ATG: 3390 (Reverse strand: CAT) 



Transcript No. : CT16986 

AGCACTTCCACTGCTGACGAAGATTGACCAAAACAGAGACTGAAAGTGATCCAAATAAATTGAAGATGGATTTACAGCAGCTGGAGGAGCAGGCT 
CGCCAGGCAGCGCTCAAGGACATACAGAACATGTTGCAGCGACCAGGTCAACTGGAGAAGGTGGAACAATATCGCCACCGAATCGCCCGGAAGAA 
GGCATCCGTTGAGGCGCTCCTTAAGACCGGAATGCAAGGACAGCTGGACGGGGTCAGGGTGGGACTTAAGCAGTTGGAGACCTGCATGCAGGACG 
TGCGCGAGTACGCCACCGCCATGGAGAATCTGAAGCACATCTTCAACGTAGATGCCAGCGTACAAAAGACGATGGCGCTGATCGAGGATGATAAG 
CTGCTTAATGCCCATCAGTGCCTGGCGGATCTGGAGAATTCGCGGGATGATCTTCTCTACGAACTGCACAAGCAGCCCAAGCAGCATGCGTCCGA 
CAAGATCACCCTAAAGCGTCACTTCGAGAAGGTTGACACGGTATCCCAAGAGCTGGAGAAAAAGCTGCGACTCATTCTTAGCCGCACGCTGAACA 
CTCTGCGCAAGAAGCCCACCATCATCGTTACCGCCCTACGCATCATAGAGCGGGAAGAGAAAAACGACCAGTTCGCGTTGCAGCAGCAAAAGTCG 
GTGATAACGCGCATAGAGGGATCCAAGCTAGAGGAGCGGGCCGATAACAAGATGTGGTTAGTACGAGACCTGGAGATATTGCGTCAAATCATTCT 
CGAGGATCTGCGCGTTGTGAAGTCCCTTTGTGTGCCCTGCTTCCCGCCGCACTACGACATTTTCGGCGAATATGTCAAGTTCTATCACGAAGGAC 
TTTCTAGCTATCTTGACAACATTGTGCGCTCGGGCTTGGAGGGCAATGAATATGTATCAATGATGGCTTGGGTTACACACACCTATCCTGGCGTC 
GAGCTGATGTCCCATCCGGATCTTAATGTGGATGTACACCGACAGATAGGTACTCTGCTGCGTCCGGAACATCTCAAAGCGCTGGAAGATGAATA 

TCT GC AAAAT AT G C AG AG AAACT T C C A GG AAT G GAT GAC C A AG GC CGC AG AAACGG AG AAG C AG GAG TGGTTTACT GAG AC G GT G C C CG AT C AGG 
ATGAGGAGTACTACCACACTTCAGCGCCTGTCATCATATTCCAGATGATCGACCAGCATCTGCAGGTGACCAACACAATTCATCAGGAACTGACG 
TTCAAGGCGTTGGTAATGAGCATTCAGCAGGTGGAGATCTTTGGCCAGACATATCTTAAGAACGTAATCGAGCTGAAGGAACACCATTTTCGCAA 
TCGGGACCAGATCAAATACTTCACTCACTACATCATCACCATCGTGAATAACAGTCAGCAAATGGTTGAACTGGCGCAGCAGATGAAACAACTCT 
ACTGGCCCAAGTCACGTACTGAGCATTATGAGGATTTCGAACGTTTGCTTGCCACATTCCAGCGGATCCGGGCTCATGCCGCAAGTTACTTGCTG 
GAGGAAGCCTTCCTGGACATGGAGTGTCACTTTAACGACCTATTCACTGCCAAGTGGCTGGCCAGTAACATTGCCGTGGACACCATTTGTGTAAC 
GCTGGACGATTACTTCCAGGACTATAATCACCTGCGACCCAATAACTTCGAGATGGTCATCAACGAGGCCCAGAAGCTGCTGGCCAAGCGCTACA 
TTAGGGCACTGCTATCCAAGCGACTGTCTAAGCCACGGGCCGAGTGCGATGCCATTACGCGAAAAATCAAGACCGAAGCCAAGCGGTTTAAGCTC 
TTCTTTGAGAAAATCGCGCCGAAGATTTCCCTCAGCGACTCTCCATTGGATCTTATATCCACGCTATCGGCACTGCTTAGCTCGGATATCGAACT 
GCTCGTCTTGGACCTGCACACCTTGCTGGGCAGCTATCCGTCTCTGAACGAGGACCACCTGGTACGGCTCTTTTACATTCGCAACGATGTGAAGG 
CGGCCGAGGTGCGCGAGAAGGTGCAGGATGCCATGAAGTCGAAGAAGGCAATGGTCAGCATCGCCAAGCAGGATTGCATCTTCAAGGAGATCGTG 
TTCAGCGACAAACTGTGGTAGACCTTGGAGTATAGTTCCCAACCCATTCCGCATGTCCCTTTCTGTTCAGTATCGTATTTACTCTTAAGCTGCGA 

TTGTAATATTATTGTGCTTTATCAACAAATAAAT 
(SEQ ID NO: 593) 

Start ATG: 66 {Reverse strand: CAT) 

MDLQQLEEQARQAALKDIQNIXILQRPGQLEKVEQYRHRIARKKASVEALLKTGMQGQLDGVRVGLKQLETCMQDVREYATAMENLKHIFNVDASVQ 
KTMALIEDDKLLNAHQCLADLENSRDDLLYELHKQPKQHASDKITLKRHFEKVDTVSQELEKKLRLILSRTLNTLRKKPTIIVTALRIIEREEKN 
DQFALQQQKSVITRIEGSKLEERADNKMWLVRDLEILRQIILEDLRVVKSLCVPCFPPHYDIFGEYVKFYHEGLSSYLDNIVRSGLEGNEYVSMM 
AWVTHTYPGVELMSHPDLNVDVHRQIGTLLRPEHLKALEDEYLQNMQRNFQEWMTKAAETEKQEWFTETVPDQDEEYYHTSAPVIIFQMIDQHLQ 
VTNTIHQELTFKALVMSIQQVEIFGQTYLKNVIELKEHHFRNRDQIKYFTHYIITIVNNSQQMVELAQQMKQLYWPKSRTEHYEDFERLLATFQR 
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IRAHAASYLLEEAFLDMECHFNDLFTAKWLASNIAVDTICVTLDDYFQDYNHLRPNNFEMVINEAQKLLAKRYIRALLSKRLSKPRAECDAITRK 
IKTEAKRFKLFFEKIAPKISLSDSPLDLISTLSALLSSDIELLVLDLHTLLGSYPSLNEDHLVRLFYIRWDVKAAEVREKVQDAMKSKKAMVSIA 

KQDCIFKEIVFSDKLW* 
{ SEQ ID NO: 594) 

Name: sec6 -like 
Classification: transporter 



Celera Sequence No. : 142000013384663 

TTAATTCTCATTTTGTCATTCAGCTGAAGCTGCTTGGCTATCTAATGGGTACGATATTAATTGGAGTGGCTGGCGTCGCTATTTATCGTTACAAG 
TACTAAGGACAAGCATTGCACACGAACTACTGGTTATTTTAAGCAATTGAAATGCATTTTTTAAGTATATATAGAAGGCAAAATTATGAATGTTG 
GAAAGTCGCAGTTGCTTATTATGATGATATGGCTCTCGTTGTACATTTAATGCTATGCACACTTCACAATAGTATGTATCTTTCAAATAGTATGG 
TTTTGGATATCGTACATATAATGTTAAAGCTTAAGTTTAATCAACTACATTCTGTCAGACAAGTAACGGGAATTGTTAAAATAAACACAATATTC 
T CAT CAT TAT GT T T T AT AC AAT T T T C AAACGAAAT AAAGC AAC AC AAT C G AAAT C AG AC CAT AT T T AT GAAC T T GC AAT AAGGT T C GAAAG AAAG 
CATATGAAATACCATATGAATGTAATGATACTGATAAGAGCAGCACTGATAAGAACTCCTAACTTGATTTTTTTTGTTCGTTACTTAAATAATGC 
ATGATTTAACTAAACTCTGCTCTAATCTAAAAGGCAAACGAACTTTTGTAAAAACAAATCGGAACATCCGTTCAAGAGATATAATTTTCTTCTAT 
CGCGACTTAGAATGATTAAATGATGGTGCGAATTTAAACATGCTGATTCATTGTTTGCGTATGGTAATCGACATGACAATGGGTATATTTGGGTC 

T C C C AAGGC T GT T T C TAT GGG GGGT C GT G GAG GT AAT GC AGC T G C GAT C G T AT AT AAAC C TAT C AAG AC AT C AAC AAT T T T C AGT AT CT G AT C A C 
TATATCAACAATAATTGTGAAAAAAAAACCAGCAATAACACAAAGCTTTATTTACCACATCTTATCAGTTCGCCACAATCTGTAATATCTATTCG 
ATCTATCCAGAAAGTCTATATGTTTACAAACGTATGTTTATAAGCAAGTGTTTATAAATATCTTTCATTTGAGACAAAACACTTTTATTGGGGTG 
AT T AAT AT AC AAAT T C AT AAGC T T T C GGT T AAAC AAAT T C CT GC G AAAAAT AAAAAAAAT AC AT GT G AT GC G AGT AAC T T AGGC C TT C GAT G C AA 
TTTCAAGGGCCTGCTCCAGATCCTTGATCAAATCATCGGCATCCTCCAGTCCGACGGACAAGCGAACTAGACCGTCAGTGATGCCCAAAGTTTTC 
CTATCCTCCGCTGGAACCGAGGCATGGGTCATTATTGATGGAAGCTCGGCCAGACTCTCGTAGCCACCCAGACTCTCTGCCAGGGTGAACACCTT 
GAGGGCCTTGAGGAAAGCCGAGGAGTGCTTCAGCTCGCCCTTAATGTAGAAGGAGAACACGCCGCTGTATCCGTATGCCTGCTTCAGAGCGATCT 
TGTGCTGCGGATGGGAGGGCAAAGAGGGATGCAACACCTTCTCCACGAACGGATTCGTCTCCAGGTATTTGGCAACCTTCAGAGCATTCTTCTGG 
TGCTGTTCCATGCGCAGCGAGAGCGTCTTAAGACTTCTGTTCACCTGATAGCAGTCGAATGGAGACGGCACAATGCCCACTGCGTTCTGCAGGAA 
CTTCAAGCTTTTGTAGAGCTTCTCCGAGTTCATGGTGATGCCACCCATGACCACATCCGTATGACCGTTCATGTACTTGGTCAGGGAGTAGCAGA 
CCAGATCAGCGCCCAGCTCCAAGGGTCGCTGGAAGTAGGAGGTCAGGAAGGTGTTGTCGACGGCCAGGACGATATCCTCGCGTACTCCATGGACC 
AGCTGCGCAATAGCCTCGATGTCGGCTACCTTTACCAATGGATTAGTTGGTGACTCGATCCACACCAACTTGGTCTCCGGCTTGATGGAACTTTT 
AATTAGATCCAACTTCGTGGGATCCACAAAGGTGGCTGAGATTCCCAGACGGGTGGCAACCTGTCGGATCAAACGGTTGGTGCCTCCGTAAACAT 

C GT C GC C C AT GAT G AT G T GAT C GC CGC T GCT C AG CAT AGT TAG C AC AGCGGT GGT G GC T C CC AAT C CC GAT G AGAAC GT G AGG C C GT AT T T GG C A 
TTATCCAAGGCGGCGAAGCACGTCTCCAGCACGTTCCTAGTGGGATTTCCACTGCGGGAGTACTCATAGCCCCTGTGCTCGCCCGGTGCATCTTG 
CTTGAAGGTGGTGCTCAGCGATATCGGCGGAATCACGGCAGCACTCTTCCACTGGTCGGGGCTCTGGCCCGAGTGGATGGACTTGGTGGCGAAGC 
CGCTCGGCTGGACTCTGTAGCTCATCTTGTCTGGGTAGGAGGATCGTTGGGAACTGACTGACACGTGGAGAGAGACCGTCGCGAATATATATACT 
CCTGCCAGATGTTATCGCCTAGCCGGTAATTAAATATTTCGACCGTTACAACACTACATTCGCCGACAGTGATGGAGCTCTAGCTATTAAACGGC 

T AGAAAAC GG AAC CC T T C AAAAG T AAC GAT AC AT AT AT GC ATT G AAAC GAT GG AT AT TCTTACTTTCCT TAG AAC T T C T T CG AT TAT T T AT AC AA 
TAATTTATTTGTTGATAAAGCACAATAATATTACAATCGCAGCTTAAGAGTAAATACGATACTGAACAGAAAGGGACATGCGGAATGGGTTGGGA 
ACTATACTCCAAGGTCTACCACAGTTTGTCGCTGAACACGATCTCCTTGAAGATGCAATCCTGCTTGGCGATGCTGACCATTGCCTTCTTCGACT 
TCATGGCATCCTGCACCTTCTCGCGCACCTCGGCCGCCTTCACATCGTTGCGAATGTAAAAGAGCCGTACCAGGTGGTCCTCGTTCAGAGACGGA 
TAGCTGCCCAGCAAGGTGTGCAGGTCCAAGACGAGCAGTTCGATATCCGAGCTAAGCAGTGCCGATAGCGTGGATATAAGATCCAATGGAGAGTC 
GCTGAGGGAAATCTTCGGCGCGATTTTCTCAAAGAAGAGCTTAAACCGCTTGGCTTCGGTCTTGATTTTTCGCGTAATGGCATCGCACTCGGCCC 
GTGGCTTAGACAGTCGCTTGGATAGCAGTGCCCTAATGTAGCGCTTGGCCAGCAGCTTCTGGGCCTCGTTGATGACCATCTCGAAGTTATTGGGT 
CGCAGGTGATTATAGTCCTGGAAGTAATCGTCCAGCGTTACACAAATGGTGTCCACGGCAATGTTACTGGCCAGCCACTTGGCAGTGAATAGGTC 
GTTAAAGTGACACTCCATGTCCAGGAAGGCTTCCTCCAGCAAGTAACTTGCGGCATGAGCCCGGATCCGCTGGAATGTGGCAAGCAAACGTTCGA 

AATCCTCATAATGCTCAGTACGTGACTTG 
(SEQ ID NO: 595) 

Exon: 2354.. 1001 

Start ATG: 2305 {Reverse strand: CAT) 
Transcript No. : CT17004 

CTCTCTCCACGTGTCAGTCAGTTCCCAACGATCCTCCTACCCAGACAAGATGAGCTACAGAGTCCAGCCGAGCGGCTTCGCCACCAAGTCCATCC 
ACTCGGGCCAGAGCCCCGACCAGTGGAAGAGTGCTGCCGTGATTCCGCCGATATCGCTGAGCACCACCTTCAAGCAAGATGCACCGGGCGAGCAC 
AGGGGCTATGAGTACTCCCGCAGTGGAAATCCCACTAGGAACGTGCTGGAGACGTGCTTCGCCGCCTTGGATAATGCCAAATACGGCCTCACGTT 
CTCATCGGGATTGGGAGCCACCACCGCTGTGCTAACTATGCTGAGCAGCGGCGATCACATCATCATGGGCGACGATGTTTACGGAGGCACCAACC 
GTTTGATCCGACAGGTTGCCACCCGTCTGGGAATCTCAGCCACCTTTGTGGATCCCACGAAGTTGGATCTAATTAAAAGTTCCATCAAGCCGGAG 
ACCAAGTTGGTGTGGATCGAGTCACCAACTAATCCATTGGTAAAGGTAGCCGACATCGAGGCTATTGCGCAGCTGGTCCATGGAGTACGCGAGGA 
TATCGTCCTGGCCGTCGACAACACCTTCCTGACCTCCTACTTCCAGCGACCCTTGGAGCTGGGCGCTGATCTGGTCTGCTACTCCCTGACCAAGT 
ACATGAACGGTCATACGGATGTGGTCATGGGTGGCATCACCATGAACTCGGAGAAGCTCTACAAAAGCTTGAAGTTCCTGCAGAACGCAGTGGGC 
ATTGTGCCGTCTCCATTCGACTGCTATCAGGTGAACAGAAGTCTTAAGACGCTCTCGCTGCGCATGGAACAGCACCAGAAGAATGCTCTGAAGGT 
TGCCAAATACCTGGAGACGAATCCGTTCGTGGAGAAGGTGTTGCATCCCTCTTTGCCCTCCCATCCGCAGCACAAGATCGCTCTGAAGCAGGCAT 
ACGGATACAGCGGCGTGTTCTCCTTCTACATTAAGGGCGAGCTGAAGCACTCCTCGGCTTTCCTCAAGGCCCTCAAGGTGTTCACCCTGGCAGAG 
AGTCTGGGTGGCTACGAGAGTCTGGCCGAGCTTCCATCAATAATGACCCATGCCTCGGTTCCAGCGGAGGATAGGAAAACTTTGGGCATCACTGA 
CGGTCTAGTTCGCTTGTCCGTCGGACTGGAGGATGCCGATGATTTGATCAAGGATCTGGAGCAGGCCCTTGAAATTGCATCGAAGGCCTAAGTTA 
CTCGCATCACATGTATTTTTTTTATTTTTCGCAGGAATTTGTTTAACCGAAAGCTTATGAATTTGTATATTAATCACCCCAATAAAAGTGTTTTG 

T C T C AAAT GAAAG AT AT T T A T AAA 
(SEQ ID NO: 596) 

Start ATG: 50 (Reverse strand: CAT) 
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MSYRVQPSGFATKSIHSGQSPDQWKSAAVIPPISLSTTFKQDAPGEHRGYEYSRSGNPTRNVLETCFAALDNAKYGLTFSSGLGATTAVLTMLSS 
GDHIIMGDDVYGGTNRLIRQVATRLGISATFVDPTKLDLIKSSIKPETKLVWIESPTNPLVKVADIEAIAQLVHGVREDIVLAVDNTFLTSYFQR 
PLELGADLVCYSLTKYMNGHTDVVMGGITMNSEKLYKSLKFLQNAVGIVPSPFDCYQVNRSLKTLSLRMEQHQKNALKVAKYLETNPFVEKVLHP 
SLPSHPQHKIALKQAYGYSGVFSFYIKGELKHSSAFLKALKVFTLAESLGGYESLAELPSIMTHASVPAEDRKTLGITDGLVRLSVGLEDADDLI 

KDLEQALEIASKA* 
(SEQ ID NO: 597} 

Name: ECDYSTEROID- INDUCIBLE POLYPEPTIDE EIP40 
Classification: enzyme 
Gene Symbol: Eip55E 
FlyBase ID: FBgn0000566 



Celera Sequence No. : 142000013384546 

CGCGCTTAGTACTCGATAACAAATCGTCGGCGGGACTGGACTACCGACTATTTCAACGAATCATTCGATTCCCGATTGCTTCCGTGAAATCGATA 
TTTTCTGACCCAGCTTCATGACATCCCTGTTGAGTTGAATACGCAAAGGTGGCAACCTTATACGGCGACTGAATAAAATAAAAGTGGAATTGTAA 
ATTATAAGTGTAAGCACTAGCAATGGCTGCCACCCCAAGCGCACCCGCCAAGTCCGAGATGATCGACCTGACCAAGTTGAGTCCGGAACAGCTGA 
TTCAGATCAAGCAGGAGTTCGAGCAGGTGCGTTCCGCATGTGGGTGTGTGATGAGTCAAGTGTGCTCGCGGAGGGGGTTTCTGGAGGAGGGGCGC 
CCCAATTGGCACATGCTTCTAAAATCGGTACATTTTTGTGATCCTTAGGAGATTACCAACGTACAGGACTCGCTGTCGACGCTTCACGGCTGTCA 
AGCCAAGTACGCCGGATCCAAGGAGGCGCTGGGAACCTTTCAACCCAATTGGGAGAACCGTCAGATCCTGGTGCCCCTGACCAGCAGTATGTATG 
TGCCAGGACGGGTCAAGGATCTCAACAGGTTCGTCATAGACATCGGCACCGGCTACTACATTGAAAAGGTGATTTACATTTCCAATCGAACCTAG 
AATATTTATTGATTGCCATCTGTTTCTCAAACAGGATCTGGAAGGCTCCAAAGACTACTTCAAGCGACGCGTGGAGTACGTGCAGGAGCAGATCG 
AGAAAATCGAAAAGATCCACCTGCAAAAGACGCGCTTCTATAACTCAGTGATGAGTGTGCTGGAGATGAAGCAGGCTGCTGCAGCGAAATTGCAG 

T C GC AGC AGC AAT CGC AGC C GGC GGT T AC C C AGAGCT C CT AG AC T AC GG AG AAC GAC AG AG AC AC GAG AC AC G GAAAGC ATT AAAT T AT T AAACT 
TATTAACGAGATGTGTGCATTTAATTAACGTAGTTTATTAAATTGTTCAGCTATTTCTTGGCTTCTGCCAAAGCCTTGGTGAGAACCTTGACCTT 

CT GC T C GT AC GC GGACT T AAG AT C CT G T AG GG AAAGAAG GT AAT AG T AT AT C AAG T AT AT C AT T GG AC T CGAGT T AT AAG GC AC C T T G AAT AT C T 
GTGACTGGGTGTCGTACTCCGAACGCAGCTTGTCATACACTCCTTTGGAGATGGCTAGTTTCTTCTCCGTGAGCACACACTTGTCTTGCTCCGCC 
TTGATCCACTGCTGGCTCTTCTCCAGACGAGCCTGCGTGTGATCACATTCGGCGGTCTTTAGCTGCAACATGTCCTTGGTCACCTGCAGCACATC 
CTTAATCGAGTCTACTTGTGTCCTAAGCACTGATATCTGCTGCTCCTGCTGTTTGTTCACCTTGGTCTTCTCGAACAGCAGGGTTTTGGCCTTTA 
GAAGCAGCTCCTCCTGCTCTTGTCGTGATTTGGGAGCATCTGTGCCGCCCAGCATGACGGACTGCTTGAGCGACTGCTCCAACTGGACCACTCTC 
CGCTTTAAACGCTCATTTTCCGCCTTGGCTTCCTGCAATCAGGGTCAGTAAGTGGGTGCAGCCCGTGGAGACGAGTAACTAACCTCAAAGTCTTT 
GCGGAGCGCTCCGATCCGTTTGCGAGACTCCTCTAGTAGTGCCTGTTTTTCGCTTTCCAACGTGAAAACCATGTCCAGCAGTTCGTTCTGCGGCT 
GCTCCGGCTGACGCACTTCCTCCTGCCCATTTGCTGTTATATTGTTGTTTAGGTCACCGCCAACAGCTTCATCTTCGTTATGATCGGCCATTTTA 
TGAAGAATTCTTTTTAAATGATAAAAATTCGTAGAGCAATGTGCGTTTTCCGTGCTTTTTGGGATGTGTTTTCTTTTTGTATTAATAGACCAAAT 
AGATGCACGCTCTATTTATAGGACATCCCACTCTCCGGATCGCTCTCTGCCTGCTCTTGGCCAGATCGCTCTCCCGAAATAACAAACTGTTTCCC 
T C C AATT T AGT T T AC GC C T CAG CAT AT T CT G AAG AT C AGT T T AT TAT AC AT T G AAG C ATGG AT G AG AAAGGATT GCG GAT GT G C GG T GT T AAT AC 
AGATCGCGAAGGGCAACACAATTAAATTGTTGTTACAATCAATGGGGCGACAATTGAAAGTGAGTGATTTGATTAAAATTAAATGCCTAAACAAG 
ATAGAAGAAATGTGCTTCGCTCGCGGTGAAGAATGTCTAACCGTCCAGCTCGCCGGCAGCGCATGCATCGGCGTAACTCTTGCCGCTCTTGTCCT 
CCGGATTCAGCTTGATCATGTCGTCAATACGCAAAATGGTAATGGCGGCCTCCGTGGCGAACTTCAGCGACTTGATCTTGGACATGGCGGGCTCC 
AAAACTCCAGCCTTCTTGTTGTCCCTTACCACTCCCTCGATCAGATCCAGACCGGTCCACTTAAGATCCGAACGCTCGGGCTTCGTCTGGCTGGA 
GTTGTGGTAGGAGCGGAGCTTGGCCACTAGATCAGTGGCATCCTTGGCCGCGTTCACGGACAGAGTCTTGGGAATGACCAGCAGCGACTTTGCGA 
ATTCGGCAATGGCCAGCTGTTCGCGGGAAGCCAGCGAGGTGGCAAAGTTCTCCAGGTAGATCGACAAAGCAGCCTCCACACAGCCGCCGCCGGCT 
ACAACCTTCTTGCTCTCCAGCACACGCTTCACCACGCACAGAGCGTCGTGGACGGAGCGCTCCATCTCATCACAGTAGAAGTCATTGGGACCACG 
CAAGATAATGGATGCGGCAGCACTAAAAAGGCATAGGGATACGATAAAGCAAACTATGTTACTGCATGATATAATCTGCACTTACCGTGCCTTAG 

TGCCCTTGATGAGGATAAGTTCAT 
(SEQ ID NO: 598) 

Exon: 1874.. 1604 
Exon: 1552.. 1130 
Exon: 1069. .1001 

Start ATG: 1801 (Reverse strand: CAT) 
Transcript No. : CT17018 

ACACATCCCAAAAAGCACGGAAAACGCACATTGCTCTACGAATTTTTATCATTTAAAAAGAATTCTTCATAAAATGGCCGATCATAACGAAGATG 
AAGCTGTTGGCGGTGACCTAAACAACAATATAACAGCAAATGGGCAGGAGGAAGTGCGTCAGCCGGAGCAGCCGCAGAACGAACTGCTGGACATG 
GTTTTCACGTTGGAAAGCGAAAAACAGGCACTACTAGAGGAGTCTCGCAAACGGATCGGAGCGCTCCGCAAAGACTTTGAGGAAGCCAAGGCGGA 
AAATGAGCGTTTAAAGCGGAGAGTGGTCCAGTTGGAGCAGTCGCTCAAGCAGTCCGTCATGCTGGGCGGCACAGATGCTCCCAAATCACGACAAG 
AGCAGGAGGAGCTGCTTCTAAAGGCCAAAACCCTGCTGTTCGAGAAGACCAAGGTGAACAAACAGCAGGAGCAGCAGATATCAGTGCTTAGGACA 
CAAGTAGACTCGATTAAGGATGTGCTGCAGGTGACCAAGGACATGTTGCAGCTAAAGACCGCCGAATGTGATCACACGCAGGCTCGTCTGGAGAA 

G AGCC AGC AGT G GAT C AAG GC G GAG C AAGAC AAGT GT GT G C T CACGG AG AAG AAACT AGC CAT C T C C AAAGG AGT GT AT G AC AAGC T G C GT T CG G 
AGTACGACACCCAGTCACAGATATTCAAGGATCTTAAGTCCGCGTACGAGCAGAAGGTCAAGGTTCTCACCAAGGCTTTGGCAGAAGCCAAGAAA 

TAG 

(SEQ ID NO: 599) 

Start ATG: 74 {Reverse strand: CAT) 

MADHNEDEAVGGDLNNNITANGQEEVRQPEQPQNELLDMVFTLESEKQALLEESRKRIGALRKDFEEAKAENERLKRRVVQLEQSLKQSVMLGGT 
DAPKSRQEQEELLLKAKTLLFEKTKVNKQQEQQISVLRTQVDSIKDVLQVTKDMLQLKTAECDHTQARLEKSQQWIKAEQDKCVLTEKKLAISKG 
VYDKLRSEYDTQSQIFKDLKSAYEQKVKVLTKALAEAKK* 
{SEQ ID NO: 600) 
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Celera Sequence No. : 142000013384832 

ATGCAATATATTCAACTACTATCATTATTTTGCGGATCGGCGATTCTTAAGGTTGCGTTCCAGTTTGTGATGAGGCTTGCCTTCGTTCTTTCCTG 
TCTTCTGAGTGATTCTTTGAGCATCCCGACTGCGAGTTTGGCGCACGGATTGCTGGGCATCCTTGGGAGGATTGGTTTTACGCTGCTTTGGAGGC 
GGCGGCGTGAAGGATTCGTCGCTATCGTCTAGGTGGTCCTCATAGAGAGCATCGTCTTCTAAGTATTCCGTCACGAATTCTTCCTGCTTCACATT 
GTCCATGTCTTCAAAATCGTCAAGATGTTCCTCCTTCTCATCGACGCCGCCTTCTCCTTCGCCGTCGTCTTCGAACTCGTTCTGGTACGGCTCCA 
TTTCCACCACATGATCGTCCTCGATGTGGCGCTGCAGGCATTTCTGCGTCAAGAAGATGTGGTCACAGAACTCGCAGGCATGGCGCTCGCTCTGC 
TTGTGCTGGTCCATGTGGCTCTCGACACTGTCGATATCTTCGATCTGCTTGCGACACATGGGACATTTCAGGCTGGCGGTGGCATGAATGAGCGC 
ATGATCCAGTAGGTCGTTATAGTTGCTGTACGACACCTTGCACTCACTGCACACGAACGAGGCGTCGGAAGTCTGCGTGTGCGTGCGCAGGTGGC 
GCGAGAGATGATGCGACAGCATGTAGGACTTGTGGCAGTACTTGCAGGGATAGGCTCTCTTCCCGGCGTGGGCAACCAGGTGGCGGGCCAGCTTG 
GAAGCGGTGGAAAAGCTTCTCTCGCAATGCTGACAGGGGAAGGGAAGATTGGTGGAGTGGACTGTCTGAAAGTGGACATAATCGTTAGCCGGGGC 
AATCCCAAATATGCTTGTCAACCCACCTCATGACGCTCCAATCCCTGCTTGAAGGCAAACGACTTGGTGCAGACACCGCATTGGAACGGACGCTT 
CTTCTGATGTCGCTCGGCGTGCTTCTCCATCTCCTGGCGCGAAAGGAAGGGCTTCTCGCAGTACACGCAGATGAACTTGGGTACGTCCGTGTGGG 
TGCGCTCGTGGCGATGGAGCAATGCCTTCCGTGTGAAGGCTTTCGAGCAGATGGCGCACTTGAATGGCCGCTCGCCGGTGTGCACAAAGTTGTGC 
TTGGCCAGGTCGTACTTGGAGAAGAAGCTCTTCTCGCACAGGAGACACTGGAAGCTGCGCGAGCGAGTGTGATTCAGCCGATGGATCTCGAGCAG 
CTGCTGCAGCGGAAAGGAGCGTTCGCAATCGGGACAGGGGAACACATCGGTGGCCACCTTCTCGGCGTTCTTGGTCGGATCATCTTGATCGTCCA 
ATGGCAGATCCGAGTCCAGAACCTCTGTCACTATGGCCACATTGCCACTATCATCTCGCTTGACTTTGGGTGTGGCCAGGCGCGGTTCCACCGGA 
GCGGCGGGTCCCTTGTTCAAGATCCTAATCGAGGACTTGTTCAGCACCTTCGGCTTGACGGGTGATGCCTTCTGGGGGATATCGGGGAACTCCTT 
CTCTAGCTCACTGGCCATATCCTCCAGCATGCTCTGCACATCGTCCATGGACAGCTCCTGGTTGTTATCCACCTTCAGCTCATAGGCGTGCGAAC 
GACGTGGCACTGGAGAGGAACCGGCTACCGTCCTGGGGGTCACCATGGCGCTCTCCAGCACCTGGACGTCCTCGATGATCACCTGACTACTGGAT 
TTGGCCATTGTGTTCAGCAGCTTTTTGGGCGTTTCGCTTGGCTCGGCTGTCTTTGCTGGCACGACCAACAACTTCTTGGAAGCCACCATCGTCAT 
GGGCGCACGGGGCTTCTCCAGCGGCGATGGCCAGTTACCCGTGAGCGGATACTGGCGCAGGAGGGTCTCCGCCCGCTTGCACATCTGCTTGAAGC 
GGTAGCAGTGCTCGAAGAGGAGGCGGCACTCCAGACAGATGTGCCCCGGCATTCCGTCGCCGGCGTAGACCTGTTTTTAATGTTAGATTAGATAA 
ATGTGTGAGTGACTAGGAAGCTACATTTATTCGGGGAAGTCCGTGCTTAAGCCAAATAATAAGGTAATATTCGTGGAACACCCCTATATGTTTTT 
CGCTACATAATAAATTAAACGCCATAAATGTCTCTTTCCCTTAAAATGAACAATATAATGTTAGAGTTTAAATTTAAACAAAGGCGCCGAGGACC 
AGAACACCCTCTACAAATTAATCGATGGCCACCAATCGATGATTAGTGCTATCGACGAGCGGCCATCGCCGCTTGGCGGCAAGTGCATACACACG 
ACTACATGCAATCGCAGTTTGTTTTCGCCGCTCAGAGCACGCTTAACCTCCGTTAATTGATTGATTAAACACGTCCGACTGTGAGCCATAAAAGG 
ACATTCGCTCGATCGAAAGCCGACAAAAGCTGGCTGAGAAGCGCAAAACAGCTGCGCTGCGCAAAGTTTGGCCAACGCAGAGCAATAGAAAACAG 
AAGC AGT G T C AGAAT AAAAC AAG AAT C AG AC C AAAAAGGGC G C AAC AAC T GC GGAG AG AG AAAAGG AAGG AAAG AGC AGC GC GT G AGCG AGGG AG 
GAGCAGGAGGGAGCGAACAAAAGGCAAGAGGAAACACAAGGAGAAACGCAAACAATCAAAACAAGAACTTAAGCAAGAAAAGGGGCGCAAGATGA 
CCGACGAGTGCGTAACCAGAAACTACGGAGTAGGCATTCGCAGCCCAAATGGATCGGAAAATCGGGGCAGCTTTATAATGGCTGATAATACCGAT 
GCCSAGGGCTGTACGCCGGAGTCTCTTGTTGTTGGTGGTGCAACTGCAGCGAGGTGAGTCCATGTGCCTCGCTCTCTCCCTCTCCCATCTCTCTC 
TTTATGTTTCTGTGTTTTGTTATCAATGCAAGTACTGGCATAACTGCTATATCCTAGTTTCAAATGCAGCATATAAAGCTCTTTTTTTTTTATTT 
ATTCCATTTTCATTCCACAATCTATTATTTTTATATTATTTATACTGTCTATTGCCAACATTTAAAATCAATGTGTCAATCTGTCTTTATAATGT 
TGTTAAAATGGCTGAAGTCACAAAGTTTGAAAGCACTTGACTTTAATTCTATTGTTCTATTTAGTCTTATTCACAAATATATTTGAAAATAATTA 
CATAAATAAATACCCAATCATCGATTATTTCGCCGATTACCTTATCTTATCATGCTGCTTCCCAAATGGCTATTATAATTCTGATTTCTTTTTCC 
CCAGCCCCTTGCCGGCCAACAAGTTTGTGGCTCGAATGCCCGTGGAACGATATGCAAGCGAATACAACATGAGTCACAAGCACCGCGGAGTTGCG 
CTGATCTTCAACCACGAATTCTTCGACATACCCTCGCTGAAGAGCCGCACCGGAACAAATGTCGATGCCCAAGAGCTGAAAAAGGCCTTTGAAAA 
CCTGGGATTCGCGGTGTCCGTGCACAAGGACTGCAAGTTGAGGGACATCCTGAAGCACGTGGGGAAGGCCGCCGAGTTGGACCACACGGACAACG 
ACTGCCTTGCGGTGGCCATACTCTCGCACGGGGAGCACGGCTACCTGTACGCCAAGGATACGCAGTACAAGCTGGACAACATCTGGCACTACTTC 
ACTGCCACCTTCTGCCCCTCGCTGGCGGGCAAGCCGAAGCTGTTCTTCATCCAAGCCTGCCAGGGCGATCGCTTGGACGGAGGCATCACTCTGGA 
GAAGGGCGTTACCGAGACGGACGGGGAGTCCTCGACGAGCTACAAGATACCCATACACGCCGACTTTCTCTTCTCCTACTCGACCATTCCGGGTA 
AGGAATATGATTAGGTAACTATTTTAGTGAATTTCACTAGCAATCTCGTCCTGTTAGGCTACTTCTCCTGGCGCAACATCAACAATGGCTCCTGG 
TACATGCAATCGCTGATCCGCGAGCTGAACGCCAATGGCAAAAAGTACGACCTGCTCACCCTGCTTACATTCGTTAACCAGCGCGTAGCCCTAGA 
CTTTGAGTCGAACGTGCCCGCCACACCGATGATGGATCGCCAGAAGCAAATACCGTGCCTCACCTCCATGCTGACGCGCATACTGCGCTTCGGCG 
ACAAGCCGAACGGCAATAAGGCTGGCTAGGAAGAGATCTCCCTTCGAAGAAGAACTTTCAACAGAGGTTGTACAACCATCGCCCGCAATCATTCA 
CGTTGGATTCCAATTCACTTCAAGTTTAGTTTTAGCTGACGCGTGCGATTATCCCGATGGATATTGAAGCACCCGACTCCCTTTCATTCATTCCC 
CTGCCTTTCCCATGCCCATTTCGGCCTTATCATTGACGAAATTCGTTACCCATTTACCCACTAGATGACTTCCGTATGCATAAGCGAGACCTTGC 
CAATAGAAAACCCACAAATTTGAGATAAGATAATACAATGAAGATTTATACATACATACATACTTTTATGTACACGCAAACATTTATATTTGATA 
TGAATAAAAATGAATTAAAAACATAGCAAGGTATCTCCCCAATTGAACGTAGATGGGTAATTTCGATCTAAGCAACTTTGTTTGAAGACAGGAGC 
TGTCTGGGCCAGATTGAGTACGTTGATATGGGAAATAAAAGATAACAATGAAGACTTGGTTGGCTGAATCAGTGAAATTGTACGAAACTTTAGTG 
GAATTCAAATTAGCATTATTAAGAGTGATTAATTAAGATTTTTCTCACTCTAATTTTATTAATTTCCTAAATTTTATATTTATTTCCTAATTTTA 
TAATTTAGGAAATTAATAAAATTAACCAAATGATGATTTTTGCGTAAATATAAGTATCGAAGGAACTATTAATGCATTTAAATATGATCTCGATT 
TGAATTTAGCGCCAAAAACAACTGACAGCTGTCAGGGCGATCAGCTGTTGACGCACTAGTGTGCGTGTGGCGCAGTACAGTTGCCCCAAATCGCA 
GAAATTTTCATTTTTCTTTGCACAAGAGGAAAAAGGGAAAGCCCAATGATTAGCCAACTAATTGGGCTGTAACCTGGTCGCACTTTTCAAGGACA 
TTCGGCTGGGAAAGGGAAGCGGGGCAGTTTGGAGCGGGGAAACTTAGCCAAGACGCCTCGCAGTCGATGCGGAAACCCGCGATGCGGCTTCTTAG 
CCAAGTTATGCGTGGCGCTGGTTAGTGCAAGCCAGAACCCATGGATGGAGCCGCTTACCTCGATCGCCGTGATGGCCATAATCTGCAGCGGCAGA 
TTGGCGGTTGTTTTCACCCGCGGATTCTCCTCGAAAATCGAGGCCAGCTTCTGCTCGGTCAGACAGAAGCGGCAGACCCTCTTCTCGGTCAGCAT 
GGCCTCGCGCACCTCCAGTTTCGCGGCCATCGGCAATTGGCGGTAATTCGTTCCGGGCGGCGGTAGAAACGCGGAATTTCACTTGCAAAGTGCGT 
ACAGATTACAAATGAGAGGAAAACAATCGCTTGGCGTCGCTTAGAGATGTGCACTATGCAGATATCGATGTTTCTGTGT 
(SEQ ID NO: 601) 

Exon: 1001.. 2808 

Exon: 3235.-3797 

Exon: 3858. .4494 

Start ATG: 2657 

Transcript No. : CT17064 

GCTTCTCGCAGTACACGCAGATGAACTTGGGTACGTCCGTGTGGGTGCGCTCGTGGCGATGGAGCAATGCCTTCCGTGTGAAGGCTTTCGAGCAG 
ATGGCGCACTTGAATGGCCGCTCGCCGGTGTGCACAAAGTTGTGCTTGGCCAGGTCGTACTTGGAGAAGAAGCTCTTCTCGCACAGGAGACACTG 
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GAAGCTGCGCGAGCGAGTGTGATTCAGCCGATGGATCTCGAGCAGCTGCTGCAGCGGAAAGGAGCGTTCGCAATCGGGACAGGGGAACACATCGG 
TGGCCACCTTCTCGGCGTTCTTGGTCGGATCATCTTGATCGTCCAATGGCAGATCCGAGTCCAGAACCTCTGTCACTATGGCCACATTGCCACTA 
TCATCTCGCTTGACTTTGGGTGTGGCCAGGCGCGGTTCCACCGGAGCGGCGGGTCCCTTGTTCAAGATCCTAATCGAGGACTTGTTCAGCACCTT 
CGGCTTGACGGGTGATGCCTTCTGGGGGATATCGGGGAACTCCTTCTCTAGCTCACTGGCCATATCCTCCAGCATGCTCTGCACATCGTCCATGG 
ACAGCTCCTGGTTGTTATCCACCTTCAGCTCATAGGCGTGCGAACGACGTGGCACTGGAGAGGAACCGGCTACCGTCCTGGGGGTCACCATGGCG 
CTCTCCAGCACCTGGACGTCCTCGATGATCACCTGACTACTGGATTTGGCCATTGTGTTCAGCAGCTTTTTGGGCGTTTCGCTTGGCTCGGCTGT 
CTTTGCTGGCACGACCAACAACTTCTTGGAAGCCACCATCGTCATGGGCGCACGGGGCTTCTCCAGCGGCGATGGCCAGTTACCCGTGAGCGGAT 
ACTGGCGCAGGAGGGTCTCCGCCCGCTTGCACATCTGCTTGAAGCGGTAGCAGTGCTCGAAGAGGAGGCGGCACTCCAGACAGATGTGCCCCGGC 
ATTCCGTCGCCGGCGTAGACCTGTTTTTAATGTTAGATTAGATAAATGTGTGAGTGACTAGGAAGCTACATTTATTCGGGGAAGTCCGTGCTTAA 
GCCAAATAATAAGGTAATATTCGTGGAACACCCCTATATGTTTTTCGCTACATAATAAATTAAACGCCATAAATGTCTCTTTCCCTTAAAATGAA 
CAATATAATGTTAGAGTTTAAATTTAAACAAAGGCGCCGAGGACCAGAACACCCTCTACAAATTAATCGATGGCCACCAATCGATGATTAGTGCT 
ATCGACGAGCGGCCATCGCCGCTTGGCGGCAAGTGCATACACACGACTACATGCAATCGCAGTTTGTTTTCGCCGCTCAGAGCACGCTTAACCTC 
CGTTAATTGATTGATTAAACACGTCCGACTGTGAGCCATAAAAGGACATTCGCTCGATCGAAAGCCGACAAAAGCTGGCTGAGAAGCGCAAAACA 
GCTGCGCTGCGCAAAGTTTGGCCAACGCAGAGCAATAGAAAACAGAAGCAGTGTCAGAATAAAACAAGAATCAGACCAAAAAGGGCGCAACAACT 
GCGGAGAGAGAAAAGGAAGGAAAGAGCAGCGCGTGAGCGAGGGAGGAGCAGGAGGGAGCGAACAAAAGGCAAGAGGAAACACAAGGAGAAACGCA 
AACAATCAAAACAAGAACTTAAGCAAGAAAAGGGGCGCAAGATGACCGACGAGTGCGTAACCAGAAACTACGGAGTAGGCATTCGCAGCCCAAAT 
GGATCGGAAAATCGGGGCAGCTTTATAATGGCTGATAATACCGATGCCAAGGGCTGTACGCCGGAGTCTCTTGTTGTTGGTGGTGCAACTGCAGC 
GAGCCCCTTGCCGGCCAACAAGTTTGTGGCTCGAATGCCCGTGGAACGATATGCAAGCGAATACAACATGAGTCACAAGCACCGCGGAGTTGCGC 
TGATCTTCAACCACGAATTCTTCGACATACCCTCGCTGAAGAGCCGCACCGGAACAAATGTCGATGCCCAAGAGCTGAAAAAGGCCTTTGAAAAC 
CTGGGATTCGCGGTGTCCGTGCACAAGGACTGCAAGTTGAGGGACATCCTGAAGCACGTGGGGAAGGCCGCCGAGTTGGACCACACGGACAACGA 
CTGCCTTGCGGTGGCCATACTCTCGCACGGGGAGCACGGCTACCTGTACGCCAAGGATACGCAGTACAAGCTGGACAACATCTGGCACTACTTCA 
CTGCCACCTTCTGCCCCTCGCTGGCGGGCAAGCCGAAGCTGTTCTTCATCCAAGCCTGCCAGGGCGATCGCTTGGACGGAGGCATCACTCTGGAG 
AAGGGCGTTACCGAGACGGACGGGGAGTCCTCGACGAGCTACAAGATACCCATACACGCCGACTTTCTCTTCTCCTACTCGACCATTCCGGGCTA 
CTTCTCCTGGCGCAACATCAACAATGGCTCCTGGTACATGCAATCGCTGATCCGCGAGCTGAACGCCAATGGCAAAAAGTACGACCTGCTCACCC 

T G C T T AC AT T CG T T AAC C AGC G C G T AGC C C TAG ACT T T GAGT C GAAC GT GC C CGC C AC AC C G ATGAT GGAT CGC C AG AAGC AAAT AC C GT GCC T C 
ACCTCCATGCTGACGCGCATACTGCGCTTCGGCGACAAGCCGAACGGCAATAAGGCTGGCTAGGAAGAGATCTCCCTTCGAAGAAGAACTTTCAA 
CAGAGGTTGTACAACCATCGCCCGCAATCATTCACGTTGGATTCCAATTCACTTCAAGTTTAGTTTTAGCTGACGCGTGCGATTATCCCGATGGA 
TATTGAAGCACCCGACTCCCTTTCATTCATTCCCCTGCCTTTCCCATGCCCATTTCGGCCTTATCATTGACGAAATTCGTTACCCATTTACCCAC 
TAGATGACTTCCGTATGCATAAGCGAGACCTTGCCAATAGAAAACCCACAAATTTGAGATAAGATAATACAATGAAGATTTATACATACATACAT 
AC TT T T AT GT AC ACGC AAAC AT T TAT ATT T GAT AT G AAT AAAAAT G AAT T AAAAAC AT AGC AA 
(SEQ ID NO: 602) 

Start ATG: 1657 

MTDECVTRNYGVGIRSPNGSENRGSFIMADNTDAKGCTPESLVVGGATAASPLPANKFVARMPVERYASEYNMSHKHRGVALIFHHEFFDIPSLK 
SRTGTNVDAQELKKAFENLGFAVSVHKDCKLRDILKHVGKAAELDHTDNDCLAVAILSHGEHGYLYAKDTQYKLDNIWHYFTATFCPSLAGKPKL 
FFIQACQGDRLDGGITLEKGVTETDGESSTSYKIPIHADFLFSYSTIPGYFSWRNINNGSWYMQSLIRELNANGKKYDLLTLLTFVWQRVALDFE 

SNVPATPMMDRQKQIPCLTSMLTRILRFGDKPNGNKAG* 
(SEQ ID NO: 603) 

Name: caspase-1 precursor 
Classification: endopeptidase 
Gene Symbol: Dcp-1 
FlyBase ID: FBgn0010501 



Celera Sequence No. : 142000013384832 

TGGGGGTGGGGTAGTTGAGGGGCACGGAAGGAACTGAACAAAATTGAAATAAAATAAAGCACCATAAAACGTACTTAAGATTGACGATTGTGTTT 
TTTGGGGGAGCGATCAATGAAAGAGTTTCGATGAAGACAATATATGGCAAGTACAGTTGAGCTCTTTTTTGGATTTGGGCATTCGTGTTGTCTGC 
AAAGGGTTATTGATAATTCTATGCTCTTCCCAATTTAGTTAATATTCTTTTATTACTAAATTATGAGGGAATTTTTGTGTCTGATTTCGAGACGG 
GGGAACAATCGAAGGAGCCAGAAAGCGACCGAAGAACGCGTTGAGTAAATGCACATTCTGGAATATTTATATAGTACATTTGAATATATATGTAC 
TTGATTAGATATAATTGGCTGTCAGTATGTCTCTTGATCGCTCCACTCAATCGCACCAATACTCGGATTCGGATACCGAGCAACCGCACAGATAC 
ATTTTCGAAACAAACAAAATGTCGGCAGCTGCTTGTTGTGTTAGCTTATAAATTAAAACGCGTTTCATGGCATTATTATTTGATGGTTTTTTACT 
TTTTTTTTTTTCGTCTTATGGCTGCTCAATTCATTCGATTGTTCGTATATTCACATTCTTTTTAGCCCATTCGTACCCATATACCCCCGATATCG 
CTCACACCTCCTTCCCCCCCTTCTCCGCTCGGTTCATCTGTCCGCCTGTCTGCTAATTGAAAATCAATTAATGTGACTGCATCCTATAGCTGCTC 
ATGTCAAAGTCAGATATCATCCAGGGTAGGGT AAAT TCGATGGAATAAGGGAATGAAAACAAGAGAAGG AAT TCAAGACTC AAC GACTCAGATTC 
AGAATCCGAATCAGAATCCTTTTGTTTTCACTGACCTTACTGTCTGAAAACAAAAACCCAATTGATGTGCCCCAAGTAACACTCTGATAATTACC 
GAAAAACTCCAAATTTTTCGATTGCTTTCGACTTTGGCAGTATTCATCGCCAAGTTCATTGGGTTTTCGAGTGCGTCCCGCGAATCGCTTTCGTT 
TGGTGCGGAATTTGGCACAAGTTTTGTATGCTCAACTACTGGGAAGGCCAATCTGGGGCTTCTAATTCCACTGTGCGGGTAATTTGGGGACTCTT 
GCCAGTTACTAGTTACCTCCCAATACGAATCACTGATTTCTCACCTCCTGCCACTCGCTTCTTGCTTCCGTTGCAGGTCAACAAACAGTATTCGG 
CCACCGATCTGGAGGCTTTTATGAAGATTGCCGCCAACTGGCAAAATTCCAATCACAACTTGCTGGAGGGCGTCGACATAAAGACGGGTAAGTTA 
AACAAAACGAAGTGACCGTTGAAGGAACTTAATTCAATCATCGAATGCTTGCAGAAATGACACAGTACATGAACAGTCCTTCCATGAACATGTAC 
AAGAAGAGGGAACGCACCCAGAACTGGAACCACCAGGAGAAGAAATACCTACTCGATCTGTGTCGCAAGGACATGCGCATTATCGAGAACAAGAG 
ATTGGATGCCGGTCTGACCGCCGTGAAGAACAAGGCATGGAAGATTATCCACCAGAAGTTCTCCAATCAGTTTGGCACCGATCGCACCTGCAATC 
GCCTGAAGGAACAGTGGCGTCGCATGAAGGGTAAGTCTTTATAAAATACATATATTTAATGGCAAAGCTAACCTTTTCGATCTCATCCTTTAGCA 
TGTACTCGCAACGAGATCCTGGACTACAATAATCGCCTGGCCAGGTTTGGAGCAGAGGTGGCCGATCGCAAGAAGCCATCTCCGTTCACCTTCGA 
GGTGTGGGATTTCATGCAGGAGGCGAAGAAGGCATGCAAATCGGAGGCCCTGGATGGCATTGATTACTCTAAGATTCCACTGGCCTTGGAGGAGG 
GTTTTGAGTATCGTGAGGATTACAAATTCAATCCCGAAGACCACGACATGGATGAGTAAGTTGATCGCTATGTGGTAGTCTGAATAGATCTTTAT 
TGAGCATTCTCTATTTGAGCAGCAGCCGCACACCGCCGCAGGAACTGTGCGATGTGGACATCAAGGAGGAGGAGAACAACGAGACCTTCAACGAG 
ACCTTCAACCATCACCACCTCAACCAGAGCGACATCCTTGGCAGCGGCGAGCGTCTCAGTGTGTCGCCTAATCACAGCATCAGTCGCAACGGGAT 
CCACGAGGCCGAGAGTCCCGCCACGATTTCCACATCTAATGTCCTGGACTCGAGCGGTGCCTATATGGCCGGAGGAGGTGACATGTCCGGCTTCC 
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AGGCCAACTTCAACATGAACAACATATCCGCCACGCTGGAGGCACTGAATGCCCTCCGTTCTGGCCAATTTCCCAGTGCAGCAGCAGCGGCAGCG 
GCCGCCATTCACCAGCACCAGGCACAGGCCCAGGTCGCCATGCATCACCAGCACAATAACAACAACAACAATAACAACAACAACGAGGACGACGA 
GGACAGCATGAAGCCGCAGTCTAAGAGACAGCGCACCAGCAGTGGTAGCATGTTGGGCAGCGAGGATCAGGCACCAGTCCTGGCCCCATCAGTGC 
CAGCATCCTCGGACTCGCAGGCCCTAGAGCGGTCCGGCAGCAGCAATGGATTGACCATGCCGGGAACGGCAGTCAGCACCAATGCCAACAGCAGC 
AGCAGCAACAACAACAACTTTGAGCTGCGCATGTTCATGGAGATGCAGAGCAAGGAGCACATGATGCGCATGAAGATCCTGGAGGTGCAGTTGCA 
GGCGGCAAAGCACAGTCGCGACCTAGTGGAGATCAACAAGACACTGGCGCTGCAGAAGCTGCAGTTGGAGCTGGCCAGCAAGCGACTGCCCAGTT 
AGGAGGCAGTGCAGCAGCAAAAGGAGCAGCAGCAGCAACAGCAGCAGCAGCCCATCGCCGTTTCATGTTACATTAGTTTATAGAAATAAAGCCTG 
TCAACTCGGTCACGTTCGCAGGCAGAGAGCGTGATTGGGAATCGACTAGAGTTAGCACTAGAGTTAGTCTTAAGTTGAAAGCAGGATCCACAGCT 
ACACATCCAGACCCGACACACAGAGAGACATTCCTGGCATTGTTGATCAGTATTTGAAAGCTAGAAGCGAAACCCGACACACAGAACCGAACCCA 
AGAACCCCAGCGAGGATCAGGAGGCGCTCGGAAGTATGCCATAATCTTATGATGATGATGACGATGATGATGATGAATCGCCTACCTCACAAGCA 
CACACACACACACACACACAATATGCACACACTCAGCTAGCAGACACACACACACAGACACACTCTCTTCTGGCCTAACCAGCACTTGACCAAGT 
CAAGTCAAGTTTAGTTAGTTTCTAGTTTTAGTTCGTATGGGAACCACTCTCTCCAAGTTGCTCAACTACCACAAATGTATTGTACTTACGTTCTT 
GACCTCTGCTACCCCACAACTAACGATTTGAACGAACATGAAATGCACACAAGATTATTTTTCATAGTCCTAAGAGAAGAACCTTATATATGTCA 
ATGCATCATACACAATCGGCAGTTGTCTCTGATTTTGATTCTTACTTGAACTTGTATTTATTTGTATCTTTGTATAGACTATGTTGATATATATA 
TGACCCTATATATGTTTGTTTCAATGCCATATTACATGAAAGTTCTTTTTAATTCATATTTTCCGTTTGCGCTGCAAAACAAATTGTTACCATAT 
ATAATATATGATATGTATGTACCCTACGTATAAGCAGGCTTGAAGAAGGCCGCCAACTCGGCTTCAGTTGTCGTAGAAGACTTCTTTGTAAAGCG 
TGTAAATGAGAGATACAAATTATTTTAAATAAAATACAAAACAAAACAAGCAAATGTTTCAGCTACACTTTATTAATTTTGTTCATGGTTCATCT 
GTGAGCCTTCGGTCTTGGTTTTTTCCAACGATTGTATCTGCTTATCCACCTTGATGATCTTTGTTGTGACTTCGTTCTTCTGGCTGAGAATCTTT 
TCTCGCAACGACTTCAGCTGTTCCCGTTGCTGGGTAAATTTGCAATTAAATTTAAATAGATTACTTTCGTTTTTTGAGATCACAAACCAACCAGT 

T T AAGG AAAT GT AT GT TCTCCTCCGCCCTGGCTTTTTC GAAC AAAT GCT T GG AT AT GT T C CT GG AT AC AG AG AT GC AT C T AAT AAGC G AAG C C AT 
AAGTGCTCAGAACACGGAACTCACTTGTTGGATTGTGAAAACTGACACATCCTGGGTCGCAACAGTTGGAAAAGCTTTTCGACCAACTTCAGCAT 

T GT AT C C T T AC GG AGGAAG AT AAT T T GG AAT T T AC T AT AGT AACT G AT G AGGT ATT T C AAG GT GGAAT AT T G AC AAAC T T AAAAGT G AT AGC C T A 
CCGTTTGATAGGTCATTTTTCGAGACTCTTTTTGAATTGCTCCAACAAATCCTTCAGAGCTTTCAATTCCTTTAGCAGATTCTCCTTGTGTTTCC 
TTGATTTGTGGCTAGTTTTGGATTCCAAAATCTTGATCTCCTCTTTCAATTTGACGATATCATTGGCGATTTCTTTAACCCTGTTCTTTTTTATC 

T T CAT AAAT T GC T CT GAGGC C TAT GGAAT ACCC AAT T CT AC AAG GT T AT T C AC AAC ATT AAAT AGT T T C G AT AC AT AC AAGT TT GAG AAC AT AGT 
TAAGTTCCTCGCCTCTGCCCTTCTCGGAAAATTTGTCAGTGAATTGACGTCTATCATATGTAAAAAAAAAATTATTTTTACGGGTCATTTGAATA 
ATAAAATACTTACTTAGCAGGAAATTCGTTTAAATTATTATTCGACCTTGGGTGGAAAATGTTTCTACCGTGGTAAAAGTTTCTATGCCTTGGTA 
AATTTCTGTGCAACTCATTGGAAGGCCAACTAATATTTAGAAGATTTACAAATAAAGTTTGGATAAATTTCCAGTACTACATACCAAACTCTTTC 
TTGACATAGCACCTGAGGTTGAAATAGCCTGTTGCACGATGCCGAGCCTAAGATTCGTGCAACAGGAAACATGGCTTCTTAAGTT 

(SEQ ID NO: 604) 

Exon: 1001.. 1322 
Exon: 1385.. 1645 
Exon: 1709. .1955 
Exon: 2018.. 4025 
Start ATG: 1073 

Transcript No. : CT17108 

CAAGTTCATTGGGTTTTCGAGTGCGTCCCGCGAATCGCTTTCGTTTGGTGCGGAATTTGGCACAAGTTTTGTATGCTCAACTACTGGGAAGGCCA 
ATCTGGGGCTTCTAATTCCACTGTGCGGGTAATTTGGGGACTCTTGCCAGTTACTAGTTACCTCCCAATACGAATCACTGATTTCTCACCTCCTG 
CCACTCGCTTCTTGCTTCCGTTGCAGGTCAACAAACAGTATTCGGCCACCGATCTGGAGGCTTTTATGAAGATTGCCGCCAACTGGCAAAATTCC 
AATCACAACTTGCTGGAGGGCGTCGACATAAAGACGGAAATGACACAGTACATGAACAGTCCTTCCATGAACATGTACAAGAAGAGGGAACGCAC 

C C AG AAC T GG AAC C AC C AG GAG AAG AAAT ACC T AC T C GAT C TGT GT C GC AAGGAC AT GCGCATTATC G AGAAC AAGAG AT T G GAT GCCGGTCTGA 
CCGCCGTGAAGAACAAGGCATGGAAGATTATCCACCAGAAGTTCTCCAATCAGTTTGGCACCGATCGCACCTGCAATCGCCTGAAGGAACAGTGG 
CGTCGCATGAAGGCATGTACTCGCAACGAGATCCTGGACTACAATAATCGCCTGGCCAGGTTTGGAGCAGAGGTGGCCGATCGCAAGAAGCCATC 
TCCGTTCACCTTCGAGGTGTGGGATTTCATGCAGGAGGCGAAGAAGGCATGCAAATCGGAGGCCCTGGATGGCATTGATTACTCTAAGATTCCAC 
TGGCCTTGGAGGAGGGTTTTGAGTATCGTGAGGATTACAAATTCAATCCCGAAGACCACGACATGGATGACAGCCGCACACCGCCGCAGGAACTG 
TGCGATGTGGACATCAAGGAGGAGGAGAACAACGAGACCTTCAACGAGACCTTCAACCATCACCACCTCAACCAGAGCGACATCCTTGGCAGCGG 

CG AG C GT C TC AGT GT G T C G CC T AAT C AC AGC AT C AGT C GC AAC G GG ATC C AC GAGGC C GAG AGT C C C GC C AC GAT T T C C AC AT C T AAT G T C C T GG 
ACTCGAGCGGTGCCTATATGGCCGGAGGAGGTGACATGTCCGGCTTCCAGGCCAACTTCAACATGAACAACATATCCGCCACGCTGGAGGCACTG 
AATGCCCTCCGTTCTGGCCAATTTCCCAGTGCAGCAGCAGCGGCAGCGGCCGCCATTCACCAGCACCAGGCACAGGCCCAGGTCGCCATGCATCA 
CCAGCACAATAACAACAACAACAATAACAACAACAACGAGGACGACGAGGACAGCATGAAGCCGCAGTCTAAGAGACAGCGCACCAGCAGTGGTA 
GCATGTTGGGCAGCGAGGATCAGGCACCAGTCCTGGCCCCATCAGTGCCAGCATCCTCGGACTCGCAGGCCCTAGAGCGGTCCGGCAGCAGCAAT 
GGATTGACCATGCCGGGAACGGCAGTCAGCACCAATGCCAACAGCAGCAGCAGCAACAACAACAACTTTGAGCTGCGCATGTTCATGGAGATGCA 
GAGCAAGGAGCACATGATGCGCATGAAGATCCTGGAGGTGCAGTTGCAGGCGGCAAAGCACAGTCGCGACCTAGTGGAGATCAACAAGACACTGG 
CGCTGCAGAAGCTGCAGTTGGAGCTGGCCAGCAAGCGACTGCCCAGTTAGGAGGCAGTGCAGCAGCAAAAGGAGCAGCAGCAGCAACAGCAGCAG 
CAGCCCATCGCCGTTTCATGTTACATTAGTTTATAGAAATAAAGCCTGTCAACTCGGTCACGTTCGCAGGCAGAGAGCGTGATTGGGAATCGACT 
AGAGTTAGCACTAGAGTTAGTCTTAAGTTGAAAGCAGGATCCACAGCTACACATCCAGACCCGACACACAGAGAGACATTCCTGGCATTGTTGAT 
CAGTATTTGAAAGCTAGAAGCGAAACCCGACACACAGAACCGAACCCAAGAACCCCAGCGAGGATCAGGAGGCGCTCGGAAGTATGCCATAATCT 
TATGATGATGATGACGATGATGATGATGAATCGCCTACCTCACAAGCACACACACACACACACACACAATATGCACACACTCAGCTAGCAGACAC 
ACACACACAGACACACTCTCTTCTGGCCTAACCAGCACTTGACCAAGTCAAGTCAAGTTTAGTTAGTTTCTAGTTTTAGTTCGTATGGGAACCAC 
TCTCTCCAAGTTGCTCAACTACCACAAATGTATTGTACTTACGTTCTTGACCTCTGCTACCCCACAACTAACGATTTGAACGAACATGAAATGCA 
CACAAGATTATTTTTCATAGTCCTAAGAGAAGAACCTTATATATGTCAATGCATCATACACAATCGGCAGTTGTCTCTGATTTTGATTCTTACTT 
GAACTTGTATTTATTTGTATCTTTGTATAGACTATGTTGATATATATATGACCCTATATATGTTTGTTTCAATGCCATATTACATGAAAGTTCTT 
TTTAATTCATATTTTCCGTTTGCGCTGCAAAACAAATTGTTACCATATATAATATATGATATGTATGTACCCTACGTATAAGCAGGCTTGAAGAA 
GGCCGCCAACTCGGCTTCAGTTGTCGTAGAAGACTTCTTTGTAAAGCGTGTAAATGAGAGATACAAATTATTTTAAATAAAATACAAAACAAAAC 
AAGCAAATGTTTCAGCTACACTTTATTAATTTTGTTCATGGTTCATCTGTGAGCCTTCGGTCTTGGTTTTTTCCAACGATTGTATCTGCTTATCC 
ACCTTGATGATCTTTGTTGTGACTTCGTTCTTCTGGCTGAGAATCTTTTCTCGCAACGACTTCAGCTGTTCCCGTTGCTGGGT 

(SEQ ID NO: 605) 
Start ATG: 73 
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MLNYWEGQSGASNSTVRVIWGLLPVTSYLPIRITDFSPPATRFLLPLQVNKQYSATDLEAFMKIAANWQNSNHWLLEGVDIKTEMTQYMNSPSMN 
MYKKRERTQNWNHQEKKYLLDLCRKDMR^ 

VADRKKPSPFTFEVWDFMQEAKKACKSEALDGIDYSKIPLALEEGFEYREDYKFNPEDHDMDDSRTPPQELCDVDIKEEENNETFNETFNHHHLN 
OSDILGSGERLSVSPNHSISRNGIHEAESPATISTSNVLDSSGAYMAGGGDMSGFQANFNMNNISATLEALNALRSGQFPSAAAAAAAAIHQHQA 
QAQVAMHHQHNNNNKNNNNNEDDEDSMKPQSKRQRTSSGSMLGSEDQAPVLAPSVPASSDSQALERSGSSNGLTMPGTAVSTNANSSSSWNNNFE 

LRMFMEMQSKEHMMRMKILEVQLQAAKHSRDLVEINKTLALQKLQLELASKRLPS* 

(SEQ ID NO: 606) 

Name: RNA binding protein 
Classification: RNAjoinding 
Gene Symbol: apt 
FlyBase ID: FBgn0015903 

Celera Sequence No. : 142000013384541 

ACCTGAATGTCAAGCCCAGCTGTGCCGGAGCCTGCGCCTTGGCCGTGGACACCGTCACGCCGTAGTCCTGCAACGTGCTGTCGTCCTCCATGACA 
TCGTTGTCCTGATTGTATAACCGCTGGTCCACGGGCTGCACCTTTAGTATGCCTGCGATCGATGGGGTTGTATGATCAGTCATGGAGTTACCCAA 
TAATGGCGCTGCTCTATAAATAGAAGCCAACACTTACCCTCAATCATTCGCTTCAGCTCGGCCACCGTTGTGTTCTCCTTGGCGTCCGTGAAGAT 

GGTGGTCTTTTGCCGCCT GAT C AT AAGGAAC AC GT C CT GC AAAAG AG AGGGT T AG C CAT TAG AT T T G ATT C GAT GC G AAGT G AT TCGTTTCGGTC 
AAAGTTCAGTGGCGCTTTTTCAGTGTGTCATGCTTTCTGATCTTGCCATCACACAGGCGACATTTGTGCACATGCATTAAGAGCGCGACGGTGTC 
TATGGCAGCAGCATCTGCGATGCACTGCATCCTTGCTAGGCTGCACGACACCGGCACGAGCACTCACCATATTGAGGAGTTTGTGGCGCTGGCTG 
CGACTGAGGCTGCTACGCGGATGTGGGTTAACCAGGTGCGTGGGCTAGTGCGCTGTGTTCGGCAACTGACGGCGTGGGTGGCGCTTCCTTTGGCG 
GTCGCGGGCGGACGTTTCGATCGGCTGGAATTTTCCTACTTCGCTTGCACCACCGCTGCTGGGCGCTTCTATTTTTTGCTCTGTGCCGCTCGACC 
GGAGTTGGGGGCGGGCGCAGGTGGGCGTGGCGTGCGGTTCGCGGTACGATGAGCGTGCGTGAGTGTGTATGTTTGTGCAACAGCTTCCTCGGTTG 
CCCCCCACTTGTAGGTTTTCGTAGTTTTACGATTTTTTCTAGCCTTTTGCTGCTGCTTTCTCAACTAATTAGAGCTGTCAAACCATCGATGGTCG 
CATCGATAGTGAGCTATCGATAGTTCAGGGTCAGCGGCCTTCGTTGTCTAATGCCATCGCTAATCCGATATTATTGCAGGGCTTTTAGCAAGGGA 
TTGTCTAAAGCCTTGGTGAATTCTATTAATGTGTATTAACAAATACTTCACAATTCCGTTTAATGTATATTTTCCATTAGTTTATAAATAAAATA 
ATTCGCATAAGTAATGAGCTTTTGCAGCCCTGGTTATTGCCAAGAAAATGTACTAGAAAATACCGCCATAGCTGTCAGTGTCACTTTTCCCGTTT 

CT C CT GT T T TAT T T GC C AC GAAAT GG T T T T GC T AG AC AATT C G AAC GT AAAT AAAT G AAAT AC CT T T AAAAG AAGC C G AT AAGT AAC CGT AT C T C 
CTTACAGTTTATCCTGCGCCTGGAGAAGATCGCAAATGCGGCCAAGAAGGACTCCTCCTTCACGCTGACGTTCAAGCGATGTGAGTGTTCGACAC 
CGAAAAACATGGTACACTAGGAACTAATTTACAATTTATTTAATGTGCCGCAGATGATGGCAACGATAAGCCCGTGCCGAGGGAAGGACGGCCAC 
CGCTGCCCAAGCCGGAGACCTACATGTGCCTGATGCGCGCCCAGTCCAAGTCCCAAAAAGTAAGCCCGATTGAAAGCTCCACGTTTCGACTCCTT 
ACCAAATAAACCTATTTTTCCGGCAGATTTCCACCGTTGTCCGGCAGGAGGATGTGCCCGCCATGATGAGCATGTACTCGCAGTTCATGAAGAGC 
AAGATGGACGGGCTAAAGCGGGTTAAGAAAGTCAAAAGCAAGGCCAAGGCGACAAAGGGTTAATCGGGGCATAGTCTTAGTTAGATTTTATTTGA 
CTGTTTACTGGTCGTTTGTTTTAAACCTTAAATGCATCATTTTGTACGGAAAGCACGCTATACTTCTTATTTCTATCATCAATCTGACGCAGATC 
GAGCAGCCACTTCGGCAGTTCCCACTGCCGCAACCCGTATATGAACTTTGGTCTTATCAGGTCAACCAAACTGTAAAGATCCTCGTTTATTTGGT 
TCTCAGCTAATAGCTCTTCGTCCACGTACAGTACATCGATTCCCTGACAGAGGCCCGTGTGAAGGGAGCCCGAAGCCTTTGGCAGAAAGTCCAGA 
TACCCAGATAGGCAGTCCACAATTAAATATCGTGCATGGCCGTCTGCAAAGAGATAAGTGTTCTCAGACTACCAACAGGAACCAATGGCTCAGTA 
CTCAGGTAGTACTCACTTTCTATGTAGAATAGGTTGCACTCCAAGCCGTTTTCCTTTCTTCCTGCTGGTATCAATGTGATTGTGATCTCGTTTTC 
ATCGTACACTAGTCGGGTGGGCTTATTCGGCAGTAGTATTTCAAAAGTGCAGCTTTCTGAAACATCTGCATGTGATTGCAGCCGTGAGATGTCCA 
GCTGCGAACCCACAGCTAACTGGCACCAATTTTGTGGTGCTATTAGATCTCCTAAGGCGGCTGACGTGAAGAGCAAGGGTCCACGGTGCTGGAGG 
GCCAAATTGTATTGGAAGTGAATTTATATAAAACTAGGTGGTGTACGCACCTCCTCGTAGGAACTGTAGACTTTTCCTTTTTCCAGTATTAAGGG 
CTGGCCTATATCGTTTAAAAACATTGTTAATTATGCGCCAATTCCTCAGGAAATTTGTTTTTCTATGTGTTACCAGATCGCATTGACAAGAAAAT 
CCCAATTCTTCAAGGACTATTTTCTAAGTTCAAAACTACAGCGCGAGATTTTAAAGTAGAATTCTGGATAATTTTGGAAATATTTGCAGAAAATC 
CCCTCATTGAAAGAAACTTTTTAAGTATTTAAATATTACCATCTGCACTATGAAATCATCCAACCTTCCCTTTTGTAGCGCACTTCAATAAAAAA 

C AC AG AT T GT AG 
(SEQ ID NO: 607) 

Exon: 1001.. 1060 
Exon: 1219.. 1281 
Exon: 1338.. 1410 
Exon: 1479. .1579 
Exon: 1642. .1862 
Start ATG: 1001 

ATGCCATCGCTAATCCGATM 

CGAAATGGTTTTGCTAGACAATTCGAACTTTATCCTGCGCCTGGAGAAGATCGCAAATGCGGCCAAGAAGGACTCCTCCTTCACGCTGACGTTCA 
AGCGATATGATGGCAACGATAAGCCCGTGCCGAGGGAAGGACGGCCACCGCTGCCCAAGCCGGAGACCTACATGTGCCTGATGCGCGCCCAGTCC 
AAGTCCCAAAAAATTTCCACCGTTGTCCGGCAGGAGGATGTGCCCGCCATGATGAGCATGTACTCGCAGTTCATGAAGAGCAAGATGGACGGGCT 
AAAGCGGGTTAAGAAAGTCAAAAGCAAGGCCAAGGCGACAAAGGGTTAATCGGGGCATAGTCTTAGTTAGATTTTATTTGACTGTTTACTGGTCG 
TTTGTTTTAAACCTTAAATGCATCATTTTGTACGGAAAGCACG 
(SEQ ID NO: 608) 

Start ATG: 1 

MPSLIRYYCRAFSKGLSKALCHFSRFSCFICHEMVLLDNSNFILRLEKIANAAKKDSSFTLTFKRYDGNDKPVPREGRPPLPKPETYMCLMRAQS 

KSQKISTVVRQEDVPAMMSMYSQFMKSKMDGLKRVKKVKSKAKATKG* 

(SEQ ID NO: 609) 
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Celera Sequence No. : 142000013384548 

CACACTCACCGCGGGCGCAAATGCAGCACAAAGCAGACTAGAAATAACAATCGTTATCGAAAAGGTCGTCGAAATGGGCATAAATATTTTTATAA 
ACAGTAAATATATAATTTCTTGTGGAGATCACGTTAAATGTGATGAGCTTTACAGTCGGATTGCGGAGAAAACAGTGAGTATGTTATGTATAATT 
TTCATAAGATTTTGCAATGTGGCCCTTTTTTTTACAGAACCTGCAACCAGGGGAATACTACTTAGTCAGCAATGGCAAGCGCTTGGAGGGAGAAA 
TTTCATCTGGAGATGTTCACTGCGTTCTCCGCCAGCTCGGTGGCAAAGGAGGATTTGGTTCCATGCTTCGAGCCATTGGTGCGCAAATTGAAAAG 
ACAACCAATCGCGAGGCTTGCCGCGATCTAAGTGGTCGCCGTTTGCGGGATATCAACGAGGAGAAGCGGGTGCGCGCCTGGCTGGAGAAGCAGGG 
CGAACGGGAACGGGAGGCCGAGGAGCGTAAAAAGCGAAAAATCGAGAAACTGCTGGCCGTGCCCAAGCACGACTTTAAGGACGATAAGTACGAGG 
AAGCCAGGGCTAATCTGACCGAGAAGGTAAACGACGCCTTCGAGGAGGGACTCAAGCAGGCCGAGGAGAACAAGGAGAAAGGCGTCGAGGAAGCA 
ACTTCCAGTGGCACAAAGAGGAAATCCCCCGCCGTAGACAAGACCAAGGCAAAAAAGAAGAAAAAGGGCACACTCTGGATAGACGATGACATCTC 
TGGATCCGACTCTGACTCTGACGACAGCGAGGAGGAGCCAAAAACACAGAAAAAAGCTATACAAAACTAGATAAATAACTTTTGAATTAAGTATG 
TACCAAAGACGTTCTTATTTCAAAATTTACCTGAAATAAACATAACAATATAAATACTTCTCATTTAAAATTTGAAAGATTTAGATTTAATACTA 
TGGGGATAATTAAGTAATTTATAAATATACATTTAAAAGAGAAAGAAAAATCACGGTGACACAAACTGTGAAGAAACCCAGGCCAATCCCCATTT 
CAGGTTGGTTAGCATGAATTTCAGTGCCTTGGTCCACTGCTCCTCCGAGTTAAACTGGATTCTGTAAGGAAACCATCATTATTATAGGTAAAATC 
TTTACCATTTCTAAACCTACTTAATAGAATAGGAATTTCCCGTGGAGGGATCGATGATCTTGCCCTTCTCCATCTTGTAGGGCAGCAGGAACTCG 
GTGTCGCGCTTCTCGACCTCCTTCTGGAACTGGGTAAGACAGTCGAGAAAAGCTACCATGGCAGCATCAAACTTGGTGTCCCAGAAAAACTTAAA 
TCCGCCGCTGCCATAAAGCGGGAGCTCTCGATTCTCGCCGAGCACCTCAACATACGAATGATTTCCAAAGGGTACTACACGATACCGCTCGAAGG 
TGAGCCCTATCTTACGAGCAAGCGCAGAGAGCAGAAGCACCGTCTGGCCCCAAGCGGCGTTGATCTCTGACCAGTCCACGGATACAGAGGGCAAT 
CGACCCAGCCTAAAGTTATTAATGGTACCGAAATGCCCGGCATGCCAGATGTGAAAGGTGATGTTGAATATGTTGGTGTCGCGCAGTTTGTCCAG 
CTGCTGTTTCGAGTAGGCGATCTGACACTCCAGACTTCGTTTGTCATCCTCGGTGAGCATTAGCTCACGCCTGTGCTTGGTATACTCGCGCCAGT 
AGCTTTCCTCCTGCTCGTGCAGCTCCTCTCGCTCCTGTTCCTCCTCGGCAATGGCATCATTTAGCGATTGTTCCTCCTTTTTGAGCTCTTTTAGC 
TCCGACAGAAGCTGTTGCTCGCTGCGCTTTAGTTCGTCGAGCTCCTTGTCCAGGGCCTCAACGTTGGGTGCTACACGCTGTTGCTCTAGTTCATC 
CAAATAAGCCTTGTACACATCCCACTCGTCCTCAGCGATGCGCAGTTCCCTGTCCATGATTTCCAGCATGGAGTCGGCACACTCTTCGCACAGCG 
GATGGTCAATCTCAGAGTTGGAGGAGAGGCAGTCAAACAGCTCCGCTTTCAGCTTAAAAGCAGCACTCATTTTCTTGTTGTCCCTGCCATCGGAA 
ACCAGCATAAAACCAGTGCCATTTATAGAGTCCGTAAGCCTGTAGGGCGGCACAAAGTGGTCGAAGCTGCTGGCGTCCTGCGGGTCCAATGTATT 
GCCATTGTCTCCGTAGATGGGCACTGAAAAAGTGATTTATTAGACTAATTTCCAGAGTGTATCACCCATCATCACCTACAAGATAACTCCGCCAT 
TGCGTGCACGCTAATCTTCTCCAGCTGCTCGTCCAGGACGATGGGCTGCAGGCAGCGCTGACAGGCGAAGGACACCGCCTGCTTTTCCGCCTCAC 
TCATTGTCGTTCCTGTTTGTTGTCCTTCGATTTTTGCATACAATTAAATCTTTCACCGAGTGTAAAAACACAAATAATCCATCAGCTGTTTGCAA 
CTGCTACCAGAGTTGCCACCCCCTCCGAAAAAGAGTGTGGAAAAATGGCGCAGTTAATCGGACAGTAAATCGTCAACCAACAACAACCTGAGACA 
GTAAAAAAGAGAGAAATAACAGAAAACCAGTTTTAAACAATAACATTGCCTAATTGTGTCCGCCGGAGATGCTTAAGCGTGCCGCGAAGCTGCAG 
GCCTTGACCACAATTCCCGCTGATAAGAGAGAACTCAAGGGGTTGGCTTTTGGTTTGTTTTGGTAAGCGACAGAACATTAGACGGAGTGCCGTCG 
CCAGTGCTGCCAACCTATTTTAGCTATAGCAAATATTTCATAGTAGCAAAGAGGTAGAACCCAAATACAATTTTTATAAATAATAGTTTATAGTT 
AGCGAACGGACTAAAAGAGAAACCTGATATCGAACGACTTATTATATTTTTTGTTAACTCAATTTTAAACTTACTCCAGCGTAAACCATACTTTA 
TAAATAGTTTACCAAAAATGATAGCACAAGTGCCAGAAAGCTATTCACTTCATTTTTAATTTGATTAGGTAAATTGTGCGAAGGAAGTATTTAAA 
AAAATGTGAAATGACGTACTAAGCACCCCACATATAGACTAAGAATTAAAGGTTAAGGACATTGACTTTCGCAATCTTTGTATTGTAGTTAAGCC 
ACAAGCATTGATAAGC AAAGAAAT TGTAAGAGTGAAATAAAATCATAATAAGATG AATTAATT AAAGT AGAGAAACT GATAAAGACTT GCACT TG 
AAGCTGACACATGTGTTTAATAACTATATAATTGCTAAGGGAAATTTTGTTTTTGAAAAGTTTCCGTTATCAAGCTCGAAATTTCTAGCAATGGG 
GCAGCACGAAATTTGCCGCTGTAAATCTAGCTTTCAAAACCTGGCAGCACTGTCGGTGGTCACATTGTAGCCTCTTCGCATTTGACGACCCATGG 
AAAACTTTCAGTCTCTTGCAGGCAGCACAGCGATACGGAAGCAACAATTGAAACGTCCAGAACTCCTCCTCCCACGAGTATTTAAATTCTGACAC 
AAGAGTGCCTGAACCAAAATCCCACTCACCGAATGCTGGTGCTTTGATTTGCAACTGTTCCTAGGTTCCGGCTGAATCCCTAGATTTGCGGATCT 
ACGCAACTCATTCACTCAAAAGCCAGCCG7^ATTTATCGCGCCAGATTAAACAAACAGAAACAAAAAACAGAAGCCGAGACCAGTTCCACATCGTG 
CCATTTTTATCTGGGCATATAGGCAGACGTGATTACGTGATTAGTGCGGC 
(SEQ ID NO: 610) 

Exon: 2755. .2265 
Exon: 2208.. 1161 
Exon: 1106.. 1001 

Start ATG: 2379 {Reverse strand: CAT) 
Transcript No. : CT17226 

CGACGGCACTCCGTCTAATGTTCTGTCGCTTACCAAAACAAACCAAAAGCCAACCCCTTGAGTTCTCTCTTATCAGCGGGAATTGTGGTCAAGGC 

CTGCAGCTTCGCGGCACGCTTAAGCATCTCCGGCGGACACAATTAGGCAATGTTATTGTTTAAAACTGGTTTTCTGTTATTTCTCTCTTTTTTAC 

TGTCTCAGGTTGTTGTTGGTTGACGATTTACTGTCCGATTAACTGCGCCATTTTTCCACACTCTTTTTCGGAGGGGGTGGCAACTCTGGTAGCAG 

TTGCAAACAGCTGATGGATTATTTGTGTTTTTACACTCGGTGAAAGATTTAATTGTATGCAAAAATCGAAGGACAACAAACAGGAACGACAATGA 

GTGAGGCGGAAAAGCAGGCGGTGTCCTTCGCCTGTCAGCGCTGCCTGCAGCCCATCGTCCTGGACGAGCAGCTGGAGAAGATTAGCGTGCACGCA 

ATGGCGGAGTTATCTTTGCCCATCTACGGAGACAATGGCAATACATTGGACCCGCAGGACGCCAGCAGCTTCGACCACTTTGTGCCGCCCTACAG 

GCTTACGGACTCTATAAATGGCACTGGTTTTATGCTGGTTTCCGATGGCAGGGACAACAAGAAAATGAGTGCTGCTTTTAAGCTGAAAGCGGAGC 

TGTTTGACTGCCTCTCCTCCAACTCTGAGATTGACCATCCGCTGTGCGAAGAGTGTGCCGACTCCATGCTGGAAATCATGGACAGGGAACTGCGC 

ATCGCTGAGGACGAGTGGGATGTGTACAAGGCTTATTTGGATGAACTAGAGCAACAGCGTGTAGCACCCAACGTTGAGGCCCTGGACAAGGAGCT 

CGACGAACTAAAGCGCAGCGAGCAACAGCTTCTGTCGGAGCTAAAAGAGCTCAAAAAGGAGGAACAATCGCTAAATGATGCCATTGCCGAGGAGG 

AACAGGAGCGAGAGGAGCTGCACGAGCAGGAGGAAAGCTACTGGCGCGAGTATACCAAGCACAGGCGTGAGCTAATGCTCACCGAGGATGACAAA 

CGAAGTCTGGAGTGTCAGATCGCCTACTCGAAACAGCAGCTGGACAAACTGCGCGACACCAACATATTCAACATCACCTTTCACATCTGGCATGC 

CGGGCATTTCGGTACCATTAATAACTTTAGGCTGGGTCGATTGCCCTCTGTATCCGTGGACTGGTCAGAGATCAACGCCGCTTGGGGCCAGACGG 

TGCTTCTGCTCTCTGCGCTTGCTCGTAAGATAGGGCTCACCTTCGAGCGGTATCGTGTAGTACCCTTTGGAAATCATTCGTATGTTGAGGTGCTC 

GGCGAGAATCGAGAGCTCCCGCTTTATGGCAGCGGCGGATTTAAGTTTTTCTGGGACACCAAGTTTGATGCTGCCATGGTAGCTTTTCTCGACTG 

TCTTACCCAGTTCCAGAAGGAGGTCGAGAAGCGCGACACCGAGTTCCTGCTGCCCTACAAGATGGAGAAGGGCAAGATCATCGATCCCTCCACGG 

GAAATTCCTATTCTATTAAAATCCAGTTTAACTCGGAGGAGCAGTGGACCAAGGCACTGAAATTCATGCTAACCAACCTGAAATGGGGATTGGCC 

TGGGTTTCTTCACAGTTTGTGTCACCGTGA 

(SEQ ID NO: 611) 
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Start ATG: 377 (Reverse strand: CAT) 



MSEAEKQAVSFACQRCLQPIVLDEQLEKISVHAMAELSLPIYGDNGNTLDPQDASSFDHFVPPYRLTDSINGTGFMLVSDGRDNKKMSAAFKLKA 
ELFDCLSSNSEIDHPLCEECADSMLEIMDRELRIAEDEWDVYKAYLDELEQQRVAPNVEALDKELDELKRSEQQLLSELKELKKEEQSLNDAIAE 
EEQEREELHEQEESYWREYTKHRRELMLTEDDKRSLECQIAYSKQQLDKLRDTNIFNITFHIWHAGHFGTimJFRLGRLPSVSVDWSEINAAWGQ 
TVLLLSALARKIGLTFERYRVVPFGNHSYVEVLGEWRELPLYGSGGFKFFWDTKFDAAMVAFLDCLTQFQKEVEKRDTEFLLPYKMEKGKIIDPS 
TGNSYSIKIQFNSEEQWTKALKFMLTNLKWGLAWVSSQFVSP* 
(SEQ ID NO: 612) 

Classification: hypothetical 



Celera Sequence No. : 142000013384004 

AATAATTTGTTTGTAAATTTCTATTTTTACTTTGTGAAGTAATGATTTTATATTAATAATATAATTACCAAGACTAAACGGAATAATTATAAGGT 
TGTGTATTAGTAGAGAACAAAAACAGAATGAAATACATTTGTAGGAATCACCTAATATTTAATTATATTTTATATTATATTAATTATAATTATAT 
TTGTTCTTTATACAATTTTATACATTGTATTTTATAGTTTTATTAAATATTCCCGCTTAAAAAATAAAATGTTGTTTCTATTTAAATTACCGCAG 
CTGTTGGTGGTAATTTAGATGACCTGATTCGATTTGGTATTATGGGAACTTTTCCAGATTTGGTCACACTGCCCTTTACACTGTTTTGGTTTTCG 
TGAAAATGTCCACATACTACTTTGTTATTGTGGGTCAAAATGATAATCCCATTTACGAGAAGGAATTCAGCACCGTGAACAAGGAACTAAGGGTA 
TGTAGACCTGGGCAACGTCTCATAGAACTGTGTAAATACATTCTCCTTCTGCAGAAAGAGGACCACAGGCACCTCACCCAGTTTATTGCCCATGC 
CGCCTTGGATTTGGTGGATGAGCACAAATGGAAGACGGCCAATATGCAGCTGAAATCCATTGATAGATTCAATCAGTGGTTTGTTTCGGCCTTCA 
TCACAGCCAGCCAAATACGATTCATCATCGTTCATGACAACAAAAACGACGAGGGAATCAAGAACTTCTTTAACGAGATGTATGACACGTATATC 
AAAAACTCCATGAATGCCTTCTATCGCATCAACACACCCATTAAATCTCCGATGTTTGAGAAGAAGTCGGAGATCTTTGGTCGGAAGTATTTGTT 
ATCATAAATTTGTTAAAACCTGTAGCCATTAAGTAATTGTGTAATATAAAACTTGAAATAATTTAAAAGCATAACTTTACCCTTTCTTATACAGT 
TGAGAAGACCCGAAACCAATGAATAATCTAGTTTATATTTTATTTAATTTATTTAGAAAAGCAGCCTAGGCTTAACATTGATGGACTTGTTTTAT 
TCACTTTTTACTCAATGCGCCCTCGAATCAAATCTAAAACGATTTAACGCCCGTCTCCAGACGACAGTCCCAGACTGGAGTAAATTAGATGTGTG 
GGTTGAATAGATGAACAACGCATGGAATCAATTTGAAACTAAAGCAACAAAATATACGGAATTTGTAATGGGTCTTTGCAAAGAAATCTCTTGGT 
GGTGGCGAGGAAACCGTCCGTCCTGCAGCTTAGTGCTTCTTGTTCTTTCCAGAGCGCTCATATTTGCTCTTGGACTGGTGCCACAGAGCCAGAGG 
AATGGCCAGGGAAGGCAGAGTCATCACGGCGAAGGCCACCCAGTCGAAGAAATGGGAGGAGAAGCGCTTGTTGTACTCGTTCAGGTTGACAATTC 
CACCCTCACCGGGCTCAGAGCTGACCGCCAGTTGGAGCTGCAAGAATTGTTGGGTATTAAGTTCTTTGATCGATCTTTCAACAGGTAACGGGCAT 
TTTTAAAAGACTGTTGTGTATAAATTCACTTAGTTGAGATTATTAAATTTAAAAGTGAATGATTTTGGTTTGCTTTGGCGGTGGCTTCTGTAACA 
CTGAGATTAGATAAGATTTTTCAGGGCGGACACTGAGATTTGATGCCCCCACATTCAGCTATAACTTACCGTCTCCGACTCCTCGACAGCCTTGT 
AGGACACCTCGGCGGCGGTGAAGTTGAAGTAGCCGAAGGCCTTGGGCCTCACCACCAGGACATGGGTGAAGTTGGTCTGGGGAGCGATCCTATCG 
ACCACGGCGGTGGGCTGTCCTCCGACAACGTCGAAGGCCTCCGGGTGGAAGCCGGAGTCTACCAATCGCACCTTGGTGGCGGCACCGCTGCCCAC 
GTTGAAGATGGTGTAGCGTACCAACAGGTCGCTCTTCTCCACCAGGTACTTGTTCAGGATCTGCTTAGAGACCAGGAGACGGGCACGGGTCTCGT 
CTTCCGACAGGCAGGTGCTAAACACCGCGCACAGGGCGAACAAAGTCAGCAGGTGCTTGAACATTCTACAATCACAGATAATATGAAGTTTTATT 
TTCACGTTTCACTTTTTTGTTGACGTGGAAGGGTCGCCGAAAAACGAGCGGCACATAAGGGCGACCGCTTTCCGGTGTGTTTTCTAGACATTCAC 
GCATCATAGCTGCCCCGAGCACACGTTCAGCATAAACGTACACATTTTAATTCATAATTTAACTACTTACTTGTAATTTGGGTATTTCCTTATCG 
AGTATTTCAGTACAACTAAGCAGCAGTCGTAGCAGGCAGCCGCTTTCTCGGTGGTTCGTGACGTGGAATGGATTCGCTGTCAAACTTCGTGCGTC 
AGTGGTGCCAGGGCTGCGTCAACATATCGATGTATCTGCTGGCTGTCGGCAGTATATTAATACCAACACCATAGTATTTTCGTCGGTTATGAACA 
CAACACAACTAAATTTTTTTGTGAAATTCATAAAGCTTAATTTGTTTTTTTATCACCAGTGAATATATTTAGTAATAATGAACGAAATAGTTTCA 
AATTAAAACCGTTAGAAGCCGCTATGGTGTAGCTGTGTGCAGTGCTACATAAAAAGGAATTATGTTTATAATAACAAAAATTAAAAAAAAAAACG 
TTTTAACGTAAACGTATACACTTAAATCTTTTCGAAATTAAATTATATTAAAATCGGTTTCAAATTGTTTTCTATAAAGTTTAATTTAGCTTTAA 
ACAAAAAATTAAGTCGTTATTTTTAATTTGCATTCATATATTATTTAATAAAAATTTAAATTGCATGCAAAAGAAGTAAGGAAGCCTGAAATTTT 
AATTATTTCTGAAATTTTAATTATTTCTGAATAGCGCACAACTACTCGTGGAGATTCGGCAACTCCGCTCCCCAGCGTCTGTTAAGCCGCTGTTA 
ATAAGCAGCAGCTGTTGTTTGAAACACCATAAACAAATGTAAACAAAACTCTTTTTATGCCTTTTATGACTTTAACTAATGTTTTGCTACGCGGT 
TTTAAAATAGTTAGGCATTTTATCTGCTTTTGTTCGTACATTTGGCATACATTTCAAATAGAACAGCTGTTAGCCGGCACCCGGGAATGCCCTTG 
A AT G AC AT T GC AC C C T AC AT C AAGGT T C T C G GC C T C T C AG T T C AC T AAT G CAT CAT GAT AAAATT AC C TC AAAT C AC GAT AAAGAAATG AAA T T C 
AAGGGGACAATAGTAGATAACTGGATTAATGGGTTCTTCAGCATTTTAACGACGCACCCCCAATACTAGATTAGGACTGACTAATAGCACGCACT 
AATGATAGAGGTTATCTTGATCGCCATGACCGAACCGATTTGGCTTATCAGTGGGAGGCAAATTTCGCTGTGCACACTCAATTTTCATCGCACGC 
AACCGCGGTTAAGTCGTCCAGTTGGAA 
(SEQ ID NO: 613) 

Exon: 2447.. 2439 
Exon: 2336.. 2256 
Exon: 2059.. 1685 
Exon: 1462.. 1001 

Start ATG: 2447 (Reverse strand: CAT) 
Transcript No. : CT17318 

ATGGTGTTGAACCACCGAGAAAGCGGCTGCCTGCTACGACTGCTGCTTAGTTGTACTGAAATACTCGATAAGGAAATACCCAAATTACAAATGTT 

CAAGCACCTGCTGACTTTGTTCGCCCTGTGCGCGGTGTTTAGCACCTGCCTGTCGGAAGACGAGACCCGTGCCCGTCTCCTGGTCTCTAAGCAGA 

TCCTGAACAAGTACCTGGTGGAGAAGAGCGACCTGTTGGTACGCTACACCATCTTCAACGTGGGCAGCGGTGCCGCCACCAAGGTGCGATTGGTA 

GACTCCGGCTTCCACCCGGAGGCCTTCGACGTTGTCGGAGGACAGCCCACCGCCGTGGTCGATAGGATCGCTCCCCAGACCAACTTCACCCATGT 

CCTGGTGGTGAGGCCCAAGGCCTTCGGCTACTTCAACTTCACCGCCGCCGAGGTGTCCTACAAGGCTGTCGAGGAGTCGGAGACGCTCCAACTGG 

CGGTCAGCTCTGAGCCCGGTGAGGGTGGAATTGTCAACCTGAACGAGTACAACAAGCGCTTCTCCTCCCATTTCTTCGACTGGGTGGCCTTCGCC 

GTGATGACTCTGCCTTCCCTGGCCATTCCTCTGGCTCTGTGGCACCAGTCCAAGAGCAAATATGAGCGCTCTGGAAAGAACAAGAAGCACTAAGC 

TGCAGGACGGACGGTTTCCTCGCCACCACCAAGAGATTTCTTTGCAAAGACCCATTACAAATTCCGTATATTTTGTTGCTTTAGTTTCAAATTGA 

TTCCATGCGTTGTTCATCTATTCAACCCACACATCTAATTTACTCCAGTCTGGGACTGTCGTCTGGAGACGGGCGTTAAATCGTTTTAGATTTGA 

TTCGAGGGCGCATTGAGTAAAAAGTGAATAAAACAAGTCCATC AAT GTTAAGCCTAGGCTGCTTTTCT AAAT 

(SEQ ID NO: 614) 




Start ATG: 1 (Reverse strand: CAT) 



MVLNHRESGCLLRLLLSCTEILDKEIPKLQMFKHLLTLFALCAVFSTCLSEDETRARLLVSKQILNKYLVEKSDLLVKYTIFNVGSGAATKVRLV 
DSGFHPEAFDVVGGQPTAVVDRIAPQTKFTHVLVVRPKAFGYFNFTAAEVSYKAVEESETLQLAVSSEPGEGGIVNLNEYNKRFSSHFFDWVAFA 
VMTLPSLAIPLALWHQSKSKYERSGKNKKH* 
(SEQ ID NO: 615) 

Classification: hypothetical 
Gene Symbol: SsRbeta 
FlyBase ID: FBgnOO11016 



Celera Sequence No. : 142000013385214 

AATTAGCGGAATAGCTTTTATTGGACGGGCTAAAAAAAAAAATCAAAGCCAATCAGTGGCGTGATCCGAAATCAACACCATCTTGGCCATCACCT 
AGATGGGTGGTGGAAGATAGTGGGCTGGTGAACTCGATCCCTTTTGACATGTTTACACATTTAATTAACTTACACGCGCCGAAAGTTTCCGGACA 
CGTTAGATAGTCCCCGTCTAATCACACCCAAGCGAAAACCCCACCCGCATGAACTTCCAAAATGGGGCTAAGAAAACAAAAGGATAAAGACCCTT 
TCGCAACCCAACCCCTAAGTCTTCCCTTTTTGACAATTTGAAACCAGTTCAAGTGGTTTAATCATTTAACGCACATCATAAAAATCGCATCAAGT 
GATTTTTATTTACACGCCCAGGACAAAAAGGATCCCCAAAACCTAAACGGAATACAATGATTGGGGCGGGGGTGCGAAATGGTTGGGGGTGAGCC 
ATCCCTTTGAAACCCAAGGCGTTGGCTAAGAACAAGAAAAATGAACTCGCTCACTAGATAAGGTTTCGGTTAGTTGATAGACTTTTACGCTGTAA 
TGAAATCTTAGAAATTCCATTTAAGAAGGAAATATATATGTATTACAAATAAGCTTAACCGCAAAAGGTATTAGCTTTTAATCTTAAGGAATATA 
TAAGTCTTTATATAATGAGTACGAACTTCCTAAACATTTTTCCTTACTCCAATTTGTGATTAACTAATTTTATAGAACTATTTAACCTCAGAACA 
AGAGCACATATCAGTAATTAAATTCAAGTTTTCATGTAGGTTGTGTTAGCGAAGTATTTAATCATATAGTTTTATTTTTTAAATACATGTTAAGC 
TGGTGAGGCGTAGTGTATATGTGAATTTTATTCAAAATTTAATGCGCAACCGCATAATGGACATAAGATGTGAGACTTTAGATGAATTTGCGTGA 
AGATTGATCTTGATCAGAGGCAGACGACCGAAGAATGCGGATTACACTGCTTAACTGAAGGTGTTGACCTGTGAATAAATAAAAACATTCATTTA 
AATTTAAATCATTTCATTCATTTGTATAACACTTACAATATCGTCGATCTTCAAGATCGATCGAATCGTCTCCGTGGCCAGGGTGATCGACGAAA 
TACTGACCAGCAGCGGCTGCACAACGTTCTCCGCGAAAATGTCTGTGATGGCGCCCTTGCGCACATTAATACCGGCGTTCTTCTCACCCTGAGCG 
TGGCGGTTGCGTAGCTCCGTGACCGTGGCAATGGGATTCAAACCGGCGTTCTCAGCCAGAGTGGAGGGAATCACCTCCAAAGCATCGGCAAATGC 
ACGGAAGCAATACGCATCCACACCCTCCACGGTTTGTGCCAATGCGGCTAGCTGCAGGGCCATCTCAATCTCGGGCGCACCACCGCCGACAATCT 
GGGCGCGCAGCTTCACCAGGCAACGGACGACACAGAGGGCATCGTGCAGGGAGCGAGCAGCCTCCTCGAGCACCAGCTTGTTGGATCCGCGGCAG 
ATGATCGAGACAGTGCGTCCCATGTTCTGAATGCCGGTGATCTTCACAAACTTGTTGGTACCACTGGCCACCTCCTCGACCAGATCGGCACTGGA 
AAGGTTCTCGGCTGTGAAGTGATCCAGCGAAGCGATCGGTCGGCAGTGCAGGGTCTTGCACACGAACTCAATATCCTCGCGCTCCACATCCTTAA 
C GAC CAT GC ACT TG AT CT TGTC CAGG AAAT GCT G AGC C AG AT C AGAG AC AG C AT CGC T AG AT AAGTGGAGAAC T A TC AGC AC AAA GTGG A C AAAT 
AACAAGTAAGCTAAACCTACCGCAAGATAGACTTCTGAACGAGAAGAACATTGCATCCTGACTTCTTGATCTGCTTGACAATGTTCAGGATGTAT 
GAACGCTCCTCCTTGAGCACACGATCCATGGCAGCGTAATCCGATACAATCACATTGTGATCCATCTGTAATTTCGGCATAAATTATAAAAGACA 
ATAAATTGTTAGGATCATAGCGATGTACTCACATCGGTCTTGGGGGCCGAAATGCAGAACTGGATAAGACCGATCTTGGCCTTCTCAATGCGCTT 
GGGAGCGTTGGATCCGGCAGAGCGGCAGGTGAAGACCAATCCATCGACCAGCTCGGTGTCCTCAACAGTGCCACCCAGGCTGGAGATGACCTTGA 
TGTTCTTGAGATCCACAGAGGTTTCCTTGCCGGGATCCGTAACCTTCAGCACGGCATCCACGGCAATGGGGGCCAGCAGGCTGCTCTGCTGCGAC 
ACCACCTTGGAGTTCAACGAGGTGGAGGCGCTCTTGATCAGCGTCTCGCGGTCGTCCAGCTCGATGGGAGTGGACATCTGTTTAAGAATCTCGAC 
GGCCTTGTTCGAGCAGCGCTGGAACGAGTCCGAAATGGCCGTGGGATGCAGACCCTTCTGCAGCAACTTCTCGCACGCCTCAAGCAGAGCTCCGG 
CAATGACAACCACGGAGGTGGTGCCGTCACCAGCGGCCACATCCTGGGCACGCGACAGCTCCACCAGCATCTTGGCAGCCGGGTGCAGTACATTC 
ATCTGCTTCAGGATGGTGGCGCCATCGTTGGTGATGGACACCTCGCCGTTGCCGGCCTGGATCATCTTGTCCATGCCGCGGGGGCCCAAACTGGT 
GCGGATAGCATCGGAAACGGCTGGAAAACAGAATTATTTCAATAGGTTATAAATGTATTTTAATATTTCGTTAAAACTTAATGAGAAACTCAGAT 
AAGTGGAATTCGTGTGGTTAGTTTTACACAGCAGCATGACTAATTTACATACATACATACAAAAAGACATATGGTGTTTTATGTGTTAATTCAAA 
AAATCTACTGTAACTTATCTTAAGAGATAAACAATAAAATCCTTGTAGCGGATAAAAGGGTATTGGGAATAGAGTATTAAAAATCCTCAATTTTT 
AGGTTGTTTCAATTCATCACATGAGTAATGCGCGCTCTTCTTCTTCTTCCGCCCAAGAAGTTTCCAGACAGCCGGCATCCAACTGCTTTTTCAAT 
AGGCCATTTGGCCGGAAATTCTGATGCACCACTGCTCGGAATTCAGTCACCTCGGTTCTTAGCTCAATAAATAGCACATACGCCAACATTTTGAT 
ACAAAAATGTGTTTAATTTTGTTAATTTCCTCGCTTTTTCTCACCTTTGGCCGCCTGGATGTTGGACAAACGCACATCGGTGGGCTTCGACTTGT 
CCTTGAAGGCCTTGGCCGTTGGCTTGATGTTGACAGCTGCTGCCTTGGGCGCCATTGTAGGGTTAAGTTATAGTTTTTTAACAAAACACAGTTAT 
TCTACAATAAATTCACAGCGATTTACACAGAACACGTTGCTGCGAAATGTTAAGGAAAGTGTGGCCGCACTCGCTGAGCTCAAATATACCGCCAA 
CTTATCGGCATCGATAACATCATCCCTTGTGAATGGCAAATTGGTGTGACCGTATTTTGCGTTGTGACTCGGCTGCTTTATTTACATTGAATTTT 
TGTTCGATTTTTGAATTTAAAATAAAAATAATGCCGCACGGACACTCTCACGATCACGGCGGCTGCAGCCACGAGGCCTCGGACGTGGACCACGC 
CCTAGAAATGGGCATCGAGTACAGCTTGTACACCAAAATCGATCTGGACAATGTGGAGTGCCTGAACGAGGAGACGGATGGCCAGGGAAAAAGTG 
TCTTTAAGCCGTACGAGAAGCGCCAGGATCTGTCCAAGTACGTGGAGAGCGATGCGGATGAAGAGCTCCTCTTCAACATTCCCTTCACCGGCAAC 
ATCAAGCTCAAGGGCATCATCATAAGTGGGGCCAATGATGACTCCCATCCAAACATGGTCAAAATGTAAGGAACAAAAGTCAAGGCTGCTTTAAT 
ATTACTATATTCATCATATATCTTATTTTTCAGCTTCAAGAACCGACCCAGGATGACCTTTGATGATGCCAGAGCAAAGCCCGACCAGGAGTTCC 
AGCTGACCCGCGATGCCCGCGGAGAAATCGAGTACTCACCCAAAGTGGTCACCTTCTCCTCCGTGCACCATCTCTCCCTCTACTTTCCCAGCAAT 
TTTGGCGAAGACATAACACGCATTTACTATATAGGTAAGCTCACAACAATCTATGTTTTATCTGTGTTCCCAACTAAAACCAAGTCACTATTCAA 
AGGTCTTCGAGGAGAGTTCACAGAAGCTCATTACCATGGCGTAACCATATGCAACTACGAATCCCGTGCCAATGCAGCGGATCACAAGGAAAAGG 
CTTTCGATGGAGTCGGCAGAGCCATCCAATAGGGTTATAGAGATTAAAATTTACAATTTAATTGAGTATCTCTGTAGTGTATATATTTTGTGATC 
(SEQ ID NO: 616} 

Exon: 3370.. 3180 
Exon: 2680. .2028 
Exon: 1965 . .1826 
Exon: 1766.. 1082 
Exon: 1018.. 1001 

Start ATG: 3285 (Reverse strand: CAT) 
Transcript No. : CT17430 

TCGCAGCAACGTGTTCTGTGTAAATCGCTGTGAATTTATTGTAGAATAACTGTGTTTTGTTAAAAAACTATAACTTAACCCTACAATGGCGCCCA 
AGGCAGCAGCTGTCAACATCAAGCCAACGGCCAAGGCCTTCAAGGACAAGTCGAAGCCCACCGATGTGCGTTTGTCCAACATCCAGGCGGCCAAA 



FIGURE SHEET 333 



GCCGTTTCCGATGCTATCCGCACCAGTTTGGGCCCCCGCGGCATGGACAAGATGATCCAGGCCGGCAACGGCGAGGTGTCCATCACCAACGATGG 
CGCCACCATCCTGAAGCAGATGAATGTACTGCACCCGGCTGCCAAGATGCTGGTGGAGCTGTCGCGTGCCCAGGATGTGGCCGCTGGTGACGGCA 
CCACCTCCGTGGTTGTCATTGCCGGAGCTCTGCTTGAGGCGTGCGAGAAGTTGCTGCAGAAGGGTCTGCATCCCACGGCCATTTCGGACTCGTTC 
CAGCGCTGCTCGAACAAGGCCGTCGAGATTCTTAAACAGATGTCCACTCCCATCGAGCTGGACGACCGCGAGACGCTGATCAAGAGCGCCTCCAC 
CTCGTTGAACTCCAAGGTGGTGTCGCAGCAGAGCAGCCTGCTGGCCCCCATTGCCGTGGATGCCGTGCTGAAGGTTACGGATCCCGGCAAGGAAA 
CCTCTGTGGATCTCAAGAACATCAAGGTCATCTCCAGCCTGGGTGGCACTGTTGAGGACACCGAGCTGGTCGATGGATTGGTCTTCACCTGCCGC 
TCTGCCGGATCCAACGCTCCCAAGCGCATTGAGAAGGCCAAGATCGGTCTTATCCAGTTCTGCATTTCGGCCCCCAAGACCGATATGGATCACAA 
TGTGATTGTATCGGATTACGCTGCCATGGATCGTGTGCTCAAGGAGGAGCGTTCATACATCCTGAACATTGTCAAGCAGATCAAGAAGTCAGGAT 
GCAATGTTCTTCTCGTTCAGAAGTCTATCTTGCGCGATGCTGTCTCTGATCTGGCTCAGCATTTCCTGGACAAGATCAAGTGCATGGTCGTTAAG 
GATGTGGAGCGCGAGGATATTGAGTTCGTGTGCAAGACCCTGCACTGCCGACCGATCGCTTCGCTGGATCACTTCACAGCCGAGAACCTTTCCAG 
TGCCGATCTGGTCGAGGAGGTGGCCAGTGGTACCAACAAGTTTGTGAAGATCACCGGCATTCAGAACATGGGACGCACTGTCTCGATCATCTGCC 
GCGGATCCAACAAGCTGGTGCTCGAGGAGGCTGCTCGCTCCCTGCACGATGCCCTCTGTGTCGTCCGTTGCCTGGTGAAGCTGCGCGCCCAGATT 
GTCGGCGGTGGTGCGCCCGAGATTGAGATGGCCCTGCAGCTAGCCGCATTGGCACAAACCGTGGAGGGTGTGGATGCGTATTGCTTCCGTGCATT 
TGCCGATGCTTTGGAGGTGATTCCCTCCACTCTGGCTGAGAACGCCGGTTTGAATCCCATTGCCACGGTCACGGAGCTACGCAACCGCCACGCTC 
AGGGTGAGAAGAACGCCGGTATTAATGTGCGCAAGGGCGCCATCACAGACATTTTCGCGGAGAACGTTGTGCAGCCGCTGCTGGTCAGTATTTCG 
TCGATCACCCTGGCCACGGAGACGATTCGATCGATCTTGAAGATCGACGATATTGTCAACACCTTCAGTTAA 
(SEQ ID NO; 617) 

Start ATG: 86 (Reverse strand: CAT) 

mPKAAAVNIKPTAKAFKDKSKPTDVRLSNIQAAKAVSDAIRTSLGPRGMDKMIQAGNGEVSITNDGATILKQMNVLHPAAKMLVELSRAQDV^ 
GDGTTSVVVIAGALLEACEKLLQKGLHPTAISDSFQRCSNKAVEILKQMSTPIELDDRETLIKSASTSLKSKVVSQQSSLLAPIAVDAVLKVTDP 
GKETSVDLKNIKVISSLGGTVEDTELVDGLVFTCRSAGSNAPKRIEKAKIGLIQFCISAPKTDMDHNVIVSDYAAMDRVLKEERSYILNIVKQIK 
KSGCNVLLVQKSILRDAVSDLAQHFLDKIKCMVVKDVEREDIEFVCKTLHCRPIASLDHFTAENLSSADLVEEVASGTNKFVKITGIQNMGRTVS 
IICRGSNKLVLEEAARSLHDALCVVRCLVKLRAQIVGGGAPEIEMALQLAALAQTVEGVDAYCFRAFADALEVIPSTLAENAGLNPIATVTELRN 
RH AQGEKN AG I N VRKG AI T DI F AEN V VQPLL V S I S S I T L ATE T I RS I LK I D D I VNT F S * 
(SEQ ID NO: 618) 

Classification: chaperone 



Celera Sequence No. : 142000013384254 

AAGTTACGAAACCAAAGAAAAAAAATGAATGTTTCCTATTTCCCAACCGAAAAC 

TTTGAAAAATTCATTTGTTTATATACTTTAAATCGTCATATAGATGGCGCTACATTTGCCATTGTCACTCACAGCGCAACGCGCACACCTTATCA 
GCTGAATTTGTTGCCATTCAGAATGCAGTGTTTGTGAAGGCGTAGTTCTGTTGTTTTTTATATTTTCCCTCTTTAAAAATACACAAAATGTTGAA 
ACCCAAAGCTTTGACGCAGGTTCTCAGCCAGGCGAACACCGGCGGAGTGGAGAACACCCTGTAGGATAGTCCCTTACCATGCCATTCCTTCAAAT 
ATCACTTATCCCTGCATCCACAGCCTGCTCAGCCAGGAGGGAGCTCTATTGGCCTACTCCGGTTATGGCGATAAGGATGCCAGGATAACGGCGGC 
CATAGCCAGCAACATATGGGCGGCCTACGAGAAGCATGGACGCAACGCCTTTCGCGAAGGTCGCCTCACTTTCGTGCTCATCGATTGTGAAAACG 
GTCATGTGGCCATCACACAGGTGGGTGATCCTCTTTTTTTTAGCAGGGATTCGTTCTATAAAATCTTAACTATATTCCTTTTTACAGGTTGCCAG 
TGTCCTTTTGTGCCTGTACGCCAAGCAAACGGTGGGATTGGGTCTTCTGAAGCAAAAGGCCATGTCACTGGCCTCCTATTTGGAGCGACCGCTTA 
AGCAGATCTCGGCCTCATAAGGATAAAACAATGATGTACCAAGTTCCGAAATGAATAAGTTAATAAGTGTATGCATCAAAAGTGAATCAAATGAA 
TATGAATAGTCTGCGGAAAAGTTTAAACGATAATATATTAGTACTTGAAATTATGAATCTTTAAGGATTTTGTTAGTATTTAAGGAAAAACTTGT 
CTGCTGGTTTCCAAAAGTAACCCAAAGATATTTCGTATTCTGAATTATTTGGATAGAGGGTCAGGTTTATTATTCTGCAAGATTTGTTCAAGAAA 
ACACACAATCATTAGGGAATATTGTATAAGAAAATTTTGGACTTAATTACTAAGTGTGGACTTATTCGATGGCGTAGAGCTTCAGGGTGCGGTCC 
ATGCTGGTGGATGCCAGATACTGGGCGTGCTTGCCGAATCTCACGCCCGTTGCCAGGGCCGTGTGGTCGTTGAATACCTTCAGTTCCTGCCACTG 
CTTGCACAGGTAAACCCTGTAAATAAAACGAAGCAACCGTCAGAATTTGTATTTGTACCGTGGCAACCAAATAAACAAGTTAACTAAGGGAATCT 
TCTTTGGCGAATTTAACTTAACAATGCGAAAGTTTAAAAGAAAAGTTTACCGCGAATATGAAAGTAAGTATTATATTCTAATATGCGAAGACTTA 
AAAATGCATGCTGCTCTTCAAGATTTCGTGCGACCCAAGACCCATGTGGAACCGGAGCCTCCAAAGCCCATAAACTTATCGATTGAGGACTTCAA 
TGCGGTGCGAAATAATTAATTTTTATATTTCCAAAGTTAAGTTTTATTAATATTCCTCAGCGCGGAGAGATCGCCAATCCCCGTGCTCCTTGTGT 
GGTCACCACCATCGATCCCGATTATATTAAGGACGCCAGTGCGCAACGTTTGGCCAGAAAATTTCGCATGAAAACGTAAGGAATCAAATGAATTA 
TGTGCTGGTGCAATTTCACAGCTTTTTCAGGGACATCCTACTCTTGAGGGAGATTACCCGCAATCGCTTTATGTACTATGCTACGCAAGAACTTT 
TTTTGGACCGAAAACGCGAGAAGGAATACCAAGACAATGTATACGAAAAGGCCGAGGAGTTTCACAAGTTTGCAGAGGAGTCGCTCAAGAAAATG 
GAAGCGGAGGAGTTTAAGAAAATGTTAGAGGCCCAAGAAAAGCTGAAGACACTGAAGGAAAGCAAACTGAGCGAAGAGCTGAACGAGGTGAAGCT 
GGAGCATCAGCGAGTGCTGATGGAAGTGCAGGAGGTCTACGGTCGCTTTCAGTATTTACTCAAACTGATCAGCTACTTTGATGACACAGTTGAGG 
AGGAGCAAACCGAGGGAGACGATGCCAGCAATAGGCTTAGCATGCCGCAGGAAATCGTAGCCATGGAGATCACCGAACGACATCCAGCGGGTCTC 
GAGCAAGTGCGTCAAGTCCAAGACTACATGGAGAAGATAGTGCGTCCGTATTTGGCAAAGCGGAATCATCTCAATGCAGAGATTTGGATACGTGG 
CTATGAGAGAAATCGCTTAAAGGTGTCCAACTACAACAGGCGCTTTTCTCAGTTGGCAATCTTGAACCACATCGTAGCCATCCTCCATGACAAGG 
CCCAGAAGACGTACGAGCGTCAGGTGAAAGCCAATGTGTTCCTCAAGGATCCCGAGTTCCTGCAGCGACGGGTGGCAGGTCTTCAGGAGCGTGCC 
AAGGGTCTATTAAATGACTTCGAGCAGAGATACTGCAAGGAGTGAGTTGGGATGCGTACATATTTGAATCAACTATTTTATATTCTTAATTTCCA 
GCAAGCTGAATATCAAGCTAAATGCTATTGTACCAATTATTCTCAAGCAAATCCAAAACAAAGTTAAGCCGGAGCAAAAGCGAGAGGTTACGCAG 
CCGCCATCCAAGCCAGCAGCCCAGAAGTCGTCCAAGCAATGGCGAACACTGGAAGACATGCAGCCTCCACCACCGCCACCCAAGTCTACATACGC 
GGACGAACAGGACACGACTCTTGCCAAGGTGGAGAAAATCCAGGGCTACGTTTTGGAGCTCTTGGCTAAACTGGACCACATCGAACCGAGCAAGC 
TGCGAAGTGTCGAGCAGCAGGTTCGACGTCGCATGGCAGCGGAGAAGGAAATGTCTCGCCGTGCGTTAGTCAAGCAGAACCGGCTACACAACTGG 
ATGGCGCATTTCAAGAAGCATCATGATCAGCAGGTTGAGCAACGATGGAAAAATAAAGTCTAAGCTAGACAACAACACCATACTTAAAACATTAA 
ATGAATACAATTTAAAACTTATTTTAACCTCATTCTGAATGATTTCAATAAGAAAAGTACAATTCACTTACCTGACATCGCTGCCGGCGATAGCC 
AGATAGGTTCCGCTTTGGTCGAAGCACAGATCCTTGACCTCGTAGCCGTCATCAAGCTGGATTGTCTTAAAGTTCTTCAGCTTTCGCAGATCCCA 
CAGCTTAACGCAGGCATCATCTGCAGCGGTGGCCAGGTAGTAACCGTTCTCCGAGAAGGAGATGGCCGAAATGGGACCGGTATGACCAGGGAAGT 
TGGCCACATTGCTCTGCTCCTTCAAATCCCAGATCTTAACCTGCGAGTCCACGGTGCCCGTACCGAAGATCAGACCATCAGGATGGAATTGCGCT 
GTCGTCAGCCCGACTTCGGCGGTATCGATTACCTTGGTGAGCAGGCGTCCTGTGCGGATATCGGAGAATGCCCAGTGTTTGTCGGATGATGTGGA 
CAGCAAGTAGTCGCCAGTGGGGTGCAGCGAAAGGCCAGTAACAGGTCCTTCGTGGCAGCGCAGGAGAAGCTGCGTCTGCGAAGTGGGCACGTGCC 
AGATGCGTATGTTCATGTCGGGCGAGCCGGTTATGACTGTATCCTCGTTAGGATGATATATGACTTTGGTGATCTTCTTGGTGTGTCCCTTCAGG 
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ATGGCCACCATCTGCTCTGTGTCCTTGTTGAACACGGTGGCGTTTTTATCGTTGCCTCCGGTCAGAATCTTACTGTGATCAGCACTGTTGATGTC 

CAAGGCCAGAATGCCGGGAACGGAGGCCGAGTGTAGGCCGGGATGGGATGCCACAGTAAGGAAGTTCTTAACCTGATCCGTTGTAACCAGATCCT 

CGGGCACAGTGCGTCCGCGCTTCTTACGCTCCTGCGTCAGCACCGTGGCCTTGTCCTGCAGTTTCTGTATGACCTCGGCGCTCATGCCCGCCTGC 

TCCATGGGATGGGCAGCAGCTCCGCCAGCCTCAGAGGCCAGGGCCGGTTGTGGTATTGCCGTGGGTGCGTTGGCAATGCCCGCCTGCGGTTTCAA 

CGTGGCCAGAGCTTCGCGAGCGGCGGCCACCTCCTTATTGAGGCGGGCAATCACACGACAGGCGGCATCGTGCTGATACAGCGCATGGGACAACT 

CCTGGCGCGTCGTTTGCAGCTGCTGGCGTTGTGTGAACGAGTGAATCATGAGGGCGTCCCACTCGTCCTGCATCGTTTTCAAAGTGGCCGGAATG 

CTGGTGGCACTTGGCGGTTTCGGTTTCACCACAGCCGGCGTCTTGATCTCGATCAGCTCCTCCGGCTTCAGCTCCTTGCCGCTAATGGGATCGCA 

GCCATTCTCTAACAGATACTTCTCAATCACCCGCTTTTCGAAAACGGCTCCCGAGTGCGGGGAAACCACGGGCGTCTCGGGGACTTCGTTTGTCA 

CTTGGAATGGAAAGTGCGATTCATAATGGCCCAATTCAAATCAAAATCTAAATTATTCACTTACGTGCGCAAACCAAAGCCATTGCTGTTTGTGA 

TTATGTCGCTTTTCGCGTAGTCTTGTTGGTAAATTAAATGTTTTACACGTTTTTCCCGAGCCAAGCGCAAAATTGTCGGCAAAGCCGCTTCTTCT 

TATTATGCCAGTTACTCGGCGACGCGGGAGTTTCGAGCACAGGGCTGCCAGATCAAATGGATTTTGGCTAGAAATTTGCACAGAAAAAACTAGCT 

AGAATTTAAACAATTTATTTTATTGATAACACAACACATTCAATAAGTATTTAATCCATAACAATAGTTTTTATAGATAAATGACATTTACTGAT 

AGAGAAGTGAGTAACATATCTTAAATCAATCAGCTATCTCACCGTTTTGCCAATCACAGTTCCGCACGCACAATCCATAATTGCTGGCAACCCTA 

GCACGTTGTTTCTGTTATTCAATTCACGATCTAGTTTTATAACATAGTTTACTGCCATTAAGGTATATTTTTAAAATGCTGAAGCATTTTGCACG 

ATGGCGTCTGGGCAGCCAGTTGCTAAAGGGATGTGCCGCACCAGTGAGGCAAGCCTCCAAAACCTCCAGCGCAGAGAACCTGATAGCCGGCACAG 

AGGAACCCCAAAAAAAGTTCGTCAATCCGTTTTCCCAGCCCGCTCCTGCGTTAAGTAATGATACGATATCAGAGAACAAGGAGGAACGGGATAAG 

CGACTCAAGGTGCTGCAGCTGGAGGCGGACATTGCCCACCAGGAGGGTCGGCGGGTGCCGTCACTGGAATTCTTTAAGGATCATCATTGGGAGCA 

CGTGCTGACGCTACCCACAAAATCAGCTAGGATCAAGTACTTTGGCTATCTCTGGCAAATCGAGATGAAGAAGGAGGCGGATCAACGCAAGAAGG 

CAGAGCGAGCGAAGGAAGCCGAGCGGCGGGTTGCGGAGATGCGAAAGGAGCGCGAGGAGAATACGCACATTATCTATGGCCTGGGACACACATCG 

TTGTTTCTGCGTATCTATGATACCACTATTAATCACTGGCAGAACAACCGACTCACACGGGCCATGCAGTTTGCCCCCAAAATGGTTTTGGATTG 

CTCCTACGATGAGCACATGAACAATCGGGAGGCTACCTATGCAGCAAAGCAATTGATGATGTGCTTTGCGGAGAACCGGATGAATGATGAGCCC 

(SEQ ID NO: 619) 

Exon: 4699.. 4530 
Exon: 4465.. 3112 
Exon: 1251,. 1001 

Start ATG: 4548 (Reverse strand: CAT) 
Transcript No. : CT17476 

CCCTGTGCTCGAAACTCCCGCGTCGCCGAGTAACTGGCATAATAAGAAGAAGCGGCTTTGCCGACAATTTTGCGCTTGGCTCGGGAAAAACGTGT 

AAAACATTTAATTTACCAACAAGACTACGCGAAAAGCGACATAATCACAAACAGCAATGGCTTTGGTTTGCGCACTGACAAACGAAGTCCCCGAG 

ACGCCCGTGGTTTCCCCGCACTCGGGAGCCGTTTTCGAAAAGCGGGTGATTGAGAAGTATCTGTTAGAGAATGGCTGCGATCCCATTAGCGGCAA 

GGAGCTGAAGCCGGAGGAGCTGATCGAGATCAAGACGCCGGCTGTGGTGAAACCGAAACCGCCAAGTGCCACCAGCATTCCGGCCACTTTGAAAA 

CGATGCAGGACGAGTGGGACGCCCTCATGATTCACTCGTTCACACAACGCCAGCAGCTGCAAACGACGCGCCAGGAGTTGTCCCATGCGCTGTAT 

CAGCACGATGCCGCCTGTCGTGTGATTGCCCGCCTCAATAAGGAGGTGGCCGCCGCTCGCGAAGCTCTGGCCACGTTGAAACCGCAGGCGGGCAT 

TGCCAACGCACCCACGGCAATACCACAACCGGCCCTGGCCTCTGAGGCTGGCGGAGCTGCTGCCCATCCCATGGAGCAGGCGGGCATGAGCGCCG 

AGGTCATACAGAAACTGCAGGACAAGGCCACGGTGCTGACGCAGGAGCGTAAGAAGCGCGGACGCACTGTGCCCGAGGATCTGGTTACAACGGAT 

CAGGTTAAGAACTTCCTTACTGTGGCATCCCATCCCGGCCTACACTCGGCCTCCGTTCCCGGCATTCTGGCCTTGGACATCAACAGTGCTGATCA 

CAGTAAGATTCTGACCGGAGGCAACGATAAAAACGCCACCGTGTTCAACAAGGACACAGAGCAGATGGTGGCCATCCTGAAGGGACACACCAAGA 

AGATCACCAAAGTCATATATCATCCTAACGAGGATACAGTCATAACCGGCTCGCCCGACATGAACATACGCATCTGGCACGTGCCCACTTCGCAG 

ACGCAGCTTCTCCTGCGCTGCCACGAAGGACCTGTTACTGGCCTTTCGCTGCACCCCACTGGCGACTACTTGCTGTCCACATCATCCGACAAACA 

CTGGGCATTCTCCGATATCCGCACAGGACGCCTGCTCACCAAGGTAATCGATACCGCCGAAGTCGGGCTGACGACAGCGCAATTCCATCCTGATG 

GTCTGATCTTCGGTACGGGCACCGTGGACTCGCAGGTTAAGATCTGGGATTTGAAGGAGCAGAGCAATGTGGCCAACTTCCCTGGTCATACCGGT 

CCCATTTCGGCCATCTCCTTCTCGGAGAACGGTTACTACCTGGCCACCGCTGCAGATGATGCCTGCGTTAAGCTGTGGGATCTGCGAAAGCTGAA 

GAACTTTAAGACAATCCAGCTTGATGACGGCTACGAGGTCAAGGATCTGTGCTTCGACCAAAGCGGAACCTATCTGGCTATCGCCGGCAGCGATG 

TCAGGGTTTACCTGTGCAAGCAGTGGCAGGAACTGAAGGTATTCAACGACCACACGGCCCTGGCAACGGGCGTGAGATTCGGCAAGCACGCCCAG 

TATCTGGCATCCACCAGCATGGACCGCACCCTGAAGCTCTACGCCATCGAATAAGTCCACACTTAGTAATTAAGTCCAAAATTTTCTTATACAAT 

ATTCCCTAATGATTGTGTGTTTTCTTGAACAAATCTTGCAGAATAATAAACCTGACCCTCTATCC 

(SEQ ID NO: 620J 

Start ATG: 152 (Reverse strand: CAT) 

MALVCALTNEVPETPVVSPHSGAVFEKRVIEKYLLENGCDPISGKELKPEELIEIKTPAVVKPKPPSATSIPATLKTMQDEWDALMIHSFTQRQQ 
LQTTRQELSHALYQHDAACRVIARLNKEVAAAREALATLKPQAGIAHAPTAIPQPALASEAGGAAAHPMEQAGMSAEVIQKLQDKATVLTQERKK 
RGRTVPEDLVTTDQVKNFLTVASHPGLHSASVPGILALDINSADHSKILTGGNDKNATVFNKDTEQMVAILKGHTKKITKVIYHPNEDTVITGSP 
DMNIRIWHVPTSQTQLLLRCHEGPVTGLSLHPTGDYLLSTSSDKHWAFSDIRTGRLLTKVIDTAEVGLTTAQFHPDGLIFGTGTVDSQVKIWDLK 
EQSNVANFPGHTGPISAISFSENGYYLATAADDACVKLWDLRKLKNFKTIQLDDGYEVKDLCFDQSGTYLAIAGSDVRVYLCKQWQELKVFNDHT 
ALATGVRFGKHAQYLASTSMDRTLKLYAIE* 
(SEQ ID NO: 621) 

Name: GTP-binding-protein 
Classification: signal_trans duct ion 
Gene Symbol: Gbp 
FlyBase ID: FBgn0013969 



Celera Sequence No. : 142000013384 665 

CAACCATCAGCAGTGTCATCCGCATCCGCGCCAGCTATCACAGCACCAGCCCAAGCCATCCGATCAGCTCATCCGCCGGCTGCACGTTCCTAGGC 
ACTACTGGTAGTACTTTGGGCCGCCCGCGCAGTGCGGCCGCCCGATCGTTTGCGTCTGCATGGAGTCAATGAGACGTAACGAGGCGAACACTAGC 
CACCACCCGAATAAGGTAAGCTAATTGTGGCACCTTAATCGGAAAACTATCCACGCATCATTTCTGAAGCATTTTAATCATATTCAGTATTGATT 
TCTTATCAATAAATTGATTTAAATTGTAAATTTTTATGCTACAAAGCTAAACTTTTCAAATACCTAAGTGTATACATATCTTGTAAATGTCTGTA 
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TGTATTTAAGCAAAGACGTCAGGCGAAGTAAACGTTTTTTTTTCTTTGAATACCATATAATTTTTTATTGCGAGTAATAACTTACTAGTGGGAAG 
TACTTATAGCTTATAGCTTTGGATCTTTAATACTTGGTAACACTATTTACCTTCCTCCACTGTATTTAGTGGGTGATAAGAGCGTAATTATTTTA 
TTGTCATCATTTTGGCCCACTCGAGTGGCGTTTTTTTCGTGCCAGTTTCTCAAGTAGCCACTACACTTTTGCAGTGTGCGATAAGCGAAAGGGGC 
GATAATGAACTGACCGGCGAACTACATGCTTTACTCCTCCTTAGTTTGTTGGTTTATTGGGTCAGTCCAGGAAACTGATTTATTAGTTCAAAGGG 
TGGAAAATGAAAAACAACAACAACGAAGATCAACGTGCACTTAAAGTTGTCAGTTGCCACTCGCGTTTTCCAACACTGAAAGCGTTTTGTTTTGT 
TTGCCTTCGTCTCTCCGCCGCTTCTCTTCCGCTCACATAATCTCTTTCTCGCTCTCCCTCTCTTCACCCGTTTTCGGTGGTTGCTCTCCGGCTCT 
TTCCTCTTCTTCTTCACTTTTTTTTTGGTTTTCCGTTCAGTCTCATCCGCCAAACGAAAAAGGACGCCGCGGAGAAGGCTCGCGCAAAATAATCG 
AGAGTAATACGACGTTGCTTATTTATTTATTGTTTTTGTTTTTTGTTTTTCTTGTAAATTGAAGCGTTAAAAATACGCGCACGCACAAAACTATC 
AACT AATT ATGGT T AT GGAATGGAAAAGAAGTT GGCAAAGGT AAAGAGCGAAAGCGGCGAAAGAAAT AACAAT T TT TGTT T GAGTTT TTT TT T TT 
GAGTTTTCTTCACGTCATGCCCTACTCCTCACACTTACGCACTCACACCCATGCATATGTATGTAAGCTACAACTTATTTTGTTGGATTTCTTTC 
TTTTTTTTTGTGCAATCGCAGGTGGCCCTGAATAGCAGCAACGTAGCAACGGAAAGCAATATTAAGAATAATAAATTGTTAGCTAATTTAAGTGC 
GGCAGGAGCAGCTACAGCTACAACGGCAACAACAGGAACAGCAACAACAACGACGGCGACAACAGCGAATCAAGCGCTGAATTTCAATAACAAAA 
CGAAAGCAACAGCAAACGCGACTGCAGCATCGGCGAATAATCGGCACAATAACAACAACAATTCCAGTGCGATCAAGAAGCACACAAATACGAGT 
AAGTAACTGCAAATCTCTCGCTGCCTCTCGCTTTTTTTTTATCAACAGTGGCGTAGACAGCATTGTTTTAAGGGGTTACAAATTAAATTACAGAA 
AAAAGTGCTATTTCACACACACAGAAGTGTATAGAAATCGATAGCGTCCTTTCGATTAATGTGAATTCCCCCCTCCCCCATTTTTATTATTAATT 
CATTAAAAGAATTGAAAATAAGTAATTAATATAATAAAAATTAAA 

TTAAACTAAATACAATTCAGCAACTCCCCATTTAGTACGCCACTGACCTCTAAGCTTTAATGCGACAACTCCAATCTAATCCATCTCTCCTTTTT 
CGCAGAACAACTAGCTGGCAAGAGCCCCGCGTGCAACTCCTCCTCCTCGTCGCTCTCCTCGTCGAGCAGCTCCAACTCGAGCGAGTCGAAGGACA 
CGAACTTTGAGTACGAGGACGAGTGGAATATTGGCGGCATACCAGAGCTGCTGGACGACTTGGACGCGGATATTGAGAAGTCCGCGCATTCTTCG 
GGTGGTGGCAACCAGGCTACCGCGCTAAATGCCAAGCAGGCAAACAGCTCCTCCACATCCTCCTCATCATCCTCCAAGGGCGGCGCTTCATCCTC 
GTCATCCTCAGCAGCAGCGACGGGATCATCATCGTCGCATAAATCGCACAAAACCACATTGCACAGCAATTTGTCGGCCACATCGCCAACCACAA 
TTAAGTTCACACGCCAACCGGTGGCCATTGGCGGAGCTAACTCCTCCTCATCTTCATCAGCAGCTGCAGCGCCCAGTGGTGGCAGCGCGAACGTG 
GTGGCCAAGGGATCATCTTCTTCCTCGTCCTCCACATCTTCCTCATCTTCGGGGAAGCATCATCACCATCACCACCATCACTCAAACTCAAGCAG 
CAGTGGAAGCAGTTCCAAGGGCTACAAGTCCGCTTTGGTGGCTCAACTGAACAGTCCGAGTCCACTGAACAGTAACTCCAAGTCCCTGAGTGGAT 
CGGGGAGTGGAAGCGGGAACACAAACGGAGCAGCGGGAGCTGGAGCCGGCAGCACATTGTCATCCTCGACATTTGCCGGTTTCTCCAAAGGCGGC 
AGCTTAGTGTCTTCGTCGGCAGGAGCAGCAGCGGCTTTAGCGGCTGGCAGTGGACAACAGGGCTCCAAATTCTCCGCTGGCGGCATGTCCTCGCA 
AACGGGCAGCGGCAGCGGCGGCAACAACACTAGCAATAGCAACAACAGCAGCAGCGGAAGCGGAGGCAGCGGCTCAGGTAGCAGCACAGGGAACA 
CCGCCAGCGGTAGCGGGAACAACAACAGCACAAGTGCCGGTGGGCCGCCGAGTTCGCAAGGCGGCAACAACGGAAACGGTAGCGGTAGCAGCTCC 
AGCTCCTCCAGTGGTAAATCGAGCGCAAAGATGTCCATAGACCACCAAGCGACGCTCGACAAAGGACTCAAAATGAAGATCAAGCGCACCAAGCC 
GGGCACCAAAAGCTCGGAGGCCAAACACGAGATTGTGAAGGCCACCGATCAGCAACAGAACGGAGCCCTGGGCGCCGGATCCAATAACTCGGCCA 
ACGAGGATGGGAGTTCCGGTTCCAGTTCCACCAACGCGTCTTCCCTGGGGAGCACCAATTCATCGAGCAGCGCAAGCAGCGGCAGCTCCTCCAGC 
AGCGGCAGTTCTTCGAGCAGCAGCAAGAAGCACCTAAACAATGCCAGTAGCGGTAGTGGCTCCTCCTCTTCCGGAGGAGGGAGCCAAAACAATGC 
CAGTGGCCATGCCAGCGGAGGGGGATCCTCCGGTAGCAGTCAGTCTACGCCGCAGGGCACTAAGCGCGGTAGTTCGGGTCATCGGCGTGAGAAGA 
CCAAGGACAAGAACGCACATTCCAATCGCATGTCCGTGGACAAGTCAGCAGCTGCTGCCTCGGCGGCCGGCGAGAAGGATACACCTGAGAAGTGT 
TCCGGTACGGGAGCTGGTGGATCTCCCTGCTCCTGCAACGGAGATGTGGGAGCTCCTTGCTCACATCATGCCTGCATTCGCCGTGCCGCACACAT 
GTCCAACTCCGCAGGCAATGCGAATTCGAGTGCCGGCACTGGGCAATCCGGTGGTTCATCCTCCATGTCAGCAGTGCCACCGGGTGTGTTCACAC 
CTTCAGCGGGCTCTCCTTCCACCGGTTCACCATCGACCGTGGTGCCTGCCGCTGCCTCACTTCTGGCAGCGACCGGAGCAGCCTCCTCCTCCGCC 
TCTCAAATGGCCAGCAGTAGTGCCGGCGGCGTTGGCGGTTCGGGAGGAGGTGCCAATGCACCAGGACCGCCGGGTAAGGAGTCTGCCGGCAGCAT 
CAAAATCTCCTCGCACATTGCCGCCCAACTGGCAGCAGCGGCCGCTTCCAATAGTTACAGCGGGAGTGGAGCCAACACGAATCAAGGCCAGAACA 
GCAATGCTGGCGGCAACGGCGGTTCGGAGAGCAAAGCAGCAGCGGCTGCTCAGGCCAAGTTAATGGCACCCGGCATGATCTCAGCCACCATGCAC 
CATACGATCTCGGTGCCGGCGGGCACAGGAACAGGCGACGACGATACCAAATCTCCGCCTGCCAAAAGGGTCAAGCATGAGGCTGGCGCTAGTGG 
AGCAGGAGGCGGCAAGGAGATGGTGGACATCTGCATTGGCACCTCGGTGGGCACCATCACCGAACCGGACTGCCTGGGTCCCTGCGAACCTGGCA 
CTTCCGTAACCCTCGAAGGGATTGTGTGGCACGAGACCGAAGGCGGCGTTCTCGTGGTCAATGTCACGTGGCGGGGCAAGACCTACGTGGGCACT 
CTGCTAGACTGCACCCGACACGATTGGGCTCCTCCAAGGTGAGTTGGTGTGGCTTGGGAATTGAGTTGAGGCTTATGTTGACTTGCTTTTCAAAC 
TGTTTCCAAAATTATTGTAGAGACTAGATAAATGAATAAAGATGCCTTTCAATATTATGTTGCAAGTTGATTTATTTTTATTATAAATTATCTTT 
TCTATCTACCCATTGTTTTGGGGATCAGTTTGAAATATAGTCTATGGTTTATATACGAGAAACATAATTATCAAAACTCAGTGGCGTTTAAGTAA 
ATTTGTCAGCACCTTATCTACTTGCCTAATATTTGCATCAAGCTGTTAAGTTATTTGTAAATTTAATCTAAATCTGATGCAACAGCAATCATTTG 
CCGTTCGTTTACTGATGCGATTATTTATTAAGTGCTTAATGTGCCACTCAAGATTTCCACGAATGTACATATGTAGTTTGTTTTGATTGAAGATA 
TGTGATATCTGTAGGGGGATTGCAAACATCACTAGTTTTAGATTTTACCGATTACTCATTCATTTTCGTTGCTTTCAGATTCTGTGATTCACCAA 
CTGAAGAATTAGATTCTCGAACTCCAAAGGGACGCGGAAAGCGCGGACGCAGTGCAGGTCTCACTCCTGACCTGAGCAACTTCACCGAGACCAGA 
AGTTCGGTAAGTAAAGCATACAGAATTCAGTTTTAGGCTTTTGAAAGCAGAAATAAATACGATGGTTAATAATATATATCTCTCATCAAATCGTT 
ACAGATTTACTTCTCACACGCCCAGGTGCACTCCAAGCTGCGTAATGGTGCCACAAAGGGACGCGGTGCGACACGCAGTGCATCCGGTAATGCGG 
CAGCGAACAGCAACAGCAGTTCATCGGGCAACGGAGGCGGGGCCACGCCCAGCACATCGCCGACGGCATTTTTGCCGCCACGGCCAGAGAAGCGC 
AAGTCGAAGGATGAGGCGCCTTCACCGCTTAACGGCGACGCCTCGGATGGAGCGTCTGTTGGCGGTATTGGTGGAGCCGGTGGAGTGAACATGGT 
GAACGCCAGTGGCATCCCCATATCGGCCAGCGGCGGTGGCCTGGCCACGCAGCCGCAGAGCCTGCTCAATCCGGTCACTGGTCTCAATGTGCAGA 
TCAGCACCAAGAAATGTAAGACTGCTTCGCCCTGCGCGATCTCCCCAGTTCTGCTTGAGTGTCCCGAGCAGGACTGCAGCAAGAAGTACAAGCAC 
GCAAATGGGCTGCGTTATCACCAGTCGCATGCCCATGGAGCGGGCGGTGGCGCAAGCTCCATGGATGAGGACTCCATGCAGGCGCCCGAGGATCC 
GGCCACGCCGCCCTCGCCAGGAGTCGCAAGTGGAACGGGATCAGGAGCATCTGTAGCATCATCGGCAGTACCAGCGACAGCACCATCAGCGGGTC 
AAGGAACTGTCGCAGTGTCCCCAAACACACCCCTCGCAAATTCAAGCAACCCCGTCACCAATGGCAATGTTGCACCATCGGCACCAGCTACTGGA 
TCGGTAACTATCGCCGCTCCCAACACTACTCCCTCTGTGGTAGAGACGCAAGCGCCGCTTACTGGTCCTCCACCAGTCACGCCACCAGCTCCAAC 
TCCCATCTGCGCAGTTGCAACACCCGGAGCGGAGCAATCTGTATCATCAGTATTGCCCCTGGGAAATCTTCCACTGACGGCAGGTCCAAATAGCG 
CGACACAACAGCAACAGCCTCCTACACAGCAGCAACAACCACAGTTGCTCGTCCCAGGAGGCAGTGCGGCAAGCCTTCAGCAGCAGCAGCAGCAG 
CAGCAACCAGTGGCCGGAGGCAGCATCACTGCTGGAATCTCCGGACAGGCTCTGTCACAGCATCAACAGCAACTAATGGGTGGATTGCCTGCCAT 
GTTGTCAGACCAGCAGCAGCAGGCATTGTTGCAGCAGGGAGCACGTAAGTATTCGATTTTTCTACCTTTTGGAAAAGATGTGATTGAAATTATAT 
TAATCGTACTAAATTCTTATCTATTTCTTTTTTAGTTAAAGCGGGTGTACTGCGTTTTGGCCCGCCAGATGGAAATCCCCTGCAGCAGCAGCCTG 
GTCAAGCATCGGTTAATCCACAGACACAACAGTCTCCTCCAAGGCCTCCCAGTCATGTCCAGGATCAACAGACGCCATCGGCCTACGCCCAGCAG 
GCAGGTTTAAAGACATCGCCTGGATTTGGCTCCGTTGGCGTGGGTGCCGCTAGCAGCAAACAAAAGAAGAACCGCAAGTCACCTGGACCCAGTGA 
CTTTGAGGGCCGCGTATCGCGCGAGGATGTACAGAGTCCGGCATACAGTGATATCTCCGATGACTCCACGCCAGTTGCCGAACAGGAAA TGCTGG 
ACAAGTCAGTCGGCCAGGCGGTGACGGCAAAGCACATCGAATTGATGGGCAAGAAGCCGACGGAAGTGGGCGTTGGTGTTCCGCCACCACCAGCT 
CCCAATATGTATGTGCCGGGAATGTATCAGTTCTATCCGGCGCAGCAGCAGTCAGCACCTCCACCACAGCAACAGCAACAACAGCCGCAATACAT 
GGTGCAGACAGAGCCAGGCAAACCACCAGGATTGCCACCTGCTTTGACACAAGCTCAGCAGCAGCAACAATTGCAGCCGGGTGCTCCTCCGCCTA 
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CCTCACAGCCCCCGAGTCATTTGCTAGGTCCACCCGGTCAGCAGTCGGTGGCTGCCCACCTGGCAGACTACAGTGGCAAGAACAAGGATCCACCA 

TTGGATCTGATGACCAAACCGCAGCCACAGCCGGGTCAGCCGCCTTCACAGCAGCAGCAGTCGGGCCAACTGTCGGGGCAGGAAAACAATGGCAA 

GGATGTTGGACCGCCCACCTCTCAGCCAGGATCTCAGCCGCCGCCGGTTAACCTAAGTGCAGTGGCTGGGCCGCCGCCAGGAAGCCTGCCACCCG 

GTTTGGGTGGTCTCTCGGCTTTGGGAGCAGCCGGTCTAGGTGGTCCTGGACCGGGCAAGGGCATGCCCCACTTCTATCCCTTCAAGTAAGTTGTT 

GATCAATCGCTAAGGTCAGTTGCCATTGCATAACTAACCAATCGTTTACCTTTTAGTTTTATTCCGCCTGCGTATCCATACAATGTGGACCCCAA 

CTTTGGGTCAGTTTCTATTGTTGCTTCAGAGGAGGCCGCCAAGCTGAGTGGGCATCCGGGTCTGCCACCCAGCTCACAAGCGCAGCAGTTATCCG 

GCATAAGTATCAAGGAGGAGCGTCTGAAGGAAAGCCCCAGTCCGCACGACCAGCCGAAGCACATGCCATCTCAGCAACAGGTAAGTCCTTACCAA 

CGTTATAATATTTTGGCAAAACTAATGCAATATGATTTCATATCCTCCGCAGATGATAGCCAGCAAGCTAATCAAGCAGGAGCCGATGACCAAGC 

AGGAGATCAAGCAGGAGCCCAACTCAAATCCGGGCCAGCAGCATCCGCCTCCGCAACAGCAGCCAGCGCCTCAGCCGCAGCAACAACAGCCGCCA 

CCCCCGCAGCCACAACAGCCGCATGCCCTGCATCCCAAGGATTTACAGGCACTGGGCGCCTATCCCGCCATCTACCAGCGCCATTCCATAAACCT 

GGCCGTGCAGCAGGCTCGCGAGGAGGAGCTGAGACGGTAAGTGAAGTATAAGCATCATAAACGCGTCTTTGAGGAATACAAATCATAATAAATTT 

AAAATTGTTTCTAACTTTTGGAAGTTGTAACTAACTTGTGCATCTTTTCACATCAGGTATTACATGTTCACTGGACGACAGAATTCGGCAGCCGC 

AGCAGCAGCCGCAGCAGCTCAAAACGCTGCCAGTGGTGGTCTTCCACCGCATCCCGGCATGATGCACAAGGATGAGCCTGGCATGGGATCGGCGG 

CGCAACAACAGCAGCAGCAACAACAGCAGCAGATGCAAATTGCACAGCAGCAGCAGCAGGCCATTCAGCAGCATCATCAGCACCTGCAGCAGCAG 

C ACC AGGC C C AGC AAC AGC AGC AGC AG CAAC AACACC AGC AGC AGC AGC AAC AAC AGC AGC AGC AGC AA C AA C AA C A GC A GC AGC AGC AACAAC A 

GCAAAAGTTAAAGCAATCGCAGGCGGCTAGTGCGGGGGCAAATAATAAGGCCACCAACCTGACGAAGGATTCTCCCAAACAAAAGGGCGGCGATG 

AC GAT CAAC C GC T G AAGGT G AAGC AGG AG GGCC AAAAGC CG AC C AT G GAG AC GC AGG GT C C GC C AC C AC C AC C C AC GT CGC AGT ACTT CC TT C AC 

CCCTCATACATTTCCCCGACGCCCTTTGGGTTTGATCCCAATCATCCGATGTATCGCAACGTGTTGATGTCAGCCGCCGGACCGTATAATACGGC 

ACCGTACCACCTGCCCATCCCTCGCCCGTACCATGCCCCAGAAGATCTCTCGCGTAACACCGGCACCAAGGCGTTGGATGCACTCCATCACGCGG 

CCAGTCAGTACTACACCACCCACAAGATCCATGAGCTCAGCGAACGGGCCCTCAAGTCGCCCACCAGCGGCAGCGGCCCCGTCAAGGTGAGCGTC 

AGCAGTCCCAGCATCGGGCCGCCTCAACCAGGCGGACCCACGAGCAGCGGCCCCGGATCCGGACCAGTTTCCGGTGTCCTCGGCCCCGGCAGCGG 

TTCCAACCAGCAGCCCGGCTCGGCACCTGGGTCTGCGGGCGGTGTGCCGCTGAACCTACAGCCACCACCCGGAGGAATGGGCCCGTCACCCGGCA 

GCAAGCCGGATCTGTCCGGCCCGAAAGGACATGGCGGAGTGACGCCCGGCTCGTCATTGGACGGCCACAAGCAGTCGATGCCCGGAGGTCCGCCA 

CCGAACGGGCCATCGGGCAATGGAGCCGTTGGCGGTGTGGGAGGCGCTGCCGGCAATGGCGCAGCGGGAGGCGGCGGTGCAGGTGCCGCCGACTC 

ACGCAGCCCACCACCGCAGCGCCATGTGCACACCCACCACCACACGCACGTCGGCCTCGGCTATCCCATGTACCCGGCGCCGTATGGAGGTGAGT 

CCCCAACTGTGCTATTCAGTTCGGAAAGTTGAATGCTAATCATTTAAATGTTTACTCCCATAGCGGCTGTTTTGGCTAGCCAGCAGGCGGCTGCT 

GTAGCTGTGATAAACCCGTTTCCGCCGGGTCCGTCGAAATGAGATCCACTGAACGGTAACAACGAGAAGAAGGCGTATAACGTAAAGAACAACTA 

GTGTGGAGCGGAAAGTCATGGTGCCCGCCCCATAGTGCATTTGCTATTTTCTGAGATCCCCAGACAGACAGATAGAATGGGAGAAGGATAAGGCT 

TTGTTCCCCGTTGGATTCCAACTTCTTCCCGTCGAAACCTGTCACAGTCTTGGCTGCCTCAACGAGTCCTTTCGATGCTGATGCATCGGCGGTCG 

AGGAACCGAGAGGAACACAGAATCGACTATTTATATGCATAAGTATTTTTTGATGTTTCGTTCGATGTGGTTAACTATAACTAGCCTAAGGTATC 

ACTACTATTACCACATACAACCTATACATATATAAATATATATAGCATAACTACATATAGCGACTTTTTCCTAAGTGTATTTGCATCTGCATATA 

GAT AT CT AT AT T T AT AT T T AT AT C AT ACGAG AG AGGC GT AATC CG AAC AT G AAC AAAC AC AACC AAC AAAC AG A TT AC AT GAAAC A T GCAAAAAA 

TGAAAGGGAAAAAAACGAATCAAACTCGATACTGTAGTGTGTTGATTTCTTTGTAAAAAAAAATAAAAACTAAACGTACGAGAAATGAAATTAAG 

AAAAGATCGAAATGTATGAATATTGTTTAACTAAATGATTAAATTATGCAAGTAGAACGTACAAGATTAGCATTTAGCGTGTGTATTAATTATCC 

TTATCTGTGTTGCGGATGGAGCTGCGCGGGATTTGCATCTAAAAGACAAATTGGGCGCGCCGCCCATTCAATTCTTATAAATATAAATATATATT 

ATGTGTATCTAAATCTATATCTGTGTGTGTACTACTCGTATCCGTAGTTTCTACTTTATAAGTTCTCCGTGAATGACTACGATGAATTTTTCGGC 

CAGCAGCCGCACATTAATTTTAATTGTTTCTAAGCATCAATTTGTACTTTAAATATTTATACGCATATGCTGCATATATGATATGATAACGA 

(SEQ ID WO: 622) 

Ex on : 1001,. 1180 

Exon: 1352.. 1613 

Exon: 2001., 4503 

Exon: 5019.. 5136 

Exon: 5230. .5290 

Exon: 5357.. 6409 

Exon: 6496.. 7495 

Exon: 7562.. 7775 

Exon: 7843, .8111 

Exon: 8227.. 9494 

Exon: 9564.. 9637 

Start ATG: 1160 

Transcript No. : CT1764 6 

CAAACGAAAAAGGACGCCGCGGAGAAGGCTCGCGCAAAATAATCGAGAGTAATACGACGTTGCTTATTTATTTATTGTTTTTGTTTTTTGTTTTT 
CTTGTAAATTGAAGCGTTAAAAATACGCGCACGCACAAAACTATCAACTAATTATGGTTATGGAATGGAAAAGAAGTTGGCAAAGGTGGCCCTGA 
ATAGCAGCAACGTAGCAACGGAAAGCAATATTAAGAATAATAAATTGTTAGCTAATTTAAGTGCGGCAGGAGCAGCTACAGCTACAACGGCAACA 
ACAGGAACAGCAACAACAACGACGGCGACAACAGCGAATCAAGCGCTGAATTTCAATAACAAAACGAAAGCAACAGCAAACGCGACTGCAGCATC 
G GC G AAT AAT CGGC AC AAT AAC AAC AAC AAT T CC AGTG C G ATC AAG AAGC AC A C AAA T ACG AAAC AACT AGC TGGCAAGAGCCCCGCGTG C AACT 
CCTCCTCCTCGTCGCTCTCCTCGTCGAGCAGCTCCAACTCGAGCGAGTCGAAGGACACGAACTTTGAGTACGAGGACGAGTGGAATATTGGCGGC 
ATACCAGAGCTGCTGGACGACTTGGACGCGGATATTGAGAAGTCCGCGCATTCTTCGGGTGGTGGCAACCAGGCTACCGCGCTAAATGCCAAGCA 
GGCAAACAGCTCCTCCACATCCTCCTCATCATCCTCCAAGGGCGGCGCTTCATCCTCGTCATCCTCAGCAGCAGCGACGGGATCATCATCGTCGC 
ATAAATCGCACAAAACCACATTGCACAGCAATTTGTCGGCCACATCGCCAACCACAATTAAGTTCACACGCCAACCGGTGGCCATTGGCGGAGCT 
AACTCCTCCTCATCTTCATCAGCAGCTGCAGCGCCCAGTGGTGGCAGCGCGAACGTGGTGGCCAAGGGATCATCTTCTTCCTCGTCCTCCACATC 
TTCCTCATCTTCGGGGAAGCATCATCACCATCACCACCATCACTCAAACTCAAGCAGCAGTGGAAGCAGTTCCAAGGGCTACAAGTCCGCTTTGG 
TGGCTC AACT GAACAGTC CGAGTC CACTGAAC AGT AACTCCAAGTCCCTGAGTGGATCGGGGAGTGGAAGCGGGAACACAAACGGAGCAGCGGGA 
GCTGGAGCCGGCAGCACATTGTCATCCTCGACATTTGCCGGTTTCTCCAAAGGCGGCAGCTTAGTGTCTTCGTCGGCAGGAGCAGCAGCGGCTTT 
AGCGGCTGGCAGTGGACAACAGGGCTCCAAATTCTCCGCTGGCGGCATGTCCTCGCAAACGGGCAGCGGCAGCGGCGGCAACAACACTAGCAATA 
GCAACAACAGCAGCAGCGGAAGCGGAGGCAGCGGCTCAGGTAGCAGCACAGGGAACACCGCCAGCGGTAGCGGGAACAACAACAGCACAAGTGCC 
GGTGGGCCGCCGAGTTCGCAAGGCGGCAACAACGGAAACGGTAGCGGTAGCAGCTCCAGCTCCTCCAGTGGTAAATCGAGCGCAAAGATGTCCAT 
AGACCACCAAGCGACGCTCGACAAAGGACTCAAAATGAAGATCAAGCGCACCAAGCCGGGCACCAAAAGCTCGGAGGCCAAACACGAGATTGTGA 
AGGCCACCGATCAGCAACAGAACGGAGCCCTGGGCGCCGGATCCAATAACTCGGCCAACGAGGATGGGAGTTCCGGTTCCAGTTCCACCAACGCG 
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TCTTCCCTGGGGAGCACCAATTCATCGAGCAGCGCAAGCAGCGGCAGCTCCTCCAGCAGCGGCAGTTCTTCGAGCAGCAGCAAGAAGCACCTAAA 
CAATGCCAGTAGCGGTAGTGGCTCCTCCTCTTCCGGAGGAGGGAGCCAAAACAATGCCAGTGGCCATGCCAGCGGAGGGGGATCCTCCGGTAGCA 
GTCAGTCTACGCCGCAGGGCACTAAGCGCGGTAGTTCGGGTCATCGGCGTGAGAAGACCAAGGACAAGAACGCACATTCCAATCGCATGTCCGTG 
GACAAGTCAGCAGCTGCTGCCTCGGCGGCCGGCGAGAAGGATACACCTGAGAAGTGTTCCGGTACGGGAGCTGGTGGATCTCCCTGCTCCTGCAA 
CGGAGATGTGGGAGCTCCTTGCTCACATCATGCCTGCATTCGCCGTGCCGCACACATGTCCAACTCCGCAGGCAATGCGAATTCGAGTGCCGGCA 
CTGGGCAATCCGGTGGTTCATCCTCCATGTCAGCAGTGCCACCGGGTGTGTTCACACCTTCAGCGGGCTCTCCTTCCACCGGTTCACCATCGACC 
GTGGTGCCTGCCGCTGCCTCACTTCTGGCAGCGACCGGAGCAGCCTCCTCCTCCGCCTCTCAAATGGCCAGCAGTAGTGCCGGCGGCGTTGGCGG 
TTCGGGAGGAGGTGCCAATGCACCAGGACCGCCGGGTAAGGAGTCTGCCGGCAGCATCAAAATCTCCTCGCACATTGCCGCCCAACTGGCAGCAG 
CGGCCGCTTCCAATAGTTACAGCGGGAGTGGAGCCAACACGAATCAAGGCCAGAACAGCAATGCTGGCGGCAACGGCGGTTCGGAGAGCAAAGCA 
GCAGCGGCTGCTCAGGCCAAGTTAATGGCACCCGGCATGATCTCAGCCACCATGCACCATACGATCTCGGTGCCGGCGGGCACAGGAACAGGCGA 
CGACGATACCAAATCTCCGCCTGCCAAAAGGGTCAAGCATGAGGCTGGCGCTAGTGGAGCAGGAGGCGGCAAGGAGATGGTGGACATCTGCATTG 
GCACCTCGGTGGGCACCATCACCGAACCGGACTGCCTGGGTCCCTGCGAACCTGGCACTTCCGTAACCCTCGAAGGGATTGTGTGGCACGAGACC 
GAAGGCGGCGTTCTCGTGGTCAATGTCACGTGGCGGGGCAAGACCTACGTGGGCACTCTGCTAGACTGCACCCGACACGATTGGGCTCCTCCAAG 
ATTCTGTGATTCACCAACTGAAGAATTAGATTCTCGAACTCCAAAGGGACGCGGAAAGCGCGGACGCAGTGCAGGTCTCACTCCTGACCTGAGCA 
ACTTCACCGAGACCAGAAGTTCGATTTACTTCTCACACGCCCAGGTGCACTCCAAGCTGCGTAATGGTGCCACAAAGGGACGCGGCGGGGCCACG 
CCCAGCACATCGCCGACGGCATTTTTGCCGCCACGGCCAGAGAAGCGCAAGTCGAAGGATGAGGCGCCTTCACCGCTTAACGGCGACGCCTCGGA 
TGGAGCGTCTGTTGGCGGTATTGGTGGAGCCGGTGGAGTGAACATGGTGAACGCCAGTGGCATCCCCATATCGGCCAGCGGCGGTGGCCTGGCCA 
CGCAGCCGCAGAGCCTGCTCAATCCGGTCACTGGTCTCAATGTGCAGATCAGCACCAAGAAATGTAAGACTGCTTCGCCCTGCGCGATCTCCCCA 
GTTCTGCTTGAGTGTCCCGAGCAGGACTGCAGCAAGAAGTACAAGCACGCAAATGGGCTGCGTTATCACCAGTCGCATGCCCATGGAGCGGGCGG 
TGGCGCAAGCTCCATGGATGAGGACTCCATGCAGGCGCCCGAGGATCCGGCCACGCCGCCCTCGCCAGGAGTCGCAAGTGGAACGGGATCAGGAG 
CATCTGTAGCATCATCGGCAGTACCAGCGACAGCACCATCAGCGGGTCAAGGAACTGTCGCAGTGTCCCCAAACACACCCCTCGCAAATTCAAGC 
AACCCCGTCACCAATGGCAATGTTGCACCATCGGCACCAGCTACTGGATCGGTAACTATCGCCGCTCCCAACACTACTCCCTCTGTGGTAGAGAC 
GCAAGCGCCGCTTACTGGTCCTCCACCAGTCACGCCACCAGCTCCAACTCCCATCTGCGCAGTTGCAACACCCGGAGCGGAGCAATCTGTATCAT 
CAGTATTGCCCCTGGGAAATCTTCCACTGACGGCAGGTCCAAATAGCGCGACACAACAGCAACAGCCTCCTACACAGCAGCAACAACCACAGTTG 
CTCGTCCCAGGAGGCAGTGCGGCAAGCCTTCAGCAGCAGCAGCAGCAGCAGCAACCAGTGGCCGGAGGCAGCATCACTGCTGGAATCTCCGGACA 
GGCTCTGTCACAGCATCAACAGCAACTAATGGGTGGATTGCCTGCCATGTTGTCAGACCAGCAGCAGCAGGCATTGTTGCAGCAGGGAGCACTTA 
AAGCGGGTGTACTGCGTTTTGGCCCGCCAGATGGAAATCCCCTGCAGCAGCAGCCTGGTCAAGCATCGGTTAATCCACAGACACAACAGTCTCCT 
CCAAGGCCTCCCAGTCATGTCCAGGATCAACAGACGCCATCGGCCTACGCCCAGCAGGCAGGTTTAAAGACATCGCCTGGATTTGGCTCCGTTGG 
C GT GGGT GC C GC T AGC AGC AAAC AAAAG AAG AAC C GCAAGTCACCTGG AC CC AG TG AC T T T GAGGGC C GC GT AT CG CGC GAGGAT GT AC AG AGT C 
CGGCATACAGTGATATCTCCGATGACTCCACGCCAGTTGCCGAACAGGAAATGCTGGACAAGTCAGTCGGCCAGGCGGTGACGGCAAAGCACATC 
GAATTGATGGGCAAGAAGCCGACGGAAGTGGGCGTTGGTGTTCCGCCACCACCAGCTCCCAATATGTATGTGCCGGGAATGTATCAGTTCTATCC 
GGCGC AGC A GCAGTCAGC AC C TC C AC C A CAG CAAC AGC AAC AACAGCCGC AAT AC AT GG T GC AG AC AG AGC C AG GC AAACC AC C AGGAT T GCC AC 
CTGCTTTGACACAAGCTCAGCAGCAGCAACAATTGCAGCCGGGTGCTCCTCCGCCTACCTCACAGCCCCCGAGTCATTTGCTAGGTCCACCCGGT 
CAGCAGTCGGTGGCTGCCCACCTGGCAGACTACAGTGGCAAGAACAAGGATCCACCATTGGATCTGATGACCAAACCGCAGCCACAGCCGGGTCA 
GCCGCCTTCACAGCAGCAGCAGTCGGGCCAACTGTCGGGGCAGGAAAACAATGGCAAGGATGTTGGACCGCCCACCTCTCAGCCAGGATCTCAGC 
CGCCGCCGGTTAACCTAAGTGCAGTGGCTGGGCCGCCGCCAGGAAGCCTGCCACCCGGTTTGGGTGGTCTCTCGGCTTTGGGAGCAGCCGGTCTA 
GGTGGTCCTGGACCGGGCAAGGGCATGCCCCACTTCTATCCCTTCAATTTTATTCCGCCTGCGTATCCATACAATGTGGACCCCAACTTTGGGTC 
AGTTTCTATTGTTGCTTCAGAGGAGGCCGCCAAGCTGAGTGGGCATCCGGGTCTGCCACCCAGCTCACAAGCGCAGCAGTTATCCGGCATAAGTA 
TCAAGGAGGAGCGTCTGAAGGAAAGCCCCAGTCCGCACGACCAGCCGAAGCACATGCCATCTCAGCAACAGATGATAGCCAGCAAGCTAATCAAG 
CAGGAGCCGATGACCAAGCAGGAGATCAAGCAGGAGCCCAACTCAAATCCGGGCCAGCAGCATCCGCCTCCGCAACAGCAGCCAGCGCCTCAGCC 
GCAGCAACAACAGCCGCCACCCCCGCAGCCACAACAGCCGCATGCCCTGCATCCCAAGGATTTACAGGCACTGGGCGCCTATCCCGCCATCTACC 
AGCGCCATTCCATAAACCTGGCCGTGCAGCAGGCTCGCGAGGAGGAGCTGAGACGGTATTACATGTTCACTGGACGACAGAATTCGGCAGCCGCA 
GCAGCAGCCGCAGCAGCTCAAAACGCTGCCAGTGGTGGTCTTCCACCGCATCCCGGCATGATGCACAAGGATGAGCCTGGCATGGGATCGGCGGC 
GCAACAACAGCAGCAGCAACAACAGCAGCAGATGCAAATTGCACAGCAGCAGCAGCAGGCCATTCAGCAGCATCATCAGCACCTGCAGCAGCAGC 
ACCAGGCCCAGCAACAGCAGCAGCAGCAACAACACCAGCAGCAGCAGCAACAACAGCAGCAGCAGCAACAACAACAGCAGCAGCAGCAACAACAG 
CAAAAGTTAAAGCAATCGCAGGCGGCTAGTGCGGGGGCAAATAATAAGGCCACCAACCTGACGAAGGATTCTCCCAAACAAAAGGGCGGCGATGA 
CGATCAACCGCTGAAGGTGAAGCAGGAGGGCCAAAAGCCGACCATGGAGACGCAGGGTCCGCCACCACCACCCACGTCGCAGTACTTCCTTCACC 
CCTCATACATTTCCCCGACGCCCTTTGGGTTTGATCCCAATCATCCGATGTATCGCAACGTGTTGATGTCAGCCGCCGGACCGTATAATACGGCA 
CCGTACCACCTGCCCATCCCTCGCCCGTACCATGCCCCAGAAGATCTCTCGCGTAACACCGGCACCAAGGCGTTGGATGCACTCCATCACGCGGC 
CAGTCAGTACTACACCACCCACAAGATCCATGAGCTCAGCGAACGGGCCCTCAAGTCGCCCACCAGCGGCAGCGGCCCCGTCAAGGTGAGCGTCA 
GCAGTCCCAGCATCGGGCCGCCTCAACCAGGCGGACCCACGAGCAGCGGCCCCGGATCCGGACCAGTTTCCGGTGTCCTCGGCCCCGGCAGCGGT 
TCCAACCAGCAGCCCGGCTCGGCACCTGGGTCTGCGGGCGGTGTGCCGCTGAACCTACAGCCACCACCCGGAGGAATGGGCCCGTCACCCGGCAG 
CAAGCCGGATCTGTCCGGCCCGAAAGGACATGGCGGAGTGACGCCCGGCTCGTCATTGGACGGCCACAAGCAGTCGATGCCCGGAGGTCCGCCAC 
CGAACGGGCCATCGGGCAATGGAGCCGTTGGCGGTGTGGGAGGCGCTGCCGGCAATGGCGCAGCGGGAGGCGGCGGTGCAGGTGCCGCCGACTCA 
CGCAGCCCACCACCGCAGCGCCATGTGCACACCCACCACCACACGCACGTCGGCCTCGGCTATCCCATGTACCCGGCGCCGTATGGAGCGGCTGT 
TTTGGCTAGCCAGCAGGCGGCTGCTGTAGCTGTGATAAACCCGTTTCCGCCGGGTCCGTCGAAATGA 
(SEQ ID NO: 623} 

Start ATG: 160 

MEKKLAKVALNSSNVATESNIKWNKLLAWLSAAGAATATTATTGTATTTTATTANQALNFNNKTKATANATAASANNRHNNNNNSSAIKKHTNTK 
QLAGKSPACNSSSSSLSSSSSSNSSESKDTNFEYEDEWNIGGIPELLDDLDADIEKSAHSSGGGNQATALNAKQANSSSTSSSSSSKGGASSSSS 
SAAATGSSSSHKSHKTTLHSNLSATSPTTIKFTRQPVAIGGANSSSSSSAAAAPSGGSAMVVAKGSSSSSSSTSSSSSGKHHHHHHHHSNSSSSG 
SSSKGYKSALVAQLNSPSPLNSNSKSLSGSGSGSGNTNGAAGAGAGSTLSSSTFAGFSKGGSLVSSSAGAAAALAAGSGQQGSKFSAGGMSSQTG 
SGSGGNNTSNSNWSSSGSGGSGSGSSTGNTASGSGNNNSTSAGGPPSSQGGWNGNGSGSSSSSSSGKSSAKMSIDHQATLDKGLKMKIKRTKPGT 
KSSEAKHEIVKATDQQQNGALGAGSNNSANEDGSSGSSSTNASSLGSTNSSSSASSGSSSSSGSSSSSSKKHLNWASSGSGSSSSGGGSQNNASG 
HASGGGSSGSSQSTPQGTKRGSSGHRREKTKDKNAHSKRMSVDKSAAAASAAGEKDTPEKCSGTGAGGSPCSCNGDVGAPCSHHACIRRAAHMSN 
SAGNANSSAGTGQSGGSSSMSAVPPGVFTPSAGSPSTGSPSTVVPAAASLLAATGAASSSASQMASSSAGGVGGSGGGANAPGPPGKESAGSIKI 
SSHIAAQLAAAAASNSYSGSGANTNQGQNSmGGNGGSESKAAAAAQAKLMAPGMISATMHHTISVPAGTGTGDDDTKSPPAKRVKHEAGASGAG 
GGKEMVDICIGTSVGTITEPDCLGPCEPGTSVTLEGIVWHETEGGVLVVNVTWRGKTYVGTLLDCTRHDWAPPRFCDSPTEELDSRTPKGRGKRG 
RSAGLTPDLSNFTETRSSIYFSHAQVHSKLRNGATKGRGGATPSTSPTAFLPPRPEKRKSKDEAPSPLNGDASDGASVGGIGGAGGVNMVNASGI 
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PISASGGGLATQPQSLLNPVTGLNVQISTKKCKTASPCAISPVLLECPEQDCSKKYKHANGLRYHQSHAHGAGGGASSMDEDSMQAPEDPATPPS 

PGVASGTGSGASVASSAVPATAPSAGQGTVAVSPNTPLANSSNPVTNGKVAPSAPATGSVTIAAPNTTPSVVETQAPLTGPPPVTPPAPTPICAV 

ATPGAEQSVSSVLPLGNLPLTAGPNSATQQQQPPTQQQQPQLLVPGGSAASLQQQQQQQQPVAGGSITAGISGQALSQHQQQLMGGLPAMLSDQQ 

QQALLQQGALKAGVLRFGPPDGWPLQQQPGQASVNPQTQQSPPRPPSHVQDQQTPSAYAQQAGLKTSPGFGSVGVGAASSKQKKNRKSPGPSDFE 

GRVSREDVQSPAYSDISDDSTPVAEQEMLDKSVGQAVTAKHIELMGKKPTEVGVGVPPPPAPNMYVPGMYQFYPAQQQSAPPPQQQQQQPQYMVQ 

TEPGKPPGLPPALTQAQQQQQLQPGAPPPTSQPPSHLLGPPGQQSVAAHLADYSGKNKDPPLDLMTKPQPQPGQPPSQQQQSGQLSGQENNGKDV 

GPPTSQPGSQPPPVNLSAVAGPPPGSLPPGLGGLSALGAAGLGGPGPGKGMPHFYPFNFIPPAYPYNVDPNFGSVSIVASEEAAKLSGHPGLPPS 

SQAQQLSGISIKEERLKESPSPHDQPKHMPSQQQMIASKLIKQEPMTKQEIKQEPNSNPGQQHPPPQQQPAPQPQQQQPPPPQPQQPHALHPKDL 

QALGAYPAI YQRHSINLAVQQAREEELRRYYMFTGRQNSAAAAAAAAAQNAASGGLPPHPGMMHKDEPGMGSAAQQQQQQQQQQMQIAQQQQQAI 

QQHHQHLQQQHQAQQQQQQQQHQQQQQQQQQQQQQQQQQQQQQKLKQSQAASAGANNKATNLTKDSPKQKGGDDDQPLKVKQEGQKPTMETQGPP 

PPPTSQYFLHPSYISPTPFGFDPNHPMYRNVLMSAAGPYKTAPYHLPIPRPYHAPEDLSRNTGTKALDALHHAASQYYTTHKIHELSERALKSPT 

SGSGPVKVSVSSPSIGPPQPGGPTSSGPGSGPVSGVLGPGSGSNQQPGSAPGSAGGVPLNLQPPPGGMGPSPGSKPDLSGPKGHGGVTPGSSLDG 

HKQSMPGGPPPNGPSGNGAVGGVGGAAGNGAAGGGGAGAADSRSPPPQRHVHTHHHTHVGLGYPMYPAPYGAAVLASQQAAAVAVINPFPPGPSK 
* 

(SEQ ID NO: 624) 



Celera Sequence No. : 142000013383837 

CGCGCATCCAGCGGGCCACCACACTCACCTCGTCCATGTCCCTGCCGGCGGAGTGCCTGGTTCTCTCCGTATCCATGCCCTCGCTCAACTCGGAG 
CCCAAGCGTCCCAAGAGCGCGGGACCCAAGAAGAAGCCCGGCCAGGCCGTCCACTCGAAGCCATTCCGCCTATATTGAGCCTGTCGAGGAAGGAG 
CTGCTCCACAAGTCGCACGTGACGTCACCCAATCGAAGCACTCAGAGATTTGTGCACAAGTTGATATCGAGGCACGCACTTCTAAAACATGCATA 
TGCAGAATACATTTTATCAAACCGTTGATTGTTTAGAACCGAAATGGAAACGAAACTTGGCATTTACAACATAACAATAAACGACTAAATGAGAG 
GCAAAGTTAAGCACAAAATACAGCTGTGGTTTGTTTGTTTGTTTCTTTATATAATTACAGTATATATTTTCGTAAGTATTAAATCTGTTTCACGG 
CTACTACATCGCTGGAGCCAAACGGCCTTCGTTTACATTTCATTTACATAAGACTTGCGTTTGATTTTCTATATATAATTAGTACAAAAATTGTC 
CTGGGATTTCGTTACGTCTCATTGCGTCTTTATCATCGTGCTCATTGTGTGTTTGGAAAACTGTGTGCTTTTCTTGGTTTCTTTAGGTTTTGTAT 
TCGTTTTTGATTCTTGTTCCTTTCGATTATTCGAGTCGCGTCACATTTAAATGCATTTAGTTGCCTTTTTCCATTTACATCATCCTTTTGCTCTA 
CTTAAGCCATAAAAGTTGTTTATACAATTCGTTTTTAAAGTTAGGGTAATAAATAGGTCATTATAAAATAATAAACAACAGTTGTTACATTTTCA 
AAAGACATCGTCAAATGAGCTTCAAGTGATTATGATTATGATTCGAGGGTCGAGTTAGAATTAAAAAAATGCGCATGCTTGAATATCATAAAAAG 
TGGCTTCTCTAAGAGCTCGTCGTCAAGTGAATTGGCCTGGTTTCAGATTTGGTTAGCTGCGGATTGGCTGCCTTAGCTCATGGCTATATATAACT 
ATAATACATTTAATGGGAAACTAGGAAATGCATAGATGCTCGTGATATGCGAAACCAATGTTTGCTGCGGTTGTTATAATTTTCTCTTTCTGCCA 
GATACTATTCGTACAATACATTCGATTCAATTGACTAGATTTCTACATATGTAGTGCATTAGATTTATGATTTATCTGTGGTACCGATTTTGTTA 
ACAATGTTTTTTTTAGTTGTTGCGGGAGGGAATCGTGGTTTGCCAAATGTGTGACAGCAACGATCCGCAGAAAAAAACAGCCAAACAAAGAGAAA 
TTTCATACGAATTTTGATGGCAGTACATTTCGATTCACAGAATTTCGTACAATATTTATCGTTTGTTTAGAGTTTCTTTCCGGGTTCGGTGCCTT 
T T AC G AGAGT AC GG AGAAT T T T AT AAAT AAAT AC AT GT AAT T G AT T T GT C G ATG AGG GC G CT G C GT G AGT GT C AAT T T GT TG T GG C C AT T GTG C C 

GGTGTACGTGTCGCTTCCGAGGAATCTCTATACTTCGCCAAAGCCAATAACATTCTACCTTATTTACATTATTAAACACTTTCGCTGGCGTCGCT 

TTAGCGGCTCTGGTCTGAGCACCGCGCGCACCGCTTCCTCGAACACCTGAATTTGCAATCGGAGTGGGAAGGGAAATTTAATGATGAATAAGGGG 

TAGGTGGAGGGTGTGCCGGGCACCGATGACTTACCGGCTTGAGACCGCGCTGCGTCAAGGCGGAGCACTCCATGTATTTCACAGCGCGTATCTTG 

TTTGCCAGCTTCTGGCCCTGCTCGCGCTTCAGCGGCGTCAGTCCCTGCTCTGCCAGGCCGCTGAGTGTCTCTCGATCTTCGCGCAAATCGATTTT 

GGTGCCTGGAATACGCCCAAAGATATATGTATGATTCTCGTTTTCTGATTGCAGTTTTCATGTCGCTTGGCAACTTACCAACTAGAATGATGGGC 

GCATCGGGACAGTGGTGCTTTATCTCCGGATACCATTTCGAGGTGACGTTCTCAAAGGACGAGGGACTCGCCACGCTGTAGCATATCAGGAAAAC 

GTCTGTCTGCGGGTAGGATAGCGGTCTCAGGCGGTCGTAGTCCTCCTGACCCGCCGTATCCCACAGTCCCAGCGAGACCTGTATTGTGTCCACTT 

GCATGGGCGCCGAGTAGTTGTCGAAGCTGTGAAAGATATCATAGTAAAAGTTAGATATCGATCTGCGGTTTGCGGCTATCGAGGCACAAACTTAC 

ACTGTGGGCACATATTCGCCGGGAAAGCAGTCTGTCGTGTAGGAGATTAGCATGCAGGTCTTTCCGACGGTGCCGTCACCAACGACTACACACTT 

TATGGGCCTTCCGGTTGACATCGTGAGTGTGCTGAGTGTGCCTGTGCCGATAGCTGTGTACTGTCTGTGTGCTGTGGGCAAAATAAGATGGAAGA 

TGAATGTCTTCGCCAGCTTAATTCAATTAATTAAGTTCTATTCGAGGCAATAAACCCGCCAGTGCCAACTGACCGCAAATTATTTCACAGTCGTA 

CTTCCTTAATAAAAATTCTTTAAAATAGAAACAAATGCTGTAGAAATACATATATGACACAATAAATTATCCTATTCGTTTTATTAGGTTTTTAT 

AAATAAATTAACTACCACATCGTTTTTGGCGCGCAACTACATATATATAAGATAAACAATTGCTTAAATGGAATTTGGCAACTTTTAAAATTTGA 

ATATCTAGACAAATTGAATCAGGATACTTGCTTCCTGAGTCGCAAACTATTTATATTGTATTAATGGAAGTTATACTTATTGGAGTATGCTACAT 

CGTAAAGTGGGAATGTCCAAGCTGTGGAAGTACTACTGTACAGAGTATAATGGCGAAGGTCTACGGTATATCTCTGGGATGATCAAAGGCAGGGC 

TCTCGAAAGGTGTGTCCCGGCCGCAAAAAGAAAGCAACCGCAGCGGCGGAGGCCCATAAAGCCTAATAAAACGAACGAGGGTCTCATTTCGGTTG 

CGGTGGCACTTTGTTTTCGCAAATAAAATTTTCAAAAATAGATCATGATGGGCGACACAGACGGGGAAAAACAAAAAGCATAAATGGGAGCAGCG 

GACCAAATGGCGACCGCCCCTCCAGTGGCGCCCCTTCGGCGGTGGGCGGGGTCGCCCGAGTGTTGTGGTGACGCGGAAGCCACCGGCAGACGCAC 

CGCATCGAGAAAAAGAAACACAAAATGGGAAAATGAACCGCCCGAAAATAGACACAATTAAGTTCTCATAACCGATTCAATTAAAGTTCAACACA 

CAGAGGCGGCACATATTGCGCCACATGCAACGGGGCTGGGCGTGGCCGCGGGGGCGGGTGCGGTAGCAATGAAAATCTTTTGTTGTGGCACAACA 

CCGAATGGGAATACAATATTATTTTACCGTTTTCACCACCACCTATCCTTTTCCTTTGTCTGTCGGCAGGCGGCGACTGTTTTGTACTTGGCTTA 

TATCACTGTGCGTTTTCCGGCTACCGATATGCGTTCGAGGCGATGGTCCATGCGGCGCATTTAAATAATTGATTCGCACTATTTCGCAGCGGCTT 

TTACTTATAAATATCTGTTTGAGGGACAAAACGCAGCGTCCCCGAATAATTTTCGTGTTTTTCCTTTTTGTTTCCACTCCACACAAACACACACC 

CACCTTTTTGACAGCGCAGCGACGCTCTCTCTCTCGCCGTTGACAGTGGTGCCAACTTAGGGCCAGAAAATAGCTGACGGTAAATATAAAAAAAT 

CACTTAAATGTATGGTACCAATAACTAAACTTAATAAGCATATCATCCAGTATAAAGTCGTTTTTCTTTTTCCAAAACTAAAATAAACTTTTTTT 

TACAATTGGCACTTAAAAATCAGCTAGACTTGGCGTGAAAAAAGAGAGCGGGTAGTCCAGCTCCGCTCTCTCGAAACTGACAAGCAGTGCTGCAA 

AAGGTGCAGGGCTCAGGCGGATTTGGCATTATACAATACTAATAGGCATGGTTGTGGGAGAGGGGTTGCAACCAATTTTCACATTGTAAACAAAA 

TTATTTATTTAATTCTATTTCGAGCAGAATATTGTAATAATTACCTGTTAATTAAAAAATTATCATTTTAAAATTATCTTATGTAAGTTCAATAC 

CCTTTAATTGCTATCGACTACGCCCATGCTGCCTATGTCCACTGTTGTTGTTTCGAGCGTCGTAATGAGCAGATTTGTTTATAAACTTTTACAGA 

GTGCAAACCAAACTAGTTCGGGCTGCAGACGCATTTCTTTGGCTACTGTAAGAAATGCGGAAAGCGAATCTCAAGAGGGTGCCCCTGCCCGCACA 

GTTCTGGTGGAGAAGGACTCGCACATAACACTTATTGGCCTAAATCGCGAGCA 

{ SEQ ID NO: 625) 

Exon: 3423.. 3231 
Exon: 2446.. 2281 
Exon: 2211.. 1979 
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Exon: 1905.. 1745 
Exon: 1661.. 1001 

Start ATG: 2396 (Reverse strand: CAT) 
Transcript No. : CT17672 

CGGTGTTGTGCCACAACAMAGATTTTCATTGCTACCGCACCCGCCCCCGCGGCCACGCCCAGCCCCGTTGCATGTGGCGCAATATGTGCCGCCT 
CTGTGTGTTGAACTTTAATTGAATCGGTTATGAGAACTTAATTGTGTCTATTTTCGGGCGGTTCATTTTCCCATTTTGTGTTTCTTTTTCTCGAT 
GCGCACACAGACAGTACACAGCTATCGGCACAGGCACACTCAGCACACTCACGATGTCAACCGGAAGGCCCATAAAGTGTGTAGTCGTTGGTGAC 
GGCACCGTCGGAAAGACCTGCATGCTAATCTCCTACACGACAGACTGCTTTCCCGGCGAATATGTGCCCACAGTCTTCGACAACTACTCGGCGCC 
CATGCAAGTGGACACAATACAGGTCTCGCTGGGACTGTGGGATACGGCGGGTCAGGAGGACTACGACCGCCTGAGACCGCTATCCTACCCGCAGA 
CAGACGTTTTCCTGATATGCTACAGCGTGGCGAGTCCCTCGTCCTTTGAGAACGTCACCTCGAAATGGTATCCGGAGATAAAGCACCACTGTCCC 
GAT G C GC C CATC AT T CT A GT T GGC AC C AAAAT C GAT T T GC GC G AAG AT C GAG AG AC AC T C AGC GGC CT G GC AGAG C AG GG AC T G AC G C C G C TG AA 
GCGCGAGCAGGGCCAGAAGCTGGCAAACAAGATACGCGCTGTGAAATACATGGAGTGCTCCGCCTTGACGCAGCGCGGTCTCAAGCCGGTGTTCG 
AGGAAGCGGTGCGCGCGGTGCTCAGACCAGAGCCGCTAAAGCGACGCCAGCGAAAGTGTTTAATAATGTAAATAAGGTAGAATGTTATTGGCTTT 
GGCGAAGTATAGAGATTCCTCGGAAGCGACACGTACACCGGCACAATGGCCACAACAAATTGACACTCACGCAGCGCCCTCATCGACAAATCAAT 
TACATGTATTTATTTATAAAATTCTCCGTACTCTCGTAAAAGGCACCGAACCCGGAAAGAAACTCTAAACAAACGATAAATATTGTACGAAATTC 
TGTGAATCGAAATGTACTGCCATCAAAATTCGTATGAAATTTCTCTTTGTTTGGCTGTTTTTTTCTGCGGATCGTTGCTGTCACACATTTGGCAA 
ACCACGATTCCCTCCCGCAACAACTAAAAAAAACATTGTTAACAAAATCGGTACCACAGATAAATCATAAATCTAATGCACTACATATGTAGAAA 
TCTAGTCAATTGAATCGAATGTATTGTACGAATAGTATCTGGCAGAAAGAGAAAATTATAACAACCGCAGCAAACATTGGTTTCGCATATCACGA 
GCATCTATGCATTTCCTAGTTTCCCATTAAATGTATTATAGTTATATATAGCCATGAGCTAAGGCAGCCAATCCGCAGCTAACC 
(SEQ ID NO: 626) 

Start ATG: 244 (Reverse strand: CAT) 

MSTGRPIKCVVVGDGTVGKTCMLISYTTDCFPGEYVPTVFDNYSAPMQVDTIQVSLGLWDTAGQEDYDRLRPLSYPQTDVFLICYSVASPSSFEN 
VTSKWYPEIKHHCPDAPIILVGTKIDLREDRETLSGLAEQGLTPLKREQGQKLANKIRAVKYMECSALTQRGLKPVFEEAVRAVLRPEPLKRRQR 
KCLIM* 

(SEQ ID NO: 627) 
Classification: enzyme 



Celera Sequence No. : 142000013384557 

ACTGCGTGAGTTCTTTCTGTTGTTTTTTGTTTGTTTGCTGCTGCCAAGAGGAAACCTTACGATTAGTTTGTGATTTAATTGGAATTTTGTTGGTA 
AATGTTGGATCTCTCATCTAGATTGCATGTACAATATAATCGTTTGATTTGTTTTGGGCGCGTTCTCATATCAAGAGCTATTTGTATTTGTATAT 
GTATAACTACTATCTAAATAGCGTTAGTAATTTACGTATATTATTTACACTTGCATTGTTTTTCTATTTTTTTGTCGGTCTATGTCTTGATCCAC 
TTATGTCTTGCGTTGTAAATATTTGCACGGATATTAAAAACAGAAATATTTCTATTATTGAACACAGTATTGCATTTCTCCGTCACCTCGCTCCT 
TCTCCTTCTCCTTCTCCTTGTACATGTCTGCCTGTCCTCTGGTTTGGTTCATCACTTTTTTTTTTGCTAAGATCAATTGAATGTCTTTTGTTTTT 
AACTCTGTTTCTGTCGCGGCATATGGTGCAACTGCTGCCGGGGCAGCTCACAAAATTTGCATAAATAAATTCTAACAACACGCTTAGGCTATGTG 
GTTAAGTAAAAAAAATACATTAATAATTTGAATACAGCTGTTGTTGCCACCGGGCTTAGCGCACGCGGTACCCGGTATAAATAATAAATTAAACA 
TTTATAACTATTGCAACGCTACTCGACCTATCAACTATATCAAATTGTATATAACCGTATATAAATATGTCCTTTTGCTAAAAATTATTATTATT 
TGTTTGTTTCTTTCTTTCCTTCTTTCTTGCATTCTCGTTTCGTTACTAGTCTAATGCATTAAAGTAATAATAACTGCGGATTATCAACTTGCTAG 
CTTGACTTTTGTGTGTGGTTAGTAGAAGAAAGTCGAGGCAAATATTCTATATACTCGTACCTATTCGCTAGTTACATATTATCATAATTATTATC 
ATTATCGTTATCATTATTTCTTTATGTAGGTAAGCATAGTGTTACGGTTATTATGTACAACTGTCTGGGTGCTGGCTCTGCTTGCGCTTTGACTT 
TGGCTCCTTCGGATGCTGATCGATCGACGGACGACGCGGATTGCCCAACGTGACTAAGGCGCTGGCCACAGCCAGACCAGCGGACTGGCCAGAAG 
GCGCTTGCGAAAGGCCACCTGCGAGTAGAGTGATTCATAAGTAAGACACATACATTCTATAAGATGATTACTTCATCCTTACCCGTGGGCAGCCG 
GACTAGAAGCGAGAGCACAGCGGCTCGGTTGCCACTGCAAGGCTCCAAAGCGATGGGACTAGGGCATTCCTGTGTACAAATAAGATACAAAATAT 
TTCATAGTTTAATCTCAAACTTGTAATGTCCAAATTAGGCCTAGCTTTATATAATCTGTTCTGTATGGATTCCGTCCGCTAAAAAGTCGGCTCAT 
AATTTTAAGCACTGAACGATTGATTCGCCTACCTTTTTGCGTTTTTTGGACAGGTGGCCAATGCCCTCCAGCGTCAGACTCAGATCAATACTTTC 
GCGGTTCTCGATATCGACCAGCACCTGTGCAAAAATGTTTGCAGATTAGAGTCAACGCAATCAAGTTTCCTTTTTTAGATCTCACCTGGGAAAAC 
GGTCGACTCGCCATCTGTTCCATTTCGACGAACAATCGCTGACGCCGCCGGAACATGTCGTACTTCTGCTTAATCTTCCACAGGACGGCGGCCAT 
AAGCAGCAGCAGTAGGAAGCAGGAGGAGAATGTGATGAAGAATTGCTGCAGGTTGAGCTTGGGATACTGTGAGAAGGCAATTTGTATCCAGATTG 
GGGGCTGGAAGTCGTGCACGAACACGTAGAACGTGGTCAGGGAGCTGTTGTCATCGGGCGAGTGCCCGAATTGGTAGTCCGTCTTGGGGAACCGA 
TGACGGAACGTTGAGCAGTTAACGCCCACCAAGATCAGCATGTCCGGCGAGCCAGCCCTCTTGACCGAAATGTCCATTTTGGCCGGCACGCTGCA 
CGTAATGGTGAAATCCGCATCGATCTCCGGCTTTCCCGGTGAGTTCCTGAAGTTAATTTGTGTAAAGTGGCGATCCTCCTTCTTTGAAAGATTAA 
ACGTGAACTGGTAGTCGATGGTCAGCTCGTAGTAGCAGGAGCCCTTTAGCGGATCACCGTGGTAATGGTTCTGGGAGTCGCACTTCTCGCAGTGA 
TCACCGACTATACCCTTGGTGGTGCAGAAACACTTCCCAGTGTCCGGATGACAGTAAACGCCTTGGCCGTTGCAATCGCATCGCTGGCATTTTCC 
CCCGTTTATGGGATTACCCCAGTATCCAGTGCGACACTTCTCGCAGTGCGCTCCTGTCGTCAGATTATTGCATGGCTGTTCACAATGCTGTTGGT 
CATTACAATACGAATGCCCGTTGCAATTGCACCGGGGGCAGGACGTAAAGAACCAGTGCTTTAGGGCGCACTCCGTGTCATCGTAGGGTGCCAGT 
GCTCCGCCAACCACGCACCTGCCCAGACCGGTGTTAGAGCCATTGTCACACCAACCGCAAGCCGGATCGTCCAGGCACATCTGGCAGCTGTTGTA 
GTAGCCACATTGAGCACTTGACAGTGCCGTAGTGCTGCCCACGGCCAGTGCCGTTGATTGGATGGGAGCCGACCGGCACTTGGCTGTGAAGGTGG 
TCCACTCTCGGCACTGACCGTAAGGAAAGCTGGCCGTGTAGGCATTGCGGTCCACGCAGCGCTGTTCGTTTTGGCACCAGATGCACTCGTCCTCG 
GTGCAATTCTGGCAGTTGGTCAATGAAGCGCAGGACGGCGAGCAGGCCACCTCATCCTGCGTCCGGTTGAGCGCCAAACCACCGTAGGAAGTGTA 
GCTTTTGCAGCGGTTGTGCTCCGAGTCCCAGCGACACGCCAGGTTGGCAGAGCAGGCGCGACAATTGTGCAGCTGATCGCACACAGATGACACCA 
GGTACTCCTCGGAGATGGGGCACGAATCCAGATGCTTGGCATTTAGCGGCGGACTAGCCGTGGACACTGCCATTTGTGTGGACGACTCGAAGAAG 
ACCGAAGCCATTGAGGTGGTTTCACGACACCGCTCCTTGCTGCACACTCCGTTTCCGCAGTATGTGCAACCAAAGGCGGTGGACACGCACGATTG 
ACAGCTGGCCAGTTCCTGGCAACGGTGCACATCGTGGAGTAACTCTGAAGTGAGGGTAAGACGGCTCTTCGACGGGCAAGCCACATAGTCGTATT 
GCTCGCGTCCGTAGATCGCCGATCTTTGAACCTGGGTAATGGCGATGCAACGCATCTTTTGCACGTCCCAAATACACTTCACTCCAGGTCGCGCT 
GCCGTGCACTTCTCCTGTTTGGTGTAGTAGCTGCAGTAGCCCGGCTGGTAGCGCAGCATATCGTTTAGCAGCTGGCCATTGAAGCCGCCATAAAT 
GTACAGCGACTCCTCGAACACAACCGAACTGTGACCGAAGCGGGCCAGATCCGCCTGCAGGTGGCCCGGCATCGGATGGTAGTGCCACGAGTCGC 
AGTACACGTCGTACACGAGCAGGTCCTGGCTATAGCATTTCGCGCCGTAGCTCTGCGATGTGTCGTTGTGCGTATTGCCACCGAAGACCATCATT 
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AGTCCCTGGTTCACAAAGTTTGCCGTGTGAAGTAGACGCGCACTTGGCGCCGCAGACAGCAGACTCCAAACTCGCGTTGCCGGTTCATAGGCATA 
CAACCTAGAGCTCAGCACCTGACTGGACTCACTCTCGGAGACTATTCCACCGTAGACGTACACCTTCTCCGTAAGAAAGTCGTACGCGGCACTGT 
GTCCGTATCCGCCTTTCACCACATAGCCGGTGGTGGGGACGATGCGCCACTCTCGTGATGCAAAGTTGAACTCCTGCACTGTGTTGAGGTAGCCG 
TAGTTTGGAGAGTGACCGAAGATAACAACCATGTACTGGTAGTTGTTTTTGTCCCCGTAACCGGGAACCAAGGTAGCTGTGTGACCCACCACATG 
CAGGGGTCCGCACATAGCGGTGGTTCCTCCAGTGGCGTTGCACGAGGGATCCGCTCTTACCGAGATGTTCGCCCACGTTCGCGCCGAGACGTCGA 
AGGCCCACAGTTCGTTGGAGATGCCGTGGCCCTTGACCACTCCACCGTACATGAAGATCTTGTCGCCGTACATCACTGTCGAGGCACCGTATCGC 
TTGTCAGGAACCTCGCTACCATCCTCCGGGTGCACAGTCTCCCATACGTTGCCATTGAAGTCATAGGTGCTCATTAGTTTGCCTCGTCCGTACGA 
TTCGCCGCCCACAATGTGCAACGTGTCTCGCCAGATGGTGGCGCCATGGGAGGCGCTTCCGGCTGGCGCCGGCGAGTGCTTCGGGTGGACCGTGG 
ACCAGGCGCCGTGGGCGCTGATCTGACTGCAATCGTCGCCGGCGAAGCCCTCGTAACAGGAGCAGCTGTAAGAAAGTACGAGACTTAATGAGAAA 
TTGCAAATTCGAAACGAGCCATTTGCGCCCTTTTTGCACATAGATTAGTTTCTGCTTGCATTTCCTCGCTTTTCTAGTTTAGTTTCTGGTTTACA 
CTTAGTTAAGCTACTCAGGCACGTAATTGCATTCTAAATTTAGACAAGTGACAAACAACTAGTTATCTACAGATAACCGGTAAAGAGTTTGTTAT 
ATAACACAAACAATAACAAATGTCTTAAAGGGCTTTTTGAATACTATAATAAAACTCAATACGATGCAAGCAAAATAATTAAGGGCTTCATATAT 
AGAGCCAACTTACCGTTCCTGGTCAAGACGACAATGGCCCTGGTTCTTGGACTCGAAGCAGTTGTTGGGACAGGCGGCTATGTTGCAGGCTTCAC 
CCCTGTACATGGGATCACAGATGCAGTCGCCATCGCGACACTTGCCATGGCCGGAACACTCCACCTCTGAAATGGTATAAGTAGTAACTATTTAG 
AATCTTCTCTGTTTAAGGTAAGCTAAATGACAATCGTACCATCGCTATCCGAGGGACAACTGTTCATCTTGTAGGTGAGGTTGAAACCGGACATG 
TTGTAGGCATCGTCACTGAAGAAGTGCACCAGCGCCGTGCCGGATGTCGCTATCACCTGTGGCACTCGACGAATGGAGAAGTTGCCGCGATACAT 
G AGGCC AC T G AAC G AT AAT AC GG AAAG GAAGG AAAGC C AAT GGT T AGT C AT GAAAT AAAC AT GT T GT C C AAGC AAA C T AAGC T C AC C T G AAC AC C 
GCCAGCAGGGGCGAGTCCACGCTGTCGCCATCGTATATGTACAGATGATCCCAGCCGCACTCCGTGGCGAACTCTCGCAGGTGGATCCGTATGTT 
GGCGGTTCGCGACGGATTGGTATTGTGTCGGCGGTTCCAGTGCGGATGTCTGGCGTCGATTAGCCAGCTGCACTTGACACTCACCGAATAGTTGC 
CCCATCCGTCGTGTATTGTGCCCATGGGGTGGTACATGCTGCAGATTGGAAAGATGGCAAGAGAGAGTGGATATGGCATCCATTAGCTTCGAGGT 
GGCTAAATAAATCAATTCCTCTATCGATTTAGCGATATGGAGTGCACACGCAAATGGCAATAGCCCGATGCTGTGAGGAGATAATTGAATTTGCC 
GGTCAGCCGGCCAGCGCAATTGCAATCTACAATTGATATGGGTACAGTGTATGTGAGATGGTATAGGTTGCTATAGGCTGCATCGTCCGTATTGC 
ATAGACTATCTGTGTCCGAACTGAAGCCGTTCAACTGTTGACCGGCCAAGTCCGTTGTGCATTCTCCACTCTTTGGGCATTTGTTTCCCGTGGCT 
TTTGTTCTTTGAATGCGGATTACGAGTGAAAATATGTCTGGAAGACAGTCTTCAACGAAATGTGACAAGTTAAGAGCTTGAAACTCGAGTAGGGT 
ATCCGCCAGTCGCATGCCAGTGCCTTCCTTTGTCTGTCTTATTATTTTCTTAGTGTTTTGTTTGCACATCCGCTTATAAATAACTGCTTTCCCCT 
GATTTAGTCGCACTGATTTCAATTTACAGCTGTGCGAGCACAAAACGCAGTGCTCAACATACTCATATGTACATATGTGATAATTAAATGCTACC 
CTGTAGTTGCAGCAGCCGTCTTAATAGTTGATCTTTCGTTTCTTAAATTAAAAGGTCAAACGAACATTTGCTCGTTCTCTAACAACAAAAAGTTT 
ATCGGAAAATAAAAATATTCATTAATTAAATCATGAAGCAGATTTTAACTATTATACGTTAATGGCATAGTAGTGAGTAAACACTCTTTATAAGC 
C C T T TG AAC C GT GT GC AG AT AAAGAAT G GCT T TT C AC C AAG C AAAACT C C AT AT C T T AC T AAG C T GG C AAAG AT AAAAC GC G AC AT AAT G AT T C T 
ATATCATTATAAATGGATGGTACAGCTCGTAAAATTTACATATATTAATTATAATTTGCAATATGTATGCACAGCTTCAAGTACATTGTTTGCCA 
TTTAAAGAAACTTTGCACACTCTTCTCGGTTATTTGTTTGTTCTAGTTTTAACAATTAACTCACATAGTCAGTGTACTCGGAGAAAGTGGAATCA 
GCAGCCTTCCGCATTCTAAATGCGACTCACGTTACACCAAGCAAGTGCCGATAAGTACTGCGTATAGTACACTAACAATTGCGGCCACAACAATA 
GTGCCGATGGGCATTTTTGATTATTAGAAAACACTTTCCGTGTGCCTACACTAATAATTGCTTATTTATCACAATGACTTATCTAGAGGGTACAA 
TTTGGTTTGCTTGTTTAATCGTTTTATTTTTGTCATCGCTGGCCATCATATTTTTATTATTTTCGCTTGTCAAAAATATGAAATAGACCAGATCA 
TTAAACGAGATTTTTTTTAAGTTTATGGTATCTAGCTTTGTGGCATCATAAAGTCAGAAAGTCATAAAATATTATTTATGTGGATTGCAAGGCCA 
T AT C TT T C AGT AGT T C T AAAAC CT C T T T GAT AT T GAT TAT AT C T AT GC T AAAT T C AT AT AGAT T C T TAG AT TC T CC C C AAG AAAC CT C AT AT T T T 
TACGCCCGTAGCTGTAGGAGACTGCGTCGTGTGGGCGTTGTGTCTCGGTCAGTTCTAAACTTTGAGTAAACACTCCGTCCAGCCGGGATGCAGTC 
GTGCCTGGGGGGGCATATCAAACGATAGCCACGAAAAATCAATGTCGGCCAAAGATATATTTACACACTCATTGGCAGCCAGCCGGTGAGTCACT 
CTGTGCACTCGAGCACAGCTTAGCTTACGCACAGCTTTGGAGATGGTAGATGGAACCACATATATGTATAGCAATAGCTGGGGCATCGGCACCGT 
GAGCTGGAGTAGGTGAATAGGCGCCCTCGTCATGTGGCCATTCGGCTGGAATGCATTTGGAGTGAGGGAAACCAGTTCGATTGCGCCCAGTTTTT 
GCATAGAAAGCCGGCAAAGCAGTTGCCTTTCGAAGGCAACTTGAGCTGGTGAATCGTTGGCAGCGGTTAGATCCGAACTAGCAGCGAAAAGACTG 
AAAACTCACAGGCGATAAGCGGACACACACCGCTGATAAGGCCATGCCCTTATCACTGCTCAATTTGAGCGGACAAGTGTGTGGCCCCGGAATTT 
ATGGCCCGCTAAATGCTGCGATCCAACGATCTCAAACGCAGGCCGGGATCAGGGGTCTGTGTGGAGCCTGAAACTGGTTTCTGGATGGTTGGGGC 
AGGTGAGGCGACGACCTCAATGCTCGAGTGCGCAAATTGCAGGTCCCATCATCAGCAGCACGGCAGGCAGGTGAATGGGTTTAATTATTGTCAAG 
AT AGAACGCTCGCTTATCGGGCTTCGCGAGC AAAT AGCCGCTGTTAAGAAAC AT AAAC AAACT ATT TCATGAGCTTTATGACGCCAGCCGT AAGC 
GTGAGCAACCAAACTGGCCCAACGACTTCTCACATACATATCTATTAAATATAATGTATAGCATTCGTGCAACACTTTAAATACATCAGCCTATG 
CCGGGCAAAAAGCTGTTATCTAATGTTTGGCAAAATAAACATTATTTTACTGGGATAATATTTTGGGATTGATGTGAAAGAGTTCTTTTAGGTGT 
ATATACACCTCATTATTTATTAGTATGTCATTTGACATTTCTTGAATTAGATATTTTTATAGCACTGTTCCTACTTGGTTCGTCTTAAAGAAGCT 
TTACAGTAGATAAGCAAAGCGTTTTGTATCTGACTCAGGTAATATGACGAATATAACGTTTTCCAGAAATATATGTACATATATTTTCCCTGCTC 
AATCAGCACATTAGATGGCAAAGAAGCTAATGTGGCCAAAAAGCAGTGGCCTCTGCCCCAAACAAGTTCAAAGTCAAAGCGAAATAATTGAATAA 
TCCGCATTGGAAGCGTCAGGAGGCGTAGAAAGCGGCCAGCAGGTGAAAGAATGTCTAGAGGGAAATGGGAGAAAGAGAACGGGAATTGGGAGAAA 
GTGGATGGTGGAGCGGTGAAGAGACGAGATAAACACGGCCAAGAACCCGTCCGACCAGATGAAACTGAACCGTTGGCTGACACAAATAGGCCGGA 
CCGAAAAGGCAGAGGAGATCAGACCGAAAAGAATGGAAAATTGATAGGATATTGATGGCTTAACCACCACATCTTACTGGACTGGAGCTGATCTT 
TGGGGTAGCTGATCTATGGTCAATTATGTGACCTAATATGGAGATCTTACTCTATCTATGTGGCATGCTGTGCATAACCAGCCACTTACCGCACT 
TTTCCGCCACAGAACTGGCACTCCGGTCCCTGCCAGCCGTCGGCGCAGAGGCACTGGCCGTCCTCCTGGCACTGGCCGTCGTTCTGGCAGCGGCC 
GCCGTGGGCGGTGCAGTTGAAGGCGGCCGCCCGATGGAGTCCGCTGAAGCAGCCGTGGAAGAGGAGCAGCGTGAGGAGGAGCAGGCATTTTCGCC 
TGTACTTGGCGTTGAATGTGGGTGAGGTGCTGCTCCGGTTCGGTTGCTTCTGCGGTGGCTCCGCCGCCAGGCACATGATTTGCGTAGTTGATTCC 
AATTCACCGCCCGCCCGCAGCTTCACATCGTCGTCGTCCTGCAGCGAGTCCTTCGATTTTCGCGACTATTGCACAATTTTCCTTTGGGCCTTTCT 
CTCTGGCCGTTTTTCTTTTCTTTTCTATTTGCAATCTTTGCAATTCGGGGCAACCCAAGGCCCCGCACTTGAACGCACACACACACACACGCACA 
CACTGGCACACTCACTCACACTCGCGAACGCGGCTGCACAACAAACAATATTGGCAAAAAGAAACAACAACCTGCATCCAGCGACTGGCAAAGTT 
ATTGACAATTTATTTGTGAAACGAAACGAGAAATCTGCTGGGGGTCCAGTGCTGCACAGCGTTGCACTGCTTCGGCGTTGCAAGCACTGCTTAAC 
AGGGTTGTCAAGTCACTGTCGGTATCGATAAGATATCTAGCAGGGATTACTTTAAAATTATTAAACAAGCTTAGTGTGGTACATTTACTTATCAC 
ATACATAAAGGAGTGCCAAATTTATATGAATAATAATATTAAAAATAATATAAGAGAATATTTTTAAAAAGCTTAATTTTAAAGCGTTAACAGTG 
CAAGTAATTTAACTGTGGCACACTTTTGTTGCTATAGTTTTTCGAAACGTTTTCAAAGTGGTAAAATTGGCAAAATTGTAATACCAATTTCATTA 
CTTTAGAATATTCATAATGCGACATCGACATTTTTTATATATTTTCAGATATTTTTGTTATTTAAAAACAGCTCGTACAATCGATATCTTATCGC 
TGCAAACATTTCAGCCCTCGCAGTCGTTACAACCGTAACAGCACAGTTACAGTTGCAGCTTTCTGTTAGTAGGACTGCTGTTTATTTGTTTATTC 
AAAAATTAGAAATAAATGTTGAAACCAAAGTAATATTTATAAAAAAAGCACTTAAAATAGAGCAAGAATTGTAGTCTTAGAAAGAATACATAGAA 
CTTAGCTTATGAACAAAGAATTAAACACAAATAAATAGAAATACAGTAACCAGAAAATCAATACCCGTTAACAACAACCGGAGCGCGTAAATTCC 
GTCAGTTTGACAGTTGCGTTGATAAGAACGATGCGGACGGACTTTTGAGTGCGCAGGCGCGCTCCGAGCGCACCCGTCCCCCCTTTCGTCATCGT 
GGCTGTTTGCCTGCCAACTACTTTCCGGAATCCGATTTCTCAGACAGTCCACGTTCGGGCGTCGTTGCAAGCAGTGTGTGCTAACGGTACTCCAA 
CCGCGCTGCATGTGCGTTCCCCGTTTGCATCTGTGTGCGTGTGTGAGCGGAGAAG 
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{SEQ ID NO: 628) 



Exon: 9315.. 8735 
Exon: 5453.. 5217 
Exon: 5137. .4980 
Exon: 4911.. 4764 
Exon: 4396.. 3957 
Exon: 3878.. 1606 
Exon: 1544.. 1458 
Exon: 1304.. 1223 
Exon: 1158.. 1001 

Start ATG: 9006 (Reverse strand: CAT) 
Transcript No. : CT17798 

TCAATAACTTTGCCAGTCGCTGGATGCAGGTTGTTGTTTCTTTTTGCCAATATTGTTTGTTGTGCAGCCGCGTTCGCGAGTGTGAGTGAGTGTGC 
CAGTGTGTGCGTGTGTGTGTGTGTGCGTTCAAGTGCGGGGCCTTGGGTTGCCCCGAATTGCAAAGATTGCAAATAGAAAAGAAAAGAAAAACGGC 
CAGAGAGAAAGGCCCAAAGGAAAATTGTGCAATAGTCGCGAAAATCGAAGGACTCGCTGCAGGACGACGACGATGTGAAGCTGCGGGCGGGCGGT 
GAATTGGAATCAACTACGCAAATCATGTGCCTGGCGGCGGAGCCACCGCAGAAGCAACCGAACCGGAGCAGCACCTCACCCACATTCAACGCCAA 
GTACAGGCGAAAATGCCTGCTCCTCCTCACGCTGCTCCTCTTCCACGGCTGCTTCAGCGGACTCCATCGGGCGGCCGCCTTCAACTGCACCGCCC 
ACGGCGGCCGCTGCCAGAACGACGGCCAGTGCCAGGAGGACGGCCAGTGCCTCTGCGCCGACGGCTGGCAGGGACCGGAGTGCCAGTTCTGTGGC 
GGAAAAGTGCGCATGTACCACCCCATGGGCACAATACACGACGGATGGGGCAACTATTCGGTGAGTGTCAAGTGCAGCTGGCTAATCGACGCCAG 
ACATCCGCACTGGAACCGCCGACACAATACCAATCCGTCGCGAACCGCCAACATACGGATCCACCTGCGAGAGTTCGCCACGGAGTGCGGCTGGG 
ATCATCTGTACATATACGATGGCGACAGCGTGGACTCGCCCCTGCTGGCGGTGTTCAGTGGCCTCATGTATCGCGGCAACTTCTCCATTCGTCGA 
GTGCCACAGGTGATAGCGACATCCGGCACGGCGCTGGTGCACTTCTTCAGTGACGATGCCTACAACATGTCCGGTTTCAACCTCACCTACAAGAT 
GAACAGTTGTCCCTCGGATAGCGATGAGGTGGAGTGTTCCGGCCATGGCAAGTGTCGCGATGGCGACTGCATCTGTGATCCCATGTACAGGGGTG 
AAGCCTGCAACATAGCCGCCTGTCCCAACAACTGCTTCGAGTCCAAGAACCAGGGCCATTGTCGTCTTGACCAGGAACGTCAGATCAGCGCCCAC 
GGCGCCTGGTCCACGGTCCACCCGAAGCACTCGCCGGCGCCAGCCGGAAGCGCCTCCCATGGCGCCACCATCTGGCGAGACACGTTGCACATTGT 
GGGCGGCG7AATCGTACGGACGAGGCAAACTAATGAGCACCTATGACTTCAATGGCAACGTATGGGAGACTGTGCACCCGGAGGATGGTAGCGAGG 
TTCCTGACAAGCGATACGGTGCCTCGACAGTGATGTACGGCGACAAGATCTTCATGTACGGTGGAGTGGTCAAGGGCCACGGCATCTCCAACGAA 
CTGTGGGCCTTCGACGTCTCGGCGCGAACGTGGGCGAACATCTCGGTAAGAGCGGATCCCTCGTGCAACGCCACTGGAGGAACCACCGCTATGTG 
CGGACCCCTGCATGTGGTGGGTCACACAGCTACCTTGGTTCCCGTGCAGGAGTTCAACTTTGCATCACGAGAGTGGCGCATCGTCCCCACCACCG 
GCTATGTGGTGAAAGGCGGATACGGACACAGTGCCGCGTACGACTTTCTTACGGAGAAGGTGTACGTCTACGGTGGAATAGTCTCCGAGAGTGAG 
TCCAGTCAGGTGCTGAGCTCTAGGTTGTATGCCTATGAACCGGCAACGCGAGTTTGGAGTCTGCTGTCTGCGGCGCCAAGTGCGCGTCTACTTCA 
CACGGCAAACTTTGTGAACCAGGGACTAATGATGGTCTTCGGTGGCAATACGCACAACGACACATCGCAGAGCTACGGCGCGAAATGCTATAGCC 
AGGACCTGCTCGTGTACGACGTGTACTGCGACTCGTGGCACTACCATCCGATGCCGGGCCACCTGCAGGCGGATCTGGCCCGCTTCGGTCACAGT 
TCGGTTGTGTTCGAGGAGTCGCTGTACATTTATGGCGGCTTCAATGGCCAGCTGCTAAACGATATGCTGCGCTACCAGCCGGGCTACTGCAGCTA 
CTACACCAAACAGGAGAAGTGCACGGCAGCGCGACCTGGAGTGAAGTGTATTTGGGACGTGCAAAAGATGCGTTGCATCGCCATTACCCAGGTTC 
AAAGATCGGCGATCTACGGACGCGAGCAATACGACTATGTGGCTTGCCCGTCGAAGAGCCGTCTTACCCTCACTTCAGAGTTACTCCACGATGTG 
C AC C GT T GC C AG G AAC T GGC C AGC TGT C AAT CGTGCGTGTCCACCGCCTTTGGTT G C AC AT AC T GC GG AAAC G G AGT GT G C AGC AAG GAG C GG T G 
TCGTGAAACCACCTCAATGGCTTCGGTCTTCTTCGAGTCGTCCACACAAATGGCAGTGTCCACGGCTAGTCCGCCGCTAAATGCCAAGCATCTGG 
ATTCGTGCCCCATCTCCGAGGAGTACCTGGTGTCATCTGTGTGCGATCAGCTGCACAATTGTCGCGCCTGCTCTGCCAACCTGGCGTGTCGCTGG 
GACTCGGAGCACAACCGCTGCAAAAGCTACACTTCCTACGGTGGTTTGGCGCTCAACCGGACGCAGGATGAGGTGGCCTGCTCGCCGTCCTGCGC 
TTCATTGACCAACTGCCAGAATTGCACCGAGGACGAGTGCATCTGGTGCCAAAACGAACAGCGCTGCGTGGACCGCAATGCCTACACGGCCAGCT 
TTCCTTACGGTCAGTGCCGAGAGTGGACCACCTTCACAGCCAAGTGCCGGTCGGCTCCCATCCAATCAACGGCACTGGCCGTGGGCAGCACTACG 
GCACTGTCAAGTGCTCAATGTGGCTACTACAACAGCTGCCAGATGTGCCTGGACGATCCGGCTTGCGGTTGGTGTGACAATGGCTCTAACACCGG 
TCTGGGCAGGTGCGTGGTTGGCGGAGCACTGGCACCCTACGATGACACGGAGTGCGCCCTAAAGCACTGGTTCTTTACGTCCTGCCCCCGGTGCA 
ATTGCAACGGGCATTCGTATTGTAATGACCAACAGCATTGTGAACAGCCATGCAATAATCTGACGACAGGAGCGCACTGCGAGAAGTGTCGCACT 
GGATACTGGGGTAATCCCATAAACGGGGGAAAATGCCAGCGATGCGATTGCAACGGCCAAGGCGTTTACTGTCATCCGGACACTGGGAAGTGTTT 
CTGCACCACCAAGGGTATAGTCGGTGATCACTGCGAGAAGTGCGACTCCCAGAACCATTACCACGGTGATCCGCTAAAGGGCTCCTGCTACTACG 
AGCTGACCATCGACTACCAGTTCACGTTTAATCTTTCAAAGAAGGAGGATCGCCACTTTACACAAATTAACTTCAGGAACTCACCGGGAAAGCCG 
GAGATCGATGCGGATTTCACCATTACGTGCAGCGTGCCGGCCAAAATGGACATTTCGGTCAAGAGGGCTGGCTCGCCGGACATGCTGATCTTGGT 
GGGCGTTAACTGCTCAACGTTCCGTCATCGGTTCCCCAAGACGGACTACCAATTCGGGCACTCGCCCGATGACAACAGCTCCCTGACCACGTTCT 
ACGTGTTCGTGCACGACTTCCAGCCCCCAATCTGGATACAAATTGCCTTCTCACAGTATCCCAAGCTCAACCTGCAGCAATTCTTCATCACATTC 
TCCTCCTGCTTCCTACTGCTGCTGCTTATGGCCGCCGTCCTGTGGAAGATTAAGCAGAAGTACGACATGTTCCGGCGGCGTCAGCGATTGTTCGT 
CGAAATGGAACAGATGGCGAGTCGACCGTTTTCCCAGGTGCTGGTCGATATCGAGAACCGCGAAAGTATTGATCTGAGTCTGACGCTGGAGGGCA 
TTGGCCACCTGTCCAAAAAACGCAAAAAGGAATGCCCTAGTCCCATCGCTTTGGAGCCTTGCAGTGGCAACCGAGCCGCTGTGCTCTCGCTTCTA 
GTCCGGCTGCCCACGGGTGGCCTTTCGCAAGCGCCTTCTGGCCAGTCCGCTGGTCTGGCTGTGGCCAGCGCCTTAGTCACGTTGGGCAATCCGCG 
TCGTCCGTCGATCGATCAGCATCCGAAGGAGCCAAAGTCAAAGCGCAAGCAGAGCCAGCACCCAGACAGTTGTACATAA 
(SEQ ID NO: 629) 

Start ATG: 310 {Reverse strand: CAT) 

MCLAAEPPQKQPNRSSTSPTFNAKYRRKCLLLLTLLLFHGCFSGLHRAAAFNCTAHGGRCQNDGQCQEDGQCLCADGWQGPECQFCGGKVRMYHP 
MGTIHDGWGNYSVSVKCSWLIDARHPHWNRRHNTNPSRTANIRIHLREFATECGWDHLYI YDGDSVDSPLLAVFSGLMYRGNFSIRRVPQVIATS 
GTALVHFFSDDAYNMSGFNLTYKMNSCPSDSDEVECSGHGKCRDGDCICDPMYRGEACNIAACPNNCFESKNQGHCRLDQERQISAHGAWSTVHP 
KHSPAPAGSASHGATIWRDTLHIVGGESYGRGKLMSTYDFNGNVWETVHPEDGSEVPDKRYGASTVMYGDKIFMYGGVVKGHGISNELWAFDVSA 
RTWANISVRADPSCNATGGTTAMCGPLHVVGHTATLVPVQEFNFASREWRIVPTTGYVVKGGYGHSAAYDFLTEKVYVYGGIVSESESSQVLSSR 
LYAYEPATRVWSLLSAAPSARLLHTANFVNQGLMMVFGGNTHNDTSQSYGAKCYSQDLLVYDVYCDSWHYHPMPGHLQADLARFGHSSVVFEESL 
YIYGGFNGQLLNDMLRYQPGYCSYYTKQEKCTAARPGVKCIWDVQKMRCIAITQVQRSAIYGREQYDYVACPSKSRLTLTSELLHDVHRCQELAS 
CQSCVSTAFGCTYCGNGVCSKERCRETTSMASVFFESSTQMAVSTASPPLNAKHLDSCPISEEYLVSSVCDQLHNCRACSANLACRWDSEHNRCK 
SYTSYGGLALNRTQDEVACSPSCASLTNCQNCTEDECIWCQNEQRCVDRNAYTASFPYGQCREWTTFTAKCRSAPIQSTALAVGSTTALSSAQCG 




YYNSCQMCLDDPACGWCDKGSNTGLGRCVVGGALAPYDDTECALKHWFFTSCPRCNCNGHSYCNDQQHCEQPCNNLTTGAHCEKCRTGYWGNPIN 

GGKCQRCDCNGQGVYCHPDTGKCFCTTKGIVGDHCEKCD3QNHYHGDPLKGSCYYELTIDYQFTFNLSKKEDRHFTQINFRNSPGKPEIDADFTI 

TCSVPAKMDISVKRAGSPDMLILVGVNCSTFRHRFPKTDYQFGHSPDDNSSLTTFYVFVHDFQPPIWIQIAFSQYPKLNLQQFFITFSSCFLLLL 

LMAAVLWKIKQKYDMFRRRQRLFVEMEQMASRPFSQVLVDIENRESIDLSLTLEGIGHLSKKRKKECPSPIALEPCSGNRAAVLSLLVRLPTGGL 

SQAPSGQSAGLAVASALVTLGNPRRPSIDQHPKEPKSKRKQSQHPDSCT* 

(SEQ ID NO: 630) 

Name: LanBl-like 
Classification: cell_adhesion 



Celera Sequence No. : 142000013384557 

CTCGCTCTCCTTTTGTGTGGCTAATTTTTTCCTTTTAGTCCTTTTTTCGGTTGCCTTCGTGCAGAAATTCCCTGCTTTATGTAATTTAGAACACA 
ATGTTTGCGTTGCACTGTTGGCCGACTGCAGCTGAAAACCGCACAAAAAAAGCGACGCACACACACACCTTCGAGCATCCGGACTTCCAATTTGC 
ACTGCTTTTCTAACAGTCTATCGCGCACGCACTGCCACAATTTCACCCGCTTATCGCGCCGCAATATGGGACGTTCGCTTTTCAATCGATTTTGG 
TAACCGGCAGGTGCGCTTTATGCGAGAATTACGAATTTTCTCCAGCGGCAACTAGAATCGTTCACACAGACACCGTAGAAGTGACGCCATCTTTG 
AAGCGATTAGTGTTGCAAAGCGCGTCTGCAAAAGTGCTGGCTACCAACTACGGTTGATAGGGATGAGGGCTATGGCTTGTTGACCGAAAATGATA 
AAT GT T AAC T AT GC C AAT AT ATT T T AG AT T T T AC T AAT AC GT T AC AAAAAT AAG AAAAGAC T ATG AT AAAAT T GT GT T T T AGC T T G GC T CAT AAG 
TATTGATCCTTTATAAGCTATGGTCTTGCAGTGGCTAACATTTCTCAAATTAGGGCAACCATCTGATAAGATTTCTAATTCGAATTCTAATAAAT 
GAATTTAAACTGCTTTTTCTTAAAGTTAAAAAAAAATTAGTTTTAAATCAAACGGCATGCATACATTAACCATCAAACAATGTAATTATATATAT 
AAT AT T AC C AAT T G GGC T AC T AAAT AT T GC T GAT AT GT AAGC T AT GGT C AT G AGGT AAAGGC AT AT T AT T T AT AT AATT T T C AT AGT AT C T TT GT 
TGTAGGGGAAAAATAACAAGTAATTTAAAATTGTTCAATTGACTTAAAACAAAATATAACAGATATTATTTTTAAATCCAACGACATACCATATA 
AAG TTGGTATTTCTTT C T AAT CTGCTTGGTATTCTCAT CG AC C C C AAT C GGT C AC AC T G GT C GT G AAT C AG AAAC GC ACT AAT T AGT C AAAAAAA 
CGGTACAAAAGATCGCTTAATTGCTGAAATTTCACCTGTCGCAGTTAAGTTTACCCACCAGCAGTTCAAAACCAGCCAGAGGCTAGCTAAAAACC 
CCAAAGATCGACCCCAAAACATCGCCGCAAAGTGGACAACTCCCAGTTTTCAGAGCACCTGCCCCTCAAAAAGGAGCATCGCGACAAAAAGGCGA 
GACTTCGCAGCAGAATGTCCTCGCTCTCTGCGTCCGCGGAGAATGTGTCCAGTTTGGGACTGGGATCGGGCGGAGGAGGAACTACTTCCCACGAC 
GGCAACTCCCAGCAGGGTTCCGGATCCGACGGAGGATCCAGTATGTGCCTGGAACTGGCCCTCGAGGGTGAACGCCTCTGCAAGGCGGGTGACTG 
CCGGGCGGGCGTGGCCTTCTTCCAGGCGGCCATTCAGGCCGGAACCGAGGATCTTCGCACTCTGTCCGCCATCTACTCCCAGCTGGGCAACGCCT 
ACTTCTACCTAGGCGACTACAACAAGGCGATGCAGTACCACAAACACGATCTTACGCTGGCCAAGAGCATGAACGACCGACTGGGCGAGGCCAAG 
TCCTCTGGCAATCTGGGCAACACCCTCAAGGTAATGGGTCGCTTCGACGAGGCGGCCATCTGTTGCGAGAGGCACCTAACGCTGGCCAGGCAGCT 
GGGAGATCGTCTGTCCGAGGGACGCGCTCTTTACAACCTAGGAAATGTCTACCACGCCAAGGGCAAGCACTTGGGTCAGAGAAACCCTGGAAAGT 
TCGGCGACGATGTCAAGGAGGCGCTCACCCGTGCCGTGGAGTTTTACCAGGAGAATCTGAAGCTCATGCGGGATCTGGGTGATCGTGGTGCTCAA 
GGTCGAGCTTGTGGTAATCTGGGAAATACCTACTACCTACTAGGCGATTTCCAAGCGGCCATTGAGCATCATCAGGAGCGACTGCGCATTGCCAG 
AGAGTTTGGTGATCGCGCGGCTGAGCGAAGGGCCAACAGTAATCTTGGCAACTCGCACATATTCCTGGGACAATTCGAGGACGCCGCCGAACACT 
ACAAACGCACCTTGGCTTTGGCCGTTGAGTTGGGTGAACGCGAAGTGGAGGCCCAGAGTTGCTACAGCTTGGGCAACACCTACACGTTGCTGCAC 
GAGTTCAACACTGCCATCGAGTATCACAATCGACACTTGGCCATTGCCCAGGAGTTGGGCGATCG AATT GGT GAGGCCAGGGCCTGCTGGTC ACT 
GGGCAATGCCCACTCTGCCATCGGTGGTCATGAGCGGGCCCTAAAGTATGCGGAACAGCATCTTCAACTGGCCAAGGAGCTGCACGACCCGGTGG 
GTGAAAGCACAGCCAGGGTGAACATCTCCGATCTACGAAAGCTACTCGGAATGCCCGACTCCGAGCCTTCACCCACCGAGGAGGAGGCTCGATCC 
ACCGCCTCCGATCACTCGGCCAGCGGAAATCAGTCGGATGGATCGGAGAACTCGCAAGGAAGAATGGTAAGAGAAACGAGCTGCAACAACATAAA 
GCAATATGAATGATTCCTGAGATATTTTTAAACAATACTGGCAGTCCGTTTGTTCTATTTTTCGAAGTGGACACCTGCTGCTTGCAGTGTTTGCC 
CCTTTATGTTAGCGT AAAT ATT TATACTTTGTACCAAAACACATCGTTAAACTTTCGATTCG AGC ACATGCTGAGAAATTGAAACCCCTGAAAGA 
TACATAGCTAAATTTATTACTTTAATTGCTTCCCGTCAGGTGCGCGTTCGTCGCCAGAGCATGGAGCAGTTGGACCTGATCAAGATCACTCCCGA 
TGGAAAGCGCATGCAGGAGGAAAAGCTACGAGCACAGGCGACCAGGAAAGCCAAAGACGACGACTTTTTCGAGATGCTTTCGCGGTCGCAGTCGA 
AGCGCATGGATGACCAGCGCTGTTCGATCAAGGTGAATCCCGCCGGAGCTCCGGCGGTGGCCACTGGCGCCACGCGTAAGCCGCTTGTGCAGCAG 
AACTCGTTGTTCGTGGATCCCACCAACCTGCCTGGCCTCAAGTCACCGTCGTCGGCGAATCCATCAGCCATTGGACATGGGCCGCTCGCCAGGAG 
CGCCACCACTACGCAGCAGCCCGATGATGACTTCCTAGACATGCTTATGCGCTGCCAAGGCTCGCGTCTGGAGGAGCAGCGGTCCGAGTTACCGC 
GCCCCAATGTCACCATGGACGCAGAGGCGGAGGCACCACCGCGATCCGTCCCAGAGGCTGCTGTTCCAGGGGCACCACGCGGACAGACCGGACGT 
GGAGCTACCGTCCCCGATGAGGACTTCTTCTCACTGATCATGAAGGTCCAGAGCGGTCGCATGGAGGATCAGCGCGCCTCGATTCCGTTCCGAAA 
TGCCAACAATAACAACAATAGCCGCAGCAACAACAACGGTTCAGCCGGCGGAGCTGGAAAGTAGAGAGTTGTAGTCCAAAAAGCTGAAGTCCAAC 
CAAACAATCCTATAAATCCATCCCTGGCCGCAGCTCCTGACTCATTTCGTTGTAAGCCCATCTTGCGGCCATTTATTCAGTCGTTTATTTTCGGA 
AATTGGCTGCTAAACATTTTGTTCTTTTACAAGATTTATATATACACACGTACACGAAAAGCATTAATTTATGAATAGCAACTAACCACGTAACC 
AAATATTAGGGGTGTATAGGGCCTCGTCTACACGAGCAGAGGGAGGAACCTCCAACGGATGCACACACTCATCCACTCATGACTCATTACATACT 
CCACCATGAGCTGGCCATGATATTGAACGATATGCACTCGAGTCTTACGATATTAACGTTTTAATTGCATGTGCAAGTAGTTTAAGCCGGATAAT 
ACTAATGTACTAATCCAATTTGCTCAATAGACGGAACCAAGTCTTCTCTACGATCAACGATCTTACATTCTCGTAGCCACATTCAGCTAAATATT 
AAGTTTTAAGTCCATCGAACCCACTCTACATGTAAGCAATGCATTTTCCAAATCAAAATCTAACTACAGATATATCCAATTGTAACTATGAGTAT 
TGCATTTTTCTATCAATCTATCGATCGATCTAAACTCTGAACATATTTTAAGCCACTTACAATTTTTGTTAGTATATTAACAAGACACAAAGAAT 
AGCAGCTCGGAATTCCATTTGAGGACCGGCGCTTGTGCAAGCACTTAGCATACGATATACACAACCAATCATACTTACACATATACAAGAAACAT 
ATGAACAATATATATATAGTTATACCAATGTCTAACGATTTGCCTCTTCGGGTATTGTTCTACACAATATATACAATACAAGTTATATATACACA 
AAAC AAAAAC AC AAAACTGAGTTCGAGTTGTACGAAATGAAGCC AAAAT TT ATT AATGTT AT ACTTTTTTGT AGC ACGAGATCTACGGGATCTGC 
GAATGAAAGATACCATACA 
(SEQ ID NO: 631) 

Exon: 1001.. 2536 
Exon: 2795.. 3484 
Start ATG: 1250 

Transcript No. : CT17944 

GTCACACTGGTCGTGAATCAGAAACGCACTAATTAGTCAAAAAAACGGTACAAAAGATCGCTTAATTGCTGAAATTTCACCTGTCGCAGTTAAGT 
TTACCCACCAGCAGTTCAAAACCAGCCAGAGGCTAGCTAAAAACCCCAAAGATCGACCCCAAAACATCGCCGCAAAGTGGACAACTCCCAGTTTT 
CAGAGCACCTGCCCCTCAAAAAGGAGCATCGCGACAAAAAGGCGAGACTTCGCAGCAGAATGTCCTCGCTCTCTGCGTCCGCGGAGAATGTGTCC 
AGTTTGGGACTGGGATCGGGCGGAGGAGGAACTACTTCCCACGACGGCAACTCCCAGCAGGGTTCCGGATCCGACGGAGGATCCAGTATGTGCCT 
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GGAACTGGCCCTCGAGGGTGAACGCCTCTGCAAGGCGGGTGACTGCCGGGCGGGCGTGGCCTTCTTCCAGGCGGCCATTCAGGCCGGAACCGAGG 

ATCTTCGCACTCTGTCCGCCATCTACTCCCAGCTGGGCAACGCCTACTTCTACCTAGGCGACTACAACAAGGCGATGCAGTACCACAAACACGAT 

CTTACGCTGGCCAAGAGCATGAACGACCGACTGGGCGAGGCCAAGTCCTCTGGCAATCTGGGCAACACCCTCAAGGTAATGGGTCGCTTCGACGA 

GGCGGCCATCTGTTGCGAGAGGCACCTAACGCTGGCCAGGCAGCTGGGAGATCGTCTGTCCGAGGGACGCGCTCTTTACAACCTAGGAAATGTCT 

ACCACGCCAAGGGCAAGCACTTGGGTCAGAGAAACCCTGGAAAGTTCGGCGACGATGTCAAGGAGGCGCTCACCCGTGCCGTGGAGTTTTACCAG 

GAGAATCTGAAGCTCATGCGGGATCTGGGTGATCGTGGTGCTCAAGGTCGAGCTTGTGGTAATCTGGGAAATACCTACTACCTACTAGGCGATTT 

CCAAGCGGCCATTGAGCATCATCAGGAGCGACTGCGCATTGCCAGAGAGTTTGGTGATCGCGCGGCTGAGCGAAGGGCCAACAGTAATCTTGGCA 

ACTCGCACATATTCCTGGGACAATTCGAGGACGCCGCCGAACACTACAAACGCACCTTGGCTTTGGCCGTTGAGTTGGGTGAACGCGAAGTGGAG 

GCCCAGAGTTGCTACAGCTTGGGCAACACCTACACGTTGCTGCACGAGTTCAACACTGCCATCGAGTATCACAATCGACACTTGGCCATTGCCCA 

GGAGTTGGGCGATCGAATTGGTGAGGCCAGGGCCTGCTGGTCACTGGGCAATGCCCACTCTGCCATCGGTGGTCATGAGCGGGCCCTAAAGTATG 

C GG AAC AGC AT CT T C AAC T GGC C AAG GAG C T GC AC GAC CCGGTGGGT G AAAG C AC AGC C AGGGT GAAC AT C T C C GAT C T AC G AAAGC T ACT C GG A 

ATGCCCGACTCCGAGCCTTCACCCACCGAGGAGGAGGCTCGATCCACCGCCTCCGATCACTCGGCCAGCGGAAATCAGTCGGATGGATCGGAGAA 

CTCGCAAGGAAGAATGGTGCGCGTTCGTCGCCAGAGCATGGAGCAGTTGGACCTGATCAAGATCACTCCCGATGGAAAGCGCATGCAGGAGGAAA 

AGC T AC G AGC AC AGGC GACC AGG AAAGC C AAAGAC GAC GAC T T T T T CG AGAT GC T T T C GC GGT C GC AGT C G AAG CG C AT GG AT G ACC AGC GC T GT 

TCGATCAAGGTGAATCCCGCCGGAGCTCCGGCGGTGGCCACTGGCGCCACGCGTAAGCCGCTTGTGCAGCAGAACTCGTTGTTCGTGGATCCCAC 

CAACCTGCCTGGCCTCAAGTCACCGTCGTCGGCGAATCCATCAGCCATTGGACATGGGCCGCTCGCCAGGAGCGCCACCACTACGCAGCAGCCCG 

ATGATGACTTCCTAGACATGCTTATGCGCTGCCAAGGCTCGCGTCTGGAGGAGCAGCGGTCCGAGTTACCGCGCCCCAATGTCACCATGGACGCA 

GAGGCGGAGGCACCACCGCGATCCGTCCCAGAGGCTGCTGTTCCAGGGGCACCACGCGGACAGACCGGACGTGGAGCTACCGTCCCCGATGAGGA 

CTTCTTCTCACTGATCATGAAGGTCCAGAGCGGTCGCATGGAGGATCAGCGCGCCTCGATTCCGTTCCGAAATGCCAACAATAACAACAATAGCC 

GCAGCAACAACAACGGTTCAGCCGGCGGAGCTGGAAAGTAG 

{SEQ ID NO: 632) 

Start ATG: 250 

MSSLSASAEMVSSLGLGSGGGGTTSHDGNSQQGSGSDGGSSMCLELALEGERLCKAGDCRAGVAFFQAAIQAGTEDLRTLSAIYSQLGNAYFYLG 
DYNKAMQYHKHDLTLAKSMNDRLGEAKSSGNLGNTLKV^ 

KEALTRAVEFYQENLKLMRDLGDRGAQGRACGNLGNTYYLLGDFQAAIEHHQERLRIAREFGDRAAERRANSNLGNSHIFLGQFEDAAEHYKRTL 

ALAVELGEREVEAQSCYSLGNTYTLLHEFNTAIEYHNRHLAIAQELGDRIGEARACWSLGNAHSAIGGHERALKYAEQHLQLAKELHDPVGESTA 

RVNISDLRKLLGMPDSEPSPTEEEARSTASDHSASGNQSDGSENSQGRMVRVRRQSMEQLDLIKITPDGKRMQEEKLRAQATRKAKDDDFFEMLS 

RSQSKRMDD QRC S I K VN PAG AP AV AT G AT RK PL VQQN SLFVDPTNLPGLKSPSS AN P S A I GHGPL ARS AT T T QQ P D D D FL DMLMRC QG S RL EE QR 

SELPRPNVTMDAEAEAPPRSVPEAAVPGAPRGQTGRGATVPDEDFFSLIMKVQSGRMEDQRASIPFRNANNNNNSRSNNNGSAGGAGK* 

( SEQ ID NO: 633) 

Classification: hypothetical 



Celera Sequence No. : 142000013384669 

TTCAATGCCCTTTTTCTTTGGCTACCGATCGAAATACCCACTTTATGGTCTACCTCCCGCCTTCCTCGCCCGCACACATGTCATGCATATATATT 
TTCCTGCTCGATCGGCACACGCATAGCGACAGAAAACAGGCGAGATGTATAATTTTATATAATAGGGCAAAGTTATTTTTAAAGGCACTCGACAA 
TATTATTATGCTAATAAAAGAAATGAACTGCTTGCTACGCGGCGGCGGCGAAAAGACAAAACCGTTTTTCACACCAAATAACGGTCCATTCGAGT 
ATGATTGTAAGTAAAAGTACCGTAACAGGGACTTACCGCGTTTTCGGCGCTTATCCTTGCGCTTACGCGGCTGCACATCGCCCGACATTTTGCTC 
GCTAAAGGCACTAAAACCTATGCCAATATAATGGAGAGTGTTTGTTTTTCCTTTTCGAGCGGACCGGGCAAAACGGGCGATCTGGATCGACGAAC 
AGCACGTTACTAAATGCAAATACCGCTCGCACCGTGGGAAAATTATTTATAAAGAGACAGTGGCGTTGCCAGATCCGGCGGGAATGATGAAAGCT 
AGAGATGACAGCTGGGCAGTAAAGGCACCGCTATTTCAATTTTATTATTCTGGATGGAAAACTCATGACAGCTCTCAAAAACTATATTTACGCAT 
AAATATTTATACATTTATTTAATTCTAATAATTATGTTTTATTTTAAATCGTGTTGTGTCGATAGATTGAATTTAATTTTAACAGTTCAGCAACG 
CAAACGTTTAGAAAACCCAAATGTAATTTTTTCTGTTACTCGTGGATTTTTCATATTTTATGTCTATGCGCTATCCATACAACAGAACAAATTGA 
TTTATTTGATTTAAAATGTAAAGTGCTCAAAGTATTAAATACAAGTAAATAATTTAAATTAAAAAAATTAGCTTTTTTTTAAATATCGCATGTCA 
TTAAAATTCATGCGCACTGCAGTCTCCAGTATCGTCACGCGCCCATCTCGAAGGGAACCAACCCCGACCAAGAAAAAAAAAAACAAGAAAATAAT 
ACACAATTGGCGACGGCGGTGCAGAGGTTCTCCGTGTGATCCGATCGAACGGGACTAGGACCGACACGGACCAGGTCTAAATCCGCGAGGTCAGC 
GGTATTATGGCCGGCGACATGTCCATCGGCTGTGCGGCCCTGAAGCGATCCAGCAAGCTGAGGTCCTGCACCAGCGCCGCCGACACGGACTCGAT 
GCGGGTGCCCAGCGACCACAGGGACAACGGGAGCCTGATCAAGCCGCCTCCGGTGCCGCCGCAGAATATGCGCGGCGGTCTGCGGGTCTATAAGA 
TTGTCATCCTGGGCGACGGTGGCGTTGGCAAGTCAGGTGAGCTATTCTAGCCATGGATTAGACGAAAGAAAATGCCGAGTTATCACTGCGCACTT 
TGATTGGGGCTGCGAAAAATCAATAAATTATCTCAATTTCTTTTACGCTTTGTTTTGACCCCCGTACCACCGACACCCACGAAACTCCACCAACC 
ACCACTCCCAACGCCCTCTGAAAACGACGACGAAACCCGAACAGCTGTCACCCTTCAGTTCGTGAGCCACAGTTTCCTGGACTACCACGACCCCA 
CAATTGGTGAGTAAGGCGCAGGGTGCGCAAGGCGTCTAAGGCACAAGCCCCCACAACCCCACTAAATGTATACCGGTTTTTTTTTTCTTTTTTGC 
TGCTGCAGAGGACTCCTACCAGCAACAGGCGGTCATCGACAATGAGGCCGCCCTGCTCGACATCCTTGACACCGCCGGCCAGGTGGAGTTCACGG 
CCATGCGGGACCAATACATGCGTTGCGGCGAAGGTTTCATCATATGCTACTCGGTCACCGACCGCCACAGCTTCCAGGAGGCCTCCGAGTACAGG 
AAACTAATAACCCGTGTCCGCCTGTCCGAGGACATTCCGCTGGTTCTGATTGCCAACAAGGTGGACCTGGAGTCGCAGCGACGCGTGACCACCGA 
GGAGGGCCGGAATCTCGCCAACCAGTTCGGCTGCCCGTTTTTTGAGACATCGGCTGCACTGCGTCATTACATCGACGAGGCATTCTACACGTTGG 
TCCGCGAGATTCGACGCAAGGAAATGCACAAGGCCCTGGGCACGGACTCCAATTCGGAAAAGATTCACACGGGGCGACGAAGTCGCTGGTGGCGC 
ATCCGATCCATTTTCGCATTGGTTTTCCGGCGCAGACGAAACCTTAACTAGACCAGGAGGCGGATTCCTTGAAATGAGTGCAATATTAACCACAA 
AAACCAAGTAACTTTTGTTTGCTATATTTTGTATATAGCACATACCCATCAGATACAACATACAAAAGAGAAATTAACCGATCGAGGAGCAGATA 
ACTAATTGTTATGGCCACAGTGGTTTATTTATTATTTGTTATGGAAACTATTTAAAGTACTTGTTGTTGGCCGGAAAGGATAGCGAGGCAGTAGA 
TCAGCGATAGAGTATATCAATACGTTTAAGCACCGATCCAAAAGATACATATTAACAATGCACGTACATATGTTTATGTAGTAAATATTACTAAT 
GTTGATTCCTAGACGAGAGCGATTAAGCAAGTGTGGCAATTGGCTAAAGCTCAAAAGCATTTAAATTCAGCAAGATCGAGGGGTCGAGTCTTCCC 
CCACTAACAATCTGAACAACTCTGAATCCTAGCCAGGAGATTAACCCAAAAGCCAATTGCCAGGTGCGAAGGGCAGAGAATTGGAGGATTTTTAA 
ACTTATGTATGTACGTATTTACCCAGAATACACGCACATGACATACGTTACACGAGCCACACATTTAACTAATCGAATAAATGTTGAATTTAATG 
TATACTAATATTATAAATCGATCGATCGACCTAAGAAGTATGTAAAACGCGGGAACATTGCAAATTATGAAATATGAATGAGAATGTGCGTTCAT 
TTCAGATCCAAGCCCAAATTGATAATTTAAGTCCACGACAGCCGATGCATTTGTTGTTTAAGTTTGAGAAAAGTCTATTGTTTATATTGTTAAGG 
ACACACGTGTTACGCCTAAGTTTATTTATTATTAGTTTAACGAAATCTGTTGATTACTTTTACCACAAGAAACCACACATCACGTTCTCTAAGCA 
ATCACCATGAGTTCCTCTCATCAATGTTAACTGCATCACCGGAAACAAAATACTAACACGGCTTTAATTACTCTGTTTGTTACATGATCCCAAAA 
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AACAGTTGAATGGACTACCCATAATTACTAATAATTAATTGTTATTGCGAATCCTGCAACCGTTCGAACCAAACAGATCGACTGACGAAAAAAAG 
AAAAACAACAGCTCCAACACGAAAACGAATCAAATGCGGGTAGAAATATAATAGAATTGCTTTCGCCGCGAAAGTTAAGCGATTAAAAAGTACGT 
TTTAAGAGAAAAGGCTTGGGAACTAGATGTTCAAAGCAATGTGTCGTGTCCATGTTGTTTCGAAAACGACATACCTGTTCGGGAATAATATTGTG 
AAACCTAATACCTAATCGATGAAAAACCAAAACATTTTGATTCATAATGTTATGTACATTTTAAATGTAAAAAAAAAAATTAAATACAAAAGCAC 
AAAATAAACAAAAACAATACCCTCAATAAAGATTTTTTTAACCCCGTTAAAGATAAGTGTTTTGACTTTAAAGGGCGGGTATTCCCCGGATTTAA 
ATCCATTATCTTAGCCTAATAAAATTCGATGCCCGGCCAAATAAGAAAACAGTTAGTTAAACCAAGAGTAGATCAAATGTACTGTTTATTCATAA 
GTATAATATAGTTTGGTTGTATCTGATGCATCTTCGAGTTTTTCAACTTCTGTGTGAGAATTACACGATTGACGGTACAAAATACAAAAAAAGAG 
ACATACACATATGTACTGTATAGTATAATTATATGTATAATCGTATTTTATAAAATCATAAAATACTCTTCATATAGTTGTTTGGTTGTTTTGTA 
GTATCTTTTTTTTTTTATGTTTTCCACTCGTTTTAATGCAAAGTACAAATATTAAAAACTCGCTTAATGCGCTCGAAATAGTTTTTCACTTCATC 
AAAAATTTTCCCTCTTAGCATCTCCCACGCGAAAACTGAATGCATCTCATTTGTGGCTTACGTTTAGTTAAATAAACATTTATCGTTTTTGTTTT 
GTTTTGTTTTGTGTTTAGTTCGGCTCTTGTAATGTAGTTTCGCTTCCAACTCCTATTGGCGGCCTGGTCTCTCTGTATTTGATAGATTTAATGGG 
AACATAACTCTCATCTTGACGCCTACGACCTCCTTGGACCCCGGGATCCAACATCCGATTCCGTTTCCGATTCTGTTCCTAATTCCACTGATTCA 
C C T G AC TCACTCACCCTC GC AT AC G AC AC AAT CT C GT C C AG C C AC AGC C G AC T C T AC AC T TGAC AAC AC AGTT AAC GAAAT AAT AAC AT AC AT CT 
AGGAGGAGGGGGTGGACAGAAACGGACACAACTTGAATATGTTTACATATATCGATATATATTTTTTGCTTGCAGTGGCTGACTGATT 
{SEQ ID NO: 634) 

Exon: 1001.. 1366 

Exon: 1565.. 1621 

Exon: 1719.. 3553 

Start ATG: 1147 

Transcript No. : CT17993 

AAGGGAACCAACCCCGACCAAGAAA7A7A7AAAAAACAAGAAAATAATACACAATTGGCGACGGCGGTGCAGAGGTTCTCCGTGTGATCCGATCGAAC 
GGGACTAGGACCGACACGGACCAGGTCTAAATCCGCGAGGTCAGCGGTATTATGGCCGGCGACATGTCCATCGGCTGTGCGGCCCTGAAGCGATC 
CAGCAAGCTGAGGTCCTGCACCAGCGCCGCCGACACGGACTCGATGCGGGTGCCCAGCGACCACAGGGACAACGGGAGCCTGATCAAGCCGCCTC 
CGGTGCCGCCGCAGAATATGCGCGGCGGTCTGCGGGTCTATAAGATTGTCATCCTGGGCGACGGTGGCGTTGGCAAGTCAGCTGTCACCCTTCAG 
TTCGTGAGCCACAGTTTCCTGGACTACCACGACCCCACAATTGAGGACTCCTACCAGCAACAGGCGGTCATCGAC7AATGAGGCCGCCCTGCTCGA 
CATCCTTGACACCGCCGGCCAGGTGGAGTTCACGGCCATGCGGGACCAATACATGCGTTGCGGCGAAGGTTTCATCATATGCTACTCGGTCACCG 
ACCGCCACAGCTTCCAGGAGGCCTCCGAGTACAGGAAACT7AATAACCCGTGTCCGCCTGTCCGAGGACATTCCGCTGGTTCTGATTGCCAACAAG 
GTGGACCTGGAGTCGCAGCGACGCGTGACCACCGAGGAGGGCCGGAATCTCGCCAACCAGTTCGGCTGCCCGTTTTTTGAGACATCGGCTGCACT 
GCGTCATTACATCGACGAGGCATTCTACACGTTGGTCCGCGAGATTCGACGCAAGGAAATGCACAAGGCCCTGGGCACGGACTCCAATTCGGAAA 
AGATTCACACGGGGCGACGAAGTCGCTGGTGGCGCATCCGATCCATTTTCGCATTGGTTTTCCGGCGCAGACG7AAACCTTAACTAGACCAGGAGG 
CGGATTCCTTGAAATGAGTGCAATATTAACCACAAAAACCAAGTAACTTTTGTTTGCTATATTTTGTATATAGCACATACCCATCAGATAC7AACA 
TACAAAAGAGA7AATTAACCGATCGAGGAGCAGATAACTAATTGTTATGGCCACAGTGGTTTATTTATTATTTGTTATGGAAACTATTTAAAGTAC 
TTGTTGTTGGCCGGAAAGGATAGCGAGGCAGTAGATCAGCGATAGAGTATATCAATACGTTTAAGCACCGATCCAAAAGATACATATT7AACAATG 
CACGTACATATGTTTATGTAGTAAATATTACTAATGTTGATTCCTAGACGAGAGCGATTAAGCAAGTGTGGCAATTGGCTAAAGCTCAAAAGCAT 
TT7AAATTCAGCAAGATCGAGGGGTCGAGTCTTCCCCCACTAACAATCTGAACAACTCTGAATCCTAGCCAGGAGATTAACCCAAAAGCCAATTGC 
CAGGTGCG7AAGGGCAGAGAATTGGAGGATTTTTAAACTTATGTATGTACGTATTTACCCAGAATACACGCACATGACATACGTTACACGAGCCAC 
ACATTTAACTAATCGAATAAATGTTGAATTTAATGTATACT7AATATTATAAATCGATCGATCGACCTAAGAAGTATGTAA7AACGCGGGAACATTG 
CAAATTATGAAATATGAATGAGAATGTGCGTTCATTTCAGATCCAAGCCCAAATTGATAATTTAAGTCCACGACAGCCGATGCATTTGTTGTTTA 
AGTTTGAGAAAAGTCTATTGTTTATATTGTTAAGGACACACGTGTTACGCCTAAGTTTATTTATTATTAGTTTAACGAAATCTGTTGATTACTTT 
TACCACAAGAAACCACACATCACGTTCTCTAAGCAATCACCATGAGTTCCTCTCATCAATGTTAACTGCATCACCGGAAACAAAATACTAACACG 
GCTTTAATTACTCTGTTTGTTACATGATCCC7AAAAAACAGTTGAATGGACTACCCAT7AATTACTAATAATTAATTGTTATTGCGAATCCTGCAAC 
CGTTCGAACCAAACAGATCGACTGACGAAAAAAAGAAAAACAACAGCTCCAACACGAAAACGAATCAA 

TTTCGCCGCGAAAGTT7AAGCGATTAAAAAGTACGTTTTAAGAGAAAAGGCTTGGGAACTAGATGTTCAAAGC7AATGTGTCGTGTCCATGTTGTTT 
CGAAAACGACATACCTGTTCGGGAATAATATTGTGAAACCTAATACCTAATCGATGAAAAACCAAAACATTTT 
(SEQ ID NO: 635) 

Start ATG: 147 

MAGDMSIGCAALKRSSKLRSCTSAADTDSMRVPSDHRDNGSLIKPPPVPPQNMRGGLRVYKIVILGDGGVGKSAVTLQFVSHSFLDYHDPTIEDS 
YQQQAVIDNEAALLDILDTAGQVEFTAMRDQYMRCGEGFIICYSVTDRHSFQEASEYRKLITRVRLSEDIPLVLIANKVDLESQRRVTTEEGRNL 
ANQFGCPFFETSAALRHYIDEAFYTLVREIRRKEMHKALGTDSNSEKIHTGRRSRWWRIRSIFALVFRRRRNLN* 
(SEQ ID NO: 636) 

Classification: enzyme 
Gene Symbol: Ric 
FlyBase ID: FBgn0017549 



Celera Sequence No. : 142000013383733 

GTAATCCCGTTTCAGAGATCTATCGCATTGTGTCGCAGAAACAGATCAGAGATCCGCCGGAAGGCGACGTCATCCGCCCGTCGAACGTGGAGCCC 
ATCGACGTAAAGCCGACTGTCACCGCCGATGTGCGCAAACAGTGCTGTCAGTGACCGCCCCAAGCAATTTTCGCTATTCCCCGC7AACCGCCAGAA 
CAGCAACATAAACAGAAGCAACAGCAATTACAACTACGACTACAAGAGGAGCAGCAGCGGCCAGAACTGCAACACCAAGATCAGGAGCAGCGTAC 
ACCCGATGGACCGCACAAGATGTCTATGCGTTTGTCAGCCACAGTTTGTCATGTGCGCATAAATTGATAATTCAAATCTGAGCCGCCACTCTCCA 
CCACC AAC ACCACCACCATCATCAG7AAGC AGC AGC AGC AGC AAAAGAATCGA7AAGC AAC AACATCCGGCTGT AGTT AACTGCGGTTTAAACTGAA 
GTTTTATATATATACAGATCCATGAGTATCGCATAAATCGACCGCCGCGGAAAGAAAGCAAGCAAAACTAAACACTCCAAATTCAGTTTAAGCTG 
TAGAACTATAATTTATATAAATATACATATATGCTAATATTAAAATATTAATATTATTAATTTTTCGCAACAT7AAATATATAGTTGTAATTTCGC 
GAGCAGAACAACAACAAACAACCGGAAATTCTTCGACTTAAACGCAAATTCATTACCACAGCACTGCAGCTCGGATTTTCTTCTTCGGTTTCTCG 
CAACATTTTAAACGATTGGCCACTTGGTAAAACTAGCTGATAAAGAGAAATCAAACAAAAACA 

AATAACATCATTTGTGTTTGCTATGTTAAGTGTGATT7AAATAAATATAATTTAAATAAAACAGAAGTGCTTTCAATTGGGCGGAAAATCGAAATT 
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TAAAATGGGATTAGAGTGACCAAAGGCAGACGTCGGAGTGGGTGTTTGGTATTTTCCGCGCGAGCAGCCACACTGATCCATACACGTGCGGGAGA 

AAAGTGAACAAAAACATTGAAACATGGGCGGTAAGAAGAAAGTGCATCCGAAGACGCGCACGGCGGCCTTCAAGGCCAGCGAGCCGAGCGAGATC 

GTGGAGGCGCCGCACTCCTTCGTCATCCATCGCGGCCTCGCCTGTCCGTACATCACGGACCTCACCCTGGACTTCCGCCGCATTATGGAGCCCTT 

TACGGCCAGCAATCTGCGCGAGAAGCGGATGAACCGCATCAAGGACTTTGTCAGCCTGAGCAGCTTCTTCCACGTCTCGCACATGGGCATCTTCA 

ACAAGGCCTCCACGCAGCTGTCCTTCAAAGTGGTCCGCTTGCCGCGCGGTCCTTCCCTGACTTTCAAGGTTTGTTTGTTTCATCGCTTAGTTTAT 

ACTGGATATTTTACCGCGAAGTTATTCTCTGTCAGGTTCACCAGTTCACCCTGGCCAGAGATGTGATCTCGCTAAGCAAAAAGCAGATGATTGAC 

AACGATCACTTCAAGCACGCTCCGCTCGTCATCATGAACAACTTCAGTGGCGATGGCAAGCACCTTAAGCTGATGGCCACCACGTTCCAAAACAT 

GTTCCCCTCAATTAACTTGGCCACGGTTAATATCGGCACCATTCGTCGCTGCGTCCTGTTCTCCTACAATCCGGACACGAAGCTGGTCGAGATGC 

GGCACTACTCCGTGCAGGTGGTGCCGGTTGGCTTAAAGCGGGCGGTCCAGAAAATCGTCAAAGGAACGGTACCCAATCTGGGCAAGTGCAACGAA 

GTCGTGGACTTTGTGACCAAGTAGGAAATGACCGTTTGTTCTCATTCCATCAAATAATCATAATATTGATTTTCAGGGACGGCTATGCCTCCGAA 

TCAGAGGCCGAAGATGATGAGCAATCACATGTGGTGCTGGCACAGACTCTTAAAAGCAAGGGCAACTTGGAGGACAAAAAGAGCTCCATAAAGCT 

ACACGAGATCGGACCTCGACTAACCATGCAGCTCATCAAAATAGAAGAGGGCCTTCTGACGGGCGAAGTACTGTATCACGACCATGTGGTCAAGA 

C AG AGGAC GAAAAGG AAA CC C T GC GC AAGCT GG T C G AAAAGAAGC G AAAGC AGAAGG AGC AG C GC AAG AAGG AGC AGG C AG AG AAC CGTGCCCGC 

AACTTGAAGCTTAAGAAGGATGAAAAATGGGCTGCCAAGAGGGCGGCAGAGGGACGGACTGACAGCGATCCCGAGGATGATGCAGAGTACTACAA 

GGAGGAAGTCGGCGAGGAACCAGACGAAGGTGAGTTGTGCTACATGCTGCGTAGAATATAAACTAATGTTTCCTTTGCAGAACTGTTCAAAATGG 

AAGC G AAGT C C T C C C G AAAGC GGC C T AGT TT GGG C GGC GG AAT G AAAT AC AAG AAC AAG C GC GC T AAAC T GGAT AC C AAGG AC AAG AAT G AC AAA 

TCTGAGCGCACGGACAAGTACGATCGTAAGGACAAGTTCGATCGCAAGGACAAGAAGGACAAATTCGACCCCAAGAACAGGAGAGCCAAGTTCGA 

TCCCAAAAACAAGCGCGCCAAGTTTGACCACAGAAAGAGTAGAAAATCAAAGTAGTACATTTAACATAAGCTATTTTATAAAGACCATTGGTTAA 

CACGACATCGCAATGTTATCAATGGGAACTAACTAAACATACGTTGCACCACCTCCTTGTAATGGGCCTTGCTATCCTTCCAGCTAACAGCGATG 

TTGTCGCGCAGCTTGCTGAACTCTCGCTTCTCCTCGTAGTTGGGCATTCGCTCGATGGTGGCTTGCGCCTTGGCGAACTCCTTCAGGTGATAGTA 

GGCGAGTGCACGCCGGTAGATGCTCTTCTCGCTGGGACTCTCCTCCGTCTCCACAAACTGCGTGTGGTAGATTACATGCTCATAGCGTTTCTCCT 

GCAGCAAACAGGCGGCCAGGTTGGTCTGTATCTGAATAAAGAGCTCCTCCACCGCCTGGCCGTTGATGCCATTGGTCTTTTTGGTCAGTTTATCG 

AAGGGTTTGTAGGTGATCAGCAACTTTGCGGCACGCACAAAGTAATCAAAGGCGAATTTCGGGAACGTCTTAAAGGTGGCCACGCCGCTTTCCTT 

GAGACGTAGTGCCACCTCATAGATCTCTGCAGCGCTTAGCTTCTCCACCTGAGTGTTCTTCACAATTTTCTCCAGCTTCAGTTCAAATTCTATTC 

GTTCGCCGGTTTTGGTCGTGATGCTACAGGCTGCCGTTTCACCCGGCACAAATTGCTGCAGCAGCAGCTCCAAGTAGCAGTCCACTGCGGTCAGG 

CTGGTGCCCATTTCCAGGCTGTGCTTTTGAGGATACGTAGCGGACTCATCGCTGACCAAATATTTGCTAGGTCGTCCTCCCAAGTTGGTGGCTTT 

CTTGTTGACACTGAATTCCACCGTACTGATTATCGTGGGCTTGTCCATTTCGGTGAATTGGAGGCCAATTACCTTCTTGGCCAGACCACGGTTCT 

TGTCCTCCCACTCTGTGCCCACGTACCAGTCCATTTTGGAATATTTGCAAATTGGGAGCTTCGAAATTTAGCAGTAAGCAAAGTTTGCGATTTGG 

CGATGTGACCAGAAAACCGTCAAAAATGTACCTATCGCGAATTTCACAAAAAAT 

<SEQ ID NO: 637) 

Exon: 1001.. 1398 
Exon: 1461.. 1825 
Exon: 1882.. 2309 
Exon: 2361.. 2664 
Start ATG: 1069 

Transcript No. : CT18136 

ATTTTCCGCGCGAGCAGCCACACTGATCCATACACGTGCGGGAGAAAAGTGAACAAAAACATTGAAACATGGGCGGTAAGAAGAAAGTGCATCCG 
AAGACGCGCACGGCGGCCTTCAAGGCCAGCGAGCCGAGCGAGATCGTGGAGGCGCCGCACTCCTTCGTCATCCATCGCGGCCTCGCCTGTCCGTA 
CATCACGGACCTCACCCTGGACTTCCGCCGCATTATGGAGCCCTTTACGGCCAGCAATCTGCGCGAGAAGCGGATGAACCGCATCAAGGACTTTG 
TCAGCCTGAGCAGCTTCTTCCACGTCTCGCACATGGGCATCTTCAACAAGGCCTCCACGCAGCTGTCCTTCAAAGTGGTCCGCTTGCCGCGCGGT 
CCTTCCCTGACTTTCAAGGTTCACCAGTTCACCCTGGCCAGAGATGTGATCTCGCTAAGCAAAAAGCAGATGATTGACAACGATCACTTCAAGCA 
CGCTCCGCTCGTCATCATGAACAACTTCAGTGGCGATGGCAAGCACCTTAAGCTGATGGCCACCACGTTCCAAAACATGTTCCCCTCAATTAACT 
TGGCCACGGTTAATATCGGCACCATTCGTCGCTGCGTCCTGTTCTCCTACAATCCGGACACGAAGCTGGTCGAGATGCGGCACTACTCCGTGCAG 
GTGGTGCCGGTTGGCTTAAAGCGGGCGGTCCAGAAAATCGTCAAAGGAACGGTACCCAATCTGGGCAAGTGCAACGAAGTCGTGGACTTTGTGAC 
CAAGGACGGCTATGCCTCCGAATCAGAGGCCGAAGATGATGAGCAATCACATGTGGTGCTGGCACAGACTCTTAAAAGCAAGGGCAACTTGGAGG 
ACAAAAAGAGCTCCATAAAGCTACACGAGATCGGACCTCGACTAACCATGCAGCTCATCAAAATAGAAGAGGGCCTTCTGACGGGCGAAGTACTG 
TATCACGACCATGTGGTCAAGACAGAGGACGAAAAGGAAACCCTGCGCAAGCTGGTCGAAAAGAAGCGAAAGCAGAAGGAGCAGCGCAAGAAGGA 
GCAGGCAGAGAACCGTGCCCGCAACTTGAAGCTTAAGAAGGATGAAAAATGGGCTGCCAAGAGGGCGGCAGAGGGACGGACTGACAGCGATCCCG 
AGGATGATGCAGAGTACTACAAGGAGGAAGTCGGCGAGGAACCAGACGAAGAACTGTTCAAAATGGAAGCGAAGTCCTCCCGAAAGCGGCCTAGT 
TTGGGCGGCGGAATG AAAT AC AAGAACAAGCGCGCTAAACTGGATACCAAGGAC AAG AATG AC AAAT CTGAGCGCACGGAC AAGT ACGATCGTAA 
GGACAAGTTCGATCGCAAGGACAAGAAGGACAAATTCGACCCCAAGAACAGGAGAGCCAAGTTCGATCCCAAAAACAAGCGCGCCAAGTTTGACC 
ACAGAAAGAGTAGAAAATCAAAGTAGTACATTTAACATAAGCTATTTTATAAAGACCATTGGTTAACACG 
{SEQ ID NO: 638) 

Start ATG: 69 

MGGKKKVHPKTRTAAFKASEPSEIVEAPHSFVIHRGLACPYITDLTLDFRRIMEPFTASNLREKRMNRIKDFVSLSSFFHVSHMGIFNKASTQLS 
FKVVRLPRGPSLTFKVHQFTLARDVISLSKKQMIDNDHFKHAPLVIMNNFSGDGKHLKLMATTFQNMFPSINLATVNIGTIRRCVLFSYNPDTKL 
VEMRHYSVQVVPVGLKRAVQKIVKGTVPNLGKCNEVVDFVTKDGYASESEAEDDEQSHVVLAQTLKSKGNLEDKKSSIKLHEIGPRLTMQLIKIE 
EGLLTGEVLYHDHVVKTEDEKETLRKLVEKKRKQKEQRKKEQAENRARNLKLKKDEKWAAKRAAEGRTDSDPEDDAEYYKEEVGEEPDEELFKME 
AKSSRKRPSLGGGMKYKNKRAKLDTKDKNDKSERTDKYDRKDKFDRKDKKDKFDPKNRRAKFDPKNKRAKFDHRKSRKSK* 
(SEQ ID NO: 639) 

Classification: hypothetical 
Gene Symbol : ppan 
FlyBase ID: FBgn0010770 



Celera Sequence No. : 142000013385214 
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TGAATGGCAAATTGGTGTGACCGTATTTTGCGTTGTGACTCGGCTGCTTTATTTACATTGAATTTTTGTTCGATTTTTGAATTTAAAATAAAAAT 

AATGCCGCACGGACACTCTCACGATCACGGCGGCTGCAGCCACGAGGCCTCGGACGTGGACCACGCCCTAGAAATGGGCATCGAGTACAGCTTGT 

ACACCAAAATCGATCTGGACAATGTGGAGTGCCTGAACGAGGAGACGGATGGCCAGGGAAAAAGTGTCTTTAAGCCGTACGAGAAGCGCCAGGAT 

CTGTCCAAGTACGTGGAGAGCGATGCGGATGAAGAGCTCCTCTTCAACATTCCCTTCACCGGCAACATCAAGCTCAAGGGCATCATCATAAGTGG 

GGCCAATGATGACTCCCATCCAAACATGGTCAAAATGTAAGGAACAAAAGTCAAGGCTGCTTTAATATTACTATATTCATCATATATCTTATTTT 

T C AGC T T C AAG A AC C G AC C C AG GAT G AC C T T T GAT GAT GC CAG AGC AAAGC C C G AC C AGG AGT TCCAGCTGACCCGCGAT GC CC GC GG AG AAAT C 

GAGTACTCACCCAAAGTGGTCACCTTCTCCTCCGTGCACCATCTCTCCCTCTACTTTCCCAGCAATTTTGGCGAAGACATAACACGCATTTACTA 

TATAGGTAAGCTCACAACAATCTATGTTTTATCTGTGTTCCCAACTAAAACCAAGTCACTATTCAAAGGTCTTCGAGGAGAGTTCACAGAAGCTC 

ATTACCATGGCGTAACCATATGCAACTACGAATCCCGTGCCAATGCAGCGGATCACAAGGAAAAGGCTTTCGATGGAGTCGGCAGAGCCATCCAA 

TAGGGTTATAGAGATTAAAATTTACAATTTAATTGAGTATCTCTGTAGTGTATATATTTTGTGATCATTTAAGGCAAGAAATGTTTTTGTATTCG 

CTTAACTTATTGAGCATTTGTATTGCACAGATATTGTGGGCTAAATCATACTAAGTATATTGTGTGGGTGGAATCACATCGGGTCCTATTTTCAT 

GCCCATATTGGGCACTTTATCCGAATGGATTTCAGCCGTGACCTCGGTGAACAGCTTGTCCCACTGCCAGAGATCGTCCTTCTCGTTGAGGCTCT 

CCTGTGGAACCAGGCAATCGGTGAGGATTGAGAGATCAAAGCCATCCACAGCTGAAAAGGCGTACTGGGAAAGCAAATCGGAATTGGCCTCCTTC 

AAAACAGCAGATCGTGAGGTGCCAATACTACAAATTACAACTTATTATCGTAGTGAAGAGAAATATAGCAGCTTCTGCTTACGTTGGCGGTTCTA 

CAATCTCATTGAGCATGATTTCGTCCTTGACATCATCCATGTCTGGGATGACGGGAATGTCATCCGTTGGTATGGAATTTGTAGCCGACTTTGTT 

TGTTTAAAGCGCTCACTAAAATGAATTTTATCATTATAACAGTCGGTGGAAAGTGAAGTTATTGTGATAAACTTGTACTCACTCATCAAAGGAGT 

TTTTCTTGGACCTAAAAAGATATAGAGCAATCAGTAATGTGCCTAATTCGAGCCTAATCATACCCACTTTAATCCTTTGAGTCCAGAATCCGCCC 

AGCCACCTGAAATGCGACGAATGGGTGGTCGATGTGACTCGAATTCATTGTTGTATGTGGACGAGGTGGCCGTGTCCGAGGTGGTCGGTGGGGTG 

CCCGCCTGGACATTCAGCGTAATGTCGATGGAAATGTCCTTGGAGCCCCCAGCCGCCTGGACATCCTCCTCCTTCAGAACATCGCGACTCTTGGA 

GCGCCGACCCTTCCTGGCCGTCGTCTAGGGTGGCAAATAAATCGAAACTTCAATTTGCATCCATTTAAACAGGGTTACTTACCTTTCTTATGGTC 

ATCTTGAGTTCTTCGGCCCAGTCCATCATCTTTTGTTTACGTTTATCCTCAGCGTTTCCAAGGCAACAAGTGGTCATGGTTACTCCGTGGGAATG 

TTCAAGTTTACTAATGGAGTTTGCCATTCGCAACGCGATGCGATAGTTTTCAAATCAGTTACCTATCGCCGATGACAGTTGGCGGAGCTTCAAGT 

GGC CAAAATAAAC AAAT ATT T AT T T T CT AAT AAC G C AAAAT GC T GACGG AC AC GG AAG AT G AC TTCTTTTT CG AAG AGG AT AAG AT GT AAGT GGC 

CATTGAAAGCAATAAATTGCTGCAACACCTACGCCCAGCTCTATTTACGCACAACAATCTGTTGCGATGTGCCGAAAAGTGTCTGTTAAAAATCA 

ATAATTGCCACCAAACTGAAACATATTGTTGGAATCAGGCCAGAACCGCTGCTCATCCAGTTGGATGGGGATGGTGCTCAGGCTGTCCGGCAGCC 

CAAGGAACGCACGGAGCAGCGAGAGCATGCCTCCGGTTCCGTCGCTCCCAACGAGGCCACCGTTATGCTCTGCTCCCCGGCGGGATTCTCCTGCG 

ATTCCGCATCCGGGGACAGGTTCGGCATCGAGTCCGCCAGTAGGCAGAAGTATGAGTTAGACCGTCTGTAAGTCAACTCCGTTGACCCGGAATCG 

TTCCCATACTATTTCCTGTGGCAGCTCTGTTTTCTAAGTTCTGGTTTTTATAGTCTTTATCCTGTAGTTTCTACCAAATTAGCTTACTCTTGGTT 

TTTTATTGATTTTATTCTTTTAAATTCATAAATTGGTGTATTTATATTAATAGTAGCATATACACTTCTTTATTTAACTATTAAAAATTTAATTT 

TGATTCAGTTGCTGTTTTTTTTTTAATTATTTACTTTTTAAAGAGCGAACACAGTACTATATCCATTTTTTGTATAGTAAGAACTACAGAACTAG 

AACTAACAAGATTGTTGTTTATTTAGTTTATTAATTCCACCGAAAAAGGACAAATGTGAATTTCCTTTGTAGAGCCTTAGATGCTTATACTTAGG 

CTAGCTAGTATACAAACTATATTGAATTCGAGGTGTC 

(SEQ ID NO: 640) 

Exon: 1982.. 1888 
Exon: 1829.. 1588 
Exon: 1531.. 1508 
Exon: 1440.. 1318 
Exon: 1262.. 1001 

Start ATG: 1977 (Reverse strand: CAT) 
Transcript No. : CT18140 

TAACCATGACCACTTGTTGCCTTGGAAACGCTGAGGATAAACGTAAACAAAAGATGATGGACTGGGCCGAAGAACTCAAGATGACCATAAGAAAG 
ACGACGGCCAGGAAGGGTCGGCGCTCCAAGAGTCGCGATGTTCTGAAGGAGGAGGATGTCCAGGCGGCTGGGGGCTCCAAGGACATTTCCATCGA 
CATTACGCTGAATGTCCAGGCGGGCACCCCACCGACCACCTCGGACACGGCCACCTCGTCCACATACAACAATGAATTCGAGTCACATCGACCAC 
CCATTCGTCGCATTTCAGGTGGCTGGGCGGATTCTGGACTCAAAGGATTAAAGTCCAAGAAAAACTCCTTTGATGATGAGCGCTTTAAACAAACA 
AAGTCGGCTACAAATTCCATACCAACGGATGACATTCCCGTCATCCCAGACATGGATGATGTCAAGGACGAAATCATGCTCAATGAGATTGTAGA 
ACCGCCAACTATTGGCACCTCACGATCTGCTGTTTTGAAGGAGGCCAATTCCGATTTGCTTTCCCAGTACGCCTTTTCAGCTGTGGATGGCTTTG 
ATCTCTCAATCCTCACCGATTGCCTGGTTCCACAGGAGAGCCTCAACGAGAAGGACGATCTCTGGCAGTGGGACAAGCTGTTCACCGAGGTCACG 
GCTGAAATCCATTCGGATAAAGTGCCCAATATGGGCATGAAAATAGGACCCGATGTGATTCCACCCACACAATATACTTAG 
(SEQ ID NO: 641) 

Start ATG: 6 (Reverse strand: CAT) 

MTTCCLGNAEDKRKQKMMDWAEELKMTIRKTTARKGRRSKSRDVLKEEDVQAAGGSKDISIDITLNVQAGTPPTTSDTATSSTYNNEFESHRPPI 
RRISGGWADSGLKGLKSKKNSFDDERFKQTKSATNSIPTDDIPVIPDMDDVKDEIMLNEIVEPPTIGTSRSAVLKEANSDLLSQYAFSAVDGFDL 
SILTDCLVPQESLNEKDDLWQWDKLFTEVTAEIHSDKVPNMGMKIGPDVIPPTQYT* 
(SEQ ID NO: 642) 



Celera Sequence No. : 142000013384550 

ATGCGTCTGGCACGCGGCTCCTCCTCCGCCAATCCATTTGCCTCCATAATGCGCTGTCTCTGCGAATGAATGTGCTGCCACAGAGAGACTGCGTC 
CTGGTGCTCGCTACTGCCCGGTATGTAGAAGGCCTTGGCATTGCCAATAAGCAGCTCCAGGTCGGCCATCAGATCATCCAAATCGTCGTAGGAAT 
CGGTTTTCAGCTTCTGCTGAACCTTGAGCAGATCAATGGGATTGACCACCACGTCATAGTACGAGGGCTCTTGCCGGCGCTTCGGCACGCGGATG 
AAGGTGTCGCACAGCATTGAACCGTCCTCCTTCTTTATGTTCCTTATGGAATCGTACAATTGCTGGCACAGTTCTGTGGGATCGAGACGGCGCTT 
TTTTCGCGCCGATTGTGTCGTCGTCTGCTGCACCGGTGACTGCTCCGGCGTCGAGTCGTCCAGCTGCAGCGGATCCTCGTCCTGCCGGCTGGATA 
TGCTGCTCGCCCGGCGCTTGCGGCTCAGCATCGTGTTATCCCTGTTCTTATTACAGTTAAAATCTAAATATCCGATAATAAGACGCCACACAAAA 
TGCCACAAATTCTGGGAATGAGCGAAAAAGTAGTGGTGGGCGAAAGGTTCGATAGTGCGCGGTCGATATATAGTTCGATAGCCGGCCACTTGCGT 
GGTGAGTTATCGTAATCGAAGCGCGCGGTGGTGAATATTATTTTAAATGGTATGTTCCCACCAAAAATCATATTTCTTAACCAAATAAACAAGCT 
GAACATCATTTTACTTAAGTTCTTTATTATAGCATCCCTAAAGCATCATTTAAAGTTTTAATTAAGTCATTGAATCACAGCAATATCGTAGGTAG 
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TTTTCCTTAAATTAGCTCCTTCTCGAAAAAAGGCATTCAAGCCAATAGAAACAACGAAGTTAGGCTGAAAAAAAAGTTAAACAGAACAATATTTA 

ATTTAAAAGTCAGCAAAACGCAAGCGCGCAATTAAAAAGCCATGATATTTCGATTAATTTAGTAAAATGCTTTGTTTATTATTATTTTTTATCAC 

ATAAGTTTTAGACGCTTGTTAGTGTGTGATTGTGTGAATATTGGGTTACAGATTTGAGTATGCCGTAAAATGAAATGAACACGCCTCATCGTTTC 

GCTGCTTAAGGAACATTGTTAAAATCTTATAAAAATAACTGAAATATACGTGGATAAATAGCGAAAAACATGTATGAAATGGAATTCACTCTGAA 

CGACAAGAAACGAAATTCTAAACTACGCCTACTGCAAATTGACTGCGGCTTATGTTGCACAATTACATCTTCTATTTACATGCTAGTTATTGGCG 

AGATTTCTGCGAATATCATGGTACACGAGTGCCATTGTTCTGTGTTTACTGGCACCACAGTGATATGCTGGTGCACTGTGAGGACTCTGTCTGTA 

TAAAGTACAACAATTAAGCACAACAGTTAACATTGCAAAATTTAAATATTAGTATCAGTTCTGCACAACTATACGTGATCATATAACATATGTAA 

AGGTATATTATATAGTTTCAGCTGCCCATCTCCCATGTTTTTCTCGTGTCTTAACAGTTCATTTCGTTTGCCTTGCGACTAGCGGTGGTATTGCG 

ACTGCTCTTACGACTAAGGTTACGATACTTGGTATAGAAACTTGTTAGGCTCTTTAGCGGCGACTCGACAAGGCGCCTCACCTCCTGTCGGTTTA 

TACAAAACGCATTGTCGTGTCGCTTGCGCCAATATCAGTTGATGAAGCTAATGTAGTTAGTAAAGAGCACATAGTTCTCCACTAACCGGGCTCTT 

AGCTCCCGGCGCACCGTAGCCGTGAATTTGTTTGCCAAGCACAGGGCAGTTGTGGTTCCAAAAACCAAATTGTAAAGCAATACAATCTGGAAGTT 

TCCCAACCACTCGATGGCTCCAAAATCACCCAGCAGGTCAAAGTTTGTGATGCCTGTAAGAATCAATAATAGTCGCTTTTAAACACTGTATTATT 

CATTAGTAATTTGCCACAGCCTACCTATGATCCTGCTGAGAAGCGGCAATGCCGACGACAAAACCAGCATGAAGCCACAGTTCAGCATTAGCTGT 

GGCAGCGAAGTCTGGCGCCGCTTGGGACAAACCTTGCGCATGAACGGCATGCTGTAGAAGCCCACCACCGAGGTGGCGCCGAGATAGAAGATGAG 

ACACACCTCCAGGCCGGCACCAAAGGGACCCAACTTCGAGAGCGAAGAGATGCCCAATGCGAATTGCTAAATAAGGGAAACAAACTCAATAAGCA 

TTCGTCTTACGATCCCTGCAGTTAGACCTACCCTTGTGCTTAGTGGCAGCGCCTTAATGCCAATAAGGAGCTCTAGTGTATTCTGCACAACCAGC 

AGAATGGTGACAGCTGTGCAGAAGAGCAGCAGCAGCATGGCCAGCGGATAGACGAATGTGCGTTGGAAGGTGCTCGATGTGCGCAGCTTGTCTAG 

TTCCTTGCGCTCGGAGTCCAGTTCTCTTAGTCTATCGTTGAGCTCGTCCACGTCACTGGCCATCGCCTTGCGCTGGTATAGATGAGCCTGGGAGT 

GTTGCAGCAGTGGCTGGGGATAGAAGGTGCGCCCACGTCCCAGACCAATCCGATTGCCAAGGCGACCTCCGTTGGCGGCATCCGCAATGCTCACG 

TTGTGCAGCTCAATGTGCGCTAGCTTCCGCTTCACGCTCGCCTCCTCCATGTAGAAGGCGCTAAACTCCTCGTTGACGTCGCGCAGTAGCATAGG 

TCGCACCAGCACCTGGTTAACAACGCCAAAGAGGCGCACAAAGCCATACGGGGTGCAAACTGTAAGAAAACAAAACGGTTGTCCTGGTTACCAAA 

CTTGTACATGCCTTAATATTAGTATGTAAGATCTAGAACTTATTTCTCGTTGTGTACGAAACGTAAACAAAGGTCTCTTTGACATCCCACTCAAC 

TTAAAAGACATTCTGTTCTGAGCAGTCGAACCAGTCGGTATCTTAACTAAATTTCTATTCCTTACATCTTTAAAATGGCAAACGAAGAGGTTAAA 

G AC AAT G C C C AAC T T AAGGC AAC T AT GT T AAAC TGC AAT T T GAAC C C C AGT GT C GC GG AT G AT TT T G C T G AGT T CG AGG C AAC C C T T GC G AAG AT 

CGATTGTATACTGCAGAACAAGGCACCCTGTGACGATGAAGATTCAAAGGCAGGTGGCGATGCCAAGGAGAAGATCAACTTTGACAACTTGGATG 

TGGATAAGGTGCGGCTTAAAGTGAAAGAAAATCGAACAGTCATCAACAGAAAGTCTCTAGAAGAAGACAATGAGAAGCAAGTCAAGGATATGAAC 

CAGAAGAGTTTCATGGAGCAGGTGGAGAAGGATGCCAATGATCGTGCAGAGGCACGTGCCAAAGCCGAATACGAAGCAGAACTACAGAGAAGGTA 

AACTTAATGGTCTCCCTAAAATTATAATATATTGTATATAAGGATAATTTTTTAGTCAGGGAAACGAAGCATTTCGCAGCCAGAAGTACGAGAAG 

GCAATCCTACATTATGACAAGGCTATCATCAAAGTTAAGGACAGCGCTATTACATATTGCAATCGCGCCTTGTGCTACATCAAGTGAGTTACAAT 

CTTCAAGCCTTATGAGCTATCCTAACAAGCCTTCTTCATCATTTTATTTCCATCTACCAGGCTACAGAACTATAAGCGCGCCCTCAAGGACTGCC 

AGTACGTGTTGGAGAAGCTGCAGGAGAGTAATCTCCGCGCCTGGCTCTACCAGGCCCACGCCTACAAGGGTTTGAAACAAGACGACAAGTTCGAG 

GAAAGCGTGGTCAAGGCACGTGAACACAATCCCAAGCAACTGGCGTACATTGACAAGTACATTAAGCAACTGGAAGCCGATCTTAAAGCACTAGA 

AATATAACTTATTTAATTTTGCATTATATACTCACTCAGCATAAGCATGACACCCAAAAACGAGACACACGAATAAAGAAACGGCAGGTGAACAC 

TGCCCAAATCTGTGAAGCATGAAAAGCGGAAATTAGAAATGCATTGTGCATCAGAATCCAATCCCAATTTGGGGATGGGCTTGTCAGGGGCGCGG 

AAAG AC AAT T GAC AT T GG AAT T GAT CTCTTCCGCACCCGTGCATCATT AC AT TAT AT AC T C AC T T AGAAAC C AG AAG AAT TGAAGC T T T T C AAT G 

CCAAAGACTGCACTTAGGACGGCAGTTAGTACGAGGACAATAATGGCCATCAGCATAAATACTGTAAAGGTTTCGTAGACTCGCGGCAGGATGCC 

CTTCTTGTTGCCGACGAATCCCGTGGATTCCGAGAACAAATAGACAAATGGCAGGAAGATGAACAGTGACAGATTGGAGAACAGAAATACATGGT 

TCCACAAGCCTGAAATTGGCATATTAAACGGGTTAGTCAGTGTTCCTCGGTGGATAATGTAGTACTTACCCTGTATGAGTGAACTGTTTAGCCAT 

TTCACATAGTAACTGTTGGGATAGAGGAGTAGCACCTCATTGCTGGCAATGGAAACGGGCAGCAGCATGGCAGCCCCTTCGGCAACGGCCAGTGT 

GAATGTGCACAGCCAAAAGCTATGGGGTTGGTTGGGTGGAAAAGGCAAAAGCAAAGAAAACCATTCATTATTGCGAGTGCAAGTCAAGTGACGCG 

GCAATTAGCGACACAGGTTGTTGACCACTGGATGCCTCACCTTATGCGATAGACCAGCACCTCGTCCTCGTCATTCGAATAGAGATCGTCACGGT 

CGCGCCGCCTAAAACGTGAGACGACCACATAGGAGCTGGAGTACAGCAAGATTATCAGTAGCAAGAATATCTGAAAAGAACAATGCAACATGGCA 

AGTTAGTAAGCTTTATGGTTTTATCAGCTAAGTAATATTTTACTATAATATAATCAAATTTATTTTCCCTTAGAAACTGCAACATTTCGATCGCC 

CTCCTCAGGTTGGTTAATTACGTAAAACCCAAAGTAAATTTAGACTCGATTGTCTTTTTTGTTTTATTTTATTGAGGTTTTCATAGCACATTTCC 

CCACATATTCTTGTGTTTAAACAAATGTCAAAAAGTAAACACATCACCCACGCATCTCCTTTGTTTTGTTTTTATAAATATTTGTCATTACCGCC 

AACTGTGTGCACATTAGCCAAAAACATTTTCGATTCGTTAAGTTAGGGAAATCTGTATTAATCTAGCTCGGTGAATATATTACTTATTGATACTG 

ATATGTGTACATGATTCAATTTAGTGTCAAAGTCTGGCATTTTAAGCGTTTCAGGGTCTAATCGTCAAAAACATTGCCAGAGTGATTTGAACTTT 

TGTTTGCGGACATTTCAATTTTTCAATTTCCGGTATGTATGCTTAAAATAATATTATGTATGTATATAGAACAGGTACTCAGGTTTCTTTTGCCT 

ACAACCCTATATCACCATAAACCACTTGGCAATCGATGAAATATTATGACCGTCAATTAGACCAATAAAGTTAATTTCGTCTGCGTTCAAAGGTC 

ACAAAAGAAAGCTTATCTGTGGCCAACGACAAAACATGAAACATTTGATAAGAACTAAACCGATAGCGGTCATTTAAATCCCATTGAATGTCAGC 

AACATTTGATTATCCGTACTCTTGACTATTAAATCCGAGTAGGTTGGCAATTTGAATCTTCATCAGCGATTTCCAAACATTGGACCCATCATTTG 

CAGGGTAAATATATTTTTGGGACCGCGAAATATAGATGCCAATCACATGGCACACGTCATATTTTCGCTTTTTTTGTGTATGCGAAAAGCGAAAA 

CAAAACAAAACGTAAATGGATACGAAAATACAATAGTCTAGGCGCACAAATAAATCATTAAGTGAAGATACTTTTCATGTTATAGCTAGCAAAAA 

AAAACTTGGGGAAAAAATGCAGAGAGAGCAAAAAACATTAGTTTATACTCACGTCCCTGCTGTGTTTTGCTATTCTTTGTTTGCTATTCGAGATA 

TTTTCAGTGGTATCTGTGAGATACTCTATTGCGTTATACAACGCGAGTTTTAATTAACTTATAACTTTATTCTTCTGCCTGCATTTGACCAGATC 

AATTAAATGAAATTTTCTTTAATTGCTCGCAGCGTTTCGCATTTCGGGATTCCAACCTTTCATCTTGCCATTCGAAATGACCTTCTTTTCAGTCA 

AC AT T T AAAC A GT T T T CT C T AC AC AGT AG AC AAAAAAT AT G AT C AAAT AT T AAAAAT T AAAT AT A G AG AC GT AAAT C GC C T GC T AT C AAC AG C AG 

AGTGCGTCGCTTTATTTATAGAACCCAAGTGTAGATCATGGAAAAATGTAGATACACTGAATGGCAAAACTATAAGGTTGCAGTATAATTAAACA 

TCCAAAAGTACACATTTTGCGCATCTAGATAACGAGGCATTCGAATACTAATGTGATATGCACAGAGCATTTTTATAGAATCTAGGCCAGCTATG 

CAGCTATGAAACGCAGTGTATTTTCTAGACTATGACGTGTTAAACTTGAAACACATAGGCGACGAATACCAACTGATATGAGAATCTTATAATAT 

GTACGCCAGGCATTCCTGAATGGAAACAATTGAGCCAGCGATCTTCAAGCTTAGTTTAGTGCTTGTCAGTCGAGAAAACGTGCAACTGCAATGCG 

CAGCCCCAAGTGGCATTTGTCACGCTCCAAGCGATTTGGATTTGGCCAAACTAAACGATCTGTGCCTCGGAATTAGCCGCACTTGCATCGTAGGC 

GAACGACGAGTCAATGACTTGACATGAACTCGAGGGGGAACACAACACGCCCATTGAAATGTGGACGACGATTTGGCTAAAAATAGCAGCTACGC 

CCAAAATGCAGCCAGAAGCCAGCGAAAACGACGACACTGTGGCCAGGCCAAAAGGAGAAGGGAAGAGGATCGCCGGTCGGACGGGTTTTGGCTTG 

ATTTGTAGACAACTTTGCCGGTGAATGGGCTGCG7VAATATTTATACTTTTCTATACCCAATAGTTGTGCCGAACAGGGGTATATATTGTATCCAT 

ATGTGCTGCAAGGTGGGTTGGTCTACGCCTATTGATACCTCTTCATTTTATCATTTCAGATAATTTTTTAGAATGTAGCGTCTCTTCAGATTCTC 

AAAGTGGAGAAGAGCTGTATGAGTCGTCCCAATTGTTTTAATGACTCCAAATAAAAATGTGTTTTCTAATGAGGGGAACTGCATTTGCAAGACTT 

TTGGTCATGGTTTAAGTTACCCAATTCAGTTTCTTCATCAGACTTTTCCCTAATCACTACTTGGGAAATGACTAAGAACATAAGTGTGGTGTTAC 

ACAATAAGTTTGTATGTATTTTCTCAGAATTTCCTGCTACTACACTTGGCATTAGTGGTTTTAATTCGGGTTTGGCTTTAGGTTCCTGAATTATT 

TATGAGAGCTTTGCGTTGTTTGCATTTCATTATTGCATGGAGTCCCACACATCACGTATTCGGTATGTTGACCCACATTCTGACGAATTATTCAA 

TTTTCGGCCATCCAATGCAATCAGAACCGCTGACTGAAACCAGTTTCCATTTCCTGTTCCTGTTCCAGTTCCACAGAATCATCCTCAATGAAAGA 
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AATGTGTATTTATAGACGTACAAATCATAATCGGTGGCATAAAAAGCAACCTGCAAGTGTATTGTATCTACGCCAGTCTATGCCAACCCTTTCAA 

ATGTGTCCATTTAACGCCTCGTCTGCCATTGTTTAAACAGATCCAACAGCTGATCACGACAGCAGACACTTTCAGTGACCCAGACGGTACCAAAT 

CGTGACCACTAGCCGTGCACTTGAAATCTGGCCTGCAAGTGTTTTGGCCATTTAACCGAATTTTCACAAAAGGAAATTAATGCCTTGCGCAATAT 

AATAAACATGTGTTTTTATTCGATTCAGTCTGTTGTTTACATAAATCACTGGTTTCGTTTATTACACGCCCAACATTTCTATAAATAGTAGCTCG 

AGTGAAACACTTTCACTTGTTGCTTTTGCTTATGTTGCCTTTCTTTGTTTTTGCCTGAATTGTTTAAAATTTGATTTCAATTGAAAGTTCTACGA 

TATTGGTACCAATTAAGTGAAGTTTGTTGATTATCTTATCAATACCTTGGACTTCTAGTTACCTTCCATTTTTCAAATAGCAAAACCTAAGCATT 

CTGATAGGGCTTATGAACTCACAAGAACCAATTAACTTAAGGGCAACATTACTGGGAACTATTAATGAAACTATTTAAGCCTGCCTTCCATTTGG 

AT AT T GG AAT T T AA AT AT T C AAC AAGAAT GG AG AGT AAT ATT T T AAT G AAAT T T AAAAC AT AT AT T T T AAG AAC C T AT T G T T CT T AAT T AAT C T A 

ATTGTAAACAAGAACATGTAAAAAAACATTTGTTTATAAAGTTCTAATAATAAAATAGAAACGAATAAAGCTTGGTAATGGTAGCAAGTATCTCG 

CCGTCGAATCATCGATTGTGTCTAAACTCTTTGATTAACAACAGGTGTCCGTCCGTGTGGGCCACTTTTGGTGTGTCTGCGTGATAATTTTCGCA 

GCTCAAATAGAAAGTGCAAGAAGAGAGTGGCTATCGCCGCCACTTCCTTCCCCGATTTTATATACAAAGCCCAAACGCAGGCGCCGAATGTGCTG 

ACCACTGTAGTCCGAAACCGATAACAGAACCATAAACAAATACATTTCTGTAGACGACCAGCAACTGATTTCTCTGTTTATCAAAAGACCAGGCG 

AAGTGTGTGTAATTTCCAGAAGTCGCACGCATTTTACACCCAGCCACATGTACAGTTAGAACAACAACAATAGGGACAGCTGCGCCCTGAGTTCT 

TGTTGTTGTTTTATGCGTAGTGGGGTGGGAAATTGGGTGGAAAGGGGATGAATTCAATCCAGAAAAACACTTACAATGTGCTCGCGTACTGTATT 

GTGGAACAATTGCAGCTTTAGATCCGTAACCTCCTCCTCCTCCTCTTCGTCCATGCCCACGAAATATTTGGACAATTTATCGCTGGTAGCCAGTT 

TTCACAATTTTTCCTAGATCACGCCTCACTTAAAAATGTTACATACACATTTGACAGTCGCCAGAGTCTGTCAAATTCCTATGGGCAACAGTCCT 

TCTTAGTAATGCGCTCCAGGTTACAAATTTTCTGAATTAAAGTCGCTGTGGCGCTGCACTGTCTGATTTCGCACGGTAATTGTGATTAATTTAGT 

TAACTTTTTGATGCCGCATTGCCTTTCGACCCGTTTTGCGCTGGTCACTAATTTTCTACTAGCATTCGAGCCGATTTACGACCGAAACGTTAAAA 

GATTTCTATAAACTGGCGTCAGATTGACCGCATCTGCTAGAGGTGGACAATTGACGATAGGTCGATAGCTTTCGATGTATGTGCTTACGTTTAGA 

AAGG AAAAT AT T G AAT CT CT AAAAAAAAAAAAAT T G AAT T GGAAAG AAAAT AC T GAAT T T AAAAT T T T T AT TGAC C T C T AT T AT T AGAG AAAAAA 

ATATTATACCAATGCAATATCTTCAAGTTAAGAAAACTGTAACCCTAAAAACTATCAATTGGTTCAGACATTCTGGCAACCTTACATCTGTCAGC 

TGTTACCATACAAAAATGTCCAATTTGCGGCGCCAGGTAAATTAACTTAAAATCATAAATAATCGGTTAATATTGTTTGTATTCCAGGTTCTCAG 

TGCTTTCAAGAAGCTCCACCGTACACGACAATACGTTTTTCAGGGGGATGCCAATGCTTTGGCGGCGGGACGACTAAAAATCAACGAATCTTTCC 

T G C AG AAT C GT AAT G AAAGC AGC GAGG AT GAAAT T C AAAAGGT T T AAC C AC T T T T T GT AAAT T C T AAC T AAGT AC AT AT AT GG C AAT T TG GC AAA 

TTACAGATGATCAAGCTGGCGCAGGACGTGGATTTGGAGCTGCGAACGAATGTCATCCAAGCCCAGAAAAAAGAGGATGGTGTTTATGGTCAGTG 

GGGC ATGGC GAT T ACATAAAAGAGCGCTCAAAATT AAAAT TTGGCT AT ATTTACAGAACTAAGGATCACGCCAGAGACAACGCGACTCGACAATG 

TGGTCTTTAATCCTGACGCAATTATCGAAAAACCGCGCAGGCAACGCGGCGACAAAAACACTGAAGGTTGCTGCGGTGGCGCTGCGATGGCGGCG 

CTGGAAGCCGAAGTCCAGGCCCGCAACAAATAAAAATATATTCAATATTTAGCTGTAGTCAAGATGGTTTTGTTCTGATTAATTACGTCGTTTAT 

CTTTAATTTCTATTATGTTTCTTCTTCGATTTGTGACTACTCTTTGACTTGGACTTCTTTTGCTTGGATTTTGTACTCTTTTTGCG 

(SEQ ID NO: 643) 

Exon: 9346.. 8910 
Exon: 4820.. 4696 
Exon: 4579.. 4440 
Exon: 4379.. 4149 
Exon: 3999.. 3931 
Exon: 2814.. 2312 
Exon: 2251. .2020 
Exon: 1953. . 1001 

Start ATG: 8984 (Reverse strand: CAT) 
Transcript No. : CT18210 

CAGATGCGGTCAATCTGACGCCAGTTTATAGAAATCTTTTAACGTTTCGGTCGTAAATCGGCTCGAATGCTAGTAGAAAATTAGTGACCAGCGCA 
AAACGGGTCGAAAGGCAATGCGGCATCAAAAAGTTAACTAAATTAATCACAATTACCGTGCGAAATCAGACAGTGCAGCGCCACAGCGACTTTAA 
TTCAGAAAATTTGTAACCTGGAGCGCATTACTAAGAAGGACTGTTGCCCATAGGAATTTGACAGACTCTGGCGACTGTCAAATGTGTATGTAACA 
TTTTTAAGTGAGGCGTGATCTAGGAAAAATTGTGAAAACTGGCTACCAGCGATAAATTGTCCAAATATTTCGTGGGCATGGACGAAGAGGAGGAG 
GAGGAGGTTACGGATCTAAAGCTGCAATTGTTCCACAATACAGTACGCGAGCACATTATATTCTTGCTACTGATAATCTTGCTGTACTCCAGCTC 
CTATGTGGTCGTCTCACGTTTTAGGCGGCGCGACCGTGACGATCTCTATTCGAATGACGAGGACGAGGTGCTGGTCTATCGCATAAGCTTTTGGC 
TGTGCACATTCACACTGGCCGTTGCCGAAGGGGCTGCCATGCTGCTGCCCGTTTCCATTGCCAGCAATGAGGTGCTACTCCTCTATCCCAACAGT 
TACTATGTGAAATGGCTAAACAGTTCACTCATACAGGGCTTGTGGAACCATGTATTTCTGTTCTCCAATCTGTCACTGTTCATCTTCCTGCCATT 
TGTCTATTTGTTCTCGGAATCCACGGGATTCGTCGGCAACAAGAAGGGCATCCTGCCGCGAGTCTACGAAACCTTTACAGTATTTATGCTGATGG 
CCATTATTGTCCTCGTACTAACTGCCGTCCTAAGTGCAGTCTTTGGCATTGAAAAGCTTCAATTCTTCTGGTTTCTAAATTTGGGCAGTGTTCAC 
CTGCCGTTTCTTTATTCGTGTGTCTCGTTTTTGGGTGTCATGCTTATGCTGATTTGCACCCCGTATGGCTTTGTGCGCCTCTTTGGCGTTGTTAA 
CCAGGTGCTGGTGCGACCTATGCTACTGCGCGACGTCAACGAGGAGTTTAGCGCCTTCTACATGGAGGAGGCGAGCGTGAAGCGGAAGCTAGCGC 
ACATTGAGCTGCACAACGTGAGCATTGCGGATGCCGCCAACGGAGGTCGCCTTGGCAATCGGATTGGTCTGGGACGTGGGCGCACCTTCTATCCC 
CAGCCACTGCTGCAACACTCCCAGGCTCATCTATACCAGCGCAAGGCGATGGCCAGTGACGTGGACGAGCTCAACGATAGACTAAGAGAACTGGA 
CTCCGAGCGCAAGGAACTAGACAAGCTGCGCACATCGAGCACCTTCCAACGCACATTCGTCTATCCGCTGGCCATGCTGCTGCTGCTCTTCTGCA 
CAGCTGTCACCATTCTGCTGGTTGTGCAGAATACACTAGAGCTCCTTATTGGCATT7AAGGCGCTGCCACTAAGCACAAGGCAATTCGCATTGGGC 
ATCTCTTCGCTCTCGAAGTTGGGTCCCTTTGGTGCCGGCCTGGAGGTGTGTCTCATCTTCTATCTCGGCGCCACCTCGGTGGTGGGCTTCTACAG 
CATGCCGTTCATGCGCAAGGTTTGTCCCAAGCGGCGCCAGACTTCGCTGCCACAGCTAATGCTGAACTGTGGCTTCATGCTGGTTTTGTCGTCGG 
CATTGCCGCTTCTCAGCAGGATCATAGGCATCACAAACTTTGACCTGCTGGGTGATTTTGGAGCCATCGAGTGGTTGGGAAACTTCCAGATTGTA 
TTGCTTTACAATTTGGTTTTTGG7AACCACAACTGCCCTGTGCTTGGC7AAACAAATTCACGGCTACGGTGCGCCGGGAGCTAAGAGCCCGGTTAGT 
GGAGAACTATGTGCTCTTTACTAACTACATTAGCTTCATCAACTGATATTGGCGCAAGCGACACGACAATGCGTTTTGTATAAACCGACAGGAGG 
TGAGGCGCCTTGTCGAGTCGCCGCTAAAGAGCCTAACAAGTTTCTATACCAAGTATCGTAACCTTAGTCGTAAGAGCAGTCGCAATACCACCGCT 
AGTCGCAAGGCAAACGAAATGAACTGTTAAGACACGAGA7AAAACATGGGAGATGGGCAGCTGAAACTATATAATATACCTTTACATATGTTATAT 
GATCACGTATAGTTGTGCAGAACTGATACTAATATTTAAATTTTGCAATGTTAACTGTTGTGCTTAATTGTTGTACTTTATACAGACAGAGTCCT 
CACAGTGCACCAGCATATCACTGTGGTGCCAGTAAACACAGT^ACAATGGCACTCGTGTACCATGATATTCGCAGAAATCTCGCCAATAACTAGCA 
TGTAAATAGAAGATGTAATTGTGCAACATAAGCCGCAGTCAATTTGCAGTAGGCGTAGTTTAGAATTTCGTTTCTTGTCGTTCAGAGTG7AATTCC 
ATTTCATACATGTTTTTCGCTATTTATCCACGTATATTTCAGTTATTTTTATAAGATTTTAACAATGTTCCTTAAGCAGCGAAACGATGAGGCGT 
GTTCATTTCATTTTACGGCATACTCAAATCTGTAACCCAATATTCACACAATCACACACTAACAAGCGTCTAAAACTTATGTGATAAAAAATAAT 
AAT AAAC AAAGC AT T T T AC T AAAT T AAT C G 
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(SEQ ID NO: 644) 



Start ATG: 363 (Reverse strand: CAT) 

MDEEEEEEVTDLKLQLFHNTVREHIIFLLLIILLYSSSYVVVSRFRRRDRDDLYSNDEDEVLVYRISFWLCTFTLAVAEGAAMLLPVSIASNEVL 

LLYPNSYYVKWLNSSLIQGLWNHVFLFSNLSLFIFLPFVYLFSESTGFVGNKKGILPRVYETFTVFMLMAIIVLVLTAVLSAVFGIEKLQFFWFL 

NLGSVHLPFLYSCVSFLGVMLMLICTPYGFVRLFGVVKQVLVRPMLLRDVNEEFSAFYMEEASVKRKLAHIELHNVSIADAANGGRLGNRIGLGR 

GRTFYPQPLLQHSQAHLYQRKAMASDVDELNDRLRELDSERKELDKLRTSSTFQRT FVYPLAMLLLLFCTAVTILLVVQNTLELLIGIKALPLST 

RQFALGISSLSKLGPFGAGLEVCLIFYLGATSVVGFYSMPFMRKVCPKRRQTSLPQLMLNCGFMLVLSSALPLLSRIIGITNFDLLGDFGAIEWL 

GNFQIVLLYNLVFGTTTALCLAKfKFTATVRRELRARLVENYVLFTNYISFIN* 

{SEQ ID NO: 645) 

Classification: known_f lybase_gene 
Gene Symbol: BcDNA;GHl2663 
FlyBase ID: FBgn0027539 



Celera Sequence No. : 142000013384830 

CTCCTCCTCCTTACCGTGGTCGCGCATCGAGTTGCGCCCCATAACCACCATTTTGCACATAGTCTCTTCCTGCAGCTGCCGCAGTGTGTTGCCCT 
TGGGCCCCAGGATCTTGCCAACGAAATTGAACTTGGGGTACTCCTTGATGGGGAACAAGACTTTCTGGGCCACGCGGATGGGCTTTTCGCTGTAC 
ACGTTGGCATATATCTCGGGCTTTGGTATGCGTCCGCTGACCAAGATCTTCTCCACCTCTGCGGGGCCATGAAAAATTGCAATTAGATCCACTAT 
AGCTTACATGTCGCAGCGCCTAACTCACCGTCGTCAAGTAAGCGCTTGGTGATGATGTGCTTCTTCTCGAGCGTCTTCTTTTCCAGCAGGCACTC 
CTGCAGATACGCGTTGGTCTTCTCATTCAGCTGGGGCGTGTCCCGCAGGGCGGCGTGGCTGGTGGTGTGTCCGCCGCCATCCGCCTCAGGGGTCT 
CTGGAGCACCGGTAGCCGCTGGCTCCTCGTCGCGTCGCTTCCGCTTGTAGTCGGCGGCGTCCTCGTTGTAGTCTCTGTCGTAGTCGCGCGGCATT 
GCGTTTTGTAGGATTTCTGGCCTGGAGGATTCGTGAATTTTAAAACTCCTACTCTTGAATACAAAATTCTACTATTGCAGAATTAGTTGGCTGAA 
AAAATATGTATGTGGTAGGACATCGATAGGTAAATCGGCATCACCGATTCGATAGCGTATTATCGGTTACTGCCGATAACAACCCGCCATCATGT 
GCATTAGTAGTTTATGACTTATCAATTTTATTAAAAACTTTCCCATAACAAAGAGAGCATTTAAATTATATTTGTAATTTAAAAATCACTAAGTG 
CATATTATAAATTATTGCCATGCACCTGACGACGCATCTATGTTAATATATAGCGTTAGATAGTGTCTGCAAGCCGATAACAACTATGCGCCCTT 
AACAGCACCGTCGATTGGCAGCGAACAGCACGCTGCAGAAGCACGAGAGACAGCCCGAGAGACCGGAGTTGCCGGACCGAGCCGCTGACATAGTA 
AATGTCCGAATAAAATACACACTTGGAAAAAGAGAGAAAAATTAGGGCCATTGGGAAGGCTGAAGTGTGCTAAGTTAGCAATCATACCGAATCGA 
CAGCAAAAGCGATCGCCCGCAATCTGCGCACGGAAAATCCGAGACTCGGCGACTTAGCAGGACGATAGAGCGACAGCGATCCCGAAGGCAGGACG 
ACAAATGGCGGAGAACGGACTGCGCGTGCCAGGGTCATCCGCGGTGGCCTCTGCGGTGGCTTCCACTGCGACGGGAGTCGGCGGTGGATCAGCAG 
CGGGAACTGCGGGGGCCGGGACAGGAGTGGTGCCCACCGGCGTCAGCGGCGGGACGAGCAACGGGGAGCATGAGAACGAACACAACGCCAACGCA 
G AC GGC GAG AAG GCC C AGC C GGCGCCGGCGGTC C AG AAGT AC AT GC AGG AGC TC AT G AC GG AGCG AT C GCGC AT GG AAAACC AC TTCCCCCTGGC 
GGTGAAGCTAATTGACGAAGGTAAATTGATTTGTCAGGGGCACTTCATGGTAATTACTTTGATTTCCCTCTGTTTTTGCAGCTCTGGAGCGTGTG 
CAGCTAAACGGACGCATTCCCACGAGAGACCAGTACGCCGATGTCTACCAGCAGCGCACCATCAAGCTGTCCCAAAAAGTGCACGTGCCCATCAA 
GGACAAGAAGTTCAACTATGTGGGCAAGCTACTGGGGCCCAAGGGCAACTCACTGCGTCGCCTGCAGGAGGAGACGCAGTGCAAGATCGTCATAC 
TTGGTCGCTTCTCAATGAAGGATCGCGCACGTGAGGAGGAGCTGCGCAACTCCGCGGACGCGAAGTACGCCCACCTGAATCTTCCTCTGCACGTC 
GAGGTGTCCACCATTGCTCCGCCCGCCGAGGCGTACGCTCGGGTGGCGTATGCCCTGGCCGAGATTCGGCGGTACCTCACGCCGGACAAGCACGA 
TGACATCCGCCAAGAGCAGTACCGCGAACTCATGGAGGATCCGGAGGCGGCCAAGAAGCTGACGCTTCGCCAGCTGCAACAGCAGTCGAATGCGG 
CAGCTAGTGGTGCAGGAGGAGGAGGCGGTGGTGGAGGCGGAAACGGAAATGGCGGCGCAGCTGGATCCGGCAGCAATAATGGCAATGGGAACCAG 
CGCTCCGGCGGCAACTACAGGTGGGTTGCAGTTCAATGTCCTCTACGAGCCTCGAGTCTAGCTAAACTCCCATGTTTCTTACAGACAAAAGTTTC 
AGCAACAATCACACTATCGCCACAACGAGGAGACTGTCTATTTTCGCTCACACAACAATGCGTACCACCAGCCAAAACCCTATGTGCCAGGTAAG 
TCTGGGCTCTCCTGGTCCATAGACCATAGATTCTATGTTCGCTTTTGCAGCCGCCCAACGGGGCAATGCCATGCACACCACCTTGCCACCGCAAG 
CAATTGTGCGAGCCTCACCGCCCGGAATGGTCGTCAGTGCGGCCAGCTTCAATGGCCGAAATGCCGGACTGGGTAATGCCGGCGGTGGCGGAATC 
ATGGCCAATAGCATCGTGGCCACCACTGGCGGTGGCATCGGTGGCGCTGTGCCGCCTAACATGCGTTACCGTCCCGCTGCCCCCTACCAGTTTGT 
CAAGAAATAATACAGATGGGAGGATGCACCGCCCGCTGCCGTTCGCTGTTTGTTCAACTATTCCTCACCATCATTGATGCCTCATCCGGAAGTAT 
TCACGCCCATCTGTCATACCCATCCAAACACCCACAACCACACACAGCTCCAAAGGTAGATGTAGTAACTATGAGAACATTGGAAACTTGTCATG 
CATTGATTTCATCAAGATTTCGGCAGGGTGGAAACCAAAACGAAACGAATGAGTAAATCTATGTTTAAGAAAACTAGCCTAAGTAGACAAATGCT 
CCGAGGAAC AGACGAACACCTCAGCCACCCAACC AAAACC AAAACTAACCGACC AG AC AAACACCCACCACCCCACACGCACGCAACCTCTTCTC 
GAGGATTGGGTTCGAAGGATCACGAGGGTCTGCGATCTAAGACTTTGGAATCCGTGTGTTGGGCCCTGCGAGCATTTGGAAGAGAAGAATTGAAG 
AAGAAGAACGAGGAGGAGGTGGAGAACGCCTCGATGTTGGATCGGGCCGAATCGAATGGACGCATTCAAGGTTGGTCCCAGTCGAGCCAAATGTC 
CTTGTTTTGTTTTGTCGCATATTATTAACATTTTAATTTTACCACTGACGCGTGCGAAGATTTCATCACTTTGTAATTTATATAAATGAATATAT 
ATATCTCCTTATGCTTACTATCATACGTATCCTTCTCGCAGTCGCTTTAACTCTATTGCTTTTCCAACTCTGCGTTTAAGTTTATCCCATTTTTT 
GTTGTAATGCGTTTTGTATTATGTTTACGTTTAGAAGAGGAGGCTTCTTATTTTTATTTGTTACGTCGCCAGGACAGAAAAAGCAACTGCAAACG 
ATCATGAATAATATATATAACAAGTATATACAAGCGGTATGATTTATAACACATAGTGCATATGCAACACCACACGATCATACGCGCGTACGCAT 
ATTCAAAACATCAGCATATTCACCTAAAGCCGTAAAGTTTAAACCAAATGAACATTAGAA 
{SEQ ID NO: 646) 

Exon: 1001.. 1540 
Exon: 1602.. 2205 
Exon: 2270.. 2370 
Exon: 2426.. 2670 
Start ATG: 1240 

Transcript No. : CT18271 

CAGCCCGAGAGACCGGAGTTGCCGGACCGAGCCGCTGACATAGTAAATGTCCGAATAAAATACACACTTGGAAAAAGAGAGAAAAATTAGGGCCA 
TTGGGAAGGCTGAAGTGTGCTAAGTTAGCAATCATACCGAATCGACAGCAAAAGCGATCGCCCGCAATCTGCGCACGGAAAATCCGAGACTCGGC 
GACTTAGCAGGACGATAGAGCGACAGCGATCCCGAAGGCAGGACGACAAATGGCGGAGAACGGACTGCGCGTGCCAGGGTCATCCGCGGTGGCCT 
CTGCGGTGGCTTCCACTGCGACGGGAGTCGGCGGTGGATCAGCAGCGGGAACTGCGGGGGCCGGGACAGGAGTGGTGCCCACCGGCGTCAGCGGC 
GGGACGAGCAACGGGGAGCATGAGAACGAACACAACGCCAACGCAGACGGCGAGAAGGCCCAGCCGGCGCCGGCGGTCCAGAAGTACATGCAGGA 
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GCTCATGACGGAGCGATCGCGCATGGAAAACCACTTCCCCCTGGCGGTGAAGCTAATTGACGAAGCTCTGGAGCGTGTGCAGCTAAACGGACGCA 
TTCCCACGAGAGACCAGTACGCCGATGTCTACCAGCAGCGCACCATCAAGCTGTCCCAAAAAGTGCACGTGCCCATCAAGGACAAGAAGTTCAAC 
TATGTGGGCAAGCTACTGGGGCCCAAGGGCAACTCACTGCGTCGCCTGCAGGAGGAGACGCAGTGCAAGATCGTCATACTTGGTCGCTTCTCAAT 
GAAGGATCGCGCACGTGAGGAGGAGCTGCGCAACTCCGCGGACGCGAAGTACGCCCACCTGAATCTTCCTCTGCACGTCGAGGTGTCCACCATTG 
CTCCGCCCGCCGAGGCGTACGCTCGGGTGGCGTATGCCCTGGCCGAGATTCGGCGGTACCTCACGCCGGACAAGCACGATGACATCCGCCAAGAG 
CAGTACCGCGAACTCATGGAGGATCCGGAGGCGGCCAAGAAGCTGACGCTTCGCCAGCTGCAACAGCAGTCGAATGCGGCAGCTAGTGGTGCAGG 
AGGAGGAGGCGGTGGTGGAGGCGGAAACGGAAATGGCGGCGCAGCTGGATCCGGCAGCAATAATGGCAATGGGAACCAGCGCTCCGGCGGCAACT 
ACAGACAAAAGTTTCAGCAACAATCACACTATCGCCACAACGAGGAGACTGTCTATTTTCGCTCACACAACAATGCGTACCACCAGCCAAAACCC 
TATGTGCCAGCCGCCCAACGGGGCAATGCCATGCACACCACCTTGCCACCGCAAGCAATTGTGCGAGCCTCACCGCCCGGAATGGTCGTCAGTGC 
GGCCAGCTTCAATGGCCGAAATGCCGGACTGGGTAATGCCGGCGGTGGCGGAATCATGGCCAATAGCATCGTGGCCACCACTGGCGGTGGCATCG 
GTGGCGCTGTGCCGCCTAACATGCGTTACCGTCCCGCTGCCCCCTACCAGTTTGTCAAGAAATAA 
(SEQ ID NO: 647) 

Start ATG: 240 

MAENGLRVPGSSAVASAVASTATGVGGGSAAGTAGAGTGVVPTGVSGGTSNGEHENEHNANADGEKAQPAPAVQKYMQELMTERSRMENHFPLAV 
KL I DEALERVQLNGRI PTRDQYADV YQQRT I KLSQKVH VP I KDKKFN YVGKLLGPKGN SLRRLQEETQCKI VI LGRFSMKDRAREEELRN S ADAK 
YAH LNL PLHVEVSTIAP P AE A Y ARV A Y AL AE I RRY LTPDKHDDI RQEQ YRE LME D P E AAK KLT L RQ L QQ Q S N AAAS G AG GGG GGG GGN GN GG AAG 
SGSNNGNGNQRSGGNYRQKFQQQSHYRHNEETVYFRSHNNAYHQPKPYVPAAQRGNAMHTTLPPQAIVRASPPGMVVSAASFNGRNAGLGNAGGG 
G I MAN S I VAT T GG G I GGAVP PNMR YRP AA P Y Q FVKK* 
(SEQ ID NO: 648) 

Name: KH domain RNA-binding protein 
Clas si f ication : RNAJbinding 
Gene Symbol: qrk58E-2 
FlyBase ID: FBgn0022985 



Celera Sequence No. : 142000013384530 

TAATTAGATACAAAAGGATAGGACGTTAAGGATTTCATCATATAATCGCCAAATTTCCTAATTCATTTCCTAGGCCTGGCCAAACCCAATACCCT 
GATAGCCAGCGTCCATCCGGCAGATAAGCTCTCCCTGCAGTCCTTTCAATCCCTGGGAGTGGAAACAGTGATCAAGAATGCACCCGTTGTGCAGC 
AATCGGATGTGGTCTTCGTGTCCGTGAAACCCCAAGTGGTTCCCTCGGTTTTGTCGGAGATTCAGCCATTGAGCTCGGGCAAACTATTCCTTTCC 
GTGGCCATGGGCATCACTTTGTCCACCATCGAGTCCAGTTTATCTCCACAAGCTCGCGTCATCCGTGTGATGCCCAATCTGCCGGCTGTCGTGTG 
TTCCGGTTGTTCGGTTTTCGTTCGCGGATCCAAGGCCACAGATGCAGATGCGGATATTACCCAGAAGCTGCTGCAGTCCGTGGGCACTTGTGAGC 
CGGTGGATGAGTCCCAGTTGGATGTGGTTACTGCTCTGAGCGGAAGTGGACCGGCCTATGTATTCGTGATGATTGAGGCCTTGGCGGATGGAGCC 
GTTCACATGGGCATGCCCAGGGATCTGGCCTATCGTTTGGCATCGCAAACAGTCCTGGGCGCAGGTCACATGGTGCGGGACAGTGGTATGCATCC 
CGGACAACTCAAGGATGGGGTCACAAGTCCGGCGGGATCGACAGCAGCGGCACTCAGACAACTCGAGCTGTCAGGTAAGTTGGGATGATGACGCC 
GATGACGCACTCGCTTATCGCCGCCACTTCAACCTTTGATGTCGATTGCAGGTTTTCGGGCTGCGGTGTCTGGAGCGGTGGAACAGGCGACTCTG 
CGATGCCGGCAAATCTCGGGAAAGACGAAGTAGGCACATAGGATTATCATATACATATACGTACATTGGGGTGGGTCAATTTTGTTATTAAAGTT 
ATAGTAGTTAAAGGAAGTCGTTAAATTTTGTTGCTTCTTGAAGAAACATGTTTTTAATTTGGAAATGTATGTTGGTGCTATAGTGCTTTCAAAGA 
GCAATGATGAGATGTTTCATTTTAATCGACCCACCCTTGTGCTTAATCTGGGTTAGTAGCTTTGCATAACTATAAAAACTGTGAAGGCCGATTTC 
GTGATACTAAAATATAAGCAACACTGCTATATTCTTTACTCTCAGCTTTGTTTATAGTCTAGCTAAAAATAACTAATTCAAAGTCGAGCCTAGAA 
CTTCTTCTTCTTTTTGGCCCGGGACAGCTCATTGAGCTCGCCCTGTTGCACGGATGCCTCCTGGTAAATGCGCTTAACCTTCTCGTGGCGCTCCT 
TGTCACTGTGGATATGAAATACAATAGATTAGAAAGGAACGGTTAAATTGTAAATGGAACACAAACCTTTGCAGGGTCTGTTTGAGCCTAAGCAT 
CTGAACGAAGGACGTATCTCTGCGTATTTCTCGAATGGCACCCTTCTTCTCGCGCTTGATCTTGTGCAGCAGTTTGGCCCGCTCTTCCTTGGCCT 
TGGACATCTTGGGACGACGTTTGTCATCGTAAACCGCCTCGAATCGAGGTTCCAGCAGCCTAAGAGCCTTGGGTTTCTTTTCCGCCGGAGCCAAG 
GGTTTCATTTTCTGCGCAGCTAGCTTCTCCGCCAGTTCGGTGGCATCCTTGTGGTGCTGATGCACATGCTCGGGGTAAGACTCCAGTGGAAGGCG 
GGAGAGTAGTGCCAAAAAGGGTTGGGCCAGGTAGCAGGCACCCACATGCTCTTCCACCAGTTGGAGAGCTTCCTTGATTAGCAGGAGGGAAGTGT 
CCAGTGCTCGAACCTTGAAATCTGGAGTGATCGTCTGGGTGACCAAGTCAGTAGGCTGCAGCTGTTGGGGCTCCAACTTCGTGGATTCCGTGTTA 
GCTGGAAGTGCTAGGAGCTTGCTCAATGGACCATCACGCTCAAAAGGCGGCGTGATCTCCAACTGTTCCACATCCCTCTTCGGTATGGACATGTG 
TACAATGCCCTGCAGGAAGTTGAATACGGCTGGAACCAGTCTCTTGGATTGGGACACAAACTCCAAGACTACGGTGACCAGGAACAAACCCATAG 
ATATTTCCTGACGCGTTCGCACTCGTGATCTGGACAGGACGTGTTGTATGAATATAAAGCAAGGAGTGACCACCGGATGGCGGAAATCGGAAGTC 
GAGTACAAGTTGGCCACTAGTTTGAAGTACACCAAAGTGTCAAGTGAAGGATACATTTTGTGATTTTTCCTGAACTCTTCGTACTTCTCCTTGAT 
CACCTCCAGTAATGTATTGGACATGCGCTCGGGATTCAGTTGCGTTAATTCATATAAGTAAGGCATGAGCTTTGAGAGCAACTGGAAATGTTCTC 
GTATGTCCTGCTCGCTGGCATCTTCAAACAAATCCTTCAGATATTGCAGCAAAAATGAGTACAACTTAACCACGTTCTCGCGGTTAACACCTTCC 
AGTTTGGGATGATTACATTTTATAATCCTTTCGATAATGATCGCCTTCTGGGCCGTAGCATGCTTGGAGAGCAGCTCCGTAAAGTCCTCGTAGGT 
CTTGGGCATCTTGATGGTAAACGGAATACTGGTGTCCAAGGAGGCGGGAATTGTATCCGCAGACTTTTTCGCTTTCTTTTTGCTAGCCTTCGGAG 
CTTCTTCTGGCTCTGACTCACTTTCAGATTCCTTGAGATCGCTCAGATTGTCGACCTCTGAATCACTCTCCTCATCCGAGTCCTCCTCTTCCTCT 
TCTTCCCCCTCCTTGTCATCGTCATCCTCATTTTCGTCTCCCTTCAACACGGCTTCCTTCTTTCCATTGAGATGTGTGCCCAGATTGCCGTCCAA 
ATCGTAAGCGAGCGTATCATCTCCCTCGTCATCTTCGCCGGCCAAGAAATAGCCATCATCCAGGTCATCTGCGGATCGGTGCTTGGGTTTGGCAA 
CGGAGGCCTCCTCATCCTCCTCGCCATCTGCTCGCATGCGACGAAGCCGTTCGTTCTCGAGCTTCTCCAATCGAGCGGCCTCCTGTTTGGCCAGC 
TCGTCGGGATTGATTAGTTTATCCGCGACACTGCCACGCGGCTCGAAGATCATCTCCTTGAGCAGTTTATCGTAGGCATCTGGCGGTGGCTTGGC 
ATCCTGCTCATCCTTCGTGACCTTGGCGACCAGCGGCAGCAGAAGCTTATAGTTGGCATCCAGTTTCTCTGTGAGATCATACACTTCGTCCTTCT 
CCTTGGCGATTTCGTTCTTTCTTCGTTTTTGTTCAACGATCATCTCATCGATGGCCGCCTGTCTATCCTGGGCAGAGTCTCCCTCTCCGCCAAAA 
TGAGCAGCAGCAGTAAAATCGGCTGAAAGAATATGAGAGTTAGTTGTGCGCTTCAAAGTACAATCATTTGGGTATACCATCTAATGCCTCGTCGT 
CTAGCTCCTCGTCGTCGGAGCGCTCGTCCCTGTACTGCTCGATCTCCTCCAGAGTTTGTCCCCTGTGCGTCAGTAGTTCGTCGTCGTTGAGGTTG 
AACTTCTCCGCTTTCTGGTTGGAGCGCACCTGGGACATCTTTTCGGCCAGGTAGCGGGCATTCATCACGGACTCGGTTAGCTGATCGCCACTCAA 
GTGTTTTCCAATTCGGTTATCCTTGAAGCGATTGGTCTTGTGCTTCACGGCAAACTGTTGACCCAATGTCTGCGCTCGTTTCTGCAGAGCCTTGG 
CCCGCGACACGCCCGGCATTCCCCTGTCATGCTTGCAGATCCTCCCCAGAATCTTGAACTTCTCCTTGTTCACGTGCACATCGAAGGGATTGCCC 
CGCTTGCTGGACTGTGCCGTCGAGTTGTCGAAGGGATTGGCCGACCTGGTTGTCTTTTTGGCGTAAACCGCATCTGCGCTGGCCTTCTTTCCTTT 
GGCCACCATTGTTTTATATCGCGTTCTACGGATAAATTAAACCGAGTTCATTTAGCAAAACAAAACACCTCACACGTGCAACCCAATAACGGACT 



FIGURE SHEET 351 



AGCGCGTCTACGAAATGCCGAAATCAGTGTTGGCTAATCCTGCGAATAGTCACGTACAATGGTAATGGGATCAGTTTCAATTTCAACTGTAACTA 
CAAATTAATCATAATTTACTGTATAACAATGTATTTTTTCCTTGTTAATGTAATTGTAAATCTACAAGGGCATTTAAATATTACACAATTAAAAT 
CTTTGTTTTGGTATCTACTTCGAAAAACTATTGTATATTACGAAACACCGGTACATACGCTGTATGATCTGAGTCATTTAACACAACAATTTTAA 
GGGTAGACCAAGAAAACGATGCTTCAATTTGAAAATTTTGTAATCGAAGCAATCAAGTTGTACATTTTTGAATGGGTGACTGAATTAGTAGTTAT 
ATTGTTATCACATTCTATTTATAATAGCTAAAATGTTAAATCGATAAATATTAGTTTTCGGGGAATTCCTTGGAACCTTATCCGCTGAAATGTAA 
CAGTTACCGCTGCTCATCACTAATAGCATGACTTTTCCAGCACTGAACCACGTCGCCATTTTCGTTTCGTTCCTTGCCACAAAAATTTTCACACG 
CAACGCCGGATACTGAAAAATTGGTTTTTAAAGCGTGTTAGGAGCTTTTATAACTTTGCGTGGCTACAGTCAGTCAGTTCTTGGTGTGAAGGTGC 
TGGAAAAAGCATGAGCGCCACGCCGCCGGACCTACTTTCGTTGGCGGCGTCCCGAGAAGTCTCGGCCGCCGTCGCAGTCGCCGCCTCTGCCAACG 
GAGGCAGCGGCAGAAGCGCTGATAAGGAGCGTTCACGTTCCAGAGATCGTGGGCGGGACAAGGATAAGGATCGGGACAAGGACAGGAAGAAGAGG 
TGCGTGCGTTGCAGGGAGAATGATTTTGTACCCTTGCCCTTTTAGCTTCCCCTTTTTACTTCCTTTTCCGGTGTACACATTTTTCAACAATGTGC 
T TT T AT AAAAG C AG AAC T T T T AT T AAC GC C AT AGC AT A 
(SEQ ID NO: 649) 

Exon: 3978.. 3403 
Exon: 3347.. 2806 
Exon: 2691.. 1397 
Exon: 1335.. 1001 

Start ATG: 3904 (Reverse strand: CAT) 
Transcript No. : CT18277 

GGTTGCACGTGTGAGGTGTTTTGTTTTGCTAAATGAACTCGGTTTAATTTATCCGTAGAACGCGATATAAAACAATGGTGGCCAAAGGAAAGAAG 
GCCAGCGCAGATGCGGTTTACGCCAAAAAGACAACCAGGTCGGCCAATCCCTTCGACAACTCGACGGCACAGTCCAGCAAGCGGGGCAATCCCTT 
CGATGTGCACGTGAACAAGGAGAAGTTCAAGATTCTGGGGAGGATCTGCAAGCATGACAGGGGAATGCCGGGCGTGTCGCGGGCCAAGGCTCTGC 
AGAAACGAGCGCAGACATTGGGTCAACAGTTTGCCGTGAAGCACAAGACCAATCGCTTCAAGGATAACCGAATTGGAAAACACTTGAGTGGCGAT 
CAGCTAACCGAGTCCGTGATGAATGCCCGCTACCTGGCCGAAAAGATGTCCCAGGTGCGCTCCAACCAGAAAGCGGAGAAGTTCAACCTCAACGA 
CGACGAACTACTGACGCACAGGGGACAAACTCTGGAGGAGATCGAGCAGTACAGGGACGAGCGCTCCGACGACGAGGAGCTAGACGACGAGGCAT 
T AG AT GCCGATTTTACTGCTGCTGCT C AT T T T GGC GG AG AG GG AGACT C T GC CC AGG AT AGAC AGGC GG CC AT CGAT GAG AT G AT CGT T GAAC AA 
AAACGAAGAAAGAACGAAATCGCCAAGGAGAAGGACGAAGTGTATGATCTCACAGAGAAACTGGATGCCAACTATAAGCTTCTGCTGCCGCTGGT 
CGCCAAGGTCACGAAGGATGAGCAGGATGCCAAGCCACCGCCAGATGCCTACGATAAACTGCTCAAGGAGATGATCTTCGAGCCGCGTGGCAGTG 
TCGCGGATAAACTAATCAATCCCGACGAGCTGGCCAAACAGGAGGCCGCTCGATTGGAGAAGCTCGAGAACGAACGGCTTCGTCGCATGCGAGCA 
GATGGCGAGGAGGATGAGGAGGCCTCCGTTGCCAAACCCAAGCACCGATCCGCAGATGACCTGGATGATGGCTATTTCTTGGCCGGCGAAGATGA 
CGAGGGAGATGATACGCTCGCTTACGATTTGGACGGCAATCTGGGCACACATCTCAATGGAAAGAAGGAAGCCGAATCTGAAAGTGAGTCAGAGC 
CAGAAGAAGCTCCGAAGGCTAGCAAAAAGAAAGCGAAAAAGTCTGCGGATACAATTCCCGCCTCCTTGGACACCAGTATTCCGTTTACCATCAAG 
ATGCCCAAGACCTACGAGGACTTTACGGAGCTGCTCTCCAAGCATGCTACGGCCCAGAAGGCGATCATTATCGAAAGGATTATAAAATGTAATCA 
TCCCAAACTGGAAGGTGTTAACCGCGAGAACGTGGTTAAGTTGTACTCATTTTTGCTGCAATATCTGAAGGATTTGTTTGAAGATGCCAGCGAGC 
AG GAC AT AC GAG AAC AT T T C C AGT T GCT C T C AAAGC T C AT GCC T T ACT T AT AT G AAT T AACGC AAC T G AAT C CC GAG CGC AT GT C C AAT AC AT T A 
CTGGAGGTGATCAAGGAGAAGTACGAAGAGTTCAGGAAAAATCACAAAATGTATCCTTCACTTGACACTTTGGTGTACTTCAAACTAGTGGCCAA 
CTTGTACTCGACTTCCGATTTCCGCCATCCGGTGGTCACTCCTTGCTTTATATTCATACAACACGTCCTGTCCAGATCACGAGTGCGAACGCGTC 
AGGAAATATCTATGGGTTTGTTCCTGGTCACCGTAGTCTTGGAGTTTGTGTCCCAATCCAAGAGACTGGTTCCAGCCGTATTCAACTTCCTGCAG 
GGCATTGTACACATGTCCATACCGAAGAGGGATGTGGAACAGTTGGAGATCACGCCGCCTTTTGAGCGTGATGGTCCATTGAGCAAGCTCCTAGC 
ACTTCCAGCTAACACGGAATCCACGAAGTTGGAGCCCCAACAGCTGCAGCCTACTGACTTGGTCACCCAGACGATCACTCCAGATTTCAAGGTTC 
GAGCACTGGACACTTCCCTCCTGCTAATCAAGGAAGCTCTCCAACTGGTGGAAGAGCATGTGGGTGCCTGCTACCTGGCCCAACCCTTTTTGGCA 
CTACTCTCCCGCCTTCCACTGGAGTCTTACCCCGAGCATGTGCATCAGCACCACAAGGATGCCACCGAACTGGCGGAGAAGCTAGCTGCGCAGAA 
AATGAAACCCTTGGCTCCGGCGGAAAAGAAACCCAAGGCTCTTAGGCTGCTGGAACCTCGATTCGAGGCGGTTTACGATGACAAACGTCGTCCCA 
AGATGTCCAAGGCCAAGGAAGAGCGGGCCAAACTGCTGCACAAGATCAAGCGCGAGAAGAAGGGTGCCATTCGAGAAATACGCAGAGATACGTCC 
TTCGTTCAGATGCTTAGGCTCAAACAGACCCTGCAAAGTGACAAGGAGCGCCACGAGAAGGTTAAGCGCATTTACCAGGAGGCATCCGTGCAACA 
GGGCGAGCTCAATGAGCTGTCCCGGGCCAAAAAGAAGAAGAAGTTCTAGGCTCGACTTTGAATTAGTTATTTTTAGCTAGACTATAAACAAAGCT 
GAGAGTAAAGAATATAGCAGTGTTGCTTATATTTTAGTATCACGAAATCGGCCTTCACAGTTTTTATAGTTATGCAAAGCTACTAACCCAGATTA 
AGCACAAGGGTGGGTCGATTAAAATGAAACATCTCATCATTGCTCTTTGAAAGCACTATAGCACCAACATACATTTCCAAATTAAAAA 
(SEQ ID NO: 650) 

Start ATG: 75 (Reverse strand: CAT) 

MVAKGKKASADAVYAKKTTRSANPFDHSTAQSSKRGNPFDVHVNKEKFKILGRICKHDRGMPGVSRAKALQKRAQTLGQQFAVKHKTNRFKDNRI 
GKHLSGDQLTESVMNARYLAEKMSQVRSNQKAEKFNLNDDELLTHRGQTLEEIEQYRDERSDDEELDDEALDADFTAAAHFGGEGDSAQDRQAAI 
DEMIVEQKRRKNEIAKEKDEVYDLTEKLDANYKLLLPLVAKVTKDEQDAKPPPDAYDKLLKEMIFEPRGSVADKLINPDELAKQEAARLEKLENE 
RLRRMRADGEEDEEASVAKPKHRSADDLDDGYFLAGEDDEGDDTLAYDLDGNLGTHLNGKKEAESESESEPEEAPKASKKKAKKSADTIPASLDT 
SIPFTIKMPKTYEDFTELLSKHATAQKAIIIERIIKCNHPKLEGVNRENVVKLYSFLLQYLKDLFEDASEQDIREHFQLLSKLMPYLYELTQLNP 
ERMSNTLLEVIKEKYEEFRKNHKMYPSLDTLVYFKLVANLYSTSDFRHPVVTPCFIFIQHVLSRSRVRTRQEISMGLFLVTVVLEFVSQSKRLVP 
AVFNFLQGIVHMSIPKRDVEQLEITPPFERDGPLSKLLALPANTESTKLEPQQLQPTDLVTQTITPDFKVRALDTSLLLIKEALQLVEEHVGACY 
LAQPFLALLSRLPLESYPEHVHQHHKDATELAEKLAAQKMKPLAPAEKKPKALRLLEPRFEAVYDDKRRPKMSKAKEERAKLLHKIKREKKGAIR 
EIRRDTSFVQMLRLKQTLQSDKERHEKVKRIYQEASVQQGELNELSRAKKKKKF* 
(SEQ ID NO: 651) 

Classification: hypothetical 



Celera Sequence No. : 142000013384574 

GTTAACAATGCGGCTATTGGCATTGTCACGAGTTTCTTCCTCAAGTACATGAACTCCATTCTAAAGACCTTCGCGAGCGCTCTGGAGTTGCTCTT 
CACGGCAGTCCTATGCTACTTCCTCTTCTCCATACCCATTTACATGAACACCGCGCTGGCCATCGCGGTTGTATCCTACGCCATCTATCTCTACA 
CCCAAAGCCCGGTGGTTAATTTGGGCAAGGTGCGTCCGCTGTCCAATTTAAGTGATGCCACGACCAAATCAACGGATAAACGAAAGCTGATCGAC 
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GAGGAAGCGGCAGAATCGGACCTCGATATGGTGTAATGCTGTAGCTTGTCCTAGACGTAATCTTAGACTGTAAATAGTTGTGGAAAAGTGCCTGC 
AAATGTTTTAAACGCCAGACGAATGTATTTGACCCCAGGCTTTGTGCAATGCAATAGCAACAGGCTTGTAAATACCAAAACTTTTGCTCCTCTAG 
TTTAAGATTCTTATTTTAATCAGAACATAAATGCAATCAAACGAATATTGTGCATGCTTCTTCAAGAAGCTTTTTATTTTATGGTTCATATACTA 
TCACATATGTACATGGTATATTTGGATGCCTGCTTGTCTTAGCCTAAAATGGTGTGTCCACATCATGTCCATCTTCCTGTTTGGGCGGTATGCGA 
CTACGAAGCGCTATCTGCACCGCCTGGTTTTCGATGCTGCGGAGATAGTAGTCACACTCCTTCAGATGCTTCTCCATGTCCTGGACTCGGTGTGT 
GGCGTTGTACTTGCAGGTCTGCAGCGGCGGTCCACAGTAGTTCTTCTCGCACTTAATTAAGTGCTTGGGCATCCGGAAGAGCAAGATGCGGTGCG 
ATTTGTCGTAAGGGCAAATTCCGTACTCGAAATCGGACATTTTGAAAAAGTCAAGAGCCGTTTGTCAAAACAACAAATTCATAAAGCTTAAGGTA 
CCGCTATGTTTTAAAGTTTTAGTTATCGGTTGTGTTATCGGAAGTAAGGAATTTTCGATTTATTTATTTTTCAAACCAGCTGTCAAGATATTGAC 
CTTATACGGAATACCCCCGATGAATATAGGGTCACACTGAGCAGACGCGCGTAAACCACCCCGTCCCTCCCTTTTACTTTTCATTCAGCTTGAAA 
ATTTCATAGAGTTCGGTCAATTGTCAAATCTGCTCAAAACCACTTAATTACCAGGTAAACGATGGCACGTACCAAGCAAACAGCCCGTAAATCGA 
CCGGAGGCAAGGCGCCCCGCAAGCAGCTGGCCACCAAGGCGGCCCGTAAATCGGCGCCATCCACCGGCGGAGTGAAGAAGCCACATCGCTACCGT 
CCTGGAACGGTGGCCCTGCGTGAGATTCGTCGCTACCAGAAGTCCACGGAGCTGCTCATCCGCAAGCTGCCGTTCCAGCGTCTGGTGCGCGAGAT 
AGCCCAGGACTTCAAGACCGATCTGCGCTTCCAGTCGGCGGCCATTGGAGCCCTACAGGTATGGTGTACAGCACTCCCTGCACCGTGACTATTTC 
ATACGGTCATGTAATAATGCGTCTCTCCCCCATTTCAAATGCGCAATTTGCAGGAGGCCAGCGAGGCGTACCTGGTCGGTCTGTTCGAGGACACC 
AATCTGTGCGCCATTCACGCCAAGCGCGTCACCATTATGCCCAAGGACATCCAGCTGGCCAGACGCATCCGTGGCGAGCGGGCCTAAATGCCCAT 
GACCGCCATCTTGGATTGGAAAAGCAGTGGCGGCAAAACTACCAACCCACAGCTGTCGCTTGGAGGGGGGAACTCGGAGCTAGAGGTGGAGGTGG 
AGAAGCGGAAAACCAAACCCCCGGCCGCAGCTGCCGCCGCTTAGTACTTAGGTCTATCTATACTTACCGATAAGTAATCAATACACAATGTTCTA 
ATAAAATCGAATTCATTTTTGTTTGTAAATGTTTAACCACAAAGGAAAGCGTTCATTCATATATTTTGTTCAGACTGTACATTATTAGTTGTGTC 
TCTTAAATTGTAATTGTTGTCTTGCTGTCTCGGCAACAATTGAACATTAATTTGAATGGTTTCGTACAAAAATACATATATTCTATTGTATCATT 
T T G T AAAAT GAG GT AGAC CAT GT AG T AAAT GC AT T CAT T T T T CGT T AT T CT AGT AC AT AT GT AT AT C T AGC AT GT AC TGG AAT G AAAAGT AT CT T 
TAAACTATGTAGATTTCTGTGTGGTGGTGAGCTGCTCCCCTGGAGCCAAGCAGTTGAAGGTCCGGACAAAAACAGGCAGCCTTGTCAGGGAATTT 
ATGGGTGAACCGCCCTCGACGAAATCGCTGCCGAAATACAGGGTTTGGCCGAAGGAACTCTGGCCAAATCTGAAGAAGAACAATTCCCTCAGAGA 
CATGGTGGTGAAGTTGGGCTGCCTCTGGCTGAACTTGGAACCTTCTGCGAAGTAGGCTTCGTGGACCAAATGGAGCACGACGAGGTACTGGTCGC 
GTTGGCTAAGCAAATTATCTTGGGTCCATGCGCTTACACACTCATCGCCGGCGGTAGTAATTTGGTTATCGTCGAACAATTGGAGCATATTGACC 
TTATGATGTTTGCAATTAAAAAGATCGAAACTTTTTAACATCTCAAGGCCAAAGCGAATGCCCAAGGGTGTCACTTTGAAGATATCGTGGCTGTC 
GGGATCAAATGAAGCATCTGCCAGAAAGTCGTATGCCACATAAATCGTATTTCCATTTTCCTCATACGCAGTTTCGAAGAAGAAATTCGAAATGG 
GATATGAATGGAGCATGTATCGCCGGTCTTTGAAGTTCGGTTGGTCCAGTAAGCTTAGTTCGGCACGCGTTCGGTGCTCCAGAACCTTTAGGTGA 
GCAGTCGCAAGTTCCTCGTCGTCCGCAAATTCAAGCTCCTTGTAGTATCGGGTTACGAAACGTCGATGGCCCGCCTTTCCCGGCAGATTGCCGAT 

GCT 

(SEQ ID NO: 652) 

Exon: 1001.. 1483 
Exon: 1574.. 1948 
Start ATG: 1202 

Transcript No. : CT18281 

ATTTTCGATTTATTTATTTTTCAAACCAGCTGTCAAGATATTGACCTTATACGGAATACCCCCGATGAATATAGGGTCACACTGAGCAGACGCGC 
GTAAACCACCCCGTCCCTCCCTTTTACTTTTCATTCAGCTTGAAAATTTCATAGAGTTCGGTCAATTGTCAAATCTGCTCAAAACCACTTAATTA 
CCAGGTAAACGATGGCACGTACCAAGCAAACAGCCCGTAAATCGACCGGAGGCAAGGCGCCCCGCAAGCAGCTGGCCACCAAGGCGGCCCGTAAA 
TCGGCGCCATCCACCGGCGGAGTGAAGAAGCCACATCGCTACCGTCCTGGAACGGTGGCCCTGCGTGAGATTCGTCGCTACCAGAAGTCCACGGA 
GCTGCTCATCCGCAAGCTGCCGTTCCAGCGTCTGGTGCGCGAGATAGCCCAGGACTTCAAGACCGATCTGCGCTTCCAGTCGGCGGCCATTGGAG 
CCCTACAGGAGGCCAGCGAGGCGTACCTGGTCGGTCTGTTCGAGGACACCAATCTGTGCGCCATTCACGCCAAGCGCGTCACCATTATGCCCAAG 
GACATCCAGCTGGCCAGACGCATCCGTGGCGAGCGGGCCTAAATGCCCATGACCGCCATCTTGGATTGGAAAAGCAGTGGCGGCAAAACTACCAA 
CCCACAGCTGTCGCTTGGAGGGGGGAACTCGGAGCTAGAGGTGGAGGTGGAGAAGCGGAAAACCAAACCCCCGGCCGCAGCTGCCGCCGCTTAGT 
ACTTAGGTCTATCTATACTTACCGATAAGTAATCAATACACAATGTTCTAATAAAATCGAATTCATTTTTGTTTGTAAATGTTTAACCACAAAGG 
AAA 

(SEQ ID NO: 653} 
Start ATG: 202 

MART KQT ARK S T GGK A PRKQL AT K AARK S AP S T GG VK K P HRY RP GT V AL RE I RRY QK S T E L L I RKL P FQRL VRE I AQD F KT D L RFQS AA I G AL QE 
ASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA* 
(SEQ ID NO: 654) 

Name: Histone H3 . 3A 
Classification: DNA_binding 
Gene Symbol: His3.3A 
FlyBase ID: FBgn0014857 



Celera Sequence No. : 142000013385213 

ACAATGAGCAGCGGGACCACGATGATGGCGAAGATGACATAGGTGATAATGAGCAGCGGGCGTATCCAGATGCGCCACTCCTCGAAGAAGCGTCT 
TACGTCCAAGCTGCACATCTTTGCCTCTGGTGCACTCCTTGGGAATGCTCCTTGAAATGTGCTGTTCACCGGAGTACCTAATATCCGGGTTGATA 
TCGTTGATTCGTTGACGTCGACGCCGCCGCCCCGATTCTGTTTTCACTTGGGTTTCCGCTTTCTTCCGCTCCGATCCGCTCTTCTCCGCCAGAAA 
TTCTCTCACAGCAGATTACAGGACACGGCGGTGGCACCCACGCTGGAGGCGCATTTTCAGAGATCTCCTGCTGGCAGAGCAAGTTCTTCGATCTA 
TTGGTTTCAGTCCTATTGGCAGTAATATCGCGGCTTTCGTGCCCTTCTTTTGTGAGGAATCCAGTTCGTGGGTCGGGTGCCTACGTCTAATTGGT 
GTGACCAGCCAATGCGAAATCCGAAATGACTAAAATACAGACTTCCTCCCGGTTTTATATTATTTAAAATTTTTACTATTTGTTTTATTTTAAAG 
TTTAAAAGCTATCAGTAAAAAAATTAACTTAATTAAAAAACAAATGTCGAAGATTATTTAGATTTTGTTTGTCGCAGACGAAAAATATGAAAATT 
TTCCGATGATTTAAAAAAGGGAAATTTCGAGGGCTGCAGAGTAAAAAAATAAAACATATTTTGAAAATTAAAGAAATATGTCAAATATATAAAGT 
AGCTAACTCCAAACCGACTACTAATTGTTTCGTAGTTCAGAAATTTTTTTAATACTGAATTAATTGTAAAATATCTAATTTTAGTTAGTTCAATA 
AAACAATTCACGAAAGTTAGATGCATTTTATTTATTTATAAGATTAAAATATCAATTATGTACAACTAAGTTAAGTATTTAATTAGTTCTCTTGA 
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AATGGATATAAGGGGATTTACAAGTGAGCTTGATAAGCGTGATATTTTTTCGTTATGGATTTATTAAACAAATATTCTTAGATTATCTGTTTGAT 
TTGAATATATGATGTCAAAATGTTCCCTTAACATTTATACGTCGATAACTTAAGATGTTCCGTGTACGTCCCAAGGCTGTCTTTTGGTTAAAAGT 
ACTGCGGAATCGCTATTACTATTTGCCTCGCCCCTGGTAGTCATTTTCATATACACAAGATCGGATTACGTTAAAAGTAATTAAAAAGTATTTGC 
ATTATAAGAAACAGTAGTCGCCAGTGCAGACTAATTGTTGTGATGATGAAGAAAGAGGATGGCGTCGATGTGACTACCGGTGAGCATGAATCGCT 
TGTTGAAGTCGTATATTTGGTCTCTCAGCGCCTTCTTGAAGCTGAGCGTCACCACGCCGGGCACACTATGGTACAGGCTGGCCAAATCGCGGAGC 
GTTCCATAAATGTGTCCCATCTTGAGCCACCACTCGATGGGCGACTCATCCAGCTGAACGACCACCTCGTTGCGATATCGCTTAAGATCGGACTC 
CACGCTGCTGTCCGCATCTCGCTTCGGAGGATTTGGTATCTCGTTACTGTCAAACAGCAGTTTCAGACCTACGGGCATTTTCGTTTAGTCCATGG 
TTTATGATAATCGATTTAGTTCGATGCACTTACCCAACTTCTTTGGTGCACTGCCGTCCACTACTCTTGGTTCTGACTTAATGGCCACCACGCTG 
GAGGTAGCTGCCACTTCGTTGGATTCCCCTTCGCCCACTCCACCAAAGTTAATGTTATACTTATGGGTCAGCATACGAACGGTTCGCTCTAGGTT 
GTCAATGGTGGTCAGGCGGTGGAAGCGTGGGTCCAACAATGCAGCAGCGGTCAGATTGTCATTGGATATAACGTGCGCCTGGAGGACACTGGAAA 
TGGTCCGCTTCAAATTCATCATTAGGGGATCGTCCTGCTCGGCGATGCCCAACTTCTTGATAAGAATCTTTGAAGTGATTGGCGAAAGCATGCTA 

C AC AAGG G AAT G T C T T C G CC AC GC AG C GT GT C C AAAGC AAT C T AAAAG AT T T C AC AC AAT AAG AAAAC T GC TT AAAC GT C T C AT T T AGCT T GAT G 
GCTTACCACCAGAGCACTCAGGGCCTCCGTCACCGTCTTGGCTGATATAACCAGGTGATCCATGTCCTCGGACCAAGATATGGACTCGGCAAAGA 
ACTTCAATGTTTCGTACAGTGTCCAGGGGAAGTGCTCGTTGTACGTGGGCATGGGCACCTCGGACACCTTGGAGGCAATCTCCGTTGAATGACGC 
TGCATTAGGTCCTTCACCTGCTCCAATAAAGTGGCCACCTCTTCGATCTCAAACACACGACGCAAACATTTGTCAATTACCGAAACGTAGCACAG 
CGAAATGGGTATATCTGCAATAAAAGATTCATATTCATTAGGAAGTCCAATATTTGATATATATTCTATATAAGGTAAATAATTCTGACTTGCTA 
ATGCACTCACTTTTCTCTCGCAGATAGGTGTGCAAAAGGTCCTCTTCCTCGTCGTAGTTTATGATGGCCAGTGTGCAATTGGCCAAATAGAAGTC 
GGTCAGCAGATCGTCAAAGACGATGTAGTCGTTGTACTCCACGGTGGCATACAGCATACCGCGAACTGAGTGCGTCTCCTCGTCCAGAAAGTGAT 
GGAAGATGCTAAGGAAGCGCCGCTGTTCAACATTCACCCACATCTCAAAGGCGAGAGAGAAGGGCTTCTCGGAGGTGAAATCGCGCGTGATTTCG 
CCAATCTCCAGAAACTTGGAGGCGTGCATCTCATTGATCAAAGAGTCGATCTTGTGTGGCTCGGGCATGGCTGCCGAATTGCCTAGGACTTGACG 
CAGGAATTCACCAAAACCGGCATCGTACAGAGAATCCACATTGCGCAAATCCTTAATCACAATGTCCGTAAGGGCTTCGGCCAGATTGGCGATGT 
TCGCCACGGCGGCATAGCCGCCTTCCTCGTGCTTGATGACACGCCCGTGCATCAGGGCACTCGATGGAGTGGCCACTTTGCGTTTGCGTCCGTAC 
TTTGGTTCCATAACTAGGCGCTCCTCCTCCTCTACCAGATTTTCGGCCTCGTCGTAGGTGAAGGGCACCATGGTCTCGTACAGCATGGTTTGCGA 
GGAGGCGTCCTCCTCCGCCTGCATGGCCGCAGTTGCCACGGCTGCTTCGTCCAGTTCATCATCGGAAGCGTTATCGTGGTTCCGGTTATTAATGA 
ACTCCAGTTTGACCCGCCGCGAGGAAACATTCCGCTTGGACAGCGGTGGATGGGATACATTACGCGGCGTCTTGCGCGGCGGCACATGGATGTCG 
TGCTCCTGGCACAGCTCCTTGAACAGATCCTTGTGTCGGTGCTGGAGATGCGCACGCAAATTGGTGGTGTTGCCATGGTTGGTCAGCACCTTGTG 
ACACTTAATGCAGACCACATTCTGTTTGGTAATCACCTCATTATTGTCGTTCGAGGGGAATCCAAAGTAGCGCCAGTATACGCTTTTCATCTTGG 
GCGAGTACAGTTTGGAAAAATTTAGCTGCGACACATCCTCGTACTTGAGTTCGCCAATGGCCACGGGCGACGCTGGTACCCCTTCGCTCATCTTG 
TCTTCTGTGGGGGATTGGGATCTTCTTGTTTGTTTTGAATTAGATTCGACAACGATTGCTGTGCCCAATGCGTGTATTAGCGCGCAATTTCTCGA 
T T AT AG AT AG AC GGT T C AAT AGT C AAT AGT T T T T T AT C TGT AT C AG AAG T AC AGT T C AG AGT T GC C AC GC G AC AC GCCGT C AAT CG AT AAGGC AA 
TAAAACGTTGCTAAATGCTTCAAATGCACTCTCATTTATTTGAGATTATTTTTACAGACTGCTCAAATATGAGAGTCACCGCTTTTTAATTTCAA 
TACGTCTGACTGTCGTAAAAAATGCGATTTGTTGAGTATTCTCGCTAACACTGATTGCATCACCAATCGATAGCTGGGGAAATTTCTCGCTCGGC 
CGGGGGAATTTCTCTAGGGAAATTTTCGCTTGGCCTACGACGTGAAGTTGGTTGCGATTTCCTTTTTAATTCGTTGTTGGGAAGAAAAGGTGTGT 
AAATTTCGTGAGCGTCACGAAATTCAATGAAAATAAAAGATAAACGAAATGCAAATCGAGCCATCGAGTGTGCCAAGTGTGGAGGACACAAGTGG 
GCATCATTGACACCCGTAGGCATGAAGAAGTGCTCGGAATTACGGCTTAAACAAATCGAAATTCGCCAGCCACGTGGGCCTGCAGCTTGGTTATC 
CCATATATACGAGTTTACAAAGAGACATTGATGTTATTATTGTGTTTTTGCCCTCGTTGCAGCCAATCTAAACCGCCAAGATGGGTAAGGGTAAC 
AATATGATTCCGAATCAGCACTACCACAAGTGGTGGCAGCGGCATGTGAAGACCTGGTTCAACCAGCCGGCCCGCAAGGTCCGCAGGCATGCGAA 
CCGCGTCAAGAAGGCTAAGGCCGTCTTCCCCCGCCCAGCCAGCGGTGCCCTGCGCCCTGTGGTTCGCTGCCCCACCATCCGCTACCACACCAAGC 
TGCGTGCCGGCCGTGGTTTCACCCTGGAGGAGCTGAAGGTAAGTGAAGCCGAAAACATTATCGGAATCCATTGGCCACTAACAAAATCTCTGTTT 
ACAGGGTGCCGGCATTGGCGCCAACTTCGCCAAGACCATCGGCATTGCCGTCGACAGGAGGCGCAAGAACAAATCCCTGGAGTCCCGCCAGCGTA 
ACATCCAGCGCCTCAAGGAGTACCGCAGCAAGTTGATCCTGTTCCCCATCAACGAGAAGAAGATCCGCGCCGGCGAGTCCTCTCTGGAGGAGTGC 
AAGCTGGCTACCCAGCTTAAGGGACCCGTCCTGCCCATCAAAAATGAGCAGCCCGCCGTGGTCGAGTTCCGTGAGGTGACCAAGGATGAGAAGAA 
GTTCAAGGCCTTCGCCACGCTGCGCAAG 
<SEQ ID NO: 655) 

Exon: 3968.. 2481 
Exon: 2389.. 2097 
Exon: 2036.. 1649 
Exon: 1588 .. 1001 

Start ATG: 3606 (Reverse strand: CAT) 
Transcript No. : CT18313 

CTATCGATTGGTGATGCAATCAGTGTTAGCGAGAATACTCAACAAATCGCATTTTTTACGACAGTCAGACGTATTGAAATTAAAAAGCGGTGACT 
CTCATATTTGAGCAGTCTGTAAAAATAATCTCAAATAAATGAGAGTGCATTTGAAGCATTTAGCAACGTTTTATTGCCTTATCGATTGACGGCGT 
GTCGCGTGGCAACTCTGAACTGTACTTCTGATACAGATAAAAAACTATTGACTATTGAACCGTCTATCTATAATCGAGAAATTGCGCGCTAATAC 
ACGC ATTGGGCACAGC AAT CGTTGTCGAATCTAATTC AAAAC AAAC AAG AAGATCCCAATCCCCC AC AGAAGACAAGATGAGCGAAGGGGTACCA 
GCGTCGCCCGTGGCCATTGGCGAACTCAAGTACGAGGATGTGTCGCAGCTAAATTTTTCCAAACTGTACTCGCCCAAGATGAAAAGCGTATACTG 
GCGCTACTTTGGATTCCCCTCGAACGACAATAATGAGGTGATTACCAAACAGAATGTGGTCTGCATTAAGTGTCACAAGGTGCTGACCAACCATG 
GCAACACCACCAATTTGCGTGCGCATCTCCAGCACCGACACAAGGATCTGTTCAAGGAGCTGTGCCAGGAGCACGACATCCATGTGCCGCCGCGC 
AAGACGCCGCGTAATGTATCCCATCCACCGCTGTCCAAGCGGAATGTTTCCTCGCGGCGGGTCAAACTGGAGTTCATTAATAACCGGAACCACGA 
TAACGCTTCCGATGATGAACTGGACGAAGCAGCCGTGGCAACTGCGGCCATGCAGGCGGAGGAGGACGCCTCCTCGCAAACCATGCTGTACGAGA 
CCATGGTGCCCTTCACCTACGACGAGGCCGAAAATCTGGTAGAGGAGGAGGAGCGCCTAGTTATGGAACCAAAGTACGGACGCAAACGCAAAGTG 
GCCACTCCATCGAGTGCCCTGATGCACGGGCGTGTCATCAAGCACGAGGAAGGCGGCTATGCCGCCGTGGCGAACATCGCCAATCTGGCCGAAGC 
CCTTACGGACATTGTGATTAAGGATTTGCGCAATGTGGATTCTCTGTACGATGCCGGTTTTGGTGAATTCCTGCGTCAAGTCCTAGGCAATTCGG 
CAGCCATGCCCGAGCCACACAAGATCGACTCTTTGATCAATGAGATGCACGCCTCCAAGTTTCTGGAGATTGGCGAAATCACGCGCGATTTCACC 
TCCGAGAAGCCCTTCTCTCTCGCCTTTGAGATGTGGGTGAATGTTGAACAGCGGCGCTTCCTTAGCATCTTCCATCACTTTCTGGACGAGGAGAC 
GCACTCAGTTCGCGGTATGCTGTATGCCACCGTGGAGTACAACGACTACATCGTCTTTGACGATCTGCTGACCGACTTCTATTTGGCCAATTGCA 
CACTGGCCATCATAAACTACGACGAGGAAGAGGACCTTTTGCACACCTATCTGCGAGAGAAAAATATACCCATTTCGCTGTGCTACGTTTCGGTA 
ATTGACAAATGTTTGCGTCGTGTGTTTGAGATCGAAGAGGTGGCCACTTTATTGGAGCAGGTGAAGGACCTAATGCAGCGTCATTCAACGGAGAT 
TGCCTCCAAGGTGTCCGAGGTGCCCATGCCCACGTACAACGAGCACTTCCCCTGGACACTGTACGAAACATTGAAGTTCTTTGCCGAGTCCATAT 
CTTGGTCCGAGGACATGGATCACCTGGTTATATCAGCCAAGACGGTGACGGAGGCCCTGAGTGCTCTGGTGATTGCTTTGGACACGCTGCGTGGC 
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GAAGACATTCCCTTGTGTAGCATGCTTTCGCCAATCACTTCAAAGATTCTTATCAAGAAGTTGGGCATCGCCGAGCAGGACGATCCCCTAATGAT 
GAATTTGAAGCGGACCATTTCCAGTGTCCTCCAGGCGCACGTTATATCCAATGACAATCTGACCGCTGCTGCATTGTTGGACCCACGCTTCCACC 
GCCTGACCACCATTGACAACCTAGAGCGAACCGTTCGTATGCTGACCCATAAGTATAACATTAACTTTGGTGGAGTGGGCGAAGGGGAATCCAAC 
GAAGTGGCAGCTACCTCCAGCGTGGTGGCCATTAAGTCAGAACCAAGAGTAGTGGACGGCAGTGCACCAAAGAAGTTGGGTCTGAAACTGCTGTT 
TGACAGTAACGAGATACCAAATCCTCCGAAGCGAGATGCGGACAGCAGCGTGGAGTCCGATCTTAAGCGATATCGCAACGAGGTGGTCGTTCAGC 
TGGATGAGTCGCCCATCGAGTGGTGGCTCAAGATGGGACACATTTATGGAACGCTCCGCGATTTGGCCAGCCTGTACCATAGTGTGCCCGGCGTG 
GTGACGCTCAGCTTCAAGAAGGCGCTGAGAGACCAAATATACGACTTCAACAAGCGATTCATGCTCACCGGTAGTCACATCGACGCCATCCTCTT 
TCTTCATCATCACAACAATTAGTCTGCACTGGCGACTACTGTTTCTTATAATGCAAATACTTTTTAATTACTTTTAACGTAATCCGATCTTGTGT 
ATATGAAAATGACTACCAGGGGCGAGGCAAATAGTAATAGCGATTCCGCAGTACTTTTAACCAAAAGACAGCCTTGGGACGTACACGGAACATCT 
TAAGTTATCGACGTATAAATGTTAAGGGAACATTTTGACATCATATATTCAAATCAAACAGATAATCTAAGAATATTTGTTTAATAAATCCATAA 

CG 

(SEQ ID NO: 656) 

Start ATG: 3 63 (Reverse strand: CAT) 

MSEGVPASPVAIGELKYEDVSQLNFSKLYSPKMKSVYWRYFGFPSNDNNEVITKQNVVCIKCHKVLTNHGNTTNLRAHLQHRHKDLFKELCQEHD 
IHVPPRKTPRNVSHPPLSKRNVSSRRVKLEFINNRNHDNASDDELDEAAVATAAMQAEEDASSQTMLYETMVPFTYDEAENLVEEEERLVMEPKY 
GRKRKVATPSSALMHGRVIKHEEGGYAAVANIANLAEALTDIVIKDLRNVDSLYDAGFGEFLRQVLGNSAAMPEPHKIDSLINEMHASKFLEIGE 
ITRDFTSEKPFSLAFEMWVNVEQRRFLSIFHHFLDEETHSVRGMLYATVEYNDYIVFDDLLTDFYLANCTLAIINYDEEEDLLHTYLREKNIPIS 
LCYVSVIDKCLRRVFEIEEVATLLEQVKDLMQRHSTEIASKVSEVPMPTYNEHFPWTLYETLKFFAESISWSEDMDHLVISAKTVTEALSALVIA 
LDTLRGEDIPLCSMLSPITSKILIKKLGIAEQDDPL!)MNLKRTISSVLQAHVISNDNLTAAALLDPRFHRLTTIDNLERTVRMLTHKYNINFGGV 
GEGESNEVAATSSVVAIKSEPRVVDGSAPKKLGLKLLFDSNEIPNPPKRDADSSVESDLKRYRNEVVVQLDESPIEWWLKMGHIYGTLRDLASLY 
HSVPGVVTLSFKKALRDQIYDFNKRFMLTGSHIDAILFLHHHNN* 
( SEQ ID NO: 657) 

Classification: known_flybase_gene 
Gene Symbol: Dref 
FlyBase ID: FBgn0015664 



Celera Sequence No. : 142000013384668 

GCGGGAGATGATGCCGGTATCGAAAAGTTTAATCGCCGATTGGTCCGGGTAACGAAGGAGCACGCCAAAGAGGCCAAGGAACTGCTCACACTAAT 

GGGTGTGCCCTATGTTGATGCACCGTGCGAAGCGGAGGCCCAGTGTGCCGCTCTGGTGAAAGCTGGAAAGGTTTATGCCACCGCCACGGAGGATA 

T GG AT GC C C T C ACAT T C GG AT C T ACAAAAC T GT T G AG AT ACC T T AC CT AC AGCGAGGC ACG AAAG ATGC CC GT C AAGGAGT T C AGC T AC G AC AAG 

CTGTTGGAAGGTCTGGCCATTAACAATCGAGAGTTCATTGATCTATGTATTCTGCTGGGTTGCGATTACTGTGAGAGCATCAAGGGTATTGGACC 

CAAGCGAGCGATCGAACTGATCAACACCTATCGGGATATAGAGACTATTCTGGATAACCTGGACTCTAGCAAATACACCGTGCCCGAGAACTGGA 

ACTACAAGGTGGCGCGGGAACTCTTCATCGAACCGGAGGTAGCTGATGCCGACTCCATAGATCTCAAATGGGTCGAGCCGGATGAGGAGGGCCTT 

GTCAAGTTTCTCTGCGGCGACCGGCAGTTCAACGAAGAGCGCGTTCGCAACGGTGCCAAAAAGCTGATGAAATCCAAGCAGGCCCAGACTCAGGT 

GAGACTCGATAGCTTCTTTAAGACACTGCCCAGCACTCCGAATGCCACAAACGCTGCAAAACGGAAGGCCGAGGAAGCAAAGAAGAGCGCCAATA 

ACAAAAAGGCCAAGACCAGCGGCGGTGGACGAGGCCGGCGACCCAAGTAGTTCATTCGTGTGTTATCTGGAAAAACGTACAATTCTTTAATCTTA 

ACTTAACCGTAAAGCCTATGCATACAATATAAACTACATCTTTGAGCTTAATGCTAGTTGGTCTGCTTGTGGGTGCGGGTGAGGTTGGGATGTTG 

GATTATGGATATTGGATTTTGAATTAGAGTTTGGACTGTGGGGGCTGCGACTACAGCAAATACTGCTTGGCCAGGGCACGCAGAAATCTCGATTT 

CCGCACCATGGCAGATTCTCGTTTGATCTCCTCATTGTTGTTTGCATTCAGCTCCAGAGTGCCCATCATAGAAATCTGTTTTTTAAAAATGTTAT 

AAAG AT AAAT GAT T AC C C G C AAAT G AGC AC AG AGT AAT AG AT TCCTACCGCTT C AAG T T T C C G AC TGTGTTCGCCTCGCT T AGCGT GC AG AT T G A 

CGATGAACTCCAGCAGAGTTGTATCCCAGATGCAGCTGTAATATGCATCCATGGCATCGGTGAAATTCGACGATTTCTCGCTGAGATTCTTGAAC 

ACAATGCCGTAGTCGATCTCATCAAGGAACTGACACAGAACGGTGGCCTGCATGTGACAGCCCAGATTGGCGGCACATCGAATCATCTTGCGGAT 

CACGTAGTCGTCGGCCACATTTCTCTGCAGCGGCATGGTGCAGTAGTCGGTGCCAGTAACCAGCGCATTTACATAGCAACGCATTGCGGACTCAT 

TATTGCCGATGGCTAGATCTAGATCGCCTTTCATCTTCAGCCATGAAATGGTTTGCGGATAGTAGGACAAAGCCTCGGAAAGAAGCCATTGCAGG 

GTCTCGCAAACGCTCCTCAAACTGTAGCTGACAGGGCTGAAATGAGGATTAAAAGTTATTATTTAAAAAAAAACTATATTACTGTCTGTCTACTC 

ACTTGTTAATGCTGGTAGGCCACAGCTGCATGTACTCGCAGCTTAAATCATGATTGGGGTCATCCTTAAGTATGTTGTGGACTTTGCCTATCATG 

GAGATGGCGAGTCCGAAAACGCTTTGGTGGCGGATTCGCTGTAGGAAGGCCTGCAGGGAACTGATACCGCCAACTCCTGAGGAGCCACGCTTCGG 

AACGTTAAGGAACATGCTTAGCATCAGTTCCCAGGCATCGCGACATACCTTCTTGGGGCCCTTCATCTTCTCCATCTCGATGAAGGTGGCGGCGA 

AGGCCAGGGGTAGTTCCAGGGGACCCGCCCGTTTATCCAGGTAAAGGAGAGGTGGAAATTCTGTGAGGTTAAGTAGCATGATAGCGCAACTCTCT 

AGGATATCCGGACGTGGGACGATTGAATCTCCTGCCTGCAGGGCTCCCAAACATTGCTTGCATCTTCCTCCCAAGGATTGCAGATCGAGTGGCTT 

CTGGTGCCACTCCTCCAGGCACTGTGTTATCTGAATGTGCAGGATTTCCCAGGTGATCAGCTTACACATCAGCTGCACATTGGCCGCCATCTCCT 

GGCGCTGATTTTCCGACTTCAGTACACTCAGCATTGAAACGGCATCGATGTAGTTGCCCCTTGTGGCCAATTCTCGCGCCTTGCCGAGCAGAACA 

TAGCTGAAGTCTTGCAAAAAGCTAACCGGCATGGCCATAATAAGAGTTTTCAGGGGCTGTGGTACGTCCCAACTGGGATTAATGGCCCACAGAGG 

TTTTCCAGGAGCAGTTGCGCATAATTTCACTAGCAATATCCTCACTGTGTTGGCATTGGTGCAGCTTATCAGTTGTCGCTCCAGAGCGCCCACGT 

CAACCCTGCTCACTAGAAGGGGTAAGTTAAAATATAATAAATGAATTTTGTTACAGGAATGGAGATGCATCTCTTACTCTTGGAGTGACACATCC 

AGTCGGAAAGCGAGGCCAACGAGTTGGTGGGCACCGGCGGCGTCGCCTGCAGCATCTGTTCGTCAATATCCTGGAGTTCATCTGACGAGAACAGT 

CCAAATCCTCGCACCGTTTGTAGCAACTGTCGTGACTGCCCACCCTGATGAAGCAAACAGTCGATGAGGAAGTGCTTGAGTGCCGCCTGTTCTTT 

GAAAGTACAGTGGGGAAGCATTTCCTGCAAAGCCTCCACCAGCGGTGGCAGGCAGTCCAGTTCCCGGTACTTCTCAAAGTACTCAACGCTGCCAA 

AGATGCTGCCCCACTCAAAGATGTTTCGCACCACATTCAATGCAGCCACCTGCAATTGCAGCTGAACTGTCTCCTTGAGTATGCCCGTGGAGATG 

GAACCCTCAATGTCCAGTTCCACTTGCCGCCGGCTTATAAAGGGGATTTCTCGCCTTCTGTTGTCCATTCGCAGAATTGATACAATGTCTGTGTA 

GTTGTTAAGTAGCGATTGCTGGAACTTCTCCAGCAGTGACTGCTCCTCGGCGCTAATTCCACAGGCCTGCAGTAGACCTTCCAGTTCTGACGGTC 

GTATGTGGCAGTAATATAGAGTTGTCTGTGGCGGGATTGCCTGCAGATTAGTATTACACGCGGCGGCCGCTTCCCGGGCCAAAACATACTGTTTG 

GTGTACAGGTAATATTGGGCGAGCTCGTAGTGCAGTTGGGCTCTCAATTCCTGGGGTGTTATTAGGTGCATAACATCGTAGTTTTGGCCATTGCC 

GATATCATCACATCGCAGCGGAACAAAGCTGTCGTAGTGAAATATGTAGAAAGGTTTTGTCGAGGCGCAGAGTTGTTGCAAATACTCGATGGATG 

GTTGAGTCGGTGTGTCCAGAAACTGCTGGAGCTGGTCCACAGTTTGGTACCTATAATTAAGCCTTATTTGAGCATTAGAGTATGGATAACTTAAC 

CACATCTTACATTGTGTTTTGAGGACGTGGCGCTTTTAACGCCTGCTCCTTTATCAGTTGCATTCGCAGCAGCATGCGGTGGTATATGGTCAAGG 

CGAATCTTGCTGCTCTTTCGTTTCCAGCAGCGGCTGAAAGGATTCACAGAGGATATAGAGAAGTATATGTGGAGGCTAGGAATTAACATACAAAT 
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TCTCGCGTCAAAATCATTGGGCAGGGGAATACTTAAGGAGCAGCCGTAGCTTATGGTGCACAGATCGCGCAGCAAGGCCAGCTGCATAATCACAG 
GCAGGTTCTTCTCCAATGCATCCAGATCCCACTTCAGCCAAGTGGCTACCTTCAGCTCGAGGATTTTCAGTGCCAACTGCTTGCGGTTCAATTGC 
AGGCCCTTCTCCGCGGGCCTTTGTGGGCTATGGGGCATGCCCACTCCGCCGGAGGCAGTGGGCGTTGTCGGCGCAGGGGCACCACCAACCACGCC 
GGGAATGGGTCCCGAATTCGATGGCGTCTGGTTTAGATTCTGCAAATCACTGCCAGGCGTCCCCACTGACTCCTGCGCCGTGTTCGGCGTCATGC 
TGATGAACTGCACGATCAGCTCCATGGCGCTGGGCTCTATAAATGGTGGGCATGGATTGTAATCTCAATTTGTCGGCCGGATTCCACAAGTGAGT 
CGAACATACCGGGATGCGGGCGCTGCAGATGCTGCGTAATCTTGTGCGGATCGAGCAGGAACTCGAACCACAGTACCGTTTCGGCAGCCAGTGGA 
ACTGGCTTGGGCTTAAGAGGATCGTCCATGCTAATCCCTGCCAAAATTCACCAATCAAAACAAAAATTAAAGAATGGCGAAGGGAGACCAGCTGG 
CTGTGGCTATACAATCGATGTATCTATCGATGGGCGATAGATTCTGTCCGCTTTTGATTTCGATAAATTTGCCAAATAATTTGATTTGTGTTATC 
GATACATCCCTGCCAACTCGTTTTATGGACAATTGAAAGTTGGTTTTTGAAGATGCTGTTAAATTGAAATATTTAAATCTTTTTTTAAATAGACC 
AGTAGCAGAATGTACTGTTATCTTCTTGAGATTCTTGTTGGCAATTAAAGTTCTCCTTATTCGTTTGGGACACAAAGAAGTATTTCCGCTTTTCT 
AAAAAACAATTTTCCTCTTTTTGTTTAGGCATAATTTTACTGTTAATAATAATTATTTAGTTATCCATTTATATATGGGCAATTAACAGCCTTTA 
CCTAAAATTTTACAATAATAAATACCCCTTAACCCAAATGAAGATTTCGAAATCCAGCTGTAATAATAAAATATAATACCGAATTTAATGTAGGT 
CTAAAGTTTATGAAATAGTTTTTACTTCTTTCATTTTAGTTTATATGTTTGTTTTTATTTTATCCATACTTCATTAGTACCATAAAATGTTTAAC 
TCCATCTCCTTTGCACATTGATCGGAAAAACACAAACAAAAACGACCAAAAGGGGAGTACCTAATCATTTCAATATGCCTAAGACTATTTTATTG 
TTGTTCATTAAAAATGTAAAATAGCCTCAAAAACTTTTTGGTTAATTTATGCAAATATTCTGTTTGTTGTATATCGAGTCAGGTGCTCTAGAGTT 
GGGTTATCTCTCTCTTTCAGTTTGATTTCGATGCAGACACTGAAAGAAATCTATTACAAAATTGTATAGTTTATAGTTGATGTTTTGCGGAATGT 
TCATGGGTATTTTTTTCCACTGTAAGTTCCTCTAAGTTTATTGTACACCCTTTTTTTAGATGGTGATCTTTATGAGCATTTCTCGACTCCATAAT 
CTGAATCTTAAATAAAATATTTAACGTTCAATCAGGTGATTTTATGCAAATTACAC 
(SEQ ID NO: 658) 

Exon: 4471.. 4285 
Exon: 4216.. 3797 
Exon: 3738.. 3621 
Exon: 3565.. 2643 
Exon: 2577.. 1713 
Exon: 1651.. 1189 
Exon: 1120.. 1001 

Start ATG: 4399 (Reverse strand: CAT) 
Transcript No. : CT18357 

CCACAGCCAGCTGGTCTCCCTTCGCCATTCTTTAATTTTTGTTTTGATTGGTGAATTTTGGCAGGGATTAGCATGGACGATCCTCTTAAGCCCAA 
GCCAGTTCCACTGGCTGCCGAAACGGTACTGTGGTTCGAGTTCCTGCTCGATCCGCACAAGATTACGCAGCATCTGCAGCGCCCGCATCCCGAGC 
CCAGCGCCATGGAGCTGATCGTGCAGTTCATCAGCATGACGCCGAACACGGCGCAGGAGTCAGTGGGGACGCCTGGCAGTGATTTGCAGAATCTA 
AACCAGACGCCATCGAATTCGGGACCCATTCCCGGCGTGGTTGGTGGTGCCCCTGCGCCGACAACGCCCACTGCCTCCGGCGGAGTGGGCATGCC 
CCATAGCCCACAAAGGCCCGCGGAGAAGGGCCTGCAATTGAACCGCAAGCAGTTGGCACTGAAAATCCTCGAGCTGAAGGTAGCCACTTGGCTGA 
AGTGGGATCTGGATGCATTGGAGAAGAACCTGCCTGTGATTATGCAGCTGGCCTTGCTGCGCGATCTGTGCACCATAAGCTACGGCTGCTCCTTA 
AGTATTCCCCTGCCCAATGATTTTGACGCGAGAATTTCCGCTGCTGGAAACGAAAGAGCAGCAAGATTCGCCTTGACCATATACCACCGCATGCT 
GCTGCGAATGCAACTGATAAAGGAGCAGGCGTTAAAAGCGCCACGTCCTCAAAACACAATGTACCAAACTGTGGACCAGCTCCAGCAGTTTCTGG 
ACACACCGACTCAACCATCCATCGAGTATTTGCAACAACTCTGCGCCTCGACAAAACCTTTCTACATATTTCACTACGACAGCTTTGTTCCGCTG 
CGATGTGATGATATCGGCAATGGCCAAAACTACGATGTTATGCACCTAATAACACCCCAGGAATTGAGAGCCCAACTGCACTACGAGCTCGCCCA 
ATATTACCTGTACACCAAACAGTATGTTTTGGCCCGGGAAGCGGCCGCCGCGTGTAATACTAATCTGCAGGCAATCCCGCCACAGACAACTCTAT 
ATTACTGCCACATACGACCGTCAGAACTGGAAGGTCTACTGCAGGCCTGTGGAATTAGCGCCGAGGAGCAGTCACTGCTGGAGAAGTTCCAGCAA 
TCGCTACTTAACAACTACACAGACATTGTATCAATTCTGCGAATGGACAACAGAAGGCGAGAAATCCCCTTTATAAGCCGGCGGCAAGTGGAACT 
GGACATTGAGGGTTCCATCTCCACGGGCATACTCAAGGAGACAGTTCAGCTGCAATTGCAGGTGGCTGCATTGAATGTGGTGCGAAACATCTTTG 
AGTGGGGCAGCATCTTTGGCAGCGTTGAGTACTTTGAGAAGTACCGGGAACTGGACTGCCTGCCACCGCTGGTGGAGGCTTTGCAGGAAATGCTT 
CCCCACTGTACTTTCAAAGAACAGGCGGCACTCAAGCACTTCCTCATCGACTGTTTGCTTCATCAGGGTGGGCAGTCACGACAGTTGCTACAAAC 
GGTGCGAGGATTTGGACTGTTCTCGTCAGATGAACTCCAGGATATTGACGAACAGATGCTGCAGGCGACGCCGCCGGTGCCCACCAACTCGTTGG 
CCTCGCTTTCCGACTGGATGTGTCACTCCAAGATGAGCAGGGTTGACGTGGGCGCTCTGGAGCGACAACTGATAAGCTGCACCAATGCCAACACA 
GTGAGGATATTGCTAGTGAAATTATGCGCAACTGCTCCTGGAAAACCTCTGTGGGCCATTAATCCCAGTTGGGACGTACCACAGCCCCTGAAAAC 
TCTTATTATGGCCATGCCGGTTAGCTTTTTGCAAGACTTCAGCTATGTTCTGCTCGGCAAGGCGCGAGAATTGGCCACAAGGGGCAACTACATCG 
ATGCCGTTTCAATGCTGAGTGTACTGAAGTCGGAAAATCAGCGCCAGGAGATGGCGGCCAATGTGCAGCTGATGTGTAAGCTGATCACCTGGGAA 
ATCCTGCACATTCAGATAACACAGTGCCTGGAGGAGTGGCACCAGAAGCCACTCGATCTGCAATCCTTGGGAGGAAGATGCAAGCAATGTTTGGG 
AGCCCTGCAGGCAGGAGATTCAATCGTCCCACGTCCGGATATCCTAGAGAGTTGCGCTATCATGCTACTTAACCTCACAGAATTTCCACCTCTCC 
TTTACCTGGATAAACGGGCGGGTCCCCTGGAACTACCCCTGGCCTTCGCCGCCACCTTCATCGAGATGGAGAAGATGAAGGGCCCCAAGAAGGTA 
TGTCGCGATGCCTGGGAACTGATGCTAAGCATGTTCCTTAACGTTCCGAAGCGTGGCTCCTCAGGAGTTGGCGGTATCAGTTCCCTGCAGGCCTT 
CCTACAGCGAATCCGCCACCAAAGCGTTTTCGGACTCGCCATCTCCATGATAGGCAAAGTCCACAACATACTTAAGGATGACCCCAATCATGATT 
TAAGCTGCGAGTACATGCAGCTGTGGCCTACCAGCATTAACAACCCTGTCAGCTACAGTTTGAGGAGCGTTTGCGAGACCCTGCAATGGCTTCTT 
TCCGAGGCTTTGTCCTACTATCCGCAAACCATTTCATGGCTGAAGATGAAAGGCGATCTAGATCTAGCCATCGGCAATAATGAGTCCGCAATGCG 
TTGCTATGTAAATGCGCTGGTTACTGGCACCGACTACTGCACCATGCCGCTGCAGAGAAATGTGGCCGACGACTACGTGATCCGCAAGATGATTC 
GATGTGCCGCCAATCTGGGCTGTCACATGCAGGCCACCGTTCTGTGTCAGTTCCTTGATGAGATCGACTACGGCATTGTGTTCAAGAATCTCAGC 
GAGAAATCGTCGAATTTCACCGATGCCATGGATGCATATTACAGCTGCATCTGGGATACAACTCTGCTGGAGTTCATCGTCAATCTGCACGCTAA 
GCGAGGCGAACACAGTCGGAAACTTGAAGCGATTTCTATGATGGGCACTCTGGAGCTGAATGCAAACAACAATGAGGAGATCAAACGAGAATCTG 
CCATGGTGCGGAAATCGAGATTTCTGCGTGCCCTGGCCAAGCAGTATTTGCTGTAG 
(SEQ ID NO: 659) 

Start ATG: 73 (Reverse strand: CAT) 

MDDPLKPKPVPLAAETVLWFEFLLDPHKITQHLQRPHPEPSAMELIVQFISMTPNTAQESVGTPGSDLQNLNQTPSNSGPIPGVVGGAPAPTTPT 
ASGGVGMPHSPQRPAEKGLQLNRKQLALKILELKVATWLKWDLDALEKNLPVIMQLALLRDLCTISYGCSLSIPLPNDFDARISAAGNERAARFA 
LTIYHRMLLRMQLIKEQALKAPRPQNTMYQTVDQLQQFLDTPTQPSIEYLQQLCASTKPFYIFHYDSFVPLRCDDIGNGQNYDVMHLITPQELRA 
QLHYELAQYYLYTKQYVLAREAAAACNTNLQAIPPQTTLYYCHIRPSELEGLLQACGISAEEQSLLEKFQQSLLNNYTDIVSILRMDNRRREIPF 
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ISRRQVELDIEGSISTGILKETVQLQLQVAALNVVRNIFEWGSIFGSVEYFEKYRELDCLPPLVEALQEMLPHCTFKEQAALKHFLIDCLLHQGG 

QSRQLLQTVRGFGLFSSDELQDIDEQMLQATPPVPTNSLASLSDWMCHSKMSRVDVGALERQLISCTNANTVRILLVKLCATAPGKPLWAINPSW 

DVPQPLKTLIMAMPVSFLQDFSYVLLGKARELATRGNYIDAVSMLSVLKSENQRQEMAANVQLMCKLITWEILHIQITQCLEEWHQKPLDLQSLG 

GRCKQCLGALQAGDSIVPRPDILESCAIMLLNLTEFPPLLYLDKRAGPLELPLAFAATFIEMEKMKGPKKVCRDAWELMLSMFLNVPKRGSSGVG 

GISSLQAFLQRIRHQSVFGLAISMIGKVHNILKDDPNHDLSCEYMQLWPTSINNPVSYSLRSVCETLQWLLSEALSYYPQTISWLKMKGDLDLAI 

GNNESAMRCYVHALVTGTDYCTMPLQRNVADDYVIRKMIRCAANLGCHMQATVLCQFLDEIDYGIVFKNLSEKSSNFTDAMDAYYSCIWDTTLLE 

FIVNLHAKRGEHSRKLEAISMMGTLELNANNNEEIKRESAMVRKSRFLRALAKQYLL* 

{SEQ ID NO: 660) 

Classification : known_f lybase_gene 
Gene Symbol: EG:EG0003.5 
FlyBase ID: FBgnO025830 



Celera Sequence No. : 142000013384556 

CACTTATTGTGAATTTTGTTTAAATGCCTTGGCCTCTTCGCACTTTGCTTTCCTGCCTTTTGTTGTTCTGCTCTATCTTCTTGCTTTTTCACTTT 

TCGTGCTCCTCACACGCGTGACTCTGCGGTCGCATTAGGTGGAGCTGCATTTCCCTCGCGGAAACTGCGTCGGTTGTCCTGTGGCGGCCTGCAGA 

GGCAGTGCGTCAAAGCGGAGGCTGACCACTCGAATGTCGGGCGACTCGGATTGCAAACTTTTCCATTAAACTCGCGCAGCACTCATCCGCCCTTC 

GCGTTGCGATGCGATTTCCGTCGACAGCGGAGCGTCCGTGCGTTCGCAGCAAAGTTTTTTTTTCGATTAATGTGACCAGTCGGCGGGGTCTGCGA 

TTAGTACTGCCAAGAGTATGGCGCTCTTCGGTTCGGGGGAAAATGCTGAGGACGTGTAAAAACTATCCTACACCGTTCACACTAGCAATGCGATA 

TTGTTGTACTGTTGGATTTTCAATGTTTGACATAAGCTTATAGCTAATTGTATATCTTATCACTAAAGTTGGTAAACTGTCGAATTCGAACTATT 

AACATTCGTATTTGCCAAAATCAGCTTAGTATAAACACATTGAATAATTTTAATCAATAATTAATTAATGATAATAATAATAATAATAAATTAAT 

AATAATAATAATAAAATAATAATATTAACAATGATAATAATTAATTATTACATAATATATAATGTTGTATTTCACATAGCTTAATGAATAAATTA 

GAATGGGTACAAACATTTACGGCTTGAAACGCTTTAAATTTACTCAAAGTTCGAAAGTGAACTAAACTTTAGGCTAACAGGTTCGCTTCCCCACT 

GCTTGTCCGTTCGAAAAGGTTCGAGTGCGCACTGTCCGCCAGTGGAGTACCGCACCCTGCATTCGCCTCGCACATCTGTCATTTTTTAGTGCCAT 

TCAAGTTTCGCTGGCGCTTTCGCGTCGCTTGGTCACTAGTGTCGTTTTCGTAAGTTGTTGGGGTGGGTTAAAATCCGCGCGCAGAGCTATAATTT 

TATTCCTGGAAAATCGTCGTTGTCGCTGACCGCGTCGCATTGCAAAGTTACCGTACGCCACCGAAAGTATTAGCCCTTTGTAGCCGGACAACTGT 

CAAACGGATTTGACTGACACAAGAGCAGTGATATTAGCAGCGCAACGGAAAGCAATTATACGACATAAAAATGTTCTCACGACCCAGGTAAGCTT 

TTCTTTTTCGTTTGGTTTTTTATGCAAGGACTCGCCTCGAAGGATTTGTGTGCGCGTGGGTGTGTGTGTGCCTCGGGCGCAAATTATGTAAGATT 

TCACGATGCGGGAGGAGCAACTGCCCCAGCTGGTAGCCCAGTGACCTAGTCACTCGGATCAGACGAGCGGCCGCCAAGGCGTCCGTGTATATATA 

CAATATATTCACATAGTATATCATCCCATTTCCCCTTCTCTTTTCCTCCTCCTTTTGCTGCGCAGCTTTCGTTTTCACTGGGAAGCGCTATGTGC 

GAGCGAGAGGGATTCCGATTGGGAATCGCACATGGCCTGGCATTGACCCTTCGGCTTCCCTCCCTTTCTCGCACTTGCTGGTCCCGTTCGGTGAA 

CAGCTGTTTTCAGAATTTTCACTTTTCCCTTGCAATTGCATAAGCGTTTTCGAACATGCATGTCTTCGTTTTTTGGCTGCCAAACAAGAAACAGC 

TGAAAAAATCAATGGGCTGCAGAGAGAGAAACGGGGTGCTACGTATCATAAGTATATGTGCTTTAATTTTGCTTTTCTTCGGTGGAGGAATATCG 

CCGCACATAATTCGTATGAATTTTCTATGATTCACAGTTGGATAGGAAGTACAAATACGAATGTATTCCATTGTTTAGTAGTATTCTAAGAAAAA 

AAAAGGAATGGAAAGTTTATCATCATACTTTTACAATCGAAAACATTTTGTTTCCTGTGTAGCGAAAACACAACACCTTCAATACCATGGACATG 

GACATATATTCGAAAGATGGTCTGGCGGAATTCTAAATTTAGTGCATATACCGAAAAAATGTGTGCTTTGTTTCGCCGCTGTCAGCGTTTTTCGT 

TGCCAAGTTCATCAAGAATGTTTTCAGCTGGTCTGCTATCTTGCCTCTCGTTCCCACGCAAACCGTTAGCCAGAAATTTGGCTAACAAAAAATGG 

AAAGACGTAAAGAGTTTAATGAACTTGCCCAGCTTTTTCCTTCAGACTCCAGTTCTTAGGGCTTTTAAGACCCTAAAAATAATTATTGGTAGCAC 

TCGGAGTATGGGGGTATCATTTACGACACGCACTTATTCCGTGTATTTGGCTCAGATATGCGAAATTGTTTAGCACTCGACGTGGTGGGGGCTCA 

TGTGAGTGCCATTCAAATAAATGGAAAAAATTGACGAGTGATAGCGTTACCTAAACGATAATTATCAACCCAAAGTGCTCGCCTTTTGTCGGTGC 

AAGGGATCTCGGGGTTTCCAGCCCGGCGCACGTCACTAACCGGTTTCAACTTGACCAAAATCCAGCAACGACACCGATACCGACCCTCTGCCAAT 

CCCAATTCCACTCCCCGCGAGCATGATTCCATCAATGAACCGCAAACACAAATTAGTCACACGCCTCTGTCCCGCGATATTTGCCTCTCTCGTAA 

TTAGCACCATCTATTCGAACTGCCAGCTAGCAAAAACCAAACAGCAGGGGTCAAAATTATAGGGTATCTTATAATATATGATCTTCGCTTGGCAC 

TTCGATTACAGTAAGAGTGTTGCATAATCAGAACCTGTTTTTAATTAACTGATCTAGTCCGTTGGGGTCCATAATATTGTATATTTTTGTAAGCC 

CTTCACTACGAGTTGTTAGTTTTGCAATCTTATCTGGGGGTATTTACGCTTCAAGAACAGAAAAACATTTTCAAGTTCAATGATCCAATTGGTTT 

TGTAATTCATTAAATTGATACGATCACAAGTAAGTGCACAGAAAATTACCTACTTTAGTTGTCCCTAGAGAATAAATAATAATGCACTTCAATCC 

AAGCTGTAGAACTGAACTTGTAGTTTTTAAATATTTTTTATTCTAAAGAATTTAGATAAGTTATTTATAAGAATTTATTACAAGTTATTAAGATT 

CTGAGGATTAGATGCTAGATTTATGACCGCGGCTGTGCATAGATTTTTATCGTGATGTTGCCAGGCCATCCATCCCCCTTATCGCTGGCGCTTTG 

GATTCCGGGGGACAGCTGTAAACAATGCGGCTTAGTTTCCTTCTTTTCTGCTTTTCATTTTATATTTTGGCGTGCCCAGCGCGGTGATAAACAAA 

AAGCAGCTGATGTCGACCTTGGGCATGGCGTTTTTATTCCATAAATATATGCATATGTGTGTGTTTGTGGGTGGACAGAGGGGGCCCCCCCATTG 

CCAGTGGCAATGTCAAGAATGACAGTGGGCCCACCTCAAAGTCAAGGGTTCCCAGGGGGTTCTCCAATTGCCGCGCTGATAAAGTGATTCACGTA 

ATGCACTTTTGCAATTTGCAGCTTATGTGGAACTGTGGGCAAATTGTGTCGATGCGGCACTTCAGCAACAGGAAAGACTGCTGTGGCAGCAGCAA 

CAGTTCCCACCGCTCGACACTATCATGAGGTGGTGGGCGATATCATCTGCCCGTCTCAGGTGCGCGGCATAGATCACATCCGTGATCCGCGACTC 

AACAAGGTAAGGAGCACTAAATTAATTGTGTGACACTTTTGTTTTGTATAAACAAATGCTTTTGTTTCTAGGGCCTTGCCTTTACACTGGAGGAG 

CGCCAGACTCTGGGCATCCATGGGCTGCAGCCAGCGCGCTTCAAGACGCAAGAGGAGCAGCTGCAGCTATGCAAGATCGCCGTAAACCGCTACAC 

GGAGCCGCTGAACAAGTATCTCTATCTGAGCGATCTGTACGATCGCAATGAGCGTCTGTTCTTCCGCTTCCTATCGGAGAACATCGAGGATCTGA 

TGCCCATTGTGTACACGCCGACGGTGGGCTTGGCCTGCCAGCGCTTTGGCCTGATCTACCGACGTCCTCATGGTCTGTTCATCACCTACAACGAT 

CGAGGACACATCTTTGATGTGATGAAGAACTGGCCGGAGCCGAATGTGCGTGCCATCTGTGTGACGGATGGTGAGCGCATCCTGGGACTGGGAGA 

TTTGGGCGCCTGCGGCATGGGCATTCCCGTGGGCAAGCTGGCCTTGTACACGGCTCTGGCCGGAATCAAGCCCCATCAGTGCCTGCCCATTGTGG 

TGGATGTGGGCACCAACAACATCGATCTGCTGGAGGATCCCCTGTACGTGGGTCTGCGTCAGAAGCGAGTGGTTGGTCGGGAGTACGACGAGTTC 

ATTGACGAGTTTATGGAGGCCGTGGTGCAGCGCTATGGCCAGAACACTCTGATCCAGTTTGAGGACTTTGGCAACCACAATGCATTTAGGTTCCT 

GGACAAATACCGCAACACCTATTGCACTTTCAACGACGACATCCAGGGAACCGCCTCCGTGGCCGTTGCTGGGCTCTATGCCTCCAAGCGTATTA 

CGGGTAAGTCCTTCAAGGACTACACCTTCCTGTTTGCAGGAGCCGGTGAGGCCGCCATCGGCATCGCCGATCTCACGGTCAAGGCCATGGTGCAG 

GATGGTGTGCCCATCGAGGAGGCCTACAACAGAATTTACATGGTCGATATCGATGGTCTGCTGACCAAGAGCCGCAAGGTGGGCAACTTGGATGG 

CCACAAAATCCACTACGCCAAGGACATCAATCCGATGTCAGATCTCGCCGAGATTGTCTCCACCATCAAACCCAGTGTATGTTATAACAATTGTT 

ATTTCTGATGGAACTTTTGGTAATCCTAAACTAATCAACAGGTGCTTATTGGTGCCTCGGCTGCTGCCGGCATTTTCACACCGGAAATCCTGCGC 

ACCATGGCGGATAACAATGAGAGGCCCGTGGTGTTCGCCTTGTCCAATCCCACCAGCAAGGCAGAATGCACTGCCGAAGATGCTTACAAGCACAC 

GGATGTAAGTTATTTTCATTGGATATTGTAATGACCAACTAATGAGATGATTGAAATGCACAGGCTCGCGTGATCTTCTCGTCGGGCTCTCCCTT 

CCCACCGGTCCAGATCGGTGACAAGACCTTCTACCCGGGCCAGGGCAACAATGCCTACATCTTCCCAGGCGTTGGCTTGGGCGTGATCTGCACGG 

GTACTCACCACATACCCGACGAAATGTTCCTCATCGCCGCCCAGGAGTTGGCCAACTTCGTGGAGCCCAGCGATATTGAGCGCGGATCGCTGTAT 
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CCTCCGCTGTCGAGCATCCGCAACGTGTCCATGAACATTGCCGTTGGCGTGACCAAATGTGCCTACGATAGAGGTAAGATACCAATAGCATACCA 
ATCTTGGTGGAACAGAAAATATAACTAATTTGGATTATTTTTGTAGGCTTGGCATCTACCTACCCTGAGCCACAGGACAAGCGCAAGTGGCTGGA 
GAATCAGCTGTACAACTTCAACTACGAGAGCTCGATGCCCGCTTCCTGGGTTTGGCCGCGGATGCCCTACATTAAGACTCGTGATTTAGTCCCCA 
CAAAGCTCTATGGAAAACAAAGTGAAAATTAATTTCTTTCTTTATTTTATTCTAGGCGAAGAAAGCCCGCTCATTGCCGCCATCAAATAAAATAG 
CATTAATGCGTTCAACTGGTTGATGGCACCGTTCTGTTTTGCATCTAACACACTCTACTACTTCCAAGCGTACACCATCAGCACCAAGCAACAAA 
ACAACCAACCGACTACTAACTACCCTAGACTAGATAAATTTCCAGTGCTCACTCTTTTTAAATGTTTGTTAATCGATCCGTTGGCATCGACAGTC 
AAACTCAGTCGCAACTAAAAAATGATTAAAATGTTTTTGATATTAAATAATTAGAACACCTAACAAGCTGCGAGGTCAAGGGATCGGTTCTTAAA 
TGTATTTTCAAATTGTTTAATAGTATTATGTATGTATGCCATGTTTTTATATATGTAAAAATTATCATGTTATTAATTTTGTGATACCGAAACGT 
ACGATACTAAATGACGACATTGACATGACGACTGTGACGGTGACTACGCCTTCGAAAAAGCAATTACGCAGTATTCTATTCTATCCTATCACTAT 
CCTATCTGTATTTAATCTATGCTATGTTTATGGTTTGCTGTAACTTAAACACCCGATTGGCTTCGCAlATTCCATTCCTCGCCGCTTGCCATAGCT 
ATCCCGCATTATAAGTGTACCCCAGCTTCCGATATGATCTGCTGTGTCTACTACTACTACTAACTCTGACGGCGGCATCAAGCGAGCTCTGAAGC 
TATACTTATTTATCAAATTCACGCTTAATCACAATCCACCCGAAATTCGACAATTGATTGATTTATCTATTGCTATATGCTCCTTTCGTTCACAC 
ACAGAGAGCGAGGAAGTAATGTAAATTATCGATTACCCCCCTACAAACCGACCGCCTAGAACAATTGGCTAGACAGCAGAGCTAACTGGTGAGAA 
CAACCCAAACAAGCAAACAACCAAACAACTAAACAGCCAACCATCCAACCGATTCCACATGTAAAACAACTTAACTCGCTGCCGTTAACCACCAC 
CAACTATCTAAGTTAGAACATTATGCTTTCTATCGATTAGTTCACTTATATCTGTAGATATATTCGAGCGACGATTATGTTTGTTGTTTAGCCAT 
ATCTGATATTTATTAACGCAATTAAGCAGCTTTGTTATAATTTATTGTTAAAAGGTCGAGCCTAATGTATCAGTAACTAGAAAACCTAACCCCTT 
AT GAT C G AT C GG AC C GAT T C AGC C AT T GT AC C ACT GAC T AG AT G AT G AT GAT GT GC G AT AAAC AAT AAAAC G AAAAC AC AAGT T AAAC AC T C T C G 
ATCCTTTATGCCAATTGCTTGCTGGAGATCTCTTGAGAT 
(SEQ ID NO: 661) 

Exon: 1001.. 1227 

Exon: 3537.. 3711 

Exon: 3777.. 4826 

Exon: 4887.. 5039 

Exon: 5099.. 5393 

Exon: 5462.. 5590 

Exon: 5661.. 5974 

Start ATG: 1211 

Transcript No. : CT18483 

TAAGTTGTTGGGGTGGGTTAAAATCCGCGCGCAGAGCTATAATTTTATTCCTGGAAAATCGTCGTTGTCGCTGACCGCGTCGCATTGCAAAGTTA 
CCGTACGCCACCGAAAGTATTAGCCCTTTGTAGCCGGACAACTGTCAAACGGATTTGACTGACACAAGAGCAGTGATATTAGCAGCGCAACGGAA 
AGCAATTATACGACATAAAAATGTTCTCACGACCCAGCTTATGTGGAACTGTGGGCAAATTGTGTCGATGCGGCACTTCAGCAACAGGAAAGACT 
GCTGTGGCAGCAGCAACAGTTCCCACCGCTCGACACTATCATGAGGTGGTGGGCGATATCATCTGCCCGTCTCAGGTGCGCGGCATAGATCACAT 
CCGTGATCCGCGACTCAACAAGGGCCTTGCCTTTACACTGGAGGAGCGCCAGACTCTGGGCATCCATGGGCTGCAGCCAGCGCGCTTCAAGACGC 
AAGAGGAGCAGCTGCAGCTATGCAAGATCGCCGTAAACCGCTACACGGAGCCGCTGAACAAGTATCTCTATCTGAGCGATCTGTACGATCGCAAT 
GAGCGTCTGTTCTTCCGCTTCCTATCGGAGAACATCGAGGATCTGATGCCCATTGTGTACACGCCGACGGTGGGCTTGGCCTGCCAGCGCTTTGG 
CCTGATCTACCGACGTCCTCATGGTCTGTTCATCACCTACAACGATCGAGGACACATCTTTGATGTGATGAAGAACTGGCCGGAGCCGAATGTGC 
GTGCCATCTGTGTGACGGATGGTGAGCGCATCCTGGGACTGGGAGATTTGGGCGCCTGCGGCATGGGCATTCCCGTGGGCAAGCTGGCCTTGTAC 
ACGGCTCTGGCCGGAATCAAGCCCCATCAGTGCCTGCCCATTGTGGTGGATGTGGGCACCAACAACATCGATCTGCTGGAGGATCCCCTGTACGT 
GGGTCTGCGTCAGAAGCGAGTGGTTGGTCGGGAGTACGACGAGTTCATTGACGAGTTTATGGAGGCCGTGGTGCAGCGCTATGGCCAGAACACTC 
TGATCCAGTTTGAGGACTTTGGCAACCACAATGCATTTAGGTTCCTGGACAAATACCGCAACACCTATTGCACTTTCAACGACGACATCCAGGGA 
ACCGCCTCCGTGGCCGTTGCTGGGCTCTATGCCTCCAAGCGTATTACGGGTAAGTCCTTCAAGGACTACACCTTCCTGTTTGCAGGAGCCGGTGA 
GGCCGCCATCGGCATCGCCGATCTCACGGTCAAGGCCATGGTGCAGGATGGTGTGCCCATCGAGGAGGCCTACAACAGAATTTACATGGTCGATA 
TCGATGGTCTGCTGACCAAGAGCCGCAAGGTGGGCAACTTGGATGGCCACAAAATCCACTACGCCAAGGACATCAATCCGATGTCAGATCTCGCC 
GAGATTGTCTCCACCATCAAACCCAGTGTGCTTATTGGTGCCTCGGCTGCTGCCGGCATTTTCACACCGGAAATCCTGCGCACCATGGCGGATAA 
CAATGAGAGGCCCGTGGTGTTCGCCTTGTCCAATCCCACCAGCAAGGCAGAATGCACTGCCGAAGATGCTTACAAGCACACGGATGCTCGCGTGA 
TCTTCTCGTCGGGCTCTCCCTTCCCACCGGTCCAGATCGGTGACAAGACCTTCTACCCGGGCCAGGGCAACAATGCCTACATCTTCCCAGGCGTT 
GGCTTGGGCGTGATCTGCACGGGTACTCACCACATACCCGACGAAATGTTCCTCATCGCCGCCCAGGAGTTGGCCAACTTCGTGGAGCCCAGCGA 
TATTGAGCGCGGATCGCTGTATCCTCCGCTGTCGAGCATCCGCAACGTGTCCATGAACATTGCCGTTGGCGTGACCAAATGTGCCTACGATAGAG 
GCTTGGCATCTACCTACCCTGAGCCACAGGACAAGCGCAAGTGGCTGGAGAATCAGCTGTACAACTTCAACTACGAGAGCTCGATGCCCGCTTCC 
TGGGTTTGGCCGCGGATGCCCTACATTAAGACTCGCGAAGAAAGCCCGCTCATTGCCGCCATCAAATAAAATAGCATTAATGCGTTCAACTGGTT 
GATGGCACCGTTCTGTTTTGCATCTAACACACTCTACTACTTCCAAGCGTACACCATCAGCACCAAGCAACAAAACAACCAACCGACTACTAACT 
ACCCTAGACTAGATAAATTTCCAGTGCTCACTCTTTTTAAATGTTTGTTAATCGATCCGTTGGCATCGACAGTCAAACTCAGTCGCAACTAAAAA 
ATGATTAAAATGTTTTTGATATTAAATAATTAGAACACCTAACAAGCTGCGAGGTCAAGGGAT 
(SEQ ID NO: 662) 

Start ATG: 211 

MFSRPSLCGTVGKLCRCGTSATGKTAVAAATVPTARHYHEVVGDIICPSQVRGIDHIRDPRLNKGLAFTLEERQTLGIHGLQPARFKTQEEQLQL 

CKIAVNRYTEPLNKYLYLSDLYDRNERLFFRFLSENIEDLMPIVYTPTVGLACQRFGLIYRRPHGLFITYNDRGHIFDVMKNWPEPNVRAICVTD 

GERILGLGDLGACGMGIPVGKLALYTALAGIKPHQCLPIVVDVGTNNIDLLEDPLYVGLRQKRVVGREYDEFIDEFMEAVVQRYGQNTLIQFEDF 

GNHNAFRFLDKYRNTYCTFNDDIQGTASVAVAGLYASKRITGKSFKDYTFLFAGAGEAAIGIADLTVKAMVQDGVPIEEAYNRIYMVDIDGLLTK 

SRKVGNLDGHKIHYAKDINPMSDLAEIVSTIKPSVLIGASAAAGIFTPEILRTMADNNERPVVFALSNPTSKAECTAEDAYKHTDARVIFSSGSP 

FPPVQIGDKTFYPGQGNNAYIFPGVGLGVICTGTHHIPDEMFLIAAQELANFVEPSDIERGSLYPPLSSIRNVSMNIAVGVTKCAYDRGLASTYP 

EPQDKRKWLENQLYNFNYESSMPASWVWPRMPYIKTREESPLIAAIK* 

(SEQ ID NO: 663) 

Name: MALATE OXIDOREDUCTASE 
Classification: enzyme 
Gene Symbol: Mdh 
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FlyBase ID: FBgn0029155 



Celera Sequence No. : 142000013385213 

CCGCTCGCAAACAGTTTGTCTATTTTTAAATAGAATTCAAGAAAGTATTCGGGGAGGAAACGTATGTACTTATGCATGAATCACATAATCCCGAT 
TCGTAAGAGCAAAATATAAAGGATCGGGGATGTAATTGGATTTAAACACATTTAACTACGGGACAATGGAACCTTGGAATTTTGGTTTTACTTTA 
GGTTTCCATGGGTTCGTTGCTGTCGATCGAGACACTATATCTGCGGGGTATATCTATTTATCGATGTACGCCGTATTATGTGAGCATGTATGTAT 
ATTCGCACGTATGTATGTATATGGATGTGAGTAACGCTGCCGGCTGCTACACTGGGCGCAGGCTTTAACTTAAAAATTAAATCATCGGCGCGAGC 
GATCAGTTGAAGACTAAAGACTTGATGAATCGCCGCGCCAGCCGAAAGCTCTTCATTACAGTTGTGCTCTCTGGAAAGCGATTCTCCGCTTCTCT 
TTGCACCACACAACCAAAAGAATCTCGTGGGAAAATGCGAGAGCGATCCGTTTAGGGTTGTCGAGCGGCTCGAACTATAGAGCAAATATGTAGAA 
AT AAAT GAAC AT GAT C GC AT T CGT AC AC AAAAT G AT CTTTATCC T T AT AAC AAT AGT T T C AAT T C G A T AC AT AAAC AG AAAT T AC AG AAAT G AAA 
ACTAATCATGATGACAAATCAATAAAACAAGAATATTAACTTGTCTGCCTTGGTTTATCGTGGAAATGTTGCTGTTACTAATGAAGAAAAACTAA 
TGTCGCCTCAACGGTGAAAAATTACTAACTCGATTTATGATCCACTTTTGGCATACCATACACCAAATCTCTCCTTGCATACAAGTTTAAAAAAT 
ATACAAATACAATATTGTTAGAATAATTCAAACGATGTTAACAGGATTTGACAGCGGTAGGACGATTAGACCAAGTTGATCAAATGCTCGCGCAG 
AATAAATACGGAGGTGGGGACAAGGACGAGAACGAGGACAACGAGAATGCCTATTTGGCGGGCGAGGCTATGGAGTTGTTTAGTGAATTGTTTAG 
CTGCTCCTCCTGATCCTTGTCCAAGCCAAGGGCTGTGGTCAACAGCTTGACCACACATCGCTGCTCGTGCACCTCGCCCTTCTCGTTGGAAGTAA 
TGTCCTTTTCTAGTTTTAGCTTCTCTTCGGCGGCCATCAGGATGCCCTCCTCGATGGTACTCTCGGAAATGAGACGGTAAATGGTCACCGGACGC 
TGCTGTCCCATGCGATGACACCGATCCTCGGCCTGTTTATCGTTGTACGGGTTGAAGTCGATATCGTGAATGACACAGGTGTCGGCAGCCGTCAG 
GTTGATGCCAACGCCACCGGCTTTGGTGGAGAGTAGGAACACAAAGATGCTATCGTCCCCATTGAAGTCGGTGATCAAATCCTGGCGGACATTTA 
CGGCGGTCGCCCCGTCCAGGCGACAGAAACCAAACTTTCGTATTCGCAGATACTCCTCCACAATGTCCAGCATCATGGTGAACTGGCTGAAAAGC 
AACACACGGTGTCCTTCTGCCTTCAGTTTTGGAAGCAACGTATCCAGGTAGAGGAATTTTCCAGAATCGCATATCAAATTATCGGGTATTTTCAC 
GTCGTAAAACTCCTGAAAGATGCACAAATTTAGTCATTGTCATTGGACAACCTCTTTGATAGTCACTTACATGCTTGTTCATCATCTGATAGACC 
TGAAAGTCTGACATAACGGCTAGCTCTTCAAAGATGTATTGCTCGTTGGTCTTTTTAAAGGAACTGGCATTAGCCAGGCGCTTAGAGAATCCACG 
TAGATTGGCGTCGGTGAAGTAGTGACGCATCAGCAACGGATGGTTTGCAATGCGGCGCATCTCCATCATGATCGCTATGCCGGCCCTTTCGCTGC 
TGCTGCACACCTCGCCCTTGTTATTGGAATAATAATCGACCAGCTCGTGATAGTATATTTTCTGTTGACTGCTCATGGGCACTTTTTCCTTTAAA 
TAAAAAACAATTTAATTATTGTATAATATTTATTTTCGCCCGACGTACTTACCACCAGGCTTAGCTTCTTGGGCAGGTTCTTAAGGACATCTTTC 
TTAAGCCGTCTGAGTACAAATGGCTTCATGATACGCTTTGCTCTCTGGATTTGTGTTTCCTGGAACTGTGAAACCTCATCCTGATCTCCATCGCT 
TTTTCCCTTCTGCAAGGAACATGAAGTTTATATATTGTTTTGTGTGAAACCAATCTTTGTTTTACCTTAGCAAACAACGATTTTATGTCCTCAAT 
GCTCTTGGCAAAGAATTTGGGCATCACGAAGCAGAGCAGAGAGATTAGCTCCAGCAGATTGTTCTGCAGCGGAGTTCCAGTGAGCAGGATTCGCA 
TCCTAGCGTTTATGGTTATAAGGTTAGCATAGCGTTGCGTGGTCATGTTCTTCAGCATATGGGCTTCATCAAAAATGACGTAGTCCAACTTGCAA 
ACCCGGAACATTTTCCTCTCCTCCGGCGTGGATCCCACAATGTGATATCTGTTCAAAATGTACAAGAAATTTAAGCTTAGATTCTTTTATGGCTT 
TGTGGATATTTACGTGGTCAGAAGAACATCAAAGCCGGTAAATCCATCCTTGGCAAAGCGGCCTCGCATTCTCCTTCGCTCGTCCTGCGAGCCAT 
GGTACTTCTCCACCACCAGTTCGGGACACCAGCGGCTAATCTCTGCCTCCCAATTATCCAGCGTGGACGAGGGCACCACAATTAAATGTGCAGCC 
TGGCTAAGACCATTCTCCTTTAGATAGGCCAGAAATGCAATAACCTGAATTGTCTTGCCCAGACCCATTTCATCTGCCAAAATGCCATTCATTTC 
TTGTTTGTGCATTACGGTGAGCCAATTAAGGCCAATTATCTGATAATCAGCCAGCTGTAGACTGCAAGAGGAACGAAAATAGAAACACAAATGGG 
GACATGCATGCTGGTTAAGGCTCACCCGCTGCTAAGAAGCTTAGGTTGCTCCACAATGCCAGCACCATTGGAAATCGCCTTCTCCAGACGCGACA 
CCATGTTATTGCACTTACTTAGAATAGCGGCCACCGTGTTCTGTTTGTTAATCAGCTCTTGAGCGTAATTCAAAAGATCACCCGACATCCTGATG 
CTCTCCAGCTTTTGGCGCAAATCAGACCAATCACTAAATGGTCGCACATCAATGATGGCTAATGCCTTCTTCTCGGACAGCGTTTTAACTGACTG 
GAGTTCAATTAGCGAAGCCTCGTTCATAAACTGGAAAACTTTTTTACGTTGCCCCGTCATCTTGGTGGACATCTCGCTATCGCTGTCATCGCTAT 
CATACACCTGGTCCTTGGACTGCTTCACATCATCGTCGTCCGAATGGTCGTTATCCGAGAAGTTGCCTCCACTAATTCCGTTTGACCTGGGTTTC 
AGCTTGGACTTGGCCAGTGGTCCATTGTGCCCCTTGGGCTTGCAGTTCTCCCTTAGGTATCGCACGGAAGCGGCCACATCCCAATTGGTGCGGGA 
AAGAGACTCCTGGATGGCCTACGGAACATATTTGAAATTT ATT TCAGTTAAAATCGAGATTTATGC AAAAT ATT GCATTAAAAATGAGCAATTTT 
AAGCTATAAAATATTCTGATTAATTTTAAAATTCAAATAAGAACGGTGATAATGTCCGGAAATATTTTATATGAGGCACACCAAAGTAGTGGAAA 
CAGTTACACGAGAGGGTTATTACTGCTCCCATCCCCGTCAGTGACGCCCCTGGCTAACGACAGCTGTTCGCAACGCCGGGCCGAACCGAATCGCC 
ATGTTCCCCGAACTCAGATGCAGAGACGCGCGTTTCCCCGCGAATCGCCCCGTTTTACCTTGGTGGCATTTCGGCCGGAAACCTACCATTGTGTC 
AAAGTGGGGCGAAATCTTGGCGGCTGCCATGTAGCGCTCCTCCTTTTCCTTAACGGTCAGCTCCAGTTTGGTCTTCTTGGGCGTCTGGGAGTCGT 
TGCCATCGCTGTCACTATCGGCCATCACCTGGATGCGCTTCTTTCCAGGCACACGCTCTGAAATCAGGCAGTTAGCTACCGTGTCAATCGCTTGG 
AATTTTAGCTCAGCAAGTACCAGTTCTCGAAGGACTAGCGACGACAGAGGAAGCATTCTTGTTGATGCGGAACTGACGCAGATCGCTGAGGCTCG 
ACTTCGCGGACGATGAGGCGGAGGCCGACGCAGATGCGGCCACTGTGCTGTCCGACATCGTGGGACAAGGCGAAACGCTTCCTGGCAATCTGTCA 
GTCCTTATTAACAAATTCTGTTTGCAACTCCTACAATCTTAAGTTGGTATTTGTGAATATTCGCATGCTAGTGCGGCCGTTTGATTATCCCGATA 
AAACCTATTTCTATTTCTCCAACGGAATGCGGTCACACTGTGCGCCAATTTAAGTATTAGCTTAATGATTATTATTATTTTTCATTTATTTTTAT 
CGAACTGTAACGCATTAGTGTAAGCTTCTATGAGATTATCTAACAATTTGCTTTTGTATATATCGTATATGATTTGATTTCGTCAACCTAACCGT 
AAGTAATTAAAACCGATGCCAAAATGAAAGAAAGAAACTTAAAAAAATCCCAAACTGGCGGTAAATTAAGAAAGTTGCCAGAGTTGCCAGATGTT 
ATCGGCTTACCGCAAGCAATCGGCCAACAATCGAAACATTGGGACGCGTTGCAACCCTCTCTTTTTTTCGCTTGCTCTTGTAGGTTAATTGAAAA 
CACTTTTATTGCAAATAAACCGCTTGCCGAGCCAAGGGCAAACCGATAAACTGGCAAACATGGCTGGCGGCAACACCCCGCGTGTGATCCAGGTG 
ACCAACATAGCGCCGCAGGCCACCAAGGACCAGATGCAGACGCTGTTCGGGAACATTGGCAAGATAGAGGAGATCCGCCTGTACCCGACAATCCG 
TGATGTCTCCTGTCCGGTGCAGTCGCGCATCTGCTATGTGAAATACACGGACACGACCAGTGTGCCGGTGGCCCAGCACTTGACCAACACGGTGT 
TCATCGATCGCGCCCTAATTGTCATACCCGTTCTGGCCATACCCGAGGAGTATCGGGCCCTGGAGATGCTCAAGAACGGAACCATTGTGCCGGGA 
CTCCAGAAGCCGGACTCCAAGCTACCGCCCGAAGTCATTAACCGCATCGAGGGACAGCTGCCGCAGCAAGTGATCAAGACGTACGACCCCAAGTT 
GGTGGAATTCAATCTGCCGGAGTACCCGGCCTTACCCTCGTTCTACGATGCGCGCAAAATCGAGGAGATTCGGCGCACCATTATCGTGTGCGATG 
TTAAGA 

(SEQ ID NO: 664) 

Exon: 4326.. 4106 

Exon: 4047.. 3887 

Exon: 3533.. 2971 

Exon: 2911.. 2579 

Exon: 2518. .2251 

Exon: 2194 . .2048 

Exon: 1988.. 1686 

Exon: 1627.. 1001 
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Start ATG : 4238 (Reverse strand: CAT) 
Transcript No. : CT18505 

AATACCAACTTAAGATTGTAGGAGTTGCAAACAGAATTTGTTAATAAGGACTGACAGATTGCCAGGAAGCGTTTCGCCTTGTCCCACGATGTCGG 
ACAGCACAGTGGCCGCATCTGCGTCGGCCTCCGCCTCATCGTCCGCGAAGTCGAGCCTCAGCGATCTGCGTCAGTTCCGCATCAACAAGAATGCT 
TCCTCTGTCGTCGCTAGTCCTTCGAGAACTGAGCGTGTGCCTGGAAAGAAGCGCATCCAGGTGATGGCCGATAGTGACAGCGATGGCAACGACTC 
CCAGACGCCCAAGAAGACCAAACTGGAGCTGACCGTTAAGGAAAAGGAGGAGCGCTACATGGCAGCCGCCAAGATTTCGCCCCACTTTGACACAA 
TGGCCATCCAGGAGTCTCTTTCCCGCACCAATTGGGATGTGGCCGCTTCCGTGCGATACCTAAGGGAGAACTGCAAGCCCAAGGGGCACAATGGA 
CCACTGGCCAAGTCCAAGCTGAAACCCAGGTCAAACGGAATTAGTGGAGGCAACTTCTCGGATAACGACCATTCGGACGACGATGATGTGAAGCA 
GTCCAAGGACCAGGTGTATGATAGCGATGACAGCGATAGCGAGATGTCCACCAAGATGACGGGGCAACGTAAAAAAGTTTTCCAGTTTATGAACG 
AGGCTTCGCTAATTGAACTCCAGTCAGTTAAAACGCTGTCCGAGAAGAAGGCATTAGCCATCATTGATGTGCGACCATTTAGTGATTGGTCTGAT 
TTGCGCCAAAAGCTGGAGAGCATCAGGATGTCGGGTGATCTTTTGAATTACGCTCAAGAGCTGATTAACAAACAGAACACGGTGGCCGCTATTCT 
AAGTAAGTGCAATAACATGGTGTCGCGTCTGGAGAAGGCGATTTCCAATGGTGCTGGCATTGTGGAGCAACCTAAGCTTCTTAGCAGCGGTCTAC 
AGCTGGCTGATTATCAGATAATTGGCCTTAATTGGCTCACCGTAATGCACAAACAAGAAATGAATGGCATTTTGGCAGATGAAATGGGTCTGGGC 
AAGACAATTCAGGTTATTGCATTTCTGGCCTATCTAAAGGAGAATGGTCTTAGCCAGGCTGCACATTTAATTGTGGTGCCCTCGTCCACGCTGGA 
TAATTGGGAGGCAGAGATTAGCCGCTGGTGTCCCGAACTGGTGGTGGAGAAGTACCATGGCTCGCAGGACGAGCGAAGGAGAATGCGAGGCCGCT 
TTGCCAAGGATGGATTTACCGGCTTTGATGTTCTTCTGACCACATATCACATTGTGGGATCCACGCCGGAGGAGAGGAAAATGTTCCGGGTTTGC 
AAGTTGGACTACGTCATTTTTGATGAAGCCCATATGCTGAAGAACATGACCACGCAACGCTATGCTAACCTTATAACCATAAACGCTAGGATGCG 
AATCCTGCTCACTGGAACTCCGCTGCAGAACAATCTGCTGGAGCTAATCTCTCTGCTCTGCTTCGTGATGCCCAAATTCTTTGCCAAGAGCATTG 
AGG ACAT AAAAT C GT T GT TT GC T AAG AAGGG AAAAAGC GAT GG AGATC AG GAT GAGGTTTCACAGTTC C AGGAAAC AC AAAT C C AGAG AGC AAAG 
CGTATCATGAAGCCATTTGTACTCAGACGGCTTAAGAAAGATGTCCTTAAGAACCTGCCCAAGAAGCTAAGCCTGGTGGAAAAAGTGCCCATGAG 
CAGTCAACAGAAAATATACTATCACGAGCTGGTCGATTATTATTCCAATAACAAGGGCGAGGTGTGCAGCAGCAGCGAAAGGGCCGGCATAGCGA 
TCATGATGGAGATGCGCCGCATTGCAAACCATCCGTTGCTGATGCGTCACTACTTCACCGACGCCAATCTACGTGGATTCTCTAAGCGCCTGGCT 
AATGCCAGTTCCTTTAAAAAGACCAACGAGCAATACATCTTTGAAGAGCTAGCCGTTATGTCAGACTTTCAGGTCTATCAGATGATGAACAAGCA 
T G AGT T T T AC G AC GT G AAAAT AC C C G AT AAT T TG AT AT GC G AT T C T GG AAAAT TCCTCTACCT GG AT AC GT TGCTT CC AAAAC T G AAGGC AG AAG 
GACACCGTGTGTTGCTTTTCAGCCAGTTCACCATGATGCTGGACATTGTGGAGGAGTATCTGCGAATACGAAAGTTTGGTTTCTGTCGCCTGGAC 
GGGGCGACCGCCGTAAATGTCCGCCAGGATTTGATCACCGACTTCAATGGGGACGATAGCATCTTTGTGTTCCTACTCTCCACCAAAGCCGGTGG 
CGTTGGCATCAACCTGACGGCTGCCGACACCTGTGTCATTCACGATATCGACTTCAACCCGTACAACGATAAACAGGCCGAGGATCGGTGTCATC 
GCATGGGACAGCAGCGTCCGGTGACCATTTACCGTCTCATTTCCGAGAGTACCATCGAGGAGGGCATCCTGATGGCCGCCGAAGAGAAGCTAAAA 
CTAGAAAAGGACATTACTTCCAACGAGAAGGGCGAGGTGCACGAGCAGCGATGTGTGGTCAAGCTGTTGACCACAGCCCTTGGCTTGGACAAGGA 
TCAGGAGGAGCAGCTAAACAATTCACTAAACAACTCCATAGCCTCGCCCGCCAAATAG 
(SEQ ID NO: 665) 

Start ATG: 89 (Reverse strand: CAT) 

MSDSTVAASASASASSSAKSSLSDLRQFRINKNASSVVASPSRTERVPGKKRIQVMADSDSDGNDSQTPKKTKLELTVKEKEERYMAAAKISPHF 
DTMAIQESLSRTKWDVAASVRYLRENCKPKGHNGPLAKSKLKPRSNGISGGNFSDNDHSDDDDVKQSKDQVYDSDDSDSEMSTKMTGQRKKVFQF 
MNEASLIELQSVKTLSEKKALAIIDVRPFSDWSDLRQKLESIRMSGDLLNYAQELINKQNTVAAILSKCNNMVSRLEKAISNGAGIVEQPKLLSS 
GLQLADYQIIGLNWLTVMHKQEMKGILADEMGLGKTIQVIAFLAYLKENGLSQAAHLIVVPSSTLDNWEAEISRWCPELVVEKYHGSQDERRRMR 
GRFAKDGFTGFDVLLTTYHIVGSTPEERKMFRVCKLDYVIFDEAHMLKNMTTQRYANLITINARMRILLTGTPLQNNLLELISLLCFVMPKFFAK 
SIEDIKSLFAKKGKSDGDQDEVSQFQETQIQRAKRIMKPFVLRRLKKDVLKNLPKKLSLVEKVPMSSQQKIYYHELVDYYSNNKGEVCSSSERAG 
IAIMMEMRRIANHPLLMRHYFTDANLRGFSKRLANASSFKKTNEQYIFEELAVMSDFQVYQMMNKHEFYDVKIPDNLICDSGKFLYLDTLLPKLK 
AEGHRVLLFSQFTMMLDIVEEYLRIRKFGFCRLDGATAVNVRQDLITDFNGDDSIFVFLLSTKAGGVGINLTAADTCVIHDIDFNPYNDKQAEDR 
CHRMGQQRPVT I YRLI SE S T I EEGILMAAEEKLKLEKD I T SNEKGE VHEQRC VVKLLTT ALGLDKDQEEQLNNSLNN S I AS PAK* 
(SEQ ID NO: 666) 

Name: putative helicase 
Classification: DNA_binding 



Celera Sequence No. : 142000013384321 

GTATTTTTAGAAAACATTGCTGTAATCCTAATTGGCTGTTAGCGAATAATTTAGAAGGCATATCATAGAAAATTTTGACACTTTTATGATTTTTC 
TGCTTGTGTAAAAATGTTGTAATACCCCTGAGATCTGTATAGCATATATATTTGGGCTCTATATTTGGCTGGGGTTTCATGCTCTAGCCTCTTTT 
CTTTTTGATAATAATATATGCTAAATCAATTTGAAATCAATCGATTCAAATATTTACATATATAGTCTCTAACGCTTAAATGTATTCAATATTCT 
AGATGTTCTAATGTTCTAGAAACCGTTTCCAAAAGTCAAATCATTCTATTTAAGTTTGCTCCCAGAATTATATCTCAATCTTTATAGTCTAGGGA 
CTACGTGCTGGAATACTGGTTTGGCACTACCTTTTAAATAATATCCGAGGCCCCTGAACCAAGAAAATGTGCCAATAAATATAAATTATGAACTT 
TATGATCTGCAGAGGATGCCAAATATTTCAACATGACTTATATGTATGTACGTTCGAGAATTGGGAAAATTTATGTGTAGATACATTCTGGCCAG 
ACGCTAGATACACAAAATGAAACGCACAAATGACACAGGCTAATTACAGGTAAGCAATAGAAAAGAAAAGACAAGAAAAGCAGGCGTACTTCGTG 
TTTTTGAGTTTATTTTCGTAGGGGGCAAGCTCTCAGGCAAACTAGCGTGAAAATGCGTTCACACAAGCTTCGGCCCAAGTGAAATGAAAAATCAA 
AGAATTTTCCACACAACTGAGCAATTAACACAACAACACGGCCGCGTGAAAAGTCACTCGAAAGTCCCACTCACATTCACATAGGTGGTGGTGGT 
GGCTTATAGATATATCTCAGGGTATATCCCACTCGGATCCGATCGGCGGGGAGCCGATGGCGCCCGACGGCGACCGAAGGCTCCCGAAGGGAACC 
GAATAAATCGTTGAAATAGACGAATTCAGCTGGCAGGCCAAACCAATTCAACGAAATACTCTTTATTCGAACGACAACGGGATCTCTTGTTCAGT 
AGATCACCAGCCTTACTCCGACGTTTATTACGAATTTTCTCGGTGACTCGGTGACAACTACAAAACTGCGTGCGGTGCTGTATAGATTTTCGCTT 
GAGAGGAACACACAAATCATAATACTAGATACGCGAACCGATCGCACATTAATTTATAGAACCAAAATCGAAAATCTGACCTAGCAAACATGTCG 
TCATCGCAGGCTGTCCGCTCCTCGAAATACTCCTACCGTGCCACGTCCACCGGACCCGGTACCGCCGATGTGAACATCGAGTACATCCAGGACCT 
GAGCTCGCTCTCCCGCTTGGAGGTAAGGATCCCGTCCCGCTCCGTCTCGGAATCTCGAATTCGAATTCGTAATTGGAAGCGGACCGGTATCGTTT 
AAGAAATCGGCATCGTATTGGGCACCAAATCCCCAAAAAGACGTTGGAATTCCGACCAGGGAATAGGAATACGAAATGCGTGTGCGCCAAATTTT 
CTTCAAAATCTACGAGAATTTCTCTCTCTTGTTTCGGTGTCTGTGTAGAGACTGAAAAAGAATCGGAATATAATCAAGAAAAAATACGAAAAACA 
AACTAAGAAACCCGAAAAATTCATCTACACTCTATATCAAATGCACATATATACCGTAGATATATATTCGTGTGTTAAGTTCTCGGCGCGTTCTT 
GTGTGTGGAAATATAGAAAATTCGGTGAAGCAAAATTCTAGATTTGTATATGTATCTAAACGTGTCTGTATATACATATGCATTGCTAAAAAATT 
ATGTAATGGCCACCTCTGGGCAATACATATACACGAGAAAACATCGAAGAAAAATTTGTGAAGAAATCAATCAACTGTGCTGTAAAGTGCTTTTT 
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AATGAATGTTTTAATCAAATAATATGAAATAACGAGCCCATAAAAGATCAAACTCAATAAATGTATCCACCATTTTCTTGCAGGACAAGATCCGT 

TTGCTCCAAGATGATCTTGAAGTTGAACGTGAGTTGCGCCAAAGGGTGAGTCATTCAATATGGAAATATTCATAATTTTAAATATTTTTAAGTTA 

TTTTTGAACCGAGTATTTTTAAGGTATTACTATAGGTTTCTTTGGAATATTTTTGAACATTCTTAATTTATTGAGAGCATCGTTTTTAGTCAATA 

AGATTATTTGCTGATGAAATCGGCGTCAGAGATATGACATATCCGAGGAAGTTAGAAAAGATCAGACCGCATTCTCCGAGACCCAATGGAAATCA 

AAGCGCCAGGGAAAACCATTTTTACTTTTTTCACCTCGCAAATGTCACCCTAAAAACAGCAAAAATATGTGGAAAGCCAAAATAAAATCAAAATA 

GACCAAAGTAGCAGTAGCGATAGAACCGATAGAAAAGTCAGCAAACAAATGCAAATCTAAGAATAAACTACCCTTCGTAGAGTTTCGAAAGTGTG 

AAAGTAAATGATAATGATGTGCATTGAACTTCAATCGCTTTCGTTTTTGGCTTTTCCTTCCCTGCGGAGCATTGATAAATGATTTTTCCCCTGAC 

TTTGCAAAGACAGCGGCCGCAAAGACAGCAAAGTCAGGAACAGCCAGAGAAAAGTGGGAAAATTCGCAGGCCGAAAGTGCACTTCATTCAGCGCA 

ATCCTCTTGGGCGTTACAAATTTAGAAAATGCGAAGAGCTCTTGCCGGAATGGAATGGGCTGGCCAGGAATGTTCTGAAAATTTTGTTAAAAAAT 

CAAATTTAATTACTTTGTGCCTTATGGATTAGCACGCACTCGCACACAGATACAACGCCCCCCAACACACACTCACAATATATGGAACAAAATGG 

CTAAAATTAGCGCACGACATTCAATACAGAGAAGCGAATGATTAACTTGAAAACAACCAAAATATGAGCAGAGAAGAAAAGAAAATAAAGTAGCC 

AAAAAGTCATCATCATCACCATCTTCATGATCAGCGGCGGCAGAGAGGAAAATTCGCTCGCTCACTACGGGAAAATCTTCGCTTTTTGTCTGTGG 

GCGTGTGTCTGTGGGAAAATTAAATAATTTTCATGCACATTAAAAGTCTTCACACCGCCTGGCATTCTTTCTACATATCCACATATGCATATATA 

GGCGGCAAAAACATGTATCTACAAGATACATATGGCGCAAATGGATTTACGGCCCGTACTTACGTGGCCTTTGTGGCAATCGGCAACTTTTTGTT 

AT GC C AAAT G AAC GG AC AAAC AC AT GT T C G C C TAGGT AT TAT C AT AAAC AT AAAT G T AC AAAC AT AT T C AGC G AGAG AT T T AAAAG AGAT T T AT C 

AGACCAATACGTTCTTTATTTAAGCTTGGGGCATTTCAGTTAGCTGAATTTTTTATGATTGCTTCGGACAGCTCCAAATACCAGAAATCCCAATG 

GCCGCACATAGCTTTCATAAAATAGCATTTAATTGCAATTCGCAAAATTGACCAAACGAGGTCGTCGACAAAGGCGAAAAGCAAATGTTTAGATT 

CCGAGGGAGCCAACAAAAAATGTTTCTGTTCAAGCGAAAAATGTTCGAATGCCTTTTTGGAAATGCAATCTAGGAAAGCGGTGTGCGGCCGCCCA 

CAAAAGATATATACGCATATATAGACGCATATATCGTGGATGCAATCGGACTATGGCTACACCTCTGCGTGGAAGATTTCTTGGCAAGATGCCAT 

TGTTATTTTAGGTGTTGCATGTGCATAATTATTTTTGACTGATTAATGTGATTTATAATTTTTATTATGGGCTGTTTGAGTAGCCACAGAGCATG 

GCTAGCGCAGGGTATTTATAGATCCTTCACATTATACCAAGTCGCTAATGAAATCTCTTCTTAATCTCTGCTTATATTCTTCAAATTTCAGATCG 

A GC GC G AG AAAGC C GAT C T C AGC GT T C AG GT CAT T CAGATGT C C GAGC GTCT C GAGG AGGC C G AGGG T GG T GC T G AAC AT C AAGT GAGT AC CAT G 

TGGTAAACCAATATTGCAATCCAGCCATAACTGATGTATCTTCCAATCCCAGTTTGAAGCCAACCGCAAGCGCGACGCTGAGCTGCTGAAGCTGC 

GCAAGCTGCTTGAGGATGTTCATCTTGAGTCGGAGGAGACCACTTTGCTTTTGAAGAAAAAGCACAACGAAATTATCACGGATTTCCAGGAACAG 

GTTGAAATCCTAACGAAAAACAAGGCCAGGTAGGCATTGCAACAGGCTCTGGCCATCGCCACCCGACGTCCCCATTTCCATTCCATTGACCCAGA 

AACCATACCATATCCAAAACATAAACATCTCCATCCCGAGCTGTCATGCATCCGCAGAACATATTTATTTGCCCCCAATGATGATGATCAAGTGG 

CGTCAGCTGGCTCTTGCGGACATTTCAATGTCTCGAGATCTCGTGCAGTAACTAACCAATACAATGCCTAACCTTGCGCTGCCCCGTTGACCCAA 

AAAAAAAAAAAAAAACAGAGAAAATCGAAGCAGATCATAAACCATTCATCAAAATGCCATGACATGCGAAATGCAAAGCGAAAAAAATC 

CATAAACCTGCATGGCTTGCAAGTTCTATTGGCACAAATCGCGGCCGAAAGGCCAAACGAATGAGAAAAAAATTCGAGAAACTTTCGGTTGCTGG 

CCGCCTCTCATATTTCCAAATCGATGCGAGCGATTTCCAAAAATAAATTCCTAGGAATTTTGAAATCGCAACAATTCAAGGCTATCAAAAGATAA 

CGAGTCATTTAGATTAATATGCATTCGGATAAATCAAAATCAACTCATATCCTAAGATTAAATCAAGTTCAAAGTTGAGTTTCTTTACCGTTCTT 

TAAGATTTCCACCTTTAAGATAGGTGCGCCCGAGTTTGCGCACTTCCGTTCGTAGACAAGGACAGAAGGTCGGAAGATCAAGCAGCCGTCCCACT 

C T AT C G AAAAAG CT T T GGAAAGC T TT C GC AC AGC G AAACT T T C T T C C AT AT GTT GT G CCC AC C ACC CC AT AC CAT AT T T T CT T T CT G AGCG GAAA 

TTTATTTATAGAAATTTTGAAATTTTCATCCACATTCTTGAACAGAGAGGCTCTTTGTTGCTGTTGCCATTCCGCTCCCACCGCCTCCCTGCATC 

ACTCCCTCTCGCTCGCACCAACGCTCCAACCCAAATATGTCTTTGATTGAATCATATAGAAAATGTGTTTATTGTTTTTGTTGCTGTCCCGCTCC 

GACCCATCGCGTATTATGGAAATCCGATACGTTTTTTCGCTGGCTTTTCAGCTTGTGGGTGGGAAAGGTCGGTTGCACCGATTCTAGCCCCCTGA 

CAGGTGGAAAAGCTGGCAAATGCCACGAATTGCACACGGAAAATTCGATCGACAATCAAGGCGATACATAATGGGATATTACTTTGATCGATTCA 

GAGCACACTGGACTTAATATATGACCACTATGATTAGAAAAGATTACAATGCGCCTTAGGTACAAAATTCACATATCTATAAGTAACATTCTCAA 

ATTTACCAAACGTATCAAATTGATTTACACTGATCGGAGTAATTCATATTACATATCCTATCTATTCGTCCCAGCTTATGCATATTTATTTTACA 

ATCAATTATCATAGCTTGCTTCCAGGCAGACCCAACATGATCTTAACCCTTGGCCTGCTGAGCCTTCGACCACCGTTCGATAATCCTACGATAAA 

CTTTCGAACGGTGGCCGAAAGCTCGCAGTTCCACTCAAGCTTGCTAGCGACTCACAGCCTGGCCGATCTCACAAACCATGACTAAGTATCGGGAT 

TGTGATCATAGACCCCAAAAACTAAAGTGGAGAACCCTCTTCTCCCTATTTTTAACCGAATGCTTGCAGAGCCGAAAAGGACAAGGCCAAATTCC 

AGACCGAAGTCTACGAGCTGCTCTCCCAGATCGAGTCCT AC AAC AAGGAGAAGATCGTGTCGGAGAAGCACATCTCC AAACT GGAGGTCTCGATC 

TCCGAGCTCAACGTGAAGATCGAGGAGCTTAACCGCACCGTTATCGACATTAGCTCCCACCGATCCCGCCTCTCGCAGGAGAACATTGAGCTGAC 

CAAGGACGTCCAAGATCTGAAGGTTCAGCTGGACACGGTCTCGTTCTCCAAGAGCCAGGTCATTTCCCAGCTGGAGGACGCCCGCCGTCGCTTGG 

AGGACGAGGACCGTCGTCGCTCGCTGCTCGAATCCTCTCTGCATCAGGTTGAGATCGAATTGGACTCGGTTCGTAATCAACTCGAGGAGGAGTCC 

GAGGCCCGCATCGACTTGG AAC GTCAGTTGGTCAAGGCCAATGCC GAT GCCACCTCCTGGCAGAAC AAGT GGAACTCCGAGGTGGCTGCCCGCGC 

CGAGGAGGTCGAGGAGATCCGTCGCAAGTACCAGGTCCGCATCACCGAGCTGGAGGAGCACATCGAGTCCCTCATCGTCAAGGTCAACAATCTGG 

AGAAGATGAAGACGCGCCTGGCCAGCGAAGTGGAGGTGCTCATCATCGATCTGGAGAAGTCGAACAACAGCTGCCGCGAGCTGACCAAGTCGGTC 

AACACCCTCGAGAAGCACAACGTTGAGCTGAAGAGCCGTCTGGACGAGACCATCATCCTGTACGAGACCAGCCAGCGCGACCTGAAGAACAAGCA 

CGCCGACCTCGTCCGCACTGTCCACGAACTGGACAAGGTCAAGGACAACAACAACCAGCTGACCCGCGAGAACAAGAAACTGGGAGGTGCGTTAC 

CATACTAAAAGACTTTCACAATTGTATATTGTAGTTATCATGATATAATATTTAATAATAACCTATTATTAACCAGATGATCTTCATGAGGCCAA 

GGGTGCCATCAACGAACTGAACCGTCGTCTGCACGAATTGGAACTGGAGCTGCGTCGTCTGGAGAACGAGCGTGATGAGCTGACCGCTGCCTACA 

AGGAAGCCGAGGCTGTAAGTAATCACAAAAAAAAAAAAAAACATACACATAACATTTTTAGTTATAATCCTAATAATTTGTGTAATATTATTTTA 

TAGGGCCGCAAGGCTGAGGAGCAGCGCGGACAGCGCCTGGCCGCCGACTTCAACCAGTACCGCCACGACGCCGAGCGTCGTCTGGCCGAGAAGGA 

TGAGGAGATCGAGGCCATTCGGTAAGTTACTGACCAGGGATTGGTAACTAGAGTTAACCCCCTTTTTGGAAGTAGCCCCACGCCACTGCACCACG 

AGTG AC AGCG AGAACAGACGAAGGTTTTCGTTTCCCCTGCCAAAGCGGAAGT CCC AAAAAAAAT AAAC ACC CCTCAAGGCCCCAAGGGGGAAATT 

ATTCGATTGATTCGCAATCGCTTTCGCAATTGCCAACACAATTGATAGTGAACTGACCCATTTAGCAATTAGCAATCAGTTTGAGTTCTTTAGTC 

GCACAAAAAGCAACGGCATTAGTTTTCTATTCAGGAAGCCAGGCAAAAGATCCTGATCCCACTGAAGGTTCAAAGGCTGAGCTCAAGAAGTGGGT 

TATATCCATCTGGTTTTCAAAACCATACCGATTTTTTAAAAATCCGTATCTAATTCTCTGCAGAAAATTGCGCAGTTGCTGTGATTACACAATCA 

AAGTGGGTTCATAGTTGAAAATCGAAAGAACCCATTGAGAAAACTTTCGATTTGAAGAGTGGCTCTCCAGCAAGTGGTAGGTTTTTGGATCAATC 

ATCAATCGAGTTGGTGGTCAGTGTTAGTCTGTGATTGAAATGATCTTATGTCTTAAGCAGAGATCAGTTCTAGCTTTCTTAAAAACAGATAAGTT 

AGGTGTGCATAATGCTTGATCTTAATAGCTATTGTTGATTTTGTAAAATTTTTACACACGTTTCCGGTAATTAAATGATATTGTGAAATCGATAG 

AAGTATCTATCTGAACTTGTACAAATTAAAATCGGTTGATTTGGAATGTTCAAATAACTTTACATAAATAGATACAATAACTGAATAGAAAATTG 

ATTATATGCACTTTTAGACACTTTCCCTAGTTCCTTAGAGTCTTCCATGACTATTGCATAAAATGTCTTGCTATTATGACATTGTCTATGAGCAC 

TTTGCCAAAATGAATCAATCATAGCCGCACACATTGACTGCCAACAAATGATGATAATAATGATGATGGTGATGATGATGTTATTGTCGTCATTA 

TGCTCATACCTAGCCCGATGCCAACAGCTGAAAGAAACGAAATGCAATCTTTTTATAGATTTCGACAAATTTGAAAATTTATTTGCAAAGCAGCA 

GCAACTGCCACTGCTGCCAACCGAGGAGAACCCAAGAAACCCGAAGCTACAGAGGCGACAAGGCGACACACAGAGGCGGCGACAATTGTAGCTTT 

AATTAGAGTACTCTTAAGAAAACACATTAGGCCATTAGACCATTAGACCATCGCAACCCCCTTGAGAAATCCGCCCCCCCTTCACAAAGATCCCC 

TTCCTAAACCATGGCCCTAGCCCTAAAACAAAGACCTTCTCGCTTCCGTCCGACGACGACAACGTATGAGGATAACTATGGGTACACCATGAACT 

TCTACCAGCCCATGCTGGACTATCTGGACGCCAAGGCCAAGGGTCTGGAGGTCAAGAAGCCGCACCTGCCGTGGGTCAGCGAGAGGGGACTGAAG 
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CAGTACCGCCCCTCGAATGCGGTGCGACAATATAACGCCGACGAGATCGTCCGATTGTCGCGCACCTGCGCCGCTCGGGCCGACGAGATATTGCT 
TAACTTTCGGGCCCAGAAACGCTCGCCGTTCAGTGTAC71AAAGTTGGTCGACGCATCGCGAGTGACCAAGCACCTTGAACCGGACACAGTTGTCG 
AAAGATCGCGCCAGCGCCGCCGTCGCCGTCAGGAGGAGCTCGAGGATTTGATCAAACGCGACACTCTTAAGATATTGCAGCGTATCCGTAAGATT 
GAATTGGATAACGAACTCGACAAGATGTCCGACGATTTCAAGCGCTCGATTCGCGGCAAATCCGCCAGTGCCATCGCCCAGGCGCTGCTCTCCGA 
ATCGGAGAAGAACATCAAGACGGCCAAGAAGGAGGAGGAGGACTATATCGCCCAAACTTTAGTGCGCTCGAGTCGTGCAGTTTCGCGTGCCCGTT 
CGCGTTCCTCCTCCCCCCTCGATGGCCAGTATCGCGCCCATGCGCTCCACATCGAGCTCATGGACGACCGGCTGGTGGACAAGCTGGACCATCGC 
GTCTCCTCGTCCCTTCACAATGTGAAGCGCCAATTGTCGACACTGAATCAGCGCACTGTGGAGTTCTACGCGGACAGCAGGTGCGGTATTGAGGA 
TAAGTGCTGGATCTGTCGCAAGGTCTTGCGCACCCATCCCAAGCTCAAGGACTATAGCTACGTGTATTGGAAGCACTAGGTCCAGGTCGATGTCC 
CTAATCCCAATTCCTTGCCCAATTCCAGACCCTCTTTTCGCTCATAGCTTCATTCGATATCTTAGATTTCAAGCACAAGGTAACTTAATCTTTTT 
TGACCATCCGAAATGTATCTCAAATGTATACCAAAGGATATATTTGTGTTACTGCACACATCGAAGAGGAGAGAAAACCTTTTAAAAATCAAATA 
TATGATTTTTAGAGGCCTCTATTTTGCTTACTGTTCACTACACGAAACGGTTCTTTACGAATACTAAAAGACATGGAGTGGTTAAATATATTTAT 
CTCGAGATATTAAAAGATCCATCTTCAGAAAGCCAAAAAAATGTGTTAATATCAAGTGGATCGAACTGCGTTTCTCTGTGTACTTTCCCATCCGG 
GCAATTTGAATAAATGCGAACATTTTTGAATTGCAGCTCAATTTTTTGTCTGAGCATCCTAGCATCCACCTCCCCCGACCCGGCAATTTGGCATC 
TCTATTTGTCTGTTTGTCGTTTCTCGTCTGGCACAAAATGTATTTTTGTTTCAAGAGTTTTTGCAAATCGCGTTCTTGTTTTGTTCCTCTAGCAT 
TGTGTCTGTTGTGCAGTCTTCTGAAAATTATTTATACAGAATTATGCGCAGCATATTTGACCACGGCTAATTGCAGAAAATCGCCAGCAATTTGG 
TCATTAATTAGAAACTTTGTGGGTGTTTCTTATGCTTCTTGTGTTGGGCATCTAAACAAGCCTTTGAGTGGGAAATATCTTTGTTTTAATTGTGT 
AT AT AAT C AAC AC T CT T AT C T GC C GAAT AT GT C AAG T T AGT AT AAT CC C T AAAT GC C AAGT T AAT C AG AAT GC AC AT AT ATT CATCACTTTCCCC 
TGAAATAACGACTTCTTAGCCCAGTCCTATCATATTTCGCACCCGCACCTTAGAAATTTCGAGGCTGTCAGTCCGATTTGCGCGTAGATGGTCCG 
ACCAACTATTTCTTTGGGTTTGTCCTTCAAATCGAGATTTATTTTATTTTCAACTGAAAAGAAAACTGAGGCGCATGACAGCAAAAACAATCACA 
AG AAG AGG T GGC G ACAAAAAAAAAAAAAAC A GAAC AAC AT T AG AC AC AGC AAAT AT T T T T G C GT AAAC AT T G AAAC AAT GGCGGT G AAAAATT AA 
AATCTAAATGAGAATGCTTAAGCTTTCTACCTAAAACCAAATACCTTGGCAAAACGACAGCAATGGGCACACACACATGATTTTCAGATTAAGTA 
AC C AAC AC AGGG GC T T T T AT AAGAAAACT CAACT AAAC AAGAAT AAT C AT AGGAAT T AGG AAAT ACTG AC T GAT G AGGG ATT AT AAT AT T GTT AT 
AT AAC C AGG G AAAAG GAAAT GAT T CT GC AAAT T AAGC ATC AT T AAG AT AC T C CT T C C AAAC T C AAAAAT T GT TAG AT AC T GG AAAT C C AT T AGC T 
TCCTATGTTTTCGATGGTAATATATTCTATCACCACCTTGTAAAACGACCTATCGCCACTGAAATAGAATTCAGTAGTCATGTTCACTTATCACT 
AATTGCAACAGGTACACGACTTTGAATTAAATTAGAAAGTGAATCAGGTAGAGTATCAGAAGTGCAAGCATAACTCAAATTTAGTTGGCATCGCT 
GCACCTTCGCCCAAGAATAGACGCATGATGCACTTACCCCAAAAACAAAAACAAAAATGCATTTTAGGGGGAGAAGGAACTAGTCGGCAAATAGC 
TTGGCATGTATGTCCCAACTATTTGTGACAAGTTCGATGTCTTTATCATGCGTAAAAATACACCGAAATTCGCGAATGCAGCAGCCAGCGAAACA 
ATGAAACATTTCTGACAATGCCAGTGCACCTGAGTTGAGTGGTTGAGTGCTTGGTGGTTGAGTGCCCAACTGCTGAGTGGTGCGAAAAATGTAAA 
ATTGTGGGTCAGGCATTGGGGCCAAAAATAGAGCCAGCAATGCGAATCGACGGAGGGTCATTGTGTGGTAAACTGGGGCTACTTCTCGGTATCAA 
TTCAATTGATAAATGGTGGAAAAATATGTCAAAGTTCTAATGAATGCATTCAATCCGTAGCAAGCAGACCTCGATCGAGATCGAGCAGCTCAATG 
CCCGCGTCATCGAGGCGGAGACCCGGCTGAAGACCGAGGTGACCCGCATCAAGAAGAAGCTGCAGATCCAGATCACCGAGCTGGAGATGTCGCTG 
GATGTGGCCAACAAGACCAACATCGATCTGCAGAAGGTGATCAAGAAGCAGTCGCTGCAGCTGACCGAACTGCAGGCCCACTACGAGGATGTCCA 
GCGCCAGTTGCAGGCCACCTTGGACCAGTATGCCGTTGCCCAGCGCCGTCTGGCCGGTCTCAATGGCGAGCTGGAGGAGGTGCGCTCGCACCTGG 
ACAGCGCCAACCGTGCCAAGCGCACCGTGGAGCTGCAGTACGAGGAGGCTGCCTCCCGCATCAACGAACTGACCACCGCCAATGTCAGCCTCGTC 
TCCATCAAGTCCAAGCTGGAGCAGGAGCTCTCCGTGGTCGCCTCCGACTACGAGGAGGTGTCCAAGGAGCTGCGCATCAGCGACGAGCGCTACCA 
GAAGGTCCAGGTGGAGCTCAAGCATGTCGTCGAGCAGGTGCATGAGGAGCAGGAGCGCATCGTTAAGCTGGAGACCATCAAGAAGTCCTTGGAAG 
TCGAAGTCAAGGTGGGTGTTTGGGTTTTGGGACATCTATCTAACTAATATTTCATAGGGATAAAACTTACTACCTATCTGGTACTATATGACTCA 
ACGAACCTTGCAATCTCTCCCGCAGAACTTGTCCATCCGCTTGGAGGAGGTCGAGCTGAACGCCGTCGCCGGCAGCAAGCGCATCATCAGCAAGC 
TGGAAGCCCGCATCCGCGATCTGGAGCTGGAGCTGGAGGAGGAGAAGCGCCGCCACGCCGAGACCATCAAGATTCTGCGCAAGAAGGAGCGCACC 
GTCAAGGAGGTGCTGGTGCAGTGCGAGGAGGACCAGAAGAACCTCATCCTGCTGCAGGACGCGCTGGACAAGTCCACTGCCAAGATCAACATCTA 
CCGCCGCCAGCTGTCCGAGCAGGAGGGTGTCTCCCAGCAGACCACCACCCGTGTGCGCCGCTTCCAGCGCGAGCTGGAGGCTGCCGAGGATCGCG 
CCGACACCGCCGAGTCCAGCCTGAACATCATCCGCGCCAAGCACCGCACATTCGTCACCACCTCGACGGTGCCCGGATCCCAGGTGTACATCCAG 
GAGACCACAAGGACGATCACCGAATAGAGCCCCACGCTCCGCACCGAGATCATTTCATTATCAAACGCTAACCGAAGAACCGAACACAACCAATC 
CCGAACCCATGAAGAAGACCATCTGGCGGAGGGCAGAGTTCAGAGTGCAGGACGAGAACGAGAATTGCGTGAGAAAGTTTTTATAATTTAAATTT 
TTTTGTCAAACCCCCAAAACGCCGCGGCAGCGATTAGAACCCAGTGGACTTCGGCGGGGGCTCGTATAGGCTACACACTATAAATACATAACACT 
AC AC ATCCT ATCT ATAT AATAAT AAT ACT ATT T TGAAGTT C AT GT AATTT AAT T AAAATAAT GAAC CTGAAAAACCT AAAAAC AAAAT AAAAAC A 
AACCAACGAAAACTTGGCAGAGTTTTGCTAGCGCTAGAGATATGATATATACCATATACCATATACCACATACGATATATAATACTTGTACGTAG 
AGAGATGAGAGAGATCAACCGAGTTGGCTGAACAATAACGTTTAATGGAAATGGAGCCCATTTGAACGAAAAGCACTAAAACTTAATAACTTGAA 
ATGGAAATGATAATCGTCACCGAAATATCCCACTAAGCACGCCACCGCAGCATCGAGTTAACCGAGTTACAACCGCTTCTACTATTTGTTTTTAG 
TTCACTTTATGTTTAGTAGGTAAATTTCTGTGTGCTCGTAACGAATTTGAATTTTACATTAAATCAGCATCAACAAAACTGAGCAATTTTCACAT 
TTTTTACTTTGTATCTTGCAATAAAAACGAAATTCTATTCATAATAATAACAATAAGAAATTGTAAGTGAAATTCGGTTTTACTAATGGGGGGGT 
CTTCAAGATTTCAAGAAATGGTGTAGCTGCGAACCTTAGTGGACTTCAAGTAATGAAGTTTGGCATAGCCAACTCAAAGTTTTTGAAGTTTTTTT 
TTTAGCCATCTAAAGTTTCATGGCATTCAAAAATAGGCAGAAGTTGTCGGAGTAAACAAACATTCTCATTTGCAAATAAAGATTCGCTCAAGTTT 
GGTTTCCCAGCCCAATAAATGAATTTCCATTGATTTATATTGCTTTAAGCTATCACTATTTATTAACACTTTTGGAGCTATACAATCCAAGCTCT 
TAGATGGCCACAACACCCAGTTTGGCGTCGGTCTTCAGGTACGCATTCTCGCCGTAAGATCCGCCGTATTGGCTCTTCTTGAAGACTGAAAGGTA 
GAGAAGATACAAAAGAAGGTAAGTCTGGGTGGCAAAGCGGTGCTATGAAGTCAATCCAAAATCATAGGATAAACAATATTATCCAGTTAAGAACT 
CCATGTTGTAGCTTAGTTTAGGAACCCACCTGGATCGACCTCAGCCACGAAGGAGCGTGGTGGCAGAGCAGCAGCGTATAGCGGGTCCAGAGGGC 
CGTAGGGCTTGGCCAAAAGTGGGCGACCCTGCTCCCATCCATAACGGTTGAGGGAATTCCAGTCAGCACCGTCCAGCTTGGCGGCGGAGGCAGCT 
GCTCCGGCAGCGTTGGTCAGGGCAGAAGCCTGAGCCTCGGCGGTCTCAGCGGCCTTAACCACATCGCCGTAGCTGGCTCCGTATCCGGATCCGCC 
GTAGTTGGGGCGATTGGCCACAGCCAGGGCGACGCAGCAGGCCATCAGGCAGAGAAGACGTAGGGTGGCGGACATTTTTTTTTGGTGGTGGTGCG 
AGGAACGGACTGTTTCGAAAACTAATTAACCAGAGCGTGTGTCCAAATTGCTTTTATAGCAACACCATTTGAGCTGCTCCGCGATTTGTGAGCTC 
ACCAGACGTGACCAAACC 
{SEQ ID NO: 667) 

Exon: 1001 . .1352 

Exon: 1984.. 2040 

Exon: 3892.. 3978 

Exon: 4043.. 4209 

Exon: 5865 . . 6736 

Exon: 6822 . .6949 

Exon: 7034.. 7146 
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Exon: 11461.. 12076 
Exon: 12186.. 13458 
Start ATG: 1230 

Transcript No. : CT18593 

ACGAAATACTCTTTATTCGAACGACAACGGGATCTCTTGTTCAGTAGATCACCAGCCTTACTCCGACGTTTATTACGAATTTTCTCGGTGACTCG 
GTGACAACTACAAAACTGCGTGCGGTGCTGTATAGATTTTCGCTTGAGAGGAACACACAAATCATAATACTAGATACGCGAACCGATCGCACATT 
AATTTATAGAACCAAAATCGAAAATCTGACCTAGCAAACATGTCGTCATCGCAGGCTGTCCGCTCCTCGAAATACTCCTACCGTGCCACGTCCAC 
CGGACCCGGTACCGCCGATGTGAACATCGAGTACATCCAGGACCTGAGCTCGCTCTCCCGCTTGGAGGACAAGATCCGTTTGCTCCAAGATGATC 
TTGAAGTTGAACGTGAGTTGCGCCAAAGGATCGAGCGCGAGAAAGCCGATCTCAGCGTTCAGGTCATTCAGATGTCCGAGCGTCTCGAGGAGGCC 
GAGGGTGGTGCTGAACATCAATTTGAAGCCAACCGCAAGCGCGACGCTGAGCTGCTGAAGCTGCGCAAGCTGCTTGAGGATGTTCATCTTGAGTC 
GGAGGAGACCACTTTGCTTTTGAAGAAAAAGCACAACGAAATTATCACGGATTTCCAGGAACAGGTTGAAATCCTAACGAAAAACAAGGCCAGAG 
C C G AAAAG G AC AAG GC C AAAT T C C AGAC C G AAGT C T AC G AGC T GC T C T C C C AG AT C G AGT C C T AC AAC AAG G AG AAG AT C GT GT C GG AG AAGC AC 
ATCTCCAAACTGGAGGTCTCGATCTCCGAGCTCAACGTGAAGATCGAGGAGCTTAACCGCACCGTTATCGACATTAGCTCCCACCGATCCCGCCT 
CTCGCAGGAGAACATTGAGCTGACCAAGGACGTCCAAGATCTGAAGGTTCAGCTGGACACGGTCTCGTTCTCCAAGAGCCAGGTCATTTCCCAGC 
TGGAGGACGCCCGCCGTCGCTTGGAGGACGAGGACCGTCGTCGCTCGCTGCTCGAATCCTCTCTGCATCAGGTTGAGATCGAATTGGACTCGGTT 
CGTAATCAACTCGAGGAGGAGTCCGAGGCCCGCATCGACTTGGAACGTCAGTTGGTCAAGGCCAATGCCGATGCCACCTCCTGGCAGAACAAGTG 
GAACTCCGAGGTGGCTGCCCGCGCCGAGGAGGTCGAGGAGATCCGTCGCAAGTACCAGGTCCGCATCACCGAGCTGGAGGAGCACATCGAGTCCC 
TCATCGTCAAGGTCAACAATCTGGAGAAGATGAAGACGCGCCTGGCCAGCGAAGTGGAGGTGCTCATCATCGATCTGGAGAAGTCGAACAACAGC 
TGCCGCGAGCTGACCAAGTCGGTCAACACCCTCGAGAAGCACAACGTTGAGCTGAAGAGCCGTCTGGACGAGACCATCATCCTGTACGAGACCAG 
CCAGCGCGACCTGAAGAACAAGCACGCCGACCTCGTCCGCACTGTCCACGAACTGGACAAGGTCAAGGACAACAACAACCAGCTGACCCGCGAGA 
ACAAGAAACTGGGAGATGATCTTCATGAGGCCAAGGGTGCCATCAACGAACTGAACCGTCGTCTGCACGAATTGGAACTGGAGCTGCGTCGTCTG 
GAGAACGAGCGTGATGAGCTGACCGCTGCCTACAAGGAAGCCGAGGCTGGCCGCAAGGCTGAGGAGCAGCGCGGACAGCGCCTGGCCGCCGACTT 
CAACCAGTACCGCCACGACGCCGAGCGTCGTCTGGCCGAGAAGGATGAGGAGATCGAGGCCATTCGCAAGCAGACCTCGATCGAGATCGAGCAGC 
TCAATGCCCGCGTCATCGAGGCGGAGACCCGGCTGAAGACCGAGGTGACCCGCATCAAGAAGAAGCTGCAGATCCAGATCACCGAGCTGGAGATG 
TCGCTGGATGTGGCCAACAAGACCAACATCGATCTGCAGAAGGTGATCAAGAAGCAGTCGCTGCAGCTGACCGAACTGCAGGCCCACTACGAGGA 
TGTCCAGCGCCAGTTGCAGGCCACCTTGGACCAGTATGCCGTTGCCCAGCGCCGTCTGGCCGGTCTCAATGGCGAGCTGGAGGAGGTGCGCTCGC 
ACCTGGACAGCGCCAACCGTGCCAAGCGCACCGTGGAGCTGCAGTACGAGGAGGCTGCCTCCCGCATCAACGAACTGACCACCGCCAATGTCAGC 
CT CGT CT C CAT C AAGT C C AAG C T G GAG C AG GAG CTCTCCGTGGTCGCCTCC G ACT AC G AGG AGG T GTC C AAGG AGC T GC GC AT C AGC G ACG AGC G 
CTACCAGAAGGTCCAGGTGGAGCTCAAGCATGTCGTCGAGCAGGTGCATGAGGAGCAGGAGCGCATCGTTAAGCTGGAGACCATCAAGAAGTCCT 
TGGAAGTCGAAGTCAAGAACTTGTCCATCCGCTTGGAGGAGGTCGAGCTGAACGCCGTCGCCGGCAGCAAGCGCATCATCAGCAAGCTGGAAGCC 
CGCATCCGCGATCTGGAGCTGGAGCTGGAGGAGGAGAAGCGCCGCCACGCCGAGACCATCAAGATTCTGCGCAAGAAGGAGCGCACCGTCAAGGA 
GGTGCTGGTGCAGTGCGAGGAGGACCAGAAGAACCTCATCCTGCTGCAGGACGCGCTGGACAAGTCCACTGCCAAGATCAACATCTACCGCCGCC 
AGCTGTCCGAGCAGGAGGGTGTCTCCCAGCAGACCACCACCCGTGTGCGCCGCTTCCAGCGCGAGCTGGAGGCTGCCGAGGATCGCGCCGACACC 
GCCGAGTCCAGCCTGAACATCATCCGCGCCAAGCACCGCACATTCGTCACCACCTCGACGGTGCCCGGATCCCAGGTGTACATCCAGGAGACCAC 
AAGGACGATCACCGAATAGAGCCCCACGCTCCGCACCGAGATCATTTCATTATCAAACGCTAACCGAAGAACCGAACACAACCAATCCCGAACCC 
ATGAAGAAGACCATCTGGCGGAGGGCAGAGTTCAGAGTGCAGGACGAGAACGAGAATTGCGTGAGAAAGTTTTTATAATTTAAATTTTTTTGTCA 
AACCCCCAAAACGCCGCGGCAGCGATTAGAACCCAGTGGACTTCGGCGGGGGCTCGTATAGGCTACACACTATAAATACATAACACTACACATCC 
T ATCT AT AT AAT AAT AAT ACT AT TTT GAAGTTCATGT AAT TTAATT AAAAT AAT GAACCT GAAAAACCTAAAAAC AAAAT AAAAAC AAACCAACG 
AAAAC T T G GC AG AG T T T T GCT AG C GC T AGAGAT AT GAT AT AT AC CAT AT AC CAT AT AC C AC AT AC G AT AT AT AAT AC T T GT AC GT AG AG AGAT G A 
GAGAGATCAACCGAGTTGGCTGAACAATAACGTTTAATGGAAATGGAGCCCATTTGAACGAAAAGCACTAAAACTTAATAACTTGAAATGGAAAT 
GATAATCGTCACCGAAATATCCCACTAAGCACGCCACCGCAGCATCGAGTTAACCGAGTTACAACCGCTTCTACTATTTGTTTTTAGTTCACTTT 
ATGTTTAGTAGGTAAATTTCTGTGTGCTCGTAACGAATTTGAATTTTACATTAAATCAGCATCAACAAAACTGAGCAATTTTCACATTTTTTACT 
TTGTATCTTGCAATAAAAACGAAATTCTATTCATAATAATAACAATAAGAAATTG 
(SEQ ID NO: 668) 

Start ATG: 230 

MSSSQAVRSSKYSYRATSTGPGTADVNIEYIQDLSSLSRLEDKIRLLQDDLEVERELRQRIEREKADLSVQVIQMSERLEEAEGGAEHQFEANRK 
RDAELLKLRKLLEDVHLESEETTLLLKKKHNEIITDFQEQVEILTKMKARAEKDKAKFQTEVYELLSQIESYNKEKIVSEKHISKLEVSISELNV 
KIEELNRTVIDISSHRSRLSQENIELTKDVQDLKVQLDTVSFSKSQVISQLEDARRRLEDEDRRRSLLESSLHQVEIELDSVRNQLEEESEARID 
LERQLVKANADATSWQNKWNSEVAARAEEVEEIRRKYQVRITELEEHIESLIVKVNNLEKMKTRLASEVEVLIIDLEKSNNSCRELTKSVNTLEK 
HNVELKSRLDETIILYETSQRDLKNKHADLVRTVHELDKVKDNKNQLTRENKKLGDDLHEAKGAINELNRRLHELELELRRLENERDELTAAYKE 
AEAGRKAEEQRGQRLAADFNQYRHDAERRLAEKDEEIEAIRKQTSIEIEQLNARVIEAETRLKTEVTRI KKKLQIQITELEMSLDVANKTNIDLQ 
KVIKKQSLQLTELQAHYEDVQRQLQATLDQYAVAQRRLAGLNGELEEVRSHLDSANRAKRTVELQYEEAASRINELTTANVSLVSIKSKLEQELS 
VVASDYEEVSKELRISDERYQKVQVELKHVVEQVHEEQERIVKLETIKKSLEVEVKNLSIRLEEVELNAVAGSKRIISKLEARIRDLELELEEEK 
RRHAETIKILRKKERTVKEVLVQCEEDQKNLILLQDALDKSTAKINIYRRQLSEQEGVSQQTTTRVRRFQRELEAAEDRADTAESSLNIIRAKHR 
TFVTTSTVPGSQVYIQETTRTITE* 
(SEQ ID NO: 669) 

Name: PARAMYOSIN, LONG FORM 
Classification: motor_protein 
Gene Symbol: Prm 
FlyBase ID: FBgn0003149 



Celera Sequence No. : 142000013384548 

ACATAATTTCCTGCACCATCGAATTGGCTACTCTATTGTAATTGGCTGAGCTCCACAAAACTTCTTTATGCATAACATTCGAGCGTGAATTTGGT 
TTAGTTATAACGTTACTCGATATATCCTATAACTATGAATAGTATTCCATGTAAAATTGCTCCTTGTTTTTGGTAATACATATATGTAAATTGTG 
TATATTTCGAATTTATTTTTTTCCAAAGATTCATTTTATGAATTCTCATGCTCTTTGGTTTAATTTAAAGAATAAACTTGAACAATTTACAATAA 
AAGTCAAATAAGCTTATTTTGATATTTTTATGTGGGCTTAATTTACTTTGTGCCCACTGTAGAACAGATGCAACCAGGTGAACACAGAATATCGC 
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TTATCGATAACTCAGCCCTGTTAATATTTGTCTGCGAAAACAGCTGTGGCTATTGTCTGTTAACTCGCTTAGTCTATTTATTTAACTAAATACCA 
TGTCGAATATTTCAAGTGTGGCACCACCTCCTCCCCCACCACCCATGATAGTCACTCCTAGCACACCTGCTACCACAAAGGAACGTCCAGTTGGG 
GACAATATAACCACCGATGAGTGCACTTGGGCATGTGAGTATCCTGAGGTGCAGAAAGGATGCTATTTAAGTCGCGTCTGCCAATATACACCACC 
GAAACACTGTCAATACTACTCCGTAAACAGCATAGTGCAGGTGAGTTAGAAAACAATAAAGATTTCTCAGTTTGTTATTCGATAACATATGTATG 
TGCACCGAAGGTGGGACCTACTTGTGGCTTAGTGGCCCTCAGCATGTTGCTTGGTGGCAGTCCCACGGCAGATGATCTCCTCAAGGATGCAATCG 
ACCAGGAGTATACCCTTAATGGTGAACTATTTAGTGCTCAGTATTTGTTTGAACTGACCCGGAAGCATATGCCAGGACCTGCTGCGTGTCAGTTG 
CATGTAGGTCCTTTGGACTGCAAAAAAGTCAAGGAACTTTTAAAGGCCGGAGGCTGCCTACTAGTGCCGTATCCTTTTTCAAAAATATTGCCATT 
TAAACCCGGGTTGAGTTTGGTATTCCCCTGCATTTAAAGCCGTTGGCGCCTAACAGCACTTGTAAACGGTGAACAGCACACTGTTGTCTGGCAGT 
GTTGTAAAACTCGCGGTAGTTCAGTGGTGCAACTTATAAACAGCGTTTTGTGTTTCGCAAAATAATATACAATTTTCAGTCAGTAAGTAAACGAG 
AAGTGCAAATACTTTTATTAAATGGCCGACGAAACAGAGCAGGTATGTGTAAGAATAAAAGCGAAAAGGATATTCTCTATATTCCGTACTTGCAG 
CTCAGTCAGGTGATTTTGCCGGTAAAGGAGCCTTTGGATCTCATCCGATTGAGTTTAGATGAGAAGGTGTACGTAAAGATGCGCAACGAGCGGGA 
ACTGCGAGGACGTCTTCACGTGAGTTGATGCCTCTTTTTACTGTTGCCCAGCAATCTTTCAATATACACCTTTATCTAGGCCTTTGATCAGCATT 
T G AAC AT GGT GC T C G GC G AT GCGGAGG AG AC GGT GAC C AC T GT G GAG AT CG AC G AGGAG ACC T AT G AGG AGGT GT AC AAG AC C G C C AAGC GC AC T 
ATTCCCATGCTATTCGTTAGAGGCGATGGAGTCATCCTGGTTTCGCCACCCATGCGGGTGGGCTAGCGGGTCTTGTTTAGCTTTAAATCAATTAC 
CTGCATTTATACAAGCTATATCTAAATAGGTTTATAAATGTCTTTTAAAAAAAATAATGTCTTGTTTAATTTTTGCTTACTATATAAATATATAA 
AAATCTTTTCTATGGGTTGTCCTTAATCAATTGTAATTTCAAAAGCTACGATGCCGATGTAAACCATGCGCCATGCGTTAAAAATGGACACCGAG 
CCCATTGGGCTTTGATTGTAGGTTATCTGGTGGACACGCAGGACCGGGTGAGTGGAAAAAGCTAGGCATATTAAGACCAATATTAATGTATAAGC 
CCTTCCGCAGTTTTATGTCCTGGCTCGTCATGGTAAATCTCGTAACCTAGCCGTTTGGCCACTGGACACGCTCAGCCAAAGCAACGAAAATTTAA 
AAGAGTTTGCCCAACCCAAGGGCTACCCAGATGATGAGTTTCTGCTGCCACCGGGTGGAATCGGTGGATCTCTGGGCCTCAACGAGCGCTGCATT 
CTTGTCAACGGGCTGCCAAAGCAGGTGATTCATGTTCGCTGGTCCTAAATATATGTGCATCGAAATATTTTCAATAAAAAAAACCTTTTTTAATT 
ACGTAGTTAACTTTAATATATTTAATAGTACAGTTATAAATTACACGCCTTACAATATTTCACTTCGCAACGCAATCAACGCACCTCGAACTCCT 
TTCCATCATTCAGTTCAAACGGGGAGAGAAACACGCAGAGTTTTCGTATTTAATCGGTTCGCAACTACTACGGGGACACGAATGCGATTTTAGGG 
GGAATAGAATATAATATACAATAATGATATATCGZ^ACTTTCATTTGGTTTAGAGCCAGCTGTGTATAAATTTATGTATACAAGTATAAATTTACG 
CTTTTCCAACATCCGCTAATAAGACTGGGCTGCCTTGAAATTTCAAAAATCACAACAATAAGAGTAGTACACATGGTTTTGTATATATGGTAGAT 
AAGATAGGTAACTATATATAGTATGTTTTGTATATAATTTCACACACAGATATTTTTCTGATTTAGGTAGTACATAGACGACGGGCTTGGGTATA 
ATGTCAAATTTTTGTCAAGTAGATTTAGTATGCTGCTGTACTTTTAGGTTCATTTGATAGTTGGCCT^AACGGGAGCGGCGATTTTGAGATCATGG 
GGCCTAATGCTCCCGATTGCTTGGAAACTGTCGGTTAAAGAATATATATATATTTTATTATATATGCTAGTACATGCCAACAGGCAAGAGCAACG 
TGTCCATAGAAATTTTTGAAACGGCAACTACAAAGTTTGGTTTTAGTTCTAGAACTTCTTGTACAGGATATTTTGATACGCGTTGTTCATCTCTG 
ACGAGTTTATAATTACTTTGAGCATACCATTAAACTAGTTAGTGTCTGTTTGTCTGTATGTGTCTGGTTGCGGGTCGAAAGTATCTCTGATAAAT 
TCACAAGTTGCGTGCGAAAATGAACTGAACTTAAATGAACGTTACAGGCTAACGGTTTATTAACTAAAAATTTACAAATACTAGTTATAAATAAA 
TTA 

(SEQ ID NO: 670) 

Exon: 1001.. 1277 
Exon: 1327.. 1444 
Exon: 1505.. 1672 
Exon: 2006.. 2233 
Start ATG: 1409 

Transcript No. : CT18597 

AGGCTGCCTACTAGTGCCGTATCCTTTTTCAAAAATATTGCCATTTAAACCCGGGTTGAGTTTGGTATTCCCCTGCATTTAAAGCCGTTGGCGCC 
TAACAGCACTTGTAAACGGTGAACAGCACACTGTTGTCTGGCAGTGTTGTAAAACTCGCGGTAGTTCAGTGGTGCAACTTATAAACAGCGTTTTG 
TGTTTCGCAAAATAATATACAATTTTCAGTCAGTAAGTAAACGAGAAGTGCAAATACTTTTATTAAATGGCCGACGAAACAGAGCAGGCAGCTCA 
GTCAGGTGATTTTGCCGGTAAAGGAGCCTTTGGATCTCATCCGATTGAGTTTAGATGAGAAGGTGTACGTAAAGATGCGCAACGAGCGGGAACTG 
CGAGGACGTCTTCACGCCTTTGATCAGCATTTGAACATGGTGCTCGGCGATGCGGAGGAGACGGTGACCACTGTGGAGATCGACGAGGAGACCTA 
TGAGGAGGTGTACAAGACCGCCAAGCGCACTATTCCCATGCTATTCGTTAGAGGCGATGGAGTCATCCTGGTTTCGCCACCCATGCGGTTTTATG 
TCCTGGCTCGTCATGGTAAATCTCGTAACCTAGCCGTTTGGCCACTGGACACGCTCAGCCAAAGCAACGAAAATTTAAAAGAGTTTGCCCAACCC 
AAGGGCTACCCAGATGATGAGTTTCTGCTGCCACCGGGTGGAATCGGTGGATCTCTGGGCCTCAACGAGCGCTGCATTCTTGTCAACGGGCTGCC 
AAAGCAGGTGATTCATGTTCGCTGGTCCTAA 
{ SEQ ID NO: 671) 

Start ATG: 360 

MRNERELRGRLHAFDQHLNMVLGDAEETVTTVEIDEETYEEVYKTAKRTIPMLFVRGDGVILVSPPMRFYVLARHGKSRNLAVWPLDTLSQSNEN 
LKEFAQPKGYPDDEFLLPPGGIGGSLGLNERCILVNGLPKQVIHVRWS* 
(SEQ ID NO: 672) 

Name: LSM3; U6 core snRNP protein 
Classification: RNA_binding 



Celera Sequence No. : 142000013384321 

CTGTTTGCGGCTGGTTAACTGAGTGGGTGTATTGGCCACATCGTAGTCCAGCTCATCACCAGTTTGCTTAAGGCTGTGATACAGGAACTTGGACA 
GCAAATTCTTTGTTTGCAGCGCCTCCCGGGACCGCTCCTTTAGGCAGTAGTTGTGCTGTGTATCTCCATAGGTCAGGACGTGCTCCAGAATAAAG 
GATCTTAGCCACTTCCAGTCGTCATCCGCCTTTCGCAGCAGATACTTCTGGATGTCGAACTCATCCAGGAGCAACGTGTTGCCCACCTTGTGCAC 
AACCATGCTGATGGCGCTCTTCGCGCTGTAAGGCAACTTGAGTAGCTGCTTGATATTCTCCGCGTCGCTGACCACATCCACCTCTCCCACGCAGT 
CCGGAAACATCTGGCCCAATCGAAAACTGGCAAAGCCAGCACTTTGGTAGAGGGCCTGGTGTAATCCATGACTGCTGTGCGACGTTGTCAGCCAG 
TTGAGAGGCGGAAGATTCAAGTTGGTGTTGCACTCCAGCCGCTTGAAGTGGGCTGGCACTGGAACGGGAGACAACTTAATCACCGCCTGGCTTTT 
TACGACGTCCAACTCATGTGGTCCAATACTCCTGCACGGTTCAATCTATCGAAAAACATGTGTTATTTATTTAAGCTTCTTTGCGCACATTCTTA 
CTTCTTCGTTTCTATCTTCCATTATTGCGCTCGCTCATTGGTAACTCCCTGAAAATCCAATCCCGTGGAATATTCAAGAAACTTTCGAGCAGATG 
TGTTTTTGTTTTGCCGGCTCAGCTGATAGCTGCCGCAGTTTTCCAACACCGGACCAGACCGGCGCTGCCAACTTAGTGCCAGCACAGTTGCATGG 
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GATCAGGTAGACTTCAAAACCTCTACAATTTATAAGGGTTTAAACTAAGTTTCAAGACTAATTATATATGGCATACAATAAACCTCATATTTAGT 
TATTCAATACTTCCAACACACTTTTGCTCTCTAGATGATCGAAATGAAACTTACAATGAAATTTCTTTCAGTGGTAGCTCAAAATTTAAGCATAA 
TAATTATTTCAAAAACTATGTGAAAATATAACTGTGGGAAACCGTCCCCTCTCCTACGCCACTGGCCAATTCCAAGCTGATGACTTTCTGATGCT 
CGTTATCCAGCACTGGTATATTATTTTGTTTTCCATTCAGTAACACGATTATCACAAACTGTCGCAAATTTTCCCGAATTGAATACAAATCTGAC 
ACAAAAGCATGACACTGTTTGCCCTCATCGCCCTGCTCAGCCTGCTGAACACAATCCTGCCAGATTTCATAAGGAACTACGTGAGTACCATCCGC 
AAGAGAAAACCCCAGTGTAGAACCTTATCCTGACCTGCAGCTCAAAGTGTCGCGGTTGTGGGGCGTAAGGAACTCCGCGGAAATTCGGCAGTTGC 
AGACAGAACTGGACGATGCCCGGAAGCAAGTGGAGGTGGTCAGCAATGCGGAGCATTCCGGAGAGTACGCCAGGACCATCAAAATTATGCGCGCC 
GAGCGTAAAGTATCAGAGGCGGAGGCGAAACTGAAATCTGCCCGGGGAATGGAGAACTTCATGCGGATAAGCATCGATACGGCCATGTTCTACGG 
ATCTAAGGTGCTGCTCTCAGCAATAACCGTGTTTATCAGCATCCGGAATCGTGGAACGCCCGTCATGATCATCGATGAAGCGATCAGCCTGGCTC 
CGTTCACAGGACTTCTTAGTTTTCCTACTGGCGTGGCCAATGCCATTTCGGTGCCCGCCTGGGCATTCTCCTGCAACCTGACATTTCGACTCATC 
TACGGCTTTGTGAAAAATCGTGGGGCGTGATTCCATCCGAGCGAAATCCCTCCATTTCATTTAGCACTTAGAAGGCGCTCGAAACTATGTACTTT 
GTTATGGTTTGCTAAGCTTCATCATGCATTGACTCTTGAATGTAAAGGCCCAATTTTATGTAAGCACCACATCAAATAAATGTATTTTCAGACAA 
GATTCAGTTTGAATCACGCGCCTTGGTCTAAGGGTGTTTTCCATCACTGCTCGTAAACATAGTGTTGCGCTGTGCTCACTCTAGATGAGTGCGTG 
TCATTTTCCCCATCCAATTTGTTTGGTGTTTTGTTAGCCGAAATGGACAAACTAAATGAAAATGCTCGACAGCGGAAGCAGATCAAGCTGGGAGA 
GATCGACACGGGCCCCATTATAGTGGCCCTCCTGCTAGGCTTCATTGCAGTGGGTGAGTCCGCCACGTGGAGTTCGCCTTTTGCCCCAGGGAGTC 
TCACCTGATGTCTATAATCCCGTTCACACGTTTTCTCTTTTAGCCATATTTGTGATCCTGCGAAGACGCTCTGCTGGTCGCAAGGACTTTCTGCT 
TACTGGGCTCAGTGAATCCGGCAAAAGTGCCATCTTCATGCAGCTGATCCACGGCAAATTCCCGGCCACTTTTACTTCCATCAAGGAGAACGTGG 
GAGATTACCGGACGGGCAGCGCGTCTGCCAGGTTAGTGGATATACCCGGACATTATAGGGTGCGCGACAAGTGCTTGGAGCTATATAAACACCGT 
GCCAAGGGCATTGTCTTTGTGGTGGACTCGGTTACTGCCCACAAGGATATCCGGGATGTGGCAGAGTGAGTTTTCAATTAATGTACAACCTAGAT 
GCGGTGCTAATCTTATTGGTTTTCCAGCTTTCTGTACACCATCCTATCAGACAGTGCCACACAACCCTGCTCTGTGCTAGTCCTTTGCAACAAGC 
AGGATCAAACCACCGCCAAGAGTGCCCAGGTCATTAAAAGCTTATTGGAGTCGGAACTGCACACAGTGCGCGACACCAGAAGCCGCAAACTTCAA 
TCCGTGGGCGACGAGGATGGCAGCAAATCCATCACATTGGGCAAGCCAGGCCGTGACTTCGAATTCTCACACATCGCACAGAATATCCAGTTTGC 
GGAAGCCTCCGCCAAGGACACTGAACTTGATCCCCTCACCGATTGGCTCGCTCGGTTGCTGTGAATTGGGCACACAAGGCTCTCAGACTGATAAA 
AAACTGGAGTCCTGAAGGAAAGAATGCGGAACAAAAAAACAACAGATTGATATTTATTTTAACGTTTAGCGCTCTGAATGTCGTATTTATTATGC 
TAAGCCGATATACACCGCTGAGTATAAATCCGTGCATTGTTAAATATCCTCTTTTCCCATCCTTTGCCTTCCACCTATTTTCCCGTCTAATTTTA 
GGTTATTGTGGAAATTTTATATACTTTTAGCATTAATTTGTCGGTAGATAAGTGATAAAGTCATTTATAAACCGATTAAAAAGAAAAAAACCCAA 
ATAAGATTTACGAGTTTTTAAAAGCAGGTTGTGGGATCAGCAAAGGCTCCAAATAATAGCATGCTGGGGGGCTTAAATTTATGTCGTACATGTTT 
TTCTTTATTTTTAAGAAATGCCGCCAAAACTGAGTTTAGTTCATTGACTATCGATAGCACAGATATGTTTAACTTTTTAGTTCAATTTTCAAACA 
CAATTATCTGGTGGCGCCATTGTTTAGGCATGTAAAATTTTACGTAGCAGTCATGTAGAATTAGCACAACCTTATTTTTAAAATATAGCTTAGAA 
TAATAAGCATTAAATAATTTAAAGCTCAGAAAAAAATACATGTTTTGAAATTCTACATAAATTGGCTATAATAAAAATTTTTTTAATGTTATAAA 
AAAATGGAAAAAGTACAATTAACATTGAGAGTATAAATATTTCTTCATTTGAGTAAGGCTTCTATGAAAAAGTTCGATTTATATATATGTTTATA 
TCTACAGCAACAATTCGCTTCCTTTTATTTCAACAGCTGTATCTAATTTGATTATCACATGACCGAAATTCATGAAAAAGGGGTTTTGCTCAACT 
AGCAGTTGG C AAAGC C AT AT T T T T GACC AAC T GT C C C AGAAAC G AGG AT TAT T GT TAG AT C G CTGAGC AAAC GT T T GGC GC AT GC GC AAGC TT T C 
AAACCTTTTTCGCGATTTG 
(SEQ ID NO: 673) 

Exon: 1001.. 1315 
Exon: 1371.. 1824 
Exon: 2080.. 2238 
Exon: 2324.. 2630 
Exon: 2688.. 3009 
Start ATG: 1244 

Transcript No. : CT18686 

T T AC AAT G AAAT T TC T T T C AG T GG T AGCT C AAAAT T T AAG C AT AAT AAT TAT TT C AAAAAC T AT GT G AAAAT AT AAC T GT GGG AAAC C GT C CC C T 
CTCCTACGCCACTGGCCAATTCCAAGCTGATGACTTTCTGATGCTCGTTATCCAGCACTGGTATATTATTTTGTTTTCCATTCAGTAACACGATT 
ATCACAAACTGTCGCAAATTTTCCCGAATTGAATACAAATCTGACACAAAAGCATGACACTGTTTGCCCTCATCGCCCTGCTCAGCCTGCTGAAC 
ACAATCCTGCCAGATTTCATAAGGAACTACCTCAAAGTGTCGCGGTTGTGGGGCGTAAGGAACTCCGCGGAAATTCGGCAGTTGCAGACAGAACT 
GGACGATGCCCGGAAGCAAGTGGAGGTGGTCAGCAATGCGGAGCATTCCGGAGAGTACGCCAGGACCATCAAAATTATGCGCGCCGAGCGTAAAG 
TATCAGAGGCGGAGGCGAAACTGAAATCTGCCCGGGGAATGGAGAACTTCATGCGGATAAGCATCGATACGGCCATGTTCTACGGATCTAAGGTG 
CTGCTCTCAGCAATAACCGTGTTTATCAGCATCCGGAATCGTGGAACGCCCGTCATGATCATCGATGAAGCGATCAGCCTGGCTCCGTTCACAGG 
ACTTCTTAGTTTTCCTACTGGCGTGGCCAATGCCATTTCGGTGCCCGCCTGGGCATTCTCCTGCAACCTGACATTTCGACTCATCTACGGCTTTG 
TGAAAAATCATGAGTGCGTGTCATTTTCCCCATCCAATTTGTTTGGTGTTTTGTTAGCCGAAATGGACAAACTAAATGAAAATGCTCGACAGCGG 
AAGCAGATCAAGCTGGGAGAGATCGACACGGGCCCCATTATAGTGGCCCTCCTGCTAGGCTTCATTGCAGTGGCCATATTTGTGATCCTGCGAAG 
ACGCTCTGCTGGTCGCAAGGACTTTCTGCTTACTGGGCTCAGTGAATCCGGCAAAAGTGCCATCTTCATGCAGCTGATCCACGGCAAATTCCCGG 
CCACTTTTACTTCCATCAAGGAGAACGTGGGAGATTACCGGACGGGCAGCGCGTCTGCCAGGTTAGTGGATATACCCGGACATTATAGGGTGCGC 
GACAAGTGCTTGGAGCTATATAAACACCGTGCCAAGGGCATTGTCTTTGTGGTGGACTCGGTTACTGCCCACAAGGATATCCGGGATGTGGCAGA 
CTTTCTGTACACCATCCTATCAGACAGTGCCACACAACCCTGCTCTGTGCTAGTCCTTTGCAACAAGCAGGATCAAACCACCGCCAAGAGTGCCC 
AGGTCATTAAAAGCTTATTGGAGTCGGAACTGCACACAGTGCGCGACACCAGAAGCCGCAAACTTCAATCCGTGGGCGACGAGGATGGCAGCAAA 
TCCATCACATTGGGCAAGCCAGGCCGTGACTTCGAATTCTCACACATCGCACAGAATATCCAGTTTGCGGAAGCCTCCGCCAAGGACACTGAACT 
TGATCCCCTCACCGATTGGCTCGCTCGGTTGCTGTGA 
{SEQ ID NO: 674) 

Start ATG: 244 

MTLFALIALLSLLNTILPDFIRNYLKVSRLWGVRNSAEIRQLQTELDDARKQVEVVSNAEHSGEYARTIKIMRAERKVSEAEAKLKSARGMENFM 

RISIDTAMFYGSKVLLSAITVFISIRNRGTPVMIIDEAISLAPFTGLLSFPTGVANAISVPAWAFSCNLTFRLIYGFVKNHECVSFSPSNLFGVL 

LAEMDKLWENARQRKQIKLGEIDTGPIIVALLLGFIAVAIFVILRRRSAGRKDFLLTGLSESGKSAIFMQLIHGKFPATFTSIKENVGDYRTGSA 

SARLVDIPGHYRVRDKCLELYKHRAKGIVFVVDSVTAHKDIRDVADFLYTILSDSATQPCSVLVLCNKQDQTTAKSAQVIKSLLESELHTVRDTR 

SRKLQSVGDEDGSKSITLGKPGRDFEFSHIAQNIQFAEASAKDTELDPLTDWLARLL* 

(SEQ ID NO: 675) 
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Classification: enzyme 
Gene Symbol: BcDNA:GM0477 9 
FlyBase ID: FBgnO028478 



Celera Sequence No. : 142000013384548 

CTGGGTAAGACAGTCGAGAAAAGCTACCATGGCAGCATCAAACTTGGTGTCCCAGAAAAACTTAAATCCGCCGCTGCCATAAAGCGGGAGCTCTC 

GATTCTCGCCGAGCACCTCAACATACGAATGATTTCCAAAGGGTACTACACGATACCGCTCGAAGGTGAGCCCTATCTTACGAGCAAGCGCAGAG 

AGCAGAAGCACCGTCTGGCCCCAAGCGGCGTTGATCTCTGACCAGTCCACGGATACAGAGGGCAATCGACCCAGCCTAAAGTTATTAATGGTACC 

GAAATGCCCGGCATGCCAGATGTGAAAGGTGATGTTGAATATGTTGGTGTCGCGCAGTTTGTCCAGCTGCTGTTTCGAGTAGGCGATCTGACACT 

CCAGACTTCGTTTGTCATCCTCGGTGAGCATTAGCTCACGCCTGTGCTTGGTATACTCGCGCCAGTAGCTTTCCTCCTGCTCGTGCAGCTCCTCT 

CGCTCCTGTTCCTCCTCGGCAATGGCATCATTTAGCGATTGTTCCTCCTTTTTGAGCTCTTTTAGCTCCGACAGAAGCTGTTGCTCGCTGCGCTT 

TAGTTCGTCGAGCTCCTTGTCCAGGGCCTCAACGTTGGGTGCTACACGCTGTTGCTCTAGTTCATCCAAATAAGCCTTGTACACATCCCACTCGT 

CCTCAGCGATGCGCAGTTCCCTGTCCATGATTTCCAGCATGGAGTCGGCACACTCTTCGCACAGCGGATGGTCAATCTCAGAGTTGGAGGAGAGG 

CAGTCAAACAGCTCCGCTTTCAGCTTAAAAGCAGCACTCATTTTCTTGTTGTCCCTGCCATCGGAAACCAGCATAAAACCAGTGCCATTTATAGA 

GTCCGTAAGCCTGTAGGGCGGCACAAAGTGGTCGAAGCTGCTGGCGTCCTGCGGGTCCAATGTATTGCCATTGTCTCCGTAGATGGGCACTGAAA 

AAGTGATTTATTAGACTAATTTCCAGAGTGTATCACCCATCATCACCTACAAGATAACTCCGCCATTGCGTGCACGCTAATCTTCTCCAGCTGCT 

CGTCCAGGACGATGGGCTGCAGGCAGCGCTGACAGGCGAAGGACACCGCCTGCTTTTCCGCCTCACTCATTGTCGTTCCTGTTTGTTGTCCTTCG 

ATT T T T G C AT AC AAT T AAAT CTTTCACC G AGT GT AAAAAC AC AAAT AAT C CAT C AGC T GT TT GC AACT GCT AC C AGAGT T GC C AC CC C C T C C G AA 

AAAG AGT GT G G A AAAAT GGC GC AGT T AAT C GG AC AGT AAAT C GT C AAC C AAC AAC AAC C T G AG AC AGT AAAAAAG AGAG AAAT AACAG AAAAC C A 

GTTTTAAACAATAACATTGCCTAATTGTGTCCGCCGGAGATGCTTAAGCGTGCCGCGAAGCTGCAGGCCTTGACCACAATTCCCGCTGATAAGAG 

AGAACTCAAGGGGTTGGCTTTTGGTTTGTTTTGGTAAGCGACAGAACATTAGACGGAGTGCCGTCGCCAGTGCTGCCAACCTATTTTAGCTATAG 

CAAATATTTCATAGTAGCAAAGAGGTAGAACCCAAATACAATTTTTATAAATAATAGTTTATAGTTAGCGAACGGACTAAAAGAGAAACCTGATA 

TCGAACGACTTATTATATTTTTTGTTAACTCAATTTTAAACTTACTCCAGCGTAAACCATACTTTATAAATAGTTTACCAAAAATGATAGCACAA 

GT GC C AG AAAGC T AT T C ACT T C AT T T T T AAT T TG ATT AGGT AAAT T GT GC G AAGG AAGT ATT T AAAAAAAT GT GAAAT G AC GT AC TAAGC AC C C C 

AC AT AT AGAC T AAG AAT T AAAGGT T AAGG AC ATT GACT T T C G C AAT CT T T GT AT T GT AGT T AAGCC AC AAGC AT T G AT AAGC AAAGAAAT T G T AA 

GAG T GAAAT AAAAT CAT AAT AAG AT G AAT TAATTAAAGT A GAGAAACT GAT AAAG AC T T GC ACT T GAAGC T G AC AC AT GT GT T T AAT AAC TAT AT 

AATTGCTAAGGGAAATTTTGTTTTTGAAAAGTTTCCGTTATCAAGCTCGAAATTTCTAGCAATGGGGCAGCACGAAATTTGCCGCTGTAAATCTA 

GCTTTCAAAACCTGGCAGCACTGTCGGTGGTCACATTGTAGCCTCTTCGCATTTGACGACCCATGGAAAACTTTCAGTCTCTTGCAGGCAGCACA 

GCGATACGGAAGCAACAATTGAAACGTCCAGAACTCCTCCTCCCACGAGTATTTAAATTCTGACACAAGAGTGCCTGAACCAAAATCCCACTCAC 

CGAATGCTGGTGCTTTGATTTGCAACTGTTCCTAGGTTCCGGCTGAATCCCTAGATTTGCGGATCTACGCAACTCATTCACTCAAAAGCCAGCCG 

AATTTATCGCGCCAGATTAAACAAACAGAAACAAAAAACAGAAGCCGAGACCAGTTCCACATCGTGCCATTTTTATCTGGGCATATAGGCAGACG 

TGATTACGTGATTAGTGCGGCTGGGGCTGCCGTTTATCAATAGCGAGAGCAACAATGGTTTCCTACTTCGTACCTCGTGGCCGCTTCCTGCTGAA 

GGCCGGCAATCTCAGACAGGTGGTGCAGCAGCAGCATCAGCCGGCCCAGCTACAGCTGCAGCCCATTAAAGGACCACAGCCGCAGGCTCAGAATG 

CCAGTCTGCCGGTGGCCCGTCACTTGCGCCAATTCTCCTCGAATCCAGCCTCCAAGGAGGCACCATTACACCACCGGCGCCCGCAACATAAACAA 

CAACCGAATCCTAGTCAGGAGTTAGCTCAGATTCGGCGCAATATACTCTCACGGTGGACGGGCTTCCTGTTGCGTTGGGCTCCCATGGGCATCTG 

TGTGTTTGGCGCCATCGAGTGGCAGTTGCAGAAGAACCGCTGCGAAAAGGAAGGAAAACCTCGGACAGCGTCCGAGCTCCAGTCGCGCATTTACT 

GCTCCCTGCCACTGCGCATAATCAGCCGTTGCTGGGGCTGGCTGGCCGCTTGCTACTTGCCTCCCAGTCTGCGTCCCTACGTCTACGGATGGTAT 

TCCAACACGTTTGATGTTAATTTGAGCGAGGCCATGTATCCGGAGTATGAGCACTACAATAGTCTGGCTGAGTTCTTTACCAGGCCACTTAAGGA 

GGGCGTTCGTGTCATCGATCAGCAGGCTCCACTGGTCTCCCCCGCCGACGGTAAGGTTCTACATTTTGGCAGCGCCTCGGACTCGCTAATAGAGC 

AGGTCAAGGGTGTTAGCTACAGTATCGAGGACTTCCTTGGCCCGCTGGAGACTGTGGAGCAGGCAAATTCCGGTGCCTCCTATGCCCAGGCCCTC 

AAAAAGAAGAGCGATGGTTCTACGGAGCTGTACCAGTGCGTGATATATCTGGCTCCCGGAGATTACCATCGATTCCACTCTCCTACCGCTTGGAA 

GCCCACCATTCGTCGTCACTTCTCCGGCGAACTGCTGTCCGTGAGCCCCAAAGTTGCCGGCTGGCTGCCTGGTCTGTTTTGCCTCAACGAACGTG 

TGCTGTACATGGGCCAGTGGAAGCACGGATTCTTCTCCTACACCGCCGTGGGTGCCACAAACGTGGGATCCGTCGAAATCTACATGGATGCCGAT 

CTGAAGACGAACCGTTGGACTGGATTC AAT GTCGGCAAGCATCCGCCC AGC AC CTACGAGTACGATGAACTGGTCTTGAAC AAAG AACT AAC AGA 

AGCGCCCAAGGAATTCGGCAAGGGGGATCTGGTGGGCCAGTTCAATATGGGCAGCACCATAGTCCTGCTCTTCGAGGCGCCAAAGAATTTCAAAT 

TTGATATTATCGCCGGTCAGAAGATCCGCGTTGGCGAGTCCCTCGGCCACATCGTCGGCTCCAAATGAAAATGGGAGCCACAATTATGTACCGAA 

TCAGTGCAAAGTCCTGTCCCCCAACTGCTCAGATCACTTGTTGTATTAGCATTAGACGCTCGTGTCCTAGGCAAATGTATTTTGTGAACATCCTG 

CGAAGCTAACTCTGTGTATTGTTGCCATTTCAGTAGTTTTAAGCTATATTTTATACAATAAACGTCGAAATTATATTTGAGAACAACAACAGCAC 

CATCCATCAATTAACTCATTCAGGCACGCGCTTCTGAAGTAGAACACCGATTGTTGAGGTGGGGCGTTATCAGGGAACGTTTTGATAAGAGATAC 

CAGGCTGAAACGC AGT CGTTGAACCCCTCGTTGCTGAGGATTCAGAAAGACTAAGTCTGTGTCTGTAGCCAGCTTATCGGGCC AAAT AACGCTCC 

GGGGGTCCACACTCACCTCCTTTTGCGGAGTCCACGCTATCAGTGAACCTGAAAATGAAGCAAAACCTTTTTGGGCTAGACAAGCGCGTGCCGCT 

GCCGACGACAAGATGACAAGTCGTCTATGCCTTGCGCCACCCAGTGTTTACTTTAGCGCAGAATCCGGGATCGCAATCCATTCCGTTTAATTAAT 

TCAATCAAGCCCGGACGCCGGGAATCTTGTTCGTTGAACTTATCAGGCGGTGACGGGTGATTTGGTTCCTGGCCTTGACACACACACTCCACGAA 

GTCCGTGAGTGTTAATTATTCGCCATGGCAACCGATTTGATTGCAAATCGAGCATCGGCTTAAGGGCAATCACTAAGGGAACTTCGAATATGGCG 

CCAACAGTGCTGTAAATGTTTAAATAAAAGTATAAGTCGTGGAATTGCCACTCTTTAAAGCTATTCGTTGTGAAAAAAAAACATAAGGAACTTTT 

ATTTTGGCATATCACGCCATATTCTTATCAAATAATTTTTTTTCGGTTTAATTTTAAAAATTCGCGCCAACTCGATAAATGAACGTTCAGTTAAA 

TTCACCTCATTGTACAAACGAGGTGAACTGTGTTAAACAGCTGACAGCCGGCATTCATTTGTTGTTCTCTGTGATTTTACGTAATATTGGTATTT 

ATTCTGAATATAATTGTTTATAAAGCACATTAAAATGATGATGCGACACTGTATGATGGCCTTGAGGGTCGCTAGATCCCAGATCCCGTTCAGCA 

CTGCCAGGAATACTCAGAGCAATCTCCTCCAACGTT 

{ SEQ ID NO: 676) 

Exon: 1001.. 1458 
Exon: 2121.. 4071 
Start ATG: 2525 

Transcript No. : CT18825 

AAGATAACTCCGCCATTGCGTGCACGCTAATCTTCTCCAGCTGCTCGTCCAGGACGATGGGCTGCAGGCAGCGCTGACAGGCGAAGGACACCGCC 
TGCTTTTCCGCCTCACTCATTGTCGTTCCTGTTTGTTGTCCTTCGATTTTTGCATACAATTAAATCTTTCACCGAGTGTAAAAACACAAATAATC 
CATCAGCTGTTTGCAACTGCTACCAGAGTTGCCACCCCCTCCGAAAAAGAGTGTGGAAAAATGGCGCAGTTAATCGGACAGTAAATCGTCAACCA 
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ACAACAACCTGAGACAGTAAAAAAGAGAGAAATAACAGAAAACCAGTTTTAAACAATAACATTGCCTAATTGTGTCCGCCGGAGATGCTTAAGCG 
TGCCGCGAAGCTGCAGGCCTTGACCACAATTCCCGCTGATAAGAGAGAACTCAAGGGGTTGGCTTTTGGTTTGTTTTGTCACATTGTAGCCTCTT 
CGCATTTGACGACCCATGGAAAACTTTCAGTCTCTTGCAGGCAGCACAGCGATACGGAAGCAACAATTGAAACGTCCAGAACTCCTCCTCCCACG 
AGTATTTAAATTCTGACACAAGAGTGCCTGAACCAAAATCCCACTCACCGAATGCTGGTGCTTTGATTTGCAACTGTTCCTAGGTTCCGGCTGAA 
TCCCTAGATTTGCGGATCTACGCAACTCATTCACTCAAAAGCCAGCCGAATTTATCGCGCCAGATTAAACAAACAGAAACAAAAAACAGAAGCCG 
AGACCAGTTCCACATCGTGCCATTTTTATCTGGGCATATAGGCAGACGTGATTACGTGATTAGTGCGGCTGGGGCTGCCGTTTATCAATAGCGAG 
AGCAACAATGGTTTCCTACTTCGTACCTCGTGGCCGCTTCCTGCTGAAGGCCGGCAATCTCAGACAGGTGGTGCAGCAGCAGCATCAGCCGGCCC 
AGCTACAGCTGCAGCCCATTAAAGGACCACAGCCGCAGGCTCAGAATGCCAGTCTGCCGGTGGCCCGTCACTTGCGCCAATTCTCCTCGAATCCA 
GCCTCCAAGGAGGCACCATTACACCACCGGCGCCCGCAACATAAACAACAACCGAATCCTAGTCAGGAGTTAGCTCAGATTCGGCGCAATATACT 
CTCACGGTGGACGGGCTTCCTGTTGCGTTGGGCTCCCATGGGCATCTGTGTGTTTGGCGCCATCGAGTGGCAGTTGCAGAAGAACCGCTGCGAAA 
AGGAAGGAAAACCTCGGACAGCGTCCGAGCTCCAGTCGCGCATTTACTGCTCCCTGCCACTGCGCATAATCAGCCGTTGCTGGGGCTGGCTGGCC 
GCTTGCTACTTGCCTCCCAGTCTGCGTCCCTACGTCTACGGATGGTATTCCAACACGTTTGATGTTAATTTGAGCGAGGCCATGTATCCGGAGTA 
TGAGCACTACAATAGTCTGGCTGAGTTCTTTACCAGGCCACTTAAGGAGGGCGTTCGTGTCATCGATCAGCAGGCTCCACTGGTCTCCCCCGCCG 
ACGGTAAGGTTCTACATTTTGGCAGCGCCTCGGACTCGCTAATAGAGCAGGTCAAGGGTGTTAGCTACAGTATCGAGGACTTCCTTGGCCCGCTG 
GAGACTGTGGAGCAGGCAAATTCCGGTGCCTCCTATGCCCAGGCCCTCAAAAAGAAGAGCGATGGTTCTACGGAGCTGTACCAGTGCGTGATATA 
TCTGGCTCCCGGAGATTACCATCGATTCCACTCTCCTACCGCTTGGAAGCCCACCATTCGTCGTCACTTCTCCGGCGAACTGCTGTCCGTGAGCC 
CCAAAGTTGCCGGCTGGCTGCCTGGTCTGTTTTGCCTCAACGAACGTGTGCTGTACATGGGCCAGTGGAAGCACGGATTCTTCTCCTACACCGCC 
GTGGGTGCCACAAACGTGGGATCCGTCGAAATCTACATGGATGCCGATCTGAAGACGAACCGTTGGACTGGATTCAATGTCGGCAAGCATCCGCC 
CAGCACCTACGAGTACGATGAACTGGTCTTGAACAAAGAACTAACAGAAGCGCCCAAGGAATTCGGCAAGGGGGATCTGGTGGGCCAGTTCAATA 
TGGGCAGCACCATAGTCCTGCTCTTCGAGGCGCCAAAGAATTTCAAATTTGATATTATCGCCGGTCAGAAGATCCGCGTTGGCGAGTCCCTCGGC 
CACATCGTCGGCTCCAAATGAAAATGGGAGCCACAATTATGTACCGAATCAGTGCAAAGTCCTGTCCCCCAACTGCTCAGATCACTTGTTGTATT 
AGCATTAGACGCTCGTGTCCTAGGCAAATGTATTTTGTGAACATCCTGCGAAGCTAACTCTGTGTATTGTTGCCATTTCAGTAGTTTTAAGCTAT 

ATTTTATACAATAAACGTCGAAATTATATTTGAG 
(SEQ ID NO: 677) 

Start ATG: 863 

MVSYFVPRGRFLLKAGNLRQVVQQQHQPAQLQLQPIKGPQPQAQNASLPVARHLRQFSSNPASKEAPLHHRRPQHKQQPNPSQELAQIRRNILSR 
WTGFLLRWAPMGICVFGAIEWQLQKNRCEKEGKPRTASELQSRIYCSLPLRIISRCWGWLAACYLPPSLRPYVYGWYSNTFDVNLSEAMYPEYEH 
YNSLAEFFTRPLKEGVRVIDQQAPLVSPADGKVLHFGSASDSLIEQVKGVSYSIEDFLGPLETVEQANSGASYAQALKKKSDGSTELYQCVIYLA 
PGDYHRFHSPTAWKPTIRRHFSGELLSVSPKVAGWLPGLFCLNERVLYMGQWKHGFFSYTAVGATNVGSVEIYMDADLKTNRWTGFNVGKHPPST 
YEYDELVLNKELTEAPKEFGKGDLVGQFNMGSTIVLLFEAPKNFKFDIIAGQKIRVGESLGHIVGSK* 
(SEQ ID NO: 678) 

Name: PHOSPHATIDYLSERINE DECARBOXYLASE PROENZYME 
Classification: enzyme 

Celera Sequence No. : 142000013384303 

AAATGGCGACCAGGGTCTGTGGGGCTCACCAGATTAATATCAGGGTGGTGACCACATTGAGGATCAGTCCCACGATGGTAATGAGATTGGGGGCC 
AGCCAAAGGGGCGTCTGGGCGACCAGCCAGTTCCACCACGGCTGCAGCAGCGGATCGAGCAGGCTGACACTGAAGCAGGAGTACTTGTGTTCGCT 
GAGCTTCCTCAGCTGCTGGGCACTCAGGATGTGCTTGTCCCGGTAGGCGAGCAGCGCCATTTTCCCCCACCTGGTTACGTCGTCCGATTCCTGCG 
TCGTCGCGCTCTCCTCACGCTTTTCACTTTTCACTTTCCTCTGACCTAGCGCTTTTCCCTCGCGCGCATTTGTGTACGTGTACTTGACGTATGCG 
CGCGTGTGCGTGTGTGTCCGTTTGTGTGTTCGCTTTTCCTGGCTATTTTCACACGTGGAGCGCGATGCTGGCGCAACTACGAAGCGCGTCTATGA 
TTTGCGTTTATCAACTGCGGCGTGGTCAGTTTTACGATCGCTTGGATGCGCACGGAGTATCTGCACTTGTATGTTTACTTAGAAAAATTAACTTA 
ATTGAACGACCGTGCCGTGGAGTTTTTTTTTCTTATTGCCTATGACCAGTGTGACCGCGATTGTGACTGCAAAATAAACCATCTGCGCTTGCAAT 
GAAATACCAGAAAAGAACGATGTCCATTTGCGATAACACATAGCCCAGACTCAACCCAATTTAACAGCAGTCTTGGTGTTCTCTGTTAGTTGCAT 
TTTGCGCACAGCTATTTAATATCTTAGCGATATTGTTTGTGCAACTTAAAAAAAATAAATTTTCAATATTGAAGCAACTCAATTAATGACAATTT 
TGTATAGACTAACAATGACACGATCATTTGCTCTGTTAATGTAACATCAACATTTGCAATTTCCTGTTAATAGCCCGTTATAAGTCTTTTACTAT 
CGGTCGTCTATCGATATCGGCCTTTTAGCCGCCCATTCAAATTCAAATGTATATATCATAGTTTATTAGATAAATCAGTTTGTTGCTTAAACTAC 
GTCCTCAACTTAACATACCGTATGCAATGTTTCATTTGTGTATTCGCTTGTTAGTTAATATCTAAACCTTGTCAATTGCCTGCTTTATTGTTACG 
ATTAGAATTTATGTTTAAATATGCGGCTACTACTGGTTCTGGGCCGGGAAATGCTCGCCAACGCCTGCAAATATGCTCAAAAAGTACAAAACTGT 
CGTGCGTCCAGCTGCTGGCCAATATAAATTGCGGCTTATGTAAACTATGGCCTATCAAAAGTTATTTTTGATGTGTTGTTTCCATGAGGCATCGT 
CGTCGTGTTTCTTAGTTTGGTGGATATTAGGAAAATGTATAAATGTATTTGCTTCGATATATAATATTTATATATAGATAAAAAAATAGGCTTAA 
AATGTTTCGTTTTAGCATTTTGTTGTTGTTTAAATTACTTTATATTGGCAACTCTCATTACATTTGTTAATGCATATAGTTATGTAGTTGAAAAA 
CGCTCGAACGTGCAACTATTTCGCTTTGCTGATGACAAAAATGGTTACACACAGCACACTCAACCAATTATATAAATAGTAGTAGTAATAATTTA 
TAGCACAATATGATATAGCAGCAGTTTGAAACTTTTTACACATGTCACAAATGTTTTCTTCCCGTTTGCAAGATTCATTTTTCGTTCATGGAGTT 
TTTGTGGTCAGAAGACAGAGTGCAAATTATAATTACTGACAATTTTCAACTTTAAAGCGAAGGGACAAAATTTCACATAATCAATTTACAGTCGC 
CGCAAAAGTTGCATGTGCAGCCGAGGCCTAATATCGTTCAATTTGCATTAAGAAGATACATATTTATTTATAGATGCGTGCATGTATGCATATAG 
TAACCAGCGATGCAAAAATTAATTTAATTTATGTGGTAGGCATTTACGATAATATAATATAAATTTTGTTTTAATTGTCTTCAGCTTTGAGACAT 
TTAGGCAAAAGTTTTCTTTTCATAGTGAATTTGTTTTGTTTTCTATAAAACGTGTTAAGCATTAGGTTGGTTGGTTGGTAATTCGTTATCATATA 
ATCATAATAATATAAGTGTTCACTCGCTGCGAAACGCACTCCTTGAAAACATCAAAACATAAACTAAAGAGCACATGTAGGGCAAAATCCTCAAC 
ACACGTTCAAAAGCACAGGACAGAACTTCTCTTTATCGAACTTAAATTATATATTTATATCAAAAATCCGCTTGTTTACAATAATAATAATAGTG 
CCTGGTCTGGTATGTAGACATACTTGCATATGCCTATAGTTGCGGTCAAGATTGTAGCAAGTGAGTTTCGTCCTCTGTGCGGAAGGATTGCAAAG 
GAAAACTCCGATCAAATGCCTTTTTCTTCGGCCGCAGCTATAAATACACAAATAAATTTACATATCTAGTTTAAAGTGTCACTCAGTTGCTATTA 
AACGATTTCGTAATGTTATCGTAATGTAATTGTTTTATCAAAAAACGTCAAAACGTAAAAAATGCATTCGGCTTAAAGGCTTAAACTTGTGTTGT 
TCTATACGTTGTCTACAGAATTTATGTCCACGTTTCTTGTATATAAATATTTAAAATTTGTGTTTTTTAGTTTCTTCTTGTTGTTATTCGCAATT 
GCTGATAATGCTGCCTAATCGTTATGGGTGTGTGTTGTGTTTTTCGCGTTTCCCGGAGAGGATTGTGTGTGGCGTCCCTCCTTAGAAGTTATCTT 
ATTATGAATGATATGGCGTCACATAAGTTGCTGGGAAGATGCCTTTCCTATTGCCGATCTCGCCGTTCCACCAGTTCTCATCGGAGCGATCTGTG 
ACGGTGATGACATCGCCGCGTCTGAAGTCCAATTCCCCGGATTCCTGTGGCACAAAATCGTACAGCGCCTGCACGAGCATCTGCAATAAAAAAAA 
TCGTTCGGACATAATGAATCCAATCTAATACTCTTGGCATCGCTGTTTGTTTATAGTTCAAATGGGTGGGCGGTGGCGGCGGCATCGTACGCAAA 
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CATGACCTCAATGTGTTATGCGAAATTTGTATTTTTAAAATGAATCTGCATGTTTGCTGAGCAAACTCATCACATCAAAATGGAATGGAATGGAC 

GGCACACCCACCTCTTCAGGTATCATATCACGCAATTTGACATCCTGCGACCGGGACACGCTCGCCGTCCGATGATATTCGACCAGTTCGTTGAG 

GGAGTTGAACTTGACCACCCAGAGGAAGAATTTGCTTTGGGCATCGCGCAGAACCTTGAAATGCTGAACGCCATCGGGGCATCTGAAGGGGCAAA 

GAATTTCGATTATATACGGTTTTCAATCATAAATATGAAAGCTGAAGCCATTCAAGCGCAACTCACTTGACTGATAAGGAGAAATCGCCGGGACT 

GGATTCGCTGATGCGTATCAAGAAGGCGCCTTCGTGCTTATTTGACAGCAGCTTCTCAGCATCGGCGCGTGTGATGCGTCCGTAATACCAGCTAC 

AATAGAAATGATCATTATTTGGGTCTCGATATTGGAAATTTCAGGCTCGACTTACTCGTGATTCTTCATTTCTATGTAATTACTAGGTATGAGGC 

CTTCCTTTCCATCCAGCTCCGCGCGATACCAATTTGAATCGTCTTCCATATTTAATATCTGTGTAAAAAGAAGGGATGTAAGTTAATTTAAGCAA 

TACAAATCAATGGGTTTGTAAATGTAATGAACCATATTTTTGATTAAATAACTAATAAAGAAAGCTTGATTTGTGGAAAAAAGGATCTTAATGGG 

CATAAAATTACCTAAATCTTCTACCCCTGCCTAATAAAACAATCATATATTCAGCTTCATGAGGCTGAGTTAAATGTTAAATATGTTCTTTGGTT 

TTTCCATGAAGTTATTAAAACCGGAGCATCTGATTTTAATTCCCATTTCTCACGAAACCAAAGTACCAGCTGCACAGTGAGTAATCCGATCCCCG 

AAAATTTGTTGTAAGAACCTGCCTCCGTTGTTTCCATTTTGAGGGCCACTCCCCTTTAATCCCAACCCCCCCAGACAATGGCGATCAATCGAGAC 

ATTCTTCCGCTTTGTTTATTGGCGACTGGCCCGATTCCCCAGTGGATTCGATGAACAGTTTTCAATTGTTTGTTGATTGAGCGTTGTCCCAAAAA 

ACTGTGTCAGTGCGCAATCAGCGAAGATAGGCTTATGGTTTGGTATGAAACGTACATATTTATCTATTATCTCCACTATTCTCATATCTTTTTTT 

TCTTGCGAAACGCTAAAAATAGGCAAGCATTATGGCCGCGACATGTGATAAGAATGTTAAGTGAAACGGAGCACTCACTCGAATGGACGAATTTC 

ACTTTTGTCTATCTAATGAACTTTTTCCTAGCCCATTTCGCATGCCAATGACACAACAACAGGCTGCTGCCTACGTAATCATCGACTACTTTTAT 

GATATCTTACGATTGACAGCCGAAATTGAAGATTGTCTATCTGCAAAGGTCTCGCCATTTGCTTGGCCACTGGGAATTCCCAGATTCGAAGGTGC 

TGAGTTGAGCTTTTATGAATGAATCTCGTCGCTTATGACGGCGCCCAGCAGCCTTCAGTTTTCGGAATCTGTTTGCTTTTCTCAAGGAAATCGGT 

TGCACT^AATACCCAGGAAATGTGTTTATTGTGCAAAGGTCAGGATTTCGACTTTTAGTTAAGTCTTAAGTAAAGCATATTCCTATCTTTATGGAT 

TTCTGCCATTTTTCCCTTGAGCAGTTCAGTGTTTATTTGGTAGCTTTGTTGTTTTCCAATATAGCTTGTTCGTCAAGTCGGAATAATGGAATGGC 

ATTCAAATGAGTCACCCACAAAAAAAACCCCTGTTGATACTGTGTAGTTGAGGGTTGGTTACTTTTCTGAAGTTGCAACTTGTAATTACCCCATG 

AATGTCTCATCGTTAATTATACACGCTACTAACATACGAGGTTAATAACAAACATAATACCTTAACCCACAGATCGATTTCTTGTTAATTAAAGA 

TACAAATATATATATATAAATATGAGATATATCATATTCGATTCGTATAATGATGTTTCGAGTTTTTTGGTTACATTTTCGCACGTCGGAATGTG 

GAGGGGATTTGGGTGATGTGGAGCTGGAGGTGGATGGGGTGTCAATGTCGTTTCCAGCTTATGTAAAACCCAATTATTGTGCTCACTGTTGGCTG 

TGGCAGTGGGATTTTGCCCGCTCAGCTGGATCTGGCCGACTAATTGGCATAGCTCCATAGTTCTAATCCACCCCGCGATCACCGACCATCCCGCC 

CGCATCTGTCACCCAGACATTGTTCATCGTATATTTATTTTATTTTTAGATTGTTTCATTCATTGAAAACAGCCGCCGAG7VAATTCATTTCCACG 

GATTCGTCGTCTATTACAGAAAGTTTTCTCGCCGCTCAGCACTGGGAAAAACCAGAGGGTTTAGAAATAATATAAACTTAAATATGCTGAAATGC 

TTCAAGAACATAAACTTTCCAAGTATTCCATGATCGCATATTTATTTTCCCAACTGCATAAGGAAGTGCACCATTTTTTTTTCAGTGCAGGCAAT 

GTGCAATGTGCCTGTTGGATAATGCATAAATATTTAGCGGCGCTTTCGGAAGGCAGGCACGTAGTGGCCATGATTGTTATTATCAATTGTTTATA 

CTCGTATGCTTATGTAATTACTAGAGAAAAAATTCTATTGTCGTTTCGGAAATCCGACGCTGAAAAAAGTGTCAAAGCAACACGAACGTTCACTT 

TTCGCTGTTTTCGCGACAAGCGAAAGTTGAGCACAGAATCCCATTTCGTGTGTTTCACATTGTTATGATAAAACGTGGCCACTTTTGGTGCCCAC 

AC AAC AAG GC CGT T AG AAT T G GAGGC AT T TTTTTTTTT GG AAAAT GGAGC T C C AAAT AT CC AAGAAAT AC GC T G AC C TGT AAAC C AGT TT T C T G G 

AAAGAGAACGAACCCCAATGAAATGTTTGATGATTCACAGCGCTCCTCTCTCTATCGCCTTCTATGCCTGCTGTATGTTTGCTGGATTGGGGCAT 

CTTTTCAGATAAGGCTATCGAGAAGTGACTAATCGCTGGCAAACAAACAAATGCGCTGGGAGTTCGAACAGATATGCGTAACATACATATCTCCA 

ATGCGAGATAATCGTGTATTTCTATTTATCCCCCGCATCAATGGAGCCCAGAACTATGGGCTATTAAACGAACTATGACTATCAGTGCGGATTGT 

TTACCCATTGATAAGACCGTGCAGCTGGACAGAGGGCTCATCTACGGCGTAGAAGCCCCTCTAATGGACGGGAGCAGCGCACACACTTGACCTTC 

GGGC GG C G AT AAAG AAGC C T C T T C GGC G AGT CAT T G AC AC CAT C AAT C AGC AGC AGC GG C AG C AG CAGC AAC AAAT GC T GC AAAT GC AGC GGC GG 

CATATGGAAAAGAGGCGCAAAAAAAAGAGGCACAAAAAGGGAGAAAACCTGCCACGGGCCTGGGTTCCTGCTGTGACTAACCTACGACAAGGTAG 

ACACACAGCAACGAAAATCAGCTTACATCGCGCCCTGGCAACAAGAAACCGGAAAATTCGCCAGCCAAAACAAAGGGTTTGGCCTAGGACAGCCA 

GCCAGCCAGCCATCGAGCCAGTGAGCCAACACAAACACAAACACCTGGAAGGAAGCAAGGCGGCGAAAAAGATCGAATCGCAGCCTGCCAGCAGC 

CTTTTTCTTGGCCAGAAAAGCCAGCGAATACAAGCAACACGGGTAAGCATCGGATGTCCCCCACAAAGTTCCGCAAAGTCAAAGGGGGCATCGAA 

TATATCATCACTACAAACTGTTGGAATTGTTTTTCTAAGCATTCGGAAGAAAACGAATAAGGAAAGTTGAAATAGTATATATACTATAGGAATGC 

TAACAATAAATAAGTTCATTCAAGGAAACACAGTGAAAAATGCTAGATTTGCAAAGTATAGTATATAGATTAAATCATATATGCATGTTTATAGC 

TTTAGAAACCCCCTCCTCACATCATTTTTTAAACTTACTTTGTAATGCTAATCACAACTATATGCAGATTAAATGGGGTCAAAAATATGGTCTTG 

ATAGATTTGAAGTGCAGTTGAGAAATGAAAAAATGTAGAGTAGAATGTACAATATTTCAGTATGTGATGCTTCACAAGGATTTTGTACTGATAGT 

GGAATATATTTGACTGAATCTTTTCGTTCTAGTAAGAGTATTCGGGCTGCGGTGGCTAATGAGTCGTAATCCTTTCGTGTTGCCTTCTGCGAATG 

CACCTTTTGAAACTCTTTAGCCAGCTTTTGCGACTGCTGTCTGTTTGGTCGCGATTTAGTAACAAAATCTGTCTGGGGATTGAACCTAGGATGTG 

GGTTAAGGCAACGACCTGGACGGCGGCAGCTGCATCGCAGCGGATTATGCATGCAGGACCTGCCACGTCCCCCTGTGCACCCCCTTACACCCTTG 

CGTGCCCCCATTCAGTGTGAAGTCTATGAATAAATGCAGATATTATGATTTGAGGTGGGAATTGATGATTTTGGAACAGCACTCACCTTTAGAAT 

CTGAGTTTTGCGAAAACTCAGCTCGTCGTCAGCCGTCGCAGAGAAATCGTGTTTGGCAATCGCTTCCATTTTGCTGTGTTGTTTTGTGCGCAGTT 

AGAAAACGCTTTCGGAGTTTGTTTTAAAATTTAGTTGCAATTGTAGTTTTGGATTTAGTTCTATCAGCCTTGATGCACCACACACACGCACACAC 

AGAGGCACAGACACACACGCTCGTTCTCTGCTCTTTCTCTCTAACTCTCTTTGAATTTCTCTCTCACTCACTACTCAGTGCTGTGGAAATGAAAA 

CGAAACGAGACGAATCCGCGATTAGTTAGCGCACATTATATAAACGATAAATCATGGATATTCGGCAGTGCGCTTTTCTCAGTTCGGTTAATAAT 

AATATATCTTGTGCGAAGGCAAGGGAAGAAAGCGATTAAAAATACACCGACGACGACGACATTCCCAGGCGGAGCAGCAACAGAGAATGAACACA 

GACACGAGCCGCTCTCTCCACGGCGGCCACTGTGTGGGTGTGTGTGCGTGAGCGTGGCAGCGTTGCCAGCAGCTTGCGATTTGGAATTTCGGGGA 

AAAGCAGGTACCGCAAAAGCAATTAGACGATGTTTGGCTATTACTTGTTGTTATGCGAAGTAATTTGCCTTAAATAATATGAAATGTGTCGCTTA 

GAACACAATTAACATTTGGCAATCTAGGGCGTGGGTGCTTGCTGTTGGCAATGCTGCCGCGGTGACGTTGTTGTTTGTTTTATTTTTTGTAACCC 

CAATAAACACACACACACACACACACGCGGCGCTTATATTTCTATATGCGTAAGTTGCGGCTAACTATTTTCGAGTCTCTGGCACGTGCAAAGAG 

GCCGCCGGGCTTACAATTTCAAAGGGCTAGGGTGGCTGCAAGCTTAAGCGACACTAATTAGGCGCGGGAGCGGTTTGGCTAACATATTACTACTG 

CTAAATTTGTAATTCACTTTTATTTTCATAGCAGCGTGACCGTGGCCGGGTCGCTTTATTATTCCAAGAAAATATACTGAACACAACACTGTGGG 

AGAGGCATACCGCAGGGCTTTCGTAGTTTTCCTTCACCTGGTCACACCGCAAGACGCTTCGCCATTTTAAAAATCAAAACAGGAGACGCTTCGCC 

GGAAAAGTCAGCATTACGTTTCATTCCCAACCGGACACTTCAGTTTGACTCGCCCATCCGTCACAGCAGTCGCCATGCAGCGCCACAAGGAACTG 

TACAAGGAGCAATCGCTGGTGCTGAGTCCGCGCAATCACTGCCAGGAGAACAGGGATCGGTTGCAGGCGGCGCGCGCCAAAAAACGAGAGGATTG 

TTTTTATCAGAACCGCATCATCAGCGTGAGCCCGACGCCCGTCAAAATAAAGCAATTGGCGGCGGCCCAAGCTGCTCTGACCCAGGAAAATGTGG 

CACCCAAACTGGAGAGTCCGGAGCGGCTGGACACGAAGCCCGCAGAACTGCTGAAGGAATCCAACCCGAAGGTCTCCCGCCAGAAGCTCTACTTA 

CAACGCTACATGGAGTGGAAGATAGCCAAGACCAAGGAGCATAAGCAGCAAGACCAGAAACGTCGAGGCGCAGCCATCAATGTGCCGACAGTAAA 

ACAATCAAAGGCCTTGCCCAAATCACAAACATTTCGGGTTCCCGATAATCTGGCTTCTGCAAAGCAGAAGGAGGCGGCTCCCATGTTCCAGCCTC 

CCAAGCGGTGTTCTCTCTACATGATTGCCAATCCGACGGGCAAGGGAAAAGCCGCAGAACCGATTAAACCTTCAATACCCAAGCCAACGTCAGCA 

GCAGCTCCTCCGTCATCCAATACCGTGGCTGCATCTTCCGCCTTGGCCAGGCATAAATCAGCTGCTTCTGCAACAAAAATAGTGCCTGCAATTCG 

TCAGAACAACAATCCAGTGGCCTTGGCCAGGCAAAAAGCAGCTGCACGACCCATTCCCAACACAACGAAGCAAACTACCAGTGTAAGGCAGCCT 

(SEQ ID NO: 679) 
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Exon: 8594.. 8470 
Exon: 7870.. 7592 
Exon: 3668.. 3571 
Exon: 3511.. 3392 
Exon: 3312.. 3147 
Exon: 2930.. 1001 

Start ATG: 7669 (Reverse strand: CAT) 
Transcript No. : CT18920 

CACGGTCACGCTGCTATGAAAATAAAAGTGAATTACAAATTTAGCAGTAGTAATATGTTAGCCAAACCGCTCCCGCGCCTAATTAGTGTCGCTTA 
AGCTTGCAGCCACCCTAGCCCTTTGAAATTCACTGAGTAGTGAGTGAGAGAGAAATTCAAAGAGAGTTAGAGAGAAAGAGCAGAGAACGAGCGTG 
TGTGTCTGTGCCTCTGTGTGTGCGTGTGTGTGGTGCATCAAGGCTGATAGAACTAAATCCAAAACTACAATTGCAACTAAATTTTAAAACAAACT 
CCGAAAGCGTTTTCTAACTGCGCACAAAACAACACAGCAAAATGGAAGCGATTGCCAAACACGATTTCTCTGCGACGGCTGACGACGAGCTGAGT 
TTTCGCAAAACTCAGATTCTAAAGATATTAAATATGGAAGACGATTCAAATTGGTATCGCGCGGAGCTGGATGGAAAGGAAGGCCTCATACCTAG 
TAATTACATAGAAATGAAGAATCACGACTGGTATTACGGACGCATCACACGCGCCGATGCTGAGAAGCTGCTGTCAAATAAGCACGAAGGCGCCT 
T C T T GAT AC GC AT C AGC GAAT C C AGT C C C GGC G AT T T CTC CT T AT C AGT C AAAT G C C C C GAT GGC GT T C AG CAT T T C AAG GT T CT G C GC GAT GC C 
CAAAGCAAATTCTTCCTCTGGGTGGTCAAGTTCAACTCCCTCAACGAACTGGTCGAATATCATCGGACGGCGAGCGTGTCCCGGTCGCAGGATGT 
CAAATTGCGTGATATGATACCTGAAGAGATGCTCGTGCAGGCGCTGTACGATTTTGTGCCACAGGAATCCGGGGAATTGGACTTCAGACGCGGCG 
ATGTCATCACCGTCACAGATCGCTCCGATGAGAACTGGTGGAACGGCGAGATCGGCAATAGGAAAGGCATCTTCCCAGCAACTTATGTGACGCCA 
TATCATTCATAATAAGATAACTTCTAAGGAGGGACGCCACACACAATCCTCTCCGGGAAACGCGAAAAACACAACACACACCCATAACGATTAGG 
CAGCATTATCAGCAATTGCGAATAACAACAAGAAGAAACTAAAAAACACAAATTTTAAATATTTATATACAAGAAACGTGGACATAAATTCTGTA 
G AC AAC GT AT AG AAC AAC AC AAGT TT AAGC CT T T AAG C CG AAT GC AT TTTTTACGTTTT GAC GTT T T T T G AT AAAAC AAT T AC AT T AC G AT AAC A 
TTACGAAATCGTTTAATAGCAACTGAGTGACACTTTAAACTAGATATGTAAATTTATTTGTGTATTTATAGCTGCGGCCGAAGAAAAAGGCATTT 
GATCGGAGTTTTCCTTTGCAATCCTTCCGCACAGAGGACGAAACTCACTTGCTACAATCTTGACCGCAACTATAGGCATATGCAAGTATGTCTAC 
ATACCAGACCAGGCACTATTATTATTATTGTAAACAAGCGGATTTTTGATATAAATATATAATTTAAGTTCGATAAAGAGAAGTTCTGTCCTGTG 
CTTTTGAACGTGTGTTGAGGATTTTGCCCTACATGTGCTCTTTAGTTTATGTTTTGATGTTTTCAAGGAGTGCGTTTCGCAGCGAGTGAACACTT 
ATATTATTATGATTATATGATAACGAATTACCAACCAACCAACCTAATGCTTAACACGTTTTATAGAAAACAAAACAAATTCACTATGAAAAGAA 
AACTTTTGCCTAAATGTCTCAAAGCTGAAGACAATTAAAACAAAATTTATATTATATTATCGTAAATGCCTACCACATAAATTAAATTAATTTTT 
GCATCGCTGGTTACTATATGCATACATGCACGCATCTATAAATAAATATGTATCTTCTTAATGCAAATTGAACGATATTAGGCCTCGGCTGCACA 
TGCAACTTTTGCGGCGACTGTAAATTGATTATGTGAAATTTTGTCCCTTCGCTTTAAAGTTGAAAATTGTCAGTAATTATAATTTGCACTCTGTC 
TTCTGACCACAAAAACTCCATGAACGAAAAATGAATCTTGCAAACGGGAAGAAAACATTTGTGACATGTGTAAAAAGTTTCAAACTGCTGCTATA 
TCATATTGTGCTATAAATTATTACTACTACTATTTATATAATTGGTTGAGTGTGCTGTGTGTAACCATTTTTGTCATCAGCAAAGCGAAATAGTT 
GCACGTTCGAGCGTTTTTCAACTACATAACTATATGCATTAACAAATGTAATGAGAGTTGCCAATATAAAGTAATTTAAACAACAACAAAATGCT 
AAAACGAAACATTTTAAGCCTATTTTTTTATCTATATATAAATATTATATATCGAAGCAAATACATTTATACATTTTCCTAATATCCACCAAACT 
AAGAAACACGACGACGATGCCTCATGGAAACAACACATCAAAAATAACTTTTGATAGGCCATAGTTTACATAAGCCGCAATTTATATTGGCCAGC 
AGCTGGACGCACGACAGTTTTGTACTTTTTGAGCATATTTGCAGGCGTTGGCGAGCATTTCCCGGCCCAGAACCAGTAGTAGCCGCATATTTAAA 
CATAAATTCTAATCGTAACAATAAAGCAGGCAATTGACAAGGTTTAGATATTAACTAACAAGCGAATACACAAATGAAACATTGCATACGGTATG 
T T AAGT T G AGG AC GT AGT T T AAGC AAC AAAC T G AT T T AT C T AAT AAAC T AT GAT AT AT 
(SEQ ID NO: 680) 

Start ATG: 327 (Reverse strand: CAT) 

MEAIAKHDFSATADDELSFRKTQILKILNMEDDSNWYRAELDGKEGLIPSNYIEMKNHDWYYGRITRADAEKLLSNKHEGAFLIRISESSPGDFS 
LSVKCPDGVQHFKVLRDAQSKFFLWVVKFNSLNELVEYHRTASVSRSQDVKLRDMIPEEMLVQALYDFVPQESGELDFRRGDVITVTDRSDENWW 
NGEI GNRKGI FPATYVT PYHS* 
(SEQ ID NO: 681) 

Name: downstream of receptor kinase 
Classification: signal__transduction 
Gene Symbol: drk 
FlyBase ID: FBgn0004638 



Celera Sequence No. : 142000013384288 

TCCTGCGTGGGCATGTAGTCCTCCACGCGCACCTTCTTCTTTTTGCGCTTGCGGGCATCGTCCAACTCTTTGAGGCGTATTTTGGTAGCCTGTGT 
CTTGATGGCCGTGGAAGTGCGTATGGATTTGCGGCCGGAATCGAGGACCGTGAAGCGTGGACGTGGCCGCCGTTTCGTTACCCCACCACCAGGTC 
GTTTCTTGTGCAGGGCCGGCGTTGCCTTTGTTTCCTTTTTGACTGCAGGCTTGGTTTCCTGCGGAAATCATATGTACCATTTAAACACATTCAGT 
AATCACAGGATTTTATCGCATTTT2VAGAATGGTTCGCACCTTATAAGCCTTTGTGTTAACGACACCGCGTTTCCGCTTTTTCTCGGGCGCCTCTT 
CCTGATCCGAAACGGGCTCGTCGTTCTCATCGATACTGAAGTCGGAGTCAACCACATCTTCCTCTTCGTCTTTTTGCCTGCAGAGTGAATTTTAT 
ATGGCTTGAGCATTTAAATTTATTCAAATGATTAGTGCTCACTCGTATTCCTTATCCTCTTCGTCCTCCTGAAATCCACCATAGGACGTTTTGTA 
AAAGTCGTCCTCCTCTTCCTCATTGAGCAAATGAGCTATTTTGTTGCCGGCATTGTTGCGCCGTGATCGCGAGGCAGCCATTTTTGTTATTTATA 
TATAGAAACTATGATTTAAAGCAGCTATTAAATAATAATATTTTACAATTTGTAAATGTAAACAAGCAGAGCAGTGTGACCAATGTAGCAAACAT 
AGTATATATCGCAGAGGCTTTACAACTTTGGTATGCGTTTCAGCCCTGGTAAAGAAAATGGTTTCTCCTGCATTGGCAGTTATTCAAACACATTT 
AAAACATAGCTAAATTTATTTATTTATCTATTACATGTATCACAACAAAAAGCAGGCAGCTGCTTTAAAATCACATTAAAAAGCGGTCGGTAATT 
CAACTACCGAAAGTAGACCACATTACTTTACACTTGCGCTGGCCACACTCTTGGCTTGTTGCTTTTTATTCTGCCGGCTGCCTGCATTTTCTGAT 
TTAATTTAATACCATCGATAAAAAGACTGCTCGCAACATGTCGGAACAATGGGAGGTGGTCTCCAAGTCGCGCAAGCAAAAGAATCTGGACAAGA 
AGGTGTCTGCACACAACGAGCAGAAGCGCATCGCGGCCCAGTTGCCCAAACTGGAGGAGCTGTGTAAGAATCTTTTAAAAGGTGCACCTTGCAAT 
GCGATTACATGTTTGCCCCTGCCATTTTATTCACTTCGCAGTGCCAACTCAGCGGTATCGCAATTTGTTCGGCAGCAGCAACAACAACAACAGCA 
AGTCCCATTCCCCTGCGAAATCTAGTTCAAGTGCCTCATCTTCCTCAAAAGCCAATAAGTCGCCAGTTAAAAAGCACACCGCAAAGAATGCCTCT 
ACCCAAAAAAGGACCACGTCGGCAGCCGCATCGAAGCCAAAGACTCTGGAGCTGGCCTTGAGAAACATCACCCGGGATGATTTTGCTGCCCAACT 
GGAGCAGGTGAAGCTCAGTTGTCCTGGCTCCGAGCTGCGCTGGTTAAGCCATGTATGTTTCTAAAGGAAATGTGCCTAAATATTTGCTCACCTAT 
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CGTTGTCCCCCAGATTGCGCTATACTTCAATGAAGCCTTGTCCTATGACTGCGATCCCATCTTCTCCGGCCGCTCGGCTCAGTATCCCAGCAATT 

TGGCCAGCGCCTCGCTCAAGTACTCAATTGTGGAGTTTCTGGGCAGTGTGGGCGAACAAAACCTGGAGTATTTCTTCTACTCCCTGCTGGATAGC 

ATGTCAACGGATCTAAACAACAATCAGACAGTGGCAGGCTACAAGCTCATCCTACAGTTGATCGGTCAAAACTGGCCGAACATTTGTTCTAGGAA 

CCTGGCCAAGACTGCGTTGCTAAGGAACTCCTATCAGAACCGTAGCAACATCTGCCTGAGCATCCTGTGGGCAATTGGACAAGGTGGCTATCAGT 

CCCTAAACGAGGGCGTTCGCGTGTGGCAGAACCTGATGCTGCCCAACCTGGAGCTAAAGCAGTACACCAAGTTTGTGGTCGAGTATATGGAGCGG 

GCTCTGAGTGCAGCAGCTGTCCGGAAGACAGCGGATCCTCTGCAGATCAACCAGCAGGAGTTCTTTGCCACTTACAATGCGCTCAATGCACCCTA 

TAACCACCTGCCCAAGGAATGGCAGCAAAGTCTGAAGCGTAGTGCGCGTTTGTTGCTTGTAAGGACGAAAACAATTGATAGTTGTCCCATTGTTT 

CACTGACCTTTCTTCCTCCTGTGTGTAGCAACACTATATCAACAGTCCTGTGAGGCATGCCAACATATTCCTTACGCTCTTCCGGGAAATAAGCG 

CCGGATCCAAGCAGACTAATGAGATCGAAGGCTGTATTAGTTGTCTGCTCAGCAGCGGCAGAGATGATTGCCTCAGGGTGTGGCGCATGAACTAC 

AAGAAGCAACAGTTGCCTAGTCTACTGCTTCTGAAAGCTATAAGTACGATCTACAGCTGTGAAAATTCATTTAATTGGATTTTTGTTTAAAAGTA 

CCACCCACACTTAACGTTAATGTGTTATCTAATATTGAATTTCTATCTTTTCCTAGACGACAATTGGACCACGTCTACCCATGAGTTGGCTACGT 

CG AC T G T GT AT C AC T CT T TT C T GC AAG AT GT GGC GAAT C T G AAC G AAGAAC T G C AG GG AAGC AAG AG AAAC G AAGGT C AC C T AG ATG ATT T AAAG 

GAGGTTTTGTTGGTGAGTGCGACTTAAGTATATCACTCGATTCCAAGCATAAGGCTCTGATCCCAGATCGATAAAAGGTGAATGTTGAATGTGTG 

TTGTTTTATCTTTTATAGTTTTATTGGAAAACTAGTATATAGGTAATATATTTATTTTGTTAGCTTTTTTAACCCAAACTTATTATTCAAACTTA 

AATGAAATCAGAAACACTATAAATTTGAATTTTCTCACACCTCTGGTAACACTGTTTAAGTTAAGTGGTCTGGCTTAGTACATTACGTACTCTGG 

TCACATTGCTTTAGTGAAAAGAAAAACGAAGGTATAAACTTGTAGAACTGCCGTCTAAAAGTGAATAATTTATTGCAATCGGTGCTAAAAAGAAT 

GAT AT G AGC T TAT T AC AC TGC AGC T AAC T AAT GT AAAACT CT T C AC GT AGAAGT GGC C GAAC T GT T AG C C G T T AAT GAAGT T AG AGT T CT T TAG G 

AGGACGCTGCCAACGCGACGTCGCTGCGGGAAAGAGATGGAAAGCGTTAGCCGGCGTTCGTCCGAAATTTCTCCGCTATTCAACTGGCTTTTGAA 

GCCTGGAGTGAGCATAAATTAATGGTCCGCACCTTAATTATCGCGTTGTCGCAATTATTGTTGCTGTTGCAGTAGTGCAAAGTGCGTTTCGTGCA 

TATGTGTGCGTGTCAGTATTAACGGTTGTGTATTCCGCCGTTCTGCACTGGTATTGGTGGCAGCTATAGCTGCACTTTTCCCATAGCGTTGCCAT 

GCGGGTTCCAGTTGCCGCTAAATTTCTCCGAGCAGAGCAGACGTTTTGCTTTCGAAAAGAGAGTGGGCATGGCCATCGGGGATCATAAATTGCCG 

CAT TAT C AGC C AAC AT T CT T ATC AGT AC AC C C AAT T T T CG GCT GAT C T GAGGC T TAT T T T T AT TT T CC T AT T G AC T G AT GT GT T C C AG CT GC G AT 

TGGCCACTCGCCTCCTGCTCCCTCACCCACTCCCCTCTCACATCCCGCTTGTGCGCACTCTCCTCCTCTCTTCGCCT 

(SEQ ID NO: 682) 

Exon: 1001.. 1203 

Exon: 1277.. 1572 

Exon: 1629.. 2243 

Exon: 2309.. 2513 

Exon: 2622.. 2782 

Start ATG: 1083 

Transcript No. : CT19152 

TTGGCTTGTTGCTTTTTATTCTGCCGGCTGCCTGCATTTTCTGATTTAATTTAATACCATCGATAAAAAGACTGCTCGCAACATGTCGGAACAAT 
GGGAGGTGGTCTCCAAGTCGCGCAAGCAAAAGAATCTGGACAAGAAGGTGTCTGCACACAACGAGCAGAAGCGCATCGCGGCCCAGTTGCCCAAA 
CTGGAGGAGCTGTTGCCAACTCAGCGGTATCGCAATTTGTTCGGCAGCAGCAACAACAACAACAGCAAGTCCCATTCCCCTGCGAAATCTAGTTC 
AAGTGCCTCATCTTCCTCAAAAGCCAATAAGTCGCCAGTTAAAAAGCACACCGCAAAGAATGCCTCTACCCAAAAAAGGACCACGTCGGCAGCCG 
CATCGAAGCCAAAGACTCTGGAGCTGGCCTTGAGAAACATCACCCGGGATGATTTTGCTGCCCAACTGGAGCAGGTGAAGCTCAGTTGTCCTGGC 
TCCGAGCTGCGCTGGTTAAGCCATATTGCGCTATACTTCAATGAAGCCTTGTCCTATGACTGCGATCCCATCTTCTCCGGCCGCTCGGCTCAGTA 
TCCCAGCAATTTGGCCAGCGCCTCGCTCAAGTACTCAATTGTGGAGTTTCTGGGCAGTGTGGGCGAACAAAACCTGGAGTATTTCTTCTACTCCC 
TGCTGGATAGCATGTCAACGGATCTAAACAACAATCAGACAGTGGCAGGCTACAAGCTCATCCTACAGTTGATCGGTCAAAACTGGCCGAACATT 
TGTTCTAGGAACCTGGCCAAGACTGCGTTGCTAAGGAACTCCTATCAGAACCGTAGCAACATCTGCCTGAGCATCCTGTGGGCAATTGGACAAGG 
TGGCTATCAGTCCCTAAACGAGGGCGTTCGCGTGTGGCAGAACCTGATGCTGCCCAACCTGGAGCTAAAGCAGTACACCAAGTTTGTGGTCGAGT 
ATATGGAGCGGGCTCTGAGTGCAGCAGCTGTCCGGAAGACAGCGGATCCTCTGCAGATCAACCAGCAGGAGTTCTTTGCCACTTACAATGCGCTC 
AATGCACCCTATAACCACCTGCCCAAGGAATGGCAGCAAAGTCTGAAGCGTAGTGCGCGTTTGTTGCTTCAACACTATATCAACAGTCCTGTGAG 
GCATGCCAACATATTCCTTACGCTCTTCCGGGAAATAAGCGCCGGATCCAAGCAGACTAATGAGATCGAAGGCTGTATTAGTTGTCTGCTCAGCA 
GCGGCAGAGATGATTGCCTCAGGGTGTGGCGCATGAACTACAAGAAGCAACAGTTGCCTAGTCTACTGCTTCTGAAAGCTATAAACGACAATTGG 
ACCACGTCTACCCATGAGTTGGCTACGTCGACTGTGTATCACTCTTTTCTGCAAGATGTGGCGAATCTGAACGAAGAACTGCAGGGAAGCAAGAG 
AAAC GAAGGT C AC C T AGAT GAT T T AAAG GAGGT T T T GT T GGT G AGT G C G AC T T AA 
(SEQ ID NO: 683) 

Start ATG: 83 

MSEQWEVVSKSRKQKNLDKKVSAHNEQKRIAAQLPKLEELLPTQRYRNLFGSSNNNNSKSHSPAKSSSSASSSSKANKSPVKKHTAKNASTQKRT 
TSAAASKPKTLELALRNITRDDFAAQLEQVKLSCPGSELRWLSHIALYFNEALSYDCDPIFSGRSAQYPSNLASASLKYSIVEFLGSVGEQNLEY 
FFYSLLDSMSTDLNNNQTVAGYKLILQLIGQNWPNICSRNLAKTALLRNSYQNRSNICLSILWAIGQGGYQSLNEGVRVWQNLMLPNLELKQYTK 
FVVEYMERALSAAAVRKTADPLQINQQEFFATYNALNAPYNHLPKEWQQSLKRSARLLLQHYINSPVRHANIFLTLFREISAGSKQTNEIEGCIS 
CLLSSGRDDCLRVWRMNYKKQQLPSLLLLKAINDNWTTSTHELATSTVYHSFLQDVANLNEELQGSKRNEGHLDDLKEVLLVSAT* 
(SEQ ID NO: 684) 



Celera Sequence No. : 142000013384288 

GCGGGCCACATGCACGGGTTCTATGGATCGTTTGGGCAGCGGCATGGTCGGGCGCGATTGACAATTATCTCCAAATTGGCAAACCCGTAAATTAG 
TGCGTCCTCATCGACTGAAAACGGCATTAATAAGTAATAAATTGTTTTAATACGAATTAATGCATTTAAAATTAATTAAATATAAAACCCTTTAT 
TTACTGATTTACTACATAATCTCATGCCAAATTGGGATCAATAATAAAAAGATTTCTTTTTATGCTGATTATCAATTGAAGACATTTCATTCCGA 
TAAAATTGTACTTCCCAAATTGTCGTTTAACTTTGTTTATAAATTTTCCCGTAAAAGTAGACGTTTAAAAAGTTAATTGGATACTATCAAAATTG 
GTTTAAAAGAAATTTTGTCATCACTATACTAAACCGTCTATGAACAAGACTCAGAATGGTATTGCAGAAGATTTCACGAGACGCTTATCCATGTA 
T C C AT G AG AC GT G T T CAT T AC AAT C AC C AAAAT GC CAT T T GAAT T GT T T T GT GAT AT GCGC G AC T G AC GAAC T C G AC G T C T GG AC C AC AC C C C AC 
GAAACGATACCCACAGCGAGGTGATAAGGGCGTAACGCCCAAGTCACCCCCGCAAGATGCACCTTGCAGTCCCGGCGAGATTAGTCGATAATGCC 
CCCCTTTTCGGTGGCAGTTTCTGGGTGCTACGACCTGGGGCCTTTACGTGGGCACCTATTCCATACGTGCATATACGTATATCGTCGGCGGCCTC 
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CCACCAACGCAAATTAGACCGTCACGCACTCACACACACGCATGAACCGCAGCAGCAGCAGCACCCAAGTAAATGTACATACATATGTGTGGGAC 
GGAACGGCTGACTCTTTCAGATTTTCGCACTGAAACATTCGCACTAAATCGCAACAATGCACACACAAAGGTATCTCTCACACTTTCTAATCGTA 
GTGGCTGCTGCGATTTTGATTTTGGTTTTGAACGGCGCAGCTTAGTTTTACACAGTCCACCTCACTTTACAGCATATCGTACACTCACCGCCCGT 
CACACTTTTCGTTTGGCGGCTGGCACCACAACAAACATTAAGCAGCCACTTCCGTTTCGGGGGTTCGGTGGCAAATTCGATCCGATGCGAACTCT 
AATTAGCCAATTTGGGGCTTGAAAACTGCAAATATTTGCTCGTTTTGTGGAAACTAAGATTTCCACACCTTTGCAGTGTGACCGAAGCTTTGTGG 
TTCGAAAATACTGCAGGGATTCATCATTTCGGAAAATCAGACAAATATACTGCACGATGCGGTGGGGAAAAAGTTATTTTTATATTAAATGTAAT 
ATGTATTTTAATTACTATATTTAAATAAATTTTGCAAGGGAATTTTCAGTAACTCATTTCATATTGTTTATTTACTGTATAGAATGAAATATTTT 
AAATATAAATATATATTCAATTTATTTTATTTTATTTTAACTCTATATTATCCATAATAAGCTATTTTATTAACGTTATTTAAAATTACTGTTTG 
CAGTTTCTTCCATTAACTTTAGAAATGTATATTTGGTATTTTTGGTATTTATCCGCCGATCGATAACTAGTGCTCCAGCCCTGGCCCCAATGCAT 
AACATCTCTGTTTCGTCGCACACTGAATAGGTAAGTTTTCCGCACATTTCGTTGATTTCCTCTTGTATAATTCCAACTGCTTGCAGAACCAAATT 
GAAAAGATGGCGAGTTTGGCTACCAAGGGATCAGGACTTGTGAACAGTGAGTGTCGTGAATCGAATGCAATCGAATGGATACACCACCATTGGCG 
CCTGTCAATCGCAAAATTATGTAATGATCACTTTTTGCAGGGCTCCTCACACAGGCGAGGCCCCAACTGGACGTGTTCCTGAAGTACGCCAAGGT 
GGAACTGACGCCCCCGACGCCCGCCGATATTCCGGCCATTCGCCAAGGACTGGGCAACATCATCAAGGGAGCCAAGACCGGCGCCTACAAGAACC 
TCACGGTTCGCGAGGCCTGGCTTAACACCCTGGTGACCGCCGAGGTCATCTTCTGGTTCTACATCGGCGAGTGCATCGGCAAGCGTCACATTGTA 
GGCTACAATGTCTAAGCTTACTATAGTCTCCGCTTGGCAGTCACTGGAATGGGCAACGTAATCCCTT^ACAGATGTGTATATTTATATGTCTGCGA 
ACATTTCGACTCTGAATAAAGTGAAATAGTAATTTAAAATTCCGAAAATTTTCGAAAAATACATTGTTTTTTGAAAACCGTTAGAACGTTTGCGC 
GG GAT T T GT GT AAG CT AAAG ATG AGGT GAT GT AAAAC C AAGT T T GG AAT T AAAAGT T G AC GAT AT T T AAT G AT AAAAAT T AAAAAAAT AT AT GT A 
ATGTTATACATCAAATGTTTATGAAACGGTGTCGTAATCAAAGAGGCTAATGGTTCAGAAATACATAATATACTTAGAGCATTTAAAAGCACTCA 
AGAAT AATTTT AT TTAAAAAAAAAAAT AAATAAATCT AAATTTGCTTTTCAT AGATAATT C AT TT AC ACATT TTTTT AAACAAAGTAAAGT AAGA 
TGTATTGAATTATTTTATTTATAAATAACGTTTTTTATTTGAATCTTGAAAAGCATTAATATTATTATTATTATACTTATATTTTTTAACAACAA 
AACTTTTGTAGACAGAGTAAATTTTTGTAATCTAACTGCGGTCACACTTTACTTTAGTTACCTTCCGATCGGAAGAAGAACCCGGCTGACATTAG 
GAATAGGAACTTGGCAGAACTTTCCCAGGATCTGTCAGAATGGCCTACAAGAAGCGCCTACAATCGCAGAACCTCGTGCATCTGCTGCAGAACCG 
C GAGT C C GG AT AC AC C AACGT GGGC C AAC AGC C GG GC AGG AT GC C GC T GTT GGCCTACGAGCGTCTCTT C T AC AAGT GC AT C AC GC C T TGC C T G A 
CAATTGACTCGATTACCATTCCGCCGATCTACCTGCGCAAGTTTACGCCGGATGGCAGAAAACTGTTGGCCTTCTCGCAGGATCAGCGCAGTCTG 
CTCATATACAGCTACGGGGGTTCTAGTTGTGCGGCGGTGGGTGAACTGATTCGCCAGGCGGATGTAGGCAGTGGCGAGTGCTTCAGTAGCCAGGA 
CACCATACTCAAGAGCAGGATCTTTGAACGCCTGTTTCCCACAAAAGAGACGCTGAATCTATGCCAAGGCGACTTTGGCCTCTACTATCTGCACC 
GGGAGTTCAGTGTGTTCCTGGAAGAGGGTCGTTATGCCATGCTGGCTGCCATGACCGTTGTGCGCGGCGCACTGCCGGTCGATGACTACGTACGG 
TACCCAGATCTGTTCGACAAAGTGGACGCCTTCTCGTATGTGTTCTTTCTGGTGGATTTGAAGCTAGGCGTCGTCACAGATAGACTTATCCTGCC 
TAACGACTCCATCGTCATTGCCCACAATCATGGAATTTCCGTGTTTGGCTCAAC 
(SEQ ID NO: 685) 

Exon: 1001.. 1296 
Exon: 1524.. 1645 
Exon: 1702.. 1756 
Exon: 1846.. 2474 
Start ATG: 1717 

Transcript No. : CT19171 

CACAGTCCACCTCACTTTACAGCATATCGTACACTCACCGCCCGTCACACTTTTCGTTTGGCGGCTGGCACCACAACAAACATTAAGCAGCCACT 

TCCGTTTCGGGGGTTCGGTGGCAAATTCGATCCGATGCGAACTCTAATTAGCCAATTTGGGGCTTGAAAACTGCAAATATTTGCTCGTTTTGTGG 

AAACTAAGATTTCCACACCTTTGCAGTGTGACCGAAGCTTTGTGGTTCGAAAATACTGCAGGGATTCATCATTTCGGAAAATCAGACAAATATAC 

TGCACGATGCGTTTCTTCCATTAACTTTAGAAATGTATATTTGGTATTTTTGGTATTTATCCGCCGATCGATAACTAGTGCTCCAGCCCTGGCCC 

CAATGCATAACATCTCTGTTTCGTCGCACACTGAATAGAACCAAATTGAAAAGATGGCGAGTTTGGCTACCAAGGGATCAGGACTTGTGAACAGG 

CTCCTCACACAGGCGAGGCCCCAACTGGACGTGTTCCTGAAGTACGCCAAGGTGGAACTGACGCCCCCGACGCCCGCCGATATTCCGGCCATTCG 

CCAAGGACTGGGCAACATCATCAAGGGAGCCAAGACCGGCGCCTACAAGAACCTCACGGTTCGCGAGGCCTGGCTTAACACCCTGGTGACCGCCG 

AGGTCATCTTCTGGTTCTACATCGGCGAGTGCATCGGCAAGCGTCACATTGTAGGCTACAATGTCTAAGCTTACTATAGTCTCCGCTTGGCAGTC 

ACTGGAATGGGCAACGTAATCCCTAACAGATGTGTATATTTATATGTCTGCGAACATTTCGACTCTGAATAAAGTGAAATAGTAATTTAAAATTC 

CGAAAATTTTCGAAAAATACATTGTTTTTTGAAAACCGTTAGAACGTTTGCGCGGGATTTGTGTAAGCTAAAGATGAGGTGATGTAAAACCAAGT 

TTGGAATTAAAAGTTGACGATATTTAATGATAAAAATTAAAAAAATATATGTAATGTTATACATCAAATGTTTATGAAACGGTGTCGTAATCAAA 

GAGGCTAATGGTTCAGAAATACATAATATACTTAGAGCATTTAAAAGCACTCAAGAA 

(SEQ ID NO: 686} 

Start ATG: 434 

MASLATKGSGLVNRLLTQARPQLDVFLKYAKVELTPPTPADIPAIRQGLGNIIKGAKTGAYKNLTVREAWLNTLVTAEVIFWFYIGECIGKRHIV 
GYNV* 

(SEQ ID NO: 687) 

Name: ATP synthase g subunit 
Classification: enzyme 



Celera Sequence No. : 142000013384288 

TTTCCGATAGAGTAGCCGGCATTCGGATATACCTGGAAGCTTTCGATGATAGGACAGATGCCACGCGCCTAGTTATTTTTGCCAAAGTGCGTAAA 
CTAACGCCATGGGCCATTGCAGTTAATTTGGGGTGCTTAAATAAGAAAAAAATT T AGT AAC AAATCTTAGTTTTCATTTTTGGCGGGC AATTTT G 
AAAGCCAGTTCACACGGTTTTGATGTACATACAAATTCGCGAGTCTGGCATACAGAAATGTTTACTGTACCTCTGTGCCGCCCCACAAGCGAATA 
GCGGATAGAATCTGCAAATCTATTGGAACTTGCTCCCTCTTTAACGGCTCAAAATATTCAATTTCCAAGTCATCCCGTACAATTTCAATAATTCT 
GCGAAGGTTTTCCTTAGTGTATCGGTACTTTAGTCGAAATTTATGCTCATTTAGATCGGTCAATGGATTAAATCTTGGGTTCCTTTTCTTTATAT 
GCGTTGCCCGCGCTCGAGAATTGTGTTTGATATTCAGAAAACATTGAAAGTCTTTGAAATAATCAAAGTCTTTTAGAGACCTTTGAAATCTTTGT 
ATATCTTCCATTTTCACTTGTTTACTTGCATAGCAGGGTGGCTGTTTTGTATATATTAGAAAATACCTCATACGTACATGCGGTATTTAAGGCTC 
TTGCAACGATTGAAATTTTATGATGAAACTTTACACTTTAAAAATAATCTTTTTAAATGTTGTTAATTAAAAAAATGTAAGTAAGTAAATTAATA 
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CTTTTATTTTTTTTAAGCACGTTATGTACTTTTGGAGTTAAATAATTTTTTTTTATGTATTTCTATACTTTTAAAAAAACAGAACAGAAAGAAAT 
CAAAGCTCGAAGTACCAAAAAATAAATACTAGGGAAATAGAACCTATTGCGAAGATTTGCAAAAGCGATTGTTTTCAGTTGTGAAGACTAATATC 
TACACATTCCAAAAGCTTAACGATCTTTGAAATCTGGCCATTTGGTCACACTGAGTCCAAAGCGTCGTTTTCAAAGTACTCTTTCAGTTTCCATT 
GTGAAGTTTTAAGTGATCGCGAGTGCCAAAAAGTAACAATGGCTGATGATCAGGGACGTGGACGCAGGCGTCCACTTAACGAAGATGATTCCTCT 
ACTTCCCGAGGTAGTGGTGATGGGCCGGTATGTAAACTCTTATTCAAGCTAGGAAAAAAAATCATAGCGGTAAAGCAAATTCCTTATAATTCCGT 
TATGTCGCAAACACAAACACAGCCATGTACATAGATTGTAAATGCGCGCATTTCTTCGGAATTCGACACCAAAAGGAAGCACTAAAACTAACAAA 
CAAAAATCTGCTATGAATTTGAGCTTGACTGCATAAAATCTTCTTGAATTTCTGTTGACCAAGCACTTTTTTCAAAGGGAATTCATTAAACATAT 
TTCCAAAAACGCTTCTTTCAATTTGAATTTCCAAGGAAATGTAAAGCATTATCTGTTTCGTTTCATTTATTTTTTTTTAATAAAAACTGCTAAAA 
TAACCATCACCTGACAGGAAGGTGGCTGTGGCAAGCCGAATCTTATTTACCCTTGCAGTCTATGTCTTTGGAATCTGCATGCCCGACGTTGATAT 
GGACGGACATGACCAAATGGACCCGGCTAGTGATACTCATCAAAAATCATGGTCGGAAACGCTTCCTTCTATCTGTCACATACATTTGATTCCTA 
GAAACTCTACGAGTAACGGGTATTTCTCCGATATTCATCCCAATTCTATACAACTTTGAGATGGCTTTCAGCATTGATATACTCTGAAATTGCAG 
GCAAAGCATACTACAAAAACTATTTTTTACCATTTTTTTTCTGCTTCATCTTTGATTATGAGTAACAATATTTTCGTAGCTTGACCGTTAGTTTT 
ATGATTTAAAATCTGTCGTACTTTAGGTAGTTGTTTAAATGGCTCACATTTAGGAATCACAACTATTTGTTTAAACCCCTTAATATTTTTTCTTT 
TAATAATAAATAATAAACAGGTAATACTTTTTGATTGTTCATCGCGATTGTTTTCAATGTACAAAGTTGAGTCTTTCTGGTGTTTATTGCTGTCT 
CGTGTTTATTGCTGAATCTCGCACGATCTCTCTCTTTCTTCTATGTTATTTTTTCCATCGGCGTCAATTATTCATAACTGCCGCTAGGTGGAGCT 
AGTGTGCAGTCATTCACTTCTATTAGGAGACAGCGTCCTCTATATCTAATATTATAATTTTTTTTAAATGAGTAACTTACATCATTTAATGCTTT 
C AT GC AGC GGG T G AAAG TAT T C AG AGG AT C T T C AT C AGGT GAC CCG AG AG C GG AT C C TC G T AT AG AGGC T T C AAG AG AGAG AAG AGC T C T C G AGG 
AAGCTCCCAGGCGTGAAGGTGGCCCGACAGAGCGAAAGCCGTGGGGTGACCAATATGATTACCTGAATACCCGTCCGGCTGAGCTGGTATCCAAG 
AAG GGAAC CG AT GG CGTCCCGGTCATGCT GC AGAC G AAC T T T T TC C GAT T AAAAAC C AAGCC GGAAT GG C GG AT C GT T CAT TAT C AC GT GG AGT T 
TGAGCCGAGCATCGAGAATCCTCGTGTCCGTATGGGAGTTTTGTCCAATCATGCTAACCTTCTGGGATCAGGCTATCTATTCGATGGACTGCAAC 
TGTTCACCACCAGGAAATTCGAGCAGGAAATCACGGTGCTCAGCGGAAAGTCGAAGCTGGACATTGAATACAAGATATCCATAAAGTTCGTTGGA 
TTCATATCGTGTGCTGAGCCCCGCTTTTTGCAAGTCTTAAATCTAATATTGCGCCGCTCCATGAAGGGCCTAAATTTGGAATTAGTTGGCCGTAA 
TCTCTTTGATCCCCGAGCTAAGGTAATTTCTTTTGGTATCGTTTTAAGTGCTGTTTGGTCAAATATGATTTATTTTAGATCGAAATAAGGGAGTT 
C AAAAT G GAGCT AT G GC C GGGC TAT GAG AC AT CG AT T C GT C AGC AC G AAAAAG AT AT T T T AT T G GGC AC C G AAAT AAC T C AC AAAGT T AT GC GC A 
CCGAGACGATCTACGACATAATGCGACGTTGCTCACACAATCCGGCTCGTCATCAGGACGAAGTACGGGTAAATGTTTTGGACTTGATTGTCCTT 
ACGGATTACAATAACAGAACTTATCGTATCAATGATGTCGACTTTGGACAAACTCCGAAATCAACATTCAGTTGCAAGGGTAGAGATATCAGTTT 
CGTGGAATACTATCTCACTGTAAGTCTAATATTTACCACGGATATACATCGGCGTCTTATTTCTCCCAACATTTGTTCTCTTTAGAAATATAATA 
TACGCATTCGCGACCACAATCAGCCGCTGCTGATCTCCAAAAATAGGGACAAGGCTCTAAAAACTAACGCTAGCGAATTAGTGGTACTAATTCCT 
GAGCTCTGCCGAGTGACTGGGCTCAATGCCGAGATGCGCTCAAACTTTCAGTAAGACTTTAAACTATATTTAAATTAACAAGCTCTTGTGTCGCA 
AACTATCTATTTCTTAAGTTATATTCTGTGTATCCTTTCTTTACATGACTAAAACTATATTAATTTCGCTAGTCCGTCCGATTCTTGACCTGTTT 
GGAGTCAACGAGTTCATGCCGATCGGTTCGAAAAAACTTCTTACTGTTTGCAAACCTGAGATTACCTCCTTATCCCAGTAGACTAATGTTATTAC 
AATAGAACTAATAACTAATAAGACTAATATGAGATAGAATTATTCCCCGACTATCAGATACCCGTTACTCAGCTAGTGGAACGGTAATTTTCATC 
AT TTTTCTGTG GT AT C GAT AG AT AT T G AGG AAT AAAAT GAG AAC AT T T T T T AAAAAT T TAAAACT AAAAT T TT AAAAGT C T GGT CGT GAC C GT T T 
TGGCCGTTTCGTATGTGAAATGAGAGGCGCGGCTAAAGTGTTTTAGCTATACAAATAAAAATAGGCAAGACAAACAATAAAACGAAGAAAAATCT 
GAGCACTTTTCAAAAGTGTGGGCGTGGCAGTTTAGGGCGGTTTTATGGCCCATAAAAAATGTAATATTTAATTATTATAAACAGGCTTATGCGTG 
CCATGAGCAGTTATACGCGAATGAACCCCAAACAACGCACTGATCGATTGCGCGCTTTTAACCACCGTTTACAAAACACTCCAGAAAGTGTGAAG 
GTCTTGAGAGACTGGAACATGGAACTGGACAAGAACGTCACAGAAGTACAAGGCCGGATAATTGGACAGCAGAACATCGTGTTTCATAATGGAAA 
GTGAGTTGAGCAAGTTAAGAGCACTGCAAGTTTATAGTTAATAATTTTTTCAGGGTTCCTGCTGGAGAAAACGCTGATTGGCAAAGGCACTTCAG 
AGACCAAAGGATGCTTACCACTCCGAGCGATGGCCTCGATCGTTGGGCTGTCATCGCGCCGCAAAGGAATTCCCATGAACTCCGAACTCTACTTG 
ACTCTTTGTATAGAGCAGCTAGTGGAATGGGTCTTAGAATTCGAAGCCCCCAGGAGTGAGTTTTTATAAAAAAACACGTATACCTCACATTACGT 
CTTTACTTAATTCGTAAAGAGAGGTTTCGTGCTTCGTTTTCTGGGTATATATCCACTTTAACTATCGTTTTACAAACAGATTCATAATTTATGAT 
GATCGCACTGGAACTTATGTGAGAGCAATGGATGATTGTGTGCGCTCAGATCCCAAACTTATATTATGCCTCGTACCCAATGATAACGCCGAAAG 
GTATTTATTCCCTAATAAGTGTACAAATCTAAACTTAATGACTATATTTATCGAAGACATTGTTTAATTTTGGTACACTAAAGTTAATATAAATG 
AAAGTACTGAAGTAAACCTTTTTATTTTAGATACTCATCAATCAAAAAGAGAGGATACGTTGACAGGGCGGTGCCAACTCAAGTTGTGACCCTTA 
AAACGACCAAGAACCGTAGCCTTATGAGCATTGCCACCAAAATAGCAATCCAACTGAATTGCAAGTTGGGATATACACCCTGGATGATCGAACTA 
CCCTTGTCCGGACTGATGACAATTGGCTTTGACATTGCGAAGAGCACACGAGATCGGAAGAGGGCCTACGGAGCATTGATTGCCTCAATGGATCT 
ACAGCAAAACTCCACGTACTTCAGCACAGTCACGGAGTGCAGCGCCTTTGATGTGCTCGCTAACACCCTTTGGCCGATGATAGCAAAGGCCCTGC 
GCCAATATCAACATGAGCATAGGAAGCTGCCATCTCGAATCGTATTTTATCGAGACGGTGTGAGCTCCGGCTCTCTAAAGCAGCTTTTTGAATTT 
GAAGTCAAGGACATCATTGAGAAGTTGAAAACTGAATACGCCCGCGTCCAGCTAAGCCCACCGCAATTAGCTTATATTGTGGTAACCAGATCCAT 
GAACACGCGCTTCTTCCTCAACGGACAAAATCCTCCGCCTGGTACTATAGTTGATGACGTTATAACTCTGCCCGAGAGATACGACTTTTATCTGG 
TCTCGCAACAAGTTCGTCAGGGTACAGTGTCGCCGACCAGCTACAATGTTCTTTATAGCAGCATGGGTCTCTCACCGGAGAAAATGCAAAAACTT 
ACGTACAAGATGTGCCACTTGTACTACAATTGGTCGGGCACCACACGAGTGCCAGCAGTTTGCCAGTACGCTAAGAAGCTAGCTACCCTCGTGGG 
TACGAACTTGCACTCTATTCCGCAAAACGCGCTCGAAAAGAAGTTTTATTATCTATAATTGGATATAATTTAGAATGGAGTATTAATCCTTACTA 
AGAGGCCATATATGAAACTAGCCCAGACATTTATACTTTTTCAATACTTTCTTACTTTTGCTAAGCACTTCAGCATTTATGACTAAATATTTTGT 
ATTTGAAATGCATTACTGCTCTTTTTTCAAACAAAAGCAAAATTGAGGATTAAGATTCTGGTATTTAAGCATAAGACCAGAGGAAATTCCCAAAC 
AAACATTTAAAGTTATCTATCAAGACATGTTCATTAATTTGGAATATAATTACTTTATTTTTTATTGTATATTTTAGTTTATGTAAAGAAAAATT 
ACATACATCCATGTTTGCTTACTTAACCACACATTCATGGCTGCTTATATTCGTGAAACATATTTTATTAATTATACAGGTATTTAACTTTCACA 
AGGATAATACAATTTCATAGCTTGTTGAGTATACTGAAATGCTACGGAGTTCGATATATTTTAATGTGTTTGTATTTACAAAATTCTAAATTTTC 
ATTTTGGATAAATGTTTTATATTAATCGTATCGATTGCAAGTTGGGGTAGGATTGAGTTCAGTGATAATATCGATTATTTATAATTTAATATCCA 
AACACCGACTACTATGATACAGAAGCCCATATCTACTCAGACCTTGACTCCAAAGAAGCGAACAAAATTAGCAAGTCTTTTAAATCGCTTTCCTT 
GGATTGTCGTCTAGGCTTAAAATGCAAATGCGTTCGTTCACTATAGCGATTAATTTAACGATAATTACTCGTTGGTGTATATACAATTCAATGAA 
ACTTTTGATTTTAGACATATTATTTAGGATTAGGTGTATATTGTATAAAGGGTGGTAGAAAAGCTTTGGTGCTTTTGTGTAATCCAATTTCATAC 
CACGCTTGGTTTCGACCATCGTATGGATTAATTCTCTTTGGAATTACTTCTTCTTTTTGTTTTTTAGAATTATGTCAGGCTTTAACGAGTTAATC 
AGGGATCAAGCCGATGTTTCGTTCGGTTCCATCCAGCCCTCGCTGTCGTCTTCGCTATCCGAATCCTCGGACTCGGATAATTCGATAGCCACACG 
TCTGGCCAAGATTGAAGCCACATCCAATGGCGCCGCGTTTCGTTCAATTTCCTTTTGCTCTGACTTTTCAACTTTACGTAGAGTAATACCTGGAA 
AGATTAATTTGATGATATTTTTGCATTCGAACCTTGAATGGCAATTACTTACCATCTCGAATGGCCTTCATCAAGTCGTTGCGTGGGTCATGGAA 
TGGAGGCAGCATCTTCTTGGGTGCGACAATGTGAGGCACACCATTTTGTTGGCCGGGCTGCTGCATTTCGCCATTGGCCAGTGATTTGGGAAGAA 
TTTGATGCGGGCGCAGCGTGTGTTGGTTGCCACTGCCCATGCCTTCCTCGACTGGAGGCGGTGGCGGCGGCGGTGGTGGCGCGGCATTAGGTGGC 
GACATCTTAGGCGAGTGCTGTAAGGTTTTGCCACAAATTAAACCAAATTGTGATATATTAACACCCATCAGTGAACAACAAACCTGATCTGGAAC 
CGGCGGCGGTGGCGGCAAGTCGCTAAGGGAGTTCAGTTGATTGCCCGTCATACCGATGCTGTGGAAAGTATTGCTCAGTTGCGTCATGACCATAT 
CATTGGCATTCTGGACGCTGTTGGGAGATCCAGCTCCCGACGAACTGTTCGCCCGGCCAAGCAGTTTGGCCGCCATATGCCCGCTGTTGACTCCG 
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TTCATGCCAGACATTGGTGAAATGACGTCTGGAACTGGTGGAGGCGGTGGCAAGGCATCCCGACTGGTCGACATACTTCTGCCCCTTGTCGGAGT 
GTTGGCATTGGAGGCTGTGGGCGTACCACCACCGCTGCCATTCGATGGCGGAGCCGGTGGCGGCTGC 
(SEQ ID NO: 688) 

Exon: 1001.. 1167 
Exon: 2288.. 2872 
Exon: 2929.. 3249 
Exon: 3316.. 3470 
Exon: 4075.. 4275 
Exon: 4329. .4520 
Exon: 4640. .4750 
Exon: 4876.-6667 
Start ATG: 1084 

Transcript No. : CT19209 

CTGAGTCCAAAGCGTCGTTTTCAAAGTACTCTTTCAGTTTCCATTGTGAAGTTTTAAGTGATCGCGAGTGCCAAAAAGTAACAATGGCTGATGAT 
CAGGGACGTGGACGCAGGCGTCCACTTAACGAAGATGATTCCTCTACTTCCCGAGGTAGTGGTGATGGGCCGCGGGTGAAAGTATTCAGAGGATC 
TTCATCAGGTGACCCGAGAGCGGATCCTCGTATAGAGGCTTCAAGAGAGAGAAGAGCTCTCGAGGAAGCTCCCAGGCGTGAAGGTGGCCCGACAG 
AGCGAAAGCCGTGGGGTGACCAATATGATTACCTGAATACCCGTCCGGCTGAGCTGGTATCCAAGAAGGGAACCGATGGCGTCCCGGTCATGCTG 
CAGACGAACTTTTTCCGATTAAAAACCAAGCCGGAATGGCGGATCGTTCATTATCACGTGGAGTTTGAGCCGAGCATCGAGAATCCTCGTGTCCG 
TATGGGAGTTTTGTCCAATCATGCTAACCTTCTGGGATCAGGCTATCTATTCGATGGACTGCAACTGTTCACCACCAGGAAATTCGAGCAGGAAA 
TCACGGTGCTCAGCGGAAAGTCGAAGCTGGACATTGAATACAAGATATCCATAAAGTTCGTTGGATTCATATCGTGTGCTGAGCCCCGCTTTTTG 

C AAGT C T T AAAT C T AAT AT TGCGCCGCTC CAT GAAG GGCCT AAAT T T GGAAT T AGT T GGC C GT AAT C T CT T T GAT C CC C GAG CT AAG AT CG AAAT 
AAGGGAGTTCAAAATGGAGCTATGGCCGGGCTATGAGACATCGATTCGTCAGCACGAAAAAGATATTTTATTGGGCACCGAAATAACTCACAAAG 
TTATGCGCACCGAGACGATCTACGACATAATGCGACGTTGCTCACACAATCCGGCTCGTCATCAGGACGAAGTACGGGTAAATGTTTTGGACTTG 

AT T GTCC T T AC GG AT T AC AAT AAC AG AAC TT AT C GT AT C AAT G AT GT C G AC T T T GG AC AAAC T CC GAAAT C AAC AT T C AG T T GC AAGGGT AG AG A 
TATCAGTTTCGTGGAATACTATCTCACTAAATATAATATACGCATTCGCGACCACAATCAGCCGCTGCTGATCTCCAAAAATAGGGACAAGGCTC 
TAAAAACTAACGCTAGCGAATTAGTGGTACTAATTCCTGAGCTCTGCCGAGTGACTGGGCTCAATGCCGAGATGCGCTCAAACTTTCAGCTTATG 
CGTGCCATGAGCAGTTATACGCGAATGAACCCCAAACAACGCACTGATCGATTGCGCGCTTTTAACCACCGTTTACAAAACACTCCAGAAAGTGT 
GAAGGTCTTGAGAGACTGGAACATGGAACTGGACAAGAACGTCACAGAAGTACAAGGCCGGATAATTGGACAGCAGAACATCGTGTTTCATAATG 
GAAAGGTTCCTGCTGGAGAAAACGCTGATTGGCAAAGGCACTTCAGAGACCAAAGGATGCTTACCACTCCGAGCGATGGCCTCGATCGTTGGGCT 
GTCATCGCGCCGCAAAGGAATTCCCATGAACTCCGAACTCTACTTGACTCTTTGTATAGAGCAGCTAGTGGAATGGGTCTTAGAATTCGAAGCCC 
CCAGGAATTCATAATTTATGATGATCGCACTGGAACTTATGTGAGAGCAATGGATGATTGTGTGCGCTCAGATCCCAAACTTATATTATGCCTCG 
TACCCAATGATAACGCCGAAAGATACTCATCAATCAAAAAGAGAGGATACGTTGACAGGGCGGTGCCAACTCAAGTTGTGACCCTTAAAACGACC 
AAGAACCGTAGCCTTATGAGCATTGCCACCAAAATAGCAATCCAACTGAATTGCAAGTTGGGATATACACCCTGGATGATCGAACTACCCTTGTC 
CGGACTGATGACAATTGGCTTTGACATTGCGAAGAGCACACGAGATCGGAAGAGGGCCTACGGAGCATTGATTGCCTCAATGGATCTACAGCAAA 
ACTCCACGTACTTCAGCACAGTCACGGAGTGCAGCGCCTTTGATGTGCTCGCTAACACCCTTTGGCCGATGATAGCAAAGGCCCTGCGCCAATAT 
CAACATGAGCATAGGAAGCTGCCATCTCGAATCGTATTTTATCGAGACGGTGTGAGCTCCGGCTCTCTAAAGCAGCTTTTTGAATTTGAAGTCAA 
GGACATCATTGAGAAGTTGAAAACTGAATACGCCCGCGTCCAGCTAAGCCCACCGCAATTAGCTTATATTGTGGTAACCAGATCCATGAACACGC 
GCTTCTTCCTCAACGGACAAAATCCTCCGCCTGGTACTATAGTTGATGACGTTATAACTCTGCCCGAGAGATACGACTTTTATCTGGTCTCGCAA 
CAAGTTCGTCAGGGTACAGTGTCGCCGACCAGCTACAATGTTCTTTATAGCAGCATGGGTCTCTCACCGGAGAAAATGCAAAAACTTACGTACAA 
GATGTGCCACTTGTACTACAATTGGTCGGGCACCACACGAGTGCCAGCAGTTTGCCAGTACGCTAAGAAGCTAGCTACCCTCGTGGGTACGAACT 
TGCACTCTATTCCGCAAAACGCGCTCGAAAAGAAGTTTTATTATCTATAATTGGATATAATTTAGAATGGAGTATTAATCCTTACTAAGAGGCCA 
TATATGAAACTAGCCCAGACATTTATACTTTTTCAATACTTTCTTACTTTTGCTAAGCACTTCAGCATTTATGACTAAATATTTTGTATTTGAAA 
T G C AT T AC TGCTCTTTTTT C AAAC AAAAGC AAAAT T GAG GAT T AAGAT T CT GGT ATT T AAG C AT AAGAC C AG AGG AAAT T CC C AAAC AAAC AT T T 
AAAGTTATCTATCAAGACATGTTCATTAATTTGGAATATAATTACTTTATTTTTTATTGTATATTTTAGTTTATGTAAAGAAAAATTACATACAT 
CCATGTTTGCTTACTTAACCACACATTCATGGCTGCTTATATTCGTGAAACATATTTTATTAATTATACAGGTATTTAACTTTCACAAGGATAAT 
ACAATTTCATAGCTTGTTGAGTATACTGAAATGCTACGGAGTTCGATATATTTTAATGTGTTTGTATTTACAAAATTCTAAATTTTCATTTTGGA 
TAAATGTTTTATATTAATCGTATCGATTGCAAGTTGGGGTAGGATTGAGTTCAGTGATAATATCGATTATTTATAATTTAATATCCAAACACCGA 
CTACTATGATACAGAAGCCCATATCTACTCAGACCTTGACTCCAAAGAAGCGAACAAAATTAGCAAGTCTTTTAAATCGCTTTCCTTGGATTGTC 
GTCTAGGCTTAAAATGCAAATGCGTTCGTTCACTATAGCGATTAATTTAACGATAATTACTCGTTGGTGTATATACAATTCAATGAAACTTTTGA 
TTTTAGACATATTATTTAGGATTAGGTGTATATTGTATAAAGGGTGGTAGAAAAGCTTTGGTGCTTTTGTGTAATCCAATTTCATACCACGCTTG 

GTTTCGACC 

{SEQ ID NO: 689) 

Start ATG: 84 

MADDQGRGRRRPLNEDDSSTSRGSGDGPRVKVFRGSSSGDPRADPRIEASRERRALEEAPRREGGPTERKPWGDQYDYLNTRPAELVSKKGTDGV 
PVMLQTNFFRLKTKPEWRIVHYHVEFEPSIENPRVRMGVLSNHANLLGSGYLFDGLQLFTTRKFEQEITVLSGKSKLDIEYKISIKFVGFISCAE 
PRFLQVLNLILRRSMKGLNLELVGRNLFDPRAKIEIREFKMELWPGYETSIRQHEKDILLGTEITHKVMRTETIYDIMRRCSHNPARHQDEVRVN 
VLDLIVLTDYNNRTYRINDVDFGQTPKSTFSCKGRDISFVEYYLTKYNIRIRDHNQPLLISKNRDKALKTNASELVVLIPELCRVTGLNAEMRSN 

FQLMPJ^MSSYTRMNPKQRTDRLRAFNHRLQNTPES 

DRWAVIAPQRNSHELRTLLDSLYRAASGMGLRIRSPQEFIIYDDRTGTYVRAMDDCVRSDPKLILCLVPNDNAERYSSIKKRGYVDRAVPTQVVT 
LKTTKNRSLMSIATKIAIQLNCKLGYTPWMIELPLSGLMTIGFDIAKSTRDRKRAYGALIASMDLQQNSTYFSTVTECSAFDVLANTLWPMIAKA 
LRQYQHEHRKLPSRIVFYRDGVSSGSLKQLFEFEVKDIIEKLKTEYARVQLSPPQLAYIVVTRSMNTRFFLNGQNPPPGTIVDDVITLPERYDFY 
LVSQQVRQGTVSPTSYNVLYSSMGLSPEKMQKLTYKMCHLYYNWSGTTRVPAVCQYAKKLATLVGTNLHSIPQNALEKKFYYL* 

(SEQ ID NO: 690) 

Classification : known_f lybase_gene 
Gene Symbol : piwi 
FlyBase ID: FBgn0004872 
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Celera Sequence No. : 142000013384554 

TAAGTTCTCGAATATTTTATTTAAAATTGTGTTGTACTTCAGTTTTAGTCTGATAATAAGACATACAATTTAATAAAACAACAAACCAGTTTTCA 

TGCCACTTAAAATGAAAACCAAGAAAACACATTTAAAAACAAGCCCAACAAAAAAATTGGGTACAAAGTAAAAGAGATAAACAATT 

ATAACACTTTAATTATGTATTTCGTCTTTTTATTAGGCGGGCTAAATATTAACTAAATAAAAACAAGATTGAACGCTACAGTCTGGATCCTCGAA 

TTGCAGATAAACGATACTCTGGTAGTAGAAGTTCCCGCCAGAAATCACTATTTTTTACGAATACTCCAACCATAAATAAAATAAAACTAAAATTT 

TGGATTTATTGTATGTTTATTTATTTGAAATATCAGTTTGTATGCTGCGGTTCTCTATAATTTATTTCAATTATCCATAATGATTTAGTATTATC 

TGAACCTTCTTTGTTGGTATTATGGTGTTAAATACAGATCTTGAGCATCGTTAATATTGCTTAACTTTTCCGTTTTCATGTTAGATGCCTTGCTA 

AATGTGCATTTTTATTTCATACAAACTACGTAATTTCCGTACGTTATGAAGATGCTGAGTTGATTGTATCTTTGTTTGAAGAGTAGATGCGTTTA 

CTTATTTTGGTTTTGATTTTAGATGCTGATGTAGTTTTATGCTATAAATATTTAGCTACGTTCAATTCTGCTTAATATTTAGAGAACATTAAATT 

TTGAAGATTTTATTTCCTAAATATTCCAATGATTTGCTACAAATCCAACAGATGAACGTATGAGGTTTAATTCTTAGCCTAAAACTTTATTATTA 

ATGTATATGTGTTTCAACTCCATGAAATTCCGCTCGTAACTACTTACTAAGAAATGTTAAAATAAATATTTTACGTTAATGTGTAAATTTAAATA 

CAATATATATTTATTGCAAATTGCTTTGTAAATTTAATATAATTTTGTCCTTATGTTGATTTTTGTAGATTTAATTTATTTAAACGTTGTGAAGG 

GGTACAAAGTGTGACGTTCAGGCTACGCTTAACTAACAACAAAATTCACATATAACAACAATAAAATGAAAATGTTACAAAAAATATATATCCTT 

CAATTAGTTTATGGTGTATGGTAAAAATACAATGTAAATATAGAGACTTTTTGTTTCGTTCGTGTGTACTATTCATTTAACATTTTGTGCTGCTT 

CAACGGTTTCCCATATAAATAAGTTCCCCTGACACAAGGAGGGAAAAATCAAAAATTATTTCGATGCAGGTAAGTTGGATTCGACAAAATTCGTT 

TCAAGTGTGGGTGATTTTAAAGGTTTCGAACTCACAGCATGGTCAACAATCACACGAACTTATCTATATATCCTTGCAGTTCTTACAATCAATAA 

TTATATCTGCAAGGGTACACAAACATCTGTTTAAGATAGACGTACGATCATATTGACCATTATACTGTTTGGGGATCGTTTGAGGAGTAAATATA 

CAGAATGCTGTTTGTCAACAGAATGGCCAACACTTACATATAATTTCCTAACGTTTTGCTTTTGCTCCTAGCTTATGCTCAAATAAATAGATCAT 

AAATAAATTAAATTACAATTTTGTAAATACATAAAATATTTGGCATTTAATTTCGCGAAGCTCTACAAATTGAATGGTATTCGATTTTAGCACAA 

CATTTTGCACAAGGTTCAGAAGTGATTTGAAAATATATTTTCGGCTGGGTCTTTGGATGGCAATGCGTTTTAACCAGAACAAAAATCTGTCAATC 

ATTCTCATTCTTGTGGAATGCTATAATCATATGTTACATCATACGTCTATATGATTTCATATCTCTTTCTTTTGGCATTGTGTTAGCATACAAAA 

AT GGC AAC AT C AAAAAAAAAAAGT AAT CG AT T C AAAAGTC T T AT GG GAGG AAAC AAAT C GT AAGAGAAAT GT AC AT G ATT T C T T T T T T AC AT T T T 

CCCATTTGTTTTTGGCATTTAACATTATACTTTTGTGCTTATGCTTTGATCTATATATTTAATATATAGTATGTATCTATGAATTCAAAATTGTT 

TCGCTAGGCTAAGTATTATGGAGCTGCTTATGTTGTTCGTACATATATGAAGTTTACTGTCTGTGTTTTTTGTGGTAGAAATATTCTGTTAAAAA 

AGAGTTTTATGTTTTTGCTCTCCTGGCATATCAAGTTCATATATAAAGTCGCTTCTGAGGCTCCTGTGGATTGTGCTTTAAAAACGTTTTACAAA 

TTCCAAATGGTTGCCTCATCTCTAATACAATTGCTCATCTGTCTCGTGTTGTTTTCGCTTGTCTTGTTGTTAAAAGTACTACAAAAGTGTGTTGT 

TTCTGTCGTCAGTAGTGTTGTTAAAGTATTGACCAGGGAACATCGTACATTATCTCGCTAGCCTTCGCTGCTAAATGCTTTTCCCCATTCGCACG 

TCTTCAGTGATAACGCTCGCTATGCATCCGTTTCGATTTGGTTTGTTAGTACTTCTCATCGTCCGTAATTTCCCTATGGGAATTCTGTAATTCTC 

AGGCCTTGTAGTGCTGATTCCTCACCGCGCAGCAGAGGCTCAGGGCGAAAGTCAGGGACAGCAGCTCCAGCAGAATGACAGCGGCGGTGACGGCG 

AAAATGGTGACCCAGTTCTCGTAGATGATTTCAATCAGCTTGTCCACGCAGCCCTGCAAGAGACGTTTAATATGAAACTCATTAATCGGTTTGTG 

TAAAAGTCAGCTCGATTTTCATTCAGTTTGTTTACAGCTCGCAATCTCTGGATCTACTACCCAACCAGACTGGCAGACAGAAGTTTGAGATGCGT 

GTTGAATATCAATATTAATTAATACGTTTTTATCGTCTCAGCATTGGCGGGCGTGAATGTTTATTAACCCAATGCTTTCAGACCGCAGAAAAAAA 

GGATTATTAGTTTTTATGGCCAGGCGGCGATAGCGATCGGTAGGCCAAGAAATTATGATTCGGGCACACCTATCAATTGAGTTTTGATGATTTTT 

TGGAAATTCACTGAATCGAAAAACGATCTTTTCTGGTAAAATTCATATGACTCACGGAATCGAAAAACTCTTTGGTTGGTCATTGGCATTACCTG 

CTGATAAATGGCGTTGTTCAATGGGCCACCAAATTTCAGGCGTCTGGACAGCTCACACTCATTATCCTTTAGATTATCCTTGCAGCAGGACTCGG 

GTATGTTATAGAAGACATTCATGGAAGACACGGCGATCTCCACAATATTTGTGCGATCCACATTATTAAACCGACTCGTGGCCCAATCTCCAGGT 

CCATCGGCGCCGCAGCATTTCAACTGAAACAGATCAGGAAAGTTCGCATAAATCGTTTTTCCGACGACGTGGTGGGTATGTCCAAGTACTTACAT 

TCTTTTGCAGCGTATCAAAGGTGACGGTGCGCGAACTCATCGTGGACTGGCCGTACTCCTCCTGGACTGACGACTTGACAGCCGCCCGCACGATG 

TCATCCAGCTTGTCTTTGTTGTGGAAGGCCCATGCTCCGGCTGCAATTTGTGCCACCATCACGATGAGTATGACACAGAAGAACTGCAAGAGAAT 

TTCAGATTTATGAATGACGGAGTCCGGCCCGTGGGCGTTGAGTAAATAAACTGGTTGCGATTTAAGACAGATATGTTTGCACTCGCAATAAGGAA 

CTCCAATTAAGTGCAACTTTTGTTTTCTTTACATCATTTATATGTATAAGATATACTTTCTGTTATATATATTTAATAAGTACTTTAAATGGCAA 

TACTATTGGATAAATTTTCTGTATTCACAGTCCAATTTAGTTAGGTATAAAATTATTTTGGCAAGTGCATCGTTTTGCCAAACTGCCAAGACATT 

GT G GGCG AAC C T C C G T T GCAAT C AT AT G AGCGG AAC C GT G T GG GCC AC AC AT T T GC AT GC AT T T T AAT T G G AAT T C C C AG AC AC T GGT TAT GAT A 

TTACTTTTCACTTACCGAAACCAGCAGACATTGCGACTCACGACAGACTCCGCAACAGCCAAAGAATGCGCCCAATGTGATCAGTATGCCGATGG 

CGAGGAAAACATACAGGGCTATGTGATAATGATTGTAGTTCTGTGTCATCGACAGCATAAATGTGGGATCCGTCAGCATCCAAACGGAGGTCACC 

ACGATGGTCAGTCCGATGAGCTGCAAGCGAAAAGAATACAAAATATAGTAGAGCATTACCTATTTTTACACATTTTCGCTGTAATTCTCAAATTG 

TTGCATCCATCTCACGCTGGGCACACGCGTCAATGACGCCATCTGGCGGCAAGGCGGCAATTAAAGTGGGAGACTGCCCCGCCCAGTTTCGCCTT 

GCTTTAAACTTCTATAGTTTCATACATATGATGTCAGCTTAGTTGGACGCAGACTATATACATAGCAACTGCTGTTGCGCTTCTTTTAATTTAAT 

TAAAGGCAGTAAACAGAAACCAAGTACAAATTACAATTTTCCCCATTCCATTCAATTTGTTAAAAAAAAAATCAGTAATTCTATCCAGTATACAG 

ATTCAAAGGCAGCTTTTTGAGCTCCGAACTTTCAAAACCCGATAAGAAAAGGCCCATTAAGTTTGAAAATCAATGATGCGATAAAATTTGGTTTA 

AAGTTAAATTTTGGCCTTCATCGCCAAGACATACTGTATGTATATACATATAGAAGTGGCAAGACATTGATCCCAAGCGAGAATATGAGAATTGC 

GAGCTCGTGTTATCTCACTTCTCGGGTTATCAGCAAACGAAATATGAACTTTTCTTTGGATTTCCGCTAAAAGAATTCCGGTAATGTATTTCTTC 

TTAAATTTTCCTACCCAGATATTAAACACTAAACTTGCTATTGCCAATCCATCCACCAGAAAAATGTTGTTCCGCTCGTGCAATGAGTAAGAGAA 

TCTAATATAGAGGTAATGACTTAATGGGTTGGTCGAGGTGTCTATGCAAGTTCATTAATTATTCAGCAAAATTATTCATCGCAGGGAAATTGAGT 

GTGCGATGAGAGCTAATCAAGAGATGTAAACAAGTGCACATATTACATGGGAAATATGTAAATTTAGAACCACTTAAGAACCGATGGCTAACTGA 

GAAGCATTTCCAAATACAAAACACTTCTTATCTCGTATTTTTTCTCAGCCCGCAAGCTTTTGTAATTGCCAAGAGCATCAGTGCTTTCCGATAAG 

ATTACGCAGTGGATCTCGCAATAAGCAAATATCTATTTAACTTAAAGATACAGCAAAATATAAGATACAGTTTTTGGCGCCCAGGAAAAACGGAT 

GAAAAAGCATGAGGAAGTCTCGTAAATTTCATGAAAATCTTATGAAATATAAGAGAAGCGGAGACCGGCTTTTATGTGGGTGGTCCACGAAAGAG 

AGAGCTGCCATAAAGTGAGCGAGACAAAAAGTGTGCCACAAAGTGCACGATATGCTGGCCCTAAAGTTCATGAATGTGCATGTGTGCAGCTAACG 

GAAAAACTACTCATGGAAACGTAATACAATGTTGCCCCATTCGAGGGCAGTTTATTGAGTAAGTACATTGTAACCAGAAGCCCTTGTTTGCATAG 

ATTTTACTTATTTTTCCAGGTCAGGTTTTCGGGTATGGATCAGTAACATAATCTTGAATTTGTTCATGTGTTGAATTAAAAAAAAAAATGTTTCT 

GAAGAAGAGGTTTATTCCTAGAGCTCTTATAAAGTATGTGTATGTATATACTACTTTGGATAAACCTCAATTCCCTTATCCTTTTAGGGAACAAA 

AGTTGACCTCGTTTTGAATCAGTCCCATTATCCCTGAACGAACTGAATCCACGCCTTGTTTGTAGCTCTGTTTGTTGGAATTTTTACGATTCCCC 

GAACGTTTTCACACACATATATTTTTACACATCTGACACTATGTCGCGTGTCCAGGACAAACAAACAAAAATCCTGCCGAATGGGCCAATAAATG 

TACAGAGTCCTTCGTTGACCCAGTTTTCAGAGCCGAGGCGTCGAAAGCGGAAAACGAGAGAGCATTTTACTGCTGCTAAATTGCAAAAAAAAAAA 

TAAAATGGGGCCAAAAGGCAGCTGGGCTTGCATTGCATGTCCAGTCACAGTTTTGTCCCACAATTAACCGCATCGATGGCAAATCGCAGCCAACA 

GCCATCATGCATCAGCCATAAGCTGAAAATACAAACTGAAAAATGAATGAAGCGCCACTCATGAATAATTGAGTGAAAAATAGCACAGCGAATTA 

AGTATGTAGATGTGACATGTCACAGCGAATTGAAAATAGATACAGGCCAGCTTCGTTACTTGGAGAAATATTTTTAAATATTATTTTATTCACAG 

CAAAAGATTACATTTTTTTCAGGAATTTAAGAATTCTGCACTATGGAAGTCTGACTGTGCTGAAATAAATGGTTTTTTTAGTGGGTGGACAGGAT 
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AGGATTGTCAGATGGTCGACCGAATATTTGGAAATCTTTCAGTTTTGGTCACTGTCAACTTCAATGACGACAGCGCCGGCCGTTTAGCATTTTAG 

ACGCTTATGGCATAGGCCTCCCCCAGCATCTTGGATGTGGGATATCGGCCCCATTGGAGCGTCAGCGTCCTCAGAGATCGTGGCGTTTGATCACG 

ATGCCCCACGGCGACGATGAAAATGGAAATGTTTGGAAATCGATGCACAGTGGAGAGCGAGATTCAAAACTAATCGGACAGTGGTCATTTTTTGC. 

GAAGTTATATAAAAATACTATTATTAAATTTATTATGGCTTTGTGGGGTAATTGATAACATGCCCTGGCTAAGAATAGCCTCAAAACAATGTATA 

AAAAGTCTTATATAAAATAATATCTGACGCACTTTCATCCATAAATCGCTCTAAAAATAGTTCAAACGATTAGTGAAACGAACTGCATCACAAAC 

ACATCACAATTTCTATCGCTCATTGTTCCTCCACTGAAATCAAACACACCACTGTGCATCCAACAACAACAACGATGAATACTAAAACATCACAG 

ATCAAGGAACATTACACAGAAACCAATTATTGGATATAATCTTATTGGGATCGGTTAATTTTAATTTTTTTTACATGACATATATACAATAAACG 

AAAAATAATTGAAAGGTATTCGAAAGTTTCCAAACACGAAACTTGGCTAGGCATCAGTATTTTCTTTCTGTGTACAACAACGAAAAACTGAACGG 

ACGCGTAAGTTTGTGTTGTTCCTCGCCCGTATTCGAAAATAGCTTTTGACTAATAAATACTATAGACAGAAATCGAAAAAAGGCAGCAACAATCA 

ACAAACGAACGAAAAAAAAATTAGAAATCAGCCCGCATAAGAATCAAAATTTTAAATTTGTTGCCCAAAGGAAAAGCGAAACTGATTTCGCAAGC 

CGCGGAAGACAAGAGGAAACCCCAAGAAAAACTCAAGGAAACTCAAAACAAACGCAAACAATCGCTGTGGCAGATAGTAATTGAATGGAAACAAT 

TGAAAGCCTAACGAGGCTATGTTGAAAGTCGAAAATTGCGTAAAACAACAAGCAGATAAGTTGGCCAAAGAAAGTAAAACCCTTTTTTTTGGGGT 

GAGGAGAGGTCCTTTTTTACCGAATTAAGCCAACCGAAGTTTGTAAAAATATCAACGGCACGTATACGTTATAAAAATAATAAAACAAAAAAAGA 

GAGACAGGAAGAGACAGAAGCAiW^GAAAAACAACTGAACAAACATGTGGAGGGAAAAGAAAACAGTGGGAGGTGGGAATGGAGATGCTTAAATG 

TTTATCTGTCTGTGTGTGTGTGTGTCCCCTCCCACAACGGAACTTCTATTGTTTTTATGCGGGAGTACGAGCAGAGCGGTAGGGAGTGCGAGCGG 

GAG AT G GAG AG AT AC AGT GAT GT C GCTT T T AAAAGC C AGT T G AAGT T T T G AAG T T AT AAAT G GT AG AAAAT AAT T T AAAGT C AAAT AGT T C AT AA 
CAAACAAAACCGATTTGTTTAATTTCAAAAAGAAAAACCCTTCTATTAATGTTTTTAATTAACCGCGTACTTGAATATTTATATAATCTTGTCCA 
TTTAATGTTTTTAAGCCCGCTCATTAAAAAAAAAATATTTAAGTTAATATATTATTATATTTATTTTATTATTTATTTATTTATATTTTTTACTT 
CCTTTTCGATT AAAT ATTTCAAGCCGTACTT AAAT AAGT TTCAACTT AT GTTTTGGGCACCTCTGGGATCGAGAGAGTGGAAAGAGGGACTGAAA 
GAGCGGTAGGACTCCAAAGAGCGTGCGCACGGCTCCTCCCTCCGCCCGCGCGCCAAGCGTGAGAAAGAGGGGCGTAGGGAATACACAACACACAC 

AC AT TC GC T C A C GC AAC AC C C AC AC AC AC GC GC GC G AAAAGG AGGT C C GT GT GT T C C AGT T GT T GT T GTG AT GG GAG AG AAC CGAAT C T C GC AT T 
GTTGTTGTTGCTGCTGCTGCTGGAATCTACATACCCAGAATAAAATGTTTATAAGGACCATCAGATATTTGACGCACGAACAGCAGCCGTTGAGA 
CCCATGGCGCGGATTGATATTCTTAAATAGGTAACTCCTTAAAAAGTAGCTGCAATTTAACTGGTTTTTCAACCACTTGGAGATGCTCTTTTTCC 
AAGGAACCCCGAACTGCAGTGTCGCCTGCACTTTTCGACCTTTTGCACTTTTTCAAAACACCCGCTCGATTGTACTATATTTTTCAAATTTTTTT 
AAAATTTCCCGTGCTAAAATCCACCGATTTGAGCCACTGTTTGCGCGATTGGCCGTTGCACCCGATTTGTTTTAAAAATTTTATTTGTGCATCGA 
GGCGTCCGTCCAACGTAAAATTCTTGTATCAGTGCGACCATTAGCTCGGCCGTTCAAATATACCAGCGGGTATAGAAAGGCTAAAAACCGAAAAG 
TGGTGACCCTTAGAAGATACCAAAGAAGTGGTTGAATAACTTTGATGAGGCTTAGCATGGCTTGCAACCTGCTTGGATTTGCAGCCTATACGCAA 
ACCAAAGTGGCCTATCGATATTTTGGATAAGTTCTATTGCCAATTTTCCCAATAAAATAAAAACTTCACTTGAATGTTAAAAGTAATTTAGTTTC 
AGTGTTTACATAATGTTCAAATAACCTTTTGATTGTTATTATGAAAGTTAAAAATTAAAAATTTAGATATTCTAACTAACTATTATCCCCAATGT 
TTCAGCTATAATCACCATATTTAAAGTAGATTTCATTTTAAAGTATAAATTTGTGGTTGTTTACATTATTTTAACATGTCTATTTAAT TAT TACT 
ATGAAAGTGGGTAATTGAAAATTAGGATATTCCAGGTTTTTGATTTACTTTGGCTATGATTTTCCCAAAATATTTAAATCTGTAAATGTAAACAG 
TTTGAAATGTGACCTTAATAGTTCATGCTTTGTTCATTCTTCTGCGAAACGGTCACACTGTTCGCGCAGGCGAAAATGGCCTAAAGCGTGAAATG 
TTTAATTCGTTGTTGTTTCTCCTCCGCAATAGTTACTAATAGAAACAAATGCAAAATGGCTGTCTAACGTTTAAGTGCAGCGACCGCCGAAGATC 
GTGGAAATATGCATGAAAAGCGGCTCGCAGGCAAATGAAAGGCGCGTGAAAATGTAGCAGACACCTTTTTATCGCGAAAACGAACATCGGTGCAA 
AGGGAAAAACAAAAACCAAGGAAAATACAGCCAAACGCGAATGTGAATGTGAAGGCAAAGTAAGCGAAAAGTCCCCCAACTTTTTTGGGGAGTAT 

CGAGTGCACCACATACGTTTTTCAATATATATTTCGATTGCATTGAAATCAGAAACATGACAA 

(SEQ ID NO: 691) 

Exon: 8848.. 8490 
Exon: 4200.. 4006 
Exon: 3598.. 3419 
Exon: 3348.. 3133 
Exon: 2713.. 1001 

Start ATG: 8555 (Reverse strand: CAT) 
Transcript No. : CT19213 

TTGGACGGACGCCTCGATGCACAAATAAAATTTTTAAAACAAATCGGGTGCAACGGCCAATCGCGCAAACAGTGGCTCAAATCGGTGGATTTTAG 
CACGGGAAATTTTAAAAAAATTTGAAAAATATAGTACAATCGAGCGGGTGTTTTGAAAAAGTGCAAAAGGTCGAAAAGTGCAGGCGACACTGCAG 
TTCGGGGTTCCTTGGAAAAAGAGCATCTCCAAGTGGTTGAAAAACCAGTTAAATTGCAGCTACTTTTTAAGGAGTTACCTATTTAAGAATATCAA 
TCCGCGCCATGGGTCTCAACGGCTGCTGTTCGTGCGTCAAATATCTGATGGTCCTTATAAACATTTTATTCTGGCTCATCGGACTGACCATCGTG 
GTGACCTCCGTTTGGATGCTGACGGATCCCACATTTATGCTGTCGATGACACAGAACTACAATCATTATCACATAGCCCTGTATGTTTTCCTCGC 
CATCGGCATACTGATCACATTGGGCGCATTCTTTGGCTGTTGCGGAGTCTGTCGTGAGTCGCAATGTCTGCTGGTTTCGTTCTTCTGTGTCATAC 
TCATCGTGATGGTGGCACAAATTGCAGCCGGAGCATGGGCCTTCCACAACAAAGACAAGCTGGATGACATCGTGCGGGCGGCTGTCAAGTCGTCA 
GTCCAGGAGGAGTACGGCCAGTCCACGATGAGTTCGCGCACCGTCACCTTTGATACGCTGCAAAAGAATTTGAAATGCTGCGGCGCCGATGGACC 
TGGAGATTGGGCCACGAGTCGGTTTAATAATGTGGATCGCACAAATATTGTGGAGATCGCCGTGTCTTCCATGAATGTCTTCTATAACATACCCG 
AGTCCTGCTGCAAGGATAATCTAAAGGATAATGAGTGTGAGCTGTCCAGACGCCTGAAATTTGGTGGCCCATTGAACAACGCCATTTATCAGCAG 
GGCTGCGTGGACAAGCTGATTGAAATCATCTACGAGAACTGGGTCACCATTTTCGCCGTCACCGCCGCTGTCATTCTGCTGGAGCTGCTGTCCCT 
GACTTTCGCCCTGAGCCTCTGCTGCGCGGTGAGGAATCAGCACTACAAGGCCTGAGAATTACAGAATTCCCATAGGGAAATTACGGACGATGAGA 
AGTACTAACAAACCAAATCGAAACGGATGCATAGCGAGCGTTATCACTGAAGACGTGCGAATGGGGAAAAGCATTTAGCAGCGAAGGCTAGCGAG 
ATAATGTACGATGTTCCCTGGTCAATACTTTAACAACACTACTGACGACAGAAACAACACACTTTTGTAGTACTTTTAACAACAAGACAAGCGAA 
AACAACACGAGACAGATGAGCAATTGTATTAGAGATGAGGCAACCATTTGGAATTTGTAAAACGTTTTTAAAGCACAATCCACAGGAGCCTCAGA 
AGCGACTTTATATATGAACTTGATATGCCAGGAGAGCAAAAACATAAAACTCTTTTTTAACAGAATATTTCTACCACAAAAAACACAGACAGTAA 
ACTTCATATATGTACGAACAACATAAGCAGCTCCATAATACTTAGCCTAGCGAAACAATTTTGAATTCATAGATACATACTATATATTAAATATA 
TAGATCAAAGCATAAGCACAAAAGTATAATGTTAAATGCCAAAAACAAATGGGAAAATGTAAAAAAGAAATCATGTACATTTCTCTTACGATTTG 
TTTCCTCCCATAAGACTTTTGAATCGATTACTTTTTTTTTTTGATGTTGCCATTTTTGTATGCTAACACAATGCCAAAAGAAAGAGATATGAAAT 
CATATAGACGTATGATGTAACATATGATTATAGCATTCCACAAGAATGAGAATGATTGACAGATTTTTGTTCTGGTTAAAACGCATTGCCATCCA 
AAGACCCAGCCGAAAATATATTTTCAAATCACTTCTGAACCTTGTGCAAAATGTTGTGCTAAAATCGAATACCATTCAATTTGTAGAGCTTCGCG 
AAATTAAATGCCAAATATTTTATGTATTTACAAAATTGTAATTTAATTTATTTATGATCTATTTATTTGAGCATAAGCTAGGAGCAAAAGCAAAA 
CGTTAGGAAATTATATGTAAGTGTTGGCCATTCTGTTGACAAACAGCATTCTGTATATTTACTCCTCAAACGATCCCCAAACAGTATAATGGTCA 
ATATGATCGTACGTCTATCTTAAACAGATGTTTGTGTACCCTTGCAGATATAATTATTGATTGTAAGAACTGCAAGGATATATAGATAAGTTCGT 
GTGATTGTTGACCATGCTGTGAGTTCGAAACCTTTAAAATCACCCACACTTGAAACGAATTTTGTCGAATCCAACTTACCTGCATCGAAATAATT 
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TTTGATTTTTCCCTCCTTGTGTCAGGGGAACTTATTTATATGGGAAACCGTTGAAGCAGCACAAAATGTTAAATGAATAGTACACACGAACGAAA 
CAAAAAGTCTCTATATTTACATTGTATTTTTACCATACACCATAAACTAATTGAAGGATATATATTTTTTGTAACATTTTCATTTTATTGTTGTT 
ATATGTGAATTTTGTTGTTAGTTAAGCGTAGCCTGAACGTCACACTTTGTACCCCTTCACAACGTTTAAATAAATTAAATCTACAAAAATCAACA 

TAA 

(SEQ ID NO: 692) 

Start ATG: 294 {Reverse strand: CAT) 
MGLNGCCSCVKYLMVLINILFWLIG 

MVAQIAAGAWAFHNKDKLDDIVRAAVKSSVQEEYGQSTMSSRTVTFDTLQKNLKCCGADGPGDWATSRFNNVDRTNIVEIAVSSMNVFYNIPESC 
CKDNLKDNECEL SRRLKFGGPLNN AI YQQGC VDKL I EI I YENWVT I FAVTAAV I LLELL SLT FALS LCCAVRNQHYKA* 
(SEQ ID NO: 693) 

Classification: known_f lybase_gene 
Gene Symbol: BcDNA:LDl9727 
FlyBase ID: FBgn0027865 



Celera Sequence No. : 142000013385214 

GTGGTGCCGTCACCAGCGGCCACATCCTGGGCACGCGACAGCTCCACCAGCATCTTGGCAGCCGGGTGCAGTACATTCATCTGCTTCAGGATGGT 
GGCGCCATCGTTGGTGATGGACACCTCGCCGTTGCCGGCCTGGATCATCTTGTCCATGCCGCGGGGGCCCAAACTGGTGCGGATAGCATCGGAAA 
CGGCTGGAAAACAGAATTATTTCAATAGGTTATAAATGTATTTTAATATTTCGTTAAAACTTAATGAGAAACTCAGATAAGTGGAATTCGTGTGG 
TTAGTTTTACACAGCAGCATGACTAATTTACATACATACATACAAAAAGACATATGGTGTTTTATGTGTTAATTCAAAAAATCTACTGTAACTTA 
TCTTAAGAGATAAACAATAAAATCCTTGTAGCGGATAAAAGGGTATTGGGAATAGAGTATTAAAAATCCTCAATTTTTAGGTTGTTTCAATTCAT 
CACATGAGTAATGCGCGCTCTTCTTCTTCTTCCGCCCAAGAAGTTTCCAGACAGCCGGCATCCAACTGCTTTTTCAATAGGCCATTTGGCCGGAA 
ATTCTGATGCACCACTGCTCGGAATTCAGTCACCTCGGTTCTTAGCTCAATAAATAGCACATACGCCAACATTTTGATACAAAAATGTGTTTAAT 
TTTGTTAATTTCCTCGCTTTTTCTCACCTTTGGCCGCCTGGATGTTGGACAAACGCACATCGGTGGGCTTCGACTTGTCCTTGAAGGCCTTGGCC 
GTTGGCTTGATGTTGACAGCTGCTGCCTTGGGCGCCATTGTAGGGTTAAGTTATAGTTTTTTAACAAAACACAGTTATTCTACAATAAATTCACA 
GCGATTTACACAGAACACGTTGCTGCGAAATGTTAAGGAAAGTGTGGCCGCACTCGCTGAGCTCAAATATACCGCCAACTTATCGGCATCGATAA 
CATCATCCCTTGTGAATGGCAAATTGGTGTGACCGTATTTTGCGTTGTGACTCGGCTGCTTTATTTACATTGAATTTTTGTTCGATTTTTGAATT 
TAAAATAAAAATAATGCCGCACGGACACTCTCACGATCACGGCGGCTGCAGCCACGAGGCCTCGGACGTGGACCACGCCCTAGAAATGGGCATCG 
AGTACAGCTTGTACACCAAAATCGATCTGGACAATGTGGAGTGCCTGAACGAGGAGACGGATGGCCAGGGAAAAAGTGTCTTTAAGCCGTACGAG 
AAGCGCCAGGATCTGTCCAAGTACGTGGAGAGCGATGCGGATGAAGAGCTCCTCTTCAACATTCCCTTCACCGGCAACATCAAGCTCAAGGGCAT 

CAT C AT AAGT GGG GC C AAT GATGAC T C C C AT CC AAAC AT GG T C AAAAT GT AAGG AAC AAAAGT C AAGGC T GC T T TAAT AT T AC T ATAT TC AT CAT 
ATATCTTATTTTTCAGCTTCAAGAACCGACCCAGGATGACCTTTGATGATGCCAGAGCAAAGCCCGACCAGGAGTTCCAGCTGACCCGCGATGCC 
CGCGGAGAAATCGAGTACTCACCCAAAGTGGTCACCTTCTCCTCCGTGCACCATCTCTCCCTCTACTTTCCCAGCAATTTTGGCGAAGACATAAC 
ACGCATTTACTATATAGGTAAGCTCACAACAATCTATGTTTTATCTGTGTTCCCAACTAAAACCAAGTCACTATTCAAAGGTCTTCGAGGAGAGT 
TCACAGAAGCTCATTACCATGGCGTAACCATATGCAACTACGAATCCCGTGCCAATGCAGCGGATCACAAGGAAAAGGCTTTCGATGGAGTCGGC 
AGAGCCATCCAATAGGGTTATAGAGATTAAAATTTACAATTTAATTGAGTATCTCTGTAGTGTATATATTTTGTGATCATTTAAGGCAAGAAATG 
TTTTTGTATTCGCTTAACTTATTGAGCATTTGTATTGCACAGATATTGTGGGCTAAATCATACTAAGTATATTGTGTGGGTGGAATCACATCGGG 
TCCTATTTTCATGCCCATATTGGGCACTTTATCCGAATGGATTTCAGCCGTGACCTCGGTGAACAGCTTGTCCCACTGCCAGAGATCGTCCTTCT 
CGTTGAGGCTCTCCTGTGGAACCAGGCAATCGGTGAGGATTGAGAGATCAAAGCCATCCACAGCTGAAAAGGCGTACTGGGAAAGCAAATCGGAA 
TTGGCCTCCTTCAAAACAGCAGATCGTGAGGTGCCAATACTACAAATTACAACTTATTATCGTAGTGAAGAGAAATATAGCAGCTTCTGCTTACG 
TTGGCGGTTCTACAATCTCATTGAGCATGATTTCGTCCTTGACATCATCCATGTCTGGGATGACGGGAATGTCATCCGTTGGTATGGAATTTGTA 
GCCGACTTTGTTTGTTTAAAGCGCTC ACT AAAAT GAATTTTATC ATT AT AAC AGTCGGTGGAAAGTGAAGTT ATT GT GAT AAACTTGT ACT CACT 
CATCAAAGGAGTTTTTCTTGGACCTAAAAAGATATAGAGCAATCAGTAATGTGCCTAATTCGAGCCTAATCATACCCACTTTAATCCTTTGAGTC 
CAGAATCCGCCCAGCCACCTGAAATGCGACGAATGGGTGGTCGATGTGACTCGAATTCATTGTTGTATGTGGACGAGGTGGCCGTGTCCGAGGTG 
GTCGGTGGGGTGCCCGCCTGGACATTCAGCGTAATGTCGATGGAAATGTCCTTGGAGCCCCCAGCCGCCTGGACATCCTCCTCCTTCAGAACATC 
GCGACTCTTGGAGCGCCGACCCTTCCTGGCCGTCGTCTAGGGTGGCAAATAAATCGAAACTTCAATTTGCATCCATTTAAACAGGGTTACTTACC 

TTTCTTATGGTCATCTTG 
{SEQ ID NO: 694) 

Exon: 1001.. 1378 

Exon: 1442.. 1632 

Exon: 1696.. 1868 

Start ATG: 1059 

Transcript No. : CT19326 

CTCGGCTGCTTTATTTACATTGAATTTTTGTTCGATTTTTGAATTTAAAATAAAAATAATGCCGCACGGACACTCTCACGATCACGGCGGCTGCA 
GCCACGAGGCCTCGGACGTGGACCACGCCCTAGAAATGGGCATCGAGTACAGCTTGTACACCAAAATCGATCTGGACAATGTGGAGTGCCTGAAC 
GAGGAGACGGATGGCCAGGGAAAAAGTGTCTTTAAGCCGTACGAGAAGCGCCAGGATCTGTCCAAGTACGTGGAGAGCGATGCGGATGAAGAGCT 
CCTCTTCAACATTCCCTTCACCGGCAACATCAAGCTCAAGGGCATCATCATAAGTGGGGCCAATGATGACTCCCATCCAAACATGGTCAAAATCT 
TCAAGAACCGACCCAGGATGACCTTTGATGATGCCAGAGCAAAGCCCGACCAGGAGTTCCAGCTGACCCGCGATGCCCGCGGAGAAATCGAGTAC 
TCACCCAAAGTGGTCACCTTCTCCTCCGTGCACCATCTCTCCCTCTACTTTCCCAGCAATTTTGGCGAAGACATAACACGCATTTACTATATAGG 
TCTTCGAGGAGAGTTCACAGAAGCTCATTACCATGGCGTAACCATATGCAACTACGAATCCCGTGCCAATGCAGCGGATCACAAGGAAAAGGCTT 
TCGATGGAGTCGGCAGAGCCATCCAATAGGGTTATAGAGATTAAAATTTACAATTTAATTGAGTATCTCTGTAGTGT 

{SEQ ID NO: 695) 
Start ATG: 59 
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MPHGHSHDHGGCSHEASDVDHALEMGIEYSLYTKIDLDNVECLNEETDGQGKSVFKPYEKRQDLSKYVESDADEELLFNIPFTGNIKLKGIIISG 
ANDDSHPNMVKIFKNRPRMTFDDARAKPDQEFQLTRDARGEIEYSPKVVTFSSVHHLSLYFPSNFGEDITRIYYIGLRGEFTEAHYHGVTICKYE 

SRANAADHKEKAFDGVGRAIQ* 
{SEQ ID NO: 696) 

Classification: hypothetical 



Celera Sequence No. : 142000013384548 

GTAACCAAGTACACCTGTTGGAAGTTCTCCAGCGATCCATTAGCGGGATGTGGATGGAATATGTCCAGCAGACCGATTACGTTCTCATGATCCAT 
ATGCTTTAAAAGTCGAAGCTCCCGGTACGTCCTCTTTGCATGGACAGCTGATTGAAAAGGCCTGGCAAGCTTTTTAATGGCCACATGCATATTGG 
TGCCACGAACAACTGCCTTTGACACCTGTCCGTAAGCTCCCGATCCCACGGGCTGCAGATCCTGGTATATATCCGGGATCTCCCATTCCGTTCGA 
TTTATATCCAACTTGTAAAACTTTTTTGTAATGGACACTGACATGGTATTGCTTGAGGAAGGGCGTCGTCACCTTAGTTAATGGTGCCTATAAGC 
GAGATATGTATTGATTCTTAAAATGCCATGCGAACGATAATTCCATGCTGATTACGTTTATCTTGAGGCAGCTCTGCGTCAGTAGTACTTACCTC 
TGTTCCACGCCCAAGAATATCAACAACTAAGTACTGGTCCCTGGCATTTTTCCTATATTTGCGCTATTACTCCATTCCGATGCGTGTTAGGCATT 
CATGAATCAGATCGTCAGTGGATCAGATCGATCAGAAGCGACACACGAATGCGCTACATTTGATCAGAGTAAAGTTGTTAAAATCATTTTTGGCA 
CGATATTAAAAAAAATCCTTAATATATAAGTAAATAAATAAAATAATATATTAACCATATCATTGCATTGTAACAAGTCTCTTTTTCCACTCTTA 
TCGATGTCCTTAAATGTGAATAAAGGGATTAAAATTAAATAATTTTGTTAGTTTTTAATAATATTCTAAAAATTAGGTAATATAATATATTTTTA 
AAGCCCATATACCGTAAAATTTGGAGTGTAGAGCGTTTAAATGAAAATAAACACACGGCGTTGCCAGACCGTCAGCTGACAGTTCAATAAGAATT 
TATCGAATAAGTATTTGAATCAGCTGTCTCTTGTTTTTCCTGCAAATTTGAGCAGCACCATCCCCTGACCCATTTTTTGTTGCACTTTTGGTGTA 
CTTTGAATGAAAAATAGTCAGAGACACTACTCGCGCCTGCGCAGTTTCAACGCCGGGGTTATCAAATAAGCTAATCTAATCGTTCCGTAAACGCA 
GCCTGACATTCCGCCCTACCATTCTCCGACGAACAGTTAGTTTCGCACGCAAGCCGCTCCCGCTCGCATCGTAACCTAGAAAACTTTAGCGTATG 
TACATAAGTAAATTGACGAGTTTATCTGAATGATACTGAATATATTAGCGGTATAAAAGATAAACATTACATGGTTTAACGAGTGTTCGCTTGAA 
AGCCTAAATGTTAGACCGCAGGGGAGCTGGAAGAAGCCACTTCAACCGCTGTGAACTGCGACCTTTGGAACCTGGACTGGCAAACATGCAGATTT 
TCTGTAGTCCACTATCTCAGATAGCGGGAACATTGTTAACATCGATGGCATGTTCAGCACAAGTAAAAGATCTGCGAACCGAATGCGATTAGATA 
GTCCGGGGGGTTCAAAGTATGGTTTGAAGGGTTGCTATAACTTCAACCCCTGAACCGTTTTTCTTTTAATATTTTCGAGGAAAACTTCACACTCC 
CCAAAGGCGTGTATAATTTTTGGTAGATAACCCACAGTTCGTTATCTTCCATTATAGCTCATCATAAAATGCAAAACTATCATGTTTCTTTCTCT 
TTGCAGACACAAAATGACTTCAAAGCTACTGCCCGGAAACATTGTGTACGGAGGTCCTGTGACTGAACGACAGGCCCAGGATAGCAGATCCTTGG 
GTCAGTACATCCTCGACAAGTACAAGAGCTTTGGCGACCGGACGGTGCTGGTGGATGCCGTCAATGGAGTGGAGTACTCTGCCAGTTTCATGCAC 
AAGTCCATTGTACGGCTGGCATACATCCTTCAAAAACTGGGAGTCAAACAGAATGACGTCGTTGGTTTGTCTAGCGAAAACAGCGTCAACTTCGC 
CCTGGCCATGTTCGCTGGTCTAGCAGTTGGGGCTACGGTTGCTCCCCTTAACGTAACATATTCCGATCGTGAGGTGGACCACGCCATTAACTTGT 

C C AAGC C AAAG AT C AT AT T CGCC T C T AAG AT T AC CAT T GAT C GT GT T GC C AAAGT GGC C AGC AAG AAT AAGT T CGT C AAGG GC AT C AT TG C GC T C 
AGTGGAACTTCCAAGAAATTTAAGAACATCTATGATCTTAAGGAGCTGATGGAGGACGAGAAGTTCAAGACACAGCCTGACTTCACGAGCCCTGC 
GGCCAATAAGGACGAGGACGTGTCTCTTATTGTGTGCTCTTCTGGAACCACCGGACTTCCTAAAGGAGTGCAGCTGACCCAAATGAACCTGCTGG 
CCACTCTCGACTCACAAATGTAAGTTTAATAATATGCTATAAAGGCGTTTCTTTTAAATGATTTTTCTTTAATAGCCAACCCACTGTCATTCCAA 
TGGAGGAGGTCACTCTACTCACTGTCATTCCCTGGTTCCACGCCTTCGGCTGTCTGACGCTTATCACCACCGCCTGCGTTGGCGCACGATTGGTA 
TACCTGCCCAAGTTCGAGGAAAAGCTCTTCCTTTCTGCCATTGAAAAGTATCGCGTGATGATGGCCTTCATGGTGCCACCACTGATGGTCTTTTT 
GGCTAAACACCCCATCGTGGATAAGTACGATTTGTCCTCTTTGATGGTCCTGCTGTGTGGAGCAGCTCCACTCAGTCGCGAAACTGAGGATCAGA 
TCAAGGAGCGTATTGGAGTGCCATTCATCCGACAGGGATACGGCCTCAGCGAATCAACGCTGAGTGTTCTGGTGCAGAACGATGAGTTCTGCAAG 
CCAGGCAGTGTGGGCGTTCTTAAGGTGGGAATCTATGCCAAGGTGATCGATCCCGACACCGGCAAGCTATTGGGGGCCAACGAGCGCGGCGAGCT 
TTGTTTTAAAGGCGACGGCATCATGAAGGGCTACATCGGAGATACGAAGTCCACGCAGACCGCCATCAAGGACGGTTGGTTGCATACTGGCGATA 
TTGGCTACTATGATGATGATTTTGAGTTCTTCATCGTGGACCGCATCAAGGAGCTGATCAAATACAAGGGATACCAGGTGCCGCCGGCAGAGATT 
GAGGCTCTGCTGCTCACCAACGACAAGATTAAGGATGCGGCGGTCATTGGAAAGCCAGACGAGGAGGCTGGCGAGCTGCCGCTGGCATTTGTGGT 
CAAACAGGCTAATGTTCAACTGACCGAGAACGAAGTCATTCAGTTTGTCAACGACAACGCCTCGCCCGCCAAGCGTCTAAGGGGTGGCGTGATCT 
TTGTTGACGAAATTCCAAAGAACCCCAGTGGCAAGATTCTGCGTCGCATTCTGCGGGAAATGCTTAAGAAGCAAAAATCCAAGTTGTAAGGGATT 
TCTCCCCTTAAGGGTGTGTGTACAAACAATTTACTATGTGTATGTTTTATTATTGCTAAACTTTGTTTTATAAACTACCAACTGGTTGGGTTTCG 
ATTTACTCATATTTGGCTTTTCTTTCGAAGTATTCGCTTAAAACTTAAATACAAATAAAAACTGATGGATTTCCATGTCGCAGATAGCATGTGAC 
ATCGCTGAAGCTCTAGAACTCAACTCCAGCTTTAATTACGTAGATGCCTGAACTATCAATAACAAGTTATATATAAACAACTACATACATTCGTT 
CTCAACTTGTAACAAACATGTATTTTTTCCATATTGTTGTCGCTTTTATATAGACTCATTATTAATTTCGTATGGTGTATACATATATTTATATA 
TTTATTTGAAATATTTAACTACTCATCTGCTCTGCTTTGTAAATAGTTGTTTAACATTTTTTGTTTTGCTTGGGAAAAGTTTCTATCTTGCTTTG 
TCGCAGCAAATTGCATATATTGGGTTCATTTGTGTTTCCAGTATATAAAGAGGTTTATATGTCTGTCTCTAGCATTAGTTTACTAATTATTGTGT 
ATGAAACCTTAAATGTAAGAGAAATGCCTTGGTTTCTATAACGGAAATATCATAGTTTGTGTAAAATCCTAAATACTGCTCATCCAGCACAGGCG 
CACCCCCGTCAAAATCCTGCTTCCAATCTTTTCGAGCTGCGCTGCTACCAGTGCTTATTTGCAGTACATACTTATTGTTGCAATGCTAATCGACT 
TGGTGCACGATTTACACTTAAACCTAGTTCGACAAATTAGATGAATAAATTCAGTTGTTCACTTCGTTAGTTTCTTTTCGTTTAACTTTACATAG 
TTAAATAAAAACGGTGCATTTTAAAATTAGCTCGCTCTTTTTTCTTTTACCATTGGTACAATTCTAGTTTTCTGCTTTTGGCACAATCCTGACAA 

C AA AT T GGT T ACT CT GAT T GT C AC T TGT GAAC G A AACT AAAT T G 
(SEQ ID NO: 697) 

Exon: 1001.. 1230 

Exon: 1717. .2394 

Exon: 2451. .3414 

Start ATG: 1724 

Transcript No. : CT19388 

AGCAGCACCATCCCCTGACCCATTTTTTGTTGCACTTTTGGTGTACTTTGAATGAAAAATAGTCAGAGACACTACTCGCGCCTGCGCAGTTTCAA 
CGCCGGGGTTATCAAATAAGCTAATCTAATCGTTCCGTAAACGCAGCCTGACATTCCGCCCTACCATTCTCCGACGAACAGTTAGTTTCGCACGC 
AAGCCGCTCCCGCTCGCATCGTAACCTAGAAAACTTTAGCACACAAAATGACTTCAAAGCTACTGCCCGGAAACATTGTGTACGGAGGTCCTGTG 
ACTGAACGACAGGCCCAGGATAGCAGATCCTTGGGTCAGTACATCCTCGACAAGTACAAGAGCTTTGGCGACCGGACGGTGCTGGTGGATGCCGT 
CAATGGAGTGGAGTACTCTGCCAGTTTCATGCACAAGTCCATTGTACGGCTGGCATACATCCTTCAAAAACTGGGAGTCAAACAGAATGACGTCG 
TTGGTTTGTCTAGCGAAAACAGCGTCAACTTCGCCCTGGCCATGTTCGCTGGTCTAGCAGTTGGGGCTACGGTTGCTCCCCTTAACGTAACATAT 
TCCGATCGTGAGGTGGACCACGCCATTAACTTGTCCAAGCCAAAGATCATATTCGCCTCTAAGATTACCATTGATCGTGTTGCCAAAGTGGCCAG 
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CAAGAATAAGTTCGTCAAGGGCATCATTGCGCTCAGTGGAACTTCCAAGAAATTTAAGAACATCTATGATCTTAAGGAGCTGATGGAGGACGAGA 

AGTTCAAGACACAGCCTGACTTCACGAGCCCTGCGGCCAATAAGGACGAGGACGTGTCTCTTATTGTGTGCTCTTCTGGAACCACCGGACTTCCT 

AAAGGAGTGCAGCTGACCCAAATGAACCTGCTGGCCACTCTCGACTCACAAATCCAACCCACTGTCATTCCAATGGAGGAGGTCACTCTACTCAC 

TGTCATTCCCTGGTTCCACGCCTTCGGCTGTCTGACGCTTATCACCACCGCCTGCGTTGGCGCACGATTGGTATACCTGCCCAAGTTCGAGGAAA 

AGCTCTTCCTTTCTGCCATTGAAAAGTATCGCGTGATGATGGCCTTCATGGTGCCACCACTGATGGTCTTTTTGGCTAAACACCCCATCGTGGAT 

AAGTACGATTTGTCCTCTTTGATGGTCCTGCTGTGTGGAGCAGCTCCACTCAGTCGCGAAACTGAGGATCAGATCAAGGAGCGTATTGGAGTGCC 

ATTCATCCGACAGGGATACGGCCTCAGCGAATCAACGCTGAGTGTTCTGGTGCAGAACGATGAGTTCTGCAAGCCAGGCAGTGTGGGCGTTCTTA 

AGGTGGGAATCTATGCCAAGGTGATCGATCCCGACACCGGCAAGCTATTGGGGGCCAACGAGCGCGGCGAGCTTTGTTTTAAAGGCGACGGCATC 

ATGAAGGGCTACATCGGAGATACGAAGTCCACGCAGACCGCCATCAAGGACGGTTGGTTGCATACTGGCGATATTGGCTACTATGATGATGATTT 

TGAGTTCTTCATCGTGGACCGCATCAAGGAGCTGATCAAATACAAGGGATACCAGGTGCCGCCGGCAGAGATTGAGGCTCTGCTGCTCACCAACG 

ACAAGATTAAGGATGCGGCGGTCATTGGAAAGCCAGACGAGGAGGCTGGCGAGCTGCCGCTGGCATTTGTGGTCAAACAGGCTAATGTTCAACTG 

ACCGAGAACGAAGTCATTCAGTTTGTCAACGACAACGCCTCGCCCGCCAAGCGTCTAAGGGGTGGCGTGATCTTTGTTGACGAAATTCCAAAGAA 

CCCCAGTGGCAAGATTCTGCGTCGCATTCTGCGGGAAATGCTTAAGAAGCAAAAATCCAAGTTGTAA 

(SEQ ID NO: 698) 

Start ATG: 238 

MTSKLLPGNIVYGGPVTERQAQDSRSLGQYILDKYKSFGDRTVLVDAVNGVEYSASFMHKSIVRLAYILQKLGVKQNDVVGLSSENSVNFALAMF 
AGLAVGATVAPLNVTYSDREVDHAINLSKPKIIFASKITIDRVAKVASKNKFVKGIIALSGTSKKFKNIYDLKELMEDEKFKTQPDFTSPAANKD 
EDVSLIVCSSGTTGLPKGVQLTQMNLLATLDSQIQPTVIPMEEVTLLTVIPWFHAFGCLTLITTACVGARLVYLPKFEEKLFLSAIEKYRVMMAF 
MVPPLMVFLAKHPIVDKYDLSSLMVLLCGAAPLSRETEDQIKERIGVPFIRQGYGLSESTLSVLVQNDEFCKPGSVGVLKVGIYAKVIDPDTGKL 
LGANERGELCFKGDGIMKGYIGDTKSTQTAIKDGWLHTGDIGYYDDDFEFFIVDRIKELIKYKGYQVPPAEIEALLLTNDKIKDAAVIGKPDEEA 
GELPLAFVVKQANVQLTENEVIQFVNDNASPAKRLRGGVIFVDEIPKNPSGKILRRILREMLKKQKSKL* 
(SEQ ID NO: 699) 

Name: lucif erase-like 
Classification: enzyme 



Celera Sequence No. : 142000013384303 

CCACTCCCCTCTCCAACACCACACTACCAATACCCTGCCAGCACGTTTGAGTTGCCCGTATTTTTGGGTTATCGCCCGGTCGTATTTGTTTATAG 
AGAAAGAGAGAATTTTTTTTTTTGTCTTTTTTGACCTGACCACCTGTTGCTGGGCGCTGAGCGCCTGTTGCGCTAGCAAATAATCGATCGTCAGT 
TATTTATTGATTTCTGATTGATGGGCCAACCAAGTCCCATCCAATTTGAATCCAAACGACGTGGCTTAAGGGCGGGGGATTAGCGTCTGCTACGT 
GGGATCTTCGTGTGTGCCGGGGCGGATTAGCTGGCTTTGGTCATGCGGCTAACAGAAAAACGGCAGCAGGCAGAGATCGCGTCGCCAGTGGAAAT 
TTGAATAAAATTAACATTACAACTACAGTAGTCGGCGGCACAGAATGCTGATAAGCGCTTAACTTATCAAATGCATACATAAGTGGTTTTCGTTT 
GGTTAACTTTAGTTTATTCATTTTTGAAATGTCTATAACAGAAACCCAAGAAAACATATTTAAACATAAACACTTAGGAGTCTAGGAATCAGGGA 
CGAAGGTAATTTTGGGATATACTCGTATATGTAAATTGTTTTTTTTCTAAGTTTTTTAATTTGTTTTGTTTTTTTTTTTGTAGTTTTGTGACTGT 
TTTAGCAAAACTTATTTAGCTACGCCCTAAGCCTAGGCAATATTATCAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTTTACGATGATGTTT 
TTATTATTTGATCAACGCACTGTTACCCATGCGGCAACTTAATTTGATCTGCTTAAATTATTTTATTTTACAATGTGTCTTGGTTACTTAAGACT 

AACAG AT T T T T AAT C C T AAAAAT G AT AG GG AG AAT T AT AAT AGT T GAT AT AAC AT AGT AAT T AT T ATT T AT T T GAT TAT T CT AAT G AAT T T T AAA 
ACTAAGACTTTCTAGGTCAAAACCAGGTTAGGGAGGTCATGAGGGTGGTGAATATTATCAATACGATCGTTGAAAACCCAACACCCGAAAGTATG 
CTATAAGTACGAACGCCTACTTGCGACTGCCCTTGCGACACAATTGCTGTACCGCCTGCGCTGCCGCCAGGAAACTCTTCTCGGCGCAGCGCTTG 
GGCCAGTCCTAGGACTCGTAGAGCAGCCGCTGATAGTGTGGCAGGACCTCGTGCCGGTTTACATGCAGGCTAAACTCCAGAGCCACAAGCACAGC 
GAACTCCGACGAGATCAGCTCCTTGCGATTCAACCGGAACACGCTCTCCGTCTTCTCGATGAGGCTCTTGAGTGCGTCACCCTTCACGTCATTCA 
TCTTGGCGCTGAGCAGCAGACAGGCGCCGGCGCACAGCTTGCGGTTGCTCTTGTTAATCAGATTGGCCAGGATGAGCTTCTCGAAGTACACGTAC 
GCCTGCGATATGGTCACCAGATCGATGCGCGAGTCCAACTTGTTGATTCGGCGCATTTCGCGCTTAATGCTGCAAGTGAAACGTAAATGGATAAG 
TAGTGTTCCTTTAAATGGTTACTGAAAGAAGGTTATGGGTTCACTTTCTCAATATTACTTTCTATTGAGAGTAACTTTTTCCATTCCTTTTAAAA 
GTGAAACCATAGTCCTTCAGATTCCTTGTCACTTACCTCCTCAGCTTACTCAGTGTGAGCTGGATGGTGGGGAACTTCTCGCGAAACTTATCGTT 
CAGCTCCTTCTTAAGATCGGATGGACGCACATAGTCAATGACCGAGGTCATGTAAGAGGTGAAGGTAAGCAACGTGCGATGCTTTCCGGCAATCA 
ACTCCGGATCGTCCAAAATGCTAGCGGAGTACTGCGGATTACGCAAGACATAACTCTAGTTAGTAAGGATATACATTCATAGAGATAACCATTCA 
ACTTACCACCTGACCACGCGACGAGTCTCCGCCCAGGATAACACTCTCTATGTCCATATCCAGTTTCATGTCTGGCGTCGGACTGGTTGCACTAT 
TGCGATTGTTGTTCTCATAAGTGAAGCACCTGCGAAGTCAATTTCGATTGATTAGGATGCACACTAAATACGAATTATATGCTAAGATAGCGCGT 
GATAGAAGGCGGGATTGGGTTAGTCGAAGCATTGCTGTTCGTCGGATGTGTGCGAGAGTGGGTGGCTGGAGGTGTGTATGTGGGGATAGGTGAAC 
TTGGTTAGTGGGCTAGAAGCTTTTGGCAGTCGACCAGGCTTTGAAACTACAGCAATTCAACTACTGAAGCTAGTAGTAACATAGCTAGGTTGGCT 
GTTTTGTTTTTGTCATATTAGCTGTTTTTCAAAATTTACCATTTGCCTTGACGTTTGGTTATGGTTTTTGTGCTGAAACGAAGTAGTGCAAAGCG 
TTAAATGAATTAGCAAAAGTTAATGGCACATAATGGTTTTATTTTTGTTTTGTTTTGTTTTTTTTTTTTGGGTCAACTATTACCGAACCAAAGCA 
AAGAAAACAATTGAACATTGTAGCTTATCAAGCGATAAAAATATCTTTCGATTTTGGGGATTATATAAATGCGAACAATGTGCTGAGCTTCAGTA 
AATATATAATTATTGGTTTGGGTAACATTCGCGCTCAACTCTTGATGAGTCAGATTTTCAGGTCTTGCTGGGTCAGCTTTATGGCTGTTGACAAA 
CACGAGCTAAACCAGTGTCTCCACTGGCAACCGAGTAATGCGAGAAGGTAACAAAGAAGCATTATGTTTACTTTTCGTTTTAAACGGAATGGCTA 
CTTGCTCTTCGATCGGATAAAGGAATTATTAGTTTACAGTGGTCGTTCGGTATGGCTAACCATGCAATATGACGCTCAATGAATGTTTAGAATGG 
TTCATGGGCATGACATGACATGAAATGGAGCTGATTGAATGTGTTCAACAAAAATGATACGAAAGATGATCCAAAATATGCTGGCAAACAGGAAG 
CCGGCACCAACCTCGCAATGCCATGATTCTTAAGCGCCGCTGTGCTCGTGATCTCCATTTGGTTCTCAAAGATGGAAGGATTCGCCGATGCATTG 
CCATGCTTCTTGCGCTCCTTTTCGTACTGACGTGAAGGTATCAGCAGGTGACCATATGATATGTCCTGTAAAAAACAAGTTGTAGTTAAGATCAG 
TTACAGTTGAGCTAAACTTATTGACTTACCTGACCACTTTCGGCCTTCTGTATACCCAGCAAATCAAACGGATCGAATGGCGCATCGTTGATCGA 
GCTCAAAGGACGCGAAGTGGATGGATTCCGACGACGACGATCCTCCTTGCGAAACTCAGCTCTGCAAATTGGAACATAATTATTATAATACTACA 
CTAATAATTAGAATCTAGTTTTGCATGTCACCTTCCATTCTTGCCCTTGTAATAGGGAAGTGAAGAGAATATGAAGAACGGAATCTTGCGCGACA 
CAAGCACCATTCGCTCATCCCGTGCCTGTCCAAGTCCCAATGGCTGGCCGGAGGTTCCACGCCGGCTATCGCTGATCTGGACCGACTTGCTGAGG 
CTGTTCCGGAAACTGCCCGCATAGTTTCCCGCCGGAGTCAGGCTCTCCGTGCTGCTGTCGTCGGTGATGTGGCTAACGCTCGATCCACTCAAGAC 
ATAGGTGCGCTTCAGACCGCCGCCCATGGCAATGCTCGAATTGGAGCCCACCCGCATACCCGGCGCCGTGTTCAAACGTGCCCTCCGCTCCGGCT 
GCAAGTTCTGATCGCTTGCCCCATTGCTGAAGGTGCTGCAAGCGAGAAGCAGGAGATCGGTTAGAATCGTAGCGATCTAAGGAGCAACAGGCGCT 
CAACAACTCAAGTAGCATGAAAAACGCGCAGAAAACGATACAAAAACGCGCTGGGCTGGACCTTGAAGGCCCCAAGTTGTTGCTGCTTTAAAATA 
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CGATTTTTTTTTGGTATTTATAAATTCAATGACAAATGTTAGCGAAGAGACTAGTGATGTCACTTTAGCTATTAACCAAAAAAACATACTTTTAG 
GATACTTTCTTTAAAGAAATTACTTTTTAAATTTTTAAAATAAAAGCATTGATTATAATGTAACATTACTATTCATAGTTCTACTATACTTATTA 
TAGCCTAACTAGCTATGCCTACGTTCTTAGCTCTAAATTGTTTAGCAATCTGTTTTAGTTTATAGCTAATTTTAGCCATTTTATGGGTATCTTGT 
GGGAGCATCATCTCTGGAACTCTCTTCATCGCGGGGGCCCATTGTCACACCCCTCACGCAATGATCGATATTTTCACAGGCCCGCAGAAACGGAG 
AGAAAAATCAGCGGAGGCAGAAAGATTTTCCCTTGTGCATTTAAAGGGCGATCAGGCCCGCTGCTTATCGCTTGTGTATTTTTGGAAATCAAACA 
AACACATCAACACAAACACACACACGCATACACATTCACACGCTCGTTGGCACACACATGTGCAATGCCGAAGGGCAGCCGCAACGCAAGTCAAT 
TAGCCAACTGAAGTCAACATCTTCACGGCATCTGTGCGCCTGGTTGATTACCTTGGGATAATTGGTCTAGGTCGAGGTTACAGTCAAAAAAAGTG 
TGGTCTCTTGAAGCGGAAGGGGGCATGGGGCAGGGGGGTCTGTGCGCAAACTCCCCTCAAGTTCGTCGTGGTGACGATTCTGCGTTAATTAGCTG 
GATGAAGTGCAAGGCAGCTCGCTTTTCATTAAAAAACCCCTTGCAAACCCAACCCCAAAAATATGATGGAAGTGGCGCCTAGGCTGTCATAACCC 
GTTAATTAGTTGGAGCATTTTAACAGGCGCACAGAATTAGCAGCAGGAATTTAGTTTTACATGCTTACAACAGCAGTGAAGAAATGAAGAGGGTG 
GGGATAGCTCCAAAATGTGTGTGTGTCTGCGTGTGTATGTTTGGCCGAAGAAGAAGAAGTAAAGGGCCGAGCAAACATTTATTTTTGTGCGTTTG 
CTTTTGTGTTTTCTGTTCGTCTAACTATTGACTCGCTAATGAGTTTGTGTGCGTGTGTGTGTGTGTGCGTTGATAAGAGGTTGGGCAAAGTGCGT 
GTGCGAGCGAAAGGGAGTCTATAGACCACCACTGCAACTCAAACAATAAAGCAAGTGAGCATGTATAGGTCCGGTATTTATAGTTTTTCTATTTC 
AATTTCGTTTTTCGAAACTCCGTCTATGAAGTGCTGTGCCCCTGTGCCAAAAGCAAACAATATTTAAATTCTAACAATAGATTTGATGAATTGAA 

AG AAAAAT GAT T C AGC AT T T T T T G AACC G AC AAAG AATT G GAAGAAAGT T C G AC GAAT T T T T T C AAT T CT T C AT GT T AC AT AAC AT T C G AAT AT G 
TACAGTGCTATTTATAGATCGCCAGAAATGCGTTAGAAAATTCTCATAACAGAATCACAGTACTGTAATTTTAACAGAATGTTTTACAAAATTCT 

T T AC AC AT T C T G CT AAT AGC AT AAT GTT T AAAAC AT G AAC AAT CC G AC T T G AC T TC T GT T GT T GAC AT AAC C GAT AT GT T AAAAAT AT AAAAC T C 
AAATCACTTAGCAAATTTGCATGCCTTGTAGCATATAAAGAAAGAAATCCTATAGCTCTTATCTCCCAGCTGTGCAGATCTAAAGATATGTACAT 
ATGTCTTTCTCCTAGAGACTATCTCTCTGATTTTATTTTGATTGCAAATATTGACCAAGAACTGCTACTAAAAGCTTTGGTAAATATACTTTTCA 

C GT C T GC AT G C G AGAGAGC G AGC AAAT AAAT C T GC C C AG AGT T T AAT C GAAG AAAAC AC AAT AT T T T AAT T GC AAT AT C T T T CC AT T T T AAT GGC 
TTTTTCTTTGTGTGTATGTGTGTGTGAAGCAATCAAATCAACTGAGTCAATCGCGCTTTTATCAGCAATCGCTTTTGTTATTGTCATCGTGCCTT 
GT T GTT T T T GC T GT T AT CT GC T GG CT AT GTGC AAAC AGAT GT T AAT GG AAGT AAGT T TT AT T AT AT AAGT TTCCGTTT G ACC AG AT C AC GT GC C C 
ATGAGCTTAAGCATGCAGCTCTGCTGATAAAAGCTTTCAGCCAGACATAAACAAAAACTAAAAAATAGACTAAATTAAATATGCAAATTTATTAA 
AACATGTTTGACAACACCACAACTTTTAAAAAAAATTTCTAAAAACGCCCGCAAACCGCTTTCGAATCGCTAGAGAAACGCTAGCAAACCGCTAG 
CAAATGCAACCAACTTCCATAAAAGTATGTACTCATTTCTTATGTTCATAATATTTTAACTGGAAATAACCACTACTCTACTATTATCTGAAACA 
ACCCTAGTAGAAGATACACTGAAAGTAATAACTAGCAATTGCCACTTGAGGCTTTGTTTTGGCGGGCAATGACGCAAATTGATTCCAACATTATC 

AAT ACT AGC T C C C T T T C ACC T CG AAAC AAAAGTGG AT T T C GG AAC G AAAT GG T GT T CT AG AT AAGGT GG AAAC AC AT C C AGC G AAAAC CGCC GAC 
AAGCAACAATAACAATAATCGCGTTAGGAAAGGAAGGCGTAACCGAAGAATAAGCGTTTTGTAAACAAACAGTGGGAGCAAAACGGAATGCCAAA 
AAACGCCAGCGATTTCGGGCGAGACGGGCAAAAACGTCGAACGAAATGTTTACACAAGCCGAAAGTAATTACATAAAGCAAATAGGCAAAAAAAA 

AAT AAAAAAT T G C CT AT T T AC AAGT CC AAT T GAAGGC T GT CGATAAAAAGC AC AAAC AAAG AAAC AC AAT G AT GTGT CG AT AT G AAAC G AGT AC A 
TTTGTGTTCCATTATTTGATTTCAATACTTTGTGTGAATTAAAAAAATTGGATGTTATAACACGTTCCTACTGTTTTCCTCTTCTACTGACAGAT 
CTTAACTTCAAGAGCAAAAACTTTGGCAAGAGCGAAAACCGAGAAATCAGTGAAACGGGAACCCATTTTATCCGCTTTTAACCGCTTTTACTACC 
TTCAAATACAATGGAACAGATAAAACATAATAAAGCGACTTGAGAATGACGTGGAGTTCGGGCATGTGTCGATTAAATATATATTAAGTCGCAGT 
TACTAGCAGAGAAGAGCATTAAGTTTAGTCGATACGAAGTGCATAGAGCTTTATTTATCGCAAGGGGTTCTTTGTTTTGTGTACTACATATGTGC 
AATGAAGTCCACAAAGGCGGTGCAATAAGTATATTTACGAATCGGATTAGGTGTGTCTATGTCATCTTTGGGCTTATCGTTCATTTCTTCACCGA 

C G AGAG AG AC AAT GG AGC T T T AT GGC GGT C T GT G T T G ACT T T AG C AG AGCT T AC AT AT G C AC AG AAAAC AAT T GGAT AGC T T T T T C AC AAATT T T 
TAAAAAAAAAAGTTTAAGAACAAAGTTTTTTCCAGTGTTTATTAACTAATTGTTCTCCGTGTGTACTGAAAGCAAATCAAGTTCTAGGGGTTGGT 
TGCATAAACTGGGATGCAGTGGCAAAATGCAGCATGCACTTCAAGCTTTTTGCACCACTGACACACATTTCGCAACCCGCCGGCACAAAGGAACC 
CCTTTTTCGAAACGGCGTGTCCTTGAACACAAGGATTAGAACCCCTGAAACGCACAAGTAGTGCGAAT AAAAAT AAC GAAAACAAACAAG AAC AG 
GGAAAACCGGTTCCTATTACGTCGTCGATTGCATAAAACTCTGCCTCAAGGTTACCCAGTTTGATGGAGTGAACGGGATGGGAGGGGGTACGCAG 
AGGATTTTCTCGCTGTTTGGACAGGTTTGCTGGCTGGCTGGGGCGACATTGACATTGAAATTGGTACGAGGCAGGGAGCAGGATGAGGCCACAAA 
AAAAGCTCAAGTTCAATGCTAAGCTTTGCCTTTTGCCGGCTGAAGGTCCTTGGTTGACTGTTGAATGAACAAAAAGGGCCGGGCCACTCCCCTAA 
ATCAAGTTAATTGCCATGACGCAACCTTTGGCCAAAGGAAGTGAAAACTGAGAATGAAAAAGAAAAAAATGATAGATGGGGCAATAAAAAAAAAT 
TGTTTACGTTTTCGGTAGAGTTGCGGATTTTTTTTCAGTTTTTTTTTTTCACTCTTCCATATTCCCATTCATTGGCCAACTCCACATAAAGCAGC 
TCAGGTTTCAAAACAAAAACTGGACATAAGTAGTTTGTTTTGGTTTTGCCAATTGCGACGCAGTTTGTTTTTGCGCCTGCGCAGCTTTTTGTGGC 
AAGATTCCCACACCGAAAAATGCGGACGATTCCGCATAGCACAATATTGTTGTTTGCAACAATGTGCCTCTTCAACAAATTCTGGCATCAACAAA 
ATTGCTATCCCATTCCGGAGCGGAAATAGTAATGAAACCCTAGATTGGTGGGTAAAAAACAGAACTTGCTTTTTGGATTCAACGAAGAGCATAAT 
ATTTTTTTTTCTTTTGCTGCGAAATGAGCGAAACTGGGAATACATATTTGAAATCGCTTAAAGACTTTGTAACTCGATGATGTGGCAAGAAATTC 
ACATTTATTGTTCACGAACTGAGTCATGATAAAGGGTTAATATATAATTTTAAAATCAGCTGACACAAAAAATACCCCTTAATTAAAGTCTTTAA 
CGTCACTGAGATATCTCATCGAGTGCTTTTATAATCTTCGAGGCTTCGATCATTTCCACATGCCAGATAAACCTGCTGAGTGTAAAAGATCTCAT 
GATAGGACTATCCGTCATATATCAAGATCATTCTAACCAATCTTATCAGTTGATTGCTTTGTGTATATACATACATATGTACATATATGATCTAT 

T G C T AGT T C AT C GC T AC GAT AAGGT G C AAAAAT AT C AAAAT T T C AAGCT T T C G AC T T T AT C AAAAT C G AT C AAAG AAC T C AT T GC AAGG AAC C C T 
GATAAAAATGTGTACATTTCACTATTTGCGAAGTATATATTTCTAGTTTTCCCTATGATATTCATTGACTATTTGCTATTCAGCGTGCTAAGTGT 
TATGAATTTGCATTTTTATAGCTCGTTCGATAGTTTAGCTTTCATGGCGCAGTTAGATCGTATATGTATAAATGTATATGTATATTAATAAACGT 
TTTATGGTATAATTAGAGTTATAGTAATAGCATCCTTCTCAAGCTTTTGCATAGTCATTGCCAAACTAAAACCAAATCCCCGGGAATTTGATTTG 
CCACAGCAAATGGCAATGACAGGTCGCTGACAAGCCAATTGGCTGCAAGCCAAATGCATATCTACAAGTATGTAACCAACGGATGGATGACAATA 
AGTCCCCTTCCCCACCCCTGCGGAATCCGAGGAAAAACGGGTTAGCCACCAGTTGAGTGGCGGCGGCATGCGAAAGAAACCGAAAACGAAAACGC 
TAGGCAAAAATAGCGAAACGCCAGCGGAGCACGACTGCAAAGCTGCATGTGGCAAACAAGGTCAGTTAGCCAAAGCGCATCCAACGAGTTCAACG 
AAACTTGTATCTGGCATTGGAAGTGAAAGCTCTGAGATGCACCTTAGCTTTGCCCAGCTTTATGCACGGAGAAAGACAAAGAAAGTCCCCCCGAG 
TAGGGGGGGGGGGGGTAGAGTGGGGCGTAAGTTCAAGGCAGTTTGCGAGCGGTTTACGAGCGTTTCGCTAGCGATTTACGAGCGATTCGATGGCA 
CACGCATTTGGACATTGCGTAATTACTTTGCGCTAATACAACCAGATATAATGCCCATGATTCAACGTTGACAGGCCGTTTAAGTGCGTGACAAA 
ATGACTGGCGCCTCGGACAAAAGTTCCGAATTAAACGAACACCGCGACGCCCCTCGACATGCAATTCAATTCAACTTAATTCAAACAATTGTTTG 
CACAGCTTTCGGCCGGGCACGCTCTCTAATTTACGTTAATTGAGCCGGCACACGTTGATGAATGAAGCCGATTCGATTACAGGCCCCACCCTCCC 
ATCCAACGCCCAACCCCTTGCCCCTCATGAGATCAAAATATTTGATGTGTATTTTTGGGGTAGGCGATATTTTTAGCGCCAAGCGACCGGCAACA 
GACAGACTCCTAATTATGCAGCATCAGGCGTTGAAAACGTGCAAATAGTCCCTCGCCATCTGCGAAAATACGTCACAATCGGAAAAGCGCTCTCT 
CAATTTTCACAAGCTTATGTTTATAAATAATTTTCGATTTCAGAACGCCCAGCGAAGTGTTTTGCTTTCGCGGTACTTTCGATCAAGCTGAGCCT 
TTTTCGTAAGCAAATTAGAGAACCAGTGTCAATTGGCTAACGTAAACTTAGCGGAAATTTCAATTAACAAATTTGGGGAGGGGAAAAATCGATGC 
CAATTATGACGACAGCTTAATTGAACTCAACAAAAGTTCAGTGATCAAGCGGAATGATTCATTCATTCTCGTAGCGTAGAAATTATTTATAAATC 
AACAAAGAGCAAAGGAATATTAAGCGAATTTCATTCGTACGTGTATGGTTGGAAAATATTGCGATTGGACTAAGGAAAACTGTCCACTTGACGTC 
ACGTGTCAATCTATTTCTGAGTGTCTACAAGATAGAAGATCGTTTAATTGGAAAAACATAAATTTGAAGCTGCAATAACTTAACACATTTCCTGT 
TTGAATATTTTGGTAGGTTCCCTTATAGTCGATCTAATATAACATTTATAATATTATATAAACTTATGGTAAGAATCTTTACAGTTGCCTCTTCA 
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TACAGCTAATTTGATTACAACTTCTAGCAACTGCTTAACAAAATGCTGATTAGAAACCATAATAAGCTAAGATAGTTCACTGATATGACAAACAT 
GAATTGTGCATTGATTACTCACCGTTCCCTCAGCGGCAATAGCTTGCTGCCCGCTCCGCCGGCCACGCCACCGCCGCTGCCCAGATTGGACACCT 
TGAGCGGTATCTTGACGCTGTCCGAATCGCTGCCACTCTCGGCACCCGATCCTCCGCTGCCGGCGGCGCAACTTCGTGCCTGCCGCTGTTGCTGC 
TGCAACTTGTGCTGCTGCTGCTGCTGCTGTTGGAGCTTATTGCCCATATTGCCATTGCCCGGTGCCTGGTGGTTATGACGACCCGACGAGGTCGG 
GCGCTTTATGTTGCGTCTCTCGGCCGGCACAATGCTGCTGCGATTGTGATCGGCGGCCGGCAGGACGTTCATCAGGTATTGACCCATATTTTCAG 
TTTCACTAAAATGCCCGTCCCCATCGCTGGCACCGGTGGTGCAGTCCTCCAGTCCCAATCCCAGCCCGATGCCGCCATCGATGCCCATGTGGACA 
TCGGCGCTGCAGAGGAGCGGCGAACAGAGGGCGCCATTCTGGTGTTGCTGCTGCTGCTGATGCTGCTGCTGATGTTGCTGCTGCTGCTGCTGCTG 
TTGGGTGTGTTGCTGTTGTTGCTGTTGCTGCTGCTGCTGCGTGTGATGCCGCTTCTGGTGGTTGTGGTGCGTCGCCGAGTGGCTGCGCTGATGGT 
GGTGCTGCTGATAGTGCGGATGAAAGCCCCCCGAGCCGAGTCCCAGCGAGGATGGGTTCTTATTCTGGTTGCAGAGCAGGCTGCTGCCGAATGTT 
GCGCCGAATTTGGTGTCCCGATGGGTGCCATCTAACGAGATATTCGATAGGAACGAGATCGCAGCCAGTCTGCGCCTGTGTCTCGTCTTGTTGAG 
CGAACTTGCCATTAGGCGATGAACCCTTAATTACCAAGCGAGATTAACCGAAGTGGGTGCTGTGGGTCGTTTCCCTGGATGCTCCGATGTTGCTA 
TATATGTACCTATATGTATATATAGTGTGTTTGTGTGTGTGTGCATATACTGTGTGGGAATGTGATGTTGCACTTTTCTGGAACACTCTTCGCGA 
TTCTTTACGAAAAAACTGGTTTCGAAAGTGATTATGCTGCTGTTGTGTATTTTCACTGCTGGTTGCTCACTTTTCTTGATCGAATCAATTAAGTT 
TTTTAGTCTCTAACTTGGCAGCCGAATTAAAAAACAAACTACTTGAACGCGACATGGTGACGAATTTTTAGCACTTTAGATGACGGAAGATGGTT 
GGAATACGAATAATTTTATGCACTTTTATTATTAACTAGCGTTTATGCGTCGATGCGTTGCTTTGGCTGCAAGTAGAAATAAGAACAGAAGAGAG 
GAATGGAATCAATAAATGAGACTATAAAAAGCGGCAGGAAAGCTGCTTTTGGCCAAGAGAAGCTTTCTCCGAGCGTTTCTGGAATATTCGTTGAA 

GT G C AG T GG GT GGC T TAGGGG AAAT T GT GGT C AG C AC AT T C T C AT T CT C AG T GG GC AAAAT AT T G AAT AT G AT AT AAT AT T GT T AGC AG AAGT C A 
TCTCAATATGATATTTTATAATTAGCAATGTTTTTAAGCTATTTTTAAGTGAATATGCTATGGTTTTTTCAACTCTTTTAGATTATTTACAATTT 
TTAGACATAATAATCATTTTAAAAAATCTCATTTGAAAACAACGGGCATTTTTAATTTTAAGAATTAAAAGTTTTCCTTGAAATAAACCCGAATC 
TTTGCTGGGATTTTAGCCAACTGAATTGATAAAGCTTTTCCGCTGGCAGTTAAAAAAGCTGCATGCGGCTCAAAGCTGCAGTGGTTTTAGCCGCC 
TTTGTTGTTGCTGTTGTTGCTGGCGGGCGGCTTAGTGTGTCAGTTTAGTGGGTGGCAACAGTGCGAAGAGAGGGGGTGGCAAAGAGAGAGATATT 
GGGCCACAGCCGGCAAGGAAAATCTAAGCGCTTTTCCGCACTTTCGTGTGTGTTTTCTGTTTACATTCAATACGGGAGGCCCTAAAATGCCCAAA 
ATTTTCCCTCGGGCAACAAGAGCGAGTGGAAAATTTTTGAGAACCATCATAAACAGCTGTTCGGGAAGCGGGGAAATAGATTTTCCGGTGGTGGT 
TTTTTTTTACCTAGGATCCAGTCTAATTGGATTTACCTGGACCAATTGGAGTGCGGCGGCACACACAAGTTTGAGCCTGTGCGAAAAATATAAAA 
T C AAT GC AGG AAC AAGT GT GT GC GAAAAT AT T T AT GAC ACC C AC C C AC T C AC AC GC T C GC AC AC AC AC AC T T G C AAGC AC AC GC AC ACGT GT GAC 
GTGTTTTTTCGTGTTTTCCCCTCGTCTTTTACTTACCACTGTGCTTGGTTTTTCCACGGTCGTTGGGTATTACTTAAAACTTAATTCGATTGATA 
GTCGTTCTATGCTGAGAAATGTTTCAAGCGCGTCATTTTGATCTATTTTTATAGTGTTTTTTTCGATAGACTTTGATTTTTTGCCTTTGGCTGCC 
AACTCGCCTGCGCTCCTCACTCTCTTTGGCTCTCTTTTGCACGGCTGTGTGTGTGTGTGTGTGTGTTGGTGCACTCACGCAGTTGCCTTCGTTGC 
GTAAATTATGCAAAATGCAGGCCAATTGCAATGTGTTTACAAAAACAGAACAAACGATCGCGTTAAGTGTGTTTTTTGCGATTTTACGAGCGCGT 

T AAC T GGT C T CT GC G AAAT AC G CAAAC GGT ATG AAC GT C AAAAT AT T GGC AGC AGC GAC G AAAAC AT CT GC AG AT GGG CT T AC G AAAAAT T C AC A 
AACTGAGGCCAGTGTTGGGCAGCGGACGAAGGGCGAAATACCAATTCAAAGGGAGGCAAAATATACCGCAATGGTACTCTAGAAAATTCACGATT 
GTTTTGGTTTGTCCATAACTTCAACGTTTAAATAAGGTGCACTTAGAAAATTCTAGATGTTCACAATTTGTAAAAAAAATGTACACAAGCCAAAG 
TTATCTTAGTTTTATGAACAATTTTTCTGTATTCTTAAAATATTGTTTACATGTTAATTAGGATTTAATTACTTTGGTATTAGCTGCGGTATCTT 
AATACCACTACTTTTCAAACCTTCTGGCAACGCCATTTCGATCAGCTGTTTTCGTGGGACGCGCATGAGAAAATTATTGTTTGCGGCATTTCATG 
AAAATCGCGAAAAATGGTGACAAAAACGATAAAATCTCTGAATTTAGAGATCAATTTCGTAAGTACCATGAAATATATGTAAATCACAACTAACC 
TATCAAGTTTTTCACACACAGGAAGTGGAGGATGTGCCGTATGAGGAGGAAATCCTTCGAAACGCCTACTCCGTAAAACATTGGCTGCGCTACAT 
TGATCACAAAGCGAAGGCGCCGAATAATGGTGTTAACATGGTCTATGAGCGTGCTCTGAAAGAGCTTCCTGGCAGCTATAAGATCTGGCACAATT 
ACCTGAGAACTCGCAGGAAGCAGGTGCGCGGCAAGATACCCACGGATCCCATGTACGAGGAGGTCAACAGTGCCTTCGAACGTGCACTGGTGTTT 
ATGCACAAAATGCCACGTATCTGGATGGACTACGGTGCCTTCATGACCTCGCAATGCAAGATTACCCGCACCCGGCACGTTTTCGATCGGGCTCT 
AAGGGCTCTGCCCATCACGCAGCACGGCAGGATTTGGCCACTGTATTTGCAGTTTGTACGCCGCTTTGAAATGCCGGAAACAGCACTGCGAGTCT 

ACCGTCGATATCTGAAACTCTTTC 
(SEQ ID NO: 700) 

Exon: 13464.. 13052 
Exon: 12036.. 10758 
Exon: 3740.. 3357 
Exon: 3291.. 3165 
Exon: 3105.. 2957 
Exon: 2024 -.1907 
Exon: 1836. .1652 
Exon: 1494.. 1001 

Start ATG: 11602 (Reverse strand: CAT) 
Transcript No. : CT19424 

AGATGTTTTCGTCGCTGCTGCCAATATTTTGACGTTCATACCGTTTGCGTATTTCGCAGAGACCAGTTAACGCGCTCGTAAAATCGCAAAAAACA 
CACTTAACGCGATCGTTTGTTCTGTTTTTGTAAACACATTGCAATTGGCCTGCATTTTGCATAATTTACGCAACGAAGGCAACTGCGTGAGTGCA 
CCAACACACACACACACACACACAGCCGTGCAAAAGAGAGCCAAAGAGAGTGAGGAGCGCAGGCGAGTTGGCAGCCAAAGGCAAAAAATCAAAGT 
CTATCGAAAAAAACACTATAAAAATAGATCAAAATGACGCGCTTGAAACATTTCTCAGCATAGAACGACTATCAATCGAATTAAGTTTTAAGTAA 
TACCCAACGACCGTGGAAAAACCAAGCACAGTGCCAAAGCAACGCATCGACGCATAAACGCTAGTTAATAATAAAAGTGCATAAAATTATTCGTA 
TTCCAACCATCTTCCGTCATCTAAAGTGCTAAAAATTCGTCACCATGTCGCGTTCAAGTAGTTTGTTTTTTAATTCGGCTGCCAAGTTAGAGACT 
AAAAAACTTAATTGATTCGATCAAGAAAAGTGAGCAACCAGCAGTGAAAATACACAACAGCAGCATAATCACTTTCGAAACCAGTTTTTTCGTAA 
AGAATCGCGAAGAGTGTTCCAGAAAAGTGCAACATCACATTCCCACACAGTATATGCACACACACACAAACACACTATATATACATATAGGTACA 
TATATAGCAACATCGGAGCATCCAGGGAAACGACCCACAGCACCCACTTCGGTTAATCTCGCTTGGTAATTAAGGGTTCATCGCCTAATGGCAAG 
TTCGCTCAACAAGACGAGACACAGGCGCAGACTGGCTGCGATCTCGTTCCTATCGAATATCTCGTTAGATGGCACCCATCGGGACACCAAATTCG 
GCGCAACATTCGGCAGCAGCCTGCTCTGCAACCAGAATAAGAACCCATCCTCGCTGGGACTCGGCTCGGGGGGCTTTCATCCGCACTATCAGCAG 
CACCACCATCAGCGCAGCCACTCGGCGACGCACCACAACCACCAGAAGCGGCATCACACGCAGCAGCAGCAGCAACAGCAACAACAGCAACACAC 
CCAACAGCAGCAGCAGCAGCAGCAACATCAGCAGCAGCATCAGCAGCAGCAGCAACACCAGAATGGCGCCCTCTGTTCGCCGCTCCTCTGCAGCG 
CCGATGTCCACATGGGCATCGATGGCGGCATCGGGCTGGGATTGGGACTGGAGGACTGCACCACCGGTGCCAGCGATGGGGACGGGCATTTTAGT 
GAAACTGAAAATATGGGTCAATACCTGATGAACGTCCTGCCGGCCGCCGATCACAATCGCAGCAGCATTGTGCCGGCCGAGAGACGCAACATAAA 
GCGCCCGACCTCGTCGGGTCGTCATAACCACCAGGCACCGGGCAATGGCAATATGGGCAATAAGCTCCAACAGCAGCAGCAGCAGCAGCACAAGT 
TGCAGCAGCAACAGCGGCAGGCACGAAGTTGCGCCGCCGGCAGCGGAGGATCGGGTGCCGAGAGTGGCAGCGATTCGGACAGCGTCAAGATACCG 
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CTCAAGGTGTCCAATCTGGGCAGCGGCGGTGGCGTGGCCGGCGGAGCGGGCAGCAAGCTATTGCCGCTGAGGGAACGCACCTTCAGCAATGGGGC 
AAGCGATCAGAACTTGCAGCCGGAGCGGAGGGCACGTTTGAACACGGCGCCGGGTATGCGGGTGGGCTCCAATTCGAGCATTGCCATGGGCGGCG 
GTCTGAAGCGCACCTATGTCTTGAGTGGATCGAGCGTTAGCCACATCACCGACGACAGCAGCACGGAGAGCCTGACTCCGGCGGGAAACTATGCG 
GGCAGTTTCCGGAACAGCCTCAGCAAGTCGGTCCAGATCAGCGATAGCCGGCGTGGAACCTCCGGCCAGCCATTGGGACTTGGACAGGCACGGGA 
TGAGCGAATGGTGCTTGTGTCGCGCAAGATTCCGTTCTTCATATTCTCTTCACTTCCCTATTACAAGGGCAAGAATGGAAGAGCTGAGTTTCGCA 
AGGAGGATCGTCGTCGTCGGAATCCATCCACTTCGCGTCCTTTGAGCTCGATCAACGATGCGCCATTCGATCCGTTTGATTTGCTGGGTATACAG 
AAGGCCGAAAGTGGTCAGGACATATCATATGGTCACCTGCTGATACCTTCACGTCAGTACGAAAAGGAGCGCAAGAAGCATGGCAATGCATCGGC 
GAATCCTTCCATCTTTGAGAACCAAATGGAGATCACGAGCACAGCGGCGCTTAAGAATCATGGCATTGCGAGGTGCTTCACTTATGAGAACAACA 
ATCGCAATAGTGCAACCAGTCCGACGCCAGACATGAAACTGGATATGGACATAGAGAGTGTTATCCTGGGCGGAGACTCGTCGCGTGGTCAGGTG 
TACTCCGCTAGCATTTTGGACGATCCGGAGTTGATTGCCGGAAAGCATCGCACGTTGCTTACCTTCACCTCTTACATGACCTCGGTCATTGACTA 
TGTGCGTCCATCCGATCTTAAGAAGGAGCTGAACGATAAGTTTCGCGAGAAGTTCCCCACCATCCAGCTCACACTGAGTAAGCTGAGGAGCATTA 

AGC GC G AAAT GCGCC G AAT C AAC AAG T T G GAC T C GC GCAT C GATC T GGT G ACC AT AT C GC AGG C GT AC GT G T AC T T C GAG AAG C T CAT C C TGGC C 
AATCTGATTAACAAGAGCAACCGCAAGCTGTGCGCCGGCGCCTGTCTGCTGCTCAGCGCCAAGATGAATGACGTGAAGGGTGACGCACTCAAGAG 
CCTCATCGAGAAGACGGAGAGCGTGTTCCGGTTGAATCGCAAGGAGCTGATCTCGTCGGAGTTCGCTGTGCTTGTGGCTCTGGAGTTTAGCCTGC 
ATGTAAACCGGCACGAGGTCCTGCCACACTATCAGCGGCTGCTCTACGAGTCCTAGGACTGGCCCAAGCGCTGCGCCGAGAAGAGTTTCCTGGCG 
GCAGCGCAGGCGGTACAGCAATTGTGTCGCAAGGGCAGTCGCAAGTAGGCGTTCGTACTTATAGCATACTTTCGGGTGTTGGGTTTTCAACGATC 

GTATTGATAATATT 
(SEQ ID NO: 701) 

Start ATG: 848 (Reverse strand: CAT) 

MASSLNKTRHRRRLAAISFLSNISLDGTHRDTKFGATFGSSLLCNQNKNPSSLGLGSGGFHPHYQQHHHQRSHSATHHNHQKRHHTQQQQQQQQQ 
QHTQQQQQQQQHQQQHQQQQQHQNGALCSPLLCSADVHMGIDGGIGLGLGLEDCTTGASDGDGHFSETENMGQYLMNVLPAADHNRSSIVPAERR 
NIKRPTSSGRHNHQAPGHGNMGNKLQQQQQQQHKLQQQQRQARSCAAGSGGSGAESGSDSDSVKIPLKVSNLGSGGGVAGGAGSKLLPLRERTFS 
NGASDQNLQPERRARLNTAPGMRVGSNSSIAMGGGLKRTYVLSGSSVSHITDDSSTESLTPAGNYAGSFRNSLSKSVQISDSRRGTSGQPLGLGQ 
ARDERMVLVSRKIPFFIFSSLPYYKGKNGRAEFRKEDRRRRNPSTSRPLSSINDAPFDPFDLLGIQKAESGQDISYGHLLIPSRQYEKERKKHGN 
ASANPSIFENQMEITSTAALKNHGIARCFTYENNNRNSATSPTPDMKLDMDIESVILGGDSSRGQVYSASILDDPELIAGKHRTLLTFTSYMTSV 
IDYVRPSDLKKELNDKFREKFPTIQLTLSKLRSIKREMRRINKLDSRIDLVTISQAYVYFEKLILANLINKSNRKLCAGACLLLSAKMNDVKGDA 
LKSLIEKTESVFRLNRKELISSEFAVLVALEFSLHVNRHEVLPHYQRLLYES* 
(SEQ ID NO: 702) 

Classification: known_f lybase_gene 
Gene Symbol: BcDNA: GH06193 
FlyBase ID: FBgn0027581 



Celera Sequence No. : 142000013384674 

TGTACAACAGCCCTAATATTAGATATTTAATGTTGTTCATATAACAAATATCCCGAACATCAACAATATTTCCTTATGATCTAATGACCCTTATG 
ATCAGTTAAGCTTTCAGCCCTTTATGCAATTGGTTATTTCGGTCCAGAAGGCCTGAACTTCAGGATCCTGCATTAAACTACGTTTCTTAGCGTTC 
AAAACTATGTTGAGGATGTCGGCATTTTTGCAGCAATCCCGTCTGGCGGATTTGCCATCCGGCGCCTCCGGGGAAAAGCCTTCCGTCTCCTGGTC 
ATCCATTGGATCCACTTCACCCGCCGGACTGCCCAATTGTCCAACTGCAAAATAACAATGTATTTGGGGGAATCTATTGCTTAATCAACTTAAAT 
ACCTTTCTTGCACTTGGTCGCCTCTGTTTTCGGTTTTTCTGTCCGTTTTTGGTAGGGTTTTCTTGGTTTTGAGACATCTTTCGTCTTGCTGCGCT 
TGGCATTCGCATCGGTGGGCAGCAATTCGGGACTCTCGGCGGGATTATCCGCATCCTGCTGCAGGCGCCGCTTGACGTGCAGATTCAGGCGCTCC 
TTGATGTTTTGCTGTTCCAGTTTTATTTGCACACCAGTGGTTTCTGCCATGTTTTTGCGTTTTGAATATAAAAACAATATCCAAAACAGTCCACC 
AGTGTGACCCCTAAATAAATCTTACAAAAAAATAAACAGTAACGTAAGATACAACTTTCTCACTTGTATTTGATTGTTCCTAATTATTTCTAACT 
ATGTAAATATCGTTAGTATTGGATGCAAAGCTATGGCGGTAGACGTAAGGAAGGAATTATCAGTGCAATTTAGAAGTTAAATAAATTTGTTATCT 
AATAATTATTTGGCAAAATTTGTGGCAGTGTGAACGGAACTGGGTTGAGTCTCCGCTACCCCATATCCGAGCCACGCACCCCAGGTCTGCATTTG 
TGTCGATGGCAAATCTGGTATTTTTCAGCGACGCCATGTTGGATTGCGAACAACAGTTAACAGCAGCAAACGAGAAATCAGCTCGCAGCGCAGAA 
AAATTAAAAAATATCGCCGTAAAAGTGAGCCCAACCGACCCAAAACATATGAAAAGTGTGAATTAAAGGACAATAAAGATATAAATCATTAATTA 
AGCTTTGAGCAAGCGCGAAAAGTACGCGAAATCAACGTTTTGCAAAACGTTCGCCATACGCGTCGCCATTGGTGTGAGCGATAGCTTGCCTGTGT 
GTGCGTGTGGTATTGGAGGAGGTGAGAGCGCCTGTGTGTGTGTGTGTTTGCGTGCGTGTGTGAGAGCGAGCGAGCGGCCCCTATGAAAAGCAACA 

GC AAC AAC GC C AGC AAAG GAC C C G C AAC AACAAC AAC AAC AAC AGC AAG AGT AAC AAG C AC AGC G T T T T AAT AT C C AAAT AAT GAG TGTCGCTGC 
AATGACTATGGACGATCAAAGACCCTGTATGAACAGTTACGACAAAATGCCCCCGACCAAATACGAGCAAAACCTAAATATCCTAAACAGCAGCC 
AAAACAGTGGGGCCACTGGCGGCCCCGCCTCCCCAACGCCCTCCGGCTTGGAGGACCACCATCATCATCACCATCCGCATCCGCACCATCATCAC 
CATCAGGAGCAGCAGCAGCAGCAGCAGCAACAGCAACACTTGCAGCAGCAGCAACAACAGCAGCAGCAGCAACATGCAGCAGCTGTTGCAGAGGC 
TGTTGCAGCCGCCGAACAGCGACAGCGCTTGCTCGAAGGTATTTTGCACATTTTGGTACACTGCTATATGATCAGAGATCTATCTGATCTATATA 
ATACCCCCTATATACTACATACATACTCGTTTCGAATACCGACAAAAAAAAAAAGAAAAGGAAAAAACCACGATCCAAAGTGTTGACTCTGTGTA 
AATTGTATAAATAATGCAATATATAAACAACAAGTAAACGTAGTCTTTTAGTTTATTTCTGCCCCCACTCCACTTGGGTTTCTTGTGTCATTTGA 
ATTTGTTTATGAAAAACACTCGGTAAAGAACTCAACGTATATATTATGTGAATCGAATCGAAGCGAACACTCTTTGCAACGAACATTCTTTCAGC 
GGCGTTTTGGTTAACTTTATTTATTTATTTTTTTCAACACCACCTCGAGCAAACCGTCGAATGTAGTGAAAGTGCTTTCAAAAAGGACCATGCAT 
ATAATAATCTATAATACCTAAATATGAACTAAGTTAAAAGGCAACTGACGCCTAAAGTTATCAAGTGATTACAAAGTGTTGTGCAACAGAACTAT 
CTAAAATATAAAATATGAGTTAAATAATCATAAATAATATATGAGAATCGTAACCAATTATAACCAGTAAGCTGAGCTAAGCTTTAATATTGAAA 
GCAATCGTACTTAAAATAAAAAATTATAAAATGAATCATAAGAAGATAAGGTTAACGAAGTAAAGTTTAAGATTTTATATAATCTTAATGAGAAA 
ACCGGGCTGATAATAACATCAAATGAATAGAGCATATCAAGTAATAATAGTAGAGAATCCTAATCGGAGATTTTCCTCGTTTTCCGTCGTTTTTG 
CGGTGTACAGTTTTCGTTTTGCGACCACTTTTCGTTTGCCTACGTGCCCGTTGAAGAAGAAGAAGAAGCACAGCAGCAGAGAAGTGGAGAGGAGA 
AGAAGAGAGCCAACGTGTCTGCGTCCCCTGCCCCACCCCCTGGACCCATCTTATTCTCTCTCTTCGCGCCCCTCACGCTCTTTTCGTCGCGCCTA 
AATATAAATAATAATATTAAATAATCAATTGAAAATGTGACAGTGCATAACCAAAGAGAAAATAAAGTTGAAATTCTTTTTCTGTTGAAATTGTG 
TTAAAGAAGTGAACGTCAAATGAAAAAAACATTGCCTCGTGTGTTCAAACAAATACAAATGTTTAAATAAATGGCGCCAAAATACGGCGAACATG 
TGTGCGTTTGTGTGTGAGTGTGTATATTCACAATTGTTAAGCTTACTTAATTGATTTTTAAACGCTTAATTGGCCGCCAATCAATCAATTAACAT 
GCAATGTGGGACACACGCACATGCACACATTTGTTTGCCCCGATCAAATAGTATATATAAATATAAAATAATTGCTTGGCTAAGTAAATACAAAA 
GCGAAGTTATATATGTACGTGTTTGCACAACAAATCCAACAAAAAATGCAAAGTCAATAAACTTTTTGTATGACAAAAGAGTGTTCGAAAATGCC 
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GCAAAAAAGAGAAGGGGAAAAGCTTTTGAAATCCATGCATACATACATCCATGCATAGGCATACGAAAAAGCTTCCGAAAGCCCATTGTGTGTGC 
GTCGAGTTTTTTGCAAGTGTTTTTCAAAAATATTTTCAGTTTTGCCACAAAAAATATGTATACCAATCAGTCATCGTTGTTTTCCACAGTGTATG 
CAAGTTCGAAAATGCCGCGAAGATTCAAATTCAATCGCCAA 

ATAATGGCAGGTGTTCAAGTTATTCAAGTGCCAATGATAATGCCAGAGATAACGAAGCAATCGAAATCGAAATCCAAACCGCAAGCGAATTCGAT 
ACCGAAATCGATATCGTGTTGTCCAAGTTCAAGTGGTTCAAGTTGTGACACTCGCATCGAATCAAAATCTAAATCAATATCATTATCAGCCAGAG 
ATTGACGACGGGATGCGGCCTTTTCCACCGACCATGTCACGAACTTACACGAATGATTGCAACGCATTGGTATGATTTCGTATTAAGTTCCACCT 
AAAAAAACAAACACTTTGCCTTGCCCATTAAGAACTTGTAAAATATCTATGTATATATAGTACGATATTGTTATATCTCGTGTGCGAACCAAAAA 
AAAAAAAAAAAGACAGTGGCTCTCTCAGCTTGGAACGCCCGTTATCTAGACCCCATTATTAGCTAATCCTTATCCTTCGATTCGCAGATGAGATC 
GAGAATCTCAAGCTGGAACAGGTCCGCATGGCTCAGCAATGCGCGGACGCCCAGCGACGCGAGAAGATCCTCATGCGACGGCTGGCCAACAAGGA 
GCAAGAATTCCAGGACTATGTGGTGGGTACTCGCCTTAATATACACAAATTTCTCAAATACTAATGCACCCGTAATCCCCATTAGAGCCAAATTG 
CCGAGTACAAGGCCCAACAGGCGCCAACTGCCCTCGCATTGCGCACTGCTCTGCTCGATCCTGCGGTCAATCTCCTGTTCGAGCGGCTCAAGAAG 
GAACTTAAGGCCACCAAGGCCAAGCTGGAGGAGACCCAGAACGAGCTGTCCGCATGGAAGTTCACGCCGGACTCTAATACCGGCAAACGCCTAAT 
GGCCAAGTGCCGACTGCTCTATCAGGAGAACGAGGAGCTGGGCAAGATGACCTCCAACGGCAGACTGGCCAAGCTGGAGACCGAACTGGCCATGC 
AGAAGAGCTTCAGCGAAGAGGTCAAGAAATCGCAGTCGGGTAGGTTAAATCTGAACTAGGATTGGGATCGAAACTTATAGATATGTAAACTCCTT 
TACCATTTAGAGTTGGACGACTTCCTGCAGGAGCTGGACGAGGATGTGGAGGGCATGCAGAGCACCATTCTGTTTTTGCAGCAAGAACTAAAGAC 
CACACGCGATCGCATACAAACGCTTGAGAAGGAGAACGCCCAGTTGAAACAGGCCATCAAGGATGAGGTTGTGGCTCCAGCGGCTGCCACCAACG 

GT GGCAC C AAC AC G AC G AT AAAT AAG C T AG AG AC C AT T C AC G AGG ACGCT T GC AT GGC GAAT AAC C C AAC G AAT CC G GATT GT T ACAAT GGG AAT 
ACCAACAACGAACAGATCGCGGCCGTGCCACAGATACCACTGTCGGATGATGGCAGCAATATGAACGGAAATGCAGCGCGATTAGCCCGCAAACG 
AAACTACCAAGAGGAGGAAGCCCTTCCAACGGTAGTGGTGGTGCCGACTCCCACCCCAGTAGGAAATAACGTCCAGGAAGCGCCGCCCATCAGGG 
AAGTTACCGCCCCACGTACTCTGCCGCCCAAAAAGTCCAAGCTGCGAGGCATTACCACGCGTCGCAACTCCCAGCTGGAGGAGGATCATCAGCCA 
GTGACAACACCAGTGGCCGTCCCCATGATCGTGGACAACGCTGTGGCCGGAATGGCGAGCGAAGAGGCTGCTGCCGCCGCTGCGGCTGTTAATAA 
CAGCAATACGGGAATAATCCCGGAAACGGGAGTGCAAGTGGGTGTCCCAGTTGAGGGCGGCGATCCAGGTGCTCCGGCGGCACCGGGCCGCATCC 
TGACCCGTCGTCGGTCCGTCCGCATGCAGCAGAACGGCAGCGGAGCAGTCGACTACTCCACCTAAGCGCTGGCGAACCTTTTTCCAAAAAAGGGT 
GCACTCCATATGAGTTCGTCAAATTGATCGTTAATTCCTGACGTTTACTTTTGTGAATTGAGATTTCAAAACAGAGAGAGACTTCATTTTATATT 
TCTAACCTATCTTTTAGATTCTTTAATTTTTTTTACACATTGTCTTGTAAAATACGTTTATTATTTCGTTTGCATGCGGCGTAGACATACATAAT 
TCGCAAATTATTTAGAAGTCACTGCTTTACATACGAAATGCATTATGTAGAACAGTGTAAAACTTCAGCACATATATTTACAATAATACGATCGA 
GCTAAATGATAATGCTAAAAACTGATACACTATTTATGTTTTTTAACCACGGCATCTACTGAAAAGCTGAAAAGAAAGCTTGGTCAAAACAAAAC 

TTGTCATT T AGC AAAAT T AGGT T TT AAT T AT C AAC T T AAGG C AT T T T T GT AAT AC TAT G AAAGT T G TGT AAAAT AAT AT AAAC T T GT GT T AAAAT 
AAATTGTTAAGCATAACAAATATTTCAAAAAACCAGTGTCCTTTAATTTCAATTGCCGCCCAGCAAAACAATCGTTTTGATTTTCGAAATTATCT 
TGTGTTTATTATCAGTTTCTTCTTTTAGCACATTAGACAAATGGTTTTACTTGTATTTGATTGAATTCGAAATATGTTTATGTATGTAGTACAAT 
TGTGGCTGCCAGCGAGTGTGTTTGTATCCTTTGTTTCGTTTGTGTCTGTGAGAAGAGTTTAAACTTGAGATCGGGCTAAAATGTTCTGGTAAAGA 
GGCCTGCAAGCTTGGCATAGTTTGATATTTGCTTTACCCTACTTTCAATCGAAATGTACAAAATGAGTAGGGATGTGGTGAGACAACCTGCAAGG 
GGGTTAAACGTACTTCCGATTCCATATTAAATAGTTAATAACAAATACTAGGCAGTTCGAATGGCACAAGTTGATTACACTACCATGGGGAAATG 
TGGGGTGTCTAAAGCTTAGCGAGTTCTGTTCCGATCGGAGGAGATATCCTTTCGGCTGAGAGTTTGTTTGTCTAGATGTACGATAGTTTGTTATG 
GTAAATTTCAACTAGTCAATGGTTTAAGTGTAGTATTATTAATAATTTAGTCGCTAGATGCTGCTGCGTTTACTCAATCGTCTCAGTTCTGTGGC 
TCGTATCAAGTAGTTAGTAATAAAAAGTATTTCTCATATATGTATGTAGGTACTTGTGGGTAATGCTAGGTATCGTCATCATCAGTGTATATAAA 
AACAATGCCTTTAAATCGGATCTCTTTTTGGTTTTGCTTGTTCTTGAGTGGCCTACAATGTCCTTTTTAACGAATGTCTATAAAAGTATGTAGTT 
TTCCAAGGCTCCGAGGCTGAACTTTCGGTTAGTTTTTCTCTGGGCTAGGCTAAGCAAATATACAAAAGCTACTTAAATTAGTTAGCATGTACAGT 
ACCGGGCGTACTCGTAGGTATATATCGTATTATCGTATGAGTGTGTGTATATATAATTAAATGGTTCCATTATTCG 

(SEQ ID WO: 703) 

Exon: 1001.. 1748 
Exon: 3983.. 4107 
Exon: 4171.. 4504 
Exon: 4571.. 5916 
Start ATG: 1412 

Transcript No. : CT19764 

CAACAGTTAACAGCAGCAAACGAGAAATCAGCTCGCAGCGCAGAAAAATTAAAAAATATCGCCGTAAAAGTGAGCCCAACCGACCCAAAACATAT 
GAAAAGTGTGAATTAAAGGACAATAAAGATATAAATCATTAATTAAGCTTTGAGCAAGCGCGAAAAGTACGCGAAATCAACGTTTTGCAAAACGT 
TCGCCATACGCGTCGCCATTGGTGTGAGCGATAGCTTGCCTGTGTGTGCGTGTGGTATTGGAGGAGGTGAGAGCGCCTGTGTGTGTGTGTGTTTG 
CGTGCGTGTGTGAGAGCGAGCGAGCGGCCCCTATGAAAAGCAACAGCAACAACGCCAGC7AAAGGACCCGCAACAACAACAACAACAACAGCAAGA 
GTAACAAGCACAGCGTTTTAATATCCAAATAATGAGTGTCGCTGCAATGACTATGGACGATCAAAGACCCTGTATGAACAGTTACGACAAAATGC 
CCCCGACCAAATACGAGCAAAACCTAAATATCCTAAACAGCAGCCAAAACAGTGGGGCCACTGGCGGCCCCGCCTCCCCAACGCCCTCCGGCTTG 
GAGGACCACCATCATCATCACCATCCGCATCCGCACCATCATCACCATCAGGAGCAGCAGCAGCAGCAGCAGCAACAGCAACACTTGCAGCAGCA 
GCAACAACAGCAGCAGCAGCAACATGCAGCAGCTGTTGCAGAGGCTGTTGCAGCCGCCGAACAGCGACAGCGCTTGCTCGAAGATGAGATCGAGA 
ATCTCAAGCTGGAACAGGTCCGCATGGCTCAGCAATGCGCGGACGCCCAGCGACGCGAGAAGATCCTCATGCGACGGCTGGCCAACAAGGAGCAA 
GAATTCCAGGACTATGTGAGCCAAATTGCCGAGTACAAGGCCCAACAGGCGCCAACTGCCCTCGCATTGCGCACTGCTCTGCTCGATCCTGCGGT 
CAATCTCCTGTTCGAGCGGCTCAAGAAGGAACTTAAGGCCACCAAGGCCAAGCTGGAGGAGACCCAGAACGAGCTGTCCGCATGGAAGTTCACGC 
CGGACTCTAATACCGGCAAACGCCTAATGGCCAAGTGCCGACTGCTCTATCAGGAGAACGAGGAGCTGGGCAAGATGACCTCCAACGGCAGACTG 
GCCAAGCTGGAGACCGAACTGGCCATGCAGAAGAGCTTCAGCGAAGAGGTCAAGAAATCGCAGTCGGAGTTGGACGACTTCCTGCAGGAGCTGGA 
CGAGGATGTGGAGGGCATGCAGAGCACCATTCTGTTTTTGCAGCAAGAACTAAAGACCACACGCGATCGCATACAAACGCTTGAGAAGGAGAACG 
CCCAGTTGAAACAGGCCATCAAGGATGAGGTTGTGGCTCCAGCGGCTGCCACCAACGGTGGCACCAACACGACGATAAATAAGCTAGAGACCATT 
CACGAGGACGCTTGCATGGCGAATAACCCAACGAATCCGGATTGTTACAATGGGAATACCAACAACGAACAGATCGCGGCCGTGCCACAGATACC 
ACTGTCGGATGATGGCAGCAATATGAACGGAAATGCAGCGCGATTAGCCCGCAAACGAAACTACCAAGAGGAGGAAGCCCTTCCAACGGTAGTGG 
TGGTGCCGACTCCCACCCCAGTAGGAAATAACGTCCAGGAAGCGCCGCCCATCAGGGAAGTTACCGCCCCACGTACTCTGCCGCCCAAAAAGTCC 
AAGCTGCGAGGCATTACCACGCGTCGCAACTCCCAGCTGGAGGAGGATCATCAGCCAGTGACAACACCAGTGGCCGTCCCCATGATCGTGGACAA 
CGCTGTGGCCGGAATGGCGAGCGAAGAGGCTGCTGCCGCCGCTGCGGCTGTTAATAACAGCAATACGGGAATAATCCCGGAAACGGGAGTGCAAG 
TGGGTGTCCCAGTTGAGGGCGGCGATCCAGGTGCTCCGGCGGCACCGGGCCGCATCCTGACCCGTCGTCGGTCCGTCCGCATGCAGCAGAACGGC 
AGCGGAGCAGTCGACTACTCCACCTAAGCGCTGGCGAACCTTTTTCCAAAAAAGGGTGCACTCCATATGAGTTCGTCAAATTGATCGTTAATTCC 
TGACGTTTACTTTTGTGAATTGAGATTTCAAAACAGAGAGAGACTTCATTTTATATTTCTAACCTATCTTTTAGATTCTTTAATTTTTTTTACAC 
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ATTGTCTTGTAAAATACGTTTATTATTTCGTTTGCATGCGGCGTAGACATACATAATTCGCAAATTATTTAGAAGTCACTGCTTTACATACGAAA 
TGCATTATGTAGAACAGTGTAAAACTTCAGCACATA'TATTTACAATAATACGATCGAGCTAAATGATAATGCTAAAAACTGATACACTATTTATG 
TTTTTTAACCACGGCATCTACTGAAAAGCTGAAAAGAAAGCTTGGTCAAAACAAAACTTGTCATTTAGCAAAATTAGGTTTTAATTATCAACTTA 
AGGCATTTTTGTAATACTATGAAAGTTGTGTAAAATAATATAAACTTGTGTTAAAATAAATTGTTAAGCATAACAAATATTTC 

(SEQ ID NO: 704) 
Start ATG: 412 

MSVAAMTMDDQRPCMNSYDKMPPTKYEQNLNILNSSQKSGATGGPASPTPSGLEDHHHHHHPHPHHHHHQEQQQQQQQQQHLQQQQQQQQQQHAA 
AVAEAVAAAEQRQRLLEDEIENLKLEQVRMAQQCADAQRREKILMRRLANKEQEFQDYVSQIAEYKAQQAPTALALRTALLDPAVWLLFERLKKE 
LKATKAKLEETQNELSAWKFTPDSNTGKRLMAKCRLLYQENEELGKMTSNGRLAKLETELAMQKSFSEEVKKSQSELDDFLQELDEDVEGMQSTI 
LFLQQELKTTRDRIQTLEKENAQLKQAIKDEVVAPAAATNGGTNTTINKLETIHEDACMANNPTNPDCYNGNTNNEQIAAVPQIPLSDDGSNMNG 
N AARL ARKRN Y QEE E AL P T VW V PT P TP VGNN V QE AP PIREVTAPRTLPP KKS KLRG I T T RRN SQLEEDHQPVTTP V AV PM I V DN A V AGMAS E E A 
AAAAAAVNNSNTGIIPETGVQVGVPVEGGDPGAPAAPGRI LTRRRSVRMQQNGSGAVDYST* 
(SEQ ID NO: 705) 

Classification: known_f lybase_gene 
Gene Symbol: fl(2)d 
FlyBase ID: FBgn0000662 



Celera Sequence No. : 142000013384544 

TTTTCTCGGATATAGGCAAAATGGAACAATTTAGCCATTTCTAGCCAATGACACGCCCCTTCACCTCGACGTTTCTACATCTGTGCAAAGTCATA 

AAAGTTAAACGATTGCCAGAGCTATTAAATATTTCAGACACATTTTGTGGGCAAAAATTGTAACAGACTTCAAGCGAACGAAATAAACCGAAACC 

CAACCCAAATTTAAGCAAAAACCTAAAATGTCTTTTGGAAATGTTAAGCGCGAAATTGGCAAACTTTATGTGCCTGAATTTTGGACTGCAATTTG 

CGTTTAATTTGGGGTCACACACATTTCGGGCGAACCCCGGGCAATGTTAAGTTATGTAAAGGCAAATTTAACAATGTGCCAAGGGACGGAGCTGA 

GGGTAATCAATGTGAAGCAGTTTTAGTCTCCGAAGGCATTTTAATTGCCACAGAGCAACTTGCAACATTGCAACCTGCCCCGCGGCCAAGTTAAA 

TGACACATGTGTCTGCCCCGGCCATTTTTCTCTTTCTCCCCCAATATCTTATCTCTCTCTCTCTCTCTGTCTGTCTCTGTATATCTTGGCCTGTT 

TGTTGCTGCTGCTGCCGATGCTGTTGAAAAGACTTTCACTCTGGCCGAGCGCTATAGATGAAATGGCATTTGACTTCATTTTTGGCCAAAAATAA 

AAC C G AACC AT T AACT AAT T AGC T T AC AT T T T G AACG AT T T AT T G AGC T CT AG AC T T AAGCG AAG C T C GT C T AACT AAAAAT AC TACT T AAAT GC 

TATGGTTTGCTCTCTCTT CAT GAG AGC AGGC AAT G AT T AAT T T T AAAGG AAGG C AC AAAAGT C GAT GT T T C AGT TT T AAC AAAT ATT GTG G AAT A 

TATTGAGCTCTAGTTAAGTTATATTTACTGAAATAACTCCTCCTTTTATTTTATGGATTTTGGTTTTTAATAAATATCTAGAAACACTCAGAAAA 

CGTTCTCACTTGTCCTAATTATTTCTTTTATATTAACCATATCTGATCACTCACCTTCTGGCCTGCCTGGGAATGGACTGCGTCCAGCACACGGG 

TGCCTTGTGTAGTTGATGCATATTTCGGGAGGACTTGTAGGTGAGATCGCCGCACTCCGTCTGCCACACCTGGAGATTCCCCTCCGCCGGATAGA 

AGGT AT AC AGGC T GT GG GT GGC C GAG GC G AT G AC CG GCG AGGAGC C AC C GC CC T CGACC AC AT CGACC AC GCCCGCTGCC AC C AGT GG AG AT AGG 

GTGGCACTAGAAATCGGTTGGACGAGTAGGGCCTGTCCGGCTGCCGGTGCTCCGGCTGCAGTGCCTCCGCCAGGCGACTTGTAGTTCAGCATGCG 

GGTGGCGGTGGCACCGGCATTGCTGCTGCTTATATTGCCATTTCCGTCGTGCTTCAGGATCAGGCTACTTGGATCGTAGCCGTGGCGATTGTGCC 

GCGAAATGGCGTCCACCGAATGGGATCTAATGCCGGTTCGAGGTGGATTCACGTGGAGCCGACGTCCGCGCCGGTAGTTAGCAGTGCCAGTTCCG 

GTTCCCAGGCTACTATTGCCCAATCCCGAAGATCCCTGCTGCATCGACGCGGTCAACCGGTGATTGTGGACACTGGAGGCGGCACTTGGAACGAG 

GGGCTGTCGTTGTTTTTGGTACTGGAGGCGTGGCTGTTGCGGGGGTGCCTGATGCCTTAGTTTCCCCTGATCCTTGGCAAAGAACTCCCGCCTCT 

GCAGCTGGGGGGCGGCCAAAGTGAGTGCGCCCCTGACGGAGTTCACCATTGAGGGGCTGGATGCATTTGCTCCGGACGTCGTCTTAGTGGCGCCC 

CCAGTAGCCGCCGAACTAAGCTGCGCGAGCTTACTGGCATTGTTGTTGCTGACCGTATTTGCTGCATTGCTGTTGTTGCAAGTGCTGGTGCTAGT 

GGTTCCACTTGTCTGATGCTCCGAATCGGTGGAAACCCGATCCGGATCACTTTCGAAACCTTCGTCCAGGGATCGAGTTGGACTCAGAATGGGCT 

TCGAGTGCTGCTGCTCCAGTTTCGTAGACATGGCTGCTGGCCACTAAGGGGTATTATTTGTTGTTGTTTCTTTTGTGTTTTTTTATTATTTTTTC 

TACAGCTTTAGCCCGGGTATTGGGTATCGTTACACTTGCTGCACATATGCACCTTATTTCTCTCTTTCTCTGCCGCCTGCCATGCACACTTGTTT 

TTCACATTTAATTTCGCTTGGCTTTCAGCTCGTTCACTTCGCGAAAAGTCGCGCCAAAGTTGCAGTTTGCAGTTGGTTTGGATTTTGGTTTTGGT 

TCGAATAAAAAATAGCTACTGCGTTATACACGTCTGTCTCGCACGAAAGCTTTTTATCGACTGTGGAGTGCGAGTCGGAAACGGAATTAACTTGG 

TTCGCTCTGTTATCGCCATGTTAAGTAACATTCGTTAACATTTACCAACGGAAAGAGCAAAAGCTTCGATCGACACTTGTTGCGGTGCACCTGTT 

ACATTTAAAAAGTGCCTGAGAGAGAGAGAGAGTAACATCGCTCAGCTGTTTGCCAAATGTTATTCGCGAAACTTCCAAGTGAGAAAGATCATCAC 

AGAAT AAC AAAGTAAAATGAAGAAAGACAGGAAAAT AT AAACACGGAAGTGTCGAGACATCATCCT AAAAAT GCTTTTTCCTGTTCAC ATT TGCT 

TTGCACGAAAATGTTATCAATTTACAATATTTTTTAAGTGTTTTATCAAAGCTGTTGTTTTAGTAAATACGATTGTGAAAATCTTTTAATTGACA 

CTACACATTCAATATCTCCCAGATATATATCTATATTTTTCCGATATTCGGTAAAACTTTAAAACCACCATGCCATATATTTTAGAAGCGACTGT 

GATTTTGTTTTCGGCTCCACTGGGATTTATCAATGAAAATACCAACTCCAAAGCAAGAGCAAAAACAACAACCGGCACAAGCCGAGAGAGGGAGC 

GAAGCAAACAAAAACAAAACGAAGGCTTGAAAAGCTCGCTGTGCTGAGAGCAAAAGAGCAAGGAGCTTTCGACACCTGCTGCACCTGTACAACAA 

CAGCATAAATATTTGTGGGAGGGTGGCGATAGGAAGTATTTTGCTTATTTTGTTGCTTAGATTCGATTGAATTTTGAGCAATGTCTTTTGTAATT 

TCCTTTTTATATATTTTTTTAATTTTTGCAATATGCCTAACTGACTGTATGAACCATATGAATTAACTCCTATTGAATATATTTTAAAAACTTAC 

GCCTATTGACTATATATTGTTCGAAAAATTATTGTAATAATCACTTGGCGGCACAGATCCTTCCGCAATCCTAATCCAATTCATGGA 

(SEQ ID NO: 706) 
Exon: 2317. .1001 

Start ATG: 2026 (Reverse strand: CAT) 
Transcript No. : CT19790 

ACAGACGTGTATAACGCAGTAGCTATTTTTTATTCGAACCAAAACCAAAATCCAAACCAACTGCAAACTGCAACTTTGGCGCGACTTTTCGCGAA 
GTGAACGAGCTGAAAGCCAAGCGAAATTAAATGTGAAAAACAAGTGTGCATGGCAGGCGGCAGAGAAAGAGAGAAATAAGGTGCATATGTGCAGC 
AAGTGTAACGATACCCAATACCCGGGCTAAAGCTGTAGAAAAAATAATAAAAAAACACAAAAGAAACAACAACAAATAATACCCCTTAGTGGCCA 
GCAGCCATGTCTACGAAACTGGAGCAGCAGCACTCGAAGCCCATTCTGAGTCCAACTCGATCCCTGGACGAAGGTTTCGAAAGTGATCCGGATCG 
GGTTTCCACCGATTCGGAGCATCAGACAAGTGGAACCACTAGCACCAGCACTTGCAACAACAGCAATGCAGCAAATACGGTCAGCAACAACAATG 
CCAGTAAGCTCGCGCAGCTTAGTTCGGCGGCTACTGGGGGCGCCACTAAGACGACGTCCGGAGCAAATGCATCCAGCCCCTCAATGGTGAACTCC 
GTCAGGGGCGCACTCACTTTGGCCGCCCCCCAGCTGCAGAGGCGGGAGTTCTTTGCCAAGGATCAGGGGAAACTAAGGCATCAGGCACCCCCGCA 
ACAGCCACGCCTCCAGTACCAAAAACAACGACAGCCCCTCGTTCCAAGTGCCGCCTCCAGTGTCCACAATCACCGGTTGACCGCGTCGATGCAGC 
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AGGGATCTTCGGGATTGGGCAATAGTAGCCTGGGAACCGGAACTGGCACTGCTAACTACCGGCGCGGACGTCGGCTCCACGTGAATCCACCTCGA 
ACCGGCATTAGATCCCATTCGGTGGACGCCATTTCGCGGCACAATCGCCACGGCTACGATCCAAGTAGCCTGATCCTGAAGCACGACGGAAATGG 
CAATATAAGCAGCAGCAATGCCGGTGCCACCGCCACCCGCATGCTGAACTACAAGTCGCCTGGCGGAGGCACTGCAGCCGGAGCACCGGCAGCCG 
GACAGGCCCTACTCGTCCAACCGATTTCTAGTGCCACCCTATCTCCACTGGTGGCAGCGGGCGTGGTCGATGTGGTCGAGGGCGGTGGCTCCTCG 
CCGGTCATCGCCTCGGCCACCCACAGCCTGTATACCTTCTATCCGGCGGAGGGGAATCTCCAGGTGTGGCAGACGGAGTGCGGCGATCTCACCTA 
CAAGTCCTCCCGAAATATGCATCAACTACACAAGGCACCCGTGTGCTGGACGCAGTCCATTCCCAGGCAGGCCAGAAGGTGA 

(SEQ ID NO: 707) 

Start ATG: 292 {Reverse strand: CAT) 

MSTKLEQQHSKPILSPTRSLDEGFESDPDRVSTDSEHQTSGTTSTSTCNNSNAANTVSNNNASKLAQLSSAATGGATKTTSGANASSPSMVNSVR 

GALTLAAPQLQRREFFAKDQGKLRHQAPPQQPRLQYQKQRQPLVPSAASSVHNHRLTASMQQGSSGLGNSSLGTGTGTANYRRGRRLHVNPPRTG 

IRSHSVDAISRHNRHGYDPSSLILKHDGNGNISSSNAGATATRMLNYKSPGGGTAAGAPAAGQALLVQPISSATLSPLVAAGVVDVVEGGGSSPV 

IASATHSLYTFYPAEGNLQVWQTECGDLTYKSSRNMHQLHKAPVCWTQSIPRQARR* 

(SEQ ID NO: 708) 



Celera Sequence No. : 142000013384512 

AATATGCATGTTCTTGAATCCCACAACGTGACTATTTGTTCATTGACCCCAGGGCGTATTTGTGGGTCAATTCGTTTGCGGAAGCCACCCGTGGC 
CCCACCGCAACCCCTTAAGAGCCGATCCTATAACCGGCAAAACTGATGCCGGGTTCAGTGCGCCGGGTTTTAGAGCTGAACAAAGTGCAGTCATG 
AAGAAACCCAGCAGCGGGCTTCAAAATTTTTGGTAATTCGGCATAAAAATGTCGAGCATGCCAGCAAAACAGAACGCCAATTGCCGGGATCAGCA 
AATTTAGCCAATGGAAATTCGGATCAGAACCGATCGACCAGGAAACCTGATGCGCTGGCAGCGGCCATTGCACCGTGCTTATAAGAGCTGGATGG 
CTAACTCTCAGGTTTTAGTTCTGGAATAGTACGTGTCGCGGTTGGTAAATCGAGGTACTTCGATACCCCGAGGAATCAGTTAAAATCATACATAA 
TTGTTGACCACAATTGCCTCGGGAACTTCGATCGCGTCCTGAATTTATATCGAGTTATCTAAGGAAATAGTAGCCGTGATTTTACCCAAGAATTT 
TTCACATATCACAGTTGCTGTTTCTTTTAGATAGCTCTTTTGTAAACAGCCTAGATCCGAATGCGAGTATCCGAGTGCCTTGCGTGAATCCTGGA 

T GAAAAGGT C CT T C GG T T T C AGT T T AT GGGAT AC AC AC AGT T C AGT T T T GT CAC ACT T C AAGC AT C AC T G GGCT TT GTT T AT C T C AT G AGAGC G A 
AATTTATTCAACAGGAAACCTTAAATCATACCAATATATAAAAAAATCTGTATATAAAACATTTGAATTTCTTGGATCAAGTTTTTAAGCTTAAT 

CAGAAAAGTAAGTACAAAAGTGAWAAAGTAGGTTGGTGGATAAATAT 

TTTTCATTGAAATCAATCTTTCATTTCCCAGAAAAACAAAGTACACCCAATTTAATTAGATACCAAGAATAAGGGCTTACTTTATTGGTTTGACC 

AGGTTGGGGAAGCACCACCACAATTTAAAACTTAGGGAACTACAATTTACAGTGTAGAAATGCATGTGGAATATACATATATAAAAACAAAATAA 

GATGAGTTGCTTAAGAGAGCATTTTGCGAATCATGTAGTTGAGGATTCCACCATTCTTGTAGTAGGTGATGTCCACCTCAGTGTCGAAGCGCAGA 

ATAGTCTCAAAGACGGTGCCATCAGCCTTGAAAAAAGATAGTTCATTAATATAACCAAAATCGCTAGCACCAGGTTTCCAATAGCCTACCTCCAC 

TTGGATCTTCTGGCCCGGCTTTAAACCGCTCTCTGGCAGGGCGATGTTGTACACCTCCCGGCCAGTCAGGTTCAATGTCTCGGCACTCTGTCCCG 

GCAGGAATTGCAGCGGAATGATGCCCATGCCCACCAGGTTGGAACGATGAATGCGCTCGTAGGACTCCGCGATCACCGCCTTGACGCCCAGCAGG 

AAGGGACCCTTGGCGGCCCAATCCCGAGAGCTTCCGCTGCCGTAGTCCTTGCCTACAACGAGGACCAGAGGAGTACCCTCCTCGCGGTACCGCTC 

GGCGGCATCGAAGATATCCAGCTCTTCCTGGCTGGGAATGTGCACGGTGCGGGGTCCTGTCTTTTCGACCAGTTTGTTGACCAACCGAATGTTGG 

CGAAGGTGCCGCGCGACATGATGGCATCGTTGCCACGACGCGATCCGTACGAGTTGAAGTCACGGGGTGTGATGTTCCGCTCGGACAAGAATCGG 

GCAGCAGGCGAGGTTCTAGCAATAGATCCAGCCGGCGAGATGTGATCCGTGGTGACGGAATCGCCTAGGAAGAGCAGGCAACGTGCCTTCTGAAT 

GCTCTGCAGCTTGGGAAGATCTCTGGTCATGCCCTCGAAGAAGGGTGGTCGCTTAATGTAGGTGGAATCGGCACTCCAGGAGAAGAGCTTGCCCT 

CGGACACCTGCAGAGTCTGCCAATCCTGGGAGCCCAGCTCAATTTTGCCTGCAGATGGAAGTGAAATTAAACCAAACTCAAATAAAGATGGCGCA 

CGAAGTACTCACTGTACACCTCCTGGAACATGGCGGGGATTACATGCTTGTTCTCCACTTCCTGTATCTCCGATCGCGTTGGCCAGATGTCCTGC 

AAGAACACATTCTTGCCATTGGCATCCACTCCCAGAGGTTCCTTCTCGAAGTCGATGTCCACGCGTCCTGCGATGGCATATGCGATCACCAACAG 

AGGACTGGCCAGATAGTTGGCCCTGGTGTTGGGATGGATGCGACCCTCGAAGTTTCGGTTTCCGGACAAAACTCCAGCACACACGAGGCCGTTCT 

TCTCGATGGTGTTCACCACATTCTCCTCGAGAGGACCGGAGTTACCAATGCAGGTCATGCATCCATAGCCCACAATATCGAAGCCCAACTTCTCC 

AGATAGGGAATGACTCCGGACTCTTTCAGGTAGTAGGTAACCACACCGGATCCAGGCGACAGCGAGGTCTTGATATAGGGCAGGATGCTCAGACC 

CTTCTCAACGGCCTTTTTGGCCAGCAGACCGGCACCCAGCATCACCGAAGGATTCGATGTGTTCGTGCAGGATGTGATGGCCGCAATGACAACAG 

ATCCATGATGCAGTTTGTAGGTCTTGCCATCATCCCACTGGAACTCCCCAAAAGCGGATTGAGCTTCCGGAGCAATGGCGAATCCCTTGAAACCC 

ACCTAAGGTAATGGTGGAGCAAAACAATGTTTAATTTAATTCCCTTTTTCTCAATGAATTTGAAACTTACTGGGCTGGACAGGCACGACTTAAAA 

TCCTCAGGCATGTCGGAGACAGAAACGCGATCGTGAGGACGCTTGGGTCCCGAAACGGAGGTGACCACCGTCGACAGATCCAAGGTGATACTCTG 

TAGAAGGGATGTATGTTAGCTTAAGTTTTTAGTGGTATATGTAAGGCAATTTCAACCTGAGTAAACTTGGGATCTTGGGCCGCATCAGCATAGTT 

GCGCAGCTGCTGAGTTGCCTTCAGATACTGGCGAATGATGTCGATCTTCTTTTCAGACCGATCTAAAAAGTTTGTTATGATATAGGTAAAATATT 

GTTTTATGAAACAGTCTGGTTTTCCGTACTTGTCTGCTTCATGTAACCAAGAGTATTCTCATCGATGGGGAAGTATCCCACGGTGGCTCCGTATT 

CGGGACACATATTGCTGATGGTGGCCCTGTCAGCAATGCTGAGTTCCGCAACTCCCGGGCCGTAAAACTCCACAAACTTTCCCACCACACCCAAC 

TGGCGCAGGTGCTTGGTAATGGTCAGCACCAGATCGGTGGATGTGACCAGTGGGCTCAGTTTGCCCTCCAACTTGTAGCCAATTACCTCGGGCAA 

CAACATGGAAATCGATTGGCCCAACATCACAGCTTCCGCTTCGATGCCTCCCACTCCCCAACCAAGTACGCCCAATCCATTTATCATCGTGGTAT 

GGGAATCAGTGCCCACCACACTGTCGGGATACAGAATCTTGGAGCCATCGGTTGCGTCATTCTCGAACACCACGCGGGCCAAGTACTCCAGATTC 

ACCTGGTGGACAATACCGGAGCCAGGTGGCACGATCAGCATGTTGTTGAAGGCCTTGGCTCCCCACTAATTAGGAAAAACAATATTATTAAAAAT 

ACTATCTGGATAATAAGCTAAGGAAAAGTAACCTTCAGGAAGGTGAAGCGCTCCTTGTTGCGCTCAAATTCCAGTGACTGGTTCTTGGCCAAAGC 

ATCAGGCGCACGGGCGAAGTCCACCTGGACCGAGTGATCGATGACCAAGTCGGCGGGGCAGATGGGATTGATTTTCTCCGGGTCGCCGCCCAGAT 

CCAAAACAGCATCGCGCATAGCTGCAAAGTCCACCACAGCAGGAACTCCAGTAAAGTCCTAGTAGCAAGATATTATATAATTTAAATATTACTTT 

GCTAGATTGCCGTATGAAACTCACCTGCAGGATGACACGGGCCGGCTTGAAGGACACCTCCACATCGTTGGATCCCTGCTTGAGGGCGGGCGACC 

ATCCCAGGATGCTCTGCACGTCCTTCTCTAGGATATGGAAGTTGTCGCAGTTGCGAACCGCCGACTCCAAGAGCACTCGGATCGAGTAGGGCAGT 

TGATCTACGCGTTGGAAATGTTAAGTGTTAATCAAAGTGTTTAGAATTTCATCACAGGTACAGTAAAGCCTCTTGGTGAGAGGACCTAAGACAAT 

TTGCCGCACTAAACGTATGTATCTTTAAAAAACCTTTAAGTTGCACTTGTTGAATTAAAATTCACGAGTAGAGTAAAGAACCTCCCTTTTTCTAT 

AACCGTCCACCTTAGTTAGTTTAAAGTGTGCTAGTCAGATAGCAGTCACTTCTTATCACAGGCACAATGATTACGTTTGTCCAGCACTTAATGCG 

ATTCACAGGCGTCACTATATGGGAGGAGCCACCAAAGAAGCCCTCACCTCAGAACCCCGATTTCTGGACACGCGCGGTTCCAGTAAAAACATTCC 

ACTGATCTTGAGATTTTTCGGAGTCATCGGACTACGACCCCAAATCAGCAGATCTGCGAAGTTTTAATTCGCTGATAAGTGAGCACGTAAAATTG 

GGAAATGGAAATCAAATTTCGGAGGCACACGATTCGTTATCACCGCTAGACGAGCCCCATTATGAGGCAATCTGGCGGTGATAACTTTCGCTTTC 

GGCGCAAGACAAGCCCGCCCATAGAAAACTATAGAAAATCCACTTAATGCTCCGGGAGCACCTTCTACTTGACATTGCCTGACTTGCAAATTGAG 

CTGTTCTTCTACGCTTCTTTTTCAATTCAGGGGTCAAACTCCAACTTCAACAGGTGCTTAATTAATCCCATTTTTGATATGCATGTTTAGCCTAG 
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TTTATTGCGACCACGTAGACAGACACGCCCACTTTTAAAGCCTTCTTAAGACCACATCCACTTACCATATTTACTATCGATGGAGGCGAGGTCAA 
AGTACTTGTATGTTGTGCCGGCCTGGGAGAAAGTTTTCTCGAACTGGGCGAAGGGATTGGCGCCTGCGACGATACGAATGTGTTATATAAGCCAC 
AGGATTCGGGGGACAGGGTATATGGTAGTCGAGGGAACTGCAAAGCTCCGGACTGGCAACCCACCTGACATCTTATTGGAACTCTTTACTGAACT 
TATTCCGGACGAGCAAAGCAAGAAACTGATTGGAGAAGCAGTTCGGATAGGAGCTGGCCAGCCTTCGCTGTCTATCGATAGCCAGCGATAGTTGC 
GTATATCGCCAGTCGGATCGACAGTGCATTTTATACATTATATATGGTGCATTATATTTTATAAAACGATAGCTTTATGTTCACTTCTCTTTTTC 
TAAAATGAGATTATAGTGTTCTGCGCTTGGTATGTTATTATCATATGTATTAAATGCAGTAATAACTATTCGGATATTTATGTAATACACCATCT 
TTTGAAAAGCCTGAAATTTGTGCAATTATCAGTAAACTAAATCGACGAAAATAATTGAAAGTTGTATAACATATGGTTCATTTTTAAAATGAAAA 
CTCCACCCGCCAAACCTATCAGTAATCGACAGAGCTCGTGCAATCGCAACATCATATTGCACCGATATGTCGAGGGAAAATTGAAATCGTTATCG 
AAAATACGAGGCGATAGCACGCGCACGTTGTTTTCTTTGAATATTAGGAAAATATAATATGAATTTACCAGCGAAAATGGACGTGGAAAAGGATA 
AGCTTCGCCAGATGCACTGCGAATTCAATCCCTTGTCGCGTTGCGACGGATCCGTGATGTACAGCCAGGGTATAGCCGTGTTTTCAACTCTAGTA 
CATGGCCAGAACCGGCTAACATGGTCTTGCCAATTTCAGGTGCTACAGGTCTCATAGGCGCAGTTCTCGGCCCCATCGAGGTGAAGACCCAAAAC 
CTGAGCATAGATGGCAGCTACTTGGAGTGTAACTACCGACCAAAAGCGGGCCTGCCGCAGGTTACCGATCGGATTCGGGAGGCGGCCATTCGGGA 
TGTGCTGGAACTGGCCCTCCTGTCGGAGGCTCATCCGCGCTCCAAGATGTCCGTGCAGATCCAGGAGCTGGAAGATCGTGGCAGTGTATGTTTTT 
GAATCTCCTTCATCTGTAATCCTTCTAACAATATATATATTCTATCAGATTGATGCCTGCGCCGTAAATTGCGCCTGCCTAGCCATGCTCAT 

(SEQ ID NO: 709) 

Exon: 5267. .5195 
Exon: 5098.. 5006 
Exon: 4184.. 4015 
Exon: 3953.. 3738 
Exon: 3675.. 3165 
Exon: 3102.. 3002 
Exon: 2942.. 2826 
Exon: 2757.. 2103 
Exon: 2043.. 1325 
Exon: 1261.. 1001 

Start ATG: 5201 {Reverse strand: CAT) 
Transcript No. : CT19806 

ACTGCTTCTCCAATCAGTTTCTTGCTTTGCTCGTCCGGAATAAGTTCAGTAAAGAGTTCCAATAAGATGTCAGGCGCCAATCCCTTCGCCCAGTT 
CGAGAAAACTTTCTCCCAGGCCGGCACAACATACAAGTACTTTGACCTCGCCTCCATCGATAGTAAATATGATCAACTGCCCTACTCGATCCGAG 

T GC T CT T GG AGT C GG C GGT T C GC AAC T GC G AC AAC T T C CAT AT C C T AGAG AAGGAC GT G C AG AGC AT CCTGGGATGGTCGCCCGCCCT C AAGC AG 
GGATCCAACGATGTGGAGGTGTCCTTCAAGCCGGCCCGTGTCATCCTGCAGGACTTTACTGGAGTTCCTGCTGTGGTGGACTTTGCAGCTATGCG 
CGATGCTGTTTTGGATCTGGGCGGCGACCCGGAGAAAATCAATCCCATCTGCCCCGCCGACTTGGTCATCGATCACTCGGTCCAGGTGGACTTCG 
CCCGTGCGCCTGATGCTTTGGCCAAGAACCAGTCACTGGAATTTGAGCGCAACAAGGAGCGCTTCACCTTCCTGAAGTGGGGAGCCAAGGCCTTC 
AACAACATGCTGATCGTGCCACCTGGCTCCGGTATTGTCCACCAGGTGAATCTGGAGTACTTGGCCCGCGTGGTGTTCGAGAATGACGCAACCGA 
TGGCTCCAAGATTCTGTATCCCGACAGTGTGGTGGGCACTGATTCCCATACCACGATGATAAATGGATTGGGCGTACTTGGTTGGGGAGTGGGAG 
GCATCGAAGCGGAAGCTGTGATGTTGGGCCAATCGATTTCCATGTTGTTGCCCGAGGTAATTGGCTACAAGTTGGAGGGCAAACTGAGCCCACTG 
GTCACATCCACCGATCTGGTGCTGACCATTACCAAGCACCTGCGCCAGTTGGGTGTGGTGGGAAAGTTTGTGGAGTTTTACGGCCCGGGAGTTGC 
GGAACTCAGCATTGCTGACAGGGCCACCATCAGCAATATGTGTCCCGAATACGGAGCCACCGTGGGATACTTCCCCATCGATGAGAATACTCTTG 
GTTACATGAAGCAGACAAATCGGTCTGAAAAGAAGATCGACATCATTCGCCAGTATCTGAAGGCAACTCAGCAGCTGCGCAACTATGCTGATGCG 
GCCCAAGATCCCAAGTTTACTCAGAGTATCACCTTGGATCTGTCGACGGTGGTCACCTCCGTTTCGGGACCCAAGCGTCCTCACGATCGCGTTTC 
TGTCTCCGACATGCCTGAGGATTTTAAGTCGTGCCTGTCCAGCCCAGTGGGTTTCAAGGGATTCGCCATTGCTCCGGAAGCTCAATCCGCTTTTG 
GGGAGTTCCAGTGGGATGATGGCAAGACCTACAAACTGCATCATGGATCTGTTGTCATTGCGGCCATCACATCCTGCACGAACACATCGAATCCT 
TCGGTGATGCTGGGTGCCGGTCTGCTGGCCAAAAAGGCCGTTGAGAAGGGTCTGAGCATCCTGCCCTATATCAAGACCTCGCTGTCGCCTGGATC 
CGGTGTGGTTACCTACTACCTGAAAGAGTCCGGAGTCATTCCCTATCTGGAGAAGTTGGGCTTCGATATTGTGGGCTATGGATGCATGACCTGCA 
TTGGTAACTCCGGTCCTCTCGAGGAGAATGTGGTGAACACCATCGAGAAGAACGGCCTCGTGTGTGCTGGAGTTTTGTCCGGAAACCGAAACTTC 
GAGGGTCGCATCCATCCCAACACCAGGGCCAACTATCTGGCCAGTCCTCTGTTGGTGATCGCATATGCCATCGCAGGACGCGTGGACATCGACTT 
CGAGAAGGAACCTCTGGGAGTGGATGCCAATGGCAAGAATGTGTTCTTGCAGGACATCTGGCCAACGCGATCGGAGATACAGGAAGTGGAGAACA 
AGCATGTAATCCCCGCCATGTTCCAGGAGGTGTACAGCAAAATTGAGCTGGGCTCCCAGGATTGGCAGACTCTGCAGGTGTCCGAGGGCAAGCTC 
TTCTCCTGGAGTGCCGATTCCACCTACATTAAGCGACCACCCTTCTTCGAGGGCATGACCAGAGATCTTCCCAAGCTGCAGAGCATTCAGAAGGC 
ACGTTGCCTGCTCTTCCTAGGCGATTCCGTCACCACGGATCACATCTCGCCGGCTGGATCTATTGCTAGAACCTCGCCTGCTGCCCGATTCTTGT 
CCGAGCGGAACATCACACCCCGTGACTTCAACTCGTACGGATCGCGTCGTGGCAACGATGCCATCATGTCGCGCGGCACCTTCGCCAACATTCGG 
TTGGTCAACAAACTGGTCGAAAAGACAGGACCCCGCACCGTGCACATTCCCAGCCAGGAAGAGCTGGATATCTTCGATGCCGCCGAGCGGTACCG 
CGAGGAGGGTACTCCTCTGGTCCTCGTTGTAGGCAAGGACTACGGCAGCGGAAGCTCTCGGGATTGGGCCGCCAAGGGTCCCTTCCTGCTGGGCG 
TCAAGGCGGTGATCGCGGAGTCCTACGAGCGCATTCATCGTTCCAACCTGGTGGGCATGGGCATCATTCCGCTGCAATTCCTGCCGGGACAGAGT 
GCCGAGACATTGAACCTGACTGGCCGGGAGGTGTACAACATCGCCCTGCCAGAGAGCGGTTTAAAGCCGGGCCAGAAGATCCAAGTGGAGGCTGA 
TGGCACCGTCTTTGAGACTATTCTGCGCTTCGACACTGAGGTGGACATCACCTACTACAAGAATGGTGGAATCCTCAACTACATGATTCGCAAAA 

TGCTCTCTTAAGCAACTCATCTTATTTTGTTTTTATATATGTATATTCCACATGCATTTCTACACTGTAAATTGTAGTTCCCTAAGTTTTAAATT 

GTGGTGGTGCTTCCCCAACCTGGTCAAACCAATAAAGTAAGCCCTTATTCTTGGTATCTAATTAAA 

(SEQ ID NO: 710) 

Start ATG: 67 (Reverse strand: CAT) 

MSGANPFAQFEKTFSQAGTTYKYFDLASIDSKYDQLPYSIRVLLESAVRNCDNFHILEKDVQSILGWSPALKQGSNDVEVSFKPARVILQDFTGV 
PAVVDFAAMRDAVLDLGGDPEKINPICPADLVIDHSVQVDFARAPDALAKNQSLEFERNKERFTFLKWGAKAFNNMLIVPPGSGIVHQVNLEYLA 
RVVFENDATDGSKILYPDSVVGTDSHTTMINGLGVLGWGVGGIEAEAVMLGQSISMLLPEVIGYKLEGKLSPLVTSTDLVLTITKHLRQLGVVGK 
FVEFYGPGVAELSIADRATISNMCPEYGATVGYFPIDENTLGYMKQTNRSEKKIDIIRQYLKATQQLRNYADAAQDPKFTQSITLDLSTVVTSVS 
GPKRPHDRVSVSDMPEDFKSCLSSPVGFKGFAIAPEAQSAFGEFQWDDGKTYKLHHGSVVIAAITSCTNTSNPSVMLGAGLLAKKAVEKGLSILP 
YIKTSLSPGSGVVTYYLKESGVIPYLEKLGFDIVGYGCMTCIGNSGPLEENVVNTIEKNGLVCAGVLSGNRNFEGRIHPNTRANYLASPLLVIAY 
AIAGRVDIDFEKEPLGVDANGKNVFLQDIWPTRSEIQEVENKHVIPAMFQEVYSKIELGSQDWQTLQVSEGKLFSWSADSTYIKRPPFFEGMTRD 
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LPKLQSIQKARCLLFLGDSVTTDHISPAGSIARTSPAARFLSERNITPRDFNSYGSRRGNDAIMSRGTFANIRLVNKLVEKTGPRTVHIPSQEEL 
DIFDAAERYREEGTPLVLVVGKDYGSGSSRDWAAKGPFLLGVKAVIAESYERIHRSNLVGMGIIPLQFLPGQSAETLNLTGREVYNIALPESGLK 

PGQKI QVEADGT VFET I LRFDTEVDI T Y YKNGGI LNYMI RKMLS * 
( SEQ ID NO: 711) 

Name: RNA-binding protein 
Classification: RNA_binding 
Gene Symbol: Irp-lB 
FlyBase ID: FBgn0024957 

Celera Sequence No, : 142000013384287 

CTTTTCCATTACCTCGATGATCGTTAGTTGGTTCGGTTCATCCGGTCGCGTGGGCAACGTATTGGTCAACTATCGAGCCGTGGCCCACGAGGACA 
AGTCCTTCGCCCAGGGGCTGGCGCTGATGATGATTAGCCTTCTGGCCCTCATACCGGGACCCATCATATTCGGCCGCCTCATCGACTCCACGTGC 
CTGGTGTGGACAAAGACCTGCAATGGCAATGGCAACTGTCAGTTGTACGACCAGACCCGTTTCCGTTACTCCCTCAACTTTTTATCCTGCCGTAA 
GTTTGATTCAAAATATGTAATAAGATAACATATTCTCTACCTTCTTGCTAACTTCCAGTGCTCACCTTTATGGGACTCCTCTTCGATTACCTCGT 
TTGGTACTACGGACGCAACCTGGACATTTATGGCGACAAGGAGGCCAAGGAGGAGGAGCGTGCCAACAGAAAAGACCAGCCCATTACTCCGCTCC 
TGGCCAAGAAATCCGAACAGGAGTAAAATCAACAAAAAAAAGAAAGTGCAACCCATCCGTGATACTCAATTATAAGGAGTAGATTCATATGAAAC 
ACGCCAGTTCCGAGTATTTCATAACAATAATTCAAAGCACATTCCTAGAGTTAGTTTTTGAAATCAAAATAGAGAAAGATTGGGCAAAGCCTGTA 
CTCCAACGCCATATGTACAAATCTTCCGCTTTTCGATAGGAATTTAGCTATTGAACTCGCGCTGTCAAATGTCTTCTTTCAGTGTACAATTGTTA 
ATTACTTTTACTTTGTGCTAAGATCAATTAAATATACCATGCTTAGTTTAACAACAAAAATGAAAAGAATTTATGTGTACGTATGTATAAAAAAA 
AATGTTTCAAATAAAAATGTATGTTTATTAAAATATTATGAGGAGTCTAAATCATAAATATTTATATTTTATATATATTAATTTTGCCAAAATAT 
TCGATTGCAATCCATAAAACCAGAATATAATTTCTTAGGTTTCATTTTTTTTTTTTTTTTGAAATTTTCTTTTCACTTTTATTGATAAAAAATGT 
TCAAATTTATGCAGATCGTTTGTTTTTCATTCCATACGTATATGTATAATAATAATACACTTTAATTTCGATTTTTGTATAAATATTCATTCCCG 
TTTGTCTATATAATTTCTATTATTACGATCGCTCTTATTCCTCAGTGGCGTAGTTTTCGACTTCCGTTCCGATTTCGCGAGTGATTTGTTGCTTG 
AGTGTCAGTGTTGGGTGTCTGTGTGAAACATTTTGAATGAATTTCCGCGTTGTCAACAATTATATCGCACTCAGTATTATTACAATAATAATTAA 
TTAATTAATTATCATTCGCGTTCGATTCGTAGGAGGCTGCGAGAAGCGTTAAAAGCTGGCTTAAGTCCTTCATTTACTAGGTCCGCAATGTAAGT 
ATTAATCAGTTAATTAAATTAACTTAGACTTCAGGGGACTGCTCAACGTGCTTAATTGATATTATTTTATTTACTTGCGTCTGTTACGGCAACAA 
CATTCAATTTAACAAAAACTGAATTTTCGACAATGGCAGAGGGGAAGACAGATTTTAAAAAATGGCTATGTGAATATTTTGAAACCGCCTTGGTC 
CGGAATTTGAATGAGTGAATAGCGTATATAAACAATAAAGGGGTAAACTTAAATAAAAAGCTTTACTAGTTTGTGTTTTTGTTTCTGCTCGGTAG 
CAAAAGGCATTTTATTTTTTCATATATGGGATTTCGTGTTCAACTCGCACGAATTCTTCAATTAACTAAATTATACAAAATCAAATGTGTTTTAA 
GTAGCAAAATCAAATGGTTTTAAATGATTAACAATTAGGTGTTTCCGTAAATTTGAATTTTCTCCGTTCTGCTTTTACAGACTTTACTCCATGTT 
CTTTAAGCTTAAACTTAAATTAAAGCATATAGATTATATCATTTAAAAAGAAGAGCAGAAATATCTTAATGCATTTTGCAGGGTATTTAACAAAT 
TTGCAATTTAAAGGCGCATACAGGAAGATGGATGGGGAAAGGATTCTCGGGATCGTACTTTCGCTTACAAATACAAAAATTGGGTAGCTTGCAGC 
TTTAGTTTTAGTATTTTAAAGAAATTCACGTATTGCGCGCGATTTAGGCATGTACGTGTAGGCTATTTAATGCTTACAGTTTATACAAAATAGTA 
AACCAAAAGAAAACGACGACTGCGACAAGTGCTCAATATCCTTGCGAGATGGGCTCAACAAACAAAATCGTTATCAATGCAACTCCTCTTCTTCA 
ACTCTTTCTCTCTCTTTGCATATATACGTGTGAGTTGTTTGTTTTGTATACATATATAAATAGTTCATCTTGTTTCGCATTCGCTGCCTCCGATT 
TTGCTGCTTAGTTTTACATAATATAGTTTATAATTCGTATATAATGCATTTCTGTTTCTATCTCCGAACTCGTCCTTGTTGTAAATTAACTTTTC 
GCCTGCCTACATTGCCGTTCCGTTCCGTTTTGAATCGATCTGGATTCTATAATCGATTGCTATTGCTGGTTGAAAGTAATGTGAGTGAGAGCTCG 
AGCGATGCTGAGTACTGTTTTTGTTGATTTGTCAAATGTGAAGAAAGCGTGGCGTGGCGTTGCTGTTGTTGTGGCTGTGGCTGTGTTCTCTCGTC 
GGGCTTGCAATGCTATCTCAAAAATCGCAATGAAAGGCGGGCGCTCGAATTTAGCTAGGCTCAATTTATTCGACCACCTTCAGCACCTTGCCAAT 

GG C AATGG T T T T GT C T G AAAG AT AG AAAT GAT AAT TAG AC GG GG AT C T C C C G AGT C AC C AAT C G AC T T AC T T TC AT C T C T G AGCGT AAAAC GGC C 
CATTTGAGGGAATAGCTTAAATTGTTCAAGGCAAATCATTCCGGAGCACTCGATTCGCATAATTGCAACCTGATCCTGCTTAACGAACCGTGGAC 
GTGTTTTTGATTTGTCACCAGACTTTTTGTCGACCAAACAGATGAGGGCCTATTGAAAAATAATACATATTCAGTATCTGGCTAATGTATGGGAA 
TTATAGACCTAAAACTTACCTTAACTGTGACTTCTTCGGCTGCACAGTGTATGTGCATGACAGCCGAGTAACCCGCACAGATAATTGATTTGTGT 
TCTAAAATTACGACCTGAGCATCAAAGATTTTTCCCGTCTTGATGGGATTGGCTGCATCGCAGAGAACAAAACCGGGCGAGACATCCTCCTCCTC 
AATACCCTAAAAAAAAGTTTAAGAATCAATTAAAATAATCCAAGGGTATAACAAAAGAATCATGTACTCACTTTCAGCTTAATCTTGACGTTCTC 
GCCGGGACCAACAGAAGTGACTTCAAAGTCGTCCGAGAACAACTGATCCACCGCCACTTGCGTCTATGTGAAAAACAAGCTTAGTTTCAAATATT 
CAGCTGATTGCAAAGTGTATAAACTTACCCTATTTGGCATCACTAACAGGTTTTGACCCTTGCGCGCTGTTCCGGACTCAACCTTGCCCATCACC 
ACGGTGCCCATATCCTTGTACTTGTCCACAATTGGCATGATGAAAGGCCCGTCCGACTTGCGGTTGAGCGAGGGCAGCTCGTCAATGAACGGAAT 
GAAGGCGGGTCCCCGGTACCAGGGGCAGAGCGTCTCCGGGATTTGGTCTTTCAGCCCGTATCCGCTGAGGCCCGAGCAAGGCATAAAGGTAAGGT 
CCTTGGCCGGGTTGAAGCCCAGCTTCTTGAGGTATGGTAGTATCTTGTCTTTGCATTCGTTGTAGCGCGTCTGATCCCAATTGACCGTTGGGTCA 
TCCATTTTGTTGACCAGCACGACCAGATGCTTAACGCCGGCGGTCTTGGCCAACATGGCATGTTCTCGGGTCTGGCCGCCGCGATCGAAGCCCGT 
CTCGAATTCACCCTTTCGCGCCGAGATGACCAGCACTGCGAGATCGGCCTGCGCCGCGCCACCGATCATGTTAGGCACGAAGCTCTTGTGACCGG 
GTGCGTCGAGGATAGTGAAATGCTTTCGGTCCGTTTCGAAGAAGGCTCGTCCCACCTCGACGGTCTTGCCCTTGTCGCGTTCCTCCTGGTTGGTG 
TCCAGCGCCCAGGACAAGTACCAACTCTCACGCGACTTCTCACGTGCCTCCCGCTCGTACTTCTCCAGTGTTCGCTTGTCCACCATGCCCGTGAG 
CGACATGATCTGTCCGCCAATTGTTGATTTGCCAGCATCTACCCAAAGCAAAGTGCGAATGTTAATAATGGCTTTTGGTGGGACATATGTTTATA 
AGTAATGATAACTCACCAACATGGCCAATGAACACAACATTCACGTGCTCACGCTTGCTCCTATTTTCCTCCACCTTAACCACCTTCTTCTTGGA 
CACCTTCGGCGTGGCCTCGCCCTCCGTGAACTCAGCATCTTCCACCTCCTCATCCTCGGGCGTGATCACTGCATCGTCCACATCCCAGCTGTCAG 
CAGGATCGGTTTCTAAGCAAATAAAAAAAAACCAAATAAAAGGTCATTAGTCGAGACATTTACAAGGCGATTCTACTGTGCTTGGTGGTGAAAAT 
AATATGGCTTGTTGAGGAATATATAATTGTTTGTTGTTTATTTTCTATCGATTACTAAAATTGTTATCGCGCAAATATATAATTTATTTACATTA 
GACAATCTTGGGAAAACTAGTAATTTTAAAAATTGTTCAAAATAGTGTCGGTTTGTCCAGATCAATCCAGTTAAAAACCGAATAAAGCAATGAAT 
TTTTTTTATTTTTGGAATTGTCCACATCATCGTAATAGTTTAAAGAAACTTTAGTGATGGGAGTAAAATTCACTTTAACTTTCTCTATTTCACAA 
TATCGAAACCTTTGTATTTTGAGCGACATTTAGTAAATGCAAACATGTCTCTTTTTATGCACAAAAATAAAAATGCAAATGTCATGCCTTTTATT 
TTTGCACATGAAACAACTGTGTAAATAGTTTCTTTTTAATTTAGGTCGGTTTGTTAGAAACTTATAGCACTTCAGATGATTTAAACTCAGGGAAG 
TTGAGCTATCACTTCACCAATTATATACTTCGTACTACCACTCATGCACTCGCCGTGTTTGACCTGGCGGCCACATGCTAATTGCTAATCGCTGA 
CTCTAGCCACCCGGGCGGATGAGTCATGTGCGGGATTGGATTGGTGGAGTGCTGTTCTGTGCTGGTGTCAACGTGCTTGGTTAATAGTAGTTCCC 
GAACCTTTGACGTAAATGTCGGAGGATCATTTGACCGTCTGGAGATGGGCTGTAACGAATGGTATCCCAACTCCACGATTACTAACTTTTTCTCC 
ACTGCGAAAACGAGTAAATGTAGCGAGATCCGGCGCCCTTACTCATGATTCCGATGCATACAACTCAATTCTAATCGGATTTCGGAACTAGGAGC 
GGATGTAGATATAGCCGGAGAAACGAGCTTCGATCGCGCCAAAGTGTAAGAGCCAACAGCAAATGCTATTTTTTTTCGGCGCTTTGGCAAGCGCG 
CCGCACAGCCAAACGCCATGTAATCAAGTTCGAAGCGGTTTCAACCGCCGGCGAGCTGCACTAAGCGCTTCCATTCGGGCCAAGCAACTACAAAA 
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GTCGCCACATTGGAGTCGCTGCGATAACGACGAGTGAGCCATTCCCATTCCCATGCCCATTCCATTCCAGTCCACACCGCCAGTCGAAGCTGGGC 
GCAGCGTAGGCTCCAATGAATGGCTGACAAAAGAGTAAACGTGTCCAAAACACAAGAATTGGGGGGCAAGCATTTGCGAGTGTGGGTGCGGAAGA 
AGGGGACAAGGCAATGGCCAAATAAACGACACCACAAAGCGAAGCGGCAGTAAAATGGCAGCACAAACAGCTGCTTGTCAACGTCGCCCGGCGAA 
TGGCAAGTAGGTGAGTGGCCACCAGAGATGCGCCGTGATTGTTAACAGTTTAACAAGTGACAGTGATAAGTCGTTACATCGAGAAATCAACAAAG 
CTAAACAAGAAAGTGTTTTCTAACTATGAAATTTTCAGAATATTAAAATTATGAACAAGTTTATAGAATTGAAAAATACACAAGGCTAGAATATT 
TATATTAGCATATTCGTGTAAAATTTTCATGGCTTAACTCTTAAATATCCCAATTCTTGAAACTCAAAATACTGAACGACGATTAAGAATATAAG 
AATACTATTCGGACGAAAGTTTTCACAAATCCCAAATCCCAAGATCACAGCGAGGCGTGGATTGTTTGATGTTCATGTACATATAGATTCAACAA 
GTATTTTTAAATGTATGTACCTACATTTGTTTAAAGCAAAAATGTCAAAGTGCATGATTAATTCCCAACCATTAGCTGGTTTCTAGAAAACAACT 
AGACCTGATTTGCGGTTATTTCACCATTTGGTGACACCTCTGTAGTGATTCTGGTCGTGATTCCTGGGCATCTCTGCGCTGAGCCACTGCTGGCG 
ACACGACGACGAGAAAAGCGGCCAAAGAAAGGGCCCTGGCCGGACGAACTGTTTTTGCGCGCAGGCCGAAAGCGATTAGCCATTTGTTTAGCGGC 
ACAATAATGCCCGCAATTCGAATAGAAGTCAAGCAACACATGCGACGCCGACGTCGGCAGTGCTGACGCAGATTGAGGTTATGTACTAGGGCCAG 
CGATAATAACCTAAACGGAACCTGGTGAATGCAACCTATCTTTGGGGTTTTCCCTATCCCATTCCCCATTGTTGTGTCTCTGCCAACGGTCTTGC 
GACCAGCAGGTAGCAACAATAACACGCTTCTTTGTTGACGTCACAGAAGAAGCGAGTTCAATGGACGTGGGGCATGTGTATTTGGCAGGGGGTAT 
TTAGGTAGCTAGCAAGCTCTATTTACCATTGTTTGCGATTTTGTCCGAGGGGTTGAGACCATCGATGGGCGTTGGATCCGCCGCCGCTGCTCCTG 
CTGAAGGCGTGGTGGCTGGCGAACCCGGCGCTGGCGATGCGGAATTGGAGGATGATGTGGGCGGCTGTGCTGAGGGTGCTGCTGCCCCCGCCCCC 
TGTGCAGTTGGAGGCGGTGGAGCTGCTCCCGCTGTGGATCCCCCGCTGCCGACGGAATCGGGTGTGGTGGCTGGCGTGGCTGAACCGGAGGCGGG 
ACCCGGATCCGCCGTCTCCTCGGCCGCAGCCGCTGCAACCACTACTGCCGCCGCCTCCTCAGCTACATTCACAACGCTGTTGTAACTGAAACTGG 
GCACAAATTCTACCGCGTTCACGTTCAGCGTTGAGAACTTTGTGCTGATCTCCGTGTTTTCCTGGGCGGCCATCGTTGCAGTGCGTCCGTAATCC 
TCCGCAGTGGTTTCCTCCGATGAAAATCTCTCTCTCGCTTTTTTCACGCGACGACACACTTTCGCCCTCCTTCTACAAACTGCTTCAGCTGCTTC 
TTCTTGCTGCTCCCGCACTCACACAGAACACACGCGGTCGTTCGGAATTCTTGGGATTGCCTATGGTAATGGAAATTGGGAATGGCAAAAGCAAT 
GCCACCAACACACGCACGCCTTGATGGTTGTTGGCCGGGGGGGTGGCTTGTGGGGGGTTTAGGCCAGACTTTCGGCTCAATGTGGTCTACACTTT 
TGTTGTGACTCGGTGTTGTGTTTGTTGTTTGTGCTTTGCTAGCCTGGCCAACTCGGGTTGAACTCGAATTCCCTTTAATTTGATCGACTATCGCC 

T GT T T GG GGCAGT GGT G AC CACC T G GT G AT C T GC AGGT AT G AT T C G AC T AC C T GAACAC T AAAT GC AGT GC T CGG AACT AG C AAT CT T CT AT TAT 
TTTACACGTTTTAAGCCACGCATCAAGTCAGGGTTGCCACTTGTATGCGGTGTTATCGGTGCATCGATATGTGCTGGCTGTCACAGTGGCCTATA 
ATTGATTTGGGGCAAAGTATTAAGATATTTTTAAATTTTTTTAGTAAGATTTTTAATACTTTTAAAATTATTCAACATAAATAATTATATTGCTT 
TTGTACTTTGTTCTGATTAATATAATTTTTTAAGGCTAAATATAACGTAGTCCACATGTACATATATAAATACAAACTGTATTTGTTTTATGTGT 
GACTAGACATTAATAATTTTTTATTCACTTTCAGTGAGTTTTTATAATTATTAATTTAGTTATTTTGTATAATAGAAGTGTAATGCGAGTTTTGT 
AAATATTGATAAGTGTGTAGTGAGCCTTGGGGTTTCAATACATTTCTAAGCTCATCATCAATGGTATCGAATTACAATCGATGTTGTTCCAGCTC 
TAACGTCGTGCCAGACGCCATTTTAACAGTAGTGAAAAAATCGTCTGGAGTTTGGCTCGTTTTCAGCATTCGCATTAGCATTTTAGCGGTAAAAC 
AAAT ATCCAAATAGACAATGAGCAACGGTGGTGGTGCCGGGGGATTGGGCGCAGCGCGTCCACCTCCACGGATCGATGGAAT GGT CTCGCTTAAG 
GCAAGTATCTGCTAAGTAAAATGCGACTAAATCGCCCACAAAATGGCGGCCGGCTGGCAACGTTGTGTGGTTAGGTAACCTCTCCAAAAGCATGT 
AAATTAATCCGCGTGTAATTATGCCTACAGGTCGACAATCTCACATATCGCACCACGCCGGAGGATCTGCGTCGGGTCTTCGAGCGGTGCGGCGA 
GGTGG GGG AC AT C T AC AT AC C C AGGGATC GC T AC AC AC GT G AG AGC C G C GG ATT C GC AT T T GT T C GGT AAGC AT T C AC TTCTCTTC C GGG AAAAT 
TGTGAAAGTCTTAGGTCTTTTTTGTTTTGTTACTTTCACTTGCCGGCGGTGGATTTTAATTATCATGGGACCTAAAATGTA 

(SEQ ID NO: 712) 

Exon: 7821.. 6867 
Exon: 4477.. 4292 
Exon: 4218.. 3449 
Exon: 3388.. 3302 
Exon: 3236.. 3060 
Exon: 2994. .2826 
Exon: 2769.. 1697 
Exon: 1007.. 1001 

Start ATG: 7293 (Reverse strand: CAT) 
Transcript No. : CT19890 

CTGACTTGATGCGTGGCTTAAAACGTGTAAAATAATAGAAGATTGCTAGTTCCGAGCACTGCATTTAGTGTTCAGGTAGTCGAATCATACCTGCA 
GAT CACC AGGT GGTCACC ACT GCCCCAAACAGGCGAT AGT CGATC AAAT TAAAGGGAATTCGAGTTCAACCCGAGTTGGCCAGGCT AGC AAAGC A 
CAAACAACAAACACAACACCGAGTCACAACAAAAGTGTAGACCACATTGAGCCGAAAGTCTGGCCTAAACCCCCCACAAGCCACCCCCCCGGCCA 
ACAACCATCAAGGCGTGCGTGTGTTGGTGGCATTGCTTTTGCCATTCCCAATTTCCATTACCATAGGCAATCCCAAGAATTCCGAACGACCGCGT 
GTGTTCTGTGTGAGTGCGGGAGCAGCAAGAAGAAGCAGCTGAAGCAGTTTGTAGAAGGAGGGCGAAAGTGTGTCGTCGCGTGAAAAAAGCGAGAG 
AGAGATTTTCATCGGAGGAAACCACTGCGGAGGATTACGGACGCACTGCAACGATGGCCGCCCAGGAAAACACGGAGATCAGCACAAAGTTCTCA 
ACGCTGAACGTGAACGCGGTAGAATTTGTGCCCAGTTTCAGTTACAACAGCGTTGTGAATGTAGCTGAGGAGGCGGCGGCAGTAGTGGTTGCAGC 
GGCTGCGGCCGAGGAGACGGCGGATCCGGGTCCCGCCTCCGGTTCAGCCACGCCAGCCACCACACCCGATTCCGTCGGCAGCGGGGGATCCACAG 
CGGGAGCAGCTCCACCGCCTCCAACTGCACAGGGGGCGGGGGCAGCAGCACCCTCAGCACAGCCGCCCACATCATCCTCCAATTCCGCATCGCCA 
GCGCCGGGTTCGCCAGCCACCACGCCTTCAGCAGGAGCAGCGGCGGCGGATCCAACGCCCATCGATGGTCTCAACCCCTCGGACAAAATCGCAAA 
CAATGAAACCGATCCTGCTGACAGCTGGGATGTGGACGATGCAGTGATCACGCCCGAGGATGAGGAGGTGGAAGATGCTGAGTTCACGGAGGGCG 
AGGCCACGCCGAAGGTGTCCAAGAAGAAGGTGGTTAAGGTGGAGGAAAATAGGAGCAAGCGTGAGCACGTGAATGTTGTGTTCATTGGCCATGTT 
GATGCTGGCAAATCAACAATTGGCGGACAGATCATGTCGCTCACGGGCATGGTGGACAAGCGAACACTGGAGAAGTACGAGCGGGAGGCACGTGA 
GAAGTCGCGTGAGAGTTGGTACTTGTCCTGGGCGCTGGACACCAACCAGGAGGAACGCGACAAGGGCAAGACCGTCGAGGTGGGACGAGCCTTCT 
TCGAAACGGACCGAAAGCATTTCACTATCCTCGACGCACCCGGTCACAAGAGCTTCGTGCCTAACATGATCGGTGGCGCGGCGCAGGCCGATCTC 
GCAGTGCTGGTCATCTCGGCGCGAAAGGGTGAATTCGAGACGGGCTTCGATCGCGGCGGCCAGACCCGAGAACATGCCATGTTGGCCAAGACCGC 
CGGCGTTAAGCATCTGGTCGTGCTGGTCAACAAAATGGATGACCCAACGGTCAATTGGGATCAGACGCGCTACAACGAATGCAAAGACAAGATAC 
TACCATACCTCAAGAAGCTGGGCTTCAACCCGGCCAAGGACCTTACCTTTATGCCTTGCTCGGGCCTCAGCGGATACGGGCTGAAAGACCAAATC 
CCGGAGACGCTCTGCCCCTGGTACCGGGGACCCGCCTTCATTCCGTTCATTGACGAGCTGCCCTCGCTCAACCGCAAGTCGGACGGGCCTTTCAT 
CATGCCAATTGTGGACAAGTACAAGGATATGGGCACCGTGGTGATGGGCAAGGTTGAGTCCGGAACAGCGCGCAAGGGTCAAAACCTGTTAGTGA 
TGCCAAATAGGACGCAAGTGGCGGTGGATCAGTTGTTCTCGGACGACTTTGAAGTCACTTCTGTTGGTCCCGGCGAGAACGTCAAGATTAAGCTG 
AAAGGTATTGAGGAGGAGGATGTCTCGCCCGGTTTTGTTCTCTGCGATGCAGCCAATCCCATCAAGACGGGAAAAATCTTTGATGCTCAGGTCGT 
AATTTTAGAACACAAATCAATTATCTGTGCGGGTTACTCGGCTGTCATGCACATACACTGTGCAGCCGAAGAAGTCACAGTTAAGGCCCTCATCT 
GTTTGGTCGACAAAAAGTCTGGTGACAAATCAAAAACACGTCCACGGTTCGTTAAGCAGGATCAGGTTGCAATTATGCGAATCGAGTGCTCCGGA 
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ATGATTTGCCTTGAACAATTTAAGCTATTCCCTCAAATGGGCCGTTTTACGCTCAGAGATGAAAACAAAACCATTGCCATTGGCAAGGTGCTGAA 
GGTGGTCGAATAAATTGAGCCTAGCTAAATTCGAGCGCCCGCCTTTCATTGCGATTTTTGAGATAGCATTGCAAGCCCGACGAGAGMCACAGCC 
ACAGCCACAACAACAGCAACGCCACGCCACGCTTTCTTCACATTTGACAAATCAACAAAAACAGTACTCAGCATCGCTCGAGCTCTCACTCACAT 
TACTTTCAACCAGCAATAGCAATCGATTATAGAATCCAGATCGATTCAAAACGGAACGGAACGGCAATGTAGGCAGGCGAAAAGTTAATTTACAA 
CAAGGACGAGTTCGGAGATAGAAACAGAAATGCATTATATACGAATTATAAACTATATTATGTAAAACTAAGCAGCAAAATCGGAGGCAGCGAAT 
GCGAAACAAGATGAACTATTTATATATGTATACAAAACAAACAACTCACACGTATATATGCAAAGAGAGAGAAAGAGTTGAAGAAGAGGAGTTGC 
ATTGATAACGATTTTGTTTGTTGAGCCCATCTCGCAAGGATATTGAGCACTTGTCGCAGTCGTCGTTTTCTTTTGGTTTACTATTTTGTATAAAC 
TGTAAGCATTAAATAGCCTACACGTACATGCCTAAATCGCGCGCAATACGTGAATTTCTTTAAAATACTAAAACTAAAGCTGCAAGCTACCCAAT 
TTTTGTATTTGTAAGCGAAAGTACGATCCCGAGAATCCTTTCCCCATCCATCTTCCTGTATGCGCCTTTAAATTGCAAATTTGTTAAATACCCTG 
CAAAATGCATTAAGATATTTCTGCTCTTCTTTTTAAATGATATAATCTATATGCTTTAATTTAAGTTTAAGCTTAAAGAACATGGAGTAAAGTCT 
GTAAAAGCAGAACGGAGAAAATTCAAATTTACGGAAACACCTAATTGTTAATCATTTAAAACCATTTGATTTTGCTACTTAAAACACATTTGATT 
TTGTATAATTTAGTTAATTGAAGAATTCGTGCGAGTTGAACACGAAATCCCATATATGAAAAAATAAAATGCCTTTTGCTACCGAGCAGAAAAAA 

AAAA 

(SEQ ID NO: 713) 

Start ATG: 529 (Reverse strand: CAT) 

MAAQENTEISTKFSTLNVNAVEFVPSFSYNSVVNVAEEAAAVVVAAAAAEETADPGPASGSATPATTPDSVGSGGSTAGAAPPPPTAQGAGAAAP 
SAQPPTSSSNSASPAPGSPATTPSAGAAAADPTPIDGLNPSDKIANNETDPADSWDVDDAVITPEDEEVEDAEFTEGEATPKVSKKKVVKVEENR 
SKREHVNVVFIGHVDAGKSTIGGQIMSLTGMVDKRTLEKYEREAREKSRESWYLSWALDTNQEERDKGKTVEVGRAFFETDRKHFTILDAPGHKS 
FVPNMIGGAAQADLAVLVISARKGEFETGFDRGGQTREHAMLAKTAGVKHLVVLVNKMDDPTVNWDQTRYNECKDKILPYLKKLGFNPAKDLTFM 
PCSGLSGYGLKDQIPETLCPWYRGPAFIPFIDELPSLNRKSDGPFIMPIVDKYKDMGTVVMGKVESGTARKGQNLLVMPNRTQVAVDQLFSDDFE 
VTSVGPGENVKIKLKGIEEEDVSPGFVLCDAANPIKTGKIFDAQVVILEHKSIICAGYSAVMHIHCAAEEVTVKALICLVDKKSGDKSKTRPRFV 
KQDQVAIMRIECSGMICLEQFKLFPQMGRFTLRDENKTIAIGKVLKVVE* 
(SEQ ID NO: 714) 

Name : eRF-3 

Classification: translation_f actor 
Gene Symbol: Elf 
FlyBase ID: FBgn0020443 



Celera Sequence No. : 142000013384669 

AGCTAGCTCCGTTGGGCTCCATGCTGCAGTACATACTGGATCATGGTCATGAGATCACGGCCAATGCGGAGCTGAAAGTTTGGGCGTCACAGATT 
G C GT GT G GT AAGT C AAAT T C AAG ACAAT T T GAT T C AAT T C AAT T AAT T AAT T AAT GT T T GT AC C AG GC AT GC AT T AT CT GGAATC AC AGC AT T T T 
GTACACCGCGATCTGGCCGCCAGAAATATATTGCTGACCGCACGCCACCAGGCGAAGATCAGTGACTTCGGCATGTCGCGATCACTGAGACCCGG 
TAGCACGGAATACCAATTCACCCAGGGCGGACGTTGGCCCATCCGCTGGTACGCACCGGAAAGCTTCAACCTGGGCATCTTCTCGCACGCCAGTG 
ATGTGTGGTCCTTTGGCGTGACCATCTGGGAGATGTTTTCCCTCGGCGCACCGCCATATGGGGAAATATCTAATGTCGATGCCATCAAGTTGGTG 
GACAGTGGCGAACGCCTGCCGCAGCCCAATCTCTGTCCAGCTTACATCTATGCCGTGATGCAGAGCTGCTGGAAGGAACGACCCAAAGATCGGCC 
AACCTTTGTCTATCTCACCGAGTTTTTCGCCCGCGATCCTGATTACCAAAACCTGCCGGAATTGGTGCAAACGGTTCACATTTAAGCCAGTTTCC 
ATTTTCCATTTTTTCCGTTCTGAGTGCAATTTTCTGTAGCTATTTATGCAATGTCTTCTATTTCGTTTTCAACGTATTAACGATCTGGAACCCAC 
AGATCACTATTGTTGTTCCGTATCCTAGTATTCATGTTTATAGCTTGTAATATATTTTTTAATCATATGTACAACCAATTTGTGTCAAAAGAAAG 
CGAACAAGTGCCTTAAAGATTTAGCATATAAGCTTTGTATTCATTTTTACAAATCCATTTTGTTTCATTGGTTATACAAAATAAAAACTATTTTT 
ATACATACTGCGCTTTTTGATTTAGTATTAAATAATACAGCAGTATGCTTTAATAATCAATGTTTAAACATTTAATGCAGTTCTTTTATATAAAA 
CGTGGTAACAAAAGTAAATAGTTTGTTGGTAAACCGCTTAAATACTAGGTTAGTAATATGGTTTCTTGGAAAGATTATAACTACGTTAGCGTGGT 
TAGTGCTCTAGAAATGGATTATATATTTTTTTGTTAATTTTTTCGAGCTAGCACAACAGTAATCAAGCTGATCCGTTTATTGCCTGCTTCACATT 
CGATGCGCTGACGAGCTTAAGGGCGCCAAGTATTCCGTTCAGGTTCTCCTCGTTGTAGGGCTTGCCCTGGAGCGGAGTCACCACCGGGACGAGCT 
GATCGCTCTGCGGCAGCTGGATGCCGATCTCCTTCATTAATCCCTACAAATTTAGTAAAATCATAAGTGATAATTCTCTAAAACAGGGCAAAACA 
AATCTTACCGCTTCCACTTCCACGCTTAGTTTCAGTTTCATGTCCTGCTCATCGCCCTTCACCTCCAGTTCCTTCTGCACGGTGAACTTGGGAGT 
CTTGCCGATGACCCAGTCCCAAGAGCTGTAGTTGGAGCGCAATTCCTCAATGCCGGGGAACCATTTCTCTGTGGGATTGACCAGCTGAAAGCCGC 
GCTGCTGCATCGTCTGTGTGCTGCCGCCGTCTTCAAGCGTAGTTGCAGCCGTTCTCAAATACTCGTAACCCACGGCGGAACGCAGCTGAGCAACG 
TTCACCGTGCGATTGACATCAACCAGGTTGCGAATGGGCGACGGCACCGAAGCCGTGGCCTTGCTAATATAATTGGCCGGCTCCCGCACCAGAGA 
TTCGCCCAAGTGCAATTTGTTGGCGCTGGCCAAAATGGTGCAGTGATGGTACGAGTTCGGGTGTCCCAGCTTGGCCGCAGTGCCGGAAATCTGTA 
ATTGAACAAGGGCATACTCGTTGAGTGGGCAAATAAAGACCAGATGCACAGCCCCAGCTGTTGGTTAAGTGAGCTGATGCCAATGTGAGGGATCA 
GCCGGGGTCGCATCATTCCTTGATAAGAGCCACCGCTCCTCCTCACCTTCTTATTCATCACTACAATGTCGTCGCGCTCATTTATCTCTGCCTTG 
ATGGCCCATTCGCGGAACAAGGCCCTGGTCACGATGTTCAGGTTGTACTTGCGATCGTAGCGCTCTCTGGGCGAGAAAAAGGTGCAGTTGAGGTT 
CCCAAGATCATGGTACACCGCTCCACCGCCACTATTGCGGCGTGCAAGTGTTATGCCGCGTTCCACCAGCTGCGAAACATTAGCCTCGGTGAAGG 
GATTCTGGTGGCGACCAATGACTACGCACGGGTCGTTGGCCCACAGCAGCAGCACATGGTGGCGGCTGAAATCAAAGTTTTTGTACAGCCAATCC 
TCTAGAGCGAGGTTGGTGAACACATCGCTGGACTGCGAGATGAACACGGACTTCTTGATCTCCGAATCGGGAACCACGGGCGGTGGCTGGCGCCT 
CTGCTTTTTAATCTCCTGCTGTCCAGCAGATGGTGGCTCCACTCCGCTGGATGCCAGGCGCGTGGCCTGAGCGGCGGAGGAGAGAAGTAGCCGCA 
GCTGGCTGGAATTGCGTCGCAACAGCATGCTCATGGCTTACACAAGGAAGTGGGGCGTACGGGTGTGTGTGATTTCTGTTGCCAGGTGATCTGGG 
ACACAATGCACTTGCAAGCGAAACAAGGTCAATGGTGTCGAAAAAAGCTTTTGGCTGGTTGATTGGTTGGTTGGTTGGTTGGTTTCTACGGTTGT 
CGTCTGCCTGGTTCTGTGTGGTCCGTGTGTGCTCTTCTTTAGGTTATGTATGGTCTACTATCTACGCTATTCGTTTTGGTCTTTAAATGGCTTTG 
GTCGCTCTGGACTTTAGACTGTGGCGTTCTTAGTGGCTCGGGATGTGTTTGCAGCCACGGAGCTGGCCGAGATGAGGTATTCGTGCAGGGCGGGC 
GTCTGATGCAGCCACTTCCACGTTTGATGCAGACCACAGACACGCTTACGGAGACCCAGGCAGTCCTTGTTGGCACAGTTCACTGGCATACTACG 
CCTCACACCCATCCAGGTGGGGGATACAACGAATCCGGTGTTTACGGGGGGCCGCAACGTTACGGGGCCCAAATAAGATGGCTAAAGGGTTTCCT 
TTTTAGCTGACACCAAAATTTTCCACTCTTTGTTCGCGGCGCACGGGAACTGCGGGGGGCGGGGGCAAGGCGGATGATCGCAGTGGGCGGGGGTA 
GCAGTTATAGCACAAACAAACGAACGAACACAGAGCCGATCGAAAAGCAGCTGCGCGTGTGGAAATATCACGACCGATTGGCTTTCGTTTCAGAT 
GAGCAGTGGTGCTCGCTGCTCTTGTTCTTCCTCGACGGTCCCCAAATGCCATTCGCAGCGAACGACGAAAGCTCCGGTCATTAAAAAAAAAAAAA 
AACAACGGCACTCTGGCGCCGCACTTAATTTATTTTCATCTTTTTAATTCGAACTTGTATAATTTACGCTAAACATGTGTGATAATCCGAGACGT 
CTACAACAGATGTGACATTAATATGTGTTATATTTTATAAGCAGCATACTTTGAGAGCCCCTTCAGTTTGGAATACCCCCATTATTAATGTAAGC 
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CGCCTCTTCGGGATCAATTGGCTTGCGGGCAAATCGTGCCGGCAGCTCGTAGTCCTCAATTGCTGGTCCACCGGCCAACTAAATGTAGAAGGGAC 
AATTACGAAGATTAGTTATGATGCACAGAAGAAGCCGTGGTATTGTGTATAAATGAAATGGGCTGGGTTAACCTTTTTGCCTCCTGCGGCCTGGG 
AACTTGCCTTCTGGTTTGCTGTCTGGGCGCCCGGCACTCCTGCACCACCCTTGCGGAAGTGTATGAGCGGTACGCGCTTAGCCAGCGCAACTAGT 
GTTGGTCGAACCATAGCAACAGCAATTGAATTTCTACCAAAAAGCCAAGGCAAAAGTAAAAGTTACGTGAATAACAGCTGACTGTAAACTCATAC 
AAGCGCTGCCAGATCGTTAAAATTGCATGCCATTCGCCGCGCAATGGATTATTATCGTTATTGAAATTTACGTGATCCTGTGAATGGCTAAGGCA 
GATAAGAGGAAGCGCAAAAAACTATTTTGGTCCAAATCGGTAATATTTTTATATTTAAAATGAATTGGGGACAGAAGATAACCGGCAGCCGACGG 
CCCTAACTTCTGGACGGACATCGCCAGTATATGGATTGCTCCTAAAACTAGCAAACCAACGTACTTATGCTCTATGCAACAATAAATGTCATATG 
AAAACAAACTATTGAAGACGTTTTTATGAGTATAACCGGTAACTATTTCATTTGTTTATTATATTAATGACAAATAAAC 

(SEQ ID NO: 715) 

Exon: 3354.. 2042 
Exon: 1895.. 1434 
Exon: 1373.. 1001 

Start ATG: 2593 (Reverse strand: CAT } 
Transcript No. : CT19977 

AAGAACAAGAGCAGCGAGCACCACTGCTCATCTGAAACGAAAGCCAATCGGTCGTGATATTTCCACACGCGCAGCTGCTTTTCGATCGGCTCTGT 
GTTCGTTCGTTTGTTTGTGCTATAACTGCTACCCCCGCCCACTGCGATCATCCGCCTTGCCCCCGCCCCCCGCAGTTCCCGTGCGCCGCGAACAA 
AGAGTGGAAAATTTTGGTGTCAGCTAAAAAGGAAACCCTTTAGCCATCTTATTTGGGCCCCGTAACGTTGCGGCCCCCCGTAAACACCGGATTCG 
TTGTATCCCCCACCTGGATGGGTGTGAGGCGTAGTATGCCAGTGAACTGTGCCAACAAGGACTGCCTGGGTCTCCGTAAGCGTGTCTGTGGTCTG 
CATCAAACGTGGAAGTGGCTGCATCAGACGCCCGCCCTGCACGAATACCTCATCTCGGCCAGCTCCGTGGCTGCAAACACATCCCGAGCCACTAA 
GAACGCCACAGTCTAAAGTCCAGAGCGACCAAAGCCATTTAAAGACCAAAACGAATAGCGTAGATAGTAGACCATACATAACCTAAAGAAGAGCA 
CACACGGACCACACAGAACCAGGCAGACGACAACCGTAGAAACCAACCAACCAACCAACCAATCAACCAGCCAAAAGCTTTTTTCGACACCATTG 
ACCTTGTTTCGCTTGCAAGTGCATTGTGTCCCAGATCACCTGGCAACAGAAATCACACACACCCGTACGCCCCACTTCCTTGTGTAAGCCATGAG 
CATGCTGTTGCGACGCAATTCCAGCCAGCTGCGGCTACTTCTCTCCTCCGCCGCTCAGGCCACGCGCCTGGCATCCAGCGGAGTGGAGCCACCAT 
CTGCTGGACAGCAGGAGATTAAAAAGCAGAGGCGCCAGCCACCGCCCGTGGTTCCCGATTCGGAGATCAAGAAGTCCGTGTTCATCTCGCAGTCC 
AGCGATGTGTTCACCAACCTCGCTCTAGAGGATTGGCTGTACAAAAACTTTGATTTCAGCCGCCACCATGTGCTGCTGCTGTGGGCCAACGACCC 
GTGCGTAGTCATTGGTCGCCACCAGAATCCCTTCACCGAGGCTAATGTTTCGCAGCTGGTGGAACGCGGCATAACACTTGCACGCCGCAATAGTG 
GCGGTGGAGCGGTGTACCATGATCTTGGGAACCTCAACTGCACCTTTTTCTCGCCCAGAGAGCGCTACGATCGCAAGTACAACCTGAACATCGTG 
ACCAGGGCCTTGTTCCGCGAATGGGCCATCAAGGCAGAGATAAATGAGCGCGACGACATTGTAGTGATGAATAAGAAGATTTCCGGCACTGCGGC 
CAAGCTGGGACACCCGAACTCGTACCATCACTGCACCATTTTGGCCAGCGCCAACAAATTGCACTTGGGCGAATCTCTGGTGCGGGAGCCGGCCA 
ATTATATTAGCAAGGCCACGGCTTCGGTGCCGTCGCCCATTCGCAACCTGGTTGATGTCAATCGCACGGTGAACGTTGCTCAGCTGCGTTCCGCC 
GTGGGTTACGAGTATTTGAGAACGGCTGCAACTACGCTTGAAGACGGCGGCAGCACACAGACGATGCAGCAGCGCGGCTTTCAGCTGGTCAATCC 
CACAGAGAAATGGTTCCCCGGCATTGAGGAATTGCGCTCCAACTACAGCTCTTGGGACTGGGTCATCGGCAAGACTCCCAAGTTCACCGTGCAGA 
AGGAACTGGAGGTGAAGGGCGATGAGCAGGACATGAAACTGAAACTAAGCGTGGAAGTGGAAGCGGGATTAATGAAGGAGATCGGCATCCAGCTG 
CCGCAGAGCGATCAGCTCGTCCCGGTGGTGACTCCGCTCCAGGGCAAGCCCTACAACGAGGAGAACCTGAACGGAATACTTGGCGCCCTTAAGCT 
CGTCAGCGCATCGAATGTGAAGCAGGCAATAAACGGATCAGCTTGATTACTGTTGTGCTAGCTCGAAAAAATTAACAAAAAAATATATAATCCAT 
TTCTAGAGCACTAACCACGCTAACGTAGTTATAATCTTTCCAAGAAACCATATTACTAACCTAGTATTTAAGCGGTTTACCAACAAACTATTTAC 
TTTTGTTACCACGTTTTATATAAAAGAACTGCATTAAATGTTTAAACATTGATTATTA 
(SEQ ID NO: 716) 

Start ATG: 762 (Reverse strand: CAT) 

MLLRRNSSQLRLLLSSAAQATRLASSGVEPPSAGQQEIKKQRRQPPPVVPDSEIKKSVFISQSSDVFTNLALEDWLYKNFDFSRHHVLLLWANDP 
CVVIGRHQNPFTEANVSQLVERGITLARRNSGGGAVYHDLGNLNCTFFSPRERYDRKYNLNIVTRALFREWAIKAEINERDDIVVMNKKISGTAA 
KLGHPNSYHHCTILASANKLHLGESLVREPANYISKATASVPSPIRNLVDVNRTVNVAQLRSAVGYEYLRTAATTLEDGGSTQTMQQRGFQLVNP 
TEKWFPGIEELRSNYSSWDWVIGKTPKFTVQKELEVKGDEQDMKLKLSVEVEAGLMKEIGIQLPQSDQLVPVVTPLQGKPYNEENLNGILGALKL 

VSASNVKQAINGSA* 
(SEQ ID NO: 717) 

Classification: hypothetical 



Celera Sequence No. : 142000013384513 

TTAATCCCGAGGACGACGTGGGCAACATAAATGCTGCCCAGAGCAAATCGACGCCGAGAAGTAAACAGCTAAATGATTCAGCTCACAAGCCACGA 
ACAAGCACGACGGCATCTGCACCAACAAAACGAAAGTCTGCGGAAAATTCTTTATCCGAGGTGTTTGCTAAAACTGTGCGATTGATACAGCCCAT 
CCCAAAGATGTGGCCTACTCCTTCTGATGCCGACTCGGGGATATCTTCGATGGGCTCATCATTGGCATCGACAGCCCGCCTGAAGTAGTAGCCGT 
GTTTCGTCTTAAATGTGCTTAGTAGTTCTTTGTAGATAAGATTCCTTGCGTCCTGACCCAGTAGTAGAGTCTAGTTTGTGTGCCCCCTTATTCGA 
ATTCAATTTTTATGTATATTATTTTCAGGCCGAGAGATCAATGCGCTTTCCCATGCGGTACAACAAGATCAAGAGGCTGCATTAGATATATGTTC 
TTGCCAGATTATTTACCCGCAATCGTGTACAAATGTAAGTTTTAGTTACGTATTATTAGGAAGCTTGCAATAGAAAGAATGCTTTAGCTTAAGCT 
TATCGATGTATGAATCAATATTTCTTTAGAGTAGTAAACCAGTATTATCTAAGCTCGATTTTCCTAAGCATGAAGCCGTTTTTTAGTTTTTCTTT 
CACTTCACTTTTTTAACTTTTTACGACGCTTTTAGTTGCTTATGAAATTTTCACATTTTTATTAATTTCTATTTCTTTTGAACATTTTTGATTTA 
TTTTACTTTACCTACGAATAGATTTATTTCGTATTCTTATTATTTCTATGATTTCCTAAAAATTTAACACTATACCTCCTTTGATTTAGAACAAA 
AATAGACAAATGACTTTTTGATTGACGAAGCGCGTGTTTTGGAAATCCTCATGAGCTGATAGTACTAAATTCAAGTACCTGATCTCGTCTGTACA 
7AAAGTATAAACATAAGAACTCTTCGGACATAAGACAAAGTTCAAGATAAATTAAGGATAAAGTGTATATATACGTATGTATTTACCAGAGTGCAA 
GCTAGTGTCTATTTACAATCATTTTAAGATATAATGTTTTTTTCGTAAGTATAACGTGTAGTTCTTAAGGAATAAAACGGAACCATGCAATTACT 
TAATGTGTTTTAATGAAAGTAAGTCGGCGATTCTCCCAATGCTACTCCGAGTCGCTGCTGCCTGACTTGTGCTCCTCCTCTTTCTTCTCTTCAGT 
CTTGTAGTGGCGATATTCTTTCTTGAGCTCCCACATGTTCTTGTGCGGGTTCTTCATATTGTAATCGCAGACATCCTTGAGAATCTCCTTCAGGT 
AGCTAATCGGCTGGTTGGTGATCTTGACCAGATCCTTGATGTTATAGTATTGATGCTTCTCGAATGCGTGGAAGAGCATATCCATCACGGCATTC 
TTGTCGTCGCGCGCCTTCTTGCCCTCGGCCTTCTTTCGCTCCCGATATTCAATCTGCAGGAATGGAGGAGTTATATATACATATATATTACCAAA 
ACAGAGTGTATTATTGTTCTCACGTTGTGTGCGTGGTCTTTTACAGGCTTAAAGTTCTGCACGATTTTGTCGATAGGCTGCACACGTCGCTGTGG 
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TTCCGATGCCTTGCGTATGGACTCCAGCTTTAGTTTCATGTAGCAGTTGTCGGCGATGGGTCGGCACTCTAGCTTTTGCACAATCCTGCCCTCCA 
TATACAACTTTTCGTTATCCGGCTGTGCCGCCGAGGTAGTCGAGTTCTCCTTGCCATCGGACGGTGCCATGTGCGAGAATACGCCGAGCGTCTGC 
TTGGTGACCTGCGACACGTCCAGGATGTGCTCGGTGGGTATTTTCTCCTCGGGATCTAGAGCCAAAACGGCGGGCGTGAGGGAAAGCGAGACCTG 
GGCCTTTTGGCCAGGTGTCTTGTTTATCCGGAGCTTGCCCACGTCCATGTTCGTCGGCGCCTTCTCCCATTTCTGGGCAATGTACTTGGGCACCT 
TCACCAGCCATACGCCTCGCCCAGCATTGGACAGGTCCAGGTCCTTGTCGATGATCTGTGTCTTTTCCTTATCCTCCTTCGACATTCTGTTTGCT 
TTAAATTATTAAAAACGAATTTGCAAAGCCACTTGTACCGTTTTCGGCTTTGCTTGTTGTTGTTGTTTTGCCGTAAATTTAGTGCGGTCGAATAA 
ACAGTTTCGAAAAAATGCTAGTTTAGTATATCACGGTATTTTTCTTTCGGTATTTATTCGTTATGTTATTATTAGGCGCCAAGTTTGAAAAAGTC 
CGAAGTGCAGATAACGAAATAGAACTATTTACATATCTGAATTTTATATAGTTAGCAAGAACTTTAGAACATAAGAAATAACTTTTATAATGAAA 
TGTCTTAGAAAGAATATGATATCTTAATTTTGATTTATTCAGCAAATTTTATGACTAAATTTAGAACACAACATTGTATTGATTTAAGATGAATT 
TATGAATTGATGTGACACCTTATCCAGCATTATCCACCCCATTGCCCTATTAAATTGTTTTTCTAATATTTTTATTGTTTTTTCATTCTTTTTAA 
ATTAGGTTGACAGCATGGTGATTGTGTAGAATCTATTCCTCCTCCTCTTCCTCGTCCTCCTCCTCCTCCGCATCGAGATCCTCCTCGTAGCTATT 
ACCACCAGTGGGGGTAACATTAGAGCCGAGCTCGATGATCGGTTCCTGTTCCTCAACGGGCGTGACGGGCTCTTCGACCACAGGCTCGCTGGTCT 
CGTTCTCGTTCTTCTGGATGGCCTGCAGTTCCTGGGCGGCAAGCTCGGATTCCTGGCGGGCGATCTCGGCCAGCTCGGCGGCCTGGCGAGCAAGT 
TCTCCCTCACTCTCCTTCTCGATCTCCGCCAGGCGCTCCAGCTCCTGCTCACGGACACGCTGCTCCTCCTCCAGGCGAGCCTCTTCAGCGACACG 
AGCCTCTTCGGCGGCCTTTTGGGCAGCTTCTTCGGCGGCACGAGCCTCCTCTACAGCCTTCAGTGCGGCTTCTTCGGCGGCCTTTTGGGCAGCTT 
CTTCAGCCAGACGAGCCTCTTCGGCTGCCTTTAAAGCGGCTTCTTCGGCAGCCTTTTGGGCAGCTTCCTCGGCGGCCTTTTGGGCGGCTGCCTCC 
TCTGCTAGGCGAGCTTCCTCAGCAGCTTTTTGGGCAGCTTCCTCGGCGACCTTTTGGGCGGCAGCCTCCTCCGCTAGACGCGCTTCTTCGGCAGC 
TTTCTGGGCAGCATCTTCAGCGGCTTTTAGGGCTTTAGCTTCTTCGGCTGCTTTTTGGGCGGCAGCTTCAGCCAGGAGAGCCTCTTCGGCAGCGA 

TT 

(SEQ ID NO: 718) 

Exon: 2327. .1544 
Exon: 1478.. 1001 

Start ATG : 2080 (Reverse strand: CAT) 
Transcript No. : CT20349 

ATAAAATTCAGATATGTAAATAGTTCTATTTCGTTATCTGCACTTCGGACTTTTTCAAACTTGGCGCCTAATAATAACATAACGAATAAATACCG 
AAAGAAAAATACCGTGATATACTAAACTAGCATTTTTTCGAAACTGTTTATTCGACCGCACTAAATTTACGGCAAAACAACAACAACAAGCAAAG 
CCGAAAACGGTACAAGTGGCTTTGCAAATTCGTTTTTAATAATTTAAAGCAAACAGAATGTCGAAGGAGGATAAGGAAAAGACACAGATCATCGA 
CAAGGACCTGGACCTGTCCAATGCTGGGCGAGGCGTATGGCTGGTGAAGGTGCCCAAGTACATTGCCCAGAAATGGGAGAAGGCGCCGACGAACA 
TGGACGTGGGCAAGCTCCGGATAAACAAGACACCTGGCCAAAAGGCCCAGGTCTCGCTTTCCCTCACGCCCGCCGTTTTGGCTCTAGATCCCGAG 
GAGAAAATACCCACCGAGCACATCCTGGACGTGTCGCAGGTCACCAAGCAGACGCTCGGCGTATTCTCGCACATGGCACCGTCCGATGGCAAGGA 
GAACTCGACTACCTCGGCGGCACAGCCGGATAACGAAAAGTTGTATATGGAGGGCAGGATTGTGCAAAAGCTAGAGTGCCGACCCATCGCCGACA 
ACTGCTACATGAAACTAAAGCTGGAGTCCATACGCAAGGCATCGGAACCACAGCGACGTGTGCAGCCTATCGACAAAATCGTGCAGAACTTTAAG 
CCTGTAAAAGACCACGCACACAACATTGAATATCGGGAGCGAAAGAAGGCCGAGGGCAAGAAGGCGCGCGACGACAAGAATGCCGTGATGGATAT 
GCTCTTCCACGCATTCGAGAAGCATCAATACTATAACATCAAGGATCTGGTCAAGATCACCAACCAGCCGATTAGCTACCTGAAGGAGATTCTCA 
AGGATGTCTGCGATTACAATATGAAGAACCCGCACAAGAACATGTGGGAGCTCAAGAAAGAATATCGCCACTACAAGACTGAAGAGAAGAAAGAG 
GAGGAGCACAAGTCAGGCAGCAGCGACTCGGAGTAGCATTGGGAGAATCGCCGACTTACTTTCATTAAAACACATTAAGTAATTGCATGGTTCCG 
TTTTATTCCTTAAGAACTACACGTTATACTTACGAAAAAAACATTATATCTTAAAATGATTGTAAATAGACACTAGCTTGCACTCTGGTAAATAC 

ATACGTATATATACACTTTATCCTTAA 
(SEQ ID NO: 719) 

Start ATG: 248 (Reverse strand: CAT) 

MSKEDKEKTQIIDKDLDLSNAGRGVWLVKVPKYIAQKWEKAPTNMDVGKLRINKTPGQKAQVSLSLTPAVLALDPEEKIPTEHILDVSQVTKQTL 
GVFSHMAPSDGKENSTTSAAQPDNEKLYMEGRIVQKLECRPIADNCYMKLKLESIRKASEPQRRVQPIDKIVQNFKPVKDHAHNIEYRERKKAEG 
KKARDDKNAVMDMLFHAFEKHQYYNIKDLVKITNQPISYLKEILKDVCDYNMKNPHKNMWELKKEYRHYKTEEKKEEEHKSGSSDSE* 

(SEQ ID NO: 720) 

Name: TRANSCRIPTION INITIATION FACTOR II 
Classification: transcription_f actor 
Gene Symbol: TfllFbeta 
FlyBase ID: FBgn0010421 

Celera Sequence No. : 142000013384643 

AC C AG CAT C C AGT GAT TGTATTTAT AGC C C AAT T G AAG C GT TAG AT T AC GAT T C ACG GAGGG AT AG AC C GT T T T T AG AT TGGCATTCGCCTCGAT 
GACCGGCTGAAGATTTGAGCCCAAGTGCAGGGATGTCAGCAGCGCAAAGTAACCAATGTCTCGTATTTTGGGTATGAACATGGTTCTCGAATTTG 
GAGTTCAAAATGAAAAAATTTTAAGTCGTTTTTTAAATTTTGTATCTAATTTTGTGTGACCTCAATGTGCTCTGACATTCCCAGTCTGATTGCTT 
AAAAAACGTTATATTATAAACATTTTTTGATTTTTATTTTAAATTATACGTAATTAAAATTACGTAATTAAACAATTAGGTAAATAATAAAAGAT 
TTTATCGTTGTTTGTTATAAATTAATTTATTAAATTGAAAGGCAGCGTGCTGAAACTACTTAACTGCATCAACTTATTTTAGCGACATAAATTAT 
TTGAAATTCCCGCCAGCTGGTACAGTTTGAGAGCAACTATTAGCTGTGCTAAAACCCCTGCATTTAAGTGCTTGACAATCTTGCCTTATTCCAAC 
CTATTGGAAATATAAATTTCATATTAAAACAATCAACGGTACGATGTACATATTTTTTTATTGTAAAACAATATATTGTTAAGATTTCAAAAGGA 
ATTAATTTTTTATTTTTAGTTCAATTGGTTGGAGTGCTTGGTAAATAGTTAGCACAGATAGCAACCTTGGTGTGAGCCTTCGCTAATATTTCGAC 
ATAAGATTTAAATACAGAAAATAATATTTAATTCTAACAATTATACCTTATTTCTGTGTGAAATTATGAAAAATGTAAACATTTCTCTGAATACA 
AATGACATTTCGTTACTGTAGGTTTTGTTCTGTGTAAATAGGTTTAATATTCACGGATAATTCGTTTGTAGTATATTTTAGTATATAATTATTCG 
CGTATGATGGATTTTTTGGCCGGCAATAAAAAAGTGTAAATTGTAGCAAGGCAACCGAATCGTTTCAAAATTTTGTCCAAAAAGTGGAAATTTAC 
GCGAAATGGAGCGCGAAATAGCGCACAGTTTAGCTGGCGGCGAGGAACGAAGTAGCGATGTGGCACCTGGACAGGTTAAGACCTTCGAGGAACTG 
CGTCTATACCGAAATCTGTTGAACGGATTGAAGCGCAACAATTTCGTGACGCCAACGAAGATTCAGGCAGCTGCCATACCCATGGCACTGGCCAA 
AATGGGTAAATATCTTGCATGCAACAGAAGATTATCAATAATAGTTGCTATATTAATCTGCAGATCTAATAATACAATCGAAGAGTGGAACTGGC 
AAAACTCTAATCTATGTTATAGCCGTCGTTCAGAGCTTTAATCCCAACATAAACCAACCGCATGCCATGATCGTGGTCCCAACTCGAGAGCTGGC 
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CATTCAGGTGCAGGACACATTTTTCCATCTCTGCAAATCGTTTCGGGACTTTAAATGCTCTGCTTTCATCGGTGGCACCGATGTGGCCAAGGATC 
GCAAGCGAATGAATGAAAGTCGAGTCATTATCGGTACACCGGGACGTTTGCTGCATCTGTATGAGAATAGAGTATTCGATGTGTCCAAACTGCGA 
CTGCTGGTGCTCGACGAGGCCGATCAACTTTATCAGACAAAGAGCCTGCAGCATACGGTCAGTAAACTCATTGAGGCAATGCCCAAGAATCGGCA 
AATAATTGCCTGCAGTGCCACATACGATCAGAATTTGGATGAGCGTCTGGCCAAGGTCATGGACAAACCCATGCTTATATCGAATTCCGAAAGGG 
CTACCGTTCTACTGGGCATTCGGCAGTTTGTCTACGAATTGCCACAGCAGAATAACAGTGTGGAGGAGATGCGTCTTAAACTGCAGATTCTTGGC 
CAGATCTTTAATCAGCTGCCTTACGAACAGGCCATCATCTTTGCTAGCTCACAAATGCGAGCGGATTCTTACAAGAACTATCTTACAGCCAGTGG 
TATCGATTGTCATTTAATTTCGGGAGCCATGGAACAATCGGAGCGTTTGCATGTTTTCGAGGGCTATAGAAACTTTACTATGCGCATCCTGGTGG 
CCACTGATTTGATGGCACGCGGAGTGGACTCCCCGCATGCCAACCTGGTTATCAACATTGATCCACCACAGGATCATGTCACCTATTTGCACCGC 
ATTGGACGAGCTGGTCGGTTTGGCTCCAAAGGCATAGCCATCACGTTTATTGCCTCCAAGAAGGAGAGCCAGAGATTTAGGGAAATGTCAAAAAA 
AATCGCTACGGCTTGGTCGGTTCTGGAGTTTCCCAAGGAGCCAATGCCCAATGAATTCAATTTTTGGGATTTCGAAAAGTACAACTTTGATTATT 
ACATTAAGGAAGAGAATCCGCTACAGGAAATGCCTATGCCTATTAAAGAAAATAGAAGTAAAGAAAATGTTGATGCTTCTTCAGTTGACTTGGAA 
AATCTTCAAAAGGATCAGGATGGCAAAAGGAGAGATCCAGACAAACTACCAGTGGCCTTAGAAAATGTTGAAACCCAAAAAGAGTTGGAGCTCGA 
AAACTTACCAGAAAGTTCACATAATAATAAGAATCTTAGGGTCAAAGAGAAAGAAATTGTAAGGCAAGGAAAGCTTAAAGAAACTAATTCCAAAG 

C T GGAG GT T CT AAAAC GAAC AAG AC GGAG AG AAGG AAAAAAT CT AAC AC T C CAT C AAAGT T AC AG AAAC AAAT AAC T G AAGT GC AGC AAAC AC C A 
GAGATCACCGTAAATAAATACTATATGCAAGAAGACATACAACGAACAGCAGACGGCAAAGAGTATCATCCTGTTAATGGAGCGGGAAACCACCA 
GGTTCCGTCAAAATCTAATCAAAAACCGGTGGAGTTTATATCTCCCGAATTGACACCAACCCTAACACCATTGGAACTGACCCAGGAGCCGTCTC 
CAACCACAGTGACACCACCTGCACCGCCAGCCAACTCGATTAACAACAAAACCTATTGCCTAGCTGCTCCAACGCAAACCAGTTCGATGACCATA 
CAGAACATGGTAATATCGAACACGGTGGACGATGCCAGTAGCATCAGTTCGGATAGCATGGTGGTATGTGGCTATCATAGCGATAGGTCCTACGA 
CACATACTATGCAACCAGTGATGAGGAGGAGATCTGGAATAGATTAATGTCGAAACAAAGGCGTCAGCTGAAGAGCAAGAGAGGCCATGGAAAAC 
GGCGTGTGATACTATACAAGAAAGTATTTCCTAAACTAAAGGCGAATGTTAAACGCAAGCTTAAAAAACGGATGAGCTCGTCATGCAGATCCAAA 

AAAGC AC AC C AT TT GCGAAAAC GT C ATGT AT AC AAAAAT AT AT C AC T T T T GC C ACAC AT G AAT T T GT GTC AT C T GC T T C AAAAAT T T AC AAAT C A 

AGAGAGATTTTTGAAACGGCTCCACAAATATGCAAAGAATCAATTCATTGACAATGCGAGGGTTGGCTTTTTGCTGGATAAGCTCTATAAATCCA 

TCCTGGAAATGTACTATAACAGTGCTAAGGAGAGGAAGAAGCATTTTAAGGAAGCTATGAAGGAGTCCCTAAAATCCATTGATAACTACGAGAGC 

CAAGAGTCTTCGGAATCAGAGGAGGAGGAGGAGGAGGAGAAGGATCTAAAGAACAAGTTAAGTGTTCATGTGACGAATCAGATGGACATACCACC 

AATGCCACAGGCCGCTGTTGAAATTGGTTCAGGAAGTGAATCGGATACGGAGGACTCGCAGGAATACGGCGAAGGCGAAGAGATCATTGTGGACG 

ATGAGGACGATGGCAGCGATGGCGGGCCAAATTCATCGAGTGGTTTTGTAGAGAGTCAGGAGAGTGTTTCGTCTGGAATAGACACTTCCGTTTAC 

GAAACGGAATCTACGGGCGCATCTGGAAATAATTCAAGTGCATTCTTTATTTCGGATGATGAATCAGATACTGAATCCTTAACAGATTCGGAAGC 

TGGAAGCAATGTCAGCGCGACAGGATCAAGCTCACAGCGATCTTCAGTAGCGAGCAGCACCTATGAATCTTCGGATGATGATTCTGTAACTGTAA 

CCAATGTCAGTTCAATGCAGAACGCTCAATCATTGTGGTTGCAGACTTTCAACATGCAGTACCAATTCATTGCATCCCATGTGGCCAAGAATCTA 

CAGAACTATTACTAAGTGGTTATGAATGAAAAACTAGTGAAAACTCAGGCATTAAAATTAAACTCAACTAATCACTTCATTCGGAAACAAAACGG 

ATCAGTTCCAAGAGAGGCTGTATCAGTCTCTTTCCTTACAATTGTATATCGTAGCCAGTTAAACCGTTTTCTTAATTAGAATACTCAATGTCGAA 

ATAGTTGATAGCATGTTGGATACCATGAATGTTTTTTTATGCAAGCAATAAATTGCAGAATTTGTTACCAAAAAAATAAATGAATTAAAGCAGGA 

AACTAACTATTTTGAATTTATTAAACAATAAGTTCATTGCTCAACTTCTTTGATTTCAAAATTGATTGTTTTTCATGTTTATTCGCTACATAATC 

GCATCAAAC AAAT T AATCCACAATGTGAAAGTT AAAT CCCTTT AAT C ACTCGC AAT GATTTATCTAAGAGATGGGAAAACAAACATTTTTCTTGG 

TTTCGGCAAGCCGGATGGATGAAGGTAAAGAAGCGGCGCCAACCGACTGCCGGGACATAAATCATATAGTCAGTGGGATCGGGATCGGCATCGAC 

CATCGACCATCGGGCAACGGGCATCAGCTGTGTGGCAGACAACATTGTTGCACGGGCGCCGCTCACTTTGCAACTGACTGCAGAGGTGTTAACCC 

ATTAATTAGCTGTTGTCAAAAGTCGTTCACTCGATTTTGAAAGCCACTCGAAAATCAGAATGGATTGTACAAAAATATGACATATATTTATGTTT 

TATTTATTTATTTAATGCATTTTGTGAATTATTTGTGCAACGAATGCCATGTTTGTCAGCAAATAAATGAGTTCTACTATTTTCTTTTTATACTC 

CACTGCTGTATTTAATTTTTAAAATACTTCGATTCATAAATAATTTATTTTTAAATTGATTAAATGCAGTTAAAATTTTTAAGTTTCCTTTTACG 

AGTTTTGTGTCCCCATTTGCGAGTGCTTCGCAATGGGTTAAGCGAATGAGACGGTCCACAGGCGGCATGCCGCGTCGGTGGCATAGTCAAAACAT 

TCGCCCTGCGGTCGAGCACGCAAAAAGCCGGCATCTGGATGGTACTGCTCATTGGTCGTCGATCTGCAGTGCCCCAAGTGCCCCTCTTTTGGAGT 

GCAACACCCCTCCCCCCCCTGCGTTGCAAGTCCAATAAGGAGCGTGAGAAAACGCAAATTCAAAGCGAGCTGGAAGCGACAACATTTTATAGATC 

CTAGAGACATGGATAGCGTTGCATTTTGTCGCTGCCGCAGTGGCAGCAAACT 

{SEQ ID NO: 721) 

Exon: 1001.. 1240 
Exon: 1299.. 4467 
Start ATG: 1051 

Transcript No. : CT20361 

GCAACCGAATCGTTTCAAAATTTTGTCCAAAAAGTGGAAATTTACGCGAAATGGAGCGCGAAATAGCGCACAGTTTAGCTGGCGGCGAGGAACGA 
AGTAGCGATGTGGCACCTGGACAGGTTAAGACCTTCGAGGAACTGCGTCTATACCGAAATCTGTTGAACGGATTGAAGCGCAACAATTTCGTGAC 
GCCAACGAAGATTCAGGCAGCTGCCATACCCATGGCACTGGCCAAAATGGATCTAATAATACAATCGAAGAGTGGAACTGGCAAAACTCTAATCT 
ATGTTATAGCCGTCGTTCAGAGCTTTAATCCCAACATAAACCAACCGCATGCCATGATCGTGGTCCCAACTCGAGAGCTGGCCATTCAGGTGCAG 
GACACATTTTTCCATCTCTGCAAATCGTTTCGGGACTTTAAATGCTCTGCTTTCATCGGTGGCACCGATGTGGCCAAGGATCGCAAGCGAATGAA 
TGAAAGTCGAGTCATTATCGGTACACCGGGACGTTTGCTGCATCTGTATGAGAATAGAGTATTCGATGTGTCCAAACTGCGACTGCTGGTGCTCG 
ACGAGGCCGATCAACTTTATCAGACAAAGAGCCTGCAGCATACGGTCAGTAAACTCATTGAGGCAATGCCCAAGAATCGGCAAATAATTGCCTGC 
AGTGCCACATACGATCAGAATTTGGATGAGCGTCTGGCCAAGGTCATGGACAAACCCATGCTTATATCGAATTCCGAAAGGGCTACCGTTCTACT 
GGGCATTCGGCAGTTTGTCTACGAATTGCCACAGCAGAATAACAGTGTGGAGGAGATGCGTCTTAAACTGCAGATTCTTGGCCAGATCTTTAATC 
AGCTGCCTTACGAACAGGCCATCATCTTTGCTAGCTCACAAATGCGAGCGGATTCTTACAAGAACTATCTTACAGCCAGTGGTATCGATTGTCAT 
TTAATTTCGGGAGCCATGGAACAATCGGAGCGTTTGCATGTTTTCGAGGGCTATAGAAACTTTACTATGCGCATCCTGGTGGCCACTGATTTGAT 
GGCACGCGGAGTGGACTCCCCGCATGCCAACCTGGTTATCAACATTGATCCACCACAGGATCATGTCACCTATTTGCACCGCATTGGACGAGCTG 
GTCGGTTTGGCTCCAAAGGCATAGCCATCACGTTTATTGCCTCCAAGAAGGAGAGCCAGAGATTTAGGGAAATGTCAAAAAAAATCGCTACGGCT 
TGGTCGGTTCTGGAGTTTCCCAAGGAGCCAATGCCCAATGAATTCAATTTTTGGGATTTCGAAAAGTACAACTTTGATTATTACATTAAGGAAGA 
GAATCCGCTACAGGAAATGCCTATGCCTATTAAAGAAAATAGAAGTAAAGAAAATGTTGATGCTTCTTCAGTTGACTTGGAAAATCTTCAAAAGG 
ATCAGGATGGCAAAAGGAGAGATCCAGACAAACTACCAGTGGCCTTAGAAAATGTTGAAACCCAAAAAGAGTTGGAGCTCGAAAACTTACCAGAA 
AGTTCACATAATAATAAGAATCTTAGGGTCAAAGAGAAAGAAATTGTAAGGCAAGGAAAGCTTAAAGAAACTAATTCCAAAGCTGGAGGTTCTAA 
AAC GAAC AAGACGGAGAGAAGGAAAAAATCT AAC ACTCCATCAAAGTT AC AGAAAC AAAT AACTGAAGTGCAGCAAACACCAGAGATCACCGTAA 
ATAAATACTATATGCAAGAAGACATACAACGAACAGCAGACGGCAAAGAGTATCATCCTGTTAATGGAGCGGGAAACCACCAGGTTCCGTCAAAA 
TCTAATCAAAAACCGGTGGAGTTTATATCTCCCGAATTGACACCAACCCTAACACCATTGGAACTGACCCAGGAGCCGTCTCCAACCACAGTGAC 
ACCACCTGCACCGCCAGCCAACTCGATTAACAACAAAACCTATTGCCTAGCTGCTCCAACGCAAACCAGTTCGATGACCATACAGAACATGGTAA 
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TATCGAACACGGTGGACGATGCCAGTAGCATCAGTTCGGATAGCATGGTGGTATGTGGCTATCATAGCGATAGGTCCTACGACACATACTATGCA 
ACCAGTGATGAGGAGGAGATCTGGAATAGATTAATGTCGAAACAAAGGCGTCAGCTGAAGAGCAAGAGAGGCCATGGAAAACGGCGTGTGATACT 
ATACAAGAAAGTATTTCCTAAACTAAAGGCGAATGTTAAACGCAAGCTTAAAAAACGGATGAGCTCGTCATGCAGATCCAAAAAAGCACACCATT 
T G C G AAA AC GT C AT G T AT AC AAAAAT AT AT C AC T T T T GC C AC AC AT G AAT TTGTGTCATCTGCTT C AAA AAT T T AC AAAT C AAG AG AG AT T T T T G 
AAACGGCTCCACAAATATGCAAAGAATCAATTCATTGACAATGCGAGGGTTGGCTTTTTGCTGGATAAGCTCTATAAATCCATCCTGGAAATGTA 
CTATAACAGTGCTAAGGAGAGGAAGAAGCATTTTAAGGAAGCTATGAAGGAGTCCCTAAAATCCATTGATAACTACGAGAGCCAAGAGTCTTCGG 
AATCAGAGGAGGAGGAGGAGGAGGAGAAGGATCTAAAGAACAAGTTAAGTGTTCATGTGACGAATCAGATGGACATACCACCAATGCCACAGGCC 
GCTGTTGAAATTGGTTCAGGAAGTGAATCGGATACGGAGGACTCGCAGGAATACGGCGAAGGCGAAGAGATCATTGTGGACGATGAGGACGATGG 
CAGCGATGGCGGGCCAAATTCATCGAGTGGTTTTGTAGAGAGTCAGGAGAGTGTTTCGTCTGGAATAGACACTTCCGTTTACGAAACGGAATCTA 
CGGGCGCATCTGGAAATAATTCAAGTGCATTCTTTATTTCGGATGATGAATCAGATACTGAATCCTTAACAGATTCGGAAGCTGGAAGCAATGTC 
AGCGCGACAGGATCAAGCTCACAGCGATCTTCAGTAGCGAGCAGCACCTATGAATCTTCGGATGATGATTCTGTAACTGTAACCAATGTCAGTTC 
AATGCAGAACGCTCAATCATTGTGGTTGCAGACTTTCAACATGCAGTACCAATTCATTGCATCCCATGTGGCCAAGAATCTACAGAACTATTACT 
AAGTGGTTATGAATGAAAAACTAGTGAAAACTCAGGCATTAAAATTAAACTCAACTAATCACTTCATTCGGAAACAAAACGGATCAGTTCCAAGA 
GAGGCTGTATCAGTCTCTTTCCTTACAATTGTATATCGTAGCCAGTTAAACCGTTTTCTTAATTAGAATACTCAATGTCGAAATAGTTGATAGCA 
TGTTGGATACCATGAATGTTTTTTTATGCAAGCAATAAATTGCAGAATTTGTTACCAAAAAAATAAATGAATTAAAGCAGGAAA 

(SEQ ID NO: 722) 
Start ATG: 51 

MEREIAHSLAGGEERSSDVAPGQVKTFEELRLYRNLLNGLKRNNFVTPTKIQAAAIPMALAKMDLIIQSKSGTGKTLIYVIAVVQSFNPNINQPH 

AMIVVPTRELAIQVQDTFFHLCKSFRDFKCSAFIGGTDVAKDRKRMNESRVIIGTPGRLLHLYENRVFDVSKLRLLVLDEADQLYQTKSLQHTVS 

KLIEAMPKNRQIIACSATYDQNLDERLAKVMDKPMLISNSERATVLLGIRQFVYELPQQNNSVEEMRLKLQILGQIFNQLPYEQAIIFASSQMRA 

DSYKNYLTASGIDCHLISGAMEQSERLHVFEGYRNFTMRILVATDLMARGVDSPHANLVINIDPPQDHVTYLHRIGRAGRFGSKGIAITFIASKK 

ESQRFREMSKKIATAWSVLEFPKEPMPNEFNFWDFEKYNFDYYIKEENPLQEMPMPIKENRSKENVDASSVDLENLQKDQDGKRRDPDKLPVALE 

NVETQKELELENLPESSHNNKNLRVKEKEIVRQGKLKETNSKAGGSKTNKTERRKKSNTPSKLQKQITEVQQTPEITVNKYYMQEDIQRTADGKE 

YHPVNGAGNHQVPSKSNQKPVEFISPELTPTLTPLELTQEPSPTTVTPPAPPANSINNKTYCLAAPTQTSSMTIQNMVISNTVDDASSISSDSMV 

VCGYHSDRSYDTYYATSDEEEIWNRLMSKQRRQLKSKRGHGKRRVILYKKVFPKLKANVKRKLKKRMSSSCRSKKAHHLRKRHVYKNISLLPHMN 

LCHLLQKFTNQERFLKRLHKYAKNQFIDNARVGFLLDKLYKSILEMYYNSAKERKKHFKEAMKESLKSIDNYESQESSESEEEEEEEKDLKNKLS 

VHVTWQMDIPPMPQAAVEIGSGSESDTEDSQEYGEGEEIIVDDEDDGSDGGPNSSSGFVESQESVSSGIDTSVYETESTGASGNNSSAFFISDDE 

SDTESLTDSEAGSNVSATGSSSQRSSVASSTYESSDDDSVTVTNVSSMQNAQSLWLQTFNMQYQFIASHVAKNLQNYY* 

{SEQ ID NO: 723) 

Name: DEAD -box helicase 
Classification: RNA_binding 
Gene Symbol: Dhhl 
FlyBase ID: FBgn0011802 



Celera Sequence No. : 142000013384666 

GCGATATGACTGTCATGATCTACTTCTATGGCCCGATGCTGTTGCTAATCGCTTTCAACATAATAATGTTTGTCCTTTCGGCGATTTACATATAT 
AACATAAAGAAAAATGTGAAAGGCCTTGTCCACAAGCAACAAACAAACCAACAGATAAATGACCAACAAATGTAATTATATTATATTATAAGATA 
ATAACTATAATGTCAAATGGCACTTTTTGTAATCTTTTATAGGTTTGCTATATTCCTGCGACTTTTCATCTTAATGGGTTTGTCGTGGAGCTTTG 
AGATATTATCCTTTTTGTTAACCAAACAGCAAGCTTGGGCTAGGGCTTTAATGGTGGCTGACTACTTTAATTGGTCCCAGGGTACCATCATATTC 
GTGCTTTTTATTTTAAAGCCTAGCATTCTAAAACTTATAATAGCAGGGTAAGTAAAGTATTACAAATAACCTTGAAATTTGACCGTCTCATACGT 
ATGAACAATCTTTTGATTTAGGGGACGTCAAAACCTCCCAGGAAGTCACCATAATTCGAGATCAAAAGCAGCTCGATATAATTCAACTCATACGG 
CTTGTGAAGGATCAATTGCGGATCCAAACGCTTACTGCTAAAAAAATGGAATTCAATAAGATGCAACGTCTCTGTTATTGATCACATTATTTTAA 
TTTTCTATGGAGCAAAAGTTTAAAAAAAATATTTGTTATCAAGACAATAAATAATGTCTTGGAACATATAAAGCAAAATATATTAAAGGGTACAA 
AAAACAGGAAAAGTATGGGGCAATACAGCAAAATATTTTTCGGATCATCAACTTTTTTGTAAGACAAAATGTAGAAATATTTCTGGCACCGTTGT 
TTCAAACCAGGGTTTGAGTCAACAATGCTGGCCCACCACTGGCAAGTACCAAACGATTACAGAAAACTTTTTCTAATCGATTAAGCATCGGGGGT 
CATCGATGTTTCTCCACCACTACAATTTCATGTTGTTTTTTTAGTCGCGGCTGTTTGACGTTTGTCGCTTTCCTCCGTTGCGAATATAATATATT 
ACGTAGCTCATT TTTATACAAACGGAATTACGAGCGCAACGACGACAGCAACACTAGTAGCACT AAT CGTAAGCGC AGGGGCCAAAAATT AAAT T 
GCGTTTGCGGCCGCAAAGATTTGATGACGTCGCATACGCCGTCTTCTAGGGCGTAAAAAGCAAAGCAAAGCAAACAAACGCGAAAGCGAAACGTG 
TAAACGGCGTAGAAGCGATAAACGCGACTCAAATACGCAGCAGATAAAATACAATACGCGAGAAGAGAAAAGTCACGGGAAATATTGTTCATATT 
CGGCGTCTTTCTGCGAGCGTAAACGTGTGTGCGTGGGCTTGTGCTTTTGCCAGTGTCGTGTGTGTTAAGTGCCTGTGTGTGTGTGTGATTAAGAA 
GATATAAAGGAATATAACGGTAAATGCAGCGCCGCAAAAATGTGCAGCCTGACGCCAAACCACATGGTAAACGTAACGCAGCAGCACCTACACGA 
TCTATTGGAAACTTTCGAAAAAAAGTCCTTCGAGGCGGCGGCCTTTGAGGAAGGGACGGCGGTAAGTTAACGGTACCAGTGTGTGCTTGTGTCTG 
TGACTGTGTGTGTGACATACATTTAGGCGCACGTGCACGTGCTGCCGCCGCAAAAACACGCAGACACACACACACCCATGCATGATATGCCTGTT 
GTGCTAGTGTGCGTGGCGCTTACACCGAGGACACGTCATAATGTCCTGTCCTCCTATTGCTCTTATTCTCTTCCCTTTCTTGCGCTCCTCTCCCT 
CGCCGTCTTCCTTCTGCCGCGCCCCGAAAATACTGAGCCACTCAAAATGGCTGCCATTAAGCTTCCTGCGAGGAAAATCGCATTTTCATGGCAAT 
CAATAGGCGGTAAAACATGTCGTTTGAAGGACTTAATGGTTCGTTTTCTCAAAGAGCAGAATTAATCACGGGATAGCTTTTCCAATTTGGTAGTC 
AATCAAGGTTGTTACCTATAAGAGCACTAATCACTAATTAAAATATCTTTTTTAAATAAGATGAGCAATAAACGGTATTTTTGTATAATTTCTAT 
TAACATACATTTTATGGTCCTAATATTATTTAATTTAATTGATTTTTCTATTCCGTTTCTTAGCAAATTTTTAATTAATTGATAAGAATTAAATA 
TTAAGTTTCAGCTTTCAGAGTTCCTTATCGAGTTGCATTCATTTTGATTATTAAATCAATTTCAACTAGGAAATCCCGAGTAAGCAGGTTTTCCG 
AAACCGGAAACTTCCTAATGGCTTTTGTTCTCGAATTGCCATTGAAACTCAATATCCCACTAAGCTGTGCTAAGCGATAATTAAAAGCCTAACAA 
CGGCCTTTTTAGGCCCGCCCCTGACCCACGCATAAATGTGCATTTTCTTTCCACGACACGCACTCGAATTCAACTTTTGTCTTTTGCCTTCAGCT 
CATCCCCTTGCACCCGTTCATCCATTTTACCCCCCTCGAATGTCACGCACGCCTTTTAAATAGTCCAGTGTGTAATGCCATCTTATCGCCCGCTT 
AACCGGCAATAACACAGAATAACAATTGTAATGGCAACTTGGCAACAGCCGCTGCTCATTTGCATGCCAGTGCTGGTGAATATATTTAAGCCAAA 
TCCCCGAGCCAGGCTGCCTGGAAACAGAGCGAGGAAAGCCAAGTGGGAACTTGGCACGTAACTCACGTTTCCGCGACGATCTTACATCGGTTTGT 
TCAGCTGCAATCGGCACATAAATATGTACATACATACATACATATGTGCATACTTACTTGTGAATGTTTAACCGCTACCGCAAACTTTGTTGCAA 
ACGTATGTTTTTATCAAACAATGGCTAGGTCTTATCACCAACCAGGGCGCCAACCTTCGATTTGATTTTCCACAAGTATTTTCCTATTCCTTATC 
TATCTGCCAGCTATTCGAAGGCACAAACAAAACGCTGCTACTGCCCCTTGATACAAAGCATTGCATTTTTGATGGGGTGCAAAGGGTGAATTATG 
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TGTGCGCTTCTCTCTCACTTGTTTATTTCCCAGTCTCCTCGGGGTTCAGGGTTAAGTTTAGCCACACCGATTAGATTAGGCGTTCGCCCAGCCAA 
AGTCACAATACCTAAGCAAATGACGTAGGCAGGCGATGAGGTTTATAGCTTACAGAGAACGTATGTAAGAACCATTAAATGGGTTAAATATACGA 
GCTAATAAGGTCAAAATGGCATTTAAATTGCAGCTAAAGATAATATTGCTACCTGAAATATATTTGGAAATCTTTAAATTGAAATTGAAGAGCTT 

tccaaattcataagttttactcctcgtcttatcgcagcaaccaaaaacaagcaatcctcttaaaaggactcagaaaaaaatcgagtgaa™ 

GATTGAGTTTGCTCCAATTTACATCAAGACATGGAAATATTTAATATTCAACATCCATCAAAAAAAATTACACTTATTTATGTTGGTTCACCACA 
AAAAAAAATACATTTATATATGTAGATTTGCTAAAAAAAAAAAATCAACGTACATTTAAAAAGTTTCGCAGTAGACGCCGACCGACAATCGTTGG 
ACAATTATTGTATGCTTATGGACTTAGAAACCCTGGAAACTCCGGCGGAACTCTTCCTCGAAATCCGCGTGGCCAAGATCAAGCTTGTGGTAGTC 
GGTGAATCCCTCGCGATCCAGCTTGGCCTGGGCCCAAAGGATTAGCTTGATGAGGAACATCATTCTTGGCTCCATGGGCTCCTCTTTGCTCTTAC 
TCTCATCCTCGTGGCAGCGTAACATGGCTGCATTCATTTCACCGGCCACCTTTTGGCGATATGAGTAGTACATCAGCTCCCCGTAGGGACTGTAT 
TCCGGCCGATCGAAGGCCAGCAGACCCATGGTCCGCTCCACTTCGTGGTAGTGACTTGGATCGACCTTGGAAAAGCCCGCTGCCTTGCTCTGGGC 
AAACTTCAAGGCCTTCTCCATCTTCTGATCCCTGATCATCTCGATCAGGCGCAACTGTTGCATGTGAAAAAATACATAGTTGTCGGTCTCGAAGA 
GCCGTGGATAGATCCGTGTGGCCAGGTCCATGGCGTACTTGACCTGACCCACACGCACGGCATCCTGAATCCTTAAACGATCCCCGATGGTGTCC 
ATGTGTGGACCCGGCTTCACAGACGCCTCGGTCATGAAACGCTTGGCGGCCTCTTGATATCCCTCTAAATTTGAAGTTATGTACTTCAGTTAGGG 
GGGAAGCATACTAGTGAGGCTCACCTGTAACCAGATAGTTCATCACCAGGCGATTCAGGTCCGCCTGCCGGCACTGGAAGCTCATCATCCGATGG 

GGC C AC G AAGT GG AT GGT C TATACTCGGCCATTC T C AAT T T T T AT AGT T T GAT T T AG GT AC T G AAAAT T G AAAC AT AGG T AAAC AT GC AAAT T G A 
CATTAGTCCTGCTTTACTTCCTTAGTTCTTTAATTATTTCTCGTTTTTATCTATCTACTAATTGTAAACATGTTCTTTCCATTTATGTGAAGACA 
TTAGAAAGCTGTCGAGTTTTGTCTTCTTTGTTTTATTGATTCCCACACATTTTGTTGCCATATAGGAAAGTTCCGCTGATGATCATCTATTAACA 
ATGTGCTGATCATTATATCACTTTTGTTTTCAGCGAGTGAGTTACACATCATTTTTGCGCATTGGTTGTACGTACTATCGCCAACTGATTTATTT 

T TAT GG C C T GGC AT T G AAAACT GT T GC AGT T GC G AGT GT G T AT T GG GAT T T GC ATT T GT AT T AAGT AT CT GC GC G GGT GGGGT G G AC CAT C T GT C 

TGCTCGTGTGGGCGCCAAAAAGTCAATCCCTGGCCAAGCAGAGAAATGCCAGAAATAGCAGGGAAGGCTTTTTCTTCATAAATATCTCACTTTCG 

TAAACATTTTTTGTCCGCGCGTCTTGCCGTGTGTTTGCCGTTTTTTGTTGTTTTTATTTTTATATATTTTATTTGTTTATTTGCACAGTGTTAAA 

ACCCACCTGTTGTTGTTTTTGTTGCTGCTTTCGTTGTCAATCAAAGATTGTTGCAAAAATTTCACCTGCTGAAAACGAAGAAAAGGCTGAAATGA 

TTTTGCAATACAATGGAATATGGAAAACGTACTCTGTTGTATGTATTGTTTTGGATGGCTAGCGATGGCTAGCTGCAGTTTTTCTTCTATTTTCT 

TTTTGGCAACAATCGGCTTAATCCAAACCAATTGGACCAAAAATGGCCGCGTTGCGAGCGCTGCCTTTGGCGCCTGCTGGAACACTACTCTGGTC 

CCAGGAGCTGCAGGACCTGGACTGCAGCTACTCCTCCGCATTAATGTGTATCCCGCCTGGAGCCTTCCAGTCTGCAGCTCGCTTCCAGTTCGGAG 

ATTGCACTCGTTTTGGGGGTTGCAACCAGTTCCAGGAACAGTGGAGACACCGTATGACAAACGCCGGTGTACGGAGCATTGAGTGTTGGACCCTT 

CTGGTCGGTCCTGCTCGCCTGCTGTTGTTGGCAGCAGTCTGTGCCGTGGCCACGCCCCCGTCAAACGGCAGCTTCCCACCACCGCCCCCGGAAAC 

GGATGCAGCTGCGCTGCAAGCCGTGCGGCGTCATTTGTGTGGGCGCTTTGGAAAATGCGGCACAGGCGCGCAGGAATCGCCGGAAGAGCGGAAGT 

GCGACGCCTTCCAGCGCAACAATTGCAACCAAAGTTGGGTACTGTGGGATAAGTTCGAACAAAATGGGTCAAATATGAGGAATCTATGAGCAATA 

GAGTTTATGTTGCGCCTTATCAGTCGAAGATTTCAGGATCAATAAGCTTTTAAAAGCTATTTATTGTTTCCTAAAATTATTATTATATCTAAACG 

TTGAAAAATATATAAAAAGTACATTGAAATTGACAAGGCTAACGCGTTTTTGGTGATTGATTTTTAATGTATACATTATGGTAGCTCGTCAAGCA 

TGCCAAAAATAATTATTTGATGACCTCAGTGTGTAGAGTTGGGAAATGAGAACTGCCATGAATTGTGGGTCAATGATTGTTTATTTATATCAATT 

ATAAAGAAATATAGAAATATAAAATTGTTCAAACTTTTTGTTCCATTTGTTCCACTTTTCTAAAGAGCGCCTGTTAAGATACTCCCACTGTGCGA 

CTTGGCACGACTTGGCACCCTGCACCCTTTTATGACAGCCTGTTGCACGTTGAACTTTGCTCCCACGCGTTTGGATGCTCCTCCCCGCGCCGCCC 

CCTTTGTAGGCAGTGTGTGTGTGTGTGTGTGCAAAGTGTGTTCGATGATGGCCAGCTGTTTGCTGCAAGGTGCGTGGGTCGCACCACAGACACTT 

GTGCCAGGAACGCAGGAGTGGAAGTAATCGCCCCCTTGAAGGGAATGGCCAGACGGGAGGAGTGTCTAGCGGAGTGGAGGGGAAGATGCATCTCC 

GCATACCCGCATCTGTAGCAGTGCGAGCAGCTGCGTTTCCAACATTTATCGGGGATGCAGCAGCAGCTTTTCGGGCGCATGTCCACAATGAAGAG 

AGCCGGCGAGTTCGATTTGATTAAGGGCCACTCCGATAAAGGGGTTCCAATGCTTCCAGTTACATGCTCATATATATGAACATATGTGTGAATGG 

ATATGGCTCCGCCAGTCGAAGTGCAGTAAGTCGAATCACCCTGGTGCGGAACTCCTTGCGCATTCCGGACTAACTGCCTAGCATCCGGACCCACC 

ATGCAAGCATGATTAGAAGAAGGGCACAACTCGGCATCGCCGTGTTCGCTACTACTCCTTCGATTTTCCAGCTACGCGTGGTTTCGGATGCTGCC 

GATTTTTAGCCACCAACTTGACATTCATGCTCGCCTGTTCGCTAACAGATAAAATCGATAATGTGCGAAATGGCAATGCCGCTTTACTCAAATAT 

TTGGTGAGAGTTCGGATTCAGATTCGGATTCGTTACGTTGGTTGGGGGATCGGACTTCGTTTGGTTATGACCTAAGCATAAAATATAGGGGATGT 

TGGCTGAAGGCAAAACGAATTATTTAGTTGTGTGCTTGATTTACAAAGCGCCAACAGGAGCGCAGAAATTTTATTCTACTTATTGATTTTAATTT 

GTGGTAGTGCTGGATTAAAGCACTGGGAAAAATGTAACCCACATTAAACCCAAGTGTTGCCTTATTTGTATGAGTTAATATATGATTAGTGTAAT 

ATTTTCGGATGTAATAGAACTTCATATCGTTTCTGCTAGCCCATGCTGGCATTGCCAAACTGAATCGATTATTCAGATGACCTTCTTTTCGCATC 

ACAGTGAATTGTCTGATGGGAAGCCAGAAAGCGTGAAAAGCGATCGGCGCCTGGCGGAACTCAGTTCTGTTTGTGGCAACTAGTGTGGGCGGGGA 

AAATCTCTTCCTGGCTGGCTGGCGAATAAATTAAGCAAATGATAAATAGATTTCTGAAACAACCCTGCCTCGTACTCATTATTCGTAACGCCCAC 

TTACTGCGCTCAATTATTGTCATTGCGAAGTGTTTGAAGTGCTTTCAGGCGTTTGTTTGGCAAGCAAAAATCATTTAGCTCTTCAGGGCGGCAAA 

AAACTCCTCCATTGGATACATCTTCTTCGCATATTATCCACAAAGCGGTTTCAATGCCTTTCGAAAGTTGCCCCACACACTGTCAAGTGTTTTTC 

TATACCCGTGGTGAT?LATTATGTACATCTTAAAGAAATAAATGCAAGTTAACTGCACCGGCAATTTGCATACTAAAAGTTCATTAATATCCACAA 

TCGATGTGGATGATGTTGCATAAATTAAAAGAAATGCTAGTCGTTAACGTAAAGATTAGTTGAAATTATAAAAGTTGTGGTTAAAAGTTTTAGCA 

T ATT ATTTTCAATTCAGTCATTTACATACATATGCGTCGTAATAAACCTT AAAAT ATTTATTTTACAATGCACAATGTAAACAATGCCTTGTTTT 

TGGTTGCTTTATGTTGGAAAAGAAAGGCATTGTTTATACTTATATAGATAAATATGTTTATTTATTTAACTTATAATAAACCCAACCGAACTGTA 

CTGCTTATGGCAGTTCCTTGATTTGGAACCTGATCCCAACGCCAACTCAGAGTTTGAAACGTGGGCGCAACTTGTTTTCATATCGGGGGGGTCTT 

TGAACGCCGACGAAAACAACACGCTGCACATTTTAAAGCGCAAGCATAAATCATGTAAACAAGAACAACGAGAATCAAAGGCGGAACTCCCCTCT 

CTGTCGCAGTCATTTTGGGTGTGGTCAGTGGCCCCAGTTTAATGCCACTGAATAATTCATTCAAGAAAAGCCATTTCCCTGTCGCCGGCGGATTG 

ATGAATATGCATCGTGTGGCCAGCACCGAGGCCTCCTTCAGTTTTTCAGGTTATTTGGCAAAAAAAAAACCACACAAGACAAGAGTCAGAGCAGG 

AGCTCAAGAGTCAGCTTATACAGTAAACATAACTTAAGGCGGAATATGACATGAAAATACGTATTAAAAATAAGTGGTAGCTTGGTTGAAGTAGT 

TGTTGGTCTGAATAGTTTTATTCATTTATACAATGTTGCAACTCAATGGTGTTTACTTGACTACTTTCCATTACGTTCCCCTGAGGAAAGTGCAG 

CTCCGTTAATGTCACAGTTAAAGAAGTCGTACTGTACTTAATGGGCGACCTCTCCACCCTTTTCTTTTGCTGATTCCCTGTGCTTGGTCGAGTTG 

GTTCGGTTCGGTTGGTGGTTTTCTCGGTTCGATTGCACACGAGTGCGACTACGATTATTTTCAGCTGTGCGGTTTCTTACCGCTGTTGTTCAGTT 

TTTTCTTTCTTTCTTCTCTCTTTTTTTTAATTTTTGTAAGCTTTTCTGCTTTATTTTCTGTTCGGTTCCATTTTTCAAGAGAGAGCGAAGAAAAG 

TGTGTCGCACTCTGCAGCTTCTGCTGCATTTCTGGTGGCCCAGCATCGGCAGAGTCTTTAAAATCCGGGTTTGTGAGGCGAGCTTGGGTTCCGAG 

CCAGCCTTTCGGTTTGAGTTTCACTATTTGGGTCAGCGATGATGAAAGCGCCAGCGCCGTTCGTGATGTTGATGATGACGATGGGGCTTAATGAT 

CACCACGCTCGCAAACTATTTTCCGCGCTGTAACTAAACGAGCGCGCGTCTGTTTGTTGTTTTTAAGTTTTACAGCGCCTTTGGAACAGGCGCTC 

AGT TT T TC A AG C AGC GC TGC T GC AC T T AC AGT T G C AC TTGCTCGC T T AC AAT G AAAAAAC G AC AG C GG C AGC AAAAG C AA C G AAAC AAC GGC AAC 
AGAAGCAGTGGGCAAGCAGTTCAATGCACACAACAACAGTAGCCACCCGTACATGTGTGCATGTGTGACTTGACTGCTTTGTAACGGCAGCTTCT 
CCGACTTGGAGCGCACTCACACTCAAACACTCACTCGCGCATCGGCCATCGATGGGGCTTATATTTTCTTTCCGACAGTCCCATCTCTCTCACAC 
TCATCACCTCGCCTGCTCCCGCTCGCATCATCTGTCAAACAGTGCTGCCGAAAGTAGCTGCATGGAACCTCAAAAGTAGCTAAACTCTTTTAAGA 
CTAGCTTTGGCATTTCGAGCTTTGATATTTGCTATTTTTATTTATTATTATTATTATTTTGATCCGTAGATCCATTTTTCACCTTCGTATTAATA 
GTTCAACCGCTAAAACAGTGAACGCATTT AT CATT AAAT T AAT TTCC AAC ACTGTAAGCC AAAT ACTCGATTAAATTGT AAT C AAT AAGT AT TT^ 
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ATAAATAGTAAAGCAATAGTAGTTTTGTACTGATGTGTTTTATCTTAAACCTATCGTTATGCAATTCAAAAATGCTAAAAATACAATAAAAGCAC 

CATAAAAAGCGATTGGCTTTTGCGTGTTTCCGTCGGCGTTTTTCTTGGTAATTGTAATTGCGTTTTAGTTGTTTTGCTCGTTCGCTTCGCACTGC 

GCACTTGTGTGCAGTAGTTTTGAGTCCCCACACACAGACACACATGCGCCCCACTAAGTTTTCATCGGGTTTGTTTTGCTCGCTGCTTCGCTTTT 

GTTGTTGCCAATTTTTGCGCAGCTACGGCGGTCTTCGTCTTTTTATAGTTTTTGTTTTTCTCTCCTCCATTTGTTTTGCACATTTCTCTCTGTGA 

GCTAGAAACCTGTTTTCCCAAGGTGTTTGTTTTTGCACGAGTCGTTTTATTTTGAGGCTATATTATTATTGTGCTTTTATCTAGTCGCAGTTGTT 

TGTCTGGGACAGCTGGCTGGGGGCCACGCCCCCATTCGCCGCCACCGCCCCCTTGCTCGCCTTTCCTTCTATTTTTAGCCAGTCGCCTAAACAAC 

CTTTAGAATTCTTTTTGATTAGCACACAAAATACGTATTATTTTGTTTCTTATGAATTGGTTATACTTGTTGTGAATTTATTACTGTATCTCGAT 

GGTTAATTGGCGTTGTAACTTGGCCATTACCCCCCTTAGCTTATCTGTTTTAACCCTTGAGCGGCAGATAGCAACCAGTCGAAGCCGAACGATCC 

AGATTCCAGTTCTCCTACCCTGGTAGCGTCCAAGTTCCATTGAACTCGGGTTAGGAATGTCTATATAGTATATGTATACACACATTGCGAATGCA 

TGAAAACCAGTTGCACTCCGCGTTCTAATCTCGCTGTCCATTAAGAAGCGTCTCCGGCATCGCTGATTTATGCAAACCATTGACTTTACACACTG 

CCCGTCAAATACTTAATGTTTACCCGAAGTGAATGGGTACTAAGGTCATTCGTAAATGGATCCCATTTGTGGCAAGAAAAGTTGCACAACTTTAT 

TCAAACGATTATTTTACCACATAATCCTTACCTATTACATCTACTTTTAGCATATATGTATTATATTAACTCCCTATTCATGGTATCCTGATCTT 

TTTCCTTTCCTTTTAGGAGTACGACATCTCCAAAAAATGCGAATACCTGTTTAAGCTCGACTACAGCCTAATTGAGCTGGATAATACGAACGGAT 

TGCTCAGTCCGCGATATCCTGGCCGAATACTCATCCCGGAATATGAGCACGGGCACATGGCCAAGACGCTGGTACCGGGAAATGGACTCTTCGGG 

CAAGTTGGTGGGGGAGTGGGAGGTGGAGGCTCCTCGGGAACAACCGCCACTGCCACGCCTCTGAACAGCAGTGCAGGAAGCACCGGAAGTGAGGG 

TGTGGGCATCCAAGCCTTTGTGACCTTTGCCAATCCCCTGCAGACGCAACAACAGCATCCGCTCCAGCAACAATATCCCTCGCAGCAGATGCATC 

CCCTCCACGCGCAATATCCCTCCCAGCAGCCACATCCACTCCAGCAGCAGCAGCAGCAGCCATCGCAACAGCAACCAC7VAAATACGATATACGAG 

GATCAGTATGATATCCAGCGAATGCGGGAATTGGTAACGATGGCCAAATATGCGAGATGCCGTCAAAGATTCGCCGTGCCTGTGATTATGTATCG 

CGGAAAGTACATATGCCGCTCTGCCACGCTATCCGTCATGCCAGAAACCTACGGCCGAAAAGTGGTGGACTATGCCTACGACTGCCTGAGTGGCG 

GCAATTACACCGCGCCAAACGGAGAAGAGAACGATGCTGACTCCACGGACGAGTCGCTGATCACCCACATGCACGACCAGGCGCAGTCGCAGTTC 

AGCTACGACGAAGTCATCAAGAGTGACATCCAGCTGCTGCATACGCTCAATGTCTCAACCATTGTGGACCTCATGGTCGAAAACCGCAAAATCAA 

ATACTTCATGGCGTGAGTACACTTTAAGATACCGAGTACAAAACAAAAACTGACCACCTGTTTTTTTTGTTCTTTTAGCGTTTCCTCGTCAGAGA 

AAGCGGATCCCAACAAGCACTATAAGAGCTTTAACCTTCTATCCCTGCCGTATCCGGGCTGTGAGTTCTTCAAAAAGTTCCGGGACAATAATTAC 

ATGGCTCGCAACCTGCACTACAACTGGAAGCAAACGTTCAACGATGCGAATATCAACATTCCCAACATGGGACCCGCTGCGGATATCGATGTGGC 

GTGGTCGGAGTACCGGGATTGGGATCTGGTGGCAATCACCCAAAACTATTTGAGAGCTACACTGAAATACGTGCAAGAGGAAAACTCCGGCCTGC 

TGATTCACTGCATCAGCGGTTGGGATCGCACGCCACTGTTTGTCTCCTTGGTCAGGCTGTCTCTGTGGGCAGATGGACTCATCCATCAGTCGCTG 

AACGCCATGCAAATGGCCTATTTCACACTGGCCTACGACTGGTACCTGTTTGGCCATCAACTTCCCGATCGCCTGAAACGAGGCGAAGACATCAT 

GTTCTTCTGCTTCCACGTGCTGAAGTTTATCACGGACGAGGAGTTCAGCATTGTGGAGCACCGCAAGCGCACCAAGACATCCAGCAGCAGCGGCA 

GTAGTGTAATAGTAATCAAATCCGATTGCTGCGACGATGAACCGCTCAAGGAGTGAGTATAGCCAAAGACACTAGAATTATAGCCAAGTATTGTA 

AATGAACAGTACCATTATCAGTCCATTTACTTTGAATTGCAGAGACTACATCCTTTCGTTCGATCAAGATAGCAACGACAGCTACTCAAACTGTT 

CCAACTGTGATATGTCCATAACAGATAACTTCTATGCCACGACGCCGGCGCAAGTCAATCCGTTGACCAGCAGGTCGCCAAATCCGAAGAGGTTG 

AACAGCTGAATACTAAAATTAGAAACCTGACTAACTGTGAACTCAACTTGTTTTAGATCTAGAACCAGCCCCATTTCAGTGCCCGGATCAAATGC 

GCGGCAAAGACAGGAGTCTACATCGTCCAATGGTAGCTGGCAGGTGGTTACCGACACGGGTTCAATTGACTCCATGATGAACGGCAGCTACATGA 

TGCGCTTTGTGGCGCAACAGGCAGCCGATGGTGGCGGCTCCTCAAACATTCCTTTATGCAATGGCGGCAATGGTTACCACTGCAGCATCAATGCA 

GCATCGAGTGGCAGTGGGAGCGGAAGTGGTAGCAGTATCAGTAACGGCAGCTCGACGCACGGTTTCGCAAACGGTTCCTCCAAAGACGTAGGCGG 

CAGCACTATGGCCAGCAAGCAATGCATCAACTTGTAAGTTGTAGTCTGATTATTCGACAGCAAATACATAATGACATAATTTTGTTTTACAATTG 

GAGACGAAAGCAACGCCTGAATGCAGTGCGCGCCATTTTTATACAAGCCTACGGCAAGACGATTGGACTGAAATTCAAGGAGGGCTCATCCATGA 

ACCTGGCCACGTTCATTGGGAACCTGGCGGACCAACTGTTTTGAGAAAGGATTAGCAGCTATGACCGTTTGTTAGATTTCATCAACATTGTTTGT 

TTCTTGTTAAGTGCTACGCTTTCAACCAACGACTGTATTTATAGACCTGTTAAATTGCGTTCTTTAGTATCTGTTGACGATTGTGTAGTGTAACG 

GAT TACT G ATT ACCGATAAGCATCGAAAGAACCAGGCGATCGTAGGCAACGAGCAAACAAAATCCTGTCGTCCTTCTGCCCCGGCTAAACCGCTC 

TCCTGTTCTCTAGGTTCTCCCGCTCCTTGTTTGTTTAGTTATTAGTTCTTCGAGAATGCAAAGCAAGTGAAATATAAAATGCTATTTGGATTATA 

CAATAGTTTTAAGTTGTAATTTATAGTAATGTATTTTTTAATAAACTTATGTGCATTTTATACGAAAGAAAGTCATGTTCTCAGCGTAGGCCGCC 

TTGAGCGCCGTATTCTAGCTAAGGGTAAATAATCATCAATTTCCACTCGCTTCAGCCGGATGCGACACCTTTTAATGTAGTATTTCGCGCTTTTA 

AGAGGAACGAAGCGTGGTGTTTCTATCGGTCGTTCGTCACATTTTGTTGTATCTTTAATTTTAGGTTCAATAGCTGTTTCTATTTTGATTTTGAC 

ATCATTTGATCCTTTGCAAGCATTATTATATCTGATTCTTTTATTGGCACTTGCGACATTGGGTCTTATTTTGTGTGGGCTAATGCCGAATCTGT 

CCATGGCCGGACTGAGTAAATTGTCTGCTGGGAGCTCTTTCACCCGTATCGCTTTAAGAAATTCGTCGTATTCGCTCTGAACCGTCTCCAGGTCC 

TTAGGACCCAGCTTTCGAGTAGCTTTCTCTTGGCGCTCTTGCTCCAGGATGAGGGCGAGGACCTCCACTTCTAAATGCCGGCGTTCCGCTTGCGC 

CATAGATTTCACGAAAGGCAACTGACTGAGAAGAAACTCCTGACACTTGGTTTTCGGCACATCCGCCGGATTCGGGGCTTGCATGCGAATGTGCA 

CATCGTGCATCACCCGCCACAGAGGTTCAGGGATTTGCCCTTTGGAACCGGCCAAGGCTCTGACCAGCGGGCGCAGCTTCACCCAACGCAGCTTC 

AGTTCACTTACGGAGAACGGAGTACTCAGCTCAACGTGCTCGTAGCTTTGGTTAAAGGACTTGGCCAGCTGATCCCAACCCCAGGAGTCGTAGTC 

AACGTCCATTTTGGGCTCCACCTGCGGGATGATAAGGAAGGGATACCGGGCGAGCATCTCCATGAGCAGCAAGTCGTTTAAAACAATCTGGCCGC 

TGATGTTGGTGTTTGATATGTTAATCTCCAAATGGGGAACAGTGGAATGGATCACCGTTTCCTCGGCGCTGTCGGTACTCATTTTAATATTTTTC 

GCACTGTGTTTCTCCAACG 

(SEQ ID WO; 724) 

Exon: 1001.. 1581 

Exon: 10942.. 11732 

Exon: 11859.. 12497 

Exon: 12583.. 12726 

Exon: 12787.. 13143 

Exon: 13209.. 13744 

Start ATG: 1465 

Transcript No. : CT20377 

CTGTTTGACGTTTGTCGCTTTCCTCCGTTGCGAATATAATATATTACGTAGCTCATTTTTATACAAACGGAATTACGAGCGCAACGACGACAGCA 
ACACTAGTAGCACTAATCGTAAGCGCAGGGGCCAAAAATTAAATTGCGTTTGCGGCCGCAAAGATTTGATGACGTCGCATACGCCGTCTTCTAGG 
GCGTAAAAAGCAAAGCAAAGCAAACAAACGCGAAAGCGAAACGTGTAAACGGCGTAGAAGCGATAAACGCGACTCAAATACGCAGCAGATAAAAT 
ACAATACGCGAGAAGAGAAAAGTCACGGGAAATATTGTTCATATTCGGCGTCTTTCTGCGAGCGTAAACGTGTGTGCGTGGGCTTGTGCTTTTGC 
CAGTGTCGTGTGTGTTAAGTGCCTGTGTGTGTGTGTGATTAAGAAGATATAAAGGAATATAACGGTAAATGCAGCGCCGCAAAAATGTGCAGCCT 
GACGCCAAACCACATGGTAAACGTAACGCAGCAGCACCTACACGATCTATTGGAAACTTTCGAAAAAAAGTCCTTCGAGGCGGCGGCCTTTGAGG 
AAGGGACGGCGGAGTACGACATCTCCAAAAAATGCGAATACCTGTTTAAGCTCGACTACAGCCTAATTGAGCTGGATAATACGAACGGATTGCTC 
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AGTCCGCGATATCCTGGCCGAATACTCATCCCGGAATATGAGCACGGGCACATGGCCAAGACGCTGGTACCGGGAAATGGACTCTTCGGGCAAGT 

TGGTGGGGGAGTGGGAGGTGGAGGCTCCTCGGGAACAACCGCCACTGCCACGCCTCTGAACAGCAGTGCAGGAAGCACCGGAAGTGAGGGTGTGG 

GCATCCAAGCCTTTGTGACCTTTGCCAATCCCCTGCAGACGCAACAACAGCATCCGCTCCAGCAACAATATCCCTCGCAGCAGATGCATCCCCTC 

CACGCGCAATATCCCTCCCAGCAGCCACATCCACTCCAGCAGCAGCAGCAGCAGCCATCGCAACAGCAACCACAAAATACGATATACGAGGATCA 

GTATGATATCCAGCGAATGCGGGAATTGGTAACGATGGCCAAATATGCGAGATGCCGTCAAAGATTCGCCGTGCCTGTGATTATGTATCGCGGAA 

AGTACATATGCCGCTCTGCCACGCTATCCGTCATGCCAGAAACCTACGGCCGAAAAGTGGTGGACTATGCCTACGACTGCCTGAGTGGCGGCAAT 

TACACCGCGCCAAACGGAGAAGAGAACGATGCTGACTCCACGGACGAGTCGCTGATCACCCACATGCACGACCAGGCGCAGTCGCAGTTCAGCTA 

CGACGAAGTCATCAAGAGTGACATCCAGCTGCTGCATACGCTCAATGTCTCAACCATTGTGGACCTCATGGTCGAAAACCGCAAAATCAAATACT 

TCATGGCCGTTTCCTCGTCAGAGAAAGCGGATCCCAACAAGCACTATAAGAGCTTTAACCTTCTATCCCTGCCGTATCCGGGCTGTGAGTTCTTC 

AAAAAGTTCCGGGACAATAATTACATGGCTCGCAACCTGCACTACAACTGGAAGCAAACGTTCAACGATGCGAATATCAACATTCCCAACATGGG 

ACCCGCTGCGGATATCGATGTGGCGTGGTCGGAGTACCGGGATTGGGATCTGGTGGCAATCACCCAAAACTATTTGAGAGCTACACTGAAATACG 

TGCAAGAGGAAAACTCCGGCCTGCTGATTCACTGCATCAGCGGTTGGGATCGCACGCCACTGTTTGTCTCCTTGGTCAGGCTGTCTCTGTGGGCA 

GATGGACTCATCCATCAGTCGCTGAACGCCATGCAAATGGCCTATTTCACACTGGCCTACGACTGGTACCTGTTTGGCCATCAACTTCCCGATCG 

CCTGAAACGAGGCGAAGACATCATGTTCTTCTGCTTCCACGTGCTGAAGTTTATCACGGACGAGGAGTTCAGCATTGTGGAGCACCGCAAGCGCA 

CCAAGACATCCAGCAGCAGCGGCAGTAGTGTAATAGTAATCAAATCCGATTGCTGCGACGATGAACCGCTCAAGGAAGACTACATCCTTTCGTTC 

GATCAAGATAGCAACGACAGCTACTCAAACTGTTCCAACTGTGATATGTCCATAACAGATAACTTCTATGCCACGACGCCGGCGCAAGTCAATCC 

GTTGACCAGCAGGTCGCCAAATCCGAAGAGATCTAGAACCAGCCCCATTTCAGTGCCCGGATCAAATGCGCGGCAAAGACAGGAGTCTACATCGT 

CCAATGGTAGCTGGCAGGTGGTTACCGACACGGGTTCAATTGACTCCATGATGAACGGCAGCTACATGATGCGCTTTGTGGCGCAACAGGCAGCC 

GATGGTGGCGGCTCCTCAAACATTCCTTTATGCAATGGCGGCAATGGTTACCACTGCAGCATCAATGCAGCATCGAGTGGCAGTGGGAGCGGAAG 

TGGTAGCAGTATCAGTAACGGCAGCTCGACGCACGGTTTCGCAAACGGTTCCTCCAAAGACGTAGGCGGCAGCACTATGGCCAGCAAGCAATGCA 

TCAACTTACGAAAGCAACGCCTGAATGCAGTGCGCGCCATTTTTATACAAGCCTACGGCAAGACGATTGGACTGAAATTCAAGGAGGGCTCATCC 

ATGAACCTGGCCACGTTCATTGGGAACCTGGCGGACCAACTGTTTTGAGAAAGGATTAGCAGCTATGACCGTTTGTTAGATTTCATCAACATTGT 

TTGTTTCTTGTTAAGTGCTACGCTTTCAACCAACGACTGTATTTATAGACCTGTTAAATTGCGTTCTTTAGTATCTGTTGACGATTGTGTAGTGT 

AACGGATTACTGATTACCGATAAGCATCGAAAGAACCAGGCGATCGTAGGCAACGAGCAAACAAAATCCTGTCGTCCTTCTGCCCCGGCTAAACC 

GCTCTCCTGTTCTCTAGGTTCTCCCGCTCCTTGTTTGTTTAGTTATTAGTTCTTCGAGAATGCAAAGCAAGTGAAATATAAAATGCTATTTGGAT 

TATACAATAGTTTTAAGTTGTAATTTATAGTAATGTATTTTTTAATAAACTTATGTGCATTTTATACG 

(SEQ ID NO: 725} 

Start ATG: 465 

MCSLTPNHMVNVTQQHLHDLLETFEKKSFEAAAFEEGTAEYDISKKCEYLFKLDYSLIELDMTNGLLSPRYPGRILIPEYEHGH^KTLVPGNGL 

FGQVGGGVGGGGSSGTTATATPLNSSAGSTGSEGVGIQAFVTFANPLQTQQQHPLQQQYPSQQMHPLHAQYPSQQPHPLQQQQQQPSQQQPQNTI 

YEDQYDIQRMRELVTMAKYARCRQRFAVPVIMYRGKYICRSATLSVMPETYGRKVVDYAYDCLSGGNYTAPNGEENDADSTDESLITHMHDQAQS 

QFSYDEVIKSDIQLLHTLNVSTIVDLMVENRKIKYFMAVSSSEKADPNKHYKSFNLLSLPYPGCEFFKKFRDNNYMARNLHYNWKQTFNDANINI 

PNMGP AAD I D V AW S E YRDW DL VA I T QN Y LRAT L K Y VQE EN S GL L I HC I S GWDRT PL F VS L V RL S LWAD GL I HQ S LN AMQMA Y FT L A Y DW YL FGHQ 

LPDRLKRGEDIMFFCFHVLKFITDEEFSIVEHRKRTKTSSSSGSSVIVIKSDCCDDEPLKEDYILSFDQDSNDSYSNCSNCDMSITDNFYATTPA 

QVNPLTSRSPNPKRSRTSPISVPGSNARQRQESTSSNGSWQVVTDTGSIDSI^NGSYMMRFVAQQAADGGGSSNIPLCNGGNGYHCSINAASSGS 

GSGSGSSISNGSSTHGFANGSSKDVGGSTMASKQCINLRKQRLNAVRAIFIQAYGKTIGLKFKEGSSMNLATFIGNLADQLF* 

(SEQ ID NO: 726) 

Classification: protein_phosphatase 
Gene Symbol: BcDNA : LD23181 
FlyBase ID: FBgn0027506 



Celera Sequence No. : 142000013384666 

TGCGGCATTAAAAAGTCGTTCGTTACCTGTCGGAGCTAGCGGAACTCGATGGGTTGTAGAACTTGATTTGTTTCGTCTGGGTCACAGACATTTCG 
ACAAGAGAATAGTGATTGCAGCAGCCAGTTGCAGCGATTAGAAAAAAACACGCGGTGATTTACTTCGTCGTGCCTCTCCCTCTCCGAGCAAGTGA 
CTGATTCGTTTCTGAAGGCATTGCGTTGGAAAAACGTGCGCCGCCTTCGTATACATACATATGTATACATCTACCCGAGCTTGAATGGTCAAGTA 
AAGTGCTAATTGTCAGCTGGGGAAAGGCACTGGAGCGGAGGGACGCTCATTGCCTTGTTTATCTCCATTTGATGCTGTCATTGAGGCATTTATTT 
ATATGTTTTGGGGCAATTCAATTCCATTCGCAAGGCGCTCAGCTGTTTGATTGGGGCGGCGACAGTGTTGCGAAGCGCTGTTGGCTGTTGGGTTG 
CAGTAAATTCGAATGTGAATGGCTAAATCCGAAGCGGGGTTTTTAAATGTTTGATAATTATTACTTCGAGCTTGTGTCAATATAAGTGAATATTT 
TACATAAAACTTTGTGGTTTCTAAAAGATTAGCTTACAACATGATCTTTAGGTACTAACAAATAAAAGGTTGTAAATAGCGGTAGTGCTCGGAAA 
ATATATCAACATGAGACAATTACCAATTCGATTATTTTTTGTTGGTCCTCTATCGATAACTGTTACTCTTACTCTCTTGCACTGCAAGCCAAATT 
CATATTAATTAAGACTATAAAAATAAACACATACAGAAACATGTGGCTGTTTCCTGGCTATCTTGGATGGAGAAAGTAGTATGTAGTAGTATCAT 
AATTTATAGTTTTCTTTCCTCTTGTTCTGTCCCTTAAAGAAACTTTGAATAAATTGCATTTTGTTCAGTTTTAAAATGCATTAAATAATCGATTG 
CCACGCTAGTTCGATAGTATTTAACGAGCACCGAGTATTGTTCGAGCTCTAAAAGATGGTCACTCCGGACCAGAGAGACGCAACAATATAGCAAT 
TCTGAACTTGGTTCGAGTGTTTTTGCAGCACGAATTGATAATATGCCGTAGGTGCGGCCCAGCGCGGATCATCAGCAGCAGCAGCGATGGCCTAC 
ATCAACATCGCCGAGTGGACGCCCGACCAGGTCACCGACTGGATAAAAGGTAATGCTCGAGGTCAAGGAAAGCGCAGCTGAGAAAAAAAAAACTG 
CATCTCGCAGCCACTGCCCCAAAAAGCAGCAGCAACAACAAGTGCAACAAGAATTTCCGCTTTGTTCAAGGTTTCAACAGGCACATCATAATAAT 
TATGTTTTCCTGATTCTCGCAGGTCTGGATGAGTCTATGAAGGGGTATTTGTACGAGTTCTCCAAGCAGGAGATCGGCGGACGGGCGTTGCTCAA 
CATACGGCCATACGAGCTGGAGAATCTGGGCATGCTGCGCATTGGCCATCAGGAGATCGTGCTGGAGGCGGTGGAGAACTTAAGAAACTTCGTGA 
GACCCCAAACAAATATTGTGACATACAAACCCGTTAATTGAGCTTGATATTCCAGCATTATCACCTGAAAAACGACAATCTGCAGTTCATGGCCT 
TGCATGTGGCCACGGCAGCTAAGAATCTGCATCGCGAACTGGCCAGGAATCATGCGGAGAGCACTAAGATCGATACCCGGATACTGCACGATATT 
ACCAGGACGATAGCTACTCTAAAGCCATTGGTGGGCAGCCTGGAGCGAACACCGTTCCGTAAGCAGGAGATGTACCGCGAGTATTGTGGCAATGT 
GCTCAAATGCGGCCTGGAGCTGGCCACAATTGCCCACCGTGATCGTTTTGCCCTGCAGCCGGTGCCAGCGATTCGACAGTCAGCGGAACGGCTCG 
AGAATCTTGCCAACTTTGTAATCCAGGACATATCTGATCCCATGGTACTGCAGCCAGCCTCTCTGAACCTTGTCACCCTTAAAAAACGCGAATCC 
GAGTTGGGTTTCAACATAGAGTCCAGTTATAATGGCATTCACCGTGTCACGGATATCAAGTACAACTCGCCAGCTCACAATTCCGGAAAAATCGA 
AGATGGCGACGAGATTGTGCAGATCAACTACCAGACCGTTGTGGGCTGGCAGCATCGCACAGTACTGGAGCATCTGAGGGAAGCACTTCCAGATG 
TGGTGCTGACGGTGAAGAAACGGCCGAAGCACACAAAGATGTTTGGACAGATCTATATGCAGCCGTACAGGTTGCCCAGCAAGAAGAGGAATATG 
GCCGCCCGCTGGGCGGCACAGATGCCAAGTCCCCGTGCTGCATTTTTAACCCTGGACACGGAGCAACTGGCAACAGGAGGAACAGGAAGCGCGGT 
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GCCAGATCCCAGTAAATCCCTAAGCAGTGAGAAGAGGGAGGTACTTACCAAGGCCAATCCAGTGTCCTCTGCCTCCGACTCTGATTCCAGTTGCA 
GTGATATTCCAACACCAACGGATCCCAAGCTAGCCGCACGGGAGATCCGTTTGTATTACCCAAAGCCAAGAGCTTTGCTCCAACGAAGAAACACT 
ATTTGTGGATGCGAGTATCTTAGCCTGAAAAACTCGGATCTAGTCGTTCCATCGTGGCACGAAAGAAAGCCAGGAATTGGTTCGCCCACCAATTG 
CGACCCTGGCTCGCCCAGTATACGGGATAAGTCGATATCCTTCGGTTATGGCCTAGAGATGGCAGCAAGGCCGACCACTTGCATTGGGATTGCGG 
GCGATACCTCCACGGACAAGGCCAGGCGAATGTTTCATGAGGCGCGAAAACTAAAGCAGCTGACGGAGGACTCGCAGCGCGAATTCCTCGTGGAT 
CGTTACAAGCCGGGTGTCAGCAAAGTAGTTCGGTTCGACGCCAAGGAAGATTATGTGATGAAGAATGAAAAATTTATCTGTAACGTTGAGAACAC 
CATACTGGAGACCTTTGAACCGATTCCCTTTGCGGATGAGGGAGACGAGGACGCCTTGGAAACGCTGCGCAACTGCAAAACAGAAAACGCAGAGG 
AGCTCCTAGAGGCAATTAATCTGGCTACAAAAGGTCAGGACTTGCCCCTGGCCGAGGCCATTAACATGCCTTTGTTGCAGCAAGGTCGAAGGGGC 
CGATTAGACAAGAGTCATAGTACTCCGGCATACGATAATTCGGGTGAAGAATCGGATACGCCGCCAGCGATTGAGCCAAGAAAGGAGTTCCTGCT 
TGTCACACCGCCAGCACCGCCACCGCGACCCCGCAAGCAAAGGGAAATGACGCCACCGGCGGTGCCTCCACCTCCACCAAAACCAGCCAGCATGC 
AGCCGGCTAGTAGTATCACCAGTATCTCCATTCCCGTGCCAGTACCTGTACCAGCCGCAGTTGATCCATCGGAGATCAGTGAGCTTCATACGCCC 
AGCAAATCCCGGACACTGACCCTTAAAAAGAAGCACAGCTTAATGGCCAAGCGGCGGAACACAAATCTCAAGCTGTTGGGGACTGGGGATATCCA 
AGGTCATCTGTACAGGCGCAAGAAAAATCATCGCGGAGTGACTTACTGGGCGAGAATCTATTTCGTGATGCTGGATACGATACTGTACGGATTTC 
GCAGTAAGCAGAGTACCAGCGCCAGCTTGGTAATATTTTTACCGGGCTTTACGGTTTCATTGGCAAAAGAAGTGCACTCCAAACCGCATGCCTTT 
AAAGTTTACCACACGGCCAAGAGCTTCTACTTTGCGGCCGAGTCGCTGGACGCACTTAACCAATGGGTGGACTTCTTGCGCCAGGCTTCGCTCAA 
AGTGCCGCCTAGTACGGGTTCAAAAGGCGGTGGTGACGCAAAGGATTTGTATTCGGAGAACGATAGCTCAGGCGAGGAATGCGATGCCCTTGTGA 
TCCAGAATTTAAGCACGCCCTCGCCGCAAGGCAACAAGGAGTCAATGTCCATGTCGATGACACTATCTGGCGGAACACCACCTTCCTCTGCACCC 
ATTAAGCATGAACGCGGCTACTTGGACTCGTTTCGCAAATTTACGAATACATTCAAGAGCAGTGCAGCGAAACCGTCGAGCGACATCCCTGTGCC 
AACGGAGCAATACCGCAGCTATAGGAAGGTGCCAGGCGGAAGTTTTGGCATTCAGATTGGTGCCAATACACCGGGCTACCATGATCCCGCAATGC 
CGCCGACACAGATTCCACCTTTGGTGGGTCCTAAACTGTCGCGTAGCTCAAGTAGGAGTTCGGTGGTCAGTGGAACCGAGTCTGCCGTATCATTA 
TCTGCTTCGATTCTGGGTCCGCCCGCGTCGCCAGCGCCAACGCCCACGCCTACAGCCACTCCCACATCGCTGCAGCATCAGAGCTCCGAGGAGAG 
TCCAAGTCAGAGCCAGAGCCAGAGTCCCAGTAGCAAGTCGAGTTTGAAGAAGGCACCATTCAACTTCTTGCACGCCTCAAATCCGAATTTGGTGG 
AATTCGATTTCCATACCTCGAAAACACTACTCCCCAAGATGAGTGTGGGTAATACACTCGACCATGGCCATAATATACAGGGCTTTGTGACGCTT 
AAGGATCTGATGCTGCGCAAACAGGAGGAGGAGGCCCAGGAAATGTACAACAATCGAGTGCACTTGGGTGTGGAGAAGCACAAGCACGCACGGAC 
GGAATCCACGGCCAGTCAGCAGAGCGCCATGAGCAGCAGCGTGACAAAGCCGCTGGAAAAGTTGCCCAAGATCCAGAGCGTGAGTCTGCCCAAAA 
CACCCGACTACGAGATCAGTTTTAAGCCTGACGACGAGAGTATTAAGAGAACCAGAACTAAGGAAGGTCAGAAGTTGCGCGACTTTGGTTACGAG 
CTGATATGTGGGGATGAGCCGAGCACTAGCTCCATCAGCCGGCAAGAGCACCACCATCATCATCATCATCACCAGCACCACCTGCAGCATGTGCA 
ACAGCAGCAGCAGCTACAACATCAGCAGACGCAGCAGCAACATAGCAGCAAGGCCAAGCACTTCTTGCGCTCACAGCAGTTACAGCTCTCCAGTT 
TCCTGCACAAGAACAGCAGCGGCGGCAGCGGCAAGAGTTCCGGATCCGGTGCGGATCAAAAGAAGTCCAAAGGAAAGTCTAGCGGCGATCGTCGG 
TTTCCCTTTTCCAAGCACTCAACTGGCTCCGCGGGCAGTGGCTGTGGGCCGGGTCATGCGATGACTATGACGCTGCCGCTGAACAAGAAGTCCAA 
ATCGAATCACGCCTTGGATGGAGCCGTAGGCAATGGAGTCACCATAATCGGCAGCGGCAGCAGCATCAAGAAAAGCCAGACGTACAATCAGGATC 
TGCGAGACAAGATAGTGGGCACCAAGTACGATGCGCATCGCAAGAACTCGGCCCCGATACCAATCTTTTCGAAACTCTCCATATCGGGTGGCACG 
CCGGCGAAGCCCTCCAAGGAGAATCGTTTCCTGGGCTCACCGCTGCTGCATCGCACGCTGTTCGGGCACCATCACCATCAGCAGCAGGCAGTAAC 
GCCACCAAGCAGTGCGGATCCCGATTGCGATCAGGAGATCTTCTCCCAAATCACCCTGCCCACGCACAATCAGGCGGGATATCGAAGTTACCGTG 
GACCTGGCATACTAACCACCTCGACCACTAATTTGTGCTCGTCGGAGGGCTTGCAGCCGCCGTTGCCCCCAGCGCCACCGCCTCCCGTCAACACC 
AGCAGGCAGAGAGAGGAGGAGCAAGCGGAGGCAACGATATCAGAGTGTGCGGCCACCCCAAAAGTGTCCAGTTCGGGATCTGTGGATTCGAAGGC 
GGCAACGCCGGACTATCCAAATATGGAGTGTCCGCCTGTCTTTGAGCCGGAAATCTACTCGCTCAGCGATACCAGCTTGTCCCGCATCGTGATGC 
GGACGCCCAGCAGCAGTGGTGCCACCAGTGCCACCGCCAGCACCACCACCACCAACAACAACAACAACAATAATAACACCAATAACAACACCAAC 
ACCAACAGCAGTCCCAGTGGCGGCGAACACCATCAGACCTAGATGTGTGATGAAAACTATTGTGAATGCTCGCGGGGATTAGCCCCCGTTTACTT 
ATGTATTTATACACACGCATATAGATATATATATATATATTATATATAGCGATATTCTCGTACGATTTAGGTATACAACTTGTTCATTTTGATCG 
TCTTCCCTGTCTGTCCACCAGAAACAAACACACGCATACCTGCAATCCCTTCGATTCGAAACCCACTTAGATGTAAATGTTTAATCTGCCTTATT 
TGCATTGTATTTCGGTTCTAGGCAAAATGTTTTACACATGTAGTCACGCAAGCAAAACAATGAATTATAGCGAAAATGGAGAATAACTTTAGAAG 
TCTGTGCATCGAACAAAGACGGTTCTTTATGATTTGCCTTTGTACGCGATTAACCAAGTTAATCCGAGCTGTTTGATTTGCTCAACAGTTTGTGT 
CTAGATTGCGTGTAATATACATATATTATTATGCGAAATGAGATATTATTGTAATGGCATCCAGCGACGCCAATGGAGCCAGAAATGTTGTAGCG 
G AAT GC AAG AAAT ATAT AT AT AC AC AAT T AC AC G AAAT AT AC AT GC AT AC G AAT AAAGT C AAC T GT AC AT AAT AGAAT AAC GAT CTTTTTATTTT 
GGGATTTTTCAAGAAACAGGTTGTTTCATTAGAGGAGGATTGAAAATCTTTTTATGCACGCAACTTATTCAAAGACCTAAGAAATGAGAATCTCT 
GGACAGAAGTGCTGAACCCTGCTTCTTTTTACTAATGTAACCAAATCAACTTGATCAAAACGAGTATTACCTTCTATTATTAGTTCGATTTAGAA 
TCCCTTTCTTTATTCACATTACATTTGAAATTAGATCAATTTAAAACCACTGCCGCTTACGCAATGTTCTTAATGTGCTGCTCCACCTCCTCCTT 
GGTGAACATGTAGAACTCTCTTTCTTTCGTCATGGTCATCACCTCCACATTGGTGGAGTTTAGTTTCTCCTCCATAACCTGTTTAAGTGTGTTGA 
GCGAAATGTCGATAGCCTCATCGAGAGTCAAATCGGGTCTAAATAAGTCCTGCAGATTCTGCTGAGCACCTTCGCTGCCCGAGCCAATGGCCTTG 
GCTCCGTGGCGCACGAATGTGCCGGAGGGATCCATGTGCCACAACTGGGGTTGTCCCGCCTCGATGCCGGCAAATAGAATGGCCACACCAAAGGG 
ACGACTCATGGCGGCGGCACCATCGCTGTCGCCACTATCACCGAACTGGATGGCCAAAGTGGACACAGCCTGGGCGCAGGACTCGATGGACATGC 
GCTCGTTGTAGACGAACCAGTGGTTCTGGCACTCCACGCGTGCCCTCTCGATCAGAGTCCTGGCATCGGCCATCAAGCCGGAGGTGGCGCAACCA 
ATGTGCTTGTCCACCTCCACAATCTTCTCCACAGTACTGGGTACCATTAGCGGCGAAGTGATGCGCTTCTCCACCGCCAAAACCACTCCTGGAAT 
GCAATTTTACCATTAGATCTCCGCGTGTACTTATGGCAGCAAAGAACATACCTTCTGGCGTGCAAATTCCAATTGCTGTGGAGCCCAATTTGATG 
GCCTCAATGGCATATTCCACTTGGAAGAGGCGGCC 
(SEQ ID NO: 727) 

Exon: 1001.. 1189 

Exon: 1353.. 1516 

Exon: 1576.. 6635 

Start ATG: 1132 

Transcript No. : CT20428 

AAAAGATGGTCACTCCGGACCAGAGAGACGCAACAATATAGCAATTCTGAACTTGGTTCGAGTGTTTTTGCAGCACGAATTGATAATATGCCGTA 
GGTGCGGCCCAGCGCGGATCATCAGCAGCAGCAGCGATGGCCTACATCAACATCGCCGAGTGGACGCCCGACCAGGTCACCGACTGGATAAAAGG 
TCTGGATGAGTCTATGAAGGGGTATTTGTACGAGTTCTCCAAGCAGGAGATCGGCGGACGGGCGTTGCTCAACATACGGCCATACGAGCTGGAGA 
ATCTGGGCATGCTGCGCATTGGCCATCAGGAGATCGTGCTGGAGGCGGTGGAGAACTTAAGAAACTTCCATTATCACCTGAAAAACGACAATCTG 
CAGTTCATGGCCTTGCATGTGGCCACGGCAGCTAAGAATCTGCATCGCGAACTGGCCAGGAATCATGCGGAGAGCACTAAGATCGATACCCGGAT 
ACTGCACGATATTACCAGGACGATAGCTACTCTAAAGCCATTGGTGGGCAGCCTGGAGCGAACACCGTTCCGTAAGCAGGAGATGTACCGCGAGT 
ATTGTGGCAATGTGCTCAAATGCGGCCTGGAGCTGGCCACAATTGCCCACCGTGATCGTTTTGCCCTGCAGCCGGTGCCAGCGATTCGACAGTCA 
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GCGGAACGGCTCGAGAATCTTGCCAACTTTGTAATCCAGGACATATCTGATCCCATGGTACTGCAGCCAGCCTCTCTGAACCTTGTCACCCTTAA 
AAAACGCGAATCCGAGTTGGGTTTCAACATAGAGTCCAGTTATAATGGCATTCACCGTGTCACGGATATCAAGTACAACTCGCCAGCTCACAATT 
CCGGAAAAATCGAAGATGGCGACGAGATTGTGCAGATCAACTACCAGACCGTTGTGGGCTGGCAGCATCGCACAGTACTGGAGCATCTGAGGGAA 
GCACTTCCAGATGTGGTGCTGACGGTGAAGAAACGGCCGAAGCACACAAAGATGTTTGGACAGATCTATATGCAGCCGTACAGGTTGCCCAGCAA 
GAAGAGGAATATGGCCGCCCGCTGGGCGGCACAGATGCCAAGTCCCCGTGCTGCATTTTTAACCCTGGACACGGAGCAACTGGCAACAGGAGGAA 
CAGGAAGCGCGGTGCCAGATCCCAGTAAATCCCTAAGCAGTGAGAAGAGGGAGGTACTTACCAAGGCCAATCCAGTGTCCTCTGCCTCCGACTCT 
GATTCCAGTTGCAGTGATATTCCAACACCAACGGATCCCAAGCTAGCCGCACGGGAGATCCGTTTGTATTACCCAAAGCCAAGAGCTTTGCTCCA 
ACGAAGAAACACTATTTGTGGATGCGAGTATCTTAGCCTGAAAAACTCGGATCTAGTCGTTCCATCGTGGCACGAAAGAAAGCCAGGAATTGGTT 
CGCCCACCAATTGCGACCCTGGCTCGCCCAGTATACGGGATAAGTCGATATCCTTCGGTTATGGCCTAGAGATGGCAGCAAGGCCGACCACTTGC 
ATTGGGATTGCGGGCGATACCTCCACGGACAAGGCCAGGCGAATGTTTCATGAGGCGCGAAAACTAAAGCAGCTGACGGAGGACTCGCAGCGCGA 
ATTCCTCGTGGATCGTTACAAGCCGGGTGTCAGCAAAGTAGTTCGGTTCGACGCCAAGGAAGATTATGTGATGAAGAATGAAAAATTTATCTGTA 
ACGTTGAGAACACCATACTGGAGACCTTTGAACCGATTCCCTTTGCGGATGAGGGAGACGAGGACGCCTTGGAAACGCTGCGCAACTGCAAAACA 
GAAAACGCAGAGGAGCTCCTAGAGGCAATTAATCTGGCTACAAAAGGTCAGGACTTGCCCCTGGCCGAGGCCATTAACATGCCTTTGTTGCAGCA 
AGGTCGAAGGGGCCGATTAGACAAGAGTCATAGTACTCCGGCATACGATAATTCGGGTGAAGAATCGGATACGCCGCCAGCGATTGAGCCAAGAA 
AGGAGTTCCTGCTTGTCACACCGCCAGCACCGCCACCGCGACCCCGCAAGCAAAGGGAAATGACGCCACCGGCGGTGCCTCCACCTCCACCAAAA 
CCAGCCAGCATGCAGCCGGCTAGTAGTATCACCAGTATCTCCATTCCCGTGCCAGTACCTGTACCAGCCGCAGTTGATCCATCGGAGATCAGTGA 
GCTTCATACGCCCAGCAAATCCCGGACACTGACCCTTAAAAAGAAGCACAGCTTAATGGCCAAGCGGCGGAACACAAATCTCAAGCTGTTGGGGA 
CTGGGGATATCCAAGGTCATCTGTACAGGCGCAAGAAAAATCATCGCGGAGTGACTTACTGGGCGAGAATCTATTTCGTGATGCTGGATACGATA 
CTGTACGGATTTCGCAGTAAGCAGAGTACCAGCGCCAGCTTGGTAATATTTTTACCGGGCTTTACGGTTTCATTGGCAAAAGAAGTGCACTCCAA 
ACCGCATGCCTTTAAAGTTTACCACACGGCCAAGAGCTTCTACTTTGCGGCCGAGTCGCTGGACGCACTTAACCAATGGGTGGACTTCTTGCGCC 
AGGCTTCGCTCAAAGTGCCGCCTAGTACGGGTTCAAAAGGCGGTGGTGACGCAAAGGATTTGTATTCGGAGAACGATAGCTCAGGCGAGGAATGC 
GATGCCCTTGTGATCCAGAATTTAAGCACGCCCTCGCCGCAAGGCAACAAGGAGTCAATGTCCATGTCGATGACACTATCTGGCGGAACACCACC 
TTCCTCTGCACCCATTAAGCATGAACGCGGCTACTTGGACTCGTTTCGCAAATTTACGAATACATTCAAGAGCAGTGCAGCGAAACCGTCGAGCG 
ACATCCCTGTGCCAACGGAGCAATACCGCAGCTATAGGAAGGTGCCAGGCGGAAGTTTTGGCATTCAGATTGGTGCCAATACACCGGGCTACCAT 
GATCCCGCAATGCCGCCGACACAGATTCCACCTTTGGTGGGTCCTAAACTGTCGCGTAGCTCAAGTAGGAGTTCGGTGGTCAGTGGAACCGAGTC 
TGCCGTATCATTATCTGCTTCGATTCTGGGTCCGCCCGCGTCGCCAGCGCCAACGCCCACGCCTACAGCCACTCCCACATCGCTGCAGCATCAGA 
GCTCCGAGGAGAGTCCAAGTCAGAGCCAGAGCCAGAGTCCCAGTAGCAAGTCGAGTTTGAAGAAGGCACCATTCAACTTCTTGCACGCCTCAAAT 
CCGAATTTGGTGGAATTCGATTTCCATACCTCGAAAACACTACTCCCCAAGATGAGTGTGGGTAATACACTCGACCATGGCCATAATATACAGGG 
CTTTGTGACGCTTAAGGATCTGATGCTGCGCAAACAGGAGGAGGAGGCCCAGGAAATGTACAACAATCGAGTGCACTTGGGTGTGGAGAAGCACA 
AGCACGCACGGACGGAATCCACGGCCAGTCAGCAGAGCGCCATGAGCAGCAGCGTGACAAAGCCGCTGGAAAAGTTGCCCAAGATCCAGAGCGTG 
AGTCTGCCCAAAACACCCGACTACGAGATCAGTTTTAAGCCTGACGACGAGAGTATTAAGAGAACCAGAACTAAGGAAGGTCAGAAGTTGCGCGA 
CTTTGGTTACGAGCTGATATGTGGGGATGAGCCGAGCACTAGCTCCATCAGCCGGCAAGAGCACCACCATCATCATCATCATCACCAGCACCACC 
TGCAGCATGTGCAACAGCAGCAGCAGCTACAACATCAGCAGACGCAGCAGCAACATAGCAGCAAGGCCAAGCACTTCTTGCGCTCACAGCAGTTA 
CAGCTCTCCAGTTTCCTGCACAAGAACAGCAGCGGCGGCAGCGGCAAGAGTTCCGGATCCGGTGCGGATCAAAAGAAGTCCAAAGGAAAGTCTAG 
CGGCGATCGTCGGTTTCCCTTTTCCAAGCACTCAACTGGCTCCGCGGGCAGTGGCTGTGGGCCGGGTCATGCGATGACTATGACGCTGCCGCTGA 
ACAAGAAGTCCAAATCGAATCACGCCTTGGATGGAGCCGTAGGCAATGGAGTCACCATAATCGGCAGCGGCAGCAGCATCAAGAAAAGCCAGACG 
T ACAAT C AGGA T C TGC GAGAC AAGAT A GTG GGC ACC AAGT AC GAT G C GC AT C GC AAG AAC T C GG CC C C GAT AC CAAT C T T T TC GAAAC T C T C C AT 
ATCGGGTGGCACGCCGGCGAAGCCCTCCAAGGAGAATCGTTTCCTGGGCTCACCGCTGCTGCATCGCACGCTGTTCGGGCACCATCACCATCAGC 
AGCAGGCAGTAACGCCACCAAGCAGTGCGGATCCCGATTGCGATCAGGAGATCTTCTCCCAAATCACCCTGCCCACGCACAATCAGGCGGGATAT 
CGAAGTTACCGTGGACCTGGCATACTAACCACCTCGACCACTAATTTGTGCTCGTCGGAGGGCTTGCAGCCGCCGTTGCCCCCAGCGCCACCGCC 
TCCCGTCAACACCAGCAGGCAGAGAGAGGAGGAGCAAGCGGAGGCAACGATATCAGAGTGTGCGGCCACCCCAAAAGTGTCCAGTTCGGGATCTG 
TGGATTCGAAGGCGGCAACGCCGGACTATCCAAATATGGAGTGTCCGCCTGTCTTTGAGCCGGAAATCTACTCGCTCAGCGATACCAGCTTGTCC 
CGCATCGTGATGCGGACGCCCAGCAGCAGTGGTGCCACCAGTGCCACCGCCAGCACCACCACCACCAACAACAACAACAACAATAATAACACCAA 
TAACAACACCAACACCAACAGCAGTCCCAGTGGCGGCGAACACCATCAGACCTAGATGTGTGATGAAAACTATTGTGAATGCTCGCGGGGATTAG 
CCCCCGTTTACTTATGTATTTATACACACGCATATAGATATATATATATATATTATATATAGCGATATTCTCGTACGATTTAGGTATACAACTTG 
TTCATTTTGATCGTCTTCCCTGTCTGTCCACCAGAAACAAACACACGCATACCTGCAATCCCTTCGATTCGAAACCCACTTAGATGTAAATGTTT 
AATCTGCCTTATTTGCATTGTATTTCGGTTCTAGGCAAAATGTTTTACACATGTAGTCACGCAAGCAAAACAATGAATTATAGCGAAAATGGAGA 
ATAACTTTAGAAGTCTGTGCATCGAACAAAGACGGTTCTTTATGATTTGCCTTTGTACGCGATTAACCAAGTTAATCCGAGCTGTTTGATTTGCT 
CAACAGTTTGTGTCTAGATTGCGTGTAATATACATATATTATTATGCGAAATGAGATATTATTGTAATGGCATCCAGCGACGCCAATGGAGCCAG 
AAATGTTGTAGCGGAATGCAAGAAATATATATATACACAATTACACGAAATATACATGCATACGAATAAAGTCAACTGTACATAATAGAATAA 
(SEQ ID NO: 728) 

Start ATG: 132 

^YINIAEWTPDQVTDWIKGLDESMKGYLYEFSKQEIGGRALLNIRPYELENLGMLRIGHQEIVLEAVENLRNFHYHLKNDWLQFMALHVATAAK 
NLHRELARNHAESTKIDTRILHDITRTIATLKPLVGSLERTPFRKQEMYREYCGNVLKCGLELATIAHRDRFALQPVPAIRQSAERLENLANFVI 
QDISDPMVLQPASLNLVTLKKRESELGFNIESSYNGIHRVTDIKYNSPAHNSGKIEDGDEIVQINYQTVVGWQHRTVLEHLREALPDVVLTVKKR 
PKHTKMFGQIYMQPYRLPSKKRNMAARWAAQMPSPRAAFLTLDTEQLATGGTGSAVPDPSKSLSSEKREVLTKANPVSSASDSDSSCSDIPTPTD 
PKLAAREIRLYYPKPRALLQRP^TICGCEYLSLKNSDLVVPSWHERKPGIGSPTMCDPGSPSIRDKSISFGYGLEMAARPTTCIGIAGDTSTDKA 
RRMFHEARKLKQLTEDSQREFLVDRYKPGVSKVVRFDAKEDYVMKNEKFICNVENTILETFEPIPFADEGDEDALETLRNCKTENAEELLEAINL 
ATKGQDLPLAEAINMPLLQQGRRGRLDKSHSTPAYDNSGEESDTPPAIEPRKEFLLVTPPAPPPRPRKQREMTPPAVPPPPPKPASMQPASSITS 
ISIPVPVPVPAAVDPSEISELHTPSKSRTLTLKKKHSLMAKRRNTNLKLLGTGDIQGHLYRRKKNHRGVTYWARIYFVMLDTILYGFRSKQSTSA 
SLVIFLPGFTVSLAKEVHSKPHAFKVYHTAKSFYFAAESLDALNQWVDFLRQASLKVPPSTGSKGGGDAKDLYSENDSSGEECDALVIQNLSTPS 
PQGNKESMSMSMTLSGGTPPSSAPIKHERGYLDSFRKFTNTFKSSAAKPSSDIPVPTEQYRSYRKVPGGSFGIQIGANTPGYHDPAMPPTQIPPL 
VGPKLSRSSSRSSVVSGTESAVSLSASILGPPASPAPTPTPTATPTSLQHQSSEESPSQSQSQSPSSKSSLKKAPFNFLHASNPNLVEFDFHTSK 
TLLPKMSVGNTLDHGHNIQGFVTLKDLMLRKQEEEAQEMYNNRVHLGVEKHKHARTESTASQQSAMSSSVTKPLEKLPKIQSVSLPKTPDYEISF 
KPDDESIKRTRTKEGQKLRDFGYELICGDEPSTSSISRQEHHHHHHHHQHHLQHVQQQQQLQHQQTQQQHSSKAKHFLRSQQLQLSSFLHKNSSG 
GSGKSSGSGADQKKSKGKSSGDRRFPFSKHSTGSAGSGCGPGHAMTMTLPLNKKSKSNHALDGAVGNGVTIIGSGSSIKKSQTYNQDLRDKIVGT 
KYDAHRKNSAPIPIFSKLSISGGTPAKPSKENRFLGSPLLHRTLFGHHHHQQQAVTPPSSADPDCDQEIFSQITLPTHNQAGYRSYRGPGILTTS 
TTNLCSSEGLQPPLPPAPPPPVNTSRQREEEQAEATISECAATPKVSSSGSVDSKAATPDYPNMECPPVFEPEXYSLSDTSLSRIVMRTPSSSGA 
TSATASTTTTNNNNNWNNTNNNTNTNSSPSGGEHHQT* 
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(SEQ ID NO: 729) 



Name: connector enhancer of ksr 
Classification : signal_transduction 
Gene Symbol: cnk 
FlyBase ID: FBgn0021818 



Celera Sequence No. : 142000013384589 

ATAAAAGCAATTCGATTCAATATACGATTATTAAGCTTCCCCAACTAAAAATCGTATTTTTCTGATCCACCTTAAGCCGCCTATTTCACTCCAGT 

TTCTTGACGAATCAGTGGAATTTGTCAGCAATTGAGGACTAGTCATGCCGACACAGAGGTGCGCTGTAGTCCATTCAGGAACGATTTGGAATGAT 

AAGATAAGTCGGCCCACATTTGGCGGAAAACTTTTTGGGAAAGGTGTGCGTGCACCCACAACGTATGAGTCATGTTTTGAATTTATTTTTTTGTA 

TTAATTTATTGCTAGTATCTTGTATACTTTTCATTTTCATAATTTGATTTATTAAAAATTGCATCAATAGTTTATGTTTTTATTCAATTATCAAT 

TTTGTAAGCTTTTGCTTTTCGCGTAAAAAATGCACAAAGTTCTCTCGCTAATCGAAGCAAAAATCGTTAATAATATTGTTAAGTGTTTGCTCTGC 

TCGTGCTCTGCTTATCTGTTTCAGTTTTAGTCATAAAAATACATATTTTATATTTGTGTATATTTATTCATAAGATGCACGTGGCCAAGTCCTAT 

GCCATCCACAGTTCCAGTTAGCAGTCGCGTTCGCTGTGCTAAAAAAAAAACCGAAATCCTAACCAAAACTGGCGAAACAAAAGCCGCAAACAGAA 

CAGAACGGGGGCTAAAATTGGAAACAAAATGCCTGGCTGGTGGAATCATTGAACACAGACTGGCTGCGTGGTATCTACGACTGCTGGAGATGCCG 

AGAGCTGGAATGTCGGAGGATCCTGGGACCGCCTGGCCTGACATAAGTACATGGTCTTGGGTTTTTTTTCGTTTAGAGGGTTTAGGGGGTATCGG 

AAATGCACTACACTATATTCTTGTCTGGTAGCTAACTCGCAAATATTACTCTGATCTTGATCCCTGATCTCGTGTCAATGAGTTAGCATTCGCAC 

TCGCCTGCGCTCGGGATTAGCTCGTCAATTAACAGTAATTAGTTAGCCGATTTCGTTAATTGTTGTTTAGTCTATTTGCCTTTGCTATCATTTGT 

TACT T T A T AAA T AAAC T T AAACGTAAAGT GT G T AT GT GT GT AT AG AT GT T G AGG AAT C C GC C GC AGAT C C GGC C AGT AC AT AC AC CT G AC AC CT G 

TGTACACCAGTGCTGCTGGCCGATCTCCTGCATCTGCATCTGCTCCTGCTGTTGAAAAAGGGGGAGAGGATGGTTGGATCGTCGGCTCAACCGGT 

TCTAGCAGGCGCAGCCGCCCTCGGGATCCTTTGTCTCGTTGGGTGAGTCCTTTAGCACAACCTCCTGCATGTCCTCGGAGGGCTTGTTCTCCGTG 

GAATCCATGCCGGGCAGTGCCGCGGCCACCCGTCGGAACAATTGCTTCACATTGTAGCCGGCCTTGGCGCTTGTCTCGATGAACATCACGTTAAG 

CTCCTTCGCCTTGCGCTCACCCTCCTCGGTGGACACCTGACGCTTGTCGGAGAGATCCGTCTTGTTGCCCACCAGCATGATGATGACGTCGCTAC 

CCCGCTCCGTGCGCACGTCATCGATCCACTTGGAGGTCTGGTGGAACGAGTTGGTGTTGGTGATATCGTAAACGACCACTGCCACCGTGGAGTCG 

CGTATGTACGAGGGTATCAGCGAGCGGAATCGCTCCTGTCCCGCCGTATCCCACAGCTGCAGGCGCACAGTGCGATCCTCCAGGTACATGGTCTT 

CGATAGGAAATCAATTCCGATCGTCGCCTGGTACGTGTTGTCGAAGCTGTCGTACATGAAGCGCGTAATCAGCGAGGTCTTGCCCACACTCTGCT 

CGCCGAGGAAGACGAGCTTGAACTTCCGCAGCGGATTGCCAAAATCTCCGGATGACATCTTCCTCAACGTACGTGTCTACTTTTTCTTAATTACG 

GGTTTGCTGAACTCTCTTCTCGCTCCGCTCCGCTCTCACCAATCTCCACCCCTCCTCACTGTTTATAATGGGGCTGGGCCAGCGGTTTTCGGGGC 

CCTTTCGCACTGCGATTTCTCGGGATGCACCGCTTTCAAGTTGCCGCACTCTATTACACACGCTCTAAATGCTTTCCGTCTGGCTAAAGACTCAA 

TTGCGGCTCGAATGGCTTGGTAAATCGCTAGGTTGAGTCGTGGAAAAATTACGGTCTCTGCTTCTTCTTCTTTAACTTCTGAGTCAGTTGCTGTG 

TGACCAGAGTGGCAATCAAAAGCAATTTCTCTGCCACGCTATCGATAAACGCATATTTGTACTAAATCGATGTTTTATGATTTAAAAAAAAATCG 

ATTGATATTCGTTACCGTTATTAAATAATTTTAACTAAAATTTACCTGCTTAAAATGTTTAAATACTGATGAAGTACAAGGTAATTGCTTAACTT 

TTTATTTCATATTAAAAAGTGAATACTAAGTGTAAGAAATACTAACAGGTGATAACGCGATTTTCAAATATAAAGCTCTCTAAAGTGATTTGCTT 

GTTTTTAGATAAACGAGCGTAGGTATTTGCCCATCCTATCGGTGGAACTTGACGTACTTCTGTCAAAGAGTCACTGTTCGGGCGTCGCTTAATCT 

CAAATTATAATAAGTGGTTATCTGAAGAACACACGGTCCTTAATTGAAAGCCTTATCTGCTGGCGCTGGCAAAATTTTGGAATGGCGAATTAATT 

TAAACGTGCCGCGATAAAGGTCGATGTAAACAGTTCCGCGAAAACTATGTTAGCGAACGGACAGCAAACAAAGTCGTTGGCCAGTGGGCTTCTTC 

TCCTACTGGGCATATGCCGAATATCCGGAGTGGCTATCGGCGCACCCGAGGGGCGTGTCGTGGGTGGATCTCCGGCAGCGGTGAATAGTGCTCCC 

TATGCGGTGTCCATGCAGTACGGTGGCACCCATTACTGTGCCGCCAGCATTCTGAATGCCAACTGGCTAGTCACCGCTGCCCACTGCCTGACCAA 

TAGTAACCAGGTGCTGGGCAGCACCTTGGTGGCCGGAAGCATCGCGGTGGACGGAACTGCAAGCACTACGCAGACGCGGAGCATCACGTACTTCG 

TGATCAATGACTTGTACACCGGCGGAACTGTGCCCTATGACATCGGAATGATCTACACGCCCACCGCCTTCGTTTGGAGCGCCGCCGTTGCCCCA 

GTGACGCTACCATCCTCGGGAGTGGTGCCCACTGGCACCGCCAATCTGTACGG 

(SEQ ID NO: 730) 

Exon: 2188. .1001 

Start ATG: 1863 (Reverse strand: CAT) 
Transcript No. : CT20524 

TCACACAGCAACTGACTCAGAAGTTAAAGAAGAAGAAGCAGAGACCGTAATTTTTCCACGACTCAACCTAGCGATTTACCAAGCCATTCGAGCCG 
CAATTGAGTCTTTAGCCAGACGGAAAGCATTTAGAGCGTGTGTAATAGAGTGCGGCAACTTGAAAGCGGTGCATCCCGAGAAATCGCAGTGCGAA 
AGGGCCCCGAAAACCGCTGGCCCAGCCCCATTATAAACAGTGAGGAGGGGTGGAGATTGGTGAGAGCGGAGCGGAGCGAGAAGAGAGTTCAGCAA 
ACCCGTAATTAAGAAAAAGTAGACACGTACGTTGAGGAAGATGTCATCCGGAGATTTTGGCAATCCGCTGCGGAAGTTCAAGCTCGTCTTCCTCG 
GCGAGCAGAGTGTGGGCAAGACCTCGCTGATTACGCGCTTCATGTACGACAGCTTCGACAACACGTACCAGGCGACGATCGGAATTGATTTCCTA 
TCGAAGACCATGTACCTGGAGGATCGCACTGTGCGCCTGCAGCTGTGGGATACGGCGGGACAGGAGCGATTCCGCTCGCTGATACCCTCGTACAT 
ACGCGACTCCACGGTGGCAGTGGTCGTTTACGATATCACCAACACCAACTCGTTCCACCAGACCTCCAAGTGGATCGATGACGTGCGCACGGAGC 
GGGGTAGCGACGTCATCATCATGCTGGTGGGCAACAAGACGGATCTCTCCGACAAGCGTCAGGTGTCCACCGAGGAGGGTGAGCGCAAGGCGAAG 
GAGCTTAACGTGATGTTCATCGAGACAAGCGCCAAGGCCGGCTACAATGTGAAGCAATTGTTCCGACGGGTGGCCGCGGCACTGCCCGGCATGGA 
TTCCACGGAGAACAAGCCCTCCGAGGACATGCAGGAGGTTGTGCTAAAGGACTCACCCAACGAGACAAAGGATCCCGAGGGCGGCTGCGCCTGCT 
AGAACCGGTTGAGCCGACGATCCAACCATCCTCTCCCCCTTTTTCAACAGCAGGAGCAGATGCAGATGCAGGAGATCGGCCAGCAGCACTGGTGT 
ACACAGGTGTCAGGTGTATGTACTGGCCGGATCTGCGGCGGATTCCTCAACATCTATACACACATACACACTTTACGTTTAAGTTTATTTATAAA 
GTAACAAATGATAGCAAAGGCAAATAGACTAAACAACAATTAACGAAA 
(SEQ ID NO: 731) 

Start ATG: 326 (Reverse strand: CAT) 

MSSGDFGNPLRKFKLVFLGEQSVGKTSLITRFMYDSFDNTYQATIGIDFLSKTMYLEDRTVRLQLWDTAGQERFRSLIPSYIRDSTVAVVVYDIT 
NTNSFHQTSKWIDDVRTERGSDVIIMLVGNKTDLSDKRQVSTEEGERKAKELWMFIETSAKAGYNVKQLFRRVAAALPGMDSTENKPSEDMQEV 
VLKDS PNETKDPEGGCAC* 
(SEQ ID NO: 732) 
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Name; RAB6 

Classification : signal_transduction 
Gene Symbol: Rab6 
FlyBase ID: FBgn0015797 



Celera Sequence No. : 142000012790778 

ATTCTTTATTGGAAATCTTCACACTACAACTATCTGCTGAAACTTAAAAACCTTCATACATTTACACATCATATCTTCACAAAAGGCTCCACCCT 
CGATCACGGACTTAACTCGCGCACAACAAAATATAAATAAAACTCCAATGTCACCGAGTCGAGTTGATAAAAAGCATCGCTAAAGGCCTTTTAAT 
TGCCGCTGCAGTCGATAAATCATCAAACGGGAATATGAAAGCATCTCAGGAATTCAGTGCGTAAAATTAGAATGCAACAACAACAGCAGTAGCAG 
CAGCAACAACAACAACACGCGGGCGGAGGCAGGAAAGATACAAAATAGCCAAGGAGCCAATATACATACATATACATGTACATATACATCCACGT 
CGTAAAGTGCGTCTCTCTCTTGCACTCTCTCTCTCTCTCTGTGTTAGAGCTCCATCTCCCTCTCCCCACAGCCACCTACTTCCTGCACATATGCT 
GC GAGC AAC AAC AAC AC AAC GT GC T C C T CT T GGG GAT GC T GT GT GCG AGC G AG AAAGAG AG AAAGC C AG AG AG AAC GGC AG AGT AACT TC G AAT C 
AAAGAGCGCAACTACACTCGTAAAAATTATTACAGCTGCCAAGTTAAAAGCTATTACATCGTAATAATGTTGTTGTGGAATTTTTTTAAAACAAA 
TAT AACTT T T T AT AAAT AC GT T AC AT T T G AAT AAG AT AT AATT AC GAG AAG GTT AGGGAAT AT AT TG T T AT T T AAT GT T T T T AC AATA GA TT TGC 
GTTTTACTCTATTACTATTACTCTAACCAAACTAAAAAAAAGTTATAGGAATCTTTAATGAAATTCCTTTATTCCTTTTGTTTGAGTTGACTGAT 
AACTGAATTTCCACCCTAAATTTTTGCGTTATCGTTTTCCCGTTTAGGGCATCCATTTCTCTGAGTGCAGGGCTCAGTCATCGCAGCCTTTTCGC 
CTTGGCCACGAACACACGTCTCATTTGAACGCGAAACGCCGAGTCAAACGGACGCAGTCGCTCGTCGCAGTGGTTGTTAAACTCGAGTTTCGTAC 
TTTCGAGCGCGCGTATTGAAACATAAGTGAGCGATCGTGTGAACAGTGATATAAAGTGCACCGTGTATGCGGGATAGAGATACCTAACCTAGTGC 
CACACGAAAGTTGAAGTAATACCAGCAGGATATCTACTCAGTTGGCAGTGAAAAGCGTGGTGAAGTGTTCGGTCATAATGAGTGTGTGAGCGGGG 
ACAAGCTGACAAACTGAATGCGATAAGGTTACAGATGTCCTAAACACACAGGCACTGGCACACAAGGATTCCCGGGGTCCTGGTGACATGTAACT 
GGATCCGGATCTGGATACATGTTGGCTCCTAAGACAACAGCCCGCTCCTTTGCCTATCGTCTGGCCGCACTGCAGAAACGGGAGCAACAAAAGCG 
AGACACCGGGTAAGTAGCGATGCTTAACTATAATAAGAAACCTGTCCCCTCGGACAGGATAAAAACAAGCCGCACACACAGTAGCACATGCGGAC 
AAAGGGGCGCACTGCAGTGGTCGTCCGATGCTTCCTGCGCCGCTAAGTATGCCACTGCAATGCTATTTTAAGGTGCAAAGTGCATTGAGTAGGAG 
GAGGCAAAGTGCCAGCCAGCTAGCCCTAACCCACACACACACTTCGCATAGACACACATGGGCAGTGGACTACGGCGACAACACACAAAACGTAT 
TCAAGTGTAACTAAAGACGACCAATTCACAAACAAACAAGCGCGCCAGGAGCAGTTGGCGGATGTGTGGGAGGGAGGGAGTTGGGGGAAACGGAA 
CCGACTGAAGCGGAGCGGAGCGGTCGACCCTCGAACCGTGTGACCGCGGGGCAGGAGGGGGCGGGGTTCTGAAGGGACGACAGGGGCCGCTCGCT 
AATGTTTATTATGAAAAGCGTGCCCTTTTGAACGCCATCGTGTATTTTTATTTTTATTTGTATTTTATGATTGCGCTTAGCCATGTGTATTTGCT 
GTGAGTGGAACTGTGGTGAGCGCTAGTGCACAGGAAGAAAAAGATATAGACTTGGTTTTTTATTTGAGGTTAAAGGGAGATCCCATTTCAAAGCA 
AAGGAAAACAATTCTATTGAAGGTTTCGATTAGGAGACGTTTCGTTTTTTGAAATCTGAACTCTACACTGTGGAATTATTGAAGTTCATACCATG 
TTTGGTTTACGATCAATTAAAATGGGATACATTATACATCATGAATACTATGTTTGGTTTACGATCAATTAAAATGGGATAGATTATACATCGTC 
AAATGATAAGACAGTTTAAAACAATGATTAGTGAATCATATAATGTGTATTGAAACCTTTTCGATTGGAATACGATTATAAAATCCATCTAAAGG 
CCTGGGAATAATCCCCCGAGAGCTCTTACAAACCATTCTCGATCGATTCCGAGTACTACATTTTCTCTCAGTATGTTTATTTATGTTTTTCTTGT 
CACGGAGCTGGATCAGAGGGGGTGAAGGGGCGGCTGTCGAGAGGCCCTCATGTGAGTGCAGCACTCAATTCATTGATAAAAGATGCAGCTACAAT 
TATAATAATAATAATAAGAACTCCGGCCAACGGGATGCCGCCGCGATCAACATGCCGCAATTACAGTTATAATTTGGGCCCGTAAGCGTTGTTCA 
AATGTAAATTCAAATGTGGATGAATTGCAATTACCAACCCTCCGCGGCGACAAACGTTGGCTGCTTCGCTGAGTGGGGGTAACAAACAATCGCCA 
ATTGGCATATAATGATCCCCTTTGGACGTCGAAACGTCCTAACTGACTGACTGACGTACCAACTGGCTGACTGACTGACTGACAGTCGGACGATA 
TACATACGAGTATATGCGCCATGAGGTGGGTATGGTAAAAAGTGGCATGGGACTCAGACAACCAGCAGCGACATTCGGAACAATTCTCAGGCCTT 
GACAAATGCACGTAATGATTGGACTCACACTTTGGTTTCTGCTTTTTTZVACAATTTACGCGCAGCTCTTACTCTTACCCCTTCTTCGCTGGTTCT 
CACACTCACTCGGCCTAGTCATAATTTTAGGGGTTGTGCAACCCGGCACAGTAATCGAAACTCAAACCCAAAAACTCGTAGATACAGATACAGCT 
GTTCGCCTTGATGGGCAAACAAGGGCTTGGAGGTAGACCTCCGCCCTTCGACTTTCGATTTTTCGGGACATGGGAACGGAACGTTGCATGAACAT 
CTTAGACTCCTGCGGTGATTAATTGTCATTCCAGGGCCCTACAGCATATGTAGAGGCCACTCGGGCTCCTATGTATATGCGTTATGTAGGATCCA 
GGGTTCAGGGTCTTAACACTGGATTAGATGTCGATCGAAGTGAAAGAATAGGCTCTTTTCAGTTTCTTTCGGCAACTGGCTTTGTTCCGAGCCTA 
GGTAGCTGATGTTGGTAAGGGCTTAGCGTCGTAAATCGTTGGATTGAACCATTCCGATCGATGGCGGAGAGAATCGGTAGGGCATTAACATGCCT 
AGGTAGTAGAACTTTCTATACGCCAGACTCTTATCATTGCATCTTCGACATGCGTAGGCAATACGTTTGCTTTCCATTTTTCTCTGGCTATCATC 
AGCTGGCATCATGTGGCAGTAGTTTTGGCAACATAAATTAGGCCACGGGCATCATTTGTTACAACCGGACACATTAGAACTTTGCTAATACGATA 
CAAATGATAAGGGCCTTTCATCAGTTGTTCAGTTTGTGCTACTTATGCCGCTGATGGGATAGAATCATAAATAATTTGTTTATTGCATGCAGAGG 
GGGTCACAAGTATGCTGGCTACGATTGTATCTTGTGTATGCTGTGTATCTTGTAGTAACATGAGCCTGTGTAGCTTGGCCCATAACTCATGCTTG 
AAACTTTTGTTTCTACTTCTATTTAATTCCTCGACAATTACGCTTAATTAATTTAAAGATGTTTGTTGCCGTTGTCGTGCTCGCATCCCCCGTCG 
AGTTGTTCGATCAATTTTGAGCAGTTATGACGGCGGCTCGGCCATGGGGGAGGATTTTTTAAATAGATTAATTAAAATCATCTTTAATTGTTTGA 
TTACAAGGCAAGCGCGGCCCGAAAAAAATGCGAAGCGAACGCAGATAGCTCACAATAGCTGCTTTCATACGTTCTCAAGTTGCGGAGTGTGTGGG 
CAAAATGCGCAACATGTGAAATACAGATGCCTCAGAGTTACCAGCACGGTCCCTTGGGCATTAAAGATTTCATTTGTGCTACAGATGTTCCGGCG 
AT TGT AC AAC AT AAC T AC AAGG AGG AG T C T C A C GC AAT T GA GGCAACGGCATC T GCC AC AACC AC AC C AAC AC T CAC AG C C AC AT C C AC AT C C AC 
ATCCATGGATCCATGGTTCCACTTCAGCGCAGTCACCAAGCGACTGGCGCAGCGTTTCGAGTGGCATTTGATTTTAGTGCTCGAGAGTGTGTGTG 
CGAGTGCTTCAGAAGTATAAAAATTATTGCTGCCAAGCAACTGAGTCAGTCAGTCATCAATTATTTTATAGAATGCTCACTTGTTTTGTGTGTTC 
TGGCATTGTTCCCCGACCAGACCCAATCTCCATCTCCCCGACCCATTTCGGCATCCCCATGCCCCGAAATGCCAAAACCGAATCTGTGTTCTTGG 
AGTTCACTTGGAAGTTCTCGTTCGGTACAGGAAGGGCTTCTCCGCCCTCTACACTTCACTTCACTGGGACTCCGGCAATAGTTGTGAAATTAAAT 
TGAATACGACGCCGATTATCCCACGCAGTCTTCCATTCGCATTTCCCATTCATTCAGGCAGGCATCCACGGGCTGCCTAGGAAGCAAGTGCTGGC 
GACACAATCTCATCCTCATACTCCTCCGCGGAATCCGAGAGCTTGGGAAATCATCAAAGTTCAGCGGGACTTCGGTGGCGTCTCCTGCTCACAAT 
ATAAGATGATTACATGCACACGCAACTTGCCGTATCTGGCCATAAATCTCAAAGAGTGCACAGGGAAAAAAGGATATTAATATATACAAATTATG 
AAACGGTATAGTGTTTCTACTGCGCAAGGCTTTAATATTAATAGACATGAAATCAATTGAAAAATGTATAAAATTCTTATCTGAAATACACGTTC 
CTTGACGCAGTATTCCGTTCTATGTAAAGAACAAGAAATATCACCAATTTGGCAGAAGTTCTAATATTTATTTTAATATTTTGTTTAAAATTCTC 
CTTTTCTTTGATTTTTTCGCTCAGTGCATGTGCGTGTGTGTCCCACAGTTTGTCACTCAGTCATTCCATTCAGAGGATGAGGAAGTGGATGTAGA 
GGGGGTTTTCGCCGGATGCTTGCGTGCCAAAATCAATGCAGGTTTTTTTATTTGTCAACGACAAGTTTGCATTCAATAAATAAAACCCATTCGAT 
TATAAAGTACACCCGCAGACAATGAAATGTCGCTGATAAGCGCCTCTTACAATCCATTTGGCCTACTTCAGTTCATTAAAAATCTATTAAAATGT 
AATGT AC AAT AC CC C AC C TCCCCTATG CAT G AAG CTTTCGGTCC G AT AAGGC AGAC AGC C ATC C AAT CG AT C G AAT C GGG GCT ATT T AAT GT GC G 
AGACTTTGATTAATCTCGCCATCAGTTCGGTTGTTCCCTTGTCATGCGGAATATATGTACATATGTACAGCTATCTGTATCTGTCTGCAATTCAG 
CATTATTCTTGAGAGTAGCTGCAGTAATAAATAAGCCATGTTTGTAAGAGCTGAAAATTTGTTCGATTTTCGCTGATGGCGATATCGATTTAGTT 
TAGCCGAAATGTAATTTGGCTAAGCAGCCCGGCAAACAATGTGGCCCAATATCGGCCTCGTCAAACAAAGGTCTAGACAAATCGAAAACAAAACG 
AACTTCAATATCCCATCTATAATTGAATGCACACACAAAACAAAGCGGATAGAGTTGGAAGACTATTATAATCTGGTCAGTTACATGAGCACCGC 
CCGCCCACACACTCCATATCCACAAATATAGCAGATATGAGATAGCACATAGTATATATGGCATATATGGCATTTACCATTACTATTACCCCAAC 
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CCATCCACACACAAAGGAGAATATTGACAGACACTCAAGTGTCCACAAGCACCCTGTAAATCCAATTGAATTACGCATAAAGGCGCTTAAGCCGA 
ACGAAACGAGTCTCTCGATGATGACGACCCATCAAATGTATGATTTCCCTTTAACTTCTGATTACCCGATATTTTATTACGTTTCGTGGTCACGC 
CGGCTACTGGGTACCACTGATTTGGTGTAGCTATGGAGCTGTGGCGCATGGATCCCCCAATAATAATCACTTATCGCATATCAGCGTTTCCGCAA 
AATTTACATATCGACTTTTACTTGATGCGCCTTTTAATGGCCCACGTCGACGGCAATAAAATATTGAGAGACTGAGCGTGCGGGGGCCTTTGAGT 
TTTAACTTATGCATGATTAATACATAATTACAAAGACAAACGCATAAACTCCAACTCAAACTCCAGCTCTAAGTGAGGCCCAAAGAGGGGGGCCG 
GACTTATCGCGGTATTCGTGCAGGTATGCCTCCTCCATATAACCATTTGCCACTCCAACGTCCACCACACCACCACTTAATAACTCTAAAAGACA 
AAAGGCACTTTTCCCACAGCCAAAGGCAAAGGCAAAGGCAAAGCCCGTGGAGGAGACCACGTCCTGAACCGAGCCAAGCTGCCTAGGTGCCCGGA 
GAACTGCAGTGTGGCGGAGCATACTTATACCTGTTATTAATATTTCGATTAGGTACCAACGATTCGTAGAACCTGACGCATGACTTGCATGGCTA 
ATGGGAAATTTTGTAACTTGTGACCACGCTGCCCTACAATTTGTGGGTCGTCGGACGGGGTCGTCGGATTGGTCCCCTGATTGATGGAGACAACT 
CATCGATTGTGCGGCGAGGACAGGAAGTGTCCCTAAGTGCAGCCGATACATCCTCATTTTCCCATAGATCCTGACAGTAGATTGGAATCTGGGCC 
TACCTTCGCCAGGAAATGATATCTAAACTCATGCGTACAACTCGAGATAGTTTCATTCGTGAAAGGAGCATATTTTTTGACCTTTTGGAAGGGTT 
TGTTGCGTGATCATTTCATCTTCTCTTCGCAGACACCACCCTTAAAATATTGTTGATTCTGGGAAATGTGATTGTAAGCAGACACCTTTTTCTTC 
CTTACTCATAAAGAGGATTTGCGTCATAAAAAAGCAGGTGTACATCTGAGGGTTCCATTAAATGTTTGAATAACACTGTAATAGGATATGCAGCC 
TAAATACAGATACAGGTTAGCAATATCTACAACAAAATGTAACATGATGTTTAGGTTAGGTTCTATAATGAATGAATTCAATATAAATTCACTGA 
ACAGCAGATTTCTTAGGCAATAGAGTGTGTACTAGTCGTCTCAGATAGCAAAATCCAAGTGCAAGACCATAAGTTTGTTAATGCTGAGCGTGTGC 
GTC AAAG AT AT GAAC AGT T T A GT CAAAC C GT GG AC AC TTTCGTGGCTT G AC AG AC C GCT T T GT AAAT AAT C GCC T T AT GGCC AAGT GC GC AAAT T 
GCCTGGTCGAAGGAGCAAAAGTAAATGTAGCTAACTGGCGTTTCTTATCTGACTTTCCAGATCGTATTCACCACGTGAACAAAGCGAGTGTGCGG 
CTAATGCTAATGCCAGCTCCTTCAGCTAGCTCGCCACATCGCCAGCAGCAGCAACAACAATTGCAGGTGCCCCTGCAGGTGCATCAGTTGCACAG 
GAAGCGCCGGACAAAGGGACACCTACACCACCTTCACCATCTCAACCACGAAACGACAGGTCAACTGACAGCGAAGCATGGCGGTGATTAATAAA 
GCCGGAAATGTGATTGCCCTCCTGCTGGTTAAGCTTCAGCTAATCCTGCTGTTCACGCTGTCAGTTTCCGGAGAGTTGCCGCAGCTGGACTACGG 
CAGCTTGTCCGCATCCCTGGAGGAGGATGCTATTGACCCACTCACGGCGATGGCACCCTTCGCCAACTCCCTGGTCACCGAAGAATCGGCGCAGC 
ACAAGAACAACGCCGAGCTGCTTGGCAATAGCAGTGAGGATGAGAATGTGAGACCGCAGCAAGGGTCCTCTTCCTCGGGACTAGGATCATCGGGA 
GCAGCAGGTGGATCAGGCATACTGCTCGAGGAGTTTAACAGCGGCAAACTGAGTCCCGGAGAGGCGAGTAATACGCTGCCCATATTCCTCATCGA 
GCCGGAGAGCGTGTTCGTGGTCAAGAACCGGCCGGCGGTGCTTAAGTGCAAGGCCTCCCACTCGCTGCAGGTGATCTTCAAGTGTAGCGGCAGTT 
CGCAACCACCGCCGTCCACGCACGAAACTCACGTGGATCCGCACACGGGGGTCAACATGGAGGAGGTCACCGCCACCATCCACCGCGACCTGGTC 
GACGAGTTCTTCGGCGACGGACCCTTCAAGTGCGAGTGCCACGCGTGGTCGTCGCGTGGCGTGGTCAAGAGTCAGGCGGCCACCGTGCACATTGC 
TTGTGAGTACACTGTTCAAAAAGTTGTAGTTTACAAATTTATTATGAGCGAAACCTTATCCTAGGATTTTCGTATCGGAAAAAAAGTTAAAATAA 
ATATTTATTCTACAATAGTTAATTTTCCTTACGAATTTTTGTTTCAGTATAATGGCATGCAGTTGAAAATGTAATTTTTGTAGTTAACTTTGCAC 
CTTTGTTCGGGGGTGGCTTCCTGTCAACAACTGACCCTCGCTTAGCTTACCCCTATCTGTATCTGAATCTACATCGGTAACCTGCATCTGTCCGT 
TTAACCCCCACAGATATTCGCAAGTCCTTCAACCAGTCGCCCACCTCGCTGCGCCTGGAGCTGGGCAGTCGGGCGGAACTGCGCTGCGAACCACC 
CGGCGGTTTCCCCGAACCGAAGCTCACCTGGCACAAAAACAACGCGGTCATCACGGCGGACAGCGAGCCGGGGATCACCGTTTCGGCCGGCACAC 
TCATCTTCCGCCAGGTGGCCCTGCAGCATATGGCCAACTATAGCTGCAGTGCGGAGAATATCGCTGGCAGACGCGTCTCCGATTCCGCCGTGCTC 
ATCGTTTATGGTCAGTACAGATGGGCTTTGGGGCAAATTAAAAATTTATAGGTTTCATGCTGTAAATTCCAGAAAGAAACCCCATTTACTGGGGG 
TTCTCGGTTTTAATCATTTTCCATGAAATTTTATTAAATTTATTGCTAATAGAACTGAGGAACTATAATCATTTTACACTAATATACTCTTAAAT 
TATATGGCCAAAAATCGCATAGCACCTTAAATGGAGTAGCGAAAAATCATTTGGTATTGCAGGTCGATAAAGCTCTTAATTGTTTTATTTTCCGT 
TTTCATCCCACAGTCAACGGTGGCTGGAGCACCTGGAGTCCATGGCGCGAATGCAAGTGTGCGGGCAAGCCCAGCCAGGGAAGGAAGCGTTCGCG 
CACCTGCAACAATCCGATGCCATTGAATGGGGGCGCCCAGTGTCCGGGCCCCCAGATCCAGAAGTCCGCCGACTGCGCCGCATGTCCAGGTACGT 
TCCTGCGGTCATCCGCAAAATTGTGGGCGCATCGAATTGCGGCCAACACGCCCATTATACGGGCCCATTATATGTGCTGCTGTGGGGTTTTTGGG 
GGGAAAATGGGAATGGGAAAATGCTTAAAGGCAGATGTTAAGCAGAAGCCAAAATTGTCAATTTAAATGGTATTTTGCGGCCGGGATTCCAGCTT 
GGGCGCCACCTGAAAATCCCCATCTGGTTGCCCCGCTGCTTCTGTCGATACATCTCCCGGAGCGTCGAGAACAACGCGTATTTTTCACACCAGGG 
CGGACATGAGGATACCAGGATGGTGTGGAGTGGCAAGGACTGCCCAAAATACAAAGCACTGCACAACGCAAAAGTTCAGGGGGCGTTTCACAAAA 
AAAAAAGATGAGAAATAAAAGACAAGATAAGCTTTTGAGGGTCAATGCGAACTAGACATACTCTAAGGACCTCAGTGAATTATCATAATCCGAAA 
GGGAGCGGATTTACCTGAATGATGGGAGTGGCATTGGGATTAATTTACAAAGTACAACAGCTGAACGAAATATTTTAATATTAAATTTATTTAAG 
ACTAATCCGCATGGTTCTTACAAATTTCTACAGATTTTTATCAGCAATCAAAGTTACTAGATGTCAAAATATTTTCCCGGTTTAAAGTTAGTCTC 
ATTATAACTTTTTGACACACCTTATTTCAACAATTAGCGATAACTATTTCTTACAATAATGAGTACATCAAAAAATGCCTATTTCAAGATTAATA 
ACTTAATACCCACTAGAACTAAATATAGACCAACCCCAGTACATAGTCAACATGCCCCGACCGAACAATGGTATAATACACTTTTGAGTGAGTGC 
TGAGTATTTCATTTTAAACACGCATTCGGTTCCTCGGACCGACGACAGATATCTCCACATGCGAGATGCGTATATAGATGGATAGATTGGGCCGC 
CGCCTTTGGCAGTGGGGAACGCCTGCCGCCGGCGTGCCACCGACTAAATGGTGACAAAGGAATGAATCCGGGGGAAGGCAGTCCAGAGACTTGGT 
GCGACTCAGCATTGAGGACTGGAGATGTGAGGATGAAATCTGGAGTGGAGTGGCATGGAATGGGTCGGACGGTGGTAGCTGGGGCGCTCTTTTTG 
ACAGACGCACGGGCGGCATGTGTTTGAGACTCTTCACTTTTATTCATGTAAATTTATTGTATTGCATTGTATTGTATTGTAGTGCTCGCACAGAT 
ACTCACCCACACGGAGATTTCGTAGGATACTTGGCCAGGCGCACAGCGGGCACAGCAATGAGCAATTGTTGACAGACAAGCGGGCCAAATGGGTG 
ATGGGAAATGGGTGGGTGGACTACGAGTATATACTGTGCTCCAGAAGGGGGCCTTTCGGTCGTCAAGAGGGGGACGCCATTGTGGGCTCGGGTAT 
TTCTTTTCTATGAAAAGTGTGTCATAAAATTAATTTTAAATTTCGCTTGGCTTTCGTTTTTCTTCCTTTTTTTTTTTTGTCCCCGGACGACTTTT 
CATCCCCATGGAACCGTGTCAACCTTATGGTCCTGCCGCTGCCACGCCCACCAACGCCCCGTGATTACGCTGTCGCTCTGACGTCCCGTCGTCCT 
GACGTCCCGTCGCCATCGTCGCCATCGTCGCCCGCCAACAACAGAGGACACTCAAATCGTGAGCCCTGATGGATTTGACATTTCGTCGAGTAAGC 
GCAGTAAGTTGAACCGAAAGCCCGACAATATGCATTTCCCTCAAAGGCTTCCGAGCCGTTTCAGGATCCAATGGGATGGGGCTACTCGAATGGGT 
GGATGGGGGTTCCAAAGGATTCAATACATCGACATTGTGTTCGAGCAGTTAAACTTGAATGTTTAACGAGCCGACGACAGTTAAATTTTTTAGTT 
GGGGTCGCCTCGCTATCCACTGGGTCCTTAACAATCCAATAAACAATCGTGTTAACCCGAACCGGAGGCATCAGCATCCAACCAACCGTGGCGTC 
ATTCGTTTCGGGACTTCCCCCAAAAAAAAATCCTCCGCACTTTAACTGACTTTAACTGAAGTGACTTACTTTGCGACGGGGTAGTTGTTGGTGAC 
GTCACTGCGGCAGATAGCAACTTAAAAATCACTTACCACGACTCACGCTCTCTGATTCATGCAACAGTCTGAGTCTGGGTCTGGGTCTAGTTCTG 
GGTCTCGGTTTTTGGCTGGGGTCGAAAAGGATACGTCCTGGGATGGGTATTGACAGGCGGGATGGGTTGGGGATCAGGATGCGCACGGCGTGGCA 
GCGGGGAAAACTAAATGCCTGGCAATGTGATGATGGCTGGCGACAAGTCCGGAATTTTGTGCGCCCTTTCTGAGGGTGGTTTTATGTTTCCAAAA 
CAGGAAGTGCCAAATGCCGATGTATTTATACCAGGATGTGCGCACACTTAGCGAAAAACTAGCCCAGGAACTAAATACATAATTAAAATAAACCA 
T CGC AGAC AT GT GGAAAT ATGAATATTATTGC AAAAGAAG TAAC TTAATGAAAACATTTTGAATT CGAT TAAATCAAACC AG AG AAAAGAGT GT A 
CACATATGTATTATGTTTATAAAGTAGGCTTGTTAATTGTATCTGTTTTCATTAAGAATAACTTGAATAACTTGAATCGCTTTGAATAGATTCAA 
GTCAACTTCTTTTTCTCAGTGCATAGCTGTCGAACATATATTTCAGCTGCCATTGGCATGCATCACACCATGGACATTTTTGGCTTTTTGTTTAG 
CTTTCGGACTTTCCGCTTTCTCCCCCTCATTTAGTGGAAATTGAAATAGCAAAGAAAATGCGAAAATAGAAAAGCATTAAAATGACAATCACTTT 
TACTCACCCACCCACGCAGAAACCAGCCCAAACCAAACCCCACAAATCAACAGAGGCGTCGTCAGTGGAGTTTTGGGGGTTAAGAAAAAAACGGG 
AAATGGGAATCAGGAGTGGAGGAAGACCCACTAATTTTATGGTGGGAACAAACTTTCGATTTCTCCCCATGGACCATCTATTAGACCGTAATTGT 
GCAGCCGGAGAAGCTGTGAACAAAGGGGGGGGGGGGGGGTTGGAATGCCACGTCCATGGTTGTGGCCCCGATGGTCTTGATATATGACCCGGGGA 
GTGCCGCTGTCGGGAATGTGTGCTAAATCATGTTAATATGACTTTTGATTTCTAAACTGCCTGTGCTTTTGCCTCTGCTCTTCCATTTACAGTGG 
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CCCGCTGGTCGGCGTGGAGTGACTGGAGCATCTGCTCCGCGGAGTGCATTCAAGTGCGTCGCCGGAAGTGCCTGACTCAGGGCCAGACTCAGATT 
TCCTCCGAGGCGGAGGAAGCCGGTGATCTGCTCCTGGGAGCTCCCGGAGTGGGCATGGCCGCCCTCATCGCCGCTGCAGGAGTGGGCGCAGTAGG 
AAGCCCCAGCGAAGCGACTGGCTCCAGTTCCGACATTATCCCGGGATATGGCAAATCATTGTGCGCCGGAAAAGACATACAAACGGCCGAATGCC 
GCGGCGAACAGTGCCAGATTGGCAAGGATGGTAAGTCTGTGCTCATGACTAGGATGAATGCCTGGCAGGATCAACAAAGTAGACAGTTAGACAAC 
AATAACAGCAAAGCCGGAGTGCAGCATCCACAAAACCAGAATCCCAGAATCCCTCGCACAAAAGATGAGCACTCAACGCTGATTGTCACAATGGG 
AAATTAACCAAGTGCATTTGTTTGTGCACGACCACGGGCTTATTTATATATATATAATGGAAGTATAGATGGTATAGAAAGCATGGCCAGGACAT 
TTCACTTTTCACCCCAAAGGACCCCCAGCTGTAAGCTCTCAACTTGCGACTTTCAATCGCCCCAAATGAATCCCGGACCCAATCGAAGAAGGGTC 
GCAAATCAATCGCAGCAGCACCACTTGGAATTGAGATTTAAGTGGGGGTCCCACACAAGTCTTCCGAGGGTTTATGTGCTTTGCCAGATAATTAG 
GCGTGCGGCAAAGACAGCAGCATCAGCATCAGCATCCACACACCCATTTGCCACCCCAATCCGGGGCTTTTCCCGTGTCATTGTTGCTGTTTGCG 
ATTCGATTTTCATCGGCAGGGCGGCTAGAGGGGAAAATTGTGCGATTGACCCAACTCAAACATAGTTTTGTGCGGTCCATGGGGATCAATTAGAT 
GGCTGTCACAGTGTGTAATTGAAAGGATCAGGATACACAGGGAGTACCAGGAGTTGTTCGGCTTTTTTGGCAGGTTTGTCCTTTATTTGATTGCC 
TGCTGAAAAGTTTTCGAAGAGCTGTGACACTCTGCAGCTTATTTTCGATTTTATTACGATTCGCTAAAGCTCGAGCAATCCGGGCAAATCCCTGG 
GGCCACCGATACCTGCGGCTTCATCATGGCCCATTGTCAGCATCTCGTACTCAACCATCTGATATTAAAAGCGTATTTGTCAGACCAAGGAGACG 
AATCCCCTTTTGATGTCCCTTGCAGGAGGCA 
(SEQ ID NO; 733) 

Exon: 1001.. 1434 
Exon: 7661. .7873 
Exon: 7950. .8335 
Exon: 8423..8S52 
Exon: 8849.. 9130 
Exon: 9419.. 9589 
Exon: 11255.. 11308 
Exon: 12823. .13307 
Start ATG: 7705 

Transcript No. : CT20824 

GACGCAGTCGCTCGTCGCAGTGGTTGTTAAACTCGAGTTTCGTACTTTCGAGCGCGCGTATTGAAACATAAGTGAGCGATCGTGTGAACAGTGAT 
ATAAAGTGCACCGTGTATGCGGGATAGAGATACCTAACCTAGTGCCACACGAAAGTTGAAGTAATACCAGCAGGATATCTACTCAGTTGGCAGTG 
AAAAGCGTGGTGAAGTGTTCGGTCATAATGAGTGTGTGAGCGGGGACAAGCTGACAAACTGAATGCGATAAGGTTACAGATGTCCTAAACACACA 
GGCACTGGCACACAAGGATTCCCGGGGTCCTGGTGACATGTAACTGGATCCGGATCTGGATACATGTTGGCTCCTAAGACAACAGCCCGCTCCTT 
TGCCTATCGTCTGGCCGCACTGCAGAAACGGGAGCAACAAAAGCGAGACACCGGATCGTATTCACCACGTGAACAAAGCGAGTGTGCGGCTAATG 
CTAATGCCAGCTCCTTCAGCTAGCTCGCCACATCGCCAGCAGCAGCAACAACAATTGCAGGTGCCCCTGCAGGTGCATCAGTTGCACAGGAAGCG 
CCGGACAAAGGGACACCTACACCACCTTCACCATCTCAACCACGAAACGACAGGTCAACTGACAGCGAAGCATGGCGTTTCCGGAGAGTTGCCGC 
AGCTGGACTACGGCAGCTTGTCCGCATCCCTGGAGGAGGATGCTATTGACCCACTCACGGCGATGGCACCCTTCGCCAACTCCCTGGTCACCGAA 
GAATCGGCGCAGCACAAGAACAACGCCGAGCTGCTTGGCAATAGCAGTGAGGATGAGAATGTGAGACCGCAGCAAGGGTCCTCTTCCTCGGGACT 
AGGATCATCGGGAGCAGCAGGTGGATCAGGCATACTGCTCGAGGAGTTTAACAGCGGCAAACTGAGTCCCGGAGAGGCGAGTAATACGCTGCCCA 
TATTCCTCATCGAGCCGGAGAGCGTGTTCGTGGTCAAGAACCGGCCGGCGGTGCTTAAGTGCAAGGCCTCCCACTCGCTGCAGGAGGTCACCGCC 
ACCATCCACCGCGACCTGGTCGACGAGTTCTTCGGCGACGGACCCTTCAAGTGCGAGTGCCACGCGTGGTCGTCGCGTGGCGTGGTCAAGAGTCA 
GGCGGCCACCGTGCACATTGCTTATATTCGCAAGTCCTTCAACCAGTCGCCCACCTCGCTGCGCCTGGAGCTGGGCAGTCGGGCGGAACTGCGCT 
GCGAACCACCCGGCGGTTTCCCCGAACCGAAGCTCACCTGGCACAAAAACAACGCGGTCATCACGGCGGACAGCGAGCCGGGGATCACCGTTTCG 
GCCGGCACACTCATCTTCCGCCAGGTGGCCCTGCAGCATATGGCCAACTATAGCTGCAGTGCGGAGAATATCGCTGGCAGACGCGTCTCCGATTC 
CGCCGTGCTCATCGTTTATGTCAACGGTGGCTGGAGCACCTGGAGTCCATGGCGCGAATGCAAGTGTGCGGGCAAGCCCAGCCAGGGAAGGAAGC 
GTTCGCGCACCTGCAACAATCCGATGCCATTGAATGGGGGCGCCCAGTGTCCGGGCCCCCAGATCCAGAAGTCCGCCGACTGCGCCGCATGTCCA 
GAGGACACTCAAATCGTGAGCCCTGATGGATTTGACATTTCGTCGAGTAAGCGCATGGCCCGCTGGTCGGCGTGGAGTGACTGGAGCATCTGCTC 
CGCGGAGTGCATTCAAGTGCGTCGCCGGAAGTGCCTGACTCAGGGCCAGACTCAGATTTCCTCCGAGGCGGAGGAAGCCGGTGATCTGCTCCTGG 
GAGCTCCCGGAGTGGGCATGGCCGCCCTCATCGCCGCTGCAGGAGTGGGCGCAGTAGGAAGCCCCAGCGAAGCGACTGGCTCCAGTTCCGACATT 
ATCCCGGGATATGGCAAATCATTGTGCGCCGGAAAAGACATACAAACGGCCGAATGCCGCGGCGAACAGTGCCAGATTGGCAAGGATGGTAAGTC 
TGTGCTCATGACTAGGATGAATGCCTGGCAGGATCAACAAAGTAGACAGTTAGACAACAATAACAGCAAAGCCGGAGTGCAGCATCCACAAAACC 
AG AAT C C C AGAAT CC C T C GC AC AAAAG ATG AGC AC T C AAC GC T GAT T GT C AC AATG GG AAAT T AA 
{ SEQ ID NO: 734) 

Start ATG: 479 

MPAPSASSPHRQQQQQQLQVPLQVHQLHRKRRTKGHLHHLHHLNHETTGQLTAKHGVSGELPQLDYGSLSASLEEDAIDPLTAMAPFANSLVTEE 

SAQHKNNAELLGNSSEDENVRPQQGSSSSGLGSSGAAGGSGILLEEFNSGKLSPGEASNTLPIFLIEPESVFVVKNRPAVLKCKASHSLQEVTAT 

IHRDLVDEFFGDGPFKCECHAWSSRGVVKSQAATVHIAYIRKSFNQSPTSLRLELGSRAELRCEPPGGFPEPKLTWHKNNAVITADSEPGITVSA 

GTLIFRQVALQHMANYSCSAENIAGRRVSDSAVLIVYVNGGWSTWSPWRECKCAGKPSQGRKRSRTCNNPMPLNGGAQCPGPQIQKSADCAACPE 

DTQIVSPDGFDISSSKRMARWSAWSDWSICSAECIQVRRRKCLTQGQTQISSEAEEAGDLLLGAPGVGMAALIAAAGVGAVGSPSEATGSSSDII 

PGYGKSLCAGKDIQTAECRGEQCQIGKDGKSVLMTRMNAWQDQQSRQLDNNNSKAGVQHPQNQNPRIPRTKDEHSTLIVTMGN* 

(SEQ ID NO: 735} 

Name: UNC-5 like protein 
Classification: cell adhesion 



Celera Sequence No. : 142000013384585 

TATATATTTACGATTAATTGTGTAATGAAAAACGTATTTTATATAAACAAAGATATATCAAATAAAACTAAAACTTAAACATAGCGATCGCCTTA 
CCTGTGGACCGATTAATGAGTCGAAAAGTAATTTTATAATATTTTTTATTTAAGTTCCTTTCCATTTGCGATTTCAGTTTTGTTTTGGTTACGAA 
AAAAACTTACAATTACTTTGTTACGAAATCTTGTTCTCAGTTTTTTTTTCAATGAAGTGACGACGATGACTGGTTTAGTCATTAAAACTACAAAA 
TAGTAGGAAGTTGGTTTTTATTTATATTCTTTTTTTCTTTTCTTTTCCTTTATATATCAGCACTATACTCGTAATTGCCTTTCTTTAAAATAACA 
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AATTTCGCTGTGGTGGAGCCTTTTTAATGTTCAAGTCTAAAATCATTATTAGTGAAAGGTAGTATTTGACCAAAAAAACAAATTTCGAACGCTTA 
AAAATATCACATTAAAAAATCAAACCGAATCCGATTTCATATATAAATAACACAAATTTATTTAATTTTGTTTTGGAATTTGGAATAGGTTGTCT 
TAAGTTTTGTATACATAGATTTATTTCGTTTTTCGTTTACCACTTCATTTTTAGTCTTTTGTTGTTGAGAATTTTCCTTGCTGTGGCCAGTTTTT 
TTTTTTTTTTTTGAATGAAAATCATCTTTTACGAAGTTTACTGCCCTGACATTGGTTGCATTAATATTACGCATTACGCATTAAAATAGGCTTTT 
CTCTTTACTCGTTTAGCTCCATTTTCACTTAACACTGTACACATATAATGTAAACTAATTGTATTTATATTTATTATGCAGTTGAATAGTTTATA 
GTTTAATATGTCAATGTGCAATTCATGATGCAGTTTTTCATCGAATGTGACTCGTTACAATGTTTTCGGTTGCTAAGCGAATTAAAAACAAAACT 
AGTTTCCGAAAAATACAACTCAAGAAATCTCCAAATGTTTTTTTTTTTTTACAACTTCTGTATTAACGTGATATAAATATAGACTGAATTTAGAT 
TGGATTGGATCGACTGGCTGTTGGCGGTTGCTTTTCGTGTTACGGTAGTTGTTACTGTTAATTTAGTAGTTATCGTACTTTAATATATATTTATA 
AAATAGACTTTCTCCGTTGCAAGATGCGAATTTTCGTTTCTGTTGTTGCAGTTTTCGGTTTGTGGTACATGCACGTTGCCAAAACTTACGCATCA 
TGTTGCGTACCTAAAAGACAAGCACACAGATTACTAAGTTTGTTTACTCGCTTTTCCAGCGACAACTCACCTTTTTGTCCAGAGACGCGCAATTC 
GATCGTGCTCCTCACGATTCTGCAAATATTGCGTGGCAATGCTGCCCACCAGCGGATCGGCTGTAAAGACACATGATATAAACATGAGATCATAT 
TTACTACAGTCCTAGGAAAAGTTAAATCAAAGTGGAGAACACGACACTGTATATCAAAGTCTAAAAAAAATAAGTCTCACCTGGATTACAGTCTG 
TGAGCAGGGAGCAAATTGACAGCAAGACCTTTGATATGGTCAGCGCCGGCGACCAGTTGTCCTTGAGTATGTCCAGGCAAATGACGCCCTGGCTG 
TTGATGTTGCAGTGATAGATGCGCGTGCGGAACGTGACTTTGGGTGGCTTGAAGGGATACTCCGGCGAGAAGTGTATGTCGAGGAAGAAGACGCC 
GCCCTCGTAGACGGATCCGGGTGGTCCCAGTATGGTGGATACCCACTCGTACAGGTTGTCGCCCTTGGGCCCGGCACTGCAGTTGGGCGGTGGGT 
CCAGTGTGATCTCAGCCAGCTCCTTCTGTATGCGCTTGGCCGAGGTGCCCAGCGCCCTGGATATCCTGGGCGTTGTTTTGGCTTCCTTTCGCGGC 
TCATCTCCGCCACCGGCATTGCTCCCGGAGGCACCACCGTTGGCGTTGCTGCCCCTGCCGCCACGTGCCTGCGGAGTTCCGGCAGTCGTCGGCTG 
GCTGGTATTGCTCACGTTGCTGGCCGTTGTGCTGGGTGCACTGGGCGCATTTGAGGTGGCACTGGAGGTGGCCACCTCGGCGGCACTGCCCGCTG 
CTGGGGTTGAAGACATTTTTGGTCTCGTTTCTTCGAACTGAAATGAGTATTTTGATGTTTGGTTAGATAATGGTTTATGTAAACGATCCACTATG 
TTCAGCAGGGAATTTCCATAGAAAACTGGTGAAATATGAAGCCAAAGCCTACTTGGGATTCACATTTATTTTTAAAGGGAAATCCCTTCTTGCCT 
GCACTTTGTGCGTACGAATGTAGTTGATGTGAAAAGTATGGCCAGTGGAGTCGGCAATTATACGCGAATCTTGGACGGGGCTTCCCCAGGGTCCT 
TGACCTGGTTTTCCACCCAGTTTGTGCCCCTTTATGATGACAGGGGTTTGCTTTTTTTCGCACAAACGCACGCATTTCTATTGAGCAGCGGCCAT 
TTTGGCAAACCCCCGACAGTAAAAAGTGGGTTTTGCATGGGATTCGTTGTCAGGCCAGTGGATGCTGGACAGGATTAGGTGATAGCTGGACAAGC 
ATTTGCCCCCAAAGTTTCGAGAGCATGTGCTGGATGCCGCAAAAAATCAATCAGCAGACATTTGCAAAATGCCGCCGCCGACGGCGGAGGATGAT 
GATGACGGTGCTGACTCCGCCGCCTGCAGCTCGATTGCACAATTTCGCGGAATTTGGACGCTGATTAGGTGGGGTCTATCTTGGGACCAGCCCGT 
GAAGCACTATACACACCTCACTATTGCAGTTTCTTTCACTTTTTTCTATCACAACACTTGCGAAAATGATTCTCCCTGTGTCGCCTGCTGCCCTC 
TCGCTCGCACTCTCGCGCACACGGCCTTCGCAGTGGATGCACGGCTGTTCCGCTGGCGTCCTTGGTGGGTGGCAGTGTCGGCGCTCCAAACTCCT 
CGGCGGCTCACGGTCGCGATCGTTGTTATTTCGCTAAATTACAGTTGGCAAATTACGAGGCGACGCAGAGCCGACTTTATCTTTGTTAGAAACTC 
GATAATTTGTATGTTTGTAGCCAGCTATCGGTCCGGGTCCGTCTCTAGGTGCTCGCGCTGTCCGCTCATCCTTAGCGAACTATCGATGTTCGGCT 
TCTGGCAACCGTTTTTTGCCTTTTCTAGCACTGTGGCAACTCGGAAGCGGATACCATAAAAATAATATTCAAGTATTGAAATCTAACTATTTCGT 
AAATGAAATAAAAAACAAGAATATTTACACCATGTATAATACACTTTGAATACCTCTCTGCTTCACCCACTTCGATAATTTAAGTGTAACACCCC 
AGATGTAAAATCTTTCAATATTTACTTTAAAGTTTAAACTAGTTTTGTATTTATAAAAAGTTTACAAGTTTTGGGATTACTTCGTCTCCGCTTCT 
TCCGCTAGCGCCTTGCGAGATTTGAATACGCGCACCGTGTTGTCGACACCACCAGATACAATGTACTTGGGATTGGACCAGTCGATGTCCAGGAC 
CTTTTCACCGTGACCCAGAAGATCGTACAGTGGGGCCTTGGGACTCCTGCAGTCCCACAGCTTGTTCTGGTTGTCATAAGCGCCGGAGACAAATA 
GGAACTCCTCCGTGGTGGACCACATGACGGTCTGCACCCAGGCATTGTGTCCCAGATAGGTATTCCTTACCACTGATCCCTCTGAAAGATCCGAA 
TTATATTGGTTAGGTGGAAGTTCCATTGTTGATATTGGAGTATTAGATCTTACGGTTTGTCCTAGGATCATAGAGCCGCAGATTCTTGTCCGCCG 
AAGCAGTCAGTATCAGCCGATTCAGCTTTGAGTAGCTCGCATCGAATATGGACTTGTTGGTGGATATTTCGGTCTTGATGCCCTCAAGACTCAGA 
TCCCACACCTTGAGTGTGTGATCCCAGCTGCCAGTTAGCAGAGTGGTAGCATCCATCCACTGCACGGCTGATACGCTCTCACGATGTCCCTGCAG 
CGTAATTTTGGGGGTCTGAAAGGAATTGTAAAGCTGATTACAAGTCTAATTATAAATGGCAAGACTACT 
{SEQ ID NO: 736) 

Exon: 3059.. 2772 
Exon: 2127.. 1506 
Exon: 1390.. 1306 
Exon: 1245.. 1001 

Start ATG: 2106 (Reverse strand: CAT) 
Transcript No. : CT20873 

TACAAACATACAAATTATCGAGTTTCTAACAAAGATAAAGTCGGCTCTGCGTCGCCTCGTAATTTGCCAACTGTAATTTAGCGAAATAACAACGA 
TCGCGACCGTGAGCCGCCGAGGAGTTTGGAGCGCCGACACTGCCACCCACCAAGGACGCCAGCGGAACAGCCGTGCATCCACTGCGAAGGCCGTG 
TGCGCGAGAGTGCGAGCGAGAGGGCAGCAGGCGACACAGGGAGAATCATTTTCGCAAGTGTTGTGATAGAAAAAAGTGAAAGAAACTGCAATAGT 
GAGTTCGAAGAAACGAGACCAAAAATGTCTTCAACCCCAGCAGCGGGCAGTGCCGCCGAGGTGGCCACCTCCAGTGCCACCTCAAATGCGCCCAG 
TGCACCCAGCACAACGGCCAGCAACGTGAGCAATACCAGCCAGCCGACGACTGCCGGAACTCCGCAGGCACGTGGCGGCAGGGGCAGCAACGCCA 
ACGGTGGTGCCTCOGGGAGC AATGCCGGTGGCGGAGATGAGCCGCGAAAGGAAGCCAAAACAACGCCCAGGATATCCAGGGCGCTGGGCACCTCG 
GCCAAGCGCATACAGAAGGAGCTGGCTGAGATCACACTGGACCCACCGCCCAACTGCAGTGCCGGGCCCAAGGGCGACAACCTGTACGAGTGGGT 
ATCCACCATACTGGGACCACCCGGATCCGTCTACGAGGGCGGCGTCTTCTTCCTCGACATACACTTCTCGCCGGAGTATCCCTTCAAGCCACCCA 
AAGTCACGTTCCGCACGCGCATCTATCACTGCAACATCAACAGCCAGGGCGTCATTTGCCTGGACATACTCAAGGACAACTGGTCGCCGGCGCTG 
ACCATATCAAAGGTCTTGCTGTCAATTTGCTCCCTGCTCACAGACTGTAATCCAGCCGATCCGCTGGTGGGCAGCATTGCCACGCAATATTTGCA 
GAATCGTGAGGAGCACGATCGAATTGCGCGTCTCTGGACAAAAAGGTACGCAACATGATGCGTAAGTTTTGGCAACGTGCATGTACCACAAACCG 
AAAACTGCAACAACAGAAACGAAAATTCGCATCTTGCAACGGAGAAAGTCTATTTTATAAATATATATTAAAGTACGATAACTACTAAATTAACA 
GTAACAACTACCGTAACACGAAAAGCAACCGCCAACAGCCAGTCGATCCAATCCAATCTAAATTCAGTCTATATTTATATCACGTTAATACAGAA 
GTTGT 

{ SEQ ID NO: 737) 

Start ATG: 310 (Reverse strand: CAT) 

MSSTPAAGSAAEVATSSATSNAPSAPSTTASNVSNTSQPTTAGTPQARGGRGSNANGGASGSNAGGGDEPRKEAKTTPRISRALGTSAKRIQKEL 
AEITLDPPPNCSAGPKGDNLYEWVSTILGPPGSVYEGGVFFLDIHFSPEYPFKPPKVTFRTRIYHCWINSQGVICLDILKDNWSPALTISKVLLS 
ICSLLTDCNPADPLVGSIATQYLQNREEHDRIARLWTKRYAT* 
(SEQ ID NO: 738) 
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Name: Ubiquitin conjugating enzyme 2 
Classification: enzyme 
Gene Symbol: UbcD2 
FlyBase ID: FBgn0015320 



Celera Sequence No. : 142000013384588 

AGCGACACGTGATCACCACCCaGACGGAGCACAAGTGCGTGCTGGACTCTTGTCGTGCTTTAGAGAACGAGGGCTTTAAAGTCACGTACCTGCCA 
GTGCTGGCCAACGGACTCATCGACCTCCAGCAACTGGAGGAAACCATCACATCGGAGACCTCGCTGGTGTCCATTATGACGGTGAATAACGAAAT 
CGGCGTACGGCAACCCGTCGATGAGATTGGCAAACTGTGCCGGTCTCGCCGAGTATTCTTTCACACGGATGCCGCTCAGGCGGTGGGAAAGGTCC 
CGCTGGACGTTAACGCCATGAACATTGACCTGATGTCTATATCCGGCCACAAAATCTACGGCCCAAAGGGTGTAGGCGCCCTGTATGTCCGACGA 
AGGCCGAGGGTTCGCCTCGAGCCCATCCAAAGTGGCGGCGGCCAGGAGCGTGGCCTAAGGAGCGGAACGGTTCCAGCCCCGCTCGCCGTGGGATT 
GGGAGCTGCTGCGGAACTTTCTTTGCGTGAGATGGACTACGACAAGAAATGGGTGGACTTTCTATCCAATCGCCTGCTGGACAGGATTTCAAGTG 
CCTTGCCGCATGTCATCAGGAATGGTGATGCCAAAGCCACATACAATGGTTGCCTGAATTTGTCGTTTGCCTATGTCGAGGGCGAGTCACTACTG 
ATGGCTCTTAAGGACGTGGCCCTGAGCAGTGGGTCCGCCTGCACCTCCGCCTCCCTGGAGCCCTCCTATGTTCTGCGTGCCATCGGCACCGACGA 
GGATCTTGCCCACAGCTCCATACGCTTTGGCATCGGGCGCTTTACCACCGTCGAGGAGGTGGACTACACCGCCGATAAGTGCATCAAGCATGTGG 
AGCGACTGCGCGAGATGTCGCCGCTGTGGGAGATGGTCCAAGAGGGCATCGACCTAAAGACCATCCAATGGTCGCAGCATTAATGTACTAATATT 
TAAGCCGTTACTGTTACACCACACTTTGAAAGCGAAGGGAAAATAAATTTGTTGGAAGAGTATCTAGGTTTATTTGTACACAAAACCACTTATAA 
CTAAGCTTAGTCCGAACGAATGTGCTTGTGTAAGAGGTGAGGCAAGAGAGACCCCAAAAAGCGAAAACGGCTCAAAGTGCGACGGCCAATGATAT 
GCAAATGATGCAATAATTTGCATTCGCCGCTGGTCAAGGTTCAGTTGGCCTTCTTCTGGGGTGAGCCGCTGGATCCACCGCCCAGGATCTTACGG 
CGCTGGGCGAACATGTGCAAGTACAACTGCGGGAAGATGGGGATGTAGCCGAGCATGACGATCCACAGGAATCCAAAATACGAGAAGGTGGCGTT 
CCACTTGTTGGGCATCACCACACTCCACACGCTGTTCTCACGGGCATAGCTCTGGGCCCACCAGAAGCACAGCAGCTCCCCAGTCACGCCGATGG 
GATACAGGACAATGAATGTGGTGTAGCGCAGGAACACCACAAAATGCGGCACCACCTTGACGATGTTCAGAGCGTAATATCCATATCGGATGATC 
TCAGTGATGGCCCAGGCCAGCAGGGCGATGGGCAGACCGGGTGAAACCTTTCCAGTAGGCGTGGCCATCACCACGCCGACGACCACCATCATGCG 
GCTGAACACCTGGAAGCCAGTGACCACCGGGTTCGACTTGACCAATCCAAAAGAAGCATTGAGTATCTCCACGAACGCGGCATTCTGGAAGATGA 
TCACGGCCAGCCGAGTATACTCCCACAACGTCACCTGGGCACGGAACTCCGGACCCTGCAATATGTAGTAGTTGACCAGCTGCCACAGGATGTAG 
CTCCATCCGACCACCTGGCCGGCGTTGTACGCAAACAGGTACAATTTAGTGACCGCCGACGGCTCCTTGGAAGCGCCTGGTTTGCTGGACTTGGA 
CACTGCCTTTGCTGACATGTTGGAGCGGAGATGTGTGAACTTTCACCTTGCCGATCGCCGCTGGTTAACTAATCAGTCGTCACAGTCTTATCGCC 
CTGTAGCGCGATAATTCCGAAGTCGCTTTCACCTGATTGGGTCAGTATGACCGCACGGCGACCACACAAAAAATACCAAAAAAAAGGGCTAAGAA 
ATATATATAACTATTAAATACCCTTCCATATAATATCTTTTATAAATAAAAACGCTTTGATATCGAAAATTGTTCATTTAATCATTTGTAAATTA 
ATAATCATTTAATTAAAATTTCTTGATATAAAATTTCATTCGCCTTTATTTTCTCTGAATTTTGTTTTTTGCTATTCACAATTTTTCCAACGAGT 
ATAAATAAAAGATGAACTTCATTCTTTCAAAGTCAACGGGTACAAAGTGCATAGCCTTATAACCTTTCCTCTATTCACAGATCGCAAATCTAAGT 
TAACAATGCCACAACCATTAGGCATGCTTAGCAATTAAGTCGTTTTTATGTGCAAAGTTATTTCACTTAAGTTAAGTTAAGTCTCAAAACGTTAA 
AATTCTTTGAAAACTATGATTTTAAATCCTATCAACTTACTTTAAAGCTGAATTACCGTTTCTTATGTTAGTACAAATTAAAGTAAATAAACTAA 
ATAATAACACAATTTATATTTCTTCGAAAATATTCTTAGTATCATTAGTAAAACTATAATAATTTTTAATTATAGTAAAAATTAATACCTATCAT 
ATATATTCTTTCTTTAATGTATAGTAAAGCATGGTTAAGAACAATGGATAATGGGTGGTTATGAGCCGACATATAAAAATTTATCGCTAACGACA 
AGGTGGAGAAATCAATTTTTTTGATAAGAACTAGCCCATTGTACGCTTTGCATTTCTAACCACATTTCGCTTAAAGCAGCCCACTGTAGGCGGCA 
GCTGACGTTTCCCAGCGAACGACGGCTGGTTTGCGCGATGAGCCAGCAAGGCAGCTAAACTTTCTGGTCGCCTGGTGGAAACCGAACCGAAAACC 
GAACGTGTCTGAAAAAGTCCGCCACCACCACGGACCGGTGCACGTCCCCACCTGCCAAGTCGCGTGTGTGTGGTGTTGCCTGTGGACCGACCCCA 
G 

{ SEQ ID NO; 739) 
Exon: 2041.. 1001 

Start ATG: 1918 (Reverse strand: CAT) 
Transcript No. : CT20927 

ACTGACCCAATCAGGTGAAAGCGACTTCGGAATTATCGCGCTACAGGGCGATAAGACTGTGACGACTGATTAGTTAACCAGCGGCGATCGGCAAG 

GTGAAAGTTCACACATCTCCGCTCCAACATGTCAGCAAAGGCAGTGTCCAAGTCCAGCAAACCAGGCGCTTCCAAGGAGCCGTCGGCGGTCACTA 

AATTGTACCTGTTTGCGTACAACGCCGGCCAGGTGGTCGGATGGAGCTACATCCTGTGGCAGCTGGTCAACTACTACATATTGCAGGGTCCGGAG 

TTCCGTGCCCAGGTGACGTTGTGGGAGTATACTCGGCTGGCCGTGATCATCTTCCAGAATGCCGCGTTCGTGGAGATACTCAATGCTTCTTTTGG 

ATTGGTCAAGTCGAACCCGGTGGTCACTGGCTTCCAGGTGTTCAGCCGCATGATGGTGGTCGTCGGCGTGGTGATGGCCACGCCTACTGGAAAGG 

TTTCACCCGGTCTGCCCATCGCCCTGCTGGCCTGGGCCATCACTGAGATCATCCGATATGGATATTACGCTCTGAACATCGTCAAGGTGGTGCCG 

CATTTTGTGGTGTTCCTGCGCTACACCACATTCATTGTCCTGTATCCCATCGGCGTGACTGGGGAGCTGCTGTGCTTCTGGTGGGCCCAGAGCTA 

TGCCCGTGAGAACAGCGTGTGGAGTGTGGTGATGCCCAACAAGTGGAACGCCACCTTCTCGTATTTTGGATTCCTGTGGATCGTCATGCTCGGCT 

ACATCCCCATCTTCCCGCAGTTGTACTTGCACATGTTCGCCCAGCGCCGTAAGATCCTGGGCGGTGGATCCAGCGGCTCACCCCAGAAGAAGGCC 

AACTGAACCTTGACCAGCGGCGAATGCAAATTATTGCATCATTTGCATATCATTGGCCGTCGCACTTTGAGCCGTTTTCGCTTTTTGGGGTCTCT 

CTTGCCTCACCTCTTACACAAGCACATTCGTTCGGACTAAGCTTAGTTATAAGTGGTTTTGTGTACAAATAAACCTAGATACTCTTCCAAC 

(SEQ ID NO: 740) 

Start ATG: 124 (Reverse strand: CAT) 

MSAKAVSKSSKPGASKEPSAVTKLYLFAYNAGQVVGWSYILWQLVNYYILQGPEFRAQVTLWKYTRLAVIIFQWAAFVEILNASFGLVKSNPVVT 
G FQV FS RMMV VVGV VMAT PTGKVS PGLP I ALL AWAI TE I I 

VMPNKWNATFSYFGFLWIVMLGYIPIFPQLYLHMFAQRRKILGGGSSGSPQKKAN* 
(SEQ ID NO: 741) 

Classification: hypothetical 



Celera Sequence No. : 142000013384588 
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GCGTACTCAGTGGCGGTGAGATCGGCATCACGATGGTCCAGTTGGTCAGCGGGTTGCTTATTGCGGAGCAGGATTGGCCCGTGTTCTTCTATCTG 

GTTGGTGGAGGAGCCGTGGCCTGGTTCTTGGGATTTGTAAGTCTATACGAAGTCTAAAGTCATGATAATCATAGTTTAAACCAATTTGTTCCCAA 

GACCTTAGTTTGCTATAGCACTCCGGATCATTGTCCGTTTATTCAGAGCGAAGAAAGGGAATACATAAGGTGCAATACTAGCAACTCCTTTTTGT 

TGACCACTGGCAGAGAAAGAGAGGAGATGGACGGGGAAGATGGATATGAAGGTGAGGATAGGGAGCACCGCAGAGAAGTGGAAGCAACGTGCAAC 

ACTGCACCTTGGCGGAGCATGCTTAACAGCACGCCACTCTGGGCCCTGGTTTCCACCTCCATGCAGCAGGAATTCCAACAGAAATTGCCCCAGGA 

GCTGCAAATTGCGCTAGAGGAAGTAAGAGCCCGTGGCACCAGTTTCTCAGAACTAACCACCATCATCGAAACGATAGCACCGTCCGTTGGAAACT 

GGATAGCTTCACTGACAACAGGGCGGCTGAGCGACGTGCTAATCGAGCAGCAGATTCTCACGCGCACTCAGACACGCCGCCTCATGTCCTGGTTG 

GTCTTCCTATGCGGATCCATGTACATGCTACAGATCAAGATGAGCGGAGCACGAATCTGGAGCGTTCTTGGCATGGGAGCCTACTATGCGAGCAT 

TAAGCTGCTACCGCTGGATATGAGTCCCAACTATGCTGGCACTTTGATGGGCATTTCGGGCGGCATGGGCGCGCTTCCAGCACTTCTGATGCCCT 

ATCTGGAACAACTGGAGACCGACTACAAGCTGGTGAGCAGTGTGAGGGCCGCCATGTGGGTAATCGGCGCCAGCTACATATCCGGCGATGTCCAG 

GCGTTTAACCAACCGGAACGGGAGCCACAATGAGGAGACTTGCTTGTTTTAAGTTTAATCTTACGCTTAAACATAAGCTACTGGATTTGTCTTAG 

TGGATTCCTTAGGCAACAGATTCTAGTTTACATATGTACCATAATACATGACAATTTCTATGCTAAAGCTAGGAGCATAAACGTAAACAAGCTTT 

ATATATATATATATAAATAACAGGTATGGTCGATATTAACTTAATTAGAGTAGGACTGCTGGCTAAATGCTCCGCGCTGGCTCTTATTTTCTCGG 

GATATCAATCCCAACACACACCGGTTTCATACTCTATGCAACGCCCTGGGCCATCAATCCGGATTGGGCCATTGCCGACACTGAGTTCATGTCGA 

TACCCAGCTTTTTGGTGACGTTGATTTGGGCCACTGCCGCGTTGCGCAGCGAGTAAACGTGCACCAACTCCGCCTGACTCTTGGCATAAAGCAGT 

GCCTTTTCAAAAAGCTCGACGGCCCGTGTGAGTTGCGCCCGCTGAACTTCGACTGTTCCGAGTGTCTCGTAGGCCAGCTCGCATTTGGGATCTAC 

TTCGATCGCCTTGTTGAGCAGCTGGACAGCCAAATTAATGTCTCCGCGCCACTGAAGCACCATGATGGCCTGGTGAACGATAAGTGCAGGATTTG 

TTGGCGCCAA7ACCATGGCCTTTTTGTAGTATTCCTCCGCCTGGGTAAACTGTTGCTGGTCGGCCAACACCTGAGCCGTTAGACTATAGCACTCA 

ACGCAGCTCGGGAAACGCTCGATGGCATTTTGGAAGGTGTGCATTACGCTCTCCAAGCGCCGCTGGTCGCCGGCCAGGAGGGATAGTCGGTACTC 

GGCATAGCACTTCTGGACGAAGGCGATGGCGTGGTTGGGTGCAATGCTCACCGCCTTTTCGAACTCGGCCAAGGCGGGCTCTATCTGCTCCAGCA 

GCAGAAGTATCTGGGCGCGCTGGTGGTACACATCTGGGTTCTCTGGGTTCAGGCGTTCCGCCTCAGCAAAATCGGCTATACCCTTTTCGCGCTGA 

TCCAGCTGGATGTAGAGAGCTGCCCGCTTTATGTAGGCATACGCACGCAAGTTGGGATCAGCATAGTCTGTAGTGAAATAAAGTTAATAGAATAG 

AAATCAAAAAACAACTGAAGAAAACCGACTAATTAATGAATTAAACTTAAACAAATTCTTACCATTAGCTATAATAGCATCAAAGTCCTGCTGGC 

TTTCTACATAGGAGCCGCACAGCAGATGGAAAGTTCCCCGCATAAGCAGAGCCTCCACTTTGTACTGTGCTTCGGCCTCGGAGGATTCGATCTCT 

TCGGTGCAGGCGGGAATAATTTCCTCGTACTTCTCCTCGAGGAAAGCGAGGTGGGCGCGCAGGAAACCCTTTGGGGGTGCATCGGATTTAGGGGC 

TGGGACTTTCATGGTCTGCAGCGGATCCGCGATGAAGGAACGCGTGTAGGTGTTTACGAAGCAAGCCGACGGCACGACTGGCACCCGATTGCGCA 

TACCCTTCTCGGCGTCCAGTCGGCCGGTCTCCTTGAGCACACGGTCAGCGAACATGATAGTTTGATTATTTTGGAACATCTCCAGGATACAGGTG 

GC T GT G AC AT C GT C C AAG C AC T C GT T CAT AT CCTTGGTGGCTTCGT G AGC ACGGGC AC G AC G AT AAT ATGC CT TGGCGT AAC G AGG ATTGAAC TC 

CAGCGAGGCTGTGCAGTCCTCCTTGACGTTGGACCACTTCTTGAGCATCTCGTAAGAGGCGGCGCGGTTCTGGTAGAAGATGGCCATGTCAGTGC 

GGTGCTCCTTGGGACACTTGTCGATGGCCTTGTCATAGAATTTAATGGCCTCATCGTACTTCCCATTCCTGTAGCAGTTGTTACCCTCCGTCTTG 

TAGTTGTTGGCCTCCTTAAGAGGCGACAGCTTCTCGCCCAGCTCAGCGGACTTTTGATTGCGCTCTAGCTCCTGGTCCGGTGCGGTGCCGTCCAG 

GGAGATGGCCTGCTTCTCGATCTTGGCCTTTGGTCTCTTCTTCTCCCCGTCCAACTCCTTGGACGCAGCCGTCCAGCTCTTCACCGCATAGGTGC 

CCAGTCCAATGGCCAGGGGAGTTCCAAGGATGAGCGCCAGCTGCCACTTTGTCAGCTTCATCGAGCCAATGTTCAGAGTCATCGCCATTGAGCCG 

GTCTTTTTATGGATGTGTGTTATGTGTGATGAACTGCAAGAAAAGAGCTGAATCAGTTTGGGTGCCGCCACAAATTGTGTAATACTCCGCTGGCT 

TATCGCTCAGCGCCTTTTGAGCTTGCTAAATGCCGGGCGGGGGGCACGTTTCTGCAGTACCTAACCCCCAACCCAATTGATTTTTCACACCCCCA 

ACTGGCCGGGATGTGAATCAGTGCAGTCAGCGAACGGAACTTACACTTTTTTAAAACTTTACACTGCTGTGAATTTGGCAATTGTTAAACGTCGA 

TTGTTCTATTTTCAGTTAGAACTCGGGAAACTTAGATTCTATTGTATATCAGCCTGCTGAGCACACGACATCGAAAAACGATAGACATCGATAGG 

TATGGAGTAATAACGGCACTGCGAATGCTTATTATGGAAGAAGAAGATCAGTGAGAGAATAAATTGTGTGAATAGTTTAAAAAAAAACTAAGGAG 

TTCGAATATGATGTATTTGTAATAAAGCCTGAATTTATACTTTACGAATACTGTACGAATAAATATTTTAGATCTCTATATTACATTCTATCAGA 

CCGAAACAGAGTTCTTAAATTTTAAAATCGGTTTCATCCACTTTACAAATTGAAATACAAATGCATTGTAATATTACAATCAATGTATTTATAGG 

AATTCACTAGAATACACGGACAGCCTATGGGTGATTCTCCTCATCTGTGCTGGAAGTTGGAGCTGGAGCAGGAGCAGAAGCGGAACTGGTAGTCG 

ACTTATCGGTTTCCGTAACCAAGATCTTGTCCAGATTCTCCAGTTTAACTAGGCCCTGGGAATCGGCGTTGTGGATGAGATCACATATTGTGGGC 

GACTCCACGACCAGGATATACTGGCACGTCTTCGGCTCAAGCAGGTACATGGATACAGCGGTGCCGCTGTTCGTTACAGGTGTGCAGGTTAACTT 

CACGTCCACTTCGCGGGGCACTCCAATCTGTTCGCAGTGGGTTCCCTTGCCATAATGGTGCCAGATGGATGTCGTGAAGCCGGAACGGCGTGCAC 

CCTTGTCCGGGTTGGCGTTGGACCAAGCACGATGCGCTTCCTCTGAGAAGTAGCCAAGAAAGAGCTCCACTTCGCTGGTCTTATCTTTGTGGAAC 

TGACGCACATGTCGGCCATAGCAGAACTCGTACTTCCACCAGCCATTGCCCTAAAAAAAAGAGTATTATTGAAGCAAATGCATGGGTCAGCTGCA 

TGTTATTCCTACCCCAGTTAGGCAGTTCTTGCCCGATATAAAGTCCTTAAGGGGCGTCAGATCAGTGATGGGTGTTGGTCCAGTTTCAGAGGCTG 

C TAT AG AC AGC AGG A 

(SEQ ID NO: 742) 

Exon: 3480.. 3370 
Exon: 3122. .2153 
Exon: 2062.. 1001 

Start ATG: 3122 (Reverse strand: CAT) 
Transcript No. : CT20965 

TCAGCAGGCTGATATACAATAGAATCTAAGTTTCCCGAGTTCTAACTGAAAATAGAACAATCGACGTTTAACAATTGCCAAATTCACAGCAGTGT 
AAAGTTTTAAAAAAGTATGACTCTGAACATTGGCTCGATGAAGCTGACAAAGTGGCAGCTGGCGCTCATCCTTGGAACTCCCCTGGCCATTGGAC 
TGGGCACCTATGCGGTGAAGAGCTGGACGGCTGCGTCCAAGGAGTTGGACGGGGAGAAGAAGAGACCAAAGGCCAAGATCGAGAAGCAGGCCATC 
TCCCTGGACGGCACCGCACCGGACCAGGAGCTAGAGCGCAATCAAAAGTCCGCTGAGCTGGGCGAGAAGCTGTCGCCTCTTAAGGAGGCCAACAA 
CTACAAGACGGAGGGTAACAACTGCTACAGGAATGGGAAGTACGATGAGGCCATTAAATTCTATGACAAGGCCATCGACAAGTGTCCCAAGGAGC 
ACCGCACTGACATGGCCATCTTCTACCAGAACCGCGCCGCCTCTTACGAGATGCTCAAGAAGTGGTCCAACGTCAAGGAGGACTGCACAGCCTCG 
CTGGAGTTCAATCCTCGTTACGCCAAGGCATATTATCGTCGTGCCCGTGCTCACGAAGCCACCAAGGATATGAACGAGTGCTTGGACGATGTCAC 
AGCCACCTGTATCCTGGAGATGTTCCAAAATAATCAAACTATCATGTTCGCTGACCGTGTGCTCAAGGAGACCGGCCGACTGGACGCCGAGAAGG 
GTATGCGCAATCGGGTGCCAGTCGTGCCGTCGGCTTGCTTCGTAAACACCTACACGCGTTCCTTCATCGCGGATCCGCTGCAGACCATGAAAGTC 
CCAGCCCCTAAATCCGATGCACCCCCAAAGGGTTTCCTGCGCGCCCACCTCGCTTTCCTCGAGGAGAAGTACGAGGAAATTATTCCCGCCTGCAC 
CGAAGAGATCGAATCCTCCGAGGCCGAAGCACAGTACAAAGTGGAGGCTCTGCTTATGCGGGGAACTTTCCATCTGCTGTGCGGCTCCTATGTAG 
AAAGCCAGCAGGACTTTGATGCTATTATAGCTAATGACTATGCTGATCCCAACTTGCGTGCGTATGCCTACATAAAGCGGGCAGCTCTCTACATC 
CAGCTGGATCAGCGCGAAAAGGGTATAGCCGATTTTGCTGAGGCGGAACGCCTGAACCCAGAGAACCCAGATGTGTACCACCAGCGCGCCCAGAT 
ACTTCTGCTGCTGGAGCAGATAGAGCCCGCCTTGGCCGAGTTCGAAAAGGCGGTGAGCATTGCACCCAACCACGCCATCGCCTTCGTCCAGAAGT 
GCTATGCCGAGTACCGACTATCCCTCCTGGCCGGCGACCAGCGGCGCTTGGAGAGCGTAATGCACACCTTCCAAAATGCCATCGAGCGTTTCCCG 
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AGCTGCGTTGAGTGCTATAGTCTAACGGCTCAGGTGTTGGCCGACCAGCAACAGTTTACCCAGGCGGAGGAATACTACAAAAAGGCCATGGTATT 

GGCGCCAACAAATCCTGCACTTATCGTTCACCAGGCCATCATGGTGCTTCAGTGGCGCGGAGACATTAATTTGGCTGTCCAGCTGCTCAACAAGG 

CGATCGAAGTAGATCCCAAATGCGAGCTGGCCTACGAGACACTCGGAACAGTCGAAGTTCAGCGGGCGCAACTCACACGGGCCGTCGAGCTTTTT 

GAAAAGGCACTGCTTTATGCCAAGAGTCAGGCGGAGTTGGTGCACGTTTACTCGCTGCGCAACGCGGCAGTGGCCCAAATCAACGTCACCAAAAA 

GCTGGGTATCGACATGAACTCAGTGTCGGCAATGGCCCAATCCGGATTGATGGCCCAGGGCGTTGCATAGAGTATGAAACCGGTGTGTGTTGGGA 

TTGATATCCCGAGAAAATAAGAGCCAGCGCGGAGCATTTAGCCAGCAGTCCTACTCTAATTAAGTTAATATCGACCATACCTGTTATTTATATAT 

ATATATATAAAGCTTGTTTACGTTTATGCTCCTAGCTTTAGCATAGAAATTGTCATGTATTATGGTACATATGTAAACTAGAATCTGTTGCCTAA 

GGAATCCACTAAGACAAATCCAGTAGCTTATGTTTAAGCGTAAGATTAAACTT 

(SEQ ID NO: 743) 

Start ATG: 112 (Reverse strand: CAT) 

MTLNIGSMKLTKWQLALILGTPLAIGLGTYAVKSWTAASKELDGEKKRPKAKIEKQAISLDGTAPDQELERNQKSAELGEKLSPLKEANNYKTEG 
NNCYRWGKYDEAIKFYDKAIDKCPKEHRTDMAIFYQNRAASYEMLKKWSNVKEDCTASLEFNPRYAKAYYRRARAHEATKDMNECLDDVTATCIL 
EMFQNNQTIMFADRVLKETGRLDAEKGMRNRVPVVPSACFVNTYTRSFIADPLQTMKVPAPKSDAPPKGFLRAHLAFLEEKYEEIIPACTEEIES 
SEAEAQYKVEALLMRGTFHLLCGSYVESQQDFDAIIAKDYADPNLRAYAYIKRAALYIQLDQREKGIADFAEAERLNPENPDVYHQRAQILLLLE 
QIEPALAEFEKAVSIAPNHAIAFVQKCYAEYRLSLLAGDQRRLESVMHTFQNAIERFPSCVECYSLTAQVLADQQQFTQAEEYYKKAMVLAPTNP 
ALIVHQAIMVLQWRGDINLAVQLLNKAIEVDPKCELAYETLGTVEVQRAQLTRAVELFEKALLYAKSQAELVHVYSLRMAAVAQINVTKKLGIDM 
N S VS AMAQS GLMAQGVA* 
(SEQ ID NO: 744) 

Classification: hypothetical 



Celera Sequence No. : 142000013384588 

ACTT T T GT AAT GC GAG T TT AAC GAAT CT T T AAT T T AGAAAT AC ATT T AAT AAAAAT AAAAAAC C GAT TCT AAGGC AC TT T T AAC ACG CAATAT AT 
TGCAATGTTTAATTTTATAAAAAAACAGGTTTTTCATCACCTTCTATTTGTTTGAAGTTTGGAACACCACATTTTTAGATAGATCAATGATTCAG 
ACGAAATGCTATTTAATGAAGATAAGGAATATTTATGACTAACCAAAATGCGAATTTAAACATACATAAGAGTCTTGGAATTTTAATTGTCAATA 
TGTTCGAACAATTTCATTAATATGCATTATAGCAAAACCACCTGATCGAGAGTCTGTTTTATTGTTTACCAAATATCAACAACGCAATTCAAAAA 
GCCGGCAAATGGCGTTTATAGAGAGAGAGAGAGCGAGAGAGAGGAGAGAATTCCATTAGAGTGGACCTTTCTGGAAAAGTTTGTTCGGTTTATTT 
TTGTACGTGTGCTTGTTTGTTTGTGAGTGGAAAATAACATAGCTTTTGTAATTTTGGACACCTAATGTAAAAATATCGTTTAGTATTGAAAGTCA 
ATAAAAGCGGCGCAATATTCATAAATGCAACCTTACACGCAAATTTAAACCGGAAGAAATGATGCAGGAATATGTGTGTGCAAGTTTAATTACCT 
GTGTAACTGGTGAACTTTCAATGGAAGGTCTCGGGAATATTTCCGTTTGCAAACTAACATACATACATTACATACACAAATAAATATGTTCATAC 
ATGTTCTCTTTTTGGTCGATCGCTGTAAACAAAGAAAAAAGTCGACTGCAACTGCTTTTGCGATAATCGAGGAGCTGAGTGCAAATGAGTTCAAC 
TAAAAGGCTATAAATATGCACAACAGCACACCAACGGCATTCAAAAAGCTCTCTCGCTTGGAAAGGTGCGCTGCAAATGCAAATGAAAATAAGAC 
GACCAGGCTATTCAGTTTTGGTTTTCTCTTTATTTTATATGTATTACCTAAGTCAAACTTCATTATTTGAAACGCTCTAAGGCTTAGCAACTAAA 
ACTTGCAAGTCTGAATTAGTTTAGGAATATATTAGAAGCTAAGAGTAGTATGTGCCTTTCTTTTTTTGTGTTTGTGAAATTATAAGCGAAATAAA 
ATTATTATAATGTTCACAAAGTTTAGCGCCGCATGAAAAGCCAATGCACAATACAAATGCAATGTGTATTTCCAGCTGGCTTTTGTTTTTCTTTC 
TTTCCTTCCAGCAAAACTGCTGCTTGTTCTTCTTTTTCTTCTCTCCTTTGCTTTTGCTTTCCTTTTAAATTCGAAGATTTCTTTTCTCCTTTCAG 
CGCGGCCCCGTTACTTCTTTCTCTCCTGTTCCACCGCCTCCGTCGATGCCCTGCATCAGTTCTTGCAGGCGGCGGCTTTCTTGTTGTTGTTGTTG 
GTGTTCATCAGCTTGGTCAGAATCTTGCGGGAATGACCGGAGGGATCGAAGTGGTTGGTGTGCTCATTGGCCTCCCTCTTGTTGCGCTGCTTGCC 
GCTGTGTCCAGAGAAGATGTGCTTGTCATGGTCGAAGTTGTAGTGCTCGTACTCATCGAAGTGGGCCTCGGACATAATGGCGATTTAAGGTTGCA 
AAACTAACTGCGTTCTTGGTTGGTGTCTGCTTTTTGTTTCTCTGCAAACTTGCAGGCTGCTTTCCAAATTTCTCTTCTGCTTTTTTTTGTTCACT 
TTTCGCTGAGCTGCGAACGGAATTGAATTTTTTCGGTCGAGCAGCGGCTTTTATAGGCGCGCAAGCTTTCTGTTCCGTAACCAAAGCTCTCTCTC 
CGTCAGAGT TGCCGGATTTAGCTAAAGAAACT ACT AGAAGGGCATTTAATTGTTGCTTACGTGTAGTTACT AAAAAT AT TTAATTTTTCTAT AAC 
TTTCTACTGAACTTTTACAAAATATATTTGCTCTTTTTTATGTAAAAGAATTTTTAGTTATTAAAATTCATAGTTATTGGGCCTGACAACCCTTT 
TGCGCTCAAGAGAGTGCCTCCTTGACAATTTTTGCTGCGCAATTTAATTTTTTGCTGTTTACAAAGTGCGCGGTAAACAAAAAAACTGAAATCGC 
CAAACATATTATTTCGCCTTGTTCATTTATTTTAATAATTTAAGAACTTTTAAGCATACACACGTTCCTGAGTGGTTACATAACACACACCCACC 
TTGGCCCTCCGCAAACGACCATCAAAGTGCATTTACCCTGCATTTGTTTCTCCACCCAATAAATATAGAATTAAATGCATCATATGTTCTGGATA 
CACTGCTCAAAATTTGTTTATTTTTTTACACTCAACAATATTTGAATGTTAGGGCACAAGAAACAATTTAGCAATAATTATAATTCGAATAACAT 
TTTTGATTTACAAAAGATATACTCATAATGGATATTTATTTTGGTGCATTAGAACCTGATCGGTTGACCCACTTCCCTCCCACCAGGCAACGCAT 
TTGTTTATTTCTGAATAGCGAAAAGCTTTCAGTTTGTTTTGGTTTAATGGATTTTATATAAACAAATGTTACGTACATACATATATACATAGCCT 
TCTGGCGTTGATAAGGTGCTAACTATCGCTCTAGTTAGTCATTTGGCAGGTATTCGTTCTGAAAGTTCGTCTTCAAACGACGCAATAGATCGTAG 
TCACCATCTTAGATAAGATAGAACGGCCTCAACAATGTTGTTTCAAGAACCTGCCACAAT 
(SEQ ID NO: 745) 

Exon: 1720.. 1001 

Start ATG: 1595 (Reverse strand: CAT) 
Transcript No. : CT21019 

CTC AGCGAAAAGTGAAC AAAAAAAAGCAGAAGAGAAATT TGGAAAGC AGCCT GC AAGTT T GCAG AGAAAC AAAAAGC AG AC ACC AACCAAG AACG 
CAGTTAGTTTTGCAACCTTAAATCGCCATTATGTCCGAGGCCCACTTCGATGAGTACGAGCACTACAACTTCGACCATGACAAGCACATCTTCTC 
TGGACACAGCGGCAAGCAGCGCAACAAGAGGGAGGCCAATGAGCACACCAACCACTTCGATCCCTCCGGTCATTCCCGCAAGATTCTGACCAAGC 
TGATGAACACCAACAACAACAACAAGAAAGCCGCCGCCTGCAAGAACTGATGCAGGGCATCGACGGAGGCGGTGGAACAGGAGAGAAAGAAGTAA 
CGGGGCCGCGCTGAAAGGAGAAAAGAAATCTTCGAATTTAAAAGGAAAGCAAAAGCAAAGGAGAGAAGAAAAAGAAGAACAAGCAGCAGTTTTGC 
TGGAAGGAAAGAAAGAAAAACAAAAGCCAGCTGGAAATACACATTGCATTTGTATTGTGCATTGGCTTTTCATGCGGCGCTAAACTTTGTGAACA 
TTATAATAATTTTATTTCGCTTATAATTTCACAAACACAAAAAAAGAAAGGCACATACTACTCTTAGCTTCTAATATATTCCTAAACTAATTCAG 
ACTTGCAAGTTTTAGTTGCTAAGCCTTAGAGCGTTTCAAATAATGAAGTTTGACT 
(SEQ ID NO: 746) 

Start ATG: 126 (Reverse strand: CAT) 
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MSEAHFDEYEHYNFDHDKHIFSGHSGKQRNKREANEHTNHFDPSGHSRKILTKLMNTNNNNKKAAACKN* 
(SEQ ID NO: 747) 

Name: p8-like 



Celera Sequence No. : 142000013384588 

TGGTGGCCATGGCAGATTTCGTACGACCAGTAGGCCTCGATGCGGTACGTGCAAGTCAGGGCGGAGAAAATGGGCTGCAGCAGAGTAATTGGAGC 
TGCGGAATTCGGAAATTATGTATAATCCAATGGGATGTTAGTTGCAATGCCAATTTCTTACACAACTCTGGCTTGTCGGACTTTTCCTCCTCTTT 
TTGGTGTTCCAAAGTGGGAATCAGGCAGTCGTACTTCTCCTTATCCGGCGTGTAGAATGTTCGCAGCTGATTGCCCAAGCTCTAAACAGCATTAC 
TTAAGGAAATTTGTTTATGTCTGCATTAATAAATTTGTACTCACTGGCTGCACATCTAAATCCGGCACTTCGAAATCAATTTTATACAAAATTGA 
ATCGTCAAAATCTTTGGCTTCGTGGGAAACCGCTGCAATTATTAAATTCAGCAGCAATGCTATTCTCGCAATTATCACGTAGGACTTGCCCATTT 
TCGTGTTTAAGGCAGCGATTTCTTTACATGAGTATTTTCGTGACAATATACAAATAGTTGCAAATGCTACGTGTTGTTTTTCTCAAAAAAATAAG 
GAACATATGTTGATTGTGTGTGCGGCACTGTGAATGGTTACTGCGAAAGAAGAAGAAAATCTGCTGGCCTGCATGGGCATTTTAGCTTGGAAATA 
GCTAAATATCAAAAAATAACAGTCGCAGTCGACGGCACATACAAATTTTTTGAATTCAAATTCTTTAAAATCTAAGTTTTGCATTTAAATCTAGT 
CATTGAATATTTCTCATTAACAAAATACAAAATTTTATCTCTTAAACTTAACTGAATGTTTTCGTATTTATTGCAGATTTAATGGTCATTCATCA 
TAAAATCTTTTAAGAATCTGTATGAAAATCCAAAATAAGTCGAGAAACTAAAGCCACGTCCTCCAACAAATTAAACTCAGTTCCCAGTTTATCTT 
CAACATAATTATATTTTCCTTAACAAATATCGCATATGGTAGTCAGTTCCCTATCTCATTCCCAGCGCCGGCGTGTCCTTTTGCGCCGTGGACCA 
GCCGAGCTCCTTGTCCTTATCATGATCCACTGGCACCTTGTCGATGGCATCCAGTATAAACTGGATGGAATGCGCCGTCCGTGGCAGCCACTCCA 
TGATACGAGAGGTGGTCGAGACGGGCGGTGTCGCTGCATCGCCGTCATCCACGAAATCCGCATCTCCGCCGGACAACGAACGAACTCCACCCAGG 
TAATCCGTGTAGTCGTAATCACTGGTCAGCGGATTCACCTTGGAGTAGTAAATGGCAGCGAACGTCGAGTAGATAATCAGCAGTGCTGTGCCCAC 
CTATAAGTTTTATATACATATTTAAAACGATTATATTACACATTGCACTCATATACTTTATGTATTTTCACTCACCTGAAAGACGGCCCAGAACT 
TGTAGGAGCCCTCCGTGATGACGAAATCGTAGTATCCTTTGAGAGGATCACCCTTGATATAGCGACCCTCGCCCACATCCGTGTCGGCCTTGTCC 
TCCGCCTCGGCATCGCGTCTTTGGCGCAGTCCACGCACCGATCTCAGATTTCCCTTTTCCCCCTTTAGTGCCGAGACTTTGGTGGCGGGTGCCGA 
TGGTGTGACTGTCTCAACTTTCTCCACTGTCTTCTGCGACTGTTCCTCCTCCTCCGCTTCCTCCTCCTCCTCATCAGAGACATCTGGAGCAGCTG 
TCGTCGTCGTCGTCGTGGTTGCCTTAGTTGTCGTTGTAGCCGGCCTCTTGTTCTGGTTCTTTTTGTTTCCCTCGATGCGCGGCGGTTTTTTGGAG 
TTTATCCTGTTCTTCTGCTGCTGCTGCTGACGCAGTTGCTCCTTCTCGGCTTTCTTCGCCGCCTGCTCTTGGCGGAACTTCTCAGTGCTCTGCTG 
ATGGCGTTCGCTGACCGCGTTCCAGGCACTCATCAGTGCCTGATACAGATCCCAGATCTGCTTGAGGCTCTTCAGCACGGTGACCAGTGTGGATT 
CACTTTGGATTTGGACTACTGCATTGTCGGACGTGGGCTGAGATTGTTTGGCCATCGCTGTTTCCGGATGTGGGCGCGGCTGGAAGCCCATGGGA 
TAGTCGTACATCTCATCGTAGGCGGGATAAGGCGATCTGTAGAAAAATACATACGTTTCGTTTATGTTGTTGTAATAATTTCATCCAACACCCAT 
GACCCAGATCTAGTTGGATTAAGCATTTCTCACCTGCTAGGAGCTAAAGACGCCTGAAGCTTTCTATTATTGCCCTCGGCAGCGGGATGAGTAGC 
GACTGGAGCTGGCTGCTGTTTATATGACCACGGAGCGTAGAAGGAATCGTAGGGACGTGCCTGCTGTGACCTGCCCTCACCGGTGTATTCCTTAT 
CATAGAAACGCTCATCCTCTTCCGACTCCTGCTTGGACTCTGCACTGGGATATCGTTCAGGTCCGTAGTCATCGGCCACATCGTAGTACGGATCC 
T GCT CGT AGG AAT C C T CGC G AC T CT GC GGC GG AC C GT AGG C AT C C T GGC GT T GT C C C AAC AC GGG AGAAT C CAT GC TC T C T G CAT T C ACC AC C AC 
CGCCTGCTGCTCCTCGTAGAAGTTTCTGCCCTGCTGCGATGCGGATGCACTCGGATTGCTTGCATTTCCGGCATCCCTAGCAATTCGCTCCACCG 
TGGCAATTTCTGTATCATTATCGGCACCGGACTTGAAATGATTCCACATGCGCGTAAGGATTAGGAGATCGCTTTCTCCGCCCGCGTTGCTCTGT 
GCCCGTCCAGCTTCGATCACGGCCAGCGCCGTTAAGGTGGTGGCAAAGAGAAGAAACGTGACCACCAGAAACCATATGGGTGAGAGGCGTCGTGG 
CGCCTTCGCCGGCATTTTCTCGTAGGCAGACATTATCCGTTTGTCACTATTGATTCTATTGTTTCGAATCCTGGTTCAGCACACACCGCACGGCC 
AGTTGGCTCGGAGTTAAGTCGACAACTAACCGAAGATGTCCCAATGCTGAGCCAGAAAAACTCCGCCCGGCAATGGTGCTAATGGCGGGTTTTTG 
GGTATCTAGAACATCCATCCGTAGTGGCTCGTTGAAAGGTGCTAATCCGCGGGCCGCGTGCATCATCGTTATTGGGCCAGTGCTCCAGTTTCGAT 
TTAACGGTTCATCCAAGCCACCAAGTGCCGCGGCAATTAGCACCATCCATATCACCAATGTAGCCAGTACTACGATCCCATCCCATCCGATGCTA 
ACTTCCATTCTCCATTCAGTTATGCATAATTCCCGATTTTGTTGTTCGCCACAAATTCGTTTTTTTTTTTCGCGGTGAAAGTGTGCAATATGATT 
CTCTTTCCAGAAGATGGACTATGGAATTTAGGAAATCGTAATTGGAATGGCTTTGTGCCGTTGGAGCACAAAGATCCACGCCTGTGGAGTGGCCG 
CCATCGAGGTCCGTCGTCGGTCGTGTGCCAGGCGAAGATCACGATCTAGGAAGCTGCATACATGCCCATATTTAGACACAAATCGTGGCAAGCGT 
GCGACTTTCATTAATGGGGTTGAAAAGAAATCTATTTCTACAATTAAGATACGTTAAGTCAACATCAATGGGGTGTTTGTACATTGGTGAGATCG 
CCTTAACGATTCGAAATTTGAAGACAACACCATTTTAAAAATGCTAATACACAGTTTTTTTTGGGAAATTGGTATACTACACAAAGTAACCAAAA 
AGAAAGGATATCGTATTAGAATATTGTATACACATATGAGCTTATCATAGTTTATTATATAGATACATATAGGTAGCCACCTATTAAAACTCATA 
TCTTAGAAAAGAAAGCGATGTTAACAATTTTATTCTTTAATGCGAATTGAAATGAAATGAAAATTGCTCATGTTATGAAGTTA 
(SEQ ID NO; 748) 

Exon: 2883.. 2219 
Exon: 2126.. 1406 
Exon: 1330.. 1001 

Start ATG: 2883 (Reverse strand: CAT) 
Transcript No. : CT21031 

ATGTCTGCCTACGAGAAAATGCCGGCGAAGGCGCCACGACGCCTCTCACCCATATGGTTTCTGGTGGTCACGTTTCTTCTCTTTGCCACCACCTT 
AACGGCGCTGGCCGTGATCGAAGCTGGACGGGCACAGAGCAACGCGGGCGGAGAAAGCGATCTCCTAATCCTTACGCGCATGTGGAATCATTTCA 
AGTCCGGTGCCGATAATGATACAGAAATTGCCACGGTGGAGCGAATTGCTAGGGATGCCGGAAATGCAAGCAATCCGAGTGCATCCGCATCGCAG 
CAGGGCAGAAACTTCTACGAGGAGCAGCAGGCGGTGGTGGTGAATGCAGAGAGCATGGATTCTCCCGTGTTGGGACAACGCCAGGATGCCTACGG 
TCCGCCGCAGAGTCGCGAGGATTCCTACGAGCAGGATCCGTACTACGATGTGGCCGATGACTACGGACCTGAACGATATCCCAGTGCAGAGTCCA 
AGCAGGAGTCGGAAGAGGATGAGCGTTTCTATGATAAGGAATACACCGGTGAGGGCAGGTCACAGCAGGCACGTCCCTACGATTCCTTCTACGCT 
CCGTGGTCATATAAACAGCAGCCAGCTCCAGTCGCTACTCATCCCGCTGCCGAGGGCAATAATAGAAAGCTTCAGGCGTCTTTAGCTCCTAGCAG 
ATCGCCTTATCCCGCCTACGATGAGATGTACGACTATCCCATGGGCTTCCAGCCGCGCCCACATCCGGAAACAGCGATGGCCAAACAATCTCAGC 
CCACGTCCGACAATGCAGTAGTCCAAATCCAAAGTGAATCCACACTGGTCACCGTGCTGAAGAGCCTCAAGCAGATCTGGGATCTGTATCAGGCA 
CTGATGAGTGCCTGGAACGCGGTCAGCGAACGCCATCAGCAGAGCACTGAGAAGTTCCGCCAAGAGCAGGCGGCGAAGAAAGCCGAGAAGGAGCA 
ACTGCGTCAGCAGCAGCAGCAGAAGAACAGGATAAACTCCAAAAAACCGCCGCGCATCGAGGGAAACAAAAAGAACCAGAACAAGAGGCCGGCTA 
CAACGACAACTAAGGCAACCACGACGACGACGACGACAGCTGCTCCAGATGTCTCTGATGAGGAGGAGGAGGAAGCGGAGGAGGAGGAACAGTCG 
CAGAAGACAGTGGAGAAAGTTGAGACAGTCACACCATCGGCACCCGCCACCAAAGTCTCGGCACTAAAGGGGGAAAAGGGAAATCTGAGATCGGT 
GCGTGGACTGCGCCAAAGACGCGATGCCGAGGCGGAGGACAAGGCCGACACGGATGTGGGCGAGGGTCGCTATATCAAGGGTGATCCTCTCAAAG 
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GATACTACGATTTCGTCATCACGGAGGGCTCCTACAAGTTCTGGGCCGTCTTTCAGGTGGGCACAGCACTGCTGATTATCTACTCGACGTTCGCT 
GCCATTTACTACTCCAAGGTGAATCCGCTGACCAGTGATTACGACTACACGGATTACCTGGGTGGAGTTCGTTCGTTGTCCGGCGGAGATGCGGA 
TTTCGTGGATGACGGCGATGCAGCGACACCGCCCGTCTCGACCACCTCTCGTATCATGGAGTGGCTGCCACGGACGGCGCATTCCATCCAGTTTA 
TACTGGATGCCATCGACAAGGTGCCAGTGGATCATGATAAGGACAAGGAGCTCGGCTGGTCCACGGCGCAAAAGGACACGCCGGCGCTGGGAATG 
AGATAG 

(SEQ ID NO: 749) 

Start ATG: 1 (Reverse strand: CAT) 

MSAYEKMPAKAPRRLSPIWFLVVTFLLFATTLTALAVIEAGRAQSNAGGESDLLILTRMWNHFKSGADNDTEIATVERIARDAGNASNPSASASQ 
QGRNFYEEQQAVVVNAESMDSPVLGQRQDAYGPPQSREDSYEQDPYYDVADDYGPERYPSAESKQESEEDERFYDKEYTGEGRSQQARPYDSFyA 
PWSYKQQPAPVATHPAAEGNNRKLQASLAPSRSPYPAYDEMYDYPMGFQPRPHPETAMAKQSQPTSDNAVVQIQSESTLVTVLKSLKQIWDLYQA 
LMSAWNAVSERHQQSTEKFRQEQAAKKAEKEQLRQQQQQKNRINSKKPPRIEGNKKNQKKRPATTTTKATTTTTTTAAPDVSDEEEEEAEEEEQS 
QKTVEKVETVTPSAPATKVSALKGEKGNLRSVRGLRQRRDAEAEDKADTDVGEGRYIKGDPLKGYYDFVITEGSYKFWAVFQVGTALLIIYSTFA 
AIYYSKVNPLTSDYDYTDYLGGVRSLSGGDADFVDDGDAATPPVSTTSRIMEWLPRTAHSIQFILDAIDKVPVDHDKDKELGWSTAQKDTPALGM 
R* 

(SEQ ID NO: 750) 



Celera Sequence No. : 142000013384638 

CTTCATCTCGCCCGTCATTTTCCAGATAACCTGGACCCGCTTGCGTGAGTCTCGCAAATCGACGCAGTGCCTTGCCCAAAGCTCGAGCAGCAGTT 

CGACCGCGTGGAATGACCACTCGGACGGCTGAAAGGAGTTCCAGCTCAGGCGGCGCTCCAACGAATCCTGCAAGTGAGACATGGTAATTTGGACT 

ACTATTGACTTCTAATGGCGTCGTTGTACGTACGTCTAGTTTAGGGGGATGGATTTCCGCGATCCCCTCCTCGCACAGTTCCTCGGTACGCAGGT 

AGTCCACATGGCACTTGTGGGTGCGCAGATTGGACGTGTTGAAGGATCGCATGACGCGCTTGCAGCCCGTGCAGAAGTAGAGTCCATTGATTATG 

GCACCATCGGTGCGCACGATTTCCCGATACACCTTCCACACGGAGCCGCGCTGCTTGCGTCGTAGTAGCCGGTACTCGCCGCGGAGCACCTTCGC 

CGAGATCACCTGGTTGGATTCGAATTTTGATACGTCCTGCATCGTGCGGGCCTCAATCATGTCGTCCTTACGCTCGAAAAGTATAAACAAAATGT 

GCCTAGAGATGGGCAGCAGCTCGATATCGATTTTTCTAGAAATGGTCGGTATTGGCTTCATATATCTATCGATAGTTCTATGATATCGATTTAAA 

TTATATCTAAAATACAAATAAAAAAACATTTTACGAATTGAGAAACTAAATGTAACAAATACGATTTAAAACAAATTTATAATATTAAAGTAAAG 

TAAGTAATTAAAGTTTTGCTAAAAAAAGTAAAATATAATTTCATTGCATTTATTTTTACTTAAGAAACTAATTTGTAAGATCATAACAATGCAAA 

ATCTAGCTTTTTACAATATATATATATATATTTAGATTAATTTGAAGTTGAAGCAATATTACATTAAATATTCACTGTTTTAGTTGTATATTTTG 

CTGAAAAAGTAACACCACGCTACTTCTACGTATAACACTTAATATTTACACTACTGATTGTTATGCTTAAGCAAATAAACAATATCTTTGTGAAA 

TGCGTTTAGCTCCAGCAGTGCATCCTTCATAACCTGCAGCTTTTCTCGCTCGATCGCCAGTTTTTCCTCCTCTATTTCAAGGAGTCTGTCCGAAT 

GGTTGTTCGATGCTTTCCGTTTTGTCTGACATTTTGAAATTGGCGATGGGGAGAGATCCTTCATGTGGTCATCATCCTCCTCCTCCTCCTCCTCC 

TCCTCGTCTTCTTCCGGGATTTCCGGAATACTGGGAGAGGGAGTTCGCGTTCTTTTCATGGCTTTGCTAGCAACCATATCACTTTCGAATACGTT 

CCCATCGTCCCCGGCCATAGGCTCGTCTTTTAATGAAGACAATTCCTCAGGCTCATAATCCTCATCGCTGAACAAAGCTTGAGCTGTAAAATTAG 

CGTATTTTTAAGACATAATGACATAACTAGCATTTATTGCACTACGACTTTACCTGGATTTTCGGCGGTAATATCTTCGAAGACATCAACTGATT 

TCGTGCCAATCAGAAGTGCTTGAAGCTTATGAAAGTGCTCCCACTTGGATGGCACTCCAGTTTCCCGAACTCTCTCAGCTTCTATTCTGTAAATA 

CCAATATAAAAGTTAATATACTATAAAAATAAAAACTTTTGGTATGCAAAATATAAGAGCGCACCTATACTTTCGCGACAAATTGTCCATTTTGG 

TTTTAAGCTCCGTGTGGCTGGGCCCAAACTGACTCATCTCCCTAGCCATTTGCCGGTAGATGATCACATTCTTGGACTCTCCACGAAAATCCTTC 

AAATGCTGTGCCCACAGTTGCAGCAACATCTTCTCCGCATCCTTTTTCCAAACGTGTCGCTCCGTTTTCGTATAGTAGTCCTCAAACTTTTCCTC 

CTTGCCAACATTCTTGGGCAATGGCGAGATGCCATTGGCTCGGATCGCTGGCTCAACGACCATTTTCTGTGCAACGTTTATCTTGGCGGTTCGCT 

GTGGTCCCGTCGGTGATCCTTCATCGCTCAACAGGGCTTGGAGTGAGCTCTTCATGCGCGGTCCTGGCTTACGACGTTCTGGTGTTATATTCTAG 

AAGAGTTAGTTGGTTTAGAAGCATGGTTACTGAGTGATTAATTAGAATTTTAGTATGTTCGCTAGTGGTCCATGTTGGACTATGTAACTGAAGTG 

G AC T TT C AT AC AG AAATT AGG T C AAAT T T GAT AT C C T AT C AC AT G T T T T AAC GAAT C TC GC T TT GT AGTG C C AAAAC AC A TT A TG AAAAGG T AAA 

CAGGTCTATTGCCACCCATGTGGGATAGGTAAACAAGATATTTGGTGTGTGGCTGTACTCGGTAAAACCGGGTAAGTACCCCAACAACCCATGTG 

GGGCGATGGACTCACTTCCTCCGGCTGCAAATCCTCCGCCAGGAAGACGGTCTCGTAGAACTTCTCGTACTCCGATTTTGGCGGGAATGACAGCT 

GCGACTGAGTTTTCTTTTCGGCGGCAGCAAGACTGCGATTGCGACTGTTTAGGGCGATTCGCTTGCTCTTCGGGCCGATCAGTGAATTCCTTGTA 

CTTCGGTGAACCTCCTTGGCGCCGGCGGCATCGTCAATATTTCTGGACGGCATTACGTTGCGGTCTTTTGTGTCCAGCGCTAGATAGCGTGAATT 

GTTGTGTTTTTATTTTGATAATTGGTGGAAAAATATCTTTTGTAATTCCTGTTCCTGTTGTCCTAGAGATGCAACAAGAGTCGATAGATGTGCAG 

TAATATCGATTGTTGCAGTTAGTGAAGTTTTTGAACACTGTTTATTGGTTTGGCTTATCGATAATTTAACATTAATACGCAGCCCTGGTTAAAAT 

GGCGGGCAATTTAAAAGTTTTTATTAGTTAAGACTTCAAATGTTCGTAATTTGGATATTGTTAAAGTAAACGTTATCAGTAAATCAAACGCGATT 

TAAGTTAAATGTATCAAAGTCCTCTTGTTTGGCTTATAGCTCAATAATAAAACTAAGTGTTCCAACAAAAATATCTCTCTTCAATACATATTTTC 

TTGCATCAATTATTTTATTTACTTTACCAATTCAAATGAAGTGCATTTCATTTGCTTTAGCATTTATATTTGTTAGGCACTGCCTCCTAATACAA 

TAAAGATCATTTAATTGAAATGAATTGATGTAAGGACAATTACCACAGATGTCATTCTATGTGGATTTCGCACGCACTTTTAGCTAATCAAGATG 

GGTCAGCAGGCATTTGCCGCAGCAAATGTATCTGCATCTGCGAACAGCATCTGCATCTGCATCTGTATCTGGAAAAAGCACAAGGCACAACGGAC 

GACGGCAATCAGCGACAATTGCCCAAAGATTGCGACTAATGGTTATAGTTGGACCGACGCACGTCGTCGCATCAGACAGAAGGTGCCTGTGGGAG 

ACTTGCAACCGGTTTCGAATGATTCTCTTGTCTTCCGACTCTTGCCGTTCTCACATGGAGAAAAAGTGAACAAATCCGGCCAACAATAGGGTTAC 

AATATCAGTCTGTCCATATATTAATATGTTACATTCTTTAAAACAATACAATTCATTTGGGAAAAAATTCAGACATTGTTTCACAGATTAATACA 

TTGAAAACTGCTCCATTTAATTTGTAATTCTCAGTGTAGTCTCCTGGCTCCTGCATTTCGGCCGGCTGCAAAAGTTTGTGGCTCGTGCTCGGTGA 

TTGCCAGCT 

(SEQ ID NO: 751) 

Exon : 2714.. 2391 
Exon: 2086.. 1680 
Exon: 1606.. 1479 
Exon: 1413.. 1001 

Start ATG: 2618 (Reverse strand: CAT) 
Transcript No. : CT21Q95 
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GAACAGGAATTACAAAAGATATTTTTCCACCAATTATCAAAATAAAAACACAACAATTCACGCTATCTAGCGCTGGACACAAAAGACCGCAACGT 
AATGCCGTCCAGAAATATTGACGATGCCGCCGGCGCCAAGGAGGTTCACCGAAGTACAAGGAATTCACTGATCGGCCCGAAGAGCAAGCGAATCG 
CCCTAAACAGTCGCAATCGCAGTCTTGCTGCCGCCGAAAAGAAAACTCAGTCGCAGCTGTCATTCCCGCCAAAATCGGAGTACGAGAAGTTCTAC 
GAGACCGTCTTCCTGGCGGAGGATTTGCAGCCGGAGGAAAATATAACACCAGAACGTCGTAAGCCAGGACCGCGCATGAAGAGCTCACTCCAAGC 

C C T GT T G AGC G AT G AAG GAT C AC C G AC GGG ACC AC AGC G AAC C GC C AAG AT AAAC GT T G C AC AG AAAAT GGT C GT T GAGC C AG C GAT C C G AGC C A 
ATGGCATCTCGCCATTGCCCAAGAATGTTGGCAAGGAGGAAAAGTTTGAGGACTACTATACGAAAACGGAGCGACACGTTTGGAAAAAGGATGCG 
GAGAAGATGTTGCTGCAACTGTGGGCACAGCATTTGAAGGATTTTCGTGGAGAGTCCAAGAATGTGATCATCTACCGGCAAATGGCTAGGGAGAT 
GAGTCAGTTTGGGCCCAGCCACACGGAGCTTAAAACCAAAATGGACAATTTGTCGCGAAAGTATAGAATAGAAGCTGAGAGAGTTCGGGAAACTG 
GAGTGCCATCCAAGTGGGAGCACTTTCATAAGCTTCAAGCACTTCTGATTGGCACGAAATCAGTTGATGTCTTCGAAGATATTACCGCCGAAAAT 
CCAGCTCAAGCTTTGTTCAGCGATGAGGATTATGAGCCTGAGGAATTGTCTTCATTAAAAGACGAGCCTATGGCCGGGGACGATGGGAACGTATT 
CGAAAGTGATATGGTTGCTAGCAAAGCCATGAAAAGAACGCGAACTCCCTCTCCCAGTATTCCGGAAATCCCGGAAGAAGACGAGGAGGAGGAGG 
AGGAGGAGGAGGATGATGACCACATGAAGGATCTCTCCCCATCGCCAATTTCAAAATGTCAGACAAAACGGAAAGCATCGAACAACCATTCGGAC 
AGACTCCTTGAAATAGAGGAGGAAAAACTGGCGATCGAGCGAGAAAAGCTGCAGGTTATGAAGGATGCACTGCTGGAGCTAAACGCATTTCACAA 

AG AT AT TGTTTATT T GCT T AAG C AT AAC AAT C AGT AG 
(SEQ ID NO: 752) 

Start ATG: 97 {Reverse strand: CAT) 

MPSRNIDDAAGAKEVHRSTRWSLIGPKSKRIALNSRNRSLAAAEKKTQSQLSFPPKSEYEKFYETVFLAEDLQPEENITPERRKPGPRMKSSLQA 
LLSDEGSPTGPQRTAKINVAQKMWEPAIRANGISP 

SQFGPSHTELKTKMDNLSRKYRIEAERVRETGVPSKWEHFHKLQALLIGTKSVDVFEDITAENPAQALFSDEDYEPEELSSLKDEPMAGDDGNVF 
ESDMVASKAMKRTRTPSPSIPEIPEEDEEEEEEEEDDDHMKDLSPSPISKCQTKRKASNNHSDRLLEIEEEKLAIEREKLQVMKDALLELNAFHK 

DIVYLLKHNNQ* 
(SEQ ID NO: 753) 



Celera Sequence No. : 142000013384638 

GTGTTGATGCTGTGCCTGGTGTCGCTGCTCTTTGTCATCTGCACGGTTTTGTGAGAAAGGATCATGCTGGAGGTGTACGAGGAACGATACGGAAA 
GGTAGTGCCCATGCAGGATGCGGCTGTTCAGCTGCTGAATCGCTACCGGATGCTGGCCAGTGGAAATGTGCACAATTGAAGGTGATTGCTTTTTG 
GCACTGAACGGATTGTGGGTGGACAATAGCGGCACTTGCCACAGCAGTAAGTGTGGAATATCCTGTATTCCTGTAGAATTTCGTATTTTCATTTA 
ACGTTTGTGATAGTTTTTATATTCTGTTGAAACAAAGTGAGCGGACGACGACCATATCGATAAGCTTAATTATCAACCTACGATGGTACATAATC 

AAT AC AAC AAACT GT AAT C T T CC AT T TAG AAC C GT T GAC CG AT AC C GAT GT T AAGT GC AT T AT CAT TAT T AAT CCC AT AGT T GAAC T T AGGT AT G 
ATGTAGTGAAATCCATTCCTGGAAATACCGGGAACTTACGAGTAGAAGGGTATAGCCCCCATAGTCACAACGATATACAATTATTACAAGGCATG 
AACACATGTGTAGATTAGTTATGGAACAACGATGCGGATACTTAATATTGATAATGGATTGTGGAAGAGGAAGCCCACCCAGTGTTATTCTACCT 
TAACAAGTTACATGTGACTTAGACTGTTGTTAGATAATAACAATATTATACATGTTCATCATGACAAACCAAAATATTCCAGTACATCTAGTAGA 

T AAAC C A C AAAC GAAT AAC AAAT AAC AC AAAAC C AAAATC G T AGT AAT C AAAAT TAG C GT TAT AC TAT AT T GT T AC G AT T G ACT T GAC AC C AAGT 
AAATCAAAAACATGCAGATATTAATATTTGGAATAGCCTATACTTATACACATATATATATATATTTTAGTATACAAAATTATATTGAATAAACT 

T T AAG AG ACT GAAT AT CC AG T AGT T TCCTTAGAGTGTC AAAAAC G GT G AGT C C AAC AC AAC C AAT T T CAT TTATTTGTAGTTCTCTTTTATT G AA 

AGATAAAAATTTTAATTTTTTCATTGTCTCTTTTTTGTAATTATATATATTTTTTTATTGCTATATATATCTATATACACATGCTAATGTATATT 

T GAAAATTTCAAGTCAAGACGTTC AAAAT GC ATT TTCCAAAAATACAATTTGTTTTT AAT GCT TTGTGTT ACT AAAGTAGCCGATTTTGTTT AGC 

CACAAAATGATTTTCATGTTCTTTTGTGTGTAAAGTAGAAGACATTTTTAAATTAAACGTAAAGCGAGAAAACATATTAAAAATAATGGAATGTT 

CGTTCTTACATACTCGCGCAATAGAAATACATAAAATAGGTGGCTTTTGTTGAGCTACTCGGGCAGCTCCCTTAAATAAGTTTTTAACTCAATTT 

TGCAAACAAAAAGTTCATAAAATTAACTAAAAATGGACGGTATTCTGTGATAGAAGAAATTTCAATAATGCCGTCAGACAGAACCAAAGACTAAA 

AAAAAAAACATAAGTGCAAAATGTAGCCAACCAATTATAAATCAAATCTTAGCATTTACTTGCGCAAAACAATTGGCAGTGCATGGAAAAGTATT 

TTCTGTCTGTTATGCGTGGTGAGCCTTTTCCGTGTCTATTTGATTTTCACTTCGTTCTCATCTATTTTTTTTTTATCTTTTGGTGTGTCTGAGTG 

GTCGCTGTTGCGTGCCTAAGGCGATTTAAAAACGATTTATGTGCATATTCTTGCGACCGGTTTAGATAAAGATCTCTGTTCGCTTCCTGACTTGA 

TGGCCAGTAGTTGAGTGCAGGACAGCATCGTCTGGATCGTCAGCTCCATCGGCACCCTGATCCTCGTGCGTCAGGTAGTCGCCTTTGTGTCGGTG 

CAAATAGCGACCGATAAGGAAGAACATAAGGATCAGCAGCAGGAAGAGAATCACAAGCAGACCTGAAAGAAAGTTTTGTATCACTAATATGTTTC 

AGTGCCAGAAAATGATTTTACTTACATGCCAACAGCACAGAGTCAACTTCATTGTATGCCTTGCGCAACTTCTCCTCATCCACCAAGGGTGGTGG 

CCTAGTCTCGATTTCAATGGGCGGATGCGTAACTGGCTCCACTCCGCAGAAATCTTCAGTCAATTGAGCTATAACAACATACGATTAGGTTTAGA 

TTGTACTACTAACTATTCTGTAATTGAAAATACTTACTGCCCAATGATTTGACATTCTTGGGAGGATTCTGCTGGAACATCAGCTTCAGTGGATA 

GATATCATCGAATTGAACTCGCGACACACAACCCACAAAGCCATCTGTCATAGATTCATTTTTGCCAATGTACATGTACTGGATATTGTTGAATT 

GAGCATCCGCTGAGGCCTTAATATCAAAATTATACTCGACCGGCTCGTAGTTGTCAACCTTCAAAACCACTGTGGATCCACCATTCTTGCGCATA 

AAGTGCATATCGTGGTACTGACCCAGGCCAAAATGTTTTTTGGGGAATATGATTTCCTGCCTCTCGAAACCGAAATCAAATACACAACGCAAATG 

ACCTGAAAGAAAACGATTTGAATATGAAGTGGTTCTCACTAAAAAATGTATGCCAATCAATTTACCAGAATTCGATATTTGAATAGTTAAATATT 

CGCCAGTTAGGTTGGAAGAGAAGCCAAGCAGGAATCCCTTCGGAATGGTGGTGGTGAAACCGACGCGAATGTTCTCGGCAATTGTGGAGCGGAAG 

GAACCCTCGAACTCGTAGCGAATAATCGAACTGGAGCGTAGGTT GAC AC CAATCTCTGCAAT AAGT AAAT AGAAATT GAAT AAGATGCAATT ACT 

GCACTATCCAATTACCCACCATCTGCGCAAATAGGTCCTTTGAATGCGCTCCAACGGCAATCGCAGCTGTAGCCATCGTATCGCTCGATACAAGT 

ACCATTGTTTAGGCATGGATTTGATTCACAGCGACCCACGCAACCGGTGCTGATGCCGTATAATCCACGCTTGGAGTATTCCTTCAAGTCAACCA 

TCTTTCCGTTCAGCAGCAGTGCACGAATGCATCCTACGTAACCATCGCGGTACTCAGTAGTGGCGCCAATCACCAGATCCGATGTTAAGTGAAGT 

GCACGCACTGGACCCGGCGGCTCTCGAACCTCAGCCTTGATAGAACCATCAACCACCAGACGAGCCTCCTTCCTGTTGCGCTCGACACTAACCGT 

GTGCCAGTTGTTGTCGTTCAGGTGATAGCTTGTGCCCACATTAACGCCCAGTGGACCACTTCCCGCCTGATACTGGAATTGCAATTTGTTGCCGC 

CATTGAGACTTAGCTTAATGTAATCTGTAGGTCCGGTAGCATGGAAGATCACAGAGTTCTCTTGTGTGGTACGGAATTCCAGATAAATATCTCCA 

GAGTGACCCATATCGAATGGTGGTAAGTTAATAGAAGCATCGGCAATGCGGAAGGTCACCACATTACTGAACAGATCATCGCCCTCACAGCGCAG 

CGGTCCCAGAGTATAGCGACCCATTTTCTCGTCCAACGGAGTTCCAGTGTCGCCAAACTTCACAGCTCGCACGGGAAGGTATTCCTTCTCCCGGA 

TATCACCACCATCCTCCATCCATTCCAGACTGTTCGAATCGCAGTTGCACCACTTTGTGGGATCGTGACACTTGCCCAGAATTCCACATTCGCAC 

TTACGAGAGCCAGGTAGAGCTCCAGCCCAATAGTCCATCGGTTGATTGTGGCGAGATATCCACCAAGAGAAGGGACGGAAATTTCCGGCCTCAGC 

TGGATACAAAATTCTATTATAACTCTGAATCAATTCAAGGAATCGCATTACTACTCACATGGCGAGTTGAAGAGTCGGGAGGAGCGGCAGGAGTA 

GCTCAGGCGCTGCCAGCAGCTATGAGACCGATTCAACAGAGCTTCAATTTGCAGTTGGTTGGCATCGTACATGATGGACTGCTCAAAGGAGCCAG 

GCTCCTGAAAACCATCTACTGTTGTGGTGTGCTCTTGACTGTGACTGAGGGTGGTAATCACGCGTCCATCCGCTGCAAAAACATCAATCAATGTC 
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ATAAGTTTTATAAAATTACACAATTTAGGGTGTATCTACACATGTTTTAAACCATATAATATGTATGACATATTACAGAGGATATAATAATTTGA 
TTGATGGTAATTTTTCCATTTTCAATTTAAGGAGTAAACAAAGAAATGTAGGAGTTCTATACAATTCAAGTGATATCCACTTACAGTAGAACTCG 
CAGGTCACTGGGAAAGGTTCTAGGGGACCACTGCCATCCACATCGAGATTCAAGTTCACACGCTGCTGCACATGCTGGACATTCTTGAGAGCCAA 
GCAGGACAATGGATTGTTTGCTAAATAAGAAAAAACAAATAATATGTAAAAACAAGATCGAGTTGAAAACATGTGAGAACTCACATGTATGGCAA 
ACAGCTCCGGCGTAGCCAGTATGTCCGCAATCACAGAAGAACTCCCTGGAGTTTTGGTGACAAAGTCCCTTATGCTGGCAAGGATTCGGGTTGCA 
GCGATCAATCATCTGGCAGGCATCGACGACGACATCATCGCCACAGCAAACCTCCTCGCCCTTCACCCAGTCCTGCGGTAATTTGTAATTTCCAT 
CCACCGAAATTAAGCGCATGCAACCGACAAATCCATTTTTGTCCTTTCCACCAGCAATATAGTACTGTGCTCCAGTGGCAACCTGCATGCTCTTT 
GTTGTGGTCATCGGTCTCTGATCAATATTTAAAATCAAGCGATTCTTTTCGATGGATATAACGAACGAATGCCATTTGCCATCATTGAACTGATC 
ATCGTAGTTGTCCAGAATGATTCGCGCTTTGTCCTTGACTTTAAGATCAATTTTCACTTTGCCAAACTCCAAAAAGACCTTTAGATAACCACCCG 
AATAGAAGTCGTGATGCAGCATTACTCCAGTTTCCTCGTAAGTACGGAAATAGAAGGAGACGTTAAGACGCTGCGAATTTTCATAACCCTTCAAA 
CGGACGAACGACGAACGGGTAGTGAAGGTCACCGGATAAATCGGAGGCGAAGGACACGCGTAGATCGTGTTTACCCTAGTGAAGAGATATCCCTC 
TCCCAGTTCAGTGCTGTCTTTCATCACTCGAATAAAGTTGGTCGAATTAAAGTAGATGTTTTCTAAGCAGCCGGAGAAATTTTGTTGGACAATCA 
AGCCCTCTTGTACGTTGGGCACACCACCTAGATACAGCTCCCTATTAAGATTCAGTCGTGTGAACTCGCCTTGGATGCGTCCTCTTACGATGACA 
CGATCCACTGAGAAGATTATGTCCCTTTGATTCCTTGAGATCACAACGTCATGCCACACATTATCATCCAGCAGGCTGCCCACCGATAAGGAGGT 
CATCACCCGCGATCCCAAATCCAAATTCAGCACCATTTTGTTGTCCTTAAGCTGCAGGGCATAGTAATCCCCTTGTGTGCCACGCGAATACATCA 
TCACTCCGTTGGCAAAGGCCGTCTTGAAGCGGAAGCGGATGGACTCTTTGGTGGAGGTAATGGGTTCCCGCCTCAGATCATAGCGCACCAAGCCA 
GTGCCATTAAAGTATAGGTTTTCAGAGACTGAAAGAAAAGGTTTGGGATTAGAAACAACACAATATCAATGAATCCATTCAAAATTTACTGTAAT 
CGCAGCCGTAGAGCTCCACACGCATGGAAATACGATCGTGCCACCTGGTGGGATTGATCCTCACCCATTGGGCAATAATGGGCACCTCGAAGACG 
TTGTAATGGATAGAGTTGCCATCGGAGTTGCCCTTGAACATCTGTGATGGATATCATACATATACATTAATATTAAAACAATTAATAAACAAAAA 
AGGGTAGTTCAAAGCCAGATTCTAAAGCATTAATCCATCGCCATCCTTTAAGCCGGGATTAGTTGTGTTTTAAGTGTAGTCTTAGTTCTAGGATT 
AGTCACAGTACCTGCGGCTCGCTCGTAGGATTCACGTAGGAGCGCCAGAACTCGCCATCATCCGAATATTGGACAATGTACTCCGTCACGAACTC 
ATCCGTATGCATGCGTCCCATGGTGGCGATCTTGCGCACCATGCGCGGATCTCCCAGGTCGAGTGTTAGAAAGTGGTTATAGGTATTCTCGACCG 
GTGTCCAAGCAGCGTTACCTGCACAAGGTGCATTTACCAATCTAGTTAGTCGCAGTGTCAGAGTGTGGAGAAGGGTGTGGTTTTTTTTGTAACAG 
AAGT AAT C T AC AG AAC CAT T AGC C AAAT GAGT AAC AAACGC AT GCGGG ACGAATT AAATC AT AAAAT AAAAAC C AAGC AC AAAAGC AAGC AAAAG 
TGTTAATGAATTTATGTAAGGATAATCCGCTTTCGCAGATTCAGGGTTCACCCAACCACCATTGGTTTTCACAGAGAAAAAAATTAATATGTATC 
CTCATCGAAATGAAATATGAACGGGAATGCTTAATGGAAACATACTAGTTTCTTTTCTGTGCAATAAACCACCACCAAAAGCGATCCATTTTCTC 
TAGGATTAGCGATACACATTTCCAATTGTATGTGGTGATCAATTGTTTACCTTGACATTGCCGCCCGGCTCTTTATAGTCGGCAAACTCTAGATC 
GGTTATGCCATAGCTAATGGTGTACTCGGTGACGTACTCATTGCTATGTGGTCGTCCTTGCAGGGCGATGTGCGTCACGTTCCGCACCACGCCCA 
GGTCAATAATCAGACGCTGGTCGAAATCCGAGTTCTTCGCGGACCACGAGGTGCCAGCTGGGGTTCGCAAATGGTCATGGGATGATGGGCAATAA 
AATAAAAATGATACAATTTAAACCAAAACAAAGGCAACATAAAATAAACCTAAAGTGGGGGGGGGGGGGGGCAAACAAACAGCAAATGGGAGTTC 
TACGCTCTATAATTGCCTAAAACTAAGGGGTTTAAAATTACGCTAAATCTGCAGCAGGGGGTGTTTTCAATTAATAAACCATAATTTAGGGGCGA 
TCGCGATTGACAGCTGCAAGTGGTCGAAGTCCTTCAGCACAAGGATCAACCCCTGCCCAAAAAGTGAAAAAGGATTTGGGTAGCCCGAGGGTTGA 
GTTGATGGTGTCAACACAAAATGGGAGAGTTCGCAAATGGTTAGGCATAAACAATTGCAGTCGTGTGCTTTTGTTTTAATTGCTCAGCGCGCCAA 
ATTGCAAGCCGGTGTGACAAAATAGTGTGAGAAAGTCCAAACTGCACAGATTTCTGAGCGCACAATCCATTCCCCAATATTTTTATAAGCCGCAT 
TCAAGTTGTCTTTGTCCGACGTTGGCAACGTGGCATATCAGCCAGCCATCGTGTTCCCATCGATAAAACTTTCGGCAATTCCAAAAATAATTCAA 
ATAAACGAGCCATTCAGTGCTACCCTTGCGGATGGTTATCCTGTTGGCGCATAAATTCAACCAAAACCAATGTCTGGCCCCTGTCTGAGTTCCTT 
CCCGTTTACGGGTTCCTAAATTTTGGTTTATGGCCTTTTAAGCATATTTCCGGCTTCGTTTCGTAGCGTTTGTTTATCCAGTGCCAGGCAAACAA 
ACGATTTTTTGTTGGGGTGTGAGCGCTGCGCAACATGACAACACAAGGATTACAAACTATTCCAAATTTGGCACACAACCATTGAGAAAATATAA 
ATGCATGTTTATTCTAGGGATAACGCTATTTTTAGGAGCTACGTTTGCTTGGCTTAGATTCCTTATAGTTCCTATACATCATGATCGCTTGTAGG 
GCTTAAGAAGAGCCTTTTGGCAGTAGCTGATCACTTCCTCAAGTTCCGAATCTTCACTCTTCACGAGAAGTTCCTCAACTTGCAGAAACTTGAAC 
TTTCCACGCTCCTGTGGCAAGCAATTCCGGAATGCAATGGCCGCACAGTAGTCCTCCTGCAGTAAATGCTCCTCGAAATGCCAATTGTCCATGGG 
TTGGTCGTGGCAACGCAGACTAGTACCGATCAGAGGGGAAACATCTGTCTCCAAACTGCGAGTAGTGTCCACTGAGAAGCTGATCTTTTGCAGAT 
CTACAGTGATGCCAACACCCAGCTCCTTGACGTACGCCTCCTTGAGACACCAGTGACGCATAAAGGCCTTCACCTGCTCCCGTTCATCATGATGC 
GGTCTTCCAATATAACTCCATTCCTCCGCCGAAAATTTGCTCTTCATCAGGCCAAAGAATTCCGACAGAGGCTTGCCACCATTGTACTCGATCTT 
CATGACATCTGTGCCGATTCCAAAGTCGGGATCACTGCTCTCTCCTGCAATGCCCGCTAGAAGTACCAGGCTTCCCTGATGGGAGACATTGAAGC 
TGAGAGGTGGCCCATCGTAGTCCTCCCCTTTCACCCAGTAAGGTTTACCCCTCACATCCCGGGCGAACTTCACTTCGGCCGACGGCAATCCGCTG 
CACGTGCTCACATATTTTCGCATGAAAAGACGACCAATTAGCGAGGACAGAAAGTCATCGATAAAGTGGAACTTCATAAGCCGAGCTCGTTCCTC 
CGGTTGAATTGAGGCCACCGCCTGGCTCAGTTGAGGTAGTGTGGGCCTCCAGCTGCCCAGGTCAAAGGCCCAGCGGGTGCAGATGTGTTTGCTCA 
TTTCAATGCGTTTTGTAACTATCTAATTGCTTTTATAATCATTGTTTTCAGGGCGTTTTCGCTTTGCAATGTTGTTTTTAATTCGCTATTGTGAT 
GCAGTATGGCCAGAACTGGAGAGGGTAAAAATCCCCCAGCGATTACGCCTTATAAATATACTGCAAATATACTGCTCCACCAAATGCGGGGTGTA 
CCCATTTTCATTTTAAAAAATTAAGAAATAATGACTATCTTTAATTTAAAAAAAAAAATGGGATAGCTATATGTAGCATTTAAAAAAAGCAATCT 
AATAACCTGTATATATATAAAAATGTGTACTTAAAAAGGTTCTATTAAAATAAAAACATATTATCCTATGAATTTTTGGCGCAGCATTTCGAAAT 
AATAATTGCTAATAAAACAACTGAAAGGCAACTAAACTTTCGAACGAATAACTTTCAACAAAAAATCAATATTACTCACCGTTTAGTCGTGCTTT 
GTCGGGTCCTCTTTCCGTCAAGGATGATGTGGCCGTGAGGACGGCTCTTTCCATCAGCGGCTGGTTGCAATCGTAGTCGGAGAAATAGTCCGTGA 
AGGCGTCTGAAAGAGTGCAATATCGATTTATAAATATCCGGAATTCCATGACCCATATATGTACGAGTGTACATTGTTGGGCGAAAAAATGAAAC 
GCAAACTGTCACGGACGCAATTAAAATTGAGATATGAAGAATCTGTTTGGATGGCGGCCAATAAGCTGAAGTACTCCAGCCGGGACATAATCATT 
GGCCTCCTTTTTATGGGCCCATAAAGAAAGCCACATCTATAAGGCAGCAAGAAGTAAACAAATTGACTGAGGAGTAAATATTTGCTATTTATTTT 
AAGAGTTCTTAGTTCTCTGTTTTCCGGGTAATTTAAGTCGTACAAGATATATTGAATTTGCTTGTAATGAATAAATATCATATAAATAGAATCTA 
TACAATTGAATCTTTAAATTTCACATGAAGTCCAAGGAAAATTCCTATGAATTAAGACAATCAGCATTAAATTGGACATAAACTGATGTAATTAG 
TTGCTTTATTTAGAATATTTTACTCTTAAACATGTATTAAGAGTACTAGGTACTAGTTAGAAATATTTCTTATTAAGATGGTTCCACTGTTTTAA 
GAGGCAGATTCTTGTCTCATATTTGCACCATAATATTTTCCGTTGACGAGAGTCTTTTAGCCCCTAGGATCCGTGACAAAAGCATGCGGATGTGG 
GGGCCAGGTCGTGTCGGAAATCCAGACAGTTGCCCCTGTCTGCCGCAACCCCCAAGAAATTGTTTACAATGGGTGCAGCAGTAGCTCCTCCTAGG 
CAAACAGCACACTGGCATTTTCTTAGCGGAGTGGGGGAAAACAACCCTTGCGAGGTCATTCGGAGTTCGTGGGCACTTTAAGCTGACTTTGAGTC 
TTGGGTGGAGTGGTACAAATCAACCTGATAAGCTCGAAAGGAAACCGAGGAAACACGCATGACAATCTTCTGGTAAATAACACAGGAACATATGT 
AC AT AC AT AT AT AGC CAGT T T G AAC T AC TTTTTGCT AC AT AAT T G C C C AAT G GT GAT T AG AAAG AAAAC GAAAT AC AT AG AAT TTGGCTAATGTT 
CTAAATGAAACAAATATGACGAGGTCTGAAAGGCCATTTCCGTCCATCTGGGGGAAACTCGAAGTGCGAATGATGCTATTTTCGAGGCAATAAGA 
ACCGAAATAAATTATGGAAATCGACCAAAGTCTGGGAGGATATATGTCGGCTGTCTGCTTTCGACGCTTCAAGGCAAATTAATTGCGCGGCATTG 
TTTATTTTTAGTCATAAAAATGTTTGCATATCATCGATAGATTGTTTATTTTTCAAGATATGTGGAAATTTGTGGGGAAAAGCGCTTAAGTCTGT 
TGACATATGGAAGGATAATGATAAAATCACCTGTTTCACACAGCATCATCAATCAGCAGATGCTTCTTATTTGCCGCATGCGCATTTGAAAAGCC 
GGAAAAGAAGGCAAGCACAACTGGGGATGCAACCGGCAAAATTTTCGAATTGCTCCCCCGGGAAGCTTCATAAAAGTCCGGAAATCAAGAATCTA 
AAACCCCAATGCCTTTGCTCAAAACGAACGAGTAAAAACGGGCGTTGTTAAGCGGCATTTTTGTTTTGCCACCGCTATTCGTAAGCAACCGTTTG 
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ATCGCTCTCCCGTTCTCACCAAAACTCTCGCGATCGCTAAGAGCGAGTTAAAAAGCGAACAACGGGCCAAAGGAAAATGGAAGAGCGAGTCTTCA 
GCTTTGGGGAACACGCCCAAAAAGTTGGCTTACAAACAATCGACGTTTTATGTCACATATTGCTGGGAAAATCGAGCCGAAAGGATCTGAATTCC 
TGGGTTAGTCAATGGGTATCACTGGTTTTTACCTGCTTGCACCGATTTAATGCCATTATTGACCAGCAATAGGAGGCATAACAATCCAAACTGAA 
GCGAGGAAAATGCTGCTTTCGTATTGCTCCTGGGCGGCCTCATCTTGGAGGCTGTTAATTAAACACTATCGCTTGGTAGATTTATAAACACATCT 
ATATTCGATTTCCGAAGAAGTTTAACACCAAAATGCACTTTTCCGCTGGAAAAACCACCGTGAAAAGCTGATGATTTGTGGCAACGCGTCCGCGG 
TGTTAAAAACGTTGGACTGCACTTTACGCTCCGCTTTAAATCGATTTTGAATTTATGTAATTTATCCACTGGGTGTACGGCTCCAGCAGATCACG 
CCCACTTCGGAAGAGTTTCGTTTGAATTAAGAAAATAATTATCGCAACGCGGATGACACACGAACAGAATGTCTCGCTGCCAGTGTGACCGTGAC 
TCAAACGTGGCAAGATACCGAGGGAGTCACCAACTGGGTGCTTGTCGCGTGAGCTAAGCTATCGTGTGATTAAGTATATGAAGTATTTACAAATT 

AT T T C AAC T GC T ACT AT G AAAT T AAAT T T GGT C ACT AAT AAT AGC T T T CC GT T AAC AT T TAT AT T AAT T C G T G AT AGAAAT T T T T T T GC AT A GT T 
TTAAAGCCATGTAATATCATTGCAAAATGAATGATATAAGCAAAGTATACAAATGTAATGGCAAATAAGTCAAATCCCCTCTAAAATATTTTTTA 
AAAATCTAGTTTTCTTATCTATCCAGAAATTTGAAATAAAATACATATTTTACAAAAAATTATTTCGAATACCAAATCGATAGGGGCGTAAAACA 
TCGATAGTCGTAGTAGTATCATTATCGATAGTTGGTCACACTGTTTATCCCTCTTTTCGCCAAAAATTTGTTGGCGTGTTGTCAATTTCCTAAGC 
ATTTGATCGGTTTTTTCGTTTTGTGTGTAACCCGGAGAGGATAAAACAGCGACAATGTACGGTGGAGACGAATACGCAACCAACTCGGTGAGTGC 
AGCCAGCGAGGAATGATTCATTCTCGGGAAACCAATCCCCGGACTCCGCCGACAACATAACCCATCTTTTGTTGCCCTCGCACAAATACAAAACT 
CACTGGGATGATTTTAAAGCCGTTTTACGACTTGGCATAGTTGGTGTTTAGATCATCATAGGTCATTAAGCACTGATAATCCGTGGGCCACTCCT 
TTGACACGTTCCTTTGCTATTTGGAAACGAATCCTGTAGTCCGGGGGACACCCCGGCGCCATGTGGTATATTATTGGTATCGGTGTACATTATTA 
ATAAGTCATAATACCTTCTACATTTTTGATTTGCGTGCAATTTTAGATGGTGTGTACACACCTTTAAAGAACACATGAATATATTCTA 

(SEQ ID NO: 754) 

Exon: 11438. .11053 
Exon: 9221.. 9105 
Exon: 6193.. 5997 
Exon: 5836.. 5695 
Exon: 5633.. 4455 
Exon: 4390.. 4265 
Exon: 4062.. 3859 
Exon: 3799.. 2870 
Exon: 2810. .2631 
Exon: 2567.. 2223 
Exon: 2158.. 2021 
Exon: 1962.. 1001 

Start ATG: 11158 (Reverse strand: CAT) 
TTATTTTCTTAATTCAAA 

TAAAGCGGAGCGTAAAGTGCAGTCCAACGTTTTTAACACCGCGGACGCGTTGCCACAAATCATCAGCTTTTCACGGTGGTTTTTCCAGCGGAAAA 
GTGCATTTTGGTGTTAAACTTCTTCGGAAATCGAATATAGATGTGTTTATAAATCTACCAAGCGATAGTGTTTAATTAACAGCCTCCAAGATGAG 
GCCGCCCAGGAGCAATACGAAAGCAGCATTTTCCTCGCTTCAGTTTGGATTGTTATGCCTCCTATTGCTGGTCAATAATGGCATTAAATCGGTGC 

AAGC AG AC GC C T T C AC GG AC T AT T T C T CCG AC T AC GAT T GC AAC C AGC C GCT GAT GG AAAG AGC CGT C C TC AC G GC C AC ATC AT C CT T GAC GG AA 
AGAGGACCCGACAAAGCACGACTAAACGGTAACGCTGCTTGGACACCGGTCGAGAATACCTATAACCACTTTCTAACACTCGACCTGGGAGATCC 
GCGCATGGTGCGCAAGATCGCCACCATGGGACGCATGCATACGGATGAGTTCGTGACGGAGTACATTGTCCAATATTCGGATGATGGCGAGTTCT 
GGCGCTCCTACGTGAATCCTACGAGCGAGCCGCAGATGTTCAAGGGCAACTCCGATGGCAACTCTATCCATTACAACGTCTTCGAGGTGCCCATT 
ATTGCCCAATGGGTGAGGATCAATCCCACCAGGTGGCACGATCGTATTTCCATGCGTGTGGAGCTCTACGGCTGCGATTACATCTCTGAAAACCT 
ATACTTTAATGGCACTGGCTTGGTGCGCTATGATCTGAGGCGGGAACCCATTACCTCCACCAAAGAGTCCATCCGCTTCCGCTTCAAGACGGCCT 
TTGCCAACGGAGTGATGATGTATTCGCGTGGCACACAAGGGGATTACTATGCCCTGCAGCTTAAGGACAACAAAATGGTGCTGAATTTGGATTTG 
GGATCGCGGGTGATGACCTCCTTATCGGTGGGCAGCCTGCTGGATGATAATGTGTGGCATGACGTTGTGATCTCAAGGAATCAAAGGGACATAAT 
CTTCTCAGTGGATCGTGTCATCGTAAGAGGACGCATCCAAGGCGAGTTCACACGACTGAATCTTAATAGGGAGCTGTATCTAGGTGGTGTGCCCA 
ACGTACAAGAGGGCTTGATTGTCCAACAAAATTTCTCCGGCTGCTTAGAAAACATCTACTTTAATTCGACCAACTTTATTCGAGTGATGAAAGAC 
AGCACTGAACTGGGAGAGGGATATCTCTTCACTAGGGTAAACACGATCTACGCGTGTCCTTCGCCTCCGATTTATCCGGTGACCTTCACTACCCG 
TTCGTCGTTCGTCCGTTTGAAGGGTTATGAAAATTCGCAGCGTCTTAACGTCTCCTTCTATTTCCGTACTTACGAGGAAACTGGAGTAATGCTGC 
ATCACGACTTCTATTCGGGTGGTTATCTAAAGGTCTTTTTGGAGTTTGGCAAAGTGAAAATTGATCTTAAAGTCAAGGACAAAGCGCGAATCATT 
CTGGACAACTACGATGATCAGTTCAATGATGGCAAATGGCATTCGTTCGTTATATCCATCGAAAAGAATCGCTTGATTTTAAATATTGATCAGAG 

AC C GAT G AC C AC AAC AAAG AGC AT G C AG GT T GC C AC T GG AGC AC AGT AC TAT AT T G C T G GT G GAAAG G AC AAAAAT GG AT TTGTCGGTTGCATGC 
GCTTAATTTCGGTGGATGGAAATTACAAATTACCGCAGGACTGGGTGAAGGGCGAGGAGGTTTGCTGTGGCGATGATGTCGTCGTCGATGCCTGC 
CAGATGATTGATCGCTGCAACCCGAATCCTTGCCAGCATAAGGGACTTTGTCACCAAAACTCCAGGGAGTTCTTCTGTGATTGCGGACATACTGG 
CTACGCCGGAGCTGTTTGCCATACATCAAACAATCCATTGTCCTGCTTGGCTCTCAAGAATGTCCAGCATGTGCAGCAGCGTGTGAACTTGAATC 
TCGATGTGGATGGCAGTGGTCCCCTAGAACCTTTCCCAGTGACCTGCGAGTTCTACTCGGATGGACGCGTGATTACCACCCTCAGTCACAGTCAA 
GAGCACACCACAACAGTAGATGGTTTTCAGGAGCCTGGCTCCTTTGAGCAGTCCATCATGTACGATGCCAACCAACTGCAAATTGAAGCTCTGTT 
GAATCGGTCTCATAGCTGCTGGCAGCGCCTGAGCTACTCCTGCCGCTCCTCCCGACTCTTCAACTCGCCATCTGAGGCCGGAAATTTCCGTCCCT 
TCTCTTGGTGGATATCTCGCCACAATCAACCGATGGACTATTGGGCTGGAGCTCTACCTGGCTCTCGTAAGTGCGAATGTGGAATTCTGGGCAAG 
TGTCACGATCCCACAAAGTGGTGCAACTGCGATTCGAACAGTCTGGAATGGATGGAGGATGGTGGTGATATCCGGGAGAAGGAATACCTTCCCGT 
GCGAGCTGTGAAGTTTGGCGACACTGGAACTCCGTTGGACGAGAAAATGGGTCGCTATACTCTGGGACCGCTGCGCTGTGAGGGCGATGATCTGT 
TCAGTAATGTGGTGACCTTCCGCATTGCCGATGCTTCTATTAACTTACCACCATTCGATATGGGTCACTCTGGAGATATTTATCTGGAATTCCGT 
ACCACACAAGAGAACTCTGTGATCTTCCATGCTACCGGACCTACAGATTACATTAAGCTAAGTCTCAATGGCGGCAACAAATTGCAATTCCAGTA 
TCAGGCGGGAAGTGGTCCACTGGGCGTTAATGTGGGCACAAGCTATCACCTGAACGACAACAACTGGCACACGGTTAGTGTCGAGCGCAACAGGA 
AGGAGGCTCGTCTGGTGGTTGATGGTTCTATCAAGGCTGAGGTTCGAGAGCCGCCGGGTCCAGTGCGTGCACTTCACTTAACATCGGATCTGGTG 
ATTGGCGCCACTACTGAGTACCGCGATGGTTACGTAGGATGCATTCGTGCACTGCTGCTGAACGGAAAGATGGTTGACTTGAAGGAATACTCCAA 
GCGTGGATTATACGGCATCAGCACCGGTTGCGTGGGTCGCTGTGAATCAAATCCATGCCTAAACAATGGTACTTGTATCGAGCGATACGATGGCT 
ACAGCTGCGATTGCCGTTGGAGCGCATTCAAAGGACCTATTTGCGCAGATGAGATTGGTGTCAACCTACGCTCCAGTTCGATTATTCGCTACGAG 
TTCGAGGGTTCCTTCCGCTCCACAATTGCCGAGAACATTCGCGTCGGTTTCACCACCACCATTCCGAAGGGATTCCTGCTTGGCTTCTCTTCCAA 
CCTAACTGGCGAATATTTAACTATTCAAATATCGAATTCTGGTCATTTGCGTTGTGTATTTGATTTCGGTTTCGAGAGGCAGGAAATCATATTCC 
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CCAAAAAACATTTTGGCCTGGGTCAGTACCACGATATGCACTTTATGCGCAAGAATGGTGGATCCACAGTGGTTTTGAAGGTTGACAACTACGAG 
CCGGTCGAGTATAATTTTGATATTAAGGCCTCAGCGGATGCTCAATTCAACAATATCCAGTACATGTACATTGGCAAAAATGAATCTATGACAGA 
TGGCTTTGTGGGTTGTGTGTCGCGAGTTCAATTCGATGATATCTATCCACTGAAGCTGATGTTCCAGCAGAATCCTCCCAAGAATGTCAAATCAT 
TGGGCACTCAATTGACTGAAGATTTCTGCGGAGTGGAGCCAGTTACGCATCCGCCCATTGAAATCGAGACTAGGCCACCACCCTTGGTGGATGAG 
GAGAAGTTGCGCAAGGCATACAATGAAGTTGACTCTGTGCTGTTGGCATGTCTGCTTGTGATTCTCTTCCTGCTGCTGATCCTTATGTTCTTCCT 
TATCGGTCGCTATTTGCACCGACACAAAGGCGACTACCTGACGCACGAGGATCAGGGTGCCGATGGAGCTGACGATCCAGACGATGCTGTCCTGC 
ACTCAACTACTGGCCATCAAGTCAGGAAGCGAACAGAGATCTTTATCTAAACCGGTCGCAAGAATATGCACATAAATCGTTTTTAAATCGCCTTA 
G GC ACGC AAC AGC G AC C AC T C AG AC AC AC C AAAAG AT AAAAAAAAAAT AGAT GAG AAC G AAG T G AAAAT C AAAT AG AC AC GG AAAAGGC T C ACC A 
CGCATAACAGACAGAAAATACTTTTCCATGCACTGCCAATTGTTTTGCGCAAGTAAATGCTAAGATTTGATTTATAATTGGTTGGCTACATTTTG 
CACTTATGTTTTTTTTTTTAGTCTTTGGTTCTGTCTGACGGCATTATTGAAATTTCTTCTATCACAGAATACCGTCCATTTTTAGTTAATTTTAT 
GAACTTTTTGTTTGCAAAATTGAGTTAAAAACTTATTTAAGGGAGCTGCCCGAGTAGCTCAACAAAAGCCACCTATTTTATGTATTTCTATTGCG 
CGAGTATGTAAGAACGAACATTCCATTATTTTTAATATGTTTTCTCGCTTTACGTTTAATTTAAAAATGTCTTCTACTTTACACACAAAAGAACA 
TGAAAATCATTTTGTGGCTAAACAAAATCGGCTACTTTAGTAACACAAAGCATTAAAAACAAATTGTATTTTTGGAAAATGCATTTTGAACGTCT 
TGACTTGAAATTTTCAAATATACATTAGCATGTGTATATAGATATATATAGCAATAAAAAAATATATATAATTACAAAAAAGAGACAATGAAAAA 
ATTAAAATTTTTATCTTTCAATAAAAGAGAACTACAAATAAATGAAATTGGTTGTGTTGGA 
(SEQ ID NO; 755} 

Start ATG: 281 (Reverse strand: CAT) 

MRPPRSNTKAAFSSLQFGLLCLLLLVKNGIKSVQADAFTDYFSDyDCNQPLMERAVLTATSSLTERGPDKARLNGNAAWTPVENTYNHFLTLDLG 
DPRMVRKIATMGRMHTDEFVTEYIVQYSDDGEFWRSYVNPTSEPQMFKGNSDGNSIHYNVFEVPIIAQWVRINPTRWHDRISMRVELYGCDYISE 
NLYFNGTGLVRYDLRREPITSTKESIRFRFKTAFANGVMMYSRG^ 

IIFSVDRVIVRGRIQGEFTRLNLNRELYLGGVPNVQEGLIVQQNFSGCLENIYFNSTNFIRVMKDSTELGEGYLFTRVNTIYACPSPPIYPVTFT 

TRSSFVRLKGYENSQRLNVSFYFRTYEETGVMLHHDFYSGGYLKVFLEFGKVKIDLKVKDKARIILDNYDDQFNDGKWHSFVISIEKNRLILNID 

QRPMTTTKSMQVATGAQYYIAGGKDKNGFVGCMRLISVDGNYKLPQDWVKGEEVCCGDDVVVDACQMIDRCNPNPCQHKGLCHQKSREFFCDCGH 

TGYAGAVCHTSNNPLSCLALKNVQHVQQRVNLNLDVDGSGPLEPFPVTCEFYSDGRVITTLSHSQEHTTTVDGFQEPGSFEQSIMYDAWQLQIEA 

LLNRSHSCWQRLSYSCRSSRLFNSPSEAGNFRPFSWWISRHNQPMDYWAGALPGSRKCECGILGKCHDPTKWCNCDSNSLEWMEDGGDIREKEYL 

PVRAVKFGDTGTPLDEKMGRYTLGPLRCEGDDLFSNWTFRIADASINLPPFDMGHSGDIYLEFRTTQENSVIFHATGPTDYIKLSLNGGNKLQF 

QYQAGSGPLGVNVGTSYHLNDNNWHTVSVERNRKEARLVVDGSIKAEVREPPGPVRALHLTSDLVIGATTEYRDGYVGCIRALLLNGKMVDLKEY 

SKRGLYGISTGCVGRCESNPCLNNGTCIERYDGYSCDCRWSAFKGPICADEIGVNLRSSSIIRYEFEGSFRSTIAENIRVGFTTTIPKGFLLGFS 

SNLTGEYLTIQISNSGHLRCVFDFGFERQEIIFPKKHFGLGQYHDMHFMRKNGGSTVVLKVDNYEPVEYNFDIKASADAQFNNIQYMYIGKNESM 

TDGFVGCVSRVQFDDIYPLKLMFQQNPPKNVKSLGTQLTEDFCGVEPVTHPPIEIETRPPPLVDEEKLRKAYNEVDSVLLACLLVILFLLLILMF 

FLIGRYLHRHKGDYLTHEDQGADGADDPDDAVLHSTTGHQVRKRTEIFI* 

(SEQ ID NO: 756) 

Name: Neurexin IV 
Classification: cell_adhesion 
Gene Symbol: Nrx 
FlyBase ID: FBgnOQ13997 



Celera Sequence No. : 142000013384 650 

TGGCTGTTATAAAAAGTTAATAGCCGTTCAAGTCCCCCTTCCAAACCACTTGGAAACTGGTCCTGCTCCTGCGCAGTGAAAGTATATAGGAAACG 
GAGCACATAGGCGAAGCTACCATCCAACCGCACACCAGAACTGTGTAACCCATTCCCCGATCCCGTAGAGAATTTATTGCATGGGCCGCGACTGC 
CGTTGGCTTCTGTCCATCTATCTTTGGCTGTGCTTTTCCTGGGATTTTCCCCAATGCGATGGAATGGGTACTCAGACCACAGAGGACCTGCCCAC 
AATGCCACCATTGGCAGTTACATTTTATGGCCAAAGTTCACCAGAACCCAACTCGTTCCCCAGCCCCACGCATTCTGGCGACCAAGTGGCGCCAT 
TTTTGCCGCATTGTCATCGATGGGAGGATGAAGATGGGGATGGGCAACTGACAGGGATTCTAGTTTATGCTTGCGAGGACTTTTCAAGAAATAAA 
CTTTTTCCTCTGTCGACATCAGCGAAAATAATATCGCATTCGGAAAAGTATAAAAATAAAATGAGGACGCAGATTAAAACTATCAAGTTGAACAG 
TTGGAGATAGATTTTTCTATAAATTGATTTTTATTTTCACTTTAAACATTTCTATCGAGACTGGAATGGACAACATATATAAAAACACACTTCAC 
TTCAATTGTTTTATAATTTATCAGTATGTATATTAAGATTATTCAGTAGAAAACTTCCTTACATCGCACCAGTGACGATGCGAACCGAACAAAAA 
CTTAGTGAGACAGTTCATCATAATTATTAAATTAATTCACTTTCTGGCGTAATAACAAAAATGCTGGCTAACTAGCTTTGTTGACCATTTGGCGC 
ATGAAAATGGGAAAAAAGATTTATGTTGCCAATTCAGTTTCAAGAGCCGAGCCTGATGGATTCATCTCAACTCTAAAGTATACAAGATATTACAA 
AAATTTGCTACGTTTACGGAATTTTCGTACTTATATAGATTTTGCCATTTTTATTTTTATTTTTTTTTAGATAGATTTTCTTGTGCTTCTTATAA 
ATGCCTGCACACATGATATGCAGAGTTGAGAATGGAGTGAAACTTTCGACTTGACTTGCTAAAACAAATATGAAACATCTTCTGTTGGCTATTAC 
TAAGGGAAGTTTGTCTTGTGGCTTCTTCGACGGTTTCTTCGTTTTGTATTGCGTGGCATTGTCATGAGGCATACATCATTGATCAAATTTCTTGT 
TCTTGCTTTGATTAAGTAACGACTTGAGTTTGTATTCGTGACCTGACTTCTTAAACAGACTTCTATTTAAAAAACAAACAGTTACACATAATGGT 
AGAGCATGCTACGCTGAAGTCTTAAATCTTTCATTTTAACTAAATTTAGTTGCTGCCTAGTCAGGTGTGCGAATCTGTTTCCTCGTGATTTCTGT 
GGGTGTATGCTTGTGTGTTTGTTATTCGTTACGATCCCAATAAGTCGCGAATTCGCCTAATAGAACTTTTTCGCCTGTGCTTCGCTGCGGTTTCT 
GCAAGGAATAATAAGTTAGATTGTTTAGTTTCGAATCCACGCTTAAAGATTCCATGAACTCACTTGAAGACGTTGTAGATGAAGACTAGCAGCAG 
TTGCACAACAGTTACGCTTAGGAACAATGTGACTCCAACGCAATTGCCCGTGGGACAATTGGCCACCTGCGCTGCTCCCTGTGGCGCTGCCAATC 
TGCAAGGTAAGGGAATGCATATTGAGTATCAAGGAGGTACTTAAAAGATGACAATAAAGTATAATAGTCTAAAACTCATAAACATGTTCCTCTGA 
ATCAAACTTGTTAAAACTCTAATTTCATAATGTGGAATACGAAAAATTAGTATAAATATGGCTGAAAAGGTTAAAATTAATTCGAGCTCAGAATA 
ACTTCGATGACAACAGCTTTAGGAAACTTAAAATAGACTTTGAGCGTATTCCTCACCTTTGGCCCACGTGGGTGATGCCCTGCTTGACCTGATTC 
ATGCCATCGCGCATCTCGGCGATGAGCGTTTGTACATCATAGCCGGTGGATTGGATTTGTGCTGTGGGCGCATGCTTCTGATTCGTTTGAATGTT 
ATCAGTGCGCACATTGATATCGCCAACCAGTTGACGAATCTCGCGGATGGAACTGAGCAGCATGTTCTGATTAGTGAGCAGAAGATCTACATCGC 
TCCTGCTGACTGCGCCAACGGGCAGCTGTTGCTGTGGCACTCCGCCGGCGGCGGGCGGAGGCAGAGCCTGTCCAGCATTACGTGAGACCAGCGAC 
AGCGTGGTCTCCTGGCGGCCAATAATCTCATCCACTTTGCGGGAAAGTTCGCGTAGGTGATCGGCAATTTGGCTCTGACCCTGCCAGATCTGACG 
CAGCTCACGCTGGTTCTCCGACTCGTAGAACTCCTCCCAGTCCTCCTCGTCTTTGTGCTGTGGGATTATACGAAAGTTGATTATCAAAACTAGCG 
ATAAGACATCTTTATCAGCTGGCTGCGTGGCAATTACCTCGTCGGGATGGTCCTTCTTGTACTCCTGCTTCTGTTTCTCCAGTTTATCTTGGTAT 
TCCTTGTACTCTTGCGTAAGCTTCTCCTGGTTTTCCACCTTGGGCTGCTCCTGAACTTGTCCAGCGGCATGCAACGACGTGGTCAGGAAATGGAA 
CACATCGTGGTCATCGGCCAGACCGCCCGTGGCGGCGGAAATGCCAAAGTAGCCGTTCTTGGGCAGATTGACGCCATCCGCTCTCAGACACAGCT 
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CGTAGTCATCGTTGTTGTTGGACATTCCGTTGTGGATCATGACGGTAAGCACGTTGTTGTAGTACTCGATTCGCGCCCGTGTAGGGAAGGGCTTG 
TTACGGAAATCCCTGAGGCAACCGCTCAGGAGCTGAGTGGTTCCATCTTCGGCATGGTCATACAGCTTGGTGCCATCATTGAGCACGGCACTGAT 
GTAAGGATTGTTGTGCTTGTTGTCATTATCGAAGGAATCGAACATGATGGCCAGACCATTCCAGCGGTCCGAGGATCCGAACACAGGCCCATTGT 
AGTCACCCTTTTCCGTGGTGTACCAGAAGGCCAATCCATCGGCTCCGATCCTGCCACGTCCCGTCACCCGGAACACGATCTCCACGTCCCACCAG 
TCGAAGTTCGTCTGTGACTTTGTCCAGATGGCACCCTTCTGTGAGCGCAGCGATGGCGCCACGCGCACACTTTCCGAACTGGCGATGGCATCTGA 
AATGAGGAGCAACAAGTTGTGGCTATAGAAACACCCGAAACCGTGGTCAAAACGACATCCAAACAAAGTGCCTGCGTGTACACTTGGCCGCCTCG 
CGCCATCTCGCTCGCACCCATGGCCGCAGACCAGCAGCGAGTGGGCCAAAACAACTGACGTGGAGCTGCGTCTTCCACTCGTACTCACTTCCCCC 
GTACTCCCAAAACGGCACGGTGCCGTCTTTCTGTGCCAAATACGGCGGCTTGAACGAGTACTTGTACTCGAAGCGACGGTGCACGCCCACGGCTC 
CAGGACTCAGATTGCCGGTCGCCTCGGAGCTGGGATTCCACGCCAGGAAGCACAAAATGGCCAGGAATGTGGGCCGCATCTTTGCAATTATATGT 
TGACAACTAGTGGTGGGTGGGCGACGATGGTGCAAACTGATGATATCGATAACCAGCCGGCACCGATAGGCGGTGGTATCGCACTGGATGTTCTA 
AAAGCATTGTATAATTTAAAATCAAATATTATTTTATGTATTGTCTACCATTAACAATTAAGGTAATATAATATAAATTGATTAGAACATTTATT 
TAGGCCATCCTTCGTATATAAATGTTGAACTTAAATAAAACCAGCCATCCTATAACAATTAACTTCAGCTTAGATCACAATTTAAAATTCTATAA 
GATAAAAGTTTTTCCATGCATTCTAAGAAACAAGAACCAAGTGATCTAAAAAATATTCAGGCAGCGACAATACAATCGCTATCGATTTCGGTAGA 
TTTTATATCGATATCGCAAATGCCCACCAAATTCAAAAGATCTTGTTATCACTTTATGATGTTGAGGTGGAAAAACATTTATTTTGAAAATTTAA 
AGTGTTTACGTCCCGTTAGAAAATAAACATAAATATAGCAAAATACGCGCCATTCGCATGTGCTTACATAGATACATGTTTATCAATACGTTATT 
GTAAAATTATTTCGTTAGTTGTTTCAATTGCCTTTTGCGTGATATCACCAAAGTAGTATTACATCCATATCTCCAAAATTCCTTTAGCACAGTTA 
GCACAGTGTGTGTATATATGTTTACATCTTTATTCTTTGTTTCTGATTTCCTACCGTTTGTAGTTTTCGTAGAGCTTACAGCTAGAATTATTATA 
GGTTTTAATGCCTCGGTAGCACATAGTGTGGGCTTCATAAAATATTGTTAGTTGCTCAGTTTACACAAGTGATAGCTTTGTGTGCAGATCGGTGT 
GTGGGGGATAGTCATTAGGGGCGATCGTGCCGCAAGTTTCGCCATTTTACAAATGTTAGTTACTTATTTTGAATCACAATCACTTTCGTGTTAGT 
TCTGTGCGCGAT C AAAAG AAAAAC T C AAAT C G AT T AAAAC AAAAT C AAAGT CT 
(SEQ ID NO: 757) 

Exon: 3613.. 3414 
Exon: 3226.. 2508 
Exon: 2432.. 1957 
Exon: 1709.. 1584 
Exon: 1518.. 1001 

Start ATG: 3594 (Reverse strand: CAT) 

tcaacatataattgcaaaS 

CTGGAGCCGTGGGCGTGCACCGTCGCTTCGAGTACAAGTACTCGTTCAAGCCGCCGTATTTGGCACAGAAAGACGGCACCGTGCCGTTTTGGGAG 
TACGGGGGAAATGCCATCGCCAGTTCGGAAAGTGTGCGCGTGGCGCCATCGCTGCGCTCACAGAAGGGTGCCATCTGGACAAAGTCACAGACGAA 
CTTCGACTGGTGGGACGTGGAGATCGTGTTCCGGGTGACGGGACGTGGCAGGATCGGAGCCGATGGATTGGCCTTCTGGTACACCACGGAAAAGG 
GTGACTACAATGGGCCTGTGTTCGGATCCTCGGACCGCTGGAATGGTCTGGCCATCATGTTCGATTCCTTCGATAATGACAACAAGCACAACAAT 
CCTTACATCAGTGCCGTGCTCAATGATGGCACCAAGCTGTATGACCATGCCGAAGATGGAACCACTCAGCTCCTGAGCGGTTGCCTCAGGGATTT 
CCGTAACAAGCCCTTCCCTACACGGGCGCGAATCGAGTACTACAACAACGTGCTTACCGTCATGATCCACAACGGAATGTCCAACAACAACGATG 
ACTACGAGCTGTGTCTGAGAGCGGATGGCGTCAATCTGCCCAAGAACGGCTACTTTGGCATTTCCGCCGCCACGGGCGGTCTGGCCGATGACCAC 
GATGTGTTCCATTTCCTGACCACGTCGTTGCATGCCGCTGGACAAGTTCAGGAGCAGCCCAAGGTGGAAAACCAGGAGAAGCTTACGCAAGAGTA 
CAAGGAATACCAAGATAAACTGGAGAAACAGAAGCAGGAGTACAAGAAGGACCATCCCGACGAGCACAAAGACGAGGAGGACTGGGAGGAGTTCT 

AC GAG T C G G AG AAC C AGC GT GAG C TG C GT C AG AT C T GGC AGGGT C AG AGC C AAAT T GC C GAT C AC C T AC GC G AAC T T T C C CGC AAAGT GG ATG AG 
ATTATTGGCCGCCAGGAGACCACGCTGTCGCTGGTCTCACGTAATGCTGGACAGGCTCTGCCTCCGCCCGCCGCCGGCGGAGTGCCACAGCAACA 
GCTGCCCGTTGGCGCAGTCAGCAGGAGCGATGTAGATCTTCTGCTCACTAATCAGAACATGCTGCTCAGTTCCATCCGCGAGATTCGTCAACTGG 
TTGGCGATATCAATGTGCGCACTGATAACATTCAAACGAATCAGAAGCATGCGCCCACAGCACAAATCCAATCCACCGGCTATGATGTACAAACG 
CTCATCGCCGAGATGCGCGATGGCATGAATCAGGTCAAGCAGGGCATCACCCACGTGGGCCAAAGATTGGCAGCGCCACAGGGAGCAGCGCAGGT 
GGCCAATTGTCCCACGGGCAATTGCGTTGGAGTCACATTGTTCCTAAGCGTAACTGTTGTGCAACTGCTGCTAGTCTTCATCTACAACGTCTTCA 
AAAACCGCAGCGAAGCACAGGCGAAAAAGTTCTATTAGGCGAATTCGCGACTTATTGGGATCGTAACGAATAACAAACACACAAGCATACACCCA 
CAGAAATCACGAGGAAACAGATTCGCACACCTGACTAGGCAGCAACTAAATTTAGTTAAAATGAAAGATTTAAGACTTCAGCGTAGCATGCTCTA 
CCATTATGTGTAACTGTTTGTTTTTTAAATAGAAGTCTGTTTAAGAAGTCAGGTCACGAATACAAACTCAAGTCGTTACTTAATCAAAGCAAGAA 

C AAG AAAT TT GAT C AAT GAT G T AT GC C T C AT GAC AAT GC C AC G CAAT AC AAAAC G AAG AAAC C GT C G AAG AAGC C AC AAG AC AAAC T T C C C T T AG 
TAATAGCCAACAGAAGATGTTTCATATTTGTTTTAGCAAGTCAAGTCGAAAGTTTCACTCCATTCTCAACTCTGCATATCATGTGTGCAGGCATT 

TATAAGAAGCACAAGAAAATCTATCTAAAAAAAAATAAAAATAA 

(SEQ ID NO: 758) 

Start ATG: 20 {Reverse strand: CAT) 

MRPTFLAILCFLAWNPSSEATGNLSPGAVGVHRRFEYKYSFKPPYLAQKDGTVPFWEYGGNAIASSESVRVAPSLRSQKGAIWTKSQTNFDWWDV 
EIVFRVTGRGRIGADGLAFWYTTEKGDYNGPVFGSSDRWNGLAIMFDSFDNDNKHNNPYISAVLNDGTKLYDHAEDGTTQLLSGCLRDFRNKPFP 
TRARIEYYNNVLTVMIHNGMSNNNDDYELCLRADGVNLPKNGYFGISAATGGLADDHDVFHFLTTSLHAAGQVQEQPKVENQEKLTQEYKEYQDK 
LEKQKQEYKKDHPDEHKDEEDWEEFYESENQRELRQIWQGQSQIADHLRELSRKVDEIIGRQETTLSLVSRNAGQALPPPAAGGVPQQQLPVGAV 
SRSDVDLLLTNQNMLLSSIREIRQLVGDINVRTDNIQTNQKHAPTAQIQSTGYDVQTLIAEMRDGMNQVKQGITHVGQRLAAPQGAAQVANCPTG 
NCVGVTLFLSVTVVQLLLVFIYNVFKNRSEAQAKKFY* 
(SEQ ID NO: 759) 

Name: mannose-specific lectin 
Classification: chaperone 

Celera Sequence No. : 142000013384 670 

GAGACCACGGATCCTGGAGATAACTCGCAAACGAAGGAAAGAACAAACACGCATCTTGAGGAGACGCAAAACCAAGGAAATACTACACTCTGGGA 
T T C T GC AC AC AG AAAAG AT T G AC T C G GC T AT TG ATC C T GAC T AAT AAT AT AT AT AT T AAC AAT AT AT AC A CT T T AT T AAGT C GG AAT C GT T T AAT 
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TCTACATACAAATTTCATAGGATGTGTCACTTCATGTGAGAATATTAACAAGATTATCTTGGCAATAAGCGCTTCGTCACTTCGTGGATCGCTAG 
ATGACGTTTATTTGGGAACTCGTCATCTTCATCACAACCTTTTCATCTGTAAAACGTGACTCTACTAATTATTATATATTTTTATTATTATAATA 
CAATAATAATACAAAAACATATTTACAGAAAATAACTTTGAAACCTATGATTTGTTTCATAACT 

ATCTTAGATACTGAGATATTACGGGGTATTATGAATAGGTTGGTTTTGATAAAAACGGTAAAGGTATAAAACAAGACATACACCCACAAAAAATA 

AATTATATTATTCATGCAAGGATGTACCAAGGAGTTTTTCGGACAATTCGGGGATGATTGCAAAACAAAGGTAAATTATTGAACGCCGTGTTGAA 

CAAATCATGCTCAGGTATACCCAGGCTCATTTACCCTGCAAATTATGGCACTGAATGTGGCATGTTTACCTTTCAAATTGAGCGCACAATACAAA 

CCAAATGCGTTTGACACTGAAGTCAGAATATGCCTTTCTCGCAAATCCAGGGACCTTCTTCGTCTCGAAGGATATCAGATTGAGAAAATTAGCTG 

TTGGAGTGGGGAAAATAATGAGGGATCATAGACACACATTGAGGAAGGCACAGTTACTATTTCATTATATTTTGAAGACTCAGTTTTAATCTAAA 

TAGCAATCTATAGGTTTGTTCAATTGTTTGTTCGGTGTTTTCTTTTATTTCGTTTGTTGTTGTATATGGGCTTTAATTTGAATATTTATTTACAA 

TTTATTTAAGACTCGTTACACCCTTGCTCAAGTCAAATATAGTAACAGTTGCAATAAACAAGCTTTCAACTTATTGCTTACGTGTCTCATCAATA 

TAATTTACAAGAAAAACTTACTTAAGGCTAATCTCCGTTGATGTTTTGCTTTATGTCTCTGTGCTTCACGCATAATTTACTTTACGGATATTTAC 

CGGCTGCTTATTTTGATGCTTCTTTTATTTTGTAGGTATGTGTATTTTTACTAGTGAGAATATTGGTTGGTTAATTGAGAGTATATATAATTACT 

ATATATATCTGACTTCTATATACCATTTTTAACATTTGTAATTTATGTACGTTCGTTTTTACGAATTGAGTTTTTACTATGCGCTTGCGGTTGGT 

TTTCTTTGTATAAATAAAATAGAATATTTTCGCTTTTGCTTCGCTTCGAATAATACATGCCATATACATACATATAAGAAAATGCTCGATTACGA 

TTCTCATAACTTATGGTTCGACGTTAACCAAATGAGGCCCTCGGACAGGCCATCGCCGGATGTGGCACATGATGGTTGTACATACCAATTGCGAT 

CTCTTATTCTAGTTAAACCTAGTTTTTCCTGGATTTCATGGGGTTTCATTGCTGCAAGAGAGAACAATAAGCGGATGGAAACAATTGGCTGAGCA 

CTATACTAACCATCAGGCAGATCCTGCTTGTTAGCAAATATCAGTATGATGGCGTCGCGCATCTCCCTATCGTTAATTATCCTATGCAGCTCCGT 

GCGCGCCTCGTCTATCCGATCCCGATCCGCACAATCCACGACGAAGATCAGACCCTGTGTACCCGTGTAATAGTGTCGCCATAGCGGTCGAATCT 

TATCCTGCCCACCGACGTCCCACACATTGAACTTGACATTCTTATAGGTGACGGTTTCCACATTAAAGCCCACAGTGGGTATCGTTGTAACAGAT 

TGGCCAAGTTTCAGTTTGTACAGAATCGCTGTGGGTGGACAAAAACAATATTACTTTAGACACGCAAAGGATGTATAAACAAAATAGCTCGAATT 

CGATTAGTGATAGTACTGGTGTACGTATTGAAAGGTCTTTATATTGTACATTTTTTCAAAGTACAATAAGAATAAAAACAAAACAATAATTGCCT 

CGCCTCAAGAGTATTTTTTAAATGGGATTGGTTTCCACTTAGGAAACTGATGAAACTAATGGGTACTAATCAGGCCCACAATCGAATTCGAGAAC 

CCTTTATTGTTTATGTGCTTACTTGTTTTTCCAGCCGCGTCCAGTCCGAGCATGAGAATTCGCATTTCCTTGTTGCCGAAAATTTTTGATAGTAA 

CTTTCCCATGGTGCTGATGTGTCATCCATTAAACCTTAGCCAATCTGAAAAACAGTTCGAGTAGAGTTGGCGAGGTCTTCCAAATAAGCATATCG 

ATAAGAATAGCTGGGCGAGATTCAGGGCAGGGTCAGTGAATTTCCAGATAATAGGGCCATTGTAATACAGTTTGCAGTTAACATATCAATTTATG 

ATATAAGAACGCCGTCGCGCGAAGGCGCGTAAGTGATATATGAACATACAAATGTAGCTACATATGTACACATACGGCGGGCACAGGCAATTTGG 

TGGCCATATAGAGCTTGTACGACGGCGGAAGCTAAGACCATTATTTTGCATAATAATTTCACTTACATTGTCATCAGCTCGTTGCTAAAAAGCGA 

GAAAAAAGTGGAGAGGCAGAATTTATCAGTGAAGAGCTTGCAGCACACAGTCGCACACACAAACACACGCACACATGCTTACACATAGGAAGTAC 

AATTGGACAGATAAACACACTATAGTGTCTCGCAAATTGAGGCAGAGTTATCTCAATTGCGGGCTTATCGCTGGCTCCACTGAATGGGCGGGTGG 

ATCCGACCGCTCCACCCCCGCCCCCTGAACAGCCACCGCACCCCTCGACGACCCAGTAAAGCCGGCTTGGGAGCAACGATATCGGATGGTTTTCG 

GATTCTACGGAATTTCGCTCGCTCTCTCGCTGCTGCTTGCCTACGTCACAAACACGCCCGTAACACACATACACATGTATACACACCCAAGTCTC 

CGACGACCGCCGTTACGCACCGCTTTCCAGCCCAATTCGTTCCACGCCAGCCTGAAATCCGTTAACATGTAGTGCCAAAGTGCGCGGGCCGGGGC 

ATAAAATGCATTTGCTAGCACACGGCGGCCAAAAAACTATAACTAACAACAACAATTTTGCACTAATCTTCTTGTATTTATATACACATACACGA 

AGCAATGTGCCCGTTAGCGCAGAGCGGAACAAGATCTTCGGGAATTCCAGAAAATACGCTAAAAAATGCAAAGAATGCGGAGGATAAATACCAAA 

AATACTGAATTTATCAGATGAAAAACGGTGACTGGCGGGGATATCGTGATTTTTTGGCCAATAAGTTATTACTTTTGCAATTCAAAATTGAAAAA 

GATTTATTTTCTTATATTCATTTATTATTCATATTAATTTAGCAAACAATGTAGCTGATATATTCTCCTCGTACGTTCAACAATTTGTTAATTTT 

TTTTAATTTTAAAAATTTTTGTTTTGTGTATTTAGTGATTTGGTATTTTAATGTAGCCCTGGTGAACTTACAAAGCCCTAGGGCGGCCACACTGA 

AACC G AT G AT T G AAAT C AAGT C GAT T G AT T T C AGT T TCCTTCTTTTGTATTGTTT TAT AT T T GAGC AAT AC T T AC GAT AC T G AC AT C GC T C AG AG 
GTACAATGGAACTTAGTTAGTCGTCGACCAGAACAGACTCGGAAAACATGGCTGGCATTTGCCCCCGGGAGGCGGTTCGGGTGAAAGTGAAGGCA 
AGTGAAAGGTGGCCAGAAGGCGAGACAACAAAAGGGCTGCTTAGTTTGTGTTCCTCAATCTAATTTTTTTGAAATCGCCCACAGAAATGCGAACC 
CACTGCTCGGCCGGAGTGGCGAAAGTTTTCAGTGGACCCGCAGATCACCACGCTGGAGGTGTTGTACTCCCTGCTGGCCAAGGCCTTCGATGTCA 
AGTCGGACTTCTCCATTAAGTACAAGGCCTTTGATCCGGCGGGCAACGAGATCTACCTGGCCGTGCGCTCCGACTGGGATCTGGATGCCGCCTTT 
CTGAGGATTCACAACATTTCCATTCAGACGGCCTCAGAACCCTGCCTCACCCTGCAGATCGATGTGAAGCCATTTACGGTGGTCAGGGAGTGTGA 
GACCGAGGCCTCCCCGGGTAGATCCATAACGGGAGCGCCTGCTCCCGCAACTCCTGCACCT 
{SEQ ID NO: 760) 

Exon: 3336.. 3127 

Exon: 2419.. 2303 

Exon: 2023.. 1721 

Exon: 1666.. 1001 

Start ATG: 2384 (Reverse strand: CAT) 



Transcript No. : CT21197 

GGCACATTGCTTCGTGTATGTGTATATAAATACAAGAAGATTAGTGCAAAATTGTTGTTGTTAGTTATAGTTTTTTGGCCGCCGTGTGCTAGCAA 
ATGCATTTTATGCCCCGGCCCGCGCACTTTGGCACTACATGTTAACGGATTTCAGGCTGGCGTGGAACGAATTGGGCTGGAAAGCGGTGCGTAAC 
GGCGGTCGTCGGAGACTTGGATTGGCTAAGGTTTAATGGATGACACATCAGCACCATGGGAAAGTTACTATCAAAAATTTTCGGCAACAAGGAAA 
TGCGAATTCTCATGCTCGGACTGGACGCGGCTGGAAAAACAACGATTCTGTACAAACTGAAACTTGGCCAATCTGTTACAACGATACCCACTGTG 
GGCTTTAATGTGGAAACCGTCACCTATAAGAATGTCAAGTTCAATGTGTGGGACGTCGGTGGGCAGGATAAGATTCGACCGCTATGGCGACACTA 
TTACACGGGTACACAGGGTCTGATCTTCGTCGTGGATTGTGCGGATCGGGATCGGATAGACGAGGCGCGCACGGAGCTGCATAGGATAATTAACG 
ATAGGGAGATGCGCGACGCCATCATACTGATATTTGCTAACAAGCAGGATCTGCCTGATGCAATGAAACCCCATGAAATCCAGGAAAAACTAGGT 
TTAACTAGAATAAGAGATCGCAATTGGTATGTACAACCATCATGTGCCACATCCGGCGATGGCCTGTCCGAGGGCCTCATTTGGTTAACGTCGAA 
CCATAAGTTATGAGAATCGTAATCGAGCATTTTCTTATATGTATGTATATGGCATGTATTATTCGAAGCGAAGCAAAAGCGAAAATATTCTATTT 
TATTTATACAAAGAAAACCAACCGCAAGCGCATAGTAAAAACTCAATTCGTAAAAACGAACGTACATAAATTACAAATGTTAAAAATGGTATATA 
GAAGTCAGATATATATAGTAATTATATATACTCTCAATTAACCAACCAATATTCTCACTAGTAAAAATACACATACCTACAAAATAAAAGAAGCA 
TCAAAATAAGCAGCCGGTAAATATCCGTAAAGTAAATTATGCGTGAAGCACAGAGACATAAAGCAAAACATCAACGGAGATTAGCCTTAAGTAAG 
TTTTTCTTGTAAATTATATTGATGAGACACGTAAGCAATAAGTTGAAAGCTTGTTTATTGCAACTGTTACTATATTTGACTTGAGCAAGGGTGTA 
ACGAGTCTT AAAT AAAT TGT AAAT AAAT AT TCAAATTAAAGCCC AT AT ACAACAACAAACG 
(SEQ ID NO: 761) 

Start ATG: 24 6 (Reverse strand: CAT) 
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MGKLLSKI FGNKEMRI LMLGLDAAGKTT I LYKLKLGQS VTT I PT VGFNVET VT YKNVKFNVWDVGGQDKIRPLWRHY YTGTQGLI FVVDCADRDR 
IDEARTELHRIINDREMRDAIILIFANKQDLPDAMKPHEIQEKLGLTRIRDRNWYVQPSCATSGDGLSEGLIWLTSWHKL* 

(SEQ ID NO: 7 62) 

Name: ADP ribosylation factor 51F 
Classification: ligand_binding_or_carrier 
Gene Symbol: Arf5lF 
FlyBase ID: FBgn0013750 



Celera Sequence No. : 142000013384809 

GGACCATACAAAAGAAGTGCAAGAAAGGTGCGCGCGGTAACTTCTATATGTATCATAATCCTACACAGCGCCTCCCCAAAAGGGGAACGTAGTAG 
AATCGGGCATTCCCATAAAAGAGCAAGGCGGTGTCATTGCCGGCTTAATGGGTCCAAAGGTCACAAAATGCAAAAATCACTCTTGTTGACCCATA 
TACAAGAATGCTTCCTTCTGGGGAGGATTGTGCTCTCTTCGACCCACTTCGAGGTTCTCCCCGCCACAGCTTCGTTTCGTTTCCTAGGAAATACT 
GCAGTTCCTGTTCCTAACAATGCAGCGAGTGCATCCTTCATAATAGCAAATACTTTCCAAAATTAATTTG7AAATTTTGCGTACTTCCTTCGGTAG 
GCTGCCGCTATGAAAAGGGTGAATGGTCTCAATGCGTCGGAGGACAAATAACCAGAGAGGATAAGTTGGAACCAGAGGCAACTGGCGGTAGTGAT 
CAAAACTGCAATCCTGTGCGTACCGTCAGCAAGAAGTGTAAAGCAAATGGCAATTCCAGTGGAGGAAAACAACACGGCCAGAGTCGTCGAACAAA 
AGAGCAAAAGCAAAAGGATAAGGGTAAAAGTGGTGAATTTGACCCCTTTGTGAAAAGTTTGATCCAAATTCTCTGTTTACAGGAGCCAGTCACAT 
TTCGTCTTGATAATGAAAATTATTTATTTTATACGTATACGGAAGCAACATTTTTAATATTAAATAAGCGATTTTTAAATGCTAGCCCGTACGGA 

C T ACT T G AAC AAC AAT AAAG AAT T GGC C T T C TT AGG GC T AAAT G AAAG C AC T C T T T T AGAAAT T C T T GAAAC AAGGC T T T C AC AAAC C C C AAT AC 
CGCGAAAACCTGCACAAAATGAGACCTGTTATTGAGTAATCAAAGAGCATTGATAGAACTGTTAGTTGGCTCTGAGTTTTTATTTAATATTAGAC 
GCGACAGTAAAATAACCGTCTGCGACAAAATTTAATTAACATTACACAAAAGAAAACGTTACCTCTAACTCGGCTTGGTAATATAAAGATGTTAA 
ATTTAATAGTGTTTCTTTACTTAACTGCAAGCCCCCTTTAAATTGATCGATCTTTCGAGGCAAATCTTCAAGTTACCAATGTCTTTAATAGCCAT 
TGAAAGGGTGTTATGTGGCTGGCCCAAGATTTGCCGCAAACTTATTGTAACGTCGCGGTCGCTGACCTCCGGCTTGCGGCGGGCACTGGTCAAGC 
AGCCCCGTAAAGGCGGAGATGTTGGAAAACCCGGAATGGAGCTTGGGCGCTGTTCGTGCTTTGGACTCCGTGTGAACCTCAGCAATGCATCCGTC 
GTATACGTTGGCCACCGACGCTACTCGACGTACGAGAAGACGTCTACGCAAATTTTAACGAAGCTCTTTCCACAAACCTCTGAGGAGTCCAACGA 
CGAGGAAAGTCGTGAGCGGCGGAAACTTGAGGAGGAGGAGGAGCAGAAGGAATTGGAGCGGGCATTTAGGCGGATGAAACTGGGCTTTGGGCTTT 
TCGGCATAGGTAGCATGTTGTTCTCGTTTTGGGCTATATATTTCTATGGAAGGCCGTCCTTGGATGAGCACGGAAATGAAGTAATCGACGAATTT 

AGT T GT C T G CC AC AG AT G C AG C AGCT T AT GT GGC GG AC AT GG AAGT C GGT G AAC C GC T T T C AAC G GT T T T T C AAAG AGC C GT C GAG AAAAAAGCT 
GCTTCCTGATCCCCTTCAGCCCCCCTACGTGCAGCCGCCGTATACTTTGGTACTTGAGATTAAGGATGTGCTGGTACATCCCGATTGGACCTACG 
AAACAGGATGGCGCTTCAAGAAGCGCCCTGGCGTTGACGTGTTTTTGAAAGAATGTGCCAAGTATTTTGAGATTGTTGTTTACACGGCGGAGCAA 
GGAGTTACTGTGTTCCCGCTGGTCGACGCGCTTGATCCGAACGGCTGTATTATGTATCGCCTGGTTCGCGACTCTACGCACTTTGATGGAGGGCA 

C C AC GT G AAG AACCT T G AC AAC C T C AAC C G AG AC CT AAAGC G AGT GGT T GT CGT GGAC T GGG AT AGAAAT T C C AC C AAGT T T C AC C CT T C G AAT T 
CATTCTCGATCCCGCGGTGGTCTGGAAACGACAATGACACTACCCTCTTTGAACTCACGTCGTTTCTTAGTGTTCTAGGAACGAGTGAGATCGAC 
GATGTCCGAGAAGTGCTGCAATACTACAATCAGTTTAGTGACTCGTTATCCCAGTTCAGAGAAAACCAGCGAAAGCTGGGTGAGCTAATGCACGC 
CGAGGAGGTAGAAAAAACTAGCAAGTCTAGGCCCGTTGTCAAGAACTGGACCCGTGGCTTCATAAATCATTAAAAAGATTGTGCGTTTGATGTGT 
CCCTTAAATACTGTTAATTCACCCAAAAAATTACATTAAAACTTTCCTAGCATTCTTCAATGTCTACCTACTAACAGCAGTTACTTTTGCAGTTT 
TTTGCGTATTCTGATTGAAAAAGTCTGAAAACATATGATTTCTGATTTGGCATTTAAAGCAGACTGCCCTGCGGAACGTCAATGCTTCCTTGAAG 
GTAACAGTGCAGCCACTAGGCGGATACATAGCCGTAATGTTGTGAAAAATGTTGTGTTACGGTTTTATATGAATGCAGGCTTCCAGAAATCCTCA 
GTCCAGCACGAGCACGCAAGCTCAGCAACATGTTGCTGCCATGTGATGGTGTGCGTGAGCACGTGCCAAAATAACTACGACACATGTGTACTCTT 
GAGGGAATGTTTCCACAATCAACAACTACGCACTTCACACCACTAGGTGAGTCATGCGCAGTATCGTCCCTGCGAACATTCATCCGAATGTTGGC 
ATAAGTAAGTAAAGGGATCAGGTGTACTGTAAGCCTTCCGTCTAGTTACTTTCGATGCAGATTTTGATTCTGGCTGAGGGTTCGTGGCTGGCTGG 
GCCTCTTCTCCTTATGTTGAAAACACACAACAATTCGGTGGTGGAAGTATACTGCGGACGTCAAACTGGCGAGCCAGGGCGCAGGCGTTGGTTCA 
TTTGGCACTTTGTCTGGTACTCTCCCACATCGGCATCCTGCAATCTCTTTGGCTCAGCATCCTTGTAAGCTGTTGTCGGAGCAATCTGGGCATGG 
GGCAACTGAAAGGAAAGCAACTTTCCCATTCAGGAGCGCACAAGCCGCAGGAATCGCGAGCGCAGACAATTCAGTTCGGCTTTAACGTAATGGAC 
GGAAGGTTTGAATATCGCTTCTCGCCCCAGTTGTACAGATAAAGTCCCACTGCACGCAATAGGGCACTTGGAAGATACAATAGAACGCCGAGCAG 

TGTGTGTATATAGGTTTGAATGCTTACG 
{SEQ ID NO: 763) 

Exon: 1001.. 2353 
Start ATG: 1124 

Transcript No. : CT21286 m ^ m ^* 
AGAAAACGTTACCTCTAACTCGGCTTGGTAATATAAAGATGTTAAATTTAATAGTGTTTCTTTACTTAACTGCAAGCCCCCTTTAAATTGATCGA 

TCTTTCGAGGCAAATCTTCAAGTTACCAATGTCTTTAATAGCCATTGAAAGGGTGTTATGTGGCTGGCCCAAGATTTGCCGCAAACTTATTGTAA 

CGTCGCGGTCGCTGACCTCCGGCTTGCGGCGGGCACTGGTCAAGCAGCCCCGTAAAGGCGGAGATGTTGGAAAACCCGGAATGGAGCTTGGGCGC 

TGTTCGTGCTTTGGACTCCGTGTGAACCTCAGCAATGCATCCGTCGTATACGTTGGCCACCGACGCTACTCGACGTACGAGAAGACGTCTACGCA 

AAT T T T AAC G AAGCT C T T T C C AC A AAC CT CT G AGG AGT C C AAC G AC G A GG AAAG T CG T G AGC G GC GGAAAC T T G A GG AG G AGG AGG AG C AG AAGG 
AATTGGAGCGGGCATTTAGGCGGATGAAACTGGGCTTTGGGCTTTTCGGCATAGGTAGCATGTTGTTCTCGTTTTGGGCTATATATTTCTATGGA 
AGGCCGTCCTTGGATGAGCACGGAAATGAAGTAATCGACGAATTTAGTTGTCTGCCACAGATGCAGCAGCTTATGTGGCGGACATGGAAGTCGGT 
GAACCGCTTTCAACGGTTTTTCAAAGAGCCGTCGAGAAAAAAGCTGCTTCCTGATCCCCTTCAGCCCCCCTACGTGCAGCCGCCGTATACTTTGG 
TACTTGAGATTAAGGATGTGCTGGTACATCCCGATTGGACCTACGAAACAGGATGGCGCTTCAAGAAGCGCCCTGGCGTTGACGTGTTTTTGAAA 
GAATGTGCCAAGTATTTTGAGATTGTTGTTTACACGGCGGAGCAAGGAGTTACTGTGTTCCCGCTGGTCGACGCGCTTGATCCGAACGGCTGTAT 
TATGTATCGCCTGGTTCGCGACTCTACGCACTTTGATGGAGGGCACCACGTGAAGAACCTTGACAACCTCAACCGAGACCTAAAGCGAGTGGTTG 
TCGTGGACTGGGATAGAAATTCCACCAAGTTTCACCCTTCGAATTCATTCTCGATCCCGCGGTGGTCTGGAAACGACAATGACACTACCCTCTTT 
GAACTCACGTCGTTTCTTAGTGTTCTAGGAACGAGTGAGATCGACGATGTCCGAGAAGTGCTGCAATACTACAATCAGTTTAGTGACTCGTTATC 
CCAGTTCAGAGAAAACCAGCGAAAGCTGGGTGAGCTAATGCACGCCGAGGAGGTAGAAAAAACTAGCAAGTCTAGGCCCGTTGTCAAGAACTGGA 

CCCGTGGCTTCATAAATCATTAA 
{SEQ ID NO: 764) 

Start ATG: 124 
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MSLIAIERVLCGWPKICRKLIVTSRSLTSGLRRALVKQPRKGGDVGKPGMELGRCSCFGLRVNLSNASVVYVGHRRYSTYEKTSTQILTKLFPQT 
SEESNDEESRERRKLEEEEEQKELERAFRRMKLGFGLFGIGSMLFSFWAIYFYGRPSLDEHGNEVIDEFSCLPQMQQLMWRTWKSVNRFQRFFKE 
PSRKKLLPDPLQPPYVQPPYTLVLEIKDVLVHPDWTYETGWRFKKRPGVDVFLKECAKYFEIVVYTAEQGVTVFPLVDALDPNGCIMYRLVRDST 
HFDGGHHVKNLDNLNRDLKRVVVVDWDRNSTKFHPSNSFSIPRWSGNDNDTTLFELTSFLSVLGTSEIDDVREVLQYYNQFSDSLSQFRENQRKL 

GELMHAEEVEKTSKSRPVVKNWTRGFINH* 
(SEQ ID NO: 765) 

Classification: hypothetical 



Celera Sequence No. : 142000013384615 

CATTTCGTACAATGCTTCTCCCTTCTCTTTTTGTCTTTTTTGCCTTTCTTGTTTTCTTATTTTCCTGCCGGCACACACACGCACACACATGCAGT 
TGCGCTCTCTCGCGCTCGCACACCAACACACTCACCGTTTGGGAGTTTTTGTAGTCAATCAACTGATTGGAAGCCGCATCGCGTCCAGCCATTTT 

G CT T AAAAT T T AGG AT AT T T GC T C AAC GG GAT AAC C AG AAT T T AG AAAC ACT C G AACT GG AAAGT T G T T GAT T T C T AGCC GC CTTCGGTTTTTTG 
CTGTGACTGCGTCGAGAATTCTGTCAACTAAAGCAACTGCAGCGCAGTTCCATTTGTTTTTATAGCCCAGCACCACGTACTTGTTGTTGCTCAGG 
GTTGTCGAGTGGGCCGAAGATATCGATATTTCGATTATAGTTGGAAATATATCGCTTAAATATTACATAATATCCATTATTATTCATATTTGTAT 
TTAAATAGTATGAAATTCATTTTCAAAGTTTTTTTTTTGCACTGTCCATTTAAAAACGTGTCCCAACACCATTAAGATTTATTATTGATAATTTA 
TAGTACCTTCGCTCCATTTGCTGCCCAGAAAGGTGAATAAATTTAAACTGTTTGTTAAGTTATTTATTTTTTTAATTTCATGAAAATGTCGGCAT 
CATTAAGTTATAAACTAAGTCGGCTATCTATTGGTGCTCCTGCCAAATAAATATCTATGGCCTCTTTTTTTCTTAAGTTGTGAACCTCTTCCAGT 
GGTGCCTTTAAAATAAATGTACTTCATTCATATAATATATGTAAATATCATTAAACATGTTAAAATTATAAACTTCAAGGAAATCTCCAAATGGC 
GGCTATCTTTCTGTGGCTCCTGCCAAGCTAACATCTCTGTCTTTCTTTATTTTAGCGTACAAAAATATCTAATTAAATGAAACTTAGAAACTAGC 
ATCAACTAGCATATGTATCTTGTTCTCTAGTCACAACTTAAAAAAATCACCTAACCACCTGTGGTTCCCATCAAGTAAATATCCATGGACTGCTT 
GTCCAGGACGGTCGGGCGCCTCTTTGGAATCGGCTGAATGTGCGGCAGTCGCGTGCCATTCACGGCGTTCAAACTGAGATTCGAGTTGGGGTCGT 
AGTCGTCGCAATCCGGAGTAGTCTGGGCCAAGCTCAGCCTCTGCGATGGTTTCACCATTAGGTTTGCCCTCGCAAGCTGGCGTTTTTGGATCGCT 
GCAGCGCTGGTAGTGGGCACATTGTCTGAATCTTGACTTAAGCTTCTGTGGGAGCACAGCTCTTCATCGGGAGTCACAATGCCCCCCTCACTGTA 

T GAG GC AC CG AATT C GG AAC T T T GGC C AG AG AC AT GC AAAG AGGG CGCACTTTG GT GC G AC G AT G AC T G ACT GC C G CT G AC GT C C GT AAT GC T C G 
CGGCAGTCTCCCTGGCCATCTTGTGCCGTTTGGCATTGTCGTTCTTCTTCCGGTAGAGTCGCACCAGGTCCGTGTAGCACTTGCAGCAGATGAGG 
TTGCCCGCCCGAATTTCGCTGTTCAGCAAACAGGCGAAGCGCACCAAATATTTGTTGTTCAGTTGGCGATACGGCTTCCGCAGCGTGTGGTTGCC 
GACTGCGCAGAATCTCCGCTCAACGCGCGAGCCCTCGTCAATGGAGGCCATCTGGAATAGATCATGCACCTAGTTCTCCACAACTCGGGCTTACA 
AAACAATGCTTACACTGGCATTTTAACTTCCAAACGCAACAAAACGAAACATGAAGCCAATTGTCCCGCCACTTTGGAAACATATAGTGTGACCA 
CATATGGATTGCTGAAATATACCAAAATTAGGCGTAAAATAAAATGCATTTAACTAATAAACAAATTCAATTTTTTTAAATGTTATAAAATCTAA 

AAAC AT ACT G AC AT GGT C AC AGT T T CTC AC AAT T T AAAT GTGGCT T AT T GT AAT GC C AGT T AAGT AAT T AAT AT G AAG AAAC T AT T C GT T T AAAA 
TTGACAATAATCTAATTATTGGCGTGTTCGATGTTCATGAAATACTTATGTTATATCAGTTAGTTTGAAAATTGACTGTATTTCTTCTGTTGGTT 
GTATCAATTAAAATGAGTTTAAAAACACATTTTTTAAAACCCTAAAACCACATTCAAATCAAACTGTAATTATACCAACTAATACTGAAAATACC 
GAAAACTGCAAAATCAGCTGCTCGCCGAACAGCACTCAGCGTGCTGTTGGACGCGAAATGTTAGCCCATTAAACCCAGGTTATTTTTTGTCATTC 
ATTGTTCATACGTTCGTCTCGTCTGATTTTCTTTTTATAAAAACACAATTAAAATTTCAAACGAATACACATAATTTTTATAAATATAATGTAGT 
GCAACACGTATATGTATAATATAGCTTAATACCAAGTGCAATCCCTCGAAAGATAACTTCAAATGGCAATCTAATTTACGTAAAGAGAAGAGACA 
AAAAAAGAAACGGCGCGTGTGAAATTTTTTCAAGCAAAGCAAAGGAGAGAAGTTGGAGCAACA 
CACACAAAGTAGGCCAACAACAACAGCAAGAAAAAAGCAAGCAAGCG 
AACCTTGTAATTGTGTC AAT AAAT GTTTTAAATAAT ATT TCCTTTGG 
(SEQ ID NO: 766) 

Exon: 1707.. 1001 

Start ATG: 1666 (Reverse strand: CAT) 
Transcript No. : CT21292 

AAGCCCGAGTTGTGGAGAACTAGGTGCATGATCTATTCCAGATGGCCTCCATTGACGAGGGCTCGCGCGTTGAGCGGAGATTCTGCGCAGTCGGC 
AACCACACGCTGCGGAAGCCGTATCGCCAACTGAACAACAAATATTTGGTGCGCTTCGCCTGTTTGCTGAACAGCGAAATTCGGGCGGGCAACCT 
CATCTGCTGCAAGTGCTACACGGACCTGGTGCGACTCTACCGGAAGAAGAACGACAATGCCAAACGGCACAAGATGGCCAGGGAGACTGCCGCGA 
GCATTACGGACGTCAGCGGCAGTCAGTCATCGTCGCACCAAAGTGCGCCCTCTTTGCATGTCTCTGGCCAAAGTTCCGAATTCGGTGCCTCATAC 
AGTGAGGGGGGCATTGTGACTCCCGATGAAGAGCTGTGCTCCCACAGAAGCTTAAGTCAAGATTCAGACAATGTGCCCACTACCAGCGCTGCAGC 
GATCCAAAAACGCCAGCTTGCGAGGGCAAACCTAATGGTGAAACCATCGCAGAGGCTGAGCTTGGCCCAGACTACTCCGGATTGCGACGACTACG 
ACCCCAACTCGAATCTCAGTTTGAACGCCGTGAATGGCACGCGACTGCCGCACATTCAGCCGATTCCAAAGAGGCGCCCGACCGTCCTGGACAAG 
CAGTCCATGGATATTTACTTGATGGGAACCACAGGTGGTTAG 
(SEQ ID NO: 767) 

Start ATG: 42 (Reverse strand: CAT) 
I^SIDEGSRVERRFCAVGNHTLRKPYRQLNNKYLVRFACLLNS 

SAPSLHVSGQSSEFGASYSEGGIVTPDEELCSHRSLSQDSDNVPTTSAAAIQKRQLARANLMVKPSQRLSLAQTTPDCDDYDPNSNLSLNAVNGT 

RLPHIQPIPKRRPTVLDKQSMDIYLMGTTGG* 
(SEQ ID NO: 7 68) 



Celera Sequence No. : 14200001338454 6 

ACAACATTTGACAACAGCGTGCTTGGCCATCGGATTCCGGAAGTGCCGGTGCCCTTATTCAGCGTTTATCCGGCCACGAAGCATGCAATTACGGC 
CCTCTGCCAAACGGTGCGCCAGGAAATACATTTTCTCAAATTAAATATTAAATTAACGGTAAGTTAAATGCAGTGCATACATGGCGCGCCTCATC 
CTCCATCTTAATCCTCGGGAAATGCGGCAAACATGACGCGTCTGCTCATTGGTTACCCTCTCCATTTTTATCCCTCCATTGCAGAGCATTTGCCC 
GGGCATGGTAGACACGGATTTCCTTAGTGTTTACTCGCAGGCGGTAGCCGAGCTGCCCAAACTGCAGGCAAGGGATGTGGCCAAGGCCGTGTTGT 
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ATGCGCTGAACACTCCCGATGGCGTCCAGGTGGAGGACATCATCCTGCAGCAGATGCGGAAGGTCGATTGAATTTTAGTTTGAATTTCAAGAGAA 
TTACATTTTACCCAAAACGATATTATTTGTACATTTTGTATATGTTCTTCTATGTAATAAGAGAACAAATGTACATATTTTTAAATATAAAAAAT 
TACATTATTGCATTTCAAGCAACTGTTTCGACACTTTATATTCTTTAAGATGTTATCTGTAGATAACATCTGTATTAACATCTTATTACAAATAA 
CATAAAAATGCAATAAAACGATAATTAAACATTTATAATATTTTGCATATTCTCTAATCGATATAACATTAAATGCGTGTAAATAGTTTGGCAAC 
ATTATGAATGCATTTATACATTCATTCATTTGAATTGGAATTGCCCGCGCATGTAGTGGAGTTGCCACCTTGCGGTAGGGCTGCCAGATTGAGTC 
AAAAGACCAGGGCTGTACAGCTATCGCTAGCGAGTACAACAACCACTCTTGATTTGGCTCGCTTCGCTCTAGCAGAGTTATCCACCAAAACATCG 
ATTACTCGTGCTACCCCCAAAATCGATGGTACGAAACAAACATCGATATTTCACAGCTTTCGCTTATTGGTCACACTGGGGCTTATGATTAAAAA 
AAACTGAACTGACTCCAATAAAAACGTGAGGTAGAAAAAGCGAATTAGGTTGACAATTAATACACACAGCAGACGCACGGTAACCAGCCGCCAAT 
ACCGATACGAACGCGCGGCGGCAACAGCAAGTGCGCAAGTAAAAGTCCATATCAAACGCAAGCTACCCGCAGCCGCGGGCAGTTAAATTCCGAAA 
TGCGTCCACGTAGTGCAAAATCCGAATCTCAATTCGAATTCGAATCGGAAGTCCGGATGTCTGCGCCCTTTTTATGCTTTTTTCGAGGGTGCGGC 
TTAGATCGGCGTGCACCTGCCGTCCAAAGTGTGATTACGAAATGTGCTCCAACGTGAAATGATATTATAAAGTATAATATAATGCATGTTACCAC 
GTAGTTGCACACAGACAGGCAGCACACGCACACACACGCGCACACCCAGTGCCACTCACACACACACACAGACAGTGAGCGAGCGAGAGGGCAGG 
AGGCAGAGAAAGAATGAAATAAAGAAAGTGCTGCCCAGTTGACAAAGAACAGGTTTTAACCTAATTAACTAAATAATGGGCCCAAGTGAATGTTT 
CGCCGCCAGGAGGAATTTGCTTCCTTATATCCGCAAGGTCCCCGCCGCTCGAGTGTGCGTGTTGTTGTATGTGTGTCAAGGTTCCGGAAGTTGAG 
CCGCCGTAAAATTGATTTCCTATACACCAAGCCGCTGTTGCTGCCGCTGCTGTTGTTCTTATTGTGTTTCTGTGGCTGGACATTGAACTTGAGTC 
GAAATGCGCACTAACCCATTATCCATGCCATATCTCTTTTCGATCCGGCAGCCGTCTAGCCCAGCAGCTCTCTTCCCCCCACAATAAACACACAC 
AAGCTGCTCTTGGAGCCAATCACCCGCAACCGATAGCCACACGAAACCATCAGGATGTTCATCTGGGACTGGTTCACCGGAGTGCTGGGATACCT 
GGGTAAGCAACGTCGCCATTGCCGACCTTGTGCCTTGGGGCGGCGTTTCTTGAACGAGTGCCAACAAGGCAGCCGCACAAAAGAAACCCAAATTC 
GACGGCTGTACAGCGGAGGTCGTGGTGATAGCTCTAAACGTGCGATTATCTCGAAATCTAGAGGTTAAACTGTGTCTATAAGGTGTCGAAAGCTT 
ATAACTTTCCATTAGGTCTCTCTATCACAACAAATGCGCGTTGAATATGCATTACTCTAGGGTGCATATACAATATACACTTCTGTAAAAGGTAT 
TTTGATTGCATAGCCATGTACTCTTCAGCCACAATTATTTAAGCAGACATTTTTGATAGGGTTTTCCTCGCTTCCAGATGGCTGCTACACTTTTA 
TTTGCTTTGTAATGCTAAAAGTTAAAATTATTCCATACGAAAAGTACTGTTCTACAGTGCAGGCCAAACTGATAGCAGTTGTTATAAGCGCATTT 
TTGTTATAGCTTGATCTTTAATCAGGTTAATGATAGCCGAACATTTTGGGTAGCAAAGGTAATAATTAGCCCGAAGTCCAAGAGCTATTGTCTCG 
ACCTCAAATGTATATTCGGCTGTCTGTGGCTGGGCAAACACAAGTGAGCCGCCACTAGTGTTAGTTTAGCTGCGTATATTAATTGATGACGCGTA 

AGT C AAAAG GC T AT T C AAC AC ACT T TCTCGTTTT C AG GT C T GT GGAAAAAGTCT GG C AAAT TAT TGTTCCTGGGCCTG GAT AAT GC T GGC AAAAC 
CACACTCTTGCATATGCTCAAAGATGATAAGCTGGCGCAGCATGTGCCCACACTGCATCCAAGTGAGTTTCCTTCATGGACTAAATAGTGGTTAT 
GTGTGTGACTTTAATTAACTAATATGATTGACTTGCAGCATCCGAGGAGCTGTCCATCGGCAACATGCGCTTCACTACATTCGACTTGGGTGGCC 
ACACTCAGGGTAAGTGCCTAATCTTGACCAGAGCTCAGGGGAATGTGAATGTGCATCCAAATCGTTGCTATTTTCCGTACGTGCCCACGACTAAA 
TATAGACCAGTATGCAAATATCGCAGGGCTAAGCAAGCGCAGCCGCCAGATCCACGACAATCAGCATGAAATATAGCGCATCAGGTCACCCATAC 
AGCCAGTGCGTCTTACACTCGTAGCATTTGTTGCGATCAATCAGCGCCACACACTCGAAGCAATGCCACCACAGTGGTCACAAGCAGCTCATCCA 

GAGATC T C AAT T AT CAT C TT AT CT C AAC T GT G AT T AT G AAGAC CT C AG AAT AC TAG AAT AT T T AAC C CAT T GC AT AC AAT C AAT AT GT T C AT T TA 
TGTTTAGAAAGCAATTAGTGTCTTGTTGATTTTCGCAAATACTATCTTCCTTACCACTGTGCACATATAGGCGACGTATTTATTATTAATATTAT 
GAAGGCAAACATTCTCTATGTAGATAACAAACAGAAACGAAAAACTCGCATACATGCAGTTGATATATGTATATAAATATGAACATTATGAAAAG 
TGATCAAACATCGTGAAATCCAGTTGTTTTGATAACGAAGCTCTAGCACTCATTTTAAGGCAGCTCGTCATCGTGACGATCGTTTTCGCTTCGGA 
ACTACTAGGTACACTAAAGTAGAAAGGTTCTTAGCGGCGGAAAGATATCAGATTTGTGCCTGATATCAGTGACTTGCGCTTTGTTATTCTAGCGA 

T GT T AT AT AAAT T AC AG AT T GT T GGT C AAC AGCAGCGAC T AC T AAG GTGCCCTTG C AAAT CT T G AT T C AAAAAT CGG AAGAC TAACC CAT C T AAA 
GACTAACTGCTGCTTTTGTATTTATGAGACATATCTACTTATTTCTGCAGCACGACGCGTCTGGAAGGACTACTTCCCTGCTGTGGACGCCATCG 
TTTTCTTAATAGACGCCTGGGACCGTGGCCGCTTCCAGGAGAGCAAAAACGAGCTGGATTCGCTGCTCACGGATGAGGCGCTGTCCAACTGCCCC 
GTGCTCATATTGGGCAACAAAATCGATAAGCCCGGCGCGGCTAGCGAGGATGAGCTGAGAAACGTGTTCGGACTGTATCAGCTAACAACCGGCAA 
GGTGAGCACCAAAACCACCTCCCTCCCTCCCTCCCTCCTTACTCCTGCTACATAATTGATATCCATTTGACTTGCAGGGCAAAGTTGCACGCGCC 
GATTTGCCCGGCCGTCCTCTGGAATTGTTCATGTGCTCCGTGCTGAAGCGACAGGGCTACGGCGAGGGTTTCCGTTGGCTGGCGCAGTATATCGA 
TTAAGTCAGCATTAGCAACCACCACCAGCACCATATTTTCAAGAACACCACTCAACACTCAAAACCGAAAACTTGGCTACAAAATTTCCAAAAAT 
GATTAGAGACCGCAGAAAGAACAGAACCCAAGGGATGTGAACCCAGCCCAATTCAAACCGTCAGACCTTAAGCAAAACGAAGCTGCGTGCGCAAT 
TTCTAATTTATACAAAACAATTACAAATATTTCATAATAATAAAAAAAAAAAAACAGAAAACCATACAATGT 

GCGGGAAAAACAGAACACAACATCAAAGTGAAGACAACTTTTCTATCTGGAACATCGGGATACATTGTAAGGAGCTGAAGGATGCGCGGAGTGCG 
CAAGGGAAACTTTTAATAATAATTATGAGTAACAAATTATAGCAAATTAAAGGTGATTTATTTAAGAGCATGTGCTTTCAACGAACGCTGTTTCG 
CCAATAAAATGTAGCCACCTACCTTTATTTCAAGTTTATGAATTGTAAATGTCGCTATTGATTTCCTAAATAGTTTACTTTGGCCGCAAAACATT 
ATGTACATCCCTATAAGGATTCGACCTCCGATAAATGGTACACCTTAGTTTAGCTAAAAATTTTAAGTTGCATGTCATCGATGTACTTTGTTTCT 
TTTATACAATACGAAAGGGAAAGCAATTTATATAAAAAAGCATTTTTGTTTGCAGACAGAATTCTAAATAAAACGGAGAAAAGTAATTTTGTAAA 
ATACATAAATTGGTGACTATTGTTGTTTTGCATTGGATTTATGTGCCGAGCTTGAATGAAACCCAGCGATGTACAAACATATTGGATTCATTTAT 
TTATTTGCATTTAGGGTTTACGCTAATGAGTCACGTCCGGAGCATCTATTGAAATCAGGGAGCAGGCATTCGATGAGGGCGAGCACGGGATCCAG 
GAGCTGCAAACATGCTGGTCCAAGTAGAAAAATACACAAAACTCTGCAGAGTATGATCGCGACCTCAGTGCTAACTGAGCCGGCAGCGGCGGGGC 
AGTCCTCTCAGCAGCATCAGTTGGCGTTGAAAGAGGCGCATGAGCAAGCCCTTGCGCTTGGCCCGCTCCCTGGATCAGGAACGCGATGATCCGCC 
CATGGAGCCGGACCCGGAGCCGCCGCCAGCCATGCGCCGCTTCTTTTTCTTCTTGGCCGTCAGACTCTCCTGGCCGCTGTCCGTGTTGGAATCGC 

CGCCAT 

(SEQ ID NO: 769) 

Exon: 1001.. 1075 
Exon: 1857.. 1997 
Exon: 2698. .2817 
Exon: 2889.-2954 
Exon: 3851.. 4086 
Exon: 4163.. 4516 
Start ATG: 1955 

Transcript No. : CT21873 

TCACAGCTTTCGCTTATTGGTCACACTGGGGCTTATGATTAAAAAAAACTGAACTGACTCCAATAAAAACGTGAGCCGTCTAGCCCAGCAGCTCT 
CTTCCCCCCACAATAAACACACACAAGCTGCTCTTGGAGCCAATCACCCGCAACCGATAGCCACACGAAACCATCAGGATGTTCATCTGGGACTG 
GTTCACCGGAGTGCTGGGATACCTGGGTCTGTGGAAAAAGTCTGGCAAATTATTGTTCCTGGGCCTGGATAATGCTGGCAAAACCACACTCTTGC 
ATATGCTCAAAGATGATAAGCTGGCGCAGCATGTGCCCACACTGCATCCAACATCCGAGGAGCTGTCCATCGGCAACATGCGCTTCACTACATTC 
GACTTGGGTGGCCACACTCAGGCACGACGCGTCTGGAAGGACTACTTCCCTGCTGTGGACGCCATCGTTTTCTTAATAGACGCCTGGGACCGTGG 
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CCGCTTCCAGGAGAGCAAAAACGAGCTGGATTCGCTGCTCACGGATGAGGCGCTGTCCAACTGCCCCGTGCTCATATTGGGCAACAAAATCGATA 
AGCCCGGCGCGGCTAGCGAGGATGAGCTGAGAAACGTGTTCGGACTGTATCAGCTAACAACCGGCAAGGGCAAAGTTGCACGCGCCGATTTGCCC 
GGCCGTCCTCTGGAATTGTTCATGTGCTCCGTGCTGAAGCGACAGGGCTACGGCGAGGGTTTCCGTTGGCTGGCGCAGTATATCGATTAAGTCAG 
CATTAGCAACCACCACCAGCACCATATTTTCAAGAACACCACTCAACACTCAAAACCGAAAACTTGGCTACAAAATTTCCAAAAATGATTAGAGA 
CC GC AG AAAG AAC AGAAC C C A AGG G AT GT G AAC C C AGCC C AATT C AAAC C GT C AG ACC T T AAGC AAAAC G AAGC T GC GT G C GC AAT T T C T AAT T T 
AT AC AAAACAATTACAAAT AT T TC AT AATAAT AAAAAAAAAA 



<SEQ ID NO: 770) 
Start ATG: 174 

MFIWDWFTGVLGYLGLWKKSGKLLFLGLDNAGKTTLLHMLKDDKLAQHVPTLHPTSEELSIGNMRFTTFDLGGHTQARRVWKDYFPAVDAIVFLI 
DAWDRGRFQESKNELDSLLTDEALSNCPVLILGNKIDKPGAASEDELRNVFGLYQLTTGKGKVARADLPGRPLELFMCSVLKRQGYGEGFRWLAQ 

YID* 

(SEQ ID NO: 771) 

Name: GTP-BINDING RELATED PROTEIN SARA 
Classification: ligandj3inding_or_carrier 

Celera Sequence No. : 142000013384669 

AATGGCCAAAACGCGGCGGTTGACGTCAGCCTTGTTCATTTCGCTGCTGCTGGTCGCTTCTATTTTCCTATATTACCCTTTATTTCCGCCAGGTG 

TAACAGCTGTTGTGTGTGGCAAATGGGAAGAGAAGAGGAGAGAAGACGAGTGGTTACGGGAGGGTTGATAAAAAAGGAAGTAACGCGACAGAGAA 

GTAGAGAACTGACAACGTGCACAAATCTCAATAAGGCAATGTTCGAGGTTCGGTTCACGCTTCTCGTTTTTTTTTTTCCTGCTTTTGTTGTTTTC 

TCTGCCCTGTTTTTTGGCATTCACTTCTATGGTCTGTTATTTTGTAGTACAAAAAGCAACACTGAGCTGCTGGAACATCTAGAAATTTCAGATAA 

TAATCTACTTGCCTCGGGCTCCCACTTGTTCCCAATTAACACGGCTGCTGTGTCGATTTCTGCACAAAATTAGCACAAAAAATCTTCGCTAAACG 

GCAATCCCAAAAACTCGATTCCAGTGTGACCACAGCGATAAAGTCACAAAATACCATCGGGCTCTGGCAAAGAATGGGGAAAAAAAAATGAAATT 

AGTGATGGAATTATATTTTATACGATACAAAAAACAATAGGTGTATGACATTTGTATACTAGCTTAATTATTTGTAGGTTTGTAGGGAACAGCCT 

ACCTTGATAATACATACATTTTCTTATTACATTTTTTTTTATCAAATCCAAAGCAAAACCCATACTTTTAGAAAAAAAATTTTTTTTAACATTTT 

TTGGAAACATAAAATATGTATCGTACACATTAAAGACCAGCTTCACTTTATTATTTATATAATAAATAAAACTTCTGCTACAAGCCAGAAAAAGC 

TAAAATGTAGCCAAATATAGCATGGACCCAAACCAGCGGGAGCACTAACAACTCGTAACGTAAGCGCAGCACGTAACATTTCCGCTATCTCACTC 

GCACGTTCAGCTACATAGTGCGGACTCACTTGCACCTAGTAGTATTCGATCCGCTTGGAAATTTTGCTCGCCGTTTGCTTTCTCAAAGGCTCAAA 

AAAATGCAGCTACCACCGACTTACCTGGCTTCGCAAGTGAACGCGAACTAAGCCAATAGCATCTCGATTCGCCGGCGTTGCGCAGCGTTCGCAAA 

TGCAGCTGAAACGAGCAAAAAACACGCGCACAACGAAAAATCCGCAAGTGGAAAAAACTGCTACGGAAAAATCAACTCTGCCAGCAAACAGTGTT 

GCAAATCGATGGAGTTAAACAGCTGAGTGGTTAGTGTTGACTGACAGCCGAGTACGGCAACTTACAACACTGTAGGCGTTTTGGCGCGATTTTAG 

GTTTTGGAGGGAAACTGCATCTAGTGAAAAATCGTGGCCAGTTGTAATTCGAGCAGTGCGCTGGCACGACGTGGTGATAAAAAAGGAAAACTAAT 

AGAATTATACTGACTAGTGATAATTAACCGGACGTGCCCATCTGTTAATTCAGCCAAGTGGTATCTTTACAACCGTTTAGCCAACAACCAAATTG 

CTATCAGAAATATATATTATACACACTTACTATGGCCCAGCCATCGGTAGCTGCATTTTTCACAAACCGCAAACGCGCCGCCTTGGATGATGCTA 

TCAGTATCAAGAACAGGGTAAGACTTCCGTCCCAGCAACGTGACTCCAGATACAAAAGTATCTGCAAGGACGTTCTGCCAATACAGGCATTTTAA 

AAATGGATTTGTTTTGCAGCGTTTGGTGGAACCCGCTGAAACCGTCTCTCCTGCCTCCGCCCCTTCCCAGTTGCCAGCCGGCGACCAGGATGCGG 

ATCTAGACACCCTGAAGGCGGCGGCCACGGGCATGCGTACCCGATCCGGACGCACTGCCCGACTAATTGTCACCGCCGCTCAAGAGAGCAAAAAG 

AAGACACCGGCTGCCGCCAAGATGGAGCCACACATCAAGCAGCCCAAGCTGGTGCAATTCATTAAAAAGGGCACTCTGTCGCCCAGGAAACAGGC 

TCAGTCCAGTAAGCTGGACGAGGAGGAGCTGCAGCAGTCGTCGGCCATAAGCGAGCACACGCCCAAGGTTAACTTCACCATCACAAGCCAGCAGA 

ATGCGGACAATGTGCAGCGTGGCCTGCGCACACCCACCAAGCAGATCCTCAAGGATGCCTCGCCGATCAAGGCGGATCTCCGCCGTCAGCTCACT 

TTCGACGAGGTAAAAACGAAGGTATCGCGGAGTGCCAAGCTGCAGGAACTCAAGGCAGTGCTGGCCCTTAAGGCGGCGCTCGAGCAGAAGCGCAA 

GGAGCAGGAGGAGCGCAACAGGAAACTCCGCGACGCTGGCCCCTCCCCATCGAAGTCCAAGATGAGCGTGCAGCTCAAGGAATTCGACACAATCG 

AACTGGAGGTGCTTATAAGGTGAGCTTTATGCTTATATAAGAAGTTTTATGGGTACTGAACCAAATTCGGCTTCAAATACTTAAAAGAGATTGAA 

GCTATATTGAACATCATGTTGTGTCCTATAGTTTCAAGTCCTCTCGAATCAACAATTTCTTTTTAAATTTCTTTGAGGTTGTTTGATACTCGCTT 

TATCATTCGAAGAGGCATCCTTAGGGTATTAAACCTTGATTGAATGATATAATTTGACATGCCTTTCGCATCCTCTTCAAATCGAGTAATCATGG 

GTGTCGAAGAAAATCAGCCATGTAAGAAGCTCAAGATATGTGCCTGCCGCAACAATTTTGATTGAGCTGGCTGGTGGCCTAGAATCGGCATAAAC 

AAAC CACGGACTGTGGAGTGTGGTCT ATG AAAAAACCGGGT AAC ATCATTTAGCGAAAGGCCAGCAGTTGCTGCCGTGCAGTTCTGCTGC AAT CG 

AGGTCGTTCTCCGGGTCTCCAGCTTATTCAAGCTCTCGACTGCTGAAGTTGCATATTTTCCGCCCAAGAGTCGTCAAGTCACCGATGGCTTGGCT 

TTGGTTCGGCAATATGGCTATTTTTGTACGAGCAATTGCTGCTGATTCGTGACGTCTGTACAACATGCATAAATACCCAACTGGCTGACTGACCC 

TCCCCATGCGGCGACGCTGTGTGTGTGTATCGCTATGCGCAGCATGTTTACCCTTCGCAGACGTTACAATCTAGCCTGACCTTCCCTTTGAAAAT 

GCTTATTTTCCGCCCGCATTCGGTGGTTGCTTTCCCTTCGGCACTCGGAAAACAAGGATCTTAACTGCTGTACTGCATGGGAATGTGCATGTGTA 

TATCTGTGTATATGTATTTTGGGTCTGACGCTCAGGTAGCAACCTTTTTTTGTAACGGTCGTCTTGAAGAAAAGTCCCAAGCAGCAAGCAGGATA 

AATTTTCAAATAGGACATGCCTGTTTGTATTTCGTATTAATTTCATGAAATAAATAAGTGCATTAGCAACCCTAATGTTTAGAAGATTTTTTTTC 

TTCTATTTTTATATAATGAAATTATTGTTTCGCGCAAACTTAAGCACCGAATTTTAAAATGTTGATATAGATAACTTATAATTTTCGTTTTTTTT 

TTTTTCACAATCAGCCCTTTGAAGACCTTCAAGACTCCCACAAAAATACCGCCACCCACCCCGGACAAACATGAGCTTATGTCGCCGCGTCACAC 

TGACGTCTCCAAGCGCCTTCTCTTCAGTCCGGCCAAAAATGGATCTCCTGTCAAATTGGTGGAGGTGCCGGCTTATAAGCGCTACGCCAGTCTGG 

TGGAAAGCAGCCGCGCCGGCCAACTTCCATTGCCTTATAAGTACCGACACCTGTTGGACGTGTTCAAGGGTCTCGATTCCGTGGTGGCCATGTTT 

CACAACCGCAAGGAGACCATCACATTCAAGAAGCTTAAGCCGGCAGTGCAGCGCATGCTCCGGAAGAACTTCACTGAGACCCATTTGGCCCAGAT 

CAAGCACATCTATCCAGACGCATTTATCTTCAGTCAGGTGAAGACGCGCAATTTCGGTTCTGTCTCCAAAGCGGACTATTTCCAGCTGATCATCG 

CTCCCAATGTGGAGCCACTGCCCGAGCAACAGCAATCGGAGAAGCCACAACATTTCACCAAGATCAACGAAGACGATGTGCTCGCGTCGGCCCAG 

TCGACGTCCATGAATCCACACGTTATGACGGCGCGCATGCAGCGCTTTCAGAGCCTTCTGCTCGATCGGGCAATGCGGGCGCACGACCAGTTCCT 

TCGCTCCCAGGATCCGCCGATCATCATCGAAAAGGCGCTGACTCGCTGGCATCCGCAGTTTGATTTGGAGAGTTGTCCGGAAGTGGAGCTGTCCC 

CTTTGCCGCAGCCGCCCAATGTCGAGAAATACTCGTCAGCTAAGGATATTCTGTCTACCGCTCGAAATCTCTTCAATTGCGCCACGCCCATGGAG 

AGAGCAATGGATCGCTATGAGGCCAAGTTGGAGTCGGAAAAACAGCAAGCAGCTGAAAGCAATAAAAAGACGGAGGAACAGCAGGCAGGAGAAGT 

TACCAGGACAAGTACAGCGATTCAAACCAGTCAGGAGGTGCCTGGAATATCTGGATCATCGAAAAACCCAACTGTCCCTGAAACAACAAACCAAA 

CAGAAACTGCCAAGCCCACCGTGAAGGATTGCACAGTGCCAGATGCCAGCTCCAATCTACTGAAGGGACTGCCAAAATCACTGATCGAGAAAATT 

CGAGCCAAACAGGCAGCAAAGGCTTTGGAGGCGATGACTCGACGTCCTTCGCAGGACCAGGAGGCCACGAAGTACTCACGTTTGCCCGAATTGGC 

CAGACACCTGCGCAATGTCTTCGTT ACC GAGCGTAAAGGAGTGCTCACCCTGGAAGTAATC AT CAAGAAGATCC AGAAC AGCTTCCGGGCGAACC 
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TCACACCGCAGGAAATTGAGGCGCATCTGAAACTACTTGCCAAGGAGCTACCCAGCTGGGCTTCCTTTCATGAAGTGCGCAAGACCATGTACCTC 
AAGGTTGCCAAGGACATGGACATGAACAAGATCATCGAGAAGCTGGAGAGCGTGGCTAACGCCAAGAGCAATTAGACCGGAAACACGGAAACGCA 
ACCAGGACGCGGGACTCCGTCGAGCAATGACCCTGTGGCTATATCCAGATATATGTTTAAGTACTTGTCAAACTCGCTGGCCATTACTTGATTAA 
ACAACCGGATTAGGCTAACCATCTTGACATTTCCACATTCAACAAGCACCTTGACAACCAGGATTGGATTCATCACATTTACCCAAGCAATCATT 
ATCATTATATGTGTAATTTTTGGACCAAACGTTTATACATTCATATTTATTAGTTTTAAGACCATTTTCATGTTGTACCACGTTGAATAAATAAA 
TATTTTTCCTGGAAATAATAAACGTACTCATTCAATTGTGTAGACAGGAAATATAAAATTTATAGCCTGGCTCACAAGCATTAAATTCATCTCTT 
ATCTTCCAAGTCTATCTATGTATATATAAGCTATTTAAATCAAAATCATTTTCAGTTCCAAAATGCAGCTCATCGTGAAATTACTGGTGAGTAAG 
CTAATCAGTGAAAAGGTATCTTTCGCTAACATTCCATCTCCCTCAGATTGTCCTTTTGGCATTATTTGAATTGCTGTCTGGCAATTCAAATAATC 
GACAAGAGACCTGGGGCCACCCTCCATGGGGGTAAATTAAATGGCGTGAAAATTAGATACAAAAGATTCAACTGTAGTTGTTCTTGATGTCTTAT 
TAAAAAGTGAAATACTTTTGACTTGTGCATTGGCTTTTATTTATAAAAATTAATGGTGATGCGTCTAAAAAACTCGAACTTAATAGTTTTTTTGC 
GCAATATAATACTTATATACAAAGGCGACAGCGCCCTCTAGCGCTTTTTATTTTACAATAATGCGGGACTATAGCTAATCCCATTGTCCCAGCTG 
CCAAGTACCATATTCAAAAGATCTGGAAATTCGGGACCCGGCTGAGAGTGGAAACTTTTGGATGCCGTGGAGAGGCCCTGATACGGAATGTGTCA 

TCCAAGTCTACTGACGTGGCGTATTTATTC 
(SEQ ID NO: 772) 

Exon: 1001. .1632 
Exon: 1730.. 2394 
Exon: 3530.. 4106 
Exon: 4191.. 5015 
Start ATG: 1552 

CCGCTTGGAAATTTTGCTCGCC^ 

AGCCAATAGCATCTCGATTCGCCGGCGTTGCGCAGCGTTCGCAAATGCAGCTGAAACGAGCAAAAAACACGCGCACAACGAAAAATCCGCAAGTG 
GAAAAAACTGCTACGGAAAAATCAACTCTGCCAGCAAACAGTGTTGCAAATCGATGGAGTTAAACAGCTGAGTGGTTAGTGTTGACTGACAGCCG 
AGTACGGCAACTTACAACACTGTAGGCGTTTTGGCGCGATTTTAGGTTTTGGAGGGAAACTGCATCTAGTGAAAAATCGTGGCCAGTTGTAATTC 
GAGCAGTGCGCTGGCACGACGTGGTGATAAAAAAGGAAAACTAATAGAATTATACTGACTAGTGATAATTAACCGGACGTGCCCATCTGTTAATT 
CAGCCAAGTGGTATCTTTACAACCGTTTAGCCAACAACCAAATTGCTATCAGAAATATATATTATACACACTTACTATGGCCCAGCCATCGGTAG 
CTGCATTTTTCACAAACCGCAAACGCGCCGCCTTGGATGATGCTATCAGTATCAAGAACAGGCGTTTGGTGGAACCCGCTGAAACCGTCTCTCCT 
GCCTCCGCCCCTTCCCAGTTGCCAGCCGGCGACCAGGATGCGGATCTAGACACCCTGAAGGCGGCGGCCACGGGCATGCGTACCCGATCCGGACG 
CACTGCCCGACTAATTGTCACCGCCGCTCAAGAGAGCAAAAAGAAGACACCGGCTGCCGCCAAGATGGAGCCACACATCAAGCAGCCCAAGCTGG 
TGCAATTCATTAAAAAGGGCACTCTGTCGCCCAGGAAACAGGCTCAGTCCAGTAAGCTGGACGAGGAGGAGCTGCAGCAGTCGTCGGCCATAAGC 
GAGCACACGCCCAAGGTTAACTTCACCATCACAAGCCAGCAGAATGCGGACAATGTGCAGCGTGGCCTGCGCACACCCACCAAGCAGATCCTCAA 

GG AT GC C T C GC C GAT C AAGGC GGAT CT C C GC C GT C AG C T C AC T TT C GAC G AGGT AAAAACG AAGGT AT CGC GG AGT G C C AAGCT GC AGG AACT C A 
AGGCAGTGCTGGCCCTTAAGGCGGCGCTCGAGCAGAAGCGCAAGGAGCAGGAGGAGCGCAACAGGAAACTCCGCGACGCTGGCCCCTCCCCATCG 
AAGTCCAAGATGAGCGTGCAGCTCAAGGAATTCGACACAATCGAACTGGAGGTGCTTATAAGCCCTTTGAAGACCTTCAAGACTCCCACAAAAAT 
ACCGCCACCCACCCCGGACAAACATGAGCTTATGTCGCCGCGTCACACTGACGTCTCCAAGCGCCTTCTCTTCAGTCCGGCCAAAAATGGATCTC 
CTGTCAAATTGGTGGAGGTGCCGGCTTATAAGCGCTACGCCAGTCTGGTGGAAAGCAGCCGCGCCGGCCAACTTCCATTGCCTTATAAGTACCGA 
CACCTGTTGGACGTGTTCAAGGGTCTCGATTCCGTGGTGGCCATGTTTCACAACCGCAAGGAGACCATCACATTCAAGAAGCTTAAGCCGGCAGT 
GCAGCGCATGCTCCGGAAGAACTTCACTGAGACCCATTTGGCCCAGATCAAGCACATCTATCCAGACGCATTTATCTTCAGTCAGGTGAAGACGC 
GCAATTTCGGTTCTGTCTCCAAAGCGGACTATTTCCAGCTGATCATCGCTCCCAATGTGGAGCCACTGCCCGAGCAACAGCAATCGGAGAAGCCA 
CAACATTTCACCAAGATCAACGAAGACGATGTGCTCGCGTCGGCCCAGTCGACGTCCATGAATCCACACGATCCGCCGATCATCATCGAAAAGGC 
GCTGACTCGCTGGCATCCGCAGTTTGATTTGGAGAGTTGTCCGGAAGTGGAGCTGTCCCCTTTGCCGCAGCCGCCCAATGTCGAGAAATACTCGT 
CAGCTAAGGATATTCTGTCTACCGCTCGAAATCTCTTCAATTGCGCCACGCCCATGGAGAGAGCAATGGATCGCTATGAGGCCAAGTTGGAGTCG 
GAAAAACAGCAAGCAGCTGAAAGCAATAAAAAGACGGAGGAACAGCAGGCAGGAGAAGTTACCAGGACAAGTACAGCGATTCAAACCAGTCAGGA 
GGTGCCTGGAATATCTGGATCATCGAAAAACCCAACTGTCCCTGAAACAACAAACCAAACAGAAACTGCCAAGCCCACCGTGAAGGATTGCACAG 
TGCCAGATGCCAGCTCCAATCTACTGAAGGGACTGCCAAAATCACTGATCGAGAAAATTCGAGCCAAACAGGCAGCAAAGGCTTTGGAGGCGATG 
ACTCGACGTCCTTCGCAGGACCAGGAGGCCACGAAGTACTCACGTTTGCCCGAATTGGCCAGACACCTGCGCAATGTCTTCGTTACCGAGCGTAA 
AGGAGTGCTCACCCTGGAAGTAATCATCAAGAAGATCCAGAACAGCTTCCGGGCGAACCTCACACCGCAGGAAATTGAGGCGCATCTGAAACTAC 
TTGCCAAGGAGCTACCCAGCTGGGCTTCCTTTCATGAAGTGCGCAAGACC AT GTACCTC AAGGT TGCCAAGGACATGGAC AT GAACAAGAT CATC 

GAGAAGCTGGAGAGCGTGGCTAACGCCAAGAGCAATTAG 
(SEQ ID NO: 773) 

Start ATG: 552 

MAQPSVAAFFTNRKRAALDDAISIKNRRLVE 

IKQPKLVQFIKKGTLSPRKQAQSSKLDEEELQQSSAISEHTPKVNFTITSQQNADNVQRGLRTPTKQILKDASPIKADLRRQLTFDEVKTKVSRS 
AKLQELKAVLALKAALEQKRKEQEERNRKLRDAGPSPSKSKMSVQLKEFDTIELEVLISPLKTFKTPTKIPPPTPDKHELMSPRHTDVSKRLLFS 
PAKNGSPVKLVEVPAYKRYASLVESSRAGQLPLPYKYRHLLDVFKGLDSVVAMFHNRKETITFKKLKPAVQRMLRKNFTETHLAQIKHIYPDAFI 
FSQVKTRNFGSVSKADYFQLIIAPNVEPLPEQQQSEKPQHFTKINEDDVLASAQSTSMNPHDPPIIIEKALTRWHPQFDLESCPEVELSPLPQPP 
NVEKYSSAKDILSTAP^LFNCATPMEPJyv[DRYEAKLESEKQQAAESNKKTEEQQAGEVTRTSTAIQTSQEVPGISGSSKNPTVPETTNQTETAKP 
TVKDCTVPDASSNLLKGLPKSLIEKIRAKQAAKALEAMTRRPSQDQEATKYSRLPELARHLRNVFVTERKGVLTLEVIIKKIQNSFRANLTPQEI 
EAHLKLLAKELPSWASFHEVRKTMYLKVAKDMDMNKIIEKLESVANAKSN* 
{SEQ ID NO: 774) 



Celera Sequence No. : 142000013384581 

AAG CGAGCTTTTT GC AC TTGTCTTGGAT T T AAAT AT T GT AAC G AAAAAC GT A T AAAC AC AAT C AG GC AGC G AAG AAAAGC T C T C T T C T GAG AT AG 
TTAAAAAAATAACATTATAATTGCACTAATTTTTCACACCAAACTATGATCGCATTTTCCCCGGCGATTTATGTATTTTTGGATATTTCGAGTGC 
GAATGGGGAGCCCAGCGACCCGAGCAGTTGGAGATTCCGGAAAGTTGTGAGTTAATTGTTCATCTCGGTTCATTTTGCTAGACCAGGGCTTGAGA 
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AACACACGAAAGTATCGATTAAATGCTCGATATTTTTTTTTGAAATGCGTTCCGTTACTTTTACATTCACCTGGAGTGGAAAATGTATACATTTT 

ttggaccLaaacaaactggtaatttttatgacaagcg^ 

TGATAGTAAGTGTAATAGTAATTTAAATTAATATAATTTGTTTTAGCTTGATATCTTAAGAACAATCAGAGTGCAATTCCAAAATGTAAATAAAA 
TTTTTGCTTTATTTGAAA^^ 

ATAAAAGTAAATTGGATTCATAGTCTTTTTTCTCTAAAAT^^ 
CCTTATTACTAGCTTCTATTTC^ 

TTTTTGAAATGTGTTCTCTTTACATTCGATTTAAAAAGTCTTCGTCTTTCGACGACTATCGATCCTCTGTTATCGAAGCATCGCAAAGTATCGAG 

TACGGATTCCCCACTGAAGTTTGAAATCATTCTCGCGAGTGGATCTCTCGAGAGCGGACGTTTTTCGAGTCGGAGCCGGGTCGAGTCGAGGGCCC 

CAACTTGGAATTTTCTTTCCGATTGACTTGGAAAATAGCTTCTTCGTCGCGTGCGAGTGTGTGTGTGTGTATGTGTGTGCGCGCGGTGTGGAGTT 

TATAAAAAGAAGTCGAAAAGTGTTTGCCGCATCAACGAAAATAAAAACACCCAAGAAGAGTGTTTGTTGTTTTGTCTGTATAACAGAAAACTGAT 

TTTCTCAACTTCACCTCTGCACATCGATCAAGGACTAGATATAGATGCATCATATGTAGCCGAATTAAAGTATCTGGTAAAGTAGCTGTTGATGA 

ATCTTAAGAAACCGATTTACAGTTAGTTTTTGGATCCACGCGTAGTTTTTACGCAGGTTTCAACAGCTGCAGTCCCCCCGTCTCGCTCACTCCTT 

CTCCCTGCCTATTCTGTTTCTGTCTTGAGGTTTATCTTGTGGCCTGTTTTTGTTGTTACTTCACTTGCGTACTTCTATTTCCACTCCTTCGCATC 

CGCTGCCGTTGTTGTTTTTGTTGTGGCACACTGCCTCTTTTTATCGCCTCTCTGTTACGAACAGAAAACTCAAGATAACTCCAAAAAACTCGAAG 

GGGGCAAGGGCTTCAAATGCCTTTTAAGATCCCCAAATCATTTAAGGGATTTCGGGATAATTCTAAAAAATGAATATATTATTATCTATTAAAAA 

TTAATATTTTATATTAAGGTAACCTCTTTTTTAGTTTTGAAAAAATAAATAATTTTTACAGTTCTTTTTAGAGATTTTCCGGTTAGATTTTTTTC 

AAATGTTAAAAGTTTTTTGGTCGTACTATTTTAAAATGTAGAGGGGTTACTATTGTTAGATAGTTTTAATACTTAAGTTATATTTCATTTTCGTT 

ATCTTTCTATAATTCATTTGTTATAGAAAGCGTGCAAGTGAGCATCGATGTCGATCTCTAGATTTTAGCAATATATTTCTGTTTCTTCTAGATAT 

TCTTATTTCAGTATCTCATAAATATCTCATGTACTGTGGGACTTTTGTTAAGTTCCTGATAAGCTTGTTTAAAATATTTTCCATCTTGTGTCTCC 

ATTTTCTAATCTTTTCAAAAGTTACTCTACTATGTCACAGAGCAGTGCTAAGAAGCATTTGAATGCACATCTACTTTGGCTAGGGAATTTTTGTA 

GATTAAGCATACAAACCCGAACATATAGAAACATGATGCCATAACGGCAAGCCAAGTGAATGACTGCATGTGGACTAAAA^ 

CTGATTTTCTTTATTTCAGCGACCATGGAGAACGGGCTAATTAAAATGTGAGCCGGGCCTGTCGCCTTGATGCAGATGTGCAGCCGACGAATTAA 

TTTATCTAGATTGGATTGCGGACTGTGGGCAGAAAGGATTTAGTATTTAGTCGGTGTCTTCTAATTATAGCCGCTCTATCGCGGTCTGATCTCAT 

ATTTAGACGGTAGACTTCCACAACATGTGTGCTCCAGACCGAGAATTGAAATTCGGTGCTGGGGTTGGAATGGTGCATATGTGTACAGTGTAAAC 

ATACAGTGGTCAGTCGCGAGGAAAGTTTTACTATGCGAAGATATTAGCTTTTGAACATCAAAGTTTCAGAACATTTCTAGTGCCGGATACACATA 

TCATTTAAGGGTTTTGGATTGATTAAAGTCATGTACTAACAGAGCCATGTGATTTTTTTTTCTAAATAAAAGCATATGTTTGAATTAAATCATTG 

GCATCAGTGATGACAATTGAATTTGAATTCCCACTGTCCACTGAGTGACTGCATGAAATCATATTTTTTATTTGTATGGTCAAATAAGCTCTGTG 

CCTTGATGAAATTGGAAATATAAACAACGTCGCCCGGCAAAACTAAGAAAACATGAGCCACGGCAATTGTTCGCGCAAATTTTTATGAACTGGCT 

ATACACAGATATGCACGAATATAAAATGTATATACAGACCGACCCCTACACTTAGATACAGATATGTACACATCCAGCGAAGTCGAGAATCGTGG 

AGCGCTCCAGTGCTCCATTATTTACCCACATACACATATGTACATTGTATGAACCCATATTCTTATAGTGTATTTGGCTTGTATGTCTGGTAAAT 

TTACGTTTTGTTTCTTTACTTATTTGTTTATTTGCCAGTTTTCTCGGCGGCAAAACGAACGAAAATGAAATTAACCAACGCACAAATAGCAAACA 

AAAAAAATTTAAACACACTCCAAGTAATTAAGTTTGGAGCGGAGTCAAAAACTTCCCATGTGCCACAGAGCTCCATGGAAAAAGTGTATCTGTGA 

ATGAACTCTTATCGGGGCAGTTGAAATTATGAAATATGTATTATTAGACTAGGGAGAGCAGCTAGCTCCATGTTTGGGTCTAGAAAATGTGTTAG 

TCTTAAAAACTTGGGTTCTTATCTCACCGGCTCGTAGCTCAATATTTGATTCTCGATTTTGGTGGCCGATTTTTCCCTGCCCCGTCGAAATCTTC 

ATGATAAGAACATAAAGGTCATCGAAACATTGTGTCTAATTTCTAATATTTAATCCTTACATATAGTTAAAAAATATTTATTTATATAGATATTT 

ATATAGGTCTTAAGTCTGATTCTGCTCATCTTTTTTGGTAGAGAGATTCTGATTCCATTTCTACAGCTTAAAAACGAAATTATACACAAATCACA 

ATTCTTCAATTGTATTCTGTGCGAGAATCCCTCGTACTACAAATGAATACTGATAAAAACAGCGATCCAACGCACCAGGCAACAACGGATTATGA 

T T AT GCT AT AC CAT TAT C GGG AC C T TAT AT T GAG AGT AGT CG GAT T GAT AAT AGG AT C C ATT C AGT T T C AGT T C AAG AT TCGGTTTGCGTC T C AG 
TTTGTGCGTAAGATTCGTTGGACGCGGATGCCTTTTTTTTAATGGTACGAGACGAAAATAAATCGGTTTTTTTTATTCTGATTTTGCTGTGTAGA 
AAATTAGAGCATCTCTGTTGATAAGCGGTTTACAGTGATGATAGAAGCGTGAAGTTTGTTTAGAGAGTAAAATTGAAATCAAAGGAATATTATTA 
ATTAATAAGGCAATCTGTTAATTTTTCAGCTACAGGACTAGTTGGAAAATGCCACAATGAATAAATAAATATTTTGATCAAAATAAGGAAGAGAG 
CCGTGGTTAAGTAAGTAGCAAAGCACTGTTTTTAATTCATGATCTAATAATAGTTGGTGGAATTCACAAATGTTAATCACTGTTAATTTGGAGAA 

GT G AT G AAAACT G A T AC T T G AAT C AC C AAC CT T T T T T T TGAT AAC T T T T AC AAAC T AAAAT G AAT t T T T T T T AGAT T T T CAT ACT G AGG AT C AC G 
TGGACAAGTGTAAGAGATCCCCTTAAAAA7\.CGAACGAAAAGAAAAGCAACTGATTAAAGTCCCGGCCGGAAGTGCCCCAGTGGAACGCGACCCCA 
ACAACTGAGAGCCACATCGAGATGTTGGAGCTGCGGCTTATCGTGGTCATTGTGCTCGCCCTGCTCAGCTGGCAGTGGGACCCAGTGGACTCCCA 
AAGACCGCCCCAGCATGGCCGACGTGATCGTCCGAAATATCCACCGAACAAGTTTATCAAGACTCATCCATGCGAGAGATCCTCCTGCTACCCGG 
CAACGGGTAATCTTCTGATCGGCCGAGAAAACCGCCTGACTGCCAGTTCCACGTGTGGTCTGCACTCGCCGGAGAGATTCTGCATCCTGTCCCAT 

T T GC AGG AT AAG AAGT G TTTCCTCT GC G AC AC G C GT GAG GAG AC C AAG CAT GAT C CGT AC AAAAAT CAT CGC AT C G G AC AG AT T AT AT AT AAAAC 

TAAGCCAGGCACCAACATACCCACCTGGTGGCAGTCCGAGAATGGCAAGGAAAATGCTACCATTCAACTGGATCTGGAGGCGGAGTTTCACTTTA 

CCCATTTGATCATAACCTTCACCACCTTCCGACCAGCGGCCATGTACATCGAGAGATCCTTTGATTTCGGACAGACGTGGCACATCTACAGATAT 

TTCGCCTACGACTGCAAGGAATCCTTTCCGGGAGTGCCCACTGTCTTGGAAAACATCACCGATGTCATGTGCACCTCGAGGTATTCCAACGTGGA 

GCCCTCGAGGAATGGTGAAGTGATATTTAGGGTGCTGCCACCGAACATCAACGTTACCGATCCCTATGCGGAGCACGTGCAGAATCAGCTGAAGA 

TGACCAATCTGAGGATCCAGATGACGAAGCTGCACAAGCTGGGAGACAATCTTTTGGATAGCAGGCTGGAGAACGAGGAGAAGTACTATTATGGC 

ATCTCCAACATGGTGGTTCGCGGATCCTGCTCCTGCTATGGCCACGCCTCTCAGTGTCTACCGTTGGATCCCGCATTTTCCCAGGCGGACAACGA 

AGATGGCATGGTGCATGGCCGATGTGAGTGTACCCACAACACCAAGGGAATGAATTGTGAAGAGTGCGAGGATTTCTTCAACGATCTCCCATGGA 

AACCCGCTTTTGGAAAGAAAACGAATGCCTGTAAGAAATGTGAGTGCAACGATCACGCCGTTAGTTGTCACTTCGATGAGGCTGTCTTCACTGCA 

TCCGGTTTCGTTTCCGGCGGAGTGTGCGACAATTGCTTGCACAATACTCGGGGTCAGCATTGCGAGGAGTGCATGCCGTACTTCTACCGCGATCC 

AGAGCAGGATATAACGTCCGAAAGGGTTTGCCAGCCCTGTGACTGCGATCCTCAAGGTTCGTCGGACGACGGCATCTGTGACTCCCTAAATGAAC 

TGGAGGAGGGAGCAGTAGCAGGTGCCTGCCATTGCAAGGCCTTTGTCACGGGACGTCGTTGTAATCAGTGCAAGGATGGTTACTGGAATCTGCAG 

TCGGATAACCCCGAGGGCTGCGAGCCGTGCACCTGCAACCCCCTGGGTACACTGAACAACTCCGGATGCGTTATGCGCACCGGCGAGTGCAAGTG 

TAAGAAATATGTGACGGGCAAGGACTGTAACCAGTGTATGCCGGAGACCTATGGTCTTTCGGAATCTCCGGAGGGTTGCTCTCTGTGTAACTGCG 

ATGCTGGTGGATCCTACGATAATTACTGCGATGTCATAAGTGGTCAGTGTCGCTGCAGGCCCCACATGACGGGTAGGTCATGTTCGCAGCCGAAG 

CAGAATTACTTCATTCCCCTGCTGCCCGAGGTTCATGAAGCAGAGGTGGTGGACGAGTGCATCTCGTATGGCGCCAACGGTAACTGCAGCCTGGT 

GGCCGAAACTCCTGATGGCAGTTTCACGGGCATTGGTTTTACCAGGGTTCCTGAGAACTCGGAACTGGTTTTCACTGTCGGAGATATTCCTCGAT 

CCATGCCATACGATGCGGTTATACGTTACCAGAGCACTTCACGAGGAGATTGGGAGAACGCCTTTATCACTCTTGTCAGACCCGATCAAGTGGAT 

CCTGAAGGAGGATGCGGTGAACTAGCTGCAGCAACTTCTTCGGAAACCAGGATTCCGTTCTCCCTTCCTGATCGCAGCCGTCAGGTGGTTGCTTT 

GAATGAAGTATGCTTGGAAGCTGGAAAGGTTTATAAATTCCGGATTTACTTTGAGCGAAAGCGACACGATGTGGACAGTCCCACAGCGACGATCT 

TGGTGGATTCTCTTACGCTCATTCCCCGCATCGATGTAACGCCCATCTTCCAAGGATCCGTTCTGGCTGATATTCGCAAAAAGGATTACGAAAAA 

TACAATTGCAAATCCTCTTTGTACGATATGAACTACAAGTCGGATCCTAAATGTCAAAATCTGGACAATATCCTCAGTGTGTTTGTCCACGATGG 

AGCCAGTATGTGTAATTGCAATCCAACCGGATCGCTGAGCAAGGTGTGCGAATCCAATGGAGGTTACTGCCAATGTAAGCCCAATGTGGTGGGCA 

GGCAGTGCGATCAGTGTGCTCCGGGCACCTATGGTTTTGGTCCCGAAGGTTGCAAGGCTTGCGATTGCAACAGCATCGGATCGAAGGATAAATAC 
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TGCGACCTGATAACCGGCCAGTGCCAGTGTGTGCCGAATACATACGGCAGGGAGTGTAATCAGTGCCAGCCGGGCTATTGGAATTTCCCCGAGTG 

CAGAGTGTGCCAGTGCAATGGTCATGCGGCCACGTGTGATCCCATTCAAGGAACCTGTATAGATTGTCAGGTATGATGATATATAATATAGACAA 

TATCTTGCTTGTACTTAATGAACTTTTTAAACTTGCTTCTCTTATCACCAGGACTCGACCACGGGCTATAGCTGTGACAGCTGTTTAGATGGATA 

TTATGGAAACCCACTTTTCGGCAGTGAGATCGGATGCCGACCATGCCGCTGCCCGGAGACGGTAGCCAGTGGCCTGGCGCACGCAGATGGCTGCT 

CCCTGGACACACGGAACAATAACATGTTGTGCCACTGCCAGGAGGGATATTCCGGCTCCCGTTGCGAGATCTGTGCGGACAACTTCTTTGGCAAT 

CCGGACAACGGTGGTACCTGCTCCAAGTGCGAGTGCAGCAACAATGTCGATCTTTATGACACTGGAAACTGTGATCGGCAGACGGGAGCTTGTCT 

AAAGTGCTTGTATCAGACGACTGGCGATCACTGCGAGCTCTGCAAGGATGGATTCTTTGGGGATGCACTGCAGCAGAATTGCCAGCAGTGCGAAT 

GCGATTTCCTCGGAACGAATAATACCATAGCTCATTGCGATCGTTTTACGGGTCAGTGCCCCTGCTTGCCGAATGTCCAGGGTGTGCGTTGTGAT 

CAGTGTGCCGAGAACCACTGGAAGATTGCCTCTGGCGAGGGTTGTGAAAGCTGCAACTGCGATCCAATTGGAGCACTCCATGAACAGTGTAATTC 

CTACACGGGGCAATGCCAATGCAAGCCAGGATTTGGCGGCAGAGCGTGTAATCAGTGCCAGGCTCATTACTGGGGCAATCCCAACGAGAAGTGCC 

AGCCGTGCGAGTGCGATCAGTTTGGAGCTGCTGACTTCCAGTGCGACCGGGAGACGGGTAATTGCGTCTGTCACGAGGGAATTGGGGGCTACAAG 

TGCAACGAGTGCGCCAGGGGCTATATTGGGCAGTTTCCACACTGCTCACCTTGTGGCGAATGCTTCAACAACTGGGATTTGATTTTGAGTGCTCT 

GGAGGATGCCACAACTGCCACTATCCTACGAGCCAAGGAAATCAAGCAAGTGGGAGCGACAGGAGCCTACACGTCCGAGTTCAGTGAACTGGATA 

AGAAATTGCAGCACATTAGGAATCTGCTGCAGAACACCTCGGTCAGTCTGGTTGATATAGAGAAACTGGACTATGAAACCCAATCACTTAGGGAC 

CAACTTCAGGCGTCCCATGGCCGATTAAGTGAAACCGAGCAAAACCTGGATGACATTTACAATTCGCTAAGTTTGTCGGGCGTTGAGCTAGAGAG 

CCTACAAAATCACTCCAGATTGGTGCAACAACTGTCCAAGGAACTGAAGGAAAACGGCATACAACTGCAGGAGTCAAATATCGAGGGAGCTTTGA 

ACCTCACACGTCACGCCTACGAAAGAGTCAGCAATTTGTCGACCCTTAAAGACGAGGCCAATGAACTTGCATCGAACACCGATAGGAATTGCAAG 

AGAGTGGAAAATCTGTCTAATAAGATTCAGGCCGAAGCCGATGACCTGGCAAACAATAACAAGCTCATTGAAGACTACCGAGCAGAGTTGACTTC 

GCTAACCTCCCAAATTCCCGAGCTGAACAATCAGGTTTGCGGCAAGCCAGGTGATCCCTGTGATAGTTTGTGCGGAGGTGCTGGCTGCGGTCATT 

GTGGCGGTTTCCTGTCCTGCGAACATGGCGCCAAAACCCATTCGGAGGAAGCCCTTAAGGTGGCCAAGGATGCTGAGACGGCCATTACCAGCAAG 

AAGG AT C AGG C GG AC C AAAC C AT C AGGGC T C T G AC C C AG GC C AAAC T G AAT GC C TC C G AGG C AT AT GAG AAGGC C AAG AGGGG T TT C G AAC AAT C 

CGAGCGTTATCTGAACCAAACCAATGCCAATATCAAGCTGGCCGAAAACCTATTCATAGCCCTCAACAATTTCCAGGAGAACAAGACAGCTTCTC 

CTTCGGAGAGCAAAGAACTGGCACAGAAAACTCTCGACTTGGACCTGAAACTGGAACCCGAAGAGATTGAGACCCTCGGCGACCAAATCAACAGG 

GCTGTCTCATCGCTAAAGAATGTTGAAGCCATAATCTACAGGACCAAACCCGATTTGGACAGGGTCAATAATCTGCAGAGCATTGCCAATGCCAC 

CAAGGAGAAGGCCGATAAGATCCTGGACTCTGCCAATTCGGTCGTGGAGAGCCTGGCGGCGGCAGATGAATCCCAGGGTAAAGCCAAGGATGCCA 

TTCAGCAGGCGAACAGTAACATTGAGTTGGCCGGTCAGGATCTGGAGAAGATCGATGAGGAAACGTATTCCGCCGAAGCTCCGGCCAATAACACA 

GCAC A GC A GGT T G AGAAAT T GGC C AAG AAAGT GC AG AAACT GC AG AAT AAC AT AAT G AAG AAC GAT CGGG AT GC C AAGGAGAT AAC C AAGGAAGC 

GGGTAGCGTCAAACTGGAGGCCATGAGGGCTCGCGGAGAGGCCAATAATCTTCAGTCTGCCACAAGTGCCACCAATCAGACCCTCACCGATCGCG 

CCAGT CGCTCCGAAAATGCCAGGGAAAGGGCCAAGCAGCTGCTCCAGAGGGCCTCC AAACT AACAGTCGATACGAACGCCAAGCTAAAGGATCTG 

AACGATCTGCAGACCGTCTATCTGAACAAGAACCAACAACTACTGCGATTGCAGGCCGAGATCGGCCCCTTGAACAAGGAGCTCAACGAGCACCT 

GATTCACATCAAGGAGCGGGGGTCGCACTACAGGCAGTGCTATACGTAGATTGGGAAACGTAAGTAATGCAGTTTTAATACTATTACAAGCAAAT 

AACTAAAATCAGTAATATTTTTTAGATCCATTTGCGTGGCCATCTAAATTGCCATATTCATGAACAAATAAGCAAATTGCGAAATACGAGAATCA 

AGTAAAGAAC T AC AAGAAC AAC T AG GAAGGC GAC A GT TAT TAT GT AACC AC T GT T AAGT T AC C GT T AC GTT TTGT ACT CT AAGC GAAAACC AAC A 

CTGCAATGTTAATTTTAAAAATTGAGCGAAAGAAATCGATGCCTTCACTTTTAATGTGTACACACTGAGCTAAGTTTTTCTTATACTATCCCTAT 

ACCATTTTGTAATGCAATTAATGTAATTAAATAAAACTACTTAAATGTGGAAATGTCTAGCATTTGTTTGGGCTCGATCGGGAGAGGAATGAGAT 

AATGGTCATTTAAATGGAGAGCGACAGGTGAGAGTATTTTTGCAACACAGTTCTATTTCCCAACAATAAATCTTTTTCGCTGAACAAATGCAACT 

ACTTTTAAGT ATAAT AACAG AAATATATTATTTAC AAACGTTGAATAAAACAAT AT T AAT AAC GT T GGAAATGAAGCCTAAAAGCT T AAT AT AGA 

TAAAAAGTATTTAAACAGAAGTTTGTTGGAGTTCAAATATATACAAAAATTCGGCTTATGCTATTTTCACTGTACTTCGATTTCTATACTATTGT 

ATACTTATGCTGCCATAAACATTGCCAATTTATTCCATTTTCCGTTTTGTTTTAGTTCTTCTGCGCGTCGCGTGCGGCTTGTTTTTGTTTTTAAT 

TTTCGTCCAAATAATTTTCCACATTTTTTTTTCGGGGGCCAGAGGTTCTTTTAAAACCGGAAGTTGTTACGZVATGGATCACCGTTACAGAAGATC 

GTTAAGGAGCTCAGCAGGTATTTAATAACAATTAAAAAATTAATAATGCAGTTGGCACAGTTGCAGAGAAGCCCACTTCAAGCACAAAGGTCAAG 

GTAATGCAAAAAATGCAAAAGTGCTTGAAATCATGCAATTTCTAGTTATCCCTGCCTGCTCGAAAATCACTAAAGTTTTGCATGAAAAGAGTACC 

CCTGCTGGCTATCGATAAGAGTTATCGCTTCGGCCCCTATCGATATTTCAAAAGTTCATCAATTTCAGCTGAAATCTCAACAAGTGCAAGTTTCG 

AGCTCTGTTTTAATTAAACACAGACGTGTCGCCGAACAAGTCGGCATTGTTTGGATCGGTTTTCAGGTTGCAGTGCGTGAAAATATATATATATT 

TTTTATTTATTTATTGCGACATCCGTGCGGCCACTCACACACACAGGCGCGATTGACAGATGTGCGACGTGCATACGTATAAAGAGCACACACTC 

GCACACAC 

{SEQ ID NO: 775) 

Exon: 1001.. 1311 

Exon: 4115. .4190 

Exon: 4350.. 7195 

Exon: 7272. .9939 

Exon: 10001.. 10313 

Start ATG: 4487 

Transcript No. : CT22015 

AGAGCGGACGTTTTTCGAGTCGGAGCCGGGTCGAGTCGAGGGCCCC7AACTTGGAATTTTCTTTCCGATTGACTTGGAAAATAGCTTCTTCGTCGC 

GTGCGAGTGTGTGTGTGTGTATGTGTGTGCGCGCGGTGTGGAGTTTATAAAAAGAAGTCGAAAAGTGTTTGCCGCATC7AACGAA7AAT7AAAAACAC 

CCAAGAAGAGTGTTTGTTGTTTTGTCTGTATAACAGAAAACTGATTTTCTCAACTTCACCTCTGCACATCGATCAAGGACTAGATATAGATGCAT 

CATATGTAGCCGAATTAAAGTATCTGCTACAGGACTAGTTGGAAAATGCCACAATGAATAAATAAATATTTTGATCAA7AATAAGGAAGAG 

T GGT T ATAAT TTTC AT ACT GAGGATCACGTGGACAAGTGTAAGAGATCCCCTTAAAAAACGAACGAAAAGAAAAGCAACTGATTAAAGTCCCGGCC 

GGAAGTGCCCCAGTGGAACGCGACCCCAACAACTGAGAGCCACATCGAGATGTTGGAGCTGCGGCTTATCGTGGTCATTGTGCTCGCCCTGCTCA 

GCTGGCAGTGGGACCCAGTGGACTCCCAAAGACCGCCCCAGCATGGCCGACGTGATCGTCCGAAATATCCACCGAACAAGTTTATCAAGACTCAT 

CCATGCGAGAGATCCTCCTGCTACCCGGCAACGGGTAATCTTCTGATCGGCCGAGAAAACCGCCTGACTGCCAGTTCCACGTGTGGTCTGCACTC 

GCCGGAGAGATTCTGCATCCTGTCCCATTTGCAGGATAAG/AAGTGTTTCCTCTGCGACACGCGTGAGGAGACCAAGCATGATCCGTACA7AAAATC 

ATCGCATCGGACAGATTATATATAAAACTAAGCCAGGCACCAACATACCCACCTGGTGGCAGTCCGAGAATGGCAAGGA7AAATGCTACCATTCAA 

CTGGATCTGGAGGCGGAGTTTCACTTTACCCATTTGATCATAACCTTCACCACCTTCCGACCAGCGGCCATGTACATCGAGAGATCCTTTGATTT 

CGGACAGACGTGGCACATCTACAGATATTTCGCCTACGACTGCAAGGAATCCTTTCCGGGAGTGCCCACTGTCTTGGAAAACATCACCGATGTCA 

TGTGCACCTCGAGGTATTCCAACGTGGAGCCCTCGAGGAATGGTGAAGTGATATTTAGGGTGCTGCCACCGAACATCAACGTTACCGATCCCTAT 

GCGGAGCACGTGCAGAATCAGCTGAAGATGACCAATCTGAGGATCCAGATGACGAAGCTGCACAAGCTGGGAGACAATCTTTTGGATAGCAGGCT 

GGAGAACGAGGAGAAGTACTATTATGGCATCTCCAACATGGTGGTTCGCGGATCCTGCTCCTGCTATGGCCACGCCTCTCAGTGTCTACCGTTGG 
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ATCCCGCATTTTCCCAGGCGGACAACGAAGATGGCATGGTGCATGGCCGATGTGAGTGTACCCACAACACCAAGGGAATGAATTGTGAAGAGTGC 
GAGGATTTCTTCAACGATCTCCCATGGAAACCCGCTTTTGGAAAGAAAACGAATGCCTGTAAGAAATGTGAGTGCAACGATCACGCCGTTAGTTG 
TCACTTCGATGAGGCTGTCTTCACTGCATCCGGTTTCGTTTCCGGCGGAGTGTGCGACAATTGCTTGCACAATACTCGGGGTCAGCATTGCGAGG 
AGTGCATGCCGTACTTCTACCGCGATCCAGAGCAGGATATAACGTCCGAAAGGGTTTGCCAGCCCTGTGACTGCGATCCTCAAGGTTCGTCGGAC 
GACGGCATCTGTGACTCCCTAAATGAACTGGAGGAGGGAGCAGTAGCAGGTGCCTGCCATTGCAAGGCCTTTGTCACGGGACGTCGTTGTAATCA 
GTGCAAGGATGGTTACTGGAATCTGCAGTCGGATAACCCCGAGGGCTGCGAGCCGTGCACCTGCAACCCCCTGGGTACACTGAACAACTCCGGAT 
GCGTTATGCGCACCGGCGAGTGCAAGTGTAAGAAATATGTGACGGGCAAGGACTGTAACCAGTGTATGCCGGAGACCTATGGTCTTTCGGAATCT 
CCGGAGGGTTGCTCTCTGTGTAACTGCGATGCTGGTGGATCCTACGATAATTACTGCGATGTCATAAGTGGTCAGTGTCGCTGCAGGCCCCACAT 
GACGGGTAGGTCATGTTCGCAGCCGAAGCAGAATTACTTCATTCCCCTGCTGCCCGAGGTTCATGAAGCAGAGGTGGTGGACGAGTGCATCTCGT 
ATGGCGCCAACGGTAACTGCAGCCTGGTGGCCGAAACTCCTGATGGCAGTTTCACGGGCATTGGTTTTACCAGGGTTCCTGAGAACTCGGAACTG 
GTTTTCACTGTCGGAGATATTCCTCGATCCATGCCATACGATGCGGTTATACGTTACCAGAGCACTTCACGAGGAGATTGGGAGAACGCCTTTAT 
CACTCTTGTCAGACCCGATCAAGTGGATCCTGAAGGAGGATGCGGTGAACTAGCTGCAGCAACTTCTTCGGAAACCAGGATTCCGTTCTCCCTTC 
CTGATCGCAGCCGTCAGGTGGTTGCTTTGAATGAAGTATGCTTGGAAGCTGGAAAGGTTTATAAATTCCGGATTTACTTTGAGCGAAAGCGACAC 
GATGTGGACAGTCCCACAGCGACGATCTTGGTGGATTCTCTTACGCTCATTCCCCGCATCGATGTAACGCCCATCTTCCAAGGATCCGTTCTGGC 

T GAT AT T C GC AAAAAGG AT T AC GAAAAAT AC AAT T GC AAATC C T C T T T G T ACG AT AT G AACT AC AAG T CGG AT CC T AAAT GT C AAAAT C T G GAC A 
ATATCCTCAGTGTGTTTGTCCACGATGGAGCCAGTATGTGTAATTGCAATCCAACCGGATCGCTGAGCAAGGTGTGCGAATCCAATGGAGGTTAC 
TGCCAATGTAAGCCCAATGTGGTGGGCAGGCAGTGCGATCAGTGTGCTCCGGGCACCTATGGTTTTGGTCCCGAAGGTTGCAAGGCTTGCGATTG 
CAACAGCATCGGATCGAAGGATAAATACTGCGACCTGATAACCGGCCAGTGCCAGTGTGTGCCGAATACATACGGCAGGGAGTGTAATCAGTGCC 
AGCCGGGCTATTGGAATTTCCCCGAGTGCAGAGTGTGCCAGTGCAATGGTCATGCGGCCACGTGTGATCCCATTCAAGGAACCTGTATAGATTGT 
CAGGACTCGACCACGGGCTATAGCTGTGACAGCTGTTTAGATGGATATTATGGAAACCCACTTTTCGGCAGTGAGATCGGATGCCGACCATGCCG 
CTGCCCGGAGACGGTAGCCAGTGGCCTGGCGCACGCAGATGGCTGCTCCCTGGACACACGGAACAATAACATGTTGTGCCACTGCCAGGAGGGAT 
ATTCCGGCTCCCGTTGCGAGATCTGTGCGGACAACTTCTTTGGCAATCCGGACAACGGTGGTACCTGCTCCAAGTGCGAGTGCAGCAACAATGTC 
GATCTTTATGACACTGGAAACTGTGATCGGCAGACGGGAGCTTGTCTAAAGTGCTTGTATCAGACGACTGGCGATCACTGCGAGCTCTGCAAGGA 
TGGATTCTTTGGGGATGCACTGCAGCAGAATTGCCAGCAGTGCGAATGCGATTTCCTCGGAACGAATAATACCATAGCTCATTGCGATCGTTTTA 
CGGGTCAGTGCCCCTGCTTGCCGAATGTCCAGGGTGTGCGTTGTGATCAGTGTGCCGAGAACCACTGGAAGATTGCCTCTGGCGAGGGTTGTGAA 
AGC T GC AACT GCGAT C C AAT T GGAGC AC T C C ATGAAC AGT GT AAT T C CT AC AC GGG GC AATGC C AAT G C AAGC C AGG AT T T GGC GGC AG AGC GT G 
TAATCAGTGCCAGGCTCATTACTGGGGCAATCCCAACGAGAAGTGCCAGCCGTGCGAGTGCGATCAGTTTGGAGCTGCTGACTTCCAGTGCGACC 
GGGAGACGGGTAATTGCGTCTGTCACGAGGGAATTGGGGGCTACAAGTGCAACGAGTGCGCCAGGGGCTATATTGGGCAGTTTCCACACTGCTCA 
CCTTGTGGCGAATGCTTCAACAACTGGGATTTGATTTTGAGTGCTCTGGAGGATGCCACAACTGCCACTATCCTACGAGCCAAGGAAATCAAGCA 
AGTGGGAGCGACAGGAGCCTACACGTCCGAGTTCAGTGAACTGGATAAGAAATTGCAGCACATTAGGAATCTGCTGCAGAACACCTCGGTCAGTC 
TGGTTGATATAGAGAAACTGGACTATGAAACCCAATCACTTAGGGACCAACTTCAGGCGTCCCATGGCCGATTAAGTGAAACCGAGCAAAACCTG 
GATGACATTTACAATTCGCTAAGTTTGTCGGGCGTTGAGCTAGAGAGCCTACAAAATCACTCCAGATTGGTGCAACAACTGTCCAAGGAACTGAA 
GGAAAACGGCATACAACTGCAGGAGTCAAATATCGAGGGAGCTTTGAACCTCACACGTCACGCCTACGAAAGAGTCAGCAATTTGTCGACCCTTA 
AAGACGAGGCCAATGAACTTGCATCGAACACCGATAGGAATTGCAAGAGAGTGGAAAATCTGTCTAATAAGATTCAGGCCGAAGCCGATGACCTG 

GCAAAC AAT AAC AAGC T C AT T G AAGAC T AC CGAG C AG AGT T GAC T T C GC T AACC T C C C AAAT TCCCGAGCT G AAC AAT C AGGT T T GC GGC AAGC C 
AGGTGATCCCTGTGATAGTTTGTGCGGAGGTGCTGGCTGCGGTCATTGTGGCGGTTTCCTGTCCTGCGAACATGGCGCCAAAACCCATTCGGAGG 
AAGCCCTTAAGGTGGCCAAGGATGCTGAGACGGCCATTACCAGCAAGAAGGATCAGGCGGACCAAACCATCAGGGCTCTGACCCAGGCCAAACTG 
AATGCCTCCGAGGCATATGAGAAGGCCAAGAGGGGTTTCGAACAATCCGAGCGTTATCTGAACCAAACCAATGCCAATATCAAGCTGGCCGAAAA 
CCTATTCATAGCCCTCAACAATTTCCAGGAGAACAAGACAGCTTCTCCTTCGGAGAGCAAAGAACTGGCACAGAAAACTCTCGACTTGGACCTGA 
AACTGGAACCCGAAGAGATTGAGACCCTCGGCGACCAAATCAACAGGGCTGTCTCATCGCTAAAGAATGTTGAAGCCATAATCTACAGGACCAAA 
CCCGATTTGGACAGGGTCAATAATCTGCAGAGCATTGCCAATGCCACCAAGGAGAAGGCCGATAAGATCCTGGACTCTGCCAATTCGGTCGTGGA 
GAGCCTGGCGGCGGCAGATGAATCCCAGGGTAAAGCCAAGGATGCCATTCAGCAGGCGAACAGTAACATTGAGTTGGCCGGTCAGGATCTGGAGA 
AGATCGATGAGGAAACGTATTCCGCCGAAGCTCCGGCCAATAACACAGCACAGCAGGTTGAGAAATTGGCCAAGAAAGTGCAGAAACTGCAGAAT 
7^ACATAATGAAGAACGATCGGGATGCCAAGGAGATAACCAAGGAAGCGGGTAGCGTCAAACTGGAGGCCATGAGGGCTCGCGGAGAGGCCAATAA 
TCTTCAGTCTGCCACAAGTGCCACCAATCAGACCCTCACCGATCGCGCCAGTCGCTCCGAAAATGCCAGGGAAAGGGCCAAGCAGCTGCTCCAGA 
GGGCCTCCAAACTAACAGTCGATACGAACGCCAAGCTAAAGGATCTGAACGATCTGCAGACCGTCTATCTGAACAAGAACCAACAACTACTGCGA 
TTGCAGGCCGAGATCGGCCCCTTGAACAAGGAGCTCAACGAGCACCTGATTCACATCAAGGAGCGGGGGTCGCACTACAGGCAGTGCTATACGTA 
GAT TGGGAAAC AT CCATTTGCGTGGCCATCTAAATTGCC AT AT TC ATGAAC AAAT AAGC AAATTGCGAAATACGAGAATC AAG TAAAGAACTACA 
AGAACAACTAGGAAGGCGACAGTTATTATGTAACCACTGTTAAGTTACCGTTACGTTTTGTACTCTAAGCGAAAACCAACACTGCAATGTTAATT 
TTAAAAATTGAGCGAAAGAAATCGATGCCTTCACTTTTAATGTGTACACACTGAGCTAAGTTTTTCTTATACTATCCCTATACCATTTTGTAATG 
CAATT AAT GTAATTAAATAAAACTACTT AAAT GTGGAAA 
(SEQ ID NO: 776) 

Start ATG: 525 

MLELRLIVVIVLALLSWQWDPVDSQRPPQHGRRDRPKYPPNKFIKTHPCERSSCYPATGNLLIGRENRLTASSTCGLHSPERFCILSHLQDKKCF 
LCDTREETKHDP YKNHRI GQI I YKTKPGTN I PTWWQSENGKENAT I QLDLEAEFHFTHLI I TFTTFRPAAMYI ERS FDFGQTWHI YRY FAYDCKE 
SFPGVPTVLENITDVMCTSRYSNVEPSRNGEVIFRVLPPNINVTDPYAEHVQKQLKMTNLRIQMTKLHKLGDNLLDSRLENEEKYYYGISNMVVR 
GSCSCYGHASQCLPLDPAFSQADNEDGMVHGRCECTHNTKGMNCEECEDFFNDLPWKPAFGKKTKACKKCECNDHAVSCHFDEAVFTASGFVSGG 
VCDNCLHMTRGQHCEECMPYFYRDPEQDITSERVCQPCDCDPQGSSDDGICDSLNELEEGAVAGACHCKAFVTGRRCNQCKDGYWNLQSDNPEGC 
EPCTCWPLGTLNNSGCVMRTGECKCKKYVTGKDCNQCMPETYGLSESPEGCSLCNCDAGGSYDNYCDVISGQCRCRPHMTGRSCSQPKQNYFIPL 
LPEVHEAEVVDECISYGANGNCSLVAETPDGSFTGIGFTRVPENSELVFTVGDIPRSMPYDAVIRYQSTSRGDWENAFITLVRPDQVDPEGGCGE 
LAAATSSETRIPFSLPDRSRQVVALNEVCLEAGKVYKFRIYFERKRHDVDSPTATILVDSLTLIPRI DVTPI FQGSVLADIRKKDYEKYNCKSSL 
YDMNYKSDPKCQNLDNILSVFVHDGASMCNCNPTGSLSKVCESNGGYCQCKPNVVGRQCDQCAPGTYGFGPEGCKACDCNSIGSKDKYCDLITGQ 
CQCVPNTYGRECNQCQPGYWNFPECRVCQCNGHAATCDPIQGTCIDCQDSTTGYSCDSCLDGYYGNPLFGSEIGCRPCRCPETVASGLAHADGCS 
LDTRNNNMLCHCQEGYSGSRCEICADNFFGNPDNGGTCSKCECSNNVDLYDTGNCDRQTGACLKCLYQTTGDHCELCKDGFFGDALQQNCQQCEC 
DFLGTNNTIAHCDRFTGQCPCLPNVQGVRCDQCAENHWKIASGEGCESCNCDPIGALHEQCNSYTGQCQCKPGFGGRACNQCQAHYWGNPNEKCQ 
PCECDQFGAADFQCDRETGNCVCHEGIGGYKCNECARGYIGQFPHCSPCGECFNNWDLILSALEDATTATILRAKEIKQVGATGAYTSEFSELDK 
KLQHIRNLLQMTSVSLVDIEKLDYETQSLRDQLQASHGRLSETEQNLDDIYWSLSLSGVELESLQNHSRLVQQLSKELKENGIQLQESNIEGALN 
LTRHAYERVSNLSTLKDEANELASNTDRWCKRVENLSNKIQAEADDLANNNKLIEDYRAELTSLTSQIPELNNQVCGKPGDPCDSLCGGAGCGHC 
GGFLSCEHGAKTHSEEALKVAKDAETAITSKKDQADQTIRALTQAKLNASEAYEKAKRGFEQSERYLNQTNANIKLAENLFIALNNFQENKTASP 
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SESKELAQKTLDLDLKLEPEEIETLGDQINRAVSSLKNVEAIIYRTKPDLDRVNNLQSIANATKEKADKILDSANSVVESLAAADESQGKAKDAI 
QQAKSNIELAGQDLEKIDEETYSAEAPANNTAQQVEKLAKKVQKLQNNIMKNDRDAKEITKEAGSVKLEAMRARGEAKNLQSATSATNQTLTDRA 
SRSENARERAKQLLQRASKLTVDTNAKLKDLNDLQTVYLNKNQQLLRLQAEIGPLNKELNEHLIHIKERGSHYRQCYT* 
(SEQ ID NO: 777) 

Name: lamininBl 
Classification: cell_adhesion 
Gene Symbol: LanBl 
FlyBase ID: FBgn0002527 



Celera Sequence No. : 142000013384840 

AGCCGCCGAAAAGCATCCTTAATCTCACTGTCGGTGGCATTTCTCGGTAAGCCCAGAACCTGGTAGTGATCTTCGTAGACATCGCTCATTTTGGC 
CAATTGGTCTATTTGATGGGAGAATAATTAGATGATTTTATCGATAGTAGCACGATGGTCAGTGGTTTTTGGTGCAGAAGTCGGTATTTATTGAT 
GTTTATGTTGCGGGATGTAAGGCACTGGGAAAACCTGAGAAAACACTAAAGCTGATTCTTTCATTACAAATAGAGATGCACATCTGAAATGTGTG 
TTATCTTTGTATGGAAACAACATCGATAAATACATCGTGTCATTTGTCGTGTCATCGTGTCAGCGAAATAATATTCGCGACAAGAACATATTGTT 
TAAGGAAAATAAATGCGTCTTTCGTTTCTTCATTTGCCATCATATTTTATTGAGAATACGAGTCATTAGGTGGCCAACAGAAACAAAATCTAGAA 
AAATATGTCTTTTTTTTCAGAACAAACCATTATATTTCCGTCTTATCTGGTACATACCAAAGACACTAGAATATTCTAGTGTTATTCTATTCTAT 
TCTATTATTATTATTCTAGTGTCTTTGTACATACTATATGTAGTAGCAGACTGGCTCTGCTTAGCAACAATTTGTAATAAGCAAGTGCATTTAAT 
TTTCTCTACTATGGCATTGTTATATTTTTTACTATTTGTATCGCATAAACGTTTTTTAAATTAAAAAAAAACAAAACTATTAGTTCGTCTTACAG 
TATTTATCCGTTTTATCATTCGATTTGGCTAAAGGTATTAGCTATTTTCGAGCAACAGTCGGACGAATTCCATAATTTTTATTTATCTTACTAAA 
TAAATAAATAAATTAATCAATAAGTAACGCAGGATTCATTAGATAAACAGTCAATAATTAAGTTCCTATAAATTAATCTTAAAGACTATTCAGCG 
AAGAACGCTAGGTACCCGAGATTGCTTCATCAAGCCCTTGGATCCTTAGCTCAGTCCGAGGAGGATCCATCGCCGTGGGTAGCCTCTTTTTTCCG 
GTTGAAGAACTGGAATGTCTTGTAGGCGGCGAACGCAAAGAGCACGGTTCCTCCGACGGCGCACATGAAGGGGAGCATGTCTGGCGTGGGCTGGG 
CGAAGGTCGCAGCAGGCTCGTCATCACATTTGAGACCCTCCTCGCCGTGCTGGTCGTATATCTCGCGCTTTTCCTTATCAAAGAGCACTTCGAAG 
GCGGCCACCACCTCCCTAAACTGCTCCTCGGCCTGCGGATGGTCGTTCTTGTCCGGATGGTAGCGGAGAGCCATCCGGCGGTATCCCTTCTTGAC 
GTCTTCGCTGGACGCATTCCTCTCGATGCCCAGAATCTTGTAGTAATCCTTACCCATTGTGGTGAAATGGTATTCTCCTTGTTCTGGCTGAGATG 
GTCGTCTAGCGGTGTTGCGCCGTGTATGGAATGTGAGGCAGCGTCTCGACTTGTTGGTGCTTATGTGGTAATGCGAGTATATATTGCTGATTCGA 
TTCGCTGATTTAAAACATTATTACAAATAAACAGTCAATCGATAAATTATCGGCCGCCAGTGGCCAGAGTGACCCTACCTTACCACCAGAAGCAC 
TGTTAGTTAACAGAAGAGATTCTGACGATCGTAGCAGCTGTATAGGAGCACTATTAGCCTATTAGTAATTAATGATCTGTTAAATCTATCTTTTT 
TGAGTACTTTAAAAGGAATTACACATTTCGAAAATAGAAAATATTCATATCGCTTACCGGTTTTGGTATTTTTTCAGACCGTGCGGTATCTTATT 
CGCCATCGAAGCGGTCACACTGGGTGCCGCCGCCAACTTCACTCTTTCCGTTCTGTGAGCGAAAACCGAAAAGTCTGTGCTTTGGTAAGTGTTGC 
TAAAAGTTCGGAATAATGTTGCATCCCGAGCATTTTCGGGTACATAACTGTTCCACGGCGGTGGTCCAGCAAAGACTAATCGTTATCACGCCTTT 
CGCAGTTCTTAAATTCACCCGACGAGTCCCTAATACACAATTAAAATGGTTAGGGAGAACAAGGCAGCGTGGAAGGCTCAGTACTTCATCAAGGT 
TGTGGTAAGTATAGAACCTTATAGAATTCGCTCACTAGCTGGCGCCTGGCTTATGCTGTTAACTGATCCCTCCTCCAGGAACTGTTCGATGAGTT 
CCCAAAGTGCTTCATCGTGGGCGCCGACAACGTGGGCTCCAAGCAGATGCAGAACATCCGTACCAGCCTGCGTGGACTGGCCGTCGTGCTTATGG 
GC AAG AAC AC CAT GAT G C GC AAG GC C AT C C GC GGT CAT C T GGAG AAC AACC C GC AGC T GG AG AAGC T GC T AC CC C AC AT C AAGGGC AACGT GGG A 
TTCGTGTTCACCAAGGGCGATCTCGCCGAGGTGCGCGACAAGCTGCTGGAGTCCAAGGTGCGCGCCCCCGCCCGT 

(SEQ ID NO: 778) 
Exon: 1450.. 1001 

Start ATG: 1387 (Reverse strand: CAT) 
Transcript No. : CT22047 

ACACGGCGCAACACCGCTAGACGACCATCTCAGCCAGAACAAGGAGAATACCATTTCACCACAATGGGTAAGGATTACTACAAGATTCTGGGCAT 
CGAGAGGAATGCGTCCAGCGAAGACGTCAAGAAGGGATACCGCCGGATGGCTCTCCGCTACCATCCGGACAAGAACGACCATCCGCAGGCCGAGG 
AGCAGTTTAGGGAGGTGGTGGCCGCCTTCGAAGTGCTCTTTGATAAGGAAAAGCGCGAGATATACGACCAGCACGGCGAGGAGGGTCTCAAATGT 
GATGACGAGCCTGCTGCGACCTTCGCCCAGCCCACGCCAGACATGCTCCCCTTCATGTGCGCCGTCGGAGGAACCGTGCTCTTTGCGTTCGCCGC 
C T AC AAG AC AT TCCAGTTCTT C AAC C GG AAAAAAG AG GC T ACC C AC G GC G AT GG ATC C T C C T C GG ACT G A 
(SEQ ID NO: 779) 

Start ATG: 64 (Reverse strand: CAT) 

MGKD YYKI LGI ERNAS SEDVKKGYRRMALRYHPDKNDHPQAEEQFRE VVAAFEVLFDKEKRE I YDQHGEEGLKCDDEP AAT FAQPT PDMLP FMCA 
VGGTVLFAFAAYKTFQFFNRKKEATHGDGSSSD* 
{SEQ ID NO: 780) 

Classification: chaperone 



Celera Sequence No. : 142000013384534 

AACAACAAGTGCTACACGAGCAACCCCAACAACAACAACAACAACAACAATAACGGCGGCAGCAACAGTGGCAACAACAACAGCAGCCCGAGTGT 
AAACAACTTCAATTGTGATAGTGTCGCCGGTAATGGACGTTACCTGCACGGCGGACACCAGCATCCGCATCCGCACCAGGCGGGATTTGCAGCGG 
TTGCCGCCGCCGTCGGAGCAGCTCCCTCCGCCCTGCAGAGCCTGCAGCAGCTCCACCAGCAACACCACGCTCAGCAGCAGGCCACCCTGAGCTTT 
CAGCGGGAAAAACTCAAGTCGGGTAAGCAGGTGTTTCAGTCGATGAATAACCCATGATTTGATCATAAGTGGGTATTAAGTAAACATAGTGTGTG 
CACATCTAATACCTCCATTTGATCATCATTTTGTGATTCGGCAGATTCTCACAAGAAGAGCGCACTGAAGAACAAGCGCGTCCGGACAATCTTCA 
CGCCGGAGCAGTTGGAGTGCCTGGAGGCGGAGTTCGAGCGCCAGCAGTACATGGTCGGCCCGGAGCGGCTCTACTTGGCACACACCCTAAAGTTA 
ACGGAGGCTCAGGTGAAGGTGTGGTTCCAGAATAGGCGGATCAAGTGGCGCAAGCACCACTTGGAACTGACACAACAACGGCTGGCATTGATAAG 
GCAAACTCAACTGCCAGGTACATCGTTATTGGGCAATCAAGTATCCGTGTCGGCCAATGCGGCCCATTCTGTGTCCACGGAACGCACAAACGGAT 
GCAGTGCAGCCTCGCCCAGCCTGCAGGAGGAGGACAACGAGGATAGCAAGCACTCGCTGTCGCTTTCGGGCGCATTACCGCCGTTGAGTCGAGCG 
CAGTCCGAGTCGGATTTGTCCATCTGTAACGATTCCCTGGACGGCGACAGCCTCATGGACGGCAGCGAGGAGGCCTAAAAACCACGCCAGTGCAA 
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AGCAAGTTTCCTGTAAATATAGTCAAATAAATATGTACAATATCGGTTTTCTATACTTCTGCTTGTCTTTAATTGTTGATGATTAGCTGCAAGTT 
GATTGTTACAATCAATCAATGCAAACTATATTATATTGTATAAGCATTGCAGTATATCCATTACATACGAGTACTGAAAAAGCTTATTTAAAAAC 
AGAATAATACAGACATTTACACTCAATTCGCACTTTATACCTTGATCAGTTATCACATTGGAGTATTTCACTTATTGCTGAGCAGCCGCCCTGGC 
GAGCCGATCGTGGCAGGATAGGTGCACGATGTGCCCATTTGGATAGCGCACGAAGGCGGACTGGGTTTGAAAACGCTTCTTGCACTCGGAACACA 
CACTAGACTCGTTTAACTCGAAGCTAATGTTTCGCTGCTCCTCCAAGGCATTCTCCAGGCGCGTGGACTCTGCCTCCAGGAAGCCGCACATCATC 
TGCATCTCGTGCTTGTCGGCCATCATTTTACGTATAGATTTCTCCAAGTATTTCTCCAGCTGGGGCATGGGCATGTCATCGGGAAGGTGCTGCAA 
AATCCAAAATGGTTGAGAAATATTTCTAATAAGTCATGCATTCAATGCATTCTTCACCTCGAAGATCTCGAATGGATCGATTCTGGTGGCGTGGG 
TGTTAAGTATTTCCAGGGCCACCTCTCGATTCACTTGCGAGAAATCCGGATGCAAGGGCACACCATCATAGAGCGGTTGAGTTGGTGGTATTAAT 
ATGCACTTAATGAGCGTGTGAAATATGTGCTTGTCCTCTTTGTAATGGGCTTCCGCATAGGCCGTTGCCTTGGCCACATCTCCGAGAACATGAAT 
ATAGATGGACAGGACGTTGTCGTGCTTTTTGAGCCTGCCCAGGATTAGGGCACGCTCCTCCAAGAGTATATTGGTTGGAAATTCCTCTAGGAGGC 
GATCCGGCGAATAGTCGTTGGACTCCTCCAACATCTTGTAGAGCTTGGCTCGCATCGGAATGAGTTCGGGAACTTCCTCTCTAGAAAACAGGTGA 
GTATAAGCATTTTGATCCAGTACAATGACAGACTCTCACCCTTTTTCCTGCTGAGCAAGAAGCCGTTGCACCTTTTCGCGATACTGCTTGAGGAG 
CACATTGTGGAGCAGGGTGTTGCCGTCCTTCCATTCAGTGATTACGTGCTCCAAATAGGGAATGACAAGTGCTTTGTGCTTGCTAATAAAAAAAT 
CAAGAACTTTGGCGCGCGGCAGGGATTCCACTTCAATAAGTTCGTCGGTGAAAATGGTTAGGCCTTCCTCGGGATTATCGTTTAAGACCCAGTCA 
GCGAACTCAAAGATCAGCGGCAGATGATCACCACCCAGCTCCTGAAGATAGCGTATGGTACGCTTGCGACCCTGCAGCACAGATCCTTCGATGCT 
CGCCTGCTCTCGAAGCAGTTTAAGGGCCTCCTTGTGCTTGCCCTTCATCTGGTAGAGGATGATAAGTTCAGAGATCTTGTTGTGCTTCTTCAACG 
TCTTTTCCGACTCCTCTAAGTGGCACTGGTTTAGGCGTAGCAGTGGTGCCACCAGAGAGTCGTTTGTCTGCAGGTAACACTTCAGCAACGTTGTA 

T CG AT G AT C T C C AGC AGGGAC T T T G AAC T AC T T T T AGT GT CGC GC AG C T T G ACC AC CTCCCGCTGTCGGGCC C AT GC C AG AT ACT C GAT C AGT GC 
CAAGTAGGCGTTCTCCAGGTCGCTATCCTCAAGCGCCGGAGTACTGGACGTAGGCACGGTGACATCTTCGGTGCCGGGCTTTGGTTCTGGCACCA 
AGTTTGGGAACAGTCTTATCACATCGTAGGGATCTATAGCCGCCTTTTCAAACTCCTTCATGGCCGCCGAGAACTCCTTATTGGTAAACAGCTCT 
TTCGCATAGAGCATGTGTATCTGGCGAATGGTTTGGGCTCGGTCCGCGGCAGGTTCATCGGAGATTTGCTGAACATAAAGTTTGTTGTAGTCGTC 
GTATTTTGATTTCGTTTATCGGACTCACCGTCAGCTCAATAGCCAGTTGGAATTTCTTTTGCTGTAGCAACTGCTGACGCTGGATAGGGATTTCC 
ACCATTCGAATGCACCAAAGCTCGGAGGTGGCGGCGGCAAAGATGGTTCCCTTGTCGGCGTGAACCAGGAATTTCGTCTTCTGTAGCTCGGGAAT 
GGTCTGGACCAGGGTGTCCTTGCCCACTAGGCTGCGCACCTCGATGGCGTTGTTGACACGACCCACAGCGAATGGCTCATCCCAGACTGTAATGA 
AGATAATGTGATTTAGAAATTGGCTTGAAGTGAATAGTGAGTATCGTACCCAGATCTAAAAGCGGACTTGACCAGAGCAGAGGAGTTAGGGAACT 
ATTGCTGTCCATGGCAGCTGGCCGGACATCTGTTGAGTTATTGATTCCATCCGATTCTTTGTACTGACTCGGATCTACTACGACGAGGTAGCTGT 

C C T T AGAG AT GC C C AAC ATGT T GT T C C GG AT AAGGC AAAT GC AGGG AT C T CT G C T G AT AC T AG AGG AGGT AAG GAAG AGGT T GT GC TTCTTTGGT 
GCTTTTCCTGATATCTAGAATTGGATATGCAGACCTTTAAAATACGAAGTAAATCGACTTTAAAGCATTTCTCACATCGTAAACAACATATTCGT 

C T T T G AAGC C C AC G C AG AC AGC AT GG C C C AC C CAGC AC AGT GT CC T GG GC AC AT CGC T G AGCT C GAT GAT C AAT TC C AG GG AAT T AAG CT T GT C C 
TCCTTCCAAAAGAAGAACACAAGACGCCGCCGGATAGCGCAGCACACCCGTATGACGGTGGCCACACGTCCGGTTGTGGACTTCGGCGTGTCCAC 
GTCCATGGTGAACAGGGTGCATCCCTTCGTGTCGGGCGCACTGTGCATGAAAGCGAAGTTGCTCTCAATCCGCCGAATGTCGCACACCTGCACCT 

GT AAGT C GGT G AGT AC GAAG AGC AGGT T CT C C G AGGC AAT C AC T T C CAT T T G GGT GAT CGGT T T C CGGC T G AAGTT C T T G T T G AACAT CC GC AT G 
TCCACGCCCGTCTCCTCGTCCACAGAGTACATGATCAGCTGTCCGCTGCGGGTGCCCAAGATGACATGGTTGCCTGAAAAAATGCGTCCGGTGAG 
TAATAGCTGTGCACCGAAAATTCCACTCACCATATGCCGCAATTGACTCTATCTGCACGCCCTGTTTCAGGATCGAGTGAACACTGTAGGCCTGG 
TGCATCTTGGCCAAGTGGAAAAACTTCGGTCTCCTCCGCCTCAATTTATTTTCGTAAAAAAGCTTATAAACAAACCAGATGGGAGTACCACTAAT 
GATAGCTGATGATGATAGATATCGATAGCTTTAGCGGTGGTAAATCGATTTCTTCCTCTTATGGTGATAGCGAAATAACTTAAATATACAAAGTA 
GAAAATAAAATTTGTACATGTTATAATTTAAGATTCAATCGCTAATGACTAAAAAGTTGAGTGAAATTTTAAATTACGATACTTTAAACGAATGC 
TGCCAGACTCTGCATTGCTGTCGATAGCACCAGTTCGATATACTGGTTCCCCGCGAAACCGTCCACCTCTAGTCCTTTTCACTTTAGCGAGGCGA 
AATTGTCACATTTACATTTGACAAAATTGTAAGTTTTGAAACAACTCTGTCGCTTTATTTCGGGTTGAATGTTTTCATTGCATTACTACCCACCC 
AGTGAAGTCGTTGAGTAACAGTTTTACCCCGTTCCCGCTCCAGGGTCGCCAACAAGCCCGCAGACAGCAACAACATCGAAAGTAGCTGCACAGCA 
ACAACAGCAAACAGCCAAAATGCCCAAGAATAAAGGAAAAGGAGGCAAGAATCGTCGTCGTGGTAAGAACGAGAACGAGTTCGAGAAGCGTGAGC 
TGATCTTCAAGGAGGACCAACAGGAGTACGCGCAGGTGACCAAGATGCTGGGCAACGGTCGTCTGGAGGCAATGTGCTTTGATGGCGTCAAACGC 
CTGTGTCACATTCGGGGGAAACTTCGCAAGAAGGTGAGTGCTGCTCTGACTACGCTCTTTGTTTCGCCTGTGCTCTCTTCGCTTGCGTTGAGTGG 
GTGCCCCTCTCTAAGGTGGCTCTCTTAAGCCATTTTGGCATTGATTGCCAGCGAATGCTACGTATTTACGTTATTGTTTTTGTTGAAATGCAGGC 
AGTTTTTGTGATTTTGGGGGATTTTGCATTACCCTGCTAACTTTCGACCTTTCTATACCTAATTTTCAACTCTCTTTTTCTGCAATTCTCTTTTA 

CTGGTCCATTATTGAGTGGGTGTCATCGTCCTTGTGC 
(SEQ ID NO: 781) 

Exon: 4262. .4116 
Exon: 4063.. 3591 
Exon: 3529.. 3280 
Exon: 3221.. 2974 
Exon: 2918.. 2035 
Exon: 1980.. 1578 
Exon: 1514.. 1001 

Start ATG: 4185 (Reverse strand: CAT) 
Transcript Wo. : CT22081 

CCATCTGGTTTGTTTATAAGCTTTTTTACGAAAATAAATTGAGGCGGAGGAGACCGAAGTTTTTCCACTTGGCCAAGATGCACCAGGCCTACAGT 
GTTCACTCGATCCTGAAACAGGGCGTGCAGATAGAGTCAATTGCGGCATATGGCAACCATGTCATCTTGGGCACCCGCAGCGGACAGCTGATCAT 
GTACTCTGTGGACGAGGAGACGGGCGTGGACATGCGGATGTTCAACAAGAACTTCAGCCGGAAACCGATCACCCAAATGGAAGTGATTGCCTCGG 
AGAACCTGCTCTTCGTACTCACCGACTTACAGGTGCAGGTGTGCGACATTCGGCGGATTGAGAGCAACTTCGCTTTCATGCACAGTGCGCCCGAC 
ACGAAGGGATGCACCCTGTTCACCATGGACGTGGACACGCCGAAGTCCACAACCGGACGTGTGGCCACCGTCATACGGGTGTGCTGCGCTATCCG 
GCGGCGTCTTGTGTTCTTCTTTTGGAAGGAGGACAAGCTTAATTCCCTGGAATTGATCATCGAGCTCAGCGATGTGCCCAGGACACTGTGCTGGG 
TGGGCCATGCTGTCTGCGTGGGCTTCAAAGACGAATATGTTGTTTACGATATATCAGGAAAAGCACCAAAGAAGCACAACCTCTTCCTTACCTCC 
TCTAGTATCAGCAGAGATCCCTGCATTTGCCTTATCCGGAACAACATGTTGGGCATCTCTAAGGACAGCTACCTCGTCGTAGTAGATCCGAGTCA 
GTACAAAGAATCGGATGGAATCAATAACTCAACAGATGTCCGGCCAGCTGCCATGGACAGCAATAGTTCCCTAACTCCTCTGCTCTGGTCAAGTC 
CGCTTTTAGATCTGGTCTGGGATGAGCCATTCGCTGTGGGTCGTGTCAACAACGCCATCGAGGTGCGCAGCCTAGTGGGCAAGGACACCCTGGTC 
CAGACCATTCCCGAGCTACAGAAGACGAAATTCCTGGTTCACGCCGACAAGGGAACCATCTTTGCCGCCGCCACCTCCGAGCTTTGGTGCATTCG 
AATGGTGGAAATCCCTATCCAGCGTCAGCAGTTGCTACAGCAAAAGAAATTCCAACTGGCTATTGAGCTGACGCAAATCTCCGATGAACCTGCCG 
CGGACCGAGCCCAAACCATTCGCCAGATACACATGCTCTATGCGAAAGAGCTGTTTACCAATAAGGAGTTCTCGGCGGCCATGAAGGAGTTTGAA 
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AAGGCGGCTATAGATCCCTACGATGTGATAAGACTGTTCCCAAACTTGGTGCCAGAACCAAAGCCCGGCACCGAAGATGTCACCGTGCCTACGTC 
CAGTACTCCGGCGCTTGAGGATAGCGACCTGGAGAACGCCTACTTGGCACTGATCGAGTATCTGGCATGGGCCCGACAGCGGGAGGTGGTCAAGC 
TGCGCGACACTAAAAGTAGTTCAAAGTCCCTGCTGGAGATCATCGATACAACGTTGCTGAAGTGTTACCTGCAGACAAACGACTCTCTGGTGGCA 
CCACTGCTACGCCTAAACCAGTGCCACTTAGAGGAGTCGGAAAAGACGTTGAAGAAGCACAACAAGATCTCTGAACTTATCATCCTCTACCAGAT 
GAAGGGCAAGCACAAGGAGGCCCTTAAACTGCTTCGAGAGCAGGCGAGCATCGAAGGATCTGTGCTGCAGGGTCGCAAGCGTACCATACGCTATC 
TTCAGGAGCTGGGTGGTGATCATCTGCCGCTGATCTTTGAGTTCGCTGACTGGGTCTTAAACGATAATCCCGAGGAAGGCCTAACCATTTTCACC 
GACGAACTTATTGAAGTGGAATCCCTGCCGCGCGCCAAAGTTCTTGATTTTTTTATTAGCAAGCACAAAGCACTTGTCATTCCCTATTTGGAGCA 
CGTAATCACTGAATGGAAGGACGGCAACACCCTGCTCCACAATGTGCTCCTCAAGCAGTATCGCGAAAAGGTGCAACGGCTTCTTGCTCAGCAGG 
AAAAAGGAGAGGAAGTTCCCGAACTCATTCCGATGCGAGCCAAGCTCTACAAGATGTTGGAGGAGTCCAACGACTATTCGCCGGATCGCCTCCTA 
GAGGAATTTCCAACCAATATACTCTTGGAGGAGCGTGCCCTAATCCTGGGCAGGCTCAAAAAGCACGACAACGTCCTGTCCATCTATATTCATGT 
TCTCGGAGATGTGGCCAAGGCAACGGCCTATGCGGAAGCCCATTACAAAGAGGACAAGCACATATTTCACACGCTCATTAAGTGCATATTAATAC 
CACCAACTCAACCGCTCTATGATGGTGTGCCCTTGCATCCGGATTTCTCGCAAGTGAATCGAGAGGTGGCCCTGGAAATACTTAACACCCACGCC 
ACCAGAATCGATCCATTCGAGATCTTCGAGCACCTTCCCGATGACATGCCCATGCCCCAGCTGGAGAAATACTTGGAGAAATCTATACGTAAAAT 
GATGGCCGACAAGCACGAGATGCAGATGATGTGCGGCTTCCTGGAGGCAGAGTCCACGCGCCTGGAGAATGCCTTGGAGGAGCAGCGAAACATTA 
GCTTCGAGTTAAACGAGTCTAGTGTGTGTTCCGAGTGCAAGAAGCGTTTTCAAACCCAGTCCGCCTTCGTGCGCTATCCAAATGGGCACATCGTG 
CACCTATCCTGCCACGATCGGCTCGCCAGGGCGGCTGCTCAGCAATAAGTGAAATACTCCAATGTGATAACTGATCAAGGTATAAAGTGCGAATT 
GAGTGTAAATGTCTGTATTATTCTGTTTTTAAATAAGCTTTTTCAGTACTCGTATGTAATGGATATACTGCAATGCTTATACAATATAATATAGT 
T T GC AT T GAT T GAT T GT AAC AAT C AACT T G C AG C T AATC AT C AAC AAT T AAAG AC AAGC AG AAGT AT AG 
(SEQ ID NO: 782) 

Start ATG: 78 (Reverse strand: CAT) 

MHQAYSVHSILKQGVQIESIAAYGNHVILGTRSGQLIMYSVDEETGVDMRMFNKNFSRKPITQMEVIASENLLFVLTDLQVQVCDIRRIESNFAF 
MHSAPDTKGCTLFTMDVDTPKSTTGRVATVIRVCCAIRRRLVFFFWKEDKLNSLELIIELSDVPRTLCWVGHAVCVGFKDEYVVYDISGKAPKKH 
NLFLTSSSISRDPCICLIRNNMLGISKDSYLVVVDPSQYKESDGINNSTDVRPAAMDSNSSLTPLLWSSPLLDLVWDEPFAVGRVNNAIEVRSLV 
GKDTLVQTIPELQKTKFLVHADKGTIFAAATSELWCIRMVEIPIQRQQLLQQKKFQLAIELTQISDEPAADRAQTIRQIHMLYAKELFTNKEFSA 
AMKEFEKAAIDPYDVIRLFPNLVPEPKPGTEDVTVPTSSTPALEDSDLENAYLALIEYLAWARQREVVKLRDTKSSSKSLLEIIDTTLLKCYLQT 
NDSLVAPLLRLNQCHLEESEKTLKKHNKISELIILYQMKGKHKEALKLLREQASIEGSVLQGRKRTIRYLQELGGDHLPLIFEFADWVLNDKPEE 
GLTIFTDELIEVESLPRAKVLDFFISKHKALVIPYLEHVITEWKDGNTLLHNVLLKQYREKVQRLLAQQEKGEEVPELIPMRAKLYKMLEESNDY 
SPDRLLEEFPTNILLEERALILGRLKKHDNVLSIYIHVLGDVAKATAYAEAHYKEDKHIFHTLIKCILIPPTQPLYDGVPLHPDFSQVNREVALE 
ILNTHATRIDPFEIFEHLPDDMPMPQLEKYLEKSIRKMMADKHEMQMMCGFLEAESTRLENALEEQRNISFELNESSVCSEC 

PNGHIVHLSCHDRLARAAAQQ* 
{ SEQ ID NO: 7 83) 

Classification: hypothetical 



Celera Sequence No. : 142000013384235 

CATGTTGTTGGTTCGGTTTGCTCCTGGTCGCCATTTTCCTGCAGCCATCCGCGAGCCAATACTACGGTTATCCCTATTACGGCGGCGCTAATGCG 

GGAGCGGGATCGGGATACGGGAACATATACGGCAGCGGCATGTACGGATACCCCTACAGCTACGGCGGATATCCGTACTACTCGTATCCCGGCTA 

CAACCCGTACTCGATGTACGGTGGATACGGGCAGTACGGATACCCCTATGGAGGATATCCCTACGGCGGCAATGGCATGAGTGAGTTTCCCGACT 

GATAGCACCTCGGTGCCTAGCACACAAGTAATCTACAAGTGTTTTGGACCTGTTTCTTCCTCTAGACGTGGCCTACGCCAGCAGCAGTGGAGGAT 

TCGCAACAGCCAGTGCGGGTGGAAGCGGATACTACGGCTAGGAAGCGCGAAACCTGATATAGTTTAGTGATAGGTATACGGTTAAGTCATCCTAC 

CCCATAAGTGTGTTAGTGCTAATAAATATGATATTTATGTTCTATAAGTCAAGACTTATTCTTATGACGTGTCCAAAACTAGTGTACCTGGTAAA 

AAATTGTACACCTAACTAGTGCCTTCATGTATACCACCATAATTTTATTAGCTCCCGGAACTGATGTGATAATTTGTATGTCTTTTAGCACATTT 

GACCCTATGTATGTTCCGCCAAGTTTATGCTGCCATGACTCTTTGTGGGATCATTAGATCCTTTTCTCAGCTTTTAGCTTTGAGTCAATAACTAC 

GTACGCATATCTGGTCGTTTTAATAGTTATAAAAAAGCCTTGAGGCTGTGATCCCATATATAACCATATACCGAAAAATATTTTTTGCTTCGTGA 

TAACAAATCGACAAAGTTTGATCCATGTGCATTCTTGGCGCCTTATCGATAGCTATGGACAGTGTTGTAAGGTGCACGATAGTAAGGGGGGCAAA 

ATTGTATCGGTCTGCCGATAGTTCCCGGCCGTTAGCCAAGTCCAAATTCGACCGACCGACCAATAGTCTCGCAAAACACAAAACAAAGGCACACC 

ACACTTCGTGCTCTCGCCGAATTCCGCCGCCGTCGCCTGAAACTCACACAAAGTAAGCCCGAAATCAGTGGAAAAGCGCCGAATAGTCGAATTGC 

AGGTTCAAATGGGATGGACGGCGACTTAAAAGGCGGAGGCCAATAGGCGGCATAAAAACATTGTATTGCGTAATTGGGGCCAATAAGAAAAGCAA 

AAAAACAAAGCCAACCGACTGCATTCTATTGCGCCTGTGTGTGTGCTTGTGTATCTACAGCTGAGCAGTAAATAAATTTGTTTTACATTTGGCCT 

CCAAAGGCAATTTCTCGGTGTTCCCGCTCCCTGCGTTTCCATAAACTTCCTCTCCTTTCGCATTAATATCAATGTCTTTAGTTCATCGTTACATA 

TTATTAAGCACTTAACAAGCAGTGAGTGGAAAGAGGAAGACAAATAGCCAAAGTATGCTTACGTTAAAAGAGCTTGTTCCACATATTCGCCACCT 

TTCTCCCGCACTCTTCCTCTCCCACTCTATCAGCCATCTCTATCCCTCTCACAGGGGCTGCTCACGCGCACCGTGGGGTGTTTTTTCGATTCTCT 

ACCATTTTCGCTGCTATTGCACCCCTTCCCCAGCATTTGTGCCGCTTAATTTCATAAATCAAGGCAGCGTCAGTTAAAAAACACAAGAACTACAT 

ACGTATGTACTTCTGCCCACATCCCCTTCAGCCACCTTCTCTTCATCTTTCTGGTTTCGGCAATGGGAAATTTCCGTAGCCACCGCACAGTGGAA 

AGAGACAACAGCACGCAATTTAAATTACAAGATTGCAATCTCGCTTTTTGGGCAGTTGGTTAAATTTTCATCGAGACACAAGCTGAGCCAAATAA 

ACATTCACTTTTTAAACGAACAGTGCACACGAATTGGGTAGCAATGCCTTGTTCTAAACATAGTGTAAAATGTTTTTAAGATAATTTTTATTTTT 

TAGTCCAGCGTTCAAGGCACTCGACAAATAAAACAGCATTCATAAATTTACTAATTTGCGATTAACTAGATTGTGCCAGCTTTGACACGCTCTCA 

TCCCACTGTGCGCTTCGTAGACAGAACAGCTGATGCTCGTCAAATTTTAGCTTCTGCTTGTGCAAAAAGCGCTCTGAGTGCGCTTGTGTGTGAGT 

GCGTGTGTGAGCATGGGAAAGGGAACGTGGTTATCGATAATAAAGTAAACTGAAAATAAATCAGTCATCCGCTCGGTTGACCAACTGCCCAAGTT 

AGTGCACCTTCTAGAATTTCTTGCAAGTGCACATTCTTTTGGTTCGCAGTGTTTCTGTTTTGCGCCTTATCTGGGGGGTGTGTCGTCGTCGGGAG 

CTTCACGTTTGGCTGGTTATCAAAATCACGCGACTTGACGCGGACCCCCAGTGAATTGCCAAGGTTGCTGATAAATGAATACTAATCAAACGTTT 

GCTCTTTTCGCGGGCCTATCGCTCAGATAGCAAAATTCTAGGTGGTACGAGCCGACGACGACGAAGAATCTGAGAGACCGCTTATATAATCCGGC 

AGCGAACGAGACAGGCCCCAAATCCCGACAAGATGTTGCGAATGATGCGAAGTTCCTCCTCCAGATGCTCTAGTCTTCAGCAGAGCTTGTTGAAA 

AGGTATGCCTGAGGTTATATCCTTATATTTCATAATTTACAACTGACACAATATGTATAAGTAACCATATTATGCATACACTAGCTAGTAGTTTA 

TTAGTTTTATTACATTGAACGCGCTTCATTTTTATCAGTTCGACACGCTGTCTGGGTTCCGTTATACCGGACCTTAAACTGAAGGACTTCCCGAT 

CGCCAATGAGCCAATCCTGGGATATCTCAAGGACTCTAAGGAGCGTAAGGCCCTCGAGCAGGCTCTAAAGGGCACGGCCTCGAGTTGCGAAGACA 

TCCCCATCGTGATCGGCGGCAAGGAGTACAAGACGCCGGAGGTTCGATACCAGGTCATGCCCCATGACCACCAACACAAGCTGGCCAGCTTCTAC 

TACGCCGACAAGAAGCTTATCGAGAAGGCCATCAAAACGGCGGTGGAAACACAGCCCAAGTGGGACCGTGTGTCGATTGCAGACCGTCTGAAGAT 
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CTGGGAGAAGGCAGCGGACCTAATGGCAACGACGTACCGCCAGGATCTCAACGCCGCCACCATGCTGGGTCAGTCAAAGACGGCGATTCAGGCGG 
AAATCGATTCAGCAGCGGAGCTTATCGACTTTATACGGTGAGTGTCCCCATAATTTATTACAAGGAACATGCAGAACCAGAACACAATCTGTTCC 
AATCGATTTTTTTTAAAGGATTTTCCCAATGGAAGAAGCAATCATTTCCTCATTATGATATTCAATATAAATACAAATTCTATTTCGCAGAATGA 
ATGCCTATTTTCTGAAGGAGGTCACTAAATACCAGCCCATCAGCGAGAACATAAAGGTCACGAAGAACTCCTTGCGCTATCGTGGCATTGATGGC 
TTCATCGCCGCTGTCAGCCCCTTTAACTTTACGGCCATCGGCGGCAACTTGTCCTACACGCCAGCTCTGATGGTAAGACTAGCCAATGGCTGGAA 
ATACCAATTAACTAAGCAACCGCCTTCGATTTTCGCAGGGAAACGGCGTGCTGTGGAAGCCCTCGGACACTGCCATGCTGTCCAACTGGATTATT 
TTCAAGATTATGCGCGAAGCGGGAGTGCCCGATGGAGTCGTTAACTTTGTGCCCGCCGATGGGCCCGTGTTTGGAGACACGATTACGGCCAGTCC 
CCACCTGGCAGGCATCAATTTCACCGGCTCTGTGCCGTAAGTAATTGTCGGTCTATTGATTTTCCCGTTCAATACCATTTCCTCCGACGACAGAA 
CCTTTAACCGCCTGTGGAAGCAGGTGGGCAACAACATTGACAACTACGTGAACTTTCCTCGCTTGACTGGTGAGTGTGGCGGCAAGAATTTCCAC 
TTTATACATGCATCAGCCGACGTTGAGTCAGTGGTGACCTCCACGATACGCTCAGCATTTGAGTACTGCGGCCAGAAGTGTTCCGCCTGCTCGCG 
AATGTATGTGCCGGAATCCCTTTGGCCACAGGTAAGTTTAAAAAGCATCAGATTGCGATGTTTTCGCAACGCTGACCTGTTTAACCCCTAACCCG 
CAGATCAAGGAGGGCCTGGTATGCGAGGCAGCAAAGCTGAAGATCGGTGACGTTCAAGACTTCAGCAGCTTCACGTCGGCTGTGATTGATGACAA 
AGCATTTAAACGCATTACGGGCTACATTGAGCATGCCAAAAAATCCCCGAACCTCGAGATTCTCGCCGGAGGCACTTACTCAGACAGCAAGGGCT 
ACTTCGTTAATCCGACCATTGTACTCAGCAAGGATCCCAAGGACCGGATCATGACCGAAGAGATTTTCGGTCCCGTGCTATCTATTTATGTGTAC 
AAGGAGTCGGACCTCCTTGAGACCATGAAACTGGTGCACACCTCTACCAAGTTTGCTTTAACCGGAGCAGTATTCGGGCAGGACGAGTAAGAGAC 
TTCTATACAAAGGGTTATCTTTAGTTCATAACGTATGCTTCCAGGGATTTCGTCAAGTGCGCTCTGCAGGAGTTTAAAATGGCCGCTGGAAACTT 
CTACATCAACGACAAGTCCACTGGATCCGTGGTGGGCCAGCAGCCATTCGGAGGCGGCCGGATGTCCGGCACCAATGACAAGGCCGGAGGACCAC 
ATTATATTCTGCGCTGGACCTCTCCGCAGTCCATCAAGGAGACCTTTGTGCCGCTGCGGGACGTCAACTATCCATACATGTGCGAGTAAGCTGGC 
CGACAGAAGATGCGCAAGGGGAATGGGATGAGCAAAAGTAGTGAAAACCAGACAGGAATCTCATATAACTACTAACTATATAAACATACTTAAAT 
GTATTTTGTTGAAATACTTTTAAACATCTCCAACACCCAAAGACCTGCCCCGTAACCCTACATACCTAGTTCAGATAGCTTCGATTTGTATACAC 

AC AC AC T C T AAG AAT C T C AAAAC AAC CC AT G AG AAT GCC T CG AAAGT T T GT AAATT GAT AT GC T GAT AC C T T C GAACT AC AAAC AC AC C ATGAC A 
TGGATATTTTAAATGAATACTATGTAATATCATTATTTTACGATTTAAACGCAATGTGATTATTTAATTATATATGATTTATAATTAAATGGCCC 
GCCTTATGTTTTATTCGTTATCGAAATTGCTTGAATTGGTTGAACCGTTTTCAATTATAAAATACTTAATGTACACACCTATGTATGTGAAATCA 

AAC G AC AAT AC ATT C T G AAT AC T AAATGT GAAAG AT C AT G AAG AG AT AAAGT G AAAC CT AGT T AC AT GT AAC GT AAGC ATC AAAT T CAT AT T AAT 
AAATATAACAAAGAATTAAAAAGAATTATTTGGGTTCCGTTTTCAACATTTCGTTTAACCCAAAATGCTATCCGCATCGATTGGGGCAGTTCCTT 
TACCCGAAATAAGATGAGTTCTTAGAGATTAGAAACCCGTAGAATTTAAACTTCAAAACTTGATGGTCTGAAGAGAACATGCAAAGGTTTAGTTT 
CATTTGATTCAAATTTGTAATCAAACTACAAACAATTCTGTTATTAAAAAATACATGTATTTCTTTCCGTGTTTTCTTAGTTCAATTAAATACAT 
TGTCCAATGTGACAAAACAAATTAATCTTTTACATATTGAAGTTGTCATAAGCATTCATAAACCGGCTATCACCACAAGTTAGCGGAAAATTCAA 

CCCTTCATTTAACCCAAGCGTCTGAAAAATACCTTGGCTTACTC 
(SEQ ID NO: 784) 

Exon: 1001.. 1097 

Exon: 2497.. 2662 

Exon: 2794.. 3267 

Exon: 3416.. 3587 

Exon: 3649.. 3836 

Exon: 3894. .4116 

Exon: 4184.. 4551 

Exon: 4605.. 4839 

Start ATG: 2598 

Transcript No. : CT22083 

ACCGACCGACCAATAGTCTCGCAAAACACAAAACAAAGGCACACCACACTTCGTGCTCTCGCCGAATTCCGCCGCCGTCGCCTGAAACTCACACA 
AAATAGCAAAATTCTAGGTGGTACGAGCCGACGACGACGAAGAATCTGAGAGACCGCTTATATAATCCGGCAGCGAACGAGACAGGCCCCAAATC 
CCGACAAGATGTTGCGAATGATGCGAAGTTCCTCCTCCAGATGCTCTAGTCTTCAGCAGAGCTTGTTGAAAAGTTCGACACGCTGTCTGGGTTCC 
GTTATACCGGACCTTAAACTGAAGGACTTCCCGATCGCCAATGAGCCAATCCTGGGATATCTCAAGGACTCTAAGGAGCGTAAGGCCCTCGAGCA 
GGCTCTAAAGGGCACGGCCTCGAGTTGCGAAGACATCCCCATCGTGATCGGCGGCAAGGAGTACAAGACGCCGGAGGTTCGATACCAGGTCATGC 
CCCATGACCACCAACACAAGCTGGCCAGCTTCTACTACGCCGACAAGAAGCTTATCGAGAAGGCCATCAAAACGGCGGTGGAAACACAGCCCAAG 
TGGGACCGTGTGTCGATTGCAGACCGTCTGAAGATCTGGGAGAAGGCAGCGGACCTAATGGCAACGACGTACCGCCAGGATCTCAACGCCGCCAC 
CATGCTGGGTCAGTCAAAGACGGCGATTCAGGCGGAAATCGATTCAGCAGCGGAGCTTATCGACTTTATACGAATGAATGCCTATTTTCTGAAGG 
AGGTCACTAAATACCAGCCCATCAGCGAGAACATAAAGGTCACGAAGAACTCCTTGCGCTATCGTGGCATTGATGGCTTCATCGCCGCTGTCAGC 
CCCTTTAACTTTACGGCCATCGGCGGCAACTTGTCCTACACGCCAGCTCTGATGGGAAACGGCGTGCTGTGGAAGCCCTCGGACACTGCCATGCT 
GTCCAACTGGATTATTTTCAAGATTATGCGCGAAGCGGGAGTGCCCGATGGAGTCGTTAACTTTGTGCCCGCCGATGGGCCCGTGTTTGGAGACA 
CGATTACGGCCAGTCCCCACCTGGCAGGCATCAATTTCACCGGCTCTGTGCCAACCTTTAACCGCCTGTGGAAGCAGGTGGGCAACAACATTGAC 
AACTACGTGAACTTTCCTCGCTTGACTGGTGAGTGTGGCGGCAAGAATTTCCACTTTATACATGCATCAGCCGACGTTGAGTCAGTGGTGACCTC 
CACGATACGCTCAGCATTTGAGTACTGCGGCCAGAAGTGTTCCGCCTGCTCGCGAATGTATGTGCCGGAATCCCTTTGGCCACAGATCAAGGAGG 
GCCTGGTATGCGAGGCAGCAAAGCTGAAGATCGGTGACGTTCAAGACTTCAGCAGCTTCACGTCGGCTGTGATTGATGACAAAGCATTTAAACGC 
ATTACGGGCTACATTGAGCATGCCAAAAAATCCCCGAACCTCGAGATTCTCGCCGGAGGCACTTACTCAGACAGCAAGGGCTACTTCGTTAATCC 
GACCATTGTACTCAGCAAGGATCCCAAGGACCGGATCATGACCGAAGAGATTTTCGGTCCCGTGCTATCTATTTATGTGTACAAGGAGTCGGACC 
TCCTTGAGACCATGAAACTGGTGCACACCTCTACCAAGTTTGCTTTAACCGGAGCAGTATTCGGGCAGGACGAGGATTTCGTCAAGTGCGCTCTG 
CAGGAGTTTAAAATGGCCGCTGGAAACTTCTACATCAACGACAAGTCCACTGGATCCGTGGTGGGCCAGCAGCCATTCGGAGGCGGCCGGATGTC 
CGGCACCAATGACAAGGCCGGAGGACCACATTATATTCTGCGCTGGACCTCTCCGCAGTCCATCAAGGAGACCTTTGTGCCGCTGCGGGACGTCA 

ACTATCCATACATGTGCGAGTAA 
(SEQ ID NO: 785) 

Start ATG: 199 

MLRMMRSSSSRCSSLQQSLLKSSTRCLGSVIPDLKLKDFPIANEPILGYLKDSKERKALEQALKGTASSCEDIPIVIGGKEYKTPEVRYQVMPHD 
HQHKLASFYYADKKLIEKAIKTAVETQPKWDRVSIADRLKIWEKAADLMATTYRQDLNAATMLGQSKTAIQAEIDSAAELIDFIRMNAYFLKEVT 
KYQPISENIKVTKNSLRYRGIDGFIAAVSPFNFTAIGGNLSYTPALMGNGVLWKPSDTAMLSNWIIFKIMREAGVPDGVVNFVPADGPVFGDTIT 
ASPHLAGINFTGSVPTFNRLWKQVGNNIDNYVNFPRLTGECGGKNFHFIHASADVESVVTSTIRSAFEYCGQKCSACSRMYVPESLWPQIKEGLV 
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CEAAKLKIGDVQDFSSFTSAVIDDKAFKRITGYIEHAKKSPNLEILAGGTYSDSKGYFVNPTIVLSKDPKDRIMTEEIFGPVLSIYVYKESDLLE 
TMKLVHTSTKFALTGAVFGQDEDFVKCALQEFKMAAGNFYINDKSTGSVVGQQPFGGGRMSGTNDKAGGPHYILRWTSPQSIKETFVPLRDVNYP 

YMCE* 

(SEQ ID NO: 786) 

Name: DELTA- l-PYRROLINE-5-CARBOXYLATE DEHYDROGENASE-LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384663 

CCAAAACTCTGCCTTCGACACGATTTCTTTTCTATTTCAACAGCGAGCTGACATTTTTTTGCATGTTGTTGTGGCTGCGAATTAAAAATTGCATA 
CAAAAAATGGCAGCAAAAAACTAGTCGATCTTTACTCGACTTTGGAATACCAACTAATCGGTTTTCACTATATTTGCTTGATCAAAATAAAGCAA 
GTTAGTTATTTATATATATTTATAATATATAGATCATTTCCAATGCAAATGTGAGAAAAACGATTATAATATTGGTAATTTTTAATGTAGTGATT 
GCTTTGATCCTTTTGAATTATTTTAATAAGAATCGCAAAACTTGCTTTCAAGTTAATTAATTCAAAACTTCAAATCGCTAGCAGTAAACTTACTT 
CAATAATTAAAGAAATTCGATATTGTTTTAACATACCCCTGTCCTTTAGAAACACAGGGTGCAAATAGCGAAATGAAATTCCTGGCACAGAACGT 
AATATTATTAGCGGGGAAGCCTTGTCGACCGCCTCCACTTGACTGCGAGCAACTCGGGGCCATAAATATTGCATTACAATCGCTTTACCCCCTTT 
TTTTTGGGTTGGCGAGTTGTCGTGCAGTCGGTGGTTCCACCTCCACGTGGTGGGTGGTGACCCCTCCAACCCCATCCCGTCGGGATATGCAATTG 
CGAATTGGATTTCATGTTAATGCAGGGTCCTTATCGCCGGTCATGTGCTTGGACGGTCAGCAATGGGTTCATTTGTGTTCTATTTGGCCGGTAAT 
TTCACTTAATTGCGATGACGTTGTGTGCAAAAAGCCATGGCAAGTGGGGGCGTTTTGGGGCGGGGGATGGAGACCCAGTGGATTGCTGATTTATA 
TCGCCATCGAGCGAGGTAGCACGGTCCATCAAATATTCAGCGGTCCGTGGCTTATTGAAAAGCACAAATGGCAATTTCAGCAAGTGCGAATGCCC 
AGGGAGCCGCCAGTTTGCCTTCCAGAAAAGCACTTAATCGGTGTTTTTTTGTGTGTGTACATTTAGGTTTTATTCTTGTATTTGAATGCAATCGT 
GGTTTACAATATATAGAGTTTTCATTTCAACGTTGCCTTGCACATTCAATATATAATATACGTAATAAAAAGTAATTATAAACAACAGTTGTAAT 
AAAAATAATAAACATGTAGATGCATTTTCTCGTAGACGGTTGCGGTAACGGATAATTGGGAATTTATCTTTTTATGTTTCTTAGTGTTATGTCAT 
GTGGGTCATCAGCTCGAGCTCAAATATACACTCGTGTGTATATATATATATATTTGTTTAAATTATATTTAGATATAGGTTGTATAAAAGGTGTA 
CTGCAAGTGCTGCGTATGCGTGGGCTGGTGTTCATGTGTGTGAGTGTGTGTGCGGGTTTACTTAGAAAATCATCCGAATGGTTTAAAAATAATTA 
TAAAATCTACTGCGTTTGCTGCTGAATTTCAATCTTCTCGACTGTCATGTCCGGATTCATGGCCGTCGCTTCCTGTAATCGCGTAAAAAACAATT 
GGTATCAGTTAATGTATCTACTAACGTTGAAGTTAATCCATTAGATACTCACTTGTATTGCCTCGGCCAAGGCTTTGTCATGATCGATCGGATCT 
CCATCGGACTGAATGGTTATCTTCTGTTCCACTCGGGTCTCCACAATGCCATCGCGTTCGGTTTTGTACTGTTCATTTAGAGTAGAGTATTAATT 
AATGTTATTTTATAAATGTGGTACAACCTGAACCCATTCGAACAAGGACTATGCATGGAATATACAGTGTGGTGCATACGTTCACAGGAAATATC 
TTTTGGATAGATAGATAGAAAGCCTACCTCTGGCTACTTACGCAACCTAATCGAACGGGATCAGTTGGATTAGTTCCGGTTACTTACGGTAATGG 
TTTCCACAGTGCGGGTTTTGCTGCTCACGGTTTGGGTGGAGATGATTTCGCCATGTCCGGAGAATCCGGGTGTGTCCTCACCGAGAACCACTCGG 
GTGCTTTCCACATGGGGTCCAGTGGTGGCGGAGGTCGTAAACAGCGGAGATTTCTGTGCAAGATTCGAGATTATTAAGCTTTGAATGTAACTTTT 
TTTTTTTGGAAGCGCAACTAATTTTAGAGGGCCTAATTATCGATTGCTTATTGGAAAGTAGCCATGCATGACCCACCTGGTTATCTGTGCGCACC 
ACACTGGCACCATCGTAGGGTCCGTCGATGGGCGTGCCCGAATCGCCGGAACTGGTCGAGGAAACGCTATCCCTCCTCTGATACTACACAACACA 

T AAT AT AAC AT AAC AT AT C G AAAT CAT AAT G AC AAC AATC AAC T C AT GC AAT T GG GAT GC AGC C AT GC GAT CTTACCTGATCCCCACTGGTGACC 

GTCGCCGTCGTCTTCACCTCCTGGGTAACGACGCGCTGCTGCTGCTTGTTGGGATCGTGGGTGCGGGTGGTGGCCACCGTCTTCTCCTCCAGCTG 

CTCCGTCTTGGCGTTCTTGCCCAGATCCTCGTGCGTGGTGGCGGTGCGGGTGGTGACTGCGGTGGCCGTGACATAGGCGCCACTCAGGTCGGTGG 

GTGTAGCATCAGCCTAGGTGGATTGAGTGGCAAGTGGGTGGTTCAGTGGGTTAGTGCGTACGGCATAAAAATACAACAGCAATAATACAACAATT 

T AAGC AAC AAC AT C AAAT AC AGGC GG AAGT T AGGC AAAG GC AG AG AT T AAG ATT AAT AT AAC T T T GGAAAGC GAAGC G C AAA GAG AC AGAAAG AT 

AGACGTTTCCGGTTTTCAGAGCCCTAGGTTTTGATCCAATTCGATTGCGATTTAATTAAGATTTTCTAAATATTATATTGTTTGCGATGATTGGA 

AAAACCTCAATTGAATACCAATATCTTTTATGCCCCATTAATTAAAATCGAATTCGATCAAAGCCCTTAAGAAGTGAATTCCAGTTTATCCATAA 

GGGACGCAGTCCCTCGCTAACCTTGTGTTCCTGCGTAGAGTAGGTGACCTCGCCGGTGCCCAGATTGCGCACCTCCTCTTCCACATTGTGGGTCA 

CGCCATCGATGTTCTTGGTCAGCACCTGTTTGGTGGTGGTCTTCACCACCGTGGGCGTGCTCACTGGCTTGCCCAGCTTGCCACCCGCCTGTCCC 

GCCGGACTGCCGATCAAGATCTCATTGACCTGGTCGGCGCCATACTGTGCGGTCATGTCGTCATCGGAGCTGCCCTCGGAATCCCGCCGCAGAGC 

ACCCTTGCTTATGTGCTTGACTCCGTCGAAGAACTGCTGGCGGGTGATGGGTACCGTTTCCGTGGTGGTTTGCACCTGCGCCTGTTTGCCGAAGT 

GGGAGGGATCGATCTGGGCATAGTTCAGGATCAGTTTTCCGGTCACGGGATCCGTGACAGCCGTGGCGGGATCAATATCTCCCGTGTTCGGATCA 

ATGTCGCCATACTCGCTCTTAATCCTGCCGGTCACGGGATCAATCTCTCCTGTCAGCGTCATCGTGTGGGTGACCGTGGTGCGGGCGCCAGCGAA 

ACGCGGGTCATTGGGGTCCACCTGAACAATTTCACCGGTCACAGGATCAATCATATTGTAAACATAAACGGTCTTATTGAAGAACTTAAGAACGC 

GGCCGGAGACGGGATCGATTTCGCATGTGGAGGGATCGATCTCCTCCGGTCGACCACGTTCGTCGGTCTTCGAGATAAGCAGGATGCGCACGATC 

ACGATCTTGCCCGTCTTGGGATCGACGCGGGCCAGCTTGGTGTAGACCTCGCCGGATTCGGGATCCACCTGGGTGGCCGCATACAACGGCTCCCC 

GGTGGCCTGATCCACCTCCCCGGTGGCAGTGTATATCTTTCCGGACGTGGGTTCGATTTTGATTGTGTTGGGGTCGAGCTTGGTCTGTTTCTTGA 

TCTCGTTGGTCTTGGGGTCCAGGTAGCCATAGATGGTGATGACGTATCCCGTCTTGGGGTCCACGCTGCTGGAGATGTACTGCTGCTCCTTGGTG 

GCAGGATCGGTGGCTCCAGTGGGCACCCAAATCTGGTTGGACTTGGGATCCACCACGATGTCCTTGTCCTTGGAGGCACTTACTTCAGTGATGGG 

CGCCTTAGTGGCCGGATCCACGCGCTGTTGCACGATCCTCAAAGAGATAATCCTGGCATAATCCTTGTCGGGCTTGTTGTTCTTCGGATTGACCT 

GGTTCTTCAGCACAATCTGTCCGGTGGTTGGTTCAATCTTAATGTCCCTAGTAACCGTCTTACCTGATTTCGGGTCATTGAACGTGATCGTATGC 

TTAACCAGATCAATCTGTCCATATTTGGTATCAATCTTGCCGGCCTTCGGATCGAACTGACCTCCAGAGGTCTCCACCTGCGCCGTCTTGGGATC 

GATGATGTTCCTCTTGGAATCGAACTTGCCTGCCAAGGAAGTGACCTCCACAATTGGGTCAATCTGTTGGCCCACCTCAATGAGGCGACCTTGGT 

TCGGATCGATCTTGTTCGTTTTGGGATCCACAATGCCCAGGATAGTGATCTGGCCAGTACCCGGATCCACTTTCACGTTCTTGCTGACTACAATC 

TTGCCAGACTTGGGATCAATGGCCTTAAGTTCGCCGGTGCGTGTGTTAATCTCTCCATACTTGGTATCCAGCAGGCCGGTATCCAAGTTCAGGAC 

GCCCAAGGAACGCTCAACATCTCCGTTAACTGTGTCAATCTTTCCGGTTGCAGGATCAATGCGACTGGTGATCACTGTTAGCTCCACCACAGGAT 

TATCCTGGGGTGTGATGCAAACGATCTGACCAAGGGTGTCGTCAAGGCGTCCGGTCTTTGGATCCGCCACACCGGATACCAAATGAAGATTGCCG 

GTCTTGCCATCAACATCGCCCTGGAACACTTCCTTCTTTCCGGTCTTGGTATTCAGAGCCTCCAATGTTCCCTTCTTGGGATCAATCACGCCATA 

TTTGGTGTCAATGAGTCCGGTGGCGGGGTCTAGAATGCCGGTGGAGTGCTCAATACTTCCGTTCTCCGCATCAATGCGTTTGGTGGAAGGATCGA 

ATTTGGAGGTGATGACCATGATCTTGACTCGCCTCTTTTTCGGCTTGGCCGCTGCTGCTGCAGCTCCTGCGGCTCCAGCTGCTGCCGCAGCAGCA 

GCTCCTGGAGCTCCCTTTCCTGTCTTGGTGGGAGAACCAGGCTTCTTAGAGCCAGCAGCACCTGCAAGACCAGCAGCAACATCAGCTTTGGAGGG 

ACCAACGTAACCGGGCTCTTTGTTGTAGCGCTGTGATTCCAGGAAATCAGCCGTGGGATCTGAGATGGGCTTGTAGCTTCCTGGAGCTCCAGAGG 

AGTAGCCTTGGGGTCCCTTGGTGGGCGAGTAGGATCGCTTCTGGGAATCGCTCAGAGGAACAGCAGCCGCATCCACAGCGCTGCCCTGGAGCTTA 

GCACGAGTGGTGGGCGAGAGGAGTGGATCCTGGAGTAGCTTAGCCCGCGACTTGGGCGACAATTGACCGCGGTTCAGCTTATCTTGGGTGCGTGG 

AGACAGCTGTCCTGCCTTCAGCTTCTCAGCCGTTTCCTTCAAGGCGTTCTCGTTACCAGGAGCATAGTTAAAGGCCAAACCAATCTTTTTTTGCT 

GCGAGGTAGGAGACAGCTGCTCCTGGCCATCGGCACCGGATTTCCTAGAGTTAGGATCCTGATCGTAACGGAAGCCACCAGGGGTGTATGATTTC 

CTTGTGGGACTGGTATTGCCATCATTATCCACGGCATACTCATACTGCCTAGTGACACCGGGAGTGGTGGATCCTCGGTTCTTAGCCGCATTCCT 
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AGCCGCCTCGTCCAGGTTTTGATCCTGCTGATTATCGCCCAGACCAGACGTAACGACCAGACCAAGTCGACCCACCTCCTTGTCCTTATCCTTTC 
CGGACTTATCCTTGCCATCCTTGGATGGGGAACCGCTCTTGCTCTTGCGACCCGAGGAGAATATGCCAACGCCCTGTGGATCGTCGTTGGATCGT 
TTTTTTTTCGAGGATCAAATGAAAAGGTCAAACATAAAACGAAAAGAAATGGGAGGTAAAAATTAACAAAAGGAAA 

ACCAATGAGTTCAACTTAAGATCAATTTTCGGGTTTCACTACAAACTATCATAACTTACGGGTGAAACTTAAGTGATTTGGTATGTGGATTTGGG 
TGGGTTTTGGGTCGGGGGGGTTAAGAGTATTTACACGAGGGCGGGCGCATTCGGTGCGGGGATATCGCTTAGCTCAGTACACCTAGTCTTGTGTG 
TTCCTATGATCTAGCTTAAGTTGGGGATAGTCAGATAGGATACTTACACGTCTGCCTGAGGACTTGTCAGACTTGTTGGAATCATTCAGCTCGTC 
ATTGCCTAGATAATATTTTCAGGGAGGTTAGTAAGATTGCTTCTTTAGATACTTTATCCTGGGGATACTTACCATTAACAGCCGCTCCTGCAGCG 
GCACCAGCGGCAAGTGCTGCCTTGGCAGCCTTTTCGGCCTTCTTCTTCTCCTCCAACTCCCGCTTCTCCCGCTCCTTTCGCTCCTTCTCCTCCTT 
CTCCTTTTGTTTCTTCTCACGCAACTTAGCCTCCTTATCAGCATCCTGAGAAAACATCGATCATTAGTTAGTTATAGTTTTTAGCGTATAATTCA 
GAGGAAACAGATATTTGGGGCGAAAAGGCGACATGCTACAAAATGGCTGAGATGTACAATATATATTTACAATACAAGCAACAAAATTATAATAT 
TTCGTTAATTAGGCATTACAGATGAAACTTTTTTAATACAACTGCATGTTTTGTGTGGACTGTACTGGTGTGGCCCTAAATGGTATTTACAGCTC 
GAGAAAGGACTCTTACAGACGTCACAAAATAATCAAATCAAAGTTAAATAAAACACGTAAAAAAAGTCAGATAGTTATGATGTAATGCACATTTA 
ACAGACAATTGACGGCACTGATTTCTTTTTTTTTACAAAAATTACCATCAGATCAAGGTACAGCTCGTTCGGGTTCTATCACATTGGTTAGAGGA 
TAGTCGGTTATGAAAACAAAAAATATAAATTAATAGGTTCAGAACATTCATAAATTACTGGGGTAAAGTAGCCCTAAGCTGAACAAATAAACGAA 
TAACTGAGGGAGACTTCAAATCAAATCGTTTCATACGTAAGCTCACAACATTCGAAAAGCAGAGTTCAAAGCCGATAAAAAAGTTAGGAAGGATA 
CATTGTAGAAGATTCGATTCGGAGGATACTTTTTTTTTGGTTTTTTTTTTTTTCGATTTTTTTTTGAGTTGTAGGTAGTAAAGTGTTAACCTGAA 
CGGGGGGCTCAGCCTCGATGGCCTCAATCTCTAGGTTGGTCTCGTAGTCACCCTCGAGGGAACTTTGCGAGGAGGCCGAAGCCGTGCCGGTGCTC 
GACTGTCGCATCAACTGGTTAGTGTAATGGGGGGTTAAGGTGTGTTTTTTTTTTTTTTTAGGGGACGAAGGGTTAACATTACGAGCACATCATTT 
CATATTCAGAATCACAGAGTGGATGTGTGTGTGGTTGTTTGGTTTGTGAGTGGTTGTGAGTTAGTAACAAACCGAGAAGAAAGTACAAAAAACTC 
TGGTTAACTCCGAAATCATCGTTAAACATGTTCATCTCAATAGCTATGTTTCGTCAGGCATTAGTTGGTTAAAAACTGGATGGGTTACCTTCTCC 
TTCTTGTTCTTGATGGGACTGCGGCTGTGGGCATCGCCATCGGCATTGCGATCTCCCCGATGATCCAGGGTGCTGCTCTTCCTAGCCACCTTTTC 
CTTTTCGGCCAAGGCCAGAGCTGAAAGGATCAGGTAATTAGAAACAATTCAAACCACATAGATTCAAAACTATCAGGTTATATATAAGCAAGCTC 
TACTCAAGACTAGGTGTAATTACCATCCATGCTCCTGGAAGTGAGACGAGCTCCGGACAGAGTCCTATTGAACTTCGGAGGAGTACGATCCACGG 
GTGTGTTCGTGCTCTCGGCCTGAGTGCGACCCTTGTAGCGATAGGTGGACCCGAAGACTGGGAACATCTTGGACTTGCTGACCGGTTCGGGGGTC 

AT C AGGCG GAAGAAGGT GTGGTGCTCCAC GC AGGAT TTCCACAGTTTCTTGGCGG C ACG AT GGT T GG C C AGC T T AAAGCC AAT GGT GG AT T C GT A 

CTGCTCGAATTCACCCGGTCGGATCTTGATGTAGAAATGGTGGCGCTTGTAGGAGATCTTCAGAATCTTGGGCCAGGCAAAGCGGTTGATGCGCA 

ATCTAGTTGGATAGTGGTTTAAATATATTAGCAATTGTTTAATTTAAATATAGTAAGAGCTGCTCACTTATCGCGGTAGACGAGCAAACCGGAGG 

CACAAACGCCCAGCATGATGTCCACGCCCTCAGAATCCTTAGCGGGATGCAAGTCCACGCCGTACATGGCCAGCTTCTTGGCATTCTCCAGGTAG 

TGTAGCTCAGCCTCGGCGGGCGATTGTCCCCTAGAATACAAAATATTAAAGTGAGTATAAATCCTTGATATAATAGTCCAAAACTTACTTGTGGG 

TCTTGTGAAGATCCATGACCTTATCCTCCAACTCAGCCGTCTGATTTGGAGCGATCTTGAAGTCCTTCAGGTAGGCCCTGGTGGGCATTTCCTCT 

GCATCGTAGTCGCCCATCTCCGACTGCACCAAGTAGGATCCGAGCAGAGCGTGGGTGACGAATGTGCAGGGCAGTCGACCCTCCAGGATGTCATT 

GCGCACCTGCAGGCACAAATGGTAGCGCGTGATGTCCTCCTTCAGCTGCGATGGCTCCGGCGGATAGAACTTGACGGCGAAGGTGAGAGGCCAAG 

TGTCCGTGCGGAAGAACTTGGATACCGGCTTCTCCAGATCCAGCCAAGTGCGCGGATCTGTGGGCGTCTCATAGGTCAGACCAAAGTAGTCCTTT 

TCGATGAGGTTCAGACCGGCGCAGATTGAGTTGATCACATCACGGCCAATGGCTTTGCGCTGTGGAAATCAAATTGAACAATTCATCATTGTAAA 

TGTTTTTAATCATAACCTTAAAGGGTTTGACTTGTTTAACACATCATCATCCCTGCTTACTCACATCAATGGACACGTCCAGAAGGGAGCCATCC 

AGCAGGGTGACTCGTGCTAGGGCAGGCTTTCCGTGCGAGGATGAGCTGGACTTGGGTTTGCTCTTGGTCGGCGTTTCCGGCTCAGCGGGCGCGGA 

TGGTTTAATTTCCGCCGGCATCTTGGATTGGGGGACTTGGCTACGCCAGCGACAGCTTGGAGTTGGCTGCAAGAATGTAAAGACTTTAGTAAATT 

GCACTGGATCCCAGTTAAATGGCCGGAAGTGTCCACTCCGCGGGGTCATGTGTCCCAGTGTGTGTGTGTGTGTGTGTGTGGGCGCCTTTCGCTCA 

TTAGAAGTCTGCAGTCTGGACATAAAACACGACATAAACTTTCTCAATTTGCCTGGCGCCACTGCGCAAATAAACTCAGCACACACACATTTAAG 

CACAGGACCCTCAGCCAGGATGTGTACAACTCCTCCTTCTCCACTCGCTGATGACTATGATGTCACCAGTTGCAATTGCGTACTCACTTCCAAAG 

AGCTCTAGAATTTCTCCGGCAATGATTTCCAGCGAGCTGTAAGAGAAATTTCGCCATTAAAAGCTGAACTGCAAATTGAATTACCATTTTACTAG 

CGAAAGCAGATAGTAAAAAAAACTTTCAGGTCTGAGACCATTACAATGAGTTGACCATAAACCGCACAACAGTCAGATTTCGCAGATCTATTATT 

AAGCAGAGTGAATACAGATAATGCCCCAGTAATTAAAGTCATTAAATATACAAAAGTACATGGGAATAAAAGTTGGGAACGTCCTTTGAGTAAGC 

CGAAAAGTCAAGTGATCGGCGACAGCTCACGTTCCAGATCTGGATTCAAGTTCAGCAGTGACATAAGTCTTCCATATATACATATATATAAATAT 

GTATATGTATGCTCCATATCGGTGTGGGTTATATAAGCCGGCAAGTCGAGGCAGCGTCATGTTCTCGCTTTTATCCCCTTCCGTTTCCATTTGAT 

TCCGTTTGCGGTCGGAATTTAGCGGGACTGCAAATTGCGCAAAATAATGTTGTTTTCTTCTGAAAAGAGACGATATCGGCATTTTGCCACGGATG 

TGGACACACTGCCCCACCCAGAAAAGAGCCTTCAATCAACGCTGATTGGGAATGTACTTGGGTGTAGCCATAGTTATGGGTATGGTTAACGTGCG 

AGGATCGAGAGTCAATTGAGCCCAGATGCGCTGACCGATTGCACAGTGAGTTCAATTTACAATCGGAGTGTTTAATTGGCTGGGAGTCATGAGCC 

AAGAGATGTGTCCATCGATCCCGCAGAGGAAACTAGCTGAACCAGTTTGCCACAGACTTCGTGATAGAGGTCTTCTATGGCGCTATAATACTGTT 

TCAGTTTCGGACACGACCCCCAAAGCGAATTAACTCTGGTGGCGTCATCGTGCGTTCAAAATTAAACAAAAACACACACATGAAATATGTATGCT 

GAGAACCAGAACCCGAAAACCAAAAAAAAAAAAAGCAGACAAACATCTAATTTATATTATTTTATTTATTCTCGCGTCTTTGTAGCGCGCAGACC 

ATGTGTATATGGACTCCATATAGCATTACTTTTGGCAGCGTGCTTGTTCTCATAGTTGAAGGTTGCGCATACGCAATGTTGACATCATCGACGGG 

CGGTTGCAACCACCTCGTTATTTTCACTTGGCCAAATGCAGGGCCCCATGAGTAGTGGGAATGGGTATGGGAATACTTGACTAAGATCGGCGGTG 

AATCAGCGGCATTATCTTGCCGTGCCGACTTACAAATTCGTTTGGCTGGCAAACCAAAACAAAGTAAATCGCTCGAATTGTCGATAGCGCATAGC 

GATTTTACAATGCGGGGGCATTCGTACATGATTTATTTTCGATGACATCATCAGCTAAGCGATCAGCACCAGCTAAGGCATAATATCAAGAGCTA 

TTTCTAGAGTTTATACAAATATATTTGCAGAAGATGCCGACACCCAAAAACATTTTCAGCACTTGGCGCATTCGCCTCAATTGAAGGAGGTCGAG 

GTGAAATTCATCAAGTGGCGCGCGTCTAATCTTTAGGTTACAATATTGCTGGCCAGGCGAGAATAAATACGTCCAAATTGAGCAGAGAAGAAGCA 

GCCTAACCAAGTGAGTAGAATATATACATATATATGTATATATATATCTGGTATTATTGTGAAAGTAACGAAGTTCTTGTGGCGGACAGAAGGCA 

GAAGCACAAAAGATGGCAATCGCGTCGTTTCAATTGAGATTCGGAATGAGAACATGGGCACTCGTACTTTGGGTGCAGCCGGCTCCCCTTTAATT 

AAGTTTCATTTAATACAACTTAAATCAAATTCAATTGGCGACGGGTCTGCGCCAGGTTTTTCATATCCGCGGGGATGGCTCATACATGTGTCATC 

GAGTTATTACTCTGACATCAACATGGGTCACCTGGTGGGCGGTGGCGTCACTTCCCCGGGAACGGCGTGCCCCCCACACATTGAATTTCTCCCTT 

GTATGTACATAATACATATATAGTACATTCCCTCTAGTCATTGTGACTAAACTTTCGCTTTTATGCCCATTAACATTAAAATTTCCCCCGGGGCT 

GTGAAGTGCACAAAACTTTTCAAACTTGCCATTTTCTGAGCTCAACGAAGCGTTGCCAATAAATCGCATTTAAAATGTATTATATACTGACTTCA 

CATCCGTTTATAAACATGGCCGCATTTCCGCAAGGGAAGCTGTTACATTTTCCGTTGTGGTGCATAAATTGAAATATGTACATTAATTTGACAGA 

CATCTGAGTGTCATTTTAGCTAATTAGAGTTATATTATTATATCAATTTAAGCTTTTAAGAATACTTTCATTCTCTTTATCACTTTACCCCTTTG 

AACCAATCATTCACTCGAATGAACAATGGATTTCATGCCTGGTGCGTCAGGTCTTCCAAGTTATTTTGTATATTTTTCGAGGAAATATGGAGTGG 

GAAATACGTAAACATTCGCGCATTCTGGACTGTGTGCTAAAGTGGGAAAGTGGGTCAATCAATACAGAACGATTAGAGGAGGGTGTGTTTCGCTC 

TTGCGAATCCATAGACTTATCGACGGAGTGGAACCGCCTGGCAGTCGAGGATGGCGACTCTATGGAACCCGTCAGTCGATAAGCCAGATCCGTTC 

CCATTTCCCATCGCACTGGCAGCCCTGAGACAGAGATGACCACTAGCTAGTAACTCAGAAGTGGATCAGCCACTTGAGGGGAACTGGTTTCTGGT 

CACCAGATGCCACTTTTCGACAGCTCAAGTGACGGCTCTGTGGAGCAGTGTGTCGCTTGACTAATTACCGCTATGATCGACGTTCGGATGACAAG 

GCGACATAACCGCACTGGATCACACCCACACATAGCCATCTGCACCACCATCGCTCCACCACCCGGCACCATCAGTTCTCCACTGTCAATGGCTA 
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TTTAAAGCATTTGACTTCGCCTGGCCAACGAACTCTGAAATGCATCCACAGTTGCTGCTATTTCAACACCATAAAATATAAGCTGAACCAAAATC 
ATTTTAGATGTTGTATCCGTATGCAAGTCTTTTTAAAATAATATATATAATATAGAAAAGGGTTGAAACAATTTTTTGGTTTAAGGAACATAACT 
ATTTTATTTAGTAGAACAGAATAAGTGGTTTTAGGTTAATATTTTTCACTGTGTTGGTATGCCGCACCCACATGCACATCGCGTTTTGATTTCTG 
CCCAGAAGGTGAGGGAAAAAGTGTCGCGAAGGGCGGAGAAAGTAAACAAACGAACCGCCCTCGAGGAGAATTTTGTCCACTTCAGCGCTTTGCAC 
AT T AAT CAT AG AC C T AT AG ACG C C C CAT C C GC AC AAGT AAAAAT C GAT GC G AC AAGG AAT AAAAAT AG AAAAC GG AAAAT C G AAT GGGC AT T TAG 
GAGGAGCGCAAGAGGGAAACTGATTGGCTAAAAAGGGGGCTGGAAAAGCTGCCTTCCATCCGATTAAGTTTGTTGAAAGATTAGATTGGTTTTTC 
ACCTCGCTGAAGGAGGGTGGAAATCTTCCGGTGAACTAAGAATAGCTAGCGATTCTAAAGAGGATTCATAAAGATGAAGATATTAAGTGCTACAC 
ATAGTAGGTGAGATCCCCAATCTTAACAACGATCTGGCCACGCTGGAAAATCGCTGGTTGGGGCTTAGTCTCCGTTTTCCGACGAGAGAATCCGA 
GAGAATACGGAGAGACGCGCTCTGGGAGCTTCTAACGCTGGAAGACACACCTAGTTGGTAACGCTCGAATTCGGAGTCGTGTTCGAATTCAGGTT 
TTCGAGGGGGCTCTGATTCCGACTCATTTTGTTTTTTTTTTAGCCATGTTCATTCGCGTGCTGCCAAGTCGTTAGCCCTCCGAAACTCTCTCACT 
CTCCAGATTTTTGTAGTATAGTTTTCCACTCGCTGGTTGTACAGTTATTGGCATTCAGTTGGCTGGCACTCCTCGTTTGCTTTGCTCGGCTGCTT 
TATCTGTTTGTCTGCTGTGCAGCCAACTTCTTGGCCAGCAACAACCGGTTTTTTTTATTTCTTTCTTTTCTAATGCAAGCTCAACCTCATGTTAG 
CCAGTTCGTTTCCATTCCCACTTTGCATGGCCTGGCCAAGAAGACGAAGTCTACATGTTAATGTTTCTTACTATTTGGGTCAAAGCCCACACCTC 
CCGGAAAAGGTGGAAGAAAAAAGACTTCACGTAAACAGCGGTTTTCGAAGAACGGTTTTTTGGTTGCCAAGTCAACGCCCCAGGTTTTCCGGGGA 
TGGTGAAAATCGTGCCGTGACTCAGTGGATGGGGGATGAGGGGACACCTAAAATATCGCCACCAGTCACCTGCCATTCATAACTGGCCTTTGTTG 
TGGTGGCCTTATCTTATCTTAGCCATGTTTCCTAGTGGCTGGGATAAAGACCGACGTTGTTTACCCCAGCCGATATCAAACATATGAAAAATGCC 

AAT GC AAA AAG C GCT C AAT GT T GT G AT AAG AAAT GT GG AAAC C T GC T C G C AAAC CC AAT GG AT TCTATTTT GGG AAC AGT G ACGT C T T GATGC T T 
GGGGAATTTAGCCGCCATATGAAACTATTATATATGCCATATGAATGCGACGAAATTCCAAAATTATATTTAAGCGAGAGAAGGAAATGAAAGGA 
ATCGGTTTCATAAAGTGCGCAGTTCTAGGAATTTTGATTT TTCGGTTGGTAGTTGGCCTTCTTT AAAAT AT AAATTCAAAGTATAAATTCTACTC 
AATTGATATTTACACTTAAGAAGTTTTCTATTCATAATCATCATTATGAATAACCTTTTCAATTTGCGTAGAATTCAAAAGTGATTGTATGCCTT 
TCCACATGCGTATCTTTGAGATACATATACATACATACGTTGTAGTTTTACCGATACTGAGTAGGTTTTATAAAGATATAAAGTATTTGCCCCCT 
TTTTTAAAGTTCACTGCTAAACGAAACTACCTTTGAGATACATTTGTATCCGATCGGTGTTGTGGAACGTTTCCTTGACTACCGCGGGCCCAATA 
AGAATTTCCCACCCTGCGCCCCGAAATGCGAAGCAAACGAGGAAAACACAGAAGAAGGAAAATTGGAGTAAGGCAAAGACAGAGGC AAAAT C AAA 
AGAGAGCTGGCGACAACAACAATGTGGGCCGAGTTGACAAAAACAAGAAGTCGAAAGAATAAGAATATCGTGCCAAAAGATACACTCGAAAAGCT 
TGACAGCTCACAGATACAGATACGCAGATACAAATGCCGCGTGAGTGAGTCGAGCTCTCTTTTCGCTCCCGCTCGCAAAGCTGCTCTCAATTAAC 
TCTGCTTTCGGGTGTGTGTGTGTGTGTGTGGTGTGTGCTATGAAGTCATACTGCCCAAGTCATTCTGCGCTTGTGTTGGTATCGAAAACATAAGC 
C AC C AAAAACT C C C G AC C AC AGCC C AAAAG AAAAAAACT T GAAT AAAAAT GT AAT T AAC CT AT GT C GC AAT CCC C AAAT T GT ACT C AC CC T T T T A 
AGACGCCTCTAGAATCCTTTTTAATCCACAGCAAGGACACGCAGTTACCGTTAGTTTGGCGGTTGGCTGATTTTTACTCCTGTATTTGTGTTGTT 
TTCACTTTTCTCCCGTGGGCACTGCACTCACTCGCTTTCACACTAGTAAACAAATTCTTTGCCACTTTTTTCTATTCGTTCGCGAAAAAAGACAA 
TTTGTTGTTGTCGTCGTCGTCGTCGTTGAATACCTGTTGAAAATTTGACGTTGAGTTTCTTTGAGTGAGTAATAGAGAACGCAAGAGTGGGAGTT 
GAAGGGGTGTGTGTGCTGTGAGGGGGGTTGGGCTCAGGGGTATAATAGAGAGCGGGGTGGGCGTGGGCGGAAAATGCGCGAATAGTGGTTGCTAA 
TAAATTAATTTCCCATTTTATTTATAAACACACTGCTGCCGAATTGCGCGCTTTTCTTTTCTATCCGCTCGTCGCCGTTTTGCAATTACTTTGCT 
GCTGGCCAAACAACAAATCTTCTCACAATTCTGGCAGCGCGATCACACAAAACGTACCGTCTTCAC AAAT ATT CGATGTGCTGTGCACGAAGAGC 
AAACTGGTACTGAAACCGAAAACTGAAACTGAGAGCCATCCGCTTGCCGCTGCTCGATTTGGATTTGAGTCGATTTCAGTTGAGACCACTGAGTT 
TTGCTGAGAAACCACCACCGAGTGGTTGCAGTGCTGGCAAATGCCGTTGGTTCATCTGAGTGCTTCGGTTTGCTTTTCAAATGTTTATAGCAATA 
GCGCTGTTTGATATCGCTACAAAATGTATATAAATCTATGTTTTCTTAATAAATGTATGGTCCGATTTCAATCGTTAAATTTTTTAAAAATTATA 
TGACCTCTAATGGAAATTCGAGTGATTTTTTATCTTTTATACATTTTTGTCGCATATATCAAGGCGCTATGTATTCGACATACATTTTTAATCTG 
TGCCCTTTTTTGTTGCTAAATTATCGCGCCTTATTATCAATTTGTATATTTTCGACCGAATAATAAGAGATCAAAACACATTTGTAAGAATTCCA 
AATTTATTGCGGTTTCTCCTACATAAAATTAATGTAACTATCTATAATTGCATTCGCTGTCTTGGTTTCTTTAACAGTTCGCAAAGTCGGAAATC 
TACTAATCTGAATAGGATAACATCCTATCGCTTGCGATATAAGCAAGGTTTGAGAGAGAAAGCTTTCATCTTGGTGTTCTTGGGCACATGACGCA 

TCTTCTTGAATTGGAAGAAGTAGAAATAGTTAACAGCCACATCCTTCTTGTTAAGATCGAATCCACATGCATTGAGGCAACGCACAAACTCCCGC 
ACATCCTGGAATCGCGACTGAATCTCCGCGATATAGACAGTGCCGTGCAGCTTGAGCACTCGATTGGCCTCCAGGAAGAACTCGTTCAGATCCGT 

GCCCATTAGTGAGAGGCAGTATACGGCCACGTCCAGAGTTCGAGCTTGGAGTGGAGTATCCGTGATGTTGCAGGCAATGATGTCGCTACGAGCAG 

CGACCAAGTCCATGGAGTACACCTTGTTGGGCACGGATTGTGCCAATTTGCCCTCGCCGCAA 

(SEQ ID NO: 787) 

Exon: 15687.. 15634 
Exon: 15328. .15099 
Exon: 9631.. 9588 
Exon: 9281.. 9090 
Exon: 8989.. 8544 
Exon: 8485.. 8333 
Exon: 8267.. 7909 
Exon: 7810.. 7689 
Exon: 7424.. 7311 
Exon: 6600.. 6438 
Exon: 6370.. 6318 
Exon: 2578.. 2357 
Exon: 2048.. 1893 
Exon: 1683.. 1573 
Exon: 1497.. 1001 

Start ATG: 9236 (Reverse strand: CAT) 
Transcript No. : CT22161 

CAGTACCAGTTTGCTCTTCGTGCACAGCACATCGAATATTTGTGAAGACGGTACGTATTCAACGACGACGACGACGACAACAACAAATTGTCTTT 
TTTCGCGAACGAATAGAAAAAAGTGGCAAAGAATTTGTTTACTAGTGTGAAAGCGAGTGAGTGCAGTGCCCACGGGAGAAAAGTGAAAACAACAC 
AAATACAGGAGTAAAAATCAGCCAACCGCCAAACTAACGGTAACTGCGTGTCCTTGCTGTGGATTAAAAAGGATTCTAGAGGCGTCTTAAAAGGC 
TCGCTGGAAATCATTGCCGGAGAAATTCTAGAGCTCTTTGGAACCAACTCCAAGCTGTCGCTGGCGTAGCCAAGTCCCCCAATCCAAGATGCCGG 
CGGAAATTAAACCATCCGCGCCCGCTGAGCCGGAAACGCCGACCAAGAGCAAACCCAAGTCCAGCTCATCCTCGCACGGAAAGCCTGCCCTAGCA 
CGAGTCACCCTGCTGGATGGCTCCCTTCTGGACGTGTCCATTGATCGCAAAGCCATTGGCCGTGATGTGATCAACTCAATCTGCGCCGGTCTGAA 
CCTCATCGAAAAGGACTACTTTGGTCTGACCTATGAGACGCCCACAGATCCGCGCACTTGGCTGGATCTGGAGAAGCCGGTATCCAAGTTCTTCC 
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GCACGGACACTTGGCCTCTCACCTTCGCCGTCAAGTTCTATCCGCCGGAGCCATCGCAGCTGAAGGAGGACATCACGCGCTACCATTTGTGCCTG 
CAGGTGCGCAATGACATCCTGGAGGGTCGACTGCCCTGCACATTCGTCACCCACGCTCTGCTCGGATCCTACTTGGTGCAGTCGGAGATGGGCGA 

CT ACG AT GC AG AG G AAAT G C C C AC C AGG GC C T AC C T G AAGG ACT T C AAG AT C G C T C CAAAT C AG AC G GC T GAG T T G G AGG AT AAGGT C AT G GAT C 
TTCACAAGACCCACAAGGGACAATCGCCCGCCGAGGCTGAGCTACACTACCTGGAGAATGCCAAGAAGCTGGCCATGTACGGCGTGGACTTGCAT 
CCCGCTAAGGATTCTGAGGGCGTGGACATCATGCTGGGCGTTTGTGCCTCCGGTTTGCTCGTCTACCGCGATAAATTGCGCATCAACCGCTTTGC 
CTGGCCCAAGATTCTGAAGATCTCCTACAAGCGCCACCATTTCTACATCAAGATCCGACCGGGTGAATTCGAGCAGTACGAATCCACCATTGGCT 
TTAAGCTGGCCAACCATCGTGCCGCCAAGAAACTGTGGAAATCCTGCGTGGAGCACCACACCTTCTTCCGCCTGATGACCCCCGAACCGGTCAGC 
AAGTCCAAGATGTTCCCAGTCTTCGGGTCCACCTATCGCTACAAGGGTCGCACTCAGGCCGAGAGCACGAACACACCCGTGGATCGTACTCCTCC 
GAAGTTCAATAGGACTCTGTCCGGAGCTCGTCTCACTTCCAGGAGCATGGATGCTCTGGCCTTGGCCGAAAAGGAAAAGGTGGCTAGGAAGAGCA 
GCACCCTGGATCATCGGGGAGATCGCAATGCCGATGGCGATGCCCACAGCCGCAGTCCCATCAAGAACAAGAAGGAGAAGTTGATGCGACAGTCG 
AGCACCGGCACGGCTTCGGCCTCCTCGCAAAGTTCCCTCGAGGGTGACTACGAGACCAACCTAGAGATTGAGGCCATCGAGGCTGAGCCCCCCGT 
TCAGGATGCTGATAAGGAGGCTAAGTTGCGTGAGAAGAAACAAAAGGAGAAGGAGGAGAAGGAGCGAAAGGAGCGGGAGAAGCGGGAGTTGGAGG 
AGAAGAAGAAGGCCGAAAAGGCTGCCAAGGCAGCACTTGCCGCTGGTGCCGCTGCAGGAGCGGCTGTTAATGGCAATGACGAGCTGAATGATTCC 

AAC AAG T C T GAC AAGT CC T C AGGC AG AC GT G C T G AT GCT AC AC C C ACC G AC C T G AGT GG C GC CT AT GT C AC GGC C ACCGC AGT C AC C AC C CGC AC 
CGCCACCACGCACGAGGATCTGGGCAAGAACGCCAAGACGGAGCAGCTGGAGGAGAAGACGGTGGCCACCACCCGCACCCACGATCCCAACAAGC 
AGCAGCAGCGCGTCGTTACCCAGGAGGTGAAGACGACGGCGACGGTCACCAGTGGGGATCAGAAATCTCCGCTGTTTACGACCTCCGCCACCACT 
GGACCCCATGTGGAAAGCACCCGAGTGGTTCTCGGTGAGGACACACCCGGATTCTCCGGACATGGCGAAATCATCTCCACCCAAACCGTGAGCAG 
CAAAACCCGCACTGTGGAAACCATTACCTACAAAACCGAACGCGATGGCATTGTGGAGACCCGAGTGGAACAGAAGATAACCATTCAGTCCGATG 
GAGATCCGATCGATCATGACAAAGCCTTGGCCGAGGCAATACAAGAAGCGACGGCCATGAATCCGGACATGACAGTCGAGAAGATTGAAATTCAG 
CAGCAAACGCAGTAGATTTTATAATTATTTTTAAACCATTCGGATGATTTTCTAAGTAAACCCGCACACACACTCACACACATGAACACCAGCCC 
ACGCATACGCAGCACTTGCAGTACACCTTTTATACAACCTATATCTAAATATAATTTAAACAAATATATATATATATACACACGAGTGTATATTT 
GAGCTCGAGCTGATGACCCACATGACATAACACTAAGAAACATAAAAAGATAAATTCCCAATTATCCGTTACCGCAACCGTCTACGAGAAAATGC 
ATCTACATGTTTATTATTTTTATTACAACTGTTGTTTATAATTACTTTTTATTACGTATATTATATATTGAATGTGCAAGGCAACGTTGAAATGA 
AAACTCTATATATTGTAAACCACGATTGCATTCAAATACAAGAATAAAACCTAAATGTACACACAC 
( SEQ ID NO: 788) 

Start ATG: 374 {Reverse strand: CAT) 

MPAEIKPSAPAEPETPTKSKPKSSSSSHGKPALARVTLLDGSLLDVSIDRKAIGRDVINSICAGLNLIEKDYFGLTYETPTDPRTWLDLEKPVSK 
FFRTDTWPLTFAVKFYPPEPSQLKEDITRYHLCLQVRNDILEGRLPCTFVTHALLGSYLVQSEMGDYDAEEMPTRAYLKDFKIAPNQTAELEDKV 
MDLHKTHKGQSPAEAELHYLENAKKLAMYGVDLHPAKDSEGVDIMLGVCASGLLVYRDKLRINRFAWPKILKISYKRHHFYIKIRPGEFEQYEST 
IGFKLANHRAAKKLWKSCVEHHTFFRLMTPEPVSKSKMFPVFGSTYRYKGRTQAESTNTPVDRTPPKFNRTLSGARLTSRSMDALALAEKEKVAR 
KSSTLDHRGDRNADGDAHSRSPIKNKKEKLMRQSSTGTASASSQSSLEGDYETNLEIEAIEAEPPVQDADKEAKLREKKQKEKEEKERKEREKRE 
LEEKKKAEKAAKAALAAGAAAGAAVNGNDELNDSNKSDKSSGRRADATPTDLSGAYVTATAVTTRTATTHEDLGKNAKTEQLEEKTVATTRTHDP 
NKQQQRVVTQEVKTTATVTSGDQKSPLFTTSATTGPHVESTRVVLGEDTPGFSGHGEIISTQTVSSKTRTVETITYKTERDGIVETRVEQKITIQ 

SDGDPIDHDKALAEAIQEATAMNPDMTVEKIEIQQQTQ* 
(SEQ ID NO: 789) 

Name : coracle 

Classification: actin_binding 
Gene Symbol: cora 
FlyBase ID: FBgn0010434 



Celera Sequence No. : 142000013384322 

TCTGAATACAATTAATTACTATAATTAGATTATCGTAAACTTATATTTGATAAGGATAGATGGACTCTTTTATTAACTTGTCAGTCGAAAAATTA 

AAACCATAATTTTATTAGGTTTTCTTTTTTATTTCTAGCGAATTTTGTATTAACAGTTTCATAAAGTTCAAGGGTCAAAAGTCATCATGCAATCT 

TTTTATTTCAACAATAAATATTTCTGGGACTAGAAAAACACCAGATGAGAAAATATCAGATTAGATCTAGATAATTTATTCGCCCCGTCATTACC 

AGCTTATTTACCAATCGCAAGTCTTATCAATATCGTACTATGTTCGAATTCTCCCAGTTGGACTAGAAAAAAAAACGCTTGTGAAATTCGTTTGG 

TGAAAGCATAAATATTGTTATTATATATAATTGTGGGCGTGGCCTCAGCTTATGGCGTGTTCGGCGATCGCGCCACCTCAATATGTTAATTAGGT 

ACACATGTACCTATAAAGACTTAGAGATATTTATATTAAAGTCAGACAATGGATCTATTAAATATGAGAAGAAAAATCCAAAAAAACATCACGCA 

ATTGCTATTTTAATTTGGAAGTAAATGGCATTGACTGAGTAGATAAGAATGGTAGACGTTTCAAGATAATCAGAGCGAGCCTCCTTCGCAACAAA 

AAAAAAAAAAGATTGGCGCCCAAGTTCAAATATTATCTAAGCTTTAACATGTCAGTTCGACTATCGTTATCGATAGTACCGCACGTACCAGCTGA 

TTTTGAATTCGGGTGAAGTTGAATAAATAAGCTTAATGCCACGCGCTTTTCTAATTTGTATTCCTTATCAAAACAAGTCGGCCGTGAACTTTAAA 

TAAAAGTGGCTAGCCCGCAATAGTGGAAAAGTTTTTCGCCAGCTGATAAACGCAACAGTTGTCATTCGGAAGAGAGGGACAATGCCACCCGCAAT 

GCGAAAGCGATTGACAAGTCAGCCGGCGATAACGATTACATGTTTACGCGTTCGCCGAGTGCTTATTGTTTTTTATACTCTCGTTACATTATTAA 

TTAAAACACTTGCTTTTAAGTGTGCCGAGTGTTGTGAGAGCAGCTTATCAAAGTCCGAGGTTCTAAATCTTATCTGACTGGGCATTTAGAACGTT 

GTGATTACGCCGATTGCGCAACAATTTGACGATTGTACGCCACCCAAGAGAGTCGGAAACTGTAGCGCTTAACGGCACTTTACAGTTATCAGTTA 

CGGTTAGAGCGCGTCTGACGCCCACAGCATTTAATTTTGCTTTGTCCGAAAATTGTTCTCTTTCCCGGGACTCCAGTGTCATATTGTCTCGTATT 

GTTTCACATTGGTGACAAACACTATCGTCTCTCGTTTCCAGTCGTCGGTCGGTTCATTCATAATAATAATACCCACATCCACACCAATCCGGTTG 

TAAAATGTTCGCCCTGCGCCGCACAGCCGCAATGATGACGTCGGTCCACCGGCAGCCGCATCTCAGCCAGGCCGTTTTTCGGGCCAATTTCGCCA 

TCAGCGCCGCGGTAGGTCCTCCTTGTATCCTCGCCCTCTGTCGCTCTAAATCTCTTTGCTTACTAAATTTCGGAATAGGGTGGGTCGATGAGCAG 

GAGAAAAAGATATTTTGAATTTTTTGAAGTCGGTCTCAATATTAAGATTATAAACAATCACGTAGTCACCTGTGCTGCTAGCATCCGATAAATAG 

GATTACTATATTTACTGTTTATACAAAAAAATTGAACAATTTTTTCTTAAATTTATAACCTATGAATGCAAAGTAGAGATATTAACTTTTAGGCA 

AGTACCTTAATAATTACGTCGTCACGACGAAATATTAAATATTACAAAGTAAAGATAAGGGATTACTTTTGAAACACGGTGTATTTTAATTTCCA 

TCCATCAATATCGACCCACCCTATCCCAGGCTGATCAAAGTTTACTTTGCTTTGACCAGTTGTCAGTTTATTATCGTCATACATATAAGTTTCGC 

CTGTATCGCCTGTGCTTAGGACCCCAAAACCCATAAGATATCCATGGATAATGAAAGGATGTCCGCCGTGAGCGAGGTGAATACCCCAATACCGC 

AGCATTCAGACACAAATCCATTTTTCATGTAGCTAATAAGATTCCCATCCTGCCCTTCAAGCACTCAAGTGTGGTATGCTCTCCCGAACAGATGG 

CCCAGAAGATCAAAGCCGGTCCCGTTGTGGATGTCCTCGGTGACGAGATGACCCGCATCATTTGGGACTCCATTAAGAGCCAACTGATTCTGCCC 

TTCCTGGACATTGAGCTGCATACCTACGATCTTGGTATTGAGAACCGCGATAAAACCGAGGACCAGGTCACCATTGACTGTGCTGAGGCCATCAA 

GAAGTACAACGTGGGCATCAAGTGCGCCACAATCACTCCCGACGAGAAGCGCGTAGAGGAGTTCAACCTGAAGAAGATGTGGAAGTCGCCCAACG 
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GTACCATCCGTAACATCTTGGGAGGAACCGTCTTCCGTGAGGCAATCATCTGCAAGAACGTGCCCCGTCTGGTGACTGGCTGGCAGAAGCCTATT 
GTGATCGGTCGCCATGCCCACGCCGATCAGTACAAGGCCGTCGACTACGTGGTTCCCGGACCCGGCAAGCTGACCCTCACCTGGAAGGGAACCGA 
CGGTCAGGTGATCGATGAGGTCATTAATGACTTCAAGGGCCCCGGTATTGCCCTGGGAATGTTCAACACGGACGATTCCATTGTGGACTTTGCCC 
ACGCCTCTTTCAAATATGCTCTGGACCGCAAGCTGCCGCTGTACATGAGCACCAAGAACACCATTCTGAAGAAGTACGATGGACGCTTCAAGGAC 
ATATTCGAGGACCTGTACAACAAGCAGTACAAGAAGGAGTACGAGGCGGCGGGCATCTGGTATGAACACCGTCTCATCGACGATATGGTTGCTTA 
TGCCATGAAGTCGGAGGGTGGCTTCGTGTGGGCCTGCAAGAACTACGACGGTGATGTGCAGTCGGACTCCGTTGCCCAGGGCTACGGATCTCTGG 
GTCTGATGACCTCCGTGCTGCTATGCCCCGATGGCAAGACCGTGGAGGCTGAAGCCGCTCACGGAACGGTGACGCGTCACTTCCGCTTCTACCAG 
CAGGGCAAGGAGACCTCCACCAACCCCATTGCCTCGATATTCGCCTGGACCCGTGGCCTGCTGCATCGTGCCAAGCTGGACAACAATGAGCCACT 

G AAGC AG T T C GC C GAC AC C C T T G AGC AGG T GTGC AT C G A C AC AAT T G AGAG C GG C GCC AT G AC C AAG GAT CT G GCC AT CT GC AT T AAAGGC AAC A 
TCAACGCCGTTACTCGCAGGGACTACCAGGAGACTTTTGAGTTCATTAACACGCTGGCCAAGAACCTGGAGGGTGCCCTGGCCAAGAATGCCGTT 
GCTGCCAAATGACTATCCGGCACTCAGTCCGAGCAGCAGGCCTCGCATCTCTAGAAAACTACTCAGCGATGCCTTGTCCTCGTCACCATATTCAC 
AATATTTCCATTCGTTTTTGTAGATGTGCATGTCCTAGGCTAAAAATCCGTGTCCTCGATGCATGTTTCCCACTGTTTCATTGTCTCTGCGAGAC 
GCAAAAAAAAACCCTGATTTCCCTTTAACGTCTAACGGAAAAGAAATTTCAAATAAATGTATTTTTATACCGGACTCGAGAGGTCTTTAACCCAG 
TCGAGGGTGGGCTCGTTACGAGGCCCGGTCCACTTGTGCGGCTTCTGATGGCTCTGGGTTTATTGGATTTGCCAGCGAAATGGCCACAGTTGTGT 
TCTAACACTTGGCGGCACCAGACCGAGACGGTTCAAGCGGAAGGGGCTCAGGATCCTGTTAGAAGGACGAAACTATCGATGCTAGCCGAAGCATT 
GGGCTATGGCTACCTGCTGGCCTACGTGATCTGCCTGCTCTGCTGGCCAGGAGAGGCGTATCCCGCCAGCAGTAGCTACCGGGATCTGGATATAT 

GC AAT CAT T GGG AC GG AAG AC GT C AC T T CC T GG AAC T GGG AAG T C C AGCT GG AG AGGT GC ACGC C AGGAAT GT G AC T AC T AC T GC C T AT AG AGT A 
AGTGTATTTACAAGCTTCTCCAGACGGGATTTGGTCAGCAATTATGAACTTAAAAACAGAGCTCGCCGCTGGTTTTTAAGAACGATGCAGTGGCC 
GGGGATGTGTGGTACCAGTGCAGCCTGGAGCTGGTCACCTGTGCCGAGTGCGTCATCCGAGTGGCCTTCACCTATGCCAACTTCTCCAAAAGTTG 
CGGCAATACGGGTGGCAAGTCCAGCATGTGTCCCTGCGAGCATATCCAGTTCTCGGAGCCGCCCTATGATTCCACCATTTCCGGCCAAGAGTTCT 
GTGGAGATGGCAAGGTGTTTCGGAGCAAAACCAGGACACTGCAGTTGAAATTCTTCTACAGA 
(SEQ ID NO: 790) 

Exon: 1001.. 1531 
Exon: 2182.. 3432 
Start ATG: 1430 

TTCGCCGAGTGCTTATTGTTTTTT^ 

AAGTCCGAGGTTCTAAATCTTATCTGACTGGGCATTTAGAACGTTGTGATTACGCCGATTGCGCAACAATTTGACGATTGTACGCCACCCAAGAG 
AGTCGGAAACTGTAGCGCTTAACGGCACTTTACAGTTATCAGTTACGGTTAGAGCGCGTCTGACGCCCACAGCATTTAATTTTGCTTTGTCCGAA 
AATTGTTCTCTTTCCCGGGACTCCAGTGTCATATTGTCTCGTATTGTTTCACATTGGTGACAAACACTATCGTCTCTCGTTTCCAGTCGTCGGTC 
GGTTCATTCATAATAATAATACCCACATCCACACCAATCCGGTTGTAAAATGTTCGCCCTGCGCCGCACAGCCGCAATGATGACGTCGGTCCACC 
GGCAGCCGCATCTCAGCCAGGCCGTTTTTCGGGCCAATTTCGCCATCAGCGCCGCGATGGCCCAGAAGATCAAAGCCGGTCCCGTTGTGGATGTC 
CTCGGTGACGAGATGACCCGCATCATTTGGGACTCCATTAAGAGCCAACTGATTCTGCCCTTCCTGGACATTGAGCTGCATACCTACGATCTTGG 
TATTGAGAACCGCGATAAAACCGAGGACCAGGTCACCATTGACTGTGCTGAGGCCATCAAGAAGTACAACGTGGGCATCAAGTGCGCCACAATCA 
CTCCCGACGAGAAGCGCGTAGAGGAGTTCAACCTGAAGAAGATGTGGAAGTCGCCCAACGGTACCATCCGTAACATCTTGGGAGGAACCGTCTTC 
CGTGAGGCAATCATCTGCAAGAACGTGCCCCGTCTGGTGACTGGCTGGCAGAAGCCTATTGTGATCGGTCGCCATGCCCACGCCGATCAGTACAA 
GGCCGTCGACTACGTGGTTCCCGGACCCGGCAAGCTGACCCTCACCTGGAAGGGAACCGACGGTCAGGTGATCGATGAGGTCATTAATGACTTCA 
AGGGCCCCGGTATTGCCCTGGGAATGTTCAACACGGACGATTCCATTGTGGACTTTGCCCACGCCTCTTTCAAATATGCTCTGGACCGCAAGCTG 
CCGCTGTACATGAGCACCAAGAACACCATTCTGAAGAAGTACGATGGACGCTTCAAGGACATATTCGAGGACCTGTACAACAAGCAGTACAAGAA 
GGAGTACGAGGCGGCGGGCATCTGGTATGAACACCGTCTCATCGACGATATGGTTGCTTATGCCATGAAGTCGGAGGGTGGCTTCGTGTGGGCCT 
GCAAGAACTACGACGGTGATGTGCAGTCGGACTCCGTTGCCCAGGGCTACGGATCTCTGGGTCTGATGACCTCCGTGCTGCTATGCCCCGATGGC 
AAGACCGTGGAGGCTGAAGCCGCTCACGGAACGGTGACGCGTCACTTCCGCTTCTACCAGCAGGGCAAGGAGACCTCCACCAACCCCATTGCCTC 
GATATTCGCCTGGACCCGTGGCCTGCTGCATCGTGCCAAGCTGGACAACAATGAGCCACTGAAGCAGTTCGCCGACACCCTTGAGCAGGTGTGCA 
TCGACACAATTGAGAGCGGCGCCATGACCAAGGATCTGGCCATCTGCATTAAAGGCAACATCAACGCCGTTACTCGCAGGGACTACCAGGAGACT 
TTTGAGTTCATTAACACGCTGGCCAAGAACCTGGAGGGTGCCCTGGCCAAGAATGCCGTTGCTGCCAAATGA 
(SEQ ID NO: 791) 

Start ATG: 430 

MFALRRTAAMMTSVHRQPHLSQAVFRANFAISAAMAQKIKAGPVVDVLGDEMTRIIWDSIKSQLILPFLDIELHTYDLGIENRDKTEDQVTIDCA 
EAIKKYNVGIKCATITPDEKRVEEFNLKKMWKSPNGTIRNILGGTVFREAIICKNVPRLVTGWQKPIVIGRHAHADQYKAVDYVVPGPGKLTLTW 
KGTDGQVIDEVINDFKGPGIALGMFNTDDSIVDFAHASFKYALDRKLPLYMSTKNTILKKYDGRFKDIFEDLYNKQYKKEYEAAGIWYEHRLIDD 
MVAYAMKSEGGFVWACKNYDGDVQSDSVAQGYGSLGLMTSVLLCPDGKTVEAEAAHGTVTRHFRFYQQGKETSTNPIASIFAWTRGLLHRAKLDN 
NEPLKQFADTLEQVCIDTIESGAMTKDLAICIKGNINAVTRRDYQETFEFINTLAKNLEGALAKNAVAAK* 

(SEQ ID NO: 792) 

Name: NADP dependent isocitrate dehydrogenase 
Classification: enzyme 

Celera Sequence No. : 142000013384534 

TTTAATTATACCATTCATATGCTTGCTTTCGTTTTGCTGCCTTGCTAAGATAAAAAAAAAAAACAAAAACAATTGAACTGCGATTATGTGGTATG 
GTCTGATTTCTACATGAGCAAGTGAATTGAATCGGTATTATTGGGATCTTGGGAGTTTGCAGTGAGGTGAGTGCTTGCAACAACTAAACTAAACA 
CAAGGCTCACTAAACTTGTTGCATACATGTTTAACATTACTTTAAGCAAACAGTCGAATGTCCGCAAGGCGAACTATATTCGTTACATAGATCTA 
GTCCGACTTACGAATATTCGACCGCATTTGTTTGTTCATTACGATTACAGTTTTTGTTGCTTGGTTTTGTTTTCATTACATTTCAGTATTATCTC 
AAGTTTTGCCGGAAAACACTTCCATTCTGATCACTGGCCACTGCCCAGCTGTGGCCTTCTCCTGAGTTCCACTGCCATTTTTTACGAGTTAAAAT 
TTCCAAACAAAAACTAAAGTTAAGCATAGAATACAAGAACGAAACGAACTAATCTAAAATCTATAGCGTGAAATCCAGGACTAAGGACAGTGGCA 
GTGGCAACAGAACACTTCTGCGCCCGTTTGATCCTTTCACAATTTTACATTTGCTGCTTTTAGTCTATTATATTTAAGGGCTTTGCTTTTTCGCT 
TATTTTTCTTTTTTTTTTTGCTTCGCTTTTTCTATTTCTCATACATGCGTTAAATGTTAATTACTTTATGCTTATGTATCAAGGATTTCTACTAC 
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CGCGTTCTTATTTGCTGCGTGTTGTGGTGTTGCGTGTGGTCATCGCCTTCAGAAATTGTTGTTGCCATTACTGAGCAGATTGGAGGCGGCGGCCC 
CAAGAGCAGCAGCCGCAATGGAAGCGGCCGAGGAGCTGCCCGCCGCCGATCCGTTGGACGCCGCAGCCGCCACCACAGCTGCCACAGCGGCTGCC 
ACCGTGTTGAGATGCTGCTGATGCTGCTGCTGCTGTTGCTGGTGATGTTGTTATGGCATAAAGTAGTCCGGATGATCTGTGTCTTTCTCGAACCG 
CTTGGCCTCCTCGTGCATGCTCTCCTTGATCTTCAGAATGGCCGCCGATTCCGTTTGATCGTTCTCGCCGGGTCCGCCGAATCCGCCCAGATTGT 
AGTCACCCATGCCGCTGCCAGAGTCTTCGCGCGGCGTCCCAGGTCCTCCGTTGGCCGGCGAGTTTTTCATGCCATCCAAACCGGATCCGTCCGGA 
CCGCCGCCGCCATTAAGTACGGGGCCCATTGAATTTGGCCCGCCGCCCATCGGTCCCATTCCGCCGGGACCACCTCCGCCACCGCCGCCATCCGG 
CCCTCCGCCCATCGGGAACTCGGGTTTCATCAGGGCGTACATGCTGTCGCCCGGTCCGCCCACTGGTCCACCTTGCGTGTTATCCTGCGGCGACG 
GCATTATTGGCGTTCCCGGTCCCGGCGGTCCATTCGAGCCGGGTCCATAGTTTCCTGGCGATGATGACGAGTACGCATTTAGTGGCGCCGAGGCG 
TTTGGCTGCCATTGCGGCGGTCTTCCGCCCGCTCCGCCCATGCCCATGGGCGGCATTCCGCCGGGTCCCGGTGCTGGGCCGCGCATTCCTCCTGG 
TCCACCATAGCCCATCGGTCCCATGCCACCAGGACCGCGTGGCGGATTCATGCGCGGATTCATCGGTCCCATCCCTCCGCCTGGCCGTGTCGGAT 
CCATACTGTTGGGCATCATGGGCTGGCCGGGGGGTCCATTGAATTCATTGCCCATGCCCTGCATACGCACGCCGGGCCGCGGTCCGCCCGGGTAT 
CGGGGTCCGCCCATGAACGGCGGCTGGCCTGGCATCATTTGCGACGGCGGCGGTTGCGGACTGGAGGCGTGCGTCGGCGGCGACGGTCGCATCGT 
CGAGTTAGGGAAGAAATTCGGTCCCATTGGTCCGCCGGGACCCATTGGACCGCCATCGCCGCCGGGCGGCATTTGCCCGAGGGGACTGGGCGCTG 
GTCCGCCAGCATTGTGGGGTCCACCCGGTCCAATTCCATTCACTCCGTAGCCGGAGCTAACAAAGCCGTAGTCGTGGAAGGCCTTGGCCTCCGAG 
CTGTGATCGCACTGATCTCGGCGCTCTGGCGCCGCACAGTACAGGTCCCAGAAGACGCACCACCAGGTGTGCAGGAATCCCGGCGGCTCGCCCAG 
CGTTATGTTCTTCTCCCATCGGATCTCCGAGAGGAATGTCTGTGCCGCCTTCTGGGCGCCAACGTGCAGCAGATATTCGTACACGTACAGGGCCA 
ACTTTTCGCGGGCCTGGGCGTCCGATGGAACCGCCGATGTCTTTGATTTGCCGTACATTTTGTTTGTGTGATGTTGGTTGTTGGCTGTAGTTGCG 
TTTTGTTTCTGCTGTTGTTGTTGTTGATGATGATGTTGCTGTTGTTGCTATTGGTGCGCTGGGTAAGGTGGTTTTAAGATTTCGGAGTTTGGGTT 
TTCTCGCCTCTCTTTCGTCGCTAGTTCGCTTTTTGTCTACCGACTTCGAGCGTATTTAAATTTTTATACTTTTTTTATGTACTTTTCGAGGCTCT 
TGCTGTCTTGCTGCCTCCTCTTCGGTTGTTTCAGCTTCAGCTTCTTCGTCGTCGACGTCTTCTTCTTCTTCTTCCCTTTAGCGGCTGCTACTGCT 
GTGCTGTTGTTGTTTTTTAATTGGCCGCGCTTTGGCTCACGCAGCCGTTCTCCGCCGCTCTATTACTTCCACCATTTTTAATAAAATATATATTT 
TTGCGAATGGAGAATGAAACGCACCGTGGTCTTTACTTTTGCCTTTACTTTTCAGATTTTACTTAATAAACTTTTTAGCGCGCTCTCGCCGTACG 
TGTGTGCGTGTGTGTGTTGCCACTTCTTCGTTTTACCGTTTATTTTTTTCAGGTATTTTTTTGCCTGATTGTTGCTGCTTCTTCTTGCACAACCA 

CT AAAAAC AG AAAAAG AAG AGC AG AGGGAG AG AGAG AAAAAG AG AGC AT G AGT GAG AG C G AGC G AC AGT T C GT GT GT AT GT GT GT T GC AAAT T T T 
CGTGAGCCTGTGTGAAGTGCGGTGTGTCGTGTATTTGTGTCAGTGTGTGCGACCGTACTTGCGTGAGTGCGAGAGAGGGAGCGCGCGCATTGTCG 
ACAAAACGGCACAAGTGCCACACTAACCGTCAAAACTTTTGTACCACCTTGAACATTTTGCTAAACAGTTTAAATAAAATCAGGAGTAAAGAAGT 
GGCGAAAAATTTATTTCTAATTTAACACTTTAAACTCAATTTAATGACGTAATTTTTGTATTTTTTCTTAAAATTCTTAAAATTTCTTTTCGTGA 
CACATAAATTAAATTGGAAATTAAAAGTGTCTCGCTTTTACTTTGATTAAGTAATTGGATTATTTTGATTTATTTTAATTAAAAGTATAACTTGC 
TAGTGCATTTCTTAAGGGGAAGACGACCTTGTCACAAACTATTTTGATAACCATTTAGAATTAGAACATTAATCTTTTACAACTTTTTACTGGGT 
TTTTGTAAAACAAAAATTAGACTAACAGAGGTTTAAGTACTTAACGTGTTCGCTTTCACTAAGAGTTTGTTAGAGCTGTATTAAAGGTAAAAACT 
TTATTATTTGGTGGAATACTTTTAGAATCCACTTTGTTGGATTCCCTTCTCCCTCGTTCCACTCAGGTGAGTGAGGAGCAGACAGTGATGCTCTT 
GGCCTCTCGGCTTAGTACACATTGTACACAGGTAGGTTGCTACCTAAAAGCCTAAAGGCCAGCGGGCAGCATAGTCCGCCTGTCAGTCAAACAGA 
CATTCAAATCAGAACTCGTCTCCGGGGTCCCAAAATATTGGTGGCACTGTGCGGGGGCGAGCGGGTGCAATTGGGGTTGGGTGGGTGGTTGGTTT 
GGCAGAGTGACAGCTCGGAGAGACAGGGCGCGTAGCGATAAGGTCAATTGGCTCTGCGGACAGGTGCATCAGTTACTAGGCACTATCGGCTATAT 
ATCCAATGGCTGGGAAAACTGTACGACCGATTTGAATGCGGTCGAAGGCGAACTCAACTCAAACTAGCGAATGTACACAGAGAGCGAAAACTATG 
GGCTGTCAGCTGTTATCATTTAAAACAGAAGGTGTTATCGGAATCTATTATGAAACCAGGTGGCGAAACTATTCGAGTGCCCAAAACCTGTGATC 
AACGACGAAGCCCAAGGGCTGAGTACCTAAACTTGCAGTGTGGCTGCTACAAAAAATACCGGTCCATCTCTCACTCTTTCTTGCTGCCGCTGCCC 
GCGAGCTCTGCTGGCGTCGCTGTCAGCATGTACAACAATAACAAAAAATATGAGTAAAATCTACATTAGTGCCGATATTGCTGCATTGTTGTTGT 
CTATTTGAAGACGTCATCAAATTGGGTACGCGGGTTTTCTCTCGCACGTTCTACTTTTTGCCACACACTCTCGCACACACCACACGCACGCACAC 
TCGAACGCACAGCTTCACAGATGGCCGTTTGTTGATATTGTTGCTGCTGTTGCATACGCACCTTTGTTGTTTATTGTTGGCTTTGGCTATCTGTC 
GTTTTGTGAACGCCCTAATACTTTGAATACTTTTATTTTGGCGAGTGTCGTGTGCTGCTGATTGCAATTTTATCACAAAATCAACAAGAAACACA 

AAC AC A C AAT T C G AG AG GC T G CT G CT GC T GC GC T C G C G AAAG AG AGT AGAAGAG AGGC AG AG AGAG C GG GAG C G AGT GC G AG AGC GGC AGGGC G A 
GAGGGAGGAGCGATAGCGAGAATACGGAATTTTATTACAATCATTGATTTTTCGTATTCGTATTTGTATTAGCGGCTCCCCGGCTCTTCACATGT 
TTTGCGCGCATACATATAATTATTTATTATTAAACCGCGCGCTATTTTACTTCGTTTTTTTATTATTAGCTTTTTGCACCAATTCTCTTGTATTT 
ATGTGACGGTGTGTGTGTGTATGTTTATTCGTTTTGCATGGATGTGTGTATGTATATGTTCGAGTGTTTTTAATTTCTTTTCTAAACACACTTTT 
TTGCCTGCCGCCGCTGTGTTGTTGCTGCTGCTGTAATTTTTGCAAGTTTTCTCTTCACACGCCTTTCATTCGCCGTGCGCTTTGTCCGTGTATTC 
GACGATTGCTTTTACTGCCTGTTCGGCTGCTGCTGCTTCTTCTTCCACTAAAAATCGGCTCTTTTTTGCCTGTACGAAATTTTCACAGAGAGCTA 
TTGCGTTGAAATGCATTTTAAATTGATGTCATTTCGAAAGCAAACACAACGTTCACATTTTCACTTTGATTGCAAATAACTATTAGTTTGGTTTA 
ATCGTTTAAACACGGCAAAATCGTATAAATATTTTTACTGCTACAAGTTGCGTCGTCGTCGTCTTCGTCTCGTTCGAGTGTGACCGTTTGCCGAG 
CATTTAAATAATCCGCTGCGAGCTGAGAAACGCACTAAAATTTACTAAAAGCACGCCATTTTCAGTGCTTTTAATAAATGAATATTTATTATTTA 
GGTGTGGTAAAGGTTAAGCAGTACGCTTTATATGAAATAAATGGGAGGTTGTGATTAAATGAGCTAAATTCTATTAAGGTTTAATTATATTATCG 
ATTAATTATGACAGGGCTGCCAGCGTGCTGGAAAAAGCTCATTTGGCGTTGCCACGGTCGCCTAAGGGTGTCTGGCAGCGTAGTTGGCAATTCAC 
CAAGGAGGACCGTACAGAAGTTCGCGGATAAAATATATCTTAAATGCTAAAATAGTACGCCTTAATGATAAAGAGTTCACGATTACCTAAGTTGC 
CAATGGGTAAGTTGTTTGCTTGCATTATAGACTGCGGTCTGCACCTAAACTGCTGGGCGGGGACTTACAAAAGCCGGCGAATTTCCATGTATATG 
CATTTACATTGTTGTTGGCCCGCAACTGCAGTGCTGTTAGGTTAACAGCGGCAGTGTGACCAGACGCCCGTTTGTTTATGTTTGGGGCTCGAAAA 
ATAACAGTGTCTGTTCAAAACGCATTCAGTGCGCAAGAACCTCCGCTTGCCGCCAAATGGAAACCCCCCAAGCGAAGCAATATTGAAGAGATCCG 
AGGCATAAACAGTGCTGAGGTTTAGTGCAACTGCAACTGCTCGCGATTGCGCTACGTGCGCTGCGGTTTCCTGCTGTAATTCAAAGTTCTGCAAT 
TCGCGCGATTTCCACGCCATGCAGTTGAAGCAAATGGACTGTTGATTGACTGGCCCTCAAGAGAGGAAAGCGGATTAAACGGCTTTAACAGTTAA 
GTTCGAAAATGGTTGAAAAACCAATGAAACGTATTTTTTTCTTTGCACTTACAAAACGAGCACGTGATGACCGAATTTTAATGGTACCCCCCCCC 

CCCCCCCACAAGGCACTTTAAACCATTCAAGTT 
(SEQ ID NO: 793) 

Exon: 5398. .4527 
Exon: 2945.. 1001 

Start ATG: 2338 (Reverse strand: CAT) 
Transcript No. : CT22195 

CTCGAACGAGACGAAGACGACGACGACGCAACTTGTAGCAGTAAAAATATTTATACGATTTTGCCGTGTTTAAACGATTAAACCAAACTAATAGT 
TATTTGCAATCAAAGTGAAAATGTGAACGTTGTGTTTGCTTTCGAAATGACATCAATTTAAAATGCATTTCAACGCAATAGCTCTCTGTGAAAAT 
TTCGTACAGGCAAAAAAGAGCCGATTTTTAGTGGAAGAAGAAGCAGCAGCAGCCGAACAGGCAGTAAAAGCAATCGTCGAATACACGGACAAAGC 
GCACGGCGAATGAAAGGCGTGTGAAGAGAAAACTTGCAAAAATTACAGCAGCAGCAACAACACAGCGGCGGCAGGCAAAAAAGTGTGTTTAGAAA 
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AGAAATTAAAAACACTCGAACATATACATACACACATCCATGCAAAACGAATAAACATACACACACACACCGTCACATAAATACAAGAGAATTGG 
TGCAAAAAGCTAATAATAAAAAAACGAAGTAAAA 

GAGCCGCTAATACAAATACGAATACGAAAAATCAATGATTGTAATAAAATTCCGTATTCTCGCTATCGCTCCTCCCTCTCGCCCTGCCGCTCTCG 
CACTCGCTCCCGCTCTCTCTGCCTCTCTTCTACTCTCTTTCGCGAGCGCAGCAGCAGCAGCCTCTCGAATTGTGTGTTTGTGTTTCTTGTTGATT 
TTGTGATAAAATTGCAATCAGCAGCACACGACACTCGCCAAAATAAAAGTATTCAAAGTATTAGGGCGTTCACAAAACGACAGATAGCCAAAGCC 

AAC AAT AAAC AAC AAAGT G GT T GT GC AAG AAG AAGC AGC AAC AAT C AG GC AAAAAAAT ACCT G AAAAAAAT AAAC G GT AAAAC G AAG AA GT G GC A 
ACACACACACGCACACACGTACGGCGAGAGCGCGCTAAAAAGTTTATTAAGTAAAATCTGAAAAGTAAAGGCAAAAGTAAAGACCACGGTGCGTT 
TCATTCTCCATTCGCAAAAATATATATTTTATTAAAAATGGTGGAAGTAATAGAGCGGCGGAGAACGGCTGCGTGAGCCAAAGCGCGGCCAATTA 
AAAAACAACAACAGCACAGCAGTAGCAGCCGCTAAAGGGAAGAAGAAGAAGAAGACGTCGACGACGAAGAAGCTGAAGCTGAAACAACCGAAGAG 
GAGGCAGCAAGACAGCAAGAGCCTCGAAAAGTACATAAAAAAAGTATAAAAATTTAAATACGCTCGAAGTCGGTAGACAAAAAGCGAACTAGCGA 
CGAAAGAGAGGCGAGAAAACCCAAACTCCGAAATCTTAAAACCACCTTACCCAGCGCACCAATAGCAACAACAGCAACAT CAT CATC AAC AAC AA 

C AAC AGC AG AAAC AAAAC G C AACT AC AGC C AAC AAC C AAC AT C AC AC AAAC AAAAT GT AC GGC AAAT C AAAG AC AT CGGCGGTTC C AT CGG ACG C 
CCAGGCCCGCGAAAAGTTGGCCCTGTACGTGTACGAATATCTGCTGCACGTTGGCGCCCAGAAGGCGGCACAGACATTCCTCTCGGAGATCCGAT 
GGGAGAAGAACATAACGCTGGGCGAGCCGCCGGGATTCCTGCACACCTGGTGGTGCGTCTTCTGGGACCTGTACTGTGCGGCGCCAGAGCGCCGA 
GATCAGTGCGATCACAGCTCGGAGGCCAAGGCCTTCCACGACTACGGCTTTGTTAGCTCCGGCTACGGAGTGAATGGAATTGGACCGGGTGGACC 
CCACAATGCTGGCGGACCAGCGCCCAGTCCCCTCGGGCAAATGCCGCCCGGCGGCGATGGCGGTCCAATGGGTCCCGGCGGACCAATGGGACCGA 
ATTTCTTCCCTAACTCGACGATGCGACCGTCGCCGCCGACGCACGCCTCCAGTCCGCAACCGCCGCCGTCGCAAATGATGCCAGGCCAGCCGCCG 
TTCATGGGCGGACCCCGATACCCGGGCGGACCGCGGCCCGGCGTGCGTATGCAGGGCATGGGCAATGAATTCAATGGACCCCCCGGCCAGCCCAT 
GATGCCCAACAGTATGGATCCGACACGGCCAGGCGGAGGGATGGGACCGATGAATCCGCGCATGAATCCGCCACGCGGTCCTGGTGGCATGGGAC 
CGATGGGCTATGGTGGACCAGGAGGAATGCGCGGCCCAGCACCGGGACCCGGCGGAATGCCGCCCATGGGCATGGGCGGAGCGGGCGGAAGACCG 
CCGCAATGGCAGCCAAACGCCTCGGCGCCACTAAATGCGTACTCGTCATCATCGCCAGGAAACTATGGACCCGGCTCGAATGGACCGCCGGGACC 
GGGAACGCCAATAATGCCGTCGCCGCAGGATAACACGCAAGGTGGACCAGTGGGCGGACCGGGCGACAGCATGTACGCCCTGATGAAACCCGAGT 
TCCCGATGGGCGGAGGGCCGGATGGCGGCGGTGGCGGAGGTGGTCCCGGCGGAATGGGACCGATGGGCGGCGGGCCAAATTCAATGGGCCCCGTA 
CTTAATGGCGGCGGCGGTCCGGACGGATCCGGTTTGGATGGCATGAAAAACTCGCCGGCCAACGGAGGACCTGGGACGCCGCGCGAAGACTCTGG 
CAGCGGCATGGGTGACTACAATCTGGGCGGATTCGGCGGACCCGGCGAGAACGATCAAACGGAATCGGCGGCCATTCTGAAGATCAAGGAGAGCA 
T GC ACG AGGAGGC C AAGC GGT T C GAG AAAGAC AC AGAT CAT C C GG ACT AC T T T AT GC C AT AA 
(SEQ ID NO: 794) 

Start ATG: 1480 (Reverse strand: CAT) 
MYGKSKTSAVPSDAQAREKLALYVYEYLLHVGAQKAAQ^ 

SSGYGVNGIGPGGPHNAGGPAPSPLGQMPPGGDGGPMGPGGPMGPNFFPNSTMRPSPPTHASSPQPPPSQMMPGQPPFMGGPRYPGGPRPGVRMQ 
GMGNEFNGPPGQPiyMPNSMDPTRPGGGMGPMNPRMNPPRGPGGMGPMGYGGPGGMRGPAPGPGGMPPMGMGGAGGRPPQWQPNASAPLNAYSSSS 
PGNYGPGSNGPPGPGT PIMPS PQDNTQGGPVGGPGDSMYALMKPEFPMGGGPDGGGGGGGPGGMGPMGGGPNSMGPVLNGGGGPDGSGLDGMKNS 
PANGGPGTPREDSGSGMGDYNLGGFGGPGENDQTESAAILKIKESMHEEAKRFEKDTDHPDYFMP* 
(SEQ ID NO: 795) 

Name : SSDP-like 
Classification: DNA_binding 



Celera Sequence No. : 142000013384839 

ATAATAAAACATTCGCGTGTACTTATGTGAAGCAGACGATGTTAAATTATGCCTCCAGTACAAGGACACTTCGTGCCATTTAGACTTACAATCCG 

ATCTAGATCGATTTCAAATATGGTGCCGTGATAACGTGTTAGACTTAAATGGCTCCAAGTTGTTGTGCCAACCCAATACGCACGACTTACACTCT 

AAGTGGGTGCTCCTTGGACAGAATAACACGAGTTGATGATCTTGGATGATCTCTGGACCCAAAACTAAAATTTTCTGACCATATTTCGTCTATTG 

TCAATAAGGCCAGGGTTGTTCTTGGTTTTATAAAAAGGTGGTCTAAGGAATTTGATGATCCTTACTTGACTAAAACCTTATTTATTTCGTTAGTC 

CGTCCGATTCTCGAGTACGGATCACCTGTTTGGAGTCCACAATACGCAGTCCACTCGGACCGCATTGAATTGGTCCAAAAAAAAATTCTTACTTT 

TTGCCTTGCGGCGCCTAAATTGGGATGCAAACCGTATATTACCTCCTTATATTAGTAGACTTCCTTTAATTAATTTACCGTCCCTAGCTAACCGT 

AGAACTATGCTTGGAACAGTCTTTATTTGTAAGCTTATTCGTGGGGATGTTGAGAGTCCCGACTTTATTAGTCGGCTTAACTTTTCGGTTCCAAG 

TAGATTCACTAGAAACTATATACCCCTTATCTTAAATCATTGTAGATCTAACTATGAGTTGCATGACCCTTACAGAGTTTTATGTTCTGACTATA 

ATAGACTTAACTCTCTGCCGCTCTTAAAGCAATCGATTTTAACTTTTTTATTACATAATTAGATCCTACTAACAGTAATATTTAATATAGTTATT 

TTACATTATATTCATTTCCTCGTTCCTGTTCTCTTTTTTATATCGCGTCTATATCTTCTCGCGAATCGAGCCGTACGATACACGGCAGCGCCCCT 

CGGTCGGTTGGGCGGGAGGTGTGGACGTAGGACCCGTGCGAACAAAAAAAAAAAAAGAATCGAATTCTTTTAGCCTTCTCAAACGGTTCTTTCTT 

CTACATGCGATTTGTGCCCAGTTTTCCTGTGTGGGTTTGCAAGTGTGTTAGCAGTGCGGATTTCCGCGAAAAAGTCCTTGGGCAGCTGGAACACT 

TAAACGGTTTTCTTTGTGCGTTACTATGTGCCGGCTTAGATCCTGAAGATTTGCAAGGACCTTCGAGCAATGGCTTATTTTTGGTGTGTGATTTC 

AAGTGAGTGGACAGAGCCGAACGGTCAGTGAAAGTTTTTGGGCACTTACCACACTTAAGTGCCTTTGAGTGGCCTAGTGTTTTGTGAGCAGATGC 

TTTTGAGCTTTGAGAGCATCTCGCGTTCCTTGACAGGTGTCGCTTGAGATACGCATGTAGCGCAAAGTCCTTTGAGCAATGGCCACAAGTAAAGA 

GGCGTTCTTGGTGCACTCGCATGTGCATCCAAAGTTCATGTTCGACTTGAAAGGACTTCGAGCACTGGGAACACTTGAACTGCGTTTCATCAGTG 

TGCGTGGCCAAATGCTGCTTGAGAATTAAAGAAGTTAGAAAGCACTTTTGGCAGTATGGGCACCTTAACGATCCTCTTCTTTCGTGCATCTGTAT 

GTGGGCCTTGAGGTGGTGCCCAGCTGTAAAGGACCTTGGGCAATGCGAGCACTGGAAGGGTCGTTCTCCTGTGTGGGTTCGCATGTGATTCTGAA 

GACCAGAGCTATAGCCAAAGGACTTGGGGCAGAGGGAACACGTGAAGGGACGTTCTCCTGTGTGTGAACGCATGTGTACCTTCAGATTTGATTTT 

ACTTGAAAAGATTTAGGGCAGTGAGAACAGCTAAAGCCTGGTTTCTCCTTGGCCTTGATATCACATTCGTTAATATCAGGTTCATTGCCGGAATC 

GACGTCGTCGGCACCATCCTGGAGATCCCTCTCAGCTGCCTCCACTTCTTCTGAGCATCCGGATCCATCATTTTCACTCTCCTCCTCCCGTACAT 

CCTTGAGGAAGCGGTAAAACTGGTGACTTCTCTCGTACGTTTCTATGATATCGAATGCATTCTGCGCATCCTCCAGACAGGACTTGCATATGGTG 

TTGGACAGTGGATCGTGCTTCTCAACTTGCCAGCCGGTGCACTTGTATATTATGTACGCAATGGATAGGCCCGGTTCATGAGACTCATCGAAAAC 

ACTGGCCATATTTTCGTAACTGACCAGGCAAACTCGGCATATTGACGATTCCATCGATCTGAAATGACACGAACTGTGTGAATATAAAATTGCGA 

ATATACAAAATCGTGCACTTAAACGGTTTCAAAAAAGTATAATGGTTATTCCCAGATCTTTGAGTATTAGCGTTTCTGGAATCTACATCCTGAAT 

GCTAAATACACTGAGATCTCAACTTTCAAACAAACGGACAGAATAGGTACGACTAGATCGTAGTCATGGATGACACATACACTTTATAAGGTCAG 

TGACAAACGTCATCTTTTAGTTGCATAGCATCAAAAAGAACCCTCTGCACTCTTTCTTATATTTAATAATCAATGAAATTTAATACTTGTGTAGA 

GCTTTAAACATAAATTGCACTAATTTCCAAATACATCTTGGTTTATTTATGACTTTGCTAACTCGGCACGAATTGACCCACAATGGCAATCCTTA 

CCTTAAACTTACCTTATTTTCGTGTTTGTTCACGGCGAATGCGTATTTAAAATGTGGCAATAAGCACAGGAATTGGCAATCTGTAAAGATGTTGC 
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TATGACAAGGGTTGTACTCCTTCTGAAGTGAATAGGTAAGTCGTAAGGCTTATATACAAACATGTCCCAAGGTCGTTTGGTTCTGTAAAATGGGC 
AACATAGCATATTTAACAATGTTACCTGGGTCAGGTGTCATCAAAGAAGGGCTTTCCAGCAGCTGCACTCATCTTTTTACGGTGCGTCCCAACTA 
ATCTTACGATTTTCTGTAGATCCAAGCAGGTCAATGCATAATGTGATACATGCAGGTAAAAGTMCTATGGTTAAAGAAAAAATCTTTCCTTGGT 
CGACCCCAAAGTGCTAAATCATTATTGCTGAACTCTAAAACCATGTGTTTGTAACGAAATATACTTATTGGGGTGGTGGAAAACAGGGGAAAACC 
ATTTACCCAATGACCAATAAAGCTAGCCATACCCTTCTATTCCCAATATACCCCATGGCGGAATCATTCTAGTAGCAAATTATTCAATAAAAAAC 
(SEQ ID NO: 796} 

Exon: 2230. .1001 

Start ATG: 2194 {Reverse strand: CAT) 
Transcript No. : CT22233 

TCAATATGCCGAGTTTGCCTGGTCAGTTACGAAAATATGGCCAGTGTTTTCGATGAGTCTCATGAACCGGGCCTATCCATTGCGTACATAATATA 
CAAGTGCACCGGCTGGCAAGTTGAGAAGCACGATCCACTGTCCAACACCATATGCAAGTCCTGTCTGGAGGATGCGCAGAATGCATTCGATATCA 
TAGAAACGTACGAGAGAAGTCACCAGTTTTACCGCTTCCTCAAGGATGTACGGGAGGAGGAGAGTGAAAATGATGGATCCGGATGCTCAGAAGAA 
GTGGAGGCAGCTGAGAGGGATCTCCAGGATGGTGCCGACGACGTCGATTCCGGCAATGAACCTGATATTAACGAATGTGATATCAAGGCCAAGGA 
GAAACCAGGCTTTAGCTGTTCTCACTGCCCTAAATCTTTTCAAGTAAAATCAAATCTGAAGGTACACATGCGTTCACACACAGGAGAACGTCCCT 
TCACGTGTTCCCTCTGCCCCAAGTCCTTTGGCTATAGCTCTGGTCTTCAGAATCACATGCGAACCCACACAGGAGAACGACCCTTCCAGTGCTCG 
CATTGCCCAAGGTCCTTTACAGCTGGGCACCACCTCAAGGCCCACATACAGATGCACGAAAGAAGAGGATCGTTAAGGTGCCCATACTGCCAAAA 
GTGCTTTCTAACTTCTTTAATTCTCAAGCAGCATTTGGCCACGCACACTGATGAAACGCAGTTCAAGTGTTCCCAGTGCTCGAAGTCCTTTCAAG 
TCGAACATGAACTTTGGATGCACATGCGAGTGCACCAAGAACGCCTCTTTACTTGTGGCCATTGCTCAAAGGACTTTGCGCTACATGCGTATCTC 
AAGC G AC A C C T GT C AAG GAAC GC GAG AT GCT C T C AAAGCT C AAAAG C AT C TGC TC AC AAAAC AC T AGGC C AC T C AAAGG C ACT T AAG T GT GGT AA 
GTGCCCAAAAACTTTCACTGACCGTTCGGCTCTGTCCACTCACTTGAAATCACACACCAAAAATAAGCCATTGCTCGAAGGTCCTTGCAAATCTT 
CAGGATCTAAGCCGGCACATAGTAACGCACAAAGAAAACCGTTTAAGTGTTCCAGCTGCCCAAGGACTTTTTCGCGGAAATCCGCACTGCTAACA 
CACTTGCAAACCCACACAGGAAAACTGGGCACAAATCGCATGTAGAAGAAAGAACCGTTTGAGAAGGCTAAAAGAATTCGATTCTTTTTT 
(SEQ ID NO: 797) 

Start ATG: 37 (Reverse strand: CAT) 

MASVFDESHEPGLSIAYIiyKCTGWQVEKHDPLSNTICKSCLEDAQNAFDIIETYERSHQFyRFLKDVREEESENDGSGCSEEVEAAERDLQDGA 
DDVDSGNEPDINECDIKAKEKPGFSCSHCPKSFQVKSNLKVHMRSHTGERPFTCSLCPKSFGYSSGLQNHMRTHTGERPFQCSHCPRSFTAGHHL 
KAHIQMHERRGSLRCPYCQKCFLTSLILKQHLATHTDETQFKCSQCSKSFQVEHELWMHMRVHQERLFTCGHCSKDFALHAYLKRHLSRNARCSQ 
SSKASAHKTLGHSKALKCGKCPKTFTDRSALSTHLKSHTKNKPLLEGPCKSSGSKPAHSNAQRKPFKCSSCPRTFSRKSALLTHLQTHTGKLGTN 
RM* 

(SEQ ID NO: 798) 

Name: zinc finger protein 
Classification: transcription_f actor 



Celera Sequence No. : 142000013384534 

ATGCATACATAGATAGCTATATGACATCCAGACATTCATGCCACATGCCACTATTATACACACATACTATATAAAACATGTTTACTACTCCACGA 
CTCAAATTTGCAAATATTGTACGCTATTTAGCCTAGCAGTTAGAGCAGCATACCCATAATATATTTATATGAAACAAATCGCCCAGCAGGACACA 
TAAATTGTGGTATGTATGTGCGTGAGTGTATTTTTGTATGTACTACCTATTAAATATAGAAATATATATATATATATATGTGCGGAGAACCGTGT 
TCCTGAACACCGATCAAAAGGAATTAGGCATTTTTGTTTGTCCTGGACTGGCCGTGGATGATCCTGCGGACCGCCAAGGAATAGAGATGTGCCAA 
TTTCTGGTCCACCAAGCACAGTTGATAACCTATAAGCATCCTTACAACGTCACCACAACAGACTGGGACAAATTGCTGGTCGTAGCGTCTACTTC 
TATACGTTAATCTATATCTATATGTTAAACTGTAATTTTGGATCCGTGTCAACCAGTCGTTTAGTTCCATAGTCAAGTCATTACCGGGTCCGTTA 
GACAAGGATTTATGCAGATACTCTCAACTAATCTAGAATCTAGACAAAGGCGACGAAGCAGATCAAAAGAGTTTTGCTACCACTACCAAGTGCTG 
TACACGCCAAAAACGAAAGGCTTCAGTTTAGCTATAGACACAAATATATATATATATATATATATTATATAAAGAGATGTATTTACATATTTTTG 
GCATTATAGATACATACACAATGTGGAAAATTGCAAAACAATAATATGGGTGGAGAGGTAAAGGGAAAGTAATTCTTAGAAAGGCAGATTTTTTC 
ATACAGGAAACGGTGAAAATCAAACTTTTCAACATAAGAATTAAACATATATATATATATGGGAAAACCAAAACCTATAAAACATTTACTAAAAG 
TGAAACCAATAAATGAAAAATGAGTTATACACCCTTAAAACAATATATATGTGCAAATTGTAAATGGAGCATGTAATTTTGTAAAAATTATAAAT 
AAAAGTTTATCAAGTCGAGAACTTTTGAACTTAAGTTAAAATTGGAGACAATTTACAGAGCTGACGGCCTATTTAAATACAAACTAAATACAATT 
ATAGAAACTAGATCAAAATCCGGATGGATAAAGATGTCATTTGGCCACTTATTGCATACACTCATTTCACTGGAACTTTTCTGTAAGTCCTCGGG 
TCTTCATTGCCATTTCCGACAAATGTTGTTGCAATAACCTCTTGTTGACCTTTTCCCTGTGAAAAAATTTATAAAATTAATACAATCTTTAAGGA 
TCTATATATAAGTAACGCCTCTCACTTGATTATTAGAGCAGTGTAGCGCGTTTTATCCGCGTCGGTTATGAGCTGTGTGGGGAAATTGGGAGTGG 
ACATTAGGGACTTCATCCAGACGGCCATCTCGGCGGCATATTTTCTGTTCATGGCAAGAAGGATGTCTGCAAACTTATCCACCTGCGACCGCGGC 
GTTAGGTAGCCCACGCACATCATCGCCGTGTACAGCGTCTGCTCGCCGGTGGCCAGGACAACTTCTGTGACATGAGCATGGTTACGCGACTGCAT 
CACAAAGTGGGTAAGAAACTGTATGCTATTTCGGATGGCACCACTCTCCGGCAAAGTCATTCCACGCTGGGCGTAAAAGACCAGACGATCGTAGG 
CCAGTGTCTTGTCCTCCAAAACTTGAGGGATTTTCTTAATGATCTGCGTCAGACAGCCGAAGAAGGTTTCCATGGTGTCCGAAATGTTGGAGAAG 
TTCTGCTCGGGAGTGCTCTCAAATAGCTTAAAGCTGTGTTGGATGAACTCCCTCAGAAGTTGCTGCATCAGCGGCTTGCAGCCTTCGTCCTTAAA 
GAACATAACAATGGCCTGAAACAAAGGATTCTATTAGATCACTCAGTCATTTGCAATATATTGGTAAGGCTCACCGTTTTGGATATTTCTAGAGT 
AGGAGCGCAGCATCGGGTCTGAAAGCTGGCAACAATGAAGAGACAAAGATCCTGCAGCATTGGCTGGAAGCTGCTCCTTAAATTCGTTATCGCAT 
GCTTCATCGCACTGCAAGCGGCCTATATGAATACGATAACATTTTATTAAAAATATGCAAAATTTTGTGTTTATTGCCTTACCTCTAGGACATCG 
ATCTCCTCCACCCACATCTCGGCGATTCTCTTGAAGATGGGCATCGTGCGCTGCATGACGAGCAGAACTGGCTGCACAATCGGTTGGTCCGTCGC 
CTGCTCGTCCACATCCGTGTTTAGCGAGGAGAAGAGAGTGGAGATCATATTGAGCCGGAAGATTGTGCGAATTCGAGCAGCGGGAGTTTTCTAGT 
GCAACAAAAGAAATGGTTAGCAACTGATCTCTTTATATATTTTATATTTATATTTACTCACCGAATCTGCCTGGCAAATGGCCTGTAGCTCCTCG 
AAGCAGGGACTGACGATAATATCCAAGTATTTGGGTATTTCCTCTGGCCGCAGCAGGCTCATTAGCTTACCGATGCTGAACATAAGCCTCACCGA 
ATCGGAGTTCTTCATGCGGCCCGTGTTTAGGGAAGCGTGGCATGCGTTTAGGAGCGGATCCGCATACGGCTTCAGCTGCAGCTGACAATCGCGAC 
ACAGCTCCTTCAGGCCGAGCGTCGCCTGTGCTGACATTGAGGAGTTGAGGCCGCGCACCAGTAGGTTAATGGCTGGCGGAATGTACGCCGGATTC 
TCCATCAGCCAATTGCAATAGGAGCCCATTGTCTCTAGCGCAGTGCCCAGTAGCTTGACATTGAGCTTCTCGTAGGGAATCTCGGCCAGCACGCG 
CATCAGCCGTGGTATTTGCCGCTTCTCCTCGCCACCAAAGTGCTCGGCCACCGACTGAAAGGAATAGATGCACGCCTCCAGCTTTGTCCAGTGTG 
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TGGGATGTCTTTGCAGATCGGCAATGGCCTCGTCCAACATGGCAGCCAGAATCTCCAAGATATAGTCATTCAACACATCATAGCAATACATCTAA 
AATAAATAAAATAAATGCTATATTATGCCTGTTTGTTAATAGTTGAAAACGGTAGCCAATCCTTTCGATTTGGAGACACTCCAACTGTTGCAACC 
ACTCACAAAGGTATCGGATATATCCTGTCTGTAGCACCTAAAGCATTCCAAATCATCGGAGCTCCACTTGGCGAGCGACTTTTCGTCTGGCTGCT 
CTGACTTTCTTACTAAAATCCTGGTTAGATGGGCGTACAATGGTTTGATGTACTCCCAGCACTTGTGTTTCTGCTCATCGTTAGACATGGCGAAC 
ACTTCGTCTTGCAGCATATACCAAAACGCCAAGGCCATTGTGCTGCAGGACTCCTCCACCGGGTAAATGCCCGGCTTGTCCGTGCAGTGGAGAAT 
CTCCTGCACGATGCGATGCACTAGGATAGACAATTCCGGATCCGCGGACGTGATGCCGCTGAGCAACAGCGTCGAATGTCGCTCCACCGAAGAGA 
CGAACAGCATGTAGATATGTACGATAATGTCCTCGTTGTCGTTCTCCCTCTTCCATTCCGTCTTTGTGATTTCGGATAGCGAGTCCAGGAACATT 
TTTATAAGCACAAATGCCGTCTTTGGATAGTTGTGGCAGTCTGGCTGGATGATAATGTTCACCATGGTCTTCAGACAGGACTCGGCCAGTTCGTT 
CTCATCGGCGGTCATGCAGCCGTCGCCGGCGTGTATGCAGGGCCAATAGCACTTGTGCACCACTTCCAGGAGCACGGCGGTGATGGTGACGCAGC 
CCTCAATGGTGTAGCCGATGTTTCTGTCCAAGTTGGAAACTCACTTAGAACGTGTAAAGAATAATTTTGTAAATTGAATACTCACTTGATCCAGG 
TGCCGACGCACTTAACGGCCCTGTTCATGTTACTGTACGCCTCGGCATCCCACACGCGGTTCATTTGCAGCTTCAGATATCGCTCAACGGTGTGA 
ATGACCAACTGCACTCGCTTGGCGATCTCGGCGCGCAGAACAACCCGCTTGACAGACGTATGTATGACCTGCGCTTCCTCGGGAATGGCGGTCAG 
CACCTCCAGCATGATCCACAATTGCACATCGGCGCTGACATTTGGCATCCGTTGATTCTGAAACGTGTTGATCACCTCCTCGATGGCGCCGGGCC 
ATTCGCCCAGCATGTGAACGATGTAAGCGCCCAGCGAGATGCACAACCGATTGAGGACGATCTTCGGTCCGCCAGCGAAGCGAACGATGGATTCG 
AGTATCTTTTGCTTTAACTCCTCGCGATTCTCTGGCGGCACTTCGTGCCAGTGCTTCATAAGCTTGGAGTGCAGCGTGATGGCGCCGAAGAACTG 
CACCTCCTGACTCTGTGTGGGCGGTGGGGTGATTGGGTGGGCGGTAGGTGCATGAGTATACATATCCGGGTAATTGATGCATTGATGCGCAGGTC 
CTGCGTGGTACTCACCTTGCCGAGCTGCATTAGCTGCCAGGAAAACTGCCAGGCCTGTGGACTGGCCTCCGCATCGGTAAGCCATTCATGGGTGA 
TCGCCTGGTTCTGCGAGTTCGAGCGGTAGAACGACACGACCGCCTCCTCCAGACGCGCAATGTCAATGGGCTCCATGTGGAATGGCTGCTGCTGC 
TGCTAATGCTGCTGCTGCCGAGAGGCAGTCTCCTGTCCGCTAATCCTCGGGTGGTGGTTTTCTTTGCTTCTGCCTGTGGGCACTGGATTCGCGAG 
TGTGACCAACGCGCGAAGTCACCAATTCGCGGACGGGCCGAGTGTGCGGTCACTCCATTTTCGACTTTGCCGCGCTTTTTTTCCTCCTGTATTTT 
TTATAAAAAGTGTTGAAATGTACGTATTTAATAATAAATAAATATGTAATATAGTATAAATTGCTTGTGGCCTATATGCTCGATATATGCAGGAA 
AAATTCAATTTATAACAACATTTATTCATTCGTACACTTCATAAAATTTTACTATTTCTATTTAGTAAATACTCTCTGCAATTGAGGAGTATCTT 
AACAAGTTAACAGTGAATTTGGAAAGCCCAAAAAGTCCTATATAAAATCGAATTTAAGTTGCCAGAAACAGTATCGCAATTTGGAACACCGGTCG 
GCGATGAAATCCCTGCTATCCTCATCGATATACACCTCAATAAGGACGTTGTCGCAGAATTGAGCGACCTGCCTCTCCACGTCTTGGCAGTCCAA 
AGTGGAGCGATTAACGACGGCCCATCTGTCGGTATTCAAGCCCGGGTTCTGTGACTCCAGTAATGCGAAGATCACACAGAGCGACTTATTCGGAC 
TGACCAAGGGCAGGACAAAGGTCGTCCTCACTAGATCGAAGAACTTAATGAAGGGCTGGCAGGAGTGGGGCGGCTCCACACCTTCGACTGCTTCG 
ATGACCCATTGGGCCGTTTCGTTATCCTCGCAGACCACCAGCATGTTGCAGTCGAATGTCTGCGTTGAGGCACACTCACTAAAATTGAGATCGCT 

G ACGCT C G AT C C AC T GT C CT C G AAT AT T G AAT G AAAGT GC T C C AT T AGG GC G AT AGT GT C C AC AC C AT CC GAGC AC GAACT C TGG T T AGT T AT CA 
CCAAGGCGTAACGTCGGTTCTCGGGCATTGTGTTCGCATACGGCGGCCGAGTATCCCTCTTGCGACAATTCTTTGGCGTGCAGCAGTCCTTCTTC 

CTCTCCTTGCACTCCTCGGCCTGCTCCACTGAGCAGCTTGT 

(SEQ ID NO: 799) 

Exon: 4741.. 4481 
Exon: 4382.. 3886 
Exon: 3823. .3142 
Exon: 3036.. 2437 
Exon: 2370.. 2173 
Exon: 2112. .1975 
Exon: 1915.. 1356 
Exon: 1291.. 1001 

Start ATG: 4636 {Reverse strand: CAT) 
Transcript No. : CT22245 

CCAGTGCCCACAGGCAGAAGCAAAGAAAACCACCACCCGAGGATTAGCGGACAGGAGACTGCCTCTCGGCAGCAGCAGCATTAGCAGCAGCAGCA 
GCCATTCCACATGGAGCCCATTGACATTGCGCGTCTGGAGGAGGCGGTCGTGTCGTTCTACCGCTCGAACTCGCAGAACCAGGCGATCACCCATG 
AATGGCTTACCGATGCGGAGGCCAGTCCACAGGCCTGGCAGTTTTCCTGGCAGCTAATGCAGCTCGGCAAGAGTCAGGAGGTGCAGTTCTTCGGC 
GCCATCACGCTGCACTCCAAGCTTATGAAGCACTGGCACGAAGTGCCGCCAGAGAATCGCGAGGAGTTAAAGCAAAAGATACTCGAATCCATCGT 
TCGCTTCGCTGGCGGACCGAAGATCGTCCTCAATCGGTTGTGCATCTCGCTGGGCGCTTACATCGTTCACATGCTGGGCGAATGGCCCGGCGCCA 
TCGAGGAGGTGATCAACACGTTTCAGAATCAACGGATGCCAAATGTCAGCGCCGATGTGCAATTGTGGATCATGCTGGAGGTGCTGACCGCCATT 
CCCGAGGAAGCGCAGGTCATACATACGTCTGTCAAGCGGGTTGTTCTGCGCGCCGAGATCGCCAAGCGAGTGCAGTTGGTCATTCACACCGTTGA 
GCGATATCTGAAGCTGCAAATGAACCGCGTGTGGGATGCCGAGGCGTACAGTAACATGAACAGGGCCGTTAAGTGCGTCGGCACCTGGATCAAAA 
ACATCGGCTACACCATTGAGGGCTGCGTCACCATCACCGCCGTGCTCCTGGAAGTGGTGCACAAGTGCTATTGGCCCTGCATACACGCCGGCGAC 
GGCTGCATGACCGCCGATGAGAACGAACTGGCCGAGTCCTGTCTGAAGACCATGGTGAACATTATCATCCAGCCAGACTGCCACAACTATCCAAA 
GACGGCATTTGTGCTTATAAAAATGTTCCTGGACTCGCTATCCGAAATCACAAAGACGGAATGGAAGAGGGAGAACGACAACGAGGACATTATCG 
TACATATCTACATGCTGTTCGTCTCTTCGGTGGAGCGACATTCGACGCTGTTGCTCAGCGGCATCACGTCCGCGGATCCGGAATTGTCTATCCTA 
GTGCATCGCATCGTGCAGGAGATTCTCCACTGCACGGACAAGCCGGGCATTTACCCGGTGGAGGAGTCCTGCAGCACAATGGCCTTGGCGTTTTG 
GTATATGCTGCAAGACGAAGTGTTCGCCATGTCTAACGATGAGCAGAAACACAAGTGCTGGGAGTACATCAAACCATTGTACGCCCATCTAACCA 
GGATTTTAGTAAGAAAGTCAGAGCAGCCAGACGAAAAGTCGCTCGCCAAGTGGAGCTCCGATGATTTGGAATGCTTTAGGTGCTACAGACAGGAT 
ATATCCGATACCTTTATGTATTGCTATGATGTGTTGAATGACTATATCTTGGAGATTCTGGCTGCCATGTTGGACGAGGCCATTGCCGATCTGCA 
AAGACATCCCACACACTGGACAAAGCTGGAGGCGTGCATCTATTCCTTTCAGTCGGTGGCCGAGCACTTTGGTGGCGAGGAGAAGCGGCAAATAC 
CACGGCTGATGCGCGTGCTGGCCGAGATTCCCTACGAGAAGCTCAATGTCAAGCTACTGGGCACTGCGCTAGAGACAATGGGCTCCTATTGCAAT 
TGGCTGATGGAGAATCCGGCGTACATTCCGCCAGCCATTAACCTACTGGTGCGCGGCCTCAACTCCTCAATGTCAGCACAGGCGACGCTCGGCCT 
GAAGGAGCTGTGTCGCGATTGTCAGCTGCAGCTGAAGCCGTATGCGGATCCGCTCCTAAACGCATGCCACGCTTCCCTAAACACGGGCCGCATGA 
AGAACTCCGATTCGGTGAGGCTTATGTTCAGCATCGGTAAGCTAATGAGCCTGCTGCGGCCAGAGGAAATACCCAAATACTTGGATATTATCGTC 
AGTCCCTGCTTCGAGGAGCTACAGGCCATTTGCCAGGCAGATTCGAAAACTCCCGCTGCTCGAATTCGCACAATCTTCCGGCTCAATATGATCTC 
CACTCTCTTCTCCTCGCTAAACACGGATGTGGACGAGCAGGCGACGGACCAACCGATTGTGCAGCCAGTTCTGCTCGTCATGCAGCGCACGATGC 
CCATCTTCAAGAGAATCGCCGAGATGTGGGTGGAGGAGATCGATGTCCTAGAGGCCGCTTGCAGTGCGATGAAGCATGCGATAACGAATTTAAGG 
AGCAGCTTCCAGCCAATGCTGCAGGATCTTTGTCTCTTCATTGTTGCCAGCTTTCAGACCCGATGCTGCGCTCCTACTCTAGAAATATCCAAAAC 
GGCCATTGTTATGTTCTTTAAGGACGAAGGCTGCAAGCCGCTGATGCAGCAACTTCTGAGGGAGTTCATCCAACACAGCTTTAAGCTATTTGAGA 
GCACTCCCGAGCAGAACTTCTCCAACATTTCGGACACCATGGAAACCTTCTTCGGCTGTCTGACGCAGATCATTAAGAAAATCCCTCAAGTTTTG 
GAGGACAAGACACTGGCCTACGATCGTCTGGTCTTTTACGCCCAGCGTGGAATGACTTTGCCGGAGAGTGGTGCCATCCGAAATAGCATACAGTT 
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TCTTACCCACTTTGTGATGCAGTCGCGTAACCATGCTCATGTCACAGAAGTTGTCCTGGCCACCGGCGAGCAGACGCTGTACACGGCGATGATGT 

GCGTGGGCTACCTAACGCCGCGGTCGCAGGTGGATAAGTTTGCAGACATCCTTCTTGCCATGAACAGAAAATATGCCGCCGAGATGGCCGTCTGG 

ATGAAGTCCCTAATGTCCACTCCCAATTTCCCCACACAGCTCATAACCGACGCGGATAAAACGCGCTACACTGCTCTAATAATCAAGGAAAAGGT 

CAACAAGAGGTTATTGCAACAACATTTGTCGGAAATGGCAATGAAGACCCGAGGACTTACAGAAAAGTTCCAGTGAAATGAGTGTATGCAATAAG 

TGGCCAAATGACATCTTTATCCATCCGGATTTTGATCTAGTTTCTATAATTGTATTTAGTTTGTATTTAAATAGGCCGTCAGCTCTGTAAATTGT 

CTCCAATTTTAACTTAAGTTCAAAAGTTCTCGACTTGATAAACTTTTATTTATAATTTTTACAAAATTACATGCTCCATTTACAATTTGCAC 

(SEQ ID NO: 800) 

Start ATG: 106 (Reverse strand: CAT) 

MEPIDIARLEEAVVSFYRSNSQNQAITHEWLTDAEASPQAWQFSWQLMQLGKSQEVQFFGAITLHSKLMKHWHEVPPEKREELKQKILESIVRFA 

GGPKIVLNRLCISLGAYIVHMLGEWPGAIEEVIKTFQWQRMPNVSADVQLWIMLEVLTAIPEEAQVIHTSVKRVVLRAEIAKRVQLVIHTVERYL 

KLQMNRWDAEAYSNMNRAVKCVGTWIKNIGYTIEGCVTITAVLLEWHKCYWPCIHAGDGCMTADENELAESCLKTMVNIIIQPDCHNYPKTAF 

VLIKMFLDSLSEITKTEWKRENDNEDIIVHIYMLFVSSVERHSTLLLSGITSADPELSILVHRIVQEILHCTDKPGIYPVEESCSTMALAFWYML 

QDEVFAMSNDEQKHKCWEYIKPLYAHLTRILVRKSEQPDEKSLAKWSSDDLECFRCYRQDISDTFMYCYDVLNDYILEILAAMLDEAIADLQRHP 

THWTKLEACIYSFQSVAEHFGGEEKRQIPRLMRVLAEIPYEKLNVKLLGTALETMGSYCNWLMENPAYIPPAINLLVRGLNSSMSAQATLGLKEL 

CRDCQLQLKPYADPLLNACHASLNTGRMKNSDSVRLMFSIGKLMSLLRPEEIPKYLDIIVSPCFEELQAICQADSKTPAARIRTIFRLNMISTLF 

SSLNTDVDEQATDQPIVQPVLLVMQRTMPIFKRIAEMWVEEIDVLEAACSAMKHAITNLRSSFQPMLQDLCLFIVASFQTRCCAPTLEISKTAIV 

MFFKDEGCKPLMQQLLREFIQHSFKLFESTPEQNFSNISDTMETFFGCLTQIIKKIPQVLEDKTLAYDRLVFYAQRGMTLPESGAIRNSIQFLTH 

FVMQSP^HAHVTEVVLATGEQTLYTAMMCVGYLTPRSQ 

LLQQHLSEMAMKTRGLTEKFQ* 

{SEQ ID NO: 801) 

Name: nuclear transport receptor 
Classification: receptor 



Celera Sequence No. : 142000013384139 

AGGCCACCACGGCGGCGCAGGTGTGAATGAATATGGAGGAGAACAGGGCCCACAGAAAGACGTGGTACCACATTTCGCGAAACGTATTGAGACGC 
GCCTGTGACCGGAGTCCCATCACATCGGCGATGGTGTCCAACTCATCGGTGTCGTCCGAGAACATGGCTAAAACGTAGTCGATGGATGTTGACGT 
GGTATCCCTAGTTACTAGTTGCTAGTTATTGCTTTTCCGTTGGTCAGTTGAGTGTCCAGCCGCATGTCTGGTGGGGATTCCCTTTCTATAGCATG 
TATCCAGCCAATATTCATGGACCTGCAAACAATAACTCCCTTTAGACTTGACCTTCTGGCATTTAATTATACTCACGCCAACACATGTGTGTGCT 
AATGGCCACACCCATACCACATTTGTCGTATTCTATAGCCGGATTTGGATTGCAGTCTTTTTCAGTTGCTTGATCCTATTTAGCATGGCCTTAAC 
TTCCGGTTAGCACTGTAGTTTAGCAAATGTTTGGCTTATTTTTAATTCGGTCAACAATCGATTGAACTTTCGTCACGGCACTCAATTAGCGCTCG 
GCTGGCGGCAGTGAATGCTAAATCGGGCCTAATTACCAGGCTACTGGTGTGCGGCAGCCCTAACTCCACATTTTGAGCAACACCAAACACTTTTA 
GTTGGTTTAATTTAATTTTATATTGTTCTTGTCTGCGATTTACTATGGCAATGGGAATGGAACGTTTGGGATTGGGCCAGTTCTGATAAGCAAAC 
AGCTGTGACTCAGTTGCGAACTGGTTCGGGGCTTGCAGCGCAGCTCTAAATTATTATACTCAAGATATCCATATGTAAGGCAGATATCGTAGTGA 
AAAGTCACAGGGTTTTTAAGGTATATTCTTTTGTAAGCTGTGTTCAAATGCACTGCTTTCAGCACAAGCATGCTTATCGTTATCGAGCAGCGATA 
ATCGGCGAACTAGTTCAACATAAACCTTCTTGGCACAAGTGACAACTCTGAAACCAATCCATAAACAAAAAATTTTGTGTAAACTAAAAACAAAA 
AGCGGAAGTTGAAAAACAACACCTTAGCGTATAATTTGGCCCCTTATGATTTGTTAAGTTTCTGTTTTGGAAAATGGGATGCTGAGATAGCCCTG 
AACAGGTGTAAGTCGTCCTCCTACACTATGACACATTTCTTCCGAGACGAACTTACATACATACGCACATATGTACACATATGTATGCATGTTTG 
TAACTGCATCCATATACGATATATGTATCTTACGCACTCGCGGCTATATACATATATATGCGGAATATATATGTATGTAAGCCGTCAATATGAGA 
ATCGGTTGAAGTTAAGTCTTCATTTAAAGTGGCTATAGTACATATGTATATACACACTTTTAACCAGCATATGCTTGTGAATCGAGATTAATCAA 
AGGCCCTGCATTGCAGACTTCATATACTTCAAGCCCCACAAAACCAACGACGCATTTAATCAATGTTCTCCAACGGATTGCCGTGCAATCTGTCG 
TTTCCGAGCTGTCTCAGTGTACCAAAAGACGAGATCGGTAATGAAGAGCTACTGCCATCCAATATGGGTACTCGCGAGCCTGTTATACTCAATGT 
CTACGATATGGTAAGCTTACTTACTCCTGTTGCCCATGCGGACATCGCACAACGATAAATGTAACCTTGCATTTGCAGTACTGGATCAACGAGTA 
CACCACCTCAATCGGCCTGGGCGTATTTCACTCCGGCGTAGAAGCCTTTGGCACCGAATTCGCATACGGCGGACATCCATTTCCGTTTACGGGGG 
TATTCGAGATCTCACCGAGGGATCACGACGAGTTGGGCGATCAGTTCCAGTTCCGGCAGAGCATCCAAATCGGCTGCACCGACTTTACGTACGAG 
GAGGTGCGACGTATCGTCGAGGAGCTGGGTAATCAGTTCCGTGGCGACCGATATCATCTCATGAACAACAATTGCAACCACTTCTCGGGCTCATT 
AACTCAGGTGAGACATAAATCGAGGAAACTTTCCACTGGGTGGATCCAATATCAAGCAATATTTTGGGTGTGCTTATGCATGATTCACCGATTGT 
CAGATACCGCCTGTTACAATCAGAAGTTCTTATTGTGAATCACCTCAAAAAGTAGGACAGATTACCATCCTAAACACGTCCAATACAATTAGATG 
CTTCCAAACGATTCGAGTTTAGGAGCAAGAAAATGCAAGCTTAAAAGCCCAAATGAATATTCATTTGACTATATCTGCAATTCCAATTACTCGTT 
CTAGATACTTTGCGGCCAAGAAATACCCAGCTGGGTCAATCGTCTAGCGCATTTCAGCTCCTGTGTGCCATTTCTACAAAGATGTTTGCCAAAGT 
AAGC C T T T C T A AGT G AAAC AAAAG C C AAGT T CAT GAT TAT T AAC T ACT C T AT GT T G T T T T T TAG AG AAT GGC T G AC A C C G AAT GCCTTGCAG C AA 
AG CAT T AC C AC AAT C C AAG AG CGC G AAG AC T C C G ACAAC AG TCCCCTTT GAG AC GT T AC C T G A CC G AAAAT AT G TCACTTGGC G AC AC AC A AGC G 
ATCATAGTTCACATAAAACAACAGTCCTGAGACCAACATTGTTAAGTTACGCAGAGCAAAAAGCCGGGTTACTCATGTCACATCAATTTCCAAGC 
CAACAACGCCCCCTAAATAGAAAGCAACAACTATTATTAATAGACTTAAATTAGGCGATATACATATACATACTTACATAGAACAAACCTACATT 
GAGACATACTTATACATACATGCGAAGTGACTAAATTATTGGCTATTGTAATTTGTATTTGTAATATGACTATTCAGAATTAAGATATTATTAAG 
TGGCCGATAAATTTAAGTAGCAGGCGCCGTTCAGCAGCAGTGTCATTTATAGAATTTTAAATTAACTACGAAGACATCGAGAAATCATCATTAGG 
CACTTGCTAAAATTGTACTAACAATACCGCAGACATATTGGTGAACGATATCAAAATGTTGCCCAATTTACTTGTTTACTTTTGTCAGCGGTGGG 
GATTTGGTTTTTTTGAATTCCCACCAAACATCGGAGCTCGATTATTGAGATGATTTCCTGGTTGATTTCGTTTACACAATTATACAGAGTTCATA 
AGCCGCATGTCTATCAAAACACCAAAATGATCTGTTATCAATAAATATATGTACTTGACATTCTGATATTTACACATGCATAAATACTATTATTA 
AACATTTTTAAAGAAAAATATATTTTGATCGAAAACATTCGGTTGCGTTTGATTTACAAGCGAGCAGGTTGGAACGTTTTTAATATTTTGGGTAT 
TTTTGGGGTCAAAAGTGGAATAACCAAATAATTAAGTACACGATATTCTCATAAATTACGCAGAATATGAAAACACTCAGACCCAAAACTCCTCA 
ACAAAAAACAACAAAATTATTGTACATATATCTTTGTATTATATTTATATATTTTAGACCCTCCAGATTAGATGCATCAGTTGAAAATAAGTTAA 
CTTTTC 

(SEQ ID NO: 802) 

Exon: 1001.. 1147 
Exon: 1442.. 1625 
Exon: 1694 . .2002 
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Exon: 2285. .2373 
Exon: 2440. .2521 
Start ATG: 1488 



Transcript No. : CT22269 

AAACCAATCCATAAACAAAAAATTTTGTGTAAACTAAAAACAAAAAGCGGAAGTTGAAAAACAACACCTTAGCGTATAATTT^ 
TTGTTAAGTTTCTGTTTTGGAAAATGGGATGCTGAGATAGCCCTGAACAGGTACTTCATATACTTCAAGCCCCACAAAACCAACGACGCATTTAA 
TCAATGTTCTCCAACGGATTGCCGTGCAATCTGTCGTTTCCGAGCTGTCTCAGTGTACCAAAAGACGAGATCGGTAATGAAGAGCTACTGCCATC 
CAATATGGGTACTCGCGAGCCTGTTATACTCAATGTCTACGATATGTACTGGATCAACGAGTACACCACCTCAATCGGCCTGGGCGTATTTCACT 
CCGGCGTAGAAGCCTTTGGCACCGAATTCGCATACGGCGGACATCCATTTCCGTTTACGGGGGTATTCGAGATCTCACCGAGGGATCACGACGAG 
TTGGGCGATCAGTTCCAGTTCCGGCAGAGCATCCAAATCGGCTGCACCGACTTTACGTACGAGGAGGTGCGACGTATCGTCGAGGAGCTGGGTAA 
TCAGTTCCGTGGCGACCGATATCATCTCATGAACAACAATTGCAACCACTTCTCGGGCTCATTAACTCAGATACTTTGCGGCCAAGAAATACCCA 
GCTGGGTCAATCGTCTAGCGCATTTCAGCTCCTGTGTGCCATTTCTACAAAGATGTTTGCCAAAAGAATGGCTGACACCGAATGCCTTGCAGCAA 
AGCATTACCACAATCCAAGAGCGCGAAGACTCCGACAACAGTCCCCTTTGA 
{SEQ ID NO: 803} 

Start ATG: 194 

MFSNGLPCNLSFPSCLSVPKDEIGNEELLPSNMGTREPVILNVYDMYWINEYTTSIGLGVFHSGVEAFGTEFAYGGHPFPFTGVFEISPRDHDEL 
GDQFQFRQSIQIGCTDFTYEEVRRIVEELGNQFRGDRYHLMNNNCNHFSGSLTQILCGQEIPSWVNRLAHFSSCVPFLQRCLPKEWLTPNALQQS 
ITTIQEREDSDNSPL* 
{ SEQ ID NO: 804) 



Celera Sequence No. : 142000013384094 

T C AC GT C AC AT AC C AC C AC AT T ACCT AC C AC GC AT GC C C C AAT C GC AT T AGG AGT T C CC T C AAC AC AT C T GC AC GC T T T GC GG A C AGC G AGAG AG 
ATTTATAAGATTATTTATTTTAGTGCCAAATAACAGCAGAAAACGCAAGCAAAAAGTCATTAACTATTTTAGTAGGATAAACTAAATTGTATATA 
GAGCGACTCGCTTAGAGCGCCCCATTACTCATTTTCCTTAATTCTTGAAGTTTAAACTGCGCATGCCCAAAGTATTTATATTTATATAATAATGC 
ATAACATTTGTAGAGCGATATTTAAGATTTTAATGCAGATTGATCTGAAGTTATATAGAAATAATATTGAACAGAGAACATAGGTAGCGAGAATT 
GTTGCTCATTTACATAGGTAGGCCGAAAAACTAAAAATTATAGTTGTTGTTTTAAGTCTTGTACAAATAACGCTAGCACTGCAAAGATTACGCAT 
AT C C T AT GT AC AAAT GT C GAT C AGT GT AAG AT CT AAAT AC C T AGAT T AT AAAGC AC AG C T GT G AAC C C C GG AAAT AT T T C C AAAAAT T C C T T AAG 
ATAAATGAATAGAAAAAATTATTAAAATCAAGTGATCAAAGAGGGACTTTCTGAAAATGGCAGTAGTTACTAAAACTACGTATGGAGACCCTGAA 
AAAAAAATATCCAAACATATCTTCATATAAATTTTAAATTCGAAATCCAATCAATTTATACATTTGTTACTATTAAACTTTACATTTGTCTTTGC 
CTTGAGAAACATATTCCCTTAAATAAGTTTCTTTAAATCCACACAATACACTTGAATAATTAAATGTGTTTTAAAATTCATCTTTTTTATATAAG 
ATGACTCGTCATCCAAGGCGATTTCCAACAATTCCACCTAAACGAAAATAAAACTGTCAACACATAAAAGTATAAGTTTTAATTATGTTTTCTGG 
AAGTTGTATAAAAATTGCTCTGATTACCGTACTTATATTAATTACGTATATTATAACCGCTGCATCCAGCTGGACAGAGCCCAAAAACAAAGGCT 
CATGAAACTGAAGCCAAACAGGGATTTGAAGGTGCTGGCTCCGTTGCCGGGATTGTCCAGGTTCAATTGTGAGGCCAGCTCCTCCGGCACATCCA 
GATAGACTCTATCCAAGCCGAAGCTGAAGATCAGTTTGCGCAACTTATCCACATCCACATAGTCACCGGCAGACGACCAGATGGTCAGGGTGCTC 
TCGTTTGTCTCGTTCACCGCGGCCACCAGTCGATGCAGTTGCTCCTCGCTGTTGGCGGCAATGCCAGCGCGCACGGGAAAGGTTATGGCCTGGCC 
AGTGGATAGGACATTGTTCTCGCTCAGCGTTTCCAGCATGTCATCACACTGCTGCTGCGTGTACTCGCCATCCCGGAAGTCGGCTCCCCAGTTAG 
TTGTCCAGCCAATGGAGAGCACTGCCTGTTTGTAGCGCATACAGCCGGCGAAGAATCGGTCCGCATCAACGGGTACGGTTCTATTCTGTTCCACG 
GGACCACTTAGAATATCGGCGTTTATCCACACAGGATACGTGGTAGGCTGCACAACACAGAGATATTGTTTAATTTGTTCTGATATCTGAGATTC 
GATGACCAAACTTACGTTCGGAATATTCACGTCCAGAATGTCCAGGGAACCCTCGAATACCTCGATCGACTTGAAGTCGAGCTTAACGCCCTTTT 
TTTGGTCCTCATGATCACGGTTGAAATTAATGATCTGGTTGAGAAATTCCGACAGCGTAAGATCAGAGACATTGGCCGGCGGATGGGCCATTATG 
GGCATGTCCTCGCCATCGCCATTCAGTTTGCCCAGCACAATGTCCGCTTCGATGAAGTCAATGCCACCTTGGGCAGGGAAATGATACTATTGATA 
AGTTGGTTACACGCGCATATTTGGCCAATGATCTTACTGGTTTCGGTTAGGACCTCGTCCAGCAATTGCTGGCTGTTGACGGCATGGGCCCAGGT 
AATGGCAGTGAGATTTGTGCCCCATACCTTCATCATATCCGACTGTTTGAGGAATATATATTCGATCTGGGAATCGTCTGGCGCTGTCTGCTCGA 
CCACCACTTGGTACTGATAATTGCTGGCGTTCAGAGCAGACCCATTGATTTTGGCCACGAGCACTGCGAGTGGACGTGAAATAGACGGCATTAAT 
ATTCCACAATTGGACAATGGGACGCACAGAGGGCATAATATGGCAGATACGCGTAGATAAGCCCATTTGCCTTCCTTACCACCCTACCAGCGAGA 
ATTGATACGCAAACACATGCAATCCAATCCAATGTGAATTTGTTTACTTCGCAATGAGCGTGTACATATGTATGTGTGCGCGAATGGACTCTAAT 
TAACTCCCTTTGTTATGCGATTTTTATGGAATGCAAAGCGCGTTGTGCCCTTCTTCCGCTTTCACCGAATGCTGCAATGCAACTGCACATCAATA 
ATTCTCTTCAATCTGTCAGCTGTGCAATGGCGCAAGCGTAGAGAGCAGACGTGAAGTGAGAATGGCGAGAAATGTGCTTAACTTCAAGAGACCGA 
CGCTCATATACCCTAGCTCTAAGAGTTTACATGACAACTTATTGACAAAAAATAACAAAATGTTGCTATTTTACGTTTTACCCCCTTTTTGCGAT 
TGTGTAAATGTATTATCAAGTAATGTTACCCAACACACTCGTCAGATTTAAATGTTTTAATTGAGTACAGGAGCATCGACTATCAGATACCTGTT 
AGTTTTTTTAAGTTATGTCAAAACTAAACTGTCTGGATCCTAATGTTTCCAAGCGACCCAGGGGTCGTATGTACATGAGTTAAAAACATTAAAAA 
ATGATGATATGTAATGATTTTGAAAAGTGCAAAAATACGAATTACCAAATAGTAGCGTTGTAAGTAGAAAGGTAATTTTTGAGTCCATTTTTTCT 
ATAGGTCTAAGGGCACTTGATCATGAACTTGGGACCGCCACGGTTAGAAACGAACTAAATGGCAATTTTTCAGTTTGGCTTATCGATAGCATTTG 
CTGATTTAATTAAACATTGATTAAGTATTGTAAATTGGTAGTGACAATCTATCCGCATATCCCAAAATCTAATCTTTTAAGTGGTCACGTTATCC 
GATGAACTCGCGGACTAGTTAAACTAGTTACTACATTATAAAGAAGAAAGAACAAACAAACGTTTTTATGGAAATTATTACCACACATCTTGTAA 
AATCATATATTTCATTAAGTACGATTTACATTTATTTGCTTATTAAAAGATAAAATGTTATTTAAATAAGTTTAGACCGCGGTCTCTGTTAGAAA 
AGCGTTGCATTTTAACATTTCTGTTAAATGAGCAGCAGTAATAGAAGAGAAAACGTAAGAGAGCGGTTCAAAGAGCGGCACTAGAGTGACCATGC 
GGCGGTACACAAGAAATGCGGCCAATAAGCGAGAGCACAAACTTTACCGAAACGAGCGCTAGAAGAACTGGCCACACTACACCCCCGACACTCAG 
TTTTAGTTCGAAAATTTAACGGCTCTGGAACCGTTCGGACGTGTCGCTCTCGGTGCGCGTGTGTGTGAGTGGCGAATATAAATATCGGGGGAAAA 
GCTCGCACGTGCGACGCTTGCGAAAAAAAAAATTGAAAATATATAGTAAAATTGTTGTTTTATTTATGAGTCGAACTAATGTCCCGTCAGTTGAC 
TAAGGACAACGCCAAGAAATAAAAAATAGTTT TAGGAGAAATAGTTTCACGACTT TATTCT TTGGATCTCAGCTTAAGACT AAAAAT C AAAT GC A 
AATTGGATTGAGGAGAAGCCTCAGTATTTGAACAATATTTGTATTTCGGGAGAGCCTCGCTCTTGGGTTCTTAAGATTAGGTAGCGTTGAAAGAG 
GGCAGTCAAATCTGCTTAGGTCAGGCTTTAGGATGTTTACAAGAACGGCTGCGCTGCCAAAGATAAACGAAT 
(SEQ ID NO: 805) 

Exon: 3715.. 3468 



FIGURE SHEET 436 



Exon: 3165. .2896 
Exon: 2153. .1938 
Exon: 1872.. 1001 

Start ATG : 2031 (Reverse strand: CAT) 
Transcript No. : CT22297 

GTTGTCCTTAGTCAACTGACGGGACATTAGTTCGACTCATAAATAAAACAACAATTTTACTATATATTTTCAATTTTTTTTTTCGCAAGCGTCGC 
ACGTGCGAGCTTTTCCCCCGATATTTATATTCGCCACTCACACACACGCGCACCGAGAGCGACACGTCCGAACGGTTCCAGAGCCGTTAAATTTT 
CGAACTAAAACTGAGTGTCGGGGGTGTAGTGTGGCCAGTTCTTCTAGCGCTCGTTTCGTAACTAGTTTAACTAGTCCGCGAGTTCATCGGATAAC 
GTGACCACTTAAAAGATTAGATTTTGGGATATGCGGATAGATTGTCACTACCAATTTACAATACTTAATCAATGTTTAATTAAATCAGCAAATGC 
TATCGATAAGCCAAACTGAAAAATTGCCATTTAGTTCGTTTCTAACCGTGGCGGTCCCAAGTTCATGATCAAGTGCCCTTAGACCTATAGAAAAA 
ATGGACTCAAAAATTACCTTTCTACTTACAACGCTACTATTTGTGCTCGTGGCCAAAATCAATGGGTCTGCTCTGAACGCCAGCAATTATCAGTA 
CCAAGTGGTGGTCGAGCAGACAGCGCCAGACGATTCCCAGATCGAATATATATTCCTCAAACAGTCGGATATGATGAAGGTATGGGGCACAAATC 
TCACTGCCATTACCTGGGCCCATGCCGTCAACAGCCAGCAATTGCTGGACGAGGTCCTAACCGAAACCAGTGGCATTGACTTCATCGAAGCGGAC 
AT T GTGC T GG GC AAAC T G AAT GGCG AT GGC G AGG AC AT GC C C AT AATG GC C CAT C C GC C GGC C AAT GT C T C T GAT CTTACGCTGT CGG AAT T T C T 
CAACCAGATCATTAATTTCAACCGTGATCATGAGGACCAAAAAAAGGGCGTTAAGCTCGACTTCAAGTCGATCGAGGTATTCGAGGGTTCCCTGG 
ACATTCTGGACGTGAATATTCCGAACGTAAGTTTGGTCATCGAATCTCAGATATCAGAACAAATTAAACAATATCTCTGTGTTGTGCAGCCTACC 
ACGTATCCTGTGTGGATAAACGCCGATATTCTAAGTGGTCCCGTGGAACAGAATAGAACCGTACCCGTTGATGCGGACCGATTCTTCGCCGGCTG 
TAT GC GC T AC AAAC AGGC AGT GC T C T CC AT T GGCT GG AC AAC T AAC TGG GG AGC C GAC T T CC GG GAT GGC G AG T AC AC GC AGC AGCAGT GT GAT G 
ACATGCTGGAAACGCTGAGCGAGAACAATGTCCTATCCACTGGCCAGGCCATAACCTTTCCCGTGCGCGCTGGCATTGCCGCCAACAGCGAGGAG 
CAACTGCATCGACTGGTGGCCGCGGTGAACGAGACAAACGAGAGCACCCTGACCATCTGGTCGTCTGCCGGTGACTATGTGGATGTGGATAAGTT 
GCGCAAACTGATCTTCAGCTTCGGCTTGGATAGAGTCTATCTGGATGTGCCGGAGGAGCTGGCCTCACAATTGAACCTGGACAATCCCGGCAACG 
GAGCCAGCACCTTCAAATCCCTGTTTGGCTTCAGTTTCATGAGCCTTTGTTTTTGGGCTCTGTCCAGCTGGATGCAGCGGTTATAA 
(SEQ ID NO: 806) 

Start ATG: 641 (Reverse strand: CAT) 

MMKWGTNLTAITWAHAVNSQQLLDEVLTETSGIDFIEADIVLGKLNGDGEDMPIMAHPPANVSDLTLSEFLNQIINFNRDHEDQKKGVKLDFKS 
IEVFEGSLDILDVNIPNVSLVIESQISEQIKQYLCVVQPTTYPVWINADILSGPVEQNRTVPVDADRFFAGCMRYKQAVLSIGWTTNWGADFRDG 
EYTQQQCDDMLETLSENNVLSTGQAITFPVRAGIAANSEEQLHRLVAAVNETNESTLTIWSSAGDYVDVDKLRKLIFSFGLDRVYLDVPEELASQ 
LNLDNPGNGASTFKSLFGFSFMSLCFWALSSWMQRL* 
(SEQ ID NO: 807) 

Classification: hypothetical 



Celera Sequence No. : 142000013384094 

TCACGTCACATACCACCACATTACCTACCACGCATGCCCCAATCGCATTAGGAGTTCCCTCAACACATCTGCACGCTTTGCGGACAGCGAGAGAG 
ATTTATAAGATTATTTATTTTAGTGCCAAATAACAGCAGAAAACGCAAGCAAAAAGTCATTAACTATTTTAGTAGGATAAACTAAATTGTATATA 
GAGCGACTCGCTTAGAGCGCCCCATTACTCATTTTCCTTAATTCTTGAAGTTTAAACTGCGCATGCCCAAAGTATTTATATTTATATAATAATGC 
ATAACATTTGTAGAGCGATATTTAAGATTTTAATGCAGATTGATCTGAAGTTATATAGAAATAATATTGAACAGAGAACATAGGTAGCGAGAATT 
GTTGCTCATTTACATAGGTAGGCCGAAAAACTAAAAATTATAGTTGTTGTTTTAAGTCTTGTACAAATAACGCTAGCACTGCAAAGATTACGCAT 
ATCCTATGTACAAATGTCGATCAGTGTAAGATCTAAATACCTAGATTATAAAGCACAGCTGTGAACCCCGGAAATATTTCCAAAAATTCCTTAAG 
ATAAATGAATAGAAAAAATTATTAAAATCAAGTGATCAAAGAGGGACTTTCTGAAAATGGCAGTAGTTACTAAAACTACGTATGGAGACCCTGAA 
AAAAAAATATCCAAACATATCTTCATATAAATTTTAAATTCGAAATCCAATCAATTTATACATTTGTTACTATTAAACTTTACATTTGTCTTTGC 
C T T G AGAAAC AT AT T C CC T T AAAT AAGT T T CT T T AAAT C C AC AC AAT AC AC T T G AAT AAT T AAAT GT G T TT T AAAAT T CAT C T TT T T T AT AT AAG 
ATGACTCGTCATCCAAGGCGATTTCCAACAATTCCACCTAAACGAAAATAAAACTGTCAACACATAAAAGTATAAGTTTTAATTATGTTTTCTGG 
AAGTTGTATAAAAATTGCTCTGATTACCGTACTTATATTAATTACGTATATTATAACCGCTGCATCCAGCTGGACAGAGCCCAAAAACAAAGGCT 
CATGAAACTGAAGCCAAACAGGGATTTGAAGGTGCTGGCTCCGTTGCCGGGATTGTCCAGGTTCAATTGTGAGGCCAGCTCCTCCGGCACATCCA 
GATAGACTCTATCCAAGCCGAAGCTGAAGATCAGTTTGCGCAACTTATCCACATCCACATAGTCACCGGCAGACGACCAGATGGTCAGGGTGCTC 
TCGTTTGTCTCGTTCACCGCGGCCACCAGTCGATGCAGTTGCTCCTCGCTGTTGGCGGCAATGCCAGCGCGCACGGGAAAGGTTATGGCCTGGCC 
AGTGGATAGGACATTGTTCTCGCTCAGCGTTTCCAGCATGTCATCACACTGCTGCTGCGTGTACTCGCCATCCCGGAAGTCGGCTCCCCAGTTAG 
TTGTCCAGCCAATGGAGAGCACTGCCTGTTTGTAGCGCATACAGCCGGCGAAGAATCGGTCCGCATCAACGGGTACGGTTCTATTCTGTTCCACG 
GGACCACTTAGAATATCGGCGTTTATCCACACAGGATACGTGGTAGGCTGCACAACACAGAGATATTGTTTAATTTGTTCTGATATCTGAGATTC 
GATGACCAAACTTACGTTCGGAATATTCACGTCCAGAATGTCCAGGGAACCCTCGAATACCTCGATCGACTTGAAGTCGAGCTTAACGCCCTTTT 
TTTGGTCCTCATGATCACGGTTGAAATTAATGATCTGGTTGAGAAATTCCGACAGCGTAAGATCAGAGACATTGGCCGGCGGATGGGCCATTATG 
GGCATGTCCTCGCCATCGCCATTCAGTTTGCCCAGCACAATGTCCGCTTCGATGAAGTCAATGCCACCTTGGGCAGGGAAATGATACTATTGATA 
AGTTGGTTACACGCGCATATTTGGCCAATGATCTTACTGGTTTCGGTTAGGACCTCGTCCAGCAATTGCTGGCTGTTGACGGCATGGGCCCAGGT 
AATGGCAGTGAGATTTGTGCCCCATACCTTCATCATATCCGACTGTTTGAGGAATATATATTCGATCTGGGAATCGTCTGGCGCTGTCTGCTCGA 
CCACCACTTGGTACTGATAATTGCTGGCGTTCAGAGCAGACCCATTGATTTTGGCCACGAGCACTGCGAGTGGACGTGAAATAGACGGCATTAAT 
ATTCCACAATTGGACAATGGGACGCACAGAGGGCATAATATGGCAGATACGCGTAGATAAGCCCATTTGCCTTCCTTACCACCCTACCAGCGAGA 
ATTGATACGCAAACACATGCAATCCAATCCAATGTGAATTTGTTTACTTCGCAATGAGCGTGTACATATGTATGTGTGCGCGAATGGACTCTAAT 
TAACTCCCTTTGTTATGCGATTTTTATGGAATGCAAAGCGCGTTGTGCCCTTCTTCCGCTTTCACCGAATGCTGCAATGCAACTGCACATCAATA 
ATTCTCTTCAATCTGTCAGCTGTGCAATGGCGCAAGCGTAGAGAGCAGACGTGAAGTGAGAATGGCGAGAAATGTGCTTAACTTCAAGAGACCGA 
CGCTCATATACCCTAGCTCTAAGAGTTTACATGACAACTTATTGACAAAAAATAACAAAATGTTGCTATTTTACGTTTTACCCCCTTTTTGCGAT 
TGTGTAAATGTATTATCAAGTAATGTTACCCAACACACTCGTCAGATTTAAATGTTTTAATTGAGTACAGGAGCATCGACTATCAGATACCTGTT 
AGTTTTTTTAAGTTATGTCAAAACTAAACTGTCTGGATCCTAATGTTTCCAAGCGACCCAGGGGTCGTATGTACATGAGTTAAAAACATTAAAAA 
ATGATGATATGTAATGATTTTGAAAAGTGCAAAAATACGAATTACCAAATAGTAGCGTTGTAAGTAGAAAGGTAATTTTTGAGTCCATTTTTTCT 
ATAGGTCTAAGGGCACTTGATCATGAACTTGGGACCGCCACGGTTAGAAACGAACTAAATGGCAATTTTTCAGTTTGGCTTATCGATAGCATTTG 
CTGATTTAATTAAACATTGATTAAGTATTGTAAATTGGTAGTGACAATCTATCCGCATATCCCAAAATCTAATCTTTTAAGTGGTCACGTTATCC 
GATGAACTCGCGGACTAGTTAAACTAGTTACTACATTATAAAGAAGAAAGAACAAACAAACGTTTTTATGGAAATTATTACCACACATCTTGTAA 



FIGURE 




AATCATATATTTCATTAAGTACGATTTACATTTATTTGCTTATTAAAAGATAAAATGTTATTTAAATAAGTTTAGACCGCGGTCTCTGTTAGAAA 
AGCGTTGCATTTTAACATTTCTGTTAAATGAGCAGCAGTAATAGAAGAGAAAACGTAAGAGAGCGGTTCAAAGAGCGGCACTAGAGTGACCATGC 

GGC 

(SEQ ID NO: 808) 

Exon: 2423. .2273 
Exon: 2153. .1938 
Exon: 1872.. 1001 

Start ATG: 2031 (Reverse strand: CAT) 
Transcript No. : CT22299 

GCACAACGCGCTTTGCATTCCATAAAAATCGCATAACAAAGGGAGTTAATTAGAGTCCATTCGCGCACACATACATATGTACACGCTCATTGCGA 
AGTAAACAAATTCACATTGGATTGGATTGCATGTGTTTGCGTATCAATTCTCGCTGTGCTCGTGGCCAAAATCAATGGGTCTGCTCTGAACGCCA 
GCAATTATCAGTACCAAGTGGTGGTCGAGCAGACAGCGCCAGACGATTCCCAGATCGAATATATATTCCTCAAACAGTCGGATATGATGAAGGTA 
TGGGGCACAAATCTCACTGCCATTACCTGGGCCCATGCCGTCAACAGCCAGCAATTGCTGGACGAGGTCCTAACCGAAACCAGTGGCATTGACTT 
CATCGAAGCGGACATTGTGCTGGGCAAACTGAATGGCGATGGCGAGGACATGCCCATAATGGCCCATCCGCCGGCCAATGTCTCTGATCTTACGC 
TGTCGGAATTTCTCAACCAGATCATTAATTTCAACCGTGATCATGAGGACCAAAAAAAGGGCGTTAAGCTCGACTTCAAGTCGATCGAGGTATTC 
G AGGG T T CC C T GG AC AT T CT GG AC GT G AAT AT T C CG AAC GT AAGT T T GGT C AT C GAAT C T C AG AT AT C AG AAC AAAT T AAAC AAT AT C T C T GT GT 
TGTGCAGCCTACCACGTATCCTGTGTGGATAAACGCCGATATTCTAAGTGGTCCCGTGGAACAGAATAGAACCGTACCCGTTGATGCGGACCGAT 
TCTTCGCCGGCTGTATGCGCTACAAACAGGCAGTGCTCTCCATTGGCTGGACAACTAACTGGGGAGCCGACTTCCGGGATGGCGAGTACACGCAG 
CAGCAGTGTGATGACATGCTGGAAACGCTGAGCGAGAACAATGTCCTATCCACTGGCCAGGCCATAACCTTTCCCGTGCGCGCTGGCATTGCCGC 
CAACAGCGAGGAGCAACTGCATCGACTGGTGGCCGCGGTGAACGAGACAAACGAGAGCACCCTGACCATCTGGTCGTCTGCCGGTGACTATGTGG 
ATGTGGATAAGTTGCGCAAACTGATCTTCAGCTTCGGCTTGGATAGAGTCTATCTGGATGTGCCGGAGGAGCTGGCCTCACAATTGAACCTGGAC 
AATCCCGGCAACGGAGCCAGCACCTTCAAATCCCTGTTTGGCTTCAGTTTCATGAGCCTTTGTTTTTGGGCTCTGTCCAGCTGGATGCAGCGGTT 

ATAA 

(SEQ ID NO: 809) 

Start ATG: 274 (Reverse strand: CAT) 

MMKVWGTNLTAITWAHAVNSQQLLDEVLTETSGIDFIEADIVLGKLNGDGEDMPIMAHPPANVSDLTLSEFLNQIINFNRDHEDQKKGVKLDFKS 
IEVFEGSLDILDVNIPNVSLVIESQISEQIKQYLCVVQPTTYPVWINADILSGPVEQNRTVPVDADRFFAGCMRYKQAVLSIGWTTNWGADFRDG 
EYTQQQCDDMLETLSENNVLSTGQAITFPVRAGIAANSEEQLHRLVAAVNETNESTLTIWSSAGDYVDVDKLRKLIFSFGLDRVYLDVPEELASQ 
LNLDNPGNGASTFKSLFGFS FMS LCFWALS SWMQRL * 
(SEQ ID NO: 810) 

Classification: hypothetical 



Celera Sequence No. : 142000013384576 

ACTGGGCTTGTACAGACCCAGTGCCGGATAGAAGGCGGTGTTTAGGATTCGCACTGGTCGCGGCGTGGTCAGTGGCACATCGTTTGGGTGCTCCG 
CTGGCGGCGGAGGCGGCTTCTTCGACTTGTACCACGGCACCGGGGCAGTGTAGAATATGTGCACTGCACTGATCACCACATCCAAAGTGGTACGC 
GACTGAACAAAGTGTACATTTTTAAGATTTAAGGACTAAGATTGGCCACCTTCTTGGACTGAACTGGAAACTTACAGACTGCTCCGCCTCGTGGT 
CCCGCTGCTGCTGCTGCTTCATCAGATCGATTTTGCTCTGAATGATGCGCTCCTTGATCTTCTGCTCGTCATACTTCTCGGGCACAATCACTCCG 
TTCTCCCGCTGCGGCTGGAGCAGCGCGACGGCCACCGGCGGCGGCTTCTCCGCCACTTGGGGCTGATCCCCGGGCCTGTAGGCGCGTGCGATGAA 
TCGCTCGTCCAGCAGCAGTTCGATGGCCGCATTGCGTCCGCTGGAGCCCAGCACTATGATGGTGATGACCAGCAGGGCGACGATGGCCAGCAGGA 
GTAGAATCTTCAGCTTACGCCGCGTTTTCACATACTTTTTTAGCAGCATTTTCGCAAGCGTGCATTTTCTCTCATTGTTTGGACGTCCTCGTTGT 
TTTTCGTCCCCAGTTTTTCGTACCTCCACCCCACAACCCCTTCTCTTCCTGTCGCAATGCAGCTGCCGGAAGTTCCTGGCCATTGCGCATGTCCT 
GAAACGGACACACTAAACTATGCCCAGCGAGTTCCCAATTACCATTTCACCATTAAAGCATATGCGAGTTTGTTATTTTTTTTGGACATTTTATT 
GTGAGTTGATTTATAATATTGTACACATTAATATTGCTAAGTTATTTAAGGGAATATGCCCCAGAAATTTTCACAAATAAAATAGTTTAAACGGA 
AATGTGAAACAAGTGGGTTTTAGAATCCATTCCCAAAATAAATACATTACTTATAATTCCGTTTTTGTTTGTCAACACATTGTCTGAATAGTTTT 
TATTTAGTTATAAAGTAGAGTTTTAAAAAAGGTTTTAGGTTTGTATCTATTCGAAGTGCATAGTCTTGTTCCGATTAACATGCACCTTCGAGCTC 
TAGCGTCCCTCAACTGGAAAAGAAAATCAAGTTAGCACTGCTGGTGGCTGGGGATTACTCATCATTGCAACTTACTGTATGTAGAGAGATCGATT 
TCCTCGGGCAGCTCACTGATGTTCACATCGAAACGATCCTGTACTTCGTTAAGTATCTTGGCGTCGTTCTCGTCCGAAACGAATGTGATTGCCAG 
TCCCTTGGTGCCGAAGCGACCGGCACGGGCCACGCGATGCAAGTAGGTGTCGGAATCCTCGGGCATGTCGTAGTTGAACACGATGTTCACACGCT 
CGATGTCCATGCCGCGGCCAAAGAGATTGGTGGCCACCAGAATGCGCTTCTGGAAGTCCTTGAACTGCTGGTAGCGATTCAGACGCTCCTCCTGG 
GTCATCCCACGATGGATGCCGATGGCGGGGAAGTTCTGCTCCGTCAGCAGCTGCGACAGAGCCACGCAACGTTGCACAGACTTCACAAAGATGAC 
CACCTATGACAGGGATAATCAAGTAAGTGGTAAATATAAGTTAAGACCACCGAATGACCTACCTGATTGAACTCGAGCACGTCGAGCAGTTCGAA 
CAGTTTCTTGTTCTTCTCGTTCTCCTTCAGATTGACGTAGTGCTGCTGCAGTCCGTGCAGCGTCAGCTTGGCCTCATCGTCGACGTAGACCTCCA 
TGGGCTGGATTTGGGAAATAGAATGTGGCGCATTTTAAAGGACACAATTAGGAGGGGTACAGAGAAATCGAACTTAGACTAGGCTCGCCTCGGAA 
CATACAACAATATGCATAGAATATTCTACATACGGCCACGCTACCACCACATCTCATCAACTAGTCTCTGAACTGCTCCCGAGGATCGATGATCG 
CTCTGAGAAGTGTGCGGATTACAGATGAGCGGGCCTATCGCATCGCATTCGCTCTGGATTGATTCAACTCGTCTCATCGGCCAGTCGGCGGGATC 
TTTGCTGTTTGCTTGTAGTTTTATTTAGGTATTGGTTTTTGCTAGCGTTGGTATTGTTAAATCTTTTGCGATTTCAATTGATTCATTCAGCGGTT 
ACTGCTTGTTTATGCTTTTGCAACGGTTGCCAATTTCCACGATTAACCATTTAGAAAATGGTGTCCGATAGTCATTAACTAAGTTTTTCGATACG 
AGTTATGCTCACAGAGCGCTGGACGTTTGGTTGTACGCGTTTAACCATTTGCAAGATGGTGGTTGGCTTATACGGAGGGGATCAAGATCGAATGT 
TCGATATCGCGTGGCATTAACAAAATAATAACTAATTGTCGTGATAGCGGTGGATGCGAGGTGGAGAAGTACGAGCATTTTGGTTATGAACACGT 
TGATATTTACATCTTGCATGAACTTTTTGCAAACGGGACGAATGTCCTTGCTCAATGTGGCAGAGAACATCATCACTTGTTTGCCGTGCGGCGTG 
CTACGGAAAATCTCTTGAACGTCACGACGCATATCTAAAAGAATGGTTTGGTTATTAGTAAAACGTCCGTCATTTGGTGATTATTAGTCAACTTA 
CCCAGCTGCTCCAGCATCTTGTCGCACTCGTCGAGCACAAAGTGCTTCAAAAGCTTCAGATTAAGTTTCTTGTTGCGAATGAGGGCGAGAATTCG 
GCCAGGGGTGCCCACCACAATATGCGGGGTGCCGCTCTTGAGGGTCTCCTCGTCCTTTTGAATAGCCATTCCGCCAAAGAAGACAGCCACCTAAA 
TAGAAAACATATCGAATTAATATATAGTAACTATATAGTCGAACCAAACTTTTTGTTGAATTATATACAATGGATTAAAACGTAGACTCACCTTG 
ACTGTGGGCATGTACTTGGAGAATCGCTCATACTCCTTGCTGATCTGGAAGGCCAGCTCGCGGGTGTGGCACATGACCAGGACGTGGCAGGTGTT 
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GTTGTCCGACGGCTCCAGCTGCTGCAGCGTGGCCAGAACGAAGACGGCGGTCTTACCCATGCCGGACTTGGCCTGACAGAGGATGTCCATGCCCA 
GTACGGCCTGCGGAATGCACTCGTGCTGAACTGCAAGCGGAGAGCATCGTTAGTGACTGTACCTTTCGTCACTCCAGGTGGAGTGCAAGTACTCA 
CCCTCCGAGGGATGCTCGAAGCCGCAGTCCACGATGGCGCGCAGGATCTCCGGTTTCAGGAGGAAATCGCGGAAGCCGGAACTGTGAATGGACAC 
ATAGGTGCCCTTGACATCCTTCTTGGGGGCCTCCTGGTTTTCGACCGCAGTGGTCTCGGTCTGCTCCTCATCCTCGTAGTCCAAAAGATCGTCAT 
TGTCGGCCATTTCTGCGGCTTTCTTACCTTTAACTAGTTATCGGCTATAAATTCGCAACTAAATGCTGGGAAATAAAATGGGTGCGACTAGATAA 
GTTACATGTGTTTGTTAAAGAAAATGCCGCTTTTTTTTGCGGAACCGGTGCCGGCGTTCCCTCCGCCATTTTGTTTTTTCACCGCACCGCATTAC 
ATGTATTTCGAAATAAAACGAAATACCCATTACAAGCGACGATTTTATTTAGTGCAGTGTGATTTTCAGTGTCAAACTCACCTTTTGTGCACGTT 
TATCACTTTAAAATACAGCTTTTAAAGTGAAATTAACTATTTTAAAATCTTTAACGTGATGCGGAGGAAAAAACAGCGCAGAGCAGTGTGACCGA 
AGTACGTTTAAGAACTCATGCGAGCTGAGCCGGGTGATATCGTTAAGTTATCGATAGTTGTGCACAACAAATCCATTTGTAAGCTTCAACTAAAA 
AGTTTTTATAAAATATATGCCTTCTAACGCTTTAAATTAAAATGGCTTGTTGAGCTTAAATTTAATTTTTTTTTTATCATTTTGGGGTCAAGCGG 
TTCTTAGTCTTTTATCTAGCGCGGCGTTTGGTCACACCAGGCATGCCGGAAAGTTATCGATTAGTCTCTGAAACCATCGCAACAAATTCGATTAC 
ACGTTCCAGCGTCGGCCGCGTTTTCCTGCTGCAAGATACTTCATTTTCAAATTTGTGCATTTTCCGTTCGTTATAAAAAGTCAGGTGAGTTGCGC 
TGCTATCTACGAGCATTAGGAAAACTTGTAGAAAAAGTTGAAATAACTGATTAAGTGCCGAACAATAGAGGCGGCACTGTTTCACTCAAAATCGA 
AATAACCCCCCTGCCTGTATTAGTGTAAAGTCGCTGTAAATATTGAATTAAAATCGCCATATTGTGCAATTTTTGGCTCCTTATCAAATCGTTTA 
AAGTGCGTCGACTGGATTAGCTAAGCGTTGCGATTGATTAAACTAGTATTAGCCGCGCCATTTGCCTCCACGCCCATAAACAAACGCCGACAAAC 
TCACTTCGTTTTCCTCGTCCGCTTTAGTACAGTCGCTTGAGTGGAAAGCGATGTGCGTGTGTGTGAGTGAGACGTAGAGGAGACTGCGAGAGAGA 
GAGAGAGCGCGTTAGAGAAACTCGTTTAGCTTCTGTTTTTGAGCGTTTTGTGCGTTTCCCCTCTTTCGTGTTCGGTTCTGTTTGGAATTTTATCA 
GGGCGCAAAAATAACATACCCTACACCGAACGAAAAAGTTCAAATATAAGTATTGCATATAAAACGCGTTTAGAAGGCACGCAATTGTGTTATTC 
AACTTTTCTTCAAGTGTTGTGTCCTAAAGAGGTCAGTCACAAAA 
(SEQ ID NO: 811) 

Exon: 3794.. 3692 
Exon: 3485.. 3232 
Exon: 3165.. 2942 
Exon: 2845.-2662 
Exon: 2599.. 2481 
Exon: 1809.. 1678 
Exon: 1618.. 1216 
Exon: 1153.. 1001 

Start ATG: 3430 (Reverse strand: CAT) 
Transcript No. : CT22389 

ACACTGCTCTGCGCTGTTTTTTCCTCCGCATCACGTTAAAGATTTTAAAATAGTTAATTTCACTTTAAAAGCTGTATTTTAAAGTGATAAACGTG 
CACAAAAGCATTTAGTTGCGAATTTATAGCCGATAACTAGTTAAAGGTAAGAAAGCCGCAGAAATGGCCGACAATGACGATCTTTTGGACTACGA 
GGATGAGGAGCAGACCGAGACCACTGCGGTCGAAAACCAGGAGGCCCCCAAGAAGGATGTCAAGGGCACCTATGTGTCCATTCACAGTTCCGGCT 
TCCGCGATTTCCTCCTGAAACCGGAGATCCTGCGCGCCATCGTGGACTGCGGCTTCGAGCATCCCTCGGAGGTTCAGCACGAGTGCATTCCGCAG 
GCCGTACTGGGCATGGACATCCTCTGTCAGGCCAAGTCCGGCATGGGTAAGACCGCCGTCTTCGTTCTGGCCACGCTGCAGCAGCTGGAGCCGTC 
GGACAACAACACCTGCCACGTCCTGGTCATGTGCCACACCCGCGAGCTGGCCTTCCAGATCAGCAAGGAGTATGAGCGATTCTCCAAGTACATGC 
CCACAGTCAAGGTGGCTGTCTTCTTTGGCGGAATGGCTATTCAAAAGGACGAGGAGACCCTCAAGAGCGGCACCCCGCATATTGTGGTGGGCACC 
CCTGGCCGAATTCTCGCCCTCATTCGCAACAAGAAACTTAATCTGAAGCTTTTGAAGCACTTTGTGCTCGACGAGTGCGACAAGATGCTGGAGCA 
GCTGGATATGCGTCGTGACGTTCAAGAGATTTTCCGTAGCACGCCGCACGGCAAACAAGTGATGATGTTCTCTGCCACATTGAGCAAGGACATTC 
GTCCCGTTTGCAAAAAGTTCATGCAAGATCCCATGGAGGTCTACGTCGACGATGAGGCCAAGCTGACGCTGCACGGACTGCAGCAGCACTACGTC 
AAT C T G AAGG AGAAC G AG AAG AAC AAGAAAC T GT T C G AAC T GC T C GAC GT G C TC G AGT T C A AT C AGGT G GT C AT C T T T GT G AAGT C T GTGC AAC G 
TTGCGTGGCTCTGTCGCAGCTGCTGACGGAGCAGAACTTCCCCGCCATCGGCATCCATCGTGGGATGACCCAGGAGGAGCGTCTGAATCGCTACC 
AGCAGTTCAAGGACTTCCAGAAGCGCATTCTGGTGGCCACCAATCTCTTTGGCCGCGGCATGGACATCGAGCGTGTGAACATCGTGTTCAACTAC 
GACATGCCCGAGGATTCCGACACCTACTTGCATCGCGTGGCCCGTGCCGGTCGCTTCGGCACCAAGGGACTGGCAATCACATTCGTTTCGGACGA 
GAACGACGCCAAGATACTTAACGAAGTACAGGATCGTTTCGATGTGAACATCAGTGAGCTGCCCGAGGAAATCGATCTCTCTACATACATTGAGG 
GACGCTAGAGCTCGAAGGTGCATGTTAATCGGAACAAGACTATGCACTTCGAATAGATACAAACCTAAAACCTTTTTTAAAACTCTACTTTATAA 
CTAAATAAAAACTATTCAGACAATGTGTTGACAAACAAAAACGGAATTATAA 
(SEQ ID NO: 812) 

Start ATG: 159 (Reverse strand: CAT) 

MADNDDLLDYEDEEQTETTAVENQEAPKKDVKGTYVSIHSSGFRDFLLKPEILRAIVDCGFEHPSEVQHECIPQAVLGMDILCQAKSGMGKTAVF 
VLATLQQLEPSDNNTCHVLVMCHTRELAFQISKEYERFSKYMPTVKVAVFFGGMAIQKDEETLKSGTPHIVVGTPGRILALIRNKKLNLKLLKHF 
VLDECDKMLEQLDMRRDVQEIFRSTPHGKQVMMFSATLSKDIRPVCKKFMQDPMEVYVDDEAKLTLHGLQQHYVNLKENEKNKKLFELLDVLEFN 
QVVIFVKSVQRCVALSQLLTEQNFPAIGIHRGMTQEERLNRYQQFKDFQKRI LVATNLFGRGMDIERVNIVFNYDMPEDSDTYLHRVARAGRFGT 
KGLAITFVSDENDAKILNEVQDRFDVNISELPEEIDLSTYIEGR* 
(SEQ ID NO: 813) 

Name: DEAD box helicase 
Classification: RNA_bmding 
Gene Symbol: Hel25E 
FlyBase ID: FBgn0014189 



Celera Sequence No. : 142000013383821 

TTGTTGTTGCATGTGAACTTTTTCGTATCCCCCAACGAACGAGAGAGCGAGATGGGGCTATGCTATGCGACTGCCGAAATAAGAGGGCCAATTTG 
TAGTTGTAGTTCTTGTTGTTGATGTTCTTGCTTCTGTTATTATTGTTGATGTGGTTCTTGCTGTTGTTGTTGATTTTGCAGGGGACCCCCAGTCG 
TTGTTATTGTTGTTAACGCTCCTCTTTTATCCGCTCTGCGCTTTGCAGTCCGCTTTGCCGTTAATTATTTTTTCACTCCTTCGCTTTGTGGCTCA 
TCTTCGCTTTATCCTGCGCGCACCGCACTCACTCTTTCTCGCTCGCCCTCACTCACTCACACAGCAGTAGCAGCAGCAACAACAACGTTGCACAC 
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AACATCAATATATTGGATATATATCAACACTTTTTCGCCTGCGTGCTTGTCAGTGTGTGTGTGCATGCGTGCGCTTGTCGATTTTCACTGCCTTC 
TTTTCTTTGTGTGTGTATATGCGCTGCGCGTTGTGTTTGTCTGCGTCTGTGCGTGTGTTAGGTTTACGTGTGTGTGTGAGTGAACGAACGAAATG 
CGTTTAGCGAACGTTTTTTTAAAAAATCCTTAAATTTTTTGTGGATTTTGGCGATTACCGACACTTAAAACGGCACTGCACACGCGTCCCCGAAA 
ACAACGGCACCTAGATTCGCTGGCCGAGTTACGATTTGTTTGATGCTACGCACGGTTTTCGGACACTTTTACACGGGGCTTTGTTTTTACCCATC 
GCATCGTCGACTCTTTTGCTCCTAAACGTTGTTGGTGTTAGTGGTCAGCGATAGTGCTCACACACTCGATATGGGTTTTCGATCATTTTCGGCTG 
AATATTCGATAAGGTCAATAGTATCGGTTTTCCGCGTACGATTTCCCAGTTTTCAAATGACGGTCACACTGACATTCATTCTTGACAATCGATAG 
TTTTTGTCGAGTGACGCTCTTAATATCGAAAAACCATAAGTAACGATAGGCCCATTTTGTTGGTGGTGTTCACAATCTTCTCGCGCTTTCCCAGG 
AACTGGTCAAGATGCCCATCGCCGGGCTGTGCATCGAACTGCGCCCGAACCTACGCAGCGGCGTGGTCTTCCTGCGATTCGACCAACAGGTTTCG 
CAGAGCCAGAAGACTCGCGTCGTCGTCCGCGACTACAATGTTTATATCAGCGAGAAACCTAAGAATCAGGCAATCGACTTGGATTCCAGCTCCGA 
AAGCGACAGCGAGGACTCCGACAAGCTGATGCTCATCCGGCACGCGCAGTATGGTATGGACATTGTCTGCCTGTCCACGTTCTTGGTCAGTGGCA 
GGAACATCAGTTTCCGCTTCAACTACAACCAGATTGATCTGGCCAGCGTGGATGGAAGCGCTGTGGACGTGCCTCTGGCGCCACTACTCCTCTCC 
TGCCAGGAGAACGAGCCCATCACTATCAACTGTCGTGATTGCCGAGCGGAATTGGTGGCAGGCCGAAGCTATCGCAGGTTGAGGGAGTTTCCCAG 
CCTCTTGGTGGATCCCACTGAGTTCTTTTGCCACAATCATGGACCAGCGGGCAAGACCCAACCAATTAGCTTAGTGCCCGCCGAAACGGATCTTT 
TCTATGGCCTCAACTATGTGGTCATCAACTTTAACGAAGAAAGTTCGTGCATGCTCAATCGCGACGATCACTTGTACTGCCAGCGCTGCATGCGT 
TATTTGGGGCTGACCATGTTCGATGGCGCCGCCGCTCGCATCTGGGCTGATGCAGTACGTTGGCGTCCCGCAGGAGCGGCGACAAATGCCCCCGA 
TCGCCACTTCTTCCAAAACTCCACGCTAACGCAGCTTTTCAAGCGTCTGCTGCACTCGCTGTGGCCTCAACCGTTGCCTCAGCTCTGTTTGAACA 
CCAGTCGTGCGGTGCTGGTCACCTCGCTGCCCAACCGAAATCAGCAATACATGTTTTTGCACGTGGTGGAGTCACAGCTGCGTGTGCTGCGTCGC 
ATTCGCCCAAATTCGAACCGACTACGCTGCTTTCGCGCCTGCAAGCTGTACTACGGCGTTTTTGGAGCCAATCCGCCGCTGCTGGAGAAGTGGCA 
GGCGCAGCAGACGCTGCCCCAAATGGATGTATCACCGAACATGTTCCTAAAGATTCAGAAGCGCCTCGAGACGAATGGCCACTTGATTCCGGACG 
CCTTGAGCACGAACTGTGCGGAGGAGCACCTGCAGTTGTCGTACTTCTTTTATGAGAACGAGGATCAACACGACGGCGGGGTCGTCGCTTCGGAT 
GT GT T T ACGG AT C AAAT GAG C AAC GT G T AT C AGAAGG AGGT GG AT C CG T AC G AG AC GG AT GC AGGAC AT GC C AGC GAG AGC GAT G AC G AGT AC T C 
GGATACGGACACCGCTTCGGGGGACACGGCGGCTTACTCGCTCGGCATGCGTACCCTGCAGAAACGCAGCCCCACTCCGCCGCGGTCTTTCGCCA 
AAATGGCCACCTCCGCCTCGCCGGAGGACAAGTAGTGACCCAATCCGATTCAAAAATCAAGCATAGCTTGTATGTCGGAAACTATGTTACTTGTT 
TTCACTATACAAAGACGGACTTGGCCCAATAATTTGTCAAAGATATTTTTTTTTTAAAGTTAGCCCTTGATTGATAATCAATTCCATCCAAAATA 
CTTTACTATGCCTTTGATTCAAATCCTCACATCCTTTGCTTTTGTTTATCGTTATACATTCGCTTAGAATGAATTAACAGTACGCGGAATTGATA 
ACGCCAAAGACTATTTATTGTTTTGTGAAGAAAATTTTGATTTTATTGCTTGACTCGATCGTTATATATATATATGTATATTATATATAGTTTAA 
CAAACGTTTTGCAACAAGTATTAGCTGGATAAACGACTTCTATATATATGTATGTACTAGCACAAATGCAGTGCGTTCCGTGACTGTAAGTGCCA 
ATGTCTTGGCCTTACAGCTGCGAAACGCACTGCACATGTTTTGAAGAAAAGTAAGAAGATTGTTTGCTTTTGCTTACACTTTGACGTTCCCTATT 
AATTCACTTAAGTTAACGAGAAAAAAATGAGAGTAGAGTAATAGTTTAAAAGTAAAATTGAAATATTGAAAATATTCAATTTTAATATATTTTCA 
AAGAACTGAATTAACAAGTTTTCCAAAATGTGACTTACCAAAACAAAAAAAAAAACAAAAAATGGACTAAACGAAAAAAGAAAAGTTATTGCTTA 
CGAAAAACATATATTATAAATATATACTAATATACTTCAATATGTTTCCAGAAACAAGATTAAAACAGAATATGTATGAACTATGATTTAGCTGC 
GTAAACATTTACACACAACTATTATTGTCCAACATTTGTAATTCTGTTCGCAACTACGCATGTCAACGAATAGCCGAGAAAAATTGTAATTTCAA 
AACTCTGATGTAAAGTATATGGAAAACAACTATTTATATGTATATATTTAAAGTCAAAATGGGATGAATGTAAGGCGGAGCAGTG 
(SEQ ID NO: 814) 

Exon: 1001.. 2505 
Start ATG: 1057 

Transcript No. : CT22601 

CCCATTTTGTTGGTGGTGTTCACAATCTTCTCGCGCTTTCCCAGGAACTGGTCAAGATGCCCATCGCCGGGCTGTGCATCGAACTGCGCCCGAAC 

CTACGCAGCGGCGTGGTCTTCCTGCGATTCGACCAACAGGTTTCGCAGAGCCAGAAGACTCGCGTCGTCGTCCGCGACTACAATGTTTATATCAG 

CGAGAAACCTAAGAATCAGGCAATCGACTTGGATTCCAGCTCCGAAAGCGACAGCGAGGACTCCGACAAGCTGATGCTCATCCGGCACGCGCAGT 

ATGGTATGGACATTGTCTGCCTGTCCACGTTCTTGGTCAGTGGCAGGAACATCAGTTTCCGCTTCAACTACAACCAGATTGATCTGGCCAGCGTG 

GATGGAAGCGCTGTGGACGTGCCTCTGGCGCCACTACTCCTCTCCTGCCAGGAGAACGAGCCCATCACTATCAACTGTCGTGATTGCCGAGCGGA 

ATTGGTGGCAGGCCGAAGCTATCGCAGGTTGAGGGAGTTTCCCAGCCTCTTGGTGGATCCCACTGAGTTCTTTTGCCACAATCATGGACCAGCGG 

GCAAGACCCAACCAATTAGCTTAGTGCCCGCCGAAACGGATCTTTTCTATGGCCTCAACTATGTGGTCATCAACTTTAACGAAGAAAGTTCGTGC 

ATGCTCAATCGCGACGATCACTTGTACTGCCAGCGCTGCATGCGTTATTTGGGGCTGACCATGTTCGATGGCGCCGCCGCTCGCATCTGGGCTGA 

TGCAGTACGTTGGCGTCCCGCAGGAGCGGCGACAAATGCCCCCGATCGCCACTTCTTCCAAAACTCCACGCTAACGCAGCTTTTCAAGCGTCTGC 

TGCACTCGCTGTGGCCTCAACCGTTGCCTCAGCTCTGTTTGAACACCAGTCGTGCGGTGCTGGTCACCTCGCTGCCCAACCGAAATCAGCAATAC 

ATGTTTTTGCACGTGGTGGAGTCACAGCTGCGTGTGCTGCGTCGCATTCGCCCAAATTCGAACCGACTACGCTGCTTTCGCGCCTGCAAGCTGTA 

CTACGGCGTTTTTGGAGCCAATCCGCCGCTGCTGGAGAAGTGGCAGGCGCAGCAGACGCTGCCCCAAATGGATGTATCACCGAACATGTTCCTAA 

AGATTCAGAAGCGCCTCGAGACGAATGGCCACTTGATTCCGGACGCCTTGAGCACGAACTGTGCGGAGGAGCACCTGCAGTTGTCGTACTTCTTT 

TATGAGAACGAGGATCAACACGACGGCGGGGTCGTCGCTTCGGATGTGTTTACGGATCAAATGAGCAACGTGTATCAGAAGGAGGTGGATCCGTA 

CGAGACGGATGCAGGACATGCCAGCGAGAGCGATGACGAGTACTCGGATACGGACACCGCTTCGGGGGACACGGCGGCTTACTCGCTCGGCATGC 

GTACCCTGCAGAAACGCAGCCCCACTCCGCCGCGGTCTTTCGCCAAAATGGCCACCTCCGCCTCGCCGGAGGACAAGTAG 

{ SEQ ID NO: 815) 

Start ATG: 57 

MPIAGLCIELRPNLRSGVVFLRFDQQVSQSQKTRVVVRDYNVYISEKPKNQAIDLDSSSESDSEDSDKLMLIRHAQYGMDIVCLSTFLVSGRNIS 
FRFNYNQIDLASVDGSAVDVPLAPLLLSCQENEPITINCRDCRAELVAGRSYRRLREFPSLLVDPTEFFCHNHGPAGKTQPISLVPAETDLFYGL 
NYVVINFNEESSCMLNRDDHLYCQRCMRYLGLTMFDGAAARIWADAVRWRPAGAATNAPDRHFFQNSTLTQLFKRLLHSLWPQPLPQLCLNTSRA 
VLVTSLPNRNQQYMFLHVVESQLRVLRRIRPNSNRLRCFRACKLYYGVFGANPPLLEKWQAQQTLPQMDVSPNMFLKIQKRLETNGHLIPDALST 
NCAEEHLQLSYFFYENEDQHDGGVVASDVFTDQMSNVYQKEVDPYETDAGHASESDDEYSDTDTASGDTAAYSLGMRTLQKRSPTPPRSFAKMAT 

SASPEDK* 

{SEQ ID NO: 816) 



Celera Sequence No. : 142000013384221 



FIGURE SHEET 440 



GGTTCCAGCCCAGCAATTAATTTGTGGCCTGGAGCGTTTTAATTACTCAGCATCAGTGTGCGCGCGAATCTCGGTCAGGTGAGACAAAGCCACAT 
ATGTATATCCGTTTCCCCGTAACTCCCAACTCACCCATCCAACAACTAAGGGCCAGGCCACAAAGCCAGCAAAGCGCTAACATCTCTGTACCAAA 
GTCGACGTCACTGTGCCACGGGAATCGGGTGTGTGGATTTTATAGCGTACGATAAGCTAGACGAGGGCCAGGTGTCAATCAAGACTGGAGTTTTG 
GACGGACTGCTGGCCAGGTTGAAGCCTCGGTCGGAAGGGTCTCGGCGTTATCATTCCAACTCAGTTGGGCGTCCCAAGCGCGGCGCTGTTAAATT 
GGGTTAAGCCCAGTTCATCGGTGCATACTAATCTAAGGTGCGTTGCCCCGGTATTGAAAAGTGGAACAATATAAATATTTGAAAATATGTTCGGT 
AATTTAGTTAAATAAGTTACAATAACAATTACATTTTTATAAGTGTTTTTATGAAAAATGCTACATCTTATGTACACATAGATTAAATATTATTT 
AGATTTTCAGCGATGGCTATGTAAATTTTTCACAAATGCAATACTTTAATGCACTAACATTATAACAAATTTCAAATTTATAGAAAACTTTAAGT 
ATTCAAAATCTAAAACGGTCATGTAAAATACCATACATCAAACCATGTAAGCATACATCTTACATGTAAAATATTGCTGGCATTTGACCTAAGTA 
GATTAAGTAGATTACAAAAATATGTAATCCGCAATTTCAAAGAAACATTTTTGAGTGCATCTTATAAAACTCTATTTTTTGCCAATGCTATGTAA 
AAGAATAATTAAACTTTGTTAATTGTTGGGTGCGCAAATGAGTAAAGCCAGAATTACGTGGCCTATTACATTGCCTAATTTTGTTTAGTATATCT 
AGATGTGTTGTTGCATTTTCGAAAATATCACTTGCTGCCACACTGGCTGTGCGTCATTTTCAGAAACGCCGCGAATATTTTGCCATTGAAATCTA 
GCT AAG C AT T GG AAT T T CGC AAC GT GC AGC AGCAAAGC AAAC T AC AAAC AT G T C G GGC AC AT C G C AG AAAC AC A GG AAC T T C GT T GC G G AGC C AA 
TGGGCAACAAGTCGGTGACGGAACTGGCCGGAATTGGGGAAACCCTCGGTGGACGCTTGAAGGACGCTGGATTCGATATGGTGCGTGTGTAAATA 
CGATGACGCCATGTGCGCAAATAAACAATGCCAGTCAAGTGCTAACGATCATTTCCATTCGTTTTAGGCCTACACCGTTTTGGGACAGTATCTGG 
TGCTGAAAAAGGACGAGGAGCTGTTCAAGGACTGGATGAAGGAGGTGTGCCACGCCAGCTCCAAACAGGCATCCGATTGCTACAACTGTCTCAAC 
GATTGGTGCGAGGAGTTCTTGTAAGAGGTGGACACACAAGCCATCCGAGTAATTCGATCAAAAGGAATCTGCTAAAGGCATCGGCCAATCATTTT 
TATTCGCAGTCAACGAAAGTGGCCCAATGCTAGCTAAGCATTTGCCGTGTTCACTGTTAGTATGTATAGCTGTAGGGTGTAATTTAATGTCTAAT 
CCGAATAAATGTACAGTCATGGAAACGATCCTGCATTTGGATCTCTGTTCTGCGAAAAGTACAGTAGGTGTAAAACATTTGTTGCTGGCCGGCGA 
AGCAGCCTAATGCAGATTTTTATTAATTTGATTAGGGGACAACAATAAGCAACACTATAGAAGGCAGATCGGTTGGCGACTATATCTTCTCTATG 
TTTTCGATTGTTGGCGAGTAGGAGTCTCTGACTCGCTGAAATAAGCCTCTGTCTTTTTGATGTTTCTTTATCAGCGGGCTGATGTCGCCTAATTT 
TGAAATCTATTTCTTTTTGTTTTTCGGAAATGATCTTTTATAATGATGGACTTAGAACCTTTTAAATCCCAATCGCTGTTTTGTTTGGGGATTAT 
GTGTAAGCGGTTAATATATATTTTATAATGAAACCCCTTTTGTCAAAATTAATTGTCCTCGAGTAATTAATTAGTACCGAGTATTTGATTATAAA 
TACAACTTTTTTTAAATAAAAATATATATTTTTAAAAACTTATTTTTAAAGTGAGTGGTATTAAATAGGATTTCTTTTCCCTGAAGTGGTATCAT 
ATGGTATTTATTTTTGGTCCCGTGGTATTTTTTATCGTTATCGGCGCGGTCACGCTGCTAAGAATTACATTACGCGTGAATACCAATGAAAATTA 
AGAGAGAAATTGCAAAACACAGCCAAGCGGTGCAATAAATCGCAAAGTAAGAGGCGCAAAACGCGCAAATACCGTTTGCGCGGTTAAAAAATGCA 
AAAACACCAAACGATAGATTAAACAAAGTGAAATTAACTCGGCTTTGGCATATGCGAACACCAGTGTGCCAGTGTGTGTGTGTTTTTCTTTGTAT 
ATAGACTCGCAGAGCGAGACGGCCATACACGACGGCAGACGGGACGGTGCAGGTGCGCGAGTGCGAGCGGGAAAAAGGCTGTGCGGTCACTGCTA 
CTCTGTGTGTTATTTCAGAGTTACAGGCCAAACATTCTGATGTGCGGCCGAATCAAATCGACTTTTTGTGTGCATCGCCCCAGCGTTCGTCTTAA 
TTAA 

(SEQ ID NO: 817) 

Exon: 1001.. 1220 
Exon: 1303.. 1664 
Start ATG: 1095 

Transcript No. : CT22713 

GCGTCATTTTCAGAAACGCCGCGAATATTTTGCCATTGAAATCTAGCTAAGCATTGGAATTTCGCAACGTGCAGCAGCAAAGCAAACTACAAACA 
TGTCGGGCACATCGCAGAAACACAGGAACTTCGTTGCGGAGCCAATGGGCAACAAGTCGGTGACGGAACTGGCCGGAATTGGGGAAACCCTCGGT 
GGACGCTTGAAGGACGCTGGATTCGATATGGCCTACACCGTTTTGGGACAGTATCTGGTGCTGAAAAAGGACGAGGAGCTGTTCAAGGACTGGAT 
GAAGGAGGTGTGCCACGCCAGCTCCAAACAGGCATCCGATTGCTACAACTGTCTCAACGATTGGTGCGAGGAGTTCTTGTAAGAGGTGGACACAC 
AAGCCATCCGAGTAATTCGATCAAAAGGAATCTGCTAAAGGCATCGGCCAATCATTTTTATTCGCAGTCAACGAAAGTGGCCCAATGCTAGCTAA 
GCATTTGCCGTGTTCACTGTTAGTATGTATAGCTGTAGGGTGTAATTTAATGTCTAATCCGAATAAATGTACAGTCATGGAAACGATCCTGCATT 
TGGATCTCTGTT 
(SEQ ID NO: 818) 

Start ATG: 95 

MSGTSQKHRNFVAEPMGNKSVTELAGIGETLGGRLKDAGFDI^YTVLGQYLVLKKDEELFKDWMKEVCHASSKQASDCYNCLNDWCEEFL* 
(SEQ ID NO: 819) 

Classxf ication: hypothetical 



Celera Sequence No. : 142000013383755 

TTGCCCGACAGATCCAGCTCGCCGAGCGACTGACAGAGCTTAGTGCCCCGCAGCCAGCACCAGTCATTCTGAGTACCCAGTAAGTTGTTGCCCAG 
GTGCAGCGAGGTGAGCGGAAGACGACCCAGCTCCGAGGGAATCTAAAAGGCAGAATTAAGCTTAACTCAGTGGTCTAAAATGATAACTAATTAGT 
CAATCCACCTTGGTCAGCTTGTTACCGCTGACATCCAGCTTTGTGAGGTTGCGCAGGGTGCATATCTCGAAGCTCAGCTTCACTAGCTGGCTGTT 
GTTGATGGTCAGCGATTTGAGGGTGCGCGGAAATCCCTTGATGGGATACTCGCTCCGCTTGCTGATTACCATGCGCACCTGGGGCTGTGCCTTCT 
GCGGAATTGCCGTAGCGGCGTTTATGTTCAGTCTCAAATTGATGGCGTCCTTGCCATCCATGCCCAGCTTGAGGGTCTGCAGGAATCCTTTCAGC 
TGGATAGGATCGCACTTAATAAGCAGGTTATCTGGCGGCTGTATGAAACCAATGGTGGTCTTCCCATCCAGCACAAACTTGGTGTGCACGACATG 
AATGTTATCCTTGACCTTGTAGCGATTTCCAGTCTTGTTTTGTCCACTAAAGAGCACCATTTCCAGCTCCTTCTTGGTGTTTCCATTCGCATCCC 
TGGCGCTTTGCTTGTAGCCAATGGCCAGTGTGGATTTCACGGGACGTGGCGTTTTGTTCGTCTGGGTGGCGCGATTCACCACCTGAACTTCGCAC 
AAAATCTTCATGGCGAGGCAGTGATTAGACAATAAACAAAAAAAAAACCCGCTAAATAAAACTAAACATTTGTTTTGGCCTCAAGAGTTGCCACC 
TGATGGTTGGGACAGCTGATGATATCGAGCAGCCATTCGATAGTTAGTGGTGGACCACATACCGATGTTTTGGATTTTAGGAACTAGTTTCTAAG 
AGCAGCATCAAAAGCTCTCGAAAAAAAAACAATTGGCGCAGGACAAGTTACAAAACCAAAACAAGCAACGGCAATTGTTTAAAAACTGCAAAGTT 
ACTCCCATTTAAATACCGGAAAATGTTATTTCGCAAGGCTCGCCGACACTTTCTGCCGTTCTACATGTTCAATGGATTCTTGTGACAGCGCAACG 
AACGATATCAGCGTTTGCGTTACAATAGCTCAAATATTGGTCATCCGAGGCATGAGCTTACAGTAAGTAATGTGTGCAACATATGGAGCAAGACA 
GTGGTCATGCAATACAAAGAACACTCAAATTTTAAAAACTGAAATAAAATGCAAAAATGTTTCTGCAGCACAACAGCCTGTTGGACAACTATGCG 
AACAGCAATGTGATAATAAACATTGACAACTGCGCGCTCTACAGCACCAATGTAAGAATTTAAATCTCAAAACCCATTTCGAATACTAACGATCT 
TATTTTCTTTTCGCTGATCTGGCAACGCCAAGCGGCCAGCTGATCGAAAGTCGATAACCGACGGCTCGCACGATCGACAGCGACGTCGCCGCCAG 
CGGTTTTTAGTCATTTTTCGGTTTTTCTGCGCAACGGTCGCTTTCGGTTTGGCAATCGCCTGCGTCATCTACACGCTCCACACAATGGGCTGGAT 
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TTTTGCTGTGCTCGTCGCACTGGTGGCTCTGCTACTGACGAAACCCGGCTGGCGCTGGTTCTACATAGCGGGTGCGACCGCTAGTCGCGATCTAA 
CGTGAGTGCTGCTCTTGTACAGGGAGGGCATACTGTACCCAAGACCAAGTCCAGTGATAAGTTTACAATACCGCCCCCGTGCGGTTTGCATTTCA 
ATACACAACAACATCTTGGGGGCTTTTCTTTTGTGAATCGATTGAGAACATAAGCGAACTGAAAAATCAGCCGAGTGCGAGGTTCACTTCGTAGC 
CGAAAGTCAGAAACCACAGTGGAAAAAACTCGTTTTCCATCTATTTGGGTCGCCAACAGCAGCATTCATTTCCAGTTGGGAAAACCATAAAAGAG 
AAAACTTAACTGGCGAAATGATCCATGCGACATTGTGACGAATAGTCAACGCAAATTATTAGTTTTTGCCACTCGATTACGAGCAGTTTATTGAC 
TTCTTCAGGTTCAAGGTCGCACAGCCCAAAAAAAAAAAAAAACCTACGAGAAACCAAAACAAAACTCGCTTTTATTTACCATTGACTTCATACGA 
TCGTCGCCCAGCGAAACCCTCCGCGCATACTAATCGATCCGAATGCGATCGAGTGCGTCAGGCGGAAGACCACTGGCACACTAACCGATCTGTTC 
TTGAAAACGTACGCCAAAGAGCCAAAGCTTGGGTCATTATGATGGCGGAAGTGTCGCCTGCGTCTTGGAATTTTCGTTTTTCCAGGGTTGCCAGG 
GCTACTTACATTTTGAAATTTTGCAGAAAGGTTATTAAGTATTCAGAATTTTTAAGCAGCTCTTTACTTTTAAAATTGTTATAGTTAAGAATGTC 
TAGGACTTGGATATCAAAGTAAGTTTGCTAACCTCATAAATAGTATAAATTTAAAGAAATTACAATAATATTATAAACCACATAGATTTTTAATT 
TTTAAGTGATTCACTTTTTTATGGCTACAGTGGAAGTGTTAAGATTCCAGGGTTGCCACGGCAAGCCAAACTATACCAATTGTAGACCAAGCAAT 
AAGCGAACGAAAAGCTTGTCAAAATTATGTATGTCTCATCAATATACTTTGTGTCGTCTGTCGTTACGATAATTAAAAAAGCGGAAGTGTCGCCT 
TTTGACTTGTGTATGTATTTTGAAGACGGCTATAGATATTGCTTTTTAGCACAGGTATTGTTAGTAATATACATTTTTACAACCAAAACCATCTA 
ACCAAACACCTTAACTAATAGGAGCAAAATCTTTACACTTCATTTTAACTAAGTATTTTAAGATTCAATTAAGTTTGAATTGAGCTATAGCCTTC 
GCAAATATTCACACATTGCAACGCATTTTACAAAATTATGTTTGACCATAAATAAGGTTAAATACGTAAAGTTATAGCTATTAAAATAGCAAGTA 
TTCTCAGCAGCATTTAATTTGCTCGGCAGTTTTCGCTTAATGAGTCAAACTCGGCAGATCAAGTTCCATGCCAAGGCCAACACTGGTTTCCCATT 
TACCCAGCTCGTCTATATGGCCATTCGTATATTTCGCGCTCACAACAGATTCTTAAGATCAACTTCAAGTCTGGCCGGTCAACCTTGGCCACGTA 
TCGAAGTTTTTTGCTCGGCCACCAAGTAGACGAATCCAATCTGCGGGCAAACATGTGCAGCAACTCCAAAAGGAGCCAAGCAAACAAAGCACTCG 
ATGGGGGTCATAAGGCCTGATTAAAAACCCTATAAATACGCGGCTAATGTGCGTTAAGCGCTAAGCTAAGTTAACTAGCTGTTACCGGCAATCTT 
ATCGACACTTCACTGCCCCGTTGACGGTGCAAAGTACGCACGGGTGCAAAGCGAATTCAAATTGAGATCGTCGTTGTGGTCGAGTCCCAAGTCGA 
GTGTAGGTTTTTTTTCCCCCTTTTCTAATACAGTTCCCATCGAACTATTGTATTTGCGATTTGGATTAGTTCTACCTCACACTCTTTCCTATCGT 
ATCAAATCTCTGTGGATCTTAAGTGCTTGCAAATGCCTTTCAGACCGTAGCATCAGCTCTTCTAGAACTTGTAGAGGAAACTATTAGTATTACAG 
AGGCAGCTCTTTATAGTTTATACAAATTAATACCGTGTAACAACTTGAAGGCATCTATCTTTATAAGAAATACACATGATTACAAAAAAATCAAA 
T T T C T AAAC T C AC AT T T GT C AAT AC AAGT G C AAT T TAT T T C AAC C GC T AGT C AAT GAT AAAC ATT AT CT G C C T T CG T TC AGT GT GC AGGT T T AT T 
TATTTTGCCGATAGCTTCTTCAATATTGATAAACATGGGCCAAACAACAGTTACAGTTTAAGTATAAAACTCTTAATCCTCATAAGTGATAAGAT 
ACTAAGATGTGTAAGAGTTCTGTTCCAATGATTATACGATTGCAAATGAAAATATAAACGTCAGATTTAGATATCTGTAGCTATTGGTTAAGCTT 
TCTCCCGGAAAATTCAAAAGCCCTTGAAACTGGCTAAACAAATACGATTTTATCTCAACCTTCCCGTGAACTTTGCATTACTCAGCTCGAAATTA 
GCAGCGTCATTAAGTTGTGGCTCGCATGGTCGGACTAGGTTACCACGTTCGGATTGAATTGCAACCACTTCGAGCGGGTTTATCTCTGGGTTTAT 
CGCATCACGGCTGTTGCGCTTGTCGAGCTTCGATTTCTGTTGGCTATTTATACAGACCCCAACCACTAACCATAAAACACACGCGATTGATGGCA 
AAGATAATGCCGGTACGGCTACTATCCTCGACTAGCCGCTAATCAAACACTTTCCCTCCAAAACTGAGTAACATATCAATATCCGTTGGTTATAC 
ACGTTATTTGTGGATTTTCCATTGCAAAGAAGATACGCTATCTTCAAAAATATTGTCTAGTCAACACACTTTTATATAGCACTAACTAATTTAGC 
ATTGTTCGTTT G AAG AT T C C T C G ATT TAT C AG GT T T GAT AT T T AAT T T AC T AAT C AAAT T T C C CC T T AAC AG AGCT C T CT GGGC C T AT AT C AAGC 
TGCTGAGGTACACGAAGCGCCATGAGCGGCTCAACTACACGGTGGCGGACGTCTTCGAACGAAATGTTCAGGCCCATCCGGACAAGGTGGCTGTG 
GTCAGTGAGACGCAACGCTGGACCTTCCGTCAGGTGAACGAGCATGCGAACAAGGTGGCCAATGTGCTGCAGGCTCAGGGCTACAAAAAGGGCGA 
TGTGGTGGCCCTGTTGCTGGAGAACCGCGCCGAGTACGTGGCCACCTGGCTGGGTCTCTCCAAGATCGGTGTGATCACACCGCTGATCAACACGA 
ATCTGCGCGGTCCCTCCCTGCTGCACAGCATCACGGTGGCCCATTGCTCGGCTCTCATTTACGGCGAGGACTTCCTGGAAGCTGTCACCGACGTG 
GCCAAGGATCTGCCAGCGAACCTCACACTCTTCCAGTTCAACAACGAGAACAACAACAGCGAGACGGAAAAGAACATACCGCAGGCCAAGAATCT 
GAACGCGCTGCTGACCACGGCCAGCTATGAGAAGCCTAACAAGACGCAGGTTAACCACCACGACAAGCTGGTCTACATCTACACCTCCGGCACCA 
CAGGATTGCCAAAGGCTGCGGTTATCTCTCACTCCCGGTGAGTGCAATAGCTTTTCTGAACTAAGTTTAAATGGAAAGGTTCTAAACAAATTTGT 
ACTTTAGTTATCTGTTTATCGCTGCTGGCATCCACTACACCATGGGTTTCCAGGAGGAGGACATCTTCTACACGCCCTTGCCTTTGTACCACACC 
GCTGGTGGCATTATGTGCATGGGTCAGTCGGTGCTCTTTGGCTCCACGGTCTCCATTCGCAAGAAGTTCTCGGCATCCAACTATTTCGCCGACTG 
CGCCAAGTATAATGCAACTGTAAGTGAGATTTATGGCCTAACTAAATCACTCTCTAATTTGGCTTTTGTGATTTCCAGATTGGTCAGTATATCGG 
TGAGATGGCTCGCTACATTCTAGCTACGAAACCCTCGGAATACGACCAGAAACACCGAGTGCGTCTGGTCTTTGGAAACGGACTGCGACCGCAGA 
TTTGGCCACAGTTTGTGCAGCGCTTCAACATTGCCAAGGTTGGCGAGTTCTACGGCGCCACCGAGGGTAATGCGAACATCATGAATCATGACAAC 
ACGGTGGGCGCCATCGGCTTTGTGTCGCGCATCCTGCCCAAGATCTACCCAATCTCGATCATTCGCGCCGATCCGGACACCGGAGAGCCCATTAG 
AGATAGGAATGGCCTATGCCAACTGTGCGCTCCCAACGAGCCAGGCGTATTCATCGGCAAGATCGTCAAAGGAAATCCTTCTCGCGAATTCCTCG 
GATACGTCGATGAAAAGGCCTCCGCGAAGAAGATTGTTAAGGATGTGTTCAAGCATGGCGATATGGCTTTCATCTCCGGAGATCTGCTGGTTGCC 
GACGAGAAGGGTTATCTGTACTTCAAGGATCGCACCGGTGACACCTTCCGCTGGAAGGGCGAGAATGTTTCCACCAGCGAGGTGGAGGCGCAAGT 
CAGCAATGTGGCCGGTTACAAGGATACCGTCGTTTACGGCGTAACCATTCCGCACACCGAGGGAAGGGCCGGCATGGCCGCCATCTATGATCCGG 
AGCGAGAATTGGACCTCGACGTCTTCGCCGCTAGCTTGGCCAAGGTGCTGCCCGCGTACGCTCGTCCCCAGATCATTCGATTGCTCACCAAGGTG 
GACCTGACTGGAACCTTTAAGCTGCGCAAGGTAGACCTGCAGAAGGAGGGCTACGATCCGAACGCGATCAAGGACGCGCTGTACTACCAGACTTC 
CAAGGGTCGGTACGAGCTGCTCACGCCCCAGGTTTACGACCAGGTGCAGCGCAACGAAATCCGCTTCTAAGAGCTGCAATAGAGTTGTGTCTGAA 
CCTTGCCTTTTGCCCAATATGCTGTTAATTAGTTTGTAAGGCTAAGTGTAGTAGAGGAAAATCGGGGGAAATCGGCAGCAAAGATCATTCAGCCT 
AGGAGAGATGCATCCGAAGCACATTTCCATGTCAACAATGCACTTTTGTATATCGTAAGCATATATATATCGTATATCGTAAACGTAGTTGTATC 
TGCATTTGTGTAGATGATAGCCTCCTATACGCATTTCAATTGTTTTTAGCGTGCTAAAGAACCTTGTTAAATGCAATTTCAGCTATTGTTTAGTC 
AGTTTTAGTGGCATTTACACTTCCATTCTCGTTGCGTTTCGTTTTTGCCTGTACATATGAGAAGCTCTGATGTTTTTGTATCAAATAAAGTTTAT 
TCCTTCACCACGGACCACGTATTTAAATTCGCAGTCTATCGTGCGGCAGAATGACACCATTCCACCTGCATACAAATGATTATGTGCGATTCGCT 
CCTCGACTCTCCGATCTATAGAAATGCATAGAGATATACTTACACTCATACCCATGCTGTGAGTATTCAATTAACCAAACTTGAACCTTGTCGAC 
GTTCAAGTGCATGAAATTAACGAGAATTTTTAATAGCATTATATAATAATCTGTGAATCAGCTAACGACAATAACGTTTAATAGAAGATGCGCCC 
TTAATTTAATTATGGTTCTACTCCGCTTTCTGTGATTTTTGAAAAGTTGAGAATATAACTACTCACTGCTTGCTTACTCTTTCATTATCGCACTA 
TTAGAGATACCCATTTAATGGACAGCAATTAAGGGGTAATTATATTCTAGCAGTTAGCTTGTTTAAGCCAGTGCTAGTATTGGTAGCATCATTCA 
ATCAGAAAACGTTTCAAATCTTTTTGATATTTTAGAAGAATAATGTAACACAATGAGTCTTTCTTTCAAAGAAAAACCAAATAAAGTACTTACAC 
ATCCACTTTTAAGAGAACTTAAAGAATATCTCAATTTCTTTGGAACATGCTTACTATATCTGGATATTAGGATGATAGTTCCTAATAGTTCAGAG 
CTATTCCATTTGATATTAAACTTTGCAATGGAAAATTCAAATCAGTTGGAGGGCGGGCGAAAGAAAAAGCTCTTTCCCTCGCAGATCCTAAATAT 
GACCAAGTGGGTGAGTCTTCCGGGTGACTGGCAGATCGGTAGGAAGCAGCAGCTAGTCGAGAACCGTTATCAGTGGCGGCAATAACAAGCGAGGC 
TTCAATTCGCTTCCAGTGCAAAGAAGATTATTGCAGTTGCATTGCCGGGTGCGATCCGCCCAGTTGTCCACGTGTGAGTACGGATGATTGTCGAT 
CGCTCGATCGATTTCATACAAAGTGTGTGAGGTGTTTACCTCGAGTCCAGAAAGTCAAGACC 
(SEQ ID NO: 820) 

Exon: 1001. .1202 
Exon: 1312.. 1381 
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Exon: 1458. .1711 
Exon: 4633. .5262 
Exon: 5328. .5533 
Exon: 5629.. 6947 
Start ATG: 1192 

Transcript No. : CT22721 

CAAAACCAAAACAAGCAACGGCAATTGTTTAAAAACTGCAAAGTTACTCCCATTTAAATACCGGAAAATGTTATTTCGCAAGGCTCGCCGACACT 
TTCTGCCGTTCTACATGTTCAATGGATTCTTGTGACAGCGCAACGAACGATATCAGCGTTTGCGTTACAATAGCTCAAATATTGGTCATCCGAGG 
CATGAGCTTACACCTGTTGGACAACTATGCGAACAGCAATGTGATAATAAACATTGACAACTGCGCGCTCTACAGCACCAATCGGCCAGCTGATC 
GAAAGTCGATAACCGACGGCTCGCACGATCGACAGCGACGTCGCCGCCAGCGGTTTTTAGTCATTTTTCGGTTTTTCTGCGCAACGGTCGCTTTC 
GGTTTGGCAATCGCCTGCGTCATCTACACGCTCCACACAATGGGCTGGATTTTTGCTGTGCTCGTCGCACTGGTGGCTCTGCTACTGACGAAACC 
CGGCTGGCGCTGGTTCTACATAGCGGGTGCGACCGCTAGTCGCGATCTAACAGCTCTCTGGGCCTATATCAAGCTGCTGAGGTACACGAAGCGCC 
ATGAGCGGCTCAACTACACGGTGGCGGACGTCTTCGAACGAAATGTTCAGGCCCATCCGGACAAGGTGGCTGTGGTCAGTGAGACGCAACGCTGG 
ACCTTCCGTCAGGTGAACGAGCATGCGAACAAGGTGGCCAATGTGCTGCAGGCTCAGGGCTACAAAAAGGGCGATGTGGTGGCCCTGTTGCTGGA 
GAACCGCGCCGAGTACGTGGCCACCTGGCTGGGTCTCTCCAAGATCGGTGTGATCACACCGCTGATCAACACGAATCTGCGCGGTCCCTCCCTGC 

T GC AC AGC AT C AC GGT GGCCCATTGCTCGGCTCT CAT T T AC GGC G AGG AC T T C C T GG AAGCT GT C AC C G AC GT GGC C AAG GAT C T GC C AGC G AAC 
CTCACACTCTTCCAGTTCAACAACGAGAACAACAACAGCGAGACGGAAAAGAACATACCGCAGGCCAAGAATCTGAACGCGCTGCTGACCACGGC 
CAGCTATGAGAAGCCTAACAAGACGCAGGTTAACCACCACGACAAGCTGGTCTACATCTACACCTCCGGCACCACAGGATTGCCAAAGGCTGCGG 
TTATCTCTCACTCCCGTTATCTGTTTATCGCTGCTGGCATCCACTACACCATGGGTTTCCAGGAGGAGGACATCTTCTACACGCCCTTGCCTTTG 
TACCACACCGCTGGTGGCATTATGTGCATGGGTCAGTCGGTGCTCTTTGGCTCCACGGTCTCCATTCGCAAGAAGTTCTCGGCATCCAACTATTT 
CGCCGACTGCGCCAAGTATAATGCAACTGTAACTACGAAACCCTCGGAATACGACCAGAAACACCGAGTGCGTCTGGTCTTTGGAAACGGACTGC 
GACCGCAGATTTGGCCACAGTTTGTGCAGCGCTTCAACATTGCCAAGGTTGGCGAGTTCTACGGCGCCACCGAGGGTAATGCGAACATCATGAAT 
CATGACAACACGGTGGGCGCCATCGGCTTTGTGTCGCGCATCCTGCCCAAGATCTACCCAATCTCGATCATTCGCGCCGATCCGGACACCGGAGA 
GCCCATTAGAGATAGGAATGGCCTATGCCAACTGTGCGCTCCCAACGAGCCAGGCGTATTCATCGGCAAGATCGTCAAAGGAAATCCTTCTCGCG 
AATTCCTCGGATACGTCGATGAAAAGGCCTCCGCGAAGAAGATTGTTAAGGATGTGTTCAAGCATGGCGATATGGCTTTCATCTCCGGAGATCTG 
CTGGTTGCCGACGAGAAGGGTTATCTGTACTTCAAGGATCGCACCGGTGACACCTTCCGCTGGAAGGGCGAGAATGTTTCCACCAGCGAGGTGGA 
GGCGCAAGTCAGCAATGTGGCCGGTTACAAGGATACCGTCGTTTACGGCGTAACCATTCCGCACACCGAGGGAAGGGCCGGCATGGCCGCCATCT 
ATGATCCGGAGCGAGAATTGGACCTCGACGTCTTCGCCGCTAGCTTGGCCAAGGTGCTGCCCGCGTACGCTCGTCCCCAGATCATTCGATTGCTC 
ACCAAGGTGGACCTGACTGGAACCTTTAAGCTGCGCAAGGTAGACCTGCAGAAGGAGGGCTACGATCCGAACGCGATCAAGGACGCGCTGTACTA 
CCAGACTTCCAAGGGTCGGTACGAGCTGCTCACGCCCCAGGTTTACGACCAGGTGCAGCGCAACGAAATCCGCTTCTAAGAGCTGCAATAGAGTT 
GTGTCTGAACCTTGCCTTTTGCCCAATATGCTGTTAATTAGTTTGTAAGGCTAAGTGTAGTAGAGGAAAATCGGGGGAAATCGGCAGCAAAGATC 
ATTCAGCCTAGGAGAGATGCATCCGAAGCACATTTCCATGTCAACAATGCACTTTTGTATATCGTAAGCATATATATATCGTATATCGTAAACGT 
AGTTGTATCTGCATTTGTGTAGATGATAGCCTCCTATACGCATTTCAATTGTTTTTAGCGTGCTAAAGAACCTTGTTAAATGCAATTTCAGCTAT 
TGTTTAGTCAGTTTTAGTGGCATTTACACTTCCATTCTCGTTGCGTTTCGTTTTTGCCTGTACATATGAGAAGCTCTGATGTTTTTGTATCAAAT 

AAAGTTTATTCCTTCACCACG 
(SEQ ID NO: 821) 

Start ATG: 192 

MSLHLLDNYANSNVIINIDNCALYSTNRPADRKSITDGSHDRQRRRRQRFLVIFRFFCATVAFGLAIACVIYTLHTMGWIFAVLVALVALLLTKP 
GWRWFYIAGATASRDLTALWAYIKLLRYTKRHERLNYTVADVFERNVQAHPDKVAVVSETQRWTFRQVNEHANKVANVLQAQGYKKGDVVALLLE 
NRAEYVATWLGLSKIGVITPLINTNLRGPSLLHSITVAHCSALIYGEDFLEAVTDVAKDLPANLTLFQFNNENNNSETEKNIPQAKNLNALLTTA 
SYEKPNKTQVNHHDKLVYIYTSGTTGLPKAAVISHSRYLFIAAGIHYTMGFQEEDIFYTPLPLYHTAGGIMCMGQSVLFGSTVSIRKKFSASNYF 
ADCAKYNATVTTKPSEYDQKHRVRLVFGNGLRPQIWPQFVQRFNIAKVGEFYGATEGNANIMNHDNTVGAIGFVSRILPKIYPISIIRADPDTGE 
PIRDRNGLCQLCAPNEPGVFIGKIVKGNPSREFLGYVDEKASAKKIVKDVFKHGDMAFISGDLLVADEKGYLYFKDRTGDTFRWKGENVSTSEVE 
AQVSNVAGYKDTVVYGVTIPHTEGPAGMAAIYDPERELDLDVFAASLAKVLPAYARPQIIRLLTKVDLTGTFKLRKVDLQKEGYDPNAIKDALYY 
QTSKGRYELLTPQVYDQVQRNEIRF* 
{SEQ ID NO: 822) 

Name: fatty acid transporter-like 
Classification: transporter 



Celera Sequence No. : 142000013384839 

CTTTATATGGTCACGAATGTCTCTGTTGCACACTACAGACTAAAATCAAAATATTCTCTGAAAGGTATTCCCAACAAAAAACGCTATATCCTGGG 
ATAACCATTGTATTCTTACGATTAATTTCACAAAATGAATTAAGAAACTATAATTGTACAGAAACGATTATTTAAAGGATTATTGCCGTTCAAAA 
ACGAAAGAGTAATTCCCGCTACTGTTCCGCTGTTTTCACGTCCGCAGATTTAAGTGACCCAATTGCAAAAACAAAAGCAAAACAATATTAATTCA 
ACTCTGAGCCGACTGATGCTGCAGGCACCTCAAAGTTAAAAAATGCGTAGCACGTGCGTTCAGAAATTCGGCCAAGAAAAAAGGAAACTGTTAAC 
TGGGTCCATAAATTCAACTTTGTTGTATATACATACATACATACTGTATGTTCATATAAGTACGTCACTTTAAACATCTGCCTTCGTTTCCTTTA 
ACTCTTTAATTTATTTTTGATCCAGCTTTAATAACTATTAATTTATTATTCCTATAACTTGTACGTTGAATGGGCATAGGTAATTTTCTTGAAAT 
ATGAAATATTTAAAATGGAATGCATTCTGTATGAAATGTATTCGGGCCTAAAAAGATGATTGTAGTTATTTAAATTTGAAACTAGATTTTTAATA 
CAAGAAGATTGATAAAAAGTTATTTTTGATTTATATTTAATAAATATTTTAGAACTAAAACCAAAATAAATGATGTTTTCCCTACTAAAATTATA 
CTCGAACTTGTAGCTGTATTAAAACGTGTTCACTTATTGATTAAATCATAGCCCATACAGTTAGTTATACGTTCTCAATTTTTGCCAATCTTGCG 
CTTCTGATTTCCAATGAAATAAAAGTGACTATTGTACGAATCACCCTAATATTTTTTCACAACTGGTATTTTTCTCCGTCCCGCAATCGGTAACA 
CCATGCAGGGTGTCTGAGTTTCGAATCGGCATAAATTCGCCGGCAGCTGAGCAAGATGTATCCTGTGAGCTCGCAAAAGAGGTCCTGGACATTCG 
CCAATGAGGGCCAGCTCATGGAGTTCCGCGTGGAGCAGAACAGCAAGTACATCGAGTCGCACGAGGAGGAGGCGCAGGGTCGCGACCTCAATGAG 
CACTTTCTCACGTCGGCGGAGGAGCGCCTGTTGCTGAAGCAGTACGAGATCTACCTGTTCGATTTCTGCCGCCGCTTCGAACCGACGATGCCCAA 
GTGCGTTGTGGGCACGGCCTTCCACTACTTCAAGCGGTTCTATCTGAACAACTCCCCCATGGACTATCACCCCAAGGAGATTCTGTAGGAACGCT 
CACCTGAGAGCAGCAAACACCCAGCTGAAAAGCAA7AACATTCTAAAAATGTATAAACCTTTTCAGAGCCACATGCGTGTTCGTTGCCTGCAAAGT 
TGAGGAGTTCAACGTGTCCATCAACCAGTTCGTAAACAACATCAAGGGCGACAGGAACAAGGCCACCGACATAGTGTTGTCCAATGAATTACTGC 
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TGATTGGACAGCTCAACTACTACCTCACCATACACAATCCGTTCAGACCCATCGAGGGTTTCCTGATAGATATAAAAGTGCGATATATATTCACA 

TTTGCTCTCCAGATTGTTTCGTTCATATTGGTTTGTTTCAGACTCGCAGCAATATGCAGAATCCAGATCGTCTGCGGCCACATATTGATAGTTTC 

ATTGATTCCACGTACTACTCGGATGCCTGTCTTCTGCATACGCCTTCGCAAATTGCATTGGCTGCCGTCCTCCACGCGGCCAGCAGAGAGCAAGA 

GAATCTCGATAGCTATGTGACGGATCTTCTGTTTGTCTCCGCCAGGGAGAAGCTACCCGGACTCATAGATGCCGTGCGAAGTAAGTTTGTGCTGT 

GAGATGGTGCGCCCAGCCCTACATGCCTTTTTCCCGACAGAAATTCGCATAATGGTGAAGCAATATCAGCAGCCCGATCGGGAGAAGGTCAAGGC 

CATC GAGAAAAAGTTGGACAAGTGCCGAAATCAAGCCAATAATCCT GAT AGCGAACTGTAAGACGCCCTAATTTGTTCATGC ATT CCGAATTTAA 

TAACCCTTATACTTTAAAAGCTATAAGGAGCGCCTACGCCGATTGTACACCGATGAGGATGACATGCCCGCCGAAGATGCCTCATTCCACATTGC 

AGATGTGAGCTCGGACACATCTGCTATGAACATCAGCCAATAGACTTAAGAATATTTATTTAAATGATGTGATGATCTACTACTGCGTGGATTTC 

ATCGATATTAAAGCATTTTGTAATTTACCATTTCTTGATTGTTAAAATGTATGCGTTTAGTGTTAGTTTACTAAACAAAGTTGGATTAGGTACTT 

CACTTTTCCAATATATAAAATATTAAAAAAAAAACCTTTTGAAAATCCGCTTAGCGCTGCTTGGCAATATTTCGTTCGCAGCTAACGGCCACACT 

GCGCCCCAAGAAAAACAAGAATAGTGAACTCAGTCACAACAAATAGCAAAAAGAGTTCGAAATGCAGGATTTTCTGCCTTCGCTGGATGTTCTCT 

TTCTTGGACTACCAGGCCACTACTCGGTAGCGCTGGTCACCCTTCTGCTGTGCGTTTTGGTCGCCTTCTACTTCGCCAACATCTTCAAGGACAAA 

GGCGTGGGTGTTTTTGCCGGAAGAAGTAGACGTCTCCTGGAATATTTCCTTACATTATCTTGCAGGAACTGGAGGACGAGGGCCTGGGGGCCGAC 

GAGCCGCGCGATGAGGAGGATTCCGAGTCACGGACTCCGCAAGGGGAGGAAACCGACGATGACGAGGCCTACGAAGAGAACTCCACCGAAAGCGA 

AGTTGAGCTAATCGAGGAACAGCTGCCCGAGCACAGCTACAACCACCTGATGGGCCAGCTCAAGGCCAAGAGGCTCCAGCATAAGATGGAGAAGA 

CCCTGACTCCCAGCCAAATCGAGGAGGAGCGCCGAATAGAGCGCGAGCAGCTGGCGGCCATCTTTGAGCTCCTGCGCAAACAGGAGGCGGAATTG 

AACCTCCAGGACCGAATAAGCGACCAGGATCTCAAGCAGCAGGTGCGGCTCTATCGCTGAAGTGGCGCCGAGCCTGGGCTCACCCTACCTGAATG 

TACAGACTTGTGTGATCTATTAGTTTAATGTTTAAAATTCCAAGATTTGGTGTTGCCTCATACGGTCCACTAAGTGCGGGCAAGTCTTTAGTCAG 

AGAGCTATCTCATCAGTCGATGATGTCGGAATATAATATCTTAGTTGTATGTAAATTATATGTATTCCACTTCGATCTATGTATTTTGTGAGTTC 

CTGGTGTTATTAATATCTGATTTAGATTTGTAGTAAATGGCTGTTCGGCTCTTAGTCTCTTAAATCAGCTTTCGAATTAAGAAT 

(SEQ ID NO: 823) 

Exon: 1001.. 1319 
Exon: 1396.. 1597 
Exon: 1657.. 1885 
Exon: 1941.. 2052 
Exon: 2111.. 2409 
Start ATG: 1006 

Transcript No. : CT22750 

GCAAGATGTATCCTGTGAGCTCGCAAAAGAGGTCCTGGACATTCGCCAATGAGGGCCAGCTCATGGAGTTCCGCGTGGAGCAGAACAGCAAGTAC 
ATCGAGTCGCACGAGGAGGAGGCGCAGGGTCGCGACCTCAATGAGCACTTTCTCACGTCGGCGGAGGAGCGCCTGTTGCTGAAGCAGTACGAGAT 
CTACCTGTTCGATTTCTGCCGCCGCTTCGAACCGACGATGCCCAAGTGCGTTGTGGGCACGGCCTTCCACTACTTCAAGCGGTTCTATCTGAACA 
ACTCCCCCATGGACTATCACCCCAAGGAGATTCTAGCCACATGCGTGTTCGTTGCCTGCAAAGTTGAGGAGTTCAACGTGTCCATCAACCAGTTC 
GTAAACAACATCAAGGGCGACAGGAACAAGGCCACCGACATAGTGTTGTCCAATGAATTACTGCTGATTGGACAGCTCAACTACTACCTCACCAT 
ACACAATCCGTTCAGACCCATCGAGGGTTTCCTGATAGATATAAAAACTCGCAGCAATATGCAGAATCCAGATCGTCTGCGGCCACATATTGATA 
GTTTCATTGATTCCACGTACTACTCGGATGCCTGTCTTCTGCATACGCCTTCGCAAATTGCATTGGCTGCCGTCCTCCACGCGGCCAGCAGAGAG 
CAAGAGAATCTCGATAGCTATGTGACGGATCTTCTGTTTGTCTCCGCCAGGGAGAAGCTACCCGGACTCATAGATGCCGTGCGAAAAATTCGCAT 
AATGGTGAAGCAATATCAGCAGCCCGATCGGGAGAAGGTCAAGGCCATCGAGAAAAAGTTGGACAAGTGCCGAAATCAAGCCAATAATCCTGATA 
GCGAACTCTATAAGGAGCGCCTACGCCGATTGTACACCGATGAGGATGACATGCCCGCCGAAGATGCCTCATTCCACATTGCAGATGTGAGCTCG 
GACACATCTGCTATGAACATCAGCCAATAGACTTAAGAATATTTATTTAAATGATGTGATGATCTACTACTGCGTGGATTTCATCGATATTAAAG 
CATTTTGTAATTTACCATTTCTTGATTGTTAAAATGTATGCGTTTAGTGTTAGTTTACTAAACAAAGTTGGATTAGGTACTTCACTTTTCCAATA 
TATAAAATATTAAAAAAAAAA 
(SEQ ID NO: 824) 

Start ATG: 6 

MYPVSSQKRSWTFANEGQLMEFRVEQKSKYIESHEEEAQGRDLNEHFLTSAEERLLLKQYEIYLFDFCRRFEPTMPKCVVGTAFHYFKRFYLNNS 

PMDYHPKEILATCVFVACKVEEFNVSINQFVNNIKGDRWKATDIVLSNELLLIGQLNYYLTIHNPFRPIEGFLIDIKTRSNMQNPDRLRPHIDSF 

IDSTYYSDACLLHTPSQIALAAVLHAASREQENLDSYVTDLLFVSAREKLPGLIDAVRKIRIMVKQYQQPDREKVKAIEKKLDKCRNQANNPDSE 

LYKERLRRLYTDEDDMPAEDASFHIADVSSDTSAMNISQ* 

(SEQ ID NO: 825) 

Name : Cyclin H 

Classification : cell_cycle_regulator 
Gene Symbol : CycH 
FlyBase ID: FBgn0022936 



Celera Sequence No. : 142000013384658 

TGCTTGCTTTTCGCTTTCACAATGCGTAAAAAAACACCTAATTTTGTATAGCATATGAATATCATTAAGTGTATCAGTAAATTTGTATCAGTAAC 
AGTTGCAATATTATCGTTATCCTTGTATTATATCCTTTTTTCCTTTTTCTTTTGTTTTACCCGACGAGATACCACGAACGACGATTACGATTACG 
AAC G AGC G AC GAAAAT AC GC G AACG C G AAAC C GT T AG AACC C GAAAC GC G AC G AT T T T T T T C GAT T C CAT TTTTTGTTT GT T AT AT AAAT G ACG A 
AAGTGTTGGATACTTCTGGGAAATTGGAATCCGCTGTTGATGAATGCTTTCTATGGTTTTTGAACGGAGTTGAACATCAGTCTTTGTTTGATTTG 
TAACAATTTTGTGTGCACCTGCCATTGAATCCATTTTTCCATTTCTGATCTTAGGATTTCACAGGAGACCCAACACTTTCATCCTTTAAAGATTT 
TATTTAGTTATTTAGTTTTTAATTACCGCGACGAGGACCCGCAATATCATTTAGTTTCGACGATTTTTCATTTAGCTTTTCTATTACTATATGAG 
AAACACCCGTGTGTTTAACATATTCAACTCTATTTTCGGTTTGGGTTTTGGGGCAGAGTGTATTGGAGAATGGGGCATATCTGCTATAGCACTGA 
CTCCGGTTAAGTACGGCCGGAACACATAATTTTCTAGTGGTTCCTACTAAGTTTTCTGCACAAGAAAGGCATTTTATATGCTGTAAGCGGACAGT 
CGTGTGCCTAAAGAGGTTCTACAGATCAAAGATCCTCTAGCTTGAATTCAAATATATACGATTCTTCTCTATTCGATTTTTCGATATATGTTTGT 
GGATTTAAATCTAGTTATGTAGTTTCTTTTATGTGGATTTGCTGTTTTTTTCGATTTCTATAAGCTTAGATTTTCAATGTTAACTTAACATTCGA 
ACTGCATCGAATAACTTAAGGAGGCTAAAGATAAAGACGACACGCTTATTACGCATACTTGTACATAATTGTCTAGTTAAAAAGTTGCTGCTTGG 
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CTCGCTAATGGGTGAAGGTGGGTGTTTTATGGCATGGCTAGCCTTAGATGTTATACATTGTGGTCAGGCGCTCGGCCAAGGTTGGCGGGAATTGC 
TCCACAAATCGACGCCAGTTCTCCTCGCCCACTTGGGTCTTAAAACCGTGCAGAATCTTGAAAGAAACATGTGTGAAATAAAGAAAATAGTTACT 
ACCTGTAGGGTACAAACCTTCTGTATCATCTGATGCAGATCCTCTGGTGGGTTAACCCAGGAAGCGATGGCATCGCAGAAGAATATAAAGTCTGC 
TACCACGCCAGCTGGATTCACCGTGATCATATGACACATTCCACGGAAGGCTGAGTCCTTCTCATCATTGTCTCGTATGTGCCGCAACGATGTGC 
ACCTGCATTTTTCAAAGAAAATGTTTAAATGTCGTTTTCAGGAAAAGTAAACTAAATTAGCAGGCTTTGGCAAACACGAAAGTTCAAAAGCAAAT 
T T AAAT T CC AGT T AAAT GAT GT G GAT T GT T AGTG AGT T G T G G AAC AAAG G AAT AG AT C GAT GT G C T T AT AG GT T T C AGT G T T C AAGC GT AAT T T A 
AGAGTAATGGGAAGAACGGGAAGGATAGATAAGAGAATAGAAGGAAAGAATAAGCATTTTAAATATAAAACGTGTTACACACCACTGTCGTACAA 
ATT CGGGC AAAT AAGGAGCCACTTCAACTGGGCACACATAACCTAGACGACCGATTGTT ATT GCTAAAAGAACAGAAGAAATAAGTAT AAAT AAA 
TCAACTGAATAAATCAATATCATACGTAGTTCGCTGCGATTGATTGCTAAATTCGAATTGAATATCAATTGCGATTCTGGTCTATCTATATCTAT 
GCATGTTGTATGGGAATGAGGTTCGATTTCGTTTCGTTGCTGCTCATGCAACAAGTAACACAAATTAAAACAGAAACAAATTTCCGGACATCCAC 
ATACACGCATACATTCAGAGTAATATATATACAAACAGATCGCATATAAGTTCTTAGATAGCTCGGGAAATTCGAAAGATTTCTATCAGCTCAAA 
CGAATTACCTGTGTTCTCCAGCAGTGTCTTGGGCGTGTTCGGGCGGTTGATAATGATGAAAAGGTCGCTGAGTACAAGGCGTATGTACTGCTTGG 
TCTCCTCACCTTTGAATCAAAATATATATTAAAACGATCCTAATACTCCTAAGAAGCTCCTCCCACTTACCCAATTTCATGCAGATTTCGCCTAT 
GGCCCAAGTAGCATTATTGCAGACCGAAATGAAGTCTGGGTTAAGGTTTTGACCCAAGATGGGGAAGAAATCGGCCATGAAGGGATGCACGTGGG 
GGAAACAGGCCTTAGTCAAATCACCCAGCAGGGCAAACGAAGATTGGCGAACCTGGAATACAACAACATTATAGAAATTCAAAGGTTAATTTCGA 
AAAGATTGGTTTTAAGGGAGACATACCTCAGGCAAAACGTCCTGCATACACTGGTAGAGTAGATGCATAATGTTGCTGTTGGCCACCAATGTCTC 
GATGTGGCGATCCAAACCCTCGGCTAGGCCAGACAGCAGATCTAGGGCGACAATCATGCGCTCTTTGTCGGGATGGTCGTACGTTTGGTTTTGTT 
TACACAGCTGTTGGCGATTTAGATTGATGAAAAAGATTCCTTAGAAAATTGACAAGTAGATTACCATTTCCTGGTTGATAGTCTGCTCAATAAGA 
GAGATGCACCTTCGATACACCGGGTCGCAGTAGGGCAGAAAGCCGGACTGCAAGGCAGTGGCGATGCTCGACAGGCACTCCAACAATGGGAACAG 
ATCCTTATCGTCATCCTTCAGCAGGTTCCACTTATCAATTAGCGGAGGCATTAGGATGTCAATATATTGCGGCTTATTCAGATGATGACCTACGG 
AGTCCGCCAAAGTACCCACGGCATCGTACAGTATCAATAAGTTCTTGTGCTGGTACTTGGAGAAGGCAAAGACGAGCGTCTTTAGAATATACTCC 
AGGTAGGGCACCAGTTCCGTGCAGGCCTCCTCCTCCAGAGTAGCAAAGGCAGAGCACGCAGCTTCCTGAACGCGCTTATTGGAGTCTAGAATGCG 
CTTCAGCAGTTCTTCCATTAGAGGCTTCAAGTACTGGTCGTGCGGCTGGTTGACAACCCAATTGGCGTATCGCGAGAGCGTCCAGCATGTGATGG 
AGCGCACCAGTGCCTTCTTGTCGGAAAGGCAGCTAATTAGGTAGGGAATCAGCTCTGGCAAGTGTTGGATCATGCCCTGCATGCAGCCCTCCGCA 
ATAGCTCCCAAGGCCAGCACACCACTCTCTTTGATCACCCATTCTTGATGGAACAGAGTCTCTTTCAGGATGGGCAGCACAACGGGCAGACAATC 
CTCTCGGAAAACATTCGCCAATACATCGAGAGCAGCTGCGCTGCACTTGCGCAAGTTCCATTCCGATAGCGAGCTATCGTCGTCCATTCCATCTT 
CAAATTCGTCGTCGTCATCATCGCCTGTCGCTCCAGCTCCTCCCTCTTGTGTACTCCTGATTGTGTGTGCACGGGACTTGTGGAAACGCGGGCGG 
ATATCCTCTTCTCGATCGGGCACCATGTCATCCTCCTCAACGTTTCCCTTGAGGAGAATTATGTCGACCTCTGAGTACCGCATGCCTCGTACAAG 
AACTGGCGCTAATTGAGCAAGATAAGGGGCAAGCACGTCCTTGCAGATGCTTTGCTCGGCCAGTGAAAGCCAGAACTCGGATGCTTCCAGGGCCA 
CACCCTCATCGGTGTCCTGGGTGCGCAACAGCATGTACTAAAATTCGTTAAATAAGATATTTAAACAAAAATTTATCAGAATAATTATTATTAAT 
TATATATTTTATTTTAAAGTTATACCTCAATAATCTGCGACATGTGCGGCATCAGACGGTCCATTCGCACCTCCAGCAGCATGACCAATCCGTGG 
C ACACGT T T T T G C GC AC C TCGTGGTC C TC AT C T GAT G AC AGGT GAAAG AGGT T C T C AAT G AAGC T GTC T AT GT T T AGC AT CAGAGC CT GT G AT C T 
GTTGATGATGAACTGGTTGATGCAGGCAATGGCGTGGGAGCGGATCTTTGGACTGCTGTGCTTGAAGTACTCCAGGAACTTTGGAATCATTATGT 
TTAATGGCCTGTTGAGCGCTGCGGAATCCAAAATCTCGGCAGAGTCCTCGCAAATTTTCTGCAGGGCACTGAATGCGCCTTCGCACACATTATAG 
TCCTGGTTGTCGAGCATTTCGCAGAGAGATGGAAGCAGCTGCGGCCAGTTGTGCAGACCGCCATTGCTGGCGATGGTGGTAATCAGGATGCCCAC 
GGTGGCACGGATCAGGGGCGAAGCGTCGCCCACTGCCTGCAGGCACTCGTGTTTGATATACTCCACGATCTCCGGCTGCAGAGTGGTGCCGTGCA 
TGCGGATGTTGTTCTTGAGGATTAGGCCGCTGAGTGATCTCGTGGGCTCGTCCTCTGTCTTCAGTTTCGTCAGCACATAGATAAGATAGTTGTTG 
AAGTCGGGGTAGCGATTGAATTCCTCCAGTTTCTGTTGAGAAGGACACAATATGTGCATGCAGTTCGAGGAACCGTACCTGGTAGAAGTTATTAC 
TCACCATCTGTACGGCCATTTGAGTGGCTGTGTCCGGTGACTGCGACTCCTTGAGGATCGCTATGATCTGCTGCAGACCTTCTCCCTGTGGTTCC 
CACGTCATTTTGCTGCTGCAAGGGAAGATCGCAAGGGTTTTAGGTTTGATTAGTGAATTCGGAATTAGAAATTAGGGTTAATGGCCTCGCAAAGG 
GTAGCAAATGGGGGAGAAAGAGAGAGCACGAGAGACACTCTGTTCGAGGCCACTCACGCACACCAGGGCATTTTGCACCCACACCCGCAAACAGA 
CATCTGCTGTTGTGAAATTTACGTTTTCCGTTCGATGTTTCCTTGGCACTTAATGGTTTTACGGCGGAAGGCGGGAGTTGAAAGCCCTGCCCCGA 
GAGTAACGTGCGATTTTCTGCCCACAACCCCAGACAAAAAAAAACACACGAATAGAACTCGCCTGCCTCCGCCATATTTCGTGACATCGGGCGTG 
CGGTTGAAAACGCTTTTCCGAGATAAGTTCTGCGTTGTCCGCATACCCCCTACTTACCTTTGGGTGTTGGCTGTGCTTTAAAACTATTTTAGTTG 
GAATTTTCTTAAGATTTTGCGTAAATTGTCTCTTTTTTCACCTTTTCTTTGCACAATTAGCGAGATGTATACAGTGTGACCGTGCAAGAGAGAGG 
GAGTATGCTATTTTTTATTAGAGATGGCAACGTGTCGTCACGGGTTTTAAAATTATATTTAATGTAGTTGATTTTTTAATTTATTGTGCTTAATT 
TAATATTTTTAAGCAGCTTAAAATGTTTGAATCACTCTGATATATAATTTCCGAACAACAAATTATTTTATTGCCAAAATGAAAAAAAATGAATA 
TTAAATCAAATTTTGCAGTCCAAAGCTGCTCCGATGTAACGAACTTCCTCTCACATATGTATCTAAGTGCTGCTGCACGGTCACACTCGATACGG 
CTCTCGAAATGCGTCCACACTCTCGCCATTTAGAGTGCCCACATTCGCCACCCTGTTCAATTTTGTTTGGGTTTGAGGCCAGACAGAGGAGTCAT 
TAATTTTAAACAAAGGACCACGCTTCAGGACATTCATCCCGGGTCAGACAAAGCCGCCACACACTTTCCTCGAGGCCGAGTGAAAGTTTCTCGCC 
TTGTCGCCGCTTAACATCCGAAGCGACCGGACATGAACAAGCGCAACCATGTGAGCAGAAAACCTACGGCTCCACACACAAAGCAAATGTATTGA 
GTGAAAAGTGCTATTTCTTTTGCAGATCCGCCTGCCGCCAGAGGTGAATCGGCTATTGTACGTGCGGAACCTGCCGTACAAAATCACCTCGGACG 
AGATGTACGACATATTCGGCAAATTTGGAGCCATTCGACAGATACGCGTGTAAGATTGAAGATGATAATGCTATTCGCCGTGAATCCTCTCAATC 
AATTTGTGTATCTTTTGCAGAGGCAACACTCCGGAAACGCGTGGCACCGCCTTTGTCGTCTACGAGGACATTTTCGATGCCAAGAACGCCTGTGA 
CCATTTATCGGGTTTTAATGTGTGCAATCGCTATCTGGTGGTGCTCTACTACCAATCCAACAAGGCCTTCAAGCGCGTGGACATGGACAAGAAAC 
AGGAGGAACTGAACAACATTAAGGCCAAGTACAA 
(SEQ ID NO: 826) 

Exon: 5304 . .5188 
Exon: 4765. .4660 
Exon: 4592. .3921 
Exon: 3837. .2725 
Exon: 2667.. 2497 
Exon: 2427.. 2256 
Exon: 2194. .2099 
Exon: 1773. .1698 
Exon: 1427.. 1253 
Exon: 1196.. 1001 

Start ATG: 4758 (Reverse strand: CAT) 
Transcript No. : CT22759 
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TCACACTGTATACATCTCGCTAATTGTGCAAAGAAAAGGTGAAAAAAGAGACAATTTACGCAAAATCTTAAGAAAATTCCAACTAAAATAGTTTT 

AAAGCACAGCCAACACCCAAAGCAGCAAAATGACGTGGGAACCACAGGGAGAAGGTCTGCAGCAGATCATAGCGATCCTCAAGGAGTCGCAGTCA 

CCGGACACAGCCACTCAAATGGCCGTACAGATGAAACTGGAGGAATTCAATCGCTACCCCGACTTCAACAACTATCTTATCTATGTGCTGACGAA 

ACTGAAGACAGAGGACGAGCCCACGAGATCACTCAGCGGCCTAATCCTCAAGAACAACATCCGCATGCACGGCACCACTCTGCAGCCGGAGATCG 

TGGAGTATATCAAACACGAGTGCCTGCAGGCAGTGGGCGACGCTTCGCCCCTGATCCGTGCCACCGTGGGCATCCTGATTACCACCATCGCCAGC 

AATGGCGGTCTGCACAACTGGCCGCAGCTGCTTCCATCTCTCTGCGAAATGCTCGACAACCAGGACTATAATGTGTGCGAAGGCGCATTCAGTGC 

CCTGCAGAAAATTTGCGAGGACTCTGCCGAGATTTTGGATTCCGCAGCGCTCAACAGGCCATTAAACATAATGATTCCAAAGTTCCTGGAGTACT 

T C AAG C AC AGC AGT C C AAAG AT CCGCTCCCACGCCATTGCCT G C AT C AAC C AGT TC AT C AT C AAC AG AT C AC AGG C T C T GAT GC T AAAC AT AG AC 

AGCTTCATTGAGAACCTCTTTCACCTGTCATCAGATGAGGACCACGAGGTGCGCAAAAACGTGTGCCACGGATTGGTCATGCTGCTGGAGGTGCG 

AATGGACCGTCTGATGCCGCACATGTCGCAGATTATTGAGTACATGCTGTTGCGCACCCAGGACACCGATGAGGGTGTGGCCCTGGAAGCATCCG 

AGTTCTGGCTTTCACTGGCCGAGCAAAGCATCTGCAAGGACGTGCTTGCCCCTTATCTTGCTCAATTAGCGCCAGTTCTTGTACGAGGCATGCGG 

T AC T C AG AGGT C GAC AT AAT T C T C C T C AAGGG AAAC GT T G AGG AGG AT G AC AT G GT GCC C G AT CG AGAAG AGG AT AT C CGCCCGCGTTTC C AC AA 

GTCCCGTGCACACACAATCAGGAGTACACAAGAGGGAGGAGCTGGAGCGACAGGCGATGATGACGACGACGAATTTGAAGATGGAATGGACGACG 

ATAGCTCGCTATCGGAATGGAACTTGCGCAAGTGCAGCGCAGCTGCTCTCGATGTATTGGCGAATGTTTTCCGAGAGGATTGTCTGCCCGTTGTG 

CTGCCCATCCTGAAAGAGACTCTGTTCCATCAAGAATGGGTGATCAAAGAGAGTGGTGTGCTGGCCTTGGGAGCTATTGCGGAGGGCTGCATGCA 

GGGCATGATCCAACACTTGCCAGAGCTGATTCCCTACCTAATTAGCTGCCTTTCCGACAAGAAGGCACTGGTGCGCTCCATCACATGCTGGACGC 

TCTCGCGATACGCCAATTGGGTTGTCAACCAGCCGCACGACCAGTACTTGAAGCCTCTAATGGAAGAACTGCTGAAGCGCATTCTAGACTCCAAT 

AAGCGCGTTCAGGAAGCTGCGTGCTCTGCCTTTGCTACTCTGGAGGAGGAGGCCTGCACGGAACTGGTGCCCTACCTGGAGTATATTCTAAAGAC 

GCTCGTCTTTGCCTTCTCCAAGTACCAGCACAAGAACTTATTGATACTGTACGATGCCGT GGGTACTT TGGCGGACTCCGT AGGT CAT CATCTGA 

ATAAGCCGCAATATATTGACATCCTAATGCCTCCGCTAATTGATAAGTGGAACCTGCTGAAGGATGACGATAAGGATCTGTTCCCATTGTTGGAG 

TGCCTGTCGAGCATCGCCACTGCCTTGCAGTCCGGCTTTCTGCCCTACTGCGACCCGGTGTATCGAAGGTGCATCTCTCTTATTGAGCAGACTAT 

CAACCAGGAAATGCTGTGTAAACAAAACCAAACGTACGACCATCCCGACAAAGAGCGCATGATTGTCGCCCTAGATCTGCTGTCTGGCCTAGCCG 

AGGGTTTGGATCGCCACATCGAGACATTGGTGGCCAACAGCAACATTATGCATCTACTCTACCAGTGTATGCAGGACGTTTTGCCTGAGGTTCGC 

CAATCTTCGTTTGCCCTGCTGGGTGATTTGACTAAGGCCTGTTTCCCCCACGTGCATCCCTTCATGGCCGATTTCTTCCCCATCTTGGGTCAAAA 

CCTTAACCCAGACTTCATTTCGGTCTGCAATAATGCTACTTGGGCCATAGGCGAAATCTGCATGAAATTGGGTGAGGAGACCAAGCAGTACATAC 

GCCTTGTACTCAGCGACCTTTTCATCATTATCAACCGCCCGAACACGCCCAAGACACTGCTGGAGAACACAGCAATAACAATCGGTCGTCTAGGT 

TATGTGTGCCCAGTTGAAGTGGCTCCTTATTTGCCCGAATTTGTACGACAGTGGTGCACATCGTTGCGGCACATACGAGACAATGATGAGAAGGA 

CTCAGCCTTCCGTGGAATGTGTCATATGATCACGGTGAATCCAGCTGGCGTGGTAGCAGACTTTATATTCTTCTGCGATGCCATCGCTTCCTGGG 

TTAACCCACCAGAGGATCTGCATCAGATGATACAGAAGATTCTGCACGGTTTTAAGACCCAAGTGGGCGAGGAGAACTGGCGTCGATTTGTGGAG 

CAATTCCCGCCAACCTTGGCCGAGCGCCTGACCACAATGTATAACATCTAAGGCTAGCCATGCCATAAAACACCCACCTTCACCCATTAGCGAGC 

CAAGCAGCAACTTTTTAACTAGACAATTATGTACAAGTATGCGT 

(SEQ ID NO: 827) 

Start ATG: 125 (Reverse strand: CAT) 

MTWEPQGEGLQQIIAILKESQSPDTATQMAVQMKLEEFNRYPDFNNYLIYVLTKLKTEDEPTRSLSGLILKNNIRMHGTTLQPEIVEYIKHECLQ 

AVGDASPLIRATVGILITTIASNGGLHNWPQLLPSLCEMLDNQDYNVCEGAFSALQKICEDSAEILDSAALNRPLNIMIPKFLEYFKHSSPKIRS 

HAIACINQFIINRSQALMLNIDSFIENLFHLSSDEDHEVRKNVCHGLVMLLEVP^DRLMPHMSQIIEYMLLRTQDTDEGVALEASEFWLSLAEQS 

ICKDVLAPYLAQLAPVLVRGMRYSEVDIILLKGNVEEDDMVPDREEDIRPRFHKSRAHTIRSTQEGGAGATGDDDDDEFEDGMDDDSSLSEWNLR 

KCSAAALDVLANVFREDCLPWLPILKETLFHQEWVIKESGVLALGAIAEGCMQGMIQHLPELIPYLISCLSDKKALVRSITCWTLSRYANWVVN 

QPHDQYLKPLMEELLKRILDSNKRVQEAACSAFATLEEEACTELVPYLEYILKTLVFAFSKYQHKNLLILYDAVGTLADSVGHHLNKPQYIDILM 

PPLIDKWNLLKDDDKDLFPLLECLSSIATALQSGFLPYCDPVYRRCISLIEQTINQEMLCKQNQTYDHPDKERMIVALDLLSGLAEGLDRHIETL 

VANSNIMHLLYQCMQDVLPEVRQSSFALLGDLTKACFPHVHPFMADFFPILGQNLNPDFISVCNNATWAIGEICMKLGEETKQYIRLVLSDLFII 

INRPNTPKTLLENTAITIGRLGYVCPVEVAPYLPEFVRQWCTSLRHIRDNDEKDSAFRGMCHMITVMPAGVVADFIFFCDAIASWVNPPEDLHQM 

IQKILHGFKTQVGEENWRRFVEQFPPTLAERLTTMYNI* 

(SEQ ID NO: 828) 

Classification : known_f lybase_gene 
Gene Symbol: Trn 
FlyBase ID: FBgn0024921 



Celera Sequence No. : 142000013384640 

TAACTGTAGAGGGGAAAATGGGCTTAATTCAAGGACTAATAAATTAATAATTCCATAATATATTTTAAAATGAAACGTGTAAGTACAACATTAAT 
TTTTTATGTATATAAAATTTAATACTTTTCAGTAAAATATGGACATTTAAAACAGTTTCGGTGTAAGCATTTCAAATTAAAAACTCCAGAGGACG 
AGAGCTGCAGTTACAAATAAAATTCGTGTTAAAAATGTTTAACTTGCATTCGTCTCGATTTTCTTACGTCGCTTTAGTGTGTTTAATTCCCTAGA 
ACTTCATAAAAGCCGTAAATTGCACCTGAAACTGGGTAGAACTTCATAACTAAATCGGGTTTCTAAAAATCCAATTAAGCGTATTCAGTTTATGG 
TTGGAAGCAGACACAATCACTTGCTGAAATCTAAAGAAACCAACCACATGGAGCCGTTCCCAAAATTCGTTATAATATTTCTAGCTGCCTTTATC 
AATGTGGCATATGGCATGCATAATCCAATGATGCGTGATAGTGAGCTATTCTAAGACATATCTATATAAAATATGAACTAAAATTTATTTTCATA 
TATTTTAGATTGTAATTTGGGAGAGCATCTAACTTGTAGGCTTAAGTTTATTCACTGTTGGCTGTACGCGTGCGTTTGTCTTCCGAATCACTCTT 
AC T T GGG AC T T GGAAAT GG A T GC C TT C AC AT C AG ATAC GCT T AAAT AT AAG T T AT GAAAT C TAT AT T C G T T T C T TC ATT C AT C T T G AAT C AT GT G 
GATATGATTTTCCTATTTTATTGTCAAAAGGAAAATGGTCTTGCCACAACACATATGCATGAACATCCTCAACTGTTAATAAACATTGAAAACTA 
TCCTGTTTTTTGAAGTAACAACTCCTTGGTCACAATTCTCAAATATATAATTTACAACCGTCGTAATAAAACGCATAAGACTTTCAGCTTACATG 
CTAATCACGAACTACTACTTTATTTGCACTGAACGGCGGGGAGAGGATGACTACTTCGCTTTGTCCAGAAAGTCTTTGGCCTCGTTGATCTTGGC 
CGCCAAATAGGGAGATCCTCCTCGATCCGGATGGTTCAACAGCATGATCTTCTTGTGCGCGTCCTTAATCTGTAAACAAATATAGGAATAAGAAT 
ATTTGTATTTAAGCCAAAGGAGTATTTGATTTAAGCCAATACCATAAATATTATATAAACAAATTTGACAATCACAGGAACTACTTAGTTTCGGC 
TTTTTCATTGCTTGGAATGCCTTTATAGAAGTATACAGCTCGGACTTCTGAGAGATCCTAAGCAAGAGAACTCCACATTACCTTTATCTTGGACG 
CACTGGGGCTGACACCTAGGATTAGGGACGCCTCCCGCTTGTTCATCTTGGGATCGAAGCCGCCCTTGTAGTATTTGGAGGCGGCCATGCTCTCC 
GCATCGTATTTGGGCAGGTTCTTGAGGGCCTCGTTGAATTTGGTGGTCATCTGGGGCATGCGGCGCATCAGGTGCTTTCCGGCGAATCCCACGGC 
GGCCACGCTAAGACCCGCCAGAATTACGGAGCTCGCCTTCGGAATGGAATAATTCGTATTGGTCGACATCGCACGGACAATTTAATATTATGAAA 
CGAAAGAGATTCGCAATATTTTACATTTGTTTTTTGATTAAAACAATGTGATTTAACACAGTACCATTTTTATGTAATTTTAACAGCTGTAAGTC 
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AGTGGAACTGCATAGCCTGCTGTCAGCTGTTGGATTATCGATAGACTCAGGGTTGTCGCTCTATTGAAAAAAATATTGCTAATTTAATTAGTATT 
TTCATTTAATACTGTTGAAATTATAGTTAAATACTAATTAAAAGGATAATTTGTATGCACTATGCTTTTATATTATTTGCTTTAATATCTACATT 
TTAAATCCCCATTTACTACGACACTGCAACCAAGCTGCTGATTGTCAACAACCAGGGCTGCAGGCTCACAACAGCAGTTGTTAAAACGATATTAA 
GGTGATCTATCACCAACAGTTTGCAGCTGCAAATACATTAAGTGCTTAAACCAAACAGGTATTTCTAGAAATTCAGTGAATTTTATATCATGCTC 
ACAAAATCGGCGCATCTGCTATCGAATGTGTGCGCAGACGTTGCCTATCGATAACAATCAGGTCGCATCGATAACGGGACTCAACAAAGTGCACA 
AACAAGTGAAGAAGAAGCAGGAAAAAATTTTGATGTGCAAAGTTTGGAAACCAAATAATATTGATTGACGAAAGTAAGAGTTACGAAATTGGTGA 
TCTCCACATCCTGGGTGGCGTGGTGGTGTGCGTGAATGATCGTCAGCGTACCGCTAATCGTGAGCGTGCGTGCGTGTGTGCGTGCGTGCCGTGTG 
TCTGTGTCCGTCGGTCGCCGAGAGTGTGTGCGTTAGCTCCTCAGCTGGGGAAAAACTTTTGGAAAATTAGCAGTTCTCCGAGATAAACAGCCGTC 
AAACTGGAGCGAATTCCGGAATATCTGCGCGACACTGTCGATTGGCTTATGGGGTGAGTACCGTTACACCACTAAAGACAATGTAATCTATGGCT 
CGAAAGTGTTAGTCCCGAAAAAAA 
(SEQ ID NO: 829) 

Exon: 1589.. 1317 
Exon: 1114.. 1001 

Start ATG: 1589 (Reverse strand: CAT) 
Transcript No. : CT22766 

ATGTCGACCAATACGAATTATTCCATTCCGAAGGCGAGCTCCGTAATTCTGGCGGGTCTTAGCGTGGCCGCCGTGGGATTCGCCGGAAAGCACCT 
GATGCGCCGCATGCCCCAGATGACCACCAAATTCAACGAGGCCCTCAAGAACCTGCCCAAATACGATGCGGAGAGCATGGCCGCCTCCAAATACT 
ACAAGGGCGGCTTCGATCCCAAGATGAACAAGCGGGAGGCGTCCCTAATCCTAGGTGTCAGCCCCAGTGCGTCCAAGATAAAGATTAAGGACGCG 
CACAAGAAGATCATGCTGTTGAACCATCCGGATCGAGGAGGATCTCCCTATTTGGCGGCCAAGATCAACGAGGCCAAAGACTTTCTGGACAAAGC 

GAAGTAG 

(SEQ ID NO: 830) 

Start ATG: 1 (Reverse strand: CAT) 

MSTNTNYSIPKASSVILAGLSVAAVGFAGKHLMRRMPQMTTKFNEALKNLPKYDAESMAASKYYKGGFDPKMNKR 

HKKIMLLNHPDRGGSPYLAAKINEAKDFLDKAK* 

(SEQ ID NO: 831) 

Classification: hypothetical 



Celera Sequence No. : 142000013384523 

TGCTCTAACTAGGCCACACACGGAACAGGGATTGGAAATTTCACTTTGAGCTGCAAAGTTCTGGCGATCACCTCCACCTCTCGCAACGCGGCAAA 
ATATCAACTGAATATTTTCAGACCTCGAGTGTGTTATTTGCGTTAAAAGAAAGAGTCGCGAATCGCTGCGAAGAGAAGAAAGAGAAAAGAAATAC 
TAAAAGCTCGCTCCGAAGCTTTGGCGACGCGAGCATGATTCTTCTCATATACAGTATATAGCATTTGAATTGATTAAATGCTAATTTAGCATGCT 

GT GGT C GT T GC C GCC AT C GGAG AT C T T AC AC T G AAAC T T GAAAAGC GG AT T C TG AAT AT C AAT T CT T C T TG AAAAAAGGC TT T T AT AT GT AT AC A 
TAGATCTATAGCTTCCTAAAAATCATTGCAGCTCATTATCAAACATGCTTTAATGCTGATTCGTCTGTATAAATATTTAATTATTGTCTACCAAG 
TCATTGGAAAATTTTCACCACTATGCTTATTCGCCAACACTCTCGGAATATTTTATTTTTTCCATGGTCTATTTGTATAATTTCTTACCTTAATG 
CCAAGACCATTTGAATATTTATACCCTGTCCTTTGCTGTTTTGTTCTCTTATCAATGCCCTTCGCATTGACCGAGTTTTCAGATTTCCTTGCCTT 
TGGCATCATTAATCCCTTTCAACATGGCCAAAAGCCATTCAAAACTGAATTGTTGAGAGCTGTCACTTGGCATTTTATTGCCATCAGATAGCTGT 
ACTCACAACAAAATTCTACGACAACCCAACCGACAAAGCCCACACGATGATAGTTAATTAAAAAGTTGTTGGCACACTCAGAATATCATGCAAAA 
TTAGCCTGGCTAACTGGCCTTATCATAATTATCAGCAATCCCCAAACAAAACTTTACAACATGATAATTATTAAATAAAAAGCAAATAACCACTA 
ACAGTAGAACCGAATTAACATTTGTGAGCTCAGAAAACAAAAGCAAAATACAGGTGAAACAAAATGCAGCAGCATCCGTTTACTAATTTATACGC 
AATCTCAAATAATTTACAAAACAAATGGTTAACCGAAAGAAATATTTTAACAAGCTTTCTTGAGGCATTACAAAAATTAAAATAATATATTTCAG 
ACAGAGCAAGATATCTATTTAAATATTATTTTATACAAAATGAAGCAATTGTTAAACAATTTGGACAACGCATGCAATCGACCCTATTTGTAATT 
TAATTGATCAAAAGCGAATGTGTCTTAAAGCAGTACCTCTCTACTACACGCTTGATGTTAAATTGAATTTTTGCAATTTTTATTTTTCCGGTTCT 
TTAAATATAATATAATAATAATAATAATAATTTTTCCAGCTGATTTTTATTTGTAGTTTTGTTTTTCTTTGTATTTAAACTTATTATGTTGATGC 
TGTTGTTGGTAGCTGTCTAGCTTTTAATGTTTAATGCTTACATCATTTTTTCAGTAATTATTATTAAATTATTTCCGTTTATGCTGTTCTGTTTC 
TGTTCGGTCAAACACACAATTAATCGCATGTTCGCCACCAATCCGCCAGTTGCTGCTCCAATTGCCACTGCCACAGCTGCAGCGGCTGTTGCTGC 
TGATGTTGCTGCAGCTGCTGATGTTGCTGCGGCCCCAAGGACGCCTTTGCCGCTGCCGTTGCTCTTGGTACCGCCGCCAAATTTGCTGCTCCTAA 
TGCTAATGCCGCTGCGAAGGCGGTAGCAGCGGCTGCTGCGGCGAAAATGACTGCGGCAGTTGATGATCTTGCTGTTGTTGCTGCTGCAGCTGCTG 
CTGATGATGTTGCTGCTGTGGCGAGTGTTGGTGTTGTGTTTGTTGCTGCTATGGTGGTGACCGCAGTTCCTGCTGATATTGTGGCTATTGTTGTG 
GACAACAGCACGGCGGCCATTGCCACTTACAATGACGCTGCTGCAGCTCCTCCTCTTCTGAATCTGACTATTGTAAACTGTTGATGTTATCTTTG 
TTGCTTTTACTGTTGTTGTTGCATCTGCTTTTACTTCTGCTATTGTTGATGTTGAATTTGCTGATTCTGTTGGACTGCTGCAATCTGGAACGCAT 
CACATAGCATACTTTCTAGTCCACTCTCGTGCCAGCTCATTGTATTTTTCCCGATCGGTTTTATATATTCTGGCAATCTCTGGAACAAGAGGATC 
GTCTGGATTGGGATCACAGAGTAGAGAGCAAATTGATAATAAAACTGAAAGAGAATATGGAAACACATGTTAGTATCATTTTCGATGTAAGAACT 
TTAGATGAACTCACCTTTTGAAATAGTTAATGCTGGCGACCACTGAGATCTTAATATATCGAGACAAATCGATCCATTGCTGTTGATGTTTGGAT 
GGTATATGCGCGTTGTAAAAGCCACTTTGGGTGGTTTAAAGGGATAGTCTGTTGGAAAATGTATAGTTAAGAAGAATACACCTCCTTGATAAGGG 
CTGTCCGGCTGTAAGAAAATTGATAACAAATACTATTAGTACAAGAACACTAGTTTACTACACACCTAGTTTATTTTATTTTGGTTTATTTGGAG 
GTCAAATCATAGATTGCGTTCACTTGAAAAATTTGGTTAAAAGAATGCTTTGAAATAATAAGAAGTTCTGAAGTAAATTATCGCGAGCTTCAGAC 
GCATGCCACAATGTTTACACGTTTAATGGATTGTTCTACACATGTACTGGATTTCTATTTAAACACACGATTACAAATGTACTTGGCTATTATAT 
ATTATCTATCATTTCAAAGCATCTAATCAACTGACGTAGCTTATTTATAAATACAAATAGAGTATTGTTTTATTATTGCATTCTAATTTACATTA 
ATGATTTATTAATCTGCTATTTTTAACTGTATACGGCAATCCCTGATCAGACCAAAAATAAACCAATATAAAATATTTCAGGCTACAAATTAAGA 
TAGAATCGATTCGATCTTTCGGTGAAGTCCTCTTACCGGGCCCATTATTGTAGCTTGCCAGTGAAATACTGTAAGAAAACAAGAGATACATTAAT 
CAAAACTGTTAAAAACGTAGTTGTGATATAATCCTAAAGAGCAAACATAAACTGCGCTGCAGTTAAACGTTTTCTGTTTGCTTTGATATGAGTCC 
TATATCTACCTATATCTGCCCTAGATGGGAAATTACTGTGTGTACGTAAAGCATCTTGGATATTTACATTTGATAATGAGATGATGACAGAGTAT 
ATTAAAATTTTCCCTTCGCTCAAAACTAAAACGTATTTTTCCTTGCCCAAGTTATATTCAAGTAATGAGTTTAGATACAGGAACTGATTTGCTAA 
ATAACTAGGTAAATAAAATTAAGCTTCCAACAAGCCACAGAAATGTAATCTTCTTGTAGTACAATACCACTTTCCCCTAGTTCGCCACGTCTAGG 
CCCCACTGTATATGCATCCAATGCAGCAAACTCGCTCTCGTCTAAATAAATGCACCACTCTCTCAGCGCCGCCCACCACCCCACTTCACCACTTC 
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CATTCTCATGGCCACGCTCTTTAGCGATTCCCCCCCACAAAAGAACAAACACAAACAATGCCAGCCAAAGAGCCTTGGCACCGACAATTGGCAGC 

AGCGTCAGCAGCAGAAAAAGCACCAAATGCTCTCCCCCTTGAAACAAAAACAAAGTTTTTAAAGTAAGGAATATACCTATACCGAGATACCCTGT 

AACCAACCAGGCTTTAAAGCACACATACGAAAAATTCATTTCTATATAAAACTGAGATATTTTTATAAAATCATGTTTTCTTAATGGTGGAACGC 

AATATATTCGATTTTTCAACCGATTTCATGCTAAACAAACCTTGACTATTATTGGCATACATTTCCATGAGTTCTATATATCCTTTCACACTGCA 

GAACATCACATATGTATATATATACCCTTGTACCCTGTGAGAGAGCACAAAACGTGTGGCAGTTGGCTCAGAATCATGGCCAAAACAGAGCTACT 

GGCTATGCCGTCAATGTCATTCTGGTGAACGAATCGAATGCGACTGGGAAACTCTCTCTGTTGTGGCAGAGCACTGAGAACTGAACTGGGAACTG 

GGCCCTCGGTAACCAGCTGCTGCCTTGACCATAATTACACGAAATGTTCATTTGTCCATCTGCAGTAGGGAGTTTGGGTCTGGAAAATGCGTCGG 

CATTGTTTTTGTCATAGCTCAGACGAAAGTAATCTCATAGCTGCAGCCATTTTGATTAATACCACTTAGACAACAATCAATAATTTCTGAGCATT 

GCTAAGGCAACAAAAACCCTGCAACCAGATGATATTAATAAAAGCAATGCATATTTAGATAAAAACATCCTTAGTGCATTTAAAGAATTGATTAA 

TCCGACCATGGTAGCTTAGCTCAAAAGCATTCAATCAATGTTGAGTTTAAATGTTTAAATACATGATTAAATCATAAGGTTTCCTTTGCCATACT 

ACAATTACAATGGATTAATTTCACACACATACACACACACACACACACACACACAGCGCCACACTGAGCTGAAAATGGATGGGAGTTTCCATTGC 

ACTCGAGTGAAAATTCGCCACCCACCCAACCAACACCTAAAAAACCAAAGTAAGCCCCGCCCTGAGCTGCAATTCCTTTCTTTTACCCCAGTTGC 

TGGGAATTCGAAGGCAAGGGTTCGGAACAGCCAAATTGATAATAAAAGCAAAGTGATAAGAACTGCGGAAATTATTGAGCTGACTCCTCAATAAT 

CATAGAAATATTCACGCTTATATACAAACATACTTTTAATTGTATATAATAGCGACGTTTAAACGCTGTTAATAGAAATCGAGTGAAATCGCCAC 

ACTGGTCGTTCATTATCAGCAAACGATGACAATTATTAATATTATTATTTACTCTGCCTCGCGGGAACTTGAATGAGTTTGATTTCCTTACGACC 

AGAGAAACTGGGAAATGAAACACAAAGAAATGAGCTAACGAAAGTGCTGGGATATATATAAAATATTTCAATATTTTGCCGTTCTGTATAATATT 

TTCATTTCCACAAAAGTGCGTTCACTAAATTATAATATATATATAATACTGCAAAGCATTACGAACAAAATTTCTATAAGTTCCTATAGTTAAAA 

CAATAGTTTATATAGCTTTTTGCAGAATAATTTCAGTTCCTAGCCATTTTCCCTAAAACTAAGTATTTAGTTTGCCATTTTCTTTTTAAAATTTT 

ATCGCTTACAAAGTATGCCGGTGTTTTTTTTTTTTZ^ACCGAAATCGTCGTCAGCAAGACTATATAATTTACGATTTCACTTTTTGTGACGTCATT 

GGAGAGTGACCCCCATATGGCTGGCACCTCTTCCTCTTCCCGCTTGAGTGGTGAAGGGGTGAAGGGAGGGGGGGGGGGTGCTTTTCAGTAACGGT 

TGTCTGTATGCAGTACAGCTCTACGTGCTATAACGTAACCGAATCCGATCCACAATCAGCGTGAATTTGTTGGCTAATTGCATGACGGCTTCGTG 

AGTTTTTTAAATCGAGTTTAAGACAGCGAAGAAAAAAGAACCAAGTCATGGCGATCAGCTGTTTGGAGGCCGTCGCTATCACAAAATAAAAAGAA 

C C G AAT C C AAAC T T GGC GC AAAAG AAT T C C AAAT T AGC AAAGT CC AAAC T AAAAGT AAC AGC C AAT AGGAAAAC AAAC G AT GT A C AAAT T T AGT G 

T GT T AT ATG C C T AGT C AC G AT CG AAAT CT AT GAC G T GGT G AT GG T C GAT CGAC AAT AT T TGCATTTTT T AT T AACAAT T C C GAG AT AGTGGT AT T 

GTGAATTTTGCCTAATATATATATAGTTCCTATTCCTTGCATCAGACTATATACTATACTGACTCAGACTTGGCAACTCAGAAATATTTCACATA 

ATTTCATGTCTTGGAGCTCCCACACGATATGGTGAAATCGAAACTTGGGTTAAAGCCCCCCCAACATAATACAACTGACTCACGGCAAATCAACG 

AGTTGTTGAATAACATTAGCAATACTTACAATCATCTCCAACTGGACCGGCTGAACATTGTGCAGGTGGATCTCTGCCCAGATCTTGCAGTTCCT 

GTTGAACAAAGACATTTTCAAATTAGTCATTGAAATAAAAGTTAAATATAGGTATGTAAGCTCAGTCAATTTCTGATTGTTTTTAACAGTTATGT 

CAAGTCCTACAAAAAGAACTGGTTTTTAAAGAGTATATCTAATTTCTAGTCAAACCTTATATACTACCTACTTTCTGAAAGCATAGGTAACAAAC 

CTAGCAATTGCAATACAGGAAACCCCAAAAAGAATTCTCGAAATGGGAAATGGTCAAAAGTGCATTTCATGGATATCTGCTTATGTTTTCCGGTA 

GTCGGTTCTTTTGTTTTGCTCTATGATCCAGCTATATGTACATACATATGTATGTATGTACCACGGTTTTCCGGGTGTGCGTACATTTTATGCTG 

TGTGGTTGGTATGCAAATGATGTAAATTTTAAATGCAGAATTTAAAAACAGATATGTGGGGATACAAAATAAACATTTTTATGCGAGAGCTTCGT 

TCGAAGCGAGTGCGAGCGAGAGGGAGAGACGTGAAAAATCAAGTTCGTTGGCCCGGACCACAAATGGCGCTGTCTCCTTCTGCCCCACCCCCCGA 

CCAGTCGCTCCATCCCTGTTGCACTCGCACGCACACAGCTGAGCGGCTCCCCTCCCTCTCTCTTACACTCCGACTCTTCTTCACTTATCACAGCT 

GC C C C AC AC AC AC AC ACT C C AAAC GGGGAGAG AG AAT G AGGTGT AAG GT AAACC GGC AAAAAAAAAGAAT AT AT AT AT AAAAAG AAT AAT G AAG A 

TGAGCTGCGAAAGAGTAGCGCGACTCAGTGGAGAAGTAAGAGGAGGAGCAGCAGGAGGAGGAAGAATGAAGGAGAGTGAGAAGACTGAGCTTCAG 

TGTATGCGTGTGTTTGTAACGGCACCACAAAATCAGAGTGGAGCACAACAAAAATAGCCGTCGCGAAAAACACCGTGTTTCTTTGCCCCATCTCA 

CTTACTCATTTTGGGAATTGTACCTGTTGAGCAAAAATAAATTACAAGAAATTGTTACATTTCTTTTTTCGTTGGGAGTCGGAGCGGTTATGACT 

CATATCAGACGAACAAATGAAGTAAAAACAAACACGCACACTTGTGCACAGCAAGAAAATTCGATAATATTATCAATATTAGACCAGGTGCACTA 

CCAATACAAACTTCTATTTAATTCTTATTCGAATAATTAATATTGTGAACTTTAAGAATTGGTTCAGTAAGACTAAATGCCATATGCATTGGTTC 

TAAAGCCCATTTCGTTCAGTGTACGGTTCAAGTGAGCTCCGATCGGCGTTTAAGCCGTTCTCTCTCGCTCTTTAACTCTCTTTTTTGGCAAATGA 

GGCCATCTAGAGAAAAAAGGCTTTTGAAGAATTGACCACAAAAAGGGGGCGTGAGTGATTTTAAGAGTTTACTACAATGTATACTTTTATTTCCG 

GTTTGCCGCTTGCCATTTCGTTTTCATTTAGGCAGGCATTAAAATGTTATAATGGATAGAAAACTACGAAAAACGAGGAAAACTCAATGGCCATG 

ACGCCGACGCAGGCAGCGCTGCTTTTGTCGAGACAGTCAGCAGACCCCCGCCCCACACATTCAAACACAAATACACACCACGCTCACACATACGC 

ATAGATATGAACAAATTCCGCATTCGCTGCTCATTTTGTATGTAAATTATGCTTAATTATAATTTATAACTAATTATTTGGGGGCCCCCTTTTGG 

CTGACGGTGTTTTGCAACACTAAGCCCCGCTAGCACTCACACACTTGCGCTCTCTTTGGCTCTGCCCCATTGCGCGAGCGGGACCGAATGAGGCC 

TCAGCTGTCGCGGGCAAAGGGAATGCGCCAGGGCCAGAGCGAGATGGCATGCAATGGCCATAGCCAAGTGATGGCGAGACTTTGTATTTGCGAGC 

GTTTTAATGTGTGTGTGTGTGGGGTATGACTATGATAACCGGCTTCAAACGTAAAAAAAGATCCCAATTTGCCAAGTAACATTTCCACAACTTCG 

AGATATTACTAATGCTTTTGAGGAAGGCTGGGATGACTTTGAAAAAATATCGGGTTGTCAGGGCCTTGAACATTGGGGTGTAAAAAAGCATTTAC 

TGGACAAGGACAAACGACGCGCAGGCACTTTAAAAATAATTATCGAAGCACGCACGCTAGAACCACTCTACCTAAATACATATATAAATATATAT 

GCTTGGGGGCAGCACTATTTCTGAATTATCAGCGCAAAAAGTAGGGTTAGGTTAATTTTATACGCTTCTTGCTTTTGCTCTGTTTTATTTTTTGC 

TGGATCTTTCTGCGGCGCTGGCAGTGTGTGCGTTTATGTGTGACTGTTGCGAGGCTGAGAGTGTGTGCGAGTGCCGTGTACAGCGTGTATTGGGT 

AACAGCGCCCGTAGAGCTGCGAAAACTCCGCCGTATAGTTGCATTAATGTAAATGCATCCAGGTGCCCGGCACACACGCACACAAACAATGCACA 

TTTGCATTGCACTTTCACGCACACGCATCGCACACACACTTATCCCACACTCTCGCAAAAGTAACGGAAGCGATGACTGCACTACACGTTGCCTT 

GCCCACCACACACACACATCCTTATACCTTTTTCACCACCCTCAAAATTTCTAATCTGCTCCTAATTACTAACTTATTAATTAAATGGTAACTCC 

ACAGCTCATTTTCTCCCTTCCCTCTTGCACGCTTCTTTCCCCCCGCTTCTTCTTTCTTCTCCTCGGTTACTACCTACGTTTCACATTTCTTGCAT 

AAAATTTTTCTCCCGTTGAAAACTGATGTCTGTTCGCTTTTTTTTTAGACTATCATTTTGGTGTATCGCAAAGGGTTCTGTTCTTTCTTTTTTTA 

GTCCGTTTCGCAAAGTTTTTTGAGATATTCCTTTTTTCACATACCTTATTGATTCTTTTTAACGCCATTTTCTGATTGAAATTTGATTTTCTGGG 

TTGTGTGTGTGTTTGGTTGGTTGTGTGTGTGTTTGGTTGGTTGTGTGCTGGTGTTGGTGTCTCGCGTATTTCTTTTGTTCTTCGCACAGAATTGG 

GTTTTATTCAATATATAATCCTCCTGCTTAGAATTGAACGAAAAACGAAACACAAAGCTGCGTTGCGCCTCTAGAACTTTGATGCTTAAATATGG 

CTCTTTCGGTTGAGTGTCCCTTGTCACGAAATAAAGCAGCACCAGCGCACTGCAAGTAAAAACGGCGACCCGACGAAGACTACGGCGACGATTAT 

TTTTTCACAATCAATGCGGGGCCTTTCGACCGTTTGTGCGATAGAGACGACCGTTAGTGAGCTGAGAACGCAACGAAGCACAATCGTTACACCGA 

AGTGTAAGCCCGAGCGAGAGGAATGGTGTGTCTGTGTCGTGTGGTAGTGTTGTGAAGCGGAAGTCCAGCAGAGCAGTCCAGCTATATGACCTTTT 

CGCTAGGGGTGGCCAGATCGATTGGGTGGCAACGCCTTTTAGGTGAGGGATGAATCAGTTAATTACAATTTGATTGCCAATGCAAGACTCTTCAT 

ACTCACTTTAGTTCGGGAACAGCCAGAAGAAATAGTTTTACCGGCAGCGAGGAGGTGTTATATTTTATTCGAACTACCTTCTTATCGATGGTGCG 

TGCGCCCAAAAGCTTGCAACTTGTATGTAAATTTGTTAACACCCAACGTGTTGCAGGCTGCCGGAATGTTTTTACTGAGCAATTTTGAGATGTTG 

GTAATCCGAGTTCTTGAAAACTTTCCGACTAGCTACTGAGTTCTACAACTGAATTGACAGGGCGTTATTTTAAAATTTAAATTCACTCCAGAGCG 

CTGGTCTGGCAAGGCCGACGTGCCAATTCAGGATACTTGTCGATATTTTCCGTTGTGCTATCGATAGTTTTTGGGAATAACAGGCGTTAGTAGAT 

TTTTAGTATTTAGATTTTATTATCATAAGGATGAAAATATTTTTAACAAAATAAACTGGTTCATTTAATTTAACTTTTTATTTTCTCACTACTCC 

AAATGGCGCTTCCGCTGTCTGAAACTGCCGTTATAATAGTATGAACAGTTAAAACAATCGACGGTAGCTGAATTTAATCAATGCAGCCCTCTTAC 

CCTATACATCGATAGTATCGCATAAACATCGATTTTGGTTCCAGCTCTAACTTTAACTGTCTGCAAAATATTTCAAACAAAAGTGTTTGTTAATT 
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AAGTAATATACTTAATATTAGTGGCTGTTTTGCTTCGACATTGATCGAATTGCGGTTGTGCAAGTGTTTTGTTTGATTTGTTGGCGTGTTTTTCC 
TAGTTTGGCGGCCCTTTGAACTCGTTCGTGTATCGCCATCTCTCTCCCACCCCCGCAATCGGCTTCGTGCGTATTTTTTTGTGAATTTTTACACA 
GAACTCACACTTTTGTATGGCCCAAGAATTTGTACAATATTTTTGCACTTGAAAAACAAGAAATAATGAAGAAAGCCAAGATAAACGCGACGATG 
TGCAAGTGCTAATCATTCAGTTTTGGCAAGCGCTGAACTTCGCTGACTTCTTGAACAGCAAAAAAGATGGAGGAGGATAATGCCAAGGTCCTAAG 
TAATAACAACAAGAACAACAGAAACAACAATAAGGGACGCCAGCAGCTGCAGGGACACGCCCACAGCTCCGCCACCGCCCACGCCTCCGTCGATA 
TATTTATATTCGATTTATAGTAGCTGTAGTTGTAGTCATGCCAACAACAAAAACGCAAACACAGACAACAAC 
(SEQ ID NO: 832) 

Exon: 9807.. 9602 
Exon: 9535.. 8975 
Exon: 6078.. 6015 
Exon: 3013.. 2982 
Exon: 2478.. 2295 
Exon: 2229. .1001 

Start ATG: 8998 (Reverse strand: CAT) 
Transcript No. : CT22777 

AGTTTTCAAGAACTCGGATTACCAACATCTCAAAATTGCTCAGTAAAAACATTCCGGCAGCCTGCAACACGTTGGGTGTTAACAAATTTACATAC 
AAGTTGCAAGCTTTTGGGCGCACGCACCATCGATAAGAAGGTAGTTCGAATAAAATATAACACCTCCTCGCTGCCGGTAAAACTATTTCTTCTGG 
CTGTTCCCGAACTAAAGCGTTGCCACCCAATCGATCTGGCCACCCCTAGCGAAAAGGTCATATAGCTGGACTGCTCTGCTGGACTTCCGCTTCAC 
AACACTACCACACGACACAGACACACCATTCCTCTCGCTCGGGCTTACACTTCGGTGTAACGATTGTGCTTCGTTGCGTTCTCAGCTCACTAACG 
GTCGTCTCTATCGCACAAACGGTCGAAAGGCCCCGCATTGATTGTGAAAAAATAATCGTCGCCGTAGTCTTCGTCGGGTCGCCGTTTTTACTTGC 
AGTGCGCTGGTGCTGCTTTATTTCGTGACAAGGGACACTCAACCGAAAGAGCCATATTTAAGCATCAAAGTTCTAGAGGCGCAACGCAGCTTTGT 
GTTTCGTTTTTCGTTCAATTCTAAGCAGGAGGATTATATATTGAATAAAACCCAATTCTGTGCGAAGAACAAAAGAAATACGCGAGACACCAACA 
CCAGCACACAACCAACCAAACACACACACAACCAACCAAACACACACACAACCCAGAAAATCAAATTTCAATCAGAAAATGGCGTTAAAAAGAAT 
CAATAAGGAACTGCAAGATCTGGGCAGAGATCCACCTGCACAATGTTCAGCCGGTCCAGTTGGAGATGATTTATTTCACTGGCAAGCTACAATAA 
TGGGCCCGCCGGACAGCCCTTATCAAGGAGGTGTATTCTTCTTAACTATACATTTTCCAACAGACTATCCCTTTAAACCACCCAAAGTGGCTTTT 
ACAACGCGCATATACCATCCAAACATCAACAGCAATGGATCGATTTGTCTCGATATATTAAGATCTCAGTGGTCGCCAGCATTAACTATTTCAAA 
AGTTTTATTATCAATTTGCTCTCTACTCTGTGATCCCAATCCAGACGATCCTCTTGTTCCAGAGATTGCCAGAATATATAAAACCGATCGGGAAA 
AATACAATGAGCTGGCACGAGAGTGGACTAGAAAGTATGCTATGTGATGCGTTCCAGATTGCAGCAGTCCAACAGAATCAGCAAATTCAACATCA 
ACAATAGCAGAAGTAAAAGCAGATGCAACAACAACAGTAAAAGCAACAAAGATAACATCAACAGTTTACAATAGTCAGATTCAGAAGAGGAGGAG 
CTGCAGCAGCGTCATTGTAAGTGGCAATGGCCGCCGTGCTGTTGTCCACAACAATAGCCACAATATCAGCAGGAACTGCGGTCACCACCATAGCA 
GC AAC AAAC AC AAC ACC AAC AC T C GC C AC AGC AGC AAC AT CAT C AGC AGC AGCT GC AGC AGC AAC AAC AGC AAG AT C AT C AAC T GC C GC AGTC AT 
TTTCGCCGCAGCAGCCGCTGCTACCGCCTTCGCAGCGGCATTAGCATTAGGAGCAGCAAATTTGGCGGCGGTACCAAGAGCAACGGCAGCGGCAA 
AGGCGTCCTTGGGGCCGCAGCAACATCAGCAGCTGCAGCAACATCAGCAGCAACAGCCGCTGCAGCTGTGGCAGTGGCAATTGGAGCAGCAACTG 
G C GG AT T GGT GGC GAAC AT GC G AT T AATT GT GT GT T T G AC C GAAC AG AAAC AGAAC AGC AT AAAC G GAAAT AAT T T AAT AAT AAT T AC T G AAAAA 
AT G AT GT AAG CAT T AAAC AT T AAAAG CT AGAC AGC T AC CAAC AAC AGC AT C AAC AT AAT AAGT T T AAAT AC AAAG AAAAAC AAAAC T AC AAAT AA 
AAATCAGCTGGAAAAATTATTATTATTATTATTATATTATATTTAAAGAACCGGAAAAATAAAAATTGCAAAAATTCAATTTAACATCAAGCGTG 
TAGTAGAGAGGTACTGCTTTAAGACACATTCGCTTTTGATCAATTAAATTACAAATAGGGTCGATTGCATGCGTTGTCCAAATTGTTTAACAATT 
GCTTCATTTTGTATAAAATAATATTTAAATAGATATCTTGCTCTGTCTGAAATATATTATTTTAATTTTTGTAATGCCTCAAGAAAGCTTGTTAA 
AATATTTCTTTCGGTTAACCATTTGTTTTGTAAATTATTTGAGATTGCGTATAAATTAGTAAACGGATGCTGCTGCATTTTGTTTCACCTG 

{SEQ ID NO: 833) 

Start ATG: 744 (Reverse strand: CAT) 

MALKRINKELQDLGRDPPAQCSAGPVGDDLFHWQATIMGPPDSPYQGGVFFLTIHFPTDYPFKPPKVAFTTRIYHPNINSNGSICLDILRSQWSP 
ALT I S KVLLS I C SLLCDPNPDDPLVPEI ARI YKTDREKYNELAREWTRK YAM* 
(SEQ ID NO: 834) 

Name: effete 
Classification: enzyme 
Gene Symbol: eff 
FlyBase ID: FBgn0011217 



Celera Sequence No. : 142000013384523 

TGCTCTAACTAGGCCACACACGGAACAGGGATTGGAAATTTCACTTTGAGCTGCAAAGTTCTGGCGATCACCTCCACCTCTCGCAACGCGGCAAA 
ATATCAACTGAATATTTTCAGACCTCGAGTGTGTTATTTGCGTTAAAAGAAAGAGTCGCGAATCGCTGCGAAGAGAAGAAAGAGAAAAGAAATAC 
TAAAAGCTCGCTCCGAAGCTTTGGCGACGCGAGCATGATTCTTCTCATATACAGTATATAGCATTTGAATTGATTAAATGCTAATTTAGCATGCT 
GTGGTCGTTGCCGCCATCGGAGATCTTACACTGAAACTTGAAAAGCGGATTCTGAATATCAATTCTTCTTGAAAAAAGGCTTTTATATGTATACA 
TAGATCTATAGCTTCCTAAAAATCATTGCAGCTCATTATCAAACATGCTTTAATGCTGATTCGTCTGTATAAATATTTAATTATTGTCTACCAAG 
TCATTGGAAAATTTTCACCACTATGCTTATTCGCCAACACTCTCGGAATATTTTATTTTTTCCATGGTCTATTTGTATAATTTCTTACCTTAATG 
CCAAGACCATTTGAATATTTATACCCTGTCCTTTGCTGTTTTGTTCTCTTATCAATGCCCTTCGCATTGACCGAGTTTTCAGATTTCCTTGCCTT 
TGGCATCATTAATCCCTTTCAACATGGCCAAAAGCCATTCAAAACTGAATTGTTGAGAGCTGTCACTTGGCATTTTATTGCCATCAGATAGCTGT 
ACTCACAACAAAATTCTACGACAACCCAACCGACAAAGCCCACACGATGATAGTTAATTAAAAAGTTGTTGGCACACTCAGAATATCATGCAAAA 
TTAGCCTGGCTAACTGGCCTTATCATAATTATCAGCAATCCCCAAACAAAACTTTACAACATGATAATTATTAAATAAAAAGCAAATAACCACTA 
ACAGTAGAACCGAATTAACATTTGTGAGCTCAGAAAACAAAAGCAAAATACAGGTGAAACAAAATGCAGCAGCATCCGTTTACTAATTTATACGC 
AATCTCAAATAATTTACAAAACAAATGGTTAACCGAAAGAAATATTTTAACAAGCTTTCTTGAGGCATTACAAAAATTAAAATAATATATTTCAG 
ACAGAGCAAGATATCTATTTAAATATTATTTTATACAAAATGAAGCAATTGTTAAACAATTTGGACAACGCATGCAATCGACCCTATTTGTAATT 
TAATTGATCAAAAGCGAATGTGTCTTAAAGCAGTACCTCTCTACTACACGCTTGATGTTAAATTGAATTTTTGCAATTTTTATTTTTCCGGTTCT 
TT AAAT AT AAT ATAAT AAT AAT AAT AAT AATT TTTCC AGCT GAT TTTTATTTGTAGTTTTGTTTTTCTTTGTATTTAAACTTATTATGTTGATGC 
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TGTTGTTGGTAGCTGTCTAGCTTTTAATGTTTAATGCTTACATCATTTTTTCAGTAATTATTATTAAATTATTTCCGTTTATGCTGTTCTGTTTC 
TGTTCGGTCAAACACACAATTAATCGCATGTTCGCCACCAATCCGCCAGTTGCTGCTCCAATTGCCACTGCCACAGCTGCAGCGGCTGTTGCTGC 
TGATGTTGCTGCAGCTGCTGATGTTGCTGCGGCCCCAAGGACGCCTTTGCCGCTGCCGTTGCTCTTGGTACCGCCGCCAAATTTGCTGCTCCTAA 
TGCTAATGCCGCTGCGAAGGCGGTAGCAGCGGCTGCTGCGGCGAAAATGACTGCGGCAGTTGATGATCTTGCTGTTGTTGCTGCTGCAGCTGCTG 
CTGATGATGTTGCTGCTGTGGCGAGTGTTGGTGTTGTGTTTGTTGCTGCTATGGTGGTGACCGCAGTTCCTGCTGATATTGTGGCTATTGTTGTG 
GACAACAGCACGGCGGCCATTGCCACTTACAATGACGCTGCTGCAGCTCCTCCTCTTCTGAATCTGACTATTGTAAACTGTTGATGTTATCTTTG 
TTGCTTTTACTGTTGTTGTTGCATCTGCTTTTACTTCTGCTATTGTTGATGTTGAATTTGCTGATTCTGTTGGACTGCTGCAATCTGGAACGCAT 
CACATAGCATACTTTCTAGTCCACTCTCGTGCCAGCTCATTGTATTTTTCCCGATCGGTTTTATATATTCTGGCAATCTCTGGAACAAGAGGATC 
GTCTGGATTGGGATCACAGAGTAGAGAGCAAATTGATAATAAAACTGAAAGAGAATATGGAAACACATGTTAGTATCATTTTCGATGTAAGAACT 

T TAG AT G AAC TCACCTTTT G AAAT AG T T AAT GC T GGC GAC C AC T GAG AT C T T AAT AT AT C G AGAC AAAT C GAT CCATTGCT GTT G AT GT T T G GAT 
GGTATATGCGCGTTGTAAAAGCCACTTTGGGTGGTTTAAAGGGATAGTCTGTTGGAAAATGTATAGTTAAGAAGAATACACCTCCTTGATAAGGG 
CTGTCCGGCTGTAAGAAAATTGATAACAAATACTATTAGTACAAGAACACTAGTTTACTACACACCTAGTTTATTTTATTTTGGTTTATTTGGAG 
GTCAAATCATAGATTGCGTTCACTTGAAAAATTTGGTTAAAAGAATGCTTTGAAATAATAAGAAGTTCTGAAGTAAATTATCGCGAGCTTCAGAC 
GCATGCCACAATGTTTACACGTTTAATGGATTGTTCTACACATGTACTGGATTTCTATTTAAACACACGATTACAAATGTACTTGGCTATTATAT 
ATTATCTATCATTTCAAAGCATCTAATCAACTGACGTAGCTTATTTATAAATACAAATAGAGTATTGTTTTATTATTGCATTCTAATTTACATTA 
ATGATTTATTAATCTGCTATTTTTAACTGTATACGGCAATCCCTGATCAGACCAAAAATAAACCAATATAAAATATTTCAGGCTACAAATTAAGA 
TAGAATCGATTCGATCTTTCGGTGAAGTCCTCTTACCGGGCCCATTATTGTAGCTTGCCAGTGAAATACTGTAAGAAAACAAGAGATACATTAAT 
CAAAACTGTTAAAAACGTAGTTGTGATATAATCCTAAAGAGCAAACATAAACTGCGCTGCAGTTAAACGTTTTCTGTTTGCTTTGATATGAGTCC 
TATATCTACCTATATCTGCCCTAGATGGGAAATTACTGTGTGTACGTAAAGCATCTTGGATATTTACATTTGATAATGAGATGATGACAGAGTAT 
ATTAAAATTTTCCCTTCGCTCAAAACTAAAACGTATTTTTCCTTGCCCAAGTTATATTCAAGTAATGAGTTTAGATACAGGAACTGATTTGCTAA 
ATAACTAGGTAAATAAAATTAAGCTTCCAACAAGCCACAGAAATGTAATCTTCTTGTAGTACAATACCACTTTCCCCTAGTTCGCCACGTCTAGG 
CCCCACTGTATATGCATCCAATGCAGCAAACTCGCTCTCGTCTAAATAAATGCACCACTCTCTCAGCGCCGCCCACCACCCCACTTCACCACTTC 
CATTCTCATGGCCACGCTCTTTAGCGATTCCCCCCCACAAAAGAACAAACACAAACAATGCCAGCCAAAGAGCCTTGGCACCGACAATTGGCAGC 

AGC G T C AGC AGC AG AAAAAG C AC C AAAT G CT C T C C CC CTT G AAAC AAAAAC AAAGT T T T T AAAGT AAG GAAT AT ACC T AT AC C G AG AT AC C CT GT 
AACCAACCAGGCTTTAAAGCACACATACGAAAAATTCATTTCTATATAAAACTGAGATATTTTTATAAAATCATGTTTTCTTAATGGTGGAACGC 

AAT AT AT T C GAT T T T T C AAC CG AT TT CAT GC T AAAC AAAC CTT GAC TAT TAT TGGC AT AC AT T TC C AT G AGT T CT AT AT AT C C T T T C AC AC TGC A 

GAACATCACATATGTATATATATACCCTTGTACCCTGTGAGAGAGCACAAAACGTGTGGCAGTTGGCTCAGAATCATGGCCAAAACAGAGCTACT 

GGCTATGCCGTCAATGTCATTCTGGTGAACGAATCGAATGCGACTGGGAAACTCTCTCTGTTGTGGCAGAGCACTGAGAACTGAACTGGGAACTG 

GGCCCTCGGTAACCAGCTGCTGCCTTGACCATAATTACACGAAATGTTCATTTGTCCATCTGCAGTAGGGAGTTTGGGTCTGGAAAATGCGTCGG 

CAT TGTTTTTGT C AT AGCT C AGACGAAAGT AAT C TC AT AGC TGC AGC CAT T T TG AT T AAT AC C ACT TAG AC AAC AAT C AAT AAT T T C T G AGC AT T 

GC TAAGG C AAC AAAAAC C C T GC AACC AG AT G AT AT T AAT AAAAGC AAT GC AT AT T T AGAT AAAAAC AT C CT T AGT GC AT T T AAAG AAT T G ATT AA 

TCCGACCATGGTAGCTTAGCTCAAAAGCATTCAATCAATGTTGAGTTTAAATGTTTAAATACATGATTAAATCATAAGGTTTCCTTTGCCATACT 

ACAATTACAATGGATTAATTTCACACACATACACACACACACACACACACACACAGCGCCACACTGAGCTGAAAATGGATGGGAGTTTCCATTGC 

ACTCGAGTGAAAATTCGCCACCCACCCAACCAACACCTAAAAAACCAAAGTAAGCCCCGCCCTGAGCTGCAATTCCTTTCTTTTACCCCAGTTGC 

TGGGAATTCGAAGGCAAGGGTTCGGAACAGCCAAATTGAT AAT AAAAGC AAAGTGATAAGAACTGCGGAAATTATTGAGCTGACTCCTC AAT AAT 

CAT AG AAAT AT T C AC G CT TAT AT AC AAAC AT AC T T T T AAT T GT AT AT AAT AGC GACGT T T AAAC GCT GT T AAT AG AAAT C G AGT GAAAT C GCC AC 

ACTGGTCGTTCATTATCAGCAAACGATGACAATTATTAATATTATTATTTACTCTGCCTCGCGGGAACTTGAATGAGTTTGATTTCCTTACGACC 

AGAGAAACTGGGAAATGAAACACAAAGAAATGAGCTAACGAAAGTGCTGGGATATATATAAAATATTTCAATATTTTGCCGTTCTGTATAATATT 

TTCATTTCCACAAAAGTGCGTTCACTAAATTATAATATATATATAATACTGCAAAGCATTACGAACAAAATTTCTATAAGTTCCTATAGTTAAAA 

CAATAGTTTATATAGCTTTTTGCAGAATAATTTCAGTTCCTAGCCATTTTCCCTAAAACTAAGTATTTAGTTTGCCATTTTCTTTTTAAAATTTT 

ATCGCTTACAAAGTATGCCGGTGTTTTTTTTTTTTAACCGAAATCGTCGTCAGCAAGACTATATAATTTACGATTTCACTTTTTGTGACGTCATT 

GGAGAGTGACCCCCATATGGCTGGCACCTCTTCCTCTTCCCGCTTGAGTGGTGAAGGGGTGAAGGGAGGGGGGGGGGGTGCTTTTCAGTAACGGT 

TGTCTGTATGCAGTACAGCTCTACGTGCTATAACGTAACCGAATCCGATCCACAATCAGCGTGAATTTGTTGGCTAATTGCATGACGGCTTCGTG 

AGT t T TT T AAAT C GAG T T T AAG AC A GC G AAG AAAAAAG AAC C AAG T CAT GGC GAT C AG CT GTT T GG AG GC C GT C GCT AT C AC AAAAT AAAAAG AA 

CCGAATCCAAACTTGGCGCAAAAGAATTCCAAATTAGCAAAGTCCAAACTAAAAGTAACAGCCAATAGGAAAACAAACGATGTACAAATTTAGTG 

TGTTATATGCCTAGTCACGATCGAAATCTATGACGTGGTGATGGTCGATCGACAATATTTGCATTTTTTATTAACAATTCCGAGATAGTGGTATT 

GTGAATTTTGCCTAATATATATATAGTTCCTATTCCTTGCATCAGACTATATACTATACTGACTCAGACTTGGCAACTCAGAAATATTTCACATA 

ATTTCATGTCTTGGAGCTCCCACACGATATGGTGAAATCGAAACTTGGGTTAAAGCCCCCCCAACATAATACAACTGACTCACGGCAAATCAACG 

AGTTGTTGAATAACATTAGCAATACTTACAATCATCTCCAACTGGACCGGCTGAACATTGTGCAGGTGGATCTCTGCCCAGATCTTGCAGTTCCT 

GTTGAACAAAGACATTTTCAAATTAGTCATTGAAATAAAAGTTAAATATAGGTATGTAAGCTCAGTCAATTTCTGATTGTTTTTAACAGTTATGT 

CAAGTCCTACAAAAAGAACTGGTTTTTAAAGAGTATATCTAATTTCTAGTCAAACCTTATATACTACCTACTTTCTGAAAGCATAGGTAACAAAC 

CTAGCAATTGCAATACAGGAAACCCCAAAAAGAATTCTCGAAATGGGAAATGGTCAAAAGTGCATTTCATGGATATCTGCTTATGTTTTCCGGTA 

GTCGGTTCTTTTGTTTTGCTCTATGATCCAGCTATATGTACATACATATGTATGTATGTACCACGGTTTTCCGGGTGTGCGTACATTTTATGCTG 

TGTGGTTGGTATGCAAATGATGTAAATTTTAAATGCAGAATTTAAAAACAGATATGTGGGGATACAAAATAAACATTTTTATGCGAGAGCTTCGT 

TCGAAGCGAGTGCGAGCGAGAGGGAGAGACGTGAAAAATCAAGTTCGTTGGCCCGGACCACAAATGGCGCTGTCTCCTTCTGCCCCACCCCCCGA 

CCAGTCGCTCCATCCCTGTTGCACTCGCACGCACACAGCTGAGCGGCTCCCCTCCCTCTCTCTTACACTCCGACTCTTCTTCACTTATCACAGCT 

GC C C C AC AC AC AC AC AC T C C AAAC GGGG AG AG AGAAT GAGGT GT AAG GT AAAC CG GC AAAA AAAAAG AAT AT AT AT AT AAAAAG AAT AAT G AAG A 

TGAGCTGCGAAAGAGTAGCGCGACTCAGTGGAGAAGTAAGAGGAGGAGCAGCAGGAGGAGGAAGAATGAAGGAGAGTGAGAAGACTGAGCTTCAG 

TGTATGCGTGTGTTTGTAACGGCACCACAAAATCAGAGTGGAGCACAACAAAAATAGCCGTCGCGAAAAACACCGTGTTTCTTTGCCCCATCTCA 

CTTACTCATTTTGGGAATTGTACCTGTTGAGCAAAAATAAATTACAAGAAATTGTTACATTTCTTTTTTCGTTGGGAGTCGGAGCGGTTATGACT 

CATATCAGACGAACAAATGAAGTAAAAACAAACACGCACACTTGTGCACAGCAAGAAAATTCGATAATATTATCAATATTAGACCAGGTGCACTA 

CCAATACAAACTTCTATTTAATTCTTATTCGAATAATTAATATTGTGAACTTTAAGAATTGGTTCAGTAAGACTAAATGCCATATGCATTGGTTC 

TAAAGCCCATTTCGTTCAGTGTACGGTTCAAGTGAGCTCCGATCGGCGTTTAAGCCGTTCTCTCTCGCTCTTTAACTCTCTTTTTTGGCAAATGA 

GGCCATCTAGAGAAAAAAGGCTTTTGAAGAATTGACCACAAAAAGGGGGCGTGAGTGATTTTAAGAGTTTACTACAATGTATACTTTTATTTCCG 

GTTTGCCGCTTGCCATTTCGTTTTCATTTAGGCAGGCATTT^AAATGTTATAATGGATAGAAAACTACGAAAAACGAGGAAAACTCAATGGCCATG 

ACGCCGACGCAGGCAGCGCTGCTTTTGTCGAGACAGTCAGCAGACCCCCGCCCCACACATTCAAACACAAATACACACCACGCTCACACATACGC 

ATAGATATGAACAAATTCCGCATTCGCTGCTCATTTTGTATGTAAATTATGCTTAATTATAATTTATAACTAATTATTTGGGGGCCCCCTTTTGG 

CTGACGGTGTTTTGCAACACTAAGCCCCGCTAGCACTCACACACTTGCGCTCTCTTTGGCTCTGCCCCATTGCGCGAGCGGGACCGAATGAGGCC 

TCAGCTGTCGCGGGCAAAGGGAATGCGCCAGGGCCAGAGCGAGATGGCATGCAATGGCCATAGCCAAGTGATGGCGAGACTTTGTATTTGCGAGC 

GTTTTAATGTGTGTGTGTGTGGGGTATGACTATGATAACCGGCTTCAAACGTAAAAAAAGATCCCAATTTGCCAAGTAACATTTCCACAACTTCG 

AGATATTACTAATGCTTTTGAGGAAGGCTGGGATGACTTTGAAAAAATATCGGGTTGTCAGGGCCTTGAACATTGGGGTGTAAAAAAGCATTTAC 
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TGGACAAGGACAAACGACGCGCAGGCACTTTAAAAATAATTATCGAAGCACGCACGCTAGAACCACTCTACCTAAATACATATATAAATATATAT 
GCTTGGGGGCAGCACTATTTCTGAATTATCAGCGCAAAAAGTAGGGTTAGGTTAATTTTATACGCTTCTTGCTTTTGCTCTGTTTTATTTTTTGC 
TGGATCTTTCTGCGGCGCTGGCAGTGTGTGCGTTTATGTGTGACTGTTGCGAGGCTGAGAGTGTGTGCGAGTGCCGTGTACAGCGTGTATTGGGT 
AACAGCGCCCGTAGAGCTGCGAAAACTCCGCCGTATAGTTGCATTAATGTAAATGCATCCAGGTGCCCGGCACACACGCACACAAACAATGCACA 
TTTGCATTGCACTTTCACGCACACGCATCGCACACACACTTATCCCACACTCTCGCAAAAGTAACGGAAGCGATGACTGCACTACACGTTGCCTT 
GCCCACCACACACACACATCCTTATACCTTTTTCACCACCCTCAAAATTTCTAATCTGCTCCTAATTACTAACTTATTAATTAAATGGTAACTCC 
ACAGCTCATTTTCTCCCTTCCCTCTTGCACGCTTCTTTCCCCCCGCTTCTTCTTTCTTCTCCTCGGTTACTACCTACGTTTCACATTTCTTGCAT 
AAAATTTTTCTCCCGTTGAAAACTGATGTCTGTTCGCTTTTTTTTTAGACTATCATTTTGGTGTATCGCAAAGGGTTCTGTTCTTTCTTTTTTTA 
GTCCGTTTCGCAAAGTTTTTTGAGATATTCCTTTTTTCACATACCTTATTGATTCTTTTTAACGCCATTTTCTGATTGAAATTTGATTTTCTGGG 
TTGTGTGTGTGTTTGGTTGGTTGTGTGTGTGTTTGGTTGGTTGTGTGCTGGTGTTGGTGTCTCGCGTATTTCTTTTGTTCTTCGCACAGAATTGG 
GTTTTATTCAATATATAATCCTCCTGCTTAGAATTGAACGAAAAACGAAACACAAAGCTGCGTTGCGCCTCTAGAACTTTGATGCTTAAATATGG 
CTCTTTCGGTTGAGTGTCCCTTGTCACGAAATAAAGCAGCACCAGCGCACTGCAAGTAAAAACGGCGACCCGACGAAGACTACGGCGACGATTAT 
TTTTTCACAATCAATGCGGGGCCTTTCGACCGTTTGTGCGATAGAGACGACCGTTAGTGAGCTGAGAACGCAACGAAGCACAATCGTTACACCGA 
AGTGTAAGCCCGAGCGAGAGGAATGGTGTGTCTGTGTCGTGTGGTAGTGTTGTGAAGCGGAAGTCCAGCAGAGCAGTCCAGCTATATGACCTTTT 
CGCTAGGGGTGGCCAGATCGATTGGGTGGCAACGCCTTTTAGGTGAGGGATGAATCAGTTAATTACAATTTGATTGCCAATGCAAGACTCTTCAT 
ACTCACTTTAGTTCGGGAACAGCCAGAAGAAATAGTTTTACCGGCAGCGAGGAGGTGTTATATTTTATTCGAACTACCTTCTTATCGATGGTGCG 
TGCGCCCAAAAGCTTGCAACTTGTATGTAAATTTGTTAACACCCAACGTGTTGCAGGCTGCCGGAATGTTTTTACTGAGCAATTTTGAGATGTTG 
GT AAT C C GAGT T C T T G AAAACTT T C CGAC T AGCT ACT GAGT T C T AC AAC T GAAT T G AC AGGG CGT TAT T T T AAAAT T T AAAT T C ACT C C AG AGC G 
CTGGTCTGGCAAGGCCGACGTGCCAATTCAGGATACTTGTCGATATTTTCCGTTGTGCTATCGATAGTTTTTGGGAATAACAGGCGTTAGTAGAT 
TTTTAGTATTTAGATTTTATTATCATAAGGATGAAAATATTTTTAACAAAATAAACTGGTTCATTTAATTTAACTTTTTATTTTCTCACTACTCC 
AAATGGCGCTTCCGCTGTCTGAAACTGCCGTTATAATAGTATGAACAGTTAAAACAATCGACGGTAGCTGAATTTAATCAATGCAGCCCTCTTAC 
C C T AT AC AT CGAT AGT AT C GC AT AAAC AT C GAT T T TGGT T C C AGC T C T AAC T T T AAC T GT CT G C AAAAT AT T T C AAAC AAAAGT GTT T GT T AAT T 
AAGTAATATACTTAATATTAGTGGCTGTTTTGCTTCGACATTGATCGAATTGCGGTTGTGCAAGTGTTTTGTTTGATTTGTTGGCGTGTTTTTCC 

TAGTTTGGCGGCCCTTTGAACTCGTTCGTG 
(SEQ ID NO: 835) 

Exon: 9385.. 8975 
Exon: 6078.. 6015 
Exon: 3013.. 2982 
Exon: 2478.. 2295 
Exon: 2229.. 1001 

Start ATG: 8998 (Reverse strand: CAT) 
Transcript No. : CT22799 

CGTTGCGTTCTCAGCTCACTAACGGTCGTCTCTATCGCACAAACGGTCGAAAGGCCCCGCATTGATTGTGAAAAAATAATCGTCGCCGTAGTCTT 
CGTCGGGTCGCCGTTTTTACTTGCAGTGCGCTGGTGCTGCTTTATTTCGTGACAAGGGACACTCAACCGAAAGAGCCATATTTAAGCATCAAAGT 
TCTAGAGGCGCAACGCAGCTTTGTGTTTCGTTTTTCGTTCAATTCTAAGCAGGAGGATTATATATTGAATAAAACCCAATTCTGTGCGAAGAACA 
AAAGAAATACGCGAGACACCAACACCAGCACACAACCAACCAAACACACACACAACCAACCAAACACACACACAACCCAGAAAATCAAATTTCAA 
TCAGAAAATGGCGTTAAAAAGAATCAATAAGGAACTGCAAGATCTGGGCAGAGATCCACCTGCACAATGTTCAGCCGGTCCAGTTGGAGATGATT 
TATTTCACTGGCAAGCTACAATAATGGGCCCGCCGGACAGCCCTTATCAAGGAGGTGTATTCTTCTTAACTATACATTTTCCAACAGACTATCCC 
TTTAAACCACCCAAAGTGGCTTTTACAACGCGCATATACCATCCAAACATCAACAGCAATGGATCGATTTGTCTCGATATATTAAGATCTCAGTG 
GTCGCCAGCATTAACTATTTCAAAAGTTTTATTATCAATTTGCTCTCTACTCTGTGATCCCAATCCAGACGATCCTCTTGTTCCAGAGATTGCCA 
GAATATATAAAACCGATCGGGAAAAATACAATGAGCTGGCACGAGAGTGGACTAGAAAGTATGCTATGTGATGCGTTCCAGATTGCAGCAGTCCA 
ACAGAATCAGCAAATTCAACATCAACAATAGCAGAAGTAAAAGCAGATGCAACAACAACAGTAAAAGCAACAAAGATAACATCAACAGTTTACAA 
TAGTCAGATTCAGAAGAGGAGGAGCTGCAGCAGCGTCATTGTAAGTGGCAATGGCCGCCGTGCTGTTGTCCACAACAATAGCCACAATATCAGCA 
GGAACTGCGGTCACCACCATAGCAGCAACAAACACAACACCAACACTCGCCACAGCAGCAACATCATCAGCAGCAGCTGCAGCAGCAACAACAGC 
AAGATCATCAACTGCCGCAGTCATTTTCGCCGCAGCAGCCGCTGCTACCGCCTTCGCAGCGGCATTAGCATTAGGAGCAGCAAATTTGGC GGCGG 
TACCAAGAGCAACGGCAGCGGCAAAGGCGTCCTTGGGGCCGC AGC AACATC AGC AGCT GC AGC AAC ATC AGC AGC AACAGCCGCTGC AGC TGTGG 
CAGTGGCAATTGGAGCAGCAACTGGCGGATTGGTGGCGAACATGCGATTAATTGTGTGTTTGACCGAACAGAAACAGAACAGCATAAACGGAAAT 
AATTTAATAATAATTACTGAAAAAATGATGTAAGCATTAAACATTAAAAGCTAGACAGCTACCAACAACAGCATCAACATAATAAGTTTAAATAC 
AAAGAAAAACAAAACTACAAATAAAAATCAGCTGGAAAAATTATTATTATTATTATTATATTATATTTAAAGAACCGGAAAAATAAAAATTGCAA 
AAATTCAATTTAACATCAAGCGTGTAGTAGAGAGGTACTGCTTTAAGACACATTCGCTTTTGATCAATTAAATTACAAATAGGGTCGATTGCATG 
CGTTGTCCAAATTGTTTAACAATTGCTTCATTTTGTATAAAATAATATTTAAATAGATATCTTGCTCTGTCTGAAATATATTATTTTAATTTTTG 
TAATGCCTCAAGAAAGCTTGTTAAAATATTTCTTTCGGTTAACCATTTGTTTTGTAAATTATTTGAGATTGCGTATAAATTAGTAAACGGATGCT 

GCTGCATTTTGTTTCACCTG 
(SEQ ID NO: 836) 

Start ATG: 388 (Reverse strand: CAT) 

MALKRINKELQDLGRDPPAQCSAGPVGDDLFHWQATIMGPPDSPYQGGVFFLTIHFPTDYPFKPPKVAFTTRIYHPNINSNGSICLDILRSQWSP 
ALTISKVLLSICSLLCDPNPDDPLVPEIARIYKTDREKYNELAREWTRKYAM* 
(SEQ ID NO: 837) 

Name: effete 
Classification: enzyme 
Gene Symbol: eff 
FlyBase ID: FBgn0011217 



Celera Sequence No. : 142000013384651 
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CTTTTTCCGGATGGCTTCCAACCTCTGGCTCTCCGTCATCAGCTACCATACGTGGAAAGTCTTGACGTCGCTCAATCGAGTCGACCCTAACTATC 
GGTTCCTGCGGTACAACGCCTTCGTCTGGAGCACAGCCGCAATCATGACGGGAAGTATTTATATAGTTAATCAGATTTGGGAAAACGATCCCAGT 
AAATGGAACTGGTTGCCTCTGGTCGGTTTTATTCGGTGCTCGGTCAAAGGTAAACTTTCAATGCGATTGCTTTTAAATCGTTACCATGGTTACGT 
ATTGCTTTTCCATATCAGATTGGCACCCATCCGTCTGGATCTATATAAGTGGACCGTCGCTGGCCCTGAGCACTTTCAATGTCGCCATGTTCGCC 
CTGACAGCCATTTACATTAGGAAAGTGAAGGGGGGTATAAATAAGTTCACAAATGAGGAGGAGGGAAGGATTAACTGCATAAACTTTGACAGCCA 
GACGTAAGATATGATAAAACCTTGGACCTTGGTAAACTAGTTATCCTTGCCGTATTATCCACATAGTTACCTACAGTTCCTGCGGCTCTCCATCG 
TGATGGGCCTTACTTGGATATTCAATGTCATTCCGTATTCTGCACGGCTCCACATTTTCTGGGAATGGGTCGGGATAATATCCGAGTATTTTCAC 
AGCGCGTTTGGAATTGTTTTGTTCGTTCTGCTCGTCCTGAAGCGCAGCACATGGACTCTAATGATGGATTCTTAAGCTTGTTTCTATATAATTCT 
AATAATTGTTTACAATTAAAGCTTTAAACAATTAAAGCCCCTCATTTTATTAATATCCTTTGCTGAACGAAGGAATTGAATTGAATTTTATTGAA 
CTCAATAAATAAATATAGTTTTAGAAAACGATGTACTTTAAACTGCCAAGAGGTCACCGCGACAAGCTTTTCACTTCGCCACCCTCTGGTAACAC 
TGATTCTCCAGCCACTTTCGATTCTTTAAAAAACATCGATAGGTATCGCCAAAAAAATCAGCTGCTCTGCTTGTAGTTTTGCGATTTCATTGCAA 
TTTTAATTTGCTTTACTGTTCAAAAAGAAATACAGTGCAAAATATGCCGAACGAAAACGCAGAGGACCTCTCGGGTCAAGAGCTAAAGCAGAAGG 
CCAAGGAGGTGGCCGACGCATCGGAGGCAATGCTCGAAAAGGTTGTTGCCGGCTTAAATATCCAGGACACGGCATCGACGAATGCAGCAGGAAAC 
GAGGATGCGGAGCAGCCTGATGGTGCCAAGAATGAGGCTTCAGTGTCTGCGAATGCAAGTAAGTTTTCAATATGCCTGTGAAAAGTCATTTAGAG 
GCTCCTCCTGCGGCTGAAAAAATGAGGTTAGGTTTTGTCGGGAACTACAGTCCAACCTCTCTAACTTGAACTGCTGTGCTATTCATTTCAATTCA 
ATTTTCAATTTTCATTTTCTGTGCAATTACTCCAAAGATTAGCCCTAACGGAAAAATAAACAACTCATAAATCATAAATACATATTTGATTTTGA 

T G AAAT AT G AAT AT AC T T AC AT AGAT AC AAT T G AGTT T T T T AT T AC G AAT AAAAT AT GT GT AAT AT AAGAT T T C TAG CT T GT T GAT T C CAT T GAT 
TGATGCTTCATTGAAACCCCAAAATTGGGCAAATTTGTTGTCATGGGAAACTAGTCCACTGTCGTGATGCCTTAAGCTTTTTTTGGGGCAGATTA 
GTCATTCCAGTTTAAACTAATCGAAAAATGTTATAATACTATTCAACGCATCATCGTTTCTTTCAGGCAAACAGGCCTTGCTACAAGCCGTTTCC 
GAT GCTATGGCCAGCACCCGTC AGAT GGCCAAGAAGTTTGCATTTTGGTCCACACAGCCAGTCACCAAGCTGGACGAGCAGGTGACCACCAACGA 
ATGCATTGAACCGAACAAGGAAATTAGTGAGATTAGAGCATTGCCATACACTCTGCCAGGCGGCTTCAAATGGGTGACACTAGACCTGAACGACG 
CCAATGATCTCAAGGAGCTGTACACGCTGCTCAATGAGAACTATGTGGAGGACGATGATGCCATGTTCCGTTTCGATTACCAACCCGAGTTTCTA 
AAGTGGTCGCTGCAGCCACCTGGCTGGAAACGCGATTGGCATGTTGGCGTTCGTGTGGAGAAGTCCGGCAAACTGGTTGGCTTCATCTCAGCCAT 
TCCCAGCAAGCTGAAGTCGTACGACAAGGTGCTAAAGGTTGTGGACATCAACTTCCTATGCGTCCACAAGAAGTTGCGCAGCAAACGGGTGGCTC 
CAGTCCTGATTCGGGAGATAACGCGTCGCGTCAATCTGACGGGGATTTTCCAGGCGGCCTATACAGCTGGAGTTGTGTTGCCCACGCCAGTGGCC 
ACCTGTCGCTATTGGCATCGATCGCTCAATCCCAAGAAGCTGGTAGACGTGCGTTTCTCTCATCTCGCACGTAACATGACCATGCAGCGGACAAT 
GAAGCTGTACAAGCTACCCGATCAGCCCAAGACGAAGGGCTATCGCCGAATCACAGCGAAGGACATGGACAAGGCCCACAAGCTGCTGGAGGACT 
ACCTAAAGCGCTTTCAGCTGAGCCCGGTGTTCAGCAAGGAAGAGTTTCGTCATTGGTTTACCCCCAAAGAAGGTATTATCGATTGCTTCGTGGTT 
GCCGACGAAAAGGGCAACATCACTGATCTGACTAGCTACTACTGCCTGCCATCGTCCGTGATGCACCATCCGGTGCATAAGACGGTTCGTGCCGC 
ATATTCGTTCTATAATGTGTCTACAAAGACCCCGTGGCTGGACCTGATGAACGATGCACTGATCTCGGCCCGCAATGTCCAGATGGACGTGTACA 
AT GCCCTCGACCTCATGGAGAAC AAGAAGT ACT TTGCACCGCT AAAAT TCGGCGCCGGGGATGGAAACCTGCAGTACTACCTCTACAATTGGCGC 
TGTCCATCGATGCAGCCGGAGGAGATTGCTCTGATACTCATGTAGTCACCTTGTCACCTAAGCTCACATTCTCGCCCTGCCTTTCACAGCTCCCC 
TCCTCCGCTTTCTTCACCCTGCTGGAAGTGCCGTGCGGCTGCTTTGCTTTGCTCCTATTTACGTAGCTATTACCTGTAATCTTCAGTTATTCCAG 
CGAGGAAAGCCCTTTGCGAGCATACACCTAAATAGCATCATTGTGAAAATTGCATTTTGCAACGGCCAAGCTTCGATATTTTTTACTACAAGAAC 
GAACACAAATTGTATAAAATGCGCAGCAATATCAATAAAAATGTAACGTTTACTTGGACCTTTATTGCCTATAATTGACAATAAATTATTTTGCT 
GCTCTTTTAAAGATGACAAGTTTTTATTTTTTTTAAGCAGATTATTGTATTTTTGCCCACTGTCCCAGCATGTGATAGTCTGGCCACTCTGTCAC 
ATGGCCATTGGTCTGGCAAGGTCGGTGAAAAAGCGCGCGCTAAGCAAAACAGCTGCACTTGTGTTGGTTTATTAGTTAACGAAATCTGATTCCAA 
TGTTGGTTTCCTCATAATATTGACAATTAAAGACAGCCGCCGACATGGACGATTCGGTGTTCAAGCTAAAATACCAAAAGTACAAGTTGCGCGTC 
AAGGTGTGGGAGAAGGACTTCAAGAAGAAGAATGGCCGTGTTCCATCGAAGGTGAGATTGGGATGCGGATATGACAAGTATGCTAACTGGTTGAC 
CTGTTGCAGTATGATATAAGGGACGCCAGTCAGGAGATTCGGGACTCATACAAAATGTACTACAAACTGAAGACATCCTTCCTGGAAGAGACGCT 
GAACGATGTTCTCAGCGAAGATGGATACGATATTCTGGAGATGTCGCAGGCCAGTGATTTCGGTGTGAGCATGCTGGACCAGGACGTCAGTCTTA 
ATGAAGGTCCTCAGCTACCGTTGGATATATCTGCACTGGTGGGGCCGCAGAGCTCTGGTAATCTTGAGGAAATTCCACAATCGGTGGAGGGCTCA 
TTCTCAAATCTCATCGACCTGCCCAATCGTCAAGTACTCACCAATCTGGTCAATCGCGATGAAAACCATGTCATTCGCAAATTCGAGGCGGTGGA 
GGAGCTGCCAATCAACC AAAAT GCCTGGGGCTTGAATGTGAGTAAGAAGCCACCAGCTCCACCACAGCCTGTAGAGGCCTCC AAAT CCGCTCCGG 
GGCACGGAAAGCAACCCAAAGCGGGCGCTAGTCTTAAGCCCAGCTTGAGTGCCAAGTTATTCCAATCCTCACGCGGTTTTGCTAAACGAAATCCC 

CGAAAGC 

{SEQ ID NO: 838) 

Exon: 1001.. 1293 
Exon: 1777.. 3282 
Start ATG: 1089 

Transcript No. : CT22825 

AAAAAAATCAGCTGCTCTGCTTGTAGTTTTGCGATTTCATTGCAATTTTAATTTGCTTTACTGTTCAAAAAGAAATACAGTGCAAAATATGCCGA 
ACGAAAACGCAGAGGACCTCTCGGGTCAAGAGCTAAAGCAGAAGGCCAAGGAGGTGGCCGACGCATCGGAGGCAATGCTCGAAAAGGTTGTTGCC 
GGCTTAAATATCCAGGACACGGCATCGACGAATGCAGCAGGAAACGAGGATGCGGAGCAGCCTGATGGTGCCAAGAATGAGGCTTCAGTGTCTGC 
GAATGCAAGCAAACAGGCCTTGCTACAAGCCGTTTCCGATGCTATGGCCAGCACCCGTCAGATGGCCAAGAAGTTTGCATTTTGGTCCACACAGC 
CAGTCACCAAGCTGGACGAGCAGGTGACCACCAACGAATGCATTGAACCGAACAAGGAAATTAGTGAGATTAGAGCATTGCCATACACTCTGCCA 
GGCGGCTTCAAATGGGTGACACTAGACCTGAACGACGCCAATGATCTCAAGGAGCTGTACACGCTGCTCAATGAGAACTATGTGGAGGACGATGA 
TGCCATGTTCCGTTTCGATTACCAACCCGAGTTTCTAAAGTGGTCGCTGCAGCCACCTGGCTGGAAACGCGATTGGCATGTTGGCGTTCGTGTGG 
AGAAGTCCGGCAAACTGGTTGGCTTCATCTCAGCCATTCCCAGCAAGCTGAAGTCGTACGACAAGGTGCTAAAGGTTGTGGACATCAACTTCCTA 
TGCGTCCACAAGAAGTTGCGCAGCAAACGGGTGGCTCCAGTCCTGATTCGGGAGATAACGCGTCGCGTCAATCTGACGGGGATTTTCCAGGCGGC 
CTATACAGCTGGAGTTGTGTTGCCCACGCCAGTGGCCACCTGTCGCTATTGGCATCGATCGCTCAATCCCAAGAAGCTGGTAGACGTGCGTTTCT 
CTCATCTCGCACGTAACATGACCATGCAGCGGACAATGAAGCTGTACAAGCTACCCGATCAGCCCAAGACGAAGGGCTATCGCCGAATCACAGCG 
AAGGACATGGACAAGGCCCACAAGCTGCTGGAGGACTACCTAAAGCGCTTTCAGCTGAGCCCGGTGTTCAGCAAGGAAGAGTTTCGTCATTGGTT 
TACCCCCAAAGAAGGTATTATCGATTGCTTCGTGGTTGCCGACGAAAAGGGCAACATCACTGATCTGACTAGCTACTACTGCCTGCCATCGTCCG 
TGATGCACCATCCGGTGCATAAGACGGTTCGTGCCGCATATTCGTTCTATAATGTGTCTACAAAGACCCCGTGGCTGGACCTGATGAACGATGCA 
CTGATCTCGGCCCGCAATGTCCAGATGGACGTGTACAATGCCCTCGACCTCATGGAGAACAAGAAGTACTTTGCACCGCTAAAATTCGGCGCCGG 
GGATGGAAACCTGCAGTACTACCTCTACAATTGGCGCTGTCCATCGATGCAGCCGGAGGAGATTGCTCTGATACTCATGTAGTCACCTTGTCACC 
TAAGCTCACATTCTCGCCCTGCCTTTCACAGCTCCCCTCCTCCGCTTTCTTCACCCTGCTGGAAGTGCCGTGCGGCTGCTTTGCTTTGCTCCTAT 
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TTACGTAGCTATTACCTGTAATCTTCAGTTATTCCAGCGAGGAAAGCCCTTTGCGAGCATACACCTAAATAGCATCATTGTGAAAATTGCATTTT 
GCAACGGCCAAGCTTCGATATTTTTTACTACAAGAACGAACACAAATTGTATAAAATGCGCAGCAATATCAATAAAAATGTAACGTTTA 

(SEQ ID NO: 839) 
Start ATG: 8 9 

MPNENAEDLSGQELKQKAKEVADASEAMLEKVVAGLNIQDTAS™ 

TQPVTKLDEQVTTNECIEPNKEISEII^LPYTLPGGFKWVTLDLNDANDLKELYTLLNENYVEDDDAMFRFDYQPEFLKWSLQPPGWKRDWHVGV 
RVEKSGKLVGFISAIPSKLKSYDKVLKVVDINFLCVHKKLRSKRVAPVLIREITRRVKLTGIFQAAYTAGVVLPTPVATCRYWHRSLNPKKLVDV 
RFSHLARNMTMQRTMKLYKLPDQPKTKGYRRITAKDMDKAHKLLEDYLKRFQLSPVFSKEEFRHWFTPKEGIIDCFVVADEKGNITDLTSYYCLP 
SSVMHHPVHKTVRAAYSFYNVSTKTPWLDLMNDALISARNVQMDVYNALDLMENKKYFAPLKFGAGDGNLQYYLYNWRCPSMQPEEIALILM* 

(SEQ ID NO: 840) 

Classification: enzyme 
Gene Symbol: Nmt 
FlyBase ID: FBgn0020392 



Celera Sequence No. : 142000013384620 

ACATGCCAACAACGTGGCGGCCATGGTGAAGATGTTCTCTCGCAGAAGCGGCGGATTGTGGGGCATGGTAATAGAGTGCAAATCCACAATACACA 

CGGTGACATCGTCACGAGCATTCTGCAGCTGCACCCATTTGCGCACGGCGCCGAGATAATTACCCAGATGCAGGGAACCCGTCGGCTGGATGCCA 

CTAAAGACTTTCCTCGGCCAGCGGGTGTTATGCTGGGTTGGGGTAATATATTAGAGTCTTGTGTTATGTAAACTATTTCTACGAATGCACTTACT 

TCTTCGTGGCCACTGTTAACGGAATTACCATGGGGGTGTACGTGCTGCTTGTTTCCACCGGTTACCTGGGCCGTCGCAGAGGGCGTTCCAGCCGT 

C C T AC T A GC C AAT C C C GGC AAG AAAC T CCTCGCCT GAAT GC AAC GGC GCTG GGAAT AC AC AGC T GTCTGTCCCTTTTTGGC C AC C T T AGAC AG AC 

T GC GAT GGAG CT T GC C AAAT C GG AAC AT T GC AAT GAGC T AT GAAT TTTGTACCC C AAAAC T T T TG AGT AAAAAAT AT TG GG AAT AAT GCGGTGCT 

AAATGTGTGAACAAAGCCCAAAAATAAATTATTTGTTCGAAACGGCCGAAATTTTGACACTGAGCGTTCGCGAGCGTTGCCAGATGGCTTTTGAA 

AAGCCGGTACGACAGCTGTGAACAGTGATACTCAGCTTGGCGAGTTTTCCGCTAAATCGTGGATTAATATTAATATTCAAATTTAATAAAGAATG 

TTATTTATTATCATTTTTTTTATAACTCTCTATTTAAGGGTATTACCTATTTACATCTTACTTGTGAACGTATGGCTTGAAATTTAATGTAATTG 

T T T GAT GGT AAAT AAAT T GT AGAG AC C AAT T G AAC AAT TT T T T T T T T T T AAC AGG AC T C C TAT TAT T T T T AT GC T T AT AGCT T T AT TAG T GGT AC 

AC C AG AAAAT CT AAC G AC AT T GAAAAAGAAT T AAT C T GC AGAAC T GC AGT T T CGAGT T T T C AAAC AGT GT T GT AT TTATTCTAGCT T AAT AC AC A 

TATTACATTGCAATTAACGCCTGGCGCCATTACCATTATTTATATTTTCCACTTTTCTGTTCCTACATTGAGAGTTACAAATATGCGGAGACTGT 

CGTTTCGGCCGTCAGGATAATTTTTCGAATAGGGGTTATTCAAAGATTTCAAACGCGGTGCAAAATTCTTCACTATCTGGCAGAGATTGGGAAGT 

GCTTAGAAGTTGATGGGTTTGCCAAAGGCAGCTGCCACATTGGCCTCACGCAGGGCCTCGGAACATGTCTGAAAGAAGGAATTGTGCGCATTAGG 

TTCGAAAGGTTC AAAT CAGATTAGGTAATC ACT TACTGGATGCGCATGGCAAACGCGGGCAACGTCCTCGGCAGCGGCT CCGT ACT CCATGGCCA 

GCACAGCCTCGTTAATGAGCTCGCCGGCACCAGGTCCGATAATATGGGTTCCCAGGATCTTATCGGTGGCCTGGTCGGCCAGAACCTTGACGAAA 

CCGTCGGTGTCGTTGTTGGTCTTGGCACGAGAGTTGGCCAGGAAGGGGAACTTGCCCACCTTGTAGGCAACACCCTCCTGCTTCAGTTGCTCCTC 

GCTCTTGCCAACCCACGCGACCTCGGGATGCGTGTAGACTACACTGGGCACGCAGTTGTAGTCGATGTGCACGTGGCCGCCGTTGATGCCCTCAA 

TGGTGATGAGGCCCTCATCTTCAGCCTTGTGCGCCAACATGGGTCCGTGGATGCAGTCGCCGATGGCGTAAATATTGGGCACCACAGTTTGGAAA 

GTGGCGTTGACGGGGATCCTGCCGCGGTCATCCTTAACAATGCCCACGGCCTCCAGACCCAAACCTTCGGTGTAGGGACGACGTCCAACACTGAC 

CAGCAGGGCGTCGCACTGGATCTCCTCTTTCTCGCCAGACTTGGCGTTCTCCACGGACACGGTGACGTTGTCGCCGCTGCGAGAGGCAGCCGTCA 

CCTTGGTGCCCAGCTTGAACTTAAGACCCTGCTTGGTGAGCACCTTTTGGAACGTCTTGGAGACTTCGTTATCGATGCCCACGCCGCCGATCGTA 

TCCATGAACTCAATGGCAGTGACCTCAGCGCCCAGACGCGACCATACCGAACCCAGCTCCAGGCCAATGACACCGGCGCCGATGACAACCAGATG 

CTTGGGCACCTTGGCCAGCTTCAGGGCGCCAGTGCTGCTCACAATAACTTCCTCGTCGATCTGTGAAGGTTTTTGAATGGCATGCATTAGTAATC 

AAGCGCGCTGTAGAGGTTTTGCACTTACTTCAATGCCAGGGAAGGGGGTAACCTCCGAGCCGGTGGCGATGAGGATGTTCTTGGTCTTGACAGTC 

TCCGTGGAGCCGTCGGATTTCTTAACCTCCACCTCGTTGGGATTGACGATGGTGCCGAAGCCCGTCAGCTGTGTGACCTTGTTCTTCTTGAAAAG 

C AT GGC AAT GCCACCGGTCAAAGCCTTCACGGCGTTGGACTTTTGACCCATAAGCTTCTCCAGATCCAGGCT AAC GCTGCCGC AGCT AAT ACC AC 

GTTTCTCCAGATCACCGGAGTGGGCCATGTGGTAGTAGTGGGAGTTGTTCAGCAGCGCCTTTGAGGGAATGCAGCCAACATTCAGGCAGGTGCCG 

CCCAGAGTTGCCTCCTTCTCCACGCTGACCGTCTTCATGCCCATCTGGGCGGCCTTGATGGCGGCGACATAGCCACCGGGTCCGGAGCCAATCAC 

CACGATGTCCGCCTCGTGGGTGCTGGAGTAGCAGCGGGCGTTGAGGGCACCCAGAATGGCGGCATTGGTGCGCAGAGGAGTCTGTAAAGACGGCA 

GATTACTACATGTTGTGGCACCTTATCATTTATCACGCCAGTCGCACGCATTTCGCACTAATCTTCGGCCGACTATCAGCTGATAATTGGCCTAA 

TTAGCGACCTTGAAAACGGGGCCAAACAAATCCCCAAACGCCAGTGAATCACCCACCGGGATCATATGAGCACTGCAGGTGATCCAAGTGCATTT 

TCCAGCCTTTTCCTGCCCATTTCTCGACCCCCTCGGTGAGGTTAGATTGCATAACTTGAGGGGCGATGGCGTTTGGGGAATCGGTTATGTCGCCC 

GATCCTTACCTTGGCTACTGCCGAAACAACATGACGGAGCGTGAATTGCATGGCTGGTGCGGGCGAGTTATGAGGATAATCGTTATTTAACTACA 

AACACAACACTAGTTTGATGTGTGTTTTTCCGACTGTTTTTCGTGTTGTGGGGCTCGGAGCGGTCGCAGGGCTGGCGATGTGCATTCGATAGTGG 

TCATATGATGGCCGCGATGCTATCGATTGTTTTGGTGATTTTCCACTCTGTTGGAATTTGAAATTCTAAAACATTTTCGTATGAAAAAGATGTGT 

ATAAAAATATTAGCTTGATGATCGTTCAAAATACGATATGTCCGTCACTCAATTGCTAGTTTATACAATATTCACTTTATGGGTCATTTGTTGTA 

GCAAAAGCTACCTAAAAACAAGCCAACTATACGTGTAAGCCGGCTTTTATAGTTTTGGCACCCATTAATTAATGTTTTTGTACGTATAAAATAAA 

CTGTAATAAATAAACTGTAAAATAACTGTTTTAAAAATATATAACTCAACGTACATACGATACATATATAATAAGAGTTCCAAAGTAATATTAAC 

TTTATTATATCTATAGGTAATTAAAAATGTATTATACGATGAATATTCGATAGTTTGCCCTACAGTACAACCCTGTGTGGTATTCCCCATCTGGG 

ATATTTCCCACAACACTTTTGGTATTTTCACGTCGCGTTCATCTCGCGTTTTCGTTGAAGCCTCAGTATTTCGCTAGCTGTCGCAATGCGGTCAC 

ACTGGTTCGCGGACACAACAAGATGGCGAGACTTGCCCCCGAAATCTGTTAGAAAACGCCACAAGAGCAAATAATTAGTTCAGTTTTCGACCAGA 

AAAGAGTCAGCCGAAGTGCCGGGCAGACGCGGCGCATTTTGCAGAGTGTGAGCTTGGCCAGCGTTGCCCGAGAGCCGCGACAGGCGACACCTTCT 

GCTGGCCTTGCCTTGTTGCTGTGCCCCTGTGTGCGTGAGCGTGTGCGGTGCGGTGCGATTGTGTGCGTGGGTTGCCAAGAGTGAAGAGAATCGCT 

GAGTTTCCAAGATGATGAGCAACTACGGCAACATGATGGACTCACCGAAGTCATCGCCCCACGGCGGCGGCCGATCACCCGTGGTGGCCCGTCAA 

GACTCATCCGGCACCCTGAAAACGACCATTTCGCTGGGCAAGACGCCAACGATCATTCACACGGGGCCATTTTACTCCATGAAGGAACCACCCGC 

CAAGGCAGAACTTACCGGCGACAAGGATCTCATGACCGAGTACGGACTGCACCACACACTCACCAAGTTCAAGGAGAAGAAGTTCAAGGAGTCGC 

TG 

{SEQ ID NO: 841) 

Exon: 3467.. 3145 
Exon: 2836.. 2309 
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Exon: 2245. .1367 
Ekoii: 1303.. 1001 

Start ATG: 3186 (Reverse strand: CAT) 
Transcript No. : CT22859 

T G AC GG AC AT AT C GT AT T TT GAAC GAT CATC AAG C T AAT AT T T T TAT AC AC AT C T T T T T C AT AC G AAAAT GTT T T AG AAT T T C AAAT T C C AAC AG 

AGTGGAAAATCACCAAAACAATCGATAGCATCGCGGCCATCATATGACCACTATCGAATGCACATCGCCAGCCCTGCGACCGCTCCGAGCCCCAC 

AACACGAAAAACAGTCGGAAAAACACACATCAAACTAGTGTTGTGTTTGTAGTTAAATAACGATTATCCTCATAACTCGCCCGCACCAGCCATGC 

AATTCACGCTCCGTCATGTTGTTTCGGCAGTAGCCAAGACTCCTCTGCGCACCAATGCCGCCATTCTGGGTGCCCTCAACGCCCGCTGCTACTCC 

AGCACCCACGAGGCGGACATCGTGGTGATTGGCTCCGGACCCGGTGGCTATGTCGCCGCCATCAAGGCCGCCCAGATGGGCATGAAGACGGTCAG 

CGTGGAGAAGGAGGCAACTCTGGGCGGCACCTGCCTGAATGTTGGCTGCATTCCCTCAAAGGCGCTGCTGAACAACTCCCACTACTACCACATGG 

CCCACTCCGGTGATCTGGAGAAACGTGGTATTAGCTGCGGCAGCGTTAGCCTGGATCTGGAGAAGCTTATGGGTCAAAAGTCCAACGCCGTGAAG 

GCTTTGACCGGTGGCATTGCCATGCTTTTCAAGAAGAACAAGGTCACACAGCTGACGGGCTTCGGCACCATCGTCAATCCCAACGAGGTGGAGGT 

TAAGAAATCCGACGGCTCCACGGAGACTGTCAAGACCAAGAACATCCTCATCGCCACCGGCTCGGAGGTTACCCCCTTCCCTGGCATTGAAATCG 

ACGAGGAAGTTATTGTGAGCAGCACTGGCGCCCTGAAGCTGGCCAAGGTGCCCAAGCATCTGGTTGTCATCGGCGCCGGTGTCATTGGCCTGGAG 

CTGGGTTCGGTATGGTCGCGTCTGGGCGCTGAGGTCACTGCCATTGAGTTCATGGATACGATCGGCGGCGTGGGCATCGATAACGAAGTCTCCAA 

GACGTTCCAAAAGGTGCTCACCAAGCAGGGTCTTAAGTTCAAGCTGGGCACCAAGGTGACGGCTGCCTCTCGCAGCGGCGACAACGTCACCGTGT 

CCGTGGAGAACGCCAAGTCTGGCGAGAAAGAGGAGATCCAGTGCGACGCCCTGCTGGTCAGTGTTGGACGTCGTCCCTACACCGAAGGTTTGGGT 

CTGGAGGCCGTGGGCATTGTTAAGGATGACCGCGGCAGGATCCCCGTCAACGCCACTTTCCAAACTGTGGTGCCCAATATTTACGCCATCGGCGA 

CTGCATCCACGGACCCATGTTGGCGCACAAGGCTGAAGATGAGGGCCTCATCACCATTGAGGGCATCAACGGCGGCCACGTGCACATCGACTACA 

ACTGCGTGCCCAGTGTAGTCTACACGCATCCCGAGGTCGCGTGGGTTGGCAAGAGCGAGGAGCAACTGAAGCAGGAGGGTGTTGCCTACAAGGTG 

GGCAAGTTCCCCTTCCTGGCCAACTCTCGTGCCAAGACCAACAACGACACCGACGGTTTCGTCAAGGTTCTGGCCGACCAGGCCACCGATAAGAT 

CCTGGGAACCCATATTATCGGACCTGGTGCCGGCGAGCTCATTAACGAGGCTGTGCTGGCCATGGAGTACGGAGCCGCTGCCGAGGACGTTGCCC 

GCGTTTGCCATGCGCATCCAACATGTTCCGAGGCCCTGCGTGAGGCCAATGTGGCAGCTGCCTTTGGCAAACCCATCAACTTCTAAGCACTTCCC 

AATCTCTGCCAGATAGTGAAGAATTTTGCACCGCGTTTGAAATCTTTGAATAACCCCTATTCGAAAAATTATCCTGACGGCCGAAACGACAGTCT 

CCGCATATTTGTAACTCTCAATGTAGGAACAGAAAAGTGGAAAATATAAATAATGGTAATGGCGCCAGGCGTTAATTGCAATGTAATATGTGTAT 

T AAG CT AGAAT AAAT AC AAC AC T GTT T G AAAACT C GAA 

(SEQ ID NO: 842) 

Start ATG: 282 (Reverse strand: CAT) 

MQFTLRHVVSAVAKTPLRTNAAILGALNARCYSSTHEADIVVIGSGPGGYVAAIKAAQMGMKTVSVEKEATLGGTCLNVGCIPSKALLNNSHYYH 
MAHSGDLEKRGISCGSVSLDLEKLMGQKSNAVKALTGGIAMLFKKNKVTQLTGFGTIVNPNEVEVKKSDGSTETVKTKNILIATGSEVTPFPGIE 
IDEEVIVSSTGALKLAKVPKHLVVIGAGVIGLELGSVWSRLGAEVTAIEFMDTIGGVGIDNEVSKTFQKVLTKQGLKFKLGTKVTAASRSGDNVT 
VSVENAKSGEKEEIQCDALLVSVGRRPYTEGLGLEAVGIVKDDRGRIPVNATFQTVVPNIYAIGDCIHGPMLAHKAEDEGLITIEGIKGGHVHID 
YNCVPSVVYTHPEVAWVGKSEEQLKQEGVAYKVGKFPFLANSRAKTNNDTDGFVKVLADQATDKILGTHIIGPGAGELINEAVLAMEYGAAAEDV 
ARVCHAHPTCSEALREANVAAAFGKPINF* 
(SEQ ID NO: 843) 

Name: Didydrolipoamide dehydrogenase, DLD 
Classification: enzyme 

Celera Sequence No. : 142000013384523 

CTCCGGCGATAATGGAGACCGACCCGCACACACACACACACAACACGGCCACACGGTGACGGGCGTAGAGGCACACACCGAGCCAACTCGAATAC 
ACACGCGCGCAGGGGAGAGAGAGCGGGAGAGCGCGGGCGCGCTCTCAATTATTTATTTACTTTTTTCGTAACTGCATTCCATTCCAGCTCAAACA 
CACGCACACATATGGGGGACGAACACAAGCGCACACGATCACTGCTCACAGTTGGTGCTGCTTCTTTTCATTTCTTGTTGTTGCAAGCGTTTGAA 
CTCCTTTACTTATTTTCTTATTTTCGCCTAATTGAAACTAATTAACGACGATAATCCAACCATCCACTTGGTGTTCACAAAAAAATATGTACTTC 
AGTTTAGTGAGTTCGCGCCGTCGCCGATCTCTTCGGTTCCGGAATTCCTTGGGGGAGCAGCAAAAAACACGAGTAGCGCGCTTGATGACCGTCCG 
ATTCGGATTCGGAATTTGTTTTGGTAACATTCGCTGCTAGTGGTGTAGCAGAAAACATCGCATTGCATATCGATACACTCGCGGCGATGTTATTC 
GATGTATTGGTTGATATCGATATTATAGGCTGTATTGACGCGGTTGCAGAGATGCGATATTTAACAATCATGCACGAAAAGTATTTATTGTTATT 
TGGCTTGTGTAGTCCCCAAGCATAGGTGTAATTGTTGAGGGAAGAAAATAGGAGTTGTACGTTCAAAGATCGGCTTGGTAACAGGAACTAGTTCG 
GTTCTTCAAGCGATGTTAATCAAAGCCAGTTCCGATAGATCATGTTAACTCGCTCTGTTAACAATTACTGTTATCTTAATCCTAACATTTCCGTT 
ACATTTTAAAAACAGAAAAAGAAAAAATTTAAAAAAAAAGACTGTAAACTGAGTTACGCATTTCGAAATCAATATGTATCCCAACCATAGATAGA 
AAAATAAATTATGTTTCCGAAACATAACTTCAATAAAACCACAATTTCGCTTGCTTTGTACGTTTGATATTGTATTTGCATTTTATGAATTGTAA 
AAATTGAGTCAACAAGACTAACACAGCTTCAGCTCAAGACACATGCAATTTCATTGACATTGTTTAATTGTATATATTAAAGGAAATCAATTAGG 
ATATAAAGAGAGAAACCTTTTTGCCGCTTATGCGATGCCAACTATCTGGATCTGGATTGTCTAACACGAAACCCTAACTAATTGGTAAATTAAAC 
ATAAATTAATCATATCGGTCAACAACTACAACTAAATATTAGTTATTAGCGGATAAAGAGCGCCCCATAAACTATTTACACTAAGCTTTAGTCTA 
CTAGAGTCCCGGATTCCATCTCCGCTCCTCGGATCTTCGGATCTTAACTCAGATGTTCGGATCTTCAAGTTCTAGGAGAGAACTGGTGCAGTGCA 
TCCGTTATTCAGGCGATACTCGGCATAGACCAGTCCGGTGTTGTGCCAGTGATAGAAGGCGGCCACGATGATCAGGTGCCACCAGTTGTGCGAAT 
GCCCAACGAAGTCCACCTTGCCGGTGAACCAGCGCTCCGGAATCTTGGCCGCATAGAAAACGAAGGCGACGAGGCAGAGCAAGTACATGAGCACG 
ATGCGGGGCACCATCAGCTGCAAATGGTAATGATTGATGGATTAGCAAATGACAAGATTCCATAAATTTCCAACTGATGCCTACCCTGACTAGCT 
CGTTCTCCAGACCGCCCATGGCCACCGCCCAGTGGCCCAGAGGTATGATTCCATAGGCAGACCAGAGTAGCAGCACCGCCACCTTGCCGTTCATG 
GACACATTC7\AACGGGGTATCTGCACGGCAATGGCCAGTGCGAACATGCCCAGTGCAATCGTGGAGTACAGGGTGCGCAGGAAGGTGTGGCACCA 
GAAGGCATAGTACATGCCACTGATATAGATGGCCACCAGCGACAGCGAAATGCCCAGGAAGTCAACGGACAGAAAGAGCTCGTAGTGCTCCTCCG 
ATTTGCAGGAGAAGATGTGGTATATGGCGGACATCAGCATGCACAGACAGAAGCAGACCAGCAGGCAGACCACCAGCACCTGGTCGCTCAGGGAG 
GCGTGGAGGCGCAGGAACTGCAGATCGAAGATGGTCAGTCCAATGAACAGGATGCAACCGGCCAGATGGCTCCAGATGTTGATCTATGAAAAATA 
TAGCAATAATAATGTTATAATAATAGGGAATAGAGTTTAAGGAAAACTATTGTAGCTCCAATTGGAATGGAGTATCATTCTAGTGTGGCAATCTT 
ACCGTCTCATTGGTCCACCAGAAAATGCTCTGCAGGCACATTTTTGTGGAAGGTAGAAAGGTGCGGTATCCGCGACGAATGTACGGGTTGAACTT 
CAGGTGACTGGGGGCCTGTATCGAAAGCACAAGTGAATAAACATTCCAAATAAATACAAAATTTTCCAAGCCGATAAGACATGCAAATCGCATGC 
GTGTAGCATTGAAAATAAAATAAAAACCAAATAGACAAAATGACAAAATAAAATAGAAACAAACGGAATAATTCGAGGATCTGGCACCCCGCCGA 
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ATGTAAAAAAAAAAAACCAAAAAGCTTTCCATTTGCTTCAATTGAAAGAA^ 

GGAATTGATTAAATGCCAAATACTGTGTATATTATTTACTTTCTTTGAATTTAACTAGCTAAGTTTTATATATTATAAAAAATCCATTTAATTAG 

TTAAACAAGTGCTAAGCTAGTTTTGCACTGCATTTATGTATTATTACCTTAATTAATGCCTTTCAATCGGAAACAGTAATTATTTCAAGCCCGTT 

TTAAGCCCAGTTCGAAGTCGAATATTTAACTTGGTCAAAACTAAGCATGGTCCGAAAAACCCAAGAAATTCGATTCGGGAGTTACCTTGAGGATT 

TTCACAAACATATATGTATGTAACTTGCATACCACCCAAAGAAAACAACAGTAGCTTCTTTTTATTTGCCACCGCTATTTAAATATTTAATGCCC 

GCGAAGCACAGAGCTTTTTATTTGTTGCTAGTCGCGACCTTGATTCAGTTGGCATTTGGTGGGTTTGTTCGTTGTCTAATTGTATTATTTGGCAA 

TTGCTGCCAACTCGACCGCCATGGCGGATTTCACATATTTAAGCGTTTTGGCCATTTTACAACGGTCGCTTAATAATGCTTTTCTGGCCATACAT 

ACATACGTATGACGATAACAATTGGCCAAAATCAGTGTTAAAGCCATGTAAAATGCAATGAGTGAATATTTTCTGCGATTTGTCCCTGATTCTGA 

CGTATTTTTTCAATTCGCCAAACAAAATTTGCGATCAATTAGAATTCAGTGCGCGACAATTGGCGACAAATTGACCCTACCAAAGTTCGCGCGTG 

GGGTTAACTCACATCATCGAAATTGCACAGCCATTTGAATTTGGACAGCTTGTCGTCGTAGTTGGGTCCATGCCCCAGAATATTGGGCTCCTCAT 

TGGACGGAAATTGATGCGCTGCGTCCACCGAATCCGCAACCAGTTGGCCATCAAGCTGCGAAAAGAGATTTAAGATGGAATTAGTTAAAATGGAG 

GAAA CC CT AT C G AT T AGCGGC AAT T C G AAA C T T AAT T T GAAT T G AGGAC AAG AAC G AT GC TC AAG AT C T AT AC T CAT AT T C T T C C GC C TT AGAAA 

ACTGACGTTGGCAACTGGCGGGTTTGTTTCTCTGCACATTTGTGTATATTTTTAGCCAATAGAGCAACTGCAGTCGAAAAATTGATTTGTTTGCT 

TTTATTTTATTATTATCTTTTCTGCGGTTTCTGCTATTTTATACGTTGTTGTTGTTGTTGTTGTTTTCGTCTGGGGGTTTTCGGTTTCGTGTGTA 

AACAACCAACCAACCAACCAACCGACTAACCAGCATTGCAAAAATGCAATCAAAACATCGCTACTATTAGTGCCTCCAGCAAACTGCAGCAAAAA 

TTGAGAAAGGGGGCTTGACTCTAATTTGTTTATACAGCTTCTAAATCTAATGCATAACAACAAACAACAACAACAACAACAAGTGTGGGAGGGCA 

AGT GC AAGT GC CT AAGC AAAAGT AT C T T AT C C AAC GT GAT G AC GT T T T GTT T AC AAT T T GC AT TGTTTTTTTT GC AT TC C GC T T T AAAGG AAAAT 

CATGTGCAGTAAGCGTCGTGCCCATAAATGCAGCAGTAAGTATAAAAATATATGCATTTTTCAATACAAATTAAATATAACCGCTCATTAAAACG 

AAATACAGGAACAATTGAATTCCCAATCCTAATTCTTTTGTACAAGCAACAATTTCGTAGTTACTTAAATGCTAATTAAAAATCCATGATGTCAA 

AACTTAGCTACATTTATCAATTTTTAAAATTTACTCATGATCTAATAGTCACTAAAAGGCTAAATAGCTTTCGCTACCTATTTGTAAACAATGTT 

ACTAGTAAGTGCTTTGTGGTTTACTCTGTCGTCATTATTAAATATTTATTTGACACTCAGGAACGTGTGGATTTATAGTAAATTTAGGCAAAAAA 

T AT AT AG AAATT T T T C AAT AAT T T AG AAAC AT AAAAAT AT GT T T AT G AAAT G AAAAT G AAC CGCT AT T T AAC T T AAAGAC AT T C GT T AAG AC AAG 

AAAT G T AC T AT TAT AAT AT T GAT T GT T T T AAT T T T AAT AAT T AAGG AAC T AAAAGT TAT T T G T GT AAAAT GAT T T T G AAC AAAAC T AAAAT AAAA 

TATAGAACAAATAATTTTACATAAAAATTATTATACACGAGCACGTCAACTTGACATTGCGGCTTGTTAAATATTTTCCAGCAAATGCACTTCAC 

GAAAATGTTGTAGATTCCAAAAGATAAACACGGTAATTGTTGGCAAATAGAAAATAAAATGGAATGCCATCGATAAACAGGATGTGAAATTCATT 

AATTGGCCATTGACGTAGTTTAGCCAATTGTTTGTTCTACTGAATTGAAAACGTCAGATTTTTAGAGTTTGGAACGCAGCTTACATCTTTCTTTT 

TACTGCCGCGCATAGTGCATTTAAACTCAGTAGTAGTTGTTGTAGTAACAATTGTAGTTTCCATTTTACACAGGCACTTTTAACTAGTTGTTGAA 

ACGTTTTGAAAGTATTTAGTTTTTAATATTTTATTCGCAGATAACTATTATTTTATTTCAACACACAAAAGCCAGCCAAGTGGACATTCCAAAAC 

AATGTTTTCAAAGTTTTTTTTTTCGCAACTCCTTGGAGAGATTTTCAACTGAGCGCGGAAATTTTAGGGTGAACCGTTGCGATCGCCAACTTGCT 

GGCAACTAAAAAACCCCCCGATATCCAATGTGGCCAATGGTGTGAATCGAAAAAGCTCGCCGAAAAGCTTTCTATAATTTCGCAGCAGCCGAGAG 

ACGACGAATGCACTATCAGTTTTGTTTTATCAAGTCGCAAATGCTGAGCAATCATTTTGCCAGAGGGACGGTTTTCAGATTTTTCAACGGACAGT 

TATGCTGAAATTATGCGAAATTTCGTGGCCAATGAGCTTTCGCTTCTTGTGCAAGCAGCCTGTTTTGGTCAAGTCGGTTTGATTATCGAATACAC 

TGAAAAACATCGGCCTGCATTTTCATCCAAGTAAGCTTATAGCTTCGATCTAACATAAGAAAAATGAATAAATAATTTTAAAATAAATAAAAAAT 

AAAAT G CAT C T AC AC AT AT T T T AC T T T T T T T AGAAGC T AGT AAAAT AC AT TAT TAT AC AT T C G AC AT T T T T T TAT AT G AAT T ATT T GT T AAGT GT 

ATATTTTCCAAAAAAAAAGTTGTTTGTGCCTTTCTCAACACTTTTTTTTGTCGCAGTCAAGCCCCCGTATTATATAACCAATCGAGCTGATTATC 

G ATC C AG C AT T G AG AACC C T T T T GT AAAC AC C AGT T GC AC AAC AT G AAAAGC T G ATG C C AAT T C GC AAT AAT T GT T T AC AC G ATC C C AAT AC AC A 

C GG AT GG GT T T T T AT C GC T GG AC GG G AG AC GT AAAT AAAAG AC G AAAAAC AAAT C T G AAAAT C G AGC C C AAAAAC C AGT T T C AAT T T T C GAC G AA 

GTTCAGGAATTCCGCGTGCGGACTTTCGCAATTCCCATGTTTATACCCTTACATACCTACGCACATACATATGTACATATGTGTAAACATAATGC 

ATAACAAGATATCGGTGTGGCAAGCTCTAATCGGTTACAAATAACGCCGACTCGTCATGACCTTGGCTCCATTCGTGGCGATCTGCTGGGCTTTT 

TGACCTTGTGCTGTGGGGAAAATCGCGCACTCTGACACTTTGTGTAAGGTTATCTCCAAACTGCTGGCTCGTCTTTTCGGATGACAAATCATTTT 

TTTTTTGAACTGATTGCATGTGCTGCGATGCACGTGCCGTTTTCAAGTCGCCGAGAACAGGACCGGATCTGGTATGATCAGTTTCCGGGTCCATT 

CGATGCAT CAT AAC ATTGAT AAAAT TTTCGAAAT ATT TAAAATAGCT CAT AAT TTCCACATTTTAAAGCTTATGGTGGAAAAAAAGT AT ACAAAT 

AAGTACTTATATATTATACTTTTGCTACATTATATTACATATATATTTGCTATTATATATTACACTTTTGATCAAATGAAATATTTATATAGTAT 

GTATGTAAACCCCCGATAAGATTGAACTGCCCCCGTCTGAGCTTAATCCATTCAAAAAAATCTTTGTTGAGGAATGCATTGTAAACAGGAATATT 

CCACTCACCAGGCAGCTGTCGTCACAATCCTGTCCAGATTTTCGCAAGTAGCAGGAGGACGAGTCCAGCGACAGTTCTGGTTCCTCGAGGATCTC 

ACGGGGACGGGGACTCATGGCCACTCGGTAGCCTGATGGGGGTATTCACAGAAATACAGCTCGTTTAGAGATCGAGTGTCCGGGTAAAAATGTAA 

ACATATACATAAGCTGCGATCGCCAGTTGGTGACCGGCTGTTATCAACAACCCCCCCCCCCCCCACCGTCACCGCTACCATTTCGCAGAAGAGTG 

CGAAAATCGAGTCGAGCTGCCTAACAAAAGACAACTTGAGTGGTGCCAAGTGGGGAGCGCAGCCATGTTGCCATGTTGGAGCCAACAAAGACAGC 

GAGCTTCGCTCTCTGCGTAGCCTCTCTTTTCTCACTTGGACTTTCTTCGGAGCTTAATCGATTAGAGGCCGAAGAAGCCTGCTCTACAAGGGTCT 

T G AAAAT AGT AT AT T T T T GT G AT T G AGT T C T T T T T AAT T AAC AC T AT AT AAG AT AT AT T GAC C T T AT C GAT AGT ATT C T T AAG AAA AAG AAAAT C 

AAAAGTTTTATTTTTCAGATAAAAATAGTTTTCAAATAAGATTGCTCTGCAATGAAGGTGGTGCGTTGAGCACATGAAATACATATATGCATATA 

ATGAATACAAAATATATAGTTTGCTTTGTTTCCACGACTGCAGTTTCGAGTTTATTTCTTTTACAGCACAGCACAGCCACACGAATTTCTCTCAC 

AAAACCGTTTGCATAACCAAAATTTCTAAAAGTTTTCCAGGTCACTTGTTTCGCAGGCAGCTTTCGCAGCACGTGTCTCGACGAATGCAGTTGTA 

TTGACATTCCCAAGGGTCCAGATAACCTTCAATTGATTTTGATGTACTTGCTCTAACTAGGCCACACACGGAACAGGGATTGGAAATTTCACTTT 

GAGCTGCAAAGTTCTGGCGATCACCTCCACCTCTCGCAACGCGGCAAAATATCAACTGAATATTTTCAGACCTCGAGTGTGTTATTTGCGTTAAA 

AGAAAGAGTCGCGAATCGCTGCGAAGAGAAGAAAGAGAAAAGAAATACTAAAAGCTCGCTCCGAAGCTTTGGCGACGCGAGCATGATTCTTCTCA 

TATACAGTATATAGCATTTGAATTGATTAAATGCTAATTTAGCATGCTGTGGTCGTTGCCGCCATCGGAGATCTTACACTGAAACTTGAAAAGCG 

GATTCTGAATATCAATTCTTCTTGAAAAAAGGCTTTTATATGTATACATAGATCTATAGCTTCCTAAAAATCATTGCAGCTCATTATCAAACATG 

CTTTAATGCTGATTCGTCTGTATAAATATTTAATTATTGTCTACCAAGTCATTGGAAAATTTTCACCACTATGCTTATTCGCCAACACTCTCGGA 

ATATTTTATTTTTTCCATGGTCTATTTGTATAATTTCTTACCTTAATGCCAAGACCATTTGAATATTTATACCCTGTCCTTTGCTGTTTTGTTCT 

CTTATCAATGCCCTTCGCATTGACCGAGTTTTCAGATTTCCTTGCCTTTGGCATCATTAATCCCTTTCAACATGGCCAAAAGCCATTCAAAACTG 

AATTGTTGAGAGCTGTCACTTGGCATTTTATTGCCATCAGATAGCTGTACTCACAACAAAATTCTACGACAACCCAACCGACAAAGCCCACACGA 

TGATAGTTAATTAAAAAGTTGTTGGCACACTCAGAATATCATGCAAAATTAGCCTGGCTAACTGGCCTTATCATAATTATCAGCAATCCCCAAAC 

AAAACTTTACAACATGATAATTATTAAATAAAAAGCAAATAACCACTAACAGTAGAACCGAATTAACATTTGTGAGCTCAGAAAACAAAAGCAAA 

ATACAGGTGAAACAAAATGCAGCAGCATCCGTTTACTAATTTATACGCAATCTCAAATAATTTACAAAACAAATGGTTAACCGAAAGAAAT 

(SEQ ID NO: 844) 

Exon: 7736. .7648 

Exon: 6872.. 6754 

Exon: 3570.. 3433 

Exon: 2390.. 2283 
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Exon: 2173.. 1700 
Exon: 1632.. 1001 

Start ATG: 6858 (Reverse strand: CAT} 
Transcript No. : CT22875 

CGTTGCGAGAGGTGGAGGTGATCGCCAGAACTTTGCAGCTCAAAGTGAAATTTCCAATCCCTGTTCCGTGTGTGGCCTAGTTAGAGCAAGCTACC 
GAGTGGCCATGAGTCCCCGTCCCCGTGAGATCCTCGAGGAACCAGAACTGTCGCTGGACTCGTCCTCCTGCTACTTGCGAAAATCTGGACAGGAT 
TGTGACGACAGCTGCCTGCTTGATGGCCAACTGGTTGCGGATTCGGTGGACGCAGCGCATCAATTTCCGTCCAATGAGGAGCCCAATATTCTGGG 
GCATGGACCCAACTACGACGACAAGCTGTCCAAATTCAAATGGCTGTGCAATTTCGATGATGCCCCCAGTCACCTGAAGTTCAACCCGTACATTC 
GTCGCGGATACCGCACCTTTCTACCTTCCACAAAAATGTGCCTGCAGAGCATTTTCTGGTGGACCAATGAGACGATCAACATCTGGAGCCATCTG 
GCCGGTTGCATCCTGTTCATTGGACTGACCATCTTCGATCTGCAGTTCCTGCGCCTCCACGCCTCCCTGAGCGACCAGGTGCTGGTGGTCTGCCT 
GCTGGTCTGCTTCTGTCTGTGCATGCTGATGTCCGCCATATACCACATCTTCTCCTGCAAATCGGAGGAGCACTACGAGCTCTTTCTGTCCGTTG 
ACTTCCTGGGCATTTCGCTGTCGCTGGTGGCCATCTATATCAGTGGCATGTACTATGCCTTCTGGTGCCACACCTTCCTGCGCACCCTGTACTCC 
ACGATTGCACTGGGCATGTTCGCACTGGCCATTGCCGTGCAGATACCCCGTTTGAATGTGTCCATGAACGGCAAGGTGGCGGTGCTGCTACTCTG 
GTCTGCCTATGGAATCATACCTCTGGGCCACTGGGCGGTGGCCATGGGCGGTCTGGAGAACGAGCTAGTCAGGCTGATGGTGCCCCGCATCGTGC 
TCATGTACTTGCTCTGCCTCGTCGCCTTCGTTTTCTATGCGGCCAAGATTCCGGAGCGCTGGTTCACCGGCAAGGTGGACTTCGTTGGGCATTCG 
CACAACTGGTGGCACCTGATCATCGTGGCCGCCTTCTATCACTGGCACAACACCGGACTGGTCTATGCCGAGTATCGCCTGAATAACGGATGCAC 
TGCACCAGTTCTCTCCTAGAACTTGAAGATCCGAACATCTGAGTTAAGATCCGAAGATCCGAGGAGCGGAGATGGAATCCGGGACTCTAGTAGAC 
TAAAGCTTAGTGTAAATAGTTTATGGGGCGCTCTTTATCCGCTAATAACTAATATTTAGTTGTAGTTGTTGACCGATATGATTAATTTATGTTTA 
ATTTACCAATTAGTTAGGGTTTCGTGTTAGACAATCCAGATCCAGATAGTTGGCATCGCATAAGCGGCAAAAAGGTTTCTCTCTTTATATCCTAA 
TTGATTTCCTTTAATATATACAATTAAACAATGTCAATGAAATTGCATGTGTCTTGAGCTGAAGCTGTGTTAGTCTTGTTGACTCAATTTTTACA 
ATTCATAAAATGCAAATACAATATCAAACGTACAAAGCAA 
(SEQ ID NO: 845) 

Start ATG: 104 (Reverse strand: CAT) 

MSPRPREILEEPELSLDSSSCYLRKSGQDCDDSCLLDGQLVADSVDAAHQFPSNEEPNILGHGPNYDDKLSKFKWLCNFDDAPSHLKFNPYIRRG 

YRTFLPSTKMCLQSIFWWTNETINIWSHLAGCILFIGLTIFDLQFLRLHASLSDQVLVVCLLVCFCLCMLMSAIYHIFSCKSEEHYELFLSVDFL 

GISLSLVAIYISGMYYAFWCHTFLRTLYSTIALGMFALAIAVQIPPJ.NVSMNGKVAVLLLWSAYGIIPLGHWAVAMGGLENELVRLMVPRIVLMY 

LLCLVAFVFYAAKIPERWFTGKVDFVGHSHNWWHLIIVAAFYHWHNTGLVYAEYRLNNGCTAPVLS* 

(SEQ ID NO: 846) 

Classification: hypothetical 



Celera Sequence No. : 142000013384523 

CTCCGGCGATAATGGAGACCGACCCGCACACACACACACACAACACGGCCACACGGTGACGGGCGTAGAGGCACACACCGAGCCAACTCGAATAC 
ACACGCGCGCAGGGGAGAGAGAGCGGGAGAGCGCGGGCGCGCTCTCAATTATTTATTTACTTTTTTCGTAACTGCATTCCATTCCAGCTCAAACA 
CACGCACACATATGGGGGACGAACACAAGCGCACACGATCACTGCTCACAGTTGGTGCTGCTTCTTTTCATTTCTTGTTGTTGCAAGCGTTTGAA 
CTCCTTTACTTATTTTCTTATTTTCGCCTAATTGAAACTAATTAACGACGATAATCCAACCATCCACTTGGTGTTCACAAAAAAATATGTACTTC 
AGTTTAGTGAGTTCGCGCCGTCGCCGATCTCTTCGGTTCCGGAATTCCTTGGGGGAGCAGCAAAAAACACGAGTAGCGCGCTTGATGACCGTCCG 
ATTCGGATTCGGAATTTGTTTTGGTAACATTCGCTGCTAGTGGTGTAGCAGAAAACATCGCATTGCATATCGATACACTCGCGGCGATGTTATTC 
GATGTATTGGTTGATATCGATATTATAGGCTGTATTGACGCGGTTGCAGAGATGCGATATTTAACAATCATGCACGAAAAGTATTTATTGTTATT 
TGGCTTGTGTAGTCCCCAAGCATAGGTGTAATTGTTGAGGGAAGAAAATAGGAGTTGTACGTTCAAAGATCGGCTTGGTAACAGGAACTAGTTCG 
GTTCTTCAAGCGATGTTAATCAAAGCCAGTTCCGATAGATCATGTTAACTCGCTCTGTTAACAATTACTGTTATCTTAATCCTAACATTTCCGTT 
ACATTTTAAAAACAGAAAAAGAAAAAATTTAAAAAAAAAGACTGTAAACTGAGTTACGCATTTCGAAATCAATATGTATCCCAACCATAGATAGA 
AAAATAAATTATGTTTCCGAAACATAACTTCAATAAAACCACAATTTCGCTTGCTTTGTACGTTTGATATTGTATTTGCATTTTATGAATTGTAA 
AAATTGAGTCAACAAGACTAACACAGCTTCAGCTCAAGACACATGCAATTTCATTGACATTGTTTAATTGTATATATTAAAGGAAATCAATTAGG 
ATATAAAGAGAGAAACCTTTTTGCCGCTTATGCGATGCCAACTATCTGGATCTGGATTGTCTAACACGAAACCCTAACTAATTGGTAAATTAAAC 
ATAAATTAATCATATCGGTCAACAACTACAACTAAATATTAGTTATTAGCGGATAAAGAGCGCCCCATAAACTATTTACACTAAGCTTTAGTCTA 
CTAGAGTCCCGGATTCCATCTCCGCTCCTCGGATCTTCGGATCTTAACTCAGATGTTCGGATCTTCAAGTTCTAGGAGAGAACTGGTGCAGTGCA 
TCCGTTATTCAGGCGATACTCGGCATAGACCAGTCCGGTGTTGTGCCAGTGATAGAAGGCGGCCACGATGATCAGGTGCCACCAGTTGTGCGAAT 
GCCCAACGAAGTCCACCTTGCCGGTGAACCAGCGCTCCGGAATCTTGGCCGCATAGAAAACGAAGGCGACGAGGCAGAGCAAGTACATGAGCACG 
ATGCGGGGCACCATCAGCTGCAAATGGTAATGATTGATGGATTAGCAAATGACAAGATTCCATAAATTTCCAACTGATGCCTACCCTGACTAGCT 
CGTTCTCCAGACCGCCCATGGCCACCGCCCAGTGGCCCAGAGGTATGATTCCATAGGCAGACCAGAGTAGCAGCACCGCCACCTTGCCGTTCATG 
GACACATTCAAACGGGGTATCTGCACGGCAATGGCCAGTGCGAACATGCCCAGTGCAATCGTGGAGTACAGGGTGCGCAGGAAGGTGTGGCACCA 
GAAGGCATAGTACATGCCACTGATATAGATGGCCACCAGCGACAGCGAAATGCCCAGGAAGTCAACGGACAGAAAGAGCTCGTAGTGCTCCTCCG 
ATTTGCAGGAGAAGATGTGGTATATGGCGGACATCAGCATGCACAGACAGAAGCAGACCAGCAGGCAGACCACCAGCACCTGGTCGCTCAGGGAG 
GCGTGGAGGCGCAGGAACTGCAGATCGAAGATGGTCAGTCCAATGAACAGGATGCAACCGGCCAGATGGCTCCAGATGTTGATCTATGAAAAATA 
TAGCAATAATAATGTTATAATAATAGGGAATAGAGTTTAAGGAAAACTATTGTAGCTCCAATTGGAATGGAGTATCATTCTAGTGTGGCAATCTT 
ACCGTCTCATTGGTCCACCAGAAAATGCTCTGCAGGCACATTTTTGTGGAAGGTAGAAAGGTGCGGTATCCGCGACGAATGTACGGGTTGAACTT 
CAGGTGACTGGGGGCCTGTATCGAAAGCACAAGTGAATAAACATTCCAAATAAATACAAAATTTTCCAAGCCGATAAGACATGCAAATCGCATGC 
GTGTAGCATTGAAAATAAAATAAAAACCAAATAGACAAAATGACAAAATAAAATAGAAACAAACGGAATAATTCGAGGATCTGGCACCCCGCCGA 
ATGTAAAAAAAAAAAACCAAAAAGCTTTCCATTTGCTTCAATTGAAAGAAAATAAAACTAATTAATCCAATTTGTGTTGTTAGCAATCTTGTATG 
GGAATTGATTAAATGCCAAATACTGTGTATATTATTTACTTTCTTTGAATTTAACTAGCTAAGTTTTATATATTATAAAAAATCCATTTAATTAG 
TTAAACAAGTGCTAAGCTAGTTTTGCACTGCATTTATGTATTATTACCTTAATTAATGCCTTTCAATCGGAAACAGTAATTATTTCAAGCCCGTT 
TTAAGCCCAGTTCGAAGTCGAATATTTAACTTGGTCAAAACTAAGCATGGTCCGAAAAACCCAAGAAATTCGATTCGGGAGTTACCTTGAGGATT 
TTCACAAACATATATGTATGTAACTTGCATACCACCCAAAGAAAACAACAGTAGCTTCTTTTTATTTGCCACCGCTATTTAAATATTTAATGCCC 
GCGAAGCACAGAGCTTTTTATTTGTTGCTAGTCGCGACCTTGATTCAGTTGGCATTTGGTGGGTTTGTTCGTTGTCTAATTGTATTATTTGGCAA 
TTGCTGCCAACTCGACCGCCATGGCGGATTTCACATATTTAAGCGTTTTGGCCATTTTACAACGGTCGCTTAATAATGCTTTTCTGGCCATACAT 
ACATACGTATGACGATAACAATTGGCCAAAATCAGTGTTAAAGCCATGTAAAATGCAATGAGTGAATATTTTCTGCGATTTGTCCCTGATTCTGA 
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CGTATTTTTTCAATTCGCCAAACAAAATTTGCGATCAATTAGAATTCAGTGCGCGACAATTGGCGACAAATTGACCCTACCAAAGTTCGCGCGTG 

GGGTTAACTCACATCATCGAAATTGCACAGCCATTTGAATTTGGACAGCTTGTCGTCGTAGTTGGGTCCATGCCCCAGAATATTGGGCTCCTCAT 

TGGACGGAAATTGATGCGCTGCGTCCACCGAATCCGCAACCAGTTGGCCATCAAGCTGCGAAAAGAGATTTAAGATGGAATTAGTTAAAATGGAG 

GAAACCCTATCGATTAGCGGCAATTCGAAACTTAATTTGAATTGAGGACAAGAACGATGCTCAAGATCTATACTCATATTCTTCCGCCTTAGAAA 

ACTGACGTTGGCAACTGGCGGGTTTGTTTCTCTGCACATTTGTGTATATTTTTAGCCAATAGAGCAACTGCAGTCGAAAAATTGATTTGTTTGCT 

TTTATTTTATTATTATCTTTTCTGCGGTTTCTGCTATTTTATACGTTGTTGTTGTTGTTGTTGTTTTCGTCTGGGGGTTTTCGGTTTCGTGTGTA 

AACAACCAACCAACCAACCAACCGACTAACCAGCATTGCAAAAATGCAATCAAAACATCGCTACTATTAGTGCCTCCAGCAAACTGCAGCAAAAA 

TTGAGAAAGGGGGCTTGACTCTAATTTGTTTATACAGCTTCTAAATCTAATGCATAACAACAAACAACAACAACAACAACAAGTGTGGGAGGGCA 

AGTGCAAGTGCCTAAGCAAAAGTATCTTATCCAACGTGATGACGTTTTGTTTACAATTTGCATTGTTTTTTTTGCATTCCGCTTTAAAGGAAAAT 

CATGTGCAGTAAGCGTCGTGCCCATAAATGCAGCAGTAAGTATAAAAATATATGCATTTTTCAATACAAATTAAATATAACCGCTCATTAAAACG 

AAATACAGGAACAATTGAATTCCCAATCCTAATTCTTTTGTACAAGCAACAATTTCGTAGTTACTTAAATGCTAATTAAAAATCCATGATGTCAA 

AACTTAGCTACATTTATCAATTTTTAAAATTTACTCATGATCTAATAGTCACTAAAAGGCTAAATAGCTTTCGCTACCTATTTGTAAACAATGTT 

ACTAGTAAGTGCTTTGTGGTTTACTCTGTCGTCATTATTAAATATTTATTTGACACTCAGGAACGTGTGGATTTATAGTAAATTTAGGCAAAAAA 

TAT AT AG AAAT T T T T C AAT AAT T T AG AAAC AT AAAAAT AT GT T T AT G AAAT G AAAAT G AACC GCT AT T T AACT T AAAG AC AT T C GTT AAG AC AAG 

AAATGTACTATTATAATATTGATTGTTTTAATTTTAATAATTAAGGAACTAAAAGTTATTTGTGTAAAATGATTTTGAACAAAACTAAAATAAAA 

TATAGAACAAATAATTTTACATAAAAATTATTATACACGAGCACGTCAACTTGACATTGCGGCTTGTTAAATATTTTCCAGCAAATGCACTTCAC 

GAAAAT GTT GT AG AT T C C AAAAG AT AAAC AC GGT AAT T GT T GGC AAAT AGAAAAT AAAAT GG AAT G CC AT C GAT AAAC AGG AT GT GAAAT T C AT T 

AATTGGCCATTGACGTAGTTTAGCCAATTGTTTGTTCTACTGAATTGAAAACGTCAGATTTTTAGAGTTTGGAACGCAGCTTACATCTTTCTTTT 

TACTGCCGCGCATAGTGCATTTAAACTCAGTAGTAGTTGTTGTAGTAACAATTGTAGTTTCCATTTTACACAGGCACTTTTAACTAGTTGTTGAA 

AC GT T T T G AAAGT AT T T AGT T TT T AAT AT T T TAT T CGC AG AT AAC T AT TAT T T TAT T T C AAC AC AC AAAAGC C AGC C AAG T GG AC AT T CC AAAAC 

AATGTTTTCAAAGTTTTTTTTTTCGCAACTCCTTGGAGAGATTTTCAACTGAGCGCGGAAATTTTAGGGTGAACCGTTGCGATCGCCAACTTGCT 

GGCAACTAAAAAACCCCCCGATATCCAATGTGGCCAATGGTGTGAATCGAAAAAGCTCGCCGAAAAGCTTTCTATAATTTCGCAGCAGCCGAGAG 

ACGACGAATGCACTATCAGTTTTGTTTTATCAAGTCGCAAATGCTGAGCAATCATTTTGCCAGAGGGACGGTTTTCAGATTTTTCAACGGACAGT 

TATGCTGAAATTATGCGAAATTTCGTGGCCAATGAGCTTTCGCTTCTTGTGCAAGCAGCCTGTTTTGGTCAAGTCGGTTTGATTATCGAATACAC 

TGAAAAACATCGGCCTGCATTTTCATCCAAGTAAGCTTATAGCTTCGATCTAACATAAGAAAAATGAATAAATAATTTTAAAATAAATAAAAAAT 

AAAATGCATCTACACATATTTTACTTTTTTTAGAAGCTAGTAAAATACATTATTATACATTCGACATTTTTTTATATGAATTATTTGTTAAGTGT 

ATATTTTCCAAAAAAAAAGTTGTTTGTGCCTTTCTCAACACTTTTTTTTGTCGCAGTCAAGCCCCCGTATTATATAACCAATCGAGCTGATTATC 

GAT C C AGC AT TG AG AAC C CT T T T GTAAAC AC C AGT T GC AC AAC AT G AAAAGC TG AT G CC AAT T C GC AAT AAT T GT T T AC ACG AT C C C AAT AC AC A 

CGGATGGGTTTTTATCGCTGGACGGGAGACGTAAATAAAAGACGAAAAACAAATCTGAAAATCGAGCCCAAAAACCAGTTTCAATTTTCGACGAA 

GTTCAGGAATTCCGCGTGCGGACTTTCGCAATTCCCATGTTTATACCCTTACATACCTACGCACATACATATGTACATATGTGTAAACATAATGC 

ATAACAAGATATCGGTGTGGCAAGCTCTAATCGGTTACAAATAACGCCGACTCGTCATGACCTTGGCTCCATTCGTGGCGATCTGCTGGGCTTTT 

TGACCTTGTGCTGTGGGGAAAATCGCGCACTCTGACACTTTGTGTAAGGTTATCTCCAAACTGCTGGCTCGTCTTTTCGGATGACAAATCATTTT 

TTTTTTGAACTGATTGCATGTGCTGCGATGCACGTGCCGTTTTCAAGTCGCCGAGAACAGGACCGGATCTGGTATGATCAGTTTCCGGGTCCATT 

C GAT GC AT CAT AAC AT T GAT AAAAT T T TC GAAAT AT T T AAAAT AGC T CAT AAT T T CC AC AT T T T AAAG CTT AT GGT GG AAAAAAAGT AT AC AAAT 

AAGTACTTATATATTATACTTTTGCTACATTATATTACATATATATTTGCTATTATATATTACACTTTTGATCAAATGAAATATTTATATAGTAT 

GTATGTAAACCCCCGATAAGATTGAACTGCCCCCGTCTGAGCTTAATCCATTCAAAAAAATCTTTGTTGAGGAATGCATTGTAAACAGGAATATT 

CCACTCACCAGGCAGCTGTCGTCACAATCCTGTCCAGATTTTCGCAAGTAGCAGGAGGACGAGTCCAGCGACAGTTCTGGTTCCTCGAGGATCTC 

ACGGGGACGGGGACTCATGGCCACTCGGTAGCCTGATGGGGGTATTCACAGAAATACAGCTCGTTTAGAGATCGAGTGTCCGGGTSlAAAATGTAA 

ACATATACATAAGCTGCGATCGCCAGTTGGTGACCGGCTGTTATCAACAACCCCCCCCCCCCCCACCGTCACCGCTACCATTTCGCAGAAGAGTG 

CGAAAATCGAGTCGAGCTGCCTAACAAAAGACAACTTGAGTGGTGCCAAGTGGGGAGCGCAGCCATGTTGCCATGTTGGAGCCAACAAAGACAGC 

GAGCTTCGCTCTCTGCGTAGCCTCTCTTTTCTCACTTGGACTTTCTTCGGAGCTTAATCGATTAGAGGCCGAAGAAGCCTGCTCTACAAGGGTCT 

TGAAAATAGTATATTTTTGTGATTGAGTTCTTTTTAATTAACACTATATAAGATATATTGACCTTATCGATAGTATTCTTAAGAAAAAGAAAATC 

AAAAGTTTTATTTTTCAGATAAAAATAGTTTTCAAATAAGATTGCTCTGCAATGAAGGTGGTGCGTTGAGCACATGAAATACATATATGCATATA 

ATGAATAC AAAAT ATATAGTTTGCTTTGTTTCCACGACTGCAGTTTCGAGTTTATTTCTTTTAC AGC ACAGCACAGCCAC AC GAATTTCTCTCAC 

AAAACCGTTTGCATAACCAAAATTTCTAAAAGTTTTCCAGGTCACTTGTTTCGCAGGCAGCTTTCGCAGCACGTGTCTCGACGAATGCAGTTGTA 

TTGACATTCCCAAGGGTCCAGATAACCTTCAATTGATTTTGATGTACTTGCTCTAACTAGGCCACACACGGAACAGGGATTGGAAATTTCACTTT 

GAGCTGCAAAGTTCTGGCGATCACCTCCACCTCTCGCAACGCGGCAAAATATCAACTGAATATTTTCAGACCTCGAGTGTGTTATTTGCGTTAAA 

AG AAAG AGT C GC G AAT C GC T GC G AAG AG AAG AAAG AG AAAAG AAAT ACT AAAAG C T C GC T C CG AAGC T T T GGC GACGC G AGC AT GATTCTTCTCA 

TATACAGTATATAGCATTTGAATTGATTAAATGCTAATTTAGCATGCTGTGGTCGTTGCCGCCATCGGAGATCTTACACTGAAACTTGAAAAGCG 

GATTCTGAATATCAATTCTTCTTG 
(SEQ ID NO: 847) 

Exon: 7004.. 6754 
Exon: 3570.. 3433 
Exon: 2390. .2283 
Exon: 2173. .1700 
Exon: 1632.. 1001 

Start ATG: 6858 (Reverse strand: CAT) 
Transcript No. : CT22885 

ACGGTGGGGGGGGGGGGGGTTGTTGATAACAGCCGGTCACCAACTGGCGATCGCAGCTTATGTATATGTTTACATTTTTACCCGGACACTCGATC 
TCTAAACGAGCTGTATTTCTGTGAATACCCCCATCAGGCTACCGAGTGGCCATGAGTCCCCGTCCCCGTGAGATCCTCGAGGAACCAGAACTGTC 
GCTGGACTCGTCCTCCTGCTACTTGCGAAAATCTGGACAGGATTGTGACGACAGCTGCCTGCTTGATGGCCAACTGGTTGCGGATTCGGTGGACG 
CAGCGCATCAATTTCCGTCCAATGAGGAGCCCAATATTCTGGGGCATGGACCCAACTACGACGACAAGCTGTCCAAATTCAAATGGCTGTGCAAT 
TTCGATGATGCCCCCAGTCACCTGAAGTTCAACCCGTACATTCGTCGCGGATACCGCACCTTTCTACCTTCCACAAAAATGTGCCTGCAGAGCAT 
TTTCTGGTGGACCAATGAGACGATCAACATCTGGAGCCATCTGGCCGGTTGCATCCTGTTCATTGGACTGACCATCTTCGATCTGCAGTTCCTGC 
GCCTCCACGCCTCCCTGAGCGACCAGGTGCTGGTGGTCTGCCTGCTGGTCTGCTTCTGTCTGTGCATGCTGATGTCCGCCATATACCACATCTTC 
TCCTGCAAATCGGAGGAGCACTACGAGCTCTTTCTGTCCGTTGACTTCCTGGGCATTTCGCTGTCGCTGGTGGCCATCTATATCAGTGGCATGTA 
CTATGCCTTCTGGTGCCACACCTTCCTGCGCACCCTGTACTCCACGATTGCACTGGGCATGTTCGCACTGGCCATTGCCGTGCAGATACCCCGTT 
TGAATGTGTCCATGAACGGCAAGGTGGCGGTGCTGCTACTCTGGTCTGCCTATGGAATCATACCTCTGGGCCACTGGGCGGTGGCCATGGGCGGT 
CTGGAGAACGAGCTAGTCAGGCTGATGGTGCCCCGCATCGTGCTCATGTACTTGCTCTGCCTCGTCGCCTTCGTTTTCTATGCGGCCAAGATTCC 



FIGURE SHEET 457 



GGAGCGCTGGTTCACCGGCAAGGTGGACTTCGTTGGGCATTCGCACAACTGGTGGCACCTGATCATCGTGGCCGCCTTCTATCACTGGCACAACA 
CCGGACTGGTCTATGCCGAGTATCGCCTGAATAACGGATGCACTGCACCAGTTCTCTCCTAGAACTTGAAGATCCGAACATCTGAGTTAAGATCC 
GAAGATCCGAGGAGCGGAGATGGAATCCGGGACTCTAGTAGACTAAAGCTTAGTGTAAATAGTTTATGGGGCGCTCTTTATCCGCTAATAACTAA 
TATTTAGTTGTAGTTGTTGACCGATATGATTAATTTATGTTTAATTTACCAATTAGTTAGGGTTTCGTGTTAGACAATCCAGATCCAGATAGTTG 
GCATCGCATAAGCGGCAAAAAGGTTTCTCTCTTTATATCCTAATTGATTTCCTTTAATATATACAATTAAACAATGTCAATGAAATTGCATGTGT 
CTTGAGCTGAAGCTGTGTTAGTCTTGTTGACTCAATTTTTACAATTCATAAAATGCAAATACAATATCAAACGTACAAAGCAA 

(SEQ ID NO: 848) 

Start ATG: 147 {Reverse strand: CAT) 

MSPRPREILEEPELSLDSSSCYLRKSGQDCDDSCLLDGQLVADSVDAAHQFPSNEEPNILGHGPNYDDKLSKFKWLCNFDDAPSHLKFNPYIRRG 
YRTFLPSTKMCLQSIFWWTNETINIWSHLAGCILFIGLTIFDLQFLRLHASLSDQVLVVCLLVCFCLCMLMSAIYHIFSCKSEEHYELFLSVDFL 
G I S L S L V AI Y I S GM Y Y A FWCHT FLRT L Y S T I ALGMF AL A I AVQ I P RLN V SMN GK V A VL L LWS A Y G 1 1 P L GHWA V AMG GL E MEL V RLM V PR I VLM Y 
LLCLVAFVFYAAKIPERWFTGKVDFVGHSHNWWHLIIVAAFYHWHNTGLVYAEYRLNNGCTAPVLS* 
(SEQ ID NO: 849) 

Classification: hypothetical 



Celera Sequence No. : 142000013384703 

GTCGCGCAAAGCCCAGGAGAAGGAGCGTCCCACCAAATCGGAACGCATTGATAGCACCAACCTGCTGCACTTCCGCATTCGTGCCTTGGATTTGT 
T G GAG C T C T T C AT AAC C7AAG AAAC C AAC T C AAT C GGT AAT CCTGGACGTGC T AC AT T GCGT GT TC C AGGT GT AC AGC CAT T GC GC C G C AG AC AGC 
AAACTACAATCCTTGCGAGAAGCCAGTTTGAAGTTGCTCAAAAAGATACTCGCTAGGAATATCGAACTCAAGGA7AAATCAGAGCACCGCACCCAT 
TCTGGAAGCCATTGAGCAGCTGATGTCGTCTGGCGAGGAGCATTCAGAAGAGGACCAAGAGAACGGCAAGCAGCCCGCCAGCCGACAGGCCAAAA 
GAGACATCATTATTTGGCGGGACAGGTGCTTCGCTTACCTGGTCAGCCAGGCATCAGCAGATGGCGAGCCCAAGAAAAGTGCCGTCTGGCCCCTT 
CTAGTCGAATTCCTGGAGCTGTGGGTAGCCAAGCGACGTAGTCGCCTCTCTCTGGCCAGTTTCGAAGCCCTCTTTCAGTCTGGCCAATGGCAGGG 
AGTTGCTCCGCTGGCCGTTGTCCTGGCCTCTCACTTGGACGTACAGAAAACACGCAGTTTTCGGCGGGCACAGATACTGAAGCTGCTCAGCGAGC 
AAGGT C G C C GGC T CG AAGT C GC T T T C AAGG AC AAC AAC T CGTC T T C C AAG AAAT T TG AG AAGC AG AT AG C C AGAT AT GT AAAT C AGC T AG AAAC G 
AAGGCTAGCAGTTCCAAGGAACTCAATCTGCTGCTTAAGATCCTCGCCCAAGGAGGCGAGAAGTGGCAGAAACTGCGTGAACAAATTCAGCTCGT 
CGCCAAGGATCTTCAACCCAACAAGAAGGTGGCGAAGCAGAAGAAGCAGGCAGCCGCCAAACCTATGGTCGTGGAAGATGAAGAGTCCACATAAC 
GTAAAAGAAGGAAACACCCCAATAAGAAACCTTACAATTAGATGTATAAATATCTTGCATAATATAGGATACTATTAATTTTTTTGTTCTTTAAA 
AAATTCTGACGAAAAGTACTAAATTAGTATTGAAGTCATTGAAAAAATAGCAAAAGTACACTAAACTTTATAAAAATGTATTTTTACCTTTATAG 
AGCAATTTGAGTGGACGTACAACTGCGCTAACGTTAAAAACCAAAATCACAGACGAGGTGGAAAGTACACGTGCTGGCGAAATGTGCTTTTAACA 

T T CGGC AC C AT GGAC T C GT C AT T AC TGCCTCGCTTTT TAG AC AC T T C GT T GGT GAC AAT C GT C AGG AT AT C GGC GAT G T GT T C AC T AAAGT AAC A 
AAGAACAACTTATGCATTAATTCGGGGCTAGCTTGCTTTGGCAGCTTTTTACCTGGCGGAACGGCGATCGATGGCATCGCACAGGCTCCCGATAA 
ACCAGCTACCTGTTTGGGTGTGGCGCAGAGCAGCATATCCATTTACCGTGGACATGGCCCGCAGCATGTCGATGTGCTGGTCCGGCGACACGGTG 
GTCACGTCTATCCTAAACTGAAAGAGGAATACGAAAATGTATAGGCTTTCCCATTCGACGACTTCATGCTTATAAATCTCCTTACGAGCTCATTT 
GGCTTCTTTTTATGGACAAGCTTCTCCTGGCAGGCTTGTATGATTAGCAACTTCGGTTTGTAGTACAGGGTGTCGTAGCTGCAGAGCAAGTCCTC 
GATATCCGTGATCTTCATGGCGATGCTGTTGGATGCGTAGACGGCCTCCTCGAAGCCGTGGCTCAGGATGAAGACCACTAGGGAATCGCGCACGA 
GCGATCTATCGCACGCACTGCGAATGCGCTCTATGATGCCCATGTGATCCACGTTGTCGTACGCCTCCACATTGTATCCCATCGAGGAAAACACC 
TCGATTAGTCGTTCTTTATCCACATCCGTGCCATCCCGTCTACGCAGTGGGTCGGGCGACAGAAATTTCTGTGGAGAAGGGCAATGTTGTACTTC 
TTTTTGTTACTCAATATTGCAGAGATGAAACCATATTATCCCTGCTAACATTCCGGTGAAACTTCTGCTGGTTGATAATCAAGGCGATTCCTGCG 
TTCTCCCGGGTCAACTTAAGAGCATCTATTTGTGTGGAACAGTACGACTGCTGGTTATCCGATTCAATCTCTTGCTTGACTGCCATCGCAGCAGT 
GCCAGCCGCATCTGGCTCCGGAGCGTTACTAATGATGGTGTCTTTGAGCAAATTTGCTTGCGCCTGCAGACCGTTGGACTTCAAATGGCCGACCA 
GCAGCTGAACATCGCTGCCTGCGGCATTTATGTCCCCTAACTTTATGCTCCTTCTGGTCAGCCAGTCCAGCAAAAAGATCTCCAGGTACGCGGGA 
TCGTAGAAGCGAAGTGGATCTCCTGCCTGCTGGCTCTCCACCTTTTCGCGAACATCTAGGAGCAAGCGGCCACTCTGCACCAACGACAACTCCTC 
ACACATCCTGTAGAGGCTCTTCAACAGAGGATGGACATGCAGCGTGATCCCAGCGATATGCGGCAAGTAGTGCATTCGCAGCTCCTGCCAGCAGA 
AACCCAGTCTCCGGAGCACCTTGCGGGCCCCAATAATGCACAGGGCCTCCACGAGATGTCTTCTCCAGGTTTCTGGCCGGGACTTGGCGAACTTT 
ATGAGTAGATCACTTTGCGGGAAGTCTGATCGTGTCATGGCCAAAAGTTTCTGCAGAATGTAGGTGGCATCCGAGTGGTCGTCGCCATAAAGCAG 
AAAGCAGAGGCCCACCTAGAAGAAAACGAAGGTTCTACTGTTTAATTAACACTTTGGTCTAGCGGATGTGTAAAACATTTGAGTCATATTCGAAA 
AGTCCTCCCAGGGAGGTGCCATACCTTTTGGGCAAAGTTCATGTCACGTTCCACGTAGATCAGATCGTTCTGATCGATGGTGTCTAGATGAATCA 
ACAGGTTTGATCCGGCCATGGCGATCGGGAATATACAAAAATGTTTCACTTTAGGAAATGGCCGATAAATTGCACTCGCTGACATCCGATAGCCG 
TGGCCTGAAGAGCTTCCACCGATCGATATCGATAAATGGATATGTTTTCTGCAATAGCGAAAGATCGGGGTGCTACCTATTAATTATACTTAAGA 
AACAGATGATGTAGTTTTTTGATATAGAAGTTTAACAACCGGTTCAACAATCGGTTAATATTTAAAAATGTCGCAGCCAGCGAGCCTTTAGCTTT 
AAAAGTAAGCTTGCGTTCACGGTTTTTGGGGAAATACCAAATATACTAGGTAAAAACATCGATTAACTAATATCATCCAACACGACGATATGTTC 
TTTGCGAAAAATATCGATGTACTGGTATTTTTATGTTTCCACACCTCTAGTTCGTTTAAGTCGATCTGTTGAACTGTTAACAACACAACTCCTGA 
ATATTATGTTTTGGACGAAAAGTTTTATTAATTTAATGTTTTTGTATATTGTATTCTGTCCATGAGCGTAACAGAGACAGACAGAGAGAAAGGGA 
CAGATACTGAGATACGGGGCCCTCAGGCAGAGAAAGAGGGCGAAAGCAAGCTACAAGCGGGTGCGAGCGAGAGCGCGCGCTTGCCTCACGCACAT 
ATATATAAATCTGTACATACATATAGTATCAGAAGAAAAAACTAAAGCCGCTATTCGATTTACGAAAACTGGCGAAAAATTCGACGATTTGTCTC 
GTACCTATCTATGTACCTCACAACCACTTCGTCATCAAATACGTATAAGTCAGATGTATGTCAAACATATATATACATATGCATGTATATACACA 
TCTATGTGAGGCCACTGCTATATCCCATACATGTATGCCTCTATATATATGTATATATTTATCCACCTACTGTACCACTTCCACATTATGTAAGC 
AGCGACGCAAACAAAACACTAAAGCAGAAAATCCGAAAAGCAAAATGCGAAATGCAAAACAACGAACCAACAAAATCCCATCTATAAACCGA^ 
C C GAAC AGC AAT G T TAT AAC AAAT AC C GAAAAGT T G T T AAT G TT C C AC AAAAAAG C AT AAAGC AGT C AC GC T AAG AGAG T AAG AT C G AAC AAAT G 
AAATGTTTGTATGGCATAAGTAGTTCCCAGATTTTTGTATGTACTTCAATGCATATGCATGTGTGCGTATGCATGTATGTATGTATGTTTCTCGG 

GGCCATGTCTCCCCACGCCCACGTACGT 
(SEQ ID NO: 850) 

Exon: 3208. .2875 

Exon: 2770. .2027 

Exon: 1968.. 1606 

Exon: 1537.. 1383 
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Exon: 1319.. 1001 

Start ATG: 2964 {Reverse strand: CAT) 



Transcript No. : CT22949 

CGACATTTTTAAATATTAACCGATTGTTGAACCGGTTGTTAAACTTCTATATCAAAAAACTACATCATCTGTTTCTTAAGTATAATTAATAGGTA 
GCACCCCGATCTTTCGCTATTGCAGAAAACATATCCATTTATCGATATCGATCGGTGGAAGCTCTTCAGGCCACGGCTATCGGATGTCAGCGAGT 
GCAATTTATCGGCCATTTCCTAAAGTGAAACATTTTTGTATATTCCCGATCGCCATGGCCGGATCAAACCTGTTGATTCATCTAGACACCATCGA 
TCAGAACGATCTGATCTACGTGGAACGTGACATGAACTTTGCCCAAAAGGTGGGCCTCTGCTTTCTGCTTTATGGCGACGACCACTCGGATGCCA 
CCTACATTCTGCAGAAACTTTTGGCCATGACACGATCAGACTTCCCGCAAAGTGATCTACTCATAAAGTTCGCCAAGTCCCGGCCAGAAACCTGG 
AGAAGACATCTCGTGGAGGCCCTGTGCATTATTGGGGCCCGCAAGGTGCTCCGGAGACTGGGTTTCTGCTGGCAGGAGCTGCGAATGCACTACTT 
GCCGCATATCGCTGGGATCACGCTGCATGTCCATCCTCTGTTGAAGAGCCTCTACAGGATGTGTGAGGAGTTGTCGTTGGTGCAGAGTGGCCGCT 
TGCTCCTAGATGTTCGCGAAAAGGTGGAGAGCCAGCAGGCAGGAGATCCACTTCGCTTCTACGATCCCGCGTACCTGGAGATCTTTTTGCTGGAC 
TGGCTGACCAGAAGGAGCATAAAGTTAGGGGACATAAATGCCGCAGGCAGCGATGTTCAGCTGCTGGTCGGCCATTTGAAGTCCAACGGTCTGCA 
GGCGCAAGCAAATTTGCTCAAAGACACCATCATTAGTAACGCTCCGGAGCCAGATGCGGCTGGCACTGCTGCGATGGCAGTCAAGCAAGAGATTG 
AATCGGATAACCAGCAGTCGTACTGTTCCACACAAATAGATGCTCTTAAGTTGACCCGGGAGAACGCAGGAATCGCCTTGATTATCAACCAGCAG 
AAGTTTCACCGGAATGTTAGCAGGGATAATATGAAATTTCTGTCGCCCGACCCACTGCGTAGACGGGATGGCACGGATGTGGATAAAGAACGACT 
AATCGAGGTGTTTTCCTCGATGGGATACAATGTGGAGGCGTACGACAACGTGGATCACATGGGCATCATAGAGCGCATTCGCAGTGCGTGCGATA 
GAT C GC TCGTGCGC GAT T CC C T AGT G GT C T T CAT C C T G AGC C AC GGCT T CG AGG AGGC C GTC T AC GC AT C C AAC AGC AT C GCC AT GAAGAT C AC G 
GATATCGAGGACTTGCTCTGCAGCTACGACACCCTGTACTACAAACCGAAGTTGCTAATCATACAAGCCTGCCAGGAGAAGCTTGTCCATAAAAA 
GAAGCCAAATGAGCTCTTTAGGATAGACGTGACCACCGTGTCGCCGGACCAGCACATCGACATGCTGCGGGCCATGTCCACGGTAAATGGATATG 
CTGCTCTGCGCCACACCCAAACAGGTAGCTGGTTTATCGGGAGCCTGTGCGATGCCATCGATCGCCGTTCCGCCAGTGAACACATCGCCGATATC 
CTGACGATTGTCACCAACGAAGTGTCTAAAAAGCGAGGCAGTAATGACGAGTCCATGGTGCCGAATGTTAAAAGCACATTTCGCCAGCACGTGTA 
CTTTCCACCTCGTCTGTGATTTTGGTTTTTAACGTTAGCGCAGTTGTACGTCCACTCAAATTGCTCTATAAAGGTAAAAATACATTTTTATAAAG 
TTTAGTGTACTTTTGCTATTTTTTCAATGACTTCAATACTAATTTAGTACTTTTCGTCAGAATTTTTTAAAGAACAAAAAAATTAATAGTATCCT 

AT AT TAT GC AAG AT A 
(SEQ ID NO: 851) 

Start ATG: 245 (Reverse strand: CAT) 

MAGSNLLIHLDTIDQNDLIYVERDl^FAQKVGLCFLLYGDDHSDATYILQKLLAMTRSDFPQSDLLIKFAKSRPETWRRHLVEALCIIGARKVLR 
RLGFCWQELRMHYLPHIAGITLHVHPLLKSLYRMCEELSLVQSGRLLLDVREKVESQQAGDPLRFYDPAYLEIFLLDWLTRRSIKLGDINAAGSD 
VQLLVGHLKSNGLQAQANLLKDTIISNAPEPDAAGTAAMAVKQEIESDNQQSYCSTQIDALKLTRENAGIALIINQQKFHRNVSRDNMKFLSPDP 
LRRRDGTDVDKERLIEVFSSMGYNVEAYDNVDHMGIIERIRSACDRSLVRDSLVVFILSHGFEEAVYASNSIAMKITDIEDLLCSYDTLYYKPKL 
LIIQACQEKLVHKKKPNELFRIDVTTVSPDQHIDMLRAMSTVNGYAALRHTQTGSWFIGSLCDAIDRRSASEHIADILTIVTNEVSKKRGSNDES 

MVPNVKSTFRQHVYFPPRL* 
(SEQ ID NO: 852) 

Name: DreD protein 
Classification: endopeptidase 
Gene Symbol: Dredd 
FlyBase ID: FBgn0020381 



Celera Sequence No. : 142000013384651 

GTGCTCGACATTTCGCTTGCGTCCCGTAAAGAAATTGTCCACCACGATGACCTCGTGCCCCTGGACCATCAGATCGTCGACCAAGTGGGAGCCCA 
CGAAGCCAGCTCCTCCGGTAATTAGTATGCGTTTGCGATTCTTATAGTTCAGGTATTTAACCTTCGGATACTTGCGCGGTGTGCTGGTCTGGAGG 
CTACGAACTTGTTCCTCCAATCTGGCCAAATTCTCCCTTGTGCGCTGGAGTTCGGCCTTTTGTTCTCGAATAAGAGAGCTCTGCTCCTCGTACGC 
CATCTGAAGGCTCCTTTGTAGCGGACTTTCCGTGGGCGGCCATTTGGCCTGAACCTCCTCGACGCCGGGAACGGAAACGGCGACCTTTCCGCTTG 
GGCAGAAGCTGGCCATTCGGTAAAGGTAGACCAGGAGCAGGAGGAGCAGCGAGATGGCTGCAACGATCTTCAGACGCTTTTTGGTGGCAGTCATC 
GTCGGTTATGGGTTGTGTACTGCAGGGTTCCAGTGCTTATCAGCAACTGATATCGCCACATTGCATTCTCGTTCGGCTTTAGAAACTATATAAAC 
AATACAATTGCGTTATCAGGAGGTGCCAGAGTGGCCAGATGGCTCAAACGGAATTTCTAAATGGATTAATAATCTAACGCCAGGGAAAAGCTCCC 
GCCTAATTTTAAAATTTACATCAAAAGTATCGACAGGCGAACATTTTGATTTTGGATTATAAATTATAATGAATTGATAATGAATTAGACATTTT 
TGTTGGTGATCTTCCCCAGGGACGAATATTTTTCGTTTTAACGATAAAAGGTCGGAAATTGCCAAAAAATGAGTTTGAAAACTAAATTTGTTTGA 
GGGCTTTCTTAGGCTGGTCAGCAGGGACTACTTTCCGAATAAAGTGTTTGCCCAAACGGCTATTGAAGGCTATTTAAACTGCCAACGATGAGCAT 
CGATCGCCAGTCGGGCAAAGTTCTTAGTGCCAAGTGTTGTAGTGCTCAGCATGAGGCTTCCTTGGGTGATTTTCTGTACAGTTCTGCTTTTAATT 
TTTACGAACAACTCAAATGCAGATATTCCCGGGTGCAACTACTACGACACGGTTGATATCTCATACATTGAAAGGCAAAACGATTCGTATTTATA 
TGATGACATCGAAATTCCTGCTAGCTTAACTGGATACTACGAGTTCAGGCAGTTTGGGGACGGTTCGATTACGCCGATTGAAAAGCATTTAAGGG 
CCTGTGTTTGCAGCGTGCGGCCCTGTATTCGAATCTGCTGCCCAGCCAAGAACTTTTTGGCCAACGGAAAATGCGATGATGGTCTCAAAGAGGAG 
CTCGCCCGGTTCAAACCCTATATATACTTCACATACATGGACCTACAGGCACGAGTACCACTCACCGATATGGCTATTATCAGAGATGAGTTCTT 
TGATTGTGATGAAATGATTTACATCAGCGACTTTAACTATTTCTTAGAAGAGGTTAGCATTCAAATCTTCAATAAATGTGGACTTATAGTTTGGT 
TCCAGGATGGAAAATTTTGGGTTACCGTTGACCTCTTCATGGAAAAACAGGACTACTGCTTGTACCGGCACAACTTTGATTCGGATTTTCCAAAG 
TCCATGTGGATAATACGACACCGCTGCACAAGCCACATATCTCCTGGATCCTTAGAGAGTAATTCAATTATTATTTATAACAAAAGTCATCAATA 
TCGTTTTCATAGCTATATAGAAAGAAACGAAAATAACTCCTTATTGGTGTGCAAACTGCTGACTGCATTCTGTGTGCCCTTTGCAAGTTTTTTAA 
TTACAATAAGCTGGTTTTTTATTTCTTATATTTTCTTTTTCAGTTCTAATTATAACAATGATATGCTTTGTCCTAACAATCGCAGTATATCTATA 
CATTAAGAAGCTGCGAAATGTTACTGGCAAGTGCATTGTATGCTGTATAGTTTCGAGGTTTATCCAGTGCTTGATCATGATACTAGATCATTTAA 
ATCTATTGAATGGCATTTGCTCTCCAGCTGGTTAGTTCGAATTGGAGATATCTCTGGAAGCTTTAAATGTTATATCGTTGACTTCTTGCAGGTTA 
CAGCTCGCACTTTTTCCGGATGGCTTCCAACCTCTGGCTCTCCGTCATCAGCTACCATACGTGGAAAGTCTTGACGTCGCTCAATCGAGTCGACC 
CTAACTATCGGTTCCTGCGGTACAACGCCTTCGTCTGGAGCACAGCCGCAATCATGACGGGAAGTATTTATATAGTTAATCAGATTTGGGAAAAC 
GATCCCAGTAAATGGAACTGGTTGCCTCTGGTCGGTTTTATTCGGTGCTCGGTCAAAGGTAAACTTTCAATGCGATTGCTTTTAAATCGTTACCA 
TGGTTACGTATTGCTTTTCCATATCAGATTGGCACCCATCCGTCTGGATCTATATAAGTGGACCGTCGCTGGCCCTGAGCACTTTCAATGTCGCC 
ATGTTCGCCCTGACAGCCATTTACATTAGGAAAGTGAAGGGGGGTATAAATAAGTTCACAAATGAGGAGGAGGGAAGGATTAACTGCATAAACTT 
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TGACAGCCAGACGTAAGATATGATAAAACCTTGGACCTTGGTAAACTAGTTATCCTTGCCGTATTATCCACATAGTTACCTACAGTTCCTGCGGC 
TCTCCATCGTGATGGGCCTTACTTGGATATTCAATGTCATTCCGTATTCTGCACGGCTCCACATTTTCTGGGAATGGGTCGGGATAATATCCGAG 
TATTTTCACAGCGCGTTTGGAATTGTTTTGTTCGTTCTGCTCGTCCTGAAGCGCAGCACATGGACTCTAATGATGGATTCTTAAGCTTGTTTCTA 
TATAATTCTAATAATTGTTTACAATTAAAGCTTTAAACAATTAAAGCCCCTCATTTTATTAATATCCTTTGCTGAACGAAGGAATTGAATTGAAT 
TTTATTGAACTCAATAAATAAATATAGTTTTAGAAAACGATGTACTTTAAACTGCCAAGAGGTCACCGCGACAAGCTTTTCACTTCGCCACCCTC 
TGGTAACACTGATTCTCCAGCCACTTTCGATTCTTTAAAAAACATCGATAGGTATCGCCAAAAAAATCAGCTGCTCTGCTTGTAGTTTTGCGATT 
TCATTGCAATTTTAATTTGCTTTACTGTTCAAAAAGAAATACAGTGCAAAATATGCCGAACGAAAACGCAGAGGACCTCTCGGGTCAAGAGCTAA 
AGCAGAAGGCCAAGGAGGTGGCCGACGCATCGGAGGCAATGCTCGAAAAGGTTGTTGCCGGCTTAAATATCCAGGACACGGCATCGACGAATGCA 
GCAGGAAACGAGGATGCGGAGCAGCCTGATGGTGCCAAGAATGAGGCTTCAGTGTCTGCGAATGCAAGTAAGTTTTCAATATGCCTGTGAAAAGT 
CATTTAGAGGCTCCTCCTGCGGCTGAAAAAATGAGGTTAGGTTTTGTCGGGAACTACAGTCCAACCTCTCTAACTTGAACTGCTGTGCTATTCAT 
TTCAATTCAATTTTCAATTTTCATTTTCTGTGCAATTACTCCAAAGATTAGCCCTAACGGAAAAATAAACAACTCATAAATCATAAATACATATT 
TGATTTTGATGAAATATGAATATACTTACATAGATACAATTGAGTTTTTTATTACGAATAAAATATGTGTAATATAAGATTTCTACCTTGTTGAT 
TCCATTGATTGATGCTTCATTGAAACCCCAAAATTGGGCAAATTTGTTGTCATGGGAAACTAGTCCACTGTCGTGATGCCTTAAGCTTTTTTTGG 
GGCAGATTAGTCATTCCAGTTTAAACTAATCGAAAA 
(SEQ ID NO: 853) 

Exon: 1001.. 1673 

Exon: 1849.. 2025 

Exon: 2087.. 2338 

Exon: 2403.. 2836 

Start ATG: 1001 

Transcript No. : CT22963 

ATGAGGCTTCCTTGGGTGATTTTCTGTACAGTTCTGCTTTTAATTTTTACGAACAACTCAAATGCAGATATTCCCGGGTGCAACTACTACGACAC 
GGTTGATATCTCATACATTGAAAGGCAAAACGATTCGTATTTATATGATGACATCGAAATTCCTGCTAGCTTAACTGGATACTACGAGTTCAGGC 
AGTTTGGGGACGGTTCGATTACGCCGATTGAAAAGCATTTAAGGGCCTGTGTTTGCAGCGTGCGGCCCTGTATTCGAATCTGCTGCCCAGCCAAG 
7AACTTTTTGGCCAACGGAAAATGCGATGATGGTCTCAAAGAGGAGCTCGCCCGGTTCAAACCCTATATATACTTCACATACATGGACCTACAGGC 
ACGAGTACCACTCACCGATATGGCTATTATCAGAGATGAGTTCTTTGATTGTGATGAAATGATTTACATCAGCGACTTTAACTATTTCTTAGAAG 
AGGTTAGCATTCAAATCTTCAATAAATGTGGACTTATAGTTTGGTTCCAGGATGGAAAATTTTGGGTTACCGTTGACCTCTTCATGGAAAAACAG 
GACTACTGCTTGTACCGGCACAACTTTGATTCGGATTTTCCAAAGTCCATGTGGATAATACGACACCGCTGCACAAGCCACATATCTCCTGGATC 
CTTAGAGATTCTAATTATAACAATGATATGCTTTGTCCTAACAATCGCAGTATATCTATACATTAAGAAGCTGCGAAATGTTACTGGCAAGTGCA 
TTGTATGCTGTATAGTTTCGAGGTTTATCCAGTGCTTGATCATGATACTAGATCATTTAAATCTATTGAATGGCATTTGCTCTCCAGCTGGTTAC 
AGCTCGCACTTTTTCCGGATGGCTTCCAACCTCTGGCTCTCCGTCATCAGCTACCATACGTGGAAAGTCTTGACGTCGCTCAATCGAGTCGACCC 
TAACTATCGGTTCCTGCGGTACAACGCCTTCGTCTGGAGCACAGCCGCAATCATGACGGGAAGTATTTATATAGTTAATCAGATTTGGGAAAACG 
ATCCCAGT7AAATGGAACTGGTTGCCTCTGGTCGGTTTTATTCGGTGCTCGGTCAAAGATTGGCACCCATCCGTCTGGATCTATATAAGTGGACCG 
TCGCTGGCCCTGAGCACTTTCAATGTCGCCATGTTCGCCCTGACAGCCATTTACATTAGGAAAGTGAAGGGGGGTATAAATAAGTTCACAAATGA 
GGAGGAGGGAAGGATTAACTGCATAAACTTTGACAGCCAGACGTAAGATATGATAAAACCTTGGACCTTGGTAAACTAGTTATCCTTGCCGTATT 
ATCCACATAGTTACCTACAGTTCCTGCGGCTCTCCATCGTGATGGGCCTTACTTGGATATTCAATGTCATTCCGTATTCTGCACGGCTCCACATT 
TTCTGGGAATGGGTCGGGATAATATCCGAGTATTTTCACAGCGCGTTTGG7AATTGTTTTGTTCGTTCTGCTCGTCCTGAAGCGCAGCACATGGAC 
TCTAATGATGGATTCT 
(SEQ ID NO: 854) 

Start ATG: 1 

MRLPWVIFCTVLLLIFTKNSNADIPGCNYYDTVDISYIERQNDSYLYDDIEIPASLTGYYEFRQFGDGSITPIEKHLRACVCSVRPCIRICCPAK 
NFLANGKCDDGLKEELARFKPYIYFTYMDLQARVPLTDMAIIRDEFFDCDEMIYISDFNYFLEEVSIQIFNKCGLIVWFQDGKFWVTVDLFMEKQ 
DYCLYRHNFDSDFPKSMWIIRHRCTSHISPGSLEILIITMICFVLTIAVYLYIKKLRNVTGKCIVCCIVSRFIQCLIMILDHLNLLNGICSPAGY 
SSHFFRmSNLWLSVISYHTWKVLTSLNRVDPNYRFLRYNAFVWSTAAIMTGSIYIVNQIWENDPSKWNWLPLVGFIRCSVKDWHPSVWIYISGP 
SLALSTFNVAMFALTAIYIRKVKGGINKFTNEEEGRINCINFDSQT* 
(SEQ ID NO: 855) 

Name: ruth-like 7 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384804 

GAACTCTCCCTGCGAATTTCTATGGAATTTCACATGCAGCGGCAACTGCAGCAAACAGAAAACAACAAACGTGAGAGGCAAATAAATAAAAACAC 
ATTATGCAGAAAGCAGCACACGTCCCTATCTACAGATGTACAAATGTACATACATACATATGTACATATGTATGTACACACCTACAGCCGTATGC 
GTTATAAACAAATTGCAATCAAAAGCCAAGTTTGGGAAACCAAAAGATACACTTTCGGTCGACTCAAAAATGCCCTAGAGAAACCGAAGTTGCGA 
CACAAAAAAAAAAAATAGATAGATCCCTTGACAAAATGTATCTTTATCTTTGTATCACAGGTTACGCTTCTTCTCGAATTATTTATATATGTATA 
TACATATATACAACACTTGGATATGAATTCATAAAGTCAAATCAAATCATGCTAATTTGTTAATGCGACAAAAGTTCAGTCGCAATATGCGATTG 
ATTCGTGACCAGAGAATATATGAATCGATTCAGTAACATAAGCATATTAGTATCATTCCCAAATCGACTCATAGTCATGACCTCCCAGTGTGACT 
CCCAGGGTACTTCAGCACAACTTTTGGCTCCCCAAACGGAAGGTTGAGTAACTTTTTTAGCCCTCGGAGAAGGGGTTCACCGACTACAGATCCAG 
CCCAAAACAATCGCATGAGCCGTGGATACGGAGATACCTAGATAACTAAGCTACAGATACATTTGCTGACTTTAATTGATGTGACAAGTAACAAG 
TGGCGAAACAATGGCCAACCCGAAGACGAAGAGACTGAGAGAACGTATTGTACATGGCAAATTGCAAGTGACAGTGGGGTACCCACTCGAAAAAC 
CAACTAAAGTCATTTGCTTGCTTAAAATAATATGAACAAAGAAATCAATCTTTCTCATTCTCTTAATTAAGCTTCATGAAATATAAGTCACGACT 
TTAAGCTTGATTCAACGTTGATACGAAATGACTTTCCCACTATTTATAGATTATTTAAAGATCGTTGGCCGGAAATTCC7AAAATCGGAATTTGGA 
TGTATGAATAATGCCTAGACTAATATATATGTTTTTCAATTCATCTTTCAAGACATTTCCCCACCCACTCACCATGTCCTCCTCGTCCTCGCAGC 
AGTTTACATCCTGCAGCGTTTCCCGCCACGCCAGCACGGAGTCCGTGAGCAGCAGCCTCGCATTGGAGCGAACGCGCAGTTTGCGTCCCGTCCTC 
GGAGCCACGCCCCTCTTACCGGCCACCGCCCCCGCGGCGAGCACGGCCCGCTCCCACTCCTCCTTCAAGGGCTTCGGAATGGCCGGCGTGGGGTT 
GTGCTGGTGGTTATACGAATCCTCCGCCTCGCAACTGCAGCCACTATCATCGCTGCGTGGTGCCAGACTCGTGGTGTTTGAGGTGGCCAGTTGGG 
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CGGGTGGCGGTGGTGCTGGTCGGCGGATCCGGGTCATGAACCGCTGGTGGACCAGCGAAAGATGGTAGTCGTAGAATATAGCGCTGTACTTCTTC 

GGATCTTTTTGGCTGTAGAAAATGCAAATGTTCCTTAGGATCGGATGCACCACGACTGATGGCTGCTGATCCGCTGGCAGATCCTTTGGCGTATC 

CTTTGGCTGCCCGGTGACGCTGCTGCCAGTGCTGCTGATGGAGCCCAGGGTCGCTGTGCTGTTCGTCACCGTACTGCCCAGGCTTAGCGTTTCAT 

CCGCCTCCTCCTTACTCTCCCGCGAGTACATGAACTCCATATCCGTCTCCCACGCTCCATGGTCCATGGAGGCGGTGGGCGAGGGCAGGGGCGTC 

GGCATCATCATCTTGGAACGCTTGGTCAGCGTCGATGTGGACAACTTATGGCGCACTCCGGCGGCAAAGACCGCCGCCGCTTGGAGAGCCATCAC 

ATCCGGATCAGCATCGGATGGCAGCGACTCGTCGTAGTCCCCCAGATCCTCCTCCTCATCATCCACGTCGTCGTCTTCCTCGTCTTCGTCCTCCT 

CATCTTCGCCCTCGACGGGAAGGGCTTCGATTATCTCCTGATCTACGGGTTCGATTGTCTTGAACTGCCAAATGGGTTGGTAAATGGACTCCTCC 

TCCTGCTCCTCTGGGCTGGAATTCTCCGCGGCAAAGGAGTTGCCTCTCAGGTTATCCTTTTTCCCTCCGTTCGGCTTCGTCTGCTCCTCTTCAAT 

CTGGCGGGGATTGCTCAAAGATGTGTTGGGATTTCGAACGGGTACCTTTGGCGCCGGTCGTCGAACTTTAATAACGCAATTCTAAAAGTTGAGGT 

GGAAACATACTATATTGTCAAAACAAAAAATAATATCCTGTTTCGACTGTTGTGTTTTACTCGCTGCAATAATTGACGAACGGTGGGATATCGTT 

TTGATTTCATTTCGTTTCAATTATAATAAAAACATTTAAGGAAACCCCATCTTTGAATAAAGTTACGAATTTAGGGTTACTTCTCACAAACTGAT 

AAACATAGTTAATTAGTTAGGTAATTACGTAAAATAACAAAAGTGGAGGCGGATTTCCCGATCAGCTTTTAATGGAAATTATCATTAATATGCCG 

CCTGCGAAAACAATGAATTACTCCATTAATAACTTCTTTGCTAGGAATCACAATATTGAACTTTTTTGTGCAGTTGTGATGGCAACTTTACTGCT 

TTTTACGCAAATTTTGCAATCAGTAAAAAAAAAATATTCAAAAAAATAGTGTGAAAAATGACGTTAATTAAACGCTGATTATTTTATAGTTATCT 

TGTGCTTTCATTGCAAACTTGTTTGTTTTACTTCACATTTTTTTCATTGTTCAATGGTTGTTTTTAATTAGATAAGTTTGTTTATTTATTTATTT 

TTTTTTTTCGTTTCGCTGGCCAATCGAATGTTGGCATCCCTTTCGCACGTGAACCGTTAGTCAATTTTGAACTGCAGTGAAACGTAACAGAATCA 

AGTACACAGAGAGAAAAGTATAATTGCATCACAATATTAACGTATTGCCCTCAAACTGTTTTTCATATATAAGAACAGAAAAAAAAGTAGGGATC 

AATTTTAATCGTTCCTGATTGATGTTGAACTGATTTGACGGATCGCAATTGGGAATTATAAATTTTATATTTATTTACATGCACTATTTGTATTA 

CTTTGATTTACGGAAATCAAACCAATTATAGTTAATAAAAAAATTGGTAAAATTCAACCGTGTCAATTGAATTTTTAAATAATATATTTTTTTTT 

TGTGGCGCTCGTTGATACCGCCCAAACTTTTTCTCTCTGTGCATTTTAATTTTGCCTTTGATTGATGCGGCTTTTGTTTTTGCTTTTTATTAATG 

AGTGTTGACCTGTGTGATTGGCTGCTGTTGTTGTTGTTGCTGTTGTTCATTGACATTGCAATTGGCGTTGCCGTCGTTATTCTTGTTGTTGTTGA 

TACCGCTTTCATTGTCAACGGTGTGATGGTTGTTGTTGTTATTGCTGATGTTGTCATTGTTGTTGTTGCTGCTGCAGTTGATGGTGTTGTTGTTG 

CTGTTTGTTGAGGGGCGCTCTGCCTCGCCATTTGGCTTAGGCGCTGCCATCAAAGCTCCGCTGTTGTTGTTACTGTTATTATTATTGTTTTTCTT 

GCTGTTGTTACTGCTAGTGGAGCTCTCGCCCCCGCACACACCCTCTTCCCTGACTATCGCTGTCCCTCTTCGCCTCGTACTTCGATTCGCGCGGC 

GTCGCGACGCAGCAGCAGCACACACGTCTACGCTCTCTGAAATCGCTGGCGAAACTGAAACCGAAGCCGAGGCCACCGAACGATCGTTGCCGCCG 

GCGAAGGAGAAGGCGAAGAGAGGCAGGCGCCGTCGCCAATCCCCCACAACGCCAACAACTGAGTCACGCGCTCTCCTTCGCCGATAACCGTTGAT 

CGGTGGTTGTTGTCGCTGTTGTTGTTGCTGGTGCCACAGCCACAAGCTGAACGATTTGTCATAGGCGGTGAGAAAGCCACGCACACTCACGTGCA 

AGCTGGCCGAAAGAGCGAGACTCTGCATCAAGCGCAAGAGAGATACAGAGACTTGAGGCTGAATTGCGATTGCGACGACTGCGACGTGCTCTTCG 

ATTGCGGCCACCTGTTGATGTTGCACTGCCTCTTTCTCTTCGCCCAGTGCGCTCAGATAGTCATTCACAAAGTTGGCCGTGATGGCGCAGCGCCG 

TTGCTCTCTTTCCGGCTCCACGTTGTGCGCTCTCTTCTGCTGCAGCTTTCGTTTAAAGTCCTTGACCAACTTCGCCTGCTCGTCCGCACACTTTG 

CACTCTCCTCCGTGCTTCTCCGCAGTGTGACCGTGCTCTCCTCGGCGGAGGATGAAGGATCGCGCGCTGGTACGCTCTTTGGCTGCTGCCACGGC 

CCCTCGTCGTAGTGGTGCACCTGCCGCCTCAGGGAGGTCATCCACTGGCAGATGCTCTCGTTGAGATACAATGATGAGGATCCTGATGTGCTTTC 

GTCCTGCAGCAGGCGAATGCGAGAGACGCGCGGCCTCTTGCGCTCTCTCTTGTGGCTCTGGCTCTCTTCGGGCTCTCTTACGCGGCCGTAGATGT 

GCTGATCGCTTTGGCTGCCCTTCAGTTGCTGCTGCTCTAGCTTGAGTTCGCAAATGCGCTGGAGGTCAAAGGTCTCCTGGGTGCGGAACACCTCG 

TCCACGTTGTGGATGATCTCCAGGGAGCCAGCGCTGCTACTGCCAGTCGTGGGTGTCCTCGGGAATATCCTGCGCTGCAGCGACGACGAACGGTT 

CAGCTGCTTAAGAGAGCCAAAGAGACGGGTCAAGGTCGCCGCACTGGACTTGGAGTAGCCCAAGAGCGACCGCGACTTGACCTTGGTGGGTGGTG 

TGGGCGGTGTGGTTGGTGTGGACGGTGCGGGCGCTTTGGCTGGCGTCTCTGACCTTTGACACCGGCACCGCTCGTTGTCGCTATAAATGCCGCGG 

CACAAATCGCAGATGTTCTCATAGATGTTGCTGCTCTCGCCGGAAATCGTCGAAAGGGGCGGTGGCGGTGGGGGAGGCGGTGTACTCACCACGCC 

CCCGCTCACGATGCACTGGCAGAAATGGCAGTAGCGTCCTTGGGCGGCAAACACGCCGTAGCCGCAACCCCGACAAAGATTCTCGTAGATCGCCT 

CGCGCGGAAAATACTTGCGCAGCTCCGCATTCAGGTGACCTTTGGGGGTGGAGGTCAGGCCCGGCAGATTCTCGTAGATGTGCGCCTCCATGCCG 

ACGTTATCGTTAATGTCCTTCGCCTGCTGTGGCATCTCTCTTCTTTATTTCTTCTTCTGCCTCCCTACTTTCTCTTATCGCCGCTGCGTCACATC 

GTAATTCATTATTAAGTGAGAAAATAATCGCCGAGCGGGGCAATTGGAGAGATAGTAAGTGAGAGCACTGTCATCGAGAGAATTTCGTGCACCGC 

AAAAGAAATGTCGCGAAGGTATTTAATTAACATAAATTCGATTTGGAAATGCTCGAATGGCTCTTCTTAAATTTCTCACTCGAGTGAATTGAATA 

ACTTGTAATCTTCGATTACTCACTATTATTTTTTGAGAAACACCCCAAAAACGACCTTAGTTCTCGCAGTGTAATCACTGAGAAAAGTCAATGGG 

CTTGAACTCTTTAAGTGATGCCACTACCCTCATTTTCTTTCTTTCTGTCTCTTTCCCTCCCGCTTCCGACTGCAATCAGCGATATTCATTATCTC 

AGCCAAAGTCGCTGCACTCTGCACTCAGTTTTCCTCCAAACCATCGGCTTTGCCTTCAAAACACAATTCTTTGGCTTGACTTTTTTTAAATCTGG 

GAATTTAAAGGATGAATTTTTCGAATCTAATTGCAAACAGCTTTTAATAATATGACCCTGTTATCGTGAATTTCTTTACCATTTCTTCTAATATT 

TACATATATTAGCTATTTTGATAAAATATTATTGGAACCCATTGCTGTATTCAATATATGTGTAACTAGACCCAATATCTCTGTATATTTCACGC 

CCAAAAAGATCCAGAGCACTTATCACTAATCAGTTTAGAGCTCTAAAATGAATTGTTTGTTTGCCCTAAAACATTTGATGCTTAAATGCAAATAA 

TGTTTAAAGTAAAAAAAACCCCAATGCATTTCATCTGACTAGAATTCCACAGTGCTGCTTCTGCTTTTGAGCTTTGTTTTTCGTTGATATACGGA 

GATGAGTCACTAACCGATCCACCAAACCATGTCGGTCCTCTCGTAACCCCCGTCCTTATCCCTCGGACTCAAAATGATGGCCATG 

(SEQ ID NO: 856) 

Exon: 5260. .3335 
Exon: 2266. .1001 

Start ATG: 5260 (Reverse strand: CAT) 
Transcript No. : CT23013 

ATGCCACAGCAGGCGAAGGACATTAACGATAACGTCGGCATGGAGGCGCACATCTACGAGAATCTGCCGGGCCTGACCTCCACCCCCAAAGGTCA 
CCTGAATGCGGAGCTGCGCAAGTATTTTCCGCGCGAGGCGATCTACGAGAATCTTTGTCGGGGTTGCGGCTACGGCGTGTTTGCCGCCCAAGGAC 
GCTACTGCCATTTCTGCCAGTGCATCGTGAGCGGGGGCGTGGTGAGTACACCGCCTCCCCCACCGCCACCGCCCCTTTCGACGATTTCCGGCGAG 
AGCAGCAACATCTATGAGAACATCTGCGATTTGTGCCGCGGCATTTATAGCGACAACGAGCGGTGCCGGTGTCAAAGGTCAGAGACGCCAGCCAA 
AGCGCCCGCACCGTCCACACCAACCACACCGCCCACACCACCCACCAAGGTCAAGTCGCGGTCGCTCTTGGGCTACTCCAAGTCCAGTGCGGCGA 
CCTTGACCCGTCTCTTTGGCTCTCTTAAGCAGCTGAACCGTTCGTCGTCGCTGCAGCGCAGGATATTCCCGAGGACACCCACGACTGGCAGTAGC 
AGCGCTGGCTCCCTGGAGATCATCCACAACGTGGACGAGGTGTTCCGCACCCAGGAGACCTTTGACCTCCAGCGCATTTGCGAACTCAAGCTAGA 
GCAGCAGCAACTGAAGGGCAGCCAAAGCGATCAGCACATCTACGGCCGCGTAAGAGAGCCCGAAGAGAGCCAGAGCCACAAGAGAGAGCGCAAGA 
GGCCGCGCGTCTCTCGCATTCGCCTGCTGCAGGACGAAAGCACATCAGGATCCTCATCATTGTATCTCAACGAGAGCATCTGCCAGTGGATGACC 
TCCCTGAGGCGGCAGGTGCACCACTACGACGAGGGGCCGTGGCAGCAGCCAAAGAGCGTACCAGCGCGCGATCCTTCATCCTCCGCCGAGGAGAG 
CACGGTCACACTGCGGAGAAGCACGGAGGAGAGTGCAAAGTGTGCGGACGAGCAGGCGAAGTTGGTCAAGGACTTTAAACGAAAGCTGCAGCAGA 
AGAGAGCGCACAACGTGGAGCCGGAAAGAGAGCAACGGCGCTGCGCCATCACGGCCAACTTTGTGAATGACTATCTGAGCGCACTGGGCGAAGAG 
AAAGAGGCAGTGCAACATCAACAGGTGGCCGCAATCGAAGAGCACGTCGCAGTCGTCGCAATCGCAATTCAGCCTCAAGTCTCTGTATCTCTCTT 
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GCGCTTGATGCAGAGTCTCGCTCTTTCGGCCAGCTTGCACGTGAGTGTGCGTGGCTTTCTCACCGCCTATGACAAATCGTTCAGCTTGTGGCTGT 
GGCACCAGCAACAACAACAGCGACAACAACCACCGATCAACGGTTATCGGCGAAGGAGAGCGCGTGACTCAGTTGTTGGCGTTGTGGGGGATTGG 
CGACGGCGCCTGCCTCTCTTCGCCTTCTCCTTCGCCGGCGGCAACGATCGTTCGGTGGCCTCGGCTTCGGTTTCAGTTTCGCCAGCGATTTCAGA 
GAGCGTAGACGTGTGTGCTGCTGCTGCGTCGCGACGCCGCGCGAATCGAAGTACGAGGCGAAGAGGGACAGCGATAGTCAGGGAAGAGGGTGTGT 
GCGGGGGCGAGAGCTCCACTAGCAGTAACAACAGCAAGAAAAACAATAATAATAACAGTAACAACAACAGCGGAGCTTTGATGGCAGCGCCTAAG 
CCAAATGGCGAGGCAGAGCGCCCCTCAACAAACAGCAACAACAACACCATCAACTGCAGCAGCAACAACAACAATGACAACATCAGCAATAACAA 
CAACAACCATCACACCGTTGACAATGAAAGCGGTATCAACAACAACAAGAATAACGACGGCAACGCCAATTGCAATGTCAATGAACAACAGCAAC 
AACAACAACAGCAGCCAATCACACAGAATTGCGTTATTAAAGTTCGACGACCGGCGCCAAAGGTACCCGTTCGAAATCCCAACACATCTTTGAGC 
AATCCCCGCCAGATTGAAGAGGAGCAGACGAAGCCGAACGGAGGGAAAAAGGATAACCTGAGAGGCAACTCCTTTGCCGCGGAGAATTCCAGCCC 
AGAGGAGCAGGAGGAGGAGTCCATTTACCAACCCATTTGGCAGTTCAAGACAATCGAACCCGTAGATCAGGAGATAATCGAAGCCCTTCCCGTCG 
AGGGCGAAGATGAGGAGGACGAAGACGAGGAAGACGACGACGTGGATGATGAGGAGGAGGATCTGGGGGACTACGACGAGTCGCTGCCATCCGAT 
GCTGATCCGGATGTGATGGCTCTCCAAGCGGCGGCGGTCTTTGCCGCCGGAGTGCGCCATAAGTTGTCCACATCGACGCTGACCAAGCGTTCCAA 
GATGATGATGCCGACGCCCCTGCCCTCGCCCACCGCCTCCATGGACCATGGAGCGTGGGAGACGGATATGGAGTTCATGTACTCGCGGGAGAGTA 
AGGAGGAGGCGGATGAAACGCTAAGCCTGGGCAGTACGGTGACGAACAGCACAGCGACCCTGGGCTCCATCAGCAGCACTGGCAGCAGCGTCACC 
GGGCAGCCAAAGGATACGCCAAAGGATCTGCCAGCGGATCAGCAGCCATCAGTCGTGGTGCATCCGATCCTAAGGAACATTTGCATTTTCTACAG 
CCAAAAAGATCCGAAGAAGTACAGCGCTATATTCTACGACTACCATCTTTCGCTGGTCCACCAGCGGTTCATGACCCGGATCCGCCGACCAGCAC 
CACCGCCACCCGCCCAACTGGCCACCTCAAACACCACGAGTCTGGCACCACGCAGCGATGATAGTGGCTGCAGTTGCGAGGCGGAGGATTCGTAT 
AACCACCAGCACAACCCCACGCCGGCCATTCCGAAGCCCTTGAAGGAGGAGTGGGAGCGGGCCGTGCTCGCCGCGGGGGCGGTGGCCGGTAAGAG 
GGGCGTGGCTCCGAGGACGGGACGCAAACTGCGCGTTCGCTCCAATGCGAGGCTGCTGCTCACGGACTCCGTGCTGGCGTGGCGGGAAACGCTGC 
AGGATGTAAACTGCTGCGAGGACGAGGAGGACATGGTGAGTGGGTGGGGAAATGTCTTGAAAGATGAATTGAAAAACATATATATTAGTCTAGGC 
AT T ATT CAT AC AT C C AAAT T C C G ATT T T GG AAT T T C C GGC C AAC GAT C T T T AAAT AA 
(SEQ ID NO: 857) 

Start ATG: 1 (Reverse strand: CAT) 

MPQQAKDIWDNVGMEAHIYENLPGLTSTPKGHLNAELRKYFPREAIYENLCRGCGYGVFAAQGRYCHFCQCIVSGGVVSTPPPPPPPPLSTISGE 
SSNIYENICDLCRGIYSDNERCRCQRSETPAKAPAPSTPTTPPTPPTKVKSRSLLGYSKSSAATLTRLFGSLKQLNRSSSLQRRIFPRTPTTGSS 
SAGSLEIIHNVDEVFRTQETFDLQRICELKLEQQQLKGSQSDQHIYGRVREPEESQSHKRERKRPRVSRIRLLQDESTSGSSSLYLNESICQWMT 
SLRRQVHHYDEGPWQQPKSVPARDPSSSAEESTVTLRRSTEESAKCADEQAKLVKDFKRKLQQKRAHNVEPEREQRRCAITANFVNDYLSALGEE 
KEAVQHQQVAAIEEHVAVVAIAIQPQVSVSLLRLMQSLALSASLHVSVRGFLTAYDKSFSLWLWHQQQQQRQQPPINGYRRRRARDSWGVVGDW 
RRRLPLFAFSFAGGNDRSVASASVSVSPAISESVDVCAAAASRRRAKRSTRRRGTAIVREEGVCGGESSTSSNNSKKNNNNNSNNNSGALMAAPK 
PNGEAERPSTNSNKNTINCSSNNNNDNISNNNNNHHTVDNESGINNNKNNDGNANCNVNEQQQQQQQQPITQNCVIKVRRPAPKVPVRNPNTSLS 
NPRQIEEEQTKPNGGKKDNLRGNSFAAENSSPEEQEEESIYQPIWQFKTIEPVDQEIIEALPVEGEDEEDEDEEDDDVDDEEEDLGDYDESLPSD 
ADPDVMALQAAAVFAAGVRHKLSTSTLTKRSKMMMPTPLPSPTASMDHGAWETDMEFM 

GQPKDTPKDLPADQQPSVVVHPILRNICIFYSQKDPKKYSAIFYDYHLSLVHQRFMTRIRRPAPPPPAQLATSNTTSLAPRSDDSGCSCEAEDSY 
NH QHN PT PA I P K P LKEEWE RA VL AAG A VAG KRG V AP RT G RKL RVRS N ARL L L T D S VL AWRE T L QDVN C C E DEE DMVS GWGN VL K DE LKN I Y I S L G 
1 1 HT S KFRFWNFRPT I FK * 
(SEQ ID NO: 858) 



Celera Sequence No. : 142000013384502 

GCTCCGTCCTTGCCTACCAGTGAGGTCACATATTTATCTAGCTGGACAATGGTGAACTCCTCTAATTCTCCATTGTGCTGCATCACGTTCAGCTG 
ACTGGCCAGCATCGCATAGCTGTTAAAGTACTTGCCGTCGGAGATTAGGATGCGATAGCGCTCTGAGTCCGCTGCGCTGTTAATCTTCTTGATTG 
CCAGGATCTGGAGAACCGGGGCATCCACAACCTCGCCATGCATAATGCGCTGTGGCCGGTAGTACGATGTGTTACTATACCAATCCATTATTTTA 
AGCTGGCATACTTACAGCAATCACACCAGTGGACAAAGATGCCAGGACCATTTTGATTATAAAAATAAGTTGGGCAATTTCACAAGCAAATATAA 
TTCATGCGTCGAGGGCTGGGTACAAAGTGAAGCTAGAAGAGGCGGGAAGGGCTTCTGTGTAATTTTGAATCAGTTGTTGCGGAACTAGTTCAATT 
TGCTTTCCAGTTAGAGCTGTATTTCAGGGCAGTAAAGTTTTGGTTGCTTAAGAGAAAACATTTATTTTACTTTATTATTACTATTTCCTACAGCA 
ACATATTGAGAATAATTGTCTATTATTTTGACACATAGCTTTAAGCTTGACTTACATTGTTTGTGTTAATGCTCACAACTAGTTATTGTAAGCAT 
TTCTATTTCAATGGAATTATAAATATTTAATAACTGTGTGATATTTTATTGACACATTCTTTGTATTAAAGCCTACATTCAGTGCCTTGTAAACT 
TTTTAAATTGAATGGTGTAATATATATATTTAAAATTTTTTGTAAGCTCTTTAAACTCGGTGATATTTTATTGACACAAGGAAAGTTGTTACAGT 
CGGGTGTTCTTATAAACATGTATGCAGAAGCGGGTGTCCACTGTATGACCCAGACGTTAACAGCACTGCTTATCGGAACTAGACCGCAGCCCTGA 
GTTGGCTCGATTAGCAAACATTTTCCCGCGCATTTTCGCACGCGTCGTCACAACAAGATCGGCGGTACAAAACAAAACATATTACAGATAAACGC 
TAAGCACAACAGACGCACGTAATGGACAATCCCAGTTCACCACCGCCAAATACGCCCAGCGATGCCGCCGAACGCCGCGATCTGCGGGCAGCAAT 
GACGTCGCCGGTTGGCGACTTTGAGCCGTTCGAGAACGAGGACGAGATCCTGGGAGATCAGACCGTTCGCGATGAGGCCGAGGAGGAGGATGGCG 
AGGAGCTGTTCGGGGACAACATGGAGAACGATTACCGGCCCATGCCTGAGCTGGATCACTACGATCCGGCGCTGCTGGACGACGAGGATGACTTC 
TCGGAGATGTCCCAGGGCGATCGCTTTGCCGCCGAGTCGGAGATGCGTCGACGCGACCGGGCCGCCGGAATCCACCGCGACGATCGTGATCTGGG 
TTTCGGTCAGTCCGATGATGAGGACGACGTTGGACCCCGGGCCAAGAGACGAGCTGGCGAGAAGGCAGCTGTTGGCGAGGTAGAGGACACCGAGA 
TGGTGGAGTCTATTGAAAACCTGGAGGATACCAAGGGACACTCAACTAAAGAGTGGGTCAGTATGCTAGGACCACGCACGGAAATCGCCAATCGC 
TTTCAATCCTTTTTGAGAACTTTTGTGGATGAAAGGGGCGCATACACCTATCGTGATCGCATACGTCGGATGTGCGAGCAGAACATGTCTTCGTT 
CGTCGTCTCCTACACTGATCTGGCCAACAAGGAACACGTATTGGCCTATTTCCTGCCCGAGGCTCCCTTTCAAATGCTCGAGATTTTCGATAAAG 
TGGCTAAGGACATGGTGCTCTCCATTTTCCCCACTTACGAACGTGTCACCACAGAGATTCACGTGCGCATCTCGGAGCTGCCGCTGATCGAGGAG 
CTGCGCACATTCAGAAAACTGCATCTGAACCAGTTGGTTCGCACTTTGGGCGTGGTAACTGCCACCACCGGAGTTCTTCCCCAGCTATCCGTGAT 
CAAGTACGACTGCGTTAAGTGCGGCTATGTGCTGGGTCCCTTCGTTCAGTCGCAAAATACGGAGATAAAGCCCGGTTCCTGCCCAGAGTGCCAGA 
GCACTGGACCCTTCTCAATCAACATGGAGCAGACGTTGTATCGAAATTATCAAAAGATCACGTTGCAGGAGTCGCCAGGCAGGATTCCAGCCGGA 
CGCATTCCTCGCAGCAAGGATGTCATTTTGCTCGCTGATTTGTGTGATCAATGCAAGCCTGGAGATGAACTTGAGGTCACAGGTATATATACCAA 
TAACTACGATGGCTCCTTGAACACCGACCAAGGATTTCCCGTGTTTGCCACCGTGATTATTGCCAATCATGTGGTTGTAAAGGACTCCAAGCAGG 
TAGTGCAGTCGCTTACGGACGAGGACATTGCCACCATTCAGAAGTTGAGCAAGGATCCGCGTATCGTAGAGCGTGTAGTAGCATCCATGGCGCCT 
TCTATATACGGACATGATTATATCAAGAGGGCGCTGGCCTTAGCTCTCTTTGGGGGTGAGTCAAAGAATCCCGGTGAGAAACACAAGGTTAGAGG 
TGATATAAACCTACTCATCTGTGGAGATCCCGGAACGGCAAAGTCGCAGTTCCTCAAGTACACCGAAAAGGTTGCGCCACGTGCGGTTTTCACGA 
CTGGGCAGGGAGCCAGTGCCGTGGGTCTCACGGCTTATGTGCGCCGCAATCCAGTATCTCGGGAGTGGACATTGGAAGCGGGTGCTCTGGTTCTG 
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GCTGATCAGGGAGTTTGTCTAATCGACGAGTTCGACAAAATGAACGACCAGGATCGTACCTCTATTCACGAAGCCATGGAGCAGCAGTCTATTTC 
GATTTCCAAGGCTGGTATTGTCACCTCTCTTCAAGCTCGTTGCACCGTGATTGCTGCTGCCAATCCCATTGGCGGTCGCTATGATCCGTCGATGA 
CCTTCTCGGAGAACGTGAATCTCTCGGAGCCCATCCTGTCCCGTTTCGATGTGCTGTGCGTGGTGAAGGACGAATTTGACCCCATGCAGGATCAG 
CAGCTCGCCAAATTCGTGGTACACTCGCATATGAAGCATCACCCGAGTGAGGAGGAGCAGCCGGAGTTGGAGGAACCACAGCTGAAGACCGTTGA 
TGAGATCCCGCAGGACTTGTTGCGACAATACATTGTATACGCTAAGGAGAACATTCGGCCCAAGCTAACGGTAAGCGAACTTATTGAAATCACAG 
GAAAAGTGTAACAACATTTCCGTTTCATACAGAACATCGACGAGGACAAGATCGCCAAGATGTACGCCCAGCTGCGACAGGAGTCTTTTGCCACG 
GGCTCACTGCCCATTACGGTGCGTCACATTGAGAGTGTCATCCGAATGTCCGAAGCGCACGCGCGGATGCATCTGCGCGAAAACGTAATGGAGGC 
AGATGTCAGCATGGCCATCCGCATGATGTTGGAGAGCTTTATCGAGGCGCAGAAGTTCAGCGTTATGAAGAAGATGCGCAGCACATTCCAGAAGT 
ACCTGTCCTTCCAAAAGGACCATTCCGAGCTGCTGTTCTTTATTCTGCGACAGCTGACGCTCGACCAGCTCGCCTACATTCGCTGCAAGGACGGG 
CCTGGCGCCACACATGTGGAGATTATGGAGCGGGATTTGATCGAACGGGCTAAGCAGCTGGATATTGTAAACCTGAAGCCGTTCTACGAATCAGA 
TCTGTTTCGCACAAATGGCTTCTCTTACGATCCCAAGCGACGCATCATCCTTCAAATTGTGGTCGACGGCAACACAGCTTAAGTTTCAGCCATTT 
ACTTCATCTCATTTAGTTATAGTATTCATGTTTTGTTACCCCCATTTTCGGTCTAATAAAGTTTGATATACTTATGTATGTATTTATACTAAAAT 
AGGTATTTGAATTGAAATTTATACTTCCGTAAATATTTTTTATTGTGCATACAAATCAGCCCCAGAGTATGTCAATCGTTGTTCCCTTCGATGCC 
CATGGCTTTGTCGAAATCTTCCTCGCTAATCTTGTTCTTTTTCAGCCGCTTGAAGAGCCGAATATCGCTGGCCAGTTCATCGAGGTCCTCCTGGC 
TGAACTGCTGCCTCTTTTTTCCCTTCCCGGACCTGCTCGCCTCCGCCTCTGCCGCTTTCTTTCGCTGCTTCTTCGCCTTGCGCAGCTCCTTCTTA 
GACTTGGCATCCAGTTTTGCCTTCTTTGTCTGATCCCAGGACTCAACACGCTTTTTGTGCTGCTTTTGGCCCGGCCAGCTTCCCGTTTGCTCGTA 
AGTTTCCATTTTCTTTTGCCGGACTTGTTCTTTCTGGGCGTTTTTGTAGGTAAGTTTGGATAAATCCACTTCAAAAGCAGGCGCCACATAGCCGC 
CGCCCTGGTAGTTCTTTAGCTCCGGCATGCGTGGCAGTTGCAGGAGACCGTAAGCAGTGGCCATCTTGCCCAGGTCCAGATCCTTTAGTCGCAGA 
ATAGCGCTGCATTCGTGTTTGGTGTATGCTCTAACATGGGAAACAAAAGCGCGCATGCCTTTGTCATAGACGCCCTTGTCGGCCGCCTGCAGCCG 
ATGGAGTTGATCTAAGACGGCTGGTAGCTTCTTTTTTTCACGATCTGCATCCTCAGCCTCCTCGGTTAGCAGTTTAGTTAGCTCCACTTTCTGGT 

T GAT C T T C AGG AAGT GC AC AT AGG C AT CC T C GCT GGG C AG GAG AAAC AC C AG AGC GT TGCCCTCGTTTCCCT GC CG AGC C GT GC G AC CT AC AC GG 
TGTACAAAGCTGGAGGCAGTTGATGGTGGATCCCACTGCACTACCCACTCGATCTCAGGTACATCCAAACCACGTGCGAGGACATCTGTGCAGAG 

G 

(SEQ ID NO: 859) 

Exon: 1001.. 3205 
Exon: 3263.. 3846 
Start ATG: 1067 

Transcript No. : CT23073 

CAACAAGATCGGCGGTACAAAACAAAACATATTACAGATAAACGCTAAGCACAACAGACGCACGTAATGGACAATCCCAGTTCACCACCGCCAAA 
TACGCCCAGCGATGCCGCCGAACGCCGCGATCTGCGGGCAGCAATGACGTCGCCGGTTGGCGACTTTGAGCCGTTCGAGAACGAGGACGAGATCC 
TGGGAGATCAGACCGTTCGCGATGAGGCCGAGGAGGAGGATGGCGAGGAGCTGTTCGGGGACAACATGGAGAACGATTACCGGCCCATGCCTGAG 
CTGGATCACTACGATCCGGCGCTGCTGGACGACGAGGATGACTTCTCGGAGATGTCCCAGGGCGATCGCTTTGCCGCCGAGTCGGAGATGCGTCG 
ACGCGACCGGGCCGCCGGAATCCACCGCGACGATCGTGATCTGGGTTTCGGTCAGTCCGATGATGAGGACGACGTTGGACCCCGGGCCAAGAGAC 
GAGCTGGCGAGAAGGCAGCTGTTGGCGAGGTAGAGGACACCGAGATGGTGGAGTCTATTGAAAACCTGGAGGATACCAAGGGACACTCAACTAAA 
GAGTGGGTCAGTATGCTAGGACCACGCACGGAAATCGCCAATCGCTTTCAATCCTTTTTGAGAACTTTTGTGGATGAAAGGGGCGCATACACCTA 
TCGTGATCGCATACGTCGGATGTGCGAGCAGAACATGTCTTCGTTCGTCGTCTCCTACACTGATCTGGCCAACAAGGAACACGTATTGGCCTATT 
TCCTGCCCGAGGCTCCCTTTCAAATGCTCGAGATTTTCGATAAAGTGGCTAAGGACATGGTGCTCTCCATTTTCCCCACTTACGAACGTGTCACC 
ACAGAGATTCACGTGCGCATCTCGGAGCTGCCGCTGATCGAGGAGCTGCGCACATTCAGAAAACTGCATCTGAACCAGTTGGTTCGCACTTTGGG 
CGTGGTAACTGCCACCACCGGAGTTCTTCCCCAGCTATCCGTGATCAAGTACGACTGCGTTAAGTGCGGCTATGTGCTGGGTCCCTTCGTTCAGT 
CGCAAAATACGGAGATAAAGCCCGGTTCCTGCCCAGAGTGCCAGAGCACTGGACCCTTCTCAATCAACATGGAGCAGACGTTGTATCGAAATTAT 
CAAAAGATCACGTTGCAGGAGTCGCCAGGCAGGATTCCAGCCGGACGCATTCCTCGCAGCAAGGATGTCATTTTGCTCGCTGATTTGTGTGATCA 
ATGCAAGCCTGGAGATGAACTTGAGGTCACAGGTATATATACCAATAACTACGATGGCTCCTTGAACACCGACCAAGGATTTCCCGTGTTTGCCA 
CCGTGATTATTGCCAATCATGTGGTTGTAAAGGACTCCAAGCAGGTAGTGCAGTCGCTTACGGACGAGGACATTGCCACCATTCAGAAGTTGAGC 
AAGGATCCGCGTATCGTAGAGCGTGTAGTAGCATCCATGGCGCCTTCTATATACGGACATGATTATATCAAGAGGGCGCTGGCCTTAGCTCTCTT 
TGGGGGTGAGTCAAAGAATCCCGGTGAGAAACACAAGGTTAGAGGTGATATAAACCTACTCATCTGTGGAGATCCCGGAACGGCAAAGTCGCAGT 
TCCTCAAGTACACCGAAAAGGTTGCGCCACGTGCGGTTTTCACGACTGGGCAGGGAGCCAGTGCCGTGGGTCTCACGGCTTATGTGCGCCGCAAT 
CCAGTATCTCGGGAGTGGACATTGGAAGCGGGTGCTCTGGTTCTGGCTGATCAGGGAGTTTGTCTAATCGACGAGTTCGACAAAATGAACGACCA 
GGATCGTACCTCTATTCACGAAGCCATGGAGCAGCAGTCTATTTCGATTTCCAAGGCTGGTATTGTCACCTCTCTTCAAGCTCGTTGCACCGTGA 
TTGCTGCTGCCAATCCCATTGGCGGTCGCTATGATCCGTCGATGACCTTCTCGGAGAACGTGAATCTCTCGGAGCCCATCCTGTCCCGTTTCGAT 
GTGCTGTGCGTGGTGAAGGACGAATTTGACCCCATGCAGGATCAGCAGCTCGCCAAATTCGTGGTACACTCGCATATGAAGCATCACCCGAGTGA 
GGAGGAGCAGCCGGAGTTGGAGGAACCACAGCTGAAGACCGTTGATGAGATCCCGCAGGACTTGTTGCGACAATACATTGTATACGCTAAGGAGA 
ACATTCGGCCCAAGCTAACGAACATCGACGAGGACAAGATCGCCAAGATGTACGCCCAGCTGCGACAGGAGTCTTTTGCCACGGGCTCACTGCCC 
ATTACGGTGCGTCACATTGAGAGTGTCATCCGAATGTCCGAAGCGCACGCGCGGATGCATCTGCGCGAAAACGTAATGGAGGCAGATGTCAGCAT 
GGCCATCCGCATGATGTTGGAGAGCTTTATCGAGGCGCAGAAGTTCAGCGTTATGAAGAAGATGCGCAGCACATTCCAGAAGTACCTGTCCTTCC 
AAAAGGACCATTCCGAGCTGCTGTTCTTTATTCTGCGACAGCTGACGCTCGACCAGCTCGCCTACATTCGCTGCAAGGACGGGCCTGGCGCCACA 
CATGTGGAGATTATGGAGCGGGATTTGATCGAACGGGCTAAGCAGCTGGATATTGTAAACCTGAAGCCGTTCTACGAATCAGATCTGTTTCGCAC 
AAATGGCTTCTCTTACGATCCCAAGCGACGCATCATCCTTCAAATTGTGGTCGACGGCAACACAGCTTAAGTTTCAGCCATTTACTTCATCTCAT 

TTAGTTATAGTATTCATGTTTTGTTACCCCCATT 
{SEQ ID NO: 860) 

Start ATG: 67 

MDNPSSPPPNTPSDAAERRDLRAAMTSPVGDFEPFENEDEILGDQTVRDEAEEEDGEELFGDNMENDYRPMPELDHYDPALLDDEDDFSEMSQGD 
RFAAESEMRRRDRAAGIHRDDRDLGFGQSDDEDDVGPRAKRRAGEKAAVGEVEDTEMVESIENLEDTKGHSTKEWVSMLGPRTEIANRFQSFLRT 
FVDERGAYTYRDRIRRMCEQNMSSFVVSYTDLANKEHVLAYFLPEAPFQMLEIFDKVAKDMVLSIFPTYERVTTEIHVRISELPLIEELRTFRKL 
HLNQLVRTLGVVTATTGVLPQLSVIKYDCVKCGYVLGPFVQSQNTEIKPGSCPECQSTGPFSINMEQTLYRNYQKITLQESPGRIPAGRIPRSKD 
VILLADLCDQCKPGDELEVTGIYTNNYDGSLNTDQGFPVFATVIIANHVVVKDSKQVVQSLTDEDIATIQKLSKDPRIVERVVASMAPSIYGHDY 
IKRALALALFGGESKNPGEKHKVRGDINLLICGDPGTAKSQFLKYTEKVAPRAVFTTGQGASAVGLTAYVRRNPVSREWTLEAGALVLADQGVCL 
IDEFDKMNDQDRTSIHEAMEQQSISISKAGIVTSLQARCTVIAAANPIGGRYDPSMTFSENVNLSEPILSRFDVLCVVKDEFDPMQDQQLAKFVV 
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HSHMKHHPSEEEQPELEEPQLKTVDEIPQDLLRQYIVYAKENIRPKLTNIDEDKIAKMYAQLRQESFATGSLPITVRHIESVIRMSEAHARMHLR 
ENVMEADVSMAIRJYMLESFIEAQKFSVMKKMRS 
PFYESDLFRTNGFS YDPKRRI I LQI VVDGNT A* 
(SEQ ID NO: 861) 

Name: Minichromosome maintenance 2 
Classification: DNA_repl i cat ion_f actor 
Gene Symbol: Mcm2 
FlyBase ID: FBgn0014861 



Celera Sequence No. : 142000013384523 

TTTGCAATTGTTTATGGGCAAAGCAACAGACAGAGATGGCGAGTGGGGCGGCAGCGAGGGGCGCGAGCGAGATGGAAGACAAAAAATGTGAGCGA 
ACGGCAAACTAAAATGTATCTATGAGATACATGGCCTGATCGCCAGCGCTCTCGCTCGCTCTCCTTTGGCTCTCTCGCACTCATTACGTATGAGT 
TTGTTGGTTTTGTTTTCCTTTTATTTTACAGATCATTCGCCGAGCTTTATTTAACTTTTGCTCTCTCTCACTCGAATTTTTGTTGTTGTATTTCC 
CTGGCTTTACAATCCATTTTTTGTTTGTTGCCACTCTCTGTTCAGTTCAGTTATATTCGGAGTCTTAATCATCGCATCGGAATGTTTACTCCAAG 
TTTTTCCCCCGTTCCGTTGCGTTTTGTTTTGCCTGCTGCCGCTGCTGTCTTGTTTAAAATATGTAATTTATTTTGCTGCTGCCTCTTTATTTTAT 
TTTCCTTTTATTTTTCTTTTTTTTTTTTGCTTCATTTCGCCTCCACCGATCGCATTCCTTTACACACACAGCTGCAGCACGCACCTTTTCTTTCC 
GGTTCAAGAGCGCTTCGGAATCGGAATTTATATGGGTTTTTTTTTTAGCTCGTTTGCCGGAGCTTCCTGAATGAGTGAACTGCTGATCTCTCACC 
TGCGTTTTATGCCTTCTTAAGCCACTAACAACTTGCCAAGAATTTTATTCTATTTTAATTTGATTCTCAATGAGTTATCTGCTTCACTTATTAGC 
CATTAAGCAGCAATTGGGGTTTATAAGCACTCTTGATAGTTTGCGATGTTAAAGCATCTATAACTTATTTAAAACACAACCTGTTGGTTTGGTCA 
TTCGTTATGTATGCCTCATTTCTGATAATTGTTCTAGTAGTTTTAATAAGTTTAAACTAACTTTCCAAATGCAAATATCTTATAATTATATTTCA 
GATCGCCCTTGTTTCAATTTCGTTTCTTTAATTGACCCTTTCCGGAACACTTACCAATCTCTGGGATCCAGCCAGGTGGTCTTCTTGTTGATATG 
ATCTATGTAATAGGTCTTGCCATCGAAATCCTTGGCGATATCCCAGCCATCGGGAAGTGGGAAGTCGCTGTGGTGTTGTTGCTGCTGCTGATGAT 
GCGTGTGTTGTTGTTGTTGCTGTTGTTGATGATGATGATGATGTTGCTGTTGCTGCTGTTGGGGTCGCAGGTGGTGGGGATGCAGATGATGCTGC 
GATTGCGATGCGGTTTGTTGCAGATTCGGCATTTTAACTTTGTTTCTCGCCGTGTACTTTGGTTAGTCTGAGAAAACGTCGTTGTTGCTGGATTT 
TTCACCCGACGCGCGCCTTTTCTTTCTTTTCGCTATTGCTATCTACGTTTGCTTCTTTTTTCGGGGGGTTTCTCTTTGCGCTGTTTACATTAGCA 
AAGACATTTCAATTCCGTTTCTTTTGCTCCGGCGATAATGGAGACCGACCCGCACACACACACACACAACACGGCCACACGGTGACGGGCGTAGA 
G GC AC AC AC C G AGCC AAC T C G AAT AC AC ACGC GC G C AGGGGAG AG AG AGC GGG AG AG C GC GGGC GC GCT CT C AAT T AT T T AT T T ACT T T T T T C GT 
AACTGCATTCCATTCCAGCTCAAACACACGCACACATATGGGGGACGAACACAAGCGCACACGATCACTGCTCACAGTTGGTGCTGCTTCTTTTC 
ATTTCTTGTTGTTGCAAGCGTTTGAACTCCTTTACTTATTTTCTTATTTTCGCCTAATTGAAACTAATTAACGACGATAATCCAACCATCCACTT 
GGTGTTCACAAAAAAATATGTACTTCAGTTTAGTGAGTTCGCGCCGTCGCCGATCTCTTCGGTTCCGGAATTCCTTGGGGGAGCAGCAAAAAACA 
CGAGTAGCGCGCTTGATGACCGTCCGATTCGGATTCGGAATTTGTTTTGGTAACATTCGCTGCTAGTGGTGTAGCAGAAAACATCGCATTGCATA 
TCGATACACTCGCGGCGATGTTATTCGATGTATTGGTTGATATCGATATTATAGGCTGTATTGACGCGGTTGCAGAGATGCGATATTTAACAATC 
ATGCACGAAAAGTATTTATTGTTATTTGGCTTGTGTAGTCCCCAAGCATAGGTGTAATTGTTGAGGGAAGAAAATAGGAGTTGTACGTTCAAAGA 
TCGGCTTGGTAACAGGAACTAGTTCGGTTCTTCAAGCGATGTTAATCAAAGCCAGTTCCGATAGATCATGTTAACTCGCTCTGTTAACAATTACT 
GTTATCTTAATCCTAACATTTCCGTTACATTTTAAAAACAGAAAAAGAAAAAATTTAAAAAAAAAGACTGTAAACTGAGTTACGCATTTCGAAAT 
CAATATGTATCCCAACCATAGATAGAAAAATAAATTATGTTTCCGAAACATAACTTCAATAAAACCACAATTTCGCTTGCTTTGTACGTTTGATA 
TTGTATTTGCATTTTATGAATTGTAAAAATTGAGTCAACAAGACTAACACAGCTTCAGCTCAAGACACATGCAATTTCATTGACATTGTTTAATT 
GTATATATTAAAGGAAATCAATTAGGATATAAAGAGAGAAACCTTTTTGCCGCTTATGCGATGCCAACTATCTGGATCTGGATTGTCTAACACGA 
AACCCTAACTAATTGGTAAATTAAACATAAATTAATCATATCGGTCAACAACTACAACTAAATATTAGTTATTAGCGGATAAAGAGCGCCCCATA 
AACTATTTACACTAAGCTTTAGTCTACTAGAGTCCCGGATTCCATCTCCGCTCCTCGGATCTTCGGATCTTAACTCAGATGTTCGGATCTTCAAG 
TTCTAGGAGAGAACTGGTGCAGTGCATCCGTTATTCAGGCGATACTCGGCATAGACCAGTCCGGTGTTGTGCCAGTGATAGA 
(SEQ ID NO: 862) 

Exon: 1932.. 1880 
Exon: 1462.. 1001 

Start ATG: 12 67 (Reverse strand: CAT) 
Transcript No. : CT23083 

CCGAATCGGACGGTCATCAAGCGCGCTACTCGTGTTTTTTGCTGCTCCCCCAATATCGCCGGAGCAAAAGAAACGGAATTGAAATGTCTTTGCTA 

ATGTAAACAGCGCAAAGAGAAACCCCCCGAAAAAAGAAGCAAACGTAGATAGCAATAGCGAAAAGAAAGAAAAGGCGCGCGTCGGGTGAAAAATC 

CAGCAACAACGACGTTTTCTCAGACTAACCAAAGTACACGGCGAGAAACAAAGTTAAAATGCCGAATCTGCAACAAACCGCATCGCAATCGCAGC 

ATCATCTGCATCCCCACCACCTGCGACCCCAACAGCAGCAACAGCAACATCATCATCATCATCAACAACAGCAACAACAACAACACACGCATCAT 

CAGCAGCAGCAACAACACCACAGCGACTTCCCACTTCCCGATGGCTGGGATATCGCCAAGGATTTCGATGGCAAGACCTATTACATAGATCATAT 

CAACAAGAAGACCACCTGGCTGGATCCCAGAGATTGGTAA 

(SEQ ID NO: 863) 

Start ATG: 249 (Reverse strand: CAT) 

MPNLQQTASQSQHHLHPHHLRPQQQQQQHHHHHQQQQQQQHTHHQQQQQHHSDFPLPDGWDIAKDFDGKTYYIDHINKKTTWLDPRDW* 
(SEQ ID NO: 864) 



Celera Sequence No. : 142000013384651 

CAATTCAGTGGATTTCCGAAGAATCACACGAAACTCGCAATAAATATGTACACATATAGAAATATATATAGCATACATACATATGTAGGTGCCCC 
AAAGATATCGACTTGTTGCCTCTGGGGCAACAACATAGAAGTCGGGATAATGGTAAACACATAAAGGGCAAACTTTTTGACCGATATCGGATTAC 
ACTATCCACACCCACACACTTGTACAACGACCAGAGTGATGAAGTGCGGGAGCGAGGGAGGTTAAAAGCGAACAATTAAAAAAGCACATGGTTTT 
CAGGCGATGGCGCCACCTCCAAAGACCTTTCGAATTCAAGTGCAGGCGTTTATTTTTGGGAGCAAACTCACCTGGGCCAGTTGATCCTCGGCCGC 
CTTGCGCTGTTCCGGATTCGGATCAATAGTTGCGCGCAACAGTTCCGTGAGTTTTTGTGCCTCCATTTTTGCTACTTTTGCTTGATTCTATGTTT 
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CGCCGGTCTTTCTGCTGCACTTCGTCGTTTGGCGGTCACCTTGTGCCCACTGTGCACCCGCTGTTCTTGCCTTTATATAGACCTTGGTTTATGCT 
AGCCGAACCAGTGGCTCCAATGTGTGGGCCAAATTGCAAATCTCTTTTTCCGGAGGAAAAAACGTGGAAAAGACTCCACACACGCACACACACAC 
CGCCGAAATTCGCGAGACGAACGAAGCTTAGAAGAAGAAGAAACAGCGCCTGTGTGTGTTGACTAACGCCGCAGTGTATGGGGCCCTACAGGGCT 
GCATTATGTTCGCCGGTAACAGTTTAGCCAAAAATTCGAGCAATATTGAAAAGCCTTCGCTGCGGAAAATCTAGGCACCATTTATATATAGACCA 
CTTGCTTCATAAGGTAACTATATTTAAAGGCTTAAAAAACAGCGACTTTTCACAATTCCCGATAAATTGCCATTCAAGTTGCATGGCCAATCGAT 
TGTTGCAGCCCTGGCAGCTGCTCGTACGTGAACACGCGAATGTGTGTGTGACGAAGAGCAAAGTGACACATAGTTGGAGGTGTGGTTGAAAATAT 
ATAGAAATCTGGGTTTATCTGCAACTGAAACGAAGGGAAATTTGTGTTAAAGGCGAGCGAAGATCGCAACTGAAGAGCTTTAACTGGCAGCAAGG 
CAAAAGGACGAGCAGCGACCATGGCAGGCAGGCTACCGGCATGCGTAATCGATGTGGGCACCGGGTGAGTTTTTAGTTTCGATTGCCCGTGGGGG 
TCTCCAATTGGACGGAAAGGAGGTCGACTGACTAACACTCTGCTTGTCTGCACATCCATCAGCTACTCCAAACTGGGATTTGCCGGGAATAAAGA 
GCCGCAGTTCATTATTCCCTCGGCGATTGCCATCAAGGAGTCGGCACGAGTGGGAGACACCAACACACGGCGCATCACAAAGGGCATCGAGGACC 
TGGACTTCTTCATCGGCGATGAGGCCTTCGATGCCACCGGCTACTCCATCAAGGTATGCTCTTGTATCTATGGTCATTAAGCAACCGACATCACA 
TTTATCATTGCATTCCAATAGTATCCGGTGCGTCATGGTCTGGTGGAGGACTGGGACTTGATGGAGCGCTTTCTGGAGCAGTGCGTCTTTAAGTA 
TCTGCGCGCCGAGCCTGAGGATCACTACTTCCTGCTGACTGAGCCGCCGCTAAATACGCCCGAAAATCGGGAGTACACCGCGGAGATAATGTTTG 
AGACGTTCAACGTTCCTGGTCTGTATATCGCCGTCCAGGCGGTGCTTGCCCTAGCCGCCAGTTGGGCCTCCCGGTCCGCAGAGGAGCGTACCCTT 
ACAGGCATCGTGGTGGACAGCGGCGATGGAGTGACGCACGTCATACCCGTGGTAAGTAGCACAGATCTTATCTTATCAGCAGAGTTGGGTTGGCA 
CTGGATTCGGTAAAAACTGTTTGAAAAGAGAACCCAAAACCGAAACAGTCGACGGATTTATTTAATAAATAAATAGCACATTCTGAGGCTATTAT 
TAATACTACAGAATGTTATATTAAGAGGGAATTATTAAGATAAAATAGCGTATTAGTTATAACCATATAAATATATTAAGCTGTTTTTCATTTCT 
AGGCCGAGGGCTATGTGATCGGCTCCTGCATCAAGCACATTCCCATCGCGGGCCGCAATATCACCTCGTTCATCCAGAGCCTGCTACGCGAACGG 
GAGGTGGGCATTCCGCCGGAACAGAGCCTCGAGACTGCTAAAGCGATCAAGGAGAAGCACTGCTACATCTGTCCCGATATCGCCAAAGAATTTGC 
C AAG T AC GAC AC GGAGC C GGGC AAGT GG AT AC GC AAC T T T AGC GG C GT T AAC ACG GT AAC C AAGGC G C C G T T C AAT GTGG AC G T GG GC T AC G AGC 
G AT T C CT GGG AC C C GAG AT CTTCTTCCATCCC G AGT T CTC AAACC C C G ACT T T AC CAT T CC ACTGT C GG AAAT AGT G GAT AAT GT C AT C C AAAAC 
TGCCCAATCGATGTGCGGCGTCCTCTATACAATAACATTGTTCTGAGCGGTGGATCAACGATGTTCAAGGACTTCGGCAGGAGGTTGCAGCGAGA 
TAT C AAGCGATC GG T GG AC AC AC G TCT C C GG AT C AGC GAG AAT T T G T C C GAGGGAC GC AT T AAGGT GGGT GAT AT CACAGCT GC T AAT G AGTC AT 
ACTTCTTACTTT GAT TAT T T T T GT T AC AGC C AAAAC C C AT T G AT GT T C AG GT G AT T ACG C AC C AC AT GC AG AGGT AC GC C GT GT GGT T T GGAGGC 
AGCATGTTGGCTTCAACGGTAAGTTATAGTGGCCTAAGATCCGTAAAATCCTTTATCTCAATATTCTTCCCATTTCTAGCCCGAGTTCTATCAGG 
TGTGTCACACGAAGGCTGCCTACGAGGAGTATGGCCCCAGCATTTGCCGTCACAATCCCGTCTTTGGCACCATGACATAATTGATCCGATCCAAT 
CGGCTTGCAGCAGTTATTTTTGATTTATTTTTCCATTTAAAATTGATTTATTGATTTATTCTCCAACTATATATATATATATATACATTGCGATT 
CGGTATTCCACGCTAAAGT ATT AGTAATTATTCTTTTCTATATGTGTTTAGCTATTAACTCTGTGT AAAT CAGTTT CGTTCTGTTT AAT TTCGTG 
TGATTTTGTTAGGCTAACTATAAAGCAACAACTCCGATGTCTGCCAGTTCAACCCCCTATCTAAGGTGCATTATTTCTAGAAGTCGACTTTTATA 
CCATACTCAAACACGATCACACACAGTCACACCGATCTAAGCTCTTGGAGAAATCCCGGCTCTATACACAATTAAATACGAAAACGCTCTACTTG 
T C AT AGAAAT AT T TAT AAT T G C AAT AACT GC AAC C AATT C AAAC ACT G AAT G AAC AAAAAAGG AAAAACG AAAA G AAAT AT T GAT GAG AAAT C T G 
TTTGTTATATTGATAAGCGTAATTAAAAAGGCATTAAATAAATCAATAAAACATTCTTTAAGCTCAAACACTTTCGTCGCTTTCGATCTAAAGCT 
T T T AC T GG AAAAT TT T AT T C AAT GC AT AAGT T AC AT AT AT AT AT T T TT T T GT G T T CAAAT T G C T CT AT GT AT T GT AT AT AT AAAT T T T GT GAC GG 
TGTCGCTAAGCTGAGAAAATAGCAAAAGGTGGTTTCCAAAAAACATGTCCCGAGCTAGAAATCCGAAACAACTTCGTTACGCCTCGCGTAATTGA 
CAAACTGACTAGTCAATGTAGTCACTGGAGCCCTCCTCCTCGTCGTTGCTGCTCGTGGATTGTGCCGCCGCCGTTTCCGGCGAACGCAAGTGCTC 
TGCAATCCGGAAGTTTTGCAGGAAGTCTCGCATCGCGTCCATCATTAGCGTTAATGACTGCTGCAAACCAGCTTCTGCAATAATGTTGGAGGAGA 

AGTGTAACTACGATATCCGCTAGCGATTCTTTTGG 
(SEQ ID NO: 865) 

Exon: 1001.. 1204 

Exon: 1298.. 1478 

Exon: 1542.. 1856 

Exon: 2093.. 2629 

Exon: 2690 . .2773 

Exon: 2835 . .2930 

Start ATG: 1161 

Transcript No. : CT23123 

AC G AAG AGC AAA GT GAC AC AT AGT T GG AGGT GT G GT TG AAAAT AT AT AG AAAT CTGGGTTTATCT GC AAC T G AAAC G AAG GG AAAT T T GT GT T AA 
AGGCGAGCGAAGATCGCAACTGAAGAGCTTTAACTGGCAGCAAGGCAAAAGGACGAGCAGCGACCATGGCAGGCAGGCTACCGGCATGCGTAATC 
GATGTGGGCACCGGCTACTCCAAACTGGGATTTGCCGGGAATAAAGAGCCGCAGTTCATTATTCCCTCGGCGATTGCCATCAAGGAGTCGGCACG 
AGTGGGAGACACCAACACACGGCGCATCACAAAGGGCATCGAGGACCTGGACTTCTTCATCGGCGATGAGGCCTTCGATGCCACCGGCTACTCCA 
TCAAGTATCCGGTGCGTCATGGTCTGGTGGAGGACTGGGACTTGATGGAGCGCTTTCTGGAGCAGTGCGTCTTTAAGTATCTGCGCGCCGAGCCT 
GAGGATCACTACTTCCTGCTGACTGAGCCGCCGCTAAATACGCCCGAAAATCGGGAGTACACCGCGGAGATAATGTTTGAGACGTTCAACGTTCC 
TGGTCTGTATATCGCCGTCCAGGCGGTGCTTGCCCTAGCCGCCAGTTGGGCCTCCCGGTCCGCAGAGGAGCGTACCCTTACAGGCATCGTGGTGG 
ACAGCGGCGATGGAGTGACGCACGTCATACCCGTGGCCGAGGGCTATGTGATCGGCTCCTGCATCAAGCACATTCCCATCGCGGGCCGCAATATC 
ACCTCGTTCATCCAGAGCCTGCTACGCGAACGGGAGGTGGGCATTCCGCCGGAACAGAGCCTCGAGACTGCTAAAGCGATCAAGGAGAAGCACTG 
CTACATCTGTCCCGATATCGCCAAAGAATTTGCCAAGTACGACACGGAGCCGGGCAAGTGGATACGCAACTTTAGCGGCGTTAACACGGTAACCA 
AGGCGCCGTTCAATGTGGACGTGGGCTACGAGCGATTCCTGGGACCCGAGATCTTCTTCCATCCCGAGTTCTCAAACCCCGACTTTACCATTCCA 
CTGTCGGAAATAGTGGATAATGTCATCCAAAACTGCCCAATCGATGTGCGGCGTCCTCTATACAATAACATTGTTCTGAGCGGTGGATCAACGAT 
GTTCAAGGACTTCGGCAGGAGGTTGCAGCGAGATATCAAGCGATCGGTGGACACACGTCTCCGGATCAGCGAGAATTTGTCCGAGGGACGCATTA 
AGCCAAAACCCATTGATGTTCAGGTGATTACGCACCACATGCAGAGGTACGCCGTGTGGTTTGGAGGCAGCATGTTGGCTTCAACGCCCGAGTTC 
TATCAGGTGTGTCACACGAAGGCTGCCTACGAGGAGTATGGCCCCAGCATTTGCCGTCACAATCCCGTCTTTGGCACCATGACATAA 
(SEQ ID NO: 866) 

Start ATG: 161 

MAGRLPACVIDVGTGYSKLGFAGNKEPQFIIPSAIAIKESARVGDTNTRRITKGIEDLDFFIGDEAFDATGYSIKYPVRHGLVEDWDLMERFLEQ 
CVFKYLRAEPEDHYFLLTEPPLNTPENREYTAEIMFETFNVPGLYIAVQAVLALAASWASRSAEERTLTGIVVDSGDGVTHVIPVAEGYVIGSCI 
KHIPIAGRNITSFIQSLLREREVGIPPEQSLETAKAIKEKHCYICPDIAKEFAKYDTEPGKWIRNFSGVNTVTKAPFNVDVGYERFLGPEIFFHP 



FIGURE SHEET 465 



EFSNPDFTIPLSEIVDNVIQNCPIDVRRPLYNNIVLSGGSTMFKDFGRRLQRDIKRSVDTRLRISENLSEGRIKPKPIDVQVITHHMQRYAVWFG 
GSMLASTPE FYQVCHTKAAYEEYGPS I CRHN P VFGTMT * 
{SEQ ID NO: 867) 

Name: ACT IN -RELATED PROTEIN 

Classification: cytoskeletal_structural_protein 
Gene Symbol: Arp66B 
FlyBase ID: FBgn0011744 



Celera Sequence No. : 142000013384139 

TGTGCCTCGTCGACGACGCTGCTTAATGTCAAGGTTTTTTGGCTGCTGTTGCAAGTGCAGCAGCAGCAGGAAAAACAGGAAAACCAGAAACGGCC 
GAATGAGAACGGCACGTGGCCAGCCCCCGCAACGAAAAGGGGCCAGAAAGGGGGTCGATTATGCCATTACCAGGGCAAAGCTGCACCTTCCGCCC 
CCGGAACCCCCAACCCCCTGCCAGTGCGCCGCGCATGAATGCCGCGACAGACCCGATTCGAAAGCCAATGTCAAGTGCACGCGGCACAAGGATAT 
GGAGGCAGGAGTATATTACAGGCCGCCATCTGCCCCACTGTCAGTTCCCCTTCCTCAGTGCCCACAGCAGCTACAGTGGAGCTTCTATAAATTGA 
CATTGGGCTTTCATCAATATTGAACTCTTCGATATGGACTGGCTTATGAAGACTCGTTTATAATAACACATAAATCCTATTTTCATATACGGCTT 
AAGTTTGACATACACAGGCTCGACTGTACCTCATCATCGAGCATTAGGACGGTCGTTGTTGTGACCTCCAAATTGGTTGGCACACTCACCTAAAT 
GGGAGTTTATCTATATTTTATGTGCACGACTTATACCTCAGGCCCATTTCACTTGATCACCAATTAGTGACTATATGCTCAATATTTGCCAATAT 
CAAACATCAGGTGCTTCGTCAATTAACTTGTTAATTAGATAGGAAATAGATTCTGTGGTTATAATTGTTTTTGTACCGTATTATAATTGTTACTA 
CTTATTGTAGACGAATTTACATTAATCGATATCCATCCAAATTAGTCTGCGAATTCATTAGTTCAACAAACTACTCAAGGATGATGAACTTTTAC 
TAAAACTAATTCCGTTCATCGGCTTTAAATAAAATAAATCAACTGTTTCATAATAGTAATTAATAAGTCATATTAGTCATATAAATAATAAGGGT 
CCACACACAAAATTGAATACAAGATAAAACGCTCTTCTCATTTGATATTTATTCAAATTATTATTTAATGTAAGTTTTGAACTTATTTATGTTAA 
ATAACCTAGTGAGAATGAAATACAATGTTTTGTTTGTTTCTTCTTTACTGACTGCAGTGGATTGACTGCTTGGCATAACTTGTTTATTACTTCTT 
AAAAATTCTTCAATAATCTCTTTATAACTTAGGCGTATAAACTCATAATTCCGGCTAGTTTCGAGAACGGATTACCAAGGACAAAAGTAAATTAC 
ATATCTTCTACTAAAACTAGAATCTAAGGAAGAGAACAGACCAATCCGTCTACGACCAGGTCTTGGGAACACTGAACGGCCCGAATCGTTCGTGA 
CTATCAACGCCCCGCACAGCGAAGTAATACCTCTGGTTCTCCTGGAACTGATTTAAAGTCACTGCCATTGGCAGCAACATAGCGCTGACATCGCC 
CACATGCCGCCAGCTATCCGTGCTGGGCTCGTTAATCGTCTCCTGGTATGCATAGATCTGGTACATTACGCACTCTGCGAACCGCGGGCTCGAAT 

C C T CC AG AGT C C AGGAT AT AAC G AT AC C AGT AT C AAGG AG ACT AAT C C GG AT AAC GGGT CG C G AAGG AGGC AC C T T C C AG GAT GG AT T GAAGGGC 

TGGGGAGGAGACGTGGGCAGTGGTGCAGGATGTGAGTATCTTAGACGAGACGCTGGAGGCGGGGTGTTTTCCGAACGTATCTGCATAGTTACGTT 

TGAGCCCAAGGGACTTTGCTGACCTTTGAATACGTTTGAAAATAAGTACGCCGAATATTTTGAATATAATGATTTTCCAACTCACCTCCTCCATT 

ATTATGGACTATCTGTCGAGCATTTACGCGAGGCAATTGCATTGGAGAAGCTCGCACCACATTTCCGCCTCCTCGTCCTCCTCTCGTCGCTGCTG 

CAGTCTGAGCATTCCGCTTAAGCATCTCCATACGCAAACGAGCGTTTTGTGCTTGGGCAGCCCTAGCCGCCGCCGCAGCCGCTGCATCATCCTCA 

TCTGTGAGGTCAATGACTGCCTTCTCCTTGGGCTTGGCCGGCGCCAAGCTGGACTCCTGCTGCTGTCCATAGCTGCCCAATTCCGATCCACTGCT 

GCTAGATGAGATGGGCACCGAGACCGTGGCCCCCGAAAGCGGTGGCTGCTTGGCGCGCACTTTAGTTGTGCACTTCGGCACTTGCTTGGCGGGAC 

TCGAACCTGGCGTGGGACCGGGACTAGGTCGAGGTCTGCAATATGAAACAATTGTAAGTTAGTAATGTGAATATGACATGTTTCTTTTGACGAAG 

AATTTGAGATCTCGGCGATTATACGGACGGACAAATGGACGGACGGACATAAACATATTGATTCGACAAGTATATTTTTATATTGGTAACGCTTT 

TAT TTACCCATTTGGGTTTATTACC C C AAC T AT AAAG AT T T T C AAT AGC AAT AAAAT AT AC T T G AAT GT T AAGAAAAC AAAAT TTTTTCTGG CAT 

ATCAATACAAATTGGGAAATAGAATAATAAAATTTAAAAAAAAATCTATGTATGCTTTTATAGCCACTGAGTTCGAGGCGTTCATATGGACACAC 

GGACGGACATGGCCTGATTGATCCAGATAAAAAAAATATATACGGAAACTCTGCCTTCTACCAGTTATATACTCTGTAACGAATCTAGTATACCG 

TATTACTCTACAAGTAAAAATATGAATCTATTTAGCTAATTGGGTCTGAAACGATTCCCAATAAATTGCTAATTGTAAACTTCTGCACAAGCGTG 

GAATTCGCTTTTAATCGAAGATCACGATTAGGATTATACTCACAGTAGGAACTGTTGCTGCGCCTGTTGTTGCTTTTGTTGTTGTTGCAGGTGTT 

GTTGCTGTTGTTGCTGCTGTTGCTGGTGCTGTTTCTGCTGTTGTTTTTGCGCCAAGAAAGCGGCTTCTGAGTTCCGGGCACGCATGACGGTTTTA 

TTAATCGCATTGGCCACCGATGAAGGCGTGGAGCCAGGCGTTCCGGTGTGCTTGGAGGCGTGCATTGTCCTTTGAGCTATAGGCGGTGAGGACTG 

CAGACGTTGAACGCTGGACTGATTCATAAGCATTGGCGCGGGCTGTAAGTTAGTAGGTACCGGCCGATGGGGCGTGACCTTCTGCAAGCAGCCCC 

GGCGCACAGGTGGCGTGACACTAGCCGTTGTTAAGGATTGAGATGACCGTGCTGCTGGCTGCTGTTGCTGGTGATTGGAATTCGGAGTGACACTG 

ACGCCCGCACCCGTTTCCGATGAGGATGTGGATGGTCCAAAGGTAGGCGGAGGAGGTCTTGGTCGCACGGTCGCACGCGTCGGGGAGGTGCTGGC 

CTTGGGTGGTGTTCCGCTGGGCGGCGCCAACACGCTGCCAGCAGCATGAGTTGACTCTGGTGCGGCGACGGTGGGCTTTATAGCCTTGAACGGAA 

TGCCCACCTGAAGACCAACAGCTCGTGTTATACGCACAGGAGCGGTGAAATGAGAGTTCTTAGCCTCCAAGTCCTTAAGCACACGCTTGTGCACC 

GTCTCCAAATCCAGGAATTGCTTCGACACCTCGGCAATCTTGCGGCGATAATTGGCGAGAGTGCTCTCGCATTTGTCCAGCTGCATACGTATTTC 

TGCGAAATCACTTTTAACCATCATCGCCTCGACGACTTTCTGCAGCACCAGCTCCTCCAGGTCGTCACGGGTCATTGTGTCGAAACTACGTCGGA 

AGGTCTTAAAGAATTCCAAGCGCAGGGGCTTAACATTCTGTTTAATTTCTGGAACGGGGTTAAGGTCGAAGTCCAGTTTCTTTTGGGTATCTTTT 

TTGGGGGATTCTTCTTTGTCTTCCTGACCACCCATATCAATTTGTAGCGTGCCTTCGTGGGAGCTATTTGAGTCAGAGTCTTCAGTCTTAGCTTT 

CTTATTAGGTATTTCCTCCTGAGGGCTGCTGTCCAGATGTTCATGGCTCCGCTTGGTTACATCCTTCAAACCACCCTCTGCTCTGCTGAGTATTG 

GTAATGATTCTTGATTCGTCTCCGCTTCGAATTCATTTTTTTCATCGGTACTAAGACGCAGCCGTTTTGGAACCGACTCTTCCGCGTACTCGGCG 

GGTCGCAATAGATTGCTGGAATTACTGTTCCCATCCTCTGTGGTAGTTTGCTTCACCATATCATGGGTCTTCAGTTTCTCGAACTGATCGCATGC 

ATTGTCGGAATTGTCTATTTGGAGATTCTTTGCTTCAACTGATTCTGTAGTTGACTCATTGGGAAGTCCTTTTAAAGCTGTTTCCTTTTGGATAG 

TTTGAACTTCCTTTTCCGATATCCCATTCTGCGGTGTTTCATCCTCATCCTCGGAGACGAGAACAACTTCGTCATCCTTAGCTTGTGGTTTCACA 

GAATTTTCGTTCGTTGGATTGGCTGTGGTGCCCACATTTTCGGTCTTATCTAGGGGTTCAAAGAATATAACTTCGTCATCGGACTCCGTCTCTTT 

GGTCTCCTCTTCAACAACCGTTGATTCATTGCTTTCCAAATCCTTTAAATTTCCATCAGTAGAAATTTCGCATGCTCCATTTGTCGTGGGTTCCT 

TATCGCCTTTTGGTATCACCCCGAGTTCGTCCTCTGCAGCCTTCTTTTCTTCTATCTCTACAACATCCTTGACCGCTTCATCAAGCTTATCATCT 

ATGGTTGTGTAGTTCATTTCGTGGTCTGCAGTTCGAACATCAAATTCTTTTTGGACTTTCTTTTGGGGATCTTCTGTTTCTTCACCTTCGATCAC 

CACAGGTTCCTCAAGCTCTTCAGTGTCTTCTATTTCCTTTTTATTTTCTGGCTCTTCTGTTTCCTTAACGTCTTCCACTTCCTGGGTTGGGCTTG 

GATCTTCCTCTTCCTTGGTGTCTCCCAGTATCTGTGCTACTTTCGAAGCTTTGATGATTTTGATCTCCGACTTTTCTTTTTCAGTCGCCTCTTCT 

GAATCTTTTGTTTCCTTGCAGTAATCAGGCTCTTCTTTTTCCTTGTGCTCTTCAAGTTCCTCGGATTCCTTGGATTCTTCTGGTTCATCCACTTG 

TATTGACTCTTCGCTCTGGATTGGGATAGACTCTTTCGACTTTTCGGTTACTTCGTCTCCAACTAACATGGGTTCTTTACCTTCTGTCAAATGTT 

GATCAGCTTTGACAGACTCATATTCGGCCGCACTCGCTGACTTTTCTGAGATATTATTTTGCAAATTCAATGATTTCTTCGTAGAAACCTTACCG 

TGAGTAAGGACTTCGGTATCTTCTGTGTTAGCTTCAGTTTCCTCGACCTCTGATTCCGCTTCGCAAGGCACACATTCACTCGACGAGTCTAAATC 

TATAACCTCAGTGTTGGGTTTTTCCGTTTCCTTAATTATATCATCGTCATCAGATGAAATGTCTTCTAAATCATTGACAGATGAGTTGTTTTCCG 

TGTTTAGTTTTTTGCTTGGCTTCGCCTTCTTGGAGTCCTGAGAGGCAAAGGCATCAAACTCGTCGGCACTCGATATACAGTCTAAAGCATACATA 

AACACATCATCTGTTGATGTATTTTTCCGGTCTTTTTCAATGTTATCCGCAGTATTGGCTGGGACTTCATCCTCTTCCTTTGACTTTAATATTTC 

ATCTGAATCAGCTAAATCTTCGGTTTTCTGTGTGGATTCTTTAGCTGAAGCGTGTTCTTCAGTTTCATCTAATTTCTCCTCAGTCTTTAATTCAG 
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AATCAGGATCGATGCACTCTTCTCCTTCTAGAGAACTAAGCACCTCTTCGCCTTCTTCTTCTTCTTCTGGTTTTTCTCTTTGCTCTTTTAGATTT 
TCGTCCGTATCAGCAGCTGTTCTCTTCTTCGCCGACGCAGAGTCGCATTCCTCGTCTCTATCGAGTGAATCGATATCGTCGGAATTCCTGGGACT 
ACAATCCACAATAGAGCTAATCTTGTCCAGAAGAGCGTCCAAATCACTACCTGGATTACGCTCTCGGTCCGATATCTTTTTTGGTTTGTCAATAA 
CTTTGACGTTATCGGAGCTATCGACGCCATTGGTTAGCTCACAAAGATCCACTTCATCCACAGAAGTTTGACTGGATTCGTTGAGAATGGCTGTA 
ATTCAGATATCGATATATCCCAAACGGATTTATCCAAATAACTAAGTCGCACTTACCGGGGGTGGAGAGAGGTTCCACTTCATTGCCGAGCTCTG 
TGGATAGGGTTCGCATCAAAGCTTCCTCTGTAGAGAGTTCCTTCAGTTCCATGGTCTGGTTTACTCCCATCATTTTGAATGTTTTTATCGATTTA 
CTTAATCGTCATTGAATTGCTAAAAGTAAAACCATATTATAGTTCACGACGTATGTTGACAACTCTGGGATATAACAAACATGTCACAACCCTGC 
TACCGCCATATTTTTTTTAAAAATTAAATTCAATCTCATTCAAAAATTTCTTGTAAATAGCTTTTAAAAGAAAGCTCGCTAAAAGCTTATTCCCA 
GTTCTCGCCAGTCAAAATGCTAATGCAATAACTTGGGAATTATACCTTTAAGTGAGTCTGAAGCCAGCGATATTTTATTAAGACTCCGCCAAAAC 
TCATAAACCGCTTTCCCGAATTCATTTTGAAGTCCTTTCTAACTCTAAAGTATACGAAGCCATTAAAGCAATGTTATCTTTGAGTATGGATGTTC 
CATAGTCAAAGGATGATAGGAACATCATAACTCAAGACCAACATTTTTGTTTTTTTTTTTTACTTTTTTTTGTGTTCGGTTTGGTGTGTATCTAC 
TCCGTTGTTATTGTTCTTGTTGTTGGTAGTATTAAGCGTTCGTCGCTTTCCGAACGCAACAATAACTACACGATTGTAGCGAAAAGACAAAGCAA 
AGCGACGATAAATATTCGTGCAAGGGAAAAAAGTGCGTGACGAAAAATGCGAATATGGGGAGAAAGAGAAAATTAACTGCTACCTATTAATTTAT 
GCTCATAACTTTATAGTTATAGTTTTTACTATCGAAAGTGGGGTTATGTTGAGTTTGTTGCGTTTTTACGTGATAGCGTAACTGCACATTGTTAT 
ATT GAGT TT AGT AAC T T AC AT GC T T AAAC T T C C T G AT ACC C T GT T AAC GAT T T AT T T T GAT T T GGC T AC ACT C G AAG AAAT C AAAAT G AC T T C AA 
CATCTTGCCGCGTTTAGTGCAAAAATGAACGAGGTGAACGCCAAACCAGCTCTCGACGAACGAAAATGAGAGCGAGAGAGCAAACAGCTGAGAGC 
GGTAAAAGCGGCAGGGGCGTACGGGTAAGAGGAAGATAGTCGGCGGGAGGGCGTGTCTCGGGATGATATGATAAGATTCGATAATGTAATTCCAC 
CTGGAGTTTAAGGAAACTACATTCACATCTTGAACTTTAACCTTTCGCGAAAAATAGGGCAAAGCTACAGTCCTCATTGTTATTGTGCTCTATGG 
CAAGTTGGCAGTCTCTTTTGGTATGGCAGTTTCTCCATAAAATTATAAAAATAAAAGCAATAGGGGATGGTATAAAAATCGAAATGAAGTGAGCA 
GGGATCCTCATTGAATCAAAAAACATTGTTTGATAACAACGAGTAAAAAGCAAAGCGTTTAATAATAACGCAAATAGAACGCGTTTTGTTGGTCG 
CGAGCGGCAGGGTGAGATGAGGAAACAGGCAGCTAGAGCGGGACGACTATACGTTTACAAATTAATAGGGGAGCAATATAAGTAAAGAACATTAT 
GGAATATCCATTTCTTAGTGCAACTACACACAGCTCCAACCAGTGTTAATAAATAGATGATTTGTAGATGGAGGCCGAACGCAAAGATTATGTGG 
AATCAGAGAAGGCTGAAGCAAAAATGTAACAGAATGTAACCCTGTCAAAAAAAGAAACTTTCCAAACTTTCCAAACTTTCAAAACTTAGAACCTA 
TAGTCCGTCCGTCCTTCTGGTCTTTTCGAAACGAACTTAGGACCTTGCCAGGATACCGGTTTTGTAGAAAGTTGCGGAGTGAGCGGGACGGACAG 
AGAGAGCGGCCGGAGAGTAATTGCAAGAGAGAGCAAGAGGCCAAATTTATATATGCACGTGGCTATATATATTTGCCTGTATGTAGGAATTGAAT 
AAAGTTCGCGTAGCGCCAAACACAGCACGACAAAGATACAGCAGCAAGCCGAAAGCAACAACTACAACACCGGGCAAACAAAAATAGAGGCGTAA 
AGTC G T T G T T T T T GT AG AT GAAAT GC T AAAAC C T T C GAT AT C AGC T CT AT T AAAGAAAT T GAT C C T AG AAGC T AGT G AC ATT G GAT CAT T AAAT T 
TCGACTTGCTCTGATGGCAATACTTTTTTCAAGATGCTTAGGTCCATTAAAACCGTTGCTTATACACACAAATGTAGATAACAAACATGTTTAGG 
TGTCGTATTTTTGCAACTTAAAGTATATATGGCCAATTTATCGTCTTTAAAGTTCGGTCTTGTATTTAAGCCAATTTTGGGCTATATGTCTATTA 
TAGCTGTCCCGCTCTAACACGCGCACACACACCAGTTAAGTTCGTTGTAGCAACGGCACATCGGTGTGAGCGTGTATGTATATCAGAACGTGTAT 
TTGTGCGATTTTGCAACGCATTTACATTCTTTTTAGGTTACGTATCGATTTTGATGCAGAGTGAGGTTAGATCGAAAAACATCCAAGTTTTTGCA 
TGCTTTACGAAAGCAAGGACATTTTTGAAATCCCCTCGGCGCCCTCTGCATGTTTCTGGTATCAGACGTTCGATAAAGACTAAACCCCGAAAAAC 
CTAAAACGAGCAGGTTGCAAGCAGAAGCAAAAGCAAAACGGGCGGCGTTTTTATTCACCTATTGTACAATTTTATGTTTTTAGGGTTGCCAGGTA 
TGCATTATGTTTATCATGAATATAATATAAAACCTCGATGTGTCCGATTAGACGCAAATCAAATGTCTTTAAATGTATAATTGAGACCATGATGT 
CGCGCAGGGTTGCCACCGGCAGGTCATTTTTTGTCCTATTGCAATGGCATTGCGGTATATAGTCTTGCCTACACGTCGGTAGTCTAGTCTAAAAT 
ATAGCATAGTAACAAATCCAGATATATATCAGATTTCGCCTGATTTGCAAATCGCGGTCACTTGACAACCCTGAAAACACAAATGTCTTTAAAGA 
GACTCGTACCATATCTGGCGTATGGTACCCGTACCCAGTCCAAATAAAAAAAAACTTCAAAACCCAGTAAAAAAAATGCAAATCCCAAATGTACC 
ATTCGATTGGACCATTATATTACAGGGTATTTTTATAAAGAAAATTATAAATATAACATAAAATGGATAAAAAAAACTACACTTACCTTTCTACG 
ACACGACAAACTCGTTTATTAACTTATATATTACCGAAAACTATTGAAATATGACTTGTGGTCTAACGGCAACTATCAAATGTCAATGATTTGAA 
TGAATTT AAC AGC TTCTCCGTGATTTGTTTACAATTCGCAGGCAACTCCACTTCCGATGCCCTTAAACGACGAAGACTGAAACACAGACT AAAAT 
CGCACAACAACAGAAGGTGCCGAAAAGAACCAGTCTATGCACAACTGGTTCCGAAGAGCAGCGCCCAGATTGCAAGCTAGCGGCACTTGGAATAG 
AGATTGGCGCGTGATTCCGGATGCCGATTTCTGATACGCATCGAGAAGACATTTTTTATGGACGTTTCTAACAATACTTATTATTTTTGTATTTA 
TTTAGGTTGGATTTTGGCAAAACTTATTTTATCTTATATTCAACTTTGCATTTTTACGCAATTTGTATTATTATTATTATTATTATTATTAACAT 
GATGACCAGATATAACTTAAACTGGTTACCTTCTCATTACGATAGTAATATTATAAAAGGATGTATATTTAATAGCTGAGGAGAAGCAGCATTTC 
TATAGTATTTATATACAAAAATAAAAAACGTTATGCAGTTCATGCACGTGTACATCCCTAGTTTGGAACCAAACAAAATCATATGTTGTTACTAG 
GGCTGTATAGCTAAACGTTTTATTTATGGGAGCAGAACTATTAAGTGCCAGTTGGAACTAAGACGTGATCTCCCCTGTCCCTTCTTTTGCTTAAA 
ACTGAATCATATTGACAAATGTTTTCGGCCCGTTTAAGTAATGTGACGTTCTCATGAAATACAACGGTGAACATTGCAAAAATCGTGTTATTTTC 
GTTAATTGTTCCAATGGGTGAACTAGTGAACGGAGTCAGCATGTGCCAGCCCTGATGGCTGCTTTGTTGAACTAGTTCCAGTTCCGGAGCTAGTG 
AACCAATAGCAAACGGAACAAAACAAAAACATCTGTAGATTATAGGTTAATTTGTTTTATAATAACAAATGCGGGAATTATTCAATTAAGATGAG 
CAATCTGCTGAATTTTGCCCTGCGGCAGGGCGTCAACTCCATCAAGCGCCTTAGCCTCCAGAGTTTCAGACTGGCAGCTCCGGCCATTGTAACAA 
AAACGCCGGTTACTTCCACGATAAAGAGATTCTCCTC 
(SEQ ID NO: 868) 

Exon: 9487.. 9302 
Exon: 6289.. 6137 
Exon: 6075.. 2799 
Exon: 2220.. 1796 
Exon: 1733.. 1001 

Start ATG: 6248 (Reverse strand: CAT) 
Transcript No. : CT23297 

TTTCAGTCTTCGTCGTTTAAGGGCATCGGAAGTGGAGTTGCCTGCGAATTGTAAACAAATCACGGAGAAGCTGTTAAATTCATTCAAATCATTGA 
CATTTGATAGTTGCCGTTAGACCACAAGTCATATTTCAATAGTTTTCGGTAATATATAAGTTAATAAACGAGTTTGTCGTGTCGTAGAAAGCAAT 
TCAATGACGATTAAGTAAATCGATAAAAACATTCAAAATGATGGGAGTAAACCAGACCATGGAACTGAAGGAACTCTCTACAGAGGAAGCTTTGA 
TGCGAACCCTATCCACAGAGCTCGGCAATGAAGTGGAACCTCTCTCCACCCCCGCCATTCTCAACGAATCCAGTCAAACTTCTGTGGATGAAGTG 
GATCTTTGTGAGCTAACCAATGGCGTCGATAGCTCCGATAACGTCAAAGTTATTGACAAACCAAAAAAGATATCGGACCGAGAGCGTAATCCAGG 
TAGTGATTTGGACGCTCTTCTGGACAAGATTAGCTCTATTGTGGATTGTAGTCCCAGGAATTCCGACGATATCGATTCACTCGATAGAGACGAGG 
AATGCGACTCTGCGTCGGCGAAGAAGAGAACAGCTGCTGATACGGACGAAAATCTAAAAGAGCAAAGAGAAAAACCAGAAGAAGAAGAAGAAGGC 
GAAGAGGTGCTTAGTTCTCTAGAAGGAGAAGAGTGCATCGATCCTGATTCTGAATTAAAGACTGAGGAGAAATTAGATGAAACTGAAGAACACGC 
T T C AG C T AAAG AAT C C AC AC AG AAAAC C G AAG AT TT AGC T G AT T C AG AT GAAAT AT T AAAG T C AAAGG AAG AGGAT G AAGT C C C AGC C AAT AC T G 
CGGATAACATTGAAAAAGACCGGAAAAATACATCAACAGATGATGTGTTTATGTATGCTTTAGACTGTATATCGAGTGCCGACGAGTTTGATGCC 
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TTTGCCTCTCAGGACTCCAAGAAGGCGAAGCCAAGCAAAAAACTAAACACGGAAAACAACTCATCTGTCAATGATTTAGAAGACATTTCATCTGA 
TGACGATGATATAATTAAGGAAACGGAAAAACCCAACACTGAGGTTATAGATTTAGACTCGTCGAGTGAATGTGTGCCTTGCGAAGCGGAATCAG 
AGGTCGAGGAAACTGAAGCTAACACAGAAGATACCGAAGTCCTTACTCACGGTAAGGTTTCTACGAAGAAATCATTGAATTTGCAAAATAATATC 
TCAGAAAAGTCAGCGAGTGCGGCCGAATATGAGTCTGTCAAAGCTGATCAACATTTGACAGAAGGTAAAGAACCCATGTTAGTTGGAGACGAAGT 
AACCGAAAAGTCGAAAGAGTCTATCCCAATCCAGAGCGAAGAGTCAATACAAGTGGATGAACCAGAAGAATCCAAGGAATCCGAGGAACTTGAAG 
AGCACAAGGAAAAAGAAGAGCCTG ATT ACTGCAAGGAAACAAAAGATTCAGAAGAGGCGACTGAAAAAGAAAAGTCGGAGATCAAAAT CATC AAA 
GCTTCGAAAGTAGCACAGATACTGGGAGACACCAAGGAAGAGGAAGATCCAAGCCCAACCCAGGAAGTGGAAGACGTTAAGGAAACAGAAGAGCC 

AGAAAAT AAAAAGG AAAT AG AA G AC AC T G AAG AGC T T G AGG AAC C T GT GGT G AT C G AAGGT G AAG AAAC A GAAGAT C C C C AAAAG AAAGT C C AAA 
AAGAATTTGATGTTCGAACTGCAGACCACGAAATGAACTACACAACCATAGATGATAAGCTTGATGAAGCGGTCAAGGATGTTGTAGAGATAGAA 
GAAAAGAAGGCTGCAGAGGACGAACTCGGGGTGATACCAAAAGGCGATAAGGAACCCACGACAAATGGAGCATGCGAAATTTC TACT GAT GGAAA 
T T T AAAGG AT T T GG AAAGC AAT G AAT C AAC GGT T GT T G AAG AG G AGAC C AAAG AG AC G G AGT C C GAT G AC G AAGT T AT AT T CT T T G AACC C C T AG 
ATAAGACCGAAAATGTGGGCACCACAGCCAATCCAACGAACGAAAATTCTGTGAAACCACAAGCTAAGGATGACGAAGTTGTTCTCGTCTCCGAG 
GATGAGGATGAAACACCGCAGAATGGGATATCGGAAAAGGAAGTTCAAACTATCCAAAAGGAAACAGCTTTAAAAGGACTTCCCAATGAGTCAAC 
TACAGAATCAGTTGAAGCAAAGAATCTCCAAATAGACAATTCCGACAATGCATGCGATCAGTTCGAGAAACTGAAGACCCATGATATGGTGAAGC 
AAACTACCACAGAGGATGGGAACAGTAATTCCAGCAATCTATTGCGACCCGCCGAGTACGCGGAAGAGTCGGTTCCAAAACGGCTGCGTCTTAGT 
ACCGATGAAAAAAATGAATTCGAAGCGGAGACGAATCAAGAATCATTACCAATACTCAGCAGAGCAGAGGGTGGTTTGAAGGATGTAACCAAGCG 
GAGCCATGAACATCTGGACAGCAGCCCTCAGGAGGAAATACCTAATAAGAAAGCTAAGACTGAAGACTCTGACTCAAATAGCTCCCACGAAGGCA 
CGCTACAAATTGATATGGGTGGTCAGGAAGACAAAGAAGAATCCCCCAAAAAAGATACCCAAAAGAAACTGGACTTCGACCTTAACCCCGTTCCA 
GAAATTAAACAGAATGTTAAGCCCCTGCGCTTGGAATTCTTTAAGACCTTCCGACGTAGTTTCGACACAATGACCCGTGACGACCTGGAGGAGCT 
GGTGCTGCAGAAAGTCGTCGAGGCGATGATGGTTAAAAGTGATTTCGCAGAAATACGTATGCAGCTGGACAAATGCGAGAGCACTCTCGCCAATT 
ATCGCCGCAAGATTGCCGAGGTGTCGAAGCAATTCCTGGATTTGGAGACGGTGCACAAGCGTGTGCTTAAGGACTTGGAGGCTAAGAACTCTCAT 
TTCACCGCTCCTGTGCGTATAACACGAGCTGTTGGTCTTCAGGTGGGCATTCCGTTCAAGGCTATAAAGCCCACCGTCGCCGCACCAGAGTCAAC 
TCATGCTGCT GGC AGC GT GT T GGC GC C GC C C AGC GG AAC AC C AC C C AAG GC C AGC AC C T C CC CG AC GC GT G C G AC CG TGC G AC C AAG AC C TCC TC 
CGCCTACCTTTGGACCAT C C AC AT CCT C AT C GG AAAC GGG T GC GGG C GT C AGTGT C AC T C C G AAT T CC AAT C AC C AGC AAC AGC AGC C AGC AGC A 
CGGTCATCTCAATCCTTAACAACGGCTAGTGTCACGCCACCTGTGCGCCGGGGCTGCTTGCAGAAGGTCACGCCCCATCGGCCGGTACCTACTAA 
CTTACAGCCCGCGCCAATGCTTATGAATCAGTCCAGCGTTCAACGTCTGCAGTCCTCACCGCCTATAGCTCAAAGGACAATGCACGCCTCCAAGC 
ACACCGGAACGCCTGGCTCCACGCCTTCATCGGTGGCCAATGCGATTAATAAAACCGTCATGCGTGCCCGGAACTCAGAAGCCGCTTTCTTGGCG 
C AAAAAC AAC AG C AGAAAC AGC AC C AGC AAC AGC AGC AAC AAC AGC AAC AAC AC C TGCAAC AAC AACAAAAGCAAC AAC AGG C GC AGC AAC AGT T 
CCTACTACCTCGACCTAGTCCCGGTCCCACGCCAGGTTCGAGTCCCGCCAAGCAAGTGCCGAAGTGCACAACTAAAGTGCGCGCCAAGCAGCCAC 
CGCTTTCGGGGGCCACGGTCTCGGTGCCCATCTCATCTAGCAGCAGTGGATCGGAATTGGGCAGCTATGGACAGCAGCAGGAGTCCAGCTTGGCG 
CCGGCCAAGCCCAAGGAGAAGGCAGTCATTGACCTCACAGATGAGGATGATGCAGCGGCTGCGGCGGCGGCTAGGGCTGCCCAAGCACAAAACGC 
TCGTTTGCGTATGGAGATGCTTAAGCGGAATGCTCAGACTGCAGCAGCGACGAGAGGAGGACGAGGAGGCGGAAATGTGGTGCGAGCTTCTCCAA 
TGCAATTGCCTCGCGTAAATGCTCGACAGATAGTCCATAATAATGGAGGAGGTCAGCAAAGTCCCTTGGGCTCAAACGTAACTATGCAGATACGT 
TCGGAAAACACCCCGCCTCCAGCGTCTCGTCTAAGATACTCACATCCTGCACCACTGCCCACGTCTCCTCCCCAGCCCTTCAATCCATCCTGGAA 
GGTGCCTCCTTCGCGACCCGTTATCCGGATTAGTCTCCTTGATACTGGTATCGTTATATCCTGGACTCTGGAGGATTCGAGCCCGCGGTTCGCAG 
AGTGCGTAATGTACCAGATCTATGCATACCAGGAGACGATTAACGAGCCCAGCACGGATAGCTGGCGGCATGTGGGCGATGTCAGCGCTATGTTG 
CTGCCAATGGCAGTGACTTTAAATCAGTTCCAGGAGAACCAGAGGTATTACTTCGCTGTGCGGGGCGTTGATAGTCACGAACGATTCGGGCCGTT 
C AGT GT TCC C AAG AC C T GGT C GT AGAC GG AT T G GT CTGTTCTCTTCC T TAG AT T C T AG TT T TAG TAG AAG AT AT GT AAT T T AC TT T TGTCCTTGG 
TAATCCGTTCTCGAAACTAGCCGGAATTATGAGTTTATACGCCTAAGTTATAAAGAGATTATTGAAGAATTTTTAAGAAGTAATAAACAAGTTAT 
GCCAAGCAGTCAATCCACTGCAGTCAGTAAAGAAGAAACAAACAAAACATTGTATTTCATTCTCACTAGGTTATTTAACATAAATAAGTTCAAAA 

CTTAC ATT AAAT AAT AATTTGAAT 
(SEQ ID NO: 869) 

Start ATG: 228 {Reverse strand: CAT) 

MMGVMQTMELKELSTEEALMRTLSTELGNEVEPLSTPAILNESSQTSVDEVDLCELTNGVDSSDNVKVIDKPKKISDRERNPGSDLDALLDKISS 
IVDCSPRMSDDIDSLDRDEECDSASAKKRTAADTDENLKEQREKPEEEEEGEEVLSSLEGEECIDPDSELKTEEKLDETEEHASAKESTQKTEDL 
ADSDEILKSKEEDEVPANTADNIEKDRKNTSTDDVFMYALDCISSADEFDAFASQDSKKAKPSKKLNTENNSSVNDLEDISSDDDDIIKETEKPN 
TEVIDLDSSSECVPCEAESEVEETEANTEDTEVLTHGKVSTKKSLNLQNWISEKSASAAEYESVKADQHLTEGKEPMLVGDEVTEKSKESIPIQS 
EESIQVDEPEESKESEELEEHKEKEEPDYCKETKDSEEATEKEKSEIKIIKASKVAQILGDTKEEEDPSPTQEVEDVKETEEPENKKEIEDTEEL 
EEPVVIEGEETEDPQKKVQKEFDVRTADHEMNYTTIDDKLDEAVKDVVEIEEKKAAEDELGVIPKGDKEPTTNGACEISTDGNLKDLESNESTVV 
EEETKETESDDEVIFFEPLDKTENVGTTANPTNENSVKPQAKDDEVVLVSEDEDETPQNGISEKEVQTIQKETALKGLPNESTTESVEAKNLQID 
NSDNACDQFEKLKTHDMVKQTTTEDGNSNSSNLLRPAEYAEESVPKRLRLSTDEKNEFEAETNQESLPILSRAEGGLKDVTKRSHEHLDSSPQEE 
IPNKKAKTEDSDSHSSHEGTLQIDMGGQEDKEESPKKDTQKKLDFDLNPVPEIKQNVKPLRLEFFKTFRRSFDTMTRDDLEELVLQKVVEAMMVK 
SDFAEIRMQLDKCESTLANYRRKIAEVSKQFLDLETVHKRVLKDLEAKNSHFTAPVRITRAVGLQVGIPFKAIKPTVAAPESTHAAGSVLAPPSG 
TPPKASTSPTRATVRPRPPPPTFGPSTSSSETGAGVSVTPNSNHQQQQPAARSSQSLTTASVTPPVRRGCLQKVTPHRPVPTNLQPAPMLMHQSS 
VQRLQSSPPIAQRTMHASKHTGTPGSTPSSVANAINKTVMRARNSEAAFLAQKQQQKQHQQQQQQQQQHLQQQQKQQQAQQQFLLPRPSPGPTPG 
SSPAKQVPKCTTKVPAKQPPLSGATVSVPISSSSSGSELGSYGQQQESSLAPAKPKEKAVIDLTDEDDAAAAAAARAAQAQNARLPJyiEMLKRNAQ 
TAAATRGGRGGGNVVRASPMQLPRVNARQIVHNKGGGQQSPLGSNVTMQIRSENTPPPASRLRYSHPAPLPTSPPQPFNPSWKVPPSRPVIRISL 
LDTGIVISWTLEDSSPRFAECVMYQIYAYQETINEPSTDSWRHVGDVSAMLLPMAVTLNQFQENQRYYFAVRGVDSHERFGPFSVPKTWS* 

(SEQ ID NO: 870) 

Classification: known_f lybase_gene 
Gene Symbol: BcDNA: LD26050 
FlyBase ID: FBgn0027499 



Celera Sequence No. : 142000013384139 

TTAAGTTACGCAGAGCAAAAAGCCGGGTTACTCATGTCACATCAATTTCCAAGCCAACAACGCCCCCTAAATAGAAAGCAACAACTATTATTAAT 
AGACTTAAATTAGGCGATATACATATACATACTTACATAGAACAAACCTACATTGAGACATACTTATACATACATGCGAAGTGACTAAATTATTG 
GCTATTGTAATTTGTATTTGTAATATGACTATTCAGAATTAAGATATTATTAAGTGGCCGATAAATTTAAGTAGCAGGCGCCGTTCAGCAGCAGT 
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GTCATTTATAGAATTTTAAATTAACTACGAAGACATCGAGAAATCATCATTAGGCACTTGCTAAAATTGTACTAACAATACCGCAGACATATTGG 

TGAACGATATCAAAATGTTGCCCAATTTACTTGTTTACTTTTGTCAGCGGTGGGGATTTGGTTTTTTTGAATTCCCACCAAACATCGGAGCTCGA 

TTATTGAGATGATTTCCTGGTTGATTTCGTTTACACAATTATACAGAGTTCATAAGCCGCATGTCTATCAAAACACCAAAATGATCTGTTATCAA 

TAAATATATGTACTTGACATTCTGATATTTACACATGCATAAATACTATTATTAAACATTTTTAAAGAAAAATATATTTTGATCGAAAACATTCG 

GTTGCGTTTGATTTACAAGCGAGCAGGTTGGAACGTTTTTAATATTTTGGGTATTTTTGGGGTCAAAAGTGGAATAACCAAATAATTAAGTACAC 

GATATTCTCATAAATTACGCAGAATATGAAAACACTCAGACCCAAAACTCCTCAACAAAAAACAACAAAATTATTGTACATATATCTTTGTATTA 

TATTTATATATTTTAGACCCTCCAGATTAGATGCATCAGTTGAAAATAAGTTAACTTTTCTTTGTAGTTTAAAACTTGATTACAATTAAGATAAA 

TATGTAGGGATTAGCTATGTACAATGGGAACAGCTGTGGGATTACCGAACTCAAACTAGACTCGAGACATTGTTGTAGAGGCCAACGGCAGACAG 

CTCGTCCAGATCGATTTCATTGGATGGAAGATTGGGATGAGGCCAGTTCCTTGGCCTTTGATTAACATGGGGCGGCACAATGGCCTGAAAGGTGG 

GGGACTGCAGAGGCCTATAGGCTACCGCATCCACGTAGTCCACTTGTCTTGGTGGAGCATTTGGATGTGGAATCCTTCCCACGTAAAGATCTTCA 

CTGCGATAGAGACTCGCATTTGGTACGTAGCCGGAGTGCAGCCTTCTCTGCGGAGTTTGCGGAATCGGTGGACGGTGGGGCAGTTCATGGAAGTA 

AGCATCTTTGGGTATGTGTCTCTCGGGCAGCCCAAACTGGGCGCTACTCCGGTGGGGAATACTATACGTGGGCGAAGGGTTCGAAGAGGTGCCCT 

CCAAGTGATGGAAGTTATTGTTCAAGCGGGAGTGTGATCTTGCGGCGATCACATGTGCCGTCTGGGAGTCCTGTCGATTGGAGTCAATGGTGTTT 

TGCAGTTGGGCCACCATATCCCTCACCTGACCAGCGTTCGGCTTCTTTAGGCACTTGATTTCACTGTTCCGAATCTCCACCTTACTCATACTTTC 

ATTGCCATTTTGACGCACTTCTGTTTCCTCGGCGTACTCCATGATGGGATTGGAGTAAAATAAATTCGACATGACGGGATAGTTGTCTTCCAAGC 

AAAGAGGCTGCCGTTTCATCAGATTGGAGATACCTGCCGCCTCGCTCGCTTGCTCCTTGTAGAGCAGCTTGTCGCCTCCCATTCCCTCGGCAGTT 

TCATTGACAGCCGTCTCCTCTTTGCCCAGATCCGCGAAGGAGGCATCTTTTCGAAGCGCCTTGATCACCGAGTTCAGCATGTTCTCTCCTCTCCC 

AATGCGCTGGTAGTACTTATTGTAATCCGGGTGGGGATTGCTGCCCGTGGGTGATGCCTCCCAGTCCCAGCTGTTGGAGTTCTGGGAGGGATTTG 

AGGTCTCACTGGAGTTCGGATAGCGTCGTATCACGTTGGCGTACTCGTCGAAATTGGGACTATTAACACTCTCTTTATCTTCTCGAAGTTGGGCA 

GACTCCGCCCTTGGATGGGGATTATAAGGTGGCGATAAGGAAGAAAGCTGGCGCTTGCTCTCCTGTTCGCTGTCCCTTTCCATGATCAGGGTGGT 

TTTCTCTAAAAGCACTCCCAGTCTTTCGGAGACTTTTAGGAAATCTAGCTTAAGTTCATCTAGCTTCGAGTGCTCCTCTGGTTCCGCCTCTTCTT 

GCACCTGGTCCTTGTCACCCACTTCTGCCTCTTCTTCATCCCGAGCTTCATTGGAATCGTCATCGCTGTCGCTATCCACCTGGACCTCCGCTGAT 

C GGT G T ACT AT AGC CT C C AC T GC CT GAT C T T C C AGG AC AT CT AC ACTC C GT G AGT T GGT GAG C CT AC C ACC T T C GT G GT T GT T T TC C AAG AGC T C 

TAGACTGCGGCACCTGGACATTGTTCTTTCGATCCTTGAGCTCTTGTTCTGCTGATCGAAGGTGGACCGCATAGGTTCACCTCGTCGCAGGAACT 

CACTGAGGTGCTGCCTCTTGCTCTCGCATCGCAATCTGTCCTCGATCATCGTTCTTGTGAGCTGTTCAATCTGATCGTTCGTCTTGATTGCCAAT 

GCCTGAACTTTGTGACCTAGACGTTCCTTGGTTTTGTGAACGTATTTCTTCAAACTCTTTCGATTGCAATTTAGTTTGTTTTCAATGTGACGGAA 

GAAGGGACCGCAATAGCATGTGTTGCCCACGCTGACCGGACCCTTGTGAATATGACCAGCCAAATCCAGGAAGTATTGCTCCCCCGACTTAAGTG 

GCTCCTTGGGCAGTGTCTCCAGTTTTGGTGAAGGAAAGGCACGCGGATCGGGAAAGTAGTGACAGCCATAGGGTGACCAGTTGATACCCTTGTCC 

ACCACATCCCTTTCACGATCTCTGTCCTTGTCCGATCTGCTGCCGCCCTCCTTGGGTTTCTCCTTGCGATTCCGTATCTTCTTCGGCGTGTTCAG 

GATCTCGATGATTTCACGGTAGTTCTTCCTGATCGCTATGTGCATGGGGCAGTCGCCTTGTGCGTTTTTAATGCCCAGCCGAGCATCTGCCTCTA 

GGAGAATCCTTGTCAGCTTCCTGCGACCCATGGCACAGGTGATGTGCAAAGCGGTGTCGCCGTTCAGATTTGTTTTATTCGGATCGCAAAGGGCG 

GAGAGCAGGATCCGGGAGACTCCTGCATGTCCATATCTGCAGGCGGTGTGCAGCGGAGTATCTCCATACTGAAATTGTTAAACATGTTAAACATG 

GTTATCACTGATATGATCCAACACTTACATTATTCCGGACATCGGGATCGGCGCCGGCCAGCAGTAGCTCCCTAGAGCTCTGATTGTGACCATTT 

TGAGCGGCCAAGTGCAGGGCGGACAACCCTTCGTTGTTGGCAATACCCAGGGCGCTATTGTGTAGGGCTTCGATGGTCTGGTGGGCCACCTTCAC 

TCTGCCCTTGTTCTTATCCTTGTCTTTGTCCTTCTGTCCCTTTGCTGCTTTTCCCACCCCCGAGGAAGCGGGTGGCGTGGGGGGCGTGCAGATCA 

ATTTCACGCAGCGACTGAACCCCCGACTGGCGGCCTCATGGAGCGGGGTATTCCCCTGCACGGATTCCCTGGCCAGTCGATCAGCATTGCTGAAC 

AGGAGACCCTCCACCCGGCGATCATCTCCGGCGGCTACCGCCTCGTGCAGCGAAGTCCAATTGCTGGAGTCGGGCTTAAATCAGAAAAAGAAGTT 

ACATCGGTGTAAGATTTGTTGGTTCAGGCAAGTGCCCTTACTTACCCCGGATATCGCATTCGGGAGAGCTTCCTGTTTCGGAATGCGGGAGGATC 

CATGCTGCATTGTCTGCGAAAATATTCATTATTTATCAAATTTGTTAAATTATAATTTCATTCATTTATCATGATGTGTTGCAATTATAACTTTT 

TAAGTAACACTTTATCAAAACCAGATAAAATTAAAAAAATTATTTATATTAGAACTACATTTTGTCTTTAATAAATGAAAAATCTTACTTAAAG 

AC CGT AAT T AC C GTT AG AAT CGT T TAT AT GT C G AAAAT C AT AT G AAAG AT GG T AT T C T T T G AT AT AAC CTT GT T G AC GAC GAAAG AGCT T AAC T T 

GACCTGGGATCAAATGAATCTGATCTTCGATCGCACATTTTCTAGCCGTGTATGCGCATGTATGTGTGCCTCACGTAATGGCGTACCACGTAGTA 

ATTGTCATTACAAATAGZ^AGTGGGAGTTACCAAAAGCGATCAAATAAAGGACTCGTAATTTGAAATACAACCTTTTTGAGGGTAGATTAAATGTC 

TTTCATTATAACACATTGATAAAACATGTATGGTTTCTTAAATGTTCAACACTATTACAGTAAATTTCTAGACCTGAAAACCAGGTTTAAAGTGC 

GCCTAGGTTTTCTCGATTCCCGACTTACTTTTCACTAATTTTAAATCCCGTCTTTGGACCGCAATTTACGCATTGCAATTCGCCACTCTCACGTA 

TTTACACATGTTTACTTTCGACTATTGTTCAGTACCCGACTTTGCAGTATTTGGCCATTGATGGATTGATGGAACGCCCAGAGTTAATTTGGCTT 

CAAGCCAGGCCTAAAAAGAGACTAAAAGAAAGCCAATCACAGGCAAAATTGGCAACAGGTTTCGAGCTGGGTTGCCGTACCTGCGACAGCTAACG 

GTGAAGCCGATAGTCTGCATTATTGCTCCAACAGAAGTACGGTCACTCTACAAGTAGGCAACGAATTTTGTTTGTCATCGGCATTTCGCATTCAA 

CGTTTCCAATTGTTTTTTAAGGAGCTTTAAGAATGGCTTTAGCTGAAATCTGCAAGATATCGAATGCTCCGTACATGCGGCCCAATGCCTGGTCA 

TCGGCGGATGTGGAGGAAGAGCAAAAAGGGCTTATGTGCAATCTGGCCAATCCCTACACACTGGCTGCTCCGCCATTCGAGAACGTATGTAGTTA 

ATTAGCTGATCTTCTTGGGAAATTTACTAAAAAATTTGACAATTTCCTCAGCCCCTGCACAACCTTAATCAGATCCAGGCCAATGGCGACAAGAC 

CGGCGTCAAAATCAACTTCGATCACGGCACCACAACGTTGGGCTTCAAGTTCAAGGGCGGCGTTCTCCTGGCAGTCGATTCCCGTGCCACGGGTG 

GATCGTACATTGGCTCCCAGTCGATGAAGAAGATCGTGGAGATCAATCAGTTCATGCTGGGCACCTTGGCCGGTGGCGCAGCCGATTGCGTTTAC 

TGGGACAGGGTCCTTTCGAAGGAATGCCGCCTTCACGAGCTTCGAAACAAGGAACGCATCTCGGTGGCCGCCGCCAGCAAGATAATGGCCAACAT 

TGCCCACGAATACAAGGGAATGGGTCTGAGCATGGGCATGATGCTGGCCGGTTACGATAAGCGTGGTCCAGGCCTCTACTATGTGGACTCCGAGG 

GATCTCGCACGCCTGGCAATTTGTTCTCTGTTGGTAGTGGATCGCTGTACGCCTACGGTGTCCTGGACTCTGGCTATCATTGGGACCTGGAGGAC 

AAGGAGGCCCAGGAGCTGGGACGTCGCGCCATCTACCATGCCACCTT 
{SEQ ID NO: 871) 

Exon: 4652.. 4494 
Exon: 3908.. 3846 
Exon: 3768.. 3354 
Exon: 3298 . . 1001 

Start ATG: 3898 (Reverse strand: CAT) 
Transcript No. : CT23371 

CCAAATTAACTCTGGGCGTTCCATCAATCCATCAATGGCCAAATACTGCAAAGTCGGGTACTGAACAATAGTCGAAAGTAAACATGTGTAAATAC 
GTGAGAGTGGCGAATTGCAATGCGTAAATTGCGGTCCAAAGACGGGATTTAAAATTAGTGAAAAACAATGCAGCATGGATCCTCCCGCATTCCGA 
AACAGGAAGCTCTCCCGAATGCGATATCCGGGCAATTGGACTTCGCTGCACGAGGCGGTAGCCGCCGGAGATGATCGCCGGGTGGAGGGTCTCCT 
GTTCAGCAATGCTGATCGACTGGCCAGGGAATCCGTGCAGGGGAATACCCCGCTCCATGAGGCCGCCAGTCGGGGGTTCAGTCGCTGCGTGAAAT 
TGATCTGCACGCCCCCCACGCCACCCGCTTCCTCGGGGGTGGGAAAAGCAGCAAAGGGACAGAAGGACAAAGACAAGGATAAGAACAAGGGCAGA 
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GTGAAGGTGGCCCACCAGACCATCGAAGCCCTACACAATAGCGCCCTGGGTATTGCCAACAACGAAGGGTTGTCCGCCCTGCACTTGGCCGCTCA 
AAATGGTCACAATCAGAGCTCTAGGGAGCTACTGCTGGCCGGCGCCGATCCCGATGTCCGGAATAATTATGGAGATACTCCGCTGCACACCGCCT 
GCAGATATGGACATGCAGGAGTCTCCCGGATCCTGCTCTCCGCCCTTTGCGATCCGAATAAAACAAATCTGAACGGCGACACCGCTTTGCACATC 
ACCTGTGCCATGGGTCGCAGGAAGCTGACAAGGATTCTCCTAGAGGCAGATGCTCGGCTGGGCATTAAAAACGCACAAGGCGACTGCCCCATGCA 
CATAGCGATCAGGAAGAACTACCGTGAAATCATCGAGATCCTGAACACGCCGAAGAAGATACGGAATCGCAAGGAGAAACCCAAGGAGGGCGGCA 
GCAGATCGGACAAGGACAGAGATCGTGAAAGGGATGTGGTGGACAAGGGTATCAACTGGTCACCCTATGGCTGTCACTACTTTCCCGATCCGCGT 
GCCTTTCCTTCACCAAAACTGGAGACACTGCCCAAGGAGCCACTTAAGTCGGGGGAGCAATACTTCCTGGATTTGGCTGGTCATATTCACAAGGG 
TCCGGTCAGCGTGGGCAACACATGCTATTGCGGTCCCTTCTTCCGTCACATTGAAAACAAACTAAATTGCAATCGAAAGAGTTTGAAGAAATACG 
TTCACAAAACCAAGGAACGTCTAGGTCACAAAGTTCAGGCATTGGCAATCAAGACGAACGATCAGATTGAACAGCTCACAAGAACGATGATCGAG 
GACAGATTGCGATGCGAGAGCAAGAGGCAGCACCTCAGTGAGTTCCTGCGACGAGGTGAACCTATGCGGTCCACCTTCGATCAGCAGAACAAGAG 
CTCAAGGATCGAAAGAACAATGTCCAGGTGCCGCAGTCTAGAGCTCTTGGAAAACAACCACGAAGGTGGTAGGCTCACCAACTCACGGAGTGTAG 
AT G T CC T GG AAG AT C AGGC AGT GGAGGC T AT AGT AC AC C GAT C AG C GG AGGT C C AGG TGG AT AG C G AC AGC G AT G AC GAT T C C AAT G AAGC T C G G 
GATGAAGAAGAGGCAGAAGTGGGTGACAAGGACCAGGTGCAAGAAGAGGCGGAACCAGAGGAGCACTCGAAGCTAGATGAACTTAAGCTAGATTT 
CCTAAAAGTCTCCGAAAGACTGGGAGTGCTTTTAGAGAAAACCACCCTGATCATGGAAAGGGACAGCGAACAGGAGAGCAAGCGCCAGCTTTCTT 

CCTTATCGCCACCT T AT AAT C C C CAT C C AAGGGCGGAGT C T GC C C AAC T T C G AG AAG AT AAAGAG AGT GT T AAT AGT C C C AAT T T CG ACG AGT AC 
GCCAACGTGATACGACGCTATCCGAACTCCAGTGAGACCTCAAATCCCTCCCAGAACTCCAACAGCTGGGACTGGGAGGCATCACCCACGGGCAG 
CAATCCCCACCCGGATTACAATAAGTACTACCAGCGCATTGGGAGAGGAGAGAACATGCTGAACTCGGTGATCAAGGCGCTTCGAAAAGATGCCT 
CCTTCGCGGATCTGGGCAAAGAGGAGACGGCTGTCAATGAAACTGCCGAGGGAATGGGAGGCGACAAGCTGCTCTACAAGGAGCAAGCGAGCGAG 
GCGGCAGGTATCTCCAATCTGATGAAACGGCAGCCTCTTTGCTTGGAAGACAACTATCCCGTCATGTCGAATTTATTTTACTCCAATCCCATCAT 
GGAGTACGCCGAGGAAACAGAAGTGCGTCAAAATGGCAATGAAAGTATGAGTAAGGTGGAGATTCGGAACAGTGAAATCAAGTGCCTAAAGAAGC 
CGAACGCTGGTCAGGTGAGGGATATGGTGGCCCAACTGCAAAACACCATTGACTCCAATCGACAGGACTCCCAGACGGCACATGTGATCGCCGCA 
AGATCACACTCCCGCTTGAACAATAACTTCCATCACTTGGAGGGCACCTCTTCGAACCCTTCGCCCACGTATAGTATTCCCCACCGGAGTAGCGC 
CCAGTTTGGGCTGCCCGAGAGACACATACCCAAAGATGCTTACTTCCATGAACTGCCCCACCGTCCACCGATTCCGCAAACTCCGCAGAGAAGGC 
T G C AC T C C GGCT AC GT AC C AAAT GCG AGT C T C T AT C G C AG T G AAG AT C T TT AC GT GGG AAG G ATT CC AC AT C C AAAT GC T CC ACC AAG AC AAGT G 
GACTACGTGGATGCGGTAGCCTATAGGCCTCTGCAGTCCCCCACCTTTCAGGCCATTGTGCCGCCCCATGTTAATCAAAGGCCAAGGAACTGGCC 
TCATCCCAATCTTCCATCCAATGAAATCGATCTGGACGAGCTGTCTGCCGTTGGCCTCTACAACAATGTCTCGAGTCTAGTTTGA 

(SEQ ID NO: 872) 

Start ATG: 170 (Reverse strand: CAT) 

MDPPAFRNRKLSRMRYPGNWTSLHEAVAAGDDRRVEGLLFSNADRLARESVQGNTPLHEAASRGFSRCVKLICTPPTPPASSGVGKAAKGQKDKD 
KDKNKGRVKVAHQTIEALHNSALGIANNEGLSALHLAAQNGHNQSSRELLLAGADPDVRNNYGDTPLHTACRYGHAGVSRILLSALCDPNKTNLN 
GDTALHITCAMGRRKLTRILLEADARLGIKNAQGDCPMHIAIRKNYREIIEILNTPKKIRNRKEKPKEGGSRSDKDRDRERDVVDKGINWSPYGC 
HYFPDPRAFPSPKLETLPKEPLKSGEQYFLDLAGHIHKGPVSVGNTCYCGPFFRHIENKLNCNRKSLKKYVHKTKERLGHKVQALAIKTNDQIEQ 
LTRTMIEDRLRCESKRQHLSEFLRRGEPMRSTFDQQNKSSRIERTMSRCRSLELLENNHEGGRLTNSRSVDVLEDQAVEAIVHRSAEVQVDSDSD 
DDSNEARDEEEAEVGDKDQVQEEAEPEEHSKLDELKLDFLKVSERLGVLLEKTTLIMERDSEQESKRQLSSLSPPYNPHPRAESAQLREDKESVN 
SPNFDEYANVIRRYPNSSETSNPSQNSNSWDWEASPTGSNPHPDYNKYYQRIGRGENMLNSVIKALRKDASFADLGKEETAVNETAEGMGGDKLL 
YKEQASEAAGISNLMKRQPLCLEDNYPVMSNLFYSNPIMEYAEETEVRQNGNESMSKVEIRNSEIKCLKKPNAGQVRDMVAQLQNTIDSNRQDSQ 
TAHVIAARSHSRLNNNFHHLEGTSSNPSPTYSIPHRSSAQFGLPERHIPKDAYFHELPHRPPIPQTPQRRLHSGYVPNASLYRSEDLYVGRI PHP 
NAPPRQVDYVDAVAYRPLQSPTFQAIVPPHVNQRPRNWPHPNLPSNEIDLDELSAVGLYNNVSSLV* 
( SEQ ID NO: 873) 

Name: Ankynn-repeats containing protein 
Classification : cytoskeletal_structural_protein 



Celera Sequence No. : 142000013384684 

ACCCAGGCCGGAGGCTCCACGCATCCCGCTGCCAGTGCCAATTGCCACGGTTACGGGAAGCCGGACGCTTATCCCACTTTCCGGATACGATTGCC 
TAGCAGATCTGCCGTCCGTACATATTGAACAGACCTTTGAGCTTAACGATGGTAAGTACAAGCCCAAAAATTAGCTAAAATCGCTTCTAATAGTT 
CGTCTTCCCTTTGCAGCCCTAACAGGCGTCTCCTCGGAGAATCGATATGTGGTGCGATCTCCACTGGGTGATGCCATTTTCGCGGCCAACGAGAG 
TTCCACAGAAAAAAATCGACTACTTTGGGGAGCTGGTCGACCCTTTCAGATGCACCTGCTGGATAAAACGCACCAGGAAGCTCTGGTTTTTCGCA 
AAAAACTAGCGATGGGATCCATGTGCTGCCAGGCAAAAAGTCTAGAAATTTGGATACCACCTGGTAATTTGCTGGGTAAAGTGGTGCAGTCGCCC 
ACCTTCATGCAGCCGGAGTTCTTTATCGAAGACGAAAGCACAGGACAACTAACATTCTGTGTAGAGGGTCCTGTCGGTTTAGGATTCTGCTGCTT 
CAGTTTGCCCAAAGATTGTTATTTTAAGGTTGGTAGTAGCTTTAACATTCGGAACAGAATATTCAATTATAGTGTTTTTGTTCCCTTACAGATCC 
ATTCGGGAGGCAATATGAGGGCTTCCATAGACCACAAATGGCTCGCCAGCAAGTCTCAGTACACCACAAATATATACTTTAGTGACGCCAAGCTA 
ACCGCCAAGGAGCGGGCTTTGATACTGGGATCAGCCTTTCTACTGGTACTGTGCTCCTAGTTTATATACATATATCATTTGCCACAATAATGTTT 
TGTATTCGTTTTTCAGGAATATCTATTCTTCCAAACCCGCTTTTGACGTGATTAATCAATAAAAAAAAACATTGTATATTCACATTGATTTCTTT 
ATTTTTTGTCATTTTTTTGTATAAGATTTTATGTTTGTTTGTGTTTTGTTTTTTTTTTTTCTCAGTTTTTTGATATATTTTACAATAATGTAATA 
GATCACTTCATTTTCTTGTTGTTTAACCACATACACACATTCAAATATATAGATATATATCTATGTGTATATATAGATTTTGCTGCAGCGCTTGC 
CATAGCAGTTTGTACAATATTTCATTTATATCTATCAACTTTGTGTAGTTTGTTTATCTAACCTACCGCTTAATGCATATATGACAACATATTGA 
TTATTTACAGTTAGCGTTTTTTTTCATCATTTCGTTTCTCTTCTGTCAAATACAAAATTAAAAGAAACAATTAAATGAGTCTCGTGGTCTCGTTG 
TTTAAAGAATTAAAACAAAACTAGGGGCGCAAGTCATTTCTCATTCACCGTTAATAGTTGCGCTTCAAAACTTGGTGTGATAACAATTTGAAATA 
AGGAAGAACTACGTGATTTTCAAAGGGCAAAACTTGGATTTGGACATAGAAAACCAATATACTTAGTGTGCGGGTGTGTTCTGGTTCGGTTTGGA 
GGTTCATTTTCGGGGCTAGGCCTTAAAAATAACACACTTCTCTCGCTCTCGGATCTTTACATTTGTAGATAGACTCACAGCACTTCAGGAGTTTT 
TGCATTTATGCATTTCGAAAACAAATTATGGTTAAGTGTATACAATATAGATAGATTTATAAGTGTGCTCGTTGGTTGGCTTATAACTAGGGCTA 
GCTGATGACGATTCCTTAACCTAAATACTTAAGAGGCAGATCGAAACGATCTGTGTGGGGTGACTGCCTTTTAAAACCGTAAAAGGCTTTGTATA 
TGATTATTTTAGCGCAATTCGCCTAGAGTGTTTACACTGGAATAGCTGACTCGCTTATCTGGCTCCCCTATCGTTTGATTACTACTCGCGATTAG 
CCCAATACTAGTTGGGCCCATCATCGAAGAGAGGTGGGTCTTATAGGCTAGCTCTAGAGTAAGTTGTGTGTGTGCCCATATGCGATATGCTTAGA 
CATTTGATATGCGCTCGTTGGAGGTGGTTTGGTCATGGCGGCCCTCGATCCTGGAATTGTAGATGCCGTACTGACTGCTGTCCAGGTGGTAGACC 
GCGGGATACTGATAACCTCAGCTGGGCTGCAGCATGTACATGTACTGTCAGTTGCTTGCCGCTGCTGAAGCAGCCACGGGAACGGCCACGGGAAC 
AGGCAGCTCCTGGCCAGAACGAGCCCGCTTCTCGCCACTGCTTTCCGGTGGATTATCGCTCTCCTCGCCCGTACTGCTGAAACTGCCCGGCAAAC 
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GACGCTTCAGGCTATGGTTTAGGCCATTGGCCACCGTTTCATCCTCGGACATCGAGCTGGCTCGCTTGGGTGTGTTGCCAGCACTGTCGTTGTCC 
AGCGAAGCTATAGCCGAGGCAATTGCGGCACTAAAGCCATCCTCGGACGCACTATCCTGCGAGACGGGACCTGCTGCATCCTGGGCCAGCTGCAT 
CATGCCAGAAGGTGTGGTATAGCTACTGAGCTTACCAGGATCCTTCTCCTGCAGTTTCTGGGCCAGCAACTTGAGTGATATCAGCGAATTTTCAC 
AGTCGCTTGTATCGATTTCAGTGAGACTAATTCCGGCACGTTCAATTTCAGCATAGGTCCCGAAGGGCAACTTCTGCAGACACTCTAGCTTCTGA 
ACATGCGTCTTGGAGCGCAAATGCTTGGCCAAATGCCCATGGATGCGGAAGGCGATCTTGCAGTCGGTGCACTCGAAGGGTCGCGGATTTGAGGT 
GGTTGGAACACCAAAAGTCGACGTCATCATAACCTTGTTTTTGTGCGCTTCCGATCGCTCGTGCTTCTGCAGATGTCCTTGCGTGCGGAAGGATA 
TGCTGCATGGCTCGCACTTAAACTTTCGCTCCATGTAATGAATATTCATGTGCAGGGTCAGCTGATGTTGCCGCTGAAACGTCTTGGAGCACATG 
GTGCAGGTGGGACGTGCATCACCTGCAATGGGAGCTGGCGGCACTCCTCTGCCATACGATACATGCTGGAGATCGCCGCTCTTGCTAGTGGGAGT 
AGAGCTTTTGAAGCCATCTTCACCCACAATCACAGTACGCTGCGCGGGCTGAGTTGAGTTGGCAGTGGATGAGGGTGCCACAGGTGCAGCTGTTG 
TTGTGCTTACTGAAGTAGCAGTAACAGACGATGTTGGTGGTGCAGTCGGATTTCCAAGTACTCCAGTAAATTCATTGACATCCGGTTGCTGCACT 
GCAGCAGGCAGATTGGCATTGTGTGGCTCTGCGATATTAGGCTCCTCGTTTTCTTCCTGGGCAATCTGAGCAAGAATAACCGGGGCCTGTTCAGG 
TGCACTCCGAGGTGACTCAATGGCCTCTGGGGTCTTTACTTCGTGCACTTCGATGCTAGGTACCTCATTTACGCTCATAACGGGCCTGCTGTTGG 
CGGTAAAGAGAGTGCTACTACTGCTCATTACAGGAGACGCCGCTGTGTATAATGTGTTGATCAGATAGCTGCGGCTAAACTGACTCTGCTTTATC 
TTGGCTTGATGCAGCGCATGCTCCTTAATGTAGGTCTGTAGTTGAACATCCTCGCCCTCGGCCTCGTGCTTGGCCTGCTCTTCCATTTCGGCAGA 
TATTCGCACTTTGTCCGAAACCGAGATTAGAGTCTTCATGAAACCGCTTTCGTTGCCGGAGGTTCCAAAGATAACATCGTGCATCTGTTGGGCCT 

GC GATT T GGG AAG AT C AT G AGCGGGC AC C G AAAC AGG CG AGGG GG AT AT T AAT AAGAT GG AGC CGC GT G GC T T AGT G AGAT C C AT GGG AG C AC T C 

TCCTCGTTGGCCACACTCCGTTTCGGTTTGGACTCCTCCGGATTGGAGTAATACTGTTCCTGCTTTTGGTAGTCGAAAGGCGGCTTCGGCGATGT 

AAAGGAGCACACTAATCTTTTAGGTTGCGAACCCGAAGAGCCAATGCTGTTTGCAGGACTGGCCGCTGGTAGGGGTGTATCCTGCAACTGCGGAT 

AGGGCGATACGCTCTGCGGTATGGGCGGCGTCATTGATAACGAGAGTAGGCCTCTGGCTGCTTCGTGCTCTTTCTGTCGCGACTTGGACTCATCC 

GTATCTGTGAAGAGATTATAATGAATCAGCTATTTGAACATATAAGTGTATATGATTGCTTTCTTACCACTACTTTCAGATTCATTATCACTGTA 

GTCATCGGAATCCGATTCCCCAGCCATCGATGATGTGCGTCCTCCTGCCGAAGTGGATGACTGCTGGTCAAAGTCCATATCTACATCCAAGAACT 

CGCTGTCTGGCGGCATTGGTCCCGGATTTATTCCCAGTTCAATGCACTTCTTAAAGTGAGTTTTCGATTGCATATGCTTGGTCAGATTTCCCTTG 

GTCTTAAAACTGTGGAATGGAACAAAGTTTATAATAAGTTATCTCTTAAGTACGGAAAAAATGCAATGACTCACCTGAAGTTGCAATGGCTGCAT 

GTGAATGGCCTCACGTCCGTGTGAGTGCGAATGTGCTTCTTGAGCATCGACGGCTTCTTGCAACGAATTCCGCACTCGGAGCAAACATATCTTCC 

CCTTCCGCGGCCACGGATGTACGTGTAATCCTCATGAGACTCGTAGCCACCCACCAGGGGCTGCCCGGATGGACTTGTCTCCAACTTTTTGGCCT 

GTGCCTCCTCATTCGCCTTGGCCGCCACATTTGCCTCACTGAGAGCGGCAATCGTGACAGTGGGTGGGGTCTTTAGGGGTCCTTGATAGAAAGCA 

CTGCTAGTGGAGGACGTCGAGGGGGAGGATACGATTTGCTGGGAAGCGACTAAGTTGTAGCTAAGTTTTCCCGATCCAGCCATGGCGGTTGAGCT 

GCCATTTCCTAACATCCTTTGACGTGAGTCGTAGAGTGCCATAACTTGCTTTAGTTTGAATCCAAGTGGATGTGGCTGGTTCTCCTCGCAAACCT 

GCCACACACTATACATCGATAGTTTGTGCATTCCTGCCACATACGAAGGCTGTGGACAGTTCACCGTACAGTAAAACATCTTTGTGCTGCATTTG 

AGGCCCAGATATGTGTAGGCGTGTCCATTGAGATACAGTTGCACACAGGATTTGGACGGCCTGGGGGTCTCCGGCGAGATCAGCGGCAATGTATT 

GGCTGTCGGTGTAATGCCATGATGGCGTTTAGGAGTAAAAGTACCGGGTTTAAGTGGCAAACTAGTAGGCCGCAGAAACTTGGACTTCGCCGATG 

CTGAGGCACTGGCTACGGGACTGGTCTCTTCTGGAGCGTTTCCGTTGGGAGTATCAACATGTAGCGGTGTTAGGGGCAACTCTTGCTGTTTCCCA 

GAAATCACATTTTCCAAATTAAAGTGTCGAACTTCAGGTTCAGTGGTCTTCACATTATCAGCCATGCGTAGAAAGTTAAATGGTGCTTTCTTCGG 

TGCCTCCTTGCTGTTCCACTGCAGCGCTTGCCGCTGGGCAGGAGTTGGTGGCTGTCGACTGAACTCGCTTGGCACATTCTGAACTCCACCGAATG 

GCGATAGGGCCTTCAGTTCACCGGATCCTCCTCCCAACAGCAGCGGACTGGGTTGCTTTCGATTTGGACTGCGACCTCGACTAGAGGGAATGGGT 

AGCTGCAGCTGTTGTGGCGGAGGCGGCAGTAGCGGCATTTGTGGTGTCAGACTACCGGGTCCTGGCATTCCAGGCACATGAGGTATTGGTGTTAC 

TGGAGTGGTTTTATCTCCACCGCTCATCGGTGGCAGCGTTAGCGGATTGAAGGCAGTTATTGAGTTAATGGGCGGAAATTGAAAGTGCTGCGGAC 

CTGAGCTAACGCCGAGAAGTCCACTGCCACTGACCAGTGCCATCAGCTTGCGCTCCTTACTGGATGATCCTGTTTCGTTGTTGACTTTGACCGAA 

ACATCCTGGAAGCTTCCGCCGGACAGCAACGGTGTTCTAATCATCTTGGGGCTGTTGTCCAAGTCTGGGCACTCCGAAATGGGAATCATGCTGCC 

TCCCGAAGAGGTGAATCTGCGCAGTCGCTCCTCCTTGGAGGACTGCAATTCCGGCACACTATTACTGCTATTCTGATTATTAAAGGAAAAGTTGA 

AACTTGTCGGTTGCTGAGAGGAGGATGCTACAACCAAATCGCTAGTCTTGATGCGTTTGGCAGGCTGAGCTTCAATCACAAATTGCCGATCCTCG 

TGCATTTTGGTTATGACCACAGAATCCTGGGCCTTGCGTGGTGTACTTTGCTGAGCATAGTCCACTAAGGGAGTTTTTCCCAACAATGGACCAGG 

CGATGGTAGAGGCGCATCCACAAAGCGCAGTGCTTCAATCTGGGGGGCAGGTGCTGATGATGGAGCAGTGACCGTAGCAACAGCAGGAGCTGGGG 

CAGAAGCGGTTACAGTGGGAGCTGTTACAGTTGATGTCCTTGCTGGCGTCTGTGCTGCACTTAGGCGACTTAATACCAAACTGCTCCTTTTGGTC 

TTCAACTGCGACCAGGCCAACTTGGCCAGAGATAAGGGATTCTTCGAGTTGGCCATGTGGCTCTTCAGCTCCGGTAGACTCAGCGAGATCGAGTT 

GGAGCGGAATGGCGAAGATGCTGGACTCATTGGGGCACTTGAGCTCGCATCCTGACAGTAAGTGCTGTTGTGCAGCTTCAAATCGTCCGCACTGC 

GGAATTCGCTTGCACAGATAGGACACGAATAAACAGCATCATCGCCTTCGCCAATCGGCACGGCCAATCCTCGAGCATTAAGCAGTAGGTTCTTA 

ATTATCGAGTTATTAACCTGAGCAGGTGGTGGAGCAGTTTTGGAGCTACTAGTCGGATTGGTAGAAGTCGTTGGAGTCGGTGCCAGAGAACTGGT 

GGACAAGGAACTGACGGGCACAGTGACCAGAGTTGGTTTAGGCTTCGCTGATAGATTCAGAGGCATAACAGGCACTGCCGTCGTCGGTTGCTCAA 

TTTTCGTCACTGGTGTTGCAGGATCCTGGCGATATTGCATTGTTTGCATGGGCAACGGGGACGGCGGTGGAGGAAGTTGCTGCTGTTGCTTCACC 

GGCGTAGTCATCACTACTCCAGGCTTATAGGCATTCCTTTTAACCACCCCATTTGGTGGTTCGAGCTGTTGCTGTTGGCATTCCTGTTGCTGTTG 

GATGTACGTATAGTAGTTCTGTTGCTCGAGCTGCTGCTGCTGCTGCTGGGTTATCTGCTGCTGTTGCTGTTGCTGAAGCTGATGTTGCTGTTTCT 

GGAAGATGGCCTGTGCCAAATTCACTTTAGTCTCGCGCTCCGGCATCTGTGCGTTGGTGTTGTTTTGACTATTGTTTGCATGCAATGCCGACGCG 

TTGCTCCCGTGCTGCGAAGCATTATTGCTGTTGGCATTGTTGAAACTGCGGGAACGACTAAGCTGTTTGGGGTACTTCTTCTGCAACAATATCTC 

CTTGTTTTCAACTATGGCTTCATTTTGAGAGATCAACTTGGAAATGTGTTCCTGCACTTTCGCCACATTCTGAAAAGGATAACACAAATTTTAGT 

TTAGTTTTTCTGTAAAAATAAACTTTTAGTGCAACACTCACTACTTGCTTGCCACCACTTGAGCTGGCAAAAGTGCTGAGATTGGCGGCGGGCGT 

GGGAGTGGCTGGTGCTGTTGCTACACTGGAAATCTGGCTATTGGCACGCACCAACTGTAAACAATAAAGGTATATAGATTATAATTAATCTCCTC 

GTTCAAAAACAATCGTTCTTAACCAACCTGTTGCGGCGGGAGACTCGGAGGTTGTTGCATTGGTGGTGTCGCCTTGACCAACGGTGGGTGTTGCT 

GCTGCTGTTGTTGCTGCTGCTGTGCCAGGTGGATTTGGAATTGCTGAATGGCTAACGCATGCTGGTGAATGGCAGCCGCTGTATCTTGGTTGTAG 

TATGAAGGTTGCCCCAGGTAATACACCCCGCCAGGTGGCATGGCAGCTAGCAATGATTGCTGGGCCAAAAGGGATTGCTGCTGCTGCAATTGCAA 

CTGGATCTGATGCGAGTTAATGTGGTGCTTCATCTTCATTTGCGTGCTTGGCGAAAGCGTGGGATGCGCCAGTGACGGCTGCTGAACCAGAGGAA 

GCTGAATGGACAGCTTTTGCTGCGCCAATTGGTGTTGCTGTTGCTGTTGCTGTATCTGCAGTTGCTGCTGCTGCTGTTGCAGCTCCTTACTGCTA 

CATGAGTACAACGAGGCATTGTGGAACTTGGGCTTATATGGCTTATAGTCGATCGACTGGCGATGCTGGGCAGTGGCCGAGTGGTGATTGTCCGC 

TGTGGTGGGTGGCAATGTTCCTGCTGCCGGCGAGCCCAGAGGCAATTGCTGCATGTAAGTGGGCAGCGGAGGCTGTTGAATGTATGCGCTCTTGG 

CCGCAGACAGTGTGGAAGGTGAGGGTGTGGGCGAATTGATTGGCTCCTCGTAAGGCGAACCAGCGCGACTCGGCTGCAAAAACGTAAGGAGTTAC 

ACATGTTTGCGATAACAATCACAGTTATATTTGACTTACCAACTCTGAACTACTGTTGCTCAGCTCTGCATCCTGATCGGATAAACCATCGTCCG 

CATCGGCTGGCACTTCCAGGCCCCGCGCTCTTGCCGCATGCGATCTCGAGCGGCAATGTTTGTACAAATTACTCTTCGTCTTAAACGCAATGCCG 

CACGTGTCGCACGGATACGGCCGTTCGTTCGTGTGGGCCCGTATGTGCTTCTCCAGCACCGAGGGCTTGGCACAGATCAAGTTACAGTACTGACA 

AACGTATCGTCCGGACTTCTCGCATGCAGGATTTGGCGCTGCTGCTCCGCCTGGAATACCAGGACTCACTGTTGTTGGCTGCGTCAATTGCTGAT 

GCTGGGCAAATGAAGCCGGTGACAGTGTGGCATTTGCTGCATGGTTACTGTTGCTGTTGCTGGTGCTGTTCGTGTTACTGTTGCTGTTGGCTAGA 

GGCGTTCGCGTTCTTCCCGCCGCTTTGTAGCCACTATTGCTGATGATGCTCTCCTCATGACCAATGGCATTGACATGATTGCTATTTACAAAATT 
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GGCACTGAATTCATGTATTGCTACTGAAGGCGGCGGCGTCTGTTGCTGCTGCTGCTGCTGCTGCACTTGCTGGTGGCTTTGTTGTTGGGTCATCA 
GATGGTGACCGTTGGCCAGCGGCGTTGTTGGTGGCGGTGAAAGCGAACTGTTTGAGGAGAGCGATGTGGTGCGAACGGCATCGCCACTTGCTTGC 
AGTGCTCCGCCGCCATTCGTCGCACTCCAGCTGTCCACCTCGGTGGTGCTGGCCAAACGTTTGAACTTCTTGTGCAGATAGCGTGAAATATTTGA 
CCGATCCCCCAACGGAGCCTCATCGGTGGCGGGTGTGGCATTTACGGTGGTCTGCTGGCTGTTGATGATGCTCGAGTTGCTCATCGTGTTGGCCA 
CATTGGGAGCCACTGGCTCCTGGTTGCTGCTGACTGCTGCCATCTGTTGTAGCGTCATTTTGCTATCACTTAGACTTGCTTTATCTCTACTCTTA 
AATTCATAATTATTCTCACTAATACTATTATCACTGTGGTGGTTGTTGTTGTTATTATTACTTGTGTTGTTATTGTTATTATTATTATTGTTGTT 
GCTGATGATGTGGTTGCTGTCAACAGTCGGCAGCGGCACCTCGCCTAGCTTATCTCGTGTGCTTGTGCTGGGTTTGCTGCTGCTGTTGCTGTTGC 
CACTAGCAGCATCTGTTGTTGCTGCTGCTGTTGCTGCCGCTTTAGAAGCTGTTAATGTTGCATCTGTTGCTGTTGCTGTTGCAGATGCTGATGCT 
GATGCTGTTGCCTCTGTTGCTGCTGCCAATGCAATTGCTGCAGCGATGTTGGCCTTGTTGGTTGATCTCTGAGTGACTATGGTTGCTGTTGCAGT 
TTCCCGATATGTCCGCTTCTTGGTTGGCGTTGCTGCTGTTGTTACTGTTGTGGTTGCTGCTGCTGCTGCTGCTGCTGCCGCTGCTGCTGTGGCAC 
GTCGCGTATTGTTCACTTTATTGGTCTTGTACTGTTTGCCTGCGGTTTTGTAATGCTTGGCCGTTGTGGTGGTTGCATTTTCCATATTATTATCG 
AAT T T AT TAT C C T GT G AC GT AT T T G C AG T GGGCT C AT C T AT AC CG AAAC G AGAAGAT AG AGGG AG AGGG AGAG AG AAAAC C G AAATT AAT T C AAT 
GTGTTTCGCATAATTTAATTTAAATAGCAATGCTAATTGATACATTCATGATAAGCCCATTTATATACACTGATATCCCTGCTAGGTAATCCAGT 
AGAAATCAACCACATCTGACTTTGTGGATTTGCAAATCTTCGGACGATTAAATATTTAATATTAGTTTCTAATACACCCTTGTCCCCTATAAGTC 
CTCACATGATTAGTGAGAGGTTTGGTTTTATCTTTTATATGTTAATTGCGCTGTTATGTTACTGTTACTGCATTGTATTGATTCATCGCTTCTAA 
ATAAATAAATATATAAAAAAAAAAAACCCTTGTCCCCTGGCCAATAAACATCTGCGGTTTGCATTTGGGTTTCCCAAGACGAGTTCCCTGCCCAT 
ATAGTACACTTTTATTTCGAGCCCCATTATCAGCACTAATTCGGGCTCAATCTTATCAACGCGTAGCCGAGAATGCCAGATACGGAGCACGCTAA 
TTCTCTCCATGGCGATCAATCCGTCAGGTGAATCACAAGCTGGCGAATTCCCGTGCTCTATTGAAAATCAATTTCCCACTCTGTGAACGTGTCAA 
CATCGATAAAGCACAGTGGCGAAAATATACGAACACACACAAATGTATATTCGCCTTGTTTCTTTTTATCAACTTTCTATTTTGAATTCCCAACT 
TAGCTGGGGTTCAGGCCCCACCTGCCGAAAACCAGACATGCGACCAAAAGATACAAAAATCCAAGAAAAGTGACAAGTGAAGTGTAATGACTGGT 
GTCGCTGGCATTTCGTTTGTTGTTGGGGGCTTCCTTATCACCGCTCGTGCACACATGTGTGACAACAGTGCGTATCAGTACTGTAAAGCTGCTGG 
CAGGTGTGTGATTAGCAATGGGATTATAAAGCGTTGCCAAATATGAAGAGTGAGTATTTGAGAACTCCATTTGCTCGCATTAATTAAGGCTTCAG 
TTTGAAAAACAATTTAGGCAGTCTTTGGTAAATTCTTTGTTTTGTGCATGCCAAACGAAACGAAAAGCTGCAAAAATCTTTTAAAAATTTTTCAC 
TTTTGCTGCTATTTATGTGAGCCCCACTGTCTGCATAGATGGTATTATATGACTCTTGGCCGGCTGGGCTGTCGGGCGATTAGTAAGCCGGGGCA 
GTTCTCAAATCACAATCGCACACAGACTGAACACAAAAAACGCTCACTGGCAACCGAGCTGGGTCGCGTTAATTGACTCGACTGCGGCGACTGTT 
TAGCATTCAGCGTTATTATTATACACATTATACGGCATTATGCGATCGCAGCGACTCTCGTCGCGTACCAATGTTTACTCAATTTGTTATCAGCA 
CGGCTGGCTAATAAGGCTTTCAAACCGTGCGACCGCAAAAAAAAAAATAAATAAATAAAAAAAAAATGAAAAAAATTGCCCAAAAGTTCCATTGA 
TAATGCCAAGCTCTACATGCACAGGCGGTAAGGTAAGTATTTTCGGTACGTGTGTTTGCCAGATTCATTATCAGTTATCAGTGGGGACTATCGTG 
CGCGCCTGAGTCAGAAACAGAGCACCCGATAAGCGAGCCCCGTTGTAAGCCACCAAGTGATTGCCAATTTGCCGGAGTTCAGACGCGTTGACGTC 
AGTTATCCGATACGTAGTACTATTCGTTCGCAATCGTAAGGGGCTATATCAGATTGTCAACTCATTGGGCAATTAAGTGGCATTCGCTGGCGATT 
AGCTGTTCGAGGATACAAATCCTAATTACATAATCATGTGTAGTAAATATGTGTATGCACTCGACTTATTAATAAAACGAGTGAGTGGGTCAATT 
ACAAGGCGCCAACTGCGCCACAATTGAAATGCATACATATGTGTGGCGATTACACGGAATGTGTTGTTTCTTGCTAGTGAAAACAAAAGCGGAGC 
AG AC AGT G T GT GT G AGC AT T T T G AC T AAAT AAGT AGAAT AAAGT T T AAT AAC AC AAC GC C C AG AAT CTGTCTTTC G AAAT AAT AT T T AC AT T T GT 
GTTCATTCGCTGTGCGAGAAACTATTGAAGTTCAACTGTACATAATGACAAAGTGGGCTATACATTCATTATACAGCAGAAACTGGCGAAACTAA 
TGCCATCATCTACAGTCACGACAGTATAATAAAGTCAAGCGGGAATAAAGAAGATAAGCTAATCAGAAGTAGGCGCTTATAAAAACGCCAGCAAT 
TAAAACCAGAATACTCATATGCAAGATACCCCGCCACCCAATGTGCAAACAATTAGCGCAAAGTGCACAGTGACGTTTAAAGATCCCAAGTTCGT 
TTATCCACCATTTAGTTTATTGTTAGCCACGTAAATCCAATTGTCAGAGTCATCAATATCAATACGACGTTTTCATTATCACGAACTTTTGTTTA 
ACGATCGGGGGTCTTCTAATTACGCCTGTTTGCCAGCATTTTGAGGCGTGAAAACGGCAAGTGAACGCGAAACACATGGGAACGTATTTCAATAT 
ATGTGAACAAACTGGAACTGCATTTGCATTGGCATTTTCATTTGCATGCACATTTTTATTGAGCCAAGCGAAATACAATTGCCCCTGTTCAAAGT 
GCACATAAATAAAGCGCCACGGCAAATAATCGAATAGGGTTCGGTACACTCTTACAAATATCTAAAAATTGGTTGCAATTAGTAGTTCTTCTAAC 
CAGTAAATAAACACACAATCTTAGATTTTAAACACCAACTTCCGCGCCAAACTTCATGAATATCTAAGATACATAAATAAATCACTTCTTATTAA 
ATCATATTTACATTTGTTCAGAGAGTTTGTTTTTCGAAGCATATGGAAATGCTGGCATTGTTTCTGTGTAGAAAAGCTTGATAAGTTTGCAGCTC 
ATATTGAGCACACGCTATCTCGCCTACGTCACAAGTGCATTTCGCGTTTGACTGAGTGACTGCCTGAATTCGAAATTATAATAAAACCAAAAAAA 
AAACGAGAATACATATAAAAACACCCACACCAGCGCAGCGAAATAATGAAAAATAAATAACAAAATCAACGAGGTGCGGCGGCAACGGCGCCACA 
AAGTCCACACATCTGTCTGACGGTCAGTCTGTCAGTCCGTCACTCTGTCAATCGGTCATTCGCTCAATCCGCGCGATGACGACTACGTCATCGCA 
GAGCAAAAAGCACTAAGCACTCGCCCACTTGGCAATTGTTTATAATTTATTAGTAAGAAGCGTTTTACAGCGGTTGAGCAGAACGAACGAAAAAT 
GCATATGTAATAGTATAATACGTCAAAAAAAATTCGCGGAAAAGATTCCGAAATTCCAGCGAACGGCCAAAATGCGAAATGCCCGACAAAAATGA 
CGAAAACGGCACAAACGGTAAAGAGAAACATACCCAAGTTTAATTGTGACCACACGCATTTTTCCGCCTGTACCGCTCATTTGCATAGTTGGTGT 
TGTTGTTCGTTTGGTTTGAAGCGCAACAATTGGAAGTGCACGGCGAGAAAAATCTGGCGGCGCGAATAGCAAGAAATCGAAAAGGCGACAATCGA 
CAATCGACGACAACCGACGCGCCTGAAACATTTAAGGCAACTAGACGAGGCGACCAGCGAAATTCATTCAGAGTGTGCTCAGCGTGTGTGCTTGT 
GT GT GC AT G T GT GT G GT GT GT GCGAAAAG AAAAC GAAGAAGCG AC CGAACGAAGCAAAAAC AAC AAGT AAGAAGCCC AAAGT CGC ACT AGT CGAC 
TACAATGTACCCTTGAGTGGCAGTAAAAGTTATTCTTACTACAAATTACGTATAATTAGTAGGAATAGGCTAAACGTTTTGGCATTCTGCGATTT 
TGAGTTGAGCATAACGAAAATATCAGTTCAGAAAATAAGTATATATTAGGAAAGTTGAAGTCTTCATGCCCGAAAAGGTTTTATAGGGCAGTCTA 
TATCCCAGAATATTCGGTAGTAGCTCCAATAGATCATAAGCTTATTAATTTTTCTCATATTGAAAATACATCTGTATTCAAATAAAGATAGGTGG 
AAAGATTTCTTGCTATAGCGTTTTTTGGTAAATCTATTAATTGGTATAATATTAAGAAGTTGTTTTGTAATATCTTTACTTATTTTAAAAATGCC 
ATATACCCATGTTTTCATTCAGTTACTGAGTGTTAAAGAGGCGCGACGTCAGCGCAGGCAGCAAAAAAAGTACAATAATCAAATAATTGCGGCAG 
CTTCGTCGTCGCCCCCCGCACACTCACACACAAACACACACATGTGCTCTCAATCTCCTCTCCACTCATTCGCAATTGTCAATCAAATGTAAGCA 
CAGGGTCGCCACTTCTTTTTGTTCAGCTTTTGATTTTGCGGCAGCCACTGAGAAATGATCAAATCAAGCAAAAATTAAGACTCAAAAAAAAGTAT 
AAAAATT AAAC AAC AAC AGGTGACGCGAAGTACGAAGCGAAATGAAACGAAACGATTCGAAATGAAAGACTGTCGCCAGCCAGTCAGACGTTGCT 
GTTGTTGCTGCAGTTGTTGTTATTTCCCCCTATTGTTGATTACGTACCCCATACACACACACTCACACACAAGTGCGC 
(SEQ ID NO: 874) 

Exon: 13708.. 13524 

Exon: 10107.. 8495 

Exon: 8433.. 7723 

Exon: 7654. .7547 

Exon: 7479. .4445 

Exon: 4379. .4153 

Exon: 4089.. 1001 

Start ATG: 10060 (Reverse strand: CAT) 
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Transcript No. : CT23537 

TTTCAGGCGCGTCGGTTGTCGTCGATTGTCGATTGTCGCCTTTTCGATTTCTTGCTATTCGCGCCGCCAGATTTTTCTCGCCGTGCACTTCCAAT 
TGTTGCGCTTCAAACCAAACGAACAACAACACCAACTATGCAAATGAGCGGTACAGGCGGAAAAATGCGTGTGGTCACAATTAAACTTGGATGAG 
CCCACTGCAAATACGTCACAGGATAATAAATTCGATAATAATATGGAAAATGCAACCACCACAACGGCCAAGCATTACAAAACCGCAGGCAAACA 
GTACAAGACCAATAAAGTGAACAATACGCGACGTGCCACAGCAGCAGCGGCAGCAGCAGCAGCAGCAGCAGCAACCACAACAGTAACAACAGCAG 
CAACGCCAACCAAGAAGCGGACATATCGGGAAACTGCAACAGCAACCATAGTCACTCAGAGATCAACCAACAAGGCCAACATCGCTGCAGCAATT 
GCATTGGCAGCAGCAACAGAGGCAACAGCATCAGCATCAGCATCTGCAACAGCAACAGCAACAGATGCAACATTAACAGCTTCTAAAGCGGCAGC 
AACAGCAGCAGCAACAACAGATGCTGCTAGTGGCAACAGCAACAGCAGCAGCAAACCCAGCACAAGCACACGAGATAAGCTAGGCGAGGTGCCGC 
TGCCGACTGTTGACAGCAACCACATCATCAGCAACAACAATAATAATAATAACAATAACAACACAAGTAATAATAACAACAACAACCACCACAGT 
GATAATAGTATTAGTGAGAATAATTATGAATTTAAGAGTAGAGATAAAGCAAGTCTAAGTGATAGCAAAATGACGCTACAACAGATGGCAGCAGT 
CAGCAGCAACCAGGAGCCAGTGGCTCCCAATGTGGCCAACACGATGAGCAACTCGAGCATCATCAACAGCCAGCAGACCACCGTAAATGCCACAC 
CCGCCACCGATGAGGCTCCGTTGGGGGATCGGTCAAATATTTCACGCTATCTGCACAAGAAGTTCAAACGTTTGGCCAGCACCACCGAGGTGGAC 
AGCTGGAGTGCGACGAATGGCGGCGGAGCACTGCAAGCAAGTGGCGATGCCGTTCGCACCACATCGCTCTCCTCAAACAGTTCGCTTTCACCGCC 
ACCAACAACGCCGCTGGCCAACGGTCACCATCTGATGACCCAACAACAAAGCCACCAGCAAGTGCAGCAGCAGCAGCAGCAGCAACAGACGCCGC 
CGCCTTCAGTAGCAATACATGAATTCAGTGCCAATTTTGTAAATAGCAATCATGTCAATGCCATTGGTCATGAGGAGAGCATCATCAGCAATAGT 
GGCTACAAAGCGGCGGGAAGAACGCGAACGCCTCTAGCCAACAGCAACAGTAACACGAACAGCACCAGCAACAGCAACAGTAACCATGCAGCAAA 
TGCCACACTGTCACCGGCTTCATTTGCCCAGCATCAGCAATTGACGCAGCCAACAACAGTGAGTCCTGGTATTCCAGGCGGAGCAGCAGCGCCAA 
ATCCTGCATGCGAGAAGTCCGGACGATACGTTTGTCAGTACTGTAACTTGATCTGTGCCAAGCCCTCGGTGCTGGAGAAGCACATACGGGCCCAC 
ACGAACGAACGGCCGTATCCGTGCGACACGTGCGGCATTGCGTTTAAGACGAAGAGTAATTTGTACAAACATTGCCGCTCGAGATCGCATGCGGC 
AAGAGCGCGGGGCCTGGAAGTGCCAGCCGATGCGGACGATGGTTTATCCGATCAGGATGCAGAGCTGAGCAACAGTAGTTCAGAGTTGCCGAGTC 
GCGCTGGTTCGCCTTACGAGGAGCCAATCAATTCGCCCACACCCTCACCTTCCACACTGTCTGCGGCCAAGAGCGCATACATTCAACAGCCTCCG 
CTGCCCACTTACATGCAGCAATTGCCTCTGGGCTCGCCGGCAGCAGGAACATTGCCACCCACCACAGCGGACAATCACCACTCGGCCACTGCCCA 
GCATCGCCAGTCGATCGACTATAAGCCATATAAGCCCAAGTTCCACAATGCCTCGTTGTACTCATGTAGCAGTAAGGAGCTGCAACAGCAGCAGC 

AGC AAC T GC AG AT AC AG C AAC AGC AAC AGC AAC AC C AATT GGC GC AGC AAAAGCT GT C CAT T C AGC T TCCTCTGGTT C AGC AGC CGTCACTGGCG 
CATCCCACGCTTTCGCCAAGCACGCAAATGAAGATGAAGCACCACATTAACTCGCATCAGATCCAGTTGCAATTGCAGCAGCAGCAATCCCTTTT 
GGCCCAGCAATCATTGCTAGCTGCCATGCCACCTGGCGGGGTGTATTACCTGGGGCAACCTTCATACTACAACCAAGATACAGCGGCTGCCATTC 
ACCAGCATGCGTTAGCCATTCAGCAATTCCAAATCCACCTGGCACAGCAGCAGCAACAACAGCAGCAGCAACACCCACCGTTGGTCAAGGCGACA 
CCACCAATGCAACAACCTCCGAGTCTCCCGCCGCAACAGTTGGTGCGTGCCAATAGCCAGATTTCCAGTGTAGCAACAGCACCAGCCACTCCCAC 
GCCCGCCGCCAATCTCAGCACTTTTGCCAGCTCAAGTGGTGGCAAGCAAGTAAATGTGGCGAAAGTGCAGGAACACATTTCCAAGTTGATCTCTC 

AAAAT G AAGC C AT AGT T GAAAAC AAGGAG AT AT T GT T GC AG AAG AAGT AC C C C AAAC AGC T TAGTCGTTCCCGCAGTTT C AAC AAT GC C AAC AG C 
AATAATGCTTCGCAGCACGGGAGCAACGCGTCGGCATTGCATGCAAACAATAGTCAAAACAACACCAACGCACAGATGCCGGAGCGCGAGACTAA 
AGTGAATTTGGCACAGGCCATCTTCCAGAAACAGCAACATCAGCTTCAGCAACAGCAACAGCAGCAGATAACCCAGCAGCAGCAGCAGCAGCTCG 
AGCAACAGAACTACTATACGTACATCCAACAGCAACAGGAATGCCAACAGCAACAGCTCGAACCACCAAATGGGGTGGTTAAAAGGAATGCCTAT 
AAGCCTGGAGTAGTGATGACTACGCCGGTGAAGCAACAGCAGCAACTTCCTCCACCGCCGTCCCCGTTGCCCATGCAAACAATGCAATATCGCCA 
GGATCCTGCAACACCAGTGACGAAAATTGAGCAACCGACGACGGCAGTGCCTGTTATGCCTCTGAATCTATCAGCGAAGCCTAAACCAACTCTGG 
TCACTGTGCCCGTCAGTTCCTTGTCCACCAGTTCTCTGGCACCGACTCCAACGACTTCTACCAATCCGACTAGTAGCTCCAAAACTGCTCCACCA 
CCTGCTCAGGTTAATAACTCGATAATTAAGAACCTACTGCTTAATGCTCGAGGATTGGCCGTGCCGATTGGCGAAGGCGATGATGCTGTTTATTC 

GTGTCCTATCTGT GC AAGC G AAT T C C G C AG TGC GG AC G AT T T G AAGC T G C AC AAC AGC AC T T AC T GT C AG GAT GC G AGC T C AAGT G CC C C AAT G A 

GTCCAGCATCTTCGCCATTCCGCTCCAACTCGATCTCGCTGAGTCTACCGGAGCTGAAGAGCCACATGGCCAACTCGAAGAATCCCTTATCTCTG 

GCCAAGTTGGCCTGGTCGCAGTTGAAGACCAAAAGGAGCAGTTTGGTATTAAGTCGCCTAAGTGCAGCACAGACGCCAGCAAGGACATCAACTGT 

AACAGCTCCCACTGTAACCGCTTCTGCCCCAGCTCCTGCTGTTGCTACGGTCACTGCTCCATCATCAGCACCTGCCCCCCAGATTGAAGCACTGC 

GCTTTGTGGATGCGCCTCTACCATCGCCTGGTCCATTGTTGGGAAAAACTCCCTTAGTGGACTATGCTCAGCAAAGTACACCACGCAAGGCCCAG 

GATTCTGTGGTCATAACCAAAATGCACGAGGATCGGCAATTTGTGATTGAAGCTCAGCCTGCCAAACGCATCAAGACTAGCGATTTGGTTGTAGC 

ATCCTCCTCTCAGCAACCGACAAGTTTCAACTTTTCCTTTAATAATCAGAATAGCAGTAATAGTGTGCCGGAATTGCAGTCCTCCAAGGAGGAGC 

GACTGCGCAGATTCACCTCTTCGGGAGGCAGCATGATTCCCATTTCGGAGTGCCCAGACTTGGACAACAGCCCCAAGATGATTAGAACACCGTTG 

CTGTCCGGCGGAAGCTTCCAGGATGTTTCGGTCAAAGTCAACAACGAAACAGGATCATCCAGTAAGGAGCGCAAGCTGATGGCACTGGTCAGTGG 

CAGTGGACTTCTCGGCGTTAGCTCAGGTCCGCAGCACTTTCAATTTCCGCCCATTAACTCAATAACTGCCTTCAATCCGCTAACGCTGCCACCGA 

TGAGCGGTGGAGATAAAACCACTCCAGTAACACCAATACCTCATGTGCCTGGAATGCCAGGACCCGGTAGTCTGACACCACAAATGCCGCTACTG 

CCGCCTCCGCCACAACAGCTGCAGCTACCCATTCCCTCTAGTCGAGGTCGCAGTCCAAATCGAAAGCAACCCAGTCCGCTGCTGTTGGGAGGAGG 

ATCCGGTGAACTGAAGGCCCTATCGCCATTCGGTGGAGTTCAGAATGTGCCAAGCGAGTTCAGTCGACAGCCACCAACTCCTGCCCAGCGGCAAG 

CGCTGCAGTGGAACAGCAAGGAGGCACCGAAGAAAGCACCATTTAACTTTCTACGCATGGCTGATAATGTGAAGACCACTGAACCTGAAGTTCGA 

CACTTTAATTTGGAAAATGTGATTTCTGGGAAACAGCAAGAGTTGCCCCTAACACCGCTACATGTTGATACTCCCAACGGAAACGCTCCAGAAGA 

GACCAGTCCCGTAGCCAGTGCCTCAGCATCGGCGAAGTCCAAGTTTCTGCGGCCTACTAGTTTGCCACTTAAACCCGGTACTTTTACTCCTAAAC 

GCCATCATGGCATTACACCGACAGCCAATACATTGCCGCTGATCTCGCCGGAGACCCCCAGGCCGTCCAAATCCTGTGTGCAACTGTATCTCAAT 

GGACACGCCTACACATATCTGGGCCTCAAATGCAGCACAAAGATGTTTTACTGTACGGTGAACTGTCCACAGCCTTCGTATGTGGCAGGAATGCA 

CAAACTATCGATGTATAGTGTGTGGCAGGTTTGCGAGGAGAACCAGCCACATCCACTTGGATTCAAACTAAAGCAAGTTATGGCACTCTACGACT 

CACGTCAAAGGATGTTAGGAAATGGCAGCTCAACCGCCATGGCTGGATCGGGAAAACTTAGCTACAACTTAGTCGCTTCCCAGCAAATCGTATCC 

TCCCCCTCGACGTCCTCCACTAGCAGTGCTTTCTATCAAGGACCCCTAAAGACCCCACCCACTGTCACGATTGCCGCTCTCAGTGAGGCAAATGT 

GGCGGCCAAGGCGAATGAGGAGGCACAGGCCAAAAAGTTGGAGACAAGTCCATCCGGGCAGCCCCTGGTGGGTGGCTACGAGTCTCATGAGGATT 

ACACGTACATCCGTGGCCGCGGAAGGGGAAGATATGTTTGCTCCGAGTGCGGAATTCGTTGCAAGAAGCCGTCGATGCTCAAGAAGCACATTCGC 

ACTCACACGGACGTGAGGCCATTCACATGCAGCCATTGCAACTTCAGTTTTAAGACCAAGGGAAATCTGACCAAGCATATGCAATCGAAAACTCA 

CTTTAAGAAGTGCATTGAACTGGGAATAAATCCGGGACCAATGCCGCCAGACAGCGAGTTCTTGGATGTAGATATGGACTTTGACCAGCAGTCAT 

CCACTTCGGCAGGAGGACGCACATCATCGATGGCTGGGGAATCGGATTCCGATGACTACAGTGATAATGAATCTGAAAGTAGTGATACGGATGAG 

TCCAAGTCGCGACAGAAAGAGCACGAAGCAGCCAGAGGCCTACTCTCGTTATCAATGACGCCGCCCATACCGCAGAGCGTATCGCCCTATCCGCA 

GTTGCAGGATACACCCCTACCAGCGGCCAGTCCTGCAAACAGCATTGGCTCTTCGGGTTCGCAACCTAAAAGATTAGTGTGCTCCTTTACATCGC 

CGAAGCCGCCTTTCGACTACCAAAAGCAGGAACAGTATTACTCCAATCCGGAGGAGTCCAAACCGAAACGGAGTGTGGCCAACGAGGAGAGTGCT 

CCCATGGATCTCACTAAGCCACGCGGCTCQATCTTATTAATATCCCCCTCGCCTGTTTCGGTGCCCGCTCATGATCTTCCCAAATCGCAGGCCCA 

ACAGATGCACGATGTTATCTTTGGAACCTCCGGCAACGAAAGCGGTTTCATGAAGACTCTAATCTCGGTTTCGGACAAAGTGCGAATATCTGCCG 

AAATGGAAGAGCAGGCCAAGCACGAGGCCGAGGGCGAGGATGTTCAACTACAGACCTACATTAAGGAGCATGCGCTGCATCAAGCCAAGATAAAG 

CAGAGTCAGTTTAGCCGCAGCTATCTGATCAACACATTATACACAGCGGCGTCTCCTGTAATGAGCAGTAGTAGCACTCTCTTTACCGCCAACAG 

CAGGCCCGTTATGAGCGTAAATGAGGTACCTAGCATCGAAGTGCACGAAGTAAAGACCCCAGAGGCCATTGAGTCACCTCGGAGTGCACCTGAAC 
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AGGCCCCGGTTATTCTTGCTCAGATTGCCCAGGAAGAAAACGAGGAGCCTAATATCGCAGAGCCACACAATGCCAATCTGCCTGCTGCAGTGCAG 
CAACCGGATGTCAATGAATTTACTGGAGTACTTGGAAATCCGACTGCACCACCAACATCGTCTGTTACTGCTACTTCAGTAAGCACAACAACAGC 
TGCACCTGTGGCACCCTCATCCACTGCCAACTCAACTCAGCCCGCGCAGCGTACTGTGATTGTGGGTGAAGATGGCTTCAAAAGCTCTACTCCCA 
CTAGCAAGAGCGGCGATCTCCAGCATGTATCGTATGGCAGAGGAGTGCCGCCAGCTCCCATTGCAGGTGATGCACGTCCCACCTGCACCATGTGC 
TCCAAGACGTTTCAGCGGCAACATCAGCTGACCCTGCACATGAATATTCATTACATGGAGCGAAAGTTTAAGTGCGAGCCATGCAGCATATCCTT 
CCGCACGCAAGGACATCTGCAGAAGCACGAGCGATCGGAAGCGCACAAAAACAAGGTTATGATGACGTCGACTTTTGGTGTTCCAACCACCTCAA 
ATCCGCGACCCTTCGAGTGCACCGACTGCAAGATCGCCTTCCGCATCCATGGGCATTTGGCCAAGCATTTGCGCTCCAAGACGCATGTTCAGAAG 

C TAG AGT G T C T G C AG AAGT T GC C CT T C GGGACC T AT GC T G AAAT T G AAC GT GC C G GAAT T AG T C T C AC T G AAAT C G AT AC AAGC G AC T GT G AAAA 
TTCGCTGATATCACTCAAGTTGCTGGCCCAGAAACTGCAGGAGAAGGATCCTGGTAAGCTCAGTAGCTATACCACACCTTCTGGCATGATGCAGC 
TGGCCCAGGATGCAGCAGGTCCCGTCTCGCAGGATAGTGCGTCCGAGGATGGCTTTAGTGCCGCAATTGCCTCGGCTATAGCTTCGCTGGACAAC 
GACAGTGCTGGCAACACACCCAAGCGAGCCAGCTCGATGTCCGAGGATGAAACGGTGGCCAATGGCCTAAACCATAGCCTGAAGCGTCGTTTGCC 

GGG C AGT T T C AG C AG TAG GGG C G AGG AG AGC G AT AAT C C AC C GG AAAGC AGT GGC G AGAAGC GG GCT C GT T CT GGC C AG GAGC T GCC T GT T C C C G 
TGGCCGTTCCCGTGGCTGCTTCAGCAGCGGCAAGCAACTGACAGTACATGTACATGCTGCAGCCCAGCTGAGGTTATCAGTATCCCGCGGTCTAC 
CACCTGGACAGCAGTCAGTACGGCATCTACAATTCCAGGATCGAGGGCCGCCATGACCAAACCACCTCCAACGAGCGCATATCAAATGTCTAAGC 
ATATCGCATATGGGCACACACACAACTTACTCTAGAGCTAGCCTATAAGACCCACCTCTCTTCGATGATGGGCCCAACTAGTATTGGGCTAATCG 
CGAGTAGTAATCAAACGATAGGGGAGCCAGATAAGCGAGTCAGCTATTCCAGTGTAAACACTCTAGGCGAATTGCGCTAAAATAATCATATACAA 
AGCCTTTTACGGTTTTAAAAGGCAGTCACCCCACACAGATCGTTTCGATCTGCCTCTTAAGTATTTAGGTTAAGGAATCGTCATCAGCTAGCCCT 
AGTTATAAGCCAACCAACGAGCACACTTATAAATCTATCTATATTGTATACACTTAACCATAATTTGTTTTCGAAATGCATAAATGCAAAAACTC 

C T G AAGT GC T GT GAG T C T AT C T AC AAAT GT AAAG ATCCGAG AGC G AG AG AAGT GTGTTATTTT T AAG GC C T AG C CCC G AAAAT G AAC C T C C AAAC 
CGAACCAGAACACACCCGCACACTAAGTATATTGGTTTTCTATGTCCAAATCCAAGTTTTGCCCTTTGAAAATCACGTAGTTCTTCCTTATTTCA 
AATTGTTATCACACCAAGTTTTGAAGCGCAACTATTAACGGTGAATGAGAAATGACTTGCGCCCCTAGTTTTGTTTTAATTCTTTAAACAACGAG 
ACCACGAGACTCATTTAATTGTTTCTTTTAATTTTGTATTTGACAGAAGAGAAACGAAATGATGAAAAAAAACGCTAACTGTAAATAATCAATAT 
GTTGTCATATATGCATTAAGCGGTAGGTTAGATAAACAAACTACACAAAGTTGATAGATATAAATGAAATATTGTACAAACTGCTATGGCAAGCG 
CTGCAGCAAAATCTATATATACACATAGATATATATCTATATATTTGAATGTGTGTATGTGGTTAAACAACAAGAAAATGAAGTGATCTATTACA 

TT AT T GT AAAAT AT AT C AAAAAACTGAGAAAAAAAAAA 
(SEQ ID NO: 875) 

Start ATG: 233 (Reverse strand: CAT) 
MENATTTTAKHYKTAGKQYKTNK^ 

SATATATDATLTASKAAATAAATTDAASGNSNSSSKPSTSTRDKLGEVPLPTVDSNHIISNNNNNNNNNNTSNNNNNNHHSDNSISENNYEFKSR 
DKASLSDSKMTLQQMAAVSSKQEPVAPNVANTMSNSSIINSQQTTVNATPATDEAPLGDRSNISRYLHKKFKRLASTTEVDSWSATNGGGALQAS 
GDAVRTTSLSSHSSLSPPPTTPLANGHHLMTQQQSHQQVQQQQQQQQTPPPSVAIHEFSANFVNSNHVNAIGHEESIISNSGYKAAGRTRTPLAN 
SNSNTNSTSNSNSNHAANATLSPASFAQHQQLTQPTTVSPGIPGGAAAPNPACEKSGRYVCQYCNLICAKPSVLEKHIRAHTNERPYPCDTCGIA 
FKTKSNLYKHCRSRSHAARARGLEVPADADDGLSDQDAELSNSSSELPSRAGSPYEEPINSPTPSPSTLSAAKSAYIQQPPLPTYMQQLPLGSPA 
AGTLPPTTADNHHSATAQHRQSIDYKPYKPKFHNASLYSCSSKELQQQQQQLQIQQQQQQHQLAQQKLSIQLPLVQQPSLAHPTLSPSTQMKMKH 
HINSHQIQLQLQQQQSLLAQQSLLAAMPPGGVYYLGQPSYYNQDTAAAIHQHALAIQQFQIHLAQQQQQQQQQHPPLVKATPPMQQPPSLPPQQL 
VRANSQISSVATAPATPTPAANLSTFASSSGGKQVNVAKVQEHISKLISQNEAIVENKEILLQKKYPKQLSRSRSFNNANSNNASQHGSNASALH 
ANKSQNNTNAQMPERETKVNLAQAIFQKQQHQLQQQQQQQITQQQQQQLEQQNYYTYIQQQQECQQQQLEPPNGVVKRNAYKPGVVMTTPVKQQQ 
QLPPPPSPLPMQTMQYRQDPATPVTKIEQPTTAVPVMPLNLSAKPKPTLVTVPVSSLSTSSLAPTPTTSTWPTSSSKTAPPPAQVNNSIIKNLLL 
NARGLAVPIGEGDDAVYSCPICASEFRSADDLKLHNSTYCQDASSSAPMSPASSPFRSNSISLSLPELKSHMANSKNPLSLAKLAWSQLKTKRSS 
LVLSRLSAAQTPARTSTVTAPTVTASAPAPAVATVTAPSSAPAPQIEALRFVDAPLPSPGPLLGKTPLVDYAQQSTPRKAQDSVVITKMHEDRQF 
VIEAQPAKRIKTSDLVVASSSQQPTSFNFSFNNQNSSNSVPELQSSKEERLRRFTSSGGSMIPISECPDLDNSPKMIRTPLLSGGSFQDVSVKVN 
WETGSSSKERKLMALVSGSGLLGVSSGPQHFQFPPINSITAFNPLTLPPMSGGDKTTPVTPIPHVPGMPGPGSLTPQMPLLPPPPQQLQLPIPSS 
RGRSPNRKQPSPLLLGGGSGELKALSPFGGVQNVPSEFSRQPPTPAQRQALQWNSKEAPKKAPFNFLRMADNVKTTEPEVRHFNLENVISGKQQE 
LPLTPLHVDTPNGNAPEETSPVASASASAKSKFLRPTSLPLKPGTFTPKRHHGITPTANTLPLISPETPRPSKSCVQLYLNGHAYTYLGLKCSTK 
MFYCTVNCPQPSYVAGMHKLSMYSWQVCEENQPHPLGFKLKQVMALYDSRQRMLGNGSSTAMAGSGKLSYNLVASQQIVSSPSTSSTSSAFYQG 
PLKTPPTVTIAALSEANVAAKANEEAQAKKLETSPSGQPLVGGYESHEDYTYIRGRGRGRYVCSECGIRCKKPSMLKKHIRTHTDVRPFTCSHCN 
FSFKTKGNLTKHMQSKTHFKKCIELGINPGPMPPDSEFLDVDMDFDQQSSTSAGGRTSSMAGESDSDDYSDNESESSDTDESKSRQKEHEAARGL 
LSLSMTPPIPQSVSPYPQLQDTPLPAASPANSIGSSGSQPKRLVCSFTSPKPPFDYQKQEQYYSNPEESKPKRSVANEESAPMDLTKPRGSILLI 
SPSPVSVPAHDLPKSQAQQMHDVIFGTSGNESGFMKTLISVSDKVRISAEMEEQAKHEAEGEDVQLQTYIKEHALHQAKIKQSQFSRSYLINTLY 
TAASPVMSSSSTLFTANSRPVMSVNEVPSIEVHEVKTPEAIESPRSAPEQAPVILAQIAQEENEEPNIAEPHNANLPAAVQQPDVNEFTGVLGNP 
TAPPTSSVTATSVSTTTAAPVAPSSTANSTQPAQRTVIVGEDGFKSSTPTSKSGDLQHVSYGRGVPPAPIAGDARPTCTMCSKTFQRQHQLTLHM 
NIHYMERKFKCEPCSISFRTQGHLQKHERSEAHKNKVMMTSTFGVPTTSNPRPFECTDCKIAFRIHGHLAKHLRSKTHVQKLECLQKLPFGTYAE 
IERAGISLTEIDTSDCENSLISLKLLAQKLQEKDPGKLSSYTTPSGMMQLAQDAAGPVSQDSASEDGFSAAIASAIASLDNDSAGNTPKRASSMS 
EDETVANGLNHSLKRRLPGSFSSTGEESDNPPESSGEKRARSGQELPVPVAVPVAASAAASN* 
(SEQ ID NO: 876) 

Name: schnurri 

Classification : transcription_f actor 
Gene Symbol: shn 
FlyBase ID: FBgn0003396 



Celera Sequence No. : 142000013384684 

CACATACGCTAGCCAAATGCGTGGGACCCTCGAATATTCGGTGGTCCATTAAGTAGCCTAAGAATTGCACCACTGACCAGTATCCAAATATTATG 
ATCTTATTTATCTAAGCCTAACAACAGAAAACACAAGCAAGTAATTATTCTCTCTACCAGCTAAGATATGAAACGCAATAAAGCCTAAACATATT 
CATATCACCACAGACTTTCGTTCTGCCCAGAGGAATGCAAAGCTGATCGGTCATTAATCAACGTACAGGATTGCAATTGCAGTGGCACTTTGAAT 
GGCCGCTTGGCCAGTCGAGGGTTAACGAGCTAGGCCAATTGCCAGTTGCAGTGGCAGTTTCCGTTGCAGACAGTAGCCAGCGGGTCTTAGTAGTT 
GCAATTGCATCCCGCTTTGTGGGTAATCGGAGGGTTGGTCAAAGGCCACTCATAAATCAAGCAGCAGGACGGCTACTGAACATAACTGAAATTGG 
ACCTCAGTGGGTCACGAAGCTTTACATATCATTAGGAACTACTTTCTTTATGATGACTACACTACTTTCGCATATGCGGTTGTAAACTTTCGTGC 
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CGAAAATTCCACTATCAGCATGCGAAAATGGATCACATCCAGAGAAAACTTACCACAAAGCATCGTAGCATCCATTCCTGACCCCTTTTAACAAA 
AATGCCCAAATACACATAAATTGCAAATTTTCACATTGGAAAGACACTTTTCCATTGAGTACACTTTAATAAGAATTTAAATAAATTAAAACCGA 
TCATTATATTCTTAAACATATATAAAAAAACATAAATTAATGCAGATATGGAATCCCAATTGCTGGTCTTGACTAGCACATTTGAAAAGTGTTCG 
GTAGAACGTAGAGCTTTGCAAATGACTCATTGGTGGGAAATTTGACCAAAGCGATGCAATTACTATTATCTGACAACCGTCTAACGAATAACATA 
AGTTAAGGATAATAGATTACATTATTTACAAATACCTAGTAGCTAGCCCTTTAATCGTCGAAGAATAGTATTTCTAGGATCCGTTTCTGTGCACT 
TTTCTTCCGTTCCTTGTTCATTTTCAGCAGTTCCGTGGTGACTAATTCTCCCAGCAAATGCTCAGACAACCGCTGCACACCCGGATGATTGGGCG 
GCGCCTGGAATTTCCCGAAGTTTGCGGGATGCGAACTGCTGGCCTGATGTGGGAACTGCTGGTATAACTGGAAAAGGAAGTCCGTGTTTGACGGA 
AACATGTTCTGGTGCTGTTGCTGCTGCTGGTTGTTGTGGCTATTATTGCCGCTGGGATTGTTGCTCCCGCTATTGCTGCTACTGTTATTCCCATT 
TCCATTGCCCCGCTGCATGGGCATCGAATTGCTGTAGTGATTCTCCTGCTGCATCATAGCCTGCGGCTCCATCAGATCATGGCTGTTCTCATCGG 
ATTCGTCGTCCGAGTCATTGCCAAAGTGACTACCATTGCTGGTCATCTGGTTATGGGCATTGCCATTTAGTTGAATACGGGGTTTCTTCATATGG 
AAATGGCTCGATGACGAAGAGTCTAGACCATCGCCCTTGGGGTTATAGACATGTGACTGTTGCATCTGCAGAGTGGAACTGTTGCCGATGGAAGC 
GCCCGTGGCGCTGGAAGAGGAATGTACCACCGGAAGCTGCTGTTCAGATGCCGGCGACTGTTGCTGCGCTTGTTGCTGCTGCTGCTGCTGTTGTT 
GTTGCGTCTGCTGCTGAGGATGATGGGATCCTGCCCCAATTCCTTGCAGCGGCGACGACAGCGGTGATTTCATCGAGGATGATGTTGAGGTCAGA 
CTGCCAGCGCTGTTCTGGGCCCCGTTCATGCCCACGGAGCCATCCTTGTATTGATCCTGATAACCTTGCGAGGAATCAGCCATTACAGCTGCAAC 
GGCCGCCGCCTGTTGCTGCATTTGTGACTGTGATATGTGCTGGAGCTGCTGTGAGGCCACGGCAGCCGCAAAATCCCTGAACACCTGGTCCGCCT 
CGATCTTCACCTGGCCATTGACGTTTTCCAGCGCAGATGCAGCAACGTTACTCAGAGCACTCATTGCGTGTTGCTGATCCGGCTGGTGGTAAAGA 
TGACTTTGCCCTGAGCTGCCAAACTGAGACACATTCTTCATGGAGCCATTGGACTTATCAACCTGGTATTCGTTGTAGCCGGAGCTGTTGGCGGA 
CTGTGGAGATGTTAGGAAGTTCGGCACATGGCGGTAGGACTTGCGCGGCTGGATGTGCTGGTCCAAGAACTTCATTTTCTCCAGGTATGGACGCG 
ACAGGTGTTCGTAGACAGGTGGATCGCCCTGCTTCCGCTGCGACACGTAGCGTTCGCGTAGGTACTTCCACCGCTTCTTGCACGTGTCCACTGGA 
AAGGAGGGCAACATTTATATAGAGGATCTGCCGGAGGAGCGCGTCTTCCTCACCATCCGTGCGCAGCTTCATCGCAATCAGCTGCCAGGCCTCGT 
CCTTCGTCTCGTACTTGCCACCATTGGCGCCGCCGTTGAGGTAGTACTTCTGCCGGTTGTAGATCACTCCGTGCTGCGCTACCTCGTCAATTAGC 
TGCTCATCCATCATTTTAGTTTAATTCGTGTTTTCAGCTGCCAATTTGCCTTTGGTGTATGTCAGGCCATTTCCAACACTGTTCATGGGACATCG 
AAACAGTGTGACCGAAAATGTAGCAGCAGAGTGTCAAATATAATTTGAAGATATTGATATCGAAATATATTCGACAATTTGCATCATGTTTAAAT 
TCGTCAGCATTTTGATTTATATATAGTAATGCATTTAATTAATTAACACGCAATAAAATTAATAATTTCAGCAAAAGTAAATCATTTGCTGTTCC 
TGAATTTACTAAATAAAAATGTTTAAAAATTTAACTTATAAGTTACCTTCACAAAATTCTCGAAAAATAAAAGTCAGCTATAAAATAGTTGCTAT 
AAACGTATTATGGTTATTTGAAATTTTAAGGTGGGTTCTTTTGCGAACCGAGATTCAGTCACACTGGCAATTTGGTACCCGAAGTTGAATTGCCG 
TTTTGTGAAGCGGATAGTTACCTGCCGATAATCTTAAATAAAAATGTTTAAACTGGCCCGTATGCTCCTGCCGCAGCAGCGGATCCTGGCCAGCC 
CGCTGCGCCTGCAACGCCTGATCTCTACCAGCGACGAGGTCAACGCAGAGCCCATCATCAAGTCCATGGACACCATTGGCGGCCTCCCCACCGAA 
CTGGTCAACGAACAGAAGCTGAAGAAGACTAGCAGGTAATCAATCTACCGGTTTTCGCACTTGACCTTTGCCTTGCCTGTTTGTTTTGTTTACAT 
TTCGACCGGTATGGGCATGGGCATGGTATGCATGTATCGGAGGCCTGTTTTGGGCGTGATTTTCGAAAAGGAGTTTCGGGGTCTTTTTTTCTTGA 
TTTCAAGTGGGGGAGAAAGTTTGTATCGAGCCGCTTATGCAGTCACGTAGACCATAGATGCGTGCATGTGTGTGTGTATGTATTTGTGCCTGCCT 
GGTGGTGTCAGTTATGGGCTCTATTGTTCTTGACTTTTGTTTTGTCCACAGAACCTTATCGACGCTCCAAAATCACTCGGTTCCCATTGCCGCTC 

GCGTCACGGTGTCGAAAGATG 
(SEQ ID NO: 877) 

Exon: 2631.. 2429 
Exon: 2370. .1001 

Start ATG: 2579 (Reverse strand: CAT) 
Transcript No. : CT23567 

CCTGACATACACCAAAGGCAAATTGGCAGCTGAAAACACGAATTAAACTAAAATGATGGATGAGCAGCTAATTGACGAGGTAGCGCAGCACGGAG 
TGATCTACAACCGGCAGAAGTACTACCTCAACGGCGGCGCCAATGGTGGCAAGTACGAGACGAAGGACGAGGCCTGGCAGCTGATTGCGATGAAG 
CTGCGCACGGATGTGGACACGTGCAAGAAGCGGTGGAAGTACCTACGCGAACGCTACGTGTCGCAGCGGAAGCAGGGCGATCCACCTGTCTACGA 
ACACCTGTCGCGTCCATACCTGGAGAAAATGAAGTTCTTGGACCAGCACATCCAGCCGCGCAAGTCCTACCGCCATGTGCCGAACTTCCTAACAT 
CTCCACAGTCCGCCAACAGCTCCGGCTACAACGAATACCAGGTTGATAAGTCCAATGGCTCCATGAAGAATGTGTCTCAGTTTGGCAGCTCAGGG 
CAAAGTCATCTTTACCACCAGCCGGATCAGCAACACGCAATGAGTGCTCTGAGTAACGTTGCTGCATCTGCGCTGGAAAACGTCAATGGCCAGGT 
GAAGATCGAGGCGGACCAGGTGTTCAGGGATTTTGCGGCTGCCGTGGCCTCACAGCAGCTCCAGCACATATCACAGTCACAAATGCAGCAACAGG 
CGGCGGCCGTTGCAGCTGTAATGGCTGATTCCTCGCAAGGTTATCAGGATCAATACAAGGATGGCTCCGTGGGCATGAACGGGGCCCAGAACAGC 
GCTGGCAGTCTGACCTCAACATCATCCTCGATGAAATCACCGCTGTCGTCGCCGCTGCAAGGAATTGGGGCAGGATCCCATCATCCTCAGCAGCA 
GACGCAACAACAACAGCAGCAGCAGCAGCAACAAGCGCAGCAACAGTCGCCGGCATCTGAACAGCAGCTTCCGGTGGTACATTCCTCTTCCAGCG 
CCACGGGCGCTTCCATCGGCAACAGTTCCACTCTGCAGATGCAACAGTCACATGTCTATAACCCCAAGGGCGATGGTCTAGACTCTTCGTCATCG 
AGCCATTTCCATATGAAGAAACCCCGTATTCAACTAAATGGCAATGCCCATAACCAGATGACCAGCAATGGTAGTCACTTTGGCAATGACTCGGA 
CGACGAATCCGATGAGAACAGCCATGATCTGATGGAGCCGCAGGCTATGATGCAGCAGGAGAATCACTACAGCAATTCGATGCCCATGCAGCGGG 
GCAATGGAAATGGGAATAACAGTAGCAGCAATAGCGGGAGCAACAATCCCAGCGGCAATAATAGCCACAACAACCAGCAGCAGCAACAGCACCAG 
AACATGTTTCCGTCAAACACGGACTTCCTTTTCCAGTTATACCAGCAGTTCCCACATCAGGCCAGCAGTTCGCATCCCGCAAACTTCGGGAAATT 
CCAGGCGCCGCCCAATCATCCGGGTGTGCAGCGGTTGTCTGAGCATTTGCTGGGAGAATTAGTCACCACGGAACTGCTGAAAATGAACAAGGAAC 
GGAAGAAAAGTGCACAGAAACGGATCCTAGAAATACTATTCTTCGACGATTAA 
(SEQ ID NO: 878) 

Start ATG: 53 (Reverse strand: CAT) 

1MDEQLIDEVAQHGVIYNRQKYYLNGGANGGKYETKDEAWQLIAMKLRTDVDTCKKRWKYLRERYVSQRKQGDPPVYEHLSRPYLEKMKFLDQHI 
QPRKSYRHVPNFLTSPQSANSSGYNEYQVDKSNGSMKNVSQFGSSGQSHLYHQPDQQHAMSALSNVAASALENVNGQVKIEADQVFRDFAAAVAS 
QQLQHISQSQMQQQAAAVAAVMADSSQGYQDQYKDGSVGMNGAQNSAGSLTSTSSSMKSPLSSPLQGIGAGSHHPQQQTQQQQQQQQQQAQQQSP 
ASEQQLPVVHSSSSATGASIGNSSTLQMQQSHVYNPKGDGLDSSSSSHFHMKKPRIQLNGNAHNQMTSNGSHFGNDSDDESDENSHDLMEPQAMM 
QQENHYSNSMPMQRGNGNGNNSSSNSGSNNPSGNNSHNNQQQQQHQNMFPSNTDFLFQLYQQFPHQASSSHPANFGKFQAPPNHPGVQRLSEHLL 

GELVTTELLKMNKERKKSAQKRILEILFFDD* 
(SEQ ID NO: 879) 
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Celera Sequence No. : 142000013384668 

CGCACGCTTCACAAGGATCTTCGTCTTCCTCTGGCTTGCAACGAACACAGTACGCAGTGGGATACCACCGATCATTGCCGGACGTAGATCTTTCA 
TGGTGCACACCTAAGAACTCTTCATTTGGTGGCTCAGGCTCTAGAATCATCTTGTAGAAGGTGTACCCGCACCTGGTTTCACTGCAGCTGGGACT 
TAAAAAAGCTAGAGGCGATATGAATGTGTCCTCCGAAAAATTGGGTGGGAAATGTCCTTCCTTAATGGGACAGTTAGCTTCCTCCTGTGTAAAAA 
TAGAAGATAATGGTTCACTATTCAATATCCAGTGCAAAAACCATCCACTCACCTCCAAGTTTCCCAAGAAATGCAAGTCCTCGAAGTAGGAGTAC 
CTTGGACTATAACTATAACCCTCAATTTGACTATTTCGGATGGCGGCCAATTCCTTGGAGTAGTAGTGCCGCAGACCGAGCACCTGCAGCTCCAC 
CTGGTCAGGAGTCAGGCCGCAGTTCATCCGCCGGGTGATTTGTCGCCAGGCGTCATCCTTCGCAGAACCACTATGGAAACCAGGTGACTTTGGAT 
TCCAGAGGCACTCGTTCGATCGATAAAGTCGGATGAGCTTCTGCGTCTTCTTCAATCGCTCCAAATTCAAATACGACGTAATATCCTTCTGTGCC 
ATCGTGCCCCTCTGTTCTCCACAATCCTTTCCAGCGACTCGGTGATCAATCTTCCATAGCGTTTTCCTATTGAACCTACAGAGGTTGTATGCCTA 
GTTTGGAGCTATATATTGCGCCCAAGCCGAAATATTTGGACAGGAAAAACAAAAGTGACAATAAACAAGCAATTTTGAATAATTTTTGAATGATT 
TTTAAACGGTATCGCTTAATTAGGAATATTATATGCCGTTACCATTTTCAAGATCTGCAGGGTGTCTCATACATTCGAAGCTTTTGAGAGTATTA 
ATTCAGTATTATTGTGCAATTAAATGGCGATTGGTATTTAGCTGGGCGAGCAGAACGGTCACACTGGCGCACAGTAGCGAAAATAATAGTCGAAT 
AATAAAATAGTTTTGAGTGCAACTAATTCCGACCTACTAACGTATGCGAGTGCATTTAGCGGCTGCACTTAATCATTCCGGGCAAAAAAGTGCAA 
AACAGTACGACGCGGAGCAGTCAGAATCGAGGTTGCGCTCCAGCGTTATCGCCATTTGACCAATTCAACTTTTGTGTCTGGTATAACCACAAAAT 
TTTCACTGCATCTTCTTATTTGAGCCGCCCCCAACCCAATTTTCGCGAGTGTGTCGGAATCAGCAGAGAATCGGAACAAAAGGGGCCAAACGGAG 
TACCAGAATCGGCACAAGCAAACAGTGAACAGCAGATAGTGCGGACGATAGCGAGTCCGTTGATTATCCACGTAAACAGTGAGCTGCATGCGCAG 
AGGTCGCGCCCAGAGCTGCAGAAATCCAACAATCTCCGCCAGAAAGATCTATCTCCGCCCTGTAAGCAATGTCCGCGGAACGAGAGATTCCCGCC 

G AGGAC AGC AT C AAAGT G GTCTGCCGATTCCGACCGCT G AAC G AC AGC G AAG AG AAGGC C GG C T C C AAG T T C GT GGT C AAG T T C C CC AAC AAT GT 
GGAGGAGAACTGCATATCCATAGCGGTATGTGTTCCAGATCTATAGTTTTCGCCAGCCCCCAGTGCCTTTTCCCTATGTGTTATGCATTAATCTC 
GGCGTGCCGCGC CAT G AC G T C AT GC C T AT T AGCGT AC AT T AC C C T T C C C C AAAAT T G C C G AC C AAT T GG AAT GC C AGGC GAT AAAAAG AG AG AG G 
C AC T AAAGT TACC CAT GAAT AAT T GC AC AAAAC AAAAC TAG AT GT AC AT AT AC AT GT AT AT G ATG AAACC AAC T AACTT T AC T C C AAAT C AAT AT 
TGCTTTGCACCAAATTGATTTTCCCCCAGTCGAAAAAGAATAGGGTGTGTAAATCGAAAGAAGCGCGATGACGGGGTGGAGAAATGCCACCCACT 
AGACGGTTTTCTGCTAAAAGCGGGTGTTCCATTCTTTTGGGCCGCATAAATATGCGCTTTTCTAGTTTCCAGTCCGCCTTCAGTGCTTACCTCAC 
TTACATTTCCACACAAACACATACACACATTGACGGTAGCCCCCGCTCCACTTCCACTCCCCTGTCCGCGCGCCAATGAGTCATGCACGCAAAGC 
^TACAAAAAAAAAAAATATGAGGAAAAACAATAAGAGCAACTTTGGAGGCAATGGAATTTGGCTGTATTGCTTTTATGCCACATGAAATGTTATT 
GCTTTGTATGCTATGCTGAATTTCCCGCTTTTCCGCCAGGTGTAATCCGTCATATTCCTGGTAAAGCTTGGGCAATTGAACCAGTTCCAAAAAAG 
TCAGCACTAATATAGTGCCTCCATACACAGACATTATCATTACGCTCTCTCTCTCTCTCTGACTAACTCTATCTATCTCTCTTTCGCTGTGGCTT 
TCTTCTGGCGCTCGTTATTCACTATTCTTCCCCCTTCCACTTAACATGTCTTAATGCTCATAATTACAATCTGCTTCGCGGCCATGCGTTGAAAA 
TGATTCAAAGGCAGTCAGCGGGAAAAGATCGGGGACAGGGGGAATGAATCTCCGAAAGAAATTTCACCGCAACTGACACCAGTTTATAGCTAATG 
TAACTGGCTGCTATGTACTAAAGAAAAAAACACATTATACTAAACCTAATCTCCATCGTCTGCAGGGCAAGGTGTATTTGTTCGACAAGGTCTTC 
AAACCGAATGCATCCCAGGAGAAGGTCTACAATGAGGCGGCCAAGTCCATTGTTACGGATGTCCTGGCCGGGTACAATGGAACGATATTCGCATA 
TGGTCAGACGTCCTCCGGAAAAACGCATACGATGGAGGGCGTGATCGGGGACTCCGTAAAACAGGGTATCATACCACGTATCGTCAACGACATTT 

T C AAT C AC AT CT AC GC G AT GG AGGT G AAC C T GG AGT T T C AC AT C AAG GT CT C C TACT AC GAG AT C T AC AT GG AC AAG AT T C GAG AT C T GT T GG AC 

GTCTCCAAGGTGAACTTAAGCGTGCACGAGGACAAGAACCGGGTGCCGTACGTCAAGGGCGCTACGGAACGGTTCGTCTCGTCGCCGGAGGATGT 

TTTCGAGGTGATCGAGGAGGGCAAATCCAATCGTCACATCGCTGTGACAAGTGAGTACTAATTAAAATCGATGGTCAGGACCATTTCAGCCAAAC 

AAGTCTAGGAGTTATTCCAGCATTCGCTCTGACCTCGATTGTAGTGTTTCATGTACCAGATCATCTTTTATAACGATTTATCCATCCTTTCAGAC 

ATGAACGAGCATTCTTCGCGATCCCACTCAGTATTCCTTATCAATGTGAAGCAGGAGAACCTGGAGAACCAGAAGAAACTATCCGGCAAACTCTA 

CCTGGTGGATTTGGCCGGTTCCGAGAAGGTTTCCAAGACTGGAGCGGAGGGAACCGTTCTTGATGAAGCCAAGAACATCAACAAGTCGCTGTCGG 

CCTTGGGCAACGTAATTTCAGCCCTGGCGGACGGAAACAAAACGCACATCCCCTACCGTGATTCCAAGCTCACGCGCATCCTGCAGGAGTCGCTG 

GGAGGCAATGCACGCACAACCATCGTCATCTGCTGCTCTCCAGCCAGTTTCAACGAATCTGAAACGAAGTCAACGCTGGACTTCGGTCGTAGAGC 

CAAGACAGTGAAGAACGTGGTCTGCGTTAACGAGGAGCTTACTGCCGAGGAATGGAAGCGACGCTATGAAAAGGAGAAGGAAAAGAACGCCCGAC 

TAAAGGGTAAGGTGGAGAAGCTGGAGATCGAGCTTGCGCGCTGGAGAGCGGGTGAAACTGTTAAGGCGGAGGAGCAAATCAACATGGAGGATCTC 

ATGGAGGCAAGCACGCCCAACCTGGAAGTGGAGGCAGCACAGACGGCGGCGGCCGAGGCCGCTTTGGCCGCCCAGCGAACGGCTCTCGCCAATAT 

GTCCGCATCGGTTGCCGTGAACGAGCAGGCCAGGCTGGCTACAGAGTGCGAGCGTCTCTACCAGCAGCTGGACGACAAGGATGAGGAGATCAATC 

AGCAGAGCCAGTACGCCGAGCAGCTCAAGGAGCAGGTGATGGAGCAGGAGGAACTCATCGCTAACGCTCGGCGTGAGTATGAGACTTTGCAGTCG 

GAGATGGCGCGAATCCAACAGGAGAACGAGTCCGCCAAGGAAGAGGTTAAGGAGGTGCTCCAAGCTCTCGAAGAGCTGGCTGTAAACTACGACCA 

GAAATCCCAGGAGATCGATAACAAGAACAAGGATATCGATGCCCTCAACGAGGAGCTGCAGCAGAAGCAGTCTGTGTTCAACGCCGCCTCCACAG 

AGCTACAGCAGCTCAAGGACATGTCCTCACACCAGAAGAAGCGCATCACGGAAATGCTAACCAACCTACTGCGCGACCTTGGCGAAGTGGGCCAG 

GCCATTGCCCCCGGCGAGTCCAGCATTGACCTCAAGATGAGTGCTCTGGCTGGCACGGATGCCAGCAAGGTGGAGGAAGATTTCACCATGGCGCG 

TTTGTTTATCAGCAAGATGAAGAGGGAGGCCAAGAACATTGCCCAGCGATGCTCCAACATGGAAACACAGCAGGCTGACTCCAACAAGAAGATCT 

CCGAATATGAGAAAGATCTGGGCGAGTACCGGCTACTCATTTCGCAGCACGAGGCACGCATGAAGTCGCTGCAGGAGTCGATGCGGGAGGCAGAG 

AACAAGAAGCGCACGCTCGAGGAACAAATCGATTCGCTGCGCGAGGAATGCGCCAAGCTCAAGGCCGCCGAGCACGTTTCCGCCGTTAACGCCGA 

GGAGAAACAGCGGGCTGAGGAGCTGCGCTCCATGTTCGATTCTCAGATGGACGAGCTACGCGAAGCCCACACCCGGCAGGTGTCCGAGCTCCGGG 

ATGAGATTGCCGCCAAGCAGCACGAAATGGACGAGATGAAGGATGTCCATCAAAAGCTGCTCTTGGCGCACCAACAGATGACGGCCGACTACGAG 

AAGGTGCGCCAGGAGGATGCCGAGAAGTCCAGCGAGCTTCAGAACATCATCCTCACCAACGAGCGTCGGGAGCAAGCGCGCAAAGACCTCAAGGG 

CCTGGAGGACACGGTGGCCAAGGAGTTGCAGACGCTACACAACCTGCGAAAACTTTTCGTTCAGGATCTACAGGTAAAGTTTAAGCAATCCTATC 

TATTATTGCTACGTACAGATCTTAAAATGTATTTCCCTTGTGCCAACAGCAACGAATCCGAAAGAATGTCGTAAACGAGGAGAGCGAGGAGGACG 

GTGGATCACTCGCGCAGAAACAGAAGATTTCCTTCTTGGAGAACAACCTCGACCAGCTGACCAAGGTGCACAAGCAATTGGTGCGGGACAACGCC 

GATCTGCGGTGCGAGCTGCCCAAGCTGGAGAAGCGTCTACGCTGTACCATGGAGCGGGTGAAAGCTCTGGAGACAGCGCTCAAGGAGGCGAAGGA 

GGGCGCAATGCGGGATCGCAAGCGCTACCAATACGAGGTGGACCGCATCAAGGAGGCGGTGCGACAGAAGCATCTGGGCAGACGTGGCCCACAGG 

CACAGATCGCCAAGCCGATCCGGTCCGGCCAGGGTGCAATCGCCATTCGTGGTGGTGGCGCCGTTGGAGGACCATCCCCGCTGGCCCAGGTTAAT 

CCTGTCAACTCGTAGATCCAATCACCACCTGTCGCCGCCCAGTTCAGCTCCGCTTTAAACTAAACTAAATGGATTACGTTATACATACTTAACAT 

AACTGATAATTGCCTTCGCTTAGATGAGATGTGTCGCGATCATGTGCAGCGCTTTAAATATACATACATATAATTTAATTAAATAAATGAAAGGA 

AACCGGAAATTAACTAAATTTTACAAACCGAAAATAATAAAACCCACAGATATGTAAGGACATCTATATACGTTAAGAGTATTTATAAACTTTTC 

AAACATAAACCTAAATAAAAGTCGCAGACAAATTAAGGTCTCTTTAGTTTGATTCTTATTGCAGTTCAAAAAAATTATGCCGATTATTTGGCTAT 

TGGTTGTTATTACATCGTCTGCTGTTACAATCGCATGTTGTAACCTGTCCCGCGTGTGTTATGGTGCTCCCAATCGTCTGGAGAATTGACCACCT 

GCCATTCGTACCACACTTTGGTTGCATCCACGCATCGCCAAAACTGGATGGATATAGTCTGTCCCTCCCTCAGAGTTCTTGGACGCTGAGTAGTA 

AAAATATATATTAATGTAGCAGTTATCAAAGGCTAACTAATCACATCACACGTACCGTTGCGAAGAACATGGGGAACCAGGAGAACATGCCAGGC 

GTATGGGTCAGAGGATTGATGCTTAGGCAGATGTCCTTGTACAAATGGGTATCAAAGTATCCGCCGATTCCGTGCAGCACGCAATCTTTATTGAC 

TTTAAATGAAACCGTCTTGCAGCGAGTATTGTCAATATTTTCCGCTCGATTAGGATGTACGAATTCAAACAGAGCCTGCGGCTCGTCAATATGGT 

AAATATTCTTTAGCAGCGACACATAGCCGTAGTCGAAGGCAGGATAGGTTGGTAACAATTGGCAGACATTTTGGTGCAGCACTGCTGACATCAGT 
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GGATTAATGTAGGAGGTGGATTTGTAGGGTATACTGATTCCATCCGGTTTGAGCAGCTTCAGTGCTCCATCAAGGCACTCGGGCGAAAGTTCATT 

ATCGCCAAAGGAACCAAGTAACTCCGAAACCATGATGTCAGCCAGCTCTGGAGGGGAAAAGTCCCGCATGTCCTTTGAAAAAATGTGAACATCTG 

AAAGATTTGTTGCGGATTAAGACTGGATTAAAGCAAGGTGTTTTTTTTTTTCTTATTCTTACCCTTATCGGCCCAAAGCGTTTTAACCATGTTGG 

AGAGGGTGCGGATGGCATTGGGATTCTTTTCTATGATGTAGAGGCGAACTTTGCGCTTCGTTAACTCGGCAGCATTGAA 

(SEQ ID NO: 880} 

Exon: 1001. .1640 
Exon: 2726. .3185 
Exon: 3324.. 5203 
Exon: 5275.. 6014 
Start ATG: 14 94 

Transcript No. : CT23616 

CAGAACGGTCACACTGGCGCACAGTAGCGAAAATAATAGTCGAATAATAAAATAGTTTTGAGTGCAACTAATTCCGACCTACTAACGTATGCGAG 
TGCATTTAGCGGCTGCACTTAATCATTCCGGGCAAAAAAGTGCAAAACAGTACGACGCGGAGCAGTCAGAATCGAGGTTGCGCTCCAGCGTTATC 
GCCATTTGACCAATTCAACTTTTGTGTCTGGTATAACCACAAAATTTTCACTGCATCTTCTTATTTGAGCCGCCCCCAACCCAATTTTCGCGAGT 
GT GT C GG AAT C AGC AG AGAAT C GGAAC AAAAGGG GCC AAAC GG AGT AC C AG AAT C GGC AC AAGC AAAC AGTGAAC AGC AG AT AGT G C GG AC G AT A 
GCGAGTCCGTTGATTATCCACGTAAACAGTGAGCTGCATGCGCAGAGGTCGCGCCCAGAGCTGCAGAAATCCAACAATCTCCGCCAGAAAGATCT 
ATCTCCGCCCTGTAAGCAATGTCCGCGGAACGAGAGATTCCCGCCGAGGACAGCATCAAAGTGGTCTGCCGATTCCGACCGCTGAACGACAGCGA 
AGAGAAGGCCGGCTCCAAGTTCGTGGTCAAGTTCCCCAACAATGTGGAGGAGAACTGCATATCCATAGCGGGCAAGGTGTATTTGTTCGACAAGG 
TCTTCAAACCGAATGCATCCCAGGAGAAGGTCTACAATGAGGCGGCCAAGTCCATTGTTACGGATGTCCTGGCCGGGTACAATGGAACGATATTC 
GCATATGGTCAGACGTCCTCCGGAAAAACGCATACGATGGAGGGCGTGATCGGGGACTCCGTAAAACAGGGTATCATACCACGTATCGTCAACGA 
CATTTTCAATCACATCTACGCGATGGAGGTGAACCTGGAGTTTCACATCAAGGTCTCCTACTACGAGATCTACATGGACAAGATTCGAGATCTGT 
TGGACGTCTCCAAGGTGAACTTAAGCGTGCACGAGGACAAGAACCGGGTGCCGTACGTCAAGGGCGCTACGGAACGGTTCGTCTCGTCGCCGGAG 
GATGTTTTCGAGGTGATCGAGGAGGGCAAATCCAATCGTCACATCGCTGTGACAAACATGAACGAGCATTCTTCGCGATCCCACTCAGTATTCCT 
TATCAATGTGAAGCAGGAGAACCTGGAGAACCAGAAGAAACTATCCGGCAAACTCTACCTGGTGGATTTGGCCGGTTCCGAGAAGGTTTCCAAGA 
CTGGAGCGGAGGGAACCGTTCTTGATGAAGCCAAGAACATCAACAAGTCGCTGTCGGCCTTGGGCAACGTAATTTCAGCCCTGGCGGACGGAAAC 
AAAACGCACATCCCCTACCGTGATTCCAAGCTCACGCGCATCCTGCAGGAGTCGCTGGGAGGCAATGCACGCACAACCATCGTCATCTGCTGCTC 
TCC AGC C AGT T T C AACGAAT C T GAAAC G AAGT C AAC GC T GG AC T T C GGT C GT AGAGC C AAGAC AGT G AAG AAC GT GGT CTG CGT T AAC G AGG AG C 
TTACTGCCGAGGAATGGAAGCGACGCTATGAAAAGGAGAAGGAAAAGAACGCCCGACTAAAGGGTAAGGTGGAGAAGCTGGAGATCGAGCTTGCG 
CGCTGGAGAGCGGGTGAAACTGTTAAGGCGGAGGAGCAAATCAACATGGAGGATCTCATGGAGGCAAGCACGCCCAACCTGGAAGTGGAGGCAGC 
ACAGACGGCGGCGGCCGAGGCCGCTTTGGCCGCCCAGCGAACGGCTCTCGCCAATATGTCCGCATCGGTTGCCGTGAACGAGCAGGCCAGGCTGG 
CTACAGAGTGCGAGCGTCTCTACCAGCAGCTGGACGACAAGGATGAGGAGATCAATCAGCAGAGCCAGTACGCCGAGCAGCTCAAGGAGCAGGTG 
ATGGAGCAGGAGGAACTCATCGCTAACGCTCGGCGTGAGTATGAGACTTTGCAGTCGGAGATGGCGCGAATCCAACAGGAGAACGAGTCCGCCAA 
GGAAGAGGTTAAGGAGGTGCTCCAAGCTCTCGAAGAGCTGGCTGTAAACTACGACCAGAAATCCCAGGAGATCGATAACAAGAACAAGGATATCG 
ATGCCCTCAACGAGGAGCTGCAGCAGAAGCAGTCTGTGTTCAACGCCGCCTCCACAGAGCTACAGCAGCTCAAGGACATGTCCTCACACCAGAAG 
AAGCGCATCACGGAAATGCTAACCAACCTACTGCGCGACCTTGGCGAAGTGGGCCAGGCCATTGCCCCCGGCGAGTCCAGCATTGACCTCAAGAT 
GAGTGCTCTGGCTGGCACGGATGCCAGCAAGGTGGAGGAAGATTTCACCATGGCGCGTTTGTTTATCAGCAAGATGAAGACGGAGGCCAAGAACA 
TTGCCCAGCGATGCTCCAACATGGAAACACAGCAGGCTGACTCCAACAAGAAGATCTCCGAATATGAGAAAGATCTGGGCGAGTACCGGCTACTC 
ATTTCGCAGCACGAGGCACGCATGAAGTCGCTGCAGGAGTCGATGCGGGAGGCAGAGAACAAGAAGCGCACGCTCGAGGAACAAATCGATTCGCT 
GCGCGAGGAATGCGCCAAGCTCAAGGCCGCCGAGCACGTTTCCGCCGTTAACGCCGAGGAGAAACAGCGGGCTGAGGAGCTGCGCTCCATGTTCG 
ATTCTCAGATGGACGAGCTACGCGAAGCCCACACCCGGCAGGTGTCCGAGCTCCGGGATGAGATTGCCGCCAAGCAGCACGAAATGGACGAGATG 
AAGGATGTCCATCAAAAGCTGCTCTTGGCGCACCAACAGATGACGGCCGACTACGAGAAGGTGCGCCAGGAGGATGCCGAGAAGTCCAGCGAGCT 
TCAGAACATCATCCTCACCAACGAGCGTCGGGAGCAAGCGCGCAAAGACCTCAAGGGCCTGGAGGACACGGTGGCCAAGGAGTTGCAGACGCTAC 
ACAACCTGCGAAAACTTTTCGTTCAGGATCTACAGCAACGAATCCGAAAGAATGTCGTAAACGAGGAGAGCGAGGAGGACGGTGGATCACTCGCG 
CAGAAACAGAAGATTTCCTTCTTGGAGAACAACCTCGACCAGCTGACCAAGGTGCACAAGCAATTGGTGCGGGACAACGCCGATCTGCGGTGCGA 
GCTGCCCAAGCTGGAGAAGCGTCTACGCTGTACCATGGAGCGGGTGAAAGCTCTGGAGACAGCGCTCAAGGAGGCGAAGGAGGGCGCAATGCGGG 
ATCGCAAGCGCTACCAATACGAGGTGGACCGCATCAAGGAGGCGGTGCGACAGAAGCATCTGGGCAGACGTGGCCCACAGGCACAGATCGCCAAG 
CCGATCCGGTCCGGCCAGGGTGCAATCGCCATTCGTGGTGGTGGCGCCGTTGGAGGACCATCCCCGCTGGCCCAGGTTAATCCTGTCAACTCGTA 
GATCCAATCACCACCTGTCGCCGCCCAGTTCAGCTCCGCTTTAAACTAAACTAAATGGATTACGTTATACATACTTAACATAACTGATAATTGCC 
TTCGCTTAGATGAGATGTGTCGCGATCATGTGCAGCGCTTTAAATATACATACATATAATTTAATTAAATAAATGAAAGGAAACCGGAAATTAAC 
TAAATTTTACAAACCGAAAATAATAAAACCCACAGATATGTAAGGACATCTATATACGTTAAGAGTATTTATAAACTTTTCAAACATAAACCTAA 
ATAAAAGTCGCAGAC 
(SEQ ID NO: 881) 

Start ATG: 494 

MSAEREIPAEDSIKVVCRFRPLNDSEEKAGSKFVVKFPNNVEENCISIAGKVYLFDKVFKPNASQEKVYNEAAKSIVTDVLAGYNGTIFAYGQTS 
SGKTHTMEGVIGDSVKQGIIPRIVNDIFNHIYAMEVNLEFHIKVSYYEIYMDKIRDLLDVSKVNLSVHEDKNRVPYVKGATERFVSSPEDVFEVI 
EEGKSNRHIAVTNMNEHSSRSHSVFLINVKQENLENQKKLSGKLYLVDLAGSEKVSKTGAEGTVLDEAKNINKSLSALGNVISALADGNKTHIPY 
RDSKLTRILQESLGGNARTTIVICCSPASFNESETKSTLDFGRRAKTVKNVVCVNEELTAEEWKRRYEKEKEKNARLKGKVEKLEIELARWRAGE 
TVKAEEQINMEDLMEASTPNLEVEAAQTAAAEAALAAQRTALANMSASVAVNEQARLATECERLYQQLDDKDEEINQQSQYAEQLKEQVMEQEEL 
IANARREYETLQSEMARIQQENESAKEEVKEVLQALEELAVNYDQKSQEIDNKNKDIDALNEELQQKQSVFNAASTELQQLKDMSSHQKKRITEM 
LTNLLRDLGEVGQAIAPGESSIDLKMSALAGTDASKVEEDFTMARLFISKMKTEAKNIAQRCSNMETQQADSNKKISEYEKDLGEYRLLISQHEA 
RMKSLQESMREAENKKRTLEEQIDSLREECAKLKAAEHVSAVNAEEKQRAEELRSMFDSQMDELREAHTRQVSELRDEIAAKQHEMDEMKDVHQK 
LLLAHQQMTADYEKVRQEDAEKSSELQNIILTNERREQARKDLKGLEDTVAKELQTLHNLRKLFVQDLQQRIRKNVVNEESEEDGGSLAQKQKIS 
FLENNLDQLTKVHKQLVRDNADLRCELPKLEKRLRCTMERVKALETALKEAKEGAMRDRKRYQYEVDRI KEAVRQKHLGRRGPQAQIAKPIRSGQ 
GAIAI RGGGAVGGP S PLAQVN PVNS * 
(SEQ ID NO: 882) 

Name: KINESIN HEAVY CHAIN 



FIGURE 




Classification: motor_protein 
Gene Symbol: Khc 
FlyBase ID: FBgn0001308 



Celera Sequence No. : 142000013383872 

ATAAGGTGAAGAATGCGGCGCGAAAGCAAAAAGAAGAGGAAATCAAGGACGGCATTCGAGAAAAGTCGCGGTCGCCAAGTCCGATTGTGCTCGAG 
AAACCCCTACCCAAAAAGAGCAACAAGCGAGTTTATCGGTGAGTGTTGGTTTTCCATCAAAAATTTTCACACAAAAACTAGAAATATTTGCACTT 
ATTTTCTTGCGCAGATCCAAGAGTCGCAGTCGCTCCCGCTCAAAAACGCCGCCCCATCGAGATAGATCGTTGTCTCGCTCCCGATCCCGCTCGCG 
ATCCATGGAGCGCTCTTTAACGCCGCCACCGATGAATCGGAGCAGGTTGGGCGGTGGTAACAACCGCAAGGGTCGCAATCGATCTCCTCGCCACG 
AACGGGACAACAACAGCGGCAGCACAAATACCAGAGAGGGCCGTGGTGATCGTGGCAATTCCAATAGCCGAGACAATGGCAACAGCACCAACCTA 
AGACGAGTGGACCGCGACAGATCTCTAACACCTCCGAGCTTTCTGTAAATCATGGCTATAGTACAATTTCATGACAGTCGGACCAATGAGCTATC 
TTTGTTTTTCCAGGGGAAGCACATTTGCCAAGCCAGCGGATTTCATCGAGGAATCGAACAAGGGCCACCAAATGCTCATGAAAATGGGTTGGGCG 
GGCACGGGAACTGGACTGGGCTCAAAGAATCAAGGCATCGACACGCCCATTTCGGGAGGCGAGGTGCGCGATCGTCGAGAGATGTACAAGGTGAA 
GTGGCAAGAGTGCGCAAGAAAGGACAACTTATAATTTTCTATCCACAGGGAGTGGGCGCTAATATGCACGATCCCTTCGAGAGCTTTCGGAAGAA 
CAAAGGAGCTGCCTTCGCGCACCGGATGCGATCGCGAGACGACAAGAGTTAGGATCTGCCAACGGTCATAACTTCCCAGTTTTTTGTTTCCTTTA 
GCCCAGAAAATATACATTTTTATAAAAGCGAACTGAATTTTTATACAAGATTAGGCTATAACATTAAAGCTGGATTGAGCGGCCTTGGCATTTAC 

T TT T T C C AG GAC CT GGAC C AGGCC CT TGC C AG GACC GC AT T C G AAGG T GC GT GG AAAGT C GAC GC C C T GC T T GC GCT C GT AC AT C T C GT GG AG AG 
TCTGCTCCCACTTGACGGGCCTCACGATTTGCTTGGGCAGCTGCGTTAGGATGTGCTTGGCATGGCGATAGGGTTTGCCATCCACATTGGAGTAC 
ACTCTAATGACGGGGTCCTGAAGTCGCACCGTTTTCAGGGCTTTGGTAAATGGCTCCACGGCGCTTTGCATTAGAGGAGTGTGAAAGGCGCCACT 
GACAGCCAATCTCTTCATCCGCCGGATTTTGAAGGACTTGGCATTTTGTTCCAGGAACTCCAGGGCCTCTACGTTGCCGGCCACTACCTTGCAGT 
GCGGATACATGTAGTTGGCGATGCCACAGTAGGGCGACTCCACTCCTTTGTCCAGACACCATTGCTGCGCACGAGCGCAGGCCTCTCCCAGATTG 
GTATCCGGTCCGTAGAGAGTCATCGCCATGGCTCCGGCTGCCTGGTCACATGCCGCCTGCATGGCGGTGGCCCGCACCTGCACCAATCGCAATGC 
CTTGTCGAAAGGCAGGGCATCCGCGTACACCAATGCGGTAATCTCGCCCAAACTAAAGCCTGCGGCAGCTACACACGTTTCAATGGCCTTGGGTC 
GCTCCTCACGCAACTGCTCCAGTGCTGCCAGCGAGGATACCATCACAGCCAGCTGAGCGTGCTCCGTGCGATTCAGTTTCTCACGTGGTCCCTCC 
AGGCAGATCTTGAGCAGATCGTATTTCAGCACCTCGTTGGCCAGCTCGAAGATGCGACGGGCGCCCGGGAACCGGAGCAGGTCTTTGGCCATGCC 
CACGTACTGGGTGCCCTGACCGGGAAAGAGCATCACACTTGTTTCCTTGGGATCGATGGCGGGCCGGCCTTTCTGTTCCAGCGGCTCAGAAAGCT 
CGTTGAGCAGTTGCTGCCGCTTCTCCGCGTTCTGCAAAAGGGCGGAGGTTTCTGTTGTGGCCTTCGCGGCATCGCTGCTCCAGCGGGACCAGCTT 
AGGGCACCGGTAATCCGCGGTGATCGTAGTAATCTGCGGGCGGCCAACATTTTTGTTGAAGGAGTTCGCGGCATAACATAACAGCTGATTGGAGC 
AG AG AAAAG T AG AT CAT AT C GAT AAC T GT AAT GC GGC C AT CG AT AGCT T T AGT AC T T C G ATG AT T T T AGT AC AATT GT AT T T T G AAT T T T GGC GG 
C T T AT GCTC AAAAAC AAT AT GAT C TT AAG AAAT GG AAAAC AT T C AGT GC AT T GT GAT AAT C GAC ACT T TAT T GT AAATCT T AAT T AT T AAAGT C A 
ACATGCTATATACGCTATTAAGTGGTTCAAGTATTTACTATTAACTTTAAATAAATCCAATGGCCAATTGGAATTGGTAAAAAGATCAAAATTCC 
CAGGTATTTTATTTCCCCAGACATCTTAGTCATGGTCTGCGGTCTTTTTCCACTATTTATAGCCAAACTTGATTTATGTTGCTAGGCTCTTATTT 
AGTCGTTGATGGTCATGACGAAATAATGTTGGTTGTTTATTCATGAGTCACTGACTACATAGAAACTGAGCCAAGTTTTTAGTTGAGTACAGAGT 
AACAAAGTTACTGTGCTTAGTAACATTTCATTTAAGATACTATTTTAAAATATTAATAATTTTTTTTTTTTTTGGTGTAGCCTTTTATTTATTAT 
TTAAAATCTGTCCTAGTGGACAATAATTAAATTTTATGGAGGGAAACATTAACGTAAGGGGAGTACATCGAGTGTACTTGGGTAGGGGAATATTA 
ATAATGACAATATTTTTCTGAAGAAAATTTGGAAGCTAAGTTACGACTTTTCGTTTCATTTGCATTGTATTCTATAATTTACGTTTTCTAAAGTC 
AGACCTTAAATATCTATGAAAATTTAAAGAAATAATAGCTCTTTATATGTTTAAATTCAAAGACATAAGATTAAAACCCTGCCAAGACAAAAGAG 
TATTTTTTGGCCATGTGCTATTCGGAACCCCCGAAATAAGAGTTTCTAATTTGAAATGCGATCCCAGCAAGGCAATCGCAGAGTTTTTTGAACCC 

AGGGCAAAAGAATAACC 
{SEQ ID NO: 883) 

Exon: 2152.. 1001 

Start ATG: 2140 (Reverse strand: CAT) 
Transcript No * CT237 94 

CCTTCAACAAAAATGTTGGCCGCCCGCAGATTACTACGATCACCGCGGATTACCGGTGCCCTAAGCTGGTCCCGCTGGAGCAGCGATGCCGCGAA 
GGCCACAACAGAAACCTCCGCCCTTTTGCAGAACGCGGAGAAGCGGCAGCAACTGCTCAACGAGCTTTCTGAGCCGCTGGAACAGAAAGGCCGGC 
CCGCCATCGATCCCAAGGAAACAAGTGTGATGCTCTTTCCCGGTCAGGGCACCCAGTACGTGGGCATGGCCAAAGACCTGCTCCGGTTCCCGGGC 
GCCCGTCGCATCTTCGAGCTGGCCAACGAGGTGCTGAAATACGATCTGCTCAAGATCTGCCTGGAGGGACCACGTGAGAAACTGAATCGCACGGA 
GCACGCTCAGCTGGCTGTGATGGTATCCTCGCTGGCAGCACTGGAGCAGTTGCGTGAGGAGCGACCCAAGGCCATTGAAACGTGTGTAGCTGCCG 
CAGGCTTTAGTTTGGGCGAGATTACCGCATTGGTGTACGCGGATGCCCTGCCTTTCGACAAGGCATTGCGATTGGTGCAGGTGCGGGCCACCGCC 
ATGCAGGCGGCATGTGACCAGGCAGCCGGAGCCATGGCGATGACTCTCTACGGACCGGATACCAATCTGGGAGAGGCCTGCGCTCGTGCGCAGCA 
ATGGTGTCTGGACAAAGGAGTGGAGTCGCCCTACTGTGGCATCGCCAACTACATGTATCCGCACTGCAAGGTAGTGGCCGGCAACGTAGAGGCCC 
TGGAGTTCCTGGAACAAAATGCCAAGTCCTTCAAAATCCGGCGGATGAAGAGATTGGCTGTCAGTGGCGCCTTTCACACTCCTCTAATGCAAAGC 
GCCGTGGAGCCATTTACCAAAGCCCTGAAAACGGTGCGACTTCAGGACCCCGTCATTAGAGTGTACTCCAATGTGGATGGCAAACCCTATCGCCA 
TGCCAAGCACATCCTAACGCAGCTGCCCAAGCAAATCGTGAGGCCCGTCAAGTGGGAGCAGACTCTCCACGAGATGTACGAGCGCAAGCAGGGCG 
TCGACTTTCCACGCACCTTCGAATGCGGTCCTGGCAAGGGCCTGGTCCAGGTCCTGGAAAAAGTAAATGCCAAGGCCGCTCAATCCAGCTTTAAT 

GTTATAGCCTAA 
(SEQ ID NO: 884) 

Start ATG: 13 {Reverse strand: CAT) 



MLAARRLLRSPRITGALSWSRWSSDAAKATTETSALLQNAEKRQQLLNELSEPLEQKGRPAIDPKETSVMLFPGQGTQYVGMAKDLLRFPGARRI 
FELANEVLKYDLLKICLEGPREKLNRTEHAQLAVMVSSLAALEQLREERPKAIETCVAAAGFSLGEITALVYADALPFDKALRLVQVRATAMQAA 

™™*r:aMaMTT.vr;pnTNT.fiF.ar.ARAOOW^ 




{ SEQ ID NO: 885) 



Name: malonyl coenzyme A-acyl carrier protein transacylase-like 
Classificatxon: enzyme 
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Celera Sequence No. : 142000013384534 mm 
GTTTGTGCAGGTTTCGGCGGTTCCTGCAAAAACAAAAGAAAAATGAATGATAAGTATGAATCAAATATAAAGTATATGCATAGATTCTAATGATT 

TAAGTAAAATAGATAACGATTGATTACAAATCATTGATTATACATCCTTATCCTTAGTTAAAACAACTTCAAAAATTGCAAATTATGATATATTA 

GGTTTTTAAGAGCTTTATCTGGAATTTAAGTTAATTAAAAATCTCTCTAGAACTTTAAATATAGCATCAAATCTAAGGCAAATGTGTATGTTATG 

ATCAAAAGTAAAGTTGCTATGAAATAGCGCGAAACCCAATTAAAATTAGTTAACGT^^ 

AATGTTTTTCCAATCAGTAAATCCGTGTTTACCGCTCTGCAGTCGCTGCTGCGCTGAGCCCGCTGCCGGCGTCGCAGTCGGCGTCGCTGCAACGT 
CATCCTGGGCGAGCGGCGCCTTTTGCTTCTGCTCCTCCTGCTGCTGATGATGCAGTTCCTCCACTCGGCTGCCGGTGACGTCGCCGCGAAAGCGC 
AGCTGACGCTTGCCGCCCGCGGAATTGAGTGTGGCGTTCATTAAGTACGGTGTGCGGAGTCCATAAAATTAACAATAACAACGAGCTACAAAATT 
TCGGGGTGAAACAGCAGTTATATGAAAATGTACTAACGCGTGCCGAAAATACCCAAACGTAAACAGTGTGTGTATCGAATGTACAGAACTATCGC 
CTAAAGTAACAGCTTAGTTTATATTATATTAACATGGTTAATATTTGCAATGATTTTTATTCTAAGCTTAAAGAATAAGTTCCCGTTAGGGCCAT 

GAG AG AAAT TG AT AT AT AAGT AT AC AAC G AAT T T T AAAAT AAT AT AT T T T T GC T AC C C AGT AAAC TT T T TAT G C AC T AGCT C AC CG AT T CAT AT T 
TCAAAGTGCTATCGAAGTATCAGTCAATAGAGTGCCGTCACTAAATAAAAGCAGAGAAAATATTTCAATAGTTTGTAGCTGAAAAATATGTTCGC 
ATCTGTGGAATTAAGCTCATATCGCGATCAGCACTTCAAGGTGAGTACATCCGAAGGCGCCACAGCAGATCTCCGACTCAATAAACCAATTGCAG 
GGCTCGCGCTCCGAGCAAGAGCGATCGCTCCGCGACTCGTGTACACTTTACGTGGGAAACTTGAGCTTCTACACCACCGAGGAGCAGATCCACGA 
GCTCTTCTCCCGCTGCGGCGATGTTCGTGTGATTGTGATGGGCCTGGACAAGTATAAGAAGACGCCCTGCGGCTTCTGCTTCGTGGAGTACTATG 
TCCGATCGGAGGCGGAGGCGGCCATGAGGTGAGCCATATCCATCTCTCCACGCTAGAAAAAGCCGCCTTTTTTCCGGCATTTGTAGAAGTAATCA 
GATGGGGTATCATTATCTCTTAACTAACTTCAACACTCTTTTCTTCTCCCACAAAAGATTTGTGAATGGCACTCGCTTGGACGACCGTCTGATTC 
GTGTGGACTGGGACGCCGGCTTCGTGGAGGGCAGGCAGTATGGACGCGGCAAAACCGGCGGACAGGTGCGAGATGAATACCGCACGGATTACGAT 
GCCGGCCGCGGGGGTTATGGCAAACTGTTGTCACAGAAGATTGCACCCAACACGGACAATCGCTAGTTATTAGATAAAATGACTGTAAAGACTAT 
AAGCCCCCCAATTAGATATTAAAGCATACCCCTTAGACATAACATCTCTAATCGCATATATATTGTACGTTTTACATAGTTCTTTTGAAGACGTA 
ATTGATATCATTGCGGCCTAGCATTAGCATGTATGAGTTTCTAGTGTTTTCGGGCTGACTGGATTCCATTTACTTCTTGCCAATGTTGTTGGCCA 
GCGAGCAGCTGAGTCCGGTGCCATCGGGCTCGACATTCAGCTCGGTCACCTTGCCGTTCTCCACCACCAGCGAGTAGCGCTTTGAGCGCACGCCG 
CCAAGTGGTGGCAGATCGATGGTCACATCCAGGGCCTTGGTGAAACCGCCGGCGGGATCAGCTAGGAGGCGCACCTTGCCCGCGGCTCCGTGCTC 
CTTGCCCCAGGCGGACATCACAAAGGGATCGTTGACCGAAACGCAGACAATCTCGTCCACGCCCTGCTTGGACTTCAGCTCATCGGCGGAGCTCA 
CATAGCCGGGCAAGTGGGTCTGTGGGTGTGGGGAAATGCATTATTCAGTTGGAATACATAAGCTGCATAAATTACACTATACTGCTGGAAGTTTC 
TACAATATTCGGAAACTCAAAGCAGATGACTTTGCAACTATGCTACCTACGCACCTTTGAGCAGCCTGGAGTGAAGGCGCCGGGAACGCCGAAGA 
TGATCACCTTCTTGCCATTGACGAGATCGCCGGTGTTGATTTTGTTGGCTGGCGAGTCCTCGAACAGATCCACCGATGGCAGGGAGTCTCCTACC 
TGGAAATAAAGTGGGAGGCAGACAGGCGTACTTAGAGAAAAGAGGCAGCAAGATGGAGGCGATAACCTTGAAACGCTGTAATTTCTGGGTGCCGC 
CGCAATCAATATTTATGCAGCTCGGCACCGGCTTATGTAATGGAATCCGGACTGCCGCACAGCCGGAAATCCGCGTTTGCCGGTCAACCTGCACT 
TACTTTCACCATAGCTGCACTGGTTTTGGACAGGGATCGCAGTGAAATGATTTGTTGGGGCAGCGCGGAGTTAACAACTCGGCCAA 

(SEQ ID NO: 886) 

Exon: 1001.. 1085 
Exon: 1141.. 1358 
Exon: 1483.. 1746 
Start ATG: 1038 

GCAGAGZ^ 

CCGAGCAAGAGCGATCGCTCCGCGACTCGTGTACACTTTACGTGGGAAACTTGAGCTTCTACACCACCGAGGAGCAGATCCACGAGCTCTTCTCC 
CGCTGCGGCGATGTTCGTGTGATTGTGATGGGCCTGGACAAGTATAAGAAGACGCCCTGCGGCTTCTGCTTCGTGGAGTACTATGTCCGATCGGA 
GGCGGAGGCGGCCATGAGATTTGTGAATGGCACTCGCTTGGACGACCGTCTGATTCGTGTGGACTGGGACGCCGGCTTCGTGGAGGGCAGGCAGT 
ATGGACGCGGCAAAACCGGCGGACAGGTGCGAGATGAATACCGCACGGATTACGATGCCGGCCGCGGGGGTTATGGCAAACTGTTGTCACAGAAG 
ATTGCACCCAACACGGACAATCGCTAGTTATTAGATAAAATGACTGTAAAGACTATAAGCCCCCCAATTAGATATTAAAGCATACCCCTTAG 

(SEQ ID NO: 887) 
Start ATG: 38 

MFASVELSSYRDQHFKGSRSEQERSLRDSCTLYVGNLSFYTTEEQIHELFSRCGDVRVIVMGLDKYKKTPCGFCFVEyYVRSEAEAAMRFVNGTR 
LDDRLIRVDWDAGFVEGRQYGRGKTGGQVRDEYRTDYDAGRGGYGKLLSQKIAPNTDNR* 
(SEQ ID NO: 888) 

Name: RRM-type RNA binding protein 
Classification: RNA_binding 
Gene Symbol: Cbp20 
FlyBase ID: FBgn0022943 

Celera Sequence No. : 142000013384534 

ATGTGGCCGCTCTGTTGACTATGCTGTCCGCCAGTCTCTCCAGGTCGTCCAGGCTCTGCTCGTTGATTTTCCGCTCGAAGTCGGCCATAAACTCA 
TTGGTCATACGCTCAGCTAGGGGCTCCGAATAGTGCTTGCGGAATCCCCGCAGGATGCTGTCCCTTAAACAGGGCTTTATCACCACCAGATTGAG 
TTTGGTGCCCTCCTTGATGTTGGGATATGACGCAATTGTCTGTTCATTGTTAAGCGGTCGCCCCAACAGCAGTAGCTTTTGATTTGTAGCCGATA 
TCTGCAGTTCCGCCTCGATCTGGTGTTTCACCTCCAGAATCGTCGATGTCGGCGCCACCTGTGTGAAATATTACCATATTAGCGACCTCACTTGA 
TTTTGGCAAGGTTCATTCTTACCTCGATGGTGCAGTCCTTGCCCTTCAGCACCTTGATTGTTATCTGCATAGCAATTGCCGTAAAATTTATTAAA 

AAAT T7AAAAGT C AAAC T C G AGAAG T C G AGC GAT AT AAAC AAT CTGTCTTTGGCGT T GC C AC C T G TAT T G C C AC C7AAGT GT C AT T AG GCC AAGC G G 
CCCTGGTTTGTAAAACAAGTGCCGCAATATTTGCTAATATATAATTTACAAAAATATTTCCTTATATTAATATATAGTACATATATATGATTCTT 
GTAAATTGCCAAATGTTATTTAAAATTATTAATATTTTAATGCAGCTAGCACAAATTATGTACAAGCTATCGAAAACCACTCGAAGTTATTATTT 
TACAAAATTAAAGAAT7AATGTGAGATCAACAATTAAATAACAGTTTGT7AAACATTCGGTTTATTATAATTAATAATGATCTTTAAGTCTTTCGCA 
TTTGGC AAT AAGAC AGT AGAGGAGAGTAAGGT AT A7AAGAT AGT A7AAT AT AAAT AAGT AAAT AGC AT AAGCAAAAGCTCGTTTT AGC TTTT AAAC G 
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GAGTCGGTAAAATCTTCCTAAAAGTAACTATCGAGGGTCGGCTGGGTCAGGCAACGCCGCCGCACGCCTGCAGCCCTGGTAACTCTATTTTCTAT 
TGAGCACCGACAACGTTGCGTGTATAAGACAGTTTACATAAATTATTATTTACAATTGCACAGAGCGTTGATGTTGTGCGTTCTAAGCGAAAAGG 
TGAACTTGACCCCGGTGCCGATAGACCGCCGAGCTATTGGGTGTGAAATTCGCGAGCGAGCCTTGTGGAATTCGCACGCGAAAATCGAAGCGAAT 
CGCAGCGATCGAAGAAATTGCAGCGCAAATAATTCGTGTAGCCTCGACGACTGAATTTTAACGCGAGAGCGGGGTGTGCGTGTGTGTGTGTGCGC 
TGTGCTGTGATTCTGTGTGCGTGTGCAAGCGTTTTTGTTAGAATTCGCATTTTACGCAAAGAACTAGCTTTTATCAGCAGTGAACTTGATGTTGG 
CGAACCTATATCGCAGCAGGGCCAAATTGAACAAGTTCTTGCAGCTTCTGGCGGTAAATTGGCGAAATAAGACCCATTAAAAACCCCCCTCTGAA 
AATATAACCCACAAATTAACCTCGCCCCTTGACATTCAAAGTTATATTTGTTTCCCCACTTATATATGTGCACATATATATATATAAATATATAT 

AT ATT T AAT AAT AAT T GTG T GT T T G AGT C T T GG AAGT G C AACT AC AAC T AT T AAT C GTGCCTAGTACACATTT T AC T T T T AC GC T C T C G AC T GT C 
TCCGTCTCTCTCCCTCTCTGTTTCACCCCTTCCTTGCAACAACAAAACTTATGAGTACTTGCTGGGTGCCCCAACAATACTTGTGTTGATGATTG 
CCCATCCTTGCACCTCCAAAAAGAAATGAAAAAAAAATTTGAAAAACACCAACACCACCCAAAAGCGCTCTTCCTTCCCCATTTCTATTGGCATG 
TCTGGCAACCCTGGCAATTGGTCGACGGCCATTGCTTCTCAACTTGTTGCTCGTCTCGCTTGCTCTCCTCTCTCTCTCTACTCTCTTTTCAAGTA 
CGCTCTCTCTGCGCCAGTGTTTTGGCATTTTTGTTACTTATTTTCGTTTCGAGTATAGTTTTATGCGAAATGAACAATGCCAAACAATGAAACGC 
CTGAAGACCTGCATAAAGTGTTGATGCTAACATTTAAATGGGAATCCGAAATTCCACATCACCGACAACACCTGTCCACTTGGAAGTTACAGCAC 
TAATCGGAGGAACGCTGCATGAACGTGTTTTCGGTACCGTTTAGAACCGGATTTTCTATGCTAGAGAATGAGAATATGATCGCGGTTCTGTGGGG 
ACATCTTGTGACAACTTAGGAACTCCGATACGAAGCGGTTTTCCGCCATTAAGGCATTTCCAATTGGAAAGCGCGCAGCGAGAGATTCTTAAAGG 
TTGCTCCGTTTCCCATTGGTAATGTGTTGTTTACAGTTATGGTTAAGTTAATAGCAGCGTGTTTATGTAATATATAGATAATTCGAGTTATGCAC 
GTTCCACTGTACAGTGATCAAAGGGAAAAGAAACAGTATTGGAATTTCCATGAAGTTGCATAACAAATGAGGAATAGTAGTGATGAGAAGATTCG 
TGGTAGATTTCAGTAAACCCTTAATTTACGTTGAGCGCCCCTCGAATGCGAATGTTAATGCGTGTGTGTGTGTGTGTGTGTCTGCTTGAGTGTGT 
TGTTTTGTGGCTTAATTGCACGGTGTGTTGTGATGCTATTGTTATTTTTCCTTTAGTTATTGCTACGTTAATTTTTTTCTTTCGACCCTCCACAG 
TGCAAATATAATTTTCAGATAGCAATCAACTGAGAGCGGTCCAGCCCGGCCTGTTGTATTTGTTTGTTTGTCCGGTTGTTTACGTTTTTTTCTCT 
CGTTCTCGCTGCTTTTCATACGTATTTTATATTTTGTAAAAATAAAAAATCGTTATAAAAAACCAAAAACACACATAGAGGAGAGCACATCCAAA 
GAGCCAAAAAGAGTCAGAACCGATTGCGTATTATGGATAACGAGGCGTTAGATCCAACAATAAAAAGCAGTACGTCGGCAGCGACGCCAACAGCA 
GCAGCATCAGAAACGACAACAACAGCAGCATCTTCAGTTGTGGAAACCACAACAACAATCGCGGCGGCCACAGCCTCAGCGGCAGAATCAAAAAA 
CGAAACAACGGCCACAAACAACAATAGCAACACAAGCGGCAGCATCAGCAGTAGTAGCAGCAACAATATAGTCATACCGGCATCGGCCACTAACG 
GTATCAAAGAGAGTAACAGTAACTTAAGTACAACAACAACAGCAGCAGCAGTAGCGGCAGCAACAACTGTAGAAGGAGTAGCTCCAGCAATAACG 
TCCACAATCGTGGTGACCGGCGGCACTCCTCCATTGAGCAGTTTGGTGAGTAGCTCTCCCCCAAAACACTAGCCCTCCCCACCACACTCCCTCAC 
TCACTCTCTTTCTCTTGCTGTCTTTAAGCCACAGTCTCTCTCTGTTTCTGTTACGATCTCTCCCTGTCTTTCACTTGTCGCCCTATTTGCATGCC 
TACCGTTATTTTGGCGTAACGTAAACCTCACTCGTAATGCTTACACGCACACTGTGCATTTGTGTGTGTGCCTAAGCCTTATCTTATCTAATCAG 

T GC T AAT GAT GT G T GT T ATT GT AT CGCTGCTT C AC C T T ACC C C GC ACGAT T C AAGT TAG AT T G AC C AAC AC C AAT TAG AT T C AC AAT G T T G C C AA 
TTGTTAAATGAGTGTATGTCGTTCGGATAACAAATCGAATCGTAATTGTATCGCAAGTGCTTGACAAACCGATATTCACTTTCCCCCTTTTCCTC 
ATCTCCAATTTGACCAATTCGTTAGTCCTTAGAGAGATAAACTTATTAGATTTTAATTTTTTTAATTTTAAAAAATGATTTTTTAATTAATTTCA 

T AAGT x T TT T TAT ACGAAAT AT GAT GC AT T T AAT T AT T T C T T T AAAT AG AGG AAAT AC TCTTTATT T AAC AAAC GAT AT AAAT AGCAT TT AAT GA 
CCTTGTAAGTGGTAACAGTAATCTAGCAATCTTTATTGCTTTTCAATTTTGGTTTACAAATCGTTATCGTTTCAATTATATTACCTTGGGTTCTA 
CTTTCTGATAAGACTTTTGCGATTGAATGAATAGATTGCCCTAAATCCTAGATTTT AT ACGAT TTCCCGAAAGTTTTGTGTCAACAAAAGACGGA 
GTTGATTATACAAATCTTTATACTCACATGCCTGTTATTAACTAGAAGAACTTTTAATGAAAATTTTTTACATTTTTGTCTCTCCAACGACAAAA 
ATATTGAACATTTGCAATAAAACATAAAGTATAAGGCGAGCACTTTGCTTTTTGAAACTG 

CATAAATGTTAGTGCCTGGAAAATATATACATATATGCCACCACCTGATTCCCTGGTAGATGCCAGCAATATAATCCTATACGTGTATGGAGAGC 
CTACGGAACTCCAGCTTCCTGATCCCAGAGCACATTTCAATGTTCGGCATTACTAAACTCAGACTAAAACACACCCACAGGCAGCTCAACTTAGG 
CGTCAAGGATTCGCACAGTCCAAGGTAAAAGCGATGGCCGGCCAAAATGGCAGCGAGGGACCATCCCGAACTGAACTGCTTTGCTCTGATTTTGC 
AAGTGCTACGTTTTGAGTATGCTTTTTATTTGACCGCATATTGCTTTGCATTCGTTGTTAAAATTATTTTCATTTACTAAGCATAAGCCAAACCG 
TTGACATGGGAAAAGATAATCAGATAAGTGTCATTTATGCCCCTTTGCAGGCTAACAAACTCAAGGACAACACACCACCGTACGATGCACCGCCG 
CCCACGCCGATCAGCAAGGTCCTGAACATCACCGGCACCCCGATCGTTCGCAAGGAGAAGCGCCAGACCAGCGCCCGGTACAATGCCTCCAAGAA 
CTGCGAACTGACGGCCCTCATTCCGCTAAACGAGAGTGAGTAGCCTTGGAGTAGATCCTACTACTTAGTATATTACTTTATAATACATGTACTGC 
CAGCTGTTCAGATCGATATCTCTTAAGCCTGCAGATCGGTTGC AT GGGGTAGAACTTGGTCGTAGCCATGTCGT AGCAT ACATTTTAGGGCAGGG 
TTCCGTTTAGGCGATATAAAATTACCTTTCTTACTGTCTACCAACATCGTCAATTAAGTTTTTATGCAACTATACACTCAAAACAGATTGTATAA 
ATCGTAGTAATTCCCATTAACGATGCATAATTACATATATTATGTCAATTAACTCCGGTAAATTTCCATGATTATAAGTTCAGCTTTAACTTACA 
AGAGGCATTCTCTGTAAAGTGTACCTACGACCTGCAAGCTTGTTGGGAATATTGATTGGCCGGCGCTTGATTAATTTCGCTCAGCTGTTGTAATT 
TGTTTATTATTTGTTAAAGTTAAAGGTCAAGGGAGTGTCGTCTGATTAGGTGGGTGGCAACACTGACGCACGGCCACTCACACATGTGAACCAAG 
CAGCCGCATCAATTCACACCTCTCTCCACCTCTCTCACTCACCGGAATTTCAACACATGCGCTCTCTCCGCAGCCTCTCATTACACTGGGAGAAA 
GCATCGTTAGAGGATTTCTCAGACCGTCTTAAATTTCATATATACACCGGCATTTCTTTATGACACAAATGTATTCGATTGATTTAAACTTGTGT 
ATACAAATAATTAGAAAAACTGTATCCTGCAGGGAATTAATTTAAAAAAAAGCAAATTCAATATTTTTGAAAATCATATATGTACATATCTGCGC 
GTAATGATTATGAAATTGCTACACTTTTTTTTTCTGTGCAGCATGAATGTCATCGCATTGCGTTATTCACATACCTTTCTGCTAATTGCTTGGGC 
GGTTTCGACTCACAAATCGAATAAACAAATTTCGCTTGTTCAATACTATTTTATTTTATTAGGTTTTATTGTTATTGTGGCTCGCTCGCCCAC 

(SEQ ID NO: 889) 

Exon: 1001.. 1139 

Exon: 2929. .3370 

Exon: 4801. .4983 

Start ATG: 2978 

Transcript No. : CT23882 

GCAACGCCGCCGCACGCCTGCAGCCCTGGTAACTCTATTTTCTATTGAGCACCGACAACGTTGCGTGTATAAGACAGTTTACATAAATTATTATT 
TACAATTGCACAGAGCGTTGATGTTGTGCGTTCTAAGCGAAAAGAGGAGAGCACATCCAAAGAGCCAAAAAGAGTCAGAACCGATTGCGTATTAT 
GGATAACGAGGCGTTAGATCCAACAATAAAAAGCAGTACGTCGGCAGCGACGCCAACAGCAGCAGCATCAGAAACGACAACAACAGCAGCATCTT 
CAGTTGTGGAAACCACAACAACAATCGCGGCGGCCACAGCCTCAGCGGCAGAATCAAAAAACGAAACAACGGCCACAAACAACAATAGCAACACA 
AGCGGCAGCATCAGCAGTAGTAGCAGCAACAATATAGTCATACCGGCATCGGCCACTAACGGTATCAAAGAGAGTAACAGTAACTTAAGTACAAC 
AACAACAGCAGCAGCAGTAGCGGCAGCAACAACTGTAGAAGGAGTAGCTCCAGCAATAACGTCCACAATCGTGGTGACCGGCGGCACTCCTCCAT 
TGAGCAGTTTGGCTAACAAACTCAAGGACAACACACCACCGTACGATGCACCGCCGCCCACGCCGATCAGCAAGGTCCTGAACATCACCGGCACC 
CCGATCGTTCGCAAGGAGAAGCGCCAGACCAGCGCCCGGTACAATGCCTCCAAGAACTGCGAACTGACGGCCCTCATTCCGCTAAACGAGAGTGA 

GTAG 

(SEQ ID NO: 890) 
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Start ATG: 18 9 



MDNEALDPTIKSSTSZVATPTAAASETTTTAASSVVETTTTIAAATASAAESKNETTATNNNSNTSGSISSSSSNNIVIPASATNGIKESNSNLST 
TTTAAAVAAATTVEGVAPAITSTIVVTGGTPPLSSLANKLKDWTPPYDAPPPTPISKVLNITGTPIVRKEKRQTSARYNASKNCELTALIPLNES 

E* 

(SEQ ID NO: 891) 



Celera Sequence No. : 142000013384543 

GCTATTTCCAATTGTGCTATTAATCTGCCACATTTGCACTTCCGCGGAAAAGAGTCGGCGGGCGGATGTGATCTCTTTCGCTCGTCAGTTGGCTG 
ACTCGCTCTGCAGGGGTGATTTGCGGGCGAGCGAGTGAGACAGAGCGAGGCAGAGTGGTAACGCTTGGCTAACCTGAAAATAGAAACAATCGAAA 
TAAAAATACATTCTCTTCATGCTCTCGTCTAATAAAAAACCCCAAAAGCAAAACGTTTTCCATTTAAAATGTCGTCGCAACTGCCAGCGGGAAGT 
GGACGTCGAGAGTGGCAAGTGTGTGGGGCAAACTCCGGCCAGCGGCCAATGGCGGTTGGCCGAGGCCACGCCCCGTGGCTCAATTACCGACCAGT 
GCTGCCCATCGGCGGGCAGAGCGAAACCTGTTTCCACGAGCGAGAGGCAGTCAGGCGCATGTGCGATCAGAGCGGGATGGATGGCTCAAGGGTGG 
ACCAACCAGAAGAGGCAGGTTGCTGGTATTATGGTAGGTTAGGTTCAAACAAGGTGGCAAACTGGCTCTTAACCGAAATAGGAAGGAAGTCGGCA 
GCGGCGTCGACGTCGGGCAACTGGTTTGACTTGTTTCCGGCTCACTCCAACAGCCCAAACCCACCAGCAATCGGCAGAGGCAGCCAAGTCGGCAG 
AGGGCAGGCAGAGCAGCGGCCGTCGGCACAATTTGATTGCATTTATGAAAAGCGTTTTCGCCCGCTAGCACGCGTCCGCTCCCAGCTCGCAATTC 
GCCATCCGTAATCCGTAAACTTTAATCCGTAGACCGTAATCCGTGAATCCAAAGCAGTTCTCAGTTGGAATTTCCCGCAACGGGTAGTGCTTAAG 
CGCGGAAACTGAATTGAAATCAATCGGACGATTAGTGTGTGCTTCGCGGTTGAAACACTCTACATCTAACTATTAAGAGAAAAGAGTGCCCAATA 
AGATTCGAAAAGTGCAGTGAGCGAGCTCCAAATTGGCGCCACTAATTGCCAAAATGTCCAGCCACGAGGAGGATGACCATGAGCACGAGAACGAG 
CACGGCGAAGAGGTGGAGGAACAGGAGATCCACGTGGATGTGGACTCGGACTCGCGGATGTCATGTGGCAGCGGATCCGATGTGGATATGGACGG 
CGGCAGCTGCTACGACGAGTCCGAAACGCCATTGAGGTGAGAAGGAGAGCTTTTAAATAGAACCAGGAAGTAGGAAAAATGTCCCAAAAATATAG 
ATATTATCAGTATAGATATACCATAAATAAGCAGATCTTCCAATTACAATAAGACTTCTACATATACATACATATATTATTTTTTTAACTAGATT 
GTGATAAAATGCTTTGGCTTCCGTACTACTTACTTAAAACTGAAGAGGCCTTTTTTTTTAAAATTCATTTTGGTAAATATTTTTCAAATATGAAT 
AACGTCTGAATTTATGTACGTTGCAAAATCGAACAATAAAAAATATTTTTGGTTTAATTGTATCGTAAATAATCTGCTACACTAAACTTGAAAAG 
AGATTGCATCTCTATTAATAATAATAATGGCTTTACTTTCAAATTATACGATGTACAAACATAATAATTGGTCTGAAACAATTCATTACGGAAAA 

T C C C C AAAAAAAAT AT AT T AAAG AAAAAGC T C AGT TT T T C GT T T ACT CT TGAT GGT TAT GAT CAT CAT G C AAAT TAAT AAAT AT T TT T TT GT AAG 

TACTGGGCTTGATTTAGTTGGCCACATTATTCGTTAATTACTCATAAGTCAAAATTTAGATCGACAGCAATTTGATCGACTAACTGAGAGAAAAA 

GATCTATAAGCTATAGTTATAATAGTTGGCATACAGTCTTATTTTATTGCCCTATAAAAGTCAAACAAGTATTGTGGCGGAGAAAAGCTATTTGC 

ATTTGTTATTACGAGCTCATAACTCCCACTGTTTTACGATCTTTAACATTGTCACTCATAATGCTCACAAATCCTATTTTTCGAATCGTGGCTCA 

ACGCCATTGAAATGCGTTGATGTTTAAGCCCCATTGTCCGCTCTTACAAATGGTCCCATTTTCCAGCGACAAAAGGGGATAACAAGATGCCAGAG 

CATCCAGCAGACCGTGTTAACTGATCCCATTCAAGCGTTTGTGCGCTTAGAAGAACAGTGTGAACAAACCTCTTAAACGCTAATGAATTGACCAC 

AAAGCCACCAAAAGAATCAAGTCGAAATGTCCCATAAAGTGCCAAAGAAACTCCCTGAATTTACATAGTTCACGGATGGCCAGGGTTTCCAATAG 

ACGACGACGATAAAACGCACATTAGTAAGGCGCCAAAAACGGGCTGTGATAAGACATTGGGGTCCATTTGGGGCTCAATCTGGAGTTGGATCTTC 

GGCTTGAATGCCGGACAAAGACGAAATCGTTTGTCGGATTAACATTTTAATAGAGATCGTCATAAAAATGCCATAAAACCCGCGTTCCAGCGAAT 

GTAGATCACCAGATAGCAGCAAGTTTTGGCCAGTGAAATAATATGTGTTTGCTCATAACTTTTGATATTGGAAAGTGTCTGTGTGTCGGTGGTTG 

GCGATTTGAGATCATTGTTCCGATACAATTCCATTCCTGTTCCAAATGGGTTTTTACTTTTTGGATTTATAAATTAAATGCCGTGCGTGACTTGT 

TTTCGGTGCGTCAGTTGATGGCATTTGATGATTTATGGGCGCGTATTAAATGCCTCTAACGGCCGTCAGCATTTGCATAATTACTTCATTTCCAG 

CCGCATTGCGTATACGCAGCGGTGGCCACACGGGTTCTGCATGTCATGTCGTCAACTCGGCTTATCCGCTCGCGTGGGCTTTTGGTTAGTGGTTC 

CGAGACTGGTACGCACTGCAACAGACTTAATTATAGTCAATGTTCTCAGGCTTTCCCGCTTATCGGGCGACACATTTTAATGAGCCCGCACCTCC 

AGAGGGCTTCTTTCCCTTCGATTCTGATGTCTTTTCTCCGGTTTTTATTCCCAGCGAGTCCCTGCAGTCGGAGCAGACGCGCTCCTCGTCCAGTG 

AGAATCTGCCCTTCAGCATATCACGCCTGCTATCCAAGCCCTTCGAGACGAGTCACCACCACCACAACAACAACAACCACCTGCTCAGCAGCAGT 

CCGGGCTCCAGCAGCAACAACAACAACAGCGGCGAAAAGGGCGAGGAGAAGGAGCTACTGCAGCAGGAGGACCACGACCTGGCCTATAAGCTGGC 

CACCAGCATCGCCAACTCGACATACGGAAGCGCCGCAGCACTATACTCGTATCCGCATCTCTATCCCTCAGCGGCCGGCGGTCATGTGCTACGTG 

TACCGCCCCAGAGAACGCCGCTCACCTGGGCACTACCGCCACTGCATCACGCGGCGCTGGCACATCAGGCGGTCAAGGATCGGTTGGCAGGTGAG 

TGAATAGCAGCGAAGTCAAGAGGGTTTGCCAAATCCAACCCATTCGGGGCCCCCATGTCGGGATGGCCCATGTTCGCCGAGATCTTTCGGGGGAG 

CGATCCTCCAATGACCCGCGGTGGCAATTGCTACTGTTCCCCAAATAAGCACGATCAGAGAAGACTTTTAAGGCAACATGCCTATCGGCATGTCA 

TTGTTGGCCAACACATTCGTGCACCAAGAGAAAATGTGGCATTTGAAATAAAAGAATTGGTAACATTAAGAATAAAAATGTTATATATGTGCCTA 

CATAACGCGTAGTTGGATGCATTCAATTGTGCAATAGGGTGTAGGAACATAATTTTAAACTTCTTGATAAAACATATTTTTGTATATTCATTGTT 

GAAAATAAGGTGCGAAGAACGTTGTCTAATGGTCTAAAAGGGCTTACTTACATATTTCCTTATAATATAATTACGATGGTCTATTAGATGGTATT 

TTCTGTGGGTAAGAAACAGCTTAAGGCTAGAACACAGGTCTCATAATACGAATTTTCGCTCAGTGTACACTCACTCTGGTGAGCTTGGGATGCTA 

AAGTATAAAAACATGTTTGCCTTCAAATGCGCAATCGCAGCTTAAGTTAAAAACTTAACGCTGAGCCGCCGCCCAGAAGCGACTCAGGAATGAGA 

TGCGCCTTCTTATTTGCATGCGCATGCGCAGTTGAATTTGTATCTGTATCTGTCGCTGGTCCTCTGAATCCGTATCCGAATCTGTATCTGTATTC 

CATCTGAATCTGCAACTTCACGTGCAGCTGGGTCTTCTGGCTCCCTTTTTGTTAATTTGTTTACATTATGAGCTTCACTTGTAGCTACAAGCGAT 

GGGCACAGCGCGAGCCCTTTGAAATAATGAGACGGCGTTGATTTTATTACGCCTCGGCAGCCGAGGGATCTTCAATGGTGTGGAAGGGGGATCCC 

GATAGGGTTCGGATCCTATAACTCATAATATTCGGGCCTTTTTCGGCCTTGTGATGTGGAAAAGCTGCTTAGGAGCAGCGCAGCCCGGCAATCTG 

ACGGGCTTTTATGGCCGACATCCGCCCAGTGGTTCTGGTTAGCTGTTTGGCCTTTTTCTTCATTTTAGTTATTTTAATGAGCTGGTTTACATGAC 

TTTTTAATTTCGCGCTGCGCTCGAAAAGGAATTAATTACGCATTGGGCGGTCGCTGGGCTCAGCGAGTGAGTTTTCTGTCCTCGCCGGCTTATTT 

ATCGCCGCATACCGCACTTCACATTTGAGTCCAACTGACATGGTCAGCTTTTAATTAGCCAGTCGAATGCTGGCGATCCGAGGGCTGCATATCCA 

TATCCATACTTGTGTGTATATCCGCCGAGGAGGGGCATTCGATCGAGTGGATCGTATAGAATGGAATGGGATCGGATCAGTCACTTCGAGTCGAC 

TATTCCACCTGAGGGGCACACTTATGAGCCCGCCGGGTGATCCAAGCTTCTGCCGCTCGGTTTCGTTTTGTCAACATCTTCGTTTTCGTCTTGGT 

CTTGGTCTCGGCCAACGTCTGTGTCTTTTTGTGTTTATTGTGTCGCTTGCCTTGTGGCTGATAAGCTGGCGCCTTGGCTTTTTTCATTAACAAGA 

CAAACGAGCGTCCAGGGTGCGGATCCGATCCGATCCGATCCGATCCCTTCCCGCGACCAAAGTGCTCCTCTCTCTTCCGCCGGGGCCCTTAGCGC 

CATATGATTAATAACCATCGCATGTTAATTCTCGTTATTTTTTGGGAAACTACGTCGCAGCAGGCGCCAATTACTTTTCGGCCATGCACGCCCTA 

ATGCAGTTTCATAACAGCCATAATAGCCTGATTGGAAGGGAGTTGGGCTTTTGTCACGACACTTGCCCACACGAATGTTACCAATCAATGGCGGA 

GTAGGATCTATAGGCTAGTGTATAGAAATTCCCTTTACTAGGGAGCTCCTTTAGTAATTTAGTATTTAATATTAAAAAATGATGTAAACTCCGTG 

AGACTGTATACTAATTTGCCATTTAACAGTTCCCATTTGCATGCGCTTTGATTGGTTACGTGCTAGTCTTGAGGTAACTAGTATATAACTAATCC 

AAATCACTAGCAAAGGTCAGAGTGCAGCACTTTTGAAAAGAGACCAAAATAGCGGAGTGGGAGGAAACCAGTCCCAGCCACAGGCCGTCATCAAT 

AATGCATGCTAAAAGCTTTTTGCATTATTTTAGGTATTTCTGGTGAGACTTCTCGGCCGCAGCCACACACTCACCGCCATCTGTATCTTAAAGAT 
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ACTTGTGTTAGCTTGTGCGAAACATCATAAATAAGCTCGGTGGAGCACGGTGGAGTGCCGGAAAGCTATAAAATTATGCTTATGCGCCTTAACCT 

CCCTCTAAGCGGATATGTTACATCCTTGAAGGATACACACATCTGAGCTGCGGCAGAGGGGAAAATCCCGAGGCTGCTGCTTATGTGGTGCGATA 

AATGGCTTTTTATTGAAATTGATATTGCCTGAAATATTCAAATGATCGGCAAGCCACAAGACCTCCGACCCGAAAACCTTTGAAAAATTTATGCA 

CCCAGCCTCGAAAGTTTATCCCCTGGGCGGGAAAAGCCCGAAAAAAGGGTTACCGAGAAAAGTCGCAATAAGATGCATGATTTGTCAGCTTAACA 

CCCGCCATTCGTTCGCCGGCATTTGATGGCTTTTGACAGCCCAACGCCATTTTCCTACGCCAACTTGGACACCCATAAACACAACTCAGTTTCAA 

TGGGAGCTGGTCGGGTTTAAAACGAAAAGTTATATGCACATTGAAAAGGTTTTCATACTTTAGAAAGTATTCCAAGGGATTGGGAAAATCCGACT 

TACGCTGCAAATCATTGAACATTTACGCTCCAGTTAACGAGTTGCGCGTGTATTTTCAAAATCATGTTGAATGTATCGCTAAAGTTTTTAGGTTC 

CTTAAATAAGTTAAGAACTGAGAAAGCAGTACATTCTGCTTTATGCAATGCTAATGAGCACACTTGAAATTAGATCATTTAAACATTTGCCACCT 

TTACTGAAACTACACGTATTTTTCTCTACCAGGAATGATATAAACAAATTGCTTTTTGCATAAATGTATTAAGATTTCTTTCAATCATGTTTTTT 

CGTTGTCTGCGATTAAAATGGCATTCAAAGAATTGAGTGACAGTTGATTAATTTGAATAGTTTTGTGGCATTATCAGTATGGTTTTAAATTAATC 

ATTGCTAAATTGTGTTAAAAATCTTTATTAATAATGAAATAGTTGCTATTTTCCCGGCGGCGTTTGGCCATTAAATTGCACAAATGCTGACCAGC 

TAATTTCCCTCCAATTTTTCACTTGACCTCATCTGCGGATCAGTCAGTCGGTAATCCGGTTTCCTAATTGTCTCTCGTGTTGCGTTGTTGTGAGC 

GAGTGGCCGAACGCCTTACAGTTTCGGCGCGCACTGTGCCCGCCTTATCGCGCCCAATAAACAACAAAACACACACAACGGCGACTGGCAGGAAT 

TTTAAAAGGTTCAGCGCCAGCACAGGCTCAGCCACCGTTCCGCGCCCGAATCCACAAATCGAGAAATCCACAAACCAAAAACCCAGAGATCCAGA 

CTCGGACACTGGGTCACTTATGCGTCCTCCGAGCGCGACTTTATGACTCAACCTCAATCCGAATGGAAATGGATGTCGCGTGCTTTGTCTACAAG 

AGGCGGCGATAAGGGCTCCTGCTAGACCGCCGCCCGAAACCTCAATCCTTGGCCATTAGGCAGCAAGGCAGCGAGCCAGCGCGTCAGCGGCCTCC 

AAATTGATTTCCCAGCGGGAGGGGTCGTATTTAGCCACTTAGCCATTTAGGCAGCCAGGTAACAGCTATGAGCTAACAGGTATCGAGTAGCCAGC 

AGCGCTAAGTACTCGCAGGCGCACAGGTGCCGCCGCGCAACTCCGGAAGCGGCGCTCGGAAATGGCCAGTTATCTAGGAATTTATTGTTTGCTAA 

TTGCCCGCCAGGACACTTACCGCACGCAGGGGAGACGGATACAGATGCAATCGCAGGACTCGCATCTAAAATTCGAGAGGGTTTCCTTGCTGAAA 

ATTGTATAACAAGCCCCTTGTTTAAAAGCGTTTCAAGTCTAAGAAATGATAAACATTCCCATTTTGTGAGTGCACTCAATTGCTTATCGGTTCTT 

AAT G AAC AT AGT T AC TT AAC AAAT T G AAGCT T T T C G AAT AAGC AAG AGC AT AAAT C T TT T AC TAT T GC C GGT AT T AT CT T T GG AT AT GGAAAGT T 

T C AT T AAAAC T T AAAT AAG AAAC AC C T T AT AGAAAAT GC T T C TAT AC TACT C TAG AT GC AC AT AT T AT T AATT T T T T GGAG ATC AT CCCTTCAGT 

TTATTATATGACTACAGGTGTATGTAGAGAGAATAAAGAGAATCTGAGCGAGCGTTATTAAGTACTTAACATCCCTCTCGTTACAGCTGCCTTTC 

CGATCGCCCGACGGATCGGACATCCCTACCAGAATCGCACGCCGCCGAAGAGGAAAAAGCCACGCACATCCTTCACGCGCATCCAGGTGGCCGAG 

TTGGAGAAGCGCTTCCACAAGCAAAAGTATCTGGCATCCGCGGAGCGAGCGGCACTGGCCCGCGGACTGAAGATGACCGATGCCCAGGTGAAGAC 

GTGGTTCCAGAACCGACGCACCAAGTGGAGGTGAGTGTCCAGATTTCCAATTTTCCCTTTCGCAGCGCATCCCGCTGGGCTAGAGGCAAAAATCA 

ATACCCATACCCTTGAGCAGCTACTGTTTTTCCGTGCTTGGAAGTGCTGACAGCTTGACACTTACCCTCTTCTATACCCCCCACCCCCCCGCTTT 

TTCGTTTTTTACGTGCCGTTTTAAACTACAATATGCACAAAAGGCTTTTGTTAAACAGACCGAGGGCAGAAAATAGCCGACGTGAAAATGCCCTA 

GAAATGCGTGCTTAGTAATTATTATTATTTTATAGTAATTTAATGATGCTCGTTCAAACATTTCATGTAATTTTATTTCTCTTTCTTTTTATTTC 

AAATAACAATCCGCCAAATAGAAACATTACTGTAGATTAAGTTCGTTCATACTAAATTTAGATAATTATTCTTTGTATAATTCAGAAACTTTCTT 

CTTACACGGCTAATGCCCAGTACGGTGTAACTCAACTAGATTATAGGGCATCAAAGCCCGACGCGCATAAACGGGGAGCAAATGGCTTATACAAT 

ATGCATGAGCACATTAAATGAACAGCACCCAGCCCCAAGAAAAAGGGGAGCATATTTACGAGGCTTACGGAGAGGGGTAGGTAGCTGCATTTTGT 

AGCCACAAAGAGCGCTCTTGTTAAACAAGCTAATTGCCAAGTTTTTTATGGCAGTTGCAATGCCTGGCTAACTATATGGCTTAGAGTCGCATTAA 

GGCAGACATAGCACTCGTATTAAAAAATTCCATTCAAAATCATAGTTATACGTTTTGTAGTTGCAATAATTACTTGGGCAGTTAATTCGATTAAC 

AT T T T G AT AC AC C AAT AAAGT GT T AT T AC GT ACT CT T GCG AGC AT AC T C TAG AT AAAT AT GAC AAT T T T C T AAT T AT TT AC AT T T T C C G AATT GG 

TTACTTACATTGTATTGTATCAACATTATTTCTTTTAGTTTCCTTTTAAATAATTGGGTGTACCAAAGCTAGGTGTCAGATGACAAAGGAATTTC 

TGATTTATGTGGCCCCTATTAGTAAAACTCTTGTACTCACTTTCACATGCAAATGACTATTTATTTGAAACCGAAATTAGTCAGACAAAACATGT 

GCCCAATTTGGCGGGGTATTGATGCGTGATATTGCGTGCGCCTAATTGGTTTCTTTTTTATTACCGTCAACTAAAAGTATTTGAATATGAAATTA 

TTTGAGCGGATGAGTTCGGATAGTCTTGCACTCATATGCGATTAGTGGCCGTCATAAAGACGATGGCAAATGGCGGCTAAATGGCCGAAGTTTCC 

CACTTTCCCAGGAAAGGGAGCAGGGAAAGCGGCAAGTCGGGAAAGCAGCAAGGCGGGAAAGCTGGCGGGACGGCGCTTAAGTCGCGTTAATTGCC 

GCTTAGCAGGCGGCGTTTTCAGGCATTTCAGACATTTTTAGAGCCCAGTCCTCCTGGCTTATCAGCCATTTTGGGCCTATTCCAGCTCGTAAAAA 

TTAATGATGTAAAACGCAATTTTGTTAACGGCAGCGTCACCGGCAACCGCCTCGGCCACGGCTCGTTTGCGGTTAAGTGAAATGGCAATGGCACA 

TTATAGTTGCCATGATTACGAGGCGGCACTACTCGTAAATATCCAGGGTATATGTAATGTGCTCACATAATTGCCTTAATGCACACTGCGCTGCA 

TTTGCCAGTAATTTGAATAATTAGCCCTGTCCAACGTGGCGTATGTGCTATGTGCCAGTTGCCCATTAGACAGCAAATAAAGATTGCGAAACTGT 

TGTCTCTTGCCCAATTCTTAATAAATAGCAACTCGTTTGCGACAAGGTGAGTGCCTCATTACGAGAATTGTCTGTATATTGAGTGATTCGATTCG 

ATTGGATTCAACAAGAACAGCAACACTTGCCATCAATTATTAATAACAAATTGTGCGGCCAAATGAATGAACTCCTGTTGCCCGTCACTGTCTGA 

CAATGTGTTTAGCAAAAAGGCAAGAAAAAAAGACGAGGAGCAATCTGCCAAAGCAGATACATACACAACGCAGATACAGATACAAATACGGATAC 

AAAAAAGTGTAGCACACTTCCGGGCGTTTTATCCATGTGGCTCCGAAACAAATGGCAGAGTCGGACTTCTGTCTGGGTCCTTTTCCCTCATTCTC 

CGGGTCTATAACTCAATTAGTTTTACACCCATAAATCAAGCTGCCGCTAAACCCTTTTTAATTACACGCTCGTTCATCATCCGAAACATTTCCAT 

CCCACTCAAAAATATTCTTGCAAAATCTTAGTTTAGCTTAGCTTAGCTGGTACTTCATGGTAATTGCCTGGCATGTGCGAAAGTTTTCCTGCTGA 

AAGCTGGCTAATTAAAAATTTACACACATGCCAGAAAGCAACTTTGACCGCAGAACAAAAATCAATTTGACTGAAAGCAAAGTTTTTGCGGCCGG 

AGCCGAAAAATCAAATAAAATTAATTACAATGGCGAGGGCCATTTGTAAATTAACTCGCAATCTTGTTTCGCCCGTCGGCACGTGTCCTGCGAAG 

AATCTTCAAGTTGGTGGTGGGCCGTTCCCTTGGGATCTTGGCCTGAAAGCTCCTTGGACGCCTTGGCCAGTCAATTGATGGTCTGGCTAAATTGT 

TATGTCTTTGGGGATAATTACCACAGGACCTTTTTGGGGACCGCTATCTTCATCTCGACTTGCCGACATCGTTCGCGTGCAATGTGTGAAATGAT 

TTTAATTACCAGGACATGACCGGAAAAGTCTTAAACAGCAGCTGCCTCCGTTGGCGACAATTGGTACCCTGGGGCGGGGGAGCTAAGGAGATTTA 

TGCGCCGGGTTTTAATAAACATTTTAGGCTAATGTCGGGCCATTACTTATCGCGGATGGCAACAGAAACTTTCCACATCAACGCGATGGCGCCTA 

GAGACTAACATTTAACATTTGTAAAGCGTTTGTTACATTTCCCAGCATCGTGCCCGAAGATAAATTACAATCGTGTCTCTGTCTCAACCCCAAAC 

AGCGAATTAAGCCCGCACACATCCCAAGCTCACAGAGCCGGCTTCGTTAGCCTGGCCTTTTTGGAGTTTTTAATCAATTTTCGTTGAATTTTTCA 

TGCGGTAGCTGGTAAAAATGCTGCCCGATAAAGCCAGCTTGAAGGCGAAAAGGTGAAAAAGTAAAAGCGAAGTGCTTGCTGCATAATAACTCACT 

TCCAACGCGAAAACGACAATTGAATTGTTTTGCCAGCAATTGGAAATCGGGGTGTTAGTTGTGTTAGTTGTGAGTAAGAAAGCAATTTGCAGTTA 

AAAAAAAAAACACTTTTCAGAATTACCTAAAATAAATGTATAATTGATTAACTAAAGATGTTATAGATCATATTAAATGAAATCATTTTGCTAAC 

TAGCCATAT AT CCACTTCTCATTTAAGTTGGCTGCT AGT CAGCCGTAAAGACAC AAGCT AGTTATCCAATTTGTGGGCCCAAAAGTTTGC ACT AA 

ACGCCCAGGCTAATGTATTTTTAATGAAGCCCACTTGCGAGTTACGAAAGCGTAGTTTGAACGATGACAATGCTGATGAAAACCTGAAAACCGTA 

ATGAAAGCATATTAAACAATAATTATGATATATGTTTGCCATGTGCTAGCCTCGCCTAGTTGTGGTTATAATTCGGGCAACCCCTTCCTCCGCCT 

CTTTAGTCGAAACATAAGAACTTCATTAGCTCATGAAATATTCAAGATGGCGCAGAACAACCCCTAGCTCTGCGGGGGTGGATATACCACTGGAT 

GGAGCATCGCTTGTGAAGTGGAGGTGGAGGGTCTCTCCGGTGCTCGGGCGGTGGTGGTGATGGTGGTGCGGGTCCATGGCCACAATTAGAGCAAC 

ATGCTGCGCAAACGCGGGCCGGAGTTGCTGCATCTGCAGTGCAATTCAGCCCTGAATTATTTATCAGCTGGCAGGGCACAAGTTGGAAGTTGGGG 

GAGTTGGCCAAATCCCTTTAAGCTGGGGGCATTCAAAATGCAATTTACGTACACAGCTCCAGCTCGGCACAAAATTAATTTTCCATTTTCCGCAG 

AAACAAGCACAATCGATGGAGCACTGTGCCTGTACAGCAGTCAGCATATTTGATTTGATTAGTTTTAAATGGTCTTTGTAGGCCGCCCACATAAT 

GACTTGATGAGCCCCTGATTAGGCCCCAATTGGACAGGTGTGAGCCACTGAAATGGGTTTGCGGCAAATTGGTTTTCCATAATGAATATGTACAT 

TAGGCGATACTATTTGCCAGATAAGATTCACGGTCATCCAATGTATCGATTGAGCCGGGGCAGATGTTGCAACTTTGAAATTCGTTAAATAAATT 
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CAATCCGAACCCTATATCCTATACTTCCCATAAGCAATTTCGTAGTAATTAGGGGGAATATGTCAGAATTTAAATGTTAATCAAGGAAATGAATT 
AAGAGTGGATCAAACTATCACAATTTTTAATGTATTCTAGCAAACGTCACCACTTTCATTTTCTTTGAACAACTTAAGTTTCATAAAAAGCGTAT 

AG T AC AG AAC T T C C AAAG AG A A C T T GAG C CT T AC GAAA C AC AAGT T AT CC T T AT T A AGC CT T AAAC CAT AT T T C AAT AAT AT T T AAAT AG AT T C T 
TAATGAAAGTTTTATCCATGACCTGGCCAGTTACTTACCATTACCTTCTTTTTCTCCATTTCAGACGACAGACGGCCGAGGAGCGCGAGGCGGAG 
CGGCAGGCGGCGAACAGACTGATGCTGTCCCTCCAGGCGGAGGCCATCAGCAAGGGATTCGCGCCTCCTTCGGCACCCCTGGGTTCCCAGGGGGG 
CGTGAACGGGGCACCGCTGGCTGCGTTGCACGGACTACAGCCGTGGGCAGAGGCGTCGCATGCGGCGGGCTGCTAATCGGGGCGCCATCGGCGGA 
CGGAGCCTGAAGTTTGGCGGGAGCTGGTTTATTTGTGATACACTTCGCTTTGTACATAGTTCGGATGAACGGACAGTTTGCACTGTACTGTACAT 
AGGAAAGAGGGGCGTCGCCTCGAGCGCCC ACT GT AAAT ACCTAGCTTTACTCTGTAGTGACGCTTACCTAACCGATGCCGGGATGACGAGGATTT 
GCGATGGTGACCCCGAAACAGCGCTTTAAGTTTGTGTAGAATCTGGAATTTCCGTTTTGCGTTTCGTAGCGGTGCATTTAGGATCTTAGTTATTG 
TTCGACAGTTTCGACAGTTCGGCAGAAGCAAACCGCATCTCAAAACGTCGCAGTGAGCTTGGAAGCCACATTTAGTAATAATATATTTAATATAC 
AAAATTATATACGGATTGTAATTGTAATGTATTGTAATTGTAACATCTGTCTGTAGGCTTAAGCGAATGGCATTACCTAGTATTATAAATTATGA 
ACCCTAAAAACATGTACGATTATTGAAAAGTAGAGAGGATGAGCAATGAAAATAAACATATTATTGAATCAAAGAACAGAGGTATTTGGTTTTGG 
CTTTTTTACAATAGTATAGAACTGGATCTTTCAGTTCTGGAGAATAAGTTTCTTTTGGTTATTTGACCACATCAATTCTTGTTTGGATACCAAAT 
AGGTAGCAATTGATAAAAAACTCCCCCAAAACAAACGAAATCAAAAGTCAGGGGATGTTTCCCTGCCTTTTCAGTTGAGTTCCGCATTTAATCGG 
CTCTGTCTCAACCCTTAACTAGTACAAATACCCCATTGAAGATGATTTATAGACGCTTATTGACAGGACTGGGTGTTTCGATCAATAGACCAATT 
TTGATAGAAACCTGCTCCTCAACCGGATACCATTCGACGTGCTAATGACTGGGATGCTCATAAAAAAATCAGCCATATGCTCGACCCCGAAAAAC 
C AAC AAC AAT T AG CGGC G AAT AT CC G AGG CC GT T C AT T AT AAGTGT AT T AC GT GTTTTTGCGCT G AAAGC AAAAC AAAC AGGCGC C GC C GG AC AG 
GTAGCCCTATACTTAGAG 
(SEQ ID NO: 892) 

Exon: 1001.. 1176 
Exon: 3000.. 3415 
Exon: 7877.. 8105 
Exon: 12795.. 13078 
Start ATG: 1103 

Transcript No. : CT23946 

AAAATGTCCAGCCACGAGGAGGATGACCATGAGCACGAGAACGAGCACGGCGAAGAGGTGGAGGAACAGGAGATCCACGTGGATGTGGACTCGGA 
CTCGCGGATGTCATGTGGCAGCGGATCCGATGTGGATATGGACGGCGGCAGCTGCTACGACGAGTCCGAAACGCCATTGAGCGAGTCCCTGCAGT 
CGGAGCAGACGCGCTCCTCGTCCAGTGAGAATCTGCCCTTCAGCATATCACGCCTGCTATCCAAGCCCTTCGAGACGAGTCACCACCACCACAAC 
AACAACAACCACCTGCTCAGCAGCAGTCCGGGCTCCAGCAGCAACAACAACAACAGCGGCGAAAAGGGCGAGGAGAAGGAGCTACTGCAGCAGGA 
GGACCACGACCTGGCCTATAAGCTGGCCACCAGCATCGCCAACTCGACATACGGAAGCGCCGCAGCACTATACTCGTATCCGCATCTCTATCCCT 
CAGCGGCCGGCGGTCATGTGCTACGTGTACCGCCCCAGAGAACGCCGCTCACCTGGGCACTACCGCCACTGCATCACGCGGCGCTGGCACATCAG 
GCGGTCAAGGATCGGTTGGCAGCTGCCTTTCCGATCGCCCGACGGATCGGACATCCCTACCAGAATCGCACGCCGCCGAAGAGGAAAAAGCCACG 
CACATCCTTCACGCGCATCCAGGTGGCCGAGTTGGAGAAGCGCTTCCACAAGCAAAAGTATCTGGCATCCGCGGAGCGAGCGGCACTGGCCCGCG 

G AC T GAAG AT G AC C G AT GC C C AGGTG AAG AC GT GGT T C CAG AAC C GAC GC AC C AAGT GG AG AC G AC AG AC GGC C G AGG AGC G C G AGGC GG AGC GG 
CAGGCGGCGAACAGACTGATGCTGTCCCTCCAGGCGGAGGCCATCAGCAAGGGATTCGCGCCTCCTTCGGCACCCCTGGGTTCCCAGGGGGGCGT 
GAACGGGGCACCGCTGGCTGCGTTGCACGGACTACAGCCGTGGGCAGAGGCGTCGCATGCGGCGGGCTGCTAATCGGGGCGCCATCGGCGGACGG 
AGCCTGAAGTTTGGCGGGAGCTGGTTTATTTGTGATACACTTCGCTTTGTACATAGTTCG 
(SEQ ID NO: 893) 

Start ATG: 103 

MSCGSGSDVDMDGGSCYDESETPLSESLQSEQTRSSSSENLPFSISRLLSKPFETSHHHHNNNNHLLSSSPGSSSNNNNSGEKGEEKELLQQEDH 
DLAYKLATSIANSTYGSAAALYSYPHLYPSAAGGHVLRVPPQRTPLTWALPPLHHAALAHQAVKDRLAAAFPIARRIGHPYQNRTPPKRKKPRTS 
FTRIQVAELEKRFHKQKYLASAERAALARGLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQAANRLMLSLQAEAISKGFAPPSAPLGSQGGVNG 

APLAALHGLQPWAEASHAAGC* 
(SEQ ID NO: 894) 

Name: CIS 

Classification : transcriptional actor 
Gene Symbol: C15 
FlyBase ID: FBgn0004863 



Celera Sequence No. : 142000013384668 

AGGATTATGACAAGTGCTGGCAACTGGATTTCCGAGGAGCTGGAACCAGCTACTCTACTGTCGATTCAGGAGGATAATGGTATTGTTACCAAGGT 
GTACGTGGCGCCCTGTTTGCAGCCGATAAAAGAGGAGAAAGTTTTCCTGGAGGACAACACGCAACGCTTCCACTGTCCGCTTTGCTCTAATGTGA 
TTCGCACGGCCGGTGCCTACAAGGTCCATCTGATGGCCTGCCAAAGACGCTTGGCACGCATGATGAAGGATGTGCCGGACGTGTCCCAGCATCAG 
TGCAACATTTGCAAGAAAATCTTCTCCTCGGTGGATGCCCTGATCGGCCACAGGCTCCTGCATGGAAGCCCATCAATGAATCGCACCTGTGCATC 
CTGCCATGTCAAGTTCGGGACGGATCTGGAGTATCGCACCCATTTGGAGAGCCATCTGATACCGTGCGAGTCCACGGATGAATCCTACGAAGGGG 
CGTACACCATCACCAAGACCTTCAATTGCGTCTTTTGTCGCACGGAGTTCAAGGCCTGCTTTAAGCCTGGCCAAGTGACCCGTCGATACGCGTGC 
GACGCATGTATCGTAAGGCTGAAGGCCCAGGAGGAGGAGAAAAAGATCTTAGGGAAGAGAAGGCCCGAATTAATCTGTGATCGTTGTGGGAAAAA 
GTACAAATATGAGGGTTTCCTTCACCGTCATCTGAAAACTTGCCAGATGCCGGATAAATGCAAAAGAAAGCGTGAGATTGAAATCACCGAGGTCA 
GCGAGGTAACATTTACAGAAGTTGTGCAAAGTTTTAATAATTGATTATACAAATTCGCACATTTTAATTTTAAATTAAATAAATATATTTTTTAA 
TTGTAATATTTTTTATAATTTTCAAATTATGGCGCCCATCCCACCGATATCTATTTCGATCTGCTATCGATGTGGTCACACTTATCGATAAATCC 
ACCGTGGCGTGGTATTTTTGATTTAAAGCGTTAGTCGATTTCCCGCCCAGCTGCACGTGTATATTTTTTTGCGGTCCAAAGAAAAGTTCGTAAAT 
AAACGAATTTGCGTGCATTTCCAGCGAACACGATGAGTTCAGACGAGTCCAGCGACGGTTTGGAGGAGCTGCAGAGTCTGAAGGCCCTCTACGGC 
CAGCAGGAGCAAGAGAAACCGGCAAAGATCAAAAGGGAACGTTATATTCCAAAGGCCTCGCAGGCCAAGGAGCTCAACTATGTGGAGGTGCCCAT 
GGAAAAGGTGCTCTTCGGCGACAGGCAGCGACTGCTCACCAATCTAGCCAAGTCCGTGGGACAAAAGTTGCCGAACGATGACGAAGACGAGCAGG 
AGGAAAATCCCGGCCAGGCAAAGCCAGGCGACAAGCGGAAAGCCGCCTGGTCGGATTCCGATGACGAGGACCTGCAGGTGGGCGATGTCAAGAAG 
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GCCACAAAGCACACGGGTCCACTGAACCACCTGCGCAAGGATAAATCGTACAAGGAGTATCTTACGGCGCGATTCCAGCGCACGCTTAACCAACC 

CAAGTGGGCGGAAAAGAAGGTCAAAAATGAGGACGACGAGGACGTATCCTCCGACGAAGAACTGCTGCGGACGGTTGGCTTTATCGATCGTAAAG 

CCCGGAACAGTGATCTGCCCCAGAAGACTCTGAATTTTAAACGGGTCAAAGACCTGAACAGGGCCACATATGCCGAGGGTAATGCCACCAGCATT 

CAGTTTCATCCAACCAGTACTGCTGCTCTAGTGGCGGGCATGAATGGCCTAGCCACCATCTACGCCGTGGATGGGCAAAAGAACGAAAGGTTGCA 

CAACATGCGCTTCAAGMGTTCCCGCTGGCGTGCTCCAGAATTGCTCCATGCGGCACAAGGGCGTTCTTTGGTTCTGTGAAACCATTCTACTACT 

CCTACGACTTGCTGGAAGCCAAGGAATCGAAGCTAAAGCTGCCGGGCGCAATGGAATTTATGCACCGCTTTGAGGTGTCTCCATGTGGCAAATTT 

ATCGTCACTGCTGGCAAGTTTGGAGCCATTCACCTGCTCACGGCCAAAACAAATGAGCTGCTGCACAGCTTCAAGCAGGAGGGCAAAGTGAAGGG 

ATTCACATGGTCTAGTGACTCCAAGAGGATCCTCGTCTGCGGTTCCACTTCCAATGTCAGCGTGCTGAACCTGCGCCAGAACCTCATCGAGCACA 

TCTTCATGGACGACGGCTGTATACATGGAGAGTCCATTCAACTGTCTCCCAACCAACGCCTACTGGCCACCGGCAGTCAAGAGGGCGTGGTCAAC 

ATCTACGACTACGAGAGCATTTTCGCATCCAAGGCCCCACAACCCGAGAAGCGCTTCATGAACCTACGGACTGCCATCACGGATCTCCAGTTCAA 

TCACTCATCTGAGCTACTGGCGATGTGCTCCAGTGAGGCGCCGAATGCTTTTAAGCTGGCCCATTTCCCCAGCGCCACTGTGTACTCCAATTTCC 

CGGCACAGAACGAGAAGGTTGGCTTTGTGACCTCCATGGCCTTTTCGCCGCACAGCAGCTTCCTGGCCTTTGCCACTAAGGGCAAACAGGTGCCG 

CTCTTTAGGCTTAAGTACTTCAAGGGTTATTAATTGAAGGCAGTATGATGTTTACAATAAATTCTTAAATAAAATCATTACTCTTTCTGTGGTTA 

CAATGGAACCATCTTTATTTAAGTTAACAAATTAAGGGGACTTATCTTTAGAAAGGAGCTCGGAAAATATCGCGTTTCTGGGAGTCTTGAATCTT 

GCCAATGTCTTCCAGCTTCTTGCTGATGTTCGTCGACTGCGATTCGATCCACTGCAACGAATTCATATGGGCATTAAGGATTTTTCCGATCTGTA 

TGATGGGATCAGTGGTGTCCTGGCCCTTGTTGGCCTCGTTCAAGTTGTCTATTATCTCCTTTAGATCCTCGGACATCTGCTTCAGTTGTGTGTCC 

AGATTCTCCACCATCAGGTAGGTTTGGCTGCGTTCCATGTCCACCCTCGGGAGATTAACGAATTCCTTCTCAAGCGGTCCCAGGCTGTCCTCAAG 

CTCTTTTTGCTGCGTGGCTATAAACTCCAGCTCCTGGTCCAGCACTTGCTGATCGGTTTTCACTTTTTTCACGGCATCGTTGAGCTCGACGATCT 

TCCCGTTGTTGCTGATGAGCAGTTTGTCCCATGCGTTGATCTGAGTGGCCTGTTCGGTGAATACCTTTTCCTGTTCCTCGAACTCGAGAGTCCAC 

TTGTTGATATGCTCCTCAAGTTGGTGGTAGGACAATTGGGATGCCGTGGAAACGGCCGATGAGTCTGTGGTCTTGGTTTCGGTAGTGAGATTGGC 

AAAAGCACCACCAGTACTTAAAGTTGGGGCCGGTGCAGCTCCCTGCGTGGAGGCTATGGCCGACGGAGCTGCAGTCAGAGTTGTGGACGCAGCGG 

CCGTGGTCTTGGGAGCTCCCAATCCAGACCCGCCCAGCAATCCAGTTGTGGTAGCTGCGGTGTTGGGTGCACTGGTTGCTATAGTTGGCACCACA 

GATGCCGCTGGTTGAGCCGTAAAGCCTCCGAAGGCGGGAGGAGCAGCAGAGGTTCCAGTCGCGGTCGCAGAAGTGGTGGACGC 

(SEQ ID NO: 895) 

Exon: 1001.. 2598 
Start ATG: 1078 

Transcript No. : CT24010 

CTGCACGTGTATATTTTTTTGCGGTCCAAAGAAAAGTTCGTAAATAAACGAATTTGCGTGCATTTCCAGCGAACACGATGAGTTCAGACGAGTCC 

AGCGACGGTTTGGAGGAGCTGCAGAGTCTGAAGGCCCTCTACGGCCAGCAGGAGCAAGAGAAACCGGCAAAGATCAAAAGGGAACGTTATATTCC 

AAAGGCCTCGCAGGCCAAGGAGCTCAACTATGTGGAGGTGCCCATGGAAAAGGTGCTCTTCGGCGACAGGCAGCGACTGCTCACCAATCTAGCCA 

AGTCCGTGGGACAAAAGTTGCCGAACGATGACGAAGACGAGCAGGAGGAAAATCCCGGCCAGGCAAAGCCAGGCGACAAGCGGAAAGCCGCCTGG 

TCGGATTCCGATGACGAGGACCTGCAGGTGGGCGATGTCAAGAAGGCCACAAAGCACACGGGTCCACTGAACCACCTGCGCAAGGATAAATCGTA 

CAAGGAGTATCTTACGGCGCGATTCCAGCGCACGCTTAACCAACCCAAGTGGGCGGAAAAGAAGGTCAAAAATGAGGACGACGAGGACGTATCCT 

CCGACGAAGAACTGCTGCGGACGGTTGGCTTTATCGATCGTAAAGCCCGGAACAGTGATCTGCCCCAGAAGACTCTGAATTTTAAACGGGTCAAA 

GACCTGAACAGGGCCACATATGCCGAGGGTAATGCCACCAGCATTCAGTTTCATCCAACCAGTACTGCTGCTCTAGTGGCGGGCATGAATGGCCT 

AGCCACCATCTACGCCGTGGATGGGCAAAAGAACGAAAGGTTGCACAACATGCGCTTCAAGAAGTTCCCGCTGGCGTGCTCCAGAATTGCTCCAT 

GCGGCACAAGGGCGTTCTTTGGTTCTGTGAAACCATTCTACTACTCCTACGACTTGCTGGAAGCCAAGGAATCGAAGCTAAAGCTGCCGGGCGCA 

ATGGAATTTATGCACCGCTTTGAGGTGTCTCCATGTGGCAAATTTATCGTCACTGCTGGCAAGTTTGGAGCCATTCACCTGCTCACGGCCAAAAC 

AAATGAGCTGCTGCACAGCTTCAAGCAGGAGGGCAAAGTGAAGGGATTCACATGGTCTAGTGACTCCAAGAGGATCCTCGTCTGCGGTTCCACTT 

CCAATGTCAGCGTGCTGAACCTGCGCCAGAACCTCATCGAGCACATCTTCATGGACGACGGCTGTATACATGGAGAGTCCATTCAACTGTCTCCC 

AACCAACGCCTACTGGCCACCGGCAGTCAAGAGGGCGTGGTCAACATCTACGACTACGAGAGCATTTTCGCATCCAAGGCCCCACAACCCGAGAA 

GCGCTTCATGAACCTACGGACTGCCATCACGGATCTCCAGTTCAATCACTCATCTGAGCTACTGGCGATGTGCTCCAGTGAGGCGCCGAATGCTT 

TTAAGCTGGCCCATTTCCCCAGCGCCACTGTGTACTCCAATTTCCCGGCACAGAACGAGAAGGTTGGCTTTGTGACCTCCATGGCCTTTTCGCCG 

CACAGCAGCTTCCTGGCCTTTGCCACTAAGGGCAAACAGGTGCCGCTCTTTAGGCTTAAGTACTTCAAGGGTTATTAA 

(SEQ ID NO: 896) 

Start ATG: 78 

MSSDESSDGLEELQSLKALYGQQEQEKPAKIKRERYIPKASQAKELNYVEVPMEKVLFGDRQRLLTNLAKSVGQKLPNDDEDEQEENPGQAKPGD 
KRKAAWSDSDDEDLQVGDVKKATKHTGPLNHLRKDKSYKEYLTARFQRTLNQPKWAEKKVKNEDDEDVSSDEELLRTVGFIDRKARNSDLPQKTL 
NFKRVKDLNRATYAEGNATSIQFHPTSTAALVAGMNGLATIYAVDGQKNERLHNMRFKKFPLACSRIAPCGTRAFFGSVKPFYYSYDLLEAKESK 
LKLPGAMEFMHRFEVSPCGKFIVTAGKFGAIHLLTAKTNELLHSFKQEGKVKGFTWSSDSKRILVCGSTSNVSVLNLRQNLIEHIFMDDGCIHGE 
SIQLSPNQRLLATGSQEGVVNIYDYESIFASKAPQPEKRFMNLRTAITDLQFNHSSELLAMCSSEAPNAFKLAHFPSATVYSNFPAQNEKVGFVT 
SMAFSPHSSFLAFATKGKQVPLFRLKYFKGY* 
(SEQ ID NO: 897} 

Classification: hypothetical 



Celera Sequence No. : 142000013384534 

TTGCAGGATGTACTCCTTACGCCGTTGGATGACTGCCGCATCGAATCTCTTGAAATATGTGCTATCTGTGGGCACTGGCAACTTGCCGGGTAGTT 
GGAGACTCTTGTGGCGCCGGCTAAGCTCGCGATGCAGCCGCTTGACATCGTGGAACCGTTTCCACACCACCAGGCAGGTCAGAGCTTGAGGTACC 
GACCTTGGGAAGACCTAAAAGTGGGTGAGGTTTACTATTTGGAACAGTTCATCTGCCCCCAAAAAAAACGACTTACAATCGAGGTGATTTTGTAT 
ATTGTGTACCCGCCCTTGTGCTCCTGGGTGTCCGTCACTGTGAAGCTGTGGATCCAGCCGCCCAGGTTGAGAGTCATTCTGGTTTCTGCTGCAAG 
ACCTGGTGGTTTAGTTTCTAAACATATTTCCCATATCAAATGTCGTTGTTCCCCAACTTTTTTGCAACATTAAAGCTAGTGTGACCAAAGGACGC 
CAAAATAACCACTTCCGGACTTCGAAATTAAATAAGAGCATTTAGTAGCTTATTTTTTTAACTTAGAAGGATTAAATTAAACCATAAAAATATTA 
ACTGTTTTTTAAACACTTTTATTTAAAATAAAAAACGGTCACATTTTAAAATTTAAAATTTATACTCTAATCGAAAGTATCATAAGTATCGATAG 
TCTGCAGTGCATGTGGGTTATGTCATGACTAATGACAAATTGAATTGCACAGAAAAGGTGGAAGTCCCTGCTAACGACATTGCTTCATGTGTTCA 
TTTAATACAACATATTTATAGCTATAGCTTTTTTAAATGTGTC7AATTTACTATAACATTTTAAAACTTCCGAATAAAAGCTTCACACAAACGGAT 
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TTTCACAAGTAATATCCAGATAGAAATACCATTTTAAAAAGAAATTTTTCCGATTTTACCACCATCGCTTGCGCATGCGCACGGAACGCGTGATG 
CGTTTTTAAGTCCGTCATTAGGCCATCGCCCATCAATTTTAGTGTTTCAACCGCCGCCCGATACGGTCACACCGCTCGAGAAGTAGAGAAAAAAG 
AAGGCGACGAACCACAGGCAGCAAGCAAAATCGACGCCGCACGTACCGGGAGCATTTAAATAATTTTTTCTGTGGATTAATTTTACCGAAAAGGC 
TTCAACACAATGACTGAGCGCGAGAACAATGTGTACAAGGCAAAGCTGGCCGAACAGGCCGAGCGCTACGACGGTGAGTAGTGATTCCGAAAAAA 
AATGGTAGCAAAAATGGTAGAAAACGGTAGACAAATGCTAGTAGAGTAGGGAAAAAATTTTCTCGACTGAGGAGCGCCAACGTTTTTGGGGAGGG 
GTCGCCTGGGATTTTTTTTTGTCTTCCCGATCCATTGCGTCGTCGCGTGGTCGTTGAAAAAGTCTAGAGCTCCGCTTTTCGTCCGAATGTGCCAA 
TTAAAAGGTTTTCTTCCAGAGAAAACGATGGAAATTTTCAATTTTCATCGCTGTAGCTGCATTCGAAGCGACGCAGCGTCGCGATGAGTAGTATT 
TCTTGCGAAAAGAACGGGAGATGGAGGAAACAGTAAGAAGAGGAAGAAGCCGAAGAAGAAGCAGAGCGTCGGAAGAACGAAAAGAGATGAATAGA 
GTGTAAAACCTGATTTTTACCGGCAAAAATTCGCGTGGTTTTGCACAAAAAGAGAAAGAAACGGAATTGCGTATTATAATTTACAGCAAATAGTT 
TGCTCCGCTCTTTCTTTTTTGGAACTATATAGTATTAGTTTAGAAGGGGAAAATTTTGTGTATTTGGCGTTGCGAGCAAGGGCTGCGCGGCAGCT 
AGCATTTTATTTTCTTCCTTGGTTGCTAAGGGCTGTTTGCAGTTGATTCCAGTTCGCGTTCTTTTTTTTTCTTTCTTAGTAGTTGTCAAGGCGTA 
ACTTTCTAGAAACGACACATTTTTTGGGGATTCTAGTTTCTTAAATACACTTATTCCACACCCAAAATCAATGATTGTGGGGCAGGGGTAGGTGC 
AGTCCTCTCTTTCAGACCCCCTCCCCCTTTTTTTCCACGAAATGTGCTTTTCATGCGCAGCAGCGGCGTCGTCACCATGCCGTTGCTGCGAAAAA 
TCGATGAATGAATCGATTGTGTCGCCGGCTGTCTCCCTCTCTCACTTTCTGCCTATTGCTTAATTCTTTATTTATGCAAATTAGTTGCGGCTAGT 
TGCAAGCAGTTATGCGGCCACTTGGCAACCCTATCTGATAATGCGAAAAAAGGAAATATAAAATTTTTCTGTGCTTCTTCTGTTACTCTCGCATT 
ACTCATGCTTATGCGCATGTATGCATGCGTGTGTGTGTGTGTTTGTACGGAGGTGTGCGTGTGCCGATGAGTGTCCGCCGGGCGGTTTTTATTGT 
TGCTTCTTCTGTAGCCTTTGCTTATATTTAATTAAAAATACTCCAATTAATTAAAAAGCCAGCTCAAAATGCAAAGAAAGTCGCAGTGGGAGGGG 
GACGGCAAATTCGGCATTTGCAAAAAGCGCGCGCTTAACCAATTGCCATAAACGCTGCGTGCGCCAAAACGAGATGGCTAGTGTGTCGAAGTGTC 
GGCGACGCTGCACAGTGGCTGTAAAAAGTTAACAAACATTTGGCGATTCCTCTGTCTGCCGTTCCTTCCTGCTGCTGTACTTGGGCAGCGACTAT 
AGCAACAATAATTGCAAACGGTACTCTGCACCTAATTGCCTGCAAAACTGACAACATTAACGGGCGGTGGCGGCGGCAAAGAAGTCAATCAACTA 
TTAGCACACACAAACAGGGCATACATGCATACATACACCACTGGTGGGGATATTTGTGTCTGTATCACATGTTTGTCTTCTTATTTTCGCCCATC 
GGTGTGCGTTGGTGTGCAGCACTTGGCGTTGGCCTTGTCACACTCGGACATTTTGCGGTGGAGCTCATGCTGGAGCACCAATACCATTGCGCGCG 

AAG AGAG AGAAAG AGC GCGGC GGAG AGC G AAGCGG AGC G GC AAAGC CG C C AGC C GGC AAT GT C T GC GCC C CC AAT C T G AAC T TGCCTCGCCCTCT 
CCGCCCCTGATCTCATCTCCTCTTCAAACCCCTGCTCCCCTTTTCTGCACACATTAACGTCAGCCTTTAAGTGTGCTTTCTCAGGTGCTGCCCCC 
TGCGCCCACCATCCCCCGCTCCATGCTCTTTCCATCTTGCGCTCTCTGCGTTCTATCTACATTTTTTTCGAGGTCGCGCGCTGCTTTTTCCGTTG 
ATGTTCGTTCTCGTCAATGTCGTCAATATGCGCAAAAGGCAGACAAAAAAAAAATGAGTGGAAAAAGTAGCATACATACGGTTGATTGATGGGCG 
GTGGGTGGCGGTGGTGTTAGGTGTGTTTTGTTTGTTGCTCGCTTTTTGCTCAAATTGCACCATTTTAGGGGCTGTTGTTGCTGCTTCCTTTGTCT 
AATTACACATTCTCTGCCCGCTGTAACGGTATGTGCGCATGTTTTTTCAGCTGCTGCGCCCTTATGCGACTCACACACACACACATACACACACA 
CACAAAACCACGCGCACTCATACCCACACGAGCGCAAGACACACCCACAGGCGTCTCATAAGGGAGGGGTGGATCTCTGGGGGCGTGTTGGTAGG 
AGGGGGTGGAAAATCCTGCCCATTCCTTTTGCTCCCTATTTCGTCCCAATATTTCATTCCAGCTCGCGTGACTTCTTTTTACTTTTGGGTGGTGG 
TGGATTGCCTTTGCAGGGTTGTCGCTAGTTTATCAAATTCCAATACTTATTTCTTGGAAAAACAACACAATTGAGATGACCACGCTCATCTTTGC 

AAT C AACT T GTT T AT GAAAAAGAT T AAT CAT T T T T T AT AAT AAAAAAG AT G AAT AC T T T T C AAAT GT AG T T C AACT ACT C ACG AAT C T AC T T AG A 
ATCTATTCCATACTATTTTCGCCAAATTTGCTAGACTTTCGTGAACTTTGCACTTTCTACTGAAAGTTCATATGATTCGAGAAGTTTCCATACAT 
TTCTTTTTACCACTTCGTGGTTCCGCAAATGATTTATCATGCCCGCGGGACTTGGCTCTTTATTTTTGGGACGCGTTTATTGATTTTCGCCTTGC 
TATATGTACATGATTTAGTGACTGGTCTGACGCGCATGTCTCATAATTGGTTGTGGATTACTTACTACTAAGCAACATGTGCTGTATACTTTTAA 
AGTTGCAAGGATCCCAGCCACCCTTCACCACCCAACTGCGCTCCCCCTCCCCTTCCCCCTCCCTACCCACTCTCAAGTTCATTTTTTATGCGTAA 
CTCAGCATAGCTTGCAAGCGTTGTAAAATGCGTGTGCACATCATCATCCACATTGGTTTCAAACAGCCCGCCAATGCGAACAACAATGTTTGAAG 

T T C AAT C AC AGC C AC AAAT AT T C GT G AAT C AT GC AAC T T T ATT G GC AAT T T T T C AC AAAT AC GT GC T AC AAC AAAAAT C GC AC T AAT AAG C AG AC 

AGTCGGCAAATGTCCCGATGTCTTTGTGTGTGAGTGCTAGTGTGTGTGTATGTATGGGGGTGTTGGTGTGCGAAATACGTCAGCAGCTGACGGGC 

AGCGGGTGCGAGCGAGAGAGGGCAGACAGTGCAAGTCAGCAAACGCGCTACAATTACAATATTTTGAATAGTAGTGTAGTCTAGCTCTGTATTGT 

TATCAGCCACCAGATATGCCCGAGATAGTTAGCCTGCAACTGGCTTTACCCAGACAGGCAGCACTGCAATGATAATTTAGATGATTTGGTTCGGA 

AATATCTTACATTTCCTTTGGTCAGATATCAAAAACTAATCTTGCATAATTCCAAATACTAATCATATGCCCGAAAAGCATTTAAATGAACTTCG 

AAATATATTTAGACTTCAATTGACTGACTCGAAAAAGGAAATGAAATGGCCAAATGATATCAGCTCTTAACCTACTCACTTTAATTTGATAAACG 

AGCTCCTTGTTTGCTTCCGTGTTGAGTTCCGAACATGTGATTCATGACGCAGTTTTAGATGGTGATGCAGACGCAGATAGAGATGCCCCCGATGG 

GGACGATAGGATTGGGTGCAACAATGTGGCGGAAGCCCTTATCACCGGCTGATTGGATGGCCCCGTATTGGTATTGGTATTGGTATTGGTATGAC 

AGAACCAGCTAAATCAGGCGTAGACGCAACGCCACCTTATGCATAAATATATCTGCATAACTTATATATCATCGCGGATTGGGCGTGTCGGCGAT 

AAGAAATGGAAAATAGGTAGTCTAGGCTGTCGCGAGGGGACAAGGACAAGCGGGAGGAAAGTCCAGATAACAAGCCTTCCATACCTTGGATAAAT 

TTATCTACGCAAAATGTATATAATGTATGTATCTTGTTGGCTTTCCAATTGAAATGTTTGGTTTTTATTTTTAATCTTAACATGAAATATATGAA 

AGCAATTCAAAAATTATATGAACAATCAAACAAACATTTGTATTCTACGAAATTGTTGATAAAATAGCGTCCGATAGCATTGTTTAAACTACATC 

ATAAACATTACTTCTTACTCACTCAAATGAAGTATTTGTTTAATGATGTATATTTTCCTTAGGCATTGCAGAATGAATATACATATATGCGAACT 

AATGATTAAACTAATCGTTAGTTAATGGGCAATCCCTTAAATTTGGTTTGAGATGTTACACTTGATTATTGTAAACATTTACTCTTATATACTAA 

AGACCACACCTTTCATTTGTTATGTCCTCTTTACAGAAATGGTGGAGGCCATGAAGAAGGTCGCCTCCATGGACGTAGAGCTGACCGTCGAGGAG 

CGAAATCTGCTGTCGGTGGCGTACAAGAATGTGATTGGAGCACGCCGTGCCTCGTGGCGCATCATCACCTCGATCGAACAGAAGGAGGAGAACAA 

GGGGGCCGAGGAGAAATTGGAGATGATCAAAACCTACCGCGGACAGGTGGAGAAGGAGCTGCGCGACATCTGCTCGGATATACTGAACGTGCTCG 

AGAAGCATCTCATTCCATGCGCCACATCCGGCGAAAGCAAAGTATTCTACTATAAGATGAAGGGCGACTACCATCGCTACCTGGCCGAATTCGCC 

ACCGGCTCCGACCGCAAGGATGCGGCAGAGAACTCGCTGATTGCCTACAAGGCGGCCAGCGATATTGCCATGAACGATCTGCCACCAACACACCC 

CATCCGTTTGGGCTTGGCATTGAACTTCTCGGTGAGTGATTGTGAAGAGACACGATGTCGTCATATGTTTTCTATATATATATTTTAAAATCCTT 

GCAGGTGTTCTACTATGAGATTCTCAACTCGCCGGACCGCGCTTGCCGCTTGGCGAAAGCCGCTTTCGATGATGCCATTGCCGAGTTGGATACAC 

TGAGCGAAGAGAGCTACAAAGACTCGACACTCATCATGCAGCTGCTGCGCGACAACCTCACATTATGGACGTCCGATATGCAGGCAGAAGGTAAG 

TTGTAATATCACATCCACTTGTTCGGCTCCGTTATTAAGTTCTAAATATTATACTAAGAAGTAGACCCCAACGCAGGTGATGGCGAGCCCAAAGA 

GCAAATCCAGGATGTTGAGGATCAGGACGTGTCGTAATTTAAGTAAACCCCCGCCCTTCCATTTAAAACACATTATATACCTTCTAAAGTAAATA 

TAATACCATATAATATATATGCATACAACAACAACAACAAAATCAACAACAACACGACAACAACTTCAAGAACAACAACATGGAACCAATGCGGC 

AACAACAACCATAGCAGTCGATCACCACAACAGCAGCACCCGTATGTGTATCAACATATTTGCAACATGATAACAGCAAGCAAAGCAGCAGGAAC 

ACAAATTGTCATCCACAATAAACAGCATGGAAAACAACAGCACAAAAACAGAAAAAAAAAACAACTATCCAAGCAAAATGTTTGAAAATTGGGCG 

GCCTGGTGGGCGTGGCGCGGCTGTTGCTGCGGCCCAGAATCGAATCGCAACATAAAAAATTCGCGGTACTATTAAGTTGGCGTCGCTTCGCAATT 

TGGCTCCTTAACGTTAGAAAACTGCAACATCAAGTCTACTACATTTGAATTTTTTGCATTCGAGAAACCAAGCATTTTATTATATTTGAATAAAT 

ATTATATTTAATTATGAGAAGCAAAAACGAAACAAATAAAATTCATTTAAACCACAAAACACATTAAAGCACAAGCAGAAAAAGAACGAAAGTCG 

AAACACTAAACGAAAATTGAGAAACAATCACACGAGAGTAAAT C AC AAAGC AAATTT CAT CGAGTTGGCATGC ATT AATTTTTAAAACAAACTTT 

AAATACGACAGCAACACGTAAAATACAACTTTAATTTTAAGCCAATAATAAACGTGAGAACGAGGTAATAAAAATGAAAAACGATTTAATAAGCG 

CAAACGTAAAAGTAAAACATAAAGTCGAAAATGTAAGGAAAAAATACAGCAATTAATTTAAATTTAATACATGAATAAAACGTAAATGGAGTAAA 

CGAAACGAAATGAGCAAACAGAATGAAAGACGCAAAAAGCTAAAAACTATATTAAATGAGA 
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ATTCTTAAGCTTAAATTTGTATGTATGTAAGTGTACTGTTCAATAACACCGACACACCCTCACAGACACACACACACACGCGCAGACTAACACAC 
ACCCACCACCCACACAGACACCCATATTACGCATACGAGCCGTTTGTTTTCTCTTAGCGTTTCCTTGCTCTCTCACACTCGCACTCATATTCACA 

CTCACACACCCATGCATGGTCAAATCACATGAAACGAAAC^ 

TrTCCATCGCAATCGCAAACAATTTTCTTCATCATCTCTTACCTTGCGCAACAAAGAAACATATTAATAATATAGAAAAGTATTTGTAGTTGCTA 
^TACAAGAAACAAAAAGAATACAAAACAAAACAACGGTACATATATATACGTATATATATATATATATATTGTATACATATATACTTGTATATT 
TTCTATGCCTACCACTTATGTAATTTGTAGAGATTTCGAACTTTTCGCCCCACCACCCAACTAACCATCTCTCCAGAGATTCCGATTCCAAAACT 

CCC 

(SEQ ID NO: 898) 

Exon: 1001.. 1213 
Exon: 5737.. 6206 
Exon: 6275.. 6455 
Exon: 6525.. 7173 
Start ATG: 1150 

ccgccgcccLtacggtcacaccLt 

AGCATTTAAATAATTTTTTCTGTGGATTAATTTTACCGAAAAGGCTTCAACACAATGACTGAGCGCGAGAACAATGTGTACAAGGCAAAGCTGGC 
CGAACAGGCCGAGCGCTACGACGAAATGGTGGAGGCCATGAAGAAGGTCGCCTC^^^ 

CGGTGGCGTACAAGAATGTGATTGGAGCACGCCGTGCCTCGTGGCGCATCATCACCTCGATCGAACAGAAGGAGGAGAACAAGGGGGCCGAGGAG 
AAATTGGAGATGATCAAAACCTACCGCGGACAGGTGGAGAAGGAGCTGCGCGACATCTGCTCGGATATACTGAACGTGCTCGAGAAGCATCTCAT 
TCCATGCGCCACATCCGGCGAAAGCAAAGTATTCTACTATAAGATGAAGGGCGACTACCATCGCTACCTGGCCGAATTCGCCACCGGCTCCGACC 
GCAAGGATGCGGCAGAGAACTCGCTGATTGCCTACAAGGCGGCCAGCGATATTGCCATGAACGATCTGCCACCAACACACCCCATCCGTTTGGGC 
TTGGCATTGAACTTCTCGGTGTTCTACTATGAGATTCTCAACTCGCCGGACCGCGCTTGCCGCTTGGCGAAAGCCGCTTTCGATGATGCCATTGC 
CGAGTTGGATACACTGAGCGAAGAGAGCTACAAAGACTCGACACTCATCATGCAGCTGCTGCGCGACAACCTCACATTATGGACGTCCGATATGC 
AGGCAGAAGACCCCAACGCAGGTGATGGCGAGCCCAAAGAGCAAATCCAGGATGTTGAGGATCAGGACGTGTCGTAATTTAAGTAAACCCCCGCC 
CTTCCATTTAAAACACATTATATACCTTCTAAAGTAAATATAATACCATATAATATATATGCATACAACAACAACAACAAAATCAACAACAACAC 
G AC AAC AAC T T C AAG AAC AAC AAC AT GGAAC C AAT GC GGC AAC AAC AAC C AT AGC AGT C G AT C AC C AC AAC AG C AGC AC C C GT AT GT GT AT C AAC 
AT AT T T GC AAC AT G AT AAC AG C AAGC AAAGC AGC AGG AAC AC AAAT T GT C AT CC AC AAT AAAC AGC AT GGAAAAC AAC AG C AC AAAAAC AG AAAA 
AAAAAACAACTATCCAAGCAAAATGTTTGAAAATTGGGCGGCCTGGTGGGCGTGGCGCGGCTGTTGCTGCGGCCCAGAATCGAATCGCAACATAA 

AAAAT T C GC GGT AC T AT T AAGT TGGCGTCGCTTC G C AATT T GGC T C C T T AACGT T AG AAAAC T GC AAC AT C AAGT CT ACT AC AT T T G AAT T TT T T 
GC AT T CG AG AAAC C AAGC AT T T T ATT AT AT T T G AAT AAAT AT T AT AT T T AAT TAT GAGAAGC AAAAAC G AAAC AAAT AAAAT T C AT T T 
(SEQ ID NO: 899) 

Start ATG: 150 

MTERENNVYKAKLAEQAERYDEMVE AMKKVASMDVELTVEERNLLS VAYKNVI GARRASWRI I T S I EQKEENKGAEEKLEMI KT YRGQVEKELRD 
ICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDR 

ACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEDPNAGDGEPKEQIQDVEDQDVS* 
(SEQ ID NO: 900) 

Name: diacylglycerol-activated/phosholipid dependent protein kinase C inhibitor 
Classification: enzyme_inhibitor 
Gene Symbol: 14-3-3epsilon 
FlyBase ID: FBgn0020238 

Celera Sequence No. : 142000013384504 _ mm 
CCAGCGACCTATGGATTTCTATTGCATAATTTTCCAACTACAGTCCACCGCTCTACCAACTGAGCTATCGAAGGTTGAGTTCAAACTTGGCCGTT 

GTTCGTTTTATGCATGCGCACAAACGAAATGCACCGTATTTGGTTATTTATTTTTGATTTTTACTTATTTAGGTCAAACAAAATTGATCAACTTT 

CTCTATTCAGTCCATTGCAGTAAGCCTATAGTTTCCAAGCCTAATAGCCTAGGCGCTTTTCACAAACTATATCAATTTAGTTACGGTTTATTTCT 

TCCAAATAAAATCCTTCTTATATTATTTAGTTTAAACAGCTCAGTTATATTCTGAATGTCTGTAGATGAATTAAATTGATTTAAATTGTGCACTA 

CAAAATATATAAAAAAAAAGGTATGTGCACTACAAAATATATAAAAAAAAGGTATTTATATCTTTTCGATCCTTCGAGCCGGATTTGAACCAGCG 

ACCTATGGATTTCTATTGCATGTTTTTCCAACTACAGTCCACCGCTCTACCAACTGAGCTATCGAAGGTTAAATTCGCTTGCGGCTAATTCACCT 

TTTGTTCAGGAGCGAGGTGGGGAAAGTGACAAAAAGAGAGGGCACTTCTCTTAAACACCGCTGCTCTTTAATACTCGCGGCCAGAAAAATAGGAC 

CTT AAAT TCTTGTAAACTTATATT AAAT ACT AAAGC ATTGTAAATCTTAATGAGAAAAT ATT CAGGT AC AACTGTTCATGGCTT AAT TTTAGACC 

AAAATTGTGCATGAATTTAGAAAGGCAGCAATGCACTATTTGAAACTAACTAGTTACCCACCCACAGCTGTGCACCGATAAAGGGGTGGTGTGCG 

AGCAAAGATGCCATATTTATACATATATCCATCTCGTTCCAGCGCCGACGATAGACGATACCGATACTTTGCGAGTCCGCTCCCAAGATTGAGCC 

CCCACAGCGTAAAAGCGACCGAAGTGCCGGCGACGAGAGCAGTTCGTTCCCAGAAGCCGGCAGCAGTGGTCCGTTGATAACTCCGACTCGAGCTA 

CTTATCCACCACGTCGATACTCTTGTCTGTCTATTGCAAACATAGAGGAGTCACCCTTTGGCTAACGAATCCGAAACTCCCGAGTTTTGTTGTGC 

AACATATCAAAAACCGCATTTGTTCGGATCCAATTTACATTCGCATTTCAAAGATGTCGTACCACAAGCCCGAAGCTAAGACCGTCCAGGATCTG 

AAAGATCTGGCCCAGAAACTGCGAATTCACTCTATCAACGCCACCCAGGCCTCCAAATCGGGGTAAGTTTATCAATAACTGAAAGTCCATAGGCA 

TATTAAAGTGTGCGTATTTAGATTTGTTTGTGTAGATGTGAATTTTAAAAGTTGTGCACCTTGCACTTGTTGCTTCAAAGGTGACCCCCTCCATT 

TCCAGGGTGGGTGCGAAAAAGTTTTTGAGGGGGCTGCTAAATACATACATACATATGTATGTGTTTTGAAACATTACAAAAGAAAATAAAACAAA 

AATAGTTACGCATATACATAGTTATGCATCTATGTTTGTATGAATGTGAGAAGTAAAATATGGGAAGACCGGTTCCTTTAGTGACCCCTCCTGGG 

ATTCAGTTGCCAACCCACGTGTTGTCCAGTAGCTTTAGTGAATTTGCGAATTTGTTGCCCTGCCTGCGTGCTCAGTGATAAGCCGCGTTCACCTA 

CCTTAAATTGTGAGCTAATATTTTTTGACCTTAGACAACCTTGAAACGTTAGGTTTTTCTTACTGCCTTTTAATTTCTTTTGCTTTTGTTTTGTG 

CGTGGCGTTCAAAACGCCGAAAGCACAT AC AT ATGTACATGGGGT AAAAAC AAAC AT ACT TACATATC AAAC AGAGGCATTT AAT AT ACGTATGT 

ACATATGTACATACAAGTGTTGGAAAGCACAGCTTGTGACGGGAGTGGAATTGGAAAGAGAGGGATGAGAATAGTAATGGAATGATCCCCTACAC 

TTGAGATTTACGACTGTATGGCTTTTAATGTAATTATATCTCGGATAGACGCGATCAAATTATTACTGCAACAGTAAATAAAAAAGTTACTATGC 

TGATAAGGAAAACACTTTTATCGCCTCCATTAGTTTTTCCATTCTGTTTCGTTTTTTTTTTTTTTTGTGACGGTTCAAGATAAGATCTGCTCTGT 
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TTGCTCTAAATGTGACCTCTGCCGACGAGATAAGCCCGGTTTGGTTACTCftTTGACCACGATATGACCGATTAGAGGTGACCAGCTCAAGTGTGT 

TCCCCGCATGATAATCGAAAATCACTGAAAACTGATAAAAAAAAAAAAAGCGCGCCGGGTCACTGCTAACACTCCGTATCACCTTCAATAGTTCT 

TGATATGGCTGCAGGAAACGGGGACGCACTTCGGAACTATCTAGAAGCACGTGCGTTCACAAAGATCGCCATGCAACACAATCGTGGAAGCCAAG 

AATTTGGTTTTTAAGGCTATTATTTAGGTTTTTTGGTGATCAGAAAAGTTCAAGGTCCCAGAAACCGGTGAGGAATTGCGATGGACTCACGTCGC 

GACAAACACACATATCACACCATATAACGCCAATAAATACATATGCATGTACTCGTTATAGTTTATTTATATGTCTGGGGTAAGACGAGCCCCCC 

ATTTTCTCTGGGGTTGCGTGTTTTCTTTGCGGTTGGAGTTGTGCCAGATCTGCGATTCAAAGCAATTTTTTTTGGAACACGATCGCATGG 

TTGAATTCCAAGAATGGATCTTATCTTATTTATATGTACATTGCAATTGGATACTCGAGAAATCTTCATTTGAAATATGAACAGAGGGGGGGGGG 

GGGGTTTAAATAAAAGCTTCAATAGACAAATGGATTTCTTTTATTAAACAACGTTTTTAACAAGTATGCACAGTGTTCAGTTCCACTTTTCCTTT 

GTTGATCGGCCTTCACTTACCCACCGTGAGGTCGACTTCTATAATTCATTAGGCACTTACCTTAACCTAGAACTACATGCAAACGAAATGCCTTG 

TCACAATCTAATATTTTGCCTACTTTTCCCTCTTTATAGCCATCCCACATCATGTGCCTCGATCGCTGAGATCATGTCCGTGCTGTTCTTCCAGC 

AGCTGCGCCTGAACCTGAAGCACCCACGCGATCCCTCCAGCGATCGCTTCATCCTGTCCAAAGGACACGCTGCTCCCATTCTGTATGCCGCCTGG 

GCTGAGGCTGGTCTCTTCCCCATCGCGGATCTGAACAACCTGCGCAAGATCGACAGCGATCTGGAGGGTCACCCGACGCCTCGTCTGAACTTCAT 

CGATGTGGGCACTGGATCCCTGGGACAGGGTGTCGCCGTCGGCGCTGGCATGGCCTATGTGGGCAAGAACTTCGACAAGGCCGACTACCGCACCT 

ACGTGGTGGTGGGTGATGGTGAGTCCGCCGAGGGTTCCATCTGGGAGTCGCTGCACTTTGCCGGCCACTATAAACTGGACAACCTTTGCGTGATC 

TTTGATGTGAACCGCTTGGGCCAGTCGGAGGCCACCTCCCTGCAGCATAAGCTGGATGTTTACCGTGATCGTCTGGAGGCCTTCGGCTTTAATGC 

CGTCGTTGTCGATGGTCATGATGTCGAGGAATTGAGCAAGGCTTTCCATTGTGCTGCAATCACCAAGAACAAGCCCACCGCCATCATTGCAAAGA 

CGTTCAAGGGCAGGGACTTCCCCAACATCGAGGATTTGGACAACTGGCACGGCAAGCCACTGGGCGACAAGGCCGCCGAAGTGGTGAAGCATCTG 

GAGGGACTGATTGTCAACAAGAACGTCAAGCTGACCCCTAAGCCGGTGCCCAAGACTGGTGCTGCTCCCGACGTGGACATCAACAACATTAAGTT 

GAGCTCGCCACCCGCCTACAAGCTGGGTGATTCCATTGCCACCCGTTTGGCCTATGGAACAGCTCTGGCCMGATTGGCCAGAACAACCTGCGTG 

TTGTTGCCCTAGACGGAGACACCAAGAACTCAACTTTCTCCGACAAGCTGAAGAACCTCGACCCGCAGCGCTACATTGAGTGCTTCATCGCTGAG 

CAGAATCTTGTGGGAGTGGCCGTTGGAGCCGCCTGCCGTCGCCGCACTGTGGCCTTTGTGTCCACCTTTGCCACCTTCTTCACCCGTGCCTTCGA 

TCAGATCCGTATGGGCGCCATCTCGCAGACGAACGTTAACTTTGTGGGCTCCCACTGCGGCTGCAGTATCGGTGAGGATGGTCCTTCGCAGATGG 

GTCTGGAGGACATTGCCATGTTCCGCACCATTCCGGGCAGCACCATCTTCTATCCCTCCGATGCAGTGAGCACGGAGCGTGCCGTCGAACTGGCC 

GCCAACACCAAGGGTGTCTGCTTCATCCGTACGTCCCGTCCCAACACCTGTGTGATCTACGACAACGAGGAACCCTTCACCATTGGCCGCGGCAA 

GGTGGTGCGCCAAAAGAGCTCCGACGAGGTGCTTCTGATCGGAGCTGGAATCACTTTGTACGAGTGCCTGGCTGCCGCCGATCAGCTGGAGAAGA 

ACTGCATCACCGTCCGTGTGATCGATCCGTTCACCGTGAAGCCATTGGACGCCGAGCTGATCATCGAGCACGGCAAGCAGTGCGGAGGACGCGTA 

GTCGTCGTGGAGGATCACTACCAGTAAGTGTTCTTTGTAGTCTTTTTATTTGCCTCCTTTGCTAACTTGAAATTTACTTAATTTCTTATAGGCAA 

ggaggcctgggtgaIgctgtgctgagtgccctgg^ 

TCCGTCGGTACTCATCGATATGTTCGGCATCTCCGCACGTCACGTCGTCAACGCCGTTAACGAGATCCTCAAGGATTAGGGGACCAGTCGTGATC 
TGGCTACATTCATTTACCGCCTACAGGAATTCCCCCAGTTACTACTTACTTCCAATATTACCATTCCCTCTAAGATTGTTAATTGCTTCATTGAT 
TTGAACCTTACATCATGGAAATAAAATATCTTTAATAAAAATAGTTAAGCCGAACTATATACCTTATTTACATATTATATTACTTATTTACTATA 
TTACTTATTACATGTTAAGCATGCCTTTAAAATGGCTGGACGATTTCATTGCGTTTGCCTTTGGTTTGCTTTAGACTTGCTTTGGTTTGTTTTTA 
GTTTGCTGTGGTTTTGCTTTGGTTTATCTCTGGATTTGCTATGCGCTTGATGTTGATTTGACATAGATACTGAAATAAAGATAAGATGAGAAAAT 
GAATAAAATTATAAATTTTTTAAATTTAAGCATACTGCATACTTTTTTAATTTACAAATTTTTAATATTTTTCTAAAAAAAATCGGAGCTTAAAA 
CTAACAAGTTGGCAGTACCCTTCCAATTATAAAGGGTTATCGATAACTCTGCTGTAGCTATCGGTGCTGCCAGCTGATTATAGTGCGGCATGATT 
ATCTAAACATCAATTTGTTTCAGCCGATCCCGAAAACTATTCAAAATGAACTTAAGCACAAGAGTTATGCTGAATAGGTTGACACAGCAGTGTGT 
CTACAAGCGGATTGGTAAGGAGAGACGTTTTATCTTGTGCCACACATTCAGGAAACGTATTTTCAGTGACCTTCTCCACCAAAACTGCGGAACAT 
GTAATTGAAAATCAGGAGGAACAGAAGAAAGAGGCTCCGCCAACCACGCCGACTTCACCGGTGAACCGGAAGACAATCATTCCGGCCAATTATCG 
GTTTGTTTACCCAGAATTCCTGCCCGATCCGAAGGTGGAGTGGCGCAACCTCGTGCGCGAGAAGCTGGAGCGACTGGACATGCTGGATCGCCGCA 
AACAGATCGACCTGCCCGAGTTCTACGTTGGATCTGTGCTTGCGGTGACAAGTTCGGATCCCCATGCCGCTGGCAAAACCAGTCGATTTGTGGGT 
ATCTGTATCAATAGGGATCGCTGTGGACTGCGCGCCCGTTTCATTCTTCGCAACGTGATTGATCACCAGGGAATGGAGGTGGTCTACGAGCTGTA 

TGATCCTACCATCCTAAAGA 
(SEQ ID NO: 901) 

Exon: 1001.. 1297 
Exon: 3080.. 4678 
Exon: 4747.. 5100 
Start ATG: 1194 

CAGAAGCCGGCAGCAGTGGTC 

TCACCCTTTGGCTAACGAATCCGAAACTCCCGAGTTTTGTTGTGCAACATATCAAAAACCGCATTTGTTCGGATCCAATTTACATTCGCATTTCA 
AAGATGTCGTACCACAAGCCCGAAGCTAAGACCGTCCAGGATCTGAAAGATCTGGCCCAGAAACTGCGAATTCACTCTATCAACGCCACCCAGGC 
CTCCAAATCGGGCCATCCCACATCATGTGCCTCGATCGCTGAGATCATGTCCGTGCTGTTCTTCCAGCAGCTGCGCCTGAACCTGAAGCACCCAC 
GCGATCCCTCCAGCGATCGCTTCATCCTGTCCAAAGGACACGCTGCTCCCATTCTGTATGCCGCCTGGGCTGAGGCTGGTCTCTTCCCCATCGCG 
GATCTGAACAACCTGCGCAAGATCGACAGCGATCTGGAGGGTCACCCGACGCCTCGTCTGAACTTCATCGATGTGGGCACTGGATCCCTGGGACA 
GGGTGTCGCCGTCGGCGCTGGCATGGCCTATGTGGGCAAGAACTTCGACAAGGCCGACTACCGCACCTACGTGGTGGTGGGTGATGGTGAGTCCG 
CCGAGGGTTCCATCTGGGAGTCGCTGCACTTTGCCGGCCACTATAAACTGGACAACCTTTGCGTGATCTTTGATGTGAACCGCTTGGGCCAGTCG 
GAGGCCACCTCCCTGCAGCATAAGCTGGATGTTTACCGTGATCGTCTGGAGGCCTTCGGCTTTAATGCCGTCGTTGTCGATGGTCATGATGTCGA 
GGAATTGAGCAAGGCTTTCCATTGTGCTGCAATCACCAAGAACAAGCCCACCGCCATCATTGCAAAGACGTTCAAGGGCAGGGACTTCCCCAACA 
TCGAGGATTTGGACAACTGGCACGGCAAGCCACTGGGCGACAAGGCCGCCGAAGTGGTGAAGCATCTGGAGGGACTGATTGTCAACAAGAACGTC 
AAGCTGACCCCTAAGCCGGTGCCCAAGACTGGTGCTGCTCCCGACGTGGACATCAACAACATTAAGTTGAGCTCGCCACCCGCCTACAAGCTGGG 
TGATTCCATTGCCACCCGTTTGGCCTATGGAACAGCTCTGGCCAAGATTGGCCAGAACAACCTGCGTGTTGTTGCCCTAGACGGAGACACCAAGA 
ACTCAACTTTCTCCGACAAGCTGAAGAACCTCGACCCGCAGCGCTACATTGAGTGCTTCATCGCTGAGCAGAATCTTGTGGGAGTGGCCGTTGGA 
GCCGCCTGCCGTCGCCGCACTGTGGCCTTTGTGTCCACCTTTGCCACCTTCTTCACCCGTGCCTTCGATCAGATCCGTATGGGCGCCATCTCGCA 
GACGAACGTTAACTTTGTGGGCTCCCACTGCGGCTGCAGTATCGGTGAGGATGGTCCTTCGCAGATGGGTCTGGAGGACATTGCCATGTTCCGCA 
CCATTCCGGGCAGCACCATCTTCTATCCCTCCGATGCAGTGAGCACGGAGCGTGCCGTCGAACTGGCCGCCAACACCAAGGGTGTCTGCTTCATC 
CGTACGTCCCGTCCCAACACCTGTGTGATCTACGACAACGAGGAACCCTTCACCATTGGCCGCGGCAAGGTGGTGCGCCAAAAGAGCTCCGACGA 
GGTGCTTCTGATCGGAGCTGGAATCACTTTGTACGAGTGCCTGGCTGCCGCCGATCAGCTGGAGAAGAACTGCATCACCGTCCGTGTGATCGATC 
CGTTCACCGTGAAGCCATTGGACGCCGAGCTGATCATCGAGCACGGCAAGCAGTGCGGAGGACGCGTAGTCGTCGTGGAGGATCACTACCAGCAA 
GGAGGCCTGGGTGAAGCTGTGCTGAGTGCCCTGGCTGGCGAGCGCAACTTCGTTGTCAAGCACTTGTACGTGCCCACCGTGCCACGTTCGGGTCC 
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TCCGTCGGTACTCATCGATATGTTCGGCATCTCCGCACGTCACGTCGTCAACGCCGTTAACGAGATCCTCAAGGATTAGGGGACCAGTCGTGATC 
TGGCTACATTCATTTACCGCCTACAGGAATTCCCCCAGTTACTACTTACTTCCAATATTACCATTCCCTCTAAGATTGTTAATTGCTTCATTGAT 
TTGAACCTTACATCATGGAAATAAAATATCTTTAATAAAAATAGTTAAGCCGAACTATATACCTT 
(SEQ ID NO: 902) 

Start ATG: 194 

MSYHKPEAKTVQDIiKDLAQKLRIHSINATQASKSGHPTSCASIAEIMSVLFFQQLRLNLKHPRDPSSDRFILSKGHAAPILYAAWAEAGLFPIAD 

LNNLRKIDSDLEGHPTPRLNFIDVGTGSLGQGVAVGAGMAYVGKNFDKADYRTYWVGDGESAEGSIWESLHFAGHYKLDNLCVIFDVMRLGQSE 

ATSLQHKLDVYRDRLEAFGFNAVVVDGHDVEELSKAFHCAAITKNKPTAIIAKTFKGRDFPNIEDLDNWHGKPLGDKAAEWKHLEGLIVNKNVK 

LTPKPVPKTGAAPDVDINNIKLSSPPAYKLGDSIATRLAYGTALAKIGQNNLRVVALDGDTKNSTFSDKLKNLDPQRYIECFIAEQNLVGVAVGA 

ACRRRTVAFVSTFATFFTRAFDQIRMGAISQTNVNFVGSHCGCSIGEDGPSQMGLEDIAMFRTIPGSTIFYPSDAVSTERAVELAANTKGVCFIR 

TSRPNTCVIYDNEEPFTIGRGKVVRQKSSDEVLLIGAGITLYECLAAADQLEKNCITVRVIDPFTVKPLDAELIIEHGKQCGGRVVVVEDHYQQG 

GLGEAVLSALAGERNFVVKHLYVPTVPRSGPPSVLIDMFGISARHVVNAVNEILKD* 

(SEQ ID NO: 903) 

Classification: enzyme 



Celera Sequence No. : 142000013383969 

GTGCTGGAATTATATCCTGTTCCATATTTTCGTGATATTCAAAACATTCATCTCATATCTCTGCCGAAAGTTTGACTTGAAAAGAAGGAGCATTA 
TGTATGTACATACATAAGTAACGAGTGGGATGAGCTTAGGAAATTTTTATTAATTTGGCCTAGAGCTAAATGTTTTTTTTTTTTTCGAAACCAAG 
GCTAGAAAACATATGTAGTAGCAGTAGTTCAAGATCGTAATCGTATTAAAAACGCCATTCTAATTGGCCTAGAAACCGTATTCATAGACCAACGC 
CAAACGTTTCAGGTTTTAGGGAATTCCACCATCGTTTGCTCGCTTTAAATTAAAATGGCTAGTTCGGCGGCTGATACTCGTGAATATTGTGTTTT 
CCAATTAATCAAATCACACTTTGCCCCACGGTGGCGTTGTCGGTCGATGGCAAAATCAAATTCTAATCGTCGACGCTATCATAATGTATTTATGA 
GTTGTGTTATTTGTTTTTGGTTTTCGCGTACCGTGCGTATACGTAATGCGCTCAGTACGGAGTTATCTGTGTGTGCGCCGATGTCCGTGGGTGTG 
TGTGGTGCATATGTATGTATATACATACATATGCCCATATTTGCCGCTGATAATCGGGTCTGCGACGACAAAACTAATTACAACAACAGTAAAGC 
AAATACAGTGGTAACACGGAATAATGCCCTTATCAAAAAACGCGGCAAGGCGGCGCAACAACAAATGCAAACACAGGCGAAAAAGAAAGAAAAAC 
AAAAAAAAAAAAATTGGGAAGATGAGGCGGAGCGAGCAGGAGCAACGGTAAAGGTGAAGCTACAGCCGAAGCTAAAGACCAAAATAACACGCACA 
CAACGACAAACGGGAGAAGTGGAAACAAACGGCGAGCGGGCCATTGCATTGATGTGTGCCAGTGTGTGCGTTGTGTTTGGCTGTTTAATACTATA 
ATTCACAAAATTGTTCAGTTCAGTTCGCGCTGGTGAACGTAACGGAAAAGTCGTCGCCTGTTGAATATTTACATTAGTTCTATATATGTTTTTAT 
ACAAATATATATATTCTATTTACTTGAATTGTTTTTGTCGCCGCCGCCGCTCGTTTCGTTTTGAATTCTTCATCTTCTTCTTTTAATTTTTTTTT 
TTTGGTGCTTTTGTTTTTGTTATTGTTTTGTATCTCAACTTTACTGCAGCATAACAACCACAACAACAACAGACGACGAAGGAGAGGGGAAATCA 
TTTACAAAAATAAAACGTACGGCCCAAATACATAAATATATGTACATATATTTATATGTATATATATATAGTGTGTGTGTGAGCGTTAGCATTTG 
TATTTAAGTTTAGTAGCAATTAGCACTAGAGCAGCAAACTAAATGATATATTTTTTTTTACCCGAAGCGCGCGCTTTTCGCGAGACACGTGCGCC 
ATACTGAATAATATGTGGGGCGTATGTAACGCAAGTATCTGAAAGATACTTTCTCGAAATGGATGTGAACTTCAAAAGACGTTTTTCGAACACCT 
TCGCCATGTGATTACAACTCACCAAAATCACTTCTAATGGTGAAATTACCGGTAAAAACAATTTGATATCAGATAGTCAAATATTCTTGGAATTC 
CCAGTGTACATATTTTACGTGTGTCTTATCAAAGTTTGCGAGTTTCAAGTTTATGATTAACAAAAAATCAGATACCTATGGTGCAATACATCATT 
TGTAATGAAGGTCTTTTGGAAATCTAGTTATTGCAAGCAATTTCACCACTTCATTATTATATATTCAGATATATTTTATAAAAATAGTCGGTTTA 
TTGTGAAATGTCGACGACGACATGGGATGAAATAATTTTGAGCGTTACGTATCATTTCGATAGCGGTGCATACGTGCTGTGCTATTCTCCCGCCA 
ATCAATCAATCAATCGAATATCAATCAAAGTGGCTACTCCTCTCAAACGGAGCGATTTCGCCTCATAGATACAAATGTATGTACGTATGTATGTA 
GAGCCACAAACCGTGGCAAACAGTGGCTACCAAAAGGCAGTCTACCGGCCGTAAATGTTTTTACGGCATCATCTCCATTCCCGACATCTTTGTTA 
GGCCAATGCAAATACTCCTTTATGGTGTGTGAATATAATCATCATACCCCTTGCCTGATGGGTCGGAACGGAAAGTGAATGGTGGGAGGGGGGGG 
GGAGGGGGGGTAGTCATATAATGCGGAAAGCTCTAATTATACAAATGTTTGTGTTTGGCTTTAGACGGCTCTTGATCGCCTTAAACGACGCGTGT 
TAATCAAGCACTTGAGGCACCTGGAGTACTTAGATGGCTTTTCCGAAACGCTAATCTTAAATCTTGAATTTTGATACTAATGAATGTACATACAT 
ACATATGTGCGAATAAACCAAGTAAAAAACAAACATTTAGGAATGTTAATTATCCTCAAGTTTCTTTGTGGAAAGTGATAAAATCAGGAAACGAC 
AGTTAATCAGCGGTCAAAGTTTACACATCGAATCGATTCACAATTAGCTGATTATAAAAATTGTATGATTTCATTACGCAAAAGCACTACAGTGG 
TTTAAAGTATTATTATTTGATTATTTATTGTTTGAAAAACAATGAGTAGCAGCTGTTCATTCGATTATATTTGATGTGTGTCAACTAAATATGTA 
TATCTGTTTCATCTCTGTAGCCTTTCCAGTCATTGTGCAACAAATGTAATTATTCACATGAAGATAGACTTAATTGAATTACGTGGGTACAACAA 
AAA AG AT AC AAT T GAAAT AAT CTG AT T G GT T AAT CT GAT A AAC GAAT T T C AC T GG AT T T C C AAAC AAAT C T C GC AAC AAG AAC T AG TT T CG AAT C 
ATGATAATTCAATTATCCCATTCAGTTATGCATGCTCATAACTCACACCACAAGATGGCAATTAAATATGCATCTTAAAGTGCGTTCTTTTCTCT 
GCGTACTTTTTCCTTCTTAGGTGCAAAACCTTATCACTTATCACGTGTGTCTTGTCTTGCCGTTATGACCAATTTCTGCCATGTGCCTGCTGATG 
GATGGATGGATGATTGGCATTGAAATATTCCACAGTTGTGGTCATAATCATATCCACTCTTGCTTCGATAACGCAACGAGCTGATTTACTTGGAC 
TGAAAAAACTGAAAGCTGAGGTCAATTAGGCCTATTATATGCGGTTATATCGAGTACCGAAACACACTTCGATTGATAAGCACTCGTCGCAGAGT 
CAAAAGTCTGATCCTACCCACTGTGTGTTTGTTTGGCATGCATAGAAATGTGTATGTTGCTGATTGCCTGGCAATTAAAAAGCACTGTAAAATTG 
CTATATGAACAGCGTTTTACTGGCCAACTAACTTGTTCCGTGTGGCGCGACCTTTAGTACAAGCCGCATACCGCTTTCTTAGTGTATCCGCCTAC 
GAGTATAAGGCGAACTTTGCCAGCGAGTCTTATCACTTGCCCCCATATATTTTTGCAGTGGTCATTCGCGGATCGGGTTCGTTTGGGTTCGGTTC 
GGCTGGGTTCGGTTCGGTTCAGCTGCTGGAGCAGCTGTCGCCAGACCACTGGTGGCCATTCGGAGTGCGATAGAGCAGTGAGTAAGTGGGACAGT 
GGGTGCTACGAGAAAAACTCGAACACAATTCCATAAACATTGTTTGTTGCAATGATGTGCGGAGTGGAATTAAATATATAGTTTAGTTTAAAAGG 
AGCAATGAAGCGACATTGCAGTTGTAGTACAGTGATTCCGGCAATTGATTGTGGTTGTGTAAACCCTTGTGTGGCGGATATAACCGAGGTGCTAC 
AATCTAATTAGACGATATCGGCGCGGGCAAAGGAGTGCGCCTTATCACATGGTACCCCCACCCCCACCGCCACCGCCATATGATCTCATTTAAAG 
CCCCACATGCCGACAGGCCCTGATACATGTCATACATTCATACATATACATATGTATCTCGATGATTCGGTGCCTTTGTCGCCCAAGTTGGCTCA 
AGTATTTACGATGGACACTTCTGACTTCTGGTCAGCAAACTCTCGGATACGGGTTCCGAATTCGAGAAAAAAGCAAATAAAGTTCGTAACTTTTG 
CTCTCCGAAATGAAATCGCTGGCGGTCGCGATAAGGCGGCGCTCTCCCGACCAACCAACCAACCAAACAAACTGACTAACTAACCAACCAAACGA 
TCATCATCATAATCATCGCAAACGACAATCGAACCCATTGACGTTTTGATGAGACGTTGTTGGTTGGTGTTACCGTTTTTATATAATGGGTGCTA 
AATCGATACTATTTACATACCTGTCGCTTTTAGGCAGTCACGTTTCCAGGCCTCCAAAAATTTCACCCGCCCGCCCTCGAAAAAACTCAAAAATA 
TCAAATCGTTTAGCACACGATGAGTGAGAGTTGCCAAAAATATTACGCAGCCCGATTGATCAGTCAGTCGAGGTTGTGGCTTTCGTCCCGCCGCC 
TCGCAACATCTTGTTCCGATCTAAACAAAAAAAAGAAAAAAAAAATTAATAAAAGTCCAATCAATTGAGGCTGTAAACGGCTGTCTGGCTGCCTG 
TGTAAAAAGTTCAATTATTGCTTAGTTTTATTTCCTTTATTCTCTAGTTAATATAAAATTACATTTGAAATATTTTCATGAGAGCGAACACTTTA 
AAT GAAT AAGC AAT GAAT G T G C AAAAT G AAACT G AAC TCAGC ATG C CGAAAT AAAT AAAG T GAT GT T T AA T AC AAAAC T AAT T TAGAT T CT AAC G 
ATTCCACTAACGAAGTTTGCAATCTTCCCGCTTTTCAATGCACAGAATTGCAATCACTGGTTCCAGCATTTGATCGCACCCATCAATCCGATTCG 
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AGCAGTAATGCAACAACCGGTCAAGAGCCCGCAGCGCTAAACCAATCCAAGCCCCACGCTCCAAGTTCCAAGCCCACCTAATCCTAGTCCAGGAC 
GCCTAGCCGTATACACAGTTCCAAGATGACCGATAATGGGAACAATCATAACAACAACAACAACAACAACAACGATGAGGCACTCAGACTGAGGC 
TACCGGAAGAGCAGGCAGAGCAGCTGCTGCCCAAGAATGGATCAGCCGGCCAGACCAAGATAGCGCCACCTACGCATACTCATTCCATAGCCGCC 
GCTTCGGTGCCACAGAAGGTACCAGCGAAAAAAAAGCGAGACAAGAGCGATCTGGGTGAAGATTTTGTGGCGCCCGACGGCGGATGGGGCTGGCT 
GGTGTCCGTCGCCAGTGGCGTTAACATTGTGAGTGACCATCGCATTTACTCGCCTGTCTGCACATAGATATTCACTCATGCCTCTTTTTCTTCAT 
TCCCATTCCCATATCCATTCTCCACGCCAGCTAGTGACGTTCGCATTGGCCCAGCAGTTTGGCATCCTTTTCCGCGATCGCATGGCTGGTCTAGG 
AATCTCCAGCTCCGAACTGACCACCATTATCAACACACAGATTGCTGTATCCGCATTTACAGGTGAGACGATGAAAACCATATATACGAGATTAT 
AGAGTGATAGATACAGAGAGATATTCAAAGCCATGAGCAGCCGTAACTCACATTTGTATGTAGTCCACGTAGTGACGAATCGCAACGAGGGTGTG 
ACTATAAGATTTATAATATCGCACTATATCACAATTTTTTTACGATTCTCTGATGAAAAAAATCGATAGTATTTTAAATTGAGTAAGTTTTGAAT 

AT T AAAT AT AAAT AGT T C AC AT TAT G A AGC C AAAAT ACT AC T T C T T C AC AAAT T T AAAC T AC G T G AC AAC AAT T AAAT AT T AAT T T C T T AAT C AA 
AGGTCTGCTGAACGGTCCGCTTTTCCGACGCTACACATATCGAGTGGTGGCCCTGGTGGGATCCGTGCTGACCTTTCTGGGTCTCTTCTGGATGG 
TGTTCGCCGACACCTTCGCCGTCTACATAATGAGCTTCTCGATCTTGTACGGATTCGGACGAGGACTGACGGTATCCGCATCCTCGCTGGCGGTC 
AATACATACTTCAAGGTGAAGCGACGCACTGCCACAGCTTTTCAGTTCGGAGTAGCCGGCCTGGGACCAATTGTGTGTCCCTACTTTGCCACCTA 
CATGCTGTACGAGTTTGGTGTCCAGGGGACGACACTACTCTTTGCCGGCGCCTCACTGCACACCATCGCCTGCTCGCTGATCTACCAGCCAGTCA 
AATGGCACGTGGTGAAGCGAGATCGCGATGCGGAGGCACTGCAGCAGGTCCAGCCACTGGCAGAGCGTGAGGATCAGGACGAGGATATTATCAAG 
AATGTTGTAGAGCCGGAAACCCCGGTGCTGCCACGTGCCAATGACGGTTGGTTTGGCTCGCGGGCCTCGTTGAACAGCGTGGGCACTCGGAATCG 
ACTGAACACCTGGGAGAAATCGGACAGCAATGGCCACGTGGAGCTGAAGAGGCTAAGCAGTAAGGACTCAAATCCGGGACGCCAGCTGCGCAGCA 
TATCCGTGAGTCACAGCATTAAGGAGGAGGAGGCCACTGGCTACAATTCTGACCACGAAGTGCAACACACCGAGCTAACGGAGAAGGAGAAGCAA 
GAGCTGGAGGATGAGAAGGAGCGCCAGCGCCGCAAGAAGCTGCCATTCTACATGAAAGTGGTGATATTCTTTGATATGGATCTGCTGAGGGATAT 
CACGTATGTCAACCTGGCTGTGGGCATCACACTGATCAACTTTGTGGAGATCAACTTTGCCATCCTGACACCCTTCATCCTAAGCGATTTGGGTT 
TCAATAAGGATCAGATTGCCCTGGCCATGTCCACGCTGGGATTCTTCGATCTGGTTGTTCGATTCCTTATACCCCTGATCACGGCCAAAATCAAT 
CTAAGCAACCGTACCTTCTTCGTTGTGGGCATTTTGGGCATGTGCATTGGCCGCATGTTCCTCTCGATGACCTCCAATTTCTACGTCATGATGGC 
CATCTTCCTGTGGCTGGGCCTTAACAAGGCCTTCCGCACGGTCTTCTGGTCCCTGATCATACCCAGTTATGTGCCCCTGAAGAGATTGCCGGCGG 
CAGCTGGGCTGCAGCTGCTGATGTCCGGCACCTTTTCGATGATCTTCGGACCGCTGATTGGTGAGCAATAGATATACATAGACCAATATAATTGC 
AATTTCTAATCTTCTAATTGTGTAACCCAACGATCATCCAGGCTTAATCCGGGATCACACCAGCTATGCGGTGACCCTCAACCTGCTGAATGCCC 
TTTGCGTGATGGCCTTTGCCGGCTGGTATCTGGAAGATTTCATCCGCGCGCGCAGCCGGAAATCGCCACCGGTCAAGTCAATCAACTGATCAACA 
TACGATGGCGTAGTTTTACTCAGTGCCAGCGAATTCCAGGACCAACACCGTTCCCCCATCCATCCTAACTTTAGCAAAATGTTATTCTTGTTATC 
GCATTTGCCCAGTGCTTTTTTTTTTATCTGTATGTTTATGTAAACTATATTCAGCGTGTACATTGTGTATATACTTTTCGAGTGTAGCAAATTGT 
ATTAGATATTGATAATTGTTAAATAATATGCGTAAGCTGCCAATACTTAAACCTTAAAACAAGCGTTTCATTGAATGTTATCTTCGAATGTTATA 
CCTAAAGTTATCTAAGATCTATGCGACAATTTATAGCGTTTGAAATATTCTAAACCATCGGACAACCACAGACTGTCAATTTTTCTTGTGGTGCG 
ATTCGTTACTACGTGGCCCTGGTAAACGATCAGTCTCAACATTATATGATCCAATCTCTTGTGGCGCAGCAATAAAGTCGAATTAAAATCTGTTT 
GTCCATCCGTCTGACAGCATCCAAAGTTCAACATCTAACAAAAATCTTATGACACAACGGAAGAAAGTCCAACTGAATTGGAAGGAGCACGTGAA 

TAT C AC GAG T CT AC T CAT CAT AAC AT C C AAAT C AAT G ATC AAT GAC AC T GAG AAT T GAT G CGG AAAAAAAAAT T G AAC AAAC AC T GGT GGC AAG T 

TGACTTCTGAGAATTTATATTTATACCCGTTACTTGTAGAGTAAAAGGGTATACTAGATTAGTTGTAAAGTATGTAACAGGCAGAAGGAAGCGTT 

TCCGACCATATAAAGTATATATATTCTTGATCTGGATCAAATTCTAGAAGGTTGAGATTCAACATACTGATTCTAGAAACAAAGACGCAGCGCAA 

GTTTGTTGACAAACCGTACAAAACTGCCGCGCCCACACTTTTGAAAAATGAGTTGATATCATTTCACATTTTTATAAGTCTCGTAAATTTCTATC 

GATTTGCCAAATAAAACTTTTTGCCACGCCCACTCTAACGCCCTAGGCCGTTGAACCGTTCACGTCCACACTTTGAAACAATTTGTATATTTTTT 

CTCATTCTATCTCATTATACTGAGTAGGCTGAGCACTTGACTAATGAACATCGGAAGCTCGGAAATAAATAAATAAATAAATACCTAGATATAGA 

TATCCAGATATAACGGCGTGAAAGGTTAAATCAAACCTTTAAAACATATTAGATTTGATTGAAACTGTTATCAATGCAAAACACCCACACTGAAT 

GGATTTGTAAAATTATTTGACCAAACAATTAAATTTGCAAAATTTGACGCGTGCTAAAGAATAAACCAGTTTTGGTCGATGACAACAAAACATAC 

AACAATGAATGGGTTAAGAACCGGGAATCAAAATGATATGTAACTCCGCGCCAGCGGAAAGTTACAGGCTCAAAAAAAAGTGTTTAAAAACAATC 

TCATAACGAGGCGATCAGGTTGAATGCACCGCGGTGCTTATCAAATAGATCGAGAGGAAACATACTGAAAACATATGCAAAAATCGGGCAGGAAA 

CTCGGGCCAAGTTCACTGTCTCTACCAAGTCAATGGCAAATAAATTTGTCGCAGTCGGTTTTTGTAACCACGAGTATTGCTGTCGCCATCAGCGG 

AACTAAATGGAAAAAAAATCTGAGTGAATCAAACCAAATCGACCAGCTTGGGCATACCGAACACTCTTTTCATTTCAAAAAAGAAAAAAACCAAT 

TAAATCCAAATTAACTTATTATATATTGTATGTATCTAAACATATAGTGTAAGTACTAACAATTTTTTAAGGTGTTTCGTGTGGGTCAGTGATCC 

ATTGCGCGCATAGTGGGTGGTAATCCATCTTTCGCACAACTGAACCTTCGCCCTACTAAATCCTCTTTTCATCAGAGATAAACGAAGGCACACAT 

ATAAAGCCACCCACTTGTGGGCATCCCACCCACAGAGAGTAGGTGGCGTAATCAACTGAGCAACAAGTGGAAACAGGAAACAGGACTCAGCAGCT 

GTGCTGCCCATCTAGCCTTTTTATAGCTACAATTTCAATACCATATATGTTGGTGTACGATTTGTTATTATTT AAAAT AAAAATTTTTTTTTTAA 

ATATAAGCTCAATAATTCATTTTGGTAATGTCAAGTTTGACCCTAAAATTGTGCGAAATTAGATTTTACACATATTTTTTTGCCAATTATAGAAA 

ACGGGCCTGTTTTCTAGAAACGTTTAAAAAAATTCGAGTTTATTTTGCAAGTCCTAACAAAATAACTCACTGCACGGATGTGTGGCGCCAGGACT 

GCCTAGTTTGTTCGAGATCGGGTCAATAGACTGTTAGCAGGTGTGTAACCAATAGGGGTGGTTTAGTGGCAACATCGTACAGCAAATCAACAATC 

TGTTGCCGGCAGTTTCAGCTTCAGGTACTCGCATGCAAATGCCACGTGGTTGTGTGCGGTGTCATTGCCCCACTTCTCAACTATCCCCACAGGAA 

CTCCAGCAATTATCGATTATTTGTCAATCTTTTGGGACACCGCCGTGACCTTAGATATACAGTTTAATATCCTTAATTAAAAATAAATACCAATG 

TTGGATAAAAATGTACACATGACATACATGATTTGCAAACAAAAAGAACTAACAAACAAAATCCAGATACAATATTTAAAATCTCCTGATCTCAT 

TTACGAGGGCGCACTGTTATCACTCTTTAACAAAGACGCAACAAGTTAACAGAAGAGAAAAGAGTGAGCGAGAGTAAAGAGCGAAGAGCTGAAGA 

GAGAGCTGGCTCATTTGCGCAACTGCAGCAGCGTCAAAAGGCCAAGTGAAGAACGTAAAGAGGAGAACGAGAGGAGAGGAAGAGCCTGCGAATGC 

AGC GC AAC AA C T G C AG AC AAAG AAC G AAT GT AAAC AAAG AAGGAAGG CGCATTGT T TGT AG AT T T T G GC AAGC G AAGT AAG CG ACT T C T C T C AC T 
CCACGTTTTATTTCTCTTTCGATCGGCATTAGAACGCGATCTTGAAATACTGCAACACCGACACACACACACACACACACACACACACACACACA 
CACACACACACACACATGGACATGGGGACTCTCTCTGAATGAAACGACACTGTGGCACAGTGGGCCCAAATGTAACCACTCTACTCCAATTTCAT 
TTGAACCAGTTCCGATTTTCATTGTTTAGTAAGAGATCGGCAGGCTAAATATATCTAGTATCAAAAAACCGATAGTAATTCATAAGTTAATAATA 
ATTAGTAATTCCCACCTTGCAGCCTTCTAATTTTTCGCGTTCTTTACAAAAGTGTGATCAAAAGTTCTAGAAATCTGGATCACCAATACCCATTG 
CTAGCTAGTTAACCAATTAATTTGGCCCACTGTGCGACGACGGTAACGGCGACTGAGACAGCGGCAGCGACGGGACGGCAAACGGCGACGGCAGT 
CAGTGTTCAGTATTCAGTATCGATTGAACAGCCGCGCGTAGCTCATCGAAATCGAGGAAGGAAGAGACGCGACGAGAGAGCGAGCAAGACGGACA 
GGCGACCAGACAGAGACAGCGACAGCAGGGAAAGGGAGACCAAGACGGAGACCGAGAAGGAGACATTGAGCGCCTGCCGCCCCCTCTTTTGTAAT 
AAATAGCGCTGTGCGAAGCGACCACACTACCAAACGGCAAAAGTCCGCTGAGAGCGACGCAACGCAGGCCACAAAGGCCCGCCCCCCTCCCCCCT 
GCGATAGACACAACAGAACAGGAACAACTATTGCAACAACACTATCACACACACACACACACACACACACACCCGCAGTCAACAACACCCCCCTG 
AACGAAAGGGAGAACAGTGAAACAGCTGGAACAAGAAACCAGAAAACAAAACAGGCTGACAAAACAAATGTCAAAATTAGCGACTGACCAAAGGG 
AATTTTAATCAAAAGTTAATACGAAAAAACCAACAAACCACCACGCGTTTCTCCACCAAGTGCCTTAAAAGAAAAAAAAAAAACAGAAAGAACAT 
AACAAACAAACACCGATTCAAATGAATTTAAATAACAAATTCCAATGTGATAAAGCTGAAATAATCGTAAAAACCTAACAAACAAGAGTTCACGT 
CTGTTCTTTTTTTTCGAATTTTTTTCGTAAGTGTGTTGTCTAAAAATCAAAGAAATGGAAAAATATAATTAGAACATCAGAAACAACAAACCCCA 
CGGAGACGGCGGGCATAAATTAAGAAAATCGAAAAACACAAAGCCACAACGTAAAAAAAGTATCCATAAATCTATATGTGTGCTACACCTCATAT 
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AT AG AG AC C T A AAAT AAAAC C AAAAC AAAAAC AAAG C AAAT C C AGG AAAAGC AG AC AAC C G AGG A G AAC TC AGC AC AGC AC AGC AC AGC AC AAC A 

AAGCAGAAATGGCAAGCAAACAACCCAGAAAAGTGGCGCCCGACGGTGGATGGGGATGGGTCGCTTGCTTTGGCGTTAGTTTGGTCAACGTAAGT 

TCAAATTTCCAGTTCTTAGTTGCCAGTTCACTGATTAATACTAAATAGATTCACCCAACGACACAACATTCCAATGCCAATTAGCTAATACGTAT 

TGGCCAAGCTTTTATCAAGCAATTCCCTTCGGCCATCTCACTTACTTTCCCCATTTGCACTTAGGATAGGTTAGTTCCAAAAGCCCATGGCATTT 

CATAGTTGTATATTTTTATATGGCTCGTGGTAAACGTGTTTTCACATCGACGATAAGATTCAAATGAAATATGACGAAACTCAATATGCTTCTGG 

TCCTACCCGCATTCTCACATGTTTGTACACGGATTTATGATGGCTACCTGTAATATTCGTATGTTTGGTGTAGGGGCGGCATTTAGACCATTTTC 

ATTAATAATAAATCGTCAAATGGACGTGTTTTGGTCAGCAAAAAGGGGAAGAGGTTATTGGAAGGGCGATGTTGGTGTTTATCAAATTTCAAACG 

CCCTTAAAATCACTGCATATC 

(SEQ ID NO: 904) 

Exon: 1001.. 1251 
Exon: 4796. .5253 
Exon: 5351.. 5477 
Exon: 5798.. 7090 
Exon: 10702.. 11276 
Start ATG: 4966 

Transcript No. : CT24124 

TCGTCGCCTGTTGAATATTTACATTAGTTCTATATATGTTTTTATACAAATATATATATTCTATTTACTTGAATTGTTTTTGTCGCCGCCGCCGC 

TCGTTTCGTTTTGAATTCTTCATCTTCTTCTTTTAATTTTTTTTTTTTGGTGCTTTTGTTTTTGTTATTGTTTTGTATCTCAACTTTACTGCAGC 

ATAACAACCACAACAACAACAGACGACGAAGGAGAGGGGAAATCATTTACAAAAATAAAACAATTGCAATCACTGGTTCCAGCATTTGATCGCAC 

CCATCAATCCGATTCGAGCAGTAATGCAACAACCGGTCAAGAGCCCGCAGCGCTAAACCAATCCAAGCCCCACGCTCCAAGTTCCAAGCCCACCT 

AATCCTAGTCCAGGACGCCTAGCCGTATACACAGTTCCAAGATGACCGATAATGGGAACAATCATAACAACAACAACAACAACAACAACGATGAG 

GCACTCAGACTGAGGCTACCGGAAGAGCAGGCAGAGCAGCTGCTGCCCAAGAATGGATCAGCCGGCCAGACCAAGATAGCGCCACCTACGCATAC 

TCATTCCATAGCCGCCGCTTCGGTGCCACAGAAGGTACCAGCGAAAAAAAAGCGAGACAAGAGCGATCTGGGTGAAGATTTTGTGGCGCCCGACG 

GCGGATGGGGCTGGCTGGTGTCCGTCGCCAGTGGCGTTAACATTCTAGTGACGTTCGCATTGGCCCAGCAGTTTGGCATCCTTTTCCGCGATCGC 

ATGGCTGGTCTAGGAATCTCCAGCTCCGAACTGACCACCATTATCAACACACAGATTGCTGTATCCGCATTTACAGGTCTGCTGAACGGTCCGCT 

TTTCCGACGCTACACATATCGAGTGGTGGCCCTGGTGGGATCCGTGCTGACCTTTCTGGGTCTCTTCTGGATGGTGTTCGCCGACACCTTCGCCG 

TCTACATAATGAGCTTCTCGATCTTGTACGGATTCGGACGAGGACTGACGGTATCCGCATCCTCGCTGGCGGTCAATACATACTTCAAGGTGAAG 

CGACGCACTGCCACAGCTTTTCAGTTCGGAGTAGCCGGCCTGGGACCAATTGTGTGTCCCTACTTTGCCACCTACATGCTGTACGAGTTTGGTGT 

CCAGGGGACGACACTACTCTTTGCCGGCGCCTCACTGCACACCATCGCCTGCTCGCTGATCTACCAGCCAGTCAAATGGCACGTGGTGAAGCGAG 

ATCGCGATGCGGAGGCACTGCAGCAGGTCCAGCCACTGGCAGAGCGTGAGGATCAGGACGAGGATATTATCAAGAATGTTGTAGAGCCGGAAACC 

CCGGTGCTGCCACGTGCCAATGACGGTTGGTTTGGCTCGCGGGCCTCGTTGAACAGCGTGGGCACTCGGAATCGACTGAACACCTGGGAGAAATC 

GGACAGCAATGGCCACGTGGAGCTGAAGAGGCTAAGCAGTAAGGACTCAAATCCGGGACGCCAGCTGCGCAGCATATCCGTGAGTCACAGCATTA 

AGGAGGAGGAGGCCACTGGCTACAATTCTGACCACGAAGTGCAACACACCGAGCTAACGGAGAAGGAGAAGCAAGAGCTGGAGGATGAGAAGGAG 

CGCCAGCGCCGCAAGAAGCTGCCATTCTACATGAAAGTGGTGATATTCTTTGATATGGATCTGCTGAGGGATATCACGTATGTCAACCTGGCTGT 

GGGCATCACACTGATCAACTTTGTGGAGATCAACTTTGCCATCCTGACACCCTTCATCCTAAGCGATTTGGGTTTCAATAAGGATCAGATTGCCC 

TGGCCATGTCCACGCTGGGATTCTTCGATCTGGTTGTTCGATTCCTTATACCCCTGATCACGGCCAAAATCAATCTAAGCAACCGTACCTTCTTC 

GTTGTGGGCATTTTGGGCATGTGCATTGGCCGCATGTTCCTCTCGATGACCTCCAATTTCTACGTCATGATGGCCATCTTCCTGTGGCTGGGCCT 

TAACAAGGCCTTCCGCACGGTCTTCTGGTCCCTGATCATACCCAGTTATGTGCCCCTGAAGAGATTGCCGGCGGCAGCTGGGCTGCAGCTGCTGA 

TGTCCGGCACCTTTTCGATGATCTTCGGACCGCTGATTGCGGCAGCGACGGGACGGCAAACGGCGACGGCAGTCAGTGTTCAGTATTCAGTATCG 

ATTGAACAGCCGCGCGTAGCTCATCGAAATCGAGGAAGGAAGAGACGCGACGAGAGAGCGAGCAAGACGGACAGGCGACCAGACAGAGACAGCGA 

CAGCAGGGAAAGGGAGACCAAGACGGAGACCGAGAAGGAGACATTGAGCGCCTGCCGCCCCCTCTTTTGTAATAAATAGCGCTGTGCGAAGCGAC 

CACACTACCAAACGGCAAAAGTCCGCTGAGAGCGACGCAACGCAGGCCACAAAGGCCCGCCCCCCTCCCCCCTGCGATAGACACAACAGAACAGG 

AACAACTATTGCAACAACACTATCACACACACACACACACACACACACACCCGCAGTCAACAACACCCCCCTGAACGAAAGGGAGAACAGTGAAA 

CAGCTGGAACAAGAAACCAGAAAACAAAACAGGCTGACAAAACAAATGTCAAAATTAGCGACTGACCAAAGGGAATTTTAATCAAAAGT 

GAAAAAACCAACAAACCACCACGCGTTTCTCCACCAAGTGCCTT 

{SEQ ID NO: 905) 

Start ATG: 422 

MTDNGNNHNNNNNNNNDEALRLRLPEEQAEQLLPKKGSAGQTKIAPPTHTHSIAAASVPQKVPAKKKRDKSDLGEDFVAPDGGWGWLVSVASGVN 
ILVTFALAQQFGILFRDRMAGLGISSSELTTIINTQIAVSAFTGLLNGPLFRRYTYRVVALVGSVLTFLGLFWMVFADTFAVYIMSFSILYGFGR 
GLTVSASSLAVNTYFKVKRRTATAFQFGVAGLGPIVCPYFATYMLYEFGVQGTTLLFAGASLHTIACSLIYQPVKWHVVKRDRDAEALQQVQPLA 
EREDQDEDIIKNVVEPETPVLPRANDGWFGSRASLNSVGTRNRLNTWEKSDSNGHVELKRLSSKDSNPGRQLRSISVSHSIKEEEATGYNSDHEV 
QHTELTEKEKQELEDEKERQRRKKLPFYMKVVIFFDMDLLRDITYVNLAVGITLIKFVEINFAILTPFILSDLGFNKDQIALAMSTLGFFDLVVR 
FLI PLI TAKIKLSKRTFFVVGI LGMCIGRMFLSMTSNFYVMMAI FLWLGLNKAFRTVFWSLI I PS YVPLKRLPAAAGLQLLMSGTFSMI FGPLI A 
AATGRQTATAVSVQYSVSIEQPRVAHRNRGRKRRDERASKTDRRPDRDSDSRERETKTETEKETLSACRPLFCNK* 
(SEQ ID NO: 906) 

Name: monocarboxylate transporter-like 
Classification: transporter 



Celera Sequence No. : 142000013384832 

GACTGATAACACACAAAAAGAACACGCGTCTATTGTACAAAAAAGAGAAATCGATACGAAGAAACAATTTATTTACCTATTAGTGAGAGATTATT 
AATGGAAACTATCAGCGGAAACAATTCCCAGCCCCAAAAACTACTAAAGGAAACTGTAGCGAACATTTTACTGTATTGCTGTAAATAATCATGAA 
ACATTGGGCAAAGGACAGAGTTAGAAACCATTGCGGCCGAAACAATACACTGAATACAACCGAATTGAAGAACTTGCGAGTTCAAAAACTAGCTA 
AAGAAAACACAACACGAATATTATATTTCTATTAACTAGGCGAATGCAAAGCATCTTGGAGCATTGAACATTGAATTTGAATTCCAAATCGAAAC 
TGACTTAGCCACAATAAAATGCGCATAGCATATGAATGTACGTTGAGCGTAATGTTGAGTTTGAAATATATATTTTATGGTGTTTGGCCAAAAAC 
ACGAAAATAATT AGAACTC AAGGCGAGGACT CGAAATGC AAAAT T GCAACT T GAAAGCGGAGGTGC AAAG AAC AC ACT TT GT AAATGAAGAGC AT 
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AATTTAATTTATATTTATTCATGTATGATAACCGTAAAACACACAAATTATACAATATGCATACAGAATAAACTAAACTAACCAAAATCCCGCGT 
CTTCTTCTTTTTAATCAACACGTATCTGGACCTAACTTCCCGGGGATTTCCCTTCTGGACGTGTCCTTTGGAAAGATAAAAGAGGAAAGCTTAAA 
GATCGCTGTAAGGTGCTTGCACCGATTGAGAAGGCTCAATGCAACCGTTTTTATAGATGATGCATCCTATGGCCTGCGAAAATATTTAAGTTGTG 
CATTTTATTGTTTGATTATAGTAAATGACTTTTTAAAAAGTAACGTTAAATACCATTTCCAATGATTGGTATTTCTACAAACTGAAATGGTATAT 
AACGCCATTTTCTGCCAGGTCCGAAGGAATTCTTATCGTTTGTAGCACGGTCACACTGCTCAGTTATTTCAATTGTTTGTCCCGTTTGCTCGAGC 
ACTTGTTCCCAACGTAAGTCTAAAAAGACTTATTTTCCGAGATAATAAATCGCCGTGGCTGCGCATTATGTAAAGTGGAGTGGTTCCGTGCCGAT 
TTCGCTGCGTTGGGGCCGTTCCAAACATATGGAATCTAAACGCAGCGTATTTCACTCCTGCTCGTGTGTGTCTGTGTGTGTTTATGGTTGTAGXG 
GGGCTTCCGTGTCGCAAGTGGAAAACAAATGAAATTGAGTTCTCGCTTTGAGTCATATTCGAGTGACAAATAAAGCGCGTCATGCGTTGTCCATT 
GAATTACAGTTTAATTTGATTACAGTTTAATTTGGTACCGCCAAGACATGTACGTCCAACCACCGATCGATCTGTGAGGGCCTCAAATATAGACA 
TGGTGTACCCGCACAGGAATGAAGATGCGGGAGCATCTCTGGCGTGTTTGTGTGCTAATTAATTAATTGCTCATACTTGACCTGGACACTAAGCA 
GACGCTGTTGAAATCGACAATACATATAACACTTTGTGCTCGCGTAGATGAAATGGGCGAAAATAGTGCGTGCTTGGTTCAAGTTACGTGAGCCC 
CTCGCTTCGTGTGTGTTGGTAGGGAGCATTCAATGCGCGCGAGTGTGTGTGAGCAGCGGTGTGGGGATGGGCACCTCCTCTTTGCTTGTACCTAG 
GTGGAATAGTTTGGTACCCGTGACTTTCCCAGGAAAAGAGCATTTTATAAAGTTTCCCTATTCTTTTGTTTTGTAAATTACTGTGAACATTGGGT 
GATTTATTCAAAGTTGATTCAAAATAAATTACTAAGTACTAAGGAAACTTACAGAATAAAATATTTCTGAAACATCTCCTCGCGTTCGGAAATGT 
AGGAAGCATGACTCAATTTGTATTCGTGAAATCATGAACCCACAATGCAGTACCCGAGGTTCAAAGAACTAGTACCCTCCACAGCAGTGTTTAAG 
CAGGGTTTCTCTCAGTAGCGCAGTTCATTTGGCTTAATTTCTCTTTATTTACACTCGGTTCGCACAGATCAGGTTTTGTTCGCTGCTCGACGGGG 
CAGCTGCATTCTTTCTCGCTCCCACCCTGCCCCTGGATTCCCCTGCACCTGTTCGTTCTCGAGGTGCGAGCGAGAGGCCAAGAGGCAGAGGTAGA 
GCGGCACTTGCCACGCTTGACTTGCAGCAGGTCTGCAGGCGACGCGGAAATGCCACTAGAAGGTCCAGCTACAGCCGCCGATAGACACCAGTTCC 
AGTCCCGATAAGGAAGCGATAGACTGAAGTAGACAGCTAGCTGGAAAGACAGATAGCGACGGGGAGAGTGGATTTATTTATTATTCAAATTCTGA 
ATGGAATGACGATGTGATGGGCTCGTGCTCGTGTACGCGGCGACACTTTTTGCTGTCAATATGCTTCCTGCAAGTGGTGAGTAGACCGCGATCCG 
AATGATTACAATTCGAAATCTACAGCCGTTCCCCATAGATAACGATTATCGAGCGCCAGGTATTCGACTTCCTCGGCTACATGTGGGCGCCTATC 
CTAGTGAACTTTTTCCACATTTTGTTCATCATATTCGGGTTTTACGGCGCCTACCACTTTCGCGTTAAATACATCATCACCGTAAGTGCCCTGAT 
TACCATATCTCTGCGA'TAACCCCATTTTATTTTGTGTGCCCGCCTGCAGTATCTAATATGGAACTTCCTGTGGATCGGGTGGAATACCTTCCTCA 
TTTGCTTCTACTTAAATGTGGGGCAGTTGAACAGGGTGAGTTTCGTGGCTCCACAATAGACATATCTCTTCTTAACTAGAAGGAAATCTCTTCAC 
AGGACAGTGACCTGCTCAACCTGGGCACTGGCAGCGTCTCCTGGTTCGAGGCGAATGGGTACGGTTGCAAGCCCACCTATAACATGGCCGCGGAC 
GATACGTTCCGTCCCCAGCGGCCGGAGCGCGTGGAAGGCTGCCTGCTGGACTACCCGCTGGTGGAGATCACCCACTCGGGGGTACAGTGCGCTCT 
GGCGGTGAGTGATCTACTAGCTTAAGTTCCCTGAATTGTGTAAATTGTCTCGCCCACTCCCACAGCTGCTCGGCATACTCGGTGCGATTCTGATT 
AGTTGCATATTTCTTGACGAGGACGACAGATGTAAGTGGCGCTCTCTGAATGCGAATAAGAAACATGCTCGCCTGTCATAAACACCACTAACCGG 
GAACGGTCATTGCTAACCCCTCTCATCCCCATAAGATCTTTCACTTTCGGGTCTGTGCACTGAGGTGTATGTTTTCTTTTCTCACAAAACCTAAA 
ATAATACCCTATGTGTAAGCAATGTTTGTAAAGTTATCATCGAGAATTTGGCCAATGGCTTTATATTATTCTCGATTGTATAAGGCTCTCGATAA 
GAAGTCTCTTGCCTCCCTACCAAAAAACCCGCAACCTGGAATGCGACTGTACCCCCAGAAAATTATGGTTTCGATTTAAACATAAATTGTGTCCT 
ATATTAACCATTATGACTTGTCCATTTGAATTGTGTGCATCTATACACATGCCCTTTAAAACATTTTACTCACATACCACATAATCGTATTGTTA 
TTCATGTTATGTATATGTCATTTACTCATAATCAAGAAATCAGAAACATTGTATTTGCATATGGGCAGCGACACTGTGCAGCTCTAAACCATGTA 
AGACGACATTGAGATAGGTGCTCCGAAArGAATATAATTAATTGATATGAAAGTATGTAAATGTAAAAATGTAGACAAAATGAGCAAACAATATA 
AAAACATATTTATTTACACAAATAAGAAGCTAAATCAATTACTCTTGACTAGTTCTATAATTAGTAAAAAGCTTGTGGGTGCTGCACGGTGTCCT 
GCTGCCTCCGATGTTTTCCTCTGTTTATTGTTAAAGCATCTTAATCAATTTCTGGTTTTCAGTCGATTTCATGAACGGCGACGCGAAGAGTCCAC 
AGCATACCGTGGTGCACCCAATGTACGTGAGCTATACAAGTATACCCACAACATCCGCAAGCGCCACCATGCAATCAAATAAGCATCTGCAACTG 
CAGCACCAGCAGCCGCAGCAGAACTCACTGAAACTCTATCATCATCAGCAACAACAGCAACCCAAACTACACCACTTCAATAAGAACTACCAGTT 
GAGCGGCAGCAACAATAATACACTGAATAATAATCTCCACCAGCGTGCGCCTGCGCTGCTGCCGCCAAACACAACAAATAACCGCAGCGCATCAT 
TCCAAACTCAAAGTCACCCTTCAAACAACCATGTAACTCAACGCACAGGCGGAGAGGGGAGCAACTGCAGTAGTCTACGCCGCCATCGACAACAC 
CATTCCAAGGCACTCGTCAGCCCCAGTCCCATGTCACCTCAGACCACGCCGTCGCTTTCGTACGCTTCCCTGCAAAACTCGAGTCCCTACCTAGC 
CGGAAACTCCCTTAGCAACAGCAACTACAGCATATTCCAGAGCCCCGACTCGCTCCAGGGCAGTTCTCATTTCGCCCGTATTCATCACAAGCCCA 
AGCCGCCCAAATCTGATTATCCTGTTTCTGGAGAATTCAATCCGGGCGGGAATATATCCTCGCCTGTCCGTCCGCTCGATCGCCTGTCGCGCAGT 
CTCGAGGACGACGAAGACAACTTTTCGCTGCAGAAGTTCGCTCCCGGCGAACACGGTGTCACCTACGTCCCTTTTCAAAGTCCTACTCCAAACAG 
CCTTTTTCTGGGCGAGAACAACAACTCCCAGCCCCACCTCGTCTTTCACACCAACTCTCGGTCTAGTCCCAACAACAATGCTTATCCCTACGACC 
AGAGCGGTTTGCCCTCTTCCCTCCGTATGGGGTCAAACTCCAATGCCCGACGGCCGACTCACATTCCTCTGCCCACGGTTCCCATGCACAACTGC 
CAGGAGGTAGAAAATGATGAGGACGCAGATGGCGAATCCGAGCAAGATCACGATCAGATGCTGACGCCTCCGCCTCCGCCGTTAGTGCGGCCTCA 
CATCCATCAACGCTTGGGTCAGGCGCCCTACCTCGATCTCTCTCCGGAAGTGGCAGAGCGCTATGCGATTCCCAGCAAGCTTGGACCCAGCCTTC 
CCATTCAAGTGCCACTCCCCGTCCCACATGGTTCGCCCATGGTACGTCGCAGTAATCGTCGACCTAGGCCATCCAATCCCGTGAACTTTTGCGAC 
CAGATTCGCGCTACACCTCCTGGTTATGTGGTACGCGCCCAGAGCGACGACCGTCTAATGGAGCAGGTCGAGGCTGATGCAGCACCGCACGTGAA 
TCGTCGGAGCGGTCGCGGCGGTAGCGGTCAAAAGACCCGTCCTCGGTCGTTCTGCAACTCGATAGTGGGCGTGCAGGGTTAGACTTAAGTACGTC 
CCGCTGCACTTGCTCCTCTCCTCTCTCTTACTCACGCTTTCTGCCTAGGCAGGATGGCTGTTACTAACTCTACCAGGATCACTCGATCCCTAAAC 
ACATACAAAAAATTTCATATACATAGGTGTTGCTGTTTCTTTGTTGTGCCTAACCAGCCGGCTAACCTGGCTAAGAACAGGTCCTAACCATAGCT 
TCCCATGCAAGCGTTTATCGATTTTAATGATACCTTGCAGAGCTGACTTTAATTGTGTAAATCTAATTAATAATTCAGATACGTGCGATCAGTAG 
AGACTTTCCTCGAATTGTGATAGAAGGCGTGGAAAGTCGTGACAGAATGCAAGCCTAACAATTTTATGTTGATCTGTGATAACTGAAAACAAATA 
TGAACGACAAACGATAATTTGAAAAACGAAAATGTCACAAAATCATTATCAAGAAAAGGATATGACAATTTGTTGAGAATACGAGCGATAAATGA 
AATTTTGAGCCAATTGCAACCAGATGCCTTGGATGTATGACGGAATAGAGATTAGAATATACAACATACATATATATACCGATGTTATGGGCGGG 
AGAGCCCAGCCTCCCCAAGAGTTGCCTTTGTTGTTTAACTAATAATACATATGAATGTATATTTGCCCTTAATAACGAAATGTACTACGATGCAA 
ACTTTAGTTGTCGAACACAAACAAGAAGTTGCCTATTCCGTTAAGTGAGAGATTCGTTATACAAGCGCAAGCAAACGTAAAAACGTTTAGAATTA 
CGTATAGAAATGTTAAGCCTAGTATTAAGCTTGAGGCCACAAAACGCAAGAGCAGAGCAGATCTAAATCCATTTAGCGTAGACACTATTTACTGT 
TTTAGATAGATCCCTCATATTTAGCCATGCGAATTGTGCGTAGACTGGTGCCAATTCTCGAAGAGATTGAACGCGTTTCCAATAAAAATGTTCAT 
TTACACCATAGTACGGGTAATCTTTCTCTGTCTCGGAAGCTTTTTTCGTAGCCGATCCTTCTATCTTCCTAAGTTGCTCTCTTACTGGTAGTTGG 
CTTTTTGTAGCAATTGTGCCTCTAACCAACCGCTAACTATTGATTCTCTCCGAACGTAGCTAATCCAACTTGATTAACAGCTCCAATCACAAAAG 
CACACTCAAAAGCAAGCCAAGGTAACTTTGAGCATTGTATCCGCGGCTTAACGGTCTAAACAATCTTCCACTGATCCTATATTGTTCTCTCCGGC 
GCAGTGAAGCATATTAGGAAGCAGTCTAAACACCGCCAGTCGCTGTACTCCATCGAATTCGGCAGCAGTACCGACACCATCCGGCATGGCGGCCA 
CTCCTTGGGACTGGGCGATCCTCGGGGCGACCAAGATGGCGGCGAACTGAGTCCTAAGCCGATGACACCACGACGCGTCAAGCGGCGATCGGTCA 
TGGCGCGCGGCACACAAGGAAGGCAGTCAAATGGAGGAGGCTCTAGCAGGCGCTCATACCACGGATCAAGCGGCACCCTGAGATCCAAGCACGGA 
CATAGTGGACGTGGAGGCGGAGCAGACAGCAGTAGCGGAAGCTATGTGAGGAGCAGCACTCGAAGCTCTCGCCGGAAGTATCATCAGAATCCAGT 
GACTAAAATCATCGATCAGCAGCTTCAGCAGGAGCAACCTCACCTGGGTTCCTTCCACCGACAGGATAAACTCTCGCTGACTGCCTCCAACCTGC 
AATCACTCAACAATGACTTTGTGAACAACACCAATGCTTCCGGCAGTCAATTGGGAAGTCTCGGCAAACCCAATGAGCACTACTACCACAGCTAC 
CGCAAAAACATTCCGCTGGATCCCATTTACTACAATACCAACGGTCTAGGAGCGCTGGAGGAACCACCGAGCCTTCCACCGCCACCACCTCTGCC 
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ATCTGCAGCTGGAGCCTCAATCCTGCACGCGGGCGGTGGAGGCCACTACAACCCCAGTTACCAACACTCCACGACGCACCTAAACGACGTTGGTC 

ACGCTCCCGATGAAGTATACAACAATCGACCGCCCTCGGTGCGCTCCAGTTACTCGAATTTTCATGGCTCACGGCCACTCTCCACCGCTTACGGG 

AATGCCACTGGTGAGAATGTATTCGCTGGACTGAACGGAGCCAGCGCCAGCCCACCGCAGGCTCCTTGTACGCCACCCCTCTACCAACAGCATCC 

CATGCAACTACAGCAGCACCAACAGCAGCAGTATCCTCCACCGCCGCCAATGGATCCGGCGCCTGCTTACGTCAGCACCATGTCGTTCTCCAAAA 

GAACTGCCTCTCGGGAAAGCATTCGCTCAATGGCCTTTCTCAACAACGGCCCGCCCGCCTACAATCTGAATTACCACACGCCGCCCGACTCGGAG 

ACGACCATGTAATTTGGTTCGCCCTTTGTCTTTACCGTCGCACATGGATCTTGAACTATAACACAATCATCATCAAGCCATCAGCCCATGCCCAA 

CACATATTATATCTTGTGCAACAAGCCAAGGCTACCACTTGGTGCTTTAACCACATGGATAGCATATACTTGATCAGTAAAATAACGATATCTGT 

AGTGCTTAATTTATACGTAAGTTTCTGTCGTTTTACCAAAGTGCCTTTGTAAATAAATACTTAAAATAATGGATCGGTTATTTAAATATCAGAAA 

ATATTGTTAATACTTGATTGTTTATGTTCATTTATATTTCTTCTTAGGTAATTGGCTTATAGTTTTATACCTGTAGATAAGTACGTGCTGTACTT 

GGTATCTTAGTCCCTTCGACACACTGTAAGATGTACGAAAACTTAGGAAACTGCAAAAGTTGGATATTTGCTCCAATACTTACATTCATCTGCAT 

CCCAGCCCGGCTCGATGTCCAGTTGCTTGTTGCGCTCCTTCAGCCACGTGAAAAGTGGGGCAAAGTACTCAAGGATACCGCGTCCGCTCAACTTT 

CGTTCACCGGTGGCTATCTCCATGACATCCCGCCAATGTCGAGTGGCTCCCAGCTGCATCATGTCGCGAATCTTTTTTCCTGCTTCTTTGCTGTC 

GTAGAAATCGCAATTGTGAAGTGGGAAATTGGGATCACCGGGCTTGTACTGGCCACTTGCCAGGCAAAAGGAACGGTAGAACTGGAATCCCAGGA 

TCTCGGCCAGGAATTTTCTGTGGGTCCCGGGGAATATGAAATTCATGATTATCAAAAACAAATTATGGAAACTGACTTGGTATTGGAGGTTTCCG 

GATTTAGTCCTCGATAGAACTTGAAATCCACATCAAAGTCCTTGTTAAGACGTTGTAATGGCGGCTGAACTCCGGCGAATTTGTCCTGGAGGCCC 

CAGTAAGCGCAGTTATAGTCCTTTACTCCGATTCGCCCGTCCATTACGTCCACGAGGAATTT 

(SEQ ID NO: 907) 

Exon: 1001.. 1058 
Exon: 2063.. 2451 
Exon: 2509, .2646 
Exon: 2710. .2790 
Exon: 2853.. 3044 
Exon: 3106.. 3166 
Exon: 3958. .5397 
Exon: 6655.. 7802 
Start ATG: 2392 

Transcript No. : CT24170 

TCACACTGCTCAGTTATTTCAATTGTTTGTCCCGTTTGCTCGAGCACTTGTTCCCAACATCAGGTTTTGTTCGCTGCTCGACGGGGCAGCTGCAT 
TCTTTCTCGCTCCCACCCTGCCCCTGGATTCCCCTGCACCTGTTCGTTCTCGAGGTGCGAGCGAGAGGCCAAGAGGCAGAGGTAGAGCGGCACTT 
GCCACGCTTGACTTGCAGCAGGTCTGCAGGCGACGCGGAAATGCCACTAGAAGGTCCAGCTACAGCCGCCGATAGACACCAGTTCCAGTCCCGAT 
AAGGAAGCGATAGACTGAAGTAGACAGCTAGCTGG7AAAGACAGATAGCGACGGGGAGAGTGGATTTATTTATTATTCAAATTCTGAATGGAATGA 
CGATGTGATGGGCTCGTGCTCGTGTACGCGGCGACACTTTTTGCTGTCAATATGCTTCCTGCAAGTGATAACGATTATCGAGCGCCAGGTATTCG 
ACTTCCTCGGCTACATGTGGGCGCCTATCCTAGTGAACTTTTTCCACATTTTGTTCATCATATTCGGGTTTTACGGCGCCTACCACTTTCGCGTT 
AAATACATCATCACCTATCTAATATGG7AACTTCCTGTGGATCGGGTGGAATACCTTCCTCATTTGCTTCTACTTAAATGTGGGGCAGTTGAACAG 
GGACAGTGACCTGCTCAACCTGGGCACTGGCAGCGTCTCCTGGTTCGAGGCGAATGGGTACGGTTGCAAGCCCACCTATAACATGGCCGCGGACG 
ATACGTTCCGTCCCCAGCGGCCGGAGCGCGTGGAAGGCTGCCTGCTGGACTACCCGCTGGTGGAGATCACCCACTCGGGGGTACAGTGCGCTCTG 
GCGCTGCTCGGCATACTCGGTGCGATTCTGATTAGTTGCATATTTCTTGACGAGGACGACAGATTCGATTTCATG7AACGGCGACGCGAAGAGTCC 
ACAGCATACCGTGGTGCACCCAATGTACGTGAGCTATACAAGTATACCCACAACATCCGCAAGCGCCACCATGC7AATCAAATAAGCATCTGC7AAC 
TGCAGCACCAGCAGCCGCAGCAGAACTCACTGAAACTCTATCATCATCAGCAACAACAGCAACCCAAACTACACCACTTC7AATAAGAACTACCAG 
TTGAGCGGCAGCAACAATAATACACTGAATAATAATCTCCACCAGCGTGCGCCTGCGCTGCTGCCGCCAAACACAACAAATAACCGCAGCGCATC 
ATTCC7AAACTCAAAGTCACCCTTCAAACAACCATGTAACTCAACGCACAGGCGGAGAGGGGAGCAACTGCAGTAGTCTACGCCGCCATCGACAAC 
ACCATTCCAAGGCACTCGTCAGCCCCAGTCCCATGTCACCTCAGACCACGCCGTCGCTTTCGTACGCTTCCCTGCAAAACTCGAGTCCCTACCTA 
GCCGGAAACTCCCTTAGCAACAGCAACTACAGCATATTCCAGAGCCCCGACTCGCTCCAGGGCAGTTCTCATTTCGCCCGTATTCATCACAAGCC 
CAAGCCGCCCAAATCTGATTATCCTGTTTCTGGAGAATTCAATCCGGGCGGGAATATATCCTCGCCTGTCCGTCCGCTCGATCGCCTGTCGCGCA 
GTCTCGAGGACGACGAAGACAACTTTTCGCTGCAGAAGTTCGCTCCCGGCG7AACACGGTGTCACCTACGTCCCTTTTCAAAGTCCTACTCCAAAC 
AGCCTTTTTCTGGGCGAGAACAACAACTCCCAGCCCCACCTCGTCTTTCACACCAACTCTCGGTCTAGTCCCAACAACAATGCTTATCCCTACGA 
CCAGAGCGGTTTGCCCTCTTCCCTCCGTATGGGGTCA7AACTCCAATGCCCGACGGCCGACTCACATTCCTCTGCCCACGGTTCCCATGCAC7AACT 
GCCAGGAGGTAGAAAATGATGAGGACGCAGATGGCGAATCCGAGCAAGATCACGATCAGATGCTGACGCCTCCGCCTCCGCCGTTAGTGCGGCCT 
CACATCCATCAACGCTTGGGTCAGGCGCCCTACCTCGATCTCTCTCCGGAAGTGGCAGAGCGCTATGCGATTCCCAGCAAGCTTGGACCCAGCCT 
TCCCATTCAAGTGCCACTCCCCGTCCCACATGGTTCGCCCATGGTACGTCGCAGTAATCGTCGACCTAGGCCATCC7AATCCCGTGAACTTTTGCG 
ACCAGATTCGCGCTACACCTCCTGGTTATGTGGTACGCGCCCAGAGCGACGACCGTCTAATGGAGCAGGTCGAGGCTGATGCAGCACCGCACGTG 
AATCGTCGGAGCGGTCGCGGCGGTAGCGGTCAAAAGACCCGTCCTCGGTCGTTCTGCAACTCGATAGTGGGCGTGCAGGTGAAGCATATTAGGAA 
GCAGTCTAAACACCGCCAGTCGCTGTACTCCATCGAATTCGGCAGCAGTACCGACACCATCCGGCATGGCGGCCACTCCTTGGGACTGGGCGATC 
CTCGGGGCGACCAAGATGGCGGCGAACTGAGTCCT7AAGCCGATGACACCACGACGCGTCAAGCGGCGATCGGTCATGGCGCGCGGCACACAAGGA 
AGGCAGTCAAATGGAGGAGGCTCTAGCAGGCGCTCATACCACGGATCAAGCGGCACCCTGAGATCCAAGCACGGACATAGTGGACGTGGAGGCGG 
AGCAGACAGCAGTAGCGGAAGCTATGTGAGGAGCAGCACTCGAAGCTCTCGCCGGAAGTATCATCAGAATCCAGTGACTAAAATCATCGATCAGC 
AGCTTCAGCAGGAGCAACCTCACCTGGGTTCCTTCCACCGACAGGATAAACTCTCGCTGACTGCCTCCAACCTGCAATCACTCAAC7AATGACTTT 
GTGAACAACACC7AATGCTTCCGGCAGTC7AATTGGGAAGTCTCGGCAAACCCAATGAGCACTACTACCACAGCTACCGCAAAAACATTCCGCTGGA 
TCCCATTTACTACAATACCAACGGTCTAGGAGCGCTGGAGGAACCACCGAGCCTTCCACCGCCACCACCTCTGCCATCTGCAGCTGGAGCCTCAA 
TCCTGCACGCGGGCGGTGGAGGCCACTACAACCCCAGTTACC7AACACTCCACGACGCACCTAAACGACGTTGGTCACGCTCCCGATGAAGTATAC 
7AACAATCGACCGCCCTCGGTGCGCTCCAGTTACTCGAATTTTCATGGCTCACGGCCACTCTCCACCGCTTACGGGAATGCCACTGGTGAGAATGT 
ATTCGCTGGACTGAACGGAGCCAGCGCCAGCCCACCGCAGGCTCCTTGTACGCCACCCCTCTACCAACAGCATCCCATGCAACTACAGCAGCACC 
AACAGCAGCAGTATCCTCCACCGCCGCCAATGGATCCGGCGCCTGCTTACGTCAGCACCATGTCGTTCTCCAAAAGAACTGCCTCTCGGGAAAGC 
ATTCGCTCAATGGCCTTTCTCAACAACGGCCCGCCCGCCTACAATCTGAATTACCACACGCCGCCCGACTCGGAGACGACCATGTAA 
(SEQ ID NO: 908) 

Start ATG: 388 
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MGSCSCTRRHFLLSICFLQVITIIERQVFDFLGYMWAPILVNFFHILFIIFGFYGAYHFRVKYIITYLIWNFLWIGWNTFLICFYLNVGQLNRDS 
DLLNLGTGSVSWFEANGYGCKPTYNMAADDTFRPQRPERVEGCLLDYPLVEITHSGVQCALALLGILGAILISCIFLDEDDRFDFMNGDAKSPQH 
TVVHPMYVSYTSIPTTSASATMQSNKHLQLQHQQPQQHSLKLYHHQQQQQPKLHHFNKNYQLSGSNNNTLNNNLHQRAPALLPPNTTNNRSASFQ 
TQSHPSNNHVTQRTGGEGSNCSSLRRHRQHHSKALVSPSPMSPQTTPSLSYASLQNSSPYLAGNSLSNSNYSIFQSPDSLQGSSHFARIHHKPKP 
PKSDYPVSGEFNPGGWISSPVRPLDRLSRSLEDDEDNFSLQKFAPGEHGVTYVPFQSPTPNSLFLGENNNSQPHLVFHTNSRSSPNNNAYPYDQS 
GLPSSLRMGSNSNARRPTHIPLPTVPMHHCQEVENDEDADGESEQDHDQMLTPPPPPLVRPHIHQRLGQAPYLDLSPEVAERYAIPSKLGPSLPI 
QVPLPVPHGSPMVRRSNRRPRPSNPVNFCDQIRATPPGYVVRAQSDDRLMEQVEADAAPHVNRRSGRGGSGQKTRPRSFCHSIVGVQVKHIRKQS 
KHRQSLYSIEFGSSTDTIRHGGHSLGLGDPRGDQDGGELSPKPMTPRRVKRRSVMARGTQGRQSNGGGSSRRSYHGSSGTLRSKHGHSGRGGGAD 
SSSGSYVRSSTRSSRRKYHQNPVTKIIDQQLQQEQPHLGSFHRQDKLSLTASNLQSLNNDFVNNTNASGSQLGSLGKPNEHYYHSYRKNIPLDPI 
YYNTNGLGALEEPPSLPPPPPLPSAAGASILHAGGGGHYNPSYQHSTTHLNDVGHAPDEVYNNRPPSVRSSYSNFHGSRPLSTAYGNATGENVFA 
GLNGASASPPQAPCTPPLYQQHPMQLQQHQQQQYPPPPPMDPAPAYVSTMSFSKRTASRESIRSMAFLNNGPPAYNLNYHTPPDSETTM* 
{SEQ ID NO: 909) 



Celera Sequence No. : 142000013383969 

CATAGAGCGGCTCTATAAAGATCTATTTTCCCCTTACTCTCTGTCTCTCTCTCTCTCTCTGTCTGTTTGTAGCTCTCTCTTTTTTCCCATAAGAT 
GATTTCTGAACGCAGATAATTATTCTTATTCTTATTATATTTTTGTATTAATTTGAGCCAATAATTTTGCCTTTTTCCAAAATTAGTTAAGCATA 
ATAATTTCTATTATATTTCTTATTAATTTAGAAACGTGTTCAGCTCAAAGCTAAACAAAAACATGTCAAAAGAAACGGAATTTTTGTTCAGTGCA 
ATAAGATCTCTGAGCGGATCTATTCAACAGTCCAAGTTCTACGTCGTCATCATGATGGTGTAAACACATTTGCCAAATCACATCACAGCCCACTG 
TTGAGCCGGTCTTGTGAGAGTGCGATAGAGAGGGGTCGCCCAGGCCAAGACAGAGAGACAGAGAAAGAGACAGGTGGAGAGAGAGGGAAGGCTAT 
TGCCCGGCAACTGCTAATTGGGATGGCTAAATGCTGCATTGCATCCTCTGCCGCGATTCCTGCTGCACTTACGGTGCATCGTCGTCTATCCTGGC 
GCTGCCCACGCACGAAGAGAGAAGTTCACACTCTTCCCGCCTTCCGCCGGCGGTACAGAGCAAAAAAAAATTTTATTAATTTCGGTTAATTACAA 
ATTTGTACCAGGGACTTAAGCTAACGAATTTCAGAACCGAAACATTCATAAGCTTATGCTTAACAACGTAATTATATAAAATAAATAAAATTGGG 
AATTTGCTAGCATGCCAAGCATGTGATGTGTAATGTGATCCTTGGTCAAGAGATTTACATTCTCTCTTGCTTGCCCAATGCAAATTTCTCTCAGT 
GTACGGGCCAGCTCTCTCCGACCATCTCGCTCTCGTTCCGCCGTTTTGCTTTGCCAGTGACTTTGCTTTGCTTTGGCGTAGAGAATGCTATAAAG 
C AGAC C GC AGC AGT GGAAT AG C AG C AAGT C G ACT GCT GGC C GC GT AAC AGT AAAC AC AC CAACG AGC GG AC GT AC AC GG AGC G AAAG AAG AAGT T 
GGCTACCACGAAAGAGTTATTGTTTGTTAGAAACACCAAGCGCAGCAGCAGCAGCAGCAGTACAAATATTATTGTGTGCCGTCCGAAGAGAACAG 
CTGAGCCACACGCTGTTTTTTAACGCTGTTCCGTTCTGCGGCACTTGCTGTTTGTTATCCGTATCCGTCGTGTTTCGTCGTCGTATTTGAACTCT 
CTTTTTGTTACTGTGCATATATTTTTGAAATCTCTGTGCTGCACACAGCGCAGGGGGAAAAGTGCTAGAATTAGCAGATCATCAAGACATCTGGC 
TAACCTGTGATTTCCAACACACCCGAAGGCAAATTGGCAACAAGTTACCACCGTAAGTCTCGTGTCGATCAACAAGTGAAGATGTTAGTTAAGTG 
TGGCAAAAGAACTGAACAAGAATTTATTGCATTTGCAAATATACATATGTACTATACCGATAGGAACGCGTTTTAAGAAAGTAATCATGCTTTTA 
ATTTACGTGATGTGTGACTGTTATTTGTGATTCAATATAAGAAGATTAAGAAGATTAATAAGATTTTGTGAAATTTTCAAACTATGCCCAAGACA 
AATGTAGCCCACTGTTTTGGGGGTAGGGGATTTTCAGAAGCTTTGCTTTTGAGCATAAACCGTAATTAGAAGGCATTAAAAACGTGTGATTTGGA 
AATTCGTGAATTAAACCTTATAGGATTTCTCTCTCTCTCTCTCTCTGTGAGAGAGTGTGAGAACCCAAATTGAGTTTAAGAATTAGCTGAAGTTT 
CCATAATAACCACTGGCCAAGTGATTAGTTCACATGGTGGGTGGAATCCGTCTAGTCCAAACCATAATCCCCTGCCCAGTCTGATCTGATCCCGG 
AACCAAATTGAGTTTTGCCAAAGGTTTCAGCATTTTTCATGTGCATTTTTTTTTCATCTAACCATCGACGGGGAGCAGAAAATGTTTACCAAATT 
GAATTACACATTTTTATGATGTGCTGGTGCCTTGCTCTTCCCCCTTCCTCAAAGCCACCGCCACCCCGCCGCCTGACAGGCCTCAGGTGCTGTCC 
TGAArCTGGAAATCCGTGTGTCCGGGTGCCGAAGCCGGAATGGAAGGAGTCCACGTTTTCGAGAGCGGGAATGCATTTATGGCGCCGCCATTAGG 
GT GG AC C AGT AAAT T AGGT T AAT T GT AG AAAGT AAT C G AAAGCT T T AAC AGC ACC AAAT TGCATATTCT G AAT C AAG GAG AC TGTATTTTTCCCA 
CACTGCAAGTCACATTTCAATATGTAAGTGCTGCATTGAATATGTACATGTTCAACTTATAAATGTGTATTCTATATATATATATATATAGGCAC 
ATAATCTATAATCTGTTTTGTAATCACTACTGCTTTTTGCAACTTTATGGTAAAAAATACTTTGTAAATCCTTTGGTGTGCACCAAAATGAGCAT 
GAATGCATAAGCAAACATAAGCGCTTAATTATGAAAACATACCGATTTAACTAGAGTCCTGGATTATAATTGACCGATTATTTGATTAAAAGGCA 
TCCGAGGCATTCAAGAGCAACCAAATATTGTTGGCCGTTGATAACCATTGCTGGTTGCCAAGGTTTCCATAGTATTTGCAATCGGCTCAGAAGTG 
AAATGCTCTGTCTGAAATTCCGACTTAACTCCCCACCAGATAACCTGTTGCATTTCGCAATGGGGTAAACAATGGAATCACCTGTGCGCCCCCAC 
ATTTTGGCCTTCATATGGAAGCACATTCGTGCGCATTCATCTGGAAAATTGATGGTAAATTTCGCATTTAGTTTGATTCTCAATTGAATCAACAG 
TTCTAAGCCTACTGCTTACGTGTTTGTGCGATGATGATGATGATGTATAATGGTCTATCCACCTTTCGCAAATTTGCAATGGACAAGGAGGCGGT 
GCAAAGGTCTGTACATAAGTCCAGAATCTCAAGGGGTGATAAGCCTAAAAAAAAGGCAGACTGTTATATGTCAGATACTCGTAGATAGTTAGAGG 
GGTTTTTCAATGAAAAACTTAGAAGTGTTATTTTGCCTTCTGACACTTCACAATGAATATTCAGAAAAAAAAAAATGGGGAATTAACAAAGCAAT 
ATGATTATGATATGATATTATATCAATAGTTTTATTTATCAACATTCAGGTGTGCACTAATTTCTTGCCGCCAAAAACAAAGCCAATCAATAATT 
TATGTAGATAAGCATTTTGCTTATAACCATGCATTTTCATTATTGAATTGTCATTTTTTATCTTAGTTGAACCCAACTCCGCCTCCTCCGCGTAG 
ATCGTGTGTCAGCCACTTGTCTTGTTTGACAGCACACCCAAGTGGCATTAGCATTTGGGGACTGAAACTGGAACCATTGGCCAGTTTGCCCAAGA 
GAATTTCTGGGATTGGTTTTGGGTTTCTCCAGTGCACCGCAGTGCACTCCTCTCGGGCCACCACTTCAGTTCACGGAAGGCACCACTTGGTGGGA 
GTCGGCT TGAAGT ACGTGAGT TTATTT GGCT GAGCAC AAAAAT C AGAGACTCGAAC AACAC AC AAGATCGAAAAC AAAAAAC AAAATT AAAC AAA 
AAATATAATAAAATAAAAGAAGTGAAGTGAAGAGCAGGGTCGAGAGGAGAGGAGGGGAGAAGTGCCTAGAAATGGTCTGAAATGTGCATTAAACG 
CTTTGCATAGGAAAGCGACATGGACATGGTATGGGGATTGCGCCGGAGATTGACATTCCCATTCCCATTCCCGTTCGTATTCCTATTCCATTGGG 
CTACCCACTCCTATTTCCATTTCCCATTCCCTAACCCAAATCCCTACTATACCATTTTCACTTCGAGAATCGCAGAGAACACACAAAACACACAT 
GTGAGAATGTGCCGGGTGTTGGGATTATTTTGTGGCCGGGTGAGGGGCCTTCAGGCCTTCAGGCCTTCGGGCAATAACAATCATTCGAAATGCGC 
CTTAAAATAGCAAACACCAAACACGTTGCTCATAATCTCCGATGCAATTGAGTGCTGGCAGCCAGCACATGAATGATAACTTTGTCCAGGCCCGC 
TGGAGTATTAATAGCCATTTCTAGTCCCTATTGAAAGCGAGCATGACGCCCGCTGCCATGCCAACCAGCATCCCCATTGAGCTCCTAACCATGGC 
CGACCAACCAACATCGTCCGAACCTGCAGAACAAAGCCGAAAAAACGTCTCTAAGCCACAGTCCAACTTCAGTTTTGGTTGTCGCCACCAGGTGT 
CAATGCTTTAAATTATGTTTTACATTTCCACCAAAAAACACATCATCAGCCAACCAACAACACCATCATTTGCACTTGTTGTTGGCTAATTAGCG 
GCGAGAAGTTTTTTCTAACACAGCAAAGCCCATCCATTCACCCAACCGTTTGTTCGAGTTAGCGAAGTTCAACTGTATGGCTTGGTGCCCGAAAT 
TTTTGCCCACTCCTGGGATTTGGTCTTATGCATTTTGCAATTTGTTTTCCACCACGAAAGTAGCAGGCGCCAAAAACTCTGGGTTCGGAATAAAG 
GCCATAGATTCCGTGAAACCAAGGGGGTCTTCTTTATTAGAGTTTACGATTCGATGCGTTTATATTACCGAAATAAACCCTAATTGGACACATTG 
ATGCTGCAAAACGCAACACACATTTGTGCTTTTGACTTTCCTCCTCCCATCGACGTTGATCTCTAAAAACCGGTTTTCCAGCCTTATATGCATAC 
AAATCACTTCCTCTAAACAACGATTCAACGATTTCAGCGTCATTATACCGTCAATTAAAGAGTACGCCCCTAAAAACGAGAGTTTGACATATTTG 
TACATATGTAAACCCGTTTTGAACGAGGAACTATTTTAGTTAATAATAGGTATGTATATAACTACAGATTTCTAGCCAACAAAATGACTTCCGAA 
ACAAATAATTAAGTAAAAAGTATTCTATAGAACTCTATTGTTTTGATAGCTAGATTAGTGCGCAGGTATAAAAAACTGAATATTTCCTCTAACAC 
ATTTTCTAGACACTCAACTTGAAAGCATTGTGGCATTATTTGTTTTCGCCTTCCTCGAATACTAAATCATTTAATAATGTTCCTCAATCAGATCA 
AAGCGGAATCCGTTCTTATTGTAATCTTCTGAAAAAGAGGTAAGCTCTTCGGGCAAATATGTATCTCATTGATAGATGGATGATATCTCCCATAT 
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AGTATAGCCATCCATACCAGTGGGTGATTATAATAATTATATTTACAACGAAGCGTTCAGTTCACAAGTCCGATGTTGCCCTTTTCCCTTCTCAA 

ACTTTAAGTACACCTCCAACCAGATTGTACATATGTACATACATATATGTACATACATATATGTTTGTTCTCTTATTTGATCAGCTGAATGTTGG 

GCGTTCTCTTACTTTTCTCATTAGCAGCGAATCAATTACGCGCGTATCGCTGGGTGTTTAATCAATTCTCATTGACACAATTTAATTGAAGCGTT 

TGCTTGTGAAATAAAACGATCTGAGCGCCAATTCGTTGACGCCTCTCGTTGCGCATACCCATGTATTCACTATATGTTGTGAATGAATATACTCA 

TATACTCATATGTTGTATATATACGATTCGATTCCGTTTATTACACACAAAGCAAACCGATTAATCTCAATTAGGGGCCAATTGGCTGAGCGAAA 

ATGCTGGAGAAAACTTCCGTTGCGTGTGTAAACACTGGGAGAAAATCTTGTACGTTAGAACTGAAGAAGTGATCGTAATAATATAATATAATATA 

AT AAT AAT AA T AATAATAAT AGC AT AAT ATGAT AACGT AT T T A TAT TAT TAT T GT A T ATAAT T AAAT T AT T T T AT GT GT G AAT AAT T GT T AAC AC 

AATTGATATGTCTTTCTCTGAGTGTATCAATGGGAAGCTTGTTTGATAGTTGTTCGCCCATTAGCTGCACCTGTTCTACACGCTTGCTCCTTGTA 

TTCTTTTTTTGTCATCTGTGTGCCTGAGTGGGCATGTCGTGAACTCGAGAAAGTACAAGTGTTTCCAGCACTTTGTTCTCTGGCTCCACTTCGTC 

CCGCCCAGAGATAACCGGCGATCGGCCTAATCAATAGTGCCGCGCCATCGGGGCCCTCTATCTATATATTTATCTTTCTATCTTTCCGAGCGTAT 

GACTCATCACAGGCCCGACCGTTTGCCATTCCGGGTGCGATAAGGCCTAGAATTTGGTAGACCCGTTAATTGGTGCACAGAGAAAAAAAAATGAT 

GTCATTGGTAAGCCGCATTCCTTGATACACAGTATCAAGCTGCTCTTCACTCCGAAGCCTTGCTCATTTTGAGATTGGCTTTTCAAGAGCAATTC 

TTTGAAGATTCATCTTTGATTCCACATTACATCATGGCGGAATCGAAATCTTCGAAAAACACCATTTACCGATGCGGACTTGAAAAGAAGATAAT 

TTGGGATTTGGAACTGCGTTTTTTTTTTTTGGCTTTAGTCTTCCAATTTTTGCGTTATCTTATCTCATAAGTGACATGATCGTATAACCACCAAT 

ATTCCAGTTTGCATGCAAGTTTCTAGACAGACAGATTCGCGATATATATGTACATATATAGCACGTGCTTTTTGGCTGTATTCGATTCTATTATC 

TGGCCATTGCGAAAAGAATTTGCCAATCCTTGGGCGCATTCGGTTCCTTTGTCCGATCGCTTGGGGAACACTCCGGCTTTCCATTCGTGGTGGCC 

CGTCTTATCGTTTATAAAAGATAACACCATTCGACGGCAATAAGCCCAGCGAATCCCTTGTTTATTGACTTCTTATTCACCGGGCCCCGGTCATT 

AAAATGCAAAATTATAGGCGGCACACAAGCGTCTTATCGATGAACGGGTTATCCAAAAATAGGTTTCAACCATCCAGACAAATTGCGATTTGTAT 

TTTATTGCATGTTGGCAAATCGAAATGTGCGCGAATTAAAAAGAAAAAAATTAAATAATAAAAAAAAACATAATTGTGAAATAACCAA 

CAACTAACCCAACCAAACAACAGCCAGCACTTACCAACATTATCTCAGCAGCTATTTGTATTTGTTTTTTTTTTTTCACAAAAAATTGTTGCCCA 

TGCGGCATGACGGCGCAAAATACATTGTCATTTATAAGAAAACATCAACGACAAATTGCGGAAACCACTCGGCAAATGCCCCACCGTAACCCGAA 

GCCCCAAAGACCGGAATATGGTGGATCGATCGATGGAGATTGGGCGATCTCTCGAACGCTCCAACTTGCTCGATATTCCGACGACTCAGAGAGAA 

AGTATCGCCACTTAATGCTCTCACGATAGCGTAGAGCCATATTGAATATGTGGAATGTCATGCGATGAGTTTGCAAGTACAGTGTAGCAAACTAG 

TCGAGTGTGTTTGTTTTTAATCACTTCTAGTCGATAATGTTAATAATGTAGTGGCATAATTTTAATGTTATCGCATTGGAAGTGTTGTAGTTCAT 

GGAAAAGACTTCAGTTTTATCGAAAAGGTGTCTAAAAGTAGGCTACAACATCTTACAATAGGCGTACTACTTAACCAAGTAAAAATGTTAGAAGT 

TAGAAGAATACTTATGCCACCACATGGTAACCAAATTATTTGTTTAAATACATAATTCCTTTAAACTTTCTTTAGAAACTGCATTGCTAATACAT 

TAACTTCCGTAGAAATGTTAACACAATCAATTCCTCTTAACTTTTGTATATTCATTAACTCATCCTGTGCAGATGGCCAGACTCTCATCCAGCTT 

TTCTCAGTTGATCGATGGCTTGGAACTTAGAGTCTGCTAAGTGCCATCGTCGGGGGTTTTGTACTTCTGGTTCTACCCCTTCTGCCCTCGATCCC 

AGACTGACTGCGTCAGCGGGCGGGCGTCAATTTATGGCACGCTTCGATGAAGTTCCATGGATGATGATGATGATGATGATGATGGGATGGAGCGT 

TGATGATGGGACGGGGCTTCCGGGGAAGCACTGTCTGAATGAAGACATATCCCCAGTGCACCAGCTGTACTCGTTCCCGCATTATCGATGCATAA 

ATAAACGAGATGCCGAGGTTGTGCGATGCCACACGATCCACGATACGCGATCCGCGATCCGTCGAGCTATGCTTTTATTTTTAAAACTTGTCCTT 

TGATTATCGCAAGAACCGAACGATGACAACACGTTGGTGTCTCGCATTGCCGCATTTTGATGGTCCGGTCTGGCGGCTTTCGCACTCTGCTCCAA 

TCTCCCAATCAAAGCACACACACCCGCTAAGTGATTTCCCTTCAAGATCACCAAATTGCATCAGTTTTTTTTTAGCGGTACAGGCAGAGCGTGCC 

TT GC ATAACC AAGCTCCGT GCGCTGT GT GTCGCACCCATGATAAAGGCAAATCAAAAAAAAAAAAAC AAAAAAAAGAAAAAAT AGAACTATCTAT 

ACGATGGAGCCACTCGATCGAGTGGTCTATCGACCCCACCCCGTTCATAAATGCCTATGACTGTCATATAAAAATGACTCCAGCAATTCCAGCTA 

TCTGCTGCGATCCCAATTAGCAATTAGAGCATTATCTCCCCCGTCACCACCCTTTACCACGTTACCACTGCAAAGAGTCGTTAAAGGTGTGTCGC 

CGATTTATATGATATTGCCGATGGAGTCTGATACCTTGTAATTTGGAAAATCGCCGGATGGGCATATGTTGTCCCAAGTGATGTGCATTTGTAGA 

TTCGTTAGATCGCTGGCATTGGTATGGGTTGAGCATTTAACGCCTGCTTTAGAACTTATAACGCATTGTGAGAGAGGTCTCAGGAAAAATAGGTG 

T T ACC T AA T G AAAAG T GT GGAT GC C T T T T AA T T T ACA C GA T TT ACT C AGC AAT T T GC AC T T T AAAGT G GT G GG AAAT CG AAGT C AAGC AC T AAG A 

TTCTTGAAGTACCACAAATTAGGTACATAAATAAAGCGCTTAATAGTTGTAAGTAGTTAAACCCTTTACAAGTTGGTATATGGGCTCACCTGCAA 

AGTTGCTAATTTCTGCTGTTTCGACGAGAGCACCAACTAATTTCTTCGAAAAATGTGACATTTGCTTATCATTTAGCCTTGTATGTACATACAAG 

TATTTATTCAATTAGCACACTATCGAGTAGATCATTAGTCGCATTATAGGATGGTTTCAACAGCGCAAAATTCCTACTCCTTAAATATTAAATCA 

GTGAATACTACTATAGTAGGCCCAGTGTGTTCrGCCTTCAGCGACAAATTAAGGCCTGAACTAAAAAAAGAAACGACCCAGGCAGTGCCATTAAT 

C AAC G TCTTTGCCCTTTT C AGG AT C AAAT C T C GC C AC AAT GG AAAAAT C C AGCT T AAC GGAG AAG AAT C AC AC C C AAGT GT AC AAT G AC AC AACG 

AAGCCAAAGAAGCCCAAGCGTCGCGACAAGAGCGATCTGGGTCCGGATTTTGTGGCGCCCGACGGGGGATGGGGCTGGGTGGTCTGTCTGGCAGC 

CGGTTTGAATAACGTAAGTGATATATAGCGTGGCCGAATTGCAGCAGTCTGGATGGTGCATAATAATCTCCCTATCCTCATCCTCACATTCCCGC 

ACAGTTCTTCCTTTTCCCGGCTCTGCAGCAGTATGGCCTGATCTACAGGGTGCGGATGCAATCGCTGGGCTTCGATGCCAAGCAAACCACGACCA 

TTGTGAACGTGGTGATGGCAATCTCCTCGCTAGTGGGTGAGTTTCACCTGGCCAACGTTCAATCGATTATGCATTTACTGATATATATATATATA 

TACATTAAATTCTCCGTGCATAGGCATTGTCAATGGCGCCATGTTCCGGCGGTTCACATTCCGTCAGGTGGCCCTGACCGGCACCAGTCTGGCCT 

TCCTGGGCGTCTTCTTGTCGGCCTTTTGCACCACCTTCTGGCAGTATATCATCTGTCTGTCGGCGATATTTGGTATCGGTTTGGGTCTTGCCATG 

GCCGCCACCTCACTGGCGGTCAATACGTACTTCAAGCTGAAGAGGCGTCGTGCCACGGGTTTCTCGTGGACGATCACTGGCCTGGGACCCATATT 

CTTTCCGCAGGTGTCTACCGTCCTGTTGGGCTACTATGGTGCCCAGGGCACCATTCTGATCTATGCTGGAATCGCCATGAATGCCATTCTCTGCG 

CTTTGACCCTGCAACCCGTACTTTGGCATGTGAAGAAGCCCGAAAAACCAGTTCACAATACAATCGAGGGAATTGCCGAGACGGAGAAGCTGCAA 

CCAGAACTTTTGGAGGCCAATGGCAATCTGTTGTCGCCCAGCAATGATCCGTGGAAGGATTACGAGTGCAAGTACTGTCAGTATCAGAAACGTTC 

TAAGCGAGGCCTTTTCTCGTCCCAATACCTCTTCAATGTCGATGATCCCGAGAGGCCGGGGTACGAGATCACCGAGCCAGGAACTCCCATGTTGG 

CTCGCGCCAATGACGGTTGGTTCGGGTCGAAGCTCTCCCTGACCTCGGAGAGCGCGGGAGGAGCTCGCTCTCGCACTCGTCAAGCTCTGATGCGT 

CAGGTTTCATCCAGAAGTCGTGAGAATCTCGATAGACTGGAACAGAATCGTCATGACCAGGGTCCAGACACTCCATCCGCTGCGGCCTTGTACAA 

GCCAAACTACTTCAATCGGGAGCGCGAGGATCTCGATCGCTATGCCAGCAAGACGAGCGTGTATTCTCGGCCTGGTCAGGATGAGCTGCTGCGCT 

GCACCTGTGCAGAGGACAAGGCACTGTTGCAAAAGACAGCCGAATCGCTTCAGGTGAACCTGTTCAATAATTCGGCCGACAACACGGAGGCCGAG 

GAGGAGGCCAAAAGGCGCATGACCTTCTTCCAGAAGGTGAGCAAGTTCTTTGATTTGGATCTACTGCGGGACTTTACATTTGTCAACCTGGCCGT 

TGGCATGAGCATCATGATGTTTGGCGAGATGAACTTCTCGGTGCTGACACCATTCATTCTGAACAGCTTTGGGTACACGGATACACAGATCTCGC 

TGGTGATGTCGCTGCTGGCCTGCATGGATATATCGGTGAGATTTCTCGCTCCGCTCGCCTTGGAGAAGGTGAAGCTGGATAACCGTGTGCTGTTT 

GCCTTCGGCATCCTATGCATCGCCGTGGGCCGCGTGGTTGTGGCCTTTACCGACTCCTACGAGATCATGATCGGCGTCTTCCTGCTGATCGGCTT 

TGGCAAGGCATTCCGCACCATCTTCTCACCGCTGATAATACCCAGTTACGTTCCGCTCAACCGCTTGCCAGCTGCATCTGGTCTGCAGCTCATCT 

TCAACACAATATTCTCGTTCGCCATGGGCCCAATTCTTGGTGAGTTTAATGTTAATTAGATTGTGTCTGAAACGTAAATGACCTTTACTATTTTC 

CTTAAAGGTATCTTGACGGAGGCCTACGGTTATGCAGCCACCATTCATACCATCAATGCACTCACCCTTTTGGCTTTACTCCTTTGGTTAGCTGA 

GTCCGTGGTGCGCCGCATTTTGGGTATACCCTCCAAAGGATTGGGCCAATAGAAGAGGCATTCATACTAATCCGATTTGAAATTGCGACGAGTGT 

GAGCTGAAAACCTTCGTCGGAACTGAACTACTTGCTATATGTCCATCCCAGTGTGCGAGATCATGGCTAACTTGGAGCTGTAGACATAACCTAGT 

CCATACTTCCCGATTGATGTGTGCGAATAGTATTAATTGAGCGTAAAGATACCTAAGAAACATATCTCTTAACTAGCTCATAATCACCCATCGCT 

ACATAAAATTTACGGTCCATTTTTTTTTTTAGCTTTTGATACTCGTACTAGAATAATTGAATGTACTTAGTGATACGTTATAAGTGTAATAATAT 

TGTTCTGCAGTTGATAAAGAAATATATATATGTATATTAGTTAGAAAAAAGTTAATGCACTATTGGTAAGGTGACTTAATTGAAATACACCATAT 
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GGTTTGAACATGTGAGTCAGCATATTTATAAAATGGGAAATTGAACTACTTTTGTGTGCTTCGTTTATACATAAATGTAGTGATTAGAGGGTAGG 
TTTAAAGGAAATGTTAAAGCATATCTCAATGTTTGGGGATGATTTCATGATGAGTTCATACTACATTTAGTATGCACTTTACTCAAAGATTACGT 
AGAAGAAATAAATATTTACCAGAAATTCGAGGGCTTGTTCAATATTTGTCAAGGTTCTAATTAAGATATCGGCCAGTTZ^AAAATAAATACAATCT 
CAAGTTTGTTTTGTTCTACAATATATGTGCAAATGCAGCTTGTTGAACTAGCTTATTAAGATTACTTACAATCTTTGTGTATGTAAAGTTAAAAA 
CTGTGAGAAACTATTTTCAATAGCTATAGTCGGTGTTTTCCATTTGAGAACGATTTAATAAAGTTCATATAATAAATTTGCCCGTTGGTAGAATA 
TGTGTAAACATTTTTATTTGCATATAAAGCTTAACAAT7^AACCAATAAATAGATAAGTAAATGTTGTGTACGTTTCTGGTTCGATATTGAATATT 
TAAAATG 

(SEQ ID NO: 910) 

Exon: 1001.. 1382 
Exon: 9332.. 9513 
Exon: 9600,. 9726 
Exon: 9809.. 11344 
Exon: 11408.. 11547 
Start ATG: 9349 

Transcript No. : CT24172 

TAAACACACCAACGAGCGGACGTACACGGAGCGAAAGAAGAAGTTGGCTACCACGAAAGAGTTATTGTTTGTTAGAAACACCAAGCGCAGCAGCA 

GCAGCAGCAGTACAAATATTATTGTGTGCCGTCCGAAGAGAACAGCTGAGCCACACGCTGTTTTTTAACGCTGTTCCGTTCTGCGGCACTTGCTG 

TTTGTTATCCGTATCCGTCGTGTTTCGTCGTCGTATTTGAACTCTCTTTTTGTTACTGTGCATATATTTTTGAAATCTCTGTGCTGCACACAGCG 

CAGGGGGAAAAGTGCTAGAATTAGCAGATCATCAAGACATCTGGCTAACCTGTGATTTCCAACACACCCGAAGGCAAATTGGCAACAAGTTACCA 

CCGATCAAATCTCGCCACAATGGAAAAATCCAGCTTAACGGAGAAGAATCACACCCAAGTGTACAATGACACAACGAAGCCAAAGAAGCCCAAGC 

GTCGCGACAAGAGCGATCTGGGTCCGGATTTTGTGGCGCCCGACGGGGGATGGGGCTGGGTGGTCTGTCTGGCAGCCGGTTTGAATAACTTCTTC 

CTTTTCCCGGCTCTGCAGCAGTATGGCCTGATCTACAGGGTGCGGATGCAATCGCTGGGCTTCGATGCCAAGCAAACCACGACCATTGTGAACGT 

GGTGATGGCAATCTCCTCGCTAGTGGGCATTGTCAATGGCGCCATGTTCCGGCGGTTCACATTCCGTCAGGTGGCCCTGACCGGCACCAGTCTGG 

CCTTCCTGGGCGTCTTCTTGTCGGCCTTTTGCACCACCTTCTGGCAGTATATCATCTGTCTGTCGGCGATATTTGGTATCGGTTTGGGTCTTGCC 

ATGGCCGCCACCTCACTGGCGGTCAATACGTACTTCAAGCTGAAGAGGCGTCGTGCCACGGGTTTCTCGTGGACGATCACTGGCCTGGGACCCAT 

ATTCTTTCCGCAGGTGTCTACCGTCCTGTTGGGCTACTATGGTGCCCAGGGCACCATTCTGATCTATGCTGGAATCGCCATGAATGCCATTCTCT 

GCGCTTTGACCCTGCAACCCGTACTTTGGCATGTGAAGAAGCCCGAAAAACCAGTTCACAATACAATCGAGGGAATTGCCGAGACGGAGAAGCTG 

CAACCAGAACTTTTGGAGGCCAATGGCAATCTGTTGTCGCCCAGCAATGATCCGTGGAAGGATTACGAGTGCAAGTACTGTCAGTATCAGAAACG 

TTCTAAGCGAGGCCTTTTCTCGTCCCAATACCTCTTCAATGTCGATGATCCCGAGAGGCCGGGGTACGAGATCACCGAGCCAGGAACTCCCATGT 

TGGCTCGCGCCAATGACGGTTGGTTCGGGTCGAAGCTCTCCCTGACCTCGGAGAGCGCGGGAGGAGCTCGCTCTCGCACTCGTCAAGCTCTGATG 

CGTCAGGTTTCATCCAGAAGTCGTGAGAATCTCGATAGACTGGAACAGAATCGTCATGACCAGGGTCCAGACACTCCATCCGCTGCGGCCTTGTA 

CAAGCCAAACTACTTCAATCGGGAGCGCGAGGATCTCGATCGCTATGCCAGCAAGACGAGCGTGTATTCTCGGCCTGGTCAGGATGAGCTGCTGC 

GCTGCACCTGTGCAGAGGACAAGGCACTGTTGCAAAAGACAGCCGAATCGCTTCAGGTGAACCTGTTCAATAATTCGGCCGACAACACGGAGGCC 

GAGGAGGAGGCCAAAAGGCGCATGACCTTCTTCCAGAAGGTGAGCAAGTTCTTTGATTTGGATCTACTGCGGGACTTTACATTTGTCAACCTGGC 

CGTTGGCATGAGCATCATGATGTTTGGCGAGATGAACTTCTCGGTGCTGACACCATTCATTCTGAACAGCTTTGGGTACACGGATACACAGATCT 

CGCTGGTGATGTCGCTGCTGGCCTGCATGGATATATCGGTGAGATTTCTCGCTCCGCTCGCCTTGGAGAAGGTGAAGCTGGATAACCGTGTGCTG 

TTTGCCTTCGGCATCCTATGCATCGCCGTGGGCCGCGTGGTTGTGGCCTTTACCGACTCCTACGAGATCATGATCGGCGTCTTCCTGCTGATCGG 

CTTTGGCAAGGCATTCCGCACCATCTTCTCACCGCTGATAATACCCAGTTACGTTCCGCTCAACCGCTTGCCAGCTGCATCTGGTCTGCAGCTCA 

TCTTCAACACAATATTCTCGTTCGCCATGGGCCCAATTCTTGGTATCTTGACGGAGGCCTACGGTTATGCAGCCACCATTCATACCATCAATGCA 

CTCACCCTTTTGGCTTTACTCCTTTGGTTAGCTGAGTCCGTGGTGCGCCGCATTTTGGGTATACCCTCCAAAGGATTGGGCCAATAG 

(SEQ ID NO: 911) 

Start ATG: 400 

MEKSSLTEKNHTQVYNDTTKPKKPKRRDKSDLGPDFVAPDGGWGWVVCLAAGLNNFFLFPALQQYGLIYRVRMQSLGFDAKQTTTIVNVVMAISS 
LVGIVNGAMFRRFTFRQVALTGTSLAFLGVFLSAFCTTFWQYIICLSAIFGIGLGLAMAATSLAVNTYFKLKRRRATGFSWTITGLGPIFFPQVS 
TVLLGYYGAQGTILIYAGIAMNAILCALTLQPVLWHVKKPEKPVHNTIEGIAETEKLQPELLEANGNLLSPSNDPWKDYECKYCQYQKRSKRGLF 
SSQYLFNVDDPERPGYEITEPGTPMLARANDGWFGSKLSLTSESAGGARSRTRQALMRQVSSRSRENLDRLEQNRHDQGPDTPSAAALYKPNYFN 
REREDLDRYASKTSVYSRPGQDELLRCTCAEDKALLQKTAESLQVNLFNNSADNTEAEEEAKRRMTFFQKVSKFFDLDLLRDFTFVNLAVGMSIM 
MFGEMNFSVLTPFILNSFGYTDTQISLVMSLLACMDISVRFLAPLALEKVKLDNRVLFAFGILCIAVGRVVVAFTDSYEIMIGVFLLIGFGKAFR 
TIFSPLIIPSYVPLNRLPAASGLQLIFNTIFSFAMGPILGILTEAYGYAATIHTINALTLLALLLWLAESVVRRILGIPSKGLGQ* 
{SEQ ID NO: 912) 

Name: monocarboxylic acid transporter; putative 
Classification: transporter 
Gene Symbol: BcDNA: LD28120 
FlyBase ID: FBgn0028469 



Celera Sequence No. : 142000013384504 

TCTTCAGTTCGTTTCGTGTCTTTGAACTTCCCAGCTGAACCCTAAAGCTATGGTTTCTTATTTTACATCAGGCTGCCTGATGCTAACTTATGAAA 
TACTACGTGAGCATTGACGAGTGCGrCCTTGGAAGAGCATTCCTCGGCGGCATGGCATCCTCCGCCTAACCGCAGAGATTCGCGATGGCCTGAGC 
CGCATTAGCGGCTGCCACCTGTGGAGCGTCCTCTGCCGGAATGCGTTTGCGTCGGAATAGAATCACATGCGGCTCCGGCGCGTGGACCATGTAGT 
GCACCCAGCCGGGGCTCTGCTGCACGCCCAGGTTGCGCCACTCCGTCTCGGTCATCAGATGAGCCTTCGGCACATGTTTGGCCAAATCCGGTGGC 
AAGATAACATGCCTGCAGGGAGGGAAAGTGGTTTGTGTGTAAACAAACAGTTCACGGATGCTGAAACGCAAGTTGAACGTACCTGTATTCAAAGT 
TGTCGTCGAAGTACTTCTCTGAGTATTGAATTTGATCGGCCGGCATTGCGAAAAGCCAGAAAACCTGAGGTAAATTAAGCAATTTTACCAAAATA 
GGTCAATAAATTCAGATAGATATGCCACAACTGATTCGATTCTAAATACTATTGTTATCGCTGCACGTATCGAACATAAAGTTATCGATTTGGTT 
ATCAGCTGTCCAGTTAACAGCACCTTTGGCGGTAAACTCAAATGTACTGTTTTTAATGGCACGATCTTAGCGGGCTTTAATATATTGAACATAAA 
CTTTTTTCTGTATAACAATATTTTTGCAATGTTTAAAGCCATTTTAAAGCCATTTTGTTAAAAAAACGAGTACTTTGTAAAAAGATGCTGTTAGA 
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AAATGTTAACATATTTAGCGAATGAAAAACTCAATTGGATAATATGTGATTTCCAGTGTGACCGGAACTCGCAGGCTCGCACATTTTCCAAATGC 
AATCGGCAATTTCTCCGTTACTCGCCCATGGCCACATTTGTGCATGTGCGTTTGAGGTTCCTTTCCTTTTTATATAAGCCCCTCGTCTCTTGCCA 
ACAAACCAACAACAACAACAACCAGCCAGGCCAGCGGGCAACAACZ\ACGCACAACAACGCGTATAAATAAACGTGAACGCATGTACTTAATATTA 
GCATAGAGTTGTCGCCTGCGTGCATTTCATATACATAACATAACATCCACTTACTCCCATGACAACGGGACTTTAGACAGTAACGGTGGAGCAGC 
TAGAAGAAGAGGAAAGGCGAAGTAGAACGGGAACAAGAAGAAAAAGCAGTGGAAGAGGTGAGGACGATACGTAAATACAGTCAAACTTCCATAAG 
TAGATACGACTTTACTACTACAATAAAAAGTGTCCTTTAGTGATGAAGAAAATGGGTTCTAAGAACCTAAAACCTTTATGTATATGTACTCCCTA 
TTGTCGCTTAATGAATTTACCGTTACGGTATGCTAGTAAATAAATGCATACAATGTACATATGTAGATACATAGGCACAGCAAAACCACCTATAA 
AT T C G AT T GAT T T T AT G AAGT T C T AGT T G AGG AAGT TTGACTGTATGTATGCGCGCATT G GT AAAT AC G AGC AG AC C G CAAT T AGC AAAT ATTT T 
TCAGTCCAGCAGGTATCACTCACCACGTACCGCCCAAAGGTCCCAATGCTGGCGTAGAGAAGCCGAAACACCTGCTGCTGCCAGTGGAAAAGCGG 
GAGGAGCTGCTCCAAACTGCCCAGGTGACATCCAATCATTTAGAGGCGACGCACCTGCCGCCTTGAGGAACTGGTCGCCACGCAGGAGGACACGG 
ATACGGCAACGGTTACGGACTCGTTTCCGCAGTAGCGGGCACCATGAACGCCAGCCTCATCTATGAGATACTTATGTATCTGAACTCGTTCTACT 
TTGGCATGTACGCCGCCTTCGAAGTGGGCGTGGGCGTTCTTAAGGCGATTAATCTAAACTACGGCGAGAACGCGTTGTCCCGGGAGGCCAGCATC 
CTGCTCTCACTGTGCATCATCGAGACCGTGCGCATTGTCTTCGGGCGGAAGAGCAGTCTAAGCGATCGTGGTGAGTAATCCCCATGATCCCCATG 
TTACTGCAAGGGTGGGAAAAGTCCACCGGCAGCGATGATGACATGGCGAAATTATTTGTTTGCCCGGTCACAATGCCCATTTCATGCCATCCCCC 
TACCTCCGGGGCCATAACCCATAAACGTTTGGACGGGCATACATACATATATGTATGATGTAGCCCACTCGGCGTATGCGTAATTATGGTCATCA 
CCCCGGATGAAAACACGCGTTTTATGGTGGCAGGGGAAAGGCGCCCTTCTGGCACATTAAGCAATATGCAAAGGAAATCTATCAAAGCGAATCAT 
CGAAATGAGTTATTGGAAAAGCAATTGGAGCTGTTAATATTAATTATCGCAATGTGTAAACATGAAATTATTAGTATAATACTTTAAAACACTGA 
ATCACTTTCAATCGAATACACTTAATTATAAACAAGTAGACGAACTATACGATTTTCCGTGCATCTGTAGCTAACATTTTCCTTGTTTTTCTCTC 
TTTGCAGGCTGGCAAGCAACCGCATCCGTAATATTAACACTGCCCAGTCTTGCTATTGTTATCTACTTGTGTTGTTTTCAAACCTTTGTTCTCAA 
ACTCGAAATTATTCTGAGTGCCTTAATGATCACCCTACAAGGTGCTGAACTAGTCTACGCCAGCATATTCATTTGTACAATGTGTCGCCCCGTCA 
CATACACCTAGCAGTTGATAAGCCGCGTTGCCGCCGCCACGTCAACGACCAAGATCGAGATCAAGACCAAGCCAGCACTAACACTCCCACCTGGA 
GGAGAAGGAGTAGAAGCATCCACAGCCGAGCGACGACTACGACGAAGACGAAACACTTTCCTCTGGCAATTTCACATCAAACGATTCGCCAAGGC 
CTCAAAACTAAGCCACTTAAATGCACACTAAGCCACCGCACATAAAAACAAAACAAAAAATCTAGATCCTACTAGACCTCCTAGGAATCCGTGCC 
AAGACAACCCGATAACATACCCAATGGCTGGCAGTTGAGGAGGACTTTCAACAAGGTACGTGAACCTTTATAAACTTATATAATCCGCACAGATC 
ATTCGCGCCCGCGGAAATCACATCGCAGCGAAACACGACGCGTCCGATTCTATCAAAAGCAAAGCTGCATTTAATTTGCATTTCTTGGTTGCATT 
CTTTTGCCCGGCCTTTGTGGCGTTGCCTCCCTGCTGGCGCAGTTGCTCCTCGGACCCGCTCGGTTCTGGTATTTTTATACGGCTCTGATGGTCTA 
AAATTATATGCGAACTGCACGGCTCGTGCAACACCGCTGCTGTCCCCTAGGTGCAATGTACACTTCTAATTTGAATGTAATATACACGGCCCACT 
GCGGGCAAGTTAGCCTGACTGTAATGGCCAAGATGAGAGGGCTCTCCGTTTGGTCTTTGGTCTAGTGAACATGCCCTGTACGATTCACAGGAGAG 
AGATTCACGGGACTTTAGCGAGGCATAACGCTTTGAGTTTTTAAAAATAATTAAAATTCCAACAATGGTTTTCATATATTCACCCTAAACCTATC 
CTGAATTTCAAGGTAACCATTTAATGGCCCTGGGCCCTGTAGAGCATAACGCCCAGGGAGTTTTTAGTAGTTTTTTCCCCCCAACTAGCCGCCAG 
TGAAATCATTCTCCCCCACATTTCTAGGCAGGCGATCTGCCGGCCAATGCTGACCCAAACAGCGAGCAGAGAAAACAAA 
{SEQ ID NO; 913) 

Exon: 1001. .1292 

Exon: 1620.. 2065 

Exon: 2573.. 2784 

Start ATG: 1849 

Transcript No. : CT242 60 

TTTGAGGTTCCTTTCCTTTTTATATAAGCCCCTCGTCTCTTGCCAAC7AAACCAACAACAACAACAACCAGCCAGGCCAGCGGGCAACAACAACGC 
ACAACAACGCGTATAAATAAACGTGAACGCATGTACTTAATATTAGCATAGAGTTGTCGCCTGCGTGCATTTCATATACATAACATAACATCCAC 
T T A CT C C CAT G AC AACG GG ACT T T AG AC AGT AAC GGT GG AGC AGCT AG AAG AAG AGG AAAG GC G AAG T AG AAC GGG AAC AAG AAG AAAAAG C AGT 
GGAAGAGTCCAGCAGGTATCACTCACCACGTACCGCCCAAAGGTCCCAATGCTGGCGTAGAGAAGCCGAAACACCTGCTGCTGCCAGTGGAAAAG 
CGGGAGGAGCTGCTCCAAACTGCCCAGGTGACATCCAATCATTTAGAGGCGACGCACCTGCCGCCTTGAGGAACTGGTCGCCACGCAGGAGGACA 
CGGATACGGCAACGGTTACGGACTCGTTTCCGCAGTAGCGGGCACCATGAACGCCAGCCTCATCTATGAGATACTTATGTATCTGAACTCGTTCT 
ACTTTGGCATGTACGCCGCCTTCGAAGTGGGCGTGGGCGTTCTTAAGGCGATTAATCTAAACTACGGCGAGAACGCGTTGTCCCGGGAGGCCAGC 
ATCCTGCTCTCACTGTGCATCATCGAGACCGTGCGCATTGTCTTCGGGCGGAAGAGCAGTCTAAGCGATCGTGGCTGGCAAGCAACCGCATCCGT 
AATATTAACACTGCCCAGTCTTGCTATTGTTATCTACTTGTGTTGTTTTCAAACCTTTGTTCTCAAACTCGAAATTATTCTGAGTGCCTTAATGA 
TCACCCTACAAGGTGCTGAACTAGTCTACGCCAGCATATTCATTTGTACAATGTGTCGCCCCGTCACATACACCTAGCAGTTGATAAGCCGCGTT 
(SEQ ID NO: 914) 

Start ATG: 522 

MNASLIYEILMYLNSFYFGMYAAFEVGVGVLKAINLNYGENALSREASILLSLCIIETVRIVFGRKSSLSDRGWQATASVILTLPSLAIVIYLCC 

FQTFVLKLEIILSALMITLQGAELVYASIFICTMCRPVTYT* 

(SEQ ID NO: 915) 



Celera Sequence Wo. : 142000013384474 

GCACATACAGACAAACAGGCGACATTTGAAGAAATAAAAAGTGAATAATAACCAATGCGTGGCCGTTTAGAGCATTTTACATTTAAATCTACGAT 
TCGGCTTGTTCATTTTGCGGCCCCTCCGCCCCTTGTTGTCCTTGTCCTTGCGGATTTCGCGCACCAGTGTGTAAAATGCATCGTCCACGCCCATG 
CGCGTCTTGGCGGATGTCTCAATGTATGGAATGCCGTACTGTTTGGCCACCTGAGAACAACCATAATTAACGGAATGTTCGCTCGGAACTCTGAA 
GGCCGGTACTCACCTCTCTTGCCTGCTCGTTGTTAACGTTCCACGAGGCCAGATCACATTTGTTGCCCACCAGCACCATGGGCACCTCTTCGGCA 
TCCTTTACGCGCTTGATCTGCTCACGGTAGGTGCCGATATCCTCGAAGGACTTCGCACTGTTGACGGCAAAGACCAGCAGGAATCCCTCGCCAGT 
CCGCATATACTGATCCCGCATGGCCGAGTACTCCTCTTGGCCGGCGGTGTCCAGGATGTCCAGCAGGCAGGTCTCTCCATCTGCAGGGGAATATA 
AAATGCATTTGAATATCGCAACTTCTTAAGGTCTCTTCTAGTATAAACGGGAAACCAAACCAGCGA7AAGGGTGCAGAAATCTCTAGAGTATTTGG 
ACCTGTGCAGCCCTAGCTTTGCCAGATCTAAGTACATATAACTCACCGATAACCACTTGCTTTCGGTAAGAGTCCTCGATTGTGGGGTCGTACTC 
GTCCACGAAATGGTTCTGGATTAGCTGGATGGTGAGCGCGGACTTGCCCACGCCTCCGGCTCCAACGACGACCAGTTTGTATTCCGTCATTTGGC 
TGTGGCGTCCGTTTTCGATTTGCCGTAGTCAAATGTCACAGTAACAGTCTCTTTCTCGCGATCGATGCTTACTCTCTTGTGAAGCGCCCGCAGAT 
GCAGTCTCTCTCTCCGTACGCTTGTTGCGTGTGTTTGTGTGGGTGACTGTGCGTGTATGGGCTGCAAGTGGAGTAAATTATTAATTAAGATTGAT 
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TAAAATAAATTAAACGCATAATTAGGCAGGAATTAATGCACGTCACTGAGTCACTGTGACTAGGCGCTTACTCGCACAAACACACACGCACCACT 

GCATGC ACACAGCCAG ACGGGCAAC ACAAAAAAAAATGGGAAAAAGAGAT C AAT GACTC AAACT AAAAT T AAAACCC ATT CGCTCAATGC AAAT T 

AGTTAGTCTCCCCCATAGCCGAGTTGTTAACTAGGTGAATTTGCTAATTCGGCGGCGTTGTTTTTAGCCCTTTTCTTTATTATTTTTCTTTTGCC 

TGCCGGGGCTGCGGACCTTCTTTGTTGCTGCCTCTTCAGCTAGCTCCTCCTTGTTTTTCCGCTCCGCTCCTCTCTATCAATACTCTGCCTCTCTC 

TTCTCGCGCGCGGCGCAGTTGCTGGGCACGTTTCACGTCACTTTCGCGGCTAACGGTGCGACAGGCGGTACTCCTCGATGGCACCAATTTAATTG 

CCGTGCGGTAATTGCAATTAACAGCGCAGGAAAACAATAAATCCGTATGTGAAATGAAGATGAAATCTAGGCTGTACGTAGCTGTGAAGATACrA 

GAGCTGGCACCAAGCCGATGGCACTATCGATAGCGATGGCTGCATTCTGGCCGGCACCATCGATGGACTTGCAATAGCGATTGCTATATGAAAAC 

TAATCTAAAGAGGTGGATGCACTTCAGTCGACTCTCTATAATTTGCTTAAACTAATAAATGATTTGATCAATACAGCTTTCTGTAAAAACTGGCA 

GACGCTTTCTGCTTTTAATAATTGTTAATTTAAGTTCAACGGGCTGGCATCACCGTTTCTTAGCACGGACTCAAGCCTGAGTCTATTATTTCAAC 

CACCACTGTAACGAAAACAGCATGGACAGATTGAAATTCAATAATTTGGTAAATAAACGATTTTATTTAAAATTATAGAGTTTTAATTAAAAAGA 

ACTTTTACAGGTGATATCCAACMGAAGGTCATTCAAAAGGCACGCGCCCAGACCATCGCCAAAATCGTACAAAAGCTGCGAAAGACAAAAGAGG 

CTCTGGCCAAAAATCCAGACAGTGAAAAGGAAAAGATTCGTTTACGCAAGAACACAGAATGCCTGGCCCAGTTGAAGGCTCT AAAAT AT ATGGAT 

ATGGTACGCCAGTCGCTCCTGCAGGAGGGCACAAATCCCAATGCGGTGATTGCTAACGAGCGCTCCACGCCGGATGAACTGGGCATTGCCATGCT 

GCGGCTAAACAAATTGATGCACGGACTGGTGGACAAGTTTGTGGAAACCTTGAAGCTGAGCACCACCGATAAGGAGGCCAAGTGGCGTGAGGAGA 

TCCTAGAGACGAGCAAGCGAAGGGCCAAAATAGAACGCACTGAGGAGCGGAAGAGGAAGCGCAAGGAGCTGAAGGAGCAGAAGGCCCAAACCAAA 

AAC C GAC T G G AGT GGT TAG AG C AG AAC AAAGT GGT GG AT GC AG AT GT T AAT GG AGC AACTGCGG AAAC G C C CAC TC TT C AAGT CAGC AAAGT AAA 

TGATCAGGAAATACCACAGTCTTTTGCAAAAACTGAAAATTCTCCAGTTCTCAAAGTCAAAAAGGAAAAGACACCTAAAACACCAGCCAAAAAAG 

AAAGAAACCTTCAGAAAAAACAAGCTTTTGATGAGCAGATCAACCCGGAGATAACTAAGCTCAGTTCCAAAAAGCAGAATTTAAATACAATCAAG 

GAGAATAATGCAGAGCCACAACTAGAAGGCAGACCGAAAGAATCTAAGCCAAAACCCTTCGAGAAAAAACCAGCCAGAGAGCAGAAGCCTAAGCC 

ACGATTAGAAAGGCAGCCAAGTATCGATGAAGAAGAACCACAACTTGACAATAATCGTCCCACACATGTCGTTGATCCCTTCTTCATTACGGAAT 

CAGGCCAGCCCTACTTGTCCAGCGCCGTGGTTCTTTCCGGGGACAATGACAGCGAAGCTGATGAGGAACAGGCACCGCCGCCAGTCAAAAGGTTC 

CGCAATGAGGATCGACGGTCGAACACATATCGGGAGAAACCGGAAAGACAGCCAGAGTTCAAAGCGAGAAAGCCAACTGATGATCGTCATCCCTC 

CT GGCT GG C T AAGC AACAGC AG AAAC C AAT CAT T GGT GAT T T T AG AGGC AAG AAGAT T AC AT T CGGCGAC G AC GGC C AGG C GGCGG AAAT TAT T G 

CTCCAAGTATCAATCAATTAACCGCTACTGCTGCTCCTTTGCCCACCGATGGCATGCACCCGTCTTGGGTGGCCAAACAGAAGTTTAAGCCGAAA 

ATTGCCGCGTTTCAGGGCACAAAGATTAAGTTTGACGAAGATTAAAGATGATGTCAATGGTATTTTGTTGTTTTAATAACTAAATAACTGCAGCT 

TTGCTGCTCACACGCCAACTTTGGGCACATAGTGACCACGAATCTTGTCCTCGTTGCGCAGCTTGTCCACGTTGACATCGGGGAACTGGCGACGC 

AGCACATCCAGATCGAAATTGGGGTTGCGACGCAGGATCATCATGACTTCGTTGCGGGAGCGGCTGCAAAATAACAATTTTTATTGAGATATCCC 

ATTTAACTAAGGGCAACATACTTCTTGGCCTTGCGCTTGGCATTGTAGAGTTTCTCCCTGTACTTTAAGAGGAATCTGTTCACAGCTCTGCCGCC 

GTAGCCAATGTTGCGGGATTTAATCCATTTGAGGTTCATGAAGGGCGGCAGGGCGTGTTCGCATGCTTTGTATGAGACCTGCAGGCCAAATGCGT 

TCTTCACCGCCTTCTGGGGCCGTTCACCCGTTTCACCTGTTTCCAGCAGGTGATACGCCTTGTTCCGCTCCCTCACTACTGTCTCCAGGTTTTCC 

ATGGAGATTTTCACCTGCAATTCTAAAGTTTTAGAAAAGGATCTCCACCATCAAATATCGCCCACCTTATCGATGCGCTCCGGACTAGGGAAGAC 

TTCCATTTTGTCATTGCACTCGTGCTCCATGGTCAGCAGCATGTTGCGCTCCTTGAGGAGCACGAACCACAGCTGGTGCAGCTCCTTGTTGGACT 

TGATACGCAGCTCCTCCGTCCGCCAGGCGCGGCCCACCTTCACCTCATTCTCGCTCCAGTTCTTCTTGTCGTCGAAGAACTCCATAAGATCCCGG 

CGAACAGCGGAAGTGTGCATTTGCATCGGCATTTGAACATGCGGCGTTTGCAGGGCAGCCGCACTACATATTACATAAAAACAAAGGCGCGTTTA 

TATCGGTTACTCTGTGTAAAAGTCGGATTTGTGAAAGAGAACGGACCTGCACGACTGCCATGGCTGCTTGGGAGCTGCCAAAAAGCTCCTGACTG 

TCGCATTTCCCACAGTT 

{SEQ ID NO: 916) 

Exon: 1001.. 1722 

Exon: 1787.. 1948 

Exon: 2006. .3387 

Start ATG: 1922 

Transcript No. : CT24354 

GCGTGTATGGGCTGCAAGTGGAGTAAATTATTAATTAAGATTGATTAAAATAAATTAAACGCATAATTAGGCAGGAATTAATGCACGTCACTGAG 
TCACTGTGACTAGGCGCTTACTCGCACAAACACACACGCACCACTGCATGCACACAGCCAGACGGGCAACACAAAAAAAAATGGGAAAAAGAGAT 
CAATGACTCAAACTAAAATTAAAACCCATTCGCTCAATGCAAATTAGTTAGTCTCCCCCATAGCCGAGTTGTTAACTAGGTGAATTTGCTAATTC 
GGCGGCGTTGTTTTTAGCCCTTTTCTTTATTATTTTTCTTTTGCCTGCCGGGGCTGCGGACCTTCTTTGTTGCTGCCTCTTCAGCTAGCTCCTCC 
TTGTTTTTCCGCTCCGCTCCTCTCTATCAATACTCTGCCTCTCTCTTCTCGCGCGCGGCGCAGTTGCTGGGCACGTTTCACGTCACTTTCGCGGC 
TAACGGTGCGACAGGCGGTACTCCTCGATGGCACCAATTTAATTGCCGTGCGGTAATTGCAATTAACAGCGCAGGAAAACAATAAATCCGTATGT 
GAAATGAAGATGAAATCTAGGCTGTACGTAGCTGTGAAGATACTAGAGCTGGCACCAAGCCGATGGCACTATCGATAGCGATGGCTGCATTCTGG 
CCGGCACCATCGATGGACTTGCAATAGCGATTGCTATATGAAAACTAATCTAAAGAGCTTTCTGTAAAAACTGGCAGACGCTTTCTGCTTTTAAT 
AATTGTTAATTTAAGTTCAACGGGCTGGCATCACCGTTTCTTAGCACGGACTCAAGCCTGAGTCTATTATTTCAACCACCACTGTAACGAAAACA 
GCATGGACAGATTGAAATTCAATAATTTGGTGATATCCAACAAGAAGGTCATTCAAAAGGCACGCGCCCAGACCATCGCCAAAATCGTACAAAAG 
CTGCGAAAGACAAAAGAGGCTCTGGCCAAAAATCCAGACAGTGAAAAGGAAAAGATTCGTTTACGCAAGAACACAGAATGCCTGGCCCAGTTGAA 
GGCTCTAAAATATATGGATATGGTACGCCAGTCGCTCCTGCAGGAGGGCACAAATCCCAATGCGGTGATTGCTAACGAGCGCTCCACGCCGGATG 
AACTGGGCATTGCCATGCTGCGGCTAAACAAATTGATGCACGGACTGGTGGACAAGTTTGTGGAAACCTTGAAGCTGAGCACCACCGATAAGGAG 
GCCAAGTGGCGTGAGGAGATCCTAGAGACGAGCAAGCGAAGGGCCAAAATAGAACGCACTGAGGAGCGGAAGAGGAAGCGCAAGGAGCTGAAGGA 
G C A GAAG GCC C AAAC C AAAAAC C G AC T GG AGT GGTT AG AGC AG AAC AAAGT GGT GG AT GC AG AT GT T AAT GG AG C AAC T GC GG AAAC G C C CAC T C 
TTCAAGTC AGC AAAGT AAAT GATCAGGAAATACCACAGTCTTTTGCAAAAACTGAAAATTCTCCAGTTCTCAAAGTCAAAAAGGAAAAGACACCT 
AAAACACCAGCCAAAAAAGAAAGAAACCTTCAGAAAAAACAAGCTTTTGATGAGCAGATCAACCCGGAGATAACTAAGCTCAGTTCCAAAAAGCA 
GAATTTAAATACAATCAAGGAGAATAATGCAGAGCCACAACTAGAAGGCAGACCGAAAGAATCTAAGCCAAAACCCTTCGAGAAAAAACCAGCCA 
GAGAGCAGAAGCCTAAGCCACGATTAGAAAGGCAGCCAAGTATCGATGAAGAAGAACCACAACTTGACAATAATCGTCCCACACATGTCGTTGAT 
CCCTTCTTCATTACGGAATCAGGCCAGCCCTACTTGTCCAGCGCCGTGGTTCTTTCCGGGGACAATGACAGCGAAGCTGATGAGGAACAGGCACC 
GCC GC C AGT C AAA AGGT T C C GC AAT G AGG AT C G ACG GT C GAAC AC AT AT C G GG AG AAACC GG AAAG AC AGCC AG AGTTC AAAGC GAGA AAGC CAA 
CTGATGATCGTCATCCCTCCTGGCTGGCTAAGCAACAGCAGAAACCAATCATTGGTGATTTTAGAGGCAAGAAGATTACATTCGGCGACGACGGC 
CAGGCGGCGGAAATTATTGCTCCAAGTATCAATCAATTAACCGCTACTGCTGCTCCTTTGCCCACCGATGGCATGCACCCGTCTTGGGTGGCCAA 
ACAGAAGTTTAAGCCGAAAATTGCCGCGTTTCAGGGCACAAAGATTAAGTTTGACGAAGATTAAAGATGATGTCAATGGTA 
(SEQ ID NO: 917) 

Start ATG: 858 
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MDRLKFNNLVISNKKVIQKARAQTIAKIVQKLRKTKEALAKNPDSEKEKIRLRKNTECLAQLKALKYMDMVRQSLLQEGTNPNAVIANERSTPDE 

LGIAMLRLNKLMHGLVDKFVETLKLSTTDKEAKWREEILETSKRRAKIERTEERKRKRKELKEQKAQTKNRLEWLEQNKVVDADVNGATAETPTL 

QVSKVNDQEIPQSFAKTENSPVLKVKKEKTPKTPAKKERNLQKKQAFDEQINPEITKLSSKKQNLNTIKENKAEPQLEGRPKESKPKPFEKKPAR 

EQKPKPRLERQPSIDEEEPQLDNWRPTHVVDPFFITESGQPYLSSAVVLSGDNDSEADEEQAPPPVKRFRNEDRRSNTYREKPERQPEFKARKPT 

DDRHPSWLAKQQQKPIIGDFRGKKITFGDDGQAAEIIAPSINQLTATAAPLPTDGMHPSWVAKQKFKPKIAAFQGTKIKFDED* 

(SEQ ID NO: 918) 

Classification: known__f lybase_gene 
Gene Symbol: Rlbl 
FlyBase ID: FBgn0014022 



Celera Sequence No. : 142000013384474 

TTTATTGAGATATCCCATTTAACTAAGGGCAACATACTTCTTGGCCTTGCGCTTGGCATTGTAGAGTTTCTCCCTGTACTTTAAGAGGAATCTGT 
TCACAGCTCTGCCGCCGTAGCCAATGTTGCGGGATTTAATCCATTTGAGGTTCATGAAGGGCGGCAGGGCGTGTTCGCATGCTTTGTATGAGACC 
TGCAGGCCAAATGCGTTCTTCACCGCCTTCTGGGGCCGTTCACCCGTTTCACCTGTTTCCAGCAGGTGATACGCCTTGTTCCGCTCCCTCACTAC 
TGTCTCCAGGTTTTCCATGGAGATTTTCACCTGCAATTCTAAAGTTTTAGAAAAGGATCTCCACCATCAAATATCGCCCACCTTATCGATGCGCT 
CCGGACTAGGGAAGACTTCCATTTTGTCATTGCACTCGTGCTCCATGGTCAGCAGCATGTTGCGCTCCTTGAGGAGCACGAACCACAGCTGGTGC 
AGCTCCTTGTTGGACTTGATACGCAGCTCCTCCGTCCGCCAGGCGCGGCCCACCTTCACCTCATTCTCGCTCCAGTTCTTCTTGTCGTCGAAGAA 
CTCCATAAGATCCCGGCGAACAGCGGAAGTGTGCATTTGCATCGGCATTTGAACATGCGGCGTTTGCAGGGCAGCCGCACTACATATTACATAAA 
AACAAAGGCGCGTTTATATCGGTTACTCTGTGTAAAAGTCGGATTTGTGAAAGAGAACGGACCTGCACGACTGCCATGGCTGCTTGGGAGCTGCC 
AAAAAGCTCCTGACTGTCGCATTTCCCACAGTTTTTGCCAAATTTAAACATTTTATTAACGCCGAAGACATTTTTTCGACCGAACAAAACAGCTG 
AAGCAATAACAATCGATACTTTTTTGTATCGATTTATTGGATTGCGTGCGTACTACATTTTTAGCCGGGCTTTACTCCATTTCAGGGCTGCCATG 
CATTTCAACTGAAATTAGTCGATAGTATAATTTCAGCATCGACTTGTTGGCCAGCTCTAGTTGCGCTTCGCTCAGTGCTAATGAAAAATCGAATT 
TTTCGCTT C AAAT C GAC T G AAGT TT ACAC GGGCT AAAGT AT T AC AG AAGT TT GGC TC AAGC CAT T AC A C AAC AT GC AAT AAGTAAA T GT AAG T G A 
AGTGGCCACTCCCCGAAAAAAACAACGCGCTGCACTGTCATCTTCTCTGTTCCATCCAATCCAATTTGTATGCCAAGGAACGCCAACTTAATGTT 
TGGCTGTGTGCATGCATGTGTGATTGTGTGCTGGTGTTTGTGGGTTTCAGTACACAAATTTAAATTGATAAAACTCGACAAAGGCAGCGGTTCTG 
AAT T GC AGC G AGT GC AGGG AAT GC AC T GCC AAGT G C C T GT T AAT T GT G C ATT T AT C AC AGC G AAT TC AAGC AAT TT GC AC CT C G AAAT GGGC AT T 
CAAAAAC AAATTC ATTGAAGAGT AACGCCAGGAT GAAAAAAAAAAAAACAAAT GT AGAACGGTAT AGTAACAGGAC AGTGTTGCCGAGC TGC TCC 
AAGTTCCCCGGTACTATTGCCGTGCAGCGCAGTGTAGCGTACATAAATATTTATCTCACTGCCTTATCAGGCAGCATTATTGTTGAAGCGACTGT 
GTATCTGTGGGCATATTAATTGTTTGTTTGTCTGCTCTTAGCAAGTGTTATATAAAGTGCAGAACATCATTGAGTTAAAAGGTCAGCGAGTGAAT 
TGATTGATCAATTTGGCAAGGGGAAGGCGCAATGTGGGGGCTACGGAGTAACTTGTAGGGTTTTCTTCGCTCGGCTTTCGTGCTTTTATCATTTC 
CCCTTTCCCACTTTTCGAACAACTTTTCTGCTCACGGTGAACGCCCAAATAAAAAAAAACACAAACAAGTCTTCTTGTTTGTACTGTCGCCATGC 
GCGCGTGTGTGTGAGTGCTTCGGCTTTTTGTGACCAGGGGTTAATTTAACGGTGTCGCCTTAATTATCCGGGTTCTGCATATTCCTCCGGTGTTT 
GCTTTCTCCGGACTTGTGCTGCCACTTGTTGCCCCTGTTAACGGCGCTGTTAGCTGCAATCCGAATCCCCCGGTTTCGTCGCCATATCGTGCGAT 
AACCTTTCATTTGCCAATCGGTTGATTATCATGTTCTTAATGAAAGTAGCAACAAATAGCTGAATTTTCGCTTGCCCAAACTTACTCAGCGATGT 
TTTAGAGGGGACCCTCATCACTACTCCTACAGTTTTTAGTTTGCGACGCCCATTGGTGATAAATACTTGTGTACATACATACAACATGTGATTAT 
TTTTATGGAATGTGTTTATCTGGAACCCTCTCTATATGCGTCAACACATAATTGCACATGTTTAAATTTCCCGAAAATAAAATGTATATACAATG 
CCTGCAGCTATAGCTGCTGTCATTGACTTTACTTTCATTTTAGCAGTGTAAGTTTTAAATTTTGTTTCCTCCCATCCAAGATATGAAACTGTAAC 
TATGAAAACTCACTCCTGAGCTGCTCCGAAAATATCATGAATACGACAATTATGTGCACATCATTTAGTATACGTGATATTTTAGGAGCAGCTAA 
ATTATATTTAAAGTTCCGTAATTTATTGCTTTCTCGTGATATACTAGGCAAATACATTCACATACACTGATAAAGAAACTCAGTTATCGTTTATT 
TACAAACACAACCAGTGTTGACCTCAAGTTAAAGTTAAAGAATCGGTAAATGGCTTGCCAATGCTCGATTGCTTCATCAAACTCACCATTGCTTA 
TAACAGTGCCAATTTTAATGCTGAAATCTTATCTACGAATATTAACTACAACCATTTCTCAGTTCTTTAGTCTAATGTAAACCAACTTAATGTAA 
GCTACTTCTGTGTGTCTGGTAATGTGGAGGCAGTGAGTAGTTGTGCTGTATTCTCTTAAAGCTCTGCTCTATTCATTAAGATGTGCGTGTGCATT 
GAGGAAGAAAGCCAACGAAAGTTGTTCGCCACATTACCGGCAATGTTGCAACATCGGGCGCGTTGTAATCGTGTGCAAGATGCGCGAAATCCAGT 
CAAACGCGAACAGAAGAACGCGCCTCGAGGCGCAATGTTATTAAATAACAATGCAGAAAGAGCAGGGAGCACGGAGCAGCAGCCACCTCCTCATG 
GGAAACATTACCAGGGTGCCAGTTGAAGAGTCTACAATGAACCACCTCAAACCACGATCCTTCTTTTTTCCTCTTTTCTCGTTTCCAAAGACAGA 
GGAGCTAAAATGTCGCAAAAAGAATTGGCGGCTCTGGTTGCCACGAGAACGAGTCGCAAACGCACCATATCCTCGACCTCATTCTCTATTTCCAC 
TCGCACTGATCCAAAATCCATCATAGAGATACACATATTCAATGACTCCTTGCCCATAAAAAGAGCTTGCTGCGAAGTGGAGAATCCAACTTTAA 
AAGCGGAACAGCGAGTGCAGCAGGAAGAAGAGAGCCTGGGCCAAGTGCCGCCGCTGACCGAGGAAGAGCAGCAGCGGCACGATGAGTTCCGAAAT 
AGTGGTTCGATATTCCTGCAGGATTTGCCCTGCTTTAAGCTGCAGCAGGCACAGCCGATAGCAGGATCATCACCCATTCCCAAGTGTCGGGAGTG 
CAGAAGAAGAAATTTGGCTACAACAGAGGGCGAACCCAGTTCAGTTAGCGACGTTTACTGCCGGTTCTACGAGTTCCGACGCCTTCAGTTCAATG 
AGAATGGAGAGCTGTGCGTGGTTGGCTTCCCAAATCCCTACAGCGAGCCCTCACCAGAGGACATCGCCATTTGGCAGCCGGATAAAAATACACCG 
CCGACCAGCGGTTACATGGATATACAGGTTTGCAGGTACATCCTACTCCATGCCGGCGACCAGTTCTGCTATATTTGGCACCAGGAAGCAGAGGC 
TTTAAGTTTGCATCAGAATACGGACGGAACCATTGCCTGGAAGAAGGCCGTAAAAGGAACACGAGAGATTTGCGACGTGTGTGACACCACATTGT 
TCAACTACCATTGGACTTGCCGCAAGTGCGGCTTTGGAGTGTGTCTGGACTGCGTTAAAGATCGCAAGGAGGGATTGCGACTGCGGCGAGCGGAG 
AACGCTGCCCAAAAGGGATGCGATGAGTACCACTGGCTGCTCTGCAGCGACCCCAGTGGCCCACAGGAGCACGTCCTTACCGAGCTGATGCTAAC 
ACAGATCATAGCCGGAGATGCCCTCAACGTCTTGGGCAGGCTGCTGCACGAGGTTCGTACTTTGTGGCAGGTGCCACAGGTCTGCGGCTGCCTTC 
TGAGCAAGCAAGCGATTGAAGATGCACAATCAAAGGAAGTGATCCAGGATATGATTAAGGAGTCTCAGCTGAAGCAGCACACCAGCTACTCGTCG 
CTGGCCTCCGAGCAGAAGGTGCACCAGCAACAGCGCCTCGATCAACTGCACGCCACGAAGTTGGAGTTTGCCAGAGAGCTAGGTGTAGATTATGT 
TCCAGGTCGAGTGTGGACCAAGGAAACCTTGGGAAAGGATCCAATAACAACGGCGTTTGATAATTTGAAGCACATTAATTTTCTGAGAAAGGGAT 
TGGCCGGCTTGAGAAGGTTTCTTCCACCAAGGACCATGACTTTTGCTTACTCCACTCAGTTGGCTCCAGGAGTTCCTCACGAGTTTCTGTGCGAT 
GGCAGACTGCTTAGACTCACCGATGCCATGCATCCGGATAATCGAGTTCTTTTCCAGGAGGTTTGGAAATGCGGCCAGCCAGTTATGATTTCGGA 
GGTGGCTCGCTCTTTGAACTTAGACTTATGGCATCCACAAGCCTTTTGCCGGGACTTTGGCGACAAGCCCAATGATCTTATTAATTGCCTGAATG 
GCAATCTGGTGCCCAATCAGCCAATGCGACACTTTTGGGAGGGTTTTCAGTGCATGACCAAGCGTCTGCCAGATGCATATGGCAAGCCAATGCTC 
TTAAAGCTTAAGGATTGGCCGCCGGGTGATGATTTTGCCGAAATACTGCCCACCAGGTTTGCTGATTTGATGAAGGGACTGCCCATGCCGGAGTA 
CACACTTCGCACGGGAAACCTTAACATAGCTAGCTGCTTACCCAAGATGTTTGTGCCACCAGACTTAGGACCCAAAATGTACAACGCATATGGCT 
CAGCATTGCACCCCGACAAGGGTACAACCAATCTTCACTTGGACATTTCGGATGCAGTGAACATAATGGTTTATGTGGGAATACCCCAGGATGGG 
GACACGAGACCTCAGATGGCGGCCACGCAGAAAGCCATAGAAATTGGCGGATGCGATTACATCACTAGAGCTCGTTGCCAATCGCCAGATGTATT 
ACCCGGAGCACTTTGGCACATCTTTCCAGCTCGCGATGCAGACAAAATAAGAGATTTACTTAACCGCGTGACCCTGGAGAAGGGTTTCCGTCTGG 
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AGCCGGATCACGATCCAATTCACGATCAGAATTGGTACTTGGACGACAAGTTGCGAGCTCGTTTATTCAAAGAGTATGGCGTGGAGGGGCATCCG 
ATTGTCCAGTGCTTGGGCGACGCAGTCTTTATACCAGCTGGCGCCCCACATCAGGTTCAAAATCTCCACAACTGTATTAAGGTGGCCGAGGATTT 
TGTATCGCCGGAAAATATCACGCACTGCTATCATCTAACTCACGAGTTTAGAAGGCTTTCGCACTCGCACACAAATCACGAGGACAAGCTTCAGA 
TCAAGAATATTATCTACCATGCCATCAAGGACTGTTGCACCATCCTGACCAGAGCTGTGGATGAGAGACTTAATGCGGAATTAACCAAGCTAAAT 
GCAGATTAGAATTTTCTTACCTTTTAGTTTTTTTCAACTATTTAACATTTATTATTATTTTTTAAGATCAAAAATTGTCAAATATTCAATAAATA 
CTTACTTAGAACTAAATAAAAGGTACATTTATTTATTTGTCATCCAGTGCATAATTAAGTAAATTTTCTATTTTAAAAACCTAAATAAATATCAA 
TTCAAATTAATTGATTTAATTGAAATTTCTAGTATTTACATAGACAACACTAAGCCTAAGCAAGATACTCTCCAGCAACAAATGCCCATGCATGC 
TGAGGACGGAGCTGAATCTGATGATGAGGAGTCCAGCCCAGGCAGACATTAGCATCGTAAGGATCATACTACTAATTTGGCGCCCATATAGAAGT 
CAAGTCAATTACAACGCCAGAAAGCCTTTTATGTTAATCGAGAGTGGAAAGGGAAAATAATTTTTTTGCTAAACTAGCCTAGAGATCGAAATATT 
AAG AAT AC AAC T AAAAT G AC GG T GG AT T T C AAC G AT T AT T T T T GGGT G AGT T AAAT AAGGT GC AT AT GGAC GG AT CAT T GC T T AAAT AT ATT CT T 
CCATTTAATATCCAGGGGGAGAAGAACAATGGCTACGATGTTCTGTACCACAACATGAAGTTCGGCCTGATTGCCAGCAAGGAGTTATCCGAGTT 
TCTGCGCGAGAAGTCTAACATTGAGGAGCAGAACTCAAAGATGATGTCCAAGCTGGCCCACAAGGCTGGTACACTAAATAGCACATTCGCTCCTG 
TTTGGACCATACTGCGAACATCGGCAGAGAAGCTCTCCACGCTCCACCTGCAGATGGTGCAAAAGCTGACTGAGCTGGTCAAGGATGTTGCCAAG 
TATGCCGACGAGCTGCACAAAAAGCACAAATCCGTCAAGGAGGAGGAATCGCAGACTCTTGAGTGTGTCCAAGCCATCCAGACGTCGACGGTGGC 
AGTGCAAAAGTTGCGCGATCTGTACGCCAGCAAGGTCCAGGAGCTGGAGAAGCTGCGCA 
(SEQ ID NO: 919) 

Exon: 1001.. 1132 
Exon: 3226.. 5804 
Start ATG: 3240 

Transcript No. : CT2437 6 

CCAGCTCTAGTTGCGCTTCGCTCAGTGCTAATGAAAAATCGAATTTTTCGCTTCAAATCGACTGAAGTTTACACGGGCTAAAGTATTACAGAAGT 
TTGGCTCAAGCCATTACACAACATGCAATAAGTAAATACAGAGGAGCTAAAATGTCGCAAAAAGAATTGGCGGCTCTGGTTGCCACGAGAACGAG 
TCGCAAACGCACCATATCCTCGACCTCATTCTCTATTTCCACTCGCACTGATCCAAAATCCATCATAGAGATACACATATTCAATGACTCCTTGC 
C C AT AAAAAG AGCT T GC T GC GAAGT GGAG AAT C C AACTT T AAAAGC GG AAC AGC GAGT GCAGC AG GAAGAAG AGAGC C TGGGC C AAGT G C CG CCG 
CTGACCGAGGAAGAGCAGCAGCGGCACGATGAGTTCCGAAATAGTGGTTCGATATTCCTGCAGGATTTGCCCTGCTTTAAGCTGCAGCAGGCACA 
GCCGATAGCAGGATCATCACCCATTCCCAAGTGTCGGGAGTGCAGAAGAAGAAATTTGGCTACAACAGAGGGCGAACCCAGTTCAGTTAGCGACG 
TTTACTGCCGGTTCTACGAGTTCCGACGCCTTCAGTTCAATGAGAATGGAGAGCTGTGCGTGGTTGGCTTCCCAAATCCCTACAGCGAGCCCTCA 
CCAGAGGACATCGCCATTTGGCAGCCGGATAAAAATACACCGCCGACCAGCGGTTACATGGATATACAGGTTTGCAGGTACATCCTACTCCATGC 
CGGCGACCAGTTCTGCTATATTTGGCACCAGGAAGCAGAGGCTTTAAGTTTGCATCAGAATACGGACGGAACCATTGCCTGGAAGAAGGCCGTAA 
AAGGAACACGAGAGATTTGCGACGTGTGTGACACCACATTGTTCAACTACCATTGGACTTGCCGCAAGTGCGGCTTTGGAGTGTGTCTGGACTGC 
GTTAAAGATCGCAAGGAGGGATTGCGACTGCGGCGAGCGGAGAACGCTGCCCAAAAGGGATGCGATGAGTACCACTGGCTGCTCTGCAGCGACCC 
CAGTGGCCCACAGGAGCACGTCCTTACCGAGCTGATGCTAACACAGATCATAGCCGGAGATGCCCTCAACGTCTTGGGCAGGCTGCTGCACGAGG 
TTCGTACTTTGTGGCAGGTGCCACAGGTCTGCGGCTGCCTTCTGAGCAAGCAAGCGATTGAAGATGCACAATCAAAGGAAGTGATCCAGGATATG 
ATTAAGGAGTCTCAGCTGAAGCAGCACACCAGCTACTCGTCGCTGGCCTCCGAGCAGAAGGTGCACCAGCAACAGCGCCTCGATCAACTGCACGC 
CACGZ\AGTTGGAGTTTGCCAGAGAGCTAGGTGTAGATTATGTTCCAGGTCGAGTGTGGACCAAGGAAACCTTGGGAAAGGATCCAATAACAACGG 
CGTTTGATAATTTGAAGCACATTAATTTTCTGAGAAAGGGATTGGCCGGCTTGAGAAGGTTTCTTCCACCAAGGACCATGACTTTTGCTTACTCC 
ACTCAGTTGGCTCCAGGAGTTCCTCACGAGTTTCTGTGCGATGGCAGACTGCTTAGACTCACCGATGCCATGCATCCGGATAATCGAGTTCTTTT 
CCAGGAGGTTTGGAAATGCGGCCAGCCAGTTATGATTTCGGAGGTGGCTCGCTCTTTGAACTTAGACTTATGGCATCCACAAGCCTTTTGCCGGG 
ACTTTGGCGACAAGCCCAATGATCTTATTAATTGCCTGAATGGCAATCTGGTGCCCAATCAGCCAATGCGACACTTTTGGGAGGGTTTTCAGTGC 
ATGACCAAGCGTCTGCCAGATGCATATGGCAAGCCAATGCTCTTAAAGCTTAAGGATTGGCCGCCGGGTGATGATTTTGCCGAAATACTGCCCAC 
CAGGTTTGCTGATTTGATGAAGGGACTGCCCATGCCGGAGTACACACTTCGCACGGGAAACCTTAACATAGCTAGCTGCTTACCCAAGATGTTTG 
TGCCACCAGACTTAGGACCCAAAATGTACAACGCATATGGCTCAGCATTGCACCCCGACAAGGGTACAACCAATCTTCACTTGGACATTTCGGAT 
GCAGTGAACATAATGGTTTATGTGGGAATACCCCAGGATGGGGACACGAGACCTCAGATGGCGGCCACGCAGAAAGCCATAGAAATTGGCGGATG 
CGATTACATCACTAGAGCTCGTTGCCAATCGCCAGATGTATTACCCGGAGCACTTTGGCACATCTTTCCAGCTCGCGATGCAGACAAAATAAGAG 
ATTTACTTAACCGCGTGACCCTGGAGAAGGGTTTCCGTCTGGAGCCGGATCACGATCCAATTCACGATCAGAATTGGTACTTGGACGACAAGTTG 
CGAGCTCGTTTATTCAAAGAGTATGGCGTGGAGGGGCATCCGATTGTCCAGTGCTTGGGCGACGCAGTCTTTATACCAGCTGGCGCCCCACATCA 
GGTTCAAAATCTCCACAACTGTATTAAGGTGGCCGAGGATTTTGTATCGCCGGAAAATATCACGCACTGCTATCATCTAACTCACGAGTTTAGAA 
GGCTTTCGCACTCGCACACAAATCACGAGGACAAGCTTCAGATCAAGAATATTATCTACCATGCCATCAAGGACTGTTGCACCATCCTGACCAGA 
GCTGTGGATGAGAGACTTAATGCGGAATTAACCAAGCTAAATGCAGATTAG 
(SEQ ID NO: 920) 

Start ATG: 147 

MSQKELAALVATRTSRKRTISSTSFSISTRTDPKSIIEIHIFNDSLPIKRACCEVENPTLKAEQRVQQEEESLGQVPPLTEEEQQRHDEFRNSGS 
IFLQDLPCFKLQQAQPIAGSSPIPKCRECRRRNLATTEGEPSSVSDVYCRFYEFRRLQFNENGELCVVGFPNPYSEPSPEDIAIWQPDKNTPPTS 
GYMDIQVCRYILLHAGDQFCYIWHQEAEALSLHQNTDGTIAWKKAVKGTREICDVCDTTLFNYHWTCRKCGFGVCLDCVKDRKEGLRLRRAENAA 
QKGCDEYHWLLCSDPSGPQEHVLTELMLTQIIAGDALNVLGRLLHEVRTLWQVPQVCGCLLSKQAIEDAQSKEVIQDMIKESQLKQHTSYSSLAS 
EQKVHQQQRLDQLHATKLEFARELGVDYVPGRVWTKETLGKDPITTAFDNLKHINFLRKGLAGLRRFLPPRTMTFAYSTQLAPGVPHEFLCDGRL 
LRLTDAMHPDNRVLFQEVWKCGQPVMISEVARSLNLDLWHPQAFCRDFGDKPNDLINCLNGNLVPNQPMRHFWEGFQCMTKRLPDAYGKPMLLKL 
KDWPPGDDFAEILPTRFADLMKGLPMPEYTLRTGNLNIASCLPKMFVPPDLGPKMYNAYGSALHPDKGTTNLHLDISDAVNIMVYVGIPQDGDTR 
PQMAATQKAIEIGGCDYITRARCQSPDVLPGALWHIFPARDADKIRDLLNRVTLEKGFRLEPDHDPIHDQNWYLDDKLRARLFKEYGVEGHPIVQ 
CLGDAVFIPAGAPHQVQNLHKCIKVAEDFVSPENITHCYHLTHEFRRLSHSHTNHEDKLQIKNIIYHAIKDCCTILTRAVDERLNAELTKLNAD* 
(SEQ ID NO: 921) 

Name: TESTIS SPECIFIC PROTEIN A LIKE 
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TGCGGGGCTGTAGCTCCAGCTTATGACCCTCCTGCGTAATGTATTCCTGTAGAATTTTCGAGTCGGTGGTCTGCGGGTAGCCAAAGTCGAGCAAT 
TCATCCAGCAGCTCGTAGATGATCACGAAATTGTCCCGAATGGACTCCTCCTCCAGCTCCTTGAAGTATTCCACAAAGACCTGCGCGATCTTGTG 
CAGAAAGACGAAGACCAGAGCGATATTAACGTTCTTATTGCGCGGCGTCGTGGATACGATATAAAGATTGTTCGTTTTGATGTAGGCAAATGTAG 
TCTCCGCCGTCTGAAGGATCGGAGTGATTAGGCCCTCCTCCTCGCGCTCCATCAGCAATGGCATGAATTTGTCGATCACGGCCATGTCGATGTTG 
TCACCGCGGTAGTTCCGCGATATCAGCACCTTGCCCTTCACGTCCAGAACGAAAATGGCCGACGAAGACATGTCTCTTGGTTCCCAGCAGCTGCC 
ACTTGTCCCAGGTGTAAAATTCCCAATTGGATCGAGTGGGAGGTCGCTTCGATGGTTCTACTGCTCGCGGCAGGCGTTGCGCGTGGAAGTTGGCA 
ACTGTCGCGAGGTGGCGATGAGTGGGCTAGGTCTATCAATGGGCTGGATTTACATATTCCGTTTCGTAGACTGGAAAATGCGAATGAAAACTCCG 
GAATTTCTTACGTCGCCTTTTCTGCTGCTTCTTTTTCTTATTAGTTGGCCCACCAGTGCTGTGAAATGTTCACATCAGGTGTTGGGCAGTGTGGC 
CAGGTGAGATTGGTCTACTGCTGATTAACTAGCGAATTAAGCTTAAAGGAAATGAAAAATGTGCAAGGTGTAACATCTTAAGGTTTCTGTTTTTA 
AATAAATACCTTTTAAATTATTTTCATTTTCCCATTGGTACCTTTAAATTCAAATCATAAATATATTGGAGAATGTAATCTAGCATCACTATTGT 
GCCATCCGGCATTTGTGGCCATCCGCAGTACGGCCACACTGCTAGCCCAGACGCTGCTTTTGTGGAATTGTTAATTATTTATTTCGGCGAGCATG 
CAAATGAACCCGAGCGTCACAATCGAGCGCAAACGCGTCGACTGCAGTGTCCTGCCAAAGGGATGGCAGCGCGACGAGGTCCGAAAGTCCGGCAG 
CAGCGCCAATAACAACGCCAGCAGCAACAACAACAGCAGTGCCACAGCCAGCAGCAACAACAACAATAACAAGGTGGATGTTTTCTACTACAGGT 
AAAGAAGAAGAGCAGAATTACTATATAGACTACTATATACCTAAAATCGGAACACTGCACGCTGTTTAATCGGATAAACGTTTATAATCACTTTC 

ACTACCTTTCT GTT AT T AC AT CC G C AT AAC C AC AC AC AC AC AC AC CAT AT AC AC C AC AC AC ACGC AT AC GC C T AT GT AT AC AC AC GC AC AC G AAC 
TACCACCTAACCCACCAAAAATATCCATCCACCGCCGTGTCTGTTCAGCAGTCCCACAGGAAAAAGGGCGGAGGGAAAGCCACAGGACATTGCAA 
TCCCTGATTTCCAGCCTGGCAAAATGCCGCACTGCGCCCTGCCCTCGCCCTCCATTTCGCTGTACCGCTGCAGCGCCATGCCGCTGCCTATAGCA 
AGCGGGGGCGGTAATGGGGCCACCAGCGGATCGGCGGCCAATGCCCTCAAGCGCAAGTTCGCCCGGAGTCAAGGAGGCAATGCTGCAGGAGCAGC 

AGG AGC T GCTCCTCC GGC AGC G AC C G C CT CGT C AGC AGCAAC AGC GAC AGC AGC AT C TGC T T C AC C AT C AAC G GC C AAT C GC C AGC AG C AAC AGA 
TCGAACTCAGGTAGAGCCAAGTGGATGAGCACTTGACTCTTGCATGTCAGCGTGTCATGTCTTGTTTGCCAGGACGCAAGGTCTACAGCCTGCGG 
CAATTAAAAAGCATCAAATGAAAAAATACCCAAAGAAACACACAATTGGCAAAACAATTCGTAGTTGTCTTCAAAAACGCTGTAGCTCTGTGCCA 
CCTGAGATTTATGTTCCACCATCTCTGCATGCTAAAGCTTTACTTTACGCTTTGTCTAATGTGCTGTTCGGTGTTTTAGCTTTTTGAAGGCGTTT 
CTCTAATACATTTCCAATCTTTAATCCCTTACAGCCGAGCCCTGCGCACTGATGTCTCCCTGGTGCCTCCCATTCGACAGACTGCCTCGATCTTC 
AAGCAACCGGTGACGGTGATTCGCAACCATAAACAGGATCCCGCCAAAGCGAAGAATGAACCCAAGCATGGCACTCGGGAGAAGCCGAAACAGCT 
GTTCTGGGAGAAGCGACTGGAACGACTACGCGCCTGCCACGACAGTGGCGAGGAGCTAGACGACATCTCACTGCCCAAGACCATACGCACCGTTG 
GCCCCAATGTCAACGAACAGACCGTCCTCCAGTCGGTGGCCACGGCCCTGCATATGCTCAACGCCGGCGTACACGGCCAGAGTTCTACGAAGGCT 
GATTTGACCAAGAATGCCATGGCGTTTATGAATCCTGAACAACCGCTAATGCACGCGGTGATCATTTCGGAGGACGACATACGCAAACAGGAGGA 
TCGGGTCGGAGTTGCGCGGCGAAAGCTGCAGGATGCACTCAAGACATGAGTTCAATGGTTAACCCCCATGCATCTACCAATTCCATTGTAGGCTC 
TAACATCTCTATCGTATTGACCCCTATTCGAAATCTACATTTAAGACTAGTGTGTAATAAACCAAATCCAACTCCAATCGAGGAATTCTGGTTTC 
TTATTGGACATCGAGAAAGCTTCGCTTGTCTTTTCCATGACAACAGGGCTCTGTTATCCAATTTTATTACGCTCAGATGACTTTAAAATGCTATT 
TTATACAAGACAGTTTAAAAACGAATTCGAACTATAATACATAAACCGAATTTTCATACTTAGTCGATTTGTTTGGCAAATCCTTGGAACGAAGC 
GTTATCTACGACGAGGGGATCATTACCTGACGAAATTTTCGGTCCAAAGATTTATGGCCAAGTAAATCGATCTCCTTCAGTGTTTAGGTAAACCT 
TTAAAGCTGGACAGTTTTTGTGCTTCATCCCCACATTGTATTTTAAAATATATATACTATACACATAGATATCGTTGCAGAGATGCGATTAGTCT 
AGGCCCTCCTCCATCTCGTCATCCTCGTAATCTATGCCATCGTAGTCCGTCGAAGAGCGTGTCGTTAGCTGCAAGGATGAATCAATCGGAATTTG 
CAATTAACTGAGAATTTTGCACGTGCTCAAGCTGAGGCTGTCTACAAGAGTGTCTGGAGCCAACAGAAGACATTGAACCAGAACTATCGAATAAG 
CTCCATCAAGTCAAGAGTAAAGAACTACAAATTTCAGCCAAAAATACCCACCTGGTTGTAAATCCCACGCCATGCCGCAAACGCTCCCATGGCCA 
AGACGGCTCCCACGCCGGCTACTATTTTGGTGTTCAGCTCGGAGTCTAGGAACTTGCGCTCCGAGGCCGTCAGACTGGCCTCCGTGCCACTGGGT 
GTCTTTGTGAGCTTGACGGAACTGTATCCCTCGATGCGGAATATGTCCTTCGTGATGCGGTTGATAAAGTTCACTAGGTTTTGGCAGCCCTTAAT 
GTGATTCTGCAGTGGATTGTTAGGCAGTGCGATCTTAAAGATGATGGCAAGATAGCTGTAGACAATGGCATCAAACTCGCTGTAGGTGTCGCCAA 

AGAACCACACCTTGCGACCCAATTTCCG 
(SEQ ID NO: 922) 

Exon: 1001.. 1233 
Exon: 2125.. 2733 
Start ATG: 1043 

Transcript No * CT24429 

ACGCTGCTTTTGTGGAATTGTTAATTATTTATTTCGGCGAGCATGCAAATGAACCCGAGCGTCACAATCGAGCGCAAACGCGTCGACTGCAGTGT 
CCTGCCAAAGGGATGGCAGCGCGACGAGGTCCGAAAGTCCGGCAGCAGCGCCAATAACAACGCCAGCAGCAACAACAACAGCAGTGCCACAGCCA 
GCAGCAACAACAACAATAACAAGGTGGATGTTTTCTACTACAGCCGAGCCCTGCGCACTGATGTCTCCCTGGTGCCTCCCATTCGACAGACTGCC 
TCGATCTTCAAGCAACCGGTGACGGTGATTCGCAACCATAAACAGGATCCCGCCAAAGCGAAGAATGAACCCAAGCATGGCACTCGGGAGAAGCC 
GAAACAGCTGTTCTGGGAGAAGCGACTGGAACGACTACGCGCCTGCCACGACAGTGGCGAGGAGCTAGACGACATCTCACTGCCCAAGACCATAC 
GCACCGTTGGCCCCAATGTCAACGAACAGACCGTCCTCCAGTCGGTGGCCACGGCCCTGCATATGCTCAACGCCGGCGTACACGGCCAGAGTTCT 
ACGAAGGCTGATTTGACCAAGAATGCCATGGCGTTTATGAATCCTGAACAACCGCTAATGCACGCGGTGATCATTTCGGAGGACGACATACGCAA 
ACAGGAGGATCGGGTCGGAGTTGCGCGGCGAAAGCTGCAGGATGCACTCAAGACATGAGTTCAATGGTTAACCCCCATGCATCTACCAATTCCAT 
TGTAGGCTCTAACATCTCTATCGTATTGACCCCTATTCGAAATCTACATTTAAGACTAGTGTGTAATAAACCAAATCCAACT 

{SEQ ID NO: 923) 
Start ATG: 43 

MQMNPSVTIERKRVDCSVLPKGWQRDEVRKSGSSANNNASSNNNSSATASSNNNNNKVDVFYYSRALRTDVSLVPPIRQTASIFKQPVTVIRNHK 
QDPAKAKNEPKHGTREKPKQLFWEKRLERLRACHDSGEELDD 
PEQPLMH AVI ISEDDI RKQEDRVG VARRKLQDALKT * 
(SEQ ID NO: 924) 

Classification: known_f lybase_gene 

Gene Symbol: methyl-CpG-binding-domain-like-protein 
FlyBase ID: FBgn0027950 

Celera Sequence No. : 142000013384654 
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TGATGATGAGGACTGTGCAAGCATGCGAAGCAGTTCGCCGTCAftCACAATCCGAGATGCTAGTGGTACCGCCACTTTCGGTGCAACCCACGCGGA 

AAAACAAGGCCGTTGTGGGCCGCATTTAGAGCTGATGATGATGATGACTCCACAGTCAAATTGTGTATTCCATTTTGGAAGTCTCTTTTTGGAGA 

GACTCGCTTGCAGTGCTATTCCTTGAGCATTGGTCGCCCATGTCCGAGTGCATATACAAGCTTCGAATTCGTAATCTCACAACACAACCAGAAAC 

AAATTATCGAGCGTCCAATATTGTTTAGATTGTAAAATGTATTGTTAAGTTGCACCTGTCGATAGGAAGGCTCGCAAATGTTGAAGCGTTTGCCC 

GAGCTGTCGATTGATCATGATTAACTTGTAAATAGTTCTCCATATGTGTACTCCCGTGCCCGTATAATTGTGTCGCGTCGCTAAATGTAATCAAC 

TACTCATTGCAGTGACTAGGGATCGAATTCACGCACGCACCGAACGATCGATTCGGATGGTTTTCGGTCGGATGATCGCCACACCGCAACAAGCT 

rTGAATGTTTACTGAGTAAATTTTTTAAAAACCCTATCTTACCATATTGATTATAACTCAACGATTTTGCTTGAGTAGGCAATTACGACCCTAAG 

TCGTAAGGCGTTCAAAGCATGAAGATGTTTAAGTTAATATATGAACAATGATTAAAGGAAAACATTTTTAAATCAAAGCATTTTTTATTCTTAGC 

TTTGAAGCTGCAAAGTAGGCAACGAATTACACAAACTTCGACAGTTACCACAATCTCACCCGAAAAGATGCAATTGTTGCGCATAGAAAATTAGG 

TTTTTAATTTAGTCTATAGAACAATTTTGTAGGGGCAATTCCAAGTAAATATTCATATACACCAAGCCAATCATATCTTCCACACTTACGAATCT 

GGTGTACAAAAATAAAAATATACTAACGTAGAGGACCTAAACAATCGCTTTCAGAGTGTATAAACCGATTTGACGCCCGAATTGAATTTGGACAT 

TGCATCGGAATGAAGTTGATACTGCAAACTATGCGATTGCAAATGAAGGAGTATCTTCAGCTTCTGGCCGATCTCCAGAATGGCCATTTCGTTAC 

TCCCGTTGGCAGTCCAAACCGCTGTAATAGATACAGTTTTTTATCAATGTGCAGAACACATTTGATAAACTTACAGATTTTGTTGCTCTTGTTAC 

GTATATTGATGACAGCTCCGCAGATCTCGTCGGAGTATTCAAAATTTTGGCCCACCATCAGCAGTAGAATGTCCAGCCAAATTTGATCCAATTCC 

GCTTTGGCACTCTTACTCACGGTGACCAGCCAGCGACCACCCTTGATGTTGGCTTCATCCTCCCACATGGGTCTTTGGGAGTAGAGGATTTAAGT 

TAGCATCTCTCAATGAACAATGATTTATTCGTACTTTATGCCCTTCTTGAACACGGAGTAATCGCATCCAATCTTCAGCTCCGCTGGGGTCTTGA 

TGGTATGGTACAAGCTCCAGAAGGTCTCCACACTGTCAATCTCGGTAATCTCGTTTAGCATGTCCTTCCAGTGCTTGGTGCGATCGTTTTCCAAA 

TACCACAAAGTCCACACATGCTCCAAAGGATGCTTGTACTCGACTTGGTATATCTCTTCATCGATAGGTTCTGGGCTATTGCTCTCCAGTGGAAC 

ACCTTCGTGGTGATGATGATGCTTATGGATACTATTCAGCTGATGTTGCTGATCCTGCTGCTTCATAATATCAAAACTTTGCACTTGTGCACTGG 

CCATGTTGTTGGTTGACTTGTTTTTTTTTTTTCGATACGAACAGGTAACCTTAATTTATTTGCTGACGGCTGTGTTCTCAACGAATTTTGTCGCG 

ATTCGAGTGGGAATTGCTTACTAATAACCAAAAATACTTTAGCACATTTTTCTAGACCTAAACGAAACCTGTCGAAAAGAAGGAAATGAAATGAC 

AATGACACCTGGGTGTAAGCATACTATGCTGGTTTTTTCTTTTAATATAAATTATTTTTATTCATTTATAAATACTGCAAAACTTAAGACAAATC 

AATTTACATATTGTTAACATAGAAATGCGCGCTTGGTCCTTAAAAAAAAAAATAATAACATACAAAATGAAAATGTATGCACTCTATTCCAATGG 

TCAATTAAACACTAAGCGGAGGCACTAGAGAAAACAAATCGAAACAGCTACGGGGTGTTCAAAATAAAAAAAAACACGGTTTTCGTAACCATATT 

TGAAAAGATTCTCAATTCTTTTAGAAATACGTTAATAAATTATCTCAAGGCTGGTTTTGAATCATCCACCCGTCTAGACCCATAAAACAGCGGCT 

GCATCCTCGTTGGACAAAGTGGAGAAGCGCTGGTAGCGATTCGCATTGCGATAGGTTTTCGAGGGCGATGGAGCGGTTGATATCTGAGCTTGAAC 

CTGTACTTGAACTTGAGTCTGATTTTGCATGGGCAGCCCCACCATATTTAATCCGGACTTTAGTGTGGAGTTCCTGCTGTCCGAGGACATGGACA 

GGCGTCGCTGGTTTCCATTCATCAAGGTGGAACTCAGGGGCAAGCCGACATCGTGAAAGGAGCTAATCGAACTGAAGCCACTCTCATCCTCGGTA 

CTGGAACCGCTCTGCTTATTGCTGGTGGTTTTGTTATTGCTATTGCTGATGGCTGTGTTGCTGGCGGTGGTCTGTGTCTGCGTCTCCTCATCCGT 

TTCCACTTTCGAGGAGGAGGATGCCGTCGACGAGATGGTGGAATGCACGGAACTGTCGGTGGAGCTGCGCAACGATAGCCTCGAGTCGCGCATGT 

TGTTGGCCCGCAGAAAGATGAGTTCCGCATTTGATTTCCCGCCCGGTGATGGAGTCTCCAGTGTGGCACTCATGATACTGGTGGAATTGTGCAAT 

GGTGGCGTCAGTA 
(SEQ ID NO: 925) 

Exon: 1958.. 1460 
Exon: 1402.. 1215 
Exon: 1161.. 1001 

Start ATG: 1776 (Reverse strand: CAT) 

Transcript No * CT24507 
GGTCTAGAAAAATGTGCTAAAGTATTTTT^ 

TAAATTAAGGTTACCTGTTCGTATCGAAAAAAAAAAAACAAGTCAACCAACAACATGGCCAGTGCACAAGTGCAAAGTTTTGATATTATGAAGCA 
GCAGGATCAGCAACATCAGCTGAATAGTATCCATAAGCATCATCATCACCACGAAGGTGTTCCACTGGAGAGCAATAGCCCAGAACCTATCGATG 
AAGAGATATACCAAGTCGAGTACAAGCATCCTTTGGAGCATGTGTGGACTTTGTGGTATTTGGAAAACGATCGCACCAAGCACTGGAAGGACATG 
CTAAACGAGATTACCGAGATTGACAGTGTGGAGACCTTCTGGAGCTTGTACCATACCATCAAGACCCCAGCGGAGCTGAAGATTGGATGCGATTA 
CTCCGTGTTCAAGAAGGGCATAAAACCCATGTGGGAGGATGAAGCCAACATCAAGGGTGGTCGCTGGCTGGTCACCGTGAGTAAGAGTGCCAAAG 
CGGAATTGGATCAAATTTGGCTGGACATTCTACTGCTGATGGTGGGCCAAAATTTTGAATACTCCGACGAGATCTGCGGAGCTGTCATCAATATA 
CGTAACAAGAGCAACAAAATCTCGGTTTGGACTGCCAACGGGAGTAACGAAATGGCCATTCTGGAGATCGGCCAGAAGCTGAAGATACTCCTTCA 
TTTGCAATCGCATAGTTTGCAGTATCAACTTCATTCCGATGCAATGTCCAAATTCAATTCGGGCGTCAAATCGGTTTATACACTCTGA 

(SEQ ID NO: 926) 

Start ATG: 183 {Reverse strand: CAT) 

MKGQDQQHQLNSIHKHHHHHEGVPLESNSPEPID 
CDYSVFKKGIKPMWEDEANIKGGRWLVW 
LLHLQSHSLQYQLHSDAMSKFNSGVKSVYTL* 
(SEQ ID NO: 927) 

Classification : translation_f actor 

Celera Sequence No. : 142000013384654 ^ mw ™ 
TATATTGCTCATTTAATTGAAAGAATTTATTTATAAATTATAAATTTATTTATAAATTTATTTATATTTATTTATAAATTTATATTTTCAATAAA 

TCGAATGAATTATTAGCATTTATTGAAGTTCCCGCAATATTTAAATTACGCTCCATTAATTTATGATGTGTAAGCCACTGTAAATCCTATTATTA 

CATTTTTGTTAGCTTAATTGAGAACTAAATTTGATGAAAATTTTCAAGAACATTTTTATTTACAGCTTACTACCCATTTTTTTTTTTTGTGCTAA 

GTTACCCCGATGCAACTTTCAATTCCACTTGATGTTTGCTGGCAAATTCATTGAGACACTTAATAAATGTTACATATACGCCTTTCAACCACTTT 

AGACCCCCAACTATGCCCCCTTATCACCCTTATCAATGCCAAGTGAAGTTTACTCTCCAATTTCGACCCACACACGACACTTTTCGCACATAAAT 

CAAGTTGACCTTAAAAAAATCAGTGAAACAAAAACTTTGTCATATCATGGCCAGCAACAACAACAATAATAACAACAACAACTGCAAGACGCCAA 

AGGAGGCGAGGATTTTTGTGGGCAGAAGAGGGGGTAGTGGGGTTGTTAAGGGGAGGCTGTCAAGGCTACGTGACAAACAACAAGGTGACATAAAT 

ATCCAGTTGCTCCAAAAAGGAAACTGAACTTTTTTCTTTAATGCCAAGGCAACATAGCAAAGTTTAAGGGAGCCGGGGGATGTGGGGCAGAGTGA 

GAGGGGAGACGGAGACGCGTTCCAAGCAAAGAAAGCATAAAGTGATACAGTGGGCACTCGCGTAACAGAACTTCAGGGAATTCCAAGTCAAAGAA 
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AAGTGAACTTGAATTCAGAAAAAGTTGTTAATTAAATATGTACATAGTATTAACTGTTTTTAATATTGTTTGTAAATAATTTTAACAAAATATTT 
GAAAATCACGTTGTTTCTATAAATTTCCCTTAACGCATATACATATGTACTTAATTTTTTTTGTAGCCCAAGCATTTCCTCATAACGTTAAATGT 
TGGCATTACTGCGGAATTTAAATTCCGAGAATTCTCGTTTGGCGCAAAGATTACTGTACTAACCTGCTGCTGTGTTGGCTGATGTGTCGGTTGCA 
TTGGCCAGACGAGATTGAGATTCCTGGCGGATTGTGCCGGTGGTTGTGGATGCTGAGGGGGATGGGGCGGCTGGGCGGTGAGCAGCTGATTAGAG 
GCGCCAAAGTGTATCATGTTGGCGGTGTGACGTCGTGCTGGCAGAGCGGGGGGCGGGGTGGCGCAGGGGGAATCCTGTTCGTTGTCTCTCTCTAG 
CGATTCCTGATTCTCCAGCTGCTGCTGTTGTTGTTGTTGTTGTAGTTTGGCCTCGTTTGTTGGCGGCTTCTTGGGCAATGTCAAGGCATTGTTGT 
TGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGAGCGAAAGCGAACGCGCAAAAGAACTTTTGAAATAGCGACACGGCATCTGGAAATCA 
AAGTCAAGATCCGTTGTCTGCTGCATGTTTGTTGTGGTTTTGTGCATTTCTATTGGCTAAAACGGGATTTCTTTATTTATATATTTGAATTTTCC 
CTTGTGCTGCCTCTTTCTCTGCTATTTTTTTTTGTTGGTCTCTTTTAAATTGGCATTGTTGATGTTTTCCTCTTTAGCGGTATTTCCCTTTGGAT 

TAT T GC AT T TT T T C AC G C AC CC AT AAGT T C AC T T TC AC AC AC T C AC GC AC AAAT C AAT T AGT TT T T G AC AC T C G AAGG AT x T T T T T C AAT AC AAT 

TTAATAGAAATTCAATGGAAATCAATTTCTGGGATTTATTCGCGCGCGCATAACACAAACAAAATTCATAAAGCCGCAGCTAGTTTTCGTTTGCG 

CCCGAATTGATGAGTTTTACTGTACTAAAATCCAGAAAAATTGGGAAATATTGATTCGTTGAAATGTTGTTGGGTTTGAATTTTAGTGCGAACTG 

TTGTTATTAAATGTTCAAGAATAATGTTCTACT AAT AAAT AGCTTGTTTTTCAACTTTACAACAGTTTGCTGCTTCTCGTAATTCACGCGACTTT 

TCCCGGCCGAGGTGGAAACCTAACTGGAAGACCAACTGTATGTTACCGCCAAATTCGCAGACAGTTTGTGCTTGGGTCACCCTGCTTGGGTTATT 

ATTTCGATCTCCAATGTTAATGTTAATGTTATTAACAGTAAGCTCGTTCTCTCTGAAATAAATGAATGAAGAAGTTACCAAGGCAACCGAATAAA 

ACAGAGTCTGCAAAGAATGTCTTAACAGCTAGCTATTCTCTTAATAACAGTGCAACAACAGTAAAAAAAATATGCTTAAAAGTATGCTAAGCATT 

T T GT T AAAC AAT CGT T T T T AT GT C AT AAG AT C AGAT C GT G AAACC T AAAT T TAAC AAAT T AAAT G AAAG AAAC GT C T T GAAC GT AT AC AT T T AAG 

T GAT T T T AT AG AGAC TTT T T T T T T AGCT T AT AGC AT T T AGT C AAT AGC T TAGT T AAC T GC AC AAG AGT CT AT AAT AAT AAAAGT C T AT AAAT AC A 

TTAAATTCGTTAACATGTTTTCAAATGTAGAAATGTTAATGATTTGGCGCGCCAAGAAATTTGAAAAATGTAGATTGTATTTTACAAGAAAATAT 

GAAAAAGTATATTAAAAAGATAAAGTTTATTTTTGTTCTTCTTTTAGTTAAATAGATTGCATCTTAGTATTCATTATAAATATTATTCCAAATGT 

CTTAAAATTAATTAGTTTTTATATTTTATCATGAGTAAGAAAAGGGTTTTTTTCCAGTGCAAATTTTGTTCAACGAAACTAATAATATCGGTTGA 

GAAATGGAAATTGTTTTACCATTCCCCCTTCTAAGGAGGCATTAGCATTCGTTCCTCCGCTGACTAATCCCCGGCCTAATTCCCATACCTTATGT 

GGTATCTATATTGGGGAGACAGCTTGCCAATCAATTTGATTGCGCTGCGCTGATGTTTGCCCCTGTTTATCTAGCTTAGGGGAAGGATTTTCAAG 

CTTTTATCATAACAGAAGTGCTTTAATTCCCTTTGATGAACATTTGTACATAACCCCCTTTTGA 

(SEQ ID NO: 928) 

Exon: 2104.. 1001 

Start ATG: 1546 (Reverse strand: CAT) 
Transcript No • CT24537 

CACCTCGGCCGGGAAAAGTCGCGTGAATTACGAGAAGCAGCAAACTGTTGTAAAGTTGAAAAACAAGCTATTTATTAGTAGAACATTATTCTTGA 
ACATTTAATAACAACAGTTCGCACTAAAATTCAAACCCAACAACATTTCAACGAATCAATATTTCCCAATTTTTCTGGATTTTAGTACAGTAAAA 
CTCATCAATTCGGGCGCAAACGAAAACTAGCTGCGGCTTTATGAATTTTGTTTGTGTTATGCGCGCGCGAATAAATCCCAGAAATTGATTTCCAT 
TGAATTTCTATTAAATTGTATTGAAAAAAATCCTTCGAGTGTCAAAAACTAATTGATTTGTGCGTGAGTGTGTGAAAGTGAACTTATGGGTGCGT 
GAAAAAATGCAATAATCCAAAGGGAAATACCGCTAAAGAGGAAAACATCAACAATGCCAATTTAAAAGAGACCAACAAAAAAAAA 
AGAGGCAGCACAAGGGAAAATTCAAATATATAAATAAAGAAATCCCGTTTTAGCCAATAGAAATGCACAAAACCACAACAAACATGCAGCAGACA 
ACGGATCTTGACTTTGATTTCCAGATGCCGTGTCGCTATTTCAAAAGTTCTTTTGCGCGTTCGCTTTCGCTCAACAACAACAACAACAACAACAA 
CAACAACAACAACAACAACAATGCCTTGACATTGCCCAAGAAGCCGCCAACAAACGAGGCCAAACTACAACAACAACAACAACAGCAGCAGCTGG 
AGAATCAGGAATCGCTAGAGAGAGACAACGAACAGGATTCCCCCTGCGCCACCCCGCCCCCCGCTCTGCCAGCACGACGTCACACCGCCAACATG 
ATACACTTTGGCGCCTCTAATCAGCTGCTCACCGCCCAGCCGCCCCATCCCCCTCAGCATCCACAACCACCGGCACAATCCGCCAGGAATCTCAA 
TCTCGTCTGGCCAATGCAACCGACACATCAGCCAACACAGCAGCAGGTTAGTACAGTAATCTTTGCGCCAAACGAGAATTCTCGGAATTTAAATT 
CCGCAGTAATGCCAACATTTAACGTTATGAGGAAATGCTTGGGCTACAAAAAAAATTAA 
(SEQ ID NO: 929} 

Start ATG: 559 (Reverse strand: CAT) 

MQQTTDLDFDFQMPCRYFKSSFARSLSLNNNNNNNNNNNNNNNALTLPKKPPTNEAKLQQQQQQQQLENQESLERDNEQDSPCATPPPALPARRH 
TANMIHFGASNQLLTAQPPHPPQHPQPPAQSARNLNLVWPMQPTHQPTQQQVSTVIFAPNENSRNLNSAVMPTFNVMRKCLGYKKN* 

(SEQ ID NO: 930) 



Celera Sequence No. : 14200001338447 4 

AATGAAGGCTTCTGTGTCCGCAGCGAATGCCAAGCCCAAGCTAACGCGACCGTCGGACTCGATTTTAGTTGAGCAACTGAGCGCCGCCGTTTTGT 
CACATTCGGTGCTTAATTTTGGCTAGCAGCGAACTGGGAACGGGCAGAGCTGTTTTACCACCTCTCATCGACGTTCTCGCAGGCAAGCCGGTTTC 
TTGACTCAGCCTGATTTCGATTTAAAAAGCTTAACTAAGCATTCATACATTTAACGGATTCGCAATACGATAATTAACACTTTTGGCTTAGTGCA 
AATATTGGGAAGAAGTTACTATGAAGGCTAATAAACTTTTTATTACTTGTCAATAACTATGCTATTATATTTATTGTATTATATTTATGTACTTC 
TTTGTTAGATAAGCATAATTTAAAGATACAGTATCTCACCTGTTTCATATTCAACTTAAGCTGCTCCTTGATTTCCTGGCATATATTTTCAAGTA 
CCGAGTAATATGTGAAAGTCTGCCATATCAGAAGAGACAGCATAAAGGATAGTATTATATTCACACTTAGTCTGTTCATGTCTGCAATAAATATA 
CCAACTAGTAACTACGTTTTCCATATCAAACTAATTTGCATGGCCAATTTCACGGAAGCGTTTAAAATACAATTATTTGCTTTATTTGAGTCTAA 
ACGGTGTAGCGCGAACTATTTAAATGCAATCGATTGGCGACTGTGGCTACAGTTATCGATAGATTTTACTTTTAAAAACCGTGACTCAATGAATC 
TAAATTATCAACTATGAATTATAATGAATTACTATTTTCAATTTCTTGTTATTTTATTAACAACATAACTTTGCTTACCTTTCATTTATAAATAC 
CGT GAT CGAAAATTGGAGTC ACT AAAGTCACGGC AG AAAAGT TCATCTCT ATT AAGCCAAAGCC AAAC AAAGAAGC AC AC AAAAGT TT AT ACC AT 
AGTTTATTGTTTACCAAAAATATGACCTGTAGTTAAACAGCAGTTAAGCAATGGAGGACCTGTGTCGAATTTGTGGCGGGGCCTCGGAAAACATG 
CTGGGCATTTTCGACGATCAGGTGGAAGAATATGTTGATGGAGCCAAGCTCGCTGAGATGGTGAAGACATGTGCCGACGTGCAGTTGGATCCCGA 
CGACGCTATGCCCCAAAAAATGTGCATTTCCTGTGTCCACGATGCACGAACTGCTTACGGATTCAAGCGAAGATGCGAAGAGAACTACAAGAAGT 
TCTATTTGGCGATATTAAATGGGCAGGTCATCAAGGACGAGCCCAACGAGGAGGATTTCCTATTCATTGAGAACCCAGACAAAGGCAATCTGGAA 
GCAAAGAAGAAACTCAACAAGGAAATTAAAAAGACACATCAAACAGCTTCCAGAACAACCAAGTCTTCCATAACAACCAGTAGAGCTGGCAGACA 
GCTAAGGAGTATAAAGAATCAGACTTTCAAATGCGAACTATGCATCAAACAGTTCAAGCGCCAAATAAACCTCCTGGACCACATGAAGTAATTAC 
AATCAATACATATGATTTCACCTGTTCTTAAAATTGTATATCTCATTCCACAGGGTTCACAGCAATTCACACGTATGCCAAAACTGCGAAGAGCG 
TTTTCTGTTCAAGGCCGATCTGGATAACCACCAATGTTATCGCAATAGCAATTCCACCGTCGAGTGCCCGGAATGCTTAAAGGTCTTTTCAAGCA 
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CCCAGAGCCTGGACAGCCACAAATGCAAGGATATGCAGGAGCGTTCGCCCTTTCAGTGCCCCCACTGTCAGCAGGCCTTTACTCGCGAACAAAAC 
CTCAAGGCGCACCTGTTGATCCACGCGGAATCGAAACAGGGAAACGGACCCCACAAATGTTCATACTGCCAAACGGGATTCTTCAACAAGTCGGC 
CCTCAAGGTGCACATCCATGCGCACATGGGCGAACGGCCGCATGCCTGTCCCTTCTGCGTGTCCAACTTCCGTTCCAAGCAGGCCCTTAAGGTCC 
ACATCCGCATACACACGGGCGAGAAGCCCTACCAATGCCCCCACTGCCCCAAGACTTTTTCGGACAACAATAACCTGGCGAAGCACCGACGACGC 
CACTCGGATGAACGGCCCTACAAGTGTTCCATCTGCCTGCAGGACTTCCGGGAGAAACATCATCTGAAGCGGCACTTTCTGGGCAAGCATCGGGA 
TGGAGATCAAAAGCTAAAGCTAAAGTGATACCTTAAATTCTTGACCAAATACCACGGCTTACTCGAATAAATAAATTAGGATTCGGTTATGTTTA 
TATCTCATTGTTTCTATAGTTCATATTGGGATTTAATTGAATTTAGGATTTATATTTTCGTAGAATTCGAACAGCACTTTGAAAACCATTTGGAG 
TGGCAACACCGTTTTTCTGTAAATCAAAATTCGATGTAGACACGACGATATAGACAGCGTTGCCATTCGGCCGTGATTTGTTCACGTGGCACCAC 

CAT T T T GAT AAAC AAGC T AAAC AAC C C T T AAAT C GC T C G AAAAGAG C AAT GG AAAAAAAC AAC AAGT T GT AT GT T T AT AAAT T AT AAGT CT T AC A 
AATTGTAATTGTGGTTTCCGTTTGCTTAGAACATCAAAGCAGCTACCAACACACAAAAAATACCGAATATTTCTCGACGTAATTTTCCCTAGACG 
TCCTTACAATTCGTCGTGTTCGCTTTGATCTTTGTCTTGCTTAGATTCATCATCCTTCTCCTCATCGTCATCATCCTCGTCCTCATCGTCCTTGT 
CGGAAGCCTTTTTCGCCTCCTCCTCGTCCTTCTTGCGCTGCACCTCATCCTGGGCCTCCTTCATTTTCTTCTCGCCCGCCTGGGTGTTCTTGACT 
TCGGCAGCAGCCTTAGCGGCCAGCTCCACGTCGTCCGTGATCAGGACGTTGTCGAAGATAGTGCCGGACTTGACCTGCCAGAGATCGAAGCCCAG 
GGTGCAGATCTCCTTGCGCAGATACAGCTTGTCGTCGGGCACATACTCGGGATTGGCGATCTCGGGATGCTCCCAGGCGCCCTTGTAGTTGGGGT 
TGTCCAGCTGCTTTGGCTGCCACTCGCCCTTGAACTCAGGGTTGTCGATCATCGGTGGCTCCCACTCGCCGTCCATCTCATCGTCCCAATCCTCA 
GGCTTTGTGGCATCTGGGTCGGGAATGTGCTCTGGTTTGTCCCAGTCCTCGGGCTTCTTGTCATCGGGATCGGGAATG 

(SEQ ID NO: 931) 

Exon: 1001.. 1512 
Exon: 1574.. 2213 
Start ATG: 1001 

ATGGAGGACCTGTGTC^ 

CGCTGAGATGGTGAAGACATGTGCCGACGTGCAGTTGGATCCCGACGACGCTATGCCCCAAAAAATGTGCATTTCCTGTGTCCACGATGCACGAA 
CTGCTTACGGATTCAAGCGAAGATGCGAAGAGAACTACAAGAAGTTCTATTTGGCGATATTAAATGGGCAGGTCATCAAGGACGAGCCCAACGAG 
GAGGATTTCCTATTCATTGAGAACCCAGACAAAGGCAATCTGGAAGCAAAGAAGAAACTC AAC AAGGAAATTAAAAAGACACATC AAAC AGCTTC 
CAGAACAACCAAGTCTTCCATAACAACCAGTAGAGCTGGCAGACAGCTAAGGAGTATAAAGAATCAGACTTTCAAATGCGAACTATGCATCAAAC 
AGTTCAAGCGCCAAATAAACCTCCTGGACCACATGAAGGTTCACAGCAATTCACACGTATGCCAAAACTGCGAAGAGCGTTTTCTGTTCAAGGCC 

GAT C T G GAT AAC C AC C AAT GT TAT C GC AAT AGC AATT C C AC C GT CG AGT GCC C GG AAT G CTT AAAG GT C T TT T C AAGC AC C C AG AGC C TG GAC AG 
CCACAAATGCAAGGATATGCAGGAGCGTTCGCCCTTTCAGTGCCCCCACTGTCAGCAGGCCTTTACTCGCGAACAAAACCTCAAGGCGCACCTGT 
TGATCCACGCGGAATCGAAACAGGGAAACGGACCCCACAAATGTTCATACTGCCAAACGGGATTCTTCAACAAGTCGGCCCTCAAGGTGCACATC 
CATGCGCACATGGGCGAACGGCCGCATGCCTGTCCCTTCTGCGTGTCCAACTTCCGTTCCAAGCAGGCCCTTAAGGTCCACATCCGCATACACAC 
GGGCGAGAAGCCCTACCAATGCCCCCACTGCCCCAAGACTTTTTCGGACAACAATAACCTGGCGAAGCACCGACGACGCCACTCGGATGAACGGC 
CCTACAAGTGTTCCATCTGCCTGCAGGACTTCCGGGAGAAACATCATCTGAAGCGGCACTTTCTGGGCAAGCATCGGGATGGAGATCAAAAGCTA 

AAGCTAAAGTGA 
{SEQ ID NO: 932) 

Start ATG: 1 

MEDLCRICGGASENMLGIFDDQVEEYVDGAKLAEMVKTCADVQLDPDDAMPQKMCISCVHDARTAYGFKRRCEENYKKFYLAILNGQVIKDEPNE 
EDFLFIENPDKGNLEAKKKLNKEIKKTHQTASRTTKSSITTSRAGRQLRSIKNQTFKCELCIKQFKRQINLLDHMKVHSNSHVCQWCEERFLFKA 
DLDNHQCYRNSNSTVECPECLKVFSSTQSLDSHKCKDMQERSPFQCPHCQQAFTREQNLKAHLLIHAESKQGNGPHKCSYCQTGFFNKSALKVHI 
HAHMGERPHACPFCVSNFRSKQALKVHIRIHTGEKPYQCPHCPKTFSDNNNLAKHRRRHSDERPYKCSICLQDFREKHHLKRHFLGKHRDGDQKL 

KLK* 

(SEQ ID NO: 933) 

Name: zinc finger protein 
Classification: transcription_f actor 

Celera Sequence No. : 142000013384474 

CGCGATGACTACCACTTACCGTTGTCGAAATTCTCCTTCAGATAAACCTCGGCACTAATAAAGCCGACTGTCGCCAGCAACACTATCACTGTTTT 
GCACCACATCATGTGAACGATCCCGCACCACTTTTTGGACTGACTCTTCCAACTCCGATGGTTGTCCGAACTCGGTTAAAGTTAAGTACTAAATG 
CCGGCAAGAAAGTGCTATTTATTTATTGCATCTAAATCGTGCGTGTCGGAGCGGGACGGCTTGTTACTGAAATTGGTTTGCTGGAACGTTTTCAT 
TGGCCCAAATCAGGCCAATCAGGGATGTCTACGTATTCTTTTTCACCGGCCAGGGCTGATTTCGATAAGCAATCGCATGCTGCCCTGTCGCTATC 
GATTGCTTGAGCTGGAATTAGAAACACTAAAGTGACCGTTTGCTAATTAAATTAAATTATAAAATATATAAAATTAATATATTCTGGCATACAAT 
TAATCACTTGCAGCAGGCTAAGCTAAGAAAACATAAAATTACTCCAATAACAATTCTTCTACTACTTAACAATTGTAGTAGTTACAGGAAGAACA 
GTTGTGCTTCCAGAACATTCCTAAGCTTTTTAACGGCTTAAGGGTTTTGAAAATTTTAACTCACACTTACATCACATGTTAACAATTAAGTTTTC 
ACCCGATAAAGTTCATAGTATTTAAAAAGTTAGAATGTTAATTATACATTATAATACAAATCAAAAACAAATAAAGGAATTCCAAAACTTTGCAC 
CTGTATTTCGTTAATTTTAGGAACTTTTTTATCTATAAGTGCTAACTTTTACATTTGACTTTATTACATTAAAAAATTACAATTTTCGCTGGAAA 
TGTATTATATTAATAAAAAAGATAAATAGCCAGAAATAGAAAAGTGAAATTAAATTAATTAAAACAAAAAAGCTTTGGTATTTATACCAATAGAC 
CTTCTAAGGTATTTTTTAAGACCTAGCTTTGTGGCAGATACGGTAAGTTCGTGTCGAGCCCACGTGTTGAGCCAATCTTATCAAACGCCGATTCG 
AATGCGGCATTAAACAAATGGATTCCCTTAACAACGGATGGTTCAGCGAACTGCAGGCCGATCTCTGGCCCGGTCAGTCATTCTCCCTGAAAGTC 
AAGGAGGTCATCCACAAGGAGAAGTCCCGGTTCCAAGACATCCAGATCGTTGAAACGTAAGATAAATAATAGATAAACAAATACTTGGACACACA 
ACCCCATTTGTTTACATCCGAAATTCCTCAGTCTAGACTGAAGTTTGAAGATAGATGTCAAAATAAATTAGCTTAGATTGAGATTAAAAAACTAA 
ATTAAAATCATTATAAACACGCAAGTCACACAAATCCGAAATTTTTAACAGCTTAAGCTGTTGAGATTTTAAGGACTAACCTACAATAGCCTAAT 
CCCATATGTATCCTACATTGAAGGTCCCTAAATCATAGATTGTAGAAATCGAAAATTTTCCACTGGTTGTTCAAACACAAATTGCCCAGCCAGCA 
TTTTCGGACCCTTTGGTTTTCGTGTGTTTATTCTTGATCTGCACCAACAGCCTTATCGATGGAGTCATCTAGGGCCAAAAACGACAGACAGAATA 
CTGATAGCAAGTCAGCTCGTCTTTCAGGTTTGAGTTCCTCCAGTCGAACATTGGGTACACTTGCGCCGCTGAAAAAATCAATAATTCGATAATTC 
TATTACAGCGAAACCTATGGACGGTGCCTAATTCTGGACGGAATCATTCAGTGCACGGCCAGGGATGAGTTCTCGTACCAGGAGATGATATCTTT 
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CCTGCCGCTCTGCGCCCATCCCAATCCCAAAAAGGTCCTGATCGTGGGCGGTGGCGATGGCGGCGTTGCTCGCGAGGTGGTAAAGCATCCACTGG 

TCGAGGAAGTGCATCAGGTGGAAATCGACGACCGTGTCGTCGAGCTGTCCAAGCAATATCTCCCAGCGATGGCCTGTGGTTTCGCCAACGAGAAG 

TTGAAGCTTACCATTGGCGATGGATTCGACTATATGAAGAAACACAAGAACGAATTTGATGTCATCATCACCGACAGCTCGGATCCCATTGGTCC 

GGCAGTGAGCCTGTTTCAGGAAAGCTACTACGAGCTAATGAAACACGCGCTGAAGGATGACGGAATCGTGTGCTCCCAGGGCGGTAGCTTCTGGC 

TGGACCTGGACTACATCAAGAAGACCATGTCCGGCTGCAAGGAGCACTTTGCTAAGGTGGCCTATGCCGTCACCTCCGTTCCGTCCTATCCCTGC 

GGCCACATTGGCTTCGTCATGGGCTCCCTGAACAAAAACCAGGACTTCGCTACTCCCAAGCTTGGAAAGTCCGAAATCGATTCTATAGACCTAAG 

GTACTACTCCTCCGAGGTTCACTCCGCCGCCTTTGCCTTGCCTCGATGGGTGCAGAAGCACTTTTACGAATGACCAGGTTGCGGAAAGATTCGTG 

TGTATCTGTTTGTTAAAAATAAATATGATAATAATATGACTTGTTTATCATCGCTTCTTAAATTCTTGGATATTTTCAACAAATGCTGACTCTTG 

CAATGATTCATTTTTAGTCGAATTTTTGTAGCCTGAGTATTTTCCTTCCAATCCGACTACCATCTTTCAGTTATTTTACTGTTGAAAATAGAACA 

AATTGTAATGTTTATTACGTAATGATATATACAGTTGGCTCTATTTAATTTTTGTATTTGTAAGTGGATGCTGGGTTAACATAATGAGACTTATT 

GCAATATTTTACACGACCGAATGTTACTCTCATTTATATATCGCATATATTGCAAAACTTTGAAAGGAAGCGTTTCTTATATGCAAAAATGACCT 

TAACTGCATGTGAATAATTAACTAAAAACTGATATTCAATTTCAATAACTTAATAGAAATTAAATCTTCAACCTGTTTAATGACTAATTAAATAT 

TGTTATAGTTAGGAAACGCAATTGATACATAATTACAAAATTAGATGTAAAATATGTACAATCATTTAATATTGGTTTTTATTTTCTCGCCTATT 

CTAAAGTTAATTATCATTTCTTTTGTGTCGGATAGATTTTGGTGTCTAATGTGCGTGTTTTTATGTGTAATAAATATAATGTATGTGCATTGCGA 

TTTTGGTAGCCGAGCTCCAGTTTTCGAAAAAGTCGAATCATTTTGTGTCCAATTGAGAACTATTTCACATTTTTCGAAATAGATGGATAAACTAA 

TTATTGTGTGTGTACGAATACTCCACTCCTTTGACTGGGTAAATTTGAATAAACCTATAAAAATTCCTAATTCAAGTATAATAATTTTGTCTAAA 

TAAATCAATTTGTAAATGTGTATTCCAAAAAAAGAATCACGAGCAGTTGGAAAAAAATTAATAATAATGGTTTTGAAAACCGGAGCTTGACTGTG 

TATAATTTGTTATGCGATTGATTTTGTATAGGGAATTTCATGTCTTCGCGATTACATTTGTTATAGCGTGTTTTATATTATCAAT 

{SEQ ID NO: 934) 

Exon: 1001.. 1196 
Exon: 1719 ..2505 
Start ATG: 1063 

Transcript No. : CT24589 

GTGTCGAGCCCACGTGTTGAGCCAATCTTATCAAACGCCGATTCGAATGCGGCATTAAACAAATGGATTCCCTTAACAACGGATGGTTCAGCGAA 
CTGCAGGCCGATCTCTGGCCCGGTCAGTCATTCTCCCTGAAAGTCAAGGAGGTCATCCACAAGGAGAAGTCCCGGTTCCAAGACATCCAGATCGT 
TGAAACCGAAACCTATGGACGGTGCCTAATTCTGGACGGAATCATTCAGTGCACGGCCAGGGATGAGTTCTCGTACCAGGAGATGATATCTTTCC 
TGCCGCTCTGCGCCCATCCCAATCCCAAAAAGGTCCTGATCGTGGGCGGTGGCGATGGCGGCGTTGCTCGCGAGGTGGTAAAGCATCCACTGGTC 
GAGGAAGTGCATCAGGTGGAAATCGACGACCGTGTCGTCGAGCTGTCCAAGCAATATCTCCCAGCGATGGCCTGTGGTTTCGCCAACGAGAAGTT 
GAAGCTTACCATTGGCGATGGATTCGACTATATGAAGAAACACAAGAACGAATTTGATGTCATCATCACCGACAGCTCGGATCCCATTGGTCCGG 
CAGTGAGCCTGTTTCAGGAAAGCTACTACGAGCTAATGAAACACGCGCTGAAGGATGACGGAATCGTGTGCTCCCAGGGCGGTAGCTTCTGGCTG 
GACCTGGACTACATCAAGAAGACCATGTCCGGCTGCAAGGAGCACTTTGCTAAGGTGGCCTATGCCGTCACCTCCGTTCCGTCCTATCCCTGCGG 
CCACATTGGCTTCGTCATGGGCTCCCTGAACAAAAACCAGGACTTCGCTACTCCCAAGCTTGGAAAGTCCGAAATCGATTCTATAGACCTAAGGT 
ACTACTCCTCCGAGGTTCACTCCGCCGCCTTTGCCTTGCCTCGATGGGTGCAGAAGCACTTTTACGAATGACCAGGTTGCGGAAAGATTCGTGTG 
TATCTGT TTGT T AAAAATAAAT AT GAT AAT AAT 
(SEQ ID NO; 935) 

Start ATG: 63 

MDSLNNGWFSELQADLWPGQSFSLKVKEVIHKEKSRFQDIQIVETETYGRCLILDGIIQCTARDEFSYQEMISFLPLCAHPWPKKVLIVGGGDGG 
VAREVVKHPLVEEVHQVEIDDRVVELSKQYLPAMACGFANEKLKLTIGDGFDYMKKHKNEFDVIITDSSDPIGPAVSLFQESYYELMKHALKDDG 
IVCSQGGSFWLDLDYIKKTMSGCKEHFAKVAYAVTSVPSYPCGHIGFVMGSLNKNQDFATPKLGKSEIDSIDLRYYSSEVHSAAFALPRWVQKHF 
YE* 

(SEQ ID NO: 936) 

Name: spermidine synthase 
Classification: enzyme 

Celera Sequence No. : 142000013383927 

GCCCTGGTCGACTTTGAGAAAGTCAGCGTCCCAGGATGAGATGTCATCAGTGCGCTTCTCCTTGTTCTCAACCTCTTCGGTAACCACAGGATCGT 
CCTTGTGATAGGTGGCCCAGTGGAGCACTTTCTTCAGGATCAGCGAGTTGACATTCGGCAACGGCAGCACACTGTTGTCGCTCTCATCGCCCAAA 
TCCTCTATTGCAATGCGAATCGTTTCCGAGCACTTGGCGATCTCCTGATCCGTGTCAAAGATCTCCTTGTCCGCAGACTCCAGCCGAATGATGGG 
CATCTCTGCGTAAAAATTTTTATTTTTGCTTATATCTATATTTTCAATTTTATTCGTACACACCTCCTGCTGATTTGGATCCCAAGATTTGATAC 
GAAATCGGTTTGATTTATTTGCAGACGCAGCCAAACAAAATTTTTAGACACAACACGATGTCACTTAGCTGTGGCCTACTGGCTAAAAATAATAT 
TCCCTTGCCTTCAACTTGAACCGGACTGGTGCGGCGGCCGCCTGAGAACAGAGCGTTTGTTAAATAAAGAACTTGGCTGCCAATAATTCCCCGGA 
TTCTTAGAAAACTAGAACGTCGTACGATTTGAAGAGGAAATTTGTTAGTGAAACGGCCGCCAGTGTGCACATACTCGGAATGGCTTTAAAAGATG 
GCGCCTATTACAGTTAATGGTGTGGAGACACTGTGACAGCTAGCGGTAAACAAAAGCTCAGTTATATTTTTTTATAATGTATATATATATATTTT 
TTTTAAACATATATATTTTGGCACAATTCAAGCGTGCTGGGC GTTGTATTATTTAAACAGAAACCTACAAAATTCTGCCTTTATAGTTAGTTTTT 
ATCCTGTTTTTATTTAATGAATTTCAAATTTCAAATATTTTTGTAAACAATTTTAAGCTAGGCCAAGGGTTTGTGTCGCGGCTAGTGCAAGCAGT 
CAACTTGCGGCATTTTTTAGTACATTTCCCTCTGTGCTGGGTTATATATTCGCAGGCTATACTCCACACTGCTTCCAGCGGAATCGTCAAAAAAA 
TAACATTAAAATCGCCGTGTGCATAATTAGAATATTTCCGAATAGTAATCGCCCGTGAAATCGCTAATTAGCGGTCAGCACTTGCGCCAGTTAAA 
GAAATCGCGGCACCGGCAGTGGGCCAAAAGAACATTTTTTCCGGCTGAAAAAAACAGGCGGCGGCGTCGACACGCGGACCGAACCAAAGGATCCG 
AAGCCATCCGATCAGATCTAATCCGATCCGGAGAAAATTCCAGGACACTAAGGACTTCCGAAACTACCCCGCACAAGAGATGGACGTGTACGACG 
TGAGCGATGATCTGATAAAGAAGCTGCAGGACTGGAAACTAATCGGCATCATATCCGGCAAGTTCAACGAGCTGAAGGAGAACCATCGCAAGGTT 
GACTTCGTGATGCGCGCTCTTATTGACTTCAAGTACATTGAAAAGATATTCCTGAATGTAACGCTTCGAGAGGACAAGTGCAACAAGCGGAGCGT 
GGAGTTCCGTATGCTGGGCAACGAGCAGTTTTCGCTCAAGAACAGGAATTACTTCCAGGTAATCATTGCCCCCTCAATCGGCAAACTTAAGCTAA 
TCCCTGCGACTCGTTTCAGGCTCTAGAGCTGTACAACAAAAGCATCTGTTATGCCGAACCGAATTCGGAGCACCTGTCCATCGGCTATGCTAATC 
GGTCGGCGGTACTATTCGAGTGGAAGCGCTATCGGCAATGCCTGGACAATATCAAGCTGGCAAGGCAGGCCAACTATCCGGCGAGACTGAGCCAC 
AAACTAGACAAGCGAGAAAGGGACTGCCAACAGCTACTCGATCAGCAACCGCCGGATGTGGTGCCCTATGAGTTCAAGCTCAGTTTCGAACCGCA 
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TGCCCAGGTGCCCTTCATCGCCGACTGTTTGGAGTTGCGCGAAACCGCCGCCGAGGGTCGCTTTGTGGTGACCAATCGGGATTTGGCCGTTGGGG 
ATCTTGTGTCCGTGGAGGAGCCCTTCTGCTCCACACTGCTGACGCCGATGCGATACATTCGGTGTGCCACCTGCAAGCGGGAGAACTATCTGACC 
CTCATACCCTGCGACAGCTGCTGTTCCACGATGTTCTGCTCCGAGGAGTGCAAGTCGATTGCCATGCAAACCTATCACCGCTACGAGTGCCCCAT 
CATTGACTTTCTGAACCGCATGTTCAACAAGATCCACTGTATCGCACTGCGCACCACACTGGTCGCGCTGAATATCTTTCCCAGCATCGAGGAGC 
TAATTGACTTCTGTGAGCAGGAGCAGAACCAGGACAAGTGCGCCTTTGACCTCAACTACAACGAGCTGACGCCGGAGGAGCACTACCGGGCCATC 
CATGGACTGGTGACAAACCAGCACCTGCGCTCCGTCTCCGATCTCTTCCAGCGCTCCGTGGTCTGTGCTGTGCTCAAGCACTTCATAATCGAATA 
CACACCCGTCAAGGAATATTTGGGCGGCGAAGAAGGCGTGAACTTCTTCACGGACCTCCTGTTCCGACATCTGCAGACATCTCCCTCGAATATGC 
ACGGCATCGACCTGGTCGAGCAGGTTAACGAGACCAAGGACGACCAGACGCACTCGTCTGGGGCGTACGCCTTCCTCTCGCTCATAAATCACTCT 
TGTGCGCCGAATACTGTAAGGATATACGAGGGCACCAAGGCGTACATGTTCGTCCTGCGGCCTATTAAGGCGGGAAATGTGCTCTACGACAATTA 
CGGGTAAGTAGTAACCAAAGCATTCTGACCACACCACTTTAGTTCAAGCTAATAGGGGTCCAAGTCACTGCAGCTCTCTTCAAATACTAAAACTA 
ATAGATATCCAACTTTTTAGTGCTCACTTCGCCATCTGCAGCAAGGAGCAGCGTTTGAAAAGGCTGTCCCTGCAGTACCGCTTCGACTGCAAATG 
CGAGGGCTGCGAGCTGAACTATCCTATGTTCGGAATGATGCCGCACAAGGCGACGGTGCCATCGGTTACCGATGACACGGAATTGGCACTGAGCT 
CCTACAACTATGATTTCGCGGTGAGCAACTATCGAAAGTACTGCGACTTCCTCACGCAATACGGTGACGACTATCCGTGCGAGCAGATTAGCTCC 
GCGGAGGAGTGCCTAAAGATGGCCCTGCATATCATGGCGGACGCAGTGCCGCTCAAGGCGAAGATGTAATCGGATCCTGCCGCTGAACCAACCCA 
CACAACCAACACGGACGAACTTTTCCTTTGCTAACTTTAACTGGAGACAGCGTATTACAAGCGAGACACAAATCAAGGAGAGTAATTGATTATAC 
ATACATGTTGTTGCCATAAGTCTGTAGCGCAGGCAGCAGCGCGATGGGGAGGGAGCATGCCCTTAACTTTTATATTTGTAATTGTACATTTTCGA 
CCTTTGATTATGTTTTACCCCGCACCAAGCCCGCACACCACACCCACACAACGCCCAGCACACCCACACAACGCCCAGCACACCCACACAACGCC 
CAGCACACCCACCACACATTTCACAACAATTCGAAAGTTCAAAATTCTAAAGTTTATTTATTTTTTGATCACAACAATGTCTTCGATGAAAACAA 
AC AAAC AAAG GAT TAT TAT G C AAC CT AC G T CGT T T T T AT AAT T ATT T T AC AAAAAC C AACT G AAGC AG AAAT AAAAT AT T TT C AAAAC G AAGT G A 
CTTTGGCCTCAAGGCATTTTGCTGTACAGGTCCAGCAGGTTCTGGTTCAGCAAGGATCCTCCCCGGGACATGGACGTCGCCACCTTTTCCGAGGA 
CGATGCCGATGTCGTGGGCTGCAGCTGTACAAACTCGCAGCCAGAGGAGCTGGCCGCTGTGTCATTGAAAAGCACATTGCCGAAGGCCTGCTCGA 
ACTGCTGCTGGAACTGGCGGGCGTTGATGGGCTGCAGATCCTCACTTTGCGCCGGATTTATGTTCAGCTGAGCGGCCAGCTGATCCATGCTGTTG 
GGCTCCCGTTGTTGGCGCTCCCGCTGGTCGGCACTCTTCGCATAGATATTATCGCCGGCTGACTCAACGTATTCCGCGTAGTAATCTTGATCCAC 
GTCCCTGATGAAACTACCGTCCATGGCCTGAAAGCCCATGCGAATCAAAAGCGATATCTTTGTGTTAAAGCACTGCAGCCAGGCCCGATTCGTCT 
GGTAGATGGACAGCTGCGCATCCTCACCGCTGGCATTCCAAAAGTCCAGCAGCGTGTTTTTCGTGAAGTCAATCAGCATATGCCGTTTGGCATCC 
GGATACAACCTGTTGATCGCCTCCAAGTGCCGAAATGTAATCTCCAGCAGATGGTGCTGATGGCGGGAAACGAAAAGCTGCCACTTGTAGACCAT 
AGTCATCAGATTCCACAGCTTGGACATCGACTGCTCGTCGAGTCGCATCAAGGAGCAGGTGGCAATGTCGTTCAGCATAAACTTGCAATGCTCGG 
CGGTCAAAAGCTGCGCACTATTGTGTTTCGATCCGTTTAGGAGCGAGTCTATAAACTTTGGTTCGAGCAGCACAGAAGTCACATCGTGGATAACT 
GGTGGAAGATATATATATAGTGTCTTTTAAATTTGAATTTGAATTTGAATTGAAGCATATAGCTATGCAGAAGACTC 
(SEQ ID NO: 937) 

Exon: 1001.. 1578 
Exon: 1635.. 2758 
Exon: 2871.. 3637 
Start ATG; 1315 

Transcript No. : CT24 669 

CGCAGGCTATACTCCACACTGCTTCCAGCGGAATCGTCAAAAAAATAACATTAAAATCGCCGTGTGCATAATTAGAATATTTCCGAATAGTAATC 
GCCCGTGAAATCGCTAATTAGCGGTCAGCACTTGCGCCAGTTAAAGAAATCGCGGCACCGGCAGTGGGCCAAAAGAACATTTTTTCCGGCTGAAA 
AAAACAGGCGGCGGCGTCGACACGCGGACCGAACCAAAGGATCCGAAGCCATCCGATCAGATCTAATCCGATCCGGAGAAAATTCCAGGACACTA 
AGGACTTCCGAAACTACCCCGCACAAGAGATGGACGTGTACGACGTGAGCGATGATCTGATAAAGAAGCTGCAGGACTGGAAACTAATCGGCATC 
AT AT CC GGC AAG T T C AAC G AGC T G AAGG AG AAC CAT C GC AAGGT TGACTTCGT GAT GCGCGCTCT TAT T GAC T T C AAGT AC AT TGAAAAG AT AT T 
CCTGAATGTAACGCTTCGAGAGGACAAGTGCAACAAGCGGAGCGTGGAGTTCCGTATGCTGGGCAACGAGCAGTTTTCGCTCAAGAACAGGAATT 
ACTTCCAGGCTCTAGAGCTGTACAACAAAAGCATCTGTTATGCCGAACCGAATTCGGAGCACCTGTCCATCGGCTATGCTAATCGGTCGGCGGTA 
CTATTCGAGTGGAAGCGCTATCGGCAATGCCTGGACAATATCAAGCTGGCAAGGCAGGCCAACTATCCGGCGAGACTGAGCCACAAACTAGACAA 
GCGAGAAAGGGACTGCCAACAGCTACTCGATCAGCAACCGCCGGATGTGGTGCCCTATGAGTTCAAGCTCAGTTTCGAACCGCATGCCCAGGTGC 
CCTTCATCGCCGACTGTTTGGAGTTGCGCGAAACCGCCGCCGAGGGTCGCTTTGTGGTGACCAATCGGGATTTGGCCGTTGGGGATCTTGTGTCC 
GTGGAGGAGCCCTTCTGCTCCACACTGCTGACGCCGATGCGATACATTCGGTGTGCCACCTGCAAGCGGGAGAACTATCTGACCCTCATACCCTG 
CGACAGCTGCTGTTCCACGATGTTCTGCTCCGAGGAGTGCAAGTCGATTGCCATGCAAACCTATCACCGC TACGAGTGCCCCATCATTGACTTTC 
TGAACCGCATGTTCAACAAGATCCACTGTATCGCACTGCGCACCACACTGGTCGCGCTGAATATCTTTCCCAGCATCGAGGAGCTAATTGACTTC 
TGTGAGCAGGAGCAGAACCAGGACAAGTGCGCCTTTGACCTCAACTACAACGAGCTGACGCCGGAGGAGCACTACCGGGCCATCCATGGACTGGT 
GACAAACCAGCACCTGCGCTCCGTCTCCGATCTCTTCCAGCGCTCCGTGGTCTGTGCTGTGCTCAAGCACTTCATAATCGAATACACACCCGTCA 
AGGAATATTTGGGCGGCGAAGAAGGCGTGAACTTCTTCACGGACCTCCTGTTCCGACATCTGCAGACATCTCCCTCGAATATGCACGGCATCGAC 
CTGGTCGAGCAGGTTAACGAGACCAAGGACGACCAGACGCACTCGTCTGGGGCGTACGCCTTCCTCTCGCTCATAAATCACTCTTGTGCGCCGAA 
TACTGTAAGGATATACGAGGGCACCAAGGCGTACATGTTCGTCCTGCGGCCTATTAAGGCGGGAAATGTGCTCTACGACAATTACGGTGCTCACT 
TCGCCATCTGCAGCAAGGAGCAGCGTTTGAAAAGGCTGTCCCTGCAGTACCGCTTCGACTGCAAATGCGAGGGCTGCGAGCTGAACTATCCTATG 
TTCGGAATGATGCCGCACAAGGCGACGGTGCCATCGGTTACCGATGACACGGAATTGGCACTGAGCTCCTACAACTATGATTTCGCGGTGAGCAA 
CTATCGAAAGTACTGCGACTTCCTCACGCAATACGGTGACGACTATCCGTGCGAGCAGATTAGCTCCGCGGAGGAGTGCCTAAAGATGGCCCTGC 
ATATCATGGCGGACGCAGTGCCGCTCAAGGCGAAGATGTAATCGGATCCTGCCGCTGAACCAACCCACACAACCAACACGGACGAACTTTTCCTT 
TGCTAACTTTAACTGGAGACAGCGTATTACAAGCGAGACACAAATCAAGGAGAGTAATTGATTATACATACATGTTGTTGCCATAAGTCTGTAGC 
GCAGGCAGCAGCGCGATGGGGAGGGAGCATGCCCTTAACTTTTATATTTGTAATTGTACATTTTCGACCTTTGATTATGTTTTACCCCGCACCAA 
GCCCGCACACCACACCCACACAACGCCCAGCACACCCACACAACGCCCAGCACACCCACACAACGCCCAGCACACCCACCACACATTTCACAACA 
ATTCGAAAGTTCAAAATTCTAAAGTTTATTTATTTTTTGATCACAACAATGTCTTCGATGAAAACAAACAAACAAAGGATTATTATGCAACCTA 
(SEQ ID NO: 938} 

Start ATG: 315 

MDVYDVSDDLIKKLQDWKLIGIISGKFNELKENHR^ 

SICYAEPNSEHLSIGYANRSAVLFEWKRYRQCLDNIKLARQANYPARLSHKLDKRERDCQQLLDQQPPDVVPYEFKLSFEPHAQVPFIADCLELR 
ETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDSCCSTMFCSEECKSIAMQTYHRYECPIIDFLNRMFNKIHC 
I ALRTTLVALNI FPS IEELI DFCEQEQNQDKCAFDLNYNELTPEEHYRAIHGLVTNQHLRSVSDLFQRS VVCAVLKHFI I EYTPVKEYLGGEEGV 



FIGURE SHEET 505 



NFFTDLLFRHLQTSPSKMHGIDLVEQVNETKDDQTHSSGAYAFLSLINHSCAPNTVRIYEGTKAYMFVLRPIKAGKVLYDNYGAHFAICSKEQRL 
KRLSLQYRFDCKCEGCELNYPMFGMMPHKATVPSVTDDTELALSSYKYDFAVSNYRKYCDFLTQYGDDYPCEQISSAEECLKMALHIMADAVPLK 

AKM* 

(SEQ ID NO: 939) 

Classification : known_f lybase_gene 
Gene Symbol: BcDNA:LD29892 
FlyBase ID: FBgn0027495 

rplera Seauence No. : 142000013384673 

CGAGACCCCCAATTCCACACTATGGAGTAGCGACGGATGCATTTGGAGCGATGAAAGCAGCAAGAGCAGCCGGCATCGTTCGCAACTCTCTGTCA 
AAATGGCCCGAGAACACTGTGTCGTATACGCAATGCCGCGCAGCCACTAGTTGCTGGTGTTGTGATGATGATGATGATGAAGGTGCTATTTGGTA 
TGGMGTGACTGTTGACTTTGCCTCCGTTTCGATTTGAGTTTTTGGGTGTGTTTGCTGTGGCGGTTCTCTTTCGTTTTTTTTTTTTTGGGACTTT 
TTCCGGAGAAGAAACAACACATATACTCACTTGTATATCGTGTTCCTGAACAACAATATCAATAGTTTTGGTTAGCTCGTGTATTTGCCCTCGTG 
CTGTGGCCATTTCTATTGTGGTGTGTGTGCGTGGAGGGTTTTTGGCTTTTTCCACCAGCAAAGAGATAAACTCAAAAATGTGAATGGGAGCTCAT 
TCACCGGGTTTTCGGGGAGTTCGGGACCTATGTATGTATGTATGTTTGCTCATCAGCAGGTTGCCAGCTTTCTGCTTGCTAATGGCCCAACAACA 
AACAGTCGGACATTTCGAAATTGTTTCTAGAAATTATGAAATCGGAGCAGCGTCGAAAAAAGAAGATAGATAAACCCAATACGGTTGTAATTAGT 
AGTACAAAAATAAAAAAAAAAAACGAAAGTCAGGCACTGCTTCGCATTCATTTGAAGGGCCATGAACTGTGAGCACTTGAAGAGTGCTAATTGAT 

TAAGGCAAAAGCAAAAGCAAG^^AACCGCGAT 

CTTCCGGGCGACTCTCGAACTTTGTTCCCTTTACTTCCTGGGAAACTTTATTAATGAATTAATTCAAGTACAGTTGCGGTTAAAATAATAACACT 

ACTGCAGTTGCAAAGTTGATTTCCTAAAAAAAATTATTAAATATTTATTTAAGTCAGATTGCATGAATAATAAGTACCATATGTAGTCTCTTTAA 

GTAGTAAATGTTAACTCTCTCAATGTAACGGTTCTTTTTGTTTTTGGTCACTTGCTGCAAAAGTGCGCGAAATTAGGCGGTAACAAAAATAGTAC 

TGATTACGTATTTGCTGAATAAAATTAATAAAAAATACAAAAATAAATTTGATTAAAACAAAATCAAACAAAATGATCCGCAAAGCACTGCTGTT 

TGTCGAAAAGAACGAAACACGGCGAAGAAAGCCCTCAATGTCCAACGTGGAGATCAGGCGCTTGGTTCGGCAAAGCAAGAAGGATCAAGATGAGA 

GCAAAATTGTGTTGTTTGGTGGGAAAGGCTCTTGGTCTTATGTTCGGCGTCCACCACGAACTGAATATAATCATCGCTTCACCTTTAAGGCGGTA 

AGCACATCATGGATCAGCACATTTACACAGATATCCTGGAAAATCTGATGCTGCCATATGCCGAGGATGAAATGCCGTTGGTTTGGACATTTCAA 

CAGGATAACGACTCAAAACACACGAGCAAATCACTGTAGTATTTAAGTTTAGTCAGCAATATGATGAAAAAGTGTTAGAAAATACAGAAACGGTG 

GGCCAAACACGAAATGTGCTTATGGTGCTATTATTGTTACCGCAACTGTACAATACGACCATGGATATTCGATATCGTTTCGGGTCAATTCCGCT 

TTTAGTTTCGTTTCTATCGCATCGACCAATCGGGCCGTGCGACAAATTTGCGTTTTTTATTGATAAGGCGCTCCGCCGTACCCTCGAAAACCCCT 

TGTGCAGTGATAAGCGCCGAAATTTATATGACCATGGGTGGCACGTCATAAACATATATTAGATTAGAGAAATTAAATTCTTCTTTCAGCGCCAA 

CGGCCAGCCAAACATCTTACCTAACTTGGCCGGATTTAGTGTTTAATCTAATCGCAGTATTACGGCGCACATAAATATGTACTTCGGTCTCGACT 

GGGTGCAATAAAAAAAACAAACGTGTAATATATTTTAAGTCCGGATCTTACATAACAGTCTCGCAAAGAATGCCGGGTGCATTCACAGCCAGAAA 

TTATTATGTTTTATTTTATGCGGCATCTCTGAATGTTAACTAAATGTTGCACATCGCCTTTATTACTATTGTTTCAACGATTAATGTTTGATTAG 

GAATTTCTGTCATGTCAGCATATATAGCGAAAATATGCCCCAACTAAATATCATGGAATAAAGTAGCATCATCACTCCCCGAAGCGAGTTCCAAA 

ATGATGATTCATTTGCCTCCAAATTCGGTTATCACAAAGGGGTACAAGCCCCAAAAATGTTGACGCATCTTTGGGTACATTTGGCTTTTGTTAAT 

AATATGGGATTCACCTTGAAGGTACGCAAATGTGCCAAAATTACTGCACAGCATTAGTTATTTTTATATAGAATAAGCAAATAGGTTTTATTTTT 

AGAATTTCAAAAAATCAGTTATCTTCTTTACAAAAAAACCAGGGCTGCGCCTTTTGTTGGTTTGAGTCCAGGGTGGATGAACGTATTTGTCAGCA 

CTTTGTGACCATATGTTTGCGATAAGCATATATGCGTAAGCATATGTTAAATATTTGAATGTGCGTCAAATTAGCGTCGTTGAGTTAGCCGCGAT 

rrrTGAACATTGGAAAATTTTTTTATTTATTAATTTTTTGTTATATAGAGGTGAATTCAAAGAAATAAAAATTCAAAAATGTGTCGCGCGTGCTT 

GTCCCGCCGTATCTCGCCAACGTTATCTAGTGGCTAATTTACTTTCTTTAGTTAAATCTTTCTTTTCGTAACATTATTGCATATCTGTTTTTTGA 

AATGCACCGCGAAATAACTATATATTCAATAAAACAATACCTAATAGTTATTGCAACTAAACCAATTAAATTTTTATTTAGTTGTATAAAATATT 

TGTACGATATATCGTGGTTAAATCTTGCAAACTATCGTCGCTGGTGTTCCGGGTGAACCACGCGGCGGTCACACTGAAAACATCGCTGGTATTTC 

GTAACGCTCAGCAACGGTCACTCTGCCATCCTTCTTTTTATTTCCGCCGTTTGCCGAGAAAGTTGGAAAATTATGGGTGAGTTGCTGAAAAGCGT 

AGTTTTTGTGCGAATTAAGCGAACAGCGCTGACAAACAGTGAACAAACGGAGCTTACAATGCGGCGCGGCTGAAAGAGTAAGTGAATTTGCGCCG 

AAAGTGCGAAAATATTGGCAAATGCAAGTCGCGGAAACTTGCGATTTCAGCTGTCCGAAAATCGCACAAAATCCAGTGAATCTTTGGTTCCAGCG 

AACAAATATTTCGTTTCACATGCAAAAGCAATCGATGTGCACAAGGCACTAAAGTTATTTGTGTGAAAAATGCATTTGGCAGTATTTCCGAGCCT 

GGACTCTGTCTAACCTCAAAACCGAAGCCGCACAAGCTGCCATTTGTTTACATTGTGTTGTTGCATCGCAGACGCATATTAACTATAGCTAATGT 

ATACGTTTAATCTACAACAGGCAAGCCAAGAGGTCTGCGCACTGCCAGGAAGCATGTGAACCACCGTCGCGACCAGCGTTGGGCCGACAAGGACT 

ACAAGAAGGCTCATTTGGGCACCAGATGGAAGGCTAATCCCTTCGGAGGTGCTTCCCACGCCAAGGGAATCGTCCTGGAGAAGGTGTAAGTACAC 

GGTGGTTGACCCCACCATTGAAGTGCTATGCCCGCCATCCGGATGAGATGGCGGGGTTAATAACACGTCTCGACGCCTCGCAGTATATTTTTGTG 

TTTTGTTTCGTTTTGCAATCCAAATCACAGGATTATACTGTGAGATGACCAGCGTGCCAGCAGCGAAAAGCGACTGATATGATATCAATTATATT 

AATCTCTTAGTAGTATATCAAAACTAATCGGTTTCCTCTACTCCACAGCGGCGTCGAGGCCAAGCAGCCTAACTCTGCCATCCGCAAGTGCGTGA 

GGGTGCAGCTCATCAAGAACGGCAAGAAGATCACCGCCTTCGTGCCCCGTGACGGTAGCTTGAACTACATTGAGGAGAACGACGAGGTCCTGGTT 

GCCGGTTTCGGTCGTAAGGGTCATGCCGTCGGTGATATTCCCGGTGTGCGCTTCAAGGTTGTCAAGGTCGCCAACGTCTCCCTGTTGGCCCTCTA 

CAAGGAGAAGAAGGAACGCCCAAGATCTTAGATGAACATGCATCTAATCTTTTTCCAGACAAGCAGAGTTAGGGTTTTACAAATCTAAATAAACA 

AAAAACAGAAACAAACTTGTTAAATGTTTCAATTAAATGTGATTTCCTTGAATATATTTGTGGCTCGCATGTTTGTGTTTACTGATGGrAGAATA 

TAGTGATTCCAATCGATTCTGTTTTATAAGGCCGATGTACTTAATGAGTTACATACTTGCATATTTAAACTATAACAGTTTACAGTTTTCGTTGT 

TTTCAAGTATTTTCTGTAAAATTTAAAAATGATTATTGCCCTATATTCACATCTATGCGAAATATATTTCCCAAATTTTCGAATTTCAAAAAATA 

TATAGGTATTTTTGAATCCTCTTATTGGAAAAACAATAAATATGAACATATATATTTTAAAAAGAATATATGTTTATCTTATTATTAATCAAGTT 

TTTAGACTACCTGGCAACCATGGTAGTGATGCAAAGCGAAGATGCTCGAGTGCTTTCCAGCACTATTAGCCGCGTAAATTCGAATTTTTTAAATT 

TGCCTAAGCGAACTCAGGACTTGCTCTTAGTTTCAGTTTCGAAAACTATGCATTCCACTAATAACTTAATTGTCAACCACACATTGGCCACCAAA 

TTGTGCGATGCCGGAACACTCCTTCGATGCGGCGAACTGTGCGATCTGCAGGCGGTCGTACCTACCGCAGTTGCTATTGCCAAGTATTTCTTACA 

TAACAGAACGGCAAGAGCCGCGGAAACAGACAGCCATCATCAAAGCATGAGGCCCGGACCTGGACCACAGCAACGTGTGCCGGGCTCCATCAGCG 

ATAAGGAGTCACAAACTGCACCTCCTACCTGGCGGTGCACAGCGAGAAATCGGAGCACTTCGAATCGGGCTCGATTTGGAGTGGAGCTGTCTGAT 

TTCTTGCAGTGCCTCGGTTCGATCTGCTTATCAATAATTTGTTTTTTTTTCTCCTCTTTGCCGCATTTAGGCAGAGATTCGCTGGCGATAGCCGC 

TCAAGTGTCACACGCCCACATTTTTGGAGCAGTCGATGTCGATGGAGCGGCTATTTGCACATACGCCACGCCAATAAACTTGTACCCA 

(SEQ ID NO: 940) 



Exon: 1001.. 1348 
Exon: 1420.. 1501 
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Exon: 3052,. 3116 
Exon: 3536.. 3695 
Exon: 3944.. 4313 
Start ATG: 1273 

Transcript No. : CT24703 

AAGTCAGATTGCATGAATAATAAGTACCATATGTAGTCTCTTTAAGTAGTAAATGTTAACTCTCTCAATGTAACGGTTCTTTTTGTTTTTGGTCA 
CTTGCTGCAAAAGTGCGCGAAATTAGGCGGTAACAAAAATAGTACTGATTACGTATTTGCTGAATAAAATTAATAAAAAATACAAAAATAAATTT 
GATTAAAACAAAATCAAACAAAATGATCCGCAAAGCACTGCTGTTTGTCGAAAAGAACGAAACACGGCGAAGAAAGCCCTCAATGTCCAACGTGG 
AGATCAGGCGCTTGGTTCGGCAAAGCAAGAAGGATCAAGATGAGAGCAAAATTGTGTTGTTTGGCGGTAAGCACATCATGGATCAGCACATTTAC 
ACAGATATCCTGGAAAATCTGATGCTGCCATATGCCGAGGATGAAATGCCCAACGGTCACTCTGCCATCCTTCTTTTTATTTCCGCCGTTTGCCG 
AGAAAGTTGGAAAATTATGGGCAAGCCAAGAGGTCTGCGCACTGCCAGGAAGCATGTGAACCACCGTCGCGACCAGCGTTGGGCCGACAAGGACT 
ACAAGAAGGCTCATTTGGGCACCAGATGGAAGGCTAATCCCTTCGGAGGTGCTTCCCACGCCAAGGGAATCGTCCTGGAGAAGGTCGGCGTCGAG 
GCCAAGCAGCCTAACTCTGCCATCCGCAAGTGCGTGAGGGTGCAGCTCATCAAGAACGGCAAGAAGATCACCGCCTTCGTGCCCCGTGACGGTAG 
CTTGAACTACATTGAGGAGAACGACGAGGTCCTGGTTGCCGGTTTCGGTCGTAAGGGTCATGCCGTCGGTGATATTCCCGGTGTGCGCTTCAAGG 
TTGTCAAGGTCGCCAACGTCTCCCTGTTGGCCCTCTACAAGGAGAAGAAGGAACGCCCAAGATCTTAGATGAACATGCATCTAATCTTTTTCCAG 
ACAAGCAGAGTTAGGGTTTTACAAATCTAAATAAACAAAAAACAGAAACAAACTTGTTAAATGTTTCAATTAAAT 
(SEQ ID NO: 941) 

Start ATG: 273 

MSNVEIRRLVRQSKKDQDESKIVLFGGKHIMDQHIYTDILENLMLPYAEDEMPNGHSAILLFISAVCRESWKIMGKPRGLRTARKHVNHRRDQRW 
ADKDYKKAHLGTRWKANPFGGASHAKGIVLEKVGVEAKQPNSAIRKCVRVQLIKNGKKITAFVPRDGSLNYIEENDEVLVAGFGRKGHAVGDIPG 
VRFKVVKVANVSLLAL YKEKKERPRS * 
(SEQ ID NO: 942) 

Name: ribosomal protein S23-like 
Classification: ribosomal_protein 



Celera Sequence No. : 142000013384 673 

CGAGGAGGCAGACGCCACAAATGCCACGGACCAAATCATGAATACTTTGAATGCAGCCAAAGCAACGCTTCAACAAGAGCAATATTTTAATAACA 
AT AC C AATCT T AG T AGG AACT T C AAT C AC AAT C AC AAC C AG AGC AT C AT C C AC AT C AGT AGC CAG AAAGC C AACGG C AAGAAGTC C AACG AAAC A 
CCAAAAGATATTTATAATCTGAACTCAATCGAGCATATCGACTTGATTTTGGAGGATAAAAGCCTTACCAAAGCAATACACGTTACCGGCTCTAA 
ATGCTTTGTCACCATGAAGAAAATCCGACGGCTGGGAAAATATTTTCCCGTGATGAACTTCTCTTAATAGACACAGTTACTTCTTTGCAAAGTGC 
ATTCTAAATATTAAAAAGAAACTCGGTTAAGCGAACTACATAGAAAATATGCATGTATTAATTAAAACGTATAAACGTCAACGTTATCCCCCGTA 
TATAGGAAAGAGCGCGAGAATATTTAAATTTAGTGGTAGCGTAATTGTTTTGTCTGTCGTCGTCATTACGCTTAAATTATTCTAAATATATTAAT 
TTATGTTAATTATAAGTCTAATCACTCAATGCAATATTTAGTGAAAATACATTAATGCCCATATGACAATTGACTAAAAAGGTTATTTGTTTTCT 
TTTTAACAAATTGTGACGAATTAAAATGGAATGAAACTCGCAACTCGCACAATCTAAAATTGGCAATTCAAATGTTTGCGCCCAACTACAGTGTG 
TACACACCACAAATGTTGTAGAGTAGGTAAAGGAAATAAAGCAATAAACATTTAACTACTTAAAGCTCTTAAGATGTAGTTAAGCAATGATGGAT 
GTCACCAAGCAATTTTTTTTTATTGTAAGAAAAATATACAATATACAAAAATATATTCTTTTATTTTACAAAATGATGCGTGTAAAATACATTGT 
ACCTAATGCAGGTACCCCAAACGAGTACAAGCCACCCTTTTCAACTCTGGTCACACAGAGCACATCCCGTAAAGTTTTTCAGAGATGAGTTTGCG 
GCAGGAAAATTGAATTATTCACGGAAATAAACAAAGTAAACTAAGTGGAAACTGTAATCCTGACCAGCAATTATGTTGCGCATCTATCAAAATAC 
TTACCGGTGAGAGACCACGCCCGAAACGCGAAAGCCTTACGTGCAGATTTTCTGCACGATTTCGTCATCGGCGAATCTCATCATTCAATGTGTTT 
TGCCCACAGGCGCACCGCGAGAAAAGCTGTTGTCTACTCCACCAAGGTCGCCTGCTGCAATCATTCCACGCTCTGTGGCATCACCAGCCATCCAC 
GTCGGGCGCAGGATAGCGGGAGCTCCAGTTCGAATGGCAGGCACCGTGGCCACGAGGAGTTCCTACTTGCCGGCAATCCGGCGCGGGGCTGGCAA 
ATGCCTCCGCCGTCGCGTGGTTACGGAATGCTGGTGGTGCGCATACTGCGGGGAGCCCTCAAGCTGCGATACATCGTCCTGGGCGGCGCCATCGG 
CGGGGGCGTGTCGCTGAGCAAAGTGAGTTTCGCAAGTTATATAACTTGTATAGAACTTAGGAGACATGATTTGTTTTCCGCAGAAATACGAGGAA 
TGGAAGGATGGACTGCCGAATTTTAAGTGGCTGGAGGATGCCATGCCGCAGGGCGAACGGTGGAGCCAGTTCTCGCGGAATCTCATTGAGGTGGG 
CTCGCTGGTGAAGAACGCCATCGAAGTTGGTGAGTGTCCATGTAGACCTACCATCGCCCTTTCAATCTCTCACTTTAAATGGTCATATACCTTGG 
TCAGATTCGCATTGAACTCTTCGTTTTGATTGATTGATACTGATGTTATTGGGGCGTTTGCTTGTAGATCCAAAGCTCAAGCAGCTGGGCGAGGA 
TAAGTTGTCGGAATGGCGCAACTGGTTTGACAGTCGTCTGGACGATGCCATCGAGGCGGCCGATTATCAAGGAGTTCAGATTGTCGAAAGTAGGT 
TACACAAATTACACAGACTCCCAACGTTGGGACCATTTAAAGTGAGAGATCCCTATCTACCGCATTCGAGACTTATTCCTAACATCTGTTGCAGC 
CAAAGATGACTTAAAGGCCAAAACAACGGTGGCCGCTTTAGGCATAACATCCGATGAGAGTCGTAAGAAGTATGGTAGGTTTTTTGCACACCTGC 
CGAAAGATATACTATATCTAATCCTTCTGTGATTCCGCAGAGAAACTCCAGAGCCAGGTGGAGACGCTGCAGACGGAGATCATGAACGTCCAGAT 
TAAGTACCAAAAGGAGCTGGAGAAGATGGAG7AAGGAGAACCGTGAACTACGCCAGCAATACCTCATCCTGAAAACAAACAAAAAGACCACGGCAA 
AAAAAATCAAAAAATCCCTGATCGACATGTACTCTGAGGTCCTTGATGAGCTATCCGGCTACGATACGGGCTACACCATGGCCGATCACCTACCC 
CGTGTTGTGGTAGTGGGAGATCAGAGCAGCGGCAAGACCTCTGTCCTGGAATCCATTGCTAAGGCTCGCATCTTTCCCCGTGGCAGTGGAGAGAT 
GATGACGCGAGCCCCAGTCAAAGTTACTCTGGCTGAAGGACCATACCATGTGGCTCAGTTCCGTGACTCTGACCGGGAATACGATCTCACCAAGG 
AGTCTGATCTGCAGGACCTCCGTCGGGATGTGGAGTTCCGAATGAAGGCGTCGGTGAGAGGTGGTAAAACCGTCAGCAATGAAGTCATTGCCATG 
ACGGTCAAGGGTCCCGGTCTGCAACGCATGGTTCTAGTCGATTTGCCAGGAATAATTTCGGTGAGCAAGTCAATACAAGTAAAACTGCGCACATC 
TAATGCTAATATCATTTTAGACCATGACTGTCGACATGGCCTCAGACACAAAAGATTCCATTCACCAGATGACCAAGCATTATATGAGCAATCCG 
AACGCCATCATTCTCTGCATTCAGGATGGATCTGTGGACGCTGAGCGCAGTAATGTGACAGACTTGGTCATGCAGTGCGATCCCTTGGGTCGACG 
CACTATTTTTGTGCTCACAAAGGTGGATCTGGCCGAGGAGCTCGCCGATCCTGATAGAATAAGGAAAATCCTTTCGGGCAAACTCTTCCCCATGA 
AGGCTTTGGGCTACTATGCCGTCGTTACCGGTCGTGGACGAAAGGATGACAGCATAGATGCTATTCGGCAGTATGAGGAAGACTTCTTTAAGAAC 
TCCAAGCTTTTCCAGTAAGTATAGAGAGGATGTGATTGGTAAATTCCAAAGTTCACCTCTCTCCATACTTAGTCGTCGAGGGGTTATCATGCCCC 
ATCAGGTGACCAGCCGCAATCTGAGCTTGGCGGTGTCAGATCGTTTCTGGAAGATGGTGAGGGAAACCATTGAGCAGCAGGCGGATGCATTTAAG 
GCGACTAGATTTAATCTGGAAACCGAATGGAAAAATAACTTCCCCAGGTTTGTAAAAATGTTTTAAAAAATATAAGAGATCTATAACATTTGAGG 
TATACTCTTTTAGATTGCGCGAGTCTGGTCGAGATGAGCTGTTCGACAAGGCAAAGGGCGAGATACTGGACGAGGTGGTAACGCTCTCGCAAATC 
TCCGCTAAAAAGTGGGACGACGCTCTGAGCACCAAGCTATGGGAAAAGCTCTCCAATTATGTGTTTGAAAGCATCTATCTGCCCGCTGCACAGTC 
AGGTTCTCAAAGTAATATTTTGTAGTACTAAGTGTAGGTTGTAAGCAGATTTCGCCGACGTCATCGATGGGGGCACGTCCATCCAATCGCCCGCC 
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TTGTGTGCTCCGTCATTGTGTGTGTTTCTTGCAAGAAGATATGGCTTGGCCTCGAACCTCACGTGTCTCTGCAAAACGTTGTGTCGTTTAMTGT 
TTTGTTTAATTTTGTATTTATCATGGCCCGCATTCAGCGTCGAATTCCCGTGCAAGGGAATCACATGCGTTAAATAATTTCAAACCGGTTTGACT 
GATATTCCTGCAACATTTCTTGAATCACAAGTGTAAATTTAATGTGTACTCCCCGGCGGACGATTTCGAATCCCATTAACACCGTTTGCCTTTGC 
CCCAAAATAGTGACCCGGGATTGTCACAGGTTCTGCGTTGCGATCATGTGCCGGAAAGTCGCTTATAACGAAGATTTACGAGCACCTGTACCGAA 
ACCAGATTTTGGTTTATGTTCACATTATAAAAACGTATGCGATATTAGCGATTTATGACCGAAAAACCCGTTTAAGCCATCGCTCAGCTTGTGTT 
TATTTAGCATTTTATGACAGACCTGTTAATTTGCTCAATGAATTCATCCAAATCAATGTCTAATTGAATTAAATTGCAGTTATCGAGTTAGGAAT 
CTGAATGTTTCGTCGCTTTGTTTATCATTCCCCCCGTAGTTGTGTGTCTAGTTAATTTGGCTCAACTAATTTCATACGATTACTATGCTCTTTAT 
AGATTCCTTCMCACGATGGTAGACATCMGTTGCGCCAGTGGGCCGAGCAGGCACTGCCCGCTAAGTCGGTGGAAGCTGGCTGGGAAGCGTTGC 

agcagSt^tatcot 

ATACGTCGGCACAGCTGGGAGGACAAGGCCATCGACATGCTTCGCGTTATCCAGCTGAATACGCTGGAGGATCGGTTCGTGCACGACAAGCAGGA 
GTGGGATTCGGCGGTTAAGTTCCTGGAGAGTTCGGTGA^^^ 

TGCGACGAATTACCCACTGGCAGTACCTGACGCAGGATCAGCAGAAGAGGCGAAGCGTTAAGAACGAACTCGACAAAATACTCAAAAACGATACG 
GTATGGGTAGTATACATTATATAGTTAGTTATTCCTCATTTACACGCATTACATTATGTTGGTAGAAACATTTGCCCACCCTGACCCATGATGAA 

ctgIctacggttcgcaagaaccttcagcgtgataacgtggatgtggacacggactacataaggc^ 

TGTAATGCTAAACCATTTAGACTGCTTATCAGTTAGCACATATGTCTCTTTTTTCAGACACTTTCTGCAGCAGGCGTTACAGAGGGCAAAGGATT 

gcckaIag^attacctctacacgcagcagggagccgagtgtgaggtaagtgttaaggttttccctctta^ 

TTGCTTTCTCAGATATCCTGCAGCGACGTCGTGCTCTTCTGGCGCATCCAGCAGGTGATCAAGATAACAGGCAACGCACTGAGACAGCAAGTAAT 

caItcgggaggcgcggcggctggacaaggagatcaaagcggtgctggacgagttcagcgatgacgaggagaagaagggttacct^ 

agcgcgtgctactagccgaggagctaagtgagtgctttgcttaattgaatttggcgccaatttaattgctttcaattcaacagtcaaagtgcgg^ 

agatccaagagaagctggaggagttcatcaattcacttaatcaggaaaagtagtgtgccatgaccaaagacacaacacccgaggcgttacttatt 

taacttgatagtttaagaccacatttgaaa^^^ 

aaatagatcgttagttcgtattattgaaatcgtctttattgtttatgaagtgtaaccttatgtcgcatgcaatgcaatgtaaactcattaattta 

AATGTAATATAAATCATTCATTCTGTTTTAATGTAATTGTAGTTGATAAACGTTAACGCGAAACGCACTCATCGTGCTGAGTATGTAGCGGTCCG 
TGTGTGTTTCTGAATCTGTATCGTGTAATATATATATATACATATATATTTAAATCGGTTTGCATAAGAGCACAGCACTTTTACCAGATGCCTCA 
TCGTGCGGTTGATTGTTTGTTTCGTTTTGTCCACGATCGGCTAATCCCGCGATGCGCCGGCATCTGTATCGGTATCGAGATCGGTGGTGGGTGCA 
TTAACCCCGACCGATCGTCTGGCCGCGTCTCTAATTGATTGATTGTTTGATTGATAAACGTATATCTAGCTAACTATGTATGTAGCTAGCTATGT 
AACATAACAAGTCCGTGATTGCGGTGCAGTACCATATCTCTTACAAGGTAAACAAATGCTCGTTCCCTATCATGTTAAATAACTCAACAAACGAT 
TCTTAACTTGGCAGTCGACTTGTGGCCCAGCCATAGCCATGTGCTCAGCTGCTTAAGGCTATCGTAATAATATTAATAATGTGTTCGGTTCTCCT 
rCACTCCGATCCTCCGCTTAGTCTAGCTGCTAAATTCTAGTCCACATATGGGCTCCTCCACTAAAGCTTATAGAAACATAATTTACACTCGCGCT 
CCACTCTGCTCGTATGCGTGTGATTTGGGATAGGGTTGGTTGTGTGGTATTAGGGGTGTGTGTGGTAGGCACTCGATATCGCTGAAGGCACAACA 
TTACGATTAGTCTTATCTTAATTGTATTAAGTACTCGGAACACGCTTTCGCACTCGCTTCCCTCCGAAATCGGGAGTTCCTTCGACAGGATGAGT 

G 

(SEQ ID NO: 943) 

Exon: 1001.. 1146 
Exon: 1245.. 1542 
Exon: 1604.. 1739 
Exon: 1873.. 1989 
Exon: 2090.. 2164 
Exon: 2226.. 2815 
Exon: 2871.. 3244 
Exon: 3303.. 3467 
Exon: 3529.. 3707 
Exon: 4468.-4535 
Exon: 4725.. 4940 
Exon: 5006.. 5124 
Exon: 5188.. 5272 
Exon: 5333.-5537 
Exon: 5594.-5746 
Start ATG: 1118 

Transcript No * CT24795 

TCACACAGAGCACATCCCGTAAAGTTTTTCAGAGATGAGTTTGCGGCAGGAAAATTGAATTATTCACGGAAATAAACAAAGTAAACTAAGTGGAA 
ACTGTAATCCTGACCAGCAATTATGTTGCGCATCTATCAAAATACTTACCGGCGCACCGCGAGAAAAGCTGTTGTCTACTCCACCAAGGTCGCCT 
GCTGCAATCATTCCACGCTCTGTGGCATCACCAGCCATCCACGTCGGGCGCAGGATAGCGGGAGCTCCAGTTCGAATGGCAGGCACCGTGGCCAC 
GAGGAGTTCCTACTTGCCGGCAATCCGGCGCGGGGCTGGCAAATGCCTCCGCCGTCGCGTGGTTACGGAATGCTGGTGGTGCGCATACTGCGGGG 
AGCCCTCAAGCTGCGATACATCGTCCTGGGCGGCGCCATCGGCGGGGGCGTGTCGCTGAGCAAAAAATACGAGGAATGGAAGGATGGACTGCCGA 
ATTTTAAGTGGCTGGAGGATGCCATGCCGCAGGGCGAACGGTGGAGCCAGTTCTCGCGGAATCTCATTGAGGTGGGCTCGCTGGTGAAGAACGCC 
ATCGAAGTTGATCCAAAGCTCAAGCAGCTGGGCGAGGATAAGTTGTCGGAATGGCGCAACTGGTTTGACAGTCGTCTGGACGATGCCATCGAGGC 
GGCCGATTATCAAGGAGTTCAGATTGTCGAAACCAAAGATGACTTAAAGGCCAAAACAACGGTGGCCGCTTTAGGCATAACATCCGATGAGAGTC 
GTAAGAAGTATGAGAAACTCCAGAGCCAGGTGGAGACGCTGCAGACGGAGATCATGAACGTCCAGATTAAGTACCAAAAGGAGCTGGAGAAGATG 
GAGAAGGAGAACCGTGAACTACGCCAGCAATACCTCATCCTGAAAACAAACAAAAAGACCACGGCAAAAAAAATCAAAAAATCCCTGATCGACAT 
GTACTCTGAGGTCCTTGATGAGCTATCCGGCTACGATACGGGCTACACCATGGCCGATCACCTACCCCGTGTTGTGGTAGTGGGAGATCAGAGCA 
GCGGCAAGACCTCTGTCCTGGAATCCATTGCTAAGGCTCGCATCTTTCCCCGTGGCAGTGGAGAGATGATGACGCGAGCCCCAGTCAAAGTTACT 
CTGGCTGAAGGACCATACCATGTGGCTCAGTTCCGTGACTCTGACCGGGAATACGATCTCACCAAGGAGTCTGATCTGCAGGACCTCCGTCGGGA 
TGTGGAGTTCCGAATGAAGGCGTCGGTGAGAGGTGGTAAAACCGTCAGCAATGAAGTCATTGCCATGACGGTCAAGGGTCCCGGTCTGCAACGCA 
TGGTTCTAGTCGATTTGCCAGGAATAATTTCGACCATGACTGTCGACATGGCCTCAGACACAAAAGATTCCATTCACCAGATGACCAAGCATTAT 
ATGAGCAATCCGAACGCCATCATTCTCTGCATTCAGGATGGATCTGTGGACGCTGAGCGCAGTAATGTGACAGACTTGGTCATGCAGTGCGATCC 
CTTGGGTCGACGCACTATTTTTGTGCTCACAAAGGTGGATCTGGCCGAGGAGCTCGCCGATCCTGATAGAATAAGGAAAATCCTTTCGGGCAAAC 
TCTTCCCCATGAAGGCTTTGGGCTACTATGCCGTCGTTACCGGTCGTGGACGAAAGGATGACAGCATAGATGCTATTCGGCAGTATGAGGAAGAC 
TTCTTTAAGAACTCCAAGCTTTTCCATCGTCGAGGGGTTATCATGCCCCATCAGGTGACCAGCCGCAATCTGAGCTTGGCGGTGTCAGATCGTTT 
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CTGGAAGATGGTGAGGGAAACCATTGAGCAGCAGGCGGATGCATTTAAGGCGACTAGATTTAATCTGGAAACCGAATGGAAAAATAACTTCCCCA 
GATTGCGCGAGTCTGGTCGAGATGAGCTGTTCGACAAGGCAAAGGGCGAGATACTGGACGAGGTGGTAACGCTCTCGCAAATCTCCGCTAAAAAG 
TGGGACGACGCTCTGAGCACCAAGCTATGGGAAAAGCTCTCCAATTATGTGTTTGAAAGCATCTATCTGCCCGCTGCACAGTCAGATTCCTTCAA 
CACGATGGTAGACATCAAGTTGCGCCAGTGGGCCGAGCAGGCACTGCCCGCTAAGTCGGATCGGTTCGTGCACGACAAGCAGGAGTGGGATTCGG 
CGGTTAAGTTCCTGGAGAGTTCGGTGAATGCCAAGCTCGTGCAGACGGAGGAGACTTTAGCACAAATGTTTGGTCCCGGCCAAATGCGACGAATT 
ACCCACTGGCAGTACCTGACGCAGGATCAGCAGAAGAGGCGAAGCGTTAAGAACGAACTCGACAAAATACTCAAAAACGATACGAAACATTTGCC 
CACCCTGACCCATGATGAACTGACTACGGTTCGCAAGAACCTTCAGCGTGATAACGTGGATGTGGACACGGACTACATAAGGCAAACGTGGTTCC 
CGGTCTACAGAAAACACTTTCTGCAGCAGGCGTTACAGAGGGCAAAGGATTGCCGCAAAGCATATTACCTCTACACGCAGCAGGGAGCCGAGTGT 
GAGATATCCTGCAGCGACGTCGTGCTCTTCTGGCGCATCCAGCAGGTGATCAAGATAACAGGCAACGCACTGAGACAGCAAGTAATCAATCGGGA 
GGCGCGGCGGCTGGACAAGGAGATCAAAGCGGTGCTGGACGAGTTCAGCGATGACGAGGAGAAGAAGGGTTACCTGCTCACCGGCAAGCGCGTGC 
TACTAGCCGAGGAGCTAATCAAAGTGCGGCAGATCCAAGAGAAGCTGGAGGAGTTCATCAATTCACTTAATCAGGAAAAGTAGTGTGCCATGACC 
AAAGACACAACACCCGAGGCGTTACTTATTTAACTTGATAGTTTAAGACCACATTTGAAATGTTTGTTCATAGTCA 
{SEQ ID NO: 944) 

Start ATG: 118 

ML RI Y QNT Y RRT ARK A VV Y S T K VAC CN H STLCGITSHP RRAQ D S G S S S S N GRHRGHEE FLL AGN P ARGWQM P P P SRGYGML V V RI L RG ALKLRY I 

VLGGAIGGGVSLSKKYEEWKDGLPNFKWLEDAMPQGERWSQFSRNLIEVGSLVKNAIEVDPKLKQLGEDKLSEWRNWFDSRLDDAIEAADYQGVQ 

IVETKDDLKAKTTVAALGITSDESRKKYEKLQSQVETLQTEIMNVQIKYQKELEKMEKENRELRQQYLILKTNKKTTAKKIKKSLIDMYSEVLDE 

LSGYDTGYTMADHLPRVVVVGDQSSGKTSVLESIAKARIFPRGSGEmTRAPVKVTLAEGPYHVAQFRDSDREYDLTKESDLQDLRRDVEFRMKA 

SVRGGKTVSNEVIAMTVKGPGLQPJyiVLVDLPGIISTMTVDMASDTKDSIHQMTKHYMSNPNAIILCIQDGSVDAERSNVTDLVMQCDPLGRRTIF 

VLTKVDLAEELADPDRIRKILSGKLFPMKALGYYAVVTGRGRKDDSIDAIRQYEEDFFKNSKLFHRRGVIMPHQVTSRNLSLAVSDRFWKMVRET 

IEQQADAFKATRFNLETEWKNHFPRLRESGRDELFDKAKGEILDEVVTLSQISAKKWDDALSTKLWEKLSNYVFESIYLPAAQSDSFNTMVDIKL 

RQWAEQALPAKSDRFVHDKQEWDSAVKFLESSVNAKLVQTEETLAQMFGPGQMRRITHWQYLTQDQQKRRSVKNELDKILKNDTKHLPTLTHDEL 

TTVRKNLQRDNVDVDTDYIRQTWFPVYRKHFLQQALQRAKDCRKAYYLYTQQGAECEISCSDVVLFWRIQQVIKITGNALRQQVINREARRLDKE 

IKAVLDEFSDDEEKKGYLLTGKRVLLAEELIKVRQIQEKLEEFINSLNQEK* 

(SEQ ID NO: 945) 

Classification: enzyme 



Celera Sequence No. : 142000013384522 

TCATCCATGTGGTGGTGGTGGCCCGCAATCGCATTGGAGTATTTGTTATGGATATACAAAAGGGGAAGTTCGCCGCTAGTGGAATGCATTTGTAG 
GCTTTGCCGCAGGGATTAACTGATGATATTCCTCAGATAGGCGGAGTTTTCCGATTTAGGGTTCTAACTTTGGGGTTGATTAAAGCTAAAATCAT 
CATGGAACTAATAGTTTTACATTTTTAAAAGTAATGCCGATTTTTTCCTTACTTGGGATACTTCCGAAATCCTTGTTATTAATTTGTATCTTTTC 
TTTTTTGGTCATCTATGATAAATCTTATGTTGTTATATTGATATGACCGCATAGAGATATTGCATGCTATCGGATGCATAAGCTTAAACCTTAAA 
ATCTCGCAGGATTTCAGAAATATCCCTTAAATTACATAATTTCACTGGTGTGCGGTATTCTTTTTCGTTTTACTTTACAATTCTTCTAGAGTTTT 
TTGACTTATTTACCCAATAATTCGAATTCTAAAAGTATATCCGATCATTTAAATAATATTAGATAAAAACAAAGAATTTTTCAAGCATACTAACA 
GTTTTTTCGCTCTACAAAATATTATTTTTCAAAAGTATAATTTTTTTCGTCAAATTAAGTAAAACATAAATTTAGAATCAAACTATAGCACAAAA 
TGGTTACTTGTTTGTTTCGTTCTAAATCCTGATTTTTACGCCTAAGACTACAACATGCATGCACAACAACAACAACAATTATGCCAACGACAAGG 
TTAACAACAACGACAACAACATGGAACACAGCTTGCCTCAAAACTACAATGTGTGACGGTTCGGATTATGGTGAATATCAGACGGATTCATAATA 
CTGACTGGCTACCCGGGTGCGCGATCGATCCTTTCGCCTATGTGAGGATGGTGGCCTCGCCGCCACCCAGTGGGCGAGGAGCGCCCACATGCGGA 
CGCTGCTGCTGCTGCTGGAGCCGCCCACCGCCCACCCTCAGCTTCTTGTGTCATTTCTTGCCACCGAAAGTGGGTTTCGGTAGCGCACCCATGAA 
CGCACCAGCATTTGTGGGCAAGGCCGGTGCAACGAAGCTGAGATCGGGGATCAGGGAGATGCGTGGAGCCACCTTGGCGTTGCCGATTTGGAACT 
GCTGCCGCTGATGCTGCTGCTGGTAGGTGGCACCCACACCCAATCCTCCAGACACGGGTGCAGTCGAGAAGTCGTGCATGATGTGTATTTGCTGT 
GGAATGGCTGGCACATTGTTGTTATTGCTGCTGCTGGTGCTGCCCTTGGGCACTAGATTGCTATGAGGCTGTTGCTGCAGCTGCTGCTGCGCCTG 
GTACAAACTCAAGGGATTGTTGTAGAAATTAGCGCTCTGCCACGCACCGGAATTGAATTGTGGAGCCGGTCGTGCTGGCGGAGTACTGCTCACAA 
AATCCGACCAATCATCTTCTTTGGGAGCAGCTGGTTTCCTAGCGTTGCTGTTGACTATATTATTGTTGTTAGCAATTGCGTTTTGCTGATGATGC 
GGTTGCTGTTGGGTGACTGGAACAGAGACAAAGTCCGACCATTCATCGTCTTCTTGAGGCTGAGACTGAGCTGGCGTGGAAGGAGCTGGTTTCTT 
GGCCACCGAAGCGCTTATGGTGAGCGGTTTGGCCTGCGAGGAGAACAACTCCTCTATCCTGGCCATTTCCTCGGCATTTATGCTAGACAGAGCAT 
TGTCGCCCCATTCAATAGTTGCAGCTGGCTTCTTTGCCAGCGGAATGGCTTGGGGCAGCAGAATAGCCGGACTAAGGATCATCTGTTCGCCAAAA 
GTGGGTGTATCAATTCGAGACCTTGGCGCCGGCATAGATTGAAAATCACTAAAGTCGTCATCTTCCGTCGTCTGTGGTGGCAACTGAAAGCTGCT 
GGCAGCTCCATTGCCACTCAGTTTGCTGGGCGAATCCAGACCAAATGGGGGTTTGTACTCCACGTCGCGATACTCCTTCTTCACCACCGTCTCAT 
CGATGATAATGCCCGTCGCAGCCATCTCGTAGTCGAAATCAATGGGCGCTAGTGTGGTGGTCTTGGCCACCGCCGTATCCGATTTGGAGGGCGTA 
TAGATATGGGTCTCCTTCAGCGGCACCGAATAGCTGGCCATCGAGCTGCCCACGATCTCCTCCTGAGCTGGCGCTGCGGCCGGAGGAGTGGTCCT 
CAAATCAAACTGCTCCGGCTCCGACCGCTGCTGATGCAGCTGCTCACCGCCGTTGAGTTTGTGGGCAGGGCTGTCTGCTGATAGAGTAGTCGGTA 
GCGTCGCTGCTGCTGCAGCAACAGCTGCTGCTGATGCCGCTGCTGTGGATTCAATATGGTTCGATTCGGGGCTGGGGACTGCAGTTGGCTGGTGG 
CTATGACTCGGTCGCTCTTCGCCGTCGGAATTGCTGGCGGAACAGGAGCCGGTGGAGCCTTCCCCATGAGACGGGCCAGTCCGCGTTTCAGTTTG 
GCTCGCCGTTGTTGCGGCGGCGTTTTCCAATCCTGTGCGAAGGAAAGAAAACAATTACTTGGCTGTCATGAATGAAAACTATGAGATAAACAAAG 
ATAATTAATTAGTTATTCGGCTATAATCATCTGTTATGTTGCAGATAACTAAAAAAACACTGTGCTTATGAAAGATATGGAATGGATATGGATAC 
TAATCTTGCATTAATCATTTTATTTTACTAGATACATTGTATCTTTTATAGAATCAAAGAAATCAGAAAATGGTTACATTTCAAGCATTACTTTT 
AGTTACCAGATTTAGCAACACATCAGATGATGGCTTAGTTTAGAGACAATATTTAGGTATTAGTATGATCAACTGTATCTAGCTAAGTAAAATGT 
GAAAATGAAGGGAACTCTTGGGCGGGCTTCAACTACTTGTGCTAGAACTGAGCTGGCTGAGATTGGATGCGTTGTGCCAAAGTAACAAGAACTGG 
GGGAACTGTGGATCGGTAGGACGGTACGGGTAGATAGAGAGATAGAGCTGAGAAGTATATAGAGAATCGAAACTGAACTGAACGAACCAAAAAAG 
AGTGGAGATTCAACTACAACACACGCACAGAGCCAACGAAACTTACTAAAATCCAGCAAGGCGTCGTGAGCAGTGAGGTTTGTGTTAATTGTTAT 
TGTCATTGTTGCAGTATTTTTTGCATGATTGTTGTTAAGATTGTTGGGATTGTTATTATCGTTATTATTGTTGTTGCAGATGGTGGCAAAATGTG 
TGAGTGATGGAGAGTGGTGCAAATCGATTGTCAAATTGTGATCGTTATGGGTCATCAAATCCATCACAGGTGGCATACAATCCAGTTGATCAACG 
GGTGCAGCAACGTCGTTATCACCATTATCGTTGTTGTTATTGGTGTGGGAGTTAGAGGTAACAAGATTTATTTTATCAGCATTAGCCACCACCGT 
TAGTTGGTCTAGTTGTTCCAATTGTAATGTCTGTACATGTGATTCTAAATACAATTTATCGTAAGGATAAACTTTTTGATTGTTTGTTGAGATAG 
TTAGAAGTAATGTGGTCACAAGGAAAAACAAGGCAAACGAAAATCAATAAGAATATATAGAAAGGAGCAGCCAATCGAACGGCAGGCCAATCCGG 
GAAGCGTGTGTCAATAGGATCGGGAAATAGGCAACGAAAAGCGTTTGAAAGTAAAGCGAGCGTTAGGCGATAGGGTGGCAATTTCTCGATTTACG 
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AAACTTGTGTAGCGGTTAATTTGCGGCAACTAAGTGGAGCAATCTCGCAGAGACTCTAAGGACACTCACCCTGGTAGCTGGTGGAATTGGAAATG 

GGCTCCAGGCTGCTTGCGGCGCCGGTTGATTGAGGTTTCAGCGGCTTCAGCGGATCAAAGCTCAGGTTGGCAGCAAAACGTGGCATCGATGAGTT 

CCACTTGTCGCCGAAGAGCTGGAAGTTTATGAATTTCAGTTATTTGAAATACATTTTGAAGCCCTATCTAACTTACTATATTCCTTGAATCGATG 

CCCAGCGATCGAACAAGGGCGTGTCTCATTTCCGAGCTGCTCCATTGGTAATCAAGGGCTTGGGCATGCTCTATGGCACCAAAATTGAATCGCTG 

GCACTGAGCCAAAGGCCTGCGATTGTCCATACTTTTGCTGTCCAGTTCCGTTGGTTGTGGAAACATCATATCAAGGACTGGCTTAACCCTTTCAT 

CTATGCTCAGATGGACAGCAGCTGGCGGCGGTGGCGGAGGTACAGAAACAGGAGTCACTACGGGCGCGGCGAAGTCGCCAAAATCATCGTCGTCT 

TCGCTTTGACCGTGGTTTCCATTCGCAGGAGTGGCAAACTCCTGAAAATCATCGAAGCCATCATCGGTCTCTGCAGGAGGTAACTTAGAGGATTC 

TTCTGTGCTAGGAGCCGTCTTAACCTCTAGAGGTTGAGATTCCGTTCGGGAATTGATTGCTAAGGAAGGTGTCTCAAAGGCAGACGTTGAATTCT 

CCGTGATTTCTAGTTGGGGTTCTGCTGCTGAGAAATCCGCAAAGTCACCGAAGTCATCATCATCTTCTTGCTCTTGCATTTCGGGAACAATTTCT 

TTGTGTACTAAGTCTTTTGGCGTCGCATCAAGATTTGTAAATGAATATGTATCACTATTTGCTGGTTCACCCCATTGGTGAGGGGATTCCTCCTG 

ATTTGGAGGATTTGATAAGCCACGGCTTTGATTTTCCATAGGCTTTTCTAAGGTATTTGAACTTTCCTCCTGGGCTTCCAAACTTCTAAAGCTTT 

CTTCTGGCTGGGAGGTATTCTCTACGTATTCTCCTTTCCCTGTATCCACATTCTCCTCCGAAAAGCCTTCTCCATATGGTAGGGTTGGAGTGGCC 

AATACCTCTATGGCAAAGAAATCCACAGCTCCCTCGCGTATAAATAGTTCTTTGGGCTTCTTTGGCGAGCACTCCTCGTCGCTGTCATCGGACAC 

ATCCTCCAGCGTGACTTGGTGGCTGAGGATATGGGCTTTGGTTGTTTGCATAGCCGGCGGAGCAACTGCACTAGTCTCCGCAACCGGTGAGAGTG 

TGGAGGCGGCAGAAACGCTTTCCGTGGAATAGAGACTTAGACTGTCCAACTTAAGGGAGGGCACATGCTCCTCCAGTTCCTCATCATCCAGTTCC 

AAGCTCAACGGAGGCGGCGTCGGCGGTGCTGTTTCCTCGAAAACATCGTAATCCATCTTCTGTTTCACCTGGTAATTAAATGCCTCGACGGTTTG 

TTTGTCTTCCAGCAAATCAGGTGGCTTTTCTCGAAATTCCTCCACGGGATCTGGTGGTGGAGGGGGTTCTAAAAAGCGAGAATTAACTGAGTATC 

ACTTTGAGATTCGATAAATATCTAAGCTTGCCTTTCGAGCTGCTGACTAAGCCAAAATCTGAGTATTCGTCACCCTCGTCCAAATGGATGGTGGA 

TTGTAAACCGGAATCGTCGTCTTCCTCACCAAAATCGATGGGCGGGGGGGTGCTGCAGAGCAGCGGTGGTACATTCACCATAATGCCAGCTGCAA 

AATTCAGTAAAAGGATCCCCTAAAATTCACTTCCTGCTTCAAGAAGAACAACAAAAACGCTGGGAACGGTTCGATCTTCTATCTCTCTCGCTCCA 

CTTTATCACTGTGTGTGTGGTGTGTATGTGTGACGCAGGTGTGGAGGGCCTGCAAGTGTGTGTGTGTGTGTTAGCCACTCAAGGTCTGTGTTGGA 

ATTTTCTTGTTTGCATTTAACACGCGTCAATTTTGTTGTGTCTCTGTATTATGTCTAATTGTCGCCTTATTGGTGGCTCTAAAAACAGATTTGCC 

GAATTTGCAAGTCGCGTTTTATCTGCAATTAGCCATGTTTTCCAGAAAACAATACTTTTTCCTCTGCCCACAGTGTGAACGTTTAACTTTTGGCC 

AACCGACAGTGCTGAAAAACGTTGCGCAACATTGATAACATACAAAATTTGAATCTTATCTTTTTCAATTCCCATTTTCGATTTAAATCATTATC 

CATAGTTCAGATAGAATAGTTATATTAGTTATATTTCTATAATAGATTTTTATTCTTTCCAATTCTTTTACCGTTAAAATTGTATTTGAAAATTA 

TTAGTAACAAGAGTAAAATATACATGAAAGTCTAGGCTGCAGTATGTAATACGTCAAATTTTGTTGGGACACTAAACAACAAATAGTCGTCATAA 

AGGCTTCTTTTAAATTGTTATGATATTTTCCACAATTTCTGCATACTTTTTTAAGCTTGCTGGCAACGCCGCCACCGTACCAATACAACTATCGA 

TAGCCGCCTGTCAGTTTTGACAGCTGTTTTAAGGTTATACTGCGCTGCCGTTGCAATCGCTCCACCAATTTGGTAATTTTGCTTATTTAACATCA 

AATATTGCATAAAACTAGGAAAACCAGGAATCTGTAAAGTGGAGCGAGACGGGGAACGCTATTAAGGAAGTCGGATAACTAAAGTACATCCTTTT 

CTTGCAGTTCACGGTCGGACAATTACATAACTTGAAGAAGAAACCATGTCGCAGGCTCCCGTTCGCGTTTCCCCGCTGATCAAGTTCGGCAGATG 

GTCCCTGCTCCTGGTGGGCATTGCCTACGGCGCCGCCCACCAGAGCCGCCTGTCCAAGAAGGAGGAGAAGTTGCGCGAGATCGAGGCACAGCAGA 

AGGCCGTCCGTGATGCCAAGCTCGCCGAGGAGAAGAAGCGCAGCGCAGAGGCTGAGGCCCGTGCCTTGGCTGAGCTTTCGAAGCCCACTCCCAAG 

CACTAGGATCCTTCAGGATGCAACATCCGCAGTTCCCTTAGCTTAAAAACATTGAAAAACCACAACATTTACACACGAAGTTCGCCCAAAAGGAG 

GAGCATCATCGATAGTGGAGCCACC 
(SEQ ID NO: 946} 

Exon: 5865.-5352 
Exon: 5293.. 3972 
Exon: 3913.. 3775 
Exon: 2502.. 1001 

Start ATG: 549 6 (Reverse strand: CAT) 
Transcript No * CT24913 

TGGGCAGAGGAAAAAGTATTGTTTTCTGGAAAACATGGCT^^ 

AT AAG GC G AC AAT T AG AC AT AAT A C AG AG AC AC AAC AAAAT T GAC GC GT GT T AAAT GC AAAC AAG AAAAT T C C AAC AC AGAC C T T G AGT GGC T AA 
CACACACACACACACTTGCAGGCCCTCCACACCTGCGTCACACATACACACCACACACACAGTGATAAAGTGGAGCGAGAGAGATAGAAGATCGA 
ACCGTTCCCAGCGTTTTTGTTGTTCTTCTTGAAGCAGGAAGTGAATTTTAGGGGATCCTTTTACTGAATTTTGCAGCTGGCATTATGGTGAATGT 
ACCACCGCTGCTCTGCAGCACCCCCCCGCCCATCGATTTTGGTGAGGAAGACGACGATTCCGGTTTACAATCCACCATCCATTTGGACGAGGGTG 
ACGAATACTCAGATTTTGGCTTAGTCAGCAGCTCGAAAGAACCCCCTCCACCACCAGATCCCGTGGAGGAATTTCGAGAAAAGCCACCTGATTTG 
CTGGAAGACAAACAAACCGTCGAGGCATTTAATTACCAGGTGAAACAGAAGATGGATTACGATGTTTTCGAGGAAACAGCACCGCCGACGCCGCC 
TCCGTTGAGCTTGGAACTGGATGATGAGGAACTGGAGGAGCATGTGCCCTCCCTTAAGTTGGACAGTCTAAGTCTCTATTCCACGGAAAGCGTTT 
CTGCCGCCTCCACACTCTCACCGGTTGCGGAGACTAGTGCAGTTGCTCCGCCGGCTATGCAAACAACCAAAGCCCATATCCTCAGCCACCAAGTC 
ACGCTGGAGGATGTGTCCGATGACAGCGACGAGGAGTGCTCGCCAAAGAAGCCCAAAGAACTATTTATACGCGAGGGAGCTGTGGATTTCTTTGC 
CATAGAGGTATTGGCCACTCCAACCCTACCATATGGAGAAGGCTTTTCGGAGGAGAATGTGGATACAGGGAAAGGAGAATACGTAGAGAATACCT 
CCCAGCCAGAAGAAAGCTTTAGAAGTTTGGAAGCCCAGGAGGAAAGTTCAAATACCTTAGAAAAGCCTATGGAAAATCAAAGCCGTGGCTTATCA 
AATCCTCCAAATCAGGAGGAATCCCCTCACCAATGGGGTGAACCAGCAAATAGTGATACATATTCATTTACAAATCTTGATGCGACGCCAAAAGA 
CTTAGTACACAAAGAAATTGTTCCCGAAATGCAAGAGCAAGAAGATGATGATGACTTCGGTGACTTTGCGGATTTCTCAGCAGCAGAACCCCAAC 
TAGAAATCACGGAGAATTCAACGTCTGCCTTTGAGACACCTTCCTTAGCAATCAATTCCCGAACGGAATCTCAACCTCTAGAGGTTAAGACGGCT 
CCTAGCACAGAAGAATCCTCTAAGTTACCTCCTGCAGAGACCGATGATGGCTTCGATGATTTTCAGGAGTTTGCCACTCCTGCGAATGGAAACCA 
CGGTCAAAGCGAAGACGACGATGATTTTGGCGACTTCGCCGCGCCCGTAGTGACTCCTGTTTCTGTACCTCCGCCACCGCCGCCAGCTGCTGTCC 
ATCTGAGCATAGATGAAAGGGTTAAGCCAGTCCTTGATATGATGTTTCCACAACCAACGGAACTGGACAGCAAAAGTATGGACAATCGCAGGCCT 
TTGGCTCAGTGCCAGCGATTCAATTTTGGTGCCATAGAGCATGCCCAAGCCCTTGATTACCAATGGAGCAGCTCGGAAATGAGACACGCCCTTGT 
TCGATCGCTGGGCATCGATTCAAGGAATATACTCTTCGGCGACAAGTGGAACTCATCGATGCCACGTTTTGCTGCCAACCTGAGCTTTGATCCGC 
TGAAGCCGCTGAAACCTCAATCAACCGGCGCCGCAAGCAGCCTGGAGCCCATTTCCAATTCCACCAGCTACCAGGGATTGGAAAACGCCGCCGCA 
ACAACGGCGAGCCAAACTGAAACGCGGACTGGCCCGTCTCATGGGGAAGGCTCCACCGGCTCCTGTTCCGCCAGCAATTCCGACGGCGAAGAGCG 
ACCGAGTCATAGCCACCAGCCAACTGCAGTCCCCAGCCCCGAATCGAACCATATTGAATCCACAGCAGCGGCATCAGCAGCAGCTGTTGCTGCAG 
CAGCAGCGACGCTACCGACTACTCTATCAGCAGACAGCCCTGCCCACAAACTCAACGGCGGTGAGCAGCTGCATCAGCAGCGGTCGGAGCCGGAG 
CAGTTTGATTTGAGGACCACTCCTCCGGCCGCAGCGCCAGCTCAGGAGGAGATCGTGGGCAGCTCGATGGCCAGCTATTCGGTGCCGCTGAAGGA 
GACCCATATCTATACGCCCTCCAAATCGGATACGGCGGTGGCCAAGACCACCACACTAGCGCCCATTGATTTCGACTACGAGATGGCTGCGACGG 
GCATTATCATCGATGAGACGGTGGTGAAGAAGGAGTATCGCGACGTGGAGTACAAACCCCCATTTGGTCTGGATTCGCCCAGCAAACTGAGTGGC 
AATGGAGCTGCCAGCAGCTTTCAGTTGCCACCACAGACGACGGAAGATGACGACTTTAGTGATTTTCAATCTATGCCGGCGCCAAGGTCTCGAAT 
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TGATACACCCACTTTTGGCGAACAGATGATCCTTAGTCCGGCTATTCTGCTGCCCCAAGCCATTCCGCTGGCAAAGAAGCCAGCTGCAACTATTG 
AATGGGGCGACAATGCTCTGTCTAGCATAAATGCCGAGGAAATGGCCAGGATAGAGGAGTTGTTCTCCTCGCAGGCCAAACCGCTCACCATAAGC 
GCTTCGGTGGCCAAGAAACCAGCTCCTTCCACGCCAGCTCAGTCTCAGCCTCAAGAAGACGATGAATGGTCGGACTTTGTCTCTGTTCCAGTCAC 
CCAACAGCAACCGCATCATCAGCAAAACGCAATTGCTAACAACAATAATATAGTCAACAGCAACGCTAGGAAACCAGCTGCTCCCAAAGAAGATG 
ATTGGTCGGATTTTGTGAGCAGTACTCCGCCAGCACGACCGGCTCCACAATTCAATTCCGGTGCGTGGCAGAGCGCTAATTTCTACAACAATCCC 
TTGAGTTTGTACCAGGCGCAGCAGCAGCTGCAGCAACAGCCTCATAGCAATCTAGTGCCCAAGGGCAGCACCAGCAGCAGCAATAACAACAATGT 
GCCAGCCATTCCACAGCAAATACACATCATGCACGACTTCTCGACTGCACCCGTGTCTGGAGGATTGGGTGTGGGTGCCACCTACCAGCAGCAGC 
ATCAGCGGCAGCAGTTCCAAATCGGCAACGCCAAGGTGGCTCCACGCATCTCCCTGATCCCCGATCTCAGCTTCGTTGCACCGGCCTTGCCCACA 
AATGCTGGTGCGTTCATGGGTGCGCTACCGAAACCCACTTTCGGTGGCAAGAAATGA 
( SEQ ID NO: 947) 

Start ATG: 370 (Reverse strand: CAT) 

MVNVPPLLCSTPPPIDFGEEDDDSGLQSTIHLDEGDEYSDFGLVSSSKEPPPPPDPVEEFREKPPDLLEDKQTVEAFNYQVKQKMDYDVFEETAP 
PTPPPLSLELDDEELEEHVPSLKLDSLSLYSTESVSAASTLSPVAETSAVAPPAMQTTKAHILSHQVTLEDVSDDSDEECSPKKPKELFIREGAV 
DFFAIEVLATPTLPYGEGFSEENVDTGKGEYVENTSQPEESFRSLEAQEESSNTLEKPMENQSRGLSNPPNQEESPHQWGEPANSDTYSFTNLDA 
TPKDLVHKEIVPEMQEQEDDDDFGDFADFSAAEPQLEITENSTSAFETPSLAINSRTESQPLEVKTAPSTEESSKLPPAETDDGFDDFQEFATPA 
NGNHGQSEDDDDFGDFAAPVVTPVSVPPPPPPAAVHLSIDERVKPVLDMMFPQPTELDSKSMDNRRPLAQCQRFNFGAIEHAQALDYQWSSSEMR 
HALVRSLGIDSRNILFGDKWNSSMPRFAAKLSFDPLKPLKPQSTGAASSLEPISNSTSYQGLENAAATTASQTETRTGPSHGEGSTGSCSASNSD 
GEERPSHSHQPTAVPSPESNHIESTAAASAAAVAAAAATLPTTLSADSPAHKLNGGEQLHQQRSEPEQFDLRTTPPAAAPAQEEIVGSSMASYSV 
PLKETHIYTPSKSDTAVAKTTTLAPIDFDYEMAATGIIIDETVVKKEYRDVEYKPPFGLDSPSKLSGNGAASSFQLPPQTTEDDDFSDFQSMPAP 
RSRIDTPTFGEQMILSPAILLPQAIPLAKKPAATIEWGDNALSSINAEEMARIEELFSSQAKPLTISASVAKKPAPSTPAQSQPQEDDEWSDFVS 
VPVTQQQPHHQQNAIANNNNIVNSNARKPAAPKEDDWSDFVSSTPPARPAPQFNSGAWQSANFYNNPLSLYQAQQQLQQQPHSNLVPKGSTSSSN 
NNNVPAIPQQIHIMHDFSTAPVSGGLGVGATYQQQHQRQQFQIGNAKVAPRISLIPDLSFVAPALPTNAGAFMGALPKPTFGGKK* 
(SEQ ID NO: 948) 



Celera Sequence No. : 142000013384 673 

TCGCGTGGACATTGAGACTCCAGTGCGCAAGCGTGAAATCGAGACGGCGGTTGCCTCGCCCAACCAACAGCATGTGCCATCGGTGAACAAGGCGG 
GAAAAGAGGTGTACTATCCACCCGGACACGACACCATCGTCATGAAGCGCGAGGAGATGCATGCTGGATCTGCCTCAGGCGTGAGTATTGGATAC 
TTACCAAGGTGGAATATGCCCAACTGATAATCCTACATCCACAGGGTCGCTGGGCCAAGGGATCCGGAATGTACGAGTACGAGTCCGGCTCGAAG 
TCAAAGACGAAAACCAAGTCCGGTGGCGCCGTTGTGCCCGTGTGCCTGCCCCTCTGCTGTGCCATGCCCTGCTCCATTATGTAGTGGGCGGACAT 
CGCCGGAAGACCGTATCTTGTACATAGGTTTCTAATATTTGCCGTTAATAGTTCTAACGCTAAATCCTGCCCCGCGTTTTTGATTGCTCTCTACT 
AGGTCGTCTAACTTCTATCGGGCTTGCTGCTTGTCAATTGGTTGTACGCGTAATTTGTTTGCCAAATTGTATGTCTATATATCTATACGTTATTG 
TCTATGTATATAATTTGTGAAGCGTCATCTACGGATGGGCCGGTTGTTATACCATTTTATTTTGGATTTAGTAGTTCACCCCAAACTAGTTTGCT 
CAAAATTGTTGCTCAAGCATTTAAATTCGCCAGTGGGATGCCCAAAAAGTTTACTCGATGCCAATTGTTTTAACTTAAATTTGCATACCGCATAC 
CAAACACTAATAAATGAGCCGTGGAATCCAATTCCCTCTATTATTCGTTTCGCCAGCGCCCCCAAATACCCACTAATCATAAACCATAATCGTAA 
CAGCGACTTAAAAGGTAACTCAGTCGTTACAGAGCCATAAATACTATTTCTAATACACGTAGTAACCATAATAAAAAACGAGATCTAACAAACGA 
AGCGATGAACTGTTTGAATGCCTCTTTTGAATGGATCGATTTGCGTGGGGTTTTGGTATAGTAAAAGTTTTTATAAGTTTATTTTGGTATAAATC 
GTTCTGATCGAATAACAATTTGCATATCACAAAAGTAATATCTACATTTCACCTTGTAGGAATAGGTGTAGCTTTCTTTTGAGAAGTTATTAAAA 
AAC ACT ATAGTG AAAC AAC AAAAAACT AAAAAC AACAT AAAAC AAT ACAAAAATCAACAGAGGTAT ATTAT TT AGAT AAGT AT CAT TGAGCACT T 
TGAGTAGTGAAACGGGAGGATTATTGCTACAGAGACGAGTTCTTATTGTTGTTCTTGGATTACGAATTGAAGCTAAATGATGGATTAAAAGTATA 
CTACATCGTAATATGAGTATAGTTACTGGACTTACTACCATACACACAATATGTGCCTAATATTCTAACCATGCCGCCATGATCTGTCATTCTAT 
TAAAAGAGACTACCGTTTGTGAAAGTAATGTTGTCCTTTGTACATACGCATATTTGAGAGTTGGAAAGGTCACGTATTGTAACGCTGTCTTTTAT 
GCAGGCTAACTTTTGCGCTCGGAGTTCTTTAGCTCTGCTAAGTACCAAGTGAAAGTTGGAAATTGTTTGATGCTGTAGATAGAATACTTTATCGA 
GTTGATTCCATCCTGTTCTATTTTGCTCATGCCATTCTGTAAGTTTTCATAGCTGTCAAAAAAAGAAGGGTAAAAATTCCATATTTGTTAGTCTA 
AATTATGGATAATAGTATGGTCTACTGACCGCTTAGGATTGGCCTTTTCCTTCTGATGCTTCAGCATCTTGTAGCGAGCTATGTTGACCGGATAC 
CTTGATATGAATAGGTTTGCGTGCTTCAACCTGTTGGACATGTCGTCATCCTCGCCGCCCCAGCCAAAGAACGAGTTTGAGAACCCATTTACGGC 
CTGAAAGTGCTCACGCGTCATTGCGGAAACACCTCCAAATATTGATCGATAAGGCAACCTATAAAGAATTGAATCAATTTACCATTGATAGTTGT 
TATCTCTTTGCTCTACTTGCCTGAAGTTCAGCGTGTCTATAGCCACTGACATGTGTCGCGGCTGACGTGGACAGTTGTAAAGATTGCGGTCGTCC 
AAAGGCAGAAGATCGACATCGTGGAATATAAAACAATCCCACTGGTACAGCTTTAAGGCCTCCAAATAACCAATGTTCATCATGGCAGCCCGATT 
AAAGGGCTTCCCGTTGGTCTGCTCTACAATGAAAATGCGATAGGCGATGCGCTGCTTCATCAGAAATGGGTGGATGTTGCGCAGGAAAAGTAATA 
GATGGGCGTATCGATCGCGGAAGGGCACAACAATAGCCACGTGATGCTGGGCATTGCAGTTTTCAGGCTCGAAGGCGCCACCAGGGCGCAAAAGA 
GGTCCAAGCTCCGCCTCAATAACGTCCAGTGACTCCAGTGTCGTGTTGGGCGTGATGGGTCCACCTAAACAAGGTGGACGTATTTTAAATCGTGC 
TAAAGAAAGAGCCAAGCCGAGCTTACCATCACGGGGATCGGGATCAGTGCAGTTGGCAAGGAGGGCGGATGTGGATTTCTGTCGCGGGGGCAAAA 
ATTTCTTTGTGCCCAGCAGGAAGCGCGTTAAGTCTTGTGGAATGGAAGTAAAGTTTGCCGTGGCAATCACAGTTCTTATGGTAGAGTTGGGTCCG 
CCATTGAGCTCCTGGTCCCGCTCTCTGTCTTTTGATAAGGCGAGCGGGGAAGCGGGCAGCCTGGATGCTTCACTGCCTCCGGCTCCATCGCTGCT 
AGCGTTCCCGTAGATATGAGCATACTTGTGCACGCGCCGAATGCTAATATCCGTTCCCAAAAATAATTGATAAGATAACCTCTGCCAAGGCAATA 
TGATAGATTTAGTTACCTGAGCTTGCTGAGGGAGGTGGCGCTACCTCCGTCGGAACGGAAGCCCACAAAGTTAAGCACCAGCAACAAGCATATCG 
CGCCGGCTAGGAAGCGTATGAGATTCGCCTTGGTGAAGAGGTACATGGTCCGAGGATTGCATCATACCTTGCTCGACGCCCAGGCGCTGCGAACT 
GGCATTTTCAAGCGCAAAACGTTCGACAACCGAACAAAAACGTTAAAATTCGCAACTCCAGGGTTCAATCAGAAAAGTCAGCTGAGAGTCCCCCT 
CGACCAGGGTTGCCAGAGTTGCGCAATTGCTGCAGCCATAAATCGCCGACTGCCGATAGTTTGCAGGCGCCAAATTCAAAAAACAAACTTCACAG 
C AT ATT TTT ATG ATC T ATT AT T AC T AAAAA TAT T AAATG C AT T TT AAAA T T GAAC GAAAAT CT AAAT GT T T T AT C T AC AAAAC AAT G AAT AC AT A 
TAATATAAGGCAAATTAATTTGTTATTCTAAAGTCCGAAGTTCTCTTGGTTTTGTGATTTATTGCGCTGTTCGGATCTCGTAAGTATCCCAAAAA 
TAGCGAGTTACAGAGTGGTATTGTAAACTGGGGAAATATACGATCTGACATTTGCCCCCAAGTCCTGGGAATGGGTCTTTTCTATCAAGAATATA 
AACAAGTTACAAACACAACTGAAGATTTAAACTAATTGTCGTGCAGAGCAGTAAGGCCCCACTATCCTGACAGCGGTTGTCGCAGCGGCAGTCGC 
AGCGCCTCGTTGTCCTTGATGTGAATCACTTCGAACTGGTTTTGATAGGTGTACTCTATCCGCAGAATCTTGTCTTCGCCGATGCCCGGATCATA 
GAAGCCGGGCAGGTCGCTCTACAAAGACGTAAGGAATGGTATTATATCAACCCAGAACACAATCAGTGAAAGTGTCTCACCTTTGATGAATCATG 
CAGCTGCAGGGTCCCGTTCTTGACCATGCATTGAATGGCTACGGTGACGTCGAGCGATTGCTCTGGCTTGAACTGTCTGCCGCCCGCCAAGGTAC 
AGCCGTAGACAGCTCGTGTGACAATCAGCCCATTCCTCGCCAGCTCCTCCGTCATGATCCTATTGTACGTGGCCTGCATTAGATGAACGGCAGCA 
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CTCGCCTCGTGCCTCTTGGCGGACAGTCTCTGCTCATTCTGACGCTTCGTCCTTTCCACCTCGATGTTCTTTCGCTCCGCCTCCATTGGGTCCAT 

CACAGTCC 
(SEQ ID NO: 949) 

Exon: 3093.. 2867 
Exon: 2798. .2497 
Exon: 2439. .2016 
Exon: 1958.. 1740 
Exon: 1667.. 1001 

Start ATG: 2991 {Reverse strand: CAT) 
GCGAATTTTAACGTTTTTGTTC 

TCGGACCATGTACCTCTTCACCAAGGCGAATCTCATACGCTTCCTAGCCGGCGCGATATGCTTGTTGCTGGTGCTTAACTTTGTGGGCTTCCGTT 
CCGACGGAGGTAGCGCCACCTCCCTCAGCAAGCTCAGCATTCGGCGCGTGCACAAGTATGCTCATATCTACGGGAACGCTAGCAGCGATGGAGCC 
GGAGGCAGTGAAGCATCCAGGCTGCCCGCTTCCCCGCTCGCCTTATCAAAAGACAGAGAGCGGGACCAGGAGCTCAATGGCGGACCCAACTCTAC 

C AT AAG AACT GT GAT T GC C AC GGC AAAC T T T AC T T C C AT T CC AC AAG AC T T AAC GC GC T T CC T GC T GGGC AC AAAGAAATT TTTGCCCC C GCG AC 

AGAAATCCACATCCGCCCTCCTTGCCAACTGCACTGATCCCGATCCCCGTGATGGTGGACCCATCACGCCCAACACGACACTGGAGTCACTGGAC 

GTTATTGAGGCGGAGCTTGGACCTCTTTTGCGCCCTGGTGGCGCCTTCGAGCCTGAAAACTGCAATGCCCAGCATCACGTGGCTATTGTTGTGCC 

CTTCCGCGATCGATACGCCCATCTATTACTTTTCCTGCGCAACATCCACCCATTTCTGATGAAGCAGCGCATCGCCTATCGCATTTTCATTGTAG 

AGCAGACCAACGGGAAGCCCTTTAATCGGGCTGCCATGATGAACATTGGTTATTTGGAGGCCTTAAAGCTGTACCAGTGGGATTGTTTTATATTC 

CACGATGTCGATCTTCTGCCTTTGGACGACCGCAATCTTTACAACTGTCCACGTCAGCCGCGACACATGTCAGTGGCTATAGACACGCTGAACTT 

CAGGTTGCCTTATCGATCAATATTTGGAGGTGTTTCCGCAATGACGCGTGAGCACTTTCAGGCCGTAAATGGGTTCTCAAACTCGTTCTTTGGCT 

GGGGCGGCGAGGATGACGACATGTCCAACAGGTTGAAGCACGCAAACCTATTCATATCAAGGTATCCGGTCAACATAGCTCG^ 

AAGCATCAGAAGGAAAAGGCCAATCCTAAGCGCTATGAAAACTTACAGAATGGCATGAGCAAAATAGAACAGGATGGAATCAACTCGATAAAGTA 

TTCTATCTACAGCATCAAACAATTTCCAACTTTCACTTGGTACTTAGCAGAGCTAAAGAACTCCGAGCGCAAAAGTTAGCCTGCATAAAAGACAG 

CGTTACAATACGTGACCTTTCCAACTCTCAAATATGCGTATGTACAAAGGACAACATTACTTTCACAAACGGTAGTCTCTTTTAATAGAATGACA 

GATCATGGCGGCATGGTTAGAATATTAGGCACATATTGTGTGTATGGTAGTAAGTCCAGTAACTATACTCATATTACGATGTAGTATACTTTTAA 

TCCATCATTTAGCTTCAATTCGTAATCCAAGAACAACAATAAGAACTCGTCTCTGTAGCAATAATCCTCCCGTTTCACTACTCAAAGTGCTCAAT 

GATACTTATCTAAATAATATACCTCTGTTGATTTTTGTATTGTTTTATGTTGTTTTTAGTTTTTTGTTGTTTCACTATAGTGTTTTTTAATAACT 

TCTCAAAAGAAAGCTACACCTATTCCTACAAGGTGAAATGTAGATATTACTTTTGTGATATGCAAATTGTTATTCGATCAGAACGATTTATACCA 

AAATAAACTTATAAAAACTTTTACTATACCAAAA 
(SEQ ID NO: 950) 

Start ATG: 103 (Reverse strand: CAT) 

MYLFTKANLIRFLAGAICLLLVLNFVGFRSDGGSATSLSKLSIRRVHKYAHIYGNASSDGAGGSEASRLPASPLALSKDRERDQELNGGPNSTIR 
TVIATANFTSIPQDLTRFLLGTKKFLPPRQKSTSALLANCTDPDPRDGGPITPNTTLESLDVIEAELGPLLRPGGAFEPENCNAQHHVAIVVPFR 
DRYAHLLLFLRNIHPFLMKQRIAYRIFIVEQTNGKPFNRAAMMNIGYLEALKLYQWDCFIFHDVDLLPLDDRNLYNCPRQPRHMSVAIDTLNFRL 
PYRSIFGGVSAMTREHFQAVNGFSNSFFGWGGEDDDMSNRLKHANLFISRYPVNIARYKMLKHQKEKANPKRYENLQNGMSKIEQDGINSIKYSI 

YSIKQFPTFTWYLAELKNSERKS* 
(SEQ ID NO: 951) 

Name : beta-1 , 4-galactosyltransf erase 
Classification: enzyme 
Gene Symbol: BcDNA: GH1335 6 
FlyBase ID: FBgn0027538 

Celera Sequence No. : 142000013385198 _ m „ m 
ATGTATATACAACAGGATATTCTGCACGATACTTATCCTGCAGCTGATGCATTTTCTCCACCAGTTTAACCACTCCCATCGAGTACCACGCTTCT 

TCGTAAAGGCGCATGAGCTGCAAGACACGGCGGTCACTATATCTATCAGCCGGGTCAGGCATCCTTAACTGAATAACCCAACGCATGGAATCATT 

GGCGGCCACATCAACTCGCTCATCAATGAATACACCATTCAAGCCGTATACGAGAAATTTTTTAGGATGCAAAATCCACTGATTGACCAGTTCTT 

CGGGACGAGGATTCCCGTGAAGCTTTCGCGTATCCTCCTTGCAGGCCTTGTAGTATTGTCGCATTTGCTTTATCAGTCGGGGTGCCAGTGTATCG 

TTTTCACCCAAGTGGCTCTCCATGAATTCCACCATGGCAGTGGCGAACTGATCCTCGACTCGCTCAAAGTTGTCCACATATTTAGAGGCACTGGA 

GAAGCGACCGCAGGCAAAATTCCAGAAATTCGTGCACGGTGATGTTGTTCTATTGAGATTATTGTGAATATCGTTCGACGTTGCCACGTTGATGT 

CCACTATTGCCTGGATGGGAATGAATAATGCCACCAGGCAAGAGCACAGTAAAAGTAAACATCCCATTAAGCTCGCTCGATTCACAATGAAACAA 

CAGATTATCGCAACGCTCTGCTGAGCTCGGCCACCTTCGAACTACCTATAGAGCTTTGGGGCTGATAGTGACAGCTTGGTCAATACTAGAATTCA 

CATTTGAAATGTAGAGCAATTACCTCTAATGAAATGTGCCTGCCACCTATATAAACCCGCTTGGTGAGTACATTAAACCGATCTTAACGTTTATT 

GAATCTTAATTACACCGCTGTTGTAGTTCAATCTTCAATCGTTCGCTAAGACAGTTCGCAATATAAAATTGGTAATCGCGTCGGGACTACCCCAA 

CTATATGTCACACGCTTATAGTCTACTTAACAAAACTGGATCTAAGGTGTTCACTTCTTGAACTTTTGCTTCTTTCCCGCATTCGCACCCGCCGT 

GGACTTTTTCAGCTGGCCGATCTGTTTGGATTTTTGCTTCAGCTTGTTCTGCTTCTTGATGGGTTTTCCCTCGAGAATGGGCACATCGTCGATAT 

CCACTCGCGGCGCCTTGGCCAGTTTGGCGTTTCCCTTGGGCGACTGTTTTCCTTTCGGCTGTGGCTTCGTTTTCGTTTCCTTAATTGCTGATTTG 

CTAGGGGAGTTTGTGCCATTGGTTGTTGGCTTATTGTCATCTGGTTTAGCTGGTGATCCGGCTCCATTTTCCTTGGGTTGCGATCCTTTGTTCTT 

TTTATTGAATGATTTCGCATCTGAATTGGAAACAGTGGTTTTCCCATTGGTTTGTTTTTGTTTAGGTGCAGACTTGGAAGGTGACTCGCTTGAAT 

TTGTTTGCTTGTCCGGTGCCTGGGATTTCTCATTTTTCTTCTCTGCTGGCTTTTCTGGCTTTGGCTTTTTGACAGGCTGCTCAAACACCTTGGCC 

ACATACTTGGTCTTGCTCTCTTCCCGCTTCTTTTTCTTCAGTTCCTGTTCGATTTCGGTGAAACTGGGTTTTCCGGCGCGCTTTCCAAGCAGTTT 

TCGGGGACCCTTCTCCTTTGACGCTACCTCCTCCTCCACCTCCTCAGACTCCGCAGCCGCTTCTGTAGTGGCGTCCGCTTCGATGGCCTCATCCA 

GCTGCTTTTCATCCTCCTCCTGCTGCTCCTTAGTTATGGGTATGGTGTTGGAGAATTTCTTGAGCTTGGCCACGTAGAACCCGTCCATGTTGTGG 

GTGTGCGGATAGTAGCGCTTGGTAAGGTTTAAGCTAGGATGGAACCGATGCTGACGGAATTTGGTAAAGCCCTCCACGCCGAAGTCCAGACCAGT 

GGGCACAAGTTTGACATTTCGCTTCTTCAGTGCATAGTCGATGACCCATTCGTTCTCCTCGGGCAACACCGAACAAGTTGAGTATACAATGTAGC 
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CGCCAGTTGATGACTTCGCATCCGTACAATCTATGGCAGTCAGCAGAAGCTTCCTCTGCAGGTTGTAGCACCTTTGTACATCTACTTCCGACTTG 

GTGGTCTTCACACTGGGATCCTTAGAGACGACGCCAGTTCCAGTGCACGGAGCATCCAGCAGGACACGGTCAAAGCCAGTCATAATATTCCGGAA 

CTTAGTGCCATCCTCGCAGCTGACCACTGCATTTACAATTCCCAGGCGATGGAAATTGGCAACAATGGCCTTGATGCGATCCCTGTTAGAATCGT 

TGGCGAAGAGCACACCAGTGTTCTTCATGATGCTTGCGATGTGGGATCCCTTTCCACCTGGGGCGGAGCACATGTCCAGGATGCGTTCGTTCTCT 

TGCGGGGCGAGTGCCATTACAGGCAACAGCGAAGAGGCGCCCTGGATCATGTAGTGGCCAGCCAGATACTCTGGAGTTGCACCCAGTGGCACCTG 

TGAGTTAAAGACAACCAATCCAACTTTCGTCCATTTACCAAGGGGATCCAAGTTGACGCCACGATTGATCAGAGCTCCGGCCAAATCCCTACGAC 

GTGTTTTTAGAGTATTGGTGCGTATGGTTAGGGGACGAGCAATCTAAAGGGGTTAATTTTTAATGATTAATCTTGTAACTATATTTTTTATGAAA 

TATACTAACCTCTGAGGCCTCCAGATATTCCATTAGCTCTGTCAGCGGCAACATATCCATCAGCTTCTCCATCAAAAATTCGTTGTAGCTGTAAT 

ACAAACACAGATCCCTGCGCAGCAAATCGATGTACTCCCCGCGAGACCTATCCGCTTGACGATACCTTTTGAAGTCGGAAAGAACCAGACTGACG 

TCCTTTATGCGCTGTTGCACTTCCTGCAGGGTTAGGTCCTTCTCGGTTTCTTCGCCTTCGACGGGTAGCTGGAAAACATCCTGCTTGTCAACGGT 

TTCCAACATTTCGTCGTCGGCCAGTTGGGCTTCCTCTGCCTCGCGCTTTTTCAACTTCTTGTTAGCCCGCTCGATGGGCAGCTTGTCATCATCAT 

CATCATCATCATCATCGTCGTCGTCCTCATCCTCCTCTTCATCACTGTCTGCCTGAGCACCATCTTCCTCGCCAGATATGTCAAAGTCATCGTCG 

GAGTTGGCATCATCATCGTCGGATAGATCGGCCAGCTTGCCAACCTGAGCGGTATCATCATCAGATTCCAATCCTTCCTCATCGTCTCCCTC 

TTCCTCTTCACATTCTTCTCCCAACTCATCTTCGGAGTCCTCCTCCGACTCCTCATCCTCCTCCTCCTCCTCCTCGGACTCCGTTTCTGGCTGCT 

TCTGTTGCTTCTTGGGCTTCAGCCAGGCATCATTATCATCTGTAAAACCCTTGGGAGCTGTAGCTTTCTGGCCCTTCGGAGCCGGCACCAGTTGA 

GGCACCTCCTCATCGGAGGATGCAACCTGTGGTTCCTCTTCATTATCCTCAGGATCGCTGTCGCTGTTGTAGGCCGTCTTCCGCTTAGCCGGTTG 

TGGTAACTTTCCCTTGGCCCTCTTCTCCTGTAGTTTGGCCTTCTGGACTACTTTCTTTTGCTCTCGCTTAACGAACCTCTGCTTCTGTCGGTGGG 

AAAGCTTCTTGTCCTCCTCTTCCATGGGAGCTGTGAAGTAAAACATTTGAATTGAACACCCATTGAATAGGGTAATATTCAGGAGCTCTCACCAA 

ACGATTGTTTGCGAAAAACAGGCGGTCCTTGCTTGCGCGCCTTGCGCCCAGGTCCCTTTTTCGGTTTCTCACTGTATTCAGCCTTCCGACCCATA 

TTTTTAACTTAGTTTATCCAGTTTCTTGGAGCAGAACTTAAAAAAAAACGTGAGCACTTCGATTTTGCTCCACGTGCGCCGCTCAAAAGACAGCG 

AGTGTTTTCCAGCACTGGTCAGAGGGAATTGCTTGAGCCAGTGCTGTCAACTTTTCAGCTTTTAACTAGCTATGAACAATGTTTTTTTTTCAACA 

TATATTAATAACTAATAAAATTATATATTACTTTAAATTTTATTTTGTCGGACTTTGCTTGATTCTTACACAGTTTATTTATACAGATTACGTTA 

GTATATATATTTGGAAAACACGTGGAACTTTTTAATTAAGATATTTTAGCTACTTGCTCTCCCAGCTGTTTAGCAGCAAACGTATTTCTCAGCAC 

TATAAAAACGAGTTGGTAAACTAATTGGTAAATCCAAAAATATCATAAATAAAAAAAATCTGGAATATATCTAATTAACAGAATTCTTGCTTATA 

AGTAAGTATTATATCCATTCAATAATTAGATATGATCTTTAGTTGTAATTTAAAATGAAAAATTAATTTAAATAACCGTTATAGTTGCTGCCAAC 

TTGATGCGAATAGTAGTTTGTTGACAACCCTGCCATATTTGATTGTGTTGGTGAATGGCTGGTTCAAACGACTGCGCCTTTTTGCGAAAAATTTA 

ATTAGTTGCAAATCATGGCATACCAGGAGAAAATAACAGAATTTAAGAACTGGGCCACTAATCTTGGCTGCCCGCCCACTGCCCTGCCCACCGAT 

GATGCACTCCGAAGGTAAGTTACAGCTGTGTTAGGCAAATCGGCAACTATCCGTACTTAAAACCCGCTTTTTGCTTCACAGAATTTTTAAGTCCG 

GCTCCAGTTTACTTTTGAACCAGCTGCAATCCCGCATCCAGCCGGTGGATTACGTGCGGGAGGTGCGCGAGAACCTGCTGATTGCCCAAGTGGCT 

CGCTACAAGGACAAAATGGTGCCGCTGGCCTCTAGGAGTTTCCAGCCACCGGAGCTCCAGAGGTACCAAAAAATACAGGAGCTAAAGAAGCACAA 

GGAGAAGGCGAACCAACAGCTTACGGAGGCCCGAAAAGAGTACCAAAAGCTATCTGCCGGCATC 
(SEQ ID NO: 952) 

Exon: 3909.. 3703 
Exon: 3635.. 2670 
Exon: 2608.. 1001 

Start ATG: 3635 (Reverse strand: CAT) 

Transcript No * CT24929 

TGCTGGAAAACACTCGCTGTCTTTTGAGCGGCGCACGTGGAGCAAAATCGAAGTGCTCACGTTTT^ 

AACTAAGTTZUIAAATATGGGTCGGAAGGCTGAATACAGTGAGAAACCGAAAAAGGGACCTGGGCGCAAGGCGCGCAAGCAAGGACCGCCTGTTTT 
TCGCAAACAATCGTTTGATGGAAGAGGAGGACAAGAAGCTTTCCCACCGACAGAAGCAGAGGTTCGTTAAGCGAGAGCAAAAGAAAGTAGTCCAG 
AAGGCCAAACTACAGGAGAAGAGGGCCAAGGGAAAGTTACCACAACCGGCTAAGCGGAAGACGGCCTACAACAGCGACAGCGATCCTGAGGATAA 
TGAAGAGGAACCACAGGTTGCATCCTCCGATGAGGAGGTGCCTCAACTGGTGCCGGCTCCGAAGGGCCAGAAAGCTACAGCTCCCAAGGGTTTTA 
CAGATGATAATGATGCCTGGCTGAAGCCCAAGAAGCAACAGAAGCAGCCAGAAACGGAGTCCGAGGAGGAGGAGGAGGAGGATGAGGAGTCGGAG 

G AGG AC T C C GAAG AT G AGT T GGGAG AAG AAT GT G AAG AG G AAG AGG AGGG AG ACG AT G AGG AAGG AT T G G AAT C T GAT GAT G AT AC C GC T C AGGT 
TGGCAAGCTGGCCGATCTATCCGACGATGATGATGCCAACTCCGACGATGACTTTGACATATCTGGCGAGGAAGATGGTGCTCAGGCAGACAGTG 
ATGAAGAGGAGGATGAGGACGACGACGATGATGATGATGATGATGATGATGACAAGCTGCCCATCGAGCGGGCTAACAAGAAGTTGAAAAAGCGC 
GAGGCAGAGGAAGCCCAACTGGCCGACGACGAAATGTTGGAAACCGTTGACAAGCAGGATGTTTTCCAGCTACCCGTCGAAGGCGAAGAAACCGA 
GAAGGACCTAACCCTGCAGGAAGTGCAACAGCGCATAAAGGACGTCAGTCTGGTTCTTTCCGACTTCAAAAGGTATCGTCAAGCGGATAGGTCTC 
GCGGGGAGTACATCGATTTGCTGCGCAGGGATCTGTGTTTGTATTACAGCTACAACGAATTTTTGATGGAGAAGCTGATGGATATGTTGCCGCTG 
ACAGAGCTAATGGAATATCTGGAGGCCTCAGAGATTGCTCGTCCCCTAACCATACGCACCAATACTCTAAAAACACGTCGTAGGGATTTGGCCGG 
AGCTCTGATCAATCGTGGCGTCAACTTGGATCCCCTTGGTAAATGGACGAAAGTTGGATTGGTTGTCTTTAACTCACAGGTGCCACTGGGTGCAA 
CTCCAGAGTATCTGGCTGGCCACTACATGATCCAGGGCGCCTCTTCGCTGTTGCCTGTAATGGCACTCGCCCCGCAAGAGAACGAACGCATCCTG 
GACATGTGCTCCGCCCCAGGTGGAAAGGGATCCCACATCGCAAGCATCATGAAGAACACTGGTGTGCTCTTCGCCAACGATTCTAACAGGGATCG 
CATCAAGGCCATTGTTGCCAATTTCCATCGCCTGGGAATTGTAAATGCAGTGGTCAGCTGCGAGGATGGCACTAAGTTCCGGAATATTATGACTG 
GCTTTGACCGTGTCCTGCTGGATGCTCCGTGCACTGGAACTGGCGTCGTCTCTAAGGATCCCAGTGTGAAGACCACCAAGTCGGAAGTAGATGTA 
CAAAGGTGCTACAACCTGCAGAGGAAGCTTCTGCTGACTGCCATAGATTGTACGGATGCGAAGTCATCAACTGGCGGCTACATTGTATACTCAAC 
TTGTTCGGTGTTGCCCGAGGAGAACGAATGGGTCATCGACTATGCACTGAAGAAGCGAAATGTCAAACTTGTGCCCACTGGTCTGGACTTCGGCG 
TGGAGGGCTTTACCAAATTCCGTCAGCATCGGTTCCATCCTAGCTTAAACCTTACCAAGCGCTACTATCCGCACACCCACAACATGGACGGGTTC 
TACGTGGCCAAGCTCAAGAAATTCTCCAACACCATACCCATAACTAAGGAGCAGCAGGAGGAGGATGAAAAGCAGCTGGATGAGGCCATCGAAGC 
GGACGCCACTACAGAAGCGGCTGCGGAGTCTGAGGAGGTGGAGGAGGAGGTAGCGTCAAAGGAGAAGGGTCCCCGAAAACTGCTTGGAAAGCGCG 
CCGGAAAACCCAGTTTCACCGAAATCGAACAGGAACTGAAGAAAAAGAAGCGGGAAGAGAGCAAGACCAAGTATGTGGCCAAGGTGTTTGAGCAG 
CCTGTCAAAAAGCCAAAGCCAGAAAAGCCAGCAGAGAAGAAAAATGAGAAATCCCAGGCACCGGACAAGCAAACAAATTCAAGCGAGTCACCTTC 
CAAGTCTGCACCTAAACAAAAACAAACCAATGGGAAAACCACTGTTTCCAATTCAGATGCGAAATCATTCAATAAAAAGAACAAAGGATCGCAAC 
CCAAGGAAAATGGAGCCGGATCACCAGCTAAACCAGATGACAATAAGCCAACAACCAATGGCACAAACTCCCCTAGCAAATCAGCAATTAAGGAA 
ACGAAAACGAAGCCACAGCCGAAAGGAAAACAGTCGCCCAAGGGAAACGCCAAACTGGCCAAGGCGCCGCGAGTGGATATCGACGATGTGCCCAT 
TCTCGAGGGAAAACCCATCAAGAAGCAGAACAAGCTGAAGCAAAAATCCAAACAGATCGGCCAGCTGAAAAAGTCCACGGCGGGTGCGAATGCGG 

G AA A G AAGC A AAAGT T C AAG AAGT G A 
(SEQ ID NO: 953) 



FIGURE SHEET 513 



Start ATG: 208 (Reverse strand: CAT) 

MEEEDKKLSHRQKQRFVKREQKKVVQKAKLQEKRAKGKLPQPAKRKTAYNSDSDPEDNEEEPQVASSDEEVPQLVPAPKGQKATAPKGFTDDNDA 
WLKPKKQQKQPETESEEEEEEDEESEEDSEDELGEECEEEEEGDDEEGLESDDDTAQVGKLADLSDDDDANSDDDFDISGEEDGAQADSDEEEDE 
DDDDDDDDDDDDKLPIERANKKLKKREAEEAQLADDEMLETVDKQDVFQLPVEGEETEKDLTLQEVQQRIKDVSLVLSDFKRYRQADRSRGEYID 
LLRRDLCLYYSYNEFLMEKLMDMLPLTELMEYLEASEIARPLTIRTNTLKTRRRDLAGALINRGVNLDPLGKWTKVGLVVFNSQVPLGATPEYLA 
GHYMIQGASSLLPVMALAPQENERILDMCSAPGGKGSHIASIMKNTGVLFANDSNRDRIKAIVANFHRLGIVNAVVSCEDGTKFRNIMTGFDRVL 
LDAPCTGTGVVSKDPSVKTTKSEVDVQRCYNLQRKLLLTAIDCTDAKSSTGGYIVYSTCSVLPEENEWVIDYALKKRNVKLVPTGLDFGVEGFTK 
FROHRFHPSLNLTKRYYPHTHNMDGFYVAKLKKFSNTIPITKEQQEEDEKQLDEAIEADATTEAAAESEEVEEEVASKEKGPRKLLGKRAGKPSF 
TEIEQELKKKKREESKTKYVAKVFEQPVKKPKPEKPAEKKNEKSQAPDKQTNSSESPSKSAPKQKQTNGKTTVSNSDAKSFNKKNKGSQPKENGA 
GSPAKPDDNKPTTNGTNSPSKSAIKETKTKPQPKGKQSPKGNAKLAKAPRVDIDDVPILEGKPIKKQNKLKQKSKQIGQLKKSTAGANAGKKQKF 

KK* 

{SEQ ID NO: 954) 
Classification: hypothetical 

Celera Sequence No. : 142000013384681 

CAACGTCTAGGGGTCTCTGGACACTTGTGGCCGTTCTCCGCTTACGCATCTGTTCATTGATATGGCGAAGTAAAGACCGCAGCATTTCATCATCG 
AAGAGCAGCATCCATGCTTCCACTGCATTGGACACTGTTCGGGCGGGTCCCTTGCCACAGGCCTGTTGCCCCAAAAGGGGTAGATCCTCTGCAGA 
AATATCTGGGCTCCTGGTCACGCTCCATAGCTGCCCACAAGGAGATGTGTAGTTTCCGTTATTATTTGGTGGACGACCACGTCCTCGCTTCGGGG 
CAGCTGGCCTGGCGGAATCCTCCTTCTTCAGCACCACGTCAGCCAGTTGGTCCTCGTTCATCAGCATGTCGCTGTCCTCGAAAATCCAGGGAAAA 
TTGCATACTGCTTCCTGTAGCGGATCGGAGTCATCTCCGTTAAAGGAGGCACTTTGGACTGTGGAATCGACCGGCGGCAGCCGCTCCTCCACCTC 
CATCTTTTCACCATCCTCGTCTTCTTCCTCCTGGTCGCTACATTTAATATTGGCCAGCAGCTCCTCGTCCAGCACGCATTCCATTGTTTGTTTAT 
AAACTTGCAACAAAACAAACCTTTTCAGTCGTCGATAGGTTAGGACAGTAACGTCTATACATATCGATAATTTATATAGTTATCGAAAAAGACAT 

T T AAAAC AT AAAT C T T AAGCT T GAAAGGT T G AAAAC T T AT AAAAAAAAT AGT T T GT T C T ATT T AG AAAAAATT AT GT AC AT T C T AAAT AT AAC T T 
ATTTATTTTAAATGTAGATTTAATTTCCAGCGTAGAGATAACTAGTGTATGTTTATAATTCTTAAACAAATTATACTTAACAAAACTTGTATTTA 
TTCTTGGCGCCACTGTTCAAATTTCAAATGAACCTGCCAACTGAACTAACTCCCCAGTGGACCCCCATCCATGATTTATCGCACATAAGCGGCTA 
TCGGATTTCTTTCGCCACTAATTTGTTGTTTGGCACGCAGCAAGGAAAAGGAATAAAACGCTTTAAAACCAGTGCCAAAATAGCTTTCCCCAACG 
TTCGTTACTATATTTACGTTGTCTGCAAAAAGAGTTTGTCTAATTGGCCTCGATACTACGGTTGTGACGACTGCCCCGTCACTACAGTTTGCTGA 
GTGCAGTAACCGGGGCCTGGCTGAAACACTGGGAAAACCAGTGTTTAGTGTAAATAAATGATAGTGGCACCCATTCGGACATCGCAGGTGAATCC 
CCGGGAGCTCAAGACGCCCGTTATTTGGGCGCGCAGTGAATTTGTCAAAATATTTTGGTTAAAGTGTCAATTTGACATGTTATATGTCATGGTCT 
CTCGTGTGCCAGTGCGATAGTTATCGATAGATAGCCCCAACTTAATTTGTTTTAAATATTTTTAAATTTCTTCTTTACTTAATTCACTACTGTAA 
TACGTTCGTTTAATGTATAGTTCCTTGGAAATTAATCGATTTAGCAAGTAGACCGATTGGAAAGAAATCAAAAATCGATTTGAAATGCGTATTTG 
AACCTTAGTTGGCCATGACTTTGGTGTCTGAATGACGCCTGCCCTCAGAAAAAGTTAGTTGAATCTCAAAGTCTGTGGTGACAACACCTAAAGAA 
ACTAAAAAGTGACAGGTTTCTCTGTTTGTAAAAAAAAAAAAATTGTAATTAAAGTTCTGTTCTTTTCCTTTCAATATTTGCATACCGTTTCTGGT 
TGTGTTTTAATGAATTTTCTGTTTTCTGAAGCTTAAGTCTGAGCTGCAATGATCTCGCCGGAACAAGAAGAGGTCAACATGGTCCTGGATCGCCA 
TGTACGGCAGAATATCCAGGACATGATGCACGAGGCTCATGTCCAGGCGAGCCTGCTGGAGAACGAAGGTGAGTTTGCTATATTTCGGTCGAGTT 
CGGTCTTGTTCATGCCCGTTTTTTTTGCATGTCATCTGCAGGACGCGGCCGTTTCCACTCGGACTCAAGTTTGGACCAGGATTCTCTTCACGCTG 
ACGTCATCGTGGAGGAGGATCAGAGCACCGAACACGGCGCCAACCAGGTGCAAAACTACCACGACATGATGGTGGACAGTGAACACCACATCGAC 
ATAAATGGGTCGCTGCGCAAACGTCGTGGAAATCTTCCCAAGACCTCTGTAAAGATTTTAAAGCGTTGGCTTTATGAGCACCGCTATAACGCCTA 
TCCGAGCGATGCGGAGAAGTTCACCCTGTCCCAGGAGGCCAATCTGACGGTGCTCCAAGTGTGCAACTGGTTCATCAATGCCCGTCGCCGCATTC 
TACCGGAGATGATCAGGCGCGAGGGCAACGATCCTCTGCACTTCACCATATCGCGACGTGGCAAAAAGGTTAGCCCCAACTGTTCGAGATCAAGT 
GCGTTGGGCGCAAACTTGACTGGGCCGAATCCCGCACACGGAAGTCCTGCTTCCGAAGTGGTGTGTTATCTTCTGGTCAATCATTATACCACTGA 
AGTTACTAAATTGCTTGACTCCAGGTCGTTGGCGCTACAGAGGAGGTGGATGGCGCCGGTGAGATTCACGAGGGCATCGCCAATGTGCTGACCAA 
CTTTGAACAGTATGTGCAGGGCCCCAACGGACAAATGGTCAAGATGGAGCCGGAGTACGAGGACAGTGTCATCTACAGGTAACTGGTAACCGGTC 
TTTATGACCTGCTTCCCTAGCGTTAGTCACTTCTATCTCCTATCCCATCCCACCTATGTCCAAACTCCTTTTCCTCTTCCAAGCTGGCAGCAGGC 
CATTGCAAACAACCCGATGGGCTTTCAAAGTCTTCACTCCTCGCTTCAAGCAACCATGATCGATAAAATTAAAAACTATCAGATGCGCAAGGCTG 
CCGCCATTGGGGGTTCGGCGGTTGGCTCTGGTGGTGCTGGTGGGTCGAGCTCCAACTCCAGTCCTGCCACATCAATACTGCCATACAGCCTTTTT 
GGCCAGCTGCCGCCAGAGTTCGATGACGAAGAAAAGCCAAGACCTCCGAAGCGCGTAAGAACGCGGACGGTTGCTGCAAAGAGTCCTAGGGAGAA 
TGCCAAGCAAGCAAAGCAGAAGACTGGGAATAAGCAAGAAACCATGTACTGCTATAAGGATTCATACGGCGGAATCGTGGTGTCTCCTAGGTACA 
GCTTCTGCTCCCATTGACTGATCCAGAGCCTTAAATCTTACTCAAGTGCGAGTGGCGGTGCTAAGATTTTACTTTTACCCCCAGGTCCGAAGGCG 
AAGAGAGTGCACAGGGATACGAGTCCTGTGGGCCGAACAGCGAGGAGGAAGTACGCTTCGAGACCTCCCATGATTGGCAGAGCGTTATAAAGTAA 
GTAGCTATGTGGATGTATTAAGAAAATATTGTAATCTCGGCCTCTGCAATTCCAGAACTGTATTTGGTACTGAGGAAGTCAGCACTTCGGCTGGC 
AATAATCCAGGTACCTCTGGTTCTAAGGGATCTGTTCAGAATACCGCCAT 
{SEQ ID NO: 955) 

Exon: 1001.. 1227 
Exon: 1759.. 1873 
Exon: 1942.. 2377 
Exon: 2434.. 2470 
Start ATG: 17 59 

GAATAAAACGCTTTAAAAC^ 

CGATACTACGGTTGTGACGACTGCCCCGTCACTACAGTTTGCTGAGTGCAGTAACCGGGGCCTGGCTGAAACACTGGGAAAACCAGTGTTTAGTG 
TAAATAAATGATAGTGGCACCCATTCGGACATCGCAGATGATCTCGCCGGAACAAGAAGAGGTCAACATGGTCCTGGATCGCCATGTACGGCAGA 
ATATCCAGGACATGATGCACGAGGCTCATGTCCAGGCGAGCCTGCTGGAGAACGAAGGACGCGGCCGTTTCCACTCGGACTCAAGTTTGGACCAG 
GATTCTCTTCACGCTGACGTCATCGTGGAGGAGGATCAGAGCACCGAACACGGCGCCAACCAGGTGCAAAACTACCACGACATGATGGTGGACAG 
TGAACACCACATCGACATAAATGGGTCGCTGCGCAAACGTCGTGGAAATCTTCCCAAGACCTCTGTAAAGATTTTAAAGCGTTGGCTTTATGAGC 
ACCGCTATAACGCCTATCCGAGCGATGCGGAGAAGTTCACCCTGTCCCAGGAGGCCAATCTGACGGTGCTCCAAGTGTGCAACTGGTTCATCAAT 
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GCCCGTCGCCGCATTCTACCGGAGATGATCAGGCGCGAGGGCAACGATCCTCTGCACTTCACCATATCGCGACGTGGCAAAAAGGTTAGCCCCAA 
CTGTTCGAGATCAAGTGCTGGTGTGTTATCTTCTGGTCAATCATTATACCACTGA 
(SEQ ID NO: 956) 

Start ATG: 228 

MISPEQEEVNMVLDRHVRQNIGDiy^ 

RRGNLPKTSVKILKRWLYEHRYNAYPSDAEKFTLSQEANLTVLQVCNWFINARRRILPEMIRREGNDPLHFTISRRGKKVSPNCSRSSAGVLSSG 

* QSLYH* 
(SEQ ID NO: 95"? ) 

Name: Homebox Protein 
Classification : transcription_f actor 

Celera Sequence No. : 142000013384574 

T C AAAAT C GT C AGGT AAAT C T C AAT AG AAAT T AG AAT C T T AC G AGT C GAT T AAC AAT T GGT GT T C AC T TC T AG AAAC G ACG AAG AC GG CC T GG AG 
TTCGATCTGATTGGCGTGTATCCCGCCATTGCAAATGCCTTCCGTCGCCTGATGCTCAGCGATGTGCCCAGCATGGCCATCGAAAAGGTGTACAT 
ATACAACAACACCTCGATCATCCAGGACGAGGTACTTGCCCATCGAATGGGTCTACTTCCGTTGCGGGCTGATCCCCGTCTATTCGCATACCGCA 
CCGAGGAAAGCACTGAGGCGGGGACCGAGCAGGATACGCTTGAATTTGAGCTGAAGGTGAAGTGCTCGCGGCGACGGGATGCTGGAAAAGATCAG 
TCCAACTTCGATGACATTTACAAAAACCACAAAGTGTACTCGGGTCATCTGAAGTGGCTGCCCAAAGGAAAGCAGGCGCAGATATACAGCGAGAG 

T GC T GT G AACT G CAT T C ACG AT G AC AT AC T G ATT G C C C AAT T GCGT C C GGGT C ACGAGT T GG AT CTGCGCCTGGTGGCGGT T AAGGG AC T T GG AC 
GGGATCACGCGAAATTCTCGCCGGTGGCCACGGCCACCTACCGACTGCTGCCCATGATTAAGCTGAACCGCGAGGTGACCGGTAAGGATGCGTAC 

C TAT T AC AG AACT GT T T CTC AC C AGG AGT AAT T GGC AT T GAC GAG AAT G AAAC T GCC T AT GT AAAGG AT GC AC GC T AC G AC AC GT GC AGC C G AAA 
TGTGTATCGCTATCCCCAGCTAAACGACGCTGTGACTTTGGCCCGTATCCGAGATCACTACATTTTCTCCGTGGAATCAGTCGGCGCACTGAAGC 
CCGATGTCATCTTCCTGGAGGCGGTCAAGGTATTGAAGAGAAAGTGCCGGGCGCTAATTGACGAAATTGAGGCAGAATAGTTATTTATTTAGTTG 
TTAAAGAAATTATATTATGTACATAAGACAGCTTGATATAAATTGTAAGCCTACGAAGATCGTGCTTGCTCTCTCTTGCGCTTGAGCACCTTCAG 
GCGAAATTCCGTTAGGACCACCTGCAGACGCTCCTGGCACATTAGTGCGTCCTCCAGGTCCATGTTGTTCAGCTGCAGAGCCAAACTCTTGCCAA 
ACAAGTCAAATTCACTAATAATGGTGTCTTCCGTCATCGCGTCGACTTGGTATTCGTTATTAATCTCCGTATCTTCCGGCTTGAGCTGTATCATC 
TCTTCCTCGTAGACCACCTCTTGGTCCTGGAGATCTGTTTCATTCTGTTGTACTTCGAGGAACTCGTTCTCAACCTCAAATTCTTGCTCTTGATA 
GGGCTGATCTGTTCTGAGTAAATCATCCAGGATTGAGTCGTTCGAGGCGATTTCGTCCTCCTCCGTCTCCTCCTCCATGCTCTCCACCTTCGTGT 
CGTTCTTGATCTTTGGCTGCATTTTGCTCCTTGGCTTTTGCCTTGGATAGGTCTTTGATTTGGCTTTGGGCTTTGCTTTGGGTGAAATTATGGAG 
GCCTCCAAGTCTGGGTCATGCTCATCTTTGACAAGTTCACCCTTAATCATATTCTGTAGGTATAAATACGGAGTTGTTAATTTTATGTAAATTAG 
TAATGAGATGAGAATTTTTTTTTTGGAGCGATAAGTTGTGCATCGCATAGAGGGAGCTACTGGTTTGGTTAACCATTGCACAAGAAGTAGTCAAT 

GGAAT AGT T AAC G AAAAGT AT G GC G AAAAC T C AC AC AAAAAT CG T GT T T G AAAT T T AGT T T GG AT CGC AAAAT AT T T G AAT C AAT AAT GG ACAC G 
GGTCATTCCTATTGCTTATGCACAAACCCAATCCCACCTTGGCAAGAATGTTGTGGTACTTGAAACTTTCCTGGTGTTTCCGCGTTGCGAGGTGC 
TGCTTGATGGACGCTAGTCTGGCGAGGACATTGATGTCGCAGATCTTGCAGTGCGCCGAGTAACCGTCATTACTAGACGCCAGCCAGCTGGAGAG 
CTCCGGATATTTCTCCCACTCTGTGCGATAGCGTTGGTGGTACAGTCGCTTCACCTTCTTGTCCTCATCCGTCTTGTAACCCTTCAGCATTGTGC 
CCGCTTTCGGATGAAGCTTGTCTGATTTGTGGAAAAGTATTCTTTGAAAGATGAGCGCCAAAAGCAGTCTTT AAT AGCT GAT GGTGGT AAAT ATT 
TTGGTATATTCCAAAAATTTGCGACGCGAAAAAAATCTGAAAGTCGATGCGGTGTGACTGTAATCGAATACGTTATAATACCAGCAATTAATTCC 
AGGGTTGCGACATAGAGGTTGCTGCGCAACGGTCACACTAAGCTGTTTTGTTTATATAAACATTTTATGCCTTCAAGTTTATATTAGAGGCGTTT 
ATGGTTGTGTTATGTACTTTTTGAAAATACCTTACTTGTACATAGTTCAAATTAATTTAGTTTGAGATTTTAAATAAAATATTCTTTCATGGTTT 
GGGTCTCAGGTAAAGTTTAAGCAGTCACAATACGGCCACACTACACACTCATAGAGAAAAAATATAGTAAATAAAAAGGCTTTGCCGAATTCAGG 
ATAGCTTTTGCCTGCACCGTGGGTGCTCAGCCAACATGGTCATTCCAAAGGAGATCCTAAGCGACTTCAAGGAGCTATACGAGAAGGCCAACCGT 
CTGGTGGAGGAGGAGAGCCGCAATGATCCGCCCACGGATCCATTCCGCTCCCATTATAAAGCGCGCGATGTGCTAATTGTCTTGAAGAAGCAGCT 
GGACGACCAACTGGTATCTGTGCAGGCCAGCGAGGAGGACGGTGGTCAGGACGATCGCTGCTATCACTCTCTGTTGGCCTTCGTTTGCCGGGATC 
TAGGCAGGATCTACATCTACACAGAGGAGCAGGCAGAGGGCGAAAAGATGCTCAATCGTTGTCTGGAACTAGTCACGCCGTTTAAGGAGTGCCCT 
GAGGGCATCATTCCATTCATTGGGGCAATCAACGAGCTTAGCATTGTGCTGGCCTCCAAGGAAGAGTACAATAAGGGACTGGAAATTCTCCTGGA 

GGC AG AAAAAAT C T AC G AGG AC TT C AAAGC C AGT GGT CT G AAGC C C C TGGC C AT T C AAG AT G T GTT C AAT C C T C C GGAG GAG GG AC AAC AAT CTC 
ACGAAGCTGGTCCCAAGGAACTGGAATCCCTGTACACGCTAGTCTCATTTTACATGGCGCAAATGTACGGGCATTTGGGTGAGCCGGAAAAGTCG 

GCCAAGTGCTGC 
(SEQ ID NO: 958) 

Ex on: 2242. .1876 
Exon: 1573.. 1001 

Start ATG: 2085 (Reverse strand: CAT) 
Transcript No * CT25498 

TCACACCGCATCGACTTTCAGATTTTTTTCGCGTCGCAAATTTTTGGAATATACCAAAATATTTACCACCATCAGCTATTAAAGACTGCTTTTGG 
CGCTCATCTTTCAAAGAATACTTTTCCACAAATCAGACAAGCTTCATCCGAAAGCGGGCACAATGCTGAAGGGTTACAAGACGGATGAGGACAAG 
AAGGTGAAGCGACTGTACCACCAACGCTATCGCACAGAGTGGGAGAAATATCCGGAGCTCTCCAGCTGGCTGGCGTCTAGTAATGACGGTTACTC 
GGCGCACTGCAAGATCTGCGACATCAATGTCCTCGCCAGACTAGCGTCCATCAAGCAGCACCTCGCAACGCGGAAACACCAGAATATGATTAAGG 
GTGAACTTGTCAAAGATGAGCATGACCCAGACTTGGAGGCCTCCATAATTTCACCCAAAGCAAAGCCCAAAGCCAAATCAAAGACCTATCCAAGG 
CAAAAGCCAAGGAGCAAAATGCAGCCAAAGATCAAGAACGACACGAAGGTGGAGAGCATGGAGGAGGAGACGGAGGAGGACGAAATCGCCTCGAA 
CGACTCAATCCTGGATGATTTACTCAGAACAGATCAGCCCTATCAAGAGCAAGAATTTGAGGTTGAGAACGAGTTCCTCGAAGTACAACAGAATG 
AAACAGATCTCCAGGACCAAGAGGTGGTCTACGAGGAAGAGATGATACAGCTCAAGCCGGAAGATACGGAGATTAATAACGAATACCAAGTCGAC 
GCGATGACGGAAGACACCATTATTAGTGAATTTGACTTGTTTGGCAAGAGTTTGGCTCTGCAGCTGAACAACATGGACCTGGAGGACGCACTAAT 
GTGCCAGGAGCGTCTGCAGGTGGTCCTAACGGAATTTCGCCTGAAGGTGCTCAAGCGCAAGAGAGAGCAAGCACGATCTTCGTAG 

{SEQ ID NO: 959) 

Start ATG: 158 {Reverse strand: CAT) 
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MLKGYKTDEDKKVKRLYHQRYRTEWEKYPELSSWLASSNDGYSAHCKICDINVLaRLASIKQHLATRKHQNMIKGELVKDEHDPDLEASIISPKA 
KPKAKSKTYPRQKPRSKMQPKIKNDTKVESMEEETEEDEIASNDSILDDLLRTDQPYQEQEFEVENEFLEVQQNETDLQDQEVVYEEEMIQLKPE 
DTEINNEYQVDAMTEDTIISEFDLFGKSLALQLNNMDLEDALMCQERLQWLTEFRLKVLKRKREQARSS* 

(SEQ ID NO: 960) 



Pplpra Seauence No. : 142000013384574 

CTTTTGGGTACCAAATCTTAGGCGTTCGGTGGTCCATAGTCATGTGCTGGACTGCTACATGTCCGTGGTGGTGACGGAACGCACGCTGGAGCAGA 

TCCATGAATGTCATGGCTTTGATCACTACCTCCTCAAGAATCGTGCCTGCGATCTTCGCTCCGCTTTGGCTCTGAAACTCAAGCGAGAGGTGCTT 

CAGGCCCTGCAGAACGGTGTTCCAGCCCTCGCCGACGAGCCCGAACGCCAACAGGAAGTGCTTAAGGAATACCGCCGCTATCTGGAACCATATAC 

GCCCGAGGAAATCGATTGGTACGGCCATACCTATTTGGAGGCCATACGCAAGCTGCAAAAGCAGCTGCGCGAAGCGGAAAAGGTGGTGCCGCATA 

AGTTGGAGTTCCGCGGAAAGCTTATCGAGCAGCTGCGCCAGGCGGGGATCAGCGAAGCGGGAAAGCTGGAGAAGCCGGATGCATTAGCAGCAGAA 

TCGTCGGTCGAACACAAGGATTCGGATATCGAAGCGCTTACAi^GCTGTAAGTCACAAATACTTCTGTAATGAATTTCCTAACCAATGATTTATT 

CCAGGGAATCCTCCCCCTCTTCCAGTTGGCTGTCTAAAATCAATCCTTTCGGCAAGAAAGAAACCTAGAAAAGCTGTCCGCGTTTGTTTTTGATT 

TATGTGTTTAGCCTCGAACGAATAAACATAACTTCGGAAACATTTCGTTACAATTTGAAGGTATTAAAACGTTGTTACTCGAACGTCTTTATTTC 

AAATATTCGTTACATCTGGTACTGTATAAAAACAAGTAGAATCACACACTGAACATGAACGTACGCGGCAAAGTAGAAAAAGGAATGATTGCAAA 

TCCCCACGAGAATACAGAAATACATTGACAAATCAACTACGCTGATCACTTTACGCTTAGACTATGGTTAGTGTATAGCTCCGCAAGCGGGCAGA 

GAGCTTTGTGCCGTAAATGATCCCAGTCCGAGCTGGAAGCCAGGTAGGCCTTAAGCGATTGGATTTTTCCTCATTAATTCGATGTCAK 

CATGTCGCTAACCAATTCCACGAAGGTAACTTTAGGCGTCCAGTTTAGCTCTCGATTGGCTTTGGACGCGTCGCCTTGCAGCAGATCGACCTCCG 

TCGGCCGAAAATACTTTGGATTTATGCGGACGCGCACAATACCCGTGCCGTTCTCCACTCCCACCTCGTCCACACCCTTGCCCTTCCACGTTATC 

TCTCGATCAATATGCTTGAACGCGGCTTCCACGAATTCGCGCACACTGTGCGTTTCGCCTGTAGCTATGACGTAGTCGGAAGGAGACTCCCGCTG 

CAGCATCATCCACATGGCCTCCACGTAGTCGCTGGCATGTCCCCAGTCGCGCTTTGAGTCGAGATTGCCCAGTTCAAAATACTCCATTTGCTTGT 

GATAAATCTTGGCCACGCTACGGGTGATTTTTCTAGTCACAAAGTTCTCGCCACGTCGTGGACTTTCATGGTTGAAGAGGATTCCATTACAGGCG 

TACATGTTGTAGGCTTCCCGGTAGTTGATGACGATCCAAAAGCCGTACATCTTAGCGCAGGCTAAAGTTAGGTGGGATTAAGTATTACTTTTAGC 

ATACATTATCGAAGCTGCTGCCTTACCATATGGCGACCTGGGATAGAAGGGCGTCTGTTCGTTCTGTGGCGTCTCCACCACTTTACCATATAGCT 

CCGAAGTGGAGGCTTGGTAAAACCTCACGTTCTTTTCCATGCCACAAGTGCGGATGGCGTCCAGAATGCGCAACGTGCCCACGGCGTCCACCTCG 

GCCGTGTACTCACTGAGATCGAAGGAGACCTTGACATGGGATTGTGCTGCCAAATTGTAAATCTCCGTGGGCTTCACCATATTGATGATCTTCAC 

CAGGCTGCTGCTGTCCGTCATGTCGCCATAGTGGAGCTTCATACGCCCACCTTTGTGCGCCTTTGGATCGGCATATAGATGCTCAATGCGGGTGG 

TGTTGAAGGTGCTGGCCCGCCGAATAATGCCGTGCACCTCGTAGTCCTTCTTGAGCAGAAACTCTGCCAAATAGGAGCCATCCTGGGCGATAGGG 

AACAAGTAAGAATCGAGCACTTTGAACACGTAAGCACGGAATGAAGGAGATGCCCCTGAAACAGCCGTGACTAATTCATCAATTACACACCTGGC 

CCG^ATG^CTGTTATCAGCGCCACTTTATCACGGGAGTCGCCTTCGGCGCCGGCTTCCGTTCCGTTTTGGTCCTTGCTGCCGTTGCTGCTGCTT 

TCAGGACGTTGCTTCTTGGTTTCCGGGGCCCCATCGGAAGTCGACATAGCAATTAGCCGGGTATTTAGCATTTACTGACGCTAGTTTTAATTTTC 

ACGCCGACGCAGCCCGGATAATAAACAAACTTTTGTCTACAGCACTCAGGTAGAGGTGAATAAATAATCGATTGGTTATCGATACATCGCAATCG 

AAGCACTTGTACGGGACAGTGGGGCAATCTATTTAAGACAGCCTAAATTATTTTCAATAAGCAAAAAGTAAATATTTTATCACAAAAAACAAATT 

ATGTATGGAATAACATTTTAAATAAGAATTCAGCGCAAATTTAAAATTAAAACCAAATCTGGTTGAATAACACTTTATTTACTTTACTGCCAATG 

TTGCACTTTTTTTCGATTTTGTTCCACTTTGTGGCAGTGTTGAGTAGCAGGAAAACCCACGTAGCGATTTTTAGCGCATACCTTTCAGCGAAAGC 

CGGACGAAAATAACATAACGTTGGCACAACCACCTATCTTATCATTCCTCCTCCACTCCCAAGCCGATTCGCCACCGGGCATCAATATCTTCTTC 

GGCGGAACCCGCATTACGTGTGCGTAAACGGCGGCTTTCAATTTCATTCTCGGAGCGACCGGCCGGCCAGCTTGATAACACCGGTGCATAGTGAA 

ACCAGCGCCGCTTACTAATTAAGTACGACCACGACCTCCGAGATAAACAGAAGTCAATTGTTCGCAGCACAAAAATGACGGATCTTATCAGGCAG 

GGTGCTCTGTTCCTTATGAGCGAGAAGTGCTATGATAACTACTTCCTCTATCACAACTTTTTAGATGGTGAGTGGCTGCTAATTCAAAAAGGTAT 

GTGCCTATTAAGTATCAATTAACATTTCTCGTAGTTCCGTGCTTCAAGGCTTTGCTGAGCAAGGGCTTGGGATTGGCAATTATCGCAGGTTCCGT 

GCTGGTCAAAGTACCACAAGTGCTGAAGATCCTTAACAGCAAGTCTGGCGAGGGCATCAACATTGTGGGCGTAGTGTTGGACTTGCTGGCCATCT 

CCTTCCACCTGTCGTACAACTTTATGCATGGCTATCCGTTTAGTGCCTGGGGCGACAGTACCTTCCTGGCCATCCAGACCGTCACA 

(SEQ ID NO: 961) 

Exon: 2411.. 2181 
Exon: 2077.. 1642 
Exon: 1581.. 1001 

Start ATG: 2327 (Reverse strand: CAT) 

Transcript No * CT25524 

ACA&AAGTTTGTTTATTATCCGGGCTGC^ 

CGATGGGGCCCCGGAAACCAAGAAGCAACGTCCTGAAAGCAGCAGCAACGGCAGCAAGGACCAAAACGGAACGGAAGCCGGCGCCGAAGGCGACT 
CCCGTGATAAAGTGGCGCTGATAACAGGCATAACGGGCCAGGATGGCTCCTATTTGGCAGAGTTTCTGCTCAAGAAGGACTACGAGGTGCACGGC 
ATTATTCGGCGGGCCAGCACCTTCAACACCACCCGCATTGAGCATCTATATGCCGATCCAAAGGCGCACAAAGGTGGGCGTATGAAGCTCCACTA 

T GG CG AC AT G AC GG AC AGC AGC AGC C T GGT G AAG AT C AT C AAT AT GGT G AAGC C C A CGG AG AT T T AC AAT TT GG C AG C AC AAT C C C AT G T C AAGG 
TCTCCTTCGATCTCAGTGAGTACACGGCCGAGGTGGACGCCGTGGGCACGTTGCGCATTCTGGACGCCATCCGCACTTGTGGCATGGAAAAGAAC 
GTGAGGTTTTACCAAGCCTCCACTTCGGAGCTATATGGTAAAGTGGTGGAGACGCCACAGAACGAACAGACGCCCTTCTATCCCAGGTCGCCATA 
TGCCTGCGCTAAGATGTACGGCTTTTGGATCGTCATCAACTACCGGGAAGCCTACAACATGTACGCCTGTAATGGAATCCTCTTCAACCATGAAA 
GTCCACGACGTGGCGAGAACTTTGTGACTAGAAAAATCACCCGTAGCGTGGCCAAGATTTATCACAAGCAAATGGAGTATTTTGAACTGGGCAAT 
CTCGACTCAAAGCGCGACTGGGGACATGCCAGCGACTACGTGGAGGCCATGTGGATGATGCTGCAGCGGGAGTCTCCTTCCGACTACGTCATAGC 
TACAGGCGAAACGCACAGTGTGCGCGAATTCGTGGAAGCCGCGTTCAAGCATATTGATCGAGAGATAACGTGGAAGGGCAAGGGTGTGGACGAGG 
TGGGAGTGGAGAACGGCACGGGTATTGTGCGCGTCCGCATAAATCCAAAGTATTTTCGGCCGACGGAGGTCGATCTGCTGCAAGGCGACGCGTCC 
AAAGCCAATCGAGAGCTAAACTGGACGCCTAAAGTTACCTTCGTGGAATTGGTTAGCGACATGATGAAGGCTGACATCGAATTAATGAGGAAAAA 

TCCAATCGCTTAA 
(SEQ ID NO: 962) 

Start ATG: 85 (Reverse strand: CAT) 
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MSTSDGAPETKKQRPESSSNGSKDQNGTEAGAEGDSRDKVALITGITGQDGSYLAEFLLKKDYEVHGIIRRASTFNTTRIEHLYADPKAHKGGRM 
KLHYGDMTDSSSLVKIINMVKPTEIYNLAAQSHVKVSFDLSEYTAEVDAVGTLRILDAIRTCGMEKNVRFYQASTSELYGKVVETPQNEQTPFYP 
RSPYACAKMYGFWIVINYREAYNMYACNGILFNHESPRRGENFVTRKITRSVAKIYHKQMEYFELGNLDSKRDWGHASDYVEAl^MMLQRESPSD 
YVIATGETHSVREFVEAAFKHIDREITWKGKGVDEVGVENGTGIVRVRINPKYFRPTEVDLLQGDASKANRELNWTPKVTFVELVSDMMKADIEL 
MRKNPIA* 
(SEQ ID NO: 963) 

Name: GDP-mannose 4 , 6-dehydratase like 
Classification: enzyme 



Celera Sequence No. : 142000013384 668 

AAAAG AT AT G AAAG AGCC C CC GTT AAGC T AT T C AAAAAC AAAAC AAT G T GC AAC AT GT T T T AAAT G C TT AAAT AATT T T AATT TG AT AT AA T TC A 
CAAGCGTTATACATTTTTTCCCTTAGACAACCCTATGAAGTGAACCTGATGCTTCCCACATAAGTAAACATCTTGGATGCACTCGAATTTCGTCG 
CCAGGCACCGCCAGTCTGGATCGAACTAAGCAAAGTCCAGAGCACCGCGATTGAGTATTGGGCGTTTCAAATTTCCTGGCTAATTTCTCACAATC 
AGTTAATGCTAACAAAAGTACTTATGCGGCGGATAGTTCCTAAGAATGTGGGAGTATGACTTTAAGATACGATAAACTTACTTAATGCAAATAGG 
AGTAAGATTGAAAGTTAATAATTAGTTTTAGGTACCCATAAGATGTTTTTTTCTTCGGCTCTCAAAATCTGCTTTATTTAGTTTTCGTTCTTTAA 
ACTATGTATCTTTTAAGTGGGCAGTAAACAAGCTTCATCCACTAAAAGCCCAAACAAATATTTACTGCCTTTTTTTGCTACTATAATTGGTTTAT 
TACTGAGCTGAGAACCTCTCCCAAAGAGATCTATTTAAATGCCGACTGAGAGATTTTAAGAACAGAGAGAGAGTTCCATTTTGTTTATGAAAAGC 
TAATTGCCAAATTGGGAAGCCAAAGCCTGCTGGAATATTGCATTCCATTTAAGAAAAGAGGGTAGGAAGAAGAAGAACTGGAAACGTGGCTAAAT 
ATGTTAACTAATTCATTAACTTCGTTTCCTTTTCAACGCCCACTGGAACTAGAATCAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 
GAGCCAAACTTAAAATTAAAACACTACCTTTTTCCAGTTAACCGGCCGGCTTACACACACAGTCAAACGCCTGAAGCTTGAAAGAGAGTGTTTGC 
TTGATGAGAGCGGCAAGCGTATAAAAGCAGCGTCGCCGGCAGAAGAAAGTCAGTTGCAGTGCAGTCGTCGTGACGATCGGTTCAGTGATAGCTTC 
CCAGTGGCTGGAGAGAGTGAGAGTAGAGAGTAACGGAAAGAGAGTGCGAATCGAACCGAACGCTATTCCTTCGCCCACGATTTAAGTGAACGCTT 
ATTAGCGAATTGAAGACCCGAGAGCCCCAGTGCCCAGCTGATTTCGTATTCTTTGTTGAGGTAAGCCAAGGAATCCCTGGAACCACCTCATTTCG 
TAGCACCTTAGTGATGTAATTACGCACCAAGATTTACCCTTGCCAAAAAGTGGAAAACGCTTTTGCGAACTTCGATGCCGAATGGTGTTCGTCTC 
GAATCGGTTCTAAAATTACACACAATGGTTTTTTGGCTTCGATTGATTTGATTTGATTTTTGGGTCAATCGATTCGAAATCGAAAAGTTTTGAAC 
TCGTTTCGCAGAATCGCAGTTCGCGCGGATCTCTCGCCTCGCTCGCAAGCAAATATGTGAAGGACAACGACGTTAACCGATCAACTAGAAGCCCT 
CTTCTTCTGCCTCTTCAGAACCGATCAGATCGGAATAGAAATATAGAAACGACCATAGCACCTCCGTCGACTGTGGAGTGCGTGAACGTATTTCG 
AAGCGTGAGGTACTCGAACCGGAACCAACTTGAGTGTAGTGCATATTCGGGATCTTATCCAAACGCAGGCTTACATAACCCCTATTTTAATACCT 
T T T CAAAAAGGAAT AATT GAAATT C AAAAC AAAATT ATT TAGC AAC AAACAAAC AAT C AGT TAT AT T T T T G AACT G AT AT GC C TCGGGGCTAGGG 
GTCCCAGACTTTCAAATTCTAGTGATAACCCCCCCGATCGGCGAGTTCATCTGGAAATGTTGCCCATATGGGCCGGGTGCACGTGAATCTCTTCA 
GTGCCCATTGACGCACTTTGTGTGCCTCCACTGGGTCCTTGACCCGCTGTGATCCCTGCCCTTTCCCATGGATTGGATAATCTAGGTGTCATTAT 
CTTCCCTAACCTTTTCCCACGATTAAAGTGAAACCATTTGCCACGATTTTAGTGCCGTAGTGCCAGATCAAAGAGCCAAAATAGATCAGAGAATC 
GAAATTTAATTAAATCATGACACCGAAACACCGAGACAGACCAAATTAAATTCAATTGAAGCTGTCATGGCAACAAGTGCCAGCAAAAAAAAGGA 
AGAGCACTACGGCGAAACCAGAAGATAACACCGAAAAGTCAAATCGATTGTTGCAATTTGAGACTCTCCGAAAGCGAAAATTGCAGCGGAAGAGG 
CGGCCAATCGGCCATTACTAAACGGTGATGGGGTTGGGGAGGGGATCCGAAGTCTTGGTGACTGACTGGGCGCCCTGCCAAAATGATGGCAAAGG 
CAGAGACACCATGATTCCTCAGCCAAATCGAACGAAAACGGGTTTACGCTCTACATGGACCCCATTGCACACTGAATCCGTTTATCCAGCGAGGC 
AAAAAGTGCCGACGACAATGGAAACAGGGGCGTAGTATCCGTACCTCCTCCGTCCTACGATTCTTCCAGCAGCGCCAACGCCCCAAAAGATATTG 
ATTCCGACTTTTGCGCGATTCAGCGAAAAACATAAGGGGACCAAAAGTGGGCAGACGGGCGGAGGAGTGGTGTCTGTTTCTTTGATATGCGTGGG 
TGTGTTGTCTGTGCGGCAATCGATGCACTTCCGGTGGGTTCTATGGGGAAATACTCTTGCGTTTCAGTATATATCTACTATATATATTGCAATTT 
TATGGTAATTATATTTTCTGTCAACTTTTGCCTTGACTCCGCTTGATTTATTGCATTGTTCTTGAAGAATAGCCTCATAAGGCATGCAAGCAAAC 
ATGGCTTTCGAACTAGCGCAAAATTAGCTAGTAGCTTATAGAGTATATGTAGCTTACCCCCAACGGTTTTTATATCATTTGTTTAAACCACTTAA 
GAATATTCGTAGCCTTGTGGACACACTTTATCGTGGATCGTGGAAGTTATACGACACCGTAATATATCCTTATCACTCACGCATATTCTGGCCCA 
CTTCAGCTAGAAACGTTTGGCGCGTTCGCCTTATCATAAAAATCAGGCTATCGACTGATTAACTGAGTTTAATAAGCCAGCAAACGTAAACAAAA 
ACTACGAAAGTTCTGGAAAACTAACCAAGGCTATTTCAAAACGGCGTGAAATGGCTAGCACTCTAAATTATTGAAGTTTTCAACATATGTATATC 
TAATCTGATAGCCAAGAAAACATAATGGTTACCAACTTTCCATATACACCGTAATTTATTATTGATGAATATCAAATTATCGATAGTACTTCCCC 
GATAAGATTCTGTTAACACACATTGCTCTACTTTTCTATCAGGGAATTGCATTGAGAAAGCGCAGAGATGGCCGATGAAGCACAAGCACCACCAG 
CTGAGGGAGCACCACCAGCCGAGGGAGAGGCGCCACCACCCGCCGAGGGAGCCGAGGGTGCCGTCGAGGGCGGAGAGGCCGCTCCACCAGCAGAG 
CCCGCCGAGCCCATCAAGCACAGCTACACGCTCTTCTACTTCAACGTGAAGGCGTTGGCCGAGCCCCTGCGCTACCTGTTCGCCTACGGCAACCA 
GGAGTACGAGGATGTGCGCGTCACCCGCGACGAGTGGCCAGCCCTGAAGCCCAGTGAGTGCCATAGTCCATCATTGAGTGGTCAGTGGACTAACC 
ATAATCTACTCTGTCTCATCCAATAGCCATGCCCATGGGACAGATGCCCGTGCTGGAGGTGGACGGCAAGCGTGTGCACCAGAGCATTTCGATGG 
CTCGATTCCTGGCCAAGACTGTCGGCCTGTGCGGCGCTACTCCGTGGGAGGATCTGCAGATCGACATTGTCGTTGACACCATCAATGACTTCCGT 
CTAAGTAGCGAACAATGTGTGCCTCAGTTTCATGTTCAGTGTCCTTGATTAACTCTGGAATTTTTAATGTATAGAAATTGCAGTCGTCTCGTACG 
AGCCGGAGGACGAGATTAAAGAGAAGAAGTTGGTCACCCTGAATGCGGAGGTCATTCCATTCTACCTGGAGAAGCTCGAGCAGACCGTCAAGGAC 
AACGATGGTCACCTGGCTCTGGGCAAGGTGCGTGCTCAAATGAGAACTTGAAACCATACACTCGTTAATTTTGAATGAACAATACTAAGAACCAT 
CGATTCCTTTCACCCACAGCTGACCTGGGCCGACGTCTACTTCGCAGGCATCACCGACTACATGAACTACATGGTCAAGCGCGATCTGTTGGAAC 
CCTACCCAGCTCTGCGC GGAGTCGTGGATGCCGTCAACGCTTTGGAACCGATCAAGGCCTGGATCGAGAAGCGCCCCGTCACCGAGGTCTAAGTT 
CACCGAACAGGACTTGGGAGGATGGGAAAGCATTGCACAACCACTCAGACACAACCGCACCGCAGAAATGCACACCACCCACATCTGCACACATT 
TTGTTCCTTTTTGCGGAGGATCCGAGAGGACGGCGGATTTTCGCACGGGGTCATAAACGCGTAGGAGTTTAGCACGTATCACTATGGATAAATCG 
TATTTGTTCTTAATTTCTTGGTTATAATTCTCTTTGCATGGCGCTATATACATATTTCTATTGGGGTCCGGCAGCGGATCGCCGTTCGTTAGCTT 
CTTCTGAAATGATGAAAGATAATACTTATTGCACAGTTTATAATATTTTGGCTTACTTTTAGAACATACATTTAGCGCACCTATGCAGACATTTG 
CTGTACCTATATATATATATATATGCATAAAATACAAAAGCTACGAGTGGTTCATCTTATAGGCGTATTCTATTGCAATTTATTCAATGCGGAGG 
GCTCAATAAATATTTAAAACAACACCAGCTAAGGCTGAGAAACAATTACAAATTCTTATTTTGTTTTTTATACAACACGCCATCGCGGCCCCAGG 
AATTTTCTTCGTCTCATCCAGTTCCCTTTTAAGTAGCCATTAAAAAAAGACTCACATCAATAGAACCGGTTTCTCCAAAAAAAAAAAAAACAGAC 
TCTTACTGCCAAATTTCAAATGCACATAATCACATTATTATATACATATTATTTATTCAAATTTGTAATTGTATTTTTTGAATATTTTCACAAAA 
CACTTTTTAATAAACATAACACCTAAAAAGGAATGAGCACTTTTTAAAAGCACGACAACTGGTTTTATTTAATTTTTGTACGATGTATTGGAATA 
ATTTATTAATTGTTGACATATTTAAGAAATTTAAAGAAAGTGTTTATTTACATTATTAATTTTTATTTTTATTATCCTTTCAATTTGCATTTCCA 
AAGCCGACCCGACAGACATAAATAAATGCATGGAGTTGTGTTCAACATTGTTTTATAAACGTTTAAAAAAATAACATTTCAAATTAACATTCTTT 
TAAGTCACTTTGGACAATTCAAGCTTAAAAGTATAAATGCACAATTGGTCAGTTTTGATTCATTATTTCATATTGTATGTTGCGGTTACTGGTGC 
TTCCAGTTGGGTGGGCGCTTCTCGAAGAAGCTAGCAATGCCCTCTTTGGTGTCGCCCAGCTGGAAATTTTCGCACATCTTCTCCTGCGCAGCCGA 
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AAAGGCTTCCGCTTGGGACATGGCCAGTTGCTTGTAGTAGAACTCTTTTCCCAGAGAGATGACGGCACGACTTTTTGCCTTTATGGCATTGGTGA 
TTTCCTCGATCTCCTTATCTAGTTCCTCTGCAGGAACAGCTTTGGTGACCATTCCCGATATGTAGGCTTCTTCACCAGTAACGGGAAGACCGGTC 
ATTAGCATATAGGCGGACTTTGGACGCGACATAATCCGGGCAACTGCGACTCCGGGTGTTGAGCAAAAAACTCCGACTCCAGCCCTAAAAATATA 
AATTATAACATGATTAACTGGTTTCATAGAGGGTTTTCATCCTTACCCTGGAGTAGAGAACTTG 
(SEQ ID NO: 964} 

Exon: 1001. .1200 

Exon: 3368.. 3663 

Exon: 3732. .3899 

Exon: 3970.. 4112 

Exon: 4200. .4954 

Start ATG: 3393 

Transcript No. : CT25660 

C AGT T GC AGTGC AGT C GT C GT G ACG ATC GGT T C AGT G AT AGC T T C C C AGT G GCT GG AGAG AGT G AG AGT AGAG AGT AAC GG AAAG AG AGT GC G AA 
TCGAACCGAACGCTATTCCTTCGCCCACGATTTAAGTGAACGCTTATTAGCGAATTGAAGACCCGAGAGCCCCAGTGCCCAGCTGATTTCGTATT 
CTTTGTTGAGGGAATTGCATTGAGAAAGCGCAGAGATGGCCGATGAAGCACAAGCACCACCAGCTGAGGGAGCACCACCAGCCGAGGGAGAGGCG 
CCACCACCCGCCGAGGGAGCCGAGGGTGCCGTCGAGGGCGGAGAGGCCGCTCCACCAGCAGAGCCCGCCGAGCCCATCAAGCACAGCTACACGCT 
CTTCTACTTCAACGTGAAGGCGTTGGCCGAGCCCCTGCGCTACCTGTTCGCCTACGGCAACCAGGAGTACGAGGATGTGCGCGTCACCCGCGACG 
AGTGGCCAGCCCTGAAGCCCACCATGCCCATGGGACAGATGCCCGTGCTGGAGGTGGACGGCAAGCGTGTGCACCAGAGCATTTCGATGGCTCGA 
TTCCTGGCCAAGACTGTCGGCCTGTGCGGCGCTACTCCGTGGGAGGATCTGCAGATCGACATTGTCGTTGACACCATCAATGACTTCCGTCTAAA 
AATTGCAGTCGTCTCGTACGAGCCGGAGGACGAGATTAAAGAGAAGAAGTTGGTCACCCTGAATGCGGAGGTCATTCCATTCTACCTGGAGAAGC 
TCGAGCAGACCGTCAAGGACAACGATGGTCACCTGGCTCTGGGCAAGCTGACCTGGGCCGACGTCTACTTCGCAGGCATCACCGACTACATGAAC 
TACATGGTCAAGCGCGATCTGTTGGAACCCTACCCAGCTCTGCGCGGAGTCGTGGATGCCGTCAACGCTTTGGAACCGATCAAGGCCTGGATCGA 
GAAGCGCCCCGTCACCGAGGTCTAAGTTCACCGAACAGGACTTGGGAGGATGGGAAAGCATTGCACAACCACTCAGACACAACCGCACCGCAGAA 
AT GC AC AC C AC C C AC AT C T GC AC AC AT T T T GTTC C T T T T T G C GG AGG AT C C G AGAGG AC G GC G GAT T T T C GC AC GGGGT C AT AAACGCGT AGG AG 
T T T AGC ACGT AT C AC T AT GGAT AAAT C GT AT T TGT T CT T AAT T T C T TGGT T AT AAT TCTCTTTGCAT GGC GCT AT AT AC AT AT T T CT AT T G GGGT 
CCGGCAGCGGATCGCCGTTCGTTAGCTTCTTCTGAAATGATGAAAGATAATACTTATTGCACAGTTTATAATATTTTGGCTTACTTTTAGAACAT 
ACATTTAGCGCACCTATGCAGACATTTGCTGTACCTATATATATATATATATGCATAAAATACAAAAGCTACGAGTGGTTCATCTTATAGGCGTA 
TTCTATTGCAATTTATTCAATGCGGAGGGCTCAATAAATATTTAAAACAACACCAGCTAAGGCTGAGAAACAATTACAAATTCTTATTTTGTTTT 

TTATACAACACGCCATCGCGGCCCCAGGAATTTTCTTCGTCT 
(SEQ ID NO: 965) 

Start ATG: 226 

MADEAQAPPAEGAPPAEGEAPPPAEGAEGAVEGGEAAPPAEPAEPIKHSYTLFYFNVKALAEPLRYLFAYGNQEYEDVRVTRDEWPALKPTMPMG 
QMPVLEVDGKRVHQSISMARFLAKTVGLCGATPWEDLQIDIVVDTINDFRLKIAVVSYEPEDEIKEKKLVTLNAEVIPFYLEKLEQTVKDNDGHL 
ALGKLTWADVYFAGITDYMNYMVKRDLLEPYPALRGVVDAVNALEPIKAWIEKRPVTEV* 
(SEQ ID NO: 966) 

Name: glutathione S transferase 2 
Classification: enzyme 
Gene Symbol: Gst2 
FlyBase ID: FBgn0010226 

Celera Sequence No. : 142000013384668 

GCTTTATTTAGTTTTCGTTCTTTAAACTATGTATCTTTTAAGTGGGCAGTAAACAAGCTTCATCCACTAAAAGCCCAAACAAATATTTACTGCCT 
TTTTTTGCTACTATAATTGGTTTATTACTGAGCTGAGAACCTCTCCCAAAGAGATCTATTTAAATGCCGACTGAGAGATTTTAAGAACAGAGAGA 
GAGTTCCATTTTGTTTATGAAAAGCTAATTGCCAAATTGGGAAGCCAAAGCCTGCTGGAATATTGCATTCCATTTAAGAAAAGAGGGTAGGAAGA 
AGAAGAACTGGAAACGTGGCT AAAT AT GTT AAC TAATTCATTAACTTCGTTTCCTTTTCAACGCCCACTGGAACTAGAATC AGAG AGAGAG AGAG 
AGAGAGAGAGAGAGAGAGAGAGAGAGAGCCAAACTTAAAATTAAAACACTACCTTTTTCCAGTTAACCGGCCGGCTTACACACACAGTCAAACGC 
CTGAAGCTTGAAAGAGAGTGTTTGCTTGATGAGAGCGGCAAGCGTATAAAAGCAGCGTCGCCGGCAGAAGAAAGTCAGTTGCAGTGCAGTCGTCG 
TGACGATCGGTTCAGTGATAGCTTCCCAGTGGCTGGAGAGAGTGAGAGTAGAGAGTAACGGAAAGAGAGTGCGAATCGAACCGAACGCTATTCCT 
TCGCCCACGATTTAAGTGAACGCTTATTAGCGAATTGAAGACCCGAGAGCCCCAGTGCCCAGCTGATTTCGTATTCTTTGTTGAGGTAAGCCAAG 
GAATCCCTGGAACCACCTCATTTCGTAGCACCTTAGTGATGTAATTACGCACCAAGATTTACCCTTGCCAAAAAGTGGAAAACGCTTTTGCGAAC 
TTCGATGCCGAATGGTGTTCGTCTCGAATCGGTTCTAAAATTACACACAATGGTTTTTTGGCTTCGATTGATTTGATTTGATTTTTGGGTCAATC 
GATTCGAAATCGAAAAGTTTTGAACTCGTTTCGCAGAATCGCAGTTCGCGCGGATCTCTCGCCTCGCTCGCAAGCAAATATGTGAAGGACAACGA 
CGTTAACCGATCAACTAGAAGCCCTCTTCTTCTGCCTCTTCAGAACCGATCAGATCGGAATAGAAATATAGAAACGACCATAGCACCTCCGTCGA 
CTGTGGAGTGCGTGAACGTATTTCGAAGCGTGAGGTACTCGAACCGGAACCAACTTGAGTGTAGTGCATATTCGGGATCTTATCCAAACGCAGGC 
TTACATAACCCCTATTTTAATACCTTTTCAAAAAGGAATAATTGAAATTCAAAACAAAATTATTTAGCAACAAACAAACAATCAGTTATATTTTT 
GAACTGATATGCCTCGGGGCTAGGGGTCCCAGACTTTCAAATTCTAGTGATAACCCCCCCGATCGGCGAGTTCATCTGGAAATGTTGCCCATATG 
GGCCGGGTGCACGTGAATCTCTTCAGTGCCCATTGACGCACTTTGTGTGCCTCCACTGGGTCCTTGACCCGCTGTGATCCCTGCCCTTTCCCATG 
GATTGGATAATCTAGGTGTCATTATCTTCCCTAACCTTTTCCCACGATTAAAGTGAAACCATTTGCCACGATTTTAGTGCCGTAGTGCCAGATCA 

AAG AGC C AAAAT AG AT C AGAG AAT C GAAAT T T AAT T AAAT C ATG AC AC C GAAAC AC CG A G AC AG AC C AAATT AAAT T C AAT T G AAG CTGTCATGG 
CAACAAGTGCCAGCAAAAAAAAGGAAGAGCACTACGGCGAAACCAGAAGATAACACCGAAAAGTCAAATCGATTGTTGCAATTTGAGACTCTCCG 
AAAGCGAAAATTGCAGCGGAAGAGGCGGCCAATCGGCCATTACTAAACGGTGATGGGGTTGGGGAGGGGATCCGAAGTCTTGGTGACTGACTGGG 
CGCCCTGCCAAAATGATGGCAAAGGCAGAGACACCATGATTCCTCAGCCAAATCGAACGAAAACGGGTTTACGCTCTACATGGACCCCATTGCAC 
ACTGAATCCGTTTATCCAGCGAGGCAAAAAGTGCCGACGACAATGGAAACAGGGGCGTAGTATCCGTACCTCCTCCGTCCTACGATTCTTCCAGC 
AGCGCCAACGCCCCAAAAGATATTGATTCCGACTTTTGCGCGATTCAGCGAAAAACATAAGGGGACCAAAAGTGGGCAGACGGGCGGAGGAGTGG 
TGTCTGTTTCTTTGATATGCGTGGGTGTGTTGTCTGTGCGGCAATCGATGCACTTCCGGTGGGTTCTATGGGGAAATACTCTTGCGTTTCAGTAT 
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ATATCTACTATATATATTGCAATTTTATGGTAATTATATTTTCTGTCAACTTTTGCCTTGACTCCGCTTGATTTATTGCATTGTTCTTGAAGAAT 
AGCCTCATAAGGCATGCAAGCAAACATGGCTTTCGAACTAGCGCAAAATTAGCTAGTAGCTTATAGAGTATATGTAGCTTACCCCCAACGGTTTT 
TATATCATTTGTTTAAACCACTTAAGAATATTCGTAGCCTTGTGGACACACTTTATCGTGGATCGTGGAAGTTATACGACACCGTAATATATCCT 
TATCACTCACGCATATTCTGGCCCACTTCAGCTAGAAACGTTTGGCGCGTTCGCCTTATCATAAAAATCAGGCTATCGACTGATTAACTGAGTTT 
AATAAGCCAGCAAACGTAAACAAAAACTACGAAAGTTCTGGAAAACTAACCAAGGCTATTTCAAAACGGCGTGAAATGGCTAGCACTCTAAATTA 
TTGAAGTTTTCAACATATGTATATCTAATCTGATAGCCAAGAAAACATAATGGTTACCAACTTTCCATATACACCGTAATTTATTATTGATGAAT 
ATCAAATTATCGATAGTACTTCCCCGATAAGATTCTGTTAACACACATTGCTCTACTTTTCTATCAGGGAATTGCATTGAGAAAGCGCAGAGATG 
GCCGATGAAGCACAAGCACCACCAGCTGAGGGAGCACCACCAGCCGAGGGAGAGGCGCCACCACCCGCCGAGGGAGCCGAGGGTGCCGTCGAGGG 
CGGAGAGGCCGCTCCACCAGCAGAGCCCGCCGAGCCCATCAAGCACAGCTACACGCTCTTCTACTTCAACGTGAAGGCGTTGGCCGAGCCCCTGC 
GCTACCTGTTCGCCTACGGCAACCAGGAGTACGAGGATGTGCGCGTCACCCGCGACGAGTGGCCAGCCCTGAAGCCCAGTGAGTGCCATAGTCCA 
TCATTGAGTGGTCAGTGGACTAACCATAATCTACTCTGTCTCATCCAATAGCCATGCCCATGGGACAGATGCCCGTGCTGGAGGTGGACGGCAAG 
CGTGTGCACCAGAGCATTTCGATGGCTCGATTCCTGGCCAAGACTGTCGGCCTGTGCGGCGCTACTCCGTGGGAGGATCTGCAGATCGACATTGT 
CGTTGACACCATCAATGACTTCCGTCTAAGTAGCGAACAATGTGTGCCTCAGTTTCATGTTCAGTGTCCTTGATTAACTCTGGAATTTTTAATGT 
ATAGAAATTGCAGTCGTCTCGTACGAGCCGGAGGACGAGATTAAAGAGAAGAAGTTGGTCACCCTGAATGCGGAGGTCATTCCATTCTACCTGGA 
GAAGCTCGAGCAGACCGTCAAGGACAACGATGGTCACCTGGCTCTGGGCAAGGTGCGTGCTCAAATGAGAACTTGAAACCATACACTCGTTAATT 
TTGAATGAACAATACTAAGAACCATCGATTCCTTTCACCCACAGCTGACCTGGGCCGACGTCTACTTCGCAGGCATCACCGACTACATGAACTAC 
ATGGTCAAGCGCGATCTGTTGGAACCCTACCCAGCTCTGCGCGGAGTCGTGGATGCCGTCAACGCTTTGGAACCGATCAAGGCCTGGATCGAGAA 
GCGCCCCGTCACCGAGGTCTAAGTTCACCGAACAGGACTTGGGAGGATGGGAAAGCATTGCACAACCACTCAGACACAACCGCACCGCAGAAATG 

C AC AC C AC C C AC AT C T GC AC AC AT T TTGTTCCTTTT T GC GGAGG AT C CG AG AGG AC G GC G GAT T T T C GC AC GGGGT C AT AAACGC GT AGG AGT TT 
AGCACGTATCACTATGGATAAATCGTATTTGTTCTTAATTTCTTGGTTATAATTCTCTTTGCATGGCGCTATATACATATTTCTATTGGGGTCCG 
GCAGCGGATCGCCGTTCGTTAGCTTCTTCTGAAATGATGAAAGATAATACTTATTGCACAGTTTATAATATTTTGGCTTACTTTTAGAACATACA 
TTTAGCGCACCTATGCAGACATTTGCTGTACCTATATATATATATATATGCATAAAATACAAAAGCTACGAGTGGTTCATCTTATAGGCGTATTC 
TATTGCAATTTATTCAATGCGGAGGGCTCAATAAATATTTAAAACAACACCAGCTAAGGCTGAGAAACAATTACAAATTCTTATTTTGTTTTTTA 
TACAACACGCCATCGCGGCCCCAGGAATTTTCTTCGTCTCATCCAGTTCCCTTTTAAGTAGCCATTAAAAAAAGACTCACATCAATAGAACCGGT 
T T CT CC AAAAAAAAAAAAAAC AGAC T C T T AC T GC C AAAT T T C AAAT GC AC AT AAT C AC AT T AT T AT AT AC AT AT TAT T TAT T C AAAT T T GT AAT T 
GTATTTTTT GAAT AT T T T C AC AAAAC AC T T T T T AAT AAAC AT AAC ACC T AAAAAGG AAT G AGC ACT T T T T AAAAGC AC G AC AAC T GGT T T TAT T T 
AATTTTTGTACGATGTATTGGAATAATTTATTAATTGTTGACATATTTAAGAAATTTAAAGAAAGTGTTTATTTACATTATTAATTTTTATTTTT 

ATT ATC C TT T C AAT T T GC AT T TCC AAAGC CGACC C G AC AGAC AT AAAT AAAT GC AT GG AGT T GT GT T C AAC AT T GT T T TAT AAAC GT T T AAAAAA 
AT AAC AT TT C AAAT T AAC AT T CT T T T AAGT C ACT T TGG AC AAT T C AAGC T T AAAAG T AT AAAT GC AC AAT T GGT CAGTT T T GAT T CAT TAT T T C A 
TATTGTATGTTGCGGTTACTGGTGCTTCCAGTTGGGTGGGCGCTTCTCGAAGAAGCTAGCAATGCCCTCTTTGGTGTCGCCCAGCTGGAAATTTT 
CGCACATCTTCTCCTGCGCAGCCGAAAAGGCTTCCGCTTGGGACATGGCCAGTTGCTTGTAGTAGAACTCTTTTCCCAGAGAGATGACGGCACGA 
CTTTTTGCCTTTATGGCATTGGTGATTTCCTCGATCTCCTTATCTAGTTCCTCTGCAGGAACAGCTTTGGTGACCATTCCCGATATGTAGGCTTC 
TTCACCAGTAACGGGAAGACCGGTCATTAGCATATAGGCGGACTTTGGACGCGACATAATCCGGGCAACTGCGACTCCGGGTGTTGAGCAAAAAA 
CTCCGACTCCAGCCCTAAAAATATAAATTATAACATGATTAACTGGTTTCATAGAGGGTTTTCATCCTTACCCTGGAGTAGAGAACTTG 

(SEQ ID NO: 967) 

Exon: 1001.. 1174 
Exon: 2918.. 3213 
Exon: 3282.. 3449 
Exon: 3520.. 3662 
Exon: 3750.. 4504 
Start ATG: 2943 

CGGATCTC^ 

CAGATCGGAATAGAAATATAGAAACGACCATAGCACCTCCGTCGACTGTGGAGTGCGTGAACGTATTTCGAAGCGTGAGGGAATTGCATTGAGAA 
AGCGCAGAGATGGCCGATGAAGCACAAGCACCACCAGCTGAGGGAGCACCACCAGCCGAGGGAGAGGCGCCACCACCCGCCGAGGGAGCCGAGGG 
TGCCGTCGAGGGCGGAGAGGCCGCTCCACCAGCAGAGCCCGCCGAGCCCATCAAGCACAGCTACACGCTCTTCTACTTCAACGTGAAGGCGTTGG 
CCGAGCCCCTGCGCTACCTGTTCGCCTACGGCAACCAGGAGTACGAGGATGTGCGCGTCACCCGCGACGAGTGGCCAGCCCTGAAGCCCACCATG 
CCCATGGGACAGATGCCCGTGCTGGAGGTGGACGGCAAGCGTGTGCACCAGAGCATTTCGATGGCTCGATTCCTGGCCAAGACTGTCGGCCTGTG 
CGGCGCTACTCCGTGGGAGGATCTGCAGATCGACATTGTCGTTGACACCATCAATGACTTCCGTCTAAAAATTGCAGTCGTCTCGTACGAGCCGG 
AGGACGAGATTAAAGAGAAGAAGTTGGTCACCCTGAATGCGGAGGTCATTCCATTCTACCTGGAGAAGCTCGAGCAGACCGTCAAGGACAACGAT 
GGTCACCTGGCTCTGGGCAAGCTGACCTGGGCCGACGTCTACTTCGCAGGCATCACCGACTACATGAACTACATGGTCAAGCGCGATCTGTTGGA 
ACCCTACCCAGCTCTGCGCGGAGTCGTGGATGCCGTCAACGCTTTGGAACCGATCAAGGCCTGGATCGAGAAGCGCCCCGTCACCGAGGTCTAAG 
TTCACCGAACAGGACTTGGGAGGATGGGAAAGCATTGCACAACCACTCAGACACAACCGCACCGCAGAAATGCACACCACCCACATCTGCACACA 
TTTTGTTCCTTTTTGCGGAGGATCCGAGAGGACGGCGGATTTTCGCACGGGGTCATAAACGCGTAGGAGTTTAGCACGTATCACTATGGATAAAT 
CGTATTTGTTCTTAATTTCTTGGTTATAATTCTCTTTGCATGGCGCTATATACATATTTCTATTGGGGTCCGGCAGCGGATCGCCGTTCGTTAGC 
TTCTTCTGAAATGATGAAAGATAATACTTATTGCACAGTTTATAATATTTTGGCTTACTTTTAGAACATACATTTAGCGCACCTATGCAGACATT 
TGCTGTACCTATATATATATATATATGCATAAAATACAAAAGCTACGAGTGGTTCATCTTATAGGCGTATTCTATTGCAATTTATTCAATGCGGA 
GGGCTCAATAAATATTTAAAACAACACCAGCTAAGGCTGAGAAACAATTACAAATTCTTATTTTGTTTTTTATACAACACGCCATCGCGGCCCCA 

GGAATTTTCTTCGTCT 
(SEQ ID NO: 968) 

Start ATG: 200 

MADEAQAPPAEGAPPAEGEAPPPAEGAEGAVEGGEAAPPAEPAEPIKHSYTLFYFNVKALAEPLRyLFAYGNQEYEDVRVTRDEVJPALKPTMPMG 
QMPVLEVDGKRVHQSISMARFLAKTVGLCGATPWEDLQIDIVVDTINDFRLKIAVVSYEPEDEIKEKKLVTLNAEVIPFYLEKLEQTVKDNDGHL 
ALGKLTWADVYFAGITDYMNYMVKRDLLEPYPALRGVVDAVNALEPIKAWIEKRPVTEV* 
(SEQ ID NO: 969) 

Name: Glutathione S transferase 2 
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Classification: enzyme 
Gene Symbol: Gst2 
FlyBase ID: FBgn0010226 



Celera Seauence No. : 142000013384668 

TCACCAGTAACGGGAAGACCGGTCATTAGCATATAGGCGGACTTTGGACGCGACATAATCCGGGCAACTGCGACTCCGGGTGTTGAGCAAAAAAC 

T C C G AC T C C AGC C CT AAAAAT AT AAAT T AT AAC ATG AT T AAC T GG T T T C AT AG AG GGT T TT C AT C C T T AC C C T G GAGT AG AG AAC T TGCTGTTCT 

TGGTGCAGACTACCATGTCACATGACACCACCAGTTGGCAGCCTGCAGCAGCCGCATAGCCATTGACTTTTCCTAGTACGGGCACCGGCAGTCTT 

TGGATGTCGTTTATAACATCGGTTAGCTTCTGGAACACACAAGCCTGTATCTTGGGGTCATTGTGCAACTCCTTGAGATTGTGACCTGCCGACCA 

GATCTTTCCTTGGGCCGTCAGGACCACGCATCTCAGATCCAAGTTGTCTTTATCTTTTAGCAGGGCGTCCTGAAGGGCACACATCATGTCCAGTG 

ATAGGGAATTCAGAGTTTTTGGATGATTCAGCGTTATCTCTCGCACTCCGTTATGCTCCTTGACCAGGACCAAATCACTGGGTCCATTGCTGGTA 

AAGCGAATGACCTGTTGCACGCAACCTGTTGCATATTTTGATATTTTTAAGGAGTTGTTTAAAATACGCAACATTTTTATTTTGATTTGGCAGCA 

AGTGTTGGTAAATTCGCCGCCCAAACAGCTGGTTATCTATCGAAACAGCTGCAAGTCGATAGCAAGGCGCAACAGTTGTCAAACGGTCACTCTAA 

GCCGCAATGAGTTTGTACGATTAAAAGTTTATGTCTATTCGCGTTTTTCGAAGCTTTCCCGATTCCCGTAGCTGTCCCACTGTACAGGTGAGTAA 

CCGCTCAGGTGCGTCAACAACCGCTGCCGTTCTTTGTCCATCAGCTGAGCGGTCAGACAGAACAACAACAAAGCCAGAAGCTCGCGCGCCAGCCT 

GTGTACGTAGTAAATCTATGTGACTACTACAAAAGCAGAACTTTATCAGCCACCAAAGCCATTTGTTATTTGTCGCCCCACATTCGCGGCCCCGC 

CAATCAGTTAACAAATCGAACCACGCGAGTGCGGATCCCCCGACTTGAGCTCCACTAAAACGTACACCATTAAATTACCACATTCCAGCTTGCCA 

CACGATGCGTCCGTTCTCCGGCAGCGATTGCCTTAAGCCCGTCACCGAGGGCATCAACCGGGCGTTCGGCGCCAAGGAGCCCTGGCAGGTGGCCA 

CCATCACGGCCACCACGGTGCTGGGAGGCGTCTGGCTCTGGACTGTGATCTGCCAGGATGAAAGTGAGTTGGTGGTGTCTATGTGTGTGATAGAT 

C T C C TG G AAT T AGT T CT AAAC AAC AAT GC CT T T GC T GG AT TTTCTTCTT GC AC AGT T T AT AT GC AAT C T GCAAG AAAAT AT G AC AAAAT T C GAC T 

T AAACGAAAGT T GG ACT T AT AT AAAAC AGTG T C AAGT C AT C AC AGT C AG C C GAT TTCATTTATTTCGTGCTT GC AGGG C AG GGC AC G T C AAT T AC 

CTAAATCCATCAAACTATCTTATCTGATGGGTGCGTTGCTTATAAACATAGCTAGACTTCTTCTCTTACAATTTTCCTAAAAAACGCTAATATAA 

AACGTATCGTACAAAGAGAATTCACCTAAGAATTCACCACTTTTATAATATTGTATTAAAATAAGCACGCTTTACTGTGTACTTTGTGCCCATAG 

CTTATATAAGCCATGCACTCGTCTGGTGCCCTCCCCGAACTTCTGAATTTAATCTAGTCACAGATTACGATAGATTAGAGTTTAAATGTTTGCTA 

AAGCGTGTAGCAACTTAAAATCAATCAAATATGCAAAGTTCAACAGCGCGAAGAAAATTGATAAGATTGAGATATCCTGCCCTTGCTAGCACATA 

GTAATAGTTTAGGGGCTTAGTACCAAAACTAATGAGATTAGGTAGGTAGTTTCCCCGATACCATGACTAATGCTTTAAATTAGGAATGATGAAAA 

AATAATGTTTCTAGCATAGCTTCTCGATAAAGTGCCATAAAGCCTGTCCAAAGTTTCGTCCACAGCAAGGCATGTCATGTCCGATAAGATCGAGC 

GGGTATCATACACACTTCGTTTATCAGGCTGCAATTTTTTTCAATGATTCTTTGTTTTGGGTCTGTTAACCTCATGAATATAAAACTTTTAACGA 

CTTCAGTCTGATGAAAACAAAGGACAGTGAAAGGTTTATCTATAGTAATATAATAATAACTGTACGCAAACTTATTCCATAATTTTCTCTCCCAA 

GATCTTTACATTCGTGGCAAGCGTCAGTTCTTTAAGTTTGCCAAGAAGATTCCAGCCGTGCGTCGTCAGGTGGAGACTGAATTGGCCAAGGCCAA 

AAACGACTTCGAGACGGAAATCAAAAAGAGCAACGCCCACCTTACCTACTCGGAAACTCTGCCCGAGAAGGGACTCAGCAAGGAGGAGATCCTCC 

GACTGGTGGATGAGCACCTGAAGACTGGTCACTACAACTGGCGTGATGGTCGTGTATCTGGCGCGGTCTACGGCTACAAGCCTGATCTGGTGGAG 

CTCGTCACTGAAGTGTACGGCAAGGCCTCCTACACCAATCCCTTGCACGCAGATCTTTTCCCGGGAGTTTGCAAAATGGAGGCGGAGGTAGTGCG 

CATGGCATGCAACCTGTTCCATGGAAACTCAGCCAGCTGTGGAACCATGACCACCGGCGGCACCGAATCCATTGTAATGGCCATGAAGGCGTACA 

GGGATTTCGCTAGAGAGTACAAGGGAATCACCAGGCCAAACATCGTGGTGCCTAAGACGGTCCACGCGGCCTTCGACAAGGGCGGTCAGTACTTT 

AATATCCACGTGCGATCCGTGGATGTAGATCCGGAGACCTACGAAGTGGACATTAAGAAGTTCAAACGTGCCATTAACAGGAACACGATTCTGGT 

ATGGGTTTACATAGCAAAATGTATTCTAAAACTAATCTTGAAACTTTAATCAGCTGGTTGGGTCTGCTCCGAACTTCCCCTATGGAACCATCGAT 

GACATCGAAGCTATCGCCGCTTTGGGCGTTAAGTACGACATTCCCGTGCACGTGGACGCCTGCCTGGGCAGCTTTGTGGTGGCCTTGGTCCGCAA 

CGCCGGCTATAAGCTGCGTCCCTTCGACTTTGAGGTCAAGGGAGTGACCAGTATCTCCGCTGATACCCACAAGTATGGTTTCGCGCCCAAGGGAT 

CATCGGTGATCCTTTACTCGGACAAGAAGTACAAGGACCATCAGTTCACTGTGACTACTGACTGGCCTGGCGGCGTGTATGGTTCTCCCACAGTC 

AACGGTTCCCGTGCCGGAGGTATTATCGCCGCCTGCTGGGCTACCATGATGAGCTTTGGCTATGATGGTTATCTGGAAGCCACTAAGCGCATTGT 

GGATACGGCGCGCTATATCGAGAGGGGGTAAGTTTGTATCAAAGTTCGTTGTGACAATAATATAATTTTAAATCTATATTACCAGCGTTCGCGAC 

ATCGATGGCATCTTTATCTTTGGCAAGCCAGCTACTTCAGTGATTGCCCTGGGTTCCAATGTGTTTGACATTTTCCGGCTATCGGATTCGCTGTG 

CAAACTGGGCTGGAACCTGAATGCGCTGCAGTTTCCATCTGGGTAGGTGACTCAACTTTTAAATTAGCGGAACCATTCTTATAAGTGAAATTCCC 

TCTAAGTATCCACCTGTGCGTGACGGACATGCACACACAGCCCGGAGTCGCGGATAAATTCATTGCCGATGTGCGCAGCTGTACGGCGGAGATCA 

TGAAGGATCCCGGCCAGCCCGTCGTTGGAAAGATGGCTCTCTACGGCATGGCACAGAGCATACCCGACCGTTCGGTGATCGGAGAAGTGACTCGC 

CTATTCCTGCACTCCATGTACTACACTCCCAGCCAGAAATAGACACCTGGAGCAATCCCCGTTCTCTTCGCCCACCCCACGGAGCTAATGCATTT 

CCTGTGCTGTATTTAAACCACCAAAACACCCCGTCGTTAAACCTTCCTCAAGCAATTTATATTAGGATGCAATTAGTGCTGTAATCGAGGGTACA 

AAACGTCGTTCTACGCGAAAATCTATCTACCTATGTTCATCCCATTTGTCAACATTCGTCGCTCTAAGAGCCATGTTATTAAAGTGTTTTTCTGT 

GTAACTTGCTAGTGAAATAATAATATAATATTAATCAATTTTTGTGTACTATACATTTGTGAATGCTGTTAATGCTCGTTAATCTTGAAACTTCC 

AT T T C C T C T GC AAAAT AG AAT G AAAAC AT AT AC AT T T AGC T T AAT T G GAT T GT GGC C T C AC AAATT T T T AGC AT G GAT C T AT T T GT T AC CAT T T A 
ATTTATATTTGTTTACATTGTTTACCACTTCAGCAATTAACAATAATTACTGCTGCGTGGGTATATGCAATCCGGTCCGATCTGATCTGGATTGG 
TGTTTTTATTTCTTTTCGAACATTTTGCCACGCGTCGGCGTTTTTTAATTTAGTTGATCTTGGATGCCCAGCTGACACGCCCATTTCCTCTTTTG 
TAATTTATGGAAGTCTCGAATGGCGTGCACAAAAATTATTTCAAATACATTTCTATGGCGGAAACATTTATTTCAAAGGCAAGCGCACATTTTAA 
CAAGGCACACGTTCGTTGTCATTTTTCAAAGAAAAAATAAATATAAATAGCCAAAGCGAAGCAACAAGTGTCGCACAAAGGTGACGATCGATTTG 
TGGCATCCATATCAAAACCTGTCACCCTAACACACCTAATGCGCCCACGCAACTCTGCACTTCCTGCCCCTGTCCACCAGAAATCCGGGTGCACT 
AGCAACTCCAGACAGCCGGGCAGCCCAGAAAAATTCGGCTCAAACTGGGTTGCCAAAAAAGTGTGCATGAAATAAATTCACTTTTTTCCCCA 

(SEQ ID NO: 970) 

Exon: 1001.. 1298 
Exon: 2282.. 2943 
Exon: 2999. .3447 
Exon: 3506.. 3652 
Exon: 3712.. 3937 
Start ATG: 1145 

Transcript No. : CT25678 

CACCAAAGCCATTTGTTATTTGTCGCCCCACATTCGCGGCCCCGCCAATCAGTTAACAAATCGAACCACGCGAGTGCGGATCCCCCGACTTGAGC 
TCCACTAAAACGTACACCATTAAATTACCACATTCCAGCTTGCCACACGATGCGTCCGTTCTCCGGCAGCGATTGCCTTAAGCCCGTCACCGAGG 
GCATCAACCGGGCGTTCGGCGCCAAGGAGCCCTGGCAGGTGGCCACCATCACGGCCACCACGGTGCTGGGAGGCGTCTGGCTCTGGACTGTGATC 
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TGCCAGGATGAAAATCTTTACATTCGTGGCAAGCGTCAGTTCTTTAAGTTTGCCAAGAAGATTCCAGCCGTGCGTCGTCAGGTGGAGACTGAATT 

GGCCAAGGCCAAAAACGACTTCGAGACGGAAATCAAAAAGAGCAACGCCCACCTTACCTACTCGGAAACTCTGCCCGAGAAGGGACTCAGCAAGG 

AGGAGATCCTCCGACTGGTGGATGAGCACCTGAAGACTGGTCACTACAACTGGCGTGATGGTCGTGTATCTGGCGCGGTCTACGGCTACAAGCCT 

GATCTGGTGGAGCTCGTCACTGAAGTGTACGGCAAGGCCTCCTACACCAATCCCTTGCACGCAGATCTTTTCCCGGGAGTTTGCAAAATGGAGGC 

GGAGGTAGTGCGCATGGCATGCAACCTGTTCCATGGAAACTCAGCCAGCTGTGGAACCATGACCACCGGCGGCACCGAATCCATTGTAATGGCCA 

TGAAGGCGTACAGGGATTTCGCTAGAGAGTACAAGGGAATCACCAGGCCAAACATCGTGGTGCCTAAGACGGTCCACGCGGCCTTCGACAAGGGC 

GGTCAGTACTTTAATATCCACGTGCGATCCGTGGATGTAGATCCGGAGACCTACGAAGTGGACATTAAGAAGTTCAAACGTGCCATTAACAGGAA 

CACGATTCTGCTGGTTGGGTCTGCTCCGAACTTCCCCTATGGAACCATCGATGACATCGAAGCTATCGCCGCTTTGGGCGTTAAGTACGACATTC 

CCGTGCACGTGGACGCCTGCCTGGGCAGCTTTGTGGTGGCCTTGGTCCGCAACGCCGGCTATAAGCTGCGTCCCTTCGACTTTGAGGTCAAGGGA 

GTGACCAGTATCTCCGCTGATACCCACAAGTATGGTTTCGCGCCCAAGGGATCATCGGTGATCCTTTACTCGGACAAGAAGTACAAGGACCATCA 

GTTCACTGTGACTACTGACTGGCCTGGCGGCGTGTATGGTTCTCCCACAGTCAACGGTTCCCGTGCCGGAGGTATTATCGCCGCCTGCTGGGCTA 

CCATGATGAGCTTTGGCTATGATGGTTATCTGGAAGCCACTAAGCGCATTGTGGATACGGCGCGCTATATCGAGAGGGGCGTTCGCGACATCGAT 

GGCATCTTTATCTTTGGCAAGCCAGCTACTTCAGTGATTGCCCTGGGTTCCAATGTGTTTGACATTTTCCGGCTATCGGATTCGCTGTGCAAACT 

GGGCTGGAACCTGAATGCGCTGCAGTTTCCATCTGGTATCCACCTGTGCGTGACGGACATGCACACACAGCCCGGAGTCGCGGATAAATTCATTG 

CCGATGTGCGCAGCTGTACGGCGGAGATCATGAAGGATCCCGGCCAGCCCGTCGTTGGAAAGATGGCTCTCTACGGCATGGCACAGAGCATACCC 

GACCGTTCGGTGATCGGAGAAGTGACTCGCCTATTCCTGCACTCCATGTACTACACTCCCAGCCAGAAATAG 

(SEQ ID NO: 971) 

Start ATG: 145 

MRPFSGSDCLKPVTEGINRAFGAKEPWQVATITATTVLGGVWLWTVICQDENLYIRGKRQFFKFAKKIPAVRRQVETELAKAKNDFETEIKKSNA 
HLTYSETLPEKGLSKEEILRLVDEHLKTGHyNWRDGRVSGAVYGYKPDLVELVTEVYGKASYTNPLHADLFPGVCKMEAEWRMACNLFHGNSAS 
CGTMTTGGTESIVJ^KAYRDFAREYKGITRPNIVVPKTVHAAFDKGGQYFNIHVRSVDVDPETYEVDIKKFKRAINRNTILLVGSAPNFPYGTI 
DDIEAIAALGVKYDIPVHVDACLGSFVVALVRNAGYKLRPFDFEVKGVTSISADTHKYGFAPKGSSVILYSDKKYKDHQFTVTTDWPGGVYGSPT 
VNGSRAGGIIAACWATMMSFGYDGYLEATKRIVDTARYIERGVRDIDGIFIFGKPATSVIALGSNVFDIFRLSDSLCKLGWNLNALQFPSGIHLC 
VTDMHTQPGVADKFIADVRSCTAEIMKDPGQPWGKMALYGMAQSIPDRSVIGEVTRLFLHSMYYTPSQK* 
(SEQ ID NO: 972) 

Name: sphingosine phosphate lyase-like 
Classification: enzyme 



Celera Sequence No. : 142000013384668 

TGGGCGCTTCTCGAAGAAGCTAGCAATGCCCTCTTTGGTGTCGCCCAGCTGGAAATTTTCGCACATCTTCTCCTGCGCAGCCGAAAAGGCTTCCG 
CTTGGGACATGGCCAGTTGCTTGTAGTAGAACTCTTTTCCCAGAGAGATGACGGCACGACTTTTTGCCTTTATGGCATTGGTGATTTCCTCGATC 
TCCTTATCTAGTTCCTCTGCAGGAACAGCTTTGGTGACCATTCCCGATATGTAGGCTTCTTCACCAGTAACGGGAAGACCGGTCATTAGCATATA 
GGCGGACTTTGGACGCGACATAATCCGGGCAACTGCGACTCCGGGTGTTGAGCAAAAAACTCCGACTCCAGCCCTAAAAATATAAATTATAACAT 
GATTAACTGGTTTCATAGAGGGTTTTCATCCTTACCCTGGAGTAGAGAACTTGCTGTTCTTGGTGCAGACTACCATGTCACATGACACCACCAGT 
TGGCAGCCTGCAGCAGCCGCATAGCCATTGACTTTTCCTAGTACGGGCACCGGCAGTCTTTGGATGTCGTTTATAACATCGGTTAGCTTCTGGAA 
CACACAAGCCTGTATCTTGGGGTCATTGTGCAACTCCTTGAGATTGTGACCTGCCGACCAGATCTTTCCTTGGGCCGTCAGGACCACGCATCTCA 
GATCCAAGTTGTCTTTATCTTTTAGCAGGGCGTCCTGAAGGGCACACATCATGTCCAGTGATAGGGAATTCAGAGTTTTTGGATGATTCAGCGTT 
ATCTCTCGCACTCCGTTATGCTCCTTGACCAGGACCAAATCACTGGGTCCATTGCTGGTAAAGCGAATGACCTGTTGCACGCAACCTGTTGCATA 
TTTTGATATTTTTAAGGAGTTGTTTAAAATACGCAACATTTTTATTTTGATTTGGCAGCAAGTGTTGGTAAATTCGCCGCCCAAACAGCTGGTTA 
TCTATCGAAACAGCTGCAAGTCGATAGCAAGGCGCAACAGTTGTCAAACGGTCACTCTAAGCCGCAATGAGTTTGTACGATTAAAAGTTTATGTC 
TATTCGCGTTTTTCGAAGCTTTCCCGATTCCCGTAGCTGTCCCACTGTACAGGTGAGTAACCGCTCAGGTGCGTCAACAACCGCTGCCGTTCTTT 
GTCCATCAGCTGAGCGGTCAGACAGAACAACAACAAAGCCAGAAGCTCGCGCGCCAGCCTGTGTACGTAGTAAATCTATGTGACTACTACAAAAG 
CAGAACTTTATCAGCCACCAAAGCCATTTGTTATTTGTCGCCCCACATTCGCGGCCCCGCCAATCAGTTAACAAATCGAACCACGCGAGTGCGGA 
TCCCCCGACTTGAGCTCCACTAAAACGTACACCATTAAATTACCACATTCCAGCTTGCCACACGATGCGTCCGTTCTCCGGCAGCGATTGCCTTA 
AGCCCGTCACCGAGGGCATCAACCGGGCGTTCGGCGCCAAGGAGCCCTGGCAGGTGGCCACCATCACGGCCACCACGGTGCTGGGAGGCGTCTGG 
CTCTGGACTGTGATCTGCCAGGATGAAAGTGAGTTGGTGGTGTCTATGTGTGTGATAGATCTCCTGGAATTAGTTCTAAACAACAATGCCTTTGC 
TGGATTTTCTTCTTGCACAGTTTATATGCAATCTGCAAGAAAATATGACAAAATTCGACTTAAACGAAAGTTGGACTTATATAAAACAGTGTCAA 
GTCATCACAGTCAGCCGATTTCATTTATTTCGTGCTTGCAGGGCAGGGCACGTCAATTACCTAAATCCATCAAACTATCTTATCTGATGGGTGCG 
TTGCTTATAAACATAGCTAGACTTCTTCTCTTACAATTTTCCTAAAAAACGCTAATATAAAACGTATCGTACAAAGAGAATTCACCTAAGAATTC 
ACCACTTTTATAATATTGTATTAAAATAAGCACGCTTTACTGTGTACTTTGTGCCCATAGCTTATATAAGCCATGCACTCGTCTGGTGCCCTCCC 
CGAACTTCTGAATTTAATCTAGTCACAGATTACGATAGATTAGAGTTTAAATGTTTGCTAAAGCGTGTAGCAACTTAAAATCAATCAAATATGCA 
AAGTTCAACAGCGCGAAGAAAATTGATAAGATTGAGATATCCTGCCCTTGCTAGCACATAGTAATAGTTTAGGGGCTTAGTACCAAAACTAATGA 
GATTAGGTAGGTAGTTTCCCCGATACCATGACTAATGCTTTAAATTAGGAATGATGAAAAAATAATGTTTCTAGCATAGCTTCTCGATAAAGTGC 
CATAAAGCCTGTCCAAAGTTTCGTCCACAGCAAGGCATGTCATGTCCGATAAGATCGAGCGGGTATCATACACACTTCGTTTATCAGGCTGCAAT 
TTTTTTCAATGATTCTTTGTTTTGGGTCTGTTAACCTCATGAATATAAAACTTTTAACGACTTCAGTCTGATGAAAACAAAGGACAGTGAAAGGT 
TTATCTATAGTAATATAATAATAACTGTACGCAAACTTATTCCATAATTTTCTCTCCCAAGATCTTTACATTCGTGGCAAGCGTCAGTTCTTTAA 
GTTTGCCAAGAAGATTCCAGCCGTGCGTCGTCAGGTGGAGACTGAATTGGCCAAGGCCAAAAACGACTTCGAGACGGAAATCAAAAAGAGCAACG 
CCCACCTTACCTACTCGGAAACTCTGCCCGAGAAGGGACTCAGCAAGGAGGAGATCCTCCGACTGGTGGATGAGCACCTGAAGACTGGTCACTAC 
AACTGGCGTGATGGTCGTGTATCTGGCGCGGTCTACGGCTACAAGCCTGATCTGGTGGAGCTCGTCACTGAAGTGTACGGCAAGGCCTCCTACAC 
CAATCCCTTGCACGCAGATCTTTTCCCGGGAGTTTGCAAAATGGAGGCGGAGGTAGTGCGCATGGCATGCAACCTGTTCCATGGAAACTCAGCCA 
GCTGTGGAACCATGACCACCGGCGGCACCGAATCCATTGTAATGGCCATGAAGGCGTACAGGGATTTCGCTAGAGAGTACAAGGGAATCACCAGG 
CCAAACATCGTGGTGCCTAAGACGGTCCACGCGGCCTTCGACAAGGGCGGTCAGTACTTTAATATCCACGTGCGATCCGTGGATGTAGATCCGGA 
GACCTACGAAGTGGACATTAAGAAGTTCAAACGTGCCATTAACAGGAACACGATTCTGGTATGGGTTTACATAGCAAAATGTATTCTAAAACTAA 
TCTTGAAACTTTAATCAGCTGGTTGGGTCTGCTCCGAACTTCCCCTATGGAACCATCGATGACATCGAAGCTATCGCCGCTTTGGGCGTTAAGTA 
CGACATTCCCGTGCACGTGGACGCCTGCCTGGGCAGCTTTGTGGTGGCCTTGGTCCGCAACGCCGGCTATAAGCTGCGTCCCTTCGACTTTGAGG 
TCAAGGGAGTGACCAGTATCTCCGCTGATACCCACAAGTATGGTTTCGCGCCCAAGGGATCATCGGTGATCCTTTACTCGGACAAGAAGTACAAG 
GACCATCAGTTCACTGTGACTACTGACTGGCCTGGCGGCGTGTATGGTTCTCCCACAGTCAACGGTTCCCGTGCCGGAGGTATTATCGCCGCCTG 
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CTGGGCTACCATGATGAGCTTTGGCTATGATGGTTATCTGGAAGCCACTAAGCGCATTGTGGATACGGCGCGCTATATCGAGAGGGGGTAAGTTT 
GTATCAAAGTTCGTTGTGACAATAATATAATTTTAAATCTATATTACCAGCGTTCGCGACATCGATGGCATCTTTATCTTTGGCAAGCCAGCTAC 
TTCAGTGATTGCCCTGGGTTCCAATGTGTTTGACATTTTCCGGCTATCGGATTCGCTGTGCAAACTGGGCTGGAACCTGAATGCGCTGCAGTTTC 
CATCTGGGTAGGTGACTCAACTTTTAAATTAGCGGAACCATTCTTATAAGTGAAATTCCCTCTAAGTATCCACCTGTGCGTGACGGACATGCACA 
CACAGCCCGGAGTCGCGGATAAATTCATTGCCGATGTGCGCAGCTGTACGGCGGAGATCATGAAGGATCCCGGCCAGCCCGTCGTTGGAAAGATG 
GCTCTCTACGGCATGGCACAGAGCATACCCGACCGTTCGGTGATCGGAGAAGTGACTCGCCTATTCCTGCACTCCATGTACTACACTCCCAGCCA 
GAAATAGACACCTGGAGCAATCCCCGTTCTCTTCGCCCACCCCACGGAGCTAATGCATTTCCTGTGCTGTATTTAAACCACCAAAACACCCCGTC 
GTT^AACCTTCCTCAAGCAATTTATATTAGGATGCAATTAGTGCTGTAATCGAGGGTACAAAACGTCGTTCTACGCGAAAATCTATCTACCTATG 
TTCATCCCATTTGTCAACATTCGTCGCTCTAAGAGCCATGTTATTAAAGTGTTTTTCTGTGTAACTTGCTAGTGAAATAATAATATAATATTAAT 
CAATTTTTGTGTACTATACATTTGTGAATGCTGTTAATGCTCGTTAATCTTGAAACTTCCATTTCCTCTGCAAAATAGAATGAAAACATATACAT 
TTAGCTTAATTGGATTGTGGCCTCACAAATTTTTAGCATGGATCTATTTGTTACCATTTAATTTATATTTGTTTACATTGTTTACCACTTCAGCA 
ATTAACAATAATTACTGCTGCGTGGGTATATGCAATCCGGTCCGATCTGATCTGGATTGGTGTTTTTATTTCTTTTCGAACATTTTGCCACGCGT 
CGGCGTTTTTTAATTTAGTTGATCTTGGATGCCCAGCTGACACGCCCATTTCCTCTTTTGTAATTTATGGAAGTCTCGAATGGCGTGCACAAAAA 
TTATTTCAAATACATTTCTATGGCGGAAACATTTATTTCAAAGGCAAGCGCACATTTTAACAAGGCACACGTTCGTTGTCATTTTTCAAAGAAAA 
AATAAATATAAATAGCCAAAGCGAAGCAACAAGTGTCGCACAAAGGTGACGATCGATTTGTGGCATCCATATCAAAACCTGTCACCCTAACACAC 
CTAATGCGCCCACGCAACTCTGCACTTCCTGCCCCTGTCCACCAGAAATCCGGGTGCACTAGCAACTCCAGACAGCCGGGCAGCCCAGAAAAATT 
CGGCTCAAACTGGGTTGCCAAAAAAGTGTGCATGAAATAAATTCACTTTTTTCCCCA 
(SEQ ID NO: 973) 

Exon: 1001.. 1097 
Exon: 1384.. 1548 
Exon: 2532.. 3193 
Exon: 3249.-3697 
Exon: 3756.. 3902 
Exon: 3962.. 4187 
Start ATG: 1395 

GTCACTCTA7AGCCGCA^ 

AGC T T GC C AC AC G AT GC GT C C GT T CT C C GGC AGC G AT T GC CT T AAGCC C GT C AC C G AGGGC AT C AAC C GG GC GT TC GGC GC C AAG G AG CC C T GGC 
AGGTGGCCACCATCACGGCCACCACGGTGCTGGGAGGCGTCTGGCTCTGGACTGTGATCTGCCAGGATGAAAATCTTTACATTCGTGGCAAGCGT 
CAGTTCTTTAAGTTTGCCAAGAAGATTCCAGCCGTGCGTCGTCAGGTGGAGACTGAATTGGCCAAGGCCAAAAACGACTTCGAGACGGAAATCAA 

AAAG AG C AAC GC C C AC C T T AC C T ACT C GGAAACT C T GC CC G AG AAGG GAC T C AGC AAGG AGG AG AT CC T C C G AC T GGTGG AT G AGC AC C T G AAG A 
CTGGTCACTACAACTGGCGTGATGGTCGTGTATCTGGCGCGGTCTACGGCTACAAGCCTGATCTGGTGGAGCTCGTCACTGAAGTGTACGGCAAG 
GCCTCCTACACCAATCCCTTGCACGCAGATCTTTTCCCGGGAGTTTGCAAAATGGAGGCGGAGGTAGTGCGCATGGCATGCAACCTGTTCCATGG 
AAACTCAGCCAGCTGTGGAACCATGACCACCGGCGGCACCGAATCCATTGTAATGGCCATGAAGGCGTACAGGGATTTCGCTAGAGAGTACAAGG 
GAATCACCAGGCCAAACATCGTGGTGCCTAAGACGGTCCACGCGGCCTTCGACAAGGGCGGTCAGTACTTTAATATCCACGTGCGATCCGTGGAT 
GTAGATCCGGAGACCTACGAAGTGGACATTAAGAAGTTCAAACGTGCCATTAACAGGAACACGATTCTGCTGGTTGGGTCTGCTCCGAACTTCCC 
CTATGGAACCATCGATGACATCGAAGCTATCGCCGCTTTGGGCGTTAAGTACGACATTCCCGTGCACGTGGACGCCTGCCTGGGCAGCTTTGTGG 
TGGCCTTGGTCCGCAACGCCGGCTATAAGCTGCGTCCCTTCGACTTTGAGGTCAAGGGAGTGACCAGTATCTCCGCTGATACCCACAAGTATGGT 
TTCGCGCCCAAGGGATCATCGGTGATCCTTTACTCGGACAAGAAGTACAAGGACCATCAGTTCACTGTGACTACTGACTGGCCTGGCGGCGTGTA 
TGGTTCTCCCACAGTCAACGGTTCCCGTGCCGGAGGTATTATCGCCGCCTGCTGGGCTACCATGATGAGCTTTGGCTATGATGGTTATCTGGAAG 
CCACTAAGCGCATTGTGGATACGGCGCGCTATATCGAGAGGGGCGTTCGCGACATCGATGGCATCTTTATCTTTGGCAAGCCAGCTACTTCAGTG 
ATTGCCCTGGGTTCCAATGTGTTTGACATTTTCCGGCTATCGGATTCGCTGTGCAAACTGGGCTGGAACCTGAATGCGCTGCAGTTTCCATCTGG 
TATCCACCTGTGCGTGACGGACATGCACACACAGCCCGGAGTCGCGGATAAATTCATTGCCGATGTGCGCAGCTGTACGGCGGAGATCATGAAGG 
ATCCCGGCCAGCCCGTCGTTGGAAAGATGGCTCTCTACGGCATGGCACAGAGCATACCCGACCGTTCGGTGATCGGAGAAGTGACTCGCCTATTC 

CTGCACTCCATGTACTACACTCCCAGCCAGAAATAG 
(SEQ ID NO: 974) 

Start ATG: 109 

MRPFSGSDCLKPVTEGINRAFGAKEPWQVATITATTVLGGVWLWTVICQDENLYIRGKRQFFKFAKKIPAVRRQVETELAKAKNDFETEIKKSNA 
HLTYSETLPEKGLSKEEILRLVDEHLKTGHYNWRDGRVSGAVYGYKPDLVELVTEVYGKASYTNPLHADLFPGVCKMEAEVVRMACNLFHGNSAS 
CGTMTTGGTESIVMAMKAYRDFAREYKGITRPNIVVPKTVHAAFDKGGQYFNIHVRSVDVDPETYEVDIKKFKRAINRNTILLVGSAPNFPYGTI 
DDIEAIAALGVKYDIPVHVDACLGSFVVALVRNAGYKLRPFDFEVKGVTSISADTHKYGFAPKGSSVILYSDKKYKDHQFTVTTDWPGGVYGSPT 
VNGSRAGGIIAACWATMMSFGYDGYLEATKRIVDTARYIERGVRDIDGIFIFGKPATSVIALGSNVFDIFRLSDSLCKLGWNLNALQFPSGIHLC 
VTDMHTQPGVADKFIADVRSCTAEIMKDPGQPVVGKMALYGMAQSIPDRSVIGEVTRLFLHSMYYTPSQK* 
(SEQ ID NO: 975) 

Name: sphingosine phosphate lyase-like 
Classification: enzyme 

Celera Sequence No. : 142000013383850 ™ mnim ™ 
ATCATTATTAAACTGTTTGCTTTATTTTTGTTTCGAATCTGCGCAATTCGCAGATCCATTCCTTTTGTTTGTTGGTTTGTCAAGTTAAGATTTAA 

AAAAAAAAAACTCAAAATTACAATTGTTAGTTCTTCTTTCCCCGCCCTTCTCCCTCGTGTCTGCTTTCGCTTCGCTACGCTTATGTAACAATAAA 

ATTGTAGCTTTTGCCAAATAAATAATATATGCATATATAATAAAATATATATATATATATACGATCATATATCATAAAGACATTAATGTAGTGCA 

TACAAAAAAAAACTAAACTAAAACAAATCTATCTGCAATTTGAATCGGAGGTTAAAGAATATGGTGTGTTGTAAATATAAATTTTTATGCTTATA 

GTTTTAAAATAAAAAACATGAATTGTAATCGGTAAATAACTACAAAACAAGATATAAATTCTGCACCAATTAGCAGGATACCAAGCCGATGATCA 

GCAGATTAAACAAACATTTCGATCGCACTGTAAAAATAAATGTGGGAAGTGTGGTGGCTGCCAAACCTATAAAT7AATATATATACAAATAAATAT 

ATATATATAAAATTGTCGATTTCCTCGTCCCGAGAAGCGTGCATTCGGTTCGACTTGACTCGCATTTTGGTTGTCATCGTAATCATTTTATCATC 
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ATGATTTATCCTCCGTATTTATGTTAGGTGGATGTAAATATACACTCTTTCATATACTCTATTTATATATATTTGTTTATATATAATTTGTATAT 

AGTTTAATTAGCTTTTTATTGCAGTTGAGTATACTATGTTGACTTGTTTTTTGTTGGTGTTGTTGCTTTGCTGGTGTTGTTGATCCTATTTCGTT 

TGTTTATCCTTTTGTTTTGTTTTGTTTCTTTCGATTCGATTCGATACGAGGCTTTATCCATATTGCTTCTTGATCTTGATAGGTCGGCTGTTGGC 

TGTATTCTTCTTTTGTAGTTATCTTGAATGGTGTTGTATTTTATTTGTGGCTATTTCTCTGGCTTTTCCTCCGTCTTGCTTAATGCAATGCTTAT 

TTTAGTTTATTTTTGCTTTTAATTTACGCTTTGCTAATTATAAATTATGTTTTAGTTTTTGTGTTTCTTGTTGAAGTTCTTGTTGCAGCTGTTGG 

TTGATTTATCCTGATTTGTCCTGTGCCAAGTCCATGCAGTCCGCTCAGCCAGTTACATACTGCCACTATATAGTCGGTTGGGGACTGGAACTCGA 

TCCGAACACAAGTGAATCGCGGGCTGGTTGTGGTGGTGGGGAGAGATGCGTCGCCCAGAGGAGAGAAGTAAGAGGATGGAGCAGGATGCACGCGG 

TTTCTCAGTCATTGTCGTGGGCCACCGCCAGTCGTCGTCATCATCTAAGCCATTTCCAGATGCCCTCAATAAATAGGGTTGGATCCGGGCCTGCT 

AAACGAGGACAAGCTGCTGACGGCGGCCACAATCGACCAATCATCATCGCAATCATCATCAGCTGCGCCAGAAGAAGCTCTTTTTGAGCCTAACA 

GACAGCTTGCGATGTACGCGCAGCCTAACCTGCTCCTGATCCTAAGTATCGTCTGACATGAACGCGTTCTGAGTCTTCGACTCCTCGATACGGCG 

GATCTGGTGGCGTAAATACATAATTCTGCGAAAGGCAATACATAAAATGCATTGGTTAGAACAACTACCATTTAGATGTCATTGCAACACTCACT 

TTTGTTCCTGCAGGGTGGCATTGATTTCGGTGCTTCGCACCAGGCTGCGGGCGCACTTCAATTCCGCGCATATCCTCGTGCACGTTATGTCGCCA 

AACAGGTGCGTATCCTCACAGCACTTCTGCGCCTTCGAGAAACAATGGAAAACATTTACAATCAGTTCGAGTCCATGGATATATGTATACGTATG 

TACAGAGGTCGGGTTAGTGCCTTATTCGCAGGCAAATCCTACCTCACAGCTTACCTGACGCTCCAGCACCTCCGCGTGCCAATTGGTCACATGCG 

CGATCAGATCGGCGCGGAAGTATTTGGCCAGCTTTGGCGAAATCTCGGGACCACGTAGGTTTGCCGCCTGTCCTGAGGAACTCACGCAGCTCTGC 

AGTGAGCTCAGCGTGGCCGACGACGTCCCATCCACTGGAATAACAAATGGGCAAACATTGTCATATATTTTAAACTCGGAACACACTAGAATTTT 

GAATACATACATTTCCGCTGCTGCTGGGCCTGTTGCGCCGCCTGTAGCTGGAGGCTTGACGCTGTCGCGGCTGCCGCTGCGGCGGCCAACTGTTG 

CTGCTCCAACGCCGAGCCGCTCAAATCCAGCTCCTGGGTGGGCGTAGGCGGTCCTTCGCATAGCGTGCCGCTGTTGCTGCCCATCGAGGTGATCA 

CACTGATTCCGCAGACGGGGCCCATGCCGGGCACGATGCTTTGGCTGCTATTCGCTCCACTGGAGCCGGAGGAGGCGTTGCCACCGGGCGTGGCC 

GGTGAGGCATTCGCCACCATCACTCCGCCATCGAGGGTAGCCTTGAGGCCAAAGGGTGGATTGCAGGTGTAGGCGATGCTCATGTTGTTCGTGTG 

GCCCACAATCGTCGGTATGGGACTCTTGGTGAAGTTCAACGAGGGCGACATGCTGTGGTGCATATACAACGGCGAGTTTACGCTGTTGTCCCTGC 

AATATTGATATCCATGAAGTTTAGTTGATGATCTTGCAACATGCCTTCAATAGCTTACCGTAAGCTGCCGGAGACCGAGTTATTGACCGAATTGT 

AGGCGTGATTGGTCCCATCTGACACCGAACCAACCGAGTTAACGCCGTGATGTGATTGTACATCGAGGACGCCCAGGCCCAGGTGCATCTTCTGG 

CGCATCAGCAGCGCCTGATCCTTGCCCTTGGCACCCGGTGGCAGTGGAGCATTCGAGTTCAGATGATGCTGGTGCGGATCCTGACTTCCCAGTGT 

GGGCACAGTCGTCTGCAATGCATAATGCATATAAATAAGAATAAAGCCAAAGCTTAAGGCTATCAAACAAGCCAAACTTACGCCAGATGTGGAGC 

CAATGTGACCGACAACGGAGTTCCTCAGACTGCTGCTGCTGGCATTGCTGCTGCTGCCGCCGGCACTGTTGCTGCTGTTGCTGTTCAGCATTCCC 

GACATGGTGGGCAGCATTGTGGCGCCCGTCACCGGGCCACCGCCCGTTCCGCTGCCGTAGTGCAGTCCCCCCGTCGCACCTGTGCCCGTCGCCAC 

GGATGCGGAGGAGTTGGCAGTTGGGTGGGAGGACAAACGAGGCAGGGCATTGCCCATTGGCTGATCGCTGTTGCTACCGCCGCCGCCTCCGGGAC 

CGCCTCCGCCCGGACCGCCGCCGTGTGTGGTGGGCGTGCCGCTTAGCGAGTAACTGGCTTCCGAGGTAGGCGTCGAGTCGCCGCTTATATCCATG 

TCAGCATTGTCGTTGTGTCGACCTGTCGGTAGGATTTTAAAATCAATTAACTAGATCCTACGCAACCTGCTTTATGCTCACCATCCAAACGCCTT 

CGATGACTGGGCGGACCGCCATCGTTGTCGTAATTCCATCTCAGTCGTGAATTGTCCCGCGAAGAATTGGAATGTTTGTATGCTGCAAAGAAAAA 

CACATGTTTTGAATATCATATTTGAAATTTCTGAAGCAGGTTCTTAACTCACTTGATCTGGAGAAACGATCATCGCGGTCGCGATCGCGATCCTT 

GTCACGATAATCCTTCTCACGATGCTGATCGCGCGGCAAATTTCTGTAACCATATAACACATTAATGACTATTTAACCAAGGACTAGGGTGCTCT 

TATTACAGATTCTTCCCAAGATCTTTGCTGGTCATATTATAATGCCATTGCCTTTACCTTTCTCTGTCCACCCATTCCTTTGGCTTCTCCCACTG 

AGAGATTTCCGTCTTGCAGTTGTAATAATACATTTTGCCTGAAAGCAATCCAATATTATAGTTTGGAGCCTGGAG7U\GAATAAGCTGTCATGGGG 

GGACTCACCTGATGAGCTGACATGCTCGCTCCAGTCGCCATAACGCGCCGTCCGTTCGCCGCGATCGCTGCGCTCTCGTTTGTCTAACAAAAAAA 

AAAACAAAAGCCACAGAGTATGTTTAGTAAATAAGGGGAGATGTGGGGTAAATGGGCGCTTGGACCTACCTGCGTACTTGTCCTTCTTGTACATG 

TCCCTGTCGCGATCGGAACCGCGGTCCCGCTTGTCGCGCAAGTCGCGCAGATCTCGATCCCGATCTCGATCGCGATCGTCGGACCCGGATCGTTT 

GTCGTTCTCCCGCGAGGAGCAGAGCTTGCCGCTGCCGCCTCCGCTGCCTCCTCCGCCACGACGATCCCTATCATGGTTGGTGCGATACGACGACT 

CCGAATCGCGATCGTTGCCACGCCGATCACGTTTATCTATGAAGAATAGAATAATGTATTGTTATGGTATACAACTCCAACTTGCAGCTGCGCTA 

ACTCACCTGAATATTTATCCTTCTTGTAGACATCCCGATCTCGATCTCGCATCTTCTGCATAAAGCTGTACCGCTCGCCACTGCCGCCACGCCCA 

TTGCCGCCACCTCCGCCCCGATGATCGCTGCGGTCGCGCAGATCGTGCGAACGGGACGACGGCGAATCAATGTCCGGCGACTGGGAGCGGTAGCT 

GTTGCCATTGTTCAATGGTCCGCCGCCATTTCCGCTACCGCCGCCACCTCCTCCAGCGGATCCGCCGCCTCCTCCACCGCCAGCGCCTGGCGACA 

GGTCACGATCGCGATAATTGGAGGAACGATCGCGTTCGTAGTCGCGCTTTGAACTGCTGTACTTTGAGCTCTGCACGGGTAAATAAAAGCGCACA 

CAGTTTTTAGTCGATTTACATTCCCAAAATCAATTGCACAGTGCTACAAAATACAATGTTTGTCGAGTACTCTCTAAGAAATCGCAGTAAGGATT 

CAAGATGTGGTCAATATAATCAGTACATCAAGTAATGTAGCCAATATATGAGTTTTTCGTGTTCCAACAAATGGGATTAATTTAGGAAATCCTTA 

CTGAATCTCTTGGATAGTACCAACCCACTGGTATATTTAGTATTGCATTCTTAATAGCGGTTGATTTAAGTCTCGGTTTAGTTGTAACTTTTGAA 

GTTGGGTGGTTGGTTAGTTGAACTAAATCTACTCAATAGGAAAATCATAAGTTAATGCATAATGCCAGCTACTTTTGCCAATTTTCTCGCTTGTG 

ATTATTGCCAATGGAAACGAAAGGATACAAAAAGAAAGTTGGGTGAAATGAAACTCCCGATGGTTTGAGAAATCAAATGTGTAATCGAGAAGAAT 

TTTGAAATGTGTCGTGTGTTTGGTGTGTGTTTTGGTTTATCGTCTCCTGAGTAAATTATATATTAATTGAAACTAATTATTTAGGATTTTTTTAA 

GTGATACTTTCGTTTTTGGTTCATATTGGTCGTTGTCTTCAGGTTATCCAAACTGAAAATCTAATTGTCCAGTTTAATATTAATCAAAATTAAAA 

TCCTTGTTTGTTGTCCATCGTAAATTTTGGGATCATCCATAAAATTCCAATCTTGTGTTTATCAAAAAATTTTATAATAATTAATAAAGAATCCT 

TCATAATTGAGTGGTGTTTCATTGTTTAAGAGATATAGAATAATAACTCTTTGGGTGTTTGGTATGTAAATCATAATTTGTTGATGTTTTTTATT 

AGCGGCGATTTGTGTGTCGTTTTAGTCAGATTTTTGTTTAAAATATACGAGTTTGGATGGAAGATTTATTCCTGCTGCATGTTTCATGTTCTGTT 

GCCTTCTTTTGATTTTTATTATTCATTATTAAGTAATTTTATTTGTTCCTTAATTTGAATGTTTTTTGTTTATTCCTTATCAGGATTGCTTTTCC 

TGCGCACAGCACGATGTTCGTGGCACTTTATTGGTTTTTGATCACTTTATTTGCGTTTTTATATTTTACTATTGTTTTTAACCATTATTTTCTGG 

GATCTTTGGGTTACTTACTTCTAGGCGACAGAGACCCACTGCTGAAGGCGTTGAATTCGCTGCCAGCGTCGCTGCTGATGCGCTGTTTTTGTTAT 

TGTTGTTATTGTGGTTGCTAATGTTGCTGTTGCAATTGGCAATTGTTGTTGTCGAAAGGGAGAGATCAACAGCCATGTTGTTGCTGTTGTTGTTG 

CTGCTGCCGTAGGAGTTTTTGTTGCTCCTCATAGATAATTAATTGAAGTGTATTTGGTTTTAATTTGAATTAAAAGTCCAATTGGTCGCTTGTTT 

TCGAGGGCACACTTTATTTGGCTTTGGTTTATTTCCTATTTGTTTTGCATTAGTTATTTGTTGGTGTCTTAATTGATTGTTGTTGCTTTATGTAT 

GAGGCACATTTGTGTTTTTGTTGTTGTCGTTGATTATTTCGTTAGACAATTTGGAAATTGGTTTGTTCTTCCGTTCCGCACATTCATGTTATTTT 

TAATATTGGTGTATAATTGTTTACTACGTCGTTTTTAAGGCTATTTTAATTTGATTAGGTTCCTATTCCTTAGGCAGAGTTTAGAAATTGGTTAT 

GCATTCGATTTCTAATTAATATAGTTTGATAAATTTGTATACTTTTAACCATATATATCATTCGATTTAAAACATTTGTTGGTGCTGTTCGGTTA 

GATGTTGTAATTGATTAAATGATCGAATTCGGATTGGGTTTGTTTTTTGTTTGTTTTTTTTTTTTAGCAGTGATGCGTACAGATTCGTTTCTCAA 

GTTAAGATCAGTTCAGGCTGCAATGCCGCGTTTTTGAATGGGTTAAAATGAAACACGATTCGGTCAGTTCAAAAATGGATGAGGTTACGGGAACG 

T AAG AT CT G GG AT GC AC AAC AAAG AC AGCG ATT G AC AAAG AAGG AG AAAT ATGAAC C AAAAA T AAT C C AAAA T CCA GG C T AT AC AAT T TGC AAAT 

TTTGTTAATTTTTCTCTTTACCAAAATATCAGCAAATGACACTAGGGAAATCGACGGTTCGATAAAAATACGATTTACATTGGCTTTTACATTCT 

TTTTTTTGTGGCTTTGGGTGCACTACGGGCTAATGAGCTAAACTACAAAATTAAAAATGTCCACTAAAAGCTGGCATGTAAATGTGATTAATTGC 

GGTGTTCGAGGAGAATTTTCGGATCGGTCTGCTTACCTGGTATGATGTGTGACTGGTATGCTTCTCAAAGTACCTGGAAAAC 

(SEQ ID NO: 976) 
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Exon: 6207.. 6004 
Exon: 4820.. 4472 
Exon: 4406.. 4155 
Exon: 4073. .3999 
Exon: 3933.-3858 
Exon: 3748.. 3663 
Exon: 3597.. 3502 
Exon: 3442. .3027 
Exon: 2957. .2719 
Exon: 2656.. 2196 
Exon: 2124.. 1955 
Exon: 1837.. 1710 
Exon: 1640.. 1001 

Start ATG: 6207 (Reverse strand: CAT) 
Transcript No. : CT25711 

ATGAGGAGCAACAAAAACTCCTACGGCAGCAGCAACAACAACAGCAACAACATGGCTGTTGATCTCTCCCTTTCGACAACAACAATTGCCAATTG 

C AAC AGCAAC AT T A GC AA CCA C AA T AA CAAC AAT AAC AAAAAC AGC GC AT C AGC AGC G AC GC T GGC AGCG AAT T CAACGC CT T C AG C AG TGGGT C 

TCTGTCGCCTAGAAAGCTCAAAGTACAGCAGTTCAAAGCGCGACTACGAACGCGATCGTTCCTCCAATTATCGCGATCGTGACCTGTCGCCAGGC 

GCTGGCGGTGGAGGAGGC GGCGGATCCGCTGGAGGAGGTGGCGGCGGTAGCGGAAATGGCGGCGGACCATTGAACAATGGCAACAGCTACCGCTC 

CCAGTCGCCGGACATTGATTCGCCGTCGTCCCGTTCGCACGATCTGCGCGACCGCAGCGATCATCGGGGCGGAGGTGGCGGCAATGGGCGTGGCG 

GCAGTGGCGAGCGGTACAGCTTTATGCAGAAGATGCGAGATCGAGATCGGGATGTCTACAAGAAGGATAAATATTCAGATAAACGTGATCGGCGT 

GGCAACGATCGCGATTCGGAGTCGTCGTATCGCACCAACCATGATAGGGATCGTCGTGGCGGAGGAGGCAGCGGAGGCGGCAGCGGCAAGCTCTG 

CTCCTCGCGGGAGAACGACAAACGATCCGGGTCCGACGATCGCGATCGAGATCGGGATCGAGATCTGCGCGACTTGCGCGACAAGCGGGACCGCG 

GTTCCGATCGCGACAGGGACATGTACAAGAAGGACAAGTACGCAGACAAACGAGAGCGCAGCGATCGCGGCGAACGGACGGCGCGTTATGGCGAC 

TGGAGCGAGCATGTCAGCTCATCAGGCAAAATGTATTATTACAACTGCAAGACGGAAATCTCTCAGTGGGAGAAGCCAAAGGAATGGGTGGACAG 

AGAAAGAAATTTGCCGCGCGATCAGCATCGTGAGAAGGATTATCGTGACAAGGATCGCGATCGCGACCGCGATGATCGTTTCTCCAGATCAACAT 

ACAAACATTCCAATTCTTCGCGGGACAATTCACGACTGAGATGGAATTACGACAACGATGGCGGTCCGCCCAGTCATCGAAGGCGTTTGGATGGT 

CGACACAACGACAATGCTGACATGGATATAAGCGGCGACTCGACGCCTACCTCGGAAGCCAGTTACTCGCTAAGCGGCACGCCCACCACACACGG 

CGGCGGTCCGGGCGGAGGCGGTCCCGGAGGCGGCGGCGGTAGCAACAGCGATCAGCCAATGGGCAATGCCCTGCCTCGTTTGTCCTCCCACCCAA 

CTGCCAACTCCTCCGCATCCGTGGCGACGGGCACAGGTGCGACGGGGGGACTGCACTACGGCAGCGGAACGGGCGGTGGCCCGGTGACGGGCGCC 

ACAATGCTGCCCACCATGTCGGGAATGCTGAACAGCAACAGCAGCAACAGTGCCGGCGGCAGCAGCAGCAATGCCAGCAGCAGCAGTCTGAGGAA 

CTCCGTTGTCGGTCACATTGGCTCCACATCTGGCACGACTGTGCCCACACTGGGAAGTCAGGATCCGCACCAGCATCATCTGAACTCGAATGCTC 

CACTGCCACCGGGTGCCAAGGGCAAGGATCAGGCGCTGCTGATGCGCCAGAAGATGCACCTGGGCCTGGGCGTCCTCGATGTACAATCACATCAC 

GGCGTTAACTCGGTTGGTTCGGTGTCAGATGGGACCAATCACGCCTACAATTCGGTCAATAACTCGGTCTCCGGCAGCTTACGGGACAACAGCGT 

AAACTCGCCGTTGTATATGCACCACAGCATGTCGCCCTCGTTGAACTTCACCAAGAGTCCCATACCGACGATTGTGGGCCACACGAACAACATGA 

GCATCGCCTACACCTGCAATCCACCCTTTGGCCTCAAGGCTACCCTCGATGGCGGAGTGATGGTGGCGAATGCCTCACCGGCCACGCCCGGTGGC 

AACGCCTCCTCCGGCTCCAGTGGAGCGAATAGCAGCCAAAGCATCGTGCCCGGCATGGGCCCCGTCTGCGGAATCAGTGTGATCACCTCGATGGG 

CAGCAACAGCGGCACGCTATGCGAAGGACCGCCTACGCCCACCCAGGAGCTGGATTTGAGCGGCTCGGCGTTGGAGCAGCAACAGTTGGCCGCCG 

CAGCGGCAGCCGCGACAGCGTCAAGCCTCCAGCTACAGGCGGCGCAACAGGCCCAGCAGCAGCGGAAATTGGATGGGACGTCGTCGGCCACGCTG 

AGCTCACTGCAGAGCTGCGTGAGTTCCTCAGGACAGGCGGCAAACCTACGTGGTCCCGAGATTTCGCCAAAGCTGGCCAAATACTTCCGCGCCGA 

TCTGATCGCGCATGTGACCAATTGGCACGCGGAGGTGCTGGAGCGTCAGGCGCAGAAGTGCTGTGAGGATACGCACCTGTTTGGCGACATAACGT 

GCACGAGGATATGCGCGGAATTGAAGTGCGCCCGCAGCCTGGTGCGAAGCACCGAAATCAATGCCACCCTGCAGGAACAAAAAATTATGTATTTA 

CGCCACCAGATCCGCCGTATCGAGGAGTCGAAGACTCAGAACGCGTTCATGTCAGACGATACTTAGGATCAGGAGCAGGTTAGGCTGCGCGTACA 

TCGCAAGCTGTCTGTTAGGCTCAAAAAGAGCTTCTTCTGGCGCAGCTGATGATGATTGCGATGATGATTGGTCGATTGTGGCCGCCGTCAGCAGC 

TTGTCCTCGTTTAGCAGGCCCGGATCCAACCCTATTTATTGAGGGCATCTGGAAATGGCTTAGATGATGACGACGACTGGCGGTGGCCCACGACA 

ATGACTGAGAAACCGCGTGCATCCTGCTCCATCCTCTTACTTCTCTCCTCTGGGCGACGCATCTCTCCCCACCACCACAACCAGCCCGCGATTCA 

CTTGTGTTCGGATCGAGTTCCAGTCCCCAACCGACTATATAGTGGCAGTATGTAACTGGCTGAGCGGACTGCATGGACTTGGCACAGGACAAATC 

AGGATAAAT CAAC CAACAGCT GC AAC AAGAACTT CAAC AAGAAAC AC AAAAAC TAAAAC AT AATTTATAAT TAGCAAAGCGTAAATTAAAAGCAA 

AAAT AAACT AAAAT AAGC AT T GC AT T A AGC AAG AC GG AGG AAAAGC C AG AG AAAT AG 

(SEQ ID NO: 977) 

Start ATG: 1 (Reverse strand: CAT) 

MRSNKNSYGSSNNNSNNMAVDLSLSTTTIANCNSNISNHNNKNNKNSASAATLAANSTPSAVGLCRLESSKYSSSKRDYERDRSSNYRDRDLSPG 

AGGGGGGGSAGGGGGGSGNGGGPLNNGNSYRSQSPDIDSPSSRSHDLRDRSDHRGGGGGNGRGGSGERYSFMQKMRDRDRDVYKKDKYSDKRDRR 

GNDRDSESSYRTNHDRDRRGGGGSGGGSGKLCSSRENDKRSGSDDRDRDRDRDLRDLRDKRDRGSDRDRDMYKKDKYADKRERSDRGERTARYGD 

WSEHVSSSGKMYYYNCKTEISQWEKPKEWVDRERNLPRDQHREKDYRDKDRDRDRDDRFSRSTYKHSNSSRDNSRLRWNYDNDGGPPSHRRRLDG 

RHNDKADMDISGDSTPTSEASYSLSGTPTTHGGGPGGGGPGGGGGSNSDQPMGNALPRLSSHPTANSSASVATGTGATGGLHYGSGTGGGPVTGA 

TMLPTMSGMLWSNSSNSAGGSSSNASSSSLRNSVVGHIGSTSGTTVPTLGSQDPHQHHLNS3SFAPLPPGAKGKDQALLMRQKMHLGLGVLDVQSHH 

GVWSVGSVSDGTNHAYNSVNNSVSGSLRDNSVNSPLYMHHSMSPSLNFTKSPIPTIVGHTNHMSIAYTCNPPFGLKATLDGGVMVANASPATPGG 

NASSGSSGANSSQSIVPGMGPVCGISVITSMGSNSGTLCEGPPTPTQELDLSGSALEQQQLAAAAAAATASSLQLQAAQQAQQQRKLDGTSSATL 

SSLQSCVSSSGQAANLRGPEISPKLAKYFRADLIAHVTNWHAEVLERQAQKCCEDTHLFGDITCTRICAELKCARSLVRSTEINATLQEQKIMYL 

RHQIRRIEESKTQNAFMSDDT* 

(SEQ ID NO: 978) 



Celera Sequence No. : 142000013384682 

TGATCTATTAGAATTTGTTTAAAAAGTTTAGAAACTATAATTTTATACCAATCAACCATTTTACCTATCCAACAATTTCATAGAAAACTTCTACA 
GTTTTATAAACATTATATGATTAAGAGCCGAACCTTTCAATCAAAAGATTCAATTGGATTTAAATTTATATGTCATTGTTTTTAAATTGGTTTTG 
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CTGTATTATCAAAATTTTCCTTCGTTGCTCTTAGATAACATTTTCTCTCATAGAACATTTTAAAATTTTCTTTATTTCTTCGAGTGTGGAAAATT 
GGTAGTGGT^AGTATCTTGCAGTTTCGTCGTATATCTGCTCGTAAAAGCAGCAAGCCACCAATGGATTTCCTCCCACTGCCGACCGGCCACCCCAT 
TTTAGCTGATTCTTGCTTTGGTCATTAAGACGCATTATTGTTTGGACTCAAAAAAAAAAAAACAGGGGGAAACGGTTGGATGATGAGACAGCGGG 
CGGTTTGCATAGTGACTCTCTGATCTGATTGCAGACAACTGATAGCTTTATTTCATCAAATGAAATTAGGAAGAGTGTCGGGTTCAGCTAGGGGG 
AAT AAT C GC A AAT T AAAT ACT C AT GT T TG A T AT AAAC TC A T AAAACAAT TC TTAGTTTTT T AAT CA C AAT AAAC C AT T T T T AC C AC AT C T C AC GA 
TGATCATCTTAAGTTGGTTGCTCTTTATAAATATTATATATTTATATATATTATATAAATTAATTTTAGGTTTTGGTGTATTTTAAATTAAATGA 
AACTTGTGAGTTAAGTCTTTAAAATTCCACTTCAAACTTGCCGCGCCACAAACGACTGCTGCAGCTCGGCAGCACTGGCAGAAATACAGGGCATG 
CAAGAAATACTGGTCACGAGTAGGTATATTCCACCGGAGAGAGCGCTAACTGAACCCAGTATTTTCCGCTGAGAGAGCAGCAAAAGCCGGTTCCC 
ACCAACAACTTCGAACAGCCGGCGGAGTTCAGTAATAAACAGACTTCGAACGCGTGCAGTTCAGATTGAGTTTGAGGTTCGATTGGATTCGAGAT 
ATCCAAGTTCCTTTTGCGCTACGTTTTGTTTAGCCGAGTTTGCCCCGAACTAAAATCACCCACCAAAGCCGGTTCGGGATGCGATGATGCCAAGG 
ATGCAGCTGATGCAGCCGGGCTCTTGTATCTTTAAGATACAAATTAAGTTGATTTGTTGCGCTCGCGTTCTCTTGGGTAATTGAACCCCACAGTT 
GGGCTCCTTGCGCCCTCTGCCGCGCGGTCTTCGGTCTTCGGTCTTCGAGCATTTCGTTGCATTTTGTGTATCTAGATACCTTTGTCGGTATCCGT 
ATCCGTCGCTTGTCAATCTGGTCGCTGATAAAAAAGCAAACGCGAACGCAGCCACGAAACATAAACCAGCCGTCAACGGAAAATAAAAGTAAAAA 
GCAACAAACAGCAAGTGGCGTATCAGTTCGTGTGTTCTCGGTGATAAATGCATTTCATGCGTTCCAACTATGTAAATAGCAATTTAATTTGAGTG 
TCCACAGTAAACATCAAAAAACCGCCCTGGACTCCAGTCACCACCACCAACAACACCATCCCCGGTGGAGCCCAGTGATCCCATTGAAACCCCGG 
TAAACCCCTGAAAAGGGCAGTCGTTGTCATCGTCGTCATCGTCGTCGTCGTCGTACGTCTGTGCCATCCAAATATTGAATATTTACAAATGCGTT 
GAGTGCATTTTCGAGTGGCCGGCACTTACAGTGACTATCTGGTGGATGTGCGAGTGTATTCATCGGATTGCCAGCAGCATAAAACTTCGTTTTCG 
ATACTGATCCAATACGCGAGTGCTTATATATTTTTTGTGTCGCTAATTGAGACCAAAGGTGTCGAAGTGCGGCGGTTATACATATACGCAGATGA 
CTGCTAGGATGATCTCAATATGCGGCCTGGTCATGGCTTTGATGATGGGTAAGTGGTCTGCTACGATTGGATTTTGTGGACTAAGTGAGGGATTG 
CCTTATTCTCGGGGGGTTCTTTGCAATCAAATTCATTTCATTTGCTTAAGGGGGTTCTCAATTCCCATTTATTTGCTTATTTCGATGTCGGCATT 
TGGCGATTGCCTTTGCAACTGAATCGCTGGGAACTTCAAAGGAATCACAACATTTGTGTTGCGCAGAATATTGGCCATTCCATCTTGGGTGAGAC 
CACCCATAATATGGACTCTATTAACCTCGAGGGCTCTGCCGTCCACACAGAACTACTCCTATTTCCTTATTCTCCAAGGGCGATTTTGTTTTGCG 
AAGAAAAACCGAAAACCCAGAGAGCCTTTGTCTCTGGCTATTCAATTTGAAAGGCAAGCGGCGTGCAAAATGACTCAATTAAAAGTCAGCCAGGA 
GCCACCAAGCTGGCATTACTAATTTGGAATAGGAGAAGGAAAGCAGCGGGAGAGGAAGCCGTTTGGAAAATTATGAGTCTCAGGTCTCAGTTCAA 
AATGAAGTCATTTTGGGCAAACATCZ^ACATCGAAAATGTGACGAGCTCACTAACACAGGCAACAATTGAGGTCCTTGTGGCCAGGTTATCCTTTT 
CAATGACAGATGTTGCTCATTAAATACTTTTCTTTAGATAAAGTAGTTAATTTATAGGCAGATCAGCTAAAATACTCGCTCGGAATAAGAAAATA 
TTGTTATAAAACTAATAAATATTTCATTTCTATACCTTTTAGATCCGTTCTGTTTTAATAGATTCACTAATTAGAAACACTATATAGGCATGTTT 
CCGAATGTTTTTTCCCAGTGCTAGCACACACACACACTTGAAATATAAGGACCATCCTGCCAAGGATCACGATACCCACTTGAAGTGAGACAAGC 
GACGCCGAAATTTCGGCTAATCAAATATAGTTTACAGCGGGGATTACGAGTGGGATAAGGATCGTGCCCGAAGGAGGGGGATCTCGGGTCTCAAG 
AGGGACACGATTATGCAAACGAAGGTCCTTAACAACGCATGTGTGTGCGTGCGTTCGAGTGTGTGTGTGTGAGACAGGACAGAGCTGAAAACTGC 
TGACTGTTAAGTAGCACAAAGGCAAACAGCCGCTAACAAAAGCCAGCCGGCAGACAGCGAGGATAGGGGGATTTGGATTCGCATATTTCAAAATC 
TGTGCAACGCTACTACAGAGGTCCATTATTTGTATGGCAAATAATGTGGCTGTCAGTGTGTGTGTGTGTGTGTGTGCTTGTTTGTGTGAGTTCCT 
TTTGTGTGGCAGTCATAATTTGTTATTCAAGCATGCAAAATGCTGCATGCAACATGCCAGGCCAAAAGCAAAAGCATCGTACATCCCACATTAAG 
GGCAGCTGTGGCAACTACAAAACTCCCCTTCACCAACCTTTGCCACTGCCACTGCCACACCCCCCGCCCCTGGTCCTTCCGCCTGTCTGCATCCT 
TCGGCAACCTTAAAGACTGCCAGTCGAAAGTTCTCCAGAGGGTTCTCTCTCGAAAATTGTATGAACTAGTCATGCATTTGCTTAGAAAGCCGGGG 
TATAAAGGGAATTTCAGGATTTAATGCTGCCTTATAATGAATCCAATTTATTAGGCTGTGTTCTATGAACAACTGGTCTCCGTTTGAAATACTTT 
GAATTGAATACAGCATCCTTGAGGTTTTCTACTACACTGGATTGGCTGGATAAGGGTATTTTACCTATATGTAATAGACATAAAACCATAATTTT 
GTAAGCTAGTTATTTCCTACAAAGTAGCGTTCAAAGAAATCATATCCAAAATGGTATTTATTTTACACATAAGAAGTAAGAGACTACATAATAAC 
TTTTCATAAATAATGCTTGGCAGGCAGTACGATTCCCATTTTCATGTTGTTAAATCAATTTTGTTTGAAACACAAAGGAAATGATCCCGAGAAGC 
TCGATTGTAGTTGAAGGGAAAATCCCTGAATAAAGCCCTGATGACTGTTTATTAGCCATGCTGTAACCTTTAAATTGAATTCCAACTCTATCAAA 
GACCATCCAGTCCTTTTTCTGGCCATTTTTACCCCTGAAAAAACCCTAGATCAAACGTTTGACGCGTAATTGTCAGACACAATTCCACTTTTCAT 
TCAGTGCCTGTCTGTCTGTTGACTTGCAGTTTTCCAGCACCGCTTTTCCCAGACAAATGCCTCGCATTCCACAGTGACAGTTCAGTTCTAGCCCC 
CCATTCCCCATTCCCATTGCCACCCCCCGCACACAGACACACGCACACAAAATCAAATCAAAACGCCACCCATGGGGCAAGTTCAAGTGGCTGCC 
ACAGTGGGTTGCCTGATTATTACTACAATATTTCGCATTCATTGTGCTTAATCGACGAATGTTGCCGCACCATTTGTGGCAGTCGAAGTTGGATC 
TGTATCCGCATCTGCATCAGAATCTGAATCTGAATCTCAAAGATGCTGATGAGCATGAGGCCACAGCAGTGGGCACTCGTAATACTGCCTGCTGC 
TGACTTGCTGATTTCGGTTATTCGAGCC AAAC AAAC GAACTAATTATCAC AC ACGCACACACACCCGCGACGAGGGTAAGTGGAAGAATGGAACG 
ATGCACTGAGAAAGATTTCTTTGGGTTTAAAAAAAGAATCTTTAGAGTTCCAAGTACACACCTTAAAGTATATAAATCACAAACGTATTACTTGT 
GAAACGGTTGTTATAGAGAAGATTGTTATACCTATAATGTCAAAAGATAAAAAAAGATAAAAACATCTACCAATTCCTATCATAAACTCTGTCAT 
ATCTCATTACATGCTCATAATTTGCGAAACTGCTTACCAAAAGTTATAAATAAACATTACTTAGTTTAATAAATCAATTGAAAGACACAGTGTTG 
AAAATTCGTTTGTATTGTTTTTAAATTTAGAAAATCAAAAATTCCCACTTGGCTACCCTTCTTTTTCGCCAGTGTATGTCCACAGACAGATCGAG 
ATGGGCAGTTGGAACCGGGACTGCCCGCTGCTCCGTGGCTCTGTCGCTGATGTTTGCAGCATCAACACTTTTTCAGTTTGCTGCTCGGCTTCTGC 
GGCATCAGTGCTCCTCCACAGAGCTCCACCTCCTGCCCCACTCCTCGTCGCTTTCCGCACCCTCTCACGCATTTTTGTGCCCACATCAGCGCAAT 
CAACGGACTGCGGAGCGGGACGCATATTTCTTTAATCGACGCCCAGTAACTGACATTGTTCCCTCGTCTCGTATCGAATCGACTCAACTCGACTC 
GACTTGACTGGAACTCGAACTCGTTTTCCGCACGGCAAATTAAATCGATTCAGGGGAGGATCGAGTCGAGCAGGGCAGGGAGTTAAGCCCAGCCA 
AGAGGAGGAGGTGGAGGGCATCCCGGTGCCTGCCCAACTGCCTTTTCGGTTGGGCCGTGTATCATAACACGTCTGTCGGCGATGTGACTGGCATA 
TGAATCGCATGGAGTTCCTCCTTCGAACGTCGAAAGGCCAGTAATGATGTTGTGGCTGTTGTTGTTGGCCCGCTTATTGATTCCTCGGTGCGGTC 
GTGCTCTTGGCCAAAAAGGAACTGCGCCCGGGACCCACAATCTGAGGTGTTGTCTCAAATTGCTCCAATTAGTGGCCAAATATAATGGTTGCCGT 
TTTGTAGTTCTTGTTCATGTTCCTTGCTCCTGCTACCCAAGTCGTATAATCAAAAGTAATTGCGTTTGAATTGCTTTTCGTTTCGAGTTGCGGTT 
GGGAGTTATGTATCTTCTGGCCATATCTACTCACTTGACTTTGTAGCCTCGAATGGCGCCACTTGTGGTCTGGATTTAAACAGCTTCGATATGAA 
AAGGTTCTTCGCTCATTTGGTAATCGCCATTGAAATGCATTCATATGGGCCACACCTGACCAGTGAATTGCATTTTAAATGCAGCAAATTGCATT 
TGCGGCTGTTGATTTAATATAACAACCGATTTGCATTCACAAATAAATGAGAAACACACAAATTAAATGTATAAAAACATTTTTCCTCTCGAAAT 
GCGTTTATTTTTCGCATGGTTTTTATGGGCGCCTTTTTCTTCTGGCTTTTAATTATGTTGGTACAAAACTCTGAATACGTTTTTGTTTGTTTTTT 
TTTTTCACCATACACATATTGTAATTTTGTTTGAGGACACGCAACTTTGGACAGCAATGGCGTAGGTTTTTTCTCCATGCCTCTCTCTGATACTC 
TATAAAGGGTATTTTTTTAAATTTTAATGGCAAAGATCATTTCATTTAAGCTTTGAACTTATGTAATTATTTAGTTTGTAACAAATTGTCCAATA 
ATTCAATAATAATAATATGCTACTTAATAAGCACTTTAATCATTGCAGCTAAAGAGTATTCAAAAAGGGTTTTTAGATTTAAAAAAAAAGCCTCT 
GTTTTTTGTGCTTCCGCCCTTTTTTGAGTATTTTGTTGCTGGTGTTTCTGCCTTTGCTCCTGCCCTTGCAGAGCGCTTTACAAAATTACATTTGC 
GTTAAACACTGTAATTTAAGCAGCTTAAGCGCATTTAATTTGTTTAAACCACTTTAATTAAACAAAATTATGGTTAATGCACTTTTTTTTTTTAG 
TTAA.TTAAGCCAAATTGTTCGATGTTCAATTTAAATTTAAAGGGACTTTTGTAAGAGCTGCAAAACTCTGCCCTTTAATCGGGATAATGGCTGAC 
TGTAATGAGATCCAGTTTACTAATTTCCCTCGCTTATTTCTATAATTACAGCTTCCGTTTTGGCCAGCTCATCGCGTTTCCAGCGAGTGCCCCAA 
AGTCAGTCCGTGGTGGAAAATGAATCCGTGAAATTCGAGTGCGAATCGACCGACTCCTACAGTGAACTGCACTACGACTGGTTGCATAATGGTGA 
GCAGTGCGATTTACATATCCCATGGTTGTGCATTAAGCATTCGCCTTCCTTGCACTCCATTCCAGGTCATCGGATTGCGTATGACAAGCGTGTCC 
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ATCAAATCGGTTCCAATTTGCATATCGAGGCGGTGCGGCGCACGGAGGATGTTGGCAACTATGTTTGCATCGCGACAAATTTGGCCAGCGGCGCC 
AGGGAGGCGTCTCCGCCCGCCAAACTGAGCGTGATATGTGAGTAGTTATATTCCCAGTTTCAGTCCCCATCTCCATCCCAATCCCAGTGCCCATG 
TCATTCCAACTCCAAGACCCACAATTGTTCGCAATAAGCAAATTGACTGATGGTTAAGGGCGACAGCAAATCTGTCTTTGCTTGTGCAAATTAAC 
AAAGCGAAATTGAATTGTAAGCGGGTAGTTGAGAGCACTGCAAAAAACTTCACTTGATATTGTTACATTTTTATATCTAAAAACTGAAGTTAGTT 
CTTGAGCCATAAACTTTCAGTGGAATATTACATAGAACTTACCTAGATTGATAGTAACTGCTGCACTCTGTAAATGCTCACCGGTTTCTTTGCGT 
GTAGGGCAAATTCATTTGATTAATGCTGCATTTAATGGTATTCCAACCCCAACCCCACCTCTCCCTGGCCCAAACCCCCCTTTTCAGATACGGGC 
AAACCACAAGACAACAAGAAAAGGTAAAGAGGCATAAAAACGCAACCGAAGCGAACGGAACGCAGCCCAACAAACCGATGAAAGATAAAAACCGA 
AAGTTTTGATGTACATGCGTCAGATCCCATGCTAACCCAGTCCCTTTTGCCAACTCCCCCTTCCAAAGACCCCCCCCCACCCCTTCACCAATTAA 
CCCCTTCGCGCAGAGCTACAGGAAGTGCAGTGCAGTGCACAGAATGCATCCCATGCATCGGACCGTGAAGAAGAAGACCCTGGTCGAAGAAGAAC 
CACACAGAGAAAGATCACTTGGTTCTAAAATGCAGCTATGCATTCGAAAAGTAAACTTCTTTGAGAAAACCATCTTCAAGATACATCCTCAAGAT 
ACATCTTCTTCTTTGCAATTATGATAAATTGTAGAGTTCGTTTCAAATCTACCTTTTCAATTCTTAATAGCGGATAAAGAAACATATCTATGAGT 
CTTGAGTTAAAACAATACGATTAGATAAAATATCTCTCAGTGCATCGCTGATTTCCTTCAAGTAGCAAGTGCAGTTGCAGTTCTTGGCCTCGTGT 
TGAGGCAGCGGCGTCACGTGATTGATGTTCATTTAGTTGCGTTTGCCTTTAAGCAGAAAAATGCATAAATTTACCAAATAGCTCCCCCCACTTCC 
CCTACTCCACCGTCTTCCTTTTTCATGTAAACAAAACGCGAGTTTTATGCACAATCTGGAGTGCATGTGTTTGGGGGCCCTCCAATATACGAGAT 
ACATATATCTGGTATCTGGTATCTGCTATCTGCATCTGCATCTGCATTTGTATCTGTATCTGTAGCTGGGCTGTCAGCGCGGCCACGGCATCCGA 
GCTGTGATCAAAGCCACCCAACAGCAGGGGCATCTTTCCTCCGCCGTCGCTATTTATACACCAGCCTCATCTCAAGTGTTTTTAGCTAATTAATT 
AACTAATTAATTTGTAGTTTCGGGAGTTGATTTTAATTGCCACAGCTCCCTTCCCCCCCCAAAAAAA21AAACAGCTGCTGCTGAAAAGCGTGGAG 
AGTGTATATATATACGTATTTGTATCTTGGAGGGGATGCAGGGAAAGTATCTTGGCGCCAGCTGAAGCTAAGTGGGTTATGTGATTACCACCTAC 
TCCTGCCCAGCTTGCATCAATCTCATAATTCTTGAGCAGCAGACAAAAACTATTAAAATATGCATGGGGCAGTATGGATGGAGATGGTGTCGTTG 
CTCCTCAGTTTTAGGATTCGGTAAATATTTTATTCAGGACTCGTCGCCATTGTGTCCCCTGGCTGCATTGTCTTAGCCCGGATTATCCTTCCCCG 
TCTGCACTTTTCTCGCAAATAAGTGTTCAATTAAGCGCGAGAAAAGTGTCATTTTCCTATGCGACATTTAACAAATTTGCAATGGAGACTGGAAA 
ACAAGAAAATTGTGTGGATGGGAAAGGCAATCTGGTAAAAGCTCATTAGATAGTGGAATATATACTTCTGCATCAATATAAAATAGCAGATAGAT 
AGAGCATTAACATTTTAAATGGTAAATAGGTACTAAAAGTTCGGAAATAAACATATTTTTAAAGAAATATTATGAGTTTGTAAATATATTTTAGT 
CTTACAAAAGATACACACACATAGTTGAGTTTTAGTACTAAAGTTAATCTGAATTTATGAAGTTATTGTGAGTAAGCGCTTCTTTAATGAAAGGC 
TAATGGAAATCCCAAGGATAGTTCCAATCTTAGCGGGCAAATCAAGTAAACCTCTTTTTCGGTATTCATAAAAGCGGGAAGACAAATGCTGACAG 
TGCCCGGCGACTAACCGATGACTACAAGGAATTAAGCAGACAGCGAAGAAATTAAACAGACAGCAATAAAGGCAAAAGCGAGTCAGAGCCAAAAA 
CAAGGACCAAGAAGAAAGCAGCCGCGTCAGACCAAAACCAGACGACTGGCTCCTCTGTGAGGACGATGAGTTTGGGGCGAAAGGACATTCCAAAC 
ATTTGCTCAGCGAAAGGGGTTGTTGGACTCACAGGACTGGCAGCTCCGGTACTCAGCCAGGATACACAGACTAACCATTGAACAGCTGAGCGAAG 
CTGGTGGCTAAAACCGGAGGGAAATGTGTGTTGTGCGCTGAAATGTATGGGGGGGAAATGAAATGCAACATTAACACGAAACATAAACATAAAAT 
GAGAAATGGCAGCAGCAACAATGGCAACCGAACCGAGCCAGCAAAGTCATAAAAACCAAACAAAATGCAAAGACACAGCAAGGACGGGATTTGCA 
GGGGAATGTCAGGACGAAAAGCAACCCCTTAAATCGACCGCCCAGGCACCCGGGCACCCGAACACCCCCATCACATATATATATAGACTGTTGTA 
TCTGTGCTCTAGACACAAGTTCAAGTTGACTGCAGTTGGTAAGTTGGGTTTTTATTTTACGGATACAGCGGGGGTGGAACGCGATGCCCGGGGAG 
TCGCGCGGTCGGGTGGCGGTAGCGGTGGGGGGAACCGGATAATTTTCATTTTGTGAATGCCGGAAAAACGCATAAAATTTTACGAGCCAAGCTCG 
GAGAACATTAAAGGGAGACGACGACAGACAGACCGAAAAACAGAAAAACAGACAAGAGCTTGCAAACGGAAAATGTAGCCAACTTTTTGGCGCGC 
TAACCTGCGTCCTCTCTTCATTTCTAGTACTCTTCACACAGCAAGGATAATACCAGTTTTCATTTGAAGCTATGATTAAGAAAAACAGCAACTAG 
ATTGACTTTGATAAGCAACGAAATTAGACATTGTGAAACTACACATTTTCCTTTAATTAAAGGTTCTTAAAAATTCATATTGGATATACTATACA 
GTATATATGTGCATTGAAAATAAAAATATTGTAATAAAACATACTTGGCTAAAGATTTGAAATCAGTAGATATTTAATTTAAATAAAAACATAAA 
AATGCCCTAGCACAGACCAATTGTTTTGCTCTGTGCATCTCTTTCAACCACTCTTTTCCTAACTCTAGGATTTTCTTTATATTCCTTTTTGGCAT 
TTATTTTACTTTTAGCTTTTCTGTGCTGCTACACGAGTTGATCATTAGCAACTTTTGGGGAAAGCATTACGGGGGGGGGGCAACGGCAGCAGGGG 
CGTAGTTCAGGGAATCCCCCCTTTAAAACGGGGGCTAACTGCCCTCGCCAGAGGGCAAAGGTCACTGCAACGGAAGTCCCCTTTTTTCGCAAGCC 
ACTGGAAAACCAGAGGGCAGACAATGTGCGATAATCGAATGTTTAACGTTTTAATTTGAAAAACGTTTGGGGTGCTGAGACAGCGACTGGCCAAC 
TTTAATTAAAGATCCAACGATAACGCTATGGAGCAGGACACATCGAATGTTCTTCGGTCAGTTGTTGCAATAAATTTTCCATAAATATAAATGGC 
TGGCCACTGAAAACGGAGAGCTGGGGAAAATGGATGGTAAAATTTGGAAATTCATTGATATCGAAATCCGCTGATGAAAGCCACTCATCGATTTG 
TATCTGTGGCAAACGATCTGCGAAATAGTTAATTCGTTGAGTAAGATATCTGCAAACAATTTTGATGCATTTCCCTCTTGCATCTGCGTTAATTT 
GCGAAATGGGTCAATAACTTGTAAGCCGCTTTGTGCACCGCACACACACAAGAATGCCCAGCCAATTGAAGTGAAATGGAGTGGAGTGGCCACTA 
GCCGCTAGCCACTGGCCAGTGTCCAGTTTTCCAGTTGGACAACTGGATAGATGGAAGCGCAGAAGGGCTGCATAAGTCCATTT^ACGTTCACAATT 
ACCGCCGGCCACTTCAGAAACACAGCCACTTCTATGGGCTCCCATTTCCATTCCTCCATCTTTCGGTGGACACGGGCCACTTGGGTTTATGACAG 
CGCTGAAAACCCTTCCACTCGAGTTCGATTCCCAACTCTGCCCGTCCGAAAATAATCTATTTTTGATTATCAATTAATTATATTTTCATAAAAGC 
GGGCGATAACAAGAAGTGCAGCACTGCAGCTCAAGGAAAAAAAAAAAAAAACAAAAACTAGTTATCTTATCGGGAGGAGCGGACTCATAGTTGAC 
GAAATAAGACTCAGTCGGCACACTTATTCTGGGGTCTTTGGAGCTCCAGAGGTGGTGATGGGGAGTGAGTTCCCAAGACCCAACCGATAAGAACG 
ATCCGCTCGGGGGAACTGTCAATTTCGATATCTATAATTCTGCTTGCCTTTCGACGTTTACTTCCACTTTGTTTACCCAAGGCGTCTCTATTTTT 
TTATGGGTCCTACCCACACCCACACTCCACGGCATTCCGGCAATTGTTTACAGCGATTGTGGTTTATGCTCCATCCTCGGCCAGCGAAAAGTTTA 
ACAAATTTATTTTGTTTTGGAGTCGTGTTGGTTTACATCGTCTTATTTCCCTCGTTCCCAGTGTTTAGTGTTTACCTCGACTTGCACGTTCAACG 
TGTGAATGACGACCGCCTTCCGACCGACCCCTACTACAACTTAGAGTTACCCCTCCTCCCCCCACTCCCCGCACGCCATCCCTCGCCCACAGGTA 
CATGTAGATACAACCCATGGGTGTCAAAAGATCTGCTGTCTCTGGCGATGCATGCATCAGCCTAATGAGAAGCTCGTTAATTAGCGACTCGCGAC 
TTGCTAGTAATTAAAGTTACAGTTAGTCATACCTATTAGTCAGATTCATATAATACTTTCTAGGATCGCAGAGGTATCTCAACATGAAGCCAAAA 
AGAGGTAGCACCAAACAAGCTATATATTATAAAGATACTTACAAGGGATGCAAGTTGAGGATGTTAATCTCTGGTTCTTGTACTAGCACTTATAA 
TGTTAGATACTAATCGTACCTTACAATACTTTAGAATCTTTTACTTACGAACCATAGTAAATGCCGAACCTCTTTAACGAAAATACGAATATCAT 
TAAGCAACATTTCCCCACTTCTATGTTAAGATGCTCAACCATCAGATACTTGTGTGCTTACCTGTCCAGTGGCGAGTTGAATCCGTTCCCGTTCC 
CGTTCCACACTTTACCCGCTGCTCACTTCCCCCTAGATGTTTGGCCATCCATTAGATACTTTTGCTCCTCGGCTGTCTAAGTGCCAATTAGTTCA 
CTCTACTGTCATTTTAGTCGTAACATATACCAAAAACTTTGTGCCACAAGCCCACAGAGTATAGCCATTGGTACGTACTACGTACCACCATACAA 
AGTCGTCGTCGAGCTGACGGACGTGCCCCAAGAAGAGCAAATATACAAACACATTTCAAACTTTTGACACTTACCCAACGGAGACGAAAGACAGG 
GCACTGTTTATTGAGTTGTTGCTGTCGGTTCAGTTTGGTTCGGTTCGGTTTGGTTCTGTTTTTCCACTGCCTGCCACCAAATAAGTGTCTACAAA 
TATTTAGAGGCATTTTTTGTTCACTGACTGAGTCTGCGAGAGGCAAACAAAGTCGGTGCCCAAACAAGATTTATGTGCATATGTAAATTTTCTCT 
AACGCGGCCAGCAAACCGCAGCCATCCAGACCTGGCAAACATTCAGATAGGCAAAGAGGAGAAATATCCATAAAACATACATCATCCATTTTGCC 
TAGATCTCGGCTCGCTCCTTTGGGCCGCAGATAAAAAATGTTTTATGAAAATTATGGCACCATCTCCGAAGGCTGTAGCAATCACACATAACATA 
CATACATATATATGGAATACCAAACGCGGGCAAATAGAGAATGTAGGAAAACAATTTGAATAAATTATTTGGGGGGATACCAAATTAGAACTTTA 
AGGAGCAGCATCGCTCGGCATCCGACAATTATCGCTCGGAATACAAAAGTATCTCATTCACATCGCCACAATGGGCTGCAGATACACGATTGCAG 
CTACAAGATATTATGGAAGTCCGGTCATACAAAAGCCACACACAGAGAAATTTTTATAATTACTACAAAATCGGAACTTGAAAAACTTCAACTGT 
CGTTGATAACAATAAAACATTAAATGAACACTAAATAAGCTGATGAAATGTATTTCAAATTTAAGGAATATGCAGTGGATATTCTTATTTTGACT 
AGTATTTCAATCTTAATCATATTAATATGCATTTGCCCAACTTAGCATACCAGAGAACACTTTTAAGTTTTCTCTCTGTGCAGCGATTGTGTATG 
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CTAGCGTGTCTGCAACATGACTGGCCACGCCCCTTGCGCCTCCGGCTGCCACTAAAACATAAATTAAACCCTATGCCACGGCAGAGAGTTTAGAG 
AGCAGCGCATTCTTCTTTTGGACTGCAGTTTTTTTCCCCCCCATTTTTTATGCGCTTTTCTTTTGCGCCATAGAAACTCTTTTCTCATGCGGCAA 
TTGTTGGCAAGTACCGTTCCATACCGTTCCGTCTGGCCAAGAAGTGGAGAAAGAAGCCGGCGGGCAGGAAAATGGAGAGGGCCATGTGAGCATTT 
ACATATGGGCAGCAAGGTGGGATCGAGGTCGGGAGCGAGAGGTGGAGCCTTCTCCCGACTGACTGACAGTTCAATTTATGCTTCCTCGCCACCTT 
AAATGGGCTGCCAGGCTGTGCCGATAAAGACAAAGATTGCAGCGTCGTGTTCGGCGCTGAACTGACGATAAAATGCACTTAATGCCGCGCGAAAT 
ACCCCTGCTTTCCAGGGCGAAATATGTATTGTATGTAGTTATAACCAAAGGGCCACGGAAAGCGGAAAGCAGAAAATCTGAAATCAGAAGAGGGA 
GAGCAAGCGCAGCGAATTCGCATAGCCAAGAAAACGCATTAAGGCTAAATGTATCTTCCGGATATATTATGCATATTTATGGAATGCCAGCCTAT 
TCGCATTTTACAGGGGCTGATCAATAACTTTTGCTGGCAACGAGCCTTCCGTTGGCCGAACTGTTGTGGCAACCCTCGGCCTGCGTCTTGCATTA 
ATTCAATTAGTTTATAGCCCACACTCATTCCGCAGCCTCAATCTCCGCATCTGCTACTTCTTTTACACGGTCAGAAAATATATGAATTTACTTAG 
TTTTTTATAAATAATGTAAAATTTAAAATGTTATTCCCTGTCATCTCTTCTTTAATATATTGTGGTTCTTAGCATCACTTACTTTTATATATGTA 
TCTTTTATGAGGCATTAGATGATAAAAAATATACATACTTGGCTCGAACGTATGTTTATATAAAAACTATAATCGACTTATAATAACACCATTCT 
ACCCATTCATTCCTACCTTCAGATCTTGAATCGGCCAGTGTTCAGCTGCTGGGCAGCAATCGCAACGAGCTGCTGCTCAAGTGCCACGTGGAGGG 
CGCCTCAGGAGATTTGGAACCGCTAGAGATCGAATGGTATCGCAACAGCGAGAAACTTAGCACCTGGAAGAACGTCCAGCTGGATCAGCATCGCC 
TGATAATCCGACAGCCGGGAAGCGAGGACGATGGCCTGTACCGCTGCACCGCCTCCAATGCGGCGGGTCGAGTGATGAGCAAGCAGGGCTATGTC 
TACCAGTCCAGTGTGAAGTGCCTGCCCCGACTTCCTAGGCGCAAGAACGAAAAGATGATGGAGTCCTGGGACAAACAGACCTTCCTGTGCCGCGG 
TAAGCGCGGTGGAGCTGCTGGACTAGAGGCACTACCCGCCGCTCCCGAGGATCTGCGCATTGTGCAAGGACCCATTGGTCAGTCGATTATCAAGG 
AGGGTGAGCCCACGGCTTTGACCTGTCTGTATGAACTGCCCGACGAGCTGAAGAACCAGCGCATTCAACTCCGTTGGCGCAAAGATGGCAAACTT 
TTGCGTCAAGTGGAACTGGGTGGCTCGGCGCCCATTCCAGGTCACTCCTTCGATTCCGGCAAGGATGCTCTGTTGCGTGAGGACGCCCGTCTAGT 
GCTGCACAAGCAGAATGGCACCCTAAGCTTTGCCAGCATTATTGCCAGTGATGCGGGTCAGTACCAGTGTCAGTTGCAGCTAGAAGCTCATGCTC 
CGATTAACTCATCTCCTGGCATCTTGGAGGTCATCGAGCAGCTAAAGTTTGTTCCTCAGCCCACGTCCAAGAATTTGGAGCTGGATGCAGTGGTG 
GCCAAGGTTCACTGCAAGGCGCAGGGCACTCCAACGCCGCAGGTGCAATGGGTTAGGGTAAGTAATTCAATCCAATTAGATTCAAATAAATACTT 
ATTTTAAGGTACGAACTAAAATGGCATAGCATTCCCTTCGATATATTTACACATGGTTTCAAACAAGTACTAGCTTTCATTTATATATTTGTTAC 
ACTTACTTTTGATAAGTAATAGATTGTTCTTGCAAAATTTTAGTACACCCTATCCGGTTGACATCTGCCTCCTATTCAGTAATTGCCCAGGCACT 
TCGGTCTTCATTTCGACCGCTTCCAAATGGGAATGTTTCTGGACATTTCTGGCTCACTTTTTCGCCCACTCCACCACGGCAATGATGGCCACATG 
TCACTGACAAATTGAAAAAACGCGCATTGCGCCGCTGGGCAAACTTTTGCTCCAAACAAACCGTTTCATATGCAAAACTCTATCGTGTCTATGTG 
GCCTAAGCACCCTTCTTCTATCGCCCGAACATCCGAAAAACGAACCCACTCCATTTGGTGGCCATTGAGCGTTTTGTGTTGCCAGCTGTTTGCCA 
TGGCCAACCAATTTCGACAACTGTTTGTCTATTTGGGAACGGAGCGAAAAAAGAGGATCGAACCCTTTCAAAGAAAGTTTTTCATTTGGCCGAAA 
AACAAACCTGACTCAAACCTGTTCAAAGTGTATTGCGTCCTTTTTGTCrGCTTTTCTGCATTTTTGATGAAAACGCATGGCACACGTTGGCAAAT 
TGGTTGGTAATGGTCAGCTGTCTGTTTCTATTTCTCACTGATGTTTCACATTCAACTTTGATATTAACAGATTGGAAGTGCATAATAAATGTGCA 
TGTGTCTTTGATATTCAAATTAAGAAATGTGGGGGTGCATAAACCAGTGCACCACTGCAGTGGTrTGATATCAAAAGTAATAACCAGAAAGAAAA 
TCTATCCTTAGATTGCTATTGATATATGATAGATCCTCATAAAATCTACTAACACAACAAAAAATTATTTCTCTACAGGATGGGGAAAATACCAC 
TTTACCCGACCATGTAGAAGTGGATGCCAATGGAACACTGATTTTCAGAAACGTTAACTCGGAACATCGCGGCAACTACACTTGCTTGGCCACCA 
ACAGCCAAGGACAGATAAATGCCACTGTGGCTATAAATGTGGTGGTCACTCCCAAATTTAGTGTGCCCCCAGTGGGACCCATCGAGACTTCGGAG 
CAGGGCACCGTGGTGATGCATTGCCAGGCGATTGGGGACCCCAAGCCGACAATTCAGTGGGACAAGGATCTAAAGTATCTGAGTGAGAACAATAC 
GGACCGCGAAAGATTTAGGTTCCTGGAGAACGGCACTCTGGAGATACGGAATGTCCAGGTGGAGGACGAGGGTAGCTACGGCTGCACAATTGGAA 
ATAGTGCAGGACTCAAGCGAGAGGATGTCCAGTTGGTAGTGAAGACAACCGGTGACGGTTTCGCTCCAGAGGAATCCGGAGGTGATGGCTTCCTG 
GTCACCCGAGCTGTACTGATCACCATGACGGTGGCCCTGGCCTACATAGTGCTCGTGGTGGGCCTGATGCTGTGGTGTCGCTATCGCCGTCAAGC 
CAGAAAAGCCCGTTTAAACGATCTGAGCACAAAGGAAGCTGGCGGGGATCAGCCGGATGTTGCTGGCAATGGCAAAGGCTCCGAGCAGGAGCCCT 
GCCTGTCTAAGCAGCACAATGGCCACAGCAAATCCAGATCCAAGTCCAGTGGAGATGCCCAGAAATCCGATGACACAGCCTGCAGCCAGCAGTCG 
AGGGCCTCAAAGAAATCGGCACATATTTACGAGCAACTGGCTCTGCCAAGATCCGGGCTCAGTGAACTCATACAAATTGGACGCGGCGAGTTTGG 
GGATGTGTTTGTGGGCAAACTTAAGGCTACCTTAGTCACCTCGCCCTCGGACAAGGATGCGGATACGGAAAAGCAGCACAGCAACAGTGAAAATG 
GCAGTGGAGGCAGCGGAAGTGGAAGCACCACGCTAAGTACTCTAAATGAGAAGCGTCGCTCCAAGACCTCCATGGACGACATTGAGGAGATCAAG 
GAGGAGGAGCAGGATCAGCACAATCAATCGGGTCTCGAGCAGCTCGTTCTGGTCAAAGCCCTGAATAAAGTGAAGGATGAACAGGCCTGCCAGGA 
GTTTCGACGACAGTTAGATCTGCTGCGCGCCATCTCGCACAAGGGAGTAGTACGTCTGTTTGGCCTGTGTCGCGAAAAGGATCCGCACTACATGG 
TGCTGGAGTACACGGATTGGGGCGATCTCAAGCAGTTCCTGCTGGCCACCGCCGGAAAAGTGAACACTGCCACCGCGGGCAGCTCCTCACCGCCG 
CCACTCACCACCAGTCAGGTTTTGGCCGTCGCCTATCAAATTGCCCGAGGAATGGACGCCATCTACAGAGCTCGCTTCACCCATAGGTAAGTGAT 
TTGAGAAGGCCAGCATTTTCATAGATCCAATTGGCAATACTTTACCTTACAGGGATCTGGCCACTCGCAACTGCGTAATTTCCAGTGAGTTTATA 
GTGAAGGTATCCTATCCGGCTCTCTGCAAGGACAAGTATAGCCGCGAGTACCACAAACACCGCAACACGCTGCTCCCGATCCGCTGGCTGGCGCC 
CGAGTGCATCCAGGAAGACGAGTACACCACCAAGAGCGACATCTTTGCCTACGGTGTGGTGGTGTGGGAGCTCTTCAACCAGGCCACCAAGCTGC 
CCCACGAGGAGCTGACCAACGAGCAGGTGGTGCAGCGTTCCCAGGCGGGCTCCTTGGAGTGGTCAGTGGCCGAGGCGACGCCCGATAGCCTGCGA 
GAGATATTGGTAAGCATCCTTATAACTTAACAATTATATTCGTACCTATTAACAAATTTCAATTAAATTAATTATATTAGTCAATGATTCGATGG 
ATCCTCGCTTGGCTTAGCTATGAAATGAATGCTTAAGTCGCTGACAAACTACTGATATAATCCTTTTCTGCGTCCACTTTCAGCTGTCCTGTTGG 
GTGTCCAATCCGAAGGAGCGACCTTCGTTCAGCCAACTGGGAGCTGCTCTCAGCAAGGCGATGCAGAGTGCCGAGAAGTGAGGCCAGACTGATGG 
CTCGATCATGCCAAAAAGTAATTCGTAGTATTGTCTTAGTGTCTGCACGCCCGAGTTTTGCTATGCGCTCAGAAACCTAGTCTTAAGGATCGAAG 
GATAACTGCGCCTGTTTAACATGCGCATTGCGCACTTGGTGGCACACAGAATGAAGTTAAGCAGCTACTGTACAAACACAATGTATGGAAATGGG 
GATATCTTCGTGAATAGTATTGCGTGGTGCTGCATATTTTTAGGTATGCAGCTAAAATGTACATAATAACTGCCACAAATATGCATTTAAACCAG 
TTGAAGCATTTCTAGGATTAGATCATTTCAACCTTCGTCAGCATTTAATAATCGACTTCTAAGCTAAATTAGTAAATAAAATAAAATATAAATAC 
ACAAGCATATGCATCACACAAAACAACGAGCGATTTCATTGTTTTCAAACATTGAGAAACTAAGCCAAATTTTAAGTTTAGCATTAAATATAAAT 
AACACACACCACCCACATATTTTATCACAAGCCTAGTTATACACTTTTTAATGCGTAAGAAACAAACAGCCGAAACAATTTTATGACTAGCCATT 
TAAGCTTTGATGCATGTTTACAAAACATTTAAATAAACTCGAATAAAATAAATTTAATCAGAAGGCACAAATTTAAAGAATCAGCTTTATTTCGA 
AAGTAGAAGACCAAGGGGAGAGACGAAACCTTTCGTGGGCTACTACTTTTTATGGGCCTTTCATTTGGCATGGCAAATTTGTTGAGGGTCCTTTG 
GGCCTAAGCGGACTTGGCTCATTCACACTTCACTTGATTTGAAAAATGAGCGCCTATTGAATGTCGGGGTTAGGCCGTCTTGCCTCCACGTCTTT 
TCCCAAAACGATTGACAAACAAACATGGCAATAAACCGTAAAGTTGCCTCGGCTTATTGGGGCTTAAAGTCGCCGCACTCTCCGATTTTTTTTTC 
CCGGAATTTGCTGCTCCGGAAGTTCCAGCGCCTATGTAGCCAACAAAGTTTCGTCGTGCGAAAACTTTGGCCAGGACCAAATCCAAGCAATATTT 
GCATTTGAATCAGCAGCAAAAACAACACCGTTACACAATGAATTGCCATTTTCCTAGCCTAGTGGATAGGCTGCTTAGCTTTTCCTGGCACGTAC 
AAATATCTGCCTGTCATGAATATCAAATGCCTTCGGAAAAAGGTTGATTGAACCAGTTGGTAGCAGGAAGTGTGGCAGAAAAAAAGAAAGAGCAC 
AAATGATATGTCGAAACGGGATGGCAGAGAATTTGATACTCTATTTCACATGTAAAGGAAACCTAAAAGGATTCAAAACGTATTTACTTCCACAT 
AGGCCACTTTTTAAAATTAAAACTCCTTGATAACTTTTAGAAGCATTAGCGAACGAAATAAATGGGATACCCCTAACTCAAAGGCTCGAAAGGCA 
AACTGCATTCGCAATTGGTAAAGCTGCCTAAAATAAAATCCATGCCCTGCCGCACATGGCGTATGCGCAATATTTGCAATGTTTGTAATGGAACG 
ACAGATGTGCAACAACAAAAGCCTAAGCTCGACACAGAACGAAAACATAAAAATGCCATTTCGCATGGCTTTCAATTTATTAATATTTTATACGG 
GCCACTGGCAGAAAGGGA 
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(SEQ ID NO: 979) 



Exon: 1001.. 1948 

Exon: 6702.. 6836 

Exon: 6906.. 7067 

Exon: 14748.. 15637 

Exon: 16609. .18041 

Exon: 18103.. 18439 

Exon: 18609.. 19348 

Start ATG: 1897 

Transcript No. : CT25769 

CGCGTGCAGTTCAGATTGAGTTTGAGGTTCGATTGGATTCGAGATATCCAAGTTCCTTTTGCGCTACGTTTTGTTTAGCCGAGTTTGCCCCGAAC 
TAAAATCACCCACCAAAGCCGGTTCGGGATGCGATGATGCCAAGGATGCAGCTGATGCAGCCGGGCTCTTGTATCTTTAAGATACAAATTAAGTT 
GATTTGTTGCGCTCGCGTTCTCTTGGGTAATTGAACCCCACAGTTGGGCTCCTTGCGCCCTCTGCCGCGCGGTCTTCGGTCTTCGGTCTTCGAGC 
ATTTCGTTGCATTTTGTGTATCTAGATACCTTTGTCGGTATCCGTATCCGTCGCTTGTCAATCTGGTCGCTGATAAAAAAGCAAACGCGAACGCA 
GCCACGAAACATAAACCAGCCGTCAACGGAAAATAAAAGTAAAAAGCAACAAACAGCAAGTGGCGTATCAGTTCGTGTGTTCTCGGTGATAAATG 
CATTTCATGCGTTCCAACTATGTAAATAGCAATTTAATTTGAGTGTCCACAGTAAACATCAAAAAACCGCCCTGGACTCCAGTCACCACCACCAA 
CAACACCATCCCCGGTGGAGCCCAGTGATCCCATTGAAACCCCGGTAAACCCCTGAAAAGGGCAGTCGTTGTCATCGTCGTCATCGTCGTCGTCG 
TCGTACGTCTGTGCCATCCAAATATTGAATATTTACAAATGCGTTGAGTGCATTTTCGAGTGGCCGGCACTTACAGTGACTATCTGGTGGATGTG 
CGAGTGTATTCATCGGATTGCCAGCAGCATAAAACTTCGTTTTCGATACTGATCCAATACGCGAGTGCTTATATATTTTTTGTGTCGCTAATTGA 
GACCAAAGGTGTCGAAGTGCGGCGGTTATACATATACGCAGATGACTGCTAGGATGATCTCAATATGCGGCCTGGTCATGGCTTTGATGATGGCT 
TCCGTTTTGGCCAGCTCATCGCGTTTCCAGCGAGTGCCCCAAAGTCAGTCCGTGGTGGAAAATGAATCCGTGAAATTCGAGTGCGAATCGACCGA 
CTCCTACAGTGAACTGCACTACGACTGGTTGCATAATGGTCATCGGATTGCGTATGACAAGCGTGTCCATCAAATCGGTTCCAATTTGCATATCG 
AGGCGGTGCGGCGCACGGAGGATGTTGGCAACTATGTTTGCATCGCGACAAATTTGGCCAGCGGCGCCAGGGAGGCGTCTCCGCCCGCCAAACTG 
AGCGTGATATATCTTGAATCGGCCAGTGTTCAGCTGCTGGGCAGCAATCGCAACGAGCTGCTGCTCAAGTGCCACGTGGAGGGCGCCTCAGGAGA 
TTTGGAACCGCTAGAGATCGAATGGTATCGCAACAGCGAGAAACTTAGCACCTGGAAGAACGTCCAGCTGGATCAGCATCGCCTGATAATCCGAC 
AGCCGGGAAGCGAGGACGATGGCCTGTACCGCTGCACCGCCTCCAATGCGGCGGGTCGAGTGATGAGCAAGCAGGGCTATGTCTACCAGTCCAGT 
GTGAAGTGCCTGCCCCGACTTCCTAGGCGCAAGAACGAAAAGATGATGGAGTCCTGGGACAAACAGACCTTCCTGTGCCGCGGTAAGCGCGGTGG 
AGCTGCTGGACTAGAGGCACTACCCGCCGCTCCCGAGGATCTGCGCATTGTGCAAGGACCCATTGGTCAGTCGATTATCAAGGAGGGTGAGCCCA 
CGGCTTTGACCTGTCTGTATGAACTGCCCGACGAGCTGAAGAACCAGCGCATTCAACTCCGTTGGCGCAAAGATGGCAAACTTTTGCGTCAAGTG 
GAACTGGGTGGCTCGGCGCCCATTCCAGGTCACTCCTTCGATTCCGGCAAGGATGCTCTGTTGCGTGAGGACGCCCGTCTAGTGCTGCACAAGCA 
GAATGGCACCCTAAGCTTTGCCAGCATTATTGCCAGTGATGCGGGTCAGTACCAGTGTCAGTTGCAGCTAGAAGCTCATGCTCCGATTAACTCAT 
CTCCTGGCATCTTGGAGGTCATCGAGCAGCTAAAGTTTGTTCCTCAGCCCACGTCCAAGAATTTGGAGCTGGATGCAGTGGTGGCCAAGGTTCAC 
TGCAAGGCGCAGGGCACTCCAACGCCGCAGGTGCAATGGGTTAGGGATGGGGAAAATACCACTTTACCCGACCATGTAGAAGTGGATGCCAATGG 
AACACTGATTTTCAGAAACGTTAACTCGGAACATCGCGGCAACTACACTTGCTTGGCCACCAACAGCCAAGGACAGATAAATGCCACTGTGGCTA 
TAAATGTGGTGGTCACTCCCAAATTTAGTGTGCCCCCAGTGGGACCCATCGAGACTTCGGAGCAGGGCACCGTGGTGATGCATTGCCAGGCGATT 
GGGGACCCCAAGCCGACAATTCAGTGGGACAAGGATCTAAAGTATCTGAGTGAGAACAATACGGACCGCGAAAGATTTAGGTTCCTGGAGAACGG 
CACTCTGGAGATACGGAATGTCCAGGTGGAGGACGAGGGTAGCTACGGCTGCACAATTGGAAATAGTGCAGGACTCAAGCGAGAGGATGTCCAGT 
TGGTAGTGAAGACAACCGGTGACGGTTTCGCTCCAGAGGAATCCGGAGGTGATGGCTTCCTGGTCACCCGAGCTGTACTGATCACCATGACGGTG 
GCCCTGGCCTACATAGTGCTCGTGGTGGGCCTGATGCTGTGGTGTCGCTATCGCCGTCAAGCCAGAAAAGCCCGTTTAAACGATCTGAGCACAAA 
GGAAGCTGGCGGGGATCAGCCGGATGTTGCTGGCAATGGCAAAGGCTCCGAGCAGGAGCCCTGCCTGTCTAAGCAGCACAATGGCCACAGCAAAT 
CCAGATCCAAGTCCAGTGGAGATGCCCAGAAATCCGATGACACAGCCTGCAGCCAGCAGTCGAGGGCCTCAAAGAAATCGGCACATATTTACGAG 
CAACTGGCTCTGCCAAGATCCGGGCTCAGTGAACTCATACAAATTGGACGCGGCGAGTTTGGGGATGTGTTTGTGGGCAAACTTAAGGCTACCTT 
AGTCACCTCGCCCTCGGACAAGGATGCGGATACGGAAAAGCAGCACAGCAACAGTGAAAATGGCAGTGGAGGCAGCGGAAGTGGAAGCACCACGC 
TAAGTACTCTAAATGAGAAGCGTCGCTCCAAGACCTCCATGGACGACATTGAGGAGATCAAGGAGGAGGAGCAGGATCAGCACAATCAATCGGGT 
CTCGAGCAGCTCGTTCTGGTCAAAGCCCTGAATAAAGTGAAGGATGAACAGGCCTGCCAGGAGTTTCGACGACAGTTAGATCTGCTGCGCGCCAT 
CTCGCACAAGGGAGTAGTACGTCTGTTTGGCCTGTGTCGCGAAAAGGATCCGCACTACATGGTGCTGGAGTACACGGATTGGGGCGATCTCAAGC 
AGTTCCTGCTGGCCACCGCCGGAAAAGTGAACACTGCCACCGCGGGCAGCTCCTCACCGCCGCCACTCACCACCAGTCAGGTTTTGGCCGTCGCC 
TATCAAATTGCCCGAGGAATGGACGCCATCTACAGAGCTCGCTTCACCCATAGGGATCTGGCCACTCGCAACTGCGTAATTTCCAGTGAGTTTAT 
AGTGAAGGTATCCTATCCGGCTCTCTGCAAGGACAAGTATAGCCGCGAGTACCACAAACACCGCAACACGCTGCTCCCGATCCGCTGGCTGGCGC 
CCGAGTGCATCCAGGAAGACGAGTACACCACCAAGAGCGACATCTTTGCCTACGGTGTGGTGGTGTGGGAGCTCTTCAACCAGGCCACCAAGCTG 
CCCCACGAGGAGCTGACCAACGAGCAGGTGGTGCAGCGTTCCCAGGCGGGCTCCTTGGAGTGGTCAGTGGCCGAGGCGACGCCCGATAGCCTGCG 
AGAGATATTGCTGTCCTGTTGGGTGTCCAATCCGAAGGAGCGACCTTCGTTCAGCCAACTGGGAGCTGCTCTCAGCAAGGCGATGCAGAGTGCCG 
AGAAGTGAGGCCAGACTGATGGCTCGATCATGCCAAAAAGTAATTCGTAGTATTGTCTTAGTGTCTGCACGCCCGAGTTTTGCTATGCGCTCAGA 
AACCTAGTCTTAAGGATCGAAGGATAACTGCGCCTGTTTAACATGCGCATTGCGCACTTGGTGGCACACAGAATGAAGTTAAGCAGCTACTGTAC 
AAACACAATGTATGGAAATGGGGATATCTTCGTGAATAGTATTGCGTGGTGCTGCATATTTTTAGGTATGCAGCTAAAATGTACATAATAACTGC 
CACAAATATGCATTTAAACCAGTTGAAGCATTTCTAGGATTAGATCATTTCAACCTTCGTCAGCATTTAATAATCGACTTCTAAGCTAAATTAGT 
AAATAAAATAAAATATAAATACACAAGCATATGCATCACACAAAACAACGAGCGATTTCATTGTTTTCAAACATTGAGAAACTAAGCCAAATTTT 
AAGTTTAGCATTAAATATAAATAACACACACCACCCACATATTTTATCACAAGCCTAGTTATACACTTTTTAATGCGTAAGAAACAAACAGCCGA 
AACAATTTTATGACTAGCCATTTAAGCTTTGATGCATGTTTACAAAACATTTAAATAAACTCGAATAAAATAAATTTAATCAGAA 
(SEQ ID NO: 980) 

Start ATG: 897 

MTARMISICGLVMALMMASVLASSSRFQRVPQSQSVVENESVKFECESTDSYSELHYDWLHNGHRIAYDKRVHQIGSNLHIEAVRRTEDVGNYVC 
IATNLASGAREASPPAKLSVI YLESASVQLLGSNRNELLLKCHVEGASGDLEPLEIEWYRNSEKLSTWKNVQLDQHRLIIRQPGSEDDGLYRCTA 
SNAAGRVMSKQGYVYQSSVKCLPRLPRRKNEKMMESWDKQTFLCRGKRGGAAGLEALPAAPEDLRIVQGPIGQSIIKEGEPTALTCLYELPDELK 
NQRIQLRWRKDGKLLRQVELGGSAPIPGHSFDSGKDALLREDARLVLHKQNGTLSFASIIASDAGQYQCQLQLEAHAPINSSPGILEVIEQLKFV 
PQPTSKNLELDAVVAKVHCKAQGTPTPQVQWVRDGENTTLPDHVEVDAMGTLIFRKVNSEHRGNYTCLATNSQGQINATVAINVVVTPKFSVPPV 
GPIETSEQGTVVMHCQAIGDPKPTIQWDKDLKYLSENNTDRERFRFLENGTLEIRNVQVEDEGSYGCTIGNSAGLKREDVQLVVKTTGDGFAPEE 
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SGGDGFLVTRAVLITMTVALAYIVLVVGLMLWCRYRRQARKARLNDLSTKEAGGDQPDVAGNGKGSEQEPCLSKQHNGHSKSRSKSSGDAQKSDD 
TACSQQSRASKKSAHIYEQLALPRSGLSELIQIGRGEFGDVFVGKLKATLVTSPSDKDADTEKQHSMSENGSGGSGSGSTTLSTLNEKRRSKTSM 
DDIEEIKEEEQDQHNQSGLEQLVLVKALNKVKDEQACQEFRRQLDLLP^ISHKGVVRLFGLCP^KDPHYMVLEYTDWGDLKQFLLATAGKVNTAT 
AGSSSPPPLTTSQVLAVAYQIARGMDAIYRARFTHRDLATRNCVISSEFIVKVSYPALCKDKYSREYHKHRNTLLPIRWLAPECIQEDEYTTKSD 
IFAYGVVVWELFNQATKLPHEELTNEQVVQRSQAGSLEWSVAEATPDSLREILLSCWVSNPKERPSFSQLGAALSKAMQSAEK* 

(SEQ ID NO: 981) 

Name: off-track 
Classification: protein_kinase 
Gene Symbol: otk 
FlyBase ID: FBgn0004839 

Celera Sequence No. : 142000013384084 

TTTGCATGTGTTTCAAACTACATATAAATATTGGGAAATATTCAATGCTCTGCTCGGCGCTTTTGCTTTTATTTTGTATTGTTATGTGATTTCTG 
TCTTGGTTATTTTTATTCGATGCTCTTCCAAATCTATAGATGTGTAAGTTAATCTGATTGCTGAATCATCTTTATATCTTTTTTTCATCTCCCGA 
AGAAGAATCATCTTCTATCGTCCGCCTAAACGTATAAAATGCATGTAAATCTAAAATCTTATTTAATCAACTAACTCTGTCACGCCCATCCCTTC 
GTTATGTGTATATATATATATATAACTTCTATATATATATATATATAATGCTCTGATTATGATTTGCAGTAACGAAAAGTGACCAGAAGAAAGAG 
TATTAATATCTAGTTCTTTTTCCTTCCGCATAGGTTTATATTTATATTTATATATATATATATATTTATATATATATGTATTTATGTATATATAT 
GCATATACGTAAGTGTAAAGTTAAGTGTACTCGATTCGGTCTTCGGTGTTATATATGGATATTATCTTCATAAAAGCCGCGTGCCGTCAGTGGCA 
TCGCCAGACGGCCATGCCCTCGCCGCCCATCCAGAGATCCGGGAATCACCGCCAGGCGGTCCGCAACCGTCTGTCCGCCTTGCCACGTCCAGCCG 
ACGTTAGATTGGCCCCGATCTCCAGGCTCCGCACATCCCGGCAGCCAGATGGTGAACCTCTCCTCGATCGAAGCGATAAAGGCGAGGCAGGCGCC 

T T AAAG ATC AC GG AT T ACCT GG AAAC C G AAGC CC GCC AGCT GG AC AGT T GT T GG AGT GGT T GGT GGC G GAT GT T GC AGGC GGAC AGGC G AC AAC G 
GCCGCCCGGTGGTATTCCCTCCTTCCAAAACGAAGAGGTGGACCCGGATTTGGAGAAGAAGGAGCACGGCGCAACGTACACGGTGACACCGACGA 
CGCACAGGCGAGAAAAAAAACTTAAAAACTAATTGATAATAGTAATAATAAGAGTTATAAAATAATTTTTAAGCATTACAGTAATGTTGTGTGTT 

GT TTTGTGTGTGGT GT T T T T C TT ATCG AC AAC GT C AT GC T G AC GAT T CG AT T G AAC AAC AAAAGG AT G AT AAAT GT AT AC AT GT AAAT AT GAAT G 
ATAGATTGGATCAATGTGTCGCTGGTATAGCTCCTTGGCGTGCACCTCTCCTACAACGGTACGTGAATGGTCTACTGTTCGGGAATGGACGACAG 
T GAT CGAATCTGGCGTTCCTGCT GAT CCACCTTAAGCACGCTCGCCACGGATGCGGCGCGCCAACT GGATGTCCTT AGGC ATGATTGTCACGCGC 
TTGGCGTGGATAGCGCACAAGTTGGTGTCCTCGAAGAGACCCACCAGGTACGCCTCAGATGCTTCCTGTAAGTCAAGAGAAATTCATATGAGTTG 
GATAACCAAGTTCCTTTTAGATATTAGTAGCCCCCCCTTCATAATCCCATGAATCCTGTCATTTTTAGTTAAGTTAAGCCACTCTGATTCTTTGG 
AGCATAATTCTCACCTGCAAGGCACCGATGGCAGCCGACTGGAAACGCAGATCGGTCTTGAAATCCTGAGCGATTTCACGAACCAGACGCTGGAA 
GGGCAGCTTGCGGATGAGCAACTCGGTCGACTTCTGGTAACGACGGATCTCACGAAGAGCCACAGTTCCGGGACGATAACGATGGGGCTTCTTCA 
CTCCGCCGGTGGATGGCGCCGATTTACGGGCAGCCTTGGTGGCTAGCTGCTTACGAGGAGCCTTTCCTCCGGTCGACTTACGGGCAGTCTGCTTA 

G T AC GAGC C AT GT TTTCTTAGCTTTTTTTTT T AAAC T GCAATTAAG AG AAT GGG AAAT GC AAAT GT TAG AT T T T GC G C GT T C C AG AT T GAC AC AG 
ATTAAATAAACACTTCTACAATCATTGTTTGGTTGGTACCCAAAATAAACACGCTGATCCTCGCCAGATGAGAAAGACAACTCGATGAACTGGGC 
ACAAACTCTAAACTATTCGTCCAGCACAGGCGCAGTGCACGATGCAATTGGCCAGGATACGATTGTGCTGTGTCGTGGACTCTTGACGGATATAC 
ACGGATGGATATCGCTGGGAGCCAAAGTTTGGACCGATGCACAGCGCCGAAGCGCTGGCGGCCAACTGACGAGGAACTTTGGTTCGTGGTCATAA 
CGATAACGAGGGTTGAAGGTGAAATTTTTTTGAGGCAGCTGCCTGCTTTTCCTTCTTGCCCCCCATTTCAAATATTTTCCCATCTTCTGGGAACG 
AGGCACTCGAGAAACTTCCACTTCAGTGGCAAAGGAATAAATCGTGCCTCGGCAGCAAAACACTCAGAAAAATAATAGTGTGATCCATATTTATA 
TTGGTGTGCAAATGAAGCTTAGCTAAGATGATTCTAAAGTTTTCACATTACTTATAATTAAAAAAACTTAAGAAACATTTCTGATAATTCATAAA 
GGTGGTCCATAAAAAAACCCTAATTTATCCGTTCTAAATAGCAATAGGAATACCATCTAATTCTCAATTGGTGGTATAGATATATTACATCTGCT 
TTTCTCGCTGTGTAGTTGTTCCACGATGGAGGACCACTTGAAAATCTCATTTCTTTTCTTCGCGGGCAAACATCGTGAGTCGCCGTTTTTTGTGT 
TCCCCTTCTTTTTATGTACAGCCTCTTGAGTTTTTTTCGCGCTTTATGCGGAATCAATCACGCTGAAGTAATGAGTTGTAGTTTTCAATTTATAA 
TTACTTGTGCAGGCAAGAGGAAAATATCAAATAAAAGGAAGCCAAAGAAAACGAAAAAAATTCTTCGACTGGTAGCGCCAAAAGAGTAAACATAA 
ACAAAAGGAAGGAAAATCAAGCGGTAAAAACTTGGCGCGCATTGAAAGGCGGCAAGTCGAAGGGAGATGCCTATAGGGGCACGTTCGCGTTAGAA 

AG AG AAG AAAG GT GT GC C AG C AGT T AAT TGCGTCGTTTCTCCG C T AAAC AG CAT GT GT GT GCGT GT AGAT GT G AC AGC T GC AC AT C CT C AC AAAA 
GAGAATGCCAAAGGAAAAAGAAAATAAAAAGCAATTATGCAATAACATACGGGAAAAGCTGGCGGTGGGAAAGAGAGATGGCACACACTGCGCAT 
GACAAGCGCAGCCATGCACATTTGCATTTGTTAACCAATGTGCGTGTTTAGTATACACACACATCAACGCGCGTGCCACTTAAGAAATAAATAAA 
TAACTTTTATATGTTTATGCCAACCAGCATAAATACGATTACTCATACTTTTTGGCGCCAGTTTTCGCACATTGGCAAATCGTTTTTATTTATTT 
TTTTAATTTTCTTTTTGCCAATTTCATCAACGACTGCAACTTTTTTATTTCTTGCTATTCTCCTGCTTGGCGGCCATTTTGGAAAAAAAAAAAAA 
GCAGCGCACCTCACCATTACAACTAAAACAACAATGCAACCAACAGTCGTGCGAATACGAGACCTCTCACATCTTCGTGTATTTGTGTATGTATC 
CACCAGCGAGAATGCGCCTGTGTGTGTGCGGTGAAAAATAGGGCTCCGCTGGACGGCCGGCGGAGCCCCAAAAACGGCAAAACATAAACTCAGGC 
GATTCACAACGAAAAAAAAATTTGGAAAACAGTGCACACACGCACATATTCAGCGGCCACAAGCACGTGCGTTTTAAGAACCACACTTAGTATTA 
TAAGTATTTAGCAATAAAAAAAAACTCACCGTTAAACACGGAATAAAGTTATTCTTCACAATTTGTCACTTTCAACACGAGACGAACAAATTCGG 
ACGAGAGAAATTTTCACAATGATTTCGTTTCGTTGAGATCTAGAGATGGTAGCCTAGTAGGCGAAGGCCTACTTCGATTGTATTCGATTACACTC 
GATTGTCTGACAAGACCATGCCCGCCAAATAGTTTGGTTCTGTTCTTTGCTAATTACATTTTTATAACTTTTTAAAATTTTTTTAACTCCAACCT 
AAGATTAAAAATGAATATGTTTTAATGGGTTGGACTCTGCTAATTTTAATGAAATATAAGCTTCATTAAGCAAAAAAAAAGCCAAGTTCTTTTAT 
GGATTGTGGGTTTCTCTCTGCAGAGAACTTTATTTTAGTGCATTGTGTGTTACATTTGAGTTGAACGAACGTGCATACTACTGCGACTTCTTGTC 
CTCCTTGGCGGCCCTGCCCACGGGCTCATCCTTGAAGTGAAGGAACACGAATGAGAAGACAAACACGCCGATCATCATTGAGAAGGCGCTGGTGT 
AGTAGGGGAATGCGCTCGGGATAAAGCGCTCGTACTGTGTGTGCTCCAGCGGACGCACCGACACCTGGGTGGTGCTGTACAGATGCGTGTAGCCC 
ACGCGATTGTAGTCCACCTTGAACTGATAAACGCCGTAGACGTCGGGTATCTTGAATTTGGCCTGGTAGGCGCCCGTGTTCGTTTGCTTCAGATA 
GGTGCGGACGAAGGGATCGATGCGCACGAACTCCAGTTGAATGTCGCTGGCCTTAAAGGCGCGCCACTCGCCCTGCACCAGCTCCTCGATGCCGA 
TGGTGTAGACGACCGGATCGGTGATGGTGTACGCCTGATCCGGTGGCAGCAGCTCGCCCTCCTTGTGATGTTGCACGGATGCCACGCGCAAGCGG 
CCAGTCTCTCCAAATACCCACTTGCTAATGGACTCGGCCACGTCGCGATTACCGGCCAGCTTGTGGAACACACCGCTCTGTGCGTACTGCACGGC 
CGTTGTGAAGCTCTCGTCAGAGAAGAAGAGCAGCGAGCCGGAGAACACTACGCGGGCATTGTTCCTCGCCTGCAGGGCGGCGATTAGCA 

(SEQ ID NO: 982) 

Exon: 3839. .3735 

Exon: 1840.. 1535 

Exon: 1395. .1001 

Start ATG: 1816 (Reverse strand: CAT) 
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Transcript No. : CT25818 

ATCTCAACGAAACGAAATCATTGTGAAAATTTCTCTCGTCCGAATTTGTTCGTCTCGTGTTGAAAGTGACAAATTGTGAAGAATAACTTTATTCC 
GTGTTTAACGTTTAAAAAAAAAAGCTAAGAAAACATGGCTCGTACTAAGCAGACTGCCCGTAAGTCGACCGGAGGAAAGGCTCCTCGTAAGCAGC 
TAGCCACCAAGGCTGCCCGTAAATCGGCGCCATCCACCGGCGGAGTGAAGAAGCCCCATCGTTATCGTCCCGGAACTGTGGCTCTTCGTGAGATC 
CGTCGTTACCAGAAGTCGACCGAGTTGCTCATCCGCAAGCTGCCCTTCCAGCGTCTGGTTCGTGAAATCGCTCAGGATTTCAAGACCGATCTGCG 
TTTCCAGTCGGCTGCCATCGGTGCCTTGCAGGAAGCATCTGAGGCGTACCTGGTGGGTCTCTTCGAGGACACCAACTTGTGCGCTATCCACGCCA 
AGCGCGTGACAATCATGCCTAAGGACATCCAGTTGGCGCGCCGCATCCGTGGCGAGCGTGCTTAAGGTGGATCAGCAGGAACGCCAGATTCGATC 
ACTGTCGTCCATTCCCGAACAGTAGACCATTCACGTACCGTTGTAGGAGAGGTGCACGCCAAGGAGCTATACCAGCGACACATTGATCCAATCTA 
TCATTCATATTTACATGTATACATTTATCATCCTTTTGTTGTTCAATCGAATCGTCAGCATGACGTTGTCGATAAGAAAAACACCACACACAAAA 

C AAC AC AC AAC AT T ACT GT AAT G C T T AAAAAT T AT T T T AT AAC TC T 
(SEQ ID NO: 983) 

Start ATG: 130 (Reverse strand: CAT) 

MART KQT ARK S T GGK AP RKQL AT K AARK SAPS TGG VKK PHR YRP GT V AL RE I RR YQK S T E LL I RK£ P FQRL VRE I A QD F KT DL RFQS AAI G AL QE 
ASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA* 
(SEQ ID NO: 984) 

Name: Histone H3.3B 
Classification: DNA__binding 
Gene Symbol: His3.3B 
FlyBase ID: FBgn0004828 



Celera Sequence No. : 142000013385192 

TTTCCGCTGCCATCTGCAGCTGTAGCTGCCTAACAAATGCAATTAAAATCTTTGCGCAAGAAAATTGTTGCAGCAGGCACGCAGAGATACTCGTA 
CTGTATGTGCTGCAATAGTTTGGTGGCAGCAACGGGGCGGCAGATAACATATTGCCCACGTCCGGCTGAAATTGTACGGACAAGGCGAACTCAAA 
AAAAAAAAAAAAAAAAAGGAATCCCTCAAAAGGAGAACCAAGAAAAATGGGATTTATCTCTGGCCAAGCTCCAGCCATACAAATCTAATTTAGAC 
CACACGACGGCTTCAGATATTTTATATTCCCCACTCAGTTGCGGTCCATCCTGATCTCCTCCAGCTTGCTTGATGAGCCCAGCCGTTGGCCAAAT 
GAAATTATTTATGGGCAGTCCTAGGTCAGTGAGAATTTGGTTTTTTAAGGGTTTGAACTTTGCGGCTCAACAATGCCTGTGATTTTTTTTTTTTT 
TATTATTTTTTTGCATTGCATCCATTAGAGAAAGCACTCCTTTGGCATTCTACTCAGCTACTTGCCACATAAATATCTACGGAGGGTAGCTAACT 
AGTCCTTATCTCAACTTCCCCATTTATCCTCGCCGCAAGTGCACTTTTCCTGCTGCTAATTAAGTTGCAAGGCAGCTTTCTCAGCAGGCGTTGCA 
CGAAGTAGATAAAAGGAAAAGCCCCCGATTTGCAGACACTATGCTGGGAAAGATATATATATATATATATTTTGCCAGCATTCGCTAATCGGCTT 
AGGGCAGCCACCATGCGACCACAACAACAACAACAACAGCAGCAGCAGTGTGCTCCATGCAATATTTAATCACATTAGTTGATGGGGTAACGGCC 
ATGTACGCAACAATTTACAACTTGCATTAATTATACGTCTAGTTGGACAATGGCGCCTCTAACTATCCCACTCTAAATGTATCTCTATCTCTATC 
CTTATTTGCAGGGGACACCCGTTGATAATAAATTATTAATGCATCTGCTGGGAAGGCGGCACTGCACGGCTGTCCTTGGCAGCAAATCCAGTTGC 
AACATCGGTGAGTGACGGGCGGCAATCACAAAGGACTTGCAACAGTTGCCGTTGCTTCTGCTATTGCTGTTGCTGTTGCTGTTGCAACAATGGCC 
AGTGGCAACAATGAAACTGAGCCGCTCTACTGCGGCAGCGGCATGGATAATTTTCATACAAGGTAAGCGACTTTCACCAATGTCACGATCAAGGC 
CAAGGGGCTTTTCCACCAGACATGGCGACAACAATACCATAAGCCAACAGTTGGGGGCGTGGTCGGGGGCGGAGGCGATGGTGGCAACATCATAG 
CCATTAGCTGCTGGCGCAAGGGAACCGTTCAAAAATCGATTATCGCCCCATTTCGGGGGAGCTTCTATTTTGATTTGCCGTACAATTTTCTCGGG 
CGATTAAACGACGAAGCAGAACGAAAACAAAAAACAGATTTGTCAACAGCAAGGTCAACAATTGATGGCTGAAATCAATTTAATTGACCATATCC 
TACGGGCCCTCCAAGTGGCCATCTGCTGCACCTATAAAAAAGTGAATCCGGTCTGCGATTATTTATATATTCGTTGCATGGCAGGCGGTCGTAAA 
ACCTCGAGATGATGATTAAAAGCGGCCCTAAAAACTTAATGGCGGTTTAGGAAATTCAATTCCTGTAATTTAAGCCGAGTCACCATTCTTCGAAG 
TTCTTACATGTAAGCGATAATAAATAGTTAAGTCAATTGGCCAATAAACCTATTAATATTGTGCATTTACCACGATTAGACTTTGATTAAAGTGA 
CAATGCTGATTTCTGTAGAGGAAATCTAGTTCTAGTCTTCCCACAAAGCTATTTAGTTACTCTTGAATAAATATGTTACTTTTCTTTTGCCAAAA 

CC AAC AG AAT T TT AAAT T T AAT AAT TT GG AT T T T T T GC AAT AAACT G T AC T G AT T AAT GG GC C AC AC AAAAAT GT C T AGT T T AT TAT G GAG C T C T 
TGGTTTCATAAATTAAGAACATAATCCAATCGGCATATAAATCATTGATAGCAATTTATTTTCCGTGATGAAACTGTGCTCCGTGTGAACTGCGA 
ATTACTCATTCTACGGTTGCAAAAAAAGCCACCAACGGTCAACATTTAGACCAGGACTTTTAGTTTTAATTAGAGCCAGCCTGGCCAACAGCAGT 

GT T AAT G AC C ACAAAGT G GC T GGC C AC AGG AT C AGC AT C CC AG AAT GC G ATG C CG C AT T T GCT T T AATT AAAG GT AG T AG C T G GAG T T T G AAAG A 
TGACTGTATGGCAATTAGATGTGTAGCCAGAACACTTGGCCATTTACTTTTGTGTCAAAGTCGTGCCAAATTGCCAGCGGAGGCGACACTTGACG 
CTGTCACGCCCCAGACAGACGCAGACCGGCCCAAAAGCACCCACTCAGCCGTCTCCAGGCGCCACTCAAGCGGCAAAGGAACGCCAAAACACTAG 
GACACAGAACGCCAGAAGACTCGAAAAAAAAGTATAAAATTCAAACGCGTATACAATTCAATGTGTCACATCGAAGAAGAACCCTTTCTCTGGCG 
GCAACGTGACGTATACGCAATATTAATGGCAAAGCGAAAAGGGCCTCAAAAGCTAACGAAAAACCGCAGCATAACACGCACAAAACAAGCAATTA 
ACATAAATAATAAAGAACGCACAAAGGCGTTCGGCTCTCCGTTGCCAAAGAGAAAAGCCAACGAAAGGAATAAAAAACGCCAAAAAGCCCGAAAA 
AAAAACCGAAGACCCAAAAAATGTGTAAAATTGATTTCGGAAAATATTTAATTTACCAAATGCATGCTCGCTTTTTGCGGCCGTGTACGCCCCGA 
ACAAATAAAACAAATAGGCACACATCGTCTGGCATTTCATGTTAAAGGCTTCTCATGCTGATGGACAGGATACAGAAAATATATATACGAAGTGA 
GGTGTGTGTCTACGTGCCACAAAAACAGCGAAATTGAAACGCTTTCCGAATGCCGGCGACACTTTTTGGCGGCACAACCCTCGTCTTGTCGACCA 
ACTCCTGGGGAATTTACAACTCCATTCATCAAACTCACAAAAACTTTGCCTGGTGTACGGTCTTGGGTTCGTCGTCGCAGGATTTCACCTGGCTA 
AAAATCCAGACTCCCAGCCATCCCTGAGTCTAATTTATGGCCTTTGATAATCCATGTTGATGAATGGCGGCAACGACAACAACGTCGACGACATG 
AATGAAGTCCTGGAATTGTTTTGCACCAGGATGGCATCGGGGCTCCAGGTGGGACGTACTGGCTCAAAGTTATTGGCCCAGAAATCAGGCATAGT 
TAGCTGCCGAAATGAAACCCAAATACCGAGAAAACTAGGCAAAACAAACAGTAGTACACCGGAAATGCATATCATTGTAAAAACTACATCAGTTT 
ACCTAAAAGGCTTGGCTTTTAAGCTTTCACATTTATAAAATATTGAAAATGCATATAAAAGTATGAAATTAATTCCCTTTTGTCAATAAACTTTC 
TTTCTTTCTTTCTGTGTAATATGGGGGATACCGGTTTTTTTTTTTTTTCAATGAAATCCCTTCGAAAGGTATAAGTTCAGAATCGAGAGTTTTAT 
GCCAAGTTGGGCACAGTTTTTTTTTTCCCCAGCTACCTAAAATAATAGAGACATTTTCCTCCCACTACAACTGATTGCATTGCCGGTGCAGAAAG 
TTTTTTCAGTTGGTTCGGAAAAATTTGGTTCGCAAACAAATTAATATGAACTGGCAAGCATTTTTCGGGCAAAAAGCTCTCATCTATGTAGATTG 
GAATGGAAATTCCGGCTAGAATTGCATAAGACCACCTGCAGTGTGGGCTAACATGACTAAAAAGTTGTCCACAAATTTGGCTTAGATTCTCCAAT 
AAAACTGTCGTTCGGCCAGGAATCCCCTTTTTTGTTTCGAGTGAATGGGGAATTTCGCACGACAGACAGCAATAAAGAATTTAACTAAAGTCCTG 
ACACCGACAGCACCAGCAGGACGCACACGTGTCACTCCATTTGGAGAGCTTGGAGTATATTAAACATTTTTTCCCCACCAGTCAGCCGCAGGACT 
TGCATCGGTCTCGCCTCGCATTTTCCTATATAAATTTTATGCTAAGTCTAATTTGTTGGCTGCAACTTGCACAAAGGCAAAAAATAAACAAGGGC 
GAAATGCCGAAAGCCAAAACCCAACCGAAACCGTTGAGGGCTGCCTCGCTTTTTTCCTGTGCCGAATTCCCTAAAACTTTGCACATAAATTTGAG 
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TCCTGCGCCTGGGCTTTTCCTCTTCCACCTTTTTTTTTAGCCTCAAAGCGCTCGACGAAAACTAAACAAAAGCTGGTAAAATGTTTATCCCAGAA 
GGGGGCGGTGTGGGCGTGGCTTGCTTAAGCCCTAGACACCCGAGAGTTTATTGCCTCAGTGTTGCAAACGCTGCTGCTTGGCAAGTCAACGGAAA 
TGGCTTTCGGATGGTTTTATTGCCTACTTTTGCTGCGAGAAAGCGGACTAGACATGGAAGTAACAAATTGGTTAAGGCTAACTAGAGCATAATAA 
CAGAAACATCAGAAACATAATTTGGGAAATGCCTGCCACTCACCGACAACATTTCGATTTGTGTCTTTATTACACTTCTCTGCCGTGATGGGTTA 
TAACTTTTGGATCCTTGCAGAGAGTTGGGAAATTGCTTAAAAATAAAATTTGTTTAATTTAATTTGAGGTGGTTTTTCAAACAATTTTAATGGAT 
TAATAAGTTTAGGAAGTTAAGTATATACATCTATCTATTATACCATCTTCAAATATTGTTGAATTGAAGAGATAACTCGGAGCTAATCCCATATT 
TTTCATTTTTTTCAGCTACAAGAACATGCATGGCTATGTTTCGCTGGTGGTCTGCATCCTGGGCACCATCGCGAATACCTTGAATATCATTGTGC 
TAACCCGACGGGAGATGCGCTCCCCCACGAATGCCATACTCACGGGTCTGGCCGTGGCCGACCTGGCAGTTATGCTGGAGTATATACCCTACACC 
ATACACGACTACATCCTGACGGACAGTTTGCCGCGGGAGGAGAAGCTCAGCTACAGCTGGGCCTGCTTCATCAAGTTCCATTCGATTTTCGCCCA 
GGTTCTGCACACCATTTCCATTTGGCTGACGGTGACCCTGGCTGTTTGGCGTTATATAGCGGTGGGTTATCCGCAAAAGAATCGCGTATGGTGCG 
GTATGAGAACCACCATAATAACGATAACCACCGCTTATGTGGTGTGTGTTCTGGTGGTGTCGCCGTCGCTCTATTTGATCACGGCTATAACCGAA 
TATGTCGATCAGTTGGATATGAATGGCAAAGTGATAAACTCCATTCCCATGACCCAGTACGTAATCGATTATCGTAATGAGTTACTGAGTGCCAG 
GACGGCTGCCCTGAATGCCACGCCCACCAGTGCACCACTGAACGAAACTGTGTGGTTAAATGCGAGCACCTTGCTGACATCGACAACCACCGCTG 
CACCACCCACGCCATCGCCAGTGGTGCGAAATGTTACTGTCTATAGGCTATACCACAGCGATTTGGCGTTGCACAATGCCTCGCTGCAAAATGCC 
ACATTTCTCATATACAGTGTAGTGATTAAGCTGATACCATGCATAGCACTCACCATTCTGTCGGTTCGATTGATCCTGGCCTTACTGGAGGCCAA 
GCGGCGGCGGAAGAAGCTCACCAGCAAGCCCGCCACTCCGGGTGCCAGTAATGGAACCAAATCACCGGCCAATGGTAAAGCAGCGGATAGGCCCC 
GGAAAAATAGCAAAACTCTGGAAAAGGAAAAGCAGACGGATCGCACCACGAGAATGCTGCTGGCGGTGCTACTCCTCTTTCTCATCACTGAATTT 
CCACAAGGGATTATGGGTCTGCTGAATGCTGTGCTCGGAGATGTCTTCTATCTGCAGTGCTACCTAAGACTGAGTAAGTAAATATGTACTATGCA 

T AAC GC C AC AGT AAAT T C C AC AG AT T C C CAT T T C ACT T T AAGAT GT AAT AAC C C AAT GC CTTTTGCCTTTT GC C AC C AG GT G AC CT G ATGG AT AT 
CTTGGCCCTGATCAACTCCAGCATCAACTTCATTTTGTACTGCTCCATGAGCAAGCAATTCCGCACCACGTTCACGCTGCTCTTTCGTCCAAAAT 
TCCTGGACAAGTGGCTGCCGGTGGCGCAGGACGAAATGGCAGCTGCTCGAGCTGAACGCTCTGCGGTGGCACCGGTCCTGGAAAAGGGACGACAG 
CAGCCGCAGGTGGTGATGGCCAGCACGACCACCAACATCACGCAGGTGACAAATCTGTAGCACAGGAGGAGTCGTGGTCGCCGAACTTTGCTCAG 
TCGCCTGCTGAGTGTCCTGAAACGCGGCAGAAGGCGCTCCTCCGGCGAAGGAGGAGGCGTGGGAGGAGGTGGTGCGCCGTTGGCCGGCAACGATG 
CGGTGGAACCAGCGTTCCAGGCCATCGTGGTGGTGGTGGACAAGGTGAGCGGTGCCACGGAGAATCAGCTGTACACCGCCGAGCAAGCTCGTATT 

GT G AC GT AGT AAC CC T GAT AGT AGT AC AT AT TAT AT GC CT AGT T AT C C T T GT AAAAAGT AAAC T AGT T GT GT AAG AAT CGC AG AT AACC AC AAGT 
TATGTGCATACTTTGGTGGGAAATCGTAAAACCCGTACAGTCATAATAATAAAATGCTGGTTCATCACTCATACGCACCGTTTGACTTCACTCTA 
TTCCACATTCACAGTCAAGTAGGAGCTGTAAAAATGCTGTGCATAAACGAGCTCGTATACGCTACAGCTGGCGTCACAATAATAGTTACCATTGG 
GTTGGCTTCACAACATAGATCAAAGTCAAAAAATATATAGTTTTGTAAAATGTTTCATAAATCTATCTTCATATTTAACTAACACAATTCAGAAA 
CTTTCGCTTGCATGGAAAATGTTTTATTTTCCCCACTTGACCCAATTGTGCGCAACTGTTCAGCTTTTCGGCATGAATCAAATCAAATATAGTAA 
ACTTTTGCATCTTTTCCACTTTGATCCCATGCGTTTTCATGTTGATTTCATTCGCATAAAGTATGCAGCACACGTACTCACTCAAAACTACGAAT 
TTTCGGCAACGATTTGAGCAGAATATTTGCCATTTGTTGTACAGATTATATATACACACACACACACTTATGTCCGTATCTTTAATAATGGCAAT 

T AAG T CC TAG AAAT T T T G AC C T GAT AT T G ATG C AG GC AT T GGAAAAAT AAGT G GC AGT T G C AGT GCG GT GC GTT GAT AAAT G AAAT ACT T T GC GA 
ATTCTATTATTGGAAATCGCAATTTGGGCTGTGAATGAAATTATAGTTTTCTTGAACGCAATTGAGAATTCAGTTCCGTTTCGTGGGTTACTGAA 
TTTATGGGAAATAATT TAAGGCAGTCACTGCAGCT GACCGAAT AATCCAATGAAACATTGCAT AAAC AGT TTGCAGTTTTTCCACACTTGCGAAA 

AT T GT AAT T GC AAAT C GCT T AAGC C C C AC G AC G AAT G AC AAAT GGC C AG C G AAC G CAT A 
(SEQ ID NO: 985) 

Exon: 1001.. 1202 
Exon: 4861. .5963 
Exon: 6065.. 6282 
Exon: 6331.. 6564 
Start ATG: 1135 

Transcript No * CT25824 

GGAAGGCGGCACTGCACGGCTGTCCTTGGCAGCAAATCCAGTTGCAACATCGGTGAGTGACGGGCGGCAATCACAAAGGACTTGCAACAGTTGCC 
GTTGCTTCTGCTATTGCTGTTGCTGTTGCTGTTGCAACAATGGCCAGTGGCAACAATGAAACTGAGCCGCTCTACTGCGGCAGCGGCATGGATAA 
TTTTCATACAAGCTACAAGAACATGCATGGCTATGTTTCGCTGGTGGTCTGCATCCTGGGCACCATCGCGAATACCTTGAATATCATTGTGCTAA 
CCCGACGGGAGATGCGCTCCCCCACGAATGCCATACTCACGGGTCTGGCCGTGGCCGACCTGGCAGTTATGCTGGAGTATATACCCTACACCATA 
CACGACTACATCCTGACGGACAGTTTGCCGCGGGAGGAGAAGCTCAGCTACAGCTGGGCCTGCTTCATCAAGTTCCATTCGATTTTCGCCCAGGT 
TCTGCACACCATTTCCATTTGGCTGACGGTGACCCTGGCTGTTTGGCGTTATATAGCGGTGGGTTATCCGCAAAAGAATCGCGTATGGTGCGGTA 
TGAGAACCACCATAATAACGATAACCACCGCTTATGTGGTGTGTGTTCTGGTGGTGTCGCCGTCGCTCTATTTGATCACGGCTATAACCGAATAT 
GTCGATCAGTTGGATATGAATGGCAAAGTGATAAACTCCATTCCCATGACCCAGTACGTAATCGATTATCGTAATGAGTTACTGAGTGCCAGGAC 
GGCTGCCCTGAATGCCACGCCCACCAGTGCACCACTGAACGAAACTGTGTGGTTAAATGCGAGCACCTTGCTGACATCGACAACCACCGCTGCAC 
CACCCACGCCATCGCCAGTGGTGCGAAATGTTACTGTCTATAGGCTATACCACAGCGATTTGGCGTTGCACAATGCCTCGCTGCAAAATGCCACA 
TTTCTCATATACAGTGTAGTGATTAAGCTGATACCATGCATAGCACTCACCATTCTGTCGGTTCGATTGATCCTGGCCTTACTGGAGGCCAAGCG 
GCGGCGGAAGAAGCTCACCAGCAAGCCCGCCACTCCGGGTGCCAGTAATGGAACCAAATCACCGGCCAATGGTAAAGCAGCGGATAGGCCCCGGA 
AAAATAGCAAAACTCTGGAAAAGGAAAAGCAGACGGATCGCACCACGAGAATGCTGCTGGCGGTGCTACTCCTCTTTCTCATCACTGAATTTCCA 
CAAGGGATTATGGGTCTGCTGAATGCTGTGCTCGGAGATGTCTTCTATCTGCAGTGCTACCTAAGACTGAGTGACCTGATGGATATCTTGGCCCT 
GATCAACTCCAGCATCAACTTCATTTTGTACTGCTCCATGAGCAAGCAATTCCGCACCACGTTCACGCTGCTCTTTCGTCCAAAATTCCTGGACA 
AGTGGCTGCCGGTGGCGCAGGACGAAATGGCAGCTGCTCGAGCTGAACGCTCTGCGGTGGCACCGGTCCTGGAAAAGGGACGACAGCAGCCGCAG 
GTGCACAGGAGGAGTCGTGGTCGCCGAACTTTGCTCAGTCGCCTGCTGAGTGTCCTGAAACGCGGCAGAAGGCGCTCCTCCGGCGAAGGAGGAGG 
CGTGGGAGGAGGTGGTGCGCCGTTGGCCGGCAACGATGCGGTGGAACCAGCGTTCCAGGCCATCGTGGTGGTGGTGGACAAGGTGAGCGGTGCCA 

CGGAGAATCAGCTGTACACCGCCGAGCAAGCTCGTATTGTGACGTAG 
(SEQ ID NO: 986) 

Start ATG: 135 

MASGNNETEPLYCGSGMDNFHTSYKNMHGYVSLVVCILGTIANTLNIIVLTRREMRSPTNAILTGLAVADLAVMLEYIPYTIHDYILTDSLPREE 
KLSYSWACFIKFHSIFAQVLHTISIWLTVTLAVWRYIAVGYPQKNRVWCGMRTTIITITTAYVVCVLVVSPSLYLITAITEYVDQLDMNGKVINS 
IPMTQYVIDYRNELLSARTAALNATPTSAPLNETVWLNASTLLTSTTTAAPPTPSPVVRNVTVYRLYHSDLALHNASLQNATFLIYSVVIKLIPC 
IALTILSVRLILALLEAKRRRKKLTSKPATPGASNGTKSPANGKAADRPRKNSKTLEKEKQTDRTTRMLLAVLLLFLITEFPQGIMGLLNAVLGD 
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VFYLQCYLRLSDLMDILALIWSSINFILYCSMSKQFRTTFTLLFRPKFLDKWLPVAQDEMAAARAERSAVAPVLEKGRQQPQVHRRSRGRRTLLS 
RLLSVLKRGRRRSSGEGGGVGGGGAPLAGNDAVEPAFQAIVVVVDKVSGATENQLYTAEQARIVT* 
(SEQ ID NO: 987) 

Name: G-protein coupled receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384084 

ATGCAATAACATACGGGAAAAGCTGGCGGTGGGAAAGAGAGATGGCACACACTGCGCATGACAAGCGCAGCCATGCACATTTGCATTTGTTAACC 
AATGTGCGTGTTTAGTATACACACACATCAACGCGCGTGCCACTTAAGAAATAAATAAATAACTTTTATATGTTTATGCCAACCAGCATAAATAC 
GATTACTCATACTTTTTGGCGCCAGTTTTCGCACATTGGCAAATCGTTTTTATTTATTTTTTTAATTTTCTTTTTGCCAATTTCATCAACGACTG 
CAACTTTTTTATTTCTTGCTATTCTCCTGCTTGGCGGCCATTTTGGAAAAAAAAAAAAAGCAGCGCACCTCACCATTACAACTAAAACAACAATG 
CAACCAACAGTCGTGCGAATACGAGACCTCTCACATCTTCGTGTATTTGTGTATGTATCCACCAGCGAGAATGCGCCTGTGTGTGTGCGGTGAAA 
AATAGGGCTCCGCTGGACGGCCGGCGGAGCCCCAAAAACGGCAAAACATAAACTCAGGCGATTCACAACGAAAAAAAAATTTGGAAAACAGTGCA 
CACACGCACATATTCAGCGGCCACAAGCACGTGCGTTTTAAGAACCACACTTAGTATTATAAGTATTTAGCAATAAAAAAAAACTCACCGTTAAA 
CACGGAATAAAGTTATTCTTCACAATTTGTCACTTTCAACACGAGACGAACAAATTCGGACGAGAGAAATTTTCACAATGATTTCGTTTCGTTGA 
GATCTAGAGATGGTAGCCTAGTAGGCGAAGGCCTACTTCGATTGTATTCGATTACACTCGATTGTCTGACAAGACCATGCCCGCCAAATAGTTTG 
GTTCTGTTCTTTGCTAATTACATTTTTATAACTTTTTAAAATTTTTTTAACTCCAACCTAAGATTAAAAATGAATATGTTTTAATGGGTTGGACT 
CTGCTAATTTTAATGAAATATAAGCTTCATTAAGCAAAAAAAAAGCCAAGTTCTTTTATGGATTGTGGGTTTCTCTCTGCAGAGAACTTTATTTT 
AGTGCATTGTGTGTTACATTTGAGTTGAACGAACGTGCATACTACTGCGACTTCTTGTCCTCCTTGGCGGCCCTGCCCACGGGCTCATCCTTGAA 
GTGAAGGAACACGAATGAGAAGACAAACACGCCGATCATCATTGAGAAGGCGCTGGTGTAGTAGGGGAATGCGCTCGGGATAAAGCGCTCGTACT 
GTGTGTGCTCCAGCGGACGCACCGACACCTGGGTGGTGCTGTACAGATGCGTGTAGCCCACGCGATTGTAGTCCACCTTGAACTGATAAACGCCG 
TAGACGTCGGGTATCTTGAATTTGGCCTGGTAGGCGCCCGTGTTCGTTTGCTTCAGATAGGTGCGGACGAAGGGATCGATGCGCACGAACTCCAG 
TTGAATGTCGCTGGCCTTAAAGGCGCGCCACTCGCCCTGCACCAGCTCCTCGATGCCGATGGTGTAGACGACCGGATCGGTGATGGTGTACGCCT 
GATCCGGTGGCAGCAGCTCGCCCTCCTTGTGATGTTGCACGGATGCCACGCGCAAGCGGCCAGTCTCTCCAAATACCCACTTGCTAATGGACTCG 
GCCACGTCGCGATTACCGGCCAGCTTGTGGAACACACCGCTCTGTGCGTACTGCACGGCCGTTGTGAAGCTCTCGTCAGAGAAGAAGAGCAGCGA 
GCCGGAGAACACTACGCGGGCATTGTTCCTCGCCTGCAGGGCGGCGATTAGCAGCGTGCCTCGGCCAACGGCATGCGGATAGTCGGAGACACTTG 
CCTCGGGATTGTAGCTATAGGCGGTGCTCTCGGCGGTCAGCAGCTTGAGTACCAGCGGGTTCTCCTTGTCGGCAATCAGTCCTGTACCGCGATAG 
AGCAGCGGTGCGGCATCCGCCTGCCTGTTGGCCTTGCCCACAATGGTGTCGGCCTGTATAAGATTCTTGGCGGAGGTCAGGATGGTTGTGTGCTC 
GCCAGCATCGCTAACATCGTAGTGCAGATGGTCAATGACCGCCGCATTCTCCTCGTCCAGCTCGAAGCCGCACTCGGAGGCGAATTCCCTTAGCG 
CGTCGCCGGATTTCTCGCTGCCCGCCACCAGAACATTGCCGCCATCGTCGACGAACTGGGCCAAACGCTCCACGCTCACATCGCCGCCGAATTCC 

T C G AC AC T T GGT GC G AAT AT GAT G AC GT T C TT GT AC AGAT ATT C AC CGT AT T T GG AC AGC AGC AGGC T G GAGT C AT C CGC C AGTT T G T AG GT G AG 
CTTAAATCCGCGATCTGTAGCCCAAAGTTAAACAATTATCAACACATTTCATTGTCCACGTCAGAGGTTAACCGGCATTTTCACTCACCCTGCAG 
CGATTTGAAGAAGATCGAGTGGGTTTCCCGGATGGCCAGGTTGTCCAGCAGAACCAAGGTGTTGGCGTCCGTTTCCAGTACCGCCTGGCAGTGGG 
CAATCGCCAAAACGGCGATCAGGAGAGCCTTCCACATCATTTTATCCAACTAAAACACAAAATTCTCTCCGGCTTTTCTTGAAAAAAAAAAATGA 
CTGGCTGTCAGTGTGGCCGAATACTGGTAGGGTGTGCCAAAATATATACAAATATACCAGAAGGAATTCAAGAAATTATAAAAAATCGATGATTT 
GATTAATAATCGATTACATTTACTAGCCAATTTATAAACAAAACATTTTAATTCAAATTTTAGATTGTGAATAACATTCCTTAGCAACAACCATA 
ATGGACATATTCGAGAGCTTCTGTCGCACGTGCGGAAACGAATGCCTGGAATCGCTGTCCATTTACAATGAGTGTGCCCAAGTGCTTGACCAGAT 
GGTGCCCATTGCGGATATGCTAGCGTCCTGCCTGCCCGCCTCCTTGCCGCCACTGGACCCAGAGGACGACTATCCCAAACAGATCTGCCGCATCT 
GCGTGAAGAAGCTGTCGATGGCCTACGAGTTCAGCCACCAGTGGCTGGGCGCCCACGGCGAGTTCAACGTGGCCCTAAAGTTCGAGCAGCGCCGA 
AGGCGCAGCCAGGCCAGCAAATCCCAGTCGCACACGCAAACCCAAACTCATCCGGTGGAACCGAAGAATGAGCAACTGCCCACGGTGCTGGCGGA 
AGCAGTGTCGACCAAAAGAGGTGGGTGTACAATTTTCTTATCTAGGCTAGGTAGATTTCAAACTTCTCAATCTTCCCCAGCAGCTGCTACTCCCA 
CCAACGAGTTCAGCAGCGACTCCGGACCGGCCTTCAAGTGCGGCTTCTGCGGCGAATGTTTTTACACGGAGAAGGCCTGCAAATTTCACTTAAAG 
TTCTCGCACAAGGATCTTTAGAGAACTGTCTGTTTTTTCCCTTACTTGTAATTTGGTTTTAAGTGTATAAATCTTTTGTGAAATTAATTTTAAAT 
TAGGAATATATATATATATTCCTTCTCTTTATACAAGTGCCTTCTTTGAGTTTTTAAACCCAAAACGTTGCAATTGTAGGGATATAGAGTGCATT 
TGTGTAAGTATGAAACACAAAAAAAACCTGGAAAACTACTACCCGCTATAGTTGAACCACTGAA 
{SEQ ID NO: 988) 

Exon: 2579.. 2369 
Exon: 2294 . . 1001 

Start ATG: 2510 {Reverse strand: CAT) 
Transcript No. : CT25930 

ACACTGACAGCCAGTCATTTTTTTTTTTCAAGAAAAGCCGGAGAGAATTTTGTGTTTTAGTTGGATAAAATGATGTGGAAGGCTCTCCTGATCGC 
CGTTTTGGCGATTGCCCACTGCCAGGCGGTACTGGAAACGGACGCCAACACCTTGGTTCTGCTGGACAACCTGGCCATCCGGGAAACCCACTCGA 
TCTTCTTCAAATCGCTGCAGGATCGCGGATTTAAGCTCACCTACAAACTGGCGGATGACTCCAGCCTGCTGCTGTCCAAATACGGTGAATATCTG 
TACAAGAACGTCATCATATTCGCACCAAGTGTCGAGGAATTCGGCGGCGATGTGAGCGTGGAGCGTTTGGCCCAGTTCGTCGACGATGGCGGCAA 
TGTTCTGGTGGCGGGCAGCGAGAAATCCGGCGACGCGCTAAGGGAATTCGCCTCCGAGTGCGGCTTCGAGCTGGACGAGGAGAATGCGGCGGTCA 
TTGACCATCTGCACTACGATGTTAGCGATGCTGGCGAGCACACAACCATCCTGACCTCCGCCAAGAATCTTATACAGGCCGACACCATTGTGGGC 
AAGGCCAACAGGCAGGCGGATGCCGCACCGCTGCTCTATCGCGGTACAGGACTGATTGCCGACAAGGAGAACCCGCTGGTACTCAAGCTGCTGAC 
CGCCGAGAGCACCGCCTATAGCTACAATCCCGAGGCAAGTGTCTCCGACTATCCGCATGCCGTTGGCCGAGGCACGCTGCTAATCGCCGCCCTGC 
AGGCGAGGAACAATGCCCGCGTAGTGTTCTCCGGCTCGCTGCTCTTCTTCTCTGACGAGAGCTTCACAACGGCCGTGCAGTACGCACAGAGCGGT 
GTGTTCCACAAGCTGGCCGGTAATCGCGACGTGGCCGAGTCCATTAGCAAGTGGGTATTTGGAGAGACTGGCCGCTTGCGCGTGGCATCCGTGCA 
ACATCACAAGGAGGGCGAGCTGCTGCCACCGGATCAGGCGTACACCATCACCGATCCGGTCGTCTACACCATCGGCATCGAGGAGCTGGTGCAGG 
GCGAGTGGCGCGCCTTTAAGGCCAGCGACATTCAACTGGAGTTCGTGCGCATCGATCCCTTCGTCCGCACCTATCTGAAGCAAACGAACACGGGC 
GCCTACCAGGCCAAATTCAAGATACCCGACGTCTACGGCGTTTATCAGTTCAAGGTGGACTACAATCGCGTGGGCTACACGCATCTGTACAGCAC 
CACCCAGGTGTCGGTGCGTCCGCTGGAGCACACACAGTACGAGCGCTTTATCCCGAGCGCATTCCCCTACTACACCAGCGCCTTCTCAATGATGA 
TCGGCGTGTTTGTCTTCTCATTCGTGTTCCTTCACTTCAAGGATGAGCCCGTGGGCAGGGCCGCCAAGGAGGACAAGAAGTCGCAGTAGTATGCA 
CGTTCGTTCAACTCAAATGTAACACACAATGCACTAAAATAAAGTTCTCTGCAGAGAGAAACCCACAATCCATAAAAGAA 

{SEQ ID NO: 989) 
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Start ATG: 70 (Reverse strand: CAT) 



MMWKALLIAVLAIAHCQAVLETDANTLVLLDNLAIRETHSIFFKSLQDRGFKLTYKLADDSSLLLSKYGEYLYKNVIIFAPSVEEFGGDVSVERL 
AQFVDDGGNVLVAGSEKSGDALREFASECGFELDEENAAVIDHLHYDVSDAGEHTTILTSAKNLIQADTIVGKANRQADAAPLLYRGTGLIADKE 
NPLVLKLLTAESTAYSYNPEASVSDYPHAVGRGTLLIAALQARNNARVVFSGSLLFFSDESFTTAVQYAQSGVFHKLAGNRDVAESISKWVFGET 
GRLRVASVQHHKEGELLPPDQAYTITDPVVYTIGIEELVQGEWRAFKASDIQLEFVRIDPFVRTYLKQTNTGAYQAKFKIPDVYGVYQFKVDYNR 
VGYTHLYSTTQVSVRPLEHTQYERFIPSAFPYYTSAFSMMIGVFVFSFVFLHFKDEPVGRAAKEDKKSQ* 
(SEQ ID NO: 990) 

Classification: enzyme 
Gene Symbol: Ost48 
FlyBase ID: FBgn0014868 



Celera Sequence No. : 142000013384683 

G AAAT GCAGTCGGTAC G AAAT T AC T GT GAC T GG AACGGC C TGGACTTTATCTATT G AC T T T AAAT T AAAT AG AT AC T T AC AT AC AT AT TT T AAAT 

GAAGTCAGCCCAACGTAAATGTATTGCAAAAGGAATCCCCTCAAAGACGTTTTAGACAAGGAACTTTGGGTATTTGCCTAAGTACAAATTAAGCC 

ACCAATTTGATCAAAGTTCACTTTGATGTTTTTAATATCAGCGCAATAGCGATCTAAAGCAGAATCCTCCAAAGTGATGGATTGCATTCGGTACT 

CGATCATAGCTGAGCAATTTTTCCTTAAAACATTACAATTTTGTATAAATATAATTTAATAAGAATTTTAGTCTTTTAATCAAAGTAAGACTACA 

AAAATTTAACCTGTTGGTGAAGAATTCTATTTGTCATTATATTTTTATTTATAAATTTAATTTAATTTATTAATATTTAACTTTAATATAATTGT 

TTTTTATTTAAAATTCGATATTTTCAAATTATATATTTGAAATTGAAATCGCTAGAAATTGAATTGATATGGGAGGAAATTAAGTCTACATATCT 

CAATGCCCCTATAATTAGGAAAGATTTATTTTGACAGGACTGACATCATTAGTAATAACTTTATTCCAAATAAAATGAATCCCATATTTTTATAT 

TTTTTATGTATACGCACTTATTTTTAATACTTGTATATACAAATTTTATATGAATTTTATAACATTTACGTATACGTACTAATTGCTATCATATT 

CGAACGACAAACATTCCCGGAGAAATCTTAACATATAGATGTCATTAGGAAACTACAAAAAACCGCTTTCTAAAACTGTATTTTTAGATATAACA 

AGTAAAAAATCGGAAAAATTTACCACGGGCTACTAGCTATTTTGTAGCTATTCTCGACTGAGCGTTGCCATCCCTGGTCACGACCAGTGTGACCA 

TCGAACTGGTACGACTTCGGCAATTTTCGTAGCTGTCAAGTGACGTGACGTATGTATGGTGCTGTCGTGGCGTTCGCTTCCAGAGAGAAAGAAAG 

TCGCAGGTCGCCGAGTTGGATAGCCGTAGAGTTCCGCCTATTCCATTGATTACTAAATAAAAACCACCTCATAACTTGATCAAGATGTCCCGTCA 

ACTGTTCGCCATTTGCCTGATTGCCGCCGTTTGTGCGTCCGGCGTCTACGGTAGCTGCAAGGCCACCGTGAGCTCTTTCAGCACCCAGGATGCCA 

CGATTCTGACCCAGCTGGCCCATGTGGGCGAGTTCACGCTGCAGTGCAGCGGAGCCGCTCCCGCCAGCCTGTTCGCCGAGTTCCCCTGCGGCAAG 

GTGGTGCCGGTGGCCAAGGTGGGCGACAACAAGTACCAGGTGAGCCAGTAGTAGCCCCGTCAAGTTTTGTGCAAAGGCTAATCTCTGTTCTGCAC 

TAGGTGAGCTGGGTGGAGGATATCAAGCAGGGCAGCAGCGGCAATGTTCAGGTGCGTCTCTTCGACGAGGATGGCTATGCCAATGTGCGCAAGGC 

CCAGCGGGATGGCGACAAGGTTTCATCGGTGAAGTCGCTGCTCGACATTTCCGTGCCCACCAAGGGTGCCTACAAGGGTCCCTGGATCAAGGCTG 

AGCTTTTGGCCGCCTTCTTGGTCGGCGGATTCGCATACTTCGCCTTCACCACCAAGGGCAAGGTGCAGTCCTAAGTAGAGGATGCCTTCGTCATC 

GAGATTTTGAATGAAATTCATTCGCACTAAAGTCATGTACCATCAAATTGCCGGGTCCCGGAGATTTTTCCACATTTTGTACACCTAAGCATCGG 

ATCACCCGCCGCTCCAAAATAAACTGATTGAAAGACCAACCGAAGGAACCTAGTTGAGATTAATTTAATTTTAATTTAGAGATAACTTGTACGAA 

TACGCTTAAACCTGGCCCGTGGAGGGCGCTCCATTCATGGTCGCCTCCTCGTCGGAGGAGATGTCTATCATGCTCTCCACACGCGACGGCGACAT 

CACATACACAAAGTGCTCCGTGGAGCCCTCGGGGCCGACTGGCGGCGGGGGTATGACCACGGAGCCGTCCTGCGGCGGCGGTGGCGGAGTATCTA 

TGCTGCAGTACTTGAGATTCCCAGCTCCTCCTCCGCCTGAATCATCCGGCGGTGGCGCTCCACCGTTGCAATCAAAGTTCTCCAGCTCTAGCTCC 

AAATCACTGACGATTCGCATGCAACGCATCTTGGCGAGCGACTGGAATTTCATGGGCATAGCTCGCACCTGCCGCGCCAGACTCAAGAAGAAAAG 

AGTGTTTGCGTCAAGTTCTCTGTTGGCCTGTAGCACGGCAGAGACGTCCCAATCGCTCGGAAAACGCGGATCGCAAATCTTACGCGTTTTGGCGG 

GCGACGGCTGTTGTTGCTGTTGAATTTGTGGCATTTGCATTGACTGCTGCTGCTGCGTTTGTACGGAATTCCCTCCGGAGCAACTTAGGTTGATG 

GCCGTGGTTTGCGGCGAAGGTTGTACCCTTCGCACCTTGGAGGCTGGTGGCGCTGGCGGCGGCGGCGGAGGAGTGGGATTACCCATGGGTAAGCT 

CGGGCAGCTGCCGTTCAAACGATTGGCGTTCTGCACTTGGTTGGCCAGGTTCTGGAACAATCCCAAAGGTGCTCCAGCTGCCGCAGCCGCTGTCA 

GTTCCTTCTTGAAGTTGTCCGCCGCCGCAAAGAGATGCGGCTCGATGATCAGACAACCGACATTGCTGTTGGTTGCGCTGCCGGTGCTGCAACCA 

GGCGGCGTGATCGTAGGTGGCAGGGGAAGACCAGCAGCCACTGCCGCCGCCTTGAGCGTGGCTAGCGGGAACATCTGCTGGGCCAGTCCGCTCA 

{SEQ ID NO: 991) 

Exon: 1001.. 1369 
Exon: 1429. .1849 
Start ATG: 1130 

Transcript No. : CT25954 

TATGTATGGTGCTGTCGTGGCGTTCGCTTCCAGAGAGAAAGAAAGTCGCAGGTCGCCGAGTTGGATAGCCGTAGAGTTCCGCCTATTCCATTGAT 
TACTAAATAAAAACCACCTCATAACTTGATCAAGATGTCCCGTCAACTGTTCGCCATTTGCCTGATTGCCGCCGTTTGTGCGTCCGGCGTCTACG 
GTAGCTGCAAGGCCACCGTGAGCTCTTTCAGCACCCAGGATGCCACGATTCTGACCCAGCTGGCCCATGTGGGCGAGTTCACGCTGCAGTGCAGC 
GGAGCCGCTCCCGCCAGCCTGTTCGCCGAGTTCCCCTGCGGCAAGGTGGTGCCGGTGGCCAAGGTGGGCGACAACAAGTACCAGGTGAGCTGGGT 
GGAGGATATCAAGCAGGGCAGCAGCGGCAATGTTCAGGTGCGTCTCTTCGACGAGGATGGCTATGCCAATGTGCGCAAGGCCCAGCGGGATGGCG 
ACAAGGTTTCATCGGTGAAGTCGCTGCTCGACATTTCCGTGCCCACCAAGGGTGCCTACAAGGGTCCCTGGATCAAGGCTGAGCTTTTGGCCGCC 
TTCTTGGTCGGCGGATTCGCATACTTCGCCTTCACCACCAAGGGCAAGGTGCAGTCCTAAGTAGAGGATGCCTTCGTCATCGAGATTTTGAATGA 
AATTCATTCGCACTAAAGTCATGTACCATCAAATTGCCGGGTCCCGGAGATTTTTCCACATTTTGTACACCTAAGCATCGGATCACCCGCCGCTC 
C AAAAT AAACT G AT T GAAAGACC AACC G AA 
(SEQ ID NO: 992) 

Start ATG: 130 

MSRQLFAICLIAAVCASGVYGSCKATVSSFSTQDATILTQLAHVGEFTLQCSGAAPASLFAEFPCGKVVPVAKVGDNKYQVSWVEDIKQGSSGNV 
QVRLFDEDGYANVRKAQRDGDKVSSVKSLLDISVPTKGAYKGPWIKAELLAAFLVGGFAYFAFTTKGKVQS* 
(SEQ ID NO: 993) 

Classification : known_f lybase_gene 
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Gene Symbol: BcDNA: GM12291 
FlyBase ID: FBgnOQ27912 



Celera Sequence No. : 142000013383806 

AATCCAGCCGTATATCCCACTTCGATCCCTGATCACGGATGAGTGCCGCGAGCGCAGCTACATGGCCTTTATTATCTCGGGTAGTGTTCTTCTGG 
GACTACTGCTGCTCCTCCTGATCCTTTTGCTGTGGTGGCGAGTAGTCCAAAAGCGACGACGTCGTAAGCTGGACGTAGTGCAGCCGGAACCGCGC 
ACTTACAAGGAAACCCAGATCGTATACCAAATCGAGAACGCTGGCCTGCTAAAGACTGATTTGTAGACGGCAAGGAAAGACACATAGATCACAGA 
TGCCTGTTTTGGACTGTTTTTCAGTACCATAGCAGATCATCTACCCACAAGAGACACACAGCTGAATGAAGCTAAAAGGAGCGGTGGGCGGTGGA 
ATGGTTGGAGCACCATTGCCACCATTACCACCACCGATCATACAGAACCACCAATGCGTGACCTTTCGCTGTACCAAAACCATAGCCTACAGTTA 
GTCGTAATATATCTTAATCTAATTATTCATAACTCGCAATGTTCGTCAGTTGTAAGTGGCCCATGTATATAGTCGCATGTAGGCTGGGATCTTGC 
CCAAAAATATCCAGGTGGATTAGCCATATCCCACCAAATACGTATGTATATGGAATAAATCAATTTAAAGCCAAAATGTGGTGCTCTTAAATGTA 
TATGCTACATATGTACATATATGTTAATACTATTTGGAAGCGTAATTTAAAATAACAAAATTTGGAGGACAAAGTCAGCTAATAAAATATTGTAT 

AC AAAT T GC GG AT G AC AG AC GAG AT GT AC GT GT AAAC AAAAGT T T TAT T GAT C C CT AT AGCT T T C G AAAAT AT T T T T AG AAAC T T G AC AC AT GT T 
TACAAGATAATCATAAACTTTCAAAGGGAAATAGTGTTGGAAGGAAGCCCATTTTTGTTTACGATATTTAAATATTTATAAATAATATTATATTT 
CCCATTTCCCAAATACATTACGATTTAAATAATGATAGTACTTAAAGAACTCACCCGATGGGTAAATATGATTCCAAAGCCTGGTGAACGGTCAA 
CTGCGGTTCAGATAGGGCAAATCCTTTAGAGAGCAGGGTATGATGAGTGTTGATGGTCTCCTCAATGCCGCTCATGTCCTCCACGTCTTTGTCAT 
CTGATGT GAT ACATATTTGGTTTTAAT AT AATCCAAAGT AAAAGT AAT ACT CGCAT AAAT TACCTGGCAGATTGAGCTCCTTCTTCATCAGATTG 
GCACTGCACATGCCACCCAAATAGGCCAACTCCTGTTCAAGTCGTATCAGACACGAGTTGCTGGCATTAATCTGAGCGAGATGGAAGAACACGGT 
TTAGAAAGAGATAGTATTACACACGGAATCCACTCACCGATGTGGGATTGTCATAGCCAACCACTAGACACTTGAAGCCATAGCGATTGTTGTGC 
GGATCCTGGGCGTAAAGTGAGGCTGTTTCCAAGGAGAACTCCACACAGTTGCCGGGCAGAATGAGGACATTGTGGAGCCAGGTGGAAGCACTATA 
ATAATAAGAGCCATCTAGGATCATGACTCCTCATTGGAATATGTTGCTACTTACGTATTAAACTTCTTGACCACAATCCACTCCCTGTCATTGGG 
ATCCTGGCTGCGCAGAGGCATTGAACTGGGTGCGGCAGCAGATCCCGGAGCATTTCCACAACCTGGACTCCGGCAGCAACTGGCCATGGTCAACT 
GGACACTGGCGGAGCGTTGATGGGTGTTCGGTGCTGCGCCACTACCGGCGATGCCACCGCCCGTGTTGCGCCTCCTGTCCCCCGCTCCATTCACA 
TTGTTATCCGCCTGCTCCAAATAGTCATCCACATGCGGCACTGGCGGCGCCTGCGATGCTGGACGCATGGGAATCGACAGGAGCAGGTAATCTTC 
CAGTTGAGCAGT GCCACACTGCGGGTCAAACTCGATGGTGAGCCACTGTACGCAAGGTGGAAACTCCACGCGAT AGCT ACT AATGCTAGCACTCT 
TATACGGATGGTCACTCTCCACTACACAATAATGGTTACTGGTGGTGCTGCTCGAATTGGAATTGCCAGAACCAGAGGTCTGTGTAGCAAAAATA 

GCCGGCTCCGGCTGT C GCT GAT C CTT GG AC AC GT G C AACT GAT T T AGG GC C GCG AT AT GC GGC AG AAT C TC C T T G AT T AAGC CC AG AAT C T GG AC 

GGAACTCTTTGGATCACTGCAAGCCAAATTGGCAATATGTGCATAGACCAGTGGCAGTAACGTGTGGAACATATTGGAGGCGGCCAGATTAAAGA 

CCTCGAGCAGCTGCACTGCAGCTGCTTGACTGCCCACTCCTGCCACTCCACCAGATCCATCTGCCACACTTACTCCGCCACCGGCAGCCATGCCG 

GCAACCTCCAGCGGAGCCTCCTCCGAATCGGAGTGCAGATGCTGGGGCATTGACTGTTGAGCCGATTGCTCTGCGAACTCAAAGTCAGCGGTTCG 

TTCATTGCCATTTATCGGAACACTTTGACTTTGACTCTGACTTTGGCTTACACTTTGGCTTTGGCCATTTCTCAAGTCTGCCGATTCCTCGACTT 

CGGTGGCTCCGATTTCAATTTCCATCTCGAAGGGATTCGTGGATCGCTTGTCGAACTTTAGGGTCTCCATGAGACCCTTGCTAAACGACTCAATA 

CGTTTGCCCAAATCCCTGGCTGTGGACAGCTGCGGATCTATGGCTGTTGGCAGGGCAGCTGTTCTGGAATTGTTTGCCCAATTGATGTCCATGTG 

CTCCTCGATGGGTGTGATATTGTGCTCGGAATCGATGGTCTGTGTGTGATTGGAACTATTATCTGACGGCGAACAAGATGCAGCCATCGCATTGC 

GTGCCAAAATCAGCAGTTCAGTACACTTTTTGGTTATATCAGCGGCGATCTGCAGGGCTGTATCGCGGGTGGAGACCTCGTGGGGACCAAGAAGC 

ATAATGCGATCCTCCAAATGAGTTGCTACATTGGATCCATCGCCCGTTCGTCCACTCGCCGAGTGTCCATCCTGTTTCATAGGCGTGCTGTAGTA 

CAGGATGCAAGGCAATTGACCTCTAGCTGGAGTGGTTCCATTGAAGCTAAGATCGCATGCATAGAAGTGGAACTGGGCGCCACAAGGCCCTCGAA 

CCGCCGGCATTCCCGCGTCGCCAGAACAAGTCCTTGCTCCTTGGGAGCATAGACGAAGGGCGTAGCGGGCATTCTCCTTGATATGCACCGGATGG 

TCAAATTTGATCTCCGCCAGATCGTTGTGAACTGCATCCTGATCGTAACTGCCGGATACCGATTCCAGGGTCTCCCACTTGTGCTGCGGCTGCAG 

ATCCGCCGTATTATCGTACAGCAACTCCAATTGGTAGTCATAGCTCCCGGATCCCGAGTAAACCATTGCGCCAGCTATGGCCACACCAGGTCGAT 

CCACGGCGAAGGCAATAGCATCGGGTCCAAAGTTTCCGGTATTCCAGGTCCTACTCACATCGCAGCGGGCAAAGCGAGCTCCCGTCGAATGAAGG 

GTTCTGGGTGGCATGAACAGCTTGTCATATTCACCCATCGCCGTTTGATAGACAATCTCGGCCAGAACTCTGGCCAGCAGATGACAACAGTTGTC 

AATCAAGCCTTGATTAACACAATCTTTACCTCCGGGATAGCCAAGACTTCCTAAGCTCCGGCTGTATATCGCCTCGATGCGAGAACGTATGGGAC 

TGGCAATAATGTCTAGCAAACGCACGAGTACATCCTTAAATGTCCAGGAATTGCTCAGAAAATCACTCTTCTCCTGCTGTGTGCGATATATCATT 

TGCTCCACCAGCACGGGATACGGATGGGCATCCGTGGAGGAGAGCATGGGCAAGCCGGAATTGTCGCTGGGACTGATAATTGACTGGCGTTCGTT 

TCCAGCCGCACTAAGGAGTGAGAAAGTGGCTCTGAGTTTAACCGATGGCGAGCATAATCCAGCAAGGATCGCGCTTAGCAGGCGGGAATGCAAGG 

CTCCACGATCCATCTGAGCAAGCAGATCGCACAAGCAGTTCCACTTCAACGAGGACGTCGGATAGAATGCATCGAAACAGGCCACAAAGCTCAGA 

TGACACTCGTCGAGAATCTCCAGCGATAGTTCATAGCCTACAAAGTAAGAGATATCATAGAATAGAATAGGTAAACTATAATTTCCAGACTTAAA 

CTAACCCTCATCCGTGGCTATATCCTTGAATATGTCATCGCAAAAGATGCCAATGAGCAGAGCTCTAACATTGCCAATGCACTCCGCCAGCTGGA 

TGTTTTCCATGTTGGTCTTCCGGGTGACGGGAGTTCCGCTCCCAGGTGCCACAGCTGCCGAGGCCGATGTAGATGGAGCTCCTCCGCTGCTGGCA 

CTGCTAATCATGGCTGGGGCCACCGATGTGGAGGGATCACCAAAATACTTAGCCATTACTCCTGCGTTGCCCACCACACGAGGGGTATTCTTGTC 

TATAAATGAATATTATTGAATATTACATTAAAATGAAAGCAAAGGTGTTCACCTTTGTTTGGCTTAGACTGAACAGACTTGGCAGTGGTTACTAC 

GCCACTACCATGGGCTGCATTGGAGCCACCGCCCGTGGCCACCGATGTGGAAGAGTTCCTCTGCGGGTAAATCTCGATTGTGTACTTGCGCAACA 

GGCGGAGACATGACTTAATGACATAGCCGAGGTACTGCAGCGACTGTTGCAGCTGCAATGGACGGGAGCGATCGCGCTCCTCGCGCAGCTGCAAC 

TTAAAGCAATCCCACGACCAGCTTAGCAGGACCACCAGACTCTCGAAGCACTCACGACTCACACTGTTGGCAAAAGCCCGGGAAATACGCTGCAC 

AGGATCGGCGTCCATTTGGGCCGGCGTCTGTTTGCAATCCTGGGTGACCAATCGGTACAAAATCGCCGGTATCTGGCCAGAGTTGACATCGGTGC 

CGTTGTTTGCCTTTTTGCTGGATTTGAAGGAGAAAACCACCTGATCCTCGGTGGTCACGGACGCCTGACCGCAGGATCCGCAATCGGAGGAGGGA 

CCTGCAATCCGGGCCCACACCAAATACCATCTGCCAGCTACCGCACTGACCGGTTTGGGCAGAAGGATATGATGTTTACTCCTGGCGCCGCACTC 

GTAGGGCACCTCTTTGGTCTCGCTGATCAGGATGCCTTCCTTCTCGTAGCCACCACCGTCGCCGCCCAAATCGAAGAGCTTCAGTTTGCAACTGT 

ACTCCCCGCGGCCACCGAACATCCCGAAGCCACAGATCACAATATCGGTGTCCGCCGAGAAGCGTATGGCCTCCACACTGTGGCCGGAGTAGCCC 

CATCCGCCGCCAAAGTTATCGAACCGATTGACTACCTGGAATTCACCGGCCTGCGTTTCTGCCGTCTGTTGGGTTTGCTTCAATAGAGGCTGCTC 

AAAGCACCCGGGTATGTTGTCCTGCGCCGAAGTGAGGATGTCGAGACAGGCGAGTAAGTTCAGGGAGGCCTGCGAGCGGGCCACACGCGAATCCA 

CCCGATCCGGCAAAGATAACTCCGGCGAGAGCAACGATCGATAAGTGGCCGCGTTCTGGAAGATAGTTTAAGATAAGAGGAACAAGGAAAACTGA 

TTTCATCTATTAGCTTACCGCATCGCTGTCGTTGATGTCACTGGCCACCACATTGTGGCAGCGTAGTTTCCGTGCAGCTCCATCGAAAACCCAGA 

GAACATTGTATGTGGGATCCATGGTGAAGGCTAGTTGTGGTGGCGATGGAATGGGTTCCGGTGAACTCGTACCCTGTTTGCCACCACGACGACTC 

CTTGGCACAGACCCCGCCGATACCGGTGTCCCAGGTGCTGCTACTGCTGCTGCTGCAGCAGATGATCCCACCTCCGGTAGCTGTTCCTCAAAGAT 

CTGATTCCTGGCCTCGTGCATGCTTCGGGACATCTGCTCGTTGATGGGCACTCCAGCCGTAGCTGTGCCCGTCAGAGCCGCAAGAAGGGACGAAG 

TGGAGGCATGTGCCGGCGAATCCGCAACCGGAGTAACGGCCACATCCAAATTGGAGTTGATCTCCGGTTGCTGTTGCTCCGGCTGTGCTTGCATC 

AGTTTCACAAAGTTGGTTTGTCCACGATAATGGGCAGTACAACTACCATCCCGTCGATTTATAGAGAGCGTATGGAAGGACAGCGGAATGCTGGG 

AATGAGCACTATATCGGGGATGAGTCAAATATTAGTATTTGATTTTTTTTTAAATCTTGTGAGAAACTTACATATCGTCTTTTTATTGGCCACCA 
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CATGCATCTTGGGCAGCATTTGTGCCGTGATCAGGGATTCGTCTATCTTGATAAAGGTCTGATCTCCATTGGCTCCAATCCAGGTTACCTTCTTG 
CCATAACAGGGACCCAAACCACTGCAAAGGATAATAGATAATCATTAGCAATCCTTATTCTACTAATGTGGTTTTTTCCATACAAGACGGCTCCA 
GGTGAACAATTCCACAGAAACTTCTGACGCTGGGTGAGCACCGGCAACAGTGGAGGCATCTCCTTGGAGCCACTGGGCTGCACATTGGCCGGCGA 
TTGGGATCGTTCCGGGCCCGGCATTCCCACCGGCGCAACCGTTGGCGATCCTCCTGCTCCTCCATCAGATCCAGCTCCCACCGGAGAGTCATCAT 
CCTGGGGCGGAGGCTTCAGTCCATAATCACTGGGCAGTCGGCCCAGTTGACCCTTCTGGTAATTGCCAAACGTGTAGACCATTCCTTTCGATGTC 
AGCAGTACGGTGTGATTGCTCCCAGCGGCCACTTGGCTGATGGCACCCGGTACCTGAACACGAACTGGTCCGCTCACTGGCTGCAGATCTCCACT 
GCCCAATTGACCATACTGATTGCTTCCGAAAGTGTAGACTTCACCGGCCAAGGTGAGCACCACCGTATGGTGCAATCCACAAGCCACCTGCACTA 
CAGGCGATGAGCTGGTGAGCTGTAGCAGCTGCGGCGCCAGTGGAGCCACTCGAGGCGGATCCTTGTCCAGATCAGTGGGCGTTGCAGCTCCTCCT 
CCGGCGGCTCCTGTTTCCAGTTCGCCCTTGCGCTCCTTGCGGCGCATGATAAGGGTTTTGGAACGCTGCTGGCGCTGTTGTACATCACTGGGTGG 
AGG T T T T AGT T CCGTTTTGGCCTCT GC AAGT GG AG AT C G GC AT T AAGC AT GC AACT AGG AAT T T T T C GAT C AG TAG AT AT AC T C AC C AT C GC T AT 
CCTGGAGGGCAATGCAGGCGGCGCACAGGCCACACTTGGCACAGCCCGCATCTCCGTGACCACAGGGACATATACTGCCCGCCAAACGCTGATCC 
AGTGGTACATTTAGAGCGGAGACACAGCTAACATTGTATCCCGTGCATTCGCGACAAATGACGCACACGCGGCACTGACCCTGCACCAGATTATG 
GTCATCGTAGTCGCACAGCGTGGAGAAGTCCAATTTGAGGCGCTTGTCCAGTGGCAGCAGGGATGCAAAACAGGCGGCGGCCGTTGAGGATGAGG 
CAAATGGCGGCACATCCTTCACCGGCTGCAACTTGTTGAATATCCTGCCGCACTGACCCTTATTGTTCAGTCCGAACGAGAAGAGCTGCCCCTTG 
GCATTGAGAGCCACACAATGGGCCTTGCCCAGCGCCACCTTGGTCACATGCTGTTCCAGCAGCTCACTGACGAAGCCCATGGCATCACAGTGGGC 
CGTATCCTTGCCATACATGATCAGCTTTCCAGTTTTGGTTACAAATGCAGAGGTGCCATTGTTGCAGGCGGCATGCACCACCACATGGCCATCGA 
TTTTGGTCATCTTCTTCGGTTTAACCGCCTTCGGTTGCCTCCGGTTCTTGCTGCTATCACCATCTTCACCGCGACGAGCTGTGCCCGCAAAGAAA 

AC GGT AC CAT CAT CAT T G AC C AAC AGT GC AT G AAT GC C AT CGT GT CCC AC GGC C AC AT GG AC T AT AT T GGCCGCCT T CG AG AT G ACC AAT T C GGT 
GAGCTTGAGTGTCGGGGTCCGACCCACACACTTCAATCCGAGCGCAGCGGATTTCCCATAATAGTAGACGCGACCATTACTGGCACGCACCAAGC 
CAAACTCCTTGCCGCAGGCCATGCCCATAACATCAACATCCGCATCTCTGCAGGGAGCGGGTAGCTTTGGTTCGAAGCTATTGTGGGCCGACTGG 
ATGCGTTGAATCTGATACTGATTGAGCAGCTCATCCTCGAAGGCCGCATACCCCAAGGTCCGGAACTTCTTGCGGGCCAACTGGAGCGGCAGTTC 
AAAGGGCGGCGGATCAATGTTGTAGCTATTGTGGTACAGATTTAGTTTCTTAACCACCAGTGTATCCTATTAAAAAAAAAACAATACTCATTACA 
AAATAATCTTATATAGTTATACATCAATTTCTAAATGTTAAGGGTATCCTGTCATATTTTATATGAAATATAAAATATACAAAAATCGTTATTTT 
CTATGATTATACTGAGATTTTAAATAAAATGCAGTTACTTACATCGCGATTGCTGCAGATGGCGTGCAGTGAGTCGTCGTCCGTAAAGAGCACGT 
AGTTGAGTCCTTCGCGCATATGCAGCATACTCAGGGGACTCATTGCCTTAATGGCTAGCGTGTCCAGGCCCACCATTTGCAGTTGATCACCAGCA 
CTGCGGCGACAGGTGCGCCGAAAGTAGAGCTGCCCGCCACTGTAGCCCAGCCAGGCGCTCCTCTCGTGGCTGAAGTCCTCATTCTGGGCATAGAT 
GTGACCCTTGTAGGATCCCCCAAAGCCAGTGCCAATTTTTAGCAGCGTTCCTCCATAGAGTAAATAGACAAACACTCCATCGCAGGCCATTGCTG 
GCGAGGTGATGGGAGCACCTGTGCTCAAGGGCGGCAACTTCACGGGAAACTATTAAAAAAAAGGGGGAACATTTTTGTTTAACCACGCATACAAT 
AACAATAATAAAAAAAAACACCGATCACTACACAGCTCACGCACGCAATCTACTACAAAAATAGTAAATATATTATACAAATATTGTATAAAACA 

TTAGCTCTT T AT AAAAT AAAC AAAAAAAAAAC AC T AAC CAAGT C CAT AAC T C AC C T CAT C GAT C AGGG AAT T GT G CGG C ACCC C GT GGC T G T G AA 
AATCCGGTCTGGTCGGCTTATTCAACGCCGCACTGATCACCGTGTGCTGCAAGGTGGCCAGCACCACCGGCAGCTGGACAATCTGGGCGGAGAGC 
TGACGCGGCGTCATCAGCATGGCGGCAATAGCCTTGAGCATCTTGCCCGTATCACCGCGTGCGATGCACAGTCCCAGGAGGGCGGCACAGGCCAT 
GGCCGACCAATTGGCCTCCGCACCGGTGGCCGAATTGAGTGCGGCCGGCATGGTGGCCAAATCGAGAAGAAGATCGTACAGCGGATTGATCAGCT 
CCTCCGGTTCTAGCTTAAAACTTTCCGGCTGCTGGCCCTGGATGACATCCAGCAGGGCCTTCAGCGCCTTGGTGCACAGCAGCGGATGTACATGC 
CGGGCATCAGCGATCAGCTCGAAGAGACTGCGCAAGCCGGCGCCCACAATTAGCGGCACAGTGGCGTAGACGGGCTCCCTTTCGGTGGGCTCCTC 
TGTGTCCGAATCCGCATCGCTGTTGGCGCTATCCCGACGTGGTCGCTTCTCTGCTGGCAAACCGGGTCCGCCGACGACCACCGGCACTGGTGCCG 
GTATCCGGCCATCGACAAGGACTCCAACTGGTCCGCCAGGACCTGCATTCCCAGGTCCAACGGCGCCGTGCATAGCCATATGCGCGTGGCAAGAG 
CAATCGAGTAGATTGCTTATCATACAGCTGTGGGCACACATCATGCAGCAGGCTGAGTTCACGTTATTGCTCCACTTCTTCTGGTTTGATCCGCT 
TTGGCCACTGGGTGCTCCGCCCTCCTGGAACAGGGAGTAGCGCGTCTCTGCCTCCAGAACCGTGGATCTAATTGCCGAGAAGACCGCAAACTTGG 
ACGCATTCGGATCCAGTTCAATGTGCGCCACCGGAAACGAGGGCAAGATCATCAGTTGCTCCTTGACCAGCGGTGGATCTAGCGGATTCACCAGT 
CCCCCGGCACCCGCATTCCCTTCCAGAACAGCGGCAGCAGCTGCCGCTGCCATTGCTGCGGCCGATTTGTCCTTTCGACTTTTGCGTTTCGTTAA 
CACCTTGCGATCGTGTTGCATTTTTCTGTAATAGAAAAGAATAAATAGGTTTAAAAATATTATACAAAATGTATGATGAACTTGAATAAAAAATC 
AATAAGTGCCCACCATGAAAATAACATTAAGCTAGAAAAAATACAAGAGCTTGAGGAAAAGAAAATTCTAAAATCCATTAAACAAGAAATCATAA 
AATAATAGTTTGCGAAAAAAAAAAATTTTAGACTAAAATCCTTTTATAATGTAAGTAGCAATTTATATCCTTGCACAAATGATAATCTTTTCATT 
TTTTTAAATTGTGTATCTATTCGACAGGTGGAACATATGTATATTTTGTATTTTGAACTATGATAAACTTGAAAACTTTGATTTCGTGTGATATT 

TGTTATTGGTT T AT AC AC TAT G T AAT AGT T GC T AAT T AAAAC GC A C G AAT T T T C T T T AAAC GG AC T C AC T T AAAAG AAAAAC T T T T G AAAAT AT T 
TAGGGACACTAATCAACATCTAAGTCATAAATCAAAAACGCATTCCAACTAACTGAAAAAGGGGAACAAGCTTAATTTGCGAGGTCCTGGCGAAG 
GACTGGCATTTGTTTTTGTTTGCCTACAGGAAGCTTTTAGCTCTTATTGGCTCTTATCAGCCTGCAAGCTTTTATCCTGGATGGCGGTCGTTATT 
TTTGGACGACAGATGCACATGCATATGTACACATATCGGACACCAAGGTGCAAGCTTTGGGATGCACATATATATGCATATACCCACATGACCAT 
CTTTGCTACTGAGTGCATCCGGAACTTTCGATCCTGCAGCTGACAGGTGGTTTTCAATACTGTTTAACACATCGCAAAGTGAGCTCAAAGTCCGC 
GAAAGAAGCAATTACTGCCTCCAAAGTGTGTAAATTTCGTCACAATTTATTTGCATCAATTGAATGTCATGCTTTAAAAGAAATTTTTAAAAGGG 
GTTTATAATATAAATGGTTGAAAAGTGGCGGTATAAAATTTGTTATGATTTAAAAAAAGTATATATTCAATGTGAAATAATGTGTTTGCGAAAAT 
CAAGTTCTTATTGACAAACTGAAGTTTAGTGTACTAAATCCATGAAATCCGAAAACCCAAAACCTTGGCGCAAACGTTTAATTATTCTAAAGCAA 
TTTGCAGGCAATCAATAAAAGCAAGACAATAAGAAACTTTTTACACGCCTTCTGACAAATCGATTTTCTTGAGCTGATTCGACATTTTGCCAACT 
AACTGTTGTCTAGGTATTCAATTAAATAATCTTAAGCCCAAGGTCCCGAAACGGTAACCAGCATATATACATACCCTGTTAGAGTATTTAAGACT 
CAACTCATTAAAGCTACTTAAGCAATGTTTCCTTTTCGTAAATTTAAAACTCAATTGATCTTTGAACACTTCTAGTTTTGTTTTAAACTACAAAT 
CTGTCGTAGCATATATAATCCTGCTTTAAACCTATTTAAAGACATTAAAGTTAACATTTTTCTTCGGCAGATATTGGGATTTCGATTATATAAAT 
AATATATATTGAAATATATATGATCCCCAATCGTGTGATGTGAATAATATATATTGAAATATATATAATCCCCAATCGTTTGATGTGGTCTCCCA 
AAGATAGTGTTTTTTGGCGGTCTTAACGGGGAGCGACATTCTAAACGCCGCACCATCGCCTCGTGGGCCATCAGTGGATTAATGCAGGTCGCTAG 
GCTCATGGCAGTCAGCTGATTGAGAGCAGCGCTCAGCTGTTCGGCGCAGGACTGTCCACTTCCGCCCGTTGCAACCCCCAAAAATCGGGGGGTGC 
TGCGACAGCGCGGAGAGCGAATGTGAATTGTGTAGGGGTTGAAGACCCAGATTGGAAGCCGAAAGCCGAGTGCAGCAGATCGATTTTTCAACCCC 
AAGTCGGATGGGCTGTACGGGACGCAGGGGATGTAGGGGTTAGGGGGTGGCAACTGAATGGGACTCCGGGACATCTACTTATCCAGATGCTGGCG 
GGGATTACCTGCGCGTGCTGTTCGTGTACAGCTCGTAGAAGTGGTCGCCATACTTGAGCGGATTGAGCAGGCCAACCATTTTGGCCCGATTTTCC 
GTTAGATCCAGTGCTCCAATCGATATTTTCTATATATTCTAATGCTCCGTTACCGTAGCCGTTACTGTTGCCGTCTACCGTTACAACGTTATGCT 
CGCTCTCTCCCGTGAATCTCCTCTCTTCTTTGGCTAATGATTTATATTGCACAGCGGTTATTAAACAAAATATGAATTAACAACTAGAATTTAAC 
TAATTTGCCGAGCATCGTCTGGCAATTTGAGTGCCCTTTGTTTTCGCCATTTGCCTTTTGTCACTTTCACCCAAAAAAAAGGAAACACACACCGA 
GCGACAACGCAACGGTACAATGGGAACGAGGGAACTAAACTTTCGTCCCGCTTGCTCTTGTGCGCTTCCTGCGAGAGAGGACGCTCAGCTGTGTG 
GGAGACAGTGACAGTGTTGGTAACAGAGCGGCATGTCCTATTCTGTGCTTAAAATCTTAAATTTTTCGGAGCTTTCCGATTTAATTTCTGCCTAA 
ATTAAAGGTACTTTAAATCGCACGAGTATTTTCTTCAATTTTTTATTTTATATATCTTAAATCAACAATTTTGGTTTCCACCTTTCATCCATAGC 
TAAGGCTTTTCAGTTTTATTTTATGGCACAACGCAAAAGTATCGATCACGAAATATCACATTTATTGTTCAACTTTTAATATCTGTGTGCCTTTT 
CCCTAAAAGTTCCGTTATAAAAGTTAACATTTAAAAATATTTCACATACATTGCAATACACCAGCTGTTCGAGCAGTGCTGAAAAACGCCGACGC 
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ATCGCGAACTGGCAACGCTGTCAGCAGCTGTCACTTGCAACAAAAACAACAGGGGCAGGAAGAAGAAGAACTTTGACAGCTGCGGCGCAACCGAA 
ATCTAAGAAAAACAATTTACGTAATTTAATTGAGCGAATTTCAATGGTTTGCGAGTGTACTAAAACGTTTTTTCCCCACCCGCAGGATGGGCAAC 

GCGCTGACGCACTACATGAAACCGGACGTGATGC 
(SEQ ID NO: 994) 

Exon: 12239.. 12169 

Exon: 12077.. 11808 

Exon: 10190.. 9080 

Exon: 8884.. 8498 

Exon: 8331.. 7212 

Exon: 7148.. 6449 

Exon: 6386.. 6247 

Exon: 6183.. 5624 

Exon: 5565.-4569 

Exon: 4464.. 4186 

Exon: 4122.. 1575 

Exon: 1515.. 1368 

Exon: 1306.. 1204 

Exon: 1140. .1001 

Start ATG: 12239 (Reverse strand: CAT) 



Transcript No. : CT25974 

ATGGTTGGCCTGCTCAATCCGCTCAAGTATGGCGACCACTTCTACGAGCTGTACACGAACAGCACGCGCAGCCCATCCGACTTGGGGTTGAAAAA 
TCGATCTGCTGCACTCGGCTTTCGGCTTCCAATCTGGGTCTTCAACCCCTACACAATTCACATTCGCTCTCCGCGCTGTCGCAGCACCCCCCGAT 
TTTTGGGGGTTGCAACGGGCGGAAGTGGACAGTCCTGCGCCGAACAGCTGAGCGCTGCTCTCAATCAGCTGACTGCCATGAGCCTAGCGACCTGC 
ATTAATCCACTGATGGCCCACGAGGCGATGGTGCGGCGTTTAGAATGTCGCTCCCCAAAAATGCAACACGATCGCAAGGTGTTAACGAAACGCAA 
AAGTCGAAAGGACAAATCGGCCGCAGCAATGGCAGCGGCAGCTGCTGCCGCTGTTCTGGAAGGGAATGCGGGTGCCGGGGGACTGGTGAATCCGC 
TAGATCCACCGCTGGTCAAGGAGCAACTGATGATCTTGCCCTCGTTTCCGGTGGCGCACATTGAACTGGATCCGAATGCGTCCAAGTTTGCGGTC 
TTCTCGGCAATTAGATCCACGGTTCTGGAGGCAGAGACGCGCTACTCCCTGTTCCAGGAGGGCGGAGCACCCAGTGGCCAAAGCGGATCAAACCA 

G AAG AAG T GG AGC AAT AAC GT G AACT C AG C CT GCT GC ATG ATGT GT GC C C AC AG CT G T AT GAT AAG C AAT C T AC T CG AT T GC T CT T GC C AC GC G C 

ATATGGCTATGCACGGCGCCGTTGGACCTGGGAATGCAGGTCCTGGCGGACCAGTTGGAGTCCTTGTCGATGGCCGGATACCGGCACCAGTGCCG 

GTGGTCGTCGGCGGACCCGGTTTGCCAGCAGAGAAGCGACCACGTCGGGATAGCGCCAACAGCGATGCGGATTCGGACACAGAGGAGCCCACCGA 

AAGGGAGCCCGTCTACGCCACTGTGCCGCTAATTGTGGGCGCCGGCTTGCGCAGTCTCTTCGAGCTGATCGCTGATGCCCGGCATGTACATCCGC 

TGCTGTGCACCAAGGCGCTGAAGGCCCTGCTGGATGTCATCCAGGGCCAGCAGCCGGAAAGTTTTAAGCTAGAACCGGAGGAGCTGATCAATCCG 

CTGTACGATCTTCTTCTCGATTTGGCCACCATGCCGGCCGCACTCAATTCGGCCACCGGTGCGGAGGCCAATTGGTCGGCCATGGCCTGTGCCGC 

CCTCCTGGGACTGTGCATCGCACGCGGTGATACGGGCAAGATGCTCAAGGCTATTGCCGCCATGCTGATGACGCCGCGTCAGCTCTCCGCCCAGA 

TTGTCCAGCTGCCGGTGGTGCTGGCCACCTTGCAGCACACGGTGATCAGTGCGGCGTTGAATAAGCCGACCAGACCGGATTTTCACAGCCACGGG 

GTGCCGCACAATTCCCTGATCGATGAGTTTCCCGTGAAGTTGCCGCCCTTGAGCACAGGTGCTCCCATCACCTCGCCAGCAATGGCCTGCGATGG 

AGTGTTTGTCTATTTACTCTATGGAGGAACGCTGCTAAAAATTGGCACTGGCTTTGGGGGATCCTACAAGGGTCACATCTATGCCCAGAATGAGG 

ACTTCAGCCACGAGAGGAGCGCCTGGCTGGGCTACAGTGGCGGGCAGCTCTACTTTCGGCGCACCTGTCGCCGCAGTGCTGGTGATCAACTGCAA 

ATGGTGGGCCTGGACACGCTAGCCATTAAGGCAATGAGTCCCCTGAGTATGCTGCATATGCGCGAAGGACTCAACTACGTGCTCTTTACGGACGA 

CGACTCACTGCACGCCATCTGCAGCAATCGCGATGATACACTGGTGGTTAAGAAACTAAATCTGTACCACAATAGCTACAACATTGATCCGCCGC 

CCTTTGAACTGCCGCTCCAGTTGGCCCGCAAGAAGTTCCGGACCTTGGGGTATGCGGCCTTCGAGGATGAGCTGCTCAATCAGTATCAGATTCAA 

CGCATCCAGTCGGCCCACAATAGCTTCGAACCAAAGCTACCCGCTCCCTGCAGAGATGCGGATGTTGATGTTATGGGCATGGCCTGCGGCAAGGA 

GTTTGGCTTGGTGCGTGCCAGTAATGGTCGCGTCTACTATTATGGGAAATCCGCTGCGCTCGGATTGAAGTGTGTGGGTCGGACCCCGACACTCA 

AGCTCACCGAATTGGTCATCTCGAAGGCGGCCAATATAGTCCATGTGGCCGTGGGACACGATGGCATTCATGCACTGTTGGTCAATGATGATGGT 

ACCGTTTTCTTTGCGGGCACAGCTCGTCGCGGTGAAGATGGTGATAGCAGCAAGAACCGGAGGCAACCGAAGGCGGTTAAACCGAAGAAGATGAC 

CAAAATCGATGGCCATGTGGTGGTGCATGCCGCCTGCAACAATGGCACCTCTGCATTTGTAACCAAAACTGGAAAGCTGATCATGTATGGCAAGG 

ATACGGCCCACTGTGATGCCATGGGCTTCGTCAGTGAGCTGCTGGAACAGCATGTGACCAAGGTGGCGCTGGGCAAGGCCCATTGTGTGGCTCTC 

AATGCCAAGGGGCAGCTCTTCTCGTTCGGACTGAACAATAAGGGTCAGTGCGGCAGGATATTCAACAAGTTGCAGCCGGTGAAGGATGTGCCGCC 

ATTTGCCTCATCCTCAACGGCCGCCGCCTGTTTTGCATCCCTGCTGCCACTGGACAAGCGCCTCAAATTGGACTTCTCCACGCTGTGCGACTACG 

ATGACCATAATCTGGTGCAGGGTCAGTGCCGCGTGTGCGTCATTTGTCGCGAATGCACGGGATACAATGTTAGCTGTGTCTCCGCTCTAAATGTA 

CCACTGGATCAGCGTTTGGCGGGCAGTATATGTCCCTGTGGTCACGGAGATGCGGGCTGTGCCAAGTGTGGCCTGTGCGCCGCCTGCATTGCCCT 

CCAGGATAGCGATGAGGCCAAAACGGAACTAAAACCTCCACCCAGTGATGTACAACAGCGCCAGCAGCGTTCCAAAACCCTTATCATGCGCCGCA 

AGGAGCGCAAGGGCGAACTGGAAACAGGAGCCGCCGGAGGAGGAGCTGCAACGCCCACTGATCTGGACAAGGATCCGCCTCGAGTGGCTCCACTG 

GCGCCGCAGCTGCTACAGCTCACCAGCTCATCGCCTGTAGTGCAGGTGGCTTGTGGATTGCACCATACGGTGGTGCTCACCTTGGCCGGTGAAGT 

CTACACTTTCGGAAGCAATCAGTATGGTCAATTGGGCAGTGGAGATCTGCAGCCAGTGAGCGGACCAGTTCGTGTTCAGGTACCGGGTGCCATCA 

GCCAAGTGGCCGCTGGGAGCAATCACACCGTACTGCTGACATCGAAAGGAATGGTCTACACGTTTGGCAATTACCAGAAGGGTCAACTGGGCCGA 

CTGCCCAGTGATTATGGACTGAAGCCTCCGCCCCAGGATGATGACTCTCCGGTGGGAGCTGGATCTGATGGAGGAGCAGGAGGATCGCCAACGGT 

TGCGCCGGTGGGAATGCCGGGCCCGGAACGATCCCAATCGCCGGCCAATGTGCAGCCCAGTGGCTCCAAGGAGATGCCTCCACTGTTGCCGGTGC 

TCACCCAGCGTCAGAAGTTTCTGTGGAATTGTTCACCTGGAGCCGTCTTTGGTTTGGGTCCCTGTTATGGCAAGAAGGTAACCTGGATTGGAGCC 

AATGGAGATCAGACCTTTATCAAGATAGACGAATCCCTGATCACGGCACAAATGCTGCCCAAGATGCATGTGGTGGCCAATAAAAAGACGATATT 

GCTCATTCCCAGCATTCCGCTGTCCTTCCATACGCTCTCTATAAATCGACGGGATGGTAGTTGTACTGCCCATTATCGTGGACAAACCAACTTTG 

TGAAACTGATGCAAGCACAGCCGGAGCAACAGCAACCGGAGATCAACTCCAATTTGGATGTGGCCGTTACTCCGGTTGCGGATTCGCCGGCACAT 

GCCTCCACTTCGTCCCTTCTTGCGGCTCTGACGGGCACAGCTACGGCTGGAGTGCCCATCAACGAGCAGATGTCCCGAAGCATGCACGAGGCCAG 

GAATCAGATCTTTGAGGAACAGCTACCGGAGGTGGGATCATCTGCTGCAGCAGCAGCAGTAGCAGCACCTGGGACACCGGTATCGGCGGGGTCTG 

TGCCAAGGAGTCGTCGTGGTGGCAAACAGGGTACGAGTTCACCGGAACCCATTCCATCGCCACCACAACTAGCCTTCACCATGGATCCCACATAC 

AATGTTCTCTGGGTTTTCGATGGAGCTGCACGGAAACTACGCTGCCACAATGTGGTGGCCAGTGACATCAACGACAGCGATGCGAACGCGGCCAC 

TTATCGATCGTTGCTCTCGCCGGAGTTATCTTTGCCGGATCGGGTGGATTCGCGTGTGGCCCGCTCGCAGGCCTCCCTGAACTTACTCGCCTGTC 

TCGACATCCTCACTTCGGCGCAGGACAACATACCCGGGTGCTTTGAGCAGCCTCTATTGAAGCAAACCCAACAGACGGCAGAAACGCAGGCCGGT 

GAATTCCAGGTAGTCAATCGGTTCGATAACTTTGGCGGCGGATGGGGCTACTCCGGCCACAGTGTGGAGGCCATACGCTTCTCGGCGGACACCGA 
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TATTGTGATCTGTGGCTTCGGGATGTTCGGTGGCCGCGGGGAGTACAGTTGCAAACTGAAGCTCTTCGATTTGGGCGGCGACGGTGGTGGCTACG 

AGAAGGAAGGCATCCTGATCAGCGAGACCAAAGAGGTGCCCTACGAGTGCGGCGCCAGGAGTAAACATCATATCCTTCTGCCCAAACCGGTCAGT 

GCGGTAGCTGGCAGATGGTATTTGGTGTGGGCCCGGATTGCAGGTCCCTCCTCCGATTGCGGATCCTGCGGTCAGGCGTCCGTGACCACCGAGGA 

TCAGGTGGTTTTCTCCTTCAAATCCAGCAAAAAGGCAAACAACGGCACCGATGTCAACTCTGGCCAGATACCGGCGATTTTGTACCGATTGGTCA 

CCCAGGATTGCAAACAGACGCCGGCCCAAATGGACGCCGATCCTGTGCAGCGTATTTCCCGGGCTTTTGCCAACAGTGTGAGTCGTGAGTGCTTC 

GAGAGTCTGGTGGTCCTGCTAAGCTGGTCGTGGGATTGCTTTAAGTTGCAGCTGCGCGAGGAGCGCGATCGCTCCCGTCCATTGCAGCTGCAACA 

GTCGCTGCAGTACCTCGGCTATGTCATTAAGTCATGTCTCCGCCTGTTGCGCAAGTACACAATCGAGATTTACCCGCAGAGGAACTCTTCCACAT 

CGGTGGCCACGGGCGGTGGCTCCAATGCAGCCCATGACAAGAATACCCCTCGTGTGGTGGGCAACGCAGGAGTAATGGCTAAGTATTTTGGTGAT 

CCCTCCACATCGGTGGCCCCAGCCATGATTAGCAGTGCCAGCAGCGGAGGAGCTCCATCTACATCGGCCTCGGCAGCTGTGGCACCTGGGAGCGG 

AACTCCCGTCACCCGGAAGACCAACATGGAAAACATCCAGCTGGCGGAGTGCATTGGCAATGTTAGAGCTCTGCTCATTGGCATCTTTTGCGATG 

ACATATTCAAGGATATAGCCACGGATGAGGGCTATGAACTATCGCTGGAGATTCTCGACGAGTGTCATCTGAGCTTTGTGGCCTGTTTCGATGCA 

TTCTATCCGACGTCCTCGTTGAAGTGGAACTGCTTGTGCGATCTGCTTGCTCAGATGGATCGTGGAGCCTTGCATTCCCGCCTGCTAAGCGCGAT 

CCTTGCTGGATTATGCTCGCCATCGGTTAAACTCAGAGCCACTTTCTCACTCCTTAGTGCGGCTGGAAACGAACGCCAGTCAATTATCAGTCCCA 

GCGACAATTCCGGCTTGCCCATGCTCTCCTCCACGGATGCCCATCCGTATCCCGTGCTGGTGGAGCAAATGATATATCGCACACAGCAGGAGAAG 

AGTGATTTTCTGAGCAATTCCTGGACATTTAAGGATGTACTCGTGCGTTTGCTAGACATTATTGCCAGTCCCATACGTTCTCGCATCGAGGCGAT 

ATACAGCCGGAGCTTAGGAAGTCTTGGCTATCCCGGAGGTAAAGATTGTGTTAATCAAGGCTTGATTGACAACTGTTGTCATCTGCTGGCCAGAG 

TTCTGGCCGAGATTGTCTATCAAACGGCGATGGGTGAATATGACAAGCTGTTCATGCCACCCAGAACCCTTCATTCGACGGGAGCTCGCTTTGCC 

CGCTGCGATGTGAGTAGGACCTGGAATACCGGAAACTTTGGACCCGATGCTATTGCCTTCGCCGTGGATCGACCTGGTGTGGCCATAGCTGGCGC 

AATGGTTTACTCGGGATCCGGGAGCTATGACTACCAATTGGAGTTGCTGTACGATAATACGGCGGATCTGCAGCCGCAGCACAAGTGGGAGACCC 

TGGAATCGGTATCCGGCAGTTACGATCAGGATGCAGTTCACAACGATCTGGCGGAGATCAAATTTGACCATCCGGTGCATATCAAGGAGAATGCC 

CGCTACGCCCTTCGTCTATGCTCCCAAGGAGCAAGGACTTGTTCTGGCGACGCGGGAATGCCGGCGGTTCGAGGGCCTTGTGGCGCCCAGTTCCA 

CTTCTATGCATGCGATCTTAGCTTCAATGGAACCACTCCAGCTAGAGGTCAATTGCCTTGCATCCTGTACTACAGCACGCCTATGAAACAGGATG 

GACACTCGGCGAGTGGACGAACGGGCGATGGATCCAATGTAGCAACTCATTTGGAGGATCGCATTATGCTTCTTGGTCCCCACGAGGTCTCCACC 

CGCGATACAGCCCTGCAGATCGCCGCTGATATAACCAAAAAGTGTACTGAACTGCTGATTTTGGCACGCAATGCGATGGCTGCATCTTGTTCGCC 

GTCAGATAATAGTTCCAATCACACACAGACCATCGATTCCGAGCACAATATCACACCCATCGAGGAGCACATGGACATCAATTGGGCAAACAATT 

CCAGAACAGCTGCCCTGCCAACAGCCATAGATCCGCAGCTGTCCACAGCCAGGGATTTGGGCAAACGTATTGAGTCGTTTAGCAAGGGTCTCATG 

GAGACCCTAAAGTTCGACAAGCGATCCACGAATCCCTTCGAGATGGAAATTGAAATCGGAGCCACCGAAGTCGAGGAATCGGCAGACTTGAGAAA 

TGGCCAAAGCCAAAGTGTAAGCCAAAGTCAGAGTCAAAGTCAAAGTGTTCCGATAAATGGCAATGAACGAACCGCTGACTTTGAGTTCGCAGAGC 

AATCGGCTCAACAGTCAATGCCCCAGCATCTGCACTCCGATTCGGAGGAGGCTCCGCTGGAGGTTGCCGGCATGGCTGCCGGTGGCGGAGTAAGT 

GTGGCAGATGGATCTGGTGGAGTGGCAGGAGTGGGCAGTCAAGCAGCTGCAGTGCAGCTGCTCGAGGTCTTTAATCTGGCCGCCTCCAATATGTT 

CCACACGTTACTGCCACTGGTCTATGCACATATTGCCAATTTGGCTTGCAGTGATCCAAAGAGTTCCGTCCAGATTCTGGGCTTAATCAAGGAGA 

TTCTGCCGCATATCGCGGCCCTAAATCAGTTGCACGTGTCCAAGGATCAGCGACAGCCGGAGCCGGCTATTTTTGCTACACAGACCTCTGGTTCT 

GGCAATTCCAATTCGAGCAGCACCACCAGTAACCATTATTGTGTAGTGGAGAGTGACCATCCGTATAAGAGTGCTAGCATTAGTAGCTATCGCGT 

GGAGTTTCCACCTTGCGTACAGTGGCTCACCATCGAGTTTGACCCGCAGTGTGGCACTGCTCAACTGGAAGATTACCTGCTCCTGTCGATTCCCA 

TGCGTCCAGCATCGCAGGCGCCGCCAGTGCCGCATGTGGATGACTATTTGGAGCAGGCGGATAACAATGTGAATGGAGCGGGGGACAGGAGGCGC 

AACACGGGCGGTGGCATCGCCGGTAGTGGCGCAGCACCGAACACCCATCAACGCTCCGCCAGTGTCCAGTTGACCATGGCCAGTTGCTGCCGGAG 

TCCAGGTTGTGGAAATGCTCCGGGATCTGCTGCCGCACCCAGTTCAATGCCTCTGCGCAGCCAGGATCCCAATGACAGGGAGTGGATTGTGGTCA 

AGAAGTTTAATACTGCTTCCACCTGGCTCCACAATGTCCTCATTCTGCCCGGCAACTGTGTGGAGTTCTCCTTGGAAACAGCCTCACTTTACGCC 

CAGGATCCGCACAACAATCGCTATGGCTTCAAGTGTCTAGTGGTTGGCTATGACAATCCCACATCGATTAATGCCAGCAACTCGTGTCTGATACG 

ACTTGAACAGGAGTTGGCCTATTTGGGTGGCATGTGCAGTGCCAATCTGATGAAGAAGGAGCTCAATCTGCCAGATGACAAAGACGTGGAGGACA 

TGAGCGGCATTGAGGAGACCATCAACACTCATCATACCCTGCTCTCTAAAGGATTTGCCCTATCTGAACCGCAGTTGACCGTTCACCAGGCTTTG 

GAATCATATTTACCCATCGGGTGA 
(SEQ ID NO: 995) 

Start ATG: 1 (Reverse strand: CAT) 

MVGLLNPLKYGDHFYELYTNSTRSPSDLGLKNRSAALGFRLPIWVFNPYTIHIRSPRCRSTPRFLGVATGGSGQSCAEQLSAALNQLTAMSLATC 
INPLMAHEAMVRRLECRSPKMQHDRKVLTKRKSRKDKSAAAMAAAAAAAVLEGNAGAGGLVNPLDPPLVKEQLM 

FSAIRSTVLEAETRYSLFQEGGAPSGQSGSNQKKWSNNVNSACCMMCAHSCMISNLLDCSCHAHMAMHGAVGPGNAGPGGPVGVLVDGRIPAPVP 
VVVGGPGLPAEKRPRRDSANSDADSDTEEPTEREPVYATVPLIVGAGLRSLFELIADARHVHPLLCTKALKALLDVIQGQQPESFKLEPEELINP 
LYDLLLDLATMPAALNSATGAEANWSAMACAALLGLCIARGDTGKMLKAIAAMLMTPRQLSAQIVQLPVVLATLQHTVISAALNKPTRPDFHSHG 
VPHNSLIDEFPVKLPPLSTGAPITSPAMACDGVFVYLLYGGTLLKIGTGFGGSYKGHI YAQNEDFSHERSAWLGYSGGQLYFRRTCRRSAGDQLQ 
MVGLDTLAIKAMSPLSMLHMREGLNYVLFTDDDSLHAICSNRDDTLVVKKLNLYHNSYHIDPPPFELPLQLARKKFRTLGYAAFEDELLNQYQIQ 
RIQSAHNSFEPKLPAPCRDADVDVMGMACGKEFGLVRASNGRVYYYGKSAALGLKCVGRTPTLKLTELVISKAANIVHVAVGHDGIHALLVNDDG 
TVFFAGTARRGEDGDSSKNRRQPKAVKPKKMTKIDGHVVVHAACNNGTSAFVTKTGKLIMYGKDTAHCDAMGFVSELLEQHVTKVALGKAHCVAL 
NAKGQLFSFGLNNKGQCGRIFNKLQPVKDVPPFASSSTAAACFASLLPLDKRLKLDFSTLCDYDDHNLVQGQCRVCVICRECTGYNVSCVSALNV 
PLDQRLAGSICPCGHGDAGCAKCGLCAACIALQDSDEAKTELKPPPSDVQQRQQRSKTLIMRRKERKGELETGAAGGGAATPTDLDKDPPRVAPL 
APQLLQLTSSSPVVQVACGLHHTVVLTLAGEVYTFGSNQYGQLGSGDLQPVSGPVRVQVPGAISQVAAGSNHTVLLTSKGMVYTFGNYQKGQLGR 
LPSDYGLKPPPQDDDSPVGAGSDGGAGGSPTVAPVGMPGPERSQSPANVQPSGSKEMPPLLPVLTQRQKFLWNCSPGAVFGLGPCYGKKVTWIGA 
NGDQTFIKIDESLITAQMLPKMHVVANKKTILLIPSIPLSFHTLSINRRDGSCTAHYRGQTNFVKLMQAQPEQQQPEINSNLDVAVTPVADSPAH 
ASTSSLLAALTGTATAGVPINEQMSRSMHEARNQIFEEQLPEVGSSAAAAAVAAPGTPVSAGSVPRSRRGGKQGTSSPEPIPSPPQLAFTMDPTY 
NVLWVFDGAARKLRCHHVVASDINDSDANAATYRSLLSPELSLPDRVDSRVARSQASLNLLACLDILTSAQDKIPGCFEQPLLKQTQQTAETQAG 
EFQVVKRFDNFGGGWGYSGHSVEAIRFSADTDIVICGFGMFGGRGEYSCKLKLFDLGGDGGGYEKEGILISETKEVPYECGARSKHHILLPKPVS 
AVAGRWYLVWARIAGPSSDCGSCGQASVTTEDQVVFSFKSSKKANWGTDVNSGQIPAILYRLVTQDCKQTPAQMDADPVQRISRAFANSVSRECF 
ESLVVLLSWSWDCFKLQLREERDRSRPLQLQQSLQYLGYVIKSCLRLLRKYTIEIYPQRNSSTSVATGGGSNAAHDKNTPRVVGNAGVMAKYFGD 
PSTSVAPAMISSASSGGAPSTSASAAVAPGSGTPVTRKTNMENIQLAECIGNVRALLIGIFCDDIFKDIATDEGYELSLEILDECHLSFVACFDA 
FYPTSSLKWNCLCDLLAQMDRGALHSRLLSAILAGLCSPSVKLRATFSLLSAAGNERQSIISPSDNSGLPMLSSTDAHPYPVLVEQMIYRTQQEK 
SDFLSNSWTFKDVLVRLLDIIASPIRSRIEAIYSRSLGSLGYPGGKDCVNQGLIDNCCHLLARVLAEIVYQTAMGEYDKLFMPPRTLHSTGARFA 
RCDVSRTWNTGNFGPDAIAFAVDRPGVAIAGAMVYSGSGSYDYQLELLYDNTADLQPQHKWETLESVSGSYDQDAVHNDLAEIKFDHPVHIKENA 
RYALRLCSQGARTCSGDAGMPAVRGPCGAQFHFYACDLSFNGTTPARGQLPCILYYSTPMKQDGHSASGRTGDGSNVATHLEDRIMLLGPHEVST 
RDTALQIAADITKKCTELLILARNAMAASCSPSDNSSNHTQTIDSEHNITPIEEHMDINWAWNSRTAALPTAIDPQLSTARDLGKRIESFSKGLM 
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ETLKFDKRSTNPFEMEIEIGATEVEESADLRNGQSQSVSQSQSQSQSVPINGNERTADFEFAEQSAQQSMPQHLHSDSEEAPLEVAGMAAGGGVS 

VADGSGGVAGVGSQAAAVQLLEVFNLAASNMFHTLLPLVYAHIANLACSDPKSSVQILGLIKEILPHIAALKQLHVSKDQRQPEPAIFATQTSGS 

GNSNSSSTTSKHYCVVESDHPYKSASISSYRVEFPPCVQWLTIEFDPQCGTAQLEDYLLLSIPMRPASQAPPVPHVDDYLEQADNNVNGAGDRRR 

NTGGGIAGSGAAPNTHQRSASVQLTMASCCRSPGCGNAPGSAAAPSSMPLRSQDPNDREWIVVKKFNTASTWLHNVLILPGNCVEFSLETASLYA 

QDPHNNRYGFKCLVVGYDNPTSINASNSCLIRLEQELAYLGGMCSANLMKKELHLPDDKDVEDMSGIEETINTHHTLLSKGFALSEPQLTVHQAL 

ESYLPIG* 

(SEQ ID NO: 996) 

Name: RCCl (regulator of chromosome condensation) protein hortiolog 
Classification: DNA_binding 



Celera Sequence No. : 142000013384684 

TCTTCACCTGGCCATTGACGTTTTCCAGCGCAGATGCAGCAACGTTACTCAGAGCACTCATTGCGTGTTGCTGATCCGGCTGGTGGTAAAGATGA 
CTTTGCCCTGAGCTGCCAAACTGAGACACATTCTTCATGGAGCCATTGGACTTATCAACCTGGTATTCGTTGTAGCCGGAGCTGTTGGCGGACTG 
TGGAGATGTTAGGAAGTTCGGCACATGGCGGTAGGACTTGCGCGGCTGGATGTGCTGGTCCAAGAACTTCATTTTCTCCAGGTATGGACGCGACA 
GGTGTTCGTAGACAGGTGGATCGCCCTGCTTCCGCTGCGACACGTAGCGTTCGCGTAGGTACTTCCACCGCTTCTTGCACGTGTCCACTGGAAAG 
GAGGGCAACATTTATATAGAGGATCTGCCGGAGGAGCGCGTCTTCCTCACCATCCGTGCGCAGCTTCATCGCAATCAGCTGCCAGGCCTCGTCCT 
TCGTCTCGTACTTGCCACCATTGGCGCCGCCGTTGAGGTAGTACTTCTGCCGGTTGTAGATCACTCCGTGCTGCGCTACCTCGTCAATTAGCTGC 
TCATCCATCATTTTAGTTTAATTCGTGTTTTCAGCTGCCAATTTGCCTTTGGTGTATGTCAGGCCATTTCCAACACTGTTCATGGGACATCGAAA 
CAGTGTGACCGAAAATGTAGCAGCAGAGTGTCAAATATAATTTGAAGATATTGATATCGAAATATATTCGACAATTTGCATCATGTTTAAATTCG 
TCAGCATTTTGATTTATATATAGTAATGCATTTAATTAATTAACACGCAATAAAATTAATAATTTCAGCAAAAGTAAATCATTTGCTGTTCCTGA 
ATTTACTAAATAAAAATGTTTAAAAATTTAACTTATAAGTTACCTTCACAAAATTCTCGAAAAATAAAAGTCAGCTATAAAATAGTTGCTATAAA 
CGTATTATGGTTATTTGAAATTTTAAGGTGGGTTCTTTTGCGAACCGAGATTCAGTCACACTGGCAATTTGGTACCCGAAGTTGAATTGCCGTTT 
TGTGAAGCGGATAGTTACCTGCCGATAATCTTAAATAAAAATGTTTAAACTGGCCCGTATGCTCCTGCCGCAGCAGCGGATCCTGGCCAGCCCGC 
TGCGCCTGCAACGCCTGATCTCTACCAGCGACGAGGTCAACGCAGAGCCCATCATCAAGTCCATGGACACCATTGGCGGCCTCCCCACCGAACTG 
GTCAACGAACAGAAGCTGAAGAAGACTAGCAGGTAATCAATCTACCGGTTTTCGCACTTGACCTTTGCCTTGCCTGTTTGTTTTGTTTACATTTC 
GACCGGTATGGGCATGGGCATGGTATGCATGTATCGGAGGCCTGTTTTGGGCGTGATTTTCGAAAAGGAGTTTCGGGGTCTTTTTTTCTTGATTT 
CAAGTGGGGGAGAAAGTTTGTATCGAGCCGCTTATGCAGTCACGTAGACCATAGATGCGTGCATGTGTGTGTGTATGTATTTGTGCCTGCCTGGT 
GGTGTCAGTTATGGGCTCTATTGTTCTTGACTTTTGTTTTGTCCACAGAACCTTATCGACGCTCCAAAATCACTCGGTTCCCATTGCCGCTCGCG 
TCACGGTGTCGAAAGATGAGAGTCGCGACTTTATGGCCGTGTTCCCAGGTGAGATTTGTTTATTTATGCCCAGTAACAATTATCATGCAAATACT 
AATGCTAATTATATTTCGATGTGCAGATCTTGTGCGTGACATCACCACCGTGACGAAGGCATACAATTGCAGCGACGCAGCCAAATGGTTCGCCC 
AAGTCCTCCAGTACAATGTGCCCAGGGGCAAGAAGAACCGCGGCATCCTCACCGTGCTCACGTATAAGAATCTAGTGCCCACCCAGGATCTTACG 
CCGGAGAACATAAAGCTCGCCCAGTATCTGGGCTGGTGTGTGGAAATGGTAGGAGTTTTATGCATTAAATAGTTTTCCTAATACAAAGTTATCGA 
TCATCTTTTCAGCTCCAAAGTTTTTTCATCATCTCAGACGATGTGATGGACAACAGCACCACCAGACGCGGCCAACCCTGCTGGCACAAGGTGGA 
GAACGTGGGCCTGACTGCCATTAACGATGCTCTTATGATTGAAAACGCCATGTATGCCATTCTCAAGAAGCACTTCAGCCACTTGGACTGCTATG 
TCGCCCTAATGGAGCTCTTCCACGAAATCACATACATCACTACCTGCGGACAATCGTTGGACCAACTGAACTCGAACCGCTGTGTCTCGGAATTC 
ACAATGGAAAACTACAAAGCAATTGTCGAAAATAAGACTGCCTATTACTCCTTCTACCTTCCATTCGCACTTGCTTTGCACTTGGCTGGGTAAGT 
AATGGCCATACATATATAAGATTCCGCTTATTAATCTTCCGCTCTATTGTTTAGCTATAAAGACGCAGAGGCCTTCCGTCAATCGAAGACCATTC 
TCCTAGAAATGGGCAACTTTTTCCAGGTTCAAGATGATTTCCTCGACTGCTTCGGTAATCCGGAAGTGACTGGTAAAATTGGAACCGACATTCAG 
GACAACAAGTGCTCGTGGCTGGCAGTGGTGGCAATGCAGCGAGCCAATGTCGAGCAAAAGCAAATCATGGTCGATTGCTATGGCAAAGAAGGTAA 
AACTAGCTTAGCAAACCCGCACAATTAGACATTGACTTTTGAATATTAACCTGCAGAACCAGCTAAAGTGGAGCGAGTGAAAGAACTGTACAAGG 
AACTTGGACTGCCATCCACCTATGCCATTTTCGAGGAGGAATCCTACAACATGATTAAAACGCACATACAACAAACCTCACGCGGGGTGCCTCAC 
CAGACCTTCCTGCAAATACTCAACAAAATCTACCAACGTGACTCCTAAATTACTCAAGCGCAACGACTTTACGGATACGTTTTTAATTAAATTGT 
AGTTTTAGCATTTACATGGATGAACTGTTAGCAAGGGCTCAGCAATTACAACCGATATGAACTTTTTACTATACATACAATGTTCTTTGCTTAAT 
CGTCTGTAAAATCAAAGGGCGCAAAAGCCTTTCGGAAATCTTCCACATACGCCACTTCTTCGTCCTTATCGAGAAGGCAATCCTTCCAGTTCACT 
TTTTCATCGCAGTTAAGAAGATTCTCCGAACAGAACACGTCCCTGAAAAAATGAGTTTAGTCAAGAGTTGGTAGCTTAAATCGAGAAGTTAACGC 
ACCTTGCGAAATGAATGGGAAAGTGTTTCATTTGCCGCGTTATAAGTGTGCTATCGTCGGGTAACTCGGCCAGGAAATAAGGTCCCATTTCAGGC 
AGCATTTTCTCAGAATCCAATGCGGGAAGTGTCTCAAATTCCAGATTAAATTCCTCCGCCTTATCCTGCAATGTTGTGGGATATACATCATATGT 
TATAAAAGGTTGAATGTCGTGAGCATACCTCAAAACTATGTTTTATCTTCCAGGCATAGCCTTCTTCAAAGGCAAGCGCGTTGATTTGTAGGTGA 
ACCGATTTATAGTGCCGTTCAGTAAAACACACCACTTGGCCTAACGTTTTGAAGAACTTTCGCAAAGCTGCCTTGAACTTGTTGAGCTCCTTCCA 
ATCATCTGGACTTAGTTGAGCAGCGCAGGGCACATGCTTGGTGGACAATATCATGACATGGTGTTTGTTAATCGGTCCTTTGGCCAGAGCAAGAT 
AAAAATGTTCGCCCACAGTGATAATGAGATGCTTCTCCACATCAGGTGAGGATAAACAGAACCAGCATTTATCTACAAATATAGGAATAGTATTA 
AGGATGCAGCTTGTAAAATTAAGTACTAGACAAACCTTGCTCAATCTGCGGTATTCTAGGACGTTTATCTCTTTTGTTATTATCACCGCCTTGTC 
GCTTGCGTCTGCCACCATCCATGTCGTAGAAGTATTGTCGGTTTTCTGACTGAAAGAATATATAATTAGTCACTTGGAGTACTTATAGATATGTA 
AAGGTTACCCACCGAATCATTCTTGCCAATAGCTC 
(SEQ ID NO: 997) 

Exon: 1001.. 1267 

Exon: 1569.. 1663 

Exon: 1737.. 1948 

Exon: 2008. .2369 

Exon: 2430.. 2656 

Exon: 2717.. 3025 

Start ATG: 1086 

Transcript No. : CT26006 

TTCAGTCACACTGGCAATTTGGTACCCGAAGTTGAATTGCCGTTTTGTGAAGCGGATAGTTACCTGCCGATAATCTTAAATAAAAATGTTTAAAC 
TGGCCCGTATGCTCCTGCCGCAGCAGCGGATCCTGGCCAGCCCGCTGCGCCTGCAACGCCTGATCTCTACCAGCGACGAGGTCAACGCAGAGCCC 
ATCATCAAGTCCATGGACACCATTGGCGGCCTCCCCACCGAACTGGTCAACGAACAGAAGCTGAAGAAGACTAGCAGAACCTTATCGACGCTCCA 
AAATCACTCGGTTCCCATTGCCGCTCGCGTCACGGTGTCGAAAGATGAGAGTCGCGACTTTATGGCCGTGTTCCCAGATCTTGTGCGTGACATCA 
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CCACCGTGACGAAGGCATACAATTGCAGCGACGCAGCCAAATGGTTCGCCCAAGTCCTCCAGTACAATGTGCCCAGGGGCAAGAAGAACCGCGGC 
ATCCTCACCGTGCTCACGTATAAGAATCTAGTGCCCACCCAGGATCTTACGCCGGAGAACATAAAGCTCGCCCAGTATCTGGGCTGGTGTGTGGA 
AATGCTCCAAAGTTTTTTCATCATCTCAGACGATGTGATGGACAACAGCACCACCAGACGCGGCCAACCCTGCTGGCACAAGGTGGAGAACGTGG 
GCCTGACTGCCATTAACGATGCTCTTATGATTGAAAACGCCATGTATGCCATTCTCAAGAAGCACTTCAGCCACTTGGACTGCTATGTCGCCCTA 
ATGGAGCTCTTCCACGAAATCACATACATCACTACCTGCGGACAATCGTTGGACCAACTGAACTCGAACCGCTGTGTCTCGGAATTCACAATGGA 
AAACTACAAAGCAATTGTCGAAAATAAGACTGCCTATTACTCCTTCTACCTTCCATTCGCACTTGCTTTGCACTTGGCTGGCTATAAAGACGCAG 
AGGCCTTCCGTCAATCGAAGACCATTCTCCTAGAAATGGGCAACTTTTTCCAGGTTCAAGATGATTTCCTCGACTGCTTCGGTAATCCGGAAGTG 
ACTGGTAAAATTGGAACCGACATTCAGGACAACAAGTGCTCGTGGCTGGCAGTGGTGGCAATGCAGCGAGCCAATGTCGAGCAAAAGCAAATCAT 
GGTCGATTGCTATGGCAAAGAAGAACCAGCTAAAGTGGAGCGAGTGAAAGAACTGTACAAGGAACTTGGACTGCCATCCACCTATGCCATTTTCG 
AGGAGGAATCCTACAACATGATTAAAACGCACATACAACAAACCTCACGCGGGGTGCCTCACCAGACCTTCCTGCAAATACTCAACAAAATCTAC 
CAACGTGACTCCTAAATTACTCAAGCGCAACGACTTTACGGATACGTTTTTAATTAAATTGTAGTTTTAGCATTTACATGGATGAACTGTTAGCA 
AGG GC T C AGC AAT T AC AAC C GAT AT GAAC T T T T T AC TAT AC AT AC AA 
(SEQ ID NO: 998) 

Start ATG: 86 

MFKLARMLLPQQRILASPLRLQRLISTSDEVHAEPIIKSMDTIGGLPTELVNEQKLKKTSRTLSTLQNHSVPIAARVTVSKDESRDFMAVFPDLV 
RDITTVTKAYNCSDAAKWFAQVLQYNVPRGKKNRGILTVLTYKNLVPTQDLTPENIKLAQYLGWCVEMLQSFFIISDDVMDNSTTRRGQPCWHKV 
ENVGLTAINDALMIENAMYAILKKHFSHLDCYVALMELFHEITYITTCGQSLDQLNSKRCVSEFTMENYKAIVENKTAYYSFYLPFALALHLAGY 
KDAEAFRQSKTILLEMGMFFQVQDDFLDCFGNPEVTGKIGTDIQDNKCSWLAVVAMQRANVEQKQIMVDCYGKEEPAKVERVKELYKELGLPSTY 
AIFEEESYNMIKTHIQQTSRGVPHQTFLQILNKIYQRDS* 
(SEQ ID NO: 999) 

Classification: enzyme 
Gene Symbol: Fpps 
FlyBase ID: FBgn0025373 



Celera Sequence No. : 142000013384577 

GTTGGCCAAGTAGCGCGTACATGTGTGGGTTTTCGCCATTTAAGGCGTGCACCCAGCACGAAAGTCGCTTTGTACGCAATTTACCCAAAGTGTTG 
GTCGGTTTTCGATTTTTGTCGAACCACCCCACGTTCAGTAAGTCTAGAGAAATCTCTTTCTAAACGGATTTCCAGGAAACCCACGGCACTCACAC 
TGTTCACACGCTGCGGTAAAAATTCAGAGTGACCGTCGACAAGCGGTAGTACTGGAACACAAGTCGCTGTTCGCTCGAAACAATCGATGTGTGCC 
AAAATTCGATCACGATATTCCTCGCAATCGATTGTATTCGCTATTTGCCTTCGTAAGTCCCGGTTAGACCAGCGGCATGAAAAGTTTAAATTCAC 
TTCCACATTCGAGTACTATGAGATAAGATTACTGTACTTAAACTCAAAAAGATTCAAAACAACACTGATCGTTAAGCTACTAAGACAAATAGCTG 
GCCGGTTCCAGTCCCACGATTTAAAAACAGTTCACTGAATAGCAGCATATGTAAAATTCCATACTTTCATATATTTGTATAATATGATTCACATA 
AGGT T GT T GT TC AAAT GGT T G AAT ACT AC AATT AAG T CT T T AC AAC AAG AG AAT GT AAT C T GT T GG AAAAC AAT AT T AC AAT C AAATT TAT T AAA 
CAAATATAAGATTCGGGTTTAAATATTTATACTGGTGGAAAACTTTGATGGGCGTAATATCGATAGCAAATAAAATAGCGAAGTCAGGCACCCCA 
GCTCTGGGTCTGCAGCCCTGAACTTCAAGTGGCACCTCCAAAATAACGTAAAAAAGTGCAAAATAATACAAATTGAATTCAAAATATTTGCATTA 
TTTTATGCTTTGTTTTTCTTTACCATACACACTGCGAAGCCGCAGCGAAATTGCCGGATATCCGCTAACATCTGGCGATAGTTGGCGATCGATCT 
ATCGATAGTCGCTAATGTTTTCTTACATCACTAAAACTTTTGCAAGATGGAGCCTCTTTTGCTTGGTTTTAACAAATCAGTAGCAGCACGTGTAT 
AAAAAGTGCCAAATAATTTTCAGAAAATGTTAGTATATACCCTAGATTAAGCGTTTGAAATGTAAAAAGCCGCAAGGCGATCTTGTGCCGCATCG 
ATTCCGCAACTGCTTAGCATAATTTCCGCGTGTTTTAATGCAAAAAACGCGATTTTAAATTTCTGACGGTCAAAATGAGTGGAAAAAAATTACAC 
GCCCGTATGACAGTGCAGGGAGTGGGAGTGAGAGGCAGCCACCATGCGAGCGAGAGAGAGCGAGCGCAGTGGTTGCTCCAAGAGAGTTTTGTTGC 
TGGACTCGGTCTTTGCACCCCTCCCCCTTCGCCTTGGCGAAACCTAGAACTCCGAACTGGGGACTTTTGTTGAAGTCCGGTGACGTCGGTGTCGC 
CACTTGACTGGATTTTATGCACTCTCGTCCGTCCGTTCGCTTGACTCATGACTGGACAATGACGCAGACCGCTGATTCTATCTTTAGTGGACGAC 
GGCATTCGTGGGCTGATTTAGCCTTGCTCCAACTCAATTTTCGATTTGCTTTGGAATATTTGCCTGTCAGTCGGTTTTTTTTTTCTATCCCCACC 
CCGCCGTACTGTCAAAACTACTGCTATTTTCGATCAACTACTACCATTCGCAACTGGTGTTAGGCAACACTGGCGCCACTGTTACGTGCTAATAT 
CGCTCGGCTGCTCTCGCCCGTTGACATTGACACGAACGACACTAGGGTCGTCCCGCTCGCACACGCTCAGGGGCGGACTCTCGCCGCTTGTCGCT 
CAAAAAGGCTGCCTAGCTATCGCTTTCGGTCTCATACGTAAAGTCACCGCTTCGCAGAGCGCATCGCAGCACAAACGTAACGGACTAAAAGCAAA 
GCGCTCGATAGCTCGAGAAAACAACACAGTCGGCGTTCGGCCCGAATTCTCCGTTATTCGTTCTAGACAAATAGGCAAACAGTTGATTGAGTGAA 
CCGGGCCCAAGCCGTCGAAATTCAGGATATAAGTTTACCTAGTCAATTCTTTATTGTTGATAATGTGAGTGGTTGCCAAGCGGCGCTCGCGGTTT 
TCGATTCTCCCTCCACCAGCCTTACTTACTTTTCGCTTTTCTGCCATTCGCCATAACCAACAATTGTGGCTGGAAAATTTTTTACCCCCGTCCCG 
CTACTGCTTTTGTTGTTGTTTCCACTGTTCCGCCCCACTAACGCATTCTCGCTTTCTTGCAGGGATGACCGAAATGAGATACCTCAGGATGGCCC 
CGCCAGCATGGAACCCGAGGGCGTCATCGAGTCCACCTGGCACGAGGTGTACGACAACTTCGATGACATGAACTTGCGCGAGGAGTTGCTGCGCG 
GCATCTACGGTTATGGTTTCGAGAAGCCGTCGGCCATCCAGCAGCGCGCCATCATTCCCTGTGTGAGGGGTCGCGATGTCATTGCCCAGGCCCAG 
TCGGGTGAGTAAATGGCTCCGAGTCCCCCATCATCATCTTCACCATCATCCCCCTCGATCAGTCACAAAGAAAAACTTTCGAGAATGTGCAAAAT 
CGGCGAAAAAACAGCACGATGACTCAGCCGCTGCGCCTGTATTTCTTTGTTGAAATTCCGCTTTGCAGCACTCTCTTTCTTCTCCTTATACTCTC 
TCTCTTCCGTCCGCTCTCTTCGATTGTCGGCGTCGTTGCGCTTGCGCGCCACTTTTCGCTGACCAGCTGCGTTTCGGTTTCTCGGCCGCCAGGCC 
AGAAGAAAGTTCCCAGAAAGAAGGTCGCTCTTCGAGCGCTCTTGGCGAATTCCAGCTGCTATGCGTTTTTAAGGTCATCTGTTCCAAATGCGATC 
TACCTACATTGCTCATTCGGCACTAATCCATTTTGCAGGCTCTGCAAAAGAGGCTCTATATACATATTAATATAGTTCATTCTCAACTCCTGTTT 
TGTGTTCTGGCACTTGCACCTGCCTATCGATGTATAGGCAGGGATTAGTGGCCAGCTCGACCCTGAAATGGCACATCAGCTACCTGATGAAACGA 
TGGACAACTGTCTTTATTCCATTGATCTTTCCGTCATTCAAGTTTACGCCTAACAGAATTGCATAACAAATGGCGTACTGGTTAGTCGAAAATTG 
GAATTTTGCTGAATTTCCATAAGAATATCACATAGTTAGAAAAGTAATTGTGTATAGTCCGTAGCACACACTATTCTGCATACTAACCATTTTAC 
CTTTAATCTCCTCCGTAGGAACTGGCAAGACTGCCACCTTCTCGATTGCTATCCTTCAGCAAATCGACACGAGCATTCGCGAGTGCCAGGCGCTG 
ATCCTGGCCCCCACTCGCGAGTTGGCCACGCAGATCCAGCGCGTGGTGATGGCGCTCGGCGAGTACATGAAGGTGCACTCGCACGCCTGCATTGG 
CGGCACTAACGTGCGCGAAGACGCCCGCATCTTGGAATCCGGTTGCCATGTGGTGGTGGGCACTCCTGGCCGCGTCTACGACATGATCAACCGCA 
AGGTGCTGCGCACCCAGTACATCAAGCTGTTCGTGCTGGATGAGGCCGATGAGATGTTGTCCCGCGGTTTCAAGGATCAGATCCAGGATGTCTTC 
AAGATGCTGCCCCCAGATGTGCAGGTCATCCTGCTGTCCGCCACCATGCCGCCGGATGTGCTCGAAGTGAGCCGTTGCTTCATGCGTGATCCCGT 
CAGCATCCTGGTTAAGAAGGAGGAACTGACCCTTGAGGGTATCAAGCAGTTTTACGTCAACGTGAAGCAGGAGAACTGGAAACTGGGCACCCTTT 
GCGATCTGTACGATACGCTGTCCATCACCCAGTCGGTAATCTTCTGCAACACCCGTCGCAAGGTAAATTCACTCATTCTTATAATTGTGTGAAGC 
TGGATTAGTTTCAATCGCAAAGCCGATAAGTTGCTAACAAAAATGTCTACCCATTCTTCAGGTGGACCAACTGACCCAGGAGATGTCTATCCACA 
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ACTTCACCGTCTCGGCCATGCACGGCGACATGGAGCAGCGTGATCGCGAGGTCATCATGAAACAATTCCGTTCGGGCTCGTCTCGTGTTCTGATT 
ACCACTGATTTACTGGCGCGCGGTATTGATGTGCAGCAGGTGTCGTTGGTCATCAACTATGATCTGCCCTCGAACCGCGAGAACTACATTCATAG 
GTAAGTGCTCACCACCTCGCCCCTGAATCTACTCAGCTATGTGCTATACTAAACATGATACCATCACTTTCAATCTATTTCAGAATTGGTCGCGG 
TGGTCGTTTCGGTCGCAAGGGTGTTGCGATCAACTTTATTACAGATGATGATCGACGAATCCTTAAGGATATTGAACAGTTCTACCACACAACAA 
TTGAGGAAATGCCTGCTAATATTGCCGATTTGATTTAAATTTTTTCTGCTGAGCCGCAAAACAACTAAGAAAGAATTCTAAACAAAAAGTGAGTG 
AGCAAAACAGCAAACAAGAACTGCGAAATCGATAACCGAGCGCAACTATAAATTATTCAAAGATCAACAAGTCAAGTTAAATATTTATTATTCAA 
ATATTAATTCGAAAAGGCAGCCGACCGACCACCCAAAAAAAGGACTAAACCAGGACAGAGCAGCAACATCTTCAGCAGCGGCGCAGCAGCAGTAA 
CATCAGCAGCAGCGGAAGCGGCAGCAACACCACCACCAATTGCCGATTAGCCGGAACCCCTCTGTATCCATTTAGTTCTCTGTGTATCATCAATT 
TTAAAACTGCGTTGTCAACCAGCAACAACAAATATATACAAATATATAAAAAAAAAAAAACAAAACTTAAAACAACATTTGCATCTATCTAAAAA 
AAGTAATATCTGAAACCAGGTACAAGATACCACTTGCTAAGAAGAGCGAGGAACTAAACGCAAAAGCCACAATATCAGAACACCTTTTTGCCGCA 
CCTTTGTTTTAGCTTTTTCTAGAACTATTGATAAAAATATATTAAAACAAAACAAAATACATATGTACACATTATGTTTAAATATACCACTCCTA 
AAAAAC AGCT G AC AAC C G AAGC AACATTTT AC GC T AAAC ACG AAC AC AC AGC GAAT C C ACC CAG AT C T AT AAAT GT AT T AAC AT ATAGCT AT AT A 
TCTTTCGGGGTATGGCCAAAAAATATATGGTTGCCGCTATTGGGTGTATAGCATGAGTTGATATAGCTGTGGATATATACAATTGGGTGGTATTG 
TAGATTATGTTTAACAAAGCGGAAGTAAATTCCTTATCTCAAAACAGTAGAACGAAAATGGCTGTTTTTGAAAAGTTTTTAT'GTATCTGTTACGT 
ACGGGTTGGGTTAACCTCATATTAACCCAAACCTTTTTAGCCAATTTCATAAGCTTTCGCTTTAAAACCAGCTCTCTAAGCAAGATGAGTTCCAA 
AAGTTAAACTAATTTTTTAAAACTTTCAACACCACTCCACTGCATATACTTTTAAAACGAAAGCGCAGTGGGTAAGTTGCATTTCGTTAAGCCTT 
CACATCAACTTATCGGTTGAAATTCTGATAACAAGCTATCGATAATCTATCGATTCGACTCGCAGCCCTATCACGTCAGGTCATATTAATTTTTC 
AATCCGCCCAGTTGGATGTTTGGACAGTTTCCTTCCTCGCGCCGGTCGATAATGTTGGCCAACTAACACCGACCTGTAGCACGTCACCCGTAATA 
ACC CAT AAC C GC AAC C AGT C C AAGC ATC C AAG AT G GG ACAG AAAGT TTCGCGCACC G AC TT C G AGT G GGT GT AC AC GG AGG AGCC GC AC 
(SEQ ID NO; 1000) 

Exon: 1001.. 1073 
Exon: 2248.. 2474 
Exon: 3249.. 3862 
Exon: 3957.. 4180 
Exon: 4264 . .4789 
Start ATG: 2288 

Transcript No. : CT26044 

AGCCTCTTTTGCTTGGTTTTAACAAATCAGTAGCAGCACGTGTATAAAAAGTGCCAAATAATTTTCAGAAAATGGATGACCGAAATGAGATACCT 
CAGGATGGCCCCGCCAGCATGGAACCCGAGGGCGTCATCGAGTCCACCTGGCACGAGGTGTACGACAACTTCGATGACATGAACTTGCGCGAGGA 
GTTGCTGCGCGGCATCTACGGTTATGGTTTCGAGAAGCCGTCGGCCATCCAGCAGCGCGCCATCATTCCCTGTGTGAGGGGTCGCGATGTCATTG 
CCCAGGCCCAGTCGGGAACTGGCAAGACTGCCACCTTCTCGATTGCTATCCTTCAGCAAATCGACACGAGCATTCGCGAGTGCCAGGCGCTGATC 
CTGGCCCCCACTCGCGAGTTGGCCACGCAGATCCAGCGCGTGGTGATGGCGCTCGGCGAGTACATGAAGGTGCACTCGCACGCCTGCATTGGCGG 
CACTAACGTGCGCGAAGACGCCCGCATCTTGGAATCCGGTTGCCATGTGGTGGTGGGCACTCCTGGCCGCGTCTACGACATGATCAACCGCAAGG 
TGCTGCGCACCCAGTACATCAAGCTGTTCGTGCTGGATGAGGCCGATGAGATGTTGTCCCGCGGTTTCAAGGATCAGATCCAGGATGTCTTCAAG 
ATGCTGCCCCCAGATGTGCAGGTCATCCTGCTGTCCGCCACCATGCCGCCGGATGTGCTCGAAGTGAGCCGTTGCTTCATGCGTGATCCCGTCAG 
CATCCTGGTTAAGAAGGAGGAACTGACCCTTGAGGGTATCAAGCAGTTTTACGTCAACGTGAAGCAGGAGAACTGGAAACTGGGCACCCTTTGCG 
ATCTGTACGATACGCTGTCCATCACCCAGTCGGTAATCTTCTGCAACACCCGTCGCAAGGTGGACCAACTGACCCAGGAGATGTCTATCCACAAC 
TTCACCGTCTCGGCCATGCACGGCGACATGGAGCAGCGTGATCGCGAGGTCATCATGAAACAATTCCGTTCGGGCTCGTCTCGTGTTCTGATTAC 
CACTGATTTACTGGCGCGCGGTATTGATGTGCAGCAGGTGTCGTTGGTCATCAACTATGATCTGCCCTCGAACCGCGAGAACTACATTCATAGAA 
TTGGTCGCGGTGGTCGTTTCGGTCGCAAGGGTGTTGCGATCAACTTTATTACAGATGATGATCGACGAATCCTTAAGGATATTGAACAGTTCTAC 
CACACAACAATTGAGGAAATGCCTGCTAATATTGCCGATTTGATTTAAATTTTTTCTGCTGAGCCGCAAAACAACTAAGAAAGAATTCTAAACAA 
AAAGTGAGTGAGCAAAACAGCAAACAAGAACTGCGAAATCGATAACCGAGCGCAACTATAAATTATTCAAAGATCAACAAGTCAAGTTAAATATT 
TATTATTCAAATATTAATTCGAAAAGGCAGCCGACCGACCACCCAAAAAAAGGACTAAACCAGGACAGAGCAGCAACATCTTCAGCAGCGGCGCA 
GCAGCAGTAACATCAGCAGCAGCGGAAGCGGCAGCAACACCACCACCAATTGCCGATTAGCCGGAACCCCTCTGTATCCATTTAGTTCTCTGTGT 
ATCATCAATTTTAAAACTGCGTTGTCAACCAGCAACAACAAATATATAC 
(SEQ ID NO: 1001) 

Start ATG: 114 

MEPEGVIESTWHEVYDNFDDMNLREELLRGIYGYGFEKPSAIQQRAIIPCVRGRDVIAQAQSGTGKTATFSIAILQQIDTSIRECQALILAPTRE 
LATQIQRVVMALGEYMKVHSHACIGGTNVREDARILESGCHVVVGTPGRVYDMINRKVLRTQYIKLFVLDEADEMLSRGFKDQIQDVFKMLPPDV 
QVILLSATMPPDVLEVSRCFMRDPVSILVKKEELTLEGIKQFYVNVKQENWKLGTLCDLYDTLSITQSVIFCNTRRKVDQLTQEMSIHNFTVSAM 
HGDMEQRDREVIMKQFRSGSSRVLITTDLLARGIDVQQVSLVINYDLPSMRENYIHRIGRGGRFGRKGVAINFITDDDRRILKDIEQFYHTTIEE 
MPANIADLI* 
(SEQ ID NO: 1002) 

Name: eIF4A 

Classification: translation__f actor 
Gene Symbol: eIF-4a 
FlyBase ID: FBgn0001942 



Celera Sequence No. : 142000013384577 

CAGAGCAGCAACATCTTCAGCAGCGGCGCAGCAGCAGTAACATCAGCAGCAGCGGAAGCGGCAGCAACACCACCACCAATTGCCGATTAGCCGGA 
ACCCCTCTGTATCCATTTAGTTCTCTGTGTATCATCAATTTTAAAACTGCGTTGTCAACCAGCAACAACAAATATATACAAATATATAAAAAAAA 
AAAAAC AAAACTTAAAACAACATTTGCATCTATCTAAAAAAAGTAATATCTGAAACCAGGTACAAGATACCACTTGCTAAGAAGAGCGAGGAACT 
AAACGCAAAAGCCACAATATCAGAACACCTTTTTGCCGCACCTTTGTTTTAGCTTTTTCTAGAACTATTGATAAAAATATATTAAAACAAAACAA 
AATACATATGTACACATTATGTTTAAATATACCACTCCTAAAAAACAGCTGACAACCGAAGCAACATTTTACGCTAAACACGAACACACAGCGAA 
TCCACCCAGATCTATAAATGTATTAACATATAGCTATATATCTTTCGGGGTATGGCCAAAAAATATATGGTTGCCGCTATTGGGTGTATAGCATG 
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AGTTGATATAGCTGTGGATATATACAATTGGGTGGTATTGTAGATTATGTTTAACAAAGCGGAAGTAAATTCCTTATCTCAAAACAGTAGAACGA 
AAATGGCTGTTTTTGAAAAGTTTTTATGTATCTGTTACGTACGGGTTGGGTTAACCTCATATTAACCCAAACCTTTTTAGCCAATTTCATAAGCT 
TTCGCTTTAAAACCAGCTCTCTAAGCAAGATGAGTTCCAAAAGTTAAACTAATTTTTTAAAACTTTCAACACCACTCCACTGCATATACTTTTAA 
AACGAAAGCGCAGTGGGTAAGTTGCATTTCGTTAAGCCTTCACATCAACTTATCGGTTGAAATTCTGATAACAAGCTATCGATAATCTATCGATT 
CGACTCGCAGCCCTATCACGTCAGGTCATATTAATTTTTCAATCCGCCCAGTTGGATGTTTGGACAGTTTCCTTCCTCGCGCCGGTCGATAATGT 
TGGCCAACTAACACCGACCTGTAGCACGTCACCCGTAATAACCCATAACCGCAACCAGTCCAAGCATCCAAGATGGGACAGAAAGTTTCGCGCAC 
CGACTTCGAGTGGGTGTACACGGAGGAGCCGCACGCCTCGCGGCGCAAGATCATACTGGAGAAGTACCCGCAGATCAAGAAGCTCTTTGGCCACG 
ACCCGAATTTCAAGTGGGTGGCCGGCGCAATGGTGCTCACACAGATCCTGGCGCTGTTCGTCGTGAAGGACCTGTCGTGGTCGTGGCTCATCGTG 
GCCGCCTATTGCTTCGGCGGCATCATCAACCACTCGTTGATGCTGGCCGTTCACGAGATCTCGCACAACCTGGCCTTTGGACACAGCAGACCTAT 
GCACAACCGGATCTTGGGCTTCATCTGCAACCTGCCCATCGGGTTGCCCATGAGCATTTCCTTCAAAAAGTACCATCTGGAACATCACCGGTACG 
TGCCAATCGGTTCGAATTCATACCCTAAACCGCACCCTCACGCACACATGGGCATTAACACTTTTCTCTGGGCAATAGGGCGTAGTCACACAGTA 
AACAGCGAAAACATCATCCCATTCCGCATTTATCCATTTAAAAGCCGGCTTGTGTTGTTCAACACTGATTGCTCTCCATTGGGTTTCATGTGAAT 
TTTCACCCCGCAAACTTGTGAATCAGTTTTCATGTTGTTCCCTTTTGCCGGGCATCGAGTGCGTCTTAATTGCGCCGATTAGATAAGGTTGTAGT 
TATATGTGTATCCGAATAGGCGCTCAATTGGTCACGTAGCTCAGTGATTTGCCACTCGACTTCTCGGCTGCAAGTCGCGTTCAGTTGAGTCACTG 
CTTCGCCTTCCGCAAAAAGTTCCCGAAATATTATTCAAACTAATTTTATGAGTGTTGTTTATGTTTACAGTTACAGTCATTCACTATGTTTCTAG 
TCGTAGATAGTGTTTATCTTATCGCCCAGGTGCACCATTGGGCGGATCATTGATTGTCGTTCAACAACGTCCCAATCCCAAATGTTCGCTTCTCA 
GTCCCAACTAAGTCTAAAATAAGTCATGTACGTAAATCATCACAAACTATCTGCATTCTAAAAAAATTATATTGCTAACTACATGCTATGCATAC 
ATTGTTCACCGTTCACCAGCTCAATAAACTGTAACGTTTATTTGTCCTTTCAGTTACCAAGGTGATGAGGCAATTGACACTGACATACCCACACT 
GCTGGAGGCGCGCCTGTTTGACACCACATTTGGAAAATTTCTGTGGGTGTGCCTGCAGCCTTTTTTTTACATTTTCCGTCCACTGATTATTAACC 
CGAAGCCACCTACACGCCTGGAGATTATTAACACCGTGGTGCAGCTAACTTTCAACGCCCTGATCGTTTACTTTCTGGGCTGGAAGCCACTGGCC 
TATCTGTTGATCGGCAGTATTCTAGCAATGGGACTGCACCCGGTGGCCGGACACTTTATCTCGGAGCACTACATGTTCGCCAAGGGCTTTGAGAC 
GTACTCCTACTACGGTCCGCTCAACTGGATCACCTTCAATGTGGGCTACCACAACGAGCATCACGACTTTCCGGCGGTGCCCGGCTCCAGGCTGC 
CAGAGGTTAAGCGCATCGCTAAGGAATTCTACGACACAATGCCACAGCACACCAGCTGGACCCGGGTTCTGTACGACTTCATCATGGACCCAGCA 
GTGGGACCTTACGCACGCGTGAAGCGACGCCAGCGCGGCCTGGCCTCCTAAGCCATCGATGCTATTCAAGGTTTCCTGCCATGAATGTAATCTCA 
GCGTTAGACCTAAAACGGCTTTTTTATGTATATATACACCGTAGAGACATTTACAGCGGATGCAGAAGAAATAAGGGGAACAACCACACACATAG 
CACCAGCTAATCTCTGTGATTAGATAAGGGGCATTCGGACACTTGCAGCGGCTGATATGAACAATTACGTGGATATAGTTTTTGGTATTATTGCT 
AGAGTGTGAGCTTAGTTAGGTTCGTTGGCCTAACTGAGAGGCACTTTCGAGCGCTTTAATTAATTGTTACCTCATAAATTGGAAAGCCTGTTAAA 
TCTAGTGTGTAATACTTGTCCTATTATACTATTGTCTTAGTATTTTCGATACATATTTATATTGCTTTCAGATCATTTATGTGCAGACACGATTT 
GGTGTGATCAATCTTAATCCTTATCGCCGCCACCCTGGCCGATAGCAAGCCACTAATCAAGCAACCAACTACTATTTTTGATAGACTCTTGAAAC 
TGTTACCTAAGAATAAAGTCTAAATAATTTGACCGGGGTGGCTGCATGTACAGAATACACTATAAAATTTGTAAGCAAACTAATGAAATAAATAA 
CAACATCGTATTGTAACGACTCTGTGACCAATTAAACTTTTTGAAGCTTGCAAAGTTATTGCTTACGCCGATTGCATGTCGCACGTCAGTCATGT 
GGGCACAGTGCTATCAACTGATAATGATAATTTCGATAAGCACCTGTGGAACGACAAGGTAGGCAATGCGGTTATATGAATTGCTTGCCAGATAA 
CTCACACCTTGGAGAATGTGCACCTAGGAGATCGAGGATGTTTTTCTGCTTTGCAGCCAACTGGATTGGATTAATCATGGCTTAGTTACTTAATC 
TATATGTCTATTGTATTGTCGTATTGTATTCTACGCCATCTGGAATGCGATGTATTATCAACATGTTCCTCTTTTAAGGTGCTCTTTAAAAGAGC 
TTTGAT 

(SEQ ID NO: 1003) 

Exon: 1001.. 1515 
Ex on : 2239. .2806 
Start ATG: 1118 

Transcript No. : CT26064 

GTTGGATGTTTGGACAGTTTCCTTCCTCGCGCCGGTCGATAATGTTGGCCAACTAACACCGACCTGTAGCACGTCACCCGTAATAACCCATAACC 

GCAACCAGTCCAAGCATCCAAGATGGGACAGAAAGTTTCGCGCACCGACTTCGAGTGGGTGTACACGGAGGAGCCGCACGCCTCGCGGCGCAAGA 

TCATACTGGAGAAGTACCCGCAGATCAAGAAGCTCTTTGGCCACGACCCGAATTTCAAGTGGGTGGCCGGCGCAATGGTGCTCACACAGATCCTG 

GCGCTGTTCGTCGTGAAGGACCTGTCGTGGTCGTGGCTCATCGTGGCCGCCTATTGCTTCGGCGGCATCATCAACCACTCGTTGATGCTGGCCGT 

TCACGAGATCTCGCACAACCTGGCCTTTGGACACAGCAGACCTATGCACAACCGGATCTTGGGCTTCATCTGCAACCTGCCCATCGGGTTGCCCA 

TGAGCATTTCCTTCAAAAAGTACCATCTGGAACATCACCGTTACCAAGGTGATGAGGCAATTGACACTGACATACCCACACTGCTGGAGGCGCGC 

CTGTTTGACACCACATTTGGAAAATTTCTGTGGGTGTGCCTGCAGCCTTTTTTTTACATTTTCCGTCCACTGATTATTAACCCGAAGCCACCTAC 

ACGCCTGGAGATTATTAACACCGTGGTGCAGCTAACTTTCAACGCCCTGATCGTTTACTTTCTGGGCTGGAAGCCACTGGCCTATCTGTTGATCG 

GCAGTATTCTAGCAATGGGACTGCACCCGGTGGCCGGACACTTTATCTCGGAGCACTACATGTTCGCCAAGGGCTTTGAGACGTACTCCTACTAC 

GGTCCGCTCAACTGGATCACCTTCAATGTGGGCTACCACAACGAGCATCACGACTTTCCGGCGGTGCCCGGCTCCAGGCTGCCAGAGGTTAAGCG 

CATCGCTAAGGAATTCTACGACACAATGCCACAGCACACCAGCTGGACCCGGGTTCTGTACGACTTCATCATGGACCCAGCAGTGGGACCTTACG 

CACGCGTGAAGCGACGCCAGCGCGGCCTGGCCTCCTAA 

(SEQ ID NO: 1004} 

Start ATG: 118 

MGQKVSRTDPEWVYTEEPHASRRKIILEKYPQIKKLFGHDPNFKWVAGAMVLTQILALFVVKDLSWSWLIVAAYCFGGIINHSLMLAVHEISHNL 

AFGHSRPMHNRILGFICNLPIGLPMSISFKKYHLEHHRYQGDEAIDTDIPTLLEARLFDTTFGKFLWVCLQPFFYIFRPLIINPKPPTRLEIINT 

VVQLTFNALIVYFLGWKPLAYLLIGSILAMGLHPVAGHFISEHYMFAKGFETYSYYGPLNWITFNVGYHNEHHDFPAVPGSRLPEVKRIAKEFYD 

TMPQHTSWTRVLYDFIMDPAVGPYARVKRRQRGLAS* 

(SEQ ID NO: 1005) 

Classification : known^f lybase_gene 
Gene Symbol: ifc 
FlyBase ID: FBgn0001941 



Celera Sequence No. : 142000013383842 
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TCACGCCTGCCAACCACATCCTGCCCGAGGCAGAGGATGCGCCATATGGAGGCCACACCAACGGTGGCCAGACGGACGCCGAACGTTCGGCGAGC 
AAAAGGAGACGAGGGCGACGTAGTATACTGCCGCACTCGGGCAAGATCACAATGGCAATAGTGGAGGCGCTGCGCAAGGAGAACAGTTTGCTCGA 
GCAGCAGAACGACGCCTGTCTGCAGTCGCTCGTGCGCAAGGAGAAGCAGTTCCTGGACATGAAACATAACTTTCTGGCGTATATGGAGCGGCAGG 
AGGCAGTGCTGGCGTTAATAAAGCAAACGACCATCAAGCTGGAACCATGATCCCGACTACTAGCACCACGTGGTAAGGACATTGTATGACAGCTT 
TAGCTTTAAGTTGGGAATAGAGTCGCGGCGCTCGACCCTGTGTATGTTTTTCAATTCGTTCCTTTCTCAGACACAGATGTCTAATTCACTCGACG 
T A C AAAGC AGC AGC AT AG C G A AC C G AAG AT AT TAT AAT T TC T C AAAC GTTCTTTTTC TAG C GT CAT AAG T GT AAT GT GT AT G AAC AAT C GT C T GT 
GCAAAAGATCAATCTATACGGTCTTTAGTCTTAAATTAAAAACATTGTCTTGAGGCGAACTTGTATGGTAGCTACATTTTTAAAAAAAAAACTAA 
AACTAAACAACTGGCTAAAAAATAAATTAGATGAAAGACTATTATACTATATATATATACTATACTATATAGACTATACTATTATTTCTATTGTT 
ACAGAATATCTTGTTGATATAATAGTAACGAAATCAAACATAAATTCAAATGAAATCAGTGTGAATAGCCTATTAAGTTTTGCATTTCGATCAAC 
TTAGCGTGTGCAACAACCGCCGGTTATTTGTTCATCAACCCCTGCTGCAGCAGTGTGGCCGTAGCCGGAATGTGAAAATATCCGCAAACACCGCA 
TGCGGACCTCTCTTAGTCGCTCTCTTAGCGCACAATGCGCTCTCTGCCAACCGAAATGCCACAGACTTTTTGTAGGCTTTTCCACTTTTTCGATT 
TCGAGACTTCGAGCGAAACGGTCGTGTTTTTCGCGrGTGTGCGGGTGGCGGACGCGCGTGCCAATCGGAAAAGGCCTTTAAATCAAATGAGTGGC 
TCGCCAAAGGAGCGCATCAAGGTGTGTGCGATTGTATCGAGTGTGTGTGCTTGTGCGGTGCGGAGAAAGCCAGCCAGCCGCCAGTCTGGGATAAA 
AATAAAGATTAAGATTGGGATAAAACAAACGAACGGCAGCGAAGTGCACAAATTGCCGTGGAATCGGGGCTAAAGAAGGAAAAACAAGTGTGCAG 
CACACACGGATTGAAATGTTTGCCAATGTGTGCGAGCGTCACCTAAATAAACAACAAAAAATTCGATTTGCAGCAAAGAGAGCGAGCGAGTGAGA 
GAGCGCGAGAGTGGGAGGCTGTTATTGTTGTTGCTGCTGTTGCCGCCGCTGCATGTGGCATGCCCGAAAATCCCAATCACAATCAGCTGAGCCCT 
GCCGCGCCTCTCCCTACCTCACCCCCCGCACCCATACGCCTTCTCACACTGTTTGTTGGCCAGTGTAAAAGTTTCTCGTTCTGTTTTGCGTGTCT 
CCTCCGTTTCTCTTTCTCCAGGGCCACAACACTGGGCTACAAACTATGTGAAAACCCGTTTCGTTACATTTCCGCTTTGCCCCGCCTAACCCCCC 
CCCCCCCCTCCCATCATTAATTCGGTTTTTTTCTCTGCTGTCGCGTCACGCCTCTCACGTCTTTTTTTTACTGTTTTGTTGCAGTTGTTGCTTTT 
GCCCAGCGCCTCTTTCATTCATAGAAAACGGTCAGCAAAGCAAAGCATAGCGAGGCGACAGCAACAACAAACAGCAGGTAGAGCAGAAATATTAG 
CGCGATTTCTTTTCGTTTCGCCCAAAAGAGCAGCCAAAAAATCAACAAAATACGAAACACAAAGCCAAATCAAAAGAAATCAAATTTCAAATTCG 
TGCGGCCACCAGCAAATATCGAAATAAAGAAATTATCTAAATCATCATCAGCAGCAGCACAAAAATCGAAAATCTCCTCGCTGTGTGTGTGATGT 
AAGTGCGATAATAATATTTAACAATTGTAAAAGTTTATTTTATACATCATGCCTAACTGGCGCTATTGTTCGCCCCTTTTTTTTTCGTCTTCCGC 
CTTTTTTTTTATGAAATTTGCTACTCATATCAGACCAACTTCGTAACTACATCAAGTACAATTTTTAAGGTTTATAATGTTACCATAATTTGTTT 
AAAATTTCCCATGATTGTCGTCATTTATGGGGTTTTCCTTGAGCGCTAATCATGCGGTTCTATACCACTTGAATGATAAGATTGATAACATTCTT 
GGCAATCATTATCGAATCTATGGGAAAATCATAACAAGTTCCATTTTATTCGCAGGCAGAGAACTGAAATTAATCGTTTTGGTTAATTGCCAATT 
TGATGAGTTCCATTTTCCAATCGCGATTTAATTTTCACATTAACAACACATTTGTTAGCCCATTAAAAATTTGTTTGGCATTTGTTTTCTTTAAA 
AACAGCTCATTAAATCGCCGGAGTCCACATAAATCTTCTCGTCGTTTGTGGCTAAATATTGTCACTTAGCAAACACTCGAAAATATATAACTTAA 
TTTATGCGCACGACTATTAAAAAAGAACTTGGAGGTGTCGAAAAAAGCTAAATATAGGAGGAAGTGTATTAGGTAATTGTGTATTCCACATTGTT 
ATAAGCACTACTGTCTTTTTATTGTTATCTGACAATTCTATGCAATTTATTAAGTACATGGATCTTAAGTTGTATAAAAATAGACATGAGCTGTA 
CATAAAAAATCAACATTTCGCTGCGAAGTGGTTCGAACGGGCAATTTAACACGATCTACTTAACTTGTTGTGTTGCTGTTTATTAATAAAATATT 
TGCCAATTATACGACAGTTTGCGCGTTTAGTTCCATGTGTTTCGCCCTATTATTAAATATTTATACTTGATAAACACGTGCTTATTTACACCATT 
AGTATGATTCGATCATTTTCTGGGTAATCAAACGATAACTGAACTAAACAAATGTTTTTCTCAATAATTGCGAATATTTGTATAGTAAAACAAGC 
AGAGCAGTTTTCTGCATCACAGAGCTCAGTATCTGGTATCTGATGTCGCATATTGTAGATAAGAAACTATAGTAAGGGACTTGGTAAATTTAAGG 
CGATACCAACGCATTGCGGTGTTGGGATTACGTAGAACATAGGACCTTGGATGCGATTAGCTCGGAAAGATGTCTGGTTTTTAGATAAATAGCGA 
GAGAAACAATCTAATACACAAGATTCAAGCGGAAGTTTATAATCGTTGCCAAGAAAATTCCAGAAGGCTGATTTCTGCCTATGCACTTCAAAGCT 
TTTCGATAAATAAACAGTGTAAAATAGTTTTTCCCAGATTAAAAATGCTTTTTAGGATGAATAAGTTCTGGTGAATTTCTCTTTGCTAGAAATAC 
CCTCTTGTGCATTTTAAGTTTTTTAACATAAATTGAAATTATTATTTCTTTAATTTTTCCTCTGCCCCTTTTGACCTGAAAAATCCATACGTCGC 
TTGGTGACTTCTGAAATACAAATGACACCCCCAGGGGGTCACCCACCTACAAGTGACGTAACTTCAAAGCTCACATATGCATTTACACAAATTTT 
GTTTGTCATGGAGACAACTGAATCATCTTGAGCAACAGCATTTATTACACTGGTTGACAAGAAACCCGTTTTTAAACGTGTAACCAAATCTACGC 
TATATACACTTTTAGTTTCTTTATTTACrTTAACATATATAAATTCCAATATTTAAATATTTTTATATAAATAAGTAATTTAGTTTCTATCAAAT 
TTATATAAGAAAAGGCGTGAGGTTTTGTAGTCCACTTGGGTTAGCAAGTTCGTAAGCCTTTTCCTGGATTCCTTATCGGTATTGAATTGCCGTGT 
TGTGGTTAACCGCAAATAAGCAGTTTTGTTCATTTGGTTTTGGTTTACGTTTTACACTTTCCAAGACCCAGACCTTAGAAATCAGTCAGTTGTAA 
CGTACATCTTATGTTAAGAGAAAAAGATATCTATCATGATATCTAAAAAAGTTTGGTTTAAGAACAAAACTTAAATTAAAATTTTAGGATTTTTT 
TTTTCGTTCCTTAACTGAACTTTCTCTGTCACGCATTGAAGTTCAAAGTTATTCAAAACTTCCACTAATTGGCATTAATATCTTTTGCTTTCCTA 
TTTTACTACATTTTGTTAATGTGCTCATCATGCGCTGTGTTGTGTTGCCAGTTTCTCTTGTTTAGGCCAAATATGAAATGGGGCAGTCAGCTTTG 
TCATGACAACTGTAGGGCAACAAAATGTTTATCCAACTTCCTTTAACAATTGGGCACGCAAAACCGTCAGTTCCGTCAGTCATGGCCAGGGGTAT 
GCCGTCGGGCAGGGGAAGGGCGGTGAATAGGGATTGGGTTCGGAATTGGGATTGGAAGCGGGAGCTGGACAGGTGCGCCCCGTGTGCGGTGTTCT 
GACACCTTTCTCCACGGCCAGGGGCCATGTCATCGTCAGCATGACAAGCGCTGGGTTGCCCGGTCGGTTGCTTAATGGTTGATTGAATTGAAACG 
AGGGAAACTGGGTTGTTTAAAGGGCTGTAGTTGAGCTTGTGGGTCCTAACCCCTAGATGAGGGTTCAACGAAAGTCAGTTTACTGGGGAAAAGCG 
TTAAAACGGAGGTTAGGCAACTCTGGTTCCTTGCTTTTTATGTTGCGCTTATAAACGAACTTTGGGATAATATTAACCAGATATATTTGTTTATA 
TAAACAACTTGTGAATAGATCCATTAGTTTGATCTTTTAGTGCATTTCATTTTAGCAGGCTAATGCAATTTCTTACACCTAGTTAACAAATTTGT 
TTTGGTCTAACGCCAAATGTGTTCTGAAATGAGTTTTACTAAAAAGATGCATTACTTTTTTGCTTTTATGGATATTTAACTTTCAAGGGTATCTA 
AATTTATTCATTTTCTTAAAAATTGCAGATGCATCTAGAGTTGCTGACAAATAAGGATACTAGATGACACAATTAGAATGGCATAATTTCATTAG 
GACGAATTGCCTGAATTCAAGCGCCTGGCAATATTTATGCAGTACTTTCCTCACAAGCGTCGCTTTTCATGCACGAGCAGCAAAAGTTTAGTTGT 
AATTATTATGCAAATGTATACAGTTCAAGCGACGAGAGAAAATGTTTGTCGGCTAAAAAAGTGAAGAACTTGAAGGCGACGTCAAAGGCATTTGC 
ACAAATCATTGGTTGGCTCTGCCAGTGTTGCCTAATTGTTATGCAAAAGGGTTGCTCACTAAAGACAGACTGTTGCTATTGTTGTTTATTGCTTT 
ATTATTGTTTGTTTTGACTGGGATTCTTCTAAACTCGATTGAGCCTTTAAAACAGTCCATAGTTCACTTCAAAACATAATACATAACAGCATAGA 
TATCAAATTGAAATGCAAGCTTAGTATTGGATTTTAATATTTATATGCCAACAATGGTAAGCGACTAAGAGGTGGCACTTCAGTGGGGAAAAGAA 
CGAAATTTAATTAAAATTCGTTTGCCTTTACCCCCATCACTTATGCTCTTCGTCTTTGCAGCTTGTGTGTAGTTTTTGGCAGGCTGCCAAGTGGA 
AAAGCCGCCTGTGTTGTCAGTTTCCTTATACGTGGAGAAAAATTGTGGCAAATTTCGGTATTTCAAGCCGCAGATGTTAAACATTTTCTTAAAAA 
CTTACAGTTACTAGAGTAGATTCGTGTTGGCCCAGGAAAGTTCTTTCTCGATATTGTCAAATGCTCAAAAGGTGTTTAATGGATCGTTTTTCTAA 
GTGCAGGCGGTTGAGACGACTTAGAGATAAGACTAGGAGGATGGACCATGGGTGATGGAGTGTGGCAGCCACTCAAACTTGACATAGACTTGGGC 
CACGGGCTTTTGTTCGTGAATGTGCTTCTGAGGTTGTGTGTGCCTGAGTGTACAAGGACGTGAGCACTATGTGACCACTGTGGCACCCAGGCCAT 
GGACGGGAAGTCCTTAGTGGAAACCAATTTCATCACTTATTGTCCCCAGCTAAGCTGGCAAAGAATCCGACTCGTGAGCGCCCGGCACCACCTTT 
GATGGGATTTGAACACTGTGCCTTAACTCGGCTTGGATTCCTGTTAAGTGAATTAGTGAAAAACACATTTTAGTTTTGAACTGCATTCTTCTTGG 
GTTCTTGATTACTTAAAAGCTTTTCCTCACGCACAGACTGGCACAGGCAAAGAACAAGGGCGCTTGTGGCTTTTAAATCAATAGTGTGGCAGAGT 
GGTAAAGGAGGCTGAATGGATAGGCATAGAATGGACCTTCGGCTGGCGATCTGTTCATTTGATTGAACTGAGTGGTCGGAAACCTATTCAATTTA 
ATAAGCGCCATTGAAAATTGAATGAGCGGTAAGGACATAACGATTAGTATGGCAAAATGTTCGGAACGCTATAAACGGATTTGCAACAATGCAGA 
CTGAACCACTGTGTAATTTATTAAAAGTAAAATTCTTTTATAATTACCTTTTTCTTAACGCGTTTGAAATGGCTGGTTTGCTTTTCGTTTGCAGA 
GGGAGTTTACAAAGTATGTTGGGGTTTTTAAATAATTGCGTACTCAACGTGCACGAAATATTTTCCTTTGTTTCATTGCCTTTTCAGTTGTCTCT 
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CTTTTTTTGCCTGTGAAAAATATGGCAACGTACTTAAGTTTCTTGATGCGCATACAAAAACTGACAAATTGCCTTCATGAACTACCCACCCACTT 
TATTTTTGCCTTTGAGTACTCATTAAAACCGGGTAAGAATAAATCTTTACAAATTTCAACGTTCGCTGCGGGAAGTCTGTTTAAAAATGGGACAA 
TGTGCATATTCTCTAAGTAATTTAGTGAGCCAAAATCACATTTTCCTTGTTACCAAAACATACGGGTTCGAAAAAAGATACCAGAATAGAGTTTG 
TGTAAAACTCTGGCGAAAAACTGATTATTTGTTGGTGCATTTTTACTGTCAGTCATTTAAAGATTCATATTTTCCCCTACTAGTTTTAGTGGTAC 
TTGTTTAATTATTTTCTTTTGGTCACTTGCTGTTGTAGTGGCCTTTTTAATGTGTTTCCCTGAGAGCTTTATTTTAACTGGTTAGCATGGGAGAA 
CTGCGAGCTGGTTAAAAGGATAAGCGTTGCATGAAATATGAAAATGTCCTTGACCAAAATGCTGCCTGTTTTGAAAATCCTAATTAAACAAGCTT 
ATTTGTACAAAAATAATTCACTATTCTTGTGATAAGATTTAACTATCATTTCCACTTTAACTTAAATAAATGGTAAACTGCTCATTCAAAATGCT 
TATCGAATGCCTTTTAAATAGTAGTTTATCATAAGCTTGCTGAGTTTACCCCTCGATAGCTACACTCCTTTATTGGCCCACTGAAGCATCCATCC 
AGCCGGTAAATCAATATGCCACCTCGAGAGGAGTCATCCATAACGACCTGCTTGCCAAATATTCGTGCGCAGTTAAGAGGCGAAAAAATCATGAC 
AGCAGCTGGGTGTCGACAATGGATCCTTGTGCAGGCACCAGGATCGCGGCCCCAAAGGCGGGTAATGTCATTAACAACAAATTGAGCACATTAAT 
AAATATGAATTATGATTGTTAATAAAAAACGTATGGCCAAAAGCGACGGAAGTGGAAGTGTGGCGTAGTTTCGGCGAATTTGCACGCCCAGTGGC 
AGGAAGCTGACACCTATCCAGTGCCACGCCCACTTAGAAGGGGCGCAATAAACGGAGTTGCGGGTAAACCAAATGGTAATTCACCCTGTGGGCCA 
CAAGCTGATGTTGCATAACTTTCAGCGCTCCATGGAATGCGAGTGTGTGGCAGACATCATAAATCAATTTTGGTCACGGGAAGAGGCAGCCGAGT 
TTCCGTAGTTGAATAAAAATAGTCTATTTTATTTCGGCCAGATTCAATGGGGAGTCAAAGCTCCAAATGGTCAGCATATAGATAGAAGGAATGAA 
AAGCCACTGCCAGTTCATCGAAAATTCCCATCGAAACAAGTTTAAGTATCGACTGGAAAGTTTATTTAGTCGTAACCTATTCCCTACTTTTAAGT 
ACACTAAGCTAGCTGCTGTCAAACGTGGCCATACTTTTACCTTAAAATGCACTTTAAAAGCCATTTACAACCTAAATCCTCGTAAACCGCGCCCG 
AGCCCGCAAACGATTCATCATAATGGTGGCCACAGCCATTTTAGCCAAGTTACAGCTCATCCGCAGAACGGTTGCCATTTTGTTCTTTATTATTC 
AAGATACGGGTCGGAGGCGCTTAACTGGGGCCACAAATTGGGTACTACCTAGATATTTGTTGGCATTAAAAATGCCTTTGGTATATTCGATTTTA 
TATTGAACTCAAGTGCGCTTTCAGTGTGTTTTATTGATAACGTCAGCTTGTCACGTTTTTTTTTCTTCGTCCCGCTCGACGCGAAAAGTGACAGC 
CGTGAATTAGGAAAACGAAAAGTTTTCCCTCAAAAAATGTGTTCCTGTGTATTTGTAATGCTATCTGTGTAATTTGGACCCAACATTTGCCCAAA 
CATAAGGGGCGTCCTGTTTTTGCTTCTTTCACGACTGACACTTGTATTCGAACTTTTTCCGTACACGAGTTTTCGGTTTGTTATTTACAGCATGC 
CACATCACTTTCGTCATGATGATAATGATGATTCCTTTATCTTTGCAGCACCTTGCACATTCATCGCATTTCCTTTTGCCAGTGAAGTTTCTTGG 
CTCTAATGTAATGTGAAATGCTACAATTAAGTGCTAGACGAGATTATCTACTAGGTTGCGTGTCAAAGACCATCTTTTGTCATCACAGCCTCCTC 
CATTTTGAAACGTAACTGAAACTTCACACACAATGGACTTGAAAATTAGATGATGCCTTGCAAGATTGATTGCGATAGTGCTTAATTAAACAAAA 
CCTGCCCAGTTCACCGTGTAATGTGATTTGGTTAATTGGAAAATATTTGGTCTTAACCATCATGATTAGGCGGCCATTGAGAAAGTTTCACAATC 
CACTTGGTTTCGGTGTGAAACCGTAGCTTGACTAACAAATAACAAAAATAACTTCATTTAAAAATCGTTTATTTCCGTAATCGAAAGCTTACAAA 
AAGGGCAATCATCTCTTTAGGGCGAATATTCGTTTTTAAAATGCTCGAAATTAACAGGCAGATGTGTCATATGGAAAGAGAAATATATGCAAATC 
GAATCAGAATCTTAAATGTTTGTGGGGTTTTTTGTGTTTTTTTTTATTTTTTATTATTTGCCTTGGGCTGTGCATGCGTTTGTCATATGTATTTT 
TAAGGCCCCACTGCTTTTGGCCATCTTTAAGGTTGGCTGGTTGTTGTTGTTTTTTTTTTTGACTGATTGCCGCCTTGGGCACGAGTGCGTGTGTA 
ATACTTATCAGTCCATTCAATTTACATACAAAGTGCTAAGTATAATTGAAATTCGCAGTAATTTCCTACTGAGCTCCGTAAACTGTCAGTGAATT 
ATTTACTTTCATTTCATGAGTTTTTTATATTGCAAGGAAATATTTGTAAAATTCATGCACTGTAATTTTCTTTCTCAGGATGACTAATCTATATT 
CTTCTTCACTTTTCAACGCCTTTGTTTAATGATTTGATAAGCACTATTATAATTAAATTTCTAGCTATGTAGGTAACGCCTTTTTAATCTAATCT 
TTTGGCCTACGACAACCTTCAAATCACTTGATATCAATTCTATTAAATTTCCAGGCACCCGCCCTAATGAGCGATAAAATGGGTCTTTTAAACTT 
TTTATACTCTATTCAATCATTAATTTGTTTGTATTTATAAGCAAACCCCTTTGTTATTAAATGCAATAATCCGTTTATACTCTATGCAGCCTATA 
AAAGAACCAATAAATAATTTTGCAAAACAGTTTACGCGGATTTTTTTTTAAAGTGCAATTAATTCTGAAGAGCTCTTGACTGGCTTTAGGAATTA 
AAAGCAATTTGCTGTTAGAAGTTTAATGCACTCTTTAATTGGTCTTTTATCGTGCCACTTAAAAATCGTCTGTTTTTGGTACAATGTCAAAGCAC 
GATTCTGATTTGTTCGAATTCCAAGATAGCTTATCTAACAAATTGGTAGCCAAATTCAATTGCTGCAGAAACCAAATGTACGACATGGTTTGTTT 
TACAAGTGATTGGAAAATGCCGTCTTATCAACGAAAGGGTAACCAACTCAGTTACTAATAACTCTTGTTGTATACACGCATGTATACCAGCAAAA 
AATATCGTTTTCAATTAGCTATAGAATGTTCAACCCATTTTCAGGATAGTTATCTTTTATTGAAGACCAATTATTTGGCAGAGTGTTGGCTGTTG 
GGAACTCAACCCTTTGCCAAAATGAAGCTGATTATGTCTGCAGACTGCGCTGCTGGAAATTCAATTAATTTTGAATGCAATTTTAAGTGCACTTG 
TGTAAAGCATTCTACTTAGCAGACGCTTTACCTTCTGCCTGCAACTAAATTCACTCGGACATTTTCGTTTTAAGTTTTGCTAGGGGTTGTGGTAC 
TTTTTCGGTGTTTTCTGTTAAAAAAAGAGTTTATGAAATACGCTAATTTAAATTAGCTTCCGACCAGAGACTCAGTTTCACCACCCAGAGCTATT 
CGACAAATCCAATTTCGCTTTTATTTCTCTACCCCGCCCATCAGATCGGGTAAACAATTTTGTTTTGGGTCTGGTTGGCGGTATCTTTATGTGAG 
TCCCTAGTTTATATATTTATATAGTATATATATTGCCTGTCGTCCCAATGTATTCCATTAGCCAAGCGAATGCTTCCCCGGGGAAACGCACAGTT 
TATTTGCTTTGTTAATTAAAATTACGAGCGTCAGACAAATTTGTCTGCAATGTACATGCAATGGGATCATTAGAGGTGGTTTGATTTGTTGGTCT 
ATGGTTCATTTGCAGCTAATTTAATTAACAGTTTGAGCAGCACTTTGGGGCTATGCTACTCTTTTAGTATATTTTTTTAAGAGCATTGTCCAATT 
TGATCGGAATCGCCGAGAAAGCCTTTTTTATGCCACGGTGTGCCAGGCACACTTGCTTAATCTTCGGCTATTCAAGTACAAACCTGGGCCGTTTG 
GTTTTTATTATTGGTTCAAACCATGCAAGGACCTATTCATAGTCCTCACCACAGATGCAGCGGAAAATACAGAGTTCCCAAAAGTGTGTAAAATG 
CCGCGGAAATCGACTTGCCTGCAGTTGTTGCTTTTCCTGGCTCCAAATACCAAACCATTTCTGAACACACTTACTAAAATGTAAATATTAACAAA 
ATCGGGTTTGCTTGTGCATTTGTAGTTCTGTTCCAGCTTTTAATTAAGATAGCAGTAAAAACGAAAATATTAATCATATTCCAGTTCCGAATATT 
GATTGGTTATTTTTGTAATCTTCATTTGCAGTAGCATAGCTGGATTTCTTTTCCAATTCTAGGGCATCTCTCTTTCTCTCTCTCTGAGTTTCCAC 
TTTGCTATCCTTCGGTTTTCCCTTGCCAAACCTAATAAATTTTGCATTGTGCGCGGCGTTTGGTTGAGCGTCGCAGTTACCTGAATTCAGATATT 
GCCTTCCGGACATACAAGTTGGCAGAAGCTGAGGCTGTGGTCCTTTTTGCCCAACCCGCTATTCGAATCCTTAATATTCCCTTGTTGGCTGACCC 
TGCGAAAGTGCCATATGGCATGCTAAATATTTTAAAACACGAAGTGCATGCTGTGCAGCTTGTTTTGTGTTTTAAAGTGATATCATAAGCCAACT 
ACATGATGTTTGGATTTTGGAAAGTGTGAGCGACTGTTAAAATTTGTAGCTAGGTACTGAAACTTGGTCCTTTGTCCTTTAAGATGTTTGGAAAG 
AGTTATGTGCACATTTGTGCGGCCCTCCTTCACAAATTTTCCGAGTTACTGGCTGTAAGTTATTTGCAACAGTTAACATGCAATATAACTGTGGA 
AATTGTAAAGAAAATATTACTTTTGCTTATGTCTACCCAGAATTTAGAATTCGAGTGAGTTTTGGGAAGACAATTTTATAGCCTAATCTAAAAGT 
CCCAACATCATTTGATTTCTATTAATATTGTAGTGTATTTCCCTAACTTCAGTATTATAAGAGCACTTTTAAAGTTTCCACTCCAAGGAACTTTA 
GAAAAATCTTATTATTCAACTACAGTGCTCTTGACCTATGGCTTCCACTCGGCCTCGGTGCATCTAACTCCTCCATGTCTCAAGACAAAACAGAT 
AAATGACTTTATTTGTTGAGTTTGCTCTGGCTGCAACATCCCCAACCGTAATCCTATCACCACTCGAGAAGTCAAGTGGCGAGGACCAACTGCTG 
AGCTGGGATGAGCCCTTCTATGGATTCAATGGGTGAGCAAGGAGGTTCAGTCAAGCGGCTTTTAAGAAAACCATCTTACATCATAAAAGTATAAA 
ATTGCTGCACAATGTGTGACCACGAATTACTTATTTCAAATAACATGAAACGGAACAAATTTGCATTTGTTTTTTTAAGCTATCATCTTTGCAAA 
AATGGGTGAAAAGCATTTGCAAACTTACAACCTAAACCTCTTAGGTACAAAAAAGCGTTGAATTGCTTTGTTTTCCTTTCAGCCAGCTCCGGCTA 
CTCCTCTCAAGAAGTCCGAAGTACACACCTGCTAACACGGGCAACAATCTTGGGGCAGGGAGTCCTGGCTGCACTGCATCCCCAGGGCACCGTTT 
ATTCAGCAGACGATGACGGGGAAAATGTGCTGTGCAGAAAGGCAGTTTGCTTTACTCTACAGCGTCTTTGTGTTCCTTGGCGCATCAATTTTTAA 
ACAGACCAAATGCAGCTGAATTGCATTTTAAAGCAGACCTTAACGGTTCGTCTGCAATGCGTAGAAATTGGCCGGAGGCCTATTTTAGATACATT 
GTGTAGCTAACCCAAAATCCATTGTTTGCATTGCATTCTAGCTTGCTTTAGGCTAAAATACGAAATTGAAAATCTATTTAAAAACTAGTTACCTA 
ACTTAAATGTGAGTTTAAGATTGCACAACATATGAAATACATAGAATATGACCAATTGACTAATTACATCTATTACATATGACTAATTTTTCATA 
TTCTCTGTCTTTTCAGTAACTAGCAAAATAAATTAGGTAAGTGTTTGCCATTTTCTCTTACTCAACTTGAAGTTCAGTTTTCTGCCACAATCGCT 
TAGGTTTTGAATTGGCTAAAATGAATCTACTGTAAACATCTGTTTCAAGTAAAAAACTAATATTTTACTGAGATTAGGGTCTGTTTTGAGATGTA 
GGTTGATGGTAAGGGGGGCCTAATTTATGTTGAACAGAAGTACGTGCCAAATTTCAAAAAACTAAAGTCGAAAACTTTTTACGCCGACACCAGAG 
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AAACAAAAAGGTCTATGCTAATTTCGCTGGTTCGGGCAAGGTAATACCAATATTTGGATTATCCGGCAGTAAACATAAAGACTTCCAATTTAGTT 
AGCTGTGTCCTCGTTAATCACCGCCGTCTGTTTGTTGTCATGCTTTATTTTGTTGAGCCCGGCTTGTCGTTTTCATTAACCTTAATTAGCGTGTG 
CCGTTGACATTCGTCACTATCCACATAAATGCAATTGCCTTACTACACCCGTCACTATTCCCCGCCGCACAGCCATGATTATCAGCAGCAACATC 
ATCGCCAAATGAGAGTGATATTTTGTGTCACACGTTTTCCACGTCGCTCCTCCGCATTATATGTATGTATGTGCTTGCCAGCTTTTCTGGACTGG 
CCCGAAGACCTCTTCGTTGGCGTCTGATATTGACCATGGCTGGCATTTGTGGCCAACCACCCATCTCCGCCAGCCAGCCAGTTGCTCATTGCAAC 
TGACTTGGATGACTGCCATCGCCGTTTGTGTCAGCTGCAAATGAGTTTCTGGGCCATAAACGTGAAGTGGAATAGTGGCAATGTGGATGCCTGGG 
CC AC C G AGT T T T GAAT GG T C T AGGT T C AG AGG T C T T G AAT AT T T AGG AT AT C ATT T T AGT AT AGT GT AGT AT AGT AATC T T AG AAAC T AT AC GT T 
TATACTATACGTTTCATATGCCAATTAGAGTGTTAAATAATATACCAACCAGCACTAATTACGACGCCCATGCCTTAAGTCGTTTTCCTCCCCAA 
CTTGGCTTATAAGCCCCCAGTTGCGATGTCGCTTTTGCTCCATTTACTTGCCAATGACAGTGAGTTCGAAAAGTATGTCGACCTATCGCTTACTT 
CATTCACTGCCCCCACATGGGGAACTAGTTTTTTTGTATTGGAATCGCATTGCACGCGGCCGGATCGTGTGACTTTGAACTCAATAAGTGTGGAT 
TGGTCCGGTCGATTTTTATGACAAGTAGCGAAACCGAACAATATTAAACTGGATTAAAAAATGAACCAAAATGAGCCCGCCAGTTGTAAAAACTT 
TCACGTACTTCATTTGCCAGATGAAAAAAAAAAAAAACAAGCTATTTGTGCCTTTTTCCATCTGACTGTTTGTCACGTTTGAAGTTTGTTTGGCG 
TTTTGTTCGCAGTTTTTGCTTCCTACACACGGGTTTTCTTGGGATCTTGAGATCTTTGGGTCGCGGTCTGGGTTTTTTTTTATGATGGTTCTTCG 
AATGCCTGGCAGCTTGAGGACTGCTTGTCAATAAAGTTCTTGACACCTTTTCTTCATTTGCACAGATTTCTCTCGCAATGACTTTCGGTTTGCTT 
TCGAATGTATTGTTTTATACTCATTCGCACACATTCCATGTAAATTCAATTGCTTTTTTCGCTTAGCAAAATAAAGATACTCGAACGGAAGCTAA 
TGAATATAAGATAAGCACGAATACATTAAAAGTTTATGGTAGAGAGTTCGTTAAACTTGCCAACAGCTCTTTATGACCTAGATAATTGAGCTTTA 
AAACAACAAATGAATAGTTGGAAATGGGATGCTAAAAAATCCTGAACTATAAAGTAGTAATTGTACCAAAAAAAAAAGTAATTGTTCTGCTCTGG 
ATTTTCATCTATTTACTTTGCTTGCCTACAAATGCTTTATCATTCATAATTTGTATACTGCCGAATGAGAATGGCATAACGAAAACAATAGCTAA 
CCCACTAAATCGCAATAAATGGCATTTGCATTCAATTTGTTGCATCAAATGCTTAATGACCGAGTGAACAAAGGCGGCAGCGACCTTTGGCAAAT 
ACACGCACAAAGATATGAATATTTCTCCTTCCCGAATTTAATTGTCCAAACAAAAGCTACCCATCTCCACCTCCAAACTCAGTGCCCCGGACTAC 
TACTGGTCACCTGCCTGACCTCGGAGTTAAGTTTGTGTGTGGCTTAATTGCTTGTGACTTGTTGTTGCTTCCTTGTCTTGGCTTGTTGAACTCTC 
AAGGTAACAAGAAGTAGCAAGCTGGCCCCTTCCGCGGCTGCTATTTAATTTCCACGAAATGAAGAGGGAAGAAGCGGCAAATAAAATGTCTAAGA 
TTGCGATTATCTTTTTAATAACATCTCGAGGAAGGCGAACATTACCACAAAAAAAGGCGGGAAACTGGCAATGAAAAGAAACTACAAAAATCACG 
AATTTAATTACCACGCACCCCAGATAGCGCCTCAAATGAAGCAATAAAATCCCAGTTGCCTGGGAGTATTGGACATTGCAAAAGTATCTAAGTAG 
AAACATTGACCGAATAGTAAATACAATATATTCAGTTTTTCCTACATGATTTGTTTAGTTTTTTTCCGCATGCCTGCATCTCAAAAAGCATAGCT 
GACACAGGTCTTTCATATCAAAAGCGCCCCCTTAATCGTTTCAGAGTTTTGCTCTAAATGTTTTCAGGGTCCCCCCAAAAAATATGTGGCAGTTA 
TTTATTTTAAATCTCATTCGTTTGTAAACGCCGCTTCACAAGAAAGCCAACAGACTGCTAAGAATAGCCAACAAATATCTACATAAAATCAAATT 
TATCTCATTTTACACAACCCTTGCCAGGCCAACAAATGAATAAAATGTCATTGCTGAAATTCTGTGAGTGGCATTCATGCGCAAAAATCATTTCA 
AGTAAATTATAACTTTTCCTCTTCACTGGTTCTAAAACACAGACATTGTTATTTATGCGATTTCCTTGTAAAAATAAACATTCATTACAAGTGGT 
CAATTTGACTTGGTTATTTGAAGTATCGAAATATTTTGTTCGAAGTTCAATACCAAATGCGCCAAGTAACTATAAAGCAATATACTTTAATGAAT 
AATCAGGTGGATAAACTGAAAGCACTGAGTAATTGAACTATAATATAGATATTTAAGGCCAAATTGAAAAGCTCTTCCTGAGAGTTGCCCCGATT 
TCATCAGCCCTCCGCCGTATGAACGTGAGGCACTGGCTCCTGACCCTGCTGCTGATGCAGTTGTTGCGGAGGATTGTTCTGAATACTGGGGTGCT 
GATGGACTGTCGGCTCCTGCTCCTTATCTTCCATTTCCATCTGCTCCTGTGGCTCGAGCGGCGTCTGCGGCTGCTGGGTTTCCTCGGAGGACTCG 
GCATTAGGGGGATCCGGATCACCATCACTGCTGACCAGCGAGTAGTTCGGGGGCTTCTTAAGCAGGTAGCGTGTGTTCGGGTTCTGTCTCGTTTC 
GAGCTCCAGGCGATCGATCAGCCAGCCCTGTTTCTGGTTGTTTGAGGCCCAGAAATAGACGGTCCCCCTGTCCTTGGGACCCTGGACATGGAACT 
GCAGCTGGGCCTTGTCCTCGTCACAGCGATTCTTCTCGTTGGACAGATTAAAGCCCGACTCCTTGATGGGCTCGCCCAGCAGACCGACCGCTCCA 
CTGTGCTGGCGTAGCTGCTGGATGGCCAACTGATAGAACTCCGTCTGGCGTACACGGTCCTCAAGTCGCCAGCGCATGTAGGCCAGGCCCGAGAT 
GGAGGCGAGTGCTCCGCAGATGCATATGCTGCCCATCGTTCGCTTCGAGGGCAGACCGATACGCAACTTTAAGGTTGGCATGGCGGCCTCTTAGT 
CGGCAAGAAATCACTTTTAGAATTTTGTGTGTGTGTGTGTGGAGGTGTCAGAGGCCTACAAATACTTGAAATGCGAAACAACTCGATGACAATGA 
TCTCAAAGATTTCGAACTCGTAGCGCCGTTCCCACAAGCACCGATGTATCACTAGATGGCACTTACCACCGAGCCTTGAAATCACTGCCGCTGCC 
TGGGATCGGCGGGCGTTGCCTCCGCATAATTGCGCTGCAACTTCCTCCTTGGCCTGGCCAACTTAATGCTGCCTGCTGGCGAGCAGAGAGAACCG 
GCGGCCTGGCTCAATTTCATAGTCACCACCCAAAAGGCGGGGACCGCAAAACGACGTAGAAGAATCCACAGATAGAACACAGAACTCGTTTTGTT 
TCTCTGGCTAGCTGTACATACTAGTTTTTTTTTTTGGTTTGTAGGCCAAATGAGCGGCGCCATTAATGACAGCCCGAAAAGTGAGCCGTCCAGTG 
GAGATGGTTCACTACCTTGCATCCTTGTGGCCCTTGATCGTGGGAGGGTATCCAACAAGTAGAATAACGAACCCGAGAGCACGCGAATTCTTCAG 
ACTGGGAGTTCTCTGAATTTTACATAATAACTTATAAAGCGAGATGTTTTTTTAATATTTGGTTCTGATAGGTATTTCATAGTCAGCTCAACCGC 
GTCCATCGATATTTTTTTGTTTATCTCATTACAGCAGTTATATTTTTACTGGTCAGGGCCAGTGGAGTCGCGAATAGAAGCAGACGAGAGACCAA 
TGAGTATCCAATTAGGGAGCACATAGTTATTTAAAAATAGTTTTCCAAGAACGCAGGGTATGCGATGAGAAATGCCCCCATAGCGCGTCGGTAAT 
CGATACTCTTGCAACGCCTGTCACGGATGGGAACGGAATATTGCATCTGATACTGGCCAGAGTGCGGTTAATCTATCAATGCATCTTCAAGTTTG 
AATTTCTGCTGCGGCGAGCCTGATGTTACATTTCTGGCCTAAGTGGCCTCTGGTGTGGCGGTCCTCCTTATCGAGGGGGGGGGGGGGGGTTGGAG 
GATCATGAGTTACTCAGTTAGTTACTCTGTCTGTCTGCGTGCCCGTCCTTCTGCCCCCCACTCCGCTGGCCGGGAAATGTGACAATTTCTGTGTT 
CGTTTTTTAGCCTCATTAAGACCTCTCTAAACACTGCCACTTAAAGACCATTTTGGTGTGCTTATCTTTATGGCCAAGACATCTAGTGCCTGCCC 
CCTCCACCATCCACCGAGACATGTATGTATCTTCGACGGCAAGTACGCCTCATACATATGCTTAACATGACTTTGATGTACTCAGCGTGGTGGTC 
AGAGTAAGATCTTTAATAAATCAAATTGCCTTGGTATTCGGGAAGGGGGTATGGGGTACTTAAAGCTTAATCTCTCTGCTTAAGTATTTCCCAGA 
AAGTAATTCTATGAGTTTTATAGTCGTTCGGGAAGCCACGGGTTTTGCAGGTGTTCATCGTACGTATACTGAAATCTGGGTATTAACTCTTTAGC 
GCGATTGGCCATAAAAGCTAAGTTGAATGCAAAAATTGGTTAATAAGTGAGTCATTTCCGATCAAAAATTACAACCAATTGTAATGTACAGGAGG 
AACTTGTTATCAGAAATTACAAACAGTTTCGAATGATTATCTTTACATAGTAAGCTTTTTCAAGCCGTTTTAGCAGTGAGTAATTTTAGCATTTG 
ATGATTTCATATTCTTTACAACATAAGTTACAAGTTAAGCTGTAAAGATTAATTAATCCCCCGTATCAAATTTAACATTTTCTCTGGTAGACACT 
GACGCACAATCCGAGAAAAGGCAATTTATCAACTTTTTAATGACATATGTGCTTTTTGTTAAGTTTTTATAAGCAAGCGTCAAGGCTTTTTAATT 
AGTTTTATGCTGAAAACGAGTAAATGATGCTCAGCATAAATTAATTGTGAGTAGGGTCGGATCCGAGTTGAATCATAAAACTGGTTGCTTGTCTT 
CCCAAAGATGAAATATGCAATTGAGCACAACAGTGCCGCAAATAAGTAATTGCCGACTAGTCTGCCTACGATGTTCTTCGTTGTTGATTATGTAT 
TTTGTTTATTTAAATGATTTAGTCAATCCTAGTCGCTGCCCATCGATTCCTATACCTTCATTAGCTCAAATGGCCTTCAAGGCTCACCGGCAAAA 
TTATCCATTTGCTTAAAATCTTAAAAGACCATAAATTATGAAAGACTACTGAAGATGCCAAGCAGAAGAGGACAGCGCAGTTGTTCGTTCAATCA 
GGCACTTCTTCAGTCAGCCATTTGAGCGTTCACTCACTCAGCTATTATTTTCAATAAAAACAAAATAAATATGGCAAATTGTTGGGACCAGCCCG 
CCTCGGCAGCCAGCGGCAAAAAGTAAAAATCAAACTACAAGTCCAAGCAAAACTACAGAAAACGCTGTGCGCCTGCGCAGCACGCAACGCCAAAA 
GAAAAGCCGCTGGCCAAGTCAAAGGCAGAGGCAAAGGCAAAAGCTGCTTCTCGGATGCGGCTTAGTTGGAGCCAAAGTCGTAGCCAAAGCCAAAG 
CCAAGGTTTCTGCCTTTGTAATTGCTTGTCTCCGCTCGGCTTCAAGTACGGGCAGCAGAAAAAAAAAAACAAATATCCAGCGACATGCGTCTGAG 
TAATGTTAGCGCCATAAATAGGCAAGACAGACAGAAAGAGACGAGTTCGCCATAATGGAGTGCCTCGACTACCACATACCTGTTACTCAACTAAG 
AGAAGTGCGAGATTTATGTTGGATTCATGGGCGCAGCATACTTGTCAAAGCGTCCAGCATACCCTTTGCTCGTAGGGTAACCAACATGCTGATCG 
AACCTCGACGGTCAAACCTTGGTCCCAACTTGTGTGTTTAATTGCCAGTTGTAAAAATCTTTTAGGGTGGAGCTAATCACCAAGCTACAAATGCA 
AAAATAAAACTTCCCTATAAGTGAGTTGAAGGCCAACTCGGCTTTATTCTTAACTGAAAGTTCAGTAAGATTAAATTCCTTATCCTTTAATTTGT 
TCAATAAATTAAACATCTTAAATGCAAGCCGAATGGTAATTCTTTGCCCATTTTTAAGCCCTACTTAAATTTTTTATCTTGCTGTGGAAGCTTGG 
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CCACTCCCTTTTTTTAAGCAAATGGAAGCCGCATTGATTAGAGCAGCACATGGCACACATATGTTTGGACATGCCACCATGTGGACCATGGAACT 
GGATACAGCAAAATCAGACCTCGAACTACTCTTCCGCCTGAACTATGGGATATTATCTTATATGCCTGCTGCTGGTTATATGCCGCTGTTTCGGT 
TGCTGATACTTTTGCTGCTGTGTAGGAGCTCGGTTTCCATCTGGGAAATGGGTGAACAATGGTCACTGATAAAGTCCAGCGAAAGAGAGCTATTT 
CGAAGCACTCGTCTGTAAAGTAAAGTAGCCGACACAAAACACCAGCGAAGCTAGCAGCTCGATGTGGGATCTGTGCCTGGGGATTTCTTTCTTTT 
ATCATCGGCTGTCAGGCAATCGAAACTACATTCAAGAATTTTCTATTAAGAAAATGTAACCATAACATTTCCTGCAAAGGGAGCGGTCGGCTTTT 
GTTTTGTTGTGCAACTTATGGCAACAAGTTAATTGTTTCTTTTATGTTCGTGTAATTGCCGGCGAAAGCCAGCGCTCACTTTACACTTTTTTCTA 
CTTCAAGAACAAACTAGAATAGCTCTTTTAGTTTTGCAATATAATATATCAAACGAGCTTCAAGGGGTCCCAAGAAAAAATTTTTTTTTTTAAGT 
GCTATAAATAAAATGGGTGTAGGGTCTTTTAATCTCTCTATATACATTTCGCTACCCATTTCTTGCCTTCTTTCAGCAATTTGAACTTATACATT 
TTGGATAGATAACATTTCAATTTAAAAACACACTACACTGGCAAAATAAATAAGGCAAATAGCTCTTTGTTCGGCTTGGTCTTTTCTGAAAGTGC 
ACTGCGGTGGAGGTAGGAGTGGGAAGTCTTGGCACAGCCTCCTCGAGCTCTCCAAGATGCAGTCGCGGATTAGATGACAGATTAGTTAAATGCGC 
ATGAAATGAGGATCCGTTCGCGCCGTTGCCTGCTGGTCGGTAGAATCCCAGAACCCGTGGCTGCCTTCAGAACGCTTCCTACTTGGCCTTATGAC 
CATGCGGTGCCTCCTTCGCCCCCGGCACACGTTCCTCTAATGTCTTCATTCGTTAATCAGGCGTCGGGCCAAGCACGCGTAAATCGCCTCTATTA 
ACAGGCGTTGTAATTGCCGGTTGCATGCAGGTGATTAGCTGAAGTCCGCTCCATAGCGGAATTAGCACCGAAAGATGCGTAGTTGAGATACTTTT 
CGATTCCCACAGGAACTCTTACATTTATTAGTCGCCGGCGCATTTAATTTAAACGCGTCTGCTTGTGACATCACTGCCATTAATGAGCTCATGAT 
GTAACGCACTTAAGGGTCGTAAAAGTTGAACATTATAACTAAGCCCACAGGCTTTTTTTATTCCTCTAGGGCTCGAATAAACAATAATTTAAATT 
TAGCTAATGAACCTAATGACCTCAGGCTGCCCTTGAACGAGTTGACCGTAATTTATATAAGTGGAACAGCCATATACCGCCGCACAATCAGGCAA 
TCATTCCCACAAAATAATTTCCTAATCGACTTCCTTCTGGTGGTAGTTCATATTTTTAACCGCAAGTATCCGAGAAACCATTGGTAGTCCGCTGA 
AATAGCCACCATTTTTACAGCCCACGTGGCTATGCGATTGTATTTCCAATTGCATTTTAGCAGATCCGATTTGAGGCAATCAGCGATTTATGCCC 
CATTACGTCTCGTTTACTAATAACCCATAAATGTGCATTCTATCACATTTGCCAGCATTTCACACTAAGATCTATTGCTATAAACTATGCGAGTG 
GGTTCTCATATTTGTTTGGTTTCCACTTCATTGAGTTTTCCTTTGCTATCGGTTGGCCATCCTGTCTAATGTGCTTTGTTGGCTAAAATCGCTTC 
TGATAGGATTAATAACACACCATTCGCATAACAAAGCATCGGTATTGCACATCAACAAAGCTTGTAAGAAAATAAGAGGAAACAATAAATAATGC 
TGAGTCGACTGGAACGAATATTTAGATACACTGAATTGCTTGGTGAAGGATTGCTGATTACAATGAATATTAATTGTTCTGTTAAATATTTAGCT 
AAAAGTTACAAAGGTTGTGAATGGGTTTACATAAATCCTATACATTTTTTGTAAACACGAACTGAAAGTATCATTCACTTTAAAACTTTGGCTTT 
TAAACGAGGTACTTATATTTTAATTTTAAAGCAGTACTATGTATTTTAAGATCGATTTAAAGCCTCAACAAAAATCGCTGTTGCAATAACCATCC 
TCAAATGCTCCTGCCTGTAGCTGTTTTGTTGTAATGACAAGGGTACAAAAGAACCTGCCACACCCGAAAACTTTTCCCTCGCTCCTTGCAACTCG 
ATGTGGAAGAAATGTGCGCAAAAACCAACAAGTGATATTAAATGTTGCTGTAAAAGGGGGGAAGCTGCTCAGCCCTCTCAATATATTAATACCGG 
CTACTTGTAGATTAAAGGGGTATACAAGACTACTTCACAAATATGCAAATGTGCTTCACAAATATGCAGTAGTATTACAATCTTTTTAAGGACTA 
ACAATATTCATCTACAATCCAAACATTTTTTAAAAATTTGGTGTCCAATTAAGGTATACTCCTCTAGTGGAACTGTTGCTCAAAGTGCGAGCAAT 
GAGTATCTGATAGTCGAGCCACTAGTCTCATATCTTGCTCGTTCGCATTTTTCGTGCGGTTCAGCGAGAAGAAGAAGACGGCTGGCATACGGAAA 
AGCTGTATACGTATACGGCGAAAATGTTGAACGCAATTGTGCTCGCACTGCTGTCACTTTTCCATTTGGTTGCCGTCCGCATAAATAAATAAAAG 
TCTGCGAGTGTGTCCGCATCCTGGTTGCCGGCTTTTAGCGCCGTTGTAGCTTGTTGCCAGCTTTAAGGCGCTTTAGGGTCCTTACACTTTCCGCG 
ACTCACGATGATTCACCTCACCATTCACGTCCTGCGTTCGTTGTCATCATGACTTTGTTCTGCAAACGCACTCACCTTCTACTCGTACCGTATTC 
ATGGCGCGTGTGATGAAAATTCGTTTTCACTCCTTGCGACGGCTGGGCGAAGTAATGAACCCGTGCTCACGTAAAACGCTTTCAGTTCGCTAGAA 
GAATATTCTGTTTTTTCTTCTGCTCCACAAAGTACCTGCTAATTAGCTCTCACCGAGTGGTAATCATGTAAGTCAACAATTGAACGAGAAATTTC 
ATTAGGGCAGCCAACAATAACAGCGCGAGTATCTCAAGTACCTGGTCCGCTGTTTGGCTCCTAACTCTCCGACCTCACGATTGCGTCACTTTCCG 
CACACCGCGTCCTCTCTGACCAATCCATTAGTCGGGGAAAATCACATAATGCATGTAAACGATAATGAAAATGACATTAGTCCAGGCCGCACGAG 
AAGCGGCAAAGCCAACAACAGCAGATTGAACATCAAATTCAGATGTTTTCGCATTCCCAAGGGGAGGCAACAAAATATTTGCACAGAGATGAACT 
TGTGAATGCCCTTTGAAATACTTTTTTTTAAAAATTAAGCAATTAGCAACATTTTAACTTTGCCAAACCGAATGCAATATGCCCTTAGATGCACA 
TTGTAGCATAATAATGTAATCACAATCAAATGAGTTAACCAATTGAGAACAGGGTCTTGCTTAATCTGCAATACCCTGTAGAAATGCACAACCCA 
GCGAACGGGAAAGCGAAATTAATGAAATTAAGCTAACAAACTGGGACCAAACCTCTGAGTGTGTGTGTGTGTGGGAAAATGTATCTATGTGGATT 
GGTTTACGTGGTTCGCCCACGCGTATAGCATTAAGTTTTCCAGCTCCTTTTTTTGTCGAAGCTACAATCTTCGCCCCTCCTCGTCCCCGCCCCAC 
GGAAGTGATAATCGATTCGCACCTGTCCTGATGTGGGCGTGTGCTCTCCTGTTGCCAACTTTTCCCTTACCGCCAACTGTCGCTCATTATACGAA 
TTAATGTCATGCATTCGCGTTCGATATTTTCCATCTTATATTTAGCATTTGTCCAGGGGAAAACAAAACCGTTGACATCAGCCAGTGGACATTGG 
GTGCAATTGACTACGACATAATGGCATGCAATGACATATTATTGGCAGTTAAGAGACTGTTGACCTTTAGCTCTGCTGCAATCAATAATCCCCGG 
GACGGAATTCGTGTTAAACAATAAATATTTACCTAATAATATCTTTGTGATCAGCAAGTCAGTCGATATAGAGGATAATGTCCCAAAAATGTATC 
TGAAATTCTTACTTTATATGGATATTATTCCCTTTAGCCATATTCCCTTATCTTCACGGATTTCAACACAATTTCCTTTGCATTTTCTTCTCTCT 
GTTAACATTTGTACTATAGAAACTTTGTTTGCTCGTTTGCTACTTACTTACTATCTGGGTGTTGCGGCAATTTATTGGGTTTTTGCTTCATAACT 
TCAGTTTTTGTACATTTTGATCGCGTGATAGTTGGCGTCGCTAAATAGGCAAAAACTGGCTGGCGATATACATAAACCCGTACAAGGTTCTGTCT 
CGACCGAAATCGGTGGTCCGTGGCAGCACCTACCCTCTAATTCGTGGCTTGTTTGTTTGCCTTTCGCTTTCGCCGCGTCTTTCCTTGGACGCTCT 
CTTTTGGCCAAAAGGTTCATTAGTGCCAATGCACCCATCATCGCAGTATTTGTCGCTGCCAGTTGGCAGCAAAAAGGGAGCTCCGAATTGTGTTA 
CGCGTCCATGAGGTCAAAACCAATCGATTTTCTCGACTTTTTCGGTCGCCTTTTTTTTATATGTGATACTCCTGTTTTGTTCTTTGCATTTGAGA 
ATTTTGTTTTCTATGGCACACGCTGCGTATGAGCAATGTATTTGGATAGGTTTGAACTTATAACTTCCACGAAACTTTATACACAGCAAAGATAT 
CAAGTGAATTTTTTGAATGGGCTAAACAAGGTAATATTAAACTAACTACGGGTGAAACAATCTATTCCATATAAAAATAAAAAAGATGGCTTGAT 
TTGATATTTCCGTGACGGTTGTAATTATTGTTGAGTATGGACACAATAATTTGTTGTATTCCTTTGTGTGAGTAACATTGCACGGCTATGACAAA 
TAATTGTAGCATTAGGTGAAACGACATTATTTTTAGTGAATTAGTGGGGTCCCGCCAACCAATAGGCTAAGTAATGTGAATCCAATATCGATCGG 
TGCGAAAGTGTTTTTACGACATGCATCTAGACTTTATGTCTAATTTTCTCAAATTAGTTATATGAAATCATCCCTGGCTATCACATAAACCAATT 
TTATAGTTGTTAGACCTGAAGTATAATGTGACTAATCAAGATTCAAGAAAACGTTAGTTTTTTATATGTATTTTTTATATTCGAGATCTCTTTAT 
TTCTAGCCATCGATTGGCAATGCTAATGAAGTTAGTACGCAAGTGCCATTGGTTGGCTTGGGATCTCCTAGAATTTTACGAGGTGGCGTAAATCT 
TCTTCCGCATGGAACCATTGGAATCGCCCGACGAACTGCGTGGCCACACCCTCAGCGAAATGGTTATAACTCAATTCTGTCGTTTCCGATGTCAA 
AACAAATGAACTAATATAAATTGAAGACTTTATGAGTGTTTCTATGAGCGCGTCCTGTCCCAAGCCATAGTGTTCCTTTCTTTCCGAGTATAAAT 
ATGCCAATGGCAAACATGGTGCGACAGGTATTTGGGCATTCATCAACTTGTGTATGACAGCATGTTGTTGGAACTGCTCGTACCATAGGTTCCAC 
TCCCTCTCTGGTGTAGAGAATTCGATATTGACCATCAGGCCATCATTAAGTGGCCATAAAACTCTTTGTTAATTTATTAATGCGTAATTATCGAT 
AATTACAGCCATATCTATTGACTATGGGTTGGCAGTTTTTTGTTTGTGTTGCCTGCCCGGCAATTATGTCATGCATTGTGTTGAATTTCCTGGCA 
AATTGCTAATTTAATTAAACTTTATTGAACACTATCACTATTGATTTTCAGCCAAGCAGGCGAAGCGAACATGAACTGATTAATAAATAAATTCG 
CCACTATTTTTACTTATCTAATAGTTTGAACTGTGCGGAAAATGAACCATAGATTTCTACCCCCGATCAAGTGGGTTTCTTCGATCAGCTCTACG 
ATCCTCCCCCCATTGACCCCCGCGCCCAATCACTTTATAAGCTAGCTATCCAAAAATAATCAGCATAGGCAGAAACACAGAGGCGAAGCAAACAA 
TGTTTTCATAAACGATTTTCGCCTGTGACCGCGTCGCAGAAATTTCTGAGAGAGGCGCAAAAGGCGAAAACGAGAAATAATGAGGCATTCCATTC 
GGCAGAGCCTCCCAACCATCTGGAAATGCCCAACAAGGGGCCAGCGCAATCTAATTAGACATAATAAGGCAGGCAGGCAGGCAGTTGGAGGGGGA 
GGGGAGGCACCATCGGGTTGAGGGCACCACTGGTCATATACCAAAGAGTTTGTCAGCTCGCAGCGGATCGACTGTTGCCAGGCGAAGGTTAGTTT 
CGCTATAGCCAACCAGCCACCTCCATCCACTCCAGCCGCTCCGACAACTCTTGGAACTGACCAAACGACCCAGCCGGCGAGGCCCCAAACAGCCT 
CACGAGAATTGGCGATCGCAGTGCGGAGTTTTTCCGGCCATGCTCTGTGCTATTAAAATATATTACAGCAGCAATTTCAGTCGACTTTATTTTTG 



FIGURE SHEET 545 



CGCTCAGTTGCGGCTGTGGATGTCCGCAGTCCGCAGTCTGTGGTCCGTAGTCCGCGAACCGCTGTTCGCCGCTTATAACTTCATTTTTTATGGTC 
ATGAAAGTCAATTGCAGTTTGGCATTTCTTCGATGGATTTTCCCTCGTTCGCCCGTGGAAATGCCAGCATCATCGCTTGGCTGCGACTTGGGATC 
TGTATTGCGGAGAAGAAGTCGTAACTTGGCGTAGAAAACTATTATTCACTTTGCAATGGAAAACATAAACCTATTTTAATTGCACTGCCCACTAG 
GCAAATGTGTAAATTTTTTTTGGCAATATATATTTTCTAACCTTTGTACCTATCTTAAAGAATTTTGGTTCTTGATGTTCATCTTGAGATAACCG 
GTTTAATCAA7ATGAGAAGTT7ATTAACCTTCAGTCTTATCGATGACTAATGAAGTATGCAAATCTATATATTATAACAAACCAAGGATACAAAA 
AATGGAGATTTTTCTATACAGAACAGATTGAGCACAACATACCCTTGAACTCTAGGGTGCTTTTTTTTTTTTGTAGCAACCCACATTCATTGTTG 
CAAAACAAACGTGTGTCGCACTTCACGTTGCCGTTTTTGGGGGCACAACTTCAGTTGGCAAATTGAAAACTCATAATAATTCTCACGTTTCGCTC 
CAAATTTTTTAAGTCATACAACTTTGGGGTTTTGCATAATCACCATATATCACAACGAGTCACACACGCCGCAGCCAAAACAAAGTGACGAAGAA 
CAGCCGCAAAGTGTGTGTACGTGTGAAAAGGAACCTGAAAAATCATATAAACATATAAATAGTTTGGGTATCGAATGATTATAATGCCAAATGTG 
ATCGTTAGGGTGGCAACTAAAGCTCCAGACGTGTTGCCAAGCCGACTTCCTCTGGTTCTGGGATTAAGTTTCTCTCTTCAATCAAACTTTTATCG 
GAACTCAAATTGTTTGTGTACTTGGCTCGGTTTCCTGTGAAGTGAAGTGCGGATTGACCGACTCCTACTATGCAAATTCATCATCCGACAATGCC 
CCAGAAACACACAAAACCGGTGGCACTTTAAGTGCCCTCCAGGAGGCCCACGCGTCCGCCTACCAACCAAATTAACGATGGAGCAATGAACTCAA 
AACACGACGGTCCATACGGTCTAATAGTTTTGAATTTCGGAACTCGTGTGGACCTCTGATGGACCCCCAAGGCAAGTCCGCTAATGAATGTTTTT 
GTCTTCTCCAGAAGCCGCGATTGATTTGTATTCATCTGGCCAACACACCTTGCCAATTGTTGTTTTTTTTTTTTTGGCA2^AGAAAATATACTTTG 
TGCCTGGAAAATI^ATATTTTTATATGCGATATTCAGGTGTAAGGTCAGGTGTTTCTTATGGCAGTTCGCACAGAACAAAGGGCAGTAGATGAGCA 
AGGTTTCAATAGTGTGCTCTAGGCAATCCCATACAAACCAGTTGTTTTTCTAAGCATTTACTGGGTACTTTTTTGCTTATAAAGAATTATCCAGC 
C T AAAAAAAC T T AGC G AGT TT GG AT AAG AAC T AT C T GAACT C G AAG C GG AAG T T AAGGGC AC GC T T AAAT G AC GT AAAAT G T T T AACAAT TG AAT 
GCAGCTTTTATATTGTATATTGTATTTTAAATATGTATACTATTTAGCCAAAGTCGTTGCACCCGTGAATGAATATTTAATTTCGGTGGCATCCC 
TCAGATTTATTGTGAATCACAAAGTTGCCGTGCGCACATTTTGTATGCACAAGACCTGAACTGCTTTCAAAGTGGCATTGAGATCTTTAAACAAG 
TCGACACCCGCATCTTGGAGCAGAGAAGAGTGCTTGGCAAACGGAGACCCACGTAAGCACTTCCTGTGCGACATTTTGTTTGGTTAACCACGCTG 
AATGATAAGCAATCATTCAGTTAATTGATAACGAGCTCAAACAATACGCTAGCTGAAAAGGGAAGCATGTACGATACGTGGGTTGGAGAAAACTG 
TGCCAATTAGACAGGAATGCAGCAGTTTCATTTCAAGAGTCTGCGGCTTCAGTCTATTTGCTACCAAATGGAGCCCAATAAAAGGCAAGCAAAAC 
TACATATATTCAGTTTCAACCGAATGTTTTTAATATTTATCAATTACCGGTCTAAGAAATAGCAGCAGTTGACTTATCACATTTTCTAATAGAAA 
TATTTT GAAATT CCGTAT AAAAGCTTACATTTGAAAATGTAC AAGGAAATTAAAAAGC AATTCTACT GCT T TT TAT T ATT TAATTAAAAAT AAAT 
T CG AGT G AAC AT T AAC AAT TG ATT T AAAG AGT GT AC AAAT TT T T AAT G C AAT AACCT ACC T AT T T C AAGT GAAC AAAAAAC AT AT AAAAAGT AAA 
ACAACTTATGAGTTAAGTTTATAGATTTCTTTCTGTGCTGGTATCATTATTTAATTAAGATTGACATTTTAAAGCTGTTGAGATTTCCAAGAGTA 
ATACGCGTTAAAATATGACGTCTTCTCTATCAAACTTAATCTTGTGATTACTCGTTTATTGTGTATTTTATAATTAAAATGCTATCAATGAAGAC 
GTCTGATCAATAAAAAAAAAAACATGTTACACTCAATTGTTAAACACAGCCTGCAATTTTCAATTAAAACTACACGCTTCGGATAAGCGTTGCTT 
ATATAAGAAATAAACTTTTATATTTTTTACTTTTCTAAGTACACAATAAATCTGTGCGAGTTTATTCATAAAAAATATAAATAGTTTAATTTAAA 
TTCTGTTATCACGGCTCAATTTTCTAGCATAAGGCCAGCAATGACTTTCAAACTTATGAATTCTCAAACCTTTGAATAACGCCATCTACCGTAAA 
GGGTGTTTCTTTTAAATCTGTGAGAGGGCGCTGCTGTAGAACTACATGTTTTCTACATGGATATCTACATGCTAGCATGGACATGATTTTAGTTT 
CGCGCATCTGTTGTTTTTCAAATCATGTTTTACAAATGAACATATATTTTAATGTTAATTTTTTTTTGCAGCTACAACATGCGTTTTGAGCCTCT 
GGGCCAGAAAATGTTCACCCGCCGCAATTGGTTGTTGCGCTGCAGCATCCTTATCAATGTGGCGGTGATCCTGTACATTGGCAGTCAATTGATGA 
TCGGCGGCGGCAACAACTTCTCGAACGGCGGTGGCTTCCTGCTGCAGGAGCAGCAACAGGTGTCAATGATGCAAGTGGGCTCCGCCTCATCAGCG 
GCCCAGGTGCAGCAGCAACAGCAGCCCACTCCCAAGAACCAGAACGTAAGATACATCTAGATAATGCTCAAGCGAACTTTGGAACTAACGCCTCG 
TTTCCTCTTTAGCAGGTGGCCGAGATCTTCGAGTCCGAGGAAAAGCAGCTGGTCAATGCCAACGCAGATCCTCCAGCAGCAGCGGCTCAGGTGGC 
TGATGGCGGTGCTGGAGCCCCGCAGTTGGCGAACGATAGTGGTGCCGGGGCGGCTGCTCTGGGCGATCCCTCCCAGGTGATAGGAAACATTGGGC 
CCGGCGGTGTCGCCGAGGTGAACAATAGCCAGTCTGACGGCGGCTACATAGTCACAGGCGATGAGAAGGAGCTGGAGGAGCGTGTGCGATCGTTG 
ATCAACTGCTTCGACCACGACTACCACCAACAGACCTTGCAGCGTGGCGACTTCTGGGTGCTGCAGAACTACGTGCGGGCGGAGCATGGCGACAT 
CAAGTGCCATGAGTCGATCACCTACACCACGCACGCGGACTACACGTTCCTGGACAACCTGGTGCCGCTGTTGGAGCGCTGGAACGCCCCGGTGA 
GCATTGCCATGCACGCGCCCGGCACGGACTTCCAGCCCACTCTGGACTCGATCCGGTATCTGCGAGAGTGTCTGCCCGGCAGTCACCTGGTGCGC 
GCCTATACCACCTTCCACATCTACTTCGGAACCAAGCACATTCCTAAGTCTGTGCCCAAGCCTCACGAAGTCTTCAAGACGGGATACAACTGCAC 
ACTTCCACCTCCGTACTTCAACGTCAGCTCCGGGCACCTGTACAAGGCGCAAAAGAAGTTGCTGTATCCGGTGAACGTTGGTCGTAACATTGCCC 
GCGACTCGGCACTCACCCACTTTATTCTGGCTTCAGACATCGAGCTGTATCCGAACCCGGGTCTGGTCAAGAAGTTTCTCGAGATGATTGCTCGC 
AACGAACAGTACTTGAGACGCAAGGCACCCAGGTGAGTTGCCTTCAGTTCCTTGACAGAAAGACCTTGTGATTGAGATCTAACATCTATTGCACA 
TTACAGAGTATTTCCATTGGCCATCTTCGAGGTGGAAGAGAACTCCCCAGTGCCACACGACAAGACAGAGCTACAGGAGTTCCTTCGCACGGGCA 
AAGCCATACCATTCCACAAACGAGTGTGCGCCAGCTGCCACGGGGTGCCCAAGTCCAAGGAGTGGATGTCCGCCAACGAGACAGATGAACTAAGT 
GTGTTTCACATAGGTAATGAATTATAATTATATCTTTGTTGCATTTAGATTAATTTCTTATGTTGCCATTGCAGGAAAACGAACGGGTTACTATG 
TGCACTGGGAGCCCATATATATTGGAACTCATGCCGATCCCCATTACGATGAACGGCTCAGCTGGGAGGGTAAAAGCGACAAGATGCCTCAGGTG 
AGTCATTCCGCATTATTGCAAAGCAGCAGTAAACTTAATAATATTATTTATTCACAGGGCTATGCGCTTTGCGTAATGGACTACGAGTTCCACAT 
ATTGGACAACGCATTCCTGGTGCACAAGCCGGGCATCAAGGTGCTGAAAAAGGACAACCGACGAGCGATGTTGTCCGGCAAGACAAACCAGCTGA 
TACGGAAGATCATCTATCCAGAACTCAAAATAATGTATGGCATGCGCAAGGGATGTGCGATATAGTAACTAATACTGAAGCCAAGCTGAGGTCAG 
TCCCGATTGAACGGGAAGGAGGAAAAGGCGAAAAGACTACAACCAACTTCACTATTACTATTTGCAATGCTATAGATCGGTTGTGTTGACTACCG 
GTTCCCATTCCCTCTATAGCATTGAATGCGCACTTCAAGTGCGTGTTTTACAATTAGCTAGAGACTTAGCCCTAGAACTATAGAGCATTATGTAG 
TTAGGCTTAGCTATAGGTCAGGACGTGTAACGCCATGGGGGCCTATTGGGTTTTACCGCGTTCTGGGTTATCTTTTTCTACTCTGCAAAGCCTTG 
AGAGCATGGCTTACATAGACTTAACGTAGTTAAATGTGGCTTTTAGGTATCATGGTGTATTCCATTTAGCTGGCCATTCTTAAGTTGGAACTAGA 
CGTCAAGTAAGCAGTCGGCCCTTTTGATAGAGCGTGCTTAAGAATACACGCCCAAAATTGATTCACTTCCAAGAAATTTCTGTAAATAATAACAC 
ACTAGGACCTTTCATTTCTTCTGTTCTCTGGACATAAGCGGCCAGAGACCTGGGTTATGCGGTGTCCACCACTCATAGGCAGTCATTATCAGTTA 
CTTTTAGTCTAGGCCATAAACTTAAGTTGAACGTTATTTAGTTGAATTTTCCATATACATTCATTAACGAGTTAAAAATAGTTTTTATTGGATAT 
TCTTTTCAAAATACTCAAAAAGAGAAATTGAGAGCAACATTATTTTCGTATACCGGGGCGAGTTTCCTAGACATCAGCAAAACAAACAGTCTTAC 
CACTTTTTTGTGTTTTATATATTTCCGCATTAAAGGCATAGATTAAACAATGTTTTTTGGGCAATGTAGACTTACTGCAAAGACCAAAATTTAGA 
TTAGACCGATACCGGGCCAAAACAATTCCAACGTTAAATATTTGACTTGATTAGCAGAGTAGGAATCATTGAATTGAAACGTATTAAATAGACTT 
TGAGAGTTAAAGCGTAATAGTAATAATGATAAAGACAAGTTTCAAACAAAAAAAAACACAATTACTATTAAAATTTATACAAGAAACACACACAT 
TGAGAATTGATTTGTTTACTTACCGCACTGAATGGTAGGCAGAGCAATTAATGCGATTGAAGATTACATTTCAATAAGCATAAATGCGAACCGTG 
ATATTTGTACTAGTAAAGCGATTTGGAGATGTTAAACCTATTTGATAAGAAATGAGTGGGCGACGATCAAAAGGAAAGATCAGAGACCCATATGC 
ATTAAACTTAATCTAAATTTAGTTTTGTATATGTTTTTAATTAGTGTTGAACATTAAGTCAAACTTGTGCACATATAAATGTTAAAAAACAACAT 
TGCGAAATCGAGTCCGCGATTAGATTTCGAGTGAAATGCTGAAGGTTTTTGTTGCGTAGGCCTTAACACAATAAAGAAATGTAATTGTAACTAAG 
CTATCGGTTTTAGAGAGTTTAAAAAAGATTAGTACTTAGCTGAAAAACAGCAAATTTTACGTATGTTGTTGGAAATTGAATGATGTACGTTTTCG 
ATGTGACTTGGGAACAACATACAACAATTGGTGTTTGTATAATTACATAAGCCTTGATTCCGCTGATATCATGTTTGCAAATCAAGTCTTCGCTC 
AAAAATTGATTTAATTGCATATTTTCCATTCGTAAGACTAAAATATCACTAAACATAATAAAAATGTTGAAAATGTAGCTAAGAGCAGAATTAAA 
ATATAATTAGAATACAACAATTTGAATTAAAGAATATATTTGTATGATGTAAAGAAAAGTGTAAAAGTGCCTTAGACAAAACGATAAACATAAAT 
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TATTTGTAAGTCCATTAATCAAGCAGCTGAATACTATTTAGTTTGCAATTAAATTGGCAACGCTGTGTGTTCATGCATACAGCATATTTTTGTAT 
GTATATAAATAAAACCGAAGGGCAATGTATTAAACTATAATGTTTACTATATATCGAAATAGATTATTTTTTATAACTTGAGTCTATGTCACAAT 
TCCTATTGAATGAGTCTTCTTGATGATTCACAGCCTGCCAATTATT 
(SEQ ID NO; 1006) 

Exon: 1001.. 1161 

Exon; 1929.. 2083 

Exon: 29997.. 30255 

Exon: 30318.. 31192 

Exon: 31262.. 31458 

Exon: 31520.. 31632 

Exon: 31693.. 31890 

Exon: 32136. .32961 

Start ATG: 1132 

Transcript No. : CT26212 

CCGAAATGCCACAGACTTTTTGTAGGCTTTTCCACTTTTTCGATTTCGAGACTTCGAGCGAAACGGTCGTGTTTTTCGCGTGTGTGCGGGTGGCG 

GACGCGCGTGCCAATCGGAAAAGGCCTTTAAATCAAATGAGTGGCTCGCCAAAGGAGCGCATCAAGAGCAGCCAAAAAATCAACAAAATACGAAA 

CACAAAGCCAAATCAAAAGAAATCAAATTTCAAATTCGTGCGGCCACCAGCAAATATCGAAATAAAGAAATTATCTAAATCATCATCAGCAGCAG 

CACAAAAATCGAAAATCTCCTCGCTGTGTGTCTACAACATGCGTTTTGAGCCTCTGGGCCAGAAAATGTTCACCCGCCGCAATTGGTTGTTGCGC 

TGCAGCATCCTTATCAATGTGGCGGTGATCCTGTACATTGGCAGTCAATTGATGATCGGCGGCGGCAACAACTTCTCGAACGGCGGTGGCTTCCT 

GCTGCAGGAGCAGCAACAGGTGTCAATGATGCAAGTGGGCTCCGCCTCATCAGCGGCCCAGGTGCAGCAGCAACAGCAGCCCACTCCCAAGAACC 

AGAACCAGGTGGCCGAGATCTTCGAGTCCGAGGAAAAGCAGCTGGTCAATGCCAACGCAGATCCTCCAGCAGCAGCGGCTCAGGTGGCTGATGGC 

GGTGCTGGAGCCCCGCAGTTGGCGAACGATAGTGGTGCCGGGGCGGCTGCTCTGGGCGATCCCTCCCAGGTGATAGGAAACATTGGGCCCGGCGG 

TGTCGCCGAGGTGAACAATAGCCAGTCTGACGGCGGCTACATAGTCACAGGCGATGAGAAGGAGCTGGAGGAGCGTGTGCGATCGTTGATCAACT 

GCTTCGACCACGACTACCACCAACAGACCTTGCAGCGTGGCGACTTCTGGGTGCTGCAGAACTACGTGCGGGCGGAGCATGGCGACATCAAGTGC 

CATGAGTCGATCACCTACACCACGCACGCGGACTACACGTTCCTGGACAACCTGGTGCCGCTGTTGGAGCGCTGGAACGCCCCGGTGAGCATTGC 

CATGCACGCGCCCGGCACGGACTTCCAGCCCACTCTGGACTCGATCCGGTATCTGCGAGAGTGTCTGCCCGGCAGTCACCTGGTGCGCGCCTATA 

CCACCTTCCACATCTACTTCGGAACCAAGCACATTCCTAAGTCTGTGCCCAAGCCTCACGAAGTCTTCAAGACGGGATACAACTGCACACTTCCA 

CCTCCGTACTTCAACGTCAGCTCCGGGCACCTGTACAAGGCGCAAAAGAAGTTGCTGTATCCGGTGAACGTTGGTCGTAACATTGCCCGCGACTC 

GGCACTCACCCACTTTATTCTGGCTTCAGACATCGAGCTGTATCCGAACCCGGGTCTGGTCAAGAAGTTTCTCGAGATGATTGCTCGCAACGAAC 

AGTACTTGAGACGCAAGGCACCCAGAGTATTTCCATTGGCCATCTTCGAGGTGGAAGAGAACTCCCCAGTGCCACACGACAAGACAGAGCTACAG 

GAGTTCCTTCGCACGGGCAAAGCCATACCATTCCACAAACGAGTGTGCGCCAGCTGCCACGGGGTGCCCAAGTCCAAGGAGTGGATGTCCGCCAA 

CGAGACAGATGAACTAAGTGTGTTTCACATAGGAAAACGAACGGGTTACTATGTGCACTGGGAGCCCATATATATTGGAACTCATGCCGATCCCC 

ATTACGATGAACGGCTCAGCTGGGAGGGTAAAAGCGACAAGATGCCTCAGGGCTATGCGCTTTGCGTAATGGACTACGAGTTCCACATATTGGAC 

AACGCATTCCTGGTGCACAAGCCGGGCATCAAGGTGCTGAAAAAGGACAACCGACGAGCGATGTTGTCCGGCAAGACAAACCAGCTGATACGGAA 

GATCATCTATCCAGAACTCAAAATAATGTATGGCATGCGCAAGGGATGTGCGATATAGTGTAACGCCATGGGGGCCTATTGGGTTTTACCGCGTT 

CTGGGTTATCTTTTTCTACTCTGCAAAGCCTTGAGAGCATGGCTTACATAGACTTAACGTAGTTAAATGTGGCTTTTAGGTATCATGGTGTATTC 

CATTTAGCTGGCCATTCTTAAGTTGGAACTAGACGTCAAGTAAGCAGTCGGCCCTTTTGATAGAGCGTGCTTAAGAATACACGCCCAAAATTGAT 

TCACTTCCAAGAAATTTCTGTAAATAATAACACACTAGGACCTTTCATTTCTTCTGTTCTCTGGACATAAGCGGCCAGAGACCTGGGTTATGCGG 

TGTCCACCACTCATAGGCAGTCATTATCAGTTACTTTTAGTCTAGGCCATAAACTTAAGTTGAACGTTATTTAGTTGAATTTTCCATATACATTC 

ATTAACGAGTTAAAAATAGTTTTTATTGGATATTCTTTTCAAAATACTCAAAAAGAGAAATTGAGAGCAACATTATTTTCGTATACCGGGGCGAG 

TTTCCTAGACATCAGCAAAACAAACAGTCTTACCACTTTTTTGTGTTTTATATATTTCCGCATTAAAGGCATAGATTAAACAATGTTTTTTGGGC 

AATGTAGACTTACTGCAAAGACCAAAATTTAGATTAGACCGATACCGGGCCAAAACAATTCCAACGTTAAATATTTGACTTGATTAGCAGAGTAG 

GAATCATTGAATTGAAACGTATTAAATAGACTTTGAGAGTTAAAGCGTAATAGTAATAATGATAAAGACAAGTTTCAAACAA2^AAAAAACACAAT 

T AC TAT T AAAAT T T AT AC AAGAAAC AC AC 

{SEQ ID NO: 1007) 

Start ATG: 132 

MSGSPKERIKSSQKINKIRNTKPNQKKSNFKFVRPPANIEIKKLSKSSSAAAQKSKISSLCVYNMRFEPLGQKMFTRRNWLLRCSILINVAVILY 
IGSQLMIGGGNNFSNGGGFLLQEQQQVSMMQVGSASSAAQVQQQQQPTPKNQNQVAEIFESEEKQLVNANADPPAAAAQVADGGAGAPQLANDSG 
AGAAALGDPSQVIGNIGPGGVAEVNNSQSDGGYIVTGDEKELEERVRSLINCFDHDYHQQTLQRGDFWVLQNYVRAEHGDIKCHESITYTTHADY 
TFLDNLVPLLERWNAPVSIAMHAPGTDFQPTLDSIRYLRECLPGSHLVRAYTTFHIYFGTKHIPKSVPKPHEVFKTGYNCTLPPPYFNVSSGHLY 
KAQKKLLYPVNVGRNIARDSALTHFILASDIELYPNPGLVKKFLEMIARNEQYLRRKAPRVFPLAIFEVEENSPVPHDKTELQEFLRTGKAIPFH 
KRVCASCHGVPKSKEWMSANETDELSVFHIGKRTGYYVHWEPIYIGTHADPHYDERLSWEGKSDKMPQGYALCVMDYEFHILDNAFLVHKPGIKV 
LKKDNRRAMLSGKTNQLIRKIIYPELKIMYGMRKGCAI* 
(SEQ ID NO: 1008) 

Classification: enzyme 



Celera Sequence No. : 142000013384661 

GGGAGCGGATCTTCTTCAGACGCTCCAGATCGTCACGCAGCTTCGAGTCCAAGTTGGAGGAGGTCAGCTGCCAGTACTTGCCATCGATGATGTCC 
TTCTGCCTGTTGAGGAACCAGTTGGGCACCTTGTACTGCAGAGGGTTCGAGATGATGGTCACCACCTTGTCGACCTGCAAACATGGCATTTAGGT 
TGGTGTGCGTCTCGAACGGATTGGGCGTGCTCAACTTACCTCCTCCTCGGTGCACTCACCGGCGCGCTTGGTAAGATCGACATCGGCCTTCTTCA 
GCACAATGTTGGAGTAGCGGCGACCCACTCCCTTGATGGCGGTCATGGCGATGCCAACCTTGCGCTTGCCGTCGATGTTCGTATTCATGATACGC 
AGGATGTGCTGGAACTTCTCTGGGATGACGAGCGACTGCAAGAATGCAAAGAAAAAACATTAGTATAATGCATATAAACCACGCCATTTTTGCCA 
ACAGCTAACGGCATTTGCAATTCACTATAAATTTGTCGTTTGGCGCTCAGAATCGCACATATTTGAGGTAACTCACCATTTTGCCGTTTGTATTC 
TCGTATTTACTCCTCACTGTAAAGAATTAAGAGGTATTTTGTTTTAGAATTTTGGAAAAATCAAATGGGCAGCACACCACTCACCGGAAACGGGA 
AAAGAAAAGAAGGCCGAATTAATGTGGCTGCCTGAGGCGGAGTAAAAAACCT2\AAATTGCCTGCCACGTTCGGGACGCGTGCTGTTAATAGTTCG 
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CCGTGTGCTGTTAGGCGCACTCTGGTCCATCACTAATGCAAATTTAAAAATTCAAAAGGCGCCGAACTTTTGTTCAATGAAAATGCTTAAAGTGC 
CATTTAGATATGCTCTGTGTGCACATTTAAAAAATTAAACTTCATATTTTATGTTTCTTTATATTTTATATTTTATTCCCCACATTTCGTAATAC 
GTTTTATCGTACTGCATCCCTAGAGCACCGTTATATGAAAGCTTTTCAGAATGGGTGAAATAAATGGTAAATTATATTAGCTATGCGAATTTCCA 
ATATTGATAAAAATGCATTGCATTTAAGCACTCAGCTGCGTTGGACAGGACGATGTCGTTTCCCTGCGCATCGTGTGCACCATGGCTCGCGACGG 
CAAGCAGCACAACCTCGAGGATGTGGACATGTACGGGAATACCGCCCTATTGAAGGCCTGCTATTTGGGTCGATTTGAGTGCGCTCGTACGCTGC 
TGGAATTCGGTGCAAATATCTTTGCTATGAACTATTTTGGTCAAAATGCCCTGACACTGGCCACTTATGCTGGGCATTTGACTTTAGTAAAGGAA 
CTGTTGAGGCGAAGATCCTACAAGGATTTCAATCTGTCCTCGATGATCCCGGCCCTCTGTGTGGCTACACTGCAAAAACACTCCGCTTTGGTGGC 
CTATTTTACTCAATTGGATTCGAGGGGCGTGCAAGAAACTCAAACAGTGCATGGTTAGTAAAAATCAAGTGTATTTATATAGTAACATTAAAGTA 
AATTTATTTCAGGTCTGGGAGTTGCAGAGCTGCGAGGAATGATTAAAGCTGCAGGCCGCTTAGATAAGCGCAATGTGCGCTCTCCTCCCACTTTT 
ATTAGCAATCGTCTGCGTTAATGTATTGTCATTTTGTAATAATTTATTAATTTTAATGTTTGAAATAAATATACTCTGTAAAAAGTGTGAACATA 
ATATGTTAATCAAACCAGTTGGCAGGCCGCTCGCTGGCTGCTATCGATAGATTCAGTCGATATCGCCCGTGGCTTTTCACATCCCTATCCCGCTC 
ATTTAGCCCGCCTGAAAGTAAAAAAAAAAAACAGCCCACGTTAATCATTCATCCCAAAGTCACAGCCGCGGTAACATTACTGCTGTTAAATTCTT 
AAGCCCGTCATCAGTATTTAAATAATAAAACACATTCAATATGTTCGAGGCACGCCTGGGTCAAGCCACCATCCTGAAGAAGATCTTGGATGCCA 
TCAAGGATCTGCTCAATGAGGCAACCTTCGATTGCAGCGACTCCGGCATTCAGGTAAAAATTGCACAAAAAAGATTTTAAATGCATATCCCTAAC 
ACCTTGTTCAACTTACAGCTACAGGCCATGGACAACTCCCATGTGTCGCTTGTCTCGCTGACCCTGCGTTCCGATGGCTTCGACAAGTTTCGCTG 
CGACCGCAATCTCTCCATGGGCATGAATCTGGGCAGCATGGCCAAGATTCTGAAATGCGCCAACAACGAGGACAATGTGACGATGAAGGCGCAGG 
ATAACGCCGACACTGT C AC CATC AT GT T C G AAT C GGC T AAC C AGG AG AAGGT AT C GG AC T AC G AGAT GAAAC T GAT GAAC C T C G AC C AGG AGC AC 
CTGGGCATACCGGAGACAGACTTCTCGTGCGTGGTCCGCATGCCGGCCATGGAGTTCGCTCGCATCTGCCGCGATCTGGCGCAGTTCAGCGAATC 
CGTTGTGATCTGCTGCACCAAGGAGGGCGTCAAGTTCTCGGCCAGCGGCGATGTGGGCACCGCCAACATTAAGCTAGCCCAAACCGGCTCTGTCG 
ACAAGGAGGAGGAGGCGGTGATCATCGAGATGCAGGAGCCGGTGACGCTGACATTTGCCTGTCGCTACCTG 
{SEQ ID NO: 1009) 

Ex on : 1001.. 1016 
Exon: 1074.. 1478 
Exon: 1533.. 1636 
Start ATG: 1001 

Transcript No. : CT26230 

ATGGGTGAAATAAATGCACTCAGCTGCGTTGGACAGGACGATGTCGTTTCCCTGCGCATCGTGTGCACCATGGCTCGCGACGGCAAGCAGCACAA 
CCTCGAGGATGTGGACATGTACGGGAATACCGCCCTATTGAAGGCCTGCTATTTGGGTCGATTTGAGTGCGCTCGTACGCTGCTGGAATTCGGTG 
CAAATATCTTTGCTATGAACTATTTTGGTCAAAATGCCCTGACACTGGCCACTTATGCTGGGCATTTGACTTTAGTAAAGGAACTGTTGAGGCGA 
AGATCCTACAAGGATTTCAATCTGTCCTCGATGATCCCGGCCCTCTGTGTGGCTACACTGCAAAAACACTCCGCTTTGGTGGCCTATTTTACTCA 
ATTGGATTCGAGGGGCGTGCAAGAAACTCAAACAGTGCATGGTCTGGGAGTTGCAGAGCTGCGAGGAATGATTAAAGCTGCAGGCCGCTTAGATA 
AGCGCAATGTGCGCTCTCCTCCCACTTTTATTAGCAATCGTCTGCGTTAA 
(SEQ ID NO: 1010) 

Start ATG: 1 

MGEINALSCVGQDDVVSLRIVCTMARDGKQHNLEDVDMYGNTALLKACYLGRFECARTLLEFGANIFAMNYFGQNALTLATYAGHLTLVKELLRR 

RSYKDFNLSSMIPALCVATLQKHSALVAYFTQLDSRGVQETQTVHGLGVAELRGMIKAAGRLDKRNVRSPPTFISNRLR* 

(SEQ ID NO: 1011) 

Name: plutonium 

Classification : DNA_replication__f actor 
Gene Symbol: plu 



Celera Sequence No. : 142000013384591 

TCACTATAAGCTGCAATAAAGTATATCAAAGGCCTGTTCGGGCGGCATTCTGGGGGCTCGAGTTCCCTCCCCTTTCAGTGCAATCTTTCGTTTGT 
TCATCCATTTCATAAAAATTCCTGATAATGCTTGATTTCCACAACATGCACAAGTGCGTGTATGTATCCGTTTTAGTACTGACTATATTCTACTC 
TGTTTAAAATAAATATTACACGAGACACTCGTTATGTTATTGCCCCTTGATTAAAACATCTGAATTTCTAAAATGTAATTTTTGCATATCCGATA 
GATATTTTGTCTGTGCTGCTTCCAAAAAAAAAAAACGTTAAAACTCTAAATATAATTCACAATATTTTGTTTGTATAAATTATTATGCTGGCGGG 
TGGTTACTCGTTATGTTATTGCTATATCTTTATCAATTATCTGCAAGCACAGATGATATACATATGTATGTACCTATTTATAACACAAAGCTGAT 
CTTGAAGAGGGCAAAAATAGATTGAGTGCGTTGTCAACGCTCACTTTCAACATTTTATCACCCTTGCGCTTCAGCGAAGCCTGTCTTACGTGTCT 
ACCAACTATGAAATGCAAAGTAACCGCCACTGAGTACTCTTAGTACACTTAACGCATAATATGCCTGGGGCTGGGTTACGCCTCGTTTCTATTTA 
TGTACAAAACATCGATATCCAGATAGCAAGTACCCTCGTTACTTAGCTACTATCACACACACTGACTGAACCTGTTGCTGTCGCATTGTTGCTTT 
GTCAGAGAGGTTTTACAACTAGTATCATAGCTATTTTGTGAGTATTGCTTCATTTTTATACAGGGTTTTTTCTTTTGGTAACACTAGGCTTTTCT 
TACACTATTTATTGAATCGTTTTCCACCTTGATATTTTGCCGTCTTTGAGGGAGATAAGAACTAAGCTTTCAGTCGAATCTGAAATAGGTTGTAT 
TACTGATTGTGGTTGCAAATGAAATAAAATAAAAATTCATAACTTATGTTCTTGCTAAATATTTATTTATGTAGTACCCAATTGAAAGTGTATAT 
ATTGAAGGATCCTTCGGGCCATGCCGAAGTCATTCATTACACAAGTAAAAGTTTGGCATGTCTCCATTTCGAAACGGCTTCATATTTGTTTTCAA 
ATTAAGACTAGAAACTGGACCTTAGTGTCTAAACTCTACTTTGAGACTGCACATCTATATGGCGCGATAAAGTCACTTAATAACTTAAATGTTAA 
ATACGTGTGTTGTATGCAAAGTTATGTGAGGTCATCGAACGAAGCGGACAATTAGACAGATTCAAATATGGTTTGTGGACATGCTCGCACTTTGA 
TGTAATCGATTCGTTTGATTAACGTTGAGTTGTTGGGTGTAAAACAAGTAAACCCAACAAAACCAAGTTAATGGGTTTCCAAAAAAGAACGACAA 
AATACGAGAGATCGTGTAAAAGTTTGGCATTTTGCCCTAAAAGTTTATGCTATTTTACAAATATACAATGTATGCATATATAGATGTATATCGCT 
TATGATTTGATTTGACTTGTTGATTGCCTACGAGCTAATAATACCTTAGGTTTGAACTAAGATTTCCATATCGTATGTACGTTAGTATATGTGTA 
TAAGTATATATATATCTATATATAAGTTATGTATACAATTGTATAATATTGCAATTTCCAAATGTTCAGTTTCAATATGATGTATTTTTTTTTGT 
ATGGGTTAGGGATCTGGCCGTTTAATGGATTGAAAGTCGTGGACCGGCGGCACCCATCTGTATGTTACAATACAAAATTGCTTGTCTTCTCTTGG 
TTAATAATAATATTACAATTTCTGCTGTCATTTCAAGGGGGTTTTTCGTCGTCCACAGCGACGTCTCATTTACAAATTTGATCCCTATACTAATA 
CTACTTTCTACAAATATATGTATGTGTAATTATATGTTACGATCGAGATTTTTCATTACTTTCGTTTGGGTGTTTGTTGACCCAGATTCAAATTC 
TGTTTGAATTCACCATACATCATTCGCTTTGAGTTTTGTATTATTGCAAAATAAGATATTTTCTCTACTATTGTTGTTTTTTGTGCATCAGCGCC 
ATATTGAATGATTAAATATAAGTTCTCACACCGCCTTTTCAAAGGGCAGACAGGGAAAAGTATTTTTCCTAAAATTGAGAGCTCCTCTCACTTAT 
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ATGACAATGATCTTACATTGGTATTCTGCTGTAACCATTTCTTAAAATTGTAAGCAAGATAAAATATAAATTCATTTGATTTTGTGGCTGATTGC 
TCCAAAAGCCTATTCAAAACATTTTACCGATCTTAAGCAAGGCCTCGCAGCAAATAAATTGTGTCATTCAATTGTTCTCTGATTTAAAATTATTC 
TTATCTTTGTCTTGTACTCGAACATTCTCACATTACTAAGATTCTTTAAATATTTAAAGGAGACAAAATTTACAATGCGGTAAACGTTTGTTGTC 
CATTGCTTATCAATGTTAATATACATGTGCTATAAAATGAAACTGAGAAATACACAATAAAACCAAAACATAAATTAAATACTCTTTCTCTATTA 
AAATGTATGTTTATATTAAAGTAATCCAATGGCATGAAAATAAAGTCCTGTCAAAACAGTGGGTAATAAGTAGGGTAAGGTATGTTTTCAATGAG 
TTAATTAACAAATTAGTCTTCATTGGAGAATACTTCAATTTCTTTCGAGTACAAGACTATAAGATGGCCTTGGTAGGATCATGGGTAACATGAAT 
TCCCTTTTGCTTGCCCTTTTGTCCATCGGCGTTTTTAAAACAGGCCTTTGGCGCCGGCCAACTTACGTCGCCGGAATCACATGAGTCATCATTTG 
AGCGTATTGTAAATGTTATCCTCTCCGGCCAATTCAACGGTGGTGATGTGGGCAGCGTGCCTCTCGCCCAGCGGTTCCTCGCCCAGGTACACAGT 
ATGGTAGCTGATTGCGCTGCTCTTCACCGCCGCCGTAGCTCCATTCACCTCAATCCGACTGACCGACTGTGGTCCATTGACCGAATACTTGAGCG 
ATGCATAATCTCCATCATCGTCGGGCACATGAGTGAACTGAGTGGCGCCCGAAAAGGTGTCCACTGTGTGCGAGGGACAGATTTCGGCGTATATA 
TTCTCCTTGGTATAGTCCACAGCGGGAACTGGTCTACTGGCGGACTCCGATCGCAGGCAGTCCCCGCCGGAAATCACAAGCGAATCGCGCACACT 
GGAGTAGGGCAAGTCGTCTTCCACCAGCGATGCGGGCGATATCGCCAGTTGGTTGAGTCCTCCTCCCAACAGAGAAACTCCACCGGCTGCCCTTC 
TTCCAGTGACCGGTTCCTTGGTCAGCGCCTGTTGCTGCGCCTTTAGCAACATCAGATGTCGCTTCTCCTTTTCAGCCTCGTGAAGTTGCAACTTT 
TGCGGCAAGCTAAACAGAGGTAACTGGTGCAAACTGGACGATGACTTGCTATACTGCCTGGTCTTCACGTGAAACTCGTTGTACAGCTTCTCCGT 
GATGAAGGACTGTGGAGGTGGGGTCTGATTCTCCACATCTTCCAGCTCGGTACCTGCCTCACTGATGGCGTCATTGTGATTGTGGCTGGCCAGCA 
AATCAGGTCGTTGCTTGCATTGACCATTGAGAAGCTCAATATCCTGCAGGGACATGGACTTTTGCATCATTGTGTCCAGAGACCTCGGCTCCATG 
GTAGTCAGCATCGTTGTTTCGGTGTCGGACGCTGCTGTGCTGTTTAGGGAGATAGCATCTTGCTCTGGCTGTTCGGGATTCGAGCCAAAACGCTC 
CTGCAGAAGCTTAGACATATCCGTATAGCGCTCCATGCTCATTCGCTTGGCCTGCTCATCGAGATCTAAGTCAGATTGATGTTTCTCTATTTGTG 
GTTGCTGTTGCTGCGGATGATGCTTTCCGAACCACTTCTTCTTGGTAGACTGGAATACACGCATGCGCTCAGAGATGCGATTCAAACTGGATTCC 
GAGTGAGCTGCAGCAGAAGCTGCCGCTTCCTTGGCTTCCAGATTCTCCAGATCCGATTTGCTGAGGAAAATAATGGAATCCTCCTTTTCGTGCGC 
TGCTGCCATGGAGCTCTTCTCGCGTCTCAGGATCTTTGGAAAATGCAGTGGCTTCAAGATTCGTAGCCCCTCTCGATTCTTGCTTCCCGAACGTT 
CCGAGCGCTGAATTAACTCAATCTCCACGAGCTTCAGGAATGGATCGCAATTAAATTGACAAAACTTTAGCATATTCTGCACATTCGTGTTAGCG 
AGCATCCGTTGTTCGGAGTCAAATCCTAAATAACTTTTTACCAGCTGAATATCTCGACTGGGATAGAACTTCCTCAGCTGCAAGGGCGTCTGGGT 
ATCGACGTCCTCAATGGAACAGCCAACAACCACATCCTCCTTGCTCACTTGACTAATACATAGCTGCTCGGGAATGTCGGGTCCGCCTGGACCGG 
AGATATAGCGCAGTGTTAGCGGTAACTGCTTGGCCGCCGTCACCAAAGCACTCAAACGATGCACGCACTCCGGATTCAGCTTGCGCTGCGCATCG 
CCCAGTTTCTGAAGCAGCTGAGGAATTCCCGGTTCAATCTCGTAGAACTTGCCCTTGGTGGACAGTGGGACGTACAACACCTGCCGGTTTTCATT 
AAGCAGCTGGGCATACCGGTTCTTTTCCTTCTCCCTTCCCATGTACCCACTGCCATGGGACCTGGTCGCATTGCCGTTGCCATGTTGCGACATAT 
TGACCTGATCCTGTTTGCCATCCTCGAAGACGGCCAACAGACGGAATACCTGCCCGCCCCTTGCCGTCGTTTTCACGTACTGTGGTCGTCCCTTG 
TTTATGGGCAGGCATCCCTCGTGCGTTGGACTGCTCGTACTGTTGGGCGAGGATTCCGTAAACGCCGGCATGTTGTCCACTGAGACAAACTTGTA 
AACCCGTTCCCGAACCAGCTGTATCAGAGATCCGTATTGTGTGGCCGTGGGGAGACCCTTCTCGCTTAGCATCGAAAAGTAACCTGCGGGAAAAA 
GAAAAAATACAAATGTTAAAATTGTCCTTGTATCTTATGTTGTATGCGTATCTCTTCTATTAAAGTGGGTTCATCTAACCATTATACATTTCATA 
AATAAATTAATTACAATTGGGTCAAAATAAATGTTCACAGAAGCTTCGCCTTCTCAAGGTCATAAAAGCATTTAAAAAAAATAGCACAAATCAAT 
AATTAAAAACTAATTTTGAAATCTCTTTGAACAAGACAGATATTTTGGTTCAGTCGCTGAACAAATCTGTTTACTGTCTAAAATCTGAAAACCAT 
TTTTCCGACAGCTGACAGCTTCGAACAGAATATAGTACACAATTTGCAATCCAAAAATGAGTACAAAAAATAAAACAAATAACGTGACGTGCGAC 
TGGGCATCTCTTAGTATTGAGATATATGTATTTAATTTTCTTAAAATAAAAGCATTTTTTGTCAATTAAATGCAAAACCGACAAGTTTGATTGGA 
GGGTTTGTAAAAAATAAAATT CGAATGT AAAAAGAGTTT CAGTTAGCGC AGGT GGAT TTT ACAGAAAAAAATGC AATGC AATT AAACATT AC AT G 
TATCGATGAGTCCATTAATCATTTCATTTGGTTCAATTCGCGCCACTGAGCTTAAATTATAATGATACAATAAAAAAAATTGATGATAAAGAGAG 
ACTAATGCAATCAGCGTCGAGACCAATTGAATCAACACAAGAAAAATCCAATATGCTCGTGCAAATTGAATCACTCCAATTTGCCATCCATTTTT 
TCCGGACATTGCAAGCCTAATGAATAAAATGATAATACCCAGCGTCAGTAAGTCTATAAATGGGCAGATTGCATAATATTTGCACCCATTGGTTT 
ACATTGCGCCTCGCCAAACGGCTGTGAAAAGTCACATGGCGTTGCGATTTGAAACAATGGCTGACCGCACAGTACGGCGAAGAAGCAGCAGTGGG 
TGGTTGGTTGGGTACTGCAGGTGGTATCGGTGGTGTCGGTGGTGCCGGTGGTGTTGCTGGTGTCGGTGGTGGGCCACCGAAGAAGTCACCGCTGC 
CAGACAGACTGGGCCAACAACCAAAACCACCGGCACTGGAACCCCGAATAGAGCTCATTGCATAACGCCTGATGAGCAATAGTTGCGATTACTTT 
CCATTGTTGAACTGTGACTGTGTCACGCAAAAAAAAAAAAAAACATCAAAGTGCTGGATACATATGTCTGTATGTTTAGGGAATTGTGTACTTAG 
TAACCTCGCACTGGGAGATCAGAGTTCGCTTGGCTGACTCACTAAGCGAGTTGAGTTGGTTACGAGTGGACCGTGTAATCAGGCAAAAGTAGTAA 
GTATCTAAATGCATTCGTCATGAGAACACTAAATTGTGCCTTCAAATGTTCCCGTCTGGATTATGTAAATGGGCATGCGACTACAATCAGCCGTG 
TTCGTGCCGAATCAGTCACAGGTTTTCCTCCCATTTTTTTTTTTTTTAATCATTCGGTTATGCCCTTAGCTACTGCCAATCGGCGCATCAATCGG 
TGGGTCATTGAAGTTAATTTACTGTGAGCGATAAATTAACTTCCCTCAATGGAATGACAACTTGTTTGAAGCCGCCCTCGTCGGGGCCAACGTGT 
AATTACAATTAGCCGCAGCACTTAAGTTCTAGCTGTTCTGTGAAAAATGTATCTTTCGCTTTTACAATTGTAATAATCAATAAATAGCAGGTAAT 
TAGAACGCAAACGTACTATAAACCTGAGATAGCATACATGTAATAATATGACTACTTAACTATTGTAACAATCAATTTTAATAGTTAATAAACAA 
T AT G T T T GC T AAT T G AT T TT AG T T TAT T AC AG AT AC AT A C T G T T AAGC AAT AT G AC AAAT T GT T AT T T AT GAT T T AAAT AAAC C C AG AAC T C G A T 
GCAATCATCCAATTCAGAAACTCACCCGGAAACTCCTGTGGAATGAGCAGAGTCTTCTTGCGCTGCGATGCCTGAGTCTTGGCGTTGTGTCCGTG 
GTAGATGTTGCGCTGTCGCACCAGCCTGTAGAGCAGAAAAAGCTCATCGTGGCCGGGTGGAGTGCCACCGTTGCTGCTGTGGCGACCGTTGTTGT 
TCTGGTGACCAGAACCAGAACCAGAACCCGAGCCCGAACTAAGACCAGGACCAGAGCCAGTTCCGTTTCCGGTGGCGCTGCCTCCGCTGGACGAT 
GAATGGGTGCCGGTGTCAATGTTGCCACTGGAAATGTTTCCACCACCACTGCCACCAGAACCAAACGGATTCGCTCCCGTCGAGGAGCTGCTCGA 
TGCCGCCGTGGAGGAGCCATGCGATGAGCTGGACTGCAGCGTTTCGTTTGAGCTCTTGCTCGCATGCAGAATGCGGACGACTCGTGGCAGGGAAT 
TCCGCTCAAGGAACTCAATGGCTGTCAGCGCACCCTGTAAGATGCATTGTTTATTAGTTGAAGTTAGAGTTTATTTAAATAATTTCGAATGTGGT 
TAGGAAACTTACAAATCGACAACGAATAATTAACCAACATTTTTGCGCGATTAGAAAACTTATTGACGTCATGTACACGAGAAAATTTAATATAA 
AAAAATGTATATTCTATATTTTATTTTAATCCTAATCTACGTTAATCTGCTTTTATGATATATTGTATTTGTAGAACAAACTTTGGGGGATTTTA 
AAAACCCGTTTCTATCGTTATTCGAATTTTTTGCCATCAGATTATGTTACTATTTGAAATTGGTTTGACGTAGATTTGTGTATATATAGGTATAG 
TGTAATTCAATCTTACTTCGCTGCTGCCCTTGGAGGCCTTGCTGTCCCCGGTGGAGTAATCGCCGCTGCCATTGAGTGCGTCACAGTATCCAGAA 
TGGGCACCTCCCAGCAATCCTGAGCCCGATCCCAGGCCGTAGGTGCTGTTGTTGGCACTGCCGCGCACGCGTCGTGCCCTCCGCTGGTTCAGCTC 
GATCTCGTTGCGGCGAGCTAGGCGGGCGTTCTCGAAATCGGCCACCGAGAACATATTCGAACTGGTGTCGTGGGAGTTGAAGCGCCGCAGTTTGG 
GCTTGCGCGCCCCAGCCGCCGAATGGTGGCCGTGATGGGCGCCAACGGCTGCTCCACCTGCAGTGGCAAGAGCAAAAGGTACGAAAAAAATTAAA 
ATAAAGAATAAAGAAACGCAACGCAACGCGGAGGAGTCGTGAGTGAGTGAGGTAGCACATTGTGCATCGGTGAGGTAATCGCATAAATATAAATG 
CACATGTGAGTGTCTAATTGCCACCCCGTTGTCTGCAGTGGGAAAAACCCACACACCGTCGTATCAACTTCCACATCTACGAGTACAGTGGCAGA 
GCTTCGTCTATAATGAGTGTATATTCCCCAAGCGAATTTCAGTTGGCGTGACGCAACAACAAAAAAAAAATATAAAAAAAATAAGGCAAGTGGTT 
CAAAACTGTTGCTAAGCGCAGCTAATTTTCAGTCAACGGTCAACGAGACGATGACACACAGTGGGCAAATATCGCCCCAAAGCGTAACCGCAAAA 
TTAACAAGAATTTTCGAAATAGACAGGAAGGAATGTCGAAAGATCTTGCACTGAATTTAAAGTGGGGAAAATCGAACTTTTATTGTTCAAGACGG 
CCATTTTTAGACCACAATAAATTTAATGCACTGAATAGCTGCACCTGTACACAACATTCTTGCCTGCTCGGCTTTGTTAAACATCGTTTCAAGGC 
TAAATATAAATGAAAATCTTTTAGCCAAAGTTTTGTATATGATTTGGAATCTACGAACTTCAAGTTCAACTGGCCAGAAAGGCAATCTATCTCTA 
TCTAATAATAACATGCAAATACTAAAAACGTAGGTATACACAAAACTCAAACTGCTCTATTTGCACTCGATTCTATAATTCAAAGTTTACTAATC 
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AGTGCTTATTTTTGCGTTTTAACAACGTTTACGCZ\AACAAATTTGTCACTAACAAGACTTTATTTGCAATACAAATATATTGACTATCAGCTAAT 
TGATACTAAATTTTGCTATTGACGTTTGCCATGGGTCATTAAAAATACCTGCTTGGTTTATGAAATGCTTGAAAATTGCTTAAAAACTTGTGAAA 
ACATTTTTGATTTATTTCCAAGCATAAAAATTCGTTATCAGGTCAATGATCTATGTGTAAGCTATTAATCAACGTATTTTATTATCGATTAGGTC 
ACTTCGATTCGCATTTCCGGGTTATTGATTACTTGTTTAAAAAGTTTTGAGAACTTTTTGTTAGATTCGAATGCACTTGAGTTAATATCGAGGAA 
GAACTGCGGGGGTCTATGACTTGACAATTTTAAGGTCGAATTGTATTTTTGGTTGAAAGATCCAATCGGATCGGATATCGAGGCACTAATAAAAG 
AGTCAACAACTGGAGGCGATTAACGGCACACAAGTACGTAGACAACACATAAAGATAACACCTCCCTCATGGGCAACATATATTGTATTATATCG 
TTATCGGGCCGACATCCAAAAAAGCAATTGTCTGTCGCCCCAAACAATACCGCATGAAATCACTTAGCCATATTCGTTTTTTTCCCCGATTTTTT 
GTGTTGCTTTAGAAATATATAGGTATGTATGTAGTTGTGTAAATAAGCGAACTTCGAGCGTTGCAATTAAAAAGAGATATCGCGATTTTTGATTG 
CTCATTGTTCGAATGCGGCCCATGGTGCGATTTGGATCCGCTGATTGGTGGTCGACGGCGCATTACTCACTTTAATTGCGCACATTTGGCTTAAT 
TTTTTGCCCCACACACAACACACATTTAATAAATGCAGGCGAGGGCAGAAATCACAGCATAACACCGGGAAGGGGCTAAAACGTTTTGTGAAATT 
GCTCTTATCTTTCGTGAAATATCAATTGCAGAGTAGACTTTCAGCCAGCGAGCGAGAGAGTAAGCCATATACATACATACATATATATATACATA 
TAAATAGAGAGAGAAAGCGATAGAATGGATGAAGGGGGGTGGGTGTGAGATTGTCACGTTTCGTGTGAAATTATGCGAATCGACGCGCGCGCCTT 
TTGCGAAAAGAAATCCCCATACAGACAACACAACCACCAGTTAGACAAAAAAAAAACTACTTCTGGTAACGAAACTGTTGTACCCTTTCAATCCA 
ATCTAAAAACCCTTCTTCTGATCTTACCCTTTCCGGAATTTACCGCAGATACACATACTTATCAACAATATGACTTAATCTACACATCTTAGTCT 
TTAATGGTCGAGATTTCTTTAGAACCATTCATATTTTTGAATGGGTAGACAAAAATAAGTCTTTCTAGGTCATCTTATCAACCATATATTTTAAA 
ATTTAAGTGAGTAATGTTAATCACTTCATCTCTTTATTGTTCAGAGTGTTGATTGGTATTCCATCCAAACTAGCTGGCCATAAAAATAGCACATG 
TTTCGGTTTTCACTTTTAAACGTTCTGCTCGAATTCGATTTGCACAAAAATAAATATTTATACGATTTGGATTCGCTTATCCCAAAGACCAGTAC 
AGT T GGC T AT C AAAT T AATT AAT T T AGAT G C AAAT TC G AT AAT C AAGAACT T T AAAT CAT AC T GC AAT C T C T T T T G AAC TTTTTTCGT AT AAC AC 
CACCATTGAAATTGATTACAACTTCGTTAATATATTGCACAAATCCATTTTCAAATCATTAGTTTCACTTACTGAAGTTTACCTGTAGTTGTAAA 
CTTGGATCTTTTATTACGTGCTGTAAAATCTTGTAACGGCGCACCTTTTACGGAAAATGACGTAGAAAATACACTCATTTTTAAATAATTGGGTG 
CTTGTTGTAAAAAATTAAATTCATTATTTTACAGCAGCAACCAATTGCTCGTAATTACCATACTGTATTGTGTGAAAATAGTTAGTGCCACTACT 
GTTTCAGTCTCATAAAGTTGTATTTATCAAACTTCCTAAAGGAATCAACGCAAAAACATTTTCATTTTCGAAATGGTGTAAATTAAGTCTTCAGT 
TTAATCATTTTTTGAGTGTATTAGGTCTTATCTTTTTAGTACATTCAATTCCCTTCGCAGGATTATTAAGTGTATATAAGGATGATCTGAATTAC 
CATACCCTAAATGCAGAGCATTTACAAAATAAAAGCAAAGGCGACGTCATTGAGCAAACAATTACCCATTGCGCTCTCCCGCTCTCTCCTCTCTC 
TCTCTCACGCTCGCTTGAAGCGCGTGCCTTGTCAACCTTCCCCACCACCCCTCCTCCGCACCCACTTCCACTCTTCGCGCTAAGCTTACCGTTAC 
ATGCAGCTGCTGCTGCAGCTGCGACGCTGTTGCTGGGTGCGCCCGCTGCTGCTGCTGTTGCGACCCCGTTGCCGCCCACTGGGCCAGCGCCTGCT 
CCACCCACTGTGGAGCCAGGATTTGTAGTTGTAGCTGTTGTTGTTGTTGCGCCCTTTGTGGCTGATGCAGTGGGTGTTGTTGCTGCTGCCGCTGA 
TGTTGTTGTTGCTGCTGCTGGTCCGCCGGGCATGTGCGGTCTACTGTTCATCAGCCAGTTGGCCAGGTGCCTCTGCCGCTCGAGCTCGATATTGG 
TGCGCTGCATTCCGAGTCCGGACAGCTTTAAGGTCTCGTTCATAATCGCAATCAATCAAACCGAAAAAAAAAAATGAATGCGCCACTGCAACGAG 
CGGACGCGTCGCCGCCGTCGAAACGAATGTATTCCAACTCGGTACGTCTCGCGATGCCGATTTTCAGCGACCAACTGAGCAGACTTGTCCGACTG 
CAGCGGCGCCCTCCTCGATCGTTGCTGCTGCTGCTGTAACGAGAGAGCCACAGAGAGAGAGGAAGAGAGAGAGAGAGAAAAGCGTGAGAGAAAAA 
CAACGGAGAGCACAAAATGCGCGCACAGTGGGCAGCGTTTACATAAGTGGCGTCGCGGCCAGCGCAACTCGAATTCGCACAGTGGCCTCAATCTG 
ACAGCTGCACACTCTCACAATTATCGGCAAAATAATATTTGCCGAGCTGCTCAATTTTGTTTGGCAAATATTTTTGCCTTATTATTAGACTACCT 
ACTTATTTATTGAGTGTGGTTTGGGCATAGTAGAGTACCTTATATCGGGGAGTTGTGTCACCCCCAGTTTTAGTTTATGCATTTCTGAATCATCC 
GTATTTCAGAAGAATGAACATTACCGTGAACAATTTCTTATTTCACAAAAGGCTACTGGATGACAAATTAAAATAAGATATAGTGCTTCCTAGAA 
AGCTATTTTTCAAAGATATTCACTTTAATTGATTTGTATAAATTTACTTTAGTTTTAGTTTCACGAATTTTGTAGTTAATCTTTTCTATTCTAAT 
ATTTATGCGTTTAAGTTTCTTTTTATTGTTGCCCTAATCAAATGATCAACTAAATTAGCAGCTGTGCAAACCTTTTCGCAGAGTTCATATACCCC 
AAATCCAAGCGTAACAACAACAATTGCAAAATGTGTAAAAAGATGTTGAGCTTTAGCAAAAAAAAAAAATCTCCATCGTAATTGAGATGGTTGCT 
GTGTGCCTGTGTGTGTGAGATGGATGTGCGTGTGTTGTGCATAAATTAAAATTGCGATGACGATGACGACGTTATCAGAAACTTGGCTCTCTCGC 
TCCCTCGCTCGCTCACTCGCCCGTTCGTTCCTTGGCTTGTTCAATGTCGCCGCACCTTTTCGATTGTCGACCGCCCGCCGCCCACTTCGCCGCCC 
CCCAGCCACCACCCACTACTCGTGCAACAGCGTAGAAAGCAGAGCCAGTTCGTCTGGCGTTGGCAGCAAAAACAACAACCGCGCGTCATCTTCAA 
ACGTTATGCGACAGCGTAATCGGTGCGATTGTTAGAAGAGCGAGC 
{SEQ ID NO: 1012) 

Exon: 11870.. 11300 
Exon: 8132.. 7807 
Exon: 7444.. 6961 
Exon: 5023.. 1001 

Start ATG: 11633 (Reverse strand: CAT) 
Transcript No. : CT26270 

CTCTCACGCTTTTCTCTCTCTCTCTCTTCCTCTCTCTCTGTGGCTCTCTCGTTACAGCAGCAGCAGCAACGATCGAGGAGGGCGCCGCTGCAGTC 
GGACAAGTCTGCTCAGTTGGTCGCTGAAAATCGGCATCGCGAGACGTACCGAGTTGGAATACATTCGTTTCGACGGCGGCGACGCGTCCGCTCGT 
TGCAGTGGCGCATTCATTTTTTTTTTTCGGTTTGATTGATTGCGATTATGAACGAGACCTTAAAGCTGTCCGGACTCGGAATGCAGCGCACCAAT 
ATCGAGCTCGAGCGGCAGAGGCACCTGGCCAACTGGCTGATGAACAGTAGACCGCACATGCCCGGCGGACCAGCAGCAGCAACAACAACATCAGC 
GGCAGCAGCAACAACACCCACTGCATCAGCCACAAAGGGCGCAACAACAACAACAGCTACAACTACAAATCCTGGCTCCACAGTGGGTGGAGCAG 
GCGCTGGCCCAGTGGGCGGCAACGGGGTCGCAACAGCAGCAGCAGCGGGCGCACCCAGCAACAGCGTCGCAGCTGCAGCAGCAGCTGCATGTAAC 
GGTGGAGCAGCCGTTGGCGCCCATCACGGCCACCATTCGGCGGCTGGGGCGCGCAAGCCCAAACTGCGGCGCTTCAACTCCCACGACACCAGTTC 
GAATATGTTCTCGGTGGCCGATTTCGAGAACGCCCGCCTAGCTCGCCGCAACGAGATCGAGCTGAACCAGCGGAGGGCACGACGCGTGCGCGGCA 
GTGCCAACAACAGCACCTACGGCCTGGGATCGGGCTCAGGATTGCTGGGAGGTGCCCATTCTGGATACTGTGACGCACTCAATGGCAGCGGCGAT 
TACTCCACCGGGGACAGCAAGGCCTCCAAGGGCAGCAGCGAAGGTGCGCTGACAGCCATTGAGTTCCTTGAGCGGAATTCCCTGCCACGAGTCGT 
CCGCATTCTGCATGCGAGCAAGAGCTCAAACGAAACGCTGCAGTCCAGCTCATCGCATGGCTCCTCCACGGCGGCATCGAGCAGCTCCTCGACGG 
GAGCGAATCCGTTTGGTTCTGGTGGCAGTGGTGGTGGAAACATTTCCAGTGGCAACATTGACACCGGCACCCATTCATCGTCCAGCGGAGGCAGC 
GCCACCGGAAACGGAACTGGCTCTGGTCCTGGTCTTAGTTCGGGCTCGGGTTCTGGTTCTGGTTCTGGTCACCAGAACAACAACGGTCGCCACAG 
CAGCAACGGTGGCACTCCACCCGGCCACGATGAGCTTTTTCTGCTCTACAGGCTGGTGCGACAGCGCAACATCTACCACGGACACAACGCCAAGA 
CTCAGGCATCGCAGCGCAAGAAGACTCTGCTCATTCCACAGGAGTTTCCGGGTTACTTTTCGATGCTAAGCGAGAAGGGTCTCCCCACGGCCACA 
CAATACGGATCTCTGATACAGCTGGTTCGGGAACGGGTTTACAAGTTTGTCTCAGTGGACAACATGCCGGCGTTTACGGAATCCTCGCCCAACAG 
TACGAGCAGTCCAACGCACGAGGGATGCCTGCCCATAAACAAGGGACGACCACAGTACGTGAAAACGACGGCAAGGGGCGGGCAGGTATTCCGTC 
TGTTGGCCGTCTTCGAGGATGGCAAACAGGATCAGGTCAATATGTCGCAACATGGCAACGGCAATGCGACCAGGTCCCATGGCAGTGGGTACATG 
GGAAGGGAGAAGGAAAAGAACCGGTATGCCCAGCTGCTTAATGAAAACCGGCAGGTGTTGTACGTCCCACTGTCCACCAAGGGCAAGTTCTACGA 
GATTGAACCGGGAATTCCTCAGCTGCTTCAGAAACTGGGCGATGCGCAGCGCAAGCTGAATCCGGAGTGCGTGCATCGTTTGAGTGCTTTGGTGA 
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CGGCGGCCAAGCAGTTACCGCTAACACTGCGCTATATCTCCGGTCCAGGCGGACCCGACATTCCCGAGCAGCTATGTATTAGTCAAGTGAGCAAG 
GAGGATGTGGTTGTTGGCTGTTCCATTGAGGACGTCGATACCCAGACGCCCTTGCAGCTGAGGAAGTTCTATCCCAGTCGAGATATTCAGCTGGT 
AAAAAGTTATTTAGGATTTGACTCCGAACAACGGATGCTCGCTAACACGAATGTGCAGAATATGCTAAAGTTTTGTCAATTTAATTGCGATCCAT 
TCCTGAAGCTCGTGGAGATTGAGTTAATTCAGCGCTCGGAACGTTCGGGAAGCAAGAATCGAGAGGGGCTACGAATCTTGAAGCCACTGCATTTT 
CCAAAGATCCTGAGACGCGAGAAGAGCTCCATGGCAGCAGCGCACGAAAAGGAGGATTCCATTATTTTCCTCAGCAAATCGGATCTGGAGAATCT 
GGAAGCCAAGGAAGCGGCAGCTTCTGCTGCAGCTCACTCGGAATCCAGTTTGAATCGCATCTCTGAGCGCATGCGTGTATTCCAGTCTACCAAGA 
AGAAGTGGTTCGGAAAGCATCATCCGCAGCAACAGCAACCACAAATAGAGAAACATCAATCTGACTTAGATCTCGATGAGCAGGCCAAGCGAATG 
AGCATGGAGCGCTATACGGATATGTCTAAGCTTCTGCAGGAGCGTTTTGGCTCGAATCCCGAACAGCCAGAGCAAGATGCTATCTCCCTAAACAG 
CACAGCAGCGTCCGACACCGAAACAACGATGCTGACTACCATGGAGCCGAGGTCTCTGGACACAATGATGCAAAAGTCCATGTCCCTGCAGGATA 
TTGAGCTTCTCAATGGTCAATGCAAGCAACGACCTGATTTGCTGGCCAGCCACAATCACAATGACGCCATCAGTGAGGCAGGTACCGAGCTGGAA 
GATGTGGAGAATCAGACCCCACCTCCACAGTCCTTCATCACGGAGAAGCTGTACAACGAGTTTCACGTGAAGACCAGGCAGTATAGCAAGTCATC 
GTCCAGTTTGCACCAGTTACCTCTGTTTAGCTTGCCGCAAAAGTTGCAACTTCACGAGGCTGAAAAGGAGAAGCGACATCTGATGTTGCTAAAGG 
CGCAGCAACAGGCGCTGACCAAGGAACCGGTCACTGGAAGAAGGGCAGCCGGTGGAGTTTCTCTGTTGGGAGGAGGACTCAACCAACTGGCGATA 
TCGCCCGCATCGCTGGTGGAAGACGACTTGCCCTACTCCAGTGTGCGCGATTCGCTTGTGATTTCCGGCGGGGACTGCCTGCGATCGGAGTCCGC 
CAGTAGACCAGTTCCCGCTGTGGACTATACCAAGGAGAATATATACGCCGAAATCTGTCCCTCGCACACAGTGGACACCTTTTCGGGCGCCACTC 
AGTTCACTCATGTGCCCGACGATGATGGAGATTATGCATCGCTCAAGTATTCGGTCAATGGACCACAGTCGGTCAGTCGGATTGAGGTGAATGGA 
GCTACGGCGGCGGTGAAGAGCAGCGCAATCAGCTACCATACTGTGTACCTGGGCGAGGAACCGCTGGGCGAGAGGCACGCTGCCCACATCACCAC 
CGTTGAATTGGCCGGAGAGGATAACATTTACAATACGCTCAAATGATGACTCATGTGATTCCGGCGACGTAAGTTGGCCGGCGCCAAAGGCCTGT 
TTTAAAAACGCCGATGGACAAAAGGGCAAGCAAAAGGGAATTCATGTTACCCATGATCCTACCAAGGCCATCTTATAGTCTTGTACTCGAAAGAA 
ATT G AAG T AT T C T C C AAT G AAG ACT AAT T T GT T AAT T AAC T CAT T G AAAAC AT AC C T T AC CC T AC T T AT T AC C C AC T GT T T T G AC AGG AC T T T AT 
TTTCATGCCATTGGATTACTTTAATATAAACATACATTTTAATAGAGAAAGAGTATTTAATTTATGTTTTGGTTTTATTGTGTATTTCTCAGTTT 
CATTTTATAGCACATGTATATTAACATTGATAAGCAATGGACAACAAACGTTTACCGCATTGTAAATTTTGTCTCCTTTAAATATTTAAAGAATC 
TTAGTAATGTGAGAATGTTCGAGTACAAGACAAAGATAAGAATAATTTTAAATCAGAGAACAATTGAATGACACAATTTATTTGCTGCGAGGCCT 
TGCTTAAGATCGGTAAAATGTTTTGAATAGGCTTTTGGAGCAATCAGCCACAAAATCAAATGAATTTATATTTTATCTTGCTTACAATTTTAAGA 
AATGGTTACAGCAGAATACCAATGTAAGATCATTGTCATATAAGTGAGAGGAGCTCTCAATTTTAGGAAAAATACTTTTCCCTGTCTGCCCTTTG 
AAAAGGCGGTGTGAGAACTTATATTTAATCATTCAATATGGCGCTGATGCACAAAAAACAACAATAGTAGAGAAAATATCTTATTTTGCAATAAT 
ACAAAACTCAAAGCGAATGATGTATGGTGAATTCAAACAGAATTTGAATCTGGGTCAACAAACACCCAAACGAAAGTAATGAAAAATCTCGATCG 
TAACATATAATTACACATACATATATTTGTAGAAAGTAGTATTAGTATAGGGATCAAATTTGTAAATGAGACGTCGCTGTGGACGACGAAAAACC 
CCCTTGAAATGACAGCAGAAATTGTAATATTATTATTAACCAAGAGAAGACAAGCAATTTTGTATTGTAACATACAGATGGGTGCCGCCGGTCCA 
CGACTTTCAATCCATTAAACGGCCAGATCCCTAACCCATACAAAAAAAAATACATCATATTGAAACTGAACATTTGGAAATTGCAATATTATACA 
ATTGTATACATAACTTATATATAGATATATATATACTTATACACATATACTAACGTACATACGATATGGAAATCTTAGTTCAAACCTAAGGTATT 
ATTAGCTCGTAGGCAATCAACAAGTCAAATCAAATCATAAGCGATATACATCTATATATGCATACATTGTATATTTGTAAAATAGCATAAACTTT 
TAGGGCAAAATGCCAAACTTTTACACGATCTCTCGTATTTTGTCGTTCTTTTTTGGAAACCCATTAACTTGGTTTTGTTGGGTTTACTTGTTTTA 
CACCCAACAACTCAACGTTAATCAAACGAATCGATTACATCAAAGTGCGAGCATGTCCACAAACCATATTTGAATCTGTCTAATTGTCCGCTTCG 
T T C GAT G AC CT C ACAT AAC T T TGC AT AC AAC AC AC GT ATT T AAC AT T T AAGT T AT T AAGT G AC TT T AT C GC GC CAT A TAG AT GTGCAGTCT C AAA 
GTAGAGTTTAGACACTAAGGTCCAGTTTCTAGTCTTAATTTGAAAACAAATATGAAGCCGTTTCGAAATGGAGACATGCCAAACTTTTACTTGTG 
TAATGAATGACTTCGGCATGGCCCGAAGGATCCTTCAATATATACACTTTCAATTGGGTACTACATAAATAAATATTTAGCAAG 
(SEQ ID NO: 1013) 

Start ATG: 238 (Reverse strand: CAT) 

MNETLKLSGLGMQRTNIELERQRHLANWLMNSRPHMPGGPAAATTTSAAAATTPTASATKGATTTTATTTNPGSTVGGAGAGPVGGNGVATAAAA 

GAPSNSVAAAAAAACNGGAAVGAHHGHHSAAGARKPKLRRFNSHDTSSNMFSVADFENARLARRNEIELNQRRARRVRGSANNSTYGLGSGSGLL 

GGAHSGYCDALWGSGDYSTGDSKASKGSSEGALTAIEFLERNSLPRVVRILHASKSSNETLQSSSSHGSSTAASSSSSTGANPFGSGGSGGGNIS 

SGNIDTGTHSSSSGGSATGNGTGSGPGLSSGSGSGSGSGHQNNNGRHSSKGGTPPGHDELFLLYRLVRQRNIYHGHNAKTQASQRKKTLLIPQEF 

PGYFSMLSEKGLPTATQYGSLIQLVRERVYKFVSVDNMPAFTESSPNSTSSPTHEGCLPINKGRPQYVKTTARGGQVFRLLAVFEDGKQDQVNMS 

QHGNGNATRSHGSGYMGREKEKNRYAQLLNENRQVLYVPLSTKGKFYEIEPGIPQLLQKLGDAQRKLNPECVHRLSALVTAAKQLPLTLRYISGP 

GGPDIPEQLCISQVSKEDVVVGCSIEDVDTQTPLQLRKFYPSRDIQLVKSYLGFDSEQRMLANTNVQNMLKFCQFNCDPFLKLVEIELIQRSERS 

GSKWREGLRILKPLHFPKILRREKSSMAAAHEKEDSIIFLSKSDLENLEAKEAAASAAAHSESSLNRISERMRVFQSTKKKWFGKHHPQQQQPQI 

EKHQSDLDLDEQAKRMSMERYTDMSKLLQERFGSNPEQPEQDAISLNSTAASDTETTMLTTMEPRSLDTI^QKSMSLQDIELLNGQCKQRPDLLA 

SHNHNDAISEAGTELEDVENQTPPPQSFITEKLYNEFHVKTRQYSKSSSSLHQLPLFSLPQKLQLHEAEKEKRHLMLLKAQQQALTKEPVTGRRA 

AGGVSLLGGGLWQLAISPASLVEDDLPYSSVRDSLVISGGDCLRSESASRPVPAVDYTKENIYAEICPSHTVDTFSGATQFTHVPDDDGDYASLK 

YSVNGPQSVSRI E VNGAT AAVKS S AI S YHT VYLGEEPLGERHAAHI TT VELAGEDN I YNTLK* 

{ SEQ ID NO: 1014) 

Classification: known_f lybase_gene 
Gene Symbol: B4 
FlyBase ID: FBgn0023407 

Celera Sequence No. : 142000013384789 

CAAATTATGCATTTGAATTCTCTAGAAATGAGTACCAAACAACCCATAAATTCAAAATCAATGCTTATTGTACATCTATAAGTATTTTATATTTA 
TTTTGGAATTTATCATAAATTTTGCTTGAATTTTAAATACACATCCATAGATCGTCATTAGCTATTCTCAATTCAGTTTTAAATAGGTCGTAACA 
TAATAAGTTTGACTTGCAAGCAAAACAGAAACGAACAAAATGTATCTAAAAAATGTGAATACTTAAGTGAATCAATATATTTTATATGCTGTCTT 
CATGTACATAGATAATAGATAATATATAACAATTACGATTACAAAAAAACACGACATTTTATGTGAGTGCAGGACAGTGACGGCGGAAAAAGGGG 
GTCGGGACATGGAACAGGGGGGAGGGAGGAGAGGGGGACAGTGGAGCAGTGGGCGTAGGAGCACAGCTTGGATGAAAAATGAATTTCGCTTTCTT 
GTTGTCTCTCTGGTCGAGGCGACGAGAGATAAAGTTCTATAATGTAAGCGTTAAATATATTATTTATATTATGTTTGTTATATATATAAGATCGT 
ATATATGTGAGTGTGTGTGTGTGTGTGTGTTGCGTGTGCGAGGGAGTGGGGTTATTCCTATATCGTTTTAATGACTTAAAGAACTAGAAAACATC 
ATCATCAGCAGCAGCAGCAGCAGCATCATCGTCATCAGTGTCATCCCTTTCGCCTGATTAATCTGGGTTAGTATCAGAATCGAATTATTCCACAG 
GACAGAGACAGACACACAATAATTTAGTCTAGATACTTGTATTTGGAGGCATATATCGTGCGATTAAGATGTCCTGTCTACTTGATCCGCTGCAG 
CTGCGTGTCGAAGGCATCGTGGGCCGCTTTAGTTTTCCTTTTGTTTTTTTGTTTTCTCCTTTTTGTTTGGTTTTTTTTTCACTTTCAGTTTAGTT 
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CCAAGTTCGGTTTTGTTCCGTTTATAAATATTGTTTATGCATTGTGTGTTGTTTCTGTGTTATCATTACAGTAATTATTTGTAATAAAGACGACG 
GCATTCTAGGAATTGTTTATGCACAAAAGCAAATAGTTATTAGTTCCATGCTCCAATGCTAGCTTTAGTTTCATGTTTTTCTTTACTTTCATTTC 
GTGTTAAGCTATTTCAATGTTTATTAAGTGTTCTTTCTCCATCTCCCTCTCTCTGGCGCGGCTATCATCATCTTTTTCTGTCCCAGTTAAGCAGG 
ATGAGTGCTGCTTGAGTTCGTGGTTGATTGATTGATTCTCCTTTTGATTTGATTAGATCCCCGGAAAGCAGAGTGTCACTTAGTACCTGCTGTTG 
TTAAACTGCGAATTGATGGCCGTATTAACGGCCGAGGAGGCTGCTTCCTGGACGTGCTGGTTGCGCAGAAACTCGGTGGTGAACTCCGCCTGGGC 
CTTGGCCATACTGGCCCCTGTGCTGCGGTAGATCGAGTGAATCTTTGTGATCATCAGCACATTGGCCACGGCCACCGCTGTGAAACAGAATGCCA 
CGTTCAACAGCAGAATGCCGACGATAATACCCGACGCTTGCCCATCGAACTGCCCGATGGCCGTAATAAAGCCACAGTAGCCCATCTTGTTGAAG 
CCCACCGCCTGCACAATCGAGTACAGCGTTTGAAAGAAGTAGATGAAGAAGAAGACCATGAAGTTGAAGCTCGAATCGTTGCGGAATGCCTTGTA 
CGCCGGTCGGAACCTGAAAATTAATCACACATCATCACATTATTCATGTTTTAAGAATCTTCCAGGTATAAACTCACCAGCAGACATAGGAGGCG 
GGCGAGAAGAGCATCGTGTAGAAGATGGCCAGGAAGAAGGTCTCGAATTCGCCAGCGTGGAACAGCAAAATTAGACCGCCTATCACGTTGGCCAA 
CAACGTCATGGTGTAAACTGTCGGAGAAAGTATTCAAAATAGACACATACAATCAGCATAATGCTATAATCTGTAGGGGAAAGCATTATTATATA 
CTATATACTATGCATGCAGTAAATAATGCTAGAAATGCATTTAAAAACTATCTTAATACTGATAACATGTAAATCAAACAAATTTCATGCTCCCC 
TCTTTGCAAGCATATTCTGCACAAAGGCACGCACACATCCAAGACCATTAGCCATTCAGCTATATCTGATTAATAATGGCATCGGCCTGTAGACG 
ACACGCGATGACTAATTCCACAACCTATGGCACCGCTTATCAGCGAAAACATGTAGAAAACACAGCCATAAAGAACCAATACTCACAAATCCAAA 
TGTAATACAATCGCTTGACCAGTTTCTGGAACTCGGGCGGTATCTCCACCTCGAAGTCCTGGTAGAAGCACGGCTTTACACAAAAGTTATCCGGC 
AGCGGTGGCCAGTTGTTCAGCTGCGGCACATTGCCCTGTAGCTGCTGCTCGCGGCGGTCCAGTTCGGCGGCCTTGCGGTCCAACTCCTCCTGCCG 
GCGCTGGATACAATGAAAATCATATGATTATCGACTGTTTTATATATACATTTTGTATGACTTAATAAATTGGCCGTATATGGTAAATACTTTAA 

AT C T T AT TC AT T T AC AT AT T T GGG AAAAC AAAGGC GAAAT T C AAT G AT AGG AAT T C AC AT C C AAAT C T T G AGT T T GT T AAAC T T C TAT C T T AAC C 
TCAGGCATTTGGCCTATCTATAGGAAATATTCTGTACAAGAATAAAGTGCAAACTTAAGCCTCCAAATCATATGTATTTAGATCTACTAACTAGT 
ATTTCTTGAATTATTGTGTAGGGCCGAAGAAGCCAAGTAAATCAAGAGAAACCGCAGTTTGCACAGTGAAGACTCGGCGCGGCAGCCGCTTCTAG 
TCTCAACCAGATAAATGCTCCAAAGCGGATATATCGTTGGCAGATCGAGATAAGCCCGCTTATCTTTCTGACTCTTGATGGAGAGTAAAATGTGA 
TATGCCTGCCCTAATCCCGGCACTCTAGCAAAGTGATTACTGTACAGACTTTTAGGGCAAACTTTTAGTGTAACTGTGTGGTGTGGTCCATTTCG 
ATTGTTTCCGAGTATTTGGATGTACATATGCGGTTTTTGGAGTATTGCGGATGTGGCCATGCCGGCACAGCACACCGCAGAGTGCTCACTGTTGT 
GTGGGCGGTAACGCGAGACGACCGGCCAAGTGGGTTACCTGCAGCTCCTCCGAGGTGATCTGTATGCTGGTGCTAGGCGCCGAGGCGCCCAGGGA 
ACTGGTCTGCGGGGGCGGAATGTTCTGCGATAGCGGCTGGACCACCGCCGCCGTGTTCGTTGAGTTGATCTGCAGCTGCGGCTTGGCCTGCTCCT 
CGAATGGATTGTAATCCTCAAGAGAGACAAGGGCGGCGCCGCTCTGGAGTCGTCGTGCCTGCTGGATGGCGGGGTCCTGCGGGGGAAAACCGGGA 
AATCCCAATCAGAAACCAATTCATTCAACATATGTGACGAATGCACCCAACACACACACACACTACACTCACAGGCACGCACACCAACACCCCAC 
CTTTGCGTACGTGTGCGTGTGTGTGCGATGGGTTTTTTCTTCGGCTTCCGCGCTCTCTCTCCTAGCTCTTCTATCGCCTCACCGCAAACGGATTG 
TCCAGATTGGGCTCCCCGAAGGGATTCTCGTCGAGACCGGAGCCGGACATGGGAATTCTACTTTGGATACGGCTGTTCAATGACCAGAAAAACGC 
ACTGACTCGCACTTTGCTTTTCGCTGTGTATATTTTCCCTAAGACTCTGCTCAGTTATCGATGTCGGTTTTAGTGATGGTTCAGCATCGATAAGC 
TTTCGTTGGGAATCAAAGTGTAGGCCTCAAATATCGAAAGAATTACTTTAAAGAATTAATGTTATTTGAATTACATATTAAATTCATTTTGGCAG 
AAAAACTTATTATTTAACACATTTAACATTAATCTAGAATTCTGAAAAATGTGAATAACTGGTATACAATGGTCTGATTTTTTAGGACCTATTAA 
TACATTTTATTAAAATGATAGCAAAAAGAAAAGATTCTATATACAATTGGATGTAAACCAGCAGTTATCATATGAATTTTAAATTTAAATGGCAA 
CAATCGTGCCAATCGGAACTGCTCCAGCTGAACCAGCTGCGTGGAACCAGTTCCCATCCCTGGGGCCGCCATTCCGCAATTGGAATTCTAGTGTC 
TTGGGTTGCGATTCGATTCATTTACCGCTGCCCGCGTCTCCAATTCGGTTGTGTGCGATTCTGTCTGCTCCGCATCCAAGTCAAACATCCAATCC 
TTGCAAGAATGTCGAAGCCCAGCTACAAAGTCGGTGAGTTGCGTTATAAATAAACCCAAACGGATGGCAAACTCTCGAATCGAATCTCCGGCCGG 
CAACAGGCGACATGTGCGGGTGAATTGCATCTATTCATGAGCACATGTTTCTGAGGTTAAAAGAGAGACAAAACAGAGGAGAGCACGTTGTTGCT 
GGGGCAAAAAACGTAGAAATTCAGACCGTATGGTGTATAAATCAACAGTGCCTGCTTATCTGACCCTATCTCCCTTTTATATCCCACTCGACAGC 
CGATATCAGTCTGGCGGAATGGGGACGCAAGGCGATCATTATTGCGGAGAACGAGATGCCCGGTCTGATGGCCTGCAGGAAGAAGTACGGCCCAT 
CTAAGCCCCTCAAGGGCGCCCGCATCACCGGCTGCCTGCACATGACCGTCCAGACCGCCGTGCTCATCGAGACCCTCGTCGAACTGGGCGCTCAG 

GTGGGTTCAGT 
(SEQ ID NO: 1015) 

Exon: 3761.. 3598 
Exon: 3401. .3174 
Exon: 2473. .2272 
Exon: 1917.. 1788 
Exon: 1723. .1001 

Start ATG: 3660 {Reverse strand: CAT) 
Transcript Wo. : CT2 627 6 

AACTGAGCAGAGTCTTAGGGAAAATATACACAGCGAAAAGCAAAGTGCGAGTCAGTGCGTTTTTCTGGTCATTGAACAGCCGTATCCAAAGTAGA 
ATTCCCATGTCCGGCTCCGGTCTCGACGAGAATCCCTTCGGGGAGCCCAATCTGGACAATCCGTTTGCGGACCCCGCCATCCAGCAGGCACGACG 
ACTCCAGAGCGGCGCCGCCCTTGTCTCTCTTGAGGATTACAATCCATTCGAGGAGCAGGCCAAGCCGCAGCTGCAGATCAACTCAACGAACACGG 
CGGCGGTGGTCCAGCCGCTATCGCAGAACATTCCGCCCCCGCAGACCAGTTCCCTGGGCGCCTCGGCGCCTAGCACCAGCATACAGATCACCTCG 
GAGGAGCTGCAGCGCCGGCAGGAGGAGTTGGACCGCAAGGCCGCCGAACTGGACCGCCGCGAGCAGCAGCTACAGGGCAATGTGCCGCAGCTGAA 
CAACTGGCCACCGCTGCCGGATAACTTTTGTGTAAAGCCGTGCTTCTACCAGGACTTCGAGGTGGAGATACCGCCCGAGTTCCAGAAACTGGTCA 
AGCGATTGTATTACATTTGGATTTTTTACACCATGACGTTGTTGGCCAACGTGATAGGCGGTCTAATTTTGCTGTTCCACGCTGGCGAATTCGAG 
ACCTTCTTCCTGGCCATCTTCTACACGATGCTCTTCTCGCCCGCCTCCTATGTCTGCTGGTTCCGACCGGCGTACAAGGCATTCCGCAACGATTC 
GAGCTTCAACTTCATGGTCTTCTTCTTCATCTACTTCTTTCAAACGCTGTACTCGATTGTGCAGGCGGTGGGCTTCAACAAGATGGGCTACTGTG 
GCTTTATTACGGCCATCGGGCAGTTCGATGGGCAAGCGTCGGGTATTATCGTCGGCATTCTGCTGTTGAACGTGGCATTCTGTTTCACAGCGGTG 
GCCGTGGCCAATGTGCTGATGATCACAAAGATTCACTCGATCTACCGCAGCACAGGGGCCAGTATGGCCAAGGCCCAGGCGGAGTTCACCACCGA 
GTTTCTGCGCAACCAGCACGTCCAGGAAGCAGCCTCCTCGGCCGTTAATACGGCCATCAATTCGCAGTTTAACAACAGCAGGTACTAAGTGACAC 
TCTGCTTTCCGGGGATCTAATCAAATCAAAAGGAGAATCAATCAATCAACCACGAACTCAAGCAGCACTCATCCTGCTTAACTGGGACAGAAAAA 
GATGATGATAGCCGCGCCAGAGAGAGGGAGATGGAGAAAGAACACTTAATAAACATTGAAATAGCTTAACACGAAATGAAAGTAAAGAAAAACAT 
GAAACTAAAGCTAGCATTGGAGCATGGAACTAATAACTATTTGCTTTTGTGCATAAACAATTCCTAGAATGCCGTCGTCTTTATTACAAATAATT 

ACTGTAATGATAACACAGAAAC 
(SEQ ID NO: 1016) 

Start ATG: 102 (Reverse strand: CAT) 
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MSGSGLDENPFGEPNLDNPFADPAIQQARRLQSGAALVSLEDYNPFEEQAKPQLQINSTNTAAVVQPLSQNIPPPQTSSLGASAPSTSIQITSEE 
LQRRQEELDRKAAELDRREQQLQGNVPQLWNWPPLPDNFCVKPCFYQDFEVEIPPEFQKLVKRLYYIWIFYTMTLLANVIGGLILLFHAGEFETF 
FLAIFYTMLFSPASYVCWFRPAYKAFRNDSSFNFMVFFFIYFFQTLYSIVQAVGFNKMGYCGFITAIGQFDGQASGIIVGILLLNVAFCFTAVAV 
ANVLMITKIHSIYRSTGASMAKAQAEFTTEFLRNQHVQEAASSAVNTAINSQFNNSRY* 
(SEQ ID NO: 1017) 

Name: SCAMPs 

Classification: transporter 



Celera Sequence No. : 142000013384663 

TGTTTTTATTTAAAGCAAATATACCGGAATTTTGGCTGTGTGTAGTTTTTCTTTTCCAGAATTTTTCCTCTTTTCATTCCATTGCCTCTAGCAAT 
TTCCTCTCCAAGTCGGCACATTTTATGCCTTTAATTACGCGCAGAACTAAAAAATTGCTAGTGGGTGTAATTGTAAAAAATCGGGTCAGCAATAA 
TGCACATTAATTATGGTTTATTGTTGCTGCAAGTGTGTCATCGTGTAGGAATCGAATGAAGTAGTCGGTTCGTTTTTAGTTGGTTCGGGTTGCAT 
TTTTTTGGCTGAGCCAAGAAATTGTTTTTCGATTTATCTTAGTCGTGTTGACATTATTTATTTATCTTTAGCATTTTTATTCGCACGGGGAATGT 
TACAACAATTTGAAGGTTAAAATTTGTGGTATTTGTACGCACCACGCCTAAAGATTTAAAGAGCAACACACAATTTGGTTTGTTTATTTTGGTTT 
CTCGCTCTTTCGTTTCGTGATCGAGATAGCTAGTAGTTCCTTCTTTTCGTTTTGATTTTCGTGGCATTAATTTTGGGTTTTCTAAAAAGCCAAAT 
GGATCTAAAAATAGTTTCACCGATTCTCGTTTATGCTGCCGCTGCTTCAGCACGTGTATATGGGGAAAAATACGGATGTACATATGCGTTTTTCC 
CCTAAAATCTAGAACTTATTTTGGCCAAAATTAGAAATTAACTAAAGTGGAAATATGCGGCAGTCCATTCTGTCGCCCCGCCACTTTCTCTGCTT 
TCCCCAAAAATACTTTTTGACATTCACAGGGAAACCTGTCAGTTTTCCGCTCTCTGAGAGTCATTCCATATCTCTCTCTATTGGTTGCATTGCAT 
TCTCTTCGTTTTCCACTCTCTCTTGTTTGCCACCCCCTTTTTGGTGCTACTGCCCTGCAGTTTTTAAGGTCAGTCGAAAATTTTCCAGCTGAGCG 
T C AAGTT G AAG AAGAACT T AT T CT AAAT T C G GG AAAAAC C CT T TTTCTCCATT T AAT T GC AGC AT T T T AACT GAAAT C AT G AC T G AAG T T G AGC A 
ACCGCCACAGAATGGCATAGATCCCACTGCCGGCGAAGATGATGACAATAGCAAAGCACGTCCCGCGGATATTGAACAGGTAAGTGCAGGCCGTC 
GAAGTCCCAACTGGATAACAACCCCTCCACCAAACACTCACCACCACCACTTTACTTTCTCCTCCTCCAGGATATGCGCGAAATGGAGCGTCGCA 
AGCGAGTCGAGGCTATAATGGGCTCGAAACTGTTCCGCGAGGAGCTGGAGCGTATCGTGGACTCGGCGCGCGATGGAGGTGCTGGCGCTAGTGGA 
ATCCTGCAGCAGCTGTCGGACATTGTGGGCGTACCAGTGTCCAGGGTGGGCAGCGTCTTCAAGAGCAGCAACTGCATGGTGCCAATCAACGATAT 
ACGCGGCGTCGAGTCCATGGGCTATGCCAAGGGCGAGAAGATACTCCGGTGCAAGCTGGCCGCCACATTCCGTCTGCTCGATCTGTACGGCTGGA 
CCCAGGGTCTGGGGGCACAGATCACCGCCCGACTCAAGGTCGATCAGGAGTACTTCCTGGTTAACCCATACGGCCTGCTTTACCACGAGATCACT 
GCCTCAGCGCTGAACAAGGTGGATATGCAAGGACAGATTGTGGAGCAGGGCACCACCAACTTTGGCGGCAACAAGAGTCGTAAGTGGCTGACAAT 
TGAACCTATAATTGAATATATCAAATATATATTTATCTGTTTTTAACTGCCTTGCAGACTTTGTCCTCCATTCGGTGGTACATGCTGCCCGTCCA 
GATATTCGATGCGCCATCTATATCGGCTGCAGTCCAGTCGTGGCTATTTCCTCACTAAAGACGGGTCTATTGCCCTTGACCAAGGATGCCTGTGT 
GCTGGGCGAGATCACCACTCACGCTTATACTGGTCTATTCGACGAGGAGGAGCGCAATCGATTGGTCCGCAGCCTTGGTCCCAACTCCAAGGTGA 
TTCTGTTAACTAATCACGGTGCTTTGTGCTGCGGAGAGACCATCGAGGAAGCCTTCTTCGCCGCCTGTCACATTGTACAGGCGTGTGAGACGCAA 
CTGAAGCTGTTGCCCGTCGGTTTGGATAACTTGGTGCTGATTCCAGAGGAGTCGCGCAAGGCCATTTACGAGCAGTCGCGCCGGCCGCCGGAGGA 
TCTGGAGAAGAAGTTTGCCGCCGTCGCCGCTGCCGAAGATGGTGCTGCTACTGCTGAAAAGGATGCAGCCGAAGCAGTACCCAAGGTTGGCAGTC 
CGCCCAAGTGGCGTGTGGGTGGTGCTGZ^ATTTGAGGCCTTGATGCGCATGCTGGACAATGCTGGCTATCGCACTGGCTATATCTACCGTCATCCA 
CTGATCAAATCGGATCCTCCCAAGCCTAAGAACGATGTCGAACTACCGCCAGCTGTTTCATCACTGGGCTATCTTCTTGAGGAGGAGGAACTCTT 
CCGTCAAGGGTAAGCGAATGCACTTAAATTGTACCTAAACAATTCCCTAATAAAATCTATTTACTTTTAGGATCTGGAAGAAGGGAGATATTCGC 
AAAGGCGGCGATCGCAGTCGGTGGCTCAACTCGCCAAACGTTTACCAAAAGGTCGAGGTTCTGGAGACCGGCACTCCGGATCCCAAGAAGATTAC 
AAAGGTAGAAATCATTACTTTCGAAGATATGAAGCAAACCAAAACCACCAACCTTAAGGAGATTGAGGGAAAATAGTTGAAATTGTTGGACACCG 
AATCCTTC71ACTCTTTACAGACATTCCACTTGCTATCCCCATATATAATTTCCATTTCGTTAGCACTCAGTTTTAGTTTCAACTTTGCCACTAAG 
CTGCAACTATATTCACGCATAAAATTTTTAGAGTTTAAGTAAATGAAAGTAATCAATATACAAATTCTATATATTTATATACTTTGGAAATAGAA 
TTAAAACTCGGGCTAAACCAAAATTATTTTACGTGCTTTGCTAGCAAGCAAACAAATATTGTGTAATGTGCATTAATCAAATAAATAAAATAAAC 
AAGAAATAAAAATTTGCATAGATGATT TTGTGCCTC ATT TAGGGATTTCGCTGCCT C AAAAAC AAG ACAATAGCGCTAAAACAATGCAAAGAAAG 
AGAAAGAAATTTGCTATGAATACTAATCACTATGTAATAAATAAAACATTTTTTAATTTACCGAATCAATGATTTCATATTTAACCGCATTATTT 
TTCATTCATAAAATTTTTCTTTGCATGTTTTATGGTATCTTTCTTTTATTCTCTTTGTGATTGGTTTTCATTGTTATTCAAGGCACAAAATCATT 
TATGCTTGTTATTTTAGTTAATTTATGTTTATTTGATCATTGAAAAGTTTGATATCTAGGAACATATTCTTATGTATGTTAAAAATAAACATTTC 
GAAAAACACTTGCATTGGAATGCCTACGATGTAACATTAATTTTAAGACTGGTGTAAACGAATTCACATAAGATCACACGTGTAAAAATCTATCG 
AACAGTTGAATGGGAATGTTTGTTTATAATTAAGAATCTTCCCATATTGATATGAATCTTAGTATTCACTCCACATTAAAATCAAATTTCTTTCT 
CTTTCTTTTCTATTTGCATTGATCCCACCCCACCCTCAACACACAACACATAATTTACAATATAAAACAAAATATGACAGTGGGTGGCTGAGGGT 
TCCCCCACCCACTCAACGCCAGTGAGGATAGAAGATCCACTCCAGTTTGTGCCTGCAGGAACCAATCCAAGGGAATTTAAGCGTGTCCAGCAACT 
AGTCAGTATTATAATACTCTTAGTCCCAATGAACTTTGTTAACAAGTGGAAGCTTTTTCAAACAGATTAAGGACAACCGCCGGGCAGATAAGATT 
TCAGCCGGACCACAGTCGCATATTCTGGAGGGCGTGACATGGGACGAGGCGAGCCGACTCAAGGATGCAACGGTCTCGCAGGCCGGCGATCATGT 
CGTCATGATGGGTGCCGCCTCCAAGGGAATCATTCAGCGTGGCTTCCAGCACAATGCCACTGTCTACAAGGCACCGTATGCTAAGAACCCATTCG 
ATAACGTCACAGACGATGAACTCAATGAATACAAACGCACGGTGGAGCGCAAAAAGAAATCGGTGCATGGTGAATGTAAGTTAGCAATCATACAA 
AACCAATAACCAAATAGTTACTTATAATGCATACAACAACTTTTGCAGATACCGACACAGACTTTTCGGAATCCGAAGCGGTCCTGCAGGCGGGG 
ACAAAGAAATACCCACAAAGCGAACCCGAGACCGGTATGTTAACTATTTAGAAACTATACTGATTACTGGAAACTAATATTATGTGGATATTCTA 
GAGCACCAGGTCATTGAGATCCAAACACAGCAAGCGCCAGTGCCAAGGCAGGCGGAAGTCGTGCTGAGCGATGGTAAGTCCAGTTTTGTACTAAA 
AAATGTGATCTACTTGTTTTTAATTTTTATATATGATTAAAATTACTTCAATCATGCACACTATAAAACATTTCAAGTACACTTGTCCAAGAGTT 
TTGTCTTACAAGTCATTATGTGATCGAAATTTTATAGTTCTATTTTTTATGTTCCGTTATTTTGTTTATCTCATTTTTATTTCTTTTGTTTTACT 
TTTGTTTTTTGTTAACTATTTGTTTAATAATTTATTAATAATTTAGGTCTTTAAGATGCTGCTATATTTAAAACGTATTTTGCGTAGTTTTTAAA 
TAATAATAATTTCCTTGTAGTATTATTTTCCTAGTTTTTTTTTTTTGGTATTCTTATTTTATTTGATTCCTAACATTTATTAACATTTTTATGCC 
ATTTCCCTCTTTTCGTTTTTTTGCTGTATTTTCGCATTATAAATTTACCAACCACACACAAACGCCTAAAACTACACACATTTCAATGGAAACCA 
ACACTTTCAAATTACTTAATGAAGTCAACAAAAAAATTCTTGTAGCAATTGATACTATCACAACTTCCTTTCTGTAAAATCCTCAAAATTATTTG 
CTGTTTTGTTAAAAATCTTAAGCTTCTAATTAGTTATGTTTTATTAGTTAAATATTGTGCAATATAATTCACATTTACACAATACTTCCAAGGAA 
CCAACTGTTGTACGGCGTTCATAAAATGATTTATAATTTCAACGTTCAACAATACGACAGCACATCAATCCCAAATGTTACAACAAAAAGAAACT 
TTTTATAGTAGAAAAGCTGCAGTTTTAGTTAATCCTTTGTTTTACAAAATTTTCGTTTAATTTTCCGAATTTGGCAATTTGTAAATCAAAATTTG 
ACCTTTCCATTATTTTTACATAGGAAAAAAGAACAAAACCAAATAAATTGGCAAATTGAACAGGAAAAATGCGAATTAAAAGCACTAGTTAGTAT 
TGATAT GAAAGCGAAGT GTGTGTAGGC T AAAAAT AAAAT AAACC AAAAAT TT AT AAC AAAACGAAAGACC AGC AACCAAC AAC AAGAAGCAACCG 
AAGTTCTCAACAGAACACAAAAATTCTACTACTACACTACCACAACTACTACATACTACCTTTTCTACTACTACTTCTGCGGCTGCTGCGCTCTA 
TAACACAAAATAAAAATAACTAAATAATATGCAGTTTAAAAAGAAAAACCCTTTTGAGTTGGAGGTGAGTGCAGCCCCAAAGGTTTAGTGTGTAG 
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TCAAAAAAATAAAAAAAAATAAATAAAAAATCAAGAAAATAGTTTATAGAAAAAATGTTCAGCATAACAAAAACATAAGCCACCTGCACACTAAA 

CCACATTCAACATGTCATATCAACTTTTGCTGAATGCTCTTAGTTGAATCACAATATGCGCTGTTAGCCAAGAAATTAAATTGCAGCTTGCTGTT 

TTGTTGGGTTTTCTCCGTGTTAACCGCAATTTTCCATAAATTTTTGTTTTAAACTGCAGCTGTTTTCTGTTGATTGCTTCTTTAGTTGCCCAGCA 

GCCATTTCGTACATTAGTGAACACAACTAATCCGTATACCATAAGTTTAAGGCATGCTTGATCGATACTAAACTCTAAAACTCATGCAACACTAC 

GTATTTATAGTTTAAGGCAACTGATTACGTCTGTCTCACAATTGTTTAGCGATTTACTTGCGATCGTCTGTCGCTGAATATGATTTTATAATTTT 

ATTTCTCAATACTATTTTAGCCACACTTAGTTTTATAAACGGCGAACGCTCTCACACAGCCACAAATCGCAAGCCGCCAACTGGCCGAGGTAACG 

AACAAGACCCATTTGAAAGATACTCCACAGTACTTACATCCTTCAATGTACTCAGGCTAAACGCCCATTGTATATAATATCGTTGAAACATTATA 

ATACCCCATATAATCCACTAGCAGTTCGTTGTTAATGGTGTATACTCAATATAACCCTTATATAATCCACAAGCCCATGGCAAATTATTGATGCA 

CCACAACTAAACACTAGCTATAGTTTGAGGTTGATATTTACCATTGAAATAATTTATTGCGGTTTTCTTAGTTCATTACATTGGTTTTGGGTGAA 

ACGATACATCGTCAACTCTTTTGCACAGGAAACTCGTTGTTTTAATTCAAATGCCCTTAAAAACTGCTCGTTTTTCCAAGCTCGAGATATTTTTA 

ATAGGTCAGACTTAAGTACAACCAAAAATGGGATATGTTTTTCAGATTTATGGATCTGTAATCCAGTACGTATTGGCATTATGTATGTATTTCGA 

GAGCCGCACTTATCTAAGATGCTGGCTCCGGACTTTTTATAGATTGTCACATATACTTACAATTTTTCGGCTTTTCTTACCTCCCACACCTGCAT 

TTAATGTCGCTTTAACTAGACGATAGTAAGAATAATGAGTATGGAGACGACATGGAGAACCAATTATATGAGCGGAACAGGAACGTACACAATAA 

CAGCAAGTGTCGACGTGATCTGTTTGCAACAAAGCGCAGGGATCCCCATCCATCACCGGAGCGTCTGTATACCTATGTCTATGCTACAGCTGCCT 

CTGTGGGCCAGTGCGTCCAACCGGAGGTGTGTTCCGCTTTGTTGCGGGCTCTGCATAGCGAGCAGTTGGGCTACCTGGCGAGTAGCTATCGCGGA 

GGAGCTGGGAACTCAAATGCCACCACCGCCAGCACATGCACTATAACCAGCGTCACGGGCTCCTCTACCGCATCGCGCAGACCATTGCAAAATGT 

GAATATTATGTATCCGCAGCCGTATCGCACGGGCTCCCATTTTGGATTCAATTCACCACGCCAGCAGCCACGCACTGAGCATAAGCATGTGCCCA 

GGGAGCGAGACTATAATGCCTTGGGCTACCGACCGATGCAACGAGGGATCAGTTACGAGGAGATCTTCGCCACACCTCGCTATGAACAGTTGTGC 

CAGGTGTGTTTCGACAAACTACTGCGCCTTAAGCCCGAGCTGCAAAGGATTGGCAGTGGCACGGCTACTAGTAGTTGTGCGGAGGAGCAGGCAAG 

TACAGGGAGCACTCCCTTGCAACGCCGCCAATTGCCGACTTTGCCTGAGTTATGCGATCATCGTAGGGAAATGGAAAAGTTGCCAGAACCTCCGA 

GGGTCCAACGTAATGCGTACACACAGACACAGAAGTGTTCTCCATCCTCTTCCACCTATAATTTCCTTAGTCAAATCGAAAAGTTTAACTTCTCG 

GAAGAATCGACGGACATAGATTATACTCCACGCCCCATTAACAGTGAAAGTAGCCTGTTGGGACTGATTGTGGAACAGGCAACCTCGAGTGCTGG 

AGCGAGTGCCACAAGTGCTCCACCTCAAAGAATTTACTGTGCAGAACAAGTGGACATTACAGAAGTGTGGAGTTTTGGGCGTTGGTTGGAACCCC 

AGGACCATATGATAGCGAGTAATGCAGATGAGTTTTCCCAACAGCCGCTGACCGAGCATTGCATCACATTGGAGTTTAATCAACAATATTCGACT 

GGCGACAATCAGGTCGATCGCCAATTGATGACACGCGAAGAGCGTCAGCGTCGAAAGTCTGAGTTCCAGGAGCTTTGGCAAGATCATGTCGAGTA 

CTTTGAATCCAAAGAACAGATGCATACCCATGAAAATATCCAGGAAGTCGCGACTCTTAGTGCTTATGATTGCTTAGACCAGGATATAGGCGAGA 

TTGAAGACGGATTCAATGATGGAACAATCGCTGATGCAAGTGATGAGTTACCGAACATTCTGCAGATGTTCGAGGATAGTCTTACATTAGCCGGC 

GAAAACCTAGAAAAGCTGGTGGAAAGTACCAAAAAACTACAAGCAATTCACAGTGAAACATCACCAGCCTCTTCCAATAGTTCCCTAGCCGAAGT 

TTGTAATCTCTTTCGGTCGAGAAGTTTCGAGAGCATTCCTGCTATCGATGATACTGTGTCTGTACAGGAACCACTTTACGATTTGCCCATAAATT 

C AAGT AT C G AAG AAG AT G AC GGACT GC AAC AT AAT T T T GT AC AGG AGG AAAAGC GTG AC G AC G AAAAAC C T C TT AC T C C C AT T T C AAC GG AGC C A 

GAGGAACCTTCGGATAAGGAGAAGGATCTTGTCGACGAGAAAGACATAGACGAAAATGAAAATCACGTGGAGGCTGAGTTCCAAAAGGCTCTTGG 

CGAAACAATGCAAGTAACCGTCCTTATGCCAATGGAACACAAAACCCCCCTGAAAGATACAGATGAAAATGATCGTTTGGTTGAGACTGAGATCC 

GAGATCCTCAAGAAGTTAGCTCCTTAGAGACAGATAACCAGTTTTTCACACCGCTGGAACGCGAGGTTCTGCATCGGATCGAGAAAGAAAGACTT 

GACGAGGCGGAGCCTATATTTGGAAAGATTGACCATAGTATTCGCAACAAGATGACGCCAAAGAAGCTGCAAGACTTGGTCAGCGAAGAGATATT 

TCGCACACAAACGCATGTATACACGAATCTAAGGACACCGATATCGAAGAATATACAGCATGACACCACTTTGACTCCCAGTGGCTCATCCCTGT 

CAGCTCATAATATCAGTGCACCGTGCAGCTCAATGGCGCACACTCATCTAGTCACCCGACCAACACCCCGAATGTCGCGATCCTGGTCCGAAGAT 

CAGCAAAGTGAGGATGACGATCAAAAGAAGAACGAAGCGGGAGAACGGGGATACATTGACAATGACACTCCCATAGATCGAGTCATCTCGTCTAC 

AAC AT T T GT GT G C AG AAG AAGT C C AAC C AG AAAGC GC AAC T T T ATT C AGG AG AAT AT T CGC AAT GC C AGC AG A C C AC G AAT TGCAGCCGGTTCGA 

TTAGAAAACATAAAAGCAACATCGTAACTTCCCCTAGAGCCAAATCAAGCCCATCCTCGGTGTATACTTTTCAGTGTGGCCAACGGGATAGCGGA 

GCACGCCTATGGGTTGCCTTGCCAACGTCTCCACGTGGTCAAAACGATGAGAAATCGTCTACTGCAGCTCGGGCTTTGTACACCGTTCGGGGTAA 

TCCCCCATCACCATTTGCCAAACGTCGTAGACCCTTGGTTATCCTTCCACCAATGCGTAATAAAAATAAAGCCCATAATCGCGAGATGAGCTCAT 

C T AA T G AATC T C T GC AC AG AAC TC T T T C C AGT AGC AC GT T C AAT GT GGAC GAT GAT G AAG ATC C T GC AAAT T C G GAT AAC AT T T C AAT GAG T AC C 

TTTCAGTTGGAGGAAGGTCAAGAAGTATCATCTCCTGCAACTCAGGATAATAGGAATGGGACTAACAGTTCTGTGTACTACAGCGCAAACGATAC 

CACGCGAGAAGAGAGAGAATCTTCACAGCCAAGCAGGGACGAAGCAGAAGAGAATCTTGGAGTCATGGAATTTATGCCCGAATCAATTCCACTTT 

TGGAAATACCAGATATCTGTAACCAACTCGATGAAGACGGATCATCGTTAGAACTCGCACGCTTAGCAGGCGGATTTGAAGTGGGTACAACAGAA 

GACGACCAAGTCGAAGAAGTGGAACAACAGAAGAAGAGCATAGATAACGAATCCAAAAATGTTCTGCCTATGGGAATAGAAGAAATTAATGGCGT 

TCAGGACAGCATTTGTTCCATGCAGGCGGATATTGATAAGGATGATATTGTTAATACCGATGATCAACCCTCGACTTCAAAAGCGGCAATGAAAA 

AAATTCGCATATATGGTGGTGCGGAGCCCGAAGAGACTCCTCATTGCGAAGATGAGGATGTGGTGACGCTGGAAACGGTTTGCATTGTGCGTGGT 

C T G GAG AT GTTTGTGC G AGAG GTG GAG C C CAT AAAGC AAG AG AC T T C AG AGG AAC T AT C AG AGC TT GGG GG AC T T T T T GAAAAT GGGT T AAT T G A 

TGAAGGTGGCGAGGAGGCCATGTCCCAGTAGCAGTGAAACCAGAAGTTGATGAAGAATTTATGAAAAAATATTGAGGAACGAAAGGATAACACTA 

ACTTTGAACACTTTTTTTTTTGAACATTTTATATTCGTTTATGATTTTAACATTGCTTGTCATTACCATTTTTTGTACTATTTTTCATATATTTA 

ACATTGTAACATTGAAAACTTTCTTCTTTTGTAATGTAATTTATAGAACCATTAAAAATGTGTGAATTATTGGTTTTGTACTCAGCAAATTTATG 

TTAAGTTATAAAGGGTATGGTGTTTATCTTTACAATTATGGGTTAAAAAAAATCGAACATTATGGGTTTATATTAATAATAAGACATTGTTTAAA 

TTTAATTATTAGTAGTAAATATTCCATCATAATGTCTTAGAATATGTCATCCAGTGTTTCACGTCCCTTGTCCTGCGTCCAGTGTGTAAATTATG 

TTGGTCGTGATAATTTTTGGAGGTTCAGGTGCTCAGAATCATACTTACATTACTTTCTATATCCACATCACCCGCCATTTGATATATATCGATTC 

TAGCACTTGTCTCTCAACTGGCTCAAAAGTATGCGTTCCTTTATTCGCCTGGCCAATACATGTATGCCTGCATGAAAATGGCTCCATTAATGCAT 

AAGGTTTATGTTATACACAAGGTCGAGCCCGTCAGCAAGCACAACTATCCGCCAGTTAACGATGGCAATATGAGTATTCATCACAACGAAAGTGG 

AGCCGGATTCATGGCCCAAGAGTCCAGCGTTATTAGTAGCACCCCCGTACGCAATGCCTTGGCATCTGTATCATTGCCGGAAGAGAGGAATCATT 

CTATTTTGGGCTTATCATCGACTCCGTATCGCACCATCTCGCATTTCGGGTTCAATTGTCCGCTTATTACGTCACCAACCATTCTCCTGCATCCC 

GAACATCGATCCATTTGGCAACGGGTGGCCGAGCAGCGCGAGAAGGTTGTGTCCTTCATAGATCTCACAACATTGAGCTTGGACAACCGCAAGCT 

GCTCAATGTGGTGACATCAACGCATCCAACCCAGTGCCAATCGCAATCGCAATCCTTTATCTCTGAGAAGCACATTCAGCTGGAAGTTACGCCAC 

CGAAGCGAAAGCAGCGTGTCTACAGCGCCACCATCAGCAGTGGTCTAGACGACAGCCTCGATGAGCTGGACTCCCTGATGAGTGGCCTAGCCATC 

AATATGCCGCGCAGTCGGGAGCAGGACAGCGGTCTGTACCGCAGCTACACCTTTCTACCCAGTAATCATGCTCTGCCCAAGGATACAGATGCCAA 

TAATAGGGATCAAACGGATCGGGAGCGACCAGAAGCGGAGCAGGAGGAAAGTTTTCACTGCGCAGGCGACAGTGGTATTGGCGATTCCACCGGCA 

GGCGTCCGCGTCTGGCCACCACCAGCAATGATTCGTCTATTCAAGAGGCCGAAGCCTATACCCAGGGCAAGCACGTCAAGTTGACCTTGAGCAGC 

TCACCCACGCCGACGGCCACACAGTCACCTGCAACCATCGAAATTCTTATCAACGTATCGCTGCGCAATGCCGAGTGCGTCCAAACGGTCCAGAC 

CCATGAGCAGGAGTTTCGCGCCAAGTTGGAGCGAGTAATCGACGAGGAGATCCACTATATTTCGCAGCAACTCGCATTTAAACAGCGACAGGCAG 

AGTTGCATGAACAACAGACTACATCGCGGGCTCCAATCGCTACGCCTTCTTTCACAACTATGCATCCGCCGGCACCGGCTTCTTCATCATCCATG 

GTTCACCGCAGCAACTCGGCACCAGAACTGTGCCACACCTATAGCTATGTGGCCGTGGGCGATTTGTCGACCAAACAGGATCAAGCCTCTCCCCA 

ACTGCCAGCGGAGGGTGAGCCGTTGAATGATATACTCAGTAGTTTGGAGAAGGAACTGGAGCGTCTGCTGAATAGCGTGGTGACAGCCCACATGC 

TGCACAACAAGGCCATCATACACGAGTGCCGGGCTCGCTTCTCGCAGCTGGCCGATGGAATTGTCAGCTCGTAGATACATATTATGATTTTGATG 
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GAAAATTTACATCTTCGCTTTTGTTGTGATATTTGAAAATCGACAACAGTGAAATGATTGTATAAAGCAACTAGATTAAGTTTAATGATGATAAT 

TTGGAAATCTTCAAATCGAACTAATGCATTTTTTACTTCTTTGTTCTCTTAATTTAAATGTTAAGACTGTAACTTAAGCACAATATGAAAATGCA 

AAGTTAAAACTGAATCTGAGCCAACCATTGTACATTTAAACATCTGTGAAATTGTTAAGTTTAATTTTTGTCGAATCTTGATTTTCGTTTCATAT 

TTTTATAATTAAAATATAATAAATATAGTTTTCATTATGTTTAATGTTTTTAAGAAAGTTTTAGGTTAAGTGTTCATACTTACGAAACATTAATT 

TCAATCAGTTAACTTTAATCAGCGCCCAGGTAAAATAATTTCATTTATTATTTATGTTTGTGTTTTGTACTGCATGTCTGCTTGCTTTTTAACAT 

ATATTGCTGATCCTGCTGCATGTCCTCGTGTTTGAATAGCGAATCCTAAACTCCATCTATGTACTTATACATCCACACCAATCAAAACATTTAAT 

ACATGTACATGTTTTAGTTAGATTATTATTCGTGCGTGTCTCATATGGTGAAAGTAATTACGAAATTAATATCAAATATTTTGTGTGCGTCCACT 

GAGTTACATTTGTAGGAGACCCATGCCAAATTACACTGCTATACAAATTACTAACGACATTTTCGACCCTTTTGCAACAGGTGAAAACGTACAAA 

AT GG C G AT C ATT C GGAGG C AC AC T TG AGC AC C T T C T C GC AG AG C AGT AAAGAGGT AAGT AC AT C C GAAC T GAT C C T GC G ATC C AAGAT C T ACGG A 

TCTCTTCTTCAACTCGAACACTCACAAGCAGTAAGCTGGGTGGTAAGCGATTAGCAAGTAGAGTTGCTACATCGACAAGACAACCAGTTTTAAAC 

AACATCATCATGCATTGAGCTAACCTCATTTGATTTACTAACCGAAGCGCAACCGATATTCTCACTGCAAATTTCTATTACGGTTCCAGGATGTC 

TCCACGGACGGATCCCCCAAGAAAGACAAGAAGAAGAAGAAGGGTCTGCGCACACCATCGTTTTTGAAAAAGAAGAAGGAGAAGAAGAAGGCCGA 

GGCCTAAGCTGACCATCAGCGTGATCCGTAGGAGTAGCAATCTTGACCAACCTCCCACCATCAGCGGGAACCGAGAAAAATAACGACAATGATAC 

C AC GAG AAAATG GAAAC T T T AGC C AG ATC AGC AAAAG AAGC AT T AT AT AAAAAAAAG AG AG AGG C C TAAAC AAGC AAC GG AT G AGC A 

{SEQ ID NO: 1018) 

Exon: 1001.. 1124 

Exon: 1211.-1694 

Exon: 1768.. 2479 

Exon: 2541.. 2664 

Exon: 3691.. 3801 

Exon: 3866.. 4160 

Exon: 4229.. 4309 

Exon: 4372.. 4443 

Exon: 10929.. 12767 

Start ATG: 1029 

Transcript No. : CT26485 

AT T T AAT T GC AGC AT T TT AAC T G AAAT CAT GAC TG AAGT T GAG C AAC CGC C AC AGAAT GGC AT AG AT CCCACTGCCGGC G AAGAT GAT G AC AAT A 

GCAAAGCACGTCCCGCGGATATTGAACAGGATATGCGCGAAATGGAGCGTCGCAAGCGAGTCGAGGCTATAATGGGCTCGAAACTGTTCCGCGAG 

GAGCTGGAGCGTATCGTGGACTCGGCGCGCGATGGAGGTGCTGGCGCTAGTGGAATCCTGCAGCAGCTGTCGGACATTGTGGGCGTACCAGTGTC 

CAGGGTGGGCAGCGTCTTCAAGAGCAGCAACTGCATGGTGCCAATCAACGATATACGCGGCGTCGAGTCCATGGGCTATGCCAAGGGCGAGAAGA 

TACTCCGGTGCAAGCTGGCCGCCACATTCCGTCTGCTCGATCTGTACGGCTGGACCCAGGGTCTGGGGGCACAGATCACCGCCCGACTCAAGGTC 

GATCAGGAGTACTTCCTGGTTAACCCATACGGCCTGCTTTACCACGAGATCACTGCCTCAGCGCTGAACAAGGTGGATATGCAAGGACAGATTGT 

GGAGCAGGGCACCACCAACTTTGGCGGCAACAAGAGTCACTTTGTCCTCCATTCGGTGGTACATGCTGCCCGTCCAGATATTCGATGCGCCATCT 

ATATCGGCTGCAGTCCAGTCGTGGCTATTTCCTCACTAAAGACGGGTCTATTGCCCTTGACCAAGGATGCCTGTGTGCTGGGCGAGATCACCACT 

CACGCTTATACTGGTCTATTCGACGAGGAGGAGCGCAATCGATTGGTCCGCAGCCTTGGTCCCAACTCCAAGGTGATTCTGTTAACTAATCACGG 

TGCTTTGTGCTGCGGAGAGACCATCGAGGAAGCCTTCTTCGCCGCCTGTCACATTGTACAGGCGTGTGAGACGCAACTGAAGCTGTTGCCCGTCG 

GTTTGGATAACTTGGTGCTGATTCCAGAGGAGTCGCGCAAGGCCATTTACGAGCAGTCGCGCCGGCCGCCGGAGGATCTGGAGAAGAAGTTTGCC 

GCCGTCGCCGCTGCCGAAGATGGTGCTGCTACTGCTGAAAAGGATGCAGCCGAAGCAGTACCCAAGGTTGGCAGTCCGCCCAAGTGGCGTGTGGG 

TGGTGCTGAATTTGAGGCCTTGATGCGCATGCTGGACAATGCTGGCTATCGCACTGGCTATATCTACCGTCATCCACTGATCAAATCGGATCCTC 

CCAAGCCTAAGAACGATGTCGAACTACCGCCAGCTGTTTCATCACTGGGCTATCTTCTTGAGGAGGAGGAACTCTTCCGTCAAGGGATCTGGAAG 

AAGGGAGATATTCGCAAAGGCGGCGATCGCAGTCGGTGGCTCAACTCGCCAAACGTTTACCAAAAGGTCGAGGTTCTGGAGACCGGCACTCCGGA 

TCCCAAGAAGATTACAAAGTGGGTGGCTGAGGGTTCCCCCACCCACTCAACGCCAGTGAGGATAGAAGATCCACTCCAGTTTGTGCCTGCAGGAA 

CCAATCCAAGGGAATTTAAGCGTGTCCAGCAACTAATTAAGGACAACCGCCGGGCAGATAAGATTTCAGCCGGACCACAGTCGCATATTCTGGAG 

GGCGTGACATGGGACGAGGCGAGCCGACTCAAGGATGCAACGGTCTCGCAGGCCGGCGATCATGTCGTCATGATGGGTGCCGCCTCCAAGGGAAT 

CATTCAGCGTGGCTTCCAGCACAATGCCACTGTCTACAAGGCACCGTATGCTAAGAACCCATTCGATAACGTCACAGACGATGAACTCAATGAAT 

ACAAACGCACGGTGGAGCGCAAAAAGAAATCGGTGCATGGTGAATATACCGACACAGACTTTTCGGAATCCGAAGCGGTCCTGCAGGCGGGGACA 

AAGAAATACCCACAAAGCGAACCCGAGACCGAGCACCAGGTCATTGAGATCCAAACACAGCAAGCGCCAGTGCCAAGGCAGGCGGAAGTCGTGCT 

GAGCGATGCACTTGTCTCTCAACTGGCTCAAAAGTATGCGTTCCTTTATTCGCCTGGCCAATACATGTATGCCTGCATGAAAATGGCTCCATTAA 

TGCATAAGGTTTATGTTATACACAAGGTCGAGCCCGTCAGCAAGCACAACTATCCGCCAGTTAACGATGGCAATATGAGTATTCATCACAACGAA 

AGTGGAGCCGGATTCATGGCCCAAGAGTCCAGCGTTATTAGTAGCACCCCCGTACGCAATGCCTTGGCATCTGTATCATTGCCGGAAGAGAGGAA 

TCATTCTATTTTGGGCTTATCATCGACTCCGTATCGCACCATCTCGCATTTCGGGTTCAATTGTCCGCTTATTACGTCACCAACCATTCTCCTGC 

ATCCCGAACATCGATCCATTTGGCAACGGGTGGCCGAGCAGCGCGAGAAGGTTGTGTCCTTCATAGATCTCACAACATTGAGCTTGGACAACCGC 

AAGCTGCTCAATGTGGTGACATCAACGCATCCAACCCAGTGCCAATCGCAATCGCAATCCTTTATCTCTGAGAAGCACATTCAGCTGGAAGTTAC 

GCCACCGAAGCGAAAGCAGCGTGTCTACAGCGCCACCATCAGCAGTGGTCTAGACGACAGCCTCGATGAGCTGGACTCCCTGATGAGTGGCCTAG 

CCATCAATATGCCGCGCAGTCGGGAGCAGGACAGCGGTCTGTACCGCAGCTACACCTTTCTACCCAGTAATCATGCTCTGCCCAAGGATACAGAT 

GCCAATAATAGGGATCAAACGGATCGGGAGCGACCAGAAGCGGAGCAGGAGGAAAGTTTTCACTGCGCAGGCGACAGTGGTATTGGCGATTCCAC 

CGGCAGGCGTCCGCGTCTGGCCACCACCAGCAATGATTCGTCTATTCAAGAGGCCGAAGCCTATACCCAGGGCAAGCACGTCAAGTTGACCTTGA 

GCAGCTCACCCACGCCGACGGCCACACAGTCACCTGCAACCATCGAAATTCTTATCAACGTATCGCTGCGCAATGCCGAGTGCGTCCAAACGGTC 

CAGACCCATGAGCAGGAGTTTCGCGCCAAGTTGGAGCGAGTAATCGACGAGGAGATCCACTATATTTCGCAGCAACTCGCATTTAAACAGCGACA 

GGCAGAGTTGCATGAACAACAGACTACATCGCGGGCTCCAATCGCTACGCCTTCTTTCACAACTATGCATCCGCCGGCACCGGCTTCTTCATCAT 

CCATGGTTCACCGCAGCAACTCGGCACCAGAACTGTGCCACACCTATAGCTATGTGGCCGTGGGCGATTTGTCGACCAAACAGGATCAAGCCTCT 

CCCCAACTGCCAGCGGAGGGTGAGCCGTTGAATGATATACTCAGTAGTTTGGAGAAGGAACTGGAGCGTCTGCTGAATAGCGTGGTGACAGCCCA 

CATGCTGCACAACAAGGCCATCATACACGAGTGCCGGGCTCGCTTCTCGCAGCTGGCCGATGGAATTGTCAGCTCGTAGATACATATTATGATTT 

TGATGGAAAATTTACATCTTCGCTTTTGTTGTGATATTTGAAAATCGACAACAGTGAAATGATTGTATAAAGCAACTAGATTAAGTTTAATGATG 

ATAATTTGGAAATCTTCAAATCGAACTAATGCATTTTTTACTTCTTTGTTCTCTTAATTTAAATGTTAAGACTGTAACTTAAGCACAATATGAAA 

ATGCAAAGTTAAAACTGAATCTGAGCCAACCATTGTACATTTAAACATCTGTGAAATTGTTAAGTTTAATTTTTGTCGAATCTTGATTTTCGTTT 

CATATTTTTATAATTAAAATATAATAAATATAGTTTTCATTA 

{SEQ ID NO: 1019) 
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Start ATG: 29 



MTEVEQPPQNGIDPTAGEDDDNSKARPADIEQDMREMERRKRVEAIMGSKLFREELERIVDSARDGGAGASGILQQLSDIVGVPVSRVGSVFKSS 
NCMVPINDIRGVESMGYAKGEKILRCKLAATFRLLDLYGWTQGLGAQITARLKVDQEYFLVNPYGLLYHEITASALNKVDMQGQIVEQGTTNFGG 
NKSHFVLHSVVHAARPDIRCAIYIGCSPVVAISSLKTGLLPLTKDACVLGEITTHAYTGLFDEEERNRLVRSLGPNSKVILLTNHGALCCGETIE 
EAFFAACHIVQACETQLKLLPVGLDNLVLIPEESRKAIYEQSRRPPEDLEKKFAAVAAAEDGAATAEKDAAEAVPKVGSPPKWRVGGAEFEALMR 
MLDNAGYRTGYIYRHPLIKSDPPKPKNDVELPPAVSSLGYLLEEEELFRQGIWKKGDIRKGGDRSRWLNSPNVYQKVEVLETGTPDPKKITKWVA 
EGSPTHSTPVRIEDPLQFVPAGTNPREFKRVQQLIKDNRRADKISAGPQSHILEGVTWDEASRLKDATVSQAGDHVVMMGAASKGIIQRGFQHNA 
TVYKAPYAKNPFDNVTDDELWEYKRTVERKKKSVHGEYTDTDFSESEAVLQAGTKKYPQSEPETEHQVIEIQTQQAPVPRQAEVVLSDALVSQLA 

QKYAFLYSPGQYMYACMKMAPLMHKVYVI^ 

PYRTISHFGFNCPLITSPTILLHPEHRSIWQRVAEQREKVVSFIDLTTLSLDNRKLLNVVTSTHPTQCQSQSQSFISEKHIQLEVTPPKRKQRVY 
SATISSGLDDSLDELDSLMSGLAINMPRSREQDSGLYRSYTFLPSNHALPKDTDANNRDQTDRERPEAEQEESFHCAGDSGIGDSTGRRPRLATT 
SNDSSIQEAEAYTQGKHVKLTLSSSPTPTATQSPATIEILINVSLRNAECVQTVQTHEQEFRAKLERVIDEEIHYISQQLAFKQRQAELHEQQTT 
SRAPIATPSFTTMHPPAPASSSSMVHRSNSAPELCHTYSYVAVGDLSTKQDQASPQLPAEGEPLNDILSSLEKELERLLNSVVTAHMLHNKAIIH 

ECRARFSQLADGIVSS * 
(SEQ ID NO: 1020) 

Name: ADDUCIN 

Classification: actin_binding 
Gene Symbol: hts 
FlyBase ID: FBgn0004873 



Celera Sequence No. : 142000013383756 

CGGATTGGGTCTGCGCAATCCTGCCGATATTACGCTCAACGAGTACAACCGGAGCGAGGGTAGCAGTGCCGAGGAGCTGCTGCGACGAACTCCAC 
TGAAGATCCGGGCTCCCGAGATGCTAACCGCACCCGCTGGTTATGGAACGGGTGCGTAGTAGATTGATTCGAAGCAAGAAGACATTTCAATAATA 
TATTTTATATACTATTTCAGAACCCTGTCGCATGACACTTAAACAGGAGCCAGAGACTGGTTACTAGAAGAATAACGAACGGTGCAATATGCAGT 
TTGCAATAGGACACCCCTTAAGCACACAACCCATACACATACAGGCCCTCTCTTGCTGTACTCCCCACCAAGTGCTATATAGAGATGAAATTGAA 
ATGAAGAACTTACTTAATTGTTATGCCTTGAACCATTTTGATACTTTTTATTAGTCCTAAGTAGGTATTTTGGAAATTGTTGCTTAATTTTTAAT 
GTTTAACGCAGTTGCAATATATTTTTGGAGTCATATTTTGCTCAAGAAGTTTATTATATACAATTATACTATATATATACACCATTTAGCATGTA 
CTGAGTTTGTTGGTTATTTGGTTATCTTATACTTGTGCGTGGATCACAAAACATTCATATAAGGCCATGCAATATATTGTTTTAGGTTAGGGTGT 
TGTCTAGATTATGCTGAAAGTGTAATATATATTTAATTTTAAACAAAGAACTATTTTTATATGAATATGTATAATATACAAACTATTTCTAAGAT 
GGCTGAATGAATAAACAAAATCAAAATAAACACTGGAGAATATGAATGTAAATCGTTTACTGCGGCATTTTTACTAGTAATAAATATTACTATAA 
TTTATAATTTATGCCATCTCTTTCAGCAACAACTTTAAATATCCAAACTGATAAAGCGATATATGTTCCTATGCTCCCACTTAGAACGTAACCCT 

G AT C C T GC AT AGAAAAAAT T C AAT AAC GT AT T AC AAAGACT AACT AACG T T TAT T T AT C GAT AT T GC T AAT GCCCGTCTTCTC G AC CT G GGC G AG 
GAGCTCTTGGAGCTTGTTGTCGAAGAGCTCGCATCGAATGGGCATCTCCAGGGAGTAGAAGGGGTTCTTGAGCACGTAGTCTGAGTACAGCTCGT 
AAACCTTTCGCAGCAGCAGATCAATGCCATTGAGACCCGTTTCCGAGATTATAATGAACTTGATTCCGGTCAGCGTTTGGAAGCAGTGCAGCGTG 
AAAGTGTCCGCCTCTAGGATCTCGATGCCGGAACTCTTGGGCTCCGGACTCAGTTGGCTGGCGATGGCGAACAGCGGGTAGAACATGCTGGCCAG 
AAAGATTTTCTCGTTGGTGGTCATCTTTGGCCGGCTGAACTTCAGGTTAATGGGATAGTTCTCCGGTGCATCCAGTGTCGTCCTCACATCGCGTC 
CATCATCCAGGGTGACTCCATTGACGGGCATACCGTTGACCGCCACCAGGACATGGCCCACTGTTCGTAAGCATATATTAAGGTGGTTATGTTTC 
TAAAATTCACAGACACACCCACCATTTATTCCATCCTTTCTGTTGAACGAGACCGATACCTTCTTGGAATCGTAGTCCAGCACCAAGTCCAGAGG 
GTATGTAAAGGTCTTCTCGTGCTCAATGCGTGGCACATTGTTGTCCAGGTTGAATATAAGGCCACCCGATTTACTAACTATATAGACGCCATATA 
TAATCATTTCTTACTATTACTAATTCTAAGTTTCCTAGGAATGTTTTTATATTTTATGCGAATTAGCGATGGTAGTGCCAGCCAGCTGTTTGCCG 
CGGAACCAGTTCCAGTTCCGGCTTAGTTGCCATTAACAGTACATTAACATTGCGCCTTAACAGTTGGGTCCATCGAAGAAATTATAATTATTAGG 
ATAAATAAACCATTAAATTATTATAAGAAGCATTATTTAAAATTTAGGAATAAAATGGGAAGTAAACTCTTTAGCCAGAATACTAAGAATACTTA 
AAAAATATTTACAAGTTCAACCAAAAATCTAGTTCTACCAGGGCGGAGTAGCTCTTTAAACTGATTCCATCTTGCCTATCGCGTTCATTTAACTA 
GTTCACGGCCGAACAGCTGTCTCGGCATTAGAATTAAAAATAAACATAATTGATTTTCTGCCCGAAAGTATACGTAAACGCGTAGGCGCTCCCAC 
CTGGCAGTGCACCCGATTGACCGACTTGCAACCTCCACGGCCACCGATGGGCTCCGACGACCAGAGTGCCGGGGACAGGATCCAGAAGGGATTCC 
AGATCAACTATATGATTCTGCGGGATGCCGACAGCGGAAAAATAATCTGGCAGGAGAACAAGGACTTCTCGGCACCGGACCAGGAGCATGAGGCC 
CGAGTGCCGGTGAAAATACTCGACATGCGTGCCGTTTCCCGGGAGATCAACTTCAGCACCATCGAGTCGATGGAGAACTTTCGCCTGGACCAGAA 
GGTGCTCTTCAAGGGCCGCATCATGGAGGAGTGGTTCTTTGAGATGGGATTCGTGGGTGCCAACACCACCAACACCTGGCAGTCGACAATCGAGG 
CGGCACCCGAATCCCAGATGATGCCTGCCAAGGTCTTGAACGGCAATGTGACTATCCAAACCAGTTTCTACGACAACGAGACACTCATCACCAAA 
TCGGTTGTGCGCCTGTACTACATTTAGGCAAATGTGAGTCACCTGCCGTTCTGCTGA 
(SEQ ID NO: 1021) 

Exon: 1717.. 1543 
Exon: 1485.. 1001 

Start ATG: 1717 (Reverse strand: CAT) 
Transcript No. : CT26491 

ATGATTATATATGGCGTCTATATAGTTAGTAAATCGGGTGGCCTTATATTCAACCTGGACAACAATGTGCCACGCATTGAGCACGAGAAGACCTT 
TACATACCCTCTGGACTTGGTGCTGGACTACGATTCCAAGAAGGTATCGGTCTCGTTCAACAGAAAGGATGGAATAAATGTGGGCCATGTCCTGG 
TGGCGGTCAACGGTATGCCCGTCAATGGAGTCACCCTGGATGATGGACGCGATGTGAGGACGACACTGGATGCACCGGAGAACTATCCCATTAAC 
CTGAAGTTCAGCCGGCCAAAGATGACCACCAACGAGAAAATCTTTCTGGCCAGCATGTTCTACCCGCTGTTCGCCATCGCCAGCCAACTGAGTCC 
GGAGCCCAAGAGTTCCGGCATCGAGATCCTAGAGGCGGACACTTTCACGCTGCACTGCTTCCAAACGCTGACCGGAATCAAGTTCATTATAATCT 
CGGAAACGGGTCTCAATGGCATTGATCTGCTGCTGCGAAAGGTTTACGAGCTGTACTCAGACTACGTGCTCAAGAACCCCTTCTACTCCCTGGAG 
ATGCCCATTCGATGCGAGCTCTTCGACAACAAGCTCCAAGAGCTCCTCGCCCAGGTCGAGAAGACGGGCATTAGCAATATCGATAAATAA 

(SEQ ID NO: 1022) 

Start ATG: 1 (Reverse strand: CAT) 
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MIIYGVYIVSKSGGLIFNLDNNVPRIEHEKTFTYPLDLVLDYDSKKVSVSFNRKDGINVGHVLVAVNGMPVNGVTLDDGRDVRTTLDAPENYPIN 
LKFSRPKMTTWEKIFLASMFYPLFAIASQLSPEPKSSGIEILEADTFTLHCFQTLTGIKFIIISETGLNGIDLLLRKVYELYSDYVLKNPFYSLE 

MPI RCELFDNKLOELLAQVEKTGI SNI DK* 
(SEQ ID NO: 1023) 

Classification: hypothetical 



Celera Sequence No. : 142000013384591 

TTTAACATTTGAGTATTATTTCTGCAACTTTTTAAACTCATTCTCAGTTTTTTGTCTCAATTGAGTGTTTTGATTGAGTGGGTGTTTTTAACTCT 
CTTAGCTGCATATTCCTCCCTTAAATGCAGTTTTTTTTAGCTCAACTACCAATTGAGTGAATGTTAACGGAATGTTAACTGCAATCTCAAGACTC 
AAGCGCAGTTCTGAGTGAGTCAAATATGTGGCTTTGAGTGGAACAGACTAAGCTCATAAGAGTCATGAGCAGTATTTTGACGATTCTTTTGCCCA 
ACCATGATATAAAATAATAATGACATATTTATTTTCAAAATGTTGGTATAATAATAAGCTATTAACTATCAAATAGCTATTGATATACATTCGTT 
TTCCCCATTCTCTGCTCGAAAAAATATAGTTTTGAAATTTGAAAGCTGGAATTGTTAGCCCATCAAAATGCTAAATTACTTAGTTCTTTCTGTAC 
CTGGTAATTAAATAAACTTTGTGATTCGTATTTGATTTTACATATACTACGTTTGCTTTTATTTCAACTCAGATCAGAGGATAGCTTGAACATAA 
AATGGCCTACAAGCAATAGTATTTTAAGTCGTAATTGTAACTACTAAAAGATTAGGGGATTACATTTAGTACATGTCTTATACGGATTTTATTTG 
AGTAGAAATTAGTCAAGGCAACACGGAACTATCGTAAATTTCAACTATTCTAGAAAATTGAAATATTTAAGCATAGTAACGGTGCTCTCGAAACT 
AAACAAAAGGGCTTTAACATTGTAAAATAGAAAAATATAGCATGGAAACTTTCGTGTGTATAGACGTATTATAAAACCAGCATAAGACATGCCTA 
TTGTGGATGTGTTATCTTATACTGCCTTATAAGGAAGTAGTATGATGCGAAGCGCTCCCCCTTCTTCATCGTCCCTCAGAGCACTAAGTCCAAAT 
TAATCTTCCTCATCCTGTCCCAATCCCAGTCGCTGCCACTAGACGCTTTGTCACAGAGTGTCGTTGAACAAGCGCTTGGCCTTGTAGCTCGGATG 
AATCGGTCCCGGTATCCAGTGTGGCTTAAGGGCACTCTTACCGGGCAACCGATTCGCGTCGTAGGGCACCTGCGGAGACAGCTTCAGCTGGCGGT 
AGTAATCGCTCTTGGGGTAGATGCAGTTGCTCTTAGTCACGATCCGCACATGCTTGCCACTGCCCGTTTGTGGAGTCGCAAACTCTGCGCGCGCA 
GGCTGTTTGGGTGTGGACTGCGGCAGCTTACTGTTGGCCTGCTTCACTTGCAGTTTTCGAGAGGGAACAAAGATGTCCTGCTCACCGTCCTCTAA 
AGGGGCATCCCATCCGTTGGTCGGAGCATCATCATCAACACCGGCAGCCTTCGTTTTGGGAGTCGGCTTGGCCTTGGACTGTCCCTCCACCTTAG 
GTGTGGGCTTGGGCTGCTCGTCTGTCTTCGGCTTCGACTGCTCCTCCTTCGTCTTGGAGTGGTCCAATTTCTTGCGCTTCGCAGGCTCTTCTTTT 
TCTGCCTTCAAGGTCGGTTGATCCTCTATCTTGGGCTTGGACTGGTCATTCTTCGGCTTAAGCTGGCTTTTCTTCTTCACCTTCGTCTGGTCATC 
TTTGTTCTGCTGTGGCGGTTCTTCCTTTTTTGCTTTCTTCGACTTGGGCTCGGCCTCCTTGACCTCCTCTACTACCCATGCGTTGTTTATGCTAG 
AACGCACCTTTGCATTGTAATTCACCTTCTTCTGCAGCTCAGGAGGCAGAGCCTTGCTAATAACCTCATCGTAGTTGTGCTCGTCCACCTCCTCA 
AAGTTAATGGACTGCAGTAGGCGTTTGCGTTTGGATTTGTTGAGCTTCTTCAGTTTTCTTCCGGTTGCAAAGACCTCGTCCTCCATCTTATCTAA 
GGCAGCCACTTTTTGCTCCACCATTTCGGACAGAGTTTGTCCCTCCACCTTTGGCATTGTCCTCACGCCAAGTGGAAACTCTCCTCTCTGGTAAG 
TCTCAAAGATCTGGATAAGTTTGCGCAATCCCTTGCGTCTCTTGCTGTTAAAATCGTTGGTGCGTAGCTGCGTCTGCAGCTCATCTAGTACGCAA 
TCAGCGTTGAGCGGCAGCTCCGGCATGTGAACGTCCACGTTGCCCGCTCGGGGATCCAAGGATGTGGCCCGCGGCTGCTCCTCCTCGGCGTTGAC 
CTCGTCATCCTCCTCGTCAAATCCAGAATCTAGTTTCTCAATGTCATCAATGGAGGCGGTGGGGAAACCCATCTGCTTCCAGGCATTGTACTTTT 
CGCTGTACTGTCGTCCTAAGTCGCTCTGGTACATCAGATGATAGAGTACCCTTGTGCGGCACTGTGCCACTAGCTTAGCATCGCGTTGTGTAGCA 
ATGTAGGTGACAAACGGACGCAAGAACATATTAACCTGAGCAGCAGTGATCTCACCATTGGCCGCCTTGGCCAGCTCCTCAAAGAAAACATCTAG 
GTAGTGCATGGTCATGCCGCGAGACTTAGGCTGCTCCGACATCACAGAAAGCTGCATGCTGCTGTTGAATGCGGCTATCAGGTCCGCATTCCAGT 
TGTTCTGTTTGAGGAATCTCAGTAGATAACGCACCATGCGGCGGGTGAGCATCAGGAATTTGTCCATGCGCCACTGATCTATGCCAAAGTACTCC 
TGGCACATTGTGCGCATAAACGCACTAAAGTAGGCCAGACTGCAGGCCGTATTACCGCCGAAGCTGTCCACCATTTGAGCCAACTTCTCGGCCAG 
CTCCTCCTGCACCAGCGGCTTATCCGACATCCACATGGTGTAGTACAATCCCTTCCAGATGCGCATGAAGTCGTCCTCGTTGAAGGGAAAGGAGC 
TACCGGCACGCGCCTTGAACCACTTGCGCAAAATGCGGATCTGGCGATTACGCTCCACCACATCGTTGCAGGCAAGAGCACGTATAATTTTCACC 
TCCTGGGCGACCACCATCAGCTCCTTGGCCACCTTCGGCTGTGAAGCGTCCTCCTCCTGCTCGACCTGGATTTCGGCCGCCTCGGCGTTGCGCTT 
AACTGGCTTCTTGCGCGTCACCATTATCCGGCTAATTCAACTTTTAGTTGCGAAAAATAAGTGAGCTGTGTATATTGAATACGGTACGGCACGAC 
CGCACGTGTTGCCGTTCGATGTTGCCAGATGTTGAAACGACATTTTCTCCAAATTTCGTCCAACAGCTCTAAATGCATGAGCTGTACTGAAAATG 
AC AAC AAAAAC T C AAGC T AAT G AAAT AAT T T T C AAA G AAT AAAAT AAAG AAAAAAAG AAT AAT AAAAAG AAT AT T T AAAT T GC T AG T T T T G A T AC 
TTCATGAAACGCTTGCTTTATACATTTTAATACAATTTTATATATTTTTAGCCATTTATAGGTTGTTATTTTATAGCTACCAGCGCTGACTTCTT 
TACGGAAAGTGGCAACTCTCTTCCACAGCTGATAATTCGGTCTGCTTTTTGTAAATTTTGCTTGTCCAGCAAATTTAGTTAAATAATTCAAAGAA 
ATCATGCCCGTGGTGAACGTGTCGGCTGCCGAGGAGTACCAGAAGTACATTAATGCGGATAAGACGACCGTAGCTCTATTCGCCGCCGAATGGGC 
AGAGCAATGCGGTCAGGTGAAAGACGCGCTGGAGGAGCTGGCCAAGATTACTGGCGAAAAACTGCAGTTCATCAGCCTAAACGCTGAACAATTTC 
CCGAGATTTCCATGAAACATCAGGTACGCTCGTATTTATTTTATGAAAATATAACAGTAGTTCCTCATTCATTTGCGATGCCACCAGATCGAGGC 
CGTGCCCACAGTCATATTCTTCGCCAAGGGCTCCGCCGTTGACCGTGTCGATGGTGTAGACATCGCCGCCATAAGCGCCAAATCCAAAAAGTTGG 
CCGAAAACGCAAGCAGCGCGGCGGCAACAGGACAAACGTTGGAGGAACGCCTAAAGGCCCTAATCAATACAGCTCCGCTGATGATATTCATGAAG 
GGCGACCGAAATGGACCGCGTTGCGGATTCTCCAAGCAGCTCATCGGCATTGTGAACGAAACCAAGTATGCAAA 
(SEQ ID NO: 1024) 

Exon: 3064.. 1001 

Start ATG: 30 64 (Reverse strand: CAT) 
Transcript No. : CT26519 

ATGGTGACGCGCAAGAAGCCAGTTAAGCGCAACGCCGAGGCGGCCGAAATCCAGGTCGAGCAGGAGGAGGACGCTTCACAGCCGAAGGTGGCCAA 
GGAGCTGATGGTGGTCGCCCAGGAGGTGAAAATTATACGTGCTCTTGCCTGCAACGATGTGGTGGAGCGTAATCGCCAGATCCGCATTTTGCGCA 
AGTGGTTCAAGGCGCGTGCCGGTAGCTCCTTTCCCTTCAACGAGGACGACTTCATGCGCATCTGGAAGGGATTGTACTACACCATGTGGATGTCG 
GATAAGCCGCTGGTGCAGGAGGAGCTGGCCGAGAAGTTGGCTCAAATGGTGGACAGCTTCGGCGGTAATACGGCCTGCAGTCTGGCCTACTTTAG 
TGCGTTTATGCGCACAATGTGCCAGGAGTACTTTGGCATAGATCAGTGGCGCATGGACAAATTCCTGATGCTCACCCGCCGCATGGTGCGTTATC 
TACTGAGATTCCTCAAACAGAACAACTGGAATGCGGACCTGATAGCCGCATTCAACAGCAGCATGCAGCTTTCTGTGATGTCGGAGCAGCCTAAG 
TCTCGCGGCATGACCATGCACTACCTAGATGTTTTCTTTGAGGAGCTGGCCAAGGCGGCCAATGGTGAGATCACTGCTGCTCAGGTTAATATGTT 
CTTGCGTCCGTTTGTCACCTACATTGCTACACAACGCGATGCTAAGCTAGTGGCACAGTGCCGCACAAGGGTACTCTATCATCTGATGTACCAGA 
GCGACTTAGGACGACAGTACAGCGAAAAGTACAATGCCTGGAAGCAGATGGGTTTCCCCACCGCCTCCATTGATGACATTGAGAAACTAGATTCT 
GGATTTGACGAGGAGGATGACGAGGTCAACGCCGAGGAGGAGCAGCCGCGGGCCACATCCTTGGATCCCCGAGCGGGCAACGTGGACGTTCACAT 
GCCGGAGCTGCCGCTCAACGCTGATTGCGTACTAGATGAGCTGCAGACGCAGCTACGCACCAACGATTTTAACAGCAAGAGACGCAAGGGATTGC 
GCAAACTTATCCAGATCTTTGAGACTTACCAGAGAGGAGAGTTTCCACTTGGCGTGAGGACAATGCCAAAGGTGGAGGGACAAACTCTGTCCGAA 
ATGGTGGAGCAAAAAGTGGCTGCCTTAGATAAGATGGAGGACGAGGTCTTTGCAACCGGAAGAAAACTGAAGAAGCTCAACAAATCCAAACGCAA 
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ACGCCTACTGCAGTCCATTAACTTTGAGGAGGTGGACGAGCACAACTACGATGAGGTTATTAGCAAGGCTCTGCCTCCTGAGCTGCAGAAGAAGG 

TGAATTACAATGCAAAGGTGCGTTCTAGCATAAACAACGCATGGGTAGTAGAGGAGGTCAAGGAGGCCGAGCCCAAGTCGAAGAAAGCAAAAAAG 

G AAG AAC C GC C AC AGC AG AAC AAAG ATG AC C AG AC G AAGG T G AAGAAGAAAAGC C AG CT T AAGC C G AAG AAT G AC C AGT C C AAGC C C A AG AT AGA 

GGATCAACCGACCTTGAAGGCAGAAAAAGAAGAGCCTGCGAAGCGCAAGAAATTGGACCACTCCAAGACGAAGGAGGAGCAGTCGAAGCCGAAGA 

CAGACGAGCAGCCCAAGCCCACACCTAAGGTGGAGGGACAGTCCAAGGCCAAGCCGACTCCCAAAACGAAGGCTGCCGGTGTTGATGATGATGCT 

CCGACCAACGGATGGGATGCCCCTTTAGAGGACGGTGAGCAGGACATCTTTGTTCCCTCTCGAAAACTGCAAGTGAAGCAGGCCAACAGTAAGCT 

GCCGCAGTCCACACCCAAACAGCCTGCGCGCGCAGAGTTTGCGACTCCACAAACGGGCAGTGGCAAGCATGTGCGGATCGTGACTAAGAGCAACT 

GCATCTACCCCAAGAGCGATTACTACCGCCAGCTGAAGCTGTCTCCGCAGGTGCCCTACGACGCGAATCGGTTGCCCGGTAAGAGTGCCCTTAAG 

CCACACTGGATACCGGGACCGATTCATCCGAGCTACAAGGCCAAGCGCTTGTTCAACGACACTCTGTGA 

(SEQ ID NO: 1025) 

Start ATG: 1 (Reverse strand: CAT) 

MVTRKKPVKRNAEAAEIQVEQEEDASQPKVAKELM^ 
DKPLVQEELAEKLAQMVDSFGGNTACSLAYFSAFMRT^ 

SRGMTMHYLDVFFEELAKAAKGEITAAQVHMFLRPFVTYIATQRDAKLVAQCRTRVLYHLMYQSDLGRQYSEKYNAWKQMGFPTASIDDIEKLDS 
GFDEEDDEVNAEEEQPRATSLDPRAGNVDVHMPELPLNADCVLDELQTQLRTNDFNSKRRKGLRKLIQIFETYQRGEFPLGVRTMPKVEGQTLSE 

MVEQKVAALDKMEDEVFATGRKLKKLNKSKRKRLLQSINFEEVDEH^^ 

EEPPQQNKDDQTKVKKKSQLKPKNDQSKPKIEDQPTLKAEKEEPAKRKKLDHSKTKEEQSKPKTDEQPKPTPKVEGQSKAKPTPKTKAAGVDDDA 
PTNGWDAPLEDGEQDIFVPSRKLQVKQANSKLPQSTPKQPARAEFATPQTGSGKHVRIVTKSNCIYPKSDYYRQLKLSPQVPYDANRLPGKSALK 

PHWIPGPIHPSYKAKRLFNDTL* 
(SEQ ID NO: 1026) 

Classification: hypothetical 



Celera Sequence No. : 142000013384832 wm ^ mwfnw 
CGTTGCCAAGCCCCAAAAAGGGGGCTCACCAGAACGAGCACAGTAAAAGAGCAAGCTGAATATATTTTCTAAGTAAAATTTATTTATTATGTATA 

aataattctcttttagctctcgcacaagcgctattaacgttagtaatttacagttcgctctgcagtcctcgttgccgaaccagggactcacagtc 

GGGTATCCAGGTCACCTTTGGTGGGATTTAGGAGATCCCCCCTGCTGTAACCACTGGGTGTCCCTGGTCGGCTGCGAAGTGGAAATCAAACGAAA 

CATAACGGCACAAATAGTTGGCTTAAACAATTACAATAACAATTGTGGAATATATAAATTTGCATACAGGCTAAAAGTGAAGTGGTC 

ACAAGCATAGCTCCCACTTCACTTACGTTCGTATTCGTAATTTAGTTAGAAATTAGATTTCTTGTACTTGTTGAACATCGTTTGCTTTAGTGTTG 

TTTTGCAGTTCCCATAGCTATCTATCCAATATATAAATACGAAAATCAATTGTAAAGAAGAGGGATATTCGGGCCGGTGCCACCGCTGCGCCATG 

GTGGTTATTTGTTGACTTGGGCGCAATGGAGCCCCAACGGGTACAGAAGGAATGATTGGGATAGATGCGACTCGCTTCTGTGATTTTTGACTTTG 

GCTACCCTTAAACCCATAAGAGAGATGCAACGCAGAGGTTACAATTTTTACAATAGAACGTTTGTTAATTTGTTGCTTAGGCGCTGCTTTATCCA 

CTACTATATCTACCAGTTATTTATCATCCTCAGTTATACATTTATACAATCCTATTGCACACATACGATTCTCATATGTCTTGCTTAGCACCCCC 

TCAAGATACATATACATAATCATATACATACATAGATATAGATATTTTGATACAGTTATAGTTAAAGTTAAAAAAATACCTCACACTTGATAATA 

ATGAGTGTAAAGACCCGCCCGCGACTATGGTATGAAGTTAACCCCGACCAGCTCGAAAGAAAGTGATAGTGTAGGAGGCATAGGAAATAGAGGGT 

CGAGAAGGCCTTGAGTTGGCGGTCCTAGACGGGCCCAATATCACTTTGGTATCTGGCGGTCAGGTAGTTGGCCGCCTCTGTCATTTGTTCCAGGC 

ACTGGTACAGACCGTTAAGATGCCTCCGGCAGCGCTTGGCCAGCTCCGTGGCCTTGTCCCCCGAGCCCGGCTTCGATTTGGCGACGTCTACAATG 

TCGCTGCCGTCGCTGCTTTGGGCGTTGTCGCTCTTGTTGCCATCGGACTCTAGCCCCGAGTCGTTTTCCTGCGATATGTCCGAGTCAGAGTCACT 

CATGTTGCTCACCGAGTTGGTGGACATCATGGATGCGGTAGAGGTACGGCGCACATGACCCTTGACACTTCCGGATGCGGTTGTGCTTGGACTGC 

CGCTGTTGAATTCAGTCTTGCTCTCGCTGGGATTGCAGCTCGAAGTGGAACTGGTACTGGCGTTGAGCTGCTGTTCGGAGTGCTGCTGGTCGTCG 

TCCTCCTGGTTGGCGGTCCAAAGCAGATCTTTTTTCAGGGAGTTCAGCATGTTGTACAGATTGAACAGCTCGGAGGCGCTGAGAAAGTCTTGGAC 

ATTTTTTGAATGCGCCGCTCCCCGCTTGCCGTGGGCACTGGAGGTCTGTGTCAGCTGGTTGGCGGTCGAGTCCTTGCCGGTTGCTGGGATGATGT 

CAGTGCTGTCGCCGATCTGGGGATAAATTATGTAAAGTGGGAGCCATAAGTTGTTGGTTCCGGATCCGGATGGCTGGTCAAATGGGGGGCTGGGG 

TTCTTGGGTTATGCGGTACCAACCTGCCGATCCATGAGCCTGCATGGCACCAGTATGGTGTCGTCCATAATGTTGACAGTCTTTACAAACTTTTC 

CATCACATTGACAATGCTGTCTTTGGAGAACTGCTGCTCATCGTGACGGGCGATGCGGCGCAGGCAGTTTCTGAAAAACAGAACAATCGAAAGAT 

GAGCTTGGGCCAGAGCACAATAAACCAGTAGAGCGTGTGCAGGGGAACCAGTTTGGATTGGACAATAGTCTGCGTGATTAGTGACTATGCCACAT 

CACGCCAGACTGGACCAAGCGCTGGCCGTTCTAAAGCGTCGTTAAGTCTGCGGCGATCAGACTTCCGCCACTCGAAGCCATCTTTCTTTGATGCT 

TTCGACTGGAGATGGGTGCGCCTCCTGCTGACCAACGTGGCATATGATATATATGAATTGGTTTCTATGCAAATCAGCGACTCGCTGGCGTTGTC 

GCAAATTCAATTTGTATAAAATAGAAAACCAGTTGGTGCTTTATACCTGACATTGCTCGTGTGGCTCTTAAAACAAAGCACAGACGGACGGACAG 

AAAGATTTCTATTTTGGTTTCCAAAGCCAATTTAACGCGTCCACGTGCTGTCGGTGGCGGGGTGCGAGGGTAAAGCCCGACTTTGCTCGCAAGCT 

TCAATGTAATGGTTATACAATCCTGGCTGGTCGCATTAACTTTAATTTCACGAAAAGATCGCGTAACATCTCCAGGTCAAGCCCCATAAACTTGC 

CTGAGCTCCCTCCATTAACCCGCCGGGAATATATGTTAGTTGCTGATATCTGGAGCTGGAGCGGCCAAAGTCTAGAGGCGGCTTGGCTTATCGTC 

GCTAGACAGTTCCCAGACAGAGCTGAAGTTGCGTATCTCCCGCACCTCCTATTTAATGCGGCCAGATTGCTCGGGGCGGATCGGCATGTGGGCAG 

CCAGTTCGGGGCCGCGTGATCGCAGCTAGTTGACAGAGCGAGCGCCGAGCGCACTGATAAATTTAGTTTCAGTGACTGCGCCATTTAAAGACCTC 

TACCCTACAGTTCTATCCCTGGCCAGCGCCCGCCCCGATGCCATGGCAATCCAACTTAGAAAATCTAGATATCGATGACTGAAGTGTAACGCGCG 

GCAGGCCGGGGTGGGGTGGCCACGCGAGTACTGCGTGCTCCGAACTCTGCTGACCGACATCGTCTCACTGTGGTCTAGAACCACAATAAGATCTA 

GATCTAGAGAAGGCTGGCGCGGGTGTCAATGTTCTTTCCACTGGCACGGCCAGATGTCGTGGACGTCGACGGAGGATTCCCAGCCACTGAACAGA 

GTGACTAATTAAGCTAATTGCAGCGGGTTCCAATCATGGAAATCCGCGTAGGAATGCGAAAACAATTACCATCTTATCAACGCTTTTCGCTTAGA 

CACTCGACCATAATTGACTGCCATCTCGTGGAAACCTGAGTAATCGCCTCTATTTATTTTAAATTATAGATATTTTCTAGCCCTGACCACGACCC 

TTGCTTCGCTTAGTCACTTAAGCTGGCCACACGATAGTTCAACTTATCACTGCATTTTGAATCCTTTCCTTGCGAATCCTCTGAACAGATCCCGA 

AAGCCCAAAAACGCCGGCTTGGGCAACACGCCAGATGCTACCAGCCAATTGACCAGGCAATAGGTACACCTGAAGTGTTATGGAAATTTGGCTAA 

GCCCCACAAGCGTCACGCGACATGCTCGCCAAGAAGGGCGGGGTAAGCAGCGGGAGAGGGAGGGTACCGCATACAACGCCGAAATCAGGCGACTT 

TTTCGCTCAGCGGTAGCTAAGGGCGGTGAAGGAGTGCTGAAGGGGTGGTGGTTTGGTGGGTGGCTGATGACGAGCTGGCTTTATGTGGCAAGCGA 

ATCAAACGCTTCTCTTCGCACTCTCTCTTCGGTCCGTTTCGCCGTTTTCTCTTTGCCGCCGAGTGGCGTGATGTTGCCCAGAAACTGTACAGCGG 

CCAGAGTCGCCTCCATCAGCTGAGCAAGATGAATGAATCTCTAGCGATGCTCTGCCTTCCCCTCACCTTCCTACCATCGCTCATGCAACCATGTT 

TAGTGGTATTTCTCGTATTTGCAATGTTGAGCAATGGCATATACATAGCCATACGAACAGAGTTGACCAATATTTGGTTTACTTTGGCTTTGGCT 

GCGCCTCGACCTCCAGCATAATCGATTTTTGATACGCCCGCAAATTAGATATTCTACCTACTCGGTGGCCGGGTTGGCCGGATTGGCTCGGCCAT 

CGGGCATAGAGCATTGACGACCGTAAGTTGGCGCTCAAGGCGACACCTGCCTCAAAAAGTGACACACTGCCAGCGCAACGAGGTGAATCAGAGAA 
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TTTTCTGCGATTTAAGCGCCCCAAGGAGGCGGGCAGCACCAGAAGCGATGAGTAATGGCTATGGCCGATAGCTGGGCTGGCGGACTGGTGTCAGT 
GGACCGGAGGTGCGCACAGATAAGGCATCTTATCGCTTATCCCAAAGCTGTGATTTGCCTCCGCTGGCTCACTCGAAACATTTGGGCAAACAAAC 
GGCAAAAAAAACACCAACATACTTCTGTATAGGTGTACAATATATAGCCGATTGTGTCATATTAGCGCTGGCTTGATTTTTTTATTTTATTTTAT 

TTTTTTTGTATTTGTTGTTG 
(SEQ ID NO: 1027) 

Exon: 3485. .3376 
Exon: 1970.. 1829 
Exon: 1726.. 1001 

Start ATG: 3380 (Reverse strand: CAT) 
IcCTATTGOT 

AGGATTCAAAATGCAAAACTGCCTGCGCCGCATCGCCCGTCACGATGAGCAGCAGTTCTCCAAAGACAGCATTGTCAATGTGATGGAAAAGTTTG 
TAAAGACTGTCAACATTATGGACGACACCATACTGGTGCCATGCAGGCTCATGGATCGGCAGATCGGCGACAGCACTGACATCATCCCAGCAACC 
GGCAAGGACTCGACCGCCAACCAGCTGACACAGACCTCCAGTGCCCACGGCAAGCGGGGAGCGGCGCATTCAAAAAATGTCCAAGACTTTCTCAG 
CGCCTCCGAGCTGTTCAATCTGTACAACATGCTGAACTCCCTGAAAAAAGATCTGCTTTGGACCGCCAACCAGGAGGACGACGACCAGCAGCACT 
CCGAACAGCAGCTCAACGCCAGTACCAGTTCCACTTCGAGCTGCAATCCCAGCGAGAGCAAGACTGAATTCAACAGCGGCAGTCCAAGCACAACC 
GCATCCGGAAGTGTCAAGGGTCATGTGCGCCGTACCTCTACCGCATCCATGATGTCCACCAACTCGGTGAGCAACATGAGTGACTCTGACTCGGA 
CATATCGCAGGAAAACGACTCGGGGCTAGAGTCCGATGGCAACAAGAGCGACAACGCCCAAAGCAGCGACGGCAGCGACATTGTAGACGTCGCCA 
AATCGAAGCCGGGCTCGGGGGACAAGGCCACGGAGCTGGCCAAGCGCTGCCGGAGGCATCTTAACGGTCTGTACCAGTGCCTGGAACAAATGACA 
GAGGCGGCCAACTACCTGACCGCCAGATACCAAAGTGATATTGGGCCCGTCTAGGACCGCCAACTCAAGGCCTTCTCGACCCTCTATTTCCTATG 

CCTCCTACACTATCACTTTCTTTCGAGC 
(SEQ ID NO: 1028) 

Start ATG: 106 (Reverse strand: CAT) 

MQNCLRRIARHDEQQFSKDSIVNVMEKFVKTVNIMDDTILVPCRLMDRQIGDSTDIIPATGKDSTANQLTQTSSAHGKRGAAHSKNVQDFLSASE 
LFNLYNMLNSLKKDLLWTANQEDDDQQHSEQQLNASTSSTSSCNPSESKTEFNSGSPSTTASGSVKGHVRRTSTASMMSTNSVSNMSDSDSDISQ 
ENDSGLESDGNKSDNAQSSDGSDIVDVAKSKPGSGDKATELAKRCRRHLNGLYQCLEQMTEAANYLTARYQSDIGPV* 

(SEQ ID NO: 1029) 



Celera Sequence No. : 142000013384828 ^„„„ mm 
TCGTCCGGATGCTTGTCCTTGAAGTGCTCGCGGATGCCGTCGAAGTCCTCGGCCAAGAATATGCACAGCTCGCAGCCCATGACTATGTAGCGCTT 

GATCAGTTCCTCGTCCTCTTTGCAAAACGGCGGCAGGCGTTCCATCTTGAATATCCTCTTGGGCGGTCCCCGCTTGACCACCATCTTCTCCTTCC 

CCTTTCTGCCCACCGACACCGTTCCTCTTTTCGTTTTAGCCGGCTTGGCCACGTCCCGTTTACGTTTGCGTCCACTCCGGGTCACGATCAGGTCG 

TCGGTGTCATCATCCTCGTCGTCCTCGTAGTCCTCATCCTGTTCATTGTTGGCATCCTCAAAGAGGAGTTCCATGTCGGGTGACTGTCGCTCTGT 

TTTGATGCCCACCTTGCTGTCTTCCGGGGACATGGGTTGGCTAAGGTCTAACTCGTCCAGGGGATTCACATTTATCTGCGCACCGACATCGTTGT 

CGACGGCAAGGACGTCGGCCGGCATGAGCACCTCCTCCGGCCAGGAGGTGTTCACCATCTCCGGATCCTGCTTGAACTTGGAGGTGGTGGCGTAT 

ATCACCTGGGCCTCCTGTATGGACACGTAGAACTGATGGAACTCGGACACCTGCGTCCAGCAGACGTTGCAAATAACCTTGGATATCTCGTCGTT 

GGGCAGTACCTGTACGCGTTCCGGAGACGAGATGGTGGTAAAAAAAGATTCCCAGGCACAAAAGTGGGTGGAGGGCAGCAGCTTACCTCGAACCA 

GAAATGTTGGCGCAGCACCTCCGCCACCTTGCTCTCGCCCGAGTCCACATCGAAGATCTGTAGACACATCTGGGCGCCGCTCACTCCGCGTAAAC 

AGAGGCGGCAGATATCCATTTCGCATTTATTTGATAAAACTATAATAAAAAGTAGTCATTTACATAGCTCTGTCGATAACTCCCATCGCAACTAT 

CGGCTGCATCGTGATTTGTTCGCTGTTCATTCTGTGCACTGGTTGTACGACGTCGGTCACACTGGCAAATTGTTGGTTTTCCTATTTTCGTATTT 

GCTGATAATGTGCATATTGTGTGGAAAAGATGAGTAAACAAGGGAATAAAAGCAAGGATTCGGCTGCCGCATCCGCCACTTTAAAAGTAAAGGTG 

TCCTGTGTAGCCGGAAAAACGAGCCTAACATTTTTCGGTCTTGTTTTCAGCGGATAAGCGAAAAGAAGTTGGAGGCCTTCACGGTGGGCACCTTC 

TCCAAACGACAATTGTCCAAAAAAGAGCTCGAGGACCAGAAAAAAAAGGAGGATGCAGCTGCTGCGGCACATGTAGGCGCCTCTAGCAAATCCCT 

CCTCCACGCATTATAATCCAATACGATTTTTAGGCCTTCAAAGAGTTCGTAGAGACTTTCCAAGATGCACCCACCCCCTCCTCGAAAGTTTGGGT 

CAAAGCCGGCACCTATGATGCAGGATCCCGGCGGGAGGACAAGTCGGAGAAGGGCAAGCTATACAAGCCGGTCTCCAAGCTGATGGAAAAGAGCT 

CCTCGGACAAGGTCGAGGATTACGCCAAGACCCTGGCCTCGGATCTCAAAAAGGACTCGGGTCCATTGAAGAAGAAAAACCAGGAGAAGAAAAAG 

AGCAACTTGGAGCTGTTCAAGGAGGAATTGAGACAGTAGGCCCACTTGGTTGTGCATGTGTGTATTCCATACTAATTGTTTCATGCCCACAGGAT 

TCAAGAGGAGCGTGAGGAGCGCCACAAGTACAAGCACATGGCCTCAAGCCATTCCGCACCAGCACAGCAACCGGCTGCATCAGCTGCTCCTGTGC 

CCAGCAGCAGTGTCTCCACCACCTCCCAGAACTCAAGCAGCTCAAAGGAGTCTGGTTCGTTTGACACGGGCGATCCAAATACCACAAACCTGTAC 

CTGGGCAACCTGAACCCCAAGATATCCGAGCAGCAGCTTATGGAAATCTTTGGGCGATACGGTCCCTTGGCCAGTATCAAGATCATGTGGCCCCG 

CTCCGAAGAGGAGAAGCAAAGGGGACGCAATTGCGGCTTTGTGGCCTACATGAGCCGCAAGGATGCGGAAAGGGCCCTAAAGACACTGAACGGGC 

GCTACATCATGGGCTACGAGATGCGTCTAGGATGGGGTAATCTATAGATCTTATATTGAAGTTCTTATTCCTCATACATCTTACTTAATATTTCA 

GGAAAGACTGTGCCCATAATGAATACGCCCATTTTTACACCGCAAGCATTGCTTGAGATGACCCTGCCTCCGCCTCCTTCAGGATTGCCCTTCAA 

T G C C C AAC C T C C AC C C AGT G AAG C T GAT G T GCT GCC C AAG AAAAAC T AC AAG GAG T T C AAC C AAG AGG AC AAAG AG AAT AT GG AGC GGGT AAGT G 
AATAATCACCTGCTAGCGTCTAAAGAAATAAAAATGAAAATGTTCAATTTTGTAGATACTTGCCAAATGCGTCGTTAAAGTCCATATACCCACAG 
AGAAGTATGAATTGCACCAAGGACATTCACTTAACAAGAACTCACGTTACTGTGCTTTTCTTTTTTTCCAGGGCTGTTCTTAATGTTATCCATCG 

GAT GAT C G AGT T T GT AAT AC GC G AGGG AC C GAT GTTTGAGGCTTT GAT CAT GAT T C GGG AAATGG AG AAT CCGCTCTTTGCGTTCC T CT T CGAC A 
ACGAGAGTCCTGCCCACATCTACTACCGATGGAAGCTGTTCTCTTTGCTACAGGGCGATACGCCCAATGAATGGCGGGAGGAGGAATTCCGTATG 
TTTAAAAACGGACCGGTTTGGAAGCCCCCGATTGCCAACTTCTACACTCAAGGCATGCCAGATGAGCTGGTTGTTGATCCCGATGCTCCTGTGGT 
GCACAAAGGAGCCTTGTCCAATGCGTACGTTTAGCATAGATAATCCTCGAAACATAAATGTTTACTGTACTGTAAATGTATACTGAAATAGGCAA 
CGCAACCGACTCGAGGATTTGATTAGGCACTTAACTCCTGAACGCGCCCGCATTGGAGATGCCATGATCTTCTGCATTGAGCACGCAGATGCGGC 
TGATGAAATCTGCGAGTGCATCGCCGAGTCTCTGTCAAATATCAATACATTGGCTTCAAAGAAGATTGCTCGACTTTATCTGATCTCAGATATAC 
TGCACAACTGCACTGTTAAAGTTGCGAATGCTTCGTTCTTCCGTAAATCGTATGTATATATGTTTATCAAAGCATACACATTGAAATATCCCCTA 
TTATTTAGTGTGGAAAAACAATTGTTGGACATCTTTGATAACCTGCATAACTACTACTTGAACATCGAGAGTAGATTGAAAGCTGAGGGCTTTAA 
ATCCAGAGTCTGCAATGTTATCCGCACATGGGAGGAATGGACCATTTATCCCAAGGATTTTATGGCCCAGCTGACGGCCAAATTCTTGGGCAAAC 
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CTGTTAGTTAATGAAATTCATTTCAAATTGTGCATGTAGTCTAATGGAATCTTGTGTTTAGTACGTCAAGCCAGTCAATAACTCACCTCAAGCCG 
AGGAAACGCGATTTGATGAGGCTCTGGACGAAGATATCGACGGTGCGCCCCTTTCCGGAGAGGAAAAGGACGACGAGGATCTAGACGGAGTGCCA 

C T AG AT GG AGC T G C T T T G C T T AAG AG C GC C T T G AAGC GAG C CAT AC C T GAT G C AG AT GC T GG TAC T C C AAAGC GT G AC AC AC C T AAG AAAGAT G A 
ATACCTCGATGAAATCGATGGCATACCATGTAAGCATAAGAAAGTGATTTAAACCTTCTTACTACTAAGTAAATAATCCTTAATTCCAGTGGATG 
ATGATCTCGACGGTGTACCGTTGGAAAAAGAAACCAAATCTACGGCTAAGTTGCCTGGCTTCATACCCTCAAAGTGGGAGACAGTTGATCCCCAG 
CAAGTAGAAGCGCAGGCGATTACCACCAGCAAATGGGACACTCTCGATCCGCCAGATCCACCGAAGTTCTTTAGCTCGGACGACGACAGTGGAGA 
AGACAATCCGCAAAAGTACACCGACGAGATGCGTCAGAAGTTAAGAGACATTGAGTCGAAAGCAATTCAGTACCAGGACGAACTGGAGTCCGGAA 
AGCGCCGCCTGGAACCAGGTTGGAGTGTCCCCGAACAAGTGGAACAATTCCGGAAGCGATTACTTAAACAGGTAAGATTTGTAACATACAGTAAG 
TCGGATGTAACAACTGAATTCTTTGCGAACGAACAGGACTCGCGATCTGAGGCAACTGTAGACTCCCCGCTCAGCTACATGCACATAAAATCCAA 
GAGCTCCAAGGTGAGTGAGAGTCCAGCGCTGTTCAGAAGCAGTTCCCGCAAACGATCCCACTCGCCGTTCTCTGGTGGCGAATCCAAACGACGTG 
ACTATTCACCCGAAACCTCGCGCTCATCCAAATCCTCGAAACGCAATCGGTCGCGTAGTCCGTCCAGTTCACCGAAGCGGTATACAGAGCGGTCG 
TCTCGCAGCTCCCGCTACGAAACTCCTGCCTCCGGATCGTCGCGATCACGCAGTTCTCCAACCTCCTCGCGCTCCTCTCGCAGAGAAGAGCCATC 
GCCGCCGAGACATCGCAGCGACAAGCACAAGCACAAACATAGGCACTAGAGTGTCTGCTCTTCGGACGATTTGTATTTAATTAAACATGGATTCT 
ACATAAAAACACTTTCGTCGTTTGTTTTAAATACACAGTATATATAATTTTAAATCTAAATCGTTTGTTAGGAAGTGGGACACACGCAAAACAGA 
CGGCCCAGGCAGTTCGATTCCGCTCTGCGAATTTGTCTAAAAACATGAATCTAAGCAAGCTATCTTAAAATGGTTACGTTTTGTTAAGGAAAGCC 
GGCCAACCAACACCAGTCCTCGAACCGCCTGAGCCTGGTCATATAAGCAGTGGTGATTATCATAAGGTGAATACATAGATGTGGCTTAAAAGTAG 
ATGCGAACAGCCTCCTCGGCATTCTGCTCCACGGGGCAGTAGGGGCACTTCAGTATGTGGCCAACACTCAGCTTGTGCAGGGCGTCGTTGGATAT 
GACATGACCGCAGGTCAACTTCTTCGGCGGATTGTCCTCGGAAGTCTGCTGGCGCAATATGGGGCAGGCAAAGATCGAGTGGAATCGGAATTCCG 
GCTGCAGATCGATCTCAATGGGCAACTCGTCGCAGCCGTTCCACATGCCAAGGACTTGGCGCGACTGCATCACCTGCTTGATATTGAGCAAGGCG 
GGCAGGGCAGTGCATCCGGCATTGACCACAACAGACAGGGCGGAGTTCTTGCTAATGCCTAGCAGTTGGCATGCGTCTTTGAGGAATATGAACGA 
CAGTTCGGTCCACTTCTCCTGGCCCAAGAAGTGCTTGTAAGGCGAGTTCTCCAGTCCACTGGGCAGATATATAAAGCTGGCCATCAAGTTGGCTA 
TTTCGTGCTCATAGCGCACAGCAAACTTCTTGAAGTTCAGTCGAGCGTAGGAGATGGCCTCGCGCTGCGAGTCCAATCCGTAGGAGACCAGCTGC 
ATGAAACGCATGCGGTGCAGCCGGAACTCTATGAGCGAGTGGCGGTCACTCAGCTGCTGGGAGTACATCTTTGCCCACTTGAGGGCGTCAGTCAG 

ATCC 

(SEQ ID NO: 1030) 

Exon: 1001.. 1131 
Exon: 1191..1307 
Exon: 1364.. 1650 
Exon: 1708.. 2126 
Exon: 2187.. 2368 
Exon: 2431.. 2474 
Exon: 2542.-2874 
Exon: 2942.. 3184 
Exon: 3239. .3422 
Exon: 3482.. 3734 
Exon: 3795.. 4156 
Exon: 4217.. 4609 
Start ATG: 1075 

Transcript No. : CT26551 ^^-n,,™** 
CGTCGGTCACACTGGCAAATTGTTGGTTTTCCTATTTTCGTATTTGCTGATAATGTGCATATTGTGTGGAAAAGATGAGTAAACAAGGGAATAAA 

AGCAAGGATTCGGCTGCCGCATCCGCCACTTTAAAACGGATAAGCGAAAAGAAGTTGGAGGCCTTCACGGTGGGCACCTTCTCCAAACGACAATT 

GTCCAAAAAAGAGCTCGAGGACCAGAAAAAAAAGGAGGATGCAGCTGCTGCGGCACATGCCTTCAAAGAGTTCGTAGAGACTTTCCAAGATGCAC 

CCACCCCCTCCTCGAAAGTTTGGGTCAAAGCCGGCACCTATGATGCAGGATCCCGGCGGGAGGACAAGTCGGAGAAGGGCAAGCTATACAAGCCG 

GTCTCCAAGCTGATGGAAAAGAGCTCCTCGGACAAGGTCGAGGATTACGCCAAGACCCTGGCCTCGGATCTCAAAAAGGACTCGGGTCCATTGAA 

GAAGAAAAACCAGGAGAAGAAAAAGAGCAACTTGGAGCTGTTCAAGGAGGAATTGAGACAGATTCAAGAGGAGCGTGAGGAGCGCCACAAGTACA 

AGCACATGGCCTCAAGCCATTCCGCACCAGCACAGCAACCGGCTGCATCAGCTGCTCCTGTGCCCAGCAGCAGTGTCTCCACCACCTCCCAGAAC 

TCAAGCAGCTCAAAGGAGTCTGGTTCGTTTGACACGGGCGATCCAAATACCACAAACCTGTACCTGGGCAACCTGAACCCCAAGATATCCGAGCA 

GCAGCTTATGGAAATCTTTGGGCGATACGGTCCCTTGGCCAGTATCAAGATCATGTGGCCCCGCTCCGAAGAGGAGAAGCAAAGGGGACGCAATT 

GCGGCTTTGTGGCCTACATGAGCCGCAAGGATGCGGAAAGGGCCCTAAAGACACTGAACGGGCGCTACATCATGGGCTACGAGATGCGTCTAGGA 

TGGGGAAAGACTGTGCCCATAATGAATACGCCCATTTTTACACCGCAAGCATTGCTTGAGATGACCCTGCCTCCGCCTCCTTCAGGATTGCCCTT 

CAATGCCCAACCTCCACCCAGTGAAGCTGATGTGCTGCCCAAGAAAAACTACAAGGAGTTCAACCAAGAGGACAAAGAGAATATGGAGCGGATAC 

TTGCCAAATGCGTCGTTAAAGTCCATATACCCACAGAGAAGGCTGTTCTTAATGTTATCCATCGGATGATCGAGTTTGTAATACGCGAGGGACCG 

ATGTTTGAGGCTTTGATCATGATTCGGGAAATGGAGAATCCGCTCTTTGCGTTCCTCTTCGACAACGAGAGTCCTGCCCACATCTACTACCGATG 

GAAGCTGTTCTCTTTGCTACAGGGCGATACGCCCAATGAATGGCGGGAGGAGGAATTCCGTATGTTTAAAAACGGACCGGTTTGGAAGCCCCCGA 

TTGCCAACTTCTACACTCAAGGCATGCCAGATGAGCTGGTTGTTGATCCCGATGCTCCTGTGGTGCACAAAGGAGCCTTGTCCAATGCGCAACGC 

AACCGACTCGAGGATTTGATTAGGCACTTAACTCCTGAACGCGCCCGCATTGGAGATGCCATGATCTTCTGCATTGAGCACGCAGATGCGGCTGA 

TGAAATCTGCGAGTGCATCGCCGAGTCTCTGTCAAATATCAATACATTGGCTTCAAAGAAGATTGCTCGACTTTATCTGATCTCAGATATACTGC 

ACAACTGCACTGTTAAAGTTGCGAATGCTTCGTTCTTCCGTAAATCTGTGGAAAAACAATTGTTGGACATCTTTGATAACCTGCATAACTACTAC 

TTGAACATCGAGAGTAGATTGAAAGCTGAGGGCTTTAAATCCAGAGTCTGCAATGTTATCCGCACATGGGAGGAATGGACCATTTATCCCAAGGA 

TTTTATGGCCCAGCTGACGGCCAAATTCTTGGGCAAACCTTACGTCAAGCCAGTCAATAACTCACCTCAAGCCGAGGAAACGCGATTTGATGAGG 

CTCTGGACGAAGATATCGACGGTGCGCCCCTTTCCGGAGAGGAAAAGGACGACGAGGATCTAGACGGAGTGCCACTAGATGGAGCTGCTTTGCTT 

AAGAGCGCCTTGAAGCGAGCCATACCTGATGCAGATGCTGGTACTCCAAAGCGTGACACACCTAAGAAAGATGAATACCTCGATGAAATCGATGG 

CATACCATTGGATGATGATCTCGACGGTGTACCGTTGGAAAAAGAAACCAAATCTACGGCTAAGTTGCCTGGCTTCATACCCTCAAAGTGGGAGA 

CAGTTGATCCCCAGCAAGTAGAAGCGCAGGCGATTACCACCAGCAAATGGGACACTCTCGATCCGCCAGATCCACCGAAGTTCTTTAGCTCGGAC 

GACGACAGTGGAGAAGACAATCCGCAAAAGTACACCGACGAGATGCGTCAGAAGTTAAGAGACATTGAGTCGAAAGCAATTCAGTACCAGGACGA 

ACTGGAGTCCGGAAAGCGCCGCCTGGAACCAGGTTGGAGTGTCCCCGAACAAGTGGAACAATTCCGGAAGCGATTACTTAAACAGGACTCGCGAT 

CTGAGGCAACTGTAGACTCCCCGCTCAGCTACATGCACATAAAATCCAAGAGCTCCAAGGTGAGTGAGAGTCCAGCGCTGTTCAGAAGCAGTTCC 

CGCAAACGATCCCACTCGCCGTTCTCTGGTGGCGAATCCAAACGACGTGACT ATT CACCCGAAACCTCGCGCTC ATCC AAATCCTCGAAACGCAA 

TCGGTCGCGTAGTCCGTCCAGTTCACCGAAGCGGTATACAGAGCGGTCGTCTCGCAGCTCCCGCTACGAAACTCCTGCCTCCGGATCGTCGCGAT 
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CACGCAGTTCTCCAACCTCCTCGCGCTCCTCTCGCAGAGAAGAGCCATCGCCGCCGAGACATCGCAGCGACAAGCACAAGCACAAACATAGGCAC 
TAG 

(SEQ ID NO; 1031) 
Start ATG: 75 

MSKQGNKSKDSAAASATLKRISEKKLEAFTVGTFSKR^^ 

KGKLYKPVSKLMEKSSSDKVEDYAKTLASDLKKDSGPLKKKNQEKKKSHLELFKEELRQIQEEREERHKYKHMASSHSAPAQQPAASAAPVPSSS 
VSTTSQNSSSSKESGSFDTGDPNTTNLYLGNLNPKISEQQLMEIFGRYGPLASIKIMWPRSEEEKQRGRNCGFVAYMSRKDAERALKTLNGRyiM 
GYEMRLGWGKTVPIMNTPIFTPQALLEMTLPPPPSGLPFNAQPPPSEADVLPKKNYKEFNQEDKENMERILAKCVVKVHIPTEKAVLNVIHRMIE 
FVIREGPMFEALIMIREMENPLFAFLFDNESPAHIYYRWKLFSLLQGDTPNEWREEEFRMFKNGPVWKPPIANFYTQGMPDELVVDPDAPVVHKG 
ALSNAQRNRLEDLIRHLTPERARIGDAMIFCIEHADAADEICECIAESLSHINTLASKKIARLYLISDILHNCTVKVANASFFRKSVEKQLLDIF 
DNLHNYYLNIESRLKAEGFKSRVCNVIRTWEEWTIYPKDFMAQLTAKFLGKPYVKPVNNSPQAEETRFDEALDEDIDGAPLSGEEKDDEDLDGVP 
LDGAALLKSALKRAIPDADAGTPKRDTPKKDEYLDEIDGIPLDDDLDGVPLEKETKSTAKLPGFIPSKWETVDPQQVEAQAITTSKWDTLDPPDP 
PKFFSSDDDSGEDNPQKYTDEMRQKLRDIESKAIQYQDELESGKRRLEPGWSVPEQVEQFRKRLLKQDSRSEATVDSPLSYMHIKSKSSKVSESP 
ALFRSSSRKRSHSPFSGGESKRRDYSPETSRSSKSSKRNRSRSPSSSPKRYTERSSRSSRYETPASGSSRSRSSPTSSRSSRREEPSPPRHRSDK 
HKHKHRH* 

(SEQ ID WO: 1032) 

Name: RRM-type RNA binding protein 
Classification: RNA_binding 



Celera Sequence No. : 142000013384474 

CCAGCAGAAAGAACGATATCTTCGGCATCAATGACGGACAGCTGGCGGCGGATGATCTTCCATATGCCACTCTGAGCTATGCCAATGGACCGGGC 
TATGATAGTAACTACCTCAGGGAAGGCGGCGCTGTGCGGAGGAAAAATCTGAGAGCCATCAATATGAAGAACAAGGACTTCATGTTCCCCAGCAC 
AGTTCCCTTGGAATCGGAAACGCACGGTGGCGATGATGTGGCCGTTTTTGCTAGTGGACCGTATGCCCAGCTTTTCACCGGAGTTTTCGAGCAGC 
ATTTCATACCTCACGCCCTTGGTTACGCCTCTTGCCTCAGTGATCGTAACATGTGCGTGGATGGGGGCGTGGCACGGAGACCACGCTGAAACTTG 
GCGGGG AT C AAAGGT T AAGT G AAGC AC AAT GAGT C T AAG CG AC AAAC GT AT TAT T C T C GT T T AAGAG AG AAGC C AC T AG AAG AT AAGC AAT CGT T 
TAAGGGGGATACTAAAATAAATAATTGAAATCTAAAAAAAGCATCGATTACTATTTGTGCCAATATTATACGACATTTATTTGCAGATAAGAGTC 
CGTTCCCGTGGTTATCTTATAATGTCGATTCTTTTTTGCACTTTAGAATTTGATAGCAAAATGTCGATTTTACGCGGGGAGGTTATCGATTGACA 
TGTGTCAACTGCTTAAATTTTGTTTCCTTTTCAGTGTTGGCTAATATCAACATGTGACTGCTTGCATCCGCACTAACTGTGGTTTTTGTCTTTAC 
TAAATCTAACAGCGTACTGTTGCCAACACTAAAATGTTAAATTATTTTAAAACTAACAGACGACTTGGCGCCAAATTTGAAATAAAATTAAAATG 
TTTTCGAATATAAAAGGGACAGACAACATTAATGACTAGGGTTTTATTGAAATATACATGGGGCTAATTGCTGGACATATCAGATTTAAATTATA 
TTTTAATATGTATAAGTGTATATCATTTTCATTTTGTGCTGTTTTTATTTATCTTTTTACAATTAATGCCTTCTGGCTGCATATATAACATAAGG 
AT T C AC AC AAGT GGC AC T T AAT T AT AC AAACT T AAC T T T AAAT TTTCTACACAGTTTTTTTCCCTTT T GC G AAAAC G AAGC AC AAAT AGT C AG GC 
GAATTAGTAGCATAATACATTACGCATAATAACAATATTTAAAATTAGGCATACACATAGTTCACAGTATTTTGCACGATTGCTTGGACGTCGTC 
TTTGGTTTCCTCTCCACGGCTCTAACCGCCTCCTCGAAGACCTGCTGCAGGTTCTGCTTCTTCTTGGCGGAGCACTCGACCAAGTTGAATGCGTG 
TATCTCTTTGCGCATCTTTTTTCCCTCCTGAAAAAGTCGAGAAAACTTAGCATACAACTGTGTCAATCGTTTAGTTACTAACTAACCTGTGTGGT 
CACGAACTTCTCCGAGTTGGGAATGCGCAAGTCCAGTTTGGTGCCCACGAGAACCACGGGAACGTGGGCGGAGAAGTGACGGATCTCCGGCCACC 
ACTTGCTTTTCACGTTTTCGAATGAGGTCCTACTGCTGATCGAATAGCACAACAGGAAGCAGTTGGTCTGTGGAGCGAGTACGAAATGTATTAGC 
AATAGGTAGGACTAATTTCCCTATCAGTAGTCAATGCTATCATACTTACACTGGGATAGCTCAGGGGGCGCAGCCTTTCGTAATCCTCCTGGCCG 
GCAGTGTCCCAGAGGGTCAGATTGTAGTCCCGATCATCTACGGCTATGTTGCAGGCGTGATTGTCGAACACTGTGGGTATGTACTCCTCGGGGAA 
CTCGTTCCGTGTGTAGGTTATCAGCATGCAGGTTTTGCCCACCATGCCATCGCCCACGATGGTTATTTTCAGCGGGCGCGGACTCTTCGTTATGT 
TCGCCGTCATTCCCGGTTTTTTTCCCCGTTCGTCTTTGTTAAGCAAGAACTGCAAAGAGGACAAAATAGAGGCGTCACAATCGGGATTGCAACAT 
TTGGTGGCAGATGTTGGCGCACTGTGTGTTTATCTTGGCGCACCCACACTTTCATCCAACCGGAAATCTCCAGGCTTTGCCCCTAGCTGGCTTCA 
C T T T GAT AC T T GC AG AAT AC AGC AG AAAT GGGATT ACT GCT C AT AC AGAAT AT AAG AAT GT A T AAG AATG AAC T TC AA T AGC AT T AGC TG GAAAT 
AT T G C T GAGT AAGT AAAT AC AAT AC AC CAGTC T AAT CT AC AT T C T AT T T C A GT T G AGT T CGC T T G AAGG AAT AT C G GT T T C CAT T G AC GT T T T G A 
AGTTTTCATTATTGCTGGAATGTGTTTTATAGAGTTAAATATATAAATGCCACATTTTAAACAAATTATACGTATCTATATCATAACAGCGAACA 
GTCCACCTGAATTAACCTTGTTGATCATACAACCCATATCGCACATGAACTTCAAAACTTTTAATCTAATCGCCGGTCGAATTGAAAACGACAAA 
AGTCTGGGCAATTGGAGCTTTTACGACGGGCCAACTGTGGACTGGACTTCATCAAGCGCAATTGATTCTGCGAAATCTCTAATGGAAATAAATGT 
GCTTTTATGGCTGTCCCCCCCGTCTTACATAACAATTAAATAAAGGCCGGGCCAAAACTGAAAGACTTTCACCATCTGTCCAGGCACAAAAACAA 
ACAAACTAACAAAACCGCATTTGTTTCATAAATTTTTCATTTTTAATTAAAACTGGTTTCGTCGGTTTCCAGTTGACCCAAAACATAGACTAGAT 
TTCCCTACGCAGCCTCCATAACGACACAAAATCCAAATCAAAAGACATTTGCCCGATTTACAAATGAGCGATGCCGTTAGGTATTCGTATTCGTA 
TTCATCTTGTGAGTGGGAATTCTTGGGGTACCCGTTTAAGCAGAAAACTCATGCGAATACCAGGAATTATTCACTGTTTGCGGCATGACTCATCG 
C GT AC AC T T GT TGAT GT C TCGGGCTC CAT T CAGTT T CC T T AC AT AAT T GC AAAT C A G AGC GG T GG AAGC GT T T AT TAG AAG AGC T AG AC C AC AAT 
CTTGGCTTGGTGGCTGACTCATCGGTGAGGGAAAACGCTTCCTTCCCTCCAACCCACACCAAATTTCATCTTTGAGCTGCATTTTAAATTATTCA 
GAAAATAAAGATTGAAAACCGGTTGCGTTGCGACATTCATCAGCATTCACCGTTCGACGGCACTGCACTTCCCAACTTTTGACCACGAAAGGCAC 
AAAACCACAGCCAAAGATCCCCCTAATACATACATATAAAATAATAATTTAATTGTTCACCCCTATAAGTACAAAAGCTTATTCATATGGATGAT 
TGAATTTAGCAAACAAAATGTTGTGTTAGAAGTCAAGTAATAAATACACTTTCCAACCAACCTATCAGTTCGCAAGAAGTAGATGATAACATTTA 
GATAAGCAGTGAAAAGTGAAGAAAAGTACCAAATACACTGCAATTTCAAGCTAGTATAAAATTCACCAATATCTGGAATGCCTTGTAAGACGCGG 
CAAATGCGTGAGTAATAAACATAAAACGCTTTCTTGGCACGCGATCCACAGATGCTGAACTTTGACTGGGGCACTTTGATGCCGGGGTCAACACA 
ATTAAGCCCCCCATCATTGATGCGAGTGGCTTCAAGTTGGGGTTAAGTGCTATTCGCTTTTTTAGGTTCGCCATTTTTTGATTGATTTTGTAAAC 
TTTTCCTAAAGCCTTGAATAGTTTAAATGTATTCCAACGATCAAGCTTCTGAACAAATTTTATTCCAATTCTGTATGAATGACGTTACCAAATTT 
AAATCGGATGAGTTCGTTCCAGCAAAATGGCAGCAGAATCCCCAAATAGAAGCAGTGCTTTTTTTTCTCTACTTTTGTTGCCAAATTAGTAAACA 
AATCCAAATGAACAATTGACCATTAGGTAGCCACTAGCGCAAAAATGGGCGAAAAAAAAATAAAGCGAACAAAAACACAAATTTGGCCAGAGAGC 
AACAACGAAACTAAATAAACAGCACACTTTTGTTTTGGCCTCTTTGGACGCGTTTTAAATTATTATAAACAAACAGCTTTGGGTGCTGTTTTAGT 
GACTACAGAAATGTGCAAATAATTAAGTCCCAATGCTACCGAATTCTTCTATTTATACAACCAGCTGATTAAGCAACACGTAGGAAGCAAACTCG 
TGAATGAGTTTCCTCAGTTCCACGATGTAAACTTGTGGATTAAAAAACAGGTGTGCGAGCAATGGTCGAAACTTTCCAGGCATGCCACCAAGTGG 
GTGGCAATACCCCCCTGCTACCCTGTACAATGCCGCATCATCATCATAACTTATGCCTTTCTATCAGTTCGGAAAACAGCGGAAACTCTTTATCA 
AGCTCGCTGGCCGATGATGATGAAAATGAAACTGTATCGAGCGAGAGCGCGCGGCTCGTAATTAATACACATAATAAATTATTTAAACATAATTG 
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CTGTGCATTTAATGCACTCCCTTGAAACATGAAGTGCACATGCCCGGCAGTCCGTGAACAACTTATTCCAGTTCATTGTTATCTACAGGCTTGGC 

CTCTTGTAAACTTTTCATATCAAGTTACGAACTCGTCGTTCTCGGAATCTTGTCCCAAGAACAAGACTTGTCACATTTCACACTCTCTCTTTCTT 

TGCATTGATTCGGTGGCGATGATTTCCCAAGATGAGGAAAGGTCTCAAAGCGACACATCTGCACTATGGCTACTCGTATTCATGTGCAATTCACT 

TGGGTCACTCCAGTTCAAACCTCGTCTTGGAGGCGTCTCTTATCAGCCTCCTTAATAAATGGGTGCATGCATCAGTTGGATTCCCGTTCCCATAT 

AGCAGACGGTCCACAAATAACTCAGTTACACAAAGGCGGAGTTAGTCAATAAGTGAATTATACGTATAGATGGATCATCCGACTACGCAATTAAT 

TAAATAATTCCGCAATAACGAATGTGAAGCGACGTCAGTAGAATGTAGGTAATTCTTTTATATAACTAAGATTACTCGGCTTGAGTTTCTTAAGA 

ATTTCGTTTTATTAGAACTTGCAGTGGAGCAGCAGTATAACTAATCGTACGCCCAAAACCAGATCGAATTGGTACTCCAAATCGTCTATATAGGT 

AGGTACATTTCTATATAGACACCTATTTCTATTCTACACCAAAGCGAAAAAAAAACACTTTTATTCAACTCTGCTTCTTGTTGCGAAAAATATAC 

GAACTGTATATGTGCATGTATTTTTATGGTTTCTCGCATGGGTTTTTGTATTTAATAAACTGATTAGATAAGATAGATGTCGGTGAATGCGGTGT 

ACATAGGGAATTGGACTTGAAAGAGCAACGGCGATAAGTGGATGTGGAACCATGGCGAAAACCCATGGTTACCTGGTCAGAGGAGCCATCGAATC 

CCCAGAATCAGATTGGATCCGGGCAGATTGCAAGGGCACGCGCTGCCAATTAGGTAAATTAATTGAGGTTAATGCGGAGTGTTTATTGTGCCATA 

GGTCGCAATGAATTGATTAGCGACATTATGGATTGTATTGGATTGGGGCGACAGGCGATTCACGTACTCGGTTTCCTGCGACGTCCTCTAAATGG 

CTGTGTAACCATATAATTGTTTGATAACTAAAAATGGTATCGAGCCCATTATGGTGGAGAATCAAGTCGCCTGCCAAGATCCCCGAAGAATAAGA 

GAGCCCCACAACAGCGATTGGAATTGGAATTGGGACTGGAAGAGGAAGCGAGAGCAAGAGAGTTGGGATCGTAAACCGAAACAACTTGGCATTCC 

AGAGCTGTGGACGGATGGGCGGTCGGATCGGGGGAGGTTTGGTGGGGGGGGGGGGGGGGTGCGGTGACGTGAGCGCAATACAAAGTGACTCAGGT 

GATGGTGAAAAAGGAGGAACGATCACGAAGATAACTCTCCCAGTGAGTGAGATCTATGCTAATAGTGTTTACAGCCAAAGATAAATGTTGTTATT 

TTAACAGCTCGTGCAATCGCAGACCCACTGAGCGCCCACACCATCGCAAGGTCGTCGCAATTGGGTGTTATTGAATTCATTCTGCGCTGGATTCG 

ACATTTGGCCAGCTAGCTGCTGGCGAGGATGAACTTAGTGGGATACGAGTGTTGCGTAGACGCTTTGAACTGGACGTACTTTTAGATACGGGACT 

TGTGTGCCCGTACTTTTATCCGTATGCGTACTATTTGCAATAATCAATCGAACAACACGTCTGGCTGGCGATGACGTGCGACTTGATCGCAGTGT 

GGATGTGGATTACTCGAACTGGCGGCGGCGAGAGTATAAAACCGGATAAACGAATAGCAACAACCGATCGCACGCGATGTCGTCGAACAACTGAA 

CCTTTGGCGATTCCGATTCGTTTAGCTCGCAGAGTTCGAGTAGCTCGCAGATACGCGAAGAGCGAACGAAGAGAGCGCGGTGGAGCAGCTCTCTC 

TCTTAGCACTTGGCACACAATGTTTCCTTATCAAAGGTTTCGGTTTGGATAAGAAGAATTTGGCGAACTACAGCTCCAGGTTTCTAATCCCTATA 

AATATTTTTGCATTTCTTTTTGAGTAAATGATGTGATGTCTGCAATTCCAACTTAATGTTTCGTATTCAAGTTGATAACATTTTGGTTTGAATGC 

TT ATTTTTAGAAT ACAGAAGCTCAATGC TCATATAAGTAC ATAAATAAGCCCAT GATTAAAAACCAC CAT AAAT AAGT TCAT AATGAT AAATTAG 

TTACTTCTATTTACGCATTTTCTACTACGAATAAAACGTTTGAATCAAACTTCGAAGTGCGTCAAATTATTAAGACTGAACTACTCACGCAATTA 

CTTAACTGAGAACGCAAAATACTACTTTAAAGCTACTTGTGAGCTACAAATCCTGTTGCGTAATATCCCGCCCTGCAACCCGAGTATCTGGCATA 

AGCCATTAATCAACGACATTCCGCAAGTGCGTGCCTCCAAACTGGGCTTTATATCTGTCCGATGTAGGGGTAATTAAGCCAAAGAACCGCCTTGC 

GGGTTAGACCGAATAAAATGCGTTTACTGCGAGTATGTGGCACTGGCGTACATGAGTTCAATTAGCCGACACGTTGATGGGGCACTACATTAATT 

AGAATTACCACTGCACTGGGCAAGTCACTGCCGTACAAAGAATAAAGGTCAAGTGACAAAGGCTTCGACCCAGACCCAAAAAGTCGGGCGGTCAC 

AGCAAAGAGAAGAGCCGGAAAATTATTAATACACAAATATGAATGTGCAGCATTTCCAATTAGGGATAGCTACTTAAGCGAACACATGAGCAACA 

TGTGTGCCACTACTGCAAACACCAAAGATACTTCTGGAAGACT 

{SEQ ID NO: 1033) 

Exon: 6358.. 6160 
Exon: 1949.. 1665 
Exon: 1587.. 1417 
Exon: 1357.. 1001 

Start ATG: 1910 {Reverse strand: CAT) 
Transcript No. : CT26615 

GTTCGACGACATCGCGTGCGATCGGTTGTTGCTATTCGTTTATCCGGTTTTATACTCTCGCCGCCGCCAGTTCGAGTAATCCACATCCACACTGC 
GATCAAGTCGCACGTCATCGCCAGCCAGACGTGTTGTTCGATTGATTATTGCAAATAGTACGCATACGGATAAAAGTACGGGCACACAAGTCCCG 
TATCTAAAATTCTTGCTTAACAAAGACGAACGGGGAAAAAAACCGGGAATGACGGCGAACATAACGAAGAGTCCGCGCCCGCTGAAAATAACCAT 
CGTGGGCGATGGCATGGTGGGCAAAACCTGCATGCTGATAACCTACACACGGAACGAGTTCCCCGAGGAGTACATACCCACAGTGTTCGACAATC 
ACGCCTGCAACATAGCCGTAGATGATCGGGACTACAATCTGACCCTCTGGGACACTGCCGGCCAGGAGGATTACGAAAGGCTGCGCCCCCTGAGC 
TATCCCAGTACCAACTGCTTCCTGTTGTGCTATTCGATCAGCAGTAGGACCTCATTCGAAAACGTGAAAAGCAAGTGGTGGCCGGAGATCCGTCA 
CTTCTCCGCCCACGTTCCCGTGGTTCTCGTGGGCACCAAACTGGACTTGCGCATTCCCAACTCGGAGAAGTTCGTGACCACACAGGAGGGAAAAA 
AGATGCGCAAAGAGATACACGCATTCAACTTGGTCGAGTGCTCCGCCAAGAAGAAGCAGAACCTGCAGCAGGTCTTCGAGGAGGCGGTTAGAGCC 
GTGGAGAGGAAACCAAAGACGACGTCCAAGCAATCGTGCAAAATACTGTGAACTATGTGTATGCCTAATTTTAAATATTGTTATTATGCGTAATG 
TATTATGCTACTAATTCGCCTGACTATTTGTGCTTCGTTTTCGCAAAAGGGAAAAAAACTGTGTAGAAAATTTAAAGTTAAGTTTGTATAATTAA 
GTGCCACTTGTGTGAATCCTTATGTTATATATGCAGCCAGAAGGCATTAATTGTAAAAAGAT 
(SEQ ID NO: 1034) 

Start ATG: 239 (Reverse strand: CAT) 

MTANITKSPRPLKITIVGDGMVGKTCMLITYTRNEFPEEYIPTVFDNHACNIAVDDRDYNLTLWDTAGQEDYERLRPLSYPSTNCFLLCYSISSR 
TSFENVKSKWWPEIRHFSAHVPVVLVGTKLDLRIPNSEKFVTTQEGKKMRKEIHAFNLVECSAKKKQNLQQVFEEAVRAVERKPKTTSKQSCKIL 
* 

(SEQ ID NO: 1035) 

Classification: enzyme 
Gene Symbol: RhoL 
FlyBase ID: FBgn0014380 



Celera Sequence No. : 142000013384474 

CACCACCACCATCTCGCACAATGCGTCCGTATTGATCGCGCTGCTCGCCGTGATCCTCCTTGACCACCAGTTCGCGGCCATTCACCTCATAGCGG 
TTCATTTTCTCCAAGGCCTTCTGTACGTTCTCCGGATCCTTGAACTCTACGATGCCACAGCCGCGAGCCTTGCCGCTCTCATCGAAGAACAGCTG 
GACGTACTCAATGGAGCCGACGATGCGGCGGAACAGATCCTTCAGATCCTGCCAACGGTAGTCGTATGGAATGTTCGAAATGTAGACCCGGCAAT 
TGCGACGTTCCCGGCTACGGTCGCGAGCAGCGACGCCACCTCCCTGACTACCAGCGCCGTTTCCATTGCCATCAGCGTCCGTGAAACGGGAGCCA 
CGTGCTCCGCGTCCACGACGGTCGCGCTCCTTCTCGCGGCTCTCCACCGAGTTACTAGCGTCCATGCTCATTGTTGTCTGCAAGGGATTCGCAAT 
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GCGTTTTGTAGTTCTTGCAATATTCGTAGTAGTACACACCCGGAGCGGCGTCCGCGAAAACAAAATGGCGACCGCGTCAATCGACTTTTTCGCCA 
AATGTTCTTTTCTATGCTAGCGTTATGGCTTACCTTTTGATGTTGGCTGTGCTCTTAATACTGGTAATATTTCTATTTTGCTGCTTTCGGCGTAG 
TAAATTTTATACAATGAACAAACGCTTGTGACAAACTACAGTTTCCGAGATCAATAGAAAACCGTACAAAAGTCTATGTTAGAGTTGACAGTGAT 
ACGATGACAGCACATTCATCGATGGTTTGATATTTGGACGTGTCTCAGATTTCTATCGACTAGACATCGCCGATATTCGAAAATGCTCTTCGAAT 
TATCGAAATGTAGGCATACTGCAATTTACGCGCGCAACGCAAAAATTCAAATAATAGAATATTTGGCTCAACAAGACAGGAGACTTTAGATGGAA 
AAATAGAATCCACAAAAGCAAAACTATGGAATAACTAAAACCACTTTTCATAAATAGTACACACAATCGATTTATTTCGTTTCTTTTTGTATGGC 
AAAAATTAATAC AAAAAATTAAATATTAAATGTATGTATGTATGT ATAAAAAGCTTAAAGCAAACT AT AT AATGT AAAT TT AAT T GGCTGTTT GT 
TTTCTCTCCGATTGATCTGTCAGTATGGTAAGTAGAAAGAAAGGCAAATCTCAAGAACTCATACGGATTATAATTTCATCATCGTAGCACCACAG 
ATATGTGTTTAACCGAACATAAATCTAGTTTGGGGCTAGCGAATTTACTCGTTTGTGCGAAAGTTGCTGTGAGCTTATCGGCTTTTGCTTGTTTG 
TTCTCTAATGATGAATACTAATGGAATTCTGCTCGTGCGTGTGTGCGAAAAAAGATTACATTTATGTATGACTGTCGGGTTGAAACCAATAGATA 
ACAGGGCTGTAAAAATACACAATATTTGTTTACATCGCGTTTCGGTTTTAAGCACGGAGTATCTGTTGAAGTACAAACAACATAGAATCAAGTTA 
AGTGCTGGTTGAAAACAAACGGATATAGCAGCTGAACCAAGGCAGTTACAACAGTATTCATAGTTTTTATGTAGAGGTACATATTCATTTTCATT 
TTCGATTTGCTTCGTTCGTTTTTGTTTGCTTGTTATTGTTAATACTCTTAAAAGGGCGCCTCGCAGCCTTTCAAACGACACAAAACACACACATT 
CACAAAATCATTTCGACGTACATAAAACTCTAGTTACTTAAATTAATTACAATTTCTCTTCTTTTATCTGACAATTGCACCGACCAACACGCTCC 
CACATGTGTGTGTTTTTCGCATTGTGTGTCCAAAATGGACAAACCATCGAGCCCAGGACGATCGGCGATTCGCCGACGCACATACAGACAAACAG 
GCGACATTTGAAGAAATAAAAAGTGAATAATAACCAATGCGTGGCCGTTTAGAGCATTTTACATTTAAATCTACGATTCGGCTTGTTCATTTTGC 
GGCCCCTCCGCCCCTTGTTGTCCTTGTCCTTGCGGATTTCGCGCACCAGTGTGTAAAATGCATCGTCCACGCCCATGCGCGTCTTGGCGGATGTC 
TCAATGTATGGAATGCCGTACTGTTTGGCCACCTGAGAACAACCATAATTAACGGAATGTTCGCTCGGAACTCTGAAGGCCGGTACTCACCTCTC 
TTGCCTGCTCGTTGTTAACGTTCCACGAGGCCAGATCACATTTGTTGCCCACCAGCACCATGGGCACCTCTTCGGCATCCTTTACGCGCTTGATC 
TGCTCACGGTAGGTGCCGATATCCTCGAAGGACTTCGCACTGTTGACGGCAAAGACCAGCAGGAATCCCTCGCCAGTCCGCATATACTGATCCCG 
CATGGCCGAGTACTCCTCTTGGCCGGCGGTGTCCAGGATGTCCAGCAGGCAGGTCTCTCCATCTGCAGGGGAATATAAAATGCATTTGAATATCG 
CAACTTCTTAAGGTCTCTTCTAGTATAAACGGGAAACCAAACCAGCGAAAGGGTGCAGAAATCTCTAGAGTATTTGGACCTGTGCAGCCCTAGCT 
TTGCCAGATCTAAGTACATATAACTCACCGATAACCACTTGCTTTCGGTAAGAGTCCTCGATTGTGGGGTCGTACTCGTCCACGAAATGGTTCTG 
GATTAGCTGGATGGTGAGCGCGGACTTGCCCACGCCTCCGGCTCCAACGACGACCAGTTTGTATTCCGTCATTTGGCTGTGGCGTCCGTTTTCGA 
TTTGCCGTAGTCAAATGTCACAGTAACAGTCTCTTTCTCGCGATCGATGCTTACTCTCTTGTGAAGCGCCCGCAGATGCAGTCTCTCTCTCCGTA 
CGCTTGTTGCGTGTGTTTGTGTGGGTGACTGTGCGTGTATGGGCTGCAAGTGGAGTAAATTATTAATTAAGATTGATTAAAATAAATTAAACGCA 
TAATTAGGCAGGAATTAATGCACGTCACTGAGTCACTGTGACTAGGCGCTTACTCGCACAAACACACACGCACCACTGCATGCACACAGCCAGAC 
GGGCAACACAAAAAAAAATGGGAAAAAGAGATCAATGACTCAAACTAAAATTAAAACCCATTCGCTCAATGCAAATTAGTTAGTCTCCCCCATAG 
CCGAGTTGTTAACTAGGTGAATTTGCTAATTCGGCGGCGTTGTTTTTAGCCCTTTTCTTTATTATTTTTCTTTTGCCTGCCGGGGCTGCGGACCT 
TCTTTGTTGCTGCCTCTTCAGCTAGCTCCTCCTTGTTTTTCCGCTCCGCTCCTCTCTATCAATACTCTGCCTCTCTCTTCTCGCGCGCGGCGCAG 
TTGCTGGGCACGTTTCACGTCACTTTCGCGGCTAACGGTGCGACAGGCGGTACTCCTCGATGGCACCAATTTAATTGCCGTGCGGTAATTGCAAT 
T AAC AGC GC AGG AAAAC AAT AAAT C C GT AT GT G AAAT G AAG AT G AAAT CT AG GC T GT ACGT AGC TGT GAAGAT ACT AG AGC TGGC AC C AAGC C G A 
TGGCACTATCGATAGCGATGGCTGCATTCTGGCCGGCACCATCGATGGACTTGCAATAGCGATTGCTATATGAAAACTAATCTAAAGAGGTGGAT 
GCACTTCAGTCGACTCTCTATAATTTGCTTAAACTAATAAATGATTTGATCAATACAGCTTTCTGTAAAAACTGGCAGACGCTTTCTGCTTTTAA 
TAATTGTTAATTTAAGTTCAACGGGCTGGCATCACCGTTTCTTAGCACGGACTCAAGCCTGAGTCTATTATTTCAACCACCACTGTAACGAAAAC 
AGCATGGACAGATTGAAATTCAATAATTTGGTAAATAAACGATTTTATTTAAAATTATAGAGTTTTAATTAAAAAGAACTTTTACAGGTGATATC 
CAACAAGAAGGTCATTCAAAAGGCACGCGCCCAGACCATCGCCAAAATCGTACAAAAGCTGCGAAAGACAAAAGAGGCTCTGGCCAAAAATCCAG 
ACAGTGAAAAGGAAAAGATTCGTTTACGCAAGAACACAGAATGCCTGGCCCAGTTGAAGGCTCTAAAATATATGGATATGGTACGCCAGTCGCTC 
CTGCAGGAGGGCACAAATCCCAATGCGGTGATTGCTAACGAGCGCTCCACGCCGGATGAACTGGGCATTGCCATGCTGCGGCTAAACAAATTGAT 
GCACGGACTGGTGGACAAGTTTGTGGAAACCTTGAAGCTGAGCACCACCGATAAGGAGGCCAAGTGGCGTGAGGAGATCCTAGAGACGAGCAAGC 
GAAGGGCCAAAATAGAACGCACTGAGGAGCGGAAGAGGAAGCGCAAGGAGCTGAAGGAGCAGAAGGCCCAAACCAAAAACCGACTGGAGTGGTTA 
GAGCAGAACAAAGTGGTGGATGCAGATGTTAATGGAGCAACTGCGGAAACGCCCACTCTTCAAGTCAGCAAAGTAAATGATCAGGAAATACCACA 
GTCTTTTGCAAAAACTG 
(SEQ ID NO: 1036) 



Exon: 3482.. 3229 
Exon: 2893..2594 
Exon: 2437. .2181 
Exon: 2122. .1001 

Start ATG: 2732 (Reverse strand: CAT } 



Transcript No. : CT26623 

TACGTACAGCCTAGATTTCATCTTCATTTCACATACGGATTTATTGTTTTCCTGCGCTGTTAATTGCAATTACCGCACGGCAATTAAATTGGTGC 
CATCGAGGAGTACCGCCTGTCGCACCGTTAGCCGCGAAAGTGACGTGAAACGTGCCCAGCAACTGCGCCGCGCGCGAGAAGAGAGAGGCAGAGTA 
TTGATAGAGAGGAGCGGAGCGGAAAAACAAGGAGGAGCTAGCTGAAGAGGCAGCAACAAAGAAGCCCATACACGCACAGTCACCCACACAAACAC 
ACGCAACAAGCGTACGGAGAGAGAGACTGCATCTGCGGGCGCTTCACAAGAGAGTAAGCATCGATCGCGAGAAAGAGACTGTTACTGTGACATTT 
GACTACGGCAAATCGAAAACGGACGCCACAGCCAAATGACGGAATACAAACTGGTCGTCGTTGGAGCCGGAGGCGTGGGCAAGTCCGCGCTCACC 
ATCCAGCTAATCCAGAACCATTTCGTGGACGAGTACGACCCCACAATCGAGGACTCTTACCGAAAGCAAGTGGTTATCGATGGAGAGACCTGCCT 
GCTGGACATCCTGGACACCGCCGGCCAAGAGGAGTACTCGGCCATGCGGGATCAGTATATGCGGACTGGCGAGGGATTCCTGCTGGTCTTTGCCG 
TCAACAGTGCGAAGTCCTTCGAGGATATCGGCACCTACCGTGAGCAGATCAAGCGCGTAAAGGATGCCGAAGAGGTGCCCATGGTGCTGGTGGGC 
AACAAATGTGATCTGGCCTCGTGGAACGTTAACAACGAGCAGGCAAGAGAGGTGGCCAAACAGTACGGCATTCCATACATTGAGACATCCGCCAA 
GACGCGCATGGGCGTGGACGATGCATTTTACACACTGGTGCGCGAAATCCGCAAGGACAAGGACAACAAGGGGCGGAGGGGCCGCAAAATGAACA 
AGCCGAATCGTAGATTTAAATGTAAAATGCTCTAAACGGCCACGCATTGGTTATTATTCACTTTTTATTTCTTCAAATGTCGCCTGTTTGTCTGT 
ATGTGCGTCGGCGAATCGCCGATCGTCCTGGGCTCGATGGTTTGTCCATTTTGGACACACAATGCGAAAAACACACACATGTGGGAGCGTGTTGG 
TCGGTGCAATTGTCAGATAAAAGAAGAGAAATTGTAATTAATTTAAGTAACTAGAGTTTTATGTACGTCGAAATGATTTTGTGAATGTGTGTGTT 
TTGTGTCGTTTGAAAGGCTGCGAGGCGCCCTTTTAAGAGTATTAACAATAACAAGCAAACAAAAACGAACGAAGCAAATCGAAAATGAAAATGAA 
TATGTACCTCTACATAAAAACTATGAATACTGTTGTAACTGCCTTGGTTCAGCTGCTATATCCGTTTGTTTTCAACCAGCACTTAACTTGATTCT 
ATGTTGTTTGTACTTCAACAGATACTCCGTGCTTAAAACCGAAACGCGATGTAAACAAATATTGTGTATTTTTACAGCCCTGTTATCTATTGGTT 
TCAACCCGACAGTCATACATAAATGTAATCTTTTTTCGCACACACGCACGAGCAGAATTCCATTAGTATTCATCATTAGAGAACAAACAAGCAAA 
AGCCGATAAGCTCACAGCAACTTTCGCACAAACGAGTAAATTCGCTAGCCCCAAACTAGATTTATGTTCGGTTAAACACATATCTGTGGTGCTAC 
GATGATGAAATTATAATCCGTATGAGTTCTTGAGATTTGCCTTTCTTTCTACTTACCATACTGACAGATCAATCGGAGAGAAAACAAACAGCCAA 
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TTAAATTTACATTATATAGTTTGCTTTAAGCTTTTTATACATACATACATACATTTAATATTTAATTTTTTGTATTAATTTTTGCCATACAAAAA 
GAAACGAAATAAATCGATTGTGTGTACTATTTA 
(SEQ ID NO: 1037) 

Start ATG: 416 {Reverse strand: CAT) 

MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLLVFAVNSAKSFEDIGT 
YREQIKRVKDAEEVPMVLVGNKCDLASWNVNNEQAREVAKQYGIPYIETSAKTRMGVDDAFYTLVREIRKDKDNKGRRGRKMNKPNRRFKCKML* 
{SEQ ID NO: 1038) 

Classification : signal_transduction 
Gene Symbol: Ras85D 
FlyBase ID: FBgn0003205 



Celera Sequence No. : 142000013384590 

GTTTGGCTTGCGTATTTCCGGCATTTCTGTATTTCTCGAAAATAACATCTTTTCTTTTTGTAAGATGATAAAAAGGTTAGGATGTTAAAGGTGGT 
CGTGGTCCCACGATGAAAGAAGACATTTTTATTAGTAGTTTATAATTCGTATATTTATTTCGATAATACACCTATTATGAATACAGTTCGCATCT 
TTGATTGTTTGTATTAATTTGCTTTGGTACAGTTAGGGACTTTTTCTTATCAGTATGCTATTTATTAAAAGTAATGAAAAACTTGTTACGATACT 
TTAGATTTGGATAAAAACCAGAATTAATTATAACAGACATTCTAAAAAAATAATCACTGATACATACATACGTTTTAAAAGATTGCATTCGTATA 
AATATTCGAGCATTAGAAGTAGAATACTGTCGGAATGGAATTAGTCGAATTAACATACAGATAAGGAAACGTCCCTAATGACTATACTAATACAA 
AGTTCAGTGGAGATTTCGCTCTTTTCAAGGGCGCCTCAAACATTGTTCCTAACACTCTGGTCAGAATTCAACAATAAAATTAGAGAATACATTTG 
AAAACAAACAAA7AATTGATTCAAATAATGTTCAGCAATTACGATACGTTAAATTTGTCGTTTTCAAAGTGAATGCTTTGGTTTCTTATATTTTT 
CGGTGTTTGGTTTCGCTTTTATAAAGGACAACTCGAACTACGGAACAATCTGAGGCTGGCTGCTTCCATCGTGGCTAACTATTAGCATGTGTACT 
AACTGGTGAATAGCTGCGCTATGCTCCCACATACAACATCAACCATACCCATAACAACGCGCTTACATATCTTTGGCGAGTGACCACTACGTTCA 
ATTGAATCGAAAAAAGGGAAGAGACTTGTGATGTGTGTAACGTATACAATGTCAAAGAGCTACAGTTAGAGAGTGTTTTTCGTGTGTGGTTTTCT 
TAAAACGACGACAGATTTTGTGTTTGACCCTTCCGAACCAGTCCGAGGTCCTAAACAACCACTGCTCCTGCCTCCGAACGGCTAACTGGTAGATC 
GGGACAATTCTTCCAGGTGTCCGAAAGCGGATCATAACATTCGACAGCATTGATAGTCACTTGGGCTCGGTAGAAGTTTACCTCGCAGGGCTGGC 
AG AG AAGC AT AC AAAAAC AATGAT T AAA T T AT GAATC AAAT AAAT T GAAAT AT ACT T GTT AC C T GAT C AC CGC C AGC T AC GT AG AGC AAGCC AT C 
GGCGGCGACCACGGCAGGAATGGCTCTGGGCACGTTTAATGCACACACTGTAGTCCACTTATCCTCGTCGAAGCTGTAGCGTTCCACGGAGCTGA 
GAATGTCCTGACTGATGCTACTGCCTCCAACCACATACAAGTAGCGATCCAGAACGGCCACACCCATTTGACAGCGAGCTGTTTGCATGCGGGCC 
AGCTCGTTCCACTCTTTTGTCACTGGATTGAAGCTGCAATGGATTTTTATAAAAATATATAACTACGAATATAAATATTTTTTGTTAAATACCTG 
ATTAAATCCGGCAGATGACGGGTTGTTGTTGTGCAGCCACCGACAATGTZVAATGAGACCCTCGAAACTGACCACACCCATGCTGAATCTAGGCTG 
CGGCATGCTGCCCATTAATTTCCAGAGATCCTGCTCCGGATCATAGCACTCCATGGAGCCGCCGATGTCGTCGCCTATCCAACCGCCTACGGCAA 
ACAGATTTCCGCCCATGGTGCAAAGACCGAACTCGCAGCGCGGCACAATCATCGGGGCAATCGGTTGCCAAACATCGTTCTGTGGATCATAGACC 
TCGCCATTGGCCAGAATCTGTGAACCGCGCTCTCCTCCTACCACATAGATCTTGCCATTCAATGCTGATACACCTGGCAGGATGCGTCCCACCTC 
CATAGGCGCCGTTTCCGTCCACTCGCGCCGGAATATATCGAATTTGGCCACCGTCTCGAAGATGCAGTCGGCCGAGTTCCAGGTTCGCGGCGTAT 
CCCGATGAGATCCACCAATGATATATATGTTCTTCTTGGCCAGCTGACGCGGACATACCCGCAGCGGCACCAATTGTCCGCGCTTGGACGCAATG 
TCCCGGCAAATGGAGCGCAGGGCTATCTTCACGGACATGTCGCGACAGTCGTCCTTCAGAGCCTTGTCGATCACCTTTACCGGAACAAGCGCCAT 
GCGCACGTGGGACAGGACATCGAAGACGTAGCAGCGGCGCTGCGTCACGTCGTGCTTAATCCAACGAAGCGCCGCCTGGAACACCTGCGACTCGG 
AGTCTACACGCAGCAGCTCCGAGTTCAGTAGCTGCGAGAGTAGAGTTTGTGGCGTTTCTAGGAACTCATCTTCAAGCGTTACCTGAACGAAGAAA 
GGAAAGTACCAATCAGTTAAAATAGAAATATATATTTTTGCCAAATGCTCAAATTTAAACGATAAATTATTAGGTTCAAGGCAGAAATGAAAATA 
ACTTTAAATATTATTATTTTGATATTTTTTATCGTTTATCTCGTTGAATATTTTCGTTTCCAATAAATATAGAACATTGCTTGACTATTTCCATG 
ACCCACTTTTCGTTTGGGCGGTACACCTACAAATCAAGCCTTTCAAGTGCATCCAACTCACCGCTGGGAAATTCGCGTGGACAAAGTTCAGTGCA 
CTCTTGGCCAACGACTCGCAGTTGTGGGCCTCGGCGAAGCGTAAAATGCCCAGGGCATTGGAGGCGTGCAGCTCGCGGCAGAGGAACTCGCAGCA 
CCCGTCGACCACCTCGTTTAGCTGCAGCATGTCGGCGGCGGCCAGGAGCTCCTGGACATTGGACTGTAAGATCAAAGAAGGGATTAGAATTTGGA 
GACCAAGCGACTAGGTGTATGGACTCACCTGGGTGATTTCGCAGCGACCCGTATAGATGAAGTCAAGCAGGATGTGCAGGATATCGCCGTCGATT 
GTGTGCAGCACAACGGATTTTTGCTTGACCTCATTTAACCCGAGCTCGGGTCGAAACATGGCCTCGAAGTAGGCACTGGCTGCACTCAGGACAGC 
CCGGTGGGCACTCAATGTTGCCATTCCGGCGATGATCTCCACGTCGCAGAAACGGGACTGCTCGCGCAGTTGGTTGAGATTGGAAAGCACCTTGA 
ACGGATATTGGGCGTTGCAGTAGGAGGGCAGGTTTGTGCGATGGCTGGCCGCTGTGGCTCCAGATCCCAGATTTTGGCCAAAGAAGTCACCGTCG 
CCGCCGCCGCCGGTGGGATTCTGATTGGCGTTCTGATTGGCATTCGCATTCGGACTGGTATTGCCATTGTATTCGTGTATCTGGTATGTTTTCAA 
CGGTGGCATTTCGTCAACTGTTGGGTTTTTACTTTTCGGCGATCTTGAATCCGATATCGCCTTAATAAATGCACTTTTCGGGGGCGACAGCACAG 
CTCAGACTCTTCGCATTCAATTGTTTGGCTTAATTAAACTTAATTGCTGCACTCTGCGCTCACTTCACCCTTTGTTGTTGCTGCTGCTGCTGTTG 
TTGTTGTCGCAGTCCGTTTATTTTTCTCCAATCAAAAGGCGAGCACACACACCCACACACGCAATACAAACGACTGCAGGCAAAAGGTTAGCATA 
TCACAAATTAAAACAACCGAAGTCCGCTACTAAGACAGCTATTTCCCCATTAAGATGAAATAGTGTTTTTAATATTTACAAACTGGCATATTTCT 
TTGACTTTTCAATGCACTTTAGTATGTAGCTGGAACTGGAGTCGATGTTTTTGGGCGACTATCGATGTTGATAGAAGTTGCACTATGCCTGAATA 
TCGATAAGTTCAGTTTTTATCCAGAACTTATTGATAGTAATCGCTGCTATCATCTTTGGTCCACCTCTTGTTGTTATTTAATTATTTGTATTTTA 
TTTTCAATGAATTTTCAATGAAAAGCTGACAAAATGATATCGCGCAAACTACTAAACCTTCGACTGGGTTACCTGGCACGCCGTTGGGCCAGCAC 
GGATCCGCTGAGCATTCAGAACGAGAAGCTGCAGGCTTATTTGGAGTCTCTACGCCAGGAATACTACGCCGTGCGAGTCAACGCCGCCGGAAACA 
GCAAATCCTACGCCCGTCTCGCCCAGCTGGAGGGCGTGGTCAGTGCCCTGGAGCAGCGACGCGTACTAGAACGGCACATCACCAGCGCCAAGGAC 
ATGGAAGCGGAGAAGGACGAGGATATGCGTGAGCTAATGCGCGAGGAGAACGAAGTCTATGTGGATCTACTGGGCAAGCAGGATCAGGCCTTACT 
GCAGGAGCTGCTTACGCTCTCTGACGATGAGGAGTATCCGGCTCTGATCTTCGGCCTGAATGCCGGCGCCGGTGGCCAGGAGGCAATGCTATTCG 
CCCAAGAATTGTACGAAATGTACACCGGCTATTTCGAGCACATGGGCTGGGAGTGGGAGGAGTTTGCCAGCGAGGGCACCGACATCGGGGGTCTG 
CGGCACGCTAGTCTGATGGTGAGCGGCGAGGATGCCTTCCGCTGGCTGCGCTATGAGGCTGGAGTTCACCGCGTACAGCGGGTGCCGGCCACTGA 
AAAGTCTGGACGCATGCACACCAGCACCGCCTCAATTACAGTGATTCCCCGACCCGCCGACATCCAAGTGCACATCGCTGAGAAGGATCTGAAGA 
TCGAGACGAAGCGAGCGAGTGGAGCTGGTGGACAGCATGTTAACACCACCGACTCGGCTGTAAGAATTGTTCATCTGCCCACGGGTCTGGCGGTG 
{SEQ ID NO: 1039) 

Exon: 3750.. 2879 
Exon: 2818. .2627 
Exon: 2362.. 1518 
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Exon: 1458.. 1203 
Exorx: 1135.. 1001 

Start ATG: 3334 (Reverse strand: CAT) 
Transcript No. : CT26720 

TCGACTCCAGTTCCAGCTACATACTAAAGTGCATTGAAAAGTCAMGAAATATGCCAGTTTGTAAATATTAAAAACACTATTTCATCTTAATGGG 
GAAATAGCTGTCTTAGTAGCGGACTTCGGTTGTTTTAATTTGTGATATGCTAACCTTTTGCCTGCAGTCGTTTGTATTGCGTGTGTGGGTGTGTG 
TGCTCGCCTTTTGATTGGAGAAAAATAAACGGACTGCGACAACAACAACAGCAGCAGCAGCAACAACAAAGGGTGAAGTGAGCGCAGAGTGCAGC 
AATTAAGTTTAATTAAGCCAAACAATTGAATGCGAAGAGTCTGAGCTGTGCTGTCGCCCCCGAAAAGTGCATTTATTAAGGCGATATCGGATTCA 
AG AT C GC C G AAAAGT AAAAAC C C AAC AG T T G AC G AAAT GC C AC C GT T G AAAAC AT AC C AG AT ACACG AAT AC AAT GGC AAT AC C AGTC C G AATG C 
GAATGCCAATCAGAACGCCAATCAGAATCCCACCGGCGGCGGCGGCGACGGTGACTTCTTTGGCCAAAATCTGGGATCTGGAGCCACAGCGGCCA 
GCCATCGCACAAACCTGCCCTCCTACTGCAACGCCCAATATCCGTTCAAGGTGCTTTCCAATCTCAACCAACTGCGCGAGCAGTCCCGTTTCTGC 
GACGTGGAGATCATCGCCGGAATGGCAACATTGAGTGCCCACCGGGCTGTCCTGAGTGCAGCCAGTGCCTACTTCGAGGCCATGTTTCGACCCGA 
GCTCGGGTTAAATGAGGTCAAGCAAAAATCCGTTGTGCTGCACACAATCGACGGCGATATCCTGCACATCCTGCTTGACTTCATCTATACGGGTC 
GCTGCGAAATCACCCAGTCCAATGTCCAGGAGCTCCTGGCCGCCGCCGACATGCTGCAGCTAAACGAGGTGGTCGACGGGTGCTGCGAGTTCCTC 
TGCCGCGAGCTGCACGCCTCCAATGCCCTGGGCATTTTACGCTTCGCCGAGGCCCACAACTGCGAGTCGTTGGCCAAGAGTGCACTGAACTTTGT 
CCACGCGAATTTCCCAGCGGTAACGCTTGAAGATGAGTTCCTAGAAACGCCACAAACTCTACTCTCGCAGCTACTGAACTCGGAGCTGCTGCGTG 
TAGACTCCGAGTCGCAGGTGTTCCAGGCGGCGCTTCGTTGGATTAAGCACGACGTGACGCAGCGCCGCTGCTACGTCTTCGATGTCCTGTCCCAC 
GTGCGCATGGCGCTTGTTCCGGTAAAGGTGATCGACAAGGCTCTGAAGGACGACTGTCGCGACATGTCCGTGAAGATAGCCCTGCGCTCCATTTG 
CCGGGACATTGCGTCCAAGCGCGGACAATTGGTGCCGCTGCGGGTATGTCCGCGTCAGCTGGCCAAGAAGAACATATATATCATTGGTGGATCTC 
ATCGGGATACGCCGCGAACCTGGAACTCGGCCGACTGCATCTTCGAGACGGTGGCCAAATTCGATATATTCCGGCGCGAGTGGACGGAAACGGCG 
CCTATGGAGGTGGGACGCATCCTGCCAGGTGTATCAGCATTGAATGGCAAGATCTATGTGGTAGGAGGAGAGCGCGGTTCACAGATTCTGGCCAA 
TGGCGAGGTCTATGATCCACAGAACGATGTTTGGCAACCGATTGCCCCGATGATTGTGCCGCGCTGCGAGTTCGGTCTTTGCACCATGGGCGGAA 
ATCTGTTTGCCGTAGGCGGTTGGATAGGCGACGACATCGGCGGCTCCATGGAGTGCTATGATCCGGAGCAGGATCTCTGGAAATTAATGGGCAGC 
ATGCCGCAGCCTAGATTCAGCATGGGTGTGGTCAGTTTCGAGGGTCTCATTTACATTGTCGGTGGCTGCACAACAACAACCCGTCATCTGCCGGA 
TTTAATCAGCTTCAATCCAGTGACAAAAGAGTGGAACGAGCTGGCCCGCATGCAAACAGCTCGCTGTCAAATGGGTGTGGCCGTTCTGGATCGCT 
ACTTGTATGTGGTTGGAGGCAGTAGCATCAGTCAGGACATTCTCAGCTCCGTGGAACGCTACAGCTTCGACGAGGATAAGTGGACTACAGTGTGT 
GCATTAAACGTGCCCAGAGCCATTCCTGCCGTGGTCGCCGCCGATGGCTTGCTCTACGTAGCTGGCGGTGATCAGCCCTGCGAGGTAAACTTCTA 
CCGAGCCCAAGTGACTATCAATGCTGTCGAATGTTATGATCCGCTTTCGGACACCTGGAAGAATTGTCCCGATCTACCAGTTAGCCGTTCGGAGG 
CAGGAGCAGTGGTTGTTTAG 
(SEQ ID NO: 1040) 

Start ATG: 417 (Reverse strand: CAT) 

MPPLKTYQIHEyNGNTSPNANANQNANQNPTGGGGDGDFFGQNLGSGATAASHRTWLPSYCNAQYPFKVLSNLNQLREQSRFCDVEIIAGMATLS 
AHRAVLSAASAYFEAMFRPELGLNEVKQKSVVLHTIDGDILHILLDFIYTGRCEITQSNVQELLAAADMLQLNEVVDGCCEFLCRELHASNALGI 
LRFAEAHNCESLAKSALNFVHANFPAVTLEDEFLETPQTLLSQLLNSELLRVDSESQVFQAALRWIKHDVTQRRCYVFDVLSHVRMALVPVKVID 
KALKDDCRDMSVKIALRSICRDIASKRGQLVPLRVCPRQLAKKNIYIIGGSHRDTPRTWNSADCIFETVAKFDIFRREWTETAPMEVGRILPGVS 
ALNGKIYVVGGERGSQILAKGEVYDPQNDVWQPIAPMIVPRCEFGLCTMGGNLFAVGGWIGDDIGGSMECYDPEQDLWKLMGSMPQPRFSMGVVS 
FEGLIYIVGGCTTTTRHLPDLISFNPVTKEWNELARMQTARCQMGVAVLDRYLYVVGGSSISQDILSSVERYSFDEDKWTTVCALNVPRAIPAVV 
AADGLLYVAGGDQPCEVWFYRAQVTIWAVECYDPLSDTWKNCPDLPVSRSEAGAVVV* 
(SEQ ID NO: 1041) 

Name: KELCH RELATIVE 
Classification: actm__binding 



Celera Sequence No. : 142000013384252 

TGGATAGTGTAGCGGATGCTACAGAAATGTTCTTGTGAATTCACACACTTGCCATGTGAGTGTGGCTATAGTTAGTTGTTTCAGAATTAAGCAAA 
AATAACAGCAGCAGCAAGAACAGCGAAAAATCATTTTGTTGGCCCACCAACTGCTTGAGTTTTCTATTTTTAACACTAAAATTGTAGGTGGCTGT 
GTCGGAAAGAGACGGATGGCCACGAATACTTTGTTGTTGGCCAACACATCACCGTGTGTGTATGTATGTACGTTGTACATATATATGTACATATG 
TATGCCACATCTGCAAGTATTGAATTTATGCTTTTTTATAGGTAAATTCCTTCCTTTCTCCAAAATTCACACAAATTCGAATATCGTCTCCGGGG 
AAAATCTCCCACGCTCGACAATTTCTATATTGTGATCCAACTTCAATAGATCAGTGCGAATCCCCAATTTTCATCCTGCTATCCACCGGCTTGTC 
CAAATCACAGGGTTAGGGATTGGGTTGTGATTCGTGGTGTGGCGGATGAGAAGTTTTGTGGGGGGGGGGTTATAGAGAAGAAGACGTCGTGCCGA 
CAAAAACATAATCTGGCGCCGAGCCCAGACATTCATCTGCATTGCTTCTGTTGTTGTAATGTTTATTATTATACGTATACGTAATATGTGTGTCT 
GCCTTGGTCGATAGCGAACGCGAAAATGAGAGATCTTGTTTTTTTAAAAAATTACTAAGAATTACTACTCGTACAGTGGTGATTGTATATATTTA 
AGGTTGTGTCCTTTACCTAGGCCATCGACGTTAACTGGTCAGCAGGAGAGCTTAAGTAAAGATTGAATAAAATTGGTTTGGCCCACGCTAATTGG 
TGCATTGTAAACTGGAATCTGAATCCCTAGCGTTCGATTTTAGCAAGCACCAACACACACTGGATTTTTCGGTTATTTGGGTTATGCAGTCAATA 
AGGAAATACATATTTACATATCGAATTGCGGGCATTGCGGAATAATCTTTTCATTTCAACTTTGGTATTACTGGAATCAGGAAAATTATGGTGGA 
ATAAGGGTAAAACATTGTCAATGAATTAATTTGGGGGAAACAAAGTTTCAAAGTAGGTAATTATAAGGAGAGTGCAAGATAGAAGGTTCCCAGTA 
CTGTTCACGGAAAACTTACCCATAACCCTGGGCGGGCATGTTTCATGATCTGGACCTGCAGTCGTTTGATAGAATAGATATTGGATTGATAAATT 
GTCCTTTTGCATTTTGGCTCAAAGAGGGTTCGAAACACCGCCGGAAGAGAAATTATTATTATATATAGATAGCTATATTAAATGTTAAGTATGCC 
GCCACTTAGCTTCGGTTAGTTACAAGTAGTATACGTATTTATAGATAGAGTACAGACATACGCAAAAGATATTGATTTATGGAATAGAATGTACA 
GAATAGGTAGTTTTGAAAGTAGTTCTAGTACCTAGCTGTATGATTTTTAGTCACAAGAGAGCTAAACTGTTTGTGTTCGCTAAAAACTGCAACTG 
CTTTGAAAAAGAATATCATCATTTCAAGTGTCCGACACTAGAGTGTTCGGAGTGCTAAACGTTTGAAGAACTATGTATGTGCTACGGTTAAGTCA 
TTCACACTTACAGAACAGAACGGCTACGAAGTCGGTGTCCTCGATGATCTTCTGCAATATCTTGGCATTGACCTCCTCGATGCGATCGGGCAAGT 
CCATGGCCTCCAAGGAGGTGAGGAAATCGAGCACTCCTTCCTCGTCCATGAGATCCCCATCATATATGATGGGCTCCTTTTCCCTGCGGTTTTCG 
AGATATGGATTCGATTCGATTAGGTCTTGTGTCTTGTTTGTGGTGGGGGAATTACCTACCTGAAGTAGGTGAGGGCGGGGAAGTTCTTGATGCCA 
TACTGTTTGGCTAGTCGTTTGTCGTTGATTTTCACGAAGTCCACACCGAAGGAGTCGGTGTCATCGTCGATTTTCTCGAGTTCCGCTAAAACCTT 
ATCACAGGTCACGCAGCTTCGCGCATCTGAAATCGTAGAAGCCGAGCGTGTGAGTACTAAAGATCGATTATATGTAGATACCGGCAGCGATTGAC 
AGTCGGCAACTGCTGCATGTGACGCCACAAGAGACCATAACTAAATTCCCATTCGAATTGAAAGCCCAAAGCCCAAAGCCCTGGCAGAAGTGGAG 
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CCAAATGCCGCAAAGATTATTCATTCCTATATAGTAGATGCAGGATGCATCATCGGACCGCGCAATCGAAAATAAGCCTGAATGAATGAATGAAA 
TGACAGCAGCGCCAGGCCAATTCTGTGCTATTAATGCGGCAGCAACTGCCAACAAATATGCTGAAATTCTAAATTAATTCATAAAATTCATGCGG 
CCACTATTAGGGAAATGAATTGCAGCCCACAAAAGACCACCGTTGGCCCACGTCTCTACAAACGTTCTGGTTGGTTAAACGGCAGACCAGACTGC 
AAAATCGGAATCATTAATACATCGCCGAGTAAACAGCCATTTCGATTGTTTGGCAATCCGTTCAAAACAATCTTTCACTGCTTGGAAAGTAATCG 
CGAGTTGGCCGAATACCCCCCCCCCCTGCCTACTCCATACTCCTAATGCTGTTATTTACGGGCTCCAACACTAAGGTGTTTCCTGTGTACACCAG 
GTGCTTTGGAAAATCCACTGCAGGATGTTTGATTTAAAATCATCAGAGTGCTATCGCATGCAGTGCAATAACATAAATAGCGTTCACTTGAAATA 
ATAAACACATTTAGCTTATCGATAGAATTCGTTTACATATTGAACTATCTGCGATTTAATTGCCAAAACTATTCAATGCTCTGCCATTTGCGTAG 
ATATTCCATGATTATTTCGATTGTCGGCCAGCGTCATTGGGCCTGAGGCTTTTGGACTTAAAATATTTTCATTGTCAACAATTATTTTTGGAATT 
GC CG C C AATG C T AATG AAAC AC AT C G T G AGGT AT ATGT AC ATAT AC AT AT AAAGG AGAA TAT T AC AAT C C GAAGAT C AC TTTCGTGTT TAT T AAC 
TTTGACATTGGGGAACAGCGCGCAAATGCCCAATCTATATTTAAACTGATTAAAGAGATTTAATAAATAAGCCATTCTTTCACGCGAACGGTAAC 
CATTTTTTTTTTTTTTTTTTTTTTGGTTACCCAAAAACAGGTCACCAATGGGACCCACTGGACAAGGGGATTGAAAATAAATTTAAAAGGGCAGG 
AAATCTGTTGGCAACTGCTGGAAGGAAATTZVAATTCTGCTTCATAGGTATGTAGGTATATTTTGTGCCAAAAGTTTGCGCCAAAATAGACTAAAA 
AACCGAAAATATAAAAAGCTGCAAAGACGATTTCTCACAGCCAAGAGAGATTTCTCTACTCATAAACAGATCGCAAAGATAGCGCAGAAATGAAA 
AAAA T AT AG AAAAGGG AAGGC AGC AC AG ACT ACT GGC AAC T C GGGG AGG AAAT GAAGAAAT AGAAAAAAAAT AAAAAAT T AAC C C G AAT GAAAAT 
CGCGGACATTTCGAGCAACGAATTTTTCACATATTTTCATTTGTATGTGAATTTTTAGAATTTTCACTAGCGAAAATTTATATAATATGGCTTCT 
GTTACTATTTCGAGTCCGGTACGGTCCGGTCTGACGATAAATGAGCGGTGCCCGGACGACCCCGAGAGCACTGGCATAACCACCCTCCCACTTAC 
ACCAGAAAACGGCCACGTAATCCTTGTCGGCCAGGAGCTTCTCCAGCTGCTTGGCATTGACCTCCTCGATGACGGCCTCCGGCTCCGGAGGCGCC 
ACTGGCTGCGAGCCCTTCTTGTTGTTGTTGTTGCCTGCACCACTCACATGTCCGGGAAAACTCAGGGCCAGCAGAGCACACACGAGCAGCGAGAG 
AGTCTTGAGGCGGGTGAAAGTCATCCTGCGATCGGTTACGGTTCTGATGTGCTTTAGGCCCTTAACGCGAACATGCGTGCGCGTGTCTGCTCGTT 
TTGGTGTGTTCTACAGTTTATAGTGTTATAATTGTATAATTGATTCGTTCTCGTCGTTTGGGGTCTTTCGTTATTAAAATTTGTATTCCTTTGAG 
AATCGGTCACTATTTTTAGTTGATAATGGTGTTTCTTCTTTCGCAATTTATTTTATGTATAACAACTTTTTGTTCACTGATACATATATACGGGG 
TTCGTATATGCACAGGCGAGATATATGTGGGGTATATACGAGAATAATTAGATTATATATGTGGTTTCTTGCTTCTATTAATTGAACGTTTTCCG 
CGGACTGTACTCTTTGCTCTTCAGATTTTGTTCCACTCCGTGCAGAGGTCCGCTCGCAACTGAAATTATTTTGGTGGAATCTCGGTGCCATTCAA 
GAGGCCTTGCCCGAGCTTTAGGGATCGGGATCGTGGACTTCGTGCTTTGGCGCAGCTTTTCCGCTCCCCCGAAAAGCTTTCGCGTCGAGCACAGG 
AGCGCCAATCGAGTGAGCAGTGTGGTGAAATCAGGCAAAGAGCAAGAGTTTCCCTTTTGAATTGCAGTTCTCTGTTTATGCATTCGGTGCAAACA 
AGAAAGTGACTACGAATAGTCCTATAAAAACAAACTCGGAGAATCGCTTTCATTATGCGCTACAACAATTCACAGTACTTTGTATTCCCCATTTT 
CCCTTTGGTAAACACTTTTAACAATGTGTTTCGAGCTAACATTTTGTCAACGTTATGCTTAGTCTAAAATAGAAAGGATTAAACTTCCCTTTGCT 
AAGAGACATAACGTGACATTGTTGAATCTTTAAGTTTTATAGTTCCAAAAATCTATGCGTTTATAGAGATAGGAGGACATGAACAGATCGAATCG 
GT T AAT GAT C CT C AAT AAG AAT AAAT GT AAAT T C T AGGG AC GG AAAC GT T TC C TT C C AGC AGT C AAC TAT TAT G 
(SEQ ID NO: 1042) 

Exon: 3919.. 3706 
Exon: 2021.. 1865 
Exon: 1793.. 1627 
Exon: 1195,. 1160 
Exon: 1020.. 1001 

Start ATG: 3919 (Reverse strand: CAT) 
Transcript No. : CT26754 

ATGACTTTCACCCGCCTCAAGACTCTCTCGCTGCTCGTGTGTGCTCTGCTGGCCCTGAGTTTTCCCGGACATGTGAGTGGTGCAGGCAACAACAA 
CAACAAGAAGGGCTCGCAGCCAGTGGCGCCTCCGGAGCCGGAGGCCGTCATCGAGGAGGTCAATGCCAAGCAGCTGGAGAAGCTCCTGGCCGACA 
AGGATTACGTGGCCGTTTTCTGGTATGCGCGAAGCTGCGTGACCTGTGATAAGGTTTTAGCGGAACTCGAGAAAATCGACGATGACACCGACTCC 
TTCGGTGTGGACTTCGTGAAAATCAACGACAAACGACTAGCCAAACAGTATGGCATCAAGAACTTCCCCGCCCTCACCTACTTCAGGGAAAAGGA 
GCCCATCATATATGATGGGGATCTCATGGACGAGGAAGGAGTGCTCGATTTCCTCACCTCCTTGGAGGCCATGGACTTGCCCGATCGCATCGAGG 
AGGTCAATGCCAAGATATTGCAGAAGATCATCGAGGACACCGACTTCGTAGCCGTTCTGTTCTGTCCAGATCATGAAACATGCCCGCCCAGGGTT 
AT GGT AAT ACC AAAGT TGAAAT GA 
{SEQ ID NO: 1043) 

Start ATG: 1 (Reverse strand: CAT) 

MTFTRLKTLSLLVCALLALSFPGHVSGAGNNNNKKGSQPVAPPEPEAVIEEVNAKQLEKLLADKDYVAVFWYARSCVTCDKVLAELEKIDDDTDS 
FGVDFVKINDKRLAKQYGIKNFPALTYFREKEPIIYDGDLMDEEGVLDFLTSLEAMDLPDRIEEVNAKILQKIIEDTDFVAVLFCPDHETCPPRV 
MVIPKLK* 

(SEQ ID NO: 1044) 



Celera Sequence No. : 142000013384828 

CTGGTGGCGCGACTGGCGCACTATCGAGTCGATTACATACAGCCCGGGTACTTTGTACTCCGGCTTGCACTTCAAGATGAACTTCTCCACACTCT 
GGACGACGTGCTTATACAGCTTGATGGCCCGCATCGCCGACTTGGTGATGGCGGCCATCTTCGCTTTGGAAATGGGCGGTCGGCTGTCATAGAGT 
CCGGAGAGCTAAGGAGAAATATAGATGTTACAGTACGGCTATAGTTTACGGATATCTGGGACATGGAGTTATCGTTATCGGGTGCTGGCCAAAAA 
GCAACCCTGAGCGGTGCTGTTCGGTGCTCTGCCTGCGCCTCCAGCAGGATCTGTATACCTCCCGCACATTGGACAATCCGAATACCGACTTCAAG 
CGGACATTGTGCAACCACGCCGGCTGCTCCATAACGAACCTCGTTTCTAATGCATCTATAGCTTACCTCATTGTTGAAGGCAACCACGGTTTCCA 
TGTTCAATTGCTTGGTTCACTCACGGCACGCACACACAATATTTAGTTTTGCATACTTTGGTTGGTC7AACACAATCGTTTAAAACTGGCGAAAAA 
CAATTTTCTCGACCTTTCTGCGGTTTCCACGGCTTTTAAAAAAACTTTTTACAAAGCGTGAAAAAAATGGCTGCCCTAAAGCCTGAAGCGTTGCC 
AGACCTGT AAAGT TGACATTTTCAAGGCATTTCTGGAAATTGTGCTGTTAATGGCAGTGTGACCGTATGT GGT GCATAACGTTTCGGT ATT AAAA 
AAT TT T CCAAATATTT AT AAAAAAT AAAACGAAATAGAATTTTC ATT ATGGAAT CCAAAAGTTT CAT AT AAAAAAAT AATG AC AAC AAAATA7AAA 
AT GACCTT AAAT AAATTT AAAT GAAATCAAATCTTCTTAAATGTACTACCTTTTAAAAATAATCGCTATCGGCGGTGTCGATCTTTTTAGCATTG 
TGTCGATGACAATACATCTGGCAACACTGGTCAATATTAAACATATATTAAACAAATTAAAATGAGCTCAAACCACCGTCCTCTGGACGATTCGG 
ATATCCTCCTAATACA7AACCATCCGCGAGACGCCGTCGCTGTACGATCCCCAGCTGCCCTCCTTTCGGCTGTCGCAGCGCAAAGAGGAGGACTGG 
GCAAAAGTGGCGGATTTGCTAAACATCTCCATCTCAGACGCTCGGCGACGGTGGACGTGTCTGCGGGATCGCTATTCGCGGGAGCTTAAGCAGAA 
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GCGCCTGCATCCGTCCGGCGAGTTTGGTCACZ^ATGACTTCTTTCGGAAGATGGACTTCCTGCGGGACTTTGTGCGCAAGAGACGCGAGCGCAGAG 
GACGCGAACGGGATCGGGAACAAAAGCCAACCGGGTGGATGAAGGTTGATTTACAGCGACGGCGTCGTACCCGCCTGCCTATTGACACAGAGACC 
TTGATCGAGGAGCAAGGTTCGCACGCCTACGATGAGGGCGAGGAGCAACACGACTACGACGCTAAGCTGGAGTCTCATACCACACAATCGGAGAC 
GTATTCGGTGGTCGTGGAGGCGGACGATGGGCAGGAGCCCGAACAGGAGAGCTTCGATGAGTTCCTGGGGGATGCGGAGTGCGAGCAAAAGGTCA 
AGGTGGTCACAATTCATCCGGAAATCGCTGCTCCGAATGCGACGTCCGCACCTGAGCCTATTGAGTCTAACCACGCGGATCTGAACTACTTGGTC 
TGTATGCCGCCCAATGCGAATCAGGAGCGAGAGCATTCCGCCCCAGAATTACCCAATCCCACGGCCGTCATCACCCAAAAGACTTGCGAAACAGA 
GGACGACTTCTTTTGCAAGTCGATTGCCGCGTATTTGCGCCAACTGTCGCGCGTGCACAAGATCAAGGCCAAGGTGGAGATGTACCAGATTCTAG 
AGAAGTACATACTGCTCGAGGAGTGCGGAAAGGGGTCGGGTGCGGGCGGATCAGGCTAAGCAGGTTGCATGCAATATGTACATAGGTTTTAACAT 
AGTTTTTTAATGCAGTACAAATATACAAAGAGACGAAAATCTTTTGAGCTTAATCTAACTTTGTGCTTAGGAAAA7^ATGGAGTCTTAGCAGAAGT 
CCTCGTTGATTATGTTGGCATCTACGGCCAGGCCTCGTTCCTTTAGTGCCTTGGATACTAGGCGGTGTTTCTCCTGCCGCTGCTCCTGTTCCCTA 
CGCTGGGCGTCCAGGATTTCATCGACGGATATCTCTTTGAGTGGCAGTTTCTGCTCCTTTTCACGGTCCTTTTACGCATATTTAAAGGGTGTTAA 
TCAGTCATATTGAAAAAGTATCGAGTAAAGCTTACTATTTCGCCCAGTAGCACCACATTCTCCCCGCGAATGATGAAGACTCCGCGAGGAATGTC 
GCCGTATTCGTTGCCCACATGTATCCGCTCGATGGTGCGTTGCAGCACCAGGTTGGCGAACTGGTCCACGGACCGCAGGTATCCAATCAGAGTCC 
GTCCGTCTCTCAAAAGTACCATCAGTTTCTCTGCAATGCGCCGGTTTTCAACGACTGGCCTTCCTTTCATTCGGATTTACTCACTGTCAACCTCT 
TCCAGGAGGTGAGCCGTGCCCGCCAGCGGATTTAAGTCGTCCATGCTATGAAAATATGTGGGATTTAGATAAATAAATAAACTGAAGCAACTAAA 
CTTTGGTCTAAATTTTAGTATTTAATATTAATGTAACCCTCGCTGAAGGGTTTAAAAAACCACAATGCTCTAGAAAAAAATACTAAAAATATTAC 
TACACTGTTCAACGAAACTTATAAGATATTGCTTACTGAAATTGATGTTTTTAGTTTTAAAATGCATAGATATTAAATAGTTATTATTAATTTTC 
AAACAGTTAAAAATAAGATTTCTGCTCTAACATCA^TATTTTTACGCTGTTCACAGAGTTACAAAATAAAAAGAAGTTGACCAAAAAAATATCA 
AAACT T GT ACT C AA 
(SEQ ID NO: 1045) 

Exon: 1001.. 1959 
Start ATG: 1012 

Transcript No. : CT26756 

AACAAATTAAAATGAGCTCAAACCACCGTCCTCTGGACGATTCGGATATCCTCCTAATACAAACCATCCGCGAGACGCCGTCGCTGTACGATCCC 
CAGCTGCCCTCCTTTCGGCTGTCGCAGCGCAAAGAGGAGGACTGGGCAAAAGTGGCGGATTTGCTAAACATCTCCATCTCAGACGCTCGGCGACG 
GTGGACGTGTCTGCGGGATCGCTATTCGCGGGAGCTTAAGCAGAAGCGCCTGCATCCGTCCGGCGAGTTTGGTCACAATGACTTCTTTCGGAAGA 
TGGACTTCCTGCGGGACTTTGTGCGCAAGAGACGCGAGCGCAGAGGACGCGAACGGGATCGGGAACAAAAGCCAACCGGGTGGATGAAGGTTGAT 
TTACAGCGACGGCGTCGTACCCGCCTGCCTATTGACACAGAGACCTTGATCGAGGAGCAAGGTTCGCACGCCTACGATGAGGGCGAGGAGCAACA 
CGACTACGACGCTAAGCTGGAGTCTCATACCACACAATCGGAGACGTATTCGGTGGTCGTGGAGGCGGACGATGGGCAGGAGCCCGAACAGGAGA 
GCTTCGATGAGTTCCTGGGGGATGCGGAGTGCGAGCAAAAGGTCAAGGTGGTCACAATTCATCCGGAAATCGCTGCTCCGAATGCGACGTCCGCA 
CCTGAGCCTATTGAGTCTAACCACGCGGATCTGAACTACTTGGTCTGTATGCCGCCCAATGCGAATCAGGAGCGAGAGCATTCCGCCCCAGAATT 
ACCCAATCCCACGGCCGTCATCACCCAAAAGACTTGCGAAACAGAGGACGACTTCTTTTGCAAGTCGATTGCCGCGTATTTGCGCCAACTGTCGC 
GCGTGCACAAGATCAAGGCCAAGGTGGAGATGTACCAGATTCTAGAGAAGTACATACTGCTCGAGGAGTGCGGAAAGGGGTCGGGTGCGGGCGGA 
TCAGGCTAA 
( SEQ ID NO: 1046) 

Start ATG: 12 

MSSNHRPLDDSDILLIQTIRETPSLYDPQLPSFRLSQRKEEDWAKVADLLNISISDARRRWTCLRDRYSRELKQKRLHPSGEFGHNDFFRKMDFL 

RDFVRKRRERRGRERDREQKPTGWMKVDLQRRRRTRLPIDTETLIEEQGSHAYDEGEEQHDYDAKLESHTTQSETYSVVVEADDGQEPEQESFDE 

FLGDAECEQKVKVVTIHPEIAAPNATSAPEPIESNHADLNYLVCMPPNANQEREHSAPELPNPTAVITQKTCETEDDFFCKSIAAYLRQLSRVHK 

IKAKVEMYQILEKYILLEECGKGSGAGGSG* 

(SEQ ID NO: 1047) 



Celera Sequence No. : 142000013384828 

AAGGAAAAATATTAGAGATATACATTATTTGAATATATCGTTATTTATTTATTTTAGTTTAACTTAAAAATTGTCCTAAATCAATTTAAAGTGCA 
TCAGCCTGAAAGTAGGCAAGCTATTTTGCAACCGACTTGGTGGTGAAGCCTTTTCATCACGGCCAACAAGTATGCAATGGTTTTACAGGCTTAGA 
GCATTCAGTTCCATGTTCAATCGACAATTCATTTTAATTGAATTACCAAATTGCTAATGAAAAAATCGTATTAATTCGATTTTAATTTATATATT 
AATTTCGATTTTATTAGGCTTTTTCTTTACAATTTGTTCGGCCTGCGTTCTCTTAAAATGCTTTTGTATAGTCTCGTCGCTCTTCGTCTCTTTCA 
CAACTACTACTCAATGCTCGCTTTTGATGACTTTCTTTCGCACCTTTTTTATCGCGCTCTCTCTGCGCCTCTCTCTTTTACGCTTATAGCCGGCA 
AGTAAAGAAGAAGCATAAATAGAAATGAAATATAAGTAAGCGGAATGCAAAATACTTTTTCTTTGGCGCCTTCCTCCTCACACACACACATTCGC 
ATTCGTATGTGAAACCAGAAAGAACAATATAGTATTCATACTGCACCTGTATATCGACTATAAAGTACGCTGTACGTTGTATGTTGTATTTTACC 
ATATGTATAGCTTATGTGTTTCCTTTTTTAATGCGGTAGCCTAGGATGTTGAAAATTATTTTAATTACATTTTTAAATCCTTTTAAAAAGACCCT 
TGGCTCGGGACTTTAAATTCACTATTTATTTATTTTTATTTTTTGTCACTTATACATTTACCCAACTTGCCTTCGTGACTATATACCCTGTGTAG 
TTCACATATAGAGGGTATAATAAATCTCCCACCGGCTTTTCTTGTGGTGGCCAACGACGTCGACTGCGCTGCCGGTGGCTCTGCATTGGCATTTA 
GGCATTTATGCCCACGGCAGTCGAACAGCCGCTTCAGATTCTCGTCATCGCTCGTGGAAGGCACATCGCAAACGCCGCTCTCGTTCTCGAATTTA 
TTGTGAAAGCTCAGCCGCCCACCGCCCGCCGGAAAAGTCTCGCGAGTGTCCCGAAAAAATCGAGTTAACCCCCCAAAAAAATCTATCATATAAAA 
T CC TAAC AAAATG AGC AGT AAGTTGGGTCAC AGCC AAATCC AGAGCAAGC AAAAAAGCAAGAAAAT AAAAGAAAAAAAGAGTGT GT GT GTGT ATG 
CATGGCTAAAGCTGGAAACTGAGAACTGAAAAGTCGATAGAAAACAGCCTGCCCTAAGTTAAAGCTCCTTTGGCGTTTCCTGCTAATCAATCAAT 
CAAGCAACAAATCGCCTATCCTTCCATTTCGTAACACTCGCTTCAGGTTGGAGTTATGCAAAATCCGAATTCGATTTGAATGCAAAATCAGAAGC 
TGAATCGAAATCATAGCGATAATCCCACTACTATAATATATATAATTATAAATCGAGCATAGAGCGCGCGCGATGTTGGTGTGGCTACGACCTTT 
ACTGACCTTCACTAGCGTACGACTCTGCCCACGGCCCCCCGCCTCATCACCATCACCGTCACCACCCCCTGTTTCGTCGCTCTCTCCCTTCGCTC 
TTAGTTTCTCTCTATTTTGAAACACTAACGTGCAGGAAAACATTTCAAAAATGCGAACATTGCACTGGAAACCGAACCGCTGCCAGTCAGTCGTC 
TGGCATATTTGTTTGTTTATGAAAAGTTTGTGCACAAAATCGCTGATAACGTGAAGCTCACATAACCTTCAAGAAGAGAGAGAGAGAGAGAGATA 
GATGGGGAGAGAGAGCGAGCTTACAGTTATGTTAATTCCGTTTTGTTGTGATTCCATTACATCGATTTCTGCGCTTTGTACCTGTCGCAATGCTG 
GGTGGTGACCCAGTAATCCCACGTAGCGCCATTAGCAGCCGCCTTATCATCCATCCGCCTTATCAGCAATGCGATTATTTCCGCATCTGCAGCCC 
ATTGAGATTAGCGGCCCATTAAGGGATGCACTTCGCCTTGGCATTTTTCACACGCATTCGCTGACGTCCGGCGATCTTGTTTGAAGCAATACAAT 
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ATTCACAATCTATATGTGGTACACCCCATGGATGTGTGATTCTATTTTTAAGTTCGATATGTATGGGTCGGAATTATGGCTGGCGACAGGCGCTA 
ACAAGTTTTGTCTGTTTCGAAAATAATATTATATTTGCCTTTCAAAAAGTTTTCCACGCCCGAGAACATTCTTTTGTGTCGCATCGCCATCGCCC 
ACTAGTAAAGTGTGTGATCTCTCACATTTTTCTCTTTCCTTGGAAATGCCATTAAATCTGCACGGATTTATGCCCGCATTAATCATCAACCTACG 
GCCGGCCACAGCAGCTGCCTTCAATTTTCTAGCCCGCAGGGATCTCCAGCATCTCGCGTTTCGGTCATCTACAGACACATCCACATCCACAGCCA 
CATCCATGAATCCATGGTGATTCAATTTTCTAAAATTGCTGCGTCGCCTGTACCGAAGCGATCCAATGCATCGCCTACCAACCAGCAGATACAAT 
CTCATTTCATTTCACTTCGCATCGCCGGCGCTATTTTTAGTGGCCTGCTCTTTCGGATACAGCATCGGCATCCACAAGATACACACGCCGCCTGC 
TGAGGTTTCCCGCAGAACTCGTTCCGCGCCTTGCACTCGCCAATTTGTGCGTTGGTGTCGCCGGCGATCCTCGACACGACTTCCTCCCAAGACCA 
CTCCCTTCATTGCATACCATTATGCACTTCCAGCGTACAGGCCAACATGGATAACTCAAACTCTATGCTAGACATAACCATATATGTAGAAGGTG 
ATGTTTGGTATATTATAAGCTACTTAAACACACAGGATAACCTACTAACTATACAGAATTTGCAATAGAGTTATTGTGCCGAGCTAGCCAGGCTT 
GACTGTACTTCATTTCTTGTGCGCCGGCTGCAAATTATGCGATGCTTTAATGGGGACCCTGACAAACGGGTTTCAGCGAGGGAAGGGATGTCCCC 
TTCAAGGTCTTGGCTGCCCATTCATCTAATTGGTGACCCCATTCATGGTGGTATCTGGCGTTAATCTGATCTGGGCCTCATCACGCCTGAACGGG 
AATTTGTTCAGTCACTTGACCCAGACCCAAAAGCTCAACCCGCAGCCCACGACCCCCGACCTTTTTTCCCAATTGCTTTGGCAGGGCGCTTTATC 
GGCTTTATTGTGCTACGGCCATACGCAATTGATAAAAGCGGGTAGTAGAAAACAGAGCGCAAACTCTGCTCAGCGAAGAGAGCGCTTAAGAGAGA 
GTTAAAGGGAGAGCATATCTCCGAAGAGAGCTCTCTCTCTCCCTACACTGTCAAAAAGTTAAGGAATTTTCCCACAGTTAGCTGTCACATAAATT 
GCTAATTTACCTATTTTTAAGTTTACGTGCTAAGCATATGTTTAAGTAAAATGTCGCGACGCATTCATTTTGACAATTTCAACAGTTGTTGAAAT 
TTTTGTATATTTAGACCGAGTGTTGTTTATTTAATTTTGTTTAAATCGCCAAATTTATGACGATTGCCGAAAATTCCTTTTCACCGTTCTGCCAC 
ATAAGTTTTATTGTTTTGTGAAATTTTTTGTCATTTTTCGCACACACATCCGGTAAGTCAGAAGTTATTAAGATTCATTCTAGTTTTGCCGAAAG 
TAAATGATTTCATTTTTATTTTAGGCCCAACGAATCATTTCACAAGCACACACAAACGCAAATAGAAAAGAAAGCATCTGTGTATCTCTCACTCG 
CACACACACCAATGCAAATACAAGACCCAGTCACACTCACCCACACACACACACACACACACATTAAAGAATCCGGCGGAGGATATAGATCGTCC 
TGATCTCAGTCCAAATCAAGATCAAGATCAAGATTAGGATCAAGTTCAGCCTAGTGAATGCATTTGAATGCCGCTGATTAGCTCGCTGTGCAGCA 
GTTACCATCAGAATGCGCATCCGCATCCGCAAATGTGTCCACATCCGTATCCGCATCCGTATCTGACTGCGAGAAGGTAGTTGTTGTTATCCACC 
TACTTGTTGTATCCTACGCTTCACAGTTCCTTTTCAGTTCGCTTCTCTAAAACCAAACAACCAACAAAACCAAGCCCATGTAAACAGAACTCTAA 
TCTGACACTTAACCCGACTAAAGTTAGCACAACGAAATGTATCAAGAATATAACATATATATACACACTTGTATATATATATATTCATCTATCTA 
TCGATTTGTAGTATTAATACCCGAATAATAATGTTCTAAATTCCATTTACTTGTCAATAATCAAAATTGGAAATAAAAAATAATTCGTACATCTT 
CTCTCTCTCTTTAACTCTAATGAATTTTTAATTATCTAAACAGCAAATCTGCATGGTTTAATAATTTATAGAAACGAGAATTTCTTCGAAATTCA 
CTTGTAAATTGTGTACCATACGGATTCTAGAACTAATTCCCCATCGAGTGAGAAATCTGCACCGAAACAAAAAAAAATCAAACTCATAATCGCCT 
CATAAACGTGTATATAGATACATTCATATGTATATCCTAGCCAGCGCCATGAATAATTATCTACAGCTTTTTTGCCCAACTAAATGCCAACTACG 
CAAAATTCTAATTGCCCGTTCGGACGCGCCTGAAATATGCTCAGCTCGATTTTGCCAATTCATTAATGGATCTTTATTATTTTTTTTGTTTTTTT 
TTGTTTTTTGCAATTTTATCCTCGCTAACAAAGCAAATGAATTTTGGTTGCATGTGGGCGATGCTGACGCAGTTGGAGATCTGCACACACCACTT 
GAACATATATATCTATATATCTGTTCTATTTCGAATGGGGAACCCGAACAATCGCTTGTTTATTTATAATCCGTAACGTTTTTTGCTGAATCCGA 
AACTAATAAATACAGCCAACTGCTGCAGTGAAACGTGCAGTGCGCAACAGTATCTTTTTGTGTTGTAACCCAATTTGTATCTCTGACTTTAGCTA 
T AT AT C T C AT CT C T CT T GGCT ACT CC ATCGAT AGCGTT T AAT T TGC AAT TG TATC TCCATCTCTCGTTTC GAAT AG AAT T CGCTT GGGT G ACGC T 
GACGACAAATGACACGTACTCACTGGGTGCCTTGGTCTTGGCCCACTCACTGAAAAGGGCCAAGACCGCCCACCAGCTGGCCGTTCTGGTCACGC 
CCAATGTCTCGCAGGCGATGCGCGACCGACTGAAGGAGGTCTACAATGTGGTGCAGGAGGTCAATGTGCTCGACTCACAGGATGCCGCCAATCTG 
GCGCTCCTCTCGCGACCCGAACTAGGCGTGACCTTCACAAAACTCCACTGCTGGCGCCTCGTTCAGTTTGAGAAGTGCGTCTTCCTGGATGCGGA 
CACACTGGTAAGATATGAAGTATTCCAAGGGCACAACATGTAACTTGTGAGCTAACAAAGACTTAAATAACTACTTTCAATTTGATTTCGTAGGT 
CTTGCAAAACTGCGATGAGCTCTTCGAGCGCGAGGAACTGTCGGCTGCTCCGGATGTCAGCTGGCCGGATTGCTTCAATTCCGGCGTGTTTGTGT 
TCAAACCTAGCGTGGACACCTTTGCCCAGATCACAGAGTTTGCCGTGAAGAATGGCAGCTTCGATGGCGGTGACCAGGGTCTGCTGAATCAGTTC 
TTTGCCGACTGGTCGACGGCGGACATTAAGAAGCATTTGCCATTTGTGTACAATGTGACGGCTTATGCCTCGTACTGTTATCTGCCCGCCTTCAA 
ACAGTAAGTTCGCTATCTCTGTGGAATGGAAAAGCAGTTTGTAATGGCATTTGTTTGCTTGCAGGTTCAGAGATAAGATTAAGATTTTGCATTTT 
GCCGGCAAGCTGAAGCCCTGGCTGATTCAGTTCAACTCGGAAACAAAGGTGGCCAGCGTTTCGTCGGAGTATGCTCATGCCCAGGATTTGATTCA 
GCTGTGGTGGAACATCTTTTGCGAAAATGTCATCCAGTCCCTGTCCACCGAAATGGTAAGTGCTATTTATACAATGGATAACGCACATTTTGGAA 
ATTTCCAGCGTTACTTGGTGCCCCCAAATGATCTTATGTCTTCCCTTGTTTTTGAACGCCAGACTGAAGCTAACCCTTTGATTTTTACTGTCTAA 
CCCAATGTATTTGCTCCAAATCTCAAAAACAAAAAACCCTGCTTCGTTTTCTAACACGCTAATCGCAAGCAAACGCCAGGAAATGTTGCTAGTGA 
TCGACCCGCGGGAGAAATGGCCCAAGGGTCCACTACTTTAGAGGTAAGCAGATCTTCAATTGCCGAATCGCACATATCGCACCAGATCTGTAGAT 
AC C T T G AGC AC AGC C C T G T GT AAAT AT C TT AATT AT AACT T AAAT ATTT ATT T AT T T AT T T AGC T AAC G AT T C AG T C A T CTTTCTGT GC GCC AAG 
TAAATAACCCAAGCAAACAAACTCAATTTCGATCTATTTGCCATTTGTTTTTTGCCGATAGCTGGCCTCGATATCGTTTCATTTCCCGCCTCTTT 
TCTATCAACTCGGTTTGAATATACTTTGTTTTCGTTTTATTTATTTTTATTTTTTTGTATTGGTTTGCAATTTTCTGTTGATTTTTTCTCGATTA 
GCAGCCACGCCCCAGAGTAACCCATTTTCCTTTTTTTTGTATATTTTCTTTCACTTTTTTGGCACCATAGGCAGACCAATTCTCAATTTATTTCT 
TCTGTATTTTTTCTGTAATATTTGTGCAACTGTGTGTGTCTCTCGTGTGTGTGTGTGTGTGTGAGTGCTCGTCTTAGTGTTCACTTTCTTTCTGT 
TTTCTTTTCCTTTC T AAT T T GC AT CG AT TT GGC AAAGAGC CAGG AAAAT G T T T C GGT T T AC GC C AT C GAT AG AT GG AAAT GAT T C AAATC AAG AT 
GGTCGGTTGCGGGTGATGCCAAAAAAAAAACTCAACTGAATCATTAGATATATCCCATAACACAAACAAACACACACACACACACACTAGCACGC 
ATTCGTGGGGATCTCCTCTCTCTTGATACACACAAAGAAATTTTGATATAGAATGGAAAAATAAATTTTATCTTATTCCAAATTCATTTAATTTA 
AGAGAAATAAATAATTTTTTTTATATTATTAATTTATCAAATTTTGCTGTGTGTACTTTTCTTTTTCTCCCTTTGTTATGTTCGTGCCGTTGGGT 
TCTTTATTTTTTGCAAATTGATTTACTTTTCATTTACTCTCTATTTTTTGAACTCTGCTCTCCTGTGTTGCGCATTCTGTAAATTTATACTTTGC 
CATAAATAATACGTGGACTGCCGTATCGCATATAACTCTGTATATCTAACATTGTCTCAGTGCCTTTGCCTAAATATAATACGCACGCAAAGCGA 
ATTCGCGGCGGATGAGCTGCATCAGCGGCAGCAGCATGAGTTCTACCAGCAGCAGAAGCTGCACCACCACCTGTTGCATGTGCGCCAGTTCCGCG 
ATC C T T GGGC AG ATT ACT ATG AG AATC TGG AG AAGG AG CC GCAGGAGCAGGA GC AGG A GC AA C AGC GC C AC C AGC AC C AG C AG C AG C AC C AC AAT 
CCCAGCCAGAATCACAGCCAGGACAATGGGAATGCGGATGCAGCTGCGCATGAGAATGATAGCACGATTGCAAATGCAAATGCCACTGACTGGCA 
TCATGACGATGCATGCCACCCACCTGCGCCGACGAGGCCTCTGCCCCAAATTCAAAACCAAGCCCACAATGCCAATGCCACTGCCAATGCCGAAG 
ACTTTTCATTCGTAGATGTGAATAGCAATGAGCTAAACATTTCACAGCCACCTGAACACGCCTCCTACTCCTCCGACACAGCCACGCCCACTGCC 
ACAGCCACGCCCACTGCCACGCCCACGCTAACGCCCACGCCTCATCATTCCGTGCATAGTCAGACAGTAGAAAAAGCGACAAGCCACCAACAAGA 
ACATGAAGTTGCAGCCACAACGCCGCCTGAGGTAAAGGCAATCAATCCAAACCCGGCCTGAGAACTACACAGAAAAAAAAAATTCCCTAATGGTT 
GTATGGGTTTTTCAAATCCAACATAGCCCTAAGCAAATGCTAGAACTTATTATTTATGATTTTCCTGGCACCTCATGCAAAATGGACTTCATGCT 
TATATATGGAAATATCAGAATATTTAGTTACAAATTGGAATTTTTTCTGTGTAGGAATCCCGATTCCCAACCAAACAAACCAAATGCATTGGATA 
ACGGTGTGCAAGTGTAAAAGTGCTAGTCCAAGTGATAGTGATAATGTAATGTCCTGGTTTTCGTCTAGTCACTGTTTTGCATACGAATTGTCATT 
GTCGTCTTCATTGTCCTTCGGTCATCGTTCATTGTTCATGGTCCTTGGTCCATGGTTCTTGGTTCATGGTTCAGCGTTGTGTTTGTGGCCTGGAT 
AAGGGGCATTTGTTGATTTATTATTAGTTGGCCTGGCACCTGGTCAAGCATTAATGTAGATGGAATATGGGGGGCCTCGCTACCCTACAACATAA 
TCAACATCATCATCATTTTGAAACCCAAACGCAGCAAAAGCCGGCACTCCATCCTCGTGTTGCATGCGCATTTTATAGCCACCTGCATCTCCTGC 
ATTCGGCCCACTTTCACATAGTAATCTGCCCGAGAGTTGCTGGGACACGCATTGCAAACGCAAGCAGCCGGAGCTCAACGCGTCGTATACTTGAT 
TTATGCCTCGAGTAGCACTTCTGTTGTCACATTAATCATACGCCCTGTGCACCGTAACTAACATCTCAATGTTGTCAACCGCTCGTCTGCGTACT 
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AGTGCTAGTAAATGTTTAACCCGTAGTGCATGAAATGTCGTAATATACCACCCGACTGCATATGTATATTCTCCTCCCTTGCTCCATCATAATGC 
AATCAAGCTCTTAATCAATTTAATTAACATACTTTCAGGCGGGTCTGGCTGGTGCTCTGTCACAGCTACGCATCGGTCAGCAGCGCACTCCCGAA 
CAGGAGGCCTACGAGAACCAGATGCGCCGCCAGTGCTGGGAGTCTGGACAGATTGACTACACCGGCGCCGATTCGTTCGAGAATATCTGGAAGAA 
AATATCGCAGACGCTGGAGAGGAAGACCGATAAGGAGTCGGAAGAAACTGGTAGCTCATAGATACAAAACCGAAACTAAACAGAACAAACAAATG 
CAAATCGTTCGTTATTCGTCGTTTTGTTGCCATTTTTTTCTTGTACACGCGCACACACGTACACACTTAACACCCACCCCTTCTCCACCCTAAAA 
TGAAAACCCCAAACACGCACACAATCGTATTTTATCCCCCCTCCCTCCCCACCTTTGGGAGGCAGGCGAACTGTTGCCAATGTCAAGGAGCCGCC 
GGTCTAATTGAAATTTAAGGTTTGGCCAAAAAATGTTGTCGAAACGAATTCTTCAGAGTGTCATAAATGCAATTTTTATTTGCCTTCCCTGTTAC 
AGCCACCCACCCCCCCGTCCACACACGCACACACACACACACACACGCGCGCATCTTGTTAAAAAGAATGAGCGAGAAAGCAGAGAAATTTTAAT 
TAAGTTTTCAAATTAAACTTTTTGCACTGCTGCTACTGCTACTGCTGCTGCTCTCGACATACGTGATGTGATGCGATGTGTATGTGGGCGACACA 
CTTGGCACAGTTGCCCCAATGCCCTGCCCCTCCCCCTGCCCCTGCCAAGCCCTTGGACAAAATCGAGAAAAAATGAGGCCCGGCCACAAAGTTTT 
ACACTTCATTGTGTTTTAATTGAAATTTTTGTACTTTCACCCAGTCGTGTGCTTACCGATTTATGGCGATGACTTTCCAGCTCATCTGCATCAAA 
ACTCATTGTGGCATAAAGGCCATTTCCTGTCCTTGCCCCACCTATCCCACCCCTGACTACCCTTTTTCCCTCTGCCGCCCCCAAACACATATGTA 
CATATGTTTTTCCTATGAATAAAACGCAAACAATGAGTGTATACAATAACTCTGGAAAAGATAAACGAAA^ 

AAAATGAAATTGTAGCCCACATGTTGTGGCTATTGTGTATATGTATCTATGTGTGTGTGTGTTTATGTGTGAGTTAGAGTGTGTGTGCGAAGTGC 
CCCGCCCCCTTTTCGACCACTTTTCGGATCGAATTCGAGCTGAGCGTTAAATGTTTGTTTCAAGTGTTTTGGCAGCCTAAAATTTCTATCTATAT 
ACGTATATATACCATTTATGTTATAATTTACATTTTGCAAATATTGTCGTAATATTTATTCGAGAACAACAACAACAAGAACAACAAAGAAATCC 
CAAACAACACAAACAAAAAATTTTCAATTTATGAAACTATGTTTTCGAATTTGCATTTTTATTTTAGCAATTAAGTTAAGTTACGTTCTTCTTTC 
CCATGTTGTCACCTTTTCAGTTGTGTACTTTTAAGCCATCTTAGCCATGGCAAGCATTGTGTCTAAGTCTTGGGTCTTAGAACCATCACATAGCC 
ATAAGAGCCAGCACCATTCACCCATCGCATCTTAGCCATCAGCCATCTTCAGCCATCACCAGCAATACAGCAACGCAGACGATGTAGCTGAATCG 
AATTAAAACTACTGTCTCGTTACCCCTAACCACATTTACAGATAATGATAACGATAGTGATCCCGCTAACCCACTAAGCTCAGCTCGATTGCTAA 
ACGAAGCCGCCAGCATACTGCTGCATCCGCCCGAGAAATCACCAACTTCAAAGGCGATGAAAGCGAATGGGGCGACTGGGAATGGCTCCCATCTC 
GACCAGTCGCATCCCACCAACTCCCCAGCAACTAACAAACAGCCTGCGAATGCTTCAGCTTCCCCCAATGCCTCCTTGAAACAAAATCCACAGAA 
GCGACAGGAGAAACCGCAGCCCCAAGCGGAACCGGAACTGGAACCGAGACCACAACCACAACCGCAGCCAACAGAGCAGCACATCAGGATGCAGC 
CAGGCGGCGGAAATGGCAGCGTCTCCAAGAGCGGCAAACAGAGGTCCGCACGCAAGTTCAAATAGCAATTACTGAAGCTGCTGATGCACTACAGC 
TGCCTGCCACACCTGGCGTGCCGCTGCAAGTCGCAGTATCGGCGGCTGGAACCCAATCTGGATACCGAACCCCAACCCGTCCACGTCTAAACCCG 
ACGATTGATAGACAGATCCGAGGAAGTACAAAGCGTATGCCACGATAAATGCTGTACAAATTGCTTGCTAAGTGGCTCTTAGACGAAAGTAAAGT 
AAT C AAATCG AAACG AAC T AAAT T TAT AT AT T T AT AT T AAATT AT T AAT GC AAC AAT T AT C AG AGC ACAC AGG ATT T C AAAAGT AT AT AAAT AGG 
TTTTCATTAACTTTATATCGCTTGAGTCTTTGAAAGTAAATGTTTAAAAGAGAAAACTCTTTTAAAATGCATAGTAGGTTATATTTCTTGAGGCC 
ATATAGCTCCACGACTTTTTTGATATCGTCTAAGGGTCCCTTGGTTTACCCAAATACCACGCTGCCTTACAAAGGAAACACATGCAATATCTTAA 
ACT T AGC AT AT AGTGT AAG AC AAG AGCT AAT ACT AG ATG T AAGC CG C AAGT GC TT T GGAT AAC C CT AG AT GC AAAC AC AC ACT C AG GAGCT T GGG 
ATGATATCGCCTTATATGCCATATATACTATACACACATGCATCCACACACCACAGAGCTACTGACCTCCGCGGACATCGAAGACGGTAAGGTGG 
AGT T C G AC AAGGC C AC GGG T GC C AC CACC AC GT ACAAGGTG AT C GAGGGT G GC T AC G AGGTG AT C ACC AC C T CT C C GC AGC CC AAT GGAAC CATC 
AAGACCCAGGTGCGAACCTTCTGGGATCCCAAGCCCGTTAGCGAGGAGGATCTGAAGAAGGAGCAGGAGAACAAGAAGCTGGTCCAGAAGCGTCT 
GGGCAAGGAGATTCGCATCGATGAGCGCACCACCATGCTGACCCGCGCCATCGAGGGTGGCTACGAGGAGGTGTACACCACCATCAACGAGGATG 
GAACTCGCAGCTCTCGCACCAAAACCTTCTACGACTCCGTTCCAACCGAGGTGGATGAGAAGACGGCCGCCAAGTTGAACAAGAACAAGAAGAAC 
GTGACCAGGAAGTCGTCGGTGGACTCTACCGATTCCACGGAATCCTCCCGTCGCATCGTCCAGAAGCAGCAGAAGAATCTGGTGCAGAACTATGA 
TGAGCAGACCAACTCCACCACGACATCAGAGCGCAGGGTGTCGGTGACTCAGAACATCGAGATCACCGAGAACAATCGCACCGTGCGCAAGAACT 
CGCTGCAGGAGCAGAACGTCAAGGCTATCACTACCACCTCGAACACCTCGTCCAGCGACAAGAAGCAGAAGAAGAAGCCCAAGTCATCGGCTCCA 
CCACCACCCGCTGATTTCTTCCAGAACGATACCACCTCTGTGGCCTCGAAGCGTGTGCCCGGCGGTGTGGAGTACACCTATTCCACGCAGCTGGA 
TTCTGGCAAGACCATCACCACCTCGAAGACTGTCTACGAGGAGGAGGAGGTGGAACTCACCGAGGAGGAGATCCGCCAGTACAAGAAGACCCTGA 
AGGAC GCC GAG AAGC AC AAG AACGT CT C C AC C AC C AAGAAGC T C AAGT C CG AGT C GGGT AC C AAG AAG AT CGTTCCTTCC G AAAAT CC C GG AG AT 
GTGACCACCGTGGAGACCATTAAGATCGAAGGTGGCACCGAGTACCACTACACCACCGTGACATCCGAAGGCATCGTGAAGAAGGCCGTGAAGAC 
CGTTTACGATCCAGTGCCCAGCCAGAAGTCCAATCCCGATGACACCGACGAGGAGGAGATTATCGAGGAGTACGAGGAGGAGATCATCGAGCCCG 
GCGAGAAGAACGTCi^AGACTATCGAGACCGTCAAGCAGCTGCCCAGCAAGTTCGAGGGTGAGGATAAGCCAGTCCACTTCATCCGCAGCTAGGTT 
ATATTAACCCATTTTCTTTTTTTTCGCAGAGGAAGTGACCATCACTCATGAGAGGAAGGAGAAGAAGGTAAAGAAGTCCATGGTCATTAGCCAGT 
AGAGATCGCCGCTTGCCCACACCGAAACACACTCAAACACACACGACACTCACACAACACACAAGGCACAAGAATACACACGAAAAAAAAAACAA 
GCTTATGAAATGGCCGGAAAATAGAGAAGAAAGGCCCTGCGAAGATCCCGATCCCAAGCTAATTCTCCTGACCTATTTCAGCATCCAGTTCAGTT 
AGTTGCTGGCGGTCTTTGCACGCTACTCGTTAGCTTTGTTATCAATTACCAGCTGTCTAGTGCCCTAGAAATGTATTACGAAATTATCCAAAATT 
TATTACAAGAAACTATCTAACATAATTGTAGCCTCTTTTGTTAACAACAAAATAAAACTCAAGTTTTGGTTCTAATTCTTCTTGGTCTTCTTCTT 
CGTCGGAGTGCTCTTCGGCTTGGAGCTCCTGCGGCAGATCAGGATTAGAACTACGATTGGCAACAGGATCACAGCGAGACCTAGACCCACAATGT 
CCAGTAGGTAGAACTGGTACCAGGGTAGATGTACACCAGCGGCCACCAGATGAGGAGCACCTCGGTGCTCGATCACATAGTTGATCCAATATATG 
GCCGTATCCATGGCACCGAGTGGTCGATCGCGGAATATCCGGGAGGCCTTCTTAATGTTATTCCTATACTTTGGATTCTCAATCAGTTCCATCAG 
CAGACCTCGCAACTCCTCCACCGTGACTTTCCGGTAATCCAGTCCCAGTGCATATTCTGCACTCTTACCCTGGTTAATGTTCTGATGCTGATCAC 
AGTAGACTGGCATTCCCAATATGGGAACACCATTGTACACCGCCTCCTGGGTGCCGAAGAGTCCTCCATGGGCAATGAAAACCTTGACATTCGGG 
TGCGCCAGAATGTCGCCCTGAGGTAACCAACTTTGCACCTTAACGTTAGCCGGCAAATTGGGCAGAGATTCATCTTCGAATTTCCAGAGAACGCG 
TTGCTTCAGACTGCCAAAGACTTCCAGGAATACTTTGAGCTTCTCAGGCGGC 
(SEQ ID NO: 1048) 

Exon: 1001.. 1157 
Exon: 11742 .. 12987 
Exon: 13045. .13112 
Start ATG: 1151 

Transcript No. : CT26824 

CTCGTGGAAGGCACATCGCAAACGCCGCTCTCGTTCTCGAATTTATTGTGAAAGCTCAGCCGCCCACCGCCCGCCGGAAAAGTCTCGCGAGTGTC 
CCGAAAAAATCGAGTTAACCCCCCAAAAAAATCTATCATATAAAATCCTAACAAAATGAGCAAGCTACTGACCTCCGCGGACATCGAAGACGGTA 
AGGTGGAGTTCGACAAGGCCACGGGTGCCACCACCACGTACAAGGTGATCGAGGGTGGCTACGAGGTGATCACCACCTCTCCGCAGCCCAATGGA 
ACCATCAAGACCCAGGTGCGAACCTTCTGGGATCCCAAGCCCGTTAGCGAGGAGGATCTGAAGAAGGAGCAGGAGAACAAGAAGCTGGTCCAGAA 
GCGTCTGGGCAAGGAGATTCGCATCGATGAGCGCACCACCATGCTGACCCGCGCCATCGAGGGTGGCTACGAGGAGGTGTACACCACCATCAACG 
AGGATGGAACTCGCAGCTCTCGCACCAAAACCTTCTACGACTCCGTTCCAACCGAGGTGGATGAGAAGACGGCCGCCAAGTTGAACAAGAACAAG 
AAGAACGTGACCAGGAAGTCGTCGGTGGACTCTACCGATTCCACGGAATCCTCCCGTCGCATCGTCCAGAAGCAGCAGAAGAATCTGGTGCAGAA 
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CTATGATGAGCAGACCAACTCCACCACGACATCAGAGCGCAGGGTGTCGGTGACTCAGAACATCGAGATCACCGAGAACAATCGCACCGTGCGCA 

AGAACTCGCTGCAGGAGCAGMCGTCAAGGCTATCACTACCACCTCGAACACCTCGTCCAGCGACAAGAAGCAGAAGAAGAAGCCCAAGTCATCG 

GCTCCACCACCACCCGCTGATTTCTTCCAGAACGATACCACCTCTGTGGCCTCGAAGCGTGTGCCCGGCGGTGTGGAGTACACCTATTCCACGCA 

GCTGGATTCTGGCAAGACCATCACCACCTCGAAGACTGTCTACGAGGAGGAGGAGGTGGAACTCACCGAGGAGGAGATCCGCCAGTACAAGAAGA 

CCCTGAAGGACGCCGAGAAGCACAAGAACGTCTCCACCACCAAGAAGCTCAAGTCCGAGTCGGGTACCAAGAAGATCGTTCCTTCCGAAAATCCC 

GGAGATGTGACCACCGTGGAGACCATTAAGATCGAAGGTGGCACCGAGTACCACTACACCACCGTGACATCCGAAGGCATCGTGAAGAAGGCCGT 

GAAGACCGTTTACGATCCAGTGCCCAGCCAGAAGTCCAATCCCGATGACACCGACGAGGAGGAGATTATCGAGGAGTACGAGGAGGAGATCATCG 

AGCCCGGCGAGAAGAACGTCAAGACTATCGAGACCGTCAAGCAGCTGCCCAGCAAGTTCGAGGGTGAGGATAAAGGAAGTGACCATCACTCATGA 

G AGG AAGG AG AAGAAGGT AAAG AAGT C C AT GGT C AT T AGC C AGTAG 

(SEQ ID NO: 1049) 

Start ATG; 151 

MSKLLTSADIEDGKVEFDKATGATTTYKVIEGGYEVITTSPQPNGTIKTQVRTFWDPKPVSEEDLKKEQEKKKLVQKRLGKEIRIDERTTMLTRA 

IEGGYEEVYTTINEDGTRSSRTKTFYDSVPTEVDEKTAAKLNKNKKNVTRKSSVDSTDSTESSRRIVQKQQKNLVQNYDEQTNSTTTSERRVSVT 

QNIEITENNRTVRKNSLQEQNVKAITTTSNTSSSDKKQKKKPKSSAPPPPADFFQNDTTSVASKRVPGGVEYTYSTQLDSGKTITTSKTVYEEEE 

VELTEEEIRQYKKTLKDAEKHKNVSTTKKLKSESGTKKIVPSENPGDVTTVETIKIEGGTEYHYTTVTSEGIVKKAVKTVYDPVPSQKSNPDDTD 

EEEIIEEYEEEIIEPGEKNVKTIETVKQLPSKFEGEDKGSDHHS* 

(SEQ ID NO: 1050) 



Celera Sequence No. : 142000013384430 

CTCTCGAATTGCTTCAAGTTCAGCCAGAAATTGAAACTGCAGTGGAGCTTGTTGGCTAGCCCAACAGCAGGGTATCAATAATAGTAAATAAAGTA 
GAATCGTATGGACGTTATTTATTCTCTGTTGAAAATTCCATAAAACCAGAAATTGATTACCCACTACTAAGAATCTACAAATACATCACAAATGG 
CCACTTCCCATTAAAATTTCAAGTATTTTCAGTGACTGCACGTGAATCCGGCTAATGATATCTACTAATATAGATAAATATAGCACACATGCTGC 
AATTAAATATGAATTTTTTGCTTATCAAATAAACAGAGCTTAATTTCCTACTAATTAGCTATTTTGAAGTGCATGGTGGTAGTCGGCTCTCCGAT 
GGATTAGATAAGACCTATCGCTCTGCAAAAAAAAATGTAGGTCAGCTGGAATTTCGAGCATCCAATTTCAGATTTTGGAGCTTTCGGGTGTTACC 
CGTTGCTAATGGGTAGATCTTAATTGGACGACCTCCATGGCTTCAAGTTCAGTGCTGGTCAATCGTGGCATATGTGAGTGCTGTCAATAATACCA 
CTGATACTTTTTTAATGCCGCTCGTTAAACGGTTCCCACCGACCCAGAGGGCGGTACTTGCTATTTGACACTCTTTCGGCTGGCGAGTGCTCCAT 
ATTATATTAAATTGAATTTGGTGGCGATGTCTCGCTATAAAGCATGATACGCCCGTAGTCGATGTGTCCAATTCCCGTTAGGCAACCAAACGACT 
GGAATCGAGCGATCCACCAATTCTCGATCGGGCTGCGAATCGATCCGATTACGATCCAAACCGAACCGATTCGGCGACATTCAGTTCGCCGTTCG 
AAGAGCTCGAGTGCGCGTGTCAACTGGATCAGCTCCCCCCCCAATAGCTCCCCACAATCCCATCCCTCTGTACCTTTAAAGGCACTCATATATGT 
ATATATATATAACTTTATTTTTTCAAGATCACGGCAGAGAGCAGGGCACCGCTCCTCCTTGCTCACACCTACAACTACACTCGCCTTTAACTCTC 
TCAATCTCTCCAGTTCGGTGCTCGTGTCGGTCGTAAACATAACTCAAAAAGCGCTGCTGAAGTCAGTAGCCGGCTAAACGCGACTCGGATCGGAA 
GTCTCCGCTCCTGGCAATCAGAAAATCCCCTCCACGCGATAAGGGTGTACATA^ 

AAAATCACATAAGAAATGCAACTTTTCGACGACTTTTGCAAAAGCTTCAACAAGGAATTGCAAAGGGCGAATTTCGGCTTTGCGTACAATCGAGT 
TCATTTATTCTACAGATCTCAGGTGAGGACAGTTTCACGTAATAGTCGTCCAAACACAGCCGTCTTCTTTATCGGTGCTGTTACAACTCAGGTGG 
TGGTATCATACGTGCCAATGCAAATAGCAGTAGCGAAAAATAAGTACATATATACATAGTAGTCACTGTCATTAGTGTTCCGTGCAAAAACTTCG 
TGTGAGATCACAGAGGCACGTGTTTCTTAAATATGAAAAGCCAAGGGGCTAGTTGTTCGTTCGCAAATGTCGAAGTCAATGCATTTGAAATGGTA 
TTCCCACAATAATTGGCCTGTACAAACGACGTCGCACTACGACCCAGGCTATCAGCAAATTAACTAGATAAGTGCAATAAAAAACATGATAAGCA 
AAACACCCGAAAAATAAAGGGTTTTTAAAGAGTAAACCATTAAAATTCCTATCATAACAATTTAACGAAGCTCCGCGCATTAAGCCAAAATATAA 
CCTGAGTACCCCAGGGCATTCTATATCTGCCATCCATTAGAAATACACTTAAAAGGTAAAACTGGCTTGTATACCAAATGTTCATTAGAGTGTGC 
TCAACAAAAAAAAAAAAGCGAAGTTCACAGAGAACATAAAATGTTATGGTGAGAACACATTGTACACGCAGGCATTGTTAATTAATTCGTATTTA 
ATTAGCACAGCTAAACTAAAAGCTGGGCTCTATTCAAAGTTGTATTGCTCTAAACAAATATCACGAATAGAACAAACTAAAGTGTAACCCAAATG 
GATATGAAACTATGTTTTTAACCAACTGTTCATGAATGAAGGCAAACCATCAAAAACCATCAGTTAAGTGTAAAAGAATCTTTTATCACTATATA 
TTTGTACACTCATGAGCAAATATAAAAGTTATATGGACAAAGATATCTACAAGACCATGATGATGATACCAAGTTAGATAAGAATTATGAAATTT 
TGAGCTCCCTTTTTCGACAATTCATTCTGAGTATAATCCGGCAAGAAAATACCAATCGGCTACATTAATGTTTTTGTGTCAACATTATGAGTCAT 
TTTAAAATCACCAAAATAAATTCCCTTAAAAAAGCCAAGTACTGAAAACCCCTGATAGTTATTGGAGCTCTTCCATAACACTGATTACACATTTC 
TCACGAATGAAATACTAATCAAATCAAAACAGGAACCAGACAATAGTTTTGGCGCTCGCCATAAATATTCTTGCTAGACCTTATCAGACAGCCAA 
CAACAAGCCATAAACATCCATGAAGGATCGCGTTATCTAACGAATCAATGTTTTTATTTCCGCAATATTTAAGTTCTAAACAAATAAATTTGAAT 
AATACTACGCACGCAACAAAACTAAAAGCTGTGATGCGACGAAGGTGATATCAGTTAGACAGGCCCTAATTCGGACTACGTCATTGCTAGTGATA 
AGCGCTTAGCATGTTTGTTAACGCGGGCAAAGGTGTGGAATATTTTAATTAAAATATTACATGATTATTAATCGATGATCATGTAAGGGTCAGAC 
TGGTGCACGCGATAAGGAATTCGCTATCAATCGCTGTCACAGCAACAATGTAACAATGAATTATTGCCCATATATAATATATTTTTAATTTAATC 
AATTGTATTACATTAATTATTAAATAGCTAAAACTATATAATTCTCGTATTGCAGTGGCAAAAAGATGAGATCAACACAATATACTTGCTCTACC 
GATGGATTTGGGCACTGTTTTTCCTGGGCGTGTACATCATGTGCGTTATCGTTCAGTTTTGCGATGGAAAGTTCTTCATCTACATGACCAACTGG 
GGATTCGGGCTCTGTACGATCACCATGCTGATTTCCGCCGTACAAGTCACCTGCTGGCACTTCGATGTGAGGAGCACCCGGAGCCTGGTACAGGA 
GTCCGGTCACAAGGCGGAGACGTCGAGGGGCCTGAAAATCTACTGGTGGCTCTACAATATGACACTGTCGCTGGCTCTGATCATATCAACAGTAT 
ACTGGGTGTTCCTCCATGGAAAAATGAGTGAGTATTGCCGTATCTGCGTTCATTATTTACATTGCTTTAAATGACAGTGGTGGGTGTGCAAGCTA 
TTACATACCTTACGATATCTATGTGCAATAGAGAGAAAGAGGGCATATTTTATAACATATTTAATGAATTTCCTTTCGTTACAGACAAGCCCATG 
CGATTCCCTGCCATCAGCATTATCACCCACGGCATGAATTCTGTGATGATGCTAATCGATTTCCTGGTCATTGCATTTCCGCTTAGGATACTTCA 
CATGGTGTACGGAATGAGTCTGGCGATATTCTTCTTTTTATTCACGCTGATCTACCATTTATGCGGGGGAACAGATGAGTGAGTACTTGAAATCT 
TACCCGTATACCTTGCAATTTAGAGCTAATCACCTTCACAATTCTCCTTCAGATTTGGCAATCACTACGTGTATCCCATTCTGGATTGGAACAAT 
CCCAACCGCTGTATGGTGACCTTTGTGGGCATCTTCCTGCTCATTATGTGCTACTGGGTGCTGCTCTTTGGACTCTACAAGTTGAAGAGAATGTT 
CAATAGGGCATTCAGTGTGGTTTGGTCTCCTCATGCTGTGGGTCTGATATGAGGGAGTTAATACAACAAAAAGTCATTGAATATCACCTCATGGG 
CACCCGCACGTATACCATGTTCTTATATGCATTGCCAGCATTATGTAACTCCCATTAACAAATCCATCCAAAAACGACACTGAAAAGGGAGTCGC 
ACACTTGCATGCATTTATTTATATCTAGTATTTCATATTGTAAGTTTTAATCATTTGGTTCCTTAACATTGAATTGGTTTAGTTAGGTTAATTCT 
TATACATTATGTTGTTATTACATATTTGGTTTAAATAGTATAAATATGAAATTTGTTGAAAATACTCTACCCAAGGGTTAGATAAAAAGAATAAT 
GGAATTGTATTAAAATATATCACGAAT AT C AAAAGTGG AAAGAAACG AAATAAAGAAAAT GT GGCGT T ATT GAATTT AATTGTGGAAATCGAATA 
TTTTATTATTATAGACATGAACAGGATAATTTTTATAAAAGTAATATCTAAATTTATTTTCGCTGTTCGTTTTTTAAACAATTATATAAGTAATT 
GAAACTCATTATATTTCATTCAAAATTAAAACAATTGTAGATAACATTTTAATTGTTCTGCATTTCGAGCAAGCTTATCTCTAGTCTTTGGAAGC 
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AATTTGTAAATAATTTTTCAAAATGTTTTTTTTTTAATACCCTTTCATATTTTATCAGCACCGTAGAGATCATTAAAACATTTACTGTGAAGTGG 
AACAGGTTGAATGACACTAAAGATTTCGGTAACCGGAATACTTACTTTTTCGCTTACTTTTCTAACCCAAATCTCGAAGACTTTTGTTATTCTTC 
AATAAAATGCTTACTAAGCTTTTTTCGTTGTAATGCTGCAATTAGGTTACAAATCCAGTTTTAAGAAAGTGCTAAGATCTTTCTAAATATTACGA 
AACAATTAAGACATTAACAAAAACGAGCTCTTTTCTTCTCTTTTATACCAAGCTGTACCCGCAGGTAAAATGTACTTTTATGTGGCTGCAAACAT 
TTGCCAAATAAAAGCACACACAAATGCACATAATTTAGGTGAGCGAGTAATTAAGCCCAAAGTGGACAAAAGAGGAGGGCAAAAACGGAGCCAAA 
AAGCATCCATCGTTTTTTATGGCTACTTGGTCGTTCGTTCTGTCAATACAATACCCAAACACTGTAAAAACCAATTGGAAGTTTTTTATTGAAAA 
CATTAATAAAGTCATTAATATTGTTGTTGTTATAGATTCCAGAGACATTAAATAGTAAATAGTATTTTAAAGTTAACAAACACATATTTATGTTA 
CAAAATCCCTGAATTGTTTTCTCTGTACAGCTTCTCTCTTTCTCTCTTGGCTCTGCCGTAAAATACATGTGCGAGCACACCTGCCTGCGTCGACG 
TCACCGTCGGCTGCGCAGTCGCCGCCGCGTCT 
(SEQ ID NO: 1051) 

Exon: 1001. .1352 
Exon: 3001. .3352 
Exon: 3505. . 3688 
Exon: 3758. .4352 
Start ATG: 1251 

Transcript No. : CT27002 

GCTCCTCCTTGCTCACACCTACAACTACACTCGCCTTTAACTCTCTCAATCTCTCCAGTTCGGTGCTCGTGTCGGTCGTAAACATAACTCAAAAA 

GCGCTGCTGAAGTCAGTAGCCGGCTAAACGCGACTCGGATCGGAAGTCTCCGCTCCTGGCAATCAGAAAATCCCCTCCACGCGATAAGGGTGTAC 

ATAAAAAAAAAAAAATAAATAAAACAAGTTAAATAAATATATAAGAAAATCACATAAGAAATGCAACTTTTCGACGACTTTTGCAAAAGCTTCAA 

CAAGGAATTGCAAAGGGCGAATTTCGGCTTTGCGTACAATCGAGTTCATTTATTCTACAGATCTCAGTGGCAAAAAGATGAGATCAACACAATAT 

ACTTGCTCTACCGATGGATTTGGGCACTGTTTTTCCTGGGCGTGTACATCATGTGCGTTATCGTTCAGTTTTGCGATGGAAAGTTCTTCATCTAC 

ATGACCAACTGGGGATTCGGGCTCTGTACGATCACCATGCTGATTTCCGCCGTACAAGTCACCTGCTGGCACTTCGATGTGAGGAGCACCCGGAG 

CCTGGTACAGGAGTCCGGTCACAAGGCGGAGACGTCGAGGGGCCTGAAAATCTACTGGTGGCTCTACAATATGACACTGTCGCTGGCTCTGATCA 

TAT C AAC AGT AT ACT G GGT GT T C C T C C AT GGAAAAAT G AAC AAGCCC AT GCG AT T CC C T GC CAT C AG CAT T AT C AC C C AC GG C ATGAAT T C T GT G 

ATGATGCTAATCGATTTCCTGGTCATTGCATTTCCGCTTAGGATACTTCACATGGTGTACGGAATGAGTCTGGCGATATTCTTCTTTTTATTCAC 

GCTGATCTACCATTTATGCGGGGGAACAGATGAATTTGGCAATCACTACGTGTATCCCATTCTGGATTGGAACAATCCCAACCGCTGTATGGTGA 

CCTTTGTGGGCATCTTCCTGCTCATTATGTGCTACTGGGTGCTGCTCTTTGGACTCTACAAGTTGAAGAGAATGTTCAATAGGGCATTCAGTGTG 

GTTTGGTCTCCTCATGCTGTGGGTCTGATATGAGGGAGTTAATACAACAAAAAGTCATTGAATATCACCTCATGGGCACCCGCACGTATACCATG 

TTCTTATATGCATTGCCAGCATTATGTAACTCCCATTAACAAATCCATCCAAAAACGACACTGAAAAGGGAGTCGCACACTTGCATGCATTTATT 

TATATCTAGTATTTCATATTGTAAGTTTTAATCATTTGGTTCCTTAACATTGAATTGGTTTAGTTAGGTTAATTCTTATACATTATGTTGTTATT 

ACATATTTGGTTTAAATAGTATAAATATGAAATTTGTTGAAAATACTCTACCCAAGGGTTAGATAAAAAGAATAATGGAATTGTATTAAAATATA 

T C ACGAAT AT CAAAAGTGGAAAGAAACGAAAT AAAGAAAATGTGGCGTT ATTGAAT TT 

(SEQ ID NO: 1052) 

Start ATG: 251 

MQLFDDFCKSFNKELQRANFGFAYNRVHLFYRSQWQKDEINTIYLLYRWIWALFFLGVYIMCVIVQFCDGKFFIYMTNWGFGLCTITMLISAVQV 
TCWHFDVRSTRSLVQESGHKAETSRGLKIYWWLYNMTLSLALIISW^ 

GMSLAIFFFLFTLIYHLCGGTDEFGNHYVYPILDWNNPNRCMVTFVGIFLLIMCYWVLLFGLYKLKRMFNRAFSVVWSPHAVGLI* 
(SEQ ID NO: 1053) 

Classification : known_f lybase_gene 
Gene Symbol: rost 
FlyBase ID: FBgnO011705 



Celera Sequence No. : 142000013384623 

TTTTTTCAATGTTGTTTTAGCTTGAGTTGCCTAATCTATGCAAACGAAATATATGAAGTGAAAATTGATATGCTGATCTCCACCATCCTATATAC 
AATCCATAAATCGTATATACAGAAAGAGAAATGTATTAAACTCGGCCAATATCGAAACGCATTTTTTAGATGTTGTTTTTGTGTTTTGAACTGCA 
TTCGACCGTTGACCGCAAATGTACCTAAGTGGCTATTGTTAAAACAAATCTGCTCTTTGGCAGTTGAATGTGCGAATACAAAGAATTACACAATT 
TATACAAAACATATTTAACCCAAATGTATACCCTTATACAGACATCTCTATATAAATATATTTATGTGCAAGTATTTGATGAAATAAATATTCAA 
ATTATTAAAAAGCATTTCTCCGATGAGTTGATATTTGTGTTTTGAATTTGGCATGGGAAATATATTTATTAATAGCTTGAAAAGAGAGGCAACGT 
CTTTTTGAAATTAAAAACCTTTGTTATATGTATACCTTGTAATTTAAAATTATTGGTATTTTACGTAAAGCGTATAACTTTAAAGGAATGAGAAA 
CCATTGTAATGTAGAAAATATTTAATAATATTTCTTTAAATTCATTGTATGGTAACTTATGTAGTTACCGCTTTATTTAATTACTAAGTACT^ATA 
TCTTGTTGTTTTATGCCGTAAGCTTGTTACGAGCGGAGCTGAAGTTCCAGTGTGTCCAGTCTGACGATAAGAAGACAGAAGATATCGTCCTCCAG 
GAAAGGTCTTGGCGTTAGTACCTCTTTTCGCCGTGATGCCAGACCGGGTTTCAGTCTGGGCGCCACAATTGAAATAATAATTTTTTATTGCTATA 
ATAGCCATGCTAAGATAATAGAAATTTTGAAAATTGAGTATTTTTTGAAAGTTGTTTCAATGACTTAATTTTATTTTAAATAAATGCGCCACTTT 
GAAAGCTTGAAAAAAATACGGCTTCAGGGTCATGGTATTTAATGTCGTGGAAATACCACACTCTGGTAACTCTGAGCGGGCGTCACTCTTTTCGT 
CTCTGGGCACACTGTCGGCCCTCGCACGCGTATTTTTCTGCCGATTTTGCGTGTAAATTTAATTGTGAAAACATAGTAAAACAGCTGCAATCATG 
GCCAATTTCGATCTGACACGCATCAATTGCCAGTTTTTGGACAGGCATTTGACCTTCCCGCTGCTGGAGTTCTTGTGCGGCAAGGAGGTTAGAAA 
AGCGACCAGGCGGTGCTCCGCTGGCGGTTGCCGAGTCACCAGTACATAACTATGGGAAATCTTCATATGTGTGACTTTCGTTGCAGATCTACAAC 
C AGC AGGAG CT G CT G G AG T AC AT TT T GG AG AC GGT GAAC AAG AC G AAC AT GAT C G A T T AC AC G AT GG AC AC C C G C AAGC GT C T C AAT CTCAGCCA 
GGAGATGCCCGAGGAGCTCGTGCAGCGCAAGGCGGAGGTCCTGGCCACGCTTAAGCAGCTGCAGAACGAGGTGGCACCCATCATGAAGGCCACCG 
ACATTCTTAAGAACGGTGAGAGCATGAAGGACTCGAAGACCTTCGTCAACGCCCTTCAGAAGGACTACAATTTTAAGGTGGAGCACCTGGAGAGC 
GCCTACAAGCTGGCCAAGTACCTCTACGAGTGCGGCAACTATCAGGAGTCCACCTCTTACCTGTACTTCTGCCTCATCGTCATGTCGCCCAACGA 
CAAGGTGCGTATGGAGCTAAATTAACATGAAAAGCGGCAAAAACCACTAAAACAAGAATATTAATTGATATTGTCTCCCAAATTTTGCAGAACTA 
CCTTAATGTGCTGTGGGGCAAGCTGGCCGCCGAGATTCTGACCCTCAACTGGAACACTGCTCTGGAGGATCTGACGCGTTTGCGCGACTACATTG 
ACAACGCCAACTTCAGCACCATCCAGGCCCTGCAGCAACGCACCTGGCTGATCCACTGGTCGGTCCTGGTCTTCTTCAATCACCCCAAGGGACGC 
GATCTCATTATTGAGATGTTCCTGTACAAGCCCCTGTATTTGAACGCCATTCAGACAATGTGTCCGCACATCATGCGTTACTTGGCCACCGCCGT 
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AGTCATCAACCGCACCCGCCGCAATGCTCTAAAGGATTTGATCAAGGTGATCCAACAGGAGTCGTACACGTACCGCGATCCCATCACCGAGTTTC 

TGGAGTGTCTGTACGTGAACTTTGATTTCGAGGGAGCGCGTCTGAAGCTGCACGAGTGCCAGACGGTGATCCTCAACGACTTCTTCATCGTGGCC 

TGCCTGAATGAGTTCGTGGAGGATGCCCGCTTGATGATCTTCGAGACATTCTGCCGCATCCACCAGTGCATCACCATCAGCATGCTGGCCGATAA 

GCTGAACATGAAGCCCAACGAGGCTGAGTGCTGGATCGTCAACCTTATCCGCAATGCCCGTCTGAATGCCAAGATCGATTCGAAACTGGGTCATG 

TGGTGATGGGCACCCAGCCCTTGAGTCCCTATCAGCAGCTGGTGGAGAAGATCGACTCGCTGTCCATGCGGTCGGAGCATTTGGCAGGTTTGATT 

GAGCGCAAGAGCAAGCAGAAGCAGAATCAGGAGTCGGCCGACTCCTGGAAGTACTACTAGTTGCTGACTTTTTATCCACAATTAAGTTAAGACTC 

TTTGTACCCTTTAAAAACCGATGATTGCGCGCAGTCGAGCGAATAAATTAACATCTTACTCATTCGCTCAATTCAACATTTGATTTTATTTCCTT 

CGGAATCAAAATTAGTTATACAAAAAACAATACCAAGGCGGGAATTTGTAATTTCGCTAGCCTACAGAATGCAAACGTCTGCTAATCCTAGAATT 

AGTTGTTTGTACCTACAAGTGGGTGCTGGATCTTTATGACGATCTGGCCCCGGATCACACTATTGATTGCTACCGCGATTTCTGCTTGACCACCA 

ACCAGGATCTGACGGGCAGGTCCTGCAAGTAAACGATCCATTTGCGCACCAGCAGCAGCCACACTATGCCGTACAGAACGCAAATGCCCGACTCC 

AAGTAGTACCCATCGAAGGTTATCTCGCACTTGCCAGCCGACGCTTCACAGCTCTGCGGGAGAAAATTAAGAAAAATATTTGTTTAGTTACTTTT 

ATCTCATAACTGGCAAAGTTTATCTTTTCCTGGTTTGGGGTAAAACTTTCACCATACATATGTATCCATATACAATAATAATAATAATCTTGTAT 

GCTGTAGAATCTCTCTCTGAGCACTCACCTGTTGCTCCTCCTTGCCCTGGCAGGTATTTTCCGCATTATTGGAGCACTGCTTCCAGGTGAGCACG 

TCCACCAACCACAACACCACTGTGTTGGGCCAATTACCGCCCAGATTGCACAATGTGTTCAAAAATGTCATGTAAGTGCCACCGACGGCTGGATC 

GGAGATCTTGGCGAAAAAGGCCATGGCGGCCACAAACATGGAGTACAGGAATACTTGGTAGATGGCATAACTGGTGATCAGCAGGACGTAGTAAT 

AAGCAGGCACGGCTCCTCCACGGACAATATAGGGTGTGATGTAGGCCAAGCCAGTGGCCACTGTTGCCATGATGATGCGATAGGGAATGGCCTTC 

AGATAGACGTCCATCGGACGTGGTCCGTTCGTATATCGACCGACCGCCAGCGGCAGAATGAGCTGCAGAGGAATTAGGGGAATGGCGAGTAAAGC 

CAGCTTATCCTTCGGTACACCAGCGTCAATG 

(SEQ ID NO: 1054) 

Exon: 1001.. 1227 
Exon: 1322.. 1714 
Exon: 1801.. 2736 
Start ATG: 1138 

Transcript No. : CT27364 

AAATACCACACTCTGGTAACTCTGAGCGGGCGTCACTCTTTTCGTCTCTGGGCACACTGTCGGCCCTCGCACGCGTATTTTTCTGCCGATTTTGC 
GTGTAAATTTAATTGTGAAAACATAGTAAAACAGCTGCAATCATGGCCAATTTCGATCTGACACGCATCAATTGCCAGTTTTTGGACAGGCATTT 
GACCTTCCCGCTGCTGGAGTTCTTGTGCGGCAAGGAGATCTACAACCAGCAGGAGCTGCTGGAGTACATTTTGGAGACGGTGAACAAGACGAACA 
TG AT C GAT T AC AC GAT GG AC AC C C GCAAGC GT C T C AATCTC AGC C AG GAG AT GCC C G AGG AGC T C G T GC AGC GC AAGG C GG AGGT CC T GGCC ACG 
CTTAAGCAGCTGCAGAACGAGGTGGCACCCATCATGAAGGCCACCGACATTCTTAAGAACGGTGAGAGCATGAAGGACTCGAAGACCTTCGTCAA 
CGCCCTTCAGAAGGACTACAATTTTAAGGTGGAGCACCTGGAGAGCGCCTACAAGCTGGCCAAGTACCTCTACGAGTGCGGCAACTATCAGGAGT 
CCACCTCTTACCTGTACTTCTGCCTCATCGTCATGTCGCCCAACGACAAGAACTACCTTAATGTGCTGTGGGGCAAGCTGGCCGCCGAGATTCTG 
ACCCTCAACTGGAACACTGCTCTGGAGGATCTGACGCGTTTGCGCGACTACATTGACAACGCCAACTTCAGCACCATCCAGGCCCTGCAGCAACG 
CACCTGGCTGATCCACTGGTCGGTCCTGGTCTTCTTCAATCACCCCAAGGGACGCGATCTCATTATTGAGATGTTCCTGTACAAGCCCCTGTATT 
TGAACGCCATTCAGACAATGTGTCCGCACATCATGCGTTACTTGGCCACCGCCGTAGTCATCAACCGCACCCGCCGCAATGCTCTAAAGGATTTG 
ATCAAGGTGATCCAACAGGAGTCGTACACGTACCGCGATCCCATCACCGAGTTTCTGGAGTGTCTGTACGTGAACTTTGATTTCGAGGGAGCGCG 
TCTGAAGCTGCACGAGTGCCAGACGGTGATCCTCAACGACTTCTTCATCGTGGCCTGCCTGAATGAGTTCGTGGAGGATGCCCGCTTGATGATCT 
TCGAGACATTCTGCCGCATCCACCAGTGCATCACCATCAGCATGCTGGCCGATAAGCTGAACATGAAGCCCAACGAGGCTGAGTGCTGGATCGTC 
AACCTTATCCGCAATGCCCGTCTGAATGCCAAGATCGATTCGAAACTGGGTCATGTGGTGATGGGCACCCAGCCCTTGAGTCCCTATCAGCAGCT 
GGTGGAGAAGATCGACTCGCTGTCCATGCGGTCGGAGCATTTGGCAGGTTTGATTGAGCGCAAGAGCAAGCAGAAGCAGAATCAGGAGTCGGCCG 
ACTCCTGGAAGTACTACTAGTTGCTGACTTTTTATCCACAATTAAGTTAAGACTCTTTGTACCCTTTAAAAACCGATGATTGCGCGCAGTCGAGC 
GAATAAATTAACATCTTACTCATTCGCTCAATTCAA 
(SEQ ID NO: 1055) 

Start ATG: 138 

MANFDLTRINCQFLDRHLTFPLLEFLCGKEIYNQQELLEYILETVNKTNMIDYTMDTRKRLNLSQEMPEELVQRKAEVLATLKQLQNEVAPIMKA 
TDILKNGESMKDSKTFVNALQKDYNFKVEHLESAYKLAKYLYECGNYQESTSYLYFCLIVMSPNDKNYLNVLWGKLAAEILTLNWNTALEDLTRL 
RDYIDNANFSTIQALQQRTWLIHWSVLVFFNHPKGRDLIIEMFLYKPLYLNAIQTMCPHIMRYLATAVVINRTRRNALKDLIKVIQQESYTYRDP 
ITEFLECLYWFDFEGARLKLHECQTVILNDFFIVACLNEFVEDARLMIFETFCRIHQCITISMLADKLNMKPNEAECWIVNLIRNARLNAKIDS 
KLGHVVMGTQPLSPYQQLVEKIDSLSMRSEHLAGLIERKSKQKQNQESADSWKYY* 
{SEQ ID NO: 1056) 

Name: eIF3-p48 

Classification : translation_f actor 
Gene Symbol: Int6 
FlyBase ID: FBgn0025582 



Celera Sequence No. : 142000012789666 

TATTGCTTACAATGCCAGCTATCGATAGTAACGATAGTAGTGTCACAGTTAACTGTTTCATGGCTGGCTTCGTGCACAGCTCACCGAGACTATCG 
ATAAATATCGCATCTGTGTGGCTGCACACCGAATTCGTAATACACCAAATACCGGTACCGTACTCGATGGCATTATTTAGGAACTTTCCCATCGC 
CGCGGCAAGACTGAAATCGACTGGCCTTTAGATGCTCTTTGGCGGTGGGCGCCGCTTGACTTGCCAGACTCTCCAGGCCGAGTGTGTGTGTGTGT 
AGTGGTTGTGCTAGCAGAGCTGTTGCAGAAACTAACTGGGAAAACACAAGATAGACGCGTGTGCGGCGAGCAGTAGCTAAATTGAACTGAAGTGA 
ACAGAAAATATTTACACAAAATAATAGAGATACAACTTCGGAAAAGCAAGCATACAACACACCGGGACTCCGCAATACGGGTTTTTTATTTCGTC 
TACTCTTCCGTACCTGGGCAGAGATTTAGATTTATCGCCTCCTCCACCCACTACTTCATCATCGACATCACCAGTGCTGTTGTTGTTGCTGTCGT 
TCCGCCTTCGCTCTCACGTAATCCGCCCCCGCACCACCTCGCCCGTCGGCCAACCGGCCCACAAACTAACTGACTGATAGACCAAAAATCGATTA 
GCGGACTCTCTTCCTCTTCAACATCTTCGGCATTCGTGTGTGCGTTGCTTTGCGTTACGTTACGGTTACCGTTACGTCGCCCGCGCGTGTGTGTT 
GGTGTGTGAGCAAGGTGTGGTGTGTATAGAAAATCGATTGGCAATTTGAACCGAAGAGCGCTATCGTCGCCGCAGTCGCCGCGTGTGTGCCGTTA 
CGCGATTTTCAACAGCAGCCGCGCAGTAATATTGCTTTTATTTTTTTGCCTGTGAGTCCTCCTAAGTGAGTGCGAAAGAGAGAGAGCGAGAGTGA 
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GCTCTCTAAAACCTCTGCAGAAAATAAAAAGGZVAAAGAAAAACACCGGACATGGATAAATCAGCGGCCACGAACTCGGCCACTGCCGATGGCCAA 

AACGAGGCAGCAGCAGCAGCCGCCGCCGCAGCAACAGCAGCTGGGTGTGGCAGCAACAACAGCAGCAGCACCAGCAGCAGCAACAACACTGCAAA 

CAGCAATAATGCACTGCAGCGTTTGGCAACAACAACAAGTGCCGTTGTGGCATCCCCGATGACGATCAACCTGAACATCAGCACCACCACCGGCG 

GCAATTTCGGCGTCAGCGTCGAGCCGCACATCAGTGTCGAGAGTCTGAAGAAGATCATCGCCAAGAAGCTGAAGGTGGCCAAGGATCGCATCTGT 

CTGCTGCACCGGGAAAAGTGAGTACCCCTTTAGTATTTTTTTTTAATTTACGTTACATTTAAAAGAAACTTGGCTTGAACTGTAATTTTTTGGCA 

GTCAAGTTTTTTGTTGTTTGTTACTAATTTAAAAATTAATAACAGTTAATATGGCTTAGATCGCCAATTTTTATGTTATTTATCTGAACATTTTC 

TTTTTGTTTAATTGACGAGACTAATGTTGAATGTTAAATATTACTTATTAATCTATGCATTATATTTGTTACATAAAAAGGATTGTTAAGGTTTA 

ATAAGCCTGTGTTTGGACAAGTGTGTCATTTGCATATTGCCCAGAATATCTTATGATTGTGCCAATGAGGTAATATCAATAAACCAGAAATAACT 

GATAATGCAAAATGTTTCGCAAAGAATATTATCTTGAAAGTCAGGTGAAAAGTTAAGTGTGATATCTTTTATAGAAAAATTCTTGAGGAGTGAAT 

AAGTCCTGTTCTAATTAATCCTTGCGTTAAGAAAATCGTATGGCTTCGGTGAAACGCCTTGTATGTTGAGAGTTTTCTTTTTTATATTTTTATTG 

GATATCCTCTTTTTTTCGAGCTTTGTTTGGTTAAAATGCTTTTGATTATCTTCTTTCACCGCTCCACCCCCCTCCACAGAGCTCCTTCTCTAACC 

CCTTTGTTGAGCTTTGTTAGAGGGAGAGATATAGAGAGAGAGCGAGAGAGGTGTGCGTGTCTGTGTGTGAGCGTGTTTTTTAGTCGCTAAGCTGC 

CATTGTTTAGTTGATTTGCTTGCTGTTCTCGTTATTATTATGATTATTATTGATATAACCTCCAAAATAATGGCATGTTTGAAGGGTTAAGCACA 

AAGAGCTTGAGTGTGGTGGAGTCCTTGTGTGGAAGGGGGGGGGGGGGTGTACCAATTGATCTAATCACACACATACACACGTACACACATGTACA 

CTTAGTCTGAAAGCGATTTCATAATCTATTTGTTATTATACAAAACAACGCAAAATAAAAGTAGCTAAATG 

(SEQ ID NO: 1057) 

Exon: 1001.. 1351 
Start ATG: 1001 

ATGGATAA^TCAGCG 

CAGCAACAACAGCAGCAGCACCAGCAGCAGCAACAACACTGCAAACAGCAATAATGCACTGCAGCGTTTGGCAACAACAACAAGTGCCGTTGTGG 
CATCCCCGATGACGATCAACCTGAACATCAGCACCACCACCGGCGGCAATTTCGGCGTCAGCGTCGAGCCGCACATCAGTGTCGAGAGTCTGAAG 
AAGATCATCGCCAAGAAGCTGAAGGTGGCCAAGGATCGCATCTGTCTGCTGCACCGGGAAAAGTGA 
(SEQ ID NO: 1058) 

Start ATG: 1 

MDKSAATNSATADGQNEAAAAAAAAATAAGCGSNNSSSTSSSNNTANSNNALQRLATTTSAWASPMTINLNISTTTGGNFGVSVEPHISVESLK 

KI I AKKLKVAKDRI CLLHREK* 
(SEQ ID NO: 1059) 



Celera Seauence No. : 142000013384504 

AAACTATGCGTCACCCAGCGAGTATGTGGTCCTCTTCGGCTGCATGTGTATGACCTGCGTGGGCGGTTCGAGACCAAAGGGGTAACTCACTTGCA 

AACAAATTTTTCGGTCGTTCGGCCATTGTGTGTGGTGGCTGAAAGTATCGATGCGCCTTTGTGTGGCTGTCCTAAGGATGTTTTGTCCGTTTTCT 

GCAGGCGCCGCGCCCCGAATACAACACGTCGACTTGATGCTCCTTTGGCCCTCCGATCGCCTGGTTACTGGGCCGCCTTTGATGGGCTCTTCGCC 

GTTTGCAACGCAACTACTTTTTCAGCAATCGAGCGCACTGATATGCACTTTTTCCGCAATTTCGCTAAACATTTACATAAATAGCAACCAAAACT 

TAAAAGAGCTAGAGATGAAACATTGATTATTTGCGGCCAATCGATAGCCATCGATAGTATGCGCACTAGCTTGCGATAAGCAGTCCTGCAAGGTG 

CAAAAGTAGTGAGGTGGGTAATAAAACGAATATAAAAATGTCACCTATAAATAATTATTTCTTTAAAACCAAGAGATTTGAGTTTCTATTTTTCA 

CAAATTAATCCCAGCTTTGTTAAAGTTTTTACTAGTTGAAAAGTTACATTTCGTCAATAAGTGACGTCGACGCCTACATAAACCAATCTGCGCCA 

CCTTAAAATCAATGCACAATCACCAGTGATTTAAAACCAGAAAATTAAAACATACATAATCATGTCCAGCGGATTTGTTCGCAGCAGCTACAAAT 

TGTTCGTCGGCAACCTGCCATGGACGATAGGTAGCAAGGAACTGCGGACCTACTTCTCAAAGTACGGGCACGTGGCCAACGCGGAGGTGGTCTTT 

GACCGGCAACTGGGCCTGTCCAAGCACTATGGCTTTGTGGTTTTCAGCCAACGGGATGCCTTCAACAGCGCCAGCAACCAGAACACCCATTTCCT 

CGATGGACGAGTGCTCACAGTCCAGCGGGCGAATGAAAGTCCTCAGAGTTAATCAATGAAGAATTTAATGATTTATTGGTTAGACCAAAGGAAAA 

GCATGAGAAATGTTTGTTAACAGCTTAGTTTTACACAAATGAATGCAGCACTTGCTTCAAATCGTGGCATCTTTGATTCATTTGGGCAAGTAGAA 

CTTGCTGTTGGTGCCCACCACGTCCGCAACATTAGTGTGACCCAGCCGTGTGATCAGCGCCGACAGCTCCGCCTTAATGTCTTCCACAAGAGCGG 

GTCCCTCGTACACTAAAGCCGTGTAGATTTGGACGTAGGAAGCTCCAGCTTCGATCTTTTCGTATGCATCATAGCCGGAAGCCACTCCGCCGACG 

CCTATGATGGGAATCTTTCCATCGGTGAGCTGGTACATCTGGGCGATCATCTCTGTGGAGCGGGCCTTGAGTGGAGGACCACTTAATCCGCCAGT 

TTCCTCGGCCAGCTTGTTCTTTTCGATGTTTTCCCGCGACACCGTGGTGTTTGAAACGATCAGGCCATCCACGCGGCTCTTCTTGCGCTTGATTA 

CCCACACGATGTCCTTCATGTCGTCCAAAGATAGATCCGGCGAAAGTTTCAAGAGTATAGGCACATTCTTGTTCTTATCCAGGGAGGATTTCGTG 

TCGTTGACCTGTTCCAGGAGCTCTCTTAGCTTCTCCTTGCTCTGCATATCGCGCAGGCCCTTGGTGTTCGGACTGCTCACATTAATGACCAGGTA 

GTCTGCTACGGGCCCGAAAACCCGAACACCCTGTACATAATCCGCTATCGGGCTCATGGTGGTCTTGTTCCGGCCCAGATTCACGCCCACAACTC 

CATTGAAGTTCTCCTTTTTGCGGAGCAAGCGCAGCCGTTGGAGCACGGCCTGATGCCCGTCGCTATTAAATCCGTACCGATTAATGATGGCCTTG 

TCCTCGGTCAACCGGAATACTCTCGGCTTGGGATTGCCCTCCTGAGCCGCCGGGGTAACTGTGCCCACCTCTATAAATCCGAATCCTAGATCCTG 

CAGGCCGTCCACCGCCTCCGCGTTCTTATCGAATCCCGCTGCAATGCCAATGGGATTGCTAAGCATCCGGCCAAAGAACGACGTGTGCAAGTTCT 

GGTCATCATGGTACTGTGAAACCGGGCACAGGCGGTACTTGCACGCCAGCACGGCCAGTTGATGGCTGGCTTCCGCCGGAAGAAGTCGCACCGCT 

GGCATCACGAAGGTCCGGAACAACTGGTCCTGGTTCTTGTATGCCGTAATGCCCGCCACCAAGGCAGCTCCTCCGACTGTAACGATTCCCAGCGA 

CCGCAAACGACCCTGTGGGCGGAATCATTCTGTAATCCGGTCCCGTTTAAGCCGTACTTTACTCACAACTCTCCGCGTGGCGTTTTTAGCATTCT 

TTAAGTGATCCTGATCCATGACGACTATCGCCGGATATCGATTGTACCCGATAGAGTATTAAAAGTATAATATGTAAAAAAGTAAATGCATGTAC 

TAGAGTTTTAACGAAAATTCAATTTGGGAGTAAGTCTGGCCATTTTGAAGGAAAACTCATGCGATATATAGTTACTACGGCAAGCATCAATAATT 

TTCTTTTTTTTCTGTGTAGCAAATGAACCCCTTCTATTCACAACCATGCCGAAGAGAATCGATAGTCCTCCAGTGTCACCTCTAATGGTTGAGTC 

AGTTTGTTATGGCCAGCTAAAAATTAGAAATTTCGTTCTCTAAACGCTGATTTGGCCACCCGAGGCAACCATGAAGGGCGGCACCTTTCGCAACA 

CGCAGTGGGATCCCACGCTGCTGTCCTCACAGATTGTGTCCATGCAGTTCTGCGTGTACTTCACCCTTGGCCTACTCGTCTTCGTGGCCAACAAG 

TTGTCCGGGGACAACTATAGTCTGGATCACTTGTTCGAATACCATGTATGCCATATGCTATATACCATATATCATGGAAACCCCTCTCCTTACTT 

TGATTTAATGTTGCTTAAAGGAGATCCACATCTACGACATGGGTGGCCGCCTGGTGATCTGCGCATTTGTTTTAAATGCGTTCTTAGCCTCCCTG 

GCACTGTGGTGCATTGTGAGAAGAGCCAAGCTCTGTCTGGACTTTAGGTGGGCTTATAGCTAGTGCACACACTTAATCTTTTGCTTTTACAACTC 

ATGCTTCCCTTTCATGGCAGTTGCACCTTCCACGTGCTGCATTTGCTGATCTGCTGGTGGTACAACCGCTCCTTTCCCGCGAACGCATCGTGGTG 
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GCTACTGAATGTCATCACCGGCACAATAATGTGCATAGGCGGCGAGTTTCTCTGCCTGCAGACCGAGATGAAGGAGATCCCAGTGGGCTATGCGG 
CCCTCAATCAAAAGTCGGACGTCTGACTCGAAGCCACCAACTTGTGATGCCTGCTACCCCCCCAACCAA 
(SEQ ID NO: 1060) 

Exon: 2394.. 2347 
Exon: 2292.. 1001 

Start ATG: 2394 (Reverse strand: CAT) 
Transcript Ho. : CT27530 

ATGGATCAGGATCACTTAAAGAATGCTAAAAACGCCACGCGGAGAGTTGGTCGTTTGCGGTCGCTGGGAATCGTTACAGTCGGAGGAGCTGCCTT 
GGTGGCGGGCATTACGGCATACAAGAACCAGGACCAGTTGTTCCGGACCTTCGTGATGCCAGCGGTGCGACTTCTTCCGGCGGAAGCCAGCCATC 
AACTGGCCGTGCTGGCGTGCAAGTACCGCCTGTGCCCGGTTTCACAGTACCATGATGACCAGAACTTGCACACGTCGTTCTTTGGCCGGATGCTT 
AGCAATCCCATTGGCATTGCAGCGGGATTCGATAAGAACGCGGAGGCGGTGGACGGCCTGCAGGATCTAGGATTCGGATTTATAGAGGTGGGCAC 
AGTTACCCCGGCGGCTCAGGAGGGCAATCCCAAGCCGAGAGTATTCCGGTTGACCGAGGACAAGGCCATCATTAATCGGTACGGATTTAATAGCG 
ACGGGCATCAGGCCGTGCTCCAACGGCTGCGCTTGCTCCGCAAAAAGGAGAACTTCAATGGAGTTGTGGGCGTGAATCTGGGCCGGAACAAGACC 

ACC AT GAGC C C GAT AGC GGAT T AT GT AC AG GGTGT TC GGGT T T T C GG GC CC GT AGC AG AC T AC C T GGT C AT T AAT GT G AGC AGT C CG AAC AC C AA 
GGGCCTGCGCGAT AT GC AGAGCAAGGAGAAGCTAAGAGAGCTCCTGGAACAGGTCAACGACACGAAATCCTCCCTGGATAAGAACAAGAATGTGC 
CTATACTCTTGAAACTTTCGCCGGATCTATCTTTGGACGACATGAAGGACATCGTGTGGGTAATCAAGCGCAAGAAGAGCCGCGTGGATGGCCTG 

AT C GT T T C AAAC ACC AC GGT GT C GC GGG AAAAC AT CG AAAAGAAC AAG CTGGC C GAG GAAAC T GGCG G AT T AAGTG GTC CT C C AC T C AAGGC C C G 
CTCCACAGAGATGATCGCCCAGATGTACCAGCTCACCGATGGAAAGATTCCCATCATAGGCGTCGGCGGAGTGGCTTCCGGCTATGATGCATACG 

AAAAGAT CG AAGC T GGAGCTTCCTACGTC C AAAT C T AC AC GG C T T T AGT GT AC G AGGGAC CCGCTCTT GT GG AAG AC AT T AAG GC GG AGCT GT C G 
GCGCTGATCACACGGCTGGGTCACACTAATGTTGCGGACGTGGTGGGCACCAACAGCAAGTTCTACTTGCCCAAATGAATCAAAGATGCCACGAT 
TTGAAGCAAGTGCTGCATTCATTTGTGTAAAACTAAGCTGTTAACAAACATTTCTCATGCTTTTCCTTTGGTCTAACCAATAAATCATTAAATTC 

TTCATTGATT 

{SEQ ID NO: 1061) 

Start ATG: 1 (Reverse strand: CAT) 

MDQDHLKNAKNATRRVGRLRSLGIVTVGGAALVAGITAYKNQDQLFRTFVMPAVRLLPAEASHQLAVLACKYRLCPVSQYHDDQNLHTSFFGRML 
SNPIGIAAGFDKNAEAVDGLQDLGFGFIEVGTVTPAAQEGNPKPRVFRLTEDKAIINRYGFNSDGHQAVLQRLRLLRKKENFNGVVGVNLGRNKT 
TMSPIADYVQGVRVFGPVADYLVINVSSPNTKGLRDMQSKEKLRELLEQVNDTKSSLDKNKNVPILLKLSPDLSLDDMKDIVWVIKRKKSRVDGL 
IVSNTTVSRENIEKNKLAEETGGLSGPPLKARSTEMIAQMYQLTDGKIPIIGVGGVASGYDAYEKIEAGASYVQIYTALVYEGPALVEDIKAELS 

ALITRLGHTNVADVVGTNSKFYLPK* 
(SEQ ID NO: 1062) 

Name: D I H YDROOROTATE DEHYDROGENASE PRECURSOR 
Classification: enzyme 
Gene Symbol : Dhod 
FlyBase ID: FBgn0000447 

Celera Sequence No. : 142000013384507 

GCAAACGTGCGAAATCAACAGAGAGAACGAGTGAGCATGCCCACAGAGTGCGAGAGAGAGCAAGAGCGGGGGTGCGAGCGCAAAAGACCGACAAT 
TTATTTTACTTTTATTAACATGCGCGCAAAATATTTATATAAATTGATTACACATTTTCTTGTTCTTTTATCATCACCCCCTTTTTTATATTCAC 
CCTGGCAAATGAATTACGCTGCCCCAGCCAAATTGTTTATATATTATTTATCAATCGATTTTAGGGGCATGCCAAATTTGATCAGACGCCAGTCG 
CACACAAACACACACTTGAGCACACACATTCACAGACCAACACTCGCGCATACATACACACGCACCAGGGTGTTTGTTTTGTTTTTGTTTGCGGC 
TTTTTTTTTTATCAATTTGTGATCTTTTTTCGCAGCCGCAAATTTGATTGTGATTTACCGCCGATTGGAAATGTAAACAATGCTGTCGAACTATC 
AGCATAAGGCTGGCAGTTTTAAAATACTTTTCATATTTTATAGCTGTTGGGTACTCACACCTTAGCTTATTTGAAAAATTTACATATTACAAGCA 
CGCCGTTCTAGGTCTGCTTCTTCTTACGCGCGAACATTTAGTGTGACCGCAGCTCGACCGTTCAAATATACCATTGCGAAGCGCACTGAGAAACA 
CTCTCCAATAGTTCTAAGATACCAAAAATTTAGCATACTACTATATGTAAGTTATAATTTCATTTACTATCCCATTAGAGAAATTAATTGATTTT 
TATGTTCTATAATGCATTTAACTAGTTGCCCTAGTTTTAATGTTGTTAAAATAGCATTACAACAATTACCGTTTATAACTTCACTTTCTTCAGAG 
TGACCATATAATGTAAAGTACAGTGAACACTGTATATAAAGAATCAATACAACCATGGATTTACATATTTTTCCATCTGGTTTATATCAATAGAT 
AATATATGTTAAATGATATATATAAAAACTAAATTACACAAAATTTTTTTAGGTATAGTAGGTTTATTTAAACTTGTATACGATTGAAATTTCTG 
CAATC AAAT CAAATTTGTTGTTACAATAAAGTTCAGTTCTACTTCGGAACTTGAAATCTTCCCC AAAT AT GTTATTTATGTGTTATTT ATT TGGT 
AATTTCAATCCAAGTTGAAATCGTTTTCCGAGCTATGTGTCGTCATTACAGTGCAGTGCAGTTGTAAACGGTTTGTCCGATTTCTCAGCAATAAA 
TCGGAATATATATGTAGACAAAGTGTGTTGTTTTAGGAGGGGAAGGTTCTCTAAAGTCCAAGTAGGTCCTATCCATTTTTGTGTTTTCTTCTCTT 
ACTTGCAGATTTTTTTTCGTTTTTTTTTTTTTGCTTTTACAACATTCTTTTTTTTTTGTGTTGATTTTTGGGGAGGGGTGTGTGGAAATGTTTAT 
AAATTGTATGTTTTAGGCAGCTACCGATTCGGAATCGTTGTAGTTATCCGAAACATAACCGTCGTTGGCATCATTTTCGCCATCGGAGTTCCATT 
TGAGGCCATCTCCTTTGGTTGGTGTCCCATCCACGGCACGGTCCTCGTCGTTCGACTCCTTCTTGGAGTTGTGTTGGCCCTGATCGGCGGAATCA 
GCATCATCTGCATCGTCATCAATATTTTTCGCTGGGCGACTACGGGGCTTTGTTGTTTCCTCTTTGGACACCTTGCCAGCGGCAAGAGAGGGACG 
TCCACGTTTCTTCGGCGCCACTACCGTCTTCTCAGGCGAGCTGTGTTCCTCCTCCCCGGAGTTCTCATCGTCCTGATCGTCGTCATTGTCCTCGT 
GCTTCTCGACGGGTGCAGCATTGGCCTTTGGACGACCGCGCGGACGACCGGTGGGTACGTAGCCAGTACTGCCACCGCTACCCGACTTGTTCTGT 
GGTGGACGACCGCGTTTCTTGGGCACTTGTGGCTCCCCATCGCCATTGGACTCGACAGCCCGCTTCTTGGGTCTTCCCAGGCCGCGTCCCGTGCG 
CGATATGTTCAGGTTGACGGATCCAGCCGACGGTCGGCCCACTGGGCGACGTTCTTCGATGCTGCTGTTGCCGTCGTCGTCCTCATCCTGATCGT 
CTGTTTTGAAAGAAACAAAGCAATGTTAGTTAATAGGTATAATAATGAAATGAATCCAGGAATATATTTAAAGTAATATTTGTGAAGGTAAATCA 
ATGGTCTATCGACCATTTACATATTTTTGATTTTGATTGATTGCATGGTTGCGGAATGTTGAAATCATGACTCTACCTGTGATCTTCGAAAAAAA 
ATCACCAGGTAGCGTAAAGATAAAAGTGCAAGTTTGGTGGAAGCAAACAGTGCTTAAATTTCCATTAATTGAATTGAGACCATGAAATGTTGGGG 
ATTTACGTACTAACTGCACTACCATAGGCTCACCTTCATTCTCGCTATCGCTGGGCTGGTGCTTCTTGGGCCTGCCGGCCTTGCGCTTCTTGGCG 
GAACCGGGCGTCTTCACCGAATCCCCGGATCCTGATCCGGCGCCACCACTGCTCTTGGGTCGTCCCCTGCCCTTCGTCGGCGAGGGTGATGAGTT 
GTTTGCAAGCTCAGCGCCGGAGCCATCGCCATCGCCGTCCTCCTCCCCCTCGTCTTCGGGATCTTCGTCATTTTGGATTTTTGACGCCTTCGGTG 
GGCGACCCCGCTGTCCACCACCGCCACTGGAGCCCTTGTTCTTGGCGGGACGACCGCGCTTTTTGATGCCTGGCGAGATTGCTGCAACCGCCGCT 
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GTTGAAGATTTACCACCGACGGATGCCTTTGACGGTCGGCCGCGCTTCTTTACCGCAACTTCCTCCATTTCTCTGTACGGGATTCAATTAGCAAG 
CGAATAAAAAAAATCCTTGTTTCTTCGCGTCCCCTCACACACACACGCGCACAAATGCTCTCTCTCACGAACACACGCTCACTCTTTTCGCTTGG 
CTCCGCTCTTCTTCTTCACCTTTTTGTTTGCCGAAAATCAAGCTGGAAAAATATACAGAAAAACAATTTTGCATTAGGGCTTGAATGCGCATAAA 
TGCTGCAAAAAATATTCTCAAAAAATTCTTAAAAATTCTAGTTTTTCCAGATCCGAAAGGGAATTACTTGCGTTACTTACTTTGCCTTCTCAATT 
TGTAATTGTAAAATACGCGCACGCTAGTACAGTTCAAAGTAGTGATGTAAAAAAAAATATGTCGGTGCATCGAGCAGCGATAATTTTTGCGATAT 
TTTTTTCGATTTGATATCGATTAAGCGCTGCATCGATGCGTTCAAAATTGAGGCCGTTTAATTTAAGTTTGCGTAAAACAATTCCCTTTCATTTG 
GTCATATTGCATTTTAACTATGTATTAACATAAAATGAAACGTAGCTTTTGTGAAAAACTCTTTTTACTTTTGGAAATTGTAAGGTGTAAAAATT 

GTCTACTTACAGTTTAGGAATATCAAAAGAAACAAATTTA^ 

CGTGAACAACCAAGTTAAGCTATCCATATGTCCAAAAGTTAAACAAGTTTGAAGTGACTACATGAGACGGCTGGCCTACTATTTTAAAATATGGT 
ATGTTGTGTTCAAATTTTTTTTTTTGCTTGAAGCGCCAAGCTCCAAGTTTTTAAAACAAGAGAGTACAGTTTAGAAGCAACCGTTTATTAAATAG 
TTACACATATTGCAGTTTATTAGGCTAATCACAGTTGAATAATTAAATACTATTTAGTGGGCGCTCACTTCCACAGCTTGATGACACCGTTTCGC 
GATGCAGATGCTATGTAGATCTGGCCCTTGTCCTTGCACATCAGCAGATCCGTGATGGCATCGTGGTGACTGGCCGTTGGCATATCCGGCCCCGA 
GCGGGGATTCTCCTCCGTGGACGATCGCAGCTGCTGCGAGTCGCCCGTGCTGGCCATGGAAATGATTTGCTCCTCGATCACTTGGCTGCCATCGA 
TTAACCGGGCACTGAAGGGTGGAATGAATTGATAATTTTATTATATTATCATTATAATAACAGATCAATAATTACCCATAGTTGAAAGCCACATC 
TGCCAGGTTGTCATTGGCAGCGGGTACTTTAAGCGATGAGTTCTTTGGGTTGGTGATGTCCCAGTACCGGATCCGCTGATCCGAACTGCCGGTGA 

GAA 

(SEQ ID NO: 1063) 

Exon: 3183.. 3121 
Exon: 2987.. 2409 
Exon: 2090.. 1001 

Start ATG: 2823 (Reverse strand: CAT) 
Transcript No. : CT27541 

ACATCACTACTTTGAACTGTACTAGCGTGCGCGTATTTTACAATTACAAATTGAGAAGGCAAACTTGATTTTCGGCAAACAAAAAGGTGAAGAAG 
AAGAGCGGAGCCAAGCGAAAAGAGTGAGCGTGTGTTCGTGAGAGAGAGCATTTGTGCGCGTGTGTGTGTGAGGGGACGCGAAGAAACAAGGATTT 
TTTTTATTCGCTTGCTAATTGAATCCCGTACAGAGAAATGGAGGAAGTTGCGGTAAAGAAGCGCGGCCGACCGTCAAAGGCATCCGTCGGTGGTA 
AATCTTCAACAGCGGCGGTTGCAGCAATCTCGCCAGGCATCAAAAAGCGCGGTCGTCCCGCCAAGAACAAGGGCTCCAGTGGCGGTGGTGGACAG 
CGGGGTCGCCCACCGAAGGCGTCAAAAATCCAAAATGACGAAGATCCCGAAGACGAGGGGGAGGAGGACGGCGATGGCGATGGCTCCGGCGCTGA 
GCTTGCAAACAACTCATCACCCTCGCCGACGAAGGGCAGGGGACGACCCAAGAGCAGTGGTGGCGCCGGATCAGGATCCGGGGATTCGGTGAAGA 
CGCCCGGTTCCGCCAAGAAGCGCAAGGCCGGCAGGCCCAAGAAGCACCAGCCCAGCGATAGCGAGAATGAAGACGATCAGGATGAGGACGACGAC 
GGCAACAGCAGCATCGAAGAACGTCGCCCAGTGGGCCGACCGTCGGCTGGATCCGTCAACCTGAACATATCGCGCACGGGACGCGGCCTGGGAAG 
ACCCAAGAAGCGGGCTGTCGAGTCCAATGGCGATGGGGAGCCACAAGTGCCCAAGAAACGCGGTCGTCCACCACAGAACAAGTCGGGTAGCGGTG 
GCAGTACTGGCTACGTACCCACCGGTCGTCCGCGCGGTCGTCCAAAGGCCAATGCTGCACCCGTCGAGAAGCACGAGGACAATGACGACGATCAG 
GACGATGAGAACTCCGGGGAGGAGGAACACAGCTCGCCTGAGAAGACGGTAGTGGCGCCGAAGAAACGTGGACGTCCCTCTCTTGCCGCTGGCAA 
GGTGTCCAAAGAGGAAACAACAAAGCCCCGTAGTCGCCCAGCGAAAAATATTGATGACGATGCAGATGATGCTGATTCCGCCGATCAGGGCCAAC 

AC AAC T C C AAG AAG GAG T C G AAC GAC GAGG ACC GT GC C GT GG AT GGG AC AC C AACC AAAGG A GAT GGC C T C AAAT GG AACT C C G AT GGC G AAAAT 
GATGCCAACGACGGTTATGTTTCGGATAACTACAACGATTCCGAATCGGTAGCTGCCTAAAACATACAATTTATAAACATTTCCACACACCCCTC 
C C C AAAAAT C AAC AC AAAAAAAAAAG AAT GT T GT AAAAGC AAAAAAAAAAAAAC G AAAAAAAAT C T GC AAGT AAG AG AAG AAAAC AC AAAAAT GG 
ATAGGACCTACTTGGACTTTAGAGAACCTTCCCCTCCTAAAACAACACACTTTGTCTACATATATATTCCGATTTATTGCTGAGAAATCGGACAA 
ACCGTTTACAACTGCACTGCACTGTAATGACGACACATAGCTCGGAAAACGATTTCAACTTGGATTGAAATTACCAAATAAATAACACATAAATA 
ACATATTTGGGGAAGATTTCAAGTTCCGAAGTAGAACTGAACTTTATTGTAACAACAAATTTGATTTGATTGCAGAAATTTCAATCGTATACAAG 

TTTAAATAAACCTACTATACCT 
(SEQ ID NO: 1064) 

Start ATG: 228 {Reverse strand: CAT) 

MEEVAVKKRGRPSKASVGGKSSTAAVAAISPGIKKRGRPAKNKGSSGGGGQRGRPPKASKIQNDEDPEDEGEEDGDGDGSGAELANNSSPSPTKG 

RGRPKSSGGAGSGSGDSVKTPGSAKKRKAGRPKKHQPSDSENEDDQDEDDDGNSSIEERRPVGRPSAGSVNLNISRTGRGLGRPKKRAVESNGDG 

EPQVPKKRGRPPQNKSGSGGSTGYVPTGRPRGRPKANAAPVEKHEDKDDDQDDENSGEEEHSSPEKTVVAPKKRGRPSLAAGKVSKEETTKPRSR 

PAKNIDDDADDADSADQGQHNSKKESNDEDRAVDGTPTKGDGLKWNSDGENDANDGYVSDNYNDSESVAA* 

(SEQ ID NO: 1065) 

Name: CHROMOSOMAL PROTEIN Dl 
Classification: DNA_binding 
Gene Symbol : Dl 
FlyBase ID: FBgn0000412 

Celera Sequence No. : 142000013384504 

GAACTAAAACCAAAGAACAGGCAACTTGCCAATTCTT AT CAATGGAGCT AACT CAAAAAGT GAT AT CAGT AAGT AAAT AAAT CTT AAT TGAAC AT 
ATTTATGAAAAGATAATAGGGCGCCAAGGAATTAGTAAACGTTGGTGCTTTACAGCGTTATCGGACACGATCTGCCAATCCCAAAAAAATCACCT 
GTTACACCAAATGCAGAAAAAATTAAAAACAACCATGCGAGTTCTTTTTCTCATTTGCTAAATTATTAAGCTTTAGTTTAGCAAACTTGTTTTTT 
TTTTGTTTTAATTACTTGTTTGTGTTGGTTTCTTGTTTCATATAGTACATTCTATCTCGATCTATATATTTAATAATATATATACCATTACTATT 
TTTATGTTTTCTTCGTTTAAAATCATCAATTGAAGTTCAAGGATTTGCGCTTTCATCGTTTCGATCCGACATTGACAATATTTATGTCTAGATAT 
ATATATATATATATATAAGAATACATATATATATATATATATATATGTATATTAAGAGTTCCAAGAAAAATGATGTGTGAGAATTGTAACTTATG 
GACAAAAATTCAAAATTTTATCATTTGCTTAAATATTCTTAATATTTGTGGTTGTTTAGCATATTTTAAATATGTATGTGTTTGCTTGCATATCC 
TTCAACTACGCGCATCGAATTCATAAAATATAATGATGGGTGAGATGGAAATGGAGGGCGGCTTGGGCTGGCTATACATGAATGTAATATACTTT 
CGCGATGGACTTGCACAGTTTATTATTAACGCTTGCTCCTGGTTTTTGAGGTTGTGAGAGAGGAGTGAATTGCTAAACGAGGGTGGGGTGGAGGG 
TGCATCTGTGTATCTTTTCTGTATTTTCGTGATTTGTGTTGCGCTGCGCACAGACAGTTTCTCTTTCACATTTGCTGTTCCTATATAGTATACGT 
GTGTACACATATATAGTTGTTATCGACTTTGTTTTACTCCGTTTGGTTATTTGCGAAAGTGGCTACGATGCGACTGCTGATGCTGCTGCTGCTGC 
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TGGCCCAGGTCCTTTGCCTGAAGCTGAGGCCTTGCTCATTGAGCCCACCAGTCGGGTGCGTTCGAAGAGTGCGAGGCACTGCCGCCGCCGTAGGA 
GCCGCCATAGGAGCCACCACCGGCGCCGCCGCCATAGTATCCACCGCCGCCCGAATTGCCGCCTGTAAAGGAAATCCACTGTTAAGTTTCTGCCT 
TTAAACTGATTACTGCTGATGGATGATTCCACACTCACCAAACTTACCGTACGAGTTTCCATTGCTGCGGTAACTGCCGCCGCCGCCTCCTGATC 
CGGAACCACCGCTACGTGGTGGCGGTCCACTGCGTCCGCCACCGCCGCCGCCAGAGCTTTGACGGTAATCCCTGGAGCCGAAGCCGCCACCATAA 
CGTCCACCACCGCCTCGTCCAGCCCGCTTGGCTCCGCCATGTCCGCGATCCGAACTCATATCCTCCATGAAACTGGGGATCTCCTGCTTCGTTTC 
GATCAGCAGCTCCAAGAGATCGGAGCAAATGTTGCGGTTCTTTTCGTTGAAAAAGGAGGTGGCCACACCAAGATTACCCATACGTCCGGTACGCC 
CGATACGGTGGACATACTCCTCCACATCCGAGGGCAGGTCAAAGTTAATAACGTGCTTAACGTGCGGAATGTCCAGGCCACGAGCGGCCACGGCT 
GTGGCCACCAGGATGGGGCAGTCGCCCGATCGGAAGCAGCGCAGTGCCTCCTCGCGCTCCTTCTGCGTGCGATCACCGTGGATGCTAGTGACGGG 
ATGATTGCACTGGTACAGGAACTCCTCCAGCGAGTCTGCACCCTTCTTAGTCTCCACAAAGATCAGTGTTAGGCTGTCTTTGGTGTATTCGGGGC 
CATCGCGGATAGACGAGAGCAGATCCAGCAGATACGAGCGCTTGTCGGGTTCATAGACCCACAGGATTGTCTGCGTAATGTTCTCAGATGTGGAG 
CCCACACGACCAACGGCCAAGAAAATGTAGTTGCTGAGGAAATCGCTGGCTAACTCCTGGATCTGCTTGGGGAATGTGGCCGAGAACATAAGCGT 
CTGGCGCTGTCCGGTTGGCGGCATGTTTAGCTGTTCGACGATGCGACGGATCTGCGGCTCGAAACCCATGTCCAACATACGATCAGCCTCATCCA 
GTACAAGGAATCGAATGTTCTCCAGTCCCACCTTTCCGCGTGTGATCATGTCCTCTAGGCGACCGGGTGTGGCCACAATCAGATGGCAACCGCGG 
TCCAGCTCGCGCATCTGCTCGCTAGTATTGTTTCCGCCGTACAGCACTGCCGGGCGCATCCGGGAGCGATAGGCGAACTTCTTGGCCTCCTCAAA 
GATCTGGGTGGCCAGTTCGCGGGTCGGCGCCAGCACCAGACCCAACGGATACTGCTTGCGCCGGCTGTACTGCCGGGTGCTTTGCGGCGGGGGCA 
CATGTCCCAGCTCGTACATCTGGTTGAGAATCGGCACCAGGAAGGCGGCCGTCTTGCCGGATCCAGTCTGGGCGCAGGCCATTAGATCCCGGCCA 
TTGATAATGATCGGAATGGCGTGCTTCTGCACCGGTGTCGGTTTGTCATAACGTGCTAAGGCCACGTTGTTGCGGATAATCTCCGTCAGCTGCAC 
ATCATCGAACGATGTGATGTTCGGGGGCACATTCTGGCCCGTCGCCTCCACGGGTATATCCTCGTATTTGTCAAAATTGATTCCCGTATTGCCCA 
CGCCGAACAGCTCCACCTCCAGGCGCTCATCTCGGGCGCCCAGCTTTGTGTAGTCGACGTTTGAGCCGCCGCCCTCCTTCCAGCGGCTGTTGTAA 
CCGCCGCCTCCGCGCTCGCCGCGATTGTTGTAGCTTCTGTTGCCGCCGGCAGACTGTCCGCCCTCGCTGCCATCGAATCCGGCAGGACGTTCTGG 
CTCCTGCCAACGATCGTTGCGCGGCTGCTGCTGCTGCTGTGCCTGGCCATCCTCGGCAGTCTGCTCGTTGAGGTTGCTACCGCTTCCGCCGCCAC 
CGCCGCGGTACGATGGTCCGCGTCCAAATCCTCGACCTCCTCCGCCGCCTCCGCCGCTGCGATTCCAGTCACCGCCGCGACGCGAATCGAATTCA 
CCACCTGCAAAAGTAATTACATTTTCTCTAATAATATGGGCATATTAGAGGACCAAGTATAAGTCGTACGCATTACGATTTTAAGGATCATTACA 
TTATGGAGCTGTAAAAATTAATTAAATGTCCTTGTGCTTCTTGATAGTGAATAGACCACCTGTTAGCCAATAGTACCCACCGTTGTAGTTGTCCT 
CGAAACGTCCACCGCCTCCACGTCGTCCTCCGCCGCCGCTTCCGTAACCGTAGTCGCCGCTCTGGCGATTGAAGCCGCGACCACCACCGCCGCCG 
CCTCTGCGGTATTCGCCTCCGCCTCCGCGCGACTGTTGAGGGTCCCTAGTCTCCTGGCGCGGATTCCCGGATCTGGAGTCGAAGCCCTGACCCTG 
GCCCTGTCCTTGGCCCTGGCTCTCTGCGTCCGCTGCGTTGTTATTGCCACCACCACCACGAAGGTGCGGGGGCACATACACACCACCGGTGACCG 
AGTTAGTGGAAGTCTTGCTGGTTATGGGGCCGCTGTAGTCAGCAGAGCCGCCATTCAAGTCCAGACCAGCAACCTGCAAGTGATAAAGATTAAAT 
GGTTAGTCTTCTTCAGAACTTGTGCAAAAAATGTGGCATCAAATGTGGTTTCCTAACACTTAAATTATAGGTTTTGTTCTATTTTTTTTGTACTT 
AATCATTGGTTTAAGATTACTTTTCTTTGTTATATACCAAGTTTACGGGAACACAGTGCATTTAATGATAAGATCTATACCAGCAATGCAAAACT 

AAGCT AC AC AAAT AAT CTTTGCTC C AC T AAT AC AAT T G AT T T AT AGT AC AC AT GGC C AAT T AACAT AT GT T GC CTGT AAGT T AGC C GG C AC C AG T 

TCAAGAATCAAGTTCAACAGTTTGGCAAAAGATTCGCCGGGGAAATTTTCTCCAGAACTGCCAAAATGAATTCGTTCCCGCTTAGCAACATGTGC 

TATTTACCGCTCTTCCTCTTAGCAACGCAACGTGTATATATACATAGATACATATGTATGTACGTGCAAATCTCCGGAATACACAGATAGAGAGA 

GGCGCAACTAGTTTAAAGCTTCGAATTTAGTGCTCTCCCTGTAAGCGTCTCTCTCGCACTCGCAGGCAAGCTATTACACAAAGAGAGTCCGCCTG 

GTAACCCAACAAAAACAACACGTGAACGAGCCACCCACACATGGGAAAACAAATCGCACCGTAGGGGCAAAACACAAAGCCCGGTTGCGGGACCC 

AAGTGGACAGAATGGTGATGAAAGCAAGGGAACTCTTTCAAGAACTAGTTTAAATGTATACAATGATTATTATTCCCGTATATCTTGTAATATTC 

AATATTTTGCCTTCTTCATTTATATTAATTAACAAAGGCAGCTAAATATGTAATTCGCAGATTTTGTGGGGCGTCACTTAACGGCTAGATCGTTG 

CTAGATTCATTGCGTGCATTTGAGGCATACAGAAATAGTAGTATTTAACTAAACTATTTCCATTTGAAAGCATTAACGTTTGTTTATTTCACATC 

TCTAATAAAGTTTAATTCTTTTTAAGGACCTAACAAAGGCTTGTTAAAATAGTGTATGTAGATTACATGCTTTGCTTTTCTTCCATTTTTTTTTG 

CCAAATCACCCTTGAAATTAATCACCAGATCGCTTACTCCTGGCAACCCTATTTACAACAGCTTCCACATGTGTGAGCAAGCGAGAGAGAGCGAG 

ATATAACACACACACACGACCAGCCTTAGAACAGCGGCACTGCTCGCACACACGACACACTCAACGTGAGAGGCCGAATGGAAACTTTCATATGT 

ATATATTATACATGTCAAATTGGAGGCAAGCGAGCAAAAATTATACAAAAGAAATTACAAATTGGAATGACGAAGAGGGACCCAGGCGCACACAC 

ACACGCACATATATGCAAGCCATTGACACCGACACAATTCACCGTTGTAAAACGAGCGATAGAGACATTAGTGTGTTATGAAATCAATAAATTTC 

TAGGCGAACTTTTTCAAATAGCCGCATGAGCATCACGAACACACACACACAGCCACCGCATACACATGCGTGTTCCATTCTTATATAGATGGTAA 

ATGGTATTTTCAGCAGTTAAAAACGCTACTTACTTGCTGCTCTAGACCTGTGCCATTTTGGTTAATAGCATTACTCATATTATCCTCCAATCAGT 

TGCTTTTTTTTTCTTGCTTTCCGTTTAGGAAAATTTTGCAATGGTTACGCTTTGAATTTTTTTTTTTGAATGCACACTTGCAATTTGCGACAAAA 

AAATGTATACGATCGTTGTGTTGGTGTTGCACTTTGTGTCTATACGTTTTTCCTTAGTTTTTCCGACAAAAAACTTCTTACGTTCCACAAGCTAC 

TGCGTTCTCGTGTGAAATTTCAGGGTTCTTTTCGTTCCTCGCACACAATTCCCTTTGTTCGGCTTGAAACTGAGACGTTTCCGTATCGGATTTGG 

CTGGCCTGCTTGTGCCAACAATTACAATTACAAGCGCTATGGGCTTGGAAAATGCTGCTTCTCCGACGAGTTCACGTTTTTTTCCGCTTTCAGAC 

GCGTGTCTAAAAAATCAGGGCTGCTTTAGCACGTATATCGATGAGGAATCGATAGATGCGGTCGGTGGAGGTAGAGCAGAGACTCCAACCTGGGC 

TCTCGGCGGCGGTTACTGGCAAAGACTCCGGCAAGGGCGTTGCCAGATCTATATCCTCGTCTCGCGGTCTATGAAATTCAAATTGAATTTTTTCA 

CTTTTGCTTTTTCGGCGCGTCGATCCCTTCTAGGCTGTTGTTGTTTTTTGAATGGAAGAGAAATGAGGCTGGGCAGTTAAATTTTGTTTCTTAAA 

GGCCAAAAATGTTTGTGTTTATGTTTAGGAATAGTTGTTTATAAGGCGTCAGCCGCTGTACCCCATTTCGTTATTTGAAGCAAAAGTCAAAGAAA 

TTCAAATTAAATTTGAAATTTAGTATATATATGTGATTGCCGTAAAATAGGTATTAAAAATTTCCAAACAATTCCATACTCAAGTTCCCTCCAAG 

TCTGTTAAAGAAAACTAAAATTCGAAGTTTCGAACTCTGGAATATATTCATTCACTTTCGTCTTCAGAATATACATATTGTGAATGTACATGTTT 

ATATAGAATAAATGCTTTTGGTATTGGCCTTAGCTTGAGATTAGTTAGTGGAACAATAAATATGGCTATAAAGTATCGTATATATGTATATAATA 

ATAATTTTCATATCGTTCAGCCCTTGAGCAGGCAGCGGGAGTAGAACTCCCGGTCGATAGCGTCGTTTGGAATGTATGAGCTCCACTTGAAGATC 

AGCGCCAGCGTGTCCCCGGCGGACATGTGATCCAGAGCGATGTTGAGGAAACAGTTCCAGTTGTAGAAGAAGTGGCGTCCCAACTGTTCCTGTTC 

CACGCCAGATATGCACTCGCAGTTGGCGCACACCAGTCGTCCGCCGCGTTGCAGTTCCACGAACTGTTCGAAGCAGCGGCCCCATGCGCTCACGC 

TCAGTTGTTTGCCCACGTCAGCCAGAAGTTCGGGGAGTCCGTAGTTGAGGACGATGGCCACCACTTTGGTGAGATTCTGTTCGGCTAGGTAAAAT 

TTAGCTAAGACGTCTAATAGTGCCGGCACTGATTGAACAACATCAAAGCACTTCAGTTGCTCGCCACGGGACCAGAAGTATCGCAGGAGCACTGC 

TCCAAGCTCATGATACTGGCAGGAGGTATCGAAGCGAAGGGGGAACGAGCAATGCTCACACTTGACAATCTCCGCATTTTGCGAAGTATTTAGCA 

GCACAAATCCGGCAGCTATGTGATCCCGCGTAGCGTCGTCCACTTCCCAGATCATCTCCGGATCCAGGTAATCGAAATACATCTCGTAGCAATTG 

CGTTGCGCTTGGTC 
(SEQ ID NO: 1066) 

Exon: 6044.. 5259 
Exon: 3683.. 3311 
Exon: 3139.. 1283 
Exon: 1202.. 1001 

Start ATG: 5303 (Reverse strand: CAT) 
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ACAGCGGCTGACGCCTTATAAACAACTATTCC 

TCTTCCATTCAAAAAACAACAACAGCCTAGAAGGGATCGACGCGCCGAAAAAGCAAAAGTGAAAAAATTCAATTTGAATTTCATAGACCGCGAGA 
CGAGGATATAGATCTGGCAACGCCCTTGCCGGAGTCTTTGCCAGTAACCGCCGCCGAGAGCCCAGGTTGGAGTCTCTGCTCTACCTCCACCGACC 
GCATCTATCGATTCCTCATCGATATACGTGCTAAAGCAGCCCTGATTTTTTAGACACGCGTCTGAAAGCGGAAAAAAACGTGAACTCGTCGGAGA 
AGCAGCATTTTCCAAGCCCATAGCGCTTGTAATTGTAATTGTTGGCACAAGCAGGCCAGCCAAATCCGATACGGAAACGTCTCAGTTTCAAGCCG 

AAC AAA GGGAAT T GT GT GC GAGG AAC GAA AAG AACC CT GAAAT T T C AC AC G AG AAC GC AGT AGC T T G T GG AAC GT AAG AAGT T T T T T GT CGG AAA 
AACTAAGGAAAAACGTATAGACACAAAGTGCAACACCT^ACACAACGATCGTATACATTTTTTTGTCGCAAATTGCAAGTGTGCATTCAAAAAAAA 
AAATTCAAAGCGTAACCATTGCAAAATTTTCCTAAACGGAAAGCAAGAAAAAAAAAGCAACTGATTGGAGGATAATATGAGTAATGCTATTAACC 
AAAATGGCACAGGTCTAGAGCAGCAAGTTGCTGGTCTGGACTTGAATGGCGGCTCTGCTGACTACAGCGGCCCCATAACCAGCAAGACTTCCACT 
AACTCGGTCACCGGTGGTGTGTATGTGCCCCCGCACCTTCGTGGTGGTGGTGGCAATAACAACGCAGCGGACGCAGAGAGCCAGGGCCAAGGACA 
GGGCCAGGGTCAGGGCTTCGACTCCAGATCCGGGAATCCGCGCCAGGAGACTAGGGACCCTCAACAGTCGCGCGGAGGCGGAGGCGAATACCGCA 
GAGGCGGCGGCGGTGGTGGTCGCGGCTTCAATCGCCAGAGCGGCGACTACGGTTACGGAAGCGGCGGCGGAGGACGACGTGGAGGCGGTGGACGT 
TTCGAGGACAACTACAACGGTGGTGAATTCGATTCGCGTCGCGGCGGTGACTGGAATCGCAGCGGCGGAGGCGGCGGAGGAGGTCGAGGATTTGG 
ACGCGGACCATCGTACCGCGGCGGTGGCGGCGGAAGCGGTAGCAACCTCAACGAGCAGACTGCCGAGGATGGCCAGGCACAGCAGCAGCAGCAGC 
CGCGCAACGATCGTTGGCAGGAGCCAGAACGTCCTGCCGGATTCGATGGCAGCGAGGGCGGACAGTCTGCCGGCGGCAACAGAAGCTACAACAAT 
CGCGGCGAGCGCGGAGGCGGCGGTTACAACAGCCGCTGGAAGGAGGGCGGCGGCTCAAACGTCGACTACACAAAGCTGGGCGCCCGAGATGAGCG 
CCTGGAGGTGGAGCTGTTCGGCGTGGGCAATACGGGAATCAATTTTGACAAATACGAGGATATACCCGTGGAGGCGACGGGCCAGAATGTGCCCC 
CGAACATCACATCGTTCGATGATGTGCAGCTGACGGAGATTATCCGCAACAACGTGGCCTTAGCACGTTATGACAAACCGACACCGGTGCAGAAG 
CACGCCATTCCGATCATTATCAATGGCCGGGATCTAATGGCCTGCGCCCAGACTGGATCCGGCAAGACGGCCGCCTTCCTGGTGCCGATTCTCAA 
CCAGATGTACGAGCTGGGACATGTGCCCCCGCCGCAAAGCACCCGGCAGTACAGCCGGCGCAAGCAGTATCCGTTGGGTCTGGTGCTGGCGCCGA 
CCCGCGAACTGGCCACCCAGATCTTTGAGGAGGCCAAGAAGTTCGCCTATCGCTCCCGGATGCGCCCGGCAGTGCTGTACGGCGGAAACAATACT 
AGCGAGCAGATGCGCGAGCTGGACCGCGGTTGCCATCTGATTGTGGCCACACCCGGTCGCCTAGAGGACATGATCACACGCGGAAAGGTGGGACT 
GGAGAACATTCGATTCCTTGTACTGGATGAGGCTGATCGTATGTTGGACATGGGTTTCGAGCCGCAGATCCGTCGCATCGTCGAACAGCTAAACA 
TGCCGCCAACCGGACAGCGCCAGACGCTTATGTTCTCGGCCACATTCCCCAAGCAGATCCAGGAGTTAGCCAGCGATTTCCTCAGCAACTACATT 
TTCTTGGCCGTTGGTCGTGTGGGCTCCACATCTGAGAACATTACGCAGACAATCCTGTGGGTCTATGAACCCGACAAGCGCTCGTATCTGCTGGA 
TCTGCTCTCGTCTATCCGCGATGGCCCCGAATACACCAAAGACAGCCTAACACTGATCTTTGTGGAGACTAAGAAGGGTGCAGACTCGCTGGAGG 
AGTTCCTGTACCAGTGCAATCATCCCGTCACTAGCATCCACGGTGATCGCACGCAGAAGGAGCGCGAGGAGGCACTGCGCTGCTTCCGATCGGGC 

G AC T GC C CC AT C C T G GT G GCC AC AGC C GT GGCCGCTCGTGGCC T GG AC ATT C C GC AC GT T AAG C AC GT T AT T AACT T TGAC CT GC CC T CGG AT GT 
GGAGGAGTATGTCCACCGTATCGGGCGTACCGGACGTATGGGTAATCTTGGTGTGGCCACCTCCTTTTTCAACGAAAAGAACCGCAACATTTGCT 
CCGATCTCTTGGAGCTGCTGATCGAAACGAAGCAGGAGATCCCCAGTTTCATGGAGGATATGAGTTCGGATCGCGGACATGGCGGAGCCAAGCGG 
GCTGGACGAGGCGGTGGTGGACGTTATGGTGGCGGCTTCGGCTCCAGGGATTACCGTCAAAGCTCTGGCGGCGGCGGTGGCGGACGCAGTGGACC 
GCCACCACGTAGCGGTGGTTCCGGATCAGGAGGCGGCGGCGGCAGTTACCGCAGCAATGGAAACTCGTACGGCGGCAATTCGGGCGGCGGTGGAT 
ACTATGGCGGCGGCGCCGGTGGTGGCTCCTATGGCGGCTCCTACGGCGGCGGCAGTGCCTCGCACTCTTCGAACGCACCCGACTGGTGGGCTCAA 
TGAGCAAGGCCTCAGCTTCAGGCAAAGGACCTGGGCCAGCAGCAGCAGCAGCATCAGCAGTCGCATCGTAGCCACTTTCGCAA 

(SEQ ID NO: 1067) 

Start ATG: 742 (Reverse strand: CAT) 

MSNAINQNGTGLEQQVAGLDLNGGSADYSGPITSKTSTNSVTGGVYVPPHLRGGGGNNNAADAESQGQGQGQGQGFDSRSGNPRQETRDPQQSRG 
GGGEYRRGGGGGGRGFNRQSGDYGYGSGGGGRRGGGGRFEDNYNGGEFDSRRGGDWNRSGGGGGGGRGFGRGPSYRGGGGGSGSNLNEQTAEDGQ 
AQQQQQPRNDRWQEPERPAGFDGSEGGQSAGGHRSYNNRGERGGGGYNSRWKEGGGSNVDYTKLGARDERLEVELFGVGNTGINFDKYEDIPVEA 
TGQWVPPNITSFDDVQLTEIIRNNVALARYDKPTPVQKHAIPIIINGRDLMACAQTGSGKTAAFLVPILNQMYELGHVPPPQSTRQYSRRKQYPL 
GLVLAPTRELATQIFEEAKKFAYRSRMRPAVLYGGNNTSEQMRELDRGCHLIVATPGRLEDMITRGKVGLENIRFLVLDEADRMLDMGFEPQIRR 
IVEQLNMPPTGQRQTLMFSATFPKQIQELASDFLSNYIFLAVGRVGSTSENITQTILWVYEPDKRSYLLDLLSSIRDGPEYTKDSLTLIFVETKK 
GADSLEEFLYQCNHPVTSIHGDRTQKEREEALRCFRSGDCPILVATAVAARGLDIPHVKHVINFDLPSDVEEYVHRIGRTGRMGNLGVATSFFNE 
KNRNICSDLLELLIETKQEIPSFMEDMSSDRGHGGAKRAGRGGGGRYGGGFGSRDYRQSSGGGGGGRSGPPPRSGGSGSGGGGGSYRSNGNSYGG 

NSGGGGYYGGGAGGGSYGGSYGGGSASHSSNAPDWWAQ* 
{SEQ ID NO: 1068) 

Name: DEAD box helicase 
Classification: RNA_binding 

Celera Sequence No. : 142000013384638 ^^^^ 
AAGTTCTTTTTTTTTGAGCGTCCTAACTAGGGAGGGTGCACTGTACATACAGCTTGTATGACTTTATTTTAAATACGTGATTGGATTGTAAATGG 

GAAAT AAATTAATAATTTTCAAGCGTAAACTGAGATTAGTACACATCCTTTTTGACACCACTGTTTCAAAAGGTGACAAATCCTTTAGGCCGCCA 

CTGTAAATGGGAGTAGGCTCCTTTGTCCCTCCACTGTTTGCACATACACACACGCAGAAGTTGAGTAAAAATTTCACAAGTTTCTTGTCGAAAAA 

CGCAATTTCTGTGCAACAACAACCGATTAATTGCAAGTAAACTGCGCATTTGTGTCTAGCCTTTTTAAATGATATTTACTTTTTTGTTATACTAA 

GTTACGAATTTCTATAAGTCACAAGAAATCATCACATTTTATACGAATGATGAATTTGTCTTAATCACGCTTTCGTTTCTCATTTCAGTTAAGTT 

TTTACTCTGTTTGTGAGTCAGATTTAGCTAGGAAAGTACCTTTTATCAGTACGGGAAAAAATCTTCGTTAATTTTATACTGCCAGACGACGTTGT 

TTTTGTCGATGGTTAGTGATGGCAAACTTTGGTTTGGCGAGCCCATTAGTGGTGCAAAAAAGCACAAATCTAACGAACACCAGAAAAACTTGTTC 

GTGTCACGGAGCAGCATGCACGAAATATCGCTAAGAACTATTTATATAGTGACTGAATTCAAGTCTGTGTATATTGACAATGTCTAATAATAAGC 

TTATAATTCTCTTCCCACGAATTACAAAATGGATTTTATATAAAAATTTAGATCGTAATAGCCGTTGTGAATTATTTTAATCTATTGTTCGCCAC 

ATCTTCTTAACCTAAACAATTTATTTATTTTTTTTAAACATTTTTAGTGCTTGAGCAGCAATAATCAACTCACTCTGCCATCTCTGCTGTTATCG 

ATATATCAAAACAGAAAAACGGTCTGCAAAAAGTGTGACCGTTTGCCGTGCTGCTCGCAACTTTGACACGACCAGTTCCAAAATATCGTGAGATT 

TACGCAAAACTACTAGAAAATGGTCGGTTGGAGCCGTTTGCTGTCGCCGGCGGCGAAGTTCGCCAGCTACCGGGCTGTCCTGCAAGAGCCCGCGT 

GCCGGAATGGTGAGCATTTCTTGAAAATATCCGCCGTAAACACACTTCACATTACGCAACCATATATTCTGGATTTCTAGCCCTGCACCAACAGC 

TGCGCAGAATGGGAGGCGATCATGGACACCACCAAATGATCATCAAGCCATCCCGCTTCCAGTGGGACAAGTTCAAGGATCTGTTGCACTTTTAC 

GTAATGCTCGGCGTCATCCCAGTCACCGCATTGGTTCTCTACGCGAACATCTTTGTGGGACCCGCGCAGCTCGCCGAGATTCCCGAGGGCTATGA 
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GCCCAAGCACTGGGAGTACGAGAAGGTATGATGGGCTCGGAAAGGCAACCAATCGATACTAAGATTGTAATTCCGAGATATCTCTTTATTATCTA 
GCACCCCATCTCGCGCTTTATTTCCCGCTACATTTTGAACTCGGATCAGCAAAACTACGAGAAATCCCTGCACTATCTCTACGAGGAGAACGAAA 

AAGC C C AG AT T CGGT AAT G C G ACGG AAAC AAC C C AG T T AT AT CC C AT T G T AAAAC AAT AT AT T T C C T TT C AGAC T C C T T G AGG AC G AAGT AC GT C 
GCAAGATGTCGGAGCGCAATGATTACCAGGCCTACTACTACCGACCATCGGTGGCCAAGTACCACAGGATTTCGAAAGAGGCTGCTGACGAGCTG 
GAGGCTCTGCGCGGAGACTAGAACCCTTGTAATATTCTTTAAATACCATTGTTAGGATCCCGTTTTGTGGGAAATAAAGTGTGAAATCGACTTCG 
CAACATCATCTTATTCACATTTAATAATATAAAAATTACACAGTAGTCGAATAAATACTTAAGACTTAGCAAAGTCATGTGCGTATCGATTTAGC 
CCCACGCCTGATTCTTGCAGTAGTAGAGCACCAGTTTCCCATTGGTTCCATGGCACTCGTACAAGTTCTTGATTATCTGATCCGGGGCAGGTGCA 
AATGTCTGATTAACGTACAGGAACTAAAAATTAAGATGACACATAAGATAGGGTTCTCTGGAACTTCATTTGTTACGTGCAATTTGCTCGCTGGC 
ATCGAGTTTCAGAAACTTGTGGATGAACGTCTGTATCCAGCCGACTGTCTTGTTGGGATCTACGGTCCAGGTTCGCTTTTTGATGATGGGCACAT 
TGCCAGTGGCGTTCAGAAGGATACAAACTGATAATGAAGCAAGTCTTTGTGATAAAACGAATGCAAAATATATAAAGTTAACTTTGATTCTCACT 
TTTGGAACCATCCTTATCTGCAGGTGTGGAGGAGGAAGTGCTCAGCGCTGCCTGGGATTCTGGTGTCTCTGCCATTTATTTGGAATGTAACTAAA 

T T AAAT AGT T C CT AAT T AAT G AAT T AAAAC AGCC AAT AAT GTAAATAAAAT AT AAT CT GT T AG AG AT C AG AAT TT T C T G AAG AAAT T AT C GAT AG 
GTGTCACGAGATTAGCGATTCAGCTAATTATTTGAGTAAAATATGCTGCATATCATCTGATTTTGTTTTAAGGAATTATTTACTCATTTTCTTGT 
CATATTTTTTATTTAAACATATTTTTATATTTGTCTATCACTTTGAGACTTTCAAAATTCGTTGTTAATTGTTCCACCTTTAGCGTTAACAGCGG 
CTATGTTATTTTACATTCGACTGTTATTGTAACACGTGTGGAGTTTATAAAACAATCCAGGGAAATAAACAAAACTATGAATAATAATCTTTT 

(SEQ ID NO: 1069) 

Exon: 1001.. 1149 
Exon: 1221.. 1450 
Exon: 1522.. 1628 
Exon: 1688.. 1848 
Start ATG: 1065 

CTGCTCGCA^ 

CGGCGAAGTTCGCCAGCTACCGGGCTGTCCTGCAAGAGCCCGCGTGCCGGAATGCCCTGCACCAACAGCTGCGCAGAATGGGAGGCGATCATGGA 
CACCACCAAATGATCATCAAGCCATCCCGCTTCCAGTGGGACAAGTTCAAGGATCTGTTGCACTTTTACGTAATGCTCGGCGTCATCCCAGTCAC 
CGCATTGGTTCTCTACGCGAACATCTTTGTGGGACCCGCGCAGCTCGCCGAGATTCCCGAGGGCTATGAGCCCAAGCACTGGGAGTACGAGAAGC 
ACCCCATCTCGCGCTTTATTTCCCGCTACATTTTGAACTCGGATCAGCAAAACTACGAGAAATCCCTGCACTATCTCTACGAGGAGAACGAAAAA 
GCCCAGATTCGACTCCTTGAGGACGAAGTACGTCGCAAGATGTCGGAGCGCAATGATTACCAGGCCTACTACTACCGACCATCGGTGGCCAAGTA 
CCACAGGATTTCGAAAGAGGCTGCTGACGAGCTGGAGGCTCTGCGCGGAGACTAGAACCCTTGTAATATTCTTTAAA 

(SEQ ID NO: 1070) 
Start ATG: 65 

MVGWSRLLSPAAKFASYRAVLQEPACRNALHQQLRRMGGDHGHHQMIIKPSRFQWDKFKDLLHFYVMLGVIPVTALVLYANIFVGPAQLAEIPEG 
YEPKHWEYEKHPISRFISRYILNSDQQNYEKSLHYLYEENEKAQIRLLEDEVRRKMSERNDYQAYYYRPSVAKYHRISKEAADELEALRGD* 

(SEQ ID NO: 1071) 
Classification: enzyme 

Celera Sequence No. : 142000013384438 

AATATGCTAACAAAGTGGATGTCAATTCGACATGCTTAATAAAAACTTACAACTGAGTTCATAAATTTTTGGGAGGAACCAGTATCTGCTAATGT 

AACACTGTAACTAATGTTATTGATTTAAAAAAAAAACTTCCGCATTCTGAGATTTAAATTTCGCCCAAAAGTATGCATTACGAATATATTTAAAT 

TGGCCACACAAACTTTCATATGAAAAGTTTTTTTTTTAATTAATTTATATATTAAGGCATACGGGGAAATCTTGGAGCCCCTTTGTGTCCTTCTT 

TCATATGAAAAGTGTGTTTTTTTATGCTCTTTAGAGTACCTTTTATTCTTCACCAGGATCAAGAATCAGCGCAGTCGATCTGGCCATGTTTGTCC 

GCATGAACGTCAAGATCTCATGAACTATAAATGCTAGAAGATTAAGGATGCAGATTGCACAGAAAAGTGTAGGCGTAGCTCAAGTTTATCCTGGT 

GCTACCAAACAATTGAAAGTTAAGTAACGCATATGATGCAACGAGGTGCTGGTTATAAAATGTAAGAAACTAGTTTTTAATAGAATGTGTTGTCA 

TATTTCGTCACAAAATTTGATATTATAACTTTTGTGTGCCGAAGGTACGATCATCAAAGTTTGATATTAAAGAATATAAATATTACTATTCTTAC 

AATAAACCCAGCACCGATGCGACTCCCTGAAACCCTATAGAATAGTGAAGCTCACCAAAGACTGAAACACAACATTTTGTTTTAGCAGTTGGGTG 

CTAAATTTTCAGTTGCACATGCATTTATTTTGTTCCTGGTGATGAATACTGTTGTGTTTAAGAGTCTTCAGTAGATTCGAATGCTCCTACAAACG 

CCACATGGTCGTTTAGTGTAGGTGTCACAGGATTCATGCTGATTCTTCAATTGAGTTATAAAAAAGATAATCCCATATATATTTTCTTTAGTATG 

GAATGTCATTCTGACCAGAGTAATCGATACAAGTTTTGGTTTTTTTTTTTGGCGATCAGAGCACTTTATTACGTAGACCCCCATTGATAAAAAAC 

ACAATCAGTTTACAATGCCACCCTTAGTAGATATTACCACTTCAACAAACGCCGTCGTTCTAGGCGGTGGGCAGGAAGGTGAGCTTTTCGTTAAG 

CTGCAGCTGGGTTTTGATGAGCTGGGAGAGCGTGATGACCAGCAGAAGGTTGCGTACGTTGGCGTTGAACATCTGAGTGAACTGCTCGTGGGTCA 

TGTGCGGCACGGAGTGGATAAGGTCTAGCAGCTGGCGTCCGACGGCGTTGTCGGGGGTTACCTTGTGAGCGATCACGTCGTCAACGTACTTGAGA 

ATAAGATCCAGCAGGGACTGCAGCTTCGTGGAGGCCTCCGAGATTTGGGCCAGGTCCAGCATTGGCGGAACGGTCTTGGGACGGTGAGCGGGCGA 

AACGCCTACGGTCTTCTGAAGCAGCTTGAGCCCAAATGTCTCCGGCTCGTAGCTGGTCAGCTCGACAGGAATGGGCGTGAACATGCAGCCGCTCT 

TGCCGCCGGGAACTCCTAGTTGAATGCAGACATAAGCACGCAGACCCATACGTCCGCCCTGCAGCGAAGTGTCCACGGTGAGGTGAACCGGGTTG 

TTGCATTCGCGGGCGTAATACTCGTGGATCACCGAGCTGTGGTTGGTAACGTCGTTGCCGGTGGCCCACCAGCCTACCACGCTCTCGTTGGAGTT 

CACCTTGCGGTTGAGGTCGTACATGTCCAGGGCGTAGCTCAGCTCTGCCTCCACCTGGTCATCGTGCTCCTTGTGCGGCACACAGAAACAGTTGG 

TAACCTCCACCACACCTTTGTCCACAGAGCCGAGCAGGGTGCCGATGACGCGGTGCGAGTCCGCGTTCCGCCGCTCGAAGGCGTCCACCACCTGG 

AAAAGCACCACCGGGTGCACGCGCACGGTCAGATTGAGAGCCGACATGGTAATTTTCGGTGGCCAAGATTTGCAAGGAAAATTTCCGCAATAATT 

TCTATTCAGCCACGCTGCGCGTATGTTTTAGGGATGCAAGAAAAGGGCCTATCGATAGTGCGGGTGGGGGTTGATGGCTTTTGGAGCTATCGGTG 

CGGTCGTGCGTGGCCTATCGTTTTATCGATGTAAAACCGGTGGCTAGCGTTAGTTAATCAAATACTATTCAAATTTGAATATGTCGGAGATGCCA 

AGCGCGACTTTTCATTTACTTCAGCGTCCATATTGTCCAGCACATCAAGTCGCCTGCGTGTTGTTTCTTCTGTCGTTCTAAGCAGACTAATTTAC 

TCAAGCGTCGCCTTCGCGATGCTTTTCTTCTATTCCTCACCTTCACTCAATTATTTCATCCTTACTTCGTTTCCAGAAACTATACAACAACAACA 

ACAGCCACACAAATGATGCCCACTCAATAACGGAACGCTTTCCGTGAATTTCATTGCTCGCTGCTCATTCTTAACATAACGGATCAATAACAAAA 

TGTCGGTTACATTCTACTACTCAATCTTGCTTGTGAAATTTTGCTGATCAAACGTGCTTAAAGCGAATTATTAAATTTAATAAAATGCCTGGAAA 
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GAGATAAACTTTGAAGTTACCCAATTAATAAACTATAACCACCAGTTGGGAAAATCTTTTCCAGAATAAGTATAAATGTTTTCTGTATCCCGTAA 
GACCGTCTACTATTTTTTAAAAGGCTCGGAAAAAGAGGACAGGCTTGAACCTAAGA 
(SEQ ID NO: 1072} 

Exon: 2058,. 1001 

Start ATG: 1947 {Reverse strand: CAT) 
Transcript No. : CT27 609 

CCGCACTATCGATAGGCCCTTTTCTTGCATCCCTAAAACATACGCGCAGCGTGGCTGAATAGAAATTATTGCGGAAATTTTCCTTGCAAATCTTG 

GCCACCGAAAATTACCATGTCGGCTCTCAATCTGACCGTGCGCGTGCACCCGGTGGTGCTTTTCCAGGTGGTGGACGCCTTCGAGCGGCGGAACG 

CGGACTCGCACCGCGTCATCGGCACCCTGCTCGGCTCTGTGGACAAAGGTGTGGTGGAGGTTACCAACTGTTTCTGTGTGCCGCACAAGGAGCAC 

GATGACCAGGTGGAGGCAGAGCTGAGCTACGCCCTGGACATGTACGACCTCAACCGCAAGGTGAACTCCAACGAGAGCGTGGTAGGCTGGTGGGC 

CACCGGCAACGACGTTACCAACCACAGCTCGGTGATCCACGAGTATTACGCCCGCGAATGCAACAACCCGGTTCACCTCACCGTGGACACTTCGC 

TGCAGGGCGGACGTATGGGTCTGCGTGCTTATGTCTGCATTCAACTAGGAGTTCCCGGCGGCAAGAGCGGCTGCATGTTCACGCCCATTCCTGTC 

GAGCTGACCAGCTACGAGCCGGAGACATTTGGGCTCAAGCTGCTTCAGAAGACCGTAGGCGTTTCGCCCGCTCACCGTCCCAAGACCGTTCCGCC 

AATGCTGGACCTGGCCCAAATCTCGGAGGCCTCCACGAAGCTGCAGTCCCTGCTGGATCTTATTCTCAAGTACGTTGACGACGTGATCGCTCACA 

AGGTAACCCCCGACAACGCCGTCGGACGCCAGCTGCTAGACCTTATCCACTCCGTGCCGCACATGACCCACGAGCAGTTCACTCAGATGTTCAAC 

GCCAACGTACGCAACCTTCTGCTGGTCATCACGCTCTCCCAGCTCATCAAAACCCAGCTGCAGCTTAACGAAAAGCTCACCTTCCTGCCCACCGC 

CTAGAACGACGGCGTTTGTTGAAGTGGTAATATCTACTAAGGGTGGCATTGTAAACTGATTGTGTTTTTTATCAATGGGGGTCTACGTAATAAAG 

TGCTCTGATCGCC 

(SEQ ID NO: 1073) 

Start ATG: 112 (Reverse strand: CAT) 

MSALNLTVRVHPVVLFQVVDAFERRNADSHRVIGTLLGSVDKGWEVTNCFCVPHKEHDDQVEAELSYALDMYDLNRKVNSNESVVGWWATGNDV 
TNHSSVIHEYYARECNNPVHLTVDTSLQGGRMGLRAYVCIQLGVPGGKSGCMFTPIPVELTSYSPETFGLKLLQKTVGVSPAHRPKTVPPMLDLA 
QISEASTKLQSLLDLILKYVDDVIAHKVTPDNAVGRQLLDLIHSVPHMTHEQFTQMFNANVRNLLLVITLSQLIKTQLQLNEKLTFLPTA* 
(SEQ ID NO: 1074) 

Name: eiF3-epsilon-like 
Classification: translation factor 



Celera Sequence No. : 142000013384852 

ATTGGCATTAATGTAAACACTCGAGAGAGGAAGGAAAAGATAACAGTACTTGCCCTTGACTCCAATCTGTTCAAACTAAAAAAGGTACAATGAGT 
CACCCAATTTTCCATTTAAAAATTCATTGTTTTTATACTATATTAAATATTTAAGAAAATAAACAAAAAATATGTCTTTTATCAAGGAAAGCATA 
ATCAAGCCAATTCAAAACAAAATCATATAGGGAAACGACAAATATAAATATTTTCGATCTAGTTCGTAATCATAGGTTCAAGAACAAAGTTCGTA 
TTGGACAGCCTGCCAAATTTAAAGATTCAGCAATATAAAAGTAACTAAAGCAAGAGAGAAGGCTATGGTCGAGTTCCTCGACTATCAGATACCCG 
TCACCCGTGGCAACATGGGTCAACTAACTTGCGCTGGGTATTTGTCTCTAGAATCTGTATGCTTAATCTCAACCTTCTAGCTTTTATAGTTCCTG 
AGATCTCGACTTTCATACGGACAGACGTACATGGCCAGATCGGCTATTGATCCTGGTCAAGAATATATGTACTTCATATGGTCGGGAACGCTTCC 
TTCTGCCTGTTTCAACGAATCTAGTATACTCTATTACTCTACGAGTAACGGGTAGAAATATAAAATTGAAACTATTTTGAGTCATCCCTCCAATT 
GGTAATTTTTTTATATATTAACGGAAATTTAAAGTGCGTTACAAAAGTAAAAATTCTTTTGGTGCTAGTATGCTTAACGGGGTGGACATTTCGGC 
GTTTTCCACACTTCTTGGGAAATCACTTTAGACAAATTTGTACCAATGTCGGTACTTAATAGTATTTAACTATAACTTACCTTTTGCCTTTGGTG 
TGAATAGGCTATTAAGCCAAAGAGTAAACATTCGTCGATTGTAACTCCCTTGTCATCGATGTTGGCCAATACGTGACAACATCGATAGTATCGAT 
AACTATCAGGTTCAGGTTCGCCTGTATTGCACTATTTTGCTTCACACACGTAATTTCGGAATTTGCCAGAAAAACACCGATCTGCTATTGATCTT 
AGTCCAAAATTGCGTTTTTAAGTCGAAAAGACACTTTTCGGCTTTTAGATGGATGAAGACTCGGCTACGATACAGCTGGGGGATAGGCGTCTCGG 
TAACGCGGTCGATCATGATCTGCTTCCAGGGCCACCCGGCGTCGATGAGAAAAGCGGAAACGAGAACTGCGAGCCCAGGAGCAGTGGGTCGGGAA 
AGCGCGGCAAGTCGAATTCCAGGCGCCGTTCCACATCCCGTGGTCGGTCGGATTCCCGCAACACATCGCCGTCGAACACCCGGCGAAGGCATTCC 
CTGTCCCGCTCGCTGACTCCACCGCGTCGTCGTTACGATTCGCCCGATTATGGGCGGTACTCCCGACGGCGCTTTGATCGTGATCGGCGACGCTC 
CCCCGAACGCCGGCGCTCTCCGCCACGTCGCGGACGACGGTACGGACGATCCCGGTCCCGCAGCATGTCGCCCCGACGGGGAGGTCGATGGTCAC 
CAAAAAAAAAACCGTCGTCGCCGCCAGTGCCCCCTCCGCATGGAGGACCACATCCTCAAATGACCCCTGGCTATGGAGCTATGGCCGCACATATC 
TACGGATCATACGTCGCTCCACCAGATGTTTACGGTCACCAATATGGAATTCCGCCTAACTCGTTTCCACCTCAGGGACCGGGGTACGGTATTCC 
AGGTGCCCCGCCGGACCAAAGCTCTTTTGGCTCTGGCTACGGTCCGCCTCCCACAAGCGCGTGGGACGTAAATGGTACGTTTCTAGACATTTAAC 
GCCAAAAGCCCTTCATTTATTAAGCATATTATTTCCTGCAGTGTACGGTCAGCAGGTGTGGATTAACCAGCCTATTTCTGGGGTCCAGCCATCCG 
CTTCAGACTCCCTGATCCAGCCACAGAAGGCCCGCGAAACTCTCTCTGGCAAGACGATAGGCGAGGAGCAGTCGGTGCCAGCGCTCGAAGGCGCA 
GGTTTGACGCCTCAGGATGGTAAGTGTGGAAATGAATTACAATGTGATCACTACGAACATTCACATGCAAACCTCTCACATTGTTTTATTACTGT 
TGTCATGATTACTTGTACTTGTAACTTAATTGCGCGCAAATGAATCGCATACCAGAAATTACCAAAAGTCTAGGGTAAAGATCAGATTAATCAGA 
ATCTAGAAGTACACGCGCTGCAAGAACTAACCAAGACCAACCCGAGCGATAGTCTCGAAATGTTAATTCGTTTCAAAAAGAAAATGCAGAGTTTC 
CGTTTTTGTGTGGCAGTCGAAACAAACAATTACGCCAACCTGGACGACCGCCGCCCCAACTGGCTCTTCATTCAACGAGGACTGGCAGAACAAGC 
AATCACGGCGACTGCATCCGAGTGTAATTAGCCGATGAAATGCACTCGAGAATGCATTTGTAAATGAAATTTTAACTTTTATTTATCTAGGAAGA 
ATG T TT ATT AAT T AAAAC C TAT AT AT T T AT C C C C CAT AT AT AT AC C GT C G AT CGGC AGCT GT T GC C C AAG AAGC GG AG AAGC AAAAGGAC GAG C T 
TAAGAAACAGCGCGCTAACTATGTCAAAAAGGTATCGCTGCTCAAGAAGGAAATGAAAGTACTTAAGGACCAGCGCGAGGACCTGGCAGCAGGCG 
ATGCCCCACCCTCGCCAACCACAAAGAATTTCATCGAAGAAAACGACCGCCTGCAGGTGAATTGGGATCCCAAAACCTTATCTGTTAGCCACTCT 
ATGTAACCTGTTTTCATCCTGTATAGATGCAAATTAAGAAGAAACTTGACACAATTGAAAACGTTATAGATATGCTGAACGGCATAATTGGCTAC 
GAAGAAGTTGAAGATGAACCATCTAACCAAAAGTCCCATGGCAAGACTCAACCGGAACAACAGGATAGTCGCTCGCACTCCTCCAGTGAGAGTTC 
TGCAGATAGCTCTGAATCCAGCTCATCATCTAGTACTGCTTCCGATTCGTCATCCGATAACGAAGACGGGGAAGGGGCAGGCGGCGCCGTAGGAA 
GTCCAACCAATCGGCTAAAGCACCGCGCCATAAAGTCGATGAAGTCAAAGCAGCTACATGGTGAAATGAAGGCAGTGCCAACCCAAGCACAAAAT 
TACAATTTTGTGTTTTTCGATCCCGAGCAGCATTGGTGCGAGAGCTGTGGAGTTTTTCCAAAGTCTGCTCGAGACTATTTAAAGCACTTGCATGC 
CGAGGAACACATGAATCGAGAAACCATAGAAACTCCGTGGCATGTGGGTATTGACCACGATCCCTTTCCCACATTTGAAAACGCACCTGCCAGGC 
GCGTCCCCGTGAGGGGCATGCAATTTCTGGTGCCGGCTAACGCTTGGTTTTGCAAGCTCTGCAGTGTTTGGATTGGAGACTTGCACTGCGCTTCT 
GCTCATCTTAAGTCTAGGCTTCACTCCAATAAGTATCAGGTGAGTAGTAATATACATCTTTTAAAGCTCTTATGAAATATTTACTGTCACAAATA 
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TTCTTTACCATGTTAATTTAGGGTTTTGTATTTAATTAATAAAATATATATATTAAATATAAAAATACGGAGCTTGTAGGGCTGGTGGATAGAAG 
TACTTCCCAAAAACAAAATTGAAACAATAGTTTTGTTAAGATTGAACGTTATTTACATATCGTGAAATCATTTCTACGTTTAGCTTACATCAAAA 
GTTTTGTGTGTTAAATTTGCACTTAAGATATTTAGGTAAATTACGTTATTCACTTGTGCTTTCATGCATAAAAAAAGTTCGGATTTCAGGCATTT 
GCCATTCGCACATAATATCAGCTGTCACTACCCTTGGCAATGATGTAGTGTATGAGCAGAACAAGTTTGAAGATTGCTCCAATTGTCAACTTTTT 
TTTTTGCTGTAAAGTTGGCTTTTACCTTTGTGTCAATTTATTAATTATTAAATTGGCTTATTTTTAAAAGGTTAATTCTTCTTTTCCTTCCCAGC 
AACAAAATTTAATGGTAAGATGGGAATGTGAATGTAAATGGAGTAGCCAACATGTGCTTTAAATATATTGTTAGTACTTATGTAATCTTACTACA 
ATCATAAGTTAACAATAAATAAATGGAAGCCTATAGAGGCATGTCCGTTTGTATGAACATCGAATTTTTGGGAACTATAAAAGATACCTAAGCAT 
GCAGATTCTATTGAGGTTTTCGTGGCGTAAGTTTCTTTTAGAACATTGTCACACCACAAAGGCCACAAACGGTGGCTATTCTATGGTATTTGACG 
CCCAAAAACTGTCTCGCCCACACTTTTTAAAAATTTCATTATACCCGTTACTCGTAGTAAAAGGGTATACTAGATTCGTTGAAAAGTATGTTACA 
GGTCGGACTCGTTTGTGGCCATATAAAATAAGGATATTCCTGATCAAGATCACTAGCCGAGTCTGGCCATGTCCGCCTGTCAGTCCGTTTGTATG 
AACGTCGAGATCTCAGAAACTATAAGAAAAACTTCGAGAAGATTTAAGTTTATTCATTTCCCAATTTCTATCGACATGCCAAAAAAAATTTTATA 
GCTCTCGTTCGCACTTCCACTGAGTCACCAAAGGGTATCTAATAGTCGGGGACCACTCTTGTCAAAACTAACATAGGCCTTCCACTTACCTTTCT 
CGTATATATTTTATAATTGCTTTTATTAATTCTTCGCCTCATGCAATTATTCTCCTGTCAATTTGGAATATATTATTGACTTTTTGTACCCGTGC 
ATTGCAGCAATTTTTAAATCAGAACCCCAACTATGAAATAGAGTGGCAAACAAACCGCGAGCGGGCCATGAATCAACAGAAAGATCACGATGATG 
C C AAG AAT AAAAAG T C C AAG AAAG ACG AGGAC AAGGC GT C C AAAAAG AGG AAAAAAAAGGGGT AAGGGTC AT AAG AAT AAT TTG AA T CC AG A T TA 
TATTTTATATTTGTTTCTTTTTTACAAACAGCGGGGACAAAAAAAAAAAGAAGCGCAAGCACGGAAAGAAGAGCAAACGTAGTACAATAGATGCA 
AACAGCACATCGTCATCCTCCGATAGCTCTTCGTCGGAAGAGGAAAATGGACAGCAACCAAAGAAAGGTCGTACCGAACAGCAACCAATTTGTCA 
ATCAGATAAGTCGACTAATGCAGCATCAATACGAGTGGCCATGAGAAAGCAGGAAAGCACACCAACAAATTTGAAGCCGGAAGTGCCTTGTCAGC 
CTGTGGTGCCGCCACCGCCACCGATCATAAAGGATTCCACAATTCCTTCAGAACGGGGTAAATGGACTGCCGCCAGTGTCGAAGACCAAAGTAAG 
GATCAAAAGAACCGGGATGATACCATGCTCAAGCAATGGAATAATGTGCAGCCTGTTATAAGTGAGAGTGAAAAAAAGTTGTTGGAGCAGCTTAA 
AGGAAGGCTGAAGAACAAGGGAGCCCGCGACCAGAACAACCACAAAGCAGTACCAACAGAAGCGAATGATGAAAAGTTTAGGCGGGGTGGAGGAC 
GTCATTCTCGTTCACGTTCTAAGCGGCGCAGCTACTCTCGAAGCATAAGTCGCGGGCGGGATCGAGATCGGCGGGATCGTGAGCGAGATAGAGAC 
CGTGGTCGTAGGCGTCGGTCACGCAGTCGTGGCCGACACTTCTCTCGTTCGCCCATAAGATCTAGTCGAAGAAGTCGATCCCGTCAGGGCCGAAG 
TCGACGCAGTCGTAGTCGCTCTATTGAACGGGTAGAGCGTCCCGTGGTCAAGCATTCCGAGTTTCGGCCCCGTACTGTGCCGGAACGCAAGCAAA 
CTGATATAAAGCGCGACAAGAAGGATGCAGAAAGCAAGTCAAAGTCATCCAAGCCCAGTAACAATTCCTTGCCTGGAGGCAAAAAACTACCTTTC 
ATCGGTAAGATGCCAGTATTTAAAAAGCAGTCTGTTCCATCTTTCAACTATGATGCTTCTGCTGTCTACACCAACGGCTGTTTGGAGGCACCACC 
TCCACCGCCACCTCCATTGGGCGGAGCGACAAGTCAGATAGCTGTGCGTCAACAAGCACCGACAGCTGCGCAGATTCAGATGGCAATGATGGAAG 
ATGCTTTTGGCAATGCTCCCCCATTTCATCCAGATGCGGGTATGATGGTAGATTACGATGAGCTAATGCCAGATCCTGTCCAGTTCGCCAATTTA 
ATGACGAGCTGCCCACCACCTCCGCCGCCTGGAGAAGGCTCTGAAGGTGATGCCCATGGCCCTGAGGATGGCGATCAGCATGACAACGAAAACCG 
AGAGGACGTCTTGCCCCCAGGCATTGATGAAGCTGAATCTGCCCTGGTACCGCATCCGTTAGACGCTGCGGCACAATCCCGAGACGGGGAACTCC 
CAAAGGACCTGGTCGAAGCGCTTGACATAATATTCCCCAGCGACGTAGCGATTTTGGAGGCATCAAATGGAAACAGTGAAAAGCAGGAAGATGTG 
ACTATCACCACCATTGTTGAAGATGACGCAGTGCTCAGTGTGATTCATCTTCAAGATTTAGCTAAAGAGGGTATTCACCTTGTAACTATCAATGA 
AGCAGAGGTACCTAAAAAGTCACATATTCAAACATCGGAGAAGATTGAAAAGTTGCAGGATTTAAATGGCAAATCGGACGACGGAGTAAACATAA 
AACTGATTGACGCAGAGGATATCCCAATGCCGAATTCGCCTCCGGCTGAGCCGTCATTTGACGAGGAGACGAAAAGCAAGGCCTTACAGAAGCAA 
GCTGATATCTCAGAAATACCAGAGCCGGCTGTATCGCCTTCAGACAGTGAGAAGCTTCGTCGTCAGTCGGAGCTTGACGAGTTGGCTATGCTGGG 
AATCGACAGGAGTGACATGGCGGCACAATGTATGTAGAACGTGTCTTTCTTTGCTATGTATATTATTCTTTTTTTTTAATTAAATTAGAAACTGT 
GCTATTTGAGTCTTAAATGAGTTCTACATTATTTTTTCTATATATGTGCATAGATGCTCTGTAAGATTTGAGTTGACTTGGAGAGGGCGAAAGGC 
AAGTCCCGACGCCGGATACACGGGCGGACGAATCGAATAACACGTCAAGGAGCCCCATAAAAAGGAACAGTTTTAGATTTTAGATTGGCACCTGT 
GTATGAGTGTTTTTGACGGAATGTTTTAAATAAATGCGCATATATGTATGTAATATATGTAAGAATCCTAAAAACGTTTGATTTATAATTGCTTT 
AACATACACTGCGCGTCATCAGATTAGAGCCCCCTGCATGTCCCCGTTTCTTCTTATGTTCTGTTTTTCTATTTTGTTTTATTTTGCCATTGCAT 
AAAAATATATCAATTTAATTTTTTTTTTGTTCACTGATATTCACCTCTTTGAGGATAAAAAAAGGTCCTGAAAAAAACCGTTATCTTTTTAGTTC 
TGCTTTGTTTTCTTTTCGGCGAAAAATTGGACACGTATATGGCATCATCAGATTAGCGCCCCATAATGAAGTTTGGAATATCTTCCATAATTTAA 
AAGGTAGGACAACAAAATTTTGTATGCTGTTAAATTGAATAGTTTGCTTTAATTTTGCGAAAAAATGGGACGAGGCACGCGAAATCATAGCTTTC 
TATGAATGCGGGCTTTCTTTTTATAAAATTGCAAACGGCAGACATCGCAAAACTATAGTCAATTTTATTAAAAACAAGTGAACTTATGGAAAAAA 
CTATAAAGGTGGCAATAACATTCCTTATCGAATACCTTGCAATTCCCACGACTCCTCCGACAAAATTCGGGGTAAAAGTGGCGAAAACCCGAGCA 
AATCTTTGATTCAAAGAGTGATTAAGAAAACTAAACATTTGAAACACAGAAAACTGAAAAAAAACATCTCTTTATAAGTTGCGAAAAAAAGCACG 
ACTGAGTTTTGCTTGAGCTTATATGATCCGGTCTAAGGAGTGGCATTATGTTGTATTTTCGGACGAGAAGAAATTTATCTGGAGAAAGCAGACGG 
CTATAGTAATTACAACCATGATTTGCGTATAGAGGAGCATATTTTGGTTCGCTTTCACAGACGGGTTGGAGGTGTCATGGTATGGAGCGCAGCAT 
C T TAT CAT GGCCTCTGT GAAT T C C AGT T T T TAACAGCAAAAAT TAT ATT ATT AT AT AT AAT TGTG T A T T AAAAAAC TGCATTTCCC T AT TT T AAA 
AACATTTTTGGAAAC 
{SEQ ID NO; 1075) 

Exon: 1001.. 1784 

Exon: 1847 , .2014 

Exon: 2529. .2716 

Exon: 2782. .3459 

Exon: 4758.. 4906 

Exon: 4 972.. 68 69 

Exon: 6989.. 7185 

Start ATG: 1094 

Transcript No. : CT27 625 

TAATTTCGGAATTTGCCAGAAAAACACCGATCTGCTATTGATCTTAGTCCAAAATTGCGTTTTTAAGTCGAAAAGACACTTTTCGGCTTTTAGAT 
GGATGAAGACTCGGCTACGATACAGCTGGGGGATAGGCGTCTCGGTAACGCGGTCGATCATGATCTGCTTCCAGGGCCACCCGGCGTCGATGAGA 
AAAGCGGAAACGAGAACTGCGAGCCCAGGAGCAGTGGGTCGGGAAAGCGCGGCAAGTCGAATTCCAGGCGCCGTTCCACATCCCGTGGTCGGTCG 
GATTCCCGCAACACATCGCCGTCGAACACCCGGCGAAGGCATTCCCTGTCCCGCTCGCTGACTCCACCGCGTCGTCGTTACGATTCGCCCGATTA 
TGGGCGGTACTCCCGACGGCGCTTTGATCGTGATCGGCGACGCTCCCCCGAACGCCGGCGCTCTCCGCCACGTCGCGGACGACGGTACGGACGAT 
CCCGGTCCCGCAGCATGTCGCCCCGACGGGGAGGTCGATGGTCACCAAAAAAAAAACCGTCGTCGCCGCCAGTGCCCCCTCCGCATGGAGGACCA 
CATCCTCAAATGACCCCTGGCTATGGAGCTATGGCCGCACATATCTACGGATCATACGTCGCTCCACCAGATGTTTACGGTCACCAATATGGAAT 
TCCGCCTAACTCGTTTCCACCTCAGGGACCGGGGTACGGTATTCCAGGTGCCCCGCCGGACCAAAGCTCTTTTGGCTCTGGCTACGGTCCGCCTC 
CCACAAGCGCGTGGGACGTAAATGTGTACGGTCAGCAGGTGTGGATTAACCAGCCTATTTCTGGGGTCCAGCCATCCGCTTCAGACTCCCTGATC 



FIGURE SHEET 580 



CAGCCACAGAAGGCCCGCGAAACTCTCTCTGGCAAGACGATAGGCGAGGAGCAGTCGGTGCCAGCGCTCGAAGGCGCAGGTTTGACGCCTCAGGA 
TGCTGTTGCCCAAGAAGCGGAGAAGCAAAAGGACGAGCTTAAGAAACAGCGCGCTAACTATGTCAAAAAGGTATCGCTGCTCAAGAAGGAAATGA 
AAGTACTTAAGGACCAGCGCGAGGACCTGGCAGCAGGCGATGCCCCACCCTCGCCAACCACAAAGAATTTCATCGAAGAAAACGACCGCCTGCAG 
AT GC AAAT T AAG AAG AAAC T T GAC AC AAT TG AAAACGT TAT AG AT AT GC T G AAC GGC AT AAT T GGC T AC G AAG AAGT T G AAG AT G AAC CAT C T AA 
CCAAAAGTCCCATGGCAAGACTCAACCGGAACAACAGGATAGTCGCTCGCACTCCTCCAGTGAGAGTTCTGCAGATAGCTCTGAATCCAGCTCAT 
CATCTAGTACTGCTTCCGATTCGTCATCCGATAACGAAGACGGGGAAGGGGCAGGCGGCGCCGTAGGAAGTCCAACCAATCGGCTAAAGCACCGC 
GCCATAAAGTCGATGAAGTCAAAGCAGCTACATGGTGAAATGAAGGCAGTGCCAACCCAAGCACAAAATTACAATTTTGTGTTTTTCGATCCCGA 
GCAGCATTGGTGCGAGAGCTGTGGAGTTTTTCCAAAGTCTGCTCGAGACTATTTAAAGCACTTGCATGCCGAGGAACACATGAATCGAGAAACCA 
TAGAAACTCCGTGGCATGTGGGTATTGACCACGATCCCTTTCCCACATTTGAAAACGCACCTGCCAGGCGCGTCCCCGTGAGGGGCATGCAATTT 
CTGGTGCCGGCTAACGCTTGGTTTTGCAAGCTCTGCAGTGTTTGGATTGGAGACTTGCACTGCGCTTCTGCTCATCTTAAGTCTAGGCTTCACTC 
C AAT AAGT AT CAGCAAT TTT TAAATCAGAACCCCAACT AT GAAATAGAG TGGC AAAC AAACC GCGAGC GGGCCATGAATCAACAGAAAGATCACG 
ATGATGCCAAGAATAAAAAGTCCAAGAAAGACGAGGACAAGGCGTCCAAAAAGAGGAAAAAAAAGGGCGGGGACAAAAAAAAAAAGAAGCGCAAG 
CACGGAAAGAAGAGCAAACGTAGTACAATAGATGCAAACAGCACATCGTCATCCTCCGATAGCTCTTCGTCGGAAGAGGAAAATGGACAGCAACC 
AAAGAAAGGTCGTACCGAACAGCAACCAATTTGTCAATCAGATAAGTCGACTAATGCAGCATCAATACGAGTGGCCATGAGAAAGCAGGAAAGCA 
CACCAACAAATTTGAAGCCGGAAGTGCCTTGTCAGCCTGTGGTGCCGCCACCGCCACCGATCATAAAGGATTCCACAATTCCTTCAGAACGGGGT 
AAATGGACTGCCGCCAGTGTCGAAGACCAAAGTAAGGATCAAAAGAACCGGGATGATACCATGCTCAAGCAATGGAATAATGTGCAGCCTGTTAT 
AAGT GAG AGT GAAAAAAAGT T GT T GG AGC AGC TT AAAGGAAGGC T G AAG AAC AAGGG AGCC C GC G AC C AG AAC AAC C AC AAAGC AGT ACC AAC AG 
AAGCGAATGATGAAAAGTTTAGGCGGGGTGGAGGACGTCATTCTCGTTCACGTTCTAAGCGGCGCAGCTACTCTCGAAGCATAAGTCGCGGGCGG 
GATCGAGATCGGCGGGATCGTGAGCGAGATAGAGACCGTGGTCGTAGGCGTCGGTCACGCAGTCGTGGCCGACACTTCTCTCGTTCGCCCATAAG 
ATCTAGTCGAAGAAGTCGATCCCGTCAGGGCCGAAGTCGACGCAGTCGTAGTCGCTCTATTGAACGGGTAGAGCGTCCCGTGGTCAAGCATTCCG 
AGTTTCGGCCCCGTACTGTGCCGGAACGCAAGCAAACTGATATAAAGCGCGACAAGAAGGATGCAGAAAGCAAGTCAAAGTCATCCAAGCCCAGT 
AACAATTCCTTGCCTGGAGGCAAAAAACTACCTTTCATCGGTAAGATGCCAGTATTTAAAAAGCAGTCTGTTCCATCTTTCAACTATGATGCTTC 
TGCTGTCTACACCAACGGCTGTTTGGAGGCACCACCTCCACCGCCACCTCCATTGGGCGGAGCGACAAGTCAGATAGCTGTGCGTCAACAAGCAC 
CGACAGCTGCGCAGATTCAGATGGCAATGATGGAAGATGCTTTTGGCAATGCTCCCCCATTTCATCCAGATGCGGGTATGATGGTAGATTACGAT 
GAGCTAATGCCAGATCCTGTCCAGTTCGCCAATTTAATGACGAGCTGCCCACCACCTCCGCCGCCTGGAGAAGGCTCTGAAGGTGATGCCCATGG 
CCCTGAGGATGGCGATCAGCATGACAACGAAAACCGAGAGGACGTCTTGCCCCCAGGCATTGATGAAGCTGAATCTGCCCTGGTACCGCATCCGT 
TAGACGCTGCGGCACAATCCCGAGACGGGGAACTCCCAAAGGACCTGGTCGAAGCGCTTGACATAATATTCCCCAGCGACGTAGCGATTTTGGAG 
GCATCAAATGGAAACAGTGAAAAGCAGGAAGATGTGACTATCACCACCATTGTTGAAGATGACGCAGTGCTCAGTGTGATTCATCTTCAAGATTT 
AGCTAAAGAGGGTATTCACCTTGTAACTATCAATGAAGCAGAGGTACCTAAAAAGTCACATATTCAAACATCGGAGAAGATTGAAAAGTTGCAGG 
ATTTAAATGGCAAATCGGACGACGGAGTAAACATAAAACTGATTGACGCAGAGGATATCCCAATGCCGAATTCGCCTCCGGCTGAGCCGTCATTT 
GAC G AGG AG AC G AAAAG C AAGGC CT T AC AGAAGC AAGCT GAT AT CT C AG AAAT AC C AG AGC C GG C TG T AT C GC CTT C AG AC AGT G AGAAGC T T C G 
TCGTCAGTCGGAGCTTGACGAGTTGGCTATGCTGGGAATCGACAGGAGTGACATGGCGGCACAATATGCTCTGTAAGATTTGAGTTGACTTGGAG 
AGGGCGAAAGGCAAGTCCCGACGCCGGATACACGGGCGGACGAATCGAATAACACGTCAAGGAGCCCCATAAAAAGGAACAGTTTTAGATTTTAG 
ATTGGCACCTGTGTATGAGTGTTTTTGACGGAATGTTTTAAATAAATGCGCATATATGTATGTAATATATGT 
{SEQ ID NO; 1076) 

Start ATG; 94 

MDEDSATIQLGDRRLGNAVDHDLLPGPPGVDEKSGNENCEPRSSGSGKRGKSNSRRRSTSRGRSDSRNTSPSNTRRRHSLSRSLTPPRRRYDSPD 

YGRYSRRRFDRDRRRSPERRRSPPRRGRRYGRSRSRSMSPRRGGRWSPKKKPSSPPVPPPHGGPHPQMTPGYGAMAAHIYGSYVAPPDVYGHQYG 

IPPHSFPPQGPGYGIPGAPPDQSSFGSGYGPPPTSAWDVNVYGQQVWINQPISGVQPSASDSLIQPQKARETLSGKTIGEEQSVPALEGAGLTPQ 

DAVAQEAEKQKDELKKQRANYVKKVSLLKKEMKVLKDQREDLAAGDAPPSPTTKNFIEENDRLQMQIKKKLDTIENVIDMLNGIIGYEEVEDEPS 

NQKSHGKTQPEQQDSRSHSSSESSADSSESSSSSSTASDSSSDNEDGEGAGGAVGSPTNRLKHRAIKSMKSKQLHGEMKAVPTQAQMYNFVFFDP 

EQHWCESCGVFPKSARDYLKHLHAEEHMNRETIETPWHVGIDHDPFPTFENAPARRVPVRGMQFLVPANAWFCKLCSVWIGDLHCASAHLKSRLH 

SNKYQQFLNQNPNYEIEWQTNRERAMNQQKDHDDAKNKKSKKDEDKASKKRKKKGGDKKKKKRKHGKKSKRSTIDANSTSSSSDSSSSEEENGQQ 

PKKGRTEQQPICQSDKSTNAASIRVAMRKQESTPTNLKPEVPCQPVVPPPPPIIKDSTIPSERGKWTAASVEDQSKDQKNRDDTMLKQWNNVQPV 

ISESEKKLLEQLKGRLKNKGARDQNNHKAVPTEANDEKFRRGGGRHSRSRSKRRSYSRSISRGRDRDRRDRERDRDRGRRRRSRSRGRHFSRSPI 

RSSRRSRSRQGRSRRSRSRSIERVERPVVKHSEFRPRTVPERKQTDIKRDKKDAESKSKSSKPSMNSLPGGKKLPFIGKMPVFKKQSVPSFNYDA 

SAVYTNGCLEAPPPPPPPLGGATSQIAVRQQAPTAAQIQMAMMEDAFGNAPPFHPDAGMMVDYDELMPDPVQFANLMTSCPPPPPPGEGSEGDAH 

GPEDGDQHDNENREDVLPPGIDEAESALVPHPLDAAAQSRDGELPKDLVEALDIIFPSDVAILEASNGNSEKQEDVTITTIVEDDAVLSVIHLQD 

LAKEGIHLVTINEAEVPKKSHIQTSEKIEKLQDLNGKSDDGVNIKLIDAEDIPMPNSPPAEPSFDEETKSKALQKQADISEIPEPAVSPSDSEKL 

RRQSELDELAMLGIDRSDMAAQYAL* 

(SEQ ID NO; 1077) 

Gene Symbol: BcDNA:LD19168 
FlyBase ID: FBgn0027866 



Celera Sequence No. : 142000013384504 

ACGTTTTTGCTTAACCTTGACGAGGAACCGACAAATTTTGTATTACATTTCTGTGCAATTAAAAGTTATAACGTTTGTGGATATGAATAAAAATT 
TTACCATACCTACGTAGGTGATAAACGCGTACTTAGTAGATGGTTTCATTAAACTAAAACAACATTTTTATGGTGTTTGTAAAAGCGCAATTTTT 
TAATGCGTTTACTCAATTATTTATTGCCCAGGCCCGTTTGCCACAGCCCATTTAGCAACTTTAGCTGTGCGTTACCAACTATTTATATGCGTTAC 
CACATTTGAAATTGTATGAGACAGCTGGTATTACAAAGAGTATATCTTATTAATTCACATTGTAGCACTGGTGTAAAAGTGGCATATACATTGAA 
TTACGAATGAGAGTGTAAAAATACGGTGTACATTTGTGTATTCTTATTATTTTTGGTAGTTTTAAACATTTGAGGCATTTCTGTCGTTTGCAATT 
TACAAATAAATGGCTAAAAACTATGGCTGACGAATGAGAACACCAAATTGAAATCAACGAACTTAAGCGAAAGCTATATGAATATCGAAGTTACA 
TGTGAGATATATCACCGTGTATGTATGCTTACGTAAATATATATATATATATGATTCCAAATGGGTATTTCTTGATCAAAGATACTAACGATTTT 
CTGAAGGGTCCTTAAATATTTACGAGTAATTCACCTTTTTGTTGGGTGAAACTATTTAGTTATTAAACTTGTACAAAAATAAAAATAAAATTTTG 
AAAATTTCACAAACTTTGTGGCAAATAATTCAACAATATATTGTAGGTCTATTACGATATAAAATGAATTGGCACTTCATGTATAAATATGTTAT 
TAAAAAATCAAAACATAATATAATTCTAGTTTATTCAAGGCCCTAAAGAGAATGGTTTTTACTTTTGAGATAGAACTACCCATGCCGGAGATGCG 
TGAGAAAAATGTATAAAAAGCTTAGTCGGCAGTCTAGCGTTTCCTCCAATTCAGAAATTCGTCTCCTTGAACTCCTTGGATTGCAGTACAGGCGA 
AATAGCTCCCCCATCGCCGTTTTCATTTCCATTATTGTGCTCCTCTCCGCCGCCCACCGTGTAGGCCATCACAGTGGCATGATCCAATTTCCCGG 
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GCAATGCTACAAAGGCGCTGCTGTCCACCACACGCTTCCTGGCCGCGCGATGCTGCATGCGGTTCAGTTGCACCTCACGCTTGGCGCCATCTGGT 
CCCGTAGAGAGTTTATAGAAGAGTTCCCTTTGCTCCAGGGTCTTCCTCCTGGCTGACGTTATCATCCGTACAAAATCGATAAGCAAGTGGCAGAG 
CCTGCGTGCCGATACGCAGGGCGAAAACGTGCCGTTGATGCCATAGCTGAGCAGATCCTCTAAGCGGTAGAGGGTAAGAGCATGCCGCTGGGGAC 
CGGTAACAAGGGGCAGTCCTGAAATGGACTTCGGAATGCGTTGGGGCACCACCGATGGCGGTGGAGCTGATACCAATCCCTGGCGTGGTTCAATA 
AGCGTTTGGGAGCGCATATACTTTGGTCGACTAGGCGCATTATTAGGCGGACTTTCCGCCTTGGACTGACGAGCTGGTCGCGTGGGCGGAGTCGA 
AGGATGCAGACGGGCATAGATGGCGGATGCACTCTTGTTCTGCCGTTTGGGTGCCAAGTTGGCGGGCTTGACATGTCCCGACGTCTGAAGCTTAA 
CATCCGGAGTCCAGGCTTCGGCAAACTTGCCCACAACTGGATCGAAGTTATCGCTGATTCCATCCGACGTTAAAAAGACTATGTCTCCACTTTCA 
ATGCAGGTCATCGAGAAGGTTAGATTGCTCAGCTCTGGCTTGTTGCCATCCGCCGGTCCAAGGGCACCCAACGCATCGCGCATGTCCCTCATGGA 
ACTGATGTCGTGGGATGCCTGTGTTAGTTCCCGTACTCCGTGCTTCTTGGAGTACACATAGCCCAGGGAATCGCCAACATTACACGAGCACACGA 
CATATTTCCCTGGCGCTCCGTCCAGCGGCAGCACAACGGCTATGGTGAGAGTAGATAGGGCTCCTCCAACTTCCAGGATACAACCGTGTCCCTCC 
CAAAGACTACGCAGCAAACTCACAAAAACCTCCTGGGTTGTGGTGGCTCGGCACTCCAGCGCCTGTCCGAAGACCGCCCGGTCCAGATAGTCCAA 
ACAACCGTGAACAGCTGAACGGGCGGCCAGACGAGCGCCATCACCCCAGTTGACACCATCTGCCATGGCCATGGCAGCTGAGTCTCCGCGCACTA 
CCATTCCGTAGCAGTCGGCGATGGGATTGCCCATGGGTTCTTTAGTCAGCATGTTGGTCTCGTAGAGGGACACCGAGATACCATATGCGAAGTCG 
CACTCCATGCGCCAGTTCTCCACGCCAGCGATCAAAGCTGCGGGATCAGAGCCACCCTGGTTGCTTACAGCTTCGCGGAGCAAACGTTCCCGCTC 
TGAGTTGGGATCCCCATCCGTCGAGACCTCAGTCTTTGCATTTAACACATTCCTATTGTTCTGATCGCTGTTGGCTTTTCCAGCATCCGTGCCAG 
CAGACGCAGCGGCCAGATTGGCTATGCTGCTTTTGGAATTCTTGCGGGAACGCATTCCCGACGCAGAAGCAGATCCGGAGGGCTGCTCCGCACTG 
CTCTGTCTGCGAGGTCCTGCCGGCGGCCGTCCGAATTTGCTGGTAAGTCGGGTTGAGTTCTGTGTGGCCACCTGAACTGGTGGTGTACTCCTCCT 
AACGCCATAGCAATCATCGTTCTCCTGCTGCGTATCTATGAAGTCAATGTCATCCGCGAAGCTGGTGCTCAAGTGCAGTTTGGAGCGCTTGATGC 
CGGTGAGCCCCTCGTCAGGACCTGTGCAGGCGGCTGTTATAGAGCAGTTTCCAGCTTCGTATGTATCCAACTTGAGAACGGGCATGTCCGTGGGA 
TTCTGGCCATTGTAAATCTCCGGACCCTCAACAAACTGACGCTGCATTCGGCTGTCGGTGAAGGTGGTCGGGTAAATATTTGAGCAACTTTCGCA 
GGCTACTAGGTTTCTATTTACTTACCCCTCCAGATATTTAGTTATGAAGCTACCATCATCATCCTCGTGCTCGCTGGCCCGCCGGCGACCCTCGC 
GAGGCTCCGCGATGAAGCTCAACTGGCGGAAGTAAGTGGTGACTTTCTGGCGCAGCGACGGCATCTTTAGTCGAGTTGCTCCCTCCTCCTCGCCA 
GCAGTTCTCGCCCTCCAGTTAGGGTTCTCTTAATTGCCAGAGATGGAGGGGCTAAAAAATGGATGCTGGGATGCTGACCAGTTCGTTCACCTCAT 

C AG T GG C AAT T GT T GT GGG AGT C G AGT C AC T CC AAAC G AAT CC AG CC G CC G AC CCT GC AAAGAGGT T T AAT T T AG AGC T T CAT T T AATT AT C AAT 
AAATTATGCAATTAAATGGGACTGGGCAAACACGAAAGAATTACAAGCAAATTGCATAGGCAGCCGCATAACTGACATTACTCCCTCCTTTCAAC 
CGCCTACTCCTAAATGACGAGATGTGCTTTTTAGTTGCCGCAGGCCCTGCATCCTTTCCCTGGCCACACAAAACCCTTCGCTCATCAGGAGCAGG 
ATGTCAGAGGAGAAATCCCACCCTGTGAATGGGCTTTTACCCAGCTCATTGCGAATGCAGTAGACACTGAACTGATGCGGACACAGAAATCCCCT 
CCTTCCTTCGGGGTTCAGATGACATAGCTTTTAGCAGGGCTCTCCGCATGTGTTCTGAACCAGAGATGCCAATTTGGTTCTTTGCTCAACTAATT 
TTAAATTGTTTATGGATTAACTTATGTGCCAAACTACTTGCCTTTCAAAATAAACCATTTATAAAATCGAACGCAATAATTTGGGACTTGATTTA 
TCACAACATTTATTTATTTATATTAAACCCTTTTCAGGATCAACTTTCAAATAGTCCTTTAGCATGAATTAAGTAACTGGCATCACTGATGTGTT 
GATGATTAATGAATGGACAAACCAACTGCCTGGGGCGACCTGCAACTCCAGCAGATGTTGGGTTTTCGGATTCTGACAGCATCAAGTTCGGTTTC 
AATGGAAAGATGATATCATATAGGGGTAAGTTTGCCGGCGAGAAACACACCAAGGTCTGCTGTACAATTATGTATCTAACTCTCCGCCGCTATCG 
CAGACGACATTTAATTGCCGTGTTAGGGCATTTAACTACCTGATCCATTCTCGTAGACAATGTAACGCCAAGACAGCCTTCATTTCGGCGAGATT 
ACAATGGATCGCAAACACAATAAATGCTAATCTTCAGCAGCAATTGCCGTGGAGATGCTTTTCCTTAGACGATGTACACAAAACGATTTGTATAA 
ATGATAAACAACAACAAAAAAGTCGGCACGGAAAGTAGTTTCGTTGATGGGGGCAATTATTCAAGATACCCCCCCTCCCTTGTTGTTGCTGAAAT 
TTTCACAAATGCTTCAAAGATATATATAAAAAGAAAAAAACACAATCAACCACTTTAATTTGTGGCATAAGTACTTCAAGTGACAAATGAGTGGT 
CAATTAAAAACTATGTCGATTAACGGTGAATACTGTGGGTCGGTACGCCTCTCTATGCGCTTCTCATAAACGGCACTTAACGGCACGCGAAAATC 

GC AT G AAC C C T G AAC C AAT T GT T C G AGT T T C GAT G AAC T T G AACT GC AC T C AC G AAT C AAT C AGC G AGT AAT G T AAT T C AAT T T C AAT C T G AT GG 
CCCCAAGATGCGCGATTTCTTGGCAATTGCTGGGCACTGATCCGATTGCCACCACACCCGCATTCCTTGCGGCAAAGCATTAAAATCAAATTAAC 

C C AC AT AA C C AT T AAGT GC AGG AGT G GG AGAT T T C C C AT T AC CAT AT C C AC AT T TC C C AG AT AC AAAT C AAAT G AAT T G AAAAGT C AG A GC C GCG 
AGACAAAGTCGATCGCTGGACGAAAGGGATGGATAGCTGAGTGCCCATGTGGTTATAAAGATATATCTTTCTGTGGATGCATTGAGGCGCGGCCA 
GCTGTGTGCACTCGTACCGGTCCAGGGCAATCAATACGGATGATTGAAAAGCCCACCTTCACATCACCACAAAACAATTCAATTTAGCCACAACA 
ATGTGCAGCTTGCGATTAAATGTGCAGCCAAAGAGATTACAACTGCACCACAAGAGCCAACATTGAAGCTCAGCCACAAGCGTTTTGTAGTTCCC 
ATCTGCAATTGCTCCAATTTCTTTACGCCTCCTCCAGATCTGCTCATCCACGTCAATCAGAGTCGTCATCACTTGACACACAATAGGCATCTGAC 
TGTGACTTACTGGCTTCTCAATTCCCTCAGCTTTTTCGCTTATAAGTGTCCATTGTGGGATCTCACAAATTATTTAAGTTCTGACTTTTGTCCAC 
CTGACGGCACGTGCCGAGAATAAAGTATAGGTTTTATAGAGATGGAAAAATCTCTGGATTGAGAAGTAAACTCATGTAAGATGCCCGTAGCAGCC 
GTTTTGTTTACCCATGTACATATATTATTTTTTAATCTTTAATTGCATTTTATGTATGTATTCAATTGTATATGAAATGACACTCCCCCAAACAT 
TAAACAAACAATTCTTCGAGGCGTGTGCCGCAATCAATCGACACTTTTCGATTGGATTTCGGGACCATCGAAATTTTCAACACTTGAAACTAATC 
AAGCGTGGAAATGGGGAAATAGGAACGCGGTGCTGTGAGATCATTGTGCAAGATCTGGGTGGGGCGATCCGTGGTGTATAAAAACAACTGAGTAA 
ATATTATCGGATGTGGCACAGATCTCGTGAGTCTTTCGAACTTTGTTTATGCTCGAAAACCAAAATAAACCATTTGCGCTTCCTCCAACCAAAAA 
TAGTTGTGTTGGCCCGTATTTATAGCTGCCGCCCGTGGAAACAAAAAATGTCTGGGTGAATTTCTCGCTTGAAAAACTCATTTGGGCAAGGCTTT 
GAAATGCATTTCCCGTCGCATTCGAAATCAGCTAAATTTTCTGAGCTAATCAAATGCGCAAAATTATTGTTTACACACTGGGTTAATTAGATGTG 
GAGATTGGTTTCTCTGCTCCGCTTCGAGATTCTTTTGTGCTCTTTCGGCTTTGTGCATACTTAAAATGTGTGTGGATTTACGAGGGAGAAGTGCT 
AAAACCCAATGCAAACAAGTCTGCTTTGGGGTCCAACCAAATGCTCCAACCAAATGCAAACACACCCGTGACATTCAGAGTCTGGATGCACTTGA 
GGCAAGTGACAGTTGTCTGCAGATTATGTGCTATGTGCTGCATATTGTGGAATCTCAAGGTGCAGTCGGTGGGCCAAAGCCGCAGTCAAAGAAAA 
AGCCTTAACAGCACAAGTCAAGTGAGGCTGCTCAAAGCCGTTTTGTGTGCTTCTGTTTGGGCCAGACAACAAACAACAAGAAGCTATAAAAGCAG 
AGAGTGAAAATTTGTTGCATTCATTGCGTGTTTTCTTCGAAAGAAAGTTGAAGGCCATCGTTAAGCTGTAGATTAAATAGAGCTGTCTGCTTGAT 
TACGAGCTCCCAAATATTTCAATCATTTCGAGCGACCCACTTTATGTATATTGCTTTTGTAGCGAGTTCCCTCCTTAACGTATAAGTTGTGTAGC 

T T AG GT A T T C T AAG AC C T TAG AT T AGATGAT T C T C C C T TAT GGG AAT TAT AAT AT AT GAAAC AAAACC GT AT G C AC AAAC C AC AT AC AC GC GT C A 
AGTGCTGATTCAACCGCAGTCAAGGCAATCTTGGATAGGAAATGCATGCTTAATTTTCATTTCCTCATATGCAATTTTTCTCATTTCAAGTCCAG 
GCTGATCAAGCC GCAT AAAAATAAAAT ATT AAGT AGC AAAT GCATTAAGCAAAATGGACGGGGTT CCCCAACCCAGAGT TTTAAACTG AC AAGCT 
TTTGGCTTAAACCTTCCCCCAATATTTGACCATGTGTTGACTCATTGGACAAGCCGTCAGCTGCATTAATAATGCATTAGACCAGCCCCAGGACA 
TAAAAGTCCCCCCATCGTGCTAAAGTGGGCGCAACACGGCCTGGGGCTTCATGAAAATCTGGCAATTAAAATGTTTTACATTATGTAAACCGCAC 
GAAAACAATCACTAAAGAGTTCCTTCACCTCGAAATGGCGTCGTGAATATGAAGCGATATGTATGATGTGGGAAACAAAGGAGGAGTTGAGGAGT 
CTCTTCCCGGTTTTCGAAGCATTTCTTATAAATTGCCACCTAATTGGCGTGTTCAACATTGTGACTGTGCGTGGTACGCCTCATTTCCTGTTGCA 
ACAATTTGCATTAATAGAGGGTCTCATACCACAGATTCAGTTCGAGATCTCCCCATCCGTATGCGAATGCGAATCTATTCCAGTGAATGCATTTC 
AGTACCATTCAATTGAAGGCGCGATGAAATGGAAAGTTAAGATCCCTTTGGAGGATTGGATGTGTGTCAGAAGTTGAGAAACATCCTTTGTTTTA 
TGTGCCTTCCATGTGCATTCGCACTTAACGCCAATGCCAGGACCTAAGAGCACCCAAACCCACCCAGTTTTTCCCCACTGCATTTCCGCTTGCAT 
GTTTTCCCTTGTTTTTCCGCTTTTCGGGTCGCGCCACTTGGCCCGCTTTTCAGCCACGAAAAACTATACTTGAAGAACTTGCCACACGCCGAATA 
CAATAAACTAAAACTGGGGCAAGGCGAAATAATTCGGGCCTAGAAAGGCAATATCAAAGGAGTGCCGGAGCTTATGTATGGTATAAGGGTAAATT 
ACCTTTCGGACTTACTGGGTATATAGCATCAAACCACCCTCACACAAAACATTTAGTATGATTGTTTTTAAAGCCATATAATTACGGCGAAGGAA 
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AGCCATTCCCATACCATTCTACTAATTGGTTGATTTTTTGTTGGATTTCGCAAACATCTGGCAGAAAAATTGATGCCGCCAAACAACTCCACACA 
CATTTGGCAAGCCCACAAGATAATTTGCATAAATTGAATTGAAGTCTTTACTTGTTGGCCAGGTAGCGACAGCTGTCGTCGGTTTATGTAGGACT 
TATATACGACTAATCCTGCATGAATAAATTAAATGGGAAGCCTCAACTAAGTCTGTCTGGCGGCGGATGGGTTTTGGGTTTGGCCTCTGCCGAAG 
CCTTTGTTGCCAAGCATACAAATGAGTTGAATCCAAACTCCTGGGCGCTGTCACTCTATGCAAATCTAGAAAACCAGTAACCAGTTGTTGTAGGA 
TGGAGAAGGCACGGTGGACGGACTTTCGGGGCGCTCAACTCACATATTAAATTAAATGTTAATGGCAACAAAGACAATTCAAACGGAGTCAAGGC 
CCGGGCCAAAGAACTGTGGCTAACAAAGAGGCAAACAACACCAATTTCTCGACTGCCCCGCAGCAATTGAGCAACCAACGAGCCACTTTGTTCGC 
CCCATATACCCCGCCCACATCCCAGCCCCTGTCCACCCGTGAAATCTGTTCTCCACCTCCAGCTACCTCTAGATAATATTGCACTGTTCGGTGTC 
AAATGGCTGCCCACTGACGGACTTCTTGGTTTGTCAGGTGCCATGGTTATTCCACCTACAGTAGAAGCTCTTGAAAGTGGAATTTCTTTCTATCT 
AATAATAAAAGATAATTGCCGAGACAGTTTTGGTTAAAAGTGGTCAAAGTGCTCAAAGACTCTCTATCGTCGGTTATGTACGTTAATCATATTAG 
AAGTAAGTGAAAATGTCCCCAGAAAGGACGTTCATTACMGGAGCCCCTACTGTACAAGACCTTGCGGCGGGTTAAGTGACCGATTGGCTGTCAG 
TCCCCAGCCAACCAACCTCACCTCTCGGACTTCTAGTTTTAATTTGAATTTCATTCCGTCTCGTGTGGTGGCCAAACGCTTTGGTGGTTCGCTTA 
GTCATATCACTCGGCTGACTAATCATTGAAAGCGGAGCGTGCTTGGAGGGAAACCTTAAGCCCCAGATATACCCCCCTTCTAACTCCAAACCATT 
TACCGAAATCTCCACCCCGCCAAAACCCCACACAGCTGCAAATTCAATGAACCTTTGCTGCTGTTGACTTTGCTGATTAATTGGAAATTGGCAAA 
ATGTTATTTATTTTCGCGCGGCTGGTCTAGCGCCAAATGGTTGTTTTTCATCAAAGGCATAAAGTTCTATAAATGTTCCCAGACAAATTGGTTTT 
AATGAGCCGGAAACTTTTTCGGTTGAGCTCGGATTTTTACGGGTTTGGCGGATAGAAGGAGAAGCCACATATTGTACCCGAT7GGCTTTAAACAA 
ACACGGACATACTATACTGAGACAACTTCTTCAGTGGCTCGCTAATGCATTATACAAGAATGAACTCCTTTGCAGGCACGGCCGAGCAAACAAAC 
TTCAACTTGTCCGGACCAACTGCGTCATAAAAGAGCCAACGGCAGCGCCAATCGCCATCTCCGTTTCCAGCGAGACAACAAAGGATGTTGGGCGC 
CGGGAAGCAGGAGGAGCAGGACCTGGGAAACCAGCCAGTTGTGGATGGAGCGCAGACAGAACAACTTCATGCAACCCCATCCCCTGGGAACCAGA 
C GGC AGC C AGGAGT CT GTGT GC C TG AT C C T T GC C AT CT C G AAAC G AAAAAG AAGC CGGCGAAAAAT AT AT AAAAT GG AG AAAC AAAAT ATT T TC C 
CGATGTGCATGCATAGTTCAAAGTTAGTTTCAGGCATCTGTGCACCACCAAATACACCACATTCCCCACTACTTACAAGCAGAATTGTGAAAATT 
TTTTGGTTGCCAAGCGAAAGTGATTGAATTCCTAGGGATTTTGTCGGGCGCCAGTTGAAAAGGAATTTTTGATGCGTTTAAGTGCCGAAGTTTCC 
CTAAAAAAGTTTGGCCTTAGAGTCAGGGGATGTATTAAAAAGAAAAATGTCGAAAGTTTTTGAAAGAAGAGATTAACAATGATGTGTGATACAAT 
GATTGTGTGTTTTGGTCTGGAGAATTTAAATCTCAATAAAAAACTACAAAATGTATATTAATGAGGCTAACGATGGTTATATTATGCAAACAAGT 
GTTGGGGGGAATATAATTATTTAAAATAATTGCATTTTTTTGCGATTCCCACTTTGCAATTCTAATCCCTAATAAAACTAGAGATAACTTATCTT 
ATGAAGCAATTTGTTTACTACTGCGACTGGTCTACAGAATATACTGATGCAATCGCTTTAGAAGTGTGGTAGGTAATCCCGTTCGTTCTCAGCCG 
GAAATCACCCCCAATAAGAAGTTTAGATAATCTTAGATACGGTATCTGCTTGTTGTTTGCCACATGCATCTACACCCTGGATTCTAAACTTAGTT 
CCCCCAAAATATCTGACCGAGTTGTCCCCTTTAATTTACAGCTTATATCCGGCATGTTTTTCTCCCGTTTGAATTTTGCCACAGTTTGCGCCAGA 
AGTTGTGAAAAGGGCAGAAAGAGAACGAGTTGAAAGTTCCCATTAGAATGCCCAATCCAAGCCATCCTGTTCCTTTATTGCGTGGCCCCTTTTTT 
TTACATTTTCCGTAATTCCCAGCCAGGCACCATCACGTGTCGGGTTTTCTCCAAGGTATTGGGCTTCGTGGAGAATCGATTTTGTGGCCTGACGA 
GCCTGTTATCCTTGGAGCGAGCGCTTTTGGCTTGTTTCAGGACATAGAAATTGCCTGCTGCATGTCCGAAAGGCTCAACGAGAAATTATGCTAGC 
TACACTGAAATCTAAACTTCAGTTCAAGGAACTCAGGAAAGAAAATACTATTCAGAAAATGAAGTTTTTATGTTTTAAAACTTTTATAACAGTTT 
ATAATATTATTGATACGATGTCAATATTTTATATTTGTTTTAGTTCGGTAAAATGAAGATTATCATGAATTTTTCTCAGTGTGCGAATGGAGGTG 
AGTGCGAAAACGAGAAGGAGGAAGCATCCATCACATGCATCTGCAGTTGGCAGGCAGGTAAAAATGTTGTACAAAAAATGAATAACTGACATTTT 
AAGTGGGAAAGTTTTTGCATCAAAATCAACTGCCTGCCAATGCTGCTACCGAAATTCAATATATTAACAAGTGCTCCGTAATTGTAAATTATCGC 
CACTTTTTACAGTCATTACTATTAGTAAGTTTATAATTATGTGGGTAACAAAGCGGTAAAATATGCCAAACGAAAAACAACACCAGCTATTAGCA 
CAGTTTCGAAATACGATAAAGCTTGAAAACCGGTTATTGCACCAACGAAAGCTGACTTTTGTTGTTCTTGTTTTGTTTGTTATTGTTATTTTTGC 
TAGAGC AGC TGAGAT TTCGCAGCT GCCCGT GAAGTTGAAAAC AAAAT AAGAGAAAAAAACAAACGGCAGGT GAATCAACAGC GGT T CAAGCGCC C 
AACCCACCCACAAGTTTGTTGGAGGGGGGACGACGACGTAAAAAACAGAATAATAAAGACGTAAACCCAGTTAAATGGAAAAATCAAGGCGAACG 
AACACACACGGAAGCAAAAGCACAAAACAAAAGGCGACAAAAACACGTCGAAGGCGGCAAAAAAATTTGGAGAACCCAAAAATCAATAGAACTTC 
GAAATATCCATAACCATGTAACACAGTGCACATTCACTTACAATAAGTGTGCCTTCCTCTTCTGGGTTGTTTTCTTATCGCTACTTTACATATGT 
TGGCGCCACTTGCAATGGGGAAATATATGTAAGTGCGTGCGTTCCGTTGGTGGTGTGTATAAATATTTAACATTTGCCGGAGGGTGGTTCCATCG 
CCCAGAACTTCGGCTTCTGCATGGAATACTAACATAAATCGCGCTGCACAGTCCACTTTTGTGTGTTGCACTTCGAGTCCCAATGGTTACCAATT 
AATGTCGCTACGAAATGGAACTGCTAACGACTGTTTCCGCCGCGGACCGACGTTCCTGCTTCGATTTGAAATTCGAAGTGAATTGGCCAAGCCAA 
GCGATATTTGTGGTAATCGAATCGTCGATATGAAACACACTATTCTATGTGCAACCAACCGAGTTGGCAGCTCCCAGCGCGATTTTTCAGTATAT 
TTTGCAGTATTTTAATTGTTGCAGACGGCTTTTTAAAGCGCCACCAGTTCGTTCACATTGTCGCTATTTTTCAGCATTTCGCGGGATCTGTATTT 
TTTATAGGCGAATATTGAAATTTGCATGGTCTAACATTGATTATAGTGTTGTATTCACCGAATATTTATTGGTTTTACTGTCCATTGCTTTGAAA 
GCAATAGAAATAATTGAGGATCAAGTAAGTTTAAGATAAATTATTACAGTTAAATGCCGATGCAGGCACTCACGCACACTAACGCACCCAAAACA 
GCGACACGCGAATTTGCGAATTTCGTTTTGGCCCGTTGTTTGTGTGTTGTTGCTGTTGTTACAACGTTGTTTGTGCTAAACATTAATTAATTGTC 
GCAATGTCTGTTTCGCTATAAATATGCGAAACGTGCCGTTGTTATTTGCGCGAAACTAAATGGCCMTTGCGCTTTACCGGTGTTAGTGTAGGTG 
TATGTGTTGTCTGTGTGTGTTTGTGGTTGCTGGCGCGGGCGAATGGCGGGTAATCCGCATACACTCGCACTTATTTTTTGCAAAGGGTTTATGCG 
ACACTCGCTTTGATCACGATATTAAAAGCTCAAACACCACTCTACCATTATACAAATATTTATTTATGTAAAATAAATGCGGTTAAATAATTTAC 
CTTAGAACTTGCCCTAAAGGATATTAAGTGCAATTAAAAGTAGAACAAACATTCATTGCTAAGTTACTAAATGAATAGCATTTTCTTTTGCAAAC 
TTGTTCATTATAAACTACTCCTTCGAATATATATTTACTTAGTATATTTTCTGTTTCCACGACTTTGTCTCACACATAAACACGTAAAAGCGCGA 
AACAGACAC 
(SEQ ID NO: 1078) 

Exon: 11929.. 11814 
Exon: 3379.. 3066 
Exon: 2996.. 1001 

Start ATG: 3199 (Reverse strand: CAT) 
Transcript No. : CT27700 

AACGTCGGTCCGCGGCGGAAACAGTCGTTAGCAGTTCCATTTCGTAGCGACATTAATTGGTAACCATTGGGACTCGAAGTGCAACACACAAAAGT 
GGACTGTGCAGCGCGATTTATGGTCGGCGGCTGGATTCGTTTGGAGTGACTCGACTCCCACAACAATTGCCACTGATGAGGTGAACGAACTGGTC 
AGCATCCCAGCATCCATTTTTTAGCCCCTCCATCTCTGGCAATTAAGAGAACCCTAACTGGAGGGCGAGAACTGCTGGCGAGGAGGAGGGAGCAA 
CTCGACTAAAGATGCCGTCGCTGCGCCAGAAAGTCACCACTTACTTCCGCCAGTTGAGCTTCATCGCGGAGCCTCGCGAGGGTCGCCGGCGGGCC 
AGCGAGCACGAGGATGATGATGGTAGCTTCATAACTAAATATCTGGAGGGCCGAATGCAGCGTCAGTTTGTTGAGGGTCCGGAGATTTACAATGG 
CCAGAATCCCACGGACATGCCCGTTCTCAAGTTGGATACATACGAAGCTGGAAACTGCTCTATAACAGCCGCCTGCACAGGTCCTGACGAGGGGC 
TCACCGGCATCAAGCGCTCCAAACTGCACTTGAGCACCAGCTTCGCGGATGACATTGACTTCATAGATACGCAGCAGGAGAACGATGATTGCTAT 
GGCGTTAGGAGGAGTACACCACCAGTTCAGGTGGCCACACAGAACTCAACCCGACTTACCAGCAAATTCGGACGGCCGCCGGCAGGACCTCGCAG 
ACAGAGCAGTGCGGAGCAGCCCTCCGGATCTGCTTCTGCGTCGGGAATGCGTTCCCGCAAGAATTCCAAAAGCAGCATAGCCAATCTGGCCGCTG 
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CGTCTGCTGGCACGGATGCTGGAAAAGCCAACAGCGATCAGAACAATAGGAATGTGTTAAATGCAAAGACTGAGGTCTCGACGGATGGGGATCCC 

AACTCAGAGCGGGAACGTTTGCTCCGCGAAGCTGTAAGCAACCAGGGTGGCTCTGATCCCGCAGCTTTGATCGCTGGCGTGGAGAACTGGCGCAT 

GGAGTGCGACTTCGCATATGGTATCTCGGTGTCCCTCTACGAGACCAACATGCTGACTAAAGAACCCATGGGCAATCCCATCGCCGACTGCTACG 

GAATGGTAGTGCGCGGAGACTCAGCTGCCATGGCCATGGCAGATGGTGTCAACTGGGGTGATGGCGCTCGTCTGGCCGCCCGTTCAGCTGTTCAC 

GGTTGTTTGGACTATCTGGACCGGGCGGTCTTCGGACAGGCGCTGGAGTGCCGAGCCACCACAACCCAGGAGGTTTTTGTGAGTTTGCTGCGTAG 

TCTTTGGGAGGGACACGGTTGTATCCTGGAAGTTGGAGGAGCCCTATCTACTCTCACCATAGCCGTTGTGCTGCCGCTGGACGGAGCGCCAGGGA 

AATATGTCGTGTGCTCGTGTAATGTTGGCGATTCCCTGGGCTATGTGTACTCCAAGAAGCACGGAGTACGGGAACTAACACAGGCATCCCACGAC 

ATCAGTTCCATGAGGGACATGCGCGATGCGTTGGGTGCCCTTGGACCGGCGGATGGCAACAAGCCAGAGCTGAGCAATCTAACCTTCTCGATGAC 

CTGCATTGAAAGTGGAGACATAGTCTTTTTAACGTCGGATGGAATCAGCGATAACTTCGATCCAGTTGTGGGCAAGTTTGCCGAAGCCTGGACTC 

CGGATGTTAAGCTTCAGACGTCGGGACATGTCAAGCCCGCCAACTTGGCACCCAAACGGCAGAACAAGAGTGCATCCGCCATCTATGCCCGTCTG 

CATCCTTCGACTCCGCCCACGCGACCAGCTCGTCAGTCCAAGGCGGAAAGTCCGCCTAATAATGCGCCTAGTCGACCAAAGTATATGCGCTCCCA 

AACGCTTATTGAACCACGCCAGGGATTGGTATCAGCTCCACCGCCATCGGTGGTGCCCCAACGCATTCCGAAGTCCATTTCAGGACTGCCCCTTG 

TTACCGGTCCCCAGCGGCATGCTCTTACCCTCTACCGCTTAGAGGATCTGCTCAGCTATGGCATCAACGGCACGTTTTCGCCCTGCGTATCGGCA 

CGCAGGCTCTGCCACTTGCTTATCGATTTTGTACGGATGATAACGTCAGCCAGGAGGAAGACCCTGGAGCAAAGGGAACTCTTCTATAAACTCTC 

TACGGGACCAGATGGCGCCAAGCGTGAGGTGCAACTGAACCGCATGCAGCATCGCGCGGCCAGGAAGCGTGTGGTGGACAGCAGCGCCTTTGTAG 

CATTGCCCGGGAAATTGGATCATGCCACTGTGATGGCCTACACGGTGGGCGGCGGAGAGGAGCACAATAATGGAAATGAAAACGGCGATGGGGGA 

GCTATTTCGCCTGTACTGCAATCCAAGGAGTTCAAGGAGACGAATTTCTGA 

{SEQ ID NO: 1079) 

Start ATG: 297 (Reverse strand: CAT) 

MPSLRQKVTTYFRQLSFIAEPREGRRRASEHEDDDGSFITKYLEGRMQRQFVEGPEIYNGQNPTDMPVLKLDTYEAGNCSITAACTGPDEGLTGI 

KRSKLHLSTSFADDIDFIDTQQENDDCYGVRRSTPPVQVATQNSTRLTSKFGRPPAGPRRQSSAEQPSGSASASGMRSRKNSKSSIANLAAASAG 

TDAGKANSDQNWRNVLNAKTEVSTDGDPNSERERLLREAVSNQGGSDPAALIAGVENWRMECDFAYGISVSLYETNMLTKEPMGNPIADCYGMVV 

RGDSAAMAMADGVNWGDGARLAARSAVHGCLDYLDRAVFGQALECRATTTQEVFVSLLRSLWEGHGCILEVGGALSTLTIAVVLPLDGAPGKYVV 

CSCNVGDSLGYVYSKKHGVRELTQASHDISSMRDMRDALGALGPADGNKPELSNLTFSMTCIESGDIVFLTSDGISDNFDPVVGKFAEAWTPDVK 

LQTSGHVKPANLAPKRQNKSASAIYARLHPSTPPTRPARQSKAESPPNNAPSRPKYMRSQTLIEPRQGLVSAPPPSVVPQRIPKSISGLPLVTGP 

QRHALTLYRLEDLLSYGINGTFSPCVSARRLCHLLIDFVRMITSARRKTLEQRELFYKLSTGPDGAKREVQLNRMQHRAARKRVVDSSAFVALPG 

KL DH AT VMAY T VG GGE E HNN GNEN GD GGA I S P VLQS KE FKETN F * 

(SEQ ID NO: 1080) 



Celera Sequence No. : 142000013384505 

GGATCAATATATCTTAAGGTAGAGGAGAGTTACAGGTGTTGATTTGCTTTTCATAGGCAAGTTGAGTGCATTTTGCGCTGCGGCTCTGCTGCTGC 
TCTGCTGCGGCGGCTGCGTTGAGTTGTTGTCGTTGAGTTTTTGTTTGAATTAGTCTTTGTGATTTGGTGGGTATACGATTTATAGTAGTTGCATT 
ATTATTATTATTAGTTAATAGGTATTATCGTGTCTCTTGAATATACTCTGTTAGCTTTTGCTGCTCTCTGCATTTATTCTTTATCAACATTTAAG 
GTACGTTGCACATTTTGTCCGCTTTTCGGGGGATAAACATAAACTGTACGTTATGTTGCGCTTCTAGCGTCCATTTTTGAATTACTCATAAATAT 
TATTATATTTATAATTTTACTTATTAATTAATTTTCAAGTCTATTTGCGTATCACGTTTAAAACAATGTTCAGTTGTGAAGTATCCGTTGTTAGA 
CATATAAAGTTCTTCTTCCTCTAATCGGGCTTTTCCTTCGCTTAAGTCCTCTTTCCATTTGGTTTTCCTTGTATTTCAGCCAGGCATCCATTGAG 
TGTTGTACTGGTTGCAGTTTGTCGATTGGCTGGCTGTCTCTATAAAGATTGCGCATTGAAGTTGACAGGGGAGTTTTGGAGTTTGGGGGAGATGT 
TCTATGTAGCGGGGAATTTTGCATCTTTTTCGCTGTGTTGATTTGTTCTCATTGTCGTTCGTTGTTGATGTTCTTTTTTGTCGGTCGGTTGTGTG 
AGGACAGTAGTTTGTGTGTTTTTTACTTGTATTTTGTTTGCTTTCAGTTTTCTTTTCTTGAAACAGTTTTTCTCTTCTCACCTAATAGCTTCTCA 
AATTGGTTTAATTGTTTTATGATACACACTTGAGATCACAAACATTACTCTTGTTTTTTTTTTTTGTTTTTGTTTAAAAAAGTTCAAATTTCCAG 
TTGAAATGTTTTCGCTTTTTTTGTAAAAGTTTTTTCATTTTCCATTTTTTGTTTTATGTTTTTGTTAATTTTTTTTTGTTGGTTTTATGTTCATA 
TCGATTTTTTTTTGCAACTCATTGACGGGTTCGGGGTCCACCGGGTGTTCTTGCCATTCATGTCGGTCCTCCAGATCCATCTTCAGATCCAGCTC 
TAGATCCAGATCCTGTTCCTGCTTTCCTACTCCAGACACTGATCCTCCAGCACCAGTGTGGTTACGACTGCACATTGAGCAGCAACTTTAGGTTG 
TGCGTGTCGAAGTCATCGAGGAGATTCCGCTTCGAGCGGCCTGTCTGCATCGGCACCTGGAACTTGGGCATCGACGGCATCATCATGGGAAGCTT 
ACCCTCCGACCTGGACAACGTCCAGTGCTGCGGTGGCTTGGGCAGCGCCGTTGGAGCTGGTGCATCAAAGCACTTGCTGCCCGCGAAATGCGTCA 
GTGCGGTGGGTATGAGTGGGTGCTGCTGCATCTGGTGCTGGGGCGATATCTTTCCGCCCAACGAAGGGCTCTTCCTCTGCTTGCGAGCCGATCCT 
CCGACGATTCCCGGCGAGAAACCTCCACCCAAAATGGTGGCTGGTGACTGCCGAACCTGCATAGCTCCCTGGGGCGAACGTCCTCCTCCGTTCCT 
TCGGTTGTTGTTGGCAAAGAAGAAGCCTCCACCGCCTACAGAGGATCCCTGCTGCTGCTGCGGCGACTGGCGCTGCTGCTGCGACTGCTGATACT 
TGTTCTTGGATCCGGTCGACTGGGACTGCAGTTGCCGTTGCCTCTGGTTCGGGGGCGTGGCAAGAGCGGATAGACCCAGGCCAAAACCGAGACTC 
GTGGGCGAGGAGCTCAGGAATTGGGCGTTATTTGTACTGCTGGCAATACTGTTGGAATGTGCAGTTACCAAGGGCGTTGCTGGTGCGGATGCGGA 
TACGGATGTGGGGGAGACGGGTGCGGATGTGGGCGAGGGGGTGGATGAGTTGGATGCGGAGGTCGCTGATGCTGCTCCCTTTTTTAATCGTATGG 
ACTTGGTGGCCGATGCGGTTGTCATTCTGATGTTGTTTCTAACGCTATTGTGCGAGTATGTCATAGTAATTTCGTCGCTTTCCGATCTCTGGATC 
GATATCTCTCTCTCTCCGATCTCCTCTTCTCTTTCTCTTATTAAAATTTCTGCTGTTTTCCCTTTTTGTTTTTGGTTAATTCGCTAAATTAATGC 
TTAACGTTAAATATCACACGCACTTAAATCAATTCTTTATAAATCGAAACCAAACGCCTTCTGATTCTGATTCTGATTCTGCTTCTGCTGCTATT 
GCTATTGAGATGTTGATCTAGCTGCGCCGCTCTATTTTCTTTGGTAGCTGATAGTCGCCGAAGTTGAGATGACCTCCAAAATGGAAACGAAAAGG 
GCGACGGGCGGCTGAGCACGTAGCTGCTTTTCCACTTCTCCTTTATGTTTTATGTACAGCTGTTGTTTACTTTACTTTTAGTTTTACTTGTAACG 
TTATGTATTGTTTATCAGTCTGTTGATTTATTTTTTTTTTACTCCGGTTAAGCTATCCTTAGCTATAATTTCGCAAACTCTCTGTTCTTCGCTGA 
TTTAACTGCTGCTTTCGCATTCCAAGCCGTGCTGATCCTGTTTTTTGGAGAAGAGCGAAAGAAACCATTGAAAATTCGAAACGATACATATTGGC 
T T AT G T T AAC AGG AAAAT AGT T G AAGTC T T AT T T C T T AT G AAAC T AT T TAT G AAAT C AGC AT C AAC CG C AT T AAT T AC AAT GC AGT T T AT TG AAA 
TACATTCTTACGTTAAAAGGTCTATATTTAAATTTAGGTTGAGTAAGAAAGATAATATGTACATATACACATGGAAAAAACGTCAGTATACGACA 
GAATACCTTTTTTACCTCAGTACGAAATGTAACGTTCCAAAGGTAAAACAAAACAGTTCTATTTCGATGGCCGCGTGTGTGTGTGTATAATAAGA 
AACGGGAACCCGAAAAGGGGAAATAAGACATGTAACGATTACATTTCCAGGTAGCAACAACCTGTAACGTAGTGCTAACTACGTAACGGCTGCAT 
TATCGTTTTAGCGTGTGACTGTGTGTAGACTGTGTGTGCTATTTACGTCAACAAAAGGTGTTTGTTTGGCTATTCAAGCAAACTTGGGACGTCAG 
AGGGACGACCCCACATGTATATATAAAGATACATATGTACATTTATTTTGCGAGAATACCTGAATTAATGAATCAACACACACACGCGCATACAT 
ACACACGCAGCGATTACTAACACAATTGCGACTTACTTTTTTGCCGTAGACCTCACACCAACGAGTGTGCTTGCGTGCGTGTGTGTGTCGCAGTT 
TTCTCAGCCACTTCGTTATACAGATTTCGATATTTCGACAAGCGTTATATACAAAATGGGTTGAGTTTGGGTTAAAAGTTGCCTTGATTTGGATG 
TTTCACTTTAAAATGTTGCGGAAATTGGTTTGTTTTGCGTTTTTTTTTTTGCTCTTTTCTTCGGATTTTATTTGGGGTTATATGTGGATGTGTTC 
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GCAGCTGTGTATATATATGTGTGTGTGTGTGTTTTGTAAAATCACTTTCAAAATGGCGACGGGCTGGCTTCGTATTTATATATTTATATATATAT 
ATATAGATGTAATTCCGCTTAAAGCTTTCTCTTCACTTACAGACAAAGGCTCAAAAAAATCACACTTTCAACTGATTCTGATTCGGATTTTGTTA 
AACTCTTATGTACACCACCGTCTATTGCGCAGCGTTTATTTCGAATTTGATTTCCGTTATTTGCGCTCGCCAAGTACTCCACTTTAGTGTGTGCT 
CTTCTGTAGCCTCACTCGTTGTACTTCGACGCGTATTAAAATATTAAACTTTACACAAATATTTGATTTATGTCACTTAACTTAGAACATTTACA 
ATATGCGTTACATATATATTTATTTTTGTAGCAGAAATTTTAATCGATATCTTTATCTTCTTCGCTTATGCTTGTTCTTTGCGCATCGGTACGCT 
TGCCTCGAATTTTCCACCATGACTCACACAAGCGCACGACTGCGCGGACTTTTATATGCCCGCAGCGGCATTGCCAAGCCTAAAAAATAAATTTC 
CCTCTTTATGTATACATCGAAATATGTTTAGGCAACTCTGGGTTCGAGACACGGGTGAATCAGTGGATACTCAAGCCGGAAAAGCTGGAGGCCTG 
TTCAGACATGCGGCGACCGCGGTTAGTAAATTAGTAATTATTTTTTATGACCAACCTATTGTTTTTACTCAAAATAGTCAATGTTGTGGATTCGG 
GACTTGCACGTGAAACAAATGTTCAATATTGGTAAAACAAATTAAATCTACCTGTTTTTGATTACTACCTACCTACTAGAGTTGTTACATCTGCC 
GTTATAAGAAAACAAAGTACGCGGCAATTTGAAATTGAATATAATACGACAGCTGGCATTTTGGAATGGATTTCGGCCTAAATCCGGATTATAGT 
TATCTTATGTAAATAAATATTCAATTTGTTTTAGTTTGGCGCATAAGCGGTTGTTTTTGTATATCATTATATGAATAGTTAATAAAAATTGTGCA 
TGTAAATTCAATAGATTAAGTGCACGTAAAATGAGAGCATTACGTTTGCAATTAAAAATTAAGCCATTAAACACGAAATTGAATCAATTAAAGAA 
TGTCATCTTCTCCATATTCGGATTACTGTCTTAATTTTAATGGTTGAAGCTCAAATTGAATTCAACGGCCTTGCTATTTTTGTGTTTTTGGCAAA 
TAAATTAGTGTTACACAATCTTTTTAAAAGAGAAGATAAATGTGATATTACATTTTATTGAACAGGATCAGTTGCACATATCTTTTGAAATCAGC 
CGAAGTAGCTGAATAACACAAGTAATTACCTTATACAAATTCAATGCATCAAGAAATCTGACACAATGGGGTATAGTGGGATGCCCCGTCAAGAG 
GATAAATAAAATAAAATAGGATGATGAGAACCGCTTACAACCGGCGACGCGTGTCACTCAAAACC 
(SSQ ID NO: 1081) 

Exon: 4005.. 3651 
Exon: 2602.. 1001 

Start ATG: 2020 (Reverse strand: CAT) 
Transcript No. : CT27750 

GGAAAATTCGAGGCAAGCGTACCGATGCGCAAAGAACAAGCATAAGCGAAGAAGATAAAGATATCGATTAAAATTTCTGCTACAAAAATAAATAT 
ATATGTAACGCATATTGTAAATGTTCTAAGTTAAGTGACATAAATCAAATATTTGTGTAAAGTTTAATATTTTAATACGCGTCGAAGTACAACGA 
GTGAGGCTACAGAAGAGCACACACTAAAGTGGAGTACTTGGCGAGCGCAAATAACGGAAATCAAATTCGAAATAAACGCTGCGCAATAGACGGTG 
GTGTACATAAGAGTTTAACAAAATCCGAATCAGAATCAGTTGAAAGTGTGATTTTTTTGAGCCTTTGTCTGATCAGCACGGCTTGGAATGCGAAA 
GCAGCAGTTAAATCAGCGAAGAACAGAGAGTTTGCGAAATTATAGCTAAGGATAGCTTAACCGGAGTAAAAAAAAAATAAATCAACAGACTGATA 
AACAATACATAACGTTACAAGTAAAACTAAAAGTAAAGTAAACAACAGCTGTACATAAAACATAAAGGAGAAGTGGAAAAGCAGCTACGTGCTCA 
GCCGCCCGTCGCCCTTTTCGTTTCCATTTTGGAGGTCATCTCAACTTCGGCGACTATCAGCTACCAAAGAAAATAGAGCGGCGCAGCTAGATCAA 
CAT C TC AAT AGC AATAGCA GC AGAAGC AG AAT C AG AAT C AGAAT C AG AAGGC GT TTGGTTTCGATT T AT AAAG AAT T G AT TT AAGT GCGT GT G AT 
ATTTAACGTTAAGCATTAATTTAGCGAATTAACCAAAAACAAAAAGGGAAAACAGCAGAAATTTTAATAAGAGAAAGAGAAGAGGAGATCGGAGA 
GAGAGAGATATCGATCCAGAGATCGGAAAGCGACGAAATTACTATGACATACTCGCACAATAGCGTTAGAAACAACATCAGAATGACAACCGCAT 
CGGCCACCAAGTCCATACGATTAAAAAAGGGAGCAGCATCAGCGACCTCCGCATCCAACTCATCCACCCCCTCGCCCACATCCGCACCCGTCTCC 
CCCACATCCGTATCCGCATCCGCACCAGCAACGCCCTTGGTAACTGCACATTCCAACAGTATTGCCAGCAGTACAAATAACGCCCAATTCCTGAG 
CTCCTCGCCCACGAGTCTCGGTTTTGGCCTGGGTCTATCCGCTCTTGCCACGCCCCCGAACCAGAGGCAACGGCAACTGCAGTCCCAGTCGACCG 
GATCCAAGAACAAGTATCAGCAGTCGCAGCAGCAGCGCCAGTCGCCGCAGCAGCAGCAGGGATCCTCTGTAGGCGGTGGAGGCTTCTTCTTTGCC 
AACAACAACCGAAGGAACGGAGGAGGACGTTCGCCCCAGGGAGCTATGCAGGTTCGGCAGTCACCAGCCACCATTTTGGGTGGAGGTTTCTCGCC 
GGGAATCGTCGGAGGATCGGCTCGCAAGCAGAGGAAGAGCCCTTCGTTGGGCGGAAAGATATCGCCCCAGCACCAGATGCAGCAGCACCCACTCA 
TACCCACCGCACTGACGCATTTCGCGGGCAGCAAGTGCTTTGATGCACCAGCTCCAACGGCGCTGCCCAAGCCACCGCAGCACTGGACGTTGTCC 
AGGTCGGAGGGTAAGCTTCCCATGATGATGCCGTCGATGCCCAAGTTCCAGGTGCCGATGCAGACAGGCCGCTCGAAGCGGAATCTCCTCGATGA 
CTTCGACACGCACAACCTAAAGTTGCTGCTCAATGTGCAGTCGTAACCACACTGGTGCTGGAGGATCAGTGTCTGGAGTAGGAAAGCAGGAACAG 
GATCTGGATCTAGAGCTGGATCTGAAGATGGATCTGGAGGACCGACATGAATGGCAAGAACACCCGGTGGACCCCGAACCCGTCAATGAGTTGCA 
AAAAAAAAT C G AT ATGAAC AT AAAAC C AAC AAAAAAAAA T T AAC AAAAAC AT AAAAC 
(SEQ ID NO; 1082) 

Start ATG: 938 {Reverse strand: CAT) 

MTTASATKSIRLKKGAASATSASHSSTPSPTSAPVSPTSVSASAPATPLVTAHSKSIASSTNNAQFLSSSPTSLGFGLGLSALATPPNQRQRQLQ 
SQSTGSKNKYQQSQQQRQSPQQQQGSSVGGGGFFFANNNRRNGGGRSPQGAMQVRQSPATILGGGFSPGIVGGSARKQRKSPSLGGKISPQHQMQ 
QHPLIPTALTHFAGSKCFDAPAPTALPKPPQHWTLSRSEGKLPM?MPSMPKFQVPMQTGRSKRNLLDDFDTHNLKLLLNVQS* 
(SEQ ID NO: 1083) 



Celera Sequence No. : 142000013384828 

TGGATGGCATTTCAAACATGCGAGTGCTGAAATGGGCCAGCTTTAACTATTCCAATACCAACTATGCCATCCGCCTGAGTGATATCTACATTATT 
AAGGTATAGTTAAGTTCTCACCACTTTGGTTGACAAAACTTTACTACTAACTTTGCTTTACAGTGCGAGGACTGGTATGCAACAAATGGTGGAAA 
CTGTGAGGAGATAACACGGATTGCGGAGCAACTGGCTCGCAGTGCCTCGATGGCTCTGCTGCCACTTCTCGATCTTTTAGCAGCTGCTGGCATTA 
ATAAGCTTGGCCTAAGAGCCACGCTCGATCAGGATAACGTATGTCTATTCAATGCGGAGGCAATTCAAATCACTAATAATATTATCCACTCCTAG 
GTGGGCTATGAAGCCGGAGCTCGTGGCTCCAAGCTACCGCCGTTGTACATGAACGCGCTGGACAACCATCTGATAGCCACGCTGCATGGCGAATC 
CTCCGGCATTCAAGAGCCCATCATTCTAGAGCTGATCTTTTACATACTGAACGCTTGACTTTAGTCCCTCGTAGTGCAATTCTCCCGCTGGATCC 
GAAGACCAAACGGAAACATTCTTAACTCTAAGCTCGACGCGATCAAAGCCAATGGACAACTACACGAGTCCATGGGGCAAGAAATGAGGACATGT 
TGGTATTGTTGGATGGCTCAGTTAACTGTGTATATATTTAAGTAGCTCATCGTTTGGTTTTTGTCTATACATTTCTACAACCCCCCCATTTCTAA 
CCTTTCACTTCAACTGTTATATTATTTCTGATTTTTATTAGTTTTGAGTTTTCATTGAAAGTAAAACGTATCTAAGAAACCTAAATATATATATA 
TTTGTATGTGATGTTCTTTAATATGCTTTACACATTTACCACTCGAGTATATATGAATAACAACTATATTAAAACTGTTGATCAAATGGCTGTTT 
CACTTTGTGGATATAAGATAAATTAGAAAAAACAAAGGCCGCGTCTTGTTAAATTATTGATAGCAATTCTTTATTCAATTTAAAGTGTTTCAATT 
GACTTCGTCTTGAGAGTGTGTCTGATTTACAATATACGATATTATACGATATTATCTTTACGATGGGTCTAAAAATCGTCACCACCCGCAGCCGA 
CACAAAATATATAACAACCTATAAATGTATATAAAAATGATGTTCATTCTCTCTCAGATTGTACATAATTTAAACTACGTCTAGTACCGGGGTTA 
CAGACACACAAACTACTAATGCACTCCTCTAATGTGTTGGTGTTTAGATGCAGATGCAGATGCTGATGCTGATGCTGATATGTATGCTATATACA 
AAATGACATCGGGAAGCATTTGGTTGCACTCGATTTATTGCTCGGCTCGTGCCTCATCTGTTGTATATGTTCTATTTTTGTGTATATATTTGGGT 
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TTTATATAACTTAAGCACTCGTGTCCCTTCGATGTTGACGGCAAACCAAGAACCCCCATTATTGAGCACTACAGCTCGTCGTGTTTGGGTCCCGA 
GAACTCATCGTGCGAGATGAAACCGTTCTTATCCTTATCCTCGTGCTGAAAGATCTCCTCCACCAGCTTGTCGTTCTCAGCCAGCATGTTCTTCA 
GCTCCTCGGAATCCTGTCCCTCCACGGCGGTCATCTGCTTCTTTAGATACTCGCTGACCTTTAGAGGAGAGATATAATATCCAGCATTAATTTAA 
TTGTAGCTACTTTATATATTGCTCTAGTCTACTGAAAGCATAGGCTGGTTAGTAACTTATCCCACATCCAACGGATATTTCAGTGTAGAGAGACA 
TGTATATGTAACAAAAGTGCGATCCATACATAGACGATTACCTCTTCGCGACTCAGCTGCTTGTCGGCATTATCGTCGATCTCCTTGAACACATT 
GGTGGTGGGCGGGGCATTGCCGATGTTGATCAGCTCCACATCGAAGAGGAGGGTGGCCTTGGGCGGTATCACGTTGCCGGCACCCTGGTCACCAT 
AGCCCAGCTGGGGAGGAATGGTCAGCTTGCGCTTCTCACCTGGATTTCAAAGTGAACTCCAAATTAATTTACACATTTTACCGAGCACTTCTCGC 
AAGCACTTACCCACGCACATGTTGAGCAGACCCTGATCCCAACCCTTAATCACCTGGCCGGCGCCCAATTGGAATGTGAAGGGCTGGTCGCGGTC 
GAAGCTGCAAAAAAAAAAAAAACGAGATGAAAAAAAACACGGGGACATACAAATTGTTAGACGCTTGTTGGCCACAAAGTTTCAAGTGCTTGCAG 
GTGTCGTGGGCTGAAAGCCACCTTAAACCGAAACAATTCATCATTTCTTTATGACTGCTCTCTTTTTTTCATATCTTTCGTTATTTGCAGTCTAT 
AAAAATTATTTATTTGATGGTTCCGCTAAGCGAATTTACTTTAATTTGCTAAACGTAAAAAAGGGGGGATATAAAGGCAATGGAGCTGCTCAGGT 
ATGCATGAGAATGTCGCGCTTGTTGAGCTTGATGTATGATGTGGCACAGTTGCAGGCCATTTCGCGATGATTATGATATATAGAAATGGATTCAC 
AGTAGCTCAGGGTTTGCTAAAAAGGTCGGGGGCATATCGCAAATGGAGCCATAAACACGCTGTAAAATGGCGTTTATAGGCCAACATGAGCTTAT 
GGGCTTTAATTCAGTTTTGAATTGTGCATTTTTCAAGGGTGTTAATACAAATTAAGGCTAAGCCGCATATACAGATACATAAAACACGAACAATA 
TAAATTCTAAGTACATTACACATCTCATTTCCACACAATTGAATTAATCAAAATTTGTTTATTTCAAAGCTTAAGTAACGATCTTCAGCTGAAAT 
AATTAGTTAAGACTGTAATATAGCTGTTCTCCTATTGTTCGAAACATTTTAAAAGATTTTTTTTTTATAATTTTTTTGAACAAATGTTCTTAAAC 
TTTCTGAAGCCTCTAGAGATCAGAAGTTGGCAATTGAAATGTGTCGCTTGGGCCTGCTGACTGCAGGAAAAGTTTCCTTGACGTTTTCCTTCGAA 
AACTCTCCTGTTGTTCATCATTATATTATTATTGCTGTTTGTTAAGAAAGTTGTTTGGCTGTTGGGTTTTTAAGAACTTGTTGGTTATCCCCGTT 
TAAATGGAGCCGAGACAGATTATTGCACTCGAGGCTGGGGAAAAGGAAACAAATGATTAACCAACCAGGAGGAGTCACCCACATGGCTGGGAAAA 
AGTTGTAGGGCATGTCTGTTAAATATTTTATGTGCATCTCTTGGGCATTTTGCAAAACATTTTTACATGCGCCGGCTGTTGTTTCAAATTCCAGT 
TCAAGTCACAACCATGATGGGGCAAGTGGCAAGTGGCAAGTGGAGGTGCGGCAAGTGGCTCGGCGGATTGCCGCTGAATAATCCACCCAACCGGG 
CCGCAAAAAAGAAAAAAAAACGAACATTCACATTCAGAGCGGTTAAGTGTTACGCTGGAGCGTAACCAAAGTGTCGTCGTTGCTGTTTTTGTCGC 
CCGCCTTTCGCTCAAGTGCAAATCTCCAGATGGAATAAAAAAGCGTAAAAAATTTGTACCACCACCCCCCTTCCCAACCCACTTGAGATTCCGCA 
TGTCCTGTGTGTGTTTCGGTTTGGTTTTCCAGGGATTACGCGGCATCTAAGACACCTTGGTGGCTTGTCTTTATTTCGGTGTCGCCGTCTGTGGC 
TGTGAATTTTTATTGCATTATTTTCCATGAATATCTAGTACAATATGTGAAACATTTTATGGAGTCTCCCTTCCTTCCATTTCATATCCTTGGAG 
TGTTTACTCCTGAATCCTCGCTAATTTATGCGATTTGCGAAGCTTACAACAAGCTCTACATGTTTATTTTTTTTTTGTACACTATTGAACGGGTA 
TCTAAAATCGCTTATTCGCTTTTTAAGTTTTGAGTTAAAAACTGCTTATAAACTTTAATAAATAAGCATTAACTGCAATGACACTTTAATTAAAT 
TTCAACAATATTACACCCCACAACTTTAAATGCGAAACCGTTTAAAAAACGGAGCTGTCGAGTCGTAAATAAATTAATAAAATCGTTTAAACGTG 
TTGATTGTTTCAGTCAGCACGATTGAGCCCCCTCCCCCCCCTCTTCTCAACGGACTTCCGTGTCCTTCGTCCTGTTTGTCCCCCTGAAGCAATCA 
AATATTTATGAATTACCGGACAAGACAAACAAAAGGTCAGCTGAGATCAGACCTGAATTCACTCGCAGGTGGGCAAACAAGTTGCAGTTGCTGTT 
GCTGTTGCAGTTGTTGTGCAATTGGCAAATATCAAACGGAATTTCATCGATATTTTGCAAGCGATTCGCAGGTTTTTCCAGCTCGTCTCCAATGG 
GGCTTAGATTGTGGTCTAAAGTGGGAAATAATGGGGGGTCTGCTGTCAGGGGGGGAGGCACAAGTGTGTGTGTTGCGGGGTGCTTCTTTTTACAT 
TTTGGCTTTATGGCCAGCGCATTATCATAAAGTCAAGCAAAAAATGTAAACTCATTCATCCACTCGCTAAATATTTAATAAGTTTCGATTTCATT 
CTTCCACTGGCTGTGCCGTGTGTTGTTCCTTTTTTAACCCAATCCCAATTCGAATCCGTGTTGGTATCTGTATCTGTATCTGTATCTGTATCTGC 
GAATTGCGAGCCAAGTCCCAACGCATCGCATAATTTAAAATGCAGCTGAATGGGCGCCCAAACATCCGTTTGCCGTTCTACGTGATTCCGCACCA 
CGGGTGGTGGCATTTCGATGGGTGGGAATCTTTTGGCTGGGCATATGGCCACCCGAAATCATACCACGTCACTCCGCCGACTTCAGACACATTCA 
ATCCAATCAAATTGAATCCTTTTGGGTGGATCCCTTGATGACCTCGTTTTATGTGTCCGGCCAAGAAAAGTTATAAAATCAAATATGCAACCGCA 
ATTCCTTTGCCTTCCGCTTTTTTGGATACCTCCCCATTGGGGCGACAACTTGCGATTAACGTGCACCAGAGAACAAGGACTCCAGCAGGACCTGC 
GGCGCAGGATTTCTCTCGGCGGGAATGGTCAACGTGCGAACAATTCACACAAAAAGTCTAAAGTTTGAACAAGTTTGCAGCCCAGCAGCACTGAG 
AGAAAACAGTTGGGTTTATGGTCTTTGAACTACTATAAGTTCTGAATGCATGGATGTTTTTTTTGTTTAATTAGGAAAATGAAATATTTGAAATA 
TATTTAGTAAGTTTATAGACTTATTTCTTTCTATCTACGTATCCTTTTTTTTCAGTGTCGGCACAAGTACCTCAAAGGACACCCAAATTTATGCC 
CCAGCTGGCGGCATCACGTTCCATCGCCCTTAACACACAACCCCGAAAAAAGCTGCTGTAAATTGTGGAAAATAAGAATTGCAACCGTTGAAGCT 
CATTAATTATTTAAACTATCTACAAATGTCTGACGCGCGAGCCGAGTAAAATATACATTCGTGGCAAGAGTTGCCATAAAAGTCAGACGGCTGTT 
GCAGTTGCAGTTGCAGAAACAGCCGCACTTGATTGCATCCACATGGTAATGAGCTTTAGATTTACTGGAGTCTGAGTCGCAGTCTGCAACGCCAC 
AACTTTGTTCCAAGTGTTTTGCCATCAATGGTGTCAAGGTTTCTGTGGTCACTCGAGCAGAGCTTAAGGTCTCCTCCCACTCCCACTCCAGCTCC 
CACTTCCACCTCCAACTCCCATTTTCCACTCCATGGCCAGCAGCACCACATTCAAGGTTGTCCAGAGTGTAAGCTAAATAAAACTTTCACTTTGA 
TGCATTTCTGGTTGCTGCTGCTATTTTTTTTTTGTTTTTGTTTTTTTACGCACTCTGCTTTTGGACATTAAATTGCACTTGCTGCAAAAGAGTTG 
ACACAAAGGGGGGAAAATTGGGAAGAAAAAGGCAGAAAAAAATGGGGGACTTTGCACTTGGAACATACAAATCATGGCAGCCAGAAAGCGACAAC 
ATCGCAAGTTATGGCCCGATGCTTCAGGTAACTGACTTGTGCTCCTCTTTATTTCCTCCCCTTGACTACTTCAACGTTACGTTGTCAAAAATTTT 
TGTGAAACATGTTTTCGAGGATGCCAACAAAAGAGCCACTCCTCTTTTCATATTCAAACTACGGCTGGCACTGAAAGAAATGCTACTTGCGTACC 
TTTGGACAGCAAAAGGTATTAATAAATCAATTAAAACATATTCATTATAGCATCGGACATTGCCTTTTGTGGTTTTATCATTTGGCTGTCTCAAC 
TTTCACCCATACGAAATATGGCCCATTAACATGTTCATCAAAAGTGTCGCATAAAGGTGTGAATTGACCCCAAATACCATATATGGCTTTATAAA 
ATACCAACTAAATATGTTTATGAAGAGCTCTAACTTAAAGCTCCTAAAATTTCCTTTACACTATCTCTCCATTTGTTTCAGCGAAACTTTAAGCG 
AAAACATCACCAAGTTTGTTGCTGCAACTCAAGTAACAAATTAATTTCCCCGGAAAAGTGACACAGCGCACATGAAAATCTTGTGATGGGGGGCA 
TACTAGTATACATATATCATAGCAGGTGAAAGGGAGCAGTGTAGTCATGTTACGCGTCAGGCTGATTAGAATATAGCGACGAAAGGCTGCATAAT 
TTATGCGGCCAAAAGAAGGTCTGCGGGTCAAAAGTGGAGCTCAACTTGTTGCCACAAAAACGGGAGGCGCCGCGTCGCGAAATTCTAATGAAGAT 
GTGGAGTGCGGACCAAAGTCATTGACAAACTGTGGCTGCACGTGACCCGGGTATGGCTATGACAGGTAGACATGATGGACTCCTCCGCCAGTTGG 
ATCCCTAACCAAGGGACAATGGAGATAAAAGGCGGCAGGCACGTGACAGGACTCCAGCTCTGGTCCTGGTCCTCATTATTGAATTAGTTACGAGT 
ATAAAATGCAAAGGGATGAGGGCAGTTGGGGGTATATTTGTAATTGTCCGAAATATAGTAGAGTTTCTTTTATGGCCAACCAAGTAGGTTGTAAT 
CCATTTTAATATCAATTAAGCACGTATAGTCGTATATTCGTTTTCCGTTTCCCTATGCTTGTGTGCTTAGGACAGTTTTTCATTTTGATTGGCAT 
TTCAATAAGAAATTCCATCGACAAACCCGTCAAAAGCAGAACAAAATCTTCGTACTTATCTCTCTCTCAAGGAGCATCGCATCTTCGTTTTAAAC 
CCATTTTTTCAGGTGCTCTGCCGGCAAAAGCTTAATTATAGCCAAGTGAAGCTGGCTTTCAGCCAGCTAATTTCCGCTGCAAGGCGGAGACTGTT 
TGGCCAAAAGCCGAAGCCGAAAATGGCAGAAGGAGCTGTGTTCGCAGCTTGAGGCTGAAAAGCAAGAAGTGGCTGAAAAGGATTTCTCGCACAAG 
GCGAACAGAAGATGGCGAATGAGGAGGACGCCTGTCGACATCCTTGACTGCACTTGACTGGCACAAAGTGCCTTGTTTGTTAGCCAACTTCGAGG 
GCATTTAAGCCGCAGCGCAACAGGTGGCAGGCCAGAGAGAAAAACGGCAAAGACGGGGACGCACTCTGATTGAGTTAGTTAGTTGGCCAACTCCG 
CCATTCCTGCCTCACAGAGACTTTGCCACGCCAAATTTAACAGCAATTGCGAAACTTTTAAGCCCCGAAAGAGACGAATTGCAACTTCCACATAA 
TTTGCAACCAAAATTTGTCTACCAAAACTCAAGGTAAGTGAAAGGACACAACATGTCCAATGAGATGGATGGGAAAATGAAATGTTTTCCCTTTT 
TAGCATATGACAAAACGGGTTAAGCCGGGGCAATTTGAAATACTTGGGGTCAGTCAGACGTTTATAAAATCGTTTCCCCTGCGAGAAACACAATT 
TTCTGGTCGTAAAATTAAATCAAAATTTGATAACTGAGAGAGAGAGTGGTGGAGTAATGCCCTCACACTCAACGTATATTTTATTTATATGCAAT 
CAACGGACAATATATCAGTTTAGTTAATCTAGTTGAGGTCGCTTATCGACAAGTTTTTGCCCCAGCTGATATTTATAAAACACTGATAATGAGTT 
TTACCGCTTGCCAAACGGTTACACAAAGATTTTTTGTTTGCCTGGTGGCAAACAACCCACTTTGGGGTGTTAATTAGCCGCACATGTGAAATAAT 
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TTGGTTATCCTTTAAATTTAATTGGTTTTTTGTATGGCTATCTGGGCTGGGAGCTCAAGTTTCGCTGTGGCTTTTATTTACAGGATTTATTACAA 
GAAAGTCAAAGGTTGAATATGAGAAAGAATATTTGAGATAAGAAAATAAGTAGCGCCTTATCATTTTCTTTAATTGAAACTACTGCTTAAGAGCT 
AACTATAGTTTGAAAACAAATTGTTTATGGGTAATATTTTTAAATGGTTTTTAAGAATCTTATTTGTACCGGAAAACGATTTCTTTGCTTAATGA 
AGCCAAAAATTGAGGGCCGCGTCACTATAAACGCCCATAACCCACAGAAAAATGAAACAAAAAGCGGGAAAATGGGTTTGAGTTTGAGTTTGGGT 
TGCAGGAAGAAGGGGGGCATTCAGGCATGGAAAATGCACCAGCAGCACAAGCAAAAAACAACAACAACAGCGTCAGCGACAATGCAACAATGAGA 
TCGCTTATCAGTTGGCGTCTAATTTACGGCAATGAGCCGCCGCGACGACAGACTGACTATGCAGCACCAGCAGCTTTAATCCCATACGGAAAACA 
CATGTATCTGCGAGTATGTTAATCTGCATCTCCAATGGATGCGTGTGTGTGGAGATTACGGACAGCATCGGCTTTAAAAATATTTCATCCTGAAC 
TCTCTGACATTCTGCCGAAGTTTTTGGGGCAATGTGAAATAATCACTTTAGCATGGGATTTAGCTTAGTTCATTGATAACAAAGTGTGCCTATTT 
TAGTTTTGTACACTTTAAGATCACTCACCTCGAGTCGAATTTCTTGCCATCGGCCTGCAGAGTTCCAGTGTAGTGCATGGTGAGGGAATCGCCGT 
TCTTGGACTTTTGTTCGCACACCTCCGGCGTGCTGATCACCTCCACTTTCAGATCCTGAGCCCGGACCAGGCTGTTGCTGATGGCCACCAGCAGA 
AGGCACGAGATAACGAGATTGGATTTCGACATGGCTGCAAGATAAATGGGCAAACTTGATTACCACAAAACGTAAGAGACTTTCAAAAAAAGGGG 
AACGTGTTTATGCACTCGACTACAAGGAGTATAGTAAACCTTATAAAGAGTTTAAGATTTGCTAATCCTTAAAATTAACTGAGTTCGAACTACAG 
ACGATAAGATCCACTAACTCGAAACTAAACCGAAGCACCAATGTCTTGCTCGGATTGATGTTCATCACTTTTATACACACAGATTCATCATCACT 
ATCATCTACTCGCCATCATCTTCGTTTTAATCGCGTATTAGCGTTGCACAAGCGTTAAATGATGAATTTAAGCCAATTACCCGGAGAGGTGTAAC 
AGTTGTTTGCTCAACAGTGTAAAGTTGAAATTGAAATCGTATGGATGGCGAGGGGTATTCGCAGCGATTGCGAATGGGATGCGCGAACAGCAAAA 

AT T AGT CAT T ATG C AAAT GT GC AAT TC AT T T T CG AGT GGGG AAC C GAAT T C GACC AACC AAC C AGC T TT G AC C AT C T CG AC C AGC T G GAG GGG AT 
TATGCAAGCCAAGACCCAAAGCCAAGCGAAATTATTGCGCGTACGCGGTGGCGTATGCGTAATGACCATCGTAGTCATTTCCAATGGGATATGTT 
GGTGGGTGTTTCTGGTTAAATGAAAGTTTTGAAAGCGCCTAGGTCACCCAAATATAAAAAAAAAGAAGCGAAAAAAATTCGAATAGCCGTAAAAT 
AAAAAAACTAATAAGGCAAACAAAAAATAAAATATGGTGGGTCTTTATATAGTCGGAAAACTTTCACTAATGGAAAGGCAATGGTCAAGCCTCAA 
TTACGTGAAATCGGCATTAACATCTCTTGATCTTCCTCTTCAACTGCTCTCTTTCGCTTCTGCTTCTTTTTCCTTATACAGTGAAATCTCAGAGA 
CATGAACACACATATGCCTTACTTTTTTGCTTAAGCTTGAAAATGATTAATTACACTAATAAGTTCATATGCTATGATATGCAATTGTTACAGTT 
AAGACCAAGCTGAAGCCAGAAAATCCACAGATTTAGGGACTAGAAATACTTCAAATACTATAAATAGATAAGACGAGATGGCCTCCTGTGGCTAA 
TAATCCTAAAATCACATTTTCAAATATATTTCAAAATCTTTTTTTTCAGAAGCCGCTTCACTGTACTGAGCGTCCACTTTTTGGCCTTAAAACCT 
GTTGGGCGATCCAGCTTCTTCGTTTACACACGTATCTTTTTGTTTCAGCTTTTTTTTTGTGTGTCTTTTATGAAAGGTTCAGTGAACTGGCGGTA 

C G AAG T T T ACC GT AAGC GAT CC AAT T T AC AGAC CGG C GC T AAAT GT T T CGG AAAAT CT T T T C C T AAG C G AC AC GT C AGG AGT T TTCCTCTTATTT 
CGAATCAGAAATGCTAATTGCTGACCCGAGCCGATCTTTAGCAACAAAAGCAAGAGCAAATGACCGGCGATATTTTTTTATTTTTATTTTCATGG 
ACTTGTAGCGATAAGCTCTCTTTCTCCTCCTCCATCATTCGGTTTTCTTGGGGCGTATGTGTGTGTATATATATTTATTTGTGTATGCATGTGTG 
ATGGCGTGGACTGAATATGAATGGAAGAGACACGACAGCAATGGCAAAGTACACGCTTTCTTTCGATTTCTCGATTTTCATTTCTTAATGAGCAC 

AT T T C C AAG AT GT T T AC GC T AAGC C AAGC AC AC G AT T T AC GAT C T G AT AT CT GT AAT AT AC CTTTGGTCTTC C AC AG AC GG AT ATC AC T CG AT T C 
CGGCGAAGGGGCGGGGGCGGATGCGGCGGGCGAACAACAGAAAACACGCGAGCAAACGGCGGATGTGAAGTGCAAACACGTCCGATTCGATCGAA 
GTGCGGGCGAGAAATGCTGGAACCGAAGCCGCCGACGTTGTTTTACTAACCGCTACTTCTACACTCAGAAACATTTTTGTGGCACTTTGCAAGTT 
AGACCAGATTATTTTGTAGTTTAGTTTAAAATCTTATTTATATATATTGCATTCCTCCAAGTTTTTCATAAGATTATAGTTTTTTTTGTATTTCA 
ATGTTTTTTTCATTATATTCCCTATCTTTTTTTTGGGTGCACCCCTATATAAAAAGAGTGGAAATCTTCAGAGAGACCCGGAAAGTGTGAGCGGG 
AGG AGG C GG GGG AT C GT G AT C C ACC AC CAT C C AC AC GTGT AAG AGT AC G AGAG AG AGT G AAAGAT C GT GT T T T T GT AG AGC C AAAAGAG AT T C AC 
ATACTGTGGGAACGCGATCCACAATACGGGGGGCAGAATAAGAGAGAGCACTTGTCCACGTAATCAAGCAGAAACTTGTTCAGGAGCGGATCAGG 
CGGACTTAGGTTAAGACACAATCAAAGGTGTTCCTTATCCCAAATAAATTAGTTAATATGTAAGATTAATTTTTTAAAATTTAACTTTAAATTTA 
TGATTAATCAGAACTTACAGGAACAATGTAATACCGAAATGTATTCCAAAAAGTTATTCATTTTTTATTAAGCTAATAATCGTTTTGAAAAATGT 
AACAAAAACGAAAAATATATGGAACACGAAGGACACACAACCTATTCAAGCAATTGCTCAAGTTTAACTAAAAGGAGTTTCCAAAGAATATGCCG 
TAGTCTACATGTTGCTGGCATCAAAGTTTAAGCAACATTATGCCTCGCCTACTTTTTCTAGTGCGCAGGCGCCTCTTTTATTTTATTTGTGGGTG 
GAGCCTGATAAAGAGTTAATTTAATGATTGGCGAATTAAATTCGGCAAAATTATGTGACATTTTCTTTTGTTTTCATACTCTAATTTAATTTATT 

ACAGTTTATTTTTAAACCGTGCTTGATAATTTTGTACACTGGTAGA 
{SEQ ID NO: 1084) 

Exon: 11016.. 10892 
Exon: 9154.. 8959 
Exon: 2189.. 2101 
Exon: 2034.. 1847 
Exon: 1673. .1001 

Start ATG: 9152 (Reverse strand: CAT) 
Transcript No. : CT27794 

ATCGAATCGGACGTGTTTGCACTTCACATCCGCCGTTTGCTCGCGTGTTTTCTGTTGTTCGCCCGCCGCATCCGCCCCCGCCCCTTCGCCGGAAT 
CGAGTGATATCCGTCTGTGGAAGACCAAAGCCATGTCGAAATCCAATCTCGTTATCTCGTGCCTTCTGCTGGTGGCCATCAGCAACAGCCTGGTC 
CGGGCTCAGGATCTGAAAGTGGAGGTGATCAGCACGCCGGAGGTGTGCGAACAAAAGTCCAAGAACGGCGATTCCCTCACCATGCACTACACTGG 
AACTCTGCAGGCCGATGGCAAGAAATTCGACTCGAGCTTCGACCGCGACCAGCCCTTCACATTCCAATTGGGCGCCGGCCAGGTGATTAAGGGTT 
GGGATCAGGGTCTGCTCAACATGTGCGTGGGTGAGAAGCGCAAGCTGACCATTCCTCCCCAGCTGGGCTATGGTGACCAGGGTGCCGGCAACGTG 
ATACCGCCCAAGGCCACCCTCCTCTTCGATGTGGAGCTGATCAACATCGGCAATGCCCCGCCCACCACCAATGTGTTCAAGGAGATCGACGATAA 

T GC C G AC AA GC AGC T GAG T C G C G AAG AGGT C AG C G AGT AT C T AAAG AAG C AG AT G AC C GC C GT GG AG GG AC AGG AT T CC G AG GAG C T G AAGAAC A 
TGCTGGCTGAGAACGACAAGCTGGTGGAGGAGATCTTTCAGCACGAGGATAAGGATAAGAACGGTTTCATCTCGCACGATGAGTTCTCGGGACCC 
AAACACGACGAGCTGTAGTGCTCAATAATGGGGGTTCTTGGTTTGCCGTCAACATCGAAGGGACACGAGTGCTTAAGTTATATAAAACCCAAATA 
TATACACAAAAATAGAACATATACAACAGATGAGGCACGAGCCGAGCAATAAATCGAGTGCAACCAAATGCTTCCCGATGTCATTTTGTATATAG 
CATACATATCAGCATCAGCATCAGCATCTGCATCTGCATCTAAACACCAACACATTAGAGGAGTGCATTAGTAGTTTGTGTGTCTGTAACCCCGG 
TACTAGACGTAGTTTAAATTATGTACAATCTGAGAGAGAATGAACATCATTTTTATATACATTTATAGGTTGTTATATATTTTGTGTCGGCTGCG 
GGTGGTGACGATTTTTAGACCCATCGTAAAGATAATATCGTATAATATCGTATATTGTAAATCAGACACACTCTCAAGACGAAGTCAATTGAAAC 

ACTTT AAAT TGAAT AAAG AATTGCT AT CAAT AAT TT 
(SEQ ID NO: 1085) 

Start ATG: 128 (Reverse strand: CAT) 
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MSKSNLVISCLLLVAISNSLVRAQDLKVEVISTPEVCEQKSKNGDSLTMHYTGTLQADGKKFDSSFDRDQPFTFQLGAGQVIKGWDQGLLNMCVG 
EKRKLTIPPQLGYGDQGAGNVIPPKATLLFDVELINIGNAPPTTNVFKEIDDNADKQLSREEVSEYLKKQMTAVEGQDSEELKNMLAENDKLVEE 
IFQHEDKDKNGFISHDEFSGPKHDEL* 
(SEQ ID NO: 1086) 

Name: FK506 binding protein 
Classification: chaperone 
Gene Symbol: Fkbpl3 
FlyBase ID: FBgn0010470 



Celera Sequence No. : 142000013384423 

TTGTGTACGGCGTAAGCATACCCAACACTGAGGGCAGAGCAGGAATGGCAGCCATCTACGATCCCACTCGGGAGG'TCAACGTCAGCCAGCTGGGC 

GTGGAACTGGCCAAATCCCTGCCCAACTACGCCAGGCCGCAGTTTCTCAGATTCCTGCGCAAAATCGACCTGACGGGCACCTTCAAGTTGCGCAA 

GGTGGAGCTTCAGCAGCAGGGCTTCAATCCGGAGATCATCGACGATGAACTCTTCTACGCCCAACCGGATGGCGTGTATGCTCCACTCACCCCAT 

CCGTATACGAGAGGATAGTTAGAAACGAGCTGCGATTCTAGGAGCGCCGGACTTTGTTCTTGGGAAGAAGAAAAGTCAAGACTGTGCCTTGTATG 

TTGGTGAAGAGGATATCCTTTGGATTCGGCTTGTGGTGCGCCAACTACTTTTAATTCTGGCTAAGTTTCTTATTGTCCGGACAATATAAACTTTA 

GCATTGGAAATTTACTGTTGAAAGAGCGTTTCCGCGGGTAAATTATTATTTTGTAACTATTTCCTTCTACAATTCCTACATACATAAGTGTCACA 

TCGTATTGTATACATTGTATAGATAATAAAAATAAAATTCAGCCAATGATAAATCCATATTGTGCTCTATTATGTGATTATAGCATTTTCCTTCT 

GTAAGACAGGATTTTGATTGGTTTTCATTAGATTTGTAATTTAGTTTATTGTGCTTAGTTAAATAGTTTTTAAAAATGAAACTTACAATTGAAAG 

AGCATCATTAGCCACTTTACGTTGGAGTAGTTAGAAAATCTAATATCCCGGCTTCCTTAAACAGTTCCAATCAAATCGGGATGGCTTTAGCTACG 

ATTACACTTTTCAATAGAACCTAGGTTACATCCATTGCTTGAGCAACTGATGCCTTCAGCTTAAGTGGAACCAGATTTTCAGGTGAACGGATTTG 

GAAACGGGGTCGTATAAAATACTTTCAGCAGTCTTTGGATCTATATCCGCTCAATCCTTGGGCGGCACATCCTTGCTCAACTCTGCACGCTTCCT 

CTGCTCCATGCGCTCCTCTACCTGGTCCAAGCGCTCGCTCATGGCATCCAGGGAGCTCAGCAGTCCCGCCTGGATCTCGCCATATCGCTTCATGG 

CCAGGACCAGACTCTCCTCGATAAGTTCGCTCTGCTGCGCGTACTCCCCGTTGTTCTGCGTTATGCCCTTTAGCTGCTCCAGACACTCGGCGGGA 

ATGCTGCGGAAGTACTCCGCGCCCAGGGCCGGTTCCAGCTGCTTAATCCTCTCCAGCTGCTCGGCCTGCGTGTGCAGCTCGGGGGCCACGGCTTG 

CAGGTACACCTCCTTGGAGCGCACCTGCTGGTGCTCCTCCAGGAAGTTGGGGTCCAGGTACTTCTCCAGCTCGTCAGATCTCTTCACGCACTGCT 

GCAGTGCAGCACTTCCTGTGGTTGCTTCGCCCAGGATGGCGTGTGCCGAGCACAGGGCGTCGACCACGCCCTCGCCCACCTCAGAGTCCTGGACC 

GGACCCAGGACTCGTGTCAGGGCATCGATGCGCTTCTCCAGAATGTCCAGTGCCTCCATTTTTCGCTGAATTTCTCTTTATTTTGTTCGTTTTGA 

TGGGAATGATGTCAGTCTGCAAAACAATCGATGTCCAACGAAAAGGTGGCAACGTTAGCTTGGTGGCAAGGTTCTTGTACTCGCTAAGCAGAACG 

AATTAACCCTTATGCGGACTAAAAAATATGGTATTTAAATAAAATATTTAGCTCCTCTGCATAAAGTTTTTGGTGCATACGTATCTATAAATATA 

GTACACACTTAAATTCCATTGCTATTTACTTAGTTTCTTTTAAACGTAAACTGTACAATACAAAAATCTCAGTTACAACAGCTTAAAATTCTCGA 

GGGACAACCGCTCCGCATCCTTGTTGATCTGCGCCAGACTATCCTCGGAGATGATGCGCACCGTATCGACGTCCGGGTCATCGCCTCCGGAATCA 

TCCAGCACACACAGCAGACCATCGTTCTCGATGTACGGGAAGAAGAACTCGGGCTTGTCCCAGCTCTCCTTATCGCCGATTCCGTGGTGCGACTC 

TTGTGCCATGTGCTCCACGAGCAGCTCCTCCTCGTCGAACTGCAGATCGCAGGTCACGCAGCGCAGCAGGCACATTTGGCGGCGCAGGTAGTTGA 

CCACCTTGACGCGTTGGTAGAAGTTCAGAGAGCTGGTGGCCTTCTCGAAATCCAAGCGATGAACCTCGTGCATGTGCTTCTTGAGGGCGGTGAAG 

TTGTCGCCGAGATGGTGGCAAAACAGGCACTTAATGGAGCTAGGCTCACCATCTGCCGCCCAATCCGACCAGTCCGAATCGTGTTCGCCATCCGA 

ATCAGGGCGAGCGAAGTGCTTGTCGAAGTCCACGCTGCCGGTTTCAGGATCAGCAACTGTGGACACGGATGCCGTTTCCCGTCTTCGCTTTTGGA 

GGTGCTGTTTCCGGGGCGTGGGGGCCGTTGGTACACGATTGTAGTTGATAAGGAAGTACTTGTCGTACTCCCTTCGGTTCGGATTGATCCGCTTG 

TGGCCCTTCTTGCGCATGTGTTCCTTGAGCGTGGGCCGATCCCTGAAGATCTTCTCGCAGTAGAGGCACATTAAC 

(SEQ ID NO: 1087) 

Exon: 1640.. 1001 

Start ATG: 1579 (Reverse strand: CAT) 
Transcript No. : CT27878 

GTTTTGCAGACTGACATCATTCCCATCAAAACGAACAAAATAAAGAGAAATTCAGCGAAAAATGGAGGCACTGGACATTCTGGAGAAGCGCATCG 
ATGCCCTGACACGAGTCCTGGGTCCGGTCCAGGACTCTGAGGTGGGCGAGGGCGTGGTCGACGCCCTGTGCTCGGCACACGCCATCCTGGGCGAA 
GCAACCACAGGAAGTGCTGCACTGCAGCAGTGCGTGAAGAGATCTGACGAGCTGGAGAAGTACCTGGACCCCAACTTCCTGGAGGAGCACCAGCA 
GGTGCGCTCCAAGGAGGTGTACCTGCAAGCCGTGGCCCCCGAGCTGCACACGCAGGCCGAGCAGCTGGAGAGGATTAAGCAGCTGGAACCGGCCC 
TGGGCGCGGAGTACTTCCGCAGCATTCCCGCCGAGTGTCTGGAGCAGCTAAAGGGCATAACGCAGAACAACGGGGAGTACGCGCAGCAGAGCGAA 
CTTATCGAGGAGAGTCTGGTCCTGGCCATGAAGCGATATGGCGAGATCCAGGCGGGACTGCTGAGCTCCCTGGATGCCATGAGCGAGCGCTTGGA 
C C AGG T AG AG GAG C GC AT G GAG C AG AGG AAGC GT GC AG AGT T G AGC AAGG ATGT GC C GC C C AAGG ATTG A 
(SEQ ID NO: 1088) 

Start ATG: 62 {Reverse strand: CAT) 

MEALDILEKRIDALTRVLGPVQDSEVGEGVVDALCSAHAILGEATTGSAALQQCVKRSDELEKYLDPNFLEEHQQVRSKEVYLQAVAPELHTQAE 
QLERIKQLEPALGAEYFRSIPAECLEQLKGITQNNGEYAQQSELIEESLVLAMKRYGEIQAGLLSSLDAMSERLDQVEERMEQRKRAELSKDVPP 
KD* 

(SEQ ID NO: 1089) 



Celera Sequence No. : 142000013383801 

AATGAATCTAAAATGAAAGGCCAAAACCATATAGAAGCTAATTTAGAGCGCAACTATTTCACAAATCCTCACCTGGTTATGTACTCGAGGGTCAC 
GGGATCCTCGACATTGAGACGATGGCTCTGGCACTCCACCTGGGCACGATTGATCATGATGCGAGCGTCGGCCGTGAGACCGGCAAAAGCCATTA 
CAACGTGGTTGTCGAGCATGCAAATCTTGCGCACCTTTCGATCCTCCTGCAGTTGGGCCACCGACTTCTTCTCCACGCCAAGCACAACGCAGTTG 
GCGCCGCGAACTCCGACCTGCGAGGCGAAAAATTGGAAAAGTCCATGTTTGTTTCGGTTTGGCGGCACTGCGAACGGTACTAAATTGCACTTACC 
GCAGTCGATCCCTTGCGAACGGCCTCCTGGGCATACTCCACCTGCAGGAGGTGACCGTCGGGAGAGAATATGGTTACAGCACGATCGTAGCGTGA 
GGACATCTTGTTGCTTTATTCAATAGCTGAGGTACTTGCAGAATATTAACTTATAGCTTTTTCCTTCTCGAATTCGCCAGGCAATGATGATGCTT 
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TTTTGAGGAAATGCCGCAGTGTGACCGCAGCATGGGTTCGATGAAAATTACTAAATTAATACCGTTTCGCAGTGAACGATATATCGTTGCTGAAA 
TCATGTATCAGAATCAATTCAATTCTTGTTACTTCAGTCTATTTAATATTCTATTCCATTTAGTCTATTTAAATATTTAATTTAATACTTTTGTA 
AACCGTTTTACAGCTTCAGGGCTGTAAATAATTTAGTAAAATATCAATTAATTTTTTTTCACATTATCTGTTATTATTGTTATTTAATAATACAA 
ATTTATAATACAAATTATTTTCAATATATATATATATATATGTATTAATAATTATAACCAACGACTGTTACTTAAATGTACAAAATGCAATCGCT 
ATGCCGGCCCTGGTAATCGGTGTTCATCGACAGCTTGCTGCAGCCCTCGTCGTCGCTGTAATTACAATGTGATGTGCATTTGTTTTTGTTAATAA 

G AAT T GT C CG AT T A AAC AT T T AGAT TATCTGTTTT T AAAC AAT GC GAT C ACG GT GAT AT C GG T T CGT AAAG GT T T T AC AC AC C GT G T GAT T T T GT 
TTCAGTGCGTTTTGTTGTTCGTTTCGTAAATAAGTGGGCTTCATTTTGCCCAAAAAAGACGGGAGCCCAGACAGCTGATCACAGACTTTGGCTCA 
CACCTGCTACAAAAAGATAATATTTATTCGAAAACCTGGCATTCGGATCGAATTTAGCAGCTGCCAGCGTATTTATATAAGAACAATTTGATTGA 
TTATGGCGCTGCCCAGAAAACTGATATCCAAGATCTGTAAGTAAACCGAATGTTCGAAAGTACCGTTGTATATTCCCACACACACACAAGCACGT 
AGAAATACGACTGCATGTATATATGTGTGTGTGCGTGCGTGTTTGAAATCCTATCGTAAATTCTAGTGTTGCATGCAACTTCAGTAACTTTTAAA 
ACAATACCACGCGTTCTTATAAACAATGCAGATATCCAATAGGAAAATCCCCAGGTGCACATATGTATGTATGTAGATACATATGAACATTGAGT 
GCAGCTGCAGCTGAATTGTTCGTTTTAAAGTAATTAATATCTTGTTTTACAACGCAATATCAATATCAAATAAACATCATACGGCTTCTTCAGAC 
TTTAAAATAATATTTGTATATAGCCTTTAGAAAACTTCAATTCAATCCCAAGAACTATAGAAGTTCAAAAAGGTTTTAATCTGGGAACAAGTTTT 
CAATTTCTTTGAAATGCGTACATTTACGTACATATGTATGTACATATATACTGTACATATGTATGTATGTACATCGATTTCCTAGCTTTATTTTA 
TCTGTTTTGCTTTTCAAAATACTAGTAATCCTTGTCATTCATAAATGTGGATAACGTAAAAGGTACTTATTTATTTATATCGACACAGTTTCCTT 
TGAAAAGATTCAAAAATCATTATAAAATTGTATTTATTGAGTATATACTAATACGAAAATATCATATTTTTATTATTATATATTTGTTGTAAGTT 
TTCCTTTTGATCTTGAAAAGCTGTTGTTTAAAGCATAGCTTAATCTGAACATACTAACTATATCATCATATCATGAAAACTTGCCGAAGAGGACT 
AACCAATTCTCATCCTTTGCAGACGATATAAAGGCACATGATAGCAGGGTCACGAGCCTTGATCTTGGCGAAACTGGACGAGTGCTGGTCACCGG 
AGGAGAGGATCGCAATGTGAACCTCTGGGCGATTGGTCAAAATGAGTGCTTTATGGTGAATATACTAACTACTTCCTAAATATATGTATGTACTA 
ATCGGTGACCCTTTCCTTTCAGTCTCTGACGGGCCACAATCGTTCCATCGATTGCGTGCGTTTTGCTTATAAGGATAACTTTGTCTACTCCGCCG 

AC G AC ATC GGT AT AAT C CG AAGAT GGG AT C T GAATT C AC AGAAAATC T AC T C C AC G C T AAAT GGC C AC AT G AAG AGT GT TCGCACTCT GG ACT T T 
AACCCGTCTGGAGAATATGTGGTGTCCGGGAGCAATGATACCACCGTCAGGTGAGCCAAGGACCGATATATAAATAGTCAGAGTAACCCATGAGT 
CATTCGGAAGCTCTAGTTTCCTTATCAAGTGGCAGATTTAAAGTTTATTCGTTGATTGAAAAGGGATGGCGATATTCTCCCTCCCTCTTTTAGCT 

GT GC T T AAT CC TAT AG AAT C GC GT AT TTTGGGTT T AAT AAT CT AC AC AAAAGAAAAAT CTC AT AT T TAG AAAT TAG AAAT T T T AT AT CGGT T T AA 
TGCACATACATATTAGATGCATTGTATCTATATGGTCTTTTCTTAGATATCACTTGACCTATATATGTGTAAAACAAAGATAATATGACTCACTA 

C AT T G C AC AAAAGG GG AC AG GCT GAC AAAGAT AAT AT C AC T T T GT GT T GT T T AGC T AAAGT T GAT AT T T CT T T T GGG AT C T CAT AT ATC T T AT CT 

CTCAGATCTATTACATCCATGATCTTAAACAACATTGCATTTTCAGTTATGTGATGTTTTATTGGGAGTTTTATAAAAGTGTTACAAAAAAATAC 

TTCTACCACCACGGGCGTTTAAACAATTGCATATGTATAGGCATATTTGTTATAGTGTATGTACAAAAAAAAACGGACGCGAAACGAATTATAAA 

GTCAATGATAATCCATTGCGACTAAGGCAGACTATGTGTCTCTAGGTGATCCGCGCTTACAGCTTGGCCTGTGAGCTCTTCAGGTCGACATAGGC 

GCCGGATATGACCTCCTTGCCGGTCTCCACGACCACGGTGCGGAAGTTGATGCGTGTGCCCTGCTTCCACAAGTCCACGCGCAAAGTCTGGCCCG 

GAATTACGGGTCCGGAGAAGCGCACTTTGACAGCCTTGAACAGAGCCGGGTTGTTGTCCGCAAACTGAGCGAGCACGGCGCGCACCGAGAAGCCC 

AGTGTGCAGAGTCCATGGAGGATGGGTGTCTTGAAGCCGGCCAGCAGGGCCATCTGGGGATCGATATGCAGGGGGTTCTTGTCGCCGGACAGACG 

GTACAGCGCCGCCTGGTCCTCGCTGGTCGTATACTGGACGGTTGCGTCCGGTTGGCGATTGGGCGCCGGCTGCAGCGGAACTACACCAGCGATGG 

GATCCTTCTTGCCTCCGAATTTGCCAGCGCCGACAATGAAAGTGGTGCTCTGGTTGCGCACCAACAGGCGGCCACTTTCGTCGAACGACTCGCTA 

TTGGTGACGACCACGGCGCCGGATCCCTTGTCCATAACATCGAAGACCTTGCCGTTGGTCAGCAAAGTGCCACTGGTGGGCAGATCGTCAACGAT 

CTCTAGGTACTGTTCGCCGTGCAGAATGTTGCTGAAGTCCACTTGGCTGTTGGGCAAAGCCTTGCTGAGCAGCTTGTCGGTGGACATTTGCAGCA 

GAAGGCCGGGCAGAACGAAGAAAGTTGGAATGGCGGCGAAATCGGCATCGTTTTCGTACAGGAAGCGCATGTCCTTCGCGTTTTTGACGGATGCC 

CCAATGCCCAGGGCGTAGGTAATAAGCTCTTTGCTATTGAACTCGAAGGCATCCTCGATGGCGTCGCCGCCGCCTTCCTTGAGCTTCTCAAGCAC 

TTCCAGCAGGGTGCCGGAGGCCTCTGCGATGGCACCCAAGTGCTTGGCCTTGGACATGTCGGTCACATTCGACCACACATCCTTGACGTACTCAA 

TGGTCACGGGGTCGTCCAGCGAAGGACGCAGCACAGCGCCCTTGCCGCGAACCATGTGCAGCTTGGTGGCCCAGCCGGCGGCACTCTCGATATAG 

CTACCTGTGGATCGGAGAAGAGGGAGAGAGAAAACGTTTAATAAACCCTATATATACACACATGTACACATATTCCTGATAAACGGTCTAAGTTC 

AAACGCTATCAGTATGAATTTCCCTAATCAGGCAGATTATGTTGACCATTTGGATTGGACTACTGGACATATTAAAGATGACTTAGCAGTGGTTG 

GCCAAAAATTGCATAATTTTGTAGTTCTCAAAGGGTTAAATTGATACAAATTGATATGATAATCAATTCTGAGATAATGCAATTCTCGCATAACT 

TCGGATCCCACTTACCATTATCCTCGCAGGACTCGTGGCACAAGTAGGCCACTACGGGAGCAATCAGCTTGGGCTTCAGCTCGTTGAAGAGAATG 

TCGGGCAGAATGCCCTCGGTCATTCGACTGGCGGCAGTTGGCACGATGACGTTGCAAAGAACATTGTTGCGGGCTCCCTCGATGGCAACCGTGTT 

GGCCAATCCGATGAGGCCCATCTTGGCGGCAGTGTAGTTGACCTGCCCGAAATTACCATAGATGCCCGAGTTGGACGATGTCATGATGATGCGGC 

CATAGTTCTGCTTCTTCATGTACGGGAATGCCGCCTGGGTGCACTTAAAGCTGCCCTTCAGATGGACATCGTTGACCAGATTCCAGTCCTGTTCG 

CTGGTCTTCACCAGACTCCTGTCGCGTAGAATGCCGGCATTGTTGACCAGGATATCGACGCGTCCGAAAGCTTTGATGGCCGTCTCGATGACCTT 

TGCCCCATCGATTACCGAGTTGTAGTCGGCCACCGCCTCGCCGCCGGCCTTACGGATCTCATCCACCACAATGTCGGCGGCCCGCTGCGATGCTC 

CATCGCCGGAGTGGGTGCCACCCAGATCGTTGACCACCACCTTGGCGCCACGCTCCGCGAAGAGCAGTGCGTACTCGCGGCCCAAACCGGCACCA 

GCTCCCGTCACCACCGCCACACGGCCATCGTAACGAAGTTTTCCATCGGATGAGGACATGGTCTGTCTATTGGTCAATAGCAATCGCAATTTCTG 

GTTTCGCTGCACAAATCCGGTCTCTGCGGCAACGCTTTTTCGGCTGTAAAACACGAATGAACCGATGAAAATAAACTGGTTGGCTAATAATCGAA 

CAAGAAAGGAGAGTAGCACTACTTTTCAATCGTACTTAATCTCAAGCGATAATGCTATCTGCCCTGCCCTCGCATATGTGAGTGTAACTCTGGAC 

TCTTCGGCTGAAGAACGTTACCCAGGCGATATGGTCCAAGTCCAGGGGTTCACGGTTCATAGTCAAGTGCTTTTCGGATGGCACCGAAATCGCCG 

AAATCCGAGATACAAGATACATTTGCACAGTGACGACAGCACAGTTATTCGAACCGCGTGAAATGGCCCACACACACACACATTAAACAAGGGTT 

AACTTACAGTTCTCGTAGATTCGCCGTATGATGGTATGGTATGCCCGAATATGCACAGATTAACCCAGTATCAGTGGTACAAAGTGCATCACTAT 

TCGCTGCCGAGATAATTTTGCAATCGTTTAATCGGTTCAACCCATGAAAATGCACTAAAATATACTGCCCGCATTCATACGTTTGTGAGCACTAG 

AAAGTAGTGCCGTCAAAATGCTGGAAAGCAGTGCATCGCGGACGTAAACCTCGGTTCGGAACTTCGCTAAGCCGTAGCAAAGACTCCAACCCGTT 

T T CAT T T T C C AG ACG AT T AAAT T T AAAAAT G AAC AAT T AT GGAAAT T AC AC T T AT CT T C C AC AT T T C AAAT T C AAC T AT GT C T C T T T AT T T AAC A 
TAGAAATGTATAAAACAACAATAAATTGAGCAACAATAAATGAATTAACTAGGAGATTACATATATTATTGAAACTTTATGAATTACATATGCTT 
TGTTGCTTATTTAGCAAACTATTAACTTTCATATGTCTGCAATCTCAGGCTTTGGGACGTGCAGAACGAAAACAACTGCATTAAGGTATGTCGCG 
GCCACATGTCCCACGTGAATTCCGTAAAATTCTCGCCGGATGGTCTTTGGATTGCCTCTGCTGGCCTGGAGGGTTCCATACTCATCTGGGACATA 
CGGAAGAGCAAACAGATAATGGAGTTCATAGCCGATCCGCCGGTTACGGCTATCACATGCGTGCAATTCCATCCGTTTGAGTTCCTGCTGGCCGC 
CGGTCGCGTCGATGGCACCGTTTCCATTTACGATCTGGAACACCAGCAGCTCGTCTCGCAGACCACGCACTTCTATGGCCAGGCCATCAGGTGTA 
TAACCTTTAGGTGAGTCCTGCAAAGAATAATCCGCAATCGTGAATTGATTTATGAAAAGAAATAGTAATATAGTAATTTTGCCAAAAAGTGCATA 
AATTCTAATTATCTGCATGTTAAACTTTATTTATTTCAGTGACAATGGCGAGTGCCTTTTTGTTGGCAGTTCTTCGGGCATTTCGGTAATTGGAT 
GGGAACCGGATCGTGAGCTGGACCATATAAAAAGCACCTGGTCCTCCTTGGCGGACATGAAAGTGGTCAACAATAAGTTGGTAAGTGAAAGCACT 
TACTCATTAAGCGAAGCACCCTTCCAAATCTCGTCATCCCCGTTATCTTTTTTGACAAAAATAATCAGTATTTTGGTCAGAATATTCTAAAAAAT 
GGTCCAAATATGGAGTGTCATACCTCGTTGAGTTTGTTTTTTTAATCCAAATCGATTTGCATTCAAATTTGGGAATTCTAGGTTTTTTAAATTTT 
TTGCAAATTTTGATGATAAAATTTTGATTATAAAAAAGGCGAAAAATTGGCCAAAAACTAATTTTACAAAATCCGTCCAAAAGTGAAAGGGGTGG 
TTAGTATTGGTAATTAGGAGCACAAAACGGTATTTCTTTTGGCGGTGTGGCCATTTTTAGTCAAGTTATACCCAGAAAACCAATCTTAAATATTG 
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CATTTTATGCCAAAACTGGTTTCACTGATTTTATGCGAAAAAAATAATCGGTATTTTGAGCAAAACATTAAAAATAATGGCACAAATATGGAATG 
TCATAACTTGTTGAGTTTCCGAATCAATTCGCATTCAAATTTAGGAAATCTAGGTTTTTCACTTTTTGAAAAACCAATCTTAGATCTTAAATGTA 
AAATGTCTACAAATGGAATATCCAATAAAAACCATTGCACTTCCTTTTGTAGATCTGTGGTTGTCATGAGATCGACACCGTTTCAATCAACACCA 
TCAGTCTGGATCGAGTGATACCATTCTATCAGCCGCCAAACTCACTACCCAACTTTAAGCACAACTCGACGAACCGAAAGAGCTTTACACGTGGC 
AACCAGAAGTTCAGGCTGTCAGTTGGCGGTGCCAAGCCGGCACAGGTGCAGGAGGAGCACGAGGGCGACAAGAGTGGATTGTCCTCGCCCAACTA 
TAGTCTGGAGGTTGTGAACGAGGCGGTGCTGGAGTCGGCGGAGACTTCTCCAATGTCCTCGTTGCCACCTGTCCACTTGGCTTCGATGGCCGGAT 
CGTCGGCGGACTTGGCGCCGCATCATGTCGTATATGGCGGAGCAATCGATTCTGCCCTAAGGCAGGGTAAGTACTCCAGAGTGCCTGATAATATG 
GCCAGCTATGGCAGTAGTTACGGGGCCATCGATACCCGCCTGATGTCCTTGAACGAACCCAGTTCACTGCACAAGCCGGACATTTACAATCTATA 

{SEQ ID NO: 1090) 

Exon: 1001.. 1366 

Exon: 2208.. 2335 

Exon: 2398. .2615 

Exon: 6319.. 6660 

Exon: 6785.. 6931 

Exon: 7004. .7075 

Start ATG: 1333 

Transcript No. : CT27906 

C GT C GC T GT AATT AC AAT GT GAT GT GC AT TTGTTTTTGT T AAT AAGAAT T GT C CG AT T AAAC AT T T AGAT T AT CT G T T T T T AAAC AAT GC G ATC A 
CGGTGATATCGGTTCGT7AAAGGTTTTACACACCGTGTGATTTTGTTTCAGTGCGTTTTGTTGTTCGTTTCGTAAATAAGTGGGCTTCATTTTGCC 
CAAAAAAGACGGGAGCCCAGACAGCTGATCACAGACTTTGGCTCACACCTGCTACAAAAAGATAATATTTATTCGAAAACCTGGCATTCGGATCG 
AATTTAGCAGCTGCCAGCGTATTTATATAAGAACAATTTGATTGATTATGGCGCTGCCCAGAAAACTGATATCCAAGATCTACGATATAAAGGCA 
CAT G AT AGC AGG GT CAC G AGC CT T GAT C TT GGC G AAAC T GG ACG AGT GC T GG T C AC CG GAGG AG AGG AT C GC AAT GT G AAC CT C T GGGC G AT TGG 
TCAAAATGAGTGCTTTATGTCTCTGACGGGCCACAATCGTTCCATCGATTGCGTGCGTTTTGCTTATAAGGATAACTTTGTCTACTCCGCCGACG 
ACATCGGTATAATCCGAAGATGGGATCTGAATTCACAGAAAATCTACTCCACGCTAAATGGCCACATGAAGAGTGTTCGCACTCTGGACTTTAAC 
CCGTCTGGAGAATATGTGGTGTCCGGGAGCAATGATACCACCGTCAGGCTTTGGGACGTGCAGAACGAAAACAACTGCATTAAGGTATGTCGCGG 
CCACATGTCCCACGTGAATTCCGTAAAATTCTCGCCGGATGGTCTTTGGATTGCCTCTGCTGGCCTGGAGGGTTCCATACTCATCTGGGACATAC 
GGAAGAGCAAACAGATAATGGAGTTCATAGCCGATCCGCCGGTTACGGCTATCACATGCGTGCAATTCCATCCGTTTGAGTTCCTGCTGGCCGCC 
GGTCGCGTCGATGGCACCGTTTCCATTTACGATCTGGAACACCAGCAGCTCGTCTCGCAGACCACGCACTTCTATGGCCAGGCCATCAGGTGTAT 

AAC CT T T AGT G AC AAT GGC G AGT GCCTTTTTGTT GGC AGT T C T T C GGGC AT T T C G GT AAT T GG AT GGG AAC C GG AT C GT G AGC T GG AC C AT AT AA 
AAAGCACCTGGTCCTCCTTGGCGGACATGAAAGTGGTCAACAATAAGTTGGTAAGTGAAAGTATTTTGGTCAGAATATTCTAAAAAATGGTCCAA 

ATATGGAGTGTCATACCTCGTTGAGTTTGTTTTTTTAA 
{SEQ ID NO: 1091) 

Start ATG: 333 

I^LPRKLISKIYDIKAHDSRVTSLDLGETGRVLVTGGEDRNVNLWAIGQNECFMSLTGHNRSIDCVRFAYKDNFVYSADDIGIIRRWDLNSQKIY 
STLNGHMKSVRTLDFNPSGEYVVSGSNDTTVRLWDVQNENNCIKVCRGHMSHVNSVKFSPDGLWIASAGLEGSILIWDIRKSKQIMEFIADPPVT 
AITCVQFHPFEFLLAAGRVDGTVSI YDLEHQQLVSQTTHFYGQAIRCITFSDNGECLFVGSSSGISVIGWEPDRELDHIKSTWSSLADMKVVNNK 

LVSESILVRIF* 
{SEQ ID NO: 1092) 

Classification : cytoskeletal_structural_protein 



Celera Sequence No. : 142000013383801 

ACTGGGAGTTAAAGGTCAATTAGGTGGTGAAAAGTACTCAGAGTAGTTGAGACCGTGCAGCTGCATTCATTACGTTTTCAGAAATGAATCACCAA 
CTAGTTTTCACCTTGAACGCCATTTTCAAATTTCGCGGCAGAAAAACCACAACAAAAGGCGCGAGAAAAAGCGGAAAACAAATTTGAAAACATTG 
TTCCGATAATTGCTGATTATTTGTGAGCAATCATTTCAAGTAATCGAAAAATAATTTTGAAACAGGCTGTCTTGTTAGGTCAATTATTTTTTATC 
GTGCCATTTAAACAAAAATTATTTATATTTATTTATTTATTTTCCATCGTTTCTCCAAATTACATTTAAAAGTTTGTAAGACTTAATACTGTAAA 

AC ACT AT T AT T AGT TAT T AAC TAT AGT T T AAAAT GGC AAAAT AG AAAAT GC G AC AC GC AC AAT T G AAC T AT CG AAAC G T T AAT AT T T AAAAC AAG 
TAATTTAAAATAATGAATACATGTATAAAAATAATTAATACATTTGTTTTTAAAAAGCTATAGAAATAGCTGCATATATGCTTTAACAATTTATG 
CTTAAAAAGTCTTTATAATAATCCTGTGATTTCGCCGTGATTTTTGGGTTTACAATGTGCTGTTTTAGAATAATGGTTATTTCAAAGTGACCACT 
ATTTTTCAATTATCTTAATTTTTTAATAGTCTTGTGTTTATTTCAACTTTAAGTTTTATATTTTAATGGCTCTTTAAAAGCAAAAAAATATTTGA 
TTTTTAGGAGTGATCGCTGATCACGATGCTGTAATTTTTTGTAAGATATTTACTTAACGAAGGCGAATGCGTTGTTTCGATTGGCGGTTTCAAAT 
GAGCGCAAGTAAATGTATTTATTTATTATAAAAATTGTAAAAAGGGCGCATGAACGCGACCATTTATCCGTTTTGATGGGGGAATGGGCAGCACA 
AAGTGCGTTTATGGGCGCCCTAAAGTGATGTAACAATTGCTGCGACCGCAGCAGCAGCGAAACGCGCGGAATGGAACGCACGGAACGATATCGTA 
GTATCGTACATGCGGTCGCAAGCCATACACGATGGTCTCTTTTTGTGCAACTCTGATTCGACAGTTTCGGCACAGGTCGGCAGCGGCATTTCAGA 
TTGCGGAGTCCGCCATTTTAGCTAACAAATCTATAACCCTAGCTTCGTCAGCGTGCAGTGTGAATAGGGGTCAGTTGCAGTTTGGAATTCAGATT 

GT T C GC G AAT AT T C G AAAG C GT C AAAG AT G AGT ACTT T GC C T AGGGT GT T T T T T GAT AT G AC C GC C GAC AAC GAGC C CC T C GGT C GG ATC GT CAT 
GGAGGTGAGTTATTTGCCCTTTTTTTTTAATTTTAAACTTAAGTTGAGGTTTTTTTTTTTGTGACATTTGACCAGAAAGTGAAAAACACGTGATT 
TCCCATAAAGCCATTTCTTACGCCTAAAGCTAAGACTGTGTTTTACATTATTTGCCTCAGGTGATAAGTACATACCGTTTTTTTCTTTTTATTTT 
TACGGATTTAAAACTTGTACCAAATATACACACAGCGCAAACTCGCCCTCTAATTTATAAGCGACAAGTGCAATTACTTTCCATGACTAATTGCA 
TGACAATGCGCTATGGGTAAATTTTCAATTTTCCAACCCAATTGGACAGGCGCGTTTAATCGGAAAATCTCTATAATTATCTCTACATCGCTGAA 
TTTTGAGCAAACAGGCAAGCAGGAGCACCAACCCCCGCAAATCGGCCATTTGGTGGTATTATCTTTATAGAATGGAATTTTCCTTATTTTAATTG 
ACGTGTCGGCAAACTTTGGCCGAAATGGTGTGCAATTTTTACCGGTATTTTATATGCATCTGTGCATGTGTGCTATATATGTACATTCCGGCGTG 
GTCTGCGTATGTGTGGCTCATACCCATACACACATGCAGATCAATACACGAACATAACCTCTAAGCGGTCACTCGCGTATGTCTGTATGTTCTTG 
TCGTGCTTGCATATGTACATGTGTGCATGCGGGTAATGTACGCGCAGATGTACAGTGTACACACAGAGATATTCGGGTGTGAATGCGCGAGTTCT 
GTAAACCCGTTTGCAAACATGAACTCACTATTAATTCCGCCGATGATGTAATGTACACCTAATTTTTGAATAGAGAAAAAGAATAGGAAACGCGA 
ATGCTTGATGTATATTTTTAGTGGCCTGGGAATAAAGTCATTGCCCGCCAAGTAAACCAAGCCACATGTCTAGGATTTTCCTTCCAATAGGACAT 
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ACATATGTACATACGCACACTTGAACATACATGCAGTGCGCGAAAAATTCTGCACTTCACGCCGAATTACTACACTAAACAATTGAAGATTTATA 
GAGAAGCGGCATTAGGAAAAGCATTTACTTGACTAATTTATACAATTGTCTCGATTACTGGGTGAATTGGAAAATTGCTCAAATTCTAGTGCATT 
TGCAAAGTTCTGTGCATTATATGTATTTTTTCCCTTGGTATAGATATGCACATACATATGAACATGCATATGTATATTATCAAAGGGCTAAAAAT 
GGCGATGAAAAAAAAGCGGCGTCAAAAAACAGCTGATCATGTTTACTGCCGCTCAACATGAATTATACAGAAAATTGTCCAGTCCTCAAAAAAAA 
AAAGAAATAAATAAGCAAAAAGTAGTGGCAAGGTCATCCGAGTTCGGATAAACAAACCTTGAAGAACTGAAAAAGAGTCATACTTTGTTAGCATT 
AATTTCTTGTTTACTAACGAAATTTCTACCCTTTTGTTTGTATTTTGCAGCTGCGCTCCGATGTCGTGCCCAAGACCGCCGAGAACTTCCGCGCC 
TTGTGCACCGGCGAGAAGGGATTCGGGTACAAGGGCTCCATTTTCCACCGCGTCATCCCCAACTTCATGTGCCAGGGCGGCGACTTCACCAACCA 
CAACGGCACTGGCGGCAAGTCCATCTACGGCAACAAGTTCCCCGATGAGAACTTCGAGCTGAAGCACACCGGCTCCGGCATCCTGTCGATGGCCA 
ACGCTGGCGCCAACACGAACGGCTCCCAGTTCTTTATTTGCACTGTGAAGACCGCCTGGTTGGACAACAAGCACGTCGTCTTCGGCGAGGTCGTC 
GAGGGACTGGATGTGGTCAAGAAGATCGAATCTTATGGATCGCAGTCTGGCAAGACCTCCAAGAAGATCATTGTGGCTAACTCTGGTTCTCTTTA 
AGAAGGTGCGATGCAAAAGCAACAGCAACATTGCAAGAAGAAAAATCTGTTAAGACGAAAACCATTTGCATTTTCTCCAAAACATTTTTTTTTAC 
CAATTACTGTACTGTAATATTACGAGAAAAATACATAACAAACCAACATATTTTCAATAGACATGTTGGGTTTCCTTTGGCCAATCAATTGTATG 
TGGGGGCCAAAAGTGAATAGAATTAGTGCCCACCAATGTCAGAAAGCGTTCAATTGGCTATATAATATGCATTAAATCTGTAACTTTAACATTTC 
CATTCACAAAAAAAAAAACAAAGAAGAAAAACAATGAGACGACAAGAGTCGAGAGGGGCTTAGAAAAATGTTTGATCTTTTGATGTTGGCTTACT 
TACTACTAATAATAGTATTAGTAATGCTGATTATCTAATTGCATATATATATATATATACCAACGTTCTACGCTATGTACCGGAATCGGAATTGT 
ATTTTCGTCTTAATAAATCTGAATAAACGGCAACCAACTACTCAATTGTTTGTTGTCACTGCTGGGATCTTTGGTTTTGGTAAATGGACATATTT 
TATTGGAAGAATATTTCGGAGCTACTCAAGGTGGCGTACAATAGATTGTATTCATGCTCGTAGGCGTTCACATGTCGTTCGGTTATCGCCACTGT 
CTGGTGAAGCTTCTCGTCCAGCTGCTCCGGCGCCGTCCGCTGCTGGCCCGCCAAAATGTCGCGCACCTTGGAGCGCTCCGCGTTCAGGTGCTGCG 
TGATCGTCTGGCTCCAGCCCAGCTCATGGAACTCCGTCTGCTGGTTCAGCATGTTGAGGATGTAGACCAGGCCGATGGCGAAGCCATCCTCGAAC 
AGCGAGATCTTGGTCTGGTCCTTTTTGTAGATCTTTAGCTTCTGCTTGACCCGGTAGTCAATGTAGTTCATGATGAGCGCCGGTGTGATGAGGTA 
GAAGGTTCGCAGGTGGTGGTTATGTGGATTGCACAAAAAGGGCTGAAAGCCTTGCAGAAGAAGCTGCAATATAAATCCATGGGTTAAGCTGGTTA 
CCTAACTTGTTAAGATCGTAGACTGCACCTTGAAGTAATTTGTGCTATCGGAATAGCACTCCTTCATGTGACCCAGGCTCTTCTCGTACTCCCTG 
ATGGAACCCTCCGTAGCCTCGTGCACTTTGGAGCCCCCGCAACTGAAGTTGCTGTCGAATCTGGTTTTATACGAGCGATTCCGGAAGTTGGCATT 
CTTGCTTCCAGCCTGCAGGAGGCGCACATAGCCAACAGCGTTGCCTAAAAGCAATATTTACGTTTATATTTTAAGCATTTTCTTGAATTATGGTC 
TTTAACCACCTACCGACTTGTGTGATAACCTTCCTGAAAAGATCCATATACGTCTCGCCATTGCTGGAGCAGCCTAGCCTTCTGATCTTCTTAAG 

AAAACTATCGGCGCGCTCATAG 
{SEQ ID NO: 1093) 

Exon: 1001.. 1334 
Exon: 2806.. 3677 
Start ATG: 1263 

GCAGCAGCGA^ 

CTCTGATTCGACAGTTTCGGCACAGGTCGGCAGCGGCATTTCAGATTGCGGAGTCCGCCATTTTAGCTAACAAATCTATAACCCTAGCTTCGTCA 
GCGTGCAGTGTGAATAGGGGTCAGTTGCAGTTTGGAATTCAGATTGTTCGCGAATATTCGAAAGCGTCAAAGATGAGTACTTTGCCTAGGGTGTT 
TTTTGATATGACCGCCGACAACGAGCCCCTCGGTCGGATCGTCATGGAGCTGCGCTCCGATGTCGTGCCCAAGACCGCCGAGAACTTCCGCGCCT 
TGTGCACCGGCGAGAAGGGATTCGGGTACAAGGGCTCCATTTTCCACCGCGTCATCCCCAACTTCATGTGCCAGGGCGGCGACTTCACCAACCAC 
AACGGCACTGGCGGCAAGTCCATCTACGGCAACAAGTTCCCCGATGAGAACTTCGAGCTGAAGCACACCGGCTCCGGCATCCTGTCGATGGCCAA 
CGCTGGCGCCAACACGAACGGCTCCCAGTTCTTTATTTGCACTGTGAAGACCGCCTGGTTGGACAACAAGCACGTCGTCTTCGGCGAGGTCGTCG 
AGGGACTGGATGTGGTCAAGAAGATCGAATCTTATGGATCGCAGTCTGGCAAGACCTCCAAGAAGATCATTGTGGCTAACTCTGGTTCTCTTTAA 
GAAGGTGCGATGCAAAAGCAACAGCAACATTGCAAGAAGAAAAATCTGTTAAGACGAAAACCATTTGCATTTTCTCCAAAACATTTTTTTTTACC 
AATTACTGTACTGTAATATTACGAGAAAAATACATAACAAACCAACATATTTTCAATAGACATGTTGGGTTTCCTTTGGCCAATCAATTGTATGT 
GGGGGCCAAAAGTGAATAGAATTAGTGCCCACCAATGTCAGAAAGCGTTCAATTGGCTATATAATATGCATTAAATCTGTAACTTTAACATTTCC 
AT T C AC AAAAAAAA AAAC AAAG AAG AAAAAC AAT G AG AC G AC A AG AGT C GAG A GGG GCT T AG AAAAAT GT T T GAT C T T T T GAT GTTGGCTT AC TT 
ACTACTAATAATAGTATTAGTAATGCTGATTATCTAATTGCATATATATATATATATACCAACGTT 

(SEQ ID NO: 1094) 
Start ATG: 2 63 

MSTLPRVFFDMTADNEPLGRIVMELRSDVVPKTAENFRALCTGEKGFGYKGSIFHRVIPNFMCQGGDFTNHNGTGGKSIYGNKFPDENFELKHTG 

SGILSMANAGANTNGSQFFICTVKTAWLDNKHVVFGEVVEGLDVVKKIESYGSQSGKTSKKIIVANSGSL* 

(SEQ ID NO: 1095) 

Name: PEPT IDYL- PROLYL CIS-TRANS ISOMERASE 7 (ROTAMASE) (CYCLOPHILIN) 
Classification: chaperone 
Gene Symbol: Cypl 
FlyBase ID: FBgn0004432 

Celera Sequence No. : 142000013385215 

GAACAAAAGGAAAGAGGAATTCGTTAAGCGATAAAAGATAAAGATGATGGTGAAATATGAAAACTTATTTGACTCAAGAATTGCAATATACATAT 
ATGTATCTAGATTGAATAAAATTATCTGTTAGGCAAATTGAATCTTGAATTAAGTGAATAAAATTCACAATGACGATAAATCTTTTTAGTACTAT 
AAATATTAATCTCCAATAGCCATAAACTACTGGTCATTAGACGACGATATTATATCAATTAAAAACTTATAAAGAAACCCATAACTTGGTTGTAT 
ACAAACATATAGATCTGTACAAGTTGTTATTACTTTGGGCTCAAGTTCTCTAAACCAGTAACAAAATTCTTATAGGCTAAGGTTCGCAAGTACTT 
GAAGACTGAATGCCCATTTGTAACCGAATCTCACCTTCCAGCAGTGGAGGCTCATCATCGAATTCGTTGCCGGCTCCTCCGCCAGCAGCTGACTT 
TCCGCTCTTGTCCTGTCCGTAGCTCGTGTCGGTGTAGGCCGTGGGATCGTAGAAGCCACCCAAGCCTTGGCTCGGCGGTTGGGCATAGGACGCAT 
CGTAGTTCGGCGGCACTGAGGCCTGTGTGTTGTCGAAGGTTTGGAAATTCCTGCAAGCAAGTGGGGCAACAATTGTGAATGCTGTGAAATGGAAT 
TCCGGGTAACTACTCACAGTTCCTGGCCAAATTCTGGCATATCGAAGTTGTAGCTGGCATCGGCCGAGGGCGCGGAGCCGTAAAAGTCATTGGGT 
CCGCCGAACTGAGACATGATTACTTTAATTTAAGCGCAATTCATATATTTACAAAATGATATTTAATTTAACAACACGTCAACTAGAGGTGACCA 
ATAGCTAAATGTAATCGATGAATTTCGATGTTACCTATCGTCTAACACCGGTGCTATCTGAAATCGATATTTTCTCGCCCATCGATTTGTTTTAA 
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ATTCAAAAAATTGCGCCACTTTATAAGAAAGTAAATAAAAACCCGATCATTTTGTAACATTCATAATACATTTTATTTCCATAGCTCCATTTGCA 
ACACACACAAGTTACTGAAATAGCTATCCTTTGGTAGTAGAGGTACATGCTTTTGGGGCTACGTTGTTATTGCGTTCTTTAAAGGTCTATAATAT 
GTAGGAATATAATATTTTGTAATTTCGTAAAGATTCAAACGCCCTTTAGACTCGTCGTTTAGTTGGTTTAGTTTCACACAATAGGGTAACGAACA 
CAATCGATAACTATATGAAAATTTACTAACAATACATAGCTGTGAATTATATTTATATATCAACATAGATAATTATACATATGTGTCATATATGT 
ATGGTATTTATGTATATGGCTGGTTTAACTGCGTTCCATTTGCGCTACCGACTAATTATTGTTCGTCTTTCCTTTCGTTTTGCGTATCTTCTGGT 

GAT TCGATTCT G AC C T GAC TAG AC C T G AAG C GG GC C AAAC T GT T T GT T AAAC T T AT AAG T AAAC T T GG AT T C AAAT AT CT T AT T AT GC AT T AT GT 
AAATATGACTAGATTGAATGGGGAATTCGAATAATAACCTAGCTTTGGTTTCTTATTTCGTTTGGTTTGCTTCTTTTTCCATTCCTCTCGCTAGG 
AAAGCCCGCAGTTGGTTTCTGTCCGTTTCCGCTTAGTCCTCTAGATTTCGGAAGGCCTGCTGCAGATCCTCGTACAGCTGGTCGTAGTCCGCCTT 
GATCTTTTGCTCGCCATCTTTCACCGGGTCCTTAAAATGTGAATTATTAATATAAATAATAGTTGGAATAGATAAACCCACTCACCTTGAACTTC 
ATTGACGACAGCTGGTACATAATTCCGCCCATCGATTCCCTGATGGTGTTCCATGTGATCTTGTTGTCCGACTGGGCTGTGGACTCAACGGCATG 
ACGGGCGGTCTCATAGAAGGCCATGATGTTTCTCAGCATGCCCACGGTCTTGTAGAAGGGACAAACGCGATCGTATGGTGAGTAGGAGTTCTGTT 
GCAGAAAGTCGTCCTTCAGCAGCTTTGCCACTTCCAGGGTCACCTTGTCGGTCTCGGCCAGTGATGCTTTGCCCACCAGCTGAACGATCTCAGAC 
AGATCCTCCTCCTCCTGCAGGATCTCCTTGACCTTGGTGCGTAGTGGCACGAACTCGGGGTAGTTCTTGTCATAGTATTCATCCAGAGCACGCAT 
GTACTTCGAGTAGGAGATGAGCCAGTTGATCGAGGGGAAGTGCTTGCGCTGGGCCAATTTCTTGTCGAGACCCCAGAACACCTGCACGATACCCA 
AAGTGGCGGAGGTCACGGGATCGGAGAAGTCACCACCAGGAGGAGACACAGCTCCGACAATGGACACGGATCCCTCGCGCTCCGGGTTACCCAAG 
CACTTGACGCGCCCAGCACGCTCGTAGAATGTGGCCAGACGAGCTCCTAGATAAGCCGGGTAGCCAGAATCGGCAGGCATCTCAGCCAAACGACC 
CGAAATCTCACGAAGTGCCTCAGCCCAACGGGAGGTGGAATCAGCCATCATGGCTACGTTGTAGCCCATATCACGGAAGTATTCAGACAGAGTGA 
TTCCAGTGTAGATGGAGGCCTCACGAGCAGCCACCGGCATGTTGGAGGTGTTGGCCACCAGAGCAGTTCGCTTCATAATGGACTCGGTGACGCCA 
TCTATCTCGCAGGTCAGTTCGGGAAAGTCACGCAGTACCTCAGACATCTCGTTACCGCGCTCGCCGCAACCGACGTAGATGATCACATCAGAGTT 
GGAGTACTTGGACAGGGCCTACGGGTGGTTGAAGTGCAGATTGGAAGGGATGGTCATTTAATGAGATGTAAGATGTGGATTAAGTGAATAATTTG 
CAACTTGCCTGCGAAATGACGGTCTTGCCGCAGCCAAAGGCACCGGGGATGGCAGTGGTGCCGCCCTGTACGCAGGGGAAGAGCGAGTCAAGGAC 
GCGTTGGCCCGTGAAGAGCGGATGGTTGGCTGGCAGCTTCTCTGTGACGGGACGTGGCTGCCGCACTGGCCAGACCTGCAACATGGTGTGCTTGG 
TTATCTCGCCGTCGAACTCCGTCTCCAGGACAATGTCCTCCAGGTTGTAGTTGCCCGCGGGGGCAATGTATCGAACGGTTCCCTTAGCCCTGGGT 
GCCACAATCATGCGCTGCTTCACCAGCGTGTTCTCGTGTACCACTCCATACAGATCTCCTCCGGTGATGTGGGATCCCACCCGCACATTCAGCGG 

ATT AAAT T C C C ACATC T CCG AGC G C G AC AAAGC AGT T GT GT T GAC AC C TT T GGGT AT AT AG AT G GAG T T G GT CAT G AC ACC AAT GT C C C GC AAAG 
GACGTTGGATGCCGTCGAAGATGCTGCCCATAATGCCGGGTCCAAGTTCCACGGAAAGAGGTTTGCCGGTACGCAGCACCGGATCGCCGACAGTC 
ACGCCAGAGGTCTCCTCGTACACCTGGATGGTGGCCATGTCGCCCTCCAGACGGATGATCTCGCCCACCAGCTCGTAGTAGCCGACGCGGACCAA 

C T C GT AC AT AG C T GAT C CAG AC AT G GC C T C AGC GGT G AC C ACT G AAG T GG AT T T AAAGT T GT T G AAGG AGG AAAAGC AGG AGAAAT AT G AT T AGC 
AAGGAGTGGAGAGAGCAGCATAATGAGGAAGATTATGCGGAACAGGAGGAGGGGGAAGAAAGCATACAGAAATATGAAAAGGAGAAGGATAATAA 
GGAAGGATAAAAAGGAAGATGTGGAGGAGTATAATTAAGAACAGAAGAAAGGCAGGATTAAGGATTACAGAACTTGGGGGTCAGCGCATGAAGGA 
CGCTGGTGACGTCGCTTACCTGGTCCGGATACCGCGTAGACACGGCCATATTCCGACTCGCGCTCCTCGTCTTTGAATTTCCTCAAGTTGGACAT 
GGTCGCGGTTGCTGCTCTTCTTCTTCTGCTTCGGTAACGGTCAAATCGTTCGTTGGCTGTGGGAAAAAGTAACCGTTTTTCTTATAAACAAAAAC 
ACGCATTTTCCATGCCCCCGAATGAGAGAGCGTTTTCCGCGAGGAAACGCATAGGTGGCGCTGATGTGCAGCAGCAGCCCCCAGTGAAAAGCTTC 
AC ACCCCCGC AGC ACAGCATCGATTTTCC AAT ACAATTTTCACACATGCCACCAACGGCAAGGTGAATTTTCC AC CTTTTACCCATCAC AAAAGC 
AATTTTCACACTTTCTTTTCATTCAACACCTTGTGGTCAGTGGTTTTGTGGCGTAAAAACTGCTTTTTCAGCTGCTCTAAAATTAAATAAGCTCA 
CATTTGTTTCAACAGCTCTTATGAAAATTCTCTTGCTTTTTTGATTTCACTCTAGTATTGATCTATCGGTTATCGTCGAGGGCTCCAGGTTCTCT 
ATAGAGATGGGCTCACATCAAACTCACGCACGTCACGATACATTAATAGTCGGATTCCACTAGCACATTCCGCAGTTAAATCTTGAAATACTTGT 
TGGTAATTATTAAAAATTTAACATAAGCTTGCATTTTTGTGCTATTTTCCAAATTCTTCAATGCAACTTGTTCACTCCAGTAATATAAATGTTTT 

GT AC C AC AAC T T T AAGC C AAT TTCTTTTT AAT T GC T TAT AAT AAAG AAC AAAT T T AT T AAT T AC T TT AT T AT T T T TAT T T AAT AT G T T C T CT AA C 
ATTTTAAATGCTTATAAGTCGCCGCCAAGTACAGAAGTATCAGCAAGAAAATATCGATTGTAAATTGATCGAATTCTGTTTTACATCACTAATCG 
GCAGCGGCTGCAATAAATTTCCGTAAAAACCGAAGTAATCCGCAATTAAAGAAAGCAAAGGACCTGCATCCGTGTGAGATCGTTCGGCACCGGTG 
CCGGCACATGAAATCCTATCGAGTGACCAGTGGAACAGTGCGTACAGCGCCCCGGGGCAACTGAAAAGATAACCTCCAAAGTCCGTAGACAGCAG 
GCAGATGCACCTGCTCCACCCTTTCCCCTCCCAAATGCAAGTGAAAATCGCGTTTAAATAGTGAATAAACCCTGCCCAGCCATGTCGACGACGTC 
GGTGGTGCAGCAGTTCGTCAACGGGCTTAAGTCCCGCAACCGGAATGTACAGAACAAGGCAACCCAGGATCTGCTCTTCTATGTGAAGACCGAGC 
TACGCGAAATGTCCCAGGAGGAGCTGGCCCAATTTTTCGACGAGTTCGACCATCACATTTTCACGATGGTGAATGCGACAGATATCAACGAGAAA 
AAAGGCGGTGCCCTGGCCATGAGTGAGTTCCTGGCTCCCCGGCAGTGCTTGAGAGGATTTAACGTCTTCCCTTTCAGAGTGCCTGATTAACTG 

(SEQ ID NO: 1096) 

Exon: 4318 -.4277 
Exon: 3951.. 3820 
Exon: 3556.. 2859 
Exon: 2773.. 1796 
Exon: 1740.. 1001 

Start ATG: 3895 (Reverse strand: CAT) 
Transcript No. : CT27956 

CAAAAAAGCAAGAGAATTTTCATAAGAGCTGTTGAAACAAATCCAACGAACGATTTGACCGTTACCGAAGCAGAAGAAGAAGAGCAGCAACCGCG 
ACCATGTCCAACTTGAGGAAATTCAAAGACGAGGAGCGCGAGTCGGAATATGGCCGTGTCTACGCGGTATCCGGACCAGTGGTCACCGCTGAGGC 
CATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCGGCTACTACGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGCGACATGGCCACCATCC 
AGGTGTACGAGGAGACCTCTGGCGTGACTGTCGGCGATCCGGTGCTGCGTACCGGCAAACCTCTTTCCGTGGAACTTGGACCCGGCATTATGGGC 
AGCATCTTCGACGGCATCCAACGTCCTTTGCGGGACATTGGTGTCATGACCAACTCCATCTATATACCCAAAGGTGTCAACACAACTGCTTTGTC 
GCGCTCGGAGATGTGGGAATTTAATCCGCTGAATGTGCGGGTGGGATCCCACATCACCGGAGGAGATCTGTATGGAGTGGTACACGAGAACACGC 
TGGTGAAGCAGCGCATGATTGTGGCACCCAGGGCTAAGGGAACCGTTCGATACATTGCCCCCGCGGGCAACTACAACCTGGAGGACATTGTCCTG 
GAGACGGAGTTCGACGGCGAGATAACCAAGCACACCATGTTGCAGGTCTGGCCAGTGCGGCAGCCACGTCCCGTCACAGAGAAGCTGCCAGCCAA 
CCATCCGCTCTTCACGGGCCAACGCGTCCTTGACTCGCTCTTCCCCTGCGTACAGGGCGGCACCACTGCCATCCCCGGTGCCTTTGGCTGCGGCA 
AGACCGTCATTTCGCAGGCCCTGTCCAAGTACTCCAACTCTGATGTGATCATCTACGTCGGTTGCGGCGAGCGCGGTAACGAGATGTCTGAGGTA 
CTGCGTGACTTTCCCGAACTGACCTGCGAGATAGATGGCGTCACCGAGTCCATTATGAAGCGAACTGCTCTGGTGGCCAACACCTCCAACATGCC 
GGTGGCTGCTCGTGAGGCCTCCATCTACACTGGAATCACTCTGTCTGAATACTTCCGTGATATGGGCTACAACGTAGCCATGATGGCTGATTCCA 
CCTCCCGTTGGGCTGAGGCACTTCGTGAGATTTCGGGTCGTTTGGCTGAGATGCCTGCCGATTCTGGCTACCCGGCTTATCTAGGAGCTCGTCTG 
GCCACATTCTACGAGCGTGCTGGGCGCGTCAAGTGCTTGGGTAACCCGGAGCGCGAGGGATCCGTGTCCATTGTCGGAGCTGTGTCTCCTCCTGG 
TGGTGACTTCTCCGATCCCGTGACCTCCGCCACTTTGGGTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAATTGGCCCAGCGCAAGCACTTCC 
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CCTCGATCAACTGGCTCATCTCCTACTCGAAGTACATGCGTGCTCTGGATGAATACTATGACAAGAACTACCCCGAGTTCGTGCCACTACGCACC 
AAGGTCAAGGAGATCCTGCAGGAGGAGGAGGATCTGTCTGAGATCGTTCAGCTGGTGGGCAAAGCATCACTGGCCGAGACCGACAAGGTGACCCT 
GGAAGTGGCAAAGCTGCTGAAGGACGACTTTCTGCAACAGAACTCCTACTCACCATACGATCGCGTTTGTCCCTTCTACAAGACCGTGGGCATGC 
TGAGAAACATCATGGCCTTCTATGAGACCGCCCGTCATGCCGTTGAGTCCACAGCCCAGTCGGACAACAAGATCACATGGAACACCATCAGGGAA 
TCGATGGGCGGAATTATGTACCAGCTGTCGTCAATGAAGTTCAAGGACCCGGTGAAAGATGGCGAGCAAAAGATCAAGGCGGACTACGACCAGCT 
GTACGAGGATCTGCAGCAGGCCTTCCGAAATCTAGAGGACTAAGCGGAAACGGACAGAAACCAACTGCGGGCTTTCCTAGCGAGAGGAATGGAAA 
AAGAAGCAAACCAAACGAAATAAGAAACCAAAGCTAGGTTATTATTCGAATTCCCCATTCAATCTAGTCATATTTACATAATGCATAATAAGATA 
TTTGAATCCAAGTTTACTTATAAGTTTAACAAACAGTTTGGCCCGCTTCAGGTCTAGTCAGGTCAGAATCGAATCACCAGAAGATACGCAAAACG 
AAAGGAAAGACGAACAATAATTAGTCGGTAGCGCAAATGGAACGCAGTTAAACCAGCCATATACATAAATACCATACATATATGACACATATGTA 
TAATTATCTATGTTGATATATAAATATAATTCACAGCTATGTATTGTTAGTAAATTTTCATATAGTTATCGATTGTGTTCGTTACCCTATTGTGT 
GAAACTAAACCAACTAAACGACGAGTCTAAAGGGCGTTTGAATCTTTACGAAATTACAAAATATTATATTCCTACATATTATAGACCTTTAAAGA 
ACGCAATAACAACGTAGCCCCAAAAGCATGTACCTCTACTACCAAAGGATAGCTATTTCAGTAACTTGTGTGTGTTGCAAATGGAGCTATGGAAA 

TAAAATGTATTATGAATGTTACAAA 
(SEQ ID NO: 1097) 

Start ATG: 99 (Reverse strand: CAT) 
MSNLRKFKDEERESEYGRVYAVSGPVVTAEAMSGSAMYEL^ 

IFDGIQRPLRDIGVHTNSIYIPKGVNTTALSRSEMWEFNPLNVRVGSHITGGDLYGVVHENTLVKQRMIVAPRAKGTVRYIAPAGNYNLEDIVLE 
TEFDGEITKHTMLQVWPVRQPRPVTEKLPANHPLFTGQRVLDSLFPCVQGGTTAIPGAFGCGKTVISQALSKYSNSDVIIYVGCGERGNEMSEVL 
RDFPELTCEIDGVTESIMKRTALVANTSNMPVAAREASIYTGITLSEYFRDMGYNVAMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLA 
TFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSINWLISYSKYMRALDEYYDKNYPEFVPLRTK 
VKEILQEEEDLSEIVQLVGKASLAETDKVTLEVAKLLKDDFLQQNSYSPYDRVCPFYKTVGMLRNIMAFYETARHAVESTAQSDNKITWNTIRES 

MGGIMYQLS SMKFKDPVKDGEQKI KADYDQLYEDLQQAFRNLED* 
(SEQ ID NO: 1098) 

Name: V-ATPASE A SUBUNIT 
Classification: enzyme 
Gene Symbol: Vha68 
FlyBase ID: FBgn0013355 

Celera Sequence No. : 142000013384830 __^ mo 
TGTGTTTATGCAGTGTGTGTGTGTGTGCTGCCCACTCTCCTCGCCTTGCTCCCTGCGCTGTCCCCCGCCCTCTCTCCTGCTGCTTATCGCGCGTC 

TCCATCGCCATGCACTTTGTTGCTGTCGGCCATAGCTGCAGTTGTTGTTGCCTGCTTTTTTCTTTTTCTCTCTGTTTCGTTTTTTTTATCGCTGC 

GTGCGTGTAAGCCGACGCTAGCGGCATCTCCCCTCTCGCTCTGGCCCCCTAGGAACACAACACTACACAACAACGCAGCAGAGCGAGCTCCATGG 

GCAGAGCTGACCGGTCTTAGCGGGAAAATAGCTAGTTCTGGCTATGAAGAGTCATGTTAAGTAAATGTCGTTTGCAAGTAATTCTATAGTGTGCG 

TTTTACTAGTATACAAAAGTCCCTTTTACAAAAGCTATGTTTTCTAAGCATAGAAACTGCAACGAAACTTCAGGTCATGTAAAAGGGACTTTACC 

TAACTTTGAAGGACCGAATGTTCTCAAATAAACTCAAATAAATTTTTACATAGAAGGTTACGGAAGTCTGACTATAAAAAGAATATACGTTTAGG 

ATATTTAGTGTATTGGATATTTTCTCATTTCAAAGCCTATTCTCAAAAAAGAAAATGTTTCTAACATCGAAAACATCACTAAACCGGCTGAGCTG 

GCAGCTCTATCCCCCGACGTCGTCAGTCTCAGTCCCTTTTGCCCCAACTCCGCCCGCTCGCCATGCTTGCCTGCTTGCCAGTGTGTGTGCGTGTT 

GTGTTGTGTGTGCCTCATTGTTTGCTTTTTGTTGTTGCTGCGTTGTTTTGTGCCTCTCTGCTCGTTTCTCGATTCGTACGAAAAAAATGTAGTAC 

AGCGTGTGGTATGGCAAACGAAAGCTATGAGTTTTCAGTGTATCTTGTCTGCCCTCTCTTTCCCGCCCGGAATCTATGGTTGGGAATTATAAGAG 

CCGCGAGCGGCCTCCTTCGCCGCACAGGCGTTGCAGTCGTGGTGGACGCATCACCTGACCAGCCAGAGGCCCAGGTTGAGCTTCTGCAGCTTGGG 

CAGCTTCATGATGATGTCGATGCCCTTGGAACTGAGTTGCGTGCAGCCGTAGAGATCGATGGTCTTGAGATTAGTCAGATCCTCGGCCAGCGTTT 

GCAGGCCCTTGTCCGTTATCCGGCTGCACTGTCCGATATTGAGGTTCTCCAACTCGTGCAATGCCTTGGCGATCTTTAGCATGCCGTGATCGGTG 

ATCTGGCACTGATTGAGGGACAGGGATCTCAGTCTGTAGAGTCCCTGGGCTATGTGTGTCAGGGCCTGGTCGCTGATCTTGTCGCAGAAACTGAC 

GTCCAGCGAGTTGATCCCGCTGCCGCCTTCCGTGAGGTAGGCCATTCCGATATCCGAGATGTTGTCGCAGGAGCGCAAGTTCAGCTGCTCGAGCT 

TGGGCATCCGGGCCAGGTGCTTTAGTCCGCTGTCCGTCACCGAGACGCAGAAGCTCAGGTTGATCGACTTCAGCGAGGTCAGGCCCTGGGCAATG 

TGGCCCAAAGCCTCGTCGCTTAGCCGCTGGCAATCCTGCAGACCCAAGTACTCCAGCTGCAGGTTGCCTTCGGCCGTTTCCCGCGAGAAACCAGC 

CAGGTGTCCGATTCCCTGGTCGCTGATGTGCCAGCAGGAGCGCAGATTGAGGTGCTTCAGCTTCTTCAGGCCCCACGCAATGAGCAGCAGTCCTG 

TGTTCGTGATGTTGCAACAGCCGCCCAGTTCCAGGGTCTCAAGATTCCTTAGATGCTGGGCGATGCGCCCCAGACTCGTGTCCGTGATCTGTTTA 

CAGAGGGAGAGGTCCAGTGTCTTGAGGTTTGGCAGATCCACGCTAAAGGCATGGCCCAGGTTCATGTCCGCCACGTTGAAGCAGCCGCTCAGGTT 

CAGAGAGGTCAGAGCCGGAACGCCCAGGACCAAATCCTTAAGCGAGCGGCGCAGCGATAGGATCTGCACCTTCTTGATGCCGCGCTTGACCAGGC 

AGTTGAACAGGCTCGGACTGGAACGCTTAAGGTGCAGCTTGGCCTCGACTCCCTTCCAAACGCTCTTGGCATAGGCCGCATCCCGCCATGCGGTG 

CAAACCTGAGCTGCCCGACCCAAATCCCTCACTGGTAGGTGCTCGAATATCTGCTCGAGGAGTTCTGGGAACAGGTTGCTTATGTGGGTGCCCTC 

CACCGGCGGCGGGCTCTCCGGAGATGCTGGTCTGTGCGGGAGATGGTGGTGATGGTGATGATGGTGCTGGGCGTGGGCCGCTGCTGCTGCCGCTG 

CCGCCGCAGCTGCTGCGGCATGCTGCAAGTAGAGGGCCGGCGGCAAGGTCTGCAGCTGGTGGTGCGGCGGTCGGTGGTGCAGGGCGTAGGGCGTG 

AAGCGCAGCAGGTGGTGGTGGGTGGTGGTGGCTCCGGTTCGCGTCAGACCGGGCAGCTCAGTCTGTTGATAGAGTTCGTCCATGGCCCAGCTGGG 

CGCTCCTCCGGCTCCTGCTCCGCCAATCGCCGCGCCGCCACTGGTGGCCGCTCCGTTGCTCCCGGATCCGCCACAGTGGCTATTCAGATTGTTCA 

ACACCGCGATTGTTTGATGATGGAAGGGCAGCAGCTCCGTGTTGGTCGCCATCTCCTACTCTTGATTTTACACTTGGCTTGTCGCTTGACTGCTA 

CTTTTGAAATTAACTGTTGGATTGGCTATCCTATATGCGCTGCATTTTACTGTTTGTATGCTTAGTTGAAGACTAGTAGGGTTCACTGTGGTTAT 

TCACTGCACTTAACTCGAGCACTGTGTGTAGTTGGAAGTCCAAGTTCTATTTGTGTGTTCTTTGTTCTTGCGGCCGAGACGCGATGTGCACGAAC 

CGGATGATAAAGAAAAACTGAACTCGCAGCGGCAAAGCGCCTTGCTTTTAGCCTGAAATACAGATGCAGATACAGATACGGGGGGTTTGTGTTGG 

TATTGGTCATATTTTTATCTGAACGTATTCAGCGCTCTCACCGCTCTCTCGAGCGAGTAGAGAGTAGCGAGTAGGTATTCGTCCGTTCAGTCTGT 

TGTATGTCCCGTGTGTGCGGTTCGGTCGTTCTTCCATTGCGATTGCACTTGGAATTGACTTTGTGCGGCATTTTTATTAGAATCATTGCAACGCG 

TGGTTTTCTTGCTAATTTTGCCATAAGTTCTTATTAAAAGGTAGCTAGCATATGCTCTTTCTGTCCTTTATCACCAGTTCTGCGAGTCAATTTTT 

AACCAGTTCGGCTTGTATATAAATAATTTAGGTTCAGACATAATCTCTATTGCTCGTAAGGAGTTTCACAAATTGTTTAAACTTAAAGTTTGAAG 

ACCGTTTTGCGTATTCAACTCATCCGTGACCCATTTTTTTTATTATTTATCATATAAATATATTTATTATCTCGACATTGTGTGGCGTTCTACTT 

AGGATCTCCTGCAGCTTATGATAAGAGAAGCTTCCCCATATTTCCTTTCTCGCCAGCTCTCTCATCACTAACGCTGGTTATAAGACCAGGCTTGG 

ATTTATCTTTCGGGTTTTTCGGTTTTGGACGCCAGCTCCATGTTTCAGGTTATTCTTTACCCGTCGAGCCAAACTCAATGCCAAACTCCGACGCA 
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TTTGTTGTTTCTGCTGCCGCTGCATTCCGAAGTCGAGCAGGTACCTGTCTTTCTCCCTCCCTCGCCATCTCTCTCGCTCTCTATCGCATCTGTGT 
GAGTGCGATCGCTGCTCGCTACCTGCGAACGCAAGAGCAGCAACAACACAATCCCGGTTTCCAGGAAAATATTAACGATAACTCCAAAAAAAGCA 
GAAACTCCGTTGCCGCACCACACACAGAAACTCAGA 
<SEQ ID NO: 1099) 

Exon: 2836.. 1001 

Start ATG: 2617 (Reverse strand: CAT) 
Transcript No. : CT28019 

CGTCTCGGCCGCAAGAACAAAGAACACACAAATAGAACTTGGACTTCCAACTACACACAGTGCTCGAGTTAAGTGCAGTGAATAACCACAGTGAA 
CCCTACTAGTCTTCAACTAAGCATACAAACAGTAAAATGCAGCGCATATAGGATAGCCAATCCAACAGTTAATTTCAAAAGTAGCAGTCAAGCGA 
CAAGCCAAGTGTAAAATCAAGAGTAGGAGATGGCGACCAACACGGAGCTGCTGCCCTTCCATCATCAAACAATCGCGGTGTTGAACAATCTGAAT 
AGCCACTGTGGCGGATCCGGGAGCAACGGAGCGGCCACCAGTGGCGGCGCGGCGATTGGCGGAGCAGGAGCCGGAGGAGCGCCCAGCTGGGCCAT 
GGACGAACTCTATCAACAGACTGAGCTGCCCGGTCTGACGCGAACCGGAGCCACCACCACCCACCACCACCTGCTGCGCTTCACGCCCTACGCCC 
TGCACCACCGACCGCCGCACCACCAGCTGCAGACCTTGCCGCCGGCCCTCTACTTGCAGCATGCCGCAGCAGCTGCGGCGGCAGCGGCAGCAGCA 
GCGGCCCACGCCCAGCACCATCATCACCATCACCACCATCTCCCGCACAGACCAGCATCTCCGGAGAGCCCGCCGCCGGTGGAGGGCACCCACAT 
AAGCAACCTGTTCCCAGAACTCCTCGAGCAGATATTCGAGCACCTACCAGTGAGGGATTTGGGTCGGGCAGCTCAGGTTTGCACCGCATGGCGGG 
ATGCGGCCTATGCCAAGAGCGTTTGGAAGGGAGTCGAGGCCAAGCTGCACCTTAAGCGTTCCAGTCCGAGCCTGTTCAACTGCCTGGTCAAGCGC 
GGCATCAAGAAGGTGCAGATCCTATCGCTGCGCCGCTCGCTTAAGGATTTGGTCCTGGGCGTTCCGGCTCTGACCTCTCTGAACCTGAGCGGCTG 
CTTCAACGTGGCGGACATGAACCTGGGCCATGCCTTTAGCGTGGATCTGCCAAACCTCAAGACACTGGACCTCTCCCTCTGTAAACAGATCACGG 
ACACGAGTCTGGGGCGCATCGCCCAGCATCTAAGGAATCTTGAGACCCTGGAACTGGGCGGCTGTTGCAACATCACGAACACAGGACTGCTGCTC 
ATTGCGTGGGGCCTGAAGAAGCTGAAGCACCTCAATCTGCGCTCCTGCTGGCACATCAGCGACCAGGGAATCGGACACCTGGCTGGTTTCTCGCG 

GG AAAC G GC C G AAGGC AAC C TGCAGCTG GAGT AC T T GG GT C T GC AGG AT T GC C AGC GGC T AAG C G AC GAGGC T T T GGG C C ACAT TGCCCAGGGCC 
TGACCTCGCTGAAGTCGATCAACCTGAGCTTCTGCGTCTCGGTGACGGACAGCGGACTAAAGCACCTGGCCCGGATGCCCAAGCTCGAGCAGCTG 
AACTTGCGCTCCTGCGACAACATCTCGGATATCGGAATGGCCTACCTCACGGAAGGCGGCAGCGGGATCAACTCGCTGGACGTCAGTTTCTGCGA 
CAAGATCAGCGACCAGGCCCTGACACACATAGCCCAGGGACTCTACAGACTGAGATCCCTGTCCCTCAATCAGTGCCAGATCACCGATCACGGCA 
TGCTAAAGATCGCCAAGGCATTGCACGAGTTGGAGAACCTCAATATCGGACAGTGCAGCCGGATAACGGACAAGGGCCTGCAAACGCTGGCCGAG 
GATCTGACTAATCTCAAGACCATCGATCTCTACGGCTGCACGCAACTCAGTTCCAAGGGCATCGACATCATCATGAAGCTGCCCAAGCTGCAGAA 
GCTCAACCTGGGCCTCTGGCTGGTCAGGTGA 
(SEQ ID NO: 1100) 

Start ATG: 220 (Reverse strand: CAT) 

MATNTELLPFHHQTIAVLNNLNSHCGGSGSNGAATSGGAAIGGAGAGGAPSWAMDELYQQTELPGLTRTGATTTHHHLLRFTPYALHHRPPHHQL 
QTLPPALYLQHAAAAAAAAAAAAAHAQHHHHHHHHLPHRPASPESPPPVEGTHISNLFPELLEQIFEHLPVRDLGRAAQVCTAWRDAAYAKSVWK 
GVEAKLHLKRSSPSLFNCLVKRGIKKVQILSLRRSLKDLVLGVPALTSLNLSGCFNVADMNLGHAFSVDLPNLKTLDLSLCKQITDTSLGRIAQH 
LRNLETLELGGCCNITNTGLLLIAWGLKKLKHLNLRSCWHISDQGIGHLAGFSRETAEGNLQLEYLGLQDCQRLSDEALGHIAQGLTSLKSINLS 
FCVSVTDSGLKHLARMPKLEQLNLRSCDNISDIGMAYLTEGGSGINSLDVSFCDKISDQALTHIAQGLYRLRSLSLNQCQITDHGMLKIAKALHE 
LENLNIGQCSRITDKGLQTLAEDLTNLKTIDLYGCTQLSSKGIDIIMKLPKLQKLNLGLWLVR* 
(SEQ ID NO: 1101) 

Classification: hypothetical 



Celera Sequence No. : 142000013384811 

GGGGAAATTGGAACAAGACCTTGGGTAACCGGCATACGAATGAAATTCTAATTTTATGAGGCCCAGCCAATTATAAAGTATGTGGGAAAAGTTTA 
AGGGTAATCCTAAAACTGGACACCAGTTTACCAAGGAAAGAAATTCTAGTTAAATCGAGTACCATAGACTTTACGACCGTAGGTAGGATTATAAA 
AAGTCATGTTAGTACCTTCAAATTTAGTATCGTAATATGACTTTTTACGAGAGTCGTTTGTAATAAAGTAAAGTCGTTCTTAAAGAAGTCGTATT 
TCTTTTAATAAGATACGTACTCCGCTCATTCAACTTTAAAATTGATATGTGTGGAAAATTAAGATGTACACATATCCTTGGCAACAGATCACTTG 
AAAATCCTTAGTGGTCCTTGGGCCATTGAACTTGTTCTCAAGCCCGCACTTCACCTCGAATCCACCCAATCCAAACTTCAAACTGACGACATGCA 
GCGCAATTTGAGATCCCAGATTCGGGAACCTTTCACGCACACATGTGCAACCTGATTTCCAAACGCTTAAAGGACACGCTGTGCCTGTGCCAGCG 
GCATCTTAAGTGATTTCGAAAAGGAGCTGGGAAGCGGTAAAAGTATCTGGAGGATGCGCGGATGTCGGATCTCAGATACTCAGATACAAGAATTC 
TTGAGTGCACGTCGTACGCAGGCACTTGGGCACACACACACTCCACTCATTGTAGGTTACTAGTTAAATCTGAAATCGAGGCAGCTGGAGAAAGA 
GACGGCTGCCGCAGGGGCCGACAAAAAGAGACGGGAC GAGGC ACTTGGCCAAGGAAC AT CTGTTGCGCTGACATAAATCCTGAGTGACTGACCGC 
TGGTGCGAAAGACCGCGAATCTCGAATGCGAATGTTCATGTGGATGCGACTTGGCCGCCGGCGCTCGTTTGAGGACCAAAACCAGTTTGCACCGA 
TTCCCAGCTGCCCGTTTGGGTACAGTGGTTCAAAAGGTAATCTATCTGATTCAAATCTTGTCCACCTAACAACAATTTGAAGTCCGAAGATAAAG 
CTCTTATTTCAATTAAAAAGAATGGCAGGATAGTAGGATAGCAGGAGAAACATTTCATTATTAAATTTGATTATCTACATTACATTTCGTACAGA 
AAAGAAGTTAAAAGTGATTTAATGCCCGTAAAACCACTGTGCCCGCCAATTTCATTGAGTAACTTTTGCGCTGCCCGTGAATGAATCTGTGGGAT 
ACTCCCCACCGCTTTCGTACTTCGATTCGGTGGGACAGGATCTGAGAATGGAAAACCAGTGAGCAGACTTACCTTCTTCTCGGTGAGCTTCTCGC 
AGCGGCGCTGCTTGCAGATCTGGTGGCTCTTGTCGTTTCGGCAGGGGGCGCACTCCCCACAGTTGTCCTTGCGCTGGCATCCCACGCATTCGCCG 
CACCGCTTGCGCTTCTTCTTGGACTGGAAAAGAGAGCAGGAAGAGGAGTTGTTGTCGTTGGAGTGGTTGTGGTTGTGGTTCTGGTGGTTATGGTT 
GTGAGTGGTGGGTGTGATTGTGATTGTGGGTGGTGGCAGTGTCATTGGCGGGTGGTTCAGTGTGGGTGGCGGAAGATGGTGGCGACGACCGTAGT 
CCTGTTAATCGCAGCGGTTGCCAGTTCGCCCCGTTTCGCAAGTTTATCGGTGGGGTTGTCGAAGATATCCGATATTTATCGATCAAAAACTATTC 
ACTTCTTTTGCTTCTCGTTTGTAATGCAAATGGATCGTACAATCGATATATACCTTCGGCTTCAACATCCCCACCCTTCGATCACCTACCTGCTT 
GTCGCATCCACTGGCATTGGCATCGTTGCTATCCCCCATGCCCAGGGCACTCAACGGACTCTTGAAGTTGGCCGTGCTGGAGACGGCGTCCACTT 
GGCCAGGATTGCTGGGGTCTACGTCCATGCCGCTGGCCTCGCTCTCGGCACTGGACTCCAGCGAATTGGGCCGCTGTTTCCGGCGCTCTTTCTTC 
CGCGCTTCTAGCCGAGCTATCGGCGACTGCAGCAACTTCTCGACCTTGACCGGAGTGGTGGTGGTGGGTGTGGGCTGGGTGGTGGTGGTGTGCGG 
CGTGTGGCTGCCACTCCCACCGCCAGCCGCCTGCGGATAGCTGTTGTTGCTGTTGTTGCCATCATACGAGGTGGGCGTGTGCATGCCCATTGGTA 
TGGGATGCGGCTGGCCGTGCTGCAGCGGATGGTGGATCAGTCCGGCGCCACCGCTGTTGCTGAGATCAAAGTCCTCCTTCTTGAGCAGCATCTCG 
AAGTCGCTGCCGCCCGAGATCGGCTCCTTCTTGACACTCAACGGGGTGAATCCCGGATATAGTGGCAGCTCCGCGTAGGGCGTGGGATAACCGCT 
GGCTCCTCGATGATGGTGGTGCGGATTGAGGGCGTGGTGGTGCGCGAAGGCGTTCGGCGGCGGTGGTGCCGGAAAACTGCCCGCTGATGGCGGTG 
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TGATGCTGGTCTGCAGACTCGAGAGTTGGGTCAAAGGTGAGCCCACACTCATCATGCCCGGCGAGCTGACCGAAATGCTGGCCGGCGACATGGCC 
ACCGTTTGCAGGGAACTCATGGCGGTGGGTCCCACGGGTCCTGCTACTGAATTGGGTCCTACGGAGCCAGGAGGTATGGAACCTGTTGCAGCTGC 
TACTGCAGCTGCCGATCCTGGTGGCACTCCTGCTCCTATGGGCACAGGCATTGGTGTTCCATCCGGTAGTAATTGCTCATTGAAGCCATGGAACT 
GGCTCTGAAACGACGGCAGCTTGGAGAACGGCTCGAAGCCGCTCTTCAGCTGCAGATGGGGTTGCGTTTGGGGGGGTTGTGGCCCATTGTCCTTG 
GACTCCATTTCCCAGGGACGCGTGTAGCTGCCGTTCGAGTTGGAATTGGAGTTCGAATTCGAGTTGGCGTTGCCGGCTGAGTTGTTCGGTTGCTG 
GTTACCCGGATTTCCTACTGCGGCGGAGGCGGCGGAAATGGCATTGGCATTGGCATCGCCGTGCCGCTGGCCACTGTTGCCTCCACTTGCAGCAT 
TAGCATGGTTGCTGCTGCTGTTGTTGCTGCTGCCCAAAGGCGCTGGAACCGTGGTATTGCTGCCGTCGGGGTTGCTGTTGTTGCCGTTGTTGCCG 
TTGTTGCCGTTGTTGCTGCTGTTGCCGTTGGTGTTGCCGGTGGCGGCCGCTGGCTGTTGCTGTTGCTGCTGCTGCTGTTGATGTTGCTGCTGCTG 
CCTGGCATAGTAGTCCCAGTGCTGCTGCTGCTGCTGCTGCTGTTGCTGCTGCTGCTGCTGATGTTGCTGCTGGGCGTGATGCTGCTGCTGTTGCT 
GTTGCTGCAGATGTTGCTGCTGCGGATAGTGCTGCTGGTAGTGATGATGCGGATGCGAATGCGAATGCGGATGGTGGTACGGCGTGGTATAGTGC 
TGGCTGAGGCTGCTCAGCTGGTTGAGATGGGTGTTGTGCAGGCTGGACACGGTGGCCTCCACAGATGTGGCTGCTGCTGCAGTTGCTCCCAAGTT 
GCTGCTGTCGACGGTAGCACCTGTGCTGCTCTGGGCTAAAATTGAAGACGCCATTTAAGCACAAACACTGGCATGGCTATGCAGTTGTATTGAAA 
AACTGCGGGGCTTTGTACTGATTTTTCGAAGATTCGTTTTTAGTTTTTGTACACTTATCGGTTCACTTTTCGTTTGTTCGTCTGCTTTTTGAGGT 
AGAGTTCTTCGTAAAAATGTTTCTGATTCGACTGGTTTGTTTGTTGGCTAGTGTAAAACACTATGATTATATCCGTATAATAGATTTGTATACTA 
TAATATAAGTTGAAGCTACTCCAAGATTTGTATTTCGTTTATTTTGAACTCACTGGTTTTGCGTGTTTGAATGCAAAGTTGAATATTTACTTGTT 
ATGGTGTTCTTCTTTGTAAAGTGCTTTATTAAATTGTTGTTTAAAAATTGTTTAAGATTTGTGGTTGGTTGTTTTGTACTTTATTCTTTTGTTTG 
TTTGTAAAACGGCAGCTAGGCAAAAATTTGTATTTTGATATTTGATTTAATTTGTAGCAGACTTGGTTTGGTTTTGGTTTTTCCTTTAAAAAATC 
TTTGTAAATTTGTAATTTGTAAAATTTGTATTTCGTATGCGTTGTTGTTGTTGTTTTATTTACGTTATTAATTACTTGCTGTTTGTGGTTGATAA 
TATTTGACGCTTGCTTGTTGTAAATTGATTAAGTTTTTGTACTCTGGTTTATGCTTCGTTTCTCAATTGAATTTGCTTGTAATATGTCGTCGGAT 
ATAAAAATAATTTAAGTATTTACATCTAGTAAATTGTACTTATGCGTAGGCTTAAAAATCTTTGTCTTCCGTTTGTTTGTTTGTTTGGGTTCTGG 
ACTTAACGATTAATTAAACATTAGCGCGTTACTTGTTATTGACTTTAATAGTAATAATATTTATATTTCATATAAGTTGTGTGTTTAGCCTAATT 
CATTTTACGCCTTTTGTTCGTTTGCTTTTCAATTTTGATTTTGGGCCGAAAGGTTTCTTTTTGTTTTTGTAAGTTGCTTAAAGAGCTTCCAAAGT 
ATTTTGTGT 
(SEQ ID WO: 1102) 

Exon: 3569.. 1800 
Exon: 1616.. 1308 
Exon: 1015.. 1001 

Start ATG: 35 69 (Reverse strand: CAT) 
Transcript No. : CT28117 

ATGGCGTCTTCAATTTTAGCCCAGAGCAGCACAGGTGCTACCGTCGACAGCAGCAACTTGGGAGCAACTGCAGCAGCAGCCACATCTGTGGAGGC 
CACCGTGTCCAGCCTGCACAACACCCATCTCAACCAGCTGAGCAGCCTCAGCCAGCACTATACCACGCCGTACCACCATCCGCATTCGCATTCGC 
ATCCGCATCATCACTACCAGCAGCACTATCCGCAGCAGCAACATCTGCAGCAACAGCAACAGCAGCAGCATCACGCCCAGCAGCAACATCAGCAG 
CAGCAGCAGCAACAGCAGCAGCAGCAGCAGCAGCACTGGGACTACTATGCCAGGCAGCAGCAGCAACATCAACAGCAGCAGCAGCAACAGCAACA 
GCCAGCGGCCGCCACCGGCAACACCAACGGCAACAGCAGCAACAACGGCAACAACGGCAACAACGGCAACAACAGCAACCCCGACGGCAGCAATA 
CCACGGTTCCAGCGCCTTTGGGCAGCAGCAACAACAGCAGCAGCAACCATGCTAATGCTGCAAGTGGAGGCAACAGTGGCCAGCGGCACGGCGAT 
GCCAATGCCAATGCCATTTCCGCCGCCTCCGCCGCAGTAGGAAATCCGGGTAACCAGCAACCGAACAACTCAGCCGGCAACGCCAACTCGAATTC 
GAACTCCAATTCCAACTCGAACGGCAGCTACACGCGTCCCTGGGAAATGGAGTCCAAGGACAATGGGCCACAACCCCCCCAAACGCAACCCCATC 
TGCAGCTGAAGAGCGGCTTCGAGCCGTTCTCCAAGCTGCCGTCGTTTCAGAGCCAGTTCCATGGCTTCAATGAGCAATTACTACCGGATGGAACA 
CCAATGCCTGTGCCCATAGGAGCAGGAGTGCCACCAGGATCGGCAGCTGCAGTAGCAGCTGCAACAGGTTCCATACCTCCTGGCTCCGTAGGACC 
CAATTCAGTAGCAGGACCCGTGGGACCCACCGCCATGAGTTCCCTGCAAACGGTGGCCATGTCGCCGGCCAGCATTTCGGTCAGCTCGCCGGGCA 
TGATGAGTGTGGGCTCACCTTTGACCCAACTCTCGAGTCTGCAGACCAGCATCACACCGCCATCAGCGGGCAGTTTTCCGGCACCACCGCCGCCG 
AACGCCTTCGCGCACCACCACGCCCTCAATCCGCACCACCATCATCGAGGAGCCAGCGGTTATCCCACGCCCTACGCGGAGCTGCCACTATATCC 
GGGATTCACCCCGTTGAGTGTCAAGAAGGAGCCGATCTCGGGCGGCAGCGACTTCGAGATGCTGCTCAAGAAGGAGGACTTTGATCTCAGCAACA 
GCGGTGGCGCCGGACTGATCCACCATCCGCTGCAGCACGGCCAGCCGCATCCCATACCAATGGGCATGCACACGCCCACCTCGTATGATGGCAAC 
AACAGCAACAACAGCTATCCGCAGGCGGCTGGCGGTGGGAGTGGCAGCCACACGCCGCACACCACCACCACCCAGCCCACACCCACCACCACCAC 
TCCGGTCAAGGTCGAGAAGTTGCTGCAGTCGCCGATAGCTCGGCTAGAAGCGCGGAAGAAAGAGCGCCGGAAACAGCGGCCCAATTCGCTGGAGT 
CCAGTGCCGAGAGCGAGGCCAGCGGCATGGACGTAGACCCCAGCAATCCTGGCCAAGTGGACGCCGTCTCCAGCACGGCCAACTTCAAGAGTCCG 
TTGAGTGCCCTGGGCATGGGGGATAGCAACGATGCCAATGCCAGTGGATGCGACAAGCAGGACTACGGTCGTCGCCACCATCTTCCGCCACCCAC 
ACTGAACCACCCGCCAATGACACTGCCACCACCCACAATCACAATCACACCCACCACTCACAACCATAACCACCAGAACCACAACCACAACCACT 
CCAACGACAACAACTCCTCTTCCTGCTCTCTTTTCCAGTCCAAGAAGAAGCGCAAGCGGTGCGGCGAATGCGTGGGATGCCAGCGCAAGGACAAC 
TGTGGGGAGTGCGCCCCCTGCCGAAACGACAAGAGCCACCAGATCTGCAAGCAGCGCCGCTGCGAGAAGCTCACCGAGAAGAAGGTGGACAAGAT 
TTGA 

(SEQ ID NO: 1103) 

Start ATG: 1 (Reverse strand: CAT) 

MASSILAQSSTGATVDSSNLGATAAAATSVEATVSSLHNTHLNQLSSLSQHYTTPYHHPHSHSHPHHHYQQHYPQQQHLQQQQQQQHHAQQQHQQ 

QQQQQQQQQQQHWDYYARQQQQHQQQQQQQQQPAAATGNTNGNSSNNGNNGNNGNNSNPDGSNTTVPAPLGSSNNSSSNHANAASGGNSGQRHGD 

ANANAISAASAAVGNPGNQQPNNSAGNANSNSNSNSNSNGSYTRPWEMESKDNGPQPPQTQPHLQLKSGFEPFSKLPSFQSQFHGFNEQLLPDGT 

PMPVPIGAGVPPGSAAAVAAATGSIPPGSVGPNSVAGPVGPTAMSSLQTVAMSPASISVSSPGMMSVGSPLTQLSSLQTSITPPSAGSFPAPPPP 

NAFAHHHALNPHHHHRGASGYPTPYAELPLYPGFTPLSVKKEPISGGSDFEMLLKKEDFDLSNSGGAGLIHHPLQHGQPHPIPMGMHTPTSYDGN 

NSNNSYPQAAGGGSGSHTPHTTTTQPTPTTTTPVKVEKLLQSPIARLEARKKERRKQRPNSLESSAESEASGMDVDPSNPGQVDAVSSTANFKSP 

LSALGMGDSNDANASGCDKQDYGRRHHLPPPTLNHPPMTLPPPTITITPTTHNHNHQNHNHNHSNDNNSSSCSLFQSKKKRKRCGECVGCQRKDN 

CGECAPCRNDKSHQICKQRRCEKLTEKKVDKI* 

(SEQ ID NO: 1104) 



Celera Sequence No. : 142000013384445 
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ATAACAAGGAAATGTAACGGTGATTTACGCAATGAAAGACCAAGAGAGTGGGCAAGGGAAAGAGATAGACAGGGCAACTCACAGTCCAAGTGCTT 
CTTCTTCGGGCCAATGCATTCCTCCGTGGTGGCCTTGCAAACGGACTTGGCGAGTCCTTGGCCCGCAAGGCTGTGACGGGCGGCCAGCAGCCTGT 
CGTTGATCGTTTGCCCTGCCATGGTCATGATGAAGCTGTTGCTGTTGTTGTTTCTGCTGCCGCTCGCTGGTTTTTATTTACTTGATTAATTTAAT 
TATGCACCAAAGGAGCTGACCACACCGACACACACACACACTCGAAAGCAGATACAGAGAGAGGGAGAGAGAGACACACACACACACGCGTATGG 
GGCGAGCGAGAGACAGAGAGGTATTGATTGATTGAGAATATTTCGAACGGGAAAACTCTGCTATGCGAGCCTCACGATTAAACTAACAGTAAAGA 
TTTAGTGGTTCCTACGTCACAATCACTACACAATTTATCGCCGCGGTTTATGCGCTCTCTAACCGGTGCCTAACGCTGAAACGCGATTTTTTAAT 
TCTCTATTTTTCCTCTGCCACAAATCGATAATGGCGACTGGCAAATGGAGCGAGCGTATGAGCAATGTCCTAAAGCGTCCACAAAGTGGAGCTAA 
CGCGTGCAATCGACTTGAGTTTACGGCGTTAGGTGGGAAAATATCGATTGCCATAACGTAAGCGAATACGAAATAGCTGAACTTAGCTTTTACGA 
T GT GT T C AT AT A AGT TC T GAAAAT AAAAAAAAATAATG AT AAAT AAAT AAAT AAAAAT AT AAAAT ATT AAGATA C TA T AAATAAAT CTTTTTTTA 
TAACTTAGCACTTAAAATTCCCCAACAGCAGTTAAAAAGTAGCTAGATTTGTCGGAAGTGTAGCCAAACGTAACGTAAGCTAAGGGGCTGTCAAC 
AGGAATTTTTAGTTTCGAGTAGGCGGTCATACTGGTCATTAAAATAAATAAAATTATACTAAAAAACAGTAGTTAGCACAATCAACAGTCTCGTA 
ATAAAATTGTAAGATGTTGAATGGACAGATTGCACACAGTTATTTGGAGAACTCGTGCACGCTGATCATACCCCGCCCGCCTGTCCTGAATCAGT 
CCTCAAAGTCGGCCCAGCCAGCAGCAGCTGCTGCTGCTTCTTCTTCCGCTGGTGAGGAGTTTGGTCAGGAAGCGGGGTCGGGGTTGGCGGAAGCG 
GAGGCGTTGCATGTGGATCGCATAAGAAACTGCGATATATTCGTTGACGCGCACGATCTTTATTTCTACGCAGATGTTAAGCTGTACAAGTGAGT 
CACATCCGGCAATCTGTCGTGCCCATGGTGGGCCATGGTGGCGAAATTCCAGCGATATCATTGCGTACTTGTACTCCCTTTTCGGACAGATTCCA 
CAGCCGTCGCACCTTCGACTTGCGAGAGCTGAAGACGCTGTTGATAAAGTTCAACACGACAGAGAATCACTACCAAATTTGCCTGCGCTGTCTGC 
CCCTAACAGCGGGTGCCCAGGAAGGACCCGGTGGAGAAAGAGGCGGAGGCGGTGGTACGGGGGATGCAGCCAAGCAGTGGAATAGTCGCAAAGAG 
TACATAGCCGCATTCCTGCACCTACCCACACTGAGTGCCAATTTGCCGGACAAGCAGCCTCTGGTGCAGGAATGGAAACTCAAACGAATCACCGA 
TCAGTGCCAATTGCAGCTCGACGACCACGAGCGCACCTACAAGCTGACCAACAATTTTGGGTACGGCTATGCTAGCGAATACAGCGGACCGCTAG 
ACCTGAGTGAGGTGAGTTCAATTGGTTAACCTGCTCTTCCGCGCACTAAACACAGATGTTTTCACAGTTGGACAGAGCCACGTGCCGCGTGTCGG 
AGCCACATCGCATGAGCCCCATCCAAAGGCGGCAAGAGCGCGTGGTCGATGAGATGAAGCGCTTTTCCCGCGAGCAGTACAAAATAAATTTTGTG 
GAGCTACAGGTTCCGTCCGGTCACCAGAATCCTTTACGCTACAAACCGAAAATCTACGAGACTTCCATGACCCAGGAGCAAGCGCATCTGCTGCA 
CAACATCAGCCGAATGCAGACACGGGAGCAGCAGCCGTGCGACGTTTTTCGGCAAAACGAGATCGATTGCGGGCTGATAAGCATTCTGTTGGCCA 
TATGCTATGATGTGCGGACGACGAACAATGAGCCGACGTGTGAATCGGGCTGGACGCGAAGCATACTGTGCCCGCTTTATTGCTACTTTGAGCAA 
TTCGACAACTATCGGGACGTACTGGTTGCCTTCCTGCGGCGAATGATCACCTACCCGCTGTACCGCAATTTCGAGTTGGGCCGCCAATGTGTACG 
GGATGCCATAGAAGTTTTAAAGGGCGGTCGCAACTGGTTAATTAATCAGCTGCTGCTAACGCACCAGCAGTTCGCCAGCAGTGATCCCAGTCGCG 
ATAGCTTCAACAACTACTATTTGGAGGACTATATACGATATGTGACCAGCCCGAGCGCCTGCAGTGATGAGCACATGCGCCTTCTTGCCCGCAAC 
CTTAAGAACGTGCTGATGGACGTAAACAAGCAACACTTGGGCCTGGGAGTGACCGAGATTGAGACTGAGCTGATCAAGGAGCTGATGCAGGAGAT 
GCGCATCGATGCGGGAAGCCAGGGTTCCTCGCCGCAGCAGCAGCATGGCGATGATGACTTGGACCTTGATAACGATACTTCTGGACAGGAGGACG 
AGACTACCACTGATGATGAAAGCGTCATAACTGACTCCTTCTACAGCGTGGACATGGACATGCGACTGATAGAGAACGAAATTGTATATGAGGAT 
GACGAGGAGGACGAAGATGACGATGAAGATGGAGATGATGATGAAGATGGGGATGGGGATGAGGATGAGGATGAGGATGAAGACGAAGACGACAG 
CAGTTCTTCCACAACCACCAGTAGCGAGGCGGAGGGAAACAGCGTCATCGAGCAGTGCAGCAACAGTGAGACAGCAGCCAGCACCACATAAATTG 
CCACTTATGTCGAGTTCGGCAAGTCCTGCTCCTGCCTTGCCCAAAACAAACAAGCAACACAACCAACTTAATGTCAAAGAACGGAGGCTAAACAG 
TGATTACTTTTAAAATTAAAACAAAGTAAAATACGAAACTAAAAACCAAGCCAAGTAAATGATGTAAGTTTACACAAGAATACCCAAAAAAGAAG 
AACTTCTTATATCTTAATTGTACAATAATTTAACGAAAAGCAAACTCAAAGATTTGTTCACGCCAAATGCGAAAAGATATTTGAATATTTTTTCC 
CCTTTGCTCTACAATTACTTCGATTATAATTACACATACAAGTAAATCAGAACTTTAGCATTTAGTATGGCATTAGAGTATGAAACATTTGAATT 
TTGAATTC TGAAAAGCGAAATC T AGT TGT GC ATT T GTATAT ACATCT ATCGAT ATTGT TGGTGCT GC AAGC AAAGAAAAC AAAAT AT GAATAAAA 
TAAAATGGAAATGGAAAACAACCTCCAAACTTATCTACACAGTGGGCAAAAGTCATTAATGATTTACATGTTAATGTTGTATGAGTGTCTTATAA 
AAAGAAAGACATGAAGTATGAACGAAAATTCTAATAAAATGGATAATACATATTTTTTTAAAAACATCGCGACATGAGTTCTTTATTATTTTATT 
CTTCGTATCAACCAAGCTGTCGTTTAAAAMGTGACAAATAGGTAGGTAAGCTTTTTGAAGGAGAAGTTATTTTTCATTTTACTTACAAACATTT 
AAGTATCAGTATTTAAAATGCATATAAAACATACTGTTACTTTTAGTACGCTTAAATCAGAAGATCAGATACTTTGTATCTTCAACACAATTTTG 
ATGGTTCAATCCGATAAGAAAAACATTTGCTTGGGTTTTATTATCTTTTCTTTATTTATCCATACTCACATACATACATATATGCATATCTATGT 
ACGTTTTAAAAGCTAATTTAAATATTTATGGACTGTTAATTGAAGA 
(SEQ ID NO: 1105) 

Exon: 1001.. 1324 

Exon: 1420.. 1816 

Exon: 1873.. 3036 

Start ATG: 1059 

Transcript No. : CT28305 

AAATTATACTAAAAAACAGTAGTTAGCACAATGAACAGTCTCGTAATAAAATTGTAAGATGTTGAATGGACAGATTGCACACAGTTATTTGGAGA 
ACTCGTGCACGCTGATCATACCCCGCCCGCCTGTCCTGAATCAGTCCTCAAAGTCGGCCCAGCCAGCAGCAGCTGCTGCTGCTTCTTCTTCCGCT 
GGTGAGGAGTTTGGTCAGGAAGCGGGGTCGGGGTTGGCGGAAGCGGAGGCGTTGCATGTGGATCGCATAAGAAACTGCGATATATTCGTTGACGC 
GCACGATCTTTATTTCTACGCAGATGTTAAGCTGTACAAATTCCACAGCCGTCGCACCTTCGACTTGCGAGAGCTGAAGACGCTGTTGATAAAGT 
TCAACACGACAGAGAATCACTACCAAATTTGCCTGCGCTGTCTGCCCCTAACAGCGGGTGCCCAGGAAGGACCCGGTGGAGAAAGAGGCGGAGGC 
GGTGGTACGGGGGATGCAGCCAAGCAGTGGAATAGTCGCAAAGAGTACATAGCCGCATTCCTGCACCTACCCACACTGAGTGCCAATTTGCCGGA 
CAAGCAGCCTCTGGTGCAGGAATGGAAACTCAAACGAATCACCGATCAGTGCCAATTGCAGCTCGACGACCACGAGCGCACCTACAAGCTGACCA 
ACAATTTTGGGTACGGCTATGCTAGCGAATACAGCGGACCGCTAGACCTGAGTGAGTTGGACAGAGCCACGTGCCGCGTGTCGGAGCCACATCGC 
ATGAGCCCCATCCAAAGGCGGCAAGAGCGCGTGGTCGATGAGATGAAGCGCTTTTCCCGCGAGCAGTACAAAATAAATTTTGTGGAGCTACAGGT 
TCCGTCCGGTCACCAGAATCCTTTACGCTACAAACCGAAAATCTACGAGACTTCCATGACCCAGGAGCAAGCGCATCTGCTGCACAACATCAGCC 
GAATGCAGACACGGGAGCAGCAGCCGTGCGACGTTTTTCGGCAAAACGAGATCGATTGCGGGCTGATAAGCATTCTGTTGGCCATATGCTATGAT 
GTGCGGACGACGAACAATGAGCCGACGTGTGAATCGGGCTGGACGCGAAGCATACTGTGCCCGCTTTATTGCTACTTTGAGCAATTCGACAACTA 
TCGGGACGTACTGGTTGCCTTCCTGCGGCGAATGATCACCTACCCGCTGTACCGCAATTTCGAGTTGGGCCGCCAATGTGTACGGGATGCCATAG 
AAGTTTTAAAGGGCGGTCGCAACTGGTTAATTAATCAGCTGCTGCTAACGCACCAGCAGTTCGCCAGCAGTGATCCCAGTCGCGATAGCTTCAAC 
AACTACTATTTGGAGGACTATATACGATATGTGACCAGCCCGAGCGCCTGCAGTGATGAGCACATGCGCCTTCTTGCCCGCAACCTTAAGAACGT 
GCTGATGGACGTAAACAAGCAACACTTGGGCCTGGGAGTGACCGAGATTGAGACTGAGCTGATCAAGGAGCTGATGCAGGAGATGCGCATCGATG 
CGGGAAGCCAGGGTTCCTCGCCGCAGCAGCAGCATGGCGATGATGACTTGGACCTTGATAACGATACTTCTGGACAGGAGGACGAGACTACCACT 
GATGATGAAAGCGTCATAACTGACTCCTTCTACAGCGTGGACATGGACATGCGACTGATAGAGAACGAAATTGTATATGAGGATGACGAGGAGGA 
CGAAGATGACGATGAAGATGGAGATGATGATGAAGATGGGGATGGGGATGAGGATGAGGATGAGGATGAAGACGAAGACGACAGCAGTTCTTCCA 
CAACCACCAGTAGCGAGGCGGAGGGAAACAGCGTCATCGAGCAGTGCAGCAACAGTGAGACAGCAGCCAGCACCACATAA 
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(SEQ ID NO: 1106) 



Start ATG: 59 

MLNGQIAHSYLENSCTLIIPRPPVLNQSSKSAQPAAAAAASSSAGEEFGQEAGSGLAEAEALHVDRIRNCDIFVDAHDLYFYADVKLYKFHSRRT 

FDLRELKTLLIKFNTTENHYQICLRCLPLTAGAQEGPGGERGGGGGTGDAAKQWNSRKEYIAAFLHLPTLSANLPDKQPLVQEWKLKRITDQCQL 

QLDDHERTYKLTNNFGYGYASEYSGPLDLSELDRATCRVSEPHRMSPIQRRQERVVDEMKRFSREQYKINFVELQVPSGHQNPLRYKPKIYETSM 

TQEQAHLLHNISRMQTREQQPCDVFRQNEIDCGLISILLAICYDVRTTNNEPTCESGWTRSILCPLYCYFEQFDNYRDVLVAFLRRMITYPLYRN 

FELGRQCVRDAIEVLKGGRNWLINQLLLTHQQFASSDPSRDSFNNYYLEDYIRYVTSPSACSDEHMRLLARNLKNVLMDVNKQHLGLGVTEIETE 

LIKELMQEMRI DAGSQGSSPQQQHGDDDLDLDNDTSGQEDETTTDDESVITDSFYSVDMDMRLIENEIVYEDDEEDEDDDEDGDDDEDGDGDEDE 

DEDEDEDDSSSSTTTSSEAEGNSVIEQCSNSETAASTT* 

(SEQ ID NO: 1107) 

Classification: hypothetical 



Celera Sequence No. : 142000013384416 

GTGGGCAGCGATCTGCAGGTGAAGCTGTGGGACTACAACAGTGGTGCTGTGGTGGGCATCGGATCGGAGCACGCCTCCTCCGTGATCAGTTGTGC 
GTACAGTCCCTGCATGACCATGTTTGTCACCGGCAGCACGGACGGATCGCTGATCATATGGGATGTGCCGCGTGACTTCTGGGGCAGGCCCAATC 
CGCCGGATGTCACCAAGTATTCGCTACCAAAGACATCGTCCAAAACCAGGATGAGTGACGTGCCGTCGCGGGTGAACTCGGGCACCAATCTAAAG 
ACGACCAAGGGCGAGAACATCGATGGCCTGCTGAAGGCCACTCCCAAGGACGATCTCTGCTGCGTGGAGTGCCCGCCGTGCGCCAAAAAGGATAC 
CAAGGGCGTGGATCCCTATGCCGAGTGCGGAATCGTTCCCGATGTCCGCAAGTGCTGAGATTTTCCTTTCCAATTTCTTTTTGTCGTGAAATAAA 
GGCAGTTGTCCTTTCTAAGCTTGATTTAACTGAGATTATTTATGGGAGCTTGATTTTCTAGGATAGATTGGCCTTCTCTTTTATAAATCGTATTG 
TAAGTCATAAGGAGATATGGCTTTGAATAATAGTCTTATTAGATTAAATTGATTGATTGCTGAAGGAAACCAATTTCAAATTCAATTTAATTTGA 
TCAAATGAGTATTTGGTCACACTTAACAACTGTGTTGCATCAAACATGTTCGTCGAACCAATTTTCAAACATGAAAATGAGCAAGCAAAATATTT 
TAAACTTAAAATTAAGTCTCAATGAAACAAAGAATATGGTAGTTCATATGTTTATGAACTATTAATTTGTTTAGAAACGAAAGTTTTATATTTTA 
GATTCACAGTATGACCCTGCTGGCGTGTGGTGTAACGGGCGGCTCGCCCCGCGGTCACACTGAATTACTATTGGCCAGCCTTGTTGTTGTTGTTT 
TTCGTGTTCGTAAACTAAGAGTTCGTGTGCATGGATTTTTAGCCGCATTGGTAGTAGTTACTGTCGCTCCGCCGCCGCGTGAAGTCAGATTTCTC 
CGAGCGCTCAAACGAACAGCACGCGACATTCGCCGGAAAAAAAAACGAGAACCATAAGTTGTGGGAAATTCGCACAAGGGACAGAAGGAAGAAGA 
AGCAGCAGCAGACGAAAGGGACGAACGGCGAAGCAAAACAAATCGAGCGACGAATAGCGGTGTGCAACGAAATGAAGGTCTGCTGCATTGGCGCT 
GGTTATGTCGGCGGTCCGACCTGCGCGGTTATGGCGCTCAAATGCCCCGATATCGTGATAACTCTGGTGGACAAGAGCTCCGAGCGGATTGCCCA 
GTGGAATTCGGATAAGTTGCCCATTTACGAGGTACGTTCGCTTTCCTCCTTCTCATCGTTTGTATGTACATAGGAAGTGCCACTGTTCTGTGTGT 
GTGTGTGCTAGAGAGAGAGAGAGAGAGAGCGAGAGGGCACGATAGGTAGTGTGTGTGTGTGTGTGCTCACGCCTTAGCCTTGCAATTTGTGTTTT 
TTTCTGCTTTTCGGGGCTTCCACTAATCATAAACAAACAAGTCATCATCCCTCAATCAATTCCCGATCCGAACCATCGAGCACAAACTAAAACGT 
CGATGACCGCTCATATCCAAGTCGGTGAATACCCCACACCTCCCTCCACACTCCCCTTAAGTAAACCAGTTCCCTGCACACAGAAAATGCCAGAT 
CTATTTGGAGATTTCTTGAATAATTCCATCCTATGAGATGGATTACATGGTTTATAGATATACTTAGCATTTATTTTGTAACACATTGAAATACT 
ATTTGTAATTTAGCCATAAATAAGTTTGCTGCTATCCTGCTTGTTTTACTAGTAATCATTTGATATTTAGCCAGACCTACAAAACTCCATTGAGT 
TTTCGCGCAGTGTAGTCCGTCTTCACTTGAACAACGCTGCGTGTCCGGCATTCTACTAAACTTCAGGTGCTTAACGCACTTGGCCCACAACAAGG 
TCAATGGTCAGTCCATTAGTCACTCAAAATAAACCATGAAAAGCGCGCAGAGCCGTTTGATTGCTGAAATATGATTCGGAACGGAAAAGTAGCGT 
GGGAATCGATTTTACTTTCCAGACTGCGTGACCATCGGCGTCGCTCCATTAGGCCTAAAATTCCAGATACTACAAAATGCGCCAAACGAAGCCAT 
AAGGTGCAAGCGGATTGCCTCCCATCCACCACACCACTGGCGCATTTCCGTCCTGGCTTCCGTCCGCCCTCATTTCCCACTGTACATATGTCGAG 
CTACCCACTCTTCTTATCGGCTCTGCGCCAAAACAAAGTCTCGGGTCGAGATAAGCTCGCTGGGCATGGCGTTTCTTTTCTTATATTTTTTTTTT 
TTGGTCTAGCGAGCGCACATTAGTTGGAAATTGTCCAATTACCCACGTTCCCGACAAACAGCCAGTATCTCTGGTGAAAAAAAGCACGAATAGAA 
GTATGTACAATCAGCCAAACACTCAGTATATATGTTTTGTTTTCAATTAGACAATAATTTGGTATTTGTATCCTATAGCAGTAGTAACCATTATC 
AGT T AAAA A C C C A T AC AT T TT G G AAG AT GGT AT AGGG C AT AT AT T T AGT T CAT T C T T AT C GC C CT C CT AT T G TAT C C C T AAAAAT AGTT AAAAAC 
CCATTAAGAAAATACACTATTGACTGCAGTCTAGACCGTAAATGTTCTTACCTAGACCGAGTAATTCAAAAATATACTTTTCAAGATAAATTCCA 
CTTAATGAAGTTCGATTGTAATCAATACCAATGTTATGTACGCAAAAAAAAAAAAAGATGCTGTTGGGTGCACTTGTAACCATTAGAGGAGGCTG 
TCTAGCCAGTTTGCCATTATCGCTACTCACCAAGATTGCAGCACCACAAGCAAACAATTATGGCCAGCACTTGGAGTACTTCAATGCAATTTGCT 
GGCGATGGCGATGTCAGGGCGAGCATGTCAACAACATCCTCTTGTTTATGGAATTTGCACAATTGGCGTACTTTCCATTGACACCGAAATATTGT 
CAACATTTCCTAATCGTTGCACTCATTAATGTTCCATTAGCTTTTTCGCACATCGCAAAAACTTATCGGGCAAACAGTTGATATATTTTTCTGTT 
TGATTTGCTTTTCTATCTAAGTTTTCAGCTAGZVATTGTTTAGACTGATAGATTGTTTATCGCTTAGTCGTATCTCTCGATTGAAATGCGATGTAG 
AAAGACAGTTATTAACACCTGCCCTCCATGGAGAAATTCGTATAAATACGTATGCAAATTTGGCTGGGCACGCAAAACGAGTTCAATGAACATTC 
AAATCGTATCTGAAACGGAAACAAGTCTGGCTGAAACTGAATGCGTTGAATGAGAATGGTGAATGGGTGGGTGAATCAGGGGTTCGTGTGAATAC 
ACGTATTGCATAAGCGGCTTGGTCATGGGATAGGGTCTTCCATAGCTTCTAAATTAATCACTAAATGCCAACAAGTGGACCACTCCAAGTGCGAT 
GGCAGCACGTAAACAGAAGTAGAGCAAAGTTAATTGTGAGATAGCTTTCTTGGGAAAGCCAATCGGATAAGATCGACGTACAAAGTATATATATA 
TATATATATCCATACAGAATGGTTATAATCTTATCGGACGACGAAGTGGGAACGTCACCAAGTGCTCAGCTTTGGGCGCGGGTTGTCGTGATTGG 
CATTTTAAAGCACTGCAAAGCACTGACTGCAAACTAGGCCAACAACTTGTGGAATAACTTCTAAATTATATTAATGAGGCAATTTGGTTCTTCTT 
TGTGGCAGCCCGGCTTGGATGAGGTGGTGAAGAAGTGCCGCAATGTGAACCTCTTCTTCTCAACGGACATCGAGACGGCCATCAAGGAGGCGGAC 
CTCATCTTCATCTCGGTGAACACGCCCACAAAAACCTGCGGCAATGGCAAGGGTAAGTACTAAGTAAACAAATATAAAAAAAATAGAGAAATTAT 
CATATAATATAGCATTATTCCAATTTGAAGGCTACAATCTGTTGATCATAAAGCTTCACAAGCTTGAATTATGGCTAATATCATGTCATAGCTTA 
AAAATAACTGCCTATTAAAGCATAGATTTGTTAGTTAAATAGCCATAAAATAACCACAAATAGCTAAGCTAAGCTAACATAAATAATTACAGCAT 
GAGTTAACACTTGAAAATATTAATTCTAAAAATAAAGAGCATAGTGCGAGTTTAATCAGAAATCGTTTGATAATCACACGCCACTAAAATCAAAT 
TAATTGAAAAAGTAATTAAAGTCATCAAAATGTTGAACATTTCATGATTTCAAACACACGTTTATCATTGCTGGTGGAAAAATACTTTAAATTAG 
ATCAAAAGTAAACATTGACATACTTAAGATCACTTAGTAATCCACTTTTTCAATCTGCAGGCCGAGCGGCGGACCTCAAGTACGTGGAGAGCGCA 
GCGCGGATGATCGCCGAGATCGCGCAGTCCAACAAGATTGTGGTGGAGAAGAGCACGGTGCCCGTGCGGGCGGCGGAGAGCATCATGCACATACT 
GCGAGCGAACCAGAAGCCGGGCATCCACTACGACATACTGTCCAATCCCGAGTTCCTCGCCGAGGGCACCGCCATCAATGACCTGCTAAATGCGG 
ATCGTGTGCTGATCGGCGGCGAGGAGACGCCCGAGGGCCACCAGGCGGTGGAGAAGCTGTCGTGGATCTACGAGCACTGGATACCCAAGCAGAAC 
ATACTCACGACGAACACGTGGAGCAGCGAGCTGTCTAAGCTGGCGGCCAACGCATTCCTCGCCCAGCGAATCTCCAGCATCAATTCGCTGTCGGC 
AGTGTGCGAGGCCACGGGCGCAGATGTTTCCGAGGTGGCGAGGGCCGTGGGTCTAGACTCTCGCATCGGTTCCAAGTTCCTGCAGGCGTCGGTGG 
GCTTCGGCGGCAGTTGCTTTCAGAAGGACATCCTCAATCTGATTTACATCTGCGAGAACCTCAATCTGCCCGAGGTGGCCGCCTACTGGCAGCAG 
GTGATCGACATGAATGAGTACCAGAAGCGTCGATTCTCGCAGAAAATCATCGAGAGTCTGTTCAACACGGTGTCGGACAAGCGGATAGCGATCCT 
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GGGCTTTGCCTTCAAGAAGMTACGGGCGATACGCGCGAAACAGCCGCCATTACCGTGTGCCAAACTCTGCTGGAGGAGGGCGCCGCGCTGGATA 
TCTACGATCCGAAGGTGGAACCCGAGCAGATCATCGATGACCTCACGCATCCCAGTGTCACGGAATCGCCGGAGAAGGTGAAGAAGGCGGTGCAG 
ATCCACAGTGATCCGTACAGCGCAGTGCGTGCCACACATGCGTTGGTCATCTGCACGGAGTGGGATGAGTTTGTTGATCTGGACTTTAAGCGCAT 
ATACCAATCGATGATGAAGCCAGCCTACATCTTCGACGGCCGCAAGATTCTCGACCACGAGCGCCTGCAGCAGATCGGCTTCCACGTCCAGACCA 
TCGGCAAGAAGTACCAGCGCACCGGACTGCTCCGCTCGTGGGGCATTGTGCCGCAGCTTTAGGCGCGTCGGGCGGCGGAGGTGGTGGCATTTACA 
CTTAAAGTTGGCGAAATTTCCAGTATTTGCGAGACTAAACTGTAAATATCCGTATTCTTTTCTGTAGCTGAGTGTATGTTAGATTGGGTCGATTA 
AGCTTAAGTTGAATCATGTTTAGGAGAATAGAATCGTTTCAAATCATCAGAGGACTCTGTCAGTTTTAAGTGAAAAGTAAAGTTGAATTCTTGCA 
TTTTGTATATTAGTTTCTAAAGGATATAAGCTTTTAAAACATTTCTGTATATAACTAGAAAGCGAATTACTTATTTGAACAACATAAGGCAATGA 
TTGATTAACCACAAAGTTATAAAACGTAGTAACTATTTTTGATCAGACAATATAATATTATGTAGCTTTCCCCATATCGACCCAAACTAACGAGT 
GTGTTTCTAAATAGTTAAAGTATTCAACTAGGCACACCCGCAAGGGAAACCCTCGAACCGTATTTGTATTCATATTCCTATTCCTATTCCCCTGT 
CGATCCGCCACTCGAGGATCGGTAGTTGCAATATTAAGTCCCAGCAGGACAATAATCCCCTCCTGCGGCGGGCCAATAAAAACCAGCATTTACAC 
GAGAGCATGTAAATTTTTATAATAAAAATAACAACAGTGCGTTTGATTTGGGTTCCCTTTTTCAGAAGGGTATCCTCTTGACTATTTGGCGGTAC 
CACAAAAAGGAGGTGGCCGCCGATAGACTGAACCAGGTCACCAGGTACGACAGATGATCGTTGCGCAGGGTAACGCGTGTCTGTCCACCGATTGG 
AGCATGTGCAGCCGTTTGCGGATCGTAGACGGCGTCCAGGAACACAGGAGCTGCTCCTGTGGCGGCGCACATGCGCGCCAAGTCGCGATAGAGAT 
AGACATTGCCCTTGTGGTCGGGTGTGAACTGCGGACGCGCCTCTCCTTTGCGCACGACGGCGGTGAGCTCCACTTCCGCCTGTTGTTGGCCCAGC 
GGCCTGGTTTCCGGCTCCACTTGCTTGCGGGACACCCAGCCCCGATTGACGAGCACTATGTCGCTGGGGGCGAAACGATATTAAGATGATTCTAT 
GGGATTTCCTTGAGCACAACTTACTCTCTATCTGCCAGCTGAAAAGGGGTTACTATTAGATAGCCATTGCCCGAGTCGCGCTGCGAGAAGAGTCC 
TCCTTGGGTTTCCACGCCATCGGGCCGTATCAGCGATCTCGGACCCAGGCGCATCTCCTTGTCGTGAAGGAAACGTCCACGGATCTTGACCAGGC 
GGTACTCCATTTGGGCCAAGTCAGTGAGACTACAGGAGGGAATACACTATAGTGGGTGGGTATATAGTGGAGAATATGTGGGCAAACCTACTCGT 
CTGGTAGAGCCACAGGCGCCGTGCTCAGCTGCTTGTTGAGATCCTTGATCAACTGCTCCTTCCAAATCTTGCGCTTCACTTGCCAGCAGCCCAAG 
CCAAAAGTGGTGGCCGGTATGAGCTGGGGATTGGTTAATGGACAACAAGGATTAGTTGTGTGTATAGGTAGAGGAGTTGGTGTCTACCCACCAGG 
AGGAACCATCCCAAAGGAGCTATCTTCTCCTTGTCCTTCGCCGCCTGATTGGGAATACTGGTTGTCCAATTCACCGGCGGTCGCTGTTGTGTCAT 
CTTTCTGGTGATTAATCTCGGATTGTTCGCGGTGCCCGGGTTTCGGAATACAGTCTGTATATTAAAGGCCACTTGCCGGCACATTTGGTTCCCCA 
AACGTATCATACTTTCTTTTATTTTGCTCTAAATTTTCTATTTATTTATTTATTTACATTTCCAACTGGAAGTTGATAGTAAGACCGTTTTGCAC 
TGTTATGAACGGATAATAACTTCATGCGGTCACACCACTTCCGAATGTAAGCTCTTCTTGATTTTATGAATTAAGACTGCAAAGGTTTAAAGTAT 
AT AAAAAAT AAAGT AAT A GT T AT T AC T T T T C AC AT G C AAAAAAAAAAAC AT T T T T C T TT CTT T AC AAAGAT T T GT AT CTGT T AC T GT T AATC AT A 
TGTTAGCAAGTCACTGTTAACATGTCGTTATACCCAAGTTGGCAGCCCCGTTACGATCAGCTGTTTTAGTCTGTTGTCGCGTGCAATTGGATTTT 
TTAAACGATTTTGTTTTAGTAAATTTAGGATACATTTAAGAATGCAGTGCACTCGTGCAGTTGCAAGACTGGCCGGCGGCCTAGCGAGAACCGCA 
TGGGTATGTTTAAAGCAGGGGTTGTACTTTATTTAGTCAAAACACGGTCATCCGAATGCACCGCCATCAACTCTTATATAAACGGATATAAAATT 
TTTAAATTAAACATATTATAGGCAACACAACATACGTATAAGCACAGGCCAGCGATACTGGGACTGCTGGAGCAGCCATGCCGCTTGTGTTCATC 
C ACC GGC GT T GT GGATAATAC C AAT AAAAC GAAACC C GGC AC AG C GG AAGCC GC AG AAGGC GGT AT T CT G AAG AG AGT G CT CAAC AAAGT GGGCT 
TCACACCGAACACAAAGGCGGTAGGACTCGGATTACTTTGCCCATTTGATTAATATTCATCGAAGTGATCCCATTTCAGCGCCTCAAGGTAACCA 
GTCATCTGCTGTACGAAAGCGTCGCCGACAAGATCAACTATGTGACCTTTTTCCGCGACTTCAATCTGCCCAATACCTTTAACTCCTGGTTCTTG 
GTCACCGAACTGCATGTTTGGCTACTCCTGATGCGCTCCATGGCCGAGGGCTCCGAAACGGGCGAGGATGGACGCTTCCTGCGCAACTGCATTGT 
GGAGGCGATGTGGGGAGATGTTAACACACGCGCCAAGAAATTGGGAGTAGGTTTTACTTTAGTTCACCTTTATATATATAATCTTTATGCTCTTC 
ATATGTGTTTCCCAACAGGCTCACAATCCTTCACGCACCCGTCAGCAAATCGAGACCTTATCCGAGCAGTTCCAAGCGGCCCTAATCGCCTACGA 
CG 

(SEQ ID NO: 1108) 

Exon: 1001. .1361 

Exon; 3809.. 3947 

Exon: 4431.. 6194 

Exon: 7542. .7552 

Start ATG: 1212 

Transcript No. : CT28337 

GTAGTAGTTACTGTCGCTCCGCCGCCGCGTGAAGTCAGATTTCTCCGAGCGCTCAAACGAACAGCACGCGACATTCGCCGGAAAAAAAAACGAGA 
ACCATAAGTTGTGGGAAATTCGCACAAGGGACAGAAGGAAGAAGAAGCAGCAGCAGACGAAAGGGACGAACGGCGAAGCAAAACAAATCGAGCGA 
CGAATAGCGGTGTGCAACGAAATGAAGGTCTGCTGCATTGGCGCTGGTTATGTCGGCGGTCCGACCTGCGCGGTTATGGCGCTCAAATGCCCCGA 
TATCGTGATAACTCTGGTGGACAAGAGCTCCGAGCGGATTGCCCAGTGGAATTCGGATAAGTTGCCCATTTACGAGCCCGGCTTGGATGAGGTGG 
TGAAGAAGTGCCGCAATGTGAACCTCTTCTTCTCAACGGACATCGAGACGGCCATCAAGGAGGCGGACCTCATCTTCATCTCGGTGAACACGCCC 
ACAAAAACCTGCGGCAATGGCAAGGGCCGAGCGGCGGACCTCAAGTACGTGGAGAGCGCAGCGCGGATGATCGCCGAGATCGCGCAGTCCAACAA 
GATTGTGGTGGAGAAGAGCACGGTGCCCGTGCGGGCGGCGGAGAGCATCATGCACATACTGCGAGCGAACCAGAAGCCGGGCATCCACTACGACA 
TACTGTCCAATCCCGAGTTCCTCGCCGAGGGCACCGCCATCAATGACCTGCTAAATGCGGATCGTGTGCTGATCGGCGGCGAGGAGACGCCCGAG 
GGCCACCAGGCGGTGGAGAAGCTGTCGTGGATCTACGAGCACTGGATACCCAAGCAGAACATACTCACGACGAACACGTGGAGCAGCGAGCTGTC 
TAAGCTGGCGGCCAACGCATTCCTCGCCCAGCGAATCTCCAGCATCAATTCGCTGTCGGCAGTGTGCGAGGCCACGGGCGCAGATGTTTCCGAGG 
TGGCGAGGGCCGTGGGTCTAGACTCTCGCATCGGTTCCAAGTTCCTGCAGGCGTCGGTGGGCTTCGGCGGCAGTTGCTTTCAGAAGGACATCCTC 
AATCTGATTTACATCTGCGAGAACCTCAATCTGCCCGAGGTGGCCGCCTACTGGCAGCAGGTGATCGACATGAATGAGTACCAGAAGCGTCGATT 
CTCGCAGAAAATCATCGAGAGTCTGTTCAACACGGTGTCGGACAAGCGGATAGCGATCCTGGGCTTTGCCTTCAAGAAGAATACGGGCGATACGC 
GCGAAACAGCCGCCATTACCGTGTGCCAAACTCTGCTGGAGGAGGGCGCCGCGCTGGATATCTACGATCCGAAGGTGGAACCCGAGCAGATCATC 
GATGACCTCACGCATCCCAGTGTCACGGAATCGCCGGAGAAGGTGAAGAAGGCGGTGCAGATCCACAGTGATCCGTACAGCGCAGTGCGTGCCAC 
ACATGCGTTGGTCATCTGCACGGAGTGGGATGAGTTTGTTGATCTGGACTTTAAGCGCATATACCAATCGATGATGAAGCCAGCCTACATCTTCG 
ACGGCCGCAAGATTCTCGACCACGAGCGCCTGCAGCAGATCGGCTTCCACGTCCAGACCATCGGCAAGAAGTACCAGCGCACCGGACTGCTCCGC 
TCGTGGGGCATTGTGCCGCAGCTTTAGGCGCGTCGGGCGGCGGAGGTGGTGGCATTTACACTTAAAGTTGGCGAAATTTCCAGTATTTGCGAGAC 
TAAACTGTAAATATCCGTATTCTTTTCTGTAGCTGAGTGTATGTTAGATTGGGTCGATTAAGCTTAAGTTGAATCATGTTTAGGAGAATAGAATC 
GTTTCAAATCATCAGAGGACTCTGTCAGTTTTAAGTGAAAAGTAAAGTTGAATTCTTGCATTTTGTATATTAGTTTCTAAAGGATATAAGCTTTT 
AAAACATTTCTGTATATAACTAGAAAGCGAATTACTTATTTGAACAACATAAGGCAATGATTGATTAACCACAAAGTTATAAAACGTAGTAACTA 
TTTTTGATCAGACAATATAATATTATGTAGCTTTCCCCATATCGACCCAAACTAACGAGTGTGTTTCTAAATAGTTAAAGTATTCAACTAGGCAC 
ACCCGCAAGGGAAACCCTCGAACCGTATTTGTATTCATATTCCTATTCCTATTCCCCTGTCGATCCGCCACTCGAGGATCGGTAGTTGCAATATT 
AAGTCCCAGCAGGACAATAATCCCCTCCTGCGGCGGGCCAATAAAAACCAGCATTTACACGAGAGCATGTAAATTTTTACAAAAAAAAAA 
(SEQ ID NO: 1109) 
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MKVCCIGAGYVGGPTCAVMALKCPDIVITLVDKSSERIAQWNSDKLPIYEPGLDEVVKKCRNVNLFFSTDIETAIKEADLIFISVNTPTKTCGNG 
KGRAADLKYVESAARMIAEIAQSNKIVVEKSTVPVRAAESIMBILRANQKPGIHYDILSNPEFLAEGTAINDLLNADRVLIGGEETPEGHQAVEK 
LSWIYEHWIPKQNILTTNTWSSELSKLAANAFLAQRISSINSLSAVCEATGADVSEVARAVGLDSRIGSKFLQASVGFGGSCFQKDILNLIYICE 
HLNLPEVAAYWQQVIDMNEYQKRRFSQKIIESLFNTVSDKRIAILGFAFKK1SITGDTRETAAITVCQTLLEEGAALDIYDPKVEPEQIIDDLTHPS 
VTESPEKVKKAVQIHSDPYSAVRATHALVICTEWDEFVDLDFKRIYQSMMKPAYIFDGRKILDHERLQQIGFHVQTIGKKYQRTGLLRSWGIVPQ 

(SEQ ID NO: 1110) 

Name: UDP-GLUCOSE 6 -DEHYDROGENASE 
Classification: enzyme 
Gene Symbol: sgl 
FlyBase ID: FBgnO010851 



Celera Sequence No. : 142000013384672 

GAATTTTTAAAAATTATTTTATTTTATCATATTGCTACGAAATTGGCAAAAACTACCCTAATATGTACTATGTAAATTCATTTCTTCGATCAGAA 
TTGATTTCGGCCCGAAAATCGTCTTCTAGCACAACACGCACACATATACAGAGTTCTCGACTCTTGTTTTTACTCACACAAGCAAGCAAATTAAA 
TTTTTAGATTTCTTACGCTCTCAGCGGGAGTGAGCGGAAAGAGAGCAATTTTGGCCGTCGCTAAAAAATTGGCTGCATAGTGCCAAACCAATGTA 
TGGCCGTTACGCATCTTGTTATTCTAGTGTCTTTGGTCTAATCCTCCAAATAGATTTCTTTCTCTCCAGGGTTTTTTAGCTGTATATGGATTTAC 
CTCTCTGTATAAGAGAAGATTGAAGCTGAAGGAATTCGAAACCATTACCAAAATGAAACAATTGAAGACATAAATTGGACGCTTTTGTGATGCGT 
CTTGCCTGTGTCACAGCTGAAGCTGATGGCATTCTTTATGTATTTTATAATATATATAATGTATATGTATACTAGATTATTCATTGTATAAACTA 
ACAACATCATCAGTATCAGTATCAGTTAGTTAACTCAATGTGCAACAGCCGATGATAGGCGTACACTCGTCGAACGAAGAAAATACACATTTCCA 
ATCGAAGATCATCCGAAAAAATGAAAGAAAACTTATAGGCGATGTATAATTAAAGAAATTGCAAACTGTCTGCTTAATTGGTACTTGGTAGTTTG 
TAGGTGGTGCGTATGTGGTGTTCATGTGTTTCATGTGTTATATGAAGGACATGTGGGTATTTGTGGGTGTCTGTGGAAGGCGCTTCGTGTATCCT 
TGTTTCGTTGAGCAACTAATTTAGTTCAAATAGCGAAAACAATTTTGAAGGCGGCGGCTAGGCGACAGGCAAACAAATTATGCAGATTGCGAGCT 
GATATATTTATGTGTACGTAGTGGTGACTCCAAAATCCCTGCCAGCTTAGCTAGCCCTCGATCTGAGATCGCAGGCGGGGCCGTGGCAAGTCCCC 
AAAAACAGCCTCGCCATGGACACTTCTGGTCAGCGCTTGGATGCTTGCCGCTATGTCGATGTTGTCCATTCCCTGACAAAAAAGAAATCAGCCAT 
TAGTTTCGTATGTTTTTTTTTTTACTGTTAAATACTGATCTCACCTCCTCGCAGCCCTGCCCGAAGTTATTGTTGTTGGCCACAGCAGCTCTCTC 
GGC C AGC GT GGT GT TTGCTATCTC GAT GGGC AG ACT C CGGGC G AAGTT C AT C AAC TGC GC ATGGC T AGG ACG C CC TCCGCCTC GC AC T GGT AT GG 
AAATGGAGCTATCCGGGTCCAGAGCATCTGTAAGTTGGAAACATATTGGGGGTAAATAAAAATGCTAGTGACGTACAATTTTCTAGAACTCACCT 
TCAGCTTCATCCGCAGCTTCGTCATCTAGGACACTTCGTAGGCCATTTTGTTGGCTCAGCTGCTGGAAGGGATTGTGCTGAGGCTGCTGCTGGTA 
GATCAGGCTACGCGTGTAGCTAGGCACAGGCACGGACTGAACCGGAACCGGAGCATCCACATCCTCCAGATCGAACAGGCAGTCATCTGCATCTT 
CGTTAACACTGGACTGACCCACTCCATTTGGGGTCAACGTCTGGGTGGGTGGACCACCAGCACCGCCGGCTGCTGATTGCTGCCGGAAGGTGGGG 
CTGTTGCCCACGCTCATGGGCGTGGCTTCTGGGGTGGCGGGCGGTGTGTCAAAGTGCGACATCTTCCTGGCCATCGAGACACTCTGCGGGGCCTG 
CTGCTGCTGCTGCTGTTGCGCCTGCAACTGTTGCTGCTGTTGCTGCGCCAGCAACTGTTGCTGCGGCTGCTGGTTCAGCGATAGGAAGTTGGGCG 
GATGTGTCAACGTAGTAGTGGGTGTGGTGGGTCCACCATTCAACTCCCGCCGGCTGCTGATGGTGTTCCTGCGCCGGGCAGTCGCACTCTGCGGC 
TGATTGGCCGCCACATCAAGGGCAGGGAGCATGGCACGATCGAGTAGCTCCGAGGCCTGCTCGGGCAACGCACGAATAACTTTGAATAGCAACGC 
GCAGTCCTGATCCGATTTCTCCCGGAAACTGTTTAGTAACTGGAACTGAGTGCGCGTATACCGCTCGATTCTCTCATTTGTCTCCACAATCTCCT 
CCTGCAATCGCGCCTGCTGGTGCGCTTGTATCATTCTCAATCTCTGCACCCTGGCATCAGTGGGATTCGAAGACAGGGATCCGATGCTTAGACTC 
GATGCGGAGTTGGATAGTCCACTGGTGGGTCGAGTGGATCCAGAGTCACTGCCTGTGTGATCCATGATCAGCAGCTCGAAGGTCTCCGAGAAGGC 
TTTGTTGGACTTCCGTTGAGCCAGCTCCTCGCTGCTGGTCAACAGCGTGCAGTTGATGAGGTAGAGAGTGGGCGTGGCCCCGGCGCCCGCCGTCA 
ATCGCTTGGCGCATAGCACCTCCTTGCAGTTGATGCACAGGCTCAGCTGCCAACTCTGGCCAGCGGCTTCCAGGGTGAGACCATACAACAGCTCC 
CGCTGATTAATCACAGCCACGGTGAGGTCTGGAATGGGACCATAGGCCTGAAACGGCAAGAAAAAAGTTGAATATTATTGCTTGTTTAATTCTAT 
TCTATTATTTAATTCCTTTATTCCTTTTTGTTAGATTTTTTATCAACCATACTACATGGTTATTGATTTGGGTATCAGGCTTTCCCTGGGTGCAA 
TCCTCACATTCATATTAAATTGGATTACCTTTAGTCAACAAGTTGATAACACTTTCCACACAGTCAGTCTGGTCAATGTCTGATAAAGAGGGATT 
ACCAATTCGCCCCATTATTATCTGCACTTTTGATCACAACAGAAAAGCCAGCCTTTCTTCGATGTTTACCCAGTCAACAACCCCGCCGCACAGTA 
GTTCAAGTGCAAATTTGTCAATTATGTTCATTTCGAGTGGCGGCACCCGATTGTGTGAAGACTTTATCGGAAAACAAGAAACCGCTGATGAACAC 
GAAAACAACAAACAATTAGTGCAATGGCAAAATGATGTTGATGATGGGTGCCATCATATAGAGCAGAAGACAACTTGAATTATTCTGGCTGTCAT 
GGTGACAAACAGAAGATGTGATGGGTAAACAACAAGTTCATGATATCGCCGGAATAGATTAAAATAGTCTGGTTTCCAAATGCCAAAATAGAAAC 
AAACAAGTTTGCAGTCTGCCAATGAAACCTAAATATATCTATCATCATAATGCCCCGCCAATTTAGACACATAGTTATGCAAAATAATTCCGGCA 
TTTTTGGAATCGTAGCACGTGTTTGCCGACTGACACAGGAGAGAAAAACACTTTTTATAGACGTGCTTGGCTAAATTTGACACTCGCGTCCACCA 
CAATCAAGTAATATTCATATAGCTTGGACAATGGAGAGCGCAGGGCGAGATGTGGAGCGGGTGGGACACTGGACACAAGTCCACACAAGTGCACA 
AACAACCATTCGCACACGTAGATACACCGAATTAAATTGAATTAACGTACGAACTGCAAACGTCAACAGAAATCAAATGACAATGTTGCCCGACG 
TCAGCTAAGCGCTTCTGAATCGCATAGCATCGTTTTCGTTTTCCACATAACCATCCTCATCACTTTTTGCCAGGTGGACAAGAGATCATCTCTCT 
CATCTCTGATCCACATATCATTGAATAATATACTTTAATATATACTTCATATACATTCAATTAAATTCTTGAACTAAAACGAAATGTTTATCATT 
CGTTCAAAAGGCTGATATTGTCATAATTTTGGCTGTCAACTCAATTTCCGGCATTTCAAAACTGCAGCTGCATTCGTTCGTCTTTATTTAGAAAT 
TTGTTTTCGCATCCATTTTGGACAATAACAAGTTGCGGTCACACCCACTCAAAAAAAAAAAATTGTTTTCAATGCAAATTCCGAACTTGTTGTTT 
ACAACGAGTGCGGTAATATCTAATCAGGCCCAGTCAGTCGGTGGCAATAATTATACATATGAATGAAACCGAAAAACAGGCACTAAAATGTGACA 
CAGCAAATGCATTGTTGTCAAGTGTTTTGGCACTCGGCTGATATTATTGCCGGCAAACGCATAAACATATTTATGGGCTCAAGCCTATTTGCTCG 
GTTTAAAATTGCAGTTTTATGGCTGTTTTCGTAACCGCAAGACCCGACTGACTGGCTGACCGACTGACTTGTTGTTCTTGCTGCTAATTTGCTGT 
GCTGATACCCTGTAAGATGGGGATTTGGGCGCAATGAGTAATTATGCCTTTCTTCGATTGAGGAGCAGATAAGATAAGTGGTACGATTTTAGATT 
TATGATTATATCTGCTGCGAAGGCGTAATTAAGAAATCGAGAAGTTACGAGTAAATGCCAACATATAAGTAACCCAAAAATATCTTGATCACATA 
TAACCATTCCCCAAAAAATACAGCAATAAATATGGGTATTTAGAACGGAAAGTTTTCGCATCTAGGGTAACTGATAAATCGATCATTGCTGATTG 
CCATCCATCATCATTCATTCAGCATTTTTTTGTTTAGCGTTATTTATGGTTTTATGGCACAAGCTGGCTAAGCTTGAAGTGGGTGTTTTTGTTGA 
CTCGTTGGACGGGCCCCAAAACGCCCGACATTATCAGTTAAGATTTAAGACAAATCATCATCAGAGCCATCATACTGATTAGCTGCATTACCTCC 
TATGCAGATGCAGTTTTACCCTTTGCTTGCCAAACATTTTGCATGCATTTTTCGCCCCTGCTGCGAATCGCGATCTGCCAAATGTCAATGACTCT 
TCTCAGTGACTTTCAGACTGCAATGACAACAATTGAAGGCAATAGTTGCCTTCGTCGTTGTCGTCGTCGTTCTCGTCGTCGTCACTTCATTGGAT 
TGCTTAATTAACCTAACTGCCTGTTCGAATGATCGACTCTTGCTTAATTGAAATTAAATGTCTGCGCGCTGTCCTGCTAATTATGTGACTGTGGC 
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GCCACCTAGTGGCCGGAACCAGTGGCTGTTAGCTAATTAAATAACCGTTTACGTTCGACTTTGACGTATGACCATTTGCTTTATGGCCCAGGCCA 
GCCAAGACGGGCATTTGATTAATTGTAATCGCGTCTTCTGGCCGAAATAATAAATAAAACATGGCGTGTTCTAACTGGTAGTGGTTGGCCGAAAG 
AGATGCAAAATCGCCAAAGGAAACGCGCCGAGGAGCACGTAACTACTCAAGTCGCACGCTCTTCGCATATCGGAAATCAGTATGAAGCGATTTAA 
TTGAATCGCGCGACATTTAAGTAATGTATGCAATCTAGACGGAATCAATTGCATTGGTGCCAACTGTCGAAAACGTTGTCCAACTGTCAGCTTGA 
GTCCGTCGATCACTTAATATGCATAGTGGATAACTATAGAAAATCAAAATACTTTATTGCAACCAAAGACATGGTGTAATAATAAATGGATTCTA 
GAACGAAGCTCAGAAACTATGGCAATTCATTAGATTAGATTTATTGATCTCTTTAATGTGTTAATGCTACGAAAGCAACTGAAACAGAATTTTAC 
C T T C GT T TAT AT CAT TAT AT G T T T GT AT AT AAT T AAAT AAAC T GC AT AAT AT GGC C GT AAAGT C G ACT AT AAC T TG AG AGT T ATG T AT AAA T C A T 
GTAGCAATCTCGGACGGGAAGTACGAGTGCATTAAGATGGAGCTATCAAGTGATTCTGCTTTTCAGTTATGCTGCAAAGCAAATAATGACGTTTT 
GCTATCATTGAAAAACCCTTGCGTACTTTTCAATTGCCTGTATCTATTTTATTTGGATTTTTTCTGTATTATACAAACGAGCGAGGCAATTGAAA 
TTCATATCAGTGGGTAATGGTGTCTGGAAATAATGCATATTCCCTCGGCGCTGCTTATGCAACCAGAACGATTCCCATAAAACAAACAACTTTAG 
AGCAAAATGTGTGTAATAATTTCTCGCATTTTGCACATTGCTGCATAATTAATTTGTTGTTGTTGTGCGAGGCGACTGTGCAGTTTAATTGATTG 
AACGGCATTAATTAAGCCTGTAGTCTGTGACACTTGATGTGTCTAAATACTGGTCGCCTAACTAACTAACCACTTACTTATATTTCGATGCATAT 
CAAATCGCTTTGCTATTGACTTGGCGAAAAACAAAAACATTAAGCTGTGTCCAAAGGGATAAATTAGTAGTTTGTCAGACATATTATAAAATATT 
TTCACTTTCACAACGGCACAAATAAAGATGACATGCATAAAAAAGATACGAAGAGTAGCAACAAATTGTATTTTAAGCCTCTTAAATTATTCTGC 
TTTTGTTATTCTCCTGTGAGATAATAATTTGCTTATCAGTTGATCTCGTATTAGTCTGATATCACACTTGAAAATCTAATAGATAACAAAAGGGA 
GGGGATTGGCTGAAAATGCCGGGAGGGGGCATTAACCTGTTGCAAGTGCTGGCACCCTCGTTAATTACCAATAAATCAGATCGTCAATCATCCGT 
AGCTGGCACACCTTTCGCCGAAAAAAAGATAGATTGAAAGAGAGAAAAGAAAATAACAATCAACCGCCCGATTGACGCATTTTCCTTTGTTTTGT 
TTTCCATTTCTAATTCTATTTTCATTGGATTTTACTTTTTTTTATAGCTTTGAGATAACTCGCCCGTGTTGCGTTTCATCGGCAGAAACCGAAAC 
T G AG CC AAAAAG AAAAT AAGTTGT AAAC AAATC AAC TGGC AAAAGAAA C T A C GAAAAT AAT CT T TT ATAG AT T T GG AT C CCC CT C G AAAT GGGG A 
TGGCAAGGAGGAGGGGTTCGCTCTGGGGTGCGACTTGGTTCGCCTCGCTGCTCGCGAATAATACAATAATATAATACCCTCTACATCTCCCTCTT 
CCACTTCTTGCATTTTTTTCTTTTTTCATTTTCATTTTTTCTTTTTCGTTTTTTCTTTTTTTTTTCTTCATTTTTAGTTAGTTTTGCCTCTTTTT 
TTTTTGTGTCGAAAAAAAGAGGCAAACACATTTCGCTCAGATACGTCGCCGTTCGTCGTTGGCCATTATCATAAACAACTTTAGACAGCTGGGGA 
CACACGCGCTGGCTGGGCGTTTTCGCTCTGCGGATTGCGATGCCAATTTGTATTGCCGCGCTCGGTTTTTTTGTAAAAAAAAAAAAAAAATTTCC 
ACTACCAATCGGCGGGCTTGCGATTTGGCCAGTCCAGTCCATCTAGGGCTTTTGTTGTGCTCCTCCGTCCGTTTCTTGGGTTCTTTCTGGGCCGC 
CAATGTGAGTCACTCGATGGCCTAAGCCCCTACATCAAAGTGCCTTTGGGTCACTTGTTGGTCGAAACTTGTTTTCCCAGCCATTTAATGGGCCT 
AATATTTAGGGTTTTGGGTTCTGTTTTTTTTTTATTGGAATTAATTGTTTACCCAATTTAAAGGGCTTCGTTTTTGAATCGTAACAACGCATTTA 
TAGCAGATTCTCATGAAGAGCAGCTTAGCTATACAAATTGTATTTTAAAGAATTTAAAATGCTGTTAGTCTTTCGACTAGTATTCAATCCACTTA 
AGTTGTGGCTTTGTAAGCCTTAGTAAAAATTGATTATCACCCCAGCTTTTATTTAGATCGTACACGTATTACAAAATTAATTGCTGTCATGTTCA 
ACTTTCTAAACCAATTTAAGCTTATTGTTTTCCAAGCCCTCCGTCCCCTGCACCCACCTCATTCGCACTTCCCTAATAAAATGTTGCCTTATCTA 
CGAAAAAAAACAAAAAATAAAAGGAAAACTAGTAAAAAAAAAAGCGTATCATTCAATGTCGTTTGAAAACAAAATCTGATAACGCCCCAGACCCA 
AAACATCGCATATAGTCTCGCCAGCCAAAGTGCCTACCGCACGCTCCCTCCTCCCCCTACCTCTCCACCGAGCCCTGCTCCCAAAAGATATATAT 
TTCAATCGGGATTAGAATGAGGAACTGGAAAAAAGCTAGGGAAACGTGCACACGTGTGCTCCCCGCAGAAGGATTAAACTTTTTTTTTGTAAGTA 
TGTAAATAAATGTGCTGAAATGAGTCGAATCTCGATTGTCTACACATGAAAATAAATTTACCTATAAAATTATTAATTATTAAATAAATAATTAT 
TAAATAATATATGCTCCATAACAGTTTGTTTAAGCAACTAGACAAGAAAACATTATGTTTACTTAGCCAGTAAGTAGGTATGCTTTTATGAATAA 
CATCAGATTAGATTATCAAATCAGATAACTTTTTGGTGTGACCTTACTTTACGAAAATATAAAAAAAGTTAAATGGTAAAAGCTTATTTTATTGA 
AATCATAAATGCTCAGTTATTAATATATTATTAAAATCAATTATCAATAATTACGAATTTAATTTTAGGTATTTGACTTTTCAAGGCCCTTCTTT 
ATATATATATTACACGGTCAAATCAACTCATCGATTTCGCTCAGTGTGTAACAGTAAATTAAGCGCAACAATGTTTGCCCATCGGCACTGCTAAA 
CAGACTTCGAACAACGAAGTGTGCGAGGGGTAAAAAAGGGGTTGGGGGTTCTGGGGGGACTGCGAGGTACATAGGTGTGCCAGTGTTATCTAAAA 
ACTTTGACAAACAAATTTACGTTGGCGTGGATTTTGTTTGCCCAGCGCGCACCGCACAGCCGGCTACAGCTCAAATTTAATAAGCAAACAGCAAA 
GATGGCGGAATTCTCGGGCCTGCCCAACGTCTGCTGTTGTTAATTGTTAGTTTTATACACACTCACACAGCTCGTATAAATAATAATATGAATAT 
ACGGCCACATATGTACATTTGTACTAGAATAATGAGATCATCTCATCTCTAGCGATCCCAAAACGCGACCTGCGCCTATTTTCGGAGCTACCGAA 
TTATAGAGTTGTGATAAGACCTGCTTATTTGTGCAAACGGTCTAATTATAGCTTTTTTCCAGGCGTAATAATTACAAGACAGCCAAATATTGATA 
A GAC T ATC AAC AC T A AGAAAAAA T AAT AAT T AAAC T GT T T G AAG AAT AC TG AAG AAC AC AAG AT GT T GAT GAT C AAAAA CGG AC G TACT AG AAT T 
CATTTGTATAAGCGTTTTGAGAACAAGATTTTTGATTTCGATCTAAGAATTCTTCTACGTGCTCTACTCTTAATATCGTAAACTATTTGAATTTT 
TTTTTTTTATTTTCCACTCCTATTTGCGTATCAAATTTAGACCCCTTGTTTCGTGTTTATCCGAAGTAAACAAATCTTTGTGGCCGCCGCACATT 
TGACACAGTTTTCATTATTTATTTATTTCTCGCCTGGCCACTACTATTATTTCTATTTTTTCGGCTGCATTGTGTGTGTTGTTGTGCCTCAGAGC 
AAATATTTATGCGAAACCATCATAATTTGGGGGAAGCGATCAAAATTAACCCCTCACCCGATAAAATCACGCATGCTAAACGTGATCGACGTGAT 
CAAAAAGAAAAAAACAAAATAGAAAGAAAGAAAGAATGAAAAAATCAGAGAATCAACCCCCCGAACCCAAAATGCCACCACCCCTTTTGGCATTC 
TTCAACCCTCTTGCACACGCAAAATAAATTATTCGCGTTTAGATATTTTGAAAAGAAGAGGTGTTTTCTTTGCTTTATTTTTATTTTTATTTTTA 
GCCGGCTTGCGATTTCTCAATTCAAGAATTTATAGTTCTGTTTACACGAGAGCATTTTTGTACCATGTATGTATATGGCACTCAGTAGCTACATA 
CATGTACATATGTATGTATTATGTACATTTTACAAGGCAAGTGAAAAAAGGTTAAGAGTTTACGAGCAAGGGCCAGGCTCATTAGATTGGCCCAA 
AGATTGGTTCGCTGAAGTGTGTGGGCTTTATAACAGTACGGCAATCGACTGCCGCTGACTTCAAGGTTATATTTGTTAAAGATTAACATTGGTTT 
TCAATTAACCGACACGCGCGCTGCAAGCAGCTGGTCATATGTATATGTATGTACATAAATGTATTTATTTTTCCATATGTATTTATGTGGGGTCA 
AAGGGGTGGTTAACTGTGATTGACCAGCGTACTCTTCTGGGACTCTGAAGCTCTGACCTTGCCATATCATAATGGCAATCTCCTGCTTGGGATGA 
TGTGTTGCTTTCCAAAGGGTCAGTAAATGAATATAAGCATAAGCAATCAGTCTAAAAGAGATAATTGAACTTGGCATATCAGGTGAAAAGTATAG 
AATACAACCTCAACAATTTGCTTCCAATTTTAACTTTTTATTTTAGATGATAAGAGCCCGCCCGACTTTTTATAAAGGGCCATCTTAGAGAGTGA 
TAATATAGCCCACAGTTTTGATAAGAAATTTCTTTTATAAGCAAATATAATACCTTCCAGTTTGCATTATTATTCATTTCCTAGACACATTTTAT 
CATCATTGTTTTTTTTTTTTAGTTTTTTCTTTTTCGGCCCAGACAGCAATTAGGCCCTACTCGCCTGATTTTTGATAAAGTATTTGTGATTTCCC 
ACTTATCATTCCACAATTCCACTTACTAGTATTGTTACACAAACCCATGCAGCAAAACCGCCTCGATTCACAGACTTTACATTTGATAATGACTA 
TTTTTAGAGTACACAAACGTTGTCATTGCAGGCGCGCCTTATATTCTCAACCATATGAGGAAAATATAACTTAAACTGATTGATAACTCGTGCTT 
GGGATTTGCTCTGTTGATATTTGCCTTGTGGGTAATCGGTGGATTGGAAAAGGGTTTTCGGTTTCATAACGCGATTAACTGGGATAAAAATAAAC 
ATGGCCATTAGCATATGAATGCAGTTGTTTTGTTTTCTCTTGTTTTAGCTTTGTTCAACAAACTCGTTTCTCGACCCATTCAAAGATCGTGTATT 
GTTTTGATTTTCAATTAACCAATTTTCACAGGTGAGAGTCCCAACAATTGAAGAAGGGCGGAGGAGATTAAAGGCTCTAATCATAAATAAACCAC 
TTGGAAAGGCGAAACTGGGAGGCAGACATCGATTGACGACAGCGCGACTGGCAGCAATCGTTTTATACATATGTACTCGTATATATATACTCTAT 
ACTACTATATAGAAAAGTGGCAGGGAGTATCAGTATTTGTGTCGCCGCGTTTTTACCGTTTGATAACCTAGAAAAGACGCCTAGGTAGGTGTGAG 
TGTATGCAAGAGTGTATATATATATGTGTGTGTGTGGGCGCTGGACAATCAAAATTGATAAGGGGCGTGCGCACAATCACAATCACAGCTACTTT 
TACTGCTGGATTAAGAGGCGATGACAAACAAACTGCAAGATACATGAATTCATTTTGTAAACAACATACATAGTCCTAATTCTAGTCCATTCGCA 
CAGAAATATCCAAAAGAATGCGCAATAATTGTACATTCTTTTAGTTTACTGATTTATACAAAAAGTTCTAAAATGTTTAAACTTCATTTAATAGC 
TACCATAAATAATATGAAACACATATTTAATTCGCTTTGTAGACAATTGTAGCCATATATTATTCATCACAAAATCATAAAAGCATAATTGAGTG 
CTTTCAAAAACAATCATAATTACAAACAGATGTCACTTTAAGTGCGCGTTTCGAGTTAGCGAGCACTATTGTCAGTCAATTTCATTAGTGTTGCA 
CAAGTCATTAGGAAGTTCGAGTTGACTAAATTCAAGCATATAGCCTTGTAATGGCCCATCACACATCTTGCCCATATGTCGTCCGGCATTTCAAC 
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AAATTTTATGCATAAATTCATAATTTAAACATTTCGGAGCCTACGCAGTGGAGAGCTTTTATGGCCAGGGTTGGGGTATTTTCAGTGGTGGGGAT 
GATGGGGAGATGGTGGTGCTCCGACTTGTCAACTGGCCCATCAAGTGGGCCCGAAAAAGCCATTCGCCAGCAGTCCGCAAGCAGGCATTTATGCC 
TTAGTGACCCCCTACCCGGCTTTCAATACATACATACATATGTATGTATGTACATAATATATGTATAGTTTTTATGGCTGCCTTTTGTACGCCTA 
CACATGGCAAAACAAAGTAACATTGAAATGTTTATTAACCATGAAGAGAAACTACTAAGGCTAGAAAATGAAAATCAATTGACTAGAATAACTTA 
ATCAAGAAAAGAGCGAATTTACAGGGTATAGGATAGTACAAGTTAAGTGTTGCTTATTGAACATTCCAAAAGCGGACAGAAATACACTCTACAGG 
GTATGCGATAGTTCAAGTTGAGAATAGAAATAATCCGATTTTGCTTTCTTGATATGAAGTTGTTTTGATCCAAAGGTCTCCCAGCTGACCAATCC 
CAAAGGATTAACAATTTATTTATCTAGCTAGCAGTATGGCACTACTGATATCGTTTGGAGTTTCTTCGAGTGAGCGCCTCTTGGAGCAACTCCCT 
CTCTTTTGGGCGCCCAGTGGGCCTTTTGTAGCCCCCTTTCTTCCGCTGCCCGGCCAACTCCGTCGACTTTACGGCGGTCCAGTGCCCGGCCGGCC 
GCATTTGGAAATCTCGAAAACTTGTTTATCCAACACACACACTCACACACACCCTCACACACACTCGAAAAAAAATATACTGCCAAGAGATGGGG 
GAAAAAATTATAAACAAGTTTTTTGTTTTGTATTTCGGCTCGACCGTTGTTGTTCTTCCTCTGCCTGTGCCTTCGCCTCTGCCCCTCGCCACGCC 
CCGCTTGCCAACCCCTCGCGACCGCTCTTTGTGCTTTTTACTTCTTGCTTGTGTGTTTTGTCGCTGGTTTCGTAATTTATTCTAAAAGCGTTCCG 
TTTTGGCTCTACATAGAAGTGTATACATATAAACACACGAGCCTGCATACTGATTTATACATACAGACTCATACTTGTGGCATATACATATGCTG 
TGAGGGGTATTTTAATAACAGTTATACCCTCGATCGATGATATCGAAACTAATTTATCAGTTGTGTAAGATAGTTTCTTTTGATTTTAGATACCG 
CTGCTCAAAAACCTATAGAGATTTTACAAGATTCAATATTTGTAATATCCTAAAAGATTTATGTTACTATTATGCTTAATTAGATATGCTTTATT 
AAATTTTTAATGTCGACACTCATTTATAATACGAAAAATATATAAGAAGTGTTTGAAAGGTGAAGATAGCGTATCTGCAATTCGGTGTAGTTGCA 
AATTTGGTTTACTTTGTTTTCATTTGGTTTTTTGTTTTGTTTTGTGAGCTTTGAATTATTCAAAATAACACTTTTGGTCTATTTGTCCGCGTTGC 
TGTTGTTGTAGCACGCCTCAGCGCTTTTGAAGTGGGTGCAACACAACACACAAAGCAACAAAAGCAACAATAGCGGCTATAGCCGACGATCTGCC 
AACCGCAAAAACAAAAACTAACAAGTCGTTGCCGTATTGGACAATTCAATTTTGCATTACAATAAATACAGCACTCGAGTTACAAACGTGCGGTG 
AAGGTACTTTTGTGCAAGCACCGCGATAACCGATTTTCTAAAATATTAATGAGTCAATCTTCAATATATGTATGTATATTCAAAATGAAAAGATT 
T CAT T AC AT AT AT AC AT AT AT T TAAGC ACT C AGGA C T GCT TT AAG T TC GG AAC AT AC AT AT GC AACT GGT AAAAT C T AAGAAAT AAAAAT GAT C C 
TCCCGAGACCACGTCCACCCACTGGCTGTCAAACCTCGGGCCAGCAACTCCAGTGGCTGTCAAAATCTGACGAGTGGCGTGGTCGAAAGTGGGGA 
TAAAACGGGGACAGAGCGAAAGAAAGGGGGGATACTGAGATTTAACCTTGAAAAGTGTGGACAAGACAAGCGGCGATGACGCAGATATTCAGTTG 
TTTTTATTTTGTTAAATGGATAAGGTAGCACCAGGAACGAAAGATGGAAAAATTCGGCAAAACTTAATAATAACAACAATTACAGACATAAAAAG 
TGAGGGGGAGAGGTTCCCGAAAAAAAAAACTTCGTATACAGCGCGGTCCACCAAAGTATTATTTGCCTGCAAATGTCAACTACTCAAGAACCAGT 
TTACCTTCTTAATAAATACACTTATGATCCGTATTTATGCACATAATTTTAATATAAATATAATAAAAAGCAAATATTAACCATTCGATTATGAA 
CTTTAATAAATTCATAATTCATAATCGAAATTTTAAATGCCAAGATAGCTACTACAAATA2VACAACGTGATTATTTCCTTATATGCTATTTGTTA 
GTGGCAAAAAAATGTATTTAAAATAAATAATTGCCGTGCTAACTTTTCATTCCAATCGCTTTACCAAAAACATTATTATTTGATTTATTCGCAAA 
AAAAATTATTTTTTGCTGGTCTTGAGTTGATAAATTCAAGTTTGGGAATTTACCACTTGTTTATATCTTTCGACTAAGTTATGAAGAAACAAAAT 
G AAAAAC AACT AC AAT AAC AAT C AAGAAAAT ATG GGAAAG T T T T T C AAGAAAT C C CC AT T T T AC AAC AAAAG ACC AG AT ATTAATTTGTGTATTT 
AAGTAATGACTAATGACGACATCGGTGATCTATAAATTATACATACATATGTATACATACAGATACATACATACATATATAATCAGATAAAAATT 
AAAAAATACATACATATGCATATATATAAATAAATATGAATGCTTGTAAGTACTACCTGTTATTTTTAATAATTAAATAATTGGTATAACAAAAT 
ATTTCCAGGTGCATTTGTATAATTATTTTCTTGGTAGGGACACCCTTAAATTCTTTTACAAGAAGGACATAAATTTGCGACTTTATTAACCATTA 
ACACACAACTTCCGAATTCCAGAACTGGAATATTCCACTTCGCTTAACCCCTGAATGCCAACCCCTTTGCAACTTTGTGACGTTTTTGGGAAAGC 
GAAACCGAACTGAAAATTTACTAAACGAAATTATGCAAATTACTTGCAACTGAGGGTGTGACAAATGAAAAGCAAGAGCGAATAAATCATAAAAA 
AACGCGAATAAAAACGGGAAAAAGAAGGCGAGTGAAACAGAAGCTGACGCAGGAAACTAAAAATACCGGCGATCAGGTTGGCAAAGAGAATACCG 
AACCAGAACAGGGTGCTCCTGCTTGAAAAGGAGGAGCGAATGATAAACAGGTGCGGAACAGGAGGAGAGTGATAATGGTGGAAGAGGGGGGGAGG 
GGAACGCGAAGGAGCGGGCAGACCAAAAATAAACCAGAAAATGTGAGGATTCCTCACGAGTTTCCTATGAGCAAATGGATGACCCTATTTCTTCT 
TGCCCTTCCATATTCCCTACATCACTCAGTAGACCTTCTCCTTCCAATTCACGATTCCAGGCCTACATATGCACACACATATGTATGTATAGGCC 
TATGTTTGTTTTGTGTGTGTGAGAATAGGCGGCACAGGCAGAAACGGCAAAAAAAATAAAGCCGGCACACAAAACATAAAGAACGATAATAAAAA 
C AC A T AAAC AC AC AC T GAC AGCGC AAC AAC G C AAAT AC AGT GAAT T T CGC T T AAC AAT T AC T TT G AT T AAT GC AAC T AAAT AG AGTGT AC T GT T A 
GGTTACACCAGCTCAATTAAGGGGAATAGCATATTAAGCGAAAAACTTTTAATACTTTTAAGCTATAATATATGCTTTTCATTGTGTTTGCATTT 
TTTGTTAATAACAATGTATTAGCGGATTATTAATGCATTGCTTGAGCGAACATTGTGTTCTATTCCCCTGTGGTCCTTTTAATCGTATCTTCACT 
GTGTGCCCGCAAAGAGAGCGAAAGAGCGCGAACTAGTGTCCCAGCATAGGCCTTTAGCCCATTGTTGTTGCCTTTTATGCTGCGCCGTTGTTGCC 
TTTGCCTACTCCCCAATCAATACATATTCCGTCCTGCTCTGGCACGCTGGGCGATAATCGGTTGACGAACTATATAACGGCTATAGTTGAATACC 
GAGAGAGACTTACGTGCTTAAAGAACTCCATGATATGGCTGTAGAAGACGAAGTCCAGAGGATACTTCTGCTGGAACACCTGCGACAGGAGGATC 
GCGGTGGGTACCTCGCCGCTGGCGACCTCTGCACCGCCCACGCGGCCCCCTGCGCCGCTGCTGCTGTTGCTGCGCTGCAATTTCGAAGTCGAAGC 
AACGAAATTCGAGCATTTGCACGAGATCAACATGATTTTGTTATTGCCGGTTAGCTGATAACGGTAATTGATGCGTTGCTCTCTTTCTCTCTCTC 
TCTTTCTATATTTCTTCTCACTTCACTAGCAAAACACTTCGATGTTCTTCTTCCTCTTCCTAATTTATATCTTGCAGCGCTTGATAATAACTAAC 
AACGAGTCCTTTAAATCACTTTCCCGCTCAATGTTGTTCGATACTTCGAGTGCAATTCAAGAACCCCTCGAGTCAGAAGTCGACGGGCATTTGCG 
GGTATTCCGAGATTTTACGAGTTTTCTTCTGGTGACCGAAAACAATCGAACATAGCATACACCCACAACCGTTCAATTTTTTTGTCGACTGTCTC 
CGATATCAGGAGGTTGTTTGCCTGAAGCAATCGGTTATCGCAATTCAGTATGACCGAGGTGGAAATAGCCGGTCATTTTGGGAATTCCCTAAAAT 
ATCTTCGCTGAAACAGTTCTTGGTAACACCAGATGAAAGTGATCTGGCAACTCTGAAAGTTACCAATGTTAACAGAGCGTGCGGCTAACAGTGAA 
TAATATTATATCTAGGTGTTTAATTGCTGACATTAAATGTTATGTTCCAGGGATTATTATAAATAAGAATGTTTATTTAGCTTGCGCAAGAATTC 
CATATCCTGTTTCATTAACAAACTTTCAGTATCGTCATGGGATAACCGTGAAAAAATCATATTTATACATGTGGGGAGTTGCCATCCAGGTTTGT 

TAACTTGATTTTTTGAATCCAATAGTTAAAAGTCATATTGAAAAGTCTAGACTTCAACCATGTCAAATATATTATGCATCTGTCAAATGTCAAAT 
ACGTAAGTAATATGTATATGCATAAAATATTTTTGTTAATTGCTTCTACATGTTATTCATATCAAGAGGGAATCCTTAACTTTATTCAAAGCTGA 
AAATTTGCACAAAAAGATGGCTGCAAAATTGAAAAGTCTACACATTTTTCTTGTTTGCCAGAATTTTGATGAAAAGGAGCAAAAGGAATTCAAAT 
AGCTGCTTATGAAATCTTTTCATGTTGTTATTATTAATTATTATTGTAAGATCGAAAAGATAATGTTTTTATTTTTGAATTCTTATTTGAAAGTA 
ATATTATTTTTTTGTAAATTTTTATTATTTATTTTTATTTATAAATATAAGGATTAATTTAACAGATTAGTTTAAAAAGTGGAAATTTCAACCTG 

GGAAAACCCCTCCTTCGCAAAATTCAGCACCACGAATCCGAGGAAGACTCTTAGAAGAGAAGCCGCAACGAGAAGAGACCACTCGAAGATGATAC 

GGATAGAAGGACATAGCGAGATTCTTCTAGTATACCCTTTTAATTTACGGGCATAAAAATGAGTCTGGTGTGAATTTACGC 

(SEQ ID NO: 1111} 

Exon; 16561.. 15974 
Exon: 2612.. 1424 
Exon: 1357.. 1185 
Exon: 1117.. 1001 

Start ATG: 16183 (Reverse strand: CAT) 
Transcript No. : CT28455 
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ATTGCTTCAGGCAAACAACCTCCTGATATCGGAGACAGTCGACAAAAAAATTGAACGGTTGTGGGTGTATGCTATGTTCGATTGTTTTCGGTCAC 
CAGAAGAAAACTCGTAAAATCTCGGAATACCCGCAAATGCCCGTCGACTTCTGACTCGAGGGGTTCTTGAATTGCACTCGAAGTATCGAACAACA 
TTGAGCGGGAAAGTGATTTAAAGGACTCGTTGTTAGTTATTATCAAGCGCTGCAAGATATAAATTAGGAAGAGGAAGAAGAACATCGAAGTGTTT 
TGCTAGTGAAGTGAGAAGAAATATAGAAAGAGAGAGAGAGAAAGAGAGCAACGCATCAATTACCGTTATCAGCTAACCGGCAATAACAAAATCAT 
GTTGATCTCGTGCAAATGCTCGAATTTCGTTGCTTCGACTTCGAAATTGCAGCGCAGCAACAGCAGCAGCGGCGCAGGGGGCCGCGTGGGCGGTG 
CAGAGGTCGCCAGCGGCGAGGTACCCACCGCGATCCTCCTGTCGCAGGTGTTCCAGCAGAAGTATCCTCTGGACTTCGTCTTCTACAGCCATATC 
ATGGAGTTCTTTAAGCACGCCTATGGTCCCATTCCAGACCTCACCGTGGCTGTGATTAATCAGCGGGAGCTGTTGTATGGTCTCACCCTGGAAGC 
CGCTGGCCAGAGTTGGCAGCTGAGCCTGTGCATCAACTGCAAGGAGGTGCTATGCGCCAAGCGATTGACGGCGGGCGCCGGGGCCACGCCCACTC 
TCTACCTCATCAACTGCACGCTGTTGACCAGCAGCGAGGAGCTGGCTCAACGGAAGTCCAACAAAGCCTTCTCGGAGACCTTCGAGCTGCTGATC 
ATGGATCACACAGGCAGTGACTCTGGATCCACTCGACCCACCAGTGGACTATCCAACTCCGCATCGAGTCTAAGCATCGGATCCCTGTCTTCGAA 
TCCCACTGATGCCAGGGTGCAGAGATTGAGAATGATACAAGCGCACCAGCAGGCGCGATTGCAGGAGGAGATTGTGGAGACAAATGAGAGAATCG 
AGCGGTATACGCGCACTCAGTTCCAGTTACTAAACAGTTTCCGGGAGAAATCGGATCAGGACTGCGCGTTGCTATTCAAAGTTATTCGTGCGTTG 
CCCGAGCAGGCCTCGGAGCTACTCGATCGTGCCATGCTCCCTGCCCTTGATGTGGCGGCCAATCAGCCGCAGAGTGCGACTGCCCGGCGCAGGAA 
CACCATCAGCAGCCGGCGGGAGTTGAATGGTGGACCCACCACACCCACTACTACGTTGACACATCCGCCCAACTTCCTATCGCTGAACCAGCAGC 
CGCAGCAACAGTTGCTGGCGCAGCAACAGCAGCAACAGTTGCAGGCGCAACAGCAGCAGCAGCAGCAGGCCCCGCAGAGTGTCTCGATGGCCAGG 
AAGATGTCGCACTTTGACACACCGCCCGCCACCCCAGAAGCCACGCCCATGAGCGTGGGCAACAGCCCCACCTTCCGGCAGCAATCAGCAGCCGG 
CGGTGCTGGTGGTCCACCCACCCAGACGTTGACCCCAAATGGAGTGGGTCAGTCCAGTGTTAACGAAGATGCAGATGACTGCCTGTTCGATCTGG 
AGGATGTGGATGCTCCGGTTCCGGTTCAGTCCGTGCCTGTGCCTAGCTACACGCGTAGCCTGATCTACCAGCAGCAGCCTCAGCACAATCCCTTC 
CAGCAGCTGAGCCAACAAAATGGCCTACGAAGTGTCCTAGATGACGAAGCTGCGGATGAAGCTGAAGATGCTCTGGACCCGGATAGCTCCATTTC 
CATACCAGTGCGAGGCGGAGGGCGTCCTAGCCATGCGCAGTTGATGAACTTCGCCCGGAGTCTGCCCATCGAGATAGCAAACACCACGCTGGCCG 
AGAGAGCTGCTGTGGCCAACAACAATAACTTCGGGCAGGGCTGCGAGGAGGGAATGGACAACATCGACATAGCGGCAAGCATCCAAGCGCTGACC 
AGAAGTGTCCATGGCGAGGCTGTTTTTGGGGACTTGCCACGGCCCCGCCTGCGATCTCAGATCGAGGGCTAG 
(SEQ ID NO: 1112) 

Start ATG: 379 (Reverse strand: CAT) 

MLISCKCSNFVASTSKLQRSNSSSGAGGRVGGAEVASGEVPTAILLSQVFQQKYPLDFVFYSHrMEFFKHAYGPIPDLTVAVINQRELLYGLTLE 

AAGQSWQLSLCINCKEVLCAKRLTAGAGATPTLYLINCTLLTSSEELAQRKSNKAFSETFELLIMDHTGSDSGSTRPTSGLSNSASSLSIGSLSS 

NPTDARVQRLRMIQAHQQARLQEEIVETNERIERYTRTQFQLLKSFREKSDQDCALLFKVIRALPEQASELLDRAMLPALDVAANQPQSATARRR 

NTISSRRELNGGPTTPTTTLTHPPNFLSLNQQPQQQLLAQQQQQQLQAQQQQQQQAPQSVSMARKMSHFDTPPATPEATPMSVGNSPTFRQQSAA 

GGAGGPPTQTLTPNGVGQSSVNEDADDCLFDLEDVDAPVPVQSVPVPSYTRSLIYQQQPQHNPFQQLSQQNGLRSVLDDEAADEAEDALDPDSSI 

SIPVRGGGRPSHAQLMNFARSLPIEIANTTLAERAAVANNNNFGQGCEEGMDNIDIAASIQALTRSVHGEAVFGDLPRPRLRSQIEG* 

(SEQ ID NO: 1113) 



Celera Sequence No. : 142000013384483 

TCCACCAACCACCAACCACCACCACCCACCAGGAATGTTATGTACTGGTGCTACTCCCAGTCGTATTTCATAATTTTGGACAAGTCACTGGGCCA 
TAATTGACCCGCCGCTGATCGATAAGCTCGAGTGACACATTGCGCATTCGAACTGTGGGCCAAATGGCACGGAATTGATGATATGTGGGCGGGCG 
TTCATTGTGGTCGCTTTTCATGTGTCGCAATCTCTATTAAGGGAACTACATTATGCGGCCAACGAAAATCGAACAAATCATAATGAAAATGTGTG 
GCCAACGGGTGGAAGAATGTAATGATGCCGAAGGGAAATGTTCGTTAGGCAATGTTCAAACATATGCATAGATGGGTTATATTCCAACGTTTTCA 
AATCTTACAAACTATTTTCTAAAGTTTACTCCATTTCTTTTTCTAGTAAAATTTCTAATTCTGTAGCACTTAGCGCGTTCATTAACCTTGGGCAA 
CTCAATGAAGATCTTATTGAAGATTGGAATGCATCGCTGGGATTCCGCCTAAAGGTACAGTGCAGCTTCTTTAAAAAGAACACTGTAAGTTTAAT 
GCTAGAACTTAACAAAAACATGTGTTTCTTACAAATTTCTTTCATTTTATATTAAAACACGATTTTCAAGATTTTTAAAGATTTTCCAGAGAAGT 
AAAACAAAATGTAAAGAACTCGTTTTTGGGACCAGTTGGTGTGTTTGTTAGTTGTTCGATTTCTATTTAGAGTTCCAGCGGCACAAACAGCTGTA 
CATAGGAAAGCACAACAACTCTTACATATGCGAATGGGAAAGCGAATATGGTGGAAGGCGAAACAACAATAATACTAAAGCCAAAAGGTGAGAGC 
GGAAAAAATAGAGAGAGCAGCGCCGCATTCGTGTTGTATTTGGTTTTTTATGACCGCCGCCGCCGACGACGACTAATTTTGTCGATGATAGCCGT 
TCGTTTCGAGTCAAAGTTTCAGTCAATTTTTAGTCGTCAAAGCAGGCGGACGTCCAAGCAAAAAAAATTCAGAATCAAAAATATTGACAGTGACT 
TGAGAGGATTACGTACTTAACTTAGCAACTCGCACCTCGCAAAAAGAAAAGAAAACACATTGCAAAAATGTCAGCACGCCGCGTCACATTGAACA 
CACGCGTTTCGCGCGCCTCCACCTCCACGCCGGTGGGCGGGGCATCCACCTCGTCAAGGGTGGGCGCCACCTCACCCACCTCGCCCACGCGCACC 
AGCCGGCAGCAGGAGAAGGAGGAACTGCAGCATTTGAACGATCGCCTGGCCTGCTACATCGATCGCATGCGCAACCTGGAGAACGAGAACAGCCG 
GCTCACCCAGGAACTGAATCTCGCCCAGGATACCGTCAACAGGGAGACCTCTAATCTGAAGGCGGTCTATGAGAAGGAGCTGGCCGCCGCTCGCA 
AGCTGCTCGATGAGACCGCCAAAGAGAAGGCCAAGCTTGAGATCGATATCAAGCGTCTGTGGGAGGAGAACGACGACTTGAAGCCCAGGTAAAGC 
TCCAAGACGGCGGGTTCAGGTTTGGGCTCGGAATTGGTAACAGCGTATTATCAGCCAGTGATGTCATATGTACATATGTACCCATGCCGGGCCAC 
T G G AGC GC G AG AC AC AGGC AG AAAATGGGGGC AACT T TGGG AG C C T T GT C AAGAAAAT GT G GAAT C AAT AAT GC AC AG C AC G AAAAGC AGC GAAA 
TTTCTTCGATTTGTAAAGTAAAAGTACAGTCAGTCGAGATTTCTAGAAATGCACTACTTTTTATCAAATTGTTTTTTTTTTTTTACTTTTTTGTT 
GTATTTTGTATTGAAATAAGCGAATTTTATATTTATTTCTTAGGTACAGTTGACCTAGAAGTTCAATTCCAGCTATGTAGTTATGTAGCGAAAAC 
AATAACAAATTGTTTTATTTATAAAGGCAATATATTTAATTTTCATATTTCATGCTATTCGTAAGAAGAAAAGAGTTTTAGTAAGATACAAACTT 
CAGTTTCCATGGTTGTTTTTTTTTTATATGTAGGGAGTGACAGAGACAGACAGAGAACGGGGCAGACTGAGAGCGAGCGAGGCCGATGATGTCAT 
GCCAAAAGCACATTTTGAAAGCCAAGCACATGGGAAACTCTCTCGACTCTTTCGCTGTTGTTTATTTTTTGCATTTACACTTCCAGAATTCTCTA 
GGCATTTTTATGCAAATGTTTAGTTCTCAGCTAATGTGTGAGTAGTAAACCAAAAGAAACCCAATGAAAATCAAAAAGGAAGAATCGAAATACCG 
CCGTAAATCTCAATGAAGGTTACAAGACTATGATGTCATATGGCACTCAGTTACAGATCCATTTGTAATACCAGTTTCTGGTGGGAAGTCAACAA 
AAGTTGGGTTGCTTAGCTTTTCCACTTGCGAGATTTCCGTTTCGCGTCGTTTGCGTTATCATATTTATTTTTGGTATTTACTGCGCGATGAGGCC 
CAAAGCTTTTTCAATTCATAATAGCCACAATTTGTTGTTTGTGCGTTTTGCAATGCGTTAACGAAGTTAAGAGTTTGGCTAAAAGGCGGGTGCCT 
AAATGTTGGGTAAACTTGATTTTTCGGCTACTTAAGGTGTGGCTTATACGTTGAGCAACGGACCGGAATAACCAAGGAAATTGTTTATGATGATT 
ACGATTAGATACGATAAGATTGCAGCAGAATTATGGAACTTGTGATAAAAGATTTATCTTTTGTTGAGTGTCCATATTGCTTAAATTATTTTCAT 
TTGTGAATGCTGGCTGTGCTTAATTATTTTAAATTTTTTTTTGAAGAATTTAGAAAGCCAAATATATTTTCATATTTATTCTTTGGTGACAATAG 
CATACATATGTACAAGCGAATTAGGTCAGCGGAAGGAATTGGGGAGGAAAAATGGAACTTTAACACGCTCACACAAAACGCACAAAAAAGGGGGA 
AATTTATGAAAATCTAGAACGCAAACAAAACAACCTTGAATCTATTTTGTTTTTATATTTCCCCATTTTTACACAACTTTTCTTGATTAGCTACG 
AAAACTGTTTACATTTTCTCGTTGCAACAGCAAAAACCGAAAGAAAATCGATAAAACAGAATAGAAGATTTTCAATCCATGAAAATAAACTTTCA 
AAAATGGTCAACACAATCGCCGTAAAAGGCGCCAAACTATATATATAATTTTAATTACGTTTTCATTGAACGCATGACATGCAAAGTCAATTATT 
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TCATAGTTGAAATTATTGGATTTGTTATAGAGAATAATTTAAAACGTGCCGACCACGCGAGTCGCCGATTGACAACCAAAATTGATATCCGAAAA 
TATGCAGGGATTATTTGTGGAATTTTCGTAGCTATTTTTTATAGCGAGTTTGTTTTTGCCTAAATAGAATTTCAAATACAAAAAATAAAAGGCGC 
TCATAAATCACAGGGGGTGGTGGGGTTTCGCAGGGGGGACGGGGTGAACTGGTCAGAAATAGCCGGGCTACTTTATGCTAATGGCATGTGACGAC 
TTCTCCCAAGGCAAACAAAAACCTCGAAATGTTTGAGTATTTGATTCTTCTATGGAGGAGGATCGATGAACGAGTTTCCAAGCGACTATAGCATT 
TAAATAAATGAAATGAATAATACATAAGCGATATAAAGAGAGCGGTTTAGCTTTCGCAAACAAATGTGAGTCGCAATGTGACTCAGTTATAATTA 
TATAGATAAGTGACGTCATTCAATGAGCACATGTAGATAAAAGCAAAAAAAACAATTGCTCCTATGAACCGCCTAAATTTCGATAAGGGATTACT 
CAGGCAAATAGGCTGATTAATTATTAAATTAAAAAGTTCTTTGTTATAAAAGATTATTAATTTTTAAACCAAATCCATTTTAAATTTAAGACTCG 
ACAAAAAGACGAAGGAGGCTACCGTGGCAGAGAACAATGCCCGCCTGTATGAGAACCGCTACAACGAGGTGAATGGCAAGTACAACCAATCGTTG 
GCCGATCGCAAGAAGTTCGAGGATCAGGCCAAGGAGCTCGCTCTGGAGAACGAACGCCTGCGCCGCCAACTGGATGATCTTCGCAAGCAGCTGGA 
GGCGGAGACGCTGGCCCGCGTCGATCTGGAGAACCAGAACCAGAGCCTGCGCGAGGAGCTGGCCTTCAAGGATCAGGTGCACACCCAGGAGCTCA 
CCGAGACCCGATCCCGTCGCCAGATCGAGATCAGCGAGATCGATGGCCGACTGTCGCGTCAATACGAGGCCAAGCTGCAGCAATCGCTCCAGGAA 
CTGCGTGATCAGTACGAGGGTCAAATGCGCATCAATCGCGAGGAGATCGAGCTGCTGTACGACAATGAGATCCAGAACCTCAAGGCTGCTGCCAA 
TAGGGCTGCCCAGGGATCCGCTCTTGCCACCGAGGAAGTCCGTCTCATGCGCACCAAGATCGATGGCTTGAACGCCAAGCTACAGAATCTGGAGG 
ACACCAATGCTGGCTTGAATGTGAGTACTGCAATTGCCAAACTAAATTCCCCGTGTGATTTTAATGTTGACATTTTCTCATTTACCATAGGCTCG 
CATTCGGGAGCTGGAGAACCTCTTGGACACGGAGCGCCAGCGCCACAACCAGTACATCGCCTCCTTGGAAGCCGAACTACAGCGTATGCGCGACG 
AGATGGCTCACCAGCTGCAGGAGTACCAGGGTCTGATGGACATCAAGGTGTCGTTGGATCTCGAAATTGCCGCTTACGACAAACTGCTGTGCGGT 
GAGGAGCGTCGCCTGAACATCGAGTCGCCCGGACGCCCGACTACCGACTCCGGTATCTCCAGCAATGGCAGCCACTTGACCGCCAGTGCCAGTTC 
CCGATCCGGCAGGGTCACGCCCAGTGGCCGTCGCTCGGCTACACCTGGCATCTCTGGATCCAGCGCCGTGAAGCGTCGCCGCACCGTGATCGATG 
AGAGCGAGGATCGCACTCTGTCCGAGTACTCGGTGAATGCCGCCGCCAAGGGAGATCTGGAGATCATCGAGGCGGACGTTGAGGGACGCTTCATC 
AAGCTGCACAACAAGGGAACCGAGGAGATCAATCTGACTGGCTGGCAGCTAACCCGCATTGCCGGCGACGAGGAACTGGCCTTCAAGTTCTCGCG 
CGGATCGAAGGTACTCGGCGGTGCTAGTGTAACCATTTGGTCCGTCGATGCTGGTACCGCACATGATCCACCCAACAATCTGGTGATGAAGAAGA 
AGT GG C C G GT G GC CAACT C GAT GCG AT C GGT GC T GGC C AATGC CG AC AAAG AGGT GAG AGT CT T GT C CAT AT AC AAC AC ATGCGGC AC T T GCGAC 
ATCTGCCGCCACTGCAGACACCTCGAATTCGTGCTCGACCAGTACGATATGCCATAGACCTTTGCATATTTCGACTATAATTTACGTTATATCTT 
TTGCCTTTCAGGATGTTGCCAGCTACGACCGTGTCCGTGCTAACGTTTCCAGCCACACCTCGCGGCATCGCAGCAGTGGCACGCCCAGTACGGGC 
TTTACCCTCGGTTCCGGAGCTGGCTCCACTGGCGTGAGGAGCCTCTTCTCCCTGCTCTTCTAGGCGGGAGCTTCAACTTCGAATCGGACTGGCTT 
GGC TTGGATCCTGAGGCCCAGCCAAACAAAACCCAACTCACCGCAGAGACTC AC AGACTCCGAGCTGCACATTCAAACCACACACAGATCTGTTC 
TGCATATCAAGCTTATTGTGAAACTATGTACAACAATCAAAACTTGTTCAACTCTTAATGTTAAATGTTTAAGTGACCTTAGTACATGTCTTTGA 
CTCTATGCGTCTCCTATGTTTAAGTAAGTTTTTTCACGCTCTACACCCTACTCTCACTCTCTCTCTCTGTAATTTATGACGAGAACATGCATTTT 
GGTAAAAATACAAAAAT2VAACAAACAAATAAATGTTACAAGAAAGTTGGCTGGAAAATCAATTAATGAGATTTAGCCGTGCGGTCGCCTGCGTTG 
ATATCTTTTTTGCGAGGAAGCCAAATCGGCTTAGGCCAGTGGAAAACTTCTTGAGATGCTGTCGCGGCATTTTGCACTATCTGGGATCCAGGACC 
ATGGGCCAGGCCATAATGGACAGGTAAGTGGGGGAAATTAACTACAAGAAGTCAGGTGGACAGGTGCAGCTGGGAGCAAAACTGTGGGCTAAAAC 
CGAACGAGGTCAACGAAAAAGTTTCGCAACTTTACAACTTGAAAATAAATTAGGTTGGATCACTGGGAAAGGTCGAGCAAACCGAAGCAAAAGTT 
TTTAATGGAGCAAATATACAGCTTCAGGGCAGTTGTAGAAATATACTATTTAACAATTCTAGGGCTGCCATTCAAACTAATAAACGACTCGTTTC 
GGATACAAAAATAATGCTTAGAAATACATATGTGCAATAAAAGCCAGTTGTACTAGTTCGTTTCTTTCTAATCTCCAACTAGAATACAGGAAAAA 
CCTAGGCGTCCCACTGTTTAATCTGCATCCTTTGGCTTACTCCAAAGGAATTAGCATTGCCAGGACGAAGAAGAAGATCCCTTCACTTGGCGCCG 
CCTTAGCCAAATTGCGTTATCTATTGCCCAGGCATTTCTCGCAGTGCATGAGCAGCCACTAAGGCAACTTAGCGGCGCTTGAGTGAGTGCACGAT 
TCAATGGAATGAATGCGAGAGAAAGGCGGCGGAGAAAGGTGAAAGCGGCTGAGACGGCTGCGAAGACTGAGAGGGCTCAGAAGACTGAGGGGGCG 
GGCAGACAGTTGACTTCAAAGTAGTTTTCCTTGGCAAAGTTTTTTTCCTCCTTGGCGCACGTTTGCATGCGCATAATAGGATTAGTCAAAGTAAT 
AAGTTGTTTTTCTTCCTCCACTTTTAGCTGCTGATGCTACCTAGATTCCTGCTGCTCACAG 
<SEQ ID NO: 1114) 

Exon: 1001.. 1513 
Exon: 3891.. 4485 
Exon: 4556.. 5278 
Exon: 5427.. 5901 
Start ATG: 1113 

Transcript No. : CT28479 

CGTCCAAGCAAAAAAAATTCAGAATCAAAAATATTGACAGTGACTTGAGAGGATTACGTACTT7AACTTAGCAACTCGCACCTCGCAAAAAGAAAA 
GAAAACACATTGCAAAAATGTCAGCACGCCGCGTCACATTGAACACACGCGTTTCGCGCGCCTCCACCTCCACGCCGGTGGGCGGGGCATCCACC 
TCGTCAAGGGTGGGCGCCACCTCACCCACCTCGCCCACGCGCACCAGCCGGCAGCAGGAGAAGGAGGAACTGCAGCATTTGAACGATCGCCTGGC 
CTGCTACATCGATCGCATGCGCAACCTGGAGAACGAGAACAGCCGGCTCACCCAGGAACTGAATCTCGCCCAGGATACCGTCAACAGGGAGACCT 
CTAATCTGAAGGCGGTCTATGAGAAGGAGCTGGCCGCCGCTCGCAAGCTGCTCGATGAGACCGCCAAAGAGAAGGCCAAGCTTGAGATCGATATC 
AAGCGTCTGTGGGAGGAGAACGACGACTTGAAGCCCAGACTCGACAAAAAGACGAAGGAGGCTACCGTGGCAGAGAACAATGCCCGCCTGTATGA 
GAACCGCTACAACGAGGTGAATGGCAAGTACAACCAATCGTTGGCCGATCGCAAGAAGTTCGAGGATCAGGCCAAGGAGCTCGCTCTGGAGAACG 
AACGCCTGCGCCGCCAACTGGATGATCTTCGCAAGCAGCTGGAGGCGGAGACGCTGGCCCGCGTCGATCTGGAGAACCAGAACCAGAGCCTGCGC 
GAGGAGCTGGCCTTCAAGGATCAGGTGCACACCCAGGAGCTCACCGAGACCCGATCCCGTCGCCAGATCGAGATCAGCGAGATCGATGGCCGACT 
GTCGCGTCAATACGAGGCCAAGCTGCAGCAATCGCTCCAGGAACTGCGTGATCAGTACGAGGGTCA/AATGCGCATCAATCGCGAGGAGATCGAGC 
TGCTGTACGACAATGAGATCCAGAACCTCAAGGCTGCTGCCAATAGGGCTGCCCAGGGATCCGCTCTTGCCACCGAGGAAGTCCGTCTCATGCGC 
ACCAAGATCGATGGCTTGAACGCCAAGCTACAGAATCTGGAGGACACCAATGCTGGCTTGAATGCTCGCATTCGGGAGCTGGAGAACCTCTTGGA 
CACGGAGCGCCAGCGCCACAACCAGTACATCGCCTCCTTGGAAGCCGAACTACAGCGTATGCGCGACGAGATGGCTCACCAGCTGCAGGAGTACC 
AGGGTCTGATGGACATCAAGGTGTCGTTGGATCTCGAAATTGCCGCTTACGACAAACTGCTGTGCGGTGAGGAGCGTCGCCTGAACATCGAGTCG 
CCCGGACGCCCGACTACCGACTCCGGTATCTCCAGCAATGGCAGCCACTTGACCGCCAGTGCCAGTTCCCGATCCGGCAGGGTCACGCCCAGTGG 
CCGTCGCTCGGCTACACCTGGCATCTCTGGATCCAGCGCCGTGAAGCGTCGCCGCACCGTGATCGATGAGAGCGAGGATCGCACTCTGTCCGAGT 
ACTCGGTGAATGCCGCCGCCAAGGGAGATCTGGAGATCATCGAGGCGGACGTTGAGGGACGCTTCATCAAGCTGCACAACAAGGGAACCGAGGAG 
ATCAATCTGACTGGCTGGCAGCTAACCCGCATTGCCGGCGACGAGGAACTGGCCTTCAAGTTCTCGCGCGGATCGAAGGTACTCGGCGGTGCTAG 
TGTAACCATTTGGTCCGTCGATGCTGGTACCGCACATGATCCACCCAACAATCTGGTGATGAAGAAGAAGTGGCCGGTGGCCAACTCGATGCGAT 
CGGTGCTGGCCAATGCCGACAAAGAGGATGTTGCCAGCTACGACCGTGTCCGTGCTAACGTTTCCAGCCACACCTCGCGGCATCGCAGCAGTGGC 
ACGCCCAGTACGGGCTTTACCCTCGGTTCCGGAGCTGGCTCCACTGGCGTGAGGAGCCTCTTCTCCCTGCTCTTCTAGGCGGGAGCTTCAACTTC 
GAATCGGACTGGCTTGGCTTGGATCCTGAGGCCCAGCCAAACAAAACCCAACTCACCGCAGAGACTCACAGACTCCGAGCTGCACATTCAAACCA 
CACACAGATCTGTTCTGCATATCAAGCTTATTGTGAAACTATGTACAACAATCAAAACTTGTTCAACTCTTAATGTTAAATGTTTAAGTGACCTT 
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AGTACATGTCTTTGACTCTATGCGTCTCCTATGTTTAAGTAAGTTTTTTCACGCTCTACACCCTACTCTCACTCTCTCTCTCTGTAATTTATGAC 
GAG AAC AT GCATTT TGGT AAAAAT AC 
(SEQ ID NO: 1115) 

Start ATG: 113 

MSARRVTLNTRVSRASTSTPVGGASTSSRVGATSPTSPTRTSRQQEKEELQHLKDRLACYIDRMRNLENENSRLTQELWLAQDTVNRETSNLKAV 

yEKELAAARKLLDETAKEKAKLEIDIKRLWEENDDLKPRLDKKTKEATVAENNARLYENRYNEVNGKYNQSLADRKKFEDQAKELALENERLRRQ 

LDDLRKQLEAETLARVDLENQNQSLREELAFKDQVHTQELTETRSRRQIEISEIDGRLSRQYEAKLQQSLQELRDQYEGQMRINREEIELLYDNE 

IQNLKAAANRAAQGSALATEEVRLMRTKIDGLNAKLQNLEDTNAGLNARIRELENLLDTERQRHNQYIASLEAELQRMRDEMAHQLQEYQGLMDI 

KVSLDLEIAAYDKLLCGEERRLNIESPGRPTTDSGISSNGSHLTASASSRSGRVTPSGRRSATPGISGSSAVKRRRTVIDESEDRTLSEYSVNAA 

AKGDLEIIEADVEGRFIKLHNKGTEEINLTGWQLTRIAGDEELAFKFSRGSKVLGGASVTIWSVDAGTAHDPPKNLVMKKKWPVANSMRSVLANA 

DKEDVASYDRVRANVSSHTSRHRSSGTPSTGFTLGSGAGSTGVRSLFSLLF* 

(SEQ ID NO; 1116) 

Name: LAMIN C 

Classification: structuraljprotein 
Gene Symbol: LamC 
FlyBase ID: FBgn0010397 



Celera Sequence No. : 142000013384657 

CTAGTGTCCTCAATAGATCCTGCCATTTAACTGTCAAATTAAAGTCGGAGCAGTCCTTGCTCAATTTCGATATCGTCGTTTTGTCTATTATTAGC 
ATTCTGGGCTGTCATGTTATCGCAACTGTTGTGCACCAATGTCAATTGCACAAGTCAAAAACTGCCTCATAGATGACTTTCGAGATACTTTGCCG 
GCAATTACGTAGCTGTTTCATTGTTCCACCATTTCGAACAAGCTTCATAATATTTTCACTTTTATGGCAGGCAACATAAAAGACGACAGCTTTGG 
CAATAGCAACAACATCATCATTATCAACACCATCAAAGCCCGGAAAATTTCACCTTCTGTGACTCCACCCCCCCTCCCAACGCCGTGCCTTTCGT 
CTCCTTTTGTCCCCCTCAATTGTCGTAAAATCATAGGAAAAAAGAAAGGAGCCAACGGAGGCGAAAAGTGTGAGTGCGTAAATTTCTCTCTGGAC 
TCGGAGTCGCTCAAGTGTCCTTCCATTCTGTGTCATAAAATTAACGCTTTATATGAACGAAAGCATAAGGCGCTCGATTTTTATCACCAAGGCCA 
AC AAG AT GGC C AT AAACG AATC AGG AAC ATG GAAAC G CAGAGG AAAAAAAT AGGC AGC T AAGT AT CT AC AAAT G AAAGT GT AAT T AC AAAAG ATT 
TATACTGATTTTTAAATAAACTTATAGCTAAATCTATATATAAATAAAAATGTATATTTTCCCAATTACAACCCACTTCGTACTGTTATTTTCTG 
GCAGTGCATAAGCCCATAGCGAACCTTATGAGAACATAAAGAGTTAAGAGTTG2^ATTAGAAGCAGAAGAAACTGGGACCGAAGTACGTTGGGTAG 
T AAG AG AGGT GGT CC AAAGAGT GC C T AAT T AT T T ACGT GC AAT GAAAG7T ATG AATTGGGG AAC AGG TC AAGT C GGC GAGG GT AAAAAT AC C C CC 
GAACCTCCCTATGCCTTTACTGTCTGTGTGAATGGCTTCAGTCTGGTTTCGTAGCCAGCCCCTTATACACATTCTCGTGTTGTGGCTGTGAAATC 
GTTTTTCCCAGGTGATCTGAAATTTTATTAGCATCCATTGTTATAAATTTGAATCAGTATAAAAGATCATTCGCCAAATACACTCAACCATGCTG 
ACGACCCAGCAGACCGAATTCAAGATGATGGACAACGGCACCGAGACGCTGGCCGATTCTCCTTCGAGTTCCAGCGAGGATCTGAAAACGCTTTC 
GGATATGGATATTCGCAAGCCCGTAACGGAGATTGTTGATTTGCGCCTTAAGCATCCGCTAGAGAACACTTGGACACTGTGGTACTTGGAGAATG 
ATCGCAGCAAGAACTGGGAGGACATGCAGAACGAGATTACTAGCTTTGACATGGTCGAGGACTTCTGGAGTCTCTACAATCACATTAAGCAGCCA 
TCTGAGATTAGAGTGGGCAGCGATTATTCTTTGTTCAAGAAGGGCATTCAGTAAGATTGGCTAGATTTATATACATGATTACCGATTTCCTATTC 
ATTGCAGGCCAATGTGGGAGGATGATGCGAACAAGTTCGGCGGCCGTTGGGTCATCAATATGGGGCGTGGCTCCAAAGCGGAGCTCGACAAGCTT 
TGGTTAGATGTGGTAAGTAGCTATTGGAGCTTTGTGTCCATCGCCTGTCATATCAGTTGCAACTGCACAACCTCTTTTCTGCCCACTGACAGCTA 
CTCATACTGATTGGCGAGGCCTTTGAAAACACCGAGGAAGTCTGCGGCGCCGTTATCAACTTGCGCGGGAAGAGCAACAAAATTTGTGAGTATTA 
CTGAAAAAAAAAAGATGTGCATGTGCATTAAAAAAGCTGGTTACAAATATTATATTTTACTAAAAACATTACACATGTTTTGTTTACTTCGCCAA 
CCTATAAATATTAAGTGGTATACACCTAGGTTGGCTCTGTTTTTTTTTTTTAAACATAGTTTACACATTGTAGTTTTGTAGTGTACTCAGCTTAA 
GTAGCCAGTGGACACCACCATGACAACTACTAACCCAGCGCCTTTACTCCGCAAGCGATTTGGACCGCCAACGGGCACAACGAGCTGGCCGTCAT 
GGAGATTGGACTCAAGTTACGCGACCTGCTCGTCCTGCCGCCCCACCAACTGCAGTACCAGCTGCACAAGGACACTATGTGCAAGCAGGGATCGG 
TGATCAAAGCGGTTTATTGCGTGTAAGCCGTCTCCGTTTTGGATGCCACCCACCACTCCATCCACCAACCACCCACATTTCCATGGCCGCTTATG 
CCTCTGAATGCTGTTTCCAGCGCGTGTATCCGAAAAATATAAAAAAAAAAATGTATCTTGAAACAGAACTGGTTTTATGAATTTGACCATGTTAA 
AAATTGCAAAGAAAGATGTAAATACATAGTGTTGTGAATGCATATACGACGAGCCATCATTAAATTCCCCAGTTTTTAAGTCCCAAGGAGCTCTT 
TTTATGCCGCTAGTTGGACCCTTTATGCTCCACAAAATGTTATGCAAACATATGCCGCACCAGAGCGCAAATAAAATGGTTTATATACACTTGCA 
GTTTGTACTTTTGGCCGGGCCAGCTAGAAGTGCACAGTGACTGCAAAAGATAAACGTCCACTTGGTTCGCTTGGCTGGCCGAGCGAAAAACAAAA 
GACCGAAAAGGGTCAACAATGGCCACGGCGGATTGTCCGTTTTAGCGAGTGCCACAGTACCCGGCGCTGAATCTCTGGGATCTCTGGTATCTGGA 
ACTGGCGTCGAGGGCTTCCAAAATGGAGACAATTGAATTTTGATGAAAGGCCGAACGAAACGCCGTTGCCTCGCGCAGTATTAAAATTCATGCTA 
TGAGCTTCGAGTTCGAGTTTGAAAAGAGACTCTTAATTTGCGTGTAAAAAATGTTATTTTAATAATTTAAAAAAAAACCTATTAAGTACGTTTTT 
AAAGTATCTTTGGTATTCTAGTATCTATAACCATTATCGAATGACAAAGCTGTAGGTTCCCTCACATGCGGCTGGTAACGTTTATTCAACCCTTT 
GTTCAGC CCCTGT GACGTAACGTTTTAGCCAACTCGGGTCACCAGGCCAT TC AATCAACATTGAAATTCCGATGGC AGC GGC AC GTGCAGTAGCA 
ACTTCATTATGAGCCACTTTAAGCTGACAAAACTCATCTCCATCCGCATAGCCCGTTGACTGCATTTTATGGGCCCAGGTGGCGGCGGCTTACAT 
GGGGAGCTGTTAAACCGAAACCGCATCAAAACCGCACACGTCCCACATGGCATCCAGGTGCCTTCACCTCTTGTTTGGCCCA 
(SEQ ID NO: 1117) 

Exon: 1001.. 1475 

Exon: 1528.. 1627 

Exon: 1708.. 1795 

Exon: 2051. .2312 

Start ATG: 1135 

Transcript No. : CT28485 

GTAGCCAGCCCCTTATACACATTCTCGTGTTGTGGCTGTGAAATCGTTTTTCCCAGGTGATCTGAAATTTTATTAGCATCCATTGTTATAAATTT 
GAATCAGTATAAAAGATCATTCGCCAAATACACTCAACCATGCTGACGACCCAGCAGACCGAATTCAAGATGATGGACAACGGCACCGAGACGCT 
GGCCGATTCTCCTTCGAGTTCCAGCGAGGATCTGAAAACGCTTTCGGATATGGATATTCGCAAGCCCGTAACGGAGATTGTTGATTTGCGCCTTA 
AGCATCCGCTAGAGAACACTTGGACACTGTGGTACTTGGAGAATGATCGCAGCAAGAACTGGGAGGACATGCAGAACGAGATTACTAGCTTTGAC 
ATGGTCGAGGACTTCTGGAGTCTCTACAATCACATTAAGCAGCCATCTGAGATTAGAGTGGGCAGCGATTATTCTTTGTTCAAGAAGGGCATTCA 
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GCCAATGTGGGAGGATGATGCGAACAAGTTCGGCGGCCGTTGGGTCATCAATATGGGGCGTGGCTCCAAAGCGGAGCTCGACAAGCTTTGGTTAG 
ATGTGCTACTCATACTGATTGGCGAGGCCTTTGAAAACACCGAGGAAGTCTGCGGCGCCGTTATCAACTTGCGCGGGAAGAGCAACAAAATTTCG 
ATTTGGACCGCCAACGGGCACAACGAGCTGGCCGTCATGGAGATTGGACTCAAGTTACGCGACCTGCTCGTCCTGCCGCCCCACCAACTGCAGTA 
CCAGCTGCACAAGGACACTATGTGCAAGCAGGGATCGGTGATCAAAGCGGTTTATTGCGTGTAAGCCGTCTCCGTTTTGGATGCCACCCACCACT 
CCATCCACCAACCACCCACATTTCCATGGCCGCTTATGCCTCTGAATGCTGTTTCCAGCGCGTGTATCCG 
(SEQ ID NO; 1118) 

Start ATG: 135 

MLTTQQTEFKI^DNGTETLADSPSSSSEDLKT^ 

QPSEIRVGSDYSLFKKGIQPMWEDDANKFGGRWVINMGRGSKAELDKLWLDVLLILIGEAFENTSEVCGAVINLRGKSNKISIWTANGHNELAVM 

EIGLKLRDLLVLPPHQLQYQLHKDTMCKQGSVIKAVYCV* 

(SEQ ID NO: 1119) 

Name: eIF4E-related 
Classification: translation factor 



Celera Sequence No. : 142000013384483 

ATATGTGCAATAATCACTTCTTTTACGATCGAACGCCCTTTGAACAATTCCTGGCAGGTACCAGGTAATATATTTATTATATGATGGTTACCGCA 

ATTATAAAATGTCTCATTTGCCAGTTCCGATCGACTGGTGATTGTAACAGATCCTCCATTTGGCTGTCGCACTGAACTGATTTCGCACACCCTGC 

GCAGCCTTAGAAAACTGCATAATCAAATCAACCGATTGCCATCCACTCCACTGTCCATATTCTGGATATATCCCTATTATTCAGCTAACCACATC 

AGGCAGGAAATGCCTGAGCTTGAAATGTGTGAATACAGGATCAATTACACTAATCACCTTAGATATACGAATGTGGGCAAGCAGAGTCGGTTTTG 

TGGATCACCCGTGAGGCTGTTCACCAACGTGCCACTGCGACTTCTTCAATTGCCCCCTGAAGAGGGCTATAAGTACTGCCAAAAATGCGATTGTT 

ACACTGCCAAGGAGAACCTGCACTGCAACCGCTGCGGGAAGTGCCCCTCGGTGAATGGCCAAACGTACAAGCACTGCGAGTCCTGCGATGCCTGT 

GTGAAACCCAACTATGTGCATTGCTCCGACTGTAGAAGGTGCACGCAGAAGGAGGGACACAACTGCTCCTTCTATCAGACCAAACAACACTGCTG 

GATGTGCGGCCAGAAAGGCCATATCGAAACCAAGTGCCCAAACTTCCGCAAGAGAAAGACGAACTATACCAAAGGCTGCCTTTTGTGCGGCAAAC 

GAAATCACAGGGAAAAGCGATGCTCGTACCGCTCAAAATACTTCAGGGAACAGTGTTTTATGAACGAAACAACAATTCAGTGTCTTTAAGATACA 

TACAAATATATATGTACAAAGATAAATACAAATTGCATTTCTTTATATACTGAGGGGCACTTTAGACTGTATGCATTTATGCAGTTTATAAGAAC 

AACACTGACAAAACCTTCACTACCATCCATAAACAACAGGGCACACATTTGTTTAATACAAAATTTATTTGTTCTTAACACTTTTCGTTGAATGT 

CTTACAAGTTCTTGAATTGCAACGGCTGCGCGCGGAATTTCTGATGCACATGACCACGATAAATTGTGGAATGAATTCTCAGCCCGCCGGATGAA 

CGAACGGATTGGGGGAAAACGATTAAATTTTGAAAATATGCATTACGTTTAAACAGGTTTTTAACTTAAGCTTAGGTATAATTAATTTCATGTGG 

AAACACACCAATGCCTCAACAAGCGGAATGGATCTCTTCTTATCTCGCCGCAAAAGAAATGATGACGTCGACATTGTTCACGCTCGTTGAGGTGA 

AAGTATTTTATTGGCTACGAAATGCATGGCTTAAGAACGATTGTATTTGCCCCAAATGTGCAGCATGAAGACGGAAGCGATGAAGAGCAGCGACA 

TGACCAGCACGGGTACGGGACCACTGCAGCAAAAGTCTCATTAGCACTGGGCCTTTAATATAAATATAGCCGAGTTATTCACTTACACTTTGATA 

CCGGGCGAGTCGTCCGTGTAGAAACGCCACATGCCACCAGTTCCGGCTCCGCCGGGTGCACGGCTCCGGGCCGCTGTGGTGCTGGTGGTGGTCTT 

GCGCTGCTTCAGGGTGCTGCCGCCTCCGGATCCGGCGCTACGTGGCGCCGACAATTTGCTGGGGGAGCGCGATCCGCTGCCCACGGACGTTGAAC 

TGGCTGGAGCGGGCTGGCAAATGTGTTTTAATTAGTTGCCGCACTCAATTGCCATCTAGTTTAACTCACCATTTTGGGCTATCTGTTCCTCTGGA 

CCGACTGACCTGGAAAATATGAACGGGCGAAGTGGAATCTGAACGTGAATCTGAATCTGACGTGCTCGGCTTCGTGTCGCTTCGTCTTTTTTCGT 

TCCACGTCACTTTGGTGGTGGCACCGCTGCCGATAGGTCGTGACGCTGAGGTGACAGCTATCGTGCACTTAGACAGCTGGAGATGACAGGCTAAG 

GCAACTCAACTATCGGCTGCTTTGGCTCTAAAATGAACTAGTAAAAAAAAAACGAAGAAATAATATATTCAAGTTATGAATTTAATAGATAACAA 

TAATAGATAAAATATTAATTCTACAAAATGAATTGTTTAAATCAATTTGAATGAATCCTATTAATATAATTGGCTATTATTAAAACTCCGATAAT 

AAATGCTATTATTCTTGATTTCCCTTGATTTAATTATATAATACATACTTAATAACTATATAATTATATAGAATAAAAACTTAATCACGCATTTA 

ATAGATCATATAGATATAGAATATATAGAAAATCAATGAAATCGATTTTGATAGCGATAATGTGCAACCTTGCATGTAAGTTATTTTTAGATTTT 

AGCTGGGCAAGCGCAATTTTCTTGGCGCGCACCAAACAATTTGTAAATAATATTTTTGCTTAACTGGTATTGGTTCGACTGCGAATCACTAAAAA 

TTATTAAGTGACTTAAGCACCGTGTTTTAAACATTGTTAATGGGAACGGCCACACCGAAGCGCGGGCTCACTAAACGCTTTTGTCATTTTTGCAC 

ATCTGTACAACGTTCACGTTAATTGCAGCGGCATCATCTTTGCTGGATTTCAAAGGAATATTTTGCGGTGAAAAGAACGAAAAAGTGCTTGCATT 

TGTGCGCTAATGGGTATATGAGGTCCTTCTGCTAAAGGAATGCCAAGTCATACAAACTTTTCGCGATTCGCCTTAATGGAAATAGATAAGACGCG 

GCGCAGTTGAAAGATTGCAAATTGATCCACCGATCGACCGACGCACATGAAAAGTGAAGTATGGAGGACATTGAATATTTAGACGAGTACAAAGA 

CTTGGTGCTGCCAGGTGCGTGATTTCAGCTTCCAATGCTATAAATACTT 

{SEQ ID NO: 1120) 

Exon: 1899.. 1780 
Exon: 1723.. 1512 
Exon: 1448.. 1001 

Start ATG: 1782 (Reverse strand: CAT) 
Transcript No. : CT28511 

CGAAAAAAGACGAAGCGACACGAAGCCGAGCACGTCAGATTCAGATTCACGTTCAGATTCCACTTCGCCCGTTCATATTTTCCAGGTCAGTCGGT 
CCAGAGGAACAGATAGCCCAAAATGCCCGCTCCAGCCAGTTCAACGTCCGTGGGCAGCGGATCGCGCTCCCCCAGCAAATTGTCGGCGCCACGTA 
GCGCCGGATCCGGAGGCGGCAGCACCCTGAAGCAGCGCAAGACCACCACCAGCACCACAGCGGCCCGGAGCCGTGCACCCGGCGGAGCCGGAACT 
GGTGGCATGTGGCGTTTCTACACGGACGACTCGCCCGGTATCAAAGTTGGTCCCGTACCCGTGCTGGTCATGTCGCTGCTCTTCATCGCTTCCGT 
CTTCATGCTGCACATTTGGGGCAAATACAATCGTTCTTAAGCCATGCATTTCGTAGCCAATAAAATACTTTCACCTCAACGAGCGTGAACAATGT 
CGACGTCATCATTTCTTTTGCGGCGAGATAAGAAGAGATCCATTCCGCTTGTTGAGGCATTGGTGTGTTTCCACATGAAATTAATTATACCTAAG 
CTTAAGTTAAAAACCTGTTTAAACGTAATGCATATTTTCAAAATTTAATCGTTTTCCCCCAATCCGTTCGTTCATCCGGCGGGCTGAGAATTCAT 
TCCACAATTTATCGTGGTCATGTGCATCAGAAATTCCGCGCGCAGCCGTTGCAATTCAAGAACTTGTAAGACATTCAACGAAAAGTGTTAAGAAC 
AAATAAATTTTGTATTAAAC 
(SEQ ID NO: 1121) 

Start ATG: 118 {Reverse strand: CAT } 



FIGURE SHEET 605 



MPAPASSTSVGSGSRSPSKLSAPRSAGSGGGSTLKQRKTTTSTTAARSRAPGGAGTGGMWRFYTDDSPGIKVGPVPVLVMSLLFIASVFMLHIWG 
KYNRS* 

{SEQ ID NO: 1122) 

Name: sec61 beta-like 
Classification: transporter 



Celera Sequence No. : 142000013384134 

TGCCACTTTTTGATAGATCCTTCAATTATTCGCACTTGCTTCAGATAAAAGTTAAATATTGTCCGCAGATTGTAACCGCTTGAGCGCCCAAGCGC 
GACTGACTTCCCATCTTCCATGAGCATCCATTTAAATTTGCTGCGTACATATATCGCACCGTTCGGGTTCAAATTCTAATTGACCTTTCTCTTGG 
GGTAGGTTCGAAGAGACTTGATTGATCAAGGAAAGAGTGTCAGTGGGAAATGTCGTCAAATACTCCGCACTGTCTCTTATATAAACAAAAAAGAT 
AGATTGAAGTCTGCTTTGCTAAGTGATGGATTTTATTTCAAGGTATAACTATAAGAAGTGATTTCTTCATCTGTACAAAGGACACATTTGCATAC 
ATT T T C T AC AT T C AC ATGC AGC GG ATT AT AATT AC AGC AT AC AAAC T T AGGC AC T T AT T T AT CT GT ACT T GT AAAG AT AC AAAAT CT AGC T GG AT 
AACAATCGCATATCACATACTGTGTTGGTTTACAAGTATTTACACTGCCTACACGAGATAGAGATTCATTAAAAAGTTATTCAAAGATTCAAACG 
CTTAGCTTATATGGCAAGTGCCAAAACAATAAGTCAATTTCCTATTAACTAAGATAGTTATATACAATGCTATTCGCAGTTCCACGCGTTTCGTA 
ATATTTGTTCAATATTTACTAGAATTTTCTTGAGCTGGTCTTATGAAGTAAAAAGGGTTTGAAGAAATTGGAATTATTAAATTTTAGTTAAACAT 
TTAAATTGAATTTAAGTTAAACTCGATATGAGACTATCTGATCTGCTGCTTGAATTGCTAATAAAAAAGTGATTACTTGACTTCGAGGATAAAAT 
CCTTGCAATAATCTTTCTTTTTCAGGCTACTAATAATATCATTTTTAGTGGTTCTGGGTGGCACAATGCCATTGCATCTTAAACCTAATTACGTG 
GCTCGAATTCCGATCAAGTCTTAACTAATCTAATCCGACTGATAATAATGTCAGACGTACACCTCTCCGGTCTTTTGTTCCGGCTTGTGTTTGGC 
CAGAGTTGAGGGGGCCGAGGATCCTCCGACTTCGCCCGTTTCCAGGCCTCCGCTCCTCTGAGGAATTGGCATCGTGCCTGAGTTGTCCACTGTGC 
CC GT GGG ACCC GCT G AC C T CAT C G AGG AT CGT T GCT GC GG AT CC T GC AG AT T G AGGCT C AGGT T G AAAC T GTTGT T TC GCG AAATGT TCT GGT T T 
CCATTGAGGGGCTTAACGGACAGCAGTGCCGTAGTGGTGATACCGCCATTGCCATTGCTCGACCCACTGTTGCTATATCCATCCTGGACATCTGC 
ACCGGTCAGCTCCACAATGGAAAGTGCTGACGGCGATCGCTGGATGGACAAGGACACCGGTATCGTGGTGATCACATCGTTAGGATCGTAGCCAC 
TTGCCGTCGTCGTTATCGTGGGCAGCTGGGCATTCGCCGAAATGGGCAGGATGCTCAGTGTCCCAGTGGGCATCGTTTCCCCACCGGTGCTGGTC 
ACGCTGAAGGGGAACGACAGAGTGGTGATGTTTGTGGCAGCTGGAATACCTGCAGGTTATCAGAGGAAACAATAAGTGAGAAAAGAATATAAATA 
TTTAATCCATTAAGCTTTTAATATCCTCTTAAGAGTATGTATTGAACTCATCATTAACTATTCAGAGGCATTAAAAACTTCTAGATCTTGGATAT 
TTCCACTTACTGCTCGGCGTGAATCCCTCCTTCATGCTGGCTGCCAAAGCTGCGGTAGCCGGATGACCTGGAATCAGAGTGTTATTGGACGATGA 
GCTATCCATCTCTTTGATGGGCGAGAGCAGCAGCGTGCCCAGCACCTGACCGCTCTCCTGGTGCAGTAGTGTGTCGGGTGCGTAAGAGGAGGGGT 
ACATGCCCGGTGGTGCGCTGGTGGTGCTCAGCTGCTGTGGTGCCATTAGGACCGAGCTGGTGGCTACATGGGAGTGGTGTGAATTGTACAAAAAG 
TTGCAGGTTTTGTGCGGGATTTGTGGGTTGTTACTCACCTGGATTCGTGTTGCCATCCTTGGCCACCTCGGGTGTGGGAATGCGCGGTCTGGTCT 
TGGCCACATTGACATTCAAGCCATTTAATTTGGAGTCTTAAACATAGAAATAATTTTAAATTTAAAGCATACTCTTGGGGGTTTAAAGAAATTGT 
GATTACTACCAAAAAAATTAGTTTGATTTTTATTAATATTAATATTTTATTTAAAAATGTTTATTTCTATTGTTTTTTACTTACTTTGCTTGATA 
AGTTGGCGTTGTGCTCTCAACGATTTACGATGCACAAACTTTCCATAGAGACTTGATTTACTTGGCTGCGATCCGGCTTGCTGCTGCTGATGCGC 
CTGCTGTAATTGCTGCTGCTGCAGGATGGTCAGTGGCAAGCCGGACTCGGGACTAGAGTAGACGGAGTTGAGGCGGTTCCGGTTGAATGGCATTC 
CGCCGGAGGAGTAGGCCGTCGACTGGCTCCTCATGGAACCCATTCGGGAGAGCTGGCGAACCAGCCAGTTGTTTTTCCTCCTTTCCACCATTGGC 
GTGTCCACGCTTTCCTGTTTGTGGTTAGTAATAATATTAAAATGTGACTCTTGACATGAAAAAGTCAGCTTCAAAAATATATGTAAATTCGTCTG 
CATTAAATTATGATAGATTACCAATAAATGTGACAATAACTTAAGCATGAGAAATATTTTATAGCCTTTTAAGTTGAAGTTGTTAACCCTAAAAA 
CGTTATGTTTGCTGGAAATATATCTCACAAAACGCAACTAATGTCTAAGTTACAGCAATAAAACGTTTGCTTTCAATGACTGTTAATATAAATGT 
TAGGAGAATCTATGTGTTGAATGCAACTACGTTGGGAGAAATGTATGAAGATACTCACATAGTCCGCGTAGACATCGTCGCTGAGCATCCTCTTG 
CCGCCGCCTGGCATTATGTTGGCGTACATGGCGTCCTCCTTCTTGACCTTGTACTTCTCGGCGGAGCCCTTGGGCTCGTCTTTCCGGTAGTGTTC 
GGATGCCACCGTATAGGGCAGATCCTTGGGCACCACACACTCCCAGTACTGATCCCGCAGCAGCTCGGGGTGCTCCTCGTGCATCTCGTCCACGA 
TCATGTAGGCAGCCTGAAAGCGGGTTTATCAAAAGTGGATCAGTCGAGGGCTGTTCAAAATCGAAACGTGTTCGCCATTTGCGATTTTTTGGGCC 
ATTTCGAGGAGGCACTAAACCCAATTACGTCACAATAATTCAACCGTGGCTACTGTCCCACATCGAAACAAAAGAAAAAGTGAAATTGTTTACAA 
GTTTTTAATTATTCCATAACAAAAATCAGCGCACAGCCCACTACAGAAAATTGCAAAAACAAAAAGAGGAAAAGAAATAACAATAAAAGTATAAG 
CAATACTCATTTGCTTAATCATTTTGCCGCACTCTGTTTTCAATTTGCCATTTATGAGCGACTGAGCCAAGAGTCACGTCTGGTCGTGTGGAGTG 
CAGTTGAAATGGGCCAGAAAACCTGGCATTAGCTCCCCGCTGATAACCGGGCTAAAAGCGACCGCTTGAGATGCATTTCCGAATTCGAGCAAACC 
CAGCGCGCTGGGGTGACCATAAAAACACACACACACACACACATATACGAAACATTCAATGATAAAGCTCTACAAAAGAGGTTTCAATGTTTTCA 
GCTCGTATGTTTTTTTTCACGATTTTCTCATTTTTTGAACAAAAACCTGTTTTCAATTTACTTTTGCTGCTGCTGCTGCTAACTTTTGTAAATAA 
TTGCAAGCAACGGCAGGCCCCAGACACAGATACACACACACGCACTGCACACATGCAAAAATCGCTTCTAACACACACAAGCATCGACTTCCCAA 
AGCCCAAGTCCAAAACTTCAATCAGCATCAGCAGCAGAAGGAAAGCCGAATAGAGATACCCTCATCTAAATCGAAATCGTAAGTTTGACTTTGGC 
GGGTAAATTTGAATTTCACCAGAAGTAATTAAAATTCCCTTTCGAAGATTTATCTACTTTTATAAATGGAACGCTTTTCTAAGTCGCATTTATTT 
AATATTTTAATAGAGTCTACTGAGACAGCTTCAAGTAAAATGTATCTAATTACACAGAATGCAAGTTCATAAATAAAGATTTATTAAGTTCGACT 
TTCAGTCACATGTAAATTTGTAAGACCTACTTATAATGCACTCCGACTTTTCTTAGCTAGCATCTACTGATTTCATAATTCAACAATTACACATA 
TTTCTTGTTTTGTTTTGGTACATTCTTTTTTTTAGGGTACTCTGCAGTTCGTTTTGTTACACATTGCATTTGGCTCCTCAGTCTTTTGTTGGCTA 
TGTAGAGAATGCCTGTTATTTTTAGCACGTGGCAGACAGACATGTTCTTTTCCTTTCAGTAGCAATCCAAGAAGTGGCTGACTCAAGCGATTATC 
GCCTTAAGAGTGCAAAATCTTTACTTAGTTAGCACGCTAACCCAATTAGCGAATTCAGATTTTAGCAGTCATAATACTTGGAAACCTTTGGAATT 
ATGATTATGCATATTATCCCCTTGAACAGATACGTATAGATAAGTATCTTGCGGATGTGGGCATCCCAAAGTTTGATTTAAAAGCTGGAAAATTA 
AGTTTGGAAGTGGGTATTTCGATACAAAAGCCATCTGTTCCGAATTTGGTACTAATTTAATTTGGAAATTCGTGTATATATGTATCTACACCATG 
CTAAATGAAATTAATTGGAAATCATTCAGGTTTTTTTGTGTGAACTTCAAAAGGGGGATCTTTTTCGCAACTCTAACGCAAAATGATTGTTTTAT 
GGTTGTATGGACCGCAGGAATTATTGCCACCACTTATTAACAATGTTTAACGGATGGCTGGCGTATAGACAATGTGCCACATCGAATTACGTCAA 
GATTGTGCGAATTATCTGCACTGATGACTCACCTTGATGTGTCGGTCAATTAGCCAGTTCAGCTCGATATCATCGTCATCTTCGCCAAAGGGATT 
GATTAGCACCTCGGCCACCTTCAGCCAGCCCACGTAGAAAACAAACTGCAAAGATTAATTGAATGCGATTGGATTTTGGTTAATTATATCATGAA 
AAACAGAGTCGGACAATTTCAGCAGAACTTGGGAATAATGTGAGAGGGTAAGCCAGGGAAAGCTTGGCAATTCCATTCACTTTTTGCGGCTTCAA 
ATTATACGGCATGAACATTAACAGTTAATAATGATAGCTAAACACTTGTCACAGATTTATATCGCGGAGAAACACAAACCAGACCAGAGCTCGGC 
AAAATGCCAATTTATTACGGGGAACGTGACCGCGAATCGTTTTACATAAGGCACTTGAAATGTGTCCCCAGCGGAGAACCGCAGAAACCAAAACG 
GAAAACAGCAAACAGTTGTAACTGGCCATATCATGATATAGCCCAATATATATATAGTGCATCCGAGCGGCGTATCGCTAACAAGGTATAGCCAT 
TACATTCGGCCACTCAGTTTGGCGGAAAAATTGGCAAAAACAATTGCGAACAACAAAAAACCGCAACAAGTTTTTAAACTTTTACAAAACGACGA 
GACGTGCAAAATTGACTTTGATTTATAGTAAATAAGGCGGAACGTGCAAATTAAGATAACTTCTAAGCTGTCTTAAATCAAGCGAGAAAAATTAA 
TGAATTCATTATATATGGAGCAATTGGAATCACCTGCAATACGGTAAACAGGGGGAAGAACAGATCGGGATCTTCGCGTCCACTTCTGTCCAGAA 
CGTTGGGCAACATTTGACGGCCCAAAAGGGCAGCTATGAAATAGGTGTACAGCACCAGGGTGACTACCTGAAAATAAAGTTTTATAATTAATATT 
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CAATTAAAGAAGGGAAGGTTGGAATATGCTAGAAATATTGAATAAAACTCACCTGAGTGTAGACCAAAGGAACACAGACCGTGTCGTATCCAATA 
AGACCACCTAAACGTCTACGAATATCCGAGAGCTCTACCAGTATGGTTTGCACAATATGATCCGAGGCGATTAGGCCATCCTTTCTGGCCCGGTT 
TATGATGTTGGTAGCCCAAACGAGAGGCATCCAGTATTTGGACATGGGACTCTTCTGATTCAGTGCCTCAAAGATTTTCATCTCCGATTCGTGCA 
TCAGACCCGCATCCACGAGATGTTGGGTTGTGGGAAAACGACGTTTTACACGAAGGGAAATTCGCTGAAGGGTAATCACATAGGCCAAGACCATA 
T AG C GC AT GAT AT T AC G ACGC ATC AG GCG AC CC GTT T C AT T C T GT A T T AAG AAT T T T T AT AG AT TAT T GT TGGGT T AT T T G AAAAAAAT T AT T T T 
CTTACATTTACACCGCCATTTGAGTTGGGAATAGCCGCGCTTATGAACAGGGCCAACGTATCTGGCCAGGGCAAAAGCCTATATTGCTCCCACCA 
ACGTTTCACCACCAGATTCACATAGAAGCCCAGCACAAAGGACATGGGTATACTTTCACTCTGCTGCCCAAAATAAAGGCGAATTTTTTTAAAAA 
TGCTGAAAATATATTAGTATTATATTATATATTATGTTACCATTTTATAACCCTATTCATACTTACGCTTGCTGACTTTCTGTCAGACCAAATCT 
GTAGATAACATTTATGGTATAGTATAAACACAGATACGCAACTAATTCGCGCCAAATCAATTTGTAAACGCTGCCTCGCCATCTATTAAAAAAGG 
GAAGGGATTTGAATTAAGTAACTTATAGGTAGTCGAGGAATTTATGGCTTTAATCTGTAACTAACTGGTTTTTGTTTTCTTATACAAACGATTTC 
CTCATTCAGACTTTCAATCATTTCCCCCATCGCATTTCAGATTAGGTTTTCGTTTCTAATGAGTCGCCAGTTCTGCAAGATTTTTTATGGGGGTA 
TTCCAATCTTGCATGTCAGTTCCAATGAATTCAAATGTCTTAAAGTGTGATTTGATACATTATAAAAAATATATATATGTATATTGATTGGCATA 
ACCGCTTCGAAACTGGTACTATTCAGGTGGGGTCTTTGTTCTGTTTGTTTTGATCAAGGTTTCGCTTACCTTCTTCTCTTCAAAAATTTAGGTTG 
GGCTCTGACATTATAAGCAAAAAGTAACTAACTCGAGGCTCCCATGAAAAACAATAAGTTGTTAGCCTTTTTTCACATTTTTCCAGTCCTAAAAG 
TATCTTTCAATTAGTGAGTTCATTACTGAACGTGTGATAAGCTACAACTGAATTTTCAGAGCTTCGGCGTTTTGGCTACCAATTATTCTTCTTTC 
TGGTTTGAATGGCAAAAACTGGAACTGTGCAACAGTAACTGGAACTGGAATTGCTATTCTAACTGCCCAGTTACGAGTGACTGCACTTGGGAAAC 
AATTAAAGCGAACTGGGGTTAAAAATAAACACACTCTAAAAACAAAATAGCCTGAAACCAGATGTTGTCGCCGCTGAGATGTCACTGTATGGTGG 
(SEQ ID NO: 1123) 

Exon: 6505.. 6371 
Exon: 6311.. 5943 
Exon: 5862.. 5734 
Exon: 5080.. 4973 
Exon: 3148.. 2909 
Exon: 2579.. 2270 
Exon: 2126.. 2034 
Exon: 1963.. 1721 
Exon: 1569.. 1001 

Start ATG: 6505 (Reverse strand: CAT) 
Transcript No. : CT28609 

ATGTCCTTTGTGCTGGGCTTCTATGTGAATCTGGTGGTGAAACGTTGGTGGGAGCAATATAGGCTTTTGCCCTGGCCAGATACGTTGGCCCTGTT 
CATAAGCGCGGCTATTCCCAACTCAAATGGCGGTGTAAATAATGAAACGGGTCGCCTGATGCGTCGTAATATCATGCGCTATATGGTCTTGGCCT 
ATGTGATTACCCTTCAGCGAATTTCCCTTCGTGTAAAACGTCGTTTTCCCACAACCCAACATCTCGTGGATGCGGGTCTGATGCACGAATCGGAG 
ATGAAAATCTTTGAGGCACTGAATCAGAAGAGTCCCATGTCCAAATACTGGATGCCTCTCGTTTGGGCTACCAACATCATAAACCGGGCCAGAAA 
GGATGGCCTAATCGCCTCGGATCATATTGTGCAAACCATACTGGTAGAGCTCTCGGATATTCGTAGACGTTTAGGTGGTCTTATTGGATACGACA 
CGGTCTGTGTTCCTTTGGTCTACACTCAGGTAGTCACCCTGGTGCTGTACACCTATTTCATAGCTGCCCTTTTGGGCCGTCAAATGTTGCCCAAC 
GTTCTGGACAGAAGTGGACGCGAAGATCCCGATCTGTTCTTCCCCCTGTTTACCGTATTGCAGTTTGTTTTCTACGTGGGCTGGCTGAAGGTGGC 
CGAGGTGCTAATCAATCCCTTTGGCGAAGATGACGATGATATCGAGCTGAACTGGCTAATTGACCGACACATCAAGGCTGCCTACATGATCGTGG 
ACGAGATGCACGAGGAGCACCCCGAGCTGCTGCGGGATCAGTACTGGGAGTGTGTGGTGCCCAAGGATCTGCCCTATACGGTGGCATCCGAACAC 
TACCGGAAAGACGAGCCCAAGGGCTCCGCCGAGAAGTACAAGGTCAAGAAGGAGGACGCCATGTACGCCAACATAATGCCAGGCGGCGGCAAGAG 
GATGCTCAGCGACGATGTCTACGCGGACTATGAAAGCGTGGACACGCCAATGGTGGAAAGGAGGAAAAACAACTGGCTGGTTCGCCAGCTCTCCC 
GAATGGGTTCCATGAGGAGGCAGTCGACGGCCTACTCCTCCGGCGGAATGCCATTCAACCGGAACCGCCTCAACTCCGTCTACTCTAGTCCCGAG 
TCCGGCTTGCCACTGACCATCCTGCAGCAGCAGCAATTACAGCAGGCGCATCAGCAGCAGCAAGCCGGATCGCAGCCAAGTAAATCAAGTCTCTA 
TGGAAAGTTTGTGCATCGTAAATCGTTGAGAGCACAACGCCAACTTATCAAGCAAAACTCCAAATTAAATGGCTTGAATGTCAATGTGGCCAAGA 
CCAGACCGCGCATTCCCACACCCGAGGTGGCCAAGGATGGCAACACGAATCCAGCCACCAGCTCGGTCCTAATGGCACCACAGCAGCTGAGCACC 
ACCAGCGCACCACCGGGCATGTACCCCTCCTCTTACGCACCCGACACACTACTGCACCAGGAGAGCGGTCAGGTGCTGGGCACGCTGCTGCTCTC 
GCCCATCAAAGAGATGGATAGCTCATCGTCCAATAACACTCTGATTCCAGGTCATCCGGCTACCGCAGCTTTGGCAGCCAGCATGAAGGAGGGAT 
TCACGCCGAGCAGTATTCCAGCTGCCACAAACATCACCACTCTGTCGTTCCCCTTCAGCGTGACCAGCACCGGTGGGGAAACGATGCCCACTGGG 
ACACTGAGCATCCTGCCCATTTCGGCGAATGCCCAGCTGCCCACGATAACGACGACGGCAAGTGGCTACGATCCTAACGATGTGATCACCACGAT 
ACCGGTGTCCTTGTCCATCCAGCGATCGCCGTCAGCACTTTCCATTGTGGAGCTGACCGGTGCAGATGTCCAGGATGGATATAGCAACAGTGGGT 
CGAGCAATGGCAATGGCGGTATCACCACTACGGCACTGCTGTCCGTTAAGCCCCTCAATGGAAACCAGAACATTTCGCGAAACAACAGTTTCAAC 
CTGAGCCTCAATCTGCAGGATCCGCAGCAACGATCCTCGATGAGGTCAGCGGGTCCCACGGGCACAGTGGACAACTCAGGCACGATGCCAATTCC 
TCAGAGGAGCGGAGGCCTGGAAACGGGCGAAGTCGGAGGATCCTCGGCCCCCTCAACTCTGGCCAAACACAAGCCGGAACAAAAGACCGGAGAGG 
TGTACGTCTGA 
(SEQ ID NO: 1124) 

Start ATG: 1 (Reverse strand: CAT) 

MSFVLGFYVNLVVKRWWEQYRLLPWPDTLALFISAAIPNSNGGVNNETGRLMRRNIMRYMVLAYVITLQRISLRVKRRFPTTQHLVDAGLMHESE 
MKIFEALNQKSPMSKYWMPLVWATNIINRARKDGLIASDHIVQTILVELSDIRRRLGGLIGYDTVCVPLVYTQVVTLVLYTYFIAALLGRQMLPN 
VLDRSGREDPDLFFPLFTVLQFVFYVGWLKVAEVLINPFGEDDDDIELNWLIDRHIKAAYMIVDEMHEEHPELLRDQYWECWPKDLPYTVASEH 
YRKDEPKGSAEKYKVKKEDAMYANIMPGGGKRiyiLSDDVYADYESVDTPMVERRKNNWLVRQLSRMGSMRSQSTAYSSGGMPFNRNRLNSVYSSPE 
SGLPLTILQQQQLQQAHQQQQAGSQPSKSSLYGKFVHRKSLRAQRQLIKQNSKLNGLNVNVAKTRPRIPTPEVAKDGNTNPATSSVLMAPQQLST 
TSAPPGMYPSSYAPDTLLHQESGQVLGTLLLSPIKEMDSSSSKNTLIPGHPATAALAASMKEGFTPSSIPAATNITTLSFPFSVTSTGGETMPTG 
TLSILPISAWAQLPTITTTASGYDPNDVITTIPVSLSIQRSPSALSIVELTGADVQDGYSNSGSSNGNGGITTTALLSVKPLNGNQNISRNNSF1S! 
LSLNLQDPQQRSSMRSAGPTGTVDNSGTMPIPQRSGGLETGEVGGSSAPSTLAKHKPEQKTGEVYV* 
(SEQ ID NO: 1125) 

Name: BESTROPHIN-LIKE 
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Celera Sequence No. : 142000013384547 

GCTGGCCCGTGCCTGCGCCGCTCAGACAAAGTCCACCTTCCTCAAGCTGGCAGGTCCCCAGTTGGTGCAAATGTTCATCGGTGATGGTGCCAAGC 
TGGTGCGCGATGCCTTCGCCTTGGCCAAGGAGAAGGCGCCTGCCATTATCTTCATTGACGAGTTGGACGCCATTGGCACCAAGCGTTTCGATTCG 
GAGAAGGCCGGTGACCGTGAGGTCCAGCGAACTATGTTGGAGCTTCTTAATCAGCTCGATGGCTTCAGTTCCACTGCAGATATTAAGGTGATTGC 
GGCCACCAATCGCGTAGACATTCTGGATCCTGCTCTGCTGCGCTCTGGTCGTCTGGATCGTAAGATCGAGTTCCCACATCCCAACGAGGAAGCCC 
GTGCCCGTATTATGCAGATTCACTCGCGTAAAATGAACGTTAGCAATGATGTGAATTTCGAGGAATTGTCCCGATCCACGGATGACTTCAACGGC 
GCCCAGTGCAAAGCCGTCTGTGTGGAAGCTGGTATGATCGCACTGCGTCGCTCCGCCAATTCGGTTACGCACGAAGACTTCATGGACGCCATCAT 
GGAAGTGCAGGCGAAGAAGAAGGCTAATCTTAACTACTACGCTTAGACTAAAAATTTGCAGTTTAACTTCACATTGTCAGCTCCCATTAAAATAA 
ACCATGGAAATGTAAAATTTTTCAGTTTCGATCTGAGTTGCCCCTTGAAGATCGCCTTTAAAATAACATCGCAACGGCTCTTAAAAGTAACTAGA 
ATGGGAGACGTACTGAAATCAATACCCAACCTTTTTAAAATAAACAAAAATATATATCAAATGTCGAGCGATTCATTGGTTTATCAGTACAACAT 
TTACATTCTGAGTACGATATTTTTTAAATATCTAAAAAGGTCGGTTTTAGAATTAATAAAAAATATTACATCGGGCGAAATTAGATGATAAATAT 
TTACACGCACACTAATTAACCTAGAGGGATGATTTACCAAGTTACTAAGACTAACCAGTGGGATCGAGTGTGTTAGCGCCATCCGGCGCCGCAGA 
TGTCGTGCCACTTCCCCCTTGTGACGCTGCATCGACAAGGGACGTGGCAGTGGTTGATGAGGTTTCCGTAGAGGAGGACGCGGCCACAGCATCGT 
GAGATGAAACTGCGGCGGCAGGTGGTGTGACGCCCACACCTTCGACCACTGCGGCTACAGGTGCTGCTTCCGCTCCAACCGGCGCCAAGTTAACT 
GGCGCAGCAGCAATGTCATCTGCAGCTGGCTGCTGTGCCGGCTGATCCGGCTGCTGAAATGGACGCCTCATGTACATGATGATGCCCAGCAGTAG 
AGT T AGAAC GGT AAT C AT GT AC AAGTC C CAATT GGC AGC AAT ATT T T C ATC C AT GT AG AAG AT G AAC CT GT AAGGT GACT AAAGGGGAAC AAC AT 
TAAAAAGCTAGTAAACAGTTTATCCATAGGGTTGGGTAATCACCTCCTTGATTGATTCAATCGTGGCCAACATTTGTAGCTTAAAAAGCGCAATG 
GCCACAGGCACTTTGGCGTCTGAATTGGTTTCCGCAGCCAAATCGTATAATCTCTTTGCCAGATGCCAGTCTTTCTTCATTCCGAGCCCTTGCTC 
GTGCATGTATCCCAAGTTGAACATGGCTTGGGCATTATATTGCTGCTCAGAGGCTTTCCTGTAGGGATTGATGCGGTTATAAGTCGTTTTCACAC 
TTCATAGTTCATATTCAACTAACCTGTAAAGAGCAGCGGCAGTCTCGAAGTCCGTCGAAGTACCCCAACCGTAGTAATAGTAATCGCCCAGCTTG 
ACTTGGGCCGCAGAATAGCCCTGTCCGGCTGCTCGTTTCCAGTAGTAGAAGGCACGAATCAGATCTTCGTGCCGATCGTTGAACACATGCACCTC 
CTTGCGGTCTAGTAAAAAGGCGGCATTACTTTGAGCCACCTCATAGCCCATTTCGGCCATCAACGAGTACTGCATGTAGGCCTCGTCGATGCGGT 
TATTTTTATAGTCGCTGTAGGCCATCATCAATCGGCTGCTCCAACGGCCACGCTCCGAAACAGTCTTGAAGAACTGGAATTAGCAAATATGAAAC 
GGGTTAGAAAATAATTAACATATTAATTGAACAATAAACTTACTTCTACTGCTGCTGGACATGAACGCAGCATTCCCATACCGTAGGCATTCATC 
ACTCCCAGATTATAATAGGCTAAAACATGGCCTGATTGGGTGGCCAGGTTGAAGTACTTAAAGGCTAACTTGTAGTCGGTTTTTACACCATTGCC 
AGCTGGAAAAGAAATTTAAATTGTATGGTTCCTAGAAACCCAATCAATAAAATGAAGGTTTATTTACTTACTGAAATACATGTTGCCCAGCTGCA 
GTTGCCCATCAACCCAACCTTGATCAGCCGCCTGAGTGAAATAGGATAATGCCTTGATGGAGTCTTTGGGAACACCTAAACCCTTTAGATACATG 
AGGCCCAGTCCGCTTTGACCAACTGGATCCCGCATTTCAGCGGCCTTGGAGAAGTACTAAGAAAAATATAATAATACATATTAGAAATATTAATC 
AAAAACTGTTATCTACATTATGCGCACCCCAAATGCTGCATCATTATCGGCCCTGATTTGATCACTTCCTTCCAGATAAAGCTTGCCTAAAAACG 
CGATGCCAACTGCACTACCTGCATCGGCGGCCATTTTGAAATATTCCAAAGCCTTTTGATGATCTTGCTCGGTGACCTTGCCGCCCTGATAGTAC 
ACCTGACCCAATCCCACCTGCGACTGGATGTCGCCTTTATCGGCCAGAAGTTTGTAATAGTCGACAATCTCAGTTTCGGAACTGCCGGGATTCTC 
TAGGTCATCCAGCAGTCTTACCCGATGCACAACCGGCCCATTGGCGAAGGTTATCTTGGAAGCCACTTTTTTGGCTACTCGCTTGTAGTGGATCA 
GCGCCTTCTCACAGCTAATGGGCACATTGATGCCATAAAGATAGCGATACCCCATGGCCATTTGCGCCGGAGTATTATCACCGAGGGCAGCCAGG 
CTATAGTGGATCAGTGCCAGGGGCTGGCTTACATTTTTACCGCCAACGCCCACCGAATAAAGGAATCCCAGGCCCATGTGAGCCTCTGGGACGCC 
TTCGTTGGCCAGACTCTCAAACTCATCGACAGCATGGTTGAAATCAAAGTCCATCCATTGACCCAGTAGGGCGGCCCAGGCTAGTTCCGCCCTCG 
CCTTGAGGTGATTAAATGCAGCAGCCTTCTTAAGCAAAATCATGGCTACTCGCTTATCCGTGCTTGGCTTGGCCAACATGTCCATTGCGTTGTCG 
TACACGGTTTGAGCTGGAAAAATTGCAATCAGCAACTGCAAAACAAATAAAACGATGCCCACTTACCTTCTATTTGGTCTGGGGTAAGTTCTAAC 
ACTTCAGTGGATTTGCGTTTGGCTTCGATGCGCTCGATCTCGGCTTTCAGCTTTTGCCGCCTGATCAGACTTTCGTTCCACAGGCGTTCGAACAG 
GGAAGCCGGCATAACCTTCGTCCTGGAAGCAAACTTTGTGGACTCCTTGGGTTTGTCTGCCTTGCGTTCGTTGCTTTCCGACGGAACCCGCGACT 
CTGATTCCGCCGTGGTCACCTCCTCCGCCTCCGCCTGGGTGGAGGGCTCATTGGGCAGCGTTGGCGTGGCTGTTCCCGTGCCCAATCCCGTTCCC 
GTACCCGTACCTGTGGCCGAACCCAAACTGCTGCCCGTACCATCAATGTTGGGCCCTTGGAGCTCCTCCGCTCGCAGCTGCCAGGCGAGAAGCAG 
CGCCAGCAGGCACATCCAGCTGTTGGCCTTCATCGTCGCCTATCTGCCTTTCTCGATGGCAATGTGGATGTACAACGGCAATTTTGGTCTTCGCC 
TTTGCTGCGTCTATATGGTGCTCTTCCACTTCTGACGTGTTGTGTTATTTACTGTTTTCTTGGGGCGCAGGGTCGGCAAAAAGAGTGGATTGCTG 
CTTCTGGGTGGATTCGTTTGACTTACTGGTAGCTCATTTTGGCCACCTCGACTTGGTGGTGGTTGGAATACCTTGGTTCGGTTTCCTTAATTTTT 
CACACGTAAGATTCTTTTTAATCTTTTTTGTTGCAATGCTTAGTGCTGCGAACGGGCCGAGGTTTGTGTGCCAGTGCTGCAAAGCACCGCGTCTA 
AAACAGTCTTTTCCAGGGGCGTCTCCAAAAATATTTTCGCGAATTAAAAAGGAGTAATAAAATCGAGACAGGCTTATAAATATTTAGTCAGGACG 
ATATCAAAACTTAATTACAACTTTTTTTTATTATTACAAGCTACATTTAATAATATTACAATTTGCCTAAGCCTGACGGTAAAACATAAAATTTT 
AAAAATATTTGTACATTTCTTTAAAACGTATATGCACCAAAATGTGGAAATTATGCTCAACAATTATTTCCCGGGCAACTGCGATAGTTAACGTC 
TAGAATGTCTTACCCCGAAAAATTCCCAGGGCTTGCTAGCTACGCCCCTGGATTTTCCGGCTGTTGTTTCGTTATGCAAAGGAACGCGCAGAGTT 
TAACCAGAAGGAAATCTGAATTGGAACTGTTGACGCGCAATCGAAAGTGATAATTACGAGCGGGAATAATTAGAATACGAGGAAGATTTCCGCCC 
GTGTGAATCACCAACTTTTGTCTCGTCCAATCCAAGTCAGCCACTGAGCAGCCGGCATGGAGATGAGTGTAAGTAGAAGCTGCGACCTGCGTGCC 
AGACACCCAGACCCAAAAGTCACAGCCCAAAACCAAATTGAAAACCGAAACCAGTTCCAACCCGCGCTTCAACCGAGTGAGTCGCACAGCGAATC 
ACTCTTCGAGCTCAGCCGTTAGTGCTGGTGAAAGTTACAGCTGTTGCGTTCGCTACGTTCGCTACCTAGTAACCGCCACCTGTGCCCCGATTCCT 
AATCCTCCAGTCGACACCTGTTATTTCAGGCTGCCATGTTCGCACGCGTTTCCTTCTACCCCACCCTGCTGTACAATGTCCTGATGGAAAAGGCA 
TCGGCCAGGAATTGGTACGATCGCATCGATGAGCATGTGATACTGGGAGCACTGCCCTTTCGCAGCCAGGCCAATGACGTAAGTATCCCCACGGA 
ACCAAGCCTGAAATCCTTCTAAATCCTTGTTTAACCGCTCAGCTCATTGAAAAGGAGAACATGAAGGCGGTGGTGTCGATGAACGAGGACTATGA 
GCTGACCGCCTTCTCCAACAACACGGAGAAGTGGCGAAAGCTTGGCATTGAGTTCCTGCAGCTGGCCACCACCGACATCTTTGAGTCGCCCAATC 
AAGAAAAGCTCTT 
(SEQ ID NO: 1126) 

Exon: 4333.. 3392 
Exon: 3338.. 2593 
Exon: 2526.. 2352 
Exon : 2282.. 2134 
Exon: 2068.. 1734 
Exon: 1673. .1469 
Exon: 1408.. 1001 

Start ATG: 3833 (Reverse strand: CAT) 
Transcript No. : CT28737 
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TAATATTATTAAATGTAGCTTGTAATAATAAAAAAAAGTTGTAATTAAGTTTTGAT^^ 

ATTACTCCTTTTTAATTCGCGAAAATATTTTTGGAGACGCCCCTGGAAAAGACTGTTTTAGACGCGGTGCTTTGCAGCACTGGCACACAAACCTC 

GGCCCGTTCGCAGCACTAAGCATTGCAACAAAAAAGATTAAAAAGAATCTTACGTGTGAAAAATTAAGGAAACCGAACCAAGGTATTCCAACCAC 

CACCAAGTCGAGGTGGCCAAAATGAGCTACCAGTAAGTCAAACGAATCCACCCAGAAGCAGCAATCCACTCTTTTTGCCGACCCTGCGCCCCAAG 

AAAACAGTAAATAACACAACACGTCAGAAGTGGAAGAGCACCATATAGACGCAGCAAAGGCGAAGACCAAAATTGCCGTTGTACATCCACATTGC 

CATCGAGAAAGGCAGATAGGCGACGATGAAGGCCAACAGCTGGATGTGCCTGCTGGCGCTGCTTCTCGCCTGGCAGCTGCGAGCGGAGGAGCTCC 

AAGGGCCCAACATTGATGGTACGGGCAGCAGTTTGGGTTCGGCCACAGGTACGGGTACGGGAACGGGATTGGGCACGGGAACAGCCACGCCAACG 

CTGCCCAATGAGCCCTCCACCCAGGCGGAGGCGGAGGAGGTGACCACGGCGGAATCAGAGTCGCGGGTTCCGTCGGAAAGCAACGAACGCAAGGC 

AGACAAACCCAAGGAGTCCACAAAGTTTGCTTCCAGGACGAAGGTTATGCCGGCTTCCCTGTTCGAACGCCTGTGGAACGAAAGTCTGATCAGGC 

GGCAAAAGCTGAAAGCCGAGATCGAGCGCATCGAAGCCAAACGCAAATCCACTGAAGTGTTAGAACTTACCCCAGACCAAATAGAAGCTCAAACC 

G TGT AC G AC AAC GC AA T GGAC A T GT T GG CC AAGC C AAGC ACG GAT AAGC GAGT AGC CAT GAT T T T GC T T AAG AAGGCT GCT G C ATT T AAT C AC CT 

CAAGGCGAGGGCGGAACTAGCCTGGGCCGCCCTACTGGGTCAATGGATGGACTTTGATTTCAACCATGCTGTCGATGAGTTTGAGAGTCTGGCCA 

ACGAAGGCGTCCCAGAGGCTCACATGGGCCTGGGATTCCTTTATTCGGTGGGCGTTGGCGGTAAAAATGTAAGCCAGCCCCTGGCACTGATCCAC 

TATAGCCTGGCTGCCCTCGGTGATAATACTCCGGCGCAAATGGCCATGGGGTATCGCTATCTTTATGGCATCAATGTGCCCATTAGCTGTGAGAA 

GGCGCTGATCCACTACAAGCGAGTAGCCAAAAAAGTGGCTTCCAAGATAACCTTCGCCAATGGGCCGGTTGTGCATCGGGTAAGACTGCTGGATG 

ACCTAGAGAATCCCGGCAGTTCCGAAACTGAGATTGTCGACTATTACAAACTTCTGGCCGATAAAGGCGACATCCAGTCGCAGGTGGGATTGGGT 

CAGGTGTACTATCAGGGCGGCAAGGTCACCGAGCAAGATCATCAAAAGGCTTTGGAATATTTCAAAATGGCCGCCGATGCAGGTAGTGCAGTTGG 

CATCGCGTTTTTAGGCAAGCTTTATCTGGAAGGAAGTGATCAAATCAGGGCCGATAATGATGCAGCATTTGGGTACTTCTCCAAGGCCGCTGAAA 

TGGGGGATCCAGTTGGTCAAAGCGGACTGGGCCTCATGTATCTAAAGGGTTTAGGTGTTCCCAAAGACTCCATCAAGGCATTATCCTATTTCACT 

CAGGCGGCTGATCAAGGTTGGGTTGATGGGCAACTGCAGCTGGGCAACATGTATTTCACTGGCAATGGTGTAAAAACCGACTACAAGTTAGCCTT 

TAAGTACTTCAACCTGGCCACCCAATCAGGCCATGTTTTAGCCTATTATAATCTGGGAGTGATGAATGCCTACGGTATGGGAATGCTGCGTTCAT 

GTCCAGCAGCAGTAGAATTCTTCAAGACTGTTTCGGAGCGTGGCCGTTGGAGCAGCCGATTGATGATGGCCTACAGCGACTATAAAAATAACCGC 

ATCGACGAGGCCTACATGCAGTACTCGTTGATGGCCGAAATGGGCTATGAGGTGGCTCAAAGTAATGCCGCCTTTTTACTAGACCGCAAGGAGGT 

GCATGTGTTCAACGATCGGCACGAAGATCTGATTCGTGCCTTCTACTACTGGAAACGAGCAGCCGGACAGGGCTATTCTGCGGCCCAAGTCAAGC 

TGGGCGATTACTATTACTACGGTTGGGGTACTTCGACGGACTTCGAGACTGCCGCTGCTCTTTACAGGAAAGCCTCTGAGCAGCAATATAATGCC 

CAAGCCATGTTCAACTTGGGATACATGCACGAGCAAGGGCTCGGAATGAAGAAAGACTGGCATCTGGCAAAGAGATTATACGATTTGGCTGCGGA 

AACCAATTCAGACGCCAAAGTGCCTGTGGCCATTGCGCTTTTTAAGCTACAAATGTTGGCCACGATTGAATCAATCAAGGAGTCACCTTACAGGT 

TCATCTTCTACATGGATGAAAATATTGCTGCCAATTGGGACTTGTACATGATTACCGTTCTAACTCTACTGCTGGGCATCATCATGTACATGAGG 

CGTCCATTTCAGCAGCCGGATCAGCCGGCACAGCAGCCAGCTGCAGATGACATTGCTGCTGCGCCAGTTAACTTGGCGCCGGTTGGAGCGGAAGC 

AGCACCTGTAGCCGCAGTGGTCGAAGGTGTGGGCGTCACACCACCTGCCGCCGCAGTTTCATCTCACGATGCTGTGGCCGCGTCCTCCTCTACGG 

AAACCTCATCAACCACTGCCACGTCCCTTGTCGATGCAGCGTCACAAGGGGGAAGTGGCACGACATCTGCGGCGCCGGATGGCGCTAACACACTC 

GATCCCACTGGTTAG 

(SEQ ID NO: 1127) 

Start ATG: 501 (Reverse strand: CAT) 

MKANSWMCLLALLLAWQLRAEELQGPNIDGTGSSLGSATGTGTGTGLGTGTATPTLPNEPSTQAEAEEVTTAESESRVPSESNERKADKPKESTK 
FASRTKViyiPASLFERLWNESLIRRQKLKAEIERIEAKRKSTEVLELTPDQIEAQTVYDNAMDMLAKPSTDKRVAMILLKKAAAFNHLKARAELAW 
AALLGQWMDFDFNHAVDEFESLANEGVPEAHMGLGFLYSVGVGGKNVSQPLALIHYSLAALGDNTPAQMAMGYRYLYGINVPISCEKALIHYKRV 
AKKVASKITFANGPVVHRVRLLDDLENPGSSETEIVDYYKLLADKGDIQSQVGLGQVYYQGGKVTEQDHQKALEYFKMAADAGSAVGIAFLGKLY 
LEGSDQIRADNDAAFGYFSKAAEMGDPVGQSGLGLMYLKGLGVPKDSIKALSYFTQAADQGWVDGQLQLGNMYFTGHGVKTDYKLAFKYFNLATQ 
SGHVLAYYNLGVMNAYGMGMLRSCPAAVEFFKTVSERGRWSSRLMMAYSDYKMNRIDEAYMQYSLMAEMGYEVAQSNAAFLLDRKEVHVFNDRHE 
DLIRAFYYWKRAAGQGYSAAQVKLGDYYYYGWGTSTDFETAAALYRKASEQQYNAQAMFNLGYMHEQGLGMKKDWHLAKRLYDLAAETNSDAKVP 
VAIALFKLQMLATIESIKESPYRFIFYMDENIAANWDLYMITVLTLLLGIIMYMRRPFQQPDQPAQQPAADDIAAAPVNLAPVGAEAAPVAAVVE 
GVGVTPPAAAVSSHDAVAASSSTETSSTTATSLVDAASQGGSGTTSAAPDGANTLDPTG* 
(SEQ ID NO: 1128) 

Classification: hypothetical 
Gene Symbol: BcDNA:LD23587 
FlyBase ID: FBgn0028475 



Celera Sequence No. : 142000013384857 

AGCAGTTTTCGTCCGATTTTGTCGCATATCCGAAAAGTATGCTTTAAATATGATATCTTTTTTATACCAATTCCTCGTGGAGTAAAATGTTATAC 
TAAATTCGTAAAGGCCACAATTTTGAAGTCTTGAGATTTTGTTGATATCATTCTATCGATAGCCAAAAACTGTCACACCCTATTGTTCTATAAAT 
ATTTTGAATTTATACTATTTTAACACAGTTTGGAAGAAGTATTTATTTTTTGAATCAATTTAAAAATGTTATGCAAATTTCCAGCGAGATTCCAT 
AAGAATTTTGAATTGATCCCCCGCACCCACAATAGCTAAGTAAGAGGTATCTGATAGTCGATGATACCGTTCCGTTTTCCCTTTGTTTTCGTTTT 
GAAATAAACTGACAGACTACGTGTCCTGGATCCTATTTAGTTTCTATTTACAAAAAAATAAAAGATAAAATATACCACAGCCAAGAACTTTTAAT 
TTAATAAGAAACAGCGATTGATTAGCTTAGGTCCATTACGTTTATTTTGTTGAAAGTAAATTAAATTTAGGTTACACTGCCACAGCATTTTTAAA 
TATATATATTTGCATCTATGCCCTAATTGATTTCGTTTTTATGGGAACAGCATTTTTGTTGCATAATTAATTATTTTTGGGAAAAAATAATCACA 
AATGTGTGTGAGTGTGGCATCGGTTTTGACTTGAAATATAATTTGATCATATTTTAAAGACTCTGCTCTTCAAAGAAAAGATGAGTTGACAATCT 
ATTGCATCAATGGATTTCAGTTTCATATGCTACACAGTAAACTTAAAATTAGTTAAAACGAGAATAATGTACAATATACAGGCCAGTGCCTCTCA 
ATCATTTGATTTAATTGTTTTCATTGTAATAGTATTGAAAAGCAGTTCTCAAAAACATCGAAATAAAAAGACGATCTTAGCTAAAACGTAACTAT 
GTTCGTGATTCAGTTTAATAGGTGTGCTGAGTGTTGCTTTAATATTTTGTTTTTTTTTTTATTTTGTAGAACAACTGAATCAAAGATTCATGCGA 
ATCAGGTAAACAATTTAATCTTAGATGTGGTAACTTGGTATTTTTTCTCTTCCATCTATCGTTTACATTTTTGGTTGGGAGTTTGCCGGCGAGGC 
TGTATTTCGTTTGCCAATTATTGGGTTAATTGTAAAATGTATATAAAATAAATCACAATTTCTATACGCACATAAATGACGCGTGTGTGTGTATC 
GTGTCTGTGTGGATTTTTGTGGGTGGGTGCCCTGGAAGTGGGACAGCGCACAGTCCAACACAAGGTTCGGCTACATATTTATTAATATTAATAAT 
AAACATATGTTAATTGTTTAAAACAAAGAAGCTGGCAGCCACAGCGCTGATCGCGAACCCAAACGAAAAACTCGTACCCAGGCTGGGCTCATGTC 
TCATCTCTCAAGGATCGACGACCACCTACAACCTACAGGCTAAGGACTCGCGGCGAACTAGCTCGCCTCATATGAGAAACTGGCGATCGGATGCG 
CTGTGCGGCAGGGCGGCCACGCCTATGCCGCCTTGCGTGGGCGTTTGCAGAAACGCCGAATCGTCGACTGTTGCTGAGTCAGCTGCAGGCGTGCT 
GATTGCTATCGGTTCGTAGATCGAACGATGCGGTGGCATATCGTCTTCGTCCTTAAAGGAGCCGCTTAAATTCGGATTGAGGATGTTGTTCGATG 
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CGGGCGAGGCTACAGCAATTGTCCCCGTCACCGGCAGACCCATGGCACTGGTACCGGACTCAACGCGCTCCAGATAGGGTGGCGCCGCCACACTA 
ATAACAGCTGGCTGCCGGCGGAAGAACGAGCTAAAAGCAGTAAAAACAGGAAGTTATGCACCTTCTACCAGTGAGCTTCATCACAAATATATAAT 
TACGCAAAATTTATTCTCGACCAGACCGAGCGACGACTCTCAGTTAACTGTTCGGCTAGGTGGGCCGGTAGATTCGGAGCACGATCAAAGGGATT 
GATGCGTTCATGTGCTCCAGAAGACGTGGCCGGGCGCACAGTAGAGGCCAGCAGAGCTGTACAAGTAATAGGGTGTGTTATTTAAAAAAATCAAA 
AAAAATATTTCGGCAGCAGACGTACCGTTCTCATCGTCAGAGCTTTGGGACTCCTCGAACGGATTTCGACCGGCGTGACCACGGCGGAAGGTTCC 
ATGTCGCGTGGTTCCAGCGACAGCCGCCGCCGCAACGCTTGAGCTGCTCCCAGCTGCTCCACCAGCCGAAGAAGCCGAGCTCACTTGACCCACCT 
GCCTACCATTGGACTGCTGTTGCTGCAGCAGCGGCGTTGTGTCGTCATCCGTATCCGTGACATTATCCAAAGAAGGCTGTCGACTCCGACTTGCA 
CGCGCTTCGCCCTTCGTAAAAACCTTGCGCTTACAAATGGGACACACCCGACGGTTTTCGGTAAGCCATGGATCAATGCAGTGCGTGTGATAGGC 
TGAAAGTAATAACGAATTAATATGTTGTCTTTGCATTAAAGAATTTTCTGGCGATGTCATATTAGTTTAAGCATTTGTATATGATATATTCTGAA 
ACTACTAAAGCTGAGCAAATAACCAAGACTGTATCCCTTACGATGTGAGCAGGGCAGCACGCGAAGCTTATCGTCCTCTATGAAGTCCTCCAGAC 
AGATCACACAGGTGTCGTACTTGTTGTTTGCGTTGTTCTTGGTATATCGGAGCACTGGCAATTTCTTCAGCATGCTCTTTGGCAGTCGATGGCGA 
CGCAAGCGGCGCTGCTCGCGGATGCACTTGTAGATCATATAGATAACCTGTTCGGCAATCAAAGCAATTCAGTTGTGTAAAACAGCGTTAAAACT 
CCAGATTTCTTACCATTATGATGAAGCACATACCGATCAGAATGGAAAACGGCAATATTAACTGTGTATTGATATTGAAGGGCAGCTCATCGTTG 
ATGATTAGCACCACCTCGGTCGTAAAGTAAGTGGCCAGCGCTTTACCCGTCGTGTGGCCCACGAAAACCGAGGGTATTCTTATACCCGTTATGTT 
TTCGGCGGACATTTGCTCTGTAATTGAATATGGGTTTAAACAAATCATATTTAGACCTGAGCACCTATGTGTAATCACTTACCCAAGTCATCGCC 
TTCATTATTGTAAACAATCACAGCAGAATAACTCGCATTCTGGGCCACTCGTATCTTACGCTCAAAGACGCATTCGCCACTACAAAAGATTTGAG 
TTAGGTCGAACTGAAGTGTTTATATCTACCACATATAGCTTACCGCGCTACAAGTGCCACAAATTTGGCTGAGGGTGGGTATTTCAGGTGCGGTG 
GTCTGTCCAGGCTATCACAACCGTAATAAGGGCGCCTAGCGGGAACCACATACACTTTGAGACCATTTGAAGGCAGATTGGGACCGAATTGAGCT 
GGCAGATCATTGAACTCCTCGATCAACTGTGAGTAAACAATATACGATGGGCGTAATAAGTATTTGCAATTTGTAATAATATTTCTAAAATATTA 
GCAAGTTAAACTTGTCATTAATTGAACTTGCCTGGCTGGTTGCTTTCCGGTAAACCAGCACGTGGCCCCCGACCAAGGTCGCCTCATGACAGACC 
AGGCACAAGCCCAACAGGGTGAGTATCTGGCAACTTCTCTTGGACATTCTGTCGCGGAGGACACACGTTCTGGAGATGATCGGGACTTGGTCTGT 
ACTGTTATTGCTCTAGTCCGCTATGTCCAGTGCTGTAGCGATTTGTCTTCAACATCCGCACCATTCGCATGTCTCATCTGTGTCCCACCGCCAGC 
GATTCTGTGCGCTTGGACTTCGGACTTAAAACAAAAGAAAATGTTTTTTAACATATTATGAATGATTATTTATTTGGGTGCTATAACTATTTATA 
TAAATAATAATAATAATTTTTGCAACTCGAGTATTATCCTGCGTGTAGTCAACCTAGGCGTAAACCTCTCCGCCTACGAGGCCATGGCAACAAAG 
AGCTCAGCTGATAAGATAAACGGCCAACTGGGGGTATTGTTGATTTGCTTACTCTGAAAAAATTTGCGTTTCTTCAGTCACGCTGCACTTACAGC 
TGGTTTGTTTGTTGACACAGTCAGGCAGACGACGGTTCGTTCGGGTTTTCGCGAATGCATTAATGGTTACTTCAAACAGAACAGAAAAATCAACG 
CGTTTCGCTCCTTGGCTGCTATTGTACTCAATTTGCCATCAATTAATTTTATCGCGATCAAATAATAGTGGCCGAGATCTTGGGCGGCAGCTGCT 
GTAGTCGGAACAAATAGGTTGATGAGTCACGAACACCGCTCCATGCTAAAACCAGAACCCAGACCGCATGCACATTTAACTATTTAGCGTCGACG 
CACCGTCGGTCCACAGTTGGGGCAGAAAAAGTGGAAAGTTTCTATTCGTACATATATTACAAATTCTGCTAATACACAGCCGAGGGGAGGTGTGC 
AAATATGTCACAAAATGTTTATTAAAGTTACGATTTGACAAAGAAAACGTCACTGGCATTAGCACATCGGTAGACAATCGGAGCAAAGGCTCCAA 
TGAGAACGTGACTTTGTTTTTATTGGATGGTTAAACAATAAATGGGAGGCCGACATTCGTTGTTCCAATGAGAAATTCAATCTGGGGAGGCGCGT 
GCGTTTAATAGAATTACGCCCTACGGATTGCCCAACAGTCTTTGCATTCAGGTATGAGTTGGTGATCCTACCATGTAACATTCTATTTCCATTTC 
ATGGCATGTTATAAAGAAACGACTACGGTGTAGAAGGAGTTTGTTTTTAAGATGATTTCAACTAGAATGCTATTCCTTTGACAAACAAGTAGTTC 
AACAAATTATTAAAATGTTGGTTACTATCGCATATGTTTCTACACAATTTTTTAGCAACTTGGCGATCTTTTGCCAATTAATAGTATGGTCATTT 
CGCAATTTTCAAAACTTGTATATAATTTTCCTTGCAAAAGCGCTTGCAAAAAAGCTAGAGTAGCTATCTGGATATGTAGAGATACATATGTATTG 
ACTGGATTGTTTTAAACAATCCTGACATACGGATTGAATGTTTAGCGACGAGGAATGCAATTCGCGTTAATTCGCTTTGTTTGGAAGGCCGATAT 
CAATTTACACATTACACAGCGTACGCAGCACCCACCCACCAACACACAAACACGCGCACACACACACACCTTGAACTTTACAAGGCACGAGTGTG 
GTTTCAAGGTTTTCCTTTTTGGGGGAGGGCATACGCTCACGCAAACACGCATACATACATACATAGGCACTAGGCAGGTGTTTCCTGAATGGTGT 
ACAACGGTGGGTTACAGGTGAAAAGCGGCCTCGATTGGAGGCAGCAGCCCACAAGTTATTGCACTTAAATTTCGTATATTCGCTTTTGTTGTTTT 
TTTTAGGCGCCTCTGTGTGCGTGCCTGCTACTGTGCTTGTGTGCGTTCAATTAGCACAACAACAAAGCAGCTGAGCGATTTCCTTTTGCTTTCCC 
GACGCCCATTCGCGTAAATCCACTGGTCACTCACTTGATTAGGGCTTCTCGTGCTCTTAGTTGACCGTTTGGAGCGACTTCTGGCCAGTTCCTGT 
ATCCAATCCTATTTTTTCAGCACTCGCTCGCCTCCTCACTCGTCGCCATCGCACTGGTATTGCGTTGTACTTTGTTTTGCTCCCTTTTGGGGCTC 
GGACTCCTCGGGTTCCTTTAGTTGGTGGTCGTGGTTCAGCGGGGACAGCGCAGTGGCTGCGAAAAGAGCATGACAGTCGCGATAAGAACGCTTTG 
GGTTTGGTTCACTTTTCTAATTATCGGATATTAGAATTTACATCCCACCCGAACACAGACACATATACAGGGCTAGAGCTGGAACTAAATCGATG 
TATAGCTGCTGCTGGCTTCCTGACGGCGATGTTTTCGTACATCGATTACTCCCATATTTCGATAGCTATCGACTAAAATTTTAAAAATATTAAAA 
TCCAGTATTCTCAGCAACTGTTTCAGATCATTGTACAGAATTTAGAATTCTCAACTTCAGAACCTGAAACCATATAGAAATATAATACTAAGATA 
TAGCATTAATAACTAGTTTTTAAGATTCTAGAAGTACAAATACAATGAATTGGTTAAAATCAGTTTCTGTGCAAAATACTACTTGCCTTTTCGAA 
TTAACCAAGAAAATTAACCGATTTTCAGTGGTGCGTTTATATTTTGTGTTGAACAATAATGCATTTGAGTTATGAAACATTAAAATCACATGACT 
GCGGGCTACTTGATCCAGAGATACATTTTCCCCTGGATGGTGTGCTTGACGGGATTGATCTTCTTCAGGATTTTCGTCATGGTCTCGACCAAACG 
GTCGCTGGAGACAGGCGTCTTCTTGTTCTTGAACTTGGTGAGCAACTCGGTGGCGGTCAGGGGCTTTCGCTTAAGATACCGACGCACCGCCTCCT 
CGGTGATGCCGTAGTCCCTAAACAAAATACATGAAATTAGAATTAGGAATTCCAGTATATCACGGCTTTACTCACTCTACTTTGCCGCCAGAGAA 
GGAAGTGGGCAGCGATGTGCTAATCTCGTTCTTAGGCCTCTTTGCTGGCGTGCTGGTGGGACTGTTGCTTGTGTCCGAAGCGGTGAGATCGCTGG 
GTAAGCTGTTCATCTTGCGCTTGCTGGCATCCGTCGAGAGCGTTGGAGTGGCCGAGCGAGATTTATTGGCATTGCTGCTGCTGGCGATGACCTTG 
CTGCTCGCAGCGCTCTCTTTCTCCTTTTCCTTTTCCTTGTCTTTGTCCTTGACGACGACCTTCTTCTTGGGCGGTCCGTTGCTAAGGTCGTCCTC 
GGAGTCTG 

(SEQ ID NO: 1129) 

Exon: 5848.. 5545 
Exon: 3824., 3547 
Exon: 3446.. 3274 
Exon: 3214. .3123 
Exon: 3057. .2864 
Exon: 2802.. 2 607 
Exon: 2469. .2116 
Exon: 2051. .1904 
Exon: 1835.. 1001 

Start ATG: 3657 (Reverse strand: CAT) 
Transcript No. : CT28877 

TTCGGGTGGGATGTAAATTCTAATATCCGATAATTAGAAAAGTGAACCAAACCCAAAGCGTTCTTATCGCGACTGTCATGCTCTTTTCGCAGCCA 
CTGCGCTGTCCCCGCTGAACCACGACCACCAACTAAAGGAACCCGAGGAGTCCGAGCCCCAAAAGGGAGCAAAACAAAGTACAACGCAATACCAG 
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TGCGATGGCGACGAGTGAGGAGGCGAGCGAGTGCTGAAAAAATAGGATTGGATACAGGAACTGGCCAGAAGTCGCTCCAAACGGTCAACTAAGAG 
CACGAGAAGCCCTAATCAATCCGAAGTCCAAGCGCACAGAATCGCTGGCGGTGGGACACAGATGAGACATGCGAATGGTGCGGATGTTGAAGACA 
AATCGCTACAGCACTGGACATAGCGGACTAGAGCAATAACAGTACAGACCAAGTCCCGATCATCTCCAGAACGTGTGTCCTCCGCGACAGAATGT 
CCAAGAGAAGTTGCCAGATACTCACCCTGTTGGGCTTGTGCCTGGTCTGTCATGAGGCGACCTTGGTCGGGGGCCACGTGCTGGTTTACCGGAAA 
GCAACCAGCCAGTTGATCGAGGAGTTCAATGATCTGCCAGCTCAATTCGGTCCCAATCTGCCTTCAAATGGTCTCAAAGTGTATGTGGTTCCCGC 
TAGGCGCCCTTATTACGGTTGTGATAGCCTGGACAGACCACCGCACCTGAAATACCCACCCTCAGCCAAATTTGTGGCACTTGTAGCGCGTGGCG 
AATGCGTCTTTGAGCGTAAGATACGAGTGGCCCAGAATGCGAGTTATTCTGCTGTGATTGTTTACAATAATGAAGGCGATGACTTGGAGCAAATG 
TCCGCCGAAAACATAACGGGTATAAGAATACCCTCGGTTTTCGTGGGCCACACGACGGGTAAAGCGCTGGCCACTTACTTTACGACCGAGGTGGT 
GCTAATCATCAACGATGAGCTGCCCTTCAATATCAATACACAGTTAATATTGCCGTTTTCCATTCTGATCGGTATGTGCTTCATCATAATGGTTA 
TCTATATGATCTACAAGTGCATCCGCGAGCAGCGCCGCTTGCGTCGCCATCGACTGCCAAAGAGCATGCTGAAGAAATTGCCAGTGCTCCGATAT 
ACCAAGAACAACGCAAACAACAAGTACGACACCTGTGTGATCTGTCTGGAGGACTTCATAGAGGACGATAAGCTTCGCGTGCTGCCCTGCTCACA 
TCCCTATCACACGCACTGCATTGATCCATGGCTTACCGAAAACCGTCGGGTGTGTCCCATTTGTAAGCGCAAGGTTTTTACGAAGGGCGAAGCGC 
GTGCAAGTCGGAGTCGACAGCCTTCTTTGGATAATGTCACGGATACGGATGACGACACAACGCCGCTGCTGCAGCAACAGCAGTCCAATGGTAGG 
CAGGTGGGTCAAGTGAGCTCGGCTTCTTCGGCTGGTGGAGCAGCTGGGAGCAGCTCAAGCGTTGCGGCGGCGGCTGTCGCTGGAACCACGCGACA 
TGGAACCTTCCGCCGTGGTCACGCCGGTCGAAATCCGTTCGAGGAGTCCCAAAGCTCTGACGATGAGAACGCTCTGCTGGCCTCTACTGTGCGCC 
CGGCCACGTCTTCTGGAGCACATGAACGCATCAATCCCTTTGATCGTGCTCCGAATCTACCGGCCCACCTAGCCGAACAGTTAACTGAGAGTCGT 
CGCTCGGTCTGGTCGAGAATAAATTTTGCCTCGTTCTTCCGCCGGCAGCCAGCTGTTATTAGTGTGGCGGCGCCACCCTATCTGGAGCGCGTTGA 
GTCCGGTACCAGTGCCATGGGTCTGCCGGTGACGGGGACAATTGCTGTAGCCTCGCCCGCATCGAACAACATCCTCAATCCGAATTTAAGCGGCT 
CCTTTAAGGACGAAGACGATATGCCACCGCATCGTTCGATCTACGAACCGATAGCAATCAGCACGCCTGCAGCTGACTCAGCAACAGTCGACGAT 
TCGGCGTTTCTGCAAACGCCCACGCAAGGCGGCATAGGCGTGGCCGCCCTGCCGCACAGCGCATCCGATCGCCAGTTTCTCATATGAGGCGAGCT 
AGTTCGCCGCGAGTCCTTAGCCTGTAGGTTGTAGGTGGTCGTCGATCCTTGAGAGATGAGACATGAGCCCAGCCTGGGTACGAGTTTTTCGTTTG 
GGTTCGCGATCAGCGCTGTGGCTGCCAGCTTCTTTGTTTTAAACAATTAACATATGTTTATTATTAATATTAATAAATATGTAGCCGAACCTTGT 
GTTGGACTGTGCGCTGTCCCACTTCCAGGGCACCCACCCACAAAAATCCACACAGACACGATACACACACACGCGTCATTTATGTGCGTATAGAA 
ATTGTGATTTATTTTATATACATTTTACAATTAACCCAATAATTGGCAAACGAAATACAGCCTCGCCGGCAAACTCCCAACCAAAAATGTAAACG 
ATAGATGGAAGAGAAAAAATACCAAGTTACCACATCTAAGATTAAATTGTTTACCTGATTCGCATGAATCTTTGATTCAGTTGTTCTACAAAATA 
AAAAAAAAA 
(SEQ ID NO: 1130) 

Start ATG: 472 (Reverse strand: CAT) 

MSKRSCQILTLLGLCLVCHEATLVGGHVLVYRKATSQLIEEFNDLPAQFGPNLPSNGLKVYVVPARRPYYGCDSLDRPPHLKyPPSAKFVALVAR 
GECVFERKIRVAQNASYSAVIVYNNEGDDLEQMSAENITGIRIPSVFVGHTTGKALATYFTTEVVLIINDELPFNINTQLILPFSILIGMCFIXM 
VIYMIYKCIREQRRLRRHRLPKSMLKKLPVLRYTKNNANNKYDTCVICLEDFIEDDKLRVLPCSHPYHTHCIDPWLTENRRVCPICKRKVFTKGE 
ARASRSRQFSLDNVTDTDDDTTPLLQOQQSNGRQVGQVSSASSAGGAAGSSSSVAAAAVAGTTRHGTFRRGHAGRNPFEESQSSDDENALLASTV 
RPATSSGAHERINPFDRAPNLPAHLAEQLTESRRSVWSRINFASFFRRQPAVISVAAPPYLERVESGTSAMGLPVTGTIAVASPASNNILNPNLS 
GSFKDEDDMPPHRSIYEPIAISTPAADSATVDDSAFLQTPTQGGIGVAALPHSASDRQFLI* 
(SEQ ID NO: 1131) 

Classification: hypothetical 



Celera Sequence No. : 142000013384547 

GCGCGGCTCTCCTCATACTCCCGGGCCACATCCGTCAGGCTCTTGGCCTCGGCGGACTCCTTGTTGTTGCTGCTGCTGCTGCAAATGGATGAGGA 
GGCAGCGGCCTCGCTGGTCTCTGTGGTCTGGGCAGCGTTCTGAATGACGCTTGAGAAGAGCAATGGCGCCGAGGACGACGTAGCAGCGGTGGAGG 
AGGAGGCAGCCTCCTGGGCTGATACCGAGCTGGCTGCCGTATCTGCATCCGGCTCAGCGGTGACCTGTTCAGCATTGGGGGCCACCACACGTTCA 
TGCACATTCTGGCCGAAGACGAACCCGCTGGACTGAACATTTGGCATGTTGTTCTTGGCTGCTGCAAACATGCTGGAGCGCTCAATGCCATTGCT 
ACGCAGCTTGGTCAGCGGATCCGGCCGCTCGTCGACATCGTCGTCCTCCTTGTCGCCAGTGGACTCGGCTGCCGCAGATCCGGTGGCCACCACCT 
CCTCGTCATCGTCGTCGTCCTTGGACTCGCGCAGAAACGGATTATTCAGCAGGGCGCTGTCTTCACCTGCACTTGGGGATGTGGAACTATTGTTG 
GAGCCGCTGCTGTTTGCCAAGACTGCTGGGCGCAGGACAGCACCGCCCAGGCGAGAAGCCTTCAAGGTGAAGCCACCTCGGAAACAGTTAGTTTC 
GACTGACGCTTCTGGAAACAAGGGGGAAAAAACAAGACTCGTTTGTCAGTTTGTTGCGAACTCTTCAAGGGCACAAGATTCGCGATATTTGGCAA 
CTTTAAGTTATAGCAAGCATTACAGCTTGTTTTGCTACGCTTACCATTATTTTCAGACATTGCAAACTAGTCTACTTGCAATTTTAAAATATAAA 
TTGCCAAATCTTTTACAGAATAGAGCGTTCTGTGTAAAGCCTGCCGTATTGTTATCGCTGTGTTATCGCTGTCACTATCGATAAGCGCCGGCGAT 
GGCGCTCTGTTGTCAACACTGACTTTTTCAGTGTTGCTTACTTGCCGCTGAAAAACAAGAAATTTACAAGTCATCGACAGAGTGTTTGCCCAAAA 
AGATCGGCAATAATAGCATTTTAACAAAGAATTACATTCAGCAACGGCTAATCAACAATCGTATTAGACATGGCTCAGACTCTGGAGGATAAGTC 
AATCTGGGAGGACGGGGAGGAGTCCCTGGGCGAGGAGGTGATGCGTATGTCCACCGATGAAATTGTGAGCAGGACTCGCCTGATGGACAACGAGA 
TCAAGATCATGAAGAGCGAAGTGATACGCATAACGCACGAGATTCAGGCGCAAAATGAGAAGATCAAGGACAACACCGAAAAGATCAAGGTAGGG 
ATGACAATGAAACGCCACCCTGCCAAATAGCCTCATTTGTTAAGTACTTCCCCATTAGGTCAACAAGACACTGCCCTACCTGGTGTCCAATGTAA 
TAGAGCTGCTGGATGTGGATCCACAGGAGGAGGAGGACGACGGCTCCGTCACCGTTCTGGACAACCAGCGAAAGGGCAAGTGCGCCGTCATCAAA 
ACATCCACTCGACAGGCGTATTTCCTGCCCGTCATCGGCCTGGTTGATGCCGAGAAACTGAAGCCAGGTGATCTGGTCGGAGTCAACAAAGACTC 
CTATTTGATCCTGGAAACCCTGCCGGCAGAGTATGATGCCCGCGTAAAAGCCATGGAAGTGGACGAGCGACCTACAGAGCAGTACTCCGACATTG 
GAGGTTTGGACAAGCAGATACAGGAGCTCATCGAGGCAGTGGTGCTGCCCATGACGCACAAGGAGAAGTTCAAAAACCTTGGCATTCACCCACCC 
AAAGGTGGGTATCTCACTCGAACAAATGTCACACAACCATATTAATCCTTGCCATTTACAGGTGTCCTTTTGTACGGCCCCCCTGGAACTGGCAA 
AACATTGCTGGCCCGTGCCTGCGCCGCTCAGACAAAGTCCACCTTCCTCAAGCTGGCAGGTCCCCAGTTGGTGCAAATGTTCATCGGTGATGGTG 
CCAAGCTGGTGCGCGATGCCTTCGCCTTGGCCAAGGAGAAGGCGCCTGCCATTATCTTCATTGACGAGTTGGACGCCATTGGCACCAAGCGTTTC 
GATTCGGAGAAGGCCGGTGACCGTGAGGTCCAGCGAACTATGTTGGAGCTTCTTAATCAGCTCGATGGCTTCAGTTCCACTGCAGATATTAAGGT 
GATTGCGGCCACCAATCGCGTAGACATTCTGGATCCTGCTCTGCTGCGCTCTGGTCGTCTGGATCGTAAGATCGAGTTCCCACATCCCAACGAGG 
AAGCCCGTGCCCGTATTATGCAGATTCACTCGCGTAAAATGAACGTTAGCAATGATGTGAATTTCGAGGAATTGTCCCGATCCACGGATGACTTC 
AACGGCGCCCAGTGCAAAGCCGTCTGTGTGGAAGCTGGTATGATCGCACTGCGTCGCTCCGCCAATTCGGTTACGCACGAAGACTTCATGGACGC 
CATCATGGAAGTGCAGGCGAAGAAGAAGGCTAATCTTAACTACTACGCTTAGACTAAAAATTTGCAGTTTAACTTCACATTGTCAGCTCCCATTA 
AAATAAACCATGGAAATGTAAAATTTTTCAGTTTCGATCTGAGTTGCCCCTTGAAGATCGCCTTTAAAATAACATCGCAACGGCTCTTAAAAGTA 
ACTAGAATGGGAGACGTACTGAAATCAATACCCAACCTTTTTAAAATAAACAAAAATATATATCAAATGTCGAGCGATTCATTGGTTTATCAGTA 
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CAACATTTACATTCTGAGTACGATATTTTTTAAATATCTAAAAAGGTCGGTTTTAGAATTAATAAAAAATATTACATCGGGCGAAATTAGATGAT 
AAATATTTACACGCACACTAATTAACCTAGAGGGATGATTTACCAAGTTACTAAGACTAACCAGTGGGATCGAGTGTGTTAGCGCCATCCGGCGC 
CGCAGATGTCGTGCCACTTCCCCCTTGTGACGCTGCATCGACAAGGGACGTGGCAGTGGTTGATGAGGTTTCCGTAGAGGAGGACGCGGCCACAG 
CATCGTGAGATGAAACTGCGGCGGCAGGTGGTGTGACGCCCACACCTTCGACCACTGCGGCTACAGGTGCTGCTTCCGCTCCAACCGGCGCCAAG 
TTAACTGGCGCAGCAGCAATGTCATCTGCAGCTGGCTGCTGTGCCGGCTGATCCGGCTGCTGAAATGGACGCCTCATGTACATGATGATGCCCAG 
CAGTAGAGTTAGAACGGTAATCATGTACAAGTCCCAATTGGCAGCAATATTTTCATCCATGTAGAAGATGAACCTGTAAGGTGACTAAAGGGGAA 
CAACATTAAAAAGCTAGTAAACAGTTTATCCATAGGGTTGGGTAATCACCTCCTTGATTGATTCAATCGTGGCCAACATTTGTAGCTTAAAAAGC 
GCAATGGCCACAGGCACTTTGGCGTCTGAATTGGTTTCCGCAGCCAAATCGTATAATCTCTTTGCCAGATGCCAGTCTTTCTTCATTCCGAGCCC 
TTGCTCGTGCATGTATCCCAAGTTGAACATGGCTTGGGCATTATATTGCTGCTCAGAGGCTTTCCTGTAGGGATTGATGCGGTTATAAGTCGTTT 
TCACACTTCATAGTTCATATTCAACTAACCTGTAAAGAGCAGCGGCAGTCTCGAAGTCCGTCGAAGTACCCCAACCGTAGTAATAGTAATCGCCC 
AGCTTGACTTGGGCCGCAGAATAGCCCTGTCCGGCTGCTCGTTTCCAGTAGTAGAAGGCACGAATCAGATCTTCGTGCCGATCGTTGAACACATG 
CACCTCCTTGCGGTCTAGTAAAAAGGCGGCATTACTTTGAGCCACCTCATAGCCCATTTCGGCCATCAACGAGTACTGCATGTAGGCCTCGTCG 
(SEQ ID NO: 1132) 

Exon: 1001.. 1324 
Exon: 1389. .1809 
Exon: 1867.. 2894 
Start ATG: 1115 

Transcript No. : CT29122 

AAAAACAAGAAATTTACAAGTCATCGACAGAGTGTTTGCCCAAAAAGATCGGCAATAATAGCATTTTAACAAAGAATTACATTCAGCAACGGCTA 

ATCAACAATCGTATTAGACATGGCTCAGACTCTGGAGGATAAGTCAATCTGGGAGGACGGGGAGGAGTCCCTGGGCGAGGAGGTGATGCGTATGT 

CCACCGATGAAATTGTGAGCAGGACTCGCCTGATGGACAACGAGATCAAGATCATGAAGAGCGAAGTGATACGCATAACGCACGAGATTCAGGCG 

CAAAATGAGAAGATCAAGGACAACACCGAAAAGATCAAGGTCAACAAGACACTGCCCTACCTGGTGTCCAATGTAATAGAGCTGCTGGATGTGGA 

TC C AC AGG AGG AGG AGG AC G AC GG CT C C GT C AC C GT T CT GG AC AAC C AG CG AAAGGGC AAGT GCG CC GT C AT C AAAAC AT C C AC T C G AC AGGC GT 

ATTTCCTGCCCGTCATCGGCCTGGTTGATGCCGAGAAACTGAAGCCAGGTGATCTGGTCGGAGTCAACAAAGACTCCTATTTGATCCTGGAAACC 

CTGCCGGCAGAGTATGATGCCCGCGTAAAAGCCATGGAAGTGGACGAGCGACCTACAGAGCAGTACTCCGACATTGGAGGTTTGGACAAGCAGAT 

ACAGGAGCTCATCGAGGCAGTGGTGCTGCCCATGACGCACAAGGAGAAGTTCAAAAACCTTGGCATTCACCCACCCAAAGGTGTCCTTTTGTACG 

GCCCCCCTGGAACTGGCAAAACATTGCTGGCCCGTGCCTGCGCCGCTCAGACAAAGTCCACCTTCCTCAAGCTGGCAGGTCCCCAGTTGGTGCAA 

ATGTTCATCGGTGATGGTGCCAAGCTGGTGCGCGATGCCTTCGCCTTGGCCAAGGAGAAGGCGCCTGCCATTATCTTCATTGACGAGTTGGACGC 

CATTGGCACCAAGCGTTTCGATTCGGAGAAGGCCGGTGACCGTGAGGTCCAGCGAACTATGTTGGAGCTTCTTAATCAGCTCGATGGCTTCAGTT 

CCACTGCAGATATTAAGGTGATTGCGGCCACCAATCGCGTAGACATTCTGGATCCTGCTCTGCTGCGCTCTGGTCGTCTGGATCGTAAGATCGAG 

TTCCCACATCCCAACGAGGAAGCCCGTGCCCGTATTATGCAGATTCACTCGCGTAAAATGAACGTTAGCAATGATGTGAATTTCGAGGAATTGTC 

CCGATCCACGGATGACTTCAACGGCGCCCAGTGCAAAGCCGTCTGTGTGGAAGCTGGTATGATCGCACTGCGTCGCTCCGCCAATTCGGTTACGC 

ACGAAGACTTCATGGACGCCATCATGGAAGTGCAGGCGAAGAAGAAGGCTAATCTTAACTACTACGCTTAGACTAAAAATTTGCAGTTTAACTTC 

ACATTGTCAGCTCCCATTAAAATAAACCATGGAAATGTAAAATTTTTCAGTTTCGATCTGAGTTGCCCCTTGAAGATCGCCTTTAAAATAACATC 

GCAACGGCTCTTAAAAGTAACTAGAATGGGAGACGTACTGAAATCAATACCCAACCTTTTTAAAATAAACAAAAATATATATCAAATGTCGAGCG 

ATTCATTGGTTTATCAGTACAACATTTACATTCTGAGTACGATATTTTTTAAATATCTAAAAAGGTCGGTTTTAGAATTAATAAAAAATATTACA 

TCGGGCGAAATTAGATGATAAATATTTACACGCACACTAATTAACCTAGAGGGATGATTTACC 

(SEQ ID NO: 1133} 

Start ATG: 115 

MAQTLEDKSIWEDGEESLGEEVMRMSTDEIVSRTRLMDNEIKIMKSEVIRITHEIQAQNEKIKDNTEKIKVNKTLPYLVSNVIELLDVDPQEEED 

DGSVTVLDNQRKGKCAVIKTSTRQAYFLPVIGLVDAEKLKPGDLVGVNKDSYLILETLPAEYDARVKAMEVDERPTEQYSDIGGLDKQIQELIEA 

VVLPMTHKEKFKNLGIHPPKGVLLYGPPGTGKTLLARACAAQTKSTFLKLAGPQLVQMFIGDGAKLVRDAFALAKEKAPAIIFIDELDAIGTKRF 

DSEKAGDREVQRTMLELLNQLDGFSSTADIKVIAATNRVDILDPALLRSGRLDRKIEFPHPNEEARARIMQIHSRKMNVSNDVNFEELSRSTDDF 

N G AQCKA V C VE AGMI ALRRS AN S V T H ED FM D A I ME V QAKKKAN LN Y Y A * 

(SEQ ID NO: 1134) 

Name: 26S proteasome regulatory subunit 6A 
Classification: endopeptidase 
Gene Symbol: Rpt5 
FlyBase ID: FBgn0028684 



Celera Sequence No. : 142000013384415 

CAGCAATTAGCATATACTACATATATACATATGTATATGTATATTTGGTGTGTGATTGGCATGAAACCGGCATTCTGTTTATTTACCCATCCAAT 
TGGCGCCAGGTGAGTCCAAATATTTCACTTTTGTCCACTTTGATGTGTTTGCGCAGCCAATTCCTAGACGCTGTCGTAGACATCGAGATTACAAC 
GGATTTTCGGGGGCGAAATATGGTAATATATGCAAACACGCTTTTATTTTATTACGTCGGGCTTGCCTGCAAATAATTTGGCATTTCAATTTTGA 
GTTGGTTGATTCATCTGCGCGTAAACATGATTTCGTTTTTCTTTGTAAACTCAGTAAATACGTTTAGTTTATAACTAGTTTAGCTACGTTGTAAC 
GAAGTGGAAAGCTTATGAATGGCGGTTACTTTTCAATTTTTAATGAATTCACGATGAGTTTGGGAATTTTTGGTGGCATTCTGGTTTGATGTGGT 
TTAATCGTTCTGAATGGCATTAACTTTTTCTGGTTCTTAAATATTAATTTCAAGTCTGGAAATTTCTAGCAATGCTTGTTTTTCACAACAAACTA 
AGGTTTAAAAATATTAATATTTGTGTTCGTTAGGGATAAAAATTAATTTGTTAGCGAAATAATGTTTGAAATCATTTAAAGTTTCCAACTATTCC 
GGACTAAGAGCAAAAATTAACCATAAAAATTTTTACTTAATTTTTGTATTACTTATTTATATAAGATTTGGCTTGGAATGTCCAACAGCACAATC 
TAATAAATATAAAGAAAGATTATAAAGCATACATATAGTTTACACTTTTTCAATACTTTAATATATTCTATTGTTTATAACGAATTTATTGATTA 
CAAATATTTTCAAATTTGAAATTGAATTGTAACCGCCTTTGACTGGTTCCGAATCGACTGGCTCTCCACGGAACATATCGCCACATGCGGTATAA 
TCAATGCAGCTCCCTACGGTCATACTATTAACCACATCCACATCAGAAGTCTGAAATCTGATTTAAAAACTCTCACGAAACTTACTTAAAACCAG 
GTGAAATAAAGACTGAATACATTCTGGAGGCAAGACAAGCTGAAGCAGTATGAGTGGCAAGCAGGTAGTCATCCTTCTGGGCAGTGTCCTGATCC 
TGGGATGCTTACAAGTGGCAGCGGCAACGGAAACGGATAACAAAACCAACGATTTCGTGGCCACCGACGAATGGCAGACGATTGCGGAAGGTTAA 
TGGCTAGCCAGGCTAACTCTCGGATGACGTGTACTAAGTGAGCACTCTATATTTCCAGGTCAAGCTATTCCAAGGGGCCTGCACGTGCGCATCAA 
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CCTACAAACGGGGCTCAAGGAAGCCAAACTCTTAGACGAAAGCGAACGTGGCACGTCGCTGCAGAGTCAACCGGATGATCAGAATGCTCGGGAAT 
CTCACGATGACAACGAACCCCTGGCTCTGGACTATAAACCGGACATAATCGAGGAGTCTATACGGCGGGTCAAGGAGCAGAAGAAGAGCTATGCC 
GAGCTACGCAAAGCGTACAAAGAGTTCCAGAAGAACTTTCGCACCGACGGAGAACTGATCGTCCAGTTAATCGATCAGTTTCGGAACTTCAGCAG 
AACGCCGCTAGAGTCAGAGATGCGCTCCAAGCTGGACTGCCTGGAAAATCTGGAGTATTTGCTCCATCAGATAGACAATGCCTTGATGTTCATTG 
ATAATGGGGGATTGGATGATGTACTACTGCCCATTGTTGTGAACGATACCAGTACATCCCTGAGAGTGTCGGCCATGCGTGTGTTGGGCTCGCTG 
GCGAGCAACAATCCCAAGGCCCAGATCAAGGTGTTCGAAAAGAATTTCGGGTCTCATCTGGCTCAGATTCTGACCAGTTCCGGAAATGTCGGTGA 
GATCTCCGCTGCATTGCACGCCTTTGGAGCCTTGTTGCGGAAATTTCCGCTAGCCCAGCAGCGAGTGCTTTCCACCTCGGGTACACAGGCTCTGA 
TCAAGGTGCTCCAGAGTCCGGATGTGGAGCTGCGGAGCAAGGCGAAGGTAGTGACTCTCATAAGCGACCTAGTGCTGGAGAAGCGGTCAGTGCTG 
GATGTCAGCAAAGATGATCCCGAAGCCTCATCCACAATGGCGCAATATGTGCTCTTGGACTTCGAGTCGTGGCTGAAAACGCCGGGCTACTGCGC 
GGCGGTGGACACTGTGCTAACCAAGGAGTTCCTGCTCCTCCTAGAGCAACCGGAGGTGGTGGAACAGTTTGCCACCGCATTGGAGACCACCGAAG 
ATATGTGCACCAGCACGTGGTCGCAAAGTTCCGGTCTCAGGCATGCACTATTAACCGTTCGCAATCGGTATGCCAACAGCACGGACGAGTACCGA 
CTGGAGGTGTCCCAGATTTTGGCCAAACTGTGCGAGAGATTGTTCAACAAGCCCAAGCACACCGAACTGTAATCCTAGCCAATTTACTATGTTAT 
TTATGTTAATTTCATTTAATTGTAAGCTGAGTTGGAATGAAATTCGCAGCCAAACCATAGAATACGATAGCTAAGATGCCAAGTACTTGTGATGC 
AAAAATAATGCTCTTTGTCATTAACTTAGGATAGCAATCAATCATGCCAAACATCTACAGGATTTTCGTTATCAAATATAGTCTCTTACTTAAGA 
TTATTTTAAGTGTGATTTTTTTTTAATAGGCAGACGATTCACCAACGAGCACATCAAAATAATTATGTTGTTACACGTCATTAAACTAGCAAAAA 
TTTGTATGCTACAGAATTGTTCTCCAAACAGATTATGAATCCTTAAAGCCTTAAGTTCTTGTTTATTGTAGCTACAATAATAAAGCAATGACCTA 
ATGTTTAATTTAGAAATTCTTTCTTAAATCAGTTGACTTCTTTTCAGTGGCATATTCTTAAAATGTAGTGCTAAATTATGAAAGGAAATGTATTT 
TCAATCAAGAAATCGCATTAGGAGATTGGATATTCCACCCCTATAGAACTACAGACCCATTTAAAGCCCGATAAGGCGAGCATAATCGACCGGGC 
CATAAAGTAAGTCAAAAGTGTCGGAAGGAAAGCAAAAACGCCGAGGAAAGCATGTCCCAAAAAACCACTTGAAGTTTGAATGTGAGCCAAACGAA 
GCCACAAGAACAGGGAAAACCTGGCACGGAACTGTACAGGAGGAGAAAGGACATCGGAGCAGGAGAGTGCAATTATGTGCCGGCAAAGTTGGAAG 
TTGGAGTCGAAGTCCAGCGCGGGAATTACAGCAATTAGCGGCAGTCACATGCAAATGCAATGTCACTGGGGCTGAGTCACGACGGTGGATGGTCG 
GATAGGTGGAGTGGTAAAGGTGAAGCAGCGCCTGGAAGGGAGCCGGCTAGGACGCCGCCTTGGTTCCCATGGCTACTCCATGCAATGTAAACAGA 
GAAAAGGGGTTGCAAAGCCGGCTCAGT 
(SEQ ID NO: 1135) 

Exon: 1001.. 1230 
Exon: 1294.. 2447 
Start ATG: 1095 

Transcript No. : CT29244 

CT G AAATC T GA T T T AAAAA CT C T C AC GAAAC T T AC T T AAAAC C AGGT G AAAT AAAG ACT G AAT AC AT T CT GG AGGC AAG AC AAGC T G AAGCAGT A 

TGAGTGGCAAGCAGGTAGTCATCCTTCTGGGCAGTGTCCTGATCCTGGGATGCTTACAAGTGGCAGCGGCAACGGAAACGGATAACAAAACCAAC 

GATTTCGTGGCCACCGACGAATGGCAGACGATTGCGGAAGGTCAAGCTATTCCAAGGGGCCTGCACGTGCGCATCAACCTACAAACGGGGCTCAA 

GGAAGCCAAACTCTTAGACGAAAGCGAACGTGGCACGTCGCTGCAGAGTCAACCGGATGATCAGAATGCTCGGGAATCTCACGATGACAACGAAC 

CCCTGGCTCTGGACTATAAACCGGACATAATCGAGGAGTCTATACGGCGGGTCAAGGAGCAGAAGAAGAGCTATGCCGAGCTACGCAAAGCGTAC 

AAAGAGTTCCAGAAGAACTTTCGCACCGACGGAGAACTGATCGTCCAGTTAATCGATCAGTTTCGGAACTTCAGCAGAACGCCGCTAGAGTCAGA 

GATGCGCTCCAAGCTGGACTGCCTGGAAAATCTGGAGTATTTGCTCCATCAGATAGACAATGCCTTGATGTTCATTGATAATGGGGGATTGGATG 

ATGTACTACTGCCCATTGTTGTGAACGATACCAGTACATCCCTGAGAGTGTCGGCCATGCGTGTGTTGGGCTCGCTGGCGAGCAACAATCCCAAG 

GCCCAGATCAAGGTGTTCGAAAAGAATTTCGGGTCTCATCTGGCTCAGATTCTGACCAGTTCCGGAAATGTCGGTGAGATCTCCGCTGCATTGCA 

CGCCTTTGGAGCCTTGTTGCGGAAATTTCCGCTAGCCCAGCAGCGAGTGCTTTCCACCTCGGGTACACAGGCTCTGATCAAGGTGCTCCAGAGTC 

CGGATGTGGAGCTGCGGAGCAAGGCGAAGGTAGTGACTCTCATAAGCGACCTAGTGCTGGAGAAGCGGTCAGTGCTGGATGTCAGCAAAGATGAT 

CCCGAAGCCTCATCCACAATGGCGCAATATGTGCTCTTGGACTTCGAGTCGTGGCTGAAAACGCCGGGCTACTGCGCGGCGGTGGACACTGTGCT 

AACCAAGGAGTTCCTGCTCCTCCTAGAGCAACCGGAGGTGGTGGAACAGTTTGCCACCGCATTGGAGACCACCGAAGATATGTGCACCAGCACGT 

GGTCGCAAAGTTCCGGTCTCAGGCATGCACTATTAACCGTTCGCAATCGGTATGCCAACAGCACGGACGAGTACCGACTGGAGGTGTCCCAGATT 

TTGGCCAAACTGTGCGAGAGATTGTTCAACAAGCCCAAGCACACCGAACTGTAA 

{SEQ ID NO: 1136) 

Start ATG: 95 

MSGKQVVILLGSVLILGCLQVAAATETDNKTNDFVATDEWQTIAEGQAIPRGLHVRIHLQTGLKEAKLLDESERGTSLQSQPDDQNARESHDDNE 
PLALDYKPDIIEESIRRVKEQKKSYAELRKAYKEFQKNFRTDGELIVQLIDQFRNFSRTPLESEMRSKLDCLENLEYLLHQIDKALMFIDNGGLD 
DVLLPIVVNDTSTSLRVSAMRVLGSLASNWPKAQIKVFEKNFGSHLAQILTSSGNVGEISAALHAFGALLRKFPLAQQRVLSTSGTQALIKVLQS 
PDVELRSKAKVVTLISDLVLEKRSVLDVSKDDPEASSTMAQYVLLDFESWLKTPGYCAAVDTVLTKEFLLLLEQPEVVEQFATALETTEDMCTST 
WSQSSGLRHALLTVRNRYANSTDEYRLEVSQILAKLCERLFNKPKHTEL* 
(SEQ ID NO: 1137) 



Celera Sequence No. : 142000013384415 

TAAAGAATGTTTCATAGGCTCTAAATCGAGATTTTGTGAGGCTTCTAATGATTGGGCATTCAGCATTTTTTCAAGAATTTTGTAACCGACTCAAA 
AAATCTTTAGAATGGTTGGTTATTCGGATCGCATATACTTAGCTTGTTTGTCTTATTTTTATTTGGATGAGCGCCAAAATTTTGCTGCGTCAGTC 
TGGAAAAAATTGAATCAAATGTGTATAGTTTTATAGAAGTTGGGAAGCGGAATTTATTTATTTATTTAAATATTTATAATTAAAAAAATGAAAAT 
AGTCACGTTGTTTAACTAGTCAGTATTCGAACCAACAAATGTAAAATGTATACTGGTTTGTGTCTAAGCTAAGCTTGTCATATTAACGGAGCTGC 
CAGATGTTAGGAAGTGGGGATGCCATACATTATTCTAAATTTGCGCGCAATTTTAGAAGCTTATCGTCGTCAGAATTACAAAAACAAATTGAATA 
TGAAAATGGGTTATTGCTACTTCATTATTATTGTCACGATATATGATAATTTATACAAAATGTGATAAATCCCAAATTGTTAAATAATGCTTTGG 
CTTGCTTTATACAAAACCACTAGATAATTAAAATATAGGTGGCCTAAATTGTTGCATGTTGTTTTATAATTAATCAGCAATTTGATTTGGTTGTG 
ATCGACCAAATCAGTGTGTATAATTGTAGTTAAAATGTAAAGTTCGTAATGGATTATTGAATCGCATTTCAAATTTCTTTAAATGCGCCCGGGTC 
AATGACCTTTTGAGGTGACCATAAATTGAAACTTATTTGTGCGACGGCAACCCTGTTCTGGGACTCGACATGATATCGATACGTTAACAACAAAG 
AGTCTGGACGCCATCATTCTTCCTCTTTCTCCTGAATTCGCAGACAGCGTGGCGTCAGGCATTTCAAACGGTAAAAAGAACCTGGCGATAAGGAA 
AGATTTAAAAGGCAAAAATCGAGTGATTTGTGTGATTTAACTTAAGAATAATGCTTAATGCACGTGACGTGTGTCCTGAGGGCAACGACGACCAG 
CAGTTGGACCACAATTTTAAGCAGATGGAGGAGCATTTGGCCTTAATGGTGGAAGGCAATGAAAACGAAGATCCGAGGAAAGCCACTTGTGAGTA 
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CGAGGATACGAACGAAGATGGTGCAACCTGCACATCGGGCGTTTTATCCGAAATCCAGGAGAACTTCGGTAGACTCCGGTTGTGTGACGTTACTG 
CACCACTCCTCGAATTCCACGGTTTGGATTGCTTGCAACAGATTCAAAAGCGCTCGCGCCATTTTGCATTCGACGGTTCTCCGGCCAAGAAGTCG 
CGATCCGGAGGCGTGTTGGTCACCGGGCCAAAGCAGAAGCAACTGCAGAAGGAAAATGTGTGGAACCGGAAGAGTAAAGGCTCTGCGTCCGCGGA 
TAATATTGAGAAACTGCCCATAACTATTGAGAAACTGCATATGATTGGTCTGCACGGCGATTGGTGAGTCTTCTGGAGTATATCCCAAATATATC 
ACATAATAAAAAGCTCCTTATCTAAACAATAGCTTAGAGCACAACGCCGTGCTGCGTTTGATGAATCTGTTCAGATCCCTGCATGATCACCTGAC 
CGCCGATTTGGGCTTCTCGCGCCAAAACTCAATGCCCTCGGACTATCTGTTCGATATGCCGGTGAAGAGCACGATGCCTAAGAGCTTGAATGTGC 
GCTACCAACTGCAGGTGCTGTGCACCAAAGTAGAGCGCTTCCTTGTCCAGCAGCGCCGCACCTTGGAGGCGAATCGCCACTTCGATTTCGAGAAA 
TACGACGAGTGTGACAAGTTGCTTAAGGGTTTCGCATCCTATTTGGACAACTTCAAACTGCTTTTAAAGCCCAAAATGCGCAATCGAAACGGAAA 
CTCGGGGAGCAATGCGGACAAGTGTAAGCTGTAGATTTGCAAGCAACCATTCAGCTATTCCTGCAACGATTTTATTATTTACAGTCCATACTCAG 
CGCATGGAGAGATTGCTAATTGGTCTGCGCGATTGGATCAAGGCTGCGCATCTCAGTGTGCACGTATTTAACTGGGAAATGGATCTGGAGCACCG 
CTACTCCGGGGCCATGACCGAAAGCCACAAGTCGTTGAACGAGCGGGCCATCCTTTTGTCCGGTGCCGAGCTAAGGGCGGCCGAAGCGCGTGGAA 
TCAGTGCGGAGGATCTGTTCATCGCCCAGAGATACAAACTGGGAGGTCCGATCTATTGCGTTCTGGAGCAGCATGAGTTCCTCTCCGCTCTGATC 
GCCAATCCAGAGACCTATTTCCCGCCCAGTGTTGTCGCCATTTGCGGGCCACAGAAGCTTGGCGCAGTGAGCATGGAGCAGCCGTCAGCGTCGGA 
GGAGGAGTTTGAGGAGACCGAGGAAGTGCCATCATCGCCACCTCGTCACACCGGACGTGTACCTCGCTTCAGAAGCTAAACTAATGCTGTGCACA 
TCGATAAAAGAATGACAGCAAATATGCAATTTAAAAAAGCTACTCTTCTCATGGGAAGCAATAATTTCGTAAAGTAAACATATCTATAGTGTAAG 
ATATATTTGTCCAATAGTGCGGACTCCATATTTGTATTCGTGAATAAGCTTATATAAGCTTTTTAAAAATATTTATCAAATCGATACAAACAAAA 
TCAAAATGAAAACGATTTATTACCCCTGTTTTGAGATTGATAACAAATTTATATAAGTTTAACTGTGTTTACATTTATTTGGCAAAACTACAAAT 
GTGTTTGCTTTTCACTTTTATAAATCTGTATTTTACTTAAACTTTAGAAATAAGAAATCCTTTAGTGCCTGAATTTATTTTGCAACTACGTTTTA 
TTTGTATGAGGAACTTACCAGTTTTTCTTATTTGCTTTGCATTTGTATTTTGAAAGTCAAATAAATATTTACGATTTGTGTTTGGACTAATTAGG 
GTGTTTGTACTAAGGTCTTAGACTACAAAGAGCTGTTCGAGCGGCAGTTTGCCAAAGATCCTGTCCACCACCGACTTGGCTATCAGTCCGAAGGC 
CTCGCCAATGCCCGGTTTCAGTTCATTGAAGATGTCCTTCCAGTTCTCATTGATAAGCGCGTGCATGTTCTCGCTGAGATCTTTCTGGCCGTTGA 
ACAGATTTTCCAGCTGGTAGGTCACATGGGATGGATCCAGCTCAACCTTTATGTTCATCACCTCGGCGTAGGTTTGATGATCGCCCTTGGAGACG 
CGCTTCAATTTGATCTGTGCACGTACCTTTGTTCTAGCTGCGAATTCCAGAAACACATTAAAAATTTTAGATTTCAAAGCTGTGGGAGGCAAATC 
TTACTGAGGCGTATGTCAGCAATGCCATTTCCGGTGATGGGCAGAATGAGTATTCTTCCGTCCACCGAGTAGGGTCCTCTGACTCCTATTTCTGG 
AAC T T CC AT GAC C AGCT CG AT GG AGC GGC T GAGA 
(SEQ ID NO; 1138) 

Exon: 1001.. 1488 
Exon: 1553.. 1923 
Exon: 1985.. 2454 
Start ATG: 1001 

Transcript No. : CT29276 

ATGCTTAATGCACGTGACGTGTGTCCTGAGGGCAACGACGACCAGCAGTTGGACCACAATTTTAAGCAGATGGAGGAGCATTTGGCCTTAATGGT 

GGAAGGCAATGAAAACGAAGATCCGAGGAAAGCCACTTGTGAGTACGAGGATACGAACGAAGATGGTGCAACCTGCACATCGGGCGTTTTATCCG 

AAATCCAGGAGAACTTCGGTAGACTCCGGTTGTGTGACGTTACTGCACCACTCCTCGAATTCCACGGTTTGGATTGCTTGCAACAGATTCAAAAG 

CGCTCGCGCCATTTTGCATTCGACGGTTCTCCGGCCAAGAAGTCGCGATCCGGAGGCGTGTTGGTCACCGGGCCAAAGCAGAAGCAACTGCAGAA 

GGAAAATGTGTGGAACCGGAAGAGTAAAGGCTCTGCGTCCGCGGATAATATTGAGAAACTGCCCATAACTATTGAGAAACTGCATATGATTGGTC 

TGCACGGCGATTGCTTAGAGCACAACGCCGTGCTGCGTTTGATGAATCTGTTCAGATCCCTGCATGATCACCTGACCGCCGATTTGGGCTTCTCG 

CGCCAAAACTCAATGCCCTCGGACTATCTGTTCGATATGCCGGTGAAGAGCACGATGCCTAAGAGCTTGAATGTGCGCTACCAACTGCAGGTGCT 

GTGCACCAAAGTAGAGCGCTTCCTTGTCCAGCAGCGCCGCACCTTGGAGGCGAATCGCCACTTCGATTTCGAGAAATACGACGAGTGTGACAAGT 

TGCTTAAGGGTTTCGCATCCTATTTGGACAACTTCAAACTGCTTTTAAAGCCCAAAATGCGCAATCGAAACGGAAACTCGGGGAGCAATGCGGAC 

AAGTTCCATACTCAGCGCATGGAGAGATTGCTAATTGGTCTGCGCGATTGGATCAAGGCTGCGCATCTCAGTGTGCACGTATTTAACTGGGAAAT 

GGATCTGGAGCACCGCTACTCCGGGGCCATGACCGAAAGCCACAAGTCGTTGAACGAGCGGGCCATCCTTTTGTCCGGTGCCGAGCTAAGGGCGG 

CCGAAGCGCGTGGAATCAGTGCGGAGGATCTGTTCATCGCCCAGAGATACAAACTGGGAGGTCCGATCTATTGCGTTCTGGAGCAGCATGAGTTC 

CTCTCCGCTCTGATCGCCAATCCAGAGACCTATTTCCCGCCCAGTGTTGTCGCCATTTGCGGGCCACAGAAGCTTGGCGCAGTGAGCATGGAGCA 

GCCGTCAGCGTCGGAGGAGGAGTTTGAGGAGACCGAGGAAGTGCCATCATCGCCACCTCGTCACACCGGACGTGTACCTCGCTTCAGAAGCTAA 

(SEQ ID NO: 1139) 

Start ATG: 1 

MLKARDVCPEGNDDQQLDHNFKQMEEHLALMVEGNEWEDPRKATCEYEDTNEDGATCTSGVLSEIQENFGRLRLCDVTAPLLEFHGLDCLQQIQK 
RSRHFAFDGSPAKKSRSGGVLVTGPKQKQLQKENVWNRKSKGSASADNIEKLPITIEKLHMIGLHGDCLEHNAVLRLMNLFRSLHDHLTADLGFS 
RQNSMPSDYLFDMPVKSTMPKSLNVRYQLQVLCTKVERFLVQQRRTLEANRHFDFEKYDECDKLLKGFASYLDNFKLLLKPKMRNRNGNSGSNAD 
KFHTQRMERLLIGLRDWIKAAHLSVHVFNWEMDLEHRYSGAMTESHKSLNERAILLSGAELRAAEARGISAEDLFIAQRYKLGGPIYCVLEQHEF 
LSALIANPETYFPPSWAICGPQKLGAVSMEQPSASEEEFEETEEVPSSPPRHTGRVPRFRS* 
(SEQ ID NO: 1140) 

Classification : known_f lybase_gene 
Gene Symbol : bam 
FlyBase ID: FBgn0000158 



Celera Sequence No. : 142000013383897 

CAAAATTACAATAAACCCAAAACGGAAAAAAACACCTGTAACTAAAGTACAATAGCTGGCA 

AACCTCTGCATCGCCTTTATTGAAGCACCCTGTATTTAGCATAAACAAAACCCGTGTCGGCGCTGCGCTGCGAATGTCATAAGATCCGGAGCGCA 
CAAGGAACTCGGCAAAAGGCTGGTGGTTATGTATTCATCTCGATAGCTCCGCTCTCCAACCAGTTCATCCATGGCATTTCGGTTCTTTGGCAAGT 
CCTTGCTGTACGCTCTGCCACTTGGCGTCACTTTCCTGGACTGCGTTGGTTATGTGGCCAGGGTGGATGGTGAGTCGTCCGGAGTTTATTCGCTC 
CATGTTAGTGTGCGCTAATTAATCCACAACTTTTTGCAGGCATCTCCATGCAGCCCGCTCTCAATCCCGTGCCGGATGAAAAGGACTATGTGTTC 
CTGCTGCGCTGGGGCACCCACAACAGTCAGGTGGAGCGCGGCGACATAATATCGCTCATTTCACCCAAGGATCCCGCCCAGAAGATTATCAAGCG 
CGTGGTGGGGCTGCAGGGCGACGTCGTGTCCACGCTGGGCTACAAGCACGAGATCGTTCGCGTACCCGAAGGACATTGTTGGGTGGAGGGCGATC 
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ACACAGGCCACTCCATGGACAGCAATACCTTTGGACCCGTCGCCCTGGGCCTGATGTCCGCGCGGGCAGTGGCCATCGTGTGGCCACCGGAACGC 
TGGCGAATACTGGAGAACGAACTGCCGCGGCGACGGAGACCCATACAGGCCTCCAAGAACTCCAGCAACTACTACAACTAGACTGACCAACTGGC 
GGAGAGCCTGCGCCCAAGATAGTTCCAATTGTGTACATACTGAACCGTATATTTTATAAACAATTTGTTGTGTTCCCGCATGCAAAGCAAAGTAG 
GATGTTCCAAAGTAAAAATTAAAGTTAAAGCCAAAGGAATAGCTCTTATACTACATCATTTTTAGCTCCCGAATGCCGAACTCCTCCATGAGGCT 
GAGTGTCGGCTTGAGAGCCACGCCGGCGCGCACTATTTCATAGTAGCCAGGAGTGGCCAGGTCAATGACCGTGGAGGCCAGTCGTCGCTCCTCGG 
TCAGACCGATTCGACCGGCATCGAAGACGGCGCCCAATTGGGGCCATAGGGAACGGAACTCGGACACCTGCAGACTGCTGGGCGCACTGGAGCGA 
TTGGCGCTGGTCAGGGCCAGGGGCTTCTCCTGCCACACGGCGCACAGGTCGCGCATGAAGTTGAAGTCCGGAATGCGTATGCCGATTTTGGAGGT 
GCTGGGATTAAGGAAACGGTTGCTCAGCTGGGACGTCCGCTCGATCACGATGGTCAGGGGGCCCGGGAGCAAACGAGTTAGCAGCTCGTCGCTGA 
GATGGGCAGCCTGACCAAAGCGTCGCAACGCATCGATGTTGTGAACGCAGATGGCCACGGGCTTGTGTTCATCGCGACCCTTAATCTCGTAGAGC 
TGCTGTATGGCTGTCTCGTTGTTGGCATCGCAGGCCAGGCCATACACGGTATCGGTGGGCAGGGCAATCACTTGGCCACCAAGCAGGCACTGGCG 
GGCCAACTGGAGAGCAGCCTCGTCGCCCACCGCGCACACTGGAGTCCTCAGCTCGCTCGCTTGGTGCTGCATTCTGCTGGATGTGTGGTGAGCAC 
GGAGCAGTCGATAAAGGCTCGTTTGAAGGCGGCGCATTAGTCTCGCCAGGTGTGGTTGCACTCGTGGTTGCAGCACTTGTAAAACGTGGTCATGG 
GCTCATCCGCCGACCGCGTCTGGATTTGCATAAAGTACGCCCGCTTATGGCCGCACGTTGGACACTCTGCGTCCGTGGAGTCTACGTTCTCCCAC 
GCCGCCTTGCCGCCCAGCACGTGATCCACCTCCTGGAAACACAGATTATCAGAGATATTATCATAAATTCGTAGCAAATTGCATCATCCAGTGGA 
AAATGATCATACCTTGAGGCGTGGAAAGGTTTTCGTGGAGATCTTGCGCCTGATCTTCGATATGTACGGGCAGGTGTTGCACGTGAAGCGATGGC 
AGTTGGTGTCCTCCTCGATAATCAATATATTTCCGCACGACGGGCAGAAAAACAACATAGTTTCGGAGAATCCGAATCCGCGCTGTTTAGTTTTT 
TGATTTGAGCACGTTGAATTCTCAGTGTGACCGCGTATACGCTTAGATATACTAACTCAATAACATCACAATGTATACTGCAATACCATATGTTC 
CACCGTTCCTCCATAGATGCTAATTTGAAGCACATTATAATAATTAATTAATAAATTAATTATTTGTAATAAATAAAAAATATTTTCAAATACTG 
TTTGAAAATTTCATGTTCGGAATTTAGTGAATTTATTACTTAGCCCTAGCATATACTAACTGTATATTTTATGGTGAAATAAACGTATTTACCAT 
CTTATAGTTTACGGTCATACTAATAGCATTATAAGAAATTTTGTTCGTTTTGCAAAAAACTATCAAAATAAACAATAAGCAACTCAATTTTTAGC 
AGACATTAGAGTAGCAAAAGTTTTAAATCATGTCGGCCGATGTGCGTGATATTCTGGACATGGAGCGTGCCAACACGCCCGAGGTGACGCGGGAT 
TCGTTTCTGGCCACCAAGAAGCGCAACTTTGAGCGGTAAACAAATGCATTTACATGTGTGCGATTCTCTAATTTACAATTTAATCATTTTAGGAC 
CAAGACTGCGTCCCGGCGTCCGGAGGGCATGCATAGGGAGGTGTTTGCCCTGCTCTACACAGACAAGAAAGATGCTCCGCCTCTCCTGCCCACGG 
ATACCGCTCTGGGAATCGGAGCGGGATACAAAGAGACGAAGGCCCGCCTGGGCATGAAGAAGGTGCGCAAGTGGGAGTGGGCGCCCTTTTCGAAT 
CCGGCACGTAACGATTCGGCGGTTTTTCACCACTGGAAACGAGTCACCGACAATTCCACCGACTATCCGTTCGCCAAGTTCAACAAGCAACTGGA 
GGTGCCCTCGTACACGATGACGGAGTACAATGCCCACTTAAGGAACAACATCAACAACTGGAGCAAGGTGCAGACCGATCACCTATTCGACCTGG 
CCAGAAGGTAGACTCTTCATTAGACTTCTTATTAAGTAGCCAAAATTGTAAGCC 
(SEQ ID NO: 1141) 

Exon: 2189.. 2008 
Exon: 1932,. 1001 

Start ATG: 2148 (Reverse strand: CAT) 
Transcript No. : CT29300 

ATCAAAAAACTAAACAGCGCGGATTCGGATTCTCCGAAACTATGTTGTTTTTCTGCCCGTCGTGCGGAAATATATTGATTATCGAGGAGGACACC 

AACTGCCATCGCTTCACGTGCAACACCTGCCCGTACATATCGAAGATCAGGCGCAAGATCTCCACGAAAACCTTTCCACGCCTCAAGGAGGTGGA 

TCACGTGCTGGGCGGCAAGGCGGCGTGGGAGAACGTAGACTCCACGGACGCAGAGTGTCCAACGTGCGGCCATAAGCGGGCGTACTTTATGCAAA 

TCCAGACGCGGTCGGCGGATGAGCCCATGACCACGTTTTACAAGTGCTGCAACCACGAGTGCAACCACACCTGGCGAGACTAATGCGCCGCCTTC 

AAACGAGCCTTTATCGACTGCTCCGTGCTCACCACACATCCAGCAGAATGCAGCACCAAGCGAGCGAGCTGAGGACTCCAGTGTGCGCGGTGGGC 

GACGAGGCTGCTCTCCAGTTGGCCCGCCAGTGCCTGCTTGGTGGCCAAGTGATTGCCCTGCCCACCGATACCGTGTATGGCCTGGCCTGCGATGC 

CAACAACGAGACAGCCATACAGCAGCTCTACGAGATTAAGGGTCGCGATGAACACAAGCCCGTGGCCATCTGCGTTCACAACATCGATGCGTTGC 

GACGCTTTGGTCAGGCTGCCCATCTCAGCGACGAGCTGCTAACTCGTTTGCTCCCGGGCCCCCTGACCATCGTGATCGAGCGGACGTCCCAGCTG 

AGCAACCGTTTCCTTAATCCCAGCACCTCCAAAATCGGCATACGCATTCCGGACTTCAACTTCATGCGCGACCTGTGCGCCGTGTGGCAGGAGAA 

GCCCCTGGCCCTGACCAGCGCCAATCGCTCCAGTGCGCCCAGCAGTCTGCAGGTGTCCGAGTTCCGTTCCCTATGGCCCCAATTGGGCGCCGTCT 

TCGATGCCGGTCGAATCGGTCTGACCGAGGAGCGACGACTGGCCTCCACGGTCATTGACCTGGCCACTCCTGGCTACTATGAAATAGTGCGCGCC 

GGCGTGGCTCTCAAGCCGACACTCAGCCTCATGGAGGAGTTCGGCATTCGGGAGCTAAAAATGATGTAG 

(SEQ ID WO; 1142) 

Start ATG: 42 (Reverse strand: CAT) 

MLFFCPSCGNILIIEEDTNCHRFTCNTCPYISKIRRKISTKTFPRLKEVDHVLGGKAAWENVDSTDAECPTCGHKRAYFMQIQTRSADEPMTTFY 

KCCNHECNHTWRD* 

(SEQ ID NO: 1143) 

Name: PUTATIVE DNA-DIRECTED RNA POLYMERASE III LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384661 

TTAATAATGCGATACTCGTATTCGTACTCCATCACGCGCATTCGCCCGTCCAGTTCTATGGCTCGATATTGGTTCAATCCCTCCATGAACTGACC 
GCGGCTGCACTGCACCGTGTCCAGGAGCTGCGAAAAGGTAAAGAGTAGCTTCTGCTCCACGCAGAACTCGTTCTCCGGCCCCGAATATCGAGTCA 
GCTGCAGTAGTTCACCCAGCTTGCGAAAGCGCGGCTTTATCTCGCGGCACTCGAAGTACTCGTGGAACACCTTGAGCACCGGCCGCTGCTCCAGA 
GTTCGCGGCACTTCCAGTTCATCCTCCGTGCTGTCATTGAGGCTGCGCTCCAGGGAGGCGTTGGCGTTGCCAGTGCGCGGACTCTTCAGC GGCGA 
GGTGCTGGTGGCGGCGGCAAACTTGAGGTCCGGTACCAGCAAAAGGCTATTGGATATTTCCGCGCCCTTCACGTCGTACGTTCGCTCATCGGTGC 
ACAGCACCAGCTTCTCGTTCTGGCCACCCTTGAAATATAGCGTCTGGCCGTCGCGGATTTGTCCAAGCATGTGCCCATCGAGCTCCAGAAGGCGC 
AGATTGTCCGAATCCACGTCCGCCGAGGGAAAGTACAGAGCCTGGGTGAGTTGGGTCAACTGGCGCTCGTCCAGTTTCGCATGCTTCACAATGGC 
CTTCACATCCTCCGGCGTGCGAACGTAGCTATGGCGGGTGGTTGGGTAACATTATAACTCCTACGCAGGCATACCAAAGGCACTCACAGCTGGGC 
AGCAGACTCATCAACCTCCATGGATTCCATGTTCGCTGCATGTAAACAAAACAAAAAATTAACCAAAACAATGAGCGGGAAAAAGTAGTGGGACC 
AAAGCACAGCAATGTTGCTTTCGTTGAGGGTATTCAATGGCCTTTCAATATGTAAGCTTAATGTGCTTGGAAAACATTACAGCATTAATCCGCAC 
ACCCTGCAAGGGCACACTCACCGCTCAGCTGATCAGTTATTTAACGGTCGTAGCCTCGGCAAGAACGTTTCTTTTTTGAGCGGAGATTTCGCGAG 
CACGCGTCTTGTTTTCCGCCGTTCGCCGTGCACTTGGACGCGCACTTTAGAAAGCGTTGACTGCGCCAAAGGTCATTGCTGGGCAAGAGAAGCGT 
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CCAATCGCCGCTGGTGGTGACGCTTGTGGAAAAATCAATTCGGACCAGTGACGGACGTCGAACTTCCAGTTCTCTGACAAGCGGAATCCAGTCAA 

CAACAACATGCCCGTGGGAACGTTTGACGCCTGGGATGCGGCCGTGCTGATCACCATCCTGGTCATATCGGCTCTGATTGGCATCTACTACCGGT 

ATACAGGCGGCAAGCAAAAGACCACACAGGAGTATCTGATGGCGGACCAGAGCATGACCACTTTTCCGGTCTCCTTCAGCCTGATGGCAAGCTTC 

ATGTCCGCCATCTCACTGATGGGCGTCTCCAACGAGTCCTACGAATTTGGTACCATCTTCTGTGTGATCAACATTGCCTACGTGCTGAGTACGCC 

CATTGCCGCGTACTTCTTCCTTCCTGTTTTCTACAGGATGCGGACGACCAGTGTGTACGAGTATCTGGAGCGGCGCTTTGGCCAGGCCACCCGGC 

TGTCCGCCTCCCTGGCCTTCACCGTGCAAATGGTGCTTTACATGGGCATTGCACTCTATGCACCAGCGCTCGCTTTAGAAGCTGTAACCGGCATT 

CATCGGTCAATGGCCATCGTGGTCATTGGACTGGTGTGCACCTTCTATTCGACACTTGGCGGCCTGAAGGCGGTGCTCATCACGGATGTGTTTCA 

GTCCTTCCTCATGTTTGCCGCCATCTACGCCGTGATTGCCGTGTCGGCCATTAAAGCCGGCGGATTCGCGGCCATCTGGGATGTGGCTGTGGAGC 

GAGGACGCGTGAACTTCATTGAGTTTTCATTGGATCCCACCGTGAGGCATACTTGGTGGTCTCTGATCATTGGCGGCATGGTCACTTACCTCTCC 

TTGTATGGCGTTAATCAAACGCAAGTACAGCGACTCCTGAGTGTCCGCAATTTGAAGAGCGCTCAGTCGGCGCTTTGGTGGAACCTGCCCATCCT 

AGGAATGCTTAGTTTTAGCACCATTTTCAGTGGACTGTCCATATTCTACTATTATCGAGATTGTGACCCTTTGCTAAAGGGACGCATCGACAAGC 

GGGATCAGATCATGCCCCTTTTCGCACTGGAGACTATGGGTGAGTTGGAAAGACATCCAATAAGAATTAGCATTATTTGATTTCTTTTTCCGGCA 

GGTCAATATCCTGGACTGTGTGGTCTGTTCGTGTCCGGAATATTCTCCGCCAGTCTTTCCACGATTTCGTCGGCAGTTACCTCGCTGTCGGCTGT 

AACACTGGAGGATTACCTGAAGCCCTTGTACAAGGCGATATTTAAACGGACGCTAATTGACTCCAAGTCCACGATGCCCACCAAGATTGTGGCGT 

GCATTTTTGGCCTGCTTTGCATCGGTCTAGCTTTTGTCGCTGGCTCCATGGGCGGAGTTCTTCAGGCATCGCTTACCATTTTCGGTGTGGTAGGC 

GGTCCGCTGCTGGCCATTTTCACACTGGGAGTCTGCACTACACGCAGCAACCAAAGAGGCGTGCTGCTTGGCTTCCTCGTTTCCCTGATAGTTTC 

CTTCTGGATGGGATTCGGCGGACCCAAACCAAAGCCAGTTACCTTGGAGTTTAGCACAGCGGGCTGTGAGAATGCCACCGCGGTAATGGCGCGTG 

CCATCGAGCTCAGCTCAAAGAGCAGCGGAGTGGTCATCGAACCCGACTACTTCTGGTTATATCGAATTTCTTACTTGTGGCTGAGCGTCATCGGC 

TTTCTTATTGCCGTGGTCGTCGGCTATTTCAGCAGCATTGTGCTGGCTCACTGTGGAAAGGCAGAAAACGCGGAAATCTATTTGGACAAGTCTCG 

GAAGCAGCTGGACTACGATCTCTTCGCACCCATGCTGTCCCGCCGCTGGCGACGCCATAAAGAAGAACAAGACCAAACTGCCCAGGACGAAACGC 

TCACCAAACTAACTGAGCAGTAGTTCCGCCCGGCTTAGATTTAGTGTAGAATATATACATACATATAAAAGAATTTCCAGAAGGCGTCGGCTATT 

TGGTTAGTCGTGTTACGGATGTCGTACCTAAAAGTGCATAACTATGTGACCTTTTTGTATACCTGTGTGAACGAAACTGGCATACAGAGCGACCA 

AAACAGAGTCTTCCATATACACGTAAACAACGATAATTAGTAGTTATTAGTCGTGTTGCAAATGTTCACAAGCGCATCATTAAGAGGCCTCTTAA 

GAATAGTCAAATGTTTTTTACAAGAACGTATCAAAAGTAGGAGTCTACCTCAATGAACTTTTACAGAGGGACAATAATCAATTTGGAGCCTAAAA 

GCGAGTTGGCCGACTACTTTCCCATGTTGAGCATTGGCAGCCACAATGCCCCACATAAATTATTGATTTAGTAGTACATCTAATTTTAATGGGAT 

ATTTATATTCATACCTCTTAGTTACTCGATATTTGCATACATAGATAACATAATAACATTACAATAATACACTACAGTGAGCACCGCCTATAAAT 

AGTTACTACATGGCTCTTAAAGCATTACATTGCGCCACAGTGTGAATTCTGTTTACTTATATACCTCTACTATATGTACACAAATATACTATAGA 

CTAATTATATACTCGTGTCGTGTCGCAATCAACCATTGAACAATAACTCGGAGCAGTGAAAGTTATGTGTATTTGTAAAAGCCAAACTGCTGATC 

ATTAAAGAAATCTGTTCTACAAAATGTGATTGCCTTTTTTTTTATTTGCTGCGGAAACATGATACTTGCAATCCCCAGCTGCAGCTTGGATTGAA 

ATAATTACTTCGGTTGGCTCATGGGTGTTTTTCAGAAATCACGATTTGTCGAATACTTCCTGCGATAAGCAAATTCGTACGCTATCAGCTACCAA 

GGTTATGCGAGGCTCTCCCAACATTGCTCACTCGATGAGGAATCTATTTCGGAACGAATGCCATAACTTTCAAGCGTTTACCTTTGCCAATTGAC 

TATTGCGCCATGCGGGAACAAAGTCTAATTGCGGGTCGGAGACAGCAAATTGCTTAAAATTGGAGAAGCCATGGGTCAGCATGTGAATATGGGTT 

CGCCTGTAAACAAATGCCAATTGATAGATTTGTCGAAACATCATTAATAAACCCAGACCACGCTCGTCTTGCTCATGGAAAGTTTTCAGAACGGA 

TTGGGAGTTCAAAGTACGAATAGACTCGCATGACACACATTTTTGGCTAATGTTCTGGCGAAAGAGGAGACCAAAAGCGGTGGATAACATAGTTA 

CGCCATAACCGATTTTTGTTTACTCTAACTTAGTGTGCGATTTACTATTTTTAACAATATTTATGTTAACCAACAAGAATCTTAAGGCATTTATT 

TGTATTCATTAGCAACGTCATAAAATATAATAACCATACAGATTTACATATAAGAGTATCAATTTGGCGACTGATGATTGTTTAGCCTTCATTTT 

TGCAACGCCAACAGATGATTTGGGCGATTTGTTAGCATTCGGAAGCAAAGCTTTCACAGATAGTTTCGCTGATACGATAAGGGTTTCGTTTGGAT 

AGCATTCATTGAAGTGATTCATTGGGGAAGATTTATGGTTCTATTGATTCTGCTTCCTCACAAACTTCATGAACACATCCTGGAAAGTTGTATGG 

CTGACCGAGTAGTACAATATCTGCATCTCATCCTTCAAGTTCTCCATGTACTCGAAGATCGCGGACCACT 

(SEQ ID NO: 1144) 

Exon: 1001.. 2224 
Exon: 2282. .3820 
Start ATG: 1369 

Transcript No. : CT29322 

TAGCCTCGGCAAGAACGTTTCTTTTTTGAGCGGAGATTTCGCGAGCACGCGTCTTGTTTTCCGCCGTTCGCCGTGCACTTGGACGCGCACTTTAG 
AAAGCGTTGACTGCGCCAAAGGTCATTGCTGGGCAAGAGAAGCGTCCAATCGCCGCTGGTGGTGACGCTTGTGGAAAAATCAATTCGGACCAGTG 
ACGGACGTCGAACTTCCAGTTCTCTGACAAGCGGAATCCAGTCAACAACAACATGCCCGTGGGAACGTTTGACGCCTGGGATGCGGCCGTGCTGA 
TCACCATCCTGGTCATATCGGCTCTGATTGGCATCTACTACCGGTATACAGGCGGCAAGCAAAAGACCACACAGGAGTATCTGATGGCGGACCAG 
AGCATGACCACTTTTCCGGTCTCCTTCAGCCTGATGGCAAGCTTCATGTCCGCCATCTCACTGATGGGCGTCTCCAACGAGTCCTACGAATTTGG 
TACCATCTTCTGTGTGATCAACATTGCCTACGTGCTGAGTACGCCCATTGCCGCGTACTTCTTCCTTCCTGTTTTCTACAGGATGCGGACGACCA 
GTGTGTACGAGTATCTGGAGCGGCGCTTTGGCCAGGCCACCCGGCTGTCCGCCTCCCTGGCCTTCACCGTGCAAATGGTGCTTTACATGGGCATT 
GCACTCTATGCACCAGCGCTCGCTTTAGAAGCTGTAACCGGCATTCATCGGTCAATGGCCATCGTGGTCATTGGACTGGTGTGCACCTTCTATTC 
GACACTTGGCGGCCTGAAGGCGGTGCTCATCACGGATGTGTTTCAGTCCTTCCTCATGTTTGCCGCCATCTACGCCGTGATTGCCGTGTCGGCCA 
TTAAAGCCGGCGGATTCGCGGCCATCTGGGATGTGGCTGTGGAGGGAGGACGCGTGAACTTCATTGAGTTTTCATTGGATCCCACCGTGAGGCAT 
ACTTGGTGGTCTCTGATCATTGGCGGCATGGTCACTTACCTCTCCTTGTATGGCGTTAATCAAACGCAAGTACAGCGACTCCTGAGTGTCCGCAA 
TTTGAAGAGCGCTCAGTCGGCGCTTTGGTGGAACCTGCCCATCCTAGGAATGCTTAGTTTTAGCACCATTTTCAGTGGACTGTCCATATTCTACT 
ATTATCGAGATTGTGACCCTTTGCTAAAGGGACGCATCGACAAGCGGGATCAGATCATGCCCCTTTTCGCACTGGAGACTATGGGTCAATATCCT 
GGACTGTGTGGTCTGTTCGTGTCCGGAATATTCTCCGCCAGTCTTTCCACGATTTCGTCGGCAGTTACCTCGCTGTCGGCTGTAACACTGGAGGA 
TTACCTGAAGCCCTTGTACAAGGCGATATTTAAACGGACGCTAATTGACTCCAAGTCCACGATGCCCACCAAGATTGTGGCGTGCATTTTTGGCC 
TGCTTTGCATCGGTCTAGCTTTTGTCGCTGGCTCCATGGGCGGAGTTCTTCAGGCATCGCTTACCATTTTCGGTGTGGTAGGCGGTCCGCTGCTG 
GCCATTTTCACACTGGGAGTCTGCACTACACGCAGCAACCAAAGAGGCGTGCTGCTTGGCTTCCTCGTTTCCCTGATAGTTTCCTTCTGGATGGG 
ATTCGGCGGACCCAAACCAAAGCCAGTTACCTTGGAGTTTAGCACAGCGGGCTGTGAGAATGCCACCGCGGTAATGGCGCGTGCCATCGAGCTCA 
GCTCAAAGAGCAGCGGAGTGGTCATCGAACCCGACTACTTCTGGTTATATCGAATTTCTTACTTGTGGCTGAGCGTCATCGGCTTTCTTATTGCC 
GTGGTCGTCGGCTATTTCAGCAGCATTGTGCTGGCTCACTGTGGAAAGGCAGAAAACGCGGAAATCTATTTGGACAAGTCTCGGAAGCAGCTGGA 
CTACGATCTCTTCGCACCCATGCTGTCCCGCCGCTGGCGACGCCATAAAGAAGAACAAGACCAAACTGCCCAGGACGAAACGCTCACCAAACTAA 
CTGAGCAGTAGTTCCGCCCGGCTTAGATTTAGTGTAGAATATATACATACATATAAAAGAATTTCCAGAAGGCGTCGGCTATTTGGTTAGTCGTG 
TTACGGATGTCGTACCTAAAAGTGCATAACTATGTGACCTTTTTGTATACCTGTGTGAACGAAACTGGCATACAGAGCGACCAAAACAGAGTCTT 
CCATATACACGTAAACAACGATAATTAGTAGTTATTAGTCGTGTTGCAAATGTTCACAAGCGCATCATTAAGAGGCCTCTTAAGAATAGTCAAAT 
GTTTTTTACAAGAACGTATCAAAAGTAGGAGTCTACCTCAATGAACTTTTACAGAGGGACAATAATCAATTTGGAGCCTAAAAGCGAGTTGGCCG 
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ACTACTTTCCCATGTTGAGCATTGGCAGCCACAATGCCCCACATAAATTATTGATTTAGTAGTACATCTAATTTTAATGGGATATTTATATTCAT 
ACCTCTTAGTTACTCGATATTTGCATACATAGATAACATAATAACATTACAATAATACACTACAGTGAGCACCGCCTATAAATAGTTACTACATG 
GCTCTTAAAGCATTACATTGCGCCACAGTGTGAATTCTGTTTACTTATATACCTCTACTATATGTACACAAATATACTATAGACTAATTATATAC 
TCGTGTCGTGTCGCAATCAACCATTGAACAATAACTCGGAGCAGTGAAAGTTATGTGTATTTGTAAAAGCCAAACTGCTGATCATTAAAGAAATC 
TGTTCTAC 

(SEQ ID NO; 1145) 
Start ATG: 369 

MADQSMTTFPVSFSI^SFMSAISLMGVSNESYEFGTIFCVIHIAYVLSTPIAAYFFLPVFYRMRTTSVYEYLERRFGQATRLSASLAFTVQMVL 

YMGI AL Y A P AL AL E A V T G I H R SMA I V V I GL VC T FY ST LG GL KAVL I T D V FQ S FLMFAA I Y A V I AV S A I K AGG F AA I WD V AVE RG R VNF I E FS L D P 

TVRHTWWSLIIGGMVTYLSLYGVMQTQVQRLLSVRNLKSAQSALWWNLPILGMLSFSTIFSGLSIFYYYRDCDPLLKGRIDKRDQIMPLFALETM 

GQYPGLCGLFVSGIFSASLSTISSAVTSLSAVTLEDYLKPLYKAIFKRTLIDSKSTMPTKIVACIFGLLCIGLAFVAGSMGGVLQASLTIFGVVG 

GPLLAIFTLGVCTTRSNQRGVLLGFLVSLIVSFWMGFGGPKPKPVTLEFSTAGCENATAVMARAIELSSKSSGVVIEPDYFWLYRISYLWLSVIG 

FLIAWVGYFSSIVLAHCGKAENAEIYLDKSRKQLDYDLFAPMLSRRWRRHKEEQDQTAQDETLTKLTEQ* 

(SEQ ID NO: 1146) 

Name : sodium- dependent multi-vitamin transporter homolog 
Classification: transporter 



Celera Sequence No. : 142000013384329 

GTAGAAATCATAATTCAATTGCAACGGATCGAGTACTTACATTATCACTCTTTTCTTGATCATTCCCCTTGACTACACAATCATCGATCTTAACA 
ACTTCCGTTAAATCAGGACTGGTACTTTTGCACTTGTACTGCTTAAAAACTAGCTTGAACATGCTTCATAAACAATAAGTACAATATAAACAATA 
AGTTAATATAAACGATAGCAATAGTGTGACTGCGTTTGGAGAAATCTTAAATGCTTTTGAGCACGTTTAGAAATCAGACATGTGGTGGAAATACT 
ATGGGATAGATATAAAAATTAAAAATGAATGCAATTCATAAGAGACATTGCCGAAAAAGCTCGAATGTCCTTTTCATGGAAAAAATTTTAAGCCT 
TTTTTCATGAGGTGATTCATTTGTGATTAATCTTATCGATATAAAGTTATCGATATATGAACGGGACTTTTAACTATTTTGAAATATTTTTGTTG 
TTTTGAGACCGATTTTTTCATTGTTTACGTTTTTGCAGTGATCGTTTTCATTATTTCGGTCGAAGATACGTCTTAAGGGTTGCACAGGAAAAAAT 
CAAGTGGGAACATTACACTTCAGAGCGATAATAGATCTTTATTTTTTATATTTTTATGAATTAATATTAAAAAAGGAGTTGAAATAAGAAAATGG 
TATATTTATATGACATATTTATATATTTATATATGTTGTAGTTATTTTACTTTCATGTGAGGGGGCGTGCTTAATACTATATTCGGTTCASAAGA 
AAGGGGTACCACATTGTTTATATAAAATAAAATATGTTTTTAAAATATAACGTCTTTTGACAAATAAACTTTGAATAAATAAAGACAAGTTGGTA 
TATTTGCCGGCTCTCTCCGTCCGCAGTGTGAACAGCCTGGCTGCCGAATATGTATCAATGCTATCGATATCCGCATACATCGATACTGCTTCCTT 
GCACCCTTAATTTGTTTCTAGGTGTTGGCTGCCAATTGTAGAGTGCAAATGAAAAGTTACAGATTGGTGGCTTCGTTAACAAAATTTTCAGTTAT 
TTGTGTAGTTGAAGACGGGAATTTGTTTTGCGTATAAAAAGTTAACACTGCGTGTAAGTAACGTATTGTCGTCATCAACAAACTGTCCAGTGAAT 
GAAAATTGAAAGAAAAATTACTGATCTATGGTAATGTGTGTGTATGTGCTAAAAGGAAAAAAAAATTTTTCATACTCGCTGAACGACGAGTGATG 
AAAATGCAGTCTCGATAGTTTTGCGCCTACATATATGTATGACTATTGCACAGATTTGGTTGTTTTTTTTTTCTAAACGATTGCCAGTCGGCATG 
GAAAATTAAATGAAATGAGCTCCATATATTTTGTGCCTGTGCCAGTGCCTTCCCAACGAAAAATGTTCAACGTTTAAAATTGATTATTTTTTTTT 
TATCTATTTCCCTCCTGCTCTTTGTCATGTGTACGCTTACGCGGTTGCTGCGCGTGCGAATGGGACACCGATTGCCGCGACTGTTTTTGTCTCAC 
CGCCCGCTTTCGGGAACTTCTGACCTGCTTTCGCTGCCGCGCTTTCAATTGCGCACGGCTCGCATCCACTAGTCGTGCTCCTTGCTCCAATCGAA 
AGGGCCACTTGCGCCGCGGTGGTGGTGCAAAACTTGCAATTTTGGTGTACAAAAAAAGCAAAAGGAATATATCCCATATATCCTACCCATCTATC 
GGAAGAACGGAATTCGAAATCCAGCACATAAAAAGCATAAATCACATCAAAAAATTCGCATTTTCTCTGGTCCTTAGCACCAGATCGCTTTTTAT 
GCATCCAATTCGGTTTCCTCTTTTTTTTTAGTATACCATCCCATCCGCGAGATAACATTCTGTATTAAACAAAACGCCCAACATCCTAATTACAA 
ATCAGACGGGACGGAAAATCGAAAAAACTATAGGAACCTCTCTCTCCCCCCCCACTAACGTAGTTTATGTGTATTTAACAAGCGAAGAACTTTCT 
GCAGAGGTCTGAAATACACCAATCGCACATACCAAACAAAAAGTAATAAAAAAGAAATAATTGAAAAACCAAATAGTTAAATTGGCAGAGCAAAC 
ACAGGACGACGACTATGAACTAAAGTCAGAAATTAATGGACATCGTACCGAGAATATAATCCGTGAATAGTACTATACGATAAAACAGAAAAAGA 
AAAAAAGTCAACTAAAAAGCAAAAAACCAACTCTTGCAGAACACAGTTCATGATTTATCCATACTAGAAGTGCACAAAATAAACGCCGTATGCAA 
GCGTGATCATCCAAGGCTCATTATCCGACTAAGCTACTGGCCAGCTTAAAAAAAGTAAGCGTAAGTACACCCACAACTGACCATATAACTAGGTT 
AATGAACGCCTCCATTTACTACTTCCCCATATGTATAGCACCATCTAACAAGAGCGAATCGAATCGAATCGACTTGGACGACCGACCGACCGACC 
GCACAAAGAAGGATCTATCTGGAAAGCCGGCTAGCCGATTCCCGCATCCGATTCCGAATACGAATACGTATCCGTATCCGAATCCGAATTTGAAA 
CCGAATCCTCAACCAAATCATCCCATAAGGACGCCGCTGTCTTCAACGATCACAAGATGAATGAACTAGACAGTCTCAGGCAGGAAGCCGAGTCC 
CTAAAGAACGCCATTCGGGATGCCCGGAAGGCGGCCTGCGACACATCACTGTTGCAAGCGGCCACCTCGCTGGAACCCATCGGCCGCATACAGAT 
GCGCACCCGTCGTACATTACGCGGCCATTTGGCGAAAATCTACGCCATGCATTGGGGCAACGATTCAAGGAATCTCGTATCAGCCTCACAGGACG 
GCAAACTGATCGTTTGGGACTCGCATACCACGAACAAAGTCCATGCCATTCCACTGCGATCCTCGTGGGTGATGACCTGTGCGTACGCCCCATCC 
GGTAGCTATGTGGCCTGCGGTGGCCTCGACAACATGTGTTCAATTTACAACCTAAAGACGCGCGAGGGCAACGTCCGGGTGTCCCGTGAGCTGCC 
CGGCCATGGTGGCTATCTATCGTGCTGCCGCTTCCTGGACGACAATCAGATCGTGACCAGCTCCGGTGATATGTCGTGCGGATTGTGGGATATCG 
AGACGGGACTGCAGGTAACCTCGTTTTTGGGCCACACCGGCGATGTGATGGCCCTCTCACTGGCGCCCCAATGCAAAACGTTCGTATCCGGCGCC 
TGCGATGCGTCCGCCAAGCTATGGGACATCCGGGAGGGTGTCTGTAAACAAACCTTCCCCGGCCACGAATCCGATATCAATGCGGTCACATTTTT 
CCCGAATGGTCAGGCATTCGCCACCGGTTCGGACGACGCAACCTGTCGATTGTTCGATATCCGTGCCGATCAGGAGTTGGCCATGTATTCGCACG 
ACAACATCATATGCGGCATCACATCGGTGGCATTCTCGAAGAGCGGACGTCTGTTATTAGCGGGCTACGATGATTTCAACTGCAATGTATGGGAC 
ACGATGAAGGCAGAACGGTCTGGCATACTCGCTGGCCACGACAACCGTGTATCCTGTTTGGGTGTCACCGAGAACGGCATGGCGGTGGCAACAGG 
ATCGTGGGACTCCTTCTTGCGTGTATGGAACTAAAAATAAAACAGAAACAGAAACAGCAGCATAAGCAGAAACAGAAACAAAACAAAACAAAAAA 
CAGAGAAAACCAAACAAACAAAGAACAACAAAAAAAGAGGAAAAAGA 

CGGTTGCGCAGTCTGCGGATAGTGTTTCACCCATCATCTACATTGTGATCTAATGTATATGATTCGGTGTTACCGTCTCTGTGGTAAATATTAAA 
TGAGTGCCGGACGTTTTGAATATGTGGGCGCAGCTAGTTGACTAGAACGTAGACCACAACAACAAAGAACTTTATATACAACATTATTTATATTG 
CTATATACTTATTAACTAAAATTATACAAAAAAACAAAATAAAACCAAAAAAA 

GCAACATACAAATAGACATACACACGCATACAAGAATATTATTAGTGTAAATGGAAAATTGAGAAAAGAAAAAATGAAAAACCAAGAACAAGCGA 
TCAAGAAGAAACCAAAACGATAAAGAAATTGATGGAGTAACGATGGGCCCGCTGATAGTCAGAAGAGTAAGAGTCAGTTGTTTAGTCAGAATTCC 
GGTTACAGCTACAGTTTAAAGCGTACGTAAATTAAACTTAGTTCAAATCGCATTTCAGCGGTGAGTGCGTGCAGCTGAGAGCACAAGGCTGCCAG 
CCCCTGGACTCCTGGCATCCCGATCCTGGTCCTGATCCCGCATCGGAACCCGCATCCGTGTCCCAGGAGCCGCAGGAGTGGTGCTGGCCAGTCAG 
TCAGTTAATCAGTGAGTCAGTCGTTTGCACTTTGTAAAAGAAAATGAAGATGGTATCATTGTAATCATGTATGTCAACAAGTAAATACATATTCT 
TTTGCGTATAATACATAAAACTTCAGAATGTGTGGATGTGGATGATCCTGATGCAAGGTGGGGATGGTGGAGAAGTCAAATAAAGTTAAAGCATT 
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TAAAAAAAATAAAATAAAAAAAGAATCACGGCAAACCTAAACGTAAAGCAAAACAAAAGAA 
AGGAAACCAAACAAGAAACAAAAACAAAAATTGCGTAATAAACTTTATAA^ 

CACTATAACAAATTTTTGTCTTATTATGTTTAAGTCCCTAATTTATATACAAACAAAAATAAGAATGATTTAAATAACATATATGTAAACTATGC 

ATTAGTGAAAAAGTGGAAAACTATTTATATATA 

(SEQ ID NO: 1147) 

Exon: 1001. .1098 
Exon: 1988. .2340 
Exon: 2414.. 3973 
Start ATG: 2622 

Transcript No. : CT29492 

GAAAAGTTACAGATTGGTGGCTTCGTTAACAAAATTTTCAGTTATTTGTGTAGTTGAAGACGGGAATTTGTTTTGCGTATAAAAAGTTAACACTG 
CGT AACTTT CTGC AGAGGTCT GAAAT AC ACCAAT CGCACAT ACCAAAC AAAAAGT AAT AAAAAAGAAAT AAT TGAAAAACCAAAT AGTTAAATT G 
GCAGAGCAAACACAGGACGACGACTATGAACTAAAGTCAGAAATTAATGGACATCGTACCGAGAATATAATCCGTGAATAGTACTATACGATAAA 
ACAGAAAAAGAAAAAAAGTCAACTAAAAAGCAAAAAACCAACTCTTGCAGAACACAGTTCATGATTTATCCATACTAGAAGTGCACAAAATAAAC 
GCCGTATGCAAGCGTGATCATCCAAGGCTCATTATCCGACTAAGCTACTGGCCAGCTTAAAAAAAGTAAGCCACCATCTAACAAGAGCGAATCGA 
ATCGAATCGACTTGGACGACCGACCGACCGACCGCACAAAGAAGGATCTATCTGGAAAGCCGGCTAGCCGATTCCCGCATCCGATTCCGAATACG 
AATACGTATCCGTATCCGAATCCGAATTTGAAACCGAATCCTCAACCAAATCATCCCATAAGGACGCCGCTGTCTTCAACGATCACAAGATGAAT 
GAACTAGACAGTCTCAGGCAGGAAGCCGAGTCCCTAAAGAACGCCATTCGGGATGCCCGGAAGGCGGCCTGCGACACATCACTGTTGCAAGCGGC 
CACCTCGCTGGAACCCATCGGCCGCATACAGATGCGCACCCGTCGTACATTACGCGGCCATTTGGCGAAAATCTACGCCATGCATTGGGGCAACG 
ATTCAAGGAATCTCGTATCAGCCTCACAGGACGGCAAACTGATCGTTTGGGACTCGCATACCACGAACAAAGTCCATGCCATTCCACTGCGATCC 
TCGTGGGTGATGACCTGTGCGTACGCCCCATCCGGTAGCTATGTGGCCTGCGGTGGCCTCGACAACATGTGTTCAATTTACAACCTAAAGACGCG 
CGAGGGCAACGTCCGGGTGTCCCGTGAGCTGCCCGGCCATGGTGGCTATCTATCGTGCTGCCGCTTCCTGGACGACAATCAGATCGTGACCAGCT 
CCGGTGATATGTCGTGCGGATTGTGGGATATCGAGACGGGACTGCAGGTAACCTCGTTTTTGGGCCACACCGGCGATGTGATGGCCCTCTCACTG 
GCGCCCCAATGCAAAACGTTCGTATCCGGCGCCTGCGATGCGTCCGCCAAGCTATGGGACATCCGGGAGGGTGTCTGTAAACAAACCTTCCCCGG 
CCACGAATCCGATATCAATGCGGTCACATTTTTCCCGAATGGTCAGGCATTCGCCACCGGTTCGGACGACGCAACCTGTCGATTGTTCGATATCC 
GTGCCGATCAGGAGTTGGCCATGTATTCGCACGACAACATCATATGCGGCATCACATCGGTGGCATTCTCGAAGAGCGGACGTCTGTTATTAGCG 
GGCTACGATGATTTCAACTGCAATGTATGGGACACGATGAAGGCAGAACGGTCTGGCATACTCGCTGGCCACGACAACCGTGTATCCTGTTTGGG 
TGTCACCGAGAACGGCATGGCGGTGGCAACAGGATCGTGGGACTCCTTCTTGCGTGTATGGAACTAAAAATAAAACAGAAACAGAAACAGCAGCA 
TAAGCAGAAACAGAAACAAAACAAAACAAAAAACAGAGAAAACC 

TGCATTAAAGGGTGGGGTGGGGTGTCAGGTCAACGGTTGCGCAGTCTGCGGATAGTGTTTCACCCATCATCTACATTGTGATCTAATGTATATGA 
TTCGGTGTTACCGTCTCTGTGGTAAATATTAAATGAGTGCCGGACGTTTTGAATATGTGGGCGCAGCTAGTTGACTAGAACGTAGACCACAACAA 
CAAAGAACTTTATATA 
(SEQ ID NO: 1148) 

Start ATG: 660 

MNELDSLRQEAESLKNAIRDARKAACDTSLLQAATSLEPIGRIQMRTRRTLRGHLAKIYAMHWGNDSRNLVSASQDGKLIVWDSHTTNKVHAIPL 
RSSWVMTCAYAPSGSYVACGGLDNMCSIYNLKTREGNVRVSRELPGHGGYLSCCRFLDDNQIVTSSGDMSCGLWDIETGLQVTSFLGHTGDVMAL 
SLAPQCKTFVSGACDASAKLWDIREGVCKQTFPGHESDINAVTFFPNGQAFATGSDDATCRLFDIRADQELAMYSHDNIICGITSVAFSKSGRLL 
LAGYDDFNCNVWDTMKAERSGILAGHDNRVSCLGVTENGMAVATGSWDSFLRVWN* 
(SEQ ID NO: 1149) 

Name: G protein beta-subumt 13F 
Classification: signal_transduction 
Gene Symbol: Gbetal3F 
FlyBase ID: FBgn0001105 



Celera Sequence No. : 142000013384690 

TTAGTACAATCGCTTGTAGGTCAACAGTAAAACTAACTAAATACTAAACCTATTCCTTGGAGACGGGTTTGCAGGCTATCATGTACACTGGAAAG 
AGTCCCTTGGGAAAGTTTTGTTGCTTGACCTTGCGTACAATGCGAAATCCGGCTTCCTTTAGGAAGTGTTCATAGCTGTCCAGAGGCCTCGTTAC 
CGACGAGTCGTTCCTATCTTCCACTGTCTTCTTCGAGCTGCTCACGTTCTCCTTCAGACACAAGAAGGCGCCGGGCGCCAGTCCCTGCTTTATGC 
GTCGAAAGAACGAGACCAGATCGCGGTCCGTGAGATGGCCCAGGACCCACTGGGTCCAAACGAGGTCGTACTGTTGCGTAGGCGTAAACTTCTGC 
AATCCCACGTTATAGATCTGTCCCACCTTTCCACGGCTTCCGTCTTCCGAAGTGCAGTACTCCCGTGCTTTATCGGCAAACGCGGGATCCTGCTC 
GACAAGATCCACGCAGCTGAAGCGGGGGATCAGGAGATTCCGAGTCACCCGACCAATGCCAGCTCCGCAGTCGAGGGCCAACCTGTTGCCCGGCA 
CGCGGATCTCGCGCAGGAACACATTCGATCCCTGTATATCTATGGCACTAATGTAGCCCAGGCCGCCGAGCATCCCGTTAACAGTCGCCGGTACC 
TCAGACCAATATTTTTGTGCCTTATTGTAAAATTCGGACTCCGGTGCGGCGACTTTGGTCGAACTATCGCTCGAAGAAGGGGACGCTGTCTTGGC 
ATCTGAACTTGCGGCTATACTGCTATCATCCTTCTGCTTCGACGACCCCTGGACCTTATCCGTGGTCTCGTCCATCATCTGCAACTTGTCTGAAA 
GCTGCTCTTCTAATGTAGTCGTCATTTTTGTCAGGATTTCGAAGGCCGCATGCTTTGTTTTGTTTTAGTGCGATAACCAGGGCTCCTAAACACAG 
TGAGCCATTTAGGTTTTACGAAAGAGTGGCAACCCTTCACCAATCAGCTGTTCAACAAGCAACAAAATTGTCCGTTTGTTTATGTTACTAGTTTT 
TTGTTTAATTTTTCTATTGTCCGGACAAAATGACTCTCGATTTGGATGCGCATAAGAAAGATGACAAGTTGCTGATCACAACCATACAGCAGGAG 
TATAAAATTCTGGCCGAATAGTGAGTTTTCAGAGTTCACTGATAACTACGTGGAGTAAAACTAAACTTATTTAATTTCTTATGTTTTAGCAAAAT 
GATTGAGTCGGAGAAGTTGAGTGGGATATATGTGATACCCAGCTATGCTAACTCCTTGCGTTAGTCTACCCACCTGCCAACCTGTATGCAGTCTT 
TAGCTAATTTTATCTTTTGTTAATAGAGTGGTTTGGAGTCTTCTTTGGCCGACAGGGCTTGTATGCGGAAAGTGTTTTTCGATTCACAATTTTGC 
TCCCAGATCGTTTTCCCGACGACAAAAGTCTTCCGGTGAGTAGGATCGGCAATAGAGCAAACGTATTTAACTCTACAAGTCCCCTTTAGAGCATA 
ATTTTCCAGCAGGATGTGATTCATCCCCATGTGTGCCCGTACACCCACAGTTTGGACGTGAGCCACGCCTTCCCGGAGTGGCGCTGTGGAGAGGA 
TCACCTTTGGCAGCTTTTAAAGTACCTGCAAGTCATTTTCTCCGATCCCTTGGACAGTATTCGGGGCATTGAGGTAGATAAGCTCAAAAACAGCG 
AGGCTGCGGAGCTGCTGATGAATAACAAGGAGGAGTATGTTGCTCGCGTACAGGAAAATATCAAGGAGAGCAAGGAGCATATATTTGACACCCCA 
CCCACCGAAGATCCGCACTACATAGTATTCGAGAAGTTCCAGCAGGACGTGCACGGGCCCGTTCTGGAGCGCATCAAGGCCGGAAGAAGCAAGCT 
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GACGGAGCCTTCAGCGCAACAAGCAAATGGTGGACATGCAACTGGTCTGTCATGGGTTAAGGAGGGCGAGTTTAAGCCACTTAGCATTGAGTAAA 
ACCAATATACATATATTCTGTAAGTTATATTCTATCGTTGTCTCAATTTTAAAACGATTAAAGTTATTCAATTATTTACAATCCTCGTTCATGGG 
AAATCGCTGAGTATTGTTAACCCACTCGTTCTGCAACTCTAAACGTTCTCCGTTCCATTGGAAATGTGTGGGGGAAGCCTCGCCAGCGAGAAACT 
CAAAGTGCTGGCTGAGCTCGATGGCCAGTTTGGAGCGGACTAGGCCTATTCCGCCCACCTGTGGCGCAGCTGGATGATAGACCCTTCCGTTTCGC 
GGATCCATGTACAGTTTGTGGGGTTCGTAAGGCAGGGTAAGGACCTGTCCCGCGTGTGCATAGCTGAGCACCTCCGTATCGTCCTTCCTCAGCTG 
TTCAGTAAACACAACAGGTGTGTCATCGCACCTAATAAAATTCCTTTCGCGGCCACAGAGGGAAATGTATGGAAATTCGCTTTCATATCTGCTCG 
TTTTGTTTAGCCGCAGTCGGTTGAAAAAGAACTTGAGGAAATCCTTCTCCTTAAAGCAGCTGGTGAAGTTTTTCATCTTTGCATCGTCCAGGAAG 
AGCTGCAAAACGGGTTATGCGAGTTGAAGCACAGATGCATAGTTGAGAGTCTCACCATTCCCTCATGATCGATATAGTAAAAGTATTCGCGAATT 
TTTGGCTCCGGAGATTGACCTTGGTTATATTGTACGTAACGGCAAGATATGCTTGGAGATCTAACGAATTTTAGGTTTGTAAGGGATCTAGAACG 
TAGTATAAACATTGTTCCTCTATCTGAGTACTGTGTCTTCTTATGAGCAACAAAAACAGCTGACTTTTGTTTACTACATCTGTAGGTGCATTTGA 
TAACCTCCCATTATCAATTTAACGTTTTTACAGTTTTGTATTTTCTCAATAGGTAGCGATTATTTTTGCAAGTACATTTTTTGGCAGTCACATAT 
TTTTGCATCGCGAATACAGCGATATACAGAAATGAAAACATTTTATTTAAAAAAAGGTGGCTATGTGACTGTCTATTGAATTCGACATTTGGTCG 
AAAAGCTCAGGTTTTGCTTAAAATAGCA 
(SEQ ID NO: 1150) 

Exon: 1001.. 1160 
Exon: 1230.. 1294 
Exon: 1357.. 14 60 
Exon: 1515. .2068 
Start ATG: 1075 

Transcript No. : CT29563 

TTCAACAAGCAACAAAATTGTCCGTTTGTTTATGTTACTAGTTTTTTGTTTAATTTTTCTATTGTCCGGACAAAATGACTCTCGATTTGGATGCG 
CATAAGAAAGATGACAAGTTGCTGATCACAACCATACAGCAGGAGTATAAAATTCTGGCCGAATACAAAATGATTGAGTCGGAGAAGTTGAGTGG 
GATATATGTGATACCCAGCTATGCTAACTCCTTGCAGTGGTTTGGAGTCTTCTTTGGCCGACAGGGCTTGTATGCGGAAAGTGTTTTTCGATTCA 
CAATTTTGCTCCCAGATCGTTTTCCCGACGACAAAAGTCTTCCGAGCATAATTTTCCAGCAGGATGTGATTCATCCCCATGTGTGCCCGTACACC 
CACAGTTTGGACGTGAGCCACGCCTTCCCGGAGTGGCGCTGTGGAGAGGATCACCTTTGGCAGCTTTTAAAGTACCTGCAAGTCATTTTCTCCGA 
TCCCTTGGACAGTATTCGGGGCATTGAGGTAGATAAGCTCAAAAACAGCGAGGCTGCGGAGCTGCTGATGAATAACAAGGAGGAGTATGTTGCTC 
GCGTACAGGAAAATATCAAGGAGAGCAAGGAGCATATATTTGACACCCCACCCACCGAAGATCCGCACTACATAGTATTCGAGAAGTTCCAGCAG 
GACGTGCACGGGCCCGTTCTGGAGCGCATCAAGGCCGGAAGAAGCAAGCTGACGGAGCCTTCAGCGCAACAAGCAAATGGTGGACATGCAACTGG 
TCTGTCATGGGTTAAGGAGGGCGAGTTTAAGCCACTTAGCATTGAGTAAAACCAATATACATATATTCTGTAAGTTATATTCTATCGTTGTCTCA 
AT TT T AAAAC G AT T AAAGT T AT T C AAT T 
(SEQ ID NO: 1151) 

Start ATG: 75 

MTLDLDAHKKDDKLLITTIQQEYKILAEYKMIESEKLSGIYVIPSYANSLQWFGVFFGRQGLYAESVFRFTILLPDRFPDDKSLPSIIFQQDVIH 
PHVCPYTHSLDVSHAFPEWRCGEDHLWQLLKYLQVIFSDPLDSIRGIEVDKLKNSEAAELLMWNKEEYVARVQENIKESKEHIFDTPPTEDPHYI 
VFEKFQQDVHGPVLERIKAGRSKLTEPSAQQANGGHATGLSWVKEGEFKPLSIE* 
(SEQ ID NO: 1152) 

Classification: enzyme 



Celera Sequence No. : 142000013384486 

ATTAAAACTTAATTTTACAAAAAAAAAATTAAACGAAATGTTTGTTTGGTTGGCGGGGGGGGGGGGGTGTTTTCTCGATGCGAAGCTTTGGGAGC 
TTTTAAGCGCCGTCAACCGAAATGGGGGCTCGTCAGCGGCTCTCAAACTGTGCCGAGGAAGTTTGAGAGCCTATGAAGGCGGTAACTAATTAATA 
AATTTCCTTTTATTAGCACATGCCTTTTAAGTTTACACAATTTTAATAGTTTCTAAATGAAAGTGAAATGTAACTATTATTTATTTGATTAGTTT 
TTTTTCTTTATTCAGTGGTTTCTATTTATGCTAGTGCCCTTTTTACGCTGCTCTTTCAAATGCCAATCATCGACGTTTCTTTATTTTTAGCCATT 
TCCCACCTGGGCACGCAAATGGCAATTGATTAGCATAATTAATGGCAAAAACTTTGCAATAAAGTAAAAAGTTCGAAGAAGGGCCCCGAGCCAAG 
TCAAATGACGGAGGCGCAAAAGCGAAAGTATGTTAGCATAAATCGATCAAGGAGGAGCAGCCGCCGGCCGAAAGGGGGAATCCCACCGAGGTGCC 
GTAAGACGGGACCGCAAGGTGCAGGAGCAGCCGGCGGCTCCTTCGCCGGATTTGAGCGCTGTGTGCCCAATTGGATTGACTGTCAGGTGACACAC 
GGCACAAGCGCATTGGACGAGCACATGCACACGGTCGTATGGCATTGGTACACTGAGAGAAAGGGAGCCGAGGAATTGGAAATATTTAGGATAAT 
AGCATATCATTCTACAATTTGGAAAAGAGGAACATTATGTGCGTTCTCAATAATGAATTTATATTACTACTCCCATGTTTATCATGCTAAGATGC 
ATCATAGTATTAGGTTAAACACCCTACTTTGGTGCAGTGTATGCTAAAGGGAAGGGGGGGGGGGAGGTATGACTGGGGCTAGTCAGCAACATAAC 
AATAAATACCGCTAGGCAACCGTAAATTCCCTAGCATGAATCGATTTAACATGACCGCGGGAGCGGAGGGCAGGCCACTCAGTTCGCCAGCGAAG 
CTCCTCCGTCCAACTCCGGCCGAGATTCCGAGCCAGGCAGCCAGCCAGCCAGGTACATACGATTTCCTGCCCGGTCTCCAGAGAAGTGTGCCAGG 
TGGGTATAATCGAGTGTAGAGATTTGCAGTGACAGTTCCACTTTTTACCCAGCGCTCCTTGCATACATTTCTTATATACATACTTCTTATGCAAT 
CTTAATCGCACCCAATCCCAGCGAGAAACATGTTGCCCAACCTGAAGTCGCGCCGCATCAGCGGCCTGCAGCTGGCCGGGAAGCACCTGGGTCCG 
GTGGCCAAGTGCCCGCCCAGATATTCCGAGGAGGACTGGGACTATAACAACAAGATCAAGTTTCGAATCACCTGCGACCAGGAGAAGCTGGCGGA 
GCGGATCGTGGAGTGAGTATATCGGGCGGGTGGCGGATTCATTGGGAGCGGATTCAAGTTGGTGACCCTATTCTATGCAGGGAGTCGCGTCGCGT 
GGTGGACGAAACGAAGGACACCACAAAGAACTGGCAACGGGAGGTGGAGCACCATATGCGCGAACGGACGAGCGAAATCCGATTCCTGGTGGACG 
AGTTGAATCGCCAGAAGAAGACCGCCGCCCTGGAGGATGAGGCCTTGAACACCTATCGCAATCGCGTGCTGAACTGCATCGAGTTCCTGAAGGAC 
AAGTCGCTGGCCATTTGCAAGCAGTGCCTGATCCTACGCGAAGGTCGGATCGGTGTGGATTTGTGCGACGACGAGGTGGATCGCTCGCTGCGCCG 
TGAGCTCAAGGTGATCAAGGGCTGTCAGGGATTGGCCGATGCCGCCCTCAAGGAAGCCGAGGAGCAGATCCGCAAGCTGCGCGCCGCCATCTATT 
TGCTCGATCAGGATCTGGCCGCCAAGGATAAGTCACTGGCCATCGACGAGAAGAACCTGAAGCTCAAGGAGTTCCAGCACGACCTGGCTAAGGGA 
CAGGATTTGTCCAAACAGCATTGGTAGGTTTTTGAAAACTAGGTTGCATGTCAATCACACATGTTGGGGTTTCCTTTTTAAGGAATGATGAAGTG 
GATTGATTATTAATCTTTTTTCTTTCACTTTTATGCGTAAGTCATTTCGGTGTTCATCAGTACAAGGAGTTTAAAAGGATAAACATTTTACTAGA 
GTCTTATCCACAAATTCCATTACAAAGTCAAGTAGTTGTTCCCCTTTTCAGTTAGGAATGGAATTGAGGAAGGTCAAACCTAACAATGAAACCGC 
AGTTTCCTTGGTGATATGGGATAGAGTTGTCCTGAGTCTATTTTAGTGCATTGGAATTAAGTGATTTATTTACAACCACTCAAATCGACCTGATA 
GTTGCCATGACTCCACCATGCCTTTACCCCCTTCTCTTCCTTACCCAGCCAATTTTCGCTGACCGAGTGGCAGGCGCAGACGTACGAGAATCTGG 
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AGGCCAACGCCAAGGCGCTGGTGTCCGCCGGGCAGCTGCGGGCCTACATCGATTTGCTGCTGAAGCAGGTGTGCGAGGATATGCAGAACCAGACG 

GACAGGACCAACGAGGCCTTTGAGCGAAGGATCTCGGAGACGAAGCACGTGAAACAGTGTTTGGAGAACAAGCACAAGGACACCATGGATCACAT 

CCATCAGGTGCAGCGCAACATGACCGAGCTGGAGAAGGAGATGATGGACAAGCAGCGGGCCATTACACTGTGCCAGACGCGGCTCAGCAATCGCG 

CCCATCGTCCTGGCCTGGAGCTGACCTGCGACATGGTCCAGGATGCGCTTTACAACGAACTGCAGGCGCTGAAGGCATCCGTATGCAAGCTCAAT 

CAGAAGCTGAACGAGAACAAGGCATCCATGCGCTACCTGATGCACGTCCAGGTGATGCAGGAGGAGGAGATCAACATCAAGGCGAACACCTGCAA 

GATCGACGAGGTCGACTGCATGACCCTGCGACAGGCCCTGAAGTACCAAACATTTTAGTCGGGCAGCACTACATCCAGTACAGCACCAGTCCAGT 

AGTCGCCACACAGCCCAGCCACCAATAATCAAATTAAAATCTAATTATTAGATGCATGTGCATTTCGTGTTCGATCCCAAATTACTTAATAAATT 

GAAATTCACTTTCGATTTCCAGTTACCAAAACTATGGCTTTTTAATGCATTTTTAAATTGTTCATTTTATTCGAAATAGTTAATCATATATTTCT 

TTTGCTTAATTTTCGTGAATTTAAATTTCCCGCTTTGAATCTAGTATTTTTAGTTTTACCTGCCTCTATTCGATATTTACCGCTAAGCACAGACA 

TATAGTTATCGATAACGTGAACGCAAATGCGAGGGAGTATTGCAAATAGTGTTACCGTGCTGCTGTGAAAATTCCCAGGTGTGACCGCAGCCCGC 

ATTCGATATTGATGAAGCCAGGCAAAACGTAGAAAAGCAAAAGAAAACCTTAATTAATGTTAAGTTAAAAGTGTATAAACCCGAGATTAAGGTAA 

GTGGCCCACCACCATCTAGATTTTGAGTACACCGAAGGCATGGCGATGTTGGCAGTGGCGGGAGCGTTACGCCTCTACCGCCACCGCCTCCGCCG 

CACACGATAAAAAATGGAGTACAGCCACGAATGGGCAAATCAATAATTTTCAACACTCCAATTGTTTTTACAACTGAAATCAGTCCTAATCTTGT 

TTTCCGCAGTGCAAGGATGTCGAAACGGAGCGCCGACGACGCCACCGGCAGCAGCTGCTTGGTGGCCGCCGCAGCCGCCGGCCAGCCGCCCATCA 

AGAAGGTGCACTTCGAGCCCCATCTGATTGGGCCGGTGTCCACGCTGGAGGAGATGGACATCAAGGTGCTGGAATTCCAGAACAAGAAGCTAGCA 

CAGCGCATCGAACAGCGGATGCGAACGGAGGCGGAGCTGCGACATCGAATCGAGCAGCTGGAAAAGCGGCAAACGCAGGACGACGCCGTTCTAAA 

CGTGGTCAATCGG 

(SEQ ID NO: 1153) 

Exon: 1001.. 1139 
Exon: 1257.. 1437 
Exon: 1506.. 2018 
Exon: 2424.. 3003 
Start ATG: 1001 

Transcript No. : CT29571 

ATGACCGCGGGAGCGGAGGGCAGGCCACTCAGTTCGCCAGCGAAGCTCCTCCGTCCAACTCCGGCCGAGATTCCGAGCCAGGCAGCCAGCCAGCC 

AGGT AC AT AC GATT T CCTGCCCGGTCT C C AGAG AAG T GT GC C AGC GAG AAAC ATGTT GC CC AACC T G AAGT CGCGCCGCATCAGCG GC CT GC AGC 

TGGCCGGGAAGCACCTGGGTCCGGTGGCCAAGTGCCCGCCCAGATATTCCGAGGAGGACTGGGACTATAACAACAAGATCAAGTTTCGAATCACC 

TGCGACCAGGAGAAGCTGGCGGAGCGGATCGTGGAGGAGTCGCGTCGCGTGGTGGACGAAACGAAGGACACCACAAAGAACTGGCAACGGGAGGT 

GGAGCACCATATGCGCGAACGGACGAGCGAAATCCGATTCCTGGTGGACGAGTTGAATCGCCAGAAGAAGACCGCCGCCCTGGAGGATGAGGCCT 

TGAACACCTATCGCAATCGCGTGCTGAACTGCATCGAGTTCCTGAAGGACAAGTCGCTGGCCATTTGCAAGCAGTGCCTGATCCTACGCGAAGGT 

CGGATCGGTGTGGATTTGTGCGACGACGAGGTGGATCGCTCGCTGCGCCGTGAGCTCAAGGTGATCAAGGGCTGTCAGGGATTGGCCGATGCCGC 

CCTCAAGGAAGCCGAGGAGCAGATCCGCAAGCTGCGCGCCGCCATCTATTTGCTCGATCAGGATCTGGCCGCCAAGGATAAGTCACTGGCCATCG 

ACGAGAAGAACCTGAAGCTCAAGGAGTTCCAGCACGACCTGGCTAAGGGACAGGATTTGTCCAAACAGCATTGCCAATTTTCGCTGACCGAGTGG 

CAGGCGCAGACGTACGAGAATCTGGAGGCCAACGCCAAGGCGCTGGTGTCCGCCGGGCAGCTGCGGGCCTACATCGATTTGCTGCTGAAGCAGGT 

GTGCGAGGATATGCAGAACCAGACGGACAGGACCAACGAGGCCTTTGAGCGAAGGATCTCGGAGACGAAGCACGTGAAACAGTGTTTGGAGAACA 

AGCACAAGGACACCATGGATCACATCCATCAGGTGCAGCGCAACATGACCGAGCTGGAGAAGGAGATGATGGACAAGCAGCGGGCCATTACACTG 

TGCCAGACGCGGCTCAGCAATCGCGCCCATCGTCCTGGCCTGGAGCTGACCTGCGACATGGTCCAGGATGCGCTTTACAACGAACTGCAGGCGCT 

GAAGGCATCCGTATGCAAGCTCAATCAGAAGCTGAACGAGAACAAGGCATCCATGCGCTACCTGATGCACGTCCAGGTGATGCAGGAGGAGGAGA 

TCAACATCAAGGCGAACACCTGCAAGATCGACGAGGTCGACTGCATGACCCTGCGACAGGCCCTGAAGTACCAAACATTTTAG 

(SEQ ID NO: 1154) 

Start ATG: 1 

MTAGAEGRPLSSPAKLLRPTPAEIPSQAASQPGTYDFLPGLQRSVPARNMLPNLKSRRISGLQLAGKHLGPVAKCPPRYSEEDWDYNNKIKFRIT 
CDQEKLAERIVEESRRVVDETKDTTKNWQREVEHHMRERTSEIRFLVDELNRQKKTAALEDEALNTYRNRVLNCIEFLKDKSLAICKQCLILREG 
RIGVDLCDDEVDRSLRRELKVIKGCQGLADAALKEAEEQIRKLRAAIYLLDQDLAAKDKSLAIDEKNLKLKEFQHDLAKGQDLSKQHCQFSLTEW 
QAQTYENLEANAKALVSAGQLRAYIDLLLKQVCEDMQNQTDRTNEAFERRISETKHVKQCLENKHKDTMDHIHQVQRNMTELEKEMMDKQRAITL 
CQTRLSNRAHRPGLELTCDMVQDALYNELQALKASVCKLNQKLNENKASMRYLMHVQVMQEEEINIKANTCKIDEVDCMTLRQALKYQTF* 
(SEQ ID NO: 1155) 

Name; TEKTIN 

Classification: cytoskeletal_structuralj?rotein 
Celera Sequence No. : 142000013384486 

ATTACACTGTGCCAGACGCGGCTCAGCAATCGCGCCCATCGTCCTGGCCTGGAGCTGACCTGCGACATGGTCCAGGATGCGCTTTACAACGAACT 
GCAGGCGCTGAAGGCATCCGTATGCAAGCTCAATCAGAAGCTGAACGAGAACAAGGCATCCATGCGCTACCTGATGCACGTCCAGGTGATGCAGG 
AGGAGGAGATCAACATCAAGGCGAACACCTGCAAGATCGACGAGGTCGACTGCATGACCCTGCGACAGGCCCTGAAGTACCAAACATTTTAGTCG 
GGCAGCACTACATCCAGTACAGCACCAGTCCAGTAGTCGCCACACAGCCCAGCCACCAATAATCAAATTAAAATCTAATTATTAGATGCATGTGC 
ATTTCGTGTTCGATCCCAAATTACTTAATAAATTGAAATTCACTTTCGATTTCCAGTTACCAAAACTATGGCTTTTTAATGCATTTTTAAATTGT 
TCATTTTATTCGAAATAGTTAATCATATATTTCTTTTGCTTAATTTTCGTGAATTTAAATTTCCCGCTTTGAATCTAGTATTTTTAGTTTTACCT 
GCCTCTATTCGATATTTACCGCTAAGCACAGACATATAGTTATCGATAACGTGAACGCAAATGCGAGGGAGTATTGCAAATAGTGTTACCGTGCT 
GCTGTGAAAATTCCCAGGTGTGACCGCAGCCCGCATTCGATATTGATGAAGCCAGGCAAAACGTAGAAAAGCAAAAGAAAACCTTAATTAATGTT 
AAGTTAAAAGTGTATAAACCCGAGATTAAGGTAAGTGGCCCACCACCATCTAGATTTTGAGTACACCGAAGGCATGGCGATGTTGGCAGTGGCGG 
GAGCGTTACGCCTCTACCGCCACCGCCTCCGCCGCACACGATAAAAAATGGAGTACAGCCACGAATGGGCAAATCAATAATTTTCAACACTCCAA 
TTGTTTTTACAACTGAAATCAGTCCTAATCTTGTTTTCCGCAGTGCAAGGATGTCGAAACGGAGCGCCGACGACGCCACCGGCAGCAGCTGCTTG 
GTGGCCGCCGCAGCCGCCGGCCAGCCGCCCATCAAGAAGGTGCACTTCGAGCCCCATCTGATTGGGCCGGTGTCCACGCTGGAGGAGATGGACAT 
CAAGGTGCTGGAATTCCAGAACAAGAAGCTAGCACAGCGCATCGAACAGCGGATGCGAACGGAGGCGGAGCTGCGACATCGAATCGAGCAGCTGG 
AAAAGCGGCAAACGCAGGACGACGCCGTTCTAAACGTGGTCAATCGGTATTGGAATCAGCTGAACGAGGACATACGAGTCCTGCTGCAGCGCTTC 
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GATGCCGAGACGGCCGACGAACTGGAGAACCGGAATGAGAACGAGGTGACCACCTCGTTTCTCGCCCAACTGTCCACCTGGGACAAGGAGGAGCT 
GGACGAGAAGCTGGCCAACCGCGTCCAAGTGTCCAAAAGGGCGGTCGCCAAGATCGTACAGGTTATTGATAGGTTAATGCAACGCAATGAAAAGA 
TAACTCATGTCTTGAAAGGTGATAGTCTGGCTTCAGCTGGCTCCGGTTCAGGAGCGGGAGCTGGCGGCGAGGAGGAGCAGCAGCAGGCCAGCGGC 
GATGCGGAGACCACAACGTCATCGGCCGGTGTCCATGCGCTAGAGGAAACCCTGAAGCAAACGCACATCGAGATCATGAGCGAAAATCACAAGCT 
GCAGAATCTGAATACCTCGCTGCACGAGAAGTTCCACACCATGTCGCTGAAGATGAAGGAGTACCAGGACGCTCATACCGCCAAGGAGACGGAGA 
ACGCCGAACTGAAGAACCAGATAGACGAACTGCAATATGACCTAGAGAAGATCCATTGCCGGAATGATAAGCTGGAGAACCACTTGGCCGAGGCT 
ATTGAAAAGCTGAAGGCCTACCACCAGATTTACGGTGATCCCAATAAGAGCACTAATAGTGCCAAGACACCAACTACCACAGGCAGCGGCGGCGC 
CACCACCAGCGTTAATAGCCAGCTATTGGAGGAGCTGCAAAAGGAGCTGGAGGAGTACCGCGAACTGGCCAACAACCGACTGCAGGAGCTGGATA 
AGCTTCACGCCACGCATCGCGAAACCCTGAAGGAGGTGGAGAAGCTTAAAATGGATGTAAGTTTTGAATGAATAATGTATTAGCTTATCTGGAGC 
ATTGCTAATTATAGTTAGTCATTTCGTTTATATTTTCTTTGTATTATACTGATTCATCCTTAAAATCCATTCCAGATACGACAGTTGCCGGAATC 
GGTAATTGTGGAAACCACGGAGTACAAGTGCCTGCAGAGCCAGTTTTCCGTGCTCTACAACGAGTCCATGCAGATAAAGACCATGCTGGACGAGA 
CCCGAAACCAGCTTCAAACGAGCAAGAACCAGCATCTGCGACAAATCGAAGTGATGGAGAGCGAGGAACTTATTGCCCAGAAGAAGGTGCGCAGC 
GAGATGATCCAGATGGAGGACGTTTTAGCCCTCATCCGAAAGGAGTACGAAACGCTACGCATTGAGTTCGAGCAGAACATGGCAGCCAACGAGCA 
GACGGCACCCATCAATCGCGAGATGCGCCACCTGATCACCTCTCTACAAAACCACAATGGCCAACTGAAGGGTGAGGTGCAGCGCTACAAGCGAA 
AGTACAAGGACACGTCCACGGACAACTTGAAGCTGCGTCAGGAGCTGGCCGATGCACTAGCTACACTCGAAGGCAACAAGTTGCAGGCGGCGACT 
GGTGCGGCTGGCGAAGAAATCAAGCAGGAGAACTCAACAGGTGTGAAGGAGGAGAACTCCAACAACGTTTCCGCCAGTGGACAAACAAACCAAAC 
AAATTCCGGCAACGATACCAACGTAGCCATCAAGGAGGAGAACCATATCTCGGCTGAAGATGAAGCTGATGACGAGGCCAGTGGTAAGGACGTAA 
AAGACGGCATCAAGCAGGAGAAGCTCAGTTCTGGAGACGCGGCGGCAGCTGAGAAGAAGGATTCCCCTGGACCCGGAAACTCAACAAGTTCTGCG 
ACCAATTCTGTTCCCGTCAAAAACGAAAAGGATTCCAAGGATGGTGTCAAGGGCAAGGATGTGAAGGCAGTGGAAAGTGAAACGGTGCGCGATCT 
GAAGGCGCAACTAAAGTAAGTTTGTTCAGCAATTACGATTATCTATTTAACATTGTTTGATTTACAGAAAAGCACTGAACGACCAGAAAGAAATG 
AAACTCCTGCTGGATATGTACAAAGGCGTATCGAAGGATCAGCGCGACAAGGTCCAGTTGATGGCCACCGAAAAGAAACTGCGCTCCGAGATCGA 
GGAGCTCCGCCAGCAGCTGAAGAAGCTGCAGGAAAGCAAGCGCGAGGAACGCAAGAAGCTGGCCGACGAGGAGGCGTTACGTAAGATTAAGCAAT 
TGGAGGAGCAGAAGTACGAGCTGCAAAAACAGATGGCCAACCACAAGCCAACGGATAATTCTTGGGGCAGTGGAGCCCCCGGCACAGCGAACTAC 
ACGCGACCGTTTGTTGGTTCCCACGAAGAGGAGGCACTGCTGAACGAGATGGAGGTTACCGGCCAGGCGTTCGAGGACATGCAGGAGCAGAACTC 
CCGGCTCATACAACAGCTGCGTGAGAAGGACGATGCGAACTTTAAGCTAATGTCGGAGCGCATCAAGGCCAACCAGCTGCACAAACTTCTGAGAG 
AGGAAAAGACGGTGCTGGAGGATCAGATGGCAACGGCGACCACGCAAATCGAAGCCATGCACATTGTGCTGCGAAAGCTGGAGGAGAAGGAACGC 
AGCCTGCAGGCCACCGTCGCGTCGATTGAGAAAGAGTTGATGCTGCGGCAGCAAGCAATGGAGATGCACAAACGCAAGGCGATTGAATCGGCGCA 
ATCGGCAGCGGACCTCAAGCTGCATCTGGAAAAGTATCACGCGCAAATGAAGGAGGCGCAGCAAGTGGTGGCGGAGAAGACCAGTTCGCTGGAGG 
CGGAGGCCTACAAGACCAAGCGGCTGCAGGAGGAGCTGGCGCAGTTTAAACGCAAGGCGGAGCGCATGAAGAAGATGGAAATGTCGGGCACCACC 
ATCGACGAGGTGATGATTGAGGAGATCCGCGAATACAAAGAAACGCTCACGTGTCCCTCGTGCAAGGTGAAGCGGAAGGATGCTGTCCTGTCAAA 
GTGTTTCCACGTCTTCTGCTACGACTGCCTGCGCACTAGATATGAAACGCGGCAGCGGAAGTGTCCCAAATGCAACTGCGCCTTCGGCGCCAACG 
ATTATCATAGGCTATATCTGCAGTAGATAAGCGATAATTTAGATACGTTCCGGTCATGTCCTATTCATACACAAATCCACCCACGCCACACACCA 
CTACCGCACACCCAGCATTCATGCCGGATTATTATTCATTATGCATATATTACCTACGTACTAATATTTGGTTAAGTAGTCTCCGAATAACTTTC 
CCCCACCAGATTTCCCATGGCAATTATTTGAATTCGTTCATTCTGGCGCCAAGCAAAAGCATCTCATTATGAGTGCCGCGAGATCGTGACGCGGT 
TTTTATGTAGCTTCTAGTTTTTGTAAATCAAACTATTTTTAAATGATTTCATTTGAAAACTAAATGTACGAACACTAATAAATGGCTAAAACCTA 
TTGTTTAGCATGCCCCGAGTTATTTAGATACCCCATATATACATAAGTGTAGGTTCTTTGTCTTGCATTCAGTTAGCATTATATTAAGCGTATTC 
TAATCTTAAGTTAAATACTCCTATTGTAATGATCAAACGTAGCATAGATATCAAATGTATAAACTTGGCGGAGAATCCGTTTAAAAGTCGCGCTT 
TGCTTTGAAGGGCTGCTAACTTGGTTATAATTGATAAGTAATTGTAATAATGTATGATCGAAAATTGTTGTTCAGCTGAACAAAACGAGTTTTGA 
TTGCTTTTTTGTTTGTGTGTAAATGTGTTCGTTCTTTATGTGTACTGTTTAACGAATAAAGAAACTAATGATCAGAAATCGTGACTTTCAATCAA 
TGTTATCTCTCTACATGAAATCCACCTGTTAGAGCTGTTAGCACCCACGCACTCATGTTAATAACATCGATGCTTATCTGAAACCTGTAACCATT 
GGCGCGCAACAGGTTATCGGGGTTTAAAACGCTATGCTGGTATTTTGTTCTCTACAGAAAGTAGGAAAGTGGCCATAACTATAATTTTATGTATA 
AACAGTTTGATCATATTCTAAATTTCTATTTTATGGCACAAGAACATAAAACGAATGGTTAATTTAACTCAACTGG 
{SEQ ID NO: 1156) 

Exon: 1001. .2146 
Exon: 2261. .2756 
Exon: 2796.. 3150 
Exon: 3203.. 4301 
Start ATG: 1001 

Transcript No. : CT29573 

ATGTCGAAACGGAGCGCCGACGACGCCACCGGCAGCAGCTGCTTGGTGGCCGCCGCAGCCGCCGGCCAGCCGCCCATCAAGAAGGTGCACTTCGA 
GCCCCATCTGATTGGGCCGGTGTCCACGCTGGAGGAGATGGACATCAAGGTGCTGGAATTCCAGAACAAGAAGCTAGCACAGCGCATCGAACAGC 
GGATGCGAACGGAGGCGGAGCTGCGACATCGAATCGAGCAGCTGGAAAAGCGGCAAACGCAGGACGACGCCGTTCTAAACGTGGTCAATCGGTAT 
TGGAATCAGCTGAACGAGGACATACGAGTCCTGCTGCAGCGCTTCGATGCCGAGACGGCCGACGAACTGGAGAACCGGAATGAGAACGAGGTGAC 
CACCTCGTTTCTCGCCCAACTGTCCACCTGGGACAAGGAGGAGCTGGACGAGAAGCTGGCCAACCGCGTCCAAGTGTCCAAAAGGGCGGTCGCCA 
AGATCGTACAGGTTATTGATAGGTTAATGCAACGCAATGAAAAGATAACTCATGTCTTGAAAGGTGATAGTCTGGCTTCAGCTGGCTCCGGTTCA 
GGAGCGGGAGCTGGCGGCGAGGAGGAGCAGCAGCAGGCCAGCGGCGATGCGGAGACCACAACGTCATCGGCCGGTGTCCATGCGCTAGAGGAAAC 
CCTGAAGCAAACGCACATCGAGATCATGAGCGAAAATCACAAGCTGCAGAATCTGAATACCTCGCTGCACGAGAAGTTCCACACCATGTCGCTGA 
AG AT G AAG G AGT AC C AGGAC GC T C AT A C C GC C AAG G AGAC GG AGAAC GCC G AAC T G AAG AAC C AG AT AG AC GAACT GC AAT ATG AC C T AG AG AAG 
ATCCATTGCCGGAATGATAAGCTGGAGAACCACTTGGCCGAGGCTATTGAAAAGCTGAAGGCCTACCACCAGATTTACGGTGATCCCAATAAGAG 
CACTAATAGTGCCAAGACACCAACTACCACAGGCAGCGGCGGCGCCACCACCAGCGTTAATAGCCAGCTATTGGAGGAGCTGCAAAAGGAGCTGG 
AGGAGTACCGCGAACTGGCCAACAACCGACTGCAGGAGCTGGATAAGCTTCACGCCACGCATCGCGAAACCCTGAAGGAGGTGGAGAAGCTTAAA 
ATGGATATACGACAGTTGCCGGAATCGGTAATTGTGGAAACCACGGAGTACAAGTGCCTGCAGAGCCAGTTTTCCGTGCTCTACAACGAGTCCAT 
GCAGATAAAGACCATGCTGGACGAGACCCGAAACCAGCTTCAAACGAGCAAGAACCAGCATCTGCGACAAATCGAAGTGATGGAGAGCGAGGAAC 
TTATTGCCCAGAAGAAGGTGCGCAGCGAGATGATCCAGATGGAGGACGTTTTAGCCCTCATCCGAAAGGAGTACGAAACGCTACGCATTGAGTTC 
GAGCAGAACATGGCAGCCAACGAGCAGACGGCACCCATCAATCGCGAGATGCGCCACCTGATCACCTCTCTACAAAACCACAATGGCCAACTGAA 
GGGTGAGGTGCAGCGCTACAAGCGAAAGTACAAGGACACGTCCACGGACAACTTGAAGCTGCGTCAGGAGCTGGCCGATGCACTAGCTACACTCG 
AAGGCAACAAGTTGCAGGCGGCGACTGGTGTGAAGGAGGAGAACTCCAACAACGTTTCCGCCAGTGGACAAACAAACCAAACAAATTCCGGCAAC 
GATACCAACGTAGCCATCAAGGAGGAGAACCATATCTCGGCTGAAGATGAAGCTGATGACGAGGCCAGTGGTAAGGACGTAAAAGACGGCATCAA 
GCAGGAGAAGCTCAGTTCTGGAGACGCGGCGGCAGCTGAGAAGAAGGATTCCCCTGGACCCGGAAACTCAACAAGTTCTGCGACCAATTCTGTTC 
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CCGTCAAAAACGAAAAGGATTCCAAGGATGGTGTCAAGGGCAAGGATGTGAAGGCAGTGGAAAGTGAAACGGTGCGCGATCTGAAGGCGCAACTA 
AAAAAAGCACTGAACGACCAGAAAGAAATGAAACTCCTGCTGGATATGTACAAAGGCGTATCGAAGGATCAGCGCGACAAGGTCCAGTTGATGGC 
CACCGAAAAGAAACTGCGCTCCGAGATCGAGGAGCTCCGCCAGCAGCTGAAGAAGCTGCAGGAAAGCAAGCGCGAGGAACGCAAGAAGCTGGCCG 
ACGAGGAGGCGTTACGTAAGATTAAGCAATTGGAGGAGCAGAAGTACGAGCTGCAAAAACAGATGGCCAACCACAAGCCAACGGATAATTCTTGG 
GGCAGTGGAGCCCCCGGCACAGCGAACTACACGCGACCGTTTGTTGGTTCCCACGAAGAGGAGGCACTGCTGAACGAGATGGAGGTTACCGGCCA 
GGCGTTCGAGGACATGCAGGAGCAGAACTCCCGGCTCATACAACAGCTGCGTGAGAAGGACGATGCGAACTTTAAGCTAATGTCGGAGCGCATCA 
AGGCCAACCAGCTGCACAAACTTCTGAGAGAGGAAAAGACGGTGCTGGAGGATCAGATGGCAACGGCGACCACGCAAATCGAAGCCATGCACATT 
GTGCTGCGAAAGCTGGAGGAGAAGGAACGCAGCCTGCAGGCCACCGTCGCGTCGATTGAGAAAGAGTTGATGCTGCGGCAGCAAGCAATGGAGAT 
GCACAAACGCAAGGCGATTGAATCGGCGCAATCGGCAGCGGACCTCAAGCTGCATCTGGAAAAGTATCACGCGCAAATGAAGGAGGCGCAGCAAG 
TGGTGGCGGAGAAGACCAGTTCGCTGGAGGCGGAGGCCTACAAGACCAAGCGGCTGCAGGAGGAGCTGGCGCAGTTTAAACGCAAGGCGGAGCGC 
ATGAAGAAGATGGAAATGTCGGGCACCACCATCGACGAGGTGATGATTGAGGAGATCCGCGAATACAAAGAAACGCTCACGTGTCCCTCGTGCAA 
GGTGAAGCGGAAGGATGCTGTCCTGTCAAAGTGTTTCCACGTCTTCTGCTACGACTGCCTGCGCACTAGATATGAAACGCGGCAGCGGAAGTGTC 
CCAAATGCAACTGCGCCTTCGGCGCCAACGATTATCATAGGCTATATCTGCAGTAG 
(SEQ ID NO: 1157) 

Start ATG: 1 

MSKRSADDATGSSCLVAAAAAGQPPIKKVHFEPHLIGPVSTLEEMDIKVLEFQNKKLAQRIEQRMRTEAELRHRIEQLEKRQTQDDAVLNVVNRY 
WWQLKEDIRVLLQRFDAETADELENRNENEVTTSFLAQLSTWDKEELDEKLANRVQVSKRAVAKIVQVIDRLMQRNEKITHVLKGDSLASAGSGS 
GAGAGGEEEQQQASGDAETTTSSAGVHALEETLKQTHIEIMSENHKLQNLNTSLHEKFHTMSLKMKEYQDAHTAKETENAELKNQIDELQYDLEK 
IHCRNDKLENHLAEAIEKLKAYHQIYGDPNKSTNSAKTPTTTGSGGATTSVNSQLLEELQKELEEYRELANNRLQELDKLHATHRETLKEVEKLK 
MDIRQLPESVIVETTEYKCLQSQFSVLYNESMQIKTMLDETRNQLQTSKNQHLRQIEVMESEELIAQKKVRSEMIQMEDVLALIRKEYETLRIEF 
EQNMAANEQTAPINREMRHLITSLQNHNGQLKGEVQRYKRKYKDTSTDNLKLRQELADALATLEGNKLQAATGVKEENSNNVSASGQTNQTNSGN 
DTNVAIKEENHISAEDEADDEASGKDVKDGIKQEKLSSGDAAAAEKKDSPGPGNSTSSATNSVPVKNEKDSKDGVKGKDVKAVESETVRDLKAQL 
KKALNDQKEMKLLLDMYKGVSKDQRDKVQLMATEKKLRSEIEELRQQLKKLQESKREERKKLADEEALRKIKQLEEQKYELQKQMANHKPTDNSW 
GSGAPGTANYTRPFVGSHEEEALLNEMEVTGQAFEDMQEQNSRLIQQLREKDDAMFKLMSERIKANQLHKLLREEKTVLEDQMATATTQIEAMHI 
VLRKLEEKERSLQATVASIEKELMLRQQAMEMHKRKAIESAQSAADLKLHLEKYHAQMKEAQQVVAEKTSSLEAEAYKTKRLQEELAQFKRKAER 
MKKMEMSGTTIDEVMIEEIREYKETLTCPSCKVKRKDAVLSKCFHVFCYDCLRTRYETRQRKCPKCNCAFGANDYHRLYLQ* 
(SEQ ID NO: 1158) 

Classification: hypothetical 



Celera Sequence No. : 14200001338448 6 

AAGAAGGATTAGGGATTATTTTTGGAACATTTCCAAATACTGCACTATATTACCAATCCCTGCCCGTAATCCTCGCCCCCCTCCTCGCCACCGGA 
TGTACCGATGTGCTCCTGGATCTTGGCCTCGAGCCCATTGACGTCCGCACCCTGGACGCGATCGATCTTGGTCCTGTTCCTGTAGAAGATGAATG 
TTGGCATGGCCGAAACGCCCTGTCCAGCAGCCGTGTCCTGGCACTTGTCCACATCCACTTTCAGGAAGATGGCCTTTGGGTACTTTGTTGGAAAC 
GTCTCGAAGATGGGCGCAATCCGCTTGCAGGGACCACACCACGAAGCTGTGAAGTCCACCACAACCAATTGAATGCCCGCTTGGGCCAACTCCGC 
CTGGAAGTGGGACTCGTCGTTGATCACGCGCACGGACATGGTGATAGGATTAGGTTTCTATTAATTGAGCTTTTGTTTCGGCAGCCGAATTGGAT 
TTAAGCAAGTAAATGTTATTATTAACGTTCAATGCAAATTTTTTTTGTTAAAGATGACTTGTAATATGCATTTAGTCCAAATTCGTGCTZIAGAAA 
AATACCGAATGCGGTATTCCACAAGCGGTCACACTGTGATGGTATCGATATTTCGAGCTCTTTGACTTCCTATTTTTAGAGGGACCATTTATGTG 
TAATAGAAAAAAACCGAAACTTAATATTTAAACTTTTATTGAAATATTAGTGGATTACAATATGTAAAACTATGAAATATTCTCATTTGATATAG 
CTCAAAGTGTTATTTAAAATTCATTCAGTGTTTACGACTAGCAATCTACGCTTTCACGCTCATCTTAAGCTTACCGCCCATTTGCCAGGGTTGTC 
AAGGCGAATGAGCGGTCCCACCATACACGCCACTGGAACTTTCGATACCTGCGCTGCGCCTGGCCACACGTTCATTACCTCGTGGTGTTTCAGTC 
GGTCGCATTTTCATTAAGTCGCCATTTTAAAATTATTAGAGTCAAGTACT^ATGGCAGATGTGGAAAAGGAGCCCGAGAAGACCATCGCCGAGGAT 
TTGGTGGTGACCAAGTATAAGTTGGCCGGCGAAATCGTCAACAGTGAGTATTCCTTGGCCGGAAACAGCGAACGCTGGCCGATTCCTGGAGTCGC 
TGCTACGTGGCGCTTACACAATGCACCGAATGCCGCTTTCCCTTGTGCGCCACGCGTTGGTTAATCTGCCTATTTCTGGACTCTGTCTGCTCGTT 
TAATTTTAGAAACCCTCAAGGCGGTAATTGGACTCTGCGTGGTTGATGCCTCCGTCCGGGAGATCTGCACCCAGGGCGACAATCAGCTCACCGAG 
GAGACCGGCAAAGTAAGTGGTGGCCACCTGGCGGTCATTCGCGCCAATTTCATGTCCAATGATTAAGACTTACACCTTTGAGGGTTTCCCGATGG 
CGAGCCATGTGCTGTGCGGGCTGGGGATCACCTCGTGGTCGCCAGGCGCACGCGGGGACTCCAATGCTCCACGTGCCCGGCTTGTGTGCTCTCCA 
AAAGGTCCCGAGGATTTACAGATTATGAGATCTGAGGACACACCGCGCACTATCATTGATATATAGTACAACGAACAAGCAATCTAATGCTTTTA 
TCGATCTTTCACAAACAGGTATACAAAAAGGAGAAAGACCTGAAGAAGGGCATTGCCTTTCCCACCTGTCTGTCCGTCAACAACTGTGTCTGCCA 
CTTCTCGCCAGCCAAAAACGATGCTGACTACACGTTAAAGGCCGGTGATGTGGTCAAAATGTAAGTTGAACCTCCTATTCCACATATACCGCCAC 
TAAATACGTAACATTTCTTTTCTACAGCGATCTGGGTGCCCACATTGATGGTTTCATTGCCGTGGCCGCTCACACAATTGTGGTAGGCGCTGCTG 
CGGATCAGAAGATCAGTGGTCGCCAGGCCGATGTCATCCTCGCCGCCTACTGGGCTGTCCAGGCTGCCTTACGTCTGCTCAAGTCCGGCGCCAAT 
GTGAGTCCTCCCTTACTTCTAGGTAATCCTCCGTTAATCCCTGCAAGAAACGGATTGTCTGCCGCGATTCTCCAGCGACTGAACATCTCAACACT 
TGCAAAGATCAGCTGTGGCAGCTGGTAATTGCCCTGGCCTATTATTCAGGACTGGAGGCTTCTTGTCAGTTGTCCACAAGGTTATTTCTTCTGCA 
GGCAACGGATTGACTGCGCTCAAACTCTGACACAGATCAGCTCAACACCTGCGGATAGAAACTGTGTCAATTTCGTGAACTGAACAAGTTCATTC 
CATAGAAGTGTTCGGTCTTTAAATTTGTCCACATCTCCAGTTTATAGATATGTCGGAATTGTAATCTGCAGGCAACGGATTGTCTGCTGCCTTAA 
CTCGTGGCTCAGCACAGCTCAACGTCTGCAGAGATCAACAGTGTCGATTTCGTGAACTGAACAAGTTTAGATACTTGAAATGTTCGGTCTTTAAA 
GTTGTCCACAATCGCAATGATAATGCCGATCAGTTATTGTTATTTTGCGTTATCTATAGTATACTATGATATTTGATTAAGATTAGTCAAAGGGA 
ATTGGAATGTTTTCTTTATCTCTGCTTTGAACTATTTCCATTTTATTTCATACTTAATATTTATGTTTCAATTCTGTATCCTTACAGAACTACTC 
CCTCACCGATGCAGTGCAACAAATCAGCGAGTCGTATAAGTGCAAGCCCATTGAGGGCATGCTCAGTCACGAGCTGAAGCAGTTCAAAATTGACG 
GCGAGAAGACGATCATACAGAACCCCAGCGAGGCGCAGCGCAAGGAGCATGAGAAGTGCACCTTCGAAACGTACGAGGTGTATGCCATCGATGTT 
ATCGTCAGTACCGGCGAAGGAGTGGTTAGTAATCCATCAATAGACACTACATCTCCACTAATTTGTTCGATGATTAAAAACACGCGCTTGAGGCT 
GACTTTGCTGGAATGCGGTGTTTGTTGCGAGAGTGACTTGTTTGCTCGGCGTTTTTTTATACTAAAATGCGGCACGTGCAGACACCAAGTTCCGG 
CTGGCTGTTGTCCGAAGATTGCAAGATTATGAGATCTGAGAACGCCAAATTTAAGCTGGATCCTGGATCATCGCAGCCAGAGCATTATTGCTAAC 
ATTATTCGTATTCGTTGCAGGGACGCGAAAAGGACACCAAGGTCTCAATTTACAAGAAGTCTGAGGAGAACTACATGCTCAAGATGAAGGCGTCC 
CGTGCTCTGCTGGCAGAGGTGAAAACCAAGTACGGAAACATGCCATTCAACATCCGCAGCTTCGAGGAGGAGACCAAGGCCCGCATGGGAGTTGT 
TGAGTGCGTCGGCCACAAGATGATTGAGCCCTTCCAAGTGCTGTACGAGAAGCCATGTAAGTGTGATGCATATTATTATTAATCCTATTCCCTAT 
TATGCGAGTTGGCAGAACTTAATTCCGGACCTGGTACACCTTCGGGTGCTAAGTGCGGCCAGACATTTTGCCAGAACAAATTCCAGGCATTGTCG 
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TCTTCAGCAGTTGCCTCAGTGTGGCCTCTGTCTGAACATGGCACTGTCACAATCGTATCCAATCTATTAACCTGTTTTCTTATACTTATTAAAGT 
TAATTTAGAGACTAAACTAGTTTGAGCAACCTTTATAAAGTTCGAATTTTAGCCGGAAGTAATAGCAAAGTTAAACAATCCTTTTCCTTATCTTG 
CATTACAGCCGAGATTGTGGCGCAGTTTAAGCACACGGTTCTGCTCATGCCTAACGGCGTCAACTTGGTCACCGGCATCCCATTCGAGGCGGAGA 
ACTATGTGAGCGAGTACAGTGTTGCGCAGGAGGAGCTCAAGGTAAGCTGCAACAATTTCCTTGTATTCACGATGCGTACTCAATGAAATCTCAAC 
TTTTTGCAGACTCTGCTCGCGCAGCCTTTGGGTCCTGTGAAGGGCAAGGGTAAGGGCAAGAAGGCAACAGCTGGGGCGGCGACAAAGGTGGAAAC 
GGCGCCGGCCGTGGAGACCAAGGCATAGACCAGCCCGCTGATGATGATCCGCACCGCCAAGCCATCAACGGAAACACAATGTGAACAATTGCGCT 
GCCCAACGCTGCGCTCCACAGATTTTTACTATCGAATTCGTTGCGTATTAGAGGACCCTTTTGACAACAGAACAGGACAGAAGAGAAGACGGCAA 
CAATTTGAGGATACATTTCCCCAGAAATCCTCCATCCATCAACAAGGCGGGCGGTCGGTCGGTCCCGCGCCAACTTTACCTCTTTATTTCCTTTA 
CTATAAGCTGCCTTCGTTTATCGGTCTGTTCAACATCATCGCAACGAAAAAGCAAAGCAAGAACTGTCATCAAATTGTAACAATTTTAACGCTAA 
ATGATCTTAAAATATAATTCAAGTGAAACGTTATTAACGCTGCGTAGTAGGTATTAAATAAAATTAACATTTTCTATAAAACAGCCGATAAATGC 
CAAACGATTTTTCATTTATTTACTTTCCGCTGGCGCCCAATTTTAATTCGATTTCGATACGCTTCTCATTCTAATAAATGCACTTGCGAGTTGTG 
TTTATTTTATACGTTTAATTTAGTTTTGATGTTCACATTCACATTATACAATTTGTAATTTAGATTTCTTGCCTTTTGTTATTTTAAATTTTACA 
GTCTCATCTTTGAACTCTTGTATTACGAAAGTTGCAAGAATAACTTCGTTATGTTAAACGTCACTTAGTGCTGTGCTCACTTGGCCACCCCAGTT 
GTCCATCCCAGATCCAATCCCAACAAGACCAGACCAATTCGATGCCGTATACGGCGACTTTGCCCAACTCGCTGACCTCTTCCCTTGCGTCAAAC 
AAAATAAAGAACAACAAAAAACGCAATTGCTGCGGATGAAGTATAGAAAACACGAGCAGCACTTGCAGACGACAAAGATATGTGGCCGGTGATCA 
AAAG AGG AT CT G GG AT T T AAT GG T CTGCC GTCGCT T AC AT AC ATGGT T T GGT GT AC TTTTTTTTTTTT T GT T AT G ATC 
(SEQ ID NO: 1159) 

Exon: 1001,. 1088 
Exon: 1245.. 1342 
Exon; 1634.. 1770 
Exon: 1833,. 1995 
Exon: 2653.. 2874 
Exon: 3156.. 3381 
Exon: 3714.. 3841 
Exon: 3905. .4018 
Start ATG: 1001 

Transcript No. : CT29664 

ATGGCAGATGTGGAAAAGGAGCCCGAGAAGACCATCGCCGAGGATTTGGTGGTGACCAAGTATAAGTTGGCCGGCGAAATCGTCAACAAAACCCT 

CAAGGCGGTAATTGGACTCTGCGTGGTTGATGCCTCCGTCCGGGAGATCTGCACCCAGGGCGACAATCAGCTCACCGAGGAGACCGGCAAAGTAT 

ACAAAAAGGAGAAAGACCTGAAGAAGGGCATTGCCTTTCCCACCTGTCTGTCCGTCAACAACTGTGTCTGCCACTTCTCGCCAGCCAAAAACGAT 

GCTGACTACACGTTAAAGGCCGGTGATGTGGTCAAAATCGATCTGGGTGCCCACATTGATGGTTTCATTGCCGTGGCCGCTCACACAATTGTGGT 

AGGCGCTGCTGCGGATCAGAAGATCAGTGGTCGCCAGGCCGATGTCATCCTCGCCGCCTACTGGGCTGTCCAGGCTGCCTTACGTCTGCTCAAGT 

CCGGCGCCAATAACTACTCCCTCACCGATGCAGTGCAACAAATCAGCGAGTCGTATAAGTGCAAGCCCATTGAGGGCATGCTCAGTCACGAGCTG 

AAGCAGTTCAAAATTGACGGCGAGAAGACGATCATACAGAACCCCAGCGAGGCGCAGCGCAAGGAGCATGAGAAGTGCACCTTCGAAACGTACGA 

GGTGTATGCCATCGATGTTATCGTCAGTACCGGCGAAGGAGTGGGACGCGAAAAGGACACCAAGGTCTCAATTTACAAGAAGTCTGAGGAGAACT 

AC A T GC T C AAGATGAA GGC GTCCCGTGCTCTGCT GGC AG AGGT G AAAAC C AAGT ACGG AAAC AT GC C AT T C AAC AT CC GC AGCTT C G AGG AGG AG 

ACCAAGGCCCGCATGGGAGTTGTTGAGTGCGTCGGCCACAAGATGATTGAGCCCTTCCAAGTGCTGTACGAGAAGCCATCCGAGATTGTGGCGCA 

GTTTAAGCACACGGTTCTGCTCATGCCTAACGGCGTCAACTTGGTCACCGGCATCCCATTCGAGGCGGAGAACTATGTGAGCGAGTACAGTGTTG 

CGCAGGAGGAGCTCAAGACTCTGCTCGCGCAGCCTTTGGGTCCTGTGAAGGGCAAGGGTAAGGGCAAGAAGGCAACAGCTGGGGCGGCGACAAAG 

GTGGAAACGGCGCCGGCCGTGGAGACCAAGGCATAG 

(SEQ ID NO: 1160) 

Start ATG: 1 

MADVEKEPEKTIAEDLVVTKYKLAGEIVNKTLKAVIGLCVVDASVREICTQGDNQLTEETGKVYKKEKDLKKGIAFPTCLSVNNCVCHFSPAKND 
ADYTLKAGDVVKIDLGAHIDGFIAVAAHTIVVGAAADQKISGRQADVILAAYWAVQAALRLLKSGANNYSLTDAVQQISESYKCKPIEGMLSHEL 
KQFKIDGEKTIIQNPSEAQRKEHEKCTFETYEVYAIDVIVSTGEGVGREKDTKVSIYKKSEENYMLKMKASRALLAEVKTKYGNMPFNIRSFEEE 
TKARMGVVECVGHKMIEPFQVLYEKPSEIVAQFKHTVLLMPNGVNLVTGIPFEAENYVSEYSVAQEELKTLLAQPLGPVKGKGKGKKATAGAATK 
VETAPAVETKA* 
(SEQ ID NO: 1161) 

Name: methionine aminopeptidase 
Classification: peptidase 



Celera Sequence No. : 142000013383791 

TCGACTATCCGCGCACCAGCTCAAATCGCGTCTGAGATTCCTTGAACTGAATTCTCTTCTTGTGCTGCACGTGGGCTTCACTGAGGAACAAAGTA 
GACGCCACCAGGGTTAACACAGTACCCACACACGCCAGGATAAAGGACCAGCCAAACCAATTGTTCGCATGCTCCGGCATCCAGCCATTCCGGTT 
GCCAAATCCAGCGAATACAATCACTGCAATGGCTGCACTTACTCCAGCACCGAGAAGCACATATCCCAACGATTTAATGAGCGTAATGAAGTGCT 
TTTGATCCGGACCAGCACACAGGATGAAGACCAGTACTCCGATGGCAGATACCAACATCCCGATGAAGGCCAAAGTGTAGAAGAACTGAGTGGCT 
ATCATGAAGGCGGGCAGCAGGAATCCACGGATCTCATCGTAACCCGTGGTAAATGGGTCATACACCCAACGGCAGCCCACGAAGAATCTCCGCTG 
GCTATCGTCGTTCACGTCCGGCAGCGACCGGAAGCAGTGCACCCACAATCCCAGGCGATCCAGCTTGGCGCCCGTGATGCGGTAATCACTGACCA 
ACCAACTGGGCGTTGCAAACGCAATCACGATGAAGGCAAAGGCGAACACGAAGACGCCAACGCCACAACTTCCGGATAGTGTGCGTCTCTTCATG 
TTCACTGTGTGACTGTTTTGCGTGTGGCTGGTGGGCAGGTGCTCCTGGGAATCCTCGCGTGCGAAGTGGGATGGTTTTTGCATCCTTTGAGTGGG 
AAATTCTATTGGAATTTCGCTGGTTTCCTGGGGAAATGTTTGTTTGGTTTGCTGTTTGGGTGTGGACTGCAGTGTGGCCGCACTGGTAATACCGC 
AGAATACCATGGGGCCACAAAATAAATACCAAATGCTACGGTACCAATAAATTTCAAAACAGTAAATTAAACAATTTAGATTATACATTTTTTAA 
ATTAAGAAATAAATGATTTTAAATTTAATTAATTTTTGATAGATGGCTAAGGTACTTCTTAAATCAAATAAAAAGCTTTTTATATAGAATATAAC 
ATATTTTGGTTATATTGTCGATAAAAACTTATACTTAAAAGGCCCAACCGTTGTAAATGCTAATGAGCGTCGTTTCATTCTGGAAGACATGCTCT 
CCTCAATCGTTCCCCTAATTAAGGGCTGCCACAGCCTTGTGGGCGGCATCATCCAAGTCACTGGCTGTCTGAATGGGCAGGCCCGAATTCTTCAG 
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GATCTCGCGAGCCTGATTTACATTGGTGCCCTCCAGTCGCACAACCAGTGGAACATTCAGTTGCAGCTTTTTGGATGCAGCCACAATGCCATTGG 
CAATGGTGGCACAATTGACAATGCCGCCAAAGACATTGACCAGGATTCCCTTGACTTTCGGGTCAGCAGTGAGGATCTCGAAGGCTTTGGCCACC 
TGATCCTCCCTGACGCCACCGCCGACGTCCAAAAAGTTGGCGGGCTCGCCTCCATTCAGCTTGATGATGTCCATGGTGGCCATGGCCAGGCCGGC 
ACCATTAACCAAGCAGCCAATGTTGCCATCCATGGCGACGTAGTTCAGATTGTACTTGGCCGCCTCCACTTCTCGGGGATCGGATTCCTCCTCGG 
TGACGTCCATGGAGAAAATATCCTTCTGACGGAACTGGGCATTGTCGTCAAAGTTGAGCTTAGCATCAACGGAGATAACTTCTCCCTTATCCGTC 
TCCGCCAGCGGGTTGATCTCGATCTGTACGGCATCCACAGCTTTGAAGAGAGTATAGAGTTTTTGAATTTCCTCAGCACAACGCTTTACAGAGTC 
TCCCTTGAACTCCAAGAACTTGGCCACCTCCAAGAGCGTCGACTCCGGTATGGGCTTGCCAATGTCTAGCGGAACAGTCTTAATTTTCTCAGGGG 
TTTCCTCGGCCACCGCCTCGATATCCATGCCACCGGCAGGTGAGGCAATCAGCACGGGTCCATTGTGCTCGCGGTCCAGCAAGATGCAGAGATAG 
GTCTCGCGGGTGATGTTGATACTGCGTGCCACCATGACCTTGTTGACCAAAATTCCCGATTTGGGCGTTTGTTTCGTTATCAGTCGGTTTCCAAT 
CATCTGCTGGGTAAGCGAAAGCACCTCGCTCTTGCTGGCAAAATGTTAAATAATTATTAGTATAAGCTGGTCTTATCACTTGTGTTTTTAAGGTA 
CACTCTGGCTTTCACAAACTTTTATCATTTTCAATATGGGCAGMTGATAAGATTAATCTTATATACAAATGATATTGTAGAACATAAATAATAT 
TAATTTTTTAGATAAAATGTGAATGTGAATAGCCGGAATAGACCTAATCTATAATACTTGGTGGTGATATGGACGCCGCCTTTGAATCCGTTGTC 
GAAAGTTCCCTTGCCACGCCCACCGGCGAGAATCTGCGCCTTCACCACGTATTCCGGGCACTCTGAAACACAGAAGCACAGCATTTAACACCAAT 
TAGCCCAGGGATTACATZVAACACCGGCATGGTTCCGTCCTCCACTTACCAAAAGTCTTGACGACCTCGGCATCTGCGTTGGAGTTATTCAAGACC 
TTGAATTGCTGTATGGCCACTCCATACTTCTGGAGGAGATCCTTGCTCTGGAACTCCAGCAGATTCAGGTTGCGCACGGGAACCTATCATCGAAA 
GATAAACAAAGCCAAGACAAAGGTCACTTCAACTTCTCTTTTCGAAGAGTTGATATATTATAGAGGAATGTGAGTGCGAGGAAACCTTATGAACG 
ATGTGACGGGCTGTGGTAACTGCTTTCAGTAGGAACGACATCTTGGCTGATCTGCGGTGGTAACTGAATACCTTGGCTAACTTGGTATTAATTTA 
TTTTCGTAACTTTGTCGGTAAATTTAGTTTAAAAACACAACCGAAAATCGCCGCCGTTCCACGATGCTTATCACTGCACAATGTGACCAAGAAGC 
ACGACTACTTTCCATGGTGATAAAAATATACCAAATGCCACTGCCATCTTGAACTAGTTCTCGCGAAGCAATGGTCTTACTGTTGCAGTCAGCTG 
AAAATCAGCTGTCGTTGTGTACTTTTTTTCTTCGGCTAATAGTCAGCGAAGAATGACGAATAACACAAATTGAATGAAAAACCTGAGTGAGACGG 
AAGCCGAAGTCACCATGCAAGAAAATGTAGTCCACGAATCATTGCCCCAGATCCCGGTGGCCACTTCGGTGAGCTGTTGCTTCGTTCTGGCTGTT 
CTGTACGTGGGTAGCCTCTACATCTGGAGCACCAAACACAACCGGGATCATCCAACCACGGTCAAGAGAAGATTCGCCAGTGTGTCCATGGTGAT 
GCTGGCAGCCCCGTTCTTCGTTTACTTCTTCTCATCGCCGGAGCTGCTCAGCCGGGTACCATTTCCCAAGCTGCTCGGTTTGCGCTTGGAGGGAT 
TGTGGCAGGCTGTTGTTATACCATACAGCCTGACGGTGCTGCTCTTCCTGGGACCCATCTTTGTCAACATGCAGAACGAGTCCGTGCGCTCCTAC 
TTCGGTAATAGTTAAAGCCCCTCTGCCTATCAGCCCCTTCTCACTGCTTGATTTGAATTGCAGATCTGGACTACTGGAGGGGATCATTTGGCAGC 
ATCATCTGGGTGCGCAACCATGTGATAGCACCGCTGAGCGAGGAGTTCGTGTTCCGAGCCTGCATGATGCCCCTGATCCTGCAGAGCTTTTCGCC 
CCTGGTCGCCGTTTTCATAACGCCGCTTTTCTTCGGAGTAGCCCACTTGCATCACATAGCCGAACGCCTGAGCTTAGGCGTGGAGTTGAGCACTG 
CCCTATTGATTGGACTGTTCCAGTTCATCTACACCACGCTATTTGGCTTCTATTCGGCCTTTCTATTTGCCCGTACAGGTCATGTGATGGCTCCC 
ATCTTGGTGCACGCGT 
(SEQ ID NO: 1162) 

Exon: 2911.. 2746 
Exon; 2648.. 2519 
Exon: 2437. .2338 
Exon: 2124.. 1001 

Start ATG: 2796 (Reverse strand: CAT) 
Transcript No. : CT29754 

TGGAACGGCGGCGATTTTCGGTTGTGTTTTTAAACTAAATTTACCGACAAAGTTACGAAAATAAATTAATACCAAGTTAGCCAAGGTATTCAGTT 
ACCACCGCAGATCAGCCAAGATGTCGTTCCTACTGAAAGCAGTTACCACAGCCCGTCACATCGTTCATAAGGTTCCCGTGCGCAACCTGAATCTG 
CTGGAGTTCCAGAGCAAGGATCTCCTCCAGAAGTATGGAGTGGCCATACAGCAATTCAAGGTCTTGAATAACTCCAAGGCAGATGCCGAGGTCGT 
CAAGACTTTTGAGTGCCCGGAATACGTGGTGAAGGCGCAGATTCTCGCCGGTGGGCGTGGCAAGGGAACTTTCGACAACGGATTCAAAGGCGGCG 
TCCATATCACCACCAACAAGAGCGAGGTGCTTTCGCTTACCCAGCAGATGATTGGAAACCGACTGATAACGAAACAAACGCCCAAATCGGGAATT 
TTGGTCAACAAGGTCATGGTGGCACGCAGTATCAACATCACCCGCGAGACCTATCTCTGCATCTTGCTGGACCGCGAGCACAATGGACCCGTGCT 
GATTGCCTCACCTGCCGGTGGCATGGATATCGAGGCGGTGGCCGAGGAAACCCCTGAGAAAATTAAGACTGTTCCGCTAGACATTGGCAAGCCCA 
TACCGGAGTCGACGCTCTTGGAGGTGGCCAAGTTCTTGGAGTTCAAGGGAGACTCTGTAAAGCGTTGTGCTGAGGAAATTCAAAAACTCTATACT 
CTCrTCAAAGCTGTGGATGCCGTACAGATCGAGATCAACCCGCTGGCGGAGACGGATAAGGGAGAAGTTATCTCCGTTGATGCTAAGCTCAACTT 
TGACGACAATGCCCAGTTCCGTCAGAAGGATATTTTCTCCATGGACGTCACCGAGGAGGAATCCGATCCCCGAGAAGTGGAGGCGGCCAAGTACA 
ATCTGAACTACGTCGCCATGGATGGCAACATTGGCTGCTTGGTTAATGGTGCCGGCCTGGCCATGGCCACCATGGACATCATCAAGCTGAATGGA 
GGCGAGCCCGCCAACTTTTTGGACGTCGGCGGTGGCGTCAGGGAGGATCAGGTGGCCAAAGCCTTCGAGATCCTCACTGCTGACCCGAAAGTCAA 
GGGAATCCTGGTCAATGTCTTTGGCGGCATTGTCAATTGTGCCACCATTGCCAATGGCATTGTGGCTGCATCCAAAAAGCTGCAACTGAATGTTC 
CACTGGTTGTGCGACTGGAGGGCACCAATGTAAATCAGGCTCGCGAGATCCTGAAGAATTCGGGCCTGCCCATTCAGACAGCCAGTGACTTGGAT 
GATGCCGCCCACAAGGCTGTGGCAGCCCTTAATTAGGGGAACGATTGAGGAGAGCATGTCTTCCAGAATGAAACGACGCTCATTAGCATTTACAA 
CGGTTGGGCCTTTTAAGTATAAGTTTTTATCGACAATATAACCAAAATATGTTATATTCTATATAAAAAGCTTTTTATTTGATTTAAGAAGTACC 
(SEQ ID NO: 1163) 

Start ATG: 116 (Reverse strand: CAT) 

MSFLLKAVTTARHIVHKVPVRNLNLLEFQSKDLLQKYGVAIQQFKVLNNSKADAEVVKTFECPEYVVKAQILAGGRGKGTFDNGFKGGVHITTNK 

SEVLSLTQQMIGWRLITKQTPKSGILVMKVMVARSINITRETYLCILLDREHNGPVLIASPAGGMDIEAVAEETPEKIKTVPLDIGKPIPESTLL 

EVAKFLEFKGDSVKRCAEEIQKLYTLFKAVDAVQIEINPLAETDKGEVISVDAKLNFDDNAQFRQKDIFSMDVTEEESDPREVEAAKYWLWYVAM 

DGNIGCLVNGAGLAMATMDIIKLNGGEPANFLDVGGGVREDQVAKAFEILTADPKVKGILVNVFGGIVNCATIANGIVAASKKLQLWVPLVVRLE 

GTNVNQAREILKN SGLP I QT AS DLDDAAHKAV AALN * 

{ SEQ ID NO: 1164) 

Name: PROBABLE SUCCINYL-COA LIGASE LIKE 
Classification: enzyme 
Gene Symbol: Sucb 
FlyBase ID: FBgn0029118 
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Celera Sequence No. : 142000013383791 

TTTCGCTTACCTTTCTCTGGGTCACTTAATTAAGGCACTAGCTTAGCTGAAGTTTATGCAATTCAATGATCTCTATGGTTAGTCGCAGGTCAGTC 

GTTCAGGTATGTTTATAACAATAATTTGTTTTCAGGCACTACGAACTACACCATAATAGTTCTATCTAATTAAGGCACGAGTGTAATATTTCGAT 

GCTAGTTCCGACTAGCCTGCGACTTTTGCTAAAGAAAAAAATAAACGTACAATTGCGGAGGGGTATGCTCTTTGTGTCTGGGCGTTAATTGAGCG 

AAAATTGATATGCAAATACTGTAGAATACTCTACATGAACATGAGTCATCTAGTTGAGCGTCTTCGTTTGATGTTAGCTGTGCTTGGTTCTGGTT 

TTCATTCGGAAAAACATTCAGTCTTCTCTTGGAACTGATCTGGTCGATTGGATGGATGATTATAAGTACCAAAAACTGAAAGCTTGTGAATCGGT 

GCGGCCGTCAATTGCATAAACTTTTTAAATTAAAAACTACGAGTACAGTCGAACTTCTCTGAACTAAAATTAATTTCTTTAGGTATGTTAGAGAG 

AAGCGATGAATATATGGCGAAAATTATTTTTATCCAAAATAACTTTGGTTCGTTAAAATTTTTCGAAATCTCCGCCAGTTGGGGAGTTATTTAAA 

CGATTTAAGGTAATATAAAACGAATGTTACCGACATGAATACTTTAACCTATGCACAGGGAAACCATTGAATAAAATATAAAACGAACATAAACC 

AACTTTAAAAACGTGCACTACCAGAAACTGTTAGTTAGAATATTTCGCCTTAGAGAAGTTCGACTGTAATGTTTAGGGTGAGAATTCTTTGAGGG 

GACGGAATGTTTTTGTTTTGTTTTCCTCTTGGTTGGCTTGTGTTTTCCTTGGTGCAAGTGTGCCAGCTGCCACGGCTAAAATAGACAATGCAAGA 

AGGTTAAGCGACCGACCCTGACTTGGATATAGTCAAGTACTTACCATCGACTATCCGCGCACCAGCTCAAATCGCGTCTGAGATTCCTTGAACTG 

AATTCTCTTCTTGTGCTGCACGTGGGCTTCACTGAGGAACAAAGTAGACGCCACCAGGGTTAACACAGTACCCACACACGCCAGGATAAAGGACC 

AGCCAAACCAATTGTTCGCATGCTCCGGCATCCAGCCATTCCGGTTGCCAAATCCAGCGAATACAATCACTGCAATGGCTGCACTTACTCCAGCA 

CCGAGAAGCACATATCCCAACGATTTAATGAGCGTAATGAAGTGCTTTTGATCCGGACCAGCACACAGGATGAAGACCAGTACTCCGATGGCAGA 

TACCAACATCCCGATGAAGGCCAAAGTGTAGAAGAACTGAGTGGCTATCATGAAGGCGGGCAGCAGGAATCCACGGATCTCATCGTAACCCGTGG 

TAAATGGGTCATACACCCAACGGCAGCCCACGAAGAATCTCCGCTGGCTATCGTCGTTCACGTCCGGCAGCGACCGGAAGCAGTGCACCCACAAT 

CCCAGGCGATCCAGCTTGGCGCCCGTGATGCGGTAATCACTGACCAACCAACTGGGCGTTGCAAACGCAATCACGATGAAGGCAAAGGCGAACAC 

GAAGACGCCAACGCCACAACTTCCGGATAGTGTGCGTCTCTTCATGTTCACTGTGTGACTGTTTTGCGTGTGGCTGGTGGGCAGGTGCTCCTGGG 

AATCCTCGCGTGCGAAGTGGGATGGTTTTTGCATCCTTTGAGTGGGAAATTCTATTGGAATTTCGCTGGTTTCCTGGGGAAATGTTTGTTTGGTT 

TGCTGTTTGGGTGTGGACTGCAGTGTGGCCGCACTGGTAATACCGCAGAATACCATGGGGCCACAAAATAAATACCAAATGCTACGGTACCAATA 

AATTTCAAAACAGTAAATTAAACAATTTAGATTATACATTTTTTAAATTAAGAAATAAATGATTTTAAATTTAATTAATTTTTGATAGATGGCTA 

AGGTACTTCTTAAATCAAATAAAAAGCTTTTTATATAGAATATAACATATTTTGGTTATATTGTCGATAAAAACTTATACTTAAAAGGCCCAACC 

GTTGTAAATGCTAATGAGCGTCGTTTCATTCTGGAAGACATGCTCTCCTCAATCGTTCCCCTAATTAAGGGCTGCCACAGCCTTGTGGGCGGCAT 

CATCCAAGTCACTGGCTGTCTGAATGGGCAGGCCCGAATTCTTCAGGATCTCGCGAGCCTGATTTACATTGGTGCCCTCCAGTCGCACAACCAGT 

GGAACATTCAGTTGCAGCTTTTTGGATGCAGCCACAATGCCATTGGCAATGGTGGCACAATTGACAATGCCGCCAAAGACATTGACCAGGATTCC 

CTTGACTTTCGGGTCAGCAGTGAGGATCTCGAAGGCTTTGGCCACCTGATCCTCCCTGACGCCACCGCCGACGTCCAAAAAGTTGGCGGGCTCGC 

CTCCATTCAGCTTGATGATGTCCATGGTGGCCATGGCCAGGCCGGCACCATTAACCAAGCAGCCAATGTTGCCATCCATGGCGACGTAGTTCAGA 

TTGTACTTGGCCGCCTCCACTTCTCGGGGATCGGATTCCTCCTCGGTGACGTCCATGGAGAAAATATCCTTCTGACGGAACTGGGCATTGTCGTC 

AAAGTTGAGCTTAGCATCAACGGAGATAACTTCTCCCTTATCCGTCTCCGCCAGCGGGTTGATCTCGATCTGTACGGCATCCACAGCTTTGAAGA 

GAGTATAGAGTTTTTGAATTTCCTCAGCACAACGCTTTACAGAGTCTCCCTTGAACTCCAAGAACTTGGCCACCTCCAAGAGCGTCGACTCCGGT 

ATGGGCTTGCC 

(SEQ ID NO: 1165) 

Exon: 1861.. 1001 

Start ATG: 1861 (Reverse strand: CAT) 
Transcript No. : CT297 60 

ATGGTATTCTGCGGTATTACCAGTGCGGCCACACTGCAGTCCACACCCAAACAGCAAACCAAACA7AACATTTCCCCAGGAAACCAGCGAAATTCC 
AATAGAATTTCCCACTCAAAGGATGC7A7AAAACCATCCCACTTCGCACGCGAGGATTCCCAGGAGCACCTGCCCACCAGCCACACGCAAAACAGTC 
ACACAGTGAACATGAAGAGACGCACACTATCCGGAAGTTGTGGCGTTGGCGTCTTCGTGTTCGCCTTTGCCTTCATCGTGATTGCGTTTGCAACG 
CCCAGTTGGTTGGTCAGTGATTACCGCATCACGGGCGCCAAGCTGGATCGCCTGGGATTGTGGGTGCACTGCTTCCGGTCGCTGCCGGACGTGAA 
CGACGATAGCCAGCGGAGATTCTTCGTGGGCTGCCGTTGGGTGTATGACCCATTTACCACGGGTTACGATGAGATCCGTGGATTCCTGCTGCCCG 
CCTTCATGATAGCCACTCAGTTCTTCTACACTTTGGCCTTCATCGGGATGTTGGTATCTGCCATCGGAGTACTGGTCTTCATCCTGTGTGCTGGT 
CCGGATCAAAAGCACTTCATTACGCTCATTAAATCGTTGGGATATGTGCTTCTCGGTGCTGGAGTAAGTGCAGCCATTGCAGTGATTGTATTCGC 
TGGATTTGGCAACCGGAATGGCTGGATGCCGGAGCATGCGAACAATTGGTTTGGCTGGTCCTTTATCCTGGCGTGTGTGGGTACTGTGTTAACCC 
TGGTGGCGTCTACTTTGTTCCTCAGTGAAGCCCACGTGCAGCACAAGAAGAGAATTCAGTTCAAGGAATCTCAGACGCGATTTGAGCTGGTGCGC 
GGATAG 

{SEQ ID NO: 1166) 

Start ATG: 1 (Reverse strand: CAT) 

iyiVFCGITSAATLQSTPKQQTKQTFPQETSEIPIEFPTQRMQKPSHFAREDSQEHLPTSHTQNSHTVNMKRRTLSGSCGVGVFVFAFAFIVIAFAT 
PSWLVSDYRITGAKLDRLGLWVHCFRSLPDVNDDSQRRFFVGCRWVYDPFTTGYDEIRGFLLPAFMIATQFFYTLAFIGMLVSAIGVLVFILCAG 
PDQKHFrTLIKSLGYVLLGAGVSAAIAVIVFAGFGNRNGWMPEHANNWFGWSFILACVGTVLTLVASTLFLSEAHVQHKKRIQFKESQTRFELVR 
G* 

(SEQ ID NO: 1167) 



Celera Sequence No. : 142000013383795 

CCTGAACTGCAGCTGAGCTCCAAGGAACTGTGCTGCGGAACGCATGC/AACCAACACCAGTGAGCTTTCGTGCTTCTGCATCGTGAATCTCAAGCA 
AACCAATCGTGCGAGATTCGCATTTACCGCCGTTGCTGGCCAGGCGGCGGAAAATGTGAGTGGACTTATATATCGCCTTAGACTAGTCATTTAAT 
CTTCTTACTCTGTTCCAAATAGGTGCTGAAAACGGCTGCCCTGCTGCGACACCGCGTTGATCTCCTCGAGAAGCAGTTCCAAACTGACAAACTCA 
CAAACGCTACGGAGGCGGAACTGCAGACGATTCGGCACAACATGTTGCACACGGACATTAAACTGCCATACGCTTTTAAGATGGACACGCTGGAA 
CGCATCACCGAGATGCTGAAGAGAATAAAAGATTCATCGCGCACAACCTTAAAGTAAGTTTAACTTATT7AAAGTAATGTGCTCCATTTCTAAACT 
CTACATTTTCAGAGAATTTGTGGACGTAGAGATGCGCACGCTGCTGCAGGAGAAGCCGTTGGACACACACCCCTTCATTCTGCACTACATCACCA 
GTTCCGCGCTCGTGGAGGAGATTCCTTTGCAACGAGCCACCAAGCTGTGCCCGGATCGGCCCATCCTCGTAATCAGCGTGTGTGATAGCGTGGTC 
AAGGCTCGCTGTTGTGTACCAGAAAAGTGTATCACGGAGAAATTC7AACGCTTCCGCCTGGCTGCAATCCTTTGCCGACACCTTTAATGGACAAAT 
AGCCGCGCCCAAGGGCCAAAATCCGCAGGCCGTGTGCAACATGAAGGGTCGACGGGTGAGCAATCTGTTCGAGGAACAGTTGGAGCAGGCAATGT 
CCAAGGCACATGCATACGCTAAGCTTTATCTATAGGTATAAGTTGTGATTACCCTCCAAAATGTGCTAGCTGTAAGCTTGCCCTTTGAATAATTG 
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CTCATGTGAGGAGCGAGTAAATGGCGAAGTTTTAAGGAATTTCAGAATCTTTAATCACTAATCACTACCAGGTCATCCTCCTCCTCATCTTCTGC 

GTTATCCTCTTGATCCTGGGGCACCAAGCTGGAGTTCATCAACTCTTCAAAGACATCCGCATTATCGGAACTTTGATTGAAGAAGTAACTTATGT 

TATCCAAGGGCTTCACTGGCGTTGGAATCTTTTCGCTGATGCGATGCTGCATCTTGAATCGCTCCACTGGCGTCCTGGCCAGCTTCTGCAGATCT 

CCCTTGACCAGCAGATCAAAACTGTCCTCCAAGGAGAATCGCTTACTCTGCGGTAGTTGATCCACTTGAGAGAGATCTCTTTTCGTTTCCACTTT 

GGGCGATTCATTCACATTTCCTAGACTCATCTGCGCCAGACGTAAGCTCAAATCTTCCGCCATGGGTTCGTAATGAAGCTGGTGACCATCGTGGT 

TGGTCAGTGCTGGCTTGGCATTCGGATTCGAGATAATGCCTTTAACTATGTCCGACATATCGAACTCTTCGGCATCGCAATCGCTTTCCAGATCG 

GAGGGCAGGAACTTGGTGATGGGGGTGGAGCATTGCTGACGTTGCGGAGCGGGAGCCTTGACCGGAGTACCTTCCTTCTTGCGCCCTAGGAATTG 

GTGCAGTAATGGCTGGTTCTCTTCAGGCTGAGCCTTTATCTTTCTGACCACTCGCTTTGGTTTCGAATTCGGCTCGTTTTCCTTATTCTCCTCTT 

CAGATGCGGTTTTCTTCTTTCGCGTGGTTTTCTTGGCGGGCTTCTCCTTCGACTTCAAAAAGGCCGCTACTAAATCGGGATACGCTGTCTCCAGC 

ATGTCCAATGGCTCAATGGTGTAGTAAAGTTCCTCGATTCCCTTGGGATGCTCCGCTTCGTACTCCGCAATCTGCTTGTCCGGAATAAGACCCTT 

GAAGATGCCACTGGGATCGTGCCAGCGCAATTCTAAGCTGGGCACACCCTTCACAGTTCGCTTTTTGATAATCTCGTGTGGCTGGATAAGTATCT 

TCTCCTGCTTCGATTGCTGCACCTGCCAGCGTGTGAGTATGGGAAAGAACTTTTGGAAGCAGTATATTTCTGGCCACTGCAAGAGATGTCCGATT 

TGTTTGATGAATTTGACAAGATTTGGTTGCCGCCAGTTGAGATTTAGATTGGGAATGGTGTCGGGTTCGCTGAGAAACTCGGCAATGATCTCTTC 

GTTGGGAAAGTCAGGAGATAATAGGGCTTTCCGCCTCAAAGTCAACTCGGACTTAATGGATAGTCGTTGTTCTCTAAGGGAAGAGTTTAATAAAC 

AAATGTTTATATTAGTTTTAGGAAATAGCTCACTTCCAGAGACTCTCATCGCAGCCTTTGTGTGTGCGACACACACTGCAGCCACTTTTTGTGTG 

GCTCTGGGTCTTGCCAATGTGGCCACAATTGCTGCAGATGGATTTGTCGTCCACTCGGATCTCCAAGGCATTGTATTTATCGGTTTCACCACGCC 

AGCTTCTCATTCTGCAGAAATAATAATCAAAGCTTATTAGCTTTAAAAATATAATATATGTAACAATGTAGTAGACTACCTATCCAAAATCTCAG 

TCTCCTTGTACTTGTTGAACAGCTTTAGGACGCCATCCTTGCCAATGCCACCTATGCCATCAGGACAATAATCACAGCCGCAGAGCAAAGCCATC 

ACAATGATCTTCTGTTGGCCAAAGTCCATTCGGGATGTGATCTCCCGCATGTCGTAGATGTCCACAGCTCCGCCGGCGGCTGCCTGGGCTCCTTG 

GGTGGAAACGGAGAAGTTGCGATAGACACGAACGGCTCCATAGGCAAAGCAATCCGAGTCCTGGCTAATGACGCCGTCCACTAGCTGCGGACAGG 

AATATAATCGGATTTGGTACAGAGTTCTTGTGAGGATGTAGTTGATATACGTACGCCATGCTTGTTGAGAAAGGCACAGTAGGCCTCCGCCTCGC 

CGGGACCTTGGACGCACTGGATTCCCATGGAGAGCAGCAGCGTTTCGCACTGCTTGAGAACATGGTTGAACCGGCTGCGTCCCTTGTCGCCCTTG 

GACGGCTGACTTTGGGTGCACTCAGGCGAATTCTTGGGCTTCACTCCCCGAAACTGGAGCTCATTCCTTTTGGCTATCACCTGGCTCTTGAGTTT 

TGGAGCAACTCCCTCCAAAACGAAAACCGGAGTAACTTGCTCCCAAATCAAGTAGCAAGTGCGAAAGAAGAGGTTCCTGTTGGTAAAGCGTAAAT 

ATTAATATTTTAAAAGAATATAAGGTATTAAGAATTTTTTAAACCTACTTTAAATGATGCCTTGGATGCACAAAGTAGTCCACAACATTAAGAGA 

TTCGCACACCCAGCCGGCCAGGTCGATGGCCACTTTCTTTCCCCGCAGTTCATTAATGGGTTTACGTTCACAATGGGGCGTTAAAACACCCCATA 

ATTCCTTGACGCCCATTTAAATAAGCTAAACAAAAACCCCGCGAGAAATTCGCAACAAATTTAAACAATCGCTTCTTCGTCAGAGTTGCCCAATC 

AGTTACTTTCTAGCTACTTGAAGCTATATTCAAAAGCAGCCAAGTGGAAAGTTGTGACAATAATTGGCAATGATAACTTTTTTAACAATAATACA 

TGATGCTATTTTCAATAACAAATATTATACATATAAATGTCATCTTATTCGATTTTAAGAACACATTTCGTTTTTAGCTTTTTTTGAGCTTGTCT 

CTAGCTACATTTGACAGCACTGCGACTTCCGCCAGTCACTGCGAAGCATGCTCGATAACCGATAGCGGGCTTTGTTTTGACAGCTGCACAAGCGA 

CCGCTGCGAATGGCCAGTGGTCAGCAGTCTCTTCTTCTTCCACGTCGACGTGTCGTGTCAGCCAAAAGAAAGTCGAAATAAAATATTGAATTGAA 

GCTGCCCCTTTGCGAGTAGGCCCGACAAAATTCAGAAGCTGCCAGTCGTCCGTACGCCAAACCACACACAACCACCACCGGCAAGATGGTTCTCG 

CGGATTTGGGTCGCAAGATAACAACGGCGCTGCACTCGCTCAGCAAGGCGACCGTAATCAATGAGGAGGCACTGAATTCCATGCTGAAGGAGATC 

TGCGCAGCGCTCCTGGAGGCTGATGTGAATATCCGACTGGTTAAGCAGCTGCGCGAGAATGTGCGTGCCGTTATTGATTTCGATGAGATGGCCGG 

TGGCCTGAACAAGAGGCGAATGATCCAATCGGCTGTGTTCAAGGAGCTCGTCAAACTGGTCGATCCCGGCGTGAAACCCTACCAACCCATCAAGG 

GCAAGGCCAATGTGGTCATGTTTGTGGGTCTGCAGGGCTCCGGTAAGACCACCACCTGTACCAAGCTGGCCTATCACTACCAGAAGCGCAACTGG 

AAGTCGTGTCTCGTGTGCGCGGATACCTTCCGTGCTGGTGCCTACGATCAGGTCAAGCAGAACGCC 

{SEQ ID NO: 1168) 

Exon: 3436. .3279 
Exon: 3211. .2905 
Exon: 2839. .2550 
Exon; 2481.. 2314 
Exon: 2258.. 1001 

Start ATG: 3436 (Reverse strand: CAT) 
Transcript No. : CT29886 

ATGGGCGTCAAGGAATTATGGGGTGTTTTAACGCCCCATTGTGAACGTAAACCCATTAATGAACTGCGGGGAAAGAAAGTGGCCATCGACCTGGC 
CGGCTGGGTGTGCGAATCTCTTAATGTTGTGGACTACTTTGTGCATCCAAGGCATCATTTAAAGAACCTCTTCTTTCGCACTTGCTACTTGATTT 
GGGAGCAAGTTACTCCGGTTTTCGTTTTGGAGGGAGTTGCTCCAAAACTCAAGAGCCAGGTGATAGCCAAAAGGAATGAGCTCCAGTTTCGGGGA 
GTGAAGCCCAAGAATTCGCCTGAGTGCACCCAAAGTGAGCCGTCCAAGGGCGACAAGGGACGCAGCCGGTTCAACCATGTTCTCAAGCAGTGCGA 
AACGCTGCTGCTCTCCATGGGAATCCAGTGCGTCCAAGGTCCCGGCGAGGCGGAGGCCTACTGTGCCTTTCTCAACAAGCATGGCCTAGTGGACG 
GCGTCATTAGCCAGGACTCGGATTGCTTTGCCTATGGAGCCGTTCGTGTCTATCGCAACTTCTCCGTTTCCACCCAAGGAGCCCAGGCAGCCGCC 
GGCGGAGCTGTGGACATCTACGACATGCGGGAGATCACATCCCGAATGGACTTTGGCCAACAGAAGATCATTGTGATGGCTTTGCTCTGCGGCTG 
TGATTATTGTCCTGATGGCATAGGTGGCATTGGCAAGGATGGCGTCCTAAAGCTGTTCAACAAGTACAAGGAGACTGAGATTTTGGATAGAATGA 
GAAGCTGGCGTGGTGAAACCGATAAATACAATGCCTTGGAGATCCGAGTGGACGACAAATCCATCTGCAGCAATTGTGGCCACATTGGCAAGACC 
CAGAGCCACACAAAAAGTGGCTGCAGTGTGTGTCGCACACACAAAGGCTGCGATGAGAGTCTCTGGAAAGAACAACGACTATCCATTAAGTCCGA 
GTTGACTTTGAGGCGGAAAGCCCTATTATCTCCTGACTTTCCCAACGAAGAGATCATTGCCGAGTTTCTCAGCGAACCCGACACCATTCCCAATC 
TAAATCTCAACTGGCGGCAACCAAATCTTGTCAAATTCATCAAACAAATCGGACATCTCTTGCAGTGGCCAGAAATATACTGCTTCCAAAAGTTC 
TTTCCCATACTCACACGCTGGCAGGTGCAGCAATCGAAGCAGGAGAAGATACTTATCCAGCCACACGAGATTATCAAAAAGCGAACTGTGAAGGG 
TGTGCCCAGCTTAGAATTGCGCTGGCACGATCCCAGTGGCATCTTCAAGGGTCTTATTCCGGACAAGCAGATTGCGGAGTACGAAGCGGAGCATC 
CCAAGGGAATCGAGGAACTTTACTACACCATTGAGCCATTGGACATGCTGGAGACAGCGTATCCCGATTTAGTAGCGGCCTTTTTGAAGTCGAAG 
GAGAAGCCCGCCAAGAAAACCACGCGAAAGAAGAAAACCGCATCTGAAGAGGAGAATAAGGAAAACGAGCCGAATTCGAAACCAAAGCGAGTGGT 
CAGAAAGATAAAGGCTCAGCCTGAAGAGAACCAGCCATTACTGCACCAATTCCTAGGGCGCAAGAAGGAAGGTACTCCGGTCAAGGCTCCCGCTC 
CGCAACGTCAGCAATGCTCCACCCCCATCACCAAGTTCCTGCCCTCCGATCTGGAAAGCGATTGCGATGCCGAAGAGTTCGATATGTCGGACATA 
GTTAAAGGCATTATCTCGAATCCGAATGCCAAGCCAGCACTGACCAACCACGATGGTCACCAGCTTCATTACGAACCCATGGCGGAAGATTTGAG 
CTTACGTCTGGCGCAGATGAGTCTAGGAAATGTGAATGAATCGCCCAAAGTGGAAACGAAAAGAGATCTCTCTCAAGTGGATCAACTACCGCAGA 
GTAAGCGATTCTCCTTGGAGGACAGTTTTGATCTGCTGGTCAAGGGAGATCTGCAGAAGCTGGCCAGGACGCCAGTGGAGCGATTCAAGATGCAG 
CATCGCATCAGCGAAAAGATTCCAACGCCAGTGAAGCCCTTGGATAACATAAGTTACTTCTTCAATCAAAGTTCCGATAATGCGGATGTCTTTGA 
AGAGTTGATGAACTCCAGCTTGGTGCCCCAGGATCAAGAGGATAACGCAGAAGATGAGGAGGAGGATGACCTGGTAGTGATTAGTGATTAA 
(SEQ ID NO: 1169) 
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Start ATG: 1 (Reverse strand: CAT) 



MGVKELWGVLTPHCERKPIKELRGKKVAIDLAGWVCESLNVVDYFVHPRHHLKNLFFRTCYLIWEQVTPVFVLEGVAPKLKSQVIAKRNELQFRG 
VKPKNSPECTQSQPSKGDKGRSRFMHVLKQCETLLLSMGIQCVQGPGEAEAYCAFLNKHGLVDGVISQDSDCFAYGAVRVYRNFSVSTQGAQAAA 
GGAVDIYDMREITSRMDFGQQKIIVMALLCGCDYCPDGIGGIGKDGVLKLFNKYKETEILDRMRSWRGETDKYNALEIRVDDKSICSNCGHIGKT 
QSHTKSGCSVCRTHKGCDESLWKEQRLSIKSELTLRRKALLSPDFPNEEIIAEFLSEPDTIPNLNLNWRQPNLVKFIKQIGHLLQWPEIYCFQKF 
FPILTRWQVQQSKQEKILIQPHEIIKKRTVKGVPSLELRWHDPSGIFKGLIPDKQIAEYEAEHPKGIEELYYTIEPLDMLETAYPDLVAAFLKSK 
EKPAKKTTRKKKTASEEENKENEPNSKPKRVVRKIKAQPEENQPLLHQFLGRKKEGTPVKAPAPQRQQCSTPITKFLPSDLESDCDAEEFDMSDI 
VKGIISNPNAKPALTNHDGHQLHYEPMAEDLSLRLAQMSLGNVNESPKVETKRDLSQVDQLPQSKRFSLEDSFDLLVKGDLQKLARTPVERFKMQ 
HRISEKIPTPVKPLDNISYFFNQSSDNADVFEELMNSSLVPQDQEDNAEDEEEDDLVVISD* 
(SEQ ID NO: 1170) 

Name: FEN 1-1 ike 

Classification : DNA_repair_protein 
Gene Symbol: BcDNA:GMl0765 
FlyBase ID: FBgn0027914 



Celera Sequence No. : 142000013384133 

TAAAAGCTGCCATTGGCAATCCGGTTGCAAACCCGCCCAATGGTTGCTCCAAAATACGGATTTCTCTCGCTCTGATAGCCCGCCAAATCGTCAAA 
TCCGAGGGTCACGTCATCTATTTGACCGCTGGCATCAGGCGTCTTTATGCTGGTAATGGTGGCTCCACGGGTGATCAATTGCACGGACATTCCAG 
CTCCATTTGTCAGGGTGAACCGCTTGATGGTGTCCTTTGACTTGGTGAACGGATTCACCGCTCCGTTGGCGAAGACGTCTTCCGTGACGTTGACC 
ATTTTGGATCCGGCTGCAGCTGTGGAATGATTACGAGTGCCAATTGCACTGCGTTTTTATCTGACCCTCTCCAAGCCGTAAAGGTTAATCAGCAC 
TCGATGTGCGTATGTGTTTGTTTTAATTATCGCAAAAAAGCCGCAAACAAAATGTACATATGTACGTACATATATAAACAGATCTAGTTATCACA 
AATTGGTGGGAAAGTCTGCGATAAGGGGGATAGACACCTTGTGGAGTCTTTAAGAATTGTGCAATGTTCGTATTCCAAATTTAGCGGTAATTCAG 
AAAAT T CT T ACT T AAAT TAT CT T T T AAT CGC CGT AGT T GC AATT C AAAT T T T AG AGT TG C CT T AC CT AT TTT AG ACC AG C TG T T TT T AC A TG T GC 
GTAGTAAGTAAAAATTCGCAGGGGCTTGTTTTAGTAGTTAAAAATTTGTTGGTAAAATTTTTTGATATTTGCTTATTAAGCCTTTTAAATAGAAC 
AATTTTATTAACATACTTGAATACAATATACATTTTTTTATATATATGTTTAAATGCCCATCTTTGCTACACAGTGGTGGCAGTTGCAAAGATCT 
GGCAGCCCTGCTCTAGAAATACCACACAGCAAATAGTCCACCTCTAAGCAACAGGTCGGTTGTTTATTGAATTCGTTTATTTAACTTTATGGGTA 
GT T T GGC AAAT AG AAAGCTAGC TAAC CGGT G C AG T T AACCC T T T T T GT T G AC AG AG AGC T C AG AGG AAACGGC AG AT T AGAT CC CC C TG AGTT C A 
TCCCCACCCAGGCACCACTAATCCGACGAGTATAAATCCCAAAATGCAATTGACGTGGCAGAGAGATCAGATAAATCTAGTACTTCCGATTGCTC 
TGATTTGTTGTTGTCTTTTGATCGACGTAACCAGCGCTCAGGAGGCCCAACAACCATGGTACGAGAATCTGCCAGCGGTGGCCATGGACTATAAG 
GTGTGTGCTGATTTTAAGACTCCTCTAGAGCATCAAATTTCACCGGGAACGTGGTTTCTCTTTCAGGTTCACATAGATGCTGGTAAGGAGGACTG 
CTACCATCAGTACGTTAAGGCAGGAGCCACCTTCTACGTGTCCTTTAGTGTAAGTTGATAGAATTTGGGATCAATAGCATGTTTCCGATCCAATG 
GATGGGTTTCCCCTTGAGCAGTCTGTGATATTCTAAAATAAATCAAATCGAATCCATTAATACAGTAATATAATCTATCTAATACAATGAAATTA 
TTCATATAAATTTTTTTTAGGTGGTGCGCGGAGGCGATGGAATGGCTGGTTTCGCCGTCCGCAATCCTGCTGGTGAAGTGGTGAAGCCCTACCAG 
TGGCAGGCTACTGCGGACTACACGGATCAGGTATCGCCCGGTGGCTACTACTCCGTCTGCATTGACAACCAGTTCTCCCGATTCGCTGGCAAGCT 
GGTCAACATCTATATCACGGTGGTGAAGTACGATGCATGGGACAAGTACGCCAAGGAGATCGAGCAGCTGCAGCTGAACATGCAAAACTTCACTG 
TMGCCAAACTGGTTTTCAGACACACCCAAACATCCATTTACAAATATTCACATTGATCTATTTCTTTAGGCCACCGTTGGCACTGTGGAGCGCA 
ACATCAACGATATGATGGGCTATCAAGCCCACAGTCGTCATCGCGAATCGCGTGATTATGCGTTGCTCCTGGACAACAATGCCTATATTCAAACC 
TTCTCGATTAGCCAAATTGTGGTCATATTGATCACGTGCTCTGTACAGGCGAGTATAGACATGTTTTTTGCTTTTCCTAAATTAATAACCTAATA 
TTTGCAGGTATTCTTTGTGCGCAAACTATTTGAAGTAAAATCGAGCTCCAAAAGTCGCATCTAAGGCGCAAAGATACTTTTCAAAGACTTCGCAA 
CATGTTTCTAAAATTCCATCCGCATTTCCAAGACATTCATTCTATTCTACCTGCTGAATGAGTTTTTGTAACTTATTTTATCACTGGCTAATTGA 
TCAAGCGCAACTGAGCAGACTTTTTTATAATTGGGACTACTTATGTCATACGAGTTATAGACAGCAAAACTTATTGTGATAAGTGTACATTTGTG 
TGTTTTGTTTAATTAAAAGATTGAAGATTTTTGTATTTGCTTTGTTTTATTTGGGAAAGAACCTTAAGGATTTATTCGAAGCCATTTGGGTTTGT 
TAATTTACTAAAAGCTCTACAAGCTTTAAAGTTAAGCTGCTTTGGTAAGCTCACTTAACCGTTAAATTTCATTATCCTGGAAAGGATTTTAAACT 
TAAAAATATATAATTCAGGTTGTTGTTTTAAGAAACCTGTTTTTCTTTTTATATGGGAGTAAGTTATTTTCAAATTACGACATTGTTGCAAAGAC 
ATCTTCTCGTTGCCTAAAACCAATCTTAAATATTAAATACATGCTGTACATGAGCTCATTAGTTTAAATTGAGAGCTGAAGACATTTCGAAGACA 
TTCCACAACTTGGGCTGGCAAATTAATCACCCCGAATGCCGCTTATCGCAAATTATCCTGCCCCATCCATCGCAGGACCTTAAATCATGTCACCA 
GGCTCACCGCTTATCTAGACGTCATTTAGCACATGTGTGTGTGCTCTGCAGCAGCTGGCATGTTAATTATATCCTTTTTGGCAGGCAGCAGGAGC 
ATCTGGGTATCTTGGCGGGTCTTTTGTGGCCACTGCTGCTCGAAATTAAAATGTTATTATGCGAGGCTTATCTGCGGGCCTTAAAGCCACCGCAA 
AACTAAGGTCCTAATGGCCTCGCACACCAGCACACGCTCTTTAACTGTTTTAAACGCCATTGTTAAAATATTGCTCATACGCACTTCCGCCCAAA 
ATGGGGTCTTCTCTCTTCGGCAGCACTAATGATGCCCCAAGAACGCGGAGCAAAGGACCCGGCGAAAGGCAGGAGCATCCACAGCCATAGCCACA 
TCCACATCCACATCCCCGATGGCACAGAGTTGGCTCCGCCCACCAGCAGCCGATGACAGCGGCAGATAACAGAGCAGCGGCAAAGTGGGCGGTGG 
ATGGGGTTGTGGTGCGGAGAGCGCGCCTGCGCATTAGCCGC 
(SEQ ID NO: 1171) 

Exon: 1001.. 1235 
Exon: 1302.. 1379 
Exon: 1541.. 1804 
Exon: 187S..2016 
Exon: 2098.. 2366 
Start ATG: 1089 

Transcript No. : CT30086 

ACAGAGAGCTCAGAGGAAACGGCAGATTAGATCCCCCTGAGTTCATCCCCACCCAGGCACCACTAATCCGACGAGTATAAATCCCAAAATGCAAT 
TGACGTGGCAGAGAGATCAGATAAATCTAGTACTTCCGATTGCTCTGATTTGTTGTTGTCTTTTGATCGACGTAACCAGCGCTCAGGAGGCCCAA 
CAACCATGGTACGAGAATCTGCCAGCGGTGGCCATGGACTATAAGGTTCACATAGATGCTGGTAAGGAGGACTGCTACCATCAGTACGTTAAGGC 
AGGAGCCACCTTCTACGTGTCCTTTAGTGTGGTGCGCGGAGGCGATGGAATGGCTGGTTTCGCCGTCCGCAATCCTGCTGGTGAAGTGGTGAAGC 
CCTACCAGTGGCAGGCTACTGCGGACTACACGGATCAGGTATCGCCCGGTGGCTACTACTCCGTCTGCATTGACAACCAGTTCTCCCGATTCGCT 
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GGCAAGCTGGTCAACATCTATATCACGGTGGTGAAGTACGATGCATGGGACAAGTACGCCAAGGAGATCGAGCAGCTGCAGCTGAACATGCAA^ 
CTTCACTGCCACCGTTGGCACTGTGGAGCGCAACATCAACGATATGATGGGCTATCAAGCCCACAGTCGTCATCGCGAATCGCGTGATTATGCGT 
TGCTCCTGGACAACAATGCCTATATTCAAACCTTCTCGATTAGCCAAATTGTGGTATTCTTTGTGCGCAAACTATTTGAAGTAAAATCGAGCTCC 
AAAAGTCGCATCTAAGGCGCAAAGATACTTTTCAAAGACTTCGCAACATGTTTCTAAAATTCCATCCGCATTTCCAAGACATTCATTCTATTCTA 
CCTGCTGAATGAGTTTTTGTAACTTATTTTATCACTGGCTAATTGATCAAGCGCAACTGAGCAGACTTTTTTATAATTGGGACTACTTATGTCAT 
ACGAGTTATAGACAGCAAAACTTATTGTGATAAGTGT 
(SEQ ID NO: 1172} 

Start ATG: 89 

MQLTWQRDQINLVLPIALICCCLLIDVTSAQEAQQPWYENLPAVAMDYKVHIDAGKEDCYHQYVKAGATFYVSFSVVRGGDGMAGFAVRNPAGEV 
VKPYQWQATADYTDQVSPGGYYSVCIDNQFSRFAGKLVNIYITVVKYDAWDKYAKEIEQLQLNMQNFTATVGTVERNINDMMGYQAHSRHRESRD 
YALLLDNNAYIQTFSISQI VVFFVRKLFEVKSSSKSRI* 
(SEQ ID NO: 1173) 



Celera Sequence No. : 1*42000013384504 

ACGTTTTTGCTTAACCTTGACGAGGAACCGACAAATTTTGTATTACATTTCTGTGCAATTAAAAGTTATAACGTTTGTGGATATGAATAAAAATT 
TTACCATACCTACGTAGGTGATAAACGCGTACTTAGTAGATGGTTTCATTAAACTAAAACAACATTTTTATGGTGTTTGTAAAAGCGCAATTTTT 
TAATGCGTTTACTCAATTATTTATTGCCCAGGCCCGTTTGCCACAGCCCATTTAGCAACTTTAGCTGTGCGTTACCAACTATTTATATGCGTTAC 
C AC AT T T G AAAT T GT AT GAG AC AGCT GGT AT T AC AAAG AGT AT AT C TT ATT AATTC AC AT TGT AG C AC T GGT GT AAAAGTGGC AT A TAC AT T GAA 
TTACGAATGAGAGTGTAAAAATACGGTGTACATTTGTGTATTCTTATTATTTTTGGTAGTTTTAAACATTTGAGGCATTTCTGTCGTTTGCAATT 
TACAAATAAATGGCTAAAAACTATGGCTGACGAATGAGAACACCAAATTGAAATCAACGAACTTAAGCGAAAGCTATATGAATATCGAAGTTACA 
TGTGAGATATATCACCGTGTATGTATGCTTACGTAAATATATATATATATATGATTCCAAATGGGTATTTCTTGATCAAAGATACTAACGATTTT 
CTGAAGGGTCCTTAAATATTTACGAGTAATTCACCTTTTTGTTGGGTGAAACTATTTAGTTATTAAACTTGTACAAAAATAAAAATAAAATTTTG 
AAAATTTCACAAACTTTGTGGCAAATAATTCAACAATATATTGTAGGTCTATTACGATATAAAATGAATTGGCACTTCATGTATAAATATGTTAT 
TAAAAAATCAAAACATAATATAATTCTAGTTTATTCAAGGCCCTAAAGAGAATGGTTTTTACTTTTGAGATAGAACTACCCATGCCGGAGATGCG 
TGAGAAAAATGTATAAAAAGCTTAGTCGGCAGTCTAGCGTTTCCTCCAATTCAGAAATTCGTCTCCTTGAACTCCTTGGATTGCAGTACAGGCGA 
AATAGCTCCCCCATCGCCGTTTTCATTTCCATTATTGTGCTCCTCTCCGCCGCCCACCGTGTAGGCCATCACAGTGGCATGATCCAATTTCCCGG 
GCAATGCTACAAAGGCGCTGCTGTCCACCACACGCTTCCTGGCCGCGCGATGCTGCATGCGGTTCAGTTGCACCTCACGCTTGGCGCCATCTGGT 
CCCGTAGAGAGTTTATAGAAGAGTTCCCTTTGCTCCAGGGTCTTCCTCCTGGCTGACGTTATCATCCGTACAAAATCGATAAGCAAGTGGCAGAG 
CCTGCGTGCCGATACGCAGGGCGAAAACGTGCCGTTGATGCCATAGCTGAGCAGATCCTCTAAGCGGTAGAGGGTAAGAGCATGCCGCTGGGGAC 
CGGTAACAAGGGGCAGTCCTGAAATGGACTTCGGAATGCGTTGGGGCACCACCGATGGCGGTGGAGCTGATACCAATCCCTGGCGTGGTTCAATA 
AGCGTTTGGGAGCGCATATACTTTGGTCGACTAGGCGCATTATTAGGCGGACTTTCCGCCTTGGACTGACGAGCTGGTCGCGTGGGCGGAGTCGA 
AGGATGCAGACGGGCATAGATGGCGGATGCACTCTTGTTCTGCCGTTTGGGTGCCAAGTTGGCGGGCTTGACATGTCCCGACGTCTGAAGCTTAA 
CATCCGGAGTCCAGGCTTCGGCAAACTTGCCCACAACTGGATCGAAGTTATCGCTGATTCCATCCGACGTTAAAAAGACTATGTCTCCACTTTCA 
ATGCAGGTCATCGAGAAGGTTAGATTGCTCAGCTCTGGCTTGTTGCCATCCGCCGGTCCAAGGGCACCCAACGCATCGCGCATGTCCCTCATGGA 
ACTGATGTCGTGGGATGCCTGTGTTAGTTCCCGTACTCCGTGCTTCTTGGAGTACACATAGCCCAGGGAATCGCCAACATTACACGAGCACACGA 
CATATTTCCCTGGCGCTCCGTCCAGCGGCAGCACAACGGCTATGGTGAGAGTAGATAGGGCTCCTCCAACTTCCAGGATACAACCGTGTCCCTCC 
CAAAGACTACGCAGCAAACTCACAAAAACCTCCTGGGTTGTGGTGGCTCGGCACTCCAGCGCCTGTCCGAAGACCGCCCGGTCCAGATAGTCCAA 
ACAACCGTGAACAGCTGAACGGGCGGCCAGACGAGCGCCATCACCCCAGTTGACACCATCTGCCATGGCCATGGCAGCTGAGTCTCCGCGCACTA 
CCATTCCGTAGCAGTCGGCGATGGGATTGCCCATGGGTTCTTTAGTCAGCATGTTGGTCTCGTAGAGGGACACCGAGATACCATATGCGAAGTCG 
CACTCCATGCGCCAGTTCTCCACGCCAGCGATCAAAGCTGCGGGATCAGAGCCACCCTGGTTGCTTACAGCTTCGCGGAGCAAACGTTCCCGCTC 
TGAGTTGGGATCCCCATCCGTCGAGACCTCAGTCTTTGCATTTAACACATTCCTATTGTTCTGATCGCTGTTGGCTTTTCCAGCATCCGTGCCAG 
CAGACGCAGCGGCCAGATTGGCTATGCTGCTTTTGGAATTCTTGCGGGAACGCATTCCCGACGCAGAAGCAGATCCGGAGGGCTGCTCCGCACTG 
CTCTGTCTGCGAGGTCCTGCCGGCGGCCGTCCGAATTTGCTGGTAAGTCGGGTTGAGTTCTGTGTGGCCACCTGAACTGGTGGTGTACTCCTCCT 
AACGCCATAGCAATCATCGTTCTCCTGCTGCGTATCTATGAAGTCAATGTCATCCGCGAAGCTGGTGCTCAAGTGCAGTTTGGAGCGCTTGATGC 
CGGTGAGCCCCTCGTCAGGACCTGTGCAGGCGGCTGTTATAGAGCAGTTTCCAGCTTCGTATGTATCCAACTTGAGAACGGGCATGTCCGTGGGA 
TTCTGGCCATTGTAAATCTCCGGACCCTCAACAAACTGACGCTGCATTCGGCTGTCGGTGAAGGTGGTCGGGTAAATATTTGAGCAACTTTCGCA 
GGCTACTAGGTTTCTATTTACTTACCCCTCCAGATATTTAGTTATGAAGCTACCATCATCATCCTCGTGCTCGCTGGCCCGCCGGCGACCCTCGC 
GAGGCTCCGCGATGAAGCTCAACTGGCGGAAGTAAGTGGTGACTTTCTGGCGCAGCGACGGCATCTTTAGTCGAGTTGCTCCCTCCTCCTCGCCA 
GCAGTTCTCGCCCTCCAGTTAGGGTTCTCTTAATTGCCAGAGATGGAGGGGCTAAAAAATGGATGCTGGGATGCTGACCAGTTCGTTCACCTCAT 
CAGTGGCAATTGTTGTGGGAGTCGAGTCACTCCAAACGAATCCAGCCGCCGACCCTGCAAAGAGGTTTAATTTAGAGCTTCATTTAATTATCAAT 
AAATTATGCAATTAAATGGGACTGGGCAAACACGAAAGAATTACAAGCAAATTGCATAGGCAGCCGCATAACTGACATTACTCCCTCCTTTCAAC 
CGCCTACTCCTAAATGACGAGATGTGCTTTTTAGTTGCCGCAGGCCCTGCATCCTTTCCCTGGCCACACAAAACCCTTCGCTCATCAGGAGCAGG 
ATGTCAGAGGAGAAATCCCACCCTGTGAATGGGCTTTTACCCAGCTCATTGCGAATGCAGTAGACACTGAACTGATGCGGACACAGAAATCCCCT 
CCTTCCTTCGGGGTTCAGATGACATAGCTTTTAGCAGGGCTCTCCGCATGTGTTCTGAACCAGAGATGCCAATTTGGTTCTTTGCTCAACTAATT 
TTAAATTGTTTATGGATTAACTTATGTGCCAAACTACTTGCCTTTCAAAATAAACCATTTATAAAATCGAACGCAATAATTTGGGACTTGATTTA 
TCACAACATTTATTTATTTATATTAAACCCTTTTCAGGATCAACTTTCAAATAGTCCTTTAGCATGAATTAAGTAACTGGCATCACTGATGTGTT 
GATGATTAATGAATGGACAAACCAACTGCCTGGGGCGACCTGCAACTCCAGCAGATGTTGGGTTTTCGGATTCTGACAGCATCAAGTTCGGTTTC 
AATGGAAAGATGATATCATATAGGGGTAAGTTTGCCGGCGAGAAACACACCAAGGTCTGCTGTACAATTATGTATCTAACTCTCCGCCGCTATCG 
CAGACGACATTTAATTGCCGTGTTAGGGCATTTAACTACCTGATCCATTCTCGTAGACAATGTAACGCCAAGACAGCCTTCATTTCGGCGAGATT 
ACAATGGATCGCAAACACAATAAATGCTAATCTTCAGCAGCAATTGCCGTGGAGATGCTTTTCCTTAGACGATGTACACAAAACGATTTGTATAA 
ATGATAAACAACAACAAAAAAGTCGGCACGGAAAGTAGTTTCGTTGATGGGGGCAATTATTCAAGATACCCCCCCTCCCTTGTTGTTGCTGAAAT 
TTTCACAAATGCTTCAAAGATATATATAAAAAGAAAAAAACACAATCAACCACTTTAATTTGTGGCATAAGTACTTCAAGTGACAAATGAGTGGT 
CAATTAAAAACTATGTCGATTAACGGTGAATACTGTGGGTCGGTACGCCTCTCTATGCGCTTCTCATAAACGGCACTTAACGGCACGCGAAAATC 
GCATGAACCCTGAACCAATTGTTCGAGTTTCGATGAACTTGAACTGCACTCACGAATCAATCAGCGAGTAATGTAATTCAATTTCAATCTGATGG 
CCCCAAGATGCGCGATTTCTTGGCAATTGCTGGGCACTGATCCGATTGCCACCACACCCGCATTCCTTGCGGCAAAGCATTAAAATCAAATTAAC 
CCACATAACCATTAAGTGCAGGAGTGGGAGATTTCCCATTACCATATCCACATTTCCCAGATACAAATCAAATGAATTGAAAAGTCAGAGCCGCG 
AGACAAAGTCGATCGCTGGACGAAAGGGATGGATAGCTGAGTGCCCATGTGGTTATAAAGATATATCTTTCTGTGGATGCATTGAGGCGCGGCCA 
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GCTGTGTGCACTCGTACCGGTCCAGGGCAATCAATACGGATGATTGAAAAGCCCACCTTCACATCACCACAAAACAATTCAATTTAGCCACAACA 
ATGTGCAGCTTGCGATTAAATGTGCAGCCAAAGAGATTACAACTGCACCACAAGAGCCAACATTGAAGCTCAGCCACAAGCGTTTTGTAGTTCCC 
ATCTGCAATTGCTCCAATTTCTTTACGCCTCCTCCAGATCTGCTCATCCACGTCAATCAGAGTCGTCATCACTTGACACACAATAGGCATCTGAC 
TGTGACTTACTGGCTTCTCAATTCCCTCAGCTTTTTCGCTTATAAGTGTCCATTGTGGGATCTCACAAATTATTTAAGTTCTGACTTTTGTCCAC 
CTGACGGCACGTGCCGAGAATAAAGTATAGGTTTTATAGAGATGGAAAAATCTCTGGATTGAGAAGTAAACTCATGTAAGATGCCCGTAGCAGCC 
GTTTTGTTTACCCATGTACATATATTATTTTTTAATCTTTAATTGCATTTTATGTATGTATTCAATTGTATATGAAATGACACTCCCCCAAACAT 
TAAACAAACAATTCTTCGAGGCGTGTGCCGCAATCAATCGACACTTTTCGATTGGATTTCGGGACCATCGAAATTTTCAACACTTGAAACTAATC 
AAGCGTGGAAATGGGGAAATAGGAACGCGGTGCTGTGAGATCATTGTGCAAGATCTGGGTGGGGCGATCCGTGGTGTATAAAAACAACTGAGTAA 
ATATTATCGGATGTGGCACAGATCTCGTGAGTCTTTCGAACTTTGTTTATGCTCGAAAACCAAAATAAACCATTTGCGCTTCCTCCAACCAAAAA 
TAGTTGTGTTGGCCCGTATTTATAGCTGCCGCCCGTGGAAACAAAAAATGTCTGGGTGAATTTCTCGCTTGAAAAACTCATTTGGGCAAGGCTTT 
GAAATGCATTTCCCGTCGCATTCGAAATCAGCTAAATTTTCTGAGCTAATCAAATGCGCAAAATTATTGTTTACACACTGGGTTAATTAGATGTG 
GAGATTGGTTTCTCTGCTCCGCTTCGAGATTCTTTTGTGCTCTTTCGGCTTTGTGCATACTTAAAATGTGTGTGGATTTACGAGGGAGAAGTGCT 
AAAACCCAATGCAAACAAGTCTGCTTTGGGGTCCAACCAAATGCTCCAACCAAATGCAAACACACCCGTGACATTCAGAGTCTGGATGCACTTGA 
GGCAAGTGACAGTTGTCTGCAGATTATGTGCTATGTGCTGCATATTGTGGAATCTCAAGGTGCAGTCGGTGGGCCAAAGCCGCAGTCAAAGAAAA 
AGCCTTAACAGCACAAGTCAAGTGAGGCTGCTCAAAGCCGTTTTGTGTGCTTCTGTTTGGGCCAGACAACAAACAACAAGAAGCTATAAAAGCAG 
AGAGTGAAAATTTGTTGCATTCATTGCGTGTT7TCTTCGAAAGAAAGTTGAAGGCCATCGTTAAGCTGTAGATTAAATAGAGCTGTCTGCTTGAT 
TACGAGCTCCCAAATATTTCAATCATTTCGAGCGACCCACTTTATGTATATTGCTTTTGTAGCGAGTTCCCTCCTTAACGTATAAGTTGTGTAGC 
TTAGGTATTCTAAGACCTTAGATTAGATGATTCTCCCTTATGGGAATTATAATATATGAAACAAAACCGTATGCACAAACCACATACACGCGTCA 
AGTGCTGATTCAACCGCAGTCAAGGCAATCTTGGATAGGAAATGCATGCTTAATTTTCATTTCCTCATATGCAATTTTTCTCATTTCAAGTCCAG 
GCTGATCAAGCCGCATAAAAATAAAATATTAAGTAGCAAATGCATTAAGCAAAATGGACGGGGTTCCCCAACCCAGAGTTTTAAACTGACAAGCT 
TTTGGCTTAAACCTTCCCCCAATATTTGACCATGTGTTGACTCATTGGACAAGCCGTCAGCTGCATTAATAATGCATTAGACCAGCCCCAGGACA 
TAAAAGTCCCCCCATCGTGCTAAAGTGGGCGCAACACGGCCTGGGGCTTCATGAAAATCTGGCAATTAAAATGTTTTACATTATGTAAACCGCAC 
GAAAACAATCACTAAAGAGTTCCTTCACCTCGAAATGGCGTCGTGAATATGAAGCGATATGTATGATGTGGGAAACAAAGGAGGAGTTGAGGAGT 
CTCTTCCCGGTTTTCGAAGCATTTCTTATAAATTGCCACCTAATTGGCGTGTTCAACATTGTGACTGTGCGTGGTACGCCTCATTTCCTGTTGCA 
ACAATTTGCATTAATAGAGGGTCTCATACCACAGATTCAGTTCGAGATCTCCCCATCCGTATGCGAATGCGAATCTATTCCAGTGAATGCATTTC 
AGTACCATTCAATTGAAGGCGCGATGAAATGGAAAGTTAAGATCCCTTTGGAGGATTGGATGTGTGTCAGAAGTTGAGAAACATCCTTTGTTTTA 
TGTGCCTTCCATGTGCATTCGCACTTAACGCCAATGCCAGGACCTAAGAGCACCCAAACCCACCCAGTTTTTCCCCACTGCATTTCCGCTTGCAT 
GTTTTCCCTTGTTTTTCCGCTTTTCGGGTCGCGCCACTTGGCCCGCTTTTCAGCCACGAAAAACTATACTTGAAGAACTTGCCACACGCCGAATA 
C AAT AAACT AAAACT GGG GC AAGG CG AAAT AAT T CGGGC C T AG AAAGGC AAT AT C AAAG GAGT GCCGG AGC TTATGT ATGG T AT AAGGGT AAATT 
ACCTTTCGGACTTACTGGGTATATAGCATCAAACCACCCTCACACAAAACATTTAGTATGATTGTTTTTAAAGCCATATAATTACGGCGAAGGAA 
AGCCATTCCCATACCATTCTACTAATTGGTTGATTTTTTGTTGGATTTCGCAAACATCTGGCAGAAAAATTGATGCCGCCAAACAACTCCACACA 
CATTTGGCAAGCCCACAAGATAATTTGCATAAATTGAATTGAAGTCTTTACTTGTTGGCCAGGTAGCGACAGCTGTCGTCGGTTTATGTAGGACT 
TATATACGACTAATCCTGCATGAATAAATTAAATGGGAAGCCTCAACTAAGTCTGTCTGGCGGCGGATGGGTTTTGGGTTTGGCCTCTGCCGAAG 
CCTTTGTTGCCAAGCATACAAATGAGTTGAATCCAAACTCCTGGGCGCTGTCACTCTATGCAAATCTAGAAAACCAGTAACCAGTTGTTGTAGGA 
TGGAGAAGGCACGGTGGACGGACTTTCGGGGCGCTCAACTCACATATTAAATTAAATGTTAATGGCAACAAAGACAATTCAAACGGAGTCAAGGC 
CCGGGCCAAAGAACTGTGGCTAACAAAGAGGCAAACAACACCAATTTCTCGACTGCCCCGCAGCAATTGAGCAACCAACGAGCCACTTTGTTCGC 
CCCATATACCCCGCCCACATCCCAGCCCCTGTCCACCCGTGAAATCTGTTCTCCACCTCCAGCTACCTCTAGATAATATTGCACTGTTCGGTGTC 
AAATGGCTGCCCACTGACGGACTTCTTGGTTTGTCAGGTGCCATGGTTATTCCACCTACAGTAGAAGCTCTTGAAAGTGGAATTTCTTTCTATCT 
AATAATAAAAGATAATTGCCGAGACAGTTTTGGTTAAAAGTGGTCAAAGTGCTCAAAGACTCTCTATCGTCGGTTATGTACGTTAATCATATTAG 
AAGTAAGTGAAAATGTCCCCAGAAAGGACGTTCATTACAAGGAGCCCCTACTGTACAAGACCTTGCGGCGGGTTAAGTGACCGATTGGCTGTCAG 
TCCCCAGCCAACCAACCTCACCTCTCGGACTTCTAGTTTTAATTTGAATTTCATTCCGTCTCGTGTGGTGGCCAAACGCTTTGGTGGTTCGCTTA 
GTCATATCACTCGGCTGACTAATCATTGAAAGCGGAGCGTGCTTGGAGGGAAACCTTAAGCCCCAGATATACCCCCCTTCTAACTCCAAACCATT 
TACCGAAATCTCCACCCCGCCAAAACCCCACACAGCTGCAAATTCAATGAACCTTTGCTGCTGTTGACTTTGCTGATTAATTGGAAATTGGCAAA 
ATGTTATTTATTTTCGCGCGGCTGGTCTAGCGCCAAATGGTTGTTTTTCATCAAAGGCATAAAGTTCTATAAATGTTCCCAGACAAATTGGTTTT 
AATGAGCCGGAAACTTTTTCGGTTGAGCTCGGATTTTTACGGGTTTGGCGGATAGAAGGAGAAGCCACATATTGTACCCGATTGGCTTTAAACAA 
ACACGGACATACTATACTGAGACAACTTCTTCAGTGGCTCGCTAATGCATTATACAAGAATGAACTCCTTTGCAGGCACGGCCGAGCAAACAAAC 
TTCAACTTGTCCGGACCAACTGCGTCATAAAAGAGCCAACGGCAGCGCCAATCGCCATCTCCGTTTCCAGCGAGACAACAAAGGATGTTGGGCGC 
CGGGAAGCAGGAGGAGCAGGACCTGGGAAACCAGCCAGTTGTGGATGGAGCGCAGACAGAACAACTTCATGCAACCCCATCCCCTGGGAACCAGA 
CGGCAGCCAGGAGTCTGTGTGCCTGATCCTTGCCATCTCGAAACGAAAAAGAAGCCGGCGAAAAATATATAAAATGGAGAAACAAAATATTTTCC 
CGATGTGCATGCATAGTTCAAAGTTAGTTTCAGGCATCTGTGCACCACCAAATACACCACATTCCCCACTACTTACAAGCAGAATTGTGAAAATT 
TTTTGGTTGCCAAGCGAAAGTGATTGAATTCCTAGGGATTTTGTCGGGCGCCAGTTGAAAAGGAATTTTTGATGCGTTTAAGTGCCGAAGTTTCC 
CTAAAAAAGTTTGGCCTTAGAGTCAGGGGATGTATTAAAAAGAAAAATGTCGAAAGTTTTTGAAAGAAGAGATTAACAATGATGTGTGATACAAT 
GATTGTGTGTTTTGGTCTGGAGAATTTAAATCTCAATAAAAAACTACAAAATGTATATTAATGAGGCTAACGATGGTTATATTATGCAAACAAGT 
GTTGGGGGGAATATAATTATTTAAAATAATTGCATTTTTTTGCGATTCCCACTTTGCAATTCTAATCCCTAATAAAACTAGAGATAACTTATCTT 
ATGAAGCAATTTGTTTACTACTGCGACTGGTCTACAGAATATACTGATGCAATCGCTTTAGAAGTGTGGTAGGTAATCCCGTTCGTTCTCAGCCG 
GAAATCACCCCCAATAAGAAGTTTAGATAATCTTAGATACGGTATCTGCTTGTTGTTTGCCACATGCATCTACACCCTGGATTCTAAACTTAGTT 
CCCCCAAAATATCTGACCGAGTTGTCCCCTTTAATTTACAGCTTATATCCGGCATGTTTTTCTCCCGTTTGAATTTTGCCACAGTTTGCGCCAGA 
AGTTGTGAAAAGGGCAGAAAGAGAACGAGTTGAAAGTTCCCATTAGAATGCCCAATCCAAGCCATCCTGTTCCTTTATTGCGTGGCCCCTTTTTT 
TTACATTTTCCGTAATTCCCAGCCAGGCACCATCACGTGTCGGGTTTTCTCCAAGGTATTGGGCTTCGTGGAGAATCGATTTTGTGGCCTGACGA 
GCCTGTTATCCTTGGAGCGAGCGCTTTTGGCTTGTTTCAGGACATAGAAATTGCCTGCTGCATGTCCGAAAGGCTCAACGAGAAATTATGCTAGC 
TACACTGAAATCTAAACTTCAGTTCAAGGAACTCAGGAAAGAAAATACTATTCAGAAAATGAAGTTTTTATGTTTTAAAACTTTTATAACAGTTT 
ATAATATTATTGATACGATGTCAATATTTTATATTTGTTTTAGTTCGGTAAAATGAAGATTATCATGAATTTTTCTCAGTGTGCGAATGGAGGTG 
AGTGCGAAAACGAGAAGGAGGAAGCATCCATCACATGCATCTGCAGTTGGCAGGCAGGTAAAAATGTTGTACAAAAAATGAATAACTGACATTTT 
AAGTGGGAAAGTTTTTGCATCAAAATCAACTGCCTGCCAATGCTGCTACCGAAATTCAATATATTAACAAGTGCTCCGTAATTGTAAATTATCGC 
CACTTTTTACAGTCATTACTATTAGTAAGTTTATAATTATGTGGGTAACAAAGCGGTAAAATATGCCAAACGAAAAACAACACCAGCTATTAGCA 
CAGTTTCGAAATACGATAAAGCTTGAAAACCGGTTATTGCACCAACGAAAGCTGACTTTTGTTGTTCTTGTTTTGTTTGTTATTGTTATTTTTGC 
TAGAGCAGCTGAGATTTCGCAGCTGCCCGTGAAGTTGAAAACAAAATAAGAGAAAAAAACAAACGGCAGGTGAATCAACAGCGGTTCAAGCGCCC 
AACCCACCCACAAGTTTGTTGGAGGGGGGACGACGACGTAAAAAACAGAATAATAAAGACGTAAACCCAGTTAAATGGAAAAATCAAGGCGAACG 
AACACACACGGAAGCAAAAGCACAAAACAAAAGGCGACAAAAACACGTCGAAGGCGGCAAAAAAATTTGGAGAACCCAAAAATCAATAGAACTTC 
GAAATATCCATAACCATGTAACACAGTGCACATTCACTTACAATAAGTGTGCCTTCCTCTTCTGGGTTGTTTTCTTATCGCTACTTTACATATGT 
TGGCGCCACTTGCAATGGGGAAATATATGTAAGTGCGTGCGTTCCGTTGGTGGTGTGTATAAATATTTAACATTTGCCGGAGGGTGGTTCCATCG 
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CCCAGAACTTCGGCTTCTGCATGGAATACTAACATAAATCGCGCTGCACAGTCCACTTTTGTGTGTTGCACTTCGAGTCCCAATGGTTACCAATT 
AATGTCGCTACGAAATGGAACTGCTAACGACTGTTTCCGCCGCGGACCGACGTTCCTGCTTCGATTTGAAATTCGAAGTGAATTGGCCAAGCCAA 
GCGATATTTGTGGTAATCGAATCGTCGATATGAAACACACTATTCTATGTGCAACCAACCGAGTTGGCAGCTCCCAGCGCGATTTTTCAGTATAT 
TTTGCAGTATTTTAATTGTTGCAGACGGCTTTTTAAAGCGCCACCAGTTCGTTCACATTGTCGCTATTTTTCAGCATTTCGCGGGATCTGTATTT 
TTTATAGGCGAATATTGAAATTTGCATGGTCTAACATTGATTATAGTGTTGTATTCACCGAATATTTATTGGTTTTACTGTCCATTGCTTTGAAA 
GCAATAGAAATAATTGAGGATCAAGTAAGTTTAAGATAAATTATTACAGTTAAATGCCGATGCAGGCACTCACGCACACTAACGCACCCAAAACA 
GCGACACGCGAATTTGCGAATTTCGTTTTGGCCCGTTGTTTGTGTGTTGTTGCTGTTGTTACAACGTTGTTTGTGCTAAACATTAATTAATTGTC 
GCAATGTCTGTTTCGCTATAAATATGCGAAACGTGCCGTTGTTATTTGCGCGAAACTAAATGGCCAATTGCGCTTTACCGGTGTTAGTGTAGGTG 
TATGTGTTGTCTGTGTGTGTTTGTGGTTGCTGGCGCGGGCGAATGGCGGGTAATCCGCATACACTCGCACTTATTTTTTGCAAAGGGTTTATGCG 
ACACTCGCTTTGATCACGATATTAAAAGCTCAAACACCACTCTACCATTATACAAATATTTATTTATGTAAAATAAATGCGGTTAAATAATTTAC 
C T T AG AAC T T GC C CT AAAG GAT AT T AAGTGC AAT T AAAAGT AG AAC AAAC AT T CAT T GC T AAGT T AC T AAATG AAT AGC ATTT TC T T TT GC AAAC 
TTGTTCATTATAAACTACTCCTTCGAATATATATTTACTTAGTATATTTTCTGTTTCCACGACTTTGTCTCACACATAAACACGTAAAAGCGCGA 
AACAGACACAACGCACAC 
(SEQ ID NO: 1174) 

Exon: 11938.. 11632 
Exon: 3379.. 3066 
Exon: 2996. . 1001 

Start ATG: 3199 (Reverse strand: CAT) 
Transcript No. : CT36839 

CGAAGCAGGAACGTCGGTCCGCGGCGGAAACAGTCGTTAGCAGTTCCATTTCGTAGCGACATTAATTGGTAACCATTGGGACTCGAAGTGCAACA 
CACAAAAGTGGACTGTGCAGCGCGATTTATGTTAGTATTCCATGCAGAAGCCGAAGTTCTGGGCGATGGAACCACCCTCCGGCAAATGTTAAATA 
TTTATACACACCACCAACGGAACGCACGCACTTACATATATTTCCCCATTGCAAGTGGCGCCAACATATGTAAAGTAGCGATAAGAAAACAACCC 
AGAAGAGGAAGGCACACTTATTGGTCGGCGGCTGGATTCGTTTGGAGTGACTCGACTCCCACAACAATTGCCACTGATGAGGTGAACGAACTGGT 
CAGCATCCCAGCATCCATTTTTTAGCCCCTCCATCTCTGGCAATTAAGAGAACCCTAACTGGAGGGCGAGAACTGCTGGCGAGGAGGAGGGAGCA 
ACTCGACTAAAGATGCCGTCGCTGCGCCAGAAAGTCACCACTTACTTCCGCCAGTTGAGCTTCATCGCGGAGCCTCGCGAGGGTCGCCGGCGGGC 
CAGCGAGCACGAGGATGATGATGGTAGCTTCATAACTAAATATCTGGAGGGCCGAATGCAGCGTCAGTTTGTTGAGGGTCCGGAGATTTACAATG 
GCCAGAATCCCACGGACATGCCCGTTCTCAAGTTGGATACATACGAAGCTGGAAACTGCTCTATAACAGCCGCCTGCACAGGTCCTGACGAGGGG 
CTCACCGGCATCAAGCGCTCCAAACTGCACTTGAGCACCAGCTTCGCGGATGACATTGACTTCATAGATACGCAGCAGGAGAACGATGATTGCTA 
TGGCGTTAGGAGGAGTACACCACCAGTTCAGGTGGCCACACAGAACTCAACCCGACTTACCAGCAAATTCGGACGGCCGCCGGCAGGACCTCGCA 
GACAGAGCAGTGCGGAGCAGCCCTCCGGATCTGCTTCTGCGTCGGGAATGCGTTCCCGCAAGAATTCCAAAAGCAGCATAGCCAATCTGGCCGCT 
GCGTCTGC T GGC A C GGA T GC T GGAAAAGC C AAC AG C GAT C AGAAC AAT AGG AAT GT GT T AAAT GC AAAG AC T G AGGT C T C G AC GG AT GGGG AT CC 
CAACTCAGAGCGGGAACGTTTGCTCCGCGAAGCTGTAAGCAACCAGGGTGGCTCTGATCCCGCAGCTTTGATCGCTGGCGTGGAGAACTGGCGCA 
TGGAGTGCGACTTCGCATATGGTATCTCGGTGTCCCTCTACGAGACCAACATGCTGACTAAAGAACCCATGGGCAATCCCATCGCCGACTGCTAC 
GGAATGGTAGTGCGCGGAGACTCAGCTGCCATGGCCATGGCAGATGGTGTCAACTGGGGTGATGGCGCTCGTCTGGCCGCCCGTTCAGCTGTTCA 
CGGTTGTTTGGACTATCTGGACCGGGCGGTCTTCGGACAGGCGCTGGAGTGCCGAGCCACCACAACCCAGGAGGTTTTTGTGAGTTTGCTGCGTA 
GTCTTTGGGAGGGACACGGTTGTATCCTGGAAGTTGGAGGAGCCCTATCTACTCTCACCATAGCCGTTGTGCTGCCGCTGGACGGAGCGCCAGGG 
AAATATGTCGTGTGCTCGTGTAATGTTGGCGATTCCCTGGGCTATGTGTACTCCAAGAAGCACGGAGTACGGGAACTAACACAGGCATCCCACGA 
CATCAGTTCCATGAGGGACATGCGCGATGCGTTGGGTGCCCTTGGACCGGCGGATGGCAACAAGCCAGAGCTGAGCAATCTAACCTTCTCGATGA 
CCTGCATTGAAAGTGGAGACATAGTCTTTTTAACGTCGGATGGAATCAGCGATAACTTCGATCCAGTTGTGGGCAAGTTTGCCGAAGCCTGGACT 
CCGGATGTTAAGCTTCAGACGTCGGGACATGTCAAGCCCGCCAACTTGGCACCCAAACGGCAGAACAAGAGTGCATCCGCCATCTATGCCCGTCT 
GCATCCTTCGACTCCGCCCACGCGACCAGCTCGTCAGTCCAAGGCGGAAAGTCCGCCTAATAATGCGCCTAGTCGACCAAAGTATATGCGCTCCC 
AAACGCTTATTGAACCACGCCAGGGATTGGTATCAGCTCCACCGCCATCGGTGGTGCCCCAACGCATTCCGAAGTCCATTTCAGGACTGCCCCTT 
GTTACCGGTCCCCAGCGGCATGCTCTTACCCTCTACCGCTTAGAGGATCTGCTCAGCTATGGCATCAACGGCACGTTTTCGCCCTGCGTATCGGC 
ACGCAGGCTCTGCCACTTGCTTATCGATTTTGTACGGATGATAACGTCAGCCAGGAGGAAGACCCTGGAGCAAAGGGAACTCTTCTATAAACTCT 
CTACGGGACCAGATGGCGCCAAGCGTGAGGTGCAACTGAACCGCATGCAGCATCGCGCGGCCAGGAAGCGTGTGGTGGACAGCAGCGCCTTTGTA 
GCATTGCCCGGGAAATTGGATCATGCCACTGTGATGGCCTACACGGTGGGCGGCGGAGAGGAGCACAATAATGGAAATGAAAACGGCGATGGGGG 
AGCTATTTCGCCTGTACTGCAATCCAAGGAGTTCAAGGAGACGAATTTCTGA 
(SEQ ID NO: 1175) 

Start ATG: 488 (Reverse strand: CAT) 

MPSLRQKVTTYFRQLSFIAEPREGRRRASEHEDDDGSFITKYLEGRMQRQFVEGPEIYNGQNPTDMPVLKLDTYEAGNCSITAACTGPDEGLTGI 
KRSKLHLSTSFADDIDFIDTQQENDDCYGVRRSTPPVQVATQNSTRLTSKFGRPPAGPRRQSSAEQPSGSASASGMRSRKNSKSSIANLAAASAG 
TDAGKANSDQNNRNVLNAKTEVSTDGDPNSERERLLREAVSNQGGSDPAALIAGVENWRMECDFAYGISVSLYETNMLTKEPMGNPIADCYGMVV 
RGDSAAMAMADGVNWGDGARLAARSAVHGCLDYLDRAVFGQALECRATTTQEVFVSLLRSLWEGHGCILEVGGALSTLTIAVVLPLDGAPGKYVV 
CSCNVGDSLGYVYSKKHGVRELTQASHDISSMRDMRDALGALGPADGNKPELSNLTFSMTCIESGDIVFLTSDGISDNFDPVVGKFAEAWTPDVK 
LQTSGHVKPANLAPKRQNKSASAIYARLHPSTPPTRPARQSKAESPPNNAPSRPKYMRSQTLIEPRQGLVSAPPPSVVPQRIPKSISGLPLVTGP 
QRHALTLYRLEDLLSYGINGTFSPCVSARRLCHLLIDFVRMITSARRKTLEQRELFYKLSTGPDGAKREVQLNRMQHRAARKRVVDSSAFVALPG 
KLDHATVMAYWGGGEEHNNGNENGDGGAISPVLQSKEFKETNF* 
{ SEQ ID NO: 1176) 



Celera Sequence No. : 142000013383859 

GACCTCTACAAGAACGGCGGCATATCGCATGGGGCACACCAAAAGTCGGTTGTCATTTCGGGGATCGAACTGTACTTTCAGCACAGGCGATGGAA 
ACCTATACTTGTGTTCCAGTTCGCCGGTTAGCACATCCCAAATGCACACATTGTTGTCCGTGGAAGCTGAAAGTAACTGGAGTGGATTGGAAAAT 
GTAGATGATCAGGTGTACGGCGAGCGAGGGCCTACCTTGTGACCATTCCTGGTCCAACTGAGACTGCAAACGGGATGCACATGGGCCGAAATAAT 
CTTGGCAATGCCCCTCGTCAGAAAATCCCAGATTACAATGCGGCCATCATTGCATCCCACGGCCAAAAGGGTTCCGTATTTGTTAAAGGCACACG 
TCACGGCCAACGAAATGCAATCCAGCGAGCCGTCGAACTCCTCTGGGTAGTTTTGTCCGAAAGATTCTTAATAAAGCTGGAGTTATTTACTACAT 
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AAGCATAAAAAATGGGCGGCAATCTCACCTAGTAGCTCCAAATTCATTTTTGTTTACTATCTTCTTCTTTTAAAATGGCAACTGACACTCACACC 
GAACGGCAATAACATATTATGTTATAAATGATAAGCATAACAGAAGACGAGTTTTTCGATAGTTAAGGTTATATTCTGTACAACTACTGAACATC 
AATTATTATATTATAATACGAATTTAAAAATCTACGTATCTACGTTGGTGTTTTTTGTGGTTTTATTAATAGAATAAGTTTAGTTAATTTTAGAG 
GTGAAATTAAATATTGAAATAGTTAAACATATAAAATAGATAAATTTGGTGAAGCGTAAGTAAATTTTAAACTGCATTAGTTATATCTTAATTCA 
AGTTCCCCACTAATTTATGAGCGTTAGATGCGATAGTGTTATGTCTGGCGCAAGCGATTGTCTCGACGTTTCTTCGATATTTTAAGATATCGATA 
ATGTCATCGGAGATTTGCCACCCCTATTTTTTGAGCTCTGTGCACGTTGTATGAGAAAAATTTATGCAGATGTATTCGCCAATTGTAACCAAGTC 
CGGATTGTGAAATTGCACGCAGACAGCGAGCATCATAATAAGCGCTCTAATTGGAGAAGGCAGAAGTTTCGAATTAAGCCATTTCGGAGACGTTT 
CGTCCGTGGCACGCGATTGTGTCAACGTCACCTTTGGAGCACTCGCCCTCTCTCGCTCGCTCCCGCTTTCGTTGCCAATTTTTTGCTTGCCCTCT 
CTCGGTCACTCCCTTCGAAAGCTTAACGGTTAGAGTGTAATAATGAAATAACCCAGTTTCGAATTTCGTTCACAACAAAAGTGCGGGCCTTTCAT 
GCCAAAATAACATTTGGTTTCGAATTGTTTTTCAAATTCGAATCGAGGTTTTCCAGCTTTCCAGTTTGACAGCGAGAGAACGAGAGAGAGCGAGG 
GCGAATTACGGTGTTCGCGCTCTGCTTGTGCTTTCCACTCCACTCCCCTTCTTAACTTCCCCCCTCCCAGCCTATTTACTCTGTGTGCATGTGTA 
AATGAATACTTTAAAACGTTTTTAATCGTTGCAGTGTATTCATCGCCAGCCACGTTAAAAGGAAGAACGTGTATGTGAATACGACCATCAGAGGA 
TCTTAGCGAAGGATTCAGGAGCCCAAATTCTTAATCCCCATCCCACACACACACATATACTCGCACGTAGGCACGCTCTCTGTGAGAGGAAGGGT 
AAATAAAAGAAGCAACGAGCAAAATGTCTGGCGAGGAAACGTCTGCCGATCGCAATGTCGAGATCTGGAAAATCAAGAAGCTCATCAAGAGCCTG 
GAAATGGCCCGCGGGTGAGTCTTATCAATGATATACATGTCCTTTAACGTCCCCCCCAACGACTGTTTTGCGTACGTGTGTGGGGTATACACAAT 
GCATGGCACGAAAACAACAACACTGGGGTGGAAGTGGGCGTACTGGACGTGGGTGTGGGTGGGGACTTGAGGGAGCGGCGAGTACAACCACCTTT 
CCTTGTTGTCGCCGTGTGAATCTCCTTTGCATTGTTTCCAATCTTGGCAAGATCCCCCGCCTCACTCTATCTCTCTTTTTTGCATACGAGCAACA 
CGTTTTTGTCGAGTTTTTATCAGTTCAAACTAGACTGATTGAGAGACGGAGAGAGAGAGAGAGAGAGAGGAGTGAGTGAGTGAGTGAGTAATCGC 
CTCGTGGCTGTTGTGCGTATGCGTGCATGTGTGTGTGTGTTTTTCCCCCAAAATGGGCCAAGCTTTGTGGACCCCTCGCTCTCACTCATCCTGCC 
TCGCTCACTCCCTTGACCCGGTCCTTACCGCTTCACACCGCTTTAGAGTGGGTAACAAGGTCAGCAAATCGAGTGACCCCCACAGGAGCATATCT 
GCTATGTATATACATATATATGTATATTGTTGTCAATATGCTCCACAATTGGAGCTAACATTACACCTCTTCCAATTGGAGTTGGCCACTGTTGC 
CAATTGGGATTCGCTTTGACAGATATCTTCCTATTGGTTTTCAGCCATATGCGAAATATATACACACATATGTATTTGTTTTTCTTTTTTTAAGT 
TCTGTGCTATTTTTTGTTGTATTTTGCATTCCAACATTTTGTACAAATTACGATTGGCACTCCTCTGTATTATAACGAACGAGAAAATGATTTGC 
ATCAGAGAAACAAACGTTTGGAAGTTACAGAAACGTATTTGTTTCAATAAATTAATCAAAGTTTAAATTTAGTCCAATAACCGAATGTCAAGTTT 
AAAAGACATATGCCACGAAATTTCACGGCTAATTAAATAACTTTAAAGTAGGTTGCTATTATGTGCATCTGACGCGCAAATATACGATTGAATAG 
CACTGTTGCTATGGGAGAAAGAGCCATTTCCGCGACTTATGTTCTCCCTTGTTTGGGTTTCTTGCAGCAATGGAACCAGCATGATTTCTTTGATT 
ATTCCGCCAAAGGATCAAATCTCGCGCGTCAGCAAGATGTTGGCCGATGAGTTTGGAACGGCGTCGAACATCAAGTCGCGTGTAAATCGGTTGTC 
CGTCCTCGGTGCCATTACGTCGGTACAGCACAGACTCAAATTATACACCAAAGGTAAGCTGAGCTAAGAAATTGTTCATAAACAATCGAACAAAC 
GAACTCTAAACCGTATCACCAACCGCTTCAATTCAGTGCCTCCCAACGGTTTGGTCATCTACTGCGGCACAATAGTCACAGAGGAGGGCAAGGAG 
AAGAAGGTGAACATAGACTTTGAGCCATTCAAGCCCATAAACACGTCGCTCTACCTCTGCGACAACAAGTTCCACACGGAGGCCCTCACTGCCCT 
GCTCGCCGACGACAACAAATTTGGATTCATCGTGATGGATGGTAACGGAGCGCTATTCGGTACCCTTCAGGGCAACACGCGCGAGGTGCTCCACA 
AGTTCACCGTCGATCTGCCGAAGAAGCACGGTCGTGGTGGTCAGTCCGCCCTTCGTTTCGCCCGTCTGCGTATGGAGAAGCGCCACAACTACGTG 
CGGAAGGTCGCCGAGGTGGCCACCCAGCTCTTCATCACGAACGACAAGCCCAACATTGCCGGACTCATCCTGGCTGGTAGTGCGGATTTCAAGAC 
TGAGCTTAGTCAGTCTGATATGTTCGATCCTGTAAGTGGGAGTTAACCTATTTCCATCGATGCCATATTAAGCCTTACACTTGAATCTTTTTTCA 
GCGTTTGCAATCAAAAGTCATCAAGCTGGTGGACGTGTCGTATGGTGGGGAAAACGGTTTTAACCAGGCCATTGAACTGGCGGCCGAATCATTGC 
AGAAC GT T AAAT T C AT AC AGG AG AAG AAAC TCATTGGTCGCTACTTT GAT G AAAT T T CT C AGGT G AGT T GC C GG AT AC G AGTT CT G AGG CAT T T C 
GAAATTAAAAACCACATCCGTATGCTTTCTATATTTAAAAGGATACTGGCAAATACTGTTTTGGAGTGGAGGACACTTTGCGGGCACTGGAACTT 
GGCTCTGTAGAGACTCTCATTTGTTGGGAGAACCTGGATATTCAACGTTATGTTCTCAAGAATCATGCCAACTCGACGTCAACGACAGTATTACA 
TTTGACGCCCGAGCAGGAAAAGGACAAGTCGCACTTCACTGACAAGGAGGTGAGAGCTAAACTGAGTAATGATACTAATCCGCACGCTAACTGGG 
AATTCAATCGTATAGAGCGGGGTAGAAATGGAGCTGATTGAGTCTCAGCCGCTGCTGGAATGGCTGGCAAACAACTACAAAATGTTCGGCGCCAC 
ACTGGAGATTATCACGGATAAGTCCCAGGAAGGAAGTCAGTTCGTGCGAGGTTTCGGTGGAATCGGCGGTAAGTTTGAGCGGCCAAACAGCACGG 
CTGGCTAGTTGTAGATTGGTTATGTACTAAACAAATGCTGCAACTATATCCATTCCTCTTCCCTCATGCTGCTTTCACACGGTCGTCAACGCTAT 
AATGTAAATGTTAATTTGCAAACTGGTACCTCAATTCGATATGTAGGCATTTTGCGCTATAAAGTAGATTTCCAATCCCTACAGGCCGATGAGCC 
GCTGGAGGATGTCGATCTCGATGACTATTAAGCCTGTTAAGGGTGGGCTGCGAATGGGTCTCGATTATGCGTTACATACATAATAAACCAGTACT 
TTTTGGGAAACCTCGTAAAGCAAAACGTTCAAATTGATTAGGAACCAATTTACTTGCCACATAGTTTTACCTATGTTCAGCGTAGCCGTTCAATA 
TTTATCACCAATAGGGTCCCTCTCATCAGTCGGATGGTGCATTCCCTTGTCGATCAGATCAGCGTGCATCTAGATATGTACCTATACGTTTTACC 
AAATTTTTTAATTAGTCGTGCACCAGGCGGAATCGCGTACTTTTAAGCAGCATATTAAATTCGTATTTCGTACTCGTAATTCGTAATTTGTATTT 
CAATGGTCATGTAAAAGCGATGCGATAGATATTAGCGCATTCTCCGTTCATCTGGTTCGATTTCGACTGCGGTATTAGGAGCAAAAGATCTTCGG 
TTAACTGCCACATGCCACTTTCGAACGTATCTGCATTTCCCCTATTCAAATTATAGCCGAGTGTTATAAGAAAGTATACAAATCTAATCCATAAA 
ATTGTAATAAAAATTAAACCAGAATAAAAACACCACACAATGAGGATGGGGTATCCGTTTATATTCAGTCGCGAAGTACAACAAATATTTTATTT 
AGTGTATGAAAAGTTAAATGTGGATACAAAGGAGGTGAGAATCTGAGTTTCGGTTGAACTATCCTGTGTGTTGATCCTTTTGTTTTTTCCTCTTC 
ATTTATTTTTCAATTTTCAATCAGCTTCTGTTCTTCTATTTTTTTTTTTTTTTTGTTTCTTATTTTAATTTAATTCAATTGCTGGGTGTGCCTCT 
GCAGGTATCTTACGCTACAAGGTGGATTTCCAGAGTATGCAGCTCGATGAATTGGACAATGATGGCTTCGATCTAGATGATTACTAGGAAATGTA 
T CT GT AG AAAT G AAT T T GC C AC T AC GT T C G AC C GC AAAC T G AAC TGC AACT AC AACT AT AT G AAT C C AGG AAAT C AAAT AG AAT ATT TAAAT AAT 
ATGAGAAAACACAAACCTTCTTCCATTTTTACGTTGTTGGCGAAAAAAGTAAAATTTTAAATAATGCATATAGATACAAACGATAAAATGTTAAA 
TCGAGGAAAACATATTAAAATGCAACCGTTTTACGTGAACAAATTTGCAATTTGAATATATTTTCGAGGGACTAGAGCTCTTGTTATATAATTAA 
TTGTTCCTCTATAAATCTAAATCTAAATCTACATTGACAATACGCCACGTTTCCTAAAGTTTTTGGGTACAAGGAGAATTGTATCTGTGAGTTTT 
AATCGTTCTATTATAGCTTGAAAACGAAATTTGCACCGAAAGCTGATAAAATTCCGACTCTGCTATCTGCTGGATAAGTAAAATGCTGGTAGCCA 
CTTGGAGTACTACCAGTACAGCAATGAAAGTAAAGTGAAATGGGCATTTCATTTTCTACTTTCTATTTGCAGGCACACTGGAAACAGTTTACTGC 
GAGATTCCCACTTCATTAGTTGGCAATGCACGTGGACGACAGCAAAAAGCACTCGTAGTACTTGTTTCGCCCTTGATTTGCATCTAAGAATCGCT 
CAGATACATCTGGTGAGCTGAAACTCGTTTCTGAGCCGCAGTGTCGTGCAGGTATCCGTACTCGTTTGAATATTTGTAAGTAATCATTGCAGCGA 
GACTTCCAAAAAGTAAGCTCAGCTAAGTTCTGCGGTGTTGGGCATCCAATTAGTTGACCAAGTTTTCTGGTGGCATGCACATGAAACTTGTTTAA 
TCGATAGGGT AAAT ATGCAAATTTCGATTGAATTTGAC GAT AAT GCGTCGAGATGAAAAGAAAACGAATTGTGCAGGATTTTGGTTACGAGATAA 
GCCGAATCAGCTAATAAGGCACGTGCTTAATTGATATATATTGTATGTTTAAGTAACCTAAATATGCACGACTTTAATATCACTTTTATTTCGCT 
TTCGTAATAAAACGAGGTTCACTCTAAAGAGCACAAGCGTGGAGGTGTATGTGTGTTCTTACAGCCCCGATCATTTGTATTTCCATTTGAGTGCG 
AGC GAAG AC G AGG GT T T AC AAC T T GAT GAG AAT T T GAT C AAGAT AAC ATT AAAAC AC AAT CTGC TGC AGTCGG AC GGGGCACTTCT GAT T 
{SEQ ID NO: 1177) 

Exon: 1001.. 1034 
Exon: 1555.. 1819 
Exon: 2918.. 3093 
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Exon: 3172. .3641 

Exon: 3707.. 3862 

Exon: 3937.. 4134 

Exon: 4196.. 4343 

Exon: 4512. .4589 

Exon: 5420.. 5740 

Start ATG: 1734 

Transcript No. : CT36935 

ATGAGAAAAATTTATGCAGATGTATTCGCCAATTTGTATTCATCGCCAGCCACGTTAAAAGGAAGAACGTGTATGTGAATACGACCATCAGAGGA 
TCTTAGCGAAGGATTCAGGAGCCCAAATTCTTAATCCCCATCCCACACACACACATATACTCGCACGTAGGCACGCTCTCTGTGAGAGGAAGGGT 
AAATAAAAGAAGCAACGAGCAAAATGTCTGGCGAGGAAACGTCTGCCGATCGCAATGTCGAGATCTGGAAAATCAAGAAGCTCATCAAGAGCCTG 
GAAATGGCCCGCGGCAATGGAACCAGCATGATTTCTTTGATTATTCCGCCAAAGGATCAAATCTCGCGCGTCAGCAAGATGTTGGCCGATGAGTT 
TGGAACGGCGTCGAACATCAAGTCGCGTGTAAATCGGTTGTCCGTCCTCGGTGCCATTACGTCGGTACAGCACAGACTCAAATTATACACCAAAG 
TGCCTCCCAACGGTTTGGTCATCTACTGCGGCACAATAGTCACAGAGGAGGGCAAGGAGAAGAAGGTGAACATAGACTTTGAGCCATTCAAGCCC 
ATAAACACGTCGCTCTACCTCTGCGACAACAAGTTCCACACGGAGGCCCTCACTGCCCTGCTCGCCGACGACAACAAATTTGGATTCATCGTGAT 
GGATGGTAACGGAGCGCTATTCGGTACCCTTCAGGGCAACACGCGCGAGGTGCTCCACAAGTTCACCGTCGATCTGCCGAAGAAGCACGGTCGTG 
GTGGTCAGTCCGCCCTTCGTTTCGCCCGTCTGCGTATGGAGAAGCGCCACAACTACGTGCGGAAGGTCGCCGAGGTGGCCACCCAGCTCTTCATC 
ACGAACGACAAGCCCAACATTGCCGGACTCATCCTGGCTGGTAGTGCGGATTTCAAGACTGAGCTTAGTCAGTCTGATATGTTCGATCCTCGTTT 
GCAATCAAAAGTCATCAAGCTGGTGGACGTGTCGTATGGTGGGGAAAACGGTTTTAACCAGGCCATTGAACTGGCGGCCGAATCATTGCAGAACG 
TTAAATTCATACAGGAGAAGAAACTCATTGGTCGCTACTTTGATGAAATTTCTCAGGATACTGGCAAATACTGTTTTGGAGTGGAGGACACTTTG 
CGGGCACTGGAACTTGGCTCTGTAGAGACTCTCATTTGTTGGGAGAACCTGGATATTCAACGTTATGTTCTCAAGAATCATGCCAACTCGACGTC 
AACGACAGTATTACATTTGACGCCCGAGCAGGAAAAGGACAAGTCGCACTTCACTGACAAGGAGAGCGGGGTAGAAATGGAGCTGATTGAGTCTC 
AGCCGCTGCTGGAATGGCTGGCAAACAACTACAAAATGTTCGGCGCCACACTGGAGATTATCACGGATAAGTCCCAGGAAGGAAGTCAGTTCGTG 
CGAGGTTTCGGTGGAATCGGCGGCATTTTGCGCTATAAAGTAGATTTCCAATCCCTACAGGCCGATGAGCCGCTGGAGGATGTCGATCTCGATGA 
CTATTGTATCTTACGCTACAAGGTGGATTTCCAGAGTATGCAGCTCGATGAATTGGACAATGATGGCTTCGATCTAGATGATTACTAGGAAATGT 
ATCTGTAGAAATGAATTTGCCACTACGTTCGACCGCAAACTGAACTGCAACTACAACTATATGAATCCAGGAAATCAAATAGAATATTTAAATAA 
TATGAGAAAACACAAACCTTCTTCCATTTTTACGTTGTTGGCGAAAAAAGTAAAATTTTAAATAATGCATATAGATACAAACGATAAAATGTTAA 
AT CG AGG AAAAC AT AT T AAAAT GC AACCGTT T T AC GT G AAC 
(SEQ ID NO: 1178) 

Start ATG: 214 

MSGEETSADRNVEIWKIKKLIKSLEl^RGNGTSMISLIIPPKDQISRVSKMLADEFGTASNIKSRVNRLSVLGAITSVQHRLKLYTKVPPNGLVI 
YCGTIVTEEGKEKKVHIDFEPFKPIHTSLYLCDNKFHTEALTALLADDNKFGFIVMDGNGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRF 
ARLRMEKRHNYVRKVAEVATQLFITNDKPNIAGLILAGSADFKTELSQSDMFDPRLQSKVIKLVDVSYGGEKGFNQAIELAAESLQNVKFIQEKK 
LIGRYFDEISQDTGKYCFGVEDTLRALELGSVETLICWENLDIQRYVLKNHANSTSTTVLHLTPEQEKDKSHFTDKESGVEMELIESQPLLEWLA 
WNYKMFGATLEIITDKSQEGSQFVRGFGGIGGILRYKVDFQSLQADEPLEDVDLDDYCILRYKVDFQSMQLDELDNDGFDLDDY* 
{ SEQ ID NO: 1179) 



Celera Sequence No. : 142000013384822 

TTCCCAATTCCTTCGAGGTGATGGCACCTTCGTTATCCTTATCCAATAAGCTATAGATATTCTTGAGCAGGTCCTGCTCTTCAACAGACAATTCA 
TCCATTATGCTTTATATTATGGGAAGGAGAATATTATTATTATGCTAGTAGATATACTTTTTCGATAGGATTGATACTCAGTTGAGGTCTGATAA 
GCTTTGAAGGAGTTCTACGATTGAAAATTGTTATTCAAAACATTCAATATAAATATTTTTGAAAAAAGCCAACTAAGAGTTAGCCTTCGGTCTTT 
AACAGATATTAAACAATCTGCGTGCTCCATGAAGTTCATAAACTTTTACTGAAGATAAAGGCATTAAAAGAGCATTCTCACGCACAGTTTGGCAA 
ACAAATCACGTGCTTTCCTGGACCCAAATGATATCATCTGGCGGAGAAACTCTTTGGCTTTACGCGTATTATTTTATAGTTGAACCAGAAATTGA 
TGTCATCGGTGGCGGTAAAAAATGGGCGCCCAATCCGCCCAATGAGGAAATTCCAGGAAGCCGGGCAACTCCAACTGTCGGTGATGACAATTTGC 
GTGGCATTTTAATTGAACTTTCTACGTGTTCGAGTGGTGTACTTTGATTTGATCCCCATCCACCGATTTTTCGTCGATTTGAGTCAATGTTAAAG 
GCGTTGCCACAAACCGCAAACTGTCGTGTCTAGGAGGTTGGGGAGGGGCTTGGTGGCTGGGGCCTCGAAACGTGCCTGAAGTATTATTACCGCCG 
TCTGTGATTGATGGCCTCATGCGAGTGAGTCAGTGGATTTTTATAACTCTACGCAGATCCCTTCCTTTCAATCAAGCGGCAAAAAATCTGGACAT 
CGATCTCGCCGATATTAGCTAACTTCATTTCAACTTAACTGCCCGGAGTGCTTTGAATAAAGTGGATTTGAGGCGATGCCTTGGGATTCTTGGCC 
AGATTAAGTCGATTTTTTTTTTTTACTTCAATGAAAAAAATGGCACGCAATTGTTTTTCATTATTAATGCATACTTTATTTGAAATCAACTAACT 
CGCTATGGTTTGAATAAAATCATTTAAAATTGAAGGAAGACCATGGAATTCTATGGGTTGCTTATATTAACTACTGCAAAACTCTTAAAATTTAA 
GCCTAATTACATAAGGTTCTACTGGAATGTTGCTTGTAGTTGGATTTTTGTTTATGCTCTGCTTTACACTAACTTTTACTTTCAACCATTTTTTT 
CCCTACAGCAAAATCGATTAACGATTTTGTTTACTCATTAAGATTAGCTTACCGGATAAGGAAACATAAATAAAAAATTGGAATTCACTCAGCAA 
ACAGCAACACTTAGTGTTGGGCCACACATTAAAAACTAATCGATAGACTCTCGTACTAAAGGCTTAAACATTTAAATCGGTAGATTTAAGCTAAA 
ATCCTAACGCATAAGAGCTCTATAAAATGCAACAACGGAGTTTGTTTCGCTTACAATTATCGCTTAACTGATTTTATTACTTAGTTTAGGTCTAT 
TGCCTTGCCTAGACATTGTACTAAAAAGTATTAAATTGTAGATCTAAATTTATTTTAATGTACTTGTGGGTCTAAAACGATAAATCCTCGTTTAA 
CCATTGGGAACAATAACTGTTTGTCATTATGATATTCAACTGTTCATAAGACCTTCTAAATGAAATACCAGAAAAAACAACCGAGATATATATTT 
ATTAGGAATACTACTACTTTTTGATATTTCAAATTCGTAAGAGCTTTGATAAATAGTTTTCCAAATTGTGTTGTTTTTCTATTCCGATTTTACCT 
CCATTAATGGATTGTTTTCTAGTTCTTTCTCGGTTTACTTCTAACGCCTCATTATGTCCTCAATGCTGAACCCTGTTCTTCTCGGTGGTGCAGGC 
GGTGGTGCTATGGGCGGTGCTGCCACTGGTGGTCCCTTTTCGGTGGTGGCTCCACTTGTTTCCGTACTCGGTGGTTTTCCCAGAAGCTGTCTGGG 
GGGATGTCGCGACATATTCAAAGCATATATGCTGGGACTATCCACAAAGGTCCTTTGACCCAATCCAGGATGGTGGTGCGGATCCATCAAGTCCA 
AGGTCGGTGCCGGAAGAATCCTCTTTCGCATGGGTATCGATCCCAGGAAGCTGGGCATAGCTTCATGCGGTGAATGCAAAGGTTGATTGGGTGGT 
TTACCCGACTCCGGCTGGGTTATTATAGTGGGTGGTGTATGTGGTGGCGGTGGAACACCAGCCTTCGAAAGGTTCAAGCTCTTGTGGTGCAAGTC 
GCGTCGAACGTGCAGAATGGGCAAATAGGGATCCCCGTTGGGTGGGAGTAAGTGTGGATGTCCTGGGTGCTGGTGGGTTGGCGGAGGAGCTCCAG 
GTGGTGCAGGCGGCGGTCCAGCTGCCCCAGGTGGTACAGAATATCGCACAAAGGAAGGTGGAACGGGTAGGCTGGCAATGCCAGAGTTCCCGGGT 
TCCTCCTGATGTTTCTCGCCATGTGGCCGCATTGTGTAGGTTTCACCACCCTCATGATGACAGTGGGTGACTCCCTGGCGTTCCGGCTTGGTGCC 
CACTGACGACGATGATTGCGATGTGGCGGCAGCCTTACTCGCCTGGAAACGCGCCACTAGTGCCACTTCTGCCTGCTCCTCGTCCTCGGGATCAA 
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AGTCGTCGTCCTCTTCCCTGGGCAAATCGTGTACTCTTGGCAGATCCTGCAGTTCCTCTTCCTCGTCGAAGTCATCGAACTCCTCTTCGCTTTCC 
TCATGATCGATGGCTGCCTTGCGCTTGAGTCTCATTCTTTCGGACTCAGATTTCTCACCCGATCCTCCAGACTCGCCCAAATTGTGAACTGGCGA 
CTGGCGAGGCGAGTCCACTTCCAGCAACGAATCCTCCTCGTCGCCACTTAAATTTCTGGCCTCATCCTCTTCGCCCACATCCACATAGTCCCCGG 
AGTTGACCTCACCCACTGCATGGGTCAAGGCATGGCGTCGTAGATCGCAGTTACGGCGGAAAACTTTGCCGCATGAAGAGCACTCGTAGGGCTTG 
TGATCCGTGTGGGTGAGCAGATGGGTCTTCAGGTTGGAGCGCTGATTGAATGATCGACTGCAGACGGGGCACTTGTGGGGTGATTCCTCCATGTG 
CAGGATCTTGTGGACAGCCAAGGTTCTCGATTGGCAGAATCCCTTGCCGCATTCCGTGCATTTGAAGGGTTTCTCCTTGGAGTGAATATACCTGG 
GCGAATACAAGTGGAGAAACAATTAGACTTTATTGTTATAGCGTGGGCGTTTTCCTGGTAGCTGCAATTGGTAGCCCAGGTTGACAGTTATCTCT 
GCTCAAATGCAGCCTTAACTTGAACCGATAAAGCTCACTGATATACAGCTTTACAATACTCTAACTAAGGATATAACTGGCTAGAGCAGACAATA 
GAATAAATAATATAATACTATAGCATATAATACCTATCTTTCTCTTTCAAAATGAAACCTAACCTGGATTTGTTTTAACCTTAAACCCAATTCTT 
TAAAGTTTTTCATTAAAGAAATCCTATGATTTTGTATCCTCTTAACTGATAGGGTATTTAACTTTCGTGTATGCGATCTGGCTGCATTAATTTTT 
TATGTTTATGGCGGCCATCTTGGCAATGGCGTCCACTTGCTGTTGGATAGCAGCAGCCACAGTTAAAAGTTAAAGTCCCCATCTTCGACGGAGCC 
ACAGATTCAGCATCGCATCGGAGGCAGCAGCTGCTGGTGGAAAAAAAAAAATAAATGTATAAAAACCCAAAACAGGTTCTGCCTCCGAAGCACCG 
AGATTCTTCTCCCATGCTCCGTGCGTTTCTGGTAAAAAAAAAATACAAAAAACTTAAGAAGAATTAGCCCCGCGGCAAAGCAAACAAACGACAAA 
TGGACCGATACCGAACGATCAAAAGACAGAACGCTCGGACAGGCCGACACATTGAAGCCGGGATGCTGGACCTGCTGCCTGGCATCGGCATGGGC 
ATCGGCACCGGGATACACAGACCCGATATATAGCCCCCAACCCCTTGGCCATCCCTCAATTTTAATGCTGGTGGAGGAGCCCCAAAGGAAAACCA 
ACAGAGTTAATTAACTTTGTGTTGTACTCTGCGGCCGAGTTGAAGTTTTACTGGCCGTCAGAGGAGCTCTGATTTTGGGTCCGCTTTATGGGGAC 
TTACTTATTTGAGTTGAGGTAGGAAAATTCGTCGTAAAGTTTAAACATTTCCTGGAAAATTCCTTATGGTAAAAGGGGTCTTTTGAAGCGATTGT 
CTGATATATCTAAAATCAATAAAACTCCATCTACGAGTAGTGGTCGCCTTTCCCTTTTGGCCGAGATTACTGTCACACCTGTTCAGAATGTGTGT 
ATAAACTGCCTAGTAAGCAATAAATTACATTGATCTAATTATCCTTTGCCTTTACAGGGTATTCCATTATTCGCCATTCGTTCACACTTTGAGCA 
CAAAGAATAACTCCAGGCCTGATGGCAATTACCCGAGCAGACACTGCCCCCTTTTGTGATTACTTGTACCCCAACCTCGACTCACCTGTGATCCC 
TCAGGTGATCCTGTCGCCGGAACGCCTTGCCGCAGATATCACAGGAATAGGGCCGCTCGTCCGTGTGTGTCCTCTCGTGGATGAGCAGATTGTAC 
GACTTGGTGAATTGTCGATTGCAGAACTTGCAGATGAACTGTTTTTTAGGCCGCGATGCCCGACCAGGAGCACGTAGCAGGCGCCTATCGAGTCC 
GGCATGCATTCCAGGACCCAGGCCAGGTGGAAATAGAGCTGCGGCGGCGGCTCCTGGTGGAAATGGAAAACTTCCGGCTCCCGTGGGCGATGGAC 
CCGGTCCGGTGAGATATGGCGGAGGTGGCACGCCAGCTGGTCCGTACAACACTCCAGGCGGTGGCACACCAGGACCGCCGGGTCCTCCAGGACCT 
CGTCCTAACATTGCCGCCGCCGCAGCCGCCTCCCTTAGGGCCAACTCCTTTCGCTTGTCCGCCATCATGGCAATGATTTCGTAATTGGAATTACT 
ACCCGGGCGCACCTTTCCGGTGGGCAGAACCGGCACAAAGGCAGAGATCTCGCGATGATGCGATGGTGGTGGCGGTGGCGGCGGACCATGCAGTC 
CGGCCAAATGCTGCTGATAGGCCAGCTGTGTTTCATACGCTGAGATCCGGCTGGCCGATGAGTTGCGCGAATCGCCGTTGCCATTCTGGGAGCCA 
CCATCCGATCCTCCGGATCCGCCGTTCCTATCCGCCGCACCAGGAAGACCGCCACCACCCACCGCCGCCGCTGCCGCCGCCATCGAGTTCATGAA 
CTGATCCATCCGGGAAGGTCGCAGCATGGGTATCGCACCGCCTCCTCCGCCGGAGATCTCGCTGGATGAGCTTTCCGTGGGCATTATATTGGAGG 
GGCTCTTCACATTCCGTTGGAACTGCCGGAGATAGGGATAGAACAGAAAAAGCCGAAAATAAAATAAGGGTAGTGGTAAATAACAATTAGTACAC 
GATTGTGCTGGGTTAATTGGGCTCTCGTTTTTTTTTCGAGGCTTATCTTAATTGCGAACCGAGTGCAATAAACAGATATTTCCACGTTGTATTCT 
TTATTGCTCCTAGGGTAGTACTGGGTATGATAAAAACTGACGCAAACTGGTTTAAAATAGATATATTACTATTAACTTAGTTTCTTCTTGGTGGG 
AAGTTTCCTTGTAATTGAAATATTCATGATCTTGGTTTGGCAAGGCACAGGAATATATACAATATATACTATAGAATATTTACTCTTTTTATCTC 
AACCGGTTGTAACTCACTTTTTAGATGTGAAATTCTTCAAATTTGCTTTTTGCACTTCTTATCACAAAATTAAACAGCTTTATGGGCAATTAACT 
TGCAAATAGCCAAGGATTATCACATTAGCTGATCTTTCAGAATTTATCCACGTGAACGTGATTTAGTAGTCACAAAAAAAAAAATAATGAAGAAA 
ATATTTTTCTCGAAAATAACCCATACGATTCTGAGAACGTTTTGGTATCACAGTACGTATATTTTCACAGGGTAACTAAGCAGTTTATCACATTA 
TGGCCCTCGTCTATGGGTTTGTTGCTGTGTTTTTTCCATAGGAATTTTATTATTATTGTTGGTTTATTATTATCAGCGGCGGACTGCCGACGACG 
GCGAGTGAAACTAAACTGAAAACGAGTCGCAGTTTGGCACCGCGATCGGTTGCTTCCAGTGCGCAGGTTAGATAAGATTGTGCGTCTAAAAAAAC 
CTGCCTCCGAACCTCTGAGACTGAGTAAGCAGCACAGTAGTAACGATATAAACACACACACACACACACAGTTATATTATATATGTGCCTGTGAT 
ATATGGCATATCGGCAGATACCCCGGCACACTCAGATACAGATACAAAGATACAGATACAGAGGCAGGCAGCCAAGTTGCCGTCGGTTTTGGTAA 
ATGACTCGAACCGAACTCGACCGACTGCTTACTGGATTTACGCAACAGGAGCCTCGACTTCTGAGTGCCTCGAAAGAAGAAGGGGCAGAGGCAGG 
GGCAGGGGCAGGGGATCGGGTCCCGGAACGTGATCATCACGATAATAACAAAATAATGCACATAATAATGATCACAATGATGGCCGGACCATAAA 
GTTTGTGGCCCCATATTGCAAACGCTTCTGGCTGCCATCTTCCCAACGCCTTGCACAGCAAAAAAAATGGCACCAATTTTGTACCCAATTCTATG 
GAATTCTATTTTAAGATCTATCTTTTACGATAAGATTATATTTGTTATAATATTAAAGAATTTATAGTAATATGCTTTAAAATTTGTGGCTCGCA 
AAGAGTGCGCCCCAGAGAGCAAAGTTTCCTCAGTAAGTCAACCTTTTTTTTCGGTGTACTATCCTCATTTAAATGTACAACCAGTTGAAAAAAAA 
AATGCTTTAAAAATAACCCAGCTGTCGGTGCAATGACAGTTTTGCTTATCATTTTGCTAGAATCGTTCTCGCTGTCTCGGTTGATTTTAATGCAA 
AATGCTGCAACGTCATTAAGACCAACATTAACATTCGGCTAACAACAAGTGAAGTGATCGTCATAGTTGGTTAAAAATGAGCTTAGCCAAGTTTT 
TTGGAGAGGCGCCCATTGTTGAAGTTACCGTTCTTTCCGTACGGATGACTAGTCATTGTGGTGGTCTCTTATCTATGGAAATTATTTTGGCATTG 
TCTGGCTGGAGAGTGCGCTGTCAGATTAGCTCTATAAAGTAGCTGAAATCATGACTTGATCATGAGTGTATCATTGTTATGGGGGAATGACTCTA 
CGGACCACGTATAGTTCAAGTGGATGAGATATAAGCAGGTCCCAATTATTAATTCAAATAGCTGTTAGATGTGGGTAGGTCTTATCGTGCTATCA 
TATCAGCGAGTCTTTAGTAGACAACGGAGCACGCAGTTAAGATATTAGGCAAATAGATGATCTAATTGTAATATTTACACTAGAAATGCTTACGA 
AAGCCATATCGAAGCGTTTTTGTTTCCCTAATACGACTAGCACATGCCAAACTAATCTCCACGTTTATTAGATAGTAATTGGATAATTCTTTCAG 
TTCAATACCTTTTAACTCTCACCCATGCGAGTTCTCGTGCCCTCACTGATAATCTGTCGCAGTCTATCTATATTCGATTCAATAAAGAACTTATC 
TGATCTAATAAGTCCCCCAACTTAGGGAACCTTTGCTGTAGATGAGCGATGGCCCCGCCATGCCTCATTCGATTACGATCCCGATAATCCCATTC 
CAATTTGGGCTCACGTATTTCTGGTAGCCTCGAAAAAAAAAAAAAACTGAAGTGTAACCCGTTTTGGGTATTTATATATATTTTTTTCGATTTCA 
GTACTTTTTCTAGGGATTGTTTCCACACTTTTGCATTTACGACCGCTTTGGACCCACGTTTCACCCCATACCCTGTGCACATACAATGTACAATT 
CAGCGTTCATTATCTTTTTTGAAACATCGGCACTTTTGAAAACGTCAAGCCAAATAAGAAAAACAAATAAAATTCATGTAAATAATGGACATCAT 
CAGCAAAAGCGACGGCACGTAGACTGCCGTAGAAATATTGAGAGAGAGTTGTGCGCACAGATACAGATACAGATGCGGAGATACAAAGATACTCG 
TACTCAAAGATCCACAGATACGTTTGCGCAAACGGTTGCCCAGTAAATGAAGCCAAAGGCTTAAAGTTTCCAATATGTGTGCGTCTGAAGGTGCA 
CTGGAAAAATTATTGTGCACTCTTGTGAATTATTTAATATTAAAAAAAAGAGTATATCAAATAATTCTGAATATATAGGGAAATGAACAAAAAAT 
AATTAGTACGTACTTrTTTATGTAATATTGAAATGTGCATATTTAATTAGAGAGGTAATATTAATATCTTTTCTTTCTCTGTATCGTACATATAA 
TAAAAAGAGCCAGAAGACCAACTACTGCTGGGAGGAAAATAAAAGATATCGAGGAGCAATTTGGCTGCCACTTTTTTCCCTCCTCTTCTTTTTGA 
GCCGTCTTGCTTCTTTTCCTTGGATTTCTGGGTCCGGGATCGGTTCGTTTGGTGGGTTTTTGTGTGTAAACAACTGAGGCCAGCAGAAACCCAGG 
GCCCAGCAGAGTGAAAAGGTTTTAGTTGATAAAAAAAAAAAACAGAAAAATATACAAACACACACAGACACACACTGAGCAAACGGATTTAGTAT 
TTGTATAGAAAACGCAGAGAGCTGAGGAGGCGAGTTTGTCCCTCTGATGATGGCCTACGATGGCTGGAAATGTCCATGCGTCTTACTGCCCAGTG 
ATTATGTATTGAAAGGGGTATATCAGGATTTCCCGACCAAGCCATTATATAACCTTTAATATCTTAGAAGGATCTGTGTATTTACAGAAAACAGA 
ATGTTGCTCAATGACAAGAAAAAAATAATTTTGAATTAAATTATCATTTAACGTTATATGAATCAGTAAGATCAATCCCATGTTACCTACTAACC 
GGAGCTCAACTATTTTAAAGGTTCTTAGCAATGGTTTTTGAAATAGGGACCGTTAGCTAGAAATACCTACCACAATTTTAAGGCCGAGGTACACC 
CGCGGTCTATCAGTAAACCTTTTATTTCGGACCTTATCTTCTGGGCCGAGTGCTTACCGGGCCCCCATATGCACTTCAGTTTGCTTACAAAACAC 
CGGCCAACAACAACAAACAAGCCAGAAGATGACGACGAAGAAAAGTGGAGTGGTGGGAGGAGTTCCCCGGAGGAGGAGAATGGGTATGGGTATGG 
GTATGGGTATTACGGGTGAAGGAATGGAAAAAGGGTCGCCGAGATCCGCTGTGCAGTGCGGAATGCTCTTGGCCAGGCTGTGTGTTTTAATCCCT 
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TCGGGATACTTTTTCTTCCACCGAGTGTGCATATGTTTGTGTGAAGGTAGCTAATAATCAAGACAGCGATGGAGAGGAGGAAAAACCGACGCAGA 

CTTATAATTATAGTCGAAGAGGAGAACCAAAGAGACCTGCACTTCACATGCGTTAATCTGAAAACCCTTTCGGGCCCAAGCAGCATCTGAGATTA 

TAGTCAAAGCTATTAAAAATCTAGCATAATCATTGCCCTCGGTGGAAAGCAAGTTGCGTTGTTGTTTTTCTCTTTTTTTTTTTTTTTTGCTCCCG 

AGGTAGGCAGGCGATCTTCCTGCGCAGCCTTGTCTATAATTTTCTTGATTAATATCCATGAGATTTACACATTTGGAGTGCTGGCCAGAACCCTG 

CACCTCAACCCTCCGCCACCCCCCGCGTTCCTGTGTGTGCTGGCCATTAATTTAAAAAGCTTTGCTAATTTCGCGGGGAAATCTGGAAACGAACA 

ACTGCTAATGAAGTTTTATGGGTTCGGGAATGCAGCTGCAGCAGGATTGATAATGTCGTTAGCATAAATTGAAAATGCTTTCAAGTTACGACCAA 

AGATTTGTTATGATTAAAGTATGGAGCACGCGGAGTGAGCAAGAATATCCACCAATGGATTTTTATTAAATGATAAATGGCGAGCTGCAGATAAC 

CAGATTCCTCTGGTTGCTGAAATAAATATTTTTTTTTGAATTGAAACCTTCTTAGGATTAACTGAAGTTCAGACAGTTAGTTCCTAAAATTGAAT 

TATTTCGGCTTAACTTTGACTCTTTCTAATCCCTCTTGAAGTAATCCACTTTGAATGCTAAGCCGATCTTCGTATAAATTCTATTGTCATATTGT 

GCTGCTCCTTTCATTCGAACTAAATCCATTCAAAAATTTCACCCTTGAATAGAACTCTTAAGATCTTTTGTTGGTCGGATTGCTACTTTTTCAGG 

CAATCTATTGTGTTATAATTTCAATGGGTCAATTCTTATTTATTGGAAGAGTTTATGCTAATAAGTGTAGAAGCACTAAGACTATGATTTATTAA 

TTCAAATGCAATTGGATTATGCTAATTGCACACAAATCTTGGTACAAATTATTTATATATTTTTTTCTTCGACAACTTTTGGAACCCAGCCCAAA 

ACTAAACATGTAAACTGATAATCACCTTTCCGCTGTCTGATTTCATTGATAGCATTAAAAATTCATATTCCCTAGACAACATCTAATTTCCCCAT 

CAAAATTGATGAGTTCCCGTTCCAGAACAACAATAAATGAACTGGTCTATCAGGTGAATTATACACACGTGCCAAAACGAGGCGTGTCCATATAG 

CCATTTCAATTTAAAACCCGAATATGGCGATCCCTAACGAGCCATTGTTCTGCGGTTTTATGGTAAATCTAAATTCATCAAGTATTTTGTCACGA 

TTTTAATTAATTTCGATTTCAATTTTCCTGCCACAAGGCAATTACCAGCAACATTTTCCCCTGCTGCTGGTGCAACCGAAAAATGATAGTTGGTG 

GCCCAAGATGGAGAGGAAAACCCACCCAAAGAGGTAAAGCGTAATAAACTTTTATGGTACTCCCTTTCTTATGCTAAAACTCTTTTTGGACACAA 

AACCAAATGGCCCATAAAGCAAGACAATGAATATCCAAAATAGATCTCCACTGAGGCGAAACAAAAGCCAATAACTTTATAGAACCGCAACCAAA 

AAATACAAAAAAAAAATAATAATAACTCAGACTGTGAGAGCGCGGGAGGCGTTCTACAAAAAAAAAGCTTAAAAATGAAGGAAAAAGTAAAGCAT 

TTAAAAGAAATCAAGAGATGTAGATGACGATGATGATGATGGTCTGAGGTTGAGGCTGAATCCGAATCCGAATCGGTATCTGAACCCAACAGAAG 

CTTGCGGCAATTGCGTTTCCTCTTTGGCCACACGTTCTGGACAATTAAAGCCCCAGCCGAGCTCCGAATCTGAATCGAGCCAAGTTAAGACAGCG 

ACCACGACCGAAAACCGAAAACAGAAAACCAAAAAAACCGGATTCCAAGCCGAGAAGGAGCGAAAAAGACGGCGAGAGCGACCACTGGCCACTGG 

CACTGGCTACAAGTGAACTGGTTTACAAAGAAAAGAAACAGAAATGAAAGAAACATGTGAAAAATCACGATCTATAAAGCCGATGCACTTAATAC 

CCTAGGCAAAATAAGTAAATAATACTATAGATCAAACTAGAGTAGACTTGTTGAAAAAAAAACACTGGAATAAATAAGAAGCTTATAAACTATCA 

GCTTATTAAACTAAGTTTGCCTAAATTTCGTAAAATATATTTTTAAAACCTAACGCGATGCTTTGAAATTCTGTAAAATTAAAGTTTTTCAGAAA 

TGTAACGTTATTAACACCGATTAATATTTTTAAATAATATTTTACAAATTGTGTACACCATTTTCAAACGAAGTACCCTTACCAAGAGCACCCAT 

AAGAGACCCCCAGAATTCCAAGCGAAGGTTTTTCGAGTCCGGCTCGAACCGACCAACTCGAACTGAGCGCCACAGCAACAACAATGGGGCAAGCG 

GGTGGTTCAGGATGGTGGGGAAGTTTGCGGGGGGATGGGGGACGTACTCCACCGTGGACTGTGCGGCACCACACTGAGTTCAGTGGCAATGACAG 

TGGCACATGCAGTTGTAGTGGAAGCCGCTTGTCGGTCGACAGTCCGCCAACCGCCAACGTTGTTGCGCTCTCTTTCCTGTTTTTTCGCCCCGTCT 

CATTCTATTTGTGGAGCAGGTTTTGAGATATATCTTGGGTCTTTCTCTCTCTCTCCTTCTCTGTCTGCGTACTCGTTATATTTTTGTACTCGCTG 

CCTTTTTGGCCAACGCAAAAAATACCAAAATAAAAAAATAATAAACGTAATTAGCTAGCCCTACGCCACAAACAGAAAACAAAAAAAAAAACAAC 

AAAAAATAAAACAAGTAAAAATAAGTGGTGTGGAATTTTTGTGTTCATCTGGGAAGGCAGGGCCTGTTCTCGGCGGAATCTTACCTGTGGGTTAA 

ACAGAAGAAAGCCCCAGGCTCTTTGGGCTTTATCTTTGCGTCCTAAAGTATCCAAGTAGTTGTTTCAATATGATTTTGCTGGTTTTTTGTCAGTT 

TGTAAGCTTTGAAATGCTTGGCATTGTTTTTGGTTTTTAAAGTAGTAATTTATATCATTTAAGTGGCACTTGAATTGTATGTTGTTTTTTCTTGT 

TGTATTTGTTTGAAGACTTCTGTTAATACGATTCTGTGAATTTAAAAGTAAATAATTTTATTAAAATTTTCAAAATATTTGTCGGAAATTTTTTG 

TTATATTTTCAGGTTTGATTTTAATTTTAACAAGTATTTCACCGAATCATTCTCAAGTAAATGTATTTAAAAAAACACGCACACTAAGGGACAAT 

TTTGAGATTCAAATCACGGCACGATCGCCCCAGGGGGTGGGGCTGAGGGGGCGGCAGGGGGGCGGTTACGGGAGCAGCAGCAGCATCAACGGCAG 

TCAGCTACCACAAACCACGTAGTTTTTTTCCTTCCAAAATCCACGTACAACTCGCGCACAAACACACGATGTATTTGTATCTCACAGATATGAAA 

ACCGAACGAAGAAAAACCCGAACCGAAACCTCAACCTCAACGAACTGAAAACCGAACGAGAACGAGAGATACACGGCAGCTGAAATGTGTTTTTC 

TACATTTTGTTTGAAATAGGAGCTGAAGCCAGTGCAGCTACCAATTCGCTCCTCAAGTAGTCCCAAACTCTTCCCACACACAGAGTCAGTTGCAG 

TTGAAAAGCGCTTTTCTCGAGGCAACTGGCGATGCAACAAGTTCACTCACACACACACACACACTAGCGCTCTGGGCCCGCTCAATAAGATACAT 

CGGCGGCGAAAGATACAGATAGATATAGAGCGAGAGGCACTCACAATGCAGTTCACTTTTCCAACAGCTATTACCAGCGATCCGCGCGCAGGTGC 

GTTGGCCACGATCAACTCTCGAAGACTGAGCCACGAACCCAAGAAACAGGGCCCGAATGGAAAAAGAGAGAATCGAGAGAATCGCGGGCTGAGAA 

ATGCGTAGAAAGAGACAAGCGACGAGTAGCGAGCAGTGGCACTAAAACCAGCTTAGTGCACTGTGGAAAAAGTTTTAACAATTCTTAAATATCTG 

AAG AGT AAG GC T C T AAT T T T C T GT AAAT AAC AAC AGT AT AAAT C T A TC TGC T T AAAT AT AC T AGA T AAAT AAT GGAT GC AT T T AC AT AT T T T T C T 

GAATTAACTAGTAACTTATATTCAGTTTGAATATAGTACGACATTTCTTTCAGTGGAGGAGAGGAATTAAACCGAACTCAACCCAAACCCAACCG 

GCCGCCAGTCGCACGCTGCTAAATGGACGGATGGGCCCGTGGACGGACTTATATGGAGACTGGCACTGGCGGAGCGTGGAACGTGCATTCGTACG 

ACGAGTTATTGTCAGTTAGAGCGCTACCTGTTTACCGACCGACTACCGACTACCACCGACTGCTTTTTTTTTTTTGGCCCAGTCGAGAGGGTAGG 

GTACAATTGGCCCGGGTGTTGCCAGATTGCCGTTGCTGCTGCTGCCCATGGATCGTTATGGTGGCTTCACCCGAAAAGGAGGAGGAGGGTGTAAC 

TCTGGTAGTTCCCTGTTTATTGTTATGCCTTTTTAGTGGGGGT 

{SEQ ID NO; 1180) 

Exon: 13103.. 12813 
Exon: 12668.. 12435 
Exon: 5437.. 4551 
Exon: 3226. .1001 

Start ATG: 5404 (Reverse strand: CAT) 
Transcript No. : CT37221 

ACATTTCAGCTGCCGTGTATCTCTCGTTCTCGTTCGGTTTTCAGTTCGTTGAGGTTGAGGTTTCGGTTCGGGTTTTTCTTCGTTCGGTTTTCATA 
TCTGTGAGATACAAATACATCGTGTGTTTGTGCGCGAGTTGTACGTGGATTTTGGAAGGAAAAAAACTACGTGGTTTGTGGTAGCTGACTGCCGT 
TGATGCTGCTGCTGCTCCCGTAACCGCCCCCCTGCCGCCCCCTCAGCCCCACCCCCTGGGGCGATCGTGCCGTGATTTGAATCTCAAAATTGTCC 
CTTAGTAATCGTATTAACAGAAGTCTTCAAACAAATACAACAAGAAAAAACAACATACAATTCAAGTGCCACTTAAATGATATAAATTACTACTT 
TAAAAACCAAAAACAATGCCAAGCATTTCAAAGCTTACAAACTGACAAAAAACCAGCAAAATCATATTGAAACAACTACTTGGATACTTTAGGAC 
GCAAAGATAAAGCCCAAAGAGCCTGGGGCTTTCTTCTGTTTAACCCACAGTTCCAACGGAATGTGAAGAGCCCCTCCAATATAATGCCCACGGAA 
AGCTCATCCAGCGAGATCTCCGGCGGAGGAGGCGGTGCGATACCCATGCTGCGACCTTCCCGGATGGATCAGTTCATGAACTCGATGGCGGCGGC 
AGCGGCGGCGGTGGGTGGTGGCGGTCTTCCTGGTGCGGCGGATAGGAACGGCGGATCCGGAGGATCGGATGGTGGCTCCCAGAATGGCAACGGCG 
ATTCGCGCAACTCATCGGCCAGCCGGATCTCAGCGTATGAAACACAGCTGGCCTATCAGCAGCATTTGGCCGGACTGCATGGTCCGCCGCCACCG 
CCACCACCATCGCATCATCGCGAGATCTCTGCCTTTGTGCCGGTTCTGCCCACCGGAAAGGTGCGCCCGGGTAGTAATTCCAATTACGAAATCAT 
TGCCATGATGGCGGACAAGCGAAAGGAGTTGGCCCTAAGGGAGGCGGCTGCGGCGGCGGCAATGTTAGGACGAGGTCCTGGAGGACCCGGCGGTC 
CTGGTGTGCCACCGCCTGGAGTGTTGTACGGACCAGCTGGCGTGCCACCTCCGCCATATCTCACCGGACCGGGTCCATCGCCCACGGGAGCCGGA 
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AGTTTTCCATTTCCACCAGGAGCCGCCGCCGCAGCTCTATTTCCACCTGGCCTGGGTCCTGGAATGCATGCCGGACTCGATAGGCGCCTGCTACG 
TGCTCCTGGTCGGGCATCGCGGCCTAAAAAACAGTTCATCTGCAAGTTCTGCAATCGACAATTCACCAAGTCGTACAATCTGCTCATCCACGAGA 
GGACACACACGGACGAGCGGCCCTATTCCTGTGATATCTGCGGCAAGGCGTTCCGGCGACAGGATCACCTGAGGGATCACAGGTATATTCACTCC 
AAGGAGAAACCCTTCAAATGCACGGAATGCGGCAAGGGATTCTGCCAATCGAGAACCTTGGCTGTCCACAAGATCCTGCACATGGAGGAATCACC 
CCACAAGTGCCCCGTCTGCAGrCGATCATTCAATCAGCGCTCCAACCTGAAGACCCATCTGCTCACCCACACGGATCACAAGCCCTACGAGTGCT 
CTTCATGCGGCAAAGTTTTCCGCCGTAACTGCGATCTACGACGCCATGCCTTGACCCATGCAGTGGGTGAGGTCAACTCCGGGGACTATGTGGAT 
GTGGGCGAAGAGGATGAGGCCAGAAATTTAAGTGGCGACGAGGAGGATTCGTTGCTGGAAGTGGACTCGCCTCGCCAGTCGCCAGTTCACAATTT 
GGGCGAGTCTGGAGGATCGGGTGAGAAATCTGAGTCCGAAAGAATGAGACTCAAGCGCAAGGCAGCCATCGATCATGAGGAAAGCGAAGAGGAGT 
TCGATGACTTCGACGAGGAAGAGGAACTGCAGGATCTGCCAAGAGTACACGATTTGCCCAGGGAAGAGGACGACGACTTTGATCCCGAGGACGAG 
GAGCAGGCAGAAGTGGCACTAGTGGCGCGTTTCCAGGCGAGTAAGGCTGCCGCCACATCGCAATCATCGTCGTCAGTGGGCACCAAGCCGGAACG 
CCAGGGAGTCACCCACTGTCATCATGAGGGTGGTGAAACCTACACAATGCGGCCACATGGCGAGAAACATCAGGAGGAACCCGGGAACTCTGGCA 
TTGCCAGCCTACCCGTTCCACCTTCCTTTGTGCGATATTCTGTACCACCTGGGGCAGCTGGACCGCCGCCTGCACCACCTGGAGCTCCTCCGCCA 
ACCCACCAGCACCCAGGACATCCACACTTACTCCCACCCAACGGGGATCCCTATTTGCCCATTCTGCACGTTCGACGCGACTTGCACCACAAGAG 
CTTGAACCTTTCGAAGGCTGGTGTTCCACCGCCACCACATACACCACCCACTATAATAACCCAGCCGGAGTCGGGTAAACCACCCAATCAACCTT 
TGCATTCACCGCATGAAGCTATGCCCAGCTTCCTGGGATCGATACCCATGCGAAAGAGGATTCTTCCGGCACCGACCTTGGACTTGATGGATCCG 
CACCACCATCCTGGATTGGGTCAAAGGACCTTTGTGGATAGTCCCAGCATATATGCTTTGAATATGTCGCGACATCCCCCCAGACAGCTTCTGGG 
AAAACCACCGAGTACGGAAACAAGTGGAGCCACCACCGAAAAGGGACCACCAGTGGCAGCACCGCCCATAGCACCACCGCCTGCACCACCGAGAA 
GAACAGGGTTCAGCATTGAGGACATAATGAGGCGTTAGAAGTAAACCGAGAAAGAACTAGAAAACAATCCATTAATGGAGGTAAAATCGGAATAG 
AAAAAC AAC AC AAT T TG GAAAACT AT T T AT C AAAGC T CT T AC GAATT T G AAAT AT C AAAAAGT AGT AG T AT T CC T AAT AAAT AT AT ATC T CGG T T 
GTTTTTTCTGGTATTTCATTTAGAAGGTCTTATGAACAGTTGAATATCATAATGACAAACAGTTATTGTTCCCAATGGTTAAACGAGGATTTATC 
GTTTTAGACCCACAAGTACATTAAAATAAATTTAGATCTACAATTTAATACTTTTTAGTACAATGTCTAGGCAAGGCAATAGACCTAAACTAAGT 
AATAAAATCAGTTAAGCGATAATTGTAAGCGAAACAAACTCCGTTGTTGCATTTTATAGAGCTCTTATGCGTTAGGATTTTAGCTTAAATCTACC 
GATTTAAATGTTTAAGCCTTTAGTACGAGAGTCTATCGATTAGTTTTTAATGTGTGGCCCAACACTAAGTGTTGCTGTTTGCTGAGTGAATTCCA 
ATTTTTTATTTATGTTTCCTTATCCGGTAAGCTAATCTTAATGAGTAAACAAAATCGTTAATCGATTTTGCTGTAGGGAAAAAAATGGTTGAAAG 
TAAAAGTTAGTGTAAAGCAGAGCATAAACAAAAATCCAACTACAAGCAACATTCCAGTAGAACCTTATGTAATTAGGCTTAAATTTTAAGAGTTT 
TGCAGTAGTTAATATAAGCAACCCATAGAATTCCATGGTCTTCCTTCAATTTTAAATGATTTTATTCAAACCATAGCGAGTTAGTTGATTTCAAA 
T AAAGT AT GC ATT AAT AAT G AAAAAC AA 
(SEQ ID WO: 1181) 

Start ATG: 559 (Reverse strand: CAT) 

MPTE S S S S EI S GGGGGAI PMLRP S RMDQFMN SMAAAAAAVGGGGLPG AADRNGGSGGSDGGS QNGNGDSRN SSASRISAYETQLAYQQHLAGLHG 
PPPPPPPSHHREISAFVPVLPTGKVRPGSNSNYEIIAMMADKRKELALREAAAAAAMLGRGPGGPGGPGVPPPGVLYGPAGVPPPPYLTGPGPSP 
TGAGSFPFPPGAAAAALFPPGLGPGMHAGLDRRLLRAPGRASRPKKQFICKFCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHR 
YIHSKEKPFKCTECGKGFCQSRTLAVHKILHMEESPHKCPVCSRSFNQRSNLKTHLLTHTDHKPYECSSCGKVFRRNCDLRRHALTHAVGEVNSG 
DYVDVGEEDEARNLSGDEEDSLLEVDSPRQSPVHNLGESGGSGEKSESERMRLKRKAAIDHEESEEEFDDFDEEEELQDLPRVHDLPREEDDDFD 
PEDEEQAEVALVARFQASKAAATSQSSSSVGTKPERQGVTHCHHEGGETYTMRPHGEKHQEEPGNSGIASLPVPPSFVRYSVPPGAAGPPPAPPG 
APPPTHQHPGHPHLLPPNGDPYLPILHVRRDLHHKSLNLSKAGVPPPPHTPPTIITQPESGKPPNQPLHSPHEAMPSFLGSIPMRKRILPAPTLD 
LMDPHHHPGLGQRTFVDSPSIYALNMSRHPPRQLLGKPPSTETSGATTEKGPPVAAPPIAPPPAPPRRTGFSIEDIMRR* 
(SEQ ID NO: 1182) 

Name: brother of odd with entrails limited 
Classification: transcript ion_f actor 
Gene Symbol: bowl 
FlyBase ID: FBgnO004893 



Celera Sequence No. : 142000013384 668 

GGGCTATCCGGAGTTCGGACGCCTGCTCAATGAAACTGGTCGACCGATGGTATACTCCTGCAGTTGGCCCGCGTATCAGGAGGATGCAGGAGAAA 
TGCCCGATTATGAGTCACTCAAGCAGCACTGTAATCTGTGGCGCAACTGGGACGACATCGAAGACTCGCTCGAGTCCCTTATGCAGATCATTGAC 
TACTTTGCCAAGAATCAGGACAGGATTCAGCCGCATGGCGGACCAGGACATTGGAACGATCCAGATATGCTGCTGCTGGGAAACTACGGCTTGAG 
CTACGATCAAAGCAAGCTGCAGATGGCGATTTGGGCCATTATGGCAGCTCCTCTAATTATGTCCAATGATCTGGCTGCAGTGCGTCCCGAGATCA 
AGGCTATACTCCAGAATCGGTTGGTTAAGCTTCGATATCTTATCTTCTTGGACTTATTGGTAAACTTTTGTTATTTGCAGTGCGGTTATTGCTGT 
GGACCAGGATGAGCTGGGCATCCAGGGGCGCCGTGTTTTGTCCCGCAACCAAATCGAAGTCTGGAAGCGTCCCATTACGCCAGTAACCAAGAGTG 
GACATCACTCCTATGCTGTCGCCTTTGTCAGTCGCCGAGATGATGGCGCTCCCTACAGGATCCCCTTCACGGTCAAGGAGCTCGGATTGACGAAT 
CCCAAGGGTTATAATGTGCAGGATCTGTACGATGCCAGCAGCAAGTTGGGCGTCTTCCAGTCTGAGAGTCAGTTCATCACACGCGTCAATCCCAA 
TGGTAAAATCCGTTGCAAAACACTAACTAGCCCTGACTTACTTCATTTTCCCCTCCCCCAGGCGTAACTTTCTACAAGTTTACAGCGTTGTAAGC 
CCAAGATATGCAAATCAGCCGCTGCCATCAAATCGAATATGCTCTTATATACACACTATTTCCACATTGTCCATTGACGTTGCATCGATGATCTG 
TGTAATAAGTTAATTCCGACAATAAATACCAGGTAATCACTGAAGAGTTTGACTTTGTTGATTTTTATTGTTTATTAAACAATTGATTGTGTGCA 
TTACACACAGGTATAAGTATTGTACAATTATAGGCATAGATTGCAGTACCGACGCTAGTTAAGACTTGTTCCATGATATAAATAAGGTCTGACTG 
GATGAAAGAGTTAATTCGCAAGCTTGTATAAATAAGAAGTGTAGTACTATATACGCGACAAGGACCCGAATAAGAGCAAAAAGGATAGCGACCCC 
TCGGGATCATACCAGTTATAAATAGATACCATCTACAGCTACTGCTTACATGGGTTTGGTCGAGAGAGTTAGAGTTATTGTCAAGTTGTAGGCGA 
AACCGAAACCAAACCGAAGGAACCATAGCCCGGGGTTAGTGCACCCCATTGCTGATGGTCGGTGTGCGCTCGTAGGTGCGTCTATGCAGCGCATC 
CACGTCCTTCAGGTAGTAGGTGCCGGGAAACAGTGCGGAGATGCTGCCCGTTGGCGTGTAGGGCGCCGCATGGTTGTTCTTCTCCCGCACCTCCA 
TCAGCGCGGAGAACTGCTCGGGCGCAACCTTCTCCCGTGAATTCAGCAACGGCTGCACGTAGTCCAGCTGCGAGACGAACTTTTCGAAAGCAGCT 
GCATCCTGGGTCACGCTGATCGAGTACATGGAAGCAGCCAGTCCGGATCCGTAGGAGAACAAACCAATGCGCTTTCCCACCAGCTCCTGGGCTGG 
TCCGCTAATCAACAGAGACACCAAACCGGAGTACACCGAGGGTGTGTACATATTGCCCACTTGGTTGGCCAGCAGCAGCGATTTCTTGGTCTTGC 
TGGCAAAGATGTTGGCGGACTGCGTCATGAAGGCCTTCTCCACATCGCGATCAAAGTAGGTGCTCTCCAGGGTAGCGGTATTAAAACGTTCCAGG 
TCGGGAAACTGCTTCGTTCGCTCCTCCTCACTGCTCAGCAGGAAATCGTTGAAGCTCAGGCGACCCACAGATTTTTGCACCAACTTGCAGAACGG 
TGTGTGGAATAGGATAGCATCGAAGGTGCTCAGGCTGGCCGGCTGCTTGGAAGTATCCTTCTGCTGCTGGTCGAACTTCTTGCGATACAGCCTGT 
AGCAGGTGTCCAGAGCAGAAAGGTAGCACTGAATAGAGAGCTTTCCATCTACCGTTGGGTATTCGGAGCTGAGGTCCGGCTTGTAAAAGTCGTAG 



FIGURE SHEET 635 



GCGTGCTCCATGTGCGTTGCGCGTAGTCCACGATCGAGGATAAGCGGCGCATTTGGACCCACCAGCATAGCCACAGCACCCGCACCACCAGTTGG 
GCGCGCTGCACCCTTGGCGTACACGGCGATATCAGCACAAACGGCCAAAGCCAGGCGGCCGTCCCAGCTAGAAGATTCCACCCAATTGACGGCAT 
TGAAGAGCGCCGCCGTGCCGCCGTAACAGGCGTTCGTGGTGTCGATGCCCTCGATGTCCGTGTTCCCGCTCTCGGCGAACAGCTGCATCAGCACG 
GACTTCACCGACTTCGACTTGTCCACAATGGTCTCGGTGCCCACCTCCAGTCGTCCGATCTCCGAGTGCTTCACATGTTGCCGCTCCAGCAGGCG 
GCTCACAACCGTCAGGCAGAGAGAGTTGACATCCTCGCGGTCCGAGCAGAAGCCCATCTTGGCCTGGCCCAGGCCAATTGTGTACTTGCCCGCGG 
AGGCACCATCGAAGGTCTCCAGCTCCGTTTGGTCCACGTACTGGGAGGGGAAGAGAATCTCGATGGCGCGTATGCCAACGTTCTCGGGCCAGTGG 
GATGCCATGATTGTATCCGGTTATCTATTTAATCCTGAGCTCTGTAAGAGAGACAATTTGTATACTTTTAGTGGGAGCTTTGGCATGAGAGTAAA 
TATTTAATTCAATATCTTTGTTATTAACAACTATAACAATCACTATTTGCGTTTCGAAATTGCTTCATGATGTTGGGAGCTAAGGCCTTGTTTAA 
TTTAATTTAAGCAGAGTTCTCGATGTGGCCCGAATTCCACTGCCAATTGGCATTGGGAAACTTTCTCCCCGCGCAATCACGGCGTATTTATAATG 
CTTACAAATTGGACGAGGGCAGAGACAACCTGTTGCACGCTCGAGCTGCGTTGGACGTGTGTAAGCCAGTGACTTTGGCTGCGGCTGAAGAAGTC 
ATCAAAGTCTGCCAGCCAGAGGCCCCCGTCTTGGGGGCCAACTGATTTCCCTGGCTGAGAAACTGAGCGTGTAGAGAGAGAGAGAGAGAGGGAGA 
GAGAGCCGCCGGAGCGGAGAAAGTTTCCCCCGCTCAGGCGAGCTCCACAAAAGAGAACACTAAACCGCTGTGAAGTAAGCCAAAACACACGCATC 
GCAATAAATGTAAGCCAACAAGACAAGCCCAAAGAGCCAAAGGGACAGGGGCGTGGATGCTGTGGCTACACGCAGAGAATTTATGATTACATTTA 
TTTACAATAGATAATCATTTCTTTTAAACTTTTTTTGTCAGAGAACAAGCTACAAAAATGATTACTTTAACACAATAACTCTTTTTTCAAAACCG 
CTTCCCCTGAAACTTCTCTACATTTTCACACATTTTAATTTCAGTTAGGCGCAAATTATAATTAAATCAAAATGATGATCGAAATATTATGATTT 
CCTATCGAAATAGAATGGAATTTTGCTCGGTGGAGCATTGTGGGACAGGCTGGGAGCGTCTTCACCCATAAGTGGGTCATGCCCACCAAGCACTA 
AGCACCAACCACCACTCTCCCTCAATCACACGCGTCCATATTGTATAACCGATAAGAGGTGCGAGTGTGGCCCAAACCCGAAAAGCGGTTGACAT 
ATTTACCGTTACGTTTAGCACCTGCGACTACTGGCCTATTGACTATAGATTTTCACTTTTCGGCCCGGAACTTTGTGGATTTGTGAAAAAAGGCT 
GGGTAGCGCGAAGCTCACTCGATCGGATCTCTCCGCGGACTGACGGCACTGGAGGCGCTGCACTGGCATATGAATATGACCATGACGACGGAATT 

G GC AGT GGAG AAGC AG AC AC CT AGC C AAC G AAG C C AC C T AGC C AC C GAGC AAAC AC AG AGGC GAG AAT AGGC GC T GT AAGT C AAT GC C AATT G C T 
GAAAGCGCTCGGCGAACAAATAGCGATCGTGCATAATTACGGGAAATATTTATCGCTTAGATACGGCCAGCGAGCACGAAACACAATCGTCGGGG 
ACTAAGAAGCCGCCAGACTATATAGATATGCACATCGATAGTCGTGGCCTTTTGCGATGGGGTAAAGATCAAAGGCTGTCCATCGCTGATAAGGC 
CGAAACAAAGCGGCTTCCTTAAAAATGCCACCGAATAAGATGTGACACTCTGTCCATTTCAGTGACAAAAACTGGGAAGTGACTTCCAATAGCTT 
TTCTCCGATTAATTACTTAAGACGTGCCCTTAGGAATTATTAATAGAAAATTATTTATAATTTGAGTTCTACTAATTTTCCTACTGGTATAACTT 
AAATTGGATTATCATCTTTTCTGAACACTATCTTGAAACGTGATTTCAAATCACCTGAACACAATTGGTAATTGATAAAATATTCTACAGAAATC 
ACTTAAAGTAAGATAATAACAAGACTACTTTAAGTAGTTGTGATATTACTTGAGTCGTACTTAAGTTAATTAATACAGTTTAGTTAAAGTAAGTA 

AAAT T AAC AAAT AAG AG AAT ACT T T GTC C T AT T GT GC AAAAG AGT AT G AAT CAT TAT C C AAAGT CAT G AAAC T C T AAT TTC AAG C AAT T T AAG TG 
ATTTAAGTCATTAATTCCTCGAGTGCCTGCGTGGAATACAGCTCACCCGGTGTTCCTAAGCAGCCCGTGGATTTAAGAATGGAAATTGATGCGCT 
GTTTATGCGGAAAAGGAGTCATCGAAGGGAAGGGAACGGGGCACTGTTGTGGTTTTGTGGGCTTATTTTTAGCACTTCGGAATTGGGAAATATTT 
GAATAAACATTTCCTGGAACTGCTGCGCACTTGGCTAATTGATTGCTCTTGAGTCATTCTTTTCTTTTTCGGCTGCAGGAGGGTGCGGAGGTGGA 
GGAGGAGGAGGAGGAGGAGGTCAAAGGTCGGGATGTTCCCGATAGAAATACACGTGTATTTGAGAGCCGGGAGATAAGCATAGTTTAGGGGTCAG 

GAT TAT G AAAC T GAT CT AT G AGGT GAC AC AT AT C AC T GG AT GT TG AT AAT T T T T AAATCT AC AC T AT T AC AC AC GGC T GG AAAAAC AC T G T G AAA 
CAGCTTTTCCCAATTTACAATAACCAATTCTAATTGCACACATATTATGTTCCTCCGACTTCTGGTTACATCAGCGCCCAAACTGACCACCTGCC 
ACGCCCACAGCCCTCATCCAGCGCAGCACGAACAGTACTCGCACTCGCAATCAACTTACTTGCTAATATTAATGGTCCAGGGCGGGCGATATGTT 
TTAAACTAAACTAATCTGGAATAAAATTAAGCAATGAAACTGCGGCACCTTAGCCGCAGGCAGTGTTGGAGAGCGCGCGGCGCTGGAACAGCTGC 
TGGCGGAGTCAGCTGTTTGGCTCGCTGGCTAAGCAACACTGGCGCCATCACGTGAATCTCGTGGGCATTGTGAACTGTCACTTTTTTCGCCAGCG 
CTTTTCTCACCACACAATTTCGTTCGCATTTGTGGGTGGCGGCGGTCGATTTGCGCTGATTTTTTGGCAAAGAAAAACGCACTTGCTGTTTTCAA 
ATAGCACAAAAAGCACACCTGCTTCGTTTACAATCAACCATATTCGCTGAAAAGTAAGTGCCAACCGTAAGAAATAGTTAATTCGGCGGGTTTTC 
CACGTCGCCGTGAAAAGTGCAAAAGTTACCCCCCGCCGAAAAAAAAGACCGATTACTACATCGCAGTAGGTCTGCTGACCAGGAAAAGTGCGCAA 
ATCTCTTGTAGATTATTGAAGATAGTGAGAAAATTAGTTTTTGCAATGTGGCTATGATTTTTTGTCGAAGCAGTGTGACCCTGAAAAAAAGAGTT 
CGCCAGTTTGTTGCGTTTCGTTCGACGGCAGATTGCCGCGAGGTCTGCGGATGTATCTGGAAATATCCAGCTGATGATCGTGTGGGCTGGGGGCG 
GGGAGTCGGGAAAATGTGCGAAAAGTGTTTTGGTTTTTGTGCTTAAATGAAAAACCCGTGAGCAGCGGAAAATTATCGATCGAAAAACTGTCATG 
TGACTGTCAACGAACAGCTGACATTGCGAGAGCAACTCACCACAGCGAGTGGTGAAAAGAGCGCAATAGGCGCTTTGAAAATTGAAAGAGAACAC 
AAAATTCTCTTTCGCTCTTTGGGTAAACAAAAAAAGTTGGAAAATGTTGTTTAAAAATTGAAATACTTTTCGGTTATTTTGCCATAAGGCACGTA 
TTTCGCTGATTAATAGATCGTGTTGTGCGAAAACAAGCAAATACACTTTGATGCGCTGCACGAGGATTAATAAATAATCGTATTGCGTGCAACAA 
GGACCTTGATGAAAACCAAGCACATATTATACATGATATATATACTCTACGAAACTGTAGGC 
(SEQ ID NO: 1183) 

Exon: 5522.. 5380 
Exon: 2796.. 1001 

Start ATG: 27 63 (Reverse strand: CAT) 
CTGACTCCGCCAG^ 

AGTTTAGTTTAAAACATATCGCCCGCCCTGGACCATTAATATTAGCAAAGCTCAGGATTAAATAGATAACCGGATACAATCATGGCATCCCACTG 
GCCCGAGAACGTTGGCATACGCGCCATCGAGATTCTCTTCCCCTCCCAGTACGTGGACCAAACGGAGCTGGAGACCTTCGATGGTGCCTCCGCGG 
GCAAGTACACAATTGGCCTGGGCCAGGCCAAGATGGGCTTCTGCTCGGACCGCGAGGATGTCAACTCTCTCTGCCTGACGGTTGTGAGCCGCCTG 
CTGGAGCGGCAACATGTGAAGCACTCGGAGATCGGACGACTGGAGGTGGGCACCGAGACCATTGTGGACAAGTCGAAGTCGGTGAAGTCCGTGCT 
GATGCAGCTGTTCGCCGAGAGCGGGAACACGGACATCGAGGGCATCGACACCACGAACGCCTGTTACGGCGGCACGGCGGCGCTCTTCAATGCCG 
TCAATTGGGTGGAATCTTCTAGCTGGGACGGCCGCCTGGCTTTGGCCGTTTGTGCTGATATCGCCGTGTACGCCAAGGGTGCAGCGCGCCCAACT 
GGTGGTGCGGGTGCTGTGGCTATGCTGGTGGGTCCAAATGCGCCGCTTATCCTCGATCGTGGACTACGCGCAACGCACATGGAGCACGCCTACGA 
CTTTTACAAGCCGGACCTCAGCTCCGAATACCCAACGGTAGATGGAAAGCTCTCTATTCAGTGCTACCTTTCTGCTCTGGACACCTGCTACAGGC 
TGTATCGCAAGAAGTTCGACCAGCAGCAGAAGGATACTTCCAAGCAGCCGGCCAGCCTGAGCACCTTCGATGCTATCCTATTCCACACACCGTTC 
TGCAAGTTGGTGCAAAAATCTGTGGGTCGCCTGAGCTTCAACGATTTCCTGCTGAGCAGTGAGGAGGAGCGAACGAAGCAGTTTCCCGACCTGGA 
ACGTTTTAATACCGCTACCCTGGAGAGCACCTACTTTGATCGCGATGTGGAGAAGGCCTTCATGACGCAGTCCGCCAACATCTTTGCCAGCAAGA 
CCAAGAAATCGCTGCTGCTGGCCAACCAAGTGGGCAATATGTACACACCCTCGGTGTACTCCGGTTTGGTGTCTCTGTTGATTAGCGGACCAGCC 
CAGGAGCTGGTGGGAAAGCGCATTGGTTTGTTCTCCTACGGATCCGGACTGGCTGCTTCCATGTACTCGATCAGCGTGACCCAGGATGCAGCTGC 
TTTCGAAAAGTTCGTCTCGCAGCTGGACTACGTGCAGCCGTTGCTGAATTCACGGGAGAAGGTTGCGCCCGAGCAGTTCTCCGCGCTGATGGAGG 
TGCGGGAGAAGAACAACCATGCGGCGCCCTACACGCCAACGGGCAGCATCTCCGCACTGTTTCCCGGCACCTACTACCTGAAGGACGTGGATGCG 
CTGCATAGACGCACCTACGAGCGCACACCGACCATCAGCAATGGGGTGCACTAACCCCGGGCTATGGTTCCTTCGGTTTGGTTTCGGTTTCGCCT 
ACAACTTGACAATAACTCTAACTCTCTCGACCAAACCCATGTAAGCAGTAGCTGTAGATGGTATCTATTTATAACTGGTATGATCCCGAGGGGTC 
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GCTATCCTTTTTGCTCTTATTCGGGTCCTTGTCGCGTATATAGTACTACACTTCTTATTTATACAAGCTTGCGAATTAACTCTTTCATCCAGTCA 
GACCTTATTTATATCATGGAACAAGTCTTAACTAGCGTCGGTACTGCAATCTATGCCTATAATTGTACAATACTTATACCTGTGTGTAATGCACA 
C AAT C AAT TGTTTAAT AAACAAT AAAAATC AAC AAAGTC 
{ SEQ ID NO: 1184) 

Start ATG: 177 (Reverse strand: CAT) 

MASHWPENVGIRAIEILFPSQYVDQTELETFDGASAGKYTIGLGQAKMGFCSDREDVNSLCLTVVSRLLERQHVKHSEIGRLEVGTETIVDKSKS 
VKSVLMQLFAESGNTDIEGIDTTNACYGGTAALFNAVNWVESSSWDGRLALAVCADIAVYAKGAARPTGGAGAVAMLVGPNAPLILDRGLRATHM 
EHAYDFYKPDLSSEYPTVDGKLSIQCYLSALDTCYRLYRKKFDQQQKDTSKQPASLSTFDAILFHTPFCKLVQKSVGRLSFNDFLLSSEEERTKQ 
FPDLERFNTATLESTYFDRDVEKAFMTQSANIFASKTKKSLLLANQVGNMYTPSVYSGLVSLLISGPAQELVGKRIGLFSYGSGLAASMYSISVT 
QDAAAFEKFVSQLDYVQPLLNSREKVAPEQFSALMEVREKNNHAAPYTPTGSISALFPGTYYLKDVDALHRRTYERTPTISNGVH* 
(SEQ ID NO: 1185) 



Celera Sequence No. : 142000013385215 

AGCACATTCGTTTTCCTCTTTAAATAAGGAGAATCCACTTAAAGAAATCTGATAATCGCTGCATATTGTGCTATTTATTTTATGTTAAAAAAGTA 
TATTCGCAAGAGCATAAAAACAATCTGATCGGGTTAATCTTAACrTTTTCGCTGCTCCATTCTCATTGATTACTACAGCACAAAAAAAAAGTGGG 
GAAAGCGTGGAATCCTAACTAGAAGAACGGAAGATAGAATAATGGATAAAAAACAAAGGGGCAGGGCAGCAAAAACTTAAATCATTGGCAAATAC 
CTTTTTACGATGCGGATTTTAATCAAAGTTTTTGCCACCGCTGTCAGTTTCTTTTTTTTACTTAAAATGGTAATAAAAAAAAAAGGGGAAACGAC 
GATAAATAAACATTTTAAAATAATTGCTAACGGCGTGTTGTTGCGCTTTAGTGAACTTATTAGTCTACCCTCGCACCGGCCAATTTCCACAATTG 
ATTAGTTTAAAATTGCATTTTTTGGACATTTAAATGGGAATCTAATTGATTGCCATTTTTATTGAGAGCACTTAACACTATACAATGGCAGGTTT 
TGTATTTAAGCTATATATTCCTCGTTTATACAGTGTCAAATTAATTATTGATTTTATAAGCAGCTAAGAAAATGATTGTTTAATTTGGACCGATT 
GATCATCGAGGACTTAAAATGGACTTAAAATAATTGGATATTAGCAACTTTGAAAAGATCCCGTTTACTCCCAAAACTTCAGAATCAACCCACGA 
ACTTCTTTAAGCCAGTGTGACCCCACCCGCTCTTTAGCTCTCTTTCGCTCCATCTCGCTTCCGCACATCGATTGGCGAAAGCTTCGCTCGCACTC 
GATATCAGCTCACAACTTCGCTTGCAAGCTCGCCGCGGAAAGCTCGCGATCACACGGGTCGAAATCGAAATCGAATCGCGGTAGAGAGCTTTTCG 
AAGTGCGAGCGTCGACGTTTTATTTCTGCGGCTCAGTCGGTTTTAGTTCGTTCTGTTGGAGAAAAGCAGCAATCACACGTTCGCAAGGTGAACGC 
GAAGACACAGCAAAGTAAGCCCTTCCCCCCCACCAACACACACACCCACCCAAAGCAAATAAGTAACAATAAATAATGGAATGGCTGGAAGACGG 
TTCTGGGCGATTTAAACAATTAGCGAAAGAAAGCGGCATTGAAATCCGTCTTGAATTCGCCCCGAAAAAGTGACGAAGCAGCGATCAAAGCGCAG 
AGCGAAAACACGCACACAGACTGCAAGTGTGTTAGATAATAAGTGCAGCACAAGTCCACACTTGAGTAAAATAATCCCTAAAAAAGCCGAATATC 
AATTAGTTTTCCAAGGAGCTTGAAAAAGTGCGGTATGAAAACGTGAAAATTTGCGCGTGGAAAATTATTTTGCCTTGTCAGCTGACCCCCTTCCC 
CGTGTTCGCTCCATCCCTGTCGCACCGCGGGTCTTGTGATCGCCGCCGCTCTTGCGCTCGCTTGCTCTCCCATTTCGAAACTCGAAACAGAAGTG 
GGAGTTATTCGTATTCCGATAATGAAAAACCAATATGGAGAACGAGCGACGTAAAAAAGGCGGCCCAAAGATTTTTACCATTTCCCTTACCCACT 
TTTTTTTCATTTGTCAGCTGACGGCAATGACAGTAGTCTTGTGATCAACGTCAAAAGCAATTGTCAAATATTCGAACTCGAATGGAGAGCGAGAG 
AGCCAGAGCGGAGAGTTGCTCTCCCACTCCACCCTCTCTTGTTTTTCTTTGCTGATAATTATGAAAACCCGCATATTTTGAAAAACATGCATTTC 
AGTTACATTCCTCCGTTGAATTTGTCAACCTGTGGTTGTTTTTTCACAGCTCTTATTTTATTTATTTAGCGATTAGTTTGACAAATTGCTCTCTT 
CGAACTTTCAAAGCTCTGTCACGTGAAACGAAAGCTCTGCTTTTAAAGTTTTACGCAGCATAATCAAAGAGGGGGAGTTAAAAAAAAAATAATTA 
AATCAATCGAAATTATTAGCTGCTAACCTACAACTTTATAACCTATAATCGAAAAATTTGGGAGCTGTGGGCTCTACAAAAACTTAACCTGTAAA 
TGTAGCAGATACACCTGCCCCTTGTCAGCTGACAGAGGGCTGAGCAAGAAATTAGTGATAAGAAAATGTTCACCTTTATCTTCGCCCTTTTGCAG 
CCAGCATTTAACAATTTTCCTCTTCTATTTTCCCTCCATTGCAGTCGAAAAAACAGAATAAAGCAAAATGTCCAACCTTAAGCGTTTCGATGATG 
AGGAGCGTGAGTCCAAATATGGACGTGTCTTCGCTGTCTCCGGTCCTGGTAAGCACCTAACTATACTGACTAACCATAACTCATGCTATCTAAAA 
GTTAATAAAAATAAGTTAATAATACCTGTGAACTGGAAACCTTACTTTGATGATTAGTCTAGAACTTACACTTCTGTGTGAAATAATGGCAACTT 
TAGAAATGTGTCCACCTATTTGTGATTAATATTCAAACAACTCAAACATTTTGGTTTCATTATTCAAAATTAAATGTGAATAATTTTAATAATTA 
ATTAATTGTTTCTTTAAACTTTTTTCTATAATTCTAACAAAAACATCATCAAGTATCATTAATAATAAAAAATTTTAAAAGAAAATGTTCAATGG 
AACCTATCTTCGTTTGCTGAGTTATAAAAACTTCTTGAATGAAATGTAGCCCCCCTAACCCGACCAACCGCTTCATTCCAGTCGTCACCGCCGAG 
GCCATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCGGCTACTACGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCACCAT 
CCAGGTGTACGAGGAGACCTCTGGCGTAACTGTCGGAGATCCGGTGCTGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATCATGG 
GCAGCATCTTTGACGGTATCCAGCGTCCCCTGAAGGACATTAACGAGCTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAGTTTG 
TCCCGCGTGGCCAGCTGGGAGTTCAACCCCCTGAACGTCAAGGTCGGCTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGAACAC 
TCTGGTCAAGCACAAGATGATTGTGAACCCCCGCGCCAAGGGAACAGTGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTCGTCC 
TGGAGACCGAGTTCGATGGAGAGATCACCAAGCACACCATGTTGCAGGTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCCCGCC 
AACCACCCCCTGCTCACCGGACAGCGTGTGCTCGACTCGCTCTTCCCCTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTTGCGG 
CAAGACTGTGATCTCGCAGGTGAGAGTCCCACAAATTGAGAATTTAAGGAGCGATGCCTCGTGTAGCCTCCATACACTCAAGTTTCATAAAAATA 
C AAT CCCT AAT AAATCATTT ACT TGCTTGCAGGCTCTGTCCAAG TACT CCAACTCCGATGTCATCATCTACGTCGGTTGCGGTGAGCGTGGTAAC 
GAGATGTCTGAGGTACTGCGTGACTTCCCCGAGCTGTCCGTGGAGATCGACGGTGTCACCGAGTCCATCATGAAGCGTACCGCCCTTGTGGCCAA 
CACCTCCAACATGCCTGTGGCTGCTCGTGAGGCCTCCATCTACACTGGTATCACCTTGTCCGAATACTTCCGTGATATGGGTTACAACGTGTCCA 
TGATGGCTGATTCCACCTCCCGTTGGGCTGAGGCTCTTCGTGAAATTTCTGGTCGTCTCGCTGAGATGCCTGCCGATTCCGGCTACCCAGCCTAC 
TTGGGAGCCCGTCTGGCCTCCTTCTACGAGCGTGCCGGTCGCGTTAAGTGCTTGGGTAACCCCGAGCGCGAGGGATCCGTGTCCATTGTCGGAGC 
TGTGTCTCCTCCTGGTGGTGACTTCTCCGATCCCGTGACCTCCGCCACTCTGGGTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAGTTGGCCC 
AGCGCAAGCACTTCCCCTCGATCAACTGGCTCATCTCCTACTCGAAGTACATGCGTGCTCTGGATGACTTCTATGACAAGAACTTCCCCGAATTC 
GTGCCGCTGCGTACCAAGGTCAAGGAGATCCTGCAGGAGGAGGAGGATCTGTCTGAGATCGTGCAACTGGTCGGCAAGGCCTCTCTGGCCGAAAC 
CGACAAGATCACGCTGGAGGTGGCCAAGCTGCTGAAGGACGATTTCCTGCAGCAGAACTCCTACTCCTCGTACGATCGCTTCTGCCCCTTCTACA 
AGACCGTGGGCATGTTGAGGAACATCATCGACTTCTACGACATGGCCCGTCACTCCGTGGAGTCTACGGCTCAGTCTGAGAACAAGATCACCTGG 
AACGTGATTCGTGAGGCAATGGGCAACATTATGTACCAGCTGTCATCCATGAAGTTCAAGGTGGGTTAACACGCAAATTTATCCATTGCCTAGAC 
ACTGGGTGACCACATTTTTCAATCCATTTCAGGACCCCGTTAAGGATGGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGACCTG 
CAGCAGGCCTTCAGAAATCTGGAGGACTAGAGACCGCGCTGGCCCTACTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAAAAAT 
GAAAGCAGTCAAAAACCATCCGAAAAAGCCTAATCAAACACCAACAATTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCACAAC 
TTCTTGGTGCCTGCGAGAGATGTAAACATTCCGGCCTGCGGTTAATACTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCTAGGC 
AAC AGCAACT AC AACGTCCTGCT AT GT ACT TCCATTT AC AAC AACAACACCAACATACACTTGAATAAAAGTAC AC GG AC AC TGGCGC AC AC ACA 
ACACATACATAAAAGACACAAATACAAATGCATGCATAAATAGTATTATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTCATGT 
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TTTCTCCCTGTTTGTTTGTTAAATTTATGTAAATATTTAAAGTATGAAATATTAAATGTACGAATAAAGTGCAACAACAAATACATTTAATGTAA 
TTGAAAGTGAATTTCACTGGCAGCAGAATGGATATTAAAAATGTGTCAACTCGATAAAAAGATAATAAGTTAAAATATTTTTTTGAATTTTAAAC 
CTTCATTATATAAACATACTTGACTATATGAAAGCTAAGAAAATGGGAATATATTGAAAAGCATAATCTAAAATCCTATATCATTCTCAGTCATC 
ACCTTAGTCTTTCCATAAACTTCTGCCCCATTTATTTTATTTATTTGACAATATTAGTAAAAATTGTACCAAGTGAAAAAATTGTATATATTTAA 
ACCAAATGGAAAAGAAGTCCTGGGTTGCACCGCTTACCTCGTTCACCAACAGTAACGCAAGTAACGAAGACAGAGCCCGCAACTCGGGATCGTTA 
AACCAAGATGGGGAGCCAAGGTCCGTGCCTCACCCAACGGTTCCGGCTAAGAAGTGCACATGTCCAACCTGCGCTTGCCAACGTAATGACCATAA 
ATCCAATCCGGTACCCCCTCCCCCTCAATCACGCAAGCCTGTAGCCGAGCCGGAAAGTCCCCATGATACAGTCAATGACGAGGACAGTTTGAAGG 
ACTTGAGACGTTCGACGGACCAATCCCACAAGAGCGCTCACATCGCCTTGGAGAAGAATGAGGACTCGGGTTTTGTGATCGAGCAGGTGGTTGAT 
ACGCACAAATATTCGTCGGATGAAGAAGAGGAGGAGGCGACGATGGGTCGCATTTTCGGAGTCTCCGGTCCGGTGGTCAATGCCGAGGAGATGGC 
CGGCGCAGCCATGTACGAGCTGGTTCGCGTTGGACACTCCCAGCTTGTTGGTGAGATCATTCGACTGGAGGGTGATATGGCCACCATTCAGGTTT 
ACGAGGATACTTCGGGTGTGAGCGTGGGTGATCCCGTCTACCAGACGGGAAAGCCGCTCTCCGTGGAATTGGGACCCGGCATCATGGGCAGCATT 
TTCGATGGTATCCAGCGACCATTGAGGTCCATCAGTGAACTAACCAACTC 

(SEQ ID NO: 1186) 

Exon: 1001.. 1037 
Exon: 2230.. 2328 
Exon: 2742. .3439 
Exon: 3548.. 4525 
Exon: 4593.. 5225 
Start ATG: 2253 

Transcript No, : CT37353 

TTCTGTTGGAGAAAAGCAGCAATCACACGTTCGCAAGTCGAAAAAACAGAATAAAGCAAAATGTCCAACCTTAAGCGTTTCGATGATGAGGAGCG 
TGAGTCCAAATATGGACGTGTCTTCGCTGTCTCCGGTCCTGTCGTCACCGCCGAGGCCATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCG 
GCTACTACGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCACCATCCAGGTGTACGAGGAGACCTCTGGCGTAACTGTCGGAGAT 
CCGGTGCTGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATCATGGGCAGCATCTTTGACGGTATCCAGCGTCCCCTGAAGGACAT 
TAACGAGCTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAGTTTGTCCCGCGTGGCCAGCTGGGAGTTCAACCCCCTGAACGTCA 
AGGTCGGCTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGAACACTCTGGTCAAGCACAAGATGATTGTGAACCCCCGCGCCAAG 
GGAACAGTGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTCGTCCTGGAGACCGAGTTCGATGGAGAGATCACCAAGCACACCAT 
GTTGCAGGTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCCCGCCAACCACCCCCTGCTCACCGGACAGCGTGTGCTCGACTCGC 
TCTTCCCCTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTTGCGGCAAGACTGTGATCTCGCAGGCTCTGTCCAAGTACTCCAAC 
TCCGATGTCATCATCTACGTCGGTTGCGGTGAGCGTGGTAACGAGATGTCTGAGGTACTGCGTGACTTCCCCGAGCTGTCCGTGGAGATCGACGG 
TGTCACCGAGTCCATCATGAAGCGTACCGCCCTTGTGGCCAACACCTCCAACATGCCTGTGGCTGCTCGTGAGGCCTCCATCTACACTGGTATCA 
CCTTGTCCGAATACTTCCGTGATATGGGTTACAACGTGTCCATGATGGCTGATTCCACCTCCCGTTGGGCTGAGGCTCTTCGTGAAATTTCTGGT 
CGTCTCGCTGAGATGCCTGCCGATTCCGGCTACCCAGCCTACTTGGGAGCCCGTCTGGCCTCCTTCTACGAGCGTGCCGGTCGCGTTAAGTGCTT 
GGGTAACCCCGAGCGCGAGGGATCCGTGTCCATTGTCGGAGCTGTGTCTCCTCCTGGTGGTGACTTCTCCGATCCCGTGACCTCCGCCACTCTGG 
GTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAGTTGGCCCAGCGCAAGCACTTCCCCTCGATCAACTGGCTCATCTCCTACTCGAAGTACATG 
CGTGCTCTGGATGACTTCTATGACAAGAACTTCCCCGAATTCGTGCCGCTGCGTACCAAGGTCAAGGAGATCCTGCAGGAGGAGGAGGATCTGTC 
TGAGATCGTGCAACTGGTCGGCAAGGCCTCTCTGGCCGAAACCGACAAGATCACGCTGGAGGTGGCCAAGCTGCTGAAGGACGATTTCCTGCAGC 
AGAACTCCTACTCCTCGTACGATCGCTTCTGCCCCTTCTACAAGACCGTGGGCATGTTGAGGAACATCATCGACTTCTACGACATGGCCCGTCAC 
TCCGTGGAGTCTACGGCTCAGTCTGAGAACAAGATCACCTGGAACGTGATTCGTGAGGCAATGGGCAACATTATGTACCAGCTGTCATCCATGAA 
GTTCAAGGACCCCGTTAAGGATGGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGACCTGCAGCAGGCCTTCAGAAATCTGGAGG 
ACTAGAGACCGCGCTGGCCCTACTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAAAAATGAAAGCAGTCAAAAACCATCCGAAA 
AAGCCTAATCAAACACCAACAATTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCACAACTTCTTGGTGCCTGCGAGAGATGTAA 
ACATTCCGGCCTGCGGTTAATACTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCTAGGCAACAGCAACTACAACGTCCTGCTAT 
GTACTTCCATTTACAACAACAACACCAACATACACTTGAATAAAAGTACACGGACACTGGCGCACACACAACACATACATAAAAGACACAAATAC 
AAATGCATGCATAAATAGTATTATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTCATGTTTTCTCCCTGTTTGTTTGTTAAATT 
TATGTAAATATTTAAAGTATGAAATATTAAATGTACGAATAAAGTGCAACAACAAATACATTTAATGTAA 
{SEQ ID NO; 1187) 

Start ATG: 61 

MSNLKRFDDEERESKYGRVFAVSGPVVTAEAMSGSAMYELVRVGYYELVGEIIRLEGDMATrQVYEETSGVTVGDPVLRTGKPLSVELGPGIMGS 
IFDGIQRPLKDINELTESIYIPKGVNVPSLSRVASWEFNPLNVKVGSHITGGDLYGLVHENTLVKHKMIVWPRAKGTVRYIAPSGNYKVDDVVLE 
TEFDGEI TKHTMLQVWPVRQPRPVTEKLPANHPLLTGQRVLDSLFPCVQGGTT AI PGAFGCGKTVI SQALSKYSNS DVI I YVGCGERGNEMSEVL 
RDFPELSVEIDGVTESIMKRTALVANTSNMPVAAREASIYTGITLSEYFRDMGYNVSMI^DSTSRWAEALREISGRLAEMPADSGYPAYLGARLA 
SFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSINWLISYSKYMRALDDFYDKNFPEFVPLRTK 
VKEILQEEEDLSEIVQLVGKASLAETDKITLEVAKLLKDDFLQQNSYSSYDRFCPFYKTVGMLRNIIDFYDMARHSVESTAQSENKITWNVIREA 
MGNIMYQLSSMKFKDPVKDGEAKIKADFEQLHEDLQQAFRNLED* 
(SEQ ID NO: 1188) 

Name: vacuolar ATPase subunit A 
Classification: enzyme 



Celera Sequence No. : 142000013385215 

AAAAAACAAAGGGGCAGGGCAGCAAAAACTTAAATCATTGGCAAATACCTTTTTACGATGCGGATTTTAATCAAAGTTTTTGCCACCGCTGTCAG 
TTTCTTTTTTTTACTTAAAATGGTAATAAAAAAAAAAGGGGAAACGACGATAAATAAACATTTTAAAATAATTGCTAACGGCGTGTTGTTGCGCT 
TTAGTGAACTTATTAGTCTACCCTCGCACCGGCCAATTTCCACAATTGATTAGTTTAAAATTGCATTTTTTGGACATTTAAATGGGAATCTAATT 
GATTGCCATTTTTATTGAGAGCACTTAACACTATACAATGGCAGGTTTTGTATTTAAGCTATATATTCCTCGTTTATACAGTGTCAAATTAATTA 
TTGATTTTATAAGCAGCTAAGAAAATGATTGTTTAATTTGGACCGATTGATCATCGAGGACTTAAAATGGACTTAAAATAATTGGATATTAGCAA 
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CTTTGAAAAGATCCCGTTTACTCCCAAAACTTCAGAATCAACCCACGAACTTCTTTAAGCCAGTGTGACCCCACCCGCTCTTTAGCTCTCTTTCG 
CTCCATCTCGCTTCCGCACATCGATTGGCGAAAGCTTCGCTCGCACTCGATATCAGCTCACAACTTCGCTTGCAAGCTCGCCGCGGAAAGCTCGC 
GATCACACGGGTCGAAATCGAAATCGAATCGCGGTAGAGAGCTTTTCGAAGTGCGAGCGTCGACGTTTTATTTCTGCGGCTCAGTCGGTTTTAGT 
TCGTTCTGTTGGAGAAAAGCAGCAATCACACGTTCGCAAGGTGAACGCGAAGACACAGCAAAGTAAGCCCTTCCCCCCCACCAACACACACACCC 
ACCCAAAGCAAATAAGTAACAATAAATAATGGAATGGCTGGAAGACGGTTCTGGGCGATTTAAACAATTAGCGAAAGAAAGCGGCATTGAAATCC 
GTCTTGAATTCGCCCCGAAAAAGTGACGAAGCAGCGATCAAAGCGCAGAGCGAAAACACGCACACAGACTGCAAGTGTGTTAGATAATAAGTGCA 
GCACAAGTCCACACTTGAGTAAAATAATCCCTAAAAAAGCCGAATATCAATTAGTTTTCCAAGGAGCTTGAAAAAGTGCGGTATGAAAACGTGAA 
AATTTGCGCGTGGAAAATTATTTTGCCTTGTCAGCTGACCCCCTTCCCCGTGTTCGCTCCATCCCTGTCGCACCGCGGGTCTTGTGATCGCCGCC 
GCTCTTGCGCTCGCTTGCTCTCCCATTTCGAAACTCGAAACAGAAGTGGGAGTTATTCGTATTCCGATAATGAAAAACCAATATGGAGAACGAGC 
GACGTAAAAAAGGCGGCCCAAAGATTTTTACCATTTCCCTTACCCACTTTTTTTTCATTTGTCAGCTGACGGCAATGACAGTAGTCTTGTGATCA 
ACGTCAAAAGCAATTGTCAAATATTCGAACTCGAATGGAGAGCGAGAGAGCCAGAGCGGAGAGTTGCTCTCCCACTCCACCCTCTCTTGTTTTTC 
TTTGCTGATAATTATGAAAACCCGCATATTTTGAAAAACATGCATTTCAGTTACATTCCTCCGTTGAATTTGTCAACCTGTGGTTGTTTTTTCAC 
AGCTCTTATTTTATTTATTTAGCGATTAGTTTGACAAATTGCTCTCTTCGAACTTTCAAAGCTCTGTCACGTGAAACGAAAGCTCTGCTTTTAAA 
GTTTTACGCAGCATAATCAAAGAGGGGGAGTTAAAAAAAAAATAATTAAATCAATCGAAATTATTAGCTGCTAACCTACAACTTTATAACCTATA 
ATCGAAAAATTTGGGAGCTGTGGGCTCTACAAAAACTTAACCTGTAAATGTAGCAGATACACCTGCCCCTTGTCAGCTGACAGAGGGCTGAGCAA 
GAAATTAGTGATAAGAAAATGTTCACCTTTATCTTCGCCCTTTTGCAGCCAGCATTTAACAATTTTCCTCTTCTATTTTCCCTCCATTGCAGTCG 
AAAAAACAGAATAAAGCAAAATGTCCAACCTTAAGCGTTTCGATGATGAGGAGCGTGAGTCCAAATATGGACGTGTCTTCGCTGTCTCCGG TCCT 
GGTAAGCACCTAACTATACTGACTAACCATAACTCATGCTATCTAAAAGTTAATAAAAATAAGTTAATAATACCTGTGAACTGGAAACCTTACTT 
TGATGATTAGTCTAGAACTTACACTTCTGTGTGAAATAATGGCAACTTTAGAAATGTGTCCACCTATTTGTGATTAATATTCAAACAACTCAAAC 
ATTTTGGTTTCATTATTCAAAATTAAATGTGAATAATTTTAATAATTAATTAATTGTTTCTTTAAACTTTTTTCTATAATTCTAACAAAAACATC 
ATCAAGTATCATTAATAATAAAAAATTTTAAAAGAAAATGTTCAATGGAACCTATCTTCGTTTGCTGAGTTATAAAAACTTCTTGAATGAAATGT 
AGCCCCCCTAACCCGACCAACCGCTTCATTCCAGTCGTCACCGCCGAGGCCATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCGGCTACTA 
CGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCACCATCCAGGTGTACGAGGAGACCTCTGGCGTAACTGTCGGAGATCCGGTGC 
TGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATCATGGGCAGCATCTTTGACGGTATCCAGCGTCCCCTGAAGGACATTAACGAG 
CTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAGTTTGTCCCGCGTGGCCAGCTGGGAGTTCAACCCCCTGAACGTCAAGGTCGG 
CTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGAACACTCTGGTCAAGCACAAGATGATTGTGAACCCCCGCGCCAAGGGAACAG 
TGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTCGTCCTGGAGACCGAGTTCGATGGAGAGATCACCAAGCACACCATGTTGCAG 
GTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCCCGCCAACCACCCCCTGCTCACCGGACAGCGTGTGCTCGACTCGCTCTTCCC 
CTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTTGCGGCAAGACTGTGATCTCGCAGGTGAGAGTCCCACAAATTGAGAATTTAA 
GGAGCGATGCCTCGTGTAGCCTCCATACACTCAAGTTTCATAAAAATACAATCCCTAATAAATCATTTACTTGCTTGCAGGCTCTGTCCAAGTAC 
TCCAACTCCGATGTCATCATCTACGTCGGTTGCGGTGAGCGTGGTAACGAGATGTCTGAGGTACTGCGTGACTTCCCCGAGCTGTCCGTGGAGAT 
CGACGGTGTCACCGAGTCCATCATGAAGCGTACCGCCCTTGTGGCCAACACCTCCAACATGCCTGTGGCTGCTCGTGAGGCCTCCATCTACACTG 
GTATCACCTTGTCCGAATACTTCCGTGATATGGGTTACAACGTGTCCATGATGGCTGATTCCACCTCCCGTTGGGCTGAGGCTCTTCGTGAAATT 
TCTGGTCGTCTCGCTGAGATGCCTGCCGATTCCGGCTACCCAGCCTACTTGGGAGCCCGTCTGGCCTCCTTCTACGAGCGTGCCGGTCGCGTTAA 
GTGCTTGGGTAACCCCGAGCGCGAGGGATCCGTGTCCATTGTCGGAGCTGTGTCTCCTCCTGGTGGTGACTTCTCCGATCCCGTGACCTCCGCCA 
CTCTGGGTATCGTGCAGGTGTTCTGGGGTCTCGACAAGAAGTTGGCCCAGCGCAAGCACTTCCCCTCGATCAACTGGCTCATCTCCTACTCGAAG 
TACATGCGTGCTCTGGATGACTTCTATGACAAGAACTTCCCCGAATTCGTGCCGCTGCGTACCAAGGTCAAGGAGATCCTGCAGGAGGAGGAGGA 
TCTGTCTGAGATCGTGCAACTGGTCGGCAAGGCCTCTCTGGCCGAAACCGACAAGATCACGCTGGAGGTGGCCAAGCTGCTGAAGGACGATTTCC 
TGCAGCAGAACTCCTACTCCTCGTACGATCGCTTCTGCCCCTTCTACAAGACCGTGGGCATGTTGAGGAACATCATCGACTTCTACGACATGGCC 
CGTCACTCCGTGGAGTCTACGGCTCAGTCTGAGAACAAGATCACCTGGAACGTGATTCGTGAGGCAATGGGCAACATTATGTACCAGCTGTCATC 
CATGAAGTTCAAGGTGGGTTAACACGCAAATTTATCCATTGCCTAGACACTGGGTGACCACATTTTTCAATCCATTTCAGGACCCCGTTAAGGAT 
GGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGACCTGCAGCAGGCCTTCAGAAATCTGGAGGACTAGAGACCGCGCTGGCCCTA 
CTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAAAAATGAAAGCAGTCAAAAACCATCCGAAAAAGCCTAATCAAACACCAACAA 
TTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCACAACTTCTTGGTGCCTGCGAGAGATGTAAACATTCCGGCCTGCGGTTAATA 
CTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCTAGGCAACAGCAACTACAACGTCCTGCTATGTACTTCCATTTACAACAACAA 
CACCAACATACACTTGAATAAAAGTACACGGACACTGGCGCACACACAACACATACATAAAAGACACAAATACAAATGCATGCATAAATAGTATT 
ATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTCATGTTTTCTCCCTGTTTGTTTGTTAAATTTATGTAAATATTTAAAGTATGA 
AATATTAAATGTACGAATAAAGTGCAACAACAAATACATTTAATGTAATTGAAAGTGAATTTCACTGGCAGCAGAATGGATATTAAAAATGTGTC 
AACTCGATAAAAAGATAATAAGTTAAAATATTTTTTTGAATTTTAAACCTTCATTATATAAACATACTTGACTATATGAAAGCTAAGAAAATGGG 
AATATATTGAAAAGCATAATCTAAAATCCTATATCATTCTCAGTCATCACCTTAGTCTTTCCATAAACTTCTGCCCCATTTATTTTATTTATTTG 
ACAATATTAGTAAAAATTGTACCAAGTGAAAAAATTGTATATATTTAAACCAAATGGAAAAGAAGTCCTGGGTTGCACCGCTTACCTCGTTCACC 
AACAGTAACGCAAGTAACGAAGACAGAGCCCGCAACTCGGGATCGTTAAACCAAGATGGGGAGCCAAGGTCCGTGCCTCACCCAACGGTTCCGGC 
TAAGAAGTGCACATGTCCAACCTGCGCTTGCCAACGTAATGACCATAAATCCAATCCGGTACCCCCTCCCCCTCAATCACGCAAGCCTGTAGCCG 
AGCCGGAAAGTCCCCATGATACAGTCAATGACGAGGACAGTTTGAAGGACTTGAGACGTTCGACGGACCAATCCCACAAGAGCGCTCACATCGCC 
TTGGAGAAGAATGAGGACTCGGGTTTTGTGATCGAGCAGGTGGTTGATACGCACAAATATTCGTCGGATGAAGAAGAGGAGGAGGCGACGATGGG 
TCGCATTTTCGGAGTCTCCGGTCCGGTGGTCAATGCCGAGGAGATGGCCGGCGCAGCCATGTACGAGCTGGTTCGCGTTGGACACTCCCAGCTTG 
TTGGTGAGATCATTCGACTGGAGGGTGATATGGCCACCATTCAGGTTTACGAGGATACTTCGGGTGTGAGCGTGGGTGATCCCGTCTACCAGACG 
GGAAAGCCGCTCTCCGTGGAATTGGGACCCGGCATCATGGGCAGCATTTTCGATGGTATCCAGCGACCATTGAGGTCCATCAGTGAACTAACCAA 
CTC 

{SEQ ID MO: 1189) 

Exon: 1001.. 1125 
Exon: 1993.. 2091 
Exon: 2505.. 3202 
Exon: 3311.. 4288 
Exon: 4356.. 4988 
Start ATG: 2016 

Transcript No. : CT37359 

CGAAAACACGCACACAGACTGCAAGTGTGTTAGATAATAAGTGCAGCACAAGTCCACACTTGAGTAAAATAATCCCTAAAAAAGCCGAATATCAA 
TTAGTTTTCCAAGGAGCTTGAAAAAGTGCGTCGAAAAAACAGAATAAAGCAAAATGTCCAACCTTAAGCGTTTCGATGATGAGGAGCGTGAGTCC 
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AAATATGGACGTGTCTTCGCTGTCTCCGGTCCTGTCGTCACCGCCGAGGCCATGTCTGGATCAGCTATGTACGAGTTGGTCCGCGTCGGCTACTA 
CGAGCTGGTGGGCGAGATCATCCGTCTGGAGGGTGACATGGCCACCATCCAGGTGTACGAGGAGACCTCTGGCGTAACTGTCGGAGATCCGGTGC 
TGCGTACCGGCAAGCCTCTTTCCGTGGAGCTGGGACCCGGTATCATGGGCAGCATCTTTGACGGTATCCAGCGTCCCCTGAAGGACATTAACGAG 
CTGACCGAATCCATCTACATCCCCAAGGGTGTGAACGTGCCCAGTTTGTCCCGCGTGGCCAGCTGGGAGTTCAACCCCCTGAACGTCAAGGTCGG 
CTCCCACATCACCGGAGGTGACCTGTACGGTCTGGTGCATGAGAACACTCTGGTCAAGCACAAGATGATTGTGAACCCCCGCGCCAAGGGAACAG 
TGCGCTACATCGCCCCCTCCGGCAACTACAAGGTCGACGATGTCGTCCTGGAGACCGAGTTCGATGGAGAGATCACCAAGCACACCATGTTGCAG 
GTGTGGCCAGTGCGTCAGCCACGTCCCGTGACCGAGAAGCTGCCCGCCAACCACCCCCTGCTCACCGGACAGCGTGTGCTCGACTCGCTCTTCCC 
CTGTGTCCAGGGCGGTACCACCGCCATTCCCGGAGCTTTCGGTTGCGGCAAGACTGTGATCTCGCAGGCTCTGTCCAAGTACTCCAACTCCGATG 
TCATCATCTACGTCGGTTGCGGTGAGCGTGGTAACGAGATGTCTGAGGTACTGCGTGACTTCCCCGAGCTGTCCGTGGAGATCGACGGTGTCACC 
GAGTCCATCATGAAGCGTACCGCCCTTGTGGCCAACACCTCCAACATGCCTGTGGCTGCTCGTGAGGCCTCCATCTACACTGGTATCACCTTGTC 
CGAATACTTCCGTGATATGGGTTACAACGTGTCCATGATGGCTGATTCCACCTCCCGTTGGGCTGAGGCTCTTCGTGAAATTTCTGGTCGTCTCG 
CTGAGATGCCTGCCGATTCCGGCTACCCAGCCTACTTGGGAGCCCGTCTGGCCTCCTTCTACGAGCGTGCCGGTCGCGTTAAGTGCTTGGGTAAC 
CCCGAGCGCGAGGGATCCGTGTCCATTGTCGGAGCTGTGTCTCCTCCTGGTGGTGACTTCTCCGATCCCGTGACCTCCGCCACTCTGGGTATCGT 
GCAGGTGTTCTGGGGTCTCGACAAGAAGTTGGCCCAGCGCAAGCACTTCCCCTCGATCAACTGGCTCATCTCCTACTCGAAGTACATGCGTGCTC 
TGGATGACTTCTATGACAAGAACTTCCCCGAATTCGTGCCGCTGCGTACCAAGGTCAAGGAGATCCTGCAGGAGGAGGAGGATCTGTCTGAGATC 
GTGCAACTGGTCGGCAAGGCCTCTCTGGCCGAAACCGACAAGATCACGCTGGAGGTGGCCAAGCTGCTGAAGGACGATTTCCTGCAGCAGAACTC 
CTACTCCTCGTACGATCGCTTCTGCCCCTTCTACAAGACCGTGGGCATGTTGAGGAACATCATCGACTTCTACGACATGGCCCGTCACTCCGTGG 
AGTCTACGGCTCAGTCTGAGAACAAGATCACCTGGAACGTGATTCGTGAGGCAATGGGCAACATTATGTACCAGCTGTCATCCATGAAGTTCAAG 
GACCCCGTTAAGGATGGTGAGGCCAAGATCAAGGCTGACTTCGAGCAGCTGCACGAGGACCTGCAGCAGGCCTTCAGAAATCTGGAGGACTAGAG 
ACCGCGCTGGCCCTACTTTTACACTCTAATCTTATATTTGTTATATAGTTAACGTTTAAAAATGAAAGCAGTCAAAAACCATCCGAAAAAGCCTA 
ATCAAACACCAACAATTCCGTGCTGCATTCGATGAAAAACAAAAGTCCAACAAATACCACAACTTCTTGGTGCCTGCGAGAGATGTAAACATTCC 
GGCCTGCGGTTAATACTTTCCCCTAACCACGCCCCCTCCGCCCCTTGAAGGGCAACTCTAGGCAACAGCAACTACAACGTCCTGCTATGTACTTC 
CATTTACAACAACAACACCAACATACACTTGAATAAAAGTACACGGACACTGGCGCACACACAACACATACATAAAAGACACAAATACAAATGCA 
TGCATAAATAGTATTATTGTTTAATGAATGGAAATTCTTGTTTATTTGTGAAAAAAGTCATGTTTTCTCCCTGTTTGTTTGTTAAATTTATGTAA 
AT AT T T AAAGT AT G AAAT AT T AAAT GTACG AAT AAAGT GC AAC AAC AAAT AC ATT T AAT GT AA 
{SEQ ID NO: 1190) 

Start ATG: 149 

MSNLKRFDDEERESKYGRVFAVSGPVVTAEAMSGSAMYELVRVGYYELVGEIIRLEGDMATIQVYEETSGVTVGDPVLRTGKPLSVELGPGIMGS 
IFDGIQRPLKDINELTESIYIPKGVNVPSLSRVASWEFNPLNVKVGSHITGGDLYGLVHENTLVKHKMIVNPRAKGTVRYIAPSGNYKVDDVVLE 
TEFDGEITKHTMLQVWPVRQPRPVTEKLPANHPLLTGQRVLDSLFPCVQGGTTAIPGAFGCGKTVISQALSKYSNSDVIIYVGCGERGKEMSEVL 
RDF PEL S VE I DG VT E S I MKRT AL V ANT S NMP V AARE A SIYTGITLSE Y FRDMG YN V SMMAD S TS RW AE AL RE I S GRL AEMP A D S G Y P A YLGARL A 
SFYERAGRVKCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSINWLISYSKYMRALDDFYDKNFPEFVPLRTK 
VKEILQEEEDLSEIVQLVGKASLAETDKITLEVAKLLKDDFLQQNSYSSYDRFCPFYKTVGMLRNIIDFYDMARHSVESTAQSENKITWNVIREA 
MGNIMYQLSSMKFKDPVKDGEAKIKADFEQLHEDLQQAFRNLED* 
{SEQ ID NO: 1191) 

Name: vacuolar ATPase subunit A 
Classification: enzyme 



Celera Sequence No. : 142000013384768 

GAAATGAGTTTTGTTTTGTTTTGTTTCTTCCGAGTGTTTGTGTGTTTATGTGTGTGGGGAGGGGGTGTATTCTGTGTGCGGCGTGTTTTGAGTGA 
GAGAGATAGAGAGAGAGCGCGAGTTGCTGCGGGTCGACGGTCGAAGGGGGTGGTGGTGGAGTTGGTGGTGGGGGGCTTGGTTTATACGACAAGTT 
TTGATTTATTATTAAATTTAAATACCTATGTATATAAATTACGTTTCTGTTTATTATAATTATTGTTGTTTCCATCTTAATCATTTTGTTTTCTT 
TCGCTTTTTATTTTTCATATTTTTTTTTTGTTTTTTGTATGTAGTGGGCAACTGCGGCATTTGTAAATTTCTTTATATTTAAATGTTCCTTTTGA 
CTGTGTAAAAATATCGATTTCCATCTGATCTTGTTTTGTAGTTACTATTCATGTATGCATTTGCTTTTCGTTTTAGTTAAATTACGTTAACGCTG 
CATTCTGAACTCTCTGTTGCTGCTGATTCTGCTGCTTGTTCTTCGGATTCTTGATCGGATTCTTCTGATTCTTCATCTTGGCGCGCTTTCTATGC 
TATTGAAATGTTGTTTGTGGTGCTGCTTACTGCCTTCTGGATCGGGATCCTATATTCCGCTCTGATCTGATCGGTTATGTTCATGTTATGTCGGG 
TTGTGTGTGTCTGTGTTGTGTGTATGAGAGTGGAGTGTGTTGGGTGTTCGCATCTCGTTATATGTTGTATATATATATATATATATGTGTATATA 
TATATATATATAATATATTATATGTATGTATAGTTAGTTGCTTAATAGTAGAATTTCTTGAATGTTGTTGTGGTAAAGTGTAGGTAATGTTGCTC 
TATGTCTCTATGTCGACTCAACTGCACTACATACATATGTCTTTCCACCGGATACGCGGCGAACCGAAACGTGCCGAGCCGAACCGATCCCCGTT 
TCTGACCCGCATCCGCATCCGTATCCGTATTTCGTTTCCTCACCTCCGCCTCATTCGCTGCGCAGTTTCTTTAGATCCGGTGACTTCTCCCCCAC 
GCCCACACCCACTCCGCCGGCGTTCTTCAGCTCCATATCGCAGTGCAGGGCAGTACGCTTCTGTCCAATGGCTGGTGCCGAGTCTGCACCCGCTG 
CTACCGGTGAAACCGCATCCACCGCCTTGTTCGCCGCCAGCTGCTGCAGTTTACTGTCCCTCACAATGCCGTTGCTGCTATTGCTCTCGTCAGCC 
TCTAGCTCCGCCTCATTATCCTCCTCCTCGTCCAGCTCATCTTCCTCCTCGTCCTCCGCCTCATCCTCATCCTCGTCGCCCAGGACGCCGTTCCC 
GTTACCATTTCCATTGGCTGGTGCTGCAGTCGCTTTGCTGGCTGGCTGTGGTGCTGCTGTAACCGCTTCCTCTTCTGCCAAAGGAGCAGGCTGAC 
CGTTGTTGGTGATCGAATTGTTAGTTTTGTGGGGCTGGCTCGGCGACTGCTTTTGATTCTGATTCTGGCTCGAGTGCTGTGTGGGAGTCAGGATG 
GGCAGGCTGTTGGTCTTGGGCGGCGCCTCCTGATTGCTTTCAGAGTCCGAGGCGAGCGGTGTTCCACTCGAACCGATGCCAATGCCAATGTCTAG 
GGCGATGGGCTCACCGTCCTCTGTGGAAGTGGTGGCCGTTGTCTGTGTGGTGCCATCGTTGTCATCGATGATCGGCTCCTGGGAGGTGGAATCGT 
TCGAAACAAAGTTGGAGTCACCGTCCTGCAGGGCGGCGCCAACGGCAGCTGCGTTGTTCTTGGCCTCGCCCGACTGTAGAGGCGTCAGAGCAGCC 
GGGTTGGTGTTCTCATTCTCTGTCTCCGATTCTGTAGCAGTGTTGCGACCGCCTGGGGGATAGAAAGCATTGGATTAGCCATCAATGGTCAATTT 
GTTAACTCGGCGAACTGCGGTCAGAGTAATCAAACAAAGAAACTCTGTGTTTGAAATTATTAGACATCCTAACACTTTTGAAAAGCTGCCAACAA 
GTATAAGCTACTGATAATATGTTAAAAGTTAACTATTTTACAGTAGTAACTTTACAATTCCTCCTTTTTATTTAACATGACGCCACGAGTAACGA 
ATAAATTGTAAATTGTAAATTGACCACTAGGCTACTACAAACATAATGTTTAATGGAAAATTCCGGATAGGAAAAATACTTGGTTTCTGGAAAAG 
ACCACTCGGTTTGAGTGTTTTGTTGCAAGGGTGTATACTCAAGCCGCAGTTATCGGATCATCTTAGCTGCTTACCTGCTCCTTGGCTTTCGTCGT 
CCAAATCAACACCCACACCATCGTTGGTATCGTCGCCATCGAGCACCTTGCTGCTGTTGCCGTTTGTGCTGAGTGAACCGGTGCCAGTGGAGGCG 
CCGGATCCAGGCGGCGGCATATCGTCGAGCATCTCGGGACTGAGATCATTGGCCACATAGTGTGACCAGAGGGCCAGCTCCACGCGATGAGGCGA 
CCAGTGCGGCGTATCCCCGCCCACCTCCGCATTGAGGCGCTCCACGGTGGCCTGAATGTGATTGACGAAGTTGAGGTACTCCTTGGTGGTGTAAT 
CGATGCCCTCGATCTCTGGTATGGCCATCAGGCACTCGTCGGCCATGAATGGTGCCGAATCGGGAGCTGCGGCTGCCAGCAGTGCACTGGCCATT 
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GTGGTGCCAACGCCCTTGAGGTTCGATAAAGCTGTGATCGCCTGCTCCAGATTGGGCAGCTTGCGGAAGGCCTTCTTTGTCTCCTGGATGACGGC 
GCGCGGTGTGTTGACCTTGACCAGGTAGGATAGCTGCGGATAGAATTTGCCGCGCGACTGCTTCCACTTCATCGACTGGACGAGCTCATCGTATA 
CCATATGCGCGTCCTTGCCGCGTGCCTTAATCAATTTGGGCAGTTCATTCTGATACCACTGATCCAGGCGGATCAGCTCTTGCGGCTTCTTGCAG 
CGCTTCTCGGCCTTTAGCTTAAGAACCTGGGGATAGAGCTGGTAGCAGTACTCGAACTGTTTGGTGCTCCCGGTCTCGAAGAACGAGACCGTCGC 
CTTGCCGTTGGACATCTTCTGCGGATGCGGATGTGTCGTTGTCGCCTGCTAATGATTGGTGGTATCGCTGTATCTCCTTATCGTGGTATCTATCG 
CCTTGGGGCCCAGGACGTTGGCTGATCCGCAAGTGACCGGCTGGCTATATATGCGGCAATATATGTACTTTGGTGTAAGCTACTATACGCCAAAT 
CAATTTATTGCTTACTGTTTAGTCTCCTGCTTTTCCTGTCCGTTCCGAGCTGCTCCTGTCAACGTACTGCTATTCGTGCGTTGAGAGTTCCCGGC 
GAGAAAGAAGCGAGTACGTGAGAGACAGAGAGAATGCGAGTGCGAGAGAGGGAGCACGAAGAAGCAGCAACGCAAAGGAGAGTGGATGAAAAGAG 
AAGAAAAGAGGCACACAATTAGGTTTACAGTTACGATTACATATGCAAGGGGGATCGGTGCGGTGGCATAGGGAGGTAGTTCGATTGATTACTGT 
TAATGCTGCTGCTGCTGCGGCCGTGTCTCTGTTGTTTGTTGTATTTACTCAATCCGATAATCGCTCACTATTTGCCAAAAATCTGCTCGCTTCCG 
CCGCCGTTTGCACAGCACACAAATCGAATTTTTACAGTGCTGTCGTCGGGTTCTTTTTAGTCCGCCATGCACTCACACAGTCAAACTGACAGGCA 
CACGGATGGACACACACACACTTACTCAGCACACTCGCTCTCGCACACCCAAAGAGAGCCAGGCGCTCATACACAGGCAAAAACACGCACACAAC 
ACACACACTGACTTAAACACGCACACACTTATATACTGAGGCAACAACAGCAGCAGCAGAAAATTTCTTTTTCATCTAGGTTTTCAAAAAAAATA 
CGAACGCGAGCATTAAATAAACGAATGCTACTATTTTTCGCTGTTGTTGTTGTTTGTGCTTTTTTCGTACCTCTTGTCTTTTTGGCAAACGGTGA 
GAACAGCTTTTCACTAGCGCACTGCACGCACACAGATGAGTGGGTGAACAACAAATACCTTTTAGTTAGTCGCCGCCAGTCTTCCTTTCCGTCCA 
CATCCAAACAACTACTCCAATGCTTTGCTGTTTTTCTGTGTTGCCTCTGCTTTGCTTTGCTTCGCTTCGCCTCTGTGCTTGTGCTGTGCTGCGTG 
CTTCGTGCTGTGCCCGTGCTGTGCTGCTTCTTCTTCTTCCTCTGCTGTTTCAGTTTCTGCTTTCTGCCTGCAGGGGCGCCGTTGACAACTGACAA 
ATAAAAAGGGGTTACTCTGTTCTCGAAATGAGCACGCCACCGAAGGCACAACAAAAAATTAAAAAGGATCGGAGCGGAGCGATGCAGCTGTGCGA 
TGTGCTGCTAACTCGTGTGACGGGAATTGGGAATTGGGATTGCGAATGGGAATGGGAGGGAATACGAGAAGCGAATATGGGAATGACGAACGACG 
ACGAACCGGGAACAGGAGTTGTCGTTCTATCTCGCCCGCACTCGGTGCCCGAATGTATCTGTATCTCTCTGGCACCCTCGGCGTTCCATTTCATT 
TTGCCGGCGACGTCCGTGCTCTCTGCCGCCGCTCTGCCATCGCCTCTGCCGCTGCGTTTATGCGCAGTGCGTCATTACACGAATACCTGTTTAAA 
GCGGAGGCATAAACGCGGCTTTGTCGATC 
(SEQ ID NO: 1192) 

Exon: 3684.. 3513 
Exon: 3299.. 3246 
Exon: 3179.. 2260 
Exon: 1857.. 1001 

Start ATG: 3055 (Reverse strand: CAT) 
Transcript No. : CT37627 

GAGTGCATGGCGGACTAAAAAGAACCCGACGACAGCACTGTAAAAATTCGATTTGTGTGCTGTGCAAACGGCGGCGGAAGCGAGCAGATTTTTGG 
CAAATAGTGAGCGATTATCGGATTGAGTAAATACAACAAACAACAGAGACACGGCCGCAGCAGCAGCAGCATTAACACAGTACGTTGACAGGAGC 
AGCTCGGAACGGACAGGAAAAGCAGGAGACTAAACACCAGCCGGTCACTTGCGGATCAGCCAACGTCCTGGGCCCCAAGGCGATAGATACCACGA 
TAAGGAGATACAGCGATACCACCAATCATTAGCAGGCGACAACGACACATCCGCATCCGCAGAAGATGTCCAACGGCAAGGCGACGGTCTCGTTC 
TTCGAGACCGGGAGCACCAAACAGTTCGAGTACTGCTACCAGCTCTATCCCCAGGTTCTTAAGCTAAAGGCCGAGAAGCGCTGCAAGAAGCCGCA 
AGAGCTGATCCGCCTGGATCAGTGGTATCAGAATGAACTGCCCAAATTGATTAAGGCACGCGGCAAGGACGCGCATATGGTATACGATGAGCTCG 
TCCAGTCGATGAAGTGGAAGCAGTCGCGCGGCAAATTCTATCCGCAGCTATCCTACCTGGTCAAGGTCAACACACCGCGCGCCGTCATCCAGGAG 
ACAAAGAAGGCCTTCCGCAAGCTGCCCAATCTGGAGCAGGCGATCACAGCTTTATCGAACCTCAAGGGCGTTGGCACCACAATGGCCAGTGCACT 
GCTGGCAGCCGCAGCTCCCGATTCGGCACCATTCATGGCCGACGAGTGCCTGATGGCCATACCAGAGATCGAGGGCATCGATTACACCACCAAGG 
AGTACCTCAACTTCGTCAATCACATTCAGGCCACCGTGGAGCGCCTCAATGCGGAGGTGGGCGGGGATACGCCGCACTGGTCGCCTCATCGCGTG 
GAGCTGGCCCTCTGGTCACACTATGTGGCCAATGATCTCAGTCCCGAGATGCTCGACGATATGCCGCCGCCTGGATCCGGCGCCTCCACTGGCAC 
CGGTTCACTCAGCACAAACGGCAACAGCAGCAAGGTGCTCGATGGCGACGATACCAACGATGGTGTGGGTGTTGATTTGGACGACGAAAGCCAAG 
GAGCAGGCGGTCGCAACACTGCTACAGAATCGGAGACAGAGAATGAGAACACCAACCCGGCTGCTCTGACGCCTCTACAGTCGGGCGAGGCCAAG 
AACAACGCAGCTGCCGTTGGCGCCGCCCTGCAGGACGGTGACTCCAACTTTGTTTCGAACGATTCCACCTCCCAGGAGCCGATCATCGATGACAA 
CGATGGCACCACACAGACAACGGCCACCACTTCCACAGAGGACGGTGAGCCCATCGCCCTAGACATTGGCATTGGCATCGGTTCGAGTGGAACAC 
CGCTCGCCTCGGACTCTGAAAGCAATCAGGAGGCGCCGCCCAAGACCAACAGCCTGCCCATCCTGACTCCCACACAGCACTCGAGCCAGAATCAG 
AATCAAAAGCAGTCGCCGAGCCAGCCCCACAAAACTAACAATTCGATCACCAACAACGGTCAGCCTGCTCCTTTGGCAGAAGAGGAAGCGGTTAC 
AGCAGCACCACAGCCAGCCAGCAAAGCGACTGCAGCACCAGCCAATGGAAATGGTAACGGGAACGGCGTCCTGGGCGACGAGGATGAGGATGAGG 
C GG AGG AC GAGG AGG AAG AT GAG C T GG AC G AGG AG G AGG AT AAT G AGG C GG AGC T AG AGGC T G AC GAG AG C AAT AG C AGC AAC GG C ATT GTG AGG 
GACAGTAAACTGCAGCAGCTGGCGGCGAACAAGGCGGTGGATGCGGTTTCACCGGTAGCAGCGGGTGCAGACTCGGCACCAGCCATTGGACAGAA 
GCGTACTGCCCTGCACTGCGATATGGAGCTGAAGAACGCCGGCGGAGTGGGTGTGGGCGTGGGGGAGAAGTCACCGGATCTAAAGAAACTGCGCA 
GCGAATGA 

(SEQ ID NO: 1193) 

Start ATG: 351 (Reverse strand: CAT) 

MSNGKATVSFFETGSTKQFEYCYQLYPQVLKLKAEKRCKKPQELIRLDQWYQKELPKLIKARGKDAHIWyDELVQSMKWKQSRGKFYPQLSYLVK 
VNTPRAVIQETKKAFRKLPNLEQAITALSNLKGVGTTMASALLAAAAPDSAPFMADECLMAIPEIEGIDYTTKEYLNFVNHIQATVERLNAEVGG 
DTPHWSPHRVELALWSHYVANDLSPEMLDDMPPPGSGASTGTGSLSTNGNSSKVLDGDDTNDGVGVDLDDESQGAGGRNTATESETENENTNPAA 
LTPLQSGEAKNNAAAVGAALQDGDSNFVSNDSTSQEPIIDDHDGTTQTTATTSTEDGEPIALDIGIGIGSSGTPLASDSESNQEAPPKTNSLPIL 
TPTQHSSQNQNQKQSPSQPHKTNNSITNKGQPAPLAEEEAVTAAPQPASKATAAPANGNGNGNGVLGDEDEDEAEDEEEDELDEEEDNEAELEAD 
ESNSSNGI VRD S K LQQL AAN K A VDA V S P VAAG AD SAP A I GQKRT ALHC DME LKN A GG VG VG VGEK S P DLKKL RS E * 
(SEQ ID NO: 1194) 



Celera Sequence No. : 142000013384589 

GTGGCACCATCGAGCCCCTCAAAGAGGAGCACATTTTCGTGGTCCTCGACGCCGCTACTGCCGTGTCCACCGTCCGTGCCGCGACCCGTTAATGG 
CGGCGAATGCGGTCGATGATCCAAATGGATGTCGTCGGCGTGGGCATGGCCATCCTCCTCCGCGACGACCATGTTGCAGGAGGAGGAGGCGGTTA 
GGGGCAACGCCCACACACTGCGCGCCGCCCCCGGACTTGGATGCACCTTCGAGGCCATGTAGAACTCGAAGTCCACGCCCAGACGGTCCGCCGTT 
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CCACCGTTGCCTTGATGCAGGGGGTTTTCCAGTAGGTGTGCAATCTTGTCGCCTTTTTCCGTTACGCTCCGACCGAAATTTGTACTCTGTTTGGC 

CAATCTTAGGCGTGGAATGCTAGGTAAACACGACTGCACTCGATGGCGAGGAAATTATGTCAGCACTGTGAGAAAATCAATATTTGTCCCGAATC 

GCAGGACTCGAATTGGGGCCACTTTCGAACTCGACTTGGCGCCAAACAAATCAACAGCTGACCGTTATTTTCGCAGTTGGGCGCGCAGTGTTGCC 

AACACATACTGGTAATTAAGACTTTTCTCAGCGAGGGGTGATTTAAGATTTAAGTCAAGAAGTATTAAAGAAAAAATATATATATTTAATTTGTT 

GGAAGGTCAATTGTTGAAACCGAGGTATCATTTAGCTTAACTTAAATATACTAGAAATCTCGTGGCATAAGCAAACTAAAAAAAAAAGGTTTATC 

AACTGACTTTTATATTAGTTTTAACATTATTGGGAAACATTTGTTTTCCGCCGCTAGAAACCTTTAAAATGTATACTACTATATTGCTTGGTACG 

AATGTTGCATTTTGAACTGATGCTAGCATTGCATTTTGTAAGGTTCCCATGTCTCTGTTATTTTGCCACCTGGTAACACTGTGCCCCTTTTTGAT 

AAATTTGTTTCGTAAAAAATGGGCGACAACGTGGTGACGGAACTAAATAACTTGGATTTGAACAATCAAAAGGCATGTTAAACTCTTTTCGAACT 

GCATTCGAGTGTATGAGAAGTGCTAGTTAGCCGAATTATGTTGAAGTTGGCGCCGCAGGGCGGCAGATATTATAGCCAAATAAGCCGCATTTGCC 

AGTGGATTTTGCCGAAAATTCAGACGCAGTCGAAGCCACCAAAACAACAACAACAACAGCAACCACAATTGAAAGAACAACAACAACACAGCTAC 

GCAACTTTGGTTGGTGTTTTAAGCTAAAAAGAATTGACCTAAACTTCAATTTAAACGCCTGTTACTGCTGGCACGTGTAAATGTTATGCTTGTCT 

GTGTGAGTGTATGCGATTCTCCAATGAATCAGTGCCGCCGGCTGTGACGTCACTAACAACCAACCAACTTCATGAGTGTCTGTTTTTTTTTGCTA 

CTCTATTGCATCAGCCATAAAACGACGAAGATACGCTCAAAGACGGCTCGATAAAACAGAGATAGGAGCACCGAAAACATAGCACCGAAAAGTGC 

TCATTGCCATGTCCCCAATTGTTTAAACAATAAGAAAACATCAGTCGAAAGTAGATACCGAAAAAGAGCATTTGTCTAATTGCATACGTAGCACA 

CTTGTGGCCACTTGAATATCTCCGGTTTATATTGATTTTTAACCTATTTCCGATGGATTTGCAGGCTGCCAAGATGAAGAAGGAGAAAAAGGAGA 

AGCCATCCGGCGGTGGAGATACTCGCAAGGGTAAGTAATTCTAACTAAGAATTAAAAACACAAAACGCACTTCCGTCTGTAAGATATGTAGCATA 

TCTTTAAGATTAGCTGCCTTCTTCTCTCTAATACCTGAAATTCCACTGATCTCTGCTATATTGATTTCAGAACTCAGTCCCTTGCCCAAGTACAT 

CGAGGAGCGCAATGTCTTCTGGGAGAAGTGCAAGGCGGAATACGAGGCCGAACTGGCGGCCAAGAAGCGTGAGCCCATCAAGGTGACACTGCCCG 

ATGGCAAGCAGGTGGACGCCACCTCCTGGGAGACCACTCCGTACGAAGTGGCCCGCGGCATTAGTCAGGGTCTCGCTGACAACACCGTCATCTCC 

AAGGTGAACGGCGAGGTGTGGGATTTGGACCGTGTGCTCGAGGGAAACTGCACCCTGCAGCTTCTCAAGTTCGACGACCCTGAGGCGCAGGCCGT 

CTTCTGGCACAGCTCTGCCCACATCATGGGTGAGGCCATGGAGCGCATCTATGGTGGACACCTGTGCTACGGTCCGCCCATTGAAAACGGCTTCT 

ACTACGACATGCATCTGGAGGGCGAGGGTGTAAGTAATTTGTGATGTTCAGTAAGAATTAAATTAAATTGAACGCCCAATTAGATCTCCACCAAC 

GACTACGGCGCCATGGAGGGTCTGGTTAAGCAAATTGTCAAGGAGAAGCAGAACTTCGAACGTCTGGAGATGAAGAAGTCCGACCTGCTGGAGAT 

GTTCAAATACAATGAGTTCAAGGTGCGCATTCTCAACGAGAAGGTGACCACCGACCGCACCACTGTCTACAAGTGCGGTTCTTTGATCGATCTCT 

GCCGCGGACCTCATGTTCGTCACACCGGCAAGGTGAAGGCCCTGAAGATCACCAAGAACTCCTCTACCTACTGGGAGGGTAAGGCTGACGCTGAG 

ACACTGCAACGCGTCTACGGCATCTCGTTCCCCGACCCGAAGCAGCTCAAGGAGTGGGAGAAACTTCAGGAGGAGGCGGCTAAGCGTGATCACCG 

CAAAATCGGTCGCGAGCAGGAGCTGTTCTTCTTCCATGAACTTTCGCCTGGCTCATGCTTTTTCCAGCCGCGAGGAGCGCACATCTACAACACCC 

TGATGGGCTTCATCAAGGCGGAATATAGAAAGCGTGGTTTCCAAGAAGTCATTTCCCCCAACATCTACAACGCTAAGTTGTGGATGACCTCCGGC 

CACTGGCAGCACTATGCCGAGAACATGTTCTCCTTCGAGGCCGAGAAGGAGAAGTTCGCCCTCAAGCCCATGAATTGTCCAGGACATTGTCTCAT 

CTTCGATAACCGCAATCGATCCTGGCGAGAGCTGCCACTGCGAATGGCCGACTTCGGTGTGCTTCATCGCAACGAGCTCTCCGGTGCTTTAACCG 

GATTAACTCGCGTGCGTCGCTTCCAGCAGGACGATGCGCACATCTTCTGTGCCCCCGAGCAGATTAAAAGCGAAATGAAGGGCTGCCTGGAGTTC 

CTGAAGTATGTGTATACCATTTTCGGATTCTCCTTCCAGCTGGTGCTGTCCACACGCCCGGATAATTACCTTGGCGAGTTGGAACAGTGGAATGA 

TGCGGAAAAGGCGCTGGCTGAATCACTCAACGAGTTTGGAATGCCGTGGAAGGAGAACCCCGGCGATGGCGCCTTCTACGGACCGAAGATTGATA 

TTACCATTATGGATGCACTGAAGCGTGCCCATCAGTGCGCCACCATCCAGTTGGACTTCCAGCTGCCAATTCGTTTTAATCTCAGCTACATCGCC 

GACGATGGCGAGAAGAAGAGGCCGGTGATCATCCATCGCGCTATTCTTGGCTCAGTGGAGCGGATGATTGCCATTCTCACGGAGAACTTTGCTGG 

CAAGTGGCCCTTCTGGCTATCACCGCGCCAGGTGATGGTCGTTCCAGTGGGTCCTGCCTATGATCAGTACGCCCAGTCAGTGCGGGATCAGTTGC 

ACGATGCTGGATTCATGAGCGAGGCGGATTGCGATGCCGGCGACACGATGAACAAGAAGATACGCAATGCTCAGTTGGCCCAGTTCAACTTCATT 

CTGGTGGTGGGCGACAAGGAGCGCTCCTCGAATACCGTTAACGTGCGCACTCGGGACAACAAGGTGCACGGCGAGGTCTCCGTGGCGGAACTCAT 

CACAAAGCTGCAAAAGATTCGCGACGAATTCATCGCCAACGAGGACAGCTTCTAAGACTCTATTTCGAACTATTTATCAAAAGTATACTTAACAT 

TTGCATAGCAAAGCAAGTACGCATTTTTGTTCTTTCTATGTATTTACGCTTACGAAACCGGAATTCACAAATAAAAAACCAAATCCAAACACATT 

TTAAGGTGTTTCAAATAGAAGAGAAGAGCATGCAATCAAATAAAAGCAATTCGATTCAATATACGATTATTAAGCTTCCCCAACTAAAAATCGTA 

TTTTTCTGATCCACCTTAAGCCGCCTATTTCACTCCAGTTTCTTGACGAATCAGTGGAATTTGTCAGCAATTGAGGACTAGTCATGCCGACACAG 

AGGTGCGCTGTAGTCCATTCAGGAACGATTTGGAATGATAAGATAAGTCGGCCCACATTTGGCGGAAAACTTTTTGGGAAAGGTGTGCGTGCACC 

CACAACGTATGAGTCATGTTTTGAATTTATTTTTTTGTATTAATTTATTGCTAGTATCTTGTATACTTTTCATTTTCATAATTTGATTTATTAAA 

AATTGCATCAATAGTTTATGTTTTTATTCAATTATCAATTTTGTAAGCTTTTGCTTTTCGCGTAAAAAATGCACAAAGTTCTCTCGCTAATCGAA 

GCAAAAATCGTTAATAATATTGTTAAGTGTTTGCTCTGCTCGTGCTCTGCTTATCTGTTTCAGTTTTAGTCATAAAAATACATATTTTATATTTG 

TGTATATTTATTCATAAGATGCACGTGGCCAAGTCCTATGCCATCCACAGTTCCAGTTAGCAGTCGCGTTCGCTGTGCTAAAAAAAAAACCGAAA 

TCCTAACCAAAACTGGCGAAACAAAAGCCGCAAACAGAACAGAACGGGGGCTAAAATTGGAAACAAAATGCCTGGCTGGTGGAATCATTGAACAC 

AGACTGGCTGCGTGGTATCTACGACTGCTGGAGATGCCGAGAGCTGGAATGTCGGAGGATCCTGGGACCGCCTGGCCTGACATAAGTACATGGTC 

TTGGGTTTTT 

(SEQ ID NO: 1195) 

Exon: 1001.. 1244 
Exon: 1680.. 1740 
Exon: 1876.. 2309 
Exon: 2364.. 3950 
Start ATG: 1689 

Transcript No. : CT37701 

CTTGGATTTGAACAATCAAAAGGCATGTTAAACTCTTTTCGAACTGCATTCGAGTGTATGAGAAGTGCTAGTTAGCCGAATTATGTTGAAGTTGG 
CGCCGCAGGGCGGCAGATATTATAGCCAAATAAGCCGCATTTGCCAGTGGATTTTGCCGAAAATTCAGACGCAGTCGAAGCCACCAAAACAACAA 
C AAC AAC AGC AAC C AC AAT T G AAAG AAC AAC AAC AAC AC AG CT AC GC AAC T T T GG C T GCC AAG AT GAAG AAGG AG AAAAAGG AG AAGC C AT C C G G 
CGGTGGAGATACTCGCAAGGAACTCAGTCCCTTGCCCAAGTACATCGAGGAGCGCAATGTCTTCTGGGAGAAGTGCAAGGCGGAATACGAGGCCG 
AACTGGCGGCCAAGAAGCGTGAGCCCATCAAGGTGACACTGCCCGATGGCAAGCAGGTGGACGCCACCTCCTGGGAGACCACTCCGTACGAAGTG 
GCCCGCGGCATTAGTCAGGGTCTCGCTGACAACACCGTCATCTCCAAGGTGAACGGCGAGGTGTGGGATTTGGACCGTGTGCTCGAGGGAAACTG 
CACCCTGCAGCTTCTCAAGTTCGACGACCCTGAGGCGCAGGCCGTCTTCTGGCACAGCTCTGCCCACATCATGGGTGAGGCCATGGAGCGCATCT 
ATGGTGGACACCTGTGCTACGGTCCGCCCATTGAAAACGGCTTCTACTACGACATGCATCTGGAGGGCGAGGGTATCTCCACCAACGACTACGGC 
GCCATGGAGGGTCTGGTTAAGCAAATTGTCAAGGAGAAGCAGAACTTCGAACGTCTGGAGATGAAGAAGTCCGACCTGCTGGAGATGTTCAAATA 
CAATGAGTTCAAGGTGCGCATTCTCAACGAGAAGGTGACCACCGACCGCACCACTGTCTACAAGTGCGGTTCTTTGATCGATCTCTGCCGCGGAC 
CTCATGTTCGTCACACCGGCAAGGTGAAGGCCCTGAAGATCACCAAGAACTCCTCTACCTACTGGGAGGGTAAGGCTGACGCTGAGACACTGCAA 
CGCGTCTACGGCATCTCGTTCCCCGACCCGAAGCAGCTCAAGGAGTGGGAGAAACTTCAGGAGGAGGCGGCTAAGCGTGATCACCGCAAAATCGG 
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TCGCGAGCAGGAGCTGTTCTTCTTCCATGAACTTTCGCCTGGCTCATGCTTTTTCCAGCCGCGAGGAGCGCACATCTACAACACCCTGATGGGCT 
TCATCAAGGCGGAATATAGAAAGCGTGGTTTCCAAGAAGTCATTTCCCCCAACATCTACAACGCTAAGTTGTGGATGACCTCCGGCCACTGGCAG 
CACTATGCCGAGAACATGTTCTCCTTCGAGGCCGAGAAGGAGAAGTTCGCCCTCAAGCCCATGAATTGTCCAGGACATTGTCTCATCTTCGATAA 
CCGCAATCGATCCTGGCGAGAGCTGCCACTGCGAATGGCCGACTTCGGTGTGCTTCATCGCAACGAGCTCTCCGGTGCTTTAACCGGATTAACTC 
GCGTGCGTCGCTTCCAGCAGGACGATGCGCACATCTTCTGTGCCCCCGAGCAGATTAAAAGCGAAATGAAGGGCTGCCTGGAGTTCCTGAAGTAT 
GTGTATACCATTTTCGGATTCTCCTTCCAGCTGGTGCTGTCCACACGCCCGGATAATTACCTTGGCGAGTTGGAACAGTGGAATGATGCGGAAAA 
GGCGCTGGCTGAATCACTCAACGAGTTTGGAATGCCGTGGAAGGAGAACCCCGGCGATGGCGCCTTCTACGGACCGAAGATTGATATTACCATTA 
TGGATGCACTGAAGCGTGCCCATCAGTGCGCCACCATCCAGTTGGACTTCCAGCTGCCAATTCGTTTTAATCTCAGCTACATCGCCGACGATGGC 
GAGAAGAAGAGGCCGGTGATCATCCATCGCGCTATTCTTGGCTCAGTGGAGCGGATGATTGCCATTCTCACGGAGAACTTTGCTGGCAAGTGGCC 
CTTCTGGCTATCACCGCGCCAGGTGATGGTCGTTCCAGTGGGTCCTGCCTATGATCAGTACGCCCAGTCAGTGCGGGATCAGTTGCACGATGCTG 
GATTCATGAGCGAGGCGGATTGCGATGCCGGCGACACGATGAACAAGAAGATACGCAATGCTCAGTTGGCCCAGTTCAACTTCATTCTGGTGGTG 
GGC GAC AAG G AGC GC T C C T C GAAT AC C GT T AAC GTGC G C AC T C GG GAC AAC AAGG T GC ACG GCG AGG T C T C C GT GGCGG AAC T CAT C AC AAAGC T 
GCAAAAGATTCGCGACGAATTCATCGCCAACGAGGACAGCTTCTAA 
(SEQ ID WO: 1196) 

Start ATG: 254 

MKKEKKEKPSGGGDTRKELSPLPKYIEERNVFWEKCKAEYEAELAAKKREPIKVTLPDGKQVDATSWETTPYEVARGISQGLADNTVISKVNGEV 
WDLDRVLEGNCTLQLLKFDDPEAQAVFWHSSAHIMGEAMERIYGGHLCYGPPIENGFYYDMHLEGEGISTNDYGAMEGLVKQIVKEKQNFERLEM 
KKSDLLEMFKYNEFKVRILNEKVTTDRTTVYKCGSLIDLCRGPHVRHTGKVKALKITKNSSTYWEGKADAETLQRVYGISFPDPKQLKEWEKLQE 
EAAKRDHRKIGREQELFFFHELSPGSCFFQPRGAHIYNTLMGFIKAEYRKRGFQEVISPNIYNAKLWMTSGHWQHYAENMFSFEAEKEKFALKPM 
NCPGHCLIFDNRNRSWP^LPLRMADFGVLHRNELSGALTGLTRVRRFQQDDAHIFCAPEQIKSEMKGCLEFLKYVYTIFGFSFQLVLSTRPDNYL 
GELEQWNDAEKALAESLNEFGMPWKENPGDGAFYGPKIDITIMDALKRAHQCATIQLDFQLPIRFNLSYIADDGEKKRPVIIHRAILGSVERMIA 
ILTENFAGKWPFWLSPRQVMVVPVGPAYDQYAQSVRDQLHDAGFMSEADCDAGDTMNKKIRNAQLAQFNFILVVGDKERSSNTVNVRTRDNKVHG 

EVSVAELITKLQKIRDEFIANEDSF* 
(SEQ ID NO: 1197) 

Name: threonyl-tRNA synthetase 
Classification: enzyme 



Celera Sequence No. : 142000013384832 

GTAAAAAAGCGTAGCAAACTAGTTGGGCAACGGAAAAGGATGTGGCTGCTGCGATGGCGATGGCGATGGCCAATGGCAATTGGAATTGCAAGGAC 
AACGCCAACGGCTGCTGCTAGGCACTGCTATGCCACGAAATCGAGCGGCCGGTTGAAGCCACCCTTGCGATTCATGTACTGGCGGTACTTTCGCT 
TGAGGATTACATGCACTTCTCCGACATCGTTGCCCTCGACCTTCCTGTTCTTGGTGGTGTCGAACGTGCAGAATCCCATTGTCTTCAGCATCTCG 
ACCTCTTCCGGCGATTTACCCTCCAGGTCCGCCTCGTTGATCTTGGGCCGGTCGGCAGCTGTGGCCGCCGCATTGTTCCTCGTCGTCCTTCGCGA 
CGTCGATGGTCCGGAGCTGCTGTTGCCGCCGCTGCTTCCGCCGCCGCCTCCTCCTCCTCCTCCTCCGGCGACCCTCGACCGCGATCTGCAAATTA 

T GAAT GTGTGTGCATAGTGAGTGGTGAGTGTT C TAT AG AT T T C G T T T G AGT GAC AGG AC C GAC C AG T T G AT T G CAT T CG AAT T GC AAC GAG C C T G 
GGAAATAACTGGGGCGACCTTGGGCGTTGTGCACCCGATTGTGCAATATGAAGTTTTAATTACGCGAAATGCGACACTTCTCAACCGGTCGGTTG 
GCTTCATTAAAGACCATATTTACGTCTCGATTTTGGCAATCATCTGAAGACCCAAAATAACCGGCTTGTATATGCATATGCTAGAGAATTTCATT 
CTAAGATTTAAAATCAAGCTCAAGATCCCTCTAAAATCCATTCTGAATCCATTTATATACCTAACAGACAATATTCCTCTATTCCTCTAGCTTAT 
ATCTCAAAAACTTCCATTGGAAAATAGCTAATTGATGCTTGCAGCTCCCAGACTTTGTTTTAGTTTCGTGTTTTGTATTTCTTAAATGAAATGTA 
GAAGATAACAGATAATCCTTATATCAAGTAGAAAGTGGGGAAAGTTTCGCTTAGGACATGTCCTCGATGAGGGCCGCACGCAGCTTGGTCGTCAC 
TGTGGCGGGCTGTGCCTGGTTAAGCTTGAAGACGCTCTCGATCACGGAGATCTCTGTGGCCGTGGTGTTCATGCCCACGAAGGAGAGCCAGTCGT 
TGACGACCATGCCGGCGGCGAGTACTTCGCTGCCCCGGTTCACTGTTCCGGCCACGAGGGGAACCTGCAGCAGGGACGACAGTTCGTCCTGGTCC 
TGAATGCTCGTCTTGGGATGCACCATGCCGCCCTGGTTGCTCAGCACGGCGTAAGAGCCCACCAGTGAGTTGTCGGCAATGGTCTGGCGGAAGAC 
CTCTACTTTGAGCACGTCCGCGATGATCTCCTCGGTCTCCTTGTCCAGATCCGGGTGCACCAGGGCCACATAATCATTGCAGGCGATAACGTTGC 
CCAGCGCGGACAGGCGCTCCTCCACACGATAAATCTTCACGGCGTCTGGCAGGCTGTTACGCAGGTGTTGCAGCTCCTCGTCGGTGGTGGAGTTG 
GGCACCAGCAGGCCGTTGCGGTTGCCCACGGTGAGGCGGCCGATGATCCGGCAGCCGCCCACATTCGCATGCACCACCGGGATGGTGTCGCCCAG 
CTCCGCCTCGAAGGCGCTGTAGAAGGTCTCGGATCCACCGATGGCCACCAGGCAGTATGTGTTGGTTAGTTTAGTGAAGACGCCGATGTCGTCGT 
TGTTCTCGAATTGGACGCGTAGAGCCATTTTCTCTGTATTTACGTGGATCCTCTTTTCCTGCTGGTTACTCTCTTTTCGGATTTACTGGCCTGAC 
TGAGCCTGCTGCCGGATGAGTTCTGGATTCTGTTTATATGTCCGGAAATCAATTTAAATGTCGTGATGAAATTCGCACAATTTCGCTTCACGTGG 
TAATTTTTTTTTGATGTTGGGACGCCAGAGCTGCATAGCCATCGATTGGCTGCCTATCGAAGTGTGGGTCTATCGATACTCGTGGGTAAATAAAC 
AAGTTCTGAACGGCGATTTCGGGGGTTTGAGGGGTCAATTGTCCCCTGTGTTGGAATGTGTTCCTAAATCTACACAAACACTCCCTAAGCTTATC 
CTAAACTTATAAATATTGGTTGCTATTTAAACCCCATTTCACGGTTATCCAGCACGCCCCTGAACTGTGACCCACATCCCCGATTTTAGTGACTA 
GTTTTATACTTATCGTGGTTGGCATTTGGTACACTACACTTTCTTATTCACCTAGATCGCCGACTCCGCGCACGGTCGCGCTCCCGTTCCCGCTC 
CCGATCTCGGCTGCGACTGCGGTCGCGATCCCGTTCCCGGTCGCGGCGACCGGCGCCTCCAGATCCGGATCCTCCGGCTGGCGCTACTCCGATGG 
CCTCCCGCTCCCGGGAACGGCGCTCCCTGCGCTTCTTCCGCTCGGATCGCTCCTTCTCGCGTCTCCTGTGCGCCGGCGAGGCGGGACTTCGGCTG 
CGGCCCATGGCTCCGTTTCGGTCTCTGCTTGTGTGGCGGTTTGAGTGATTGATGATGGTTTGCCTGCAGTGGATTTGAGCGGATGTTAGACTCCA 
TGTACTCCAATTGCACCGCACGCATCCCGCTTACCGTTGCACTAGGCTGTGTCTAGCATTATTTGGGCGACACCGGCGGTGGGTAAAAACGCGCT 
AACTTAAAACTAAAACCAAATAAACAACGCGCGTTATCTCGCCGCGACGCCGAAACGGAATGAGGAATGCAGAATGCGGATGATGATGCGAGGGA 
TGCGATGCAAGGGCAAAACATGGCCACCTATCGGATATCGATTGATGGTCGATTTTATCTTCGATAAAAATGTATCCTATGTATATCGCATACCG 

AATGTTTAAATCGATCTGTTTCCTTTTGAATATGAACTAT 
(SEQ ID NO: 1198) 

Exon: 1890 . .1001 

Start ATG: 1738 (Reverse strand: CAT) 
Transcript No. : CT38864 

CGAAATTGTGCGAATTTCATCACGACATTTAAATTGATTTCCGGACATATAAACAGAATCCAGAACTCATCCGGCAGCAGGCTCAGTCAGGCCAG 
TAAATCCGAAAAGAGAGTAACCAGCAGGAAAAGAGGATCCACGTAAATACAGAGAAAATGGCTCTACGCGTCCAATTCGAGAACAACGACGACAT 
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CGGCGTCTTCACTAAACTAACCAACACATACTGCCTGGTGGCCATCGGTGGATCCGAGACCTTCTACAGCGCCTTCGAGGCGGAGCTGGGCGACA 

CCATCCCGGTGGTGCATGCGAATGTGGGCGGCTGCCGGATCATCGGCCGCCTCACCGTGGGCAACCGCAACGGCCTGCTGGTGCCCAACTCCACC 

ACCGACGAGGAGCTGCAACACCTGCGTAACAGCCTGCCAGACGCCGTGAAGATTTATCGTGTGGAGGAGCGCCTGTCCGCGCTGGGCAACGTTAT 

CGCCTGCAATGATTATGTGGCCCTGGTGCACCCGGATCTGGACAAGGAGACCGAGGAGATCATCGCGGACGTGCTCAAAGTAGAGGTCTTCCGCC 

AGACCATTGCCGACAACTCACTGGTGGGCTCTTACGCCGTGCTGAGCAACCAGGGCGGCATGGTGCATCCCAAGACGAGCATTCAGGACCAGGAC 

GAACTGTCGTCCCTGCTGCAGGTTCCCCTCGTGGCCGGAACAGTGAACCGGGGCAGCGAAGTACTCGCCGCCGGCATGGTCGTCAACGACTGGCT 

CTCCTTCGTGGGCATGAACACCACGGCCACAGAGATCTCCGTGATCGAGAGCGTCTTCAAGCTTAACCAGGCACAGCCCGCCACAGTGACGACCA 

AGCTGCGTGCGGCCCTCATCGAGGACATGTCCTAA 

(SEQ ID NO: 1199) 

Start ATG: 153 (Reverse strand: CAT) 

MALRVQFENNDDIGVFTKLTNTYCLVAIGGSETFYSAFEAELGDTIPVVHANVGGCRIIGRLTVGNRNGLLVPNSTTDEELQHLRNSLPDAVKIY 
RVEERLSALGNVIACNDYVALVHPDLDKETEEIIADVLKVEVFRQTIADNSLVGSYAVLSNQGGMVHPKTSIQDQDELSSLLQVPLVAGTVNRGS 
EVLAAGMVVNDWLSFVGMNTTATEISVIESVFKLNQAQPATVTTKLRAALIEDMS* 
(SEQ ID NO: 1200) 

Name: EIF-6 family protein 
Classification: translation factor 



Celera Sequence No. : 142000013384665 

ATCTCATCTCATATGAAAGCAATATCATATATGTATGTTAATATTAAATAGATATTAAACATAAAAATCCATCCCACCCACATTCCGCTGAAAAA 
TGAAAACACTCGCTTTTGCCGAAGTGGAATTTTATTGTAATTAAATTTGTTGGGTGTACAAAGGTATGTTTTAAGGACTTTCTCTCTGCATTTAT 
TACAATTTCTTCAAATGGTTTTAATTGCTGGGCTTTTGCTTCGAAGACGGGCAATCAAGTTTCAAGAATCCTATTTTCTTTAATCATACAAATTA 
TGTTTTATTTCGCTTTAATATATAAGGATACATATATATATCTGCTATATATAGAAGTGTGTTGACTTGCGTATATTTCGTGAAAAGTAAAAGTT 
TTCTTCTTCGCAATTTCCCTTGAACTAACATTCCAATTCGGGTTTTGGTTATGTTTTTTTTTTCTGTTTTACTCTCATTATTTTGTTTGTTTGTT 
TTTTTTAAATAATTTTGCATTTTCTGCATACATGTGGTTTTGTGTTTTGCAATTTTTTCAATGATTTTTCCTTGGGTTTTTATTTGTTTTTAGTT 
TTCGATTTGTTTAATAAAAAAATATGTATTTCACTGCCCAGAGCGAAAGCCATTTCTTACTTCCGCTTGCGTTAATATATCATTTTATGTATGTA 
TGGTGTAAAAACATCATTTACTGGTAAAACTCTCGATATCCTTTGTAACAGAAATCGGAACGCAAATGAAATCGAGGCGACTTACCAGCTCCTTC 
TCATTAAATACATATATGTTCTTGCAAATGTTTATAAATGTATGCATGTTTTTCCTTCTCTCGGATAATAGTGTAATCATAAAAATCCTGTCTGC 
TTTTTTGTTGTTTATTAATATGCTTCATAAGTTGTTTACGCTTTAGCGCTTTTCATGTGTTTCTTGTATTTCTCATATATGTTGTGGTTGTTTAA 
TATGTGTGTTGCTGTGTGTGTGTTTTTGTGTTTATATATATTATTTTTTTTTTGCTGTTTTTACATAGGTTTTGCCATTTTTTTGTAGTTTTTTC 
GATTTATGTTTTGTTTTCATTTGTCAACATATTTTTTCAGCTGTTGGTATTCTGTATTTTCTCTCTTTTTTCAAAGTAGCAAAAACTTGACTCTC 
ATATTCTTGCGGTTTTCTTTTTCTTTCTTTCTTTCTATTCTTCTGTTCTCTGCACAGTTTGCTTTCCATTTTTTCGTTTTTATTTTAACTCTTTT 
GCAATTTTTTTGTTTTTTCTTTGCTTTTTTAAAAGTGTATGGTTGTCGCGTTTCTTTTTCTAAACAAAGCTTTCTTAAAGTTTGAATTTGAGAAA 
AAAGAATTGAAA71AAATTACATATCGCTTTCGTGTTTGTTGTTGTTTGAAAGTGTGGATGGAAAAAGTGTACGAAAAACTCAATATGTATGTAAC 
ACTAGAAACTTCTTTAAAATTTTTAAAACACAATTTCAATACTTGCAAGAAGCGGTTTACATGTATGCTTTGCTTTAATCTGTTTATTTTGCAAT 
GGTTTTTTGGTGTTTTCATTATACGCATACAAAATAATATCGATCCAGTTAAAGCAATAACGAAATTTTTTTCATTTCACATTTTTCCGTTTTTT 
TTTTTAATTAACATGTTTATGTAAAAAAAAAGAATTTATAAGTGGACTGCAGAATTTGATTGCGTGACTTTCTTCTTTTGCTTCGTGTACTTGCG 
TAAATGCGTTCGAAATTAAATGATTGAAATTGCTTTTGTTGATGTTTCTGGGATTAGAAAAGAAAAAGCCGTAGATTTTTCGCTTTGTTTTTCTG 
TGTTAATTGTGTGTCCTTGGCAATCCTCCCCTGCGCAAAGCATCAGTGAATGCGAATTTTTGTTGTTCATGTATTGAAAGATTAGTGGAAAACTG 
CAAGGACAAAAGAGAGGCGAGTTTAGACAGCTTGAACTTTTCGGAAAATTTGTAGACTACTTACTTTTTGGGTGTAAAGTGTTCTTGTAAAATGT 
TCATACGCTTGAGCTGTTCGAGCTTAGTTGGCGGGCTCGGTGACGCGGTGCACCTGAAGCACGGCCACAGCCTCCTCCACCTTGGCCTTAAGGGC 
CTCCTGATCCTCGATCATGTGCAAGAGCTCAGAGTTTTCGATCTCCAACAACATGCCGGTGATCTTACCAGCCAAATTGGCGTGCATATGCTCAA 
TCATCGGATACAGACGCTCGCCCAGGATCTGTTTCTGCTCCTGCGGCTTGGCGTTGGCCAGCAACGAGGCAATGAGCTTCTCAGAATCTGCAAGA 
AAAATAGCAATGGTGCCATTAGTGACAAATTGTGCATATTTTCAGTCGACCTAAAACTTACTCTTTCCTTGCAATTGCTGTGGAATTGGCTGGGT 
CTGATGCAACTGTGGCACTGGAGGATTGCGCATGTTTGAGGTGTACTTATAGTTGGAAGTACGCTGCTGGGCACCGCCAGCGATTTGGGCTCCGG 
GGATCTGCATGTTGGGTGCTGCAGTTTGCTGGCCAGTAATAGCGCGTGCTCCAGTGTTGCGCATGTTGTTAGCCGCAGCAGCGGCGGCATGTGTG 
CCCTGCACCTGAGGCTGTGCCCCGCGAGCGCCAGCAGCAGCCGAACGGAACTGAGTGGGCACCGCACCGGCTGTGCCCTGGAAGCCACCAGCAGC 
GGCAGCTCCGGCCTGGACACCCTGTATGGCTGCTGGGGGACGCACCTGGGGCACCCAGCGGGGTGTGTTTCTCATCTGAGTGGCCACCTGGGAAC 
CGAAGAAACGCTGATTTGACGGAAGGGTCGGCACAAAGAAGCCGCTAGCCGCGTTGGGCTGGTAGATCTGTCCCAGCTGCTGCATACGCATGCCG 
GTCATGTGACGCATGTACTGCGAGGCGAGATGAGCCTTGCGCTCCTCCTTCCTTTGGGCCAAAGCAACGTACAACGGCTTACTGCCGACGACGCG 
ACCGTTCAGCTCGGTGACGGCGCAGGTAGCCTCGCTTGCGGCATTGAAGCACACGAAGCCGAATCCCTTGGAGCGACCCTCCTCATCGGTCATGA 
CCTTGGCCGATGTGATGTTGCCGTATGGAGAGAACGCGATGCGCAGACGATCGTCATCGATTGTGTCGTCCAGGTTCTTAACGTACAAGTTAACG 
CCGAACACAGACTCGTGACGCTTCTGCTTCAGTTCCTCGAACTTGCGCTTCAGCTCCTGCTGGCGTTCGGCCTTCTTCTGGGCACGAGCCACGTA 
CAGGGACTTGCCCTCTCCCATGTCCTTGCCATTGAGAGCCTGAACGGCCGCCTCAGCAGCCTCTGTTGTCTCGAATGCAACGAATCCGAAGCCCT 
TGCTCTTGCCATCCTCCTTGGACATGACCTTGTAGCTGGTTATCTTGCCGTAGGGCTCAAAGAATTCCTTCAGTTTTTCATCGTCGAAATCCTCA 
GTGAAGTTCTTCACATATACATTGGTGAAGAGCTTAGCCTTCTCGCCCAGCTCCTTCTCGCGCTCCTTGCGCGGGATGAACTTACCCACGTAGAC 
CTTCTTGCCGTTGAGCAACATGCCATTGACTTTGTCGATGGACGTGTTTGCGGCCTCCTCGGTCTCGAAGTGGACGAATCCATAGCCCTTTGAGT 
TGCCCTTTTCATCGGTGGCTACCTTGCAGCTTAAAATGTTGCCGAAGGCGGAGAAAGTGTCGTAGATTGCCTTGTTGTCGATGGCCCTATCTAAG 
TTCTTGATGAACACGTTGCCCACACCTGAGCGACGAAGAGAAGGATCACGCTGAGACCACATAATGCGAATGGGCTTGTTGCGAACCAGGTCAAA 
GTTCATGGTGTCCAAAGCACGCTCAGCTGTTGATGAAAATCGGGAAAACATTTGGTTACTACCGACACGATTTGGGTATATCAAGATCATTATAA 
GTTGAAGGGGCTTAAATAACAATGTGACTCTTCAACTTCTGCACTGGCCATGAGCAAAAGCAGCGCAGTGCATGGTAAGTCAAGTCTGATCTTTT 
ATCTGGCAACTATGAATCTTAGTCGTGTCAAAAAGGGTTGAATCTGACCAACCAATGGGTAGGAGTTGCTAGATGTCTAAATACAAGGACAATCT 
TATCAGTCTAATCATGTGATACATTGAGTGTTTGTTATGCAGATGGTATGCAAAACCCATCAGCATGGTAAGTCGCATTGCTTTCTCTGGCAACT 
ATGAATCTTAGTCGTGTCAAAATGGGTTAAATCTGACCAACCAATGGGTAGAAGTTGCTAGAAGCTTAGATAAGAAAGGGCATAATCCTTCGAGT 
CTAATCATATGATGTATTATTTAGTGTTTGTTATGCTTCTACTTCTGTGGATGGCATAGATCAGCCATCTACATGGTAAGTTGCATTGCTTTCTC 
TGGCAACATTGAATCTTAGTCGTGTCAAAAAGAGATAAATCTGACCAACCAATGGGTAGGAGTTGCGAGATATGAATCGAAAAAAGAAAAACATT 
TAGGGTGTTACTAGTCTAGAACGAGATGGTATCATTTTTGAAAGGTCTGTTTTTCCAACCTAACCTrAACAGCTAGAGTTCAAGAAACTTGCTGG 
AATGCGACAGACTTTCACACGCAGCGTAATCATCACCAACCGCCAGCTTTGTGCCATTGTGCGCTTAATTGAATAAACAAATGTAAGCGAACAAC 
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GGGAATCGAAGAAGCACCAACTATGCGCAGTGAAAGTTTTGTTTTCGTTTTTTGTAAAAACAAGATTTTGATGGTTAAAAGftGAGGCCGAAAftGC 
AGTTCACTGACATGTCAAGAATTTGGAAGTGTTGATGGATGGAATGCGGAGAGTATGGAAGATATGAGAAAGTGAAAGCAAAACAAAAGTTGGAG 
GCAGTACGAAAACTAAAGAGCGAACGTGCGGAAACTCAGTTAATGGAATATAACCTAAAATAGATGAAGGCCAAAAAGAAATGGTCCAAACAACA 

atgaacacmott™^ 

A^GGAAACAATGAAACCACCAATGGGCGGGGCTGCGATAAGCAGTTTGGGCAGTATGGAACCATATTATTCCATCATAATTGCAATTTTGTGCGA 
AATTGAGACGGAAATTTCTAGGCGAACAGTTGATGAGTGCCACGAGCCAGCTGTTTAGCCCCCATCATCCCCCCGCACAAATCGTATACATCCAG 
AATAACTGGAACTGATATAATCATGTGTGTGTGTGTACACACTGGCGGGAATTGAGTGCGTGTGTGTGTGCGCGACGAGATTAACGTAAGCAGTG 
CACTGTGGAACAAGTTATGAAACACTTGGAAGCCGTTTATCTGGTCTCCTATGAAAGACATCTTTCAATAAAGTCCAGTTTGGACTTGCGTTTAG 

ccccactgScatgcgctagtgtgcgtgcgtgtgtgagtaagcgagtgcaaaactgcacacgtgtttt 

ACTTTTGCGGGCAGTAAGACTAAAAAAATTACACTACTTTTGCGGGGATTACTCACCATCGGCTGGCTGCTGGAAGTTGACGTAGGCATAGCCCA 

acgagcgacgggtaatcacatcgcggcagacacgaatggacagcactggaccagcagacgagaacttgtcaaaaagtcccgattcgttgacgtcc 

TGTGGGAGATCAC^ACGTATAGAGAAGCCATATTTGGTGATCAGGTCTACCTGC^^ 

TTGGGTTTTTTCTTAATCTTTACGGACCGGATGGATTCGACCAGGCAGGCGGTGAAATAGAAAGAAAAAAAGCGGCGTGGCTTTCGGTCTTCTAT 
TCGTCTCTCTGTCTGTGTGTCTGGGTGGCAACACTGGTACATGTGACAGTCGGTAGACATGGCAACGTTGGCATTTGCTTTGTTGTTGCTCTTTG 
CTCTCCCCTTTTATTTCTCTTCGTGTTGGAAAATTTCCACACAGACACAAAGATAGACACGCGTTTGCCTGATCGAATTTTGGGGAATTCTTTGA 
TTTTCTCGAAAATATGCGACAACTAGTTCCTAGACCACTATTTTGCGGGCAATCTGGCGTTTATAGACGCCATCTTTGAGCGAAGAACGACCGCA 
AACAAAGCAAATTTCGATGGGAAAGGAAAATTTCCCTATTCGCTTTGTCGTGCGTTGCGCCCTCTCACTCTCTACACCATTGTAACCAATAGCTT 
TACACTGTACTTACTTCTCAAACACGCTCGATTTTTTTTGCTACTTTTTACAATTTTTTTTGTTACGTTTTTGTGTTTTAATATTTTTTTTTATT 
TTTTATATGTTTTTTCTTCACAAAAAATATGTGATAAGCCGTGTGCTGAAAGCGAATCCACGACCGCTCACGTAAACGGACTCTACACGACTCAT 
AGAGAGGAGATTCGTTAGCGATGACCCCACAAAGTATCGATTAAATCCCACTGCACAATGCTGCCACATAGGGTTAGAGACAAATAAATGTCACT 
AAAAAATCCGTGAGGACAATTTACTAATTACGAAGCATTATTTTGAATTTCAAAAAAGCCAGCCATCACACATAAAAAATTTTTATATTTAGCTA 
ATTCCCGAAACATGAAATACTTGTGCATTTTCAGGGGTTCTTTGGTAACTCTGCTTCATTTATGTTCTGATATTTTCGCCAATGTCCTTGAATAC 
TTTGTCTCATAAAACACAATCCAAAATTAGGAATGCAGTTTTTTTTTCTTTTAAAAATGAGCGTAGACTTTTAGAAAATGATTTTTTGGCACTGC 
ACAGATATTAGCGTATGCTCCGAAAACCGAATCCCATCTTTGTAATTCCCCACTGTGCGAACTTACCTAAATCAGAGAATAAAGCGATGAGGCTG 
CAAGTCGACACTCCGAGAAGTGCTTGAGTACTTCTTTTCGTTCTCGATGGAAATCGGCACTACAGAGATTGAAGAATTTTTTAAGTTTTTTTTTT 
GTGTTTTTGTTTTTACGAAATAGACCACATTCAAGGATGGCTGAGGATAATTTTTTTTTTCTTTTTCCCGTTTTGAACGTCTGGTGCATGTGCCC 
TATCTGGCGATCGAGTACCTAATGAAGCGGTGTGAGGCGTCGGTGCCTGACAAGCTTCAAAGCGAAGTGTCACTTTGGGCGCCTACTGGCGTGTC 
CTGAGTAAAATCTTAAAAGATTTGACATTTCGGAAACATTTATTTGGGCATTATATACTTCATTAAACATATTTCGATCTGGTTTGGCCAAACAT 
TTACTTGGCTTGAGAATTAGTTTGCTTTCAACTCAATTTTGATTTTTCGTAGCATTTAATAGTTAAACATAAAACGTACTGGCTTAATTCATTAT 

TGTGTAGTTTTGCCTAGACTATGGGTGAAAATTATTG 
(SEQ ID NO: 1201) 

Exon: 6447.-6285 
Exon: 5751.. 5567 
Exon: 3826.-2342 
Exon: 2272.. 1965 
Exon: 1897.. 1001 

Start ATG: 5732 (Reverse strand: CAT) 

Transcript No * CT39086 

AGTCCGTTTACGTGAGCGGTCGTGGATTCGCTTT 

ATTAAAACACAAAAACGTAACAAAAAAAATTGTAAA 

TCTATACGTCGGTGATCTCCCACAGGACGTCAACGAATCGGGACTTTTTGACAAGTTCTCGTCTGCTGGTCCAGTGCTGTCCATTCGTGTCTGCC 
GCGATGTGATTACCCGTCGCTCGTTGGGCTATGCCTACGTCAACTTCCAGCAGCCAGCCGATGCTGAGCGTGCTTTGGACACCATGAACTTTGAC 
CTGGTTCGCAACAAGCCCATTCGCATTATGTGGTCTCAGCGTGATCCTTCTCTTCGTCGCTCAGGTGTGGGCAACGTGTTCATCAAGAACTTAGA 
TAGGGCCATCGACAACAAGGCAATCTACGACACTTTCTCCGCCTTCGGCAACATTTTAAGCTGCAAGGTAGCCACCGATGAAAAGGGCAACTCAA 
AGGGCTATGGATTCGTCCACTTCGAGACCGAGGAGGCCGCAAACACGTCCATCGACAAAGTCAATGGCATGTTGCTCAACGGCAAGAAGGTCTAC 
GTGGGTAAGTTCATCCCGCGCAAGGAGCGCGAGAAGGAGCTGGGCGAGAAGGCTAAGCTCTTCACCAATGTATATGTGAAGAACTTCACTGAGGA 
TTTCGACGATGAAAAACTGAAGGAATTCTTTGAGCCCTACGGCAAGATAACCAGCTACAAGGTCATGTCCAAGGAGGATGGCAAGAGCAAGGGCT 
TCGGATTCGTTGCATTCGAGACAACAGAGGCTGCTGAGGCGGCCGTTCAGGCTCTCAATGGCAAGGACATGGGAGAGGGCAAGTCCCTGTACGTG 
GCTCGTGCCCAGAAGAAGGCCGAACGCCAGCAGGAGCTGAAGCGCAAGTTCGAGGAACTGAAGCAGAAGCGTCACGAGTCTGTGTTCGGCGTTAA 
CTTGTACGTTAAGAACCTGGACGACACAATCGATGACGATCGTCTGCGCATCGCGTTCTCTCCATACGGCAACATCACATCGGCCAAGGTCATGA 
CCGATGAGGAGGGTCGCTCCAAGGGATTCGGCTTCGTGTGCTTCAATGCCGCAAGCGAGGCTACCTGCGCCGTCACCGAGCTGAACGGTCGCGTC 
GTCGGCAGTAAGCCGTTGTACGTTGCTTTGGCCCAAAGGAAGGAGGAGCGCAAGGCTCATCTCGCCTCGCAGTACATGCGTCACATGACCGGCAT 
GCGTATGCAGCAGCTGGGACAGATCTACCAGCCCAACGCGGCTAGCGGCTTCTTTGTGCCGACCCTTCCGTCAAATCAGCGTTTCTTCGGTTCCC 
AGGTGGCCACTCAGATGAGAAACACACCCCGCTGGGTGCCCCAGGTGCGTCCCCCAGCAGCCATACAGGGTGTCCAGGCCGGAGCTGCCGCTGCT 
GGTGGCTTCCAGGGCACAGCCGGTGCGGTGCCCACTCAGTTCCGTTCGGCTGCTGCTGGCGCTCGCGGGGCACAGCCTCAGGTGCAGGGCACACA 
TGCCGCCGCTGCTGCGGCTAACAACATGCGCAACACTGGAGCACGCGCTATTACTGGCCAGCAAACTGCAGCACCCAACATGCAGATCCCCGGAG 
CCCAAATCGCTGGCGGTGCCCAGCAGCGTACTTCCAACTATAAGTACACCTCAAACATGCGCAATCCTCCAGTGCCACAGTTGCATCAGACCCAG 
CCAATTCCACAGCAATTGCAAGGAAAGAATTCTGAGAAGCTCATTGCCTCGTTGCTGGCCAACGCCAAGCCGCAGGAGCAGAAACAGATCCTGGG 
CGAGCGTCTGTATCCGATGATTGAGCATATGCACGCCAATTTGGCTGGTAAGATCACCGGCATGTTGTTGGAGATCGAAAACTCTGAGCTCTTGC 
ACATGATCGAGGATCAGGAGGCCCTTAAGGCCAAGGTGGAGGAGGCTGTGGCCGTGCTTCAGGTGCACCGCGTCACCGAGCCCGCCAACTAAGCT 

C G AAC AGC T C AAGC GT AT G AAC AT T T T AC AAG AAC AC T T T AC AC C C AAAAAT T T T C C AC T AAT C T T T C AAT AC AT G AAC AA C AAAAATT C GC AT T 
CACTGATGCTTTGCGCAGGGGAGGATTGCCAAGGACACACAATTAACACAGAAAAACAAAGCGAAAAATCTACGGCTTTTTCTTTTCTAATCCCA 
GAAACATCAACAAAAGCAATTTCAATCATTTAATTTCGAACGCATTTACGCAAGTACACGAAGCAAAAGAAGAAAGTCACGCAATCAAATTCTGC 
AGTCCACTTATAAATTCTTTTTTTTTACATAAACATGTTAATTAAAAAAAAAAACGGAAAAATGTGAAATGAAAAAAATTTCGTTATTGCTTTAA 
CTGGATCGATATTATTTTGTATGCGTATAATGAAAACACCAAAAAACCATTGCAAAATAAACAGATTAAAGCAAAGCATACATGTAAACCGCTTC 
TTGCAAGTATTGAAATTGTGTTTTAAAAATTTTAAAGAAGTTTCTAGTGTTACATACATATTGAGTTTTTCGTACACTTTTTCCATCCACACTTT 
CAAACAACAACAAACACGAAAGCGATATGTAATTTTTTTCAATTCTTTTTTCTCAAATTCAAACTTTAAGAAAGCTTTGTTTAGAAAAAGAAACG 
CGACAACCATACACTTTTAAAAAAGCAAAGAAAAAACAAAAAAATTGCAAAAGAGTTAAAATAAAAACGAAAAAATGGAAA 
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GAACAGAAGAATAGAAAGAAAGAAAGAAAAAGAAAACCGCAAGAATATGAGAGTCAAGTTTTTGCTACTTTGAAAAAAGAGAGAAAATACAGAAT 
AC C AAC AGCT GAAAAAAT ATGTTGACAAAT GAAAAC AAAAC AT AAATCGAAAAAACTAC AAAAAAATGGC AAAACCT ATGTAAAAACAGCAAA 
(SEQ ID NO; 1202) 

Start ATG: 183 (Reverse strand: CAT) 

MASLYVGDLPQDVNESGLFDKFSSAGPVLSIRVCRDVITRRSLGYAYVNFQQPADAERALDTMNFDLVRNKPIRIMWSQRDPSLRRSGVGNVFIK 

NLDRAIDKKAIYDTFSAFGNILSCKVATDEKGNSKGYGFVHFETEEAANTSIDKVNGMLLNGKKVYVGKFIPRKEREKELGEKAKLFTNVYVKNF 

TEDFDDEKLKEFFEPYGKITSYKVMSKEDGKSKGFGFVAFETTEAAEAAVQALNGKDMGEGKSLYVARAQKKAERQQELKRKFEELKQKRHESVF 

GVNLYVKNLDDTIDDDRLRIAFSPYGNITSAKVMTDEEGRSKGFGFVCFNAASEATCAVTELKGRVVGSKPLYVALAQRKEERKAHLASQYMRHM 

TGMRMQQLGQIYQPNAASGFFVPTLPSNQRFFGSQVATQMRNTPRWVPQVRPPAAIQGVQAGAAAAGGFQGTAGAVPTQFRSAAAGARGAQPQVQ 

GTHAAAAAANNMRNTGARAITGQQTAAPWMQIPGAQIAGGAQQRTSNYKYTSNMRNPPVPQLHQTQPIPQQLQGKNSEKLIASLLANAKPQEQKQ 

ILGERLYPMI EHMH AN L AGK I T GML LE IENSELLHMI EDQE AL KAK VE E A VA VLQ VH R VT E PAN * 

(SEQ ID NO: 1203) 

Classification: RNA_binding 
Gene Symbol: pAbp 
FlyBase ID; FBgn0003031 



Celera Sequence No. : 142000013384665 

ATCTCATCTCATATGAAAGCAATATCATATATGTATGTTAATATTAAATAGATATTAAACATAAAAATCCATCCCACCCACATTCCGCTGAAAAA 
TGAAAACACTCGCTTTTGCCGAAGTGGAATTTTATTGTAATTAAATTTGTTGGGTGTACAAAGGTATGTTTTAAGGACTTTCTCTCTGCATTTAT 
TACAATTTCTTCAAATGGTTTTAATTGCTGGGCTTTTGCTTCGAAGACGGGCAATCAAGTTTCAAGAATCCTATTTTCTTTAATCATACAAATTA 
TGTTTTATTTCGCTTTAATATATAAGGATACATATATATATCTGCTATATATAGAAGTGTGTTGACTTGCGTATATTTCGTGAAAAGTAAAAGTT 
TTCTTCTTCGCAATTTCCCTTGAACTAACATTCCAATTCGGGTTTTGGTTATGTTTTTTTTTTCTGTTTTACTCTCATTATTTTGTTTGTTTGTT 
TTTTTTAAATAATTTTGCATTTTCTGCATACATGTGGTTTTGTGTTTTGCAATTTTTTCAATGATTTTTCCTTGGGTTTTTATTTGTTTTTAGTT 
TTCGATTTGTTTAATAAAAAAATATGTATTTCACTGCCCAGAGCGAAAGCCATTTCTTACTTCCGCTTGCGTTAATATATCATTTTATGTATGTA 
TGGTGTAAAAACATCATTTACTGGTAAAACTCTCGATATCCTTTGTAACAGAAATCGGAACGCAAATGAAATCGAGGCGACTTACCAGCTCCTTC 
TCATTAAATACATATATGTTCTTGCAAATGTTTATAAATGTATGCATGTTTTTCCTTCTCTCGGATAATAGTGTAATCATAAAAATCCTGTCTGC 
TTTTTTGTTGTTTATTAATATGCTTCATAAGTTGTTTACGCTTTAGCGCTTTTCATGTGTTTCTTGTATTTCTCATATATGTTGTGGTTGTTTAA 
TATGTGTGTTGCTGTGTGTGTGTTTTTGTGTTTATATATATTATTTTTTTTTTGCTGTTTTTACATAGGTTTTGCCATTTTTTTGTAGTTTTTTC 
GATTTATGTTTTGTTTTCATTTGTCAACATATTTTTTCAGCTGTTGGTATTCTGTATTTTCTCTCTTTTTTCAAAGTAGCAAAAACTTGACTCTC 
ATATTCTTGCGGTTTTCTTTTTCTTTCTTTCTTTCTATTCTTCTGTTCTCTGCACAGTTTGCTTTCCATTTTTTCGTTTTTATTTTAACTCTTTT 
GCAATTTTTTTGTTTTTTCTTTGCTTTTTTAAAAGTGTATGGTTGTCGCGTTTCTTTTTCTAAACAAAGCTTTCTTAAAGTTTGAATTTGAGAAA 
AAAGAATTGAAAAAAATTACATATCGCTTTCGTGTTTGTTGTTGTTTGAAAGTGTGGATGGAAAAAGTGTACGAAAAACTCAATATGTATGTAAC 
ACTAGAAACTTCTTTAAAATTTTTAAAACACAATTTCAATACTTGCAAGAAGCGGTTTACATGTATGCTTTGCTTTAATCTGTTTATTTTGCAAT 
GGTTTTTTGGTGTTTTCATTATACGCATACAAAATAATATCGATCCAGTTAAAGCAATAACGAAATTTTTTTCATTTCACATTTTTCCGTTTTTT 
TTTTTAATTAACATGTTTATGTAAAAAAAAAGAATTTATAAGTGGACTGCAGAATTTGATTGCGTGACTTTCTTCTTTTGCTTCGTGTACTTGCG 
TAAATGCGTTCGAAATTAAATGATTGAAATTGCTTTTGTTGATGTTTCTGGGATTAGAAAAGAAAAAGCCGTAGATTTTTCGCTTTGTTTTTCTG 
TGTTAATTGTGTGTCCTTGGCAATCCTCCCCTGCGCAAAGCATCAGTGAATGCGAATTTTTGTTGTTCATGTATTGAAAGATTAGTGGAAAACTG 
CAAGGACAAAAGAGAGGCGAGTTTAGACAGCTTGAACTTTTCGGAAAATTTGTAGACTACTTACTTTTTGGGTGTAAAGTGTTCTTGTAAAATGT 
TCATACGCTTGAGCTGTTCGAGCTTAGTTGGCGGGCTCGGTGACGCGGTGCACCTGAAGCACGGCCACAGCCTCCTCCACCTTGGCCTTAAGGGC 
CTCCTGATCCTCGATCATGTGCAAGAGCTCAGAGTTTTCGATCTCCAACAACATGCCGGTGATCTTACCAGCCAAATTGGCGTGCATATGCTCAA 
TCATCGGATACAGACGCTCGCCCAGGATCTGTTTCTGCTCCTGCGGCTTGGCGTTGGCCAGCAACGAGGCAATGAGCTTCTCAGAATCTGCAAGA 
AAAATAGCAATGGTGCCATTAGTGACAAATTGTGCATATTTTCAGTCGACCTAAAACTTACTCTTTCCTTGCAATTGCTGTGGAATTGGCTGGGT 
CTGATGCAACTGTGGCACTGGAGGATTGCGCATGTTTGAGGTGTACTTATAGTTGGAAGTACGCTGCTGGGCACCGCCAGCGATTTGGGCTCCGG 
GGATCTGCATGTTGGGTGCTGCAGTTTGCTGGCCAGTAATAGCGCGTGCTCCAGTGTTGCGCATGTTGTTAGCCGCAGCAGCGGCGGCATGTGTG 
CCCTGCACCTGAGGCTGTGCCCCGCGAGCGCCAGCAGCAGCCGAACGGAACTGAGTGGGCACCGCACCGGCTGTGCCCTGGAAGCCACCAGCAGC 
GGCAGCTCCGGCCTGGACACCCTGTATGGCTGCTGGGGGACGCACCTGGGGCACCCAGCGGGGTGTGTTTCTCATCTGAGTGGCCACCTGGGAAC 
CGAAGAAACGCTGATTTGACGGAAGGGTCGGCACAAAGAAGCCGCTAGCCGCGTTGGGCTGGTAGATCTGTCCCAGCTGCTGCATACGCATGCCG 
GTCATGTGACGCATGTACTGCGAGGCGAGATGAGCCTTGCGCTCCTCCTTCCTTTGGGCCAAAGCAACGTACAACGGCTTACTGCCGACGACGCG 
ACCGTTCAGCTCGGTGACGGCGCAGGTAGCCTCGCTTGCGGCATTGAAGCACACGAAGCCGAATCCCTTGGAGCGACCCTCCTCATCGGTCATGA 
CC T T GGCC GAT GT G AT G T T G C C GT AT GG AGAG AAC GC G AT GC GC AG AC GAT C GT C AT C GAT T GT GT C GT C CAG GT T CT T AAC GT AC AAGT T AAC G 
CCGAACACAGACTCGTGACGCTTCTGCTTCAGTTCCTCGAACTTGCGCTTCAGCTCCTGCTGGCGTTCGGCCTTCTTCTGGGCACGAGCCACGTA 
CAGGGACTTGCCCTCTCCCATGTCCTTGCCATTGAGAGCCTGAACGGCCGCCTCAGCAGCCTCTGTTGTCTCGAATGCAACGAATCCGAAGCCCT 
TGCTCTTGCCATCCTCCTTGGACATGACCTTGTAGCTGGTTATCTTGCCGTAGGGCTCAAAGAATTCCTTCAGTTTTTCATCGTCGAAATCCTCA 
GTGAAGTTCTTCACATATACATTGGTGAAGAGCTTAGCCTTCTCGCCCAGCTCCTTCTCGCGCTCCTTGCGCGGGATGAACTTACCCACGTAGAC 
CTTCTTGCCGTTGAGCAACATGCCATTGACTTTGTCGATGGACGTGTTTGCGGCCTCCTCGGTCTCGAAGTGGACGAATCCATAGCCCTTTGAGT 
TGCCCTTTTCATCGGTGGCTACCTTGCAGCTTAAAATGTTGCCGAAGGCGGAGAAAGTGTCGTAGATTGCCTTGTTGTCGATGGCCCTATCTAAG 
TTCTTGATGAACACGTTGCCCACACCTGAGCGACGAAGAGAAGGATCACGCTGAGACCACATAATGCGAATGGGCTTGTTGCGAACCAGGTCAAA 
GTTCATGGTGTCCAAAGCACGCTCAGCTGTTGATGAAAATCGGGAAAACATTTGGTTACTACCGACACGATTTGGGTATATCAAGATCATTATAA 
GTTGAAGGGGCTTAAATAACAATGTGACTCTTCAACTTCTGCACTGGCCATGAGCAAAAGCAGCGCAGTGCATGGTAAGTCAAGTCTGATCTTTT 
ATCTGGCAACTATGAATCTTAGTCGTGTCAAAAAGGGTTGAATCTGACCAACCAATGGGTAGGAGTTGCTAGATGTCTAAATACAAGGACAATCT 
TATCAGTCTAATCATGTGATACATTGAGTGTTTGTTATGCAGATGGTATGCAAAACCCATCAGCATGGTAAGTCGCATTGCTTTCTCTGGCAACT 
ATGAATCTTAGTCGTGTCAAAATGGGTTAAATCTGACCAACCAATGGGTAGAAGTTGCTAGAAGCTTAGATAAGAAAGGGCATAATCCTTCGAGT 
CTAATCATATGATGTATTATTTAGTGTTTGTTATGCTTCTACTTCTGTGGATGGCATAGATCAGCCATCTACATGGTAAGTTGCATTGCTTTCTC 
TGGCAACATTGAATCTTAGTCGTGTCAAAAAGAGATAAATCTGACCAACCAATGGGTAGGAGTTGCGAGATATGAATCGAAAAAAGAAAAACATT 
TAGGGTGTTACTAGTCTAGAACGAGATGGTATCATTTTTGAAAGGTCTGTTTTTCCAACCTAACCTTAACAGCTAGAGTTCAAGAAACTTGCTGG 
AATGCGACAGACTTTCACACGCAGCGTAATCATCACCAACCGCCAGCTTTGTGCCATTGTGCGCTTAATTGAATAAACAAATGTAAGCGAACAAC 
GGGAATCGAAGAAGCACCAACTATGCGCAGTGAAAGTTTTGTTTTCGTTTTTTGTAAAAACAAGATTTTGATGGTTAAAAGAGAGGCCGAAAAGC 
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AGTTCACTGACATGTCAAGAATTTGGAAGTGTTGATGGATGGAATGCGGAGAGTATGGAAGATATGAGAAAGTGAAAGCAAAACAAAAGTTGGAG 
GCAGTACGAAAACTAAAGAGCGAACGTGCGGAAACTCAGTTAATGGAATATAACCTAAAATAGATGAAGGCCAAAAAGAAATGGTCCAAACAACA 
ATGAACACATGTTTTTGCGCGTCGGCGTGTGCGTGCGCGCAAGTTTGCCGCTTGGAAACAACACACATCACACGAATAAAACGAGAAGAACGGGC 
AAGGAAACAATGAAACCACCAATGGGCGGGGCTGCGATAAGCAGTTTGGGCAGTATGGAACCATATTATTCCATCATAATTGCAATTTTGTGCGA 
AATTGAGACGGAAATTTCTAGGCGAACAGTTGATGAGTGCCACGAGCCAGCTGTTTAGCCCCCATCATCCCCCCGCACAAATCGTATACATCCAG 
AATAACTGGAACTGATATAATCATGTGTGTGTGTGTACACACTGGCGGGAATTGAGTGCGTGTGTGTGTGCGCGACGAGATTAACGTAAGCAGTG 
CACTGTGGAACAAGTTATGAAACACTTGGAAGCCGTTTATCTGGTCTCCTATGAAAGACATCTTTCAATAAAGTCCAGTTTGGACTTGCGTTTAG 
CCCCACTGTATGACATGCGCTAGTGTGCGTGCGTGTGTGAGTAAGCGAGTGCAAAACTGCACACGTGTTTTCTTTGTTTCGCGCGGAAGGGGGCT 
ACTTTTGCGGGCAGTAAGACTAAAAAAATTACACTACTTTTGCGGGGATTACTCACCATCGGCTGGCTGCTGGAAGTTGACGTAGGCATAGCCCA 
ACGAGCGACGGGTAATCACATCGCGGCAGACACGAATGGACAGCACTGGACCAGCAGACGAGAACTTGTCAAAAAGTCCCGATTCGTTGACGTCC 
TGTGGGAGATCACCGACGTATAGAGAAGCCATATTTGGTGATCAGGTCTACCTGCAATTTTTAACAATTAATTTCCACTTTTTTGACACTGAATT 
TTGGGTTTTTTCTTAATCTTTACGGACCGGATGGATTCGACCAGGCAGGCGGTGAAATAGAAAGAAAAAAAGCGGCGTGGCTTTCGGTCTTCTAT 
TCGTCTCTCTGTCTGTGTGTCTGGGTGGCAACACTGGTACATGTGACAGTCGGTAGACATGGCAACGTTGGCATTTGCTTTGTTGTTGCTCTTTG 
CTCTCCCCTTTTATTTCTCTTCGTGTTGGAAAATTTCCACACAGACACAAAGATAGACACGCGTTTGCCTGATCGAATTTTGGGGAATTCTTTGA 
TTTTCTCGAAAATATGCGACAACTAGTTCCTAGACCACTATTTTGCGGGCAATCTGGCGTTTATAGACGCCATCTTTGAGCGAAGAACGACCGCA 
AACAAAGCAAATTTCGATGGGAAAGGAAAATTTCCCTATTCGCTTTGTCGTGCGTTGCGCCCTCTCACTCTCTACACCATTGTAACCAATAGCTT 
TACACTGTACTTACTTCTCAAACACGCTCGATTTTTTTTGCTACTTTTTACAATTTTTTTTGTTACGTTTTTGTGTTTTAATATTTTTTTTTATT 
TTTTATATGTTTTTTCTTCACAAAAAATATGTGATAAGCCGTGTGCTGAAAGCGAATCCACGACCGCTCACGTAAACGGACTCTACACGACTCAT 
AGAGAGGAGATTCGTTAGCGATGACCCCACAAAGTATCGATTAAATCCCACTGCACAATGCTGCCACATAGGGTTAGAGACAAATAAATGTCACT 
AAAAAATCCGTGAGGACAATTTACTAATTACGAAGCATTATTTTGAATTTCAAAAAAGCCAGCCATCACACATAAAAAATTTTTATATTTAGCTA 
ATTCCCGAAACATGAAATACTTGTGCATTTTCAGGGGTTCTTTGGTAACTCTGCTTCATTTATGTTCTGATATTTTCGCCAATGTCCTTGAATAC 
TTTGTCTCATAAAACACAATCCAAAATTAGGAATGCAGTTTTTTTTTCTTTTAAAAATGAGCGTAGACTTTTAGAAAATGATTTTTTGGCACTGC 
ACAGATATTAGCGTATGCTCCGAAAACCGAATCCCATCTTTGTAATTCCCCACTGTGCGAACTTACCTAAATCAGAGAATAAAGCGATGAGGCTG 
CAAGTCGACACTCCGAGAAGTGCTTGAGTACTTCTTTTCGTTCTCGATGGAAATCGGCACTACAGAGATTGAAGAATTTTTTAAGTTTTTTTTTT 
GTGTTTTTGTTTTTACGAAATAGACCACATTCAAGGATGGCTGAGGATAATTTTTTTTTTCTTTTTCCCGTTTTGAACGTCTGGTGCATGTGCCC 
TATCTGGCGATCGAGTACCTAATGAAGCGGTGTGAGGCGTCGGTGCCTGACAAGCTTCAAAGCGAAGTGTCACTTTGGGCGCCTACTGGCGTGTC 
CTGAGTAAAATCTTAAAAGATTTGACATTTCGGAAACATTTATTTGGGCATTATATACTTCATTAAACATATTTCGATCTGGTTTGGCCAAACAT 
TTACTTGGCTTGAGAATTAGTTTGCTTTCAACTCAATTTTGATTTTTCGTAGCATTTAATAGTTAAACATAAAACGTACTGGCTTAATTCATTAT 
TGTGTAGTTTTGCCTAGACTATGGGTGAAAATTATTGAACTAGTCCTACATTTTATTAAATTTGTGTGTATGTATGTATATCAATTGTTTTTTTT 
TTGTATGCATGACATGATTTGTGAGTTTTCCCATTTGAAGGTATATTTTTCAAAAACATACATGGGCGTGGCGTATCGCTAAAGTACAGTACTGA 
ACTTAACTGAATTGTTCTCATTATTTAATACGTTTTCATTCATCAGCGAATTTTACATAGGCTTCCAGATTAATCTTGTAATACTTTTACCAATT 
TTGTGAAAACTTATTTATGTTCTGCTGCAACACTTAACTGCCATTGCACAATTATAACCTGTATTCAGTTGCTTTTTGCTTTTTTTTCGTTTTTT 
TTTTTTACAATTTAACATTTTATCGCTTTAATGCTGAACCAATATTTTGAACAATCTTTTTCTATTTTGGCAGTGGGTTGGTAATAAACTTTTAA 
AC AT T AAC AAAT AAAG AGGC CC GT AGT T GGC AGC GT T C AT T AAT AC AT AAT AACG AAAAGAAAT G AT AAT GG AAATG AAG AAT AT AAAATT C ATA 
ATGAGTCGGTATGCCAAAGGGTTCAATTTCCAAGCAATCCAATCGCCGTTGTCGCTGTATCTCCGCCCCACTCCACCAGGTACTGGACCTGTCCA 
TTTCCCATGCGCCGCTTGGCCCGAATGGCAAAGGGCTCACCAGCATCTATGCGATTC 
(SEQ ID NO: 1204) 

Exon: 7132.. 6907 
Exon: 5751.. 5567 
Exon: 3826.. 2342 
Exon: 2272. .1965 
Exon: 1897.. 1001 

Start ATG: 5732 (Reverse strand: CAT) 
Transcript No. : CT39088 

CCAGATAGGGCACATGCACCAGACGTTCAAAACGGGAAAAAGAAAAAAAAAATTATCCTCAGCCATCCTTGAATGTGGTCTATTTCGTAAAAACA 
AAAACACAAAAAAAAAACTTAAAAAATTCTTCAATCTCTGTAGTGCCGATTTCCATCGAGAACGAAAAGAAGTACTCAAGCACTTCTCGGAGTGT 
CGACTTGCAGCCTCATCGCTTTATTCTCTGATTTAGGTAGACCTGATCACCAAATATGGCTTCTCTATACGTCGGTGATCTCCCACAGGACGTCA 
ACGAATCGGGACTTTTTGACAAGTTCTCGTCTGCTGGTCCAGTGCTGTCCATTCGTGTCTGCCGCGATGTGATTACCCGTCGCTCGTTGGGCTAT 
GCCTACGTCAACTTCCAGCAGCCAGCCGATGCTGAGCGTGCTTTGGACACCATGAACTTTGACCTGGTTCGCAACAAGCCCATTCGCATTATGTG 
GTCTCAGCGTGATCCTTCTCTTCGTCGCTCAGGTGTGGGCAACGTGTTCATCAAGAACTTAGATAGGGCCATCGACAACAAGGCAATCTACGACA 
CTTTCTCCGCCTTCGGCAACATTTTAAGCTGCAAGGTAGCCACCGATGAAAAGGGCAACTCAAAGGGCTATGGATTCGTCCACTTCGAGACCGAG 
GAGGCCGCAAACACGTCCATCGACAAAGTCAATGGCATGTTGCTCAACGGCAAGAAGGTCTACGTGGGTAAGTTCATCCCGCGCAAGGAGCGCGA 
GAAGGAGCTGGGCGAGAAGGCTAAGCTCTTCACCAATGTATATGTGAAGAACTTCACTGAGGATTTCGACGATGAAAAACTGAAGGAATTCTTTG 
AGCCCTACGGCAAGATAACCAGCTACAAGGTCATGTCCAAGGAGGATGGCAAGAGCAAGGGCTTCGGATTCGTTGCATTCGAGACAACAGAGGCT 
GCTGAGGCGGCCGTTCAGGCTCTCAATGGCAAGGACATGGGAGAGGGCAAGTCCCTGTACGTGGCTCGTGCCCAGAAGAAGGCCGAACGCCAGCA 
GGAGCTGAAGCGCAAGTTCGAGGAACTGAAGCAGAAGCGTCACGAGTCTGTGTTCGGCGTTAACTTGTACGTTAAGAACCTGGACGACACAATCG 
ATGACGATCGTCTGCGCATCGCGTTCTCTCCATACGGCAACATCACATCGGCCAAGGTCATGACCGATGAGGAGGGTCGCTCCAAGGGATTCGGC 
TTCGTGTGCTTCAATGCCGCAAGCGAGGCTACCTGCGCCGTCACCGAGCTGAACGGTCGCGTCGTCGGCAGTAAGCCGTTGTACGTTGCTTTGGC 
CCAAAGGAAGGAGGAGCGCAAGGCTCATCTCGCCTCGCAGTACATGCGTCACATGACCGGCATGCGTATGCAGCAGCTGGGACAGATCTACCAGC 
CCAACGCGGCTAGCGGCTTCTTTGTGCCGACCCTTCCGTCAAATCAGCGTTTCTTCGGTTCCCAGGTGGCCACTCAGATGAGAAACACACCCCGC 
TGGGTGCCCCAGGTGCGTCCCCCAGCAGCCATACAGGGTGTCCAGGCCGGAGCTGCCGCTGCTGGTGGCTTCCAGGGCACAGCCGGTGCGGTGCC 
CACTCAGTTCCGTTCGGCTGCTGCTGGCGCTCGCGGGGCACAGCCTCAGGTGCAGGGCACACATGCCGCCGCTGCTGCGGCTAACAACATGCGCA 
ACACTGGAGCACGCGCTATTACTGGCCAGCAAACTGCAGCACCCAACATGCAGATCCCCGGAGCCCAAATCGCTGGCGGTGCCCAGCAGCGTACT 
TCCAACTATAAGTACACCTCAAACATGCGCAATCCTCCAGTGCCACAGTTGCATCAGACCCAGCCAATTCCACAGCAATTGCAAGGAAAGAATTC 
TGAGAAGCTCATTGCCTCGTTGCTGGCCAACGCCAAGCCGCAGGAGCAGAAACAGATCCTGGGCGAGCGTCTGTATCCGATGATTGAGCATATGC 
ACGCCAATTTGGCTGGTAAGATCACCGGCATGTTGTTGGAGATCGAAAACTCTGAGCTCTTGCACATGATCGAGGATCAGGAGGCCCTTAAGGCC 
AAGGTGGAGGAGGCTGTGGCCGTGCTTCAGGTGCACCGCGTCACCGAGCCCGCCAACTAAGCTCGAACAGCTCAAGCGTATGAACATTTTACAAG 
AACACTTTACACCCAAAAATTTTCCACTAATCTTTCAATACATGAACAACAAAAATTCGCATTCACTGATGCTTTGCGCAGGGGAGGATTGCCAA 
GGACACACAATTAACACAGAAAAACAAAGCGAAAAATCTACGGCTTTTTCTTTTCTAATCCCAGAAACATCAACAAAAGCAATTTCAATCATTTA 
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ATTTCGAACGCATTTACGCAAGTACACGAAGCAAAAGAAGAAAGTCACGCAATCAAATTCTGCAGTCCACTTATAAATTCTTTTTTTTTACATAA 
ACATGTTAATTAAAAAAAAAAACGGAAAAATGTGAAATGAAAAAAATTTCGTTATTGCTTTAACTGGATCGATATTATTTTGTATGCGTATAATG 
AAAACACCAAAAAACCATTGCAAAATAAACAGATTAAAGCAAAGCATACATGTAAACCGCTTCTTGCAAGTATTGAAATTGTGTTTTAAAAATTT 
TAAAGAAGTTTCTAGTGTTACATACATATTGAGTTTTTCGTACACTTTTTCCATCCACACTTTCAAACAACAACAAACACGAAAGCGATATGTAA 
TTTTTTTCAATTCTTTTTTCTCAAATTCAAACTTTAAGAAAGCTTTGTTTAGAAAAAGAAACGCGACAACCATACACTTTTAAAAAAGCAAAGAA 
AAAACAAAAAAATTGCAAAAGAGTTAAAATAAAAACGAAAAAATGGAAAGCAAACTGTGCAGAGAACAGAAGAATAGAAAGAAAGAAAGAAAAAG 
AAAACCGCAAGAATATGAGAGTCAAGTTTTTGCTACTrTGAAAAAAGAGAGAAAATACAGAATACCAACAGCTGAAAAAATATGTTGACAAATGA 
AAACAAAACATAAATCGAAAAAACTACAAAAAAATGGCAAAACCTATGTAAAAACAGCAAA 
(SEQ ID NO: 1205) 

Start ATG: 24 6 (Reverse strand: CAT) 

MASLYVGDLPQDVNESGLFDKFSSAGPVLSIRVCRDVITRRSLGYAYVNFQQPADAERALDTMNFDLVRNKPIRIMWSQRDPSLRRSGVGNVFIK 

NLDRAIDNKAIYDTFSAFGNILSCKVATDEKGNSKGYGFVHFETEEAANTSIDKVNGMLLNGKKVYVGKFIPRKEREKELGEKAKLFTNVYVKNF 

TEDFDDEKLKEFFEPYGKITSYKVMSKEDGKSKGFGFVAFETTEAAEAAVQALNGKDMGEGKSLYVARAQKKAERQQELKRKFEELKQKRHESVF 

GVNLYVKNLDDTIDDDRLRIAFSPYGNITSAKVMTDEEGRSKGFGFVCFNAASEATCAVTELNGRVVGSKPLYVALAQRKEERKAHLASQYMRHM 

TGMRMQQLGQIYQPNAASGFFVPTLPSNQRFFGSQVATQMRNTPRWVPQVRPPAAIQGVQAGAAAAGGFQGTAGAVPTQFRSAAAGARGAQPQVQ 

GTHAAAAAAWNMRNTGARAITGQQTAAPNMQIPGAQIAGGAQQRTSNYKYTSNMRNPPVPQLHQTQPIPQQLQGKNSEKLIASLLANAKPQEQKQ 

ILGERLYPMIEHMHANLAGKITGMLLEIENSELLHMIEDQEALKAKVEEAVAVLQVHRVTEPAN* 

(SEQ ID NO: 1206) 

Classification: RNA_binding 
Gene Symbol: pAbp 
FlyBase ID: FBgn0003031 

Celera Sequence No. : 142000013384010 

TTTTGGTTACTTGAAACGCTAAGACTGAAATGGCTTGTTATGGAGAAGAAAACACGTCCTGCAATACATTATATAATAAAAAGAACCCAGCAATT 
TTCGCATCACCACATTACAGACAACAAAGTACTTTGTCCTCATCGTTCTGTGTGATCCAGATCCCCGACTATATATGATATATACACCTAGCTAG 
ATTTAAGCGATTTGTACATACAGAAGCAAAAGCAAAACAATATACCAGAAAATTGCCTGTTAAATACATTTATATACACATATGAATTTAGTTAT 
ATCACGATGACTAGTTTTTAGTTCTTAGAGGTTATTTAGTTAGTAAGGAGCCCAACTGACGTAACGTGAATAAATCAACTATATATGTGCCCATA 
TATAAAGCGTACTATTTAATTATTTAGTTTTTACTGGACTTATGACACAACGAGCCACTGTAATGTCTTATTCAGCTCAATGCTCATAAACAAAA 
TAAATATTTTTGGAAAAAATACTGTAGTTCTTCTTATTTAGCCAAGACGTCCAAACCTTTTTTTAATAAATTTAATTGAGATGAAGATTTGCATT 
CGAATTCTATAGAAGCATCAAACAAATATTACAACAGTATCGAGTGGGTTATTAAAAATTAAATATGTAAAAATCGCTCATTGTGTTAAAATATT 
AATTATAATAGATTAGCTCAAACATCTTAAATCGACATTTTGTGCAACACATAAAAATTCTCGCAGACAATTGGCAGCTGATTACGATTAAACAA 
AACCGAAATTAGCCGTTTTCCTTTAAAGCAATCAGCATTCGTGATCTCTTGTTTGGTTACCTCACAATGCTAGTTAGTTATAGCGCCATAAAACC 
CCATTTTCAATGTCAGGGAAATCATAAGGTCATAAGGTTATCCCCACGATCGTTCTAATGTGATCACAGCCACAATCTATAATAACCAGTTCCCA 
ATAGATAGACAAGATTAGAAAACGCAGCTTGGTAGAGTTTTTTATATTTTATAATTTAATTTATTCAGAGTGCTTTCTTGGGTATCGGGTCAGAC 
AATTGAAACAGCATTAATATCATATTCGCATGTGTGGCCTGTGGGGGCAGCGGAGAGGAGATAACAAGGGGCTGGCCATCAAAGGTACAACATAC 
GGACATACTTTCCAGCGAAAGCTAGACACACATAGGATTAAATATAGCACATAGTATAACAGAATATATATGGAATTGAATACTTTGAACTTGAG 
TGTAGCTTAGTTTCGTACGATAGAATGTTTGGCTTTTAAAGTTAGAAAGATGTGTTTATAAAAAAGTGTGTATAAGTGCTGTGTTAGAGAAATGC 
GTAAAGTGTTGTTTTGGAATGTACATGTGCAATGGCACACACGCAACATTAATGTATTCTTGCTGAGAAACAAGTATAAAGCATTTTCGCTATAC 
TGAAACCGAAATTCGTTAGAAAGTAAAAGTATGTTGCTCCTGTGTCTGCCCACAATCGCTCCAACTAGTGAACTCTGTCCTTAAGGTAACCAATT 
GGACTAGCCCACTTAATGCTGCTTGGCCTTCAGGCTAGCTGGCATCTCTGGTGGTGGTGGGCGGGGAATGCCCAGGGCGACCTTCACACCATCGT 
AGATGAACCACTGCAGAGCGGTCAGGGTACCGATCATGATAATACGAGGAGTCAATCCGTTCCACATGCCACTGAAGCCCAGCGACTTGGCCACG 
CTGATGGCGCTAGCTCCCTTGGCCTGGTTCAGCTTGGACACCACCACATCAGCGGGATGCGACACCACGGCACAGAACACACCGGCGATGTAGCC 
AGCGGCGAAGGTCACGATCAGCTGCTCGCCCTTGGTGCAGTCGGCACGAGGCTTGGGCACCACATACCTGGGAAAAGTTGCATCATTAGTATTGA 
TTTAACGGGGTTCGTTGGGGCTTGACAACCCGAACTGTGAACGGTATTGCTCTAGTTTTTACATTTGAACTTTCAGTGGCTACCTGCTAATTTGC 
ACTTCAAAAGGAATGCTTTGCTTTTTTTATTTTATCCATAGTAACAAACTAGTTCTGAAATAATATGATTTTCCTATATTTGAACAAGATAAGCA 
TTCGTCGTTGGTTTTTATTCTGGCGTGCCATTTATTAGATACGAATGGCATTCTTATCTGCTGACGCGCTAAAAGTTTATTTTATTATGAATACT 
TTCAGTTTTGATCAATTAGCATATCACTTTTGAAATTTCACAGATTAACAGTCTACCTTTAAACTATTGTAATAGCAAAACGCTAATACGAATAC 
AGATATTGTGCACAGCACTGGTAAGTTTTTCCGCTGTAGAAACTAGAGGATACCTAAAGAGGAAATGTTACTTACTTGTAGAGCAGCTCCACGGT 
GCGCTCGAAGCAAGCGAACTTCATCATGGTGTATGGGATCTGTCGCATCCACAGGGGAACCAGACCCTTGTAGAAGGCATTGACGCCCTCCTCCT 
TTAGCATCTTGGGCACTGCCTCGCGGAAGTTGTTGGCGTATCCGGGAATAGTCTGGATCTTCACCTTGGCGGCCTCGAACGGCGCCAGAGCGATA 
TCGGCGAAGAACTCGGCCGAAGCGGAAGCAGCCAGATACAAGGAGGTGCGGTACAGGTAGGCGTTCTCCTCGCCAATGATTTCGGCGTACTTCAC 
CTTGAACAACTCGTACAGACCGAACTTGCACAGACCCTGAAAGATACAGAAACGCGGATTAGCCTGCTGTTTCTCAGAAGTTATTTTTCAGTTTG 
CAGTAGTATCCAAGATTTTATCTTCGGTATCTGTTTCTCCCGTAACCATAAAGTTGTCAAAGGTGTGTTGTTATTCTGAGGTATACTAAGATATC 
AGCCTAGGAATCAAACTTCCTAATGTTCGATTCGAATCACTTTTATCTGCTTCAAGCCCGGAACCGATGTAATTAAGGTCAGCCGACCTGGTCAC 
GAGCATATAATTCGCCTCCAGACGATTATGCAACCCATAAAGTGAAGCCCAATCTCACCTGTGCCGAGTAGCCGAGCAGAGTGGGGAACCAGCCC 
TTAGCCAGTCCGCGGGCGCCCTCCTCCGCCACGGTGACCTTGAATCCGTGCACCAGATTCTTGTACTTGGCCTGGTCGACCTGCAGACGGCACTT 
CACCAGATCCAGTGGGACCACGAAGGTGTGGGTGGTGCCGCAGCTGAGAATGCCACCAATTCCGCACAGAGCGAAGTACTTGGTGCTGCCGAATT 
CGCACGAATCCTGCTGGTTGGCCACCGGGGTAGCGGCGGCGGCGATCTGGCGACCCTCAACCGGCTGGGGTTCCACCACTGGGGCGGCGGCATCG 
CATCTTGCCATCGACATGGGGGTGCGGAATGGCGAGTTCCGGGCAGTTTCAAAGAAGCTGGAGAACATCTTGATTCTGTTCTTGTTCTGGGGTCG 
GCTGAAAGTAGAGAGACCATCCATTAGTTACGTATATATGTACCTCGGAGGCGACATGTGCTTTATCTAATTATATATATAGCTGGGTAATCAGC 
TGCTTTTCGCTTTTGTAGGTTACATAACGGCGTGCCTCGAATTCCGAGCACCCGGACTTACGACACCAAATTTACCCGAAAAGCGAAGGATTTTA 
AGCAACCACGTAAATTTAGTGTATGTTAAGGGAGCGATAGAGCACCGCTCGGCTACTTTGATTTCGATTTTACAAAACTTATGCAATTTTCGCTC 
GGAATTTTCGCACTTACTATTCGTCAGCAACGTTAGCGGCTCGTTGGCGAGGATTCAAAGAATGAGGGGGGGCTCACGGAGGTGCAGCGATTTCT 
CACGAATACCAAAACGCCTTAATGGTTTTGCCGTGGAGAGGTGTAACTAGCGGATTGAGGATAGTTGGCCACACTCATGTCAAAATCTTCGGCTA 
TGGACTGCAACCTTGGTGCAAAACTTTGGTGGCGCCTGAGCGTGGACTCGAACCATCTGCATACGAGGTATCAGGTGAGATTATTTTAATGCCAT 
TGCCTTCGTTGTGAATAAAATATTAATTATTAATATTAATTTCATATTGATAACATTAGTTGTAAATATTTTGAAAAAATAATAAATTCATTTGT 
AAAATCCTCATATTTTTGCATAGTATTTGTGATTTGTTTTCTATTTTTACACAATAAAACAAATTTTAATCTAAAGCTTTATTGTATTTAGAACT 
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CTCTGTAAAATATATAACAATTTTCAATTATATGTACATTTTAAATTTATAAGTTATAGCCCCATGGTAAAACTGGATAATTTTATGAGTAAATA 
TGTGAAAATAAAGTTCTGCCATGATTTCACATGGGGTTGGCCCAATCCTTCACTTTTTACGCACGGACAGGAAATTAGGCCTTTGTATTCCATTT 
TATATTTCACTCTGCAAGTGAAGCAAAATTCACCCTGGTGATCTTCAGTTAATAAGATTTGACAACCGATTTCTGGTGACCAAACGCAGCAGACT 
TTGGCATTCATAGTATCTGTAAAATTAAAAAATTAGTCTTTTATTACTATTCAAATGCTTCTTATACCGATATGCAGTCCGGGTGGAACGCAAAC 
ACCCTCTTTATTGGTTGTTTTTGGGAATTCACCTTGTAGACGACCCAATCGCCTATAAAAGTTTCCTTCATAACAACTGAATCCTGTCTCTTTAA 

TCCGACACATTTCCGTATTTTCCTTCCTACAATCTAAGCATCTC 
(SEQ ID NO: 1207) 

Exon: 3699.. 3591 
Exon: 3421.. 3004 
Exon: 2696. .2356 
Exon: 1872.. 1001 

Start ATG: 3393 (Reverse strand: CAT) 
aaattgcatLgttttgtaaa^ 

CCTTCGCTTTTCGGCCGACCCCAGAACAAGAACAGAATCAAGATGTTCTCCAGCTTCTTTGAAACTGCCCGGAACTCGCCATTCCGCACCCCCAT 
GTCGATGGCAAGATGCGATGCCGCCGCCCCAGTGGTGGAACCCCAGCCGGTTGAGGGTCGCCAGATCGCCGCCGCCGCTACCCCGGTGGCCAACC 
AGCAGGATTCGTGCGAATTCGGCAGCACCAAGTACTTCGCTCTGTGCGGAATTGGTGGCATTCTCAGCTGCGGCACCACCCACACCTTCGTGGTC 
CCACTGGATCTGGTGAAGTGCCGTCTGCAGGTCGACCAGGCCAAGTACAAGAATCTGGTGCACGGATTCAAGGTCACCGTGGCGGAGGAGGGCGC 
CCGCGGACTGGCTAAGGGCTGGTTCCCCACTCTGCTCGGCTACTCGGCACAGGGTCTGTGCAAGTTCGGTCTGTACGAGTTGTTCAAGGTGAAGT 
ACGCCGAAATCATTGGCGAGGAGAACGCCTACCTGTACCGCACCTCCTTGTATCTGGCTGCTTCCGCTTCGGCCGAGTTCTTCGCCGATATCGCT 
CTGGCGCCGTTCGAGGCCGCCAAGGTGAAGATCCAGACTATTCCCGGATACGCCAACAACTTCCGCGAGGCAGTGCCCAAGATGCTAAAGGAGGA 
GGGCGTCAATGCCTTCTACAAGGGTCTGGTTCCCCTGTGGATGCGACAGATCCCATACACCATGATGAAGTTCGCTTGCTTCGAGCGCACCGTGG 
AGCTGCTCTACAAGTATGTGGTGCCCAAGCCTCGTGCCGACTGCACCAAGGGCGAGCAGCTGATCGTGACCTTCGCCGCTGGCTACATCGCCGGT 
GTGTTCTGTGCCGTGGTGTCGCATCCCGCTGATGTGGTGGTGTCCAAGCTGAACCAGGCCAAGGGAGCTAGCGCCATCAGCGTGGCCAAGTCGCT 
GGGCTTCAGTGGCATGTGGAACGGATTGACTCCTCGTATTATCATGATCGGTACCCTGACCGCTCTGCAGTGGTTCATCTACGATGGTGTGAAGG 
TCGCCCTGGGCATTCCCCGCCCACCACCACCAGAGATGCCAGCTAGCCTGAAGGCCAAGCAGCATTAAGTGGGCTAGTCCAATTGGTTACCTTAA 
GGACAGAGTTCACTAGTTGGAGCGATTGTGGGCAGACACAGGAGCAACATACTTTTACTTTCTAACGAATTTCGGTTTCAGTATAGCGAAAATGC 
TTTATACTTGTTTCTCAGCAAGAATACATTAATGTTGCGTGTGTGCCATTGCACATGTACATTCCAAAACAACACTTTACGCATTTCTCTAACAC 
AGCACTTATACACACTTTTTTATAAACACATCTTTCTAACTTTAAAAGCCAAACATTCTATCGTACGAAACTAAGCTACACTCAAGTTCAAAGTA 
TTCAATTCCATATATATTCTGTTATACTATGTGCTATATTTAATCCTATGTGTGTCTAGCTTTCGCTGGAAAGTATGTCCGTATGTTGTACCTTT 
GATGGCCAGCCCCTTGTTATCTCCTCTCCGCTGCCCCCACAGGCCACACATGCGAATATGATATTAATGCTGTTTCAATTGTCTGACCCGATACC 

C AAG AAAGC AC T C TG AAT AAAT T AAAT TAT 
(SEQ ID NO: 1208) 

Start ATG: 138 (Reverse strand: CAT) 

MFSSFFETARNSPFRTPMSMARCDAAAPVVEPQPVEGRQIAAAATPVANQQDSCEFGSTKYFALCGIGGILSCGTTHTFVVPLDLVKCRLQVDQA 
KYKNLVHGFKVTVAEEGARGLAKGWFPTLLGYSAQGLCKFGLYELFKVKYAEIIGEENAYLYRTSLYLAASASAEFFADIALAPFEAAKVKIQTI 
PGYANNFREAVPKMLKEEGVNAFYKGLVPLWMRQIPYTMMKFACFERTVELLYKYVVPKPRADCTKGEQLIVTFAAGYIAGVFCAVVSHPADVVV 
SKLMQAKGASAISVAKSLGFSGMWNGLTPRIIMIGTLTALQWFIYDGVKVALGIPRPPPPEMPASLKAKQH* 

( SEQ ID NO: 1209) 

Name: mitochondrial carrier protein; phosphate carrier 
Classification: transporter 
Gene Symbol: Mpcp 
FlyBase ID: FBgn0026409 

Celera Sequence No. : 142000013384645 

ACTCATTGATTCATTCTTATGTTTGTTATTTTGTTATTAAATTCCAATTGATGATGCCAGTGTCTAAATAAAAACGAAGACCTAATTTTCAACCA 
AAAATCAGGTTTTCTGGGCTTGTCCGTATTAAACATTTTTGATAATTATTTATAATATATTGTTATTTTTTGCTGGCTTCAAAACATCAACAAAT 
TGTGTATGATAAATGACGATGACGTTGTTGTTGATGGCGATCGCGCTGTGGGAATCAGATGAAATGTCAATTAAGTTTGCATTTATATCTTTGAT 
TGAAGCTCTAAGATTTGCTTTGCCATTTTATGCGGAAGTTAAGCTCGGTCAGTTCTTGAAAAAGTTCCCATGGGAGCAAGTTATTTGTTTTTTGA 
CAATAAGAAGCGTTTATTTAAAATGGCTTATTATGGCCTCAATGATTCCTAATAGGCGTAACTTACTGGTTATAAACTTGTTTAATATCGTTTCA 
TTTTAAACCTGGATGACAGAAAATGAATTTCGGTTCAAATACAAATATTACAATAGAAAGTATGATATTTAGCTCCGATAAACATTTAACAATTT 
TAGTATTTTAATGGTGAAATGGTAAATTCCGCGCATGAAAAGAGATAGAAGAGATGACATATATGGAAACCTTAAAAACGTAACGGTTGCTTGGG 
TTTTATAACAATCAGTCAGTGACAGGCATTTCCAGAGTTGCCCTGTTCAACAATCGATAGCTGCCTTCGGTCGCTCGTCACTAACTTTCCTTTAG 
GCAACGGCCACACTGTCTGGCCACCAAAATCCCAAACTTAATTAAAGAATTAAATAATTCGAATAATAATTAAGCCCAGTAACCTACGCAGCTTG 
AGTGCGTAACCGATATCTAGTATACATTTCGATACATCGAAATCATGGTAGTGTTGGAGACGGAGAAGGTAAGGCGAGAAACGGCGAGCCGCATG 
GTTTCGATTTCGGCTGAGCCGTGGCAGGAACAACAAAAACAGGGTTGTTGCACAAGAGGGGAGGCGATAGTCGAGCGGAAAAGTGTGCAGTTGGC 
GTGGCTACATCATCATTGTGTTCACCGATTATTTTTTGCACAATTGCTTAATATTAATTGTACTTGCACGCTATTGTCTACGTCATAGCTATCGC 
TATCTCTGTCTGTCTCTATCAAGCTATCTCTCTTTCGCGGTCACTCGTTCTCTTTTTCTCTCCTTTCGCATTTGCATACGCATACCACACGTTTT 
CAGTGTTCTCGCTCTCTCTCTCTTGTCAAGACATCGCGCGCGTGTGTGTGGGTGTGTTCTTCTTTAGCTACATATACATAAATAGGAGAGCGGAG 
AGACAAATATGGAAAGAATGGAAAAAGAGTGAATTACTGCAATTAACCAGTCGCGAACAGTTAAATCATATTTTTGTCGGCCATTGCAGTAAATA 
AACCGTTGGCTTTCCCTCTTCACTTTCCACCTCCTTTCTTTGACGTTAATTTTTTTCAGTTAATCGCGCCGCTGCTTTGAACTCGAACACGAATT 
TTAGCCGCAACATAAAATAAAATCAAGTAACTCTTTTAACTCAAATATAAAACAACAATTCCAATTCTTTCAACAGGCAATCTGTGTTTTTATGT 
CAGATACGTGCGCGTGTGTGTGTGTGCTGTAATTCCATCGCCCCTTTCGATTCCGAGTTCGTTAGGAACAGCATTAGTTCGCCTATTTTAGTAGT 
AGCCTAGTCCGATTTTAAGTGAAACAGGATACTCCAACACCATATACTCAATAATTAGTTACAACACCCACTCAACCATACAGCAACAACAAGTT 
TAACGAGTTTTTTGTATTATCATTACTTAGTTTTTTGGTTAATAATACACAAGTGAAGAGCGAACTGCAGGGGAGCGAGATATCACGAAACAATC 



FIGURE SHEET 649 



CAAAATCCACACACACTCAAACAGAAATCAAAAGCTTCGCTCTCTCGCACACACACGCACCAACCAACTATCAACTATCACAAACACCGCGACAG 
AGAGAGAGCGGCAAGTGAATCACGGCGAATCGAAACCGATCCGAACCCACTCCGGAGCCGAAAAAGAACTGATCCTACCATCAAACGCATCCAAT 
AAACACGGCCGCCAACATGCAGAGCGACTTTCACAGAATGAAGAACTTTGCCAATCCCAAGTCCATGTTCAAAGTAATACTCTCAGTGCGCCTGT 
CGCTAAGCCAAGCTAATCTAATCTTCTGATTCCCCTTCCCATCCATTGCCATCTTCTCCCGCAGACCAGCGCCCCCAGCACCGAGCAGGGTCGTC 
CGGAACCACCAACTTCGGCTGCAGCGCCCGCCGAGGCTAAGGATGTCAAGCCCAAGGAGGACCCACAGGAGACTGGTGAACCAGCAGGCAACACT 
GCAACCACTACTGCTCCTGCCGGCGACGATGCTGTGCGCACCGAGCATTTATACAAACACCCGCTCATGAATGTCTGGACGCTGTGGTACCTTGA 
AAACGATCGGTCCAAGTCCTGGGAGGACATGCAAAACGAGATCACCAGCTTCGATACCGTCGAGGACTTCTGGAGCCTATACAACCACATCAAGC 
CCCCATCAGAGATCAAGCTGGGTAGTGACTACTCGCTATTCAAGAAGAACATTCGGTGGGTTTGCTGTTTATTGCAATTCTGCCAAGATAACCTT 
TACTAACTGATATCTCATTTGCAGTCCCATGTGGGAGGATGCAGCCAACAAACAGGGCGGTCGTTGGGTCATTACCCTTAACAAAAGCTCCAAGA 
CCGATCTGGATAACCTATGGCTCGATGTGGTAAGTGCACAAAGAAACGAGTGGTGAGCGGATGGTCTATTTATAGTGAATGTACATTCTTGAAAT 
GCAAAAATATAGAAATAGGTGTATGATTTTGCAATATAAATTATAACTTATAGAAAATATCAGCTAAAAATACGCTAGTGCTAGCTTTTGTCTTA 
GGAACATTCAATAGTGAGCTTATATCATAAATATTTTTCGCATATGAGTAACTACAACTGTTTTGCCTTCCAGCTGCTCTGCCTGATTGGTGAGG 
CCTTCGATCACTCTGATCAGATCTGCGGCGCTGTTATAAACATTCGCGGCAAGAGCAACAAGATATGTAAGTTTTCACGCACACCCAACTTCAGC 
GGAATTCCTTTGTTTAACGATTAATGCTTTCCAGCCATCTGGACTGCCGACGGAAACAACGAGGAAGCTGCCCTTGAGATTGGTCACAAGCTGCG 
CGATGCCCTGCGTCTGGGACGCAACAACTCGCTGCAGTATCAGTTGCACAAGGACACGATGGTCAAGCAGGGCTCCAACGTGAAATCGATCTACA 
CTTTGTAGGCGGCTAATAACTGGCCGCTCCTTATTCGGTCCGATCCCACACTGATTATTTTGTCTTTCATTTATTTATCGTTATAAGCAACAGTA 
GCGATTAATCGTGACTATTGTCTAAGACCCGCGTAACGAAACCGAAACGGAACCCCCTTTGTTATCAAAAATCGGCATAATATAAAATCTATCCG 
CTTTTTGTAGTCACTGTCAATAATGGATTAGACGGAAAAGTATATTAATAAAAACCTACATTAAAACCGGATACTTTTGATCTTTGATTTTGGAG 
ATATTTTAAAATAGAAAAAAGTTAACCTATATTGTTTTTGGGATAATTACTTGTTGTATTACTAAAACGAGCTTGTATCCTAATATGGGCACCTA 
AAGCAAAACTTCCAAATTTGGAATTCAATTTGTTGCTAAAAATACGTTAGCAATTTCTAGGGTGGCCAAAAAAAAATGTGTATCGATATATCGAT 
AGGTTTCCTAAATAGACATTAAAATAATTTTAGCTTTTTACTTATATTGTTAGAGTAAATACGTTTGTATTTAAAAAATAAGTCAGTATTTTTTT 
TTTATTAACCTATTATTGAATTTTATATGAGTGAGAGCTTTTCCATTCAGCCAAGAACGGTCACACTGCATGTGACATGTTTCAATTGTGTAGGC 
GCAGTCACACTGGTCGCGTCGCTCGTAGATATCGTGTAAGGATATGTCTCCGTGTTTAGCTAAATAAAATACATAAAATATATGTGCAATTCTAA 
C AAGC AT AGC CAT T T AGCGTAG AGT AGT GG AAC GAAC AGC TC G T AC CGGGC C AGT AAC AAC AAATT AAC AAGC AAAC GC AC T AAAGT T T GT C GC G 
GAATAAAGATATAATAATAGACCAGCTAACCAACACAAACAAACACACACACGGCCAGAAGTTCGTTGGCGTTGAAAGGGAAAAATGAAACGAAA 
T G AAAG GAG AAT AAC AGT GC AAT AG AT T CGGTT T AC ATT GTGC AAAAAATC G AAGT AGCC C CT GCC AAC AGG GT CGC TAAAAAT AAAGAAGGC G A 
CAGCCTCTGTTTATCGTTGGGAATAAACAAATATATGGCCAGCGGAGGAGAATCATAACAAAGGCGCAAGACTAGCCAGCATAAAAGACAAAGGA 
GCTCTCCTCCCCCGCGTCCGCCACCCCCCTCCCTTTTTGGGGTTGAAAACCCCCCACGCGTTGCGGTGCAGCATCGCCCCGACTATGCGATACCC 
TGCAATCGAGTTTATAGTGCTATAG 
(SEQ ID NO: 1210} 

Exon: 1001.. 1104 

Ex on : 1836,. 2163 

Exon: 2250.. 2620 

Exon: 2685.. 2784 

Exon: 3019.. 3106 

Exon: 3170. .3585 

Start ATG: 2107 

Transcript No. : CT39424 

CACAAGAGGGGAGGCGATAGTCGAGCGGAAAAGTGTGCAGTTGGCGTGGCTACATCATCATTGTGTTCACCGATTATTTTTTGCACAATTGCTTA 
ATATTAATTTTTTTTGGTTAATAATACACAAGTGAAGAGCGAACTGCAGGGGAGCGAGATATCACGAAACAATCCAAAATCCACACACACTCAAA 
CAGAAATCAAAAGCTTCGCTCTCTCGCACACACACGCACCAACCAACTATCAACTATCACAAACACCGCGACAGAGAGAGAGCGGCAAGTGAATC 

acggcgaatcgaaaccgatccgaacccactccggagccgaaaaagaactgatcctaccatcaaacgcatccaataaacacggccgccaacatgca 

GAGCGACTTTCACAGAATGAAGAACTTTGCCAATCCCAAGTCCATGTTCAAAACCAGCGCCCCCAGCACCGAGCAGGGTCGTCCGGAACCACCAA 

CTTCGGCTGCAGCGCCCGCCGAGGCTAAGGATGTCAAGCCCAAGGAGGACCCACAGGAGACTGGTGAACCAGCAGGCAACACTGCAACCACTACT 

GCTCCTGCCGGCGACGATGCTGTGCGCACCGAGCATTTATACAAACACCCGCTCATGAATGTCTGGACGCTGTGGTACCTTGAAAACGATCGGTC 

CAAGTCCTGGGAGGACATGCAAAACGAGATCACCAGCTTCGATACCGTCGAGGACTTCTGGAGCCTATACAACCACATCAAGCCCCCATCAGAGA 

TCAAGCTGGGTAGTGACTACTCGCTATTCAAGAAGAACATTCGTCCCATGTGGGAGGATGCAGCCAACAAACAGGGCGGTCGTTGGGTCATTACC 

CTTAACAAAAGCTCCAAGACCGATCTGGATAACCTATGGCTCGATGTGCTGCTCTGCCTGATTGGTGAGGCCTTCGATCACTCTGATCAGATCTG 

CGGCGCTGTTATAAACATTCGCGGCAAGAGCAACAAGATATCCATCTGGACTGCCGACGGAAACAACGAGGAAGCTGCCCTTGAGATTGGTCACA 

AGCTGCGCGATGCCCTGCGTCTGGGACGCAACAACTCGCTGCAGTATCAGTTGCACAAGGACACGATGGTCAAGCAGGGCTCCAACGTGAAATCG 

ATCTACACTTTGTAGGCGGCTAATAACTGGCCGCTCCTTATTCGGTCCGATCCCACACTGATTATTTTGTCTTTCATTTATTTATCGTTATAAGC 

AACAGTAGCGATTAATCGTGACTATTGTCTAAGACCCGCGTAACGAAACCGAAACGGAACCCCCTTTGTTATCAAAAATCGGCATAATATAAAAT 

CTATCCGCTTTTTGTAGTCACTGTCAATAATGGATTAGACGGAAAAGTATATTAATAAAAACCTACATTAAAACCGG 

(SEQ ID NO: 1211) 

Start ATG: 376 

MQSDFHRMKNFANPKSMFKTSAPSTEQGRPEPPTSA7AAPAEAKDVKPKEDPQETGEPAGNTATTTAPAGDDAVRTEHLYKHPLMNWTLWYLEND 
RSKSWEDMQNEITSFDTVEDFWSLYNHIKPPSEIKLGSDYSLFKKNIRPI^EDAANKQGGRWVITLNKSSKTDLDNLWLDVLLCLIGEAFDHSDQ 
ICGAVINIRGKSNKISIWTADGNNEEAALEIGHKLRDALRLGRNNSLQYQLHKDTMVKQGSNVKSIYTL* 
(SEQ ID NO: 1212) 

Name: eukaryotic initiation factor 4E 
Gene Symbol: eIF-4E 
FlyBase ID: FBgn0015218 



Celera Sequence No. : 142000013384822 
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TTCCCAATTCCTTCGAGGTGATGGCACCTTCGTTATCCTTATCCAATAAGCTATAGATATTCTTGAGCAGGTCCTGCTCTTCAACAGACAATTCA 
TCCATTATGCTTTATATTATGGGAAGGAGAATATTATTATTATGCTAGTAGATATACTTTTTCGATAGGATTGATACTCAGTTGAGGTCTGATAA 
GCTTTGAAGGAGTTCTACGATTGAAAATTGTTATTCAAAACATTCAATATAAATATTTTTGAAAAAAGCCAACTAAGAGTTAGCCTTCGGTCTTT 
AACAGATATTAAACAATCTGCGTGCTCCATGAAGTTCATAAACTTTTACTGAAGATAAAGGCATTAAAAGAGCATTCTCACGCACAGTTTGGCAA 
ACAAATCACGTGCTTTCCTGGACCCAAATGATATCATCTGGCGGAGAAACTCTTTGGCTTTACGCGTATTATTTTATAGTTGAACCAGAAATTGA 
TGTCATCGGTGGCGGTAAAAAATGGGCGCCCAATCCGCCCAATGAGGAAATTCCAGGAAGCCGGGCAACTCCAACTGTCGGTGATGACAATTTGC 
GTGGCATTTTAATTGAACTTTCTACGTGTTCGAGTGGTGTACTTTGATTTGATCCCCATCCACCGATTTTTCGTCGATTTGAGTCAATGTTAAAG 
GCGTTGCCACAAACCGCAAACTGTCGTGTCTAGGAGGTTGGGGAGGGGCTTGGTGGCTGGGGCCTCGAAACGTGCCTGAAGTATTATTACCGCCG 
TCTGTGATTGATGGCCTCATGCGAGTGAGTCAGTGGATTTTTATAACTCTACGCAGATCCCTTCCTTTCAATCAAGCGGCAAAAAATCTGGACAT 
CGATCTCGCCGATATTAGCTAACTTCATTTCAACTTAACTGCCCGGAGTGCTTTGAATAAAGTGGATTTGAGGCGATGCCTTGGGATTCTTGGCC 
AGATTAAGTCGATTTTTTTTTTTTACTTCAATGAAAAAAATGGCACGCAATTGTTTTTCATTATTAATGCATACTTTATTTGAAATCAACTAACT 
CGCTATGGTTTGAATAAAATCATTTAAAATTGAAGGAAGACCATGGAATTCTATGGGTTGCTTATATTAACTACTGCAAAACTCTTAAAATTTAA 
GCCTAATTACATAAGGTTCTACTGGAATGTTGCTTGTAGTTGGATTTTTGTTTATGCTCTGCTTTACACTAACTTTTACTTTCAACCATTTTTTT 
CCCTACAGCAAAATCGATTAACGATTTTGTTTACTCATTAAGATTAGCTTACCGGATAAGGAAACATAAATAAAAAATTGGAATTCACTCAGCAA 
ACAGCAACACTTAGTGTTGGGCCACACATTAAAAACTAATCGATAGACTCTCGTACTAAAGGCTTAAACATTTAAATCGGTAGATTTAAGCTAAA 
ATCCTAACGCATAAGAGCTCTATAAAATGCAACAACGGAGTTTGTTTCGCTTACAATTATCGCTTAACTGATTTTATTACTTAGTTTAGGTCTAT 
TGCCTTGCCTAGACATTGTACTAAAAAGTATTAAATTGTAGATCTAAATTTATTTTAATGTACTTGTGGGTCTAAAACGATAAATCCTCGTTTAA 
CCATTGGGAACAATAACTGTTTGTCATTATGATATTCAACTGTTCATAAGACCTTCTAAATGAAATACCAGAAAAAACAACCGAGATATATATTT 
ATTAGGAATACTACTACTTTTTGATATTTCAAATTCGTAAGAGCTTTGATAAATAGTTTTCCAAATTGTGTTGTTTTTCTATTCCGATTTTACCT 
CCATTAATGGATTGTTTTCTAGTTCTTTCTCGGTTTACTTCTAACGCCTCATTATGTCCTCAATGCTGAACCCTGTTCTTCTCGGTGGTGCAGGC 
GGTGGTGCTATGGGCGGTGCTGCCACTGGTGGTCCCTTTTCGGTGGTGGCTCCACTTGTTTCCGTACTCGGTGGTTTTCCCAGAAGCTGTCTGGG 
GGGATGTCGCGACATATTCAAAGCATATATGCTGGGACTATCCACAAAGGTCCTTTGACCCAATCCAGGATGGTGGTGCGGATCCATCAAGTCCA 
AGGTCGGTGCCGGAAGAATCCTCTTTCGCATGGGTATCGATCCCAGGAAGCTGGGCATAGCTTCATGCGGTGAATGCAAAGGTTGATTGGGTGGT 

T T AC C C GACT C C GGCT GGGT T AT T AT AGT G GGTGGTGT AT GT GGT GG C GGT GG AAC AC C AGC C T T CGAAAGGT T C AAGC TCTTGTGGT GC AAG T C 

GCGTCGAACGTGCAGAATGGGCAAATAGGGATCCCCGTTGGGTGGGAGTAAGTGTGGATGTCCTGGGTGCTGGTGGGTTGGCGGAGGAGCTCCAG 

GTGGTGCAGGCGGCGGTCCAGCTGCCCCAGGTGGTACAGAATATCGCACAAAGGAAGGTGGAACGGGTAGGCTGGCAATGCCAGAGTTCCCGGGT 

TCCTCCTGATGTTTCTCGCCATGTGGCCGCATTGTGTAGGTTTCACCACCCTCATGATGACAGTGGGTGACTCCCTGGCGTTCCGGCTTGGTGCC 

CACTGACGACGATGATTGCGATGTGGCGGCAGCCTTACTCGCCTGGAAACGCGCCACTAGTGCCACTTCTGCCTGCTCCTCGTCCTCGGGATCAA 

AGTCGTCGTCCTCTTCCCTGGGCAAATCGTGTACTCTTGGCAGATCCTGCAGTTCCTCTTCCTCGTCGAAGTCATCGAACTCCTCTTCGCTTTCC 

TCATGATCGATGGCTGCCTTGCGCTTGAGTCTCATTCTTTCGGACTCAGATTTCTCACCCGATCCTCCAGACTCGCCCAAATTGTGAACTGGCGA 

CTGGCGAGGCGAGTCCACTTCCAGCAACGAATCCTCCTCGTCGCCACTTAAATTTCTGGCCTCATCCTCTTCGCCCACATCCACATAGTCCCCGG 

AGTTGACCTCACCCACTGCATGGGTCAAGGCATGGCGTCGTAGATCGCAGTTACGGCGGAAAACTTTGCCGCATGAAGAGCACTCGTAGGGCTTG 

TGATCCGTGTGGGTGAGCAGATGGGTCTTCAGGTTGGAGCGCTGATTGAATGATCGACTGCAGACGGGGCACTTGTGGGGTGATTCCTCCATGTG 

CAGGATCTTGTGGACAGCCAAGGTTCTCGATTGGCAGAATCCCTTGCCGCATTCCGTGCATTTGAAGGGTTTCTCCTTGGAGTGAATATACCTGG 

GCGAATACAAGTGGAGAAACAATTAGACTTTATTGTTATAGCGTGGGCGTTTTCCTGGTAGCTGCAATTGGTAGCCCAGGTTGACAGTTATCTCT 

GCTCAAATGCAGCCTTAACTTGAACCGATAAAGCTCACTGATATACAGCTTTACAATACTCTAACTAAGGATATAACTGGCTAGAGCAGACAATA 

GAATAAATAATATAATACTATAGCATATAATACCTATCTTTCTCTTTCAAAATGAAACCTAACCTGGATTTGTTTTAACCTTAAACCCAATTCTT 

TAAAGTTTTTCATTAAAGAAATCCTATGATTTTGTATCCTCTTAACTGATAGGGTATTTAACTTTCGTGTATGCGATCTGGCTGCATTAATTTTT 

TATGTTTATGGCGGCCATCTTGGCAATGGCGTCCACTTGCTGTTGGATAGCAGCAGCCACAGTTAAAAGTTAAAGTCCCCATCTTCGACGGAGCC 

ACAGATTCAGCATCGCATCGGAGGCAGCAGCTGCTGGTGGAAAAAAAAAAATAAATGTATAAAAACCCAAAACAGGTTCTGCCTCCGAAGCACCG 

AGATTCTTCTCCCATGCTCCGTGCGTTTCTGGTAAAAAAAAAATACAAAAAACTTAAGAAGAATTAGCCCCGCGGCAAAGCAAACAAACGACAAA 

TGGACCGATACCGAACGATCAAAAGACAGAACGCTCGGACAGGCCGACACATTGAAGCCGGGATGCTGGACCTGCTGCCTGGCATCGGCATGGGC 

ATCGGCACCGGGATACACAGACCCGATATATAGCCCCCAACCCCTTGGCCATCCCTCAATTTTAATGCTGGTGGAGGAGCCCCAAAGGAAAACCA 

ACAGAGTTAATTAACTTTGTGTTGTACTCTGCGGCCGAGTTGAAGTTTTACTGGCCGTCAGAGGAGCTCTGATTTTGGGTCCGCTTTATGGGGAC 

TTACTTATTTGAGTTGAGGTAGGAAAATTCGTCGTAAAGTTTAAACATTTCCTGGAAAATTCCTTATGGTAAAAGGGGTCTTTTGAAGCGATTGT 

CTGATATATCTAAAATCAATAAAACTCCATCTACGAGTAGTGGTCGCCTTTCCCTTTTGGCCGAGATTACTGTCACACCTGTTCAGAATGTGTGT 

ATAAACTGCCTAGTAAGCAATAAATTACATTGATCTAATTATCCTTTGCCTTTACAGGGTATTCCATTATTCGCCATTCGTTCACACTTTGAGCA 

CAAAGAATAACTCCAGGCCTGATGGCAATTACCCGAGCAGACACTGCCCCCTTTTGTGATTACTTGTACCCCAACCTCGACTCACCTGTGATCCC 

TCAGGTGATCCTGTCGCCGGAACGCCTTGCCGCAGATATCACAGGAATAGGGCCGCTCGTCCGTGTGTGTCCTCTCGTGGATGAGCAGATTGTAC 

GACTTGGTGAATTGTCGATTGCAGAACTTGCAGATGAACTGTTTTTTAGGCCGCGATGCCCGACCAGGAGCACGTAGCAGGCGCCTATCGAGTCC 

GGCATGCATTCCAGGACCCAGGCCAGGTGGAAATAGAGCTGCGGCGGCGGCTCCTGGTGGAAATGGAAAACTTCCGGCTCCCGTGGGCGATGGAC 

CCGGTCCGGTGAGATATGGCGGAGGTGGCACGCCAGCTGGTCCGTACAACACTCCAGGCGGTGGCACACCAGGACCGCCGGGTCCTCCAGGACCT 

CGTCCTAACATTGCCGCCGCCGCAGCCGCCTCCCTTAGGGCCAACTCCTTTCGCTTGTCCGCCATCATGGCAATGATTTCGTAATTGGAATTACT 

ACCCGGGCGCACCTTTCCGGTGGGCAGAACCGGCACAAAGGCAGAGATCTCGCGATGATGCGATGGTGGTGGCGGTGGCGGCGGACCATGCAGTC 

CGGCCAAATGCTGCTGATAGGCCAGCTGTGTTTCATACGCTGAGATCCGGCTGGCCGATGAGTTGCGCGAATCGCCGTTGCCATTCTGGGAGCCA 

CCATCCGATCCTCCGGATCCGCCGTTCCTATCCGCCGCACCAGGAAGACCGCCACCACCCACCGCCGCCGCTGCCGCCGCCATCGAGTTCATGAA 

CTGATCCATCCGGGAAGGTCGCAGCATGGGTATCGCACCGCCTCCTCCGCCGGAGATCTCGCTGGATGAGCTTTCCGTGGGCATTATATTGGAGG 

GGCTCTTCACATTCCGTTGGAACTGCCGGAGATAGGGATAGAACAGAAAAAGCCGAAAATAAAATAAGGGTAGTGGTAAATAACAATTAGTACAC 

GATTGTGCTGGGTTAATTGGGCTCTCGTTTTTTTTTCGAGGCTTATCTTAATTGCGAACCGAGTGCAATAAACAGATATTTCCACGTTGTATTCT 

TTATTGCTCCTAGGGTAGTACTGGGTATGATAAAAACTGACGCAAACTGGTTTAAAATAGATATATTACTATTAACTTAGTTTCTTCTTGGTGGG 

AAGTTTCCTTGTAATTGAAATATTCATGATCTTGGTTTGGCAAGGCACAGGAATATATACAATATATACTATAGAATATTTACTCTTTTTATCTC 

AACCGGTTGTAACTCACTTTTTAGATGTGAAATTCTTCAAATTTGCTTTTTGCACTTCTTATCACAAAATTAAACAGCTTTATGGGCAATTAACT 

TGCAAATAGCCAAGGATTATCACATTAGCTGATCTTTCAGAATTTATCCACGTGAACGTGATTTAGTAGTCACAAAAAAAAAAATAATGAAGAAA 

ATATTTTTCTCGAAAATAACCCATACGATTCTGAGAACGTTTTGGTATCACAGTACGTATATTTTCACAGGGTAACTAAGCAGTTTATCACATTA 

TGGCCCTCGTCTATGGGTTTGTTGCTGTGTTTTTTCCATAGGAATTTTATTATTATTGTTGGTTTATTATTATCAGCGGCGGACTGCCGACGACG 

GCGAGTGAAACTAAACTGAAAACGAGTCGCAGTTTGGCACCGCGATCGGTTGCTTCCAGTGCGCAGGTTAGATAAGATTGTGCGTCTAAAAAAAC 

CTGCCTCCGAACCTCTGAGACTGAGTAAGCAGCACAGTAGTAACGATATAAACACACACACACACACACAGTTATATTATATATGTGCCTGTGAT 

ATATGGCATATCGGCAGATACCCCGGCACACTCAGATACAGATACAAAGATACAGATACAGAGGCAGGCAGCCAAGTTGCCGTCGGTTTTGGTAA 

ATGACTCGAACCGAACTCGACCGACTGCTTACTGGATTTACGCAACAGGAGCCTCGACTTCTGAGTGCCTCGAAAGAAGAAGGGGCAGAGGCAGG 

GGCAGGGGCAGGGGATCGGGTCCCGGAACGTGATCATCACGATAATAACAAAATAATGCACATAATAATGATCACAATGATGGCCGGACCATAAA 

GTTTGTGGCCCCATATTGCAAACGCTTCTGGCTGCCATCTTCCCAACGCCTTGCACAGCAAAAAAAATGGCACCAATTTTGTACCCAATTCTATG 
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GAATTCTATTTTAAGATCTATCTTTTACGATAAGATTATATTTGTTATAATATTAAAGAATTTATAGTAATATGCTTTAAAATTTGTGGCTCGCA 
AAGAGTGCGCC 
(SEQ ID NO: 1213) 

Exon: 5851.. 5813 
Exon: 5437.. 4551 
Exon: 3226. .1001 

Start ATG: 5404 (Reverse strand: CAT) 
Transcript No, : CT40018 

AGTGCAAAAAGCAAATTTGAAGAATTTCACATCTAAAAATTCCAACGGAATGTGAAGAGCCCCTCCAATATAATGCCCACGGAAAGCTCATCCAG 

CGAGATCTCCGGCGGAGGAGGCGGTGCGATACCCATGCTGCGACCTTCCCGGATGGATCAGTTCATGAACTCGATGGCGGCGGCAGCGGCGGCGG 

TGGGTGGTGGCGGTCTTCCTGGTGCGGCGGATAGGAACGGCGGATCCGGAGGATCGGATGGTGGCTCCCAGAATGGCAACGGCGATTCGCGCAAC 

TCATCGGCCAGCCGGATCTCAGCGTATGAAACACAGCTGGCCTATCAGCAGCATTTGGCCGGACTGCATGGTCCGCCGCCACCGCCACCACCATC 

GCATCATCGCGAGATCTCTGCCTTTGTGCCGGTTCTGCCCACCGGAAAGGTGCGCCCGGGTAGTAATTCCAATTACGAAATCATTGCCATGATGG 

CGGACAAGCGAAAGGAGTTGGCCCTAAGGGAGGCGGCTGCGGCGGCGGCAATGTTAGGACGAGGTCCTGGAGGACCCGGCGGTCCTGGTGTGCCA 

CCGCCTGGAGTGTTGTACGGACCAGCTGGCGTGCCACCTCCGCCATATCTCACCGGACCGGGTCCATCGCCCACGGGAGCCGGAAGTTTTCCATT 

TCCACCAGGAGCCGCCGCCGCAGCTCTATTTCCACCTGGCCTGGGTCCTGGAATGCATGCCGGACTCGATAGGCGCCTGCTACGTGCTCCTGGTC 

GGGCATCGCGGCCTAAAAAACAGTTCATCTGCAAGTTCTGCAATCGACAATTCACCAAGTCGTACAATCTGCTCATCCACGAGAGGACACACACG 

GACGAGCGGCCCTATTCCTGTGATATCTGCGGCAAGGCGTTCCGGCGACAGGATCACCTGAGGGATCACAGGTATATTCACTCCAAGGAGAAACC 

CTTCAAATGCACGGAATGCGGCAAGGGATTCTGCCAATCGAGAACCTTGGCTGTCCACAAGATCCTGCACATGGAGGAATCACCCCACAAGTGCC 

CCGTCTGCAGTCGATCATTCAATCAGCGCTCCAACCTGAAGACCCATCTGCTCACCCACACGGATCACAAGCCCTACGAGTGCTCTTCATGCGGC 

AAAGTTTTCCGCCGTAACTGCGATCTACGACGCCATGCCTTGACCCATGCAGTGGGTGAGGTCAACTCCGGGGACTATGTGGATGTGGGCGAAGA 

GGATGAGGCCAGAAATTTAAGTGGCGACGAGGAGGATTCGTTGCTGGAAGTGGACTCGCCTCGCCAGTCGCCAGTTCACAATTTGGGCGAGTCTG 

GAGGATCGGGTGAGAAATCTGAGTCCGAAAGAATGAGACTCAAGCGCAAGGCAGCCATCGATCATGAGGAAAGCGAAGAGGAGTTCGATGACTTC 

GACGAGGAAGAGGAACTGCAGGATCTGCCAAGAGTACACGATTTGCCCAGGGAAGAGGACGACGACTTTGATCCCGAGGACGAGGAGCAGGCAGA 

AGTGGCACTAGTGGCGCGTTTCCAGGCGAGTAAGGCTGCCGCCACATCGCAATCATCGTCGTCAGTGGGCACCAAGCCGGAACGCCAGGGAGTCA 

CCCACTGTCATCATGAGGGTGGTGAAACCTACACAATGCGGCCACATGGCGAGAAACATCAGGAGGAACCCGGGZUVCTCTGGCATTGCCAGCCTA 

CCCGTTCCACCTTCCTTTGTGCGATATTCTGTACCACCTGGGGCAGCTGGACCGCCGCCTGCACCACCTGGAGCTCCTCCGCCAACCCACCAGCA 

CCCAGGACATCCACACTTACTCCCACCCAACGGGGATCCCTATTTGCCCATTCTGCACGTTCGACGCGACTTGCACCACAAGAGCTTGAACCTTT 

CGAAGGCTGGTGTTCCACCGCCACCACATACACCACCCACTATAATAACCCAGCCGGAGTCGGGTAAACCACCCAATCAACCTTTGCATTCACCG 

CATGAAGCTATGCCCAGCTTCCTGGGATCGATACCCATGCGAAAGAGGATTCTTCCGGCACCGACCTTGGACTTGATGGATCCGCACCACCATCC 

T GG AT T GGGT C AAAGG ACCT T T GT GG AT AGT C CC AGC AT AT AT GCT T T G AAT AT G T CGC G AC ATC C C C CC AG AC AGC T T C TGG GAAAAC C ACC GA 

GTACGGAAACAAGTGGAGCCACCACCGAAAAGGGACCACCAGTGGCAGCACCGCCCATAGCACCACCGCCTGCACCACCGAGAAGAACAGGGTTC 

AGC AT T GAGG AC AT AAT G AGGC GT T AGAAGT AAACC GAG AAAGAAC T AG AAAAC AATCC AT T AAT GG AGG T AAAAT CGG AAT AG AAAAAC AAC AC 

AATTTGGAAAACTATTTATCAAAGCTCTTACGAATTTGAAATATCAAAAAGTAGTAGTATTCCTAATAAATATATATCTCGGTTGTTTTTTCTGG 

TATTTCATTTAGAAGGTCTTATGAACAGTTGAATATCATAATGACAAACAGTTATTGTTCCCAATGGTTAAACGAGGATTTATCGTTTTAGACCC 

ACAAGTACATTAAAATAAATTTAGATCTACAATTTAATACTTTTTAGTACAATGTCTAGGCAAGGCAATAGACCTAAACTAAGTAATAAAATCAG 

TTAAGCGATAATTGTAAGCGAAACAAACTCCGTTGTTGCATTTTATAGAGCTCTTATGCGTTAGGATTTTAGCTTAAATCTACCGATTTAAATGT 

TTAAGCCTTTAGTACGAGAGTCTATCGATTAGTTTTTAATGTGTGGCCCAACACTAAGTGTTGCTGTTTGCTGAGTGAATTCCAATTTTTTATTT 

ATGTTTCCTTATCCGGTAAGCTAATCTTAATGAGTAAACAZIAATCGTTAATCGATTTTGCTGTAGGGAAAAAAATGGTTGAAAGTAAAAGTTAGT 

GTAAAGCAGAGCATAAACAAAAATCCAACTACAAGCAACATTCCAGTAGAACCTTATGTAATTAGGCTTAAATTTTAAGAGTTTTGCAGTAGTTA 

ATATAAGCAACCCATAGAATTCCATGGTCTTCCTTCAATTTTAAATGATTTTATTCAAACCATAGCGAGTTAGTTGATTTCAAATAAAGTATGCA 

TTAATAATGAAAAACAA 

(SEQ ID NO: 1214) 

Start ATG: 73 (Reverse strand: CAT) 

MPTESSSSEISGGGGGAIPMLRPSRMDQFMNSMAAAAAAVGGGGLPGAADRNGGSGGSDGGSQNGNGDSRNSSASRISAYETQLAYQQHLAGLHG 

PPPPPPPSHHREISAFVPVLPTGKVRPGSNSNYEIIAMMADKRKELALREAAAAAAMLGRGPGGPGGPGVPPPGVLYGPAGVPPPPYLTGPGPSP 

TGAGSFPFPPGAAAAALFPPGLGPGMHAGLDRRLLRAPGRASRPKKQFICKFCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHR 

YIHSKEKPFKCTECGKGFCQSRTLAVHKILHMEESPHKCPVCSRSFNQRSNLKTHLLTHTDHKPYECSSCGKVFRRNCDLRRHALTHAVGEVNSG 

DYVDVGEEDEARNLSGDEEDSLLEVDSPRQSPVHNLGESGGSGEKSESERMRLKRKAAIDHEESEEEFDDFDEEEELQDLPRVHDLPREEDDDFD 

PEDEEQAEVALVARFQASKAAATSQSSSSVGTKPERQGVTHCHHEGGETYTMRPHGEKHQEEPGNSGIASLPVPPSFVRYSVPPGAAGPPPAPPG 

APPPTHQHPGHPHLLPPNGDPYLPILHVRRDLHHKSLNLSKAGVPPPPHTPPTIITQPESGKPPNQPLHSPHEAMPSFLGSIPMRKRILPAPTLD 

LMDPHHHPGLGQRTFVDSPSIYALNMSRHPPRQLLGKPPSTETSGATTEKGPPVAAPPIAPPPAPPRRTGFSIEDIMRR* 

(SEQ ID NO: 1215) 

Name: brother of odd with entrails limited 
Classification : transcription_f actor 
Gene Symbol : bowl 
FlyBase ID: FBgn0004893 



Celera Sequence No. : 142000013384695 

CACCTCGTTGATAATTCACCAGAAGCTGAAACCGCTCGAAAATATGATCATCCTAGACATCACTTGCGACAGGGAGTTCGTCATGGACTACATTG 
TCACGGTGATCCTTACAGCATTGTCTCTTGTGCAGTACACCATATCAACTGGAGGAAATATAAGCGAGTGCGTGACCCATAAATAGCAACCCATC 
TGGACCACACTATTAGGAGGATGAGATCATTATATAACTGTGGAGCTGGACACAGTTATCGTATAATTGCGTGGGTCAAAAGAACTTCCTTAGCG 
CCTACAAAAGTGTATCTTTTGAAAGCAGAACTCAGAGCTCGTAATTATAAAGCACTAAATGCTTACTATTTAAATGCTACAAATGGTTTTATTGT 
TTTTATACAAAATTAAGGTTTATTGACTGTAGTAGTAAACTTGTCAGATACGTGGTTTTACGCACTGCGATTCAACAACACAGGATTTCCCGTTT 
GCATAGAGAATTATGAGAATTCGTTCGTTTCAAGAAAATCCCGTGACTCTAATGAGAACAAGAAAATGCTGTCACTGTCGATTTATCCCTTTAGT 
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TTCGATTTCACGTGATGCTTAAGGTCCTGTTCTCGTTTCCTCGGCAGACAAGAGATGCTCAATTGTGCATAAGATTTGCCACACTGGAGCCCAGT 
TTTATTTAGCTTACATATAGTTTAAATTTTATTTAGTAAATATGACACATGTTCCTATCTAACATATTACCTGAACTAACTATACGATTTACTTG 
TAAAGTAACAATCAACCTAAATTATTGTAGTTAAGGTCCACGTAACAGGTAGGATATATGAATCAGGATATCATGTTCCTCGACTAGCAAATCCT 
TTAATTGTTCTTATTACCTAGTACAGGCCATTGAATCTGGTATATCCTATTATACTAAGAGTGCGTACTTCAATAACCACGGCTGTTGTGAAATT 
TAAGTGGCTTTAATTGCGACTAGCCCAATCACACTATTCCAGTAGCCCATTTACAGCGCAATTTTCGCCCACTTCTCGGACTGCGCCCACAACCA 
CTGGGCCATCTGGTCGTCGAGGGCAGCGGGAGCTACCGGAGCCAGAGCGCAATCGCTGAAGTACTGACCCGACACTCTTTCGAGATCAGGATCCA 
AGGCAGCATACAAGGTGGTCTGGGCCCCGTTCTTAGGCGTCTTCATCACGGCCCAAAGGAGCGGCCTCAAAATGGTTCTAAATGAAAACCAAAAA 
TTAGTAAGTGTATAAAAAGAGAATTGAAAGAGGACACACTCTACCACATACTGAGCAAACTTTGTTTGGAAGAAGATCATGTTCCTGGCTATCTC 
CGTGTCTGCTATTCCAGGATTAAGGGCGTTGACCGTCACGCCCGTTCCCTCCAACCGCTTGGCCAGTTCGCGGGTGAAGAGTATGTTGGCCAGCT 
TACTTTGGCAGTAGGCTACTCCCTCGTCGTAAAAATCAGAGCTGTTGATGTCATCCACCTTAATCTGACCCCGTTCGTGGGCCCTGCTGGCGACC 
ACCACCACCCGACTAGGAGCCGACCTCTCCAGCACGCCGAGCAGTAGGTTGGTTAGCAGGAAGTGTCCGATGTGGTTTACACCCAGGTGCATTTC 
GAATCCCTCTTTCGTCAATCTATGCGGTTCCCAAAACACACCAGCATTGTTGATGAGGATGTGCAGCACCCTCTGCTCTTTCTTGAAGCTGTTTG 
TGAATGAATTGTTAGTTCAAGAGAAAGACACTACGGAGTGGCATGACTCACTTCTCTGCAAACTTTCGAATGGAATCCAGGGACGACAAGTCGCA 
CTCCCGTGAGAAGACATTGGAGTTTCCGGTCTCCTTTACGATTTCCCTGCGCGCTCTCTCCACCTTTTCTTTGTTGCGGCATGCCATGTAAACGG 
TGGCTCCTCTCCTGGCCAACTCCATCACGGTCTCCTTGCCAAGACCCGTATTTCCGCCAGTAACGATGGCCACTTTTCCCGTCTCATCGGTCTGC 
TTCCTAAACTTTCCGCCCTGCATATACAGCCTGATGTGTGTTAAAACAATGCATGTTAATCGGTGATAACTACTTTGTCGTCACTTTTATACGTC 
GTCGATAACGCTTCTCAGCGTTACATTTTAAATTGGCGTCTCACTTACCTCACGCAGAAGGCGATAATGCCGACAATCCCATGGGCCAAGAAGAC 
GGGTGCTATGCTTTGCAGAAACCTTATAACAATTTCCATACTGGCTAAGCGGTCCGCTCCTTCTTGTGGTCTATTCCAAAGTGCCACCGCGGGCG 
AAATTGTGCGGGTTTTTAAGAGATCGCCGCAGCGATTACCGGACGACAACTGATAAAATTGATAGTACGGGCTTCTATAAGTGTTCTTTAATCAG 
AGAACCGCTATTCGAACACATTCAAAATTGCTTATAGCGACAAGAAGCCGAAACCGTCAAACTTTACAAAGAGCGAAAATGTTGCATACTGCCGA 
AATGAACCTAAACTTTTGGACCGAAGGCTCGGGTTAAAATGTAATGGATATACACGTCTATTTTAAACAAATGATGTAGAAATTGCCCCTAACCG 
GTTTCGGGGTACGTAAGTTTGAACGCACACAAGCCCTGAGCTTTCAACTCTTATTAGTCGGCCACGTTAAAATAACTAAAACAAGACAAGTGGAA 
TTGTGTAGACTGTAAGAAACTGAAAATTTATGCGAAGTAACAGAGGGATTGTCAGAAACATATTTCACAATACGAACTATTGTATTTTAATCATC 
GTTGGCTTCGTCCACTTTTCACTGACTGCCCAAAGCCACTTTGCCGTTTGGGTGTCGGTGGCAGCCGGCGCCATTTCCTTCAGTTTGCAGTCGCT 
GAAGTATTGTCCTGTCACCTTTTCTAGCTCGGGATCCAGAGCAACATACAACGAGGTCTGAGCACCATTTCTCGGAGTCTTAACAAAGGGCCAAA 
ACAAAGGCTTAACAAAGAGXCTGAAAAGATAAGTATTTCGTAAAAAAGGTTTTAATGGCATTACTTCTCTACAAACCCAGCAAAAAAATTATTGA 
AGAATCCCATGTGCCTGATTATTTCTGTGTCCACCACACCTGGATGCAGGGCATTCGCCGTTACGTTGGTGCCCTCTAATCGTTTGGCCAACTCC 
CTTGTGAAGAGTACATTGGCCAACTTGCTCTGGCTGTAGGCCTTGCCTTCATCGTAGGATTTGTCACTGTTCAGATCGCCAGTGTTTATTTCTCC 
TCGAGTGTGTGCCAAACTGGATACATTTACAATCCGACTGGGTGAGGATTTCTAGGAATAAAATGGTTATTAATCAGATTTAAGATCTAGTAAAG 
CTGTGTGGTACCTTGAGTAGATCCAACAGTAAATTGGTAAGCAGGAAATGACCCATGTGATTAACCCCCAGCTGAAGTTCGATGCCATCTGAGGT 
GAGAGATCTGGGACAGCGCATAACACCGGCGTTGTTGATCAGTACGTGCAGGTGCTCCTGTTCCCTCTTGAAGCTGCGGAGATGGATTGCGGAAT 
AAACACAACTTAGGCCTTAGGTAGACTTACGCGGCAACAAAGTGACGGATGGACTCCTGGGAAGCCAGGTCACATTGCCGGCAGTACACATATTT 
GTTCTTCGTTTCCAGCACGATTTCCTCGCGAGCCTGCACGCGAGAAGTATTTAGTCGTGTTTTAGGAAATTATCTCAATAAAGATCGTACCTCCT 
CACATTTCTTTAGGTTTCTGCATGCCATGTAGACGGTGCCACCTCGCTTCGCAATCTCTCTCACCGTCTCCTTGCCGATCCCCGTGTTGGCACCG 
GTGACAATAAACACCTTGCCGGTCTCATTGGTTTCTTTGGTGAACTGACCGCCTTGCATAAGATCCCTGTGAGGCGGTATAAAATGTGTAGTCAA 
AACAAGGGAATTCTGTGTGAGAAGCAGATCACTCACTTCACGAAGAAGGCCAGTCCCGTCGTCGTGCCGGTGAAGCTCAGCCAGAACGCAGTGCG 
GCTCTTGAGAAACGCAAATAGTGACATTTTAGGAACTCAAAAAATATACAAATTAGCTAATAAACGAAAACAGCTGTGTTTACCCTGTTAAGGCT 
AACAGCGCTCGTTAACAGAGTCCTATCGATATTTTCGGTCATCTGTGACGGAGCTTTCCCTTTGTTGTTGTTCGTGGGCTGCTGCTTAGAACAAA 
CTTTACACGTAAATAATCTAATTGCGTTAATTAACTGGCTTGAGCTAAACAACGACGTGCGATTTGATCGAGGGAAGGAACGTAGTTGAGATCTG 
GGCCAGGAAATCCCGACTAGGCAAGCAGGAAAACAATGCGTCGCAATGCGTAAGTCAGCGCCAACTCGACTGTTGCTTTTTCCAAGAGATCCGAA 
TGTGATCGGAGTTCTCCCCTCATCTTCAGGCGCCAACAACAACACAATCAGGAAGAAATCGGCCCCGACGACGTGGAAGACATCTTTGCCCGTCC 
CGCCTACGATGACGACGAGTTCGAAGATCTGCTCCCTCTGGCCAGGAGGCCCAATGGCCACGCCCGACTCGGCCGCTACGAGAACACTCTGGAGG 
TAAAGGTCCAAGAGGGAGACACACTCCAGGCGCTGGCCCTGCGCTTTCACTCCTCTGTGGCGGACATCAAGCGCCTGAACAAGATCGACCGCGAA 
AATGAAATCCACGCACATCGGGTTATTCGCATTCCGGTTACAGTGCACAACGTCCTGCTGGGCAACGGGGGCCTCGAAGCCCTTCCTGCGGTCCA 
CCGCAGCGGGAACAACAGTCCTAGACACAATATTGAGCGGGAACCAGCTCCCGAACGCAATCCCCTCGAGGATGCGCGGCAAATGCTGGATGAGC 
GACTCCTGGTGGCGGCGGTAAATGCTTCAGGCGCCGTGGATCACGAGAAACCATCGACGAGCAGAGCTGCTGGACAATTCTACGAGGGAGCCCAG 
GGTGCGCCGAATGACGAAGGTGAGTACAGAAGATGCAGTTGGGAATGAAAGCCCACAGTTCGCCGCGGTTTTAAAAATTTCTGAAATACTTTCA 
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Start ATG: 4017 (Reverse strand: CAT) 
Transcript No. : CT40158 

TTTTTTGAGTTCCTAAAATGTCACTATTTGCGTTTCTCAAGAGCCGCACTGCGTTCTGGCTGAGCTTCACCGGCACGACGACGGGACTGGCCTTC 
TTCGTGAAGGATCTTATGCAAGGCGGTCAGTTCACCAAAGAAACCAATGAGACCGGCAAGGTGTTTATTGTCACCGGTGCCAACACGGGGATCGG 
CAAGGAGACGGTGAGAGAGATTGCGAAGCGAGGTGGCACCGTCTACATGGCATGCAGAAACCTAAAGAAATGTGAGGAGGCTCGCGAGGAAATCG 
TGCTGGAAACGAAGAACAAATATGTGTACTGCCGGCAATGTGACCTGGCTTCCCAGGAGTCCATCCGTCACTTTGTTGCCGCCTTCAAGAGGGAA 
CAGGAGCACCTGCACGTACTGATCAACAACGCCGGTGTTATGCGCTGTCCCAGATCTCTCACCTCAGATGGCATCGAACTTCAGCTGGGGGTTAA 
TCACATGGGTCATTTCCTGCTTACCAATTTACTGTTGGATCTACTCAAGAAATCCTCACCCAGTCGGATTGTAAATGTATCCAGTTTGGCACACA 
CTCGAGGAGAAATAAACACTGGCGATCTGAACAGTGACAAATCCTACGATGAAGGCAAGGCCTACAGCCAGAGCAAGTTGGCCAATGTACTCTTC 
ACAAGGGAGTTGGCCAAACGATTAGAGGGCACCAACACTCCGAGAAATGGTGCTCAGACCTCGTTGTATGTTGCTCTGGATCCCGAGCTAGAAAA 
GGTGACAGGACAATACTTCAGCGACTGCAAACTGAAGGAAATGGCGCCGGCTGCCACCGACACCCAAACGGCAAAGTGGCTTTGGGCAGGCGGAA 
AGTTTAGGAAGCAGACCGATGAGACGGGAAAAGTGGCCATCGTTACTGGCGGAAATACGGGTCTTGGCAAGGAGACCGTGATGGAGTTGGCCAGG 
AGAGGAGCCACCGTTTACATGGCATGCCGCAACAAAGAAAAGGTGGAGAGAGCGCGCAGGGAAATCGTAAAGGAGACCGGAAACTCCAATGTCTT 
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CTCACGGGAGTGCGACTTGTCGTCCCTGGATTCCATTCGAAAGTTTGCAGAGAACTTCAAGAAAGAGCAGAGGGTGCTGCACATCCTCATCAACA 

ATGCTGGTGTGTTTTGGGAACCGCATAGATTGACGAAAGAGGGATTCGAAATGCACCTGGGTGTAAACCACATCGGACACTTCCTGCTAACCAAC 

CTACTGCTCGGCGTGCTGGAGAGGTCGGCTCCTAGTCGGGTGGTGGTGGTCGCCAGCAGGGCCCACGAACGGGGTCAGATTAAGGTGGATGACAT 

CAACAGCTCTGATTTTTACGACGAGGGAGTAGCCTACTGCCAAAGTAAGCTGGCCAACATACTCTTCACCCGCGAACTGGCCAAGCGGTTGGAGG 

GAACGGGCGTGACGGTCAACGCCCTTAATCCTGGAATAGCAGACACGGAGATAGCCAGGAACATGATCTTCTTCCAAACAAAGTTTGCTCAGTAT 

GTGGTAGAGTGTGTCCTCTTTCAATTCTCTTTTTATACACTTACTAATTTTTGGTTTTCATTTAGAACCATTTTGAGGCCGCTCCTTTGGGCCGT 

GATGAAGACGCCTAAGAACGGGGCCCAGACCACCTTGTATGCTGCCTTGGATCCTGATCTCGAAAGAGTGTCGGGTCAGTACTTCAGCGATTGCG 

CTCTGGCTCCGGTAGCTCCCGCTGCCCTCGACGACCAGATGGCCCAGTGGTTGTGGGCGCAGTCCGAGAAGTGGGCGAAAATTGCGCTGTAA 

(SEQ ID NO: 1217) 

Start ATG: 18 (Reverse strand: CAT) 

MSLFAFLKSRTAFWLSFTGTTTGLAFFVKDLMQGGQFTKETNETGKVFIVTGANTGIGKETVREIAKRGGTVYMACRNLKKCEEAREEIVLETKN 

KYVYCRQCDLASQESIRHFVAAFKREQEHLHVLINNAGVMRCPRSLTSDGIELQLGVNHMGHFLLTNLLLDLLKKSSPSRIVNVSSLAHTRGEIN 

TGDLNSDKSYDEGKAYSQSKLAWVLFTRELAKRLEGTNTPRNGAQTSLYVALDPELEKVTGQYFSDCKLKEMAPAATDTQTAKWLWAGGKFRKQT 

DETGKVAIVTGGNTGLGKETVMELARRGATVYMACRNKEKVERARREIVKETGNSNVFSRECDLSSLDSIRKFAENFKKEQRVLHILINNAGVFW 

EPHRLTKEGFEMHLGVNHIGHFLLTNLLLGVLERSAPSRVVVVASRAHERGQIKVDDINSSDFYDEGVAYCQSKLANILFTRELAKRLEGTGVTV 

NALNPGIADTEIARNMIFFQTKFAQYWECVLFQFSFYTLTNFWFSFRTILRPLLWAVMKTPKNGAQTTLYAALDPDLERVSGQYFSDCALAPVA 

PAALDDQMAQWLWAQS EKWAKI AL * 

(SEQ ID NO: 1218) 

Classification: enzyme 



Celera Sequence No. : 142000013384703 

TTTGTTGTCTGCATTTACCTTAAATGAATTTATTAACTAAGGCGATATTGTGCCTCTCGCTCAATCTCGCAAAACGATGCTTTGTCTGCTGCTTC 

TCTGTGCGTGTGTCCGTCTCTTTCCACGGCTTTTCGTCGAGACTCACATATACAAACGGCGACACACTGGATTGGAGCTGGACTCCGTTTTGGAT 

GGCTTGGCTTTGGCTGCGTACCCAAAATAAATCGCGCTTGCAGTCCCAAATCATTTTAGTTCTTGACGCCTTTGTTCCACAAAAACTTCCAACCG 

CCGTCCATGGCTCGTGCTTTCTCTTTCGCTCGCGAGACGCTCCAAAAGTCGCGAGTATGGGAAGAGAATGTATCCCTCGCTCTTTCACGCCTAGA 

AGCGGCTGTAGCACACACAAGCACACGTAAACCCAGCCGTATGCATATACATATATTCCCTCTACTAGCGAGACGTGTGTACGTGCGTATGTATG 

TATGCTTTTTTCACTTCTTTCTCCTCTGCGTTCTTTGCACCAAGTGTCGCTGTTGCTATTTGCTACCGCCGCTCCTCTTGACGAATAAGTCGGGA 

CTTGTATCAGCTCTAATTCGGACTTAGTTTACAAAATGCGGCTGCTTTTTTTCCGCGATACTCGGATTAGTGATGTAAAAAGACGTTGATGCTTC 

GACCACTGAAGTCAGTGCGATATGTATCGATAATTGAATGCCATCGATTGCTACGATTAAAACTGCCACCGCAGTTAGCGGAGACGCTCACCATT 

CTCTATTCGCCATTTGAAAGTATCGGTGTAATTTGTTTTCAGAGAAATTAATTTCCGTTTACTGTGCAATTCGGTGTGAAAGTGTTCAGATTTAT 

CAATGCGTATTCTGCTTTCGACTTCGCCACCAATCTGTGCTGCAAGTTACCATTACCAGGTCCACCTGGTTCCCGCCAGTTTTCTTTCATTGTGG 

CT AGT T GT T GT T CGT GC C TT C G AT A AAG AC GT T T AG AGGT GTTTTTAGAGTTT C GC C AT C T G GT C AC T AT AGC CGT T T C GT T TT T TAC GT G AGT A 

TTGTGAATTTGGTGTGTTGATTTATATCTCAGTTGGAGCCTGCGTGGAAATAGTGTCAGTACGTTTAAAGGCATCATCGTAAGGTTAGTCGGAAA 

ACAAATAAAAAGCCTACAAATAGCAAGCAAGAACGCTTTCGCAACCACAAGAAAAACGAACATACGCACGCATCCATCTGTGGTAGAGTCGGCTT 

GTGTGGGGGTGAGTGGATGTGTGGGTGAGCATTTCTGGAAAGTGCCGTTCTCTATGTTTCGATTCTACTTATCCCTTCATCACCATTTCATCGTT 

GAATCATCACGAAAGAAACATCGAAAGTGCACAATCGTTTGTTATCTTTGTACGAAAACAACGGTGATTTCCACACAGGCATAACCTGCAAGAGT 

AATACGATACAAATTTTGATCTGATATGACCTCATTTTAGTCGGTTTGAATTTTAAAGAGAGCGACTCTGCAAACTGCAGGGATATGAACATACA 

AAATTCACAAATGATTGTTTGCATCCTAGTTGCTTATATTCATTTACACGCATGCTTATATGCTTACATCTGTTATCAATAATTGACGACTTGGT 

GCATGCGAAGCAAACGCATTGAAATGACTATTGGGATCCACACACTGTTAAAAAACACACTGATTGCACTGCCACACATTTCTCAACTAGAACCT 

AATGTTTTATACTGAACTTAACTGCAGGAAAGCCCAAAATGCCCAGCATCAAGTTGCAATCTTCGGATGAGGAGATCTTTGACACGGATATCCAG 

ATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAGGACGATGAGAATGCCATTGTGCCGTTGCCCAATGTGAATTC 

GACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCAGCCAACGGAGGATGATGAGAGCAAGGAGAAGCGCACAGACG 

ACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGCACACTGTTTGAGCTGATATTGGCAGCGAACTATCTGGACATTAAGGGCCTT 

CTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAAGACTCCCGAGGAAATACGCAAGACCTTCAACATTAAGAAGGACTTTTCGCC 

CGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAGCGCGGCATTTCGCGGGACCAACATTAAGTTGAAACAGCTAG 

GGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACTTTCATTTACATTTTTTTTTATTTTTAACCCCAGCAGAGACT 

CGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATTTTTTGTCGTAAATATATCTCATATACATACATATGCGAGAT 

TGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACTTTAATAAATATAACTACCCAACACAAAGACACAGCCTATTTTGTGGGCTTC 

CCGGAGGGGTTAGGGTCCAAGGACTTCTCAAATTAAGAAGTGCTCTTTATTCGTTATCTTGTGTTTAGAAACGTTTGCAAGCGTTTGAAATGGAT 

ACGCATGCCATTTATGGTCCAGTCGGTAAAATTTTACGTGTTCCCAAGAAGGTACATATATATCTTAGATTATCTTATCTATAGAAACTACATTC 

ATAAGATCACGCTTTCGGCGAATAGCGAGTAATACTGTAGCTGCCACTTTGCCACTTAGTGTGCAATTGAAATGCGCGCCAACGCATGTTCTCAT 

ATGCGGAGCGTTTAGCAGCCCTTAAAAAGCTTTTCATGTTGCTTTTAAAAAGCTACGACGGGTGTGGAAATTAGATTTGTGACGCTTTATATATC 

TTTTTTTTATATCGTCTGGGGTACAGCATGGTTTTAGGATTTTAGAAGAAAGCAGTAATTTTGTAAGAATCGAAGAACGGGGGGATATTTATAAT 

TTTCATATTAACCTTGTCTTTGTTTGCTTTTGAAATTATGAATACTTCGAAAGATAATTATCTTGTGCGCATTCCATTCTGGCTTAGACTGGGTT 

GAGCTAAAAATGATTTGAAATACAAATTTTTAGGTGTCCGCTGATATATAAACAGTATGATGTTTTTGCACTTTTTTCCCTGAAGTTTTCCTTGT 

TGTCTCGCTTGCCAATGCTGAACAAAGTAGCCAAAGGACCTTTACTTGAAGCTTTGAGAGCGGAAAATATATTCGGGGACAATTGCTTTAAGTGT 

GCACTAGGCTTGGCTAGATGCCCGTGCCAAACTTTCTTTGGAATTGATCGCTGGCCTTAGCCAAAAGTGGAAAAAAGAAAGGATAGGATAGTGCC 

GCACGTATTAAGGGTCTTCTTACAAAAGGTAACTGATATTAATTCATTTGCATACTAAGCAAGCTAACTCATCTGTTTATTGCAAACAAGCGCCA 

ATTGAAGTTTTATCGAAGCTCAG 

(SEQ ID NO: 1219) 

Exon: 1001.. 1128 
Exon: 1738.. 2538 
Start ATG: 1749 

Transcript No. : CT40292 
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TTTCGCCATCTGGTCACTATAGCCGTTTCGTTTTTTACGTGAGTATTGTGAATTTGGTGTGTTGATTTATATCTCAGTTGGAGCCTGCGTGGAAA 
TAGTGTCAGTACGTTTAAAGGCATCATCGTAAGGAAAGCCCAAAATGCCCAGCATCAAGTTGCAATCTTCGGATGAGGAGATCTTTGACACGGAT 
ATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAGGACGATGAGAATGCCATTGTGCCGTTGCCCAATGT 
GAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCAGCCAACGGAGGATGATGAGAGCAAGGAGAAGCGCA 
CAGACGACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGCACACTGTTTGAGCTGATATTGGCAGCGAACTATCTGGACATTAAG 
GGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAAGACTCCCGAGGAAATACGCAAGACCTTCAACATTAAGAAGGACTT 
TTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAGCGCGGCATTTCGCGGGACCAACATTAAGTTGAAAC 
AGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACTTTCATTTACATTTTTTTTTATTTTTAACCCCAGCA 
GAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATTTTTTGTCGTAAATATATCTCATATACATACATATG 
CGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACTTTAATAAATATAACTACCCAACAC 
(SEQ ID NO: 1220) 

Start ATG: 140 

MPSIKLQSSDEEIFDTDIQIAKCSGTIKTMLEDCGMEDDENAIVPLPNVNSTILRKVLTWAHYHKDDPQPTEDDESKEKRTDDIISWDADFLKVD 

QGTLFELILAANYLDIKGLLELTCKTVANMIKGKTPEEIRKTFNIKKDFSPAEEEQVRKENEWCEEK* 

(SEQ ID NO; 1221) 

Classification : cell_cycle_regulator 
Gene Symbol: skpA 
FlyBase ID: FBgn0025637 



Celera Sequence No. : 142000013384703 

GAAGAGAATGTATCCCTCGCTCTTTCACGCCTAGAAGCGGCTGTAGCACACACAAGCACACGTAAACCCAGCCGTATGCATATACATATATTCCC 
TCTACTAGCGAGACGTGTGTACGTGCGTATGTATGTATGCTTTTTTCACTTCTTTCTCCTCTGCGTTCTTTGCACCAAGTGTCGCTGTTGCTATT 
TGCTACCGCCGCTCCTCTTGACGAATAAGTCGGGACTTGTATCAGCTCTAATTCGGACTTAGTTTACAAAATGCGGCTGCTTTTTTTCCGCGATA 
CTCGGATTAGTGATGTAAAAAGACGTTGATGCTTCGACCACTGAAGTCAGTGCGATATGTATCGATAATTGAATGCCATCGATTGCTACGATTAA 
AACTGCCACCGCAGTTAGCGGAGACGCTCACCATTCTCTATTCGCCATTTGAAAGTATCGGTGTAATTTGTTTTCAGAGAAATTAATTTCCGTTT 
ACTGTGCAATTCGGTGTGAAAGTGTTCAGATTTATCAATGCGTATTCTGCTTTCGACTTCGCCACCAATCTGTGCTGCAAGTTACCATTACCAGG 
TCCACCTGGTTCCCGCCAGTTTTCTTTCATTGTGGCTAGTTGTTGTTCGTGCCTTCGATAAAGACGTTTAGAGGTGTTTTTAGAGTTTCGCCATC 
TGGTCACTATAGCCGTTTCGTTTTTTACGTGAGTATTGTGAATTTGGTGTGTTGATTTATATCTCAGTTGGAGCCTGCGTGGAAATAGTGTCAGT 
ACGTTTAAAGGCATCATCGTAAGGTTAGTCGGAAAACAAATAAAAAGCCTACAAATAGCAAGCAAGAACGCTTTCGCAACCACAAGAAAAACGAA 
CATACGCACGCATCCATCTGTGGTAGAGTCGGCTTGTGTGGGGGTGAGTGGATGTGTGGGTGAGCATTTCTGGAAAGTGCCGTTCTCTATGTTTC 
GATTCTACTTATCCCTTCATCACCATTTCATCGTTGAATCATCACGAAAGAAACATCGAAAGTGCACAATCGTTTGTTATCTTTGTACGA7AAACA 
ACGGTGATTTCCACACAGGCATAACCTGCAAGAGTAATACGATACAAATTTTGATCTGATATGACCTCATTTTAGTCGGTTTGAATTTTAAAGAG 
AGCGACTCTGCAAACTGCAGGGATATGAACATACAAAATTCACAAATGATTGTTTGCATCCTAGTTGCTTATATTCATTTACACGCATGCTTATA 
TGCTTACATCTGTTATCAATAATTGACGACTTGGTGCATGCGAAGCAAACGCATTGAAATGACTATTGGGATCCACACACTGTTAAAAAACACAC 
TGATTGCACTGCCACACATTTCTCAACTAGAACCTAATGTTTTATACTGAACTTAACTGCAGGAAAGCCCAAAATGCCCAGCATCAAGTTGCAAT 
CTTCGGATGAGGAGATCTTTGACACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAGGACGAT 
GAGAATGCCATTGTGCCGTTGCCCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCAGCCAAC 
GGAGGATGATGAGAGCAAGGAGAAGCGCACAGACGACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGCACACTGTTTGAGCTGA 
TATTGGCAGCGAACTATCTGGACATTAAGGGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAAGACTCCCGAGGAAATA 
CGCAAGACCTTCAACATTAAGAAGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAGCGCGGC 
ATTTCGCGGGACCAACATTAAGTTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACTTTCATT 
TACATTTTTTTTTATTTTTAACCCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATTTTTTGT 
CGTAAATATATCTCATATACATACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACTTTAATAAATATAACTA 
CCCAACACAAAGACACAGCCTATTTTGTGGGCTTCCCGGAGGGGTTAGGGTCCAAGGACTTCTCAAATTAAGAAGTGCTCTTTATTCGTTATCTT 
GTGTTTAGAAACGTTTGCAAGCGTTTGAAATGGATACGCATGCCATTTATGGTCCAGTCGGTAAAATTTTACGTGTTCCCAAGAAGGTACATATA 
TATCTTAGATTATCTTATCTATAGAAACTACATTCATAAGATCACGCTTTCGGCGAATAGCGAGTAATACTGTAGCTGCCACTTTGCCACTTAGT 
GTGCAATTGAAATGCGCGCCAACGCATGTTCTCATATGCGGAGCGTTTAGCAGCCCTTAAAAAGCTTTTCATGTTGCTTTTAAAAAGCTACGACG 
GGTGTGGAAATTAGATTTGTGACGCTTTATATATCTTTTTTTTATATCGTCTGGGGTACAGCATGGTTTTAGGATTTTAGAAGAAAGCAGTAATT 
TTGTAAGAATCGAAGAACGGGGGGATATTTATAATTTTCATATTAACCTTGTCTTTGTTTGCTTTTGAAATTATGAATACTTCGAAAGATAATTA 
TCTTGTGCGCATTCCATTCTGGCTTAGACTGGGTTGAGCTAAAAATGATTTGAAATACAAATTTTTAGGTGTCCGCTGATATATAAACAGTATGA 
TGTTTTTGCACTTTTTTCCCTGAAGTTTTCCTTGTTGTCTCGCTTGCCAATGCTGAACAAAGTAGCCAAAGGACCTTTACTTGAAGCTTTGAGAG 
CGGAAAATATATTCGGGGACAATTGCTTTAAGTGTGCACTAGGCTTGGCTAGATGCCCGTGCCAAACTTTCTTTGGAATTGATCGCTGGCCTTAG 
CCAAAAGTGGAAAAAAGAAAGGATAGGATAGTGCCGCACGTATTAAGGGTCTTCTTACAAAAGGTAACTGATATTAATTCATTTGCATACTAAGC 
AAGCTAACTCATCTGTTTATTGCAAACAAGCGCCAATTGAAGTTTTATCGAAGCTCAG 
{SEQ ID NO: 1222) 

Exon: 1001. .1078 
Exon: 1393.. 2193 
Start ATG: 1404 

Transcript No. : CT40294 

AAACATCGAAAGTGCACAATCGTTTGTTATCTTTGTACGAAAACAACGGTGATTTCCACACAGGCATAACCTGCAAGAGAAAGCCCAAAATGCCC 
AGCATCAAGTTGCAATCTTCGGATGAGGAGATCTTTGACACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTG 
CGGCATGGAGGACGATGAGAATGCCATTGTGCCGTTGCCCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGG 
ACGACCCCCAGCCAACGGAGGATGATGAGAGCAAGGAGAAGCGCACAGACGACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGC 
ACACTGTTTGAGCTGATATTGGCAGCGAACTATCTGGACATTAAGGGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAA 
GACTCCCGAGGAAATACGCAAGACCTTCAACATTAAGAAGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGG 
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AGAAGTAAAGCGCGGCATTTCGCGGGACCAACATTAAGTTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTT 
AGTTGCTACTTTCATTTACATTTTTTTTTATTTTTAACCCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGC 
GGGGTTCATTTTTTGTCGTAAATATATCTCATATACATACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACT 
TTAATAAATATAACTACCCAACAC 
{SEQ ID NO: 1223) 

Start ATG: 90 

MPSIKLQSSDEEIFDTDIQIAKCSGTIKTMLEDCGMEDDENAIVPLPNVWSTILRKVLTWAHYHKDDPQPTEDDESKEKRTDDIISWDADFLKVD 

QGTLFELILAANYLDIKGLLELTCKTVANMIKGKTPEEIRKTFNIKKDFSPAEEEQVRKENEWCEEK* 

(SEQ ID NO: 1224) 

Classification: cell_cycle_regulator 
Gene Symbol: skpA 
FlyBase ID: FBgn0025637 



Celera Sequence No. : 142000013384566 

TTGTAAGGCGTATCACTTGATTCCGCACTCTACTCAACTCTCCAAGGACCTCTGTAATCTGATAGGCTTACTGAACCAACAGAGAGTAGAAACAA 
AGTTTTGCTTTATTGTGATTTTACGAAATAACCTACAACTTCACCATTCCCAAAAACACAATTTCTTTTTCCTTAAACATTCCAGTTTTAAACAC 
CTTTTTTACAGTTTTTTCGTTGTAAGTTGGCCAAAATATGTCACAAGGCTGAAAAACGCACTTTTTTAGTCAGCTGGGGTCCTCAGCTAAAGACC 
CATATCAATATTTGCTTTAATTTGGAAAAATAATTTTTCGGCCAAATTTCGCATTTTTTGTAAGGGGTAACATCATAAAAAATCGAAAAAATTTT 
CAAACCAAAAATTTCTCTAAATGAGTTACCCGTCGTTTAGGAATATTTTTACAAGTTCAACGAGGTATTCCACTCCACAATTGGACGTGTCTTTC 
CCAAGATATAGCACTTTTTCTGTGAAATCCCCATCGAAAATTATCAACAAGAATAAAATCACCCTCGTTTTCCATCTGAAAAAACTCCCTCAAAA 
ACAAACGCATATACAGTTTATTTCGTTACATATATATTGTTATTTATTTAAGAACTTGCTCGTCTAATTGACAGAAAAATATAAAGATATAGGAT 
GTATTGAATGGGTTGTGCTGAGTGTCTTACTATGCGAATGAGCGATTCCACTTAAGCCTAGGCGAAATCTCGGCGATTGCATAATGTGGAGCCCC 
CTCCCGAGGTGTACGGAAATCACAAGTGGTTGATAAGAAAGAACAGACGTGGAATTTTGCTGTGCTGGTTTGAGGATCGCAGTCGAACGGATAAT 
TTATTGGCCAGTCGCCGACTTGGCTGGTGATGATTTAATAGCGGACCCTCGGGAAGGCTCTCTGGGCTGACAGATAACAGATAACTCGACTGGCT 
GATAATTTTGATGCGTGACTGGGCGCTGCGCCTTCTCTATCTCCACCTCCTTACAGGGTCTTGTCGAACAGGAACTCGCCCAGTTCGCCGTTGGT 
GTCCATCATCTTCTTGAGAGTGGTCAGCTTGCCGGCGAGCTCGCGCTGTCCGTGGAGCTGCTCCTCCAGATAGACACCGGTCAGGTAGTCCACCA 
GGTGGTAGTGGTTGTAGGGCTTGTTCTCGCAGGTCTGGATCAGCTTGCGGATGGACTTGGTCACCTTGATCTCCAGGTCGAGGGCGTCGGACAGG 
GCGGCGGCACCATCGGTCCACTCCTGCTTGGCCACAGTCTGTGGGGGATTCGCCAGGATTAGCTGGGCTTTTGGGTTTTCTAGGCGAGTCGTCGA 
CTTACCGGCACATTGATCAGATCGCTGACTCCCTCGGTCAGTTGACCGCGCATGGACAGGTACTCCACCAGCTTGGATCCGTGCTCACGTTCCTC 
CTTGGCGGCCTTGAAGAAGTGCTCGGCGAATCCAGGGCGGTTGACGGTGTCGCGGGAGAAGTAGGCGCCCATGGCCAAGTACTGGTAGGAGGCGT 
TGATCTCCTCCTGGATCTGATTGCGCATGCCCTTTATGCAGGCATCCTTCATGTCCACCCAGTCCTTGGTAATCTCAGGAACAGCCAGGGAGCCT 
GCAATGGGATAAGGTTGGGTTAAGTCGGGGGACCACGATAACAATATA7GGCTAGTATTCAGCTTTAAAAAATTTATTTTTGTAATCAGAACTTA 
GGTAAATTTTAATATTTTAAAAACCTTTGCACTCTTGGTCTTGACCTTCATGGCTTAGGTTTTGTTTTTCTTTTAGTTGATGCTCTTCTCGCTCC 
CTTTGAAGCTCTCTCACGAAACTCTTGCTTTCTTGCTTGGCTGAGGGGTAGGGCGTTGTTGGTTATTGCATTTAAGAGGGCGTAATTGCATAAAC 
CTTTGAAATTGCGCTTGAAATGCCGCTATTCTAGGGCACAAAAACCGTGTGCAAAACTGTAATGGCCTACTCTCCCACTCTCTCTCTCTGTTTTT 
GTAACTGCATTGGTAACGTGCGCAGCTCTCGCCGCCATAGTAACGGCATTTGTTTTCGTGTAGTGCCGTGTGCAAAAAGCATGGTTCACCATGCT 
AGCTCACTCGGCCCGTAAAAGCCGTGTGCAAAGAGCGCCTTGAAATGTAACAACAAAAAGTCAAGCTTATTTATACCGCAATATTCATGGAAATG 
AAAGCGAAATTCGCATTATTTAATTATGAAAATTTATGTATAAAAACTGGTTATCATTTCCCCAAAGCGCAACGGAAGTATCAATAACTTTTAAA 
TCAACTGGTTTGGTAACTTCACTTCATTGACGAAAGAGAGTGAAAGAGCGTCAAAGAGAAACGGCGCATGCTCTTTTCAACGCATTACAAATGTT 
TTGCTTCTCTTTGCGTATTGCGTCGAAATTTATTGTTTTTTTTCCGATGACCTGCCTTTGTTTTGTTTGAAATTCTGATGAATAAAACGTAACAT 
TAGTACTTACACTTGAAATCTCCATAGGCCTGGGCCACCACGGCCAACAGGAGCAGGCTAGCAATTAGTTTCACCATCTTTGATCGTCGAACGTA 
GTCTTTACACAAAATCACAGAACAAAAATTTGTTTTTCACTTTAGTTTGATTTGCTTGAAATGCACAAATGAGAATGAGGCGACTCGAGTTGAGC 
TACAATGTTAAATAGGCGTTGAAATTGAAAATCTAGCTGCCTTTACACACACTGGCGCAGAAGGCGTCGCGTGCTTCAATTTGATGGGCTTGATT 
GGGATTATTTTCGACTTCTCGAATGCTTTCCAGTACCTTCTGCTGACAGCTTTAATCTGATTTTTCAGTGGAGCTAAGGCCTTTCTACGTTGTGT 
TTTAGCGAACTCGAAAGCGAACTGTCGCTCCGAACGGCGCACAAAACACTTTTAAGCGCGCGCGCTCTCGGTTTTTCTTCGTGTGTGTGTGAGTG 
ATGGGAGAGTGTGCACTGTTATCTTTTCAACATGTCGCCCGTTCTTTGTTAACCGTTTAAATGGCTGATAATTGAATATTCCCGATAATTGCCGC 
ACTCAATGTGATACACACGTCCTCAATATGGGTATACTTTGTATAAAGTCGGGTTTCCTAGTCAGATAATCCCCACTTACTAAGAAGTAAATTTA 
TCAAATGCAATATGGCCGCTCTTTTGAGATGCGCATCAGCTGTTCTTTGTTTTTGTTTTGGCCGTGGGAGAGCACCACGTCATGTGTTGCCAGGC 
GGTTTAATTATTGCTGACAATTTAATTCACATTTTGTCAAGTTGATTTGCACGTTCGAAGGGGGGATCAGTCACCCACTTTGATAACTGCTAATA 
TGGCGACGGCTCTTTTTTACGTGTCGCTTGCAGAGCTCCAAGAATGACACTTGATAAGAAGGATACATGGTGGGAAAACTTACAAATTACATAAT 
TTGTTTGTTGATTGGGGAAACCCTACATTATCTCCGAACCCAGCAATAATATGTATTGTAAATAGCAACAGTCTCAAGCTCGATGCTGCTGTGAA 
TTACCAATATATCAAAACTTAAAACATTATCTTAGGAGGTAGATGGTTTATCTGCAGTTTATTTTCAATTCTCATCGTTAGCTGATAAAACTCTC 
AGGAAACCACTTGAAATCCAAAGTGCAGCGTCCATATTTTGCACACGCTGACAACCCTGTCGTGAGTCACCTTCCAGCAGGCGCAGCAACAACAG 
CACTGGCAGCGGCCATATTTGAATGACACCGAAATCCCGCACTGAGCGCAAAACAACACATGCCCATGCACGCTCTCCAAAACCACACACACACA 
GCCATATCGATCGCATAGAAAGAGACGTACGCGACGCTATAAACAGCAGTTGATTTTCAGAGTTTCACCAAGAAGCACATAAAGCGATTGAATTA 
TTAATAAAAGCACAGGCACTGGAAGTGAGTTTTAATATTACTAACTAACTACTTTGTGAGTGTTTAGTTACGAAAACTAGAAAATAATATTCTGG 
AAACTGTGTTTTTAAACCGACCACCATATAAATTTAAAAACCAAAACATTCCATTGGTGGTGGTTTTGTTGTTAAAAAAGAGCACAGAAAATTCA 
AG AAAACGCT ACGACGC AAAACGAAAAAACTCGCCC AAAATGAAAAC AAAAGT GT C 
(SEQ ID NO: 1225) 

Exon: 3141.. 2792 
Exon: 2659. .2481 
Exon: 1613.. 1336 
Exon: 1273.. 1001 

Start ATG: 2547 (Reverse strand: CAT) 
Transcript No, : CT40862 
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ATTTGATAAATTTACTTCTTAGTAAGTGGGGATTATCTGACTAGGAAACCCGACTTTATACAAAGTATACCCATATTGAGGACGTGTGTATCACA 
TTGAGTGCGGCAATTATCGGGAATATTCAATTATCAGCCATTTAAACGGTTAACAAAGAACGGGCGACATGTTGAAAAGATAACAGTGCACACTC 
TCCCATCACTCACACACACACGAAGAAAAACCGAGAGCGCGCGCGCTTAAAAGTGTTTTGTGCGCCGTTCGGAGCGACAGTTCGCTTTCGAGTTC 
GCTAAAACACAACGTAGAAAGGCCTTAGCTCCACTGAAAAATCAGATTAAAGCTGTCAGCAGAAGCTCAACTCGAGTCGCCTCATTCTCATTTGT 
GCATTTCAAGCAAATCAAACTAAAGTGAAAAACAAATTTTTGTTCTGTGATTTTGTGTAAAGACTACGTTCGACGATCAAAGATGGTGAAACTAA 
TTGCTAGCCTGCTCCTGTTGGCCGTGGTGGCCCAGGCCTATGGAGATTTCAAGTGCTCCCTGGCTGTTCCTGAGATTACCAAGGACTGGGTGGAC 
ATGAAGGATGCCTGCATAAAGGGCATGCGCAATCAGATCCAGGAGGAGATCAACGCCTCCTACCAGTACTTGGCCATGGGCGCCTACTTCTCCCG 
CGACACCGTCAACCGCCCTGGATTCGCCGAGCACTTCTTCAAGGCCGCCAAGGAGGAACGTGAGCACGGATCCAAGCTGGTGGAGTACCTGTCCA 
TGCGCGGTCAACTGACCGAGGGAGTCAGCGATCTGATCAATGTGCCGACTGTGGCCAAGCAGGAGTGGACCGATGGTGCCGCCGCCCTGTCCGAC 
GCCCTCGACCTGGAGATCAAGGTGACCAAGTCCATCCGCAAGCTGATCCAGACCTGCGAGAACAAGCCCTACAACCACTACCACCTGGTGGACTA 
CCTGACCGGTGTCTATCTGGAGGAGCAGCTCCACGGACAGCGCGAGCTCGCCGGCAAGCTGACCACTCTCAAGAAGATGATGGACACCAACGGCG 
AACTGGGCGAGTTCCTGTTCGACAAGACCCTGTAA 
(SEQ ID NO: 1226) 

Start ATG: 4 63 (Reverse strand: CAT) 

MVKLIASLLLLAVVAQAYGDFKCSLAVPEITKDWVDMKDACIKGMRNQIQEEINASYQYLAMGAYFSRDTVNRPGFAEHFFKAAKEEREHGSKLV 
EYLSMRGQLTEGVSDLINVPTVAKQEWTDGAAALSDALDLEIKVTKSIRKLIQTCENKPYNHYHLVDYLTGVYLEEQLHGQRELAGKLTTLKKMM 
DTNGELGEFLFDKTL* 
{SEQ ID NO: 1227) 

Name: FERRITIN RELATED PROTEIN 
Classification : ligandjoinding_or_carrier 



Celera Sequence No. : 142000013384566 

TTGTAAGGCGTATCACTTGATTCCGCACTCTACTCAACTCTCCAAGGACCTCTGTAATCTGATAGGCTTACTGAACCAACAGAGAGTAGAAACAA 
AGTTTTGCTTTATTGTGATTTTACGAAATAACCTACAACTTCACCATTCCCAAAAACACAATTTCTTTTTCCTTAAACATTCCAGTTTTAAACAC 
CTTTTTTACAGTTTTTTCGTTGTAAGTTGGCCAAAATATGTCACAAGGCTGAAAAACGCACTTTTTTAGTCAGCTGGGGTCCTCAGCTAAAGACC 
CATATCAATATTTGCTTTAATTTGGAAAAATAATTTTTCGGCCAAATTTCGCATTTTTTGTAAGGGGTAACATCATAAAAAATCGAAAAAATTTT 
CAAACCAAAAATTTCTCTAAATGAGTTACCCGTCGTTTAGGAATATTTTTACAAGTTCAACGAGGTATTCCACTCCACAATTGGACGTGTCTTTC 
CCAAGATATAGCACTTTTTCTGTGAAATCCCCATCGAAAATTATCAACAAGAATAAAATCACCCTCGTTTTCCATCTGAAAAAACTCCCTCAAAA 
ACAAACGCATATACAGTTTATTTCGTTACATATATATTGTTATTTATTTAAGAACTTGCTCGTCTAATTGACAGAAAAATATAAAGATATAGGAT 
GTATTGAATGGGTTGTGCTGAGTGTCTTACTATGCGAATGAGCGATTCCACTTAAGCCTAGGCGAAATCTCGGCGATTGCATAATGTGGAGCCCC 
CTCCCGAGGTGTACGGAAATCACAAGTGGTTGATAAGAAAGAACAGACGTGGAATTTTGCTGTGCTGGTTTGAGGATCGCAGTCGAACGGATAAT 
TTATTGGCCAGTCGCCGACTTGGCTGGTGATGATTTAATAGCGGACCCTCGGGAAGGCTCTCTGGGCTGACAGATAACAGATAACTCGACTGGCT 
GATAATTTTGATGCGTGACTGGGCGCTGCGCCTTCTCTATCTCCACCTCCTTACAGGGTCTTGTCGAACAGGAACTCGCCCAGTTCGCCGTTGGT 
GTCCATCATCTTCTTGAGAGTGGTCAGCTTGCCGGCGAGCTCGCGCTGTCCGTGGAGCTGCTCCTCCAGATAGACACCGGTCAGGTAGTCCACCA 
GGTGGTAGTGGTTGTAGGGCTTGTTCTCGCAGGTCTGGATCAGCTTGCGGATGGACTTGGTCACCTTGATCTCCAGGTCGAGGGCGTCGGACAGG 
GCGGCGGCACCATCGGTCCACTCCTGCTTGGCCACAGTCTGTGGGGGATTCGCCAGGATTAGCTGGGCTTTTGGGTTTTCTAGGCGAGTCGTCGA 
CTTACCGGCACATTGATCAGATCGCTGACTCCCTCGGTCAGTTGACCGCGCATGGACAGGTACTCCACCAGCTTGGATCCGTGCTCACGTTCCTC 
CTTGGCGGCCTTGAAGAAGTGCTCGGCGAATCCAGGGCGGTTGACGGTGTCGCGGGAGAAGTAGGCGCCCATGGCCAAGTACTGGTAGGAGGCGT 
TGATCTCCTCCTGGATCTGATTGCGCATGCCCTTTATGCAGGCATCCTTCATGTCCACCCAGTCCTTGGTAATCTCAGGAACAGCCAGGGAGCCT 
GCAATGGGATAAGGTTGGGTTAAGTCGGGGGACCACGATAACAATATATGGCTAGTATTCAGCTTTAAAAAATTTATTTTTGTAATCAGAACTTA 
GGTAAATTTTAATATTTTAAAAACCTTTGCACTCTTGGTCTTGACCTTCATGGCTTAGGTTTTGTTTTTCTTTTAGTTGATGCTCTTCTCGCTCC 
CTTTGAAGCTCTCTCACGAAACTCTTGCTTTCTTGCTTGGCTGAGGGGTAGGGCGTTGTTGGTTATTGCATTTAAGAGGGCGTAATTGCATAAAC 
CTTTGAAATTGCGCTTGAAATGCCGCTATTCTAGGGCACAAAAACCGTGTGCAAAACTGTAATGGCCTACTCTCCCACTCTCTCTCTCTGTTTTT 
GTAACTGCATTGGTAACGTGCGCAGCTCTCGCCGCCATAGTAACGGCATTTGTTTTCGTGTAGTGCCGTGTGCAAAAAGCATGGTTCACCATGCT 
AGCTCACTCGGCCCGTAAAAGCCGTGTGCAAAGAGCGCCTTGAAATGTAACAACAAAAAGTCAAGCTTATTTATACCGCAATATTCATGGAAATG 
AAAGCGAAATTCGCATTATTTAATTATGAAAATTTATGTATAAAAACTGGTTATCATTTCCCCAAAGCGCAACGGAAGTATCAATAACTTTTAAA 
TCAACTGGTTTGGTAACTTCACTTCATTGACGAAAGAGAGTGAAAGAGCGTCAAAGAGAAACGGCGCATGCTCTTTTCAACGCATTACAAATGTT 
TTGCTTCTCTTTGCGTATTGCGTCGAAATTTATTGTTTTTTTTCCGATGACCTGCCTTTGTTTTGTTTGAAATTCTGATGAATAAAACGTAACAT 
TAGTACTTACACTTGAAATCTCCATAGGCCTGGGCCACCACGGCCAACAGGAGCAGGCTAGCAATTAGTTTCACCATCTTTGATCGTCGAACGTA 
GTCTTTACACAAAATCACAGAACAAAAATTTGTTTTTCACTTTAGTTTGATTTGCTTGAAATGCACAAATGAGAATGAGGCGACTCGAGTTGAGC 
TACAATGTTAAATAGGCGTTGAAATTGAAAATCTAGCTGCCTTTACACACACTGGCGCAGAAGGCGTCGCCTGCTTCAATTTGATGGGCTTGATT 
GGGATTATTTTCGACTTCTCGAATGCTTTCCAGTACCTTCTGCTGACAGCTTTAATCTGATTTTTCAGTGGAGCTAAGGCCTTTCTACGTTGTGT 
TTTAGCGAACTCGAAAGCGAACTGTCGCTCCGAACGGCGCACAAAACACTTTTAAGCGCGCGCGCTCTCGGTTTTTCTTCGTGTGTGTGTGAGTG 
ATGGGAGAGTGTGCACTGTTATCTTTTCAACATGTCGCCCGTTCTTTGTTAACCGTTTAAATGGCTGATAATTGAATATTCCCGATAATTGCCGC 
ACTCAATGTGATACACACGTCCTCAATATGGGTATACTTTGTATAAAGTCGGGTTTCCTAGTCAGATAATCCCCACTTACTAAGAAGTAAATTTA 
TCAAATGCAATATGGCCGCTCTTTTGAGATGCGCATCAGCTGTTCTTTGTTTTTGTTTTGGCCGTGGGAGAGCACCACGTCATGTGTTGCCAGGC 
GGTTTAATTATTGCTGACAATTTAATTCACATTTTGTCAAGTTGATTTGCACGTTCGAAGGGGGGATCAGTCACCCACTTTGATAACTGCTAATA 
TGGCGACGGCTCTTTTTTACGTGTCGCTTGCAGAGCTCCAAGAATGACACTTGATAAGAAGGATACATGGTGGGAAAACTTACAAATTACATAAT 
TTGTTTGTTGATTGGGGAAACCCTACATTATCTCCGAACCCAGCAATAATATGTATTGTAAATAGCAACAGTCTCAAGCTCGATGCTGCTGTGAA 
TTACCAATATATCAAAACTTAAAACATTATCTTAGGAGGTAGATGGTTTATCTGCAGTTTATTTTCAATTCTCATCGTTAGCTGATAAAACTCTC 
AGGAAACCACTTGAAATCCAAAGTGCAGCGTCCATATTTTGCACACGCTGACAACCCTGTCGTGAGTCACCTTCCAGCAGGCGCAGCAACAACAG 
CACTGGCAGCGGCCATATTTGAATGACACCGAAATCCCGCACTGAGCGCAAAACAACACATGCCCATGCACGCTCTCCAAAACCACACACACACA 
GCCATATCGATCGCATAGAAAGAGACGTACGCGACGCTATAAACAGCAGTTGATTTTCAGAGTTTCACCAAGAAGCACATAAAGCGATTGAATTA 
TTAATAAAAGCACAGGCACTGGAAGTGAGTTTTAATATTACTAACTAACTACTTTGTGAGTGTTTAGTTACGAAAACTAGAAAATAATATTCTGG 
AAACTGTGTTTTTAAACCGACCACCATATAAATTTAAAAACCAAAACATTCCATTGGTGGTGGTTTTGTTGTTAAAAAAGAGCACAGAAAATTCA 
AGAAAACGCTACGACGCAAAACGAAAAAACTCGCCCAAAATGAAAACAAAAGTGTC 
(SEQ ID NO: 1228) 
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Exon: 3141.. 2792 
Exon: 2619.. 2481 
Exon: 1613. .1336 
Exon: 1273.. 1001 

Start ATG: 2547 (Reverse strand: CAT) 
Transcript No. : CT408 64 

ATTTGATAAATTTACTTCTTAGTAAGTGGGGATTATCTGACTAGGAAACCCGACTTTATACAAAGTATACCCATATTGAGGACGTGTGTATCACA 

TTGAGTGCGGCAATTATCGGGAATATTCAATTATCAGCCATTTAAACGGTTAACAAAGAACGGGCGACATGTTGAAAAGATAACAGTGCACACTC 

TCCCATCACTCACACACACACGAAGAAAAACCGAGAGCGCGCGCGCTTAAAAGTGTTTTGTGCGCCGTTCGGAGCGACAGTTCGCTTTCGAGTTC 

GCTAAAACACMCGTAGAMGGCCTTAGCTCCACTGAAAAATCAGATTAAAGCTGTCAGCAGAAGCAAATCAAACTAAAGTGAAAAACAAATTTT 

TGTTCTGTGATTTTGTGTAAAGACTACGTTCGACGATCAAAGATGGTGAAACTAATTGCTAGCCTGCTCCTGTTGGCCGTGGTGGCCCAGGCCTA 

TGGAGATTTCAAGTGCTCCCTGGCTGTTCCTGAGATTACCAAGGACTGGGTGGACATGAAGGATGCCTGCATAAAGGGCATGCGCAATCAGATCC 

AGGAGGAGATCAACGCCTCCTACCAGTACTTGGCCATGGGCGCCTACTTCTCCCGCGACACCGTCAACCGCCCTGGATTCGCCGAGCACTTCTTC 

AAGGCCGCCAAGGAGGAACGTGAGCACGGATCCAAGCTGGTGGAGTACCTGTCCATGCGCGGTCAACTGACCGAGGGAGTCAGCGATCTGATCAA 

TGTGCCGACTGTGGCCAAGCAGGAGTGGACCGATGGTGCCGCCGCCCTGTCCGACGCCCTCGACCTGGAGATCAAGGTGACCAAGTCCATCCGCA 

AGCTGATCCAGACCTGCGAGAACAAGCCCTACAACCACTACCACCTGGTGGACTACCTGACCGGTGTCTATCTGGAGGAGCAGCTCCACGGACAG 

CGCGAGCTCGCCGGCAAGCTGACCACTCTCAAGAAGATGATGGACACCAACGGCGAACTGGGCGAGTTCCTGTTCGACAAGACCCTGTAA 

(SEQ ID NO: 1229) 

Start ATG: 423 (Reverse strand: CAT) 

MVKLIASLLLLAVVAQAYGDFKCSLAVPEITKDWVDMKDACIKGMRNQIQEEINASYQYLAMGAYFSRDTVNRPGFAEHFFKAAKEEREHGSKLV 
EYLSMRGQLTEGVSDLINVPTVAKQEWTDGAAALSDALDLEIKVTKSIRKLIQTCENKPYNHYHLVDYLTGVYLEEQLHGQRELAGKLTTLKKMM 
DTNGELGEFLFDKTL* 
(SEQ ID NO: 1230) 

Name: ferritin subunit 1 
Classification: li gand_b inding_or_ca r r i e r 
Gene Symbol: FerlHCH 
FlyBase ID: FBgn0015222 



Celera Sequence No. : 142000013384566 

TTGTAAGGCGTATCACTTGATTCCGCACTCTACTCAACTCTCCAAGGACCTCTGTAATCTGATAGGCTTACTGAACCAACAGAGAGTAGAAACAA 
AGTTTTGCTTTATTGTGATTTTACGAAATAACCTACAACTTCACCATTCCCAAAAACACAATTTCTTTTTCCTTAAACATTCCAGTTTTAAACAC 
CTTTTTTACAGTTTTTTCGTTGTAAGTTGGCCAAAATATGTCACAAGGCTGAAAAACGCACTTTTTTAGTCAGCTGGGGTCCTCAGCTAAAGACC 
CATATCAATATTTGCTTTAATTTGGAAAAATAATTTTTCGGCCAAATTTCGCATTTTTTGTAAGGGGTAACATCATAAAAAATCGAAAAAATTTT 
CAAACCAAAAATTTCTCTAAATGAGTTACCCGTCGTTTAGGAATATTTTTACAAGTTCAACGAGGTATTCCACTCCACAATTGGACGTGTCTTTC 
CCAAGATATAGCACTTTTTCTGTGAAATCCCCATCGAAAATTATCT^ACAAGAATAAAATCACCCTCGTTTTCCATCTGAAAAAACTCCCTCAAAA 
ACAAACGCATATACAGTTTATTTCGTTACATATATATTGTTATTTATTTAAGAACTTGCTCGTCTAATTGACAGAAAAATATAAAGATATAGGAT 
GTATTGAATGGGTTGTGCTGAGTGTCTTACTATGCGAATGAGCGATTCCACTTAAGCCTAGGCGAAATCTCGGCGATTGCATAATGTGGAGCCCC 
CTCCCGAGGTGTACGGAAATCACAAGTGGTTGATAAGAAAGAACAGACGTGGAATTTTGCTGTGCTGGTTTGAGGATCGCAGTCGAACGGATAAT 
TTATTGGCCAGTCGCCGACTTGGCTGGTGATGATTTAATAGCGGACCCTCGGGAAGGCTCTCTGGGCTGACAGATAACAGATAACTCGACTGGCT 
GATAATTTTGATGCGTGACTGGGCGCTGCGCCTTCTCTATCTCCACCTCCTTACAGGGTCTTGTCGAACAGGAACTCGCCCAGTTCGCCGTTGGT 
GTCCATCATCTTCTTGAGAGTGGTCAGCTTGCCGGCGAGCTCGCGCTGTCCGTGGAGCTGCTCCTCCAGATAGACACCGGTCAGGTAGTCCACCA 
GGTGGTAGTGGTTGTAGGGCTTGTTCTCGCAGGTCTGGATCAGCTTGCGGATGGACTTGGTCACCTTGATCTCCAGGTCGAGGGCGTCGGACAGG 
GCGGCGGCACCATCGGTCCACTCCTGCTTGGCCACAGTCTGTGGGGGATTCGCCAGGATTAGCTGGGCTTTTGGGTTTTCTAGGCGAGTCGTCGA 
CTTACCGGCACATTGATCAGATCGCTGACTCCCTCGGTCAGTTGACCGCGCATGGACAGGTACTCCACCAGCTTGGATCCGTGCTCACGTTCCTC 
CTTGGCGGCCTTGAAGAAGTGCTCGGCGAATCCAGGGCGGTTGACGGTGTCGCGGGAGAAGTAGGCGCCCATGGCCAAGTACTGGTAGGAGGCGT 
TGATCTCCTCCTGGATCTGATTGCGCATGCCCTTTATGCAGGCATCCTTCATGTCCACCCAGTCCTTGGTAATCTCAGGAACAGCCAGGGAGCCT 
GCAATGGGATAAGGTTGGGTTAAGTCGGGGGACCACGATAACAATATATGGCTAGTATTCAGCTTTAAAAAATTTATTTTTGTAATCAGAACTTA 
GGTAAATTTTAATATTTTAAAAACCTTTGCACTCTTGGTCTTGACCTTCATGGCTTAGGTTTTGTTTTTCTTTTAGTTGATGCTCTTCTCGCTCC 
CTTTGAAGCTCTCTCACGAAACTCTTGCTTTCTTGCTTGGCTGAGGGGTAGGGCGTTGTTGGTTATTGCATTTAAGAGGGCGTAATTGCATAAAC 
CTTTGAAATTGCGCTTGAAATGCCGCTATTCTAGGGCACAAAAACCGTGTGCAAAACTGTAATGGCCTACTCTCCCACTCTCTCTCTCTGTTTTT 
GTAACTGCATTGGTAACGTGCGCAGCTCTCGCCGCCATAGTAACGGCATTTGTTTTCGTGTAGTGCCGTGTGCAAAAAGCATGGTTCACCATGCT 
AGCTCACTCGGCCCGTAAAAGCCGTGTGCAAAGAGCGCCTTGAAATGTAACAACAAAAAGTCAAGCTTATTTATACCGCAATATTCATGGAAATG 
AAAGCGAAATTCGCATTATTTAATTATGAAAATTTATGTATAAAAACTGGTTATCATTTCCCCAAAGCGCAACGGAAGTATCAATAACTTTTAAA 
TCAACTGGTTTGGTZ^ACTTCACTTCATTGACGAAAGAGAGTGAAAGAGCGTCAAAGAGAAACGGCGCATGCTCTTTTCAACGCATTACAAATGTT 
TTGCTTCTCTTTGCGTATTGCGTCGAAATTTATTGTTTTTTTTCCGATGACCTGCCTTTGTTTTGTTTGAAATTCTGATGAATAAAACGTAACAT 
TAGTACTTACACTTGAAATCTCCATAGGCCTGGGCCACCACGGCCAACAGGAGCAGGCTAGCAATTAGTTTCACCATCTTTGATCGTCGAACGTA 
GTCTTTACACAAAATCACAGAACAAAAATTTGTTTTTCACTTTAGTTTGATTTGCTTGAAATGCACAAATGAGAATGAGGCGACTCGAGTTGAGC 
TACAATGTTAAATAGGCGTTGAAATTGAAAATCTAGCTGCCTTTACACACACTGGCGCAGAAGGCGTCGCCTGCTTCAATTTGATGGGCTTGATT 
GGGATTATTTTCGACTTCTCGAATGCTTTCCAGTACCTTCTGCTGACAGCTTTAATCTGATTTTTCAGTGGAGCTAAGGCCTTTCTACGTTGTGT 
TTTAGCGAACTCGAAAGCGAACTGTCGCTCCGAACGGCGCACAAAACACTTTTAAGCGCGCGCGCTCTCGGTTTTTCTTCGTGTGTGTGTGAGTG 
ATGGGAGAGTGTGCACTGTTATCTTTTCAACATGTCGCCCGTTCTTTGTTAACCGTTTAAATGGCTGATAATTGAATATTCCCGATAATTGCCGC 
ACTCAATGTGATACACACGTCCTCAATATGGGTATACTTTGTATAAAGTCGGGTTTCCTAGTCAGATAATCCCCACTTACTAAGAAGTAAATTTA 
TCAAATGCAATATGGCCGCTCTTTTGAGATGCGCATCAGCTGTTCTTTGTTTTTGTTTTGGCCGTGGGAGAGCACCACGTCATGTGTTGCCAGGC 
GGTTTAATTATTGCTGACAATTTAATTCACATTTTGTCAAGTTGATTTGCACGTTCGAAGGGGGGATCAGTCACCCACTTTGATAACTGCTAATA 
TGGCGACGGCTCTTTTTTACGTGTCGCTTGCAGAGCTCCAAGAATGACACTTGATAAGAAGGATACATGGTGGGAAAACTTACAAATTACATAAT 
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TTGTTTGTTGATTGGGGAAACCCTACATTATCTCCGAACCCAGCAATAATATGTATTGTAAATAGCAACAGTCTCAAGCTCGATGCTGCTGTGAA 

TTACCAATATATCAAAACTTAAAACATTATCTTAGGAGGTAGATGGTTTATCTGCAGTTTATTTTCAATTCTCATCGTTAGCTGATAAAACTCTC 

AGGAAACCACTTGAAATCCAAAGTGCAGCGTCCATATTTTGCACACGCTGACAACCCTGTCGTGAGTCACCTTCCAGCAGGCGCAGCAACAACAG 

CACTGGCAGCGGCCATATTTGAATGACACCGAAATCCCGCACTGAGCGCAAAACAACACATGCCCATGCACGCTCTCCAAAACCACACACACACA 

GCCATATCGATCGCATAGAAAGAGACGTACGCGACGCTATAAACAGCAGTTGATTTTCAGAGTTTCACCAAGAAGCACATAAAGCGATTGAATTA 

TTAATAAAAGCACAGGCACTGGAAGTGAGTTTTAATATTACTAACTAACTACTTTGTGAGTGTTTAGTTACGAAAACTAGAAAATAATATTCTGG 

AAACTGTGTTTTTAAACCGACCACCATATAAATTTAAAAACCAAAACATTCCATTGGTGGTGGTTTTGTTGTTAAAAAAGAGCACAGAAAATTCA 

AG AAAACG CT ACG ACGC AAAAC G AAAAAAC T C GC CC AAAAT GAAAAC AAAAGT GT C 

(SEQ ID NO: 1231) 

Exon: 3141.. 2792 
Exon: 2567. .2481 
Exon; 1613.. 1336 
Exon: 1273.. 1001 

Start ATG: 2547 (Reverse strand: CAT) 
Transcript No. : CT40866 

ATTTGATAAATTTACTTCTTAGTAAGTGGGGATTATCTGACTAGGAAACCCGACTTTATACAAAGTATACCCATATTGAGGACGTGTGTATCACA 
TTGAGTGCGGCAATTATCGGGAATATTCAATTATCAGCCATTTAAACGGTTAACAAAGAACGGGCGACATGTTGAAAAGATAACAGTGCACACTC 
TCCCATCACTCACACACACACGAAGAAAAACCGAGAGCGCGCGCGCTTAAAAGTGTTTTGTGCGCCGTTCGGAGCGACAGTTCGCTTTCGAGTTC 
GCTAAAACACAACGTAGAAAGGCCTTAGCTCCACTGAAAAATCAGATTAAAGCTGTCAGCAGAAGACTACGTTCGACGATCAAAGATGGTGAAAC 
TAATTGCTAGCCTGCTCCTGTTGGCCGTGGTGGCCCAGGCCTATGGAGATTTCAAGTGCTCCCTGGCTGTTCCTGAGATTACCAAGGACTGGGTG 
GACATGAAGGATGCCTGCATAAAGGGCATGCGCAATCAGATCCAGGAGGAGATCAACGCCTCCTACCAGTACTTGGCCATGGGCGCCTACTTCTC 
CCGCGACACCGTCAACCGCCCTGGATTCGCCGAGCACTTCTTCAAGGCCGCCAAGGAGGAACGTGAGCACGGATCCAAGCTGGTGGAGTACCTGT 
CCATGCGCGGTCT^ACTGACCGAGGGAGTCAGCGATCTGATCAATGTGCCGACTGTGGCCAAGCAGGAGTGGACCGATGGTGCCGCCGCCCTGTCC 
GACGCCCTCGACCTGGAGATCAAGGTGACCAAGTCCATCCGCAAGCTGATCCAGACCTGCGAGAACAAGCCCTACAACCACTACCACCTGGTGGA 
CTACCTGACCGGTGTCTATCTGGAGGAGCAGCTCCACGGACAGCGCGAGCTCGCCGGCAAGCTGACCACTCTCAAGAAGATGATGGACACCAACG 
GCGAACTGGGCGAGTTCCTGTTCGACAAGACCCTGTAA 
(SEQ ID NO: 1232) 

Start ATG: 371 (Reverse strand; CAT) 

MVKLIASLLLLAVVAQAYGDFKCSLAVPEITKDWVDMKDACIKGMRNQIQEEINASYQYLAMGAYFSRDTVNRPGFAEHFFKAAKEEREHGSKLV 
EYLSMRGQLTEGVSDLINVPTVAKQEWTDGAAALSDALDLEIKVTKSIRKLIQTCENKPYNHYHLVDYLTGVYLEEQLHGQRELAGKLTTLKKMM 
DTNGELGEFLFDKTL* 
(SEQ ID NO: 1233) 

Name: ferritin subunit 1 

Classification: ligand_binding_or_carrier 
Gene Symbol: FerlHCH 
FlyBase ID: FBgn0015222 



Celera Sequence No. : 142000013383837 

CGCGCATCCAGCGGGCCACCACACTCACCTCGTCCATGTCCCTGCCGGCGGAGTGCCTGGTTCTCTCCGTATCCATGCCCTCGCTCAACTCGGAG 
CCCAAGCGTCCCAAGAGCGCGGGACCCAAGAAGAAGCCCGGCCAGGCCGTCCACTCGAAGCCATTCCGCCTATATTGAGCCTGTCGAGGAAGGAG 
CTGCTCCACAAGTCGCACGTGACGTCACCCAATCGAAGCACTCAGAGATTTGTGCACAAGTTGATATCGAGGCACGCACTTCTAAAACATGCATA 
TGCAGAATACATTTTATCAAACCGTTGATTGTTTAGAACCGAAATGGAAACGAAACTTGGCATTTACAACATAACAATAAACGACTAAATGAGAG 
GCAAAGTTAAGCACAAAATACAGCTGTGGTTTGTTTGTTTGTTTCTTTATATAATTACAGTATATATTTTCGTAAGTATTAAATCTGTTTCACGG 
CTACTACATCGCTGGAGCCAAACGGCCTTCGTTTACATTTCATTTACATAAGACTTGCGTTTGATTTTCTATATATAATTAGTACAAAAATTGTC 
CTGGGATTTCGTTACGTCTCATTGCGTCTTTATCATCGTGCTCATTGTGTGTTTGGAAAACTGTGTGCTTTTCTTGGTTTCTTTAGGTTTTGTAT 
TCGTTTTTGATTCTTGTTCCTTTCGATTATTCGAGTCGCGTCACATTTAAATGCATTTAGTTGCCTTTTTCCATTTACATCATCCTTTTGCTCTA 
CTTAAGCCATAAAAGTTGTTTATACAATTCGTTTTTAAAGTTAGGGTAATAAATAGGTCATTATAAAATAATAAACAACAGTTGTTACATTTTCA 
AAAGACATCGTCAAATGAGCTTCAAGTGATTATGATTATGATTCGAGGGTCGAGTTAGAATTAAAAAAATGCGCATGCTTGAATATCATAAAAAG 
TGGCTTCTCTAAGAGCTCGTCGTCAAGTGAATTGGCCTGGTTTCAGATTTGGTTAGCTGCGGATTGGCTGCCTTAGCTCATGGCTATATATAACT 
ATAATACATTTAATGGGAAACTAGGAAATGCATAGATGCTCGTGATATGCGAAACCAATGTTTGCTGCGGTTGTTATAATTTTCTCTTTCTGCCA 
GATACTATTCGTACAATACATTCGATTCAATTGACTAGATTTCTACATATGTAGTGCATTAGATTTATGATTTATCTGTGGTACCGATTTTGTTA 
ACAATGTTTTTTTTAGTTGTTGCGGGAGGGAATCGTGGTTTGCCAAATGTGTGACAGCAACGATCCGCAGAAAAAAACAGCCAAACAAAGAGAAA 
TTTCATACGAATTTTGATGGCAGTACATTTCGATTCACAGAATTTCGTACAATATTTATCGTTTGTTTAGAGTTTCTTTCCGGGTTCGGTGCCTT 
TTACGAGAGTACGGAGAATTTTATAAATAAATACATGTAATTGATTTGTCGATGAGGGCGCTGCGTGAGTGTCAATTTGTTGTGGCCATTGTGCC 
GGTGTACGTGTCGCTTCCGAGGAATCTCTATACTTCGCCAAAGCCAATAACATTCTACCTTATTTACATTATTAAACACTTTCGCTGGCGTCGCT 
TTAGCGGCTCTGGTCTGAGCACCGCGCGCACCGCTTCCTCGAACACCTGAATTTGCAATCGGAGTGGGAAGGGAAATTTAATGATGAATAAGGGG 
TAGGTGGAGGGTGTGCCGGGCACCGATGACTTACCGGCTTGAGACCGCGCTGCGTCAAGGCGGAGCACTCCATGTATTTCACAGCGCGTATCTTG 
TTTGCCAGCTTCTGGCCCTGCTCGCGCTTCAGCGGCGTCAGTCCCTGCTCTGCCAGGCCGCTGAGTGTCTCTCGATCTTCGCGCAAATCGATTTT 
GGTGCCTGGAATACGCCCAAAGATATATGTATGATTCTCGTTTTCTGATTGCAGTTTTCATGTCGCTTGGCAACTTACCAACTAGAATGATGGGC 
GCATCGGGACAGTGGTGCTTTATCTCCGGATACCATTTCGAGGTGACGTTCTCAAAGGACGAGGGACTCGCCACGCTGTAGCATATCAGGAAAAC 
GTCTGTCTGCGGGTAGGATAGCGGTCTCAGGCGGTCGTAGTCCTCCTGACCCGCCGTATCCCACAGTCCCAGCGAGACCTGTATTGTGTCCACTT 
GCATGGGCGCCGAGTAGTTGTCGAAGCTGTGAAAGATATCATAGTAAAAGTTAGATATCGATCTGCGGTTTGCGGCTATCGAGGCACAAACTTAC 
ACTGTGGGCACATATTCGCCGGGAAAGCAGTCTGTCGTGTAGGAGATTAGCATGCAGGTCTTTCCGACGGTGCCGTCACCAACGACTACACACTT 
TATGGGCCTTCCGGTTGACATCGTGAGTGTGCTGAGTGTGCCTGTGCCGATAGCTGTGTACTGTCTGTGTGCTGTGGGCAAAATAAGATGGAAGA 
TGAATGTCTTCGCCAGCTTAATTCAATTAATTAAGTTCTATTCGAGGCAATAAACCCGCCAGTGCCAACTGACCGCAAATTATTTCACAGTCGTA 
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CTTCCTTAATAAAAATTCTTTAAAATAGAAACAAATGCTGTAGAAATACATATATGACACAATAAATTATCCTATTCG 

AAATAAATTAACTACCACATCGTTTTTGGCGCGCAACTACATATATATAAGATAAACAATTGCTTAAATGGAATTTGGCAACTTTTAAAATTTGA 
ATATCTAGACAAATTGAATCAGGATACTTGCTTCCTGAGTCGCAAACTATTTATATTGTATTAATGGAAGTTATACTTATTGGAGTATGCTACAT 
CGTAAAGTGGGAATGTCCAAGCTGTGGAAGTACTACTGTACAGAGTATAATGGCGAAGGTCTACGGTATATCTCTGGGATGATCAAAGGCAGGGC 
TCTCGAAAGGTGTGTCCCGGCCGCAAAAAGAAAGCAACCGCAGCGGCGGAGGCCCATAAAGCCTAATAAAACGAACGAGGGTCTCATTTCGGTTG 
CGGTGGCACTTTGTTTTCGCAAATAAAATTTTCAAAAATAGATCATGATGGGCGACACAGACGGGGAAAAACAAAAAGCATAAATGGGAGCAGCG 
GACCAAATGGCGACCGCCCCTCCAGTGGCGCCCCTTCGGCGGTGGGCGGGGTCGCCCGAGTGTTGTGGTGACGCGGAAGCCACCGGCAGACGCAC 
CGCATCGAGAAAAAGAAACACAAAATGGGAAAATGAACCGCCCGAAAATAGACACAATTAAGTTCTCATAACCGATTCAATTAAAGTTCAACACA 
CAGAGGCGGCACATATTGCGCCACATGCAACGGGGCTGGGCGTGGCCGCGGGGGCGGGTGCGGTAGCAATGAAAATCTTTTGTTGTGGCACAACA 
CCGAATGGGAATACAATATTATTTTACCGTTTTCACCACCACCTATCCTTTTCCTTTGTCTGTCGGCAGGCGGCGACTGTTTTGTACTTGGCTTA 
TATCACTGTGCGTTTTCCGGCTACCGATATGCGTTCGAGGCGATGGTCCATGCGGCGCATTTAAATAATTGATTCGCACTATTTCGCAGCGGCTT 
TTACTTATAAATATCTGTTTGAGGGACAAAACGCAGCGTCCCCGAATAATTTTCGTGTTTTTCCTTTTTGTTTCCACTCCACACAAACACACACC 
CACCTTTTTGACAGCGCAGCGACGCTCTCTCTCTCGCCGTTGACAGTGGTGCCAACTTAGGGCCAGAAAATAGCTGACGGTAAATATAAAAAAAT 
C ACT T AAAT GT AT GGT AC C AAT AACT AAACTT AAT AAGC AT A TC A T CC AGT A TAAAGTC GTTTTTCTTTTT CC AAAACT AAAATAAAC T T TT T T T 
TACAATTGGCACTTAAAAATCAGCTAGACTTGGCGTGAAAAAAGAGAGCGGGTAGTCCAGCTCCGCTCTCTCGAAACTGACAAGCAGTGCTGCAA 
AAGGTGCAGGGCTCAGGCGGATTTGGCATTATACAATACTAATAGGCATGGTTGTGGGAGAGGGGTTGCAACCAATTTTCACATTGTAAACAAAA 
TTATTTATTTAATTCTATTTCGAGCAGAATATTGTAATAATTACCTGTTAATTAAAAAATTATCATTTTAAAATTATCTTATGTAAGTTCAATAC 
CCTTTAATTGCTATCGACTACGCCCATGCTGCCTATGTCCACTGTTGTTGTTTCGAGCGTCGTAATGAGCAGATTTGTTTATAAACTTTTACAGA 
GTGCAAACCAAACTAGTTCGGGCTGCAGACGCATTTCTTTGGCTACTGTAAGAAATGCGGAAAGCGAATCTCAAGAGGGTGCCCCTGCCCGCACA 
GTTCTGGTGGAGAAGGACTCGCACATAACACTTATTGGCCTAAATCGCGAGCAGCAGCGTAATTCCATTGATGCCAACACCGCGGAGCAGCTGAC 
CGAAGCCATTAGCCAATTCGAAGCGGATGATACTTCTCCAGTGGGTGTGCTCTACGGCATAGGTGGCTCCTTTTGTGCAGGCTATGATCTGGAGG 
AGCTAGAGGCGGAGGCGCAACGCGGCAGCCTCAACTTTCTGTTGCGTCACGAGGGCTCCGTCGGACCAACTAGGCGGCATCTCCGCAAGCCACTT 
GTTTGCGGCATCAGCGGTTTCTGTGTG 
(SEQ ID NO: 1234) 



Exon: 3682.. 3448 
Exon: 2446. .2281 
Exon: 2211.. 1979 
Exon: 1905. .1745 
Exon: 1661.. 1001 

Start ATG: 2396 (Reverse strand: CAT) 



Transcript No. : CT40870 

AACAAAAAGGAAAAACACGAAAATTATTCGGGGACGCTGCGTTTTGTCCCTCAAACAGATATTTATAAGTAAAAGCCGCTGCGAAATAGTGCGAA 
TCAATTATTTAAATGCGCCGCATGGACCATCGCCTCGAACGCATATCGGTAGCCGGAAAACGCACAGTGATATAAGCCAAGTACAAAACAGTCGC 
CGCCTGCCGACAGACAAAGGAAAAGGATAGGTGGTGGTGAAAACGCACACAGACAGTACACAGCTATCGGCACAGGCACACTCAGCACACTCACG 
ATGTCAACCGGAAGGCCCATAAAGTGTGTAGTCGTTGGTGACGGCACCGTCGGAAAGACCTGCATGCTAATCTCCTACACGACAGACTGCTTTCC 
CGGCGAATATGTGCCCACAGTCTTCGACAACTACTCGGCGCCCATGCAAGTGGACACAATACAGGTCTCGCTGGGACTGTGGGATACGGCGGGTC 
AGGAGGACTACGACCGCCTGAGACCGCTATCCTACCCGCAGACAGACGTTTTCCTGATATGCTACAGCGTGGCGAGTCCCTCGTCCTTTGAGAAC 
GTCACCTCGAAATGGTATCCGGAGATAAAGCACCACTGTCCCGATGCGCCCATCATTCTAGTTGGCACCAAAATCGATTTGCGCGAAGATCGAGA 
GACACTCAGCGGCCTGGCAGAGCAGGGACTGACGCCGCTGAAGCGCGAGCAGGGCCAGAAGCTGGCAAACAAGATACGCGCTGTGAAATACATGG 
AGTGCTCCGCCTTGACGCAGCGCGGTCTCAAGCCGGTGTTCGAGGAAGCGGTGCGCGCGGTGCTCAGACCAGAGCCGCTAAAGCGACGCCAGCGA 
AAGTGTTTAATAATGTAAATAAGGTAGAATGTTATTGGCTTTGGCGAAGTATAGAGATTCCTCGGAAGCGACACGTACACCGGCACAATGGCCAC 
AACAAATTGACACTCACGCAGCGCCCTCATCGACAAATCAATTACATGTATTTATTTATAAAATTCTCCGTACTCTCGTAAAAGGCACCGAACCC 
GGAAAGAAACTCTAAACAAACGATAAATATTGTACGAAATTCTGTGAATCGAAATGTACTGCCATCAAAATTCGTATGAAATTTCTCTTTGTTTG 
GCTGTTTTTTTCTGCGGATCGTTGCTGTCACACATTTGGCAAACCACGATTCCCTCCCGCAACAACTAAAAAAAACATTGTTAACAAAATCGGTA 
CCACAGATAAATCATAAATCTAATGCACTACATATGTAGAAATCTAGTCAATTGAATCGAATGTATTGTACGAATAGTATCTGGCAGAAAGAGAA 
AATTATAACAACCGCAGCAAACATTGGTTTCGCATATCACGAGCATCTATGCATTTCCTAGTTTCCCATTAAATGTATTATAGTTATATATAGCC 
ATGAGCTAAGGCAGCCAATCCGCAGCTAACC 
(SEQ ID NO: 1235) 



Start ATG: 28 6 (Reverse strand: CAT) 



MSTGRPIKCVVVGDGTVGKTCMLISYTTDCFPGEYVPTVFDNYSAPMQVDTIQVSLGLWDTAGQEDYDRLRPLSYPQTDVFLICYSVASPSSFEN 
VTSKWYPEIKHHCPDAPIILVGTKIDLREDRETLSGLAEQGLTPLKREQGQKLANKIRAVKYMECSALTQRGLKPVFEEAVRAVLRPEPLKRRQR 
KCLIM* 

(SEQ ID NO: 1236) 



Classification: enzyme 



Celera Sequence No. : 142000013384543 

CATTTCCAGTACTGCAGATGACCGCGCTTCTTGTTTCGGAAGAAGCGGAAGCCATTGATCACCAGCAACTTGCCGCCGTTCTTCTTGGAGTCCGC 
GAAGTCAATGTGAAATGCTGTGAATGAAATAGAAAGTCTGGATTAGTTTGGGAATTCGTGAACAAGTGCTACGTCAAAAGCGTAGGTCTATTCTT 
ATGGAAACAGATGTAGTTCGTCGTAAATTTCGTTTAAAATTAAAAGCAGTCATTTATTTAAAAGTGGATTGTTAGATGTCTTATGGTTTTGACAA 
TGTGCACTTAAATGTAAGCCTATGACGCATCCAGAGATAGGGCCTTCTCCCTTTTTATGAGCACAGGCAGGTTTACGTTATTGCTAGGGAATTTC 
CGGCTGTGATCATGCTCGTAGGTGCATTTGATGATTCGCTCCTGTGAGTTTTTGTCCACCGCGGTCACGACACGAGCCCTGCAGACCGTCACATC 
CTGTTGGTGGGACAAAGGAATACGTCGTTACTTTAGTTTATAACAAGGAGTCTCTGTGCTTCTTGAAATTCTCTTTTTTAGAAATCTATTAATTA 
TAAACTCTTGTTTCTTTAACTATGCATATTAAACGTATACATATATTTAATTAAAAAATCAAGGAATATACTATAATATTCTTAGCACAACACGA 
AAAATGATGAGCAGACTTTATCTTAATGATTTCTCTTATACAAAACCTGAATTCTATGATTTTCAGCAAAATCTTGTAGTTTAACTTACCTTTTT 
CGAGCATATCCAATAAGTACGAGAGGCGGAACATCGGTTCTTAACGAACTTCTCGTTTTGGAAGAGCAGCACGGTTCTTCCCCGCTGTCCAGTGC 
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CGAAGACAGCCAGTTCTAAAAGGATTAAGCAGATAATTAGAATAACCATAAACTTGGGAGTGTCTTATCACTTGTGTATAACTACTAAACTAGGT 
TTTAAACGAAGAATTGTAAGTTTATTGGTAATACAGCAATGTATACACGTCTACAAATGGTTGTGCACGCCTGAAATGGCTACATATGTGGGTTT 
GTTTTTGTGGGTAATCATGTAGACGTTGCAGCTTAGTCGCTTTTTGGCACACCCCCAATATATGATATCGTTGTAGTTGGCTCGTCGATCATAAC 
GATATCCGTCGTAGACCAGAACATCGCCGCTCCTGGTCCGATCATACCGGATGCGATCCCAGTCGTCCAGGGTGCGGACGCACTTCTGATCGGGC 
GCCAGTCCACAGCGAACATTCTGTGGCCGTTTACCCTTGGTTTTGGGCAAGAAGCTCTCGTTACCAAAGTCAGCCAGGTTGCTAGGATTGAGTTC 
ACCTAAATCTGAAAAATAAAACAAATTAGTGTGAATGCAAAATGTTATACATATTTAAAAAAATTAAAAGAAAAACAACACCGCATTGCGGAATA 
AAGATTTATAAAAAAAATATATATTTAGGTACAGTAAAATAAAACAAGAGGTTGGTCTATTCATAAACATAAGTAAAGTTTTAAACAAGTCAATT 
GCAACACACTCACCTTACCTTAATAAATACTATCTTCTGCAAAATAATCCTAAGCCTGTTTATATATCAAGCTTAAGATCTGACAAATTAAAAGG 
TGTTATTGCGCAAAACTAGAAATGGATTTCGGTTCAATTTCGTTTACTACATCATTCTGTACAAAAAGCTGCCTAATCTATACAGCTATGTATGT 
GTCCCGATTCAGACTTCTGAATGCTCTTTGATCTATGGATGATTATGCACGCCACTGACTGCGTAGCGGGATTGGCCATTCGGCAAGATGTGAGT 
GATGACGCGGGAGCGGCATTGCTTGCGCCACTTGCTGCACTTCCAGTAGACCTTGTCCTTATAGGCGGCATTCCGCAGGTAACGATGCTCCCCAC 
AGTGCAGAACGACATTCCCGGCCGGAGTGCGAACGTAGTGCACGCCCTCCATGGATTTGGACAAACGAACGCGCTTCATCAGCTGGTGTTGATCG 
GGTTTTACGATGGGATCCGTTAACACATCAAATGTCACTAAAACATAGAAAAACAGGCAGAATTAGTTAAAGGCAACGCAATCATTCACACACAT 
GCACATACAGTTAGTTTCTTTAAATCATATAGGACATTTCTGCTTAAATAAACAAGACAAAGAGTTTATTTTAGATTCACATATAAATAGTCTAC 

T T C T AGT T C GC AAGC T AT GGC T AG ATT T T G AGT C T C G AT TC AT T GC AG GT T GT T AAG GT C CT C GAT GC T T AAG T T G AGC GT CC T T AGC T C C AGT G 
TTAAATCCTCGTCACCCTCCTCCTTTATGAAGCCCGTGGATACAGACTTTTCCGACTTGCTTCGCGCTTTCCCCACAACGTTTGTTACTGAAAGC 
CGCTTTTTTCGCTCCGTGTTCAGGCAGGTGTGATGGCACGATTCCAGTACTATTTTGTGCACACCTTCAACTACATTGGTGACCACCCGAGCATT 

GC AT TTCGTGCTTCCCT GG AAG AAAGAT AGG AAC T AGC T AAT T T C T GT C T G AAAC T GT C AC AAG AGT C AT AAT T T C AT AG GTC GC T T AGT T T TC T 
TTGGGGTTAATTAAATTTTATCTCACCTTTTTGGAGCATATCCAGTAGGTCTTCAGGTTACTGCGACGGTTTCGCACGTAGTTGTGGCCATTAAA 
GGCCAGCAGTTTGCAGCCGCGCTGTCCCCGAATAAAAGTGAGTAGGCTGTGCTTGATGGCGCCCTTCATGTCATCCAGATCATCGCATACTGTAA 
GAAACAAATTAGCTTTGAGTTAGCTTAAGTGTAAAGTCTACTACTGCAAAAACTGTTAAAGTGTAAATTTTACAAAATACGGTTTTTCGATTTTC 
AAGATTTGTATTCTACAATAAATAAGCTATAAATTGATTCTATACAATGATCCAAAGTCGGCCGGTTTTAATATGATTCTTTAGTAGGGCGGATG 
GTTATGTTGTCCGCATTTGATAGTCACATCGTGGTCTTGCGCGGTGACTACACGGGCCTTGCATTTGTGAACACCCGCTCGAGCGCAAGACCAAT 
AAGTTTTCAATCCCGAGGTCTTGTTTCGAGTGAAGGAGAACTCTCCGATCTTAAGTTTTCGGCTCCCTCGAAAACCGGTGATGTACGAGTAGTAC 
GCAACAGAACCTGTGAATAGAAAATATACACACTTGGTTAGTACAAACTGGTGGAGGAGTGCGACGCAGATCTAAAGCTAGAGGTCAAAGAGACG 
CATGAAGAGCGAGAAACTTAGCTAACTTGATTTTTATTAAGTCAGACGAGTGTTGAGGTGATGCAATATTTTACACTTGGACTTAAGACTAGTGC 
GCTATGTCGTTGTTGCCGGCGCTGAAGTGGCGGCCGTTTCCAATGGGTCCTGACGACGCTCCGCCCGGGCTGCCGCCTCGGCTTCCGCCCGCTGT 
CGCTTCAGGGAGCCGTACTTTCGCGCCTTAGTGATAACTTCATGGTTGTGCTCGCTGTTCAGGATGGTGAGTGTATCCAATTTGGTCTTGAGCCG 
CGCACGACATCGCAACACTGCGAACTAAAAGTAGAACCACAATAAATACATTTTTGGTGTAGATTTGGCTTAAACTGTGAAAGGAATGCAATTAA 

C T T T AC G AAAT AAC T AACAATT AAT T AC AG C T GT C T T C AAC T C AGAAAAGT AGGC T C T ACT T AT C TTGCTCTC GAC T AAG AT T AAGT AGT AAC T A 

GATTAGTTATATTTTCTCAATATTTTAAGGAGAATTTAAAACAAAAACGTTTTTTCGAATGTATAGTTTTATCCAATTACCTGGCTGCAGCGCCA 

ATAGGTGACGTCGTCCCGGCAGATGTGACGATTGAAGGTGTGTTGCTTGAGCATGATCTGCTTGGTGCCGTACTTGCTGACGATGAAGAAGGGAG 

AGTTGCTGTTGACCACTCCGGCGGACGAGGCGTTGGCCAGCTGCATAGTGGCCTCGCCCACATCCACGCTGATCTGGTAGTCGAATTTGGCTACA 

ATAAGAAGAGTTTTCCTCGTTAGTTGTGCGTGATGATTGGATTAGGGGGAATTATTAGTTCAGAAAAGGAAATAAAACACTTGGGATTGCTTTAA 

TGGTAAAATGGTATTCGGTATATTGTACATTGCGGATTCGGCCTATGACTACATACACATCCTACATCCTAATAGCACTGCAGACAGTGCTGGGC 

GTGCTCATGATGCCTGGTGAGTCGAGGTCGTCGGCTCAGCTCCACGAGGACGTAGTAGGTCGGGCAGTTCCTGGTCAAGCAATGGAAGTAGGTGC 

GGTACTGCATGGCGCGGGTTTTTTTGTAAGACTTCCCGTTGACCACCAGGAACTCGCCCTGAGCGGTGCGCTTCAAGAGGACGCCCTTGTTCTGG 

TCTTTTGCTGAAGAGGATGAAGACGAAGATGAAGATGGTATGATGATGAAATGACTGCTGCCACATGAGTTTAATGAGCTTATGGTAGGTAATGC 

ATGCAACTGGGTATAATCTTTTGGACAAATGACATTCTTTCGGGTTTATGTAATCTCATTTAATATTGCGGATAAAAAGCATCCTTGGGCATTAG 

TTTAGCGGGCACCTTGCCACCCAAAAAGCGCGGCGGATGATTGTGGGCATTCTTCCCCAGCTTGATCCCTCCGTCGAACTTGGTGATGGCCCTCG 

CGGAGCACTTGTTCTCGTTCAAACGCTTGCCATTTCCGTCGGAACAGATCCAATTGACCGTGGCCCGGAAACTGGCCTCCTTCTTGTACATGTAT 

CCATTGAATACCAGGTTAAAACCCTGCTTCTGGTTTTTCAGGAAGAAGTACTCATGCTTATCAATCCACTGGCCACGCGGAATATCCCCGATGAA 

CTGACCTGTAATGGAGGATCCAAGGAAACTGACTTTATTACCCGGGTGGTTTGAAAATAAAAAGGTATAGCCCAGAGAAAAACACATGGAAGTTT 

GAGGTGCTGAGTAAACTATTTATTGCAGGCAATTCCCGATACACTCTTACTATTTATAGCCGCTTATTTTTATCATTTGTAGAAGGCTGTAGCTA 

GTTTGCCTTTAACTGGCTACAAGTCGATTGATAGTCTAATATTCTAGTGTTTGGCTAGACATCCCTTTTGTGGTCAAGAAGGGTTGATTGGTAGC 

AAGGTGGACTCACCTGGGGCATAAAGCACTTTCCACATTAATTCAACAACTACCAGAGCAGAAAAATCACTAAATATCACTAAAAACACTAGTTC 

ACATTAAATTCAAATAGAACATCATCCTTTTTGGTACTTTTTTGGAGGGATTCCTTTTTGAGACACCAACAACAACAAAGATGACAATCGGTGCA 

CACACACAGACGCACATACATACGCGCAGGGCCACACGCACTTACTCGCTGCATTTTCGCAGGATAATTCACATATTGTAAACTTCGGACATTAA 

TATGCATGCTGCAGTTAGCCACATTTAAGCCACAACACCTTTTGTTAACTATAGACTTGCAATTTGTTTGACAAAAAACAAAAACTCAGCGAACA 

CAACACGGCGAAAAAATAACAACTACAACAAAAATGGCGTGCCAACAGCAGCACAAGCAGTGCTGTTAGCGAAGCTATCGGTTATCGGCTGTTAA 

CTGTGTTTGAATCGTAACGTATTTTATTTTTACATATCTATATTTTACTGATATTCTTGCTATATTTTTTTTATTTTCAGAATTCGCATTGTCTT 

ATATCTGAAAAATTCCATAAATGTAAGCTCATAGCTCAACAATCGATGGTGTATCAAGGCAAGAAGAAGCACTATTACTATGAGAACATGTATTC 

TAATATTACAGAAGCGATCCCTTACACATTTTTTCCTGCTAATCCCTTCGACTGATAAATAAATATAAATGATTTTGCAAAATAACATATTTAGC 

CAAATAGAAAAGCACTAATCGATAATGTTAGGTGCCAAACATCGATAACATAGTGAGTGCCAGCTCTAAAGTACAGTGCTCTTCTTCTTTAAACC 

GTGTTTTTCGTGCATTAAATATTTTATTTTATATATTTGTACATTACTCCCTGCTTGTAAACCAATAAAATACACAAGAACCAGCAAAAGACCTT 

CCACAGTTGTTTTATTTTAATTTTTTCGCCCGAAAGGCCCAATCTTACCCAGATGTTCTAACCTGTTGCGTTTGATTCTCCAATTGACAGCCGCC 

ACATCCGCATCTGCAACTAAGATTCCACCACGTAAACGCGGCAGACCGAAAACAAAGGTTGAGGATCAGACGCCCAAGCCGAAGCTCCTGGAAAA 

ACTACAGGCGGCTACCCTGAACGAGGAGGCCAGCGAGCCTGCGGTGTACGCCTCCACCACGAAGGGTGAGTCCGAATCCTGGCTAGAAGCCAAAA 

TCCTGATCCATGATAACCAAGCTAATTCACGCTCCATTCGCAGGCGGTGTCAAGCTAATCTTCAATGGTCACCTGTTCAAGTTCTCGTTCCGCAA 

GGCGGACTATTCCGTGTTTCAGTGCTGCTACCGTGAGCACGGCGAGGAGTGCAAGGTGCGGGTGGTATGCGACCAGAAAAGAGTCTTTCCTTACG 

AGGGCGAGCACGTGCACTTCATGCAGGCCAGCGACAAAAGCTGCCTACCCTCTCAATTCATGCCTGGCGAATCGGGAGTCATATCCTCGCTGTCG 

C C C AGC AAGG AG C T G C T GAT G AAG AAT AC C AC C AAGC T C G AGG AAGC C GAC G AC AAGG AG G A T G AAG AT T T C G AGG AGT T T GAG ATT C AG GAG AT 

C G A C G A G AT T G AAT T GG AC GAAC C G GAG AAG ACG C CC GC CAAGG AGG AAG AAGT G G AC C C C A AT GAT T T C C G T G AAAAAAT C AAG C G ACG AC T GC 

AAAAAGCACTGCAGAACAAGAAGAAGTGACACGGAATATGCCTTAGATCTTCAAACTACACCTTTTACTATATAAACTTAACTAAGATCGAGAAG 

GCTCAAGAAGGCCCAACAGTACAAGATCTGAGCTGCCTAAGGGCTTCACATTATACTTTCTATATCCATTTGGGTCTTAAATAGCCCTAATAATT 

CT AAAT AATT AAGT TTTTCGT AAT AAAACGATGTCAATAAATCGGACCTTAAACCATTTC ACT AGCT AGT AAGATCTAACTT AAT ACGTAGCTGT 

ACTCAAAACCCCATTTAAATGATAAATAACATAAGCCCTTTCATGTCTTTCAGTGACCCAACACGTCCGCAACTGTGGTCCTCAGATGTTCCTGA 

TCAGCCGCAAGGGCGGCACTCTGTTGACCATCAACAACTTTGTGTACCGCTCCAATCTTAAGTTCTTCGGCAAGAGCAATAATATCCTGTACTGG 

GAGTGCGTCCAAAACCGATCGGTTAAGTGCCGCAGTCGCTTAAAGACCATCGGCGATGATCTCTATGTAACCAATGGTTCGTAAAGCGAAGTAAG 

CAAATGCAAAGGCCATTTAATGGCTTCTTTCCATCTTCAGATGTGCACAATCACATGGGCGACAACAAGCGTATTGAGGCGGCCAAGGCGGCTGG 

GATGCTGATCCACAAGAAGTTGAGTTCCCTCACAGCCGCCGACAAAATCCAGGGTTCCTGGAAAATGGACACCGAGGGCAACCCAGACCATCTGC 
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CCAAGATGTAGCCGCTTCCTATACATTCCCAACCCAAACGTTCGTGTTCTGTTAGGGCTAAGCTAATGTATTCTACTATTGTTCTTGAGCGTATA 

TCATTGATTTAGACTGAGTGTCCCGAGCCTAGTTAGGATGGGTAAGTAGCCTGTTCATCTGGGATCTCAAAGTACCATTTTTGTAATTCTTAAAG 

TCCCGGGTCATATTATTTATTCTGTTCTCATTGCTTAGACGCTTTTGGCCACGGCACTGACTTAAACAATGGCTTGTGTTACGTACAAAATATGA 

AC T C T T AAT AAAAT C AGAT GAACAT C T AAAC GAAT T GG AAT TAT AT GT AT GAT C T CAT C AT AAT AT AGAAG AG A T TC T T T C AT GC GT T C TAAAT A 

GTGTTTATTTTGTGCACTAAACGTATTTGCGTTAAGAGATCTTATCCCGTTTTTTGAAAGTTTTATGAATTAATCAATTCTCGTGGGTCCATATA 

AATTTAAGTTTAGAGTTTAGCTATGCATGAGTATTTTGATATTTCACGTTGCGTTATTATGTTTCAGAAACTATCTGACTTTTTCAAGATATCCA 

TATAAACTATATGCTAATTTTGGTGTACTTCTAAATAATCAATAATATTCTTGTTCTCTATATATCTCGTTCGCTGGTAAATATCTACGAACAAT 

TAGAAGAAGCTGTGTCTATTACCTATGCAGTGCAGGGGTTGGAATGTACTTGACGATGAGAACAGGACCCTTTTTGGCCCTAGTCGGTAAAGACC 

CAAGGCATCTTTAGGCTTTGTTGCTGACTCCTGACATTTTTGTGGTTGTGCGATGAGCACGTGCTGCGGTATTTCACCTCGGAGTCCACCATGAA 

CGTAGTGGCTCGGGCCTTGCAGCCCTTGTGATGATACATGCAACGCCAGGATCCGATAAGGTTGTCCTTGCCCAAAATACGGTGGACGATAAAGT 

GCTTGTTGGCAATGATTAGCCTCGGACGTCTATGAGCAGCAAGACTGAACTCCGCAGGCCCGTCGAAGTTCAGAAACTTTTTTGAGTAGCGTTTG 

CGATCTGAAAGAAACAAAAAGCATCAAGTCAGTAAATTGTCTTGGTAATGTCCACAAAACTATTGTGTAACTAGCGTATTTGACTTAAACAGGAT 

GGCAGAGTAGATGTTTCCATTATCTTTATTTCGGTTTGTAAATTGCTTAAGCTAATCAGGACCTATACAATTTTGCACTTAGCTTAGGTTTAGGT 

GGCTCGATTTGCAGCATGCAATCTTTGGAATATTTCAAGTACTTTGCAGTAGCTTCCAATTCCACCGTGATGTTATTTATACAAACATGATATAT 

GCCTGTCCTTAAAATTCTGCAAAGACTCCGCTGCCCGCTAAAAATCGCTCACCTTAGGTACACTCTACATCGACATTAGTTCAACAGATAATAAA 

AACCACAGATTTTGGTGAGCCCACAACCGACGGGGCTAGTAATCTTTGCTAATGAAGACGGCACCGTGCGCCTGGTGCTGCTGCATCAGCGCCAT 

CATTTGCGGTGTTTGAACGTGTGGCAGGGAGATGGGCATCGCCAAAGACATGGAGACGGGCACAGGTACCGTCACCGGCATGCCGAGGTGGGTGG 

GCCCTGTGGCGTACGCCGCGGTCGGAGGCATCTGCGCAACTTGGGTCATCATTTCCATCGCAGGAGTGGGAGTCGCCGAGGTGAGCACATGGTTC 

GGGTTGGGTGGTGAACTTTTAACCGCTGGCACAGTGCCGTTGGCCTTGGCCTGGCAGTAAAGTTGACACAGGGGCAGCAGCTCCTTGTGCCGCCT 

CTTGATATGGCTGAGAAAGTTCCCCGTGGACTTGGTGGTGCCGGAGATCAGTCCGTGGCATGTTGTGCAGGTGGCCTGGATAGCACCTGGCGTCT 

TCTGGTTCGGCTGGATGTTCTTGTAGAACTTACCGTCCAGTAGCTTGGGCAGGGTAAAGGGTTTCCGGAACGATTCCTCGTCGCTTAAGTAGGTG 

ACGGCAAGGGCGGCCACAGCAGCATCCGATGAGTGCCCAGGCGTCGGCGAAGGCTCCATTTTGATCTTGACTGCAAAATGAAAAGATTAACAGAT 

TAGTTGGGATTTAAGAATGCTTAACAACTATATAGTAGGTACAGAACGAGAACACTGATTACGTATTATGTGCACATTTAATTTAATTTATTTAT 

GTTAAGGTAGAGCTGAAATCGATGAGTATATTATATTAACTGCTGGTCCTCCGGAGCTTACTGCAGCTCCGGAAATTCGTCCAGCAGGTCAGTCG 

TGAGATCCATCGAGCTGACCTCCGGAGCGTGGGTGTGCGGAGTGCCAGCGATGCGCAGGGTGACCTCCTGCACTTTGGAGGTCACCAAAGTGGCC 

CGGCAGCTGTGTTTCACGTAGCGTGAGCACCGCCAGAAGACGTTCTTCAGGCTCTGACGATTCACGACATACCGATAACCCTCGTAGACCAGCAT 

CACCCTGCCTCGTTGCGTCGTGGTGTACTGGATACTGGCACCGGTGGTTTTGCCCTCGAAGAGCGCCTTCAGCGACTGGATGACACCTGGAAAGG 

GAGGTAGATGTGTTTTGACAGACATTAGTTGAGGTTATTTTGGGAGATCCAGGTGAGACCACTAACTAGTGCACAATAGGTTTAAAGGGAAGTTC 

TAGTACAGGCATTTATTTCGGTTTACGAATGCAACAATCAATTTAAACTATTTCCCCTCCTTTATAGGCATAAGTTTAAGAGTAGACAGCTTGTG 

GTCAATGAACTCCTGTACTGCGGACGGGGTCAGCTGGTTGCCGCGCAGGGCAAACACTTTACACACATGGGCTGAAAAGATGGAAGACACACAGG 

TTAGAACGACTAATCGTGCAACGGGCGCAGCCTCCCTTATGTACTTATGATGTCCTGGATCAGGTCCTTGTCCGTGGTCTCCTCGTGCTGGGCGA 

TGCGCACCAGCTTGTCTTTACCAATCACCAGATCCATCACACGGCACGTATATATCATCGGCATGGGGATATACGCCTTGGCCAGAGCCACCTTG 

GACACAAAGACCCGGCTGCCATCGAACAAGGGGATAGCGTCAGCATCTCTAGGGGACTTCTGCACGCTATCAGTTACTGGTTTCCTTGGATGCTC 

CGCGGGCTTCGGGATGGCAGTGTCCTTGCTTGGGCCATCTGGAACGGCACACGATGGCAAAAACGATTAAAACACTTTAGCTTAGGAGGTCAACG 

GAAAACATAAACAAAATAAAACACGGTTATTGCGGTTTTCTTTTTTACAAAGTCTTTATTTCATTTCGATTAGGAAGCTTTCGCCTACACTTTTA 

GCGCAATTCTGGCGTCAAAACAGTTTTCCCATTATTTTTCACACTCGTTGCCCAATTTGACATATTAGAACACAAATTATTTATTGAAGTAAGTC 

CACGCGAGTCACAGTAGACAAGGAAATGGCAGCAACAATGCCAGGGCTGCACTCGATGCACAGCGAGTCCTCGGTAAAAGGGCGAGCAAAATGCT 

ACGTTGTTTGATCTAAAGTTTTTTTAATTATGAAAATGACATACCTTCTTTAATATTACTATACTCAAGTAAACATATTACTAAATTATAGCAAA 

ATATGCATATTGGTATTGGTACTAATAAATACGTCATGCAAACAGCTCGTTCCGTGTTAGAAAACGATTTACCGGTTTTAAGATGGTCTATAATT 

TTTCAGTAGTTTTGCGCATACCTATAGCTACTGTACTCCCGAACATTGCAGCCCTGGGTGCAAGCGCTTGATCTGGCAACTCTGTTAAAATGTCG 

CCCACAGTTGTGAAAAAATGAAGTGATTTCCGAAAATTGTTGCTAAATTACTAAACAAAAGGTGAAATTAAGATTTGTAATGGCCCCGAAAAGTG 

AATAGTGTTGGTGGAGTCCTAGTTGTGCATGAAAGAACCGCCTTCCAAAGAAGGGAGCCCAATGGCGAGTAATGAAGATGAATAAATGCCAACGA 

CGCCTGGAATTTCAATCAAATGTGTACAAATGTTAAAACGAAATTGCAAAGCGACGCCATGCGCGCAATGTTAATAAAGTACAGAAACTAGAGTG 

AAGCAGGGATATATTACTTGGCGGATCCTCTTGGGCAATCAATCAGTCAGAGCGTTCTAATCAATACCCCTTCCTCCCACTTCAATTTGTGTCTT 

GCAGAACAGGAGGATGACTTCAAGTTGCACCTGCCTCTCCTCGTGACAAGAAGAAAGAAGACGCCCGGTGGCTCCCGCAAGCAGAGTTTTGACCA 

CCTGGAGGTGAGCTTCACCAGAAGCAATAGGGGTAACAACTTGCTGACCATCGATGGGAAGCCCTTCACCCTCAATCGTCGGATCAAGGATGTGT 

GCTACTGGGAGTGCGTCAAGCTGCGCTGCAAGTACATCAAGTGCTCCGCCCGTGTGGTGACCAAATCCAATCGGATCTCGGCACTGAGCGGGCTC 

CACAACCATCCCTAAGTTACGAACTACCTGTGCCATCTCATTAGTCTAATAAAGGTGTTTACGTTGAATCAGTTTATATCTTCCGCCTCTCCTAT 

CGTTGTTGATTCTATAGTGTCCTTGCGCAACTCAGGACGCCCTCTGACTAATCTCTTTCAATTCTTGCAGCATGCTACCAACTGGTGCCCAATCG 

CCGCGGTGGGAAGAACCTCATCTTCCAGGGGCACATGTACTCCGTGGAGCGCAAGTACCGAAACAGCATCAACTGGGTATGTTCCAAGAACAGCA 

ACAGCGTGCTTCGCTGCCCGGCGAGGTGCGTCACAAATCCGGAGAGTGGCAATGGCATCAAGCTGAGCCACCGACGCCACAATCATCCGGCGGAT 

GCCTTTAAGCCCCACAAGCGCTGTCGAAAGCGTCCCGGCGATCGAAAGTAACAAGTAACCATGTTTCAGGATGCAAAGCATGCAGGCTTCGGCAA 

TTGTTAGTCACACTAAATATAAATGTTTCTCATCGTTGTCGAATTGTAACATTTTGGTGTAATGGAATCACTGTTCATTTGGTTTTGGGCTTTCA 

CATATCCGAATTACTTTACTAATACTCACTCATCAGCACGTTTCCCATAAACTAATTATAGTTTTAAGCTGTAGTTAATTCCAAACGACTTATGA 

CTTACTATGAAAATCTCCTAACAGATTTTCGCCACAGTTGTGTTTGCCGACGGAACGGAGTTACGATAATCGATGGCTACCGCTTCGTGATCGGT 

ACTACAAATCTGCGTCGCACCTATCTGAAGTGCGCCAACTTTCGCAGCAACTGCCGGGCCCGGGCCATTCTTAATACGGATACGAACAAAGTGCG 

GATGAGGCACGACCGCCATAATCACTCCCGTACGGATGCTGTGAGCGGACGTATCTCTAAGAAGTAGCAGGTACCGAGCTGTCTAGAGGAATAAA 

CATGCTGTTTCCGTTTAAAAATGTTTAACTGCTTTTCACTACCAACGAAAAACCATAATAACCTTACATCCTCACCCATAATTTGAAAGACCTGT 

CCTAATCTAACTAAATTCGCCAATCTCTTGCAGATAACTCGAAAATAGCGCAGTATGTGCGCTCCAACCGAGGAACCGATCTCGTTTACCACGAG 

GGAAACACCTACACACCCAATGAAAAGCTGAGGGAGGGACAAAAGAGTCGAGACTGGAAGTGCTCAATGTACCATAAGGCCAAGTGCCGGGCCCG 

CCTGGTTACTCGCATTACCGGTGGAGGGGATATTATACACGTGACCAGCAACCTGCACACACATCCCACAATGTACACCACGCAGAAAACGGACA 

TTAGCGTGGTTGACCAGAAACTGTGCCTCGAAAGGAATCC ACT CTGCGTTAACACTCGATCGATT AAAAT GTAGATCAAGTAAAGACCACCGACT 

GATTACTCGTTACGATTAGCATGATCTTCAGTATAAATAACGTTTTAGATTTAAAAAGTATTAAAAAATGCTCATATACATTCAATAATTTTTAA 

GTAATTTCATTTATTTATAATAAGCATATTTTCTTAAGTTCTTTCACATTTGATTTGTTTACTACGACGAATTCCATTTTTAATTCAAAACCGTT 

TTATTTACAGGACATCTTTCGACATTGCGCCATCTCCCAGTTGAGGCCATATTTGATGCGGACGGCAAACAGATGGACTTCATACCCAACATTCG 

CGTCATCCGAAGTCAGCGGAAGACCATAAAGCTGATGTTTAAAAAATATGCCTACTCGAAGACCAACGAGCACGATACCACAACCTACTGGCACT 

GCCGAAGTCGTCGCAATGGAAGGCCTGCGTGCAAGGCCCGGTTTTCCACCAAGAAGCTCAAGAACGGTAGCTACAAAGTATATCTGACCCAGCCG 

GAGCACAATCATCCCCCAAAGAAGCGTCGCCTATAATCAGAATAACTATATTTTAAGAAGAATGATATATTAAAGAGCCATGTCTTGGCAAAATG 

ACTTGCACCGACTCTGAATCTCTACTCTACTACTCTACTCTCAAATAGCCCAGCAAATGACCATTTACTAATTACAATGTGTTCCTTTCTCCTTC 

CACAGAAAGCAGC GTGGCGATCT ACT CGGCAACCTCCC GGGGAC GAAT GCAACTGATTTACGGCGGACAGCCATTCATCTTCGAGAAGACGCTGA 

AACTGTCTTCGGGAGAGGAGAAGCGCTTCTGGCGCTGCAACCAATGGTGGAACCAAAAGTGTCGCTCGCGCGTCTTCACCATCAACGACGTTGTC 
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TGTCCGCTTAACCGATTCCACACGCACGAGGAGATCGTGCGGCGAAAGAAGCGAGTGCGCCGCGTGCCGCCGGTGGAGACGATTGCCAAGGTGGT 
TGCCACAACACCCAGGCATCCGCAGCATCAACAGACAACCCAGCAGCAACAAGAGATTCAGCTGACCAGCGATGCAATTGCTGGAGCAATACTGG 
ATGATGAATCCCCAGCTACAATCGATGTCAGCGAGCTGGGGATGCACCTGAAATACGAAGAAATCGTAGCGGATGTCACGGGCATTGTGGGCGGC 
ACGCGAGTGGTCAGTCGCCGAAAGTGAAGATTGCTTTTTATTTTCTGACGTAAGATCGCAGATTCAGATTCCTTCTTTGTAAAAACTCATGCTTT 

G AGT T TTCATTTTT G AAC T T GC C AC GT C G AAAACC AAG AGC C G AAC AGT T T T G AT GAAT T T AAAAAG G C T AAG C ACAACC AC T AGC C AAT AAAT A 
CAATTTTTATTATTATTTTAAGGACATATTTTTAAAACTCTTTTTATTAAGCGTAATATTCCAAGGTTTCCTCCCCCATTTGTAAAATTTGATAA 
ATACCCCATGATTTCCTTTGACATTTTGGAGTTGTTTAGAATAAAATCTGTTCCTTCTGTTATGCCTTTGTTCAAACAAAATTGACCTAGCCTAC 
TCCCATAATTATTAATACAAAACTGCGTTTAAGAAGGCGGACAAGCTAGCTATTATGATACAGCTTATAAACAATTTATTATGTGTACATTTAAT 
AAACGTAAGACCCAAACAGTGCAATTGAATCGTTGTAGAACCTCCGAACTTTCTCCACAACCCACTTGAGATCCTTCCTTCAAGCGAATATCACC 
GACTTACTAACGCACCGGGTCACATATCTATTCTCTTTTAAGGTGACGCCACCCAGTTCTTCTTCACCAAGGGTCAGCGCGAATCGGTCAAACTC 
AACTACTGCGGTCATAGCTACGTCAAGTTCATGGAGAATGGACGCGGCACCAAGTGGATTTGTGCCACGCGCTCCACCACCAAGTGCCGTGCTCG 
CATCCGTACCACCAAAAACAACTACCTGGAGGTACTGTATGCGTCCCACAATCACGGTTTTCCGCCGCAGAAGAAGGATCGGGGTCGAGCCAGCC 
AAAGGATGTGATCCAAAATGTAACTGAGAGTGTGACTGTGACTAAAGCCAACTTCCCCTACAATACAATTTCAATAGTTTACAAAACACTTGAAG 
CCTCCTCTAAAACAGCTTGACTTTGAAACTATATATTGAAAAAAAGGCTGAGACAGCGCCAATCAATTTCCCAGCCAACCCAATCCTAGTGGTCA 
ATCTGTCTAATGGATGTAGCCATATTTCCTTCCATTAACCAAGCTCAAAGTTTGCCAGATGCTTAGATTGGCTAACACTATTTTCGTTTTCAATA 
CAAAAGAACGCCTATATGATCTTCCTAACTCGAAATCTGCGAGCGAGAAATATACTTACATGAACAGACTTTAAAAAAAAACGAGTTCACCATAA 
TGAGGGCCCAAAAAGTGTGCTAAAGAAACATTTTATTCTTCATGGTTAAATTCTCTTACATACTTAAAGTGAATATTGTTAAATAGGAAATGAAA 

TT AAAT T T T T CG AGAG GT T AC G AC AC AC C ACGAAGT G C AG AAAT C AAAAT AAT AT AGAT C T T T T C T AAC TTGTGTCTTT GAAT T TG AAC T TAT AC 

CTTGATTAAGTCGACCACTTTTTTTTTGGGTAATATCATCCTTTCCTCATCCACTCAAGAGTGGAGTTCCTCGAAGCAGGGCTCGCACATCCAGT 

TGCTTCCAGGACATGTGTTGCACAGCGTATTGATCCTCCTCAGGCCCGTCCTGTTCTTTTTTAGGCAGTTCACGCACTTTCCCTTCATGCGAGGC 

AATCCCACATACACAAAGTTGTGCTCGTTGGATGAGGTGCGGATGATGTTGTCCGGGTTCTTTAGGTTCAGCCTTCCAGCGGGCTCCAGTTTTAT 

GATTGTGGCTAAAAATTTGTCATCCCGCACCTGCCGCACTTTTTGAATCGGTGCTATATGTCGTGCAGCCTTAAAAATCAAACCTGTAAGTTTAA 

GAGGAGGAAAGATATATTATATGGTAATGATTTTCTTATGAATTTAAAGGGTATTAGAATAAAAATATACCATTAAATAGTTTAACTAAACTATC 

GTTTGCTTGTTTCATTTAATAAATTATTACTCTTCGCTTATTTGTACACTGTTTTGTTCATTAATGCATTAAACTTTAATATTCAAGCCGTATGT 

TTTCCCAATTGAGCGTCAGTCTTGCAAACGGTGTACAAATGTCACATTTAAATTTAAATATAAAAGATTTAGATTAGAGGAAAGTATCAAACTAA 

TTTTATAGATTATAATGGTTCTTAAAATTTAATAATCCTCTTTATGAGGACGAGGTGGGCAATCGAGCACTCTGGGAACAAATGTCTTTAGTCGT 

TCCGGCGGCTTGTGATTGTGAAATCCTCGGATTGATTTTATTGACTTTATGTGAGTGGTCAGCGCCGAAGTGCAGTGCATCTTTTTGTACATGGA 

GCAGCGCCAGAAGGCCAGACATTTCCCGGGATTCCTGGAGTAACAGCTAAACGTGAATCCCTCGTGGACTAATTTTCCTCTATCGTCGTAGGTGA 

GAACTGAAAGAGAAGTATTCAATTAGTCAATGGAGACAGGAACTAGGCCTACTCTTCGATACCTCGCCTCCTAATTGATCAGAACGAAGCTCGGT 

TACTTTCATGAGTTCAGATCTTTATTTCACATGATCACATAGCCGTCAAGGGAATGGAACAATTGGATTTGCAAAGCATATGCGAACTTAAAAAA 

ACCTTCATTGGTTAATCGAACTCAGGGGTCTTACAGCTATATTCACAATCTTATTCATTAAACTTCAAATTAAACTCGATTCGACACTTCAAAAT 

GCATATTCGGGTAACTAGGATGAAAATATCGAAAATAAGTGTACATAGTCCATTTCCTAGATATGTACTTTTAATAGCTGTTACTTAGTAACTAC 

ATTGCACTGCCAAGACCAGTTGGATAGCCAATCTATTCGTGCATGAGCATCATGGAGGCGTGCATCAGTTCTTGATCTGTCAACTGCAATGTGGC 

CGCATCGCTAGTCAACTGGTGTTCATCGATGAACTCTTGCTTCTGCTCCTGAAGCTGCAGCTGTGTGAAGCTGTGCGTCCTGGAGAGCTTTCTTC 

CTGTGCCGATCGCTGCCATTTTGTTACGCTGAATGATCTTGTCTATCTTCTCCGTGTGCGGCTGGTGGTTATGTAGTCCGCCTGTCACACTCTTA 

AGGCGTCCGTTCTCCAGGACGCAGCGAGATCGACAGCGCATTTCATGCGCCTTTTTGGAGTAGTTGTGGCAGCGCCAATACTGTTTATTGCTCTT 

GCGACTGTGGCAGTTGTACATGTAGCCATTTAACACTAGGCACGGCGTCGACCAGGGGCTCTCTATAAAAACCAGCTCGTCCTCTGAAAAGAAGA 

AGTTTTTATCGTTAGTTCAGTTGTCTACACTTAAGATATTTGGTTAATAACAATTGGGTTACCCAGTCCACAATGAGCTCCATCTTTTGTTTTGG 

TCTTAATTTATTAGGAGCTAAGTATGAAATACGTTTACAGAACTCAGACAAGTCGCTTAAAATCTACTTTCGAGTGGTCGGGTCGATTTCTTTGG 

GCAAAATGGTCTTGAGCCTTGTGGAATTGTGAATTTTTGCATGACTTACTAACATATAAAGATGTATTACTATTAAAATAAATATGTATTAATTA 

TTACAGAGCTGGAAAGTTTTAGGTTTTAAAAATAAGCCAAAATAATACCAGCTATGGCTTTCTTATCTTTATAAAGACTTTCCAACTGATTGACT 

GT AGAT TGGTTAAAAACTCCCAAGTGAAATCGACTCCTTCCGTGGTTCGAGAT AAAT AATTAGCTTAGGTGTTAGCTGATAACTCCTAGGGGTGC 

TCGCTCTTCAGCAGGGATCGATGCTTGGCCTCTTCTGTCCGCCACCTGCCCACCCGTTGCCGATCCCGGATCCGCTCCTCTGCATGCCGCTCCTC 

CTCATCTAACTCCTCCTCAGCGGCGTCCACAAGGGCCTGCATCTCGATCTCGGTAGCCTCGGCAGCGGGCAGGAACAACTTGCAGTCCTTGCCGT 

CCTTGGCAGTCACAATGATGTTGCTACTGCCCAGATACTGTGATATCTTTGGGTTCGACGTGCAGATCTCGATGTACTCCTCCAGTTCCTGCTCC 

ACCTTATACAGTTGTCGCTGGCCAATCCTCGAGGCGTGGGACTCGTGATTGTGCTGACGTCGTGCGTCGATGAAGTGTCCGTCGCGGGTGATGAC 

CACCGCCTTGCAGCGCAGCTTGAGCACATCCGCACAGCGCCAGGTCGTCTGTCCATTGGCCTGGGTGAGCTTCTTGTTGTAGATGTAGTTGCGGA 

ACACCAGCTGGGCGTTCTTCTTCTGGCTGCGAATGAAACTGATCTCCGTATCTGCGAGAGTCGAGGAAAACCGAAAGCATGGGGTTTAGTGGGCA 

TAATATAATTGACAGATGATCTAGTTGCTATTCTAGTGAATGTACATATATGGCTTAAGCGTTAAGTTAGGATCGGCGAAATAAAACAACGTTTA 

GTTTAGACTTTGGCTTTAAGCAGTTTATTCGCAATCAGTGAACCGGAGGACCCTTGGGTTTGGTGGTAGAAAACTTACCAAAATGCGGCGGGATG 

GCATTTTACAACTTACGGTAGCATTTGCATCGCTTGGTTTTCAAATAAGCACTATCACAGATTATCAAAAGGGTGGAAATAGGGAGGCGTTGCCC 

CCCTTACCAGCCCCATCACTTT AAAT ATT AAAT AAACACCACACCAAAC AC ACCGTC AC AAGCTGGAAATAATT AAT TGTGAAC AAAT AGGCCGT 

GTAATTCGGATGCAATAAAATAACAAAACAAATTGGCTTGCATTTTTTTAATGAATTTGCACGGCGTTGCCACACAGCCGAAGCACCAGGGCTGC 

ACATTCAATTAGGGTTTCCGGAAGTGGCAACACCGTCCTCAATTTCCACAAAATGGCAACTCCGCATTAATTCAAATTTTAATTCTTTATTTTTT 

CATACTACAAATATATATATAATATATTCTGTGGATACTTTCTAAATATCCGGACCAGGCGCGCAAAATATAAATTACACGATTCAAACGTTAAT 

ATAAATTTTCTCAAATACGCTTAACAATAAAAAAACGAAAATTACTACTACAAATAAAGGAAATATAAGTCGGAAATGCTTCAAGGCGGTCCAAC 

CTTTTGTTCCAAGGTATATAAACTATCGCCTATATGAATTAAAACGGTTTATGATTATCTCTTACCAATTAGGTAAGAGGTCTAGTATATTCTGT 

TTTTTTTGTTTAAGGTTACCGCGACATCCGTAAATATCGTGAAATTCTTCAGACACCCTTTAATTCGGAACCATATAAAAAAAACACGCAAAATG 

TTCCATACAAAAAATTTGGAATGAAAAAACAAAATCAGCACGCAAAAAGTAAAACAGAATCGACGCTGCATGAAGAAATAAATTTCCTCTCCACT 

GATAACGTTAAAAGAGTCTCCCTATACGTATTATTAAAATCTTTCCTTTTTTTTTTACGTGACCACTAAAACGATTAGAAATGAACCTTTACTAT 

AACACAGCGTTATAAATACAATTTTTTTTTTAAGACTCGGCAATCAATGCCGTTGCTGGATTTGCATCAGCAAGCTTTTACGTTGCTTAAACAAA 

TCCAGACAACATCTTTAAGTACGAGTTTGAAAACATAAATAACAACAAACCTCAGTAGCCTTAGCATCACCAGAGGATCATTATTAAACAGAAAT 

GCGCATGCCCAAATAATAACCCCCTGATCATGTAAGATTGAAAACTCCAATTACTCGCTTTGCTCAGTCAAATAAACAATTACAAATGCCTGCAA 

ATTTTGGAAAAATATATGGATAGGTGTGGTTTTAAGAGTGCCGGTATTACCTTGAGTAGTTGCAGCTCTTCACGGGTTCGGCTTCACGTTGACAT 

CAGATATCCATCGTAGCAGCTTCTTAAAATTTCATCACTCTACTTACCTTTTAACTGAATAAGTGTAAAACCGCGCGATCTCCGTCCAGCATTTA 

ACAAATATGCATCGGACAGCAGATCTGCCTAGTGCTCTTTGTCGATTCGAAATAGAACACAGGCCCGTAAACTACTTATTAACTAATGGCCAGCG 

AAGTATAGCTTAAGTTTGTTCAAAACAAAATTAAACGCACTTTGCCGAACTCTAAAAGGACATGATCCGCTTGCGACGCTAATAGATTTGGAAAG 

TGTTCAAGTTTGGAATACATATGTGATCTTTTTTTTAACGTGTTTGCATTTCGCAAGCGGACTTAATGTCCTTTAAATATTGCTTATATCTATTG 

GCCTCTGGCCATGTATGCTTTGCAATTCAACCAACCTTGAGCCTCTGTGTTGCCCTGATCCCCGCTGGTGTCGACAAACGATGATTCACTGTAGT 

TCTGGGACTGTTGCTTGACCACAGCCTTGGAAGTGGTCGCCGTCACTCCGCCACCGCTCGTATTTGCCGCAGTCTGGTTGCCAGCATCCTCGTTT 

TCAGTGAAGTAGCTGTCGTCGTAGCGCATGTCTCCGTAAGTATCGTCCTCCACATAGGTGCCCTCGGCATCGCCGGCGGCGTCGCCATGGTCCTC 
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AGAGTAGTCAGGTTCAGATTTCGTGGGCAATTCCATGGGGAGATCAATGTACTCGGCTTCCTCCTGTCGAACCTGCTGCGGCGATTGCTGGTGGA 
GCTTTGCTTCTGCCGCAGAGACCACAGAGGTTTGCACGGTGATCTGTTGCGGCACAAGTTGGGCGGTTGTCGTGGCTCCAGTTTCAGTGGTCGAG 
TCGAGAGGGTCCATAACATTTGCGGAGGTGCTTGTCTTGGAGCGCTTTACGCCGGCGCTCGAGGAGGCGGGGGTCAGGGAGGAGCGCTGGGCAGA 
GCGTTTGTTTGTCGACACCAGCGTCGCCGTCGTCGTGGTTCCCGTCTGCAGCTGCTGCGATTGGATCTGCTGCGCCGCCTGCTGCGGTTGCTGCT 
GTTGTACAATGGTGGCTTGGGGTGCGGCCGTGGTCTGGATGACGATCTGGGTGGTGCTCTGTTTCTCGTCGCCCAGGCCGTCGTCCACGGTCTCG 
ATCTTGTAGCGGGCGGAGGCGCGAGGCTGCTGGCGCTGCACCCGCTGAGCCGGTATTTGCTGTTGTTGTACATGAGGTGCTGCTGGCGGCGGGCT 
GGATTCCTGCGGCGGCTGAGGCGCAGGATCGTTCTGAAAAAAAAATATTGGTTTTTTAATTATTAACATATATTTTAGGAATAAGTGAAAAGATA 
TTTTGTATGTTCTAGCACCTCTTTAAAAATGAACCACGATCAAAACTTAATAAACTTTAAAAGTATCTGCATTTAATAAAATGGACGATTTCAAA 

AC TAT AT G T T G AAAA G AAT T AT T C C GT TAG AAT G GAAT T TG C AAAG AAAT AAG AAAT C T GGC C ATT AAT T AGT AAAGT AC GCC T T C C T T G AAT T G 
GGAAACAATAATACATTTTCTAGTTCCGATGCCTGTTTCTTTAGGTACAGCTGCACTAAATTTAATTTTGCAAATACGTTAATTTTAGTCGAAAA 
CATTAAGAGATATTTCCCAATTTAAACATTAGTTTAAAAGCAAAAAATGTTTATGCTTTGGCATCGAATTCGCTGCGAGACAAAAACAACCACTC 
CAATTTGCCGCTGCCGTGCGAGCCAGAGCGAGACAAACGCTTCGCGTATAGCCCCTGCAGTTGTCTCTCTCTGGCACGCGTGCCTTTTGGCATTT 
GGTTTGCGAGAAATTTCTCGGTACAGTCAAACCATGCTCTTATACAGACGCCAGCCATCGCGTCTGTATGTATTGGGCGCGAAAGGTTGCGTTGT 
AGTTGCTCGATTCGATATACCAGTATATCACGCACACAAATGTATACATATACTCACATCCGTTAGCCCCTTGATTTGCAGCGATTCCGCGGTGC 
TAATAAACGCGGGCAGGGCGTCCTGCTTCACGTTGACCTCGCCGCAGTACATGAATTGGATGAGGTCCTTCAGCGCCGAGTGGCTGACGTTGTTC 
AGGAATACTGCAAGATAGGCGAAGAGAGTATAAGGTAAAAAATCACGTAGAGCTACAGATCTGAGCTGAGGGGCACAAACTCACCGATAGCGTGG 
GTGTTCGACGGCATCTGAGTGAACATCTTGCGGAAGAAGGGCGAGCAGACGGATAACACCAATCGGTGGGCCTTCACTATTTGGCCCTCGGCGGC 
CAGCGAGACGTCCACCAGGTCGCCGCGGCATAGCGACTCGTGGAAGCCGGCCGACAAATTCGTGTTGAAGTTGTTCCAGCACAAGCTGAATTGCT 
CGTCGTCCGCCATCTTGGACGAAAATCAATACTTTTCTGTATCTATGCGTTTGTTGGCTGCAGGCGAAAAAGTGCAATGCGCGTTTAAAACTCAA 
TGCAGCGAGTGCTGAAAAATGGCAAAGGCACACACAAAATAGAATGCACTGCAGCCAAAAAAAACTGGGAAATTTTCACGCGTCTGTCTGTGTGT 
GCGTGTGTGTAAATTACATTTATCTCGCTTAATGACCGCAACCTACTGTGTATATTCACTTGGGGCGCCATTACTCACCTCTTGACTAAATATTT 
TTCCAATAAAATAACTAACTAAAGCCAGAGTTTTTTGCTAGCGCGGCCGACACGCAGAACGCGTCCGAGTTCTTTGACGTGAAAATTAGAAGTGT 
TACCAGGCGACGAGATTGCTGGCGATGGCAAAACAAAAAGTGACAATATATCGATAGCTTAAAATATTTATATCGAAATACTTTTATGAATATTG 
ATAGTTGATGTGCTTTGATATTTTTTTTTTAAATGTTAGTACGAAAATCACATAATTAAGTTCATAACTTTTTGAAAACTTTTACATACTTTAGT 
TTACATATGCTGTAGGATTTTAGTTTGCGGGTGTGTTAAAAAAGGTTGTAGGACTAAGAAAATATCGGCCTAAGTATACGATGTATTTATGTTAT 
GACGAAATTTCTTATTTCACAACAGTTTTGTTGTCTAGCTTAATGTTTAATTTATTTTAAATGAAATAAAACAAAATACAAATTAACGAGTTTAA 
AGACTAGCAGTCAAACTAGCACAACTAGAATAAAATTCATAAGATAAACAGAAAAATTTTAACTTTTTTAAATTCTCCTATTTTTTAAACTGGGC 
CTTCGATTTTTTACGAATGGAGGATTATTTTCTAGCGGAATTCATTATAACTCTGACAAACTCTTATGGAGCGCATCGTCACTACGTAAACCCTA 
CACTCTGAATGCACAAGGGCACTATATTTTTAAAAAAAAAGTGACTTAATAGGATTATAAGAAAGGCAAATATTCGCTAAAAGTTCTTAAACATA 
CCAAACAAAATATGTATGTACATAATATGTATATCTGTGTACTGTATGTACATATGCACTACATATGCAATTATATACATATGTGAACACGTTTT 
TGGTGCAATGCCAAGATTCGAAATGAAGGTATTCATCCCATAAAGGCGAAATATTATTAAAAATGTTGCTCTATTTAGACAACTTCACATATCCA 
TGTTATTCATCTCTATTAAATCATAACAGTTTCAAGTGAAATAAACAATAATATATGAGAGCGTAACGTTTAAACCAAACATATTCCCCACGATT 

TATTTATTTGTTAGCTTAGTACACTCTTAAA 
(SEQ ID NO: 1237) 

Exon: 23161.. 23069 
Exon: 22857. .22600 
Exon: 22522. .22383 
Exon: 21693. .20841 
Exon: 1338.. 1001 

Start ATG: 22813 (Reverse strand: CAT) 
Transcript No. : CT40910 

CAAAGAACTCGGACGCGTTCTGCGTGTCGGCCGCGCTAGCAAAAAACTCTGGCTTTAGTTAGTTATTTTATTGGAAAAATATTTAGTCAAGAGCC 
AACAAACGCATAGATACAGAAAAGTATTGATTTTCGTCCAAGATGGCGGACGACGAGCAATTCAGCTTGTGCTGGAACAACTTCAACACGAATTT 
GTCGGCCGGCTTCCACGAGTCGCTATGCCGCGGCGACCTGGTGGACGTCTCGCTGGCCGCCGAGGGCCAAATAGTGAAGGCCCACCGATTGGTGT 
TATCCGTCTGCTCGCCCTTCTTCCGCAAGATGTTCACTCAGATGCCGTCGAACACCCACGCTATCGTATTCCTGAACAACGTCAGCCACTCGGCG 
CTGAAGGACCTCATCCAATTCATGTACTGCGGCGAGGTCAACGTGAAGCAGGACGCCCTGCCCGCGTTTATTAGCACCGCGGAATCGCTGCAAAT 
CAAGGGGCTAACGGATAACGATCCTGCGCCTCAGCCGCCGCAGGAATCCAGCCCGCCGCCAGCAGCACCTCATGTACAACAACAGCAAATACCGG 
CTCAGCGGGTGCAGCGCCAGCAGCCTCGCGCCTCCGCCCGCTACAAGATCGAGACCGTGGACGACGGCCTGGGCGACGAGAAACAGAGCACCACC 
CAGATCGTCATCCAGACCACGGCCGCACCCCAAGCCACCATTGTACAACAGCAGCAACCGCAGCAGGCGGCGCAGCAGATCCAATCGCAGCAGCT 
GCAGACGGGAACCACGACGACGGCGACGCTGGTGTCGACAAACAAACGCTCTGCCCAGCGCTCCTCCCTGACCCCCGCCTCCTCGAGCGCCGGCG 
TAAAGCGCTCCAAGACAAGCACCTCCGCAAATGTTATGGACCCTCTCGACTCGACCACTGAAACTGGAGCCACGACAACCGCCCAACTTGTGCCG 
CAACAGATCACCGTGCAAACCTCTGTGGTCTCTGCGGCAGAAGCAAAGCTCCACCAGCAATCGCCGCAGCAGGTTCGACAGGAGGAAGCCGAGTA 
CATTGATCTCCCCATGGAATTGCCCACGAAATCTGAACCTGACTACTCTGAGGACCATGGCGACGCCGCCGGCGATGCCGAGGGCACCTATGTGG 
AGGACGATACTTACGGAGACATGCGCTACGACGACAGCTACTTCACTGAAAACGAGGATGCTGGCAACCAGACTGCGGCAAATACGAGCGGTGGC 
GGAGTGACGGCGACCACTTCCAAGGCTGTGGTCAAGCAACAGTCCCAGAACTACAGTGAATCATCGTTTGTCGACACCAGCGGGGATCAGGGCAA 
CACAGAGGCTCAAGATTTAGGTGAACTCAATCCTAGCAACCTGGCTGACTTTGGTAACGAGAGCTTCTTGCCCAAAACCAAGGGTAAACGGCCAC 
AGAATGTTCGCTGTGGACTGGCGCCCGATCAGAAGTGCGTCCGCACCCTGGACGACTGGGATCGCATCCGGTATGATCGGACCAGGAGCGGCGAT 
GTTCTGGTCTACGACGGATATCGTTATGATCGACGAGCCAACTACAACGATATCATATATTGGGGGTGTGCCAAAAAGCGACTAAGCTGCAACGT 
CTACATGATTACCCACAAAAACAAACCCACATATGTAGCCATTTCAGGCGTGCACAACCATTTGTAG 
(SEQ ID WO: 1238) 

Start ATG: 138 (Reverse strand: CAT) 

MADDEQFSLCWKNFNTNLSAGFHESLCRGDLVDVSLAAEGQIVKAHRLVLSVCSPFFRKMFTQMPSNTHAIVFLNNVSHSALKDLIQFMYCGEVN 
VKQDALPAFISTAESLQIKGLTDNDPAPQPPQESSPPPAAPHVQQQQIPAQRVQRQQPRASARYKIETVDDGLGDEKQSTTQIVIQTTAAPQATI 
VQQQQPQQAAQQIQSQQLQTGTTTTATLVSTNKRSAQRSSLTPASSSAGVKRSKTSTSANVMDPLDSTTETGATTTAQLVPQQITVQTSVVSAAE 
AKLHQQSPQQVRQEEAEYIDLPMELPTKSEPDYSEDHGDAAGDAEGTYVEDDTYGDMRYDDSYFTENEDAGNQTAANTSGGGVTATTSKAVVKQQ 
SQNYSESSFVDTSGDQGNTEAQDLGELWPSNLADFGNESFLPKTKGKRPQKVRCGLAPDQKCVRTLDDWDRIRYDRTRSGDVLVYDGYRYDRRAN 
YNDI I YWGCAKKRLSCNVYMITHKNKPT YVAI SGVHNHL* 
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(SEQ ID NO: 1239) 



Classification: transcription_f actor 



Celera Sequence No. : 142000013384688 

ATTTATTTTTATTTACTTGCTTTCCACTaCAACGAAATACATTCCATAGATGATTTTATTAAGCTTTAATGAAACAAACTMTGACGACAAACTA 

CAAACATTCACAAGAACAGATCTAACTAAATCCATGCAGAAATGTGCGGCCAGTAAAACTGTGCAACAAAAAAAAGAAAAACAATTATTTTTAGA 

GCTTTGATTAAAACTTTATTACAAACATAGAACCTGAATGGGAAGACGAAAAATCTCGAATGCTTACGTATAAAATAATTTCTTTAAGAAACTAA 

AATTCTTAAACAAAAAAAACATTGGTTATATAATTGCGGTGTTGGGCTAATTTAGCGACTCTTTATACAATGAAGAATAAACGAAACAATTTCTA 

AAATTTTAAAATCGAAACATTATGGCAAGAGTGGAAAATTCCAAATTTTCTAAAAAACAATATTGTTGAACAATAGGAGAACATAAACCTTTATG 

AATAGAGTATAATTTTTAATAAATAAAACACTTAATAAGACAACAAATACTCATTAAACAAACAGATTTAGCATACCATTATATATATAATATAT 

BTATTTAATTTTTCTATGAATTGTTTGTAAAACTTTTTTGTAAATTGATGAAAAACCCCAAATTCAATGTGAAAAATATACAAATTCCATGTAAT 

AAAACGCAAAAGAAACGAAAGTTGTTTGCAATAGTGAGTTGATAAATATTTTATTGATGTCTACGACTATAAGTTTTATTTTCTACACATACAGA 

TATTATCATTTTATTTACTGCTTGTAAGTATTGATTTTAATTTTTTGTTATAATATTACACTATTTTTATTTCATTTTTGGAAACATTTTATTTC 

GAAACACACAAATTATTGAAATATCAAAACTGGTTAGGCAATCATCGATTATAGGTACACATAAAAAAAATCTAATTTATAGTTTGAAACTTGCA 

TTTAATATATGATAACTAATGAAAGCAACCCAATTACCTGACATTTTCAGATGAAGCTGGCCAGAACGAAGGTGGTGAGTCCAGGATTCGAGTCC 

GTAATTGGCTAATGCTGGCTGACAAATCCATCATTGGAAAATCCTCAGACGAGCCCTCAGGTATTTTACACAACTACACTGACTACAGTTTTCTT 

GTTTTATGCATAATTTGATTGATTTTGTCATTTACATATTATGGTTTTAACCTTTTTTTTGAACTTTCCAATAACTTTGTTTTCTTTTTACAGTC 

TTACATATTGTTTTGTTGCTGAGCACACACAGACACATCATAAGTTTCTTACTCATTATCCAATCATTTATTGACAAGATTTACTAACTAAAACC 

ATTCCCAATTTCTATTTCATGACTAACCAAACAAACAAAAAAAAAAAAATACCGACAAACAACTAACCTTCAGACAAACTAACTCAGTCCAAAAA 

ATCACTCATAAGCGATGCTAAAACCACTAACAAAACGTCGACGCCGATCCGACCAAAAGTTTCGACGACGACCACATCCACGTCCACGGCGGCGG 

CTGCAGCAGCAGCAGCCACAATTGCAGCCAAGCAGGCGGCAGCTGCTATTGCCAGCAGCAACATCAACAACAATAACAGCAGTTTGACGCAAACG 

GTCACTCAAACTGTGACTCGTATCGGGAGCATTGGACGCACAACCATTGCCTGCATCACGCCGGCGAACAATGGTAACAAGAGCTCGTCGAGCAA 

CTGCAATGTGGATGCCGCATCGGCCGCTGCACTGGCGGCTGCCGGCGTGGAGCTGGACAGCATCGATGACACCATGACCGAGGTGATTGTAAAGA 

TCGAGAATCCAGAGAGTATGCCGCTCAACGACGATGAAGACGATGCCGTTTGTAATGAGGCTATTGAGGATGAGAATACCTTCGACTATGACCTG 

AAGCTGGGCAGTCCGTTGTCGTGGACCTACGATGCGGTGAAGATTGAGAATGAGGAGTTCGAGGATAGCTATCTGATGGATAACGACGATGATGA 

TGATGATCTGCTGACCACAGCTGCGGCGACTCAAAAGCATGCCAAGCAATCGAATGAGAAGCAAATGGCCGGCTCTATGGTGGCTGGAGCTGGTA 

GTGGAGGTGCCGTCAAGAAGATCGTGCTTAGTGCCCAGCAGCAACAGCAATTGCTCGAACAGCAGCAGCATCTGCAGCACTTGCAGCTACAGCCC 

ACCAGTCAGTCGCTGCAGATTAAGTTGCCCGCCATACCAGCCACCATTACGACGATTTCGGCACCCAAGCAGATGATGTCAGGTGCGGGAACCAG 

TGGCTCCCTAACCCCGAACAACAATTGCACGCTGATGAGCAACAAGCTGGGACTGCCCGTGAAGGGACAGAATCTCGATCTGCACTGGTCGCACT 

CGGACGACAATCGTTATCGCGTCCTGGTCCAGAATAAGCGCACACGCAAGGAGTCGCTGGAGCACTCCGCCGATATGATCTACAATGCGGATATT 

GAGAAGCCGTGGGTGTGCCGCAACTGCAATCGCACTTACAAGTGGAAGAACAGTCTTAAGTGTCATTTGAAGAACGAGTGCGGCTTGCCACCACG 

CTACTTCTGCAGCAAGATGTGCGGCTATGCCACCAATGTCCACAGCAATCTGAAGCGTCACCTGAACACCAAGTGTCGTGATCGCGAGAAGGATG 

CCGACGACGAGAAGAAACCCGGATCGGCCAGTGGCAACATGCCGGTGGTTGTGGGCGTTGGCAATGGGACCGCCGTTCCGGTCAGCAGCAGTAAT 

AACAACAACAACGGCGGCGGCAGCAGCACCAGTAGCACCTACACCTTGGTGTTCCAGAACGATAGCGCTTAGGAGAGACTTGGACTTGTACAAAA 

CGCAGGCAGCGGCTAAGATGGAAACGGCTTTGGAAACGAACATTCTGCTAGTGCAAATCTTGGTAAATCCATCGAGTTGAGTTTTATGGGCCATT 

CGGAACTGTGGACCCTTCACAAATTCGTTATTTTCGTTGAACGAAGCCGGTGATGCCGTGGAAACTAGACGTAGTTTGCAATACATTGAACAAAA 

AGACATGCAAAACAAATTATATTGTATTGCTTTAGCAAATTCTCACAACAAATTTAAACCTAATCTTTATAAATATGTGTAAATTTTTTTCCCAG 

AAAAAAAGAAAGAAAAAAAAAAAAACGAAACCAAACCAAACAGAAAATATATAAAACTAGTCATCAGCTAAATTTGTTTTAGGTGTGCTACAAAC 

ACT ACAAGAAACAAAAATAAT AAAAAAAAGAT GGGAAAATTT TGT AAT AAT ATT AT AATTTAAAT AAAAAAAACCT TTAAAC TAT ATGCAT GGC A 
TGAAGATAAAGCAAATGCCCCTCCACAGATACTTAAAGATAACAAAAATAATGATAAACTTTTTGATTTCGTATTATGTTTGCATTCGCAATAAT 
TTTTATAATTTGTTAGAAACATTTTTCTGGTCACTAAGTTATGCTTATATAATAATAATTATTAACTAAAAATGATTATCTAGTCAAAACGAAAC 
GATAATGAGATATAATATTTTCAATTTTAGACAAAAATTAAACCTTATTGATAAAGACAATGTTTATAAACATTAATTCGAAATATCATCAGCGC 
ATGTATGCTTTGAAAATTGTTAGATTTAGTGAAAAAAAAAAAAACACGAACCACGACGAACGGCACTGAAACGCAGAAGTTGTACCATCAAATTA 
CCTTACCTTACCAAATTGTAATGTAAACGAAAAAAAAATGACAAATACCTCTATGAAGTTGCAAGAGAGAATACCATTATATTGATTGAGTACGA 

TTACCACTATGTATTTCCAATAAACTG 
(SEQ ID NO: 1240) 

Exon: 1001.. 1024 
Exon: 1418.. 2827 
Start ATG: 1001 

Transcript No * CT41779 

ATGAAGCTGGCCAGAACGAAGGTGTCCAAAAAATCACTCATAAGCGATGCTAiW^CCACTAACAAAACGTCGACGCCGATCCGACCAAAAGTTTC 
GACGACGACCACATCCACGTCCACGGCGGCGGCTGCAGCAGCAGCAGCCACAATTGCAGCCAAGCAGGCGGCAGCTGCTATTGCCAGCAGCAACA 
TCAACAACAATAACAGCAGTTTGACGCAAACGGTCACTCAAACTGTGACTCGTATCGGGAGCATTGGACGCACAACCATTGCCTGCATCACGCCG 
GCGAACAATGGTAACAAGAGCTCGTCGAGCAACTGCAATGTGGATGCCGCATCGGCCGCTGCACTGGCGGCTGCCGGCGTGGAGCTGGACAGCAT 
CG AT G AC AC CAT G AC C G AGG T GAT T GT AAA GAT C GAG AAT C C AG AG AG T ATGC CG C T C AAC GAC GAT GAAG AC GAT GC CGT T T GT AAT G AGGCT A 
T T GAG GAT GAG AAT AC C T T C G AC TAT G ACC T G AAGCTGG GC AGT C C GT T GT C GTGG AC C T AC GAT GC GGT G AAGAT T GAG AAT GAG GAG T T C GAG 
G AT AGC T AT C T GAT GG AT AAC GAC GAT GAT GAT GAT GAT CT GC T GAC C AC AGCT GC GGC GAC T C AAAAGC AT GC C AAGC AATC G AAT G AG AAGC A 
AATGGCCGGCTCTATGGTGGCTGGAGCTGGTAGTGGAGGTGCCGTCAAGAAGATCGTGCTTAGTGCCCAGCAGCAACAGCAATTGCTCGAACAGC 
AGCAGCATCTGCAGCACTTGCAGCTACAGCCCACCAGTCAGTCGCTGCAGATTAAGTTGCCCGCCATACCAGCCACCATTACGACGATTTCGGCA 
CCCAAGCAGATGATGTCAGGTGCGGGAACCAGTGGCTCCCTAACCCCGAACAACAATTGCACGCTGATGAGCAACAAGCTGGGACTGCCCGTGAA 
GGGACAGAATCTCGATCTGCACTGGTCGCACTCGGACGACAATCGTTATCGCGTCCTGGTCCAGAATAAGCGCACACGCAAGGAGTCGCTGGAGC 
ACTCCGCCGATATGATCTACAATGCGGATATTGAGAAGCCGTGGGTGTGCCGCAACTGCAATCGCACTTACAAGTGGAAGAACAGTCTTAAGTGT 
CATTTGAAGAACGAGTGCGGCTTGCCACCACGCTACTTCTGCAGCAAGATGTGCGGCTATGCCACCAATGTCCACAGCAATCTGAAGCGTCACCT 
GAACACCAAGTGTCGTGATCGCGAGAAGGATGCCGACGACGAGAAGAAACCCGGATCGGCCAGTGGCAACATGCCGGTGGTTGTGGGCGTTGGCA 
ATGGGACCGCCGTTCCGGTCAGCAGCAGTAATAACAACAACAACGGCGGCGGCAGCAGCACCAGTAGCACCTACACCTTGGTGTTCCAGAACGAT 



AGCGCTTAG 
(SEQ ID NO: 1241) 
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Start ATG: 1 



MKLARTKVSKKSLISDAKTTNKTSTPIRPKVSTTTTSTSTAAAAAAAATIAAKQAAAAIASSNIKNNNSSLTQTVTQTVTRIGSIGRTTIACITP 

ANMGNKSSSSNCNVDAASAAALAMGVELDSIDDTMTEVIVKIENPESMPLNDDEDDAVCNEAIEDENTFDYDLKLGSPLSWTYDAVKIENEEFE 

DSYLMDNDDDDDDLLTTAAATQKHAKQSNEKQMAGSMVAGAGSGGAVKKIVLSAQQQQQLLEQQQHLQHLQLQPTSQSLQIKLPAIPATITTISA 

PKQmSGAGTSGSLTPNNNCTLMSNKLGLPVKGQKLDLHWSHSDDNRYRVLVQNKRTRKESLEHSADMIYNADIEKPWVCRNCNRTYKWKN 

HLKNECGLPPRYFCSKMCGYATNVHSNLKRHLNTKCRDREKDADDEKKPGSASGNMPVVVGVGNGTAVPVSSSNNNNNGGGSSTSSTYTLVFQND 

SA* 

(SEQ ID NO: 1242) 



Celera Sequence No. : 142000013384534 

ATGTGGCCGCTCTGTTGACTATGCTGTCCGCCAGTCTCTCCAGGTCGTCCAGGCTCTGCTCGTTGATTTTCCGCTCGAAGTCGGCCATAAACTCA 
TTGGTCATACGCTCAGCTAGGGGCTCCGAATAGTGCTTGCGGAATCCCCGCAGGATGCTGTCCCTTAAACAGGGCTTTATCACCACCAGATTGAG 
TTTGGTGCCCTCCTTGATGTTGGGATATGACGCAATTGTCTGTTCATTGTTAAGCGGTCGCCCCAACAGCAGTAGCTTTTGATTTGTAGCCGATA 
TCTGCAGTTCCGCCTCGATCTGGTGTTTCACCTCCAGAATCGTCGATGTCGGCGCCACCTGTGTGAAATATTACCATATTAGCGACCTCACTTGA 
TTTTGGCAAGGTTCATTCTTACCTCGATGGTGCAGTCCTTGCCCTTCAGCACCTTGATTGTTATCTGCATAGCAATTGCCGTAAAATTTATTAAA 
AAATTAAAAGTCAAACTCGAGAAGTCGAGCGATATAAACAATCTGTCTTTGGCGTTGCCACCTGTATTGCCACCAAGTGTCATTAGGCCAAGCGG 
CCCTGGTTTGTAAAACAAGTGCCGCAATATTTGCTAATATATAATTTACAAAAATATTTCCTTATATTAATATATAGTACATATATATGATTCTT 
GTAAATTGCCAAATGTTATTTAAAATTATTAATATTTTAATGCAGCTAGCACAAATTATGTACAAGCTATCGAAAACCACTCGAAGTTATTATTT 
TACAAAATTAAAGAATAATGTGAGATCAACAATTAAATAACAGTTTGTAAACATTCGGTTTATTATAATTAATAATGATCTTTAAGTCTTTCGCA 
TTTGGCAATAAGACAGTAGAGGAGAGTAAGGTATAAAGATAGTAAATATAAATAAGTAAATAGCATAAGCAAAAGCTCGTTTTAGCTTTTAAACG 
GAGTCGGTAAAATCTTCCTAAAAGTAACTATCGAGGGTCGGCTGGGTCAGGCAACGCCGCCGCACGCCTGCAGCCCTGGTAACTCTATTTTCTAT 
TGAGCACCGACAACGTTGCGTGTATAAGACAGTTTACATAAATTATTATTTACAATTGCACAGAGCGTTGATGTTGTGCGTTCTAAGCGAAAAGG 
TGAACTTGACCCCGGTGCCGATAGACCGCCGAGCTATTGGGTGTGAAATTCGCGAGCGAGCCTTGTGGAATTCGCACGCGAAAATCGAAGCGAAT 
CGCAGCGATCGAAGAAATTGCAGCGCAAATAATTCGTGTAGCCTCGACGACTGAATTTTAACGCGAGAGCGGGGTGTGCGTGTGTGTGTGTGCGC 
TGTGCTGTGATTCTGTGTGCGTGTGCAAGCGTTTTTGTTAGAATTCGCATTTTACGCAAAGAACTAGCTTTTATCAGCAGTGAACTTGATGTTGG 
CGAACCTATATCGCAGCAGGGCCAAATTGAACAAGTTCTTGCAGCTTCTGGCGGTAAATTGGCGAAATAAGACCCATTAAAAACCCCCCTCTGAA 
AATATAACCCACAAATTAACCTCGCCCCTTGACATTCAAAGTTATATTTGTTTCCCCACTTATATATGTGCACATATATATATATAAATATATAT 
ATATTTAATAATAATTGTGTGTTTGAGTCTTGGAAGTGCAACTACAACTATTAATCGTGCCTAGTACACATTTTACTTTTACGCTCTCGACTGTC 
TCCGTCTCTCTCCCTCTCTGTTTCACCCCTTCCTTGCAACAACAAAACTTATGAGTACTTGCTGGGTGCCCCAACAATACTTGTGTTGATGATTG 
CCCATCCTTGCACCTCCAAAAAGAAATGAAAAAAAAATTTGAAAAACACCAACACCACCCAAAAGCGCTCTTCCTTCCCCATTTCTATTGGCATG 
TCTGGCAACCCTGGCAATTGGTCGACGGCCATTGCTTCTCAACTTGTTGCTCGTCTCGCTTGCTCTCCTCTCTCTCTCTACTCTCTTTTCAAGTA 
CGCTCTCTCTGCGCCAGTGTTTTGGCATTTTTGTTACTTATTTTCGTTTCGAGTATAGTTTTATGCGAAATGAACAATGCCAAACAATGAAACGC 
CTGAAGACCTGCATAAAGTGTTGATGCTAACATTTAAATGGGAATCCGAAATTCCACATCACCGACAACACCTGTCCACTTGGAAGTTACAGCAC 
TAATCGGAGGAACGCTGCATGAACGTGTTTTCGGTACCGTTTAGAACCGGATTTTCTATGCTAGAGAATGAGAATATGATCGCGGTTCTGTGGGG 
ACATCTTGTGACAACTTAGGAACTCCGATACGAAGCGGTTTTCCGCCATTAAGGCATTTCCAATTGGAAAGCGCGCAGCGAGAGATTCTTAAAGG 
TTGCTCCGTTTCCCATTGGTAATGTGTTGTTTACAGTTATGGTTAAGTTAATAGCAGCGTGTTTATGTAATATATAGATAATTCGAGTTATGCAC 
GTTCCACTGTACAGTGATCAAAGGGAAAAGAAACAGTATTGGAATTTCCATGAAGTTGCATAACAAATGAGGAATAGTAGTGATGAGAAGATTCG 
TGGTAGATTTCAGTAAACCCTTAATTTACGTTGAGCGCCCCTCGAATGCGAATGTTAATGCGTGTGTGTGTGTGTGTGTGTCTGCTTGAGTGTGT 
TGTTTTGTGGCTTAATTGCACGGTGTGTTGTGATGCTATTGTTATTTTTCCTTTAGTTATTGCTACGTTAATTTTTTTCTTTCGACCCTCCACAG 
TGCAAATATAATTTTCAGATAGCAATCAACTGAGAGCGGTCCAGCCCGGCCTGTTGTATTTGTTTGTTTGTCCGGTTGTTTACGTTTTTTTCTCT 
CGTTCTCGCTGCTTTTCATACGTATTTTATATTTTGTAAAAATAAAAAATCGTTATAAAAAACCAAAAACACACATAGAGGAGAGCACATCCAAA 
GAGCCAAAAAGAGTCAGAACCGATTGCGTATTATGGATAACGAGGCGTTAGATCCAACAATAAAAAGCAGTACGTCGGCAGCGACGCCAACAGCA 
GCAGCATCAGAAACGACAACAACAGCAGCATCTTCAGTTGTGGAAACCACAACAACAATCGCGGCGGCCACAGCCTCAGCGGCAGAATCAAAAAA 
CGAAACAACGGCCACAAACAACAATAGCAACACAAGCGGCAGCATCAGCAGTAGTAGCAGCAACAATATAGTCATACCGGCATCGGCCACTAACG 
GTATCAAAGAGAGTAACAGTAACTTAAGTACAACAACAACAGCAGCAGCAGTAGCGGCAGCAACAACTGTAGAAGGAGTAGCTCCAGCAATAACG 
TCCACAATCGTGGTGACCGGCGGCACTCCTCCATTGAGCAGTTTGGTGAGTAGCTCTCCCCCAAAACACTAGCCCTCCCCACCACACTCCCTCAC 
TCACTCTCTTTCTCTTGCTGTCTTTAAGCCACAGTCTCTCTCTGTTTCTGTTACGATCTCTCCCTGTCTTTCACTTGTCGCCCTATTTGCATGCC 
TACCGTTATTTTGGCGTAACGTAAACCTCACTCGTAATGCTTACACGCACACTGTGCATTTGTGTGTGTGCCTAAGCCTTATCTTATCTAATCAG 
TGCTAATGATGTGTGTTATTGTATCGCTGCTTCACCTTACCCCGCACGATTCAAGTTAGATTGACCAACACCAATTAGATTCACAATGTTGCCAA 
TTGTTAAATGAGTGTATGTCGTTCGGATAACAAATCGAATCGTAATTGTATCGCAAGTGCTTGACAAACCGATATTCACTTTCCCCCTTTTCCTC 
ATCTCCAATTTGACCAATTCGTTAGTCCTTAGAGAGATAAACTTATTAGATTTTAATTTTTTTAATTTTAAAAAATGATTTTTTAATTAATTTCA 
TAAGTTTTTTTATACGAAATATGATGCATTTAATTATTTCTTTAAATAGAGGAAATACTCTTTATTTAACAAACGATATAAATAGCATTTAATGA 
CCTTGTAAGTGGTAACAGTAATCTAGCAATCTTTATTGCTTTTCAATTTTGGTTTACAAATCGTTATCGTTTCAATTATATTACCTTGGGTTCTA 
CTTTCTGATAAGACTTTTGCGATTGAATGAATAGATTGCCCTAAATCCTAGATTTTATACGATTTCCCGAAAGTTTTGTGTCAACAAAAGACGGA 
GTTGATTATACAAATCTTTATACTCACATGCCTGTTATTAACTAGAAGAACTTTTAATGAAAATTTTTTACATTTTTGTCTCTCCAACGACAAAA 
ATATTGAACATTTGCAATAAAACATAAAGTATAAGGCGAGCACTTTGCTTTTTGAAACTGCCAAAAAATTGCCCAATGAAATGCTTTAAAAAATA 
CATAAATGTTAGTGCCTGGAAAATATATACATATATGCCACCACCTGATTCCCTGGTAGATGCCAGCAATATAATCCTATACGTGTATGGAGAGC 
CTACGGAACTCCAGCTTCCTGATCCCAGAGCACATTTCAATGTTCGGCATTACTAAACTCAGACTAAAACACACCCACAGGCAGCTCAACTTAGG 
CGTCAAGGATTCGCACAGTCCAAGGTAAAAGCGATGGCCGGCCAAAATGGCAGCGAGGGACCATCCCGAACTGAACTGCTTTGCTCTGATTTTGC 
AAGTGCTACGTTTTGAGTATGCTTTTTATTTGACCGCATATTGCTTTGCATTCGTTGTTAAAATTATTTTCATTTACTAAGCATAAGCCAAACCG 
TTGACATGGGAAAAGATAATCAGATAAGTGTCATTTATGCCCCTTTGCAGGCTAACAAACTCAAGGACAACACACCACCGTACGATGCACCGCCG 
CCCACGCCGATCAGCAAGGTCCTGAACATCACCGGCACCCCGATCGTTCGCAAGGAGAAGCGCCAGACCAGCGCCCGGTACAATGCCTCCAAGAA 
CTGCGAACTGACGGCCCTCATTCCGCTAAACGAGAGTGAGTAGCCTTGGAGTAGATCCTACTACTTAGTATATTACTTTATAATACATGTACTGC 
CAGCTGTTCAGATCGATATCTCTTAAGCCTGCAGATCGGTTGCATGGGGTAGAACTTGGTCGTAGCCATGTCGTAGCATACATTTTAGGGCAGGG 
TTCCGTTTAGGCGATATAAAATTACCTTTCTTACTGTCTACCAACATCGTCAATTAAGTTTTTATGCAACTATACACTCAAAACAGATTGTATAA 
ATCGTAGTAATTCCCATTAACGATGCATAATTACATATATTATGTCAATTAACTCCGGTAAATTTCCATGATTATAAGTTCAGCTTTAACTTACA 
AGAGGCATTCTCTGTAAAGTGTACCTACGACCTGCAAGCTTGTTGGGAATATTGATTGGCCGGCGCTTGATTAATTTCGCTCAGCTGTTGTAATT 
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TGTTTATTATTTGTTAAAGTTAAAGGTCAAGGGAGTGTCGTCTGATTAGGTGGGTGGCAACACTGACGCACGGCCACTCACACATGTGAACCAAG 

CAGCCGCATCAATTCACACCTCTCTCCACCTCTCTCACTCACCGGAATTTCAACACATGCGCTCTCTCCGCAGCCTCTCATTACACTGGGAGAAA 

GCATCGTTAGAGGATTTCTCAGACCGTCTTAAATTTCATATATACACCGGCATTTCTTTATGACACAAATGTATTCGATTGATTTAAACTTGTGT 

ATACAAATAATTAGAAAAACTGTATCCTGCAGGGAATTAATTTAAAAAAAAGCAAATTCAATATTTTTGAAAATCATATATGTACATATCTGCGC 

GTAATGATTATGAAATTGCTACACTTTTTTTTTCTGTGCAGCATGAATGTCATCGCATTGCGTTATTCACATACCTTTCTGCTAATTGCTTGGGC 

GGTTTCGACTCACAAATCGAATAAACAAATTTCGCTTGTTCAATACTATTTTATTTTATTAGGTTTTATTGTTATTGTGGCTCGCTCGCCCAC 

(SEQ ID NO: 1243) 

Exon: 1001.. 1139 
Exon: 2756. .3370 
Exon: 4801.. 4983 
Start ATG: 2978 

Transcript No. : CT42545 

GCAACGCCGCCGCACGCCTGCAGCCCTGGTAACTCTATTTTCTATTGAGCACCGACAACGTTGCGTGTATAAGACAGTTTACATAAATTATTATT 
TACAATTGCACAGAGCGTTGATGTTGTGCGTTCTAAGCGAAAAGTGCAAATATAATTTTCAGATAGCAATCAACTGAGAGCGGTCCAGCCCGGCC 
TGTTGTATTTGTTTGTTTGTCCGGTTGTTTACGTTTTTTTCTCTCGTTCTCGCTGCTTTTCATACGTATTTTATATTTTGTAAAAATAAAAAATC 
GTTATAAAAAACCAAAAACACACATAGAGGAGAGCACATCCAAAGAGCCAAAAAGAGTCAGAACCGATTGCGTATTATGGATAACGAGGCGTTAG 
ATCCAACAATAAAAAGCAGTACGTCGGCAGCGACGCCAACAGCAGCAGCATCAGAAACGACAACAACAGCAGCATCTTCAGTTGTGGAAACCACA 
AC AACAAT C GC GGC G GC C AC AGCCT C AGCG GC AGAAT C AAAAAAC G AAAC AACGGC C AC AAAC AAC AAT AGC AAC AC AAGC GGCAGC ATC AGC AG 
TAGTAGCAGCAACAATATAGTCATACCGGCATCGGCCACTAACGGTATCAAAGAGAGTAACAGTAACTTAAGTACAACAACAACAGCAGCAGCAG 
TAGCGGCAGCAACAACTGTAGAAGGAGTAGCTCCAGCAATAACGTCCACAATCGTGGTGACCGGCGGCACTCCTCCATTGAGCAGTTTGGCTAAC 
AAACTCAAGGACAACACACCACCGTACGATGCACCGCCGCCCACGCCGATCAGCAAGGTCCTGAACATCACCGGCACCCCGATCGTTCGCAAGGA 
GAAGCGCCAGACCAGCGCCCGGTACAATGCCTCCAAGAACTGCGAACTGACGGCCCTCATTCCGCTAAACGAGAGTGAGTAG 
(SEQ ID NO: 1244) 

Start ATG: 362 

MDNEALDPTIKSSTSAATPTAAASETTTTAASSWETTTTIAAATASAAESKNETTATNNNSNTSGSISSSSSNNIVIPASATNGIKESNSNLST 
TTTAAAVAAATTVEGVAPAITSTIVVTGGTPPLSSLANKLKDNTPPYDAPPPTPISKVLNITGTPIVRKEKRQTSARYNASKNCELTALIPLNES 
E* 

(SEQ ID NO: 1245) 
Name : 



Celera Sequence No. : 142000013384684 

ACCCAGGCCGGAGGCTCCACGCATCCCGCTGCCAGTGCCAATTGCCACGGTTACGGGAAGCCGGACGCTTATCCCACTTTCCGGATACGATTGCC 
TAGCAGATCTGCCGTCCGTACATATTGAACAGACCTTTGAGCTTAACGATGGTAAGTACAAGCCCAAAAATTAGCTAAAATCGCTTCTAATAGTT 
CGTCTTCCCTTTGCAGCCCTAACAGGCGTCTCCTCGGAGAATCGATATGTGGTGCGATCTCCACTGGGTGATGCCATTTTCGCGGCCAACGAGAG 
TTCCACAGAAAAAAATCGACTACTTTGGGGAGCTGGTCGACCCTTTCAGATGCACCTGCTGGATAAAACGCACCAGGAAGCTCTGGTTTTTCGCA 
AAAAACTAGCGATGGGATCCATGTGCTGCCAGGCAAAAAGTCTAGAAATTTGGATACCACCTGGTAATTTGCTGGGTAAAGTGGTGCAGTCGCCC 
ACCTTCATGCAGCCGGAGTTCTTTATCGAAGACGAAAGCACAGGACAACTAACATTCTGTGTAGAGGGTCCTGTCGGTTTAGGATTCTGCTGCTT 
CAGTTTGCCCAAAGATTGTTATTTTAAGGTTGGTAGTAGCTTTAACATTCGGAACAGAATATTCAATTATAGTGTTTTTGTTCCCTTACAGATCC 
ATTCGGGAGGCAATATGAGGGCTTCCATAGACCACAAATGGCTCGCCAGCAAGTCTCAGTACACCACAAATATATACTTTAGTGACGCCAAGCTA 
ACCGCCAAGGAGCGGGCTTTGATACTGGGATCAGCCTTTCTACTGGTACTGTGCTCCTAGTTTATATACATATATCATTTGCCACAATAATGTTT 
TGTATTCGTTTTTCAGGAATATCTATTCTTCCAAACCCGCTTTTGACGTGATTAATCAATAAAAAAAAACATTGTATATTCACATTGATTTCTTT 
ATTTTTTGTCATTTTTTTGTATAAGATTTTATGTTTGTTTGTGTTTTGTTTTTTTTTTTTCTCAGTTTTTTGATATATTTTACAATAATGTAATA 
GATCACTTCATTTTCTTGTTGTTTAACCACATACACACATTCAAATATATAGATATATATCTATGTGTATATATAGATTTTGCTGCAGCGCTTGC 
CATAGCAGTTTGTACAATATTTCATTTATATCTATCAACTTTGTGTAGTTTGTTTATCTAACCTACCGCTTAATGCATATATGACAACATATTGA 
TTATTTACAGTTAGCGTTTTTTTTCATCATTTCGTTTCTCTTCTGTCAAATACAAAATTAAAAGAAACAATTAAATGAGTCTCGTGGTCTCGTTG 
TTTAAAGAATTAAAACAAAACTAGGGGCGCAAGTCATTTCTCATTCACCGTTAATAGTTGCGCTTCAAAACTTGGTGTGATAACAATTTGAAATA 
AGGAAGAACTACGTGATTTTCAAAGGGCAAAACTTGGATTTGGACATAGAAAACCAATATACTTAGTGTGCGGGTGTGTTCTGGTTCGGTTTGGA 
GGTTCATTTTCGGGGCTAGGCCTTAAAAATAACACACTTCTCTCGCTCTCGGATCTTTACATTTGTAGATAGACTCACAGCACTTCAGGAGTTTT 
TGCATTTATGCATTTCGAAAACAAATTATGGTTAAGTGTATACAATATAGATAGATTTATAAGTGTGCTCGTTGGTTGGCTTATAACTAGGGCTA 
GCTGATGACGATTCCTTAACCTAAATACTTAAGAGGCAGATCGAAACGATCTGTGTGGGGTGACTGCCTTTTAAAACCGTAAAAGGCTTTGTATA 
TGATTATTTTAGCGCAATTCGCCTAGAGTGTTTACACTGGAATAGCTGACTCGCTTATCTGGCTCCCCTATCGTTTGATTACTACTCGCGATTAG 
CCCAATACTAGTTGGGCCCATCATCGAAGAGAGGTGGGTCTTATAGGCTAGCTCTAGAGTAAGTTGTGTGTGTGCCCATATGCGATATGCTTAGA 
CATTTGATATGCGCTCGTTGGAGGTGGTTTGGTCATGGCGGCCCTCGATCCTGGAATTGTAGATGCCGTACTGACTGCTGTCCAGGTGGTAGACC 
GCGGGATACTGATAACCTCAGCTGGGCTGCAGCATGTACATGTACTGTCAGTTGCTTGCCGCTGCTGAAGCAGCCACGGGAACGGCCACGGGAAC 
AGGCAGCTCCTGGCCAGAACGAGCCCGCTTCTCGCCACTGCTTTCCGGTGGATTATCGCTCTCCTCGCCCGTACTGCTGAAACTGCCCGGCAAAC 
GACGCTTCAGGCTATGGTTTAGGCCATTGGCCACCGTTTCATCCTCGGACATCGAGCTGGCTCGCTTGGGTGTGTTGCCAGCACTGTCGTTGTCC 
AGCGAAGCTATAGCCGAGGCAATTGCGGCACTAAAGCCATCCTCGGACGCACTATCCTGCGAGACGGGACCTGCTGCATCCTGGGCCAGCTGCAT 
CATGCCAGAAGGTGTGGTATAGCTACTGAGCTTACCAGGATCCTTCTCCTGCAGTTTCTGGGCCAGCAACTTGAGTGATATCAGCGAATTTTCAC 
AGTCGCTTGTATCGATTTCAGTGAGACTAATTCCGGCACGTTCAATTTCAGCATAGGTCCCGAAGGGCAACTTCTGCAGACACTCTAGCTTCTGA 
ACATGCGTCTTGGAGCGCAAATGCTTGGCCAAATGCCCATGGATGCGGAAGGCGATCTTGCAGTCGGTGCACTCGAAGGGTCGCGGATTTGAGGT 
GGTTGGAACACCAAAAGTCGACGTCATCATAACCTTGTTTTTGTGCGCTTCCGATCGCTCGTGCTTCTGCAGATGTCCTTGCGTGCGGAAGGATA 
TGCTGCATGGCTCGCACTTAAACTTTCGCTCCATGTAATGAATATTCATGTGCAGGGTCAGCTGATGTTGCCGCTGAAACGTCTTGGAGCACATG 
GTGCAGGTGGGACGTGCATCACCTGCAATGGGAGCTGGCGGCACTCCTCTGCCATACGATACATGCTGGAGATCGCCGCTCTTGCTAGTGGGAGT 
AGAGCTTTTGAAGCCATCTTCACCCACAATCACAGTACGCTGCGCGGGCTGAGTTGAGTTGGCAGTGGATGAGGGTGCCACAGGTGCAGCTGTTG 
TTGTGCTTACTGAAGTAGCAGTAACAGACGATGTTGGTGGTGCAGTCGGATTTCCAAGTACTCCAGTAAATTCATTGACATCCGGTTGCTGCACT 
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GCAGCAGGCAGATTGGCATTGTGTGGCTCTGCGATATTAGGCTCCTCGTTTTCTTCCTGGGCAATCTGAGCAAGAATAACCGGGGCCTGTTCAGG 
TGCACTCCGAGGTGACTCAATGGCCTCTGGGGTCTTTACTTCGTGCACTTCGATGCTAGGTACCTCATTTACGCTCATAACGGGCCTGCTGTTGG 
CGGTAAAGAGAGTGCTACTACTGCTCATTACAGGAGACGCCGCTGTGTATAATGTGTTGATCAGATAGCTGCGGCTAAACTGACTCTGCTTTATC 
TTGGCTTGATGCAGCGCATGCTCCTTAATGTAGGTCTGTAGTTGAACATCCTCGCCCTCGGCCTCGTGCTTGGCCTGCTCTTCCATTTCGGCAGA 
TATTCGCACTTTGTCCGAAACCGAGATTAGAGTCTTCATGAAACCGCTTTCGTTGCCGGAGGTTCCAAAGATAACATCGTGCATCTGTTGGGCCT 
GCGATTTGGGAAGATCATGAGCGGGCACCGAAACAGGCGAGGGGGATATTAATAAGATGGAGCCGCGTGGCTTAGTGAGATCCATGGGAGCACTC 
TCCTCGTTGGCCACACTCCGTTTCGGTTTGGACTCCTCCGGATTGGAGTAATACTGTTCCTGCTTTTGGTAGTCGAAAGGCGGCTTCGGCGATGT 
AAAGGAGCACACTAATCTTTTAGGTTGCGAACCCGAAGAGCCAATGCTGTTTGCAGGACTGGCCGCTGGTAGGGGTGTATCCTGCAACTGCGGAT 
AGGGCGATACGCTCTGCGGTATGGGCGGCGTCATTGATAACGAGAGTAGGCCTCTGGCTGCTTCGTGCTCTTTCTGTCGCGACTTGGACTCATCC 
GTATCTGTGAAGAGATTATAATGAATCAGCTATTTGAACATATAAGTGTATATGATTGCTTTCTTACCACTACTTTCAGATTCATTATCACTGTA 
GTCATCGGAATCCGATTCCCCAGCCATCGATGATGTGCGTCCTCCTGCCGAAGTGGATGACTGCTGGTCAAAGTCCATATCTACATCCAAGAACT 
CGCTGTCTGGCGGCATTGGTCCCGGATTTATTCCCAGTTCAATGCACTTCTTAAAGTGAGTTTTCGATTGCATATGCTTGGTCAGATTTCCCTTG 
GTCTTAAAACTGTGGAATGGAACAAAGTTTATAATAAGTTATCTCTTAAGTACGGAAAAAATGCAATGACTCACCTGAAGTTGCAATGGCTGCAT 
GTGAATGGCCTCACGTCCGTGTGAGTGCGAATGTGCTTCTTGAGCATCGACGGCTTCTTGCAACGAATTCCGCACTCGGAGCAAACATATCTTCC 
CCTTCCGCGGCCACGGATGTACGTGTAATCCTCATGAGACTCGTAGCCACCCACCAGGGGCTGCCCGGATGGACTTGTCTCCAACTTTTTGGCCT 
GTGCCTCCTCATTCGCCTTGGCCGCCACATTTGCCTCACTGAGAGCGGCAATCGTGACAGTGGGTGGGGTCTTTAGGGGTCCTTGATAGAAAGCA 
CTGCTAGTGGAGGACGTCGAGGGGGAGGATACGATTTGCTGGGAAGCGACTAAGTTGTAGCTAAGTTTTCCCGATCCAGCCATGGCGGTTGAGCT 
GCCATTTCCTAACATCCTTTGACGTGAGTCGTAGAGTGCCATAACTTGCTTTAGTTTGAATCCAAGTGGATGTGGCTGGTTCTCCTCGCAAACCT 
GCCACACACTATACATCGATAGTTTGTGCATTCCTGCCACATACGAAGGCTGTGGACAGTTCACCGTACAGTAAAACATCTTTGTGCTGCATTTG 
AGGCCCAGATATGTGTAGGCGTGTCCATTGAGATACAGTTGCACACAGGATTTGGACGGCCTGGGGGTCTCCGGCGAGATCAGCGGCAATGTATT 
GGCTGTCGGTGTAATGCCATGATGGCGTTTAGGAGTAAAAGTACCGGGTTTAAGTGGCAAACTAGTAGGCCGCAGAAACTTGGACTTCGCCGATG 
CTGAGGCACTGGCTACGGGACTGGTCTCTTCTGGAGCGTTTCCGTTGGGAGTATCAACATGTAGCGGTGTTAGGGGCAACTCTTGCTGTTTCCCA 
GAAATCACATTTTCCAAATTAAAGTGTCGAACTTCAGGTTCAGTGGTCTTCACATTATCAGCCATGCGTAGAAAGTTAAATGGTGCTTTCTTCGG 
TGCCTCCTTGCTGTTCCACTGCAGCGCTTGCCGCTGGGCAGGAGTTGGTGGCTGTCGACTGAACTCGCTTGGCACATTCTGAACTCCACCGAATG 
GCGATAGGGCCTTCAGTTCACCGGATCCTCCTCCCAACAGCAGCGGACTGGGTTGCTTTCGATTTGGACTGCGACCTCGACTAGAGGGAATGGGT 
AGCTGCAGCTGTTGTGGCGGAGGCGGCAGTAGCGGCATTTGTGGTGTCAGACTACCGGGTCCTGGCATTCCAGGCACATGAGGTATTGGTGTTAC 

T G GAGT GGT T T T AT CTC C AC C GC T C AT CGGT GGC AGC GT T AGC GG AT T G AAGGC AGT T AT T G AGT T AAT GGGC GGAAAT T GAAAGT GC T GC GG AC 

CTGAGCTAACGCCGAGAAGTCCACTGCCACTGACCAGTGCCATCAGCTTGCGCTCCTTACTGGATGATCCTGTTTCGTTGTTGACTTTGACCGAA 

ACATCCTGGAAGCTTCCGCCGGACAGCAACGGTGTTCTAATCATCTTGGGGCTGTTGTCCAAGTCTGGGCACTCCGAAATGGGAATCATGCTGCC 

TCCCGAAGAGGTGAATCTGCGCAGTCGCTCCTCCTTGGAGGACTGCAATTCCGGCACACTATTACTGCTATTCTGATTATTAAAGGAAAAGTTGA 

AACTTGTCGGTTGCTGAGAGGAGGATGCTACAACCAAATCGCTAGTCTTGATGCGTTTGGCAGGCTGAGCTTCAATCACAAATTGCCGATCCTCG 

TGCATTTTGGTTATGACCACAGAATCCTGGGCCTTGCGTGGTGTACTTTGCTGAGCATAGTCCACTAAGGGAGTTTTTCCCAACAATGGACCAGG 

CGATGGTAGAGGCGCATCCACAAAGCGCAGTGCTTCAATCTGGGGGGCAGGTGCTGATGATGGAGCAGTGACCGTAGCAACAGCAGGAGCTGGGG 

CAGAAGCGGTTACAGTGGGAGCTGTTACAGTTGATGTCCTTGCTGGCGTCTGTGCTGCACTTAGGCGACTTAATACCAAACTGCTCCTTTTGGTC 

TTCAACTGCGACCAGGCCAACTTGGCCAGAGATAAGGGATTCTTCGAGTTGGCCATGTGGCTCTTCAGCTCCGGTAGACTCAGCGAGATCGAGTT 

GGAGCGGAATGGCGAAGATGCTGGACTCATTGGGGCACTTGAGCTCGCATCCTGACAGTAAGTGCTGTTGTGCAGCTTCAAATCGTCCGCACTGC 

GGAATTCGCTTGCACAGATAGGACACGAATAAACAGCATCATCGCCTTCGCCAATCGGCACGGCCAATCCTCGAGCATTAAGCAGTAGGTTCTTA 

ATTATCGAGTTATTAACCTGAGCAGGTGGTGGAGCAGTTTTGGAGCTACTAGTCGGATTGGTAGAAGTCGTTGGAGTCGGTGCCAGAGAACTGGT 

GGACAAGGAACTGACGGGCACAGTGACCAGAGTTGGTTTAGGCTTCGCTGATAGATTCAGAGGCATAACAGGCACTGCCGTCGTCGGTTGCTCAA 

TTTTCGTCACTGGTGTTGCAGGATCCTGGCGATATTGCATTGTTTGCATGGGCAACGGGGACGGCGGTGGAGGAAGTTGCTGCTGTTGCTTCACC 

GGCGTAGTCATCACTACTCCAGGCTTATAGGCATTCCTTTTAACCACCCCATTTGGTGGTTCGAGCTGTTGCTGTTGGCATTCCTGTTGCTGTTG 

GATGTACGTATAGTAGTTCTGTTGCTCGAGCTGCTGCTGCTGCTGCTGGGTTATCTGCTGCTGTTGCTGTTGCTGAAGCTGATGTTGCTGTTTCT 

GGAAGATGGCCTGTGCCAAATTCACTTTAGTCTCGCGCTCCGGCATCTGTGCGTTGGTGTTGTTTTGACTATTGTTTGCATGCAATGCCGACGCG 

TTGCTCCCGTGCTGCGAAGCATTATTGCTGTTGGCATTGTTGAAACTGCGGGAACGACTAAGCTGTTTGGGGTACTTCTTCTGCAACAATATCTC 

CTTGTTTTCAACTATGGCTTCATTTTGAGAGATCAACTTGGAAATGTGTTCCTGCACTTTCGCCACATTCTGAAAAGGATAACACAAATTTTAGT 

TTAGTTTTTCTGTAAAAATAAACTTTTAGTGCAACACTCACTACTTGCTTGCCACCACTTGAGCTGGCAAAAGTGCTGAGATTGGCGGCGGGCGT 

GGGAGTGGCTGGTGCTGTTGCTACACTGGAAATCTGGCTATTGGCACGCACCAACTGTAAACAATAAAGGTATATAGATTATAATTAATCTCCTC 

GTTCAAAAACAATCGTTCTTAACCAACCTGTTGCGGCGGGAGACTCGGAGGTTGTTGCATTGGTGGTGTCGCCTTGACCAACGGTGGGTGTTGCT 

GCTGCTGTTGTTGCTGCTGCTGTGCCAGGTGGATTTGGAATTGCTGAATGGCTAACGCATGCTGGTGAATGGCAGCCGCTGTATCTTGGTTGTAG 

TATGAAGGTTGCCCCAGGTAATACACCCCGCCAGGTGGCATGGCAGCTAGCAATGATTGCTGGGCCAAAAGGGATTGCTGCTGCTGCAATTGCAA 

CTGGATCTGATGCGAGTTAATGTGGTGCTTCATCTTCATTTGCGTGCTTGGCGAAAGCGTGGGATGCGCCAGTGACGGCTGCTGAACCAGAGGAA 

GCTGAATGGACAGCTTTTGCTGCGCCAATTGGTGTTGCTGTTGCTGTTGCTGTATCTGCAGTTGCTGCTGCTGCTGTTGCAGCTCCTTACTGCTA 

CATGAGTACAACGAGGCATTGTGGAACTTGGGCTTATATGGCTTATAGTCGATCGACTGGCGATGCTGGGCAGTGGCCGAGTGGTGATTGTCCGC 

TGTGGTGGGTGGCAATGTTCCTGCTGCCGGCGAGCCCAGAGGCAATTGCTGCATGTAAGTGGGCAGCGGAGGCTGTTGAATGTATGCGCTCTTGG 

CCGCAGACAGTGTGGAAGGTGAGGGTGTGGGCGAATTGATTGGCTCCTCGTAAGGCGAACCAGCGCGACTCGGCTGCAAAAACGTAAGGAGTTAC 

ACATGTTTGCGATAACAATCACAGTTATATTTGACTTACCAACTCTGAACTACTGTTGCTCAGCTCTGCATCCTGATCGGATAAACCATCGTCCG 

CATCGGCTGGCACTTCCAGGCCCCGCGCTCTTGCCGCATGCGATCTCGAGCGGCAATGTTTGTACAAATTACTCTTCGTCTTAAACGCAATGCCG 

CACGTGTCGCACGGATACGGCCGTTCGTTCGTGTGGGCCCGTATGTGCTTCTCCAGCACCGAGGGCTTGGCACAGATCAAGTTACAGTACTGACA 

AACGTATCGTCCGGACTTCTCGCATGCAGGATTTGGCGCTGCTGCTCCGCCTGGAATACCAGGACTCACTGTTGTTGGCTGCGTCAATTGCTGAT 

GCTGGGCAAATGAAGCCGGTGACAGTGTGGCATTTGCTGCATGGTTACTGTTGCTGTTGCTGGTGCTGTTCGTGTTACTGTTGCTGTTGGCTAGA 

GGCGTTCGCGTTCTTCCCGCCGCTTTGTAGCCACTATTGCTGATGATGCTCTCCTCATGACCAATGGCATTGACATGATTGCTATTTACAAAATT 

GGCACTGAATTCATGTATTGCTACTGAAGGCGGCGGCGTCTGTTGCTGCTGCTGCTGCTGCTGCACTTGCTGGTGGCTTTGTTGTTGGGTCATCA 

GATGGTGACCGTTGGCCAGCGGCGTTGTTGGTGGCGGTGAAAGCGAACTGTTTGAGGAGAGCGATGTGGTGCGAACGGCATCGCCACTTGCTTGC 

AGTGCTCCGCCGCCATTCGTCGCACTCCAGCTGTCCACCTCGGTGGTGCTGGCCAAACGTTTGAACTTCTTGTGCAGATAGCGTGAAATATTTGA 

CCGATCCCCCAACGGAGCCTCATCGGTGGCGGGTGTGGCATTTACGGTGGTCTGCTGGCTGTTGATGATGCTCGAGTTGCTCATCGTGTTGGCCA 

CATTGGGAGCCACTGGCTCCTGGTTGCTGCTGACTGCTGCCATCTGTTGTAGCGTCATTTTGCTATCACTTAGACTTGCTTTATCTCTACTCTTA 

AATTCATAATTATTCTCACTAATACTATTATCACTGTGGTGGTTGTTGTTGTTATTATTACTTGTGTTGTTATTGTTATTATTATTATTGTTGTT 

GCTGATGATGTGGTTGCTGTCAACAGTCGGCAGCGGCACCTCGCCTAGCTTATCTCGTGTGCTTGTGCTGGGTTTGCTGCTGCTGTTGCTGTTGC 

CACTAGCAGCATCTGTTGTTGCTGCTGCTGTTGCTGCCGCTTTAGAAGCTGTTAATGTTGCATCTGTTGCTGTTGCTGTTGCAGATGCTGATGCT 

GATGCTGTTGCCTCTGTTGCTGCTGCCAATGCAATTGCTGCAGCGATGTTGGCCTTGTTGGTTGATCTCTGAGTGACTATGGTTGCTGTTGCAGT 

TTCCCGATATGTCCGCTTCTTGGTTGGCGTTGCTGCTGTTGTTACTGTTGTGGTTGCTGCTGCTGCTGCTGCTGCTGCCGCTGCTGCTGTGGCAC 
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GTCGCGTATTGTTCACTTTATTGGTCTTGTACTGTTTGCCTGCGGTTTTGTAATGCTTGGCCGTTGTGGTGGTTGCATTTTCCATATTATTATCG 
AATTTATTATCCTGTGACGTATTTGCAGTGGGCTCATCTATACCGAAACGAGAAGATAGAGGGAGAGGGAGAGAGAAAACCGAAATTAATTCAAT 
GTGTTTCGCATAATTTAATTTAAATAGCAATGCTAATTGATACATTCATGATAAGCCCATTTATATACACTGATATCCCTGCTAGGTAATCCAGT 
AGAAATCAACCACATCTGACTTTGTGGATTTGCAAATCTTCGGACGATTAAATATTTAATATTAGTTTCTAATACACCCTTGTCCCCTATAAGTC 
CTCACATGATTAGTGAGAGGTTTGGTTTTATCTTTTATATGTTAATTGCGCTGTTATGTTACTGTTACTGCATTGTATTGATTCATCGCTTCTAA 
ATAAATAAATATATAAAAAAAAAAAACCCTTGTCCCCTGGCCAATAAACATCTGCGGTTTGCATTTGGGTTTCCCAAGACGAGTTCCCTGCCCAT 
ATAGTACACTTTTATTTCGAGCCCCATTATCAGCACTAATTCGGGCTCAATCTTATCAACGCGTAGCCGAGAATGCCAGATACGGAGCACGCTAA 
TTCTCTCCATGGCGATCAATCCGTCAGGTGAATCACAAGCTGGCGAATTCCCGTGCTCTATTGAAAATCAATTTCCCACTCTGTGAACGTGTCAA 
CATCGATAAAGCACAGTGGCGAAAATATACGAACACACACAAATGTATATTCGCCTTGTTTCTTTTTATCAACTTTCTATTTTGAATTCCCAACT 
TAGCTGGGGTTCAGGCCCCACCTGCCGAAAACCAGACATGCGACCAAAAGATACAAAAATCCAAGAAAAGTGACAAGTGAAGTGTAATGACTGGT 
GTCGCTGGCATTTCGTTTGTTGTTGGGGGCTTCCTTATCACCGCTCGTGCACACATGTGTGACAACAGTGCGTATCAGTACTGTAAAGCTGCTGG 
CAGGTGTGTGATTAGCAATGGGATTATAAAGCGTTGCCAAATATGAAGAGTGAGTATTTGAGAACTCCATTTGCTCGCATTAATTAAGGCTTCAG 
TTTGAAAAACAATTTAGGCAGTCTTTGGTAAATTCTTTGTTTTGTGCATGCCAAACGAAACGAAAAGCTGCAAAAATCTTTTAAAAATTTTTCAC 
TTTTGCTGCTATTTATGTGAGCCCCACTGTCTGCATAGATGGTATTATATGACTCTTGGCCGGCTGGGCTGTCGGGCGATTAGTAAGCCGGGGCA 
GTTCTCAAATCACAATCGCACACAGACTGAACACAAAAAACGCTCACTGGCAACCGAGCTGGGTCGCGTTAATTGACTCGACTGCGGCGACTGTT 
TAGCATTCAGCGTTATTATTATACACATTATACGGCATTATGCGATCGCAGCGACTCTCGTCGCGTACCAATGTTTACTCAATTTGTTATCAGCA 
CGGCTGGCTAATAAGGCTTTCAAACCGTGCGACCGCAAAAAAAAAAATAAATAAATAAAAAAAAAATGAAAAAAATTGCCCAAAAGTTCCATTGA 
TAATGCCAAGCTCTACATGCACAGGCGGTAAGGTAAGTATTTTCGGTACGTGTGTTTGCCAGATTCATTATCAGTTATCAGTGGGGACTATCGTG 
CGCGCCTGAGTCAGAAACAGAGCACCCGATAAGCGAGCCCCGTTGTAAGCCACCAAGTGATTGCCAATTTGCCGGAGTTCAGACGCGTTGACGTC 
AGTTATCCGATACGTAGTACTATTCGTTCGCAATCGTAAGGGGCTATATCAGATTGTCAACTCATTGGGCAATTAAGTGGCATTCGCTGGCGATT 
AGCTGTTCGAGGATACAAATCCTAATTACATAATCATGTGTAGTAAATATGTGTATGCACTCGACTTATTAATAAAACGAGTGAGTGGGTCAATT 
ACAAGGCGCCAACTGCGCCACAATTGAAATGCATACATATGTGTGGCGATTACACGGAATGTGTTGTTTCTTGCTAGTGAAAACAAAAGCGGAGC 
AG AC AGT G T GT GTG AGC AT T T T G AC T AAAT AAGT AG AAT AAAGT T T AAT AAC AC AACGC C C AGAATC T GT CT T T C G AAAT AAT AT T T AC AT T T GT 
GTTCATTCGCTGTGCGAGAAACTATTGAAGTTCAACTGTACATAATGACAAAGTGGGCTATACATTCATTATACAGCAGAAACTGGCGAAACTAA 
T GC CAT C AT CT AC AGTC AC GAC AGT AT AAT AAAGT C AAGC GGG AAT AAAG AAG AT AAGC T AAT C AG AAGT AGGC GC T T AT AAAAAC GC CAG C AAT 
T AAAAC C AGAAT AC T CAT AT GC AAG AT ACCC C GC C ACC C AAT GTG C AAAC AAT T AGC GC AAAGT GC AC AGT G ACGT T T AAAG AT C C C AAGT T C GT 
TTATCCACCATTTAGTTTATTGTTAGCCACGTAAATCCAATTGTCAGAGTCATCAATATCAATACGACGTTTTCATTATCACGAACTTTTGTTTA 

ACG AT CGGGGGTCT T CT AAT T AC G CC T GT T T GCC AG C AT T T T G AGG C GT GAAAAC GGC AAG T G AAC GC GAAAC AC AT G GG AACGT AT T T C AAT AT 
ATGTGAACAAACTGGAACTGCATTTGCATTGGCATTTTCATTTGCATGCACATTTTTATTGAGCCAAGCGAAATACAATTGCCCCTGTTCAAAGT 
GCACATAAATAAAGCGCCACGGCAAATAATCGAATAGGGTTCGGTACACTCTTACAAATATCTAAAAATTGGTTGCAATTAGTAGTTCTTCTAAC 
CAGTAAATAAACACACAATCTTAGATTTTAAACACCAACTTCCGCGCCAAACTTCATGAATATCTAAGATACATAAATAAATCACTTCTTATTAA 
ATCATATTTACATTTGTTCAGAGAGTTTGTTTTTCGAAGCATATGGAAATGCTGGCATTGTTTCTGTGTAGAAAAGCTTGATAAGTTTGCAGCTC 
ATATTGAGCACACGCTATCTCGCCTACGTCACAAGTGCATTTCGCGTTTGACTGAGTGACTGCCTGAATTCGAAATTATAATAAAACCAAAAAAA 
AAACGAGAATACATATAAAAACACCCACACCAGCGCAGCGAAATAATGAAAAATAAATAACAAAATCAACGAGGTGCGGCGGCAACGGCGCCACA 
AAGTCCACACATCTGTCTGACGGTCAGTCTGTCAGTCCGTCACTCTGTCAATCGGTCATTCGCTCAATCCGCGCGATGACGACTACGTCATCGCA 
GAGCAAAAAGCACTAAGCACTCGCCCACTTGGCAATTGTTTATAATTTATTAGTAAGAAGCGTTTTACAGCGGTTGAGCAGAACGAACGAAAAAT 
GCATATGTAATAGTATAATACGTCAAAAAAAATTCGCGGAAAAGATTCCGAAATTCCAGCGAACGGCCAAAATGCGAAATGCCCGACAAAAATGA 

C GAAAAC GGC AC AAAC G GT AAAG AG AAAC AT AC C C AAG T T T AAT T GT G ACC AC AC GC AT TTTTCCGCCT G T AC C GC T CAT TTGCATAGTTGGTGT 
TGTTGTTCGTTTGGTTTGAAGCGCAACAATTGGAAGTGCACGGCGAGAAAAATCTGGCGGCGCGAATAGCAAGAAATCGAAAAGGCGACAATCGA 
CAATCGACGACAACCGACGCGCCT GAAAC ATTTAAGGCAACTAGACGAGGCG AC CAGCGAAATTC ATT CAGAGTGTGCTCAGCGTGTGTGCTTGT 
GTGTGCATGTGTGTGGTGTGTGCGAAAAGAAAACGAAGAAGCGACCGAACGAAGC AAAAAC AACAAGTAAGAAGCCCAAAGTCGC ACT AGT CGAC 
TACAATGTACCCTTGAGTGGCAGTAAAAGTTATTCTTACTACAAATTACGTATAATTAGTAGGAATAGGCTAAACGTTTTGGCATTCTGCGATTT 
TGAGTTGAGCATAACGAAAATATCAGTTCAGAAAATAAGTATATATTAGGAAAGTTGAAGTCTTCATGCCCGAAAAGGTTTTATAGGGCAGTCTA 
TATCCCAGAATATTCGGTAGTAGCTCCAATAGATCATAAGCTTATTAATTTTTCTCATATTGAAAATACATCTGTATTCAAATAAAGATAGGTGG 
AAAGATTTCTTGCTATAGCGTTTTTTGGTAAATCTATTAATTGGTATAATATTAAGAAGTTGTTTTGTAATATCTTTACTTATTTTAAAAATGCC 
ATATACCCATGTTTTCATTCAGTTACTGAGTGTTAAAGAGGCGCGACGTCAGCGCAGGCAGCAAAAAAAGTACAATAATCAAATAATTGCGGCAG 
CTTCGTCGTCGCCCCCCGCACACTCACACACAAACACACACATGTGCTCTCAATCTCCTCTCCACTCATTCGCAATTGTCAATCAAATGTAAGCA 
CAGGGTCGCCACTTCTTTTTGTTCAGCTTTTGATTTTGCGGCAGCCACTGAGAAATGATCAAATCAAGCAAAAATTAAGACTCAAAAAAAAGTAT 
AAAAATTAAACAACAACAGGTGACGCGAAGTACGAAGCGAAATGAAACGAAACGATTCGAAATGAAAGACTGTCGCCAGCCAGTCAGACGTTGCT 
GTTGTTGCTGCAGTTGTTGTTATTTCCCCCTATTGTTGATTACGTACCCCATACACACACACTCACACACAAGTGCGCTCCGTAAAACACGCGCA 
CGCACACCAACAAACGACCCCATCGGCGACCACATTCCCCCGCACAAATATGCGAAATTTTGCTGACTAAGGAAGGTCTGGGGAGCACCAGGCAG 
AA AAAAAC AGC AAAAAGCAAAAAG AGGC TAAAAAGC AC AAC GCG AT AACAACAAAAGC AT TT AAAAC GT AG AAGC AC AA 

AAGTGATAATAAAATAATAATTGCTACAGCGCACACCGATCGCGCGCTCGAGAGAGAGAGAGCAGGCCAGAGTTGCCATATTAGCTGCGAGAGAG 
AAACCCCATCAGCAACCCCCACTCCATCCCCAATCTACTAATTCAGCACGATCTGCCCAACTGTGTATCACCTTAACTTGTTTATAGCTGACCTT 
TTTTTTTTGTAAGTCAGTTAGGACTATCGTAAGAGATATAATTTAGAGCATATAATGTCATGCTTTAAGAAGGAAGTCTGGTTTTGTACAATTCG 
AACGCTGAATATACACACATTAACAGTAAGCTTTTATTTACTAGACGGCATCGCTTTAATTATCTACAGGTATTCTTAGCTTTTTTTAGATCTGG 
CTACTTTTCCATGGCTGCATCACTGTCTGCCATGTGGCGACGAAGCCCCACAGCCAGAGCCAGTGCTCCCCCGCTCCCCGCTCCATACCTCTAGC 
GGTTCGCTCGCGAACATGACTAAGCCAAACAACTTGACAACACTGTCGGGAATTGTTGCTGCTTTTGAGTCAATTCATAAGAAACGAGGAAACCC 
GCCGCCATATCGTTGTTGCTGCCGTATTTATTTACGTACATATGCCTGTGTATATATGCATGTGAGATTGTTGGTGTGCTGGCATGAGCATGAAT 
CAAGTTAAAAATTTTAATAATTTTGCCCGATGTTGTGGTTGAAAACGAGTTTAGCTTGTTGAGCTTGAGTGTGTGTGTATGTATTTCAGTGCGGC 
TACTTGTTTTCTTCAGTCCTCTTCTTCTTCTTCCTAGTCGTCGTCTTCTCTTCTTCTTGCTCCAGCTTGGACACTGCGATAATATTTTACACTCA 
TGCACACATACATTTACACTACGACGAACAGAAGAGCTTGATAAGCGCTTTGTGCGCCATCTCCCTCTTTGGTAATTGAAAGGCAAGGGGAAAGG 
GTAGGCGGCTGTGACGTACCCGTGTCTCACTGTTATTGTTGTTGCCCACTCAGCCGTGCTGAAGTGTTATAGGAAATCAAATTAAGGCGATTAAA 
GAGCGAGTGCATGCGCTCTCGAACGAAAAAAAAGAGAGAGAGAGTGACGATCGCTCTTGACGGAAGATAGAAGATGACTACTTAAGTGGGGCATT 
CGTGAAGATTGATACCCTACGATGATAATTACTGTTTGCCTACAAAAAGCTTTTTTACTTTGATGTCTTGTCAACTTTAACATACAGATGGAATA 
GTTATGCTTTATCTTTTTGACCAAAACATACTTTGAAACAAAAAGGAAAGTTTATAAGATGTGTTTCTTTTTAAATTTTTTCCTTTTAAATTGCA 
CACAAACGACGTACATATGAGAAGCAAATGTTTAATATTAAACTGTTTAATGAATTTAACTATTTATGACCCATAAATAGTCAACCCATAGTGTC 
CAAGAACTGATAAGAACAAAGACATCTTGCCTTTGAGCAGAACTTTGCCATTTGCTGATAAGTTTGAAAGCGATGCGAGCCAAGTCGGTATACAG 
TTTCATTTATTTATTATTTATTTAAAATGGGTTTCCAATGCAGAATATTGAAATAAACAAGCACAACGCCAAACGGTTGATATATTATGGGGAAT 
TGCGAAAACAGTTCTCGAAAAGTGCTCTGACGTCGATCACTTGACTAATCTGGCGTTCTTTTCGAGAAATAAATTTCGATTACGGAGGCTTATCA 
ATCGACGGGCCGTGGACTCTTTTGTGGAAAGTGAAAGGTCAAGGTACGCTTACGGAAATTGGTAAATAGATAAGACATTTCGATGGAATATTTGT 
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TGGGTCTATAAGCCTCTTTGCAGGAGATCTATTTCGTTTATCTATCGTCGAATGAGTAAGCAATATATTAAGCGATTTGGCGTACACAACCAAAT 
TCTCGGCACATTATACACTTTTCACATTAGCACGGAAATTCCCAAATTTTCTTTGACGTCAGCAGAGAAAATTTGTAAAATTTCTCGAGAGTAAA 
AATCATGTTCTTTTCGTACTGAAATGTTTTCCGATTGCCAAGGTCTCTGGGTTTACATGTATGTAGGTGAAAATCTGTATAAACAAAATTGATTT 
TTGGCACAGCAAAACAAAACAGTTTAAACAATATAATTTAGCACTTTTTAGACATTCCTCTATAGATTTATGGACTATTAAAATATATCTAATTT 
AATAAAGGTGCCTAAAGGAAAAGCTTAAAAACTTAGCAAAAACGAATAAATTTTCCAGATTCATAATCATATATTTTCGAATATTATTATGATTA 
TTGCTTGGGGGCACGATAATCGTAAATATGATATGAGTTCAAAGTAAATATCGAATTATTTCCTTCTCCGTATTGCGAAATTGGTAAGAGAACCC 
CTTTAAATATAGGAATTCGTGGCCAAAAGTGTTAAATAATTGCCAGGCCAACAATTTTTTGTTTACACACAAAGCGAAGCGAAACACAGCAGAAG 
ATCAAGAAAAGCAGACAGAAATCGTTCGATAATTTGTGTTTATTTTTATTTTCCCACTGTCTTTGACTTTTGACTTTGCTCTGCTGGCTGATAGA 
ATGTGATTTTCTCCATTTTCTTGATTTTCTTGATTTTCAATTTTGTTTTGGTGCCGCCGCCTCGCTAGAAAAGTATTTTTAAATGCTCCAACTAC 
GTAAGACGACATTCAAGTGTTTTATTTTAGACAGTAGACGGTATAGAGCCAACTAGTCGGCTATATATACACACATTGGCGTCTATATATCTGCA 
TAAATATTTTTAGTTGCCTCTATTGGCCTTTTGCGTTAGCACATTTTTGTTAAATATTATCTATTTGTATTTGCCTTTATGACAGCAAAATGGAA 
AACTCGAAACACTTCGATTACGCAATTTGGCGCAGCCCATAACGATCATAATTCAATTTCGCCCCCCCATAAAACGAATCTTGAAACATTCTCGA 
TATTTCAAACTTTATGGCGCCGCAAAAGCAAAGGCTGGTGCTTTTTTTTTATCTAATGTCACGTTATTTCGTATATTTCTTATTTCGTGGTCTAA 
TTAAATTAAACTGTTTATAAATACTTCTGCTCTATCAAAGGCCACCAAAGCAAACGATTAAAGAAAAGAATCTCCAAATCGTGCATTTGAGAGCA 
AACCAAAAATGTTTCCCAGCGCCAAGCCATCAAAATAAAAATTTTCACAAATTGTGTGCGTTGTTCTTTCTTGAAATTTTTTTTCTTTTCTTTTC 
GTTTTTTTCTTTCTTCTTCTTTCTGCTTGTCTTGGCGTTTTGGCATTTGAAGGACGGAAATACGTTGGCGGCGTTGTGAATTTTTGCATATTAGC 
AAAAAGTGAGAAAAAAGAAATAGCCATAAATACACAAAAATTTTGATAAATTCAAAGCGCTTACGTAGGCAAACTCATGGGGCTATGCGAATGAT 
TCGCTTCGATTCAATTCGATTCGATTCGATTCGAATCGAGCCGATGAAATGTATGCACAGGCATGCGCATTGGCCGTGAAAAAAAATGGTAACAG 
ATTTCCAGACGCGCACCGAAAGTATCTGTATCTGTATCGGTATCTGTGTCTGTGGCTGTGGCTGTGAGTGCGGAGTATCTGTTGCTCTGTAGCTC 

T GT T GT C AAGGT C AT G AC T GCT C T T T AGG GGT TGCCCCGTC G AT AAAC AT GT GT C GT C ATT C CC C AGG CGG C AG GGC AT T C C T C AAAACT C AC AA 
GCGGCACACAGGCAGCAAAGGCGGAACGTATTATAACTGTTGCATGTTTAATTGTTCGCAAAACACTTGAGAGCAGCAGTCTTAAGACCCTGAGA 
TATGTACGTGTACAGTGAAACTCCGCCATGGGAAACATCAAGCTGAAGTACTGACAATCTGATTTTTTAAGTGATTGTATTTTTAATATATAGTT 
TAAAAAGATCGATAAATGCTTAATTCGACTCCTAATTTTTTCACGCTACTTTATATCCTAGTTTTAGATACATCCCGCAGAATCTTTAGTTTCTA 
AAGATTTCAAAGAGCAAATGTTTAAGTTCGGGTCGTAACTTGCAACAAATTGGTGTCATGCTAGAAGTTTCCACTGTGGCAGCAGGAACAGCAAC 
AGCAACAGCGATTTCGGCTGAACGCCCCGTGGTGCAGTCTTAAGTGGATTTCCACCTTAAGTTGCATTAGTATCGATGTTGTTGCTGCTCACGTT 
GCACATCGATGTGGTTGTTGTTGCTGCTGCTGCCGCCGCTGCTGGCGTGTCAATGACGTTGCAAGCCTTTTGTTTGTCTTTCTACCTAGCCAGGT 
ATCCTGGCGCCGGCTTAATTTACGAAAAGGTAAAGGTAAACGGCAGGTAAAACGCCTCAAAGTAGTTCAAGTCCAATCAGCATGCTCTCCCGCTC 
CCGCACGCACACCAACCAATGCACCTTTTTACCCACACCAATGCAGAGATTATCCATCCACTCACCCGCACACGTTGCTCTTCTTGTTGCTGTAG 
AAACTAACACTAATACTAATCTGAGAGATATTGCGGCCCGGCCGCGCGGGGTTGCCCAGATTTAAGTGTTGTTTTTGTGTTGCTTCTCCTTCGGT 
ATTGTATTCTCTTGTGAGTGGGACGGGGGGTTGGGGGGATCGCTCTTTCGACTTTTGTTTATTTCGAATTTTCACAGTTTTTCTTGGAATTCCTT 
CCACACTCGCTTGCTCACTCGCACGCACTTACTCACTCGCTCTCTTTTCTATTTCCTTTACTTTTTAAGAGAGATACTTTGAAATTTGCACTTTG 
ATTCCGTGCGTGTTTTCGTATACTATTTTTGCACCGTAGCTAATTAGCATATTATGACATTCATTTCGCTGTAACCTTTTCCTCTTCGTGGCTCT 
TGCCGTGCACTCTTTTGCGCGCTATGATCTAATTTGATTAATACAAAACTTGTTGTGGCGGCTGCACTCGCACTTCTCGCCCACACACACTATTA 
TACACAAACATGCATGCACACACAAGTATGCTGAGCAGAAACATACGCGCGATTGTTCCGTTCTTTTGTGGTTTCTTTTCTGCCTTTTGTTTATG 

TTGTGCTGCGTTGTTGTTTATTCC GACT T GT T C G AAT T AAT TCATCGCACCGCC G AAAAAT T AAT T AAT AGCC AC GC C GC AC CG AAC AAG C G AAC 
AACCGCACCGAACGACGAAAGATCGAAGAATGGCGCGCCTCGACTGTTTTTTTCGCCGTCGACGCTGGCAGCGACGCGGGCAGCGCAGCAGGCAG 
CGACGCTTAACCCTTGTGTAGACTTTCGAAGAGCTGCAAAGTGCAGGTGCCAACTCTCAGTTGATCTTGATCATAAAGTTATAGATCTAACGCGG 
TTATCATGATTTAGTTTATTTAATTTTCGTATCTATATACGCATTGCTTAAAAGTTTAGTTTTATTATTAAAATATGATATTTTTACATTCAGAT 
GAGATGTTTTTCCAGTTTTATTATTGAAGCTAAGACGATATGTTATCAACACCATTTAAGACATTCCCTATTACCATTTATTTAATCTTTATGAC 
ATTGCCAGGAGTGGAAATACTAATTATTATTTTATCCGTACCCTAACCCTGAACCAGTCGA 
(SEQ ID NO: 1246) 

Exon: 19486.. 19351 
Exon: 10107.. 8495 
Exon: 8433.. 7723 
Exon: 7654.. 7547 
Exon: 7479.. 4445 
Exon: 4379.. 4153 
Exon: 4089.. 1001 

Start ATG: 10060 (Reverse strand: CAT) 
Transcript No. : CT42591 

GAGAATACAATACCGAAGGAGAAGCAACACAAAAACAACACTTAAATCTGGGCAACCCCGCGCGGCCGGGCCGCAATATCTCTCAGATTAGTATT 
AGTGTTAGTTTCTACAGCAACAAGAAGAGCAACGTGTGCGGATGAGCCCACTGCAAATACGTCACAGGATAATAAATTCGATAATAATATGGAAA 
ATGCAACCACCACAACGGCCAAGCATTACAAAACCGCAGGCAAACAGTACAAGACCAATAAAGTGAACAATACGCGACGTGCCACAGCAGCAGCG 
GCAGCAGCAGCAGCAGCAGCAGCAACCACAACAGTAACAACAGCAGCAACGCCAACCAAGAAGCGGACATATCGGGAAACTGCAACAGCAACCAT 
AGTCACTCAGAGATCAACCAACAAGGCCAACATCGCTGCAGCAATTGCATTGGCAGCAGCAACAGAGGCAACAGCATCAGCATCAGCATCTGCAA 
CAGCAACAGCAACAGATGCAACATTAACAGCTTCTAAAGCGGCAGCAACAGCAGCAGCAACAACAGATGCTGCTAGTGGCAACAGCAACAGCAGC 
AGCAAACCCAGCACAAGCACACGAGATAAGCTAGGCGAGGTGCCGCTGCCGACTGTTGACAGCAACCACATCATCAGCAACAACAATAATAATAA 
TAACAATAACAACACAAGTAATAATAACAACAACAACCACCACAGTGATAATAGTATTAGTGAGAATAATTATGAATTTAAGAGTAGAGATAAAG 
CAAGTCTAAGTGATAGCAAAATGACGCTACAACAGATGGCAGCAGTCAGCAGCAACCAGGAGCCAGTGGCTCCCAATGTGGCCAACACGATGAGC 
AACTCGAGCATCATCAACAGCCAGCAGACCACCGTAAATGCCACACCCGCCACCGATGAGGCTCCGTTGGGGGATCGGTCAAATATTTCACGCTA 
TCTGCACAAGAAGTTCAAACGTTTGGCCAGCACCACCGAGGTGGACAGCTGGAGTGCGACGAATGGCGGCGGAGCACTGCAAGCAAGTGGCGATG 
CCGTTCGCACCACATCGCTCTCCTCAAACAGTTCGCTTTCACCGCCACCAACAACGCCGCTGGCCAACGGTCACCATCTGATGACCCAACAACAA 
AGCCACCAGCAAGTGCAGCAGCAGCAGCAGCAGCAACAGACGCCGCCGCCTTCAGTAGCAATACATGAATTCAGTGCCAATTTTGTAAATAGCAA 
TCATGTCAATGCCATTGGTCATGAGGAGAGCATCATCAGCAATAGTGGCTACAAAGCGGCGGGAAGAACGCGAACGCCTCTAGCCAACAGCAACA 
GTAACACGAACAGCACCAGCAACAGCAACAGTAACCATGCAGCAAATGCCACACTGTCACCGGCTTCATTTGCCCAGCATCAGCAATTGACGCAG 
CCAACAACAGTGAGTCCTGGTATTCCAGGCGGAGCAGCAGCGCCAAATCCTGCATGCGAGAAGTCCGGACGATACGTTTGTCAGTACTGTAACTT 
GATCTGTGCCAAGCCCTCGGTGCTGGAGAAGCACATACGGGCCCACACGAACGAACGGCCGTATCCGTGCGACACGTGCGGCATTGCGTTTAAGA 
CGAAGAGTAATTTGTACAAACATTGCCGCTCGAGATCGCATGCGGCAAGAGCGCGGGGCCTGGAAGTGCCAGCCGATGCGGACGATGGTTTATCC 
GATCAGGATGCAGAGCTGAGCAACAGTAGTTCAGAGTTGCCGAGTCGCGCTGGTTCGCCTTACGAGGAGCCAATCAATTCGCCCACACCCTCACC 
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TTCCACACTGTCTGCGGCCAAGAGCGCATACATTCAACAGCCTCCGCTGCCCACTTACATGCAGCAATTGCCTCTGGGCTCGCCGGCAGCAGGAA 
CATTGCCACCCACCACAGCGGACAATCACCACTCGGCCACTGCCCAGCATCGCCAGTCGATCGACTATAAGCCATATAAGCCCAAGTTCCACAAT 
GCCTCGTTGTACTCATGTAGCAGTAAGGAGCTGCAACAGCAGCAGCAGCAACTGCAGATACAGCAACAGCAACAGCAACACCAATTGGCGCAGCA 
AAAGCTGTCCATTCAGCTTCCTCTGGTTCAGCAGCCGTCACTGGCGCATCCCACGCTTTCGCCAAGCACGCAAATGAAGATGAAGCACCACATTA 
ACTCGCATCAGATCCAGTTGCAATTGCAGCAGCAGCAATCCCTTTTGGCCCAGCAATCATTGCTAGCTGCCATGCCACCTGGCGGGGTGTATTAC 
CTGGGGCAACCTTCATACTACAACCAAGATACAGCGGCTGCCATTCACCAGCATGCGTTAGCCATTCAGCAATTCCAAATCCACCTGGCACAGCA 
GCAGCAACAACAGCAGCAGCAACACCCACCGTTGGTCAAGGCGACACCACCAATGCAACAACCTCCGAGTCTCCCGCCGCAACAGTTGGTGCGTG 
CCAATAGCCAGATTTCCAGTGTAGCAACAGCACCAGCCACTCCCACGCCCGCCGCCAATCTCAGCACTTTTGCCAGCTCAAGTGGTGGCAAGCAA 
GT AAAT G T GGC G AAAGT GC AGG AAC AC AT T T CC AAG T T GAT C T CT C AAAAT G AAGC C AT AGT T G AAAAC AAGG AGAT AT T GT T GC AG AAG AAGT A 
CCCCAAACAGCTTAGTCGTTCCCGCAGTTTCAACAATGCCAACAGCAATAATGCTTCGCAGCACGGGAGCAACGCGTCGGCATTGCATGCAAACA 
ATAGTCAAAACAACACCAACGCACAGATGCCGGAGCGCGAGACTAAAGTGAATTTGGCACAGGCCATCTTCCAGAAACAGCAACATCAGCTTCAG 
CAACAGCAACAGCAGCAGATAACCCAGCAGCAGCAGCAGCAGCTCGAGCAACAGAACTACTATACGTACATCCAACAGCAACAGGAATGCCAACA 
GCAACAGCTCGAACCACCAAATGGGGTGGTTAAAAGGAATGCCTATAAGCCTGGAGTAGTGATGACTACGCCGGTGAAGCAACAGCAGCAACTTC 
CTCCACCGCCGTCCCCGTTGCCCATGCAAACAATGCAATATCGCCAGGATCCTGCAACACCAGTGACGAAAATTGAGCAACCGACGACGGCAGTG 
CCTGTTATGCCTCTGAATCTATCAGCGAAGCCTAAACCAACTCTGGTCACTGTGCCCGTCAGTTCCTTGTCCACCAGTTCTCTGGCACCGACTCC 
AACGACTTCTACCAATCCGACTAGTAGCTCCAAAACTGCTCCACCACCTGCTCAGGTTAATAACTCGATAATTAAGAACCTACTGCTTAATGCTC 
GAGGATTGGCCGTGCCGATTGGCGAAGGCGATGATGCTGTTTATTCGTGTCCTATCTGTGCAAGCGAATTCCGCAGTGCGGACGATTTGAAGCTG 
CACAACAGCACTTACTGTCAGGATGCGAGCTCAAGTGCCCCAATGAGTCCAGCATCTTCGCCATTCCGCTCCAACTCGATCTCGCTGAGTCTACC 

GG AGCT G AAG AGC C AC AT GGCC AACT C GAAG AAT CC C TTATCTCTGGC CAAGT T GG CC T GGT CGC AGT T G AAG AC C AAAAG GAG C AGT T T GGT AT 
TAAGTCGCCTAAGTGCAGCACAGACGCCAGCAAGGACATCAACTGTAACAGCTCCCACTGTAACCGCTTCTGCCCCAGCTCCTGCTGTTGCTACG 
GTCACTGCTCCATCATCAGCACCTGCCCCCCAGATTGAAGCACTGCGCTTTGTGGATGCGCCTCTACCATCGCCTGGTCCATTGTTGGGAAAAAC 
TCCCTTAGTGGACTATGCTCAGCAAAGTACACCACGCAAGGCCCAGGATTCTGTGGTCATAACCAAAATGCACGAGGATCGGCAATTTGTGATTG 
AAGCTCAGCCTGCCAAACGCATCAAGACTAGCGATTTGGTTGTAGCATCCTCCTCTCAGCAACCGACAAGTTTCAACTTTTCCTTTAATAATCAG 
AATAGCAGTAATAGTGTGCCGGAATTGCAGTCCTCCAAGGAGGAGCGACTGCGCAGATTCACCTCTTCGGGAGGCAGCATGATTCCCATTTCGGA 
GTGCCCAGACTTGGACAACAGCCCCAAGATGATTAGAACACCGTTGCTGTCCGGCGGAAGCTTCCAGGATGTTTCGGTCAAAGTCAACAACGAAA 
CAGGATCATCCAGTAAGGAGCGCAAGCTGATGGCACTGGTCAGTGGCAGTGGACTTCTCGGCGTTAGCTCAGGTCCGCAGCACTTTCAATTTCCG 
C CC AT T AAC T C AAT AAC T GC C T T C AATC C GC T AAC GC T G C C AC C GAT G AGC GGT GGAG AT AAAAC C ACT C C AGT AAC ACC AAT AC C T C AT GT G CC 
TGGAATGCCAGGACCCGGTAGTCTGACACCACAAATGCCGCTACTGCCGCCTCCGCCACAACAGCTGCAGCTACCCATTCCCTCTAGTCGAGGTC 
GCAGTCCAAATCGAAAGCAACCCAGTCCGCTGCTGTTGGGAGGAGGATCCGGTGAACTGAAGGCCCTATCGCCATTCGGTGGAGTTCAGAATGTG 
C C AAGC GAGT T C AGT C G AC AGC C AC C AAC TC C T GC C C AGC GGC AAGC GC T GC AGTGG AAC AGC AAGG AGGC AC C G AAG AAAGC AC C AT T T AAC TT 
TCTACGCATGGCTGATAATGTGAAGACCACTGAACCTGAAGTTCGACACTTTAATTTGGAAAATGTGATTTCTGGGAAACAGCAAGAGTTGCCCC 
TAACACCGCTACATGTTGATACTCCCAACGGAAACGCTCCAGAAGAGACCAGTCCCGTAGCCAGTGCCTCAGCATCGGCGAAGTCCAAGTTTCTG 
CGGCCTACTAGTTTGCCACTTAAACCCGGTACTTTTACTCCTAAACGCCATCATGGCATTACACCGACAGCCAATACATTGCCGCTGATCTCGCC 
GGAGACCCCCAGGCCGTCCAAATCCTGTGTGCAACTGTATCTCAATGGACACGCCTACACATATCTGGGCCTCAAATGCAGCACAAAGATGTTTT 
ACTGTACGGTGAACTGTCCACAGCCTTCGTATGTGGCAGGAATGCACAAACTATCGATGTATAGTGTGTGGCAGGTTTGCGAGGAGAACCAGCCA 

C AT C C ACT T GG AT T C AAAC T AAAG C AAG T T AT GG C AC T C T AC G AC T C ACGT C AAAG GAT GT T AGG AAAT GG C AG C T C AAC C GCC AT G GC T G G AT C 

GGGAAAACTTAGCTACAACTTAGTCGCTTCCCAGCAAATCGTATCCTCCCCCTCGACGTCCTCCACTAGCAGTGCTTTCTATCAAGGACCCCTAA 

AGACCCCACCCACTGTCACGATTGCCGCTCTCAGTGAGGCAAATGTGGCGGCCAAGGCGAATGAGGAGGCACAGGCCAAAAAGTTGGAGACAAGT 

CCATCCGGGCAGCCCCTGGTGGGTGGCTACGAGTCTCATGAGGATTACACGTACATCCGTGGCCGCGGAAGGGGAAGATATGTTTGCTCCGAGTG 

CGGAATTCGTTGCAAGAAGCCGTCGATGCTCAAGAAGCACATTCGCACTCACACGGACGTGAGGCCATTCACATGCAGCCATTGCAACTTCAGTT 

TTAAGACCAAGGGAAATCTGACCAAGCATATGCAATCGAAAACTCACTTTAAGAAGTGCATTGAACTGGGAATAAATCCGGGACCAATGCCGCCA 

GACAGCGAGTTCTTGGATGTAGATATGGACTTTGACCAGCAGTCATCCACTTCGGCAGGAGGACGCACATCATCGATGGCTGGGGAATCGGATTC 

CGATGACTACAGTGATAATGAATCTGAAAGTAGTGATACGGATGAGTCCAAGTCGCGACAGAAAGAGCACGAAGCAGCCAGAGGCCTACTCTCGT 

TATCAATGACGCCGCCCATACCGCAGAGCGTATCGCCCTATCCGCAGTTGCAGGATACACCCCTACCAGCGGCCAGTCCTGCAAACAGCATTGGC 

TCTTCGGGTTCGCAACCTAAAAGATTAGTGTGCTCCTTTACATCGCCGAAGCCGCCTTTCGACTACCAAAAGCAGGAACAGTATTACTCCAATCC 

GGAGGAGTCCAAACCGAAACGGAGTGTGGCCAACGAGGAGAGTGCTCCCATGGATCTCACTAAGCCACGCGGCTCCATCTTATTAATATCCCCCT 

CGCCTGTTTCGGTGCCCGCTCATGATCTTCCCAAATCGCAGGCCCAACAGATGCACGATGTTATCTTTGGAACCTCCGGCAACGAAAGCGGTTTC 

ATGAAGACTCTAATCTCGGTTTCGGACAAAGTGCGAATATCTGCCGAAATGGAAGAGCAGGCCAAGCACGAGGCCGAGGGCGAGGATGTTCAACT 

ACAGACCTACATTAAGGAGCATGCGCTGCATCAAGCCAAGATAAAGCAGAGTCAGTTTAGCCGCAGCTATCTGATCAACACATTATACACAGCGG 

CGTCTCCTGTAATGAGCAGTAGTAGCACTCTCTTTACCGCCAACAGCAGGCCCGTTATGAGCGTAAATGAGGTACCTAGCATCGAAGTGCACGAA 

GTAAAGACCCCAGAGGCCATTGAGTCACCTCGGAGTGCACCTGAACAGGCCCCGGTTATTCTTGCTCAGATTGCCCAGGAAGAAAACGAGGAGCC 

TAATATCGCAGAGCCACACAATGCCAATCTGCCTGCTGCAGTGCAGCAACCGGATGTCAATGAATTTACTGGAGTACTTGGAAATCCGACTGCAC 

CACCAACATCGTCTGTTACTGCTACTTCAGTAAGCACAACAACAGCTGCACCTGTGGCACCCTCATCCACTGCCAACTCAACTCAGCCCGCGCAG 

CGTACTGTGATTGTGGGTGAAGATGGCTTCAAAAGCTCTACTCCCACTAGCAAGAGCGGCGATCTCCAGCATGTATCGTATGGCAGAGGAGTGCC 

GCCAGCTCCCATTGCAGGTGATGCACGTCCCACCTGCACCATGTGCTCCAAGACGTTTCAGCGGCAACATCAGCTGACCCTGCACATGAATATTC 

ATTACATGGAGCGAAAGTTTAAGTGCGAGCCATGCAGCATATCCTTCCGCACGCAAGGACATCTGCAGAAGCACGAGCGATCGGAAGCGCACAAA 

AACAAGGTTATGATGACGTCGACTTTTGGTGTTCCAACCACCTCAAATCCGCGACCCTTCGAGTGCACCGACTGCAAGATCGCCTTCCGCATCCA 

TGGGCATTTGGCCAAGCATTTGCGCTCCAAGACGCATGTTCAGAAGCTAGAGTGTCTGCAGAAGTTGCCCTTCGGGACCTATGCTGAAATTGAAC 

GT GC C GG AAT TAG T C T C AC T GAAAT C GAT AC AAGC G AC T GT G AAAAT T C GC T GAT AT C AC T CAAGT TGC TGGC C C AGAAACT G C AG GAG AAGGAT 

CCTGGTAAGCTCAGTAGCTATACCACACCTTCTGGCATGATGCAGCTGGCCCAGGATGCAGCAGGTCCCGTCTCGCAGGATAGTGCGTCCGAGGA 

TGGCTTTAGTGCCGCAATTGCCTCGGCTATAGCTTCGCTGGACAACGACAGTGCTGGCAACACACCCAAGCGAGCCAGCTCGATGTCCGAGGATG 

AAAC GGTGGCCAATGGCCT AAACC AT AGCCTGAAGCGTCGTTT GCC GGGC AGTTTCAGCAGTACGGGCGAGGAGAGCGATAATCC ACC GGAAA GC 

AGTGGCGAGAAGCGGGCTCGTTCTGGCCAGGAGCTGCCTGTTCCCGTGGCCGTTCCCGTGGCTGCTTCAGCAGCGGCAAGCAACTGACAGTACAT 

GTACATGCTGCAGCCCAGCTGAGGTTATCAGTATCCCGCGGTCTACCACCTGGACAGCAGTCAGTACGGCATCTACAATTCCAGGATCGAGGGCC 

GCCATGACCAAACCACCTCCAACGAGCGCATATCAAATGTCTAAGCATATCGCATATGGGCACACACACAACTTACTCTAGAGCTAGCCTATAAG 

ACCCACCTCTCTTCGATGATGGGCCCAACTAGTATTGGGCTAATCGCGAGTAGTAATCAAACGATAGGGGAGCCAGATAAGCGAGTCAGCTATTC 

CAGTGTAAACACTCTAGGCGAATTGCGCTAAAATAATCATATACAAAGCCTTTTACGGTTTTAAAAGGCAGTCACCCCACACAGATCGTTTCGAT 

CTGCCTCTTAAGTATTTAGGTTAAGGAATCGTCATCAGCTAGCCCTAGTTATAAGCCAACCAACGAGCACACTTATAAATCTATCTATATTGTAT 

ACACTTAACCATAATTTGTTTTCGAAATGCATAAATGCAAAAACTCCTGAAGTGCTGTGAGTCTATCTACAAATGTAAAGATCCGAGAGCGAGAG 

AAGTGTGTTATTTTTAAGGCCTAGCCCCGAAAATGAACCTCCAAACCGAACCAGAACACACCCGCACACTAAGTATATTGGTTTTCTATGTCCAA 

ATCCAAGTTTTGCCCTTTGAAAATCACGTAGTTCTTCCTTATTTCAAATTGTTATCACACCAAGTTTTGAAGCGCAACTATTAACGGTGAATGAG 
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AAATGACTTGCGCCCCTAGTTTTGTTTTAATTCTTTAAACAACGAGACCACGAGACTCATTTAATTGTTTCTTTTAATTTTGTATTTGACAGAAG 
AGAAACGAAATGATGAAAAAAAACGCTAACTGTAAATAATCAATATGTTGTCATATATGCATTAAGCGGTAGGTTAGATAAACAAACTACACAAA 
GTTGATAGATATAAATGAAATATTGTACAAACTGCTATGGCAAGCGCTGCAGCAAAATCTATATATACACATAGATATATATCTATATATTTGAA 
TGTGTGTATGTGGTTAAACAACAAGAAAATGAAGTGATCTATTACATTATTGTAAAATATATCAAAAAACTGAGAAAAAAAAAA 

(SEQ ID NO: 1247) 

Start ATG: 184 (Reverse strand: CAT) 

MENATTTTAKHYKTAGKQYKTNKVNNTRRATAAAAAAAAAAATTTVTTAATPTKKRTYRETATATIVTQRSTNKANIAAAIALAAATEATASASA 
SATATATDATLTASKAAATAAATTDAASGNSNSSSKPSTSTRDKLGEVPLPTVDSNHIISNNNNNNNNNNTSNNNNNNHHSDNSISENNYEFKSR 
DKASLSDSKMTLQQMAAVSSNQEPVAPNVANTMSNSSIINSQQTTVNATPATDEAPLGDRSNISRYLHKKFKRLASTTEVDSWSATNGGGALQAS 
GDAVRTTSLSSNSSLSPPPTTPLANGHHLMTQQQSHQQVQQQQQQQQTPPPSVAIHEFSANFVNSNHVNAIGHEESIISNSGYKAAGRTRTPLAN 
SNSNTNSTSNSNSNHAANATLSPASFAQHQQLTQPTTVSPGIPGGAAAPNPACEKSGRYVCQYCNLICAKPSVLEKHIRAHTNERPYPCDTCGIA 
FKTKSNLYKHCRSRSHAARARGLEVPADADDGLSDQDAELSNSSSELPSRAGSPYEEPINSPTPSPSTLSAAKSAYIQQPPLPTYMQQLPLGSPA 
AGTLPPTTADNHHSATAQHRQSIDYKPYKPKFHNASLYSCSSKELQQQQQQLQIQQQQQQHQLAQQKLSIQLPLVQQPSLAHPTLSPSTQMKMKH 
HINSHQIQLQLQQQQSLLAQQSLLAAMPPGGVYYLGQPSYYNQDTAAAIHQHALAIQQFQIHLAQQQQQQQQQHPPLVKATPPMQQPPSLPPQQL 
VRANSQISSVATAPATPTPAANLSTFASSSGGKQVNVAKVQEHISKLISQNEAIVENKEILLQKKYPKQLSRSRSFNNANSNNASQHGSNASALH 
ANNSQNNTNAQMPERETKVNLAQAIFQKQQHQLQQQQQQQITQQQQQQLEQQNYYTYIQQQQECQQQQLEPPNGVVKRNAYKPGVVMTTPVKQQQ 
QLPPPPSPLPMQTMQYRQDPATPVTKIEQPTTAVPVMPLNLSAKPKPTLVTVPVSSLSTSSLAPTPTTSTNPTSSSKTAPPPAQVNNSIIKNLLL 
NARGLAVPIGEGDDAVYSCPICASEFRSADDLKLHNSTYCQDASSSAPMSPASSPFRSNSISLSLPELKSHMANSKNPLSLAKLAWSQLKTKRSS 
LVLSRLSAAQTPARTSTVTAPTVTASAPAPAVATVTAPSSAPAPQIEALRFVDAPLPSPGPLLGKTPLVDYAQQSTPRKAQDSVVITKMHEDRQF 
VIEAQPAKRIKTSDLVVASSSQQPTSFNFSFNNQNSSNSVPELQSSKEERLRRFTSSGGSMIPISECPDLDNSPKMIRTPLLSGGSFQDVSVKVN 
NETGSSSKERKLMALVSGSGLLGVSSGPQHFQFPPINSITAFNPLTLPPMSGGDKTTPVTPIPHVPGMPGPGSLTPQMPLLPPPPQQLQLPIPSS 
RGRSPNRKQPSPLLLGGGSGELKALSPFGGVQNVPSEFSRQPPTPAQRQALQWNSKEAPKKAPFNFLRMADNVKTTEPEVRHFNLENVISGKQQE 
LPLTPLHVDTPNGNAPEETSPVASASASAKSKFLRPTSLPLKPGTFTPKRHHGITPTANTLPLISPETPRPSKSCVQLYLNGHAYTYLGLKCSTK 
MFYCTVNCPQPSYVAGMHKLSMYSVWQVCEENQPHPLGFKLKQVMALYDSRQRMLGNGSSTAMAGSGKLSYNLVASQQIVSSPSTSSTSSAFYQG 
PLKTPPTVTIAALSEANVAAKANEEAQAKKLETSPSGQPLVGGYESHEDYTYIRGRGRGRYVCSECGIRCKKPSMLKKHIRTHTDVRPFTCSHCN 
FSFKTKGNLTKHMQSKTHFKKCIELGINPGPMPPDSEFLDVDMDFDQQSSTSAGGRTSSMAGESDSDDYSDNESESSDTDESKSRQKEHEAARGL 
LSLSMTPPIPQSVSPYPQLQDTPLPAASPANSIGSSGSQPKRLVCSFTSPKPPFDYQKQEQYYSNPEESKPKRSVANEESAPMDLTKPRGSILLI 
SPSPVSVPAHDLPKSQAQQMHDVIFGTSGNESGFMKTLISVSDKVRISAEMEEQAKHEAEGEDVQLQTYIKEHALHQAKIKQSQFSRSYLINTLY 
TAASPVMSSSSTLFTAWSRPVMSVNEVPSIEVHEVKTPEAIESPRSAPEQAPVILAQIAQEENEEPNIAEPHNANLPAAVQQPDVNEFTGVLGNP 
TAPPTSSVTATSVSTTTAAPVAPSSTANSTQPAQRTVIVGEDGFKSSTPTSKSGDLQHVSYGRGVPPAPIAGDARPTCTMCSKTFQRQHQLTLHM 
NIHYMERKFKCEPCSISFRTQGHLQKHERSEAHKNKVMMTSTFGVPTTSNPRPFECTDCKIAFRIHGHLAKHLRSKTHVQKLECLQKLPFGTYAE 
IERAGISLTEIDTSDCENSLISLKLLAQKLQEKDPGKLSSYTTPSGMMQLAQDAAGPVSQDSASEDGFSAAIASAIASLDNDSAGNTPKRASSMS 
EDETVANGLNHSLKRRLPGSFSSTGEESDNPPESSGEKRARSGQELPVPVAVPVAASAAASN* 
(SEQ ID NO: 1248) 

Name : schnurri 

Classification: transcription_f actor 
Gene Symbol: shn 
FlyBase ID: FBgn0003396 



Celera Sequence No. : 142000013384567 

TTTACGTGTTCTCATAAGATATAGTTTATATTAATGTATATTCATATTTTTACCCATTCATTATCTCTTGGCGCATTTAGAATTTAAGTTTTTTT 
TTTGCATTTTCCTTTTTTATTTTACGGTTTGGTATTACTTATAAACAAATTTCATATTATAGCTATAGTTTATTAGCTAGCGTTGTGTAACTTTC 
CTCCGCAGGGCCATTACTTACATTGTTAAATCGAAACGTTAATGCAATTGTTTTGTTGTGTATGTCTAACGCAATTTATATGCTCTCTATCTTAG 
TTACATGATTTTGTTGCGTAAACAAAGGGCTGCCCCAACTCCCAGCTCCCACATCTCGCATCCCATTTGCCCACCGCTCGGGAAATGCGCGCTTA 
AGTGTTTAAAAATCGGAAAGATATATATGAACACAGGCAGCAAAGTGCAGACCACGAAAGTATATATATATATATATATATATATATATATATAT 
ATATATATATATATATATATATATATATATATATATAAATATATATATATAGATAGGGGAGTAGGCGGTACAGAAGGTTACATAATATAACATAG 
GAGGGCCTTATAAGTCAGAGCGTTGCTTAAAGCTAGAGAACTGCACGTGGTAATCGCAATCATTTTACCATCAACTTAGGTTACAATGTGGGTGC 
CTTGCTGTGTGGCATATGCTTTTGTATGGATTGGATTCGCTAGAGCTAGAGCCTATAGTCTAGGGTTAGGGACACTCGCTGAACGCGGTGTCCTT 
TGTTTCTTTTACACAAATATTACACTTACAATCGAGTTATTATGTGTTCATGATCGGTTCTGCATGTTCTGCCTGATTTGGTCTGGGACTTCAAG 
TTAACAAAGAAAAAGTTGAGCCGGCCAAGAGCAAATAGTCAAATGCAAAACAGAATTAAATGCCTGCGTGTCCTGCTAACCCTCCTCCCAAACCC 
TTCCGCGGACCCCTAATCAACCGCCGATCTATTGGCCAATCTTCAGGCAGGGATCTCAAATCAATTGAACAGTGCGGTTTGTTGCAGTTTCTTGC 
AGCCAGGTTGCAATCGTTTATGTTCTAATAGTATAAATTAAATTAATCATAGGCTAGTTAATTAATAATAAGCGCGCTCAACGTTCTAACATCTG 
GGAGCAGAAACCAAAATGTCAAAAAACAAACGAAAATGTACACACAAAAAAAGAAAAGCGCTGCGCTTTGCTATGCTCTGCTCTTCTTCTCTCAC 
TGCTCTTGTCCTCCGTTTCTGGCAGGGACCTGGTCATCCTGCCGCTTGGATACATTCAACGTGTTTTACCGCATATATAAAGCAAGTATTATATA 
TATTAGGTATATATAACTATATATACTACATGTATAGTTTGTTGAGTTCTGTTGTTGTTTGTCTACTGTGTCGTGTCCAGCGTCGCAACCTAAAC 
GATAAAATATTATTTGGCGTTCCCCACGCCACCGCCCCCTCCGCCGCTCGGCATACCGCCTACGCCCACTCCACCGCCGCCACCACCGCCTCCCT 
GCCGCATCCGTATCGAAGGCCGACGCTCCGGCGTGTCGTACGGCGGTCTCGACGAGTTGGTGGCCGCCATCGAGTCGTCCATGTTGAGTCCGTTG 
CCGCCGATCAGTTGGCTGCCGTTACCGCCTCCATTGCCGCCCAGCATCGCCAGCGTTTGGCCGCCCTGGAGTTGCTGTTGCGCTGCACTGCCTCC 
TCCGCCACCCAGCGCCGCATTGACCTTGTGCTGCTGTTGTTGATGCGAGGAGGCGGCTGCCGCTGCTGCCACCGCCGCTGCTGCGGCCACCGCAT 
CATGACCCGCCTGGCCGTCCGCTCCGTCCTGCGCCTGCTGAATCGCCTGGGGATTGCGAGGTCCGCCAAACTTAAGCAGGTTCCAGTCAAACACA 
TAGTCGTAAGTGAAGCCCAAACGGTGGAACAAGTTGCGGAAGAGTTTGCGCAGGTGGCAGTAATCGGGACGCTGGTCGAAATGCATCTGGCGACA 
GAAGTTCAGATAGTTGACGAACTCGCTGGGGAAGCCCTTGCACAGCACCACAATCGAGGTGGACAGCTTCTTCTCCGAGATCCTCTCGTACTTTT 
GCCTCTTGTTGGCTGCCTTTAAGCCCTGCCAGGGCAAGGCGCCCAGATTGAAGTACATTAGGACGTAGCCCAGGGACTCCAGGTCGTCACGACGC 
GATTGCTCAATGCCCAAATGTGTGTTGATGGAGGCATAGCGGGCAGTGCCCGTGAGGTTCTTGTTTTCCCGATAGGGAATGTGCTTCAGCGACCG 
GGCATCGCGGAATTTCTTGGCCAGGCCAAAGTCAATGATGTACACCAGGTTGCCCTTTTTGCCAAGACCCATGAGGAAGTTATCCGGCTTGATGT 
CGCGATGGATGAAGTCCCGCGAGTGTATGTAATCGATGCGGGAGATCATCTGGTCCGCCAGCAGCAGAACCGTCTTCAACGAAAAGCGGCGTGAA 
CAAAAGTTGAAGAGGTCCTCCAGCGAGGGTCCGAGTAGCTCCATCACCATCACATTGTAGTCGCCCTCGCTGCCGCACCAGATTATGCGGGGTAT 
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GCCTATGCCACCCTGCATCGTCTTGTAGAACTTTGACTCGATGTGCAGCTGGGGGTGTTTGGTGCGGATGCACTCCAGCTTGATGGCCACCTCCT 
CGCCAGTGTTGATCGTGGTGCCCAGGTAGATGTCGCCGAACGATCCCGATCCTATCTTGCGGCCCAGGCGATATTTGTTACCCACGCGCAGCTCC 
ATTTTGTTACGTCTGTTTCTGTGGGTTTCTGTGTGGTAGAGTTCGGACTCCTCGGGTTCCTCGGATTTGCGTTGCCTTTCTGCGGGACCGGAGAA 
TTGAAATTTAGTTTTAATTTTACAACACATTGCCTCGAAAAGAAGGCTGTACGCTTTTCTTACTCATAATACTACACGCTACACGAATTGATCGA 

T C C C AGTT C AG AT T AG C T GCGC T T AAAAT AT TAT AC AG AT T T AAAT AT CT AC GC C AAG GAC T AAAAC T T T T AC T T T GT AAT C GC G AAAGG AT AT T 

GAAAGGTTGCTACGATGCAGGCAAATTGTTGAATTCGATCAGAATGTAAATATTCGTAGTATTCTTAGTTGCTTTGACTAATCATAGAAATAGCA 

GCGGCGTCACAGGTTTTTACGAGTAAATATGAATCAAGTGACTTCGAAAATTCGGCTATTGACCACTAGCAACAATGTGTGATATTATTAGAAGA 

GATTATTTGCAAGCAACAACACAATATC GAACGCCTTGTTAGCGATAGTGAT AAAAT ATTCGTCGGATTTTGTTTGCT AC AACTTTTGATTCTTC 

TGACACTATATTGTGACACGTGATGACAACCCTGCTGTTTTGTACAGTTTGCTAAAAGTCTAATTTTGAGAGCTAACGGAGCTTCGTGAGTATTT 

TCCAACACTTCTCCACTTTTCTCACCTGGCACTGTCCTTTCGTCTGAGCTCCTTTCTTCTTCTTGTTCCGATTTGTTCAAGGCAGTCGCTTATGG 

CTTTGTATTCGAAGGGTTCTGTAATAGGAGGTACTAAGTTAATTTTTTTTTAGATCCTCAACATGGTTAATCGAAATTCGACCCGTCTCGCGATC 

ACTTATATATAGAGAGTAATCCCCTGTTTTTATCGCATCTGCAGCGTAAGCCCGAAAAACGTGCAGAATTCGCTTTTGTTTGTGGTGATAAGGAT 

TGTGGCACACAGACATGAAAACAACGCACGGCAACAATCGATATCGAAAATAGAAAACCGTTAGCTAAGAATAAAACAAGAGAGAGAGAGCGGGA 

GAGAAGTAAGCTTGGCTTACGTTGACAACACAAAAATAGCAGAGAAAAGCGTACCAAAAGCAATTCGAAATGCAAATGCACAGTGGGCAAAAATG 

ATAAGAACCCGAACCTCGCACAAAGTAAAAGTTTATCTCGTATATTTATATTTAAGAGTGAGTGCGTCATTGCATTCGGCTGTGTGTATTGATTA 

CGAATGGGGAAAACAACAGATTTAATAAGTGATGAGTAACGATAATTTATATGCACAGAGCAAAATAGCACTACTTTTTGACATTCGTCACATTA 

TTGTTTTAGTCATTTTAATATTATATTGTACTATTATAAAATAAAGCAAATGTGCCGTAGTGCTTTTACATCCTTGAATAGATATAATTCTTTTT 

T TC AAGT GT AT TTTTTACGT GAC GAC AC C AG AAAC AT G GAGGT GAAT C AGC AAT CAGG AAAAG AC AAC GAAG AG AT T AAC AACAAAT AG AT AAAA 

GAGACGTGTGTTTGTGTATGCGTGGAGTAAACAATCGGCGACAAAAGGGAGAGAAGT AAAAT AAAACGCAGCGACTGTGAAAATGCCACTT AAAT 

GACACTTTCAGTGGTGGAAAGCACAAACAAATAGCGAAGATAAAGCGCATGGAAGTCAGAAAACGAAGCAACAAGCCGAAAGTTTTCACTTGCCG 

GTCAAATATCACAAAAATTTAGTTTAATTGTTTTCGGCGGCTCTCACTCACTAACCCCTTCCCACTCCTCTTTCCCACATTCTGTGCGCTTTCTT 

TTACCGCTCACGCATTTCGATCGCTCTCTCTCTTTCTCTCTCTCTCTCTCTCCTCCTTTTCCCGACTCTTTATCACTCACTCTGCGGTTCTAACC 

TCAAAACCTGCCTGTGTTTGTTGTTGTTTTCGCCTTGGCTTTGATTTATTTAACCGAAATGTACCGTTTTTTATTTGTATCTCCCATGTTCTACA 

CAGAAAGAAATCACTGTATAAGCACTGTAAGTCAAATTGAAATGTGGTATTAATTTTAGGAGCTTTAATTTGTTTACTGGAAGGCTAGCAGGGAA 

AACGCGGTCTTTCCTATTGGTAACTAAATATTTCCCAGGGTATCCAGCAGTTGTTTCGGACCAGCGGGTAAACACGAATTGATTACAATTGCAAA 

ACAGCTGCACGTTGCGTTGGCGGCTGCCGAAAAGCTTCCGCAAAAAAACGCACTGGAGAACAAAGAAGATACAAAAAACAGTTGTTCCAGCAGGC 

GCCACAAAAGTAAAAAGCTTAAGTAAACAGGGAAGAGCAAATGGAAAAACCAAAAACCAGAGCAAAACCAACGGGGAAACCTCAGAAGGGGAAGG 

AAACTTGCAATTAATTGTCAAACGCTAATGGCCTCCGAAAACTAACGGCACTCCACGAATGCACTGCGAATAACGGGGCTGATAAGGAGCAGCCG 

ATAGCGGGTGGAGAAGGGAAACTTGAACGCACAGTGGAACAAATTCCGCACTTACCGTGGCTTGTTTTTCTTATATTCCTTGTTTCCTATTCCGC 

TTTGCCCACTTCCTCTGCTGGCGACGCTGCTTTTCCTTCTTCTAATTCGACTGAAATCGAAATCGAATTTTTTAATTGGATTGATTCTGCCACAA 

TAATTTTCATTTTCTTTGTTTTAGAGCCCAGACGTTGTTTTTGTTGTAGTTGCCACAGTCGCACACAAACACACGCACACAGTCGCACTTTTTGG 

GGC C AC ACG C AAGC GC AT T T T AT GC AAT AAT T T T AAT AAT AC T GC AT T GCT GC C AT A T T T C AGTT AT T T T T AT C GT T T AT C T T C G AGT CGGT T AT 

GCTGTTCTTCTCTGATTTTATGGCCACATTGATTAGCATCTTGTTGCCAATCAACTCGAGAAAATTAAACGAAACCAAACGGAACTCGGACTCTT 

ATTAAAAAAACCAAACTTATGCGAGTCGTTTGCCCTGGCAATAGATTCGTGCACACTAGTCGTCCCGCCCGCACTCGCACGGTTTCTACCGCGAG 

CACGTTGCTATCCCTCCTGCTTCAGCATTTTCACACCAACACAAAGACGAGGGCACTCACTCACACACTCGCACTCTCAATGAGCACTGAAATGC 

TGTTGGGAAATTCGAGTTTTCCTGCGCGTGGTAACCGCCGCTTTTTCGCTCGCAAACGGGTCTGTCGATCCGAAACGCTACTGACCACTTACGTT 

TTGCATCAACGCTGCGCTTTCGTTTCAATTTTTCTGGCTATTTCACATAGATTTCGACGATTTTTTACAAATTTACGCCATCTTGATATTCGCAG 

TGTGACCGCGCCGATAATGAAGAAAATTACCATCTGGTATTTACTCCTCAGCGAATGTCTAGAAGATATGTAATTGTGAAAAACTGTGATTCATA 

T AC T AAGT T AAG AT AGT C C AAAGT TGGTT T C GAAGT AC CAC AT T AAG AAAAT AG C GC AAG AGT AT T GG AAAAT C AAAAG AAAAAC G AAAT TAT AT 
CCTTGCAGCTAGCCATAAGTTCAAAGTTCGAGTAACTGGAAAGCTGTGAAATAGCCAAGTTGACAACGCTGTTTTTGTTATATGGGCACACTAGT 
TCTTACATAGACCCAGTGCTGCCAGATGGACGAAAAATATTTTTGAGCTGCGCCACCTAGCGGATGAGTTGGCGAAAGTGGTATCCAGGTGGCGC 
CCGAAACAAAAGGTGGCGCCTTATTTCAGAAATGCAATTGAAAATATCTAAAGTTATTGAAAGTTTTTGAATTATGCATTCAAAGGCCCATCTGC 
TTAATTTTATTTTCAAAGAGAATACAATAAACAATACAAAAAAAAAAAAATCATTCTGAAATGATATTTATACTAATATACAAATTTGATAACTC 
AATAATATGTAAATAAAATTTATGTCTTAATTTGCAATGTTCTTAAGAAAAGTTGTTAACTCTTTTTACAGGCTTGTGCTCCAAATTCTCGTGTA 
AATAGAAATTGAAAATGGATTTCATTGGCATGTCTTGCAGTTATAGGTAGCATGCCAATTTTGATGTTACACAGATGTATCGAAATACCTGAAAC 
GTATACTTTTATTGCAGCAATACGAATAAAGCAAAACACGTGTGAATTCCATTAGTCCGCCCAACCGCTTTCCCCATTTACCCTCACATCGTGCA 
TTTTCCGGCTGTGTAACCCAAGAAACGATGACATAAAAAAAGGAAGCAGATCCAGTTATGTATGCATATATGAGAAAATGTGTTTTTTTTTTTTT 

GGGAGTATGTGCCAAGGTGTGTTTCTTTTT 
(SEQ ID NO: 1249) 

Exon: 6060 . .5983 
Exon: 3533.. 3446 
Exon: 2834.. 1001 

Start ATG: 2757 (Reverse strand: CAT) 
Transcript No. : CT42623 

T AAAT T T G T AAAAAAT C GT C G AAA T C T AT GT G AAAT AGC C AG AAAAAT T G AAAC GAA AGC G C AG C GT T GAT GC AAAAC AAC C C T T C GAAT AC AAA 
GCCATAAGCGACTGCCTTGAACAAATCGGAACAAGAAGAAGAAAGGAGCTCAGACGAAAGGACAGTGCCAGAAAGGCAACGCAAATCCGAGGAAC 
CCGAGGAGTCCGAACTCTACCACACAGAAACCCACAGAAACAGACGTAACAAAATGGAGCTGCGCGTGGGTAACAAATATCGCCTGGGCCGCAAG 
ATAGGATCGGGATCGTTCGGCGACATCTACCTGGGCACCACGATCAACACTGGCGAGGAGGTGGCCATCAAGCTGGAGTGCATCCGCACCAAACA 
CCCCCAGCTGCACATCGAGTCAAAGTTCTACAAGACGATGCAGGGTGGCATAGGCATACCCCGCATAATCTGGTGCGGCAGCGAGGGCGACTACA 
ATGTGATGGTGATGGAGCTACTCGGACCCTCGCTGGAGGACCTCTTCAACTTTTGTTCACGCCGCTTTTCGTTGAAGACGGTTCTGCTGCTGGCG 
GACCAGATGATCTCCCGCATCGATTACATACACTCGCGGGACTTCATCCATCGCGACATCAAGCCGGATAACTTCCTCATGGGTCTTGGCAAAAA 
GGGCAACCTGGTGTACATCATTGACTTTGGCCTGGCCAAGAAATTCCGCGATGCCCGGTCGCTGAAGCACATTCCCTATCGGGAAAACAAGAACC 
TCACGGGCACTGCCCGCTATGCCTCCATCAACACACATTTGGGCATTGAGCAATCGCGTCGTGACGACCTGGAGTCCCTGGGCTACGTCCTAATG 
TACTTCAATCTGGGCGCCTTGCCCTGGCAGGGCTTAAAGGCAGCCAACAAGAGGCAAAAGTACGAGAGGATCTCGGAGAAGAAGCTGTCCACCTC 
GATTGTGGTGCTGTGCAAGGGCTTCCCCAGCGAGTTCGTCAACTATCTGAACTTCTGTCGCCAGATGCATTTCGACCAGCGTCCCGATTACTGCC 
ACCTGCGCAAACTCTTCCGCAACTTGTTCCACCGTTTGGGCTTCACTTACGACTATGTGTTTGACTGGAACCTGCTTAAGTTTGGCGGACCTCGC 
AATCCCCAGGCGATTCAGCAGGCGCAGGACGGAGCGGACGGCCAGGCGGGTCATGATGCGGTGGCCGCAGCAGCGGCGGTGGCAGCAGCGGCAGC 
CGCCTCCTCGCATCAACAACAGCAGCACAAGGTCAATGCGGCGCTGGGTGGCGGAGGAGGCAGTGCAGCGCAACAGCAACTCCAGGGCGGCCAAA 
CGCTGGCGATGCTGGGCGGCAATGGAGGCGGTAACGGCAGCCAACTGATCGGCGGCAACGGACTCAACATGGACGACTCGATGGCGGCCACCAAC 
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TCGTCGAGACCGCCGTACGACACGCCGGAGCGTCGGCCTTCGATACGGATGCGGCAGGGAGGCGGTGGTGGCGGCGGTGGAGTGGGCGTAGGCGG 
TATGCCGAGCGGCGGAGGGGGCGGTGGCGTGGGGAACGCCAAATAATATTTTATCGTTTAGGTTGCGACGCTGGACACGACACAGTAGACAAACA 
ACAACAGAACTCAACAAACTATACATGTAGTATATATAGTTATATATACCTAATATATATAATACTTGCTTTATATATGCGGTAAAACACGTTGA 

AT GT AT C C AAG CGGC AG GAT G AC C AGG T C C C T GC C AG AAAC GG A GG A C AAG AGC AGT G AG AG AAG AAG AGC AGAG C AT AGC AAAGC G C AGC G C T T 
TTCTTTTTTTGTGTGTACATTTTCGTTTGTTTTTTGACATTTTGGTTTCTGCTCCCAGATGTTAGAACGTTGAGCGCGCTTATTATTAATTAACT 
AGCCTATGATTAATTTAATTTATACTATTAGAACATAAACGATTGCAACCTGGCTGCAAGAAACTGCAACAAACCGCACTGTTCAATTGATTTGA 

GATCC 

{SEQ ID NO: 1250) 

Start ATG: 244 (Reverse strand: CAT) 

MELRVGNKYRLGRKIGSGSFGDIYLGTTINTGEEVAIKLECIRTKHPQLHIESKFYKTMQGGIGIPRIIWCGSEGDYNVMVMELLGPSLEDLFNF 
CSRRFSLKTVLLLADQMISRIDYIHSRDFIHRDIKPDNFLMGLGKKGNLVYIIDFGLAKKFRDARSLKHIPYRENKNLTGTARYASINTHLGIEQ 
SRRDDLESLGYVLMYFNLGALPWQGLKAANKRQKYERISEKKLSTSIVVLCKGFPSEFVNYLNFCRQMHFDQRPDYCHLRKLFRNLFHRLGFTYD 
YVFDWNLLKFGGPRNPQAIQQAQDGADGQAGHDAVAAAAAVAAAAAASSHQQQQHKVNAALGGGGGSAAQQQLQGGQTLAMLGGNGGGNGSQLIG 
GNGLNMDD SMAATNS S RP P YDT PERRP S I RMRQGGGGGGGGVGVGGMPS GGGGGGVGNAK * 
(SEQ ID NO: 1251) 

Name: DOUBLE-TIME PROTEIN KINASE (EC 2.7.1.-) 

Classification: protein_kinase 
Celera Sequence No. : 142000013384567 

TTTACGTGTTCTCATAAGATATAGTTTATATTAATGTATATTCATATTTTTACCCATTCATTATCTCTTGGCGCATTTAGAATTTAAGTTTTTTT 
TTTGCATTTTCCTTTTTTATTTTACGGTTTGGTATTACTTATAAACAAATTTCATATTATAGCTATAGTTTATTAGCTAGCGTTGTGTAACTTTC 
CTCCGCAGGGCCATTACTTACATTGTTAAATCGA7AACGTTAATGCAATTGTTTTGTTGTGTATGTCTAACGCAATTTATATGCTCTCTATCTTAG 
TTACATGATTTTGTTGCGTAAACAAAGGGCTGCCCC7AACTCCCAGCTCCCACATCTCGCATCCCATTTGCCCACCGCTCGGGAAATGCGCGCTTA 
AGTGTTTAAAAATCGGAAAGATATATATGAACACAGGCAGC7AAAGTGCAGACCACGAAAGTATATATATATATATATATATATATATATATATAT 
ATATATATATATATATATATATATATATATATATATAAATATATATATATAGATAGGGGAGTAGGCGGTACAGAAGGTTACATAATATAACATAG 
GAGGGCCTTATAAGTCAGAGCGTTGCTTAAAGCTAGAGAACTGCACGTGGTAATCGCAATCATTTTACCATCAACTTAGGTTACAATGTGGGTGC 
CTTGCTGTGTGGCATATGCTT T TG T AT GG AT T GGAT T C GCT AG AGC T AG AGC C T AT AGT C T AGGGT T AG GG AC ACT C GCT G AAC GC G GT GT C CT T 
TGTTTCTTTTACACAAATATTACACTTACAATCGAGTTATTATGTGTTCATGATCGGTTCTGCATGTTCTGCCTGATTTGGTCTGGGACTTCAAG 

TTAACAAAGAAAAAGTTGAGCCGGCCAAGAGCAAATAGTCAAATGCAAAA^ 

TTCCGCGGACCCCTAATCAACCGCCGATCTATTGGCCAATCTTCAGGCAGGGATCTCAAATC7AATTGAACAGTGCGGTTTGTTGCAGTTTCTTGC 
AGCCAGGTTGC7AATCGTTTATGTTCTAATAGTATAAATTAAATTAATCATAGGCTAGTTAATTAATAATAAGCGCGCTCAACGTTCTAACATCTG 
GGAGCAGAAACC7AAAATGTCAAAAAACA7AACGAAAATGTACACACAAAAA7AAGAAAAGCGCTGCGCTTTGCTATGCTCTGCTCTTCTTCTCTCAC 
TGCTCTTGTCCTCCGTTTCTGGCAGGGACCTGGTCATCCTGCCGCTTGGATACATTCAACGTGTTTTACCGCATATAT7AAAGCAAGTATTATATA 
TATTAGGTATATATAACTATATATACTACATGTATAGTTTGTTGAGTTCTGTTGTTGTTTGTCTACTGTGTCGTGTCCAGCGTCGCAACCTAAAC 
GATAAAATATTATTTGGCGTTCCCCACGCCACCGCCCCCTCCGCCGCTCGGCATACCGCCTACGCCCACTCCACCGCCGCCACCACCGCCTCCCT 
GCCGCATCCGTATCG7AAGGCCGACGCTCCGGCGTGTCGTACGGCGGTCTCGACGAGTTGGTGGCCGCCATCGAGTCGTCCATGTTGAGTCCGTTG 
CCGCCGATCAGTTGGCTGCCGTTACCGCCTCCATTGCCGCCCAGCATCGCCAGCGTTTGGCCGCCCTGGAGTTGCTGTTGCGCTGCACTGCCTCC 
TCCGCCACCCAGCGCCGCATTGACCTTGTGCTGCTGTTGTTGATGCGAGGAGGCGGCTGCCGCTGCTGCCACCGCCGCTGCTGCGGCCACCGCAT 
CATGACCCGCCTGGCCGTCCGCTCCGTCCTGCGCCTGCTGAATCGCCTGGGGATTGCGAGGTCCGCCAAACTTAAGCAGGTTCCAGTCAAACACA 
TAGTCGTAAGTGAAGCCCAAACGGTGGAACAAGTTGCGGAAGAGTTTGCGCAGGTGGCAGTAATCGGGACGCTGGTCGAAATGCATCTGGCGACA 
GAAGTTCAGATAGTTGACGAACTCGCTGGGGAAGCCCTTGCACAGCACCACAATCGAGGTGGACAGCTTCTTCTCCGAGATCCTCTCGTACTTTT 
GCCTCTTGTTGGCTGCCTTTAAGCCCTGCCAGGGCAAGGCGCCCAGATTGAAGTACATTAGGACGTAGCCCAGGGACTCCAGGTCGTCACGACGC 
GATTGCTCAATGCCCAAATGTGTGTTGATGGAGGCATAGCGGGCAGTGCCCGTGAGGTTCTTGTTTTCCCGATAGGGAATGTGCTTCAGCGACCG 
GGCATCGCGGAATTTCTTGGCCAGGCCAAAGTCAATGATGTACACCAGGTTGCCCTTTTTGCCAAGACCCATGAGGAAGTTATCCGGCTTGATGT 
CGCGATGGATGAAGTCCCGCGAGTGTATGTAATCGATGCGGGAGATCATCTGGTCCGCCAGCAGCAGAACCGTCTTCAACGA7AAAGCGGCGTGAA 
CAAAAGTTGAAGAGGTCCTCCAGCGAGGGTCCGAGTAGCTCCATCACCATCACATTGTAGTCGCCCTCGCTGCCGCACCAGATTATGCGGGGTAT 
GCCTATGCCACCCTGCATCGTCTTGTAGAACTTTGACTCGATGTGCAGCTGGGGGTGTTTGGTGCGGATGCACTCCAGCTTGATGGCCACCTCCT 
CGCCAGTGTTGATCGTGGTGCCCAGGTAGATGTCGCCGAACGATCCCGATCCTATCTTGCGGCCCAGGCGATATTTGTTACCCACGCGCAGCTCC 
ATTTTGTTACGTCTGTTTCTGTGGGTTTCTGTGTGGTAGAGTTCGGACTCCTCGGGTTCCTCGGATTTGCGTTGCCTTTCTGCGGGACCGGAG7AA 
TTG7AAATTTAGTTTTAATTTTAC7AACACATTGCCTCG7A7A7AAG7AAGGCTGTACGCTTTTCTTACTCATAATACTACACGCTACACG7AATTGATCGA 
TCCCAGTTCAGATTAGCTGCGCTTAAAATATTATACAGATTTA7AATATCTACGCCAAGGACT7A7A7AACTTTTACTTTGTAATCGCGA7AAGGATATT 
GAAAGGTTGCTACGATGCAGGC7AAATTGTTG7AATTCGATCAGAATGTA7AATATTCGTAGTATTCTTAGTTGCTTTGACTAATCATAGAAATAGCA 
GCGGCGTCACAGGTTTTTACGAGTAAATATGAATCAAGTGACTTCGAAAATTCGGCTATTGACCACTAGCAACAATGTGTGATATTATTAGAAGA 
GATTATTTGCAAGCAACAACACAATATCGAACGCCTTGTTAGCGATAGTGATAAAATATTCGTCGGATTTTGTTTGCTACAACTTTTGATTCTTC 
TGACACTATATTGTGACACGTGATGACAACCCTGCTGTTTTGTACAGTTTGCTAA7AAGTCTAATTTTGAGAGCTAACGGAGCTTCGTGAGTATTT 
TCCAACACTTCTCCACTTTTCTCACCTGGCACTGTCCTTTCGTCTGAGCTCCTTTCTTCTTCTTGTTCCGATTTGTTCAAGGCAGTCGCTTATGG 
CTTTGTATTCGAAGGGTTCTGTAATAGGAGGTACTAAGTT7AATTTTTTTTTAGATCCTCAACATGGTTAATCGAAATTCGACCCGTCTCGCGATC 
ACTTATATATAGAGAGTAATCCCCTGTTTTTATCGCATCTGCAGCGTAAGCCCGAAAAACGTGCAGAATTCGCTTTTGTTTGTGGTGATAAGGAT 
TGTGGCACACAGACATG7AAAACAACGCACGGCAACAATCGATATCGAAAATAGAAAACCGTTAGCTAAGAATAAAACAAGAGAGAGAGAGCGGGA 
GAGAAGTAAGCTTGGCTTACGTTGACAACACAAAAATAGCAGAGAAAAGCGTACCAAAAGCAATTCGAA 

ATAAGAACCCGAACCTCGCACAAAGTAAAAGTTTATCTCGTATATTTATATTTAAGAGTGAGTGCGTCATTGCATTCGGCTGTGTGTATTGATTA 
CGAATGGGGAA7AACAACAGATTT7AATAAGTGATGAGTAACGATAATTTATATGCACAGAGC7AAAATAGCACTACTTTTTGACATTCGTCACATTA 
TTGTTTTAGTCATTTTAATATTATATTGTACTATTATAAAATAAAGCAAATGTGCCGTAGTGCTTTTACATCCTTGAATAGATATAATTCTTTTT 
TTCAAGTGTATTTTTTACGTGACGACACCAGAAACATGGAGGTGAATCAGCAATCAGGA7A7AAGAC7AACGAAGAGATTAACAACA7AATAGAT7AAAA 
GAGACGTGTGTTTGTGTATGCGTGGAGTAAACAATCGGCGACAAAAGGGAGAGAAGTAAAATAAAACGCAGCGACTGTGAAAATGCCACTTAAAT 
GACACTTTCAGTGGTGGAAAGCACAAACA7AATAGCGAAGATAAAGCGCATGGAAGTCAGAA7AACGAAGCAACAAGCCGAAAGTTTTCACTTGCCG 
GTCAAATATCACAAAAATTTAGTTTAATTGTTTTCGGCGGCTCTCACTCACT7AACCCCTTCCCACTCCTCTTTCCCACATTCTGTGCGCTTTCTT 
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TTACCGCTCACGCATTTCGATCGCTCTCTCTCTTTCTCTCTCTCTCTCTCTCCTCCTTTTCCCGACTCTTTATCACTCACTCTGCGGTTCTAACC 
TCAAAACCTGCCTGTGTTTGTTGTTGTTTTCGCCTTGGCTTTGATTTATTTAACCGAAATGTACCGTTTTTTATTTGTATCTCCCATGTTCTACA 
CAGAAAGAAATCACTGTATAAGCACTGTAAGTCAAATTGAAATGTGGTATTAATTTTAGGAGCTTTAATTTGTTTACTGGAAGGCTAGCAGGGAA 
AACGCGGTCTTTCCTATTGGTAACTAAATATTTCCCAGGGTATCCAGCAGTTGTTTCGGACCAGCGGGTAAACACGAATTGATTACAATTGCAAA 
ACAGCTGCACGTTGCGTTGGCGGCTGCCGAAAAGCTTCCGCAAAAAAACGCACTGGAGAACAAAGAAGATACAAAAAACAGTTGTTCCAGCAGGC 

G C C AC AAAAGT AAAAAGC T T AAGT AAAC AGG G AAG AGC AAAT G G AAAAAC C AAAAAC C AG AGC AAAAC C AAC GGG GAAAC C T C AG AAGGGG AAG G 
AAACTTGCAATTAATTGTCAAACGCTAATGGCCTCCGAAAACTAACGGCACTCCACGAATGCACTGCGAATAACGGGGCTGATAAGGAGCAGCCG 
ATAGCGGGTGGAGAAGGGAAACTTGAACGCACAGTGGAACAAATTCCGCACTTACCGTGGCTTGTTTTTCTTATATTCCTTGTTTCCTATTCCGC 
TTTGCCCACTTCCTCTGCTGGCGACGCTGCTTTTCCTTCTTCTAATTCGACTGAAATCGAAATCGAATTTTTTAATTGGATTGATTCTGCCACAA 
TAATTTTCATTTTCTTTGTTTTAGAGCCCAGACGTTGTTTTTGTTGTAGTTGCCACAGTCGCACACAAACACACGCACACAGTCGCACTTTTTGG 
GGCCACACGCAAGCGCATTTTATGCAATAATTTTAATAATACTGCATTGCTGCCATATTTCAGTTATTTTTATCGTTTATCTTCGAGTCGGTTAT 
GCTGTTCTTCTCTGATTTTATGGCCACATTGATTAGCATCTTGTTGCCAATCAACTCGAGAAAATTAAACGAAACCAAACGGAACTCGGACTCTT 
ATTAAAAAAACCAAACTTATGCGAGTCGTTTGCCCTGGCAATAGATTCGTGCACACTAGTCGTCCCGCCCGCACTCGCACGGTTTCTACCGCGAG 
CACGTTGCTATCCCTCCTGCTTCAGCATTTTCACACCAACACAAAGACGAGGGCACTCACTCACACACTCGCACTCTCAATGAGCACTGAAATGC 
TGTTGGGAAATTCGAGTTTTCCTGCGCGTGGTAACCGCCGCTTTTTCGCTCGCAAACGGGTCTGTCGATCCGAAACGCTACTGACCACTTACGTT 
TTGCATCAACGCTGCGCTTTCGTTTCAATTTTTCTGGCTATTTCACATAGATTTCGACGATTTTTTACAAATTTACGCCATCTTGATATTCGCAG 
T G T G ACC GCGC CGAT AAT GAAG AAAAT T AC C AT C TGGTATTTACTCCT C AG C G AAT G T C T AG AAGAT AT GT AAT T GT G AAAAAC T GT G AT T CAT A 
TACT AAGT T AAG AT AGT C C AAAGT T GGT T T C GAAGT AC C AC AT T AAG AAAAT AGCGC AAG AGT ATT GG AAAAT CAAAAG AAAAAC GAAAT T AT AT 
CCTTGCAGCTAGCCATAAGTTCAAAGTTCGAGTAACTGGAAAGCTGTGAAATAGCCAAGTTGACAACGCTGTTTTTGTTATATGGGCACACTAGT 
TCTTACATAGACCCAGTGCTGCCAGATGGACGAAAAATATTTTTGAGCTGCGCCACCTAGCGGATGAGTTGGCGAAAGTGGTATCCAGGTGGCGC 
CCGAAACAAAAGGTGGCGCCTTATTTCAGAAATGCAATTGAAAATATCTAAAGTTATTGAAAGTTTTTGAATTATGCATTCAAAGGCCCATCTGC 
TTAATTTTATTTTCAAAGAGAATACAATAAACAATACAAAAAAAAAAAAATCATTCTGAAATGATATTTATACTAATATACAAATTTGATAACTC 
AATAATATGTAAATAAAATTTATGTCTTA 
(SEQ ID NO: 1252) 

Exon: 5679.. 5281 
Exon: 3533.-3446 
Exon: 2834.. 1001 

Start ATG: 2757 (Reverse strand: CAT) 
Transcript No. : CT42625 

T T T CGT T T AAT T T T CT C GAGT T G AT T GGC AAC AAG AT GC T AAT C AAT GT GGC CAT AAAAT C AG AGAAGAAC AGC AT AAC C G AC T CG AAG AT AAAC 
GATAAAAATAACT GAAATATGGC AGC AATGCAGTATTATT AAAAT TAT TGC AT AAAATGCGCTTGCGTGTGGCCCCAAAAAGTGCGACTGTGTGC 
GTGTGTTTGTGTGCGACTGTGGCAACTACAACAAAAACAACGTCTGGGCTCT AAAAC AAAGAAAATGAAAATT ATT GTGGCAGAATCAATCC AAT 
TAAAAAATTCGATTTCGATTTCAGTCGAATTAGAAGAAGGAAAAGCAGCGTCGCCAGCAGAGGAAGTGGGCAAAGCGGAATAGGAAACAAGGAAT 
ATAAGAAAAACAAGCCACGAACCCTTCGAATACAAAGCCATAAGCGACTGCCTTGAACAAATCGGAACAAGAAGAAGAAAGGAGCTCAGACGAAA 
GGACAGTGCCAGAAAGGCAACGCAAATCCGAGGAACCCGAGGAGTCCGAACTCTACCACACAGAAACCCACAGAAACAGACGTAACAAAATGGAG 
CTGCGCGTGGGTAACAAATATCGCCTGGGCCGCAAGATAGGATCGGGATCGTTCGGCGACATCTACCTGGGCACCACGATCAACACTGGCGAGGA 
GGTGGCCATCAAGCTGGAGTGCATCCGCACCAAACACCCCCAGCTGCACATCGAGTCAAAGTTCTACAAGACGATGCAGGGTGGCATAGGCATAC 
C C C G C AT AATC T GG T GC GGC AGC GAG GGC GAC T AC AAT G T GAT GGT GAT GG AGC T AC T C GG ACC C T C GC T GG AGG ACCT C T T C AAC T TT T G T T C A 
CGCCGCTTTTCGTTGAAGACGGTTCTGCTGCTGGCGGACCAGATGATCTCCCGCATCGATTACATACACTCGCGGGACTTCATCCATCGCGACAT 
CAAGCCGGATAACTTCCTCATGGGTCTTGGCAAAAAGGGCAACCTGGTGTACATCATTGACTTTGGCCTGGCCAAGAAATTCCGCGATGCCCGGT 
CGCTGAAGCACATTCCCTATCGGGAAAACAAGAACCTCACGGGCACTGCCCGCTATGCCTCCATCAACACACATTTGGGCATTGAGCAATCGCGT 
CGTGACGACCTGGAGTCCCTGGGCTACGTCCTAATGTACTTCAATCTGGGCGCCTTGCCCTGGCAGGGCTTAAAGGCAGCCAACAAGAGGCAAAA 
GTACGAGAGGATCTCGGAGAAGAAGCTGTCCACCTCGATTGTGGTGCTGTGCAAGGGCTTCCCCAGCGAGTTCGTCAACTATCTGAACTTCTGTC 
GCCAGATGCATTTCGACCAGCGTCCCGATTACTGCCACCTGCGCAAACTCTTCCGCAACTTGTTCCACCGTTTGGGCTTCACTTACGACTATGTG 
TTTGACTGGAACCTGCTTAAGTTTGGCGGACCTCGCAATCCCCAGGCGATTCAGCAGGCGCAGGACGGAGCGGACGGCCAGGCGGGTCATGATGC 
GGTGGCCGCAGCAGCGGCGGTGGCAGCAGCGGCAGCCGCCTCCTCGCATCAACAACAGCAGCACAAGGTCAATGCGGCGCTGGGTGGCGGAGGAG 
GCAGTGCAGCGCAACAGCAACTCCAGGGCGGCCAAACGCTGGCGATGCTGGGCGGCAATGGAGGCGGTAACGGCAGCCAACTGATCGGCGGCAAC 
GGACTCAACATGGACGACTCGATGGCGGCCACCAACTCGTCGAGACCGCCGTACGACACGCCGGAGCGTCGGCCTTCGATACGGATGCGGCAGGG 
AGGCGGTGGTGGCGGCGGTGGAGTGGGCGTAGGCGGTATGCCGAGCGGCGGAGGGGGCGGTGGCGTGGGGAACGCCAAATAATATTTTATCGTTT 
AGGTTGCGACGCTGGACACGACACAGTAGACAAACAACAACAGAACTCAACAAACTATACATGTAGTATATATAGTTATATATACCTAATATATA 
TAATACTTGCTTTATATATGCGGTAAAACACGTTGAATGTATCCAAGCGGCAGGATGACCAGGTCCCTGCCAGAAACGGAGGACAAGAGCAGTGA 
GAGAAGAAGAGCAGAGCATAGCAAAGCGCAGCGCTTTTCTTTTTTTGTGTGTACATTTTCGTTTGTTTTTTGACATTTTGGTTTCTGCTCCCAGA 
TGTTAGAACGTTGAGCGCGCTTATTATTAATTAACTAGCCTATGATTAATTTAATTTATACTATTAGAACATAAACGATTGCAACCTGGCTGCAA 

GAAAC TGC AAC AAACCGC ACT GTTCAATTGATTTGAGATCC 
(SEQ ID NO: 1253) 

Start ATG: 565 (Reverse strand: CAT) 

MELRVGNKYRLGRKIGSGSFGDIYLGTTINTGEEVAIKLECIRTKHPQLHIESKFYKTMQGGIGIPRIIWCGSEGDYNVMVMELLGPSLEDLFNF 
CSRRFSLKTVLLLADQMISRIDYIHSRDFIHRDIKPDNFLMGLGKKGNLVYIIDFGLAKKFRDARSLKHIPYRENKNLTGTARYASINTHLGIEQ 
SRRDDLESLGYVLMYFNLGALPWQGLKAANKRQKYERISEKKLSTSI VVLCKGFPSEFVNYLNFCRQMHFDQRPDYCHLRKLFRNLFHRLGFTYD 
YVFDWNLLKFGGPRNPQAIQQAQDGADGQAGHDAVAAAAAVAAAAAASSHQQQQHKVNAALGGGGGSAAQQQLQGGQTLAMLGGNGGGNGSQLIG 
GNGLNMDDSMAATNSSRPPYDTPERRPSIRMRQGGGGGGGGVGVGGMPSGGGGGGVGNAK* 
(SEQ ID NO: 1254) 

Name: DOUBLE-TIME PROTEIN KINASE (EC 2.7.1.-) 
Classification: protein_kinase 
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Celera Sequence No. : 142000013384244 

TGGACTAAACTCGCAGATTCCTGTAACTTGGAAGCACCACGAGTTTGCTTTAGAAGATGCCTTTCCGTTGAACTAAAACTAGACGAGTAAAGCGT 

ATCTCAAACATTGTATCTTCCGAACCAAAATAGTGCTGCATACTTTTCGGGATATGTCTTATTCAATGTAAGTAATACAAAAAGAAAAAAGGTAA 

CTTCTTAAAACTTGTAATATTTGTCACAAATGAATGACGACTAGATTTGGCCATGTCTGTAACAATAAACGTACAAAATATTCGAAGTTGCAGAA 

ATGGGGATCCAGCATTCGGATTTGGTTTTTTGTCAAACTTTTAAATATAGAATCGGACCACATCTTTTTAAGACCTTAAAAAACTGATTTGAAGT 

TAGACCCCTCATAATGCCAACTAAATTTGTTAAACAAAAATATGCCCCTATATGCCAATCAAATTTAAATACAATGCTTTCTCAGCGTTTCAGTT 

AAATACTTTGGGTTGTTTAATGGTTTTATACGTATACATGTGCTAAGTCCATGTATCTTATATTTCGTAAATCCATAGATTTTGTTCGACGAAAC 

TTTGCTAAATGATATTGATCTTTCAAATATAACTTGTTAATTTCCTTTCATCGCCGTGGCAAATCCTTCGAGAGTGAGTACTACTTGTTAAACGC 

ATATATGAGGAAAATTTATTGATGAATAATGCGAACATGCAAACGGAACACATCCCCGAATTCACACAAAGCATCTCTGTGGTATCTCTTTTGGT 

TTTTTATTTGTTGCATCAACTCCGCGGAACTCCAGCTACCTCACTGCAGGTCGTAGTAGTCGACGTTATTGGCGGTCAGATGTTCTCTATTAATT 

TGGAGCGTGCTGCTGTTTCCACTTCAAACTTTTATTAACATATATAGTATGGGTTTGCCGGTATATTGCAAACGGGAACTAAGTGTAGATTCGTG 

AATGATATCAGACTGGACTGCAAACGTTTAGTTTCTAGCATGTTTTTTTTACTCTCTCCTTTATTTGTTGGAATTTTTGAGTTTTATATAGGTGG 

AGT T T T C AAAAT GAC AT T AAT T C AT AGG GGT G AGT T TT AT AT AT AGT TC G AC T GGCT GTT G C AG GT GCTGGTTTC G AATT T T T GG AT GG GG AT GC 

TAAAGGGCGGCTCCGTTTGTGTTCATCCTCATTTAGCCATCAACCAAGTTCTTACTCGCCATTCTTTTGTTCGTCCTCAGATTCGGATTCTGAAA 

GAGAATAGAATACAAAATGTTAGTATGGGGGCTTCAATTGGTACAAACTGAAAAGATGTGCAGCTCCGGTTATGCTGTCCAAATGATTATTGGCC 

AAATTTAACTATGGAAAGTTATCTGGACACCCAACCATTCAAATGCTTTGGCACAACTCCCCGCTTGGCTCAATCGAATGAAGCAACCATTCGGA 

GCTTTTATGAGGGGATTGTGCATGCGGACTAGGACAAATGCAGCTGTCCAAACGATTATTGGCCAAATTAACCACGGCATTGGGTTATATGGGCA 

CCCAATCATTCTTTCGCTTTTGCATACTGTACATGAACCCTTTAAAACTGGTTCCTAATAATGGGAACACTTTAAAAGCTTGGTAAGAAGCATGC 

TCGGTGAAGTCAGCTATTTGTGTCCAAGTTTTCATTGGCCTTTGATGCTCGGATTTTGTGAGGCAACAAGGCAAGCAAAAACTCAAATGCTCTTG 

CAATTTTGTACAATGAAAATGTCATCCAGGACAAGACCATTCTCAAAAAGTGCACTGTGTAAAGTTTCGAAGGAAGCAAAATTTGAATATTTATA 

TTGTCCAGGTTTTCATTGGCCTACGTTGTTCAGTGAATGGAAATCAACGAGGCAAGCAAAAAACCCAAATTTCTCTTGCAATTACGTACCAGAAT 

C AAGC T AT T T AAG AC AAGAC C ATT CT C AAAAAT T GC ACGCT G AT AACAT T T AT C AAAAC AT AC T AAC CT T C T T C GGCT GAC T G AAGC C AT T C GAC 

AAACTTTCGCATTTGTTCAAGGAAATGCATTTTGCCCTTGTTTGAGTGGCCCTCCTTGTACCAGCGCAAGATGATCTCCTCGGACAGGACCTCGG 

T C T T GT AG AAC AAC AAAAT G AT TT T CT G GAAGGC C T T CAT G AAG T T CAT GTT CT CGT AGC AG AAT T C CT G C AC C T T C AAG AT TAG GGC C AAT TC C 

GAACGGTCGGTCGAGGCGAAAGCCTGCAACAGTGGGCAGTAGTTCTTCAGATGACGAACAGCCTGATCGGTAACCAGCTCCTCCTTCTTGTTCCA 

TTCGCCCAATGACATGATGGTGGACCAAATCTGGTAAAGATCATACATGAGATTTAGTAAGGAAATCATCCAGTTGGATTATTTACGCATCGACT 

TACAATAACAATTATTTCATGATCGGGAATGTTGGTGCGCTGCGAGAAGTCCTTAATGTCTGAGGTTATCTCATTATATGGCTTCTCATCGTTGA 

TATCATCGATGAGCGCCTGCTGCAACTCACGCTTCGCCTCCTGGCTAGCCTGAGCCTTATGTAGCTTGACGATCTCGTTAAGTTCTTTGTCAAGA 

AAAAC T T GT T T AAAAT AT T CC T C AGT AC GT T T GT T C GGT GGG AAAAAGT C C AT AAGT C T GG AAAAAC GT G AAAAAGT GG AACC AT T AT T AAT AAT 

TT AAGT T AAC GC AT T C AAT AAT C AACCT T AAAT GAC GT ATT G AGGT GT AC G AAAT T AAAT C AGC AC C C AAAT G AAGC GAAAGGT CAT T T GC G AAT 

TGGGT AAAAC T CC GT C GAAC T T T C T TAC TC T AGGAAT AT C AATT AAAG AT GC AC AAC T GG AT CAT T G AGT CAT C T T C C T GAG AAC AT T C C GGG GC 

TAAAAACAAATCACAACTAAAAACTTACTTGCTTTCCAGTCCACCCTTCTTAAGCGCCTGAATCAGATATGCAATGCCCTTCTCTTGCTTGAACG 

TCTGGAACAGCTCTAGTAGAAACTCCAATGCAATGCCATCCTTGATTAGGTGCTCGTTGTTTAGTACCAGTAAAACATTTGGCGGTACAGAACCG 

TTGACTGTTGATCAAAAATAGGGTTAAATATAGGGTTAGTCAAATGTTTTCGCAAAACAAGTTTAGACGGCCGGCTAGCAATCCTGTCCTTGGGC 

TCGTGGCCCTTAGATGTGGTTAGTTCATCCAAGAGTCAGTATACCAAGTGAAGGATTACTTTAGCCCCTCAGCTGCAAGTTGAACAAGTGTTTAT 

CGAAACAACTAATTGAAAGACCGAATGGGCAGACGCAAGCAATGCACAGGAGGAACAGGAACATAAATGTTAAGACTTACCTAACCAGAGCGCTG 

TCATACGCGCAAGCTTGATGCGTTCACTAGGAGTAAAGCCCTTTACGAATAATAGAACCTTTCCCATCTCCTCTTCGAACATTTTCTCAAGGTAC 

TTGTACCGTCTGATGAGCTTGACAAATACCTGGAAAAGGTAAAGTGTGATTGCCCCTGTTAGCGGATGGTGAAATTTTACTGTGTTCTCACGCAG 

GAGGCTTTTGTATTTGTTGTTGC AAC ATTAGAAATCTTTAGATTGCTGGCGGT AGT AAT TTT ACAAAAT AAAG AC AAGT AAAAT GTGGCTTTTT A 

TTTTGCATATTTTGACACCGATTGAGTTGTTGAGATGCATAACCGGTTCTAATAATATGCACTTTGATTCAATAGCGTGACCTTCGCAGTGGTTG 

CATGCATATAGTATATATATATAGTGTTGAATAAAGAGTAGTTGTTGACACATAAGAAATCTTCAGAACGTTAGTGTTTAGGACAAACCAAAGAC 

AAGTAAATGTGTTGATTGTTATTGCATATTCTTGACACCGATTACATTTTTGAAACTTTATAGTCAGCTCTAATGATATGCAGCCACTTCACGAA 

TTTCATACTTTTAAGTGGATCCGAAGAAAATCATTTAGTAGTAAATAAACGTTTTAGCTTGGAAAACAACAAGAGCTCAACTTGTGTTGCAATCA 

AACCAATTTTCGTTTACTCATAATCATAATTTAAGTATGACCTGCTAAAAGATTACAAACCTGCTCATGGTTGCGCATGCTCTCCATGCTCTCGG 

GCGCATCGAAGATGCAGTAGCTTGTGCGCGGCTTTTCGCCATCCTGAGATATAGAACCACCAGGCACTGCAATTGAAACGAGGATGGAAAAGGGT 

CAGTGAAGAGACTAAACACATTACTTGCAGCGGAAAACTTACCTAGTAGACCTCCTGCAATCAATATGTCAAAGAGAACTTCACCGTAGCGACGA 

TAGTCGAGCTTGTTGCCCGCGCTGTCTAGATATTTAGAGATTTGTTCCAGGTCGCCTTCAGTTTTTTCGAGACCAGCAATGACCGCATCACGGAA 

TCCCGTTGGGTCATATTTTTCTCTTTCATCTCTTTTTCTGGTCTTGATGCGTTGACCCGATAGCACTGGTCTTTCAGTTTTTTGACTCATACAAT 

AAATTGCTGCAAAGGTAAATGTGGATCAGAGATGGAAGCGGTTTTGGTTTGCCCCTGTAAGTTTTTATTTCTGACCAGACTAAATTATCATGGCC 

ACTAGATTCATTTTTAGATTTATCTTAATTTTAGTGTAAGATGACTAAGGTGGTCTCCAAGCGGTGGAAAACCGTTCCCTGTATATGTATTCTCG 

TTGCGCTCATAATCAAACTACAACATTTTCACCTATGACCGGCGTTTAAGTGGAAACTCTAGTGAAAACTGAGCCTAAGAAAGCCTTAACCTAAA 

AATGCCAGCGTACATGTGCTTTAAACACACACATTCAAGAGCGGAGAACGGGCACACTTCACATCTCTTTTTAATTATCTCTCTTTTTCACACAC 

ACATACCCACTGTGCATTTTACAGTTTGATAATGTCGACAAAAGAAATAAAAAGAATTGAGAGGTGAGGCGACCAAGAAATCGACATAGAAACGC 

AGAATAAAGTGAAATTTTGACGTGGTTACTTGTGTTAGGTAGCGCGCTGGGGAAATGAGGTCATTGTGCATGTAGTGCACATAAAAACGCGCCGA 

TGACTGTGTGCTAGTTGGCCGGGAAAATTGTCATAGCAGCGAAAACGCAAAAATTAAATTATCTAGGCTATCAAGTACACATTGGTACATATAAG 

GAACGGCAACTTACGCTTTCACCCAAGGATATTAAGTCTGGCCCTGTGTTTGCCGCTTCCGCGTGTCCTTGCAATCAATTGAAATCAGTCGTGTG 

CGAGTGT AGT GTGT AT GTGTGTGTGTGTGTGCCTGTTGCTT TTT C AC AACTAACGAACGTGCGAGAATATGTAATTT AATT T AACCT CACGAAGC 

GCTGGCTGGTTAATGCTTCTCACTCGAGTCGACGAAAAACAAAGAGACAGATGGCAGATCCCTTCTTTTCTTACCTCTCTTTTCTTATGAGCGAG 

CTTTTGCACGGTAAAATTGACACGCACGCACACACACAACAACAAAGAAAAGGGGAGAGCAAATAGAAAATATAATTTTTTGTAACGGAATTTAC 

TTTTATGCCTTCAAAATAAACGTTAACAATTTTGCTCAGACTACGTCCGGAATTCTATTTTTAACAGCTATCACAATCGGTTACTAGGTATATGT 

GTTGTTTGCCTCGAATCCAGCGCCCCTCTGCTCGCACCACTTTGTTAAACGAAATTCGGAGAGCGAATGAGACGACACACGAGCGCACGAACGAA 

GCGGACGAGCCGAGAACGAAGACATGTGGTTCGGCGCCATTTTGAAGCGATACACTGACATGGCGAAAGCGGAATGATGGGCACAGGGAAAACGA 

AAATTAGGGGGAAACGGTGAGAAAAAATGTGGGATGGGGGTTCAAAGTGTGTATATTGCTGTCGCGCTTATTAAATAATGAGCAAGGTACAGTCG 

AACAAAGGCCGACGGCCACCAAATGATTTTTTGCATGAATATATGACTGCCCGATGACTAATCTGGCCTTAGGTACGCACGCAGTTCGTACATAA 

CCCCTTTTCCAACGAAAGCGCAGACAAAGGATTGGTCAAGGCCAGAGAAATGACCAGGACCGCGATTTTACTCGCCCTATTTGTCTTTCTGTGTA 

CTTAGGTCTACTCTAGCGCTATTTTTCACTCCTATACTTACGATTAAAACAAAAAAATCTGTTTTGCTGCGAATAAAAGTAACTTTTTGCGTAGC 

TGTTGGGCTGCTTCTTCTGGATGACTTGCGGTATGACCGTGGCCTGCTTCGTACGCAATGGTACAGAGCTCACTAAGGTGTCCAAAAGGTGGCAA 

CATTTTGGTGAAGCTGTTGGCTGGCCCCACTAACCGGACCTGAAAATGGTGCAGCGCTGCCGGACATTTAACGGAATCAAAGATTCCACAAAAGT 

CATGTTTGCATTAACAAAATTCATATTTAACCTTAACAAATTCAACAAATTAAGTTAAGTCCATTTTAGTAGCGTTGCGAGGAGTTTATTTTCGA 

AAATACGCCTTTGTATATATTATATATTTTTCCTCCACTTTAAACAAAATCGAAAGGTGTCCATTTGTCAGCTATTTAAACAATATGGCATCGCC 
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GCCTTACTTGCTCAGTAAGAAAATTTAACTGTTTTTTTTTTGCGCACCTGAAAGCGGCGATAGACATTGTTTACTTTTAGTTTATATTTACATTA 
TCCA 

(SEQ ID NO: 1255) 

Exon: 5654. .5595 
Exon: 5415.. 5145 
Exon: 4471. .4223 
Exon: 4151.. 4051 
Exon: 3449.. 3311 
Exon: 3044.. 2879 
Exon: 2622.. 2379 
Exon: 2310.. 1968 
Exon: 1229.. 1001 

Start ATG: 4460 (Reverse strand: CAT) 
Transcript No. : CT42831 

TTTAACAAAGTGGTGCGAGCAGAGGGGCGCTGGATTCGAGGCAAACAACACATATACCTACTCGCTCATAAGAAAAGAGAGGTAAGAAAAGAAGG 
GATCTGCCATCTGTCTCTTTGTTTTTCGTCGACTCGAGTGAGAAGCATTAACCAGCCAGCGCTTCGTGAGGTTAAATTAAATTACATATTCTCGC 
ACGTTCGTTAGTTGTGAAAAAGCAACAGGCACACACACACACACATACACACTACACTCGCACACGACTGATTTCAATTGATTGCAAGGACACGC 
GGAAGCGGCAAACACAGGGCCAGACTTAATATCCTTGGGTGAAAGCCAATTTATTGTATGAGTCAAAAAACTGAAAGACCAGTGCTATCGGGTCA 
ACGCATCAAGACCAGAAAAAGAGATGAAAGAGAAAAATATGACCCAACGGGATTCCGTGATGCGGTCATTGCTGGTCTCGAAAAAACTGAAGGCG 
ACCTGGAACAAATCTCTAAATATCTAGACAGCGCGGGCAACAAGCTCGACTATCGTCGCTACGGTGAAGTTCTCTTTGACATATTGATTGCAGGA 
GGTCTACTAGTGCCTGGTGGTTCTATATCTCAGGATGGCGAAAAGCCGCGCACAAGCTACTGCATCTTCGATGCGCCCGAGAGCATGGAGAGCAT 
GCGCAACCATGAGCAGGTATTTGTCAAGCTCATCAGACGGTACAAGTACCTTGAGAAAATGTTCGAAGAGGAGATGGGAAAGGTTCTATTATTCG 
TAAAGGGCTTTACTCCTAGTGAACGCATCAAGCTTGCGCGTATGACAGCGCTCTGGTTAGTCAACGGTTCTGTACCGCCAAATGTTTTACTGGTA 
CTAAACAACGAGCACCTAATCAAGGATGGCATTGCATTGGAGTTTCTACTAGAGCTGTTCCAGACGTTCAAGCAAGAGAAGGGCATTGCATATCT 
GATTCAGGCGCTTAAGAAGGGTGGACTGGAAAGCAAACTTATGGACTTTTTCCCACCGAACAAACGTACTGAGGAATATTTTAAACAAGTTTTTC 
TTGACAAAGAACTTAACGAGATCGTCAAGCTACATAAGGCTCAGGCTAGCCAGGAGGCGAAGCGTGAGTTGCAGCAGGCGCTCATCGATGATATC 
AACGATGAGAAGCCATATAATGAGATAACCTCAGACATTAAGGACTTCTCGCAGCGCACCAACATTCCCGATCATGAAATAATTGTTATTATTTG 
GTCCACCATCATGTCATTGGGCGAATGGAACAAGAAGGAGGAGCTGGTTACCGATCAGGCTGTTCGTCATCTGAAGAACTACTGCCCACTGTTGC 
AG GC TTTCGCCTC GAC C GAC CGT T C GG AAT T GGC C C T AAT C T T GAAGGT GC AGG AAT T CTGCT ACG AG AACAT G AAC T T C AT G AAG GC C T T C C AG 
AAAATCATTTTGTTGTTCTACAAGACCGAGGTCCTGTCCGAGGAGATCATCTTGCGCTGGTACAAGGAGGGCCACTCAAACAAGGGCAAAATGCA 
T T T C C T T G AAC AAAT GC GAAAGT T T GT C G AAT GGC T T C AG T C AGC CG AAG AAG AAT C C G AAT CT G AGG AC G AAC AAAAGAAT GG CG AGT AAG AAC 
TTGGTTGATGGCTAAATGAGGATGAACACAAACGGAGCCGCCCTTTAGCATCCCCATCCAAAAATTCGAAACCAGCACCTGCAACAGCCAGTCGA 
ACTATATATAAAACTCACCCCTATGAATTAATGTCATTTTGAAAACTCCACCTATATAAAACTCAAAAATTCCAACAAATAAAGGAGAGAGT 
(SEQ ID NO: 1256) 

Start ATG: 343 (Reverse strand: CAT) 

MSQKTERPVLSGQRIKTRKRDEREKYDPTGFRDAVIAGLEKTEGDLEQISKYLDSAGNKLDYRRYGEVLFDILIAGGLLVPGGSISQDGEKPRTS 
YCIFDAPESMESMRNHEQVFVKLIRRYKYLEKMFEEEMGKVLLFVKGFTPSERIKLARMTALWLVNGSVPPNVLLVLNNEHLIKDGIALEFLLEL 
FQTFKQEKGIAYLIQALKKGGLESKLMDFFPPNKRTEEYFKQVFLDKELNEIVKLHKAQASQEAKRELQQALIDDINDEKPYNEITSDIKDFSQR 
TNIPDHEIIVIIWSTIMSLGEWNKKEELVTDQAVRHLKNYCPLLQAFASTDRSELALILKVQEFCYENMNFMKAFQKIILLFYKTEVLSEEIILR 
WYKEGHSNKGKMHFLEQMRKFVEWLQSAEEESESEDEQKNGE* 
(SEQ ID NO: 1257) 



Celera Sequence No. : 142000013385192 

TGCTTCGACGACCTCCTGGTTGACTACCAGACCAGGTACAAAATCGAAAAGATCAAGGTAATGGGTTGGACCTATATGGTGGCCTGTGGCCTCGA 
AACGGACCACTACACGGACTTCTCCATCGACATTCCGGTCAAGCAACCGGAAGCTGATTCCGAAATTAGACGAGGATCCAGCGGTGACTATTTCC 
CACGACTAGATTAGAGTACAATTGTAAAAGGTACATCTATTCAGTTCTGACTGTCCATTTTGGAAGTACCGAGGACGATGAGATGAGCGGCGATA 
ATGTCAGCCAGCCCTACGCCCAAGTGCAAGATGTCGCCGTCCTGGTGATGACCGAGTTTGCCCTGGACTTACTTCGCATCATGCACGACATTCGG 
TACAACTACGTGTTCTCGGAGTACGATACATTCTTAACGGGGAGCTTGAAGATAGGTGGGCTGCATTACCTTCACCGTTATCCTAATCCATCCAA 
TGATATCTAAGGCATCTCCCATGGATCCGTGATGGCCGGCGTTGTTGGACTCTCGAAACCACACTATGATATCTGGGGTCATACCGTCAATATGG 
CCTCCCGGATGTCCTCCACTGGATTGCTGGACAACATCCAAGTAACTCGGCACACGGCAAAGGTGCTGCGACAATTTAACATCCGCTGCAACTAC 
CGAGGGCACACGGAGGTAAAGGGAGTGGGCAAGGTGCCCACCTACCTGGTGGTCGTAGACCCGGATCTCACATTCCAAGATCACGACCAAGTGAG 
CATGAGTATGAAGGACTCAAAAAGCTGGGTCATCGACACTCTTTCACTGTCATTCACCCCGAGCTTCGTGCCCCCAAAGTCTTCCACTTCGGATG 
AAGAGGAGGCTGAAGGAGAGCTCGAGGAAAATGTATCTGAAGGCGATGAACTGGAGGTCTCGATACATGAGCGCCAAAAGGGAGGCATCTTTCTG 
GCCGGGAACCTTCATTAATTTAAATTTATTTTAATAGATATATATATATATATATATTTATTTAAGCATATATAACACACATATATAGATGTATT 
CGTACAGTTCAAAGCAAATGTTTTTCTTGAGGTTAAATACGTTATGATTTGAGCACTCGCTGAAATCATTGCACAACTGGTGACCAAAGTATACG 
TATGTTCATAATTCAGAGTCATACATAGTCGTTAGGTTATGTACATGCTACAGATAATACACTCATCTTGACGAATCTCCATCTCCATCACTAAG 
AAAGCACCATCCGTGTGATCAGGTTCGGTTTCGTTTGGTTTTGTGCACAGTTTTTAGCTTAAGTAAGCATTGCAGTTGATTTGGGGCATGGGCAT 
AGCCCTTACTCGTTGACAGAACTGGAGTTCAGCAGAGCCGTGTCCTCGGGAATGGGCTTCCGCTTGTAGGAGGGCTCAGTGGCTAGGTAGACAAC 
CAGCGCCCCGAAGACCACGTACATTACTCCAATCACATTGGCCAACACACCAGCCGGTGGAAGAGTGGAGTAAGCCGGAGTTTTGCTAGAAATGT 
CGTCATGGATTAGTCAAGTGATCGATCTAGAATGCATATACACCTACATGGCGAAGATGGCCTTCTCGGTTATGCCCATCAGGGCACTAGCAATA 
GCCAGAACGAATCCAAAGAGTCCGAAGTAGATGTGCAGCGGCATCATGGCGATCCTGTAGTTCTCCCTCAACCCCGGCGCCAAAAAGGCCACAAA 
TCCGGCCACATACTGCAGCGAGAAGACGATCACCGCGGAGAGCCCCAGCCAAGAGTGCAGGGAGTACATGTTGGGAATGGGCGGATTGGCCAAGT 
TGTGGGAATCGAAGACCGTCTTCAGGGCAATCACCGTCAGAATGAATGCACCCATGTGGATGCCGGCGTGGGTAAGCTTCAGAGTCTTCTTGCGC 
GTGGTGCGGAAACCACGATAGATCAGAATGGCTGTGGGATTGAATTGAGTTGGATTACCATTTCGAAGCTAGTCAATGGGAGTGGCGTGGCACCT 
CTTCCGCTGCATTTACTCACAGTTGCCATACAGGTAAATGAAGCCAATGGTCATGAACAGCGGATGCCAGTTGAACTCCACGCCGGGATTTGAGG 
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TGCCGGCCAGACCGCCGAAGTGCTGGCCAATCCAGGTGGCCACCAGGACGATCATGGTCAGGCCGCACAGCTGGGTGAGCACATAGAGCACCTTA 

AAGTTGATCAGCGCTGGATCCATGTTGGCCTTGCCGCTGCCTGAAAAATCCAATAAAAGCTGGGGATGAGCTAAAAATAGAGGCCATTTTACGCA 

CAGCACGCATTGGCGATCTAATCGCATAACTCATTCAAGGGTAGAGATATCGGGGCATCGAAAGGGGAAGGGGATCTCAAGCACTCGTTTATATT 

TTCCATACTTTTGGTTTATAGCCTCTGCTACGTAAACATGATTCAATTACATGTTCCACCTGCCCGCAGATTAAAATAACCCCCAAGCACTTGCT 

GATAACATGCGACCAGAAGTTTAGGGAGCCAATAGGCTTCCCAAAAACCCACTTTAACAGTTGTACATCTGATCCGCTTCGATGGCAGCTGCACT 

CGTTGGGTAAACAAACAGTTCCTTTGCCGCCAGCCATCCGATACTGCGATAAACAAGATCTCATCATATCGACTGGGCCCTGAGTTAATGGGAAC 

GGCCTGTTCCCAGGCACTGGGACACACCCACGACCCCTTTCGCCGGCCACTCACCAGGCGTAATTTGGCTAATTCATGCGTAAACATTTGCGCTT 

GTTTACCCAACATTAATTTTCAACGCCAGAGCGTTCTGCCCACTTTATCTGTCGCCTTATCAGTATGTTCTCCAGATAGTCCGAACATCTGATAT 

TAAATCGCTTTAAATAGCATCGTACAAGTTCCTCGTGCTTGATGTTGCAGCATGATACATTAAATAAATTAAAGTTCGATTCATTTTACGGAACG 

TACGGGTAGTGAAAGTAGCCGTGTTCATAGTTTGGTATAGTATAGATTGGACTTGACTAAGATAGTTTAAAAGGCTAGTAAACTTGTAGGTGCAG 

ATGCACTCGATAAGGGCCGCTATCAGCCGCCCTAATTGGCGTTTGGTATGCAAATGGTGTTTTAATTTCGACTCGGGCGAATAATAAGCAAATGA 

GCCAAACTGCCAAGTGTTGGCTTGTAAATTGGCTGATAAGTGCGCCGGAGACACTTGTAGCTGGCAATCGCAATTATAAATGCTGGTAACTGGCA 

GCAGCCACGCGTCCCCGGTTACTCAATGGGCTCCAGTTGGAGTGCTCCATTTCGACGCAGCTCTGGGTTCCGAGTGAAATTAATTGGCACATTAG 

CTGTGACTATGACTGTTAATTATTAGGGTTAAGTGGGCGTGCCAAAGACGCTGACCAGCAAAGCGTTGCAGCAGATCGTGCGCAAAATTTGAAAT 

AAACAATCCACATAGGCGCACCAAAAGACCGAAAGTAAAACCCGGGCAGATGACTTTGGATGAATTTGGAAAAGCGTACAAGTCCGAAGACAGCG 

GAGCAAAATAAATAAAGTCTGGAAAACCAAAGTGAAATTCAGCTGGTTACTGGTTACTGGTTACTAGTGGCTGCTTGCTGATCGCGGATCGTGAT 

GCCGATGGTGATGGTGAAAACCGCGGCGGGGAAAGGGGACGGCTGCAGTCTGTCTTTCGGTGCGTTTTGCTTTAACTTTTACTATAACTAAACAC 

TCGGCTTGTTGGTTTATTTTACTTGGCACTAATCAAACTTGCAGCCAAAATCCAATGCCATGCACAGTCTACCTGTTGGTGGTGAAGTTGCCGCC 

GCCGTTGGAGTGGCCAACTCAATGCCAGCGATGGTTGTTGCAGGTTGCACTTGTTCTGCTGTCGCAGGTGTTGCAGTTGTTGCTGGTGTCGCTGC 

CGCTGGCTTTTCCCCATTAGTCACCGCGATCACTGTGGTCGCCGTGGAGATGGTGGTCGGAACGGCGGCCTGTGGCTGCTCCGGAATCGCTGGCT 

CCACCTGGACAACTCCGCTCTCCGGCTTGGTCGCCTGCTCCACGTCCGCGTCCTTCGACATGGTCTGTGCACTTTAAATCAATAATTCACCAATG 

GTTCAACCACAAAAAGACATGCAACGTATAATGGACGGTGTTAATTGCCCTAAGACACAGCACAAAAACACTGGTAAATGCTAGTAACTAACTAC 

TAGTAACTAATCCGCACGACACCCCGTTAATACAACCGAACACTTCGGCGTGCAATGCGCAACTGAATTTAAAGTTTTCCAGCCGCCGCCCGGCG 

ATGGTTAGGAAAAGCTTTTTCCGCTCGCAAATCTCTCGCATGCGACATCGCACACACATATTGCTCATTTCGGTGTGGATAATTGTTTGGTTCTC 

GATGCGGATAGGATTCCAGTGGGGTTGGGTTTGCTTTTGACTTTGCCTTTGGCTTTGGGTGCCAACGAATGCTGGTTAGTATGCCGGACCAGGTC 

GATCGCTTCTGGTTGGGAAGGGGGTGCATTGATGATTATTTGCATTTCGCATTCCTCGGATCTATTCAAACCATTGTCACTGCGCGGCATTTCAA 

TTGCGACTTTGACTTTGATGTTCTGTGCATAGTAATGGGATGGTGCAGTGGAACTGCTTCAATTGGAACTACTTTTTGCCAGAAGGAGTGGAATC 

CAAAACGTAAAAGGGTTTACAAAAAGTTTTTGGGTAAAATTTGTTAGTAACAACTATTTCAATATTGTTCTTAGTTTCGAAAGTACATCTACAGT 

AAAGTGAAACCTTTAAGGACAACAAGTAGCTTTCCCTCTTAAAGAGCTTAAGTTCATGCTGTGCAATAATGACTGTATGTGCACTTCAACTCACC 

CAGTGAGAACAACATGGCGTATACTTGATTGATGCTTCACTCTGCTCAGATAATGCCTGTTTCAATACCGCCTGCTCTTATCTTTAGCAGCAATT 

AGTGACTTTAAAGCGATGACGATGGGTTTTTCGCGATTACTTCTGATTTTCGTGGCTACTGCCACGCACTTTACTCTACTTGTTGAACAAACTGC 

TTAAAATGCAGGAAGCTGGGCCTTCCATATGGAGGAAAAGCCGTGGAAATTCTGCACAGGCTGCGTGGGCGTGAAAGCCCTAAAGCCCCGCCCAC 

TCTGCCGCACTTGGCCATTTCATTTGGCATTTCCACGAGCTTTTCGTTGGAAGCTCAGAACTGTAAACGCCGAAGCACACATGCAAACAAACTTT 

CGGGCTCACACCACTGTAGAAAAACCACTTGTTGGATATTATATTAATGGAAAGCCACAACCAAATTATAACTGTTTGTAAACTACATTTAAAGA 

CGTAGTTGAAATAGAAACAGGTAAACTTACAGATTTGAAATGAATCAGTAATCCAATAATGTGTTTTGTTTGGAATATTTCCAAAATGTCTTCAA 

CGGAAGAGGCAAGAACACAAACAGAAACGGCACAAACACAAAGAGATAATTTGGCAGCATAAGAGAGCGGAATGCTGTGCGAATTTACTACGTCA 

GCATTATTATTAACAAGTTAAACTTAAAGCGGCACATTCATTGATACTTAATTTAGTACAGCAAGTCAAGGACACGGTCAAACTTTCACTATCAA 

AGGGTTTTAAACTGTAACTCGATCTGTTAACCGCTTCGATTTTCTGTTTACTCATTTGGAAAAAAGCGGGACAACCAGGGATGCGAGAATCATAC 

TGTAGCTGCGTGTGCTGAAAAAATAGTGTGCATTTAATATACGTTCACATGATGATAGAAAGTACTGACTGTGTGAGTTTTAATTTTAAAAATTT 

ACAAACTTTATATTCATATTTAACTAGAGCTTGGAAAACATTTACTTCGAGACCCTGGATTCCCTTATGCCGCCAGTGTGGCCCATTGAGCACGA 

CTGAAACACGCTAACGCCTTTAGCTCGGCAGCACTGGCAGCTCCAACAAAAAGCTTCAATTTAAAATAATATTTAGGTAGTCCCACAATAAATGC 

TAACAAAATGCT 

{ SEQ ID NO: 1258) 

Exon: 4997.. 4845 
Exon: 2225.. 2016 
Exon : 1931.. 1568 
Exon: 1510.. 1001 

Start ATG: 4860 {Reverse strand: CAT) 
Transcript No. : CT42934 

AGCCACGAAAATCAGAAGTAATCGCGAAAAACCCATCGTCATCGCTTTAAAGTCACTAATTGCTGCTAAAGATAAGAGCAGGCGGTATTGAAACA 
GGCATTATCTGAGCAGAGTGAAGCATCAATCAAGTATACGCCATGTTGTTCTCACTGGGCAGCGGCAAGGCCAACATGGATCCAGCGCTGATCAA 
CTTTAAGGTGCTCTATGTGCTCACCCAGCTGTGCGGCCTGACCATGATCGTCCTGGTGGCCACCTGGATTGGCCAGCACTTCGGCGGTCTGGCCG 
GCACCTCAAATCCCGGCGTGGAGTTCAACTGGCATCCGCTGTTCATGACCATTGGCTTCATTTACCTGTATGGCAACTCCATTCTGATCTATCGT 
GGTTTCCGCACCACGCGCAAGAAGACTCTGAAGCTTACCCACGCCGGCATCCACATGGGTGCATTCATTCTGACGGTGATTGCCCTGAAGACGGT 
CTTCGATTCCCACAACTTGGCCAATCCGCCCATTCCCAACATGTACTCCCTGCACTCTTGGCTGGGGCTCTCCGCGGTGATCGTCTTCTCGCTGC 
AGTATGTGGCCGGATTTGTGGCCTTTTTGGCGCCGGGGTTGAGGGAGAACTACAGGATCGCCATGATGCCGCTGCACATCTACTTCGGACTCTTT 
GGATTCGTTCTGGCTATTGCTAGTGCCCTGATGGGCATAACCGAGAAGGCCATCTTCGCCATCAAAACTCCGGCTTACTCCACTCTTCCACCGGC 
TGGTGTGTTGGCCAATGTGATTGGAGTAATGTACGTGGTCTTCGGGGCGCTGGTTGTCTACCTAGCCACTGAGCCCTCCTACAAGCGGAAGCCCA 
TTCCCGAGGACACGGCTCTGCTGAACTCCAGTTCTGTCAACGAGTAAGGGCTATGCCCATGCCCCAAATCAACTGCAATGCTTACTTAAGCTAAA 
AACTGTGCACAAAACCAAACGAAACCGAACCTGATCACACGGATGGTGCTTTCTTAGTGATGGAGATGGAGATTCGTCAAGATGAGTGTATTATC 
TGTAGCATGTACATAACCTAACGACTATGTATGACTCTGAATTATGAACATACGTATACTTTGGTCACCAGTTGTGCAATGATTTCAGCGAGTGC 
TCAAATCATAACGTATTTAACCTCAAGAAAAACATTTGCTTTGAACTGTACGAATACATCTATATATGTGTGTTATATATGCTTAAATAAATATA 
TA 

(SEQ ID NO: 1259) 

Start ATG: 138 (Reverse strand: CAT) 
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MLFSLGSGKANMDPALINFKVLYVLTQLCGLTMIVLVATWIGQHFGGLAGTSNPGVEFNWHPLFMTIGFIYLYGNSILIYRGFRTTRKKTLKLTH 
AGIHMGAFILTVIALKTVFDSHNLANPPIPNMYSLHSWLGLSAVIVFSLQYVAGFVAFLAPGLRENYRIAMMPLHIYFGLFGFVLAIASALMGIT 
EKAIFAIKTPAYSTLPPAGVLANVIGVMYVVFGALVVYLATEPSYKRKPIPEDTALLNSSSVKE* 
(SEQ ID NO: 1260) 



Celera Sequence No. : 142000013384600 

TCTTTTTAGTAGTACATTTAATGATCACCTAACAGGCAACAAAAATCCTAAAAGGCACTCACTAATAGATTCCCTTCTCGCCAACAGGGATGACC 
GCCTTCGTACTGTGCATTCTGGCCATGTCCTCGGGACTGGCTGCACTTTTCTCCAGCCGCATCAAGAAGCTAATCACACCATTGCTGAACAAGAC 
GTTCCACAACTTCCTGGGATTCGCGTGCTTTGTGATCGCACTGGTGACCCAGTACTATGGCTACCAGACGGGCTACTTTAAGAGCCGAAGTGAGA 
CGGATTTCCAAATCCTGATGAAGTGCCTCACCCTCATATCGTTGGTCCTGTCGAGCTACGGACCGATGAAGGCACTCTATCAGAAATGCAAAAAT 
ATATCGCAACAGTTTTAGATAGGATGAAAAGAATCACTCGAATGTATGTAGATAAATAAATGTAATTTTTTCTAAAACCCCATTTTATCGACCTA 
TCCAACTATACCAACAACGATTCAAAATTTTATACCTGTTTATTGGTACAACGTGTGATCCATAAACATGACTAAGTATTTTTACAATGCCTCAT 
TAGGGAAAACCCGGTTAACCAGCGGTCAGACTTTCATCGTCCGGTACTCCCCCCTGTAAAAGTTATATAACCACCCTTTTGATTGATTTGATTTG 
ATAAGGTCTGAAAAAGTAATGAATTGACTATTAGATACACTGCAATTTGCATTTAAGTTTAACCGCTCACTCAAAAAGCCAGCATTTGAATTAAT 
CAATCAAAATTATGATCATTTATAATTTAAGTTATACTATTATTCATTTTCCTATAACAGCATTGGGTAGCATGCATCATTTTAAAATCCAAAAC 
ATGACCGGGTTTTTTATCTGATCGGTTTGTGACACTTGGAGGCCGCATAATTGATAGTAACTGTCCCAAATAAGTCCAAACTTTTGAAGCCCCGT 
AAAAAGAAGATCTTGACTTCCGGCACCTTAAATTAAAAAACCGTGTATTTTGCATGGTGCAGTTTCGTATGTTTATTTAGAAAATCCATATCATT 
TCTTCTTAAATCTCTCGCAATAAATAATACTTGGCGCGTGTGTCGCGTTGTGTGTGTGATACGTATATCAGTTAGCAGTAATAATAAATAGCTAG 
CGTTAGGTATTAAAATTCATAGTCAATTGGGTAATACAAGGCTGTCTGGCTTTAAAACCCCCTTTCTGGGTAACTTTTGGTAAGAGAAAGTCAAG 
ATCTCTATATAGAATCGGGTGAACAGATCTTTAGCATGCGGTAGGTAAATAGGAGTTATCAGCTATCGACTAAAGCCCAGTTAAGAATTGTGTGT 
TGGGCAAGGCATATTCGAAATTCGACTTTCTATATCTTTTCATAGTTTCCATAGTTCGCTCCGCTCACAGCTTGTTGCTGCGTCGAATGGCGAAG 
GACATTAGCGCGTGCTCCAGCCGCATGTGGTAGGGTAGCTTGCGCTCCAGAATGACTGGCTCCTGGCCATCTGGACAGCACCGGTTGCGGCAGCA 
CTCGCACTGGCCGTCGTCATCGTCGTATTCCTGCTGCAGTTCCTGCTCCAGTTCGTAGGCCTTCTTGGCAGCCCGCTGCTCGAAGCGCAAATAGA 
AGGC C AGGC AGAC AAAC AG C AT AAAT AC C AGG GC AC AC AT GGCT C C AC T GG C AAT GCTCGCCTTCTC C AAC AT C GG AC GC G GCC T C T TGGGC AGG 
TAAGTATTGTCGATCTCCTTGTATTCGCATCGCTGTCCCATAAAGCCAATCGCGCACTCGCAGCTGTAAACCGGTAGATCGGCTATCTTCACCGC 
AAAGCAATGGGCATCGTTCAAACAGTACCAGGCATCGAAGGTTTCCGGACATTTGTATGTGGGGAATGTAATATTGGGCCTGGGCGTGGTCGTCG 
TGGTGGTGCGCATTGTGGTGCTACTAGTGACTGGAGCCTGGGTCGGCGGCAATGCCGTGCCGGACATGGACGAGGAGATCGAAGAGCGCGGCTTG 
GGCACCGTACGGCTGGAGCAGGCCTCGACGTGCCACGGATGTGCCAGACAGCCGATCAGTACCAGGGCAACCAGCCCATGTTGTACACTCATTGT 
GGAATGCATTACAATTTGTAGCTTTTTAGCGTAGCGTATCGGCTTGTTCGTTTTTTGGCGTGCAACGTTTGGTTGTTTAAAAGCTTAAGAATATT 
AAAACTAGTGACTATCACATAAATAAATTAGAATATTCGTTGTCCGCTGCTAGGGCGTCGTCAAATCTCTTTGGAATCGGGTTTCTCTACATAGG 
TATGCTGCTGGTGGAACGTAAACGTTGAGTTAGGGGTGATGTGTGTATCTGTGGGGAAGATGGAAAGACATTAGTGGCCAGTAGTCTCCAAGCAC 
CATCGATGGTTTTTAACTCTAAAACTGTGTACTCAACAGAAGGGGGAAAATCAAACTGCGTAAAATCATAATTATATATGTATGTATTTATAAAA 
G TAT AAAC TAT AAT AG AT C GCT T T G AAT AGT AGT T AT GT T AC AAT C AG C AAAT TTTACTCCTTTCTTCACATC C AAT TCACTTTCG GTT T AAAAT 
T T CAT T T AC GT T GT C T C C C GTT G AT T AT C C T AGC TCAT C AAAGC T GGC T GT GAT AT GC GTT GGG T AT C CAT AT C AC AAC C AC T T GAT GAGGC T T A 
GATAATCAACATCGTACAGGTAAGGAACAGTTTCCTTAACTTTATGAATTAGTAACATGGTTAGTAACAATATTGCACCGCATGTGTGTAAGTCA 
GTTCGATAGGGCCCCCTAAGTCTGAGGTGCCAGTCCACCAGTTGAGCCATTTTTCGGTGCTGTGGCATGCATCTGTCCCCCGTTGTTCCGAATTC 
AGATCTCTGTTAGCCGCGTTTATTTTCCGATTGCATTTCACTACAAATATTATACAGATTTGAGGCGGACATAGTGTGTTTTATTTATGATGGGC 
GTGTGAATTATGTGGCGGGAGGTATTATTATGATTTGATGTACGATACATTACGCTAAGCTCATCAAAGCTGGCTGTGATATGCCGCGCAAAGCG 
GAAATGCATTTCACAACCACTTTGAGAGCCCAGCATGAATGCGCCAATATAGTTTTATTGATTGAGGTTTCGGGATACTTGACTGCGGGAGTTCA 
GAATTTTCAATTTTGGTTTTCATGCTCATGTAATGATATAAACAAAATCAGACACTTGTTACCGTTTTTATACTTTTTGTGTTCAATTATGCTTT 
TTTTTTGGAATTTTCAAATTGTTTTTTGATTGATTTTTTTTTTGCCTTGAGCATCGCGTCGCTCCTCAAAGCTGGCTGTGATATGAATAGGCATA 
AAAAG G C AAAC AT T T C AC AACC A C T T T G AAG AAT GA C AC AAT T C T C CT GT T T C GAT T C GAT GAG AGC C GAG GT G C AG AT T C AG AT T C AG AT T C T T 
CTAGGGGTCTTTTTGCAACGTTCTCACGGCCGTCGGCTTTGAGCCTTGTACTTTTTTGCAGCCACGGCACAATAAACAAATATGAAAAAAGTACG 
T AAAT AAAT AAAT G C G AC G A CG AG AC T AC AAC AC GAT GAT AAT G AC GAT C AT G AC G AC G AC GAC GAT AAT GAT GAT TAT GAT GAT GAGA C T A C T G 
GTACTGGTGATAGATAATACGCAAATGTTGATTGCAATGAATGGCGACAGAGCGAGCGCAAGTGAATGGGACAATTTCGTACATATCTACGCTAT 
AGACTGGCCATTGTTTCAACAACAACAACAAAATTCAACAGACACAAACGCACACAACACCGAAGAACAACACTGAAAGAGGGCAAAAAGCAATG 
CACCCCAAATAATTATAATAAAAATGTAAACATAATTTTGGCGCCAGCCTTTGGCAATTTGACGCCAAATTTGTTTTTAGATGATTTCTGAGTTC 
AAGTTCGGGAATTCAAACTGATAAGAAATGCAAACGATGAATAACAAAACGGAAGAACCGAATTCAGTTGATCCTCCAGAAGATCATTTAAATAA 
TCGATTTTATCGAAAAATTTTAAATGCATTACTGACGAAGTTCAATCACTTTAAAATGTGAGCAAAACAAAAACTTTTTATATGAAAAGACGAAT 
GCTGTTCACCATTTCGAGCGCCCACTGTGGTGCAATGCAAAAACAGTTATACGTTGGGTGTGGAGACTTATTTATGTATTAGAATGCTTTTACGG 
TTAACAATTGCTGCACTCTCTACCCTCTCTTTCTCGCTCACGCGCGGCAAGCGGGCAGCTTATTAAAATTTCCGTTATAAAAATTAAATCCACCG 
CCGCTGCTTTTATGTTCATAAATACATGTGATACGTCGCATGCATGTACACATGCACAAATGTATGTATATACACATGTATGTGCGTTTGTGTGC 
GAGAGGAAGTCGAAGTAAGGTAATGAGCGTAAAGTCAAGTTGAATAGCCATATAAAATGCGCTGAACTTGCGAGCGTGTAGGTCAAGATCAGCGG 
CGGCCGTTTATAATATTTTGTTGGTTTTCGCAAATAGCAATTGACGAAAGTTCGATGATCTTGTAGTACATTTCAATAATGGGGGAATGAGCCAA 
C T AGC AC G AT AGC AAC AAC AGC AAT AT AT AC AAGC AT C AC AAT AAT T AC AAC AGC AAC AAC AAC AAC G AT C AAC TAT AT TAG AGT GTGG AC GGC A 
AAAAGTGTGCATACATACATGACAACAAAAACACCCGCTAATAAAAAATACTGTTGTGGCTGCTTCTTCTTCTGCTCTTGCGTCTTTTACAACTT 
CTTCGGCAAGCCAGTTGTTGTTGCTGTAATTTTCATGTTGCCTTGCTCTCTTTCATTCTTGGGCGTCGCTGCGAGTTCGTTGGCTCTTCTCGTTT 
CTGTATGTGTGTGTGTTTAGAAAACTGTAAGGAAAGCAAAGTGTGTGAAAAATATGGTTAAGGTAAAAAATGCCAATACAGGAACAACAACACAG 
CTCGCTCTCTTTTCTCTGCCTGCAGTTTCGAAAAAAAATTTACCGGTTTTTATGTTTTCGCAAAAAAGCTCAAGAAACAAGTCCTACAAGTTTTA 
AATCGAGAGAGGAGAGAGCAGAAAAGGCGTGAGAGCCCAAAGAATGCGCATTTAGCAGAGACAACAACCTTTTGAGGTGTGTTGCGTATACGTGC 
ATATTCGCACACACACACCACCAACGCTTCAGCTCTCGCTCTCTTTTTCTCGCTTCGCCATCTCACTCTTACTCCATGCCTTTCAGGGGATAAAT 
TATGTATGTATATATGTACCATGCGTGTGTGTGTGCGCGGGAATGTTGTGAGGAATTCTAAGAATTTTGCACAAGCGACAAATGAAACGGGGCGC 
GGACCACACATACATATATGTATATTCCAAAACCAAACAAAAGTTTTAATAGTTCAAAAAGAAAATTGATTATGCGTCGGTATGCGAGAAGC GCC 
AACAGCCGAATGCGAAATATATCACACCAACACAAATACAGTGAAAGCAGAAAAGACGCAGAAGATGGGAAAATTCGAAGACACCGCGTGTAGGC 
GTTAAAACCTATTTTTTTTGCCCAACACCAAGCTAAATGAAACACTGCTCACTAAAACACCAATTCACTATGCTTTATCTTCACAAATTACCAAC 
TGTTTGCATTTAAACTTGGACAACATTAGGGGGCAGAACGCCGACAACGCACACCGCGCACACGAAAAAACACAACCGCACCGCCAAAGATTCGA 
AAACCGAATGCCGAACAACAACAAAATTGGTTCTGAAGATGGAGGTGTTTTTCAGCAGAGCGAGATTCCCTGGCTCACGCTGAGCGGGAGAGAAC 
GAGAGCGACAAAGTCTCTCCTCTAAAAATGCGGCATAATAACAACAAAGTGGCGAGCGGCTTTCGAGCTCATGCTGTATGTTGTTGTATACCCGT 
TACTTGGAGAGTTAATTGGGTATACGGTGATGGGCTCTTAGGCAAAGAATTTTTCCAAAATACACTTTGCATAATTACTCAAATAAATTGCCTCA 
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GAAGCTACTTTACTCGTTTAAATCTTAAATTAATATTATTCCATTAATGTCAAGCCACACAATACAAAATTCAATAGGAATCACTCACCTATAGC 

TGTCATTGCAAACAAATGATATTACCTATTGGACTAAAGGGTATATCTGAGTAGTTACGCTTCGTAAAAAGGGCTCTCTCTTGTTTTGGTTTTTG 

TTTTGGTGTTGCCAGAAGATTTCGCACGATGTAGGATCAACATGTTATTGACACGACGTCGCCGACGCCGCCACTTTTAAGCTTGGCGGAGCAAA 

AGGCACACAAACAAGAAGCGACGCAGAGCGGCGGTGCAAAAGCACAAAAAAAAAGAGGAAAAATGCGCTCGAGAGCGGAGGGCGTGGCGGTGCGC 

AGGGGCGGGGCAGGTAGCCGGACAAACAGGACGAACAGGATGAAACCCAGACGAGTTCGGCCCCACTGCAGATTTCATGGAACTGGCCGAAGGTT 

GAGGATGCGGATGCGATGCGCAGGCGCGCAAGTAAACAACATTCCATTCCTCGGGAAATTTCTTCTGGCCGCTCGAGAAGCGCGCCAAAAATAAA 

GCTCCGACCGATGTCGTTGCCATAAATATTGCTATTTTGACTTTCCAGATTCTCGCGATTCAACGCCCCTTGAAATGGGAGCAACTTGCTGGAGA 

T TC GC GT AT C AG GGC T GG AAG AGG GT TT C T C C AAT AAC T GCAC 

(SEQ ID NO: 1261) 

Exon: 5693. .5602 
Exon: 2328. .1001 

Start ATG: 2087 (Reverse strand: CAT) 
Transcript No. : CT43122 

TTTGGCGGTGCGGTTGTGTTTTTTCGTGTGCGCGGTGTGCGTTGTCGGCGTTCTGCCCCCTAATGTTGTCCAAGTTTAAATGCAAACAGTTGATA 

CACACATCACCCCTAACTCAACGTTTACGTTCCACCAGCAGCATACCTATGTAGAGAAACCCGATTCCAAAGAGATTTGACGACGCCCTAGCAGC 

GGACAACGAATATTCTAATTTATTTATGTGATAGTCACTAGTTTTAATATTCTTAAGCTTTTAAACAACCAAACGTTGCACGCCAAAAAACGAAC 

AAGCCGATACGCTACGCTAAAAAGCTACAAATTGTAATGCATTCCACAATGAGTGTACAACATGGGCTGGTTGCCCTGGTACTGATCGGCTGTCT 

GGCACATCCGTGGCACGTCGAGGCCTGCTCCAGCCGTACGGTGCCCAAGCCGCGCTCTTCGATCTCCTCGTCCATGTCCGGCACGGCATTGCCGC 

C G AC C C AGGC T C C AGT C ACT AGT AGC AC C AC AAT GC GC AC C AC C AC GAC GACC ACG CC C AGGC C C AAT AT T AC AT T C C C C AC AT AC AAAT GT C CG 

GAAACCTTCGATGCCTGGTACTGTTTGAACGATGCCCATTGCTTTGCGGTGAAGATAGCCGATCTACCGGTTTACAGCTGCGAGTGCGCGATTGG 

CTTTATGGGACAGCGATGCGAATACAAGGAGATCGACAATACTTACCTGCCCAAGAGGCCGCGTCCGATGTTGGAGAAGGCGAGCATTGCCAGTG 

GAGCCATGTGTGCCCTGGTATTTATGCTGTTTGTCTGCCTGGCCTTCTATTTGCGCTTCGAGCAGCGGGCTGCCAAGAAGGCCTACGAACTGGAG 

CAGGAACTGCAGCAGGAATACGACGATGACGACGGCCAGTGCGAGTGCTGCCGCAACCGGTGCTGTCCAGATGGCCAGGAGCCAGTCATTCTGGA 

GCGCAAGCTACCCTACCACATGCGGCTGGAGCACGCGCTAATGTCCTTCGCCATTCGACGCAGCAACAAGCTGTGAGCGGAGCGAACTATGGAAA 

CT AT G AAAAG AT AT AG AAAG T C G AAT TT C G AAT AT GC CT T GCC C AAC AC AC AAT TC T T AAC T GGGC T T T AGT C G AT A GC TGAT AAC T C C TAT T T A 

CCTACCGCATGCTAAAGATCTGTTCACCCGATTCTATATAGAGATCTTGACTTTCTCTTACCAAAAGTTACCCAGAAAGGGGGTTTTAAAGCCAG 

ACAGCCTTGTATTACCCAATTGACTATGAATTTTAATACCTAACGCTAGCTATTTATTATTACTGCTAACTGATATACGTATCACACACACAACG 

CGACACACGCGCCAAGTATTATTTATTGCGAGAGATTTAAGAAGAAATGATATGGATTTTCTAAATAAACATACGAAACTGCACCATGCA 

{ SEQ ID NO: 1262) 

Start ATG: 334 (Reverse strand: CAT) 

MSVQHGLVALVLIGCLAHPWHVEACSSRTVPKPRSSISSSMSGTALPPTQAPVTSSTTMRTTTTTTPRPNITFPTYKCPETFDAWYCLNDAHCFA 
VKIADLPVYSCECAIGFMGQRCEYKEIDNTYLPKRPRPMLEKASIASGAMCALVFMLFVCLAFYLRFEQRAAKKAYELEQELQQEYDDDDGQCEC 
CRNRCCPDGQEPVILERKLPYHMRLEHALMSFAIRRSNKL* 
(SEQ ID NO: 1263) 



Celera Sequence No. : 142000013384827 

GTTACGGTGCTTTTTTGCTCGTTTCTTTTGTGCACTCCTTCTATGCGGTGCCAAATATGAGGAGGTACCTGAAGCTTGGTTGATTTTGTGAGTGT 
CTTTTTCCACAGCATATTTTTGTGTATTGGCTACTTTAAAAATTCTATTATATCGTTGTGCTCGTAATGTTGTCCATTCCTTTTAAATATTTTCG 
AATTAAATATACCATAGATGTTTCCCTATTACCATAATAGTTTAATAAATATACGTATATGAAATTTTTTTTATCCTATGTTAGTATAAACATGT 
TATTCAACAAATAAAAGTAGGTCAGAGGTAATCGCCATCCAGCTCCAGAAAGAGGGGGGAGCTGATAAGATTGCGTTTATGTACCTAATGGGTTG 
GGTAAGTGGCTTGGTTTCACAAACACCCTTATTTCAGGTACAAAATAAGTATGTCATAGACGGCGTAAAATTGATAAGATTTTAAAGTCAAAAGA 
AAATGAGTTTATTCGCCTGTAACTGCGTTAGAAATCCAAATACACCCGCAATATTATATGTATAGAAAGAAAAAGTTTGATGCAACAACAAATTT 
AAAGTTGTTTGGTATTTTGTAATGGAAAGTTTGTGTCACTAAATATGGTTTTACTTGAACGCAGATTGGACATGGTCACACTGCCCACAAAATCA 
CGACATTGCGTTATAAATGAAGCACATCTAGTTTTGTGTTGAAAATGTGGTTCACAATTTATATAGCAGGTTATGGTAAAACATCGATCGTTCCG 
AAAAATTCCGCTCGTTGGCACATTTCCTAAATTTTTCGTGCGACAGCGGTGATTAATTACAAAAATTATTATTATAATTATATAAAAGTTGGCGA 
CCATATCAAAAAGGTTGCTATAAAAACAAATGACAATTTATAGCGGAGTTTCACTCCCAAAGCGATGTTCTCCACCCATTCCAGTCAGTCATGGA 
AAATATACCTTCTAAAAAATATATGATTTTTTCGATAGAGATACACGGGCACACTAGACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGC 
GCATATCGTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATACACGGTTAAGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGG 
TTGACACGTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAACAGTGCGGGCGTTTCTCAACATAAGAATACATACAAACTTATGCGT 
GATGGTGAGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAACCGTACAAACGGAAATACATAAAGGTAAAGGGCTCAACAACAGTA 
TTACTGATTAGGCAATTGATTTATATTTCAAACTTCTTAAATAAAGTGTCCACAAATATGTATGTTCTTAGCATAGAGGCGTAGTCGGTGAGTCA 
GCCACGCCAACAGTTAACAGTCGTCATTTTGCTGTCTGTCTAACACGCAATTTGTGTGTGTATATGTGAATAAACAGATACCGGTATGCAATCGC 
AAAGCTGGTCAACTGTAAAGATCGGTATGTACTCAAAATTAGGTCATCACCTTGTTATGCAGATACATTTGGGAAACAGTTGACTTACTTGGTAT 
TGGTTATCAGCCGAAAAGCAAATGGTGATACGGTTTTGCACCTCAGCTTACTCACGGATACATACCTACGTACATAAATGTATGTATACGGTATA 
CAGTCTTAGAAAAAATATTAGTACCGACCAGCTAGCGTTTGCCTAAAACGCTTTTACAAAAATAAACGTATATGTTTTAAGTTGTTTTTTGTCAT 
AATTAATACATTTATTATTATTTATTTATGAATTTCAGAGCTTTAATAGTAACTAAATTGTTTAAACAAACTTAAAAAAAGATTTCCAAATAAAA 
AAATATTTCGTGTGCGTGACGGAATTGCTAGTTCATTAAAGGGAGGTACATATGTACATACATACACATTGCAAAGAGTACAGATCGCTCATCTT 
GCATCTCATTTTTCCTAATGTACATATACATATGTAC71AACATGTTTATTAAACTGCCTACATGCAAATCTTTGAAGTTTTAAGAAAATAAGCAC 
AACTAGTTTGGAAAATCTTGGGAGCCAAGATCTATCTGTCCACTGAATTTCCTATGACCAAGAATGTAAAACCCTTTTTGTTAAACAAATTTTGA 
TTAAAGTATACCAGAATAATATTAAACGTATCGTCCGTATCTATAAAAGGTATTAATAGTATTACCTAAATTTAATTTCTCGGTTTTATTCATAA 
GAGTGAAAAAATATACAAGTTCCTTTTTAATTTTTTACTTATTGAAAATGTGATTAATTTTTATTTATTATTTTCAGTTTAGAATAAATACGGCC 
ACAATTAGTATACATATTTATTCATCGAAACCAACCCACCACAAGAGCATAACCAATTATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCA 
GTGCTGACAGACACCGCCTACAGAAGGACCTTTTTGCTTATCGTAAGGTACAACAAACAATTTTCTCTTTCCATTCTATTTTATTTTTTTTTTTT 
CGTATATGTAGCCGATTGGTATATTTTGTGGGGTGTCATTTATTTTGATCCTGGGTGGTAAATACATTTTTATGGAAGTTAGAAGAAAGCTTATA 
ATTTATTAAGAATAAATAAATACCTCCTTTTCAATCTTTTGTTATGTCTTCTCCTATGTGCATACTTTTGGGCATTAACATTTAGGTAAATTTTT 
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TCCACCGCAACTACAAGTAAACCGCATACACACACAGTATTTTGTAGACTCGCGCCACAAAGCACCGCATAAAAAAGGTTACGGTGCTTTTTTGC 
TCGTTTCTTTTGTGCACTCCTTCTATGCGGTGCCAAATATGAGGAGGTACCTGAAGCTTTGTTGATTTTGTGAGTGTCTTTTTCCACCGCCTGTT 
TTTGGCTACTTTAAAAATTCTATTATATCGTTGTGCTCGTAATGTTGTCCATTCCTTTTAAATATTTTCGAATTAAATATACCATAGATGTTTCC 
CTATTACCATAATAGTTTAATAAATATACGTATATGAAATTTTATAAGAATTTTTTTTATCCCATGTTAGTATAAACATGTTATTAAACAAATAA 
AAGTAGGTCAGAGGTAATCGCCATCCAGCTCCAGAAAGAGGGGGGAGCTGATAAGATTGCTTTTATGTGCTTAATGGGTTGGGTAAATGGCTTGG 
TTTCACAAACACCCTTATTTCAGGCACAAAATAAGTATGTCATAGACGGCGTAAAATTGATAAGATTTTAAAGTCAAAAGAAAAAGAATTTATTC 
GCCTGTAACTGCGTTAGAAATCCAAATACACCTGCAATATGAATCAAACAAAATCAACTCATAACGAGATCTGATAGCAACTTCTTAAAACTTTT 
TTTGTATAAAAATACAATTTATACAATTAGATAAGATTTACCAAACCTTTTTTCGATCAATGTAAATATAGTATGATAGTAGGGACTGCTGTCTA 
CAGTGATGTTTATAAGAGACATACTTAAGAGCAGCATTTAAAGCTTTGGGCGCTTTTTGAACTTAATTTAAATTCAAACCTCAATATTTATTGAG 
GTTTTTATTCCCGTTAATTGTAAAGAAAAATGCTGTATTAGATTCGGTAAAAGTATGTAAATGGTTTAAGAAAGCGTTCCCGACTCTATAAAGTG 
GCAAGTAATCTGACGATTTACCAGATCTTGGGAACTGAAAAAACTAGAATGTTTTGGAAAACCCTTTTACAATTAAGAAATGAACCAAACAATTT 
TGAATTATTAAGAATCTGTAATTGCTTTTTGAAACTTTTTAATATTTTTTTCTTTTAGACGGCACAGCATGGCTTTCCTCATAAGCCTTCGGCTC 
TTGCGTATGATCCAGTTTTGAAACTTATGGCAATAGGGACGCAAACAGGGGCTTTAAAAGTTTTCGGTCAACCCGGAGTTGAATTGTACGGTCAG 
CAT AC T T T GT T AAAC AAT T CAGC AT C GG AGC T T AAT GT AC AAT T AC T T G AAT GGGT GT AT GGAAC T GGTC GC AT AC T T T CG T T GAC G GC AGC GAA 
TCAATTAATTCTATGGGAGCCAGTTGGAGCAACGTTGCTGCCAATCAAAACACTACCGTTTGACGGCAAACTTAAAAAAGTTTCATCGCTGTGCT 
GTTCTCTCAGTAAGGATCTGCTATGGATTGGAACAGAAGGTGGAAACATCTATCAACTGGATTTACATACATTTACCATTAAGGAGCCTGTAATT 
TACCATGACGTTGTGCTAGAGCAGGTGCCACCAGCCTACAAGCTAAATCCTGGTGCAATTGAGTCAATCCGCCAACTTCCAAACTCCCCTAGCAA 
ACTTCTAGTTGCATACAATCGCGGCCTTTGTGTTTTGTGGGATTTTGAAAGCGCATCTGTCCAGCGAGCATACATAGCCCCTGGACATGGACAGA 
GCGTTGGTCTTACAGTGAACTTCGAAGGATCTGAATTTACCTGGTACCACGCTGATGGTTCATACGCCACTTGGAGCATAGATAACCCAGAACCG 
CCGTCGAATGTTAATTATGTGCCTTATGGACCTGATCCATGCAAAAGCATAAATCGACTGTACAAAGGCAAGCGAAGGTTAATATAACTTAAACA 
CAAAGTTTTTTAGATGTGTACTAATTTCTCGTTCTTTTAGATCCAACGATGTAATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTGAT 
CACAATTGTGTGTCCGTTCACGCCAGCGATGGACACAAAGTGTGTCTTGACTTTACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAATAA 
TAGAGATGTCGCTGAAGTTCTTGTTGTACTACTTGAAGAGGAACTCTGCGCTTACGATCTTACTGACCCTAATATTTGTGCTATCAAAGCGCCAT 
ATCTTCACTCTGTCCATGCATCAGCTGTCACTTGCAATTACCTTGCTTCTGAAGTCGTACAGTCGGTATATGAAAGTATTTTAAGAGCTGGAGAT 
GAACAAGACATTGACTATAGCAATATTAGCTGGCCTATCACTGGCGGTACTCTCCCGGATAACTTAGAAGAATCTGTAGAAGAGGACGCGACTAA 
G C T T TAT G AG AT TT T GT T AAC T GGT C AC G AAG AT GG T TC T GT T AAAT T T T GGG AC T GCAC T GGAGTGT T GCT T AAAC C AAT T TAT AAT T T T AAAA 
CTAGCAGCATTTTTGGAAGTGAGTCAGACTTCCGAGATGACGCAGCTGCAGATATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTCGG 
AAATCAGGACTTTTTGATCCTTATTCAGATGACCCTCGTTTAGCAGTGAAGAAAATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGTGG 
CACAGCGGGCCAAATAGTTATAGCCGACTTCATAGACTTACCCGAAAAAGTGTCTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGATG 
GATTTGTGTGGAAGGGTCACGATCAGTTAAACGTGCGATCGAACTTATTAGACGGAGAAGCAATTCCTACGACGGAACGTGGTGTAAATATATCG 
GGAGTACTGCAAGTTTTGCCGCCAGCCAGCATAACATGCATGGCACTCGAAGCAAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTAGT 
TCTCTTTGACTTCAAAAACTTTGTTCCAGTATTTCATCGCTGCACTTTAAACCCAAATGATCTTACTGGAGCAGGAGAGCAGCTGTCTCGTCGAA 
AGTCTTTTAAGAAATCATTGAGGGAGTCATTTAGAAAGCTTCGCAAGGGTCGATCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGGTT 
AGT AAGATAATAT AT ACTCAACTAGTCTTTTATATT AAAT TCAC AAT CCTACAGCT GGAAGC AAGACCCGTCGAGAGGC AAAT AGAGGCTCGTTG 
TGCAGATGACGGGCTAGGATCCATGGTGCGATGTTTACTATTTGCCAAAACTTATGTTACTAATGGTAATCAACATTTTGTACTGTTATTAGTTT 
TGATGTAACGATAAAAAAAAAAAAAGGTAACGGAAATAGTCTCTGAACCACGGCCTCGCGAGAATTGAACCCAAAAGCTAAAAACATACAGTGCT 
TTTCATTTTGGGACGCCCAGAGCTTTAATTATATTTTTGTGCAACAAATTTTTATGCATTGGATATATTGATCAAAGTTTTAAAGGTATATGTAT 
T TAT GAT C AGT T T AAAC AGC C G A C T AC AT T T GAC TAT GT C CGT C C T T AGAAAG AAAT T C AG T C AT T AC AAT AT T C C T T GCT AG T C T AT C GAT AT G 
CTATAAGTGTTAAAAGTGCTTTTGCATTTACTCTGGTTTGACAATATATTTTATTGATATTGATTTCGTTTCATTTACTAGTTGGAAAACAGTTC 
TTAAACTAATTAAATCATAGTCTTTTTGTTTTCGCGATTTCGATCTATATGGCAAATTTTGTTTTTAAAATTTCACCTATCCTCCTACTATTTCT 
GTAATGGGTAACTCGACTGTAGCGTCTATTCTTTTGGTGTTCTGTATATGTACATCTAATTTGTTGTATAATTTTTACATTTTCAGTCAACATAA 
CGTCGCCAACTTTGTGGTCAGCAACAAATGCCAGTACAGTCTCGGTTTTCCTTCTGCATTTGCCACCAGCGCAGACCGCGGCAACTGCCGTCCCG 
TCGGCAAGTGGCAATGCACCACCACACATGCCCCGCCGAATTTCTGCGCAGCTTGCTAAAGAAATACAATTAAAACATCGTGCTCCTGTGGTGGG 
TATTTCTATTTTTGATCAGGCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTGAAAACGGGAGTCCACCGCATCGTGTACTTATTGCTTCCGAGG 
AACAGTTCAAGGTGTTTTCACTTCCGCAACTAAAGCCGATTAACAAATATAAGCTTACCGCTAACGAAGGTGCTCGGATTCGCCGCATCCATTTT 
GGTTCGTTTAGTTGTCGCATATCCCCGGAAACACTGCAGAGTATGCACGGTTGTAGCCCAACTAAGTCCACGCGTTCACATGGCGATGGAGAGGC 
GGATCCTAATATCAGTGGAAGCTTGGCTGTAAGTCGTGGAGATGTATATAACGAAACAGCATTGATATGTTTAACGAATATGGGCGATATCATGG 
TTTTATCAGTACCTGAATTAAAAAGACAGCTGAATGCCGCAGCAGTGCGACGGGAAGACATTAAGTAAGTCACGCATAGTGCTATTATTTTAATT 
TTTTGGACCTAATACTGTATTTTAACTTTTCAGTGGAGTTTCGTCACTTTGCTTTACAAACTCTGGAGAAGCACTGTATATGATGTCTTCTTCTG 
AACTGCAGCGTATTGCTTTAGCCACGTCCAGAGTCGTGCAACCCACTGGCGTTGTTCCAGTAGAACCATTAGAAAATGAAGAGTCTGTGTTGGAA 
GAAAATGATGCAGAGAATAATAAGGAAACCTACGCATGTGATGAAGTTGTGAATACATATGAAATTAAAAATCCATCAGGCATTTCAATATGCAC 
AAGGCCTGCAGAGGAAAACGTTGGAAGAAATAGTGTTCAGCAAGTTAATGGAGTCAACATTTCAAATTCACCTAATCAAGCTAACGAGACTATCA 
GCAGCTCTATTGGCGATATTACCGTTGACTCGGTGCGCGACCATTTAAATATGACGACCACCACTTTGTGTTCTATTAATACAGAGGAAACCATT 
GGTGAGTAATCATTTATTTACTAACATTATTCCACCTATTTTAATTTGCTGTTTTGTACGTATCTACATTTAAATGTTTTCGTTTCGACATTAGT 
TACTTTCGGCAACATAATTCTTATAAAAATTTACTTTTTAAAAACTTGATTTTAAGTATACGCACCAAACAACTTTAAATATATACTTTTATCTT 
ATTTTAAAAGCACGGGCCCACATTATATCTGTACATAACTTTTTTTTTCTGTCATCTCTATGAAAATGTAGGTCGCCTATCTGTACTTAGCACGC 
AAACCAACAAAGCCAGTACTACCGTAAACATGAGTGAAATTCCAAATATTAATATTTCTAATTTAGAGGACTTGGAATCGAAAAGGTGTGCTTCT 
ACAAGAAACATTGACATAAGCTTTAACCTACTAATTTTTTTTTTTTACCTAACAGAAATACGACGGAAACGAGTACTAGTTCTGTTGTAATTAAA 
T C T AT AAT T AC AAAC AT T T C T C ATG AAAAAAC GAAC GG AGAC AAC AAAAT AGG AAC G C C AAAAAC AG C GC C T G AAG AAAGC C AAT T T T AAC AT T G 
ACAGAAGCCGTAACCTACTAATTATTTTATACCTTTACAGAAATACGACGCAAACGAGTACTAGTCCTGATGTAATTTAATCTATAATTACAAAC 
ATGTCTCACTAAAAATCGAACGGGGACAGCAAAATAGGAACGCAAAAAAATTAAATTAAATTAAACGCGATTTACATACAAACAGAAATGACAGA 
ATGATAATATAAAATATTCATTTTTTATTTGGCTTAAGCGATGTTGTTGTTCCAAAACCATATAATTATTTATGATTTTATGTAATGTTTTCATG 
TATTTTCGCGTAGTGACTTTATACCCTTTACCCGTATGATGAAACGAAACATGATATTTGTAGAAAGGTACAAAGGATAAAAGTTGACATTTAAG 
GCCGATTGTAAAATATTAAAACGCAACCTGGATAACATGCCGATTAATTACTTCTGTTCGTCCGTATAAACACAAATACCGATAAATTGGTCCGT 
CCCCATTTAATAAATTGTATATTAAAATGGTCTTTATTTTGGATAAATAATTCAAATTATATAGGAACATTTGGGTTTTAAGGGATAGAAACAGC 
GCTGGTGTCTTTCATTTCCGTTCGTATGACCGTTCAAAGATTATAAAAAAATTGTCGTGCCCACATTTTTTAAATTTCTTTTATTTTCGGTTAAA 
ATATTGTGATACAAAAAAATGCCCTTACAAAATTAGGACAACAACAATTTGCTTTATTATGATGAACAAATTAGGAGTACAAAAAATAGCCTGAT 
GCAAGGATGGGGAGAAAAAAGACCTTGAAGTTCAAAAGAATTTGTTGTATCAAATCCTCAATAAGAAAACTACTTAAAATTAAAAGCGTGTTTCA 
AATTAATTGTAGAATATTTCGTAAAATGAAATTGCAAACTTTTTTAGCACGAAATTACATGTATTTAATTCTTTGGCTAATACGTCAGTTAAAAA 
AGGCTACATTTTGATGATACTCACAAATGTAATTTTTGCATTCTACACAGGTGCAATTAAAAAATTATGGTTTTTATGAGTTTTAGAGGGAAGAA 
AAT A T GT C AAT AT G AG AAG T AA GTT AAT GAG C C T T C AAGT AC T T T AGT AC AT AA T GT AAT T T AAT TT TAG AAA TCCTACGGCTT TAT AT T C AAT T 



FIGURE SHEET 681 



GTAAAGTATTTTAAAATTCCTGACGATTTGTAGTAATTGTTTTGTTCTTAGCTTTTTTGCATTCTCCTCCGTTACTCCGTTACTCGTATAGTAAA 
AGGTATAGTATATACAATAGTATTGTAGATTCGTTGAGAAGTATGTTACAGGCAGAAGGAAGCTTTTCCGACCATATAAAGTATATAATTGAGTA 
TTTAGCTTAGGATTATTAATAAAACATGTATTTTATTTAACAAACTGGCTTTACAATATTTTGTGGAAGAACACCGATCTATTTAGTTGTAGGTC 
TCAAACTGAAAACTCCTAATTTTGACTTATTTAAAGATCATACCTCGGTTTTGAGCCTTAGAGGTTTGTATGAGTGATTTTCTAGAATGATCTTT 
GTACTTCGTTACAATAAATTGTTTTCTTCTACTCTAATATAGTTGGAATCGAGTAATCCCTAGATAAGAGTTAGGCTTAAAATAGTAAATTACAA 
ATATTGAATTGGACATTACAATTGCATATTCTCAGCTTGCTGTATTGTAAAAATTAATTATTAACTCGAAGAGATTCGGCAATTAATACGTCCCT 
ATTCGCATTGTTTTTACTTGGGGAAATTTTATTGAAAAACTAGATTCTCTGTTACAATCAGATATGAGAACAAGTTATTGCACCAGCACATCTCT 
CGGCCAGCTCAATTCAATTAAATAGCTAAGCACTTGTTGCGAATTCATGCTTGGAGTGTTTACGTAAGATTTGTAAAAATAACTAAACTGGATTC 
TCCACATTCTTGCGTAGCTGTGTGATTCTACTGTCACAACTCTCCGTCGACGATTTCTAGAGCCATCTTCTTGCGGAAGTACAACATTTCCGACT 
GGGTAAAGGTAATTGGCACATCACACGTTATATACTCGGCGAACATGCAGCTGAAGATACCGCAATCGCTGCCATCTAACTGTCGTGGTATATTC 
TGCACGCTCTCAATAACAAAATCGCTGGTATCAAACTGCTTTTTGGGCTTGAATATTGACTCTTCGCGTAGATATTTCTCTAGAGCGTCCAGCAC 
TGGTCGGTTTGGCTTTCCCTTTGAGTCATAATACCGGATTGTCTTGTTCCGCAAGTGTATGATGGCCATGCACCAGTGGACGCCGTTGCAGTGCA 
CTGGTACCGGGATTATGTCCTTGCTGAACAAGTCCACTTTGCGAGTCCAGCGCTTAATGCCTGCATGCCCAGCTTGCAGGAGGCGGGGCACGAA 
(SEQ ID NO: 1264) 

Exon: 1001.. 1309 

Exon: 2358.. 2517 

Exon: 3859.. 4637 

Exon: 4696.. 5887 

Exon: 5945.. 6050 

Exon: 6642.. 7284 

Exon: 7349.. 7791 

Exon: 8052.. 8160 

Exon: 8226.. 9734 

Start ATG: 2434 

Transcript No. : CT43225 

ACACTAGACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGCGCATATCGTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATAC 
ACGGTTAAGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGGTTGACACGTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAAC 
AGTGCGGGCGTTTCTCAACATAAGAATACATACAAACTTATGCGTGATGGTGAGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAA 
CCGTACAAACGGAAATACATAAAGTTTAGAATAAATACGGCCACAATTAGTATACATATTTATTCATCGAAACCAACCCACCACAAGAGCATAAC 
CAATTATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGACACCGCCTACAGAAGGACCTTTTTGCTTATCGTAAGACGGCA 
CAGCATGGCTTTCCTCATAAGCCTTCGGCTCTTGCGTATGATCCAGTTTTGAAACTTATGGCAATAGGGACGCAAACAGGGGCTTTAAAAGTTTT 
CGGTCAACCCGGAGTTGAATTGTACGGTCAGCATACTTTGTTAAACAATTCAGCATCGGAGCTTAATGTACAATTACTTGAATGGGTGTATGGAA 
CTGGTCGCATACTTTCGTTGACGGCAGCGAATCAATTAATTCTATGGGAGCCAGTTGGAGCAACGTTGCTGCCAATCAAAACACTACCGTTTGAC 
GGCAAACTTAAAAAAGTTTCATCGCTGTGCTGTTCTCTCAGTAAGGATCTGCTATGGATTGGAACAGAAGGTGGAAACATCTATCAACTGGATTT 
ACATACATTTACCATTAAGGAGCCTGTAATTTACCATGACGTTGTGCTAGAGCAGGTGCCACCAGCCTACAAGCTAAATCCTGGTGCAATTGAGT 
CAATCCGCCAACTTCCAAACTCCCCTAGCAAACTTCTAGTTGCATACAATCGCGGCCTTTGTGTTTTGTGGGATTTTGAAAGCGCATCTGTCCAG 
CGAGCATACATAGCCCCTGGACATGGACAGAGCGTTGGTCTTACAGTGAACTTCGAAGGATCTGAATTTACCTGGTACCACGCTGATGGTTCATA 
C G C C AC T T G GAG C AT AG AT AAC C C AG AAC C GC C G TC G AAT GT T AAT T ATGT G C C T T AT GG AC CT GAT C CAT G C AAAAGC AT AAAT C GAC T GT AC A 
AAGGCAAGCGAAGATCCAACGATGTAATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTGATCACAATTGTGTGTCCGTTCACGCCAGC 
GATGGACACAAAGTGTGTCTTGACTTTACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAATAATAGAGATGTCGCTGAAGTTCTTGTTGT 
ACTACTTGAAGAGGAACTCTGCGCTTACGATCTTACTGACCCTAATATTTGTGCTATCAAAGCGCCATATCTTCACTCTGTCCATGCATCAGCTG 
TCACTTGCAATTACCTTGCTTCTGAAGTCGTACAGTCGGTATATGAAAGTATTTTAAGAGCTGGAGATGAACAAGACATTGACTATAGCAATATT 
AGCTGGCCTATCACTGGCGGTACTCTCCCGGATAACTTAGAAGAATCTGTAGAAGAGGACGCGACTAAGCTTTATGAGATTTTGTTAACTGGTCA 
CGAAGATGGTTCTGTTAAATTTTGGGACTGCACTGGAGTGTTGCTTAAACCAATTTATAATTTTAAAACTAGCAGCATTTTTGGAAGTGAGTCAG 
ACTTCCGAGATGACGCAGCTGCAGATATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTCGGAAATCAGGACTTTTTGATCCTTATTCA 
GATGACCCTCGTTTAGCAGTGAAGAAAATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGTGGCACAGCGGGCCAAATAGTTATAGCCGA 
CTTCATAGACTTACCCGAAAAAGTGTCTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGATGGATTTGTGTGGAAGGGTCACGATCAGT 
TAAACGTGCGATCGAACTTATTAGACGGAGAAGCAATTCCTACGACGGAACGTGGTGTAAATATATCGGGAGTACTGCAAGTTTTGCCGCCAGCC 
AGCATAACATGCATGGCACTCGAAGCAAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTAGTTCTCTTTGACTTCAAAAACTTTGTTCC 
AGTATTTCATCGCTGCACTTTAAACCCAAATGATCTTACTGGAGCAGGAGAGCAGCTGTCTCGTCGAAAGTCTTTTAAGAAATCATTGAGGGAGT 
CATTTAGAAAGCTTCGCAAGGGTCGATCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGCTGGAAGCAAGACCCGTCGAGAGGCAAATA 
GAGGCTCGTTGTGCAGATGACGGGCTAGGATCCATGGTGCGATGTTTACTATTTGCCAAAACTTATGTTACTAATGTCAACATAACGTCGCCAAC 
TTTGTGGTCAGCAACAAATGCCAGTACAGTCTCGGTTTTCCTTCTGCATTTGCCACCAGCGCAGACCGCGGCAACTGCCGTCCCGTCGGCAAGTG 
GCAATGCACCACCACACATGCCCCGCCGAATTTCTGCGCAGCTTGCTAAAGAAATACAATTAAAACATCGTGCTCCTGTGGTGGGTATTTCTATT 
TTTGATCAGGCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTGAAAACGGGAGTCCACCGCATCGTGTACTTATTGCTTCCGAGGAACAGTTCAA 
GGTGTTTTCACTTCCGCAACTAAAGCCGATTAACAAATATAAGCTTACCGCTAACGAAGGTGCTCGGATTCGCCGCATCCATTTTGGTTCGTTTA 
GTTGTCGCATATCCCCGGAAACACTGCAGAGTATGCACGGTTGTAGCCCAACTAAGTCCACGCGTTCACATGGCGATGGAGAGGCGGATCCTAAT 
ATCAGTGGAAGCTTGGCTGTAAGTCGTGGAGATGTATATAACGAAACAGCATTGATATGTTTAACGAATATGGGCGATATCATGGTTTTATCAGT 
ACCTGAATTAAAAAGACAGCTGAATGCCGCAGCAGTGCGACGGGAAGACATTAATGGAGTTTCGTCACTTTGCTTTACAAACTCTGGAGAAGCAC 
TGTATATGATGTCTTCTTCTGAACTGCAGCGTATTGCTTTAGCCACGTCCAGAGTCGTGCAACCCACTGGCGTTGTTCCAGTAGAACCATTAGAA 
AATGAAGAGTCTGTGTTGGAAGAAAATGATGCAGAGAATAATAAGGAAACCTACGCATGTGATGAAGTTGTGAATACATATGAAATTAAAAATCC 
ATCAGGCATTTCAATATGCACAAGGCCTGCAGAGGAAAACGTTGGAAGAAATAGTGTTCAGCAAGTTAATGGAGTCAACATTTCAAATTCACCTA 
ATCAAGCTAACGAGACTATCAGCAGCTCTATTGGCGATATTACCGTTGACTCGGTGCGCGACCATTTAAATATGACGACCACCACTTTGTGTTCT 
ATTAATACAGAGGAAACCATTGGTCGCCTATCTGTACTTAGCACGCAAACCAACAAAGCCAGTACTACCGTAAACATGAGTGAAATTCCAAATAT 
TAATATTTCTAATTTAGAGGACTTGGAATCGAAAAGAAATACGACGGAAACGAGTACTAGTTCTGTTGTAATTAAATCTATAATTACAAACATTT 
CTCATGAAAAAACGAACGGAGACAACAAAATAGGAACGCCAAAAACAGCGCCTGAAGAAAGCCAATTTTAACATTGACAGAAGCCGTAACCTACT 
AATTATTTTATACCTTTACAGAAATACGACGCAAACGAGTACTAGTCCTGATGTAATTTAATCTATAATTACAAACATGTCTCACTAAAAATCGA 
ACGGGGACAGCAAAATAGGAACGCAAAAAAATTAAATTAAATTAAACGCGATTTACATACAAACAGAAATGACAGAATGATAATATAAAATATTC 
ATTTTTTATTTGGCTTAAGCGATGTTGTTGTTCCAAAACCATATAATTATTTATGATTTTATGTAATGTTTTCATGTATTTTCGCGTAGTGACTT 
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TATACCCTTTACCCGTATGATGAAACGAAACATGATATTTGTAGAAAGGTACAAAGGATAAAAGTTGACATTTAAGGCCGATTGTAAAATATTAA 
AACGCAACCTGGATAACATGCCGATTAATTACTTCTGTTCGTCCGTATAAACACAAATACCGATAAATTGGTCCGTCCCCATTTAATAAATTGTA 
TATTAAAATGGTCTTTATTTTGGATAAATAATTCAAATTATATAGGAACATTTGGGTTTTAAGGGATAGAAACAGCGCTGGTGTCTTTCATTTCC 
GTTCGTATGACCGTTCAAAGATTATAAAAAAATTGTCGTGCCCACATTTTTTAAATTTCTTTTATTTTCGGTTAAAATATTGTGATACAAAAAAA 
TGCCCTTACAAAATTAGGACAACAACAATTTGCTTTATTATGATGAACAAATTAGGAGTACAAAAAATAGCCTGATGCAAGGATGGGGAGAAAAA 
AGACCTTGAAGTTCAAAAGAATTTGTTGTATCAAATCCTCAATAAGAAAACTACTTAAAATTAAAAGCGTGTTTCAAATTAATTGTAGAATATTT 
CGTAAAATGAAATTGCAAACTTTTTTAGCACGAAATTACATGTATTTAATTCTTTGGCTAATACGTCAGTTAAAAAAGGCTACATTTTGATGATA 
CTCACAAATGTAATTTTTGCATTCTACACAGGTGCAATTAAAAAATTATGGTTTTTATGAGTTTTAGAGGGAAGAAAATATGTCAATATGAGAAG 
TAAGTTAATGAGCCTTCAAGTACTTTAGTACATAATGTAATTTAATTTTAGAAATCCTACGGCTTTATATTCAATTGTAAAGTATTTTAAAATTC 
CTGACGATTTGTAGTAATTGTTTTGTTCTTAGCTTTTTTGCATTCTCCTCCGTTACTCCGTTACTCGTATAGTAAAAGGTATAGTATATACAATA 
GTATTGTAGATTCGTTGAGAAGTATGTTACAGGCAGAAGGAAGCTTTTCCGACCATATAAAGTATATAATTGAGTATTTAGCTTAGGATTATTAA 
TAAAACATGTATTTTATTTAACAAA 
(SEQ ID NO: 1265) 

Start ATG: 386 

MLKFIRGKGQQPSADRHRLQKDLFAYRKTAQHGFPHKPSALAYDPVLKLMAIGTQTGALKVFGQPGVELYGQHTLLNNSASELNVQLLEWVYGTG 

RILSLTAANQLILWEPVGATLLPIKTLPFDGKLKKVSSLCCSLSKDLLWIGTEGGNIYQLDLHTFTIKEPVIYHDVVLEQVPPAYKLNPGAIESI 

RQLPNSPSKLLVAYNRGLCVLWDFESASVQPJ^YIAPGHGQSVGLTVNFEGSEFTWYHADGSYATWSIDNPEPPSNVNYVPYGPDPCKSINRLYKG 

KRRSNDVIVFSGGMPRSAYGDHNCVSVHAS DGHKVCLDFTSKVIDFFVTFENNRDVAEVLVVLLEEELCAYDLTDPNICAIKAPYLHSVHASAVT 

CNYLASEVVQSVYESILRAGDEQDIDYSNISWPITGGTLPDNLEESVEEDATKLYEILLTGHEDGSVKFWDCTGVLLKPIYNFKTSSIFGSESDF 

RDDAAADMSAEQVDEGEPPFRKSGLFDPYSDDPRLAVKKIAFCPKTGQLIVGGTAGQIVIADFIDLPEKVSLKYISMNLVSDRDGFVWKGHDQLN 

VRSNLLDGEAIPTTERGVNISGVLQVLPPASITCMALEASWGLVSGGTAHGLVLFDFKNFVPVFHRCTLNPNDLTGAGEQLSRRKSFKKSLRESF 

RKLRKGRSTRTNQSNQVPTTLEARPVERQIEARCADDGLGSMVRCLLFAKTYVTNVNITSPTLWSATNASTVSVFLLHLPPAQTAATAVPSASGN 

APPHMPRRISAQLAKEIQLKHRAPVVGISIFDQAGSPVDQLNAGENGSPPHRVLIASEEQFKVFSLPQLKPINKYKLTANEGARIRRIHFGSFSC 

RISPETLQSMHGCSPTKSTRSHGDGEADPNISGSLAVSRGDVYNETALICLTNMGDIMVLSVPELKRQLNAAAVRREDINGVSSLCFTNSGEALY 

MMSSSELQRIALATSRVVQPTGVVPVEPLENEESVLEENDAENNKETYACDEVVNTYEIKNPSGISICTRPAEENVGRNSVQQVNGVNISNSPNQ 

ANETISSSIGDITVDSVRDHLNMTTTTLCSINTEETIGRLSVLSTQTNKASTTVNMSEIPNINISNLEDLESKRNTTETSTSSVVIKSIITNISH 

EKTNGDNKIGTPKTAPEESQF* 

(SEQ ID NO: 1266) 



Celera Sequence No. : 142000013384827 

CGTTTCTTTTGTGCACTCCTTCTATGCGGTGCCAAATATGAGGAGGTACCTGAAGCTTGGTTGATTTTGTGAGTGTCTTTTTCCACAGCATATTT 
TTGTGTATTGGCTACTTTAAAAATTCTATTATATCGTTGTGCTCGTAATGTTGTCCATTCCTTTTAAATATTTTCGAATTAAATATACCATAGAT 
GTTTCCCTATTACCATAATAGTTTAATAAATATACGTATATGAAATTTTTTTTATCCTATGTTAGTATAAACATGTTATTCAACAAATAAAAGTA 
GGTCAGAGGTAATCGCCATCCAGCTCCAGAAAGAGGGGGGAGCTGATAAGATTGCGTTTATGTACCTAATGGGTTGGGTAAGTGGCTTGGTTTCA 
CAAACACCCTTATTTCAGGTACAAAATAAGTATGTCATAGACGGCGTAAAATTGATAAGATTTTAAAGTCAAAAGAAAATGAGTTTATTCGCCTG 
TAACTGCGTTAGAAATCCAAATACACCCGCAATATTATATGTATAGAAAGAAAAAGTTTGATGCAACAACAAATTTAAAGTTGTTTGGTATTTTG 
TAATGGAAAGTTTGTGTCACTAAATATGGTTTTACTTGAACGCAGATTGGACATGGTCACACTGCCCACAAAATCACGACATTGCGTTATAAATG 
AAGCACATCTAGTTTTGTGTTGAAAATGTGGTTCACAATTTATATAGCAGGTTATGGTAAAACATCGATCGTTCCGAAAAATTCCGCTCGTTGGC 
ACATTTCCTAAATTTTTCGTGCGACAGCGGTGATTAATTACAAAAATTATTATTATAATTATATAAAAGTTGGCGACCATATCAAAAAGGTTGCT 
ATAAAAACAAATGACAATTTATAGCGGAGTTTCACTCCCAAAGCGATGTTCTCCACCCATTCCAGTCAGTCATGGAAAATATACCTTCTAAAAAA 
TATATGATTTTTTCGATAGAGATACACGGGCACACTAGACCCACCGATCGCGACTTTGTGTGTGTTAAAATGACGCGCATATCGTTTTGGGCTAA 
TAATATTGTTCACGTTTTATAAAACAGATACACGGTTAAGCTCTGACCGGTCGTATTATTATCTGTGTGTTCTTGGTTGACACGTTTATTTCATT 
TATTCTCATTTAAAGCAACGATGTGAAGAACAGTGCGGGCGTTTCTCAACATAAGAATACATACAAACTTATGCGTGATGGTGAGCAAATAAAAT 
AGTTCAAAAAATTACTCGGCCACCGCCTTAACCGTACAAACGGAAATACATAAAGGTAAAGGGCTCAACAACAGTATTACTGATTAGGCAATTGA 
TTTATATTTCAAACTTCTTAAATAAAGTGTCCACAAATATGTATGTTCTTAGCATAGAGGCGTAGTCGGTGAGTCAGCCACGCCAACAGTTAACA 
GTCGTCATTTTGCTGTCTGTCTAACACGCAATTTGTGTGTGTATATGTGAATAAACAGATACCGGTATGCAATCGCAAAGCTGGTCAACTGTAAA 
GATCGGTATGTACTCAAAATTAGGTCATCACCTTGTTATGCAGATACATTTGGGAAACAGTTGACTTACTTGGTATTGGTTATCAGCCGAAAAGC 
AAATGGTGATACGGTTTTGCACCTCAGCTTACTCACGGATACATACCTACGTACATAAATGTATGTATACGGTATACAGTCTTAGAAAAAATATT 
AGTACCGACCAGCTAGCGTTTGCCTAAAACGCTTTTACAAAAATAAACGTATATGTTTTAAGTTGTTTTTTGTCATAATTAATACATTTATTATT 
ATTTATTTATGAATTTCAGAGCTTTAATAGTAACTAAATTGTTTAAACAAACTTAAAAAAAGATTTCCAAATAAAAAAATATTTCGTGTGCGTGA 
CGGAATTGCTAGTTCATTAAAGGGAGGTACATATGTACATACATACACATTGCAAAGAGTACAGATCGCTCATCTTGCATCTCATTTTTCCTAAT 
GTACATATACATATGTACAAACATGTTTATTAAACTGCCTACATGCAAATCTTTGAAGTTTTAAGAAAATAAGCACAACTAGTTTGGAAAATCTT 
GGGAGCCAAGATCTATCTGTCCACTGAATTTCCTATGACCAAGAATGTAAAACCCTTTTTGTTAAACAAATTTTGATTAAAGTATACCAGAATAA 
TATTAAACGTATCGTCCGTATCTATAAAAGGTATTAATAGTATTACCTAAATTTAATTTCTCGGTTTTATTCATAAGAGTGAAAAAATATACAAG 
TTCCTTTTTAATTTTTTACTTATTGAAAATGTGATTAATTTTTATTTATTATTTTCAGTTTAGAATAAATACGGCCACAATTAGTATACATATTT 
ATTCATCGAAACCAACCCACCACAAGAGCATAACCAATTATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGACACCGCCT 
ACAGAAGGACCTTTTTGCTTATCGTAAGGTACAACAAACAATTTTCTCTTTCCATTCTATTTTATTTTTTTTTTTTCGTATATGTAGCCGATTGG 
TATATTTTGTGGGGTGTCATTTATTTTGATCCTGGGTGGTAAATACATTTTTATGGAAGTTAGAAGAAAGCTTATAATTTATTAAGAATAAATAA 
ATACCTCCTTTTCAATCTTTTGTTATGTCTTCTCCTATGTGCATACTTTTGGGCATTAACATTTAGGTAAATTTTTTCCACCGCAACTACAAGTA 
AACCGCATACACACACAGTATTTTGTAGACTCGCGCCACAAAGCACCGCATAAAAAAGGTTACGGTGCTTTTTTGCTCGTTTCTTTTGTGCACTC 
CTTCTATGCGGTGCCAAATATGAGGAGGTACCTGAAGCTTTGTTGATTTTGTGAGTGTCTTTTTCCACCGCCTGTTTTTGGCTACTTTAAAAATT 
CTATTATATCGTTGTGCTCGTAATGTTGTCCATTCCTTTTAAATATTTTCGAATTAAATATACCATAGATGTTTCCCTATTACCATAATAGTTTA 
ATAAATATACGTATATGAAATTTTATAAGAATTTTTTTTATCCCATGTTAGTATAAACATGTTATTAAACAAATAAAAGTAGGTCAGAGGTAATC 
GCCATCCAGCTCCAGAAAGAGGGGGGAGCTGATAAGATTGCTTTTATGTGCTTAATGGGTTGGGTAAATGGCTTGGTTTCACAAACACCCTTATT 
TCAGGCACAAAATAAGTATGTCATAGACGGCGTAAAATTGATAAGATTTTAAAGTCAAAAGAAAAAGAATTTATTCGCCTGTAACTGCGTTAGAA 
ATCCAAATACACCTGCAATATGAATCAAACAAAATCAACTCATAACGAGATCTGATAGCAACTTCTTAAAACTTTTTTTGTATAAAAATACAATT 
TATACAATTAGATAAGATTTACCAAACCTTTTTTCGATCAATGTAAATATAGTATGATAGTAGGGACTGCTGTCTACAGTGATGTTTATAAGAGA 
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CATACTTAAGAGCAGCATTTAAAGCTTTGGGCGCTTTTTGAACTTAATTTAAATTCAAACCTCAATATTTATTGAGGTTTTTATTCCCGTTAATT 
GTAAAGAAAAATGCTGTATTAGATTCGGTAAAAGTATGTAAATGGTTTAAGAAAGCGTTCCCGACTCTATAAAGTGGCAAGTAATCTGACGATTT 
AC C AG AT C T T G GGAACT G AAAAAAC T AG AAT GT T T T GG AAAA C CC T T T T AC AAT TAAGAAAT G AAC C AAAC AAT T T T GAAT T AT T AAG AAT C T GT 
AATTGCTTTTTGAAACTTTTTAATATTTTTTTCTTTTAGACGGCACAGCATGGCTTTCCTCATAAGCCTTCGGCTCTTGCGTATGATCCAGTTTT 
GAAACTTATGGCAATAGGGACGCAAACAGGGGCTTTAAAAGTTTTCGGTCAACCCGGAGTTGAATTGTACGGTCAGCATACTTTGTTAAACAATT 
CAGCATCGGAGCTTAATGTACAATTACTTGAATGGGTGTATGGAACTGGTCGCATACTTTCGTTGACGGCAGCGAATCAATTAATTCTATGGGAG 
CCAGTTGGAGCAACGTTGCTGCCAATCAAAACACTACCGTTTGACGGCAAACTTAAAAAAGTTTCATCGCTGTGCTGTTCTCTCAGTAAGGATCT 
GCTATGGATTGGAACAGAAGGTGGAAACATCTATCAACTGGATTTACATACATTTACCATTAAGGAGCCTGTAATTTACCATGACGTTGTGCTAG 
AGCAGGTGCCACCAGCCTACAAGCTAAATCCTGGTGCAATTGAGTCAATCCGCCAACTTCCAAACTCCCCTAGCAAACTTCTAGTTGCATACAAT 
CGCGGCCTTTGTGTTTTGTGGGATTTTGAAAGCGCATCTGTCCAGCGAGCATACATAGCCCCTGGACATGGACAGAGCGTTGGTCTTACAGTGAA 
CTTCGAAGGATCTGAATTTACCTGGTACCACGCTGATGGTTCATACGCCACTTGGAGCATAGATAACCCAGAACCGCCGTCGAATGTTAATTATG 
TGCCTTATGGACCTGATCCATGCAAAAGCATAAATCGACTGTACAAAGGCAAGCGAAGGTTAATATAACTTAAACACAAAGTTTTTTAGATGTGT 
ACTAATTTCTCGTTCTTTTAGATCCAACGATGTAATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTGATCACAATTGTGTGTCCGTTC 
ACGCCAGCGATGGACACAAAGTGTGTCTTGACTTTACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAATAATAGAGATGTCGCTGAAGTT 
CTTGTTGTACTACTTGAAGAGGAACTCTGCGCTTACGATCTTACTGACCCTAATATTTGTGCTATCAAAGCGCCATATCTTCACTCTGTCCATGC 
ATCAGCTGTCACTTGCAATTACCTTGCTTCTGAAGTCGTACAGTCGGTATATGAAAGTATTTTAAGAGCTGGAGATGAACAAGACATTGACTATA 
GCAATATTAGCTGGCCTATCACTGGCGGTACTCTCCCGGATAACTTAGAAGAATCTGTAGAAGAGGACGCGACTAAGCTTTATGAGATTTTGTTA 
ACTGGTCACGAAGATGGTTCTGTTAAATTTTGGGACTGCACTGGAGTGTTGCTTAAACCAATTTATAATTTTAAAACTAGCAGCATTTTTGGAAG 
TGAGTCAGACTTCCGAGATGACGCAGCTGCAGATATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTCGGAAATCAGGACTTTTTGATC 
CTTATTCAGATGACCCTCGTTTAGCAGTGAAGAAAATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGTGGCACAGCGGGCCAAATAGTT 
ATAGCCGACTTCATAGACTTACCCGAAAAAGTGTCTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGATGGATTTGTGTGGAAGGGTCA 
C GAT C AGT T AAACGT GC GAT C GAAC T T AT T AG AC GG AGAAGC AAT T CC T AC G ACGG AAC GT G GT GTAAAT AT AT C G GGAGT A CT GC AAGT T T T GC 
CGCCAGCCAGCATAACATGCATGGCACTCGAAGCAAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTAGTTCTCTTTGACTTCAAAAAC 
T T T GT T C C AGT ATT T C AT C GC T GC AC TT T AAAC C C AAAT GAT C T T AC T GG AGCAGGAG AGC AGC T GT C T C GT CGAAAGT CT T T TAAGAAAT C AT T 
GAGGGAGTCATTTAGAAAGCTTCGCAAGGGTCGATCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGGTTAGTAAGATAATATATACTC 
AACTAGTCTTTTATATTAAATTCACAATCCTACAGCTGGAAGCAAGACCCGTCGAGAGGCAAATAGAGGCTCGTTGTGCAGATGACGGGCTAGGA 
TCCATGGTGCGATGTTTACTATTTGCCAAAACTTATGTTACTAATGGTAATCAACATTTTGTACTGTTATTAGTTTTGATGTAACGATAAAAAAA 
AAAAAAGGT AAC GG AAAT AGT C T C T GAAC C AC GGC C T CGC GAG AAT T G AAC CC AAAAG CT AAAAAC AT AC AG T GC T T T T CAT T T T GGG AC GC C C A 
G AGC T T T AAT TAT AT T T T T GT GC AAC AAAT T T T T AT GC AT TGG AT AT AT T GAT C AAAG T T T T AAA GGT AT AT GT AT T T ATG AT C AGT T T AAAC AG 
CCGACTACATTTGACTATGTCCGTCCTTAGAAAGAAATTCAGTCATTACAATATTCCTTGCTAGTCTATCGATATGCTATAAGTGTTAAAAGTGC 
TTTTGCATTTACTCTGGTTTGACAATATATTTTATTGATATTGATTTCGTTTCATTTACTAGTTGGAAAACAGTTCTTAAACTAATTAAATCATA 
GTCTTTTTGTTTTCGCGATTTCGATCTATATGGCAAATTTTGTTTTTAAAATTTCACCTATCCTCCTACTATTTCTGTAATGGGTAACTCGACTG 
TAGCGTCTATTCTTTTGGTGTTCTGTATATGTACATCTAATTTGTTGTATAATTTTTACATTTTCAGTCAACATAACGTCGCCAACTTTGTGGTC 
AGCAACAAATGCCAGTACAGTCTCGGTTTTCCTTCTGCATTTGCCACCAGCGCAGACCGCGGCAACTGCCGTCCCGTCGGCAAGTGGCAATGCAC 
CACCACACATGCCCCGCCGAATTTCTGCGCAGCTTGCTAAAGAAATACAATTAAAACATCGTGCTCCTGTGGTGGGTATTTCTATTTTTGATCAG 
GCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTGAAAACGGGAGTCCACCGCATCGTGTACTTATTGCTTCCGAGGAACAGTTCAAGGTGTTTTC 
ACTTCCGCAACTAAAGCCGATTAACAAATATAAGCTTACCGCTAACGAAGGTGCTCGGATTCGCCGCATCCATTTTGGTTCGTTTAGTTGTCGCA 
TATCCCCGGAAACACTGCAGAGTATGCACGGTTGTAGCCCAACTAAGTCCACGCGTTCACATGGCGATGGAGAGGCGGATCCTAATATCAGTGGA 
AGCTTGGCTGTAAGTCGTGGAGATGTATATAACGAAACAGCATTGATATGTTTAACGAATATGGGCGATATCATGGTTTTATCAGTACCTGAATT 
AAAAAGACAGCTGAATGCCGCAGCAGTGCGACGGGAAGACATTAAGTAAGTCACGCATAGTGCTATTATTTTAATTTTTTGGACCTAATACTGTA 
TTTTAACTTTTCAGTGGAGTTTCGTCACTTTGCTTTACAAACTCTGGAGAAGCACTGTATATGATGTCTTCTTCTGAACTGCAGCGTATTGCTTT 
AGCCACGTCCAGAGTCGTGCAACCCACTGGCGTTGTTCCAGTAGAACCATTAGAAAATGAAGAGTCTGTGTTGGAAGAAAATGATGCAGAGAATA 
ATAAGGAAACCTACGCATGTGATGAAGTTGTGAATACATATGAAATTAAAAATCCATCAGGCATTTCAATATGCACAAGGCCTGCAGAGGAAAAC 
GTTGGAAGAAATAGTGTTCAGCAAGTTAATGGAGTCAACATTTCAAATTCACCTAATCAAGCTAACGAGACTATCAGCAGCTCTATTGGCGATAT 
TACCGTTGACTCGGTGCGCGACCATTTAAATATGACGACCACCACTTTGTGTTCTATTAATACAGAGGAAACCATTGGTGAGTAATCATTTATTT 
ACTAACATTATTCCACCTATTTTAATTTGCTGTTTTGTACGTATCTACATTTAAATGTTTTCGTTTCGACATTAGTTACTTTCGGCAACATAATT 
CTTATAAAAATTTACTTTTTAAAAACTTGATTTTAAGTATACGCACCAAACAACTTTAAATATATACTTTTATCTTATTTTAAAAGCACGGGCCC 
ACATTATATCTGTACATAACTTTTTTTTTCTGTCATCTCTATGAAAATGTAGGTCGCCTATCTGTACTTAGCACGCAAACCAACAAAGCCAGTAC 
TACCGTAAACATGAGTGAAATTCCAAATATTAATATTTCTAATTTAGAGGACTTGGAATCGAAAAGGTGTGCTTCTACAAGAAACATTGACATAA 
GCTTTAACCTACTAATTTTTTTTTTTTACCTAACAGAAATACGACGGAAACGAGTACTAGTTCTGTTGTAATTAAATCTATAATTACAAACATTT 
CT CAT GAAAAAACGAACGGAGACAACAAAATAGGAACGCC AAAAAC AGCGCCTGAAGAAAGCCAATTTT AAC ATT GACAGAAGCCGTAACCT ACT 
AATTATTTTATACCTTTACAGAAATACGACGCAAACGAGTACTAGTCCTGATGTAATTTAATCTATAATTACAAACATGTCTCACTAAAAATCGA 
ACGGGGACAGCAAAATAGGAACGCAAAAAAATTAAATTAAATTAAACGCGATTTACATACAAACAGAAATGACAGAATGATAATATAAAATATTC 
ATTTTTTATTTGGCTTAAGCGATGTTGTTGTTCCAAAACCATATAATTATTTATGATTTTATGTAATGTTTTCATGTATTTTCGCGTAGTGACTT 
TATACCCTTTACCCGTATGATGAAACGAAACATGATATTTGTAGAAAGGTACAAAGGATAAAAGTTGACATTTAAGGCCGATTGTAAAATATTAA 
AACGCAACCTGGATAACATGCCGATTAATTACTTCTGTTCGTCCGTATAAACACAAATACCGATAAATTGGTCCGTCCCCATTTAATAAATTGTA 
TAT TAAAAT GGTCTTT ATT TTGGAT AAAT AATTCAAATTATATAGGAACATTTGGGTTTTAAGGGATAGAAACAGCGCTGGTGTCTTTC ATT TCC 
GTTCGTATGACCGTTCAAAGATTATAAAAAAATTGTCGTGCCCACATTTTTTAAATTTCTTTTATTTTCGGTTAAAATATTGTGATACAAAAAAA 
TGCCCTTACAAAATTAGGACAACAACAATTTGCTTTATTATGATGAACAAATTAGGAGTACAAAAAATAGCCTGATGCAAGGATGGGGAGAAAAA 
AGACCTTGAAGTTCAAAAGAATTTGTTGTATCAAATCCTCAATAAGAAAACTACTTAAAATTAAAAGCGTGTTTCAAATTAATTGTAGAATATTT 
CGTAAAATGAAATTGCAAACTTTTTTAGCACGAAATTACATGTATTTAATTCTTTGGCTAATACGTCAGTTAAAAAAGGCTACATTTTGATGATA 
CTCACAAATGTAATTTTTGCATTCTACACAGGTGCAATTAAAAAATTATGGTTTTTATGAGTTTTAGAGGGAAGAAAATATGTCAATATGAGAAG 
TAAGTTAATGAGCCTTCAAGTACTTTAGTACATAATGTAATTTAATTTTAGAAATCCTACGGCTTTATATTCAATTGTAAAGTATTTTAAAATTC 
CTGACGATTTGTAGTAATTGTTTTGTTCTTAGCTTTTTTGCATTCTCCTCCGTTACTCCGTTACTCGTATAGTAAAAGGTATAGTATATACAATA 
GTATTGTAGATTCGTTGAGAAGTATGTTACAGGCAGAAGGAAGCTTTTCCGACCATATAAAGTATATAATTGAGTATTTAGCTTAGGATTATTAA 
TAAAACATGTATTTTATTTAACAAACTGGCTTTACAATATTTTGTGGAAGAACACCGATCTATTTAGTTGTAGGTCTCAAACTGAAAACTCCTAA 
TTTTGACTTATTTAAAGATCATACCTCGGTTTTGAGCCTTAGAGGTTTGTATGAGTGATTTTCTAGAATGATCTTTGTACTTCGTTACAATAAAT 
TGTTTTCTTCTACTCTAATATAGTTGGAATCGAGTAATCCCTAGATAAGAGTTAGGCTTAAAATAGTAAATTACAAATATTGAATTGGACATTAC 
AATTGCATATTCTCAGCTTGCTGTATTGTAAAAATTAATTATTAACTCGAAGAGATTCGGCAATTAATACGTCCCTATTCGCATTGTTTTTACTT 
GGGGAAATTTTATTGAAAAACTAGATTCTCTGTTACAATCAGATATGAGAACAAGTTATTGCACCAGCACATCTCTCGGCCAGCTCAATTCAATT 
AAATAGCTAAGCACTTGTTGCGAATTCATGCTTGGAGTGTTTACGTAAGATTTGTAAAAATAACTAAACTGGATTCTCCACATTCTTGCGTAGCT 
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GTGTGATTCTACTGTCACAACTCTCCGTCGACGATTTCTAGAGCCATCTTCTTGCGGAAGTACAACATTTCCGACTGGGTAAAGGTAATTGGCAC 

ATCACACGTTATATACTCGGCGAACATGCAGCTGAAGATACCGCAATCGCTGCCATCTAACTGTCGTGGTATATTCTGCACGCTCTCAATAACAA 

AATCGCTGGTATCAAACTGCTTTTTGGGCTTGAATATTGACTCTTCGCGTAGATATTTCTCTAGAGCGTCCAGCACTGGTCGGTTTGGCTTTCCC 

TTTGAGTCATAATACCGGATTGTCTTGTTCCGCAAGTGTATGATGGCCATGCACCAGTGGACGCCGTTGCAGTGCACTGGTACCGGGATTATGTC 

CTTGCTGAACAAGTCCACTTTGCGAGTCCAGCGCTTAATGCCTGCATGCCCAGCTTGCAGGAGGCGGGGCACGAA 

(SEQ ID NO: 1267) 

Exon; 1001.. 1290 

Exon: 1383.. 1525 

Exon: 2339.. 2498 

Exon: 3840.. 4618 

Exon: 4677.. 5868 

Exon: 5926.. 6031 

Exon: 6623.. 7265 

Exon: 7330.. 7772 

Exon: 8033.. 8141 

Exon: 8207.. 9715 

Start ATG: 2415 

Transcript No. : CT43227 

CGACTTTGTGTGTGTTAAAATGACGCGCATATCGTTTTGGGCTAATAATATTGTTCACGTTTTATAAAACAGATACACGGTTAAGCTCTGACCGG 
TCGTATTATTATCTGTGTGTTCTTGGTTGACACGTTTATTTCATTTATTCTCATTTAAAGCAACGATGTGAAGAACAGTGCGGGCGTTTCTCAAC 
ATAAGAATACATACAAACTTATGCGTGATGGTGAGCAAATAAAATAGTTCAAAAAATTACTCGGCCACCGCCTTAACCGTACAAACGGAAATACA 
TAAAGCATAGAGGCGTAGTCGGTGAGTCAGCCACGCCAACAGTTAACAGTCGTCATTTTGCTGTCTGTCTAACACGCAATTTGTGTGTGTATATG 
TGAATAAACAGATACCGGTATGCAATCGCAAAGCTGGTCAACTGTAAAGATCGTTTAGAATAAATACGGCCACAATTAGTATACATATTTATTCA 
TCGAAACCAACCCACCACAAGAGCATAACCAATTATGTTAAAGTTTATCAGAGGAAAAGGGCAGCAGCCCAGTGCTGACAGACACCGCCTACAGA 
AGGACCTTTTTGCTTATCGTAAGACGGCACAGCATGGCTTTCCTCATAAGCCTTCGGCTCTTGCGTATGATCCAGTTTTGAAACTTATGGCAATA 
GGGACGCAAACAGGGGCTTTAAAAGTTTTCGGTCAACCCGGAGTTGAATTGTACGGTCAGCATACTTTGTTAAACAATTCAGCATCGGAGCTTAA 
T GT AC AAT T AC T TGAAT GGGT GT AT GGAAC T GGT C GC AT ACT T T C GT T GAC GGC AGC G AAT C AAT T AATT C T AT GGG AGCC AGT T GG AGC AAC GT 
TGCTGCCAATCAAAACACTACCGTTTGACGGCAAACTTAAAAAAGTTTCATCGCTGTGCTGTTCTCTCAGTAAGGATCTGCTATGGATTGGAACA 
GAAGGTGGAAACATCTATCAACTGGATTTACATACATTTACCATTAAGGAGCCTGTAATTTACCATGACGTTGTGCTAGAGCAGGTGCCACCAGC 
CTACAAGCTAAATCCTGGTGCAATTGAGTCAATCCGCCAACTTCCAAACTCCCCTAGCAAACTTCTAGTTGCATACAATCGCGGCCTTTGTGTTT 
TGTGGGATTTTGAAAGCGCATCTGTCCAGCGAGCATACATAGCCCCTGGACATGGACAGAGCGTTGGTCTTACAGTGAACTTCGAAGGATCTGAA 
TTTACCTGGTACCACGCTGATGGTTCATACGCCACTTGGAGCATAGATAACCCAGAACCGCCGTCGAATGTTAATTATGTGCCTTATGGACCTGA 
TCCATGCAAAAGCATAAATCGACTGTACAAAGGCAAGCGAAGATCCAACGATGTAATTGTTTTTTCCGGCGGCATGCCACGGTCAGCATATGGTG 
ATCACAATTGTGTGTCCGTTCACGCCAGCGATGGACACAAAGTGTGTCTTGACTTTACGTCTAAAGTGATTGACTTTTTTGTGACCTTTGAAAAT 
AAT AG AG AT GT C GC T G AAGT T C T T GT TGT AC T AC T T G AAG AGG AAC TCTGCGCTTACGATCTTACTGACC C T AAT AT T T GT GC T AT C AAA GC G CC 
AT AT C T T C AC T C TG T C C AT G CAT C AGCT GT C AC T T GC AAT T AC C T T GC TT C TG AAGT C GT AC AGT CGG T AT AT G AAAGT AT T T T AAGAGCT GG AG 
ATGAACAAGACATTGACTATAGCAATATTAGCTGGCCTATCACTGGCGGTACTCTCCCGGATAACTTAGAAGAATCTGTAGAAGAGGACGCGACT 
AAGCTTTATGAGATTTTGTTAACTGGTCACGAAGATGGTTCTGTTAAATTTTGGGACTGCACTGGAGTGTTGCTTAAACCAATTTATAATTTTAA 
AACTAGCAGCATTTTTGGAAGTGAGTCAGACTTCCGAGATGACGCAGCTGCAGATATGAGTGCGGAACAAGTCGATGAAGGAGAACCGCCATTTC 
GGAAATCAGGACTTTTTGATCCTTATTCAGATGACCCTCGTTTAGCAGTGAAGAAAATAGCATTCTGCCCAAAAACCGGACAACTTATTGTTGGT 
GGCACAGCGGGCCAAATAGTTATAGCCGACTTCATAGACTTACCCGAAAAAGTGTCTTTAAAATACATTTCAATGAATTTGGTCAGCGATCGTGA 
TGGATTTGTGTGGAAGGGTCACGATCAGTTAAACGTGCGATCGAACTTATTAGACGGAGAAGCAATTCCTACGACGGAACGTGGTGTAAATATAT 
CGGGAGTACTGCAAGTTTTGCCGCCAGCCAGCATAACATGCATGGCACTCGAAGCAAGCTGGGGCCTAGTATCTGGTGGGACTGCGCACGGCTTA 
GTTCTCTTTGACTTCAAAAACTTTGTTCCAGTATTTCATCGCTGCACTTTAAACCCAAATGATCTTACTGGAGCAGGAGAGCAGCTGTCTCGTCG 
AAAGTCTTTTAAGAAATCATTGAGGGAGTCATTTAGAAAGCTTCGCAAGGGTCGATCGACCAGGACCAACCAGAGCAATCAAGTACCAACAACGC 
TGGAAGCAAGACCCGTCGAGAGGCAAATAGAGGCTCGTTGTGCAGATGACGGGCTAGGATCCATGGTGCGATGTTTACTATTTGCCAAAACTTAT 
GTTACTAATGTCAACATAACGTCGCCAACTTTGTGGTCAGCAACAAATGCCAGTACAGTCTCGGTTTTCCTTCTGCATTTGCCACCAGCGCAGAC 
CGCGGCAACTGCCGTCCCGTCGGCAAGTGGCAATGCACCACCACACATGCCCCGCCGAATTTCTGCGCAGCTTGCTAAAGAAATACAATTAAAAC 
ATCGTGCTCCTGTGGTGGGTATTTCTATTTTTGATCAGGCGGGTAGCCCTGTCGATCAGCTGAACGCCGGTGAAAACGGGAGTCCACCGCATCGT 
GTACTTATTGCTTCCGAGGAACAGTTCAAGGTGTTTTCACTTCCGCAACTAAAGCCGATTAACAAATATAAGCTTACCGCTAACGAAGGTGCTCG 
GATTCGCCGCATCCATTTTGGTTCGTTTAGTTGTCGCATATCCCCGGAAACACTGCAGAGTATGCACGGTTGTAGCCCAACTAAGTCCACGCGTT 
CACATGGCGATGGAGAGGCGGATCCTAATATCAGTGGAAGCTTGGCTGTAAGTCGTGGAGATGTATATAACGAAACAGCATTGATATGTTTAACG 
AATATGGGCGATATCATGGTTTTATCAGTACCTGAATTAAAAAGACAGCTGAATGCCGCAGCAGTGCGACGGGAAGACATTAATGGAGTTTCGTC 
ACTTTGCTTTACAAACTCTGGAGAAGCACTGTATATGATGTCTTCTTCTGAACTGCAGCGTATTGCTTTAGCCACGTCCAGAGTCGTGCAACCCA 
CTGGCGTTGTTCCAGTAGAACCATTAGAAAATGAAGAGTCTGTGTTGGAAGAAAATGATGCAGAGAATAATAAGGAAACCTACGCATGTGATGAA 
GTTGTGAATACATATGAAATTAAAAATCCATCAGGCATTTCAATATGCACAAGGCCTGCAGAGGAAAACGTTGGAAGAAATAGTGTTCAGCAAGT 
TAATGGAGTCAACATTTCAAATTCACCTAATCAAGCTAACGAGACTATCAGCAGCTCTATTGGCGATATTACCGTTGACTCGGTGCGCGACCATT 
TAAATATGACGACCACCACTTTGTGTTCTATTAATACAGAGGAAACCATTGGTCGCCTATCTGTACTTAGCACGCAAACCAACAAAGCCAGTACT 
ACCGTAAACATGAGTGAAATTCCAAATATTAATATTTCTAATTTAGAGGACTTGGAATCGAAAAGAAATACGACGGAAACGAGTACTAGTTCTGT 
T GT AAT T AAAT C TAT AAT T AC AAAC AT T T C T CAT G AAAAAAC GAAC G GAG AC AAC AAAAT AGG AACGC C AAAAAC A GC G C C T G AA GAAAGCC AA T 
TTTAACATTGACAGAAGCCGTAACCTACTAATTATTTTATACCTTTACAGAAATACGACGCAAACGAGTACTAGTCCTGATGTAATTTAATCTAT 
AATTACAAACATGTCTCACTAAAAATCGAACGGGGACAGCAAAATAGGAACGCAAAAAAATTAAATTAAATTAAACGCGATTTACATACAAACAG 
AAATGACAGAATGATAATATAAAATATTCATTTTTTATTTGGCTTAAGCGATGTTGTTGTTCCAAAACCATATAATTATTTATGATTTTATGTAA 
TGTTTTCATGTATTTTCGCGTAGTGACTTTATACCCTTTACCCGTATGATGAAACGAAACATGATATTTGTAGAAAGGTACAAAGGATAAAAGTT 
GACATTTAAGGCCGATTGTAAAATATTAAAACGCAACCTGGATAACATGCCGATTAATTACTTCTGTTCGTCCGTATAAACACAAATACCGATAA 
ATTGGTCCGTCCCCATTTAATAAATTGTATATTAAAATGGTCTTTATTTTGGATAAATAATTCAAATTATATAGGAACATTTGGGTTTTAAGGGA 
TAGAAACAGCGCTGGTGTCTTTCATTTCCGTTCGTATGACCGTTCAAAGATTATAAAAAAATTGTCGTGCCCACATTTTTTAAATTTCTTTTATT 
TTCGGTTAAAATATTGTGATACAAAAAAATGCCCTTACAAAATTAGGACAACAACAATTTGCTTTATTATGATGAACAAATTAGGAGTACAAAAA 
ATAGCCTGATGCAAGGAT GGGGAGAAAAAAGACC TTGAAGTTCAAAAGAATTTGTTGTATC AAAT CCTC AAT AAGAAAACTACTT AAAAT TAAAA 
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GCGTGTTTCAAATTAATTGTAGAATATTTCGTAAAATGAAATTGCAAACTTTTTTAGCACGAAATTACATGTATTTAATTCTTTGGCTAATACGT 
CAGTTAAAAAAGGCTACATTTTGATGATACTCACAAATGTAATTTTTGCATTCTACACAGGTGCAATTAAAAAATTATGGTTTTTATGAGTTTTA 
GAGGGAAGAAAATATGTCAATATGAGAAGTAAGTTAATGAGCCTTCAAGTACTTTAGTACATAATGTAATTTAATTTTAGAAATCCTACGGCTTT 
ATATTCAATTGTAAAGTATTTTAAAATTCCTGACGATTTGTAGTAATTGTTTTGTTCTTAGCTTTTTTGCATTCTCCTCCGTTACTCCGTTACTC 
GTATAGTAAAAGGTATAGTATATACAATAGTATTGTAGATTCGTTGAGAAGTATGTTACAGGCAGAAGGAAGCTTTTCCGACCATATAAAGTATA 
TAATTGAGTATTTAGCTTAGGATTATTAATAAAACATGTATTTTATTTAACAAA 

(SEQ ID NO: 1268) 
Start ATG: 510 

MLKFIRGKGQQPSADRHRLQKDLFAYRKTAQEGFPHKPSALAYDPVLKLMAIGTQTGALKVFGQPGVELYGQHTLLNNSASELNVQLLEWVYGTG 
RILSLTAANQLILWEPVGATLLPIKTLPFDGKLKKVSSLCCSLSKDLLWIGTEGGNIYQLDLHTFTIKEPVIYHDVVLEQVPPAYKLNPGAIESI 
RQLPNSPSKLLVAYKRGLCVLWDFESASVQRAYIAPGHGQSVGLTVNFEGSEFTWYHADGSYATWSIDNPEPPSNVNYVPYGPDPCKSINRLYKG 
KRRSNDVIVFSGGMPRSAYGDHNCVSVHASDGHKVCLDFTSKVIDFFVTFENKRDVAEVLVVLLEEELCAYDLTDPNICAIKAPYLHSVHASAVT 
CNYLASEVVQSVYESILRAGDEQDIDYSNISWPITGGTLPDNLEESVEEDATKLYEILLTGHEDGSVKFWDCTGVLLKPIYNFKTSSIFGSESDF 
RDDAAADMSAEQVDEGEPPFRKSGLFDPYSDDPRLAVKKIAFCPKTGQLIVGGTAGQIVIADFIDLPEKVSLKYISMNLVSDRDGFVWKGHDQLN 
VRSNLLDGEAIPTTERGVNISGVLQVLPPASITCMALEASWGLVSGGTAHGLVLFDFKNFVPVFHRCTLNPNDLTGAGEQLSRRKSFKKSLRESF 
RKLRKGRSTRTNQSNQVPTTLEARPVERQIEARCADDGLGSMVRCLLFAKTYVTNVNITSPTLWSATNASTVSVFLLHLPPAQTAATAVPSASGN 
APPHMPRRISAQLAKEIQLKHRAPVVGISIFDQAGSPVDQLNAGENGSPPHRVLIASEEQFKVFSLPQLKPINKYKLTANEGARIRRIHFGSFSC 
RISPETLQSMHGCSPTKSTRSHGDGEADPNISGSLAVSRGDVYNETALICLTNMGDIMVLSVPELKRQLNAAAVRREDINGVSSLCFTNSGEALY 
MMSSSELQRIALATSRVVQPTGVVPVEPLENEESVLEENDAENNKETYACDEVVNTYEIKNPSGISICTRPAEENVGRNSVQQVNGVNISNSPNQ 
ANETISSSIGDITVDSVRDHLNMTTTTLCSINTEETIGRLSVLSTQTNKASTTVNMSEIPNINISNLEDLESKRNTTETSTSSWIKSIITNISH 
EKTNGDNKIGTPKTAPEESQF* 
(SEQ ID NO: 1269) 



Celera Sequence No. : 142000013384703 

TTAACTAAGGCGATATTGTGCCTCTCGCTCAATCTCGCAAAACGATGCTTTGTCTGCTGCTTCTCTGTGCGTGTGTCCGTCTCTTTCCACGGCTT 

TTCGTCGAGACTCACATATACAAACGGCGACACACTGGATTGGAGCTGGACTCCGTTTTGGATGGCTTGGCTTTGGCTGCGTACCCAAAATAAAT 

CGCGCTTGCAGTCCCAAATCATTTTAGTTCTTGACGCCTTTGTTCCACAAAAACTTCCAACCGCCGTCCATGGCTCGTGCTTTCTCTTTCGCTCG 

C GAG AC G C T CC AAAAGT C G C GAG TAT GGG AAG AG AAT GT AT CCCTCGCTCTTT C AC GC C T AGAAGCG GC T GT AGC AC AC AC AAGC AC AC GT AAAC 

CCAGCCGTATGCATATACATATATTCCCTCTACTAGCGAGACGTGTGTACGTGCGTATGTATGTATGCTTTTTTCACTTCTTTCTCCTCTGCGTT 

CTTTGCACCAAGTGTCGCTGTTGCTATTTGCTACCGCCGCTCCTCTTGACGAATAAGTCGGGACTTGTATCAGCTCTAATTCGGACTTAGTTTAC 

AAAAT GCGGCTGCTTTTTTTCCGC GAT AC T C GG AT T AGT G AT GT AAAAAG ACGT T GAT GCT T CG ACC AC T G AAGT C AGT GC G AT AT GT AT C GAT A 

ATTGAATGCCATCGATTGCTACGATTAAAACTGCCACCGCAGTTAGCGGAGACGCTCACCATTCTCTATTCGCCATTTGAAAGTATCGGTGTAAT 

TTGTTTTCAGAGAAATTAATTTCCGTTTACTGTGCAATTCGGTGTGAAAGTGTTCAGATTTATCAATGCGTATTCTGCTTTCGACTTCGCCACCA 

ATCTGTGCTGCAAGTTACCATTACCAGGTCCACCTGGTTCCCGCCAGTTTTCTTTCATTGTGGCTAGTTGTTGTTCGTGCCTTCGATAAAGACGT 

TTAGAGGTGTTTTTAGAGTTTCGCCATCTGGTCACTATAGCCGTTTCGTTTTTTACGTGAGTATTGTGAATTTGGTGTGTTGATTTATATCTCAG 

TTGGAGCCTGCGTGGAAATAGTGTCAGTACGTTTAAAGGCATCATCGTAAGGTTAGTCGGAAAACAAATAAAAAGCCTACAAATAGCAAGCAAGA 

ACGCTTTCGCAACCACAAGAAAAACGAACATACGCACGCATCCATCTGTGGTAGAGTCGGCTTGTGTGGGGGTGAGTGGATGTGTGGGTGAGCAT 

TTCTGGAAAGTGCCGTTCTCTATGTTTCGATTCTACTTATCCCTTCATCACCATTTCATCGTTGAATCATCACGAAAGAAACATCGAAAGTGCAC 

AATCGTTTGTTATCTTTGTACGAAAACAACGGTGATTTCCACACAGGCATAACCTGCAAGAGTAATACGATACAAATTTTGATCTGATATGACCT 

CATTTTAGTCGGTTTGAATTTTAAAGAGAGCGACTCTGCAAACTGCAGGGATATGAACATACAAAATTCACAAATGATTGTTTGCATCCTAGTTG 

CTTATATTCATTTACACGCATGCTTATATGCTTACATCTGTTATCAATAATTGACGACTTGGTGCATGCGAAGCAAACGCATTGAAATGACTATT 

GGGATCCACACACTGTTAAAAAACACACTGATTGCACTGCCACACATTTCTCAACTAGAACCTAATGTTTTATACTGAACTTAACTGCAGGAAAG 

CCCAAAATGCCCAGCATCAAGTTGCAATCTTCGGATGAGGAGATCTTTGACACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCAT 

GCTGGAGGACTGCGGCATGGAGGACGATGAGAATGCCATTGTGCCGTTGCCCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTC 

ACTACCACAAGGACGACCCCCAGCCAACGGAGGATGATGAGAGCAAGGAGAAGCGCACAGACGACATTATCTCATGGGATGCAGATTTCCTAAAA 

GTCGACCAGGGCACACTGTTTGAGCTGATATTGGCAGCGAACTATCTGGACATTAAGGGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACAT 

GATTAAGGGAAAGACTCCCGAGGAAATACGCAAGACCTTCAACATTAAGAAGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACG 

AGTGGTGCGAGGAGAAGTAAAGCGCGGCATTTCGCGGGACCAACATTAAGTTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATT 

GCAACTTTACTTAGTTGCTACTTTCATTTACATTTTTTTTTATTTTTAACCCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCT 

GATTTGATTCGCGGGGTTCATTTTTTGTCGTAAATATATCTCATATACATACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACA 

CTTGATTTCACTTTAATAAATATAACTACCCAACACAAAGACACAGCCTATTTTGTGGGCTTCCCGGAGGGGTTAGGGTCCAAGGACTTCTCAAA 

TTAAGAAGTGCTCTTTATTCGTTATCTTGTGTTTAGAAACGTTTGCAAGCGTTTGAAATGGATACGCATGCCATTTATGGTCCAGTCGGTAAAAT 

TTTACGTGTTCCCAAGAAGGTACATATATATCTTAGATTATCTTATCTATAGAAACTACATTCATAAGATCACGCTTTCGGCGAATAGCGAGTAA 

TACTGTAGCTGCCACTTTGCCACTTAGTGTGCAATTGAAATGCGCGCCAACGCATGTTCTCATATGCGGAGCGTTTAGCAGCCCTTAAAAAGCTT 

TTCATGTTGCTTTTAAAAAGCTACGACGGGTGTGGAAATTAGATTTGTGACGCTTTATATATCTTTTTTTTATATCGTCTGGGGTACAGCATGGT 

TTTAGGATTTTAGAAGAAAGCAGTAATTTTGTAAGAATCGAAGAACGGGGGGATATTTATAATTTTCATATTAACCTTGTCTTTGTTTGCTTTTG 

AAATTATGAATACTTCGAAAGATAATTATCTTGTGCGCATTCCATTCTGGCTTAGACTGGGTTGAGCTAAAAATGATTTGAAATACAAATTTTTA 

GGTGTCCGCTGATATATAAACAGTATGATGTTTTTGCACTTTTTTCCCTGAAGTTTTCCTTGTTGTCTCGCTTGCCAATGCTGAACAAAGTAGCC 

AAAGGACCTTTACTTGAAGCTTTGAGAGCGGAAAATATATTCGGGGACAATTGCTTTAAGTGTGCACTAGGCTTGGCTAGATGCCCGTGCCAAAC 

TTTCTTTGGAATTGATCGCTGGCCTTAGCCAAAAGTGGAAAAAAGAAAGGATAGGATAGTGCCGCACGTATTAAGGGTCTTCTTACAAAAGGTAA 

CTGATATTAATTCATTTGCATACTAAGCAAGCTAACTCATCTGTTT 

{ SEQ ID NO: 1270) 

Exon: 1001.. 2466 
Start ATG: 1717 

Transcript No. : CT43239 
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TTTTACGTGAGTATTGTGAATTTGGTGTGTTGATTTATATCTCAGTTGGAGCCTGCGTGGAAATAGTGTCAGTACGTTTAAAGGCATCATCGTAA 
GGTTAGTCGGAAAACAAATAAAAAGCCTACAAATAGCAAGCAAGAACGCTTTCGCAACCACAAGAAAAACGAACATACGCACGCATCCATCTGTG 
GTAGAGTCGGCTTGTGTGGGGGTGAGTGGATGTGTGGGTGAGCATTTCTGGAAAGTGCCGTTCTCTATGTTTCGATTCTACTTATCCCTTCATCA 
C CAT TTCATCGTT G AAT CAT C AC G AAAGAAA C AT C G AAAGT GC AC AAT C GT TTGTTATCTTT G T ACG AAAAC AAC GGT GAT T T C C AC AC AGGC AT 
AACCTGCAAGAGTAATACGATACAAATTTTGATCTGATATGACCTCATTTTAGTCGGTTTGAATTTTAAAGAGAGCGACTCTGCAAACTGCAGGG 
ATATGAACATACAAAATTCACAAATGATTGTTTGCATCCTAGTTGCTTATATTCATTTACACGCATGCTTATATGCTTACATCTGTTATCAATAA 
TTGACGACTTGGTGCATGCGAAGCAAACGCATTGAAATGACTATTGGGATCCACACACTGTTAAAAAACACACTGATTGCACTGCCACACATTTC 
TCAACTAGAACCTAATGTTTTATACTGAACTTAACTGCAGGAAAGCCCAAAATGCCCAGCATCAAGTTGCAATCTTCGGATGAGGAGATCTTTGA 
CACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAGGACGATGAGAATGCCATTGTGCCGTTGC 
CCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCAGCCAACGGAGGATGATGAGAGCAAGGAG 
AAGCGCACAGACGACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGCACACTGTTTGAGCTGATATTGGCAGCGAACTATCTGGA 
C AT T AAGG GCC T T C T GG AGC T C ACC T G C AAG AC T GT T GC AAAC AT G AT T AAGGGAAAG AC T C CC G AGG AAAT AC G C AAGAC C T T C AAC AT T AAGA 
AGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAGCGCGGCATTTCGCGGGACCAACATTAAG 
TTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACTTTCATTTACATTTTTTTTTATTTTTAAC 
CCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATTTTTTGTCGTAAATATATCTCATATACAT 
ACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGT 
(SEQ ID NO: 1271) 

Start ATG: 717 

MPSIKLQSSDEEIFDTDIQIAKCSGTIKTMLEDCGMEDDENAIVPLPNVNSTILRKVLTWAHYHKDDPQPTEDDESKEKRTDDIISWDADFLKVD 

QGTLFELILAANYLDIKGLLELTCKTVANMIKGKTPEEIRKTFNIKKDFSPAEEEQVRKENEWCEEK* 

(SEQ ID NO: 1272) 



Celera Sequence No. : 142000013384225 

CATAGATCACCCTGCCCGAGTCGCAGGATACCACGAACAGGAAACCATCGGCCGCCTCCAGGATAAGATGCTTAAGCTCCTGGTCGGTCAGGAAG 
GAGGGCTTGTAAGTGCCATCGCTGCTGGTATTTCCTGTGCCACGCAGTGCCTTCATGTGAGCCACTGCCATGCGGAGAATGGTGAGCTTGTCCGG 
TTTGCGGGCCAGGGCACTGCAGGTGGGCACCATGTCCGAGAGTTCCGTGATGTAGGCCGTCATCTTGTTGCGGCGACGTCGTTCAATCTCGCAAT 
GGTTCTCGCGGCTGGCGAACCGCTCCTTGTCCTGTATGTTCGCCTCGTCCATGCTTGGAATGCGTAATTAGAAAACGTTCGTCGTAATTTGGGTC 
GCCTGTGCTCTGTGGAAAAAGGTTCGTGGAATTGAGAGAGGGGCGGGGTAGGGGTAGGGGTTAGTATTTTGTCTACGTCGTGCTTGTGTGTGTAC 
ACTTGCTGGCTGGCCAGTAACCTTATTTCCCTCGCTTTTTTTACCCTTCTACTCACCAATGGGGGGTGTACAGAGGTACTTGGTTTATCCAGGCA 
ATTTTATTCGGCTGGTCTTATGCGCTTTTACCATTAATTCTGTACTTTTCTTCTACACTACACGATAAAATCTCTGTTTACTGCGCGAAGTGTGA 
CCGTAGTGTGGACTAACAAATATGCCGATGCAGTGTATAGCAAATCGATAACAATGCGCTAGGTGGCAACACAGGTGCGTTGAATCAAGACCAAG 
TGGCATTACTATAAAACTAGAACGTTACATTTTAAAATTTTCATCCTAAAAAATAAACACAAATATACAATAAAATTTTAATTTAATATTGACTT 
TTTTAAGGCCCAGCTATCTGTTGGCTAATTAAGTAAATCAATTTACCGCTTGTCGTGTGGTGAAAATGTTTTTATCTTTCAAAAACATACAAGAT 
CGGCGTCCATTTAAAAAATCGCTTTTCCCAGAAAATAAATAAAACCATCATTGTTTCTGATATTGTTGTCAGTATTTAATGGTATTACAGCATTG 
CGTTTTCTTACATATATCAAATATAATTTCATAAATACATTTAAATGTTTAGTCATAACTAGGCGCAGTATCGTAATTTTACGAACTATGATTCT 
TTAAAAAATTAAAAGTTTATTATTAGTGGCAATAACATGTCTTCATTTACTCCTATCCGGATGCATTCTTATAACAAAATAGTAAGCTTGCGCTT 
TATTCCATTTCGCATATTTCTTTGTGTTCTTTGCTTATTACATTTTTTCAGTTTTAATACAATCTTAGGTAAAGTTTTATCCGCTTTGCTTGTCC 
TTCATGTTAAATCTGCTTGGTATTGTTAATTAATTATTGGCAATTTTCTAACTGCATGGTTTATTTAGATGGGCATTCCATTTTCCATATTTCAT 
TCAAGCTGTTGGTACAACATACAAGTCTTTTGTTCTATGTTTATAGTTAAATATATTTAAGTCTCCATTAAAATACGTGCACTACTTAAGTGTAA 
TTTAAAATGCGGCTTCATTAAAATAGCTTAGCGTGCTTAATACGTCTTTTTTGTGGTGTCTTTGTTGGATGAAGTCCGGTTTAGATTTAACTTCA 
GCATAATTCGTTGACTAGAAATTAACTTAAGTACTTAGGGCTAGACTAATATTAACTTTAAACAAACACGCTCTTGCAGTTTTCTATTTTGGTTT 
ACGTGCATAAGTGTTTTCTGATCAATCTTTGTTCATTAATTTGTGCTGCAAACTTATTTGAGTTGCGTCGATATCCATTTCCTTGCTGGAAAGTA 
GTCAATATTTGCTTCTTGTGATTTCATATCAAATTAAGGTCATTCGTAGTTGCTGCAAGCTTAAACATAACTTTTCAAATCCGTTAACAATTTTA 
TTTTGCCTTTCAAAACATTTAGCACCAAAATGATTAATCGCCAAATAAAGAATCCTTGTAGTTTGTGTTTGTTAATTTTGCACATCGTTTGGAAT 
TTAGACATAGAGAAAAACTAAGTAAATTTGCAACAACTTTCAATTAATTCCAATTTAAATTCAAAATACTATAAACTCCTATACAATACTTTGGT 
TTTTCAGTATACAACGCACAAATGGGCGATTCAAGGTTTGAAAAAAATTGAAAAAACGAAGATATCTTTGTTGGATTGAACAATACATAGTTTTG 
GCTCTGGGGCGCACTTACTTATAACATCACAAAAAAATT AAAC ATACGTGTGATAAATTAGTTGATAGTTTCATACACTAATCTAGCTGCGT AAA 
CTATCTAATCTTTAGTCTGCAAAAATAATGCCTATCTCTCCAATATAATATTCTTTGCGCTCAATTGCGTTTAAAATGCAACAACTTGTGTGTAT 
GGTATAATGTGTAGCTTTAGTAGGTTTGATTGCGAGTAAATGTTGCTTGGCTAGGAAACGCCCGCGACCTTTCTACGTGGTGTAGGTGCGGATGA 
CGTCCCGGATCACCGCCCGGCACAGTGGGCACTGGCCACCACCCACTCCGCGCCACTGCTCGATGGCGCAGTCGTAGCACATGCACATGTGTCCG 
CACATGTACAGCACCGAGTCGATGGGATTCTCGTAGCAGATGGTGCATTCGGCACTCGAATCGGTGCTCTGCTGGTCGCTCAGGCTGTCCTTCCA 
CTTGTTGGCCGCATTATTTGTGCTATTGGCAATGGGCTGCTGATAAAGTAGGCATTAGTATATGTCTAGGGAGTATTTCACAAAGCTTTGACACT 
CACCTCGATGTACTGGCTGCTGGTGGTGCTGATAAGGGTGCCGCTGGAGCAGGCTCCGGCCAGAACACCGCTGGAGGTTACCGTGGCCGTTGGCC 
TGGCCGCGAGTTGGCCGTTTATATCGTGCGAGCTGCTAGCAGGTGGCAGGTTGACCACCAGAACGGTGCCACCACCATTCGGCTGGATCTGGAGA 
ATATCTCCGTTGGAGGGCATGCTGATCATGCGGCTGCCATTGACTCCGCCAGCTGAGGCCAAAGTGCTCGTACTCTGGGCTACGCTCAGCTGCGA 
GGGATGATGCAACAGCTTGGCAGTGGCTCCGGCATTCAGACTGCTCATCGACTCGGTCATGGGCAGCATGCGGGACGTGGAGGCAGCGTTGCAGG 
CGGACGAGCTGCTGGCATTCACATTGACCTGCGGCTGCGATGGATACGCAACCATGTTGGGTAACTGCTGGCGGAACATCCTCAGAGACTGCGTC 
GAGCCGTAGACATCCAGGAAGGCCCACAGCTGGAGGGACTGGTCCACATGCATCACCACCACAGCTGGTCCGTTGTTCTTGCTGATGCTCACTTC 
ACCGTTGGGCGCCACGAAGAAGGCAATCTCATCGCCGCGCTGAGGAGCAGCTGCAATATCCTTGCTCACCACCCAGTACTCGGGGCGATCGAGCA 
GGAAATCCGAATCATTGGGCAAATCATTGGGCTGCAACATGGCCGGATTGCAGGAGGTCAGACCCAAAGCCAAGGCACCCACGTACATCTGTTCC 
GTTTTCAGGACCTGCACAATCAGCTTCTCACCAATGCGAATGGGACGGGCAGTGAACACATATCCCTGGCAGAAGTCAGATTCCGTACGAGAAGC 
CACAAAGCGATCCTGCGAGAGCCGCACATTCCGGCCCTTGGTGTTGTGGAAGGGCACTGGAATCAGTCGTCCGTTGGCATTGTAGCGCAGAGGAG 
GCAGTCCATTGGCCAAATCATGAGCAATGGCCGCCTGTTCTACGCTAGCCACGCTTCCACCGTTTCCTGCCAAGTGCAGGGATTGCAGGGACGGC 
ATCACATGGCGCTCCAGATCGTGCTCGATGGCCGCAGTGTGTCCTGGCAGGGATCTCCTCGACTGCTGATGCGGGTGATGGGAATTCAGGGCGGA 
GGAGGCATTGGCAGCTGGCTGGGGCATTTGCTGTTGCTGGGCGGGCACAGTGGCCATTGGTATAGCCGCTGGCTGCTGCTGGTACATGTAAATGC 
GAGAATCGAGAAACTCAATGCCCGTGCAGTTGCCATAGATATCGATCACCGTCCACAGCAGGCTTCTCGTATCGATGCCGGTCAGTATCACACCC 
TTCTCCTCGTTGTTTATACCATAGATCACATCGCCAGCTCCATTGACATAGTAGTAAAGTATGTTGTCCTTCTCGCAGTACTGCTCGTGCAGAGC 
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CTTGGCCCAGA?\GCCAGGCCTGTTGGTCAGATCCGGACAGGCGTACTTGGGCAGCGTTCCCTCCAGCGTTACAGGATCGTTGCTGGTGAATCCAA 
AGCGAATTCCACCGTTCCAGTTGTTCGAGATCTCAGCAAACTTCACGCAGATCCTCTCGTTGATGCGAACCGGACGCGCTGAAAAGGTGATGGCC 
CrGCAGAAGCTCTCAAAGCGACGGGCCAGGGTTCCGTCCCTTGAAATTCGGATGTTGTCGCCATGTACGGAGTGGAATTGCAGCGGAGGCAGGTT 
GTTGGGGCCAGGACATGAAGAGGGTGAGCGGCGAACTGAAATGGTATATGTGAAAGTTGAGTTAGTTAGGATTTTTTTATATAGTATTATATTTC 
ATTTAAATTTCTTGCAATACGAACATGATTTAAGCTGTTAGTTTTATCCCACAAATCATTTGGAAACCTTGAATATACTTTCGCATTCACGAAAT 
GCTGCCTTCCCACAATCCATTTATCTATTAGTTACATCTTCTGCCACATTGTTTTAAAATCTTTTTCCCAACGCCAGTTCCCCGTATCAAATTAC 
GCATCTTCAGAAAGATGTCTCCAGGTAGCCGAACATTCAACCATGTGCTAGCGATCGAATTACGCAGAAGCCCAAACTGATCTTCCTGGGTGAGT 
CAATAACCCCCAACCCAAGACCCAAATTTAGTTGGTATTCAAGCACAGGGCACGTGACTGTATTGAAGACTATTGAACCATTGGGTTTTGGGGAT 
TTGTTGCCTTGGGAAAAAACGCGGACTCAAAAGGATTTACCATGGTCATCGCCAGCTGCTGGCCCATTTTCTTGGAAAATCAAAGTCACATACGT 
GTGAAGTACAAGAATTACTTGAGGAATAGATAAATATTGGTGTAGGTTAGGCTAGAGAATTCTTACAATTTTCAAGCGATTTCAAATTCCTAGAT 
ATATATTTGAATTGTTATTTTTGTTACTAACTCGTTATACTGGATAGGCGTTTTTTTATTACCAATGAATAGCTTGTTGCTATCGGGCTTATGGT 
AAATTCTTTGCAATTATATAGGGAAGCACTTCAATGGCAGAGGAATACTTTAGGAATCCCTTAAACATATTCCCGAAATTGACTTAATTATTGCA 
CCTTGCAAATCTACGTATATCAGCAGCTGCTGACAATTACCACAAAAGACTAAGTCAGGGCAGAAAAACCTATAACCGATCCATATGGCCATACA 
TATAAATAATTACTTGAGGACGAGGAGTCTTTGGATGGCGACAGGTTTCTGCTCAGGTCGCTGGAATATGCCCAACATCAGACGATTCCGGCGGC 
ATGTCAACAGTTGCAGTAAAAAAATATATAAACTCAGTTTATAAGAAAGCCATGGCGAAGGACCTGAAGCAGATGAGGCTGATGTTGCTGGTGAT 
TACCATGTTGATATGCGGCGCGTGGCACTGAAGATCAATGTGTTTGCTTTGGAGCGGCATGTGGCAGCTGTCCTTTTTCCCTTCTTCTCGATTCC 
TTTTGCAGCCTTTGTTGTGCGGAGGAAGAAATAGGAAGGTCAGAAGAAAGCAGCTGTACGGTTGGAACAGGTAAAACAACCCCTATGGAGAGCGA 
GAGTCTGTCCCCGTTTTTCGAGGGTAGTTTTCAACGTTTTCGATCGCTTAGGGGTTGCTGCAGTTTCACAGTCTGCAGCTGTTCAAATTTCGAAA 
AAAAAATACTGCAAGACAGATACAAGATACAAATGCTTGGGTATCTGAAAGATTCCCCTGGCTTGGCAGTGTCTTTGGGGTTGAAATTTAATCCT 
AAGCCAGGGCCAATGACTTGCACACGAAATCTGTACTGTCTGCTTCCCGAAAAAATCCAATCCTCAAGTGCTTCCGCTGCACGAAGTGGAAAGGG 
AGCACCAGCTTCCGGTGGCTCCCATTCTAAAGGAAACATCGCGCCTTTGGAGATTCGCCATGGAGGAGGAGGAGAACCAAGGTGGAGTAGAAAGC 
CGGCCCAAAGACTTCTACTACGAGTGCAATACCCCAACCATCCATATTTATGAAGTGACCGCGACGAGGAAGTAGGAAATCACGTACTACGCACA 
TACGTGTACTGGATTGGGGCCTATTAGCTGCAGGATGTTGTCGTCTCACTAGCCTGGCCAACTGGCGACGCCTACTTTTTATCGGCCCAGACAGT 
CGTCAGTCAATGGATGGAATTGCAGCGGCGATCACTAGCATTCGCTGGCATCAATGAATGAATGAATGAGTTAAATACGCGCCTCGCATTTTTCC 
TTAAACTCGCAGTCCCAGAAACGGTGCTTCATAGACGCGTTGATACTTTCAATTAGCCAGGCCACAATTAATTAAGTGCAGCGAGGCTTTAATGA 
TACCCTCCACCCCATGATAACTAATTGCATTAAATGTATAAACGATAGACTTGTCGGTTGGGCAGCAAACAGTGGCGGCTCGAAAGCGTTCAATA 
CGCTTTCAGAGTAGGAAATTATTAAATGAATACCTTACGGTAGGGCTGGCTTTCCGGCACTCTTAACTATGATTAAGGATACATZ^AAGATAATAC 
TTGTGAAACAAGCCAGTAGCGATGAAGACCAAGAAATTATCCATTTCCCAGGCCATCCAATCATCATCCAGTTTCGACACAATATTTGCCAAATA 
ACAACAGAAGATGAAAAGTTCAAACTTTCGGTGGTGGTCATGGTGCGTTTGGAGTTTGCCGCTCTGCCGCAGCAAAGGTTCCCAAACGCCGATTC 
CC G AAC T GGC AAAAC G AAG AC C C T GG C ACAT CC G AT C G AC T G C AAAC G AT C T GGC GGT GG CTG C TGC AGT TGC AGGCAAA TTTTCGTTGCTTTGC 
CAAAATGGAAAACCTGATTTACCGTTCGGCGCGGATCGGGAACGTCTTTGGATTCGCCACCAGCAAGCGAAGCAAAACATTTCCCACGCTGAGCA 
CT GGGC AAAAAT C AAAC AC AAACT CCG T AG AG AC AAT TC C C AC GAC T CC AAAT T T GT ATAT AT AGG AGAT AT AT AGT AAATT TGCGG C TGC CGAA 
CATCGAAAACCTGTAACAAGCCGCGGTAATTGCGAGAAGAGAGTGCGCAGTTCGCAGTTCGGATCTTCCCGAATCTTTTGGCGAACTAGAGAAGA 
AGCTGCAGCTGGGTGATCGAGCGGGTGGAAAATGTGAAAATCATGCAGGTGTCATAAAACAAAACCCCGCTTCAATGGAAAAAAAAGGAAACCAA 
AACCAGTGAGGCAATTGTGCGGCCATCGTTAGGCGAAATCATTTTAACGATCTGATGAAGATCTCCAAACATTCAAAATCCCATTGAGCGCAGAC 
ATCATTAGATTTCGATTGGCACTCGTGAGTCGAGTTTGGCGATCATTAAAGGCAGATTACGATGATGTCTGAATGAAGGCTAACAAGGGGATACG 
ATGTGAAGCTAAGCATCAACTTATTGAAACGATCGGTTCTAGCAAATCGCATCAAAGTTAATTAGTTCGGTAGCATTCAGGCTGTGGGAAATCAT 
TGGATCGCTTTAATATCAAAATCTTTACTATAAAATAAACTAATTTTCAAAGTATGGGTATACAAAGCATTATCTTGACATAAACAACCAACCAT 
TGCTGACTGCAACATTGTTCCCGCTGCCACCCACTTGAACTCCCTACCCGTTTGCACCGATTGAGGTCACAGGTCACGTGCTGCCCCGCCCCCTG 
CGCCGTGCCTGTTGTTGTCAGCCCACGTGCGTGTGTCAGAAGACGTGTGCGGACAATTAACACGCCCCAGGCCGTTTAATTGCACCGGAAGTGTC 
CCATAAAAGCCGGGGCTTTCGTTCAAATGCCGGCTTGGCATGCCGAGGTGGAGCAATGGATCATTTATACAAATTGTATCAATTGGTCAAGCATT 
TGGCATAAATTATCGGCATTTTGCAGCGCCGCAATCAGTCAGATCGAGTGGGTGAATGAGCCGGATTGAGTTGGACCTAAGCATTTTGACTTCCA 
CGGGCTTGTCAATGGATCGACCCACGCCGATAAATAACTTATAAGCGTATCTATGGCCAGCCACTCCGCATAGTGTGCTTATTTTTCTACATTTT 
CCTTTTCGACATATCAGTGAAATTAGTTGAGGAAACAGTGTGTGGGTCGTTTGTTAGTTAGTCAGAGACTTGATCTGTGGGCAGAAAAGCCAAGC 
GTTGGTCGCAGCTTAAGAGCTGAACAATTGTCCATCGAATGGCTTGTAAAACAATTCTCACAGCAATAAAAGCAATGCTAAATTCTTGTCTTAAA 
TAT ACGTAAATGAGCCTAATGAATGTTAATT AAT CGCTCGC AAAC AAAC AATTTGCATGGGGCGGGAAAATATGAAAGTTTTCCTCAC AAC AGAT 
GAAATTTTGTTTTGTTTTGTTTGTCGGCGTTTTTAAGTTTTCCTTTCTAACCACAAAATACCAAGCATTTCATCCCAATCAATTTTCCAGTTTGT 
TTTTTTTTTTTTTTTTTGATTTGGTGCTAAATGCACTTAGCAAAGAAGTTCGACTGAATCGAAAAACCCAACCATTGAGTTGATGCGAAAACCAC 
TTCCGTTCTGCTTGAGCCGCTGACCAGTACAGCGGAAATTGCACTGAGATCCACTCATTGCGAGCTGCTGGAGCGGGAAACTCTCTCTTGCGGTG 
TGGATTCCCCCACCGGTTTGGCCCCTCCTGGCAGCGCATGTAACTCTAGAAGCTTAGCAAGAATGGACGCAACATAGACTCTCTGCAAATATAAT 
AAAAATAATTACACAACGGAAAAGGGGCCGACTGGTGGCGGCGGCTGGCGGCGTTGACAGCAATGAAACCAGAGGTCAAACTATGCCAAGTTGCT 
GCAACCAGTTGTTATTGTGTCTGAGCAGCTTTACTTCTTCCCAAGTACATTGCACTTTTCATGCGATTAAGCGAAATCTGATACTGCACGCCATG 
TGAACTAAAAATCAAAGAAATCACAGCAGCAGCAACTCTAAGTATATCAACCATTTAGATAAAGACTCTTCAATGAAACTATCGATAATACAGAA 
AAAAGTGACTAATCAATGAAAGTGCATCAGGAAATTTCCCACAGGAAGTTCGAAGTAATCAGGGAAGACCATAAAGATAATCGGTTATCTTACAT 
GAAGAAATTAGTGTTCAAAATGCTTGGTTTTGCACTTGCTTAAGAAATCCACCTTACATATTGATCACACCTCTCTCAATGGGTTCAATGACTCC 
C AATGAC TTTTCTCCCCGTTC G GC AT AAAT T T T CC G AC T T T C CAC GC AC AT AC AC AC AAAT TAG T T AT AAAC AC C C GG AT G AGG AG GC G C AGCT C 
CCCTTCTAATATAATTAGCCACCGAAAGGGAGAGATATAGAAAAGCATATATATAGAGAGAGAGAGACGGAGAAGCGCTGAAGGGTGGGTGGCCA 
AATCAAATTTACATGTTTATTTTTACAAGCCTACAGAAAGCAACTCAGTGGAATGGGAGCACCTCTCTATCGATGCACATGATATATCCGCGTAT 
ATGTATATAGTATAAATATCTATGGTTGTTTTTTTTTTATTTGGGCACTGAAGCATGCAGGAGGCCCCAGCCAAGAAGAACTCAACTTTTTCAGC 
TCCGAATAGACATAAAACCGATCTAAATATACACGCGTAGATATAAGGAAGCAAAAGGGAACCATAAAACGCACAATAAACAACATTGTCTCTTA 
AAGTGTTGACGGTTTCTGCAGAATAAATCGTATGGATGAAATATGAAATGAAATAGTTGGACTTCTCTTTAGTTGAAGCAACTTAATTATTAAGG 
AAGCAATTAATACCGCGTTTCGTTGCAGTACTGTACTTTCGAGAGCTCAGATACGATAAGCCCGGCTTTCTTCCGCTACGTTTTGTTGCGCTTTG 
TAGGCTTGCTGTTTATCGAAGCGTTGCAATTACTGCAAAGGGCCAGTGGTGGGGTCTACATAAATGTACAGAAAAACATATATACGAGTACAATA 
TAGAAAAGTTGATAGTTACTACCATACAACTTGCATTTCCTCGCACTTGACTGCACAAGACACCCACCCCCGCGATTCTCTACAGTCTGCCAACT 
CTTTGTGTAATTAAATCACTCGAGCGTTGAGGGGGAGCCCACAATAACAGCTTGAAGTTGCGCTGAGTGGAATGGAATGGACTCGAAGAGTCGTC 
TTGTGATAGGAACACATCATTATTACATTATAACGCTGTTATTACAATGGCTGACACAGACAATTCAAGTGTTTTGGACTGCGGAATTCGATAGT 
TTTTGATGCCACCGGTTGCATTGTTATCGGCCATCTCATTAAAACTTAGTGTTATAGCCAAACCATGCAAATTTCGAAGATTTTAACATATTAAA 
TACCCCGAACAGATGCAAAGCTAAAACCAATCTAGTTTTGGTTCCTCCCAAAATAGGTGTACATATTAATTACATTAAAGTTCTGCCCGAATTTC 
GGGCCACAAACGCTAAAGGCCAAAAACCAAAATCCAAAAAACTCGATCTGGCCAAGTTGCTAGCGATGGATAGCCGAACAGCTGCTGCTGCTGGA 
C AT AGAGGCAGGCATAGATT AAAT AC CCGTAGGTGT TCGAACCG AGT AGAT AT ACCGACGGTGATTTATGCCTAGCAACCAGCAC AT CGCCCGCC 
CTAACCCGTCTGGAAAACACGCCCTCCCGAAAGGCCTACAAAATTGACCCAAAGATGCCTTTGGATATCTTTTCGTCTCACTTTGCGTGAGTTAA 
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TTGCAGCTGATTTCTTGGCCAGGTATATACTATGTGCTATGTGGTATTGCTATATTATATGTTTCTACCTGTAGCTACCTGAACTCCACCTTTTG 
TCTTGCGTCAGTTGCAACACCCTGCGGTGTAGCGTATCTACTTAGATTTTTTCTTTATCTATGGGGTGGTGTGCGTTTTTATCTTTCGGCACAGA 
CAAAATGTTAACAATTTTGTACTACACATTTGGGTCGTGAGGAGAACTAAGGCGATACAGACAATGGTTATAATATACTTTTCATTTTCAATTTT 
CATTTAGTTATACACTAGGTTAGAATTGTGAGATAAATCGAAAAGATTTGTTAACCGATGTTCACTGTACTCGTCTTCAAGCTACCAACTTAGTT 
CAATAACTGATAAGCGAAAGATGTGGGAGGCACTAAATAGGTCACAAGATAGCTAGCTCTTTAGAAAAAAGCTAGCAAAGTTAGCACAAGCATCT 
TTGTGTATATCCCCAATAGTTAACAACTGGTATTTATAGTTAATCAGTTGCAAAACGTTTACACTTGTAGAGATAACATTTCGCAGTGTCCGTTA 
TCAAAGCGCGAAATTAAGCAGTGCTTCATATCAGCTATATGAACTATACTATACTACAATAACTATACGCAAACTTCAGAATGACACATCGAGAT 
GAT G AC CT CAT GAGGGG GAG AAAAAT AAGT GAAAAAAT AAC G AAGGGG AT GC AT G AAAGT GG AAAAAT AT AG C AAC AGA T GT AT GT AGC GAAAAT 
GCTCAGCATACAACTGTGCTCAACAGAAAATGGCTTACTGGGAAATCAACAATAATATGTTAATATTGAAGCAATATTAGCAAAATGTCTCTTAA 
ACATTATGTTTACAGAAATTTGTTTTTAAGCGGCGACTTACATATTTTGAACTTAATAGAATAACATTTTAGGTCTCAGAAGTTAATGTATGTAT 
ATTTATGGTGACCAACATTGTACTTAGATATGTACTTATATGTACAAACTTCCTTGGGGGGTGCGTACCGTCATTAACAATTGACTATTTACTTT 
TGGCCAACAATCGAAGGTCGGGCGACAATTTGTATTTGTGTGTCCCATAATCGGGGTCTCAAAGACGAGTTGGCAGTGCTCAGTTGCAAGCACAC 
GCACGGCCAAGCAAACCATTTCGAAAAGTAAACCGTGAGAAGCTAAGTAAAAGCCAAGAGCAGATAAACTTTTTAGCCAAAGCAAACGTGGCATG 
CCAACACACAATAATAATAACAACAACT^AACAAGGTACGAAATGTAATAGAAAATGCAGGTACAAAAAAGCAAAAGCAGCTGACGCCGAGCAAAA 
CAACAACAAAGCAATTTCATTGATTACCGATAGGCAACGCACGTTCAAAACAAACGGACTGGAGCCGCCGCATAAAACATAAACCATTGCGGGCT 
TAAGCGCTCAGCAGAAAAATAAAAACTCCAAACACACTCAAGCTTCAAAGCCACGCCTCTTCCTCTTCTCATCCCGCTAGTCCATGGCTCTTCTC 
TTCTTCTCCTGCCGTGGATCGTTTGCTTTTCTGTTGTTGCTACCTTTGTTTTTGCTTTTGCCGATGGCGGCAACGGCGCTTCGCGTACAATTCCC 
CGAAAGGCAACAGCAACAACAACAAGCAATACTGAATGAGGCAGATGCAAAAACAGTGGTGGCGGGAAAAATGGAATCTCTTAAATACAGTACAA 
TATTTTGCATAGCCACTTTTGACAAATT^AAACTATTCCAGTGCTGAAGGACTAAATTAATAACTTTTTTTTATTATTCATATTGCGTCGTTCGTT 
GAT ATTCGACTGCTAGTTGTGCGACCACAATTGTTCGTAGGAGGCCAGT AAAAAT AAAAAAAAAAGGGGT AG AAGCTTAGCCT AT GCGTCACTGG 
CTTTGCTTTGCTTTTGCTTCGCCTGCTTTGGCCAAAAAAGAGAGCATTGAAGCCGAGAATGCAATTGAGCCAAAGTCAAAAAAAAAAGGTGATTT 
GTTGTTTTTACTTCTTTTTTTTTGTCTTTCCTTAGCTCAGCTGCTTGCAACCAAAACAAAAACAGCGCAACAAATTCCCCAGCCCAGATAATCGT 
TATGATAAACATAATCGCTCCTTGGGAAAATTGCTTTTGGTGGTATTTTCTTCAACTCAAGATCCCTCCGATCTGGTCTTCTCGATCAATTTGCG 
GTTACTTGGGTGCGATTGAAAAATTCAGTTTTCGAGAACTGACATTTCCGGCGAAAAAGTCGGAGTGTCGATGTTAAATTCTGCCGAAAAAGCCT 
TGAGAGAATTTCACATTTAAGATTTTTTCGCGCTTCTTTTGTGTCTATCTAGTCGGAAAAAAACAATTACTATTATAATCGCAAACTGGGCGTGT 
TCTGGGTGAGCATTTATTCATTAGCTTTTTGTTTTAAACTAACTCAAACTTTTTCGGTTTTTGCTCTTATCTTTCGTTATTTGTTATTGTTCTTG 
AGACGTAAAATGCTAATGTATGCATAAATGATTGCGGCTGGCTGTGTTTCTCGCTTATTTATCTTGCCGCTTTAATTAACCGAATGAATGAAAGA 
GC GATAGGTCAGACCACATGCTAATATTATGTGGGAATATCGCTGGAGATAAAAATACGATTCCCAGATCGATGAATAATGTATTTAAGGCATTT 
T C GC AC G C GGT AAT T AC ATT T GC T T CAAGACT T T AAT GGT C AG AT AT C ACG AAG GAAGT T T T AT ATTGTT AAAATGT AC T AC A T C T AC T T T AGAA 
CCCAATCACACTGATTTTAAAAATTATTCCTGGATATTCCGAGAATTGTACGTCCTTTGAAAAACGACTTGGAAATTAACTGAATATTCTTGTTA 
ATATATCCATCGATATCCTTAGCCCGATTCTTAGCCTGGTACTAATGCGACTACTAACCCAATCCAAAGCGCTTCTTGATCTTCTTGAGCAAGTG 
CATTTTCTTCTTGCTAGAGAGCTTGTCCTTGGTGCTCATGAGCAGCTCGTTGGACTCCATGGCCAGGACCTGGGCGGCCTGCTGGGCGGGAGTGG 
GCATGGCGTTGTGCAGCTGCTGCATCTGTTGCAGATGCTGCAAGTGCTGCTGATTCTGGTGGTGCTGCTGCTGCGGATGGAGAGTTAGGTGCGGA 
TGGCTGCCCTGCATATAGTTGGCTGGTATATCCGATAGACCCATGGTAGTGGATTATTTCTTATTTCGTTATTTAAATTAAAACAGACGAACTAA 
ACGAACACGCGTAATGCTAATGAAAATGCACAAGCTGCTTGCGTATTTTATTTGGCATTAAATTAAGCGATAGAGTTCTTCTTCGAATTTGAATG 
TTTTCCGTTAAAACATTTTGAATTTGGTTGGTTGTTTGTCTGCGCGCAACGGGAACAAACGAAACGAAACGATACGAAACACAAAGACCGACGAC 
GACAACTTCGAGACAACCTTTCGCTGAGTTTTTTTTCAACAAATAAACGGTGGTGCAGCCTTTTATAGCTTCGCACCAGCGCGACGTCGCTGACG 
CAGCTCAGCTGGAGAGTAACGGATAGTACTGGCGCGCAGAAATTCTCCCGTCAGATTTGGCTTTTTTTAGTATATGTTTTCAAGATATTTAGAAG 
TTTAGGTATGCCAGTAAGACATATTTGGAAATTTAAGTTTTTTATAATTACTATTAACGTTTAGTGACAATTTGGGCTGTAGATTTGAATAACAT 
TTCCTGTTTTACAATTTAATTAAAATCAAGAAACGGTAACTAAATTACATTAAAAATGCTTCAATTTATTTAAACTTATAATTTGTAAGCAATCT 
ATTCAAATAATTTAAAATTTATCCATTTTCTGGCTAAGGGCTGTCAGCCCTGAGCGTGGATGGAGCGAAAAAGAGAGCTCAAGAGAGAAAATTAT 
GAGACGCCATCGCTCTCTTTCTCTCTCTGGCAGCTGTTTCTTATTTACTATGGTTTCGCTTTGTGCCTGCTATTCGCGTGTATAGAGCCTCCCTT 
ATAATTTACCTTTATGCCTCTAACTCTGCACCTTGATCAGGGATTCCTCCACATCTTCTCCTGCTGTTGCTCTTCCTGCTTCTTGTTGTTGTTGC 
TCTAGGCCTGCATTTGGGATTTTGGCCATAGTGCTCTGATCTTCACTAACAGTTATTCTCCTCACCTTAGTGTGTGTGTAGTTCAGCCAGGTTCA 
AAGCAATTCTTTTTCTTTTTCGCAGGGTTTACAAGGTCTGGCTCATATAACTAGAAAACTATTTCAATTGCCCTAATGCTTTTGATCGTGAACTT 
TGTGGGCCAAATATCACACGTACATACATATGTGTGTATGTATGTACTCAGTTGGAGTATGTATGGAGTAATTCCATGGGAATTAAAAAAAATTC 
CCAAACGTTTCTTGATAATAATAATAAATTTCTTCTTTTCCCAATTGTTTTCGGCACATTTAATTGTCCAT 
(SEQ ID NO: 1273} 

Exon: 14366.. 13834 
Exon: 4500.. 2759 
Exon: 2699. .1001 

Start ATG: 14104 {Reverse strand: CAT) 
Transcript No. : CT32118 

AAAGGTTGTCTCGAAGTTGTCGTCGTCGGTCTTTGTGTTTCGTATCGTTTCGTTTCGTTTGTTCCCGTTGCGCGCAGACAAACAACCAACCAAAT 
TCAAAATGTTTTAACGGAAAACATTCAAATTCGAAGAAGAACTCTATCGCTTAATTTAATGCCAAATAAAATACGCAAGCAGCTTGTGCATTTTC 
ATTAGCATTACGCGTGTTCGTTTAGTTCGTCTGTTTTAATTTAAATAACGAAATAAGAAATAATCCACTACCATGGGTCTATCGGATATACCAGC 
CAACTATATGCAGGGCAGCCATCCGCACCTAACTCTCCATCCGCAGCAGCAGCACCACCAGAATCAGCAGCACTTGCAGCATCTGCAACAGATGC 
AGCAGCTGCACAACGCCATGCCCACTCCCGCCCAGCAGGCCGCCCAGGTCCTGGCCATGGAGTCCAACGAGCTGCTCATGAGCACCAAGGACAAG 
CTCTCTAGCAAGAAGAAAATGCACTTGCTCAAGAAGATCAAGAAGCGCTTTGGATTGGTTCGCCGCTCACCCTCTTCATGTCCTGGCCCCAACAA 
CCTGCCTCCGCTGCAATTCCACTCCGTACATGGCGACAACATCCGAATTTCAAGGGACGGAACCCTGGCCCGTCGCTTTGAGAGCTTCTGCAGGG 
CCATCACCTTTTCAGCGCGTCCGGTTCGCATCAACGAGAGGATCTGCGTGAAGTTTGCTGAGATCTCGAACAACTGGAACGGTGGAATTCGCTTT 
GGATTCACCAGCAACGATCCTGTAACGCTGGAGGGAACGCTGCCCAAGTACGCCTGTCCGGATCTGACCAACAGGCCTGGCTTCTGGGCCAAGGC 
TCTGCACGAGCAGTACTGCGAGAAGGACAACATACTTTACTACTATGTCAATGGAGCTGGCGATGTGATCTATGGTATAAACAACGAGGAGAAGG 
GTGTGATACTGACCGGCATCGATACGAGAAGCCTGCTGTGGACGGTGATCGATATCTATGGCAACTGCACGGGCATTGAGTTTCTCGATTCTCGC 
ATTTACATGTACCAGCAGCAGCCAGCGGCTATACCAATGGCCACTGTGCCCGCCCAGCAACAGCAAATGCCCCAGCCAGCTGCCAATGCCTCCTC 
CGCCCTGAATTCCCATCACCCGCATCAGCAGTCGAGGAGATCCCTGCCAGGACACACTGCGGCCATCGAGCACGATCTGGAGCGCCATGTGATGC 
CGTCCCTGCAATCCCTGCACTTGGCAGGAAACGGTGGAAGCGTGGCTAGCGTAGAACAGGCGGCCATTGCTCATGATTTGGCCAATGGACTGCCT 
CCTCTGCGCTACAATGCCAACGGACGACTGATTCCAGTGCCCTTCCACAACACCAAGGGCCGGAATGTGCGGCTCTCGCAGGATCGCTTTGTGGC 
TTCTCGTACGGAATCTGACTTCTGCCAGGGATATGTGTTCACTGCCCGTCCCATTCGCATTGGTGAGAAGCTGATTGTGCAGGTCCTGAAAACGG 
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AACAGATGTACGTGGGTGCCTTGGCTTTGGGTCTGACCTCCTGCAATCCGGCCATGTTGCAGCCCAATGATTTGCCCAATGATTCGGATTTCCTG 
CTCGATCGCCCCGAGTACTGGGTGGTGAGCAAGGATATTGCAGCTGCTCCTCAGCGCGGCGATGAGATTGCCTTCTTCGTGGCGCCCAACGGTGA 
AGTGAGCATCAGCAAGAACAACGGACCAGCTGTGGTGGTGATGCATGTGGACCAGTCCCTCCAGCTGTGGGCCTTCCTGGATGTCTACGGCTCGA 
CGCAGTCTCTGAGGATGTTCCGCCAGCAGTTACCCAACATGGTTGCGTATCCATCGCAGCCGCAGGTCAATGTGAATGCCAGCAGCTCGTCCGCC 
TGCAACGCTGCCTCCACGTCCCGCATGCTGCCCATGACCGAGTCGATGAGCAGTCTGAATGCCGGAGCCACTGCCAAGCTGTTGCATCATCCCTC 
GCAGCTGAGCGTAGCCCAGAGTACGAGCACTTTGGCCTCAGCTGGCGGAGTCAATGGCAGCCGCATGATCAGCATGCCCTCCAACGGAGATATTC 
TCCAGATCCAGCCGAATGGTGGTGGCACCGTTCTGGTGGTCAACCTGCCACCTGCTAGCAGCTCGCACGATATAAACGGCCAACTCGCGGCCAGG 
CCAACGGCCACGGTAACCTCCAGCGGTGTTCTGGCCGGAGCCTGCTCCAGCGGCACCCTTATCAGCACCACCAGCAGCCAGTACATCGAGCAGCC 
CATTGCCAATAGCACAAATAATGCGGCCAACAAGTGGAAGGACAGCCTGAGCGACCAGCAGAGCACCGATTCGAGTGCCGAATGCACCATCTGCT 
ACGAGAATCCCATCGACTCGGTGCTGTACATGTGCGGACACATGTGCATGTGCTACGACTGCGCCATCGAGCAGTGGCGCGGAGTGGGTGGTGGC 
CAGTGCCCACTGTGCCGGGCGGTGATCCGGGACGTCATCCGCACCTACACCACGTAGAAAGGTCGCGGGCGTTTCCTAGCCAAGCAACATTTACT 
CGCAATCAAACCTACTAAAGCTACACATTATACCATACACACAAGTTGTTGCATTTTAAACGCAATTGAGCGCAAAGAATATTATATTGGAGAGA 
TAGGCATTATTTTTGCAGACTAAAGATTAGATAGTTTACGCAGCTAGATTAGTGTATGAAACTATCAACTAATTTATCACACGTATGTTTAATTT 
TTTTGTGATGTTATAAGTAAGTGCGCCCCAGAGCCAAAACTATGTATTGTTCAATCCAACAAAGATATCTTCGTTTTTTCAATTTTTTTCAAACC 
TTGAATCGCCCATTTGTGCGTTGTATACTGAAAAACCAAAGTATTGTATAGGAGTTTATAGTATTTTGAATTTAAATTGGAATTAATTGAAAGTT 
GTTGCAAATTTACTTAGTTTTTCTCTATGTCTAAATTCCAAACGATGTGCAAAATTAACAAACACAAACTACAAGGATTCTTTATTTGGCGATTA 
ATCATTTTGGTGCTAAATGTTTTGAAAGGCAAAATAAAATTGTTAACGGATTTGAAAAGTTATGTTTAAGCTTGCAGCAACTACGAATGACCTTA 
ATTTGATATGAAATCACAAGAAGCAAATATTGACTACTTTCCAGCAAGGAAATGGATATCGACGCAACTCAAATAAGTTTGCAGCACAAATTAAT 
GAACAAAGATTGATCAGAAAACACTTATGCACGTAAACCAAAATAGAAAACTGCAAGAGCGTGTTTGTTTAAAGTTAATATTAGTCTAGCCCTAA 
GTACTTAAGTTAATTTCTAGTCAACGAATTATGCTGAAGTTAAATCTAAACCGGACTTCATCCAACAAAGACACCACAAAAAAGACGTATTAAGC 
ACGCTAAGCTATTTTAATGAAGCCGCATTTTAAATTACACTTAAGTAGTGCACGTATTTTAATGGAGACTTAAATATATTTAACTATAAACATAG 
AAC AAAAG ACT T GT AT GT T GT AC C AAC AGCT T GAAT G AAATAT GG AAAAT GG AAT GC C C AT CT AAAT AAACC ATGC AG T T AG AAAA T TGC C AATA 
ATTAATTAACAATACCAAGCAGATTTAACATGAAGGACAAGCAAAGCGGATAAAACTTTACCTAAGATTGTATTAAAACTGAAAAAATGTAATAA 
GCAAAGAACACAAAGAAATATGCGAAATGGAATAAAGCGCAAGCTTACTATTTTGTTATAAGAATGCATCCGGATAGGAGTAAATGAAGACATGT 
TATTGCCACTAATAATAAACTTTTAATTTTTTAAAGAATCATAGTTCGTAAAATTACGATACTGCGCCTAGTTATGACTAAACATTTAAATGTAT 
TTATGAAATTATATTTGATATATGTAAGAAAACGCAATGCTGTAATACCATTAAATACTGACAACAATATCAGAAACAA 
(SEQ ID NO: 1274) 

Start ATG: 263 (Reverse strand: CAT) 

MGLSDIPANYMQGSHPHLTLHPQQQHHQNQQHLQHLQQMQQLHKAMPTPAQQAAQVLAMESNELLMSTKDKLSSKKKMHLLKKIKKRFGLVRRSP 
SSCPGPNNLPPLQFHSVHGDNIRISRDGTLARRFESFCRAITFSARPVRIWERICVKFAEISNNWWGGIRFGFTSNDPVTLEGTLPKYACPDLTK 
RPGFWAKALHEQYCEKDNILYYYVNGAGDVIYGINNEEKGVI^ 

QPAANASSALNSHHPHQQSRRSLPGHTAAIEHDLERHVMPSLQSLHLAGNGGSVASVEQAAIAHDLANGLPPLRYNANGRLIPVPFHNTKGRNVR 
LSQDRFVASRTESDFCQGYVFTARPIRIGEKLIVQVLKTEQMYVGALALGLTSCNPAMLQPNDLPNDSDFLLDRPEYWVVSKDIAAAPQRGDEIA 
FFVAPNGEVSISKNNGPAVVVMHVDQSLQLWAFLDVYGSTQSLPJXIFRQ 

AKLLHHPSQLSVAQSTSTLASAGGVNGSRMISMPSNGDILQIQPNGGGTVLVVNLPPASSSHDIMGQLAARPTATVTSSGVLAGACSSGTLISTT 
SSQYIEQPIAMSTNNAANKWKDSLSDQQSTDSSAECTICYENPIDSVLYMCGHMCMCYDCAIEQWRGVGGGQCPLCRAVIRDVIRTYTT* 
(SEQ ID NO: 1275) 

Name: neuralized 
Classification: DNAJoinding 
Gene Symbol : neur 
FlyBase ID: FBgn0002932 



Celera Sequence No. : 142000013384626 

CAAGAATCAGGCCGTACACAGAAAAAAAATAAGCGTTCACCTAAATAATATTTCTGTTTTTAGGAAATTTTAAAACCGCGTTATCACTTGCAACT 
TCTTTATATTAATCTATAATTACCCAATAGCCATTATTATTCTCAAAATATTAAAAACTACGAAGGTACTACACTTCTTTGCGCTTGCTAAGTTA 
TGTTCTTTTCAGTGTATCTGGCATATTCTTTGCTTGGCCGACGTGTGTCTGGAATGACTAATTGAATAGTGCATGGGCTCGTATATCTCGCAGAT 
ACGTATCCGCCAGATACGTTAGGGAAGGCGTGTGCCCGGCCTAATGGCTTAGCTTAGCAACATGTTGCAATTAGCGATCTTCGTTTGGCAACATT 
CGCTGCCAAAAGCTGGCAACATGCACAGAGAAACATAAACACACCAAGATTTTTTCACAAAGAGAGTGCTTCATAATTTTCTCTTTATCGAACTG 
CGCTCGCCGGTTGCGAAGACGATCACAATTGGAATCGCGCGGCTCGCAACATGTTGCCTTCAAACCGGTCTGGCCATGAACTTGGTGAAAGAAGC 
TACCATGGCGAAGCTCCAGTTGGGAGAAAAGTCTGCGCCGCGTTTTCCGCTTGGAAATTTTTAAAAGACGCTGCCATCCAGCGGCCTGGAACATT 
TGTAATCCGACGTTAGAAGTCTGCGGATCGTGGCCAAAAATCGGAGTGTCCCAGCCAGCTGTCATTCTGACTCCTCCTGACTCTGCTCTGAACTT 
CGCGTGTGTGTCTGTCTGAGTGTCCGAAGATTTGGAGTTACTGTTAGTTGACACTAATACGAGAGCGAGTGAAATACGAGAAATACAAGAAAAAA 
CCGAACGTGAATCGCGCGAGTGCAAAAAGTAAAGAAAATAGAAAGAAGAAGAAAACCCCAAACAAACAAACAAACAA 

ATCGATATAAGAAAACCTGAAGAAAACGAAACTCGGCAATCGTGGTCAAAATGGTGCGGTTAAGTGCAGCCTGGAGCTGCTGTTTTTTGGCCCTG 
ATTGCGTTGGTATCCACTGTCGCCTCCTCCGCTGCGGATTATCCGGGTGTCATCGAACTGGTGGGTTATAGCAATCGCCGCGCACGGACTTACTT 
CCAAGAGCAACCCCGCTCCAATCCCAATCCCAAGCCGGGTTTGACTTCGGCCAAAACCCGGGCGGATATTGAGACCGTGAAGCGGAATATCCGTA 
AATACGATCGTTTGCTAGAGTTGGATACGAAAAATCTGCCAGAGCAGCAAAAGGATATGCTCGCCAGGATTGTGGAGCGAGTGGCCAGGCTCAAG 
GAATCCCTGGACATTGAGGAGCAGCAATTTAACCAGCAGGAAGCTACCTCCTCCAGGAGCAGTAGTAGTACCACCACTAGTGCATCCAGTACTCA 
AGAATCGCAAACAGAAGCGGAAACGGAGCTGGATCAGGAGCCGATCGAGCAGAGCACACAGCTGCCCACTGGAGCCACCGAAGCTGAAACGGATC 
CCACCCTAATCCTAGACGAAACTACGCTGCTGGAGCTGCAGAACACGATGAACGAGCAGGCTGCCATGAGCACCATAATGAAGGAAATGCCCACG 
ACCACGGATATGAGTCAGCAGGAAGCGGAATCGGAGGTCCCCGACGAGGAGGGAGAGACCACCTCACCAAATCTGACCACCCAATCGGAGGCAGC 
GACGACCAGTGATGAATCCCTCACCGATTCCACTGATCCCGATGATCAATTCGTAGCTGCCAGATCGAATAACAATATAGCGGCCATAACGGCAG 
CGGATGATGCCGATGGCACCACCACAACCACCGCCAATACGGGCAGTCAGCAGCGCACGTTGACCACCATGGGAAAACTTAAGAATCGGGCCACC 
AAGCGACCCTTCGCCCACAAGGTCACCGCCGCCCTGAAACGAGGAGGTCAGCGTCCGAGTAGCGTGAGTGCGGCAGCTGCGAGTGCGGCAGCGGC 
CTCAAAACCCAGTAGCAGCACCACCACCGCCGCCGCCGCCAGCAGCACCACCCAGAAACAGAAATTGGCCACCAACAGTTTCCAGCAGAAGCAGC 
AGCAAGCGAAGCCCCTGCAAAAGGTCACACCTTCGATTGGCGGTGGTGGATCATCCAACCCGTCCTCGGCATCCATTCCCTCGGCTACCACGAAT 
TCCGCAGCTCCCTCCTTGCCTATCGAGCAATTGTATACCCGTACTCCCCAGGCCAATATCCCATTGGCAGCAGCAGCAGCAGCAGCAACCTCATC 
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CGCCTCAGCAGCGGCAGCAGCAGGATCAGCAGGATCGGGAGCCACACCAGCCACCTACAGCAGCCAGCATGACAGCATGGTATACGATGGACATG 
TCAATGCGTCGGCCTTGATCGACAGCCTGAACACAAACGCTCGGGAGGAATACTACCAGCAGAAACTAGGAGCAGGCAACAAGGATAACAAGGTA 
AGGCATTTACCACTTACCTATACTTTTGGAAACTTATCTTTTTACTATTCGCTAAAAGCTAAAAGACCATTTAAACGCTAGTGAGATATATTATT 
TAAAATTTCTAATAAAGAGAGATCTTTCAAATAAGACCATAATTAAGGTAAAATTTTTGTGCACTATAAATATTTATAATATTAAACCCTTATTG 
AAAGGGTTAATTTGTCATCGTTTGCAAAACTAGGCAGCCATATTGATAGTAACAATACATTCTTTTTACTATAAACTGAAAATCATCCAATAAAT 
AGCTCAATAACAGGGCCCAAGGTAGCTGACATATTGACTCCTTTCCAAAATACTGTCCCATCTTTGGCTTAATTAGCAATGACCACCAAAAAGTA 
ACAAAGCTCGAAGGTTAAATGTAGAAAGGGGAAAGGCGGCGAACGCCTGTCAAGCTAGATACATATTTCCCAAGACAATGGGCTAAGTCGCCGGG 
ACAGAGACATCTCTGGACAAAAAAAAAAAGACGGAAACCGTTTCGGGCTATGTAAACAAAAACTGGAAGAACCGAAGTAAACCCCGTTGAATGTC 
AAATGCAGAATTATTGGAACAACTTGACTCTCAAACTATAACAACAAAGCAACAAGCGATGACAACGACCATGACAACACCAACAACACGAAGCT 
ATCGAAGGCTTAGTAGGAGCCACAAAAAGTTCAAAGCCAAACAGAGAGAAAAATTGTGCTTACTAGAAAATATATGAAAGTGTTATAAATTAGTT 
ACAAGCTAATTTTTATCTTCTTCGTCATTTTATGCTTTTGTATTCAAATTATTCCAATTGGAAATATCGATCGGCAATTGAATTTTATTTGAAGC 
ACCTCTCTCTGCGCTTCTCAAAGTATGTAGTGTGTTGCTCGGCTTATCTTCTTTAGCTTTTGATCTTATATAACACAACTAAAACATATGTTTCT 
AATCTATGGATCTGAAAGCCACTAGATTTAGATATTTAGTGTCGTTCTTTGGCTGGTTACAAGATAACATTTTTTGTAACTGTAAGGGGGTACTG 
TCTTGGATAGTTGGCCAAAGCTGGCCACTTGTTATA 
(SEQ ID NO: 1276) 

Exon: 1001.. 2551 
Start ATG: 1001 

Transcript No. : CT32253 

ATGGTGCGGTTAAGTGCAGCCTGGAGCTGCTGTTTTTTGGCCCTGATTGCGTTGGTATCCACTGTCGCCTCCTCCGCTGCGGATTATCCGGGTGT 

CATCGAACTGGTGGGTTATAGCAATCGCCGCGCACGGACTTACTTCCAAGAGCAACCCCGCTCCAATCCCAATCCCAAGCCGGGTTTGACTTCGG 

CCAAAACCCGGGCGGATATTGAGACCGTGAAGCGGAATATCCGTAAATACGATCGTTTGCTAGAGTTGGATACGAAAAATCTGCCAGAGCAGCAA 

AAGGATATGCTCGCCAGGATTGTGGAGCGAGTGGCCAGGCTCAAGGAATCCCTGGACATTGAGGAGCAGCAATTTAACCAGCAGGAAGCTACCTC 

CTCCAGGAGCAGTAGTAGTACCACCACTAGTGCATCCAGTACTCAAGAATCGCAAACAGAAGCGGAAACGGAGCTGGATCAGGAGCCGATCGAGC 

AGAGCACACAGCTGCCCACTGGAGCCACCGAAGCTGAAACGGATCCCACCCTAATCCTAGACGAAACTACGCTGCTGGAGCTGCAGAACACGATG 

AACGAGCAGGCTGCCATGAGCACCATAATGAAGGAAATGCCCACGACCACGGATATGAGTCAGCAGGAAGCGGAATCGGAGGTCCCCGACGAGGA 

GGGAGAGACCACCTCACCAAATCTGACCACCCAATCGGAGGCAGCGACGACCAGTGATGAATCCCTCACCGATTCCACTGATCCCGATGATCAAT 

T C GT AG C T GC C AGATC G AAT AAC AAT AT AGC GGC C AT AACGGC A GCGGAT GAT GCC GAT GGC AC CACC AC AACC AC CGCCAAT AC GGGCAGTCAG 

CAGCGCACGTTGACCACCATGGGAAAACTTAAGAATCGGGCCACCAAGCGACCCTTCGCCCACAAGGTCACCGCCGCCCTGAAACGAGGAGGTCA 

GCGTCCGAGTAGCGTGAGTGCGGCAGCTGCGAGTGCGGCAGCGGCCTCAAAACCCAGTAGCAGCACCACCACCGCCGCCGCCGCCAGCAGCACCA 

C C C AG AAAC AG AAATT GG C CACC AAC AGT T T C CAGC AG AAGC AGC AGC AAGC GAAGCC C CTGC AAAAGGTC AC ACC T TC GAT TGGC GGT GGT GG A 

TCATCCAACCCGTCCTCGGCATCCATTCCCTCGGCTACCACGAATTCCGCAGCTCCCTCCTTGCCTATCGAGCAATTGTATACCCGTACTCCCCA 

GGCCAATATCCCATTGGCAGCAGCAGCAGCAGCAGCAACCTCATCCGCCTCAGCAGCGGCAGCAGCAGGATCAGCAGGATCGGGAGCCACACCAG 

CCACCTACAGCAGCCAGCATGACAGCATGGTATACGATGGACATGTCAATGCGTCGGCCTTGATCGACAGCCTGAACACAAACGCTCGGGAGGAA 

TACTACCAGCAGAAACTAGGAGCAGGCAACAAGGATAACAAGGTAAGGCATTTACCACTTACCTATACTTTTGGAAACTTATCTTTTTACTATTC 

GCTAAAAGCTAAAAGACCATTTAAACGCTAG 

(SEQ ID NO: 1277) 

Start ATG: 1 

MVRLSAAWSCCFLALIALVSTVASSAADYPGVIELVGYSNRRARTYFQEQPRSNPNPKPGLTSAKTRADIETVKRNIRKYDRLLELDTKNLPEQQ 

KDMLARIVERVARLKESLDIEEQQFNQQEATSSRSSSSTTTSASSTQESQTEAETELDQEPIEQSTQLPTGATEAETDPTLILDETTLLELQNTM 

NEQAAMSTIMKEMPTTTDMSQQEAESEVPDEEGETTSPNLTTQSEAATTSDESLTDSTDPDDQFVAARSNNNIAAITAADDADGTTTTTANTGSQ 

QRTLTTMGKLKNRATKRPFAHKVTAALKRGGQRPSSVSAAAASAAAASKPSSSTTTAAAASSTTQKQKLATNSFQQKQQQAKPLQKVTPSIGGGG 

SSNPSSASIPSATTNSAAPSLPIEQLYTRTPQANIPLAAAAAAATSSASAAAAAGSAGSGATPATYSSQHDSMVYDGHVNASALIDSLNTNAREE 

Y YQQKLGAGNKDNKVRHLPLT YT FGNLS FYYSLKAKRP FKR* 

(SEQ ID NO: 1278) 



Celera Sequence No. : 142000013383859 

TGCCTATGGACTTTAATTTGATTTTCCCAATAGAATTATTAGATTTTCTTGACTTTTGCTGCGGGGACGGCACTCGTTGCTCTGCGGATTCTGTA 
AACGAGCAGATTGGAGATTTGTCTATCAATTTGGGTCACTAACTTGGCCTTTCGCGTTTGCTGTTTGGATCTTCCACAAAGTGCTCTTTAAAAGT 
TGGCTGACTTCATTAACTTAAACTTCGAATAAAATAGAAAAAATAAAGTATATAAAAAAAATTTAAGTATAACAGAATTTATATAAAGATCGGTG 
GTAGCTGTTAAGTTTATTGCAAATATATGTGAGTGTAAAAATTGATACAAAAGATATAAGCACTCACCGCGGTCTTAAGCTACTAAGTCATCAAT 
ATACCTAAAGCATATGTTGATAACGATTTGAACTAATGGTAAAGTATACCTTAGAAATAAATTTACGACTATGGTTGCGTATCAGCCAACTTCAC 
ATATTAAATTCACTTTGTGAAGGCTCCCAGCAATCGCTCCAGGCTGATTGAACATTTCACTTAACATATATTGCTTGCTTTTTTGTGATTGCAAT 
GTTTCGCAGGTCTAACTATTACTTTTTGAAATTAATTACGTATTTCAATTCTTTAGGTAGTTAAGTACAGTTTATGTTCGATGTTCGACTTAGCT 
GTTGTTTGTACAATAGTTCGTGAGTTTAACAATTTATACAGAATACTTATTGGCTGAGTCGTTAGCTGATTGTTTTTTTTTTTTACAAAATTTGA 
ATTTTTGGTATATGGTAGGTATGTGCATTGCGGATATTTTCAATTCAATTGCTCCTTTGCGTTAATTTGCTTTCTGAGTGATATGCGAACATTTT 
TCGATACATTGTCGGCTGTTTAGCAAATAAAGAGAGACGCGTCGCTCGGTGAACTGTTGCATGTGTTTCTTGTGCGCTTTTCTCAGCTACAGTAC 
ATGTTTTGGGTTATATATGTACATTTTGAGTTTTGTGTGAACGAGGGCCCCTAAATCATGTGCGACATCATGCAACGACTCACATCACGGTTGGC 
GCGATGCAAATGACCTTGATAATGCTGATGAATGCGCTGATATTGATGTGGCTGCTGTGGATGTTGCTGTTGCTCTTGATGTTGCTGCTGCTGCG 
CTGGAAAACTCGTCTCCGATTCCATCATCATCATGCCGCCGGTGGCCTGCATCTCGTAATCCATCAAAGAGGCCTGCGTTCCAATTATTTGTGTG 
GCAGGCAAGGAAAAGAAATCTGCAAGGCAATAAACCGATCATTAAGTTACATTAACTTATATAAAATAATTTACAATTTATTATAATTGTATGTA 
CAAACTGAAGTCTATTCCGATTTTTCTCAGTGCAACGGCAATGCAACGCAACACGTTGACACTATCGATGCTTGGCCCAATTACCGATCGTTTAT 
TTATAACCAATTCGGAGCAGAGCAGTGTGTTGTCATTTCTTATCTACGAGTTAGTTTCTTTTTTTCTGCTTTTTGTTTGTGCTGATAACCTTATC 
AACGCAATCGGAAACTATAAGGTAACGAAGGCAAAACGGAGTGCACAGATTAACAACAACTAATTAAGTGTAGTCATAGTTGCAAAAATACATAT 
GTATATTTATTCCAATAGAAAACATTTAAAATTGTATTTCTATTATTTTGTACCCCTTTTAAAAATATTAATATAGATCAAAAAAATTGAGATAT 
ATATTTAAGTAAATCGTGGACAAAACTAAATGAAATATTCGTACATATCATTGAAATCACTTAAAAACATTTAAGAAGCGTTTATACCCTCAGAA 
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AGAGTATAGATTATTGATTCACTCCCGCAGATATACACATTTCTTTTACCGGCTGTATATATTTACGCCCACACTGCCTTCCCCCTTTGTTTTCT 
TCTATTGTTGGAACACTTTTGCGAATGCAGATTGATTAAACTGGCGGTGTAAGAAGAATCGTAACACTCTGTTGACCCAGTGTACATAGTCATCT 
CCGCAGTACCTCCAGTTAATCCCCACTACTTGTGTGTTTTGTTTAGTAGGCCAAACACACCGATATACATATATGCATATGTATGTCTATAAAGA 
CTGAGTCAAGCCGGTTTTTCTTTGCTAAATTTAAGCGGCTGTCAAGTGCTATCGTTTCGAGTCCGGTACCCTCTGTATGCCACAGATAGTCCGGA 

TCCCACCGAATAACTGAAC 
(SEQ ID NO: 1279) 

Exon: 1204.. 1001 

Start ATG: 1204 (Reverse strand: CAT) 
Transcript No. : CT32502 

ATGGATTACGAGATGCAGGCCACCGGCGGCATGATGATGATGGAATCGGAGACGAGTTTTCCAGCGCAGCAGCAGCAACATCAAGAGCAACAGCA 
ACATCCACAGCAGCCACATCAATATCAGCGCATTCATCAGCATTATCAAGGTCATTTGCATCGCGCCAACCGTGATGTGAGTCGTTGCATGATGT 

CGCACATGATTTAG 
(SEQ ID NO: 1280) 

Start ATG: 1 {Reverse strand: CAT) 

MDYEMQATGGMMMMESETSFPAQQQQHQEQQQHPQQPHQYQRIHQHYQGHLHRANRDVSRCMMSHMI^ 
(SEQ ID NO: 1281) 



Celera Sequence No. : 142000013384334 

AAGTAAAGCCATCTACTTTTAGAACTTTTTAAAGGTAAGCCATTAAAAATTATTTAACATATTTTCGTGTCTTAAAATATGGGTTGAGTATGCAT 
GCCTAAACTTTCTATATGACTGTGAATAAATAGTGACATTTTTGCAAATCACAATCACAACTTAGAAGTGCGGATATGAGAATATTGTAAGTATA 
CATATGTATACCAAACCGAAAAAGCACGCTTTAGTCGAGTTCTTCGACTATCACATACCCATTACATGGCTAGCAATGGAAAAACAAAAACAAAT 
TTGTGAAGAAATTAATTCAAAATTGTGGACATGAAAGCTTTTTGCAGTTTGTTGCCGATACCAGTGGACGTGACCAAAATGTTCTCGTTTTGAGA 

AGT T C GAT T T T AAAAT CT AGCC AAG AT CT AAT CAT T CTC AGC AT ACT C AAAT C GAG AT C AAACC ATT CT C AG AT T G C AAAT GAT TTTGCTTT CAT 

TTGTCTTCCTCTCTTTTTTCAGACTCTTCGACAATCTTACACCCAATACTTCTTCCGAAAAATAAAAAATATAAACGTTGAGATCATTTTCGTCT 

CGAAACCAAGAAGGTTTATTTATTCACTGAGTACGCTGAGCGCAACCGACCTCTATTTTTTTGTTCTCAATTTTTCCGAGTGCAGGAAATGCTTA 

CCTCCCCGCCCGTTTCGTACTTTTTATCTAGCCGATAATACTCTTTTAGTCTACAAGAAAAGGGTATAAATATATAAATACAAATCTCGCACTAT 

TTGTCTGTATTAAATTATAAATTATAAAAATGTTTGGATATATTTCATACATATCGTTCGAGGGATAGCTTTTTATTATTGACTCATTTTATCTA 

ATGAGTCATATCCCTTTAGGTTTGCAATATATTTTTTTTAGCATGCATAGTGATAGGCTTTTAAAACTTTGTTGCTTAAATTGCGTTATTTATTA 

AAGTAATAATTTGTAAATTATTGTGGAAATAAAGCCCGCTTGTTGTTTTTTTAACTTAATATCTTTAGATTAGACTCACCGGCCATAAGTTCGCT 

GTTTTGCTGATCCTGCAGCATTGGGATCACCGGCTCGCCGCCAGGTGGCGCATCTGGCATATCATCGGCCGTATGATCGTGACCGTGGGCGGTGG 

GGGCGTGGCCCTGGGTGGGCGGTGGCTCGGCAAAGAGCATCTCGAAGCGCAGCCGGGTCTTGATTGGACAGACCCTCAGCTGCTGTTGCTGTTGC 

ACTTGTTCGAGCGGGCCACCGATCTCCAGGTGACGATATAGAGCATAAATTTCGGTTTTATGCTCGAAATGGCTCCAAGTGCCGTTATGGGCGGC 

GTATTTACCGTTGGCGTGATGGCAGCCTGCTTCTCTGACGTCACAGCAGCCGGCTGATTCAACAGCGACAATTTCGCTTTCGACGGCACCAAACT 

GGCGTAAATAATCTAAAAATAAGACGAGGTTTTTTTTCCTCTGTACACGGGGAGGCTGATGCCGCTTCTTCTTCTTGTTATTTGCCCTTTGCTTC 

TGCTGAAGCTGCTGCTTCTTCTTGCTATGTTCCCTTTCCTCTTGCTTTTTCTTCTTATCCTCCTCGCCTGCTAGATTCTCCTTCTTCTTACTACT 

CTTATCCTTTACCCCTTGCTTGTGACAGAGCTCCTCCTCTTTCCGTTCCGCGTGCAGTTGTTGTTGTTGTTGCTGCTGCTGCTTGTTTGACCTTT 

TCTTTTGGCCAAACTTTCTGCCCCCGTCGACGTCACAGCTGATCGATTTGCTGCACTCGCTCGGCGATTTATCCGCCTCCTGGCTTTGCCTGCTT 

TTTCGATTTTCCCAATTCACATTCGTCGTCGGCCGTGGGAATTCTGCTATCATAATTTGTATGCCACTTATGCCATAATTATCACACTCACTCTC 

ACTCACACGCACAAAAACGCGAAAAGAGAGGCGGGCGCGCGCTTGTGTTGTTGTTGATATATTCTTTTTTCACCGCTTAACCAAGCTGAAAAGTT 

GAAAATCTTTACACACACCAAAAACACACACACACACAGACTCAGAAAACTTGGGAAATCGAATTTTCCGTACTATGATTTTCCGGCCATAAATC 

TGCACGGGCGTAAGTTTCGGAGGAAACTGCTGCAGAAAAGCTCCTGGTCGCAAAGGAAGAAAAATCGGACGGGCGAAGCGTTGCCAGACGATGAC 

AGCGATTTCATTTGTTTTGATTTAGTTGCCGGAGCGATGTCACCTCAATAAGATACCGAAAAATACCAAAGGTATTTCGAGCTGCCAATCTTTTT 

TCCGACATCAAACTTTTCTAGCCATTTGACTTTTTTTTCAAAAGTATTTATAAATTTTTTTATATTTAAATGCACGAAATAAATACATATATCAA 

TGTCTATATAATATTTGGAAATATAAGTTTCCTTCGTTGTCATTGGGTCTTTCGACTTCTCTTTATTGACGCATCGCCTTAAACAATTCCAACTA 

AAAATTTCCAATCAACACAGACAGCTACTTCGCTCCGCTTAGGCAAACAGACGCAGTTCATCGCGCGCCTCGATGCACTTGGGCGGCGAGGGAGT 

CTCGAACTCGCCCAGCTCGGCGTCGTAGTAGAGGGCGGCGCGGTAGGGACGCAGCTCCCATGGCACCTGCTGCTCCATTGTGGTCAGCGTGACGC 

CCAAATCCTCAAGCGTGGGGACACCAGTCAGAACCTTGTCGGTGACAGCCTCGCGTTCGATGCGAGCGGGATGCAGACCGCCAATCGGGGTGCCG 

GGGCAGATGAAGCTGTTCAGCTTGGCTTTCAGCAGGAAGGTGGGGTCCCAGCGCATGTCGTAGCGCATGTAGCCCCAGCGCTTCTGGTCCTTGCG 

CATCAGGCGGTGGAACCAGTCGACCAGCTCGCTCAACTGATAGCGTTTAGGTCTGCAGAGAATAAAGGAAATTAAGAAAATGTACATGGTACAAA 

TACTTGGACTCGGAGATTGACTTACCCCACAGCCTGGTAGATGCGGCCAGCGCTGTCCGGATCCTTGGCGGCGTTGATGATGGCCTGGGCCACGT 

CCGAAACGTAGACGGGCTGCTTGACGGTCTTCTCTCCCTTGTGCCACAGCGGCATCGAGCGGAACTGACGGCGCCAGATGTGGGCGTAGTAGCGC 

AAGAAGCGATCCTCGGATCCGTAG 
(SEQ ID NO: 1282) 

Exon: 2159.. 1001 

Start ATG: 1858 (Reverse strand: CAT) 
Transcript No. : CT32503 

TCCGATTTTTCTTCCTTTGCGACCAGGAGCTTTTCTGCAGCAGTTTCCTCCGAAACTTACGCCCGTGCAGATTTATGGCCGGAAAATCATAGTAC 
GGAAAATTCGATTTCCCAAGTTTTCTGAGTCTGTGTGTGTGTGTGTTTTTGGTGTGTGTAAAGATTTTCAACTTTTCAGCTTGGTTAAGCGGTGA 
AAAAAGAATATATCAACAACAACACAAGCGCGCGCCCGCCTCTCTTTTCGCGTTTTTGTGCGTGTGAGTGAGAGTGAGTGTGATAATTATGGCAT 
AAGTGGCATACAAATTATGATAGCAGAATTCCCACGGCCGACGACGAATGTGAATTGGGAAAATCGAAAAAGCAGGCAAAGCCAGGAGGCGGATA 
AATCGCCGAGCGAGTGCAGCAAATCGATCAGCTGTGACGTCGACGGGGGCAGAAAGTTTGGCCAAAAGAAAAGGTCAAACAAGCAGCAGCAGCAA 
CAACAACAACAACTGCACGCGGAACGGAAAGAGGAGGAGCTCTGTCACAAGCAAGGGGTAAAGGATAAGAGTAGTAAGAAGAAGGAGAATCTAGC 
AGGCGAGGAGGATAAGAAGAAAAAGCAAGAGGAAAGGGAACATAGCAAGAAGAAGCAGCAGCTTCAGCAGAAGCAAAGGGCAAATAACAAGAAGA 
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AGAAGCGGCATCAGCCTCCCCGTGTACAGAGGAAAAAAAACCTCGTCTTATTTTTAGATTATTTACGCCAGTTTGGTGCCGTCGAAAGCGAAATT 
GTCGCTGTTGAATCAGCCGGCTGCTGTGACGTCAGAGAAGCAGGCTGCCATCACGCCAACGGTAAATACGCCGCCCATAACGGCACTTGGAGCCA 
TTTCGAGCATAAAACCGAAATTTATGCTCTATATCGTCACCTGGAGATCGGTGGCCCGCTCGAACAAGTGCAACAGCAACAGCAGCTGAGGGTCT 
GTCCAATCAAGACCCGGCTGCGCTTCGAGATGCTCTTTGCCGAGCCACCGCCCACCCAGGGCCACGCCCCCACCGCCCACGGTCACGATCATACG 
GCCGATGATATGCCAGATGCGCCACCTGGCGGCGAGCCGGTGATCCCAATGCTGCAGGATCAGCAAAACAGCGAACTTATGGCCGGTGAGTCTAA 

TCTAAAGATATTAAGTTAA 
(SEQ ID NO: 1283) 

Start ATG: 302 {Reverse strand: CAT) 

MIAEFPRPTTNVNWENRKSRQSQEADKSPSECSKSISCDVDGGRKFGQKKRSNKQQQQQQQQLHAERKEEELCHKQGVKDKSSKKKENLAGEEDK 
KKKQEEREHSKKKQQLQQKQRANNKKKKRHQPPRVQRKKNLVLFLDYLRQFGAVESEIVAVESAGCCDVREAGCHHANGKYAAHNGTWSHFEHKT 
EIYALYRHLEIGGPLEQVQQQQQLRVCPIKTRLRFEMLFAEPPPTQGHAPTAHGHDHTADDMPDAPPGGEPVIPMLQDQQNSELMAGESNLKILS 

(SEQ ID NO: 1284) 



Celera Sequence No. : 142000013384885 

G C ACT T TAC AAAAC AAC AAC CAAGT T AAC GT GAT AT T AAAG AAAT C AC T GG AGGGAC CC C AGGAC AT C G AGT T G G AGT T AAGT AT G AC AGT G T AC 
ACAAACGGAATGCCTCGTGGAAAAAGTGTAGCAAAATTATTTCTGTTTGTTTCTCAACATACATTTTAAAAATTATAATATTTCATTTGCTGCGA 
ACATGTATACCTTTAGATTCCCTACAAACCTATACAGTAGACACAAATAAAAATTATGCAGAAAAAGTACACACATACCTGCTCTGCTGATAAAT 
TTTACATTTTTAAATCATTTTAAATAAATAAAATCATCATAAATAATAGCTTATGTTCAAAGTTGTTGAATTGTTGAGTAACAAGTTTGTGATTA 
ATTTATTTTTCAAGATACACTTAAGTAAAAGTATTTTAATGTCCAAAAAACTCACGGGATTTGTACTGCTTGTGAGGACTTGGCCGCAGAACACA 
CTTCACCTCGTATATTCCATGTGCATGGATTATATAAATGCATGTGATATTCACACATTCATGCGGTTAAAAAAACTTACTCTGCCTCCTCACCG 
ACACTATTGGTAATAAZW^ATAACGTAAGTTATTTTAACATTTAATAAAGATTTTAACACTAAAAACAAGAGAGAATTCTTTTTTGCTAGTTCTT 
CGGCTATATATACCCGATATTGAAGACTACTAAAGATTACAAGGCCAATTGAAAGTAGAACAAAAAATCTAAAAAAATTTTAAACATGTGGGCGT 
TACAGCTTTTGGCGGATTGAGTTTGCGTGGTAACGTGTCGAGCATTTTTATACCCGTTAAAGGGTATACTAGATTTGTTGAAAAGTATGTAACAG 
GCAAAAACGCTTCCGACCTCAAAAAGTATATATATTCTTGATCAGGATCAATAGCCGAGTCGATCTGACTATGTCTGTCTGTCCGTATGAACACC 
TCGAATTCTGGACTATTAAAGCTAAAATGTTGAGATGAAGCATACAGATTCTAGGAACATAGACAGCACAAGTTTGTTGACCCATGTTGCCATTC 
C TG AT C AGG AT C AAT AGC CGAG AC G ATC T GGT C AT GT CCGTCTGTCCGTAT G AACGT C AC G AT T ACAGG AACT AT AAAAGC T AG AAG GAT GAG AT 
TCAGCATACAGAATCTAGAAAAGACGCAGCGCAGCGTTGACCCATGTTGCCACGCCCACTCTAACGCCTACAAACCGCGCAAAACTGCCACGCCC 
GCACTTTTGAAAAAGTGTGTTGATATTCTTTAAATATTTTGATTAGTCTTGTAATTTTCTATCGATTTCCCAAAAAACTTTTTGCCACGGCCACT 
CTAACGCCCTTCAGCCGCCCAAAAATGCTTTGTCGCCCGTTATCAACGTGGTAGCTAGTGGTTTGCTTAGGACCTCTCGTGTTTAGAAGCATTCT 
CGGCAGAGAACCAGTAGTAGCGCTGGCCTTTTTGTGGGCGTACTCTAGGTGCTCTCGGAACGTGGTATCAGGAGAAGCACTAACTAACTATTGAT 
CTAAAGATACACATTATTTTAACGTTCCAGGAAGATAGGTCTTGCGTGATAATCCACACGAACGACATGAATGAAATCATATGATGCTGCAAAGT 
ATATGTCATAGTTTTAATACTGCGTTGGTGATGCGTCGGTGATTAAGGATTCGACGTCAAATTAGTTTGGGCAGCTTGTAGAGTTGGTGCCTTAG 
AGGTAGGTTCTGTTAGCCTCGAATTATGGACGTTTGGAAATATCAAATTTTGGAAGGTGGTCGAAGGGCCAAATAAGGACCTCTTTGCGGGTGAT 
AAAAAAACTTCTGGCGTTCTGGCGTTACTCGGAGATGTATCCGGCTTTTGGGGTCTTTGTATATTGAACAGCCTTTATATGGTCAGAAGCGCTTT 
CTTCCACCTGTTACATACTTTTCAACGAACCTATTGTAGTATTCCCTTTTACTCTACGAGTAACGGGTATAAAAAATATAAGGCCCAAAGGGTTT 
TCTGTAGCACCTACCGTTTAGTTACACAATTCATAAAAAAAACCAAATCGATTTTCAAATTCATAATTTTTTTGTATAAGAATCATACGAAAAAA 
CGGTAAAAATTTTATTATGTCCTTCGTACCCGTTTATTTTTATTTTACTTAATAAGA 
(SEQ ID NO: 1285) 

Exon: 1147.. 1001 

Start ATG: 1147 {Reverse strand: CAT) 
Transcript No. : CT32593 

ATGCTGAATCTCATCCTTCTAGCTTTTATAGTTCCTGTAATCGTGACGTTCATACGGACAGACGGACATGACCAGATCGTCTCGGCTATTGATCC 
TGATCAGGAATGGCAACATGGGTCAACAAACTTGTGCTGTCTATGTTCCTAG 
{SEQ ID NO: 1286) 

Start ATG: 1 {Reverse strand: CAT) 

MLNLILLAFIVPVIVTFIRTDGHDQIVSAIDPDQEWQHGSTNLCCLCS* 
(SEQ ID NO: 1287) 



Celera Sequence No. : 142000013384703 

CGGGTGTTTACAATAAATTTCGCTGGGTGGATGTAGCTTTGTAATTTGAGTTCGCC7AATTTTTTTTGTTGTTAACAGCATTTCGAAGCGGTTGCC 
AACCCTAGACGCTGGCTGGTTGCATGGTAACGAAAAATGGCGGCGCCTGTTACACACACTCACACACACATGCACGCACAGTCTGCTGTGTTTAT 
GCACTTCATGCTGTTCCGATTTTTTTTTTGCACTTTTTCGATTTGTTGTCTGCATTTACCTTA7AATGAATTTATTAACTAAGGCGATATTGTGCC 
TCTCGCTCAATCTCGCAAAACGATGCTTTGTCTGCTGCTTCTCTGTGCGTGTGTCCGTCTCTTTCCACGGCTTTTCGTCGAGACTCACATATACA 
AACGGCGACACACTGGATTGGAGCTGGACTCCGTTTTGGATGGCTTGGCTTTGGCTGCGTACCCAAAATAAATCGCGCTTGCAGTCCCAAATCAT 
TTTAGTTCTTGACGCCTTTGTTCCACAAAAACTTCCAACCGCCGTCCATGGCTCGTGCTTTCTCTTTCGCTCGCGAGACGCTCCAAAAGTCGCGA 
GTATGGGAAGAGAATGTATCCCTCGCTCTTTCACGCCTAGAAGCGGCTGTAGCACACACAAGCACACGTAAACCCAGCCGTATGCATATACATAT 
ATTCCCTCTACTAGCGAGACGTGTGTACGTGCGTATGTATGTATGCTTTTTTCACTTCTTTCTCCTCTGCGTTCTTTGCACCAAGTGTCGCTGTT 
GCTATTTGCTACCGCCGCTCCTCTTGACGAATAAGTCGGGACTTGTATCAGCTCTAATTCGGACTTAGTTTACAAAATGCGGCTGCTTTTTTTCC 
GCGATACTCGGATTAGTGATGTAAAAAGACGTTGATGCTTCGACCACTGAAGTCAGTGCGATATGTATCGATAATTGAATGCCATCGATTGCTAC 
GATTAAAACTGCCACCGCAGTTAGCGGAGACGCTCACCATTCTCTATTCGCCATTTGAAAGTATCGGTGTAATTTGTTTTCAGAGAAATTAATTT 
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CCGTTTACTGTGCAATTCGGTGTGAAAGTGTTCAGATTTATCAATGCGTATTCTGCTTTCGACTTCGCCACCAATCTGTGCTGCAAGTTACCATT 

ACCAGGTCCACCTGGTTCCCGCCAGTTTTCTTTCATTGTGGCTAGTTGTTGTTCGTGCCTTCGATAAAGACGTTTAGAGGTGTTTTTAGAGTTTC 

GCCATCTGGTCACTATAGCCGTTTCGTTTTTTACGTGAGTATTGTGAATTTGGTGTGTTGATTTATATCTCAGTTGGAGCCTGCGTGGAAATAGT 

GTCAGTACGTTTAAAGGCATCATCGTAAGGTTAGTCGGAAAACAAATAAAAAGCCTACAAATAGCAAGCAAGAACGCTTTCGCAACCACAAGAAA 

AACGAACATACGCACGCATCCATCTGTGGTAGAGTCGGCTTGTGTGGGGGTGAGTGGATGTGTGGGTGAGCATTTCTGGAAAGTGCCGTTCTCTA 

TGTTTCGATTCTACTTATCCCTTCATCACCATTTCATCGTTGAATCATCACGAAAGAAACATCGAAAGTGCACAATCGTTTGTTATCTTTGTACG 

AAAACAACGGTGATTTCCACACAGGCATAACCTGCAAGAGTAATACGATACAAATTTTGATCTGATATGACCTCATTTTAGTCGGTTTGAATTTT 

AAAGAGAGCGACTCTGCAAACTGCAGGGATATGAACATACAAAATTCACAAATGATTGTTTGCATCCTAGTTGCTTATATTCATTTACACGCATG 

CTTATATGCTTACATCTGTTATCAATAATTGACGACTTGGTGCATGCGAAGCAAACGCATTGAAATGACTATTGGGATCCACACACTGTTAAAAA 

AC AC ACT G AT T GC ACT G C C AC AC AT TT C TC AAC T AG AAC CT AAT GT T T TAT AC T G AAC T T AAC T G C AGG AAAGC C C AAAAT G C C C AG CAT C A AGT 

TGCAATCTTCGGATGAGGAGATCTTTGACACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAG 

GACGATGAGAATGCCATTGTGCCGTTGCCCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCA 

GCCAACGGAGGATGATGAGAGCAAGGAGAAGCGCACAGACGACATTATCTCATGGGATGCAGATTTCCTAAAAGTCGACCAGGGCACACTGTTTG 

AGCTGATATTGGCAGCGAACTATCTGGACATTAAGGGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAAGACTCCCGAG 

GAAATACGCAAGACCTTCAACATTAAGAAGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAG 

CGCGGCATTTCGCGGGACCAACATTAAGTTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACT 

TTCATTTACATTTTTTTTTATTTTTAACCCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATT 

TTTTGTCGTAAATATATCTCATATACATACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACTTTAATAAATA 

TAACTACCCAACACAAAGACACAGCCTATTTTGTGGGCTTCCCGGAGGGGTTAGGGTCCAAGGACTTCTCAAATTAAGAAGTGCTCTTTATTCGT 

TATCTTGTGTTTAGAAACGTTTGCAAGCGTTTGAAATGGATACGCATGCCATTTATGGTCCAGTCGGTAAAATTTTACGTGTTCCCAAGAAGGTA 

CATATATATCTTAGATTATCTTATCTATAGAAACTACATTCATAAGATCACGCTTTCGGCGAATAGCGAGTAATACTGTAGCTGCCACTTTGCCA 

CTTAGTGTGCAATTGAAATGCGCGCCAACGCATGTTCTCATATGCGGAGCGTTTAGCAGCCCTTAAAAAGCTTTTCATGTTGCTTTTAAAAAGCT 

ACGACGGGTGTGGAAATTAGATTTGTGACGCTTTATATATCTTTTTTTTATATCGTCTGGGGTACAGCATGGTTTTAGGATTTTAGAAGAAAGCA 

GTAATTTTGTAAGAATCGAAGAACGGGGGGATATTTATAATTTTCATATTAACCTTGTCTTTGTTTGCTTTTGAAATTATGAATACTTCGAAAGA 

TAATTATCTTGTGCGCATTCCATTCTGGCTTAGACTGGGTTGAGCTAAAAATGATTTGAAATACAAATTTTTAGGTGTCCGCTGATATATAAACA 

GTATGATGTTTTTGCACTTTTTTCCCTGAAGTTTTCCTTGTTGTCTCGCTTGCCAATGCTGAACAAAGTAGCCAAAGGACCTTTACTTGAAGCTT 

TGAGAGCGGAAAATATATTCGGGGACAATTGCTTTAAGTGTGCACTAGGCTTGGCTAGATGCCCGTGCCAAACTTTCTTTGGAATTGATCGCTGG 

CCTTAGCCAAAAGTGGAAAAAAGAAAGGATAGGATAGTGCCGCACGTATTAAGGGTCTTCTTACAAAAGGTAACTGATATTAATTCATTTGCATA 

CTAAGCAAGCTAACTCATCTGTTTATTGCAAACAAGCGCCAATTGAAGTTTTATCGAAGCTCAG 

{SEQ ID NO: 1288) 

Exon: 1001.. 1269 
Exon: 1980. .2769 
Start ATG: 1980 

Transcript Ho. : CT32694 

CCATTTGAAAGTATCGGTGTAATTTGTTTTCAGAGAAATTAATTTCCGTTTACTGTGCAATTCGGTGTGAAAGTGTTCAGATTTATCAATGCGTA 
TTCTGCTTTCGACTTCGCCACCAATCTGTGCTGCAAGTTACCATTACCAGGTCCACCTGGTTCCCGCCAGTTTTCTTTCATTGTGGCTAGTTGTT 
GTTCGTGCCTTCGATAAAGACGTTTAGAGGTGTTTTTAGAGTTTCGCCATCTGGTCACTATAGCCGTTTCGTTTTTTACATGCCCAGCATCAAGT 
TGCAATCTTCGGATGAGGAGATCTTTGACACGGATATCCAGATCGCCAAGTGCTCCGGCACTATCAAGACCATGCTGGAGGACTGCGGCATGGAG 
GACGATGAGAATGCCATTGTGCCGTTGCCCAATGTGAATTCGACGATTCTTCGCAAGGTGCTTACCTGGGCTCACTACCACAAGGACGACCCCCA 

GC C AAC GG A GG AT GAT G AG AGC AAG GAG AAGC G C AC AGAC G AC AT T AT C T C AT GG G ATG C AG AT T T C C T AAAAGT CG AC C AGGG C AC AC T GT T TG 
AGCTGATATTGGCAGCGAACTATCTGGACATTAAGGGCCTTCTGGAGCTCACCTGCAAGACTGTTGCAAACATGATTAAGGGAAAGACTCCCGAG 
GAAATACGCAAGACCTTCAACATTAAGAAGGACTTTTCGCCCGCCGAGGAGGAGCAGGTGCGCAAGGAGAACGAGTGGTGCGAGGAGAAGTAAAG 
CGCGGCATTTCGCGGGACCAACATTAAGTTGAAACAGCTAGGGGATTCGGGAACGAATTGGATTTGCAGCATTGCAACTTTACTTAGTTGCTACT 
TTCATTTACATTTTTTTTTATTTTTAACCCCAGCAGAGACTCGATTTAAATTGTGTATAAATGATCTGTTGCTGATTTGATTCGCGGGGTTCATT 
TTTTGTCGTAAATATATCTCATATACATACATATGCGAGATTGTAACACTCTCTTTAACCTATTGGAGTAACACTTGATTTCACTTTAATAAATA 

TAACTACCCAACAC 
(SEQ ID NO: 1289) 

Start ATG: 270 

MPSIKLQSSDEEIFDTDIQIAKCSGTIKTMLEDCGMEDDENAIVPLPNVNSTILRKVLTWAHYHKDDPQPTEDDESKEKRTDDIISWDADFLKVD 

QGTLFELILAANYLDIKGLLELTCKTVANMIKGKTPEEIRKTFNIKKDFSPAEEEQVRKENEWCEEK* 

(SEQ ID NO: 1290) 

Classification: cell_cycle_regulator 
Gene Symbol: skpA 
FlyBase ID: FBgn0025637 



Celera Sequence No. : 142000013384583 

CTTTTACAGCAAGTCGAACTGGGAGCTGAAGTCTACCAGCAATTACTTGCTATTTACCTCTATCAAAACAAACTGTAAGTGATTGATATAAAAGA 
AATACATATTTTAATCTTGTGCTACAGGGCTGATGCCAAGTTGTTGTGGATGAGAGTTCCAGCTAACCTAAGGGATGACAAGGAACTGATCCAGC 
TGAACCTGCTCAATATTGCCCTGCAGAACAATAACTATGCCGATTTCTTCAAACACATCAAGTATGAGTGGTCGGAAAGAGTAAAGTCACCTGTG 
GAGGATCTTCTAAGTATGGCTATGCCTAAAAGCTTACTAATTGGATATAAAAACTTATAATGTATCTACTTAGATAAGCAACGCGAAGAGCTGTT 
CAAACTAATGGGCAGCGCATATATGTCCATTTACCAGCACAATCTACTGGAATTGTCACTAATGTCGGAGGACGAACTGAAACACGCCTGTGCGG 
CCTTAAATTGGACTGAGGAACTGGACGGTGACCGGGTAATCCTGAAGCCCAAGGTTCAGGAAGCACCGCCAGCTCGTGGAAACGATGACCAGTTG 
CTTAAGTTGACCGAGTTTGTAACCTTCCTAGAGAATTAAGCCTCAATAAATTTCAATACATTTAATTGGATTTTTGTATACATTTGAAGAAAAGT 
TATGGTCACACAGCCTGTGTCAATCTAAAAAACACTCGATTAATATCGTTATTAGACATATTTATAAATATTAGCTAAATCACCGAATTGGCGCC 
ATTGGGTTTTGTACGAATTAGCTTTTGGAAGTATACCTAAAAATTAGGAACTAGAAACTAAGGCTACAGAATTATAATTTTCCATGTAATCTATG 
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TAGATATATAATCATATTCTTGGCCTAATTTAATAAGTGTGATTTTAATATTCTTCAGAATTCAGTTAAATAAAATATATTTCAGAAAATCTATC 
GACGCGCGGAAACGCCGATAGTATCGAATGATAACCGATATGCGTTCCACCTCTACCGTTGTGTCACTTGAAGAAAAAAAATATTAATTTGCACC 
AACGACGTACGTTTTTCGTTCGAAAAGAACAGCAATCAGCGGATAATTGTATGGATTCTTAGAAAATTTGCAGCGACGAAATATAGTACCTTGCG 
AAATTATTTTGTAACCTTGAAGAAGTAAACTGCGAATCGCCAGAGGGCCAAATAGAAATCTCGTCGTTTGCCTGCAGAATAGAAGACCCGATTTT 
CAGTGCGCTTGTGTTTGTGTGTGAGCAGCAGAAACAGCAGAAAAAATGGCAGCAAGCGACAAATCGGTCGACGATTCACTATATCCCATTGCGGT 
TCTAATCGATGAACTGAAAAACGAGGACGTTCAGGTTTGTCATGTCCTAAAACCTGAAATCCAATTGTATCTCGCGAAGAATGTGCAGAAATATT 
CTAGTGATTAAATTTTGGGGTAGCGTGCCCACACTCCCTGCTTTATCTGCGATTGTATTTGTGCAAAAATGCGTCACGGTGTGTGTGTGCGTGTG 
TGCATGGGCATTTGTGTTGGTGCGAGTGTGTGCATCGAAATTATACCGAATTTCGATATACACAAGATTAGCGTTTTGCGAAAAAAATTACTATT 
ATCAATTAATTGCAATTATGGGTATACGTCTTTATCCGCTTTTTATTTGTAGATTTTGTTATCCTTCATTTATGTATGTATAAGTATAAGTTTTT 
ACGCATTTGCTATTTGCAAAAACAATTTACAACTGCGCGTGTTAAAAATAGATCAACTTCAATCAATAGTAATACCATAACTGATTTTAAAATGA 

G AT AGT AAG AT G AAAT AT T T T C GT T AC AT C G AAT AAT C T AAT C AC T T AT C T T ATC T AAAAAAG AT AT CT T AAGT AAAT AT AT AT GCT ACGT T T T T 
TTTTGAAAATAAAATGTAGGGTACAGGTTGACCACCGCTTTTGTAAAAAAAAACCAATAGCCCTATCAATAATAACCTCAAAATCGTAGAATTAG 
TTACCACTTATCCTAAATTTGAATACCCTATTAAAATAATCATCTTTAATATTTTCCCTTACCAAAACAGCTTCGGTTGAACTCCATCAAGAAAC 
TGTCCACCATTGCACTCGCTTTGGGCGAGGAGCGCACACGGTCCGAGTTGATTCCCTTCCTCACCGAGACCATATACGATGAGGACGAGGTACTG 
CTGGCCCTGGCCGACCAACTGGGCAACTTTACTAGTGAGCAAAGATATAATTTCTAATGAAAACATTTCCAACCATGTTCCGCCGTTACTTTCCA 
GGTCTCGTTGGTGGGCCAGAGTTTGCCATGTACTTGATTCCGCCCCTCGAGAGTTTGGCCACCGTAGAGGAAACCGTGGTGCGAGACAAGGCTGT 
GGAATCTCTACGCACCGTGGCCGCTGAGCACAGCGCCCAGGATTTGGAGATCCATGTGGTGCCGACACTGCAGCGATTGGTTTCCGGTGACTGGT 
TCACCTCACGCACCTCTGCCTGCGGCCTCTTCTCGGTCTGCTATCCACGCGTCACACAGCCAGTGAAGGCCGAGCTGCGCGCCAACTTCCGAAAG 
CTCTGCCAGGATGAGACACCCATGGTGCGCCGTGCAGCGGCCAACAAGCTGGGCGAGTTTGCCAAGGTCGTTGAGACGGAGTATCTGAAGTCCGA 

T T T G AT T C C C AAC T TT GTC C AGC T GGCAC AGGAT GAT C AG GT AAGC ACC T GAT T AAG AC C AAT T T AAC AAC AC TC C AAAAT AT C AAT CCT T AT C C 
TTAATTTCCAGGACTCTGTCCGTCTGCTGGCTGTAGAGGCATGCGTAAGCATTGCCCAGCTGCTGCCTCAGGATGATGTAGAGCACGTAAGTTGA 
TCAGCGCTGAAATTGTTAATACCTGTTGGCTTACAAAAAATATTCCATAGCTGGTTCTGCCCACGCTGCGCCAGTGCGCCAGCGACTCTTCCTGG 
AGGGTGCGTTACATGGTGGCCGAGAAGTTTGTTGATCTGCAAAAGGCTGTGGGCCCAGAGATTACTAGGGTGGACTTGGTGCCTGCCTTCCAGTA 
CTTGCTCAAGGATGCCGAGGCCGAGGTTCGCGCTGCAGTGGCCACCAAGGTGAAGGACTTCTGCGCCAATCTGGACAAGGTCAACCAGGTGCAAA 
TCATCCTTAGTTCCATTTTGCCCTATGTCCGCGATCTTGTCTCGGACCCCAATCCTCATGTGAAGTCAGCTCTGGCCTCAGTGATCATGGGCTTG 
AGTCCCATGCTGGGCGCCTATCAGACTGTGGAGCAATTGCTCCCCCTGTTCCTTATTCAACTCAAGGATGAGTGCCCAGAAGTGCGCCTAAACAT 
CATCTCAAACCTGGATTGCGTTAACGACGTCATCGGTATCCAGCAACTGTCACAGTCGCTTCTGCCCGCCATCGTCGAGCTGGCCGAGGACTCCA 
AGTGGCGTGTGCGTCTAGCCATCATCGAGTACATGCCTGCTCTGGCCGGTCAGTTGGGTCAGGAATTCTTTGACCAAAAACTGCGCGGTCTCTGC 
ATGGGATGGCTCAACGATCACGTGTACGCCATTCGTGAGGCAGCCACCCTCAACATGAAGAAGCTCGTCGAGCAGTTCGGAGCTCCCTGGGCCGA 
ACAGGCCATAATTCCAATGATTCTGGTTATGTCGCGCAACAAGAACTATTTGCACAGTAAGTTATCGAATTGAAATTGTAATGATCAATATATGC 
TTATGGTTCTTATGATTTTTAGGAATGACTTGCTTGTTCTGCCTGAATGTTTTGGCAGAGGTCTGCGGCACAGATATCACCACCAAGTTGCTGCT 
GCCCACAGTTCTCCTGCTTGCCGCTGATCCCGTTGCCAATGTTCGTTTCAACGTGGCAAAGACCCTGCAGAAGATCTCGCCCTTCCTGGAGGCCA 
GCGTCATTGATGCCCAAGTAAAGCCCACACTCGACAAACTGAACACAGACACAGATGTGGATGTCAAGCATTTTGCTGCACAGGCCATTGCCGGC 
ATAGCTGCAGGTAAGCAATAAACAGTCTAATATAAACAAGGTTAGCGTTTGACAGGGATAGAAAGGGTCTAACTTATAGAACAATATATAACATT 
ATTCATTATCACAAGTTTTAAGGCATGCAGGGAATCTGAGAGTTACACAAATTCATAAATTGTCAGTTAGTAGTAAGACTTTAATTTTAATTAAA 

ATC C T G C C AT AG AAAT GG AAC T G AAT GT GT T TAG AAC G GC AAT AC C G C C G T GCATG GGC GC C AAAAT T G AG G C GGC AAT G G A CT C AAAGC T GAT A 
GTGGAACCGCCCCGACAGCCGGAGGCAGCTGACAATGACATTTTGCAGGAGATGAATGGCAAGGCAGCGGAGCCGCCCACGCTTGACTCGGAATG 
TTAGAGGATGAGTAGCCAACCACATGACGTGTGATGTGCCTCCCGCTGTCTCCAGCCAGTTCCACAAGCCAATGTGTTTAATCCTCCCCGTCATA 
ATCGTATTCACATTTGATTGTTTGTTTGTTCAAGATAACCAGGCTAGATGTCCGATCTGATTAAAGTGCGAGTGCAGTGGATCAAATATTTAACC 
GATACTAATGTTGTGCCATATTTTTTTGTCTCTTTCCCGGTATCATCGCCATCACCACGTGGTTATCTCACCATTCCCCACTCTGTGTTCGTCTG 
CAATAATGTGTTTTGTATATTTGTTTATAGCGTAATCAACCTTCTTGCATCTTTTTTTATTCTATTTTTAAAACTGTATGTATTTAACGCCGGCA 
ACAAAACATATTATGTAGCAACAACACTTAAGAACGGAAAATTGTAGTGGATCCCGGAAGGTGTTAAACGCTGCAGCCGATGCTGGATGAGGGAT 
GAATGGAGCCAGGAGTCAGAGGAAGGAGAGAATCGGAGAATCAGCATGGGAGGAGACCCACTAAACGCGACACACAACTAAACATTACTACTTAC 
ATACTTATATTATATATAAATTACCATAGGCATAACGATCATTATTAAAAACATTTGTAGAGTTTGGAACATTTGATCGCATCTGAATGCAAAAA 
TAAAATTGTAGTGAAACAATATCACACGATCTATTGAAAAGTTTAATGTGGAGCAAAATTGAATATTTAACTCCCAAAATTGTATACCAATTTCT 
AGGTTTCAGTTGTCAGTTCAAAGTTTGGAAATGTAATATTTTCCTTTTTATAGTTTCGTTCAAATTAAGCTTTATATACATTCTGATTATTTGGC 
AGGTGCATTGATTATTTATAATTTGTGAGAGGTATGGCTATGAAAAGCATAGATTATACAAACAAAATACTTGTATTTTTACATTTTAAAAATAC 
TGTTAATTTATTTTGAAATCCTAAAATCATTCCAGCAGTAAACGATTTTGATAGCAAGCTAGACTGATATCGATAGGCGCCAAACAGACACCTGG 
CGATAGTTGGGGCTTCTTTTTGGTTTTTTAATTGGATGATGGTGGTGGTGGTGGTGGTGGTGGCGAGCAGTTTCTAGCGAGAGATCAGTTGGACT 
TTTGCGACGAAGCGATACGCTTGTTTGTTTTGCCGTTCCGCTGCGCGTTGGCCAATGTTTGCCTAACTCTCGAATCGAATTCCGAAACCGAGTGT 
GGACGCGTAAGTGCCATTGAATTATTCCGTCTGACGCTCGAGCGCCATCGAATCGGATGCGGCGTGGAATGTGGTGGAGTCAATGGGGGGATCAG 
ATGCAAATTTGATTGGGCAAATGGGTGGAACCAATCAATTGTGTGTATACAATAAAACGAGGTGTTTGTGTGTGCCCAACCAAACACAGCCAAAT 
GCGGAACATTATCAGTTTACCCACGGCTGTCGCAATCTCCGCGTCCGTTTACCGTCCACTGACCTACGATCCACCTGCCCCCGTGGACGCAACAA 
GTGTCCGAATATTTATAGTCTATGGGCTCTATGCATTAATTACTCGCATATGTCAAAAACGCTGAGCTGTTATTACAATTAGTGCTTCATTATTT 
AGCCCGATTTCCCCAAGTGGCGCGTTTCGCTAAGCGTTATTATTTTTACCTTCCGCTAAACACTTGGCGC 
( SEQ ID NO: 1291) 

Exon: 1001.. 1364 

Exon: 2066.. 2219 

Exon: 2282. .2700 

Exon: 2767.. 2841 

Exon: 2901. .3666 

Exon: 3728.. 4000 

Exon: 4686.. 5055 

Start ATG: 1281 

Transcript No. : CT32725 

CTCTACCGTTGTGTCACTTGAAGAAAAAAAATATTAATTTGCACCAACGACGTACGTTTTTCGTTCGAAAAGAACAGCAATCAGCGGATAATTGT 
ATGGATTCTTAGAAAATTTGCAGCGACGAAATATAGTACCTTGCGAAATTATTTTGTAACCTTGAAGAAGTAAACTGCGAATCGCCAGAGGGCCA 
AATAGAAATCTCGTCGTTTGCCTGCAGAATAGAAGACCCGATTTTCAGTGCGCTTGTGTTTGTGTGTGAGCAGCAGAAACAGCAGAAAAAATGGC 
AGCAAGCGACAAATCGGTCGACGATTCACTATATCCCATTGCGGTTCTAATCGATGAACTGAAAAACGAGGACGTTCAGCTTCGGTTGAACTCCA 
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T C AAG AAAC T GTCCACCATTG C AC T CGC T T T G GGC G AGG AGC GC AC AC GGT C C G AGT T GAT TCCCTTCCT C AC C GAG AC CAT AT AC GAT GAG G AC 
GAGGTACTGCTGGCCCTGGCCGACCAACTGGGCAACTTTACTAGTCTCGTTGGTGGGCCAGAGTTTGCCATGTACTTGATTCCGCCCCTCGAGAG 
TTTGGCCACCGTAGAGGAAACCGTGGTGCGAGACAAGGCTGTGGAATCTCTACGCACCGTGGCCGCTGAGCACAGCGCCCAGGATTTGGAGATCC 
ATGTGGTGCCGACACTGCAGCGATTGGTTTCCGGTGACTGGTTCACCTCACGCACCTCTGCCTGCGGCCTCTTCTCGGTCTGCTATCCACGCGTC 
ACACAGCCAGTGAAGGCCGAGCTGCGCGCCAACTTCCGAAAGCTCTGCCAGGATGAGACACCCATGGTGCGCCGTGCAGCGGCCAACAAGCTGGG 
CGAGTTTGCCAAGGTCGTTGAGACGGAGTATCTGAAGTCCGATTTGATTCCCAACTTTGTCCAGCTGGCACAGGATGATCAGGACTCTGTCCGTC 
TGCTGGCTGTAGAGGCATGCGTAAGCATTGCCCAGCTGCTGCCTCAGGATGATGTAGAGCACCTGGTTCTGCCCACGCTGCGCCAGTGCGCCAGC 
GACTCTTCCTGGAGGGTGCGTTACATGGTGGCCGAGAAGTTTGTTGATCTGCAAAAGGCTGTGGGCCCAGAGATTACTAGGGTGGACTTGGTGCC 
TGCCTTCCAGTACTTGCTCAAGGATGCCGAGGCCGAGGTTCGCGCTGCAGTGGCCACCAAGGTGAAGGACTTCTGCGCCAATCTGGACAAGGTCA 
ACCAGGTGCAAATCATCCTTAGTTCCATTTTGCCCTATGTCCGCGATCTTGTCTCGGACCCCAATCCTCATGTGAAGTCAGCTCTGGCCTCAGTG 
ATCATGGGCTTGAGTCCCATGCTGGGCGCCTATCAGACTGTGGAGCAATTGCTCCCCCTGTTCCTTATTCAACTCAAGGATGAGTGCCCAGAAGT 
GCGCCTAAACATCATCTCAAACCTGGATTGCGTTAACGACGTCATCGGTATCCAGCAACTGTCACAGTCGCTTCTGCCCGCCATCGTCGAGCTGG 
CCGAGGACTCCAAGTGGCGTGTGCGTCTAGCCATCATCGAGTACATGCCTGCTCTGGCCGGTCAGTTGGGTCAGGAATTCTTTGACCAAAAACTG 
CGCGGTCTCTGCATGGGATGGCTCAACGATCACGTGTACGCCATTCGTGAGGCAGCCACCCTCAACATGAAGAAGCTCGTCGAGCAGTTCGGAGC 
TCCCTGGGCCGAACAGGCCATAATTCCAATGATTCTGGTTATGTCGCGCAACAAGAACTATTTGCACAGAATGACTTGCTTGTTCTGCCTGAATG 
TTTTGGCAGAGGTCTGCGGCACAGATATCACCACCAAGTTGCTGCTGCCCACAGTTCTCCTGCTTGCCGCTGATCCCGTTGCCAATGTTCGTTTC 
AACGTGGCAAAGACCCTGCAGAAGATCTCGCCCTTCCTGGAGGCCAGCGTCATTGATGCCCAAGTAAAGCCCACACTCGACAAACTGAACACAGA 
CACAGATGTGGATGTCAAGCATTTTGCTGCACAGGCCATTGCCGGCATAGCTGCAGCGTAATCAACCTTCTTGCATCTTTTTTTATTCTATTTTT 

AAAAC T GT ATG T AT T T AAC G CC G GC AAC AAAAC AT AT T AT GT AGC AAC AAC AC T T AAG AAC G GAAAAT T GT AGT GG AT C C C GG AAGGTGT T AAAC 
GCTGCAGCCGATGCTGGATGAGGGATGAATGGAGCCAGGAGTCAGAGGAAGGAGAGAATCGGAGAATCAGCATGGGAGGAGACCCACTAAACGCG 
ACACACAACTAAACATTACTACTTACATACTTATATTATATATAAATTACCATAGGCATAACGATCATTATTAAAAACATTTGTAGAGTTTGGAA 

CAT TTGATCGCATCT GAAT GC AAAAAT AAAAT T GT AGTGAAAC AAT 
{SEQ ID NO: 1292) 

Start ATG: 281 

MAASDKSVDDSLYPIAVLIDELKNEDVQLRLNSIKKLSTIALALGEERTRSELIPFLTETIYDEDEVLLALADQLGNFTSLVGGPEFAMYLIPPL 
ESLATVEETVVRDKAVESLRTVAAEHSAQDLEIHVVPTLQRLVSGDWFTSRTSACGLFSVCYPRVTQPVKAELRANFRKLCQDETPMVRRAAANK 
LGEFAKVVETEYLKSDLIPNFVQLAQDDQDSVRLLAVEACVSIAQLLPQDDVEHLVLPTLRQCASDSSWRVRYMVAEKFVDLQKAVGPEITRVDL 
VPAFQYLLKDAEAEVRAAVATKVKDFCANLDKVNQVQIILSSILPYVRDLVSDPNPHVKSALASVIMGLSPMLGAYQTVEQLLPLFLIQLKDECP 
EVRLNIISNLDCVNDVIGIQQLSQSLLPAIVELAEDSKWRVRI^IIEWALAGQLGQSFFDQKLRGLCMGWLNDHVYAIREAATLNMKKLVEQF 
GAPWAEQAIIPMILVMSRNKNYLHRMTCLFCLNVLAEVCGTDITTKLLLPTVLLLAADPVANVRFNVAKTLQKISPFLEASVIDAQVKPTLDKLN 

TDTDVDVKHFAAQAIAGIAAA* 
(SEQ ID NO: 1293) 

Name: Protein Phosphatase 2A 
Classification: protein_phosphatase 
FlyBase ID: FBgn000577 6 



Celera Sequence No. : 142000013384583 

TTTTTTATATTTAAAATTAAGTCCACTTGCTTAATTTACATTAATTGCACTCTAGAACCCTCCAGATATATGTAAAAAATCACAAATTTTAAAAA 
ATGCATTTAGAATAAGCTCTTAAATGCGCCTCTATTTGAGCTAGTTAACAATGTGTAACAATTAAAAGTCCATTCGCTGTGGCAAGTGTTCCTCC 
TCCACCTGGTGCTCGATCTCCCTCATGCGTAACATAAGTTTAAAGTTCTGCTTGTCCTCCTGGCCAGTAAGGAACATGGTTAGCAGGGCCTGCAG 
GCCGATTAAGGTTCCTAAGGCTGGGACAATGGGGCGTGTAAATTTCCGCCACAACAGAAACACTTCTCTGGGATTTTCCGTAATCGAGGGAATAC 
GGTACGTGTAGCAGCGGGTTGCGAACAGAAATGCGGCGAATGGGGCCAGAATTGTGGGATATACGATGCCCAGACTTGTCTGAAAGGCAGCACTG 
CGCATCTGGATGCACACCGGGCACTCGCCCAGCGGATTTAGTAGGATGGGCCGCTGAATAAAGAACTTGTGCGCCAGCATGGTAAATATGGCGGG 
AACGGCGACAATCGGGAGGTAAGTGGACAATCGACCATGTCCTCCAAGTCGCAGCTTGGTGCGATAATGATTATTTATGTAGGCGCCCGTTCCCG 
CAGCCAAAGCACTCAGAATTCCGGGCGTGTAGCGCAACGACCATCTACAAGGCAATATTTTGATCGCTTGTTAGGTAAGCGGCAACTTTAGAAGG 
CTTCATGCACTCACACTTCGCCTATTTTGTCCCAGGAAGTTATAATCTTCCATTGGTATTTGAGGGCCTGATCCTCTGTTATGACCACCGCGTCC 
CTGGGCAGTTCGTCGGGCTTGGCGCGTGATAATGCCATTTTTCATTAATTTATCAAAATATTTGGTTCTCAGAAAACAGCTGCTTATGGGTCAGT 
GTTGTATAAGTCCCGAACCGGTGTGAGAATGTTGAATACAGTGGGTATCGGAAAAAGGTCTCGGAAAGTAACGCGAAATTTACAGATGATGACAT 
ATCAGTGCTGTTAAATGGTCAAATCAGTGTTGCGTGCTGTTTTATATCAGAGGCGAATTGAAGTGTACTACCATAATTCTTGCGAGTCTGCTTAA 
AATGCCTCACAAACGTAGGAACCGAGTACATGCGAACCAAAGGAATTTCACAGCGCGACGCGTTTCTGTGGCCAAACTGTCGTCTGCACCGCCGA 
ATGTGCTCGTCGAATCCTGCAACGGTTCTCATGCGGATAACAGTTCGCCGGACAGTCCGAAAGCCAAAGAAGGTGCAATGACCAACGGGAAAAAC 
CCGGAGATCAAGTTAGCCGTGCCCAAATGCAACACCACCGTGCGCAAGCAGAACGAGGCCCATATCATATCCAACATTAAGCTGGATATGAACCT 
TAATAAGGATTCTAACTTTATCGCCCCACAGGTTAATTAAGGTTTTCAATTATGTATACTGTTTTATAATAACATATTCTGTAACTACCCCAAAG 
ATCACCAAAAAGATGAACTTCGCTCCGAATCGTGTTGTATTCGGGGTCCTGGTGCCTCTCAATGTCAACGATTCTGTGCTGGTGCCGCACAATAG 
AAAAC CGG AT GC C T T AAAG C AC AC AT C G AAAG AAT C GG AAT C AT GC AA GGC C AT AAG C G AG C C C C AG C T AG C CG AC T AT GC G G AG AAAGT AG AT C 
CCATGATCATGGCTGTTAAGGAACCAGTACTCCACTTGGATTGGGAACCAGGACCCTTCGATTTTTTCGGCGCATATAGGAGAACATACAATTAG 
AGTTACATTTTATTTATAAATATATCCGAGAATATCAGAATGTTTGTAGCTTTGTTAACCGATGAAGTTCATGACCACCAGATGAAGGATCACAT 
GAGCGAAGATGAAGTCGGCGAAACCGCGTTCCGGCGAAATGCCGGCAAACACGCTCTTGTTCTGGGGATTGGCCTGCAGGCGTAGGCACACTGCC 
AGGACGAAACAGCTGACGGTGCTGATGAAACCAGATAGGAAGGAGTTGAACGGGAAGGTTCCAACCAGGCAGCAGTAAACAAACTGGATGATGCC 
GGTGAGCAGAATGTAGCCGAGGTAGATGTCCACCAGCTTCAGTTTCTTGGGCGTATTCTGCACGTAGTCGTTGTAGAACTTGGAAATGACGCTCG 
ACAGCTCCACCATTTTGCTATTTTATGTGATATTTGTCCGGATATTTAAGGATAAAGGCGCTTTTTAACAAATTAATCGCAGACACGTCACAAAT 
TGGGAGAGAACTCAAAAGTAGGACCGTTCGTCTAGTTTGAAAATAATACTGATAGCTTTATCGATGAAGGCGCAAGTACAGTGGGCACTCAATAC 
CTTGAAGTTTAATAAAGAATAGGTTTATATATTAAAAAATTTTTGTGTTTTAGTTAAAACTAAAACATAACAAATCTTATAGATTAATGACCGCT 
ATCGATTCTTTTTTAATGTTGCATATGTTTGAGAGGTAATGTTATAATTTAATTTATATAAAAAGAAACACAAATTCTTTTTCTAACCCGTTAAT 
AAAGTTTTCTATCCGATTTTCACTGTACCGCCAAATCAGCGAATTAATCGATAGATACGGAAAGTCATCCTCGATTGTTGTGTCTTGCTTTGCTT 
CTTCAGTGCTTGTTTTGGGGTGAGAGGCAATTGTTATTGCGAGTTGAGTTCGAAGAAATTAGTAACGTATGTATTTAAAACACCACAAAAGGGTA 
TGATGTGCGTTGCAACGTTACAATAAACATTTGCTATCCAATGTGCAACA 
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(SEQ ID NO: 1294) 



Exon: 1001.. 1456 
Exon: 1521.. 1805 
Start ATG: 1142 

Transcript No * CT32729 

G AAAAAGGT C T C GG AAAG T AACGC G AAAT T T AC AGAT GAT G AC AT AT C AGT GC T GT TAAAT GG T C AAAT C AG TGTTGCGTGCTGTTT TAT AT C AG 
AGGCGAATTGAAGTGTACTACCATAATTCTTGCGAGTCTGCTTAAAATGCCTCACAAACGTAGGAACCGAGTACATGCGAACCAAAGGAATTTCA 
CAGCGCGACGCGTTTCTGTGGCCAAACTGTCGTCTGCACCGCCGAATGTGCTCGTCGAATCCTGCAACGGTTCTCATGCGGATAACAGTTCGCCG 
GACAGTCCGAAAGCCAAAGAAGGTGCAATGACCAACGGGAAAAACCCGGAGATCAAGTTAGCCGTGCCCAAATGCAACACCACCGTGCGCAAGCA 
GAACGAGGCCCATATCATATCCAACATTAAGCTGGATATGAACCTTAATAAGGATTCTAACTTTATCGCCCCACAGATCACCAAAAAGATGAACT 
TCGCTCCGAATCGTGTTGTATTCGGGGTCCTGGTGCCTCTCAATGTCAACGATTCTGTGCTGGTGCCGCACAATAGAAAACCGGATGCCTTAAAG 
CACACATCGAAAGAATCGGAATCATGCAAGGCCATAAGCGAGCCCCAGCTAGCCGACTATGCGGAGAAAGTAGATCCCATGATCATGGCTGTTAA 
GGAACCAGTACTCCACTTGGATTGGGAACCAGGACCCTTCGATTTTTTCGGCGCATATAGGAGAACATACAATTAG 

{SEQ ID NO: 1295) 
Start ATG: 142 

MPHKRRNRVHANQRNFTARRVSVAKLSSAPPNVLV 

NKDSNFIAPQITKKMNFAPNRVVFGVLVPLNVNDSVLVPHNRKPDALKHTSKESESCKAISEPQLADYAEKVDPMIMAVKEPVLHLDWEPGPFDF 

FGAYRRTYN* 

(SEQ ID NO: 1296} 



Celera Sequence No. : 142000013384301 

ATGGTAGTTGAAATGCATATAGCTCAAGAAATGAATACCAATAAAATCTCCAGCAACAGATTTATGGAAATCCTAGCGCTTTGAATCAAATAAAA 

AATGTGCAACTGCGTACAAACTTCTTAGATTGTAAATACCCTTCGCAAGAAAGTAACATGTTAATAATAATAAATTCTTGTAATCCGCCACACGA 

TCTGAATACTAATGTTTACCAAGTAATTGGTAATAACATTCTGAATAAGGAAATCACTGGCATTTACTTGTCTACCATTAATTTATGGTATTGTC 

GAACGCTTATGTTTGGTGAAAATATATTAAAATGCCCGCTTAGAATTTATGGCCAGTGTAATGGTTTCCACAGAAGCACTAATAAAATGTCGAAA 

TTTATTCATTTTCACGATGTTGCCAAATTTATAGTGACTCCTATGAAATTTTCTTGCCGTGGATTCCTAGAAGTGAGTGGATACATTCTTAAATT 

TAATTTTAATTTAATTTAGACATTGCCAAGAATAAAAACTTTTCTGATTTGCTCAAATATTAAAAATTGATTCAAGTATTTTTTTAATTCATTTA 

AATGATCTGAATAAAAAGTAAAGTGATTTACAGTGCTAAGCTAGCACCATGTTTTATCAATAAATTCCAGAATGACTAATTCATTCGCTATTCGA 

ATTATTATTTAGGGAATTTAAAAAAGTCTCTCAAACAAAAGGTTTGTTTGTGAATTTTCTTATCAATGGAATTCCGTTCTTGCTGAATCAGTGAA 

GAGCCCATTTCAGATTTACGCTATTCATAATTAGAAATTTCATATATTTTATATGTATCCTTTTCTTTTCTTCAAAATCAATTTTGATTCGTAAA 

GATAAAGTTTTCTGTATGCTAACATAAAAAACATTAATGCATAAAATAGGCCCTAACGAAATATACATTTTTCTGTAGCTGTTGCTAACTTCGCG 

ACCCCCATAGTGTTGGGTGGCCCCAGCGGCAGGCCCCACCTTCTTGCCACTCACCTGTGCAGTCGCTCCGTTTGCTCGGCTTCGTTTTGCAGATC 

ATTGATGGCGAACAGCTCCATCTCGCCGCTGCGCCCGAATATCCCATTGGGGATCTTGCGCAGGAACTTCTTGAGCACGCTGGCGATGGTGAACA 

CCGAATAGTTGTCCACGTTGACGAGCCGACCGGCCTGCAGGAAGTGGATGAGCTTCTTCATTGCGCCCTGGTGGCCGGGTGCGCGGAATACATCG 

CGGCGGTTCGGTGACTCCTTGTTCAGTTTGAGGATCAGGACGAGCAGGGGCCCGGGAATGTTGTTGTTGTGCTTGCACACCTCCTCCAGCGGCAC 

ACCGAATTTGACCTTTTCGAGGCGGTACGGGTAGGCGGGAGCGGTGGCTGGGTGGCCGCAGGAGGTCACCACGTTGCCCAAATCCGCGGCGCGCC 

TATGGACGCGTTCTGCCCAGCTGGACATGGTTTTTCACGCGATTTTCCTGCCTCCCTGCCTTCCTGCCTGTGTTTGTGTGCTTTTGCGTGCGCGT 

GTGTGCGTGTATGTATGTGCTGGCGAGTTTTTCAGCGAGCAGAGGAAAATGTGCTCGGCAGATTTTATGGTTTTATGCTGGCCACAAACATGTTC 

ACTTGACTGGAATGTTAATTTGTTCGCCTGCCCAATATGCGGAAAATATCTTTCAGATTTTCACTATTTATTATTTTTTATTATTTTTTACTTTA 

TTTTCGACACCTACTCTCGCGTCGCAGCCCCAGATTTTAGCCCAAAATGAGCAGCTTTGAAAAGCCAGCTTTGATTAAAAAATCGCCACCACTCC 

TGCTGTATGCATTTTTCTCTTCTCTTCTTCGCGTCTCTCTTTTACACTGCGTTTCTCCGTTTCTTTTGCGCTTTTCTTCTTTTTGGCCAACGCCT 

CGGAGCTGCTATGCCTTTTGCGCCCCTTCGCTTACTGAATTAAACGCTCGCCTATCGCATTTCGCTTGGATTTGTTGAGAACGCGGCGTTCGGCG 

TCACTTTCGGTTAGTTTAGCACTAGTTTAGTTGTGTTATTTCGGGGAGTGTTTAATTATTTTGCTGCGCTCGCCCGCACGCTCGACTTTTTTCGC 

TGTATATTGAAAACGGAGTGTGGACATCAAGCGGTAGCTGCAGAATGCCTGAGGCTCAGCGCTTCGCTGCGAAATACCAACGGAATATACCAAGA 

AATTATACTGCACAAGTACTGTGATAGTAGGGGTGTTCCAGCATTCGTTAAGTTTCAAAATGCCAAAGCCAAAGATAAAAAGATATAAAAAAAAT 

TAACGCACTTATGCAATACCAAAAACGAATAAATACATAATGTTACTACTTTTATCAGGGAACTTGGCTTAACAAAGTTTATACAAATGCACAAT 

GAAATTAAATCTACTTAACTGCTAACTGTTATTATTAGTTATTTCATACGATATGCAGGAGGCAGAAAGTTGCTTATTAAACATATATTTTCATA 

AATATTTCACGAAAGTATGAGAAAATTTCAAAATCGTTTGATAATATACTAATACGCTATTTAGCCACATATATAAGTCTCTAATGTATTTCTTA 

AATAAATCATTTGTAAAGCTATAAACTAAATTCTTTTATAAAATCTACTCCCAAAAAAAACAGCAGCATCAAACCTTAGCGTCTTAATATTAATT 

CAAATGAACACTCATCGCTTGCCGATTCAAAATCGTAAGGTTTTCTTTTAAATTCAAAACTCGCTGGAATGCCGAGACTTGTGACGTCACACGCC 

ATCTTCATGCTGCGCGGCGAGAAACATGCGCAAACGTCACCTTCTCAGCGGCAGACTCAGGTTGTCCATCCCTGGTTCAAGTAATGTCACCCTGG 

TTTTTGTTTTGTAAAGTGAAATTGTTGCAAGTCAAACAACAATAATAAAAATCATTAAAAGCGCGTACAAGAAGTCAAAAAGAATTGCCCACACA 

AGTACGTATTGACGGGCTGTCAGCTTTGCTATATGGAAATCAATTTTCCACTCAACTTTTACTTGGAAATCGGTATTTAATTTTGTGTTGTCTGC 

TGTGAAAGTAACAACAACAAAATCAAGAGTAACGTCAAGAGCAAAAATAGTTTTCCCCATTTTCAAAATTGACAAATATAG 

(SEQ ID NO: 1297) 
Exon: 2121.. 1001 

Start ATG: 1453 (Reverse strand: CAT) 
Transcript No. : CT32814 

CTTGATGTCCACACTCCGTTTTCAATATACAGCGAAAAAAGTCGAGCGTGCGGGCGAGCGCAGCAAAATAATTAAACACTCCCCGAAATAACACA 
ACTAAACTAGTGCTAAACTAACCGAAAGTGACGCCGAACGCCGCGTTCTCAACAAATCCAAGCGAAATGCGATAGGCGAGCGTTTAATTCAGTAA 
GCGAAGGGGCGCAAAAGGCATAGCAGCTCCGAGGCGTTGGCCAAAAAGAAGAAAAGCGCAAAAGAAACGGAGAAACGCAGTGTAAAAGAGAGACG 
CGAAGAAGAGAAGAGAAAAATGCATACAGCAGGAGTGGTGGCGATTTTTTAATCAAAGCTGGCTTTTCAAAGCTGCTCATTTTGGGCTAAAATCT 
GGGGCTGCGACGCGAGAGTAGGTGTCGAAAATAAAGTAAAAAATAATAAAAAATAATAAATAGTGAAAATCTGAAAGATATTTTCCGCATATTGG 
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GCAGGCGAACAAATTAACATTCCAGTCAAGTGAACATGTTTGTGGCCAGCATAAAACCATAAAATCTGCCGAGCACATTTTCCTCTGCTCGCTGA 
AAAACTCGCCAGCACATACATACACGCACACACGCGCACGCAAAAGCACACAAACACAGGCAGGAAGGCAGGGAGGCAGGAAAATCGCGTGAAAA 
ACCATGTCCAGCTGGGCAGAACGCGTCCATAGGCGCGCCGCGGATTTGGGCAACGTGGTGACCTCCTGCGGCCACCCAGCCACCGCTCCCGCCTA 
CCCGTACCGCCTCGAAAAGGTCAAATTCGGTGTGCCGCTGGAGGAGGTGTGCAAGCACAACAACAACATTCCCGGGCCCCTGCTCGTCCTGATCC 
TCAAACTGAACAAGGAGTCACCGAACCGCCGCGATGTATTCCGCGCACCCGGCCACCAGGGCGCAATGAAGAAGCTCATCCACTTCCTGCAGGCC 
GGTCGGCTCGTCAACGTGGACAACTATTCGGTGTTCACCATCGCCAGCGTGCTCAAGAAGTTCCTGCGCAAGATCCCCAATGGGATATTCGGGCG 
CAGCGGCGAGATGGAGCTGTTCGCCATCAATGATCTGCAAAACGAAGCCGAGCAAACGGAGCGACTGCACAGGTGA 

(SEQ ID NO: 1298) 

Start ATG: 669 (Reverse strand: CAT) 

MSSWAERVHRRAADLGNVVTSCGHPATAPAYPYRLEKVKFGVPLEEVCKHN^ 
RLVNVDNYSVFTIASVLKKFLRKIPNGIFGRSGEMELFAINDLQNEAEQTERLHR* 
(SEQ ID NO: 1299) 



Celera Sequence No. : 142000013384429 

AACCCTTATTTTAGCTTTAAAGTCAAAAATTCAACTTATTTCACAGTGTGTAAACAGTTTCTTGACAAACTGTATGTACATAACCTTAAATGTAA 
AGATATATTGATGTCAAATGTTCAATTTCGCTTCATCTTTCAGATTTTTAATAAGTTTATAATTGAGTTAAAATGTCGGATCTGGTATAAATTAA 
GATTAAGATTTACAGATTTGAGCAGATTTTAATTCTTAATTGTTTGATGCTTCTGCCTAAAGCACAACTGTTACATTTCATAGCAAAATTCATTT 
TTTTAAATATAAAGGAATTTTAATTTAGTTATGTTAAGGGGAAATTCAATAACTGTTATATACAGTAGAATTCACCTTTAAGTATAATACACACA 
TACATAGGTTCTATACAACCATGGCAGTTGAATTTCATCATGGCTGCCACGAATTCGGGAATTCCCAGGCATTTCGAAAGTTTATGGAGCATTTG 
ATTGCAAATGGTAAATCCATATGTACAAAACATATGTATGTACATGCATGCATACATCTATTTTTTGCCTACGCCGATATATCGCACTTTGCACT 
TACTTTGTTGTTGTTGCTGATGCGGTTGTTGTTGTTGCTGCTGCTGTGGTTGTTGCAGCTGCTGTGGTTGTTGCAGCTGCTGTGGTTGTTGCAGC 
TGCTGTGGTTGTTGCTGCCGTTGTTGTTGTTGCTTTTGCTGCTGCTGGGGCTGTGGTGCTGCAGCGCTATCCTGTCCTCCTTTCTTGTTCTCTGG 
GGTGGTAATTGGAAAGTGCATATATTTGAATAGCTGTATGGTCCTGCAGGTGGGACAACTCACTATCTTTTTTTCCCATGATTTACTTTTGAGGG 
TTTCGAGTTAACTGACGACTTTTAATAGACGATTTTTTAATGCGTCACAGTCTTTTTTAGTGATGGGAGGCCTTAGGGGTGGGCGATAAGTTATT 
GAGCGGCACAGCTAAAGTGTTTTCTGTTATCGATTATTGATAGGAACATACGGACTAGGCGGGCCATTCAAGTGGGGACTCTATTTATATTTGTA 
TATTTTTATACATGTTTTGTATACATGTTGCTTGTTGAGTCAAATGTATGCTATATTCGTTGAAAAGCATGTATCCGGCAAAAGAAACCGATTTC 
GACCTAATAAATTATATATATTATAATATATTATATATATTATATATATAAGGACCAATAGCCGAGTCGATCTAGCCATGTCCGTCTTTCCGTCC 
ATCTGTCAGTCCGTACGAACGTCGAGATCTCAGGATGAATAAGAAGCTAGAAAATTGAGATTACGGAAAGCCCCCAAAACTTCCCACGCACATTT 
TCAAAATGTTTTGACATTTTTCATATTTTTATTAGTCTTGTAAATTTCCATCGATCTGCCAAAAAACTTTAAATTGCCACGCCCACAGTTATGCA 
AAATGTGATATTTTTCATATTTTGATTAGTCTTGTAAATTTCTATTGATCTGCCAAACTACTATTACTCCCACAAACCGCCGAAAAGCCACGCTT 
ACAATTTTTAACAATTTTAATTTTTTTTCTTATTTTATTCACCAATATCTAATTTGCGTTCGCATTTCCAATAGTTGAGTAACTGGCATCTGATT 
ATC GGG G AAGT T T AAAAT AGG AAAAAAAT T T T AAT T T AC T AGT TT T t T T T T T T AAAT TTTATTTT GG AAG AAGAC AT C AG T AAG AAT C AAT AAAT 
TTAATCACCACTGCTAGAAAGCATTATTTTTATTTTTCCGCCTTTAAGTGAAAAATAGTTTTACATTCCAG 

{SEQ ID NO: 1300) 
Exon: 781.. 533 

Start ATG: 781 (Reverse strand: CAT) 
Transcript No. : CT32815 

ATGCACTTTCCAATTACCACCCCAGAGAACAAGAAAGGAGGACAGGATAGCGCTGCAGCACCACAGCCCCAGCAGCAGCAAAAGCAACAACAACA 
ACGGCAGCAACAACCACAGCAGCTGCAACAACCACAGCAGCTGCAACAACCACAGCAGCTGCAACAACCACAGCAGCAGCAACAACAACAACCGC 
ATCAGCAACAACAACAAAGTAAGTGCAAAGTGCGATATATCGGCGTAGGCAAAAAATAG 
{SEQ ID NO: 1301) 

Start ATG: 1 (Reverse strand: CAT) 

MHFPITTPENKKGGQDSAAAPQPQQQQKQQQQRQQQPQQLQQPQQLQQPQQLQQPQQQQQQQPHQQQQQSKCKVRYIGVGKK* 
( SEQ ID NO: 1302) 



Celera Sequence No. : 142000013384829 

TTATATTCTTACATCAACGAGACTGCTTTGGTAGTTTGTAGTCTGCACTGATTCATCAGCTTTTGAAGATTTCTTTCTGTGTACATTATGCGTCG 
GTCGATGCACAAGCAAATGTCACATCAATTAGCCGCACCGAGGGCCACAAGATGTGGCGATTGCAGCTGGCTTGGCTAAGTTGGCTAACTTTAGC 
TAGCTGGTTAGCAGACTCATAAAACATCAAATTAGCTTTGACTAGGTTGGCTAATTTGAAAAGTAAGAGGCAGTTGGGTGAATTGGCCATCTCCA 
GTTGTGGTTGCGCCGTGGTTGAGTGGTTCAATGGCCGCCCGGCCACCCACACGCCAGCTGTTCGCTGACCTTCGCCGCCTGACGTTGACTTTCCT 
TGATTTCTACACGTATCAGTCGGAAAATGTCGAGCAAATATTGCGATACCAGCTGTGGGTAAAAACAAAAGTCACAGCAGCAGAGCAAATATTAT 
ATAAGCACACACATCATACTGTGTGCTGAGCGTGTTTTCAGTTGTTTTTTATTACTGTCTGATAATCACCTCTGATCCGTTTGTTGAACCGATAA 
GAAAAGATCCATTCGCCTATGGGTAAGTGTCGACCAATTGATAAGCACTTATTTATAGACATTAACTACTCTATAAATAATGTCATCAGTCATTC 
ATCATAACTCCTCCGCAATTAAACAGAGTGAAATGGGAAGCAATCACTGTCTCGGAACTGTAATCTGGAGTAGTACAATAAGATCAAAGCCAAGA 
AAGTAGGAAATCGATAGTTAACCATTGGCCATGCTGTTCGTTCAAGTTACTGTGGCAGCGGCAACAGCAACACTTCAAAATTTGCTGCAATTTGC 
TTTAATTATACGCTCCGTTGGACGAAAAAAACAACAGCAACGAACAAATAGGGAATGTCGATCGAGATTGCAGACAAATTTAGGCCATACAAAGC 
TCTACTTTGTTTTCGCTTTTCCCGAAAAATTAACCCACAATTTTCCCCCAATTGCAGAAACACCAACATAGTATGCGTCTCTTCAAAACGCGCAA 
ATCCACGGATACCTACAGCACACTAGCCGCGCAGCAACAGCAACAGCAGCAGCAGCAACAACAACATCAAGCGGAAGGCAGCAACATTTCCCACA 
GCAGCAACAGCAGCAGCAACAAGAGTCACACACCGGCAACATGCAGCAACAGACTGAACAAGAGCATTGTGAGCAGCACCAGCATATCGTCATCG 
CTGCCTGATCTGCATGACAAGTCGCCCGTCATGATCCTCAGCTGCACCACCCTGGCCAGCAATGGAGCCACCGCCACGGCAGCGGTCACAGCAAC 
AGCCACCGGCACAGCAGCAACATCTGGCGGCTCGCTGCAGCAGCAACAACAGCAGCATCTGCAACACCAGCAGCAGCAGCAGCCGTTACGCACGG 
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CCACGCCCACGTGTCTGCTGAGTGGCCGTCAGACGCCATCGGCCATATCGGTGATGTCGCTCCAAGAGGCCACCAGTCTGCACCGCCAGCAACAG 
CAGCCACACCAGCCACCCACCATCTAC 

CTATGGCAGCACCAGCAGCATCGCCAACCTGCAACAGCAGCAGCAGCAGCATGCCGCCCAGTACCAGCAGTATGTTGCACAGCGGCTGCACGCCG 
CTTCCAGCAGTTGTTTGTACGAGAAGGGGTCGAATGCCAGCGGTGGGGCGAGCAGCAACAAAAGCAGTCTATCCCTGACCCCAAATGGTAAGTAC 
TTTACTATAAAGTGAAGTAGAATACTCATATTACTAATGGGGTAGCCCAGGAGTTGTGCTTACATTAATTTTTATTATTGGCTGATAATATCATG 
GGTAAAAACTTGCAATCATTTAGCTGCCAGTGGCAGTTACTTTCCGTATCCGTATTCACCGATTTCTCAAAGTAAGTCATACTCTTCAGTACACA 
AAAAGCTTTTGCTACATTGTTGGGCGCAGTGAACAGGACATGTGATGCACCTTGTGATAACATGGTGATGAAGAAATGGCTCTTATCTTCGCAGA 
CATTCACGAGCCTCAGCCAGTGATACTTGTTTTCCTCCTCAACCTCCCAAATGCTGCAGTTACTATTGGCGCCCAATTCCAGTTGATTTCCAGCA 
GTTATATAGAACAATTCCTTCTTGTGTTCGAGACACAAGCGACGAGTTTTCACCAGAAGCCTCAGAGGTCTTGGTAAACTCTGCAACATGTTTTG 
GAGATCATTGTTCCACTGTTCTGGATGATTACTCTTGAGGTTCCTCAATTCCCGTGCAAAGTTAGAGAGAAGAATCTCGTTTAAAAGACGAAAAC 

GTCGTAGATGCCAATGAAAGCTCATCCAAAGCTTC 

TCTAAAATATCCGACCAGGGAATATTCCATGCCACGCGCTGTTGCTGCTCTTCATTCAGTGATAAACTGATGTCCTTACTCCACTTATCGAAATC 

CAAAGGGCTTGCATTTAATTGTGTAGGCACAAGTCCAATCAG^^ 
G^CAAAAGTATTATGAGAGCAATGTAAACGACCATTGCGATGGAAAATCTTA 

TGTGGGATTTCCGGGTATTTGGCGCTTATCGCTGGTTTTTTGATAATTCCTTGATATTCAGTGGGGCCAACGCCCTGGTTTTCATTTCCTCAAAC 

CCAACTGATACCAATTCCTGTTCCCACAGGTCACTTGCCCGACTACAAGTTGGTGACAGCGATGCCAGTTGTTGTCCTGGACGATGAACACAAAT 

CCAATTCATTGCCGGCCACTGAAGCGAGTCGCAACAGCAACAGCAGCAGCAACATGAACGGCAGCAGCAACAGCAACAGCCTTGACGTCAGCAAC 

AGCAACTCGCATTCGGGGAGCTCCACTTCTTTGGCCAGCACCACGAGAAATGGTAAGTACTTTGGCCAAAAATAGAAGTGAAATCCGGGCAAATG 

CCTGGAAATTGATTTTCCCTTCAAGTGGCGGCGCCTCGAGGGTTTTTATTCCCCGTGGAGTATATTTTATTTTGGGGCACATTCAACCGAGAGTG 

TTGGGTCAATGGAGAGGGGGAAAATGAGCCCATAAATTATAGAGCGGCAGAGGGTGTGTTGTCAGCCAGGTATTAAGAGCTCAATTTGGAGCACA 

TTTTAATTGAAACAATTAATTTTCCATGGCTAAACATAGGAAACAGCACTCCCAGTGCTTCCGTGTTTAGCAAAGGCAAAATCAGCGCCAATCAG 

AGTTGCCAATTGTCATCGATACGATGCGTATTGTGTCGCATGGCACATACTCTACTCCAAAATACCGTATTGACCCGATTTCGCATCAATGCGCG 

CCGCCAACATCGGCGCTGTAATGCGATCCCATGTCTGCAACTGGAATTTGTATTGTTTTCAGCCGGGCACGAGTTGGTGCTGCTGTGGTCTCGAT 

TTTGAAGAGGAGCCCACCCTAAAAGCCTAAACTACACAGGGAGAAACCCTACTTGGGAACTTGGGAATAAGTCATAAATCTGCT 

T AT T AT AAAC ACCT T T C T T T AAAAT GAT TAT AC AT G AC AC AG AT T T T T C T AAT ATT AT C AG AAAT AAATGT T GC T T C CAT AC TAT T C T T T C AGT G 

TAGAATATCTCCCAGCCAACAGCCGACCAAGTGTGACAAAGCTGGGCATTATATGTCATTCAATGC^ 

GAGTTCAGTTTAGTTGTAATGATTTGCACAAAGAGCACTTTATTTACTTTTATTTTTGCCCGGTTATGTGATATATGATTGAGGTTTTCGGTTGT 
ATAGATGGGTGGAGTTGTGAGTAATAGATATGGGTATTAGAGCGGCTAGTTAGATAAACGAGGAATTGAGGGCAGGAAATGGCTGATGATGTGAT 

T G GC AAAT CAAAACATG AG AAAAT AT G AGT 
(SEQ ID NO: 1303) 

Exon: 1001.. 1797 
Exon: 2880.. 3115 
Start ATG: 1023 

ATTGCAGAAAC^ 

AGCAGCAACAACAACATCAAGCGGAAGGCAGCAACATTTCCCACAGCAGCAACAGCAGCAGCAACAAGAGTCACACACCGGCAACATGCAGCAAC 
AGACTGAACAAGAGCATTGTGAGCAGCACCAGCATATCGTCATCGCTGCCTGATCTGCATGACAAGTCGCCCGTCATGATCCTCAGCTGCACCAC 
CCTGGCCAGCAATGGAGCCACCGCCACGGCAGCGGTCACAGCAACAGCCACCGGCACAGCAGCAACATCTGGCGGCTCGCTGCAGCAGCAACAAC 
AGCAGCATCTGCAACACCAGCAGCAGCAGCAGCCGTTACGCACGGCCACGCCCACGTGTCTGCTGAGTGGCCGTCAGACGCCATCGGCCATATCG 
GTGATGTCGCTCCAAGAGGCCACCAGTCTGCACCGCCAGCAACAGCAGCCACACCAGCCACCCACCATCTACGTGCCGGTGCCTACGAAACTTGG 
CAACAATGTCAACACTGGCAACAGCTCGGCCACTCTGCTGCTCAGCTATGGCAGCACCAGCAGCATCGCCAACCTGCAACAGCAGCAGCAGCAGC 
ATGCCGCCCAGTACCAGCAGTATGTTGCACAGCGGCTGCACGCCGCTTCCAGCAGTTGTTTGTACGAGAAGGGGTCGAATGCCAGCGGTGGGGCG 
AGCAGCAACAAAAGCAGTCTATCCCTGACCCCAAATGGTCACTTGCCCGACTACAAGTTGGTGACAGCGATGCCAGTTGTTGTCCTGGACGATGA 
ACACAAATCCAATTCATTGCCGGCCACTGAAGCGAGTCGCAACAGCAACAGCAGCAGCAACATGAACGGCAGCAGCAACAGCAACAGCCTTGACG 
TCAGCAACAGCAACTCGCATTCGGGGAGCTCCACTTCTTTGGCCAGCACCACGAGAAATGGTAAGTACTTTGGCCAAAAATAG 

(SEQ ID NO: 1304) 
Start ATG: 23 

MRLFKTRKSTDTYSTLAAQQQQQQQQQQQHQAEGSNISHSSNSSSNKSHTPATCSNRLNKSIVSSTSISSSLPDLHDKSPVMILSCTTLASNGAT 
ATAAVTATATGTAATSGGSLQQQQQQHLQHQQQQQPLRTATPTCLLSGRQTPSAISVMSLQEATSLHRQQQQPHQPPTIYVPVPTKLGNNVNTGN 
SSATLLLSYGSTSSIANLQQQQQQHAAQYQQYVAQRLHAASSSCLYEKGSNASGGASSNKSSLSLTPNGHLPDYKLVTAMPVVVLDDEHKSNSLP 
ATEASRNSNSSSNMNGSSNSNSLDVSNSNSHSGSSTSLASTTRNGKYFGQK* 
(SEQ ID NO: 1305) 



Celera Sequence No. : 14200001J Jb4by J 

AGTGTTACGGATACTGGGAAATGCAATTACGGGATAGTGGATTTGGGGTGGTGTGCTTGGTTGCTAGTGCTTGTGTGGCTTCGCCGAAGGGCTTG 
GTCTTTGATCTTTGGCTTTGGCTTATCTTATCGTATGATTCTGGGAATACTGGTTTGGATGGATCTTAATGGCTGGCTGGAAATAACAAGTTGGT 
TATATGTAGTTTAGATATCTCATTCTTTGTACAGGCTTCCCCATACGGGCACAACATCGAACAGGCGACACAAAGCTATAACATATGTGTACATT 
TTCGCGATTCTAGACAGAGATAGTTGGTAGATTGGTGTAGTTTAGTCAGTGTTTAGTACTAGGCGGGAAGTTAGGATAGCTAGATAGTGAGTCAC 
ACTACGCATTAACATATGTGTTATTTTGTGGCACATGTAAAGAACCGAACGCAGGCATTGGTTTCATCGATCGCATTGCTGATTAACTTTCCAGC 
CGAACTGGAAGCCTTGACGCGAACAAGTGCAAGGTTATTAACTGACCCCAAGTGGGCGTGGTGGTGGGCGGTGGGCGCTACTTGTGCCAGCACAT 
GGAGAACCCGAGTTGTGTGCAAAAAACACTGCTCAACTCGTTGGGAACACCGAAAATTAGTTTTTTATGCAGAAATAGTGAAGGAGAACACACAC 
ACAAACACACGAAGGCGTACTGCGCCGTGTGGTTAGCAGAGGGATTATAGTGGAAAAGTGAAGTTTTCGACGGATGCACACACAACACATAACTA 
TATTTGCACACTATTTGTTAATTACCAGCTCCGGCTTATATGTTTTTTTTTTTTTTAAATCTCGAACTCTACGGGCTAAAATCGAGCAAAGTTGA 
CGAATAAAAGTGGATACACACCCAAACTAGTGATGTGCCGCGCGGCGATAGTTCGGGTGCGTGAGCGCACCCGCATACCATCGCTAATATTCGCG 
CAGAAAAACATACAATACGTATCAGTTTTTAGTTCCATGCGCTCTGCAAAATGATGGCCAAGGACAAGGGTCTGGCTGTGGCCGGAACGGCGGCC 
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CCCTTACCGGCCGCCCTCGACCACGAGTGGTCCGCCGAGGAAGAGCTGCAGCTGTTCCATGCAATGGAAGGCCTGCGCCCGGTGGGCATCAACAA 
GCACTTCTACATGTCCTGCATTGTCCAGCGTCTGTCCAAGTCGCTGAACCGTGAAATGCCCAGCGAGCTGATCTGGCGCCACCTGGGCACCATGT 
ACAAGCTGAAGGAACTGGACGATCTGGAGTCGCTGCCCTTTCCCAACGAGGAGCGCGAGTTCAGCCTGCCGGAGCAGGATTACGGAACGCTACAG 
ACCAAAAAGACGGTGGAAGTGCATGCCAACGAGGAGGCTGCAGAGGCACAGTCGGCCCCTCCAACAGCAGAGACCAAT 

agttatcatcaggattcattcacttgtaatgttgttttacagctccacctgcaaccaaaacgccggtaccgacgaattccaccaaggacggcgac 
aaaaIgcccgtcgccaagccccaggagcagctgcccaagc 

ta^cgccgccggtgcagagcaacaagcgccgtcgcatataggtgacccccgcccgaaactgacgattgttgatccgttttagctcattagttt 
tactgtactctaataaaaggatttattcttcgaaatcgttgtcctcctcgttgga 

atgttttccaggagcgatttgagaaggcgtcggttgagatggtcggtctgcgtcagttcccggactgcctggggcggagcattctcgtctgccgg 
tgcagggctgttctcagctggagtcggagtctctgccatggctttgatgtgctccaaaacaggcatttttcccccgagaaatttcagcttttgct 
atgacacacagtccgctgggtgactcgtgcacttggaaacttggctaagtgtagggtacactcgataaagtatcgataagttggcgttttacgtc 

agccctagttccacccctaagtcgatcgacttccaattgga 

aaatgtttgttcccgaaacggaggatatgctgcccagactagcgcccaggccgagtgcagcagtgcccatgggccacaccaatgagattataggg 
cccaccgtgccggaggtatccatattgtttggccagccgccgcaggatccgcagatgcagccgcagcagccaaatcatgcacatcaggagtcatc 

ccctcgtgSgcacccaccttcgcgtcctggaagaagcaaatg 

ccgctcccagcagcaatagtccgtgctcctcgaaggaataccaagtaatgccccgggatcgcaactatgcaagcgatgcgcagcagac 
catatgaactgcaatgctggtggctataatgccgagcccaagcaacaggcatatcaatctaggcgcttgtccagctcctccaacgcagatgtc 

(SEQ ID NO: 1306) 

Exon: 1001.. 1415 
Exon: 1468.. 1658 
Start ATG: 1001 

mgatggccLggac^ 

GCTGTTCCATGCAATGGAAGGCCTGCGCCCGGTGGGCATCAACAAGCACTTCTACATGTCCTGCATTGTCCAGCGTCTGTCCAAGTCGCTGAACC 

gtgaaatgcccagcgagctgatctggcgccacctgggcaccatgtacaagctgaaggaactggacgatctggagtcgctgccctttccc 

GAGCGCGAGTTCAGCCTGCCGGAGCAGGATTACGGAACGCTACAGACCAAAAAGACGGTGGAAGTGCATGCCAACGAGGAGGCTGCAGAGGCACA 

gtcggcccctccaacagcagagaccaatggcaaagctccacctgcaaccaaaacgccggtac 

ccgtcgccaagccccaggagcagctgcccaagcgtccggccaagcgcacacgcggctccatgtccaacgagtccatcagcccctccactacgccg 
ccgccggtgcagagcaacaagcgccgtcgcatatag 
(SEQ ID NO: 1307) 



Start ATG: 1 

IMAKDKGLAVAGTAAPLPAALDHEWSAEEELQLFHAMEGLRP^ 
EREFSLPEQDYGT L QT KK T VE VH AN EE AAE 
PPVQSNKRRRI* 
(SEQ ID NO: 1308) 



Zj J_J XS \s _LJ .L jt X nil J_J Vui J— 1 1 \ J- •* ^j-i-i^*"*-*- " — - 

^EAQSAPPTAETNGKAPPATKTPVPTNSTKDGDKKPVAKPQEQLPKRPAKRTRGSMSNESISPSTTP 



Classification: hypothetical 

Celera Sequence No. : 142000013384547 

GCACTGTATGGAGTTGCAGTTGCACTCTCACTCTATAAGATGGCCATTATGACGACGATGATGATGACTATGATGATGGCTCAAAAAAAACAGGA 
CATTGTGAGGTGCGGGGCGCTGCAAAAAGGATAATACAACTTTCTCGCTCTATTTGCCACGCACACACATGTTGTTACAGTGCCGTGATTTTTGT 
TGCATCAAATTGGAATTTTTTCCTTAAATATGGTAAACATCGATGGGGCAGAGATTTATGTATAGTGCAGCGGCGCGTTGATGAAAATGACTCGC 
CGCTGCATTGACCGTGTTTACTGTTTGTCAGACAGGGCAGCAATTTACGCCCACATGCZVAAAAACATTAATTTGCATGCAAGTCZVATTATCAGGA 

GGCAGCAAACAGACAGGCGCTAGTTACCAGTTTTCAGTATTCAGT^ 

CATTGGTGTTATTATTCCGCCATAAGGATGCAAGGACTAATGGAGATACTCTTGAAATGGGTCATGATATTCCAAAAGAATCGCATATGGCGCAG 
AAAGATTTATTTTGTTTTTCATTAAAGTACCATACAAACCATTAATTTAGCTCGCTACTCTTATAGAGCATCTGCAGGTTGTTGAGCTCAACGTG 
GACACTCAATGGCGAGGACTTTCAAACTGCCAGGCCAGCACATTTCTAAATGTTTTTATTAATTAAAAATATTGCAAATTAAACACATTTCCAAA 
TTGCGTTTTGCATTTTTCGTATAAATACGCGGTCGATGGAAAAAATGCAAATATAATTTACAAAGCATACATTTCACATTTTATTCTTATTTTAT 
GTTCATTGCATTTTGGTTTGCTCTATCGCCGCTTTeGCAGCCATCAGTTCTTAAACTTGCCTCAAAGCTCTTAAACACGATTACTACCTGCTAAT 
TAGCTTAGAATACCTATGTTTTGGGACCAGTCTTGGGTGCATGATGGGGTCTACCTGCTTTTGATGGTCACCTCGCACTTGGCAGCCAGATTGCG 
GAGGGTGAGGAACTGGCCGGATGCCCTAATCTCCGGCCAGCTGCACCGGAGCGAGTGAGCAGGTCCCAGGAATCGGTGGCCCGGAGTCTCGATGA 
CAGTCTCGAAAGTGGGTCCATGTGGCAGGTGGTCTAGGTGCCACAGGAAGACGGCGTGCTCATCGCCGCCGCCGATCGCAATGACAGCCAAGAAA 
AATGTGTGCACTCTGACGTGGCCAAAAGTGGAGGACTCCTCGCTCAGCCGCAGGCAGGCCACGCCCTCCAGGGGCAGCTGGGGGCGGACGTGCAA 
GCGTTCGCCGAACGTGCCGTAGATTTGCAACACTGCCAGCCGGTGGTCGCCGCTCAAATGTCGCAACAACTGCCATTCCCGCCGACCTATTGA.ee 
CCGCCTCCAGTGGCAGCTCCTGATTTACTCCGGCTGCCGGCGCCTGTTTATGGCTCCCCGCTGATAAGGATGCACCTCCCGAGTTCTCAACCACC 
GCCAGCGCCTGCATCGCGACTGTTGACAATTTGTTGGCCACTGGCTGGCCAAGTTGGTAACTGTGGCTGCTGCATGATTTCCCAGTTGGGCAATG 
ATAATTTTGATACTCGCTCTCGCTGCTTGCTGACTTCTCATGCCCCAGCTGCTGCCGCTCCTGCTGCTGCTGTTGCTCCTGCACCTGGACACCAT 
TATTTATGGTGGCAACTTCATCGCAGCGCAATCCGTCTTGATCTGGCCCAGTAATCCGCCAACTTTTCTTACTCAACTTTAACTTTGGTTTAACG 
AGCATATTGTTGTGACCTTCCTCCGAAAAGTGCTCGACAGCGGCCTCCTGATGCTGCACTTTTATCAGCGTGCAACGTGGCAGGACAGCCTCCTG 
CACCGCCTGCTCCTGCTCCTGCCCCTGCTGACTCTTTCTTCTGATTTCCCGAGGACTTTGGTCCACCAAAACTGTTTCGAGTTGCCTGGAGATTT 
GCTTGCCGCACTTGGACTTGCCGGAACTCGTCTTCGCCAGCGCCATTTTCGGTTGATTATGATATTCATTTGTGCACTTGTTGACGCTGCTCAGC 
TTTCCATGGCCCTGGGAGGCAGCCACCTTGTTGGTTTTCCCTGAAAAGAGGCACCGAAACGGAGGACCATTACAAAAAGTACGAAATGTAAATAT 
AATAAATACAACTGGGACAACTGGATGGATATGCACGCAAGTTGTCCATTTGCGCTCGTTTGTGTTTTGCCCCCAAATCCCTTGACACAAACTCC 
CTTGAACTTCTTCAAGTGGCAAATGAAAATAGAGCAGAAACTAAGTAAATATCCGCGTACCTGCATCTGTATAAAAATGGTTGAACTATGGCTGA 
ACTATGGGTGAATCCGGGCGAATCATAAAGAATGCACACTTAATGTAGACGCAATAAAAACGCACTAAGCACGGCATAATCTGGGAAAAAACAAA 
GCGAAGGACCCTGAAGTGGGTAAAAGTGGCAAAAAATGAAGACCAGCAGGGAAAAACAAATGCAGACAACGGCGGAAAATTCTGTTGCTGCTTGT 
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TACCTCCGCTGTAAACTTCACTCCAAGTTAACTTGGTCGACACAGATTGAAAATGGCAACAAGGGCGGCTAAACTTTAAACGAAGCCGCCCAGTG 

GC AAAG C AAGAAAAGC AAC GC CAAC AAC AAT CT T C C G G C AAG AT G GC AAC G C AGC T AAGC G C G AAAA AC T T T AGC AAT T T ATG T T AC AT AAT T AT 
GCCATAAAAAGAACAACAAGCGAAGATGAGCCATTGCGCTCAATTTCAAGCACTCTTGACTCGGCTCCGCTTTCTCGTCACATTTT^ 
TGCGCCGCATCTCGGTCGAGATTTAAGTGAAACATTGACACTTGCCGCTGTCGAAAAGTTATGGCCCACACAAATGAGGATTCCCCGCTCCCAGC 
CAGGATGTTGCCATTACCGCAGGATGTATTACAGCGGGGCGGGCACAAAGGTTATTTCCACAGGGTTTTGCATTACATGAAATAGAATACCTTTC 

G 

(SEQ ID NO: 1309) 
Exon: 2041.. 1001 

Start ATG: 2041 (Reverse strand: CAT) 

I?gg?gctggcgaagacgIot 

AATCAGAAGAAAGAGTCAGCAGGGGCAGGAGCAGGAGCAGGCGGTGCAGGAGGCTGTCCTGCCACGTTGCACGCTGATAAAAGTGCAGCATCAGG 
AGGCCGCTGTCGAGCACTTTTCGGAGGAAGGTCACAACAATATGCTCGTTAAACCAAAGTTAAAGTTGAGTAAGAAAAGTTGGCGGATTACTGGG 
CCAGATCAAGACGGATTGCGCTGCGATGAAGTTGCCACCATAAATAATGGTGTCCAGGTGCAGGAGCAACAGCAGCAGCAGGAGCGGCAGCAGCT 

GGG GC AT G AGAAGT C AGC AAG C AGC GAG AGC GAG TAT C AAAAT T AT C ATT GCC C AAC T GGG AAAT C AT GC AGC AGC C AC AG T T AC C AACT T 

AGCCAGTGGCCAACAAATTGTCAACAGTCGCGATGCAGGCGCTGGCGGTGGTTGAGAACTCGGGAGGTGCATCCTTATCAGCGGGGAGCCATAAA 

CAGGCGCCGGCAGCCGGAGTAAATCAGGAGCTGCCACTGGAGGCGGGGTCAATAGGTCGGCGGGAATGGCAGTTGTTGCGACATTTGAGCGGCGA 

CCACCGGCTGGCAGTGTTGCAAATCTACGGCACGTTCGGCGAACGCTTGCACGTCCGCCCCCAGCTGCCCCTGGAGGGCGTGGCCTGCCTGCGGC 

TGAGCGAGGAGTCCTCCACTTTTGGCCACGTCAGAGTGCACACATTTTTCTTGGCTGTCATTGCGATCGGCGGCGGCGATGAGCACGCCGTCTTC 

CTGTGGCACCTAGACCACCTGCCACATGGACCCACTTTCGAGACTGTCATCGAGACTCCGGGCCACCGATTCCTGGGACCTGCTCACTCGCTCCG 

GTGCAGCTGGCCGGAGATTAGGGCATCCGGCCAGTTCCTCACCCTCCGCAATCTGGCTGCCAAGTGCGAGGTGACCATCAAAAGCAGGTAG 

(SEQ ID NO: 1310) 

Start ATG: 1 (Reverse strand: CAT) 

MALAKTSSGKSKCGKQISRQLETVLVDQSPREIRRKSQQGQEQEQAVQEAVLPRCTLIKVQHQEAAVEHFSEEGHNNMLVKPKLKLSKKSWRITG 
PDQDGLRCDEVATINNGVQVQEQQQQQERQQLGHEKSASSESEYQNYHCPTGKSCSSHSYQLGQPVANKLSTVAMQALAVVENSGGASLSAGSHK 

QAPAAGVNQELPLEAGSIGRREWQLLRHLSGDHRLAVLQIYGTFGERLHV 
LWHLDHLPHGPTFETVIETPGHRFLGPAHSLRCSWPEIRASGQFLTLRNLAAKCEVTIKSR* 

(SEQ ID NO: 1311) 



Celera Sequence No. : 142000013384809 

CTTTCATTTCTTTGGATCGGACAGCGTGTGCAACGTGTCACACACGCACAGAGTCTATACGCGTCTCACTCGCACTTGTCAGAGTTGCACTAGAG 
CTGCTTTGGTAGGCGCTGCCGCGTTTGCCACGCCGATAGGAGGATAGTACCTCTTAGTTTGGACGAACTTTCTTTTCAAGGAGTCTGCGCACTGA 
AATCCTAGCCACGAGTTTCATCCCAGAGCCGCAGACCAATCCGACTTACCTGCTTTACAATGCCGGACAACGACTTTGTAGGCGCTGGCGGGGCG 
TCTTTAGCTGCTCCGGCAGCTGTCGCGGTGTTCGCTGCTGTTGCCATTTTTTTATTGAATGCGGGGGAGATACTAGCAGAGGGCCGTGGCACTAG 
GCTTCAATAGCGGAATATTGCGTTGATTGCGTTTCGGAGCTAAACACCGCTTTCAACCTAATGAAGTTTGACCAAGAGAAGAACCGCTCTCAAAA 
TCTAAAAAATTTTCACTGAAGTGACACTGCTAGCTAGAGCTGAGAAGTCAGCGCGATAGCATCGATATTTTCGTGACACGCTTGTCATCCGATAG 
GTAGTGCTAAACATTTTTCTTAATTGCCAACTTAATTTACAAAAAATATGAATGACGGTAAAATTGAGAGGGATTTGGGATCCTTTATATTGGGA 
ATTTTTATATATTCATATTTATTTTAATTGTATTTATGCGTAATTTATCAGACGAACTTTGGAATTTCCAATAATTCTAGATTTTTTTGTTTCAT 
TTCAAAACTAATTTTAATTCGCTTAACGAGTGCATATCTATATTATGTTTACTTTAAATTTATCTGTGGGGGCAGCCTTAGATGTTCACATTGCA 
GGCCATAGTGTCGATAGATCCAGGCGCCAAAAGAGTATCGCTATCGATAAATGGTCGCCAGCTGTTCGGCGAATGTTACGTGATTTCCTATTTGT 
AACATTGCTAAAAATCGAGTAAAATGAATCAAGCCGATGTAACAAAGCTCATGTCACAGCTGCGTATAGCGGTCAGGCCCAACAAGCGCCATCTG 
AAGAATGTCGATGGCCCGGAAGGTCGGCTGCTGAAGCTGCGTAAGACCGTCACTGCGTTGGTGAAGCACGAGCGTATTGAACTGTTCTACAACCG 
TGCCGATGAAGCTCGTGGCTACGCCGAACTGGTAAGAGACCACCGGCGATACGCCAAAGGCTGGAATCCAAATAAAGTTTCCTCCCCTCGCAGCT 
CATTTCCAATGCCATACGCCATGGGGACCGACACCAAGCCACCATGGAACTGGCGGACTATTGGCTGCTGGAAAAGCAACTGGTTCATAAGCTCT 
TCAAGGTGCTGGTACCGCGCTATGAGACTTACAATGTGTCCTACACTCGCATGTACAAAGCCCCTCGGGAGTATCCTGGCATTTACTACAGGCGG 
TCTGTGCTGGAGCTTCGGGGCAATCCCTACCCTTCGCTGGCGGCGGATCATTCTCAGAACCGGAATCTGCTGCACAATGTGCTTCTGGACGAGGC 
GAGAAAGGAGTTTAGGCGTCAAAAACTGTCGGAGTTAGTTCACTAATAGCTCTTTTCACTAACCGCCTGTTTATTTAGTAGATTAGCAAGCAGTT 
TGTCGCTTAGAATTCACGTAAGACTTAAAAGCATCTTTATTATAAATGTATGTATACACAAAACGAATTGTTTGAATTAAGAGTTGAACTTATAC 
AAACGACTAAAGTCTACTGACATTGCAGTGATTGAAGTAATCCGGTTTCCAATCTACCGAGTTAGAGAATTAGCTTTAGAATGATTCAATTTCTG 
AACGGATAACTTACCCTTTCCCGTAAGGCAGGCATCCAGCAACATTTCACAAATGCTGTAGAAGAATTGGTGCTCATTTGTCGACGATGCGCAGA 
CAGGCAAAAACGGATCACAATTGGTAATAGCACTACAACCCGAGTTGCTGCTCAATACTGGCAGACATACGGACAGCGCGAGGAACATTATGATG 
TTGAAGATCAGGAACCAAGCCAAATAGATATTCATTTTGGATGAATTCGCAGAATTATCATTCACTAGCGTATTAAATGACATCGTCACCTAAAA 
ATTATTTCATAGCAGCAAACAGATTGGGAGCGAATCTTTTTTCGCAGATCAATCTTTATATGTATATATTTTATTTATTTTGAAATATGATCCGT 
TAAAGTTCCTCTTTACGTTCCATTCGGTCACAGAATGTAAATCCGAATTTGGCTGCCGACATGAACAGCCATCTAACAACTGATAAGCACTGGGG 
GACCTACGCGCTGATGCATACTAATTCGCTCATGTTCCTGAACCCCAAAGTTACCATGGAAACGAGCTCGGAACACTGACCAATGAATCAATGAT 
AACTTCACCTCTATTTTTCACCGGAAGCTTTTGTTCGTTTCCTCCCAGACTTCGGTATGAATGGGGTTGCTCAACTGACTGGAACCCAGCGTTCT 
TTGTAGTTTCGCGTACATTTAATCCGGCCGGCCCCGATATGTTTGTAAACTAGTGCAACTTTGCGGATGAAATGGTGTTGGAGAGCATGTTTGAT 

A 

(SEQ ID NO: 1312) 

Exon: 1001.. 1171 
Exon: 1234.. 1566 
Start ATG: 1001 

Transcript No. : CT33407 
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ATGTCACAGCTGCGTATAGCGGTCAGGCCCAACAAGCGCCATCTGAAGAATGTCGATGGCCCGGAAGGTCGGCTGCTGAAGCTGCGTAAGACCGT 

CACTGCGTTGGTGAAGCACGAGCGTATTGAACTGTTCTACAACCGTGCCGATGAAGCTCGTGGCTACGCCGAACTGCTCATTTCCAATGCCATAC 

GCCATGGGGACCGACACCAAGCCACCATGGAACTGGCGGACTATTGGCTGCTGGAAAAGCAACTGGTTCATAAGCTCTTCAAGGTGCTGGTACCG 

CGCTATGAGACTTACAATGTGTCCTACACTCGCATGTACAAAGCCCCTCGGGAGTATCCTGGCATTTACTACAGGCGGTCTGTGCTGGAGCTTCG 

GGGCAATCCCTACCCTTCGCTGGCGGCGGATCATTCTCAGAACCGGAATCTGCTGCACAATGTGCTTCTGGACGAGGCGAGAAAGGAGTTTAGGC 

GTCAAAAACTGTCGGAGTTAGTTCACTAA 

(SEQ ID NO: 1313) 

Start ATG: 1 

MSQLRIAVRPNKRHLKNVDGPEGRLLKLRKTVTALVKHERIELFYNRADEARGYAELLISNAIRHGDRHQATMELADYWLLEKQLVHKLFKVLVP 

RYETYNVSYTRMYKAPREYPGIYYRRSVLELRGNPYPSLAADHSQNRKLLHNVLLDEARKEFRRQKLSELVH* 

(SEQ ID NO: 1314) 



Celera Sequence No. : 142000013384809 

GC A CAT AC A C AC AAGT GCACGC ACT GC AGT C GAT TTTGTTTTTTT T GC AT AAAAC T T T AT T T CAT GTATGCCACTTGTACATTGTTTGT C ACC AA 
TTTGCGTATTTTGTACATCAAATTTTATTTTACTTTTCGTTGCCTTCCTTATTATTTTCCTAATCTTCACATGCACTTTTTTAGCGACGCCAACA 
CACACATTTGCACACGCACACACATGGCGAGGATGAGAGAGGAGAAAGAGAGCAAAACACCGTTTTCGTTGCTTCCTTTTCGCTTATTCCTCTTT 
AGCCGCTTCTTTTCGATTCTGGCTCTTTTTGGCACTTTTAGCCCCAAGTTGCTAGCAATTCACTTAGCGTACGAGCACTAACAATTAAAACGAAT 
AAGACTCGAAAAATACGATGGAAACGCGAACATATAAACAAAAATCTACATTGGTTTTATTGCAGCGCTACGCAGGTGTGTTTTTACGCGCTCAT 
CGCGCAATTAAATTAGAACGCTGCAGATTTACTTAGTTCACTCCAGTCTGCCCAATGCGCGCCGCACACCTCGGAGCCGCAAATTATAAACACGA 
CTGTCTTCGATTTAACGGGCTGGCCTATCGGTCCTATCGATGACTCGATAGTGCGAGCTGGAGTGTGACCATTTCTTGGTAAAAGCAAAATCGTG 
AAG AGT AAGT GT GC GAT ACT AT C G AAC T GT C AT AT ACTC AAC C AAAT AACAT C T G AAAT C T GT T T C T C AC T AAAACC GAAAT T TC C AT C AG GGT T 
AGGAAAATATAGTTTACGCACATCAAGTTGCATAGGTCAATCCTACGTAAAAAAGGCTCGATATAGGTAAGGTGGACCTCAGCCTGAACAGGGCC 
TAATGCAAATACATTCGATAAATAGATGTTATCGATAACCATTTGGTATATACCAGTAAATGCTTTGTTTTGGTTTTCATTCAGAAAATTGACAT 
ACATTTCTTAGTCTGCCTAAAGTTCCTTGGATTGAAAAGAGCAGTGCAGAATGTCGCACCGCCACCAGGACATTCCCATCCGCCCACGTTACGGT 
TATGTGGCCGAGGAGAGTGGCCCATCGGAAGGGGCAGCCAGGAGTTCTGGATCCGGAGCCACGCCCGCCTCCTCCTACACGGAAATTCCGTTCCT 
GGCTCAGTATCCGGGAGCGGTGCCAGAGCACGTGCTCCAGGATCTGACGCCCACAGCCACTGGCCATGCTGCGATTCCCGGAAGCACAAAGACAA 
GACCCAACGGCGAGAAATACCTTAACCTGGATTCGGGCGAGGATCCCGACGAGGAAGACGATCCCTTGGAGGAGGAGGACAACAGCAACTCGAAT 
AGCAACTCTAAGGGCAGTTCCGGCGACATCGAAAGGCACAGACTGAAGGCAGAGAGCAGTGGGTTCTAGGTTATCTACCCATAGTTCCTGACTAA 
AGCATCTTCCTTCTTTCAGCTGTGCCCTACGAGCTGGACGGCGAAACCAGTGACTCCGACGGGATGCGCCACTTTGTGGCCCACGATCTAGAGGC 
CAAGTTGCGCGAGCGCGTGGCGGCTTCGGCCTACTCCTCGGAGCACACCACGCCCCTGAGCTCCATGGGTCCCATTGGAGCCACTTCCGGCAACG 
GTCTGCTCACGCGGCGGTTCCTGCAGTCGCGCAACATTCCCGAAGTGGACGGCAGCGTCTTGAGCGACATCGAACTGGAGGCTCAGTACTTGGCC 
AGCTCGGTGGACAATCTGATGGAAAACCTGGGCAACCTGCTGCACTCAATCTCCTCCATAACGGCCGACAACGTGGAGGTCCATCGCAACGCGGT 
CAACAAGCTGACGGACACTTTGGACGCGAATATCAAGTGCCAGTACCAATTGCTGGCCAAGGCGGAGGAAATCACCAAGTCGATGAAACCCACAG 
AGCAGCTGGGTCAGCGCATGTAAGTGGGTCAATTCAAGACAGATCCCTTTAGTGTAATGGACTTTCTCTTCCGCAGAAGGCAGATCAAGCGGCTG 
GTGGACATGTTGGACAGCACCATGTAAAATAGGGCTACGCTATCCTGTGATCAACCAATCAAAGAGATCTTGTCTTTCTAGTCATTCCGTAAATA 
CATATAATCAGTGTTATAAAATCCATATGGAAAGATACCTTTCGCTTGTTTAAATCGTACTTATTCTAGCAGTTCCTCCAAAAAATCCACCTCCT 
TGGCTCACAGCTTCTCCACCTTGGCCTTGATGATCCTTAGGGGATCCACGGGGCGATCGTTCTTGTCGGTTTCCACCATGCCGATCCGCTTGACC 
ACCTCCATGCCCGTGTAGACCCTGCCGAAGATCGTGTGCTTGCCATCCAGCCACTGCGTGGGCGCCAGCGTGATGAAGAACTGGGACCCGTTGGT 
GTCCGGTCCAGAGTTGGCCATCGAGAGGATGCCCGCGCCCGTGTGCCTCAAGTCGCCGTGCAGCTCGTCCGCGAACTCCGATCCGTAGATGGAGG 
CGCCTCCGCGGCCAGTGCCCGTGGGATCGCCGCCCTGGATCATGAAGTCCCTGATGATCCGGTGGAACACCACGTTGTTGTAGTAGCCCCTTCTC 
GAGAGTTCGGCAAAGTTGCGGCAGGTGTTGGGCGCGTGTTTCCAGTACAGCTCCACGGTGATTTCTCCCATGCTGCGCGGGGCACAAATGCCACA 
AATAATGTGATAAACAAATGCTCAGTCTTCAGTTTCGGCGATTATCGAGCATTCAAGCTCCGGACTCACCTCGTTTCCAGGGTCACGAAGTGCGG 
CTGCCACGCCTTGTCGGGAATGCCACCAGCATTGTTGGGATCGCTTAGGGACAGCATAACTATATTTGTTGTTAAATTAGCGAGACTTCAGTAAA 
ATTGTTAAAACCGGCAGAGTCAGAGTGACCAGCTTAAGCGTAGTCACTTAGGCCGCCTGGCAACCCTGAACTGCTGCGGTATACTTCTGGTATAT 
ATATAGTATAGTATATAGACGCACCTTGGTATTTTTTAGGGTTTTTGTGAGCACACGCAAGGGTATTTTTCAATTAGTTCCATTTCAAAATTCGA 
CGAAAAAATCGTGGAAAATATTTAGCGGACGGAATTTTTCGTCGGGCTCGCGAGAAAA 
(SEQ ID NO: 1315) 

Exon: 1001.. 1388 
Exon: 1445.. 1919 
Exon: 1977.. 2098 
Start ATG: 1001 

Transcript No. : CT33418 

ATGTCGCACCGCCACCAGGACATTCCCATCCGCCCACGTTACGGTTATGTGGCCGAGGAGAGTGGCCCATCGGAAGGGGCAGCCAGGAGTTCTGG 

ATCCGGAGCCACGCCCGCCTCCTCCTACACGGAAATTCCGTTCCTGGCTCAGTATCCGGGAGCGGTGCCAGAGCACGTGCTCCAGGATCTGACGC 

CCACAGCCACTGGCCATGCTGCGATTCCCGGAAGCACAAAGACAAGACCCAACGGCGAGAAATACCTTAACCTGGATTCGGGCGAGGATCCCGAC 

GAGGAAGACGATCCCTTGGAGGAGGAGGACAACAGCAACTCGAATAGCAACTCTAAGGGCAGTTCCGGCGACATCGAAAGGCACAGACTGAAGGC 

AGAGAGCACTGTGCCCTACGAGCTGGACGGCGAAACCAGTGACTCCGACGGGATGCGCCACTTTGTGGCCCACGATCTAGAGGCCAAGTTGCGCG 

AGCGCGTGGCGGCTTCGGCCTACTCCTCGGAGCACACCACGCCCCTGAGCTCCATGGGTCCCATTGGAGCCACTTCCGGCAACGGTCTGCTCACG 

CGGCGGTTCCTGCAGTCGCGCAACATTCCCGAAGTGGACGGCAGCGTCTTGAGCGACATCGAACTGGAGGCTCAGTACTTGGCCAGCTCGGTGGA 

CAATCTGATGGAAAACCTGGGCAACCTGCTGCACTCAATCTCCTCCATAACGGCCGACAACGTGGAGGTCCATCGCAACGCGGTCAACAAGCTGA 

CGGACACTTTGGACGCGAATATCAAGTGCCAGTACCAATTGCTGGCCAAGGCGGAGGAAATCACCAAGTCGATGAAACCCACAGAGCAGCTGGGT 

CAGCGCATAAGGCAGATCAAGCGGCTGGTGGACATGTTGGACAGCACCATGTAAAATAGGGCTACGCTATCCTGTGATCAACCAATCAAAGAGAT 

CTTGTCTTTCTAGTCATTCCGTAAATACATATAAT 

(SEQ ID NO: 1316) 
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Start ATG: 1 



MSHRHQDIPIRPRYGYVAEESGPSEGAARSSGSGATPASSYTEIPFLAQYPGAVPEHVLQDLTPTATGHAAIPGSTKTRPNGEKYLNLDSGEDPD 

EEDDPLEEEDNSNSNSNSKGSSGDIERHRLKAESTVPYELDGETSDSDGMRHFVAHDLEAKLRERVAASAYSSEHTTPLSSMGPIGATSGNGLLT 

RRFLQSRMIPEVDGSVLSDIELEAQYLASSVDNLMENLGNLLHSISSITADNVEVHRKAVNKLTDTLDAKIKCQYQLLAKAEEITKSMKPTEQLG 

QRIRQIKRLVDMLDSTM* 

(SEQ ID NO: 1317} 



Celera Sequence No. : 142000013384809 

TATGGCGAAACTTTGCCGCAACCGGAACGGATGGCGATGTAGCAATGGCGATGTGGCGTATCCAACGCTCCATAGACCCGGTTGTGTAATCGTAG 
CGCCCTCAATCGACACTGCGACAAAAGTTTAATACACTTTCTACATATTTCCGATATGTCTTGACTTAGCAACTTCATAACTTATAATGTAGCTA 
ACAGGTATATATATTTATCTTAAATACTATATAACAAAACCAACAAGCGATTACGTTTCTTTCAGTGCACTTACATATATGCTCTGCAATCTGTA 
GGTGCGAGTGTGATTGTGAGTGTCCACCTTCTGGCACCGCAAGGTCCCTCACCGGGATGCTGCGCTGCTGGCATCCTCTTCCCCCATATATCCTC 
AGCCCTTCGGACTTCGCCCTCCGCAGCCCACCTCCGGGCTGTACTCCCATCTAGAATTGATGTAGCACCTACGGTGCGCCCTCTCCATCTGCCCA 
CTTCCGGAGCTTCTGTTCCACGCACGTTCAACTTCAACTTCGTTGCAGGTCATTCGGACTCGTTCAGTCTCATCCCACATCCCACCATCGCGTAA 
AGTTTTAGAGGACGAACGTAAAGTTTTAGCGCTGATTGTTAGTCTCGCTGGGGCGAGTGGAAAGTTGCCTTATAGCTTGGAAACCTGGTCAAAGA 
CCCAGATAGCCGTGCCATGAGAACAGATATATGGGTGCATCACCTTGACAATGTCAATCTTTGTTACTATTCAAGTGAACGGTGGGTAAACGTAT 
TCGTTTTGAAACATGTTATrCAAGAACAATTACATAATATTTATATTCATTTTATATGCGTGTTTGAAGCTTGTATTTGATTTACAAATAGTTAG 
TTTACGCTTGCTTGTTTGTTTTAAATTAAATACACATTTTCAGTGAAATAACGCAGTTTTCATAAAGCCTATCTTATACCCTGCAGAGGGTACTC 
AAAGATCTAAACATACTTATAACCTAAGGCACATGTTTTTGGTTTGGTACTTATTCTAAGGATTTGTCCATTTCCTCACTCTTTTTCGCAGCGGA 
CTCAAAGATCTTCTCCAGCTGCGTGAGCAGGCCAATGGCTCGATAGTTCTCCTCCATTAGGACGAATTCCTCCAGGGGCTTCAGTAGACGCAGTG 
CGTCCTTGTAGCTTTCTACGCAGCGCGGCAGATGTGCCTCATCGCTGGTCTCTGAATAATGACCGGGACTATTGGCTGGAAGTAGTTCCGTATCC 
TCAGGCAATGACTCATTGAGCGATTGAATCTTCTGAATCTTCAAGCCACATGGTATTGGGCCGCCGTTCTCGTCTGATTTTCGTTTAGTAGCCCT 
TTCTGGCAAATCCTCATTATCATTATGCTCATCGTCATTATCCTCATCATCCTTTATTTCTATGAGCTCAGGTGGCACTTCAATGGCCACGCAAT 
CATCGTCTTCTTCGTGCTCAAATTCTTCGGGTTTTGGTTCCACCTGTGGGCATTGGTCCACCTCTGGCTTGTCGCTCTGGCTCAGTTCTTCCTGC 
GACTTCTTGTGGTCCAGCGTGATCTTATGTTCATTCCGGCCGCAATAACTAACTATATCTCTTAGGTCCAGGGAAAGCGGCAAGCGGCGGTTCGT 
CACCTGAACTCCCACCGTGAAATTAAAGTTATAGCGACTCTTAAAGTGCTTCAACCACACTTTGTCGGGAAAGAAGGAGTCGATTTTGAGGATGT 
CCCTTAAACTAATGGCCACAAAGCGCAAATTGTCTAGGTCAAGACGACCCTGGCGAACCAACTGTGATCGCTGGATGTACTCATAAACGCATTTC 
TCCAGGAACGCTAACTTTCGCTTTCTACGCAGGTGCACTTCATCCGAGGTGGCTTGATTTTGGCTAGAGGTGGTGACCAATGTCTTTGTACGATA 
AAGAGGTCGATCACTGCTAGTGAGCGCAGGTTCCGGCGATCTAGGCGCTAAGCTACAAGACATCGAGGTGTGTCGTCCACAG7ACGACATAATGT 
CCCTCAAGTCCATAGATCGTGGTGGTCGGCGGACTATCGTGATCTGACGATTCGACAGTGTTATATTATAGGCGGCCTTGAAGTGATTGATCCAC 
GGTTTGTTCGGCACAAAACCATCGAGTTGCATGAGATCCCGGAACTCGATCGCACGGTTGCGGATCATCTCCTCGTTCACATCGCTGTGGAGGTA 
AAACTGGGCCCTCTGGATGTACTCGTACAGGCAGTGGCCCAGAAAGTTCAGCTTGCGCCGGCGCAGCTCGGCATCCTTATTGGCCGGCTGGCTGA 
GCCGATTGGTGGCCTTGTGGAACTCCAGGATGGCTTCCCGATTGGAGACTATGCTCTTGATTTGCTCCCAGCTGCAATTAAAGTTCCTCGCAATC 
TTACTGTACATGGGCACAATGTCGTACTGTCGGATGGCAGCCACACGCTCCTCCAAAGTCAGCACGTTCCGTATGCGATGCGATCCATTCGAGTT 
TTTGTGGGTGTCATCATTCTGAAATTGAATCATGGTTTATTTCTTTATAAAATCCGATTTCTAGGCTTAGTTACATGTATGTTATTGATACAGAA 
CTCAAATTATCTGTATGCGTGTATATCATGAATGCCTTATACATATTTATGTATGTATATTTACAAAACATGCCCTCTTCTTATTCGCAAATGGG 
TCTCGCATATTTTAGCTCAGAGATAACAAATTTCAAAAGATTTTACTAAAACGAAGCATTGCACTCTCACTTACTCACTACGAAAGTATACTTAC 
GTCAGAGTACATTTCCACTCCCGCTCTCACACTTTCAGGTTCTCCACTCTCTCTTAGCGTGCTCGCACTCCCAGTCTGGCCCCCCCTCCTCACGT 
GACCAATATAATTCTGTTCCTTTAACAATGAAATTAAAACGAAATAGTTATTATACAGAGAAGAGCTAGAAATCTTGTTTACATACCTCAGTCGG 
AAAGCTTAGAGCATAAAGTACACTTTGTTGAAAATATACAGCAGTTTTTTAGGCACATTAAACAGAAAATATGTGCGGTCGCACGTGTTTAAAGT 
TGCCAGAGCGAGTGCAACTCAATGACTGAGCATGCGGTCCGAACATTGCAGAGATCTGGACTCTCCTCCGTTTGGCGGAGCGGCAAAAACCGTAT 
CTTTGCGAGAGACATTCCGCTCTTTCTCTCGCGATCTAAACCGACTTCCCACGTTCCGATTCCGATCGTGGTGGTATTCGTTCGCGGTATGGGAT 
TTATTTTCGGATCGAGAGTAAATATTCTCTCGCTAAAGCAGCTATAACTGGTTTAACATGCCGAAAAACACAAATCGCATTACCACCTAATCTGA 
GCTGAAACTGGTTTGATATGTACACATACATATATATAAAGCTATTTTTGCTCAAATTATATCTAGTGCTTAATTTAACGTTTACCACATTTAAG 
TGGGACCCTGTTGGTTATTCCACTCTATAAACTCTTTTCGCATATTACAATTATTTACACATTTTGGGTCTTGACTCTCAAAGGCAGGACACACG 
AACATTTTAATTGTAGATTACATATAAAAACTAAACTTTATTTTTGTGGCGTCAATAAGATCATAGATCGAAATCCTCGTTCTGAGGCGGATTTA 
AAACTGATTTTTCAACTTCAGTAAGTAGTCCAATGGCCCGATAGTTATCATTTAACATGGCGTACTCCTGTAAGACCTTCACGTAGGCCAAGGCC 
ATATTTTTACAGTCGATGGATTTCAGTGAATTTTGTAAGGAAGCGACTACTTTGGGGTCCAC 
(SEQ ID NO: 1318) 

Exon: 2767. .2756 
Exon: 2488.. 1001 

Start ATG: 27 67 (Reverse strand: CAT) 
Transcript No. : CT33430 

ATGTACTCTGACAATGATGACACCCACAAAAACTCGAATGGATCGCATCGCATACGGAACGTGCTGACTTTGGAGGAGCGTGTGGCTGCCATCCG 
ACAGTACGACATTGTGCCCATGTACAGTAAGATTGCGAGGAACTTTAATTGCAGCTGGGAGCAAATCAAGAGCATAGTCTCCAATCGGGAAGCCA 
TCCTGGAGTTCCACAAGGCCACCAATCGGCTCAGCCAGCCGGCCAATAAGGATGCCGAGCTGCGCCGGCGCAAGCTGAACTTTCTGGGCCACTGC 
CTGTACGAGTACATCCAGAGGGCCCAGTTTTACCTCCACAGCGATGTGAACGAGGAGATGATCCGCAACCGTGCGATCGAGTTCCGGGATCTCAT 
GCAACTCGATGGTTTTGTGCCGAACAAACCGTGGATCAATCACTTCAAGGCCGCCTATAATATAACACTGTCGAATCGTCAGATCACGATAGTCC 
GCCGACCACCACGATCTATGGACTTGAGGGACATTATGTCGTACTGTGGACGACACACCTCGATGTCTTGTAGCTTAGCGCCTAGATCGCCGGAA 
CCTGCGCTCACTAGCAGTGATCGACCTCTTTATCGTACAAAGACATTGGTCACCACCTCTAGCCAAAATCAAGCCACCTCGGATGAAGTGCACCT 
GCGTAGAAAGCGAAAGTTAGCGTTCCTGGAGAAATGCGTTTATGAGTACATCCAGCGATCACAGTTGGTTCGCCAGGGTCGTCTTGACCTAGACA 
ATTTGCGCTTTGTGGCCATTAGTTTAAGGGACATCCTCAAAATCGACTCCTTCTTTCCCGACAAAGTGTGGTTGAAGCACTTTAAGAGTCGCTAT 
AACTTTAATTTCACGGTGGGAGTTCAGGTGACGAACCGCCGCTTGCCGCTTTCCCTGGACCTAAGAGATATAGTTAGTTATTGCGGCCGGAATGA 
ACATAAGATCACGCTGGACCACAAGAAGTCGCAGGAAGAACTGAGCCAGAGCGACAAGCCAGAGGTGGACCAATGCCCACAGGTGGAACCAAAAC 
CCGAAGAATTTGAGCACGAAGAAGACGATGATTGCGTGGCCATTGAAGTGCCACCTGAGCTCATAGAAATAAAGGATGATGAGGATAATGACGAT 
GAGCATAATGATAATGAGGATTTGCCAGAAAGGGCTACTAAACGAAAATCAGACGAGAACGGCGGCCCAATACCATGTGGCTTGAAGATTCAGAA 
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GATTCAATCGCTCAATGAGTCATTGCCTGAGGRTACGGAACTACTTCCAGCCAATAGTCCCGGTCATTRTTCAGAGACCAGCGATGAGGCACATC 
TGCCGCGCTGCGTAGAAAGCTACAAGGACGCAC^ 

CTGCTCACGCAGCTGGAGAAGATCTTTGAGTCCGCTGCGAAAAAGAGTGAGGAAATGGACAAATCCTTAGAATAA 
(SEQ ID NO: 1319) 

Start ATG: 1 (Reverse strand: CAT) 

MYSDNDDTHKNSNGSHRIRNVLTLEERVAAIRQYDIVPMYSKIARNFNCSWEQIKSIVSNREAILEFHKATNRLSQPANKDAELRRRKLNFLGHC 
LYEYIO^QTCLHSDVNEEMIRNRA^ 

paltssd^pSrtkt^ttssqnqatsdevhl^ slrdilkidsffpdkvwlkhfksry 

NFNFTVGVQVTNRRLPLSLDLRDIVSYCGRNEHKITLDHKKSQEELSQSDKPEVDQCPQVEPKPEEFEHEEDDDCVAIEVPPELIEIKDDEDNDD 
EHNDNEDLPERATKRKSDENGGPIPCGLKIQKIQSLNESLPEDTELLPANSPGHYSETSDEAHLPRCVESYKDALRLLKPLEEFVLMEENYRAIG 

LLTQLEKIFESAAKKSEEMDKSLE* 
(SEQ ID NO: 1320) 



r<=lera Seauence Ho. : 142000013384692 

CACTTAGACACTTCGCCCATCGTTCTGGCCTGTCAGCAGAATCGCCAGCCAAGACGACACTCGTGTATCGAGTGCAATAATCATCACTGCTCCCC 

CACGAGTGCCGAATTTCGAATGCCGCCTTTTCTGTGGACCTCAGCACTTTGACATTGCCCGTTTCCCTTCCAGGGCGTTGCTCGCCAAACGTGGC 

TAATCTGCGCCCTTCGATCTCCAAGGCATTTGCTGAAAATTTATTAAGCTCTGCCAAAATGCATGGGCAGGGTGCTCAAAGTAGTTGAACCAGTT 

GAAATGGCTGAAATGCTACTCGAAATAGAGGATAATAGAATAAATCTTTAAGTTCATAAGCTTTGATTAGGCATACTTATATAAAGTGCTAAGTA 

TTAATCACATTCCAATCATAAATCATTTAAATAACTCGAATTAGGCTTTTACTTAGCMTTGTATATATCGMTTTCCCAAGTTGCATCTTTCAA 

TTGGTTCAAATTGGCATCGTTGAAAGTCTTGAAAACTTTACGCCCAAAAGAATTGATTTGTTTTCATTTCATTTTTGCTGTGTTCCTCAAGAAAT 

CATTGAGCCGGGCTCCACTTGCCGAAACTTTCTGATAACTCGTAACTTGTTACTCAGCGCACACAGCACTAATGAAGACGGCACACACACAACAA 

ACACAATAAGGGGTGTCTTGTTGGCTGTTTGTTTTCCACTGGCCAAGAGCAGAGCACCGGTCACCAGGGAGCAGCAGGCTGATTCAAGTGTTAAA 

ATCAATGGGATTCACAGCTGAGTGCAGCTGAAGCCCCTAAGGGGGCGGTCAAGACGTTCCTCGATTTAATTGAATAATTGTGAGGCAGGCCACGT 

TCGCCGAATTCCTGGTAAATCAGCGAAAATTCCGTAACTAAACAGAAATAGAAGCCAGTCGTCCAGAGAGAGAGAGAGAGCGCCCCAAAATAGGC 

AACAAAGATGGGGCACAAAAGCCCGGGAAATCTTCAGCCAGGAGACTGCGTCAGGCTTTGTTTTTCGGTCCCGATGCAAATGCAGCTGCAGCTGC 

AACACCTGCCGAAAACCCCGCCAGCAATTTATGCGCCGCATTTGAAATTCGGTGATACCAAATCCTCGAGACTTACCGTTCCCAAGTGGCACATT 

CGTTTGCGTTGTTGTCGATTGTTGCTGTTGCTGCTGCTGCTGCTCGCTTGTCACCGCGTTGGACAATTTATTTGATGTTGTCGCTGTGGCCGGAA 

CTGGCACCGCTGTGGTTGTGCTCGTCGTTGCCTCCAGCTTGCTTGCTCCATTAATGGCCACCGTGGTGGCGCTTTTTCCCAGCCTTAGTATGGTG 

GTGGCCGTAGATGATGTTGCTCCGGCTGATGAGGCTCCCCCGCTAGTGGCTGTTGTGGCTGCTGCCGCCTTGAGCTTGTGCAGCGCTCCTGCCAC 

GCCTTTGGCCGCCTTCGTGGTGCGCGTCACGCTTAGAATACTTGAGGTGGCAAGACTGTTGCCATTTGGTGGCGAAGTGGCCATGGAAGTGTTGG 

TTGCAGCGGTCGACGCCGTGGAGTTGCGTATGACCAGTATGGATTGAGACTTCTCCAGACGCTTCAGCGTTTGCAGCACAAAGGGTGACTTGCTG 

TCGCAGTTGCTCTCGGAACTCAAGCTGCGTGTGGCCTGTAAAAGTGATGGAACAAGTTGGTAAAATACAACACAAATGTCACTGCACATCTCTTA 

CATGTTTTAGCATAAAGGGTCTCTCCACCATCTTGGTTTGCGTGGCGTTGGCCTGCTTGGAACGCTCTTGGGCCTTTAGAACGCTCTCCAACTGA 

CTGGCATTGATGGGGGTTTGCAGAAGGATCGTGTTGCTACTGGCCTCCTTGTTGTTGCTGTTGGCCACAATGGAAATGGTGGAGCTGCTGCTGCT 

GCTGCTGCTGCCGGCTCCACTCGAGCCCTGCACAATGGTCTTTCTAGCCACCGTTGTGGCCGTGGGCAGCTGGTTGAGCGTAATGGCCGTGGAGG 

AGTGACTTTTCTCCTGCTGCTGCTGCTGTTGTTGTTCGCTGAGCAGGCTAAGAATGTCCTGCTTGTTCTGGATGATGATCTGGCCATTTTCGTTG 

CGTGTGCCGCGCAGCAGGATGATGTTGCCGCCACTCGTGGCCTGATGCTGCTGCTGGTGTTGCTGCTTCTGCTGTTGCATGGCCCCGATCTTGAG 

CTTGGCCGCTCGCAGAGTCGTCGTTGTACTGGTTGTTGCTGCTGGCTTTGCTGATTTCGCTTCATTGCTGCTTAGCTTGCCACTGCTCTGCGATC 

GGATCAAGACATTTTTGGAGGTGATTTGGCCTAGAAAGGAATAACAAAAGTAAATCAGTCTTAAATAATCAATTCAATGTACATACAATATATAA 

AAGGTATTTATCCAGGTAAAACAACAAAATTAAAAATGTACATCCTAGAACGTCTGGATGTATAAAAAGAGTAGATCTTATTTATTTATTTGTAA 

GTTATCTAAACAGCGTGTTGGTTTCATTTTTGGTGTAAGTGCTCCTGTAACATTTTGTTCTCGATTTCTCGAATTGAGCAATTTCGAGTGGCCCC 

AAAAAACGGACAGCCCCATTGTATGCAAATTCAATGGAAACTTCATAAGCGGCGCAAGATTTATGACTCCATCTTGATCGAATTTGGAAAGCCTT 

GCATTAGAAGCGCATGTGACAGGGGAGGCAGGAGGCGTGGCTGCCATTGTTGCACTTTCCCAGGGTCCGTAAAAATGAACTGAAAGGGACGTCGA 

GCATTACGTATACGCGACGTATGCCTAGAGCGGTCAGTGGCTCGGACACTCTATTTTGTTGGCCAACAAATGGGGAGGATGGGCGGTAGCCAAAC 

GCTAAATCAAGCGAAAGACGACATGTTTATGGCAGGGACGAAGGACTACTTAGCCGACGAGCCCCAGCACAAAACGCAATAAAAGTGCAACAAAC 

AAACAGCGAGTTTTCAGCCAAGAAAGGAGGAGAAAGTAATGGGGTAGGTTCACGGGTGTAGGCAAATGGGGGTCTCTGCCCTGGCATGCGTTGCC 

AAATATAGACTCTCAAGGCTGAAGGCGACACGGAACGTAAAGTGCGCGCCGGAAACATTTCGAAATGTAGCCTAAACACTTGTCCGGCTCCCCTG 

CTTTTCCCGCCTCCCCCCGCCGCCGGCATCCAGCACCCTTTCCTGGGTATTTCCACACCCCCAAAAACCATGTGTGTTTGTTTTCGCAAAAACGT 

GCGGGTCATAATTAGCCGTTGACGCCGCCTTCAGTCGGCTGTGCTTTGGGGAAAAGCGGAAAAGCAGAAAAGCGGGCAGGGGAAAGCGCCACATG 

CGACCATCTGACCCCAAGTGGACAAGTGTCTGCGCCCCCTTCGCCTTCCTTGGAGTGTCTGTGCAAAAAGGTCGCCTGCCGCCAGGCAAAATAAA 

CCGCCTGAATTGGACCAGGATCGGAATAGAATAGGTGGGGCCACTGGCTGTTGGCTGGCCATCCATTCGAGCGCACA 

(SEQ ID NO: 1321) 

Exon: 2497.. 2482 
Exon: 2310.. 1712 
Exon: 1650.. 1122 
Exon: 1050.. 1001 

Start ATG: 2497 (Reverse strand: CAT) 
atgaaaccaacacgctgS 

GCCAGCAGCAACAACCAGTACAACGACGACTCTGCGAGCGGCCAAGCTCAAGATCGGGGCCATGCAACAGCAGAAGCAGCAACACCAGCAGCAGC 
ATCAGGCCACGAGTGGCGGCAACATCATCCTGCTGCGCGGCACACGCAACGAAAATGGCCAGATCATCATCCAGAACAAGCAGGACATTCTTAGC 
CTGCTCAGCGAACAACAACAGCAGCAGCAGCAGGAGAAAAGTCACTCCTCCACGGCCATTACGCTCAACCAGCTGCCCACGGCCACAACGGTGGC 
TAGAAAGACCATTGTGCAGGGCTCGAGTGGAGCCGGCAGCAGCAGCAGCAGCAGCAGCTCCACCATTTCCATTGTGGCCAACAGCAACAACAAGG 
AGGCCAGTAGCAACACGATCCTTCTGCAAACCCCCATCAATGCCAGTCAGTTGGAGAGCGTTCTAAAGGCCCAAGAGCGTTCCAAGCAGGCCAAC 
GCCACGCAAACCAAGATGGTGGAGAGACCCTTTATGCTAAAACATGCCACACGCAGCTTGAGTTCCGAGAGCAACTGCGACAGCAAGTCACCCTT 
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TGTGCTGCAAACGCTGAAGCGTCTGGAGAAGTCTCAATCCATACTGGTCATACGCAACTCCACGGCGTCGACCGCTGCAACCAACACTTCCATGG 
CCACTTCGCCACCAAATGGCAACAGTCTTGCCACCTCAAGTATTCTAAGCGTGACGCGCACCACGAAGGCGGCCAAAGGCGTGGCAGGAGCGCTG 
CACAAGCTCAAGGCGGCAGCAGCCACAACAGCCACTAGCGGGGGAGCCTCATCAGCCGGAGCAACATCATCTACGGCCACCACCATACTAAGGCT 
GGGAAAAAGCGCCACCACGGTGGCCATTAATGGAGCAAGCAAGCTGGAGGCAACGACGAGCACAACCACAGCGGTGCCAGTTCCGGCCACAGCGA 
CAACATCAAATAAATTGTCCAACGCGGTGACAAGCGAGCAGCAGCAGCAGCAACAGCAACAATCGACAACZVACGCAAACGAATGTGCCACTTGGG 

AACGGTGTTGCAGCTGCAGCTGCATTTGCATCGGGACCGAAAAACAAAGCCTGA 

(SEQ ID NO: 1322) 

Start ATG: 1 (Reverse strand: CAT) 
MKPTRCOITSKNVLIRSQSSGKLSSNEAKSAK^ 

LLSEQQQQQQQEKSHSSTAITLNQLPTATTVARKTIVQGSSGAGSSSSSSSSTISIVANSNNKEASSNTILLQTPINASQLESVLKAQERSKQAN 
ATQTKMVERPFMLKHATRSLSSESKCDSKSPFVLQTLKRLEKSQSILVIRNSTASTAATNT 

HKLKAAAATTATSGGASSAGATSSTATTILRLGKSATTVAINGASKLEATTSTTTAVPVPATATTSNKLSNAVTSEQQQQQQQQSTTTQTNVPLG 

NGVAAAAAFASGPKNKA* 
(SEQ ID NO: 1323) 



Celera Sequence No. : 142000013384576 

CGATGGACGAGAACGTGCTGTTGCTTTACTTCGGCCAACTAAAACTGCTGGCAGGTGAGTAAAGTTGTTTTATACATACTTCAAGTAGATATGCT 
AAGATATATATACTTCTTGTAGAAAAGACCAGAGACCAAGCATTCATCAACAATCCCGACGCCCTGATTGTGGTTCAGGATATGCTCTTCATCAT 
GCGCCGCATTTGCTATTGCACCAAGGGAATGGGTAAAGAGAAGAACTCTCGTGGCGAGGGCGACAATATCGATGAAGGTGAAGATGCGTCATCGG 

AGC C AG C AAC T CC AC C AACT AG AC C AG AAGC AGG C G AGGC AAC AAGC AG AGG AAG AGGGC G AAAGG CGG AC AT GT C CGAGGAGC C AGT AT GT AAA 
TCTAACTGATTAACTTTTCCCCCTTGAGCTAAATGTTCAAATTCCTTTGCAGCTCAAGCAACTGGAACGCTGTCTTCGCTTTGTGGAAGAAACCC 
ATCGCAACATAGT GCCCGTCATGAGT CCCGAGCTTAGTCACAACTT TGAATCCTTTTGCCGAGCGATGGCCATGAG ATTCCCC AACT ACATT GAG 
TTCGCCCAGCCCGCGCAATTTTGGCGGCAGTTTCGATCATCGGCGAAAGCTCCCAGAGCGAGCAGAAGCAAGCGACGACGTATGGAGGAGGGCGA 
AGTCGAGTCCCATGAGGAGCACGAGTCGGAGAGCGACAGTGACAGTGAGCTGCCGCTGGACAGGCACAAGACCACGCCGACTGCCTCCCGTTTGT 

C C C AAAGGAAAAC C AGC T GC G AT AT GC TT GT C AC CG AGC T GAT T T T CC AGC C AC C C GAT T GGT AGT T T C AAT T T TT GCGACC T T GC AAACT G AAT 
TTCTCAATTATTTAGTTTTACACAAATCGAATATTGTATTATTTTGCATTATAAAATGTACATATGTATGTATCTATAAGCCAAAAATGGAATCA 
AGTTATTCTTGACCTTGAAACTATTTAATAAGCCATTTTGACGTTTTGAGTTAATAATAATCCTTGCTGTACTCGCATATTTTCTCCAAAATGTT 
CTTGTAGCCGCGCATAAACTCTTCGAGCACCAGCTGGCACACTTGGAGCTCGTTCAGGCCCAACTGTTCGGCTTCCGTGGCCTGGATGAGCATGC 

AC T C C AGC AG AGC CGCGTTCACCGGCT GC AC G AAG AGAT T C C GAT C CC T GC G AT CGT C GT G C C GC AC C GG AAT G AT C AG AT T G AC G GGC AT CGG A 
GGTTCCGCCGGTTGCTTCACTGCCCTGGCTGGAGTGGCGGCAGTCGCTATGGGTCGTGAGATGGGTATGGGGTTAATGTGACTACTAGCCATCGG 
TGGTTTGCCAGCCACTAGGGTATACTTTGGCGGCGTCATTAGCGCCTTGCCGGGCAACGCCAAAGTGGATGCCGCCACCTGGAGCGCCTTGCCCT 
GGACATCGATGCCCGCCTTGGCAGCTGCGGCTGCTGCCTTGGCGCTCATCGCAGCCGCCTTCTGCCTTTTCACAGCGCTGGAATTCTTGGCCACG 
GAGTTGGGGTCCACCAACTCGCGCACCTCTGTGTTCCTGCAGCCTGCATTACAGGGTATATGAGTGCCGTAGGGTATATTTATGCTTATAGAAAT 
GAAATCATACCAACGCATCGACAGAACTTGGTGCAGATGGCCATGGACTGATAGCAGTCGCAGTAGTTCTTGATGCACTGCGACCGCTTGCAGCA 
GCATCCCTTCACGCCGCCTGCTCCTTGACCCTTGCCCTTCTTGCCATCCGCTTTATCCACACTTCTTTTCTTGGGCGATGGCATGTTGCTGCCTT 
AATCTCTTCCAGAAACGAACTGATGACAGCTCCAACAAAAAAAATCAGTGTTGAAAGGATGGCAACCCTTATACATATAAAAATATATATGTATA 
AATAAAAC AGC AAT ATTGTGGGCACT AAACT AT TAAAGCTT AGAT AAAGTATTTATATGT CGT ACTGGTAATAAAGTTAAAATTTGTTAAGTACC 
AATCGTTTAACAAGTGTCTAGTTTATATCGTTATAACATAGATAATTTCCTTTGGTGTCTACATAATATGGCATGGCATTAAGTTATGATATTGT 
TCTCCTCTACTCGTTTAGTCTACGATAATTGAAAGCTATTTTATATTATAGATGTAATAAAAAGAGTTACATTATTAATTGATATCAATTGTAAT 
TGCTTTCCTTTCATATATATTCTGTGTCGAGTGATTTGGCAGCGAAACCAACTCAAACCTCTTCTCAAAATTCTCTTTTTTTGATTGGCGTTTAG 
TACTTAAATTACTTATGTAATTGGAATGAAGTGCGTTGCATTTCCCGCCAAAAAACTCGACATTTGGGCGCGCGTTTTCAAAAATATCACTATCA 
AAACGATATATTTATGCAGTGTGTAAATGTGTGCAGCAGAAGACTTCTTAACAGTTGGAAAGTAATTGAGTGGCCTGCAATCGTTGCTATTGCTT 
CTAAAGTAACTAGGATTATCTCCGCTGCGCTCCTAACTAGTCCCACTTGGTGCGCTCCTAGCATATGGTTCCATTGAGCAGCTCGTAACAGTTGG 
GCGGCGATGCTGGCGAGGCTGCTGCCGCCGCCTCCTGTCGCGTCTTCAGAAAGTTGCCCACCCAATGGGCCGTGAGATCCAGATATTTCGTTCTC 
GAGCCCGGATTGTAGTCGAGGAAGCCGGCGCGTGGCATCACCTCCTCGATCTGGCTGCCATCCGGCCAGTTGTTGTAGCTGGCTATGTTGATGAA 

GCCCACGTGGTTGAGAATGGCTGTCCGCCAGGCCACGCC 
(SEQ ID NO: 1324) 

Exon: 1794.. 1626 
Exon: 1563.. 1001 

Start ATG: 1794 (Reverse strand: CAT) 
Transcript No * CT33573 

ATGCCATCGCCCAAGAAAAGAAGTGTGGATAAAGCGGATGGCAAGAAGGGCAAGGGTCAAGGAGCAGGCGGCGTGAAGGGATGCTGCTGCAAGCG 
GTCGCAGTGCATCAAGAACTACTGCGACTGCTATCAGTCCATGGCCATCTGCACCAAGTTCTGTCGATGCGTTGGCTGCAGGAACACAGAGGTGC 
GCGAGTTGGTGGACCCCAACTCCGTGGCCAAGAATTCCAGCGCTGTGAAAAGGCAGAAGGCGGCTGCGATGAGCGCCAAGGCAGCAGCCGCAGCT 
GCCAAGGCGGGCATCGATGTCCAGGGCAAGGCGCTCCAGGTGGCGGCATCCACTTTGGCGTTGCCCGGCAAGGCGCTAATGACGCCGCCAAAGTA 
TACCCTAGTGGCTGGCAAACCACCGATGGCTAGTAGTCACATTAACCCCATACCCATCTCACGACCCATAGCGACTGCCGCCACTCCAGCCAGGG 
CAGTGAAGCAACCGGCGGAACCTCCGATGCCCGTCAATCTGATCATTCCGGTGCGGCACGACGATCGCAGGGATCGGAATCTCTTCGTGCAGCCG 
GTGAACGCGGCTCTGCTGGAGTGCATGCTCATCCAGGCCACGGAAGCCGAACAGTTGGGCCTGAACGAGCTCCAAGTGTGCCAGCTGGTGCTCGA 
AGAGTTTATGCGCGGCTACAAGAACATTTTGGAGAAAATATGCGAGTACAGCAAGGATTATTATTAA 
(SEQ ID NO: 1325) 

Start ATG: 1 (Reverse strand: CAT) 
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MPSPKKRSVDKADGKKGKGQGAGGVKGCCCKRSQCIKKYCDCYQSMAICTKFCRCVGCRNTEVRELVDPNSVAKNSSAVKRQKAAAMSAKAAAAA 
AKAGIDVQGKALQVAASTLALPGKALMTPPKYTLVAGKPPMASSHINPIPISRPIATAATPARAVKQPAEPPMPVNLIIPVRHDDRRDRNLFVQP 
VNAALLECMLIQATEAEQLGLHELQVCQLVLEEFMRGYKNILEKICEYSKDYY* 
(SEQ ID NO: 1326) 



Celera Sequence No. : 142000013384574 

CTCTGAGAATGGAGCCAGGGGAGCGGAGAAGCCCCGCTCGGCATTCATAAAGTCGGTGGTCAGTCCACTGATGCCCTGCTTCACCTCCTTGCCAG 
CGTCGCGCAGAAAGTCCTCGAAATCATCAGTCTTCTGTGCAAACTTGGTGTCCAAATCGAGTTTGGCATCCTTAGCGTCGGCGGCCAGGCGTTGG 
ACAAACTCCTGCGCATCGCCAAAGTCCAACTTCTTTTGCTCGTTGGCGAAGTAGTGCTCCGTGTGGGCATTATCGTCCTGGGCACCACCACTCAC 
ATTGCCAGTATCGCCACCAAGTGCAGAGTCTTCGTAGGCACGTTGCGCCTGCTTGAAGATGTCCTCGTTGGTGGCAGCGGCCTCCTTTTTCAGTT 
CGTCGAAGTTCCGATCCATATTCTACAGTGTTTTGATTTCAAAATATAACACTTGTGATCAATAAGAAATTTTCACAGATGTGTTTGAATATCCT 
GGCCTTCTGATTGGAACTTCGCACACACACACACGTACGCACGCACACGACACACCCACACACACACGCACAGGCACCGAGCTACAGCTGATTTT 
GCCGGTTGAAATTAATATGATTTTTTTTAGCCGACTTAAGTGCCTTCTGGTGACTAAAAGGCCGCAATTCGCAGTGTTTTGTGCTCGGGAACGTT 
GAATTTAATTTGAACCACAGCGACTCTTTTACAGGCGCCGTCGATGCAGACGTTTGAAAAGTTTCAATGGACCTGCGCTAGAGGTGGCACAGCGG 
T C GAT AC TAT C G AT GAC GC T C AC AGC AT AAT T GC CT AAAAATGG AC AAAT ACC T T GAAC AGC C AT GT T T C G AAC C AAAGTTT T GATGTTGCCGGC 
ATCTTTAGCCATGACATTAACCCAATGGCTCTGGCAAATTGGCTAAAATTCACGACAGCTAAAATTTGTTGTATGTATTTCCAGAAACTTCGCGC 
AATTCAGTTATGCCTAATGTTTGATTTATTTGTAAACATAACAACCATGCTCAGGCCGTCCACTTGAGGCAGCGCGTGTCCTGGTGCGTTTGCAG 
GCATCTAACGCAACAGTATCGCGTTCCGCAGGCGGTGCACGAGTACAACGAGAAGTTGCCGCAAACGGCGCAGAAATGCCGCAGTGGCTTTTGTG 
GAGCCGGCGCTGCGGCGCTCTCGTAGTTGGGCTGCTTGTCCTTGTCCTCCTCGAGCAGTTGCTGGAAGTTCTTGCGATACTTGACGAGAAAGTAT 
TCGGCGCCCTTGCGCTTGCCTTTCTTCTCCTTGCCGGTCTTCAGGCTGTCCTGGAATTTGGGCAACTTCTTGGACATGACAAGGTCGGCGTGTGG 
GTCATCGTGGTAGTTGTCCTGCTCCAGAGCCTCCAAGGCTTTGCGGGCACGCCGCTGGCGGGCTGTGGAGTCCAGGACACGCTTCTTTTCAGCGT 
CCTTGATGCGGTTGGATTCGCGACCCGTCATGGTTGTTTTTAGGTTTTTACTTTTGGCTATTGGATTTTATGGGTTGTGTAGGTTCTAAGCGTCC 
T T GT C GT GC AGC GG AAAGC T T AT CC G AT C CAT C AAGT C GT C C AC T C GCGC CT GC AAAAC TT CC AC AAT GT C CGT GG AG AT AAAC AAAT GGT T C T C 
GTCCAGATCCTGGATAATGAATTTGCGTCCCAGAGCCAGTTTTTCGTCCAAGTGCAGCAGGAATTGTTTCATAGCAGGATCACTGCGGGGATTGG 
GTTGAATATAGCGAAACCAACGCATACCAAACGTATACCACTTACCACTCCACCAGCACTCCTTTCATAACATTTACCATGTTGCCAGCATTTGT 
TTGCCGAAATTAATAAACAATCTCACCAATTCTCAGAGCTGAATGACATATTCACAGTGAAACGTAGGCTAGAGGTGGCAGTCTTTACTATCGAT 
AT T T T T T T GAC T T T AC T T GAT AATT T AAT C AAT AATT T C AGC AT GGC AAT AT AAT ATT T AC AAT C AAC G AAAT AT AAC AC AT T GCT T T AAAC C AG 
CACCCGATATTAACTATATTATGTTAGTTTCACCGTCGGAAGTTTCGTACTTTAAGAAATATAGCTATTTTGCCAGCCCTATCAGTGCTACTCAC 
TACGAAAAGTAGACAGAAGTAGCGGGGTATTTGCCAGTCACACGGACAAACACCAAGGCAAAAGTGTGAGGAAATATTGAATTCCAAATTAAATT 
CCTAATTAAATCAGACCATAAGCACGCAATGGAAAGGCTGACCGGAAGGAATGTGGCACTACTGGTGCTGTGCCTGTGCGCCGGATACGCCCTGG 
TGTTCGCCGAGGGCGAGAAGGAGATACCGGTTACCAAATTCGGACAGAACATAGCGCCGACGATGACTTTCCTTTACTGGTGCGTTATATGCGGG 
GCAATTAACATAATTGATAACAATGACCTCTGTGTTTCTCACCATTGATTCGCCAAATAGCTACTCCTGCGGCTATCGCAA 
(SEQ ID NO: 1327) 

Exon: 1456.. 1001 

Start ATG: 1456 (Reverse strand: CAT) 
Transcript No. : CT33597 

ATGACGGGTCGCGAATCCAACCGCATCAAGGACGCTGAAAAGAAGCGTGTCCTGGACTCCACAGCCCGCCAGCGGCGTGCCCGCAAAGCCTTGGA 
GGCTCTGGAGCAGGACAACTACCACGATGACCCACACGCCGACCTTGTCATGTCCAAGAAGTTGCCCAAATTCCAGGACAGCCTGAAGACCGGCA 
AGGAGAAGAAAGGCAAGCGCAAGGGCGCCGAATACTTTCTCGTCAAGTATCGCAAGAACTTCCAGCAACTGCTCGAGGAGGACAAGGACAAGCAG 
CCCAACTACGAGAGCGCCGCAGCGCCGGCTCCACAAAAGCCACTGCGGCATTTCTGCGCCGTTTGCGGCAACTTCTCGTTGTACTCGTGCACCGC 
CTGCGGAACGCGATACTGTTGCGTTAGATGCCTGCAAACGCACCAGGACACGCGCTGCCTCAAGTGGACGGCCTGA 
(SEQ ID NO: 1328) 

Start ATG: 1 (Reverse strand: CAT) 

MTGRESNRIKDAEKKRVLDSTARQRRARKALEALEQDNYHDDPHADLVMSKKLPKFQDSLKTGKEKKGKRKGAEYFLVKYRKNFQQLLEEDKDKQ 

PNYESAAAPAPQKPLRHFCAVCGNFSLYSCTACGTRYCCVRCLQTHQDTRCLKWTA* 

(SEQ ID NO: 1329) 



Celera Sequence No. : 142000013384 645 

TTATCTGATTTGTCACACCTGCCTGTTGGAACCGCATACTCATATAAGGGTTAATCTTATCAGAGGCCCACCGTTGTTGCCACCGCCTTTCAAAT 
CAAATCAGACCGCTTTGCCATAGTAATGGTCTGGCAACAACAAATGAGTATCGCTACATGGAAATGTGCGCGTGTGTGTGAGAGACCAACAACAA 
AACAAG7AAAACAAGCAAACGCACGAAGAAAGTTGTGAAAATTCAATTTTCCAATGGCCCAGTCTGATTGGGACTGCCTTGAAGCGTCCTCT7AATC 
AGATAACTATCAATGTAAATGCGTAAATATCAACGCATATGCGTTGTATTTGTGGAAAATATTACGCTTTAACCACATATACACACACCCCCGCT 
TCCGGTGGGTGTTTTTTTTACGCTGGAGGGGGGAGGGTGGTTTACTCACCTATGCGCTGGCCAACTGGAGTGGAGAAAACATTTCCCAAGGCGCC 
AACGTTGAAAAATGAAGCCATCGTGGAGAAGTTTCTTATTAATGCTCGCCGTTGGGTTGCAGTGCGGTTAAAATTGTTCTTTATTTACTTTTTGT 
TCACCTGGATGAAATGTTTATGCACTTGTCTACAGCAGAGTGACCATGGCTGGCCTCAGCGAAAAATACCCACAAAGA7AATGCATTACAATAAGT 
GGGTAAATAGGAATAGGTTTTTTTCCAGGGCTGCAGCATTTTATTTCATTTTATTTAAAAGAATTATCCACAGTTTTTGAGATAATGACATAGTT 
ATACCCGTTACTCGTAGAGTAAAAGGGTTTATTAGATTCGTTGAAAAGTATGTAACAGGCAGACCATATAAAGTATATATATTCTTGATCAGGAT 
CAGTAGCCGAGTCGATCTGGCGATGTCCGCCTGTCTGTACGTTTGAACGTCGAGATCTCAGGAACTATTTAAGCTAGAATGTTTAGATTAAGCGT 
GCAGATTCCAGAGACATAGACGCAGCCCAAGATTGTTGACCCATGTTTCCATGACCACTCCAACGCCCACATTAGATTGTTCTTAGTTTTATTGT 
TCACCTTGCACCTTGCACCTTTTGGATTGTAATCACACCCACATCTTTAAACAATTTTAATTTTTTCTCATTTTATTCGCAATCCGACTAGCTGA 
GTAACGGGTACCTCATAGTCATGGTTGTTCACAGAGCGTGTTATGTATCTCAAAAACTTTGTAATTTCAAAGAGTTATTTCACAATTCATTCCAC 
AGATTTCGTGGATAATGAATTTTCGATTGTTCAACCTTGCACCTGTCTCAAAAATGAGCTTTCCGTCAGCGGTCCTTCTGACGGTACCGCTGGCG 
GTCCCACTGATGGTTCCGCTGATGGTCCCGTTGATAGCCCCACGGACGGTCCCCTTGACGGTTCCGCTGACGGTCCCTTCGATGGTTCCGCTGCC 
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GGTCGTGCTGACGGCCCCGTCGACGGTCCGGCTGATGGTCCCTTCGACGGTCCCGCTGAGGGTTCCGCTGAGGATCCCGCTGAGAGTCCCACTGA 

ATGGCATTGGAACGAACTCGTTGATGACCAATCCAATGACGGTCGCGCTGACGGTCCCGCTGACGGTAGCGCTGACGGTCTGGCTGAGGGTCCCG 

CTGAAGGCTCCTCTAGCGTTCCCGCTGACTTTCCCGCTGACGTTCCCGCTGACGTTCCCGCTGACGTTCGCGCTGACGCTAACGCTCCTGCTGAC 

GGGCACGTCGTCGGTCTCGCTGTCAGTGATCTCCATGTCGACGGCCCAGATTTCCGAGGACAGTTCATAGGATTCGTCCTCTGAAAGAAGAGAAC 

GTAGGGTATATTAGTGAGTGAGGCTTTGAGACTTATATTTTGATAGTCCCGACTTGGTTAGTCGGCTAAACTTCTCTGTTCCAAGACCCATAATC 

TTAAGCCCATTTAGATCTAACTATAAGTTGCATGATTCTTACAGAATATTGTGTTCTGACTATAATAGTCATTATCCTATTATCTGTAATCTTTG 

CTCTCTGCCGCTCTTAAAGCAATCAATCAAATCAATCGGTGCATTATTCACAAATACGACGCAAAATTTGTTTTTGGTTAGATTAAGGAATATAT 

GAGTTAGCGGTTCCAACAGGCTGATAGGACAAATCATATGCCTCGCACGTAGGGTATAAACTGAGAAGAAACGTGTTTTTCCCCGGAATTCTTTG 

CGACACATACAGTTTGTCAAGAAACTGTTTACACACTGTGAAATAAGTTGAATTTTTGACTTTAAAGCTAAAAATAAAGGGTTTTTTGCTTAATT 

AAACGCAATTTTTTTATAAAATATAATTAAACAATATTTATTTTACTTATAAATCAAAAAACAAATTAAAAATATTAAATATACAAGAAAATAAA 

CAACAAATTCCAAGTTTACACACTTTTGAGACTGTCAAGAAACTCTTTACACATTTTGTTTTCCTACTTTGTTTTGTTCTTTTTCTTAGAACGAA 

CTCGATTTTTCCGTTATTTTTGGCTTTGCATTGCTTTTTACAACGCTTCCTATTGAAATTTTGTTGACTTTGCTTGTGAAATTTTGCTGATCA2^A 

CGTGCTTAAAGCGAATTATTAAATTTAATAAAATGCCTGGAAAGAGATTAACTTTTGAAGTTACCCAATTAATAAACTATAACCACCAGTTGGGA 

AAATCTTTTCCAGAATTAGTATAAATGTTTCCTGTATCCCGTAAGACCGTCTACAATGTTTTAAAAGGCTCGGAAAAAGAGGGCAGGCTTGAACC 

TAAGAGCGGTGGTGGGCGGAAAATTAAAATTAAAAAGCG 

(SEQ ID NO: 1330) 

Exon: 1001.. 1028 
Exon: 1238.. 1794 
Start ATG: 1001 

Transcript No. : CT33785 

ATGACCACTCCAACGCCCACATTAGATTATTTCGTGGATAATGAATTTTCGATTGTTCAACCTTGCACCTGTCTCAAAAATGAGCTTTCCGTCAG 

CGGTCCTTCTGACGGTACCGCTGGCGGTCCCACTGATGGTTCCGCTGATGGTCCCGTTGATAGCCCCACGGACGGTCCCCTTGACGGTTCCGCTG 

ACGGTCCCTTCGATGGTTCCGCTGCCGGTCGTGCTGACGGCCCCGTCGACGGTCCGGCTGATGGTCCCTTCGACGGTCCCGCTGAGGGTTCCGCT 

GAGGATCCCGCTGAGAGTCCCACTGAATGGCATTGGAACGAACTCGTTGATGACCAATCCAATGACGGTCGCGCTGACGGTCCCGCTGACGGTAG 

CGCTGACGGTCTGGCTGAGGGTCCCGCTGAAGGCTCCTCTAGCGTTCCCGCTGACTTTCCCGCTGACGTTCCCGCTGACGTTCCCGCTGACGTTC 

GCGCTGACGCTAACGCTCCTGCTGACGGGCACGTCGTCGGTCTCGCTGTCAGTGATCTCCATGTCGACGGCCCAGATTTCCGAGGACAGTTCATA 

GGATTCGTCCTCTGA 

(SEQ ID NO: 1331) 

Start ATG: 1 

MTTPTPTLDYFVDNEFSIVQPCTCLKNELSVSGPSDGTAGGPTDGSADGPVDSPTDGPLDGSADGPFDGSAAGRADGPVDGPADGPFDGPAEGSA 
EDPAESPTEWHWNELVDDQSNDGRADGPADGSADGLAEGPAEGSSSVPADFPADVPADVPADVRADANAPADGHWGLAVSDLHVDGPDFRGQFI 
GFVL* 

(SEQ ID WO: 1332) 



Celera Sequence No. : 142000013384184 

AAGTTAGAACCTTCGCGTCTACTAACAGATTACTTTTCGCCCCGATATCCACTAACCTGTGGCTGCCCCAGTCGAAACTTGGGGTCTGGGCGGTG 
GCGCCCATTATCCTGCTGCATAATAATGCTAATTCCTAAAGTTTGTCTATCCCGGGAGCTGTATTGTTTTACAGCTTTAGTGTT7VAACCTACAGA 
TTCTCAGAGAACAGATGGCTACTCATTCGTTGACTAGAAGGGAGGGCGGAGGGCCTTTGCTGAGAGACTCTCAAAATTGAAAGTTTTACCTGGTT 
CGTCGTAAAACTTTTTTTGTTTTGAGTTCAAATTAATGTTTTATAAATGAAATAATAAATAACTAAATAATAATAATAAATAAATAATAAATAAA 
TAAGCTTATATAATAAGATCCATTTTAATTGTTTTTTCATAAAACAAAATGAGTGATTTTTTTTATAATTACAAATGAAACACTTGAATTACCGA 
AACAAAAACGCTTAACTGTGCCATCACTAATACTTATTGGCAATAAATAAGTGCGATTTTATGACAACAAAGACGTATCGATAAGATGCATGTGC 
AAAATGCGGTGTGTGTATTAACTTAATACTTGGCCTAAGGCTTATCGATAAGGATATCGATACTAAAAAGTCCGGCACCATTGTCGCTATCGAAT 
GCCCAAGTATACAAGACAAGCTATGAGAACGAATATAAGGTGGTCAGTTTTAAGTAATTCAAAAAATTCTAAGTCAAAATGTGTCGCAAGTGTAG 
TTACTTGATTTAATCGCGZ^ATTTACTCGTGGATTATGTCTTAAGATAATATCGATATATTGCCGGCGTCATCGATTGCTGACACATATCGAAAG 
GTCCGGCCACACTGCTCACCGTCTTCTTCTTGTCATCCCTTCCGTTTCCCTTTCTTTTCGTCTTCACGTTTCCGGTGAGTGCGTGTGCTCCGCGA 
ATTTTTAAGCAAAACTCCCTATTAAATACCTCCAATTCTCTTTTTTACTTGCAGCGGGAGTATTACGATACGTTCGCGTTCGTGTAACTTTGTGT 
TTGCAAATCAATCCAGGTAATTGTCGCCGAAAACCCCCATTTTCAAGCGGCCGCACCGGGAAAATATGCGCCCAATGTTGTTCCCGAATGCATGT 
GTACATTTTTTTTTTGTGTGGACAAAGAAGCGAGTCTTTCAGACTCAATAAACAATGTGTGAATTGGATACCACATGGATGATCTGTAGATATAT 
CCTGGCAATGAACCGAATCTAATTGCTATTTTTTTTTTTTGTTGTTTGTGCAGAATGTTTATGCCAAAGGCCCATCGTGTCGCGATCTACGAGTA 
CCTCTTCAAGGAGGGCGTGATCGTGGCCAAGAAGGATTTCCATGCCCAGAAGCACCCGGAACTGGAGTCGATCCCCAACTTGCACGTGATCAAGG 
CGATGCAGTCGCTCCACTCGCGCGGACTGGTTAAGGAGCAGTTCGCCTGGCGCCACTACTACTGGTACCTCACCAACGAGGGAATCGAGGAGCTG 
CGCAGCTACCTCCACCTGCCGCCCGAGATCGTGCCCTCGACGCTGAAGCGCCCTGCCCGCTCGGAGACCGTCCGTCCCCGTCCCGCCGTTGGCGG 
CCCACGCGGACCCGGTGACGCCTCCAAGACCGGAGAGGATCGCTCCGCCTACCGACGTGCTCCCGGCGGCAGCGGAGTGGACAAGAAGGGCGATG 
TTGGACCCGGTGCCGGAGAGGTCGAGTTCCGTGGCGGATTCGGACGCGGATCGCGCAACTAAACAACCCAGGCGCTTCAGTTTTTTTTTCTAATG 
TATAAAAAATTACTCAGTAAAATGTTCAACTGAAGATCCATACCCTTAACAATAGAAGAAACCAACATCTGAAAAATGTGTGTGTGTTTTATGAT 
ATGACGATCAATTATAAAGCAAACGCAAGTGAAATTCCGTGCTCCCCTCTATTTTTTTTTTTTGTTTTGTTTTTGTTGTTTGGTCCAGTGATTGT 
GGTGTGCAAATCGGACGTGGATTCAGATCCCAAGCGTGGCTAAATAGTATAGTAAACCCGATGCCCGGCCAAACTGAAACTGCTAACCCAACACA 
TTTTTGCAAGAAGCTCTTCAGATAACTTTAGAAGTAGGATTTTGCAGATAATTCAGTTTATTTTGCGAGGTTCTGTCGAGAGGAAAGAAGGAATT 
CTAGCGTAAATACGTTTTTTTGAATACAATACATGCATTCAATTTTGGTAATTTCAAAAGATTGTTGTATATAGATTTGGATATTTGTCGATTTG 
TAGCTCTTTATTATAAAAATATGTGCAGAAATTCATCGCATTCGTTTATAGCTGTGTGCCAAAACATAGTAGACTTGGGTTGGAATAAACCAGTT 
GGCTCGAAATACTACATTCAAATATTTAAATAACATTAGAATCCAATTGCTCATAACAAAAACGGCATGAATTGCAAGGACATAGATTAAATTTG 
ATTTACCATCAAAGGAAATGGTATATTTTCTAAACGATAAGCCAAAATCTTCGTATGTGGAGGAACGTAACTTAAATGCTCAATTTCGTTTTATT 
TGTTCACGCTACTTAAACGTTCACAATTAGTTAAATAACTAGTTTACTTAAATAGTCTTCTTTTGCTCACGAGCATCCTTTTGTTGTTCTGTAGC 
TGCCTTTTACTTATATATTAATTCTAATTTTATAAATACACTAGTTTCACATTTCCCAAAGTTAGGTTTTAGTTAAATCTTATGTGTCTACTAAT 
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ATTATTTAAAAGCTTATGTGCTTCTCCATTTAACGTTGATTACTTTCAACTAATATTTTATGTAAACAAGTAAGTATGCTGCTATGCTCGGACCA 
TAAAGAATTCGAATCGCTTAGATTATTTATTTAGGCCGACCAAAAGAACTACAACAATATACGACGAATGGCATTATTCTAGGT 

(SEQ ID NO: 1333) 

Exon: 1001.. 1934 
Start ATG: 1296 

gcagcggga^ 

CCGCACCGGGAAAATATGCGCCCAATG 

aacaatgtgtgaattggataccacatggatgatctgtagatatatcctggcaatgaaccgaatctaattgctattttttttttttgttgtttgtg 

CAGAATGTTTATGCCAAAGGCCCATCGTGTCGCGATCTACGAGTACCTCTTCAAGGAGGGCGTGATCGTGGCCAAGAAGGATTTCCATGCCCAGA 

agcacccggaactggagtcgatccccaacttgcacgtgatcaaggcgatgcagtcgctccactcgcgcggactggttaaggagcagttcgcctgg 

CGCCACTAC^CTGGTACCTCACCAACGAGGGAATCGAGGAG 

ccctgcccgctcggagaccgtccgtccccgtcccgccgttggcggcccacgcggacccggtgacgcctccaagaccggagaggatcgctccgcct 

ACCGACGTGCTCCCGGCGGCAGCGGAGTGGACAAGAAGGGCGATGTTGGACCCGGTGCCGGAGAGGTCGAGTTCCGTGGCGGATTCGGACGCGGA 
TCGCGCAACTAAACAACCCAGGCGCTTCAGTTTTTTTTTCTAATGT^^ 

AATAGAAGAAACCAACATCTGAAAAATGTGTGTGTGTTTTATGATATGACGATCAATTATAAAGCAAACGCAAGTGAAA 
{SEQ ID NO: 1334) 

Start ATG: 296 

MPKAHRVAIYEYLFKEGVIVAKKDFHAQKHP^ 

SETVRPRPAVGGPRGPGDASKTGEDRSAYRRAPGGSGVDKKGDVGPGAGEVEFRGGFGRGSRN* 
{SEQ ID NO: 1335) 

Name: ribosomal protein SlO-like 
Classification: ribosomal_protein 

Celera Sequence Wo. : 142000013384111 

CTGGGCCGATATCGTTTCCTGCAGCTTCCGGCGAAAGGCAAACTTGCGGTTACGCTGATCCTCCAGCAATTTGGTCAAGTTACTGTCATCGAAAG 
GCATCTTGGCATTCATCATGATGAACAGAATGACTCCCAAGGACCAGGCGTCCGCTAGCTTTGGATCATATGGTCGTCCACAAACTACTTCAGGT 
GCAGCATAAGCAGCTGATCCGCAGTATGTCTCAGATTTCATCTCCCGGCCATTATCGTCGCGACAATAGCGAGCAAATCCGAAATCGGCCAGCTT 
GATGTTGAGCCGCTTGGAGAGAAGAATGTTCTCGCACTTGAGATCCCGATGGGCAATGTCCAGGTTGTGCAGATACTTCAGTGCCTTGGACATCT 
GAAAGAACCAGATCTTCGATTGCTTCTCATCGATGGGACCCGACCTCTTGATGTGCGACAACAAATCCCCATTCTCTGCGTAACGCATAAAGATG 
AAGATCTTGGGACCGCGCTGAAGGATGCTATGGATCTGAATAATGTTGGAATGATCAATCTTTGTCAGGATCTCCAGTTCGCGCGGAAAGAACTT 
GTTCACGAAATCAGTGGGCGCCTTGGCTTTGTCAATGATTTTACAGGCCAAATGTACTCCATGTCCATGATCATCGGCATAACCGGCGGTTATAA 
CAGTGGCATAAGACCCCTCGCCGATCTTGTGACCCACATTGTAGCCCCGCTGTGCCAGTGCATCCACATCCGAACTTCGAGTACCCAACTGACGG 
ATCCCAGCGCTAGAATTGTATTTGGACATGGCTCTTGTTTCCTCGGCGTGCGTGTAGTGTAGTTTTCCGAGAGTTTGCAACTAGGTGATTTGGTA 
ATGGCATCAAGCGAATGATTCTTTTCTAAACAGCGGACATAATTACCCTTAAGTTGGTTGATTTCACAAACAATAATTTTATGCCACTTGTGTAT 
TTTACTTACTAAATTCGAATCCAGAAATTTTGACAAAACCTACAAACGTTATGAGGTGTGAAAGAAATATAAAAAAGGGTAATGCACAGCCTTCT 
ACCATAACTGACAAATCTGAACCAAAGTATAAGGTTACTACGTCCCAAATATTATTATGCTAGCATGCGAAATGCTAATAAGAGGGCATAAAAAT 
AATATATAATATATGGATAAGGAAAAACTGTTTAAAGTTGAGTATTTGTTTTGAGACATTTTAAAAGCCTTAAATTGATAAATTAAGTAAAAAAA 
AAAAACTGTGTTCTGAAAGCAACTAAAGTTTTAATATTTCTATATTACAGTTGCCATTCTTGTTTCCTCGAGCCAAGCCACTGGCTCTGCGTCCC 
AACCAGCTCCAATCAATAGAAATCCCCGAGGCAGCCACAACAGCAGTGGCATTGGCGACTCTGAGCTAAGTGCCTCAGAGCGAAGTCGACGATTG 
ATACCCTACATGGCCTTCTATCTGCCGGCGCCCGAGTTGCCCATTAATCAGCAGTATGCCACCAAGCACTCGTCGTCGGCTGGCGGAGCACAGCT 

GCAGGCACCGCTGCCAGGTCCTG GAAG GAT AC C AGT GC C GGTGGCCTACATGCC G C AG C AG AAG C AC CAT C C CC AC C AC C AT GGAC C AC C GC AT C 
CGCCACCAGGGCCGCATTATCAGTCGGGAGCAGGCGAGGTCTATCACTCCCTTGCGATTGGAGGCCACCAGCATCAGCATGCGCCCCCACATCCG 
CCACCTGGTCATGTCAAGGAGCAGCAGGCGATCACCAATTCGGGTGCCACGTATATTGCATACAAGCCATTAAACCCGACGCATAAGGCAAGTAT 
TTGAATAATAAGTGTTTTTAAAAAGGCTATGTCGCCTTGGGAAAGAATATATATCAAATGGAGCTAAATTAGTGCTGAATCGGTTTAGATTGCTT 
CCTTCTAAAGTCCATCAATCGTGAAATACCAGCAATATTGCGAATATTGGAATATGAAACGTCGAGGGTATTGAAAAAGCACTTGGATACCTATC 
CACTCTCCTTCATTACAATTGATTTCATGGGTCATAAATAAGCAGAAATAATCACGTAATTCATAATTTACACGCACGCAGCACAAGACAGTGCA 
GCATTTTCCGCCACTACCGCCGTCGCCCAAGGAGCAAGTGCAGCAGCAGCATCCACAGCAGCACCACCACCTTCTGCACCACCAGCAGCAGCAGC 
AGCAGCAGCAGCAGCAACCACAGCAATACGAGATCCTATCATCGAGCTTAGTGGCATCCCAGGCGCAGCGACAGCGGGGCAAACAATCAAAAATC 
AATTTAACACCATTTACAGCGCAGAACACATTGCCGGGACAGTTTATACCCATTATTTATACGCCAGTGAGCAGCAGCAAACCGGGCTCCCCCGG 
CGATGCCATCAATTACAATAACATCCAAAGCAACCTGAAGCAGCCAGAAATTGTCTATAAAAATTCCCATCAAACACAAATCGTTACGGATTATG 
CAGCCGGAGCGGCTGCCTCCGCCGAACAGCCACCACTATCGTCATCTGGCGAGGAGGTGGATGAGACGAATGGCGAGAAGATGGTGGAGCAGCAG 
CAGGAGGACCATAGAGCCCAGGAGGAGGCGGTGGAGCAGCAGCAGCAGCAGCCGCAACTACACCAGCAAGTGCAAAACAATCCAAGGTAAGATTT 
CTTGTCACTCTTGTGACTCTCATGCCCAAACGGAAAACAGAGCGCTGGCGCGAAAGCCCAAAAGTAAGCCAGGTCTGATCTTTGATCTCATGGAA 
TATCCAAAGCCTCGTCTGCTTAATAGGCAGTTCTCACAGCCACGCCCACTTATATATCGTGAGGTAGTTGCAATATCGCATGTAGAATACAAATT 
AAAATGTTGCCACTATGGCTGTCTTTAAAACAGTTTCATGTCATCCATACATTTTTAACAACTTTACACAACGAACCTTTAATTGGTCTTAAAAA 
AAAAATGCTATACAGATTACTTTTAAGACACAACCCACTTAATTAACTTACGTTACGAAAGGTTGCTAAACTTATTGTTTCGCCATCCAAAGATG 
TCAATAATTAATTGGGAACAGCCAGTTATCAACAAACTGCAGTCCAATTCAATATTCGATAATGCAACAGCGACTTTAATATGCTTCTGTTTAAA 
ACGCATGCAATTGGGATTTCTGAGTCTGAGACGGATACTGCTGTTGCAACGCTATGCCCTTGGGCAGAAAAATCAATTGCATTTCATGCACCAGC 
CCAGTCCACTCCAATGTCCGGCTGCGACACAGACCGCGTCCGCTCTTCTCCATCTCTGAATCCGGTCGTCCACATCGTCACCATCTTTGCAACTG 
CACTGGTGACGTAAGATCCATGACGACAACGATTGCGACTCCAATTGCCGTTCGATGTACAGCAGGAGGCACAGCCAGTGGGCAACAAAAAGGTG 
GGCTGAGGAGACTTGACCTGGCCGGCCAGTGTCTCATGTATGTATGTACATAACTCCTCTCCACATTCTCATGTAGTAATTGTCAAATGGCAAAT 
TCATGTCATAAGTAGCCAGCGGGCTCAGGTCAAAAATTTGTCATACAATGCTTATCAGATGAGGAGTGGGAAATGGAAAAACATGCCAATCAGCG 
AAGATTTTTACAACCACCACACTTCAGTTGAACACATTTCGATGCTGCACCTTTGTTTTTCGGTTAAAATCATTCAATAAGTTACACTTTAAGGT 
GCAAACGAAATAAATTATGCTTAATAGGCATATTACGATTTCAAGAGTAGGGAAAGTCAGGATATAAAGGAATACAATTTGTCAAATTATTTGGC 
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TGTTATCTACTAGTCATGGATAGATAATCAGATACAAATTAAAACTTCTTAAGGAATAATGAATAAATTTGAAAATATCACAATGAAATAGGGCA 

ACCTAACGTACACTCTCATGACATTTATGCTTGAAACCGAAATCATAACCGCTTTGCATGATTAACCTCCAACAGCATCGGCTACGCTTCTGTGT 

CGGTTCCCGATTCGAGCGCAATTACCGCGACCACATCCTCCAAAAACGCATTGATAATACATCAGAAATCACCGGCTTTCGGGCACGTTAGCAAA 

TACGAAAGCTACTTCAATATGGCCCCGCCCTCGACTAGTCTTGGTCATTCTCCTCATCAGCAGCATTCACACATGACGCCGCAGGAGAAATATGT 

GCTTTACTTACAACAGCGCATGAAACACCAAAAACATCAGCAACATCTGCAGCAGCAGGAGCAGCAACAGCAACAGCATTACGACTTTTCACATC 

CGGTGGTGCCGGCACAAAGCAATACATTGGCTGGTAGCCACCATTCGCAACACACCCATCTGCATCCCCATCATCCCCAAACCAAGACCCATCAT 

CCAAACAAGTACGATGCCCTCTACGTAAGACCGGATGCGACTGCTGGCCAACAGGTGCTGCATATCCCGCTGCGCGCACTCATGGCCCACGTCCA 

AGAGCAGTCGCACTCCCAGTCGCAATTACAAGCGGAGTCGCAGGCACAGTCGCCACGTCACGATTTCGCCAAGGATGAGCCACTTGGGTAAGTTC 

TTTGCTTGGGAGGAGGAGCAAGGATCGTTGAGAGGAAACAAGTTGCTTAGGTCTTACCAACGCAATAATGATTCTCCCTTCTCAAGAAAAAAGTA 

TTCTACGAGTTTATTTAATAAGAAAAACTATCTAAATTGATAGTGATGGTCTAAAGATAATCCACATTACTAAGATATATAAAACTGCTTAGTAA 

GTACACAAAGTACTCAATACGTCCTTCAAAAACTAATCATTTCTAAGTAACAAGTGCATCATGAAATCATAATTTAGCCTCCCAAAAGACCTTAC 

TTTAGATCTGAACATTTCTTTTAGTTCCCCAGCACCACCGCAGTTTATAGACTATCTAATTGATGGGCCAAGACAATCGCTGGAGGAAGAGCAGC 

AGCACTCCCAGCATGAGCGACCATCGGCTCATCAGCCGCCCCAGCATGCCTACATCCTGGTGACCACGACAGCTCCACATATCGAGCATCAGCCT 

GCACCGCAAATGCATCCAGAGATTTCCACACCGCGTCCACCGATTCTCTATAACCAACAGCCCACACTTCGATTACGTCCCGTGCATACTTACAA 

GGCCACCAGGCGTCCCATTTACGTGACCACACCCGCACCCGCAGTGTCGCCGACTTCAAGTCACACGGTAGAGATTACGCCCAAATTTGTTTACG 

TTTCGGGGAAGCCTCCTACTGGCCAAAACCCAGAGTTACCGCTTAGAGAACCATCTACGGTAACGCCTATTAAACTTAAGCCCGTTTACAAATAC 

GCCCATGAGCGCCCTCCATTAACGCCTGCATACGTACCAGAGGATCAGGATCAGCTACCAGATATTCGCACCTCATCACTGGCCGAAATCCTTCA 

CAAGCTTCAAGCGAGCAATCATCTTCCGCAAACCCTAACACCCGACAACATCGACAACTCCATCAAAACATTGATCCGCATACTGCAAAATCTAA 

AGCAAACCCAAACCATTGTGGCTAATCCTCCGCAGCATCACGACGAGCACAAGTCCAGTGCCCAGGATTATGATTACAATACGGGCAGCGAAGAG 

GAACACCATTCTCAATCAGAGGAAGCGATTGCCAGACCCAAGGGACCAAGTGAGTAGCCTTTACTTTATTATGTTTCCCTATTTTATTTCCATTT 

GAATAGATAAACATCCAGGACCAAGCACTGGAAGACCTGGCATCGATTATCCGAACTATGCGGAGATCCCTCAAACCAGTTTTGAGTGCACCAAG 

CAACGCTACAAGGGATTCTTTGGCGATCCGGAGACCAATTGCCAGGTCTGGCATTACTGTGATCTCAATGGAGGAAAGGCTTCGTTCCTTTGCCC 

CAAT GGAACCAT ATTT AGT CAGGT AAATGGGCGAACTTAT GTT ATAC ACATGTTAAGAATT AAAC AATTAAAAAC AT AT ATAAAAAC AGATT GCC 

TTAACTTGCGATTGGTGGTTCAATGTCAAATGTTCAACAACCGCACAACTTTATGTGCTGAACGAACGACTTTACAAGTACATTCTGCCTTTTAA 

TCCAAAGTTCCCCGAGGACTACAACGGACCTATTGTGGATAAGTAAGACACTCTTATATGATTCGAAAGTATTAATTAACAAACGTGTCATTTAC 

TTAAGGTATCTGGCAATGAAATTCCAGGAAATGGAGGAGAAGATGCGCCTGGAGAAACAACGCAAGGCGGCCCAAGAGGCACAGAAGCCCGAAGA 

AGC ACC AT CT AC T C T AC CTGCGTTGCC C AAAAAT C AC AAGCC AG AAC C G AAAC AT GGT AGC GGAAT AAAT GCC C AGGTCT AT GAG C AAAGCT CT G 

AGAAAAACTTGCTGATCGATGATGAGATAGATGATATTTCGGAGCGTGGGACCTACGATACGTACGACCAGACCGCACCCACAACTATAATGGCA 

CCCACGAGCACCCAGGACACGCAGTCCTTTGTCCTTAAACCAATCGTAGTTTCCTCCACTCCACAACCGCTGCCAGAGATTGATTTCGAAGTTGA 

GCCAACAGCTCCAGGCAGCAGCGAAGAGGAGGATCACCTGCAAAGTCTAAGGGAAACTAAGGAAGCTGAGAAGAAAACGCGTGAATCCACAAAGG 

TGAGCGTGGAAAAGCTGGAAGTAATCGAGATCAAAACCGATGGAAACACCGGACAACTAATGCCCATCAAGAGCAGTAGCAAATGAGCATTTTGT 

GGAGAGGATGAATAGCTCTTGCCATATTTGTAATGAGAAATTTGTTTGCGACTTGGTTGTATAATCCCTAAATTATTCTTAGCGAGCAAGTACTG 

TAAATAGCTAGCTGTAAATGCAACTGTATACATACCTATAGCTATAGTGAAATAAAATCTGACAAGTAAGTAAAAACGATCATAGCACTTACCTG 

TTTGCCATTGAGGACATGAATATGAGCCATGTTGAGTGAGTTTAAAGTCGTGGGAATCTCGGAAGCATAGCTTTGATTCTGTGGAAGAAATCAAT 

TAATATTAGGTGGATTATTATTATTAGATTAATCACGATATTTCAAGTTGCTAGTGCGCAACGTAATGGATTTATAAGTACAATGCCACAATTTA 

TGTGTACCTCATAGATATGTAAATTCTAAAGCATGTACGTTTTGTAAGCTAGCATTTGAAACCATATTGTAGATCTCAAAACTAACGTGCTTAAA 

ATCTTATTCAGAACATTTAAATAAGCAATTACATATACAAATATGGCAGGGAAATGTTTCAATATTTCAAAAATTATACTAACAATCTCGATTAG 

CTTCCGTATGCGATGCCCGCTATGATGCACGGTGGAGCAAATTTGGCAGTGGGATCGGTAACACTTCATACAGTACTTGTCTTTGGGCATTCGGT 

GATGGATGCAAAGGTTCTGACTCGGCAATGTGAAGTACTGATCCCCCACTCTGAGTTTCTTGCTGAAACTACTGTCCTGTTCCGTAGAATTGCGA 

AAAATATGCGACTTTAGTACGCGACTACGTTTGTGGACAGCATCAAAGCAACGGCGGCAGTAGAAGGCATTGCATTGCCTGCACTCTCCGCTGGC 

AGTGGCATTCCCGCACTCGCTGCACTTGACGGTCTGGACAATCTCGCTGATCGAACTGGGGCTCAGCGATTGGTTTGCCGAGTCCTTGGCAAATG 

GCAGAAGGTAGCTGAGGCTACTGGTCCTACCCAGCACGTGG 

(SEQ ID NO: 1336) 
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ATG: 


1001 



Transcript No. : CT33934 

ATGAGGTGTGAAAGAAATATAAAAAAGGTTGCCATTCTTGTTTCCTCGAGCCAAGCCACTGGCTCTGCGTCCCAACCAGCTCCAATCAATAGAAA 
TCCCCGAGGCAGCCACAACAGCAGTGGCATTGGCGACTCTGAGCTAAGTGCCTCAGAGCGAAGTCGACGATTGATACCCTACATGGCCTTCTATC 
TGCCGGCGCCCGAGTTGCCCATTAATCAGCAGTATGCCACCAAGCACTCGTCGTCGGCTGGCGGAGCACAGCTGCAGGCACCGCTGCCAGGTCCT 
GGAAGGATACCAGTGCCGGTGGCCTACATGCCGCAGCAGAAGCACCATCCCCACCACCATGGACCACCGCATCCGCCACCAGGGCCGCATTATCA 
GTCGGGAGCAGGCGAGGTCTATCACTCCCTTGCGATTGGAGGCCACCAGCATCAGCATGCGCCCCCACATCCGCCACCTGGTCATGTCAAGGAGC 
AGCAGGCGATCACCAATTCGGGTGCCACGTATATTGCATACAAGCCATTAAACCCGACGCATAAGGCAAATTGCTTCCTTCTAAAGTCCATCAAT 
CGTGAAATACCAGCAATATTGCGAATATTGGAATATGAAACGTCGAGGCACAAGACAGTGCAGCATTTTCCGCCACTACCGCCGTCGCCCAAGGA 
GCAAGTGCAGCAGCAGCATCCACAGCAGCACCACCACCTTCTGCACCACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACCACAGCAATACGAGA 
TCCTATCATCGAGCTTAGTGGCATCCCAGGCGCAGCGACAGCGGGGCAAACAATCAAAAATCAATTTAACACCATTTACAGCGCAGAACACATTG 
CCGGGACAGTTTATACCCATTATTTATACGCCAGTGAGCAGCAGCAAACCGGGCTCCCCCGGCGATGCCATCAATTACAATAACATCCAAAGCAA 
CCTGAAGCAGCCAGAAATTGTCTATAAAAATTCCCATCAAACACAAATCGTTACGGATTATGCAGCCGGAGCGGCTGCCTCCGCCGAACAGCCAC 
CACTATCGTCATCTGGCGAGGAGGTGGATGAGACGAATGGCGAGAAGATGGTGGAGCAGCAGCAGGAGGACCATAGAGCCCAGGAGGAGGCGGTG 
GAGCAGCAGCAGCAGCAGCCGCAACTACACCAGCAAGTGCAAAACAATCCAAGCCCAGTCCACTCCAATGTCCGGCTGCGACACAGACCGCGTCC 
GCTCTTCTCCATCTCTGAATCCGGTCGTCCACATCGTCACCATCTTTGCAACTGCACTGGTGACGTAAGATCCATGACGACAACGATTGCGACTC 
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CAATTGCCGTTCGATGTACAGCAGGAGGCACAGCCAGTGGGCAACAAAAAGGTGGGCTGAGGAGACTTGACCTGGCCGGCCAGTGTCTCATCATC 
GGCTACGCTTCTGTGTCGGTTCCCGATTCGAGCGCAATTACCGCGACCACATCCTCCAAAAACGCATTGATAATACATCAGAAATCACCGGCTTT 
CGGGCACGTTAGCAAATACGAAAGCTACTTCAATATGGCCCCGCCCTCGACTAGTCTTGGTCATTCTCCTCATCAGCAGCATTCACACATGACGC 
CGCAGGAGAAATATGTGCTTTACTTACAACAGCGCATGAAACACCAAAAACATCAGCAACATCTGCAGCAGCAGGAGCAGCAACAGCAACAGCAT 
TACGACTTTTCACATCCGGTGGTGCCGGCACAAAGCAATACATTGGCTGGTAGCCACCATTCGCAACACACCCATCTGCATCCCCATCATCCCCA 
AACCAAGACCCATCATCCAAACAAGTACGATGCCCTCTACGTAAGACCGGATGCGACTGCTGGCCAACAGGTGCTGCATATCCCGCTGCGCGCAC 
TCATGGCCCACGTCCAAGAGCAGTCGCACTCCCAGTCGCAATTACAAGCGGAGTCGCAGGCACAGTCGCCACGTCACGATTTCGCCT^AGGATGAG 
CCACTTGGTTCCCCAGCACCACCGCAGTTTATAGACTATCTAATTGATGGGCCAAGACAATCGCTGGAGGAAGAGCAGCAGCACTCCCAGCATGA 
GCGACCATCGGCTCATCAGCCGCCCCAGCATGCCTACATCCTGGTGACCACGACAGCTCCACATATCGAGCATCAGCCTGCACCGCAAATGCATC 
CAGAGATTTCCACACCGCGTCCACCGATTCTCTATAACCAACAGCCCACACTTCGATTACGTCCCGTGCATACTTACAAGGCCACCAGGCGTCCC 
ATTTACGTGACCACACCCGCACCCGCAGTGTCGCCGACTTCAAGTCACACGGTAGAGATTACGCCCAAATTTGTTTACGTTTCGGGGAAGCCTCC 
TACTGGCCAAAACCCAGAGTTACCGCTTAGAGAACCATCTACGGTAACGCCTATTAAACTTAAGCCCGTTTACAAATACGCCCATGAGCGCCCTC 
CATTAACGCCTGCATACGTACCAGAGGATCAGGATCAGCTACCAGATATTCGCACCTCATCACTGGCCGAAATCCTTCACAAGCTTCAAGCGAGC 
AAT CAT C T T CC GC AAAC CCT AAC AC C CG AC AAC AT CG AC AACT C C AT C AAAAC AT T GAT CCGC AT ACT GC AAAAT CT AAAGC AAAC C C AAAC C AT 
TGTGGCTAATCCTCCGCAGCATCACGACGAGCACAAGTCCAGTGCCCAGGATTATGATTACAATACGGGCAGCGAAGAGGAACACCATTCTCAAT 
CAGAGGAAGCGATTGCCAGACCCAAGGGACCAAATAAACATCCAGGACCAAGCACTGGAAGACCTGGCATCGATTATCCGAACTATGCGGAGATC 
CCTCAAACCAGTTTTGAGTGCACCAAGCAACGCTACAAGGGATTCTTTGGCGATCCGGAGACCAATTGCCAGGTCTGGCATTACTGTGATCTCAA 
TGGAGGAAAGGCTTCGTTCCTTTGCCCCAATGGAACCATATTTAGTCAGTTCCCCGAGGACTACAACGGACCTATTGTGGATAAGTATCTGGCAA 
TGAAATTCCAGGAAATGGAGGAGAAGATGCGCCTGGAGAAACAACGCAAGGCGGCCCAAGAGGCACAGAAGCCCGAAGAAGCACCATCTACTCTA 
CCTGCGTTGCCCAAAAATCACAAGCCAGAACCGAAACATGGTAGCGGAATAAATGCCCAGGTCTATGAGCAAAGCTCTGAGAAAAACTTGCTGAT 
CGATGATGAGATAGATGATATTTCGGAGCGTGGGACCTACGATACGTACGACCAGACCGCACCCACAACTATAATGGCACCCACGAGCACCCAGG 
ACACGCAGTCCTTTGTCCTTAAACCAATCGTAGTTTCCTCCACTCCACAACCGCTGCCAGAGATTGATTTCGAAGTTGAGCCAACAGCTCCAGGC 
AGCAGCGAAGAGGAGGATCACCTGCAAAGTCTAAGGGAAACTAAGGAAGCTGAGAAGAAAACGCGTGAATCCACAAAGGTGAGCGTGGAAAAGCT 
GGAAGTAATCGAGATCAAAACCGATGGAAACACCGGACAACTAATGCCCATCAAGAGCAGTAGCAAATGA 
(SEQ ID NO: 1337) 

Start ATG: 1 

MRCERNIKKVAILVSSSQATGSASQPAPINRNPRGSHNSSGIGDSELSASERSRRLIPYMAFYLPAPELPINQQYATKHSSSAGGAQLQAPLPGP 

GRIPVPVAYMPQQKHHPHHHGPPHPPPGPHYQSGAGEVYHSLAIGGHQHQHAPPHPPPGHVKEQQAITNSGATYIAYKPLNPTHKANCFLLKSIN 

REIPAILRILEYETSRHKTVQHFPPLPPSPKEQVQQQHPQQHHHLLHHQQQQQQQQQQPQQYEILSSSLVASQAQRQRGKQSKINLTPFTAQNTL 

PGQFIPIIYTPVSSSKPGSPGDAINYNNIQSNLKQPEIVYKNSHQTQIVTDYAAGAAASAEQPPLSSSGEEVDETNGEKMVEQQQEDHRAQEEAV 

EQQQQQPQLHQQVQNNPSPVHSNVRLRHRPRPLFSISESGRPHRHHLCNCTGDVRSMTTTIATPIAVRCTAGGTASGQQKGGLRRLDLAGQCLII 

GYASVSVPDSSAITATTSSKNALIIHQKSPAFGHVSKYESYFNMAPPSTSLGHSPHQQHSHMTPQEKYVLYLQQRMKHQKHQQHLQQQEQQQQQH 

YDFSHPVVPAQSNTLAGSHHSQHTHLHPHHPQTKTHHPNKYDALYVRPDATAGQQVLHIPLRALMAHVQEQSHSQSQLQAESQAQSPRHDFAKDE 

PLGSPAPPQFIDYLIDGPRQSLEEEQQHSQHERPSAHQPPQHAYILVTTTAPHIEHQPAPQMHPEISTPRPPILYNQQPTLRLRPVHTYKATRRP 

IYVTTPAPAVSPTSSHTVEITPKFVYVSGKPPTGQNPELPLREPSTVTPIKLKPVYKYAHERPPLTPAYVPEDQDQLPDIRTSSLAEILHKLQAS 

NHLPQTLTPDNIDNSIKTLIRILQNLKQTQTIVANPPQHHDEHKSSAQDYDYNTGSEEEHHSQSEEAIARPKGPNKHPGPSTGRPGIDYPNYAEI 

PQTSFECTKQRYKGFFGDPETNCQVWHYCDLNGGKASFLCPNGTIFSQFPEDYNGPIVDKYLAMKFQEMEEKMRLEKQRKAAQEAQKPEEAPSTL 

PALPKNHKPEPKHGSGINAQVYEQSSEKNLLIDDEIDDISERGTYDTYDQTAPTTIMAPTSTQDTQSFVLKPIVVSSTPQPLPEIDFEVEPTAPG 

SSEEEDHLQSLRETKEAEKKTRESTKVSVEKLEVIEIKTDGNTGQLMPIKSSSK* 

(SEQ ID NO: 1338) 



Celera Sequence No. : 142000013383898 

GAATCTGCAACAGGATACAGCAATGCATTCACGTAGTTTAAAAACAGAACAAATTTTCGGACTGATTTAGATGAATTTTCATTTGGCTAAATATG 
TAAACTAAAAGATAATTTAAAATAACCCAAATCATAAGCATCTGGAGCAATGACCTAGTGAAATGTGAAGATGCAGCAGATTAAATTTGATACAG 
TAAAAAAGTACTTATAGGAATGTCAATTTTATCGCGCAACAGAATGTCGTACAATAACATTTAGAAGCCATATGAAGAACAAAGAAATCAAGATA 
TACGTATTAATTATAGTATACTGAAAGTGTCATAGCAATTAGACGTTGGTGATTCCAAAAACACATAAGTGCATGTCTGTCGCTCAAATAGGTTT 
CTTTTTAGATATGTACGATATTATTTGCATACCTAGTATATAATTGTATTGTATAAAGTATTAATTATGATATTCCTTCCCAGTTGTGGTGGCTA 
GCATTTGCTCAATTAGTTCTTTAGCTCTCTACTGCGCACTCTTACGCACCGGTGTACCTACTATTTTAAGAACGAGAAACTATTTTTATGATCAA 
TCCAGTTTTTTACACACACTTGTTATGTTTACACAAAAGCTCACTGACACAAAAATGGCTGGGTAAATTGGAATGATATCCAATTTTGAAATATT 
TCGAAATATTTCCCAGCTGCACATCACAAACGAACAAAATCCTACAGAATCTTTCCTAATCTCTGTAACCAAATGGTTGTAAGAAATTCTCACTA 
TATCAATTTAGTTTACAATTCCCTCCCTATAATGTGGTACTATTTTTGTATATTTGAGTTTACAAAATTAGAGAAATTTTTAGAACTGATGTATA 
TTAGAAATTAATCAATTCGTCCCGTTTTGCAATATATTTATACCCTATTTTATAATTTGTATTTGTTTTTTTTTGCAATAAAGAAGATATGATAT 
ATAAACGAATGACTTTGTCTTTTATTCCCAGCTCAATAAATACAAGCACTTCACTACATTGTGCAAACGATTAATTTTTATTGATTAAATGTTTA 
GGATGTGTTAAATATAAATTTATGCTGTATGTTTATATATACGCAGCAATCGCTTAAAGTTCTGCAATTTATTTGGTTAGAAACAGCGATTGCAT 
TTTAATTAGAGCTCGATGTTGAAGGCAGGGCAGCGAGGATTTCTTCGATGAAGTTGTCCGGGTGCTCGATGAGATGCAGGAGACCTAAGATGTTG 
AACAATTTAAATTAAATGACGAAATTCAAATGGAGCTGCCACGGACACTCA€CTGCAACATCACAGAATTCGACAAAGGAAGTGCAGGGCAGGAA 
CTCCTCCGTTAGGGTCTGCAGGATGCTATCCGTGGACTTGAGCAAAGCCTCATCGTTCTTACGAAGATAGTCGAACAGGGAAATCGGCCAATCCT 
CCTCCTTGGGCGTGCTTACCGTCTTGAACAGATTGTCGTAGATCAAGTCCACCAATGCCCACATCATAAGGAAGGGCTTGTAAACAGTGTAATCC 
TTGACCTGGCGGTCTGCAGGAGCCAACAAGGCTTTGCACAGCACGGCGGGTGAGACGTGGCGACCCTGGGCCACTGCAATCATCCTCTTCAGATG 
TGCTAGCTTATGCTTATTCCATAGCTCTCGGCTGTGCAGTGACAGCGAAATGGTTGCCATGAAAAGCGGACCATCCACCTCATATCCGCATTCCA 
ACCACTTCTCGCTGGGCGGCGCCTGCAAATACGTCAGCAGAGTCTTGCTGTCTCTAGCCGCTGATCGCGACGAAAGCAACAGATAGACGGAGTAG 
AAGAGCAGGTAGGGTACGAAGTGCATGTTCGATTGCGGACCACCTCCGCCGGCATCGTCATGGAAGCTGCGTTCCCAAGCAAATCGTACCAACAG 
CAGCTTCAAGTCATGCACACTGCTGGTGTAGGAGATGTCACAGCGCTGGGTACTCTCCTGCATATAACTGCTGTGGCGGGTCATGCAAGCGGAGA 
AGGCCGCCTCTCCAACGGCGGGTCCCCAAAGGGGCAGCAGTCCATTACATCTGGTGTTGGCATTTTGCAAACTAGCACGCTCCCATTCATCCCTG 
CCACGTGTCAATCTTATGGCTGATGTGTGGCAGTCCACATGGACCACATTGAAGTGTGTGACGGTGGTATAGCCAATGGTTTTTCTCGACTTCAG 
CTCAAACTCCTCGACATTGCAGCGCTTGGTGAAGGTGTAGATACCAAGCACCTTGTCTGGCTGGCAGGCGTAGCCTTCGCGACAAATAAAGCAAG 
TTAGTCCTGTTTCGTCGCGCAGTTTTTCGATCTTCTGCAGGATGGAGCCCTTGGCCGTCACCTGGCCCTTTTCATTCGTGCGCATTCCCAATGCA 



FIGURE SHEET 710 



TCCAACTGTTTCTCGCGCGTGGCCATTGCCAGCCGCTTCTTCTCTGCCCTTGTGAAGTCACGCACTTGCTGCACCCTCGCTGCTGTCGCCGAATC 

GGTGGAGAGAGCTTCCAAAAGGTTCTCGGCCAGACTGCCCACATGCTCATCGCTGGATACTTGTTCCAGGCGATGAATAATGGGTATAATGTCCT 

TGCTAATAGCCACCTGAGTTGCCTCATGATGATTTGAGAGGCCGGTGAGGAAACGAAGAATATACTTCAAGCTTGGCCGCGAGATGAATTCTTTC 

AGCTCATCGGAGTCGGTTCGCAGGAGAGTGGGCTTGACACAAGGCGCATGCTCGGTGATGTAGGCTAGCGATCGCTCAACGATGCCGAGACTTAC 

TATGTAGTCCTTGAGAGTGCCGCCAATGCAGTTGTGCTCAATTTGGTTAGTCAGCACGCAGAAGAGTTCCAGCTTGAACTCTTCTTCAGGTGTGC 

GCTCGTTGTCAAAGCGATTAAAGTTCAAAGTGTCCTGAAAATATAACAATATTAATCCTCGATCGTATAAGCAAGAAGTAACACCTTACCTTGAA 

ATGCTCGCAAAGTAGAGCCATCTTCACCTCGTTGCCATAGACCAAAGCGGCTAGGACACGAATAAGATGGCGGAGCACCGAAGGATTGTTTCTAA 

CATTGGGACAGTCTGTGCACGATATTAGTGCGCTTACATACTCCGGTCCACCGAATGTCAAGGAGAACTGCAGGAACGAGTCCAACGTATCGCTG 

GCCGCCTTCGATAGTATTGTTTCCATGATCTCCAATAACTGCTCGGTAACCGCCGATTGAATCGAATCGTTCTCCGACTGGAGACACATTTGTAG 

CACCTTAAGCAGCGTGTTGATGGCTCCGATCTCGGGTTGGCAAAGCACCTCTTGGCAACGGCGCACTTTCACACAAATAAGGAATAGCTTAAGCA 

GTACCTGGATAAGTTCTCGGTTGCGCGATATGCGCTGCAGGCTGCCAATTCTTTCCAACATTACGCGTAGGCCGTTGCAATCAGCCAGCACATTG 

GCCATGCGATACAGCTGCTCGGTGTCCACCTGCTCCTGGCTTTTGTTGTTCAGCGTTTCCACAAATTCCTCTGTGGCGTCTCCCAGGAGACCGCG 

CATGCGATATACAATTCTCATGGCATCTCGGTCGCCTCCTTCAGCTAACCAAACTTTCTTGTAAACATCCTTAACGGGCAGATCCAAGCTTATTA 

TTTTGTTGTT GAC C AGC AAT T C C ATGC C GT TAT C AT CT TC C AAC AGG GC GAT T AGT T C AC AAT C C GT AC AAAT CTTATTCTT C AC AT CC C GC AT C 

AATGGACCCAATCCCATTTCACTCGAGGGGTAAGGATTACCCAGCATGCGACCTTGCAGAAAGTCCTCCTGCTGGGGATCTTTCTCTAATGTCAT 

ATAGAACTCGCTTTCGTCATGCTCCTCGGGATGAATAATGGAATATAAGCGCTCGAAGACAAACACCGGTGTTTTAATGTCATTCATGCGGGTCT 

TCTCTACGGTGTCAATAAGGATTTCCATAAAGGCACGTGTCTCCTCCTCGGTTCCGCTGGTCATATCCTCTAGCATCTCTAACAGCTTCTCCTGT 

GCGTCGTCAATGAGTCGCGTACGCTGCACGACTAAGCTTCTTAGGGCAAGATAGCTTTCCAGAACTGGTCCTAGCATGCGAGTCTTGTAGGTTCT 

GCGGATATTTGGACACTCTAGGAGTAGCCACAACAATTCCACATACTGGCGCAGTGCATATCCTAAGGATAAATCTGAAGAAAGCGAGTACTCCT 

CCATGCGGTGCACCTTGCTGATTTCAATAGCCAGAAGCTGTGAGATTTGACTTAATACTCCTTTCAGGACCAAATACTGCAGCCAGGGTTGCTCT 

GTGGCCATGTTCTTGTACAGCATTAAGTACTCTGCACTGGCTTCCCCTGCTTCGCCTACATGTTTCAAGAACGATGTCAGCATATTCAGAGCCTG 

TTGCCGCCGTTCGTTAGTTCGGCTCAAGCTGCTCAGCAAGGAGCAGGCCAGCTGGCGACCGAATCTGGAGTTTGCATTAAATAGCAACGGTTGTA 

TCCACTTGGCGTTGAATACAAGAGGTTTAATCACTCGTTGCTTGTCCAGCACACGTTCCCGCCAGCGTTTTCCGTACTTTTCGCTAAGGAATGCG 

ACTCGCACCTCCCTCCTGCTCTTCGGCGGTGCATCTGATCCGCCGGATGCCGCCACATTCTGATCCTTGGCATTCTTTAGCTTAGCCTGGTTGTT 

TGACGGCATCCGCTTGCTCCACGCTGCGTATTCATGATTACGGTCTGAGTTCAGCCAGGCCCTATAGTCCACCGTGTTGCCCTCAAACATCTTCA 

TGCTACACAAATCTGTAGTGGCCACCTTCTGATTGGGCTTGCTGCCGGGCAGAACCGGGCAGATGAGATTCTGCATGATGCGCAGACAGGGGTGT 

AGTACTGCTGTTACCGGACCACGAGGCAAACTGTAGAAAGAATTTAAATTATATCACATTTATATTATTTTCTAATATAAAACTTACCGGCAGTT 

GCTAATGAACAGCTCAAAGATGACCTTTAGCTTGTATTCCCAGCAGATGTCGTCCTGGCCCAAGAGCACCTCCAAAAGTGTCATTTCTTGGTGCA 

CAGCTGCCTCTAAACTTATCAGAGGAATCGATCCCATCAGAGCATTCTTCACCCTTGTGGTCACCAACTGCAACAGATGCATGCATGCCTCGGCA 

TTATCCTTAGTCAGTACGACCAACAGGTTGCGCACTTCCTCCTGAATTAGGGGAGTGCCTCGTCGCAGGTTGTGCTCAGCCAGTTCACTGACGAG 

TCCCTGACTATACAAGCACACGCGACAATCGTAATTGTAAGCCATGGCTCGCAGAAGCGTCAGACACTGCTCGGTAGATGCCAGTGCACAGCCGT 

AGCAGCGATTTGTGGTGGGATTTTCGGCACCGGTTGTTGAGCCGGGATTAACCGGCGGCTGGTCCTGCTGTTGACGATCGTAGGCCACTAGTTCG 

CTAAAATTAAAATTATTATTAGCCAAAAATTATAATTTCTAAATCAGTGATAGTACCTTCGACATGCCTTGACCTTCTGAACGATCTTGCTTAGC 

TCCTCGAAACTGGTGCGCGATTCCACACAGTATTTCTGGGCGAGCAGCTGTATAATCTTGTTTACTTGAGAAGTGCTGTGCACGCTGGCCATGTT 

GTCCTCGACCATTCGATCACTGCTACGATGCTCGGCCACCTTTTGAATAAGAAGCTCCAGCATTTGCTTGTTGGTCAGCAGCTGGCGGTAATTTC 

TATCGGCTCGTTCTAGGGAGCTGTGAATCAGTGAAACAGTCTTGACCCTATCCTCTGCCGATTCTATAGGATCAACAGCGCAGCAAACACGGGCA 

TACATGCTGAAATCAAATTTGGCATATTTGCAGAAGCCACAGGAGTGGCACAGGAACGGGTCCTTCTCATCGTAGTTAATGGCCCGACATTTGTG 

GCACTGGAACACGTTCTCACCGCAATTTCCGCAGACTCCCGGATAAGCCGGCACTGCTGCACTACAACGTGGACATTGTAAATTCTCACTGGATC 

CACTCACAGTTTCAAAGAAATCAGCAAACTCGATCATCAAATTGCATGCGGTGATTGGTAGTGGAAAATCGATCTTCAGTTCGGTCTGTGCCGAT 

TGCAGGGATACTGAGCGAGCTTTGTGCCACAGAGCTGGGCGATTTTTTAGCTCCACTACCGCCTGGACACTGCGGTTGTTGTAGTAAACATTAAT 

GGTTCTCACCATCTTTGTCCGCTTTAAGTCGGCAATCCGCACGATCAACTTGGAAATAGTATGGCATTGGACTAGCTTATAAATCATTGTGGTTG 

TGGTGTACTTAGAGTCTGATTTAACTGATGGCAACTTTATATTAGCCATCGGCACCTCTGGATTGTTGCACACTAAACAAGGTTCCGATTCTAGG 

TAGTAACCATTTACTTGAAGTATAGACTCCAAGGTGGTGTATATTGGGGCATTTGGATGCTTGCAAAGCAGTTCGTTCTGCTGCCTGAGAACATC 

CACAGCGCGGGAAACGAACTCTGGCAGTCGTTCTGTTATACTTCTGGTACTTAGGGTTAGATACCCCAATAGGTCGACAAACTGCGCTGTCCTAC 

GACCGTAGGTGGGCACCAATGGCCACAGGTTCCAAAATATGTTGAGAAGTTTCTCACGCTGCCGTTCATTGGCATGCTCAAAAAGTCCGGTCATA 

AATGAGTGGGCCTGCCATCTCAGCTGAGTAATGTTGTTCTCCAACAGGAATATTCGCACAAATCTGATTATCAGCGCATCGCATGCGTATCTAAA 

GATCTGGTGCACAAATGTACCACACTGGGCTGGATCAAACTTGGAGTAGTAGGCGTCTGTATCCTCGCTCTTCTCACGATCTGTTCGAAGTTTTC 

CGGAGGCAGACGAGGTCGATGGCTGCGGCTGGGCCTGCTTAGATCCACTAGGAGGGGGTAAGTTGCAAACGGCCGCCTGTAGCAATTGTATAATA 

ATAGGCGACACGCCATCATCCAACTGATAACAGGCGATTTCCATCAGCATGGCCAACGCATCTTCATGGACGACGCAGAACTTCTGCCAGTTGCC 

CGTGCGCATTTGGCTTATCTCCTGGCAAGTACGTAGGTGTTCCGTCAACTCTACCAGTGCATCGTATGAAAGCATAGGAGGATTAGCGTTGTAAG 

GAGCACCAGTCTGAATGCAAAAATTAAAGAATATTAATATTTAAGATAATAGTTACAATATACCAAATCACCTTGCTGCTGACAATGTTGCACAC 

CCGCTTGACCACCCGGAAATGAGCGTCGAGGGAGTGACCGTCGCGATACATGCGGAACTTTTCCTTGCTGCCGCAAATGTACATCAGGAGCTTTC 

GCACTTGCCGACGAAGCAGTGTGTTCAAATTTAGCATCATGTACTCACAAAGTGTAAAGGTCATCGCTTCGCAGAATCCACTATCGTAGTTATCC 

GGATGGGCGCTACTTAAGCGCAGAATTTGATATGGCAATCGCACGATAATTTCAGTTAGGAGTCCATCGTAGGACTCGAAAATATCCGGGTAGTT 

TTTCAGATAGTTGTGCGGTATAAGTATGGATATGTCAGGTTTAGGACCATGTTTAATGGGCTTCAGTAGCTGAACTCCGGGACCTGAAGCTCCGC 

CACTGGCTTGTGCCGACTGTGAGGCTAATCCGCCACCGGCTGCATAAGGATCTCCTAGCACATTCTTCCAGTGCACCAGTAGCGTGTTTAGTGCT 

TCTAGGGCGTAAACGAATGCTCCACTCTTACTGAGAACATTTGCTGTGGTACTGGCCACAAAGGAGCTATTGTCCTTAAATTGATGCGGCGGCGA 

TGTTGTTCCAGTTTGCTTTGACTTTGTTTTGCCCATCAGAACTCCGAACAGGGAAAGCAATGCGAGGCGCACCTCATTGCCAGGACACTTTGTCT 

CCATCTGCAATTTAAATGTTTAGATATATTTTATAGAATAATTTTTTATTTACCCTTGCGGCAGCTCCTCGCTTTCCAATTTCCACATAGTCCAC 

TAGAGCAGTAAGCAGATCGTGAAGCACTTGTTGATCCCTTTCGTTGTTTCCATTGAGATCGGTAGTCAACATGAGTATAACCTGGATCAGTGGGA 

TTGCCTGCAGGCCATTGCACAAGTCGAATGTTCCTAGATGTTGGATAATGCTCTCCAGCACAGCAATTCTATAATGGAACATAATTGATATACCC 

TAACAGCATATGATATATTTTTACACTTACCTAAGGTCATGTAGCTTGCTGAGCTTATCGTCCTCGTTTGCTTCATTATCCTCACTGCTGCTCAT 

GGCACTCAAAACAGCAACACCACCGGAGCCAGCAACGCCACTGGGACCTGGCTGCTCTTGATCGTCCTTGGTGGATCCCTGACGCGGCGGCGAGG 

CATTCGCGTGGTCGCCAAAGTTGCTGCTCCTGGCGGATGTTCCACCAATACTCTCTACTCCACTGCCTCCACTCTCTGATCCACCGCTGCCAGCA 

GGCTCTGCTGGAGAAGTACGCAATGTTGATCCATCGGTTGCCACATTGGAACCTTCATCATCATCGGATCCACCTAGGGAGACATTGGCAGCTGC 

GTTCACAGCTGCTGCCATGGCAGTAGCTTGACGGATGCCCTGAAGATTGGCCAATCCTTGCTGAAGAGACTGCAAGGCGTTTGCATCCCCACCGT 

GCTGTTGGAGACTCAATGCGATGGCAATATCCATGATGGCCTCATCGTCGGCATCTTCGGGTAGGCCGAGTAAGCGAGGGAAACCGCCACCCATG 

AATGCCTCCAGATTGAGCATTTGCTGCTGACCGACTCCTCCTACTCCAACTGCAGCACCTTCCGCGTCCGCATTCGGTTCGGGTAATCCACCTCC 

TCCAACTGACCCATCAACAACAACGCCAGCTGCTTCCTCGATGATATCACTTGCAGCTGAAGAAGCCGCTGCTTGCAGTGCCTGCATGGTAGACG 

TTGAGGGAGTCGGTGTTGAACATGCTGGGGGTGTAACTATGGCCACCTTTCTGCGCTTCATTCTTGGACTAAGCAAAAGGATCAGGGCCTCTTTG 

GCGCTGTGGCTAATCACGCTAGCATCATGCTTCAGCAAATCAATGAAGTACTTGGTCATGCGACCAACCTGTCCGGGATCGGCCAAGGCAAATCC 
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GTACATTATCTCTACCAGACAATGGACAATGGTCTCCGGCTGGCACAAGCCTCGCTTCACAATATTCACGGGTTCGTCTAGCGTCGGTGTAACCT 

TATGCAACTCCAAGACAATTCCCATGAGGGTTGGAACAATATCATAGTTGTTCTCCACGCAGATCTTCGCTAGCGATTGGGGTCGTGCGTTGGCA 

ATGCCACGGACCATCAGAACTAGGCGGTAGAAAGCCTCAGGGTCGATCTCCCGCAGAGTTTCAAGTTTTTCAAGCTTCGGCTGCATTGACTGTAG 

TTCTCTATTTACAAACTGAAGAAGCACTCCATCCTTGTAGTTATGGTACAATGCTCTAGTTCCGTATAGAGTGGCCAGCACGTAGCGCGCCTGGG 

TCTGAACAGGATTGGGCGTTGGCAGTAGCAATAGGGCACTCGCTGTTTTAACTGCCTTTTGACGCATAGAAGCAGGTACTCCAGATCCCAACAAA 

TGCAGTGCACAATCCATTACTTCCAAAAGGTGATTAGCCATTCGATCCAGCTGAGTAATGCAATTGAATCCTTCTCCAAAGTTAGCGCTACTTGC 

CTGGGAGGAAGATACTGCTGGCGCACTGGATCCTGGGACAGTAACATCCTCGCTTTCGTCGGGCCAGCCCACCAGCTCTTTCGATTTTCCATATA 

CCTCAATGCAATCCAGTAATGTCACATGCTCCGGATCTGGAGCTTTAGCAAACACGACTTTCAATAACTTGTCGGACTGGAGCATTTCTTCCCGG 

GTCAGGGGGATATCGAACCAACGTGCCCTCCTGACCGGTGTGGGTATAGTACGTCCCAGAATAGTCACAGATTGGGGAGCACGCTGTACGTCCTG 

AGTTCCAATTAACACTCTTATTCCCACGATTACAACATTGGGATCATTATTGATCACCTCCAGAGTAAAGCCATTCGATCGGGTGCTGGCCACAA 

ACATTCCCGTTGAGAACAGACGCGTCTTTAGTTTTTGTTTGTTGTAAATCTGGAGCAGATCGTTGCCTCCAAATTCCACATCGGCTAGCATGTTG 

CAGTGCTCGAAAAAGTCAATTGGGAAAACGGGCTGACCTCCAGCTGTGGGTTGCTTCTGTTGTTGGCTTGCCTTTCTGCTGGTCATCTTTCCAGA 

TGCTGTATTTGATTTGGACTTTGAAGTCGAACCAGATCCACCTCCCTTAGCCATCAAGGTGGACGAGTACAACTGATTACCAAATGGCTGAACCT 

GTGGCGATAGCCAGAAGCTAGTGTATTCTGGTTGAGCGGAAAAGATCCTCAAGCTTCCATCCTCGCAGAGGAGCAAAAGCGTAGTTTTTTCCACT 

CCGGCAACCGCATGACGAATGCCAACAACGTCCATGATTCGTGACTTGGCCGATTGTGCCTTGATCTCTTGCAGGTAAATCCGTTCAGGGGTTAT 

CATCAATATAATCGGATTATTCGATGTCTGCATGCTGGCGTAAACCAATCCTGGATGCCCTGTAACCTCTGTCCACTGCACGAGAGGCTGTAGTG 

GTCCCTTCGAAGCTGACTTACTAGCCGAATTTGTGTCCAGGTGATATATTCCCTTCACGCCTTCACTTACATTGGTGAGTGGTGAAAAGAAGCTC 

CTTCCGCAAGAATAACTGTAGAAGAGCAGCTGTAGGGTATGCGAATAGTAGATGGATACTCCACCGCCTCCGATATGACCGGCAATATCCTTGAT 

ATGTTGGTGGCTCAGCTCCAAAGTGTTGGTTACATAGAAATCTCCATGAACCGCTAGACTCTGTTGATCCAACTGCTGAGTATAGATGTATCCGC 

TGCTTGCAAAGGTCAGCATATTGTAGTTGCCATCCTGGTACACAAACGTGCAATCTCTAATTTTGCCAACGGCCACTAGGTAATAGTACTTTGGG 

CTATATGTATCCACTGCCAAATCGTAAATCTTTACATAATCCGAAGTTACAACGGCAAGAAGAGTCTGTGAGCCAGGCAGCCACACAGCTTTCTT 

GATGTAGTTTCCATTTTCCAGTTGGGGGTTCACTACTATGTGTTCGTTAGTACTGCCACTGCTTGAGAAGGTGAGAATGTGACACTCTTTTAGGC 

CACAAACTGCAAGGCAATCCTCATTGCACGGATTAGCAGCTAGGGAGATAACCGTGCAGGCAATGGGTGCAGAACTCAATTGCGTTAGAGTTAAC 

TTTCTCTTGGAGGCATCTGCTTGCTTCAACAGTGCCGACAGCTGGAGGATGGTCACCTTTCCCTTCTCATGGGATACAGCCAACTGTTGACGACG 

T C CAT G T GGC G AGG AC AAAC AGC AGAAGGC AAC ACGGC GG AC AAC T CC GG AT GAT AGT AGGT GCT T T AT AGT CTGACCTTGGTCG C C AGAAT AGT 

T C AT T C GAAC AT T C T C AAAAG CT C CCTCTTGT GAG CC C AAAGT T GC AAAC AT C AGT T GAT C GGTT AC C T GG AAGCT C TGT T C AAGC AGG T GG AGT 

CTCTCCAAGGCCGCAGTTGCTCTCTTGTGGCAGCCGACAATGGAGTATAGCATACAATTTTCGTTTATTGATGGTAGCAGCAGGTCAAAATACTC 

TAGGATGCATCTTACGACCAGCAGCCACTGATCCTGGTGCTGAAGTGTCTCTCTGTAAGGTTCCAACAGTTTGGTAAGTACTGCGATCCGTTCAT 

TAGTCAATGAATCCTTTCGCGTAAGTACTGGCTGGCCAACTGGTAAGCTGCTCCTCTTCTTTCCAGCGAGTAAGCTCATGTGGGACGGTAGATAG 

CTGCCAATTCCAGCTGAATCTCTAACCTCATTGCTGCCGATACGCCTACTCAATGCCTGACAGGATCCGTCCTCCTTGGCTCCGCAGTCACAGAA 

GAAATTTCCATATTTGGCGTAACTGACATCATGTCCCTTATGGCAAACCCTTGCGCACACAGAACAGACGCCAACTGTGTTTATCATGTTGCAAG 

TATGACAATGATACCAATGTTGGTTCATGAACTCCTTTTGGGTCTGCGAGAAGGTACACAGCTTGTTGTTCAACGTGTCCTCGTCGGAATCGTCA 

AGCAACGAGTCTTGCTCACCTTCCTCGGCTAGCTCGTCATATAGCTCATCAATATCGGTCTGTAGTTCATCATCCCACGGCGGGTTATGTTGCCT 

ACTGCCTCGGAAGCCAATGCTCTGAAGAAGTTCGCTTAAATAGCGCAACTGGGCACTTACGTTCTCTGCATGTTTCAAGTGTTTGTCCGGTAATT 

GCAATTTGGCCAGTTCCAGCCAAAGGAGGGTGGTATTGAACAAGGCGACATGGCCGCGTGCTGGACTGGCGTCAGCTAGAGTAATCATTATCTGT 

AGTACATCTGCAAAATCGCAGCCTTCCTGGGGTGGTGAAATTAATGTTTGTCCGAGCTGAAGCAAAGCTTGGAACAGTGGTGGAGCTATCTGCGA 

TGCTTTGCTTTGATCTTTCACCAATGACTTCACGAAGCTTAAGAGCAACGCGCCATTTCGTTCGCTATCACTATGATCGCAAGCCTGTGAAGATT 

CTTCCGACGGATTTGGCTGCGCAGCCTGCCACAAACTCAGAATCTGCGAGGTATTGGCTGCAGCTCCTCCGGTTCCTGGGTTACTTTCTCCGCCT 

CCCGCTGCAGTTCCGCAATCGTATTCAATTTCCATGGGATCCAACTGGTCTGAGATACTTGGCATAGCCGAAACGGTGGCCATATTTGTTGGCGT 

CTGAGCATTGCTGCTGGAGCTACTCGACGGCTGCAAAATGCTGGTGAGCATCCGGCAGGTGGTGTCTAATGCCTCGGAGGTGCTTACATCGGTAC 

TCACATTAGGGCCGTAGATAATGTGGCCCAACCATTGCTTTAAACGAGCTACGTCAACTGTGGCCAGTTCCGAGAAACACTCAACCAGATCTTCA 

TGCTGGAACTGTGAATCCGCTTGTAAGGATAGCTGGAAAAGTCGGTTAACAAATTGCAGCATCTTGCGCGCATAGCTAACAGGCAGACTGGTGCC 

AGTGAACGATAGCAAAAGTGTGGTAAGACTCCCAGAGCGCTGATCTTTGAAAAACTGGCGCATGGCCTGAACTGCCACCGGCTTTTCCAGCATTT 

TGAAC AT AGTTTT AAGC ACGGC AAACAGCATGGACTCGTT AAAC AT AGC ACGATCCTGATCAAGAGGAGCTCCGCCACTAGGCCCCGCAGACTGG 

GGCTGACGAAGATCGGCAGCAAACTCTTTGAGCATGCGATAGCAATATCCAACCATGTGCTCGTAAAGCAGCAGCTTTTGTGCCTCTTGTGTCAG 

ACCACTGAGGATCTCGCGCAGACAATGCTCGCTGTGTCCTTGCAAGTAATCCGTTCGCATGTCCAGCATTGAGGAGATAAGCAATTGCTTTAGCT 

GAGTCTCTACTTCTGCGTTCGCGGTGCGATGTGGTAGCAGCAACTCAAACAATTGCAGGATAGACTTTAAGCAGTGTTTCAGAGTGTGCAGGGTA 

GAGTAATTGGATCCTCGTGTCAGCTCAATAATGTGAGCATCAATAACGTTCAAAATATATGGTTCCACCGGATGTTGCTTTCCATTGAATGCAAA 

ATCATTGACCATTAGGCACTTTGGCCATTCGTTTGCTATTAGTTGTTTGTCCTCGTTGGGCAGGAGTTTTAGAAAAGCTGCATGGAGTTCTAACA 

TAATTGCTGGCTTATCGCTTACAATGGGCACAATCAAATTCAGTGCCTTCTCCGGTGTTGAACTTGACAACTGCTTTAGCATTTGGCCACGCACG 

ATCACCGAAAGCAATTGGTAAGAATTCATGAGCTTTACCATTAGCTCGTTTACAAGTTGATTATACTTTTGCAAATCACTTGACAGCTTACCGTC 

GTCGTCTTGTTTCAATTTTTGATCATCAATTGATTCGTTGGAAGAATAAAGTTCCGGATCGTTTGACGATGAAGTTAAGGATGTACCAGCGGACT 

GCTCGCCAGCGGCATCTTCGCTCTCTTTTCGCGGCCACTTGAGGCCCTCCTTGTTTGCCCAGGCTACCGCCTGCATGTGCGCCACCAAAGAATCA 

AAAAGGATGAGATCGGAGAGGCGGGGAGTTCCTGCAAAATGAATATTTTCAATACATTGTGGTTTGTTTTTAACCATATTATAAGTTTTTTAGTA 

CTTACCATTTCCATTCATTTGACTATTGAACGCCTCCACACCAAGAATTGTTTGCTTGAGACTATATTTTATATCGGCACCTATGTCGCCCACAT 

GAGTCCACAACGTCTCAGCGTACTGGGTATACATTCCCTGCTTGATTAGGCTGCTAACGACTAGGTAGTGAGAACTGGCCAGGGGTAGACGTATA 

CTCCACATCAGTGAGTGCAGATTGGGTGATCGCTCTGCCTTCAGCGACTCTACGTGAGAAGTAGAGGGCGGCAAACCCAAATTCAGCTTCCAGGC 

AATCGTAAAGCAATACTGTATAGCGCAAAGATTATGCATGCTCGCTTCCTTGTCCTTTTCCTTGGAATACATTGACATGTCGGTGATGGATAGTA 

TGTCAGTCAATGCATCCACCAGTAGCGGATATTTAAGCTTATCCGGTGTGCAGAGAATAAAGTTCCAAGCCAAACCATCCAGCTTAAATTCCTGA 

GCCCAGTAGGATATCTCGGGATCACCCATTACCAGAGAGTAAAAAGTTGGGCGAATTGCTAAAAAAAAAATCAAGATGGTTGAGTATCGCTGGAA 

TTGGAGATTAAATTCTCCACTTACATGAGCCATATTCGCCGGCCGACTTGCGGAAATTCTCTTCAATATCATGGCGGCAATAGGAATTCAGTTTG 

GTTCGGCAGTTTAGCAGATCCGTGATCTTTTGGGTGTCCGGCATGCTGCTCACACTAAAAAGATTCCAGCGCACCTGGTCTAGCAATTCCGGAGG 

CGCATTGTACAAGTGCTTCATAAGATACTCCAAAAAGAGCAGCAAGCGAGACAGCAGCATAATTTGGCTGTTTTTTACGGGTCTATCCTCGCTGA 

T T C C AC G AC AAAC T T C T GC G C AAC GG AT AAC AC C GC C AGCC GT T AAAAG C AG AAT G GAT TTCTTTTGCAT C AG AT T G AGC G AAT G AAAT AG G AA A 
AGCAGCAGTTGGGCGTGCTCAATGTTAAGGTCTTCATAGTCAGAATCACTGGCGGTATTCGAAGAACAAACTCCTTCCACCAACGTGTTAATAAG 
GCGGTGCCACACGGACAGGCAGGCGGCTTCCTTCTCTTGTGTTGGCTTCAGCAAAAGAATTTGCACCAGTACAGACAGAGTACTCGGATAAACTT 
GTAAAGGCCAGGTGTTTGTATCAGTAGACCATGGCGATATAGACAAGTGCTGCAGCAGATTTGACTGCAATTGCTCGGACAGCAATGAAGTGGAA 
ATCAAATTGTGAATATATCGGCCTGCAGCTCCCGAAAATCGAATCATCGACGTCTGCCACTTGGTAGAAGCTCCTGCAGAGGTTCCTGCTCCTGC 
ACTGGCAGCTGGGGCATTTCCCGACCCCTGATCGCTCTCATTTCGCATCACGTCGCGGTCAAAATCTTTGATTATATTTGCCATTAACAGCATTT 
GCTGATCACTAACGCCTGCCTTTACATAGCGCTGCATGTAGGCATGTCGATTGGCCAAAAATTGATCCAGGAAACAGAATATGTCTGCGGCCAAG 
TCTAAATACTCATGTGGCTCATCTAGCTTGGGGACAAGGGCGCTCTTCTGTTCAGCACGCTCTTGTGTGGATGTGTTTGCATTATCATCATGGGT 
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TTCTGGAGAAAGAGTTTCCTTAAACCAGTGGCCCAGGTAACTCTCACTATCGTCCTCCTCGGAGTTGTCCGAACAGGAGAGAGAATCGTTGAAAT 
AAGTCGTAGAATCTGAGTAGCTTATGGGTTCGCATTCCGTACGATCCTTGGCGGAAGCGCGCTTTAAGGTGCATGCCTTGCGATAAGCCACAGTG 
GCTAGAAAAGTGAAAAGTTGATGCAGAGTGGCTGTATTCAGCACGCGGACAATCCGCTCCAGCGAACTGTAGTTCTGCAAAATATCGAACTGTGC 
AGGCTCCGGGAGCTCTGTGGATTCGAGACTCTGTCCGCATGCGGCTTCAACCTTTAGATCTTCTATTAGCTCTGAGAGCAATTTAATGCTGTGGT 
TAGCAATAGCTGCATTGAGTTCTCCAAATCCTTGCTGTACCTTGAAAAGGTTGACTCTGGCAATAGGTGTAGTAGGTGCGGTTCCAGCCTCACTA 
CCAGACTTGGTCGCTTTTGACGCCGACTTCGATGCCGAATCCGAGCTAGATCCACCGCTTCCAGAGGCGCCCATTGCTCCTTGTATACCGCTAAT 
TAGCAGCCAGCTGCCATAGCAGAGATGGTTCTGATATACATGAATCCTGGCCGAAGTTTTGAACATGTGACCAATATTGGAGTAGATCTCTAAAG 
CTTTGTTCACTATGCTACAGGCCTGATCCTCAAAGTCGTCATGGTCTTTGTTGGATTGCCCACCACTAGAACCTCCTACTCCGCTTCCTTGACTA 
GTGGATGCAGTCTGTTTCTGAGAGCTGCTGGCCTGGCTCATACCGAGAACACTGGTGGCTATTGAGGTCAACACAGCATTGTATAGCGCAGCAAT 
ACCACACATTGATAGTCGTTCAATTCTACTGGGAGTCAGTGGCTGCATCTCGATCAGGGCTTGGGCCTGACTCAAAATAGTCAGATCAGTAAGCA 
TGTGATTCATGCTCGATTTCACTGCAGTGGCCTCCACTTGGGTCAGTGGCAAATAAAGCGATTTACCGCCGTTGATGGTCTCCATATATTTGGAT 
CGGTATCGATTGAGTATGGGCAGGCTCACACACACATCCAGCAGGGTATCCACTCCGTTTTGCTCCTGGAAGTACTCCACATTGGCCGCCAGGAT 

T T GT T C C T G AAAG AAGT AAG C T T T T AAT AT T GC G T AT CT C GT T T T T AAG AAAAT AT AAAT T T G C T C AT AGC T AAT AGT T AT AC AC AACT T C T AAA 
TTGATCGAGTAGGATAAATATTTTCATTAGATAGAATATTGCATTTTAGTTTAGTTGTGTAATATATCCGTACCTACTCAATTCAAGAGTAATTT 
ATAATAATAACAGACAAATATGACCGATCTTAGGGAAGCTATTAAAACAGGAGTTTAAGTTATTTGCTCACAGTGCAGTCGGTGACCTCCTCGCT 
GAGTGGAACCCAAGTCATCTTGCCGGGCTCTAAAGGAGCAATCAGTTGCTGCAGAATAACCGAGGACAGGTCCGACCGGGATCGCTTAATCTCCG 
ACTTGGGCGACTCCTTGTCGTTGCTGGCCCCACTTCCGGAGGAACTGCTGCTGGGCATAATGCTCTTGGTGGTCGACTCGGGATGTTTGCAGCTT 
TTAAGAACATTAGCCACCGCATTGTAGTCGAACATGGATGACGACGCTGATGCTGGGCACTCCCGGCCCTCGCAGAGCATCTTGAGGGCGCCCAG 
GAGCCACTTGGTCTCGTAGACGGACACCCTGGGCAGTCTATAGATAATAAACCGGATAAGCACGGCGGTGGCATCGTGCAACTGACAAATCTGCG 

T CT GGC AAAC T G AAAAG C AC C AAG AT GAT AAAT G T AGCGAAC G CG AAT T GGC GC AAGTG AAT C AC T C GAT T CT T GGC C AAAC AT TTCTTTGTTTT 
GTTTTCATTTGGCGAAAAATCATTAGGGTGCACTTACTTGTCTTGATTTGCATAAGTTTGTCCGCCGCCAGGGCCGCAAAGGCCGTGTAAAACTG 

C GT G AAAT T GC T T T C CT CCT C GAAAAAG T C AT GT T C GC T AT GC AAAT AT C GG AAT AG AGC GT T GT T AGT T CTC G AT AT T AAAG AATT T T C AAAGC 
TTTTTGCTATTTCGTCACCGGAGCGGCCATATTTAATTTTGCAAGTCACTTTTTGCACTAGCAGAAAATTTTTCACTACGCCATTCGGTAGGCGA 
AAAGTTCTAGATGATCGTTTGTACTTACCATCGCGTGATAGCTTTGAGCAAATTAACAACTTCGTTCTTGTTTAAGGCTCCTGTACGGTTAAGTA 
TCAGGGCTTTAACAACGGAATTCCAATCCGTTCCGCCGCTGTGGGCCGACATTATTGTTTAAATAATCTCGTCAATTTGCCCGATAGTACTGAGT 
ACTCGGCCTACTGTGTGGCCAGATTTCATTAACGGTAAAATGCCGTGCGAGCAGGGATACCACACAGCTTTAATCATTATTCCAATCAGCTGTTT 
TAAAAATACATAAAAAAGTAGTGAATAACCAATATATGAAAAAAACTGTACACTGCACAAAATATATATATATATATATATATAACTATAATATT 

T T AAAC GT AT T T GG AC AAGC T AAT AAAT GT T TTGTTTATTCGTCT AAAAT AT GT AAAGC AAAC T AGT G AT GC T AGC T T T AAAAT AAT GAT TAT T T 
AATTATTAAATATTAGCGACATATGTATATTTTTTTTGTTTAAATTATCTTCTAAACATCTTAATCGGCTGCAGGCTC 

(SEQ ID NO: 1339) 

Exon: 18222.. 18079 
Exon: 17897.. 17803 
Exon: 17679.. 17267 
Exon: 17011.. 14845 
Exon: 14783.. 14256 
Exon: 14186.. 8201 
Exon: 8143.. 7939 
Exon: 7889.. 7197 
Exon: 7135.. 5472 
Exon: 5415.. 4933 
Exon: 4875.. 2940 
Exon: 2884.. 1288 
Exon: 1224.. 1001 

Start ATG: 18197 (Reverse strand: CAT) 
Transcript No. : CT34171 

AATTGACGAGATTATTTAAACAATAATGTCGGCCCACAGCGGCGGAACGGATTGGAATTCCGTTGTTAAAGCCCTGATACTTAACCGTACAGGAG 
CCTTAAACAAGAACGAAGTTGTTAATTTGCTCAAAGCTATCACGCGATGCGAACATGACTTTTTCGAGGAGGAAAGCAATTTCACGCAGTTTTAC 
ACGGCCTTTGCGGCCCTGGCGGCGGACAAACTTATGCAAATCAAGACAATTTGCCAGACGCAGATTTGTCAGTTGCACGATGCCACCGCCGTGCT 
TATCCGGTTTATTATCTATAGACTGCCCAGGGTGTCCGTCTACGAGACCAAGTGGCTCCTGGGCGCCCTCAAGATGCTCTGCGAGGGCCGGGAGT 
GCCCAGCATCAGCGTCGTCATCCATGTTCGACTACAATGCGGTGGCTAATGTTCTTAAAAGCTGCAAACATCCCGAGTCGACCACCAAGAGCATT 
ATGCCCAGCAGCAGTTCCTCCGGAAGTGGGGCCAGCAACGACAAGGAGTCGCCCAAGTCGGAGATTAAGCGATCCCGGTCGGACCTGTCCTCGGT 
TATTCTGCAGCAACTGATTGCTCCTTTAGAGCCCGGCAAGATGACTTGGGTTCCACTCAGCGAGGAGGTCACCGACTGCACTGAACAAATCCTGG 
CGGCCAATGTGGAGTACTTCCAGGAGCAAAACGGAGTGGATACCCTGCTGGATGTGTGTGTGAGCCTGCCCATACTCAATCGATACCGATCCAAA 
TATATGGAGACCATCAACGGCGGTAAATCGCTTTATTTGCCACTGACCCAAGTGGAGGCCACTGCAGTGAAATCGAGCATGAATCACATGCTTAC 
TGATCTGACTATTTTGAGTCAGGCCCAAGCCCTGATCGAGATGCAGCCACTGACTCCCAGTAGAATTGAACGACTATCAATGTGTGGTATTGCTG 
CGCTATACAATGCTGTGTTGACCTCAATAGCCACCAGTGTTCTCGGTATGAGCCAGGCCAGCAGCTCTCAGAAACAGACTGCATCCACTAGTCAA 
GGAAGCGGAGTAGGAGGTTCTAGTGGTGGGCAATCCAACAAAGACCATGACGACTTTGAGGATCAGGCCTGTAGCATAGTGAACAAAGCTTTAGA 
GATCTACTCCAATATTGGTCACATGTTCAAAACTTCGGCCAGGATTCATGTATATCAGAACCATCTCTGCTATGGCAGCTGGCTGCTAATTAGCG 
GTATACAAGGAGCAATGGGCGCCTCTGGAAGCGGTGGATCTAGCTCGGATTCGGCATCGAAGTCGGCGTCAAAAGCGACCAAGTCTGGTAGTGAG 
GCTGGAACCGCACCTACTACACCTATTGCCAGAGTCAACCTTTTCAAGGTACAGCAAGGATTTGGAGAACTCAATGCAGCTATTGCTAACCACAG 
CATTAAATTGCTCTCAGAGCTAATAGAAGATCTAAAGGTTGAAGCCGCATGCGGACAGAGTCTCGAATCCACAGAGCTCCCGGAGCCTGCACAGT 
TCGATATTTTGCAGAACTACAGTTCGCTGGAGCGGATTGTCCGCGTGCTGAATACAGCCACTCTGCATCAACTTTTCACTTTTCTAGCCACTGTG 
GCTTATCGCAAGGCATGCACCTT AAAGC GCGCTTCCGCCAAGGATCGTACGGAATGCGAACCCATAAGCTACTCAGATTCTACGACTTATTTCAA 
CGATTCTCTCTCCTGTTCGGACAACTCCGAGGAGGACGATAGTGAGAGTTACCTGGGCCACTGGTTTAAGGAAACTCTTTCTCCAGAAACCCATG 

AT GAT AAT G C AAAC AC AT C C AC AC AAG AGC GT GC T G AAC AG AAG AGC GC C CTTGTCCC C AAGC TAG AT G AGC C AC AT G AGT AT T T AG A C T T GGC C 
GCAGACATATTCTGTTTCCTGGATCAATTTTTGGCCAATCGACATGCCTACATGCAGCGCTATGTAAAGGCAGGCGTTAGTGATCAGCAAATGCT 
GT T AAT GGC AAAT ATAATCAAAGATTTTGACCGCGACGTGATGCGAAATGAGAGCGATCAGGGGTCGGGAAATGCCCCAGCTGCCAGTGCAGGAG 
CAGGAACCTCTGCAGGAGCTTCTACCAAGTGGCAGACGTCGATGATTCGATTTTCGGGAGCTGCAGGCCGATATATTCACAATTTGATTTCCACT 
TCATTGCTGTCCGAGCAATTGCAGTCAAATCTGCTGCAGCACTTGTCTATATCGCCATGGTCTACTGATACAAACACCTGGCCTTTACAAGTTTA 
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TCCGAGTACTCTGTCTGTACTGGTGCAAATTCTTTTGCTGAAGCCAACACAAGAGAAGGAAGCCGCCTGCCTGTCCGTGTGGCACCGCCTTATTA 
ACACGTTGGTGGAAGGAGTTTGTTCTTCGAATACCGCCAGTGATTCTGACTATGAAGACCTTAACATTGAGCACGCCCAACTGCTGCTTTTCCTA 
TTTCATTCGCTCAATCTGATGCAAAAGAAATCCATTCTGCTTTTAACGGCTGGCGGTGTTATCCGTTGCGCAGAAGTTTGTCGTGGAATCAGCGA 
GGATAGACCCGTAAAAAACAGCCAAATTATGCTGCTGTCTCGCTTGCTGCTCTTTTTGGAGTATCTTATGAAGCACTTGTACAATGCGCCTCCGG 
AATTGCTAGACCAGGTGCGCTGGAATCTTTTTAGTGTGAGCAGCATGCCGGACACCCAAAAGATCACGGATCTGCTAAACTGCCGAACCAAACTG 
AATTCCTATTGCCGCCATGATATTGAAGAGAATTTCCGCAAGTCGGCCGGCGAATATGGCTCATCAATTCGCCCAACTTTTTACTCTCTGGTAAT 
GGGTGATCCCGAGATATCCTACTGGGCTCAGGAATTTAAGCTGGATGGTTTGGCTTGGAACTTTATTCTCTGCACACCGGATAAGCTTAAATATC 
CGCTACTGGTGGATGCATTGACTGACATACTATCCATCACCGACATGTCAATGTATTCCAAGGAAAAGGACAAGGAAGCGAGCATGCATAATCTT 
TGCGCTATACAGTATTGCTTTACGATTGCCTGGAAGCTGAATTTGGGTTTGCCGCCCTCTACTTCTCACGTAGAGTCGCTGAAGGCAGAGCGATC 
ACCCAATCTGCACTCACTGATGTGGAGTATACGTCTACCCCTGGCCAGTTCTCACTACCTAGTCGTTAGCAGCCTAATCAAGCAGGGAATGTATA 
CCCAGTACGCTGAGACGTTGTGGACTCATGTGGGCGACATAGGTGCCGATATAAAATATAGTCTCAAGCAAACAATTCTTGGTGTGGAGGCGTTC 

AAT AGT C AAAT G AAT GG AAAT GG AACT C CCCGCCTCTCC G AT C TC AT C C T T T T T G AT TC T TTGGTGGCG C AC AT GC AGG C GGT AGCCT GGGC AAA 
CAAGGAGGGCCTCAAGTGGCCGCGAAAAGAGAGCGAAGATGCCGCTGGCGAGCAGTCCGCTGGTACATCCTTAACTTCATCGTCAAACGATCCGG 
AACTTTATTCTTCCAACGAATCAATTGATGATCAAAAATTGAAACAAGACGACGACGGTAAGCTGTCAAGTGATTTGCAAAAGTATAATCAACTT 
GTAAACGAGCTAATGGTAAAGCTCATGAATTCTTACCAATTGCTTTCGGTGATCGTGCGTGGCCAAATGCTAAAGCAGTTGTCAAGTTCAACACC 
GGAGAAGGCACTGAATTTGATTGTGCCCATTGTAAGCGATAAGCCAGCAATTATGTTAGAACTCCATGCAGCTTTTCTAAAACTCCTGCCCAACG 

AGG AC AAAC AAC T AAT AGC AAAC G AAT GGCC AAAGT GC CT AAT GGT C AAT GAT T T T GC AT TC AAT GG AAAG C AACAT CC GGT GG AAC CAT AT AT T 
TTGAACGTTATTGATGCTCACATTATTGAGCTGACACGAGGATCCAATTACTCTACCCTGCACACTCTGAAACACTGCTTAAAGTCTATCCTGCA 
ATTGTTTGAGTTGCTGCTACCACATCGCACCGCGAACGCAGAAGTAGAGACTCAGCTAAAGCAATTGCTTATCTCCTCAATGCTGGACATGCGAA 
CGGATTACTTGCAAGGACACAGCGAGCATTGTCTGCGCGAGATCCTCAGTGGTCTGACACAAGAGGCACAAAAGCTGCTGCTTTACGAGCACATG 
GTTGGATATTGCTATCGCATGCTCAAAGAGTTTGCTGCCGATCTTCGTCAGCCCCAGTCTGCGGGGCCTAGTGGCGGAGCTCCTCTTGATCAGGA 
TCGTGCTATGTTTAACGAGTCCATGCTGTTTGCCGTGCTTAAAACTATGTTCAAAATGCTGGAAAAGCCGGTGGCAGTTCAGGCCATGCGCCAGT 
TTTTCAAAGATCAGCGCTCTGGGAGTCTTACCACACTTTTGCTATCGTTCACTGGCACCAGTCTGCCTGTTAGCTATGCGCGCAAGATGCTGCAA 
TTTGTTAACCGACTTTTCCAGCTATCCTTACAAGCGGATTCACAGTTCCAGCATGAAGATCTGGTTGAGTGTTTCTCGGAACTGGCCACAGTTGA 
CGTAGCTCGTTTAAAGCAATGGTTGGGCCACATTATCTACGGCCCTAATGTGAGTACCGATGTAAGCACCTCCGAGGCATTAGACACCACCTGCC 
GGATGCTCACCAGCATTTTGCAGCCGTCGAGTAGCTCCAGCAGCAATGCTCAGACGCCAACAAATATGGCCACCGTTTCGGCTATGCCAAGTATC 
TCAGACCAGTTGGATCCCATGGAAATTGAATACGATTGCGGAACTGCAGCGGGAGGCGGAGAAAGTAACCCAGGAACCGGAGGAGCTGCAGCCAA 
TACCTCGCAGATTCTGAGTTTGTGGCAGGCTGCGCAGCCAAATCCGTCGGAAGAATCTTCACAGGCTTGCGATCATAGTGATAGCGAACGAAATG 
GCGCGTTGCTCTTAAGCTTCGTGAAGTCATTGGTGAAAGATCAAAGCAAAGCATCGCAGATAGCTCCACCACTGTTCCAAGCTTTGCTTCAGCTC 
GGACAAACATTAATTTCACCACCCCAGGAAGGCTGCGATTTTGCAGATGTACTACAGATAATGATTACTCTAGCTGACGCCAGTCCAGCACGCGG 
2 ; ni CCATGTCGCCTTGTTCAATACCACCCTCCTTTGGCTGGAACTGGCCAAATTGCAATTACCGGACAAACACTTGAAACATGCAGAGAACGTAAGTG 
t I CCCAGTTGCGCTATTTAAGCGAACTTCTTCAGAGCATTGGCTTCCGAGGCAGTAGGCAACATAACCCGCCGTGGGATGATGAACTACAGACCGAT 
V * ATTGATGAGCTATATGACGAGCTAGCCGAGGAAGGTGAGCAAGACTCGTTGCTTGACGATTCCGACGAGGACACGTTGAACAACAAGCTGTGTAC 
Q CTTCTCGCAGACCCAAAAGGAGTTCATGAACCAACATTGGTATCATTGTCATACTTGCAACATGATAAACACAGTTGGCGTCTGTTCTGTGTGCG 
S SB s CAAGGGTTTGCCATAAGGGACATGATGTCAGTTACGCCAAATATGGAAATTTCTTCTGTGACTGCGGAGCCAAGGAGGACGGATCCTGTCAGGCA 
TTGAGTAGGCGTATCGGCAGCAATGAGGTTAGAGATTCAGCTGGAATTGGCAGCTATCTACCGTCCCACATGAGCTTACTCGCTGGAAAGAAGAG 
GAGCAGCTTACCAGTTGGCCAGCCAGTACTTACGCGAAAGGATTCATTGACTAATGAACGGATCGCAGTACTTACCAAACTGTTGGAACCTTACA 
i a a GAGAGACACTTCAGCACCAGGATCAGTGGCTGCTGGTCGTAAGATGCATCCTAGAGTATTTTGACCTGCTGCTACCATCAATAAACGAAAATTGT 
ATGCTATACTCCATTGTCGGCTGCCACAAGAGAGCAACTGCGGCCTTGGAGAGACTCCACCTGCTTGAACAGAGCTTCCAGGTAACCGATCAACT 
Vi GATGTTTGCAACTTTGGGCTCACAAGAGGGAGCTTTTGAGAATGTTCGAATGAACTATTCTGGCGACCAAGGTCAGACTATAAAGCACCTACTAT 
P CATCCGGAGTTGTCCGCCGTGTTGCCTTCTGCTGTTTGTCCTCGCCACATGGACGTCGTCAACAGTTGGCTGTATCCCATGAGAAGGGAAAGGTG 
^5 ACCATCCTCCAGCTGTCGGCACTGTTGAAGCAAGCAGATGCCTCCAAGAGAAAGTTAACTCTAACGCAATTGAGTTCTGCACCCATTGCCTGCAC 
GGTTATCTCCCTAGCTGCTAATCCGTGCAATGAGGATTGCCTTGCAGTTTGTGGCCTAAAAGAGTGTCACATTCTCACCTTCTCAAGCAGTGGCA 
^ GTACTAACGAACACATAGTAGTGAACCCCCAACTGGAAAATGGAAACTACATCAAGAAAGCTGTGTGGCTGCCTGGCTCACAGACTCTTCTTGCC 
rfl GTTGTAACTTCGGATTATGTAAAGATTTACGATTTGGCAGTGGATACATATAGCCCAAAGTACTATTACCTAGTGGCCGTTGGCAAAATTAGAGA 
TTGCACGTTTGTGTACCAGGATGGCAACTACAATATGCTGACCTTTGCAAGCAGCGGATACATCTATACTCAGCAGTTGGATCAACAGAGTCTAG 
CGGTTCATGGAGATTTCTATGTAACCAACACTTTGGAGCTGAGCCACCAACATATCAAGGATATTGCCGGTCATATCGGAGGCGGTGGAGTATCC 
O ATCTACTATTCGCATACCCTACAGCTGCTCTTCTACAGTTATTCTTGCGGAAGGAGCTTCTTTTCACCACTCACCAATGTAAGTGAAGGCGTGAA 
GGGAATATATCACCTGGACACAAATTCGGCTAGTAAGTCAGCTTCGAAGGGACCACTACAGCCTCTCGTGCAGTGGACAGAGGTTACAGGGCATC 
CAGGATTGGTTTACGCCAGCATGCAGACATCGAATAATCCGATTATATTGATGATAACCCCTGAACGGATTTACCTGCAAGAGATCAAGGCACAA 
TCGGCCAAGTCACGAATCATGGACGTTGTTGGCATTCGTCATGCGGTTGCCGGAGTGGAAAAAACTACGCTTTTGCTCCTCTGCGAGGATGGAAG 
CTTGAGGATCTTTTCCGCTCAACCAGAATACACTAGCTTCTGGCTATCGCCACAGGTTCAGCCATTTGGTAATCAGTTGTACTCGTCCACCTTGA 
TGGCTAAGGGAGGTGGATCTGGTTCGACTTCAAAGTCCAAATCAAATACAGCATCTGGAAAGATGACCAGCAGAAAGGCAAGCCAACAACAGAAG 
CAACCCACAGCTGGAGGTCAGCCCGTTTTCCCAATTGACTTTTTCGAGCACTGCAACATGCTAGCCGATGTGGAATTTGGAGGCAACGATCTGCT 
CCAGATTTACAACAAACAAAAACTAAAGACGCGTCTGTTCTCAACGGGAATGTTTGTGGCCAGCACCCGATCGAATGGCTTTACTCTGGAGGTGA 
TCAATAATGATCCCAATGTTGTAATCGTGGGAATAAGAGTGTTAATTGGAACTCAGGACGTACAGCGTGCTCCCCAATCTGTGACTATTCTGGGA 
CGTACTATACCCACACCGGTCAGGAGGGCACGTTGGTTCGATATCCCCCTGACCCGGGAAGAAATGCTCCAGTCCGACAAGTTATTGAAAGTCGT 
GTTTGCTAAAGCTCCAGATCCGGAGCATGTGACATTACTGGATTGCATTGAGGTATATGGAAAATCGAAAGAGCTGGTGGGCTGGCCCGACGAAA 
GCGAGGATGTTACTGTCCCAGGATCCAGTGCGCCAGCAGTATCTTCCTCCCAGGCAAGTAGCGCTAACTTTGGAGAAGGATTCAATTGCATTACT 
CAGCTGGATCGAATGGCTAATCACCTTTTGGAAGTAATGGATTGTGCACTGCATTTGTTGGGATCTGGAGTACCTGCTTCTATGCGTCAAAAGGC 
AGTTAAAACAGCGAGTGCCCTATTGCTACTGCCAACGCCCAATCCTGTTCAGACCCAGGCGCGCTACGTGCTGGCCACTCTATACGGAACTAGAG 
CATTGTACCATAACTACAAGGATGGAGTGCTTCTTCAGTTTGTAAATAGAGAACTACAGTCAATGCAGCCGAAGCTTGAAAAACTTGAAACTCTG 
CGGGAGATCGACCCTGAGGCTTTCTACCGCCTAGTTCTGATGGTCCGTGGCATTGCCAACGCACGACCCCAATCGCTAGCGAAGATCTGCGTGGA 
GAACAACTATGATATTGTTCCAACCCTCATGGGAATTGTCTTGGAGTTGCATAAGGTTACACCGACGCTAGACGAACCCGTGAATATTGTGAAGC 
GAGGCTTGTGCCAGCCGGAGACCATTGTCCATTGTCTGGTAGAGATAATGTACGGATTTGCCTTGGCCGATCCCGGACAGGTTGGTCGCATGACC 
AAGTACTTCATTGATTTGCTGAAGCATGATGCTAGCGTGATTAGCCACAGCGCCAAAGAGGCCCTGATCCTTTTGCTTAGTCCAAGAATGAAGCG 
CAGAAAGGTGGCCATAGTTACACCCCCAGCATGTTCAACACCGACTCCCTCAACGTCTACCATGCAGGCACTGCAAGCAGCGGCTTCTTCAGCTG 
CAAGTGATATCATCGAGGAAGCAGCTGGCGTTGTTGTTGATGGGTCAGTTGGAGGAGGTGGATTACCCGAACCGAATGCGGACGCGGAAGGTGCT 
GCAGTTGGAGTAGGAGGAGTCGGTCAGCAGCAAATGCTCAATCTGGAGGCATTCATGGGTGGCGGTTTCCCTCGCTTACTCGGCCTACCCGAAGA 
TGCCGACGATGAGGCCATCATGGATATTGCCATCGCATTGAGTCTCCAACAGCACGGTGGGGATGCAAACGCCTTGCAGTCTCTTCAGCAAGGAT 
T GGCC AATCTTCAGGGCATCCGTCAAGCTACTGCCATGGC AGC AGCTGTGAACGCAGCTGCCAATGTCTCCCTAGGTGGATCCGATGAT GAT GAA 

FIGURE SHEET 714 



GGTTCCAATGTGGCAACCGATGGATCAACATTGCGTACTTCTCCAGCAGAGCCTGCTGGCAGCGGTGGATCAGAGAGTGGAGGCAGTGGAGTAGA 
GAGTATTGGTGGAACATCCGCCAGGAGCAGCAACTTTGGCGACCACGCGAATGCCTCGCCGCCGCGTCAGGGATCCACCAAGGACGATCAAGAGC 
AGCCAGGTCCCAGTGGCGTTGCTGGCTCCGGTGGTGTTGCTGTTTTGAGTGCCATGAGCAGCAGTGAGGATAATGAAGCAAACGAGGACGATAAG 
CTCAGCAAGCTACATGACCTTAGAATTGCTGTGCTGGAGAGCATTATCCAACATCTAGGAACATTCGACTTGTGCAATGGCCTGCAGGCAATCCC 
ACT GATCCAGGTTATACTCATGTTGACTACCGATCTCAATGGAAACAACGAAAGGGATCAACAAGTGCTTCACGATCTGCT TACT GCTCTAGTGG 
ACTATGTGGAAATTGGAAAGCGAGGAGCTGCCGCAAGGATGGAGACAAAGTGTCCTGGCAATGAGGTGCGCCTCGCATTGCTTTCCCTGTTCGGA 
GTTCTGATGGGCAAAACAAAGTCAAAGCAAACTGGAACAACATCGCCGCCGCATCAATTTAAGGACAATAGCTCCTTTGTGGCCAGTACCACAGC 
AAATGTTCTCAGTAAGAGTGGAGCATTCGTTTACGCCCTAGAAGCACTAAACACGCTACTGGTGCACTGGAAGAATGTGCTAGGAGATCCTTATG 
CAGCCGGTGGCGGATTAGCCTCACAGTCGGCACAAGCCAGTGGCGGAGCTTCAGGTCCCGGAGTTCAGCTACTGAAGCCCATTAAACATGGTCCT 

AAAC C T G AC AT ATC CAT ACT TAT AC C GC AC AAC T AT C T G AAAAAC T AC C C G GAT AT T TT C G AGT C C T ACG AT G GAC T C C T AACT G AAAT T AT C GT 

GCGATTGCCATATCAAATTCTGCGCTTAAGTAGCGCCCATCCGGATAACTACGATAGTGGATTCTGCGAAGCGATGACCTTTACACTTTGTGAGT 

ACATGATGCTAAATTTGAACACACTGCTTCGTCGGCAAGTGCGAAAGCTCCTGATGTACATTTGCGGCAGCAAGGAAAAGTTCCGCATGTATCGC 

GACGGTCACTCCCTCGACGCTCATTTCCGGGTGGTCAAGCGGGTGTGCAACATTGTCAGCAGCAAGACTGGTGCTCCTTACAACGCTAATCCTCC 

TATGCTTTCATACGATGCACTGGTAGAGTTGACGGAACACCTACGTACTTGCCAGGAGATAAGCCAAATGCGCACGGGCAACTGGCAGAAGTTCT 

GCGTCGTCCATGAAGATGCGTTGGCCATGCTGATGGAAATCGCCTGTTATCAGTTGGATGATGGCGTGTCGCCTATTATTATACAATTGCTACAG 

GCGGCCGTTTGCAACTTACCCCCTCCTAGTGGATCTAAGCAGGCCCAGCCGCAGCCATCGACCTCGTCTGCCTCCGGAAAACTTCGAACAGATCG 

TGAGAAGAGCGAGGATACAGACGCCTACTACTCCAAGTTTGATCCAGCCCAGTGTGGTACATTTGTGCACCAGATCTTTAGATACGCATGCGATG 

CGCTGATAATCAGATTTGTGCGAATATTCCTGTTGGAGAACAACATTACTCAGCTGAGATGGCAGGCCCACTCATTTATGACCGGACTTTTTGAG 

CATGCCAATGAACGGCAGCGTGAGAAACTTCTCAACATATTTTGGAACCTGTGGCCATTGGTGCCCACCTACGGTCGTAGGACAGCGCAGTTTGT 

CGACCTATTGGGGTATCTAACCCTAAGTACCAGAAGTATAACAGAACGACTGCCAGAGTTCGTTTCCCGCGCTGTGGATGTTCTCAGGCAGCAGA 

ACGAACTGCTTTGCAAGCATCCAAATGCCCCAATATACACCACCTTGGAGTCTATACTTCAAGTAAATGGTTACTACCTAGAATCGGAACCTTGT 

TTAGTGTGCAACAATCCAGAGGTGCCGATGGCTAATATAAAGTTGCCATCAGTTAAATCAGACTCTAAGTACACCACAACCACAATGATTTATAA 

GCTAGTCCAATGCCATACTATTTCCAAGTTGATCGTGCGGATTGCCGACTTAAAGCGGACAAAGATGGTGAGAACCATTAATGTTTACTACAACA 

ACCGCAGTGTCCAGGCGGTAGTGGAGCTAAAAAATCGCCCAGCTCTGTGGCACAAAGCTCGCTCAGTATCCCTGCAATCGGCACAGACCGAACTG 

AAGATCGATTTTCCACTACCAATCACCGCATGCAATTTGATGATCGAGTTTGCTGATTTCTTTGAAACTGTGAGTGGATCCAGTGAGAATTTACA 

ATGTCCACGTTGTAGTGCAGCAGTGCCGGCTTATCCGGGAGTCTGCGGAAATTGCGGTGAGAACGTGTTCCAGTGCCACAAATGTCGGGCCATTA 

ACTACGATGAGAAGGACCCGTTCCTGTGCCACTCCTGTGGCTTCTGCAAATATGCCAAATTTGATTTCAGCATGTATGCCCGTGTTTGCTGCGCT 

GTTGATCCTATAGAATCGGCAGAGGATAGGGTCAAGACTGTTTCACTGATTCACAGCTCCCTAGAACGAGCCGATAGAAATTACCGCCAGCTGCT 

GACCAACAAGCAAATGCTGGAGCTTCTTATTCAAAAGGTGGCCGAGCATCGTAGCAGTGATCGAATGGTCGAGGACAACATGGCCAGCGTGCACA 

GCACTTCTCAAGTAAACAAGATTATACAGCTGCTCGCCCAGAAATACTGTGTGGAATCGCGCACCAGTTTCGAGGAGCTAAGCAAGATCGTTCAG 

AAGGTCAAGGCATGTCGAAGCGAACTAGTGGCCTACGATCGTCAACAGCAGGACCAGCCGCCGGTTAATCCCGGCTCAACAACCGGTGCCGAAAA 

TCCCACCACAAATCGCTGCTACGGCTGTGCACTGGCATCTACCGAGCAGTGTCTGACGCTTCTGCGAGCCATGGCTTACAATTACGATTGTCGCG 

TGTGCTTGTATAGTCAGGGACTCGTCAGTGAACTGGCTGAGCACAACCTGCGACGAGGCACTCCCCTAATTCAGGAGGAAGTGCGCAACCTGTTG 

GTCGTACTGACTAAGGATAATGCCGAGGCATGCATGCATCTGTTGCAGTTGGTGACCACAAGGGTGAAGAATGCTCTGATGGGATCGATTCCTCT 

GATAAGTTTAGAGGCAGCTGTGCACCAAGAAATGACACTTTTGGAGGTGCTCTTGGGCCAGGACGACATCTGCTGGGAATACAAGCTAAAGGTCA 

TCTTTGAGCTGTTCATTAGCAACTGCCGTTTGCCTCGTGGTCCGGTAACAGCAGTACTACACCCCTGTCTGCGCATCATGCAGAATCTCATCTGC 

CCGGTTCTGCCCGGCAGCAAGCCCAATCAGAAGGTGGCCACTACAGATTTGTGTAGCATGAAGATGTTTGAGGGCAACACGGTGGACTATAGGGC 

CTGGCTGAACTCAGACCGTAATCATGAATACGCAGCGTGGAGCAAGCGGATGCCGTCAAACAACCAGGCTAAGCTAAAGAATGCCAAGGATCAGA 

ATGTGGCGGCATCCGGCGGATCAGATGCACCGCCGAAGAGCAGGAGGGAGGTGCGAGTCGCATTCCTTAGCGAAAAGTACGGAAAACGCTGGCGG 

GAACGTGTGCTGGACAAGCAACGAGTGATTAAACCTCTTGTATTCAACGCCAAGTGGATACAACCGTTGCTATTTAATGCAAACTCCAGATTCGG 

TCGCCAGCTGGCCTGCTCCTTGCTGAGCAGCTTGAGCCGAACTAACGAACGGCGGCAACAGGCTCTGAATATGCTGACATCGTTCTTGAAACATG 

TAGGCGAAGCAGGGGAAGCCAGTGCAGAGTACTTAATGCTGTACAAGAACATGGCCACAGAGCAACCCTGGCTGCAGTATTTGGTCCTGAAAGGA 

GTATTAAGTCAAATCTCACAGCTTCTGGCTATTGAAATCAGCAAGGTGCACCGCATGGAGGAGTACTCGCTTTCTTCAGATTTATCCTTAGGATA 

TGCACTGCGCCAGTATGTGGAATTGTTGTGGCTACTCCTAGAGTGTCCAAATATCCGCAGAACCTACAAGACTCGCATGCTAGGACCAGTTCTGG 

AAAGCTATCTTGCCCTAAGAAGCTTAGTCGTGCAGCGTACGCGACTCATTGACGACGCACAGGAGAAGCTGTTAGAGATGCTAGAGGATATGACC 

AGCGGAACCGAGGAGGAGACACGTGCCTTTATGGAAATCCTTATTGACACCGTAGAGAAGACCCGCATGAATGACATTAAAACACCGGTGTTTGT 

C T T C GAG CGCTTATATT C CAT TAT T CAT C C C GAG GAGC AT GAC G AAAGC G AGT T C TAT AT G AC ATT AG AGAAAG AT C C CC AG C AG G AGG AC T T T C 

TGCAAGGTCGCATGCTGGGTAATCCTTACCCCTCGAGTGAAATGGGATTGGGTCCATTGATGCGGGATGTGAAGAATAAGATTTGTACGGATTGT 

GAACTAATCGCCCTGTTGGAAGATGATAACGGCATGGAATTGCTGGTCAACAACAAAATAATAAGCTTGGATCTGCCCGTTAAGGATGTTTACAA 

GAAAGTTTGGTTAGCTGAAGGAGGCGACCGAGATGCCATGAGAATTGTATATCGCATGCGCGGTCTCCTGGGAGACGCCACAGAGGAATTTGTGG 

AAACGCTGAACAACAAAAGCCAGGAGCAGGTGGACACCGAGCAGCTGTATCGCATGGCCAATGTGCTGGCTGATTGCAACGGCCTACGCGTAATG 

TTGGAAAGAATTGGCAGCCTGCAGCGCATATCGCGCAACCGAGAACTTATCCAGGTACTGCTTAAGCTATTCCTTATTTGTGTGAAAGTGCGCCG 

TTGCCAAGAGGTGCTTTGCCAACCCGAGATCGGAGCCATCAACACGCTGCTTAAGGTGCTACAAATGTGTCTCCAGTCGGAGAACGATTCGATTC 

AATCGGCGGTTACCGAGCAGTTATTGGAGATCATGGAAACAATACTATCGAAGGCGGCCAGCGATACGTTGGACTCGTTCCTGCAGTTCTCCTTG 

ACATTCGGTGGACCGGAGTATGTAAGCGCACTAATATCGTGCACAGACTGTCCCAATGTTAGAAACAATCCTTCGGTGCTCCGCCATCTTATTCG 

TGTCCTAGCCGCTTTGGTCTATGGCAACGAGGTGAAGATGGCTCTACTTTGCGAGCATTTCAAGGACACTTTGAACTTTAATCGCTTTGACAACG 

AGCGCACACCTGAAGAAGAGTTCAAGCTGGAACTCTTCTGCGTGCTGACTAACCAAATTGAGCACAACTGCATTGGCGGCACTCTCAAGGACTAC 

ATAGTAAGTCTCGGCATCGTTGAGCGATCGCTAGCCTACATCACCGAGCATGCGCCTTGTGTCAAGCCCACTCTCCTGCGAACCGACTCCGATGA 

GCTGAAAGAATTCATCTCGCGGCCAAGCTTGAAGTATATTCTTCGTTTCCTCACCGGCCTCTCAAATCATCATGAGGCAACTCAGGTGGCTATTA 

GCAAGGACATTATACCCATTATTCATCGCCTGGAACAAGTATCCAGCGATGAGCATGTGGGCAGTCTGGCCGAGAACCTTTTGGAAGCTCTCTCC 

ACCGATTCGGCGACAGCAGCGAGGGTGCAGCAAGTGCGTGACTTCACAAGGGCAGAGAAGAAGCGGCTGGCAATGGCCACGCGCGAGAAACAGTT 

GGATGCATTGGGAATGCGCACGAATGAAAAGGGCCAGGTGACGGCCAAGGGCTCCATCCTGCAGAAGATCGAAAAACTGCGCGACGAAACAGGAC 

TAACTTGCTTTATTTGTCGCGAAGGCTACGCCTGCCAGCCAGACAAGGTGCTTGGTATCTACACCTTCACCAAGCGCTGCAATGTCGAGGAGTTT 

GAGCTGAAGTCGAGAAAAACCATTGGCTATACCACCGTCACACACTTCAATGTGGTCCATGTGGACTGCCACACATCAGCCATAAGATTGACACG 

TGGCAGGGATGAATGGGAGCGTGCTAGTTTGCAAAATGCCAACACCAGATGTAATGGACTGCTGCCCCTTTGGGGACCCGCCGTTGGAGAGGCGG 

CCTTCTCCGCTTGCATGACCCGCCACAGCAGTTATATGCAGGAGAGTACCCAGCGCTGTGACATCTCCTACACCAGCAGTGTGCATGACTTGAAG 

CTGCTGTTGGTACGATTTGCTTGGGAACGCAGCTTCCATGACGATGCCGGCGGAGGTGGTCCGCAATCGAACATGCACTTCGTACCCTACCTGCT 

CTTCTACTCCGTCTATCTGTTGCTTTCGTCGCGATCAGCGGCTAGAGACAGCAAGACTCTGCTGACGTATTTGCAGGCGCCGCCCAGCGAGAAGT 

GGTTGGAATGCGGATATGAGGTGGATGGTCCGCTTTTCATGGCAACCATTTCGCTGTCACTGCACAGCCGAGAGCTATGGAATAAGCATAAGCTA 

GCACATCTGAAGAGGATGATTGCAGTGGCCCAGGGTCGCCACGTCTCACCCGCCGTGCTGTGCAAAGCCTTGTTGGCTCCTGCAGACCGCCAGGT 

CAAGGATTACACTGTTTACAAGCCCTTCCTTATGATGTGGGCATTGGTGGACTTGATCTACGACAATCTGTTCAAGACGGTAAGCACGCCCAAGG 
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AGGAGGATTGGCCGATTTCCCTGTTCGACTATCTTCGTAAGAACGATGAGGCTTTGCTCAAGTCCACGGATAGCATCCTGCAGACCCTAACGGAG 
GAGTTCCTGCCCTGCACTTCCTTTGTCGAATTCTGTGATGTTGCAGGTCTCCTGCATCTCATCGAGCACCCGGACAACTTCATCGAAGAAATCCT 
CGCTGCCCTGCCTTCAACATCGAGCTCTAATTAAAATGCAATCGCTGTTTCTAACCAAATAAATTGCAGAACTTTAAGCGATTGCTGCGTATATA 
TAAACATACAGCATAAATTTATATTTAACACATCCTAAACATTTAATCAATAAAAATTAATCGTTTGCACAATGTAGTGA 

(SEQ ID NO: 1340) 

Start ATG: 26 {Reverse strand: CAT) 
MSAHSGGTDWNSVVKALILNRTGALNKNEVW 

PRVSVYETKWLLGALKMLCEGRECPASASSSMFDYNAVANVLKSCKHPESTTKSIMPSSSSSGSGASNDKESPKSEIKRSRSDLSSVILQQLIAP 
LEPGKMTWVPLSEEVTDCTEQILAANVEYFQEQNGVDTLLDVCVSLPILNRYRSKYMETINGGKSLYLPLTQVEATAVKSSMNHMLTDLTILSQA 
QALIEMQPLTPSRIERLSMCGIAALYNAVLTSIATSVLGMSQASSSQKQTASTSQGSGVGGSSGGQSNKDHDDFEDQACSIVNKALEIYSNIGHM 
FKTSARIHVYQNHLCYGSWLLISGIQGAMGASGSGGSSSDSASKSASKATKSGSEAGTAPTTPIARVNLFKVQQGFGELNAAIANHSIKLLSELI 
EDLKVEAACGQSLESTELPEPAQFDILQNYSSLERIVRVLNTATLHQLFTFLATVAYRKACTLKRASAKDRTECEPISYSDSTTYFNDSLSCSDN 
SEEDDSESYLGHWFKETLSPETHDDNANTSTQERAEQKSALVPKLDEPHEYLDLAADIFCFLDQFLANRHAYMQRYVKAGVSDQQMLLMANIIKD 
FDRDVMRNESDQGSGNAPAASAGAGTSAGASTKWQTSMIRFSGAAGRYIHNLISTSLLSEQLQSNLLQHLSISPWSTDTNTWPLQVYPSTLSVLV 
OILLLKPTQEKEAACLSVWHRLINTLVEGVCSSNTASDSDYEDLNIEHAQLLLFLFHSLNLMQKKSILLLTAGGVIRCAEVCRGISEDRPVKNSQ 

TMLLSRLLLFLEYLMKHLYN^ 

AOEFKLDGLAWNFILCTPDKLKYPLLVDALTDILSITDMSMYSKEKDKEASMHKLCAIQYCFTIAWKLNLGLPPSTSHVESLKAERSPNLHSLMW 
SIRLPLASSHYLVVSSLIKQGMYTQYAETLWTHVGDIGADIKYSLKQTILGVEAFNSQMNGNGTPRLSDLILFDSLVAHMQAVAWANKEGLKWPR 
KESEDAAGEQSAGTSLTSSSNDPELYSSNESIDDQKLKQDDDGKLSSDLQKYNQLVNELMVKLMNSYQLLSVIVRGQMLKQLSSSTPEKALNLIV 
PIVSDKPAIMLELHAAFLKLLPNEDKQLIANEWPKCLMVNDFAFNGKQHPVEPYILNVIDAHIIELTRGSNYSTLHTLKHCLKSILQLFELLLPH 
RTANAEVETQLKQLLISSMLDMRTDYLQGHSEHCLREILSGLTQEAQKLLLYEHMVGYCYRMLKEFAADLRQPQSAGPSGGAPLDQDPJ^FNESM 
LFAVLKTMFKMLEKPVAVQAMRQFFKDQRSGSLTTLLLSFTGTSLPVSYARKMLQFVNRLFQLSLQADSQFQHEDLVECFSELATVDVARLKQWL 
GHIIYGPNVSTDVSTSEALDTTCRMLTSILQPSSSSSSNAQTPTNMATVSAMPSISDQLDPMEIEYDCGTAAGGGESNPGTGGAAANTSQILSLW 
OAAOPNPSEESSQACDHSDSERNGALLLSFVKSLVKDQSKASQIAPPLFQALLQLGQTLISPPQEGCDFADVLQIMITLADASPARGHVALFNTT 
LLWLELAKLQLPDKHLKHAENVSAQLRYLSELLQSIGFRGSRQHWPPWDDELQTDIDELYDELAEEGEQDSLLDDSDEDTLNNKLCTFSQTQKEF 

MN OHW Y HCH T CNM I NT VG VC S VC ARVCH KGH D V S Y AK YGN F FC DC GAKE D G S C Q AL S RR I GS N E VRD S AG I G S YL P S HM S L L AGKKRS S L P VGQ P 

VLTRKDSLTNERIAVLTKLLEPYRETLQHQDQWLLVVRCILEYFDLLLPSINENCMLYSIVGCHKRATAALERLHLLEQSFQVTDQLMFATLGSQ 

EGAFENVRMNYSGDQGQTIKHLLSSGVVRRVAFCCLSSPHGRRQQLAVSHEKGKVTILQLSALLKQADASKRKLTLTQLSSAPIACTVISLAANP 

CNEDCLAVCGLKECHILTFSSSGSTNEHIVVNPQLENGNYIKKAVWLPGSQTLLAVVTSDYVKIYDLAVDTYSPKYYYLVAVGKIRDCTFVYQDG 

NYNMLTFASSGYIYTQQLDQQSLAVHGDFYVTNTLELSHQHIKDIAGHIGGGGVSIYYSHTLQLLFYSYSCGRSFFSPLTNVSEGVKGIYHLDTN 

SASKSASKGPLQPLVQWTEVTGHPGLVYASMQTSNNPIILMITPERIYLQEIKAQSAKSRIMDWGIRHAVAGVEKTTLLLLCEDGSLRIFSAQP 

EYTSFWLSPQVQPFGNQLYSSTLI^KGGGSGSTSKSKSNTASGKMTSRKASQQQKQPTAGGQPVFPIDFFEHCNiyiLADVEFGGNDLLQIYNKQKL 

KTRLFSTGMFVASTRSNGFTLEVINNDPNVVIVGIRVLIGTQDVQRAPQSVTILGRTIPTPVRPARWFDIPLTREEMLQSDKLLKVVFAKAPDPE 

HVTLLDCIEVYGKSKELVGWPDESEDVTVPGSSAPAVSSSQAS SANFGEGFNCITQLDRMANHLLEVMDCALHLLGSGVPASMRQKAVKTASALL 

LLPTPNPVQTQARYVLATLYGTRALYHNYKDGVLLQFVNRELQSMQPKLEKLETLREIDPEAFYRLVLMVRGIANARPQSLAKICVENNYDIVPT 

LMGIVLELHKVTPTLDEPVNIVKRGLCQPETIVHCLVEIMYGFALADPGQVGRMTKYFIDLLKHDASVISHSAKEALILLLSPRMKRRKVAIVTP 

PACSTPTPSTSTMQALQAAASSAASDIIEEAAGVVVDGSVGGGGLPEPNADAEGAAVGVGGVGQQQMLNLEAFMGGGFPRLLGLPEDADDEAIMD 

IAIALSLQQHGGDANALQSLQQGLANLQGIRQATAMAAAVNAAANVSLGGSDDDEGSNVATDGSTLRTSPAEPAGSGGSESGGSGVESIGGTSAR 

SSNFGDHANASPPRQGSTKDDQEQPGPSGVAGSGGVAVLSAMSSSEDNEANEDDKLSKLHDLRIAVLESIIQHLGTFDLCNGLQAIPLIQVILML 

TTDLNGNNERDQQVLHDLLTALVDYVEIGKRGAAAPJyiETKCPGNEVRLALLSLFGVLMGKTKSKQTGTTSPPHQFKDNSSFVASTTANVLSKSGA 

FVYALEALNTLLVHWKNVLGDPYAAGGGLASQSAQASGGASGPGVQLLKPIKHGPKPDISILIPHNYLKNYPDIFESYDGLLTEIIVRLPYQILR 

LSSAHPDNYDSGFCEAMTFTLCEYmLNLNTLLRRQVRKLLMYICGSKEKFRMYRDGHSLDAHFRVVKRVCNIVSSKTGAPYNANPPMLSYDALV 

ELTEHLRTCQEISQMRTGNWQKFCVVHEDALAMLMEIACYQLDDGVSPIIIQLLQAAVCNLPPPSGSKQAQPQPSTSSASGKLRTDREKSEDTDA 

YYSKFDPAQCGTFVHQIFRYACDALIIRFVRIFLLENNITQLRWQAHSFMTGLFEHANERQREKLLNIFWNLWPLVPTYGRRTAQFVDLLGYLTL 

STRSITERLPEFVSRAVDVLRQQNELLCKHPNAPIYTTLESILQVNGYYLESEPCLVCNNPEVPMANIKLPSVKSDSKYTTTTMIYKLVQCHTIS 

KLIVRIADLKRTKMVRTINVYYNNRSVQAVVELKNRPALWHKARSVSLQSAQTELKIDFPLPITACNLMIEFADFFETVSGSSENLQCPRCSAAV 

PAYPGVCGNCGENVFQCHKCRAINYDEKDPFLCHSCGFCKYAKFDFSMYARVCCAVDPIESAEDRVKTVSLIHSSLERADRNYRQLLTNKQMLEL 

LIQKVAEHRSSDP^VEDNMASVHSTSQVNKIIQLLAQKYCVESRTSFEELSKIVQKVKACRSELVAYDRQQQDQPPVNPGSTTGAENPTTNRCYG 

CALASTEQCLTLLPAMAYNYDCRVCLYSQGLVSELAEHNLRRGTPLIQEEVRNLLVVLTKDNAEACMHLLQLVTTRVKNALMGSIPLISLEAAVH 

QEMTLLEVLLGQDDICWEYKLKVIFELFISNCRLPRGPVTAVLHPCLRIMQNLICPVLPGSKPNQKVATTDLCSMKMFEGNTVDYRAWLNSDRNH 

EYAAWSKRMPSNNQAKLKNAKDQNVAASGGSDAPPKSRREVRVAFLSEKYGKRWRERVLDKQRVIKPLVFNAKWIQPLLFNANSRFGRQLACSLL 

SSLSRTNERRQQALNMLTSFLKHVGEAGEASAEYLMLYKNMATEQPWLQYLVLKGVLSQISQLLAIEISKVHRMEEYSLSSDLSLGYALRQYVEL 

LWLLLECPNIRRTYKTPJXILGPVLESYLALRSLVVQRTRLIDDAQEKLLEMLEDMTSGTEEETRAFMEILIDTVEKTRMNDIKTPVFVFERLYSII 

HPEEHDESEFYMTLEKDPQQEDFLQGRMLGNPYPSSEMGLGPLMRDVKNKICTDCELIALLEDDNGMELLVNNKIISLDLPVKDVYKKVWLAEGG 

DRDAMRIVYP^RGLLGDATEEFVETLNNKSQEQVDTEQLYRtdANVLADCNGLRVMLERIGSLQRISRNRELIQVLLKLFLICVKVRRCQEVLCQP 

EIGAINTLLKVLQMCLQSENDSIQSAVTEQLLEIMETILSKAASDTLDSFLQFSLTFGGPEYVSALISCTDCPNVRNNPSVLRHLIRVLAALVYG 

NEVKMALLCEHFKDTLNFNRFDNERTPEEEFKLELFCVLTNQIEHNCIGGTLKDYIVSLGIVERSLAYITEHAPCVKPTLLRTDSDELKEFISRP 

SLKYILRFLTGLSNHHEATQVAISKDIIPIIHRLEQVSSDEHVGSLAENLLEALSTDSATAARVQQVRDFTRAEKKRLAMATREKQLDALGMRTN 

EKGQVTAKGSILQKIEKLRDETGLTCFICREGYACQPDKVLGIYTFTKRCNVEEFELKSRKTIGYTTVTHFNVVHVDCHTSAIRLTRGRDEWERA 

SLQNANTRCNGLLPLWGPAVGEAAFSACMTRHSSYMQESTQRCDISYTSSVHDLKLLLVRFAWERSFHDDAGGGGPQSNMHFVPYLLFYSVYLLL 

SSRSAARDSKTLLTYLQAPPSEKWLECGYEVDGPLFMATISLSLHSRELWNKHKLAHLKRMIAVAQGRHVSPAVLCKALLAPADRQVKDYTVYKP 

FLMMWALVDLIYDNLFKTVSTPKEEDWPISLFDYLRKNDEALLKSTDSILQTLTEEFLPCTSFVEFCDVAGLLHLIEHPDNFIEEILAALPSTSS 

SN* 

{SEQ ID NO: 1341) 



Celera Sequence No. : 142000013384665 
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TAGTTGAGGTGAGCGCTGGTTTAAATTAGCACAAAAAAGGCGGCTGGCATTATACGGTTTTGAGTTCTTGCAACAGCGAATAATTGTGGTTTTGA 
TGTGAATGCTTTCTAAGAGTTTTATCTAAATCGATTTTTGCTGACAGCTTAGCTTTTGCCAAAGGAATCTGTGGGATTTGTATTTAACCAAAATA 
TTTCTGACCTTAAGGATGTACTCAAAAGTGGCAGGGTATCCTTTCTACTGAGCACAACTGCAATGACTTATTTTTTCCTCTTCAAGGAATCCACA 
GTCATATTTCGGTGGGTATTCCTTTTATATTGCTGCTTTTTTGGTGCATGCAAGACGCAGATTACGTAATCCCTGGCTCAGAGTGCACATGATCC 
TTGCAGCTCCTGATCTCGTTCAATCTGACAAAGGGGATAAAGCGCAGTCAACTGGAGGAATCAGCTTAAGCGCTTACGTTTGTGCTCACGCTTAC 

GT C AT GC GAT GTTGTCATTGACTTTTAGTT GAT AT GC AAC C AGC AAG C C AG AG AGC C C AGAT T AGT C GG AC T GC AG AC T GCC GGG GT GG AC T C T G 
CAGATGGAATAGGATCGGATAAGCAGGAGGAGGCGCAGGAGAAGCAAATGCAAGTGGTACGGGCGGAGAAGAGCAAGCTTTACGGCTTGGTTCCC 
CTTCTGTCGGCTCCAATCATATGCGGCCACATTTCCACGGAATCTGAACTCATTGAATGCGCGCTACACGCCAATGCCTCTAATTTTCCATACTT 
TCGCTATATAGTGATGACTTGGATACATGAATCTTTTTTAACGTACTTATCACATAATATTTTTGAAACAATTTAAAGGCTCTACAAATAGATAC 
ATTTGGCAAGCTTACTTTCACAGTATATGCAAATTGAATTAACTAAATTTTCAAATTTTGGTCTTTGAATTGGTTTTAATACTTTGGAATTACTT 
GTTGCAAAATATACAGATTGAAGTAAATGTTACCTTTCTCATCACTGGTCTTACCTTTGGTTGATGTTGCTGCTGCGGCTGCGACGCTCCAGCAA 
CAATTGCTCCTGCTCCTCCATCGGCTGCAGCCAGTCGATGTTGCTGTTGTCCTTGATGATGCTGTTGCTGTTGCCATTGCTGTTGCTATTACTAT 
TGCTGTTGTCCTGGTCTTTGTTCTGGTCCTGGGTAACAGCAAACCTGCCCATACCTTGAACGGCCGCATTTAAGTCGGACAATTGCAGACGGAAG 
TCGTTGCGAGCATTCTCTGGATCATCATCAAACTGTCCCAGTCCCATTTCCATTCCCAATCCCTGTTGCTGCTGTTCGAGTTGCTGCTGTTGATG 
TTGCTGTTGTTGCTGCTGCTCATGTTGCTGCTGCTGCTGCCGTTGTAGATGTTGCTGCCTGTCGTATATGCTGTGGATGGGATTTCCGTAGAGTT 
TTTGCTGCTGTTGCTGCTGCCATTTGGCCGGTGCTTTGTTGTAGTTATAGTTGTCATTGTTGTCGTCCTCGTTGCTATCTGCAATGCCATCAACG 
TCCACATCCCCGTGCTCCCACTTGTAGTTGTAATCGTTGTAGTTGTTGTCCGTGGCGCTTCTATAGCCATATCTATTGTTTTTGAGCCCCGGCGT 
CGATTCCATCGTCTCCTCGCAGGCCATGGCGAGCGTTCGTCTGGAAGAGCCACTACCAAAACAGCGAGCAGCTAGAATGACGGCCAGGAGCAGCT 
GCAAATGTGGCAGAAACAAAGAGATTTGCTCGATAAGAAGGGTGCGGCACAAGGCAACGGAAATGTTAAACAATTGTTGTGTGGCTAATAAACTG 
CAAAGTGTCAACAGTTGTCGAAGGGGAAAGTGGGAGGGGGCTTGATATGCAAATACCCCTCAATTCGAATGGCAAACAATGCATGTGGATTAATA 
TTTATCACATGCGATTAGGCATCGCAATGAATAGGTTTATAAAAATATTTGTTGGGTATTTGGGCATCAAAGGCTTATTCAAACCCCTGGATTAG 
TGGGTAAGTCAATGAGATTCTTATAAAGGGTTTATAGCAAGCTATATGCATTATGGTTATTCATTGTAGTCTTAGTTGTTATTAATACCTTATCA 
GTT G AAT G AAT T C CT T AGGT ACT AT AC T T TG AC AAT GAT ACC AAT TAT C T T G AT AAAT AAC T C T T T AATT T CT T GAAAAAT AAAGT T C T T C C C C A 
AT TTCGCATTTTATTGTTTT G C AC GT G ACGT AG AT AT T T TAT T CT T AT C AAC AT G AAG AT AC T T AAGG AAAGC C T T GC GAT AGG AAT C AAAT GAT 
AATCAATACGCTCTGCTGATAGGCATGGTTATCAGTTGTACAAATATTACAACATGTCTAAAATTACCACAGTAATTACGAAACACGATTGTTGC 
TACGTTATAAAATTCCTAGAAAATCCATCGATAACTTTTTAATTTCATTACAACTACGAATTTAGTTTTTATTCCTAATTTTTTTTGGAAGCTCT 
GCAGTCTACTACTGCCAACCGAAAATCGACCGCTAATTCAACTAAGAACTCAACAAAATTCCGATTGACGAAACCGAAAACTTAAGCCTAAACAG 
CCTCCAAGTTCCCACTGAGCCGGCTGAAGGGGAAAGTTGAATTCCCTGCTGGATGTTGACAATCAATGTGCTTGGGG 

(SEQ ID NO: 1342} 
Exon: 1642.. 1001 

Start ATG: 1642 (Reverse strand: CAT) 
Transcript No * CT34209 

ATGGCCTGCGAGGAGACGATGGAATCGACGCCGGGGCTCAAAAACAATAGATATGGCTATAGAAGCGCCACGGACAACAACTACAACGATTACAA 
CTACAAGTGGGAGCACGGGGATGTGGACGTTGATGGCATTGCAGATAGCAACGAGGACGACAACAATGACAACTATAACTACAACAAAGCACCGG 
CCAAATGGCAGCAGCAACAGCAGCAAAAACTCTACGGAAATCCCATCCACAGCATATACGACAGGCAGCAACATCTACAACGGCAGCAGCAGCAG 

C AAC AT GAG C AG C AGC AAC AAC AGCAAC AT C AAC AGC AGC AAC T C G AAC AGC AGC AAC AG GG AT T G GG AAT GG AAAT GG GAC T GG GAC AGT TT G A 

TGATGATCCAGAGAATGCTCGCAACGACTTCCGTCTGCAATTGTCCGACTTAAATGCGGCCGTTCAAGGTATGGGCAGGTTTGCTGTTACCCAGG 

ACCAGAACAAAGACCAGGACAACAGCAATAGTAATAGCAACAGCAATGGCAACAGCAACAGCATCATCAAGGACAACAGCAACATCGACTGGCTG 

CAGCCGATGGAGGAGCAGGAGCAATTGTTGCTGGAGCGTCGCAGCCGCAGCAGCAACATCAACCAAAGGTAA 

{ SEQ ID NO: 1343) 

Start ATG: 1 (Reverse strand: CAT) 

MACEETMESTPGLKNNRYGYRSATDNNYNDYNYKWEHGDVDVDGIADSNEDDNNDNYNYNKAPAKWQQQQQQKLYGNPIHSIYDRQQHLQRQQQQ 
QHEQQQQQQHQQQQLEQQQQGLGMEMGLGQFDDDPENARNDFRLQLSDLNAAVQGMGRFAVTQDQNKDQDNSNSNSNSNGNSNSIIKDNSNIDWL 

QPMEEQEQLLLERRSRSSNINQR* 
(SEQ ID NO: 1344) 



Celera Sequence No. : 142000013384562 

CTGCACCAGACTGGGCTCATGCGACCAGAGCTTGTCCCCAAAGAGGTGGAGCATCTTGATGGCCACACCGTGACCTCCGCACATGTACAGTTCAT 
CGCTGGAGCGAGCCAACTGGCCCACGCCCACCGCAGATTTGTTGTTTGTGAGGTTTACGGCCACTGAAAAAGACAATGAGATCAATACTCGTTAC 
CTGGGGAATCCCAAGCAGATACCCACTCAACTGTCCCTTTTTGAAGTGCCCGTACATGCTCAGTCCCACGCCCAGGGGCACCACTCCGGGCAGCA 
TGAGATCCGCTCCGTTCGTCAGCTTGGGTAACACACCCTCGTGGGTGGTGAAATAGGGCAGGATGTCCGGAACGATCCACAGGGTGTACAGGGTG 
GGCACAAGTTGACCGCCGTCCAGCTCGAAGAACAAAGGCAACTTGTCCACGCAGTAGACCATGCTCTGAACACCGCCGTGCGTTAGGATCTTCAC 
CTGGGTCACCGCCGCCTTCGCCGGCACCAGCTGATCCACGGACACATGGGGGAACGCCGCCTCCACGCGCTGCCGGAACTTCTTGCTATCGGATC 
CCTTGAGGGCAGCACTGCTCTTGGGTCTGTACGGCTTGAGAAACATCTGTGTTTGGTGGTGTGCACTTGAGGAGTTCGTGAAATGGTTTGGAGTG 
GCACATGCACAGTGAAAACAAACAACTGGCAAACGGGGGTGCTGCTTTAGCTATTTTCTCGCCAGTTTGTATATTTAAAAAAATATCAATGGGAT 
TCGATATTCTAAAACCAAGTATAAGAAACATAAGAATAAGATGCAGAAAAAGAAATTAACATAAAATAAAGAAGTGAGTCCTGTCTTTTCTTTTC 
ACAATAACTCTACCCTAAATCAAAATTAGTTAAGAGCAGACGTTATTACAAGGACAGCACGCGCAGTAAAAATGCCGGCGGTCCGGAAAATTTCG 
AATCACCAAGGACTTCCTCAGGTAGCAAACGACAGAAAACCCAGGATCTTGTAAGACGCCATTTTATTCAACAAAACATCTGCGTAACAATTCCC 
CGTACACGGCCTTGTGACTCGAAGATGCGTGTTTGATTCATAGTTTTAACTTTGTACTTGAAATAAAAATTAGTAGTTAAACCCCCGATATTTGA 
AAAACATAGCCCGGCCCGGTAACAAGATTCGATCCTAGTCAGAATCCGAGTCGGAATTACTGTCCTTGGCCGTGTCGCGCGCTTGTGTAGGTGGA 
TCATAGATGTGGAGGCTTTTGCGCCAGATCTTCACGAGCCCATCGAAGGCGCGGCGGCTGTACTTTCCATACTTGTTCGGGGTGCGCGGATGATC 
GCGAGTCCGCTCGTCCTTTGGCACCATTTCCACATAGCGCTCGTAGGCGGCAGTATTCTTTCCGTAATCTATTTGCTTCTGCCTGCGACTTAGGA 
TGGCGGGATCGGCTTCGGTGTAGGAGGATGACGATGAGGTGGAGGACGTGTAGCTGTTGTGGCGCACCCGTTTCTCCTCCTTGTGGGCGCGGAAG 
TTGCCAGACTTTTTTGAGTTCCTCGAATGGGGATTGTTGCCGCCTTTGGGCGCAGCTGCATCTCCGTTGGCCGAGGAGTTGCTGCTGTTTGGACT 
GTTGGAGCGACTGTTCTCGTTGCTGGGCGGCGTGAACGAGTGTCGGCGCTTGTAGGGCGTTTTTAGCTTCTCCTCCTTGACCAGGCGCTCGAACT 
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TTACCTCGTTGACGCCATCCAGGAACTCAAATGCCATCTCCCGGCCCTCGCCATCTTTGGACTCCAGATTACCGTCGGCTGTCTCCTTTTTGGCC 
AAGGAACCGCAGCTGGCAGCGCTGCTATTCAAGGAACTGGAGGCCTGCACGTGCAAACAAGAGAATTAATTGTGTTCCCGATCGTAGCAGCCAAC 
CTTCACTTACCTCGTCACTGTGGCCAAACTCCGCGCGGACCTCCTGGGCCCAGCTCTGCATGGTGGGCTTCACATCGATCGAAATGGAGGATGCT 
GAGGAATTCAGGCTGCCGGAGCCTTTCTGCGGTGTGTTCTCCACGGACATGTGCTGATCCTCGCAGAGCATGTTGTGCAATTATCTTTCGTTTGT 
TTACTGTGCTGTGTATGATGCGGTCCGCTGCTCCAGGATGTTGTCGCGGCGGTGTTTTGCGTTTTCTGCGCAAACCTTCGCGCGTTTTTTGTTGT 
AGATGCGATGCTCAGTGTGCCCAGAAGTGGAGGTGGTAAGAAGGCGGCAAAAAGGTACCACAACACAGAACAACCTGGTCACTCACAACTAATTT 

C AAAC T AT GC T C C AAAT GT AT T AC AT T T C AT TT C AAT T T T G T T AAAAT T T GAT GT C C AAT AC AT AT T T AAAC AT AC AG AAAAT T AAC AGC G AT T T 
TTAAATTGTCTGAGGGAGGCTCAAACAATATCGATGCATCGGTACACATCGATAGCAAGACATCGAGCAACCAACCATTACCAACACTGTCTGCT 
TACCGAGTGATTTAGCATTTTTTCGGCACAAAAGAAAACTGAAAATTAAAACGGTGCAATTGTTGGTTAGTACGCAAAAGAACGATTTGTTTTCA 
TAGCAAACGTAAAGTTTTCGAGCCCAAGTTGAACATACGTCACTTTAAAATCCGTAGAACATATTTTTGACTGGAAACAACGCAGCTGGAAAATG 
AGTCTTACGTCCGCCGCGGGGATAATTTCCCTACTCGATGAGCCAATGCCGGATCTTAAGGTGTTCGCCCTCAAAAAGCTGGACAACATCGTGGA 
TGAGTTCTGGCCGGAGATCTCGGAGTCCATCGAGAAGATCGAGATGCTGCACGAGGATCGCAGCTTCCCCGAAAACAAGCTAGCTGGCATGGTGG 
CCTCCAAGGTGTTCTATCACCTGGGCTCCTTCGAGGACGCTCTGACGTACGCCCTGGGCGCTGGTGATCTGTTCGACGTGAATGCCCGCAATGAG 
TACACGGAGACCATCATTGCCAAGTGCATCGACTTCTACATTGCCCAGCGGGTGGAGTTCATCGAGAACCCTAAGGAGGCGAGTGTCGTGGACGA 
GCGGCTGGAGGGCATTGTGAACAGGATGATTCAGCGCTGTCTGGACGACAACCAGTTCCGCCAGGCTCTGGGCATCGCTCTGGAGACCCGTCGCA 

TGGACACCTTCAAGGACGCTATCATGAAG 
{SEQ ID NO: 1345) 

Exon: 2164.. 1911 
Exon: 1849.. 1001 

Start ATG: 2066 {Reverse strand: CAT) 
Transcript No. : CT34229 

TTTGCGCAGAAAACGCAAAACACCGCCGCGACAACATCCTGGAGCAGCGGACCGCATCATACACAGCACAGTAAACAAACGAAAGATAATTGCAC 
AACATGCTCTGCGAGGATCAGCACATGTCCGTGGAGAACACACCGCAGAAAGGCTCCGGCAGCCTGAATTCCTCAGCATCCTCCATTTCGATCGA 
TGTGAAGCCCACCATGCAGAGCTGGGCCCAGGAGGTCCGCGCGGAGTTTGGCCACAGTGACGAGGCCTCCAGTTCCTTGAATAGCAGCGCTGCCA 
GCTGCGGTTCCTTGGCCAAAAAGGAGACAGCCGACGGTAATCTGGAGTCCAAAGATGGCGAGGGCCGGGAGATGGCATTTGAGTTCCTGGATGGC 

GT C AAC G AGGT AAAGTT C G AGC G C CT GGT CAAGGAGG AG AAGC TAAAAAC GC C C T AC AAGC G C C G AC AC T C GT T C AC GC C GC CC AG C AAC G AG AA 
CAGTCGCTCCAACAGTCCAAACAGCAGCAACTCCTCGGCCAACGGAGATGCAGCTGCGCCCAAAGGCGGCAACAATCCCCATTCGAGGAACTCAA 
AAAAGTCTGGCAACTTCCGCGCCCACAAGGAGGAGAAACGGGTGCGCCACAACAGCTACACGTCCTCCACCTCATCGTCATCCTCCTACACCGAA 
GCCGATCCCGCCATCCTAAGTCGCAGGCAGAAGCAAATAGATTACGGAAAGAATACTGCCGCCTACGAGCGCTATGTGGAAATGGTGCCAAAGGA 
CGAGCGGACTCGCGATCATCCGCGCACCCCGAACAAGTATGGAAAGTACAGCCGCCGCGCCTTCGATGGGCTCGTGAAGATCTGGCGCAAAAGCC 
TCCACATCTATGATCCACCTACACAAGCGCGCGACACGGCCAAGGACAGTAATTCCGACTCGGATTCTGACTAGGATCGAATCTTGTTACCGGGC 
CGGGCTATGTTTTTCAAATATCGGGGGTTTAACTACTAATTTTTATTTCAAGTACAAAGTTAAAACTATGAATCAAACACGCATCTTCGAGTCAC 
AAGGCCGTGTACGGGGAATTGTTACGCAGATGTTTTGTTGAATAAAATGGCGTCTTAC 
(SEQ ID NO: 1346) 

Start ATG: 99 (Reverse strand: CAT) 

MLCEDQHMSVENTPQKGSGSLNSSASSISIDVKPTMQSWAQEVRAEFGHSDEASSSLNSSAASCGSLAKKETADGNLESKDGEGREMAFEFLDGV 
NEVKFERLVKEEKLKTPYKRRHSFTPPSNENSRSNSPNSSNSSANGDAAAPKGGNNPHSRWSKKSGKFRAHKEEKRVRHNSYTSSTSSSSSYTEA 
DPAILSRRQKQIDYGKNTAAYERYVEMVPKDERTRDHPRTPNKYGKYSRRAFDGLVKIWRKSLHIYDPPTQARDTAKDSNSDSDSD* 

(SEQ ID NO: 1347} 



Celera Sequence No. : 142000013384851 

CGAAACTATTAAGAAATCACAGCCTATCTAATTACCCATAAAAAGTTTCATTAATTTATTACGAATAAATTATGCCGCTAAGCTACATAAATCCG 
AAAACGCGAAATGAACTTGCATCTTTTGATCTTAGAAAACGGTGTATGCCCAACAACCACACAGCAACATCAAATACAGATTAAAAACACAAAAG 
GGACTAACCATAAGATCGATCCACTTCAAAGCGAGCATTAAATTAGTAAATTGTTGCAATCAAAAAACAAAATAGACTGTAAGTCCCACACTCTA 
GGTATGCAATTGCAATACGTTTTGTATGCGTAATAGCCATTTTTGTGCAAACATTACCTATGTAATTAATATTATACAAATACAAAAACAAAAAT 
ATGTAGAAAATTAGGTGGAAATGGAGTGGAGCATTTTTTCAATAAAATCACTAAAATTAACTACTACAGCCTTAATTTATCTAGCCAGCCAACCA 
TTTCAATTTCGGAGAAATCGCATTTTCCATTCACAAGATTAAGGGTCATATGTTGCGTTAATTATAATTTTATTAACATTATTGTTGGTTTAATT 
TGAATGTGTTAAATTATAAACGTGTGCCGTCGCATTTGCTTAAAAATTTATGAAAATAGTAAAGTTAGAACTTAGAACTAAAGCTTATGCTAAGA 
ACTACGTGTGACAAGCTGGCCGTCGAAGAGGGAGCAAAAACAAAATATTAAAACATAGATGTAGTGGAAGGCAATTTTACAATTCAATGTGTCAA 
AAAACGTCACGTAATGAAATCAAACCAGCTCCAAGAGTTGATATGGCCTTTGAACTAAAAAACATACAACTTCGGGCGCTGTAGGTCCTTACGGT 
TAGAGTAGGAAATGTGGATGGACAGGTGCTGCAGTCGCGGTTTGCTTGGAAGTGAGTTAAGGCCAATCAGCCGACGGCGACACATTGGGTAGATT 
ATAAGTGTATGTTAGAGATTTTAAAACACTTAAGACTATGACCGTGAGGCTTAGTTACGTCGCGTTCGACGACGAGCATTTCCTGCAAAGGCGTT 
GTGCGCCCCCGGATGCTGTCCGCTGTACAAGTTGTCCAAGAACTCATGCAATGTAGCCTCGCTGCAAAGTGAGATTGTTTTATTTTATGTTTTGC 
AGATTGCTCGAAGAAGTTCGCCCTTACCTAACGCCGCTCACTCCCCCACCAGGATGATGTACACCCCGATCGTGATGTGGCTGCTGCGGGTGCTG 
GTGGTGTTGTTGTTGTTGCTGCTGCTGCTGCTGCTGTTGTTGTTGCTGCTGCTGCTGAGCACCGCGCACGATGCGGTACTGCACCATGTGCGAGT 
TTGGTTCCAAATGTGACAGAGCTCCTTTTTGGCAGTTAGCCCACTCGGTTATGTCATAGACCTTGCCGTCCATTAGCGCCAAGTATTTCCACCGC 
AAGCCCATCATGCTTGTCTCGGCCCAAATGTCACCCTATTGCGAAAACAAAAACACATTTTTCAAAGTATGAGTATATCCAACTGTGGATTTCTG 
CACCCTTACATCTTTGGCGGAGTGCCGTATTTTGCAGGAAGCACACTCACGCGCCGCATAGTGAGGGCGTTCGGTTAGCTTGCGAGGATGACGCT 
GGGCACAAGTGCTGCACCTGTATTAAGCGGTAACCAGAACGATTAGTAATTTGCTATCTATAGAGACGCGTTTGGTACATACGCACCTTATGGTA 
TTGGCCGCCTCAGCCATTTTGGTTTGAAGTTGGGAAAGCAGGTCGTGCAACTCCGTCCACGCCTTCTCGGTGTGCAGAGTTTCGGCAATGCTTTG 
GTCGTAAATTAGTCGATTCTCCTGCAAAAGGAAACTTTTAAGATAGCCATCGATCATTTAAAGTGACAATGCATTACCGGTTCTCCGATTAACTC 
AAAGGCGCGCTGCAGCACCTTAAAAGCCTCCTCAGCACCGGCTTGTTTGTTTTTGTCGGGATGTACAAGCACGGCAATCTTTTTGTAGTGCTTGC 
GGATCTGCTCCTGGGAGCTGTCTGGGGGAACACCAAGTATGCTAGAACAAGAAGTAAGAACTGTTGCACACGTTCATTAATAGTTGAGCAGAGCT 
CACCTGTATGCATCCTTTCCCTTGCAATTAAGCAGAGAGTACATTGCTTCTTCACCCGTTTGTGGCAGACGACCAGTTTTGAATTGCTCGGAAGT 
GGTTTCTGGCGGTTTGCGTTTGTAGAAAAGTCGCCATACGGCCAGGCGCGAGTTCTTAGCAAAGCGAGCGTCGAATCGCCGCCAATAACTTCGCC 
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ACCATATGCTTGGCTTGCCAGAGTTCTGCTTCAGTTCCTTTTGCAGCTGCCGCGCATAAGCCAATGCAGCCACATAGGCCAACGTTATCCGCTCG 
TATCCCATGCTAAAGCCCAGAACCACGATATCGTACACCAAATAAAACAGCCACGTTAAAAGTTTCAGGGTGTAAATCCACCAGGTGTGGATAAT 
ACGTTTGGCCAACTCGTAGTTGCGGTTACTGGAAAAGCTGTTGGCATGGGTGGGCGAGTTTGTCGACGAATAGGCATTGCTTCGATTGCGCTTCG 
ATGAGTAGGAGTAGCCAGAACTGTTGTTCGGGTTAAGATTCGAGGAGGAATACCGTTTGGCCGAAGAGGAACTAACATTGTTATTGCTGGAGTAA 
7TTACGCTGCTACTGCTACTGCTGCTTCCCGCATGTGTGTTTGCGCCAGTGTGATTCGAGTGACTCGCGTTGGAGTTGATTTTGTTACTGCCTGA 
T T T AC GT G AT G AAGT GC T AG AAT TCGTTGCCG GAG T AAAAT T G AG AAC AT TGCTGTGGTTCT GC AAG CTGCTGTTGGCG GAC AC C GG AGT GGT C T 
TATGCTCCGAAAACTTAGGGGCGTTGCTATTTACGTTCTTCGGTTGCAGTGGCCGCTTCTTTTCCGGATTATTTAGTTTGTTTGATTTAATCTGC 
GGCTGTAGATTTGGCTGTGCTAGAGACTGAGACTGTTGAACCTTGAGCGGCGCGGTCTGGCTGACTTTACTTCGCTTCTTGCCTCGGCTGGCATG 
CGCATGACCAACACCGGGCACAAACGAACCCGTCTCCTTGGCATCGTGCAGCGGTGAACGTTTTGACGTCCGCATCTCGCGATTTCCATATGCAT 
CCTCCTTTTCCGGACGCCTCTGCGATTGTACACTGTAGTCCCCACCGTTGTCTGCGCGCATATCTCGCCCTGCAGCACTCGGTTTTGTGCCTTTG 
CTGAGCACATCGGAGTACGACATGGGCATGGGCTTCACCTCGGCCACGATTCGCTTTTTGCCTTCGGGTTGGCCAGGCACGGCGACGCTCGACTT 
GTTGATGGCAGTTGGTGAAACGGTGGCAGTGGCCAAAGGCATTTTCAGAGACTTGGCTGGCTTGGAAGCCGCTGGCAGTGTATTGACGGATTGCG 
ACAGTGGCTCAGGCTGCTGCAGGTTCTGATTAGGAACCTGTGGCTGTTGGGCTATCGAAGTCTCGAGAACGACGTCCATCTCGTTGGCCTCGCCA 
TCAAATTGAATGTAGCTTCCACCAGTCAAATTGGGCGACCAGTTGTTTACAAGGTTGTTAATGAGAAGGTTAGCACTGCTCGACGAGTGATTGAC 
TTGGGACACTGTTGGGGCCTGAGTCTGCGTCTGTTGGGGCTGTCGATGCTGGGCGTCTTGCTGCTGTGCATGTGGTAAAGTCAAAGGATTTGTGG 
CAAACAGTTCATACAGTGGAGTCGGCTGCGAAGGTGTCGGCGGAGCGGCCTGGTGTTGGTGCTGCAGCTGATGGTGATTAATCGAGGCCATGTTG 
ATATGCGTGAAGAGTGGGGTGCCGGCTTCGCTGACCTGGTTGTCCGGGCAGTGGTACACTTTGAGAAAGTTGCCGGGTCCCACAGGCAGCACGGA 
GTAGGTCTCTGGCTGGCCTGGCTGACCGGAGTAGGACAGATTGAAGAGATACTCATTGTACATGTGGGGCTGCTGCTGCTGAAGATGGGGCGCAA 
AT GGG GGG C CG GC T AGGT GC GGT GGT G GAG GT GGC C AC ATC AG C GG GAC AGGT GC T T GT CC AT GC T GCT GCT GGT GG AT C AG AT AGT TTT GGG T T 
TGATGTTGATGCTGCTGCTGTTGCTGTTGCTGCTGTTGGTGGGCCAGGTAGTTGCCCACTTCGTCAGTGTGGGGCAGGACGCGAAAGGGTGCAGA 
CTCAGTGCTGCCCCCTCCAATCTCGTCGTCCTGACGAGTCGCCATGTCCTGCGATGGGGCCTATGCTTTTTGCGCACCAGCTCCGCTCGCCTATT 
GGAATTTCAAGCGTGCTCCATGTATTCACCGCAGCGAGTCTGTCACGGATGTGCTGCAATCGCAGGTGACAGGTAGTTGGTTTTCTGTGGCCCTA 
GCCCACTTTTTCACTCCAAACTCTAAATAATTCGCAGAGCGTCCGTGTGTAAGAGTTTTATATCTTTTTATTGCAAACGTCAAATGTAGTGTTGC 
CAGATGGGTGGAATTCCATTTTCCTAACTGCCCTCGACCAACCCTGGTAGTAACCTATCGAAAGATTTCCCTCCAACGTATTTTCCGAATTGTTG 
TTAAACTGTTGGATTATTAATCGAAGTTTCCTCAAATAAAAAAACAAATTTCGCCAACTTTTCCATTGCCACCAATCAGCCGTTTTTGAAGATAC 
ATATATTGATGAACCCGCTGATCGTGCAGCCCTGGTTAATATTTGTTGTTTTAAACGTTGCAGTCAAATGATTTTTAAACAATTAAAAAAATGCC 
TTTTGTCCGACCGCTGGTGACCGTGGTGCGGGCCGGGCGGCATAGCTATTCAGCAGGATTACAGCTACAGCAACGTCTTGCAAGGTCCAACCAGA 
TCCTGAATCCGCCAGCGGAGTTCCGAAACTACCTGGTGCTTCAGGAGCACGACCCGGTCTACACAGTCGGACTAAGGACAAAGGACTATACGGCG 
CAGGATGAAGACCGACTCCGCCGGCTAGGAGCAGACTTCCATCGCACAGATCGCGGTGGCTTGATCACATTTCACGGCCCCGGCCAGTTGGTGGC 
CTATCCCATTCTGCACCTGGGTCAGTTCGTGCCGAGCATCCGCTGGTACGTGGCGACGTTGGAGCGGATGGTTGTCGAGGCCTGCCACCAGATGG 
GCATTTCCAGTGCCAAGGCGACCAAGGACACCGGTATCTGGGTGGGTGACAACAAGATCTGTGCCATCGGTATCCACGGCTCGCGTTACGTCACC 
ACGCATGGAATCGGCCTCAATTGCTGCACGGATCTGCAGTGGTTTGAGCACATTGTGCCCTGCGGAATCGAAGGCAAGGGGGTCACCTCGCTCAG 
CAAAGAGCTGGACAGACATTTTCCTGTCGAGGAGGCCAGTGGCGCACTCCTCAATAGTTTCGCTAAAGTTTTCGAATGCCGCCTTCAGGAACACG 
CC AAAAAACCAGC AAGT T CT GCTGAAATCGGA 
(SEQ ID NO: 1348) 

Exon: 4447.. 2094 
Exon: 2036.. 1883 
Exon: 1826. .1702 
Exon: 1632.. 1530 
Exon: 1460.. 1168 
Exon: 1106.. 1001 

Start ATG: 4225 {Reverse strand: CAT) 
Transcript No. : CT34422 

TTTGCAATAAAAAGATATAAAACTCTTACACACGGACGCTCTGCGAATTATTTAGAGTTTGGAGTGAAAAAGTGGGCTAGGGCCACAGAAAACCA 
ACTACCTGTCACCTGCGATTGCAGCACATCCGTGACAGACTCGCTGCGGTGAATACATGGAGCACGCTTGAAATTCCAATAGGCGAGCGGAGCTG 
GTGCGCAAAAAGCATAGGCCCCATCGCAGGACATGGCGACTCGTCAGGACGACGAGATTGGAGGGGGCAGCACTGAGTCTGCACCCTTTCGCGTC 
CTGCCCCACACTGACGAAGTGGGCAACTACCTGGCCCACCAACAGCAGCAACAGCAACAGCAGCAGCATCAACATCAAACCCAAAACTATCTGAT 
CCACCAGCAGCAGCATGGACAAGCACCTGTCCCGCTGATGTGGCCACCTCCACCACCGCACCTAGCCGGCCCCCCATTTGCGCCCCATCTTCAGC 
AGCAGCAGCCCCACATGTACAATGAGTATCTCTTCAATCTGTCCTACTCCGGTCAGCCAGGCCAGCCAGAGACCTACTCCGTGCTGCCTGTGGGA 
CCCGGCAACTTTCTCAAAGTGTACCACTGCCCGGACAACCAGGTCAGCGAAGCCGGCACCCCACTCTTCACGCATATCAACATGGCCTCGATTAA 
TCACCATCAGCTGCAGCACCAACACCAGGCCGCTCCGCCGACACCTTCGCAGCCGACTCCACTGTATGAACTGTTTGCCACAAATCCTTTGACTT 
TACCACATGCACAGCAGCAAGACGCCCAGCATCGACAGCCCCAACAGACGCAGACTCAGGCCCCAACAGTGTCCCAAGTCAATCACTCGTCGAGC 
AGTGCTAACCTTCTCATTAACAACCTTGTAAACAACTGGTCGCCCAATTTGACTGGTGGAAGCTACATTCAATTTGATGGCGAGGCCAACGAGAT 
GGACGTCGTTCTCGAGACTTCGATAGCCCAACAGCCACAGGTTCCT21ATCAGAACCTGCAGCAGCCTGAGCCACTGTCGCAATCCGTCAATACAC 
TGCCAGCGGCTTCCAAGCCAGCCAAGTCTCTGAAAATGCCTTTGGCCACTGCCACCGTTTCACCAACTGCCATCAACAAGTCGAGCGTCGCCGTG 
CCTGGCCAACCCGAAGGCAAAAAGCGAATCGTGGCCGAGGTGAAGCCCATGCCCATGTCGTACTCCGATGTGCTCAGCAAAGGCACAAAACCGAG 
TGCTGCAGGGCGAGATATGCGCGCAGACAACGGTGGGGACTACAGTGTACAATCGCAGAGGCGTCCGGAAAAGGAGGATGCATATGGAAATCGCG 
AGATGCGGACGTCAAAACGTTCACCGCTGCACGATGCCAAGGAGACGGGTTCGTTTGTGCCCGGTGTTGGTCATGCGCATGCCAGCCGAGGCAAG 
AAGCGAAGTAAAGTCAGCCAGACCGCGCCGCTCAAGGTTCAACAGTCTCAGTCTCTAGCACAGCCAAATCTACAGCCGCAGATTAAATCAAACAA 
ACTAAATAATCCGGAAAAGAAGCGGCCACTGCAACCGAAGAACGTAAATAGCAACGCCCCTAAGTTTTCGGAGCATAAGACCACTCCGGTGTCCG 
CCAACAGCAGCTTGCAGAACCACAGCAATGTTCTCAATTTTACTCCGGCAACGAATTCTAGCACTTCATCACGTAAATCAGGCAGTAACAAAATC 
AACTCCAACGCGAGTCACTCGAATCACACTGGCGCAAACACACATGCGGGAAGCAGCAGTAGCAGTAGCAGCGTAAATTACTCCAGCAATAACAA 
TGTTAGTTCCTCTTCGGCCAAACGGTATTCCTCCTCGAATCTTAACCCGAACAACAGTTCTGGCTACTCCTACTCATCGAAGCGCAATCGAAGCA 
ATGCCTATTCGTCGACAAACTCGCCCACCCATGCCAACAGCTTTTCCAGTAACCGCAACTACGAGTTGGCCAAACGTATTATCCACACCTGGTGG 
ATTTACACCCTGAAACTTTTAACGTGGCTGTTTTATTTGGTGTACGATATCGTGGTTCTGGGCTTTAGCATGGGATACGAGCGGATAACGTTGGC 
CTATGTGGCTGCATTGGCTTATGCGCGGCAGCTGCAAAAGGAACTGAAGCAGAACTCTGGCAAGCCAAGCATATGGTGGCGAAGTTATTGGCGGC 
GATTCGACGCTCGCTTTGCTAAGAACTCGCGCCTGGCCGTATGGCGACTTTTCTACAAACGCAAACCGCCAGAAACCACTTCCGAGCAATTCAAA 
ACTGGTCGTCTGCCACAAACGGGTGAAGAAGCAATGTACTCTCTGCTTAATTGCAAGGGAAAGGATGCATACAGCATACTTGGTGTTCCCCCAGA 
CAGCTCCCAGGAGCAGATCCGCAAGCACTACAAAAAGATTGCCGTGCTTGTACATCCCGACAAAAACAAACAAGCCGGTGCTGAGGAGGCTTTTA 
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AGGTGCTGCAGCGCGCCTTTGAGTTAATCGGAGAACCGGAGAATCGACTAATTTACGACCAAAGCATTGCCGAAACTCTGCACACCGAGAAGGCG 
TGGACGGAGTTGCACGACCTGCTTTCCCAACTTCAAACCAAAATGGCTGAGGCGGCCAATACCATAAGGTGCAGCACTTGTGCCCAGCGTCATCC 
TCGCAAGCTAACCGAACGCCCTCACTATGCGGCGCGTGAGTGTGCTTCCTGCAAAATACGGCACTCCGCCAAAGATGGTGACATTTGGGCCGAGA 
CAAGCATGATGGGCTTGCGGTGGAAATACTTGGCGCTAATGGACGGCAAGGTCTATGACATAACCGAGTGGGCTAACTGCCAAAAAGGAGCTCTG 
TCACATTTGGAACCAAACTCGCACATGGTGCAGTACCGCATCGTGCGCGGTGCTCAGCAGCAGCAGCAACAACAACAGCAGCAGCAGCAGCAGCA 
ACAACAACAACACCACCAGCACCCGCAGCAGCCACATCACGATCGGGGTGTACATCATCCTGGTGGGGGAGTGAGCGGCGTTAGCGAGGCTACAT 
TGCATGAGTTCTTGGACAACTTGTACAGCGGACAGCATCCGGGGGCGCACAACGCCTTTGCAGGAAATGCTCGTCGTCGAACGCGACGTAACTAA 
{SEQ ID NO: 1349} 

Start ATG: 223 (Reverse strand: CAT) 

MATRQDDEIGGGSTESAPFRVLPHTDEVGNYLAHQQQQQQQQQHQHQTQNYLIHQQQHGQAPVPLI^PPPPPHLAGPPFAPHLQQQQPHMYNEyL 
FNLSYSGQPGQPETYSVLPVGPGNFLKVYHCPDNQVSEAGTPLFTHINMASINHHQLQHQHQAAPPTPSQPTPLYELFATNPLTLPHAQQQDAQH 
RQPQQTQTQAPTVSQVNHSSSSANLLINNLVNNWSPNLTGGSYIQFDGEAKEMDVVLETSIAQQPQVPNQNLQQPEPLSQSVNTLPAASKPAKSL 
KMPLATATVSPTAINKSSVAVPGQPEGKKRIVAEVKPMPMSYSDVLSKGTKPSAAGRDMRADNGGDYSVQSQRRPEKEDAYGNREMRTSKRSPLH 
DAKETGSFVPGVGHAHASRGKKRSKVSQTAPLKVQQSQSLAQPNLQPQIKSNKLNNPEKKRPLQPKNVNSNAPKFSEHKTTPVSANSSLQNHSNV 
LNFTPATNSSTSSRKSGSNKINSNASHSNHTGANTHAGSSSSSSSVNYSSNNKVSSSSAKRYSSSNLNPNNSSGYSYSSKRNRSNAYSSTNSPTH 
AWSFSSNRNYELAKRIIHTWWIYTLKLLTWLFYLVYDIVVLGFSMGYERITLAYVAALAYARQLQKELKQNSGKPSIWWRSYWRRFDARFAKNSR 
LAVWRLFYKRKPPETTSEQFKTGRLPQTGEEAMYSLLNCKGKDAYSILGVPPDSSQEQIRKHYKKIAVLVHPDKNKQAGAEEAFKVLQRAFELIG 
EPENRLIYDQSIAETLHTEKAWTELHDLLSQLQ 

ALMDGKVYDITEWANCQKGALSHLEPNSHMVQYRIVRGAQQQQQQQQQQQQQQQQQHHQHPQQPHHDRGVHHPGGGVSGVSEATLHEFLDNLYSG 
QHPGAHNAFAGNARRRTRRN* 
(SEQ ID NO: 1350) 

Classification: hypothetical 



Celera Sequence No. : 142000013384 693 

TCAAAACAACGATCTCTTGCTGTTGAACCGCCACCATGACACATACAAGAATCTTACCGCGAAGCTCATGCAGTCCCTGTATATCCTGAGACGCC 
ACTACGAGTTCTCGTATATGCTGAAAGTGGACGACGATACCTATGTTAAGCTGGACAGTCTAGTTAATACGCTGGTTTCCTACGATCGCAAGTTG 
CTGCGCAAGAGGTCGGAGTATAGGGACCATGTGTTGCCTCAGCTTTACTGGGGCTACTTTAACGGCCGATCCACAATTAAAACGAAGGGCCAGTG 
GAAGGAGTCTAGTTACTACCTTAGCAAAAACTACCTGCCGTATGCACTGGGCGGGGGATACGTTCTGTCCCGAAGTCTGTGTGACTACATTGTCA 
ATAACTCCCAACTGCTATCGCACTATGGGTCCGAGGATGTCTCCGTTGGCACCTGGCTCGCCCCACTGCGCCACGTCTATCGGTGGCACGATCCC 
AGATTCGACACCTCCTACGCACCGCGCAAGTGTCGGTCCTATCACATGGTGCTGCACAAACGCAATGGGCAGATGATGAGGGACATTCATGACGG 
TGAACTCTGCAGTGGCATTGGTTCGAGTATATTGTCCGATTACTATTACGATTGGACGAGAACAGCGGACAAATGTTGCGACAGCTTAGTAGCGT 
AGTCGTGTGGACAGGCTAACCAGCGCTCAACTGGCCATTCTATAGGATTTTCCTTTCCTTCACGACTAGCTACCACGACTCAGCCGGCTGTGCAG 
AGCTCCTCGCTATGCTTTGTATTTGCAGCCGAGCAGCATCCTCCACTGCGATGGCCAAAAACCAACACTCGCTTTGCGGCTAGTGGGGCACGTTT 
GCATGCGTAGAAGTAGTTTGAAATGTCGAGACTGCAATGGCAGGTAATAAAAAAGTATTAAAGATGTACATTTTACAATACTTTACTCCATTATG 
ACATAATTCGGGATGCGCCACAAGACAAGGTAAACAATTAGAATGGACACTTAGAAGAACTTAACGCCGTGGCTGCCGTCCATCTCGATCAGCTC 
ACACCTGCCCTTTTCCTCCAGCAACCGGAGGGCACTGAGCAGCACCGCCTCGTCCACGCCGTAAAAGTCCAGGTGAGAGGTGTTCTCGCCGGAGG 
CTATTTCGTACAGCGTGCAGATGGTGTTCGTCTGGCCCGTTTCCTGCACCCAGTCGTACACCATGTTGCCGTACTCCTCGAGGGTGAACCAGTAC 
ACCTGCCACTCCTGGCGTCGCTTGTCCAGCGGATTCGCATGCCCGCTCCGCTCCAGCTCGCCCAGAATGGCCAGGACGAGTTCCGGCGAGAGGCG 
GCGCTTCAGCGCTTCGTTGTGGAACAGCGGCGAGTTCTGGTCCCCAATGCTGAGAGTAAACCTATTCGTGTGCCTGAGGTATTTGAGAAAGAGAT 
CTGTCCAGACCTTCAGCTGCTGCTGTCTGGTTTCTTCGTGGGGCTGTAGTCTGAAACCCGGGTGTTAGTCGAGTGATGGAGCATCTCCTGGTCGG 
AACTCACGTAAAGAAGGGTGGGAAGGTGTACTCCCAGGGCCACTGGAACTCCGCCATGCTGATTAACTACGATCTAAACATTTGTTTTTCCAATG 
TAAGTTATTACTAATGGCCTAGAGTGACCATATGCGCTAAATAGCAAAATACCTTGCTATTCAAAAACCATTCAAAAGTAAAAAAATTCGTTACT 
TTTTAGTTTTATTATTGTATCCTTATATTTAATATTCGCATTAATAAAAAAAGTTACGAGTTTCCTTAACTTTCTTTTGCATAGCACTTAGCTTA 
CGAGGAATTTAAAGCCATTTTTGAAAGCAAACTATTTAAATTAAACACCCGTTTTAAATTTTGAACCCAATATCCCAAGAAAACACTTATTTGTC 
ATTTACAGTAAAATGCAATCCATTTCTTGTATTTTGTCTTAAATAGAAAAAATTGTGTACATTAACATTTTCATCGGGCTTTTTATGAACTTTGA 
TTTGGGCGTCTGGTTTTCAGTTTCGGTTTCTGCTTTATTCAAAATCTCTGATTTATCAATTCAATGCCTAGGTCTAGAATCAATTTCGCGAGAAT 
GTGGCGACAAAAGAGGGGGATTGCAATGGCTTGCATCGTCAATATACACGCACATATATACTCGTATCTGATAAATCTGCTTTATTATTTATAAG 
TAAAGTTCTAGGAGGCGCAACACTATACATTCAATATCTGCTATTCTGTATTCCTCTTTCAAACACGCGATTTCCGAACCAATTTAGACACTTTA 
TGTGTGGCATGCTGCGGGTGCTTAGTTACTAACTTTATCGATTGTGTGCTAATGTTTCGTTCAAGTTTAACTTGCTGATCGGATCGGATCGAATC 
GGATCAGATCAAATGGGATCGAGCTGGGCCTGCTGCCCCCACGGAGCTTCCTTGGCTTGGTATGTACGACATATATGGTAATCATCTTGTTTGCA 
CTATATTTTATTTGGTTTTTGTACACATGCGTGAAATTCTAGGGAGAATGCCATTCGGGGAGTAGGAATAGAGTACAGTTTAGTGTTAGGATTGG 
ATTTCAGCTGGC 
(SEQ ID NO: 1351) 

Exon: 15*77.. 1528 
Exon: 1475. . 1001 

Start ATG: 1577 (Reverse strand: CAT) 
Transcript No. : CT34551 

ATGGCGGAGTTCCAGTGGCCCTGGGAGTACACCTTCCCACCCTTCTTTACACTACAGCCCCACGAAGAAACCAGACAGCAGCAGCTGAAGGTCTG 

GACAGATCTCTTTCTCAAATACCTCAGGCACACGAATAGGTTTACTCTCAGCATTGGGGACCAGAACTCGCCGCTGTTCCACAACGAAGCGCTGA 

AGCGCCGCCTCTCGCCGGAACTCGTCCTGGCCATTCTGGGCGAGCTGGAGCGGAGCGGGCATGCGAATCCGCTGGACAAGCGACGCCAGGAGTGG 

CAGGTGTACTGGTTCACCCTCGAGGAGTACGGCAACATGGTGTACGACTGGGTGCAGGAAACGGGCCAGACGAACACCATCTGCACGCTGTACGA 

AATAGCCTCCGGCGAGAACACCTCTCACCTGGACTTTTACGGCGTGGACGAGGCGGTGCTGCTCAGTGCCCTCCGGTTGCTGGAGGAAAAGGGCA 

GGTGTGAGCTGATCGAGATGGACGGCAGCCACGGCGTTAAGTTCTTCTAA 

(SEQ ID NO: 1352) 
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Start ATG: 1 (Reverse strand: CAT) 



MAEFQWPWEYTFPPFFTLQPHEETRQQQLKVWTDLFLKYLRHTNRFTLSIGDQNSPLFHNEALKRRLSPELVLAILGELERSGHANPLDKRRQEW 

QVYWFTLEEYGNMVYDWVQETGQTNTICTLYEIASGENTSHLDFYGVDEAVLLSALRLLEEKGRCELIEMDGSHGVKFF* 

(SEQ ID NO: 1353) 

Classification: hypothetical 



Celera Sequence No. : 142000013384523 

AATATAAAAGGCATTATGCCGAAATGAGTTTCCATACGATTTCTCTTTTAATGATTAATATGTTTTCCTCGATATTAAAATGCATAACTACGATT 
AAGTATGTGCTTGCGTTCTTAAAAAACAGTATGGTTTTCAGGGTAATTTTCTGTGTTATATGCATGTAGGTAACCATTTTTTTGATATATCTAAC 
AATTTGTACTTGGTTTTTAGCTCCACTAAGGAGTGTCTTGTTTTTTATTCAAACCATCCCTTATCCCCCATCTCTAAGTGACACAGCAAAGTTCA 
AAGCCCGCCTGCACCCATGTTTTTGAGCAGCGAGTTCGAGTTCCATTGACCAAGCGGTATATAATTGTTGGCCACAAAAGCGAAAACAGCTTTTC 
GGCCAAATAATAGTTGGAGAATAGAAAAAAAACTACAATCGACGCCCCCGCCAAGCGGAACAGAGCCACACATCCACAACCACAACCACATCCGT 
ATCCATGAAGTGCTCCACACATCGAAAACTACTTCTACATACATCTGGAGCCACATTAACACACTCTGACTACGCAACTGTGCGAAAATCGCTTT 
AGACGCTAACTGGAAACCGAGAAGCAGTTGGCTATGAGTGGGGCGCTTAATATGGATACAGTTAATTGACCTTGATTGAGTTTATTTCGGTTAAG 
GCAGCTGTCTGCGAGCCAAATGTCTCCATTCTGTTTGGTGTGAGTGTGTATCTGTGAGTGGGCTTCATCAACATCATTATCATTATGGAAGGTCT 
GCGGGAGGAAGTGGCACATTCCCGAGATTCCTGCCACAGCCTAATCAGCATAAAATAGTTCAGGGTGCAACTGCCTGCTTTCCAAAGTTTGAGAA 
AATATGACGAACTTCTTGCTACTATACTAATATTATTATACCGTTTCTCAAAATATATGTTGTAATTTAAATTAAATTGGTATACTTGTATACTT 
TTCATTTTTCAGTGTAGACCTCGCCTTAAGGTTATCTGTCTCCTTCTCGTCTAAGGATCAGATTCCCCCAGAACTTTTATGCCTGCAAATGCCCA 
ACTCCTTACACTTGGAGTGAGGAATTTTTCATTAGGAAGGAAGCAATTTCATGGCCAGTTATTTGAGCCCTCCGGCTGACGCCTGTGCGTGTCGC 
AC T AGAAACGC AC C AC C AC C CAT GCC C AACC C GC C CC C AC ACT G AAGGT GTG GC C AC C AT CC C CGC C CC G AGGT AT G C AAAAGGT AAAC ATT GGG 
CATTGAGCAACTCACCAGCTGACATGTTGAGACGTTTGCGCGCATCGCGTTTGGAGTAGAGGAACTGGAAGCCCGCGCCTAGTTTTTTGCCCGTC 
GAACCACTGCTGGCACCCACACCAACACCACCACTTGAGCCACCTCCTGCCCCACTGACGGCCGCTGCTCGAGCTGCCAATCCTGGCGAGCGGTA 
TCTCGCACTCTTGGCACTGCCAGTGGCCGAAACGGTGGACAAATTAGAGGGCGTGTGGTTGAGGGCGCTCTGAGACTTTGTCTTGGGGGTGCTGG 
TTATGGGAGCACCAACACCGGTGGTGGGTAACCGGGGTAGTGAATTCGTGCTAGCCACCACTTGGTTGGGATTCAGACTGCGCGGCAACGTGGAT 
CTGGCGCCGCCAAACTGATGCGAAGCTGCCTCATCGCTGGAGTCCGTCTCGTAAATGGCATCCGGTACACTGGAACGCCGCACATCCTTCGTAGA 
ATCACCGCCAATCGAGGATCCCGCGGAGGAGAGCCACGGCTTGGCCGGCGAGGACTCCAACTGGTGGTGATGGGCATCGCCGCCTCCAATGCTGC 
CGCAGGATCGGACGCGATCCAATTGCTCGCGGCTCAGGCTCTGCTGGTGCTGGCTGAAGAGAAGGTCGCCTGCCGATCGGCGCTTTGCCACAGCC 
AAAGCGGACGCAGACGACGAGTTATCCCTGTCCAGTTGGCTCAGCTGGCTGAGCCTCGTTCTGCTGTCGCCATTAGCCAGTAATGCCGACTCAAA 
ACTGCCGCGCCACGAGAATCGCTCGTAGTCGCTGGTCACGCCGCACTCCTCCCCGGCGTTGTCCGAGTGTCCGGGTCCGCCGGATCCTCCCGCTC 
CCGATCCCACGCCGCCGGCTGGATAATGCACCGTGGCATGGTGGTGCAGGTGATGCGAATGCGGCGTAGTGCCGTCCAGCAGTAGATCATGGGAT 
TTGCTTCGCAGCTTTAGCTGCTTGATGGGCATGGCTTTCAGTTGCTCCGCCACCTCATCTTCCTCATCCTCCGTCGTGGTGGCCGAGATCTTGGC 
CTGCAGTCTTCGATGATGCCGCTGTCCTTGCGGGGATGGAGTGGGTGTTTCATTGCCACTGCTAACTTCGTGCATCAGAGCTACCAATCTGGTAC 
CAATATGCTGCATCACCTTGGCAAACAGTTGCGGGGATCTCTCCGTGGACGGTTCTGCTTCCTCCACCGCTTTGGTGGCTTCTGTTTCCGGTTCT 
CCACCCGCCGCCACTTTGCTAAAGGAAGCTACCAATTTCTGATACACCAATGCTTCCATAGGCACACTTTGGCTCTCGTCCAGCAGTCCACCATC 
ACCACGATCACCCTCCTCGTCAATTTCAAACTTGTTGATGATCTGCTGACAGCTGGCGGAGATCTGAGCCTTCAGTTGGCCCTTCCTTTCACCGC 
CCTCCAGATCGGAGCTCGTGTAGTCCGAAGAATCGTAGGAATCAGACCATGTATCCTCACTGCTCAGGTACTCCACTATTTCGCGCACCTGCTCA 
TCATTGATGCCCGGCACGGTTTTCATCAGTCGGGTCAGCTCATTCTCAAAGTAGGCCAACTGAGCTGGCCGGGATTTGGCCAATCTGCGCTGGGA 
ACTGGGCGTGGTGTTGACATTGGTGGTGCTGGGCCCTTGATTGGGCGTGGAGGGCTTGCTCTGCTGTGCATTGGCGGTGGCGACAATGAGGTTCT 
TAAAGAACTGCCGCATCTTGCTCATCATCAGGTTGGTCTCCTCGTCGGAGCTCCAATTGTTGAAGCTATCCTTTCGCACCAGATTGGGCAGGGCA 
AGATTTTTCAGTTGGCTGGAAAGTGTGC TGGCTACAGCTGC AGCTGCTGAAGTGCTTGGTTGATCCATCCTGGCCAGACTAATCTGACTATGGAC 
CGGTTCCCGATCCTCACTGATCTGCTCCAGAGCTGGCGTTGGGCTCTTGGGTGGCTCACAATCTGGTGCCGCCTTCAGGAGATCATCGTTGCTGC 
CCACAAAGCCACTGTCCCGGCTGTGGTGATGCTGCTGTTGTTGCTTTTTCGCCTGCGACTGCATCTGCACTTGACCACTGCTGCCCGGGGTGGGA 
GTCCGGCACTCGGAAGGCTCTAGATCGTGGAGCTTCTTCTCGTCCAGAGAGTCGTGACGCTTGCGCAGTTGCTTCTTGCGTTTAATGGTGTCCGA 
GGAGGAACCATCCGAAGCTCCGGCATTGGCCAAATGAATCACAGTGTCGTCACAGCCCGCCTCCGATTCCGAGGTGTCCTCTACAGTGGCTGCAG 
CTGCTCCGCCTGCAGAAGTAACTGTGGCATCCAGGGTGTCTGATTCTGGCAGTAGCAAATTATCCAACGAGGAGTTGTGACTTCTAGGTGTTCCC 
CTGGCTCTTACCAGTCGCTTTCTCCGCTGACTGGGACTCGGTTGTCCTTCACTATCACTGCCTACACTGCCACCACTTTCATCGCTCTCCGCCTG 
CTTTTCTGCACCACTGAATCCCATGAAGCCGGAGAGGAAGTACTTTTCCAGACGGGAAGAGGCCAGATCGGAAGCTCCACTTGTATCCAACAGCT 
GATCCTCATGCGGTAGGGAGTCAAGACTCTCAGAGCACTCGGAACTCACTTCGGATGCAGTGTCATCTCCGCGTGTATCAATAGAACTCATTACT 
CGACCATCTCCCAGCCCAAAGAAGAAGTACTTCTCTAGTTCAGAGGCACTTAGTCGATGGTGGTGCTGCAAGGTGCGCTGCCTTCTCCGTTCGGA 
TAGATCCTCATCCTCCAGCTCGACGTCTTCATTGTCCGCCCGCATGGAGGCCACCTCGCTCTCCTCACAACTCTCCTCGACGATGGTGTGCAGGG 
TGAAGTCGATGGCAGATCTATTGTATCCCCTTAGGTCCTCCTCTCGATCCATATATCCATGTCGCCGGGCTCCTTCACCTTCTCCCTCCGCATCT 
TCGGAGTCTGTGGCCGTGGGTGTGGTGGTAGCATTGCTGAGATCTTCGTCCTCATCCTCGCCGCGCTGACTCAGTTCCTCCACCCAGGAGTCGTC 
ATCAGCCAAGCCGTCCTCTACGCTGAGTTCCGGCACAGCAGGCGGTTCGTTTAGCGAGTCCACACTGTACGCTTCGCGATCGCTGCTCACGAGTT 
CCGTTGGTTCCATGGCGCGAAATGAGTCACTCGCAAAGTTCTGGCCAACAGTGTGGCTGTTGTTGCTACTGGTCGGGTCTTTGTGTCTGATGTTG 
CTGAAGTTGCTGCTGCTGTTTGCCATTTCGTAATCGCCAGCTGCACTTTCATTGTCTTTGTTGATCATGACCAAACTGCGATACTTGGCTGAAAG 
TGAATTTTTGGGCGAAGCCAGTTCGCTGGGCGATGTTGCAGGTGTGTCAGATGTTGCTGCTGCTGCTGTTGCTGCAACTGCTGCTGATGCACTTG 
GAGTGTTTTGCTGATGACTACCAAATATTTTGTCGAAGTCACACTCGTTTTCACCACTCACATTTGAATTCACACTCACTACGTCACTAATATCG 
CAGCCGACAACCACACTCGAGCCGCTATTTCTCGTCTTTTGTGCTGCGAAAGCGTCATTATTATTATGGCCAAGTTCGCCGTCGTTGTTGTTAGT 
GTTTTCGTTATTGCTATTATCGTTTCTGTGTGTGTTGTTGTTATTGCTGCCGAGAAGCGCGGCACGAGAAGCGACCGCAACGTTTAACATTTCAT 
TTTCGACTGGCGACTCCGCCGACTCCGATTTCGCTCCACGGCGATATCGTTGCTCGCGAGCGTCGGCAGGTAGGCGAGCACTCTCTTCTACCTGC 
TGGAGCGGCTGCTGCTGCTGCTGCTCAGGTGCCAACAAACTCGCCGAAGTTGCCGAGAGTGAATCCAGCGAGTAGTCACTGTTGTCGTTTATTTT 
GCATGCTTGGCTGCATGTGACGACCCTCTCCTCGGTTTGCTCAGCTGTTCGGCCATTCCGCTCATCCCGCAGTTTGCCATCAGCTCTGTTTGCTT 
CTACCTCAACCACTGCCTCTACCTGCCGTAAGCAGGTGAGACTCAGCGGACCAACGTCATTAGAACTTAGCACGGTCGAGGGGCTTTTCCGAGAA 
GAAGGGCTATTGCATTCGTCGGGTTTCCCGGCACTACCTGGCTGCGTTGTGCAGTGGGTGGCAGCCACATCAGTGCCCGTGGTGGAGAGCTGCAG 
ATCCTCATTGATTTCGGCCACTCCCTGGCGCAAGCGTTGGAGCTTCAACTCCAGCTGAGAGTTCTGCTTGTGCAGCGCCAGCAATCGATTGACAT 
CGTTCGAGATGCTGTGCAGCTCATCGCAGGCGGCCAGGGCATCGCTGGCGTCTATCTCATGCGCTTGGCGGCGCTCTGATTGCTCGCCATCGAAC 
TCGACGTCTGGCAGCGATCGCATGTGGTGCAGCACCACGCCGGCGGCGGCGTCGTCCGTCTGTCCGCCCACCACTACAATGCAGTTGGCCAGCTC 
ATCGGTTGAGTTGGCAACACGGCGTTCACTTTCATCATCGTTAATGTTGTCTTCGTCACTTGATTGGCAATCACGTGCAGCTGTCGGCGGCTCCT 
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CCTCCGCCTTCTTGGGCTCCTCCTCTAGGGGTTTCACGTAGTTGAGCACAATTCCACAATCGGTGCGCTCGGTGTCTGTATCATTGTCAGAGGAA 
TCCTCCAAATTGGCCTCCGAGGAAGACAGAGAATCGCCCTCGGTGGAGGAACTGCGTCTTACCCGGCCGCCGGGCTGTCGATTTACGACGAACAC 
CTTGTTGAACCTCCTTCTCTTGGGCCTGCGATCGCGCACAGTTCCCGTATCATCGGAGTCACACGATTCCGAGCTGGAGCTGGAGTTTGAGCTGC 
TGCTGGAGCTGGAATCGCTGGTGGTGCTGTGGTGTCGTTGCTCCACTGCATCCACTTCTCCTAGTTGCTGCTCCTCCATCATTCCCGCAACGTTG 
TAGCTTCTTTGCGTGGTCCTACGCGGCTTTTGCTCCGCCTGCTCCTCCAACTCACGCTCCCTATCCTGATCTCCATCCTGGCTGCTGTTGCTGCT 
GGCCTCATCTGGACTACTGGCCGCATAGCTTTCGCTCTGCAGATGGGCCGCCGTCGATGGTGATGCCTCATCGGACATCTTGATGTTCTCTATAA 
GTCGATTGGAGGACAAATTCTTGGCAGTCTCCGCCGTCGCTCGATGCCGATCTATCTGCTCTGGGTGCTGGGCTCTGGGTTTGGGCTTGGGAATG 
GGACTGGCGGCACTGGCACTAGCACTTTCAGCCACATTTCCAACAGCATTTTCGACTGCAAGTCCGCCGAAACGAGTTATTACCGCAGGAGCCGC 
AGCTCTAGTTGCTGTTTGTATTTGTGCTGTTGCTGCGGCTGCCGCTGCTGGTACTTGATTTTTATGTATATTCGTTTCGTCTGCTGCTTCCACTT 
CTTCGTGGCTGTTTTTACTGGCCAGGTAGGTTTGTGTTGTTGCCGCTGACTGGGTTTCGCATAGATTTCGGCTCTGGTATTTGCTGGCCACTCTC 
AGAGCGGTTCCCTTTGTGGATGTCGCTGGTGTTGCTGCTGGCTCTGCTTTTGTTGATATTGCTGCTGTCGCGGCACTCTTATCTACACTGGCAAC 
ATGTGCCTGAACGCACTGAGGCACTGGCAATGTGCCGGTTTTATCATAAACATTACACAATCTACCCGTTGTGGGCGTTGCGTTGTTGCTGTTGC 
TGTTGGCGTTGTCGTCGCTCTCACTGTTGCTGCTGCCGCGACGCGCGTTGCTCGTTTCTAGTTGCTCACCCGTCGGCGAACGCGAAGCTACTAAA 
CTTGTTGGCTTTGGCTTGAAAGCACTGCTGGCCATTTGTTGATGTTGCTCTTGGTATTGTTGCTGCTGCTGCTGCATCGAGTTGGCAGCTCGCAT 
TGGACTGGCCAGCGTTTCCAGGCTCCTACCCGCCGTCCCATAGCCCGAGGTGCTCGACGACGACGAGGAGGATATGGTGGAGCAGAGCGACTGCG 
TCATCAGTGATCCCAGCGAGGCACTCAGCTCCGGTGCCGAGGACTTGAAGCCACCCAGTCCGCGCAAGCGCTCCGGAATTGGCGGCGGAGTGGTA 
GGCAGTTGTTGGTAGCTCTTGCTGGTCACCGGCGGCGTTGCATCGAACATGTTGCCACTGGCCGCTGCAAATTGCTGCGAATAGGCGGCAACTAT 
GGCGGCAAATTGCTGCTGGTGACCGGCCGCCTTCCATAGCTGCTGCAGACTGGCAGCCAGCTGCTCGTCCTGTTCCAGTAGTGGATTGCCTGACA 
CTTCTGGCCAGTTGGGATAGGCGGCCATGGGCACTGGTATGGGCACCGGGACGGGAATGGGCACGTACTGCGGCTGCTGCTGCTGCTGGCCGGGA 
TTGCAGTTGCATCGGTTGTGGCTGTGGTGGTGATGATTCTTTAAGAATCGCGTCTTAATCCGGGTCTTCATGCTCAGCGGATGTGGCGGTGGTGG 
CACGGGTGGTGGTGTTTGATCGGAGCTGGTCTGCGAACTGCTTTCGGTGTTCAGCACAAACTCCTCGTCGGTGCCGGTGTCACTCTCCCTGATCC 
TGGGACAATAGATATTCCTGGGCTGTTGGGGTCGCCTTTGCACCTGGGGAGAATCGCTATCGCTCGACTGACGCCTTACACGCGGCGGTGGAGTG 
GGCGTGGCTCGGGCTGTAGTATAGGCAATCATGGTAGTTGTCGTGGTCGTCGTGGTGGAGATCATGGAGGGCTCAAAGTAATCCCAATCGCAGTC 
TTCCACATTGCTCACAATGCTGTGACAGTGGTGCGTGGAGCAGGATGAATCCGTGTCATCCTCCTCCCCGTCCTCCTCCTCCTCTTCTTCCTCTT 
CGCCGTGACAGGACGACGATGAATCCGAGTTGAGAAAGACGAGCTTGAACTCGTCCGCTTCAACGCTATATTGCGACTGCTGCTGCTGTCGTCCG 
TGCTGAGTGGAACACCTGAAGTCTAGCTCCGATGACGACGAGGTTTCCACGGCATCGCCCCCACCTCCACATCCACCGATATCAACGGAAACCAC 
GGACAGCTGCTGCCCTATGCCTGTGGAGTAGCAGGGATGAAGTTTCGGGAACATACGTAAGTTCGGAGACAACCACTTTGTCGTTGACCAACACC 
ACGAAGCTGTTCGCAAATACTGGTTATTAGCACCAAACAAATACACACAGGTGAAATACTGGTTAGCCGGAGGGTGAGTTTTTCCACACCAAAGT 
TAAAGGTTGCAGATGGGCTAAGCACATGGACGGGTGGTGGTTGAACAAAAACTTACGGCTAGCATGGGTTAAGCTTGGATGGTTACACCGAAATA 

C C AC AG AT AT T AAT AG AT AG AAAAG GC T G GG AAGG AGTGT T AGT C AT T TT AT C AAGC C C AT T T T AT AG AT T T C AAT GT T AAT GGGT AT GAAC AAG 
TTTGTTAGCTGTCTAGTTACACGCCTTGTTAAAAAATGATTTCTACTTCAAATCGATATTAAGTTCTTGCACTGATATAAACAACATATTTAATA 
ACCCAGCACCAGGTTGAATAACTTCCCTATAAATATGTTCTGTTTATAATTGAGCGATAGAATGTTCAATAACTAAACTCAACATATACAGAACA 

C AAT GAT GAGT G AT C GT AAT GT C T AT CC AG T T C C AAT TT C AT AC AAAC T T G AAACT T AAAAT T T AAAT GG AT T C CAT AT T AC TTTTATTTTGACA 

ACACTAAAAGTTTCTCCGAAAAAATGCACTTCTTCCCAATAAGTTCAGAGCTGGGAAAAATCTTTAGTGCTGCAACGACCTTGAGTAAGTGAACT 

TCATCCTGACTAAAATTAAATATTCCGAAAATCATTTCCCCTGCTGTTCAAGTACCCATCCTGTCAGGAAGCAGTTCAAAGACCCTTGATTGATT 

ATGGTTGTTTGCCACAAGTTTTTCTGCCCAAGACAAGTACTAGCGAAAATTTGCCAAAAAGCTTTTAAAGTTGCCAACATCGCCGTCCTTCAAAC 

CCTTCTGCTTGGTTGGTTGCCCTAACAAAAACTGCAACAGCGGCGGATGCACATAGCGTCAATCCCTCGTAGCCACAAAGTAAACATTTGATCGC 

TGCCCACCCTGCTGTATGAGACACTGCGAGAAATATTTCAGAGAATTTTTAAAACTTCTTCTACCCACTCTTGCAACTAATAGTACAGGTGTTCA 

GTTTTTGGACACTTTAATATTAAGTAAACTCATTTTTAAGTCAAATATATTGTTCTGAGTGCTCAGAAGTTCTTGTCTAAACGCAAACTGTGCAC 

CGTTTTAGTCATTCACAGAGCAAAACCCACATAGGAGCACAAAAGGTTCAGGAGATAAGACTAATGCCACAGTTCTGCCGCTCCACGTGGTCCAA 

AAATCCGACGGGTGACAGCGTGGCTTTGGTGTGTACGTGGGCACCGACAGAACAAGGTACCAAATTCTGGAATCCTTCGAGGCGACAGTCCACCG 

CATCTCGAATTCGCACATAGACACAGGCTGCGTTGGTGGTCACAAAACCGAACGTACACCCACAAATCGCCAAATTGAAACATAACAAAAGGCGG 

CAAAGGCGTTTGGATACCCTGCCTTCCGCCCGGAAAATCCCCGCCTGAAAACTCTTGAAGTGCGGGGGAAGTGGTGTGGGTGGGGCGGTGGGAAA 

TCGATTTATGAGTTGCCAAAAAATTATCCCGCCAGCCCACGGCTAGGTTTTTGTATTTTGTTTCGGTCCCGTGCCACAAAATCTTTTGAAACCCG 

CCAACTTTGCTCTT AAAT TTGGTTCTTCGGTCGTTTGTGGTCATGCCGCC AAT AGCTTCGGGAAAAATTTAATTCCAATGGTGAATGGAAAGTCT 

GCCGTGAAGAAGGCGAAAATTGATGGAATACTTTTAGGAACTCGCAAGAGAAGTGAGCGAGAAAAAGTGTTGCACTCTTTTTGGAAACGCTTCTT 

AAAAAATTGATGAGGGAGATGGTTTCCGAAAAGAACCTGAATGTAAAATAGGACAGTTGCCAAGTACATCGATTATTTGAAACTCTGATTCTCTG 

CTACAGATTTGATGATTTTACTTAGCTTAACGTGTATGTATGTATTTTAATTTGAATGCATAATCCCTTAAGCTAAAACTATTTTAAAGCACTTT 

AGTCCATTAGCACAACTAGTTTTCTATGATTAAATTGATAGGATTAGTTTTTTAGGTTAGTTTCGCCCAATTAAAGTGAATAAAGTCCAGGGCAA 

CTTCTGGTTAAGCACTAATTTTTAGACACTGACAACCAAAGAAGCGCCCACCTTCATTCCCAGAAGTGGAGCTTAATTATTTGAGCCAAATTAGA 

CTGAAGGCGAAGCCCTTCACAAAGAGTTTGTTAGTCACCCAACTACCCTCTATCCGCATCAGCATCAAAAGTAGCATTACCATCACCAAGTTAAT 

TACGCTCCAAACATACATTTAGCTACACAAAGACAACTACAGGGTATGCACTGAGACAAAGTCTTGGAAAATAAAATTCGCAGAAATTATACTTT 

AAAACATCATTTTAGTAATGTGACTGAACTTTTTGATAAATAGTTCTCGTAAAAAGGATAGTTTATTTTTGACTTTCCATTGACTTTTCTCTCAG 

TGTACCGAGGCTAAGGACACTTCTTATCACTAACCTTCTTCGGCGGCCTTCTCCTTCTCCTTGCGACGCTGCAGCACGACCTTCTTGAAGACAAC 

CGCCGCGGCCTCCACTGCATTCTGCAGCTCCTGTTCGCCCAAAGTGCTTGTGTGGGTGGCCTCCTAATGAAAAATGAAATGGGGGAGCAGGATGA 

AAAGGTTTCACGCTTAAGCCTACTGGTCTCCGGACGCTTGATGTTCTACTCACCGCCGAGCTGTTGGGATCGCCGCCCTCCTTGTTCGCCGAGGA 

CTCGAGGGACTTGCCATTGACCGGCTGCTCCGCTCCAGATCCTTGGCCGGATTCCCCGTGGGGCTTCTCCTCCTCTTGCCCTGTGGATGCTACTG 

AGGCGACAGGAATTTCTGTTTTTTTCAGGCGCATGGCTAAATCAGAGTTTCTATGCATAAGCTGTTGATGGAAGGAGTAAATGTGCTGGGGCGTG 

CTACCTGTGGGGGCGGTTAACAGGAGGGGGTGCAAAGCGTTCATCAATCGGGTGCCGAAACAAGCGCAGCTACTCAAGGTTCTACCAGGGACCGC 

AAAAATTAATTAATTAATTAATTCATTCATTAGCCGTAAATACATAGTTACTTATGCGCAGGCAATCACTACTTTGGGAAGCTGGGGTGGGGATA 

CTTCACTTGTATTAGGTGTAAAAATTTAAGTTTCTAATCACTGAATGGTTTATTGCTTGAACTTATATTTGTTAAGGAAAGCGATTTAAATTTTA 

CTAAGGGGTGTAATATTGGCTTTATATTCTTTTTAATAGAATAATAGTATAACTTTTGATAAAACAGACATTGAACATGCAATTTATCAAATAAA 

GAGCACAACAATAATCACTTGAAGCTAGATTTGTTTTTTTTTTGTAGTTTTGCTTAAGCGAGGGACAAAGTTTGAATAGTTTATTTTGTAGTGTG 

TAATGTATATTTTCTACAAAATATATTTGTACAGATAGCTATCTACATAATATTGCTTTACCATCTTTGGTTTATTGTGTATATTTGATATGTTT 

CTAATGAC AGT GAGCG ATT ATT TTATGGCTTAAAACTTGTGAATGCAAATGATAGGTGCTTGTGAAATAATTGGT AAT AAT TGCGGTCCCT GGGT 

TCTACGGATGCGTGATCGCAGATTGCAACCATAGCTACTATCAAAAATGACGTATCCTCGGTACTTACCCTCCTCAGCTTCGCCGATTCCTGTGC 

GACTGTTGCGGCGCCTCGCCTGCGGAATCTTCGAGTTGCCACGCGTTTTATCCAGGATATCAGCCAGAGGACGAGGGCTGGGTATTTTCGATATT 

TCCGTTTTCTTTGTGGGCTGATCAGCGAAGTATTAATTAATGTATTTTCACCTAGATCAAGGTTTGAGGTCAACTCACCTGAGTGTCCTTTTCGG 

GATGCGAGAAGGATCGTCGTGTGACGGGCTTGGGCGTGGTCTTCGCTCCAGAATCCTCGGAAACAGAGCGCATCTGCTCCGGTTTTCTTCTCTCG 

GCAGCATGGCTATCGGTGGCACTTCCACTGGCTGCACCTTCATTGCCCGAACCTTTGCCATTGGCATCCGAGACCTGTGAGAGTTGGGCCAGTTC 

CTCCTGCTGCATCTGCAGTTCCCTGGATTGTCGGCGCCTAATGCGATTGTGCATTAGAGGAGTACGGGGTAGTGCTGCTCCGCCATAGAAGCCTT 
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GTGGGGTGCCCATGTCCGGATAATAGCCGGGCACACCACCTGAACCGCCACTGGCGCCTAGTCCCACGCCAGCTCCTCCCCCGTAACCCGAATCA 
GCAAGGACATTCTCGGCACTGTAAATCGGTCTTTGGGAGGAGTTCTGGTAGCTTCCATCATCCGAACCATTGTTGTAGATGATACGTGGCTCACT 
GCTAAATCCGGCATTATAGTTTCGTTCGTACTGCACCTCGTCGGACTCCAAGAAACTGGTCGCACTGTCGCCAAAGTAATCCACATATTCCGGGC 
GATTATCCAGCTCCTCAGTGACATCGGGAAACTGATGGAACTGCTGCTTGGCCTCCCCGAAAGTTCTTTCCTCCTTCAGGCAATCCTGGCGCACC 
AGGCGACTGGGCGATCGACCACCAGGCCGCTGCACAGCCGTTGGGATCTCCGGCAGCAATCTTCTGTTCTCCGGGTGTCCCTTGGTGTTCGAGTA 
CTCCTCGTTGCTGTGGCTGCGCACCTTGATCATCTTGGAATGCGGATCACGGGAGTGATGGATGTTTACACTCTGACTACTGGAGTACTGCTCAC 
CACCGCCGACGACCCCGGTGCTGGCCACACTGCCTCCGCTTCCCAGTTCGGTGCCTCCCACGGCGGGCACATGTGGCTGCGAGAGAAATTGCCGG 
GGATTCGGAACACTTGGGAAAAGGAAATTCTGTTTGGCCCTCGGCGAAATAGGCATGAATTTTGCCGGAGGAGTAGAGTGCACGGTGAGTGTGTT 
TCCACTGCTAAACGCTTCCGGATTTGGCAGAGTTTGCTGACGCATGAATTCCGGAGAACGTCGGAATTGCGGGGAAGTCTGTCGCTTTGGCGGAG 
ATGTGGGCAGTAGCTTGACATGCTGCTCGGGATTTGGTAGAGTTGCCTGCCGCGTCATTCTCGGCTTGGGCATGGGTCCCACATCCTGCTCGTCG 
CTGGCCAAACTGGACGTTTGTGATTCGTAATTCAGTGGTGGCGGAGCCACTGTCGGCGGAGGTGCTCCCGTGCTGGGCAGCAATTTGGTGCGATG 
ATAGGTGCTGGGCGGCTCTGGGAACGAGGTCTGCCTCATGAAGGGCGGATAGCGCTGCTGCTCACCCGGCGATCTTTCCGGGGAGCGACTGTACA 
GCCGGTTCGGGGATGTGGACATATATATAGCCGGAGTTGGTTGAGCTGGCAGCGTGGATTGGCGGCGCACCTGAAGCCGATGATGCGTGGGTGTC 
ACCTCCAGTTGGGGCTGCTGTTGCTGCGGCTGCTGCTGCTGTACGTAGTTCTCGCAGTCACCCACCGTCACACCAACTCCCACTCCCATTCCGGG 
CATGGTGTTCATGTGGATGACGTCGTCGCTGTTGTAGGCGCTCGGATAGAGCGGACTCTTCTGCAGTTGATGCGGCGATGTGGGCGCCGTTAGAT 
TCACGGAATTGTAATTGGCCAGATTGCCACCACTGCCGCTGAGAGCCGGGTCGTTGTTCAGGCTACTCGGAAGCGTCGACTGACGCAGGACCCTC 
CGTCCCAGCTGCCACTTGCTACTGGTTGTGGGCGTGGCAGTGCCACTTCCGTAATTGATGCCGTGCGTGGGTCGCGGCTGTGGGGGCAGGTGATG 
CTGCTGGTGGTGCTGGTGCAGGGTGGGGTAGCTGCCCGCTGATTGGGCCTCAATCAAATGCTGCTGCTGCGGATATAGGTTGTCCGCCACCGGCT 
GTTGTGAGTTGGGATTGTTCGGATCGTCCTCGGGATCTGTCGGGATAAAGGATTTTCAATAATCCGTTCTAACAAATTGCAATATTTTTTTTTTT 
AAAAAATCTTGGAAAAACTATTAAGCACTTAAAACTTATTTTAAATGCATGACTTTTGCTGTTCGCAGTCTGTTTAAAAAATGTACTTACGTAGT 
TGAATAATCCTTGAAGGATTCTTTCAAACATGCATATACTCAATCAAAGTCTCTTAGCCATATAATGAAAGTAAAATCCAGAGTACTTACTCGAT 
ATGGTCCAAGAACGCCGGATGCTCTTCCTCAAGACGTCGCTGGTGGTCAGATGGTCGACGGCACTGCAGGATCCCAGGGCATAGGCGTCCTTCAG 
CCACTCGCCGCTGGCCGACTGTATCTCCCTGCAAGAGCAACAACATTGCCACTTTAGAAACCAGAAAACCTCGCCAAACATTTTCCCAACTTTCG 
TTGGAGACGTCAAACAGTCAACAAGTAAATCGGCGACTCAGCGGTATCACGTCTGTCTGTCCTCGCGACATTTGGCAGACTGGCTGGCTGACTTA 

C T C AAACG AAC AC T GAAT GC C GGAT T GT C GGT GT C AC AGC AGC AGC AAC AGC AAC AT GG AAC AT G AAAC AC C AAC AC AC AC C AT G AC T T T AAG CC 
AAGGTGTGAGCCAACGGAAGAGTTAAAGCCAGCACAGACTGCCTTGGGTTCGCACAAACAGACAGAAATGTATTTTTGGGGAAAAATTTGACGGC 

G AAAC C AC AAC G GAAAAGT T T G GGGC T C AG AT AC C AAAGAAAT AAT T AC AT AT T T AAT GAAT AAG AC AAAC T T C AAC C GGAAGT T GGT CC TGC T T 
ATGTTTAACTTTAATCTCACGTTTGGGTTTTAAATAATATTAAAAGTTAGCTTCATATGGCCATACATTGCATTACACTTCCATTTAATTTGAAT 
AGCCATACTAAGAACTGGAAACGTATTTATAATTACCAAGAACTATATGATTGGAAAAGCATTTGTACAATTTAATTAAGTTCCTGGGAAAGATA 
AACTCTTTATTAAATTTCTGAAAACTTGATTATTTTAGCGGGTGATATAACACCCACTTATGTATTAAGCAAATAATAATTTATATATTACTAGA 
AAAGCTTTTTGGTAACGCACAATGGGGTTTCGAGAAAAAAGCGTTATTAATAACCCATTTGTACGTATATACAATTTAATATAATCAATGCGTGG 

T TGC C CAG AT AC AAT AAAAAAT AAG AAT T C AAT G T C T GGC C AG AAAT T C AT G AAAT AAT T GT AT TTCTTCGGC C AGAC ACT G TGCTTTTGTGCCT 
GACTTGGGGGTGCAGTCGAAGCAATTGGATAAATTCCTGCTGAGAGGCAGACGCAAATGCAGACGCCCTCCATCAAAATAAAAAAGGGGCTGGGT 

CCATCGTGGTAGTCATCATGTGGCATTCATAATTAT 
(SEQ ID NO: 1354) 

Exon: 14711.. 14531 
Exon: 14286. .12762 
Exon: 12665.. 12239 
Exon: 12177.. 12039 
Exon: 11309. .11074 
Exon: 10988.. 10865 
Exon: 8285. .1251 
Exon: 1032.. 1001 

Start ATG: 14711 (Reverse strand: CAT) 

atgtttggcLggttttS 

TGCCCTGGGATCCTGCAGTGCCGTCGACCATCTGACCACCAGCGACGTCTTGAGGAAGAGCATCCGGCGTTCTTGGACCATATCGAATCCCGAGG 
ACGATCCGAACAATCCCAACTCACAACAGCCGGTGGCGGACAACCTATATCCGCAGCAGCAGCATTTGATTGAGGCCCAATCAGCGGGCAGCTAC 
CCCACCCTGCACCAGCACCACCAGCAGCATCACCTGCCCCCACAGCCGCGACCCACGCACGGCATCAATTACGGAAGTGGCACTGCCACGCCCAC 
AACCAGTAGCAAGTGGCAGCTGGGACGGAGGGTCCTGCGTCAGTCGACGCTTCCGAGTAGCCTGAACAACGACCCGGCTCTCAGCGGCAGTGGTG 
GCAATCTGGCCAATTACAATTCCGTGAATCTAACGGCGCCCACATCGCCGCATCAACTGCAGAAGAGTCCGCTCTATCCGAGCGCCTACAACAGC 
GACGACGTCATCCACATGAACACCATGCCCGGAATGGGAGTGGGAGTTGGTGTGACGGTGGGTGACTGCGAGAACTACGTACAGCAGCAGCAGCC 
GCAGCAACAGCAGCCCCAACTGGAGGTGACACCCACGCATCATCGGCTTCAGGTGCGCCGCCAATCCACGCTGCCAGCTCAACCAACTCCGGCTA 
TATATATGTCCACATCCCCGAACCGGCTGTACAGTCGCTCCCCGGAAAGATCGCCGGGTGAGCAGCAGCGCTATCCGCCCTTCATGAGGCAGACC 
TCGTTCCCAGAGCCGCCCAGCACCTATCATCGCACCAAATTGCTGCCCAGCACGGGAGCACCTCCGCCGACAGTGGCTCCGCCACCACTGAATTA 
CGAATCACAAACGTCCAGTTTGGCCAGCGACGAGCAGGATGTGGGACCCATGCCCAAGCCGAGAATGACGCGGCAGGCAACTCTACCAAATCCCG 
AGCAGCATGTCAAGCTACTGCCCACATCTCCGCCAAAGCGACAGACTTCCCCGCAATTCCGACGTTCTCCGGAATTCATGCGTCAGCAAACTCTG 
CCAAATCCGGAAGCGTTTAGCAGTGGAAACACACTCACCGTGCACTCTACTCCTCCGGCAAAATTCATGCCTATTTCGCCGAGGGCCAAACAGAA 
TTTCCTTTTCCCAAGTGTTCCGAATCCCCGGCAATTTCTCTCGCAGCCACATGTGCCCGCCGTGGGAGGCACCGAACTGGGAAGCGGAGGCAGTG 
TGGCCAGCACCGGGGTCGTCGGCGGTGGTGAGCAGTACTCCAGTAGTCAGAGTGTAAACATCCATCACTCCCGTGATCCGCATTCCAAGATGATC 
AAGGTGCGCAGCCACAGCAACGAGGAGTACTCGAACACCAAGGGACACCCGGAGAACAGAAGATTGCTGCCGGAGATCCCAACGGCTGTGCAGCG 
GCCTGGTGGTCGATCGCCCAGTCGCCTGGTGCGCCAGGATTGCCTGAAGGAGGAAAGAACTTTCGGGGAGGCCAAGCAGCAGTTCCATCAGTTTC 
CCGATGTCACTGAGGAGCTGGATAATCGCCCGGAATATGTGGATTACTTTGGCGACAGTGCGACCAGTTTCTTGGAGTCCGACGAGGTGCAACCG 
ATTTACAGTGCCGAGAATGTCCTTGCTGATTCGGGTTACGGGGGAGGAGCTGGCGTGGGACTAGGCGCCAGTGGCGGTTCAGGTGGTGTGCCCGG 
CTATTATCCGGACATGGGCACCCCACAAGGCTTCTATGGCGGAGCAGCACTACCCCGTACTCCTCTAATGCACAATCGCATTAGGCGCCGACAAT 
CCAGGGAACTGCAGATGCAGCAGGAGGAACTGGCCCAACTCTCACAGGTCTCGGATGCCAATGGCAAAGGTTCGGGCAATGAAGGTGCAGCCAGT 
GGAAGTGCCACCGATAGCCATGCTGCCGAGAGAAGAAAACCGGAGCAGATGCGCTCTGTTTCCGAGGATTCTGGAGCGAAGACCACGCCCAAGCC 
CGTCACACGACGATCCTTCTCGCATCCCGAAAAGGACACTCAGCCCACAAAGAAAACGGAAATATCGAAAATACCCAGCCCTCGTCCTCTGGCTG 
ATATCCTGGATAAAACGCGTGGCAACTCGAAGATTCCGCAGGCGAGGCGCCGCAACAGTCGCACAGGAATCGGCGAAGCTGAGGAGGGTAGCACG 
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CCCCAGCACATTTACTCCTTCCATCAACAGCTTATGCATAGAAACTCTGATTTAGCCATGCGCCTGAAAAAAACAGAAATTCCTGTCGCCTCAGT 
AGCATCCACAGGGCAAGAGGAGGAGAAGCCCCACGGGGAATCCGGCCAAGGATCTGGAGCGGAGCAGCCGGTCAATGGCAAGTCCCTCGAGTCCT 
CGGCGAACAAGGAGGGCGGCGATCCCAACAGCTCGGCGGAGGCCACCCACACAAGCACTTTGGGCGAACAGGAGCTGCAGAATGCAGTGGAGGCC 
GCGGCGGTTGTCTTCAAGAAGGTCGTGCTGCAGCGTCGCAAGGAGAAGGAGAAGGCCGCCGAAGAAGGCATAGGGCAGCAGCTGTCCGTGGTTTC 
CGTTGATATCGGTGGATGTGGAGGTGGGGGCGATGCCGTGGAAACCTCGTCGTCATCGGAGCTAGACTTCAGGTGTTCCACTCAGCACGGACGAC 
AGCAGCAGCAGTCGCAATATAGCGTTGAAGCGGACGAGTTCAAGCTCGTCTTTCTCAACTCGGATTCATCGTCGTCCTGTCACGGCGAAGAGGAA 
G AAG AGG AGGAGGAGG AC GGGG AGG AG G ATGAC AC GG A T TC ATC C TGC TCCACGCACCACTGT C AC AGC AT T GT G AGC AAT GT GG AAG AC T GC G A 
TTGGGATTACTTTGAGCCCTCCATGATCTCCACCACGACGACCACGACAACTACCATGATTGCCTATACTACAGCCCGAGCCACGCCCACTCCAC 
CGCCGCGTGTAAGGCGTCAGTCGAGCGATAGCGATTCTCCCCAGGTGCAAAGGCGACCCCAACAGCCCAGGAATATCTATTGTCCCAGGATCAGG 
GAGAGTGACACCGGCACCGACGAGGAGTTTGTGCTGAACACCGAAAGCAGTTCGCAGACCAGCTCCGATCAAACACCACCACCCGTGCCACCACC 
GCCACATCCGCTGAGCATGAAGACCCGGATTAAGACGCGATTCTTAAAGAATCATCACCACCACAGCCACAACCGATGCAACTGCAATCCCGGCC 
AGCAGCAGCAGCAGCCGCAGTACGTGCCCATTCCCGTCCCGGTGCCCATACCAGTGCCCATGGCCGCCTATCCCAACTGGCCAGAAGTGTCAGGC 
AATCCACTACTGGAACAGGACGAGCAGCTGGCTGCCAGTCTGCAGCAGCTATGGAAGGCGGCCGGTCACCAGCAGCAATTTGCCGCCATAGTTGC 
CGCCTATTCGCAGCAATTTGCAGCGGCCAGTGGCAACATGTTCGATGCAACGCCGCCGGTGACCAGCAAGAGCTACCAACAACTGCCTACCACTC 
CGCCGCCAATTCCGGAGCGCTTGCGCGGACTGGGTGGCTTCAAGTCCTCGGCACCGGAGCTGAGTGCCTCGCTGGGATCACTGATGACGCAGTCG 
CTCTGCTCCACCATATCCTCCTCGTCGTCGTCGAGCACCTCGGGCTATGGGACGGCGGGTAGGAGCCTGGAAACGCTGGCCAGTCCAATGCGAGC 
TGCCAACTCGATGCAGCAGCAGCAGCAACAATACCAAGAGCAACATCAACAAATGGCCAGCAGTGCTTTCAAGCCAAAGCCAACAAGTTTAGTAG 
CTTCGCGTTCGCCGACGGGTGAGCAACTAGAAACGAGCAACGCGCGTCGCGGCAGCAGCAACAGTGAGAGCGACGACAACGCCAACAGCAACAGC 
AACAACGCAACGCCCACAACGGGTAGATTGTGTAATGTTTATGATAAAACCGGCACATTGCCAGTGCCTCAGTGCGTTCAGGCACATGTTGCCAG 
TGTAGATAAGAGTGCCGCGACAGCAGCAATATCAACAAAAGCAGAGCCAGCAGCAACACCAGCGACATCCACAAAGGGAACCGCTCTGAGAGTGG 
CCAGCAAATACCAGAGCCGAAATCTATGCGAAACCCAGTCAGCGGCAACAACACAAACCTACCTGGCCAGTAAAAACAGCCACGAAGAAGTGGAA 
GCAGCAGACGAAACGAATATACATAAAAATCAAGTACCAGCAGCGGCAGCCGCAGCAACAGCACAAATACAAACAGCAACTAGAGCTGCGGCTCC 
TGCGGTAATAACTCGTTTCGGCGGACTTGCAGTCGAAAATGCTGTTGGAAATGTGGCTGAAAGTGCTAGTGCCAGTGCCGCCAGTCCCATTCCCA 
AGCCCAAACCCAGAGCCCAGCACCCAGAGCAGATAGATCGGCATCGAGCGACGGCGGAGACTGCCAAGAATTTGTCCTCCAATCGACTTATAGAG 
AACATCAAGATGTCCGATGAGGCATCACCATCGACGGCGGCCCATCTGCAGAGCGAAAGCTATGCGGCCAGTAGTCCAGATGAGGCCAGCAGCAA 
CAGCAGCCAGGATGGAGATCAGGATAGGGAGCGTGAGTTGGAGGAGCAGGCGGAGCAAAAGCCGCGTAGGACCACGCAAAGAAGCTACAACGTTG 
CGGGAATGATGGAGGAGCAGCAACTAGGAGAAGTGGATGCAGTGGAGCAACGACACCACAGCACCACCAGCGATTCCAGCTCCAGCAGCAGCTCA 
AACTCCAGCTCCAGCTCGGAATCGTGTGACTCCGATGATACGGGAACTGTGCGCGATCGCAGGCCCAAGAGAAGGAGGTTCAACAAGGTGTTCGT 
CGTAAATCGACAGCCCGGCGGCCGGGTAAGACGCAGTTCCTCCACCGAGGGCGATTCTCTGTCTTCCTCGGAGGCCAATTTGGAGGATTCCTCTG 
ACAATGATACAGACACCGAGCGCACCGATTGTGGAATTGTGCTCAACTACGTGZ^AACCCCTAGAGGAGGAGCCCAAGAAGGCGGAGGAGGAGCCG 
CCGACAGCTGCACGTGATTGCCAATCAAGTGACGAAGACAACATTAACGATGATGAAAGTGAACGCCGTGTTGCCAACTCAACCGATGAGCTGGC 
CAACTGCATTGTAGTGGTGGGCGGACAGACGGACGACGCCGCCGCCGGCGTGGTGCTGCACCACATGCGATCGCTGCCAGACGTCGAGTTCGATG 
GCGAGCAATCAGAGCGCCGCCAAGCGCATGAGATAGACGCCAGCGATGCCCTGGCCGCCTGCGATGAGCTGCACAGCATCTCGAACGATGTCAAT 
CGATTGCTGGCGCTGCACAAGCAGAACTCTCAGCTGGAGTTGAAGCTCCAACGCTTGCGCCAGGGAGTGGCCGAAATCAATGAGGATCTGCAGCT 
CTCCACCACGGGCACTGATGTGGCTGCCACCCACTGCACAACGCAGCCAGGTAGTGCCGGGAAACCCGACGAATGCAATAGCCCTTCTTCTCGGA 
AAAGCCCCTCGACCGTGCTAAGTTCTAATGACGTTGGTCCGCTGAGTCTCACCTGCTTACGGCAGGTAGAGGCAGTGGTTGAGGTAGAAGCAAAC 
AGAGCTGATGGCAAACTGCGGGATGAGCGGAATGGCCGAACAGCTGAGCAAACCGAGGAGAGGGTCGTCACATGCAGCCAAGCATGCAAAATAAA 
CGACAACAGTGACTACTCGCTGGATTCACTCTCGGCAACTTCGGCGAGTTTGTTGGCACCTGAGCAGCAGCAGCAGCAGCCGCTCCAGCAGGTAG 
AAGAGAGTGCTCGCCTACCTGCCGACGCTCGCGAGCAACGATATCGCCGTGGAGCGAAATCGGAGTCGGCGGAGTCGCCAGTCGAAAATGAAATG 
TTAAACGTTGCGGTCGCTTCTCGTGCCGCGCTTCTCGGCAGCAATAACAACAACACACACAGAAACGATAATAGCAATAACGAAAACACTAACAA 
CAACGACGGCGAACTTGGCCATAATAATAATGACGCTTTCGCAGCACAAAAGACGAGAAATAGCGGCTCGAGTGTGGTTGTCGGCTGCGATATTA 
GTGACGTAGTGAGTGTGAATTCAAATGTGAGTGGTGAAAACGAGTGTGACTTCGACAAAATATTTGGTAGTCATCAGCAAAACACTCCAAGTGCA 
TCAGCAGCAGTTGCAGCAACAGCAGCAGCAGCAACATCTGACACACCTGCAACATCGCCCAGCGAACTGGCTTCGCCCAAAAATTCACTTTCAGC 
CAAGTATCGCAGTTTGGTCATGATCAACAAAGACAATGAAAGTGCAGCTGGCGATTACGAAATGGCAAACAGCAGCAGCAACTTCAGCAACATCA 
GACACAAAGACCCGACCAGTAGCAACAACAGCCACACTGTTGGCCAGAACTTTGCGAGTGACTCATTTCGCGCCATGGAACCAACGGAACTCGTG 
AGCAGCGATCGCGAAGCGTACAGTGTGGACTCGCTAAACGAACCGCCTGCTGTGCCGGAACTCAGCGTAGAGGACGGCTTGGCTGATGACGACTC 
CTGGGTGGAGGAACTGAGTCAGCGCGGCGAGGATGAGGACGAAGATCTCAGCAATGCTACCACCACACCCACGGCCACAGACTCCGAAGATGCGG 
AGGGAGAAGGTGAAGGAGCCCGGCGACATGGATATATGGATCGAGAGGAGGACCTAAGGGGATACAATAGATCTGCCATCGACTTCACCCTGCAC 
ACCATCGTCGAGGAGAGTTGTGAGGAGAGCGAGGTGGCCTCCATGCGGGCGGACAATGAAGACGTCGAGCTGGAGGATGAGGATCTATCCGAACG 
GAGAAGGCAGCGCACCTTGCAGCACCACCATCGACTAAGTGCCTCTGAACTAGAGAAGTACTTCTTCTTTGGGCTGGGAGATGGTCGAGTAATGA 
GTTCTATTGATACACGCGGAGATGACACTGCATCCGAAGTGAGTTCCGAGTGCTCTGAGAGTCTTGACTCCCTACCGCATGAGGATCAGCTGTTG 
GATACAAGTGGAGCTTCCGATCTGGCCTCTTCCCGTCTGGAAAAGTACTTCCTCTCCGGCTTCATGGGATTCAGTGGTGCAGAAAAGCAGGCGGA 
GAGCGATGAAAGTGGTGGCAGTGTAGGCAGTGATAGTGAAGGACAACCGAGTCCCAGTCAGCGGAGAAAGCGACTGGTAAGAGCCAGGGGAACAC 
CTAGAAGTCACAACTCCTCGTTGGATAATTTGCTACTGCCAGAATCAGACACCCTGGATGCCACAGTTACTTCTGCAGGCGGAGCAGCTGCAGCC 
ACTGTAGAGGACACCTCGGAATCGGAGGCGGGCTGTGACGACACTGTGATTCATTTGGCCAATGCCGGAGCTTCGGATGGTTCCTCCTCGGACAC 
CATTAAACGCAAGAAGCAACTGCGCAAGCGTCACGACTCTCTGGACGAGAAGAAGCTCCACGATCTAGAGCCTTCCGAGTGCCGGACTCCCACCC 
CGGGCAGCAGTGGTCAAGTGCAGATGCAGTCGCAGGCGAAAAAGCAACAACAGCAGCATCACCACAGCCGGGACAGTGGCTTTGTGGGCAGCAAC 
GATGATCTCCTGAAGGCGGCACCAGATTGTGAGCCACCCAAGAGCCCAACGCCAGCTCTGGAGCAGATCAGTGAGGATCGGGAACCGGTCCATAG 
TCAGATTAGTCTGGCCAGGATGGATCAACCAAGCACTTCAGCAGCTGCAGCTGTAGCCAGCACACTTTCCAGCCAACTGAAAAATCTTGCCCTGC 
CCAATCTGGTGCGAAAGGATAGCTTCAACAATTGGAGCTCCGACGAGGAGACCAACCTGATGATGAGCAAGATGCGGCAGTTCTTTAAGAACCTC 
ATTGTCGCCACCGCCAATGCACAGCAGAGCAAGCCCTCCACGCCCAATCAAGGGCCCAGCACCACCAATGTCAACACCACGCCCAGTTCCCAGCG 
CAGATTGGCCAAATCCCGGCCAGCTCAGTTGGCCTACTTTGAGAATGAGCTGACCCGACTGATGAAAACCGTGCCGGGCATCAATGATGAGCAGG 
TGCGCGAAATAGTGGAGTACCTGAGCAGTGAGGATACATGGTCTGATTCCTACGATTCTTCGGACTACACGAGCTCCGATCTGGAGGGCGGTGAA 
AGGAAGGGCCAACTGAAGGCTCAGATCTCCGCCAGCTGTCAGCAGATCATCAACAAGTTTGAAATTGACGAGGAGGGTGATCGTGGTGATGGTGG 
ACTGCTGGACGAGAGCCAAAGTGTGCCTATGGAAGCATTGGTGTATCAGAAATTGGTAGCTTCCTTTAGCAAAGTGGCGGCGGGTGGAGAACCGG 
AAACAGAAGCCACCAAAGCGGTGGAGGAAGCAGAACCGTCCACGGAGAGATCCCCGCAACTGTTTGCCAAGGTGATGCAGCATATTGGTACCAGA 
TTGGTAGCTCTGATGCACGAAGTTAGCAGTGGCAATGAAACACCCACTCCATCCCCGCAAGGACAGCGGCATCATCGAAGACTGCAGGCCAAGAT 
CTCGGCCACCACGACGGAGGATGAGGAAGATGAGGTGGCGGAGCAACTGAAAGCCATGCCCATCAAGCAGCTAAAGCTGCGAAGCAAATCCCATG 
ATCTACTGCTGGACGGCACTACGCCGCATTCGCATCACCTGCACCACCATGCCACGGTGCATTATCCAGCCGGCGGCGTGGGATCGGGAGCGGGA 
GGATCCGGCGGACCCGGACACTCGGACAACGCCGGGGAGGAGTGCGGCGTGACCAGCGACTACGAGCGATTCTCGTGGCGCGGCAGTTTTGAGTC 
GGCATTACTGGCTAATGGCGACAGCAGAACGAGGCTCAGCCAGCTGAGCCAACTGGACAGGGATAACTCGTCGTCTGCGTCCGCTTTGGCTGTGG 
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CAAAGCGCCGATCGGCAGGCGACCTTCTCTTCAGCCAGCACCAGCAGAGCCTGAGCCGCGAGCAATTGGATCGCGTCCGATCCTGCGGCAGCATT 
GGAGGCGGCGATGCCCATCACCACCAGTTGGAGTCCTCGCCGGCCAAGCCGTGGCTCTCCTCCGCGGGATCCTCGATTGGCGGTGATTCTACGAA 
GGATGTGCGGCGTTCCAGTGTACCGGATGCCATTTACGAGACGGACTCCAGCGATGAGGCAGCTTCGCATCAGTTTGGCGGCGCCAGATCCACGT 
TGCCGCGCAGTCTGAATCCCAACCAAGTGGTGGCTAGCACGAATTCACTACCCCGGTTACCCACCACCGGTGTTGGTGCTCCCATAACCAGCACC 
CCCAAGACAAAGTCTCAGAGCGCCCTCAACCACACGCCCTCTAATTTGTCCACCGTTTCGGCCACTGGCAGTGCCAAGAGTGCGAGATACCGCTC 
GCCAGGATTGGCAGCTCGAGCAGCGGCCGTCAGTGGGGCAGGAGGTGGCTCAAGTGGTGGTGTTGGTGTGGGTGCCAGCAGTGGTTCGACGGGCA 
AAAAACTAGGCGCGGGCTTCCAGTTCCTCTACTCCAAACGCGATGCGCGCAAACGTCTCAACATGTCAGCTGGCATAAAAGTTCTGGGGGAATCT 
GATCCTTAG 
(SEQ ID NO: 1355) 

Start ATG: 1 (Reverse strand: CAT) 

MFGEVFWFLKWQCCCSCREIQSASGEWLKDAYALGSCSAVDHLTTSDVLRKSIRRSWTISNPEDDPNNPNSQQPVADNLYPQQQHLIEAQSAGSY 
PTLHQHHQQHHLPPQPRPTHGINYGSGTATPTTSSKWQLGRRVLRQSTLPSSLNNDPALSGSGGNLANYNSVNLTAPTSPHQLQKSPLYPSAYNS 
DDVIHMNTMPGMGVGVGVTVGDCENYVQQQQPQQQQPQLEVTPTHHRLQVRRQSTLPAQPTPAIYMSTSPNRLYSRSPERSPGEQQRYPPFMRQT 
SFPEPPSTYHRTKLLPSTGAPPPTVAPPPLNYESQTSSLASDEQDVGPMPKPRMTRQATLPNPEQHVKLLPTSPPKRQTSPQFRRSPEFMRQQTL 
PNPEAFSSGNTLTVHSTPPAKFMPISPRAKQKFLFPSVPNPRQFLSQPHVPAVGGTELGSGGSVASTGVVGGGEQYSSSQSVNIHHSRDPHSKMI 
KVRSHSNEEYSNTKGHPENRRLLPEIPTAVQRPGGRSPSRLVRQDCLKEERTFGEAKQQFHQFPDVTEELDNRPEYVDYFGDSATSFLESDEVQP 
IYSAENVLADSGYGGGAGVGLGASGGSGGVPGYYPDMGTPQGFYGGAALPRTPLMHNRIRRRQSRELQMQQEELAQLSQVSDANGKGSGNEGAAS 
GSATDSHAAERRKPEQMRSVSEDSGAKTTPKPVTRRSFSHPEKDTQPTKKTEISKIPSPRPLADILDKTRGNSKIPQARRRNSRTGIGEAEEGST 
PQHIYSFHQQLMHRNSDLAMRLKKTEIPVASVASTGQEEEKPHGESGQGSGAEQPVNGKSLESSANKEGGDPNSSAEATHTSTLGEQELQNAVEA 
AAVVFKKVVLQRRKEKEKAAEEGIGQQLSVVSVDIGGCGGGGDAVETSSSSELDFRCSTQHGRQQQQSQYSVEADEFKLVFLNSDSSSSCHGEEE 
EEEEEDGEEDDTDSSCSTHHCHSIVSNVEDCDWDYFEPSMISTTTTTTTTMIAYTTARATPTPPPRVRRQSSDSDSPQVQRRPQQPRNIYCPRIR 
ESDTGTDEEFVLNTESSSQTSSDQTPPPVPPPPHPLSMKTRIKTRFLKNHHHHSHNRCNCNPGQQQQQPQYVPIPVPVPIPVPMAAYPNWPEVSG 
NPLLEQDEQLAASLQQLWKAAGHQQQFAAIVAAYSQQFAAASGNMFDATPPVTSKSYQQLPTTPPPIPERLRGLGGFKSSAPELSASLGSLMTQS 
LCSTISSSSSSSTSGYGTAGRSLETLASPMRAANSMQQQQQQYQEQHQQMASSAFKPKPTSLVASRSPTGEQLETSNARRGSSNSESDDNANSNS 
NNATPTTGRLCNVYDKTGTLPVPQCVQAHVASVDKSAATAAISTKAEPAATPATSTKGTALRVASKYQSRNLCETQSAATTQTYLASKNSHEEVE 
AADETN I HKNQVPAAAAAATAQI QTATRAAAPAVITRFGGLAVENAVGNVAESASAS AASPI PKPKPRAQHPEQI DRHRATAETAKNLSSNRLI E 
NIKMSDEASPSTAAHLQSESYAASSPDEASSNSSQDGDQDRERELEEQAEQKPRRTTQRSYNVAGMMEEQQLGEVDAVEQRHHSTTSDSSSSSSS 
NSSSSSESCDSDDTGTVRDRRPKRRRFNKVFVVNRQPGGRVRRSSSTEGDSLSSSEANLEDSSDNDTDTERTDCGIVLNYVKPLEEEPKKAEEEP 
PTAARDCQSSDEDNINDDESERRVANSTDELANCIVVVGGQTDDAAAGVVLHHMRSLPDVEFDGEQSERRQAHEIDASDALAACDELHSISNDVN 
RLLALHKQNSQLELKLQRLRQGVAEINEDLQLSTTGTDVAATHCTTQPGSAGKPDECNSPSSRKSPSTVLSSNDVGPLSLTCLRQVEAVVEVEAN 
RADGKLRDERNGRTAEQTEERVVTCSQACKINDNSDYSLDSLSATSASLLAPEQQQQQPLQQVEESARLPADAREQRYRRGAKSESAESPVENEM 
LNVAVASRAALLGSNNNNTHRNDNSNNENTNNNDGELGHHNNDAFAAQKTRNSGSSVVVGCDISDVVSVNSNVSGENECDFDKIFGSHQQNTPSA 
SAAVAATAAAATSDTPATSPSELASPKNSLSAKYRSLVMINKDNESAAGDYE^NSSSNFSNIRHKDPTSSNNSHTVGQNFASDSFRAMEPTEL^ 
SSDREAYSVDSLNEPPAVPELSVEDGLADDDSWVEELSQRGEDEDEDLSNATTTPTATDSEDAEGEGEGARRHGYMDREEDLRGYKRSAIDFTLH 
TIVEESCEESEVASMRADNEDVELEDEDiSERRRQRTLQHHHRLSASELEKYFFFGLGDGRVMSSIDTRGDDTASEVSSECSESLDSLPHEDQLL 
DTSGASDLASSRLEKYFLSGFMGFSGAEKQAESDESGGSVGSDSEGQPSPSQRRKRLVRARGTPRSHNSSLDNLLLPESDTLDATVTSAGGAAAA 
TVEDTSESEAGCDDTVIHLANAGASDGSSSDTIKRKKQLRKRHDSLDEKKLHDLEPSECRTPTPGSSGQVQMQSQAKKQQQQHHHSRDSGFVGSN 
DDLLKAAPDCEPPKSPTPALEQISEDREPVHSQISLARMDQPSTSAAAAVASTLSSQLKNLALPNLVRKDSFNNWSSDEETNLMMSKMRQFFKNL 
IVATANAQQSKPSTPNQGPSTTNVKTTPSSQRRLAKSRPAQLAYFENELTRLMKTVPGINDEQVREIVEYLSSEDTWSDSYDSSDYTSSDLEGGE 
RKGQLKAQISASCQQIINKFEIDEEGDRGDGGLLDESQSVPMEALVYQKLVASFSKVAAGGEPETEATKAVEEAEPSTERSPQLFAKVMQHIGTR 
LVALMHEVSSGNETPTPSPQGQRHHRRLQAKISATTTEDEEDEVAEQLKAMPIKQLKLRSKSHDLLLDGTTPHSHHLHHHATVHYPAGGVGSGAG 
GSGGPGHSDNAGEECGVTSDYERFSWRGSFESALLANGDSRTRLSQLSQLDRDNSSSASALAVAKRRSAGDLLFSQHQQSLSREQLDRVRSCGSI 
GGGDAHHHQLESSPAKPWLSSAGSSIGGDSTKDVRRSSVPDAIYETDSSDEAASHQFGGARSTLPRSLNPWQVVASTNSLPRLPTTGVGAPITST 
PKTKSQSALNHTPSMLSTVSATGSAKSARYRSPGLAARAAAVSGAGGGSSGGVGVGASSGSTGKKLGAGFQFLYSKRDARKRLNMSAGIKVLGES 
DP* 

(SEQ ID NO: 1356) 



Celera Sequence No. : 142000013383764 

GTTAACGCAAGTTCCACTTGCATCAGGGTTCTCGGCACTCGCTGCCGCATACCTCACCTTGATATGACAACCAACATGCTGTGGTTCTCCAAATA 
ATAGTCAAGATAGTTAAGAGGGTTCGAGTTTACGATTAGCTTTATTTTAAGATTCAGTTCGCGATCTGCTAGCGGTTCCGTTATGTGTCCGTTCG 
AGACTGGGACCTCTTCTCCCGCCCCGCTGCTTTTATATCCGCTTCCCCCGTTGACCCCTCACCCTGAGTGCGACCAACGGCACTTGTTCCGAATG 
CACACATACACAAACATGTTCCTAGACAGGCCCCGAGTCGCCTGCTGACGGGGCCCGAAAACACGAGAGCCGAGCGCTGGTGAGTCCTGACGAAG 
ATCGCAAGAAGAGGGTTCGTAACTTACACGAACATGTTACTAGACAGGCCCCGAGTCGCCTGCTGACGGGGCCCGAAAACACGAGAGCCGAGCGC 
TGGTGAGTCCTGACGAAGAGCGCAAGAAGAGGGTTCTTAACTTATATGTATATATTTGCATTACCTACATATATATGTATATATTAAGAAGCAAT 
TTTAATTACTTAAAAAAAAAAACGCTTGCAAAATCGGTTCTTGCATGTATAAGCTATGAAAGTTAGAAAAAAGTATACCACGCCGTATATGGTCA 
TGAAGCTGGCTCCAAGAAAGAGGCCAATGGCTCCACCAAACGACATTATCAACTGATCCGTGCTGAATACAACCCGCCGATTTATGCCCATCCTC 
GAAATTTGCAGGTTGATGATCAGCGTCCGCTCGAACCTCTCGCCAGAGTAGTTGTCATCGCCCTGAAGAAAGTAAATTCTAAATTGCAATAATTG 
GGGGCTTAAGGATACAACATTCTTTCACCCACCCCAGTCTGGTCAGACACTTTGAAGATGGTAAAGGTTTCCTCGCGACAAGACGGATAGCAATC 
GCATGGTCTTTCAAGCCACTTGGACCGGGCTAGGCAGAGCATTCCGGATACTGTGCAGATTGGAACTTCGGGAGCTGCTACGTAGAAATAGGGCT 
TGCAGTTGCACAGGCTCAGGGCCCATTTGATTCGGCACTCAAGTCGGCAAAGACTTGGGTGGTAGGGCTAAAAGGACATGTTGCTGTTTTGATTG 
GACGCAATTGGTACGGGTAAAGGTCTAAAACTCACACTGTAATACTGCAGATTGTTTTCGTCACTGAAGCGACACTTGCGATATGCTACCGGTAG 
ATTTCGCACCTCGCTTTCCGCCGATGTGGTATGTAGCATAATATCGAGATCCTTGGATTCGATGTCTTTAGCATCAAAAGAGGGAGTTCGCATGT 
CGTCGGGTAGCATCATTTCTTCGACATCGTGAAGGTACTGTAAAGTTGGATTCGCTTAACAGTGTATTTCAAAAAAGAATGATAAATGCTGCCTA 
CCATAAATATGGGGGCAACAATGCTATTTATAGGCTTCAGAGATGTGATGCAATTACTCAAATAGCCGCACTGTTTCATGGGTTCCATATCCTGT 
GTTTCTCTTAGATAAAAGCGTTAAGTAAAAATGTAAATGTAAATTCTCTTGATGGGCACAAAAAATATGTAAAGCAAAAAGCGAAATTAAAGAAA 
GAATAGCTTATTTATTTTATTATTTAATAATTTATTTTTTTTGTTTTATATTTAATGTTATTTAGGCGCAGACTTACAGTTTGCCGTAGGGATTT 
CCGTATCGGTTTAACTGGCTGGATGTTTGGCACAGACCCACCTCAGTAATGATTGGCGCCAGTTCTGGCGGGATTTTAACCGCTATTACCTTTTT 
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TTGCAGCGTGAGCAAAATGTTCAAGTAATTCACGTTGTCCAAAGTGGTATCATTCTCGAAGGGTTCCAGCGTCTCCAGAGTGGAGTAACGAAGGC 
CGTTGATTGCAAGCAGAAACTTTCTGTAGTACACAGCTTTTTCTTTGTCTTCGCTGGGATCCACTCCCCAAGTTCTTGATATTTTATAATATATT 
AGTGCTTTCAATTTACCAATGTAAATCGAAACACTCACTGATTTATAAGCCTTGGGATCTCCGTTTCGTCGTGGTACCTAGGGCACAGTGTTATC 
CCCACGAAGGGGAAGATGCGCAGCCCAAAGAGATATTCGTACTCGTAGACAATAGGATTGTCCCGGAACCGGTACCATTGGTTAAGGCAGCTGCT 
GATGGCGTAGTAGGCCGAGCCCAGGATTATGAATGTCCAAAAGAGGCGCTCGGCCCAGTGTAGATCCCTGCGGGCGATGTAGGGCATGCCGTGGA 
TGGAGGACTCCTGGCACAGCTCGGCGAGCCAGTCCGGTTGCTTCGGCCGACTCATTTTGGATTCCATGATGAACTGAACAGAAGTTCAGAAGTTC 
GCATTCAACTTTAGCGACCCGTGTAATGACACACAACAACACCACACCCTCCCCCCCCTCACCACCATTATTCACCTTGTTTGATGTGCTCCAAT 
TATAGACTTTAATGGTGTTTCGCTGTATGTTCAGCATGCATATACAATTGTCGTGATTAGACCTGTCCTGCAGTTCCACGGAACTCTATGCAAAT 
GGGAACTGGGCGCCTTACATTGCGCTTATCGAAGGCCTCTATTTGGACCACCAGCTTGTAGCGCCCATCCAGATTCGGAACATTCAAGCTGGTCT 
TCAAGTCCAGTTCGTAGTCTTTGTACTTTAAAACGGCCTACAATATTGTGTTTATGAGAAATGACCAGTGACCAAACTAGGAGGTACATTTTCTT 
ACCCCTGGTGTGGTCAAGCACTTTTCCTTAATCTCGTCGGAGTTGCTAATATACTTTGTCCAGAAATTATACCAGTATTGGTTCTTATCAAACAT 
GTTTTTGCAAAAGTTATTGGAGCTGGCCGTAAACATAGTCTTCTGCCAAGTGCCCTTATCCAGGCGATAGACTTGACCAAATACCTGCGCGATAT 
TCAAATAGTTTATTAGGAAAAGTTTCCTTAGGGACTTGCCTTTAATGTGTCTCCAGGCTGGACATCTTTCCAGACCACTTTCTGCTCTCCTTCAA 
TTCGAACTTTCGAACCAAGGTTATGTACTTTAAGGGAGGAAGTATCCACCATCGCTTCAAAACCAGAAGGATTTCCGGGCTGATTCTCGCATGGT 
GCTAACAAGCCATCTTTGTCCAATGTTAACTGGTACTCAGTCGCTCCAGCCAAACAGATGACCACTTGGAGCACCAGGATTGTAAAAGTAGCCTT 
CATATTAACAAGCTTTCCAGTATGAGGATGGGTACATAGTGCTAAAAACTGCACGTTATATACAGCTTAGGGGTCGTCTATGATTCTAATTGCAC 
ACAGTATAATATGATTATGATATGATAAAATCATTGTCTTTAATAATTTAAAAAGGGCGTATCTCTTTTGGCACGAAAATAAGTATGTTCGATTA 
AAAATTCGCGTGACTGTTTTTCTAAACTAAATATTAGACGAATGTCAAGTTCATAACCCCATTTACCCCATATATTTTCCGGAGTTTTGGCAGAC 
AGCAAATTCATTTACCTGCCCGGCTAGCTCAGTCGGTAGAGCATGAGACTCTTAATCTCAGGGTCGTGGGTTCGAGCCCCACGTTGGGCGACCAA 
TATTTTGTAGTAAATATTCAAATTTTAATAGTTAATTGATTACCAGAGAAAATCAATTGTCGAGACTCTCGACTATCGGATATCTTTTAATTAAC 

T AATT AGGC T T AAAGC AT AT GC G AAC AT GG AT GG AT GT AAT T AT AT AT AAAAAT GAT T T T T AT AC CC GT T AC T C GT AAAT T AAATT GGT T G AGG A 
GTATGTAACAGGCATGAGGAAGCGTTTGCGACTATATAAAGTAGACATATTCTTAATCAGGATCAATAGCCTAGTCGATCTAGCCATGCCCGTCT 
GTCCGTATGAGCGTCGAGATCTATAAAAGCTAGAAGGTTGAGATTCAGCATACAGATTCTAGAGACTTGAGACAAAGACGCAGGGCAATTTTGTT 
GACCCATGCTGCCACGCTCACTGTAACGCCCTAAAACCGCCAAAATCGGCTGCGCCCACATATATTTGAAAAATTGTTGGACATTTTTTCATAAT 
GTTATTAGTTGCTTAAATTTCTATAGATTTGCTAAAAAAAAACTTTTTGCCACGTCCACTCTAACGTTCTAAACCTTTCCAATTTTTCTATATCT 

AT C AAT AT C C C AG AAAAAT TAT GAAAT T T CGCGT TC GC AT T C AC ACT AGCT G A 
(SEQ ID NO: 1357) 

Exon: 3233.. 3090 
Exon: 2934.. 2758 
Exon: 2697.. 2584 
Exon: 2353. .2034 
Exon: 1974.. 1693 
Exon: 1526.. 1427 
Exon: 1367.. 1176 
Exon: 1112.. 888 
Exon: 822.-649 
Exon: 484.. 404 

Start ATG: 3233 (Reverse strand: CAT) 
Transcript Ho. : CT35185 

ATGAAGGCTACTTTTACAATCCTGGTGCTCCAAGTGGTCATCTGTTTGGCTGGAGCGACTGAGTACCAGTTAACATTGGACAAAGATGGCTTGTT 
AGCACCATGCGAGAATCAGCCCGGAAATCCTTCTGGTTTTGAAGCGATGGTATTTGGTCAAGTCTATCGCCTGGATAAGGGCACTTGGCAGAAGA 
CTATGTTTACGGCCAGCTCCAATAACTTTTGCAAAAACATGTTTGATAAGAACCAATACTGGTATAATTTCTGGACAAAGTATATTAGCAACTCC 
GACGAGATTAAGGAAAAGTGCTTGACCACACCAGGGGCCGTTTTAAAGTACAAAGACTACGAACTGGACTTGAAGACCAGCTTGAATGTTCCGAA 
TCTGGATGGGCGCTACAAGCTGGTGGTCCAAATAGAGGCCTTCGATAAGCGCAATTTCATCATGGAATCCAAAATGAGTCGGCCGAAGCAACCGG 
ACTGGCTCGCCGAGCTGTGCCAGGAGTCCTCCATCCACGGCATGCCCTACATCGCCCGCAGGGATCTACACTGGGCCGAGCGCCTCTTTTGGACA 
TTCATAATCCTGGGCTCGGCCTACTACGCCATCAGCAGCTGCCTTAACCAATGGTACCGGTTCCGGGACAATCCTATTGTCTACGAGTACGAATA 
TCTCTTTGGGCTGCGCATCTTCCCCTTCGTGGGGATAACACTGTGCCCTAGGTACCACGACGAAACGGAGATCCCAAGGCTTATAAATCAAACTT 
GGGGAGTGGATCCCAGCGAAGACAAAGAAAAAGCTGTGTACTACAGAAAGTTTCTGCTTGCAATCAACGGCCTTCGTTACTCCACTCTGGAGACG 
CTGGAACCCTTCGAGAATGATACCACTTTGGACAACGTGAATTACTTGAACATTTTGCTCACGCTGCAAAAAAAGGTAATAGCGGTTAAAATCCC 
GCCAGAACTGGCGCCAATCATTACTGAGGTGGGTCTGTGCCAAACATCCAGCCAGTTAAACCGATACGGAAATCCCTACGGCAAACTAGAAACAC 
AGGATATGGAACCCATGAAACAGTGCGGCTATTTGAGTAATTGCATCACATCTCTGAAGCCTATAAATAGCATTGTTGCCCCCATATTTATGTAC 
CTTCACGATGTCGAAGAAATGATGCTACCCGACGACATGCGAACTCCCTCTTTTGATGCTAAAGACATCGAATCCAAGGATCTCGATATTATGCT 
ACATACCACATCGGCGGAAAGCGAGGTGCGAAATCTACCGGTAGCATATCGCAAGTGTCGCTTCAGTGACGAAAACAATCTGCAGTATTACAGTC 
CCTACCACCCAAGTCTTTGCCGACTTGAGTGCCGAATCAAATGGGCCCTGAGCCTGTGCAACTGCAAGCCCTATTTCTACGTAGCAGCTCCCGAA 
GTTCCAATCTGCACAGTATCCGGAATGCTCTGCCTAGCCCGGTCCAAGTGGCTTGAAAGACCATGCGATTGCTATCCGTCTTGTCGCGAGGAAAC 
CTTTACCATCTTCAAAGTGTCTGACCAGACTGGGGGCGATGACAACTACTCTGGCGAGAGGTTCGAGCGGACGCTGATCATCAACCTGCAAATTT 
CGAGGATGGGCATAAATCGGCGGGTTGTATTCAGCACGGATCAGTTGATAATGTCGTTTGGTGGAGCCATTGGCCTCTTTCTTGGAGCCAGCTTC 
ATGACCATATACGGCGTGGACTCACCAGCGCTCGGCTCTCGTGTTTTCGGGCCCCGTCAGCAGGCGACTCGGGGCCTGTCTAGTAACATGTTCGT 

GTAA 

(SEQ ID NO: 1358) 

Start ATG: 1 (Reverse strand: CAT) 

MKATFTILVLQVVICLAGATEYQLTLDKDGLLAPCEWQPGNPSGFEAMVFGQVYRLDKGTWQKTMFTASSNNFCKNMFDKNQYWYNFWTKYISNS 
DEIKEKCLTTPGAVLKYKDYELDLKTSLNVPNLDGRYKLVVQIEAFDKRNFIMESKMSRPKQPDWLAELCQESSIHGMPYIARRDLHWAERLFWT 
FIILGSAYYAISSCLNQWYRFRDNPIVYEYEYLFGLRIFPFVGITLCPRYHDETEIPRLINQTWGVDPSEDKEKAVYYRKFLLAINGLRYSTLET 
LEPFENDTTLDNVNYLNILLTLQKKVIAVKIPPELAPIITEVGLCQTSSQLNRYGNPYGKLETQDMEPMKQCGYLSNCITSLKPINSIVAPIFMY 
LHDVEEMMLPDDMRTPSFDAKDIESKDLDIMLHTTSAESEVRNLPVAYRKCRFSDENNLQYYSPYHPSLCRLECRIKWALSLCNCKPYFYVAAPE 
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VPICTVSGMLCLARSKWLERPCDCYPSCREETFTIFKVSDQTGGDDNYSGERFERTLIINLQISRMGINRRVVFSTDQLIMSFGGAIGLFLGASF 

MT I YGVDS PALGS RVFGPRQQATRGLS SNMFV* 
(SEQ ID NO: 1359) 

Name: sodium channel, putative 
Classification: ion_channel 

Celera Sequence No. : 142000013384333 

AATAACAACAAATATAAGCATTTTGAGCACTAGCTGAAACGACAACAACAACAGCAACAACAAAACGCATGCTCACCCAATTGCTTTGACACAC^ 
AACCCACACATAGAAATTAAAACAAAACGCCCGCGGCAGCAGCGCCAGCAATAACAACACTGCAACAGCAACGTGGAGAGAGAGCACCATACCGC 
AAGAGCGAAAGAGAGAGAGAGAGTGAGCTAGCTACGGAGAGTGGCTGCTGCTGCCGGGAGTGACACTTCCGCGCCACACGTGCAAGCGAGACGAA 
GCGAGTTGCCCCGAAGGCACACCGTTGCGGCCAAAAACAACCACCATGATGGTGCCGAGAGCGAGAGAGAGCGAGAGCTGGCTCAGCGCTCAGTG 
GCGATAGACGAGTGCGAGAATGTGACAGCACGAGCAGCAGCAGCGCTGCCGCTGCCCTCTGCCCGCGCCTCTGCCCGCAGCCAAGTGGATTCGTT 

T T GT AC GT T GT T AG AC TCTCTCGCCGCGTT T TGG AAT T C AGT C G AC AAT C G AC G AT AAT T AC CGGT GGT T GT ATC AC AT C AC AT C AC GC G C AGC A 
GCAGCAGCACTACGAACGAAAGAAGAAGAAGCAGAAGAGCCAGCCGCAGTGGCAGCAGAGAAACTCGCTCGTTTTCGGATCTATATATATATATA 
TGCATATAGATTTGTATATATAGAGGGATATATATATATATATATTTGCACGCAGCATTGGCGAATTTTCGGCCGCCTTTAATTTGTTTGTGTTT 
TTTCCGTGTTCTCCTCCGCGTGCTCGCAATTACGTGCGCGGCAGCAAAAGACTATAATTATAAAACTAAAGTGAAAAGCGTGCCATAATTACACG 
AGCGAATATCAATAAGTGCAAGAGCAATGTGCGAAAATTGTTAACGAACGCAGTAGCAACCAGAAGCAGTAAAACGCAGCAGCAATCGCAGCAGT 
AGGATCCAAACAAATCTCAGCGCAACATTAAAATCGCAAGCGTTTTCAACATGATCGATGCGGCCTGCAATTACTTAAATCCCTATGCGCAACAG 
CATCAGGCGCAGCAGCAGCAACATGCGCAACATCAGCAACACGCGCAACAACAGCAACATCATCTGCACATGCAGCAGGCGCAACATCATCTGCA 
CTTGTCGCATCAGCAGGCGCAACAGCAGCATATGCAACACTTGACACAGCAACAGCAGCAGCAGCAACAACAACAGCAGCAACAGCAGCAGCAAC 
AGCAGCAACAACAACAGCCGCAACAGCAGCAACATGACTTCCTCAGCGCCGCCGCCCTGCTGTCCGCCCCACCATCCCTATCCGGCTCGAGTTCC 
GGCTCCAGCTCCGGCAGTTCTCCGCTATATGGAAAGCCGCCAATGAAGCTGGAACTGCCGTATCCACAGGCCTCATCCACGGGCACCGCATCGCC 
CAACTCCAGCATACAATCGGCGCCATCATCGGCCTCCGTGTCGCCCAGCATCTTCCCATCACCTGCCCAGTCATTCGCCTCCATTTCGGCTAGTC 
CATCCACGCCCACCACGACCTTGGCGCCACCAACAACGGCGGCAGCGGGCGCCCTAGCGGGATCACCCACATCCTCCTCACCATCCTCGTCAGCA 
GCATCGGCTGCAGCGGCGGCAGCAGCAGCTGCGGCGGCAGCAGCGGACCTTGGAGCAGCGGCGGTCGCCAGTGCCGCCTACGGCTGGAATACCGC 
CTACTCCGGATTGGGGCCAGCGAGGTAAATATCCCTATATATAGGATTTACAGAAGTTGAGTTGGGAATCATTCGTAAGAAAGAAACTTTAAAAT 
GCTAGTGATCAATAAATCAAAGCTTATAGATCATCAAATTTGTGGTTATTAGGGATACACTAATTTCGATCTGAATTAGATCTCCTTTTAATAGA 

AT AAC T AC AG GC T GAT T GT AT T T C G ATC AT G AGC G AGAT AAAGAC AT AT AT CT T TT GC AAAT T AT AC AGT T T AAT AC AT AGG AAT AT AT T AAT C A 
AATTATCCCGAAGATTGAAATGGATTAGCATACAAATAATCTCATAAACTTATAAAATGATAGGTACCACTTGAATACATTACAATTTAGTATTC 
CGATAGCAGGTGTACCCATTAAGTAAATGATTAAAAAAAACCAAATTAACCATTCTCTGTCTTCATTTCTTTTCTCCATCTAGAAGTCAGTTTCC 
ATATGCCCAGTACGCCTCCGATTATTACGGCAATGCGGTGGGCATGTCCTCCTCGGCCGCTTGGTTCTCGCATCAGGAGCGCCTCTATCAGCCAT 
GGAGTTCACAGAGCTATCCGGGCTTCAATTTCGACGACATCGCCTTCCAGACGCAATTGCAACGCCGCTCCGTTCGCTGCACGTGTCCCAATTGC 
ACCAACGAGATGAGCGGCCTGCCACCGATTGTGGGTCCGGATGAGCGCGGACGCAAGCAGCATATCTGCCACATTCCGGGCTGCGAACGGCTATA 
CGGTAAGGCGTCGCATCTGAAGACCCACCTGCGCTGGCACACCGGCGAGCGGCCCTTCCTGTGCCTCACGTGCGGCAAGCGTTTCTCCCGATCGG 
ATGAGCTGCAGCGGCACGGACGTACGCACACCAATTACCGCCCGTATGCCTGCCCCATTTGCTCCAAGAAGTTTTCGCGCAGCGATCACCTCAGC 
AAGCACAAGAAGACCCATTTCAAGGACAAGAAGAGCAAGAAGGTATTGGCCGCCGAGGCCAAGGAGCAGGCGGCTGCGGCGATAAAGCTGGAAAA 
GAAGGAGAAGAAGTCGGGTAAGCCACTGACGCCACCCGTGGAATTCAAGCAGGAGCAGCCGGATACGACACCGCTGGTCAACTATGCGCCCTATG 
CCAATCTCTATCAGCATTCCACATCCGCCGGATCGAGTGTGAATCCTCCGCCGCCACCGCCGCCGCTCTTCCAGCAGCAGATGACGACGACAACA 
TCATCGGCGGCGGCCAGCTTTGTGGAGCAGCCGTGGAGCAGTAGCAGCAGCCGTGCCATACAACCAGCCACTACCTCAGCCAGTTCATCCTCATC 
CTCCAGCGCCAGTTCCCCAGCGGCAGCGGTCGTTTCTGCCATTGGATCTGCATCTTCCCCAGCCGCATCAGCCACGGCGCTGGCCCAGCATCATT 
ATGCCGCACTTGCCATGCAAAGTGAGTCCCAGCTGGCGGCGGAATATGGGCTAACCATGAGCGGATTGGCCAGCGGAGCTAGCCAGGATTCCAGC 
TCCAGTTGCCACATGAAGTCCGAGTATGCGGCCAGTTATCCGGCTGATTTTGGTGCGGGAACAGCCAGTTACGGTTATCCGCATCCACATCCGCA 
CCATCACAATGCGTGGGCAGCGGCCTATCATCCACATGCCACCGCCTAGGATGTCCCCGATCCTTCCCAAAATTGTTACAGTTGAAAGTTGATGA 
AACAAATGCCATAATGTCCCAAAACCCAGAACGGCAACAAAGCAAGCGTAGAACGAAAATCTAGTGTGTGTTTTTTTTTCAGTGTTTATTGTTGA 
TATAAAAATATAATGTGAAAATCAAAAAAAAATTTGAATTCACAAAATACCGTATAAGGCATTATATAATATTATATAAATATATATATATATAT 
ATACATACACATATATTAACTATACATGTAGAGACTAGAACTTGAATTTTTTTCATTTTTTTTCCCCCAATCTAGTAGCTAAGGCATTTTTTTTT 
GCATGTCTGTACATACAAACATATATAGCGTTCTTTTTAAGCCATACCATATAGAATGAGAGAAACCATCTAGTTGTGTACCATAATAATTGTTA 
ATTGTCTTTAAGATAAATTATATACAACAACAAAAAGTGAAAAATCCAAGGCGACAAAATCCCCACACTGAAA 

AAAAACGAACAAAATAATATCGGAAAACAAAGCGAATCCAGTTTTTTATTATTCTCTCCGATCCCGACGCACCCTTGTTAATGGGTTCCTGGCTT 
GGTTAGATTTCAGGGTTGTTTGATCTCCGCCAGGGGTTGCTTTCTTGCATATAAATCGAGGGATCCAGGAAGCTTCCCGACTTCCGGCGAACTTG 
ATCTGCTTTTGGGTGCTGACCTTTGGCGGTATTTGGATAATAATAGAAACTTTTTCAATAGCAATCATTATAGCTGTCTGCATTTCTAGTTATAA 
TTTCATCAACGTCGGCCATCGCTTTAAAATTGCAAAAACATTGTTAGCTTGGGAGATGTACATCATTTATAGATTGCTTTACTTTAAAATGTGCA 
TAAATAAAGGTACGAATGTGATATGTATATATCCTTAAATACAATGATTTATTTAGTCAAGATACACATAGTTGTAGAATTTCATAACAATCGGA 

TGTG 

(SEQ ID NO: 1360) 

Exon: 1001.. 1734 
Exon: 2174. .3374 
Start ATG: 1001 

ATGATCGAT^GG^ 

ACAGCAACATCATCTGCACATGCAGCAGGCGCAACATCATCTGCACTTGTCGCATCAGCAGGCGCAACAGCAGCATATGCAACACTTGACACAGC 
AACAGCAGCAGCAGCAACAACAACAGCAGCAACAGCAGCAGCAACAGCAGCAACAACAACAGCCGCAACAGCAGCAACATGACTTCCTCAGCGCC 
GCCGCCCTGCTGTCCGCCCCACCATCCCTATCCGGCTCGAGTTCCGGCTCCAGCTCCGGCAGTTCTCCGCTATATGGAAAGCCGCCAATGAAGCT 
GGAACTGCCGTATCCACAGGCCTCATCCACGGGCACCGCATCGCCCAACTCCAGCATACAATCGGCGCCATCATCGGCCTCCGTGTCGCCCAGCA 
TCTTCCCATCACCTGCCCAGTCATTCGCCTCCATTTCGGCTAGTCCATCCACGCCCACCACGACCTTGGCGCCACCAACAACGGCGGCAGCGGGC 
GCCCTAGCGGGATCACCCACATCCTCCTCACCATCCTCGTCAGCAGCATCGGCTGCAGCGGCGGCAGCAGCAGCTGCGGCGGCAGCAGCGGACCT 
TGGAGCAGCGGCGGTCGCCAGTGCCGCCTACGGCTGGAATACCGCCTACTCCGGATTGGGGCCAGCGAGAAGTCAGTTTCCATATGCCCAGTACG 
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CCTCCGATTATTACGGCAATGCGGTGGGCATGTCCTCCTCGGCCGCTTGGTTCTCGCATCAGGAGCGCCTCTATCAGCCATGGAGTTCACAGAGC 
TATCCGGGCTTCAATTTCGACGACATCGCCTTCCAGACGCAATTGCAACGCCGCTCCGTTCGCTGCACGTGTCCCAATTGCACCAACGAGATGAG 
CGGCCTGCCACCGATTGTGGGTCCGGATGAGCGCGGACGCAAGCAGCATATCTGCCACATTCCGGGCTGCGAACGGCTATACGGTAAGGCGTCGC 
ATCTGAAGACCCACCTGCGCTGGCACACCGGCGAGCGGCCCTTCCTGTGCCTCACGTGCGGCAAGCGTTTCTCCCGATCGGATGAGCTGCAGCGG 
CACGGACGTACGCACACCAATTACCGCCCGTATGCCTGCCCCATTTGCTCCAAGAAGTTTTCGCGCAGCGATCACCTCAGCAAGCACAAGAAGAC 
CCATTTCAAGGACAAGAAGAGCAAGAAGGTATTGGCCGCCGAGGCCAAGGAGCAGGCGGCTGCGGCGATAAAGCTGGAAAAGAAGGAGAAGAAGT 
CGGGTAAGCCACTGACGCCACCCGTGGAATTCAAGCAGGAGCAGCCGGATACGACACCGCTGGTCAACTATGCGCCCTATGCCAATCTCTATCAG 
CATTCCACATCCGCCGGATCGAGTGTGAATCCTCCGCCGCCACCGCCGCCGCTCTTCCAGCAGCAGATGACGACGACAACATCATCGGCGGCGGC 
CAGCTTTGTGGAGCAGCCGTGGAGCAGTAGCAGCAGCCGTGCCATACAACCAGCCACTACCTCAGCCAGTTCATCCTCATCCTCCAGCGCCAGTT 
CCCCAGCGGCAGCGGTCGTTTCTGCCATTGGATCTGCATCTTCCCCAGCCGCATCAGCCACGGCGCTGGCCCAGCATCATTATGCCGCACTTGCC 
ATGCAAAGTGAGTCCCAGCTGGCGGCGGAATATGGGCTAACCATGAGCGGATTGGCCAGCGGAGCTAGCCAGGATTCCAGCTCCAGTTGCCACAT 
GAAGTCCGAGTATGCGGCCAGTTATCCGGCTGATTTTGGTGCGGGAACAGCCAGTTACGGTTATCCGCATCCACATCCGCACCATCACAATGCGT 

GGGCAGCGGCCTATCATCCACATGCCACCGCCTAG 
(SEQ ID NO: 1361) 

Start ATG: 1 

MIDAACNYLNPYAQQHQAQQQQHAQHQQHAQQQQHHLHMQQAQHHLHLSHQQAQQQHMQHLTQQQQQQQQQQQQQQQQQQQQQQPQQQQHDFLSA 
AALLSAPPSLSGSSSGSSSGSSPLYGKPPMKLELPYPQASSTGTASPNSSIQSAPSSASVSPSIFPSPAQSFASISASPSTPTTTLAPPTTAAAG 
ALAGSPTSSSPSSSAASAAAAAAAAAAAAADLGAAAVASAAYGWNTAYSGLGPARSQFPYAQYASDYYGNAVGMSSSAAWFSHQERLYQPWSSQS 
YPGFNFDDIAFQTQLQRRSVRCTCPNCTNEMSGLPPIVGPDERGRKQHICHIPGCERLYGKASHLKTHLRWHTGERPFLCLTCGKRFSRSDELQR 
HGRTHTNYRPYACPICSKKFSRSDHLSKHKKTHFKDKKSKKVLAAEAKEQAAAAIKLEKKEKKSGKPLTPPVEFKQEQPDTTPLVNYAPYANLYQ 
HSTSAGSSVNPPPPPPPLFQQQMTTTTSSAAASFVEQPWSSSSSRAIQPATTSASSSSSSSASSPAAAVVSAIGSASSPAASATALAQHHYAALA 
MQSESQLAAEYGLTMSGLASGASQDSSSSCHMKSEYAASYPADFGAGTASYGYPHPHPHHHNAWAAAYHPHATA* 
(SEQ ID NO: 1362) 

Name: buttonhead 

Classification: transcription_f actor 
Gene Symbol: btd 
FlyBase ID: FBgn0000233 



Celera Sequence No. : 142000013383980 

AAAGCAATTCCACACCCCATTCCAGTGCCCAGAAAGCCAAGGCATTACATAAAGGCGGTAAATATATATGACTAAATAAACCAAAGTCCCAGAAT 

CAAAAGGGGCAAGGGGGGGGGGACAAAACAGCTGTCAAAGATTCCTCAGTTGAGAGTTAGGATTTGTGCTGACCTACGGCAATTAACCGTCGTCA 

TTTGCCATCGCAAAGGAGGAAGCAGCCATTTCTTCCGGTTCGGTGGCGTATACGTAACGTTTGCTGGCCGTAAGGATAGCTCAATCGCCAGGACA 

CTGACTTATGCTGGCTCCAGTTAGACGCACAGATGACTACGCATTTATTTCTCGCCATCCTTCGTCGTCCCTGCGAACTTTTCAGAACTTGTTTT 

TTGCACCCAGTGTGCTTATTTTTATTTTATTTTTTTTTTTTTTTGTAGCCAGAGTTTGGAGCTGGCGCCGTCTACTGTCTGTTGATTCAGATTCG 

GTTATAGATTCTGGATTCATTTACGGTTTCTGTGTCTGACTTTCGGGCATTGACTTGGCGGCAACTTGGCTCTGTTTCTGACCCCAAGCCAAAAA 

CGTCGCATCTCCCTGCCATCTTTTGGCATTTTCCGCTGTTGCGAACATTACGCATTCGCCCTGTGGGACGGCTAAATCTCGGTGAGCAAGCTAAT 

TATGCCCATGTGGCACTCGCTTATAATTCCTATCAATTTAATTTTGTCGAATGTATGAGAGGTGTTACACCAGTCTATTCAGATTTCAATCGACT 

TGACTTGATTAAAGCATTTGTAAACCATTTCAAAATTTGGAAAGCAAATTAAAACAATTACGAATTCATTTTAAATGGCAAAGGTTTTAAATAAG 

TTCAAGCTTTGTCAAATTTCGCTACGTTGCAACTAGTGTCACATGGGCATAAAATTAAATTTGCAACTTTGGTGCTTGAGGCGAGTAGAATCAAC 

AAGGCGAAAAAGCATTGAGAAGCGATAAAGCAAAGCGGTATTCAAACTAATTAAAATACTATGTATTTGCAGGTATTCTTAATCGTTTTATTCAA 

ATCGTTACAAGCAAGAAAATCAAACGAATTGCAGCAAATGGCGTAAATCAAATGGTAATAGAAGCACTTGTTTTGCCGCATAATTTTCATAGATA 

TTTTGATATTTGTTTTTTTTTTTATCATTTTTTGTTTCGTATAGGTGAGGCTTAAGAGCGGTGGGTGCTTTTGGATTTTTGGTGATGGGTACTTC 

TCATAGGGGTTGTCCTTCTTCTTGCTGGCCGCAGGTGTGCTCTTCCTGGGTCGATTCATCATTGGATCGTCCATGGTTCGCGATGATTGCTGTCT 

CGTGTGCGCCGGCGGTCCAGGTGGCTTCTTTTTCTTCTTTTTCCCAGCTTCAGGTTGCAATGGATTGAAGGCGAACTGCAGATGTTGCTGCTGCT 

GCGTCTGTTGCAGATGTTGCTGCTGTTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTCGCGGCGCTCGAGGAAGGCGGCATTGGCCGAGGCTACC 

GTTGCTATTCTCTGCTGCTGCTCCAGATCCTCTAGTTCCTTCTCCCGCTTCTTGGTCATTTGCACCAACTGCGAACTGCGCCGCCTGAGGGCGGC 

TGCATAGGGTGTGGGCGTTGCCACCGGCGGCACCATGTCTCCCACACCGAATAGCCGCCTGATGCGCCGCCGACTGTTGATGAAGCCCAGCCCTA 

GCTGCACGCCCAATCCCAAACCCGCACCCGCCGGCGCTGTCGTCTGGCTGAGGGAACTGCTCGCGCCACCCTCACTGCCGGCGCCCAGCATCTGG 

CTGAAGTTACTGTCCAGATCGCAACTGCTGGCCGAGTCGGCGCCACAGGGAAAGAGATATGTATTCAGGGTGCTTAGGGTTGTGGATGTGGACGT 

GCCAGTTCCGGTCATGGTTCCAATGCGACTAATGGCGCCCGTATTGCTGCCACCCAGCGGATTTTGCTGTTGCGAGTCGAGAGATGCTGATCCTG 

CTGCTGCTCCTGCTGCCGATGGCTGATGTTGCAGGTTAAGCGGGAGACTCAGTGGCAGTGGCAGCATAACTGCCGTGGAACCACTCACGCTGCCG 

GTGGTGCTGCTGTCGTAGCTGTGCGGATTCACATAAATGTGGTTGTTGTTGGTGGCGGTTGATGATGCAGTTGCTCCAGCTGTAGTGCTTCCTCC 

TGCTGCCGTGCTACCAGCTGATGCAGTGCTGCTCGTGTTGACCACCAATGTGCTGTTGCTGCTCTCATGGTGATTGTTGTTGCTGGCTTGGTTAG 

CGTTGTTGCTGCTGCTGCTGCAGCTGGTGGTGTTGTTGTTGATGCTGCTGCTGTTCGTGTTGCTCAGGTTGACCCTGTAAAAGCTGGTGCTGCTG 

GTGGCAATATTGTTGTTGTTGCTGCTGCTGCTGGAGCGTGCAGGCGCCGTTGCAGGTCGCGATGCCTGTGATCCCGATCCCGATCCTGATCCGGA 

TCCAGGTGCACTAGCCTCCGGAGGGGCAGTGGTTGTTGCTGCTGCAGCTGGTGTTGCAACTGATGTGGCAGGCGTCGCCTGCTGCTGCTGCTGTT 

GCGAATGATCGTTGCCATTGTTGCTGCTCATTGCTGTGGTTGTTTGACGCCTTTTTGCGTTAAGTTCTCGATTTTAAATATACTTCATTAATGTT 

TATAGCTCATTTTTAATAACCCAATCATTATATTACATTTACTTCAATTATATGTGCTCGATAGCTCATTTACTTTTTAATAATGATAGTGAACA 

CTTATTAACATTTTTAAGGGAGGTACTTCGTTTCACTAGTAAGCTATTTTGAGAAAAGTTTTATGTTCTCGTGGTGTAATATCACAAACTAGTGA 

AATCTTTAAGATTTAATGAAATATAACATCTTTGGTTTTCTAAATTAAACTTAATAATTTATTTTTATTTCGTTAGATTTTAAGTTCACTTCATA 

ACTTTTTCTTCGGTAAATTCTAGTTACGGTGCAGTGGCTATTAGTTTATTTGTTTTTAACCGTTTAACAGTGTTTAACTTTACATTTATTGATGT 

TATTAAATTGCTTCTAGGTGTTTATTGTGCATAGATTTGTTTGATTTAGTTATATATACATTTATGCTAGTTCGTTTGCTATGACTCCTTTAAAG 

ACCCTCCTTTTACATATTACAATTCATGTATCTTCTTTTGGATGGCATCCAAATGGAGTTGAATGCACTGCTCCGATTGCTGGTAAATGGCAAAT 

GGCGACTACTAAACGGTTATGGCCAGACGTTAGCTAATCGCTCAGTTTTCCCGGACCTTTCACGCACTTTTCTCTCGCGGCACTCGAACAAAAAT 

CACAGACAGTGGGGTCAGGAGGTCAATGAAGCTTATCACTGGCACGCACATTTACGCAGCACCGCAGGACACTTGCAAAAAAAAAATTACTTAAA 

AAAATGGCTGTTTTTACGCGGCATTTTTTAAATGATTTTTATTAATTTTTATTTGATTTAGACAGACAGAGTTCTGTTATCAGTCAGAGCCACGT 

GCTTAACGCGCCGCGTGTCACAATTGTATGAACTTTGTGCCGCGTTGTTGACAATGCCAAAAGCCAACAGCAGCAGCAGTT 
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(SEQ ID MO: 1363) 



Exon: 2596.. 1001 

Start ATG: 2596 {Reverse strand: CAT) 
Transcript No. : CT35411 

ATGAGCAGCAACAATGGCAACGATCATTCGCAACAGCAGCAGCAGCAGGCGACGCCTGCCACATCAGTTGCAACACCAGCTGCAGCAGCAACAAC 
CACTGCCCCTCCGGAGGCTAGTGCACCTGGATCCGGATCAGGATCGGGATCGGGATCACAGGCATCGCGACCTGCAACGGCGCCTGCACGCTCCA 
GCAGCAGCAGCAACAACAACAATATTGCCACCAGCAGCACCAGCTTTTACAGGGTCAACCTGAGCAACACGAACAGCAGCAGCATCAACAACAAC 
ACCACCAGCTGCAGCAGCAGCAGCAACAACGCTAACCAAGCCAGCAACAACAATCACCATGAGAGCAGCAACAGCACATTGGTGGTCAACACGAG 
CAGCACTGCATCAGCTGGTAGCACGGCAGCAGGAGGAAGCACTACAGCTGGAGCAACTGCATCATCAACCGCCACCAACAACAACCACATTTATG 
TGAATCCGCACAGCTACGACAGCAGCACCACCGGCAGCGTGAGTGGTTCCACGGCAGTTATGCTGCCACTGCCACTGAGTCTCCCGCTTAACCTG 
CAACATCAGCCATCGGCAGCAGGAGCAGCAGCAGGATCAGCATCTCTCGACTCGCAACAGCAAAATCCGCTGGGTGGCAGCAATACGGGCGCCAT 

TAG T C G C AT T GG AACC AT GAC C GG AAC TGGCACGTC C AC AT CC AC AAC C C T AAGC ACCC T G AAT AC AT AT CT CTTTCCCTGT GGCGC C GAC T C G G 
CCAGCAGTTGCGATCTGGACAGTAACTTCAGCCAGATGCTGGGCGCCGGCAGTGAGGGTGGCGCGAGCAGTTCCCTCAGCCAGACGACAGCGCCG 
GCGGGTGCGGGTTTGGGATTGGGCGTGCAGCTAGGGCTGGGCTTCATCAACAGTCGGCGGCGCATCAGGCGGCTATTCGGTGTGGGAGACATGGT 
GCCGCCGGTGGCAACGCCCACACCCTATGCAGCCGCCCTCAGGCGGCGCAGTTCGCAGTTGGTGCAAATGACCAAGAAGCGGGAGAAGGAACTAG 
AGGATCTGGAGCAGCAGCAGAGAATAGCAACGGTAGCCTCGGCCAATGCCGCCTTCCTCGAGCGCCGCGAGCAGCAGCAGCAGCAGCAGCAGCAG 
CAGCAACAGCAGCAACATCTGCAACAGACGCAGCAGCAGCAACATCTGCAGTTCGCCTTCAATCCATTGCAACCTGAAGCTGGGAAAAAGAAGAA 
AAAGAAGCCACCTGGACCGCCGGCGCACACGAGACAGCAATCATCGCGAACCATGGACGATCCAATGATGAATCGACCCAGGAAGAGCACACCTG 
CGGCCAGCAAGAAGAAGGACAACCCCTATGAGAAGTACCCATCACCAAAAATCCAAAAGCACCCACCGCTCTTAAGCCTCACCTATACGAAACAA 
AAAATGATAAAAAAAAAAACAAATATCAAAATATCTATGAAAATTATGCGGCAAAACAAGTGCTTCTATTACCATTTGATTTACGCCATTTGCTG 
CAATTCGTTTGATTTTCTTGCTTGTAACGATTTGAATAAAACGATTAAGAATACCTGCAAATACATAGTATTTTAA 
(SEQ ID NO: 1364) 

Start ATG: 1 (Reverse strand: CAT) 

MSSNNGNDHSQQQQQQATPATSVATPAAAATTTAPPEASAPGSGSGSGSGSQASRPATAPARSSSSSNNNNIATSSTSFYRVNLSNTNSSSINNN 
TTSCSSSSNNANQASNNNHHESSNSTLVVNTSSTASAGSTAAGGSTTAGATASSTATNNNHIYVNPHSYDSSTTGSVSGSTAVMLPLPLSLPLNL 
QHQPSAAGAAAGSASLDSQQQNPLGGSNTGAISRIGTMTGTGTSTSTTLSTLNTYLFPCGADSASSCDLDSNFSQMLGAGSEGGASSSLSQTTAP 
AGAGLGLGVQLGLGFINSRRRIRRLFGVGDMVPPVATPTPYAAALRRRSSQLVQMTKKREKELEDLEQQQRIATVASANAAFLERREQQQQQQQQ 

QQQQQHLQQTQQQQHLQFAFNPLQPEAGKKKKKKPP 

KMIKKKTNIKISMKIMRQNKCFYYHLIYAICCNSFDFLACNDLNKTIKNTCKYIVF* 
(SEQ ID NO: 1365) 

Classification: receptor 



Celera Sequence No. : 142000013385196 

GCATATTTAATGACCCCAAAGTTAAGTCTCACATGTTCGCACCTGGAATCTGCGAGGCGCGACCGAGACCAAACCGGCGGTGGATTGGATAAGGC 

ATGACATAGCCGATGAGCAGCAAGAGCGGAAAAACCAGAACGGCCGAAAGTGGGTTGGCTTGCATTTCGAGCCGTTTACTTCCGCATCTTCCTGT 

CCCGCGAGCGGTGATCTTTCTTAATGAACGTGATACGGGCAAATTGGCCAAGCAGAATCAGGCCAAACAGCTCGGCGGTAGTTTAAGTTTGTCCC 

GAAAACCACGTAGAAAAGCGAGGAAAGCTTAACAGTGGACCTGTAGCCATTGGTCATAAAGCATCGCAATTAAAGTACTTAGGAAATCAAAGGCC 

CATTGATGAGTATGGAAATATGTTATTTGTTGCATTGTATTTCGAGTTAGCAGATTTTTCATAAAGTGGTTTTCCTCTTGGCATTCAAAAATAAC 

AGTCAAAGTTTACAATAAGTAAATAAAGTAATTGTGAGCATACTCCATCAACTGATCAGCAGAGTTAGAAACTAATTAAAAAGAGTGACCACAAG 

AAAGTTGCTCCAAATCGTAAATAGACAACAATTGCGTTGATTACTTGATTCTGCGAAGTCATCGCCCTGAACAGTTTATATCGATTCGCATCCAA 

ATCGTCCAGCTTTAAGTCCAGTTAAGGCTGGAATTTGAATCTAAGCTGAATCTAACAATATATGCACAAAATTAAGTCTGCTATCTGAAAGTGAG 

TAACAGTTGTGCTCTGTTAAGCCCAATTTATTGCCCTGCATCATGATGAATCAGTTGCTGGAGCATCCCAAATATTTGTTGAGTATTCTTTCGCC 

TTTAGCGGTAGCTTAAGCTTAGGTTGTGGTTTGAACTTTGTACAATGTTTAGGGTTAAATATTACGGCATAAGTTAGCTTTTAACATTAGAGTTT 

AGGCGTTAGCCCGCCATAGGTTGCCTCCGTCCCTGGCTATCTCCGCAGGCTTAGAGCGGCTCCAGGGCCGCCGGTCCATGTGGACTGCCCTCCAG 

TTTGCGCCAGACCACGTTGCACATCTCGCGGACCAGCGCCTTCTCGCCGTCATCCATCTGGTCGAGTGCGCTTCCAGCTAGGTTGCCAAGCCGAT 

CGCGCTCCAGCTGCTCCCGCACCTCGAGCACCTGGAGGGCTCGAACCACCGCCTCTGGTCGTCCGGCAATCGGCTCCGGCAGTTGGATAGCAGCT 

GCTGCCGCCGCTGCAGCGGCCACGATGTCTTGCTGCTGCTGTTGATGTTGCTGCTGCTGCTGCTGCTGCTGGTGGGGCGATGCTGCTGATTTGGC 

TGGAGTGGCTGCAGATGTGACCGTTGGCGTGGGCGTGGCCTGCGGCGTGGGCGTCGTCGGCTGTTGTAGATGGGCCGTGGGCGTGGTGCGCTCCT 

CCTCCAGTTGCTGGATGCGGCGCTCCAGTGAGCGCACATACTGAAGCGTCTGGTCGTTCAGCGTGTGGGCATGACGTAGATCCTGTTGGGAATCA 

AATTGAAATCGCCATTAGGTCAAGTTCTTATTGCTGTTCGTTTTTTTTTTTTGCTCACCTGGGGTAGTTGCCGCTGCTCTGCATAGTTCCTCAAG 

GCGGTGATGGTCTCATCTGCATTCGTGCTGGAGCTGGAACCATCGCCGTCATCTCGAAGCGCATTCCTAGTGGCTGCTCCGCAACCGGCGCACAG 

CGCATTTGCCGCCACATGGGCACGCTCCTCGTCCAAGTAGAGGCAGAGCTCCTTGAGCTCTAGATTATCGGTGATTAGCTCTTGCTGCTTATCAT 

CCAGCTGACGTAGTTTGTTCTGCAAAGAGTATGTCAAGTCAATTAGTATTCAAAGATCTGGAGTTTCGTTCGGGGCTGGACCCACCTGGTACGCG 

GCCACCTCCTGCCGCATCACGCTGGCCGTGTATCGCCCAAATCTTTGCCATTCCCGCGCCAGCTTGCGTCCCTTCTGCCGGTCGTCGTCCAGGAA 

GCAGCACAGATCCCGCAACTCCTGGTTGTCGTCCAGCAGGCGCTGATTCTGCTCCTTCAGCGCACGCAGTTCGTTGACCAACGACTGCAACTGCC 

GGTTCTGCTCGGCGGTCAGGCGACCTCCGCCATTGGCCGTGGGTATTCCCGTGCCGCTGGACGTGACCGAGGCATTGGGCGATGGGTGTTCCTGG 

TCTGGCTTGCGCACAAACTTCAGCATATCCTTCGCTCCCGGCTGGTGTGCTGGTAGATATCGCGACTGTTGCTGCTGTTGTTGTTGTTGCTGCTG 

CTTGATGGAGTCGGGAATGTAGATGCTAGATAGCTGGGGGATATCCGGCGGATACTTGAGTGGCAAGTGCGGCTTTAGGATCCCGGTGGCCGGCT 

GCGGCGGTGGCTGGTATCTCGGCGGGATGGAGTGGGCGGGTGCCGGTGCCTTCAGCTGCGGGCTGCGCAGAGATTGCTGCTGATTTTGCAACTGG 

TAGTGGAAGCTGCGGCGCTGCTGGCCGGCTAGATTGGGCAATTCTTTAGTCGATTGCTGCTGGATGGGCAGGCGGGAAGTCAATGCCTGCTGCTG 

CGTGGAGTGCGGCTGGTGTATCGTGGCCACCGTCTGCAATTGATGTGATTGTTGCTGCTGCTGCTGCTGTTTTTGGGCTGCCGACTGAGCTGTTG 

GCGTTGGCGTTCCGCTTTGTTGCTGCGATGAGTGTGGCTGGTGGGGATTCGTTGCTCCTCCTGTTGCTGGCAGCAATGAGCGACGCTGGTGGATT 

TGTTGTTGTTGGTGCTGCTGCTGCTGCTGCTGCTGTAGTTTGTTAATTGCCTGCGTTGCTGCCTGACTTTTGCCTACTATTTTGTCAACGCTATT 

TATTTTTATCAAATTATGGCCCAAAGTGGTTGCTTCGTTTGTTGCTGCTGCTGCCGCGTTGTTAGTTAATGACTTCTGTTGCTGTTGCTGCTGCT 

GTTGATTGCCGGACATTTTCGTGTTGGTTCCTTGTTCGCCTTGTGTCGTTTCGGTGTTCCGTCCGTCCTCTGCCCGTCCCTCGCCCTGATTTCCC 
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GGTGTCTATGACCTCGCAGTCATATTGTCAAGTCACTGCAAGATAGCACAAAAAAAAATGGGCAGGGCATTAGCGAAAATCTTAAGTGGAAAAAG 

CTATAGGCCAGCCATAAATCAAGCCATTGCTCGATGTAATTTATGCGACAGCCGAGGGCACGCCGCCGGAGCAGCATAATTAAACCTCCATTAGA 

CTGCCGCGAGACGCTGATAGAAAGTGAAATGAGCATTTATCGCACCACTGGAGCCATGACATGGCACACGAGGCGGGCACACACGTCCACATCCA 

CATCCACATCCACATCCAAGTCCACGTCCTCGCCACGGAGTGCTCAACAAAGTGCAGCGAAGCGTCGCCAAAATGTCGCCAGTACACGGAGAAAA 

AAGGTCCCCCCAACTAATATTATATATAATGATTTTTCATATGACTCATGGAATTTTTATGTTTTTGTTTTTAGGACACGCCATCATAGAAATTC 

TTAATCAACACTCTGACATACAATAACTTGTTTAGCCTTGGAGGCTCTTATTTCTTTCGGTGTGGCAAAGGGAACATGTCGTCGGTCTCCGCTTT 

CTGGCTGGGATCGGATGGGCTGGCGCTGAAGCGTCTAATCACCAGATGTGCGTCAACTAAACGGAGGCTTCCGCTTCTGGTTTGGAAACCGAAAG 

CCAGCCCGATGCACTGAATCAACACATATCACCGCATTTGTATCTGGAAAGTGAAATTGTTCATGCTTTGATGGATGGCGGCTCTCTGGGGCAGG 

AAAATTGAAATGTTGCAGAGTTCGGAGATTGACGTGATTATGGTAGTTAGGTAGAGCAATTTGTGGTAATAACAAGATTGGTATTGCAACAGAAC 

GTTTCACCAAAATATGTTTGGCATAATATATTTATATAATAGGGTGCTAGTCATATCAATGATCCA 

(SEQ ID NO: 1366) 

Exon: 2961. .1891 
Exon: 1824.. 1579 
Exon: 1507.. 1001 

Start ATG: 2961 (Reverse strand: CAT) 
Transcript No. : CT35453 

ATGTCCGGCAATCAACAGCAGCAGCAACAGCAACAGAAGTCATTAACTAACAACGCGGCAGCAGCAGCAACAAACGAAGCAACCACTTTGGGCCA 
TAATTTGATAAAAATAAATAGCGTTGACAAAATAGTAGGCAAAAGTCAGGCAGCAACGCAGGCAATTAACAAACTACAGCAGCAGCAGCAGCAGC 
AGCACCAACAACAACAAATCCACCAGCGTCGCTCATTGCTGCCAGCAACAGGAGGAGCAACGAATCCCCACCAGCCACACTCATCGCAGCAACAA 
AGCGGAACGCCAACGCCAACAGCTCAGTCGGCAGCCCAAAAACAGCAGCAGCAGCAGCAACAATCACATCAATTGCAGACGGTGGCCACGATACA 
CCAGCCGCACTCCACGCAGCAGCAGGCATTGACTTCCCGCCTGCCCATCCAGCAGCAATCGACTAAAGAATTGCCCAATCTAGCCGGCCAGCAGC 
GCCGCAGCTTCCACTACCAGTTGCAAAATCAGCAGCAATCTCTGCGCAGCCCGCAGCTGAAGGCACCGGCACCCGCCCACTCCATCCCGCCGAGA 
TACCAGCCACCGCCGCAGCCGGCCACCGGGATCCTAAAGCCGCACTTGCCACTCAAGTATCCGCCGGATATCCCCCAGCTATCTAGCATCTACAT 
TCCCGACTCCATCAAGCAGCAGCAACAACAACAACAGCAGCAACAGTCGCGATATCTACCAGCACACCAGCCGGGAGCGAAGGATATGCTGAAGT 
TTGTGCGCAAGCCAGACCAGGAACACCCATCGCCCAATGCCTCGGTCACGTCCAGCGGCACGGGAATACCCACGGCCAATGGCGGAGGTCGCCTG 
ACCGCCGAGCAGAACCGGCAGTTGCAGTCGTTGGTCAACGAACTGCGTGCGCTGAAGGAGCAGAATCAGCGCCTGCTGGACGACAACCAGGAGTT 
GCGGGATCTGTGCTGCTTCCTGGACGACGACCGGCAGAAGGGACGCAAGCTGGCGCGGGAATGGCAAAGATTTGGGCGATACACGGCCAGCGTGA 
TGCGGCAGGAGGTGGCCGCGTACCAGAACAAACTACGTCAGCTGGATGATAAGCAGCAAGAGCTAATCACCGATAATCTAGAGCTCAAGGAGCTC 
TGCCTCTACTTGGACGAGGAGCGTGCCCATGTGGCGGCAAATGCGCTGTGCGCCGGTTGCGGAGCAGCCACTAGGAATGCGCTTCGAGATGACGG 
CGATGGTTCCAGCTCCAGCACGAATGCAGATGAGACCATCACCGCCTTGAGGAACTATGCAGAGCAGCGGCAACTACCCCAGGATCTACGTCATG 
CCCACACGCTGAACGACCAGACGCTTCAGTATGTGCGCTCACTGGAGCGCCGCATCCAGCAACTGGAGGAGGAGCGCACCACGCCCACGGCCCAT 
CTACAACAGCCGACGACGCCCACGCCGCAGGCCACGCCCACGCCAACGGTCACATCTGCAGCCACTCCAGCCAAATCAGCAGCATCGCCCCACCA 
GCAGCAGCAGCAGCAGCAGCAACATCAACAGCAGCAGCAAGACATCGTGGCCGCTGCAGCGGCGGCAGCAGCTGCTATCCAACTGCCGGAGCCGA 
TTGCCGGACGACCAGAGGCGGTGGTTCGAGCCCTCCAGGTGCTCGAGGTGCGGGAGCAGCTGGAGCGCGATCGGCTTGGCAACCTAGCTGGAAGC 
GCACTCGACCAGATGGATGACGGCGAGAAGGCGCTGGTCCGCGAGATGTGCAACGTGGTCTGGCGCAAACTGGAGGGCAGTCCACATGGACCGGC 

GGCCCTGGAGCCGCTCTAA 
(SEQ ID NO: 1367) 

Start ATG: 1 (Reverse strand: CAT) 

MSGNQQQQQQQQKSLTNNAAAAATNEATTLGHNLIKINSVDKIVGKSQAATQAINKLQQQQQQQHQQQQIHQRRSLLPATGGATNPHQPHSSQQQ 
SGTPTPTAQSAAQKQQQQQQQSHQLQTVATIHQPHSTQQQALTSRLPIQQQSTKELPNLAGQQRRS FHYQLQNQQQSLRSPQLKAPAPAHSIPPR 
YQPPPQPATGILKPHLPLKYPPDIPQLSSIYIPDSIKQQQQQQQQQQSRYLPAHQPGAKDMLKFVRKPDQEHPSPNASVTSSGTGIPTANGGGRL 
TAEQNRQLQSLVNELRALKEQNQRLLDDNQELRDLCCFLDDDRQKGRKLAREWQRFGRYTASVMRQEVAAYQNKLRQLDDKQQELITDNLELKEL 
CLYLDEERAHVAANALCAGCGAATRNALRDDGDGSSSSTNADETITALRNYAEQRQLPQDLRHAHTLNDQTLQYVRSLERRI QQLEEERTTPTAH 
LQQPTTPTPQATPTPTVTSAATPAKSAASPHQQQQQQQQHQQQQQDIVAAAAAAAAAIQLPEPIAGRPEAVVRALQVLEVREQLERDRLGNLAGS 

ALDQMDDGEKALVREMCNVVWRKLEGSPHGPAALEPL* 
(SEQ ID NO: 1368) 

Name: putative delta antigen interacting protein A ortholog 
Classification: nucleic_acid_binding 



Celera Sequence No. : 142000013384668 

GGGTGAGCTGAAACAGCAAATTAGCTGACTCGCAGAAGAGCCAGAAGTTCAGGGCGACAGCCACCACCAGCAGGATAGATTCCTGTGATAAGGAG 
ATCCCGATGGCCTCCACGGCCTGTCCCACGTGACGCACGACAACCGGATATAGTGGATAGAAGGCCAACGTGTTTTCGTAGCTGTACAGATTCTC 
GGCTATATGCAGAAAGTATTCTCCATCCCAGTGCCGTAATCCTCCCAAGCAGCGCTTAATCACCTTGTCTATCCAGGAGGCATTCTGATCTGACG 
AAACCGGCATCCGGAATACATCTGGTTTGTGCTCTGGCAGGGCTCCGTTGGCGACCAATTGTACCAGGAGGACGATAAGGCGACTGGCCAGCGCC 
AGTTTAGTCACTTTCTCTGTCATTTCAGATTTCTAGGTCTTGGGTGGAGTCGATTATCATTCGGCGGGCATGTGATTCATATCAGGTCGAGGGCA 
TACCTCACAGCTAAAACGATGTGGACGGATCTTCTGGCCTGGACACTGCTGCGGAGGGAAAAAGGCAAATCAGCTCCTCGGTTGCTTCTTGAACT 
GCCTTCTCTTTTGGTGACATTTACTTTGTGAAGCGGTTTTATTATTATATATTATTTTTGGGCAGAAGCCCGGAAAAGCAGAGCATGGTGTGACC 
AGAGGGTTTGAGGAAGCATTAGACACTTTCGATATATTATCGATACGAGAATAATATATTTAATAAACAGAATTAAATTAATATGTAAACTTAAT 
TCTCGTTGCTCTCGAACTATTAATTTAGTGTACAATACAGCCTTATAATTTAATTTAAAAATAAACAGTTAATACGTCACGTGAATTGCAATCGA 
TACTATCAAACTAATATCGTTGATTGCCCCACCTCAAATCGATATGGCACATTATAGTGGCGGCATTTCCTAAATAAACTGCCGCTTTTATTTGG 
CAGTTGGGTTGAATTTAAGCTAAAATTTCACTCATAAATGAAGTGTCGCAATGCCGAGCAGTGCAAAGAAGTCGGACTCTGCAAAGAGCAATAGC 
TCGACGCCACGAAGACCGCGAAAGCGCAAACGAGTGGATCGCCAGAGCCTGGACTCGCAGGTCGATGAGGATGCCAAGGAGATCGATCGCCGGCT 
ACGGAAAGTGGCCAAGAAGAACTCCATCAGTGCCGAGGACATGCAAAAGGTGGTCCGGAAGGTGGTGCGCAACGATCATGTTCTGGCATTAGTCA 
CTTTGAAGGCGGAGGACGAGCTGGCGCGGGAGAAGTTGGAGTCGGCGGAGCAGCAAAAGCGGGGTGCCATCATCATAACGGACACTCCATCTGTG 
CCCAAGCTAACGCGAGCGAAAGCACGGGAACTGAACTGCACGCCTGGGATCTCGCTGCCCCAGCTCAATGAAACTTCCACCGACAATAATGGTAT 
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AGAGGCTCTAATACGTGAGGATTTGCACTCTGATGAGGAGGACGAGGAGTATACCTTCAAGGAGGAGGACTTTCATGTGAGTATAAACATTGAGG 
CCAATTGATCGTTGTTCAATAATTGTTATTATTCCTTTATTATCAGTCCGATGATGATCCTAACACTACAGCATCGGATTTCGACTCCAATCCCT 
GCACGCCTCAAACTCCTCTTACTGCCAATGAGGAATCTCCTATCAAATTCTCTGATGACGGTTGTTTCAAGGTGCCGTTGGACAGAAATTTTCAG 
GATAAAGAGGATCTTAGGATTGCCACTAGAACGCGCTCCAAATTTTGTTTGCAGCAAACTACCATTGAGGATTTGCAGTCGGAATTTGTCCCGCC 
GGACGTGGAGCCTAGCGATGTGGTGGAGAACGAAATGACCGCCAACGATGCGGATTGGATGCAGTTCCTCAATGACTTTACAAAGCCTTTAAGTA 
TGTAGTATCAATGGATGCTTTTATGCTCTCCCTAACTTTAATATCTTACAGACAATACGGTCCAGGGCGACGATGATGATCCTATTAACGATCCA 
GAATATGTGGCCGCTGATCACTTGGAATCGGAATATGAGAACGCCGAGGAGCTGCGCGATGTAAATATATCTAAAAAAGAGTTAACTGACCTGGT 
CTCTGAGCTGTTTGCTGGTTTGCTGGAAGAAGGAGTCTCTCTGGAATCGATTGAATTGGAAACACCTCAAAAATTCGTTGAAAGTGCCAACGTTT 
CGATGATTCCCATTGATCAAGTGTCCCGTAACATCGATTTTGATGGCCAGCAGATAAATGCTAATCAGCAGGAGCTGTCCGTTTTTCAACCTAAT 
CTAATGCCGCCTCCTTGCGAAGGGAACGAAATGGGCATGGGTGTGCCTGTCCCTGTGGCTGTGCTGCAGGAGGAAACTGGTGGATCCGCTTGCAA 
TGTGGTAAACATACCTTCCGAGGCGCTGACTGTTCCCCCCGAACTAATAGCTGTTCCAATACCTGGGCAACCTAATTGCTTCCAAATCGCCAAAG 
TTGTAGGCAATGATTCAACTGTTGGTCTTCCATTAACTGAAGAGACACTGCAGAACTTTTCTAGCTTGCATCCAATGGAACCCCCCGTGGTAGCT 
GCAAGCAATGGAGATCGATATAAAAAACCATTTGATACGAACTTCACCTGGGAATACATCTCGGTTAGGAAACATATATATTCGGAGTATGAGGA 
AAAGTTTGAGAGCCTGCGAAACGTAGAGCCAGTAAACCTCAATCATCTCGCCAGGAGCAGGGGATTTACCCAGTCTCAGCATGATATGCTTCAGC 
AGCAGCTTCGCATTCACACGCAGTTGCTCAGTCAAAGCTATCTGCAAACCTACTCCCATCCTACGTTGTACCCCATGGCCAAGAAGCCCAGAGAA 

AT GTT GG AAG AAT T GC AC AAAAT G GC AAC T AAAG AT C C C AGT T T T AAT T G C T G GAAT CT GC AGGAG GC T GT T C AG C T GGT T AGAAAATGGG AAC A 

TGATCTGTCCAGTGATGAGTTTAAGGAGGAGAACGAGAAGATGATGGACTTCATCAATAAGGAAACGGATCTGACGTAAGTTGGTGAAACTTTAA 

ACAAAACATGCTGCTGTACAACTTGTATTTACTATTTTAGTGATGGCCACACGCGCCAGGTTCCCAGGCTGGCGCCTCGTATTATGGATCTCATG 

CTGGATAGCCGGGTGTTTATGTATCCGCAGTACCTACCCCGTATGGCGTTTCAACCGAAACTGACTCAAACCACCGCTTACGCACCATCCGAATA 

TCAGCTTATTGCCATTGGCCTAGAGAAGCACTTGGCATTAATGGTAAAAGCCAAGAGACAACTACGCAAAGGTGCAGATCCCATTCGCGTTGCCT 

GTAAGCGAATGATTAAGGATACGATTTGTGGAAAGACTGCCCGCCGAATTTTCTTGAAAATCGCGGAGCTACGCAACTCAAAACAGTACAATCCA 

GTGAAGTATTTCTTTGAACATCGTCGAGCACCGCCCGCTAACGCCATCCTTTGGGGATTCGCTGGTGGGCTGGTGAGAACTCCGCGGGAACGACA 

CATGGAGCTTCCCAGTGGCTGGCAATACTACATTGAGGTAAGGAATACAATGGGTTATTATCGGATACATTGTTTAACATTTAATAAGCATCAAA 

AAGGGGGAAAATCTCAAAGGGTAAATAAGATGGGATTAGGAAAACTAGCTTAGTTGGTGGCCCACTCGATCTCTGGGTATGAGGCGCAGCTCGGA 

TCCGTGCT T T AGG AAC AC AT T AC C AGC AGT C T GC T GT GGG CT G AT GC C AAT T C CAT C GT C C GT GATGAT GGC ACT C GT C T C C GC C GG AAC CT T G A 

ACTCTTCAGAGGCGAATTTCAGCACAGCCGTAAAGGGAGTTCCCTCCGGAACACTAAGCACCTTAAAGGGCAGCTTGGGGTCCGAAGTTAGCGTG 

ATTTTGAAGGTTACTTTGCTCATAGTGGAATGTGCAGACGTGGTCAGTTTTGATACTCTATGGGAATAAACGGGTTTCCGATGTTTTCCACAGGC 

AGAGTGTTTGAGCCCTGGTGGTTAAAGTTTAAACAAGTATAAATTAGTTGGTTTGAAAATAGTTTTAAGGGGTATTTTATTTAAAAACAATAAAA 

TATGTTGACGCTCTTACTTTAGTGCGGTCAATATTACGATAGGCAAGGCAGCGAGTTATCGTCTGTGATGAGTAATATTTGCATAAATGTACCTA 

TGTCCGTACCTAATTAGGAATACATGTTTAGAACTTTATTAATTTTTCTCTTAAATTTTAAGTAATATACTTGCTAGTGATTGTTGGCAGAATTA 

TTTTACGAAATAAATCACTGGTCATCGGTGGCTAGGTACCATCGACCAAAGTTATCGATATCAATATACATAGCTTGTGCCATCGCTGTGTTTAT 

AAATCGCGGACAATTTTCAATCTATTATTTTTCAGAAAACTTGGGATCAAAAGCGCACACGTCGCAATTTTAAAGCATCGAAAGCATCGTCGGTG 

GCTAGTGCTTCTTATTTGGAGTTTGTACGCGAGGCGGTCGGCGAGGACCTGCTGCTGCCAGAGACTTTTGGTCAGGCTGCGGTCCCCAATGGCAG 

TGCTGCTGGTTCGGAACCGCCCAAGGAGAGCAAACCGAAGAAAAGGCCAGCGCAAAGGAGACCCCGCCCTGCGGAAAACCCACCACAGCTTACCA 

TCAATGTGAATTATGTTTTTGGCGGGAATACGAGCGCGGGAAACATACCCGGGGTCTCTGGGGTTTCGGTGCCCTTGGTGGCCGGCGGAAATCTG 

GCCGAGAACGTACTGAACATCAATCGCAGCTTATACACCGCCGCCGAGTGCAGTAGTCTGCTGGCCTTGCCGCCTGCTCCTCCACCATTGGATGA 

AAATGCTCCGCCAGCCATCAACTTCACAGTAGACAAGTTCACTAACACACTACAAATCTGCGAATTGGAGGAATCCAGCAAGGCCGTACAGCCGG 

ATCTGTTGCCTGCGCAGGATACCCCTTCCGTCCCAGGCAGAAACGCCAAAAGGGCGCGATACCAAATCTTTAAGCCAAAGGTTCAATCCGCCAAT 

AAATCGCGAAAAGGGTCAACACCAACCCGCAGTCGATACCATATATTGCGCGAACGTTTGCGCCATAAACTAAGCCAGCGCTGTCATTCCCTAAT 

CACCACCTATGAATCCTACCTGCGTCATCACATCGAAAAATATAAAGACTGCGAGTTGGTACAGGTCGTGCACAATCGTTTTTTCGCCCTCGAAC 

TGTACACTCAACTTCTGGTCGATCTGAGGATGTTCTGCAGCAATACAAACTCTAAAACGGCAGCGCAGCCAAAGAGATCTAGGAGTGATGATGAG 

GCGGCGAAGCTGAGAAGAGCCAACCGGCAGGAGGAGATGCTTCGCCAAATGCTCCAGCCGGATACTACAGAAGAGCGCAATCGCAAAGATGCCAG 

TAAGTTTTAGTTCAATCCCATTTAATGAAATTCAACTTGAATAATTTTATTATTTTGCACGTTCAGTTTTTGCCATGAATTTTTATGAAAAGGTA 

GAGGAGGCCTTGCTGAGTGCCAATCGCCTAGATGATTGCAAAAAATTCAACAGTATTTTGCAAAACTTCGACCCCAGGCAGGAGAAAGTTTCATC 

TTTGTATTTGGTATGTATCACTTAAGTGTCTTACTCTTATTGCATTCTAATTTTTTTTTGGGGTTCTTAGAAAGTGGAAGAAATTTTGCTACCCA 

CTCATCCCGAACTGGCTGAAGTATTCCTCAACTTTTTGCTACCTGCCGAGGCCGCTGAGATGGGAAAATTCTTTGAGCACTTTATGATATGCAAT 

GCTACCAACTTTATCAACAAACTTAACATTTACTTCAGTAAGCAGCCAGCTCAAATCAGGAAGATTTATGCCTGTCTAAACGAGTTGGCCGAAAT 

GCCTAATGTCAGCATGAAAAAAGTTGAAAATAAGATAATGCCCTTGCTCAAGGGCAATCAATTTCTGATCGATTGGTTTGTGCAACAATTCCCAC 

AGGGAAAACCACCAAAAAGGTAAGAATGGTAAACAAAATACTCGAGATTGATTTAATGTAACATTGTCTTTTATTTCCAGAATCCTTTCTCCTGT 

AGAGACCATAAATTTGCATGACGTTCAAAACAATACAACGGGAGAATATACGGAAACTATTCAAGAGTTTACAGATGTACCTACGCCAAGTAAAC 

CAGGTTGTCACTTAAAGTATATCAATGGTCGCATATTTTATGGAGCCAAGATCCTATTGCCTGCCAAACTGTCTTTTATGGCTGCCAGTATTTAC 

AATGAACAATCTCATGACGTGCCACCCGTCTCCAGTGACTCGATAAACTGCGCCCATGGTATTCGTGCCCAGGGAGAAAAACTTCTTAGTATGGC 

TTCCAATACAGATAGCGACGATGGCGCGGATCGGAGTGAGGAAGATGATGCCAGCCGCGCCTTGATTATTGATACCACGAGTGATGAGCAGAACA 

GCTGCTCATCTATTGAGATGTGCGATGAGACAGGCTTAAGAGCACATGCAATCAGATTGAATCCCGGCTTGTATACCAACTCAAGCTTTAACAAT 

GCAGCGACCAGTACGCCCAATCTGGGCAACTCTCACCAATCCCATCATCCGCACGCCAAAAAAACTGCCATAAACTTTGGTGAAATATCCCCACG 

CAAGCAATCGGCATTTAACAACTCGGGATTGAGTCCCTTATCGGCGGCCCCCAACAGTTCCCCTCAAGTCGACAAACGAAAGTCACCTACTAAGA 

AGATCCGATCTCCTCTTAATCAAACTCAGAGCAGACAACGTGCCAATAATCAGCCAATGGTTGTTATCCATGAGTCTCAGCCTTCATCTTCGCCA 

GCCATTGAGTGTGCAAAGCGCTTGCGCACGCTGATCGATCAGGAGAGCGTGCGGGATGCAGCAGATACAAAGGCGGGCATTCGAATTATTGCGCA 

AGCCAAGCAGCAACCTCCAAACCAATGCAGCTCTGAAGCTATTACGAAAGTCGAGGTAGTCAGCCTGGATGAAAGTGCCGAGAATATGCCTTTCT 

GTACAAATCTCAAAATGGACACAGATCAATCGGAGGAGGAGCTGGAGCCACTGCAATTAAGTGCTGGCAACACACCACAGCACGTGATCACAACA 

CAATTCGATAGCGATGAGGACTGCAAGCTGGATGTTGTGGCCACGCTGGCGAACTGCACCACCACCGAAGATAGATCGGAATCGGCAGCGGCATT 

ACCTGCCACCACATCGCACTCTGAGTCTGGTAGTTCCCACTCGGCACCTTGGACTCGTGAAGAGGACAAAGTAATCCTGATCGAAATGAAAATAG 

GGGCACGGGATCGAGAGCAGCTCATTCGGCGTATGCGCGCCAAGCTAAAGAATCGCAGTATAGAAGAACTACGTAGTAGGCATCAGTTTCTTATG 

GACTTTCTGTCCAAATTGCAGGGAAAGTGAGGTTTCTCTAAGATGGTCTTTTGTGTATTATAATCCAAGCTGAAATAGGATACTAATAATAACTG 

TATATAAACGAAATGGTTTACCCCATACCTACAAACCAAACTTTTTGAAGGGAATTCTTTGTTGTACTGATTACCTATTTAAGCATACAAAAATT 

CCCTTCAGAATACTTAGTTTTATTATTAAAAATTTGAATATTTTGAAATCTACTGTACGTTGAAATTGACTATTAATAGATATATTGAAACCATG 

AATTTAATATTTTTGTGTCCGTCTAAGTTTACTTTTCTATATGATAAGAAGGTCGCATTGTTTGAATTTGATTATTCTATTCTATAGTTTTGGTT 

AACGAGCACATTGTTTTCCATTGTCTTAAAGATTGTAAATATATAAAAATATAACATTTTTTAAAATCTCTAAAATGTATCTGATATTGTCAAAC 

CAGCTTGTTCGGGAACTTCAAACGTATTCGCATTTGAGAACTTAAACAAACCTGGAATTGTTCCTTTAAGAAGTACGACGTAAGACTAAAATATG 

GAGCTCTCGTTGGATGAACCGCAATGCGCATCCTTCTTTTGCATCCTGGGTGCCGTGTGCGCCATTGTCTTTTCGACATTGGGAGCCGCCTACGG 

AACAGCGAAGGCTTCTGTGGGAATCTCTTCGATGTCAATCAAGCATCCGCAGCTGATCATGAAGGCGATTGTTCCAGTGGTTATGGCTGGCATTA 
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TAGCCATTTATGGACTGGTGATCGCGGTCCTGCTTGCTGGATCACTTAGCAGCCCCTATAGCGCCTACAAGGGTTTCCTAAACCTCAGTGCTGGA 
CTGGCGGTGGGAGTCTCTGGGATGGGGGCTGGAATTGCTATTGGCGTGGTGGGCGAAGCTGGAGTCCGTGCATCTGCCCAGCAGCCAAAACTCTT 
TGTGGCCATCATTTTAATATTGATATTTGCCGAGGTCTTGGGTCTGTATGGTCTCATAGTGGCCATTTATTTGTTTTCCA 
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ATGCCGAGCAGTGCAAAGAAGTCGGACTCTGCAAAGAGCAATAGCTCGACGCCACGAAGACCGCGAAAGCGCAAACGAGTGGATCGCCAGAGCCT 
GGACTCGCAGGTCGATGAGGATGCCAAGGAGATCGATCGCCGGCTACGGAAAGTGGCCAAGAAGAACTCCATCAGTGCCGAGGACATGCAAAAGG 
TGGTCCGGAAGGTGGTGCGCAACGATCATGTTCTGGCATTAGTCACTTTGAAGGCGGAGGACGAGCTGGCGCGGGAGAAGTTGGAGTCGGCGGAG 
CAGCAAAAGCGGGGTGCCATCATCATAACGGACACTCCATCTGTGCCCAAGCTAACGCGAGCGAAAGCACGGGAACTGAACTGCACGCCTGGGAT 
CTCGCTGCCCCAGCTCAATGAAACTTCCACCGACAATAATGGTATAGAGGCTCTAATACGTGAGGATTTGCACTCTGATGAGGAGGACGAGGAGT 
AT AC C T T C AAGG AGG AGG AC T T T CAT T C C GAT GAT G ATC C T AAC ACT AC AGC AT C GG AT T T CG AC TC C AAT CCCTGCACGCCT C AAAC T C CT C T T 
ACTGCCAATGAGGAATCTCCTATCAAATTCTCTGATGACGGTTGTTTCAAGGTGCCGTTGGACAGAAATTTTCAGGATAAAGAGGATCTTAGGAT 
TGCCACTAGAACGCGCTCCAAATTTTGTTTGCAGCAAACTACCATTGAGGATTTGCAGTCGGAATTTGTCCCGCCGGACGTGGAGCCTAGCGATG 
TGGTGGAGAACGAAATGACCGCCAACGATGCGGATTGGATGCAGTTCCTCAATGACTTTACAAAGCCTTTAAACAATACGGTCCAGGGCGACGAT 
GATGATCCTATTAACGATCCAGAATATGTGGCCGCTGATCACTTGGAATCGGAATATGAGAACGCCGAGGAGCTGCGCGATGTAAATATATCTAA 
AAAAGAGTTAACTGACCTGGTCTCTGAGCTGTTTGCTGGTTTGCTGGAAGAAGGAGTCTCTCTGGAATCGATTGAATTGGAAACACCTCAAAAAT 
TCGTTGAAAGTGCCAACGTTTCGATGATTCCCATTGATCAAGTGTCCCGTAACATCGATTTTGATGGCCAGCAGATAAATGCTAATCAGCAGGAG 
CTGTCCGTTTTTCAACCTAATCTAATGCCGCCTCCTTGCGAAGGGAACGAAATGGGCATGGGTGTGCCTGTCCCTGTGGCTGTGCTGCAGGAGGA 
AACTGGTGGATCCGCTTGCAATGTGGTAAACATACCTTCCGAGGCGCTGACTGTTCCCCCCGAACTAATAGCTGTTCCAATACCTGGGCAACCTA 
ATTGCTTCCAAATCGCCAAAGTTGTAGGCAATGATTCAACTGTTGGTCTTCCATTAACTGAAGAGACACTGCAGAACTTTTCTAGCTTGCATCCA 
ATGGAACCCCCCGTGGTAGCTGCAAGCAATGGAGATCGATATAAAAAACCATTTGATACGAACTTCACCTGGGAATACATCTCGGTTAGGAAACA 
TATATATTCGGAGTATGAGGAAAAGTTTGAGAGCCTGCGAAACGTAGAGCCAGTAAACCTCAATCATCTCGCCAGGAGCAGGGGATTTACCCAGT 
CTCAGCATGATATGCTTCAGCAGCAGCTTCGCATTCACACGCAGTTGCTCAGTCAAAGCTATCTGCAAACCTACTCCCATCCTACGTTGTACCCC 
ATGGCCAAGAAGCCCAGAGAAATGTTGGAAGAATTGCACAAAATGGCAACTAAAGATCCCAGTTTTAATTGCTGGAATCTGCAGGAGGCTGTTCA 
GCTGGTTAGAAAATGGGAACATGATCTGTCCAGTGATGAGTTTAAGGAGGAGAACGAGAAGATGATGGACTTCATCAATAAGGAAACGGATCTGA 
CTGATGGCCACACGCGCCAGGTTCCCAGGCTGGCGCCTCGTATTATGGATCTCATGCTGGATAGCCGGGTGTTTATGTATCCGCAGTACCTACCC 
CGTATGGCGTTTCAACCGAAACTGACTCAAACCACCGCTTACGCACCATCCGAATATCAGCTTATTGCCATTGGCCTAGAGAAGCACTTGGCATT 

AAT GGT AAAAGC C AAG AGAC AAC T AC GC AAAG GT G C AG AT C C C AT TCGCGTTGCCT GT AAGC G AAT GAT T AAG GAT AC GAT T T GT GG AAAG AC T G 
CCCGCCGAATTTTCTTGAAAATCGCGGAGCTACGCAACTCAAAACAGTACAATCCAGTGAAGTATTTCTTTGAACATCGTCGAGCACCGCCCGCT 
AACGCCATCCTTTGGGGATTCGCTGGTGGGCTGGTGAGAACTCCGCGGGAACGACACATGGAGCTTCCCAGTGGCTGGCAATACTACATTGAGAA 
AACTTGGGATCAAAAGCGCACACGTCGCAATTTTAAAGCATCGAAAGCATCGTCGGTGGCTAGTGCTTCTTATTTGGAGTTTGTACGCGAGGCGG 
TCGGCGAGGACCTGCTGCTGCCAGAGACTTTTGGTCAGGCTGCGGTCCCCAATGGCAGTGCTGCTGGTTCGGAACCGCCCAAGGAGAGCAAACCG 
AAGAAAAGGCCAGCGCAAAGGAGACCCCGCCCTGCGGAAAACCCACCACAGCTTACCATCAATGTGAATTATGTTTTTGGCGGGAATACGAGCGC 
GGGAAACATACCCGGGGTCTCTGGGGTTTCGGTGCCCTTGGTGGCCGGCGGAAATCTGGCCGAGAACGTACTGAACATCAATCGCAGCTTATACA 
CCGCCGCCGAGTGCAGTAGTCTGCTGGCCTTGCCGCCTGCTCCTCCACCATTGGATGAAAATGCTCCGCCAGCCATCAACTTCACAGTAGACAAG 
TTCACTAACACACTACAAATCTGCGAATTGGAGGAATCCAGCAAGGCCGTACAGCCGGATCTGTTGCCTGCGCAGGATACCCCTTCCGTCCCAGG 
CAGAAACGCCAAAAGGGCGCGATACCAAATCTTTAAGCCAAAGGTTCAATCCGCCAATAAATCGCGAAAAGGGTCAACACCAACCCGCAGTCGAT 
ACCATATATTGCGCGAACGTTTGCGCCATAAACTAAGCCAGCGCTGTCATTCCCTAATCACCACCTATGAATCCTACCTGCGTCATCACATCGAA 
AAATATAAAGACTGCGAGTTGGTACAGGTCGTGCACAATCGTTTTTTCGCCCTCGAACTGTACACTCAACTTCTGGTCGATCTGAGGATGTTCTG 
CAGCAATACAAACTCTAAAACGGCAGCGCAGCCAAAGAGATCTAGGAGTGATGATGAGGCGGCGAAGCTGAGAAGAGCCAACCGGCAGGAGGAGA 
TGCTTCGCCAAATGCTCCAGCCGGATACTACAGAAGAGCGCAATCGCAAAGATGCCATTTTTGCCATGAATTTTTATGAAAAGGTAGAGGAGGCC 

TTGCTGAGT GCC AAT C GC C TAG AT GAT TGC AAAAAAT T C AAC AG TAT T T T G C AAAAC T T CG AC CC C AGG C AGG AG AAAG T T T CAT CTTTGTATTT 
GAAAGTGGAAGAAATTTTGCTACCCACTCATCCCGAACTGGCTGAAGTATTCCTCAACTTTTTGCTACCTGCCGAGGCCGCTGAGATGGGAAAAT 
TCTTTGAGCACTTTATGATATGCAATGCTACCAACTTTATCAACAAACTTAACATTTACTTCAGTAAGCAGCCAGCTCAAATCAGGAAGATTTAT 
GCCTGTCTAAACGAGTTGGCCGAAATGCCTAATGTCAGCATGAAAAAAGTTGAAAATAAGATAATGCCCTTGCTCAAGGGCAATCAATTTCTGAT 
CGATTGGTTTGTGCAACAATTCCCACAGGGAAAACCACCAAAAAGAATCCTTTCTCCTGTAGAGACCATAAATTTGCATGACGTTCAAAACAATA 
CAACGGGAGAATATACGGAAACTATTCAAGAGTTTACAGATGTACCTACGCCAAGTAAACCAGGTTGTCACTTAAAGTATATCAATGGTCGCATA 
TTTTATGGAGCCAAGATCCTATTGCCTGCCAAACTGTCTTTTATGGCTGCCAGTATTTACAATGAACAATCTCATGACGTGCCACCCGTCTCCAG 
TGACTCGATAAACTGCGCCCATGGTATTCGTGCCCAGGGAGAAAAACTTCTTAGTATGGCTTCCAATACAGATAGCGACGATGGCGCGGATCGGA 
GTGAGGAAGATGATGCCAGCCGCGCCTTGATTATTGATACCACGAGTGATGAGCAGAACAGCTGCTCATCTATTGAGATGTGCGATGAGACAGGC 
TTAAGAGCACATGCAATCAGATTGAATCCCGGCTTGTATACCAACTCAAGCTTTAACAATGCAGCGACCAGTACGCCCAATCTGGGCAACTCTCA 
CCAATCCCATCATCCGCACGCCAAAAAAACTGCCATAAACTTTGGTGAAATATCCCCACGCAAGCAATCGGCATTTAACAACTCGGGATTGAGTC 
CCTTATCGGCGGCCCCCAACAGTTCCCCTCAAGTCGACAAACGAAAGTCACCTACTAAGAAGATCCGATCTCCTCTTAATCAAACTCAGAGCAGA 
CAACGTGCCAATAATCAGCCAATGGTTGTTATCCATGAGTCTCAGCCTTCATCTTCGCCAGCCATTGAGTGTGCAAAGCGCTTGCGCACGCTGAT 
CGATCAGGAGAGCGTGCGGGATGCAGCAGATACAAAGGCGGGCATTCGAATTATTGCGCAAGCCAAGCAGCAACCTCCAAACCAATGCAGCTCTG 
AAGCTATTACGAAAGTCGAGGTAGTCAGCCTGGATGAAAGTGCCGAGAATATGCCTTTCTGTACAAATCTCAAAATGGACACAGATCAATCGGAG 
GAGGAGCTGGAGCCACTGCAATTAAGTGCTGGCAACACACCACAGCACGTGATCACAACACAATTCGATAGCGATGAGGACTGCAAGCTGGATGT 
TGTGGCCACGCTGGCGAACTGCACCACCACCGAAGATAGATCGGAATCGGCAGCGGCATTACCTGCCACCACATCGCACTCTGAGTCTGGTAGTT 
CCCACTCGGCACCTTGGACTCGTGAAGAGGACAAAGTAATCCTGATCGAAATGAAAATAGGGGCACGGGATCGAGAGCAGCTCATTCGGCGTATG 
CGCGCCAAGCTAAAGAATCGCAGTATAGAAGAACTACGTAGTAGGCATCAGTTTCTTATGGACTTTCTGTCCAAATTGCAGGGAAAGTGA 
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{SEQ ID NO: 1370) 
Start ATG : 1 

MPSSAKKSDSAKSNSSTPRRPRKRKRVDRQSLDSQVDEDAKEIDRRLRKVAKKNSISAEDMQKVVRKVVRNDHVLALVTLKAEDELAREKLESAE 
QQKRGAIIITDTPSVPKLTRAKARELNCTPGISLPQLNETSTDNNGIEALIREDLHSDEEDEEYTFKEEDFHSDDDPNTTASDFDSNPCTPQTPL 
TANEESPIKFSDDGCFKVPLDRNFQDKEDLRIATRTRSKFCLQQTTIEDLQSEFVPPDVEPSDVVENEMTANDADWMQFLNDFTKPLNNTVQGDD 
DDPINDPEYVAADHLESEYENAEELRDVNISKKELTDLVSELFAGLLEEGVSLESIELETPQKFVESANVSMIPIDQVSRNIDFDGQQINANQQE 
LSVFQPWLMPPPCEGNEMGMGVPVPVAVLQEETGGSACNVVNIPSEALTVPPELIAVPIPGQPNCFQIAKVVGNDSTVGLPLTEETLQNFSSLHP 
MEPPVVAASNGDRYKKPFDTNFTWEYISVRKHIYSEYEEKFESLRNVEPVNLNHLARSRGFTQSQHDMLQQQLRIHTQLLSQSYLQTYSHPTLYP 
MAKKPREMLEELHKMATKDPSFHCWNLQEAVQLVRKWEHDLSSDEFKEENEKMMDFINKETDLTDGHTRQVPRLAPRIM 

RMAFQPKLTQTTAYAPSEYQLIAIGLEKHLALMVKAKRQLRKGADPIRVACKRMIKDTICGKTARRI FLKIAELRNSKQYNPVKYFFEHRRAPPA 
NAILWGFAGGLVRTPRERHMELPSGWQYYIEKTWDQKRTRRNFKASKASSVASASYLEFVREAVGEDLLLPETFGQAAVPNGSAAGSEPPKESKP 
KKRPAQRRPRPAEWPPQLTINVNYVFGGNTSAGNIPGVSGVSVPLVAGGNLAENVLNINRSLYTAAECSSLLALPPAPPPLDENAPPAINFTVDK 
FTNTLQICELEESSKAVQPDLLPAQDTPSVPGRNAKRARYQIFKPKVQSANKSRKGSTPTRSRYHILRERLRHKLSQRCHSLITTYESYLRHHIE 
KYKDCELVQVVHNRFFALELYTQLLVDLRMFCSHTNSKTAAQPKRSRSDDEAAKLRRANRQEEMLRQMLQPDTTEERNRKDAIFAMNFYEKVEEA 
LLSANRLDDCKKFNSILQNFDPRQEKVSSLYLKVEEILLPTHPELAEVFLNFLLPAEAAEMGKFFEHFMICNATNFINKLNIYFSKQPAQIRKIY 
ACLNELAEMPNVSMKKVENKIMPLLKGNQFLIDWFVQQFPQGKPPKRILSPVETINLHDVQNNTTGEYTETIQEFTDVPTPSKPGCHLKYINGRI 
FYGAKILLPAKLSFMAASIYNEQSHDVPPVSSDSINCAHGIRAQGEKLLSMASNTDSDDGADRSEEDDASRALIIDTTSDEQNSCSSIEMCDETG 
LRAHAIRLNPGLYTNSSFNNAATSTPNLGNSHQSHHPHAKKTAINFGEISPRKQSAFNNSGLSPLSAAPNSSPQVDKRKSPTKKIRSPLNQTQSR 
QRANNQPMVVIHESQPSSSPAIECAKRLRTLIDQESVRDAADTKAGIRIIAQAKQQPPNQCSSEAITKVEVVSLDESAENMPFCTNLKMDTDQSE 
EELEPLQLSAGNTPQHVITTQFDSDEDCKLDVVATLANCTTTEDRSESAAALPATTSHSESGSSHSAPWTREEDKVILIEMKIGARDREQLIRRM 

RAKLKNRSIEELRSRHQFLMDFLSKLQGK* 
(SEQ ID NO: 1371) 

Name: putative DNA binding protein 
Classification: DNA_binding 



Celera Sequence No. : 142000013384591 

TGTTATCAATTCACTAATTACTAAGCAAACCAATGCCAAGCGTATCCTAAAATTAATGTTCTTTGCTCTACATTCACATACTCCTTGTTCGGCGC 

CGTTTATCTGCTATTAAAAAAACTGCCAATTCTATTTGCCATCTTTGTGCTCAATAATCCCAAGACCTTGAAAAGAGCCCTTCCATGCCAATTAA 

ATCAACTATTAATTGTTTATCCCTTCTGCCTCGGATGATAGATGATGCACGAACTGGCTTAACACCTGCACCGATATATAAAATTTGTGTGGATA 

AATGGTTAACTTAGCTAATTAAAAAAGTTCCTAACTTAAATCATTGTAGGCAGTATGTCATTATGTTTTCTCGGTTTTCTCTTGCTGTGTGGATT 

TGAAAATAATTGCTGACAGTGTGTTAATTTGTTATTTGAACACAATTGCTTGATTATGTCATAAAATACATAGATAACTGAGTTGCAAATTAAAT 

ACTGTGAGTGGGTTTCGTTAAATGTAAATATCATGTTCAACTTAGTCCATTAAAACTTTTTAAACTGTTGGAACGACTTGCAGCAAAGATTACCA 

TCCAAAAAAAATTCCGAATACACTCAACGGTAATCCTTCGCATTAATGGAACAATTCTAATATTCTAAAACTTTTGCGTGCTATATTTGCATGCA 

TGCTATGCGGCACACTGAGAAGCGTCTACACATTTAAAAAGGATTGGCAAAGCTTTTACAAAATTACGAAACTACGGTTACAAACTACTTTGCCA 

AAAAAAAAATTAGCAATACCATGTGCAATACTGATTTGAGCCCGATCAATCGATTTGGCCTGGCAGGTATATTCGAAAACTAGACGTCTCGAAAG 

CAAATCTAAAAACTAACCACTGTGCTCATGCCAAATGGCGCTCCTCCTCGCAGATGGCACTGCGCGTCGCCTCGCACTGGGCACAGTGCTTCTCC 

AGAAACTTCATGGACAAGTGGAACGTATACCGGTGCCGGGTGCAACGCCTTTACATCTTGTACTCGACGATTTTGTTGGGATTGAGGAACTGCTC 

CCGTTGCGACGCATCCAAATGGGCTACGGTGCGCGTCAGTGCCGAGCCCGAACCCGATCCCGATCCCGATCCCTGTCCCTGACCCTGACCCTGAC 

CCGATCCTGTTCCAGTTGCTGATCCCGAGGAGGACTTGTGCTTGCCATCGCCGCCGCCGCCACTGGTGCGTCCAGTATCCGTGCCAATGGGCTGG 

TAAACGCCCTCTGCACTGATCACACCCACATTGACGCCGGCGACCACTGGCTGTTGGAGATATCCGGAATACTGTGGCGGCAGCTGGTAATTGCC 

CGGCGGCTGCTGCTGCTGCTGTTGGTGGCTGCCCGCCCGCGGCTGCTCCGTGTGGGCAATGGGATACTGCTGCTGATGGGCGTAGTCCTGGCTGC 

TGCCCACAGCGCCAGCGCCAGCCACCGAGCCGCCAGCTGCTCCTCCGCCACCGGCTGATCCTCCAGCACCGCCACCGCCCATCCGGGAACCGGAA 

TCATCGTTCAGGCCCTCACCCATTTCACCATCCCGATTGCGCCGTCCGCGGATGCACAGTTTGGTGAAGAGCCAGACTCGTCCGCGGTAAAGGAG 

GTAACAACAGAAGTGAACGAAGAACGTAAAGATGATCGCACAGGCCGTCACTATGATCGCCTTGCCCGGCTTGGTCCAGTCCATCATGGGATAGA 

TAGCAGTTTGATTTTTCCTGCAAACACAATCGTCATTATCATCCAAATTTGATCAATCATTTTCAGTGCGTATGAAAATCACCTGTCTGTTCCGC 

CGGCCAAGAAATAGATTCCCGTGAATATGGCGTAGGCCAGGCCAATGCCAGTGGTAAAGTAGGCGTGGCTCATCTTTATCGGATGCCCCACAATG 

GCCAAATCGATGAGCATTATAATCGTGTTGAGCACATGGACCATTATGTTTAGCGCATCGATCTTATGGATTTCTGAAAGATAAAAGATGTATAA 

AAGATATGTTCTCTAATTAAATGGTTTCCCTTTTCTAAAGAACTCGATTAAAATGCACTTCGTTCTATTCATATATTTGGCCCTGCAGGGGAGGC 

TTTTTCACAGGTGTCCAATTTAGGAAATCCCTAATCCATGAACATCTCCCACTGGCTCTTAACGGATCTGGACAAAAACTTTTGATTTGCCGTCT 

GGCCAGCGTAGTAGAACTCAAAACCAACGTCATTGGCGGAGTCAAGAAAGATGTAGCAGGCGAGAAAGTCGAACAGGAGCATCGTGGAGCACAGC 

GGCCTATCCAGAATTGGCGGTAAATTCTCCCAATAGCCGGATATGACCTGCGCCTGGAACACAATCAATGGGCATATAACTTGCCAGAGCACAAA 

GTGGGTGGCTACTTCATAGGAGAATATCTGATCTGAAACGTGTACACAAACGGCGGCCATTAGGACGGAGGAGAGCACAACGAATATGAACCAAG 

TGAGGAACAAGTAGGCCATTGCAGGGCATTCGGGCAATAGATATGAGCCGACCAGCAGGAGCAGCGCCATCATGACAAGAATCTCCATGCTCAGC 

AGCAGGACGTGTCCCGTCCGCTGGTTTGAAAGGATACACACCGTTCCCAGTGCCAGACAGGCTGCTATGCAGCAGCTGACTATCCAGTTGAAGGG 

GAACTTCCGGCGCAGCCAAGTGACGTTGACATAGACGCACAGGGTGACCAATCCGATGACAAAGAAGCAATAGGTAGCCACTACATCCGGAACCA 

GGAGGTATCGTCTGAACAGGCCCGGCAAGATGAGCAGCCCAATGGCCAGGACAATGAACAGCGCCGCAATGCCGTAGATCCACTGGGCCACCTTG 

AGACGAAGGCGCACGACGCCATCGACTGGCTGATCATTCATCGATGTGGGGGATTTCCTCGTTTTCTGCTTGTTTCCCGGCGCCCGTGTCAAGTG 

TCAAAGGCGACAGTTCGGCCTGACAGCCAGTATGTTAAGTGTAACCAGATCGGCATCAGCCCCAAATTTCAGCGTGCGATCCGTTGTAACCTATA 

GCTCCACGAGCCACTTCGAGGCAAATCATAGTGTGAGTGATAGTCCGTTCTCGGAAGTGGATCGTTTTGTGGAGGCTGCCACCAGCATCAACCGT 

TTTCCAAAGAGGCGGAAACTCTCCTCCTGCATTTTCCGAACCTACGTCTGCCTGCTGTTGGTCATCCAGGTCATCCTGGCATCGCTGCAGTGGCT 

CGGATCCACGTACCGCTGGAAACCGCAGCTGAGTGCGGCCAACCGCGGCCTGTCCGTCCTGCTACTCCTGATTTCGTGGATCAACCTGACGCTGG 

CCTTCATGGGCTTCCGGCGGCTGCAGTTCACCTTCCCGCTCAATTGGATTGTCTTTGGGTGCATCTTCGAGAGCCTCACGCTGCTGGTGGTATGT 

CTGCACATTCTGGAACAGGACCTAACGTGGCCGTTTGTGCTAATTGGCATTGGCGTGCTGGTTGTCTATACGCTGCTGGGGCTGTGGGTACCTGG 

ATTCCTCACAGCCAACTTGTGGATCCTAATCCTGGCAAGCATTGTGGTCTTCCTGGTCTCGACAGTGGCCTTGAGCGTTCGCCTGCAGATGCGAT 

ACTACGTGCCCGCTAGCGTCTGTTTGGTTTTCTTCGGACCCTGGACCATGTACAACTCCCAGAAGTTGTTCCTGCACCACAAGCGGTCTGGTTAC 

GTAGCTCACCAGTATTTGGAGGCGTCCGCCAAGATGTTCATCAATTATGCCTTCACTGTGAGTGGCATTATATTTGCCCACCGCTTCTCGGTGGA 

CACATTGGATGCCCACAGTTAGGATATGGTGATTTCTTGGCTGCCATTTTTGTGGTTTAGATTTTTTATAAGTCTTTCTAAGTGCTGTGCA 
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{SEQ ID NO: 1372) 



Exon: 2891.. 2210 
Exon: 1973.. 1793 
Exon: 1727. .1001 

Start ATG : 2891 (Reverse strand: CAT) 
Transcript No. : CT35813 

ATGAATGATCAGCCAGTCGATGGCGTCGTGCGCCTTCGTCTCAAGGTGGCCCAGTGGATCTACGGCATTGCGGCGCTGTTCATTGTCCTGGCCAT 
TGGGCTGCTCATCTTGCCGGGCCTGTTCAGACGATACCTCCTGGTTCCGGATGTAGTGGCTACCTATTGCTTCTTTGTCATCGGATTGGTCACCC 
TGTGCGTCTATGTCAACGTCACTTGGCTGCGCCGGAAGTTCCCCTTCAACTGGATAGTCAGCTGCTGCATAGCAGCCTGTCTGGCACTGGGAACG 
GTGTGTATCCTTTCAAACCAGCGGACGGGACACGTCCTGCTGCTGAGCATGGAGATTCTTGTCATGATGGCGCTGCTCCTGCTGGTCGGCTCATA 
TCTATTGCCCGAATGCCCTGCAATGGCCTACTTGTTCCTCACTTGGTTCATATTCGTTGTGCTCTCCTCCGTCCTAATGGCCGCCGTTTGTGTAC 
ACGTTTCAGATCAGATATTCTCCTATGAAGTAGCCACCCACTTTGTGCTCTGGCAAGTTATATGCCCATTGATTGTGTTCCAGGCGCAGGTCATA 
TCCGGCTATTGGGAGAATTTACCGCCAATTCTGGATAGGCCGCTGTGCTCCACGATGCTCCTGTTCGACTTTCTCGCCTGCTACATCTTTCTTGA 
CTCCGCCAATGACGTTGAAATCCATAAGATCGATGCGCTAAACATAATGGTCCATGTGCTCAACACGATTATAATGCTCATCGATTTGGCCATTG 
TGGGGCATCCGATAAAGATGAGCCACGCCTACTTTACCACTGGCATTGGCCTGGCCTACGCCATATTCACGGGAATCTATTTCTTGGCCGGCGGA 
ACAGACAGGAAAAATCAAACTGCTATCTATCCCATGATGGACTGGACCAAGCCGGGCAAGGCGATCATAGTGACGGCCTGTGCGATCATCTTTAC 
GTTCTTCGTTCACTTCTGTTGTTACCTCCTTTACCGCGGACGAGTCTGGCTCTTCACCAAACTGTGCATCCGCGGACGGCGCAATCGGGATGGTG 
AAATGGGTGAGGGCCTGAACGATGATTCCGGTTCCCGGATGGGCGGTGGCGGTGCTGGAGGATCAGCCGGTGGCGGAGGAGCAGCTGGCGGCTCG 
GTGGCTGGCGCTGGCGCTGTGGGCAGCAGCCAGGACTACGCCCATCAGCAGCAGTATCCCATTGCCCACACGGAGCAGCCGCGGGCGGGCAGCCA 
CCAACAGCAGCAGCAGCAGCCGCCGGGCAATTACCAGCTGCCGCCACAGTATTCCGGATATCTCCAACAGCCAGTGGTCGCCGGCGTCAATGTGG 
GTGTGATCAGTGCAGAGGGCGTTTACCAGCCCATTGGCACGGATACTGGACGCACCAGTGGCGGCGGCGGCGATGGCAAGCACAAGTCCTCCTCG 
GGATCAGCAACTGGAACAGGATCGGGTCAGGGTCAGGGTCAGGGACAGGGATCGGGATCGGGATCGGGTTCGGGCTCGGCACTGACGCGCACCGT 
AGCCCATTTGGATGCGTCGCAACGGGAGCAGTTCCTCAATCCCAACAAAATCGTCGAGTACAAGATGTAA 
{ SEQ ID NO: 1373) 

Start ATG: 1 (Reverse strand: CAT) 

MN DQ P V DGV VRL RL K VAQW I Y G I AAL FI VL A IGLLILPGL FRRY LL V P D V V AT Y C FF V I GL VT L C V Y VN V TWL RRKF P FNW I V S C C I AAC L ALG T 
VCILSNQRTGHVLLLSMEILVMMALLLLVGSYLLPECPAMAYLFLTWFIFVVLSSVLMAAVCVHVSDQI 

SGYWENLPPILDRPLCSTMLLFDFLACYIFLDSANDVEIHKIDALNIMVHVLNTIIMLIDLAIVGHPIKMSHAYFTTGIGLAYAIFTGIYFLAGG 
TDRKNQTAIYPI^DWTKPGKAIIVTACAIIFTFFVHFCCYLLYRGRVWLFTKLCIRGRRNRDGEMGEGLNDDSGSRMGGGGAGGSAGGGGAAGGS 
VAGAGAVGSSQDYAHQQQYPIAHTEQPRAGSHQQQQQQPPGNYQLPPQYSGYLQQPVVAGVNVGVISAEGVYQPIGTDTGRTSGGGGDGKHKSSS 
GSATGTGSGQGQGQGQGSGSGSGSGSALTRTVAHLDASQREQFLNPNKIVEYKM* 
(SEQ ID NO: 1374) 

Classification: receptor 



Celera Sequence No. : 142000013384768 

GAAATGAGTTTTGTTTTGTTTTGTTTCTTCCGAGTGTTTGTGTGTTTATGTGTGTGGGGAGGGGGTGTATTCTGTGTGCGGCGTGTTTTGAGTGA 
GAGAGATAGAGAGAGAGCGCGAGTTGCTGCGGGTCGACGGTCGAAGGGGGTGGTGGTGGAGTTGGTGGTGGGGGGCTTGGTTTATACGACAAGTT 
TTGATTTATTATTAAATTTAAATACCTATGTATATAAATTACGTTTCTGTTTATTATAATTATTGTTGTTTCCATCTTAATCATTTTGTTTTCTT 
TCGCTTTTTATTTTTCATATTTTTTTTTTGTTTTTTGTATGTAGTGGGCAACTGCGGCATTTGTAAATTTCTTTATATTTAAATGTTCCTTTTGA 
CTGTGTAAAAATATCGATTTCCATCTGATCTTGTTTTGTAGTTACTATTCATGTATGCATTTGCTTTTCGTTTTAGTTAAATTACGTTAACGCTG 
CATTCTGAACTCTCTGTTGCTGCTGATTCTGCTGCTTGTTCTTCGGATTCTTGATCGGATTCTTCTGATTCTTCATCTTGGCGCGCTTTCTATGC 
TATTGAAATGTTGTTTGTGGTGCTGCTTACTGCCTTCTGGATCGGGATCCTATATTCCGCTCTGATCTGATCGGTTATGTTCATGTTATGTCGGG 
TTGTGTGTGTCTGTGTTGTGTGTATGAGAGTGGAGTGTGTTGGGTGTTCGCATCTCGTTATATGTTGTATATATATATATATATATGTGTATATA 
TATATATATATAATATATTATATGTATGTATAGTTAGTTGCTTAATAGTAGAATTTCTTGAATGTTGTTGTGGTAAAGTGTAGGTAATGTTGCTC 
TATGTCTCTATGTCGACTCAACTGCACTACATACATATGTCTTTCCACCGGATACGCGGCGAACCGAAACGTGCCGAGCCGAACCGATCCCCGTT 
TCTGACCCGCATCCGCATCCGTATCCGTATTTCGTTTCCTCACCTCCGCCTCATTCGCTGCGCAGTTTCTTTAGATCCGGTGACTTCTCCCCCAC 
GCCCACACCCACTCCGCCGGCGTTCTTCAGCTCCATATCGCAGTGCAGGGCAGTACGCTTCTGTCCAATGGCTGGTGCCGAGTCTGCACCCGCTG 
CTACCGGTGAAACCGCATCCACCGCCTTGTTCGCCGCCAGCTGCTGCAGTTTACTGTCCCTCACAATGCCGTTGCTGCTATTGCTCTCGTCAGCC 
TCTAGCTCCGCCTCATTATCCTCCTCCTCGTCCAGCTCATCTTCCTCCTCGTCCTCCGCCTCATCCTCATCCTCGTCGCCCAGGACGCCGTTCCC 
GTTACCATTTCCATTGGCTGGTGCTGCAGTCGCTTTGCTGGCTGGCTGTGGTGCTGCTGTAACCGCTTCCTCTTCTGCCAAAGGAGCAGGCTGAC 
CGTTGTTGGTGATCGAATTGTTAGTTTTGTGGGGCTGGCTCGGCGACTGCTTTTGATTCTGATTCTGGCTCGAGTGCTGTGTGGGAGTCAGGATG 
GGCAGGCTGTTGGTCTTGGGCGGCGCCTCCTGATTGCTTTCAGAGTCCGAGGCGAGCGGTGTTCCACTCGAACCGATGCCAATGCCAATGTCTAG 
GGCGATGGGCTCACCGTCCTCTGTGGAAGTGGTGGCCGTTGTCTGTGTGGTGCCATCGTTGTCATCGATGATCGGCTCCTGGGAGGTGGAATCGT 
TCGAAACAAAGTTGGAGTCACCGTCCTGCAGGGCGGCGCCAACGGCAGCTGCGTTGTTCTTGGCCTCGCCCGACTGTAGAGGCGTCAGAGCAGCC 
GGGTTGGTGTTCTCATTCTCTGTCTCCGATTCTGTAGCAGTGTTGCGACCGCCTGGGGGATAGAAAGCATTGGATTAGCCATCAATGGTCAATTT 
GTTAACTCGGCGAACTGCGGTCAGAGTAATCAAACAAAGAAACTCTGTGTTTGAAATTATTAGACATCCTAACACTTTTGAAAAGCTGCCAACAA 
GTATAAGCTACTGATAATATGTTAAAAGTTAACTATTTTACAGTAGTAACTTTACAATTCCTCCTTTTTATTTAACATGACGCCACGAGTAACGA 
ATAAATTGTAAATTGTAAATTGACCACTAGGCTACTACAAACATAATGTTTAATGGAAAATTCCGGATAGGAAAAATACTTGGTTTCTGGAAAAG 
ACCACTCGGTTTGAGTGTTTTGTTGCAAGGGTGTATACTCAAGCCGCAGTTATCGGATCATCTTAGCTGCTTACCTGCTCCTTGGCTTTCGTCGT 
CCAAATCAACACCCACACCATCGTTGGTATCGTCGCCATCGAGCACCTTGCTGCTGTTGCCGTTTGTGCTGAGTGAACCGGTGCCAGTGGAGGCG 
CCGGATCCAGGCGGCGGCATATCGTCGAGCATCTCGGGACTGAGATCATTGGCCACATAGTGTGACCAGAGGGCCAGCTCCACGCGATGAGGCGA 
CCAGTGCGGCGTATCCCCGCCCACCTCCGCATTGAGGCGCTCCACGGTGGCCTGAATGTGATTGACGAAGTTGAGGTACTCCTTGGTGGTGTAAT 
CGATGCCCTCGATCTCTGGTATGGCCATCAGGCACTCGTCGGCCATGAATGGTGCCGAATCGGGAGCTGCGGCTGCCAGCAGTGCACTGGCCATT 
GTGGTGCCAACGCCCTTGAGGTTCGATAAAGCTGTGATCGCCTGCTCCAGATTGGGCAGCTTGCGGAAGGCCTTCTTTGTCTCCTGGATGACGGC 
GCGCGGTGTGTTGACCTTGACCAGGTAGGATAGCTGCGGATAGAATTTGCCGCGCGACTGCTTCCACTTCATCGACTGGACGAGCTCATCGTATA 
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CCATATGCGCGTCCTTGCCGCGTGCCTTAATCAATTTGGGCAGTTCATTCTGATACCACTGATCCAGGCGGATCAGCTCTTGCGGCTTCTTGCAG 
CGCTTCTCGGCCTTTAGCTTAAGAACCTGGGGATAGAGCTGGTAGCAGTACTCGAACTGTTTGGTGCTCCCGGTCTCGAAGAACGAGACCGTCGC 
CTTGCCGTTGGACATCTTCTGCGGATGCGGATGTGTCGTTGTCGCCTGCTAATGATTGGTGGTATCGCTGTATCTCCTTATCGTGGTATCTATCG 
CCTTGGGGCCCAGGACGTTGGCTGATCCGCAAGTGACCGGCTGGCTATATATGCGGCAATATATGTACTTTGGTGTAAGCTACTATACGCCAAAT 
CAATTTATTGCTTACTGTTTAGTCTCCTGCTTTTCCTGTCCGTTCCGAGCTGCTCCTGTCAACGTACTGCTATTCGTGCGTTGAGAGTTCCCGGC 
GAGAAAGAAGCGAGTACGTGAGAGACAGAGAGAATGCGAGTGCGAGAGAGGGAGCACGAAGAAGCAGCAACGCAAAGGAGAGTGGATGAAAAGAG 
AAGAAAAGAGGCACACAATTAGGTTTACAGTTACGATTACATATGCAAGGGGGATCGGTGCGGTGGCATAGGGAGGTAGTTCGATTGATTACTGT 
TAATGCTGCTGCTGCTGCGGCCGTGTCTCTGTTGTTTGTTGTATTTACTCAATCCGATAATCGCTCACTATTTGCCAAAAATCTGCTCGCTTCCG 
CCGCCGTTTGCACAGCACACAAATCGAATTTTTACAGTGCTGTCGTCGGGTTCTTTTTAGTCCGCCATGCACTCACACAGTCAAACTGACAGGCA 

C AC GG AT GG AC AC AC A C AC AC T T AC T C AG C AC ACT C GC T C T C GC AC AC C C AAAG AGAG C C AGGC GC T C AT AC AC AGGC AAAAAC AC GC AC AC AAC 

ACACACACTGACTTAAACACGCACACACTTATATACTGAGGCAACAACAGCAGCAGCAGAAAATTTCTTTTTCATCTAGGTTTTCAAAAAAAATA 

CGAACGCGAGCATTAAATAAACGAATGCTACTATTTTTCGCTGTTGTTGTTGTTTGTGCTTTTTTCGTACCTCTTGTCTTTTTGGCAAACGGTGA 

GAACAGCTTTTCACTAGCGCACTGCACGCACACAGATGAGTGGGTGAACAACAAATACCTTTTAGTTAGTCGCCGCCAGTCTTCCTTTCCGTCCA 

CATCCAAACAACTACTCCAATGCTTTGCTGTTTTTCTGTGTTGCCTCTGCTTTGCTTTGCTTCGCTTCGCCTCTGTGCTTGTGCTGTGCTGCGTG 

CTTCGTGCTGTGCCCGTGCTGTGCTGCTTCTTCTTCTTCCTCTGCTGTTTCAGTTTCTGCTTTCTGCCTGCAGGGGCGCCGTTGACAACTGACAA 

ATAAAAAGGGGTTACTCTGTTCTCGAAATGAGCACGCCACCGAAGGCACAACAAAAAATTAAAAAGGATCGGAGCGGAGCGATGCAGCTGTGCGA 

TGTGCTGCTAACTCGTGTGACGGGAATTGGGAATTGGGATTGCGAATGGGAATGGGAGGGAATACGAGAAGCGAATATGGGAATGACGAACGACG 

ACGAACCGGGAACAGGAGTTGTCGTTCTATCTCGCCCGCACTCGGTGCCCGAATGTATCTGTATCTCTCTGGCACCCTCGGCGTTCCATTTCATT 

TTGCCGGCGACGTCCGTGCTCTCTGCCGCCGCTCTGCCATCGCCTCTGCCGCTGCGTTTATGCGCAGTGCGTCATTACACGAATACCTGTTTAAA 

GCGGAGGCATAAACGCGGCTTTGXCGATCAACATCCATGTTACCGAAACCCATGTTCGAGTTATAGAAACTTCGAGTAACAAATATCTATATTCT 

AGCAATGTTTACCCTTATTAGCTTAGTTAGTGCAATTATGTTTCATGTTTTTACAAACAGCAGAATGTTTTTAACAATAGTTATCTCTATATTCA 

GGTCCTTCAATTAAATCAAAAAGGAATGGGTTTTGTGAGTGGGTTGGGCTCCAGTGAGACTTGTTAATAATTTAAGAATTCGCGATATTCTAGTG 

TAAACAGCGCGTCACTGCATGTTCGCTGCTTTTGTTGTGCCAGTTGCCAGCGACAGCGGCGTCGT 

(SEQ ID NO: 1375) 

Exon: 4005.. 3966 
Exon: 3299.. 2260 
Exon: 1857.. 1001 

Start ATG: 3055 (Reverse strand: CAT) 
Transcript No. : CT43088 

AGTGAAAAGCTGTTCTCACCGTTTGCCAAAAAGACAAGAGCAGTACGTTGACAGGAGCAGCTCGGAACGGACAGGAAAAGCAGGAGACTAAACAG 
TAAGCAATAAATTGATTTGGCGTATAGTAGCTTACACCAAAGTACATATATTGCCGCATATATAGCCAGCCGGTCACTTGCGGATCAGCCAACGT 
CCTGGGCCCCAAGGCGATAGATACCACGATAAGGAGATACAGCGATACCACCAATCATTAGCAGGCGACAACGACACATCCGCATCCGCAGAAGA 
TGTCCAACGGCAAGGCGACGGTCTCGTTCTTCGAGACCGGGAGCACCAAACAGTTCGAGTACTGCTACCAGCTCTATCCCCAGGTTCTTAAGCTA 
AAGGCCGAGAAGCGCTGCAAGAAGCCGCAAGAGCTGATCCGCCTGGATCAGTGGTATCAGAATGAACTGCCCAAATTGATTAAGGCACGCGGCAA 
GGACGCGCATATGGTATACGATGAGCTCGTCCAGTCGATGAAGTGGAAGCAGTCGCGCGGCAAATTCTATCCGCAGCTATCCTACCTGGTCAAGG 
TCAACACACCGCGCGCCGTCATCCAGGAGACAAAGAAGGCCTTCCGCAAGCTGCCCAATCTGGAGCAGGCGATCACAGCTTTATCGAACCTCAAG 
GGCGTTGGCACCACAATGGCCAGTGCACTGCTGGCAGCCGCAGCTCCCGATTCGGCACCATTCATGGCCGACGAGTGCCTGATGGCCATACCAGA 
GATCGAGGGCATCGATTACACCACCAAGGAGTACCTCAACTTCGTCAATCACATTCAGGCCACCGTGGAGCGCCTCAATGCGGAGGTGGGCGGGG 
ATACGCCGCACTGGTCGCCTCATCGCGTGGAGCTGGCCCTCTGGTCACACTATGTGGCCAATGATCTCAGTCCCGAGATGCTCGACGATATGCCG 
CCGCCTGGATCCGGCGCCTCCACTGGCACCGGTTCACTCAGCACAAACGGCAACAGCAGCAAGGTGCTCGATGGCGACGATACCAACGATGGTGT 
GGGTGTTGATTTGGACGACGAAAGCCAAGGAGCAGGCGGTCGCAACACTGCTACAGAATCGGAGACAGAGAATGAGAACACCAACCCGGCTGCTC 
TGACGCCTCTACAGTCGGGCGAGGCCAAGAACAACGCAGCTGCCGTTGGCGCCGCCCTGCAGGACGGTGACTCCAACTTTGTTTCGAACGATTCC 
ACCTCCCAGGAGCCGATCATCGATGACAACGATGGCACCACACAGACAACGGCCACCACTTCCACAGAGGACGGTGAGCCCATCGCCCTAGACAT 
TGGCATTGGCATCGGTTCGAGTGGAACACCGCTCGCCTCGGACTCTGAAAGCAATCAGGAGGCGCCGCCCAAGACCAACAGCCTGCCCATCCTGA 
CTCCCACACAGCACTCGAGCCAGAATCAGAATCAAAAGCAGTCGCCGAGCCAGCCCCACAAAACTAACAATTCGATCACCAACAACGGTCAGCCT 
GCTCCTTTGGCAGAAGAGGAAGCGGTTACAGCAGCACCACAGCCAGCCAGCAAAGCGACTGCAGCACCAGCCAATGGAAATGGTAACGGGAACGG 
CGTCCTGGGCGACGAGGATGAGGATGAGGCGGAGGACGAGGAGGAAGATGAGCTGGACGAGGAGGAGGATAATGAGGCGGAGCTAGAGGCTGACG 
AGAGCAATAGCAGCAACGGCATTGTGAGGGACAGTAAACTGCAGCAGCTGGCGGCGAACAAGGCGGTGGATGCGGTTTCACCGGTAGCAGCGGGT 
GCAGACTCGGCACCAGCCATTGGACAGAAGCGTACTGCCCTGCACTGCGATATGGAGCTGAAGAACGCCGGCGGAGTGGGTGTGGGCGTGGGGGA 

GAAGT C AC C GG AT C T AAAG AAAC T GC G C AG C G AAT G A 
<SEQ ID NO: 1376) 

Start ATG: 285 {Reverse strand: CAT) 

MSNGKATVSFFETGSTKQFEYCYQLYPQVLKLKAEKRCKKPQELIRLDQWYQNELPKLIKARGKDAHMVYDELVQSMKWKQSRGKFYPQLSYLVK 

VNTPPAVIQETKKAFRKLPNLEQAITALSNLKGVGTTMASALLAAAAPDSAPFMADECLMAIPEIEGIDYTTKEYLNFVNHIQATVERLNAEVGG 

DTPHWSPHRVELALWSHYVANDLSPEMLDDMPPPGSGASTGTGSLSTNGNSSKVLDGDDTNDGVGVDLDDESQGAGGRNTATESETENENTNPAA 

LTPLQSGEAKNNAAAVGAALQDGDSNFVSNDSTSQEPIIDDNDGTTQTTATTSTEDGEPIALDIGIGIGSSGTPLASDSESNQEAPPKTNSLPIL 

TPTQHSSQNQNQKQSPSQPHKTNNSITNNGQPAPLAEEEAVTAAPQPASKATAAPANGNGNGNGVLGDEDEDEAEDEEEDELDEEEDNEAELEAD 

ESNSSNGIVRDSKLQQLAANKAVDAVSPVAAGADSAPAIGQKRTALHCDMELKNAGGVGVGVGEKSPDLKKLRSE* 

(SEQ ID NO: 1377) 



Celera Sequence No. : 142000013384528 

TGCAAGCGATGCAGCGCTCCTCTCCACTGGGATAACGACGAAGGGCATGTTCTCCGCGGAATCGCGGGCTCAGTGGTCCCTTTTCAAAGGGATAG 
TTGATAGTGGCCGGTTCCTTGAAGATGTGCGCCAGGGTGACGGCAAATCCACGTAAGAGTTCACCGAAGAACATGGTGGATGCAGCACGGTCCGT 
GATGTCGGCAAACTCCATGCTCAGTTCCTTGTTGTTTACATACACATAGCCCTTGGGAACCTCGACGATATCTTTGGGTTCTGCTCGCTGGGCGG 
CCAGCAGCCGGGCTCCATGGCTGCCAAACAGGCGCTGTCCTGTGGGGAGAACATTTTACCATTAAATAACATGCTTAAGTCGAAAAGTTATACTG 
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AATTAAGCGAGTTTAGACTTTTGTTTTGGTATGACATAAGCTTACATAACCTCACTCGCCGCGATTTCCACGAAAATGGGAAATTTCCCGGAAAA 
CAACGGATGTGAGGGGATGGGTCGCCACCTACCGTTGCGCGATGCGGTGAAAATTCGCATAGTTAGCGACATTTTTGCGACCAAATTAAACTAAA 
ATTGCCAAAGAAGTTGTTTTGCTATTGAATTTGCCCGAACAAATTAGAGCTGGGTTTCTTTCGATTACCCACATAAATTCAATGCTCAAATATCG 
TATTCAAATATATCGTAGTTTCTTAATGACATAAACCAAAAATAAAATATGAGATATAAAATATAAATAATGTAACAAAGAATTAAATATTCGTG 
TAAAAATMTAAATMTATTAACAAATMTCCAGAMGACTGTGAGAAAAAAAAGAAAACATTTTTTTTATA 

AAAGCTTCGATAATGGCACTATCGATATAAATTCGCACTTTTACACTTTTCAACATTGACTTTGAAAATCCCCACCACTACAATTTACAGTTTAA 

CTTTTCCCGCTACGCGGAATTGTGAAATTTTTCATTGTTTATGTCGGAAAATGGATCAGGAGGTTATGGATTTCTTTGACTTCTCGAAGGAGTTG 

AAGCGCGTGGCCGCCGCCCCGCAAGGCTACATCAGCAGCGATCCGAGGCTAATGGCCACGAAGTTCAAATCATCCGAAGTGCCCAACCGCGAGTT 

GATCGGCACCGACTATGTCAGCAAAATCGTCGCCAAGGAAAAGTGTCTCCTGAACGGGGTGAGAACTGGAAGCTTGCGTCAGCACTTCTGTTTGT 

TTATGTGTAATTCGATCCAGACGCTGCTCAACGAACAGCCTCAAGGCAAACGCATTCGAACGCTGGACGATTTGGACACCGACGATGAGGGGGAA 

GAGACCGAAATTCGCCGGGATGACGAGTACTACAAGAAGTTCAGATTCAATCTGAATCGCGACAAGAACTTGTCCATCTACGCCAAGCGCGAGGA 

AATATTGGCGGCCATCAATGCTCATCCGGTAGTCATTATTAAGGGCGAAACAGGATGTGGCAAGACCACGCAGGTGAGCAAAACTTAAATAAAAT 

CCTAACTTTTTTATTAAGTCTATATTTTAATAAAGCCAACGCAAGCGAATTATAATTAAACCTTTTATCCTCGACTTTTGAAATTCAAATCCAAA 

CTAGTCGAGAGTATTTAATATTATATAATAAATAATAATCTTCAATATACTTTTCAACTCTTTATATAAGAATAGCAAAGCCTAGGAAACTTATA 

TCCTATATAGTTATATATCCTATCCTACATACTTAAATACAATGAATATTTTATGTGCAACAGATAAACAAGATCACATCGCGCAAAAATGCTAG 

TTATTTAATTATGTAAAAGTTTTATAAACTAACTCAAGTATCCAAAATAGCTTTGAGTTCTTCTTCCGTGACCTCCTCGCTTTCACCCACTTCCA 

TTTCCTCGTCGTCACTAAACTCAATCATCTCATACTCTTGCTGCACGTCTGCTGGAAGTTCCCTCTGAAAAACGGTGGCCGAGGACATGCGGCGT 

TGATAAGTCTCGTTGACCTCCAGCTGCACTTCTAGTTCATCCGCCTGGTTTTGCGGCTCCCGTTCCTCGACGCCGTACATGGCCTTGTTCCGCAC 

CCTCTGGCGCACTTCCAACTGGTTGCCTGTAGATGTTGTGGTACAATTCTCATCTGGAGTACTCTTTCTAGCTCCTAGAACACTTTGGATATCGG 

CAAAGACTTCTCGCAAATGGCCGAATATTTGAGAGGGCGGCAAAGCGTGCTCATGACCAGCCGCCAATTGCTTCTTTTGGGCATTGTATGCCGCT 

TCCAGTTCACACAATGGCTGGCTTGCCTGGTCCAGTGCTCGCAGACCCTCCAGTTCGTATGTGAGACTGACGCCGTTCGCACGCTGTTGCTTCTG 

CATCAAGGCACTCTGCCTCTGTTCCTTGGCACCCGCTACGGTCTCCACACGGTCGTAGTCCACCAGATTGTCCAACCCCTGGCAATAGTCGAGCG 

CGGTGAAGCGGAAGGCCTGCTCCTGGGTCAGGCGCCACAGCATGTAGGCGATCTGATGAGTGTCCTTCCGCTCCGGACTATCCTTGCGCAGATCT 

AGAGCGTAGTCTGTTAGTTCTTGCCAAGTGCGCGGTGAGACCTCGATCTTAATAAAGTTGTGCGTGGGCTGCTTGTAGTAGATTACGTAGAGCAG 

AAAG AAAC C TCC G AT C C T T T GAG CGC G AG AT GC C GGG AAAAC GT C C CC GGT G GT G C GT C GGG AGC AC G AC AGT C G CTT GGC C AC AT G C AG GGC CG 

CCAGTGTGGTGGCTATCACCTCTGTGTGGTTCGTCTGGGCAGTGAAGAGGTGCTGCAGCTGCAGTTCTCGCCAGCACCGGCAGAACACCTCGAAC 

TCGCAGTTTTCGCCATCATTAACCAATCGCTGAAATCGTTGCACCAGCTCCCAGCAGTCGTCAAAGATATTCAGCTCCATTCTGAATCTTAATTA 

CGCGAAATATATGCGCATCTCTCATTCAAAAGAGCCTCAAAATATAAACAATAACATTCGCCGGGAATAAGCTTTTACAACACTAGCCGGGGAGT 

TCATCGATAAATATGTAGTGTTCATAACTGCTTAGCTATTTTCAAAAAAAAAGCGCAAAACTATAAAAACTGTTTGAAAAATGTGTAAAAAATAC 

AATTACATTTTTGTTCTGCTTTGTAAATATTTAGATATTTTCTTATAATTATTATTAGTATTCAATCAATTATTCTTACAGGTGCCCCAATACAT 

TTTGGATGAGGCCTATAAAAGCGGTAAATACTGCAACATTGTGGTGACACAACCCCGGCGCATAGCTGCCATCTCCATAGCGAATAGAGTTTGCC 

AGGAGCGCGAGTGGCAGCAGAACACGGTGTGCAGCTTCCAGGTGGGCCTGCACCGCCCGAACTCTTTGGAGGACACTCGTCTGTTGTACTGCACC 

ACCGGGGTGCTGCT G AAC AAT T T GAT T AAC AAC AAG ACG T T G AC C C AC T AC AC C C AC AT T GT GC T GG AC G AGGT GC AT G AAAG AG AC C AGAAC AT 

GGATTTTCTTCTGATCGTGGTGCGTCGCTTGCTGGCCACGAACTCGCGGCATGTAAAGATTATCCTGATGTCGGCCACCATCGATGCCAAGGAAC 

T GT CC GAC T ACT TT AC C AC C AC T AAC TC AAT T C C GC C AGT G AT C AC C AC C AAT CAT C G AC GT AAGCAC T C GAT C GAG AAGT T C T AC C GT G AC C AG 

CTCGGCAGCATTATTTGGAATGAGGAAGATGTGGGACACCAGCAGGTGCCAGAAATCAACAAACACGGTTACAGGGCAGCGGTAAAGATCATAGT 

GATTATTGACAACATGGAACGGAAGGCTGCAATCCAGTCGCGGCAGAGCTATGACGAGGCCCTGCGCTATGGAGCTGTGCTCATCTTTCTGCCAG 

GCATCTACGAAATCGACACGATGGCAGAAAATCTTACTTGTATGCTGGAAAACGAGTAAGCATTGTTATCCGTGAGAATTGCTCACTGTACAAAT 

TTTGAAGTAGCTAAAATTGTATCATTATAATTTCCAATCTTTATTGTTTAATTGCAGCCCCAACATTAAGGTCTCTATTGTTCGCTGTTTTTCGT 

T AAT GAC AC C GG AAAAC C AAC GGG AT GT CTT C AAT CCCCCGCCTCCCGGTTTCC GC AAG AT CAT T C T G AC C AC AAAC AT T GC T GAG AG C T C CATC 

ACGGTGCCAGATGTCTCATACGTTATTGACTTTTGTCTTGCCAAAGTAAAGGTAACCGATACGGCTAGCAGCTTCTCTTCGCTCCGCTTAACCTG 

GGCGTCCAAAGCCAATTGTCGTCAGCGTGCCGGCCGCGTTGGTCGTCTTCGTAGTGGACGCGTTTACCGTATGGTTAATAAACACTTCTATCAGA 

GGGAAATGCCCGAGTTTGGAATACCAGAAATGTTGCGCTTGCCCTTGCAAAACTCGGTGCTCAAGGCCAAAGTGCTGAATATGGGATCACCGGTG 

GAAATCTTGGCATTGGCTCTATCTCCGCCGAATCTGTCGGACATACACAATACAATATTGCTTCTGAAAGAGGTGGGTGCTCTGTACTTAACAGT 

AGACGGCATCTACGACCCGTTGGATGGCGATCTTACCTACTGGGGCACAATCATGTCTCGGCTGCCGTTGGACACACGTCAGTCCCGGCTTATCA 

TATTGGGTTATATTTTTAACATGCTCGAAGAAGCTATTATTATAGGTAAGCGGGCAACAGTAGTTAAGCTTTAGGATAACGGGGGGAATATTCTT 

TGTCTTGCATTAATGATTTTTTTTTTAAATATGTTGTAGCTGCTGGCTTATCGACGCCTGGCTTGTTTGCGCACGAAGGTGGTAGGTCTCAATTA 

GGCGATTCATTTTGGATGCACTATATCTTCTCCGATGGATCTGGCTCCGACTTGGTGGCCATTTGGCGCGTGTATCTGGTAAGTTATGCGATGCA 

TTACTAATAGCGTAATAACTTGTTTCTTCGGCAGACATATTTGAACATTGTGGAAAATGGTCACGACCAGGAGTCTGCTATCCGTTGGGCCAAAC 

GTTTCCACGTCTCCTTGCGGTCACTTAAGGAGATACACTTGTTGGTGCAGGAACTGCGTGTGCGATGCACGCATCTCGGTCTGATTCCGTTTCCT 

GTTAATCCTAACCAAATGATGGACGATCGTGAAAAAGCGATCATGCTCAAAGTGATCATTGCAGGTGCCTTCTACCCAAACTACTTTACGCGTTC 

CAAAGAGTCGTGTGCAGACACAGATCGCAATATATACCAGACAATATCGGGCCACGATCCCTGCCGCACCGTATATTTTACCAATTTTAAGCCTG 

CCTACATGGGCGAGTTGTACACTAGGCGCATTAAGGAGCTATTTCAGGAGGTCAGGATTCCGCCGGAAAACATGGACGTTACTTTTCAAGAGGGT 

TCGCAAAAGGTCTTTGTCACTTTCAAACAGGATGACTGGATTGAAGGTTCATCTAAGTATGTTCCTGTGTCGGGCAGAGTTCAAAGTGAGGTGTA 

CAAGGCAGTTATGATGCGACAAAATCGTGTCGAACGGCCAATCCACATAATGAAGTGAGTTATCAAGGCTTTTACTATCGATTATGAAGCTTATG 

TTTAATCATTTACAGTCCGAGTGCATTTATGAGCTACGTACAGCAGCGAGGGATTGGAGATGTGATTGAGGGCCGTTGGATTCCACCCACAAAGC 

CTTTAAACGTCGAGCTTCTTGCGCTGCCATCTGTCTTTGACAAAACTATTTCAGGGTCGATTACTTGCGTAAGCTGCTGCACCACACAAATCTAG 

AGAATTTCAAAGTATTTTTTTTATTTTTCCTATCTTACAGATCGTCAACTGTGGCAAGTTCTTTTTTCAACCCCAATCTTTTGAAGAGTGCATTC 

GAAATATGTCTGAAATTTTCAATGCTCCACAGCAACTAAGAAATTACGTAACTAACGCGAGTGCCATCGCAAAGGGCATGATGGTGTTGGCCAAA 

CGCGATAGTTATTTCCAGCGCGCCACAGTGATCCGGCCAGAGAATCAGAGCAATCGCCAACCAATGTTCTACGTTCGCTTCATCGACTATGGTAA 

TTGCACTTTGCTGCCCATGCAGCTAATGCGTTTGATGCCGAGAGAGCTAACTGAACAGTACGGCGACCTTCCGCCACGTGTTTTTGAGTGCAGGC 

TGGCAATGGTTCAGCCCTCTTCAGTGGTCAGTGGAAACAATAGGTGGTCCACCGCGGCTAATGATATGCTCAAGACAGTGGCCCAATGCGGTCTA 

ATCGATATTGAGGTGTATTCGCTTTTCAATAATGTAGCAGCTGTTCTCATTCATATGAGAGATGGCATAATCAACGATAAGCTAGTAGAGCTTAT 

GTTGTGCCGTCGATCCGACGAAGATTATATGTCTAGAAAAGATCATGATTTCCGTTTGCGACGTCAAGAATCTGCCCGGAACTTGAGCACGGCGC 

AGCGTCAACAGATCAATGAGGAATATCTGCGATCCTGTCAGCTGCCGCAAGACCACGATCTACCACCGCCTCCGCTGGAAAAGTGCAAGACCGTT 

GTAATGTTGAAAGGACCCAACAGTCCTTTGGAATGTACTATGCGATCCATTACTCGTGTGGGCTTAAGTAAACGGGTGAATATAGACCATTTGTC 

CGTTAATGCACTTCTCCTGGATGCTGATCCTCAAGATCACCACGATCATTTGATAGTGGCCCACGAAATCGCGGAGAGTAGAAACGGGCAGACAC 

TGACAGCCCGTGGAACCACCCTGATGCCCAATGTGCAGGGATTCGGTGCTCTAATGGTTATGCTATTTAGTCCCACTATGCAACTTAAATGCAAT 

AAAGAAGGCACATCATATGTCTCTGTTCTGGGAGGATTGGGTTGTGACCCGGACACAAACGAGCCCTACTTCGCAGAGCATGACGTGTTGATTAA 

TCTGGATGTCAACATTCTCGAAGATGATGTGATTTTGGTGAGCACGCATAGTACGAATATATACAGAATTGCATTAATTTTCTGATATTTATTTT 

AGATTAATCAAATTCGCTACTATATAGATAGTGTCTTCTTCAACTTTAAGGAAGAAAACAATCCAGCTGTTAGCGTAAATGAACGTGTTTCTATA 
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TACACTCAGCTGCGTAGCCTTATAAATAGGTAAGATTGCTGCGCTTAAAGTCATGCTAATTAAATGATTTCACTAAATTTGATTTCGCAGGCTCC 
TTTGCAAGGATCGTAGATATATCGAAAGAAATATGAGCAATGCGGATTTTGAATGGGAAACCAATCCCGAACTACCACTGCCGAATGAACCTTTT 
GGCAAGAGGGCCATATTTCCGATGCACTCACTTACTGAGCTCCAAGAGGAAGACACGGGTCGCTTGGTGCAACTGAGAGAGAACTGCAGCATGTT 
GCACAAATGGCGCAACTTGTAAGACTGGTGCTTGTTTACAAGAATATATTGCATTACTTATATTTTAACTTATATTTTTAGCGAAGGCACTTTAC 
CCCATATGACTTGCAAACTGTGCAATCAGTTGTTGGAATCGGTTCCCCAGCTCCGTTTGCATCTCCTGACCATATTGCATCGTGATCGTGAAAAG 
CAAATCGACTATTGCAACCAATAAGAATGGTTACATCGTTTAATATGAAAACAGCTGATTTTGAACTTTTTGATTTCTCATTTCTTTTCTTAAAG 
GACACACTTCACTTGTTTGGTTTAAAAAAATGACGTATTCATGGAATGAAAACAAGTTTTGAATTGAATCTAAACCTACAGAAATCTTCATTACG 
TTAAATATTTTCTATGTTATGTTGAATGATCTCTCTTCGTTCAATTACGAAGTGTAAAATGTGAAATATAATTTTTTGTTAATTACATCTTTAAA 
GATATCATTAAATACGAAGACAGCTATACTAACCAGTTCATTACAAGCAAAAATCCATAAACTGCTTGCTGTTTATAAAAATCAAAGCCCTGGGG 
CGTGCATGTTCTCCCTTTTTGGGCTCTTTTTTTTATTTTAGATTAAACTCACACATTATATTTATGCAACTACAACTACATCGGAGGAATTCTGC 
AGCGGCACAGGAGTTGGAGTGTTTGCCTCCAGCTCGCAGATCGTATAATTCTGCATGCGGTTTAGGGGCTTGCTGCACAGCCCACACATCTGGTG 
CTTATACACGCCGTATATCTCGAAATTCAGGATGTTGTACAGATCGAACTGGATGGTGGAGGACAGCTGCGTACACTCCACAATTAAAATATTTT 
GCTCGTTAAGTTCCACGCTGAAAGGGCATATGGCGTTTATAAAATTGTATATAAAAGCTAAAAAGCATTCTTACCGGAAGTCGTAGTCACTAGTC 
TCAATTGGCCATTGATATGGCTTTTTGCGCACGTTGTGCGGTGTTTGATCCTGGTTAAGATATATGAGCACTGGTTCATCCGTGGGCAGGCTGCT 
GTTGTCTGTGCGGGCGCGGAAGTTGAAGTAGTTTTGCCCGATATGGACTCGTAATCCATAGTTGGTGGACAGTCCTTGAACAGGCGAAGGCGTAA 
CAAACACTGCAATCTGTGGCAATGGCGAACAGTCAGCCACCAGCAGCGTGAGACAGTCGGGACGCACCGCAATCTCTCTGAGATTGATTAGGTAA 
TCATCGAAGGTATTGACTATGCCATTGATTACTGAGCAGGAGCGCTTAAGTCCTGAGTCGAAGGTAAGCTCGTGGGAAGCGTCGAACTTCGAGTC 
CAGCTTGTAGAAGAAGGGGATCTGATGGAAACGCCACTTCACTCCGTTGACCTCCACAGCCAGCCTGAACTGGTCCTGAGGAGTTTGCATTGATA 
TATTTATCTCCTGGCTCATATGCAGACTGTCCACCTTGTCGGCGGATAAGGCGGACACTAAGTGGTCCAGTCGATTCGACCACAGCTCGAACCAG 
GCGG 

(SEQ ID NO: 1378) 

Exon: 1001.. 1198 

Exon: 1256.. 1498 

Exon: 3217.. 3950 

Exon: 4048.. 4700 

Exon: 4790.. 4923 

Exon: 4975.-5564 

Exon: 5621.. 5768 

Exon: 5836. .6972 

Exon: 7033. .7154 

Exon: 7216.. 7428 

Exon: 7492.. 7934 

Start ATG: 1001 

Transcript No. : CT10340 

ATGGATCAGGAGGTTATGGATTTCTTTGACTTCTCGAAGGAGTTGAAGCGCGTGGCCGCCGCCCCGCAAGGCTACATCAGCAGCGATCCGAGGCT 
AATGGCCACGAAGTTCAAATCATCCGAAGTGCCCAACCGCGAGTTGATCGGCACCGACTATGTCAGCAAAATCGTCGCCAAGGAAAAGTGTCTCC 
T G AAC G GG AC GC TG C T CAAC G AAC AGCC T C AAGGC AAAC G C AT T C GAAC GC T GG AC GAT T T G G AC AC C G AC GAT G AGGGGG AAG AG ACC G AAATT 
CGCCGGGATGACGAGTACTACAAGAAGTTCAGATTCAATCTGAATCGCGACAAGAACTTGTCCATCTACGCCAAGCGCGAGGAAATATTGGCGGC 
CATCAATGCTCATCCGGTAGTCATTATTAAGGGCGAAACAGGATGTGGCAAGACCACGCAGGTGCCCCAATACATTTTGGATGAGGCCTATAAAA 
GCGGTAAATACTGCAACATTGTGGTGACACAACCCCGGCGCATAGCTGCCATCTCCATAGCGAATAGAGTTTGCCAGGAGCGCGAGTGGCAGCAG 
AACACGGTGTGCAGCTTCCAGGTGGGCCTGCACCGCCCGAACTCTTTGGAGGACACTCGTCTGTTGTACTGCACCACCGGGGTGCTGCTGAACAA 
TTTGATTAACAACAAGACGTTGACCCACTACACCCACATTGTGCTGGACGAGGTGCATGAAAGAGACCAGAACATGGATTTTCTTCTGATCGTGG 
TGCGTCGCTTGCTGGCCACGAACTCGCGGCATGTAAAGATTATCCTGATGTCGGCCACCATCGATGCCAAGGAACTGTCCGACTACTTTACCACC 
ACTAACTCAATTCCGCCAGTGATCACCACCAATCATCGACGTAAGCACTCGATCGAGAAGTTCTACCGTGACCAGCTCGGCAGCATTATTTGGAA 
TGAGGAAGATGTGGGAC ACC AGCAGGTGCCAGAAAT CAAC AAAC ACGGTT AC AGGGCAGCGGTAAAGAT CAT AGTGATTATTGACAACATGGAAC 
GGAAGGCTGCAATCCAGTCGCGGCAGAGCTATGACGAGGCCCTGCGCTATGGAGCTGTGCTCATCTTTCTGCCAGGCATCTACGAAATCGACACG 
ATGGCAGAAAATCTTACTTGTATGCTGGAAAACGACCCCAACATTAAGGTCTCTATTGTTCGCTGTTTTTCGTTAATGACACCGGAAAACCAACG 
GGATGTCTTCAATCCCCCGCCTCCCGGTTTCCGCAAGATCATTCTGACCACAAACATTGCTGAGAGCTCCATCACGGTGCCAGATGTCTCATACG 
TTATTGACTTTTGTCTTGCCAAAGTAAAGGTAACCGATACGGCTAGCAGCTTCTCTTCGCTCCGCTTAACCTGGGCGTCCAAAGCCAATTGTCGT 
CAGCGTGCCGGCCGCGTTGGTCGTCTTCGTAGTGGACGCGTTTACCGTATGGTTAATAAACACTTCTATCAGAGGGAAATGCCCGAGTTTGGAAT 
ACCAGAAATGTTGCGCTTGCCCTTGCAAAACTCGGTGCTCAAGGCCAAAGTGCTGAATATGGGATCACCGGTGGAAATCTTGGCATTGGCTCTAT 
CTCCGCCGAATCTGTCGGACATACACAATACAATATTGCTTCTGAAAGAGGTGGGTGCTCTGTACTTAACAGTAGACGGCATCTACGACCCGTTG 
GATGGCGATCTTACCTACTGGGGCACAATCATGTCTCGGCTGCCGTTGGACACACGTCAGTCCCGGCTTATCATATTGGGTTATATTTTTAACAT 
GCTCGAAGAAGCTATTATTATAGCTGCTGGCTTATCGACGCCTGGCTTGTTTGCGCACGAAGGTGGTAGGTCTCAATTAGGCGATTCATTTTGGA 
TGCACTATATCTTCTCCGATGGATCTGGCTCCGACTTGGTGGCCATTTGGCGCGTGTATCTGACATATTTGAACATTGTGGAAAATGGTCACGAC 
CAGGAGTCTGCTATCCGTTGGGCCAAACGTTTCCACGTCTCCTTGCGGTCACTTAAGGAGATACACTTGTTGGTGCAGGAACTGCGTGTGCGATG 
CACGCATCTCGGTCTGATTCCGTTTCCTGTTAATCCTAACCAAATGATGGACGATCGTGAAAAAGCGATCATGCTCAAAGTGATCATTGCAGGTG 
CCTTCTACCCAAACTACTTTACGCGTTCCAAAGAGTCGTGTGCAGACACAGATCGCAATATATACCAGACAATATCGGGCCACGATCCCTGCCGC 
ACCGTATATTTTACCAATTTTAAGCCTGCCTACATGGGCGAGTTGTACACTAGGCGCATTAAGGAGCTATTTCAGGAGGTCAGGATTCCGCCGGA 
AAACATGGACGTTACTTTTCAAGAGGGTTCGCAAAAGGTCTTTGTCACTTTCAAACAGGATGACTGGATTGAAGGTTCATCTAAGTATGTTCCTG 
TGTCGGGCAGAGTTCAAAGTGAGGTGTACAAGGCAGTTATGATGCGACAAAATCGTGTCGAACGGCCAATCCACATAATGAATCCGAGTGCATTT 
ATGAGCTACGTACAGCAGCGAGGGATTGGAGATGTGATTGAGGGCCGTTGGATTCCACCCACAAAGCCTTTAAACGTCGAGCTTCTTGCGCTGCC 
ATCTGTCTTTGACAAAACTATTTCAGGGTCGATTACTTGCATCGTCAACTGTGGCAAGTTCTTTTTTCAACCCCAATCTTTTGAAGAGTGCATTC 
GAAATATGTCTGAAATTTTCAATGCTCCACAGCAACTAAGAAATTACGTAACTAACGCGAGTGCCATCGCAAAGGGCATGATGGTGTTGGCCAAA 
CGCGATAGTTATTTCCAGCGCGCCACAGTGATCCGGCCAGAGAATCAGAGCAATCGCCAACCAATGTTCTACGTTCGCTTCATCGACTATGGTAA 
TTGCACTTTGCTGCCCATGCAGCTAATGCGTTTGATGCCGAGAGAGCTAACTGAACAGTACGGCGACCTTCCGCCACGTGTTTTTGAGTGCAGGC 
TGGCAATGGTTCAGCCCTCTTCAGTGGTCAGTGGAAACAATAGGTGGTCCACCGCGGCTAATGATATGCTCAAGACAGTGGCCCAATGCGGTCTA 
ATCGATATTGAGGTGTATTCGCTTTTCAATAATGTAGCAGCTGTTCTCATTCATATGAGAGATGGCATAATCAACGATAAGCTAGTAGAGCTTAT 
GTTGTGCCGTCGATCCGACGAAGATTATATGTCTAGAAAAGATCATGATTTCCGTTTGCGACGTCAAGAATCTGCCCGGAACTTGAGCACGGCGC 
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AGCGTCAACAGATCAATGAGGAATATCTGCGATCCTGTCAGCTGCCGCAAGACCACGATCTACCACCGCCTCCGCTGGAAAAGTGCAAGACCGTT 
GTAATGTTGAAAGGACCCAACAGTCCTTTGGAATGTACTATGCGATCCATTACTCGTGTGGGCTTAAGTAAACGGGTGAATATAGACCATTTGTC 
CGTTAATGCACTTCTCCTGGATGCTGATCCTCAAGATCACCACGATCATTTGATAGTGGCCCACGAAATCGCGGAGAGTAGAAACGGGCAGACAC 
TGACAGCCCGTGGAACCACCCTGATGCCCAATGTGCAGGGATTCGGTGCTCTAATGGTTATGCTATTTAGTCCCACTATGCAACTTAAATGCAAT 
AAAGAAGGCACATCATATGTCTCTGTTCTGGGAGGATTGGGTTGTGACCCGGACACAAACGAGCCCTACTTCGCAGAGCATGACGTGTTGATTAA 
TCTGGATGTCAACATTCTCGAAGATGATGTGATTTTGATTAATCAAATTCGCTACTATATAGATAGTGTCTTCTTCAACTTTAAGGAAGAAAACA 
ATCCAGCTGTTAGCGTAAATGAACGTGTTTCTATATACACTCAGCTGCGTAGCCTTATAAATAGGCTCCTTTGCAAGGATCGTAGATATATCGAA 
AGAAATATGAGCAATGCGGATTTTGAATGGGAAACCAATCCCGAACTACCACTGCCGAATGAACCTTTTGGCAAGAGGGCCATATTTCCGATGCA 
C T C AC TTACTGAGCT C C AAG AG GAAG AC AC GG GT CGC T T GG T GC AAC T GAG AG AG AAC T GC AGC AT G T T G C AC AAATGGC GC AAC T T CG AAG GC A 
CTTTACCCCATATGACTTGCAAACTGTGCAATCAGTTGTTGGAATCGGTTCCCCAGCTCCGTTTGCATCTCCTGACCATATTGCATCGTGATCGT 
GAAAAGCAAATCGACTATTGCAACCAATAAGAATGGTTACATCGTTTAATATGAAAACAGCTGATTTTGAACTTTTTGATTTCTCATTTCTTTTC 
T T AAAGGAC AC AC T T C ACT T GT T T GGT T T AAAAAAAT G AC GT AT T CAT GGAAT G AAAAC AAGT T T T G AAT T G AAT C T AAAC C T AC AG AAATC T T C 
ATTACGTTAAATATTTTCTATGTTATGTTGAATGATCTCTCTTCGTTCAATTACGAAGTGTAAAATGTGAAATATAATTTTTTGTTAATTACATC 
TTTAAAGATATCATTAAATACGAAGACAGCTATACTAACCAGTTCATTACAAGCA 
(SEQ ID NO: 1379) 

Start ATG: 1 

MDQEVMDFFDFSKELKRVAAAPQGYISSDPRLMATKFKSSEVPNRELIGTDYVSKIVAKEKCLLNGTLLNEQPQGKRIRTLDDLDTDDEGEETEI 
RRDDEYYKKFRFNLNRDKNLSIYAKREEILAAINAHPVVIIKGETGCGKTTQVPQYILDEAYKSGKYCKIVVTQPRRIAAISIANRVCQEREWQQ 
NTVCSFQVGLHRPNSLEDTRLLYCTTGVLLNNLINNKTLTHYTHIVLDEVHERDQNMDFLLIVVRRLLATNSRHVKIILMSATIDAKELSDYFTT 
TNSIPPVITTNHRRKHSIEKFYRDQLGSIIWNEEDVGHQQVPEINKHGYRAAVKIIVIIDNMERKAAIQSRQSYDEALRYGAVLIFLPGIYEIDT 
MAENLTCMLENDPNIKVSIVRCFSLMTPENQRDVFNPPPPGFRKIILTTNIAESSITVPDVSYVIDFCLAKVKVTDTASSFSSLRLTWASKANCR 
QPAGRVGRLRSGRVYRMVNKHFYQREMPEFGIPEMLRLPLQNSVLKAKVLNMGSPVEILALALSPPNLSDIHNTILLLKEVGALYLTVDGIYDPL 
DGDLTYWGTIMSRLPLDTRQSRLIILGYIFNMLEEAIIIAAGLSTPGLFAHEGGRSQLGDSF^TMHYIFSDGSGSDLVAIWRVYLTYLNIVENGHD 
QESAIRWAKRFHVSLRSLKEIHLLVQELRVRCTHLGLIPFPVNPNQMMDDREKAIMLKVIIAGAFYPNYFTRSKESCADTDRNIYQTISGHDPCR 
TVYFTNFKPAYMGELYTRRIKELFQEVRIPPENMDVTFQEGSQ 

MSYVQQRGIGDVIEGRWIPPTKPLNVELLALPSVFDKTISGSITCIVNCGKFFFQPQSFEECIP^MSE^ 

RDSYFQPATVIRPENQSNRQPMFYVRFIDYGNCTLLPMQLMPvLMPRELTEQYGDLPPRVFECRLAMVQPSSVVSGNNRWSTAANDMLKTVAQCGL 

IDIEVYSLFNNVAAVLIHMRDGIINDKLVELMLCRRSDEDYMSRKDHDFRLRRQESARNLSTAQRQQINEEYLRSCQLPQDHDLPPPPLEKCKTV 

VMLKGPNSPLECTMRSITRVGLSKRVNIDHLSVNALLLDADPQDHHDHLIVAHEIAESP^GQTLTARGTTLMPNVQGFGALMVMLFSPTMQLKCN 

KEGTSYVSVLGGLGCDPDTNEPYFAEHDVLINLDVNILEDDVILINQIRYYIDSVFFNFKEENNPAVSVNERVSIYTQLRSLINRLLCKDRRYIE 

RNMSNADFEWETNPELPLPNEPFGKRAIFPMHSLTELQEEDTGRLVQLRENCSMLHKWRNFEGTLPHMTCKLCNQLLESVPQLRLHLLTILHRDR 

EKQIDYCNQ* 

{SEQ ID NO: 1380) 

Name: DEAH-box helicase 
Classification: RNA_binding 
Gene Symbol: spn-E 
FlyBase ID: FBgn0003483 



Celera Sequence No. : 142000013384633 

TGTCCCGCTCCCGTATCCAAGCATTGTCGCTTGATAGTTCCCCCAGGAGCATGTTGCCCCCTGTGTGCTGGAGGAGCCTTCAGGATAATCTACTC 
CAGAAAGCAATTCGATCGCGCCATGTACGGACTAAGGGCACAGAGCAGCACTCTTTTGACATTGCAAGGAGTGCTGCAGCAGCTGGATGGACTGG 
TGCAGGTTAGCGAATGTCAACTGACTGGATTTCTCACCATGGAAGTGGGCATTTTTGTGGCCATAGTACCGTCCAGCAGTATCAAGAGACCGACT 
CACCTTCAATTGGAGGCCTGCGCCCGGGAGGCGGAGAAGATATCCTCCCTAATCAATGCCCAATCACCCAGAATTACAACCAATCTAGCTCTATC 
CTGTTTGACAGTATCGCATCTACTAGAACCCACGCCAAATGGAGCTACATCTTATGGTCCCCACGCCATTTGGATACTCCCCTTACTTTTGTTAG 
TTTCTAGAGTAATCATAGCTTAAGATAGTCCTAGAAGGAGTGTGTATGATCTGACTCAATCAAGTGAATTCTTGGTGCCAATTAACCCGTTATTC 
TAAGGAGATGGAAAGTGCCTTCAACTGTAAGGAATTGACCCTCTGTACCATGTGATATTAGCTAAATTAATTCTTAAGCTTTAATGTTCTTTTAT 
AGTTATTATTTAATTTATGCCAGGGTCCGTAAATCACCAAAACCCCCTTTTTGTTTTACCTAATTTAAATGTATTTAATAATTCAAATCATTGCT 
CCATGGTAATATTTAAATTTAAAATAATTATTTAAAATCTGGATCGTAAAAGTACAAAGAGATTGATGGACCAAAACTCGACACTGTGCCCTAAC 
CATGTGGAAACATTCTTAAATTTGGAAAACATTGTAATATAATTATTTTTTGGTCCCTGGCTAGGAGTTCGTTAATGACCAAAGCATTGTCCGCC 
TAGGAGTATCGCTAGTCTAATAAAAACCCCAGAAAAAAACACAGCAACTTAAGATGAAGCATGGCTTCCTTTATTGAACCACTCCATATAGATCC 
TGCCGAAGCTTTATTATAATAAATGTACGAAAAGGGTAGAAAGGGGAATCGCTTTAAGAAATCTCCATGGGCACCGCCTGGACGCGTTCGGTGAC 
GAGTAGGTCCTTAATGTTGGTGTGCAGCACGGTTGAGGTTCCGGTCTTGAAGCGGTAGTCCAACTGGCGCTTGCCCTTCTTGTTAGCCAGCTCAT 
AGTTGCGCAGATACTCAACACTGCCCTTGCGGATAACACACAGATCGATGTTGGATCCCGAACCCAGATCGTTAAACACTCCGGAGGCGATGGCA 
TCGCGCACCAGTTTCTTGCCCTCCTCTTCGGACAAGTCAGGTTTCCAGCGACTTTCGAAAACGGTCATGGCGGCCAGGGAACCCGATCCCATGGT 
GGCGTACGGCAGCTTATCCGAACTTCCATGTGGGTGAATGGAATAGATGTGGGGTCCAGTCTTATCCACTCCACCAAGCACCAGAGCGGCACTGA 
TATGACCCTGGTAGCGGAAAAGCATCTGCTTCAACATCGTATTGGCGGCCACAACGCGCACCTCACGATCTGTCTGCAAGCGATGCAGCTCCAGC 
TGGGAGGAGATGAGATCCGTAGTCATCTCAGTGTCCGCAGCGGTTCCAGCTCCACAGCAGCTGAAGGTAAATATTTTCGTTAGGTATTGGGAGAT 
AC T T G C AT GT T AG AAAG T AC AC AT AT T T T T AAG T T AGAGC AAC AT AAGC T AC AC C C T AAG AT C CT C AAC T T AAAAC A C GG GC C C C T T AAAATTGG 
AGACAAATTGATCTAAGCCGGCGAGGACAAGGAAATAGGCCCACATTTGAAATGCAGGAAAAAATACCGTAATCCCTCTTATAGTATTATTAGGT 
TCCCTGTGCACTTAAAGGCGCATACCGTCGGTGGACTTACTATATGTTTTTGGCCAGGTAATGGATCTTGGCGCAGTTCTTGTCCGAGACAATGG 
GTCCCTCGGTGGCTCGCGTATCGGCGCCCAGAATAACACCATCCTTGTAGATGATGCCAACGATTGTGGTTCCCGTCTTGGTGGTGGTCGGTGGC 
TTGAAGCCTCGATTCAAAAGAGTTGCATTGCTGAAAGCAAATAAAACACCAATTAATAAAAGCCGATGGGGTATTGCTGAACAACTTGGACGATT 
CTTACCGCTTGCAGTTGTCGAAATTGAAACCAGCGCGGGGCAGATCGCGTGCGTTATCCAAATCCATTATATATATTCTTCAATTCAACTATATT 
AGCAAGGAAAACTAAGGTTTTAATTCTGCCGAGCGCGATGACAAACAAAATTGTTAAATGCTTGCTAGAGCTGGAAGTACAATCGATATTGTATC 
GATATATATTCATGGACTCAACTAGAGCGATAGTATTGAAGAAATAAATTATTAATAATTTGACGTTATTAATAATTTGACTTATGTAACTTAAA 
TTATTCAATGTTGTATTATTTTAATATTTTAATGTTGCATTGTATAATTTTAAAACTTTTTAATGTTTGGAAATATTCCCCGCAGTATACATGCA 
ATAAGTGCTTAGGAAGGCCCCTTATCTTTTGTAAGCGAATAGTCATGTTTGAAAAATATCGATATACTGTTTCCACCTGACCATCTTGGCATAGT 
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ACCATCACTTTTGTGAACTGCATCCCTGAAGAGTATTTTGGTAATTTACCATGTCTGCGGCCACCAAATTACATAAAACAAGCTGGTGCGCCCTG 
CGGCAGCTTCAACAATACAGAACTAAGGGTAAGTAACACTAATTAGGAGTCCTTTTATTGACCAAAGATGGAGGAAGTGGTTTACATAATGCCGG 
CAGTCATTCACGTTCCGCGGAGCTCTTCGGCTAGTGATGGCAGTTCGTTGATGTCTTGGCGGAAAAATCGCTAGAATTTCTATAGATCCCAGTTA 
TTATTTTAGATAGTCAACCATAATTTAAATTCAATTTGTTGAAACTGTACTATAAAATATGTCTTTTAACTTATTAATTATCCCTACAGCCAACT 
TTAGTGGCAGTTCTGCATCGTTAGCCAAGCGGAATGACCTGTTCAACCAGGAGCAGCGCCGTCAACGGGATGCAGTGGGTCGAATCGATAAGATC 
GAGGTGCGGTATCTGGGACTTCCCGAGGATGTCACTCTGGTGATGAACGGCAACATCTCAACGCCCTTCAATTGCGCCCAGCATCTGAGTGAGGG 
GCACTGTAAGCGATCGGCACTGGCCTTGATCGACGGCAGTGTTCCCTGGGATATGCACAGACCCTTGCAGGAATCCTGTACTCTGCAGCTGCTTA 
ACTTCCATGTTTCCGAGCC 
(SEQ ID NO: 1381) 

Exon: 2344.. 2191 
Exon: 2120.. 1941 
Exon: 1675. .1001 

Start ATG: 2252 (Reverse strand: CAT) 
Transcript No. : CT10524 

CAAGCATTTAACAATTTTGTTTGTCATCGCGCTCGGCAGAATTAAAACCTTAGTTTTCCTTGCTAATATAGTTGAATTGAAGAATATATATAATG 
GATTTGGATAACGCACGCGATCTGCCCCGCGCTGGTTTCAATTTCGACAACTGCAAGCGCAATGCAACTCTTTTGAATCGAGGCTTCAAGCCACC 
GACCACCACCAAGACGGGAACCACAATCGTTGGCATCATCTACAAGGATGGTGTTATTCTGGGCGCCGATACGCGAGCCACCGAGGGACCCATTG 
TCTCGGACAAGAACTGCGCCAAGATCCATTACCTGGCCAAAAACATATACTGCTGTGGAGCTGGAACCGCTGCGGACACTGAGATGACTACGGAT 
CTCATCTCCTCCCAGCTGGAGCTGCATCGCTTGCAGACAGATCGTGAGGTGCGCGTTGTGGCCGCCAATACGATGTTGAAGCAGATGCTTTTCCG 
CTACCAGGGTCATATCAGTGCCGCTCTGGTGCTTGGTGGAGTGGATAAGACTGGACCCCACATCTATTCCATTCACCCACATGGAAGTTCGGATA 
AGCTGCCGTACGCCACCATGGGATCGGGTTCCCTGGCCGCCATGACCGTTTTCGAAAGTCGCTGGAAACCTGACTTGTCCGAAGAGGAGGGCAAG 
AAACTGGTGCGCGATGCCATCGCCTCCGGAGTGTTTAACGATCTGGGTTCGGGATCCAACATCGATCTGTGTGTTATCCGCAAGGGCAGTGTTGA 
GTATCTGCGCAACTATGAGCTGGCTAACAAGAAGGGCAAGCGCCAGTTGGACTACCGCTTCAAGACCGGAACCTCAACCGTGCTGCACACCAACA 
TTAAGGACCTACTCGTCACCGAACGCGTCCAGGCGGTGCCCATGGAGATTTCTTAAAGCGATTCCCCTTTCTACCCTTTTCGTACATTTATTATA 
ATAAAGCTTCGGCAGGATCTATATGGAGTGGTTCAATAAAGGAAGCCATGCTTCATCTT 
(SEQ ID NO: 1382) 

Start ATG: 93 (Reverse strand: CAT) 

MDLDNARDLPRAGFNFDNCKRNATLLNRGFKPPTTTKTGTTIVGIIYKDGVILGADTRATEGPIVSDKNCAKIHYLAKNIYCCGAGTAADTEMTT 
DLISSQLELHRLQTDREVRVVAANTMLKQMLFRYQGHISAALVLGGVDKTGPHIYSIHPHGSSDKLPYATMGSGSLAAMTVFESRWKPDLSEEEG 
KKLVRDAIASGVFNDLGSGSNIDLCVIRKGSVEYLRNYELANKKGKRQLDYRFKTGTSTVLHTNIKDLLVTERVQAVPMEIS* 
(SEQ ID NO: 1383) 

Name: 20S proteasome beta2 subunit 
Classification: endopeptidase 
Gene Symbol: Prosbeta2 
FlyBase ID: FBgn0023174 

Celera Sequence No. : 142000013384528 

AGATATATACTAAATGTGCATGTGTATATATAATATATATAAACATATTTGAATGATCGCAAAACGCTTTAATCAATGTTGCTTCTAGGTTTCTT 
TTTGATGCGCGCCTCGTCAATGAATAATTACGTCATATTCTTTTATTAGTTTCTGATACAGTTGGGATTATCAACTAGGTTACTCTCACGTACAC 
ACAAACACGGAACACTTGAGTAGCCTCATGGCCTGCTACTTGCTACTAGCTACTTCCTTCCGAGGTCGTGGCACTCGCTTTTCCACAGATCTTCC 
GACGCTTTTCACTTCGCGCACAGATTTCTCAATGTAGTATCAGTGAATATATGTATATATATTATATATAATATATATATCAACGCGACTGTATA 
TTATCAGATTTTTCAAAGAAGCCGACGCTTTTCCTCCTTTATCGCCAACGCGTCTAATTCCCGGATCACAGCTCGACACGTCCGCTCTGTTCGCG 
ATCCAAGCTCAAACTGAGCGGACATAACAGCCGGATGGTTGGCCTTGCTGGAGGTTTCGTTGATAAGAATCATGCTGCGAAGAGCAGCTTGGCAG 
CTGGGCTCTCTTACGATCTCTTGGGAAATGTCCTATAAATAGATTTTTGCCGCGTCCACTGGAACAACGTCTTACGGACCAGAAAACGGATTCTC 
AAATCTGATACATTCTGTTTAGAGGATTTTAAGATATCTTGCATAACATTATCTTACAATATTCCCAAGGACAATTTCGTATCCCTTCAAGACTT 
AATAATTTAATAAGGACTAGAAATACACAATATTGCGCTATTTATTTGGTACCAGACACATTTAAATTATATTAAATCTCGCGGATTGTCTATAA 
AAATCTACGTTATTTATTAGTATAAGAGTAGCATATAAGGTTACTTATAGACGAACTAAGACTTTGACGAGCTGCTCTCATACGTATAACAATAC 
ACCACAAATCACATGGAATCAGGAATATCAGAAACTACTCTGTATGTCACTTAAGGCTAGCGTGCTCCCCCATGAGCATGTCCTGCCCTGAAGTG 
GGCACCACCCGTTCCAGTACCACTTGGTCCTCCTGCTCCTCCTGCCCTGCGTTTCTCCGATCCCGATTCTGCACTAGCGTCTCCGTTATCAGAAC 
CACCACTCTGTTCTGCATCTAGATTCTTGAGTTGCTGTTGCCACATGTCCGCGTAGATGCCATCTTCGCGCAGCACAAGCTCTTCGTGTCGCCCA 
CGCTCCGCTATGCTTCCCTGCTGCAGGACCAGAATCTCGTCCGCATGGATGATGGTGGAAAGACGATGGGCAACAATGATGGTGGTACGATTTGC 
GCACACCCTGGCCAAAGCAGCCTGAATATTGCGTTCTGTGTGGGTGTCCAAAGCTGAGGTGGCTTCATCCAGGAGCACAATAATGGGCGCCTTAA 
GAAGCGTTCTGGCGATGGCTACGCGTTGCTTTTCACCTCCACTGAGCCGCAGACCTCGTTCGCCCACCTTTGTCTCATATTTCTCCGGAAAGCCG 
AGAATCCTTTCGTGAATGTCGGCGGCGCGAGCAGCCTCATAGACAGCTTCGTCCGAAGCGCCCAACTTGGCATATTCAATGTTGTAGAAGATGGT 
GTTGTTAAAGAGAACCGTATCCTGGGGCACCACTCCAATGGCCTTTCGCAGACTCTGCTGCTGCACAAGCTTGATGTTTTGTCCGTCAATCAGGA 
TAGCACCAGTCTGCACATCGTAAAAGCGGAAAAGCAGCCGCATGATGGTGCTCTTACCGGCTCCAGAGGGTCCAACAATCGCTACCGTCTTACCG 
GCCGGAACTGTGAAGCTCACATTGCGCAGCACTATCTTCTCCGGTGAATAGCCAAAGGTCACGTTTGAGAACTCAATGCCACCGCCGGCAGTTAG 
AAGGGGCGAACAGCCTGGGGCATCCACAATCTCTTCCTCCTCTTTCAGCAGATCGAACATGTTCTCCATGTCCACGAAGTTCTTCTGTATGGCTC 
GGTAATAGGTGCCGAACCAATTCAGTGGCATGTACAATTCCATCAGATAGGTGGAGAAGAGCACAAAATCGCCCACGGTGAGGGTTTGATGGTGA 
ACTACCAAGTATACACAGAGCAGCGATCCGGCCAGTAGGCCCAAGCACAAAATAATATTCTGGGCTGTGTTCAGCATGTTGAGTGTCAGCATTGA 
GAGGAACTCTTCCTTTTGGTACTTGAGAATGGCCTCCCTGTAGCAATCCACCTCGTAGTGCTCCGCTCCGTAGTACTTGACTGTTTCGAAATTGA 
GAAGGGAGTCCACGCTGCGAGCTCGCTGCTCATTGTCCGCCAGGTTCATTCTCCTCTGGTACTTGGTTCTCCATTCAGTGATTGCAATGGTAGAG 
GCTGTAAAACGGAAGGATATTTTGAAATACGAATGGAATTAATCGAATAATTTTTATCACTTACCTATATACAGGAACATGGTGAGGAAGACGAT 
TAGGCCAAACCACCAATTGAAGGCGTAAACAAAGTATGCCACGGCAACGAGTAGATCGAGAATGGTTGGTGCGATCGAGAAGACAATATAGTTGA 
GGAGGTTGTTGATCGAGTCCGTGCCGCGATCCATGACACGCAACACCTCTCCCGTTTTGCGTTGCAAGTGCCATCTAAGCGAAAGCTGGTGCAGA 



FIGURE 




TGCCGGAACAGCTCAATCTCGATCTCCCGGGTCGTGTACTGTTGAACCCGTATCCAGAGGAAGGTGCGAAGGTTATTAAATAATCCCATGCTACC 
AGTGCCGCCACCCTGGAGGAAGGACAATGCCACATAGATCAGCACAAAGTCCCAGCGAAAGACGATCGGTGCAATGGTAAGACTGTCAACTGTCG 
AAACAGGAGATTAGCAATGAATTACTTAACGAATTACGTTCATAATTCTTGACCTACCCAACTTCTTGCGATAAATCGGCAAAAACAGCTTTATG 
ACTCGACCGGCAAGCAGCAGTATAATGCAAACGATGACGGCTATCTGCAGAGCAATATTCTTTTTAGGCCAGAGGTACGGAAAGACGGTGCGCAG 
CTTGCGCCATCCGTTCCTGAAAGCAGATCCTGTCTGCACGTTTCCCTGCGATTCGTTGGCGGTATCATTGTCCAGCCGCTGATGTGGATTGTACG 
GGGCCATGATGCCAGGAGCTTTCAGACCCAAAACGAAGATGAGCAGGGAGCACAGGAACCGGGTGACAAACAAGCCCATCTCAATTTGATCCTTG 
TTCCTGCGAGTAGTAAAGGCGGTACTATAGATTATCCGTTTTACAGCCGTTTGCATTACTTACGTTTTTAGGTGGAACCACCAGTCCTCGTGGCG 
CAAATTTATGAAGGCCAGCGACTCGTTAATGAAAGCCAACGTCCAGAATAGCAATAGAACAAGTCCATGTCCTCGGGTGGGCATTGAGGGCAGCT 
GGTAGTAGCGCTCCTTGAGTATCAAACAGATGCTGAACGGGTAGGAGAAGCACACAACGCAAGTGGAGAAGATCTGCAGAAAAGATTTCGATCAG 
TAATTGTTTCTTCTTGATCATTTTCATACTTACCATATAGCCGTACACCGCACTATCTGGATAAATACGTGCATTCATCAGAAACCGCAATAGAG 
CCAAAACCGGGAGCAGGAGCAACAGAAAGAGTTGCATCGCGAACAGTCGTGACTTGGAGATTTGCGTGGGATCCGTAATCCTGGTGGCATACTTC 
CGGTACATCAGCAACTGAATGCTGCCGAAGAGCAGAAGGAATCCACCGTATACCGCCGGTCCCACGGTGTCCATGAAGCAATGCGAGATGCCATG 
CTGCGTCCACACCTCGCTGAGGGTCACGTTGGGCGGGCAGTACAGCATGGTGAAGTCGGTGTACCTGCGCGGCGACGAGGAAAGGCAATTGCGGA 
ACGATAAGCTCTGATAGAAACGTGCCAGATAAAGTGGAGCTGTCACTGGGAAACAGCTCGGTGAACACAATTGCTTTGTTTTATGGGCTGGCGGC 
AATTTCGCAATTATTCACCCAAATTATCAGACGCCCCCAAAGGGGAAATCACTATTTTTTTCCTCCGGGTAAGCGGAGTGCAGATTGTGAATTGA 
TAATTGGTGCCCTTGCAGCACGGCACCAAGAGATCCGAAATACCCTGGAGTCCGGCAACCGTGGAAACTGCAACTCATCCAGCCCATTGGATATG 
CTAAATTATGCAATGTTGCGTATGCTCGATTCGCCGGCGGGTCTAGTGCTCCCCTCTTACCCGCTATAAGAAGTCGCGATCTGGTGGAAACACCG 
CAATCAGTCGTGTGACAGGTGATAACTCAACTAATGGACAACTGGCGGGCAACTCCGACCCAATCACCACCAACTACGATCAACATAAATAATTT 
T G AC C C AAG GGTTCGGTATGGCCGTTTCTGCTT AC AC AC AAC AT AC C AC AT AC AC AT T GT AC AGT GG AAAC AG GT GAG AT T CAT T T AAAT GAAC A 
TAGCTTAATATATAACATTTTCGATTGCTATAATAAATTCAGGGAGAGTTTGTTTAAGTGATTTAATTTTAAGCTATCAAAGGTTTAAAGTAATT 
CTCTTATCAAATTGAAGGTAACAAATTTTCTGGAAACGGTCACGCTAGCTTTCAGCCCTGGTTTTCAGCTTGGCGCCCAGTTTTCAAACCATTTT 
CT TAT C AGAAT T T GC G AT T AC T T T AC C ATT C C T GGT T TT C AAC AC AC CAT C T T AT C AGAAAT T GT GAT T AT T T C AAT AT TC C T T GT T T T T AAAAC 
CCTATTTC G AC AT AC T GC G GGT AAAAT AT AC AG AT TAT AT AG AT T T CT T T ATT C GT C AAT T C T C AAAAT AAAT AT T C AATGT T CAT CT AAT AC T T 
TACATTCAAGAAATTTTCACAAATTGTCTACGCTTAACCTTAGGTAATAACTAAGCTAAGTTTTTGAGTTTGTAGAGAATGAATATTCAAAGCAC 
AAGGAACAAATTGATTGGGAGTAAGTTTCTATTTACATCTAGAGTTGCCTTTTTTGTTTTTATTTTTGTTTAATTTGAGGTTTGGTTAAATATAA 
AATTCGCTACTTATGAGTAAGGGGATGGGGCGATTAAAACCTAGGAAACATTTTAGAAGATTTCCCGTACGAATTTCGAATTTGATTCGGTCGAC 
T AT T G ACT AAC C AC T AAC T T GT T C AACT T GG GT T T AACT T AT T AT C AC AAAC AAGT AC AT T T AAT AT GC AT AT GAT T AC AGT G AAG AGGG AGT CG 
ACTCCCCCCCATACAGCTATGAGTTGATCTTGGCCAGCAGTGGCTGGGTGACGGCACTGCCCGCCACCGGTGGGCACTGCTGCTTGTAGCTCTTC 
ACCAGGAAGCCGTAGTGGCTGCTGTCCGCCGGAATGTTCTGGCCACTGCCGCTCCGTTGCGGA 
(SEQ ID NO: 1384) 

Exon: 4383.. 4238 
Exon: 3865.. 3549 
Exon: 3493.. 3294 
Exon: 3233.. 2908 
Exon: 2844.. 2440 
Exon: 2376. .1001 

Start ATG: 3848 (Reverse strand: CAT) 
Transcript No. : CT10897 

AACCCTTGGGTCAAAATTATTTATGTTGATCGTAGTTGGTGGTGATTGGGTCGGAGTTGCCCGCCAGTTGTCCATTAGTTGAGTTATCACCTGTC 
ACACGACTGATTGCGGTGTTTCCACCAGATCGCGACTTCTTATAGCGGGTAGGTACACCGACTTCACCATGCTGTACTGCCCGCCCAACGTGACC 
CTCAGCGAGGTGTGGACGCAGCATGGCATCTCGCATTGCTTCATGGACACCGTGGGACCGGCGGTATACGGTGGATTCCTTCTGCTCTTCGGCAG 
CATTCAGTTGCTGATGTACCGGAAGTATGCCACCAGGATTACGGATCCCACGCAAATCTCCAAGTCACGACTGTTCGCGATGCAACTCTTTCTGT 
TGCTCCTGCTCCCGGTTTTGGCTCTATTGCGGTTTCTGATGAATGCACGTATTTATCCAGATAGTGCGGTGTACGGCTATATGATCTTCTCCACT 
TGCGTTGTGTGCTTCTCCTACCCGTTCAGCATCTGTTTGATACTCAAGGAGCGCTACTACCAGCTGCCCTCAATGCCCACCCGAGGACATGGACT 
TGTTCTATTGCTATTCTGGACGTTGGCTTTCATTAACGAGTCGCTGGCCTTCATAAATTTGCGCCACGAGGACTGGTGGTTCCACCTAAAAACGA 
ACAAGGATCAAATTGAGATGGGCTTGTTTGTCACCCGGTTCCTGTGCTCCCTGCTCATCTTCGTTTTGGGTCTGAAAGCTCCTGGCATCATGGCC 
CCGTACAATCCACATCAGCGGCTGGACAATGATACCGCCAACGAATCGCAGGGAAACGTGCAGACAGGATCTGCTTTCAGGAACGGATGGCGCAA 
GCTGCGCACCGTCTTTCCGTACCTCTGGCCTAAAAAGAATATTGCTCTGCAGATAGCCGTCATCGTTTGCATTATACTGCTGCTTGCCGGTCGAG 
TCATAAAGCTGTTTTTGCCGATTTATCGCAAGAAGTTGGTTGACAGTCTTACCATTGCACCGATCGTCTTTCGCTGGGACTTTGTGCTGATCTAT 
GTGGCATTGTCCTTCCTCCAGGGTGGCGGCACTGGTAGCATGGGATTATTTAATAACCTTCGCACCTTCCTCTGGATACGGGTTCAACAGTACAC 
GACCCGGGAGATCGAGATTGAGCTGTTCCGGCATCTGCACCAGCTTTCGCTTAGATGGCACTTGCAACGCAAAACGGGAGAGGTGTTGCGTGTCA 
TGGATCGCGGCACGGACTCGATCAACAACCTCCTCAACTATATTGTCTTCTCGATCGCACCAACCATTCTCGATCTACTCGTTGCCGTGGCATAC 
TTTGTTTACGCCTTCAATTGGTGGTTTGGCCTAATCGTCTTCCTCACCATGTTCCTGTATATAGCCTCTACCATTGCAATCACTGAATGGAGAAC 
CAAGTACCAGAGGAGAATGAACCTGGCGGACAATGAGCAGCGAGCTCGCAGCGTGGACTCCCTTCTCAATTTCGAAACAGTCAAGTACTACGGAG 
CGGAGCACTACGAGGTGGATTGCTACAGGGAGGCCATTCTCAAGTACCAAAAGGAAGAGTTCCTCTCAATGCTGACACTCAACATGCTGAACACA 
GCCCAGAATATTATTTTGTGCTTGGGCCTACTGGCCGGATCGCTGCTCTGTGTATACTTGGTAGTTCACCATCAAACCCTCACCGTGGGCGATTT 
TGTGCTCTTCTCCACCTATCTGATGGAATTGTACATGCCACTGAATTGGTTCGGCACCTATTACCGAGCCATACAGAAGAACTTCGTGGACATGG 
AGAACATGTTCGATCTGCTGAAAGAGGAGGAAGAGATTGTGGATGCCCCAGGCTGTTCGCCCCTTCTAACTGCCGGCGGTGGCATTGAGTTCTCA 
AACGTGACCTTTGGCTATTCACCGGAGAAGATAGTGCTGCGCAATGTGAGCTTCACAGTTCCGGCCGGTAAGACGGTAGCGATTGTTGGACCCTC 
TGGAGCCGGTAAGAGCACCATCATGCGGCTGCTTTTCCGCTTTTACGATGTGCAGACTGGTGCTATCCTGATTGACGGACAAAACATCAAGCTTG 
TGCAGCAGCAGAGTCTGCGAAAGGCCATTGGAGTGGTGCCCCAGGATACGGTTCTCTTTAACAACACCATCTTCTACAACATTGAATATGCCAAG 
TTGGGCGCTTCGGACGAAGCTGTCT ATG AGGCTGCTCGCGCCGCCGAC ATT CACGAAAGGATTCT CGGCTTTCCGG AGAAAT ATGAGACAAAGGT 
GGGCGAACGAGGTCTGCGGCTCAGTGGAGGTGAAAAGCAACGCGTAGCCATCGCCAGAACGCTTCTTAAGGCGCCCATTATTGTGCTCCTGGATG 
AAGCCACCTCAGCTTTGGACACCCACACAGAACGCAATATTCAGGCTGCTTTGGCCAGGGTGTGCGCAAATCGTACCACCATCATTGTTGCCCAT 
CGTCTTTCCACCATCATCCATGCGGACGAGATTCTGGTCCTGCAGCAGGGAAGCATAGCGGAGCGTGGGCGACACGAAGAGCTTGTGCTGCGCGA 
AGATGGCATCTACGCGGACATGTGGCAACAGCAACTCAAGAATCTAGATGCAGAACAGAGTGGTGGTTCTGATAACGGAGACGCTAGTGCAGAAT 
CGGGATCGGAGAAACGCAGGGCAGGAGGAGCAGGAGGACCAAGTGGTACTGGAACGGGTGGTGCCCACTTCAGGGCAGGACATGCTCATGGGGGA 

GCACGCTAGCCTTAA 
{SEQ ID NO: 1385) 
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Start ATG: 164 {Reverse strand: CAT) 



MLYCPPNVTLSEVWTQHGISHCFMDTVGPAVYGGFLLLFGSIQLLMYRKYATRITDPTQISKSRLFAMQLFLLLLLPVLALLRFLMNARIYPDSA 
VYGYMIFSTCVVCFSYPFSICLILKERYYQLPSMPTRGHGLVLLLFWTLAFINESLAFINLRHEDWWFHLKTKKDQIEMGLFVTRFLCSLLIFVL 
GLKAPGIMAPYNPHQRLDNDTANESQGNVQTGSAFRNGWRKLRTVFPYLWPKKNIALQIAVIVCIILLLAGRVIKLFLPIYRKKLVDSLTIAPIV 
FRWDFVLIYVALSFLQGGGTGSMGLFNNLRTFLWIRVQQYTTREIEIELFRHLHQLSLRWHLQRKTGEVLRVMDRGTDSINNLLNYIVFSIAPTI 
LDLLVAVAYFVYAFNWWFGLIVFLTMFLYIASTIAITEWRTKYQRRMNLADNEQRARSVDSLLNFETVKYYGAEHYEVDCYREAILKYQKEEFLS 
MLTLNMLWTAQNIILCLGLLAGSLLCVYLVVHHQTLTVGDFVLFSTYLMELYMPLNWFGTYYRAIQKNFVDMENMFDLLKEEEEIVDAPGCSPLL 
TAGGGIEFSKVTFGYSPEKIVLRKVSFTVPAGKTVAIVGPSGAGKSTIMRLLFRFYDVQTGAILIDGQNIKLVQQQSLRKAIGVVPQDTVLFNNT 
IFYNIEYAKLGASDEAVYEAARAADIHERILGFPEKYETKVGERGLRLSGGEKQRVAIARTLLKAPIIVLLDEATSALDTHTERNIQAALARVCA 
NRTTIIVAHRLSTIIHADEILVLQQGSIAERGRHEELVLREDGI YADMWQQQLKNLDAEQSGGSDNGDASAESGSEKRRAGGAGGPSGTGTGGAH. 

FRAGHAHGGAR* 
(SEQ ID NO: 1386) 

Name: ATP-binding cassette transporter 
Classification: transporter 



Celera Sequence No. : 142000013384529 

GCTTTGAACTTCATCACGCGCAGGCTGCCCGGGAACTCGAGGGACAGCTGCTTGGTACACTTGGTGGCGATGTCAAAGCGAGCCGTATCCAATGC 

CGCGATGATCACCTGCTCCAGGATGAGATGCCGCTCGTTGCCGGTCTTGTGCACCTTGTCCTCTAGCACAGCAACCCACAGCTGGACCACCTCCT 

CGCTGTGGCGACCCGTCTCCTCGCGCCACGTTCTGAACTGATCCCGTACATCTGAAAGGATGATCATACGCTTATGATACGGCACTTTTCATTCA 

TGGCAATTGGGAGCAACCCACTGACGTGGTCTGCACCTGTCTGGAGCCGGGAAAAGCCGAAAACTTACCCGACCAGCTCATCTTCTCATAATCTA 

GCGACATATTCACTTAGGCGGACTTAAATATCTCATCAATTTAATTTAATTTCCCAATTGTAATCATTAACTAGCGATTTCCTTCTTGGCAAAAT 

AAAAGTGTGGCAGCGTAATGAGCCGTTAGATTATACCAAATAACACTAGCGTGGCGCTGCCAACCTGCTTTTTGAATATCGTAAAGTCTAGAAAT 

GAAGTCAGAGGTAATTTTTTAAAACTTTTTATTATAAAGAGACTTCTAAAGTTAAGTTTTTCTACATATACCTTGTTTCCGATATTTGCAAAGTT 

AATTAAACTGTTAATACTTCTGAAATACTTGTAGGTCCTTGAAAAGGATTCTTTAATGAATTATTTTACCATGCATTAATCTTATTTTGTCCAAA 

C T T T GT GC AT TGCTTATCCATTT T AAC AG AT T T CT AAGT C AAGT G C AC AT T TGAAAAGC GC G C C AAC G AAAAC AT G AAAC AT GATAGTTTTTCTT 

CCAGCGACGGTCACACTAACAAATCAAATAATTGTGGTACCGTCAACAAAAGCCAAGTCTCTCGTGCTCTCCGGGTATTAAAGCCGTGGAATCGG 

GCAGTGCCGCATGCGGTGCTGTGTTTAATTTCCCCAACTCCCAACAGATCCGGAAGCAGTTTGAACATGGGTCCGCCAAAGCAGCTGAAACGAAC 

CTTTCTTGGCTTCGACTTGAGCACGCAAAAGGTGAGAACTGGAACTGCCTACGTAGTATCTTTGCATCTGAGATGTGGAGGCGTCTCCAAATCGA 

CTACATCTGGGCCAATTGCGTGCGAATGGTGTTACTGGAAAAGGTCAAACGGTACTCGGGAACTTGAAACACGAACTACTCTTATCAGTTGCGGT 

GACCGGGGCTCGATTTTCGCTTATCTGCTAAAGACGCGAGCGCCAACCGATTTAACTATTATGGTTTGATTATTGAAATTCATGTTCGGAGGTAT 

TGGCCCCCGACATTATCTTAATAGAGCCGCAATAACTTTCACAATCCCTGAATGACGCTGGCGGTGACTCCTAGTTGCCCTCTATGTTTGATCGT 

TATTTTCGAGGTGTGTCTCCATATCGCTAATTGCTGCCCCATTTACTCCTCTCAGCTCAAAGCCGTCCTGCTGGGCTCCAGTTTGGAGGTGGTTG 

CCGCGGCTGAAGTGAAATTCGACACCGACTTGCCGGAGTTCCGGACCACCGGTGGAGCCAACGCTGGATCCATCAAAAATGAGTGAGTCTGTTTC 

ACCACTATCCATTACTCCAATTGAAGATTTTAATGGGATGATTGTTTGCGTAAAGCAAATAGGGGGATATATTTTACGTTTTTATATCTTGCGCT 

GTGGGGTATTTGTCCGATAATACTTAGGGCCCTATTTCATAGCAGTTGTTTTATTACACCACAATGGGTCTCTATCACTTCAAGTTTTTTTTAAC 

CTTCCTATAACAGGGACCATATATACAATGGGTACTATAAAATATATATGTGCTAACCTGACTCCACCTTTACTTGTTCTTCTTTAGATACTTTG 

TACAACCAGTGATGTGGGTGAAGGCCATGGACATTGTCCTGGACCGACTGGTTATGCAGCAAGCGGATCTTAGCACCGTAGCTGCCATCAGTGGC 

TCCGGACAGCAGCACGGATCGCTCTACTGGTCCAAACATGGAATCAACACGCTGCAAAACCTCGATTCCGAGAAGTTCCTTCACGCACAAATCGA 

CGACTCCGCCTTTGTGGTAAATCGTACTCCGATTTGGATGGACGCCACCACCACCAAACAGTGCCTAGAGATGGAGATGGCTGTTGGTGGCAAAC 

TTAACATGGTGGAACTGACGGGCTCGAAGTGCTACGAACGTTTTACGGGTCCCCAAATTAGAAAGATATATCAGCAACGCTGCCACGCCTACGAA 

GAGGCCAACCGAATCTCATTGGTCAGCAGTTTTATATCATCGCTATTTCTGGGCTCCGTCGCTCCAATTGACTTCAGTGATGGTTCCGGCATGAA 

TCTGCTGGACATCCGCAAGAAGAACTGGTCCAAAGAATGCCTAAATGTTTGTGCACCCGATTTGGACAAGCGCCTGGACATTCCGGTCAGTCCAA 

ATTCGATTTTAGGCAACGTCTCTCCGTACTTTGTGGAGCGCTTTAGTTTTTCGCCCGATTGCAAAGTGGCTGCTAGCACGGGGGACAATCCTTCG 

GCGCTGTCTGGAATGTTGGTGGGCAGCAGCTGGCTTACCATTTCCATGGGCACAAGTGACACCCTAATGATGAGCTTGAAGGAGCCGCTAAACTG 

GGAGGAGGGTCATATACTCTGCCATCCAACTGAAACGGAAGAATTCATGGGCCTTCTATGGTAAATTATTCCTGGGACATTATATCTTAAAATTA 

GTTTATTTAAATGTCTTTTTCAGTTTTAGAAATGCTTCATTGGTCCGTGAGGAAATGAACAAGAAGACAACCGGCGGCGATTGGGATAAATTCAA 

TGAGTATCTCGACTCCACGCCCCGTGGAAACTTTGGGAACATGGCCGTGCACTTTAACGACATGGAAATCATTCCCAAGGCGCAGGGTATTCTGC 

GATGGAACAGGGAAATGGATCCCAGTTCCCCGGACGCGGCTAGGGGTGTAATCAAGTAAGTCTGATTGAAAATTTAGCTGTTTTAGGTCCGACAT 

GAAAGCAATGACTCATAATCAGTTAACAGACATCTGTAAATAGTTATTTTTAGAAATGACTTTTTACCAACATGATCATGTTTTTTGTCGTTTAT 

ACTTTTTCTTAATGCCTATTTCCCTTCTACTGAATTCTTTAAAATGTATAAAATATTCCTGACCTCTTTCGCTTTATAGATTCAGCTCGCCCCAA 

ATCGAAATTCGAGCCCTAGTTGAAGGTCAGATGCTGCATCACCGGGCTGTGGCTGAAGACCTTGGCTACAAATTCGGACCCGAGACACAAATCCT 

TGTCACAGGAGGCGCATCTGTTAACAAATCAATCCTGCAAACAATTGCCGATGTGTTCAATGCTCCCGTCCATATCCAGGTTAGTTCATACCAAC 

TGTTTCTCAAGTGCAAGCAATTTAAGCAACATGTTTCTTGCAGGATGAAGGATTTGAGGCTGCCCTACTGGGAGCCGCTTATCGATCCGCCTACG 

CTCTGTACCTTCAAGAGGCAGGTGATGGGGTGACGCCTCTCTGTTACCGAGATTATATCCTAAGTCTGACAAGTAACAAACTGAGCCTTGTATGC 

GAACCACACAAGGACAGCGAGGAAATCTACGCTCCAATGCTACAACGATATCGCGAAATGGCGCGCGTCCTGTCCAATCCTAACACATAAACTCT 

CCGAAAGCCCAACGTAACATTCCAGTTTATACTCCGTACATTCACAAACTGATTCCTTCTTAAACGTCTCCTGTAATTGACAGAACTTAAGTGCA 

ATTAACAAATTCTGTGATTGACGATTTATTACAAATTTATGCGAAACCATATACAAATAAATTTTTATATATGGCCCACTCCTTATTGTGTCTGG 

TATGCCTTTAATCTTTGATCCGCCTACTTTTTGCGTCCACCACGCTCCTTGACCTGTAAATGCACCGTGGTTCCGTTTACCAGATTGTAAAAAGC 

CATGGTATTGCTGTCTTTGAAGAACATTCCCTGAAGGATAGATGTTATTGAAATAATAGTTTTCGAAACATTAGTTGTATTTCTTACCTCATAAA 

ATATCTTCTGCTTGGCCGGTGGCATGCCAGTTTCGTCCTGCAGCTTCGTCTTCAGATTGGCAATGGGTTCAGAAAGTGCCATGGTCACGGCTATC 

ATCTGTCCATTAAGCTTCCACTCTGATTTGTCGGAATTGGGTACCAAAACCTGAATGGTCACAGGGCTCTGAAATAATTTAAGATGGGTTATATA 

CCAATAGATAGCCAGATATCAAAATCAAACTTACCTTGTGGGTGGCAATAAAGTCTGCTTCTGGAATCAGGTTGTCCTCGCTTCGAATCTTTTTA 

GAAGGCGGCTCCTCCTCCATTATATCAACAGGTGGAGCAGGTGCAATTTGCTGAGGGATTCCACCAGGCTGCATGTTCATGTAGCCTCCACCACC 

AGCTCCAAAAGGCGGCTCTGCAAAATAAAAACGCTAAGCCCTCGACGAGATTTGCATACTATTTGTTACTTACGCATCATGGTTGGACGCAGTTG 

CATCAGCTGCATCACCGGTTGAGGCTGCTGCGGCGGGTGATGGTGGTGCGACTGGTGAGGTGGCGCCTGCTGATGCGGTGGATGGTGATGATGAG 

GATTATGATGCCCGTGGTGACCACCTCCGCCACCTTGGTGGTGTTGACTCTGGCTGGAGTGACTGTGCTGCGACTTGGTTGACGGCTGCGGCGGA 
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GCTGAAAGCGTCGCCTTGTTGCCCACCGGTTTGGGGCCGATTTTCTCCTTCTCCTCGTCGGGCAGCAGTCCCTTAACCTTGTGGATTTGATGAAT 
TTGTTCCTCCAGAGTGATATTTGCACGTGCTG 
(SEQ ID NO: 1387) 

Exon: 1001.. 1076 
Exon: 1481. .1602 
Exon: 1893.. 2720 
Exon: 2779.. 3000 
Exon: 3215.. 3404 
Exon: 3464.. 3877 
Start ATG: 1017 

Transcript No. : CT11741 

CGGAAGCAGTTTGAACATGGGTCCGCCAAAGCAGCTGAAACGAACCTTTCTTGGCTTCGACTTGAGCACGCAAAAGCTCAAAGCCGTCCTGCTGG 
GCTCCAGTTTGGAGGTGGTTGCCGCGGCTGAAGTGAAATTCGACACCGACTTGCCGGAGTTCCGGACCACCGGTGGAGCCAACGCTGGATCCATC 
AAAAATGAATACTTTGTACAACCAGTGATGTGGGTGAAGGCCATGGACATTGTCCTGGACCGACTGGTTATGCAGCAAGCGGATCTTAGCACCGT 
AGCTGCCATCAGTGGCTCCGGACAGCAGCACGGATCGCTCTACTGGTCCAAACATGGAATCAACACGCTGCAAAACCTCGATTCCGAGAAGTTCC 
TTCACGCACAAATCGACGACTCCGCCTTTGTGGTAAATCGTACTCCGATTTGGATGGACGCCACCACCACCAAACAGTGCCTAGAGATGGAGATG 
GCTGTTGGTGGCAAACTTAACATGGTGGAACTGACGGGCTCGAAGTGCTACGAACGTTTTACGGGTCCCCAAATTAGAAAGATATATCAGCAACG 
CTGCCACGCCTACGAAGAGGCCAACCGAATCTCATTGGTCAGCAGTTTTATATCATCGCTATTTCTGGGCTCCGTCGCTCCAATTGACTTCAGTG 
ATGGTTCCGGCATGAATCTGCTGGACATCCGCAAGAAGAACTGGTCCAAAGAATGCCTAAATGTTTGTGCACCCGATTTGGACAAGCGCCTGGAC 
ATTCCGGTCAGTCCAAATTCGATTTTAGGCAACGTCTCTCCGTACTTTGTGGAGCGCTTTAGTTTTTCGCCCGATTGCAAAGTGGCTGCTAGCAC 
GGGGGACAATCCTTCGGCGCTGTCTGGAATGTTGGTGGGCAGCAGCTGGCTTACCATTTCCATGGGCACAAGTGACACCCTAATGATGAGCTTGA 
AGGAGCCGCTAAACTGGGAGGAGGGTCATATACTCTGCCATCCAACTGAAACGGAAGAATTCATGGGCCTTCTATGTTTTAGAAATGCTTCATTG 
G T CC GT G AGG AAAT GAAC AAG AAG AC AAC C GGC GGC GAT T GGG AT AAATT C AAT G AGT AT C T CG ACT C C AC GC C CC GT GG AAAC T T T GGG AAC AT 
GGCCGTGCACTTTAACGACATGGAAATCATTCCCAAGGCGCAGGGTATTCTGCGATGGAACAGGGAAATGGATCCCAGTTCCCCGGACGCGGCTA 
GGGGTGTAATCAAATTCAGCTCGCCCCAAATCGAAATTCGAGCCCTAGTTGAAGGTCAGATGCTGCATCACCGGGCTGTGGCTGAAGACCTTGGC 
TACAAATTCGGACCCGAGACACAAATCCTTGTCACAGGAGGCGCATCTGTTAACAAATCAATCCTGCAAACAATTGCCGATGTGTTCAATGCTCC 
CGTCCATATCCAGGATGAAGGATTTGAGGCTGCCCTACTGGGAGCCGCTTATCGATCCGCCTACGCTCTGTACCTTCAAGAGGCAGGTGATGGGG 
TGACGCCTCTCTGTTACCGAGATTATATCCTAAGTCTGACAAGTAACAAACTGAGCCTTGTATGCGAACCACACAAGGACAGCGAGGAAATCTAC 
GCTCCAATGCTACAACGATATCGCGAAATGGCGCGCGTCCTGTCCAATCCTAACACATAAACTCTCCGAAAGCCCAACGTAACATTCCAGTTTAT 
ACTCCGTACATTCACAAACTGATTCCTTCTTAAACGTCTCCTGTAATTGACAGAACTTAAGTGCAATTAACAAATTCTGTGATTGACGATTTATT 
ACAAATTTATGCGAAACCATATACAAATAAATTTTTATATATGGCCC 
(SEQ ID NO: 1388) 

Start ATG: 17 

MGPPKQLKRTFLGFDLSTQKLKAVLLGSSL 

SGQQHGSLYWSKHGINTLQNLDSEKFLHAQIDDSAFVVNRTPIWMDATTTKQCLEMEMAVGGKLNMVELTGSKCYERFTGPQIRKIYQQRCHAYE 
EAKRISLVSSFISSLFLGSVAPIDFSDGSGMNLLDIRKKNWSKECLNVCAPDLDKRLDIPVSPNSILGNVSPYFVERFSFSPDCKVAASTGDNPS 
ALSGMLVGSSWLTISMGTSDTLMMSLKEPLNWEEGH^ 

DMEIIPKAQGILRWNREMDPSSPDAARGVIKFSSPQIEIRALVEGQMLHHRAVAEDLGYKFGPETQILVTGGASVNKSILQTIADVFNAPVHIQD 
EGFEAALLGAAYRSAYALYLQEAGDGVTPLCYRDYILSLTSNKLSLVCEPHKDSEEIYAPMLQRYREMARVLSNPNT* 
(SEQ ID NO: 1389) 

Classification: enzyme 



Celera Sequence No. : 142000013384516 

CATCAGGTTCTGCACATTCTGGTTGTCGATGCGCACTTGGCCGCTGTAAACATAGTTGATCAGACTCTCCAGGGCACTTGGCTCTATGGCGCTCT 
CCTTCATGGTGATCTCCTTGATCCGACTCTCTGCCATGTTGTTGGTGAACATCGCATAGAAATACGGGATGGTGGCCGCCAGCACAACGCGGTGC 
GCTGAGAAGGTCTGATCTTCAACCTGCAATAATTGGGATGGCATTTTATATTTAGACGCTAGATAATGAACGTATCTCTGAGCTGCGGGTGATTG 
AATGCCCCAGTCTGCACGCAAACAACCTTTAGGGTGACATCGCACAGCTTGCCCATCCTGCGGATTTCCTTGAACAGGGGGAAACTCTCGCCGAA 
TAGCTTATCATGCTTGTAAACAAAGTATTGATCCTCCACTTGGCTATTATCACTCAAATCCATCACTTTAACGGATCAAGAAAATGATAAACAGT 
AGAGATGCCAAATGTTGCTTGTACTGCAGAGCTAGTGGTGGTCAGTGGTGAGCGGTCACATTAATCGATAGTTGAGGTTCCTTCGATAGTTGAAC 
AACACCAATAGTAGTGAAACAAATTTATTGAATCATATTGTTTAATTTTAAAGGTAACGTTCTTGGACCAAAATGTAATTTCATAATACTTATTA 
AAAAGTTGAGCCCTTTTTATGCAACAGTGATAATTAAATCCTTAAAAAGCACATATTTTTAAGACAATATTATTCTTTCATTTAATTTTAATTAG 
TAATTTTTCCGTTTTTGCGTCACCTTCTTAAAAAAAGACGCTCATCACTAATTGGGCGGTGATGACAGCCCTATCAGCTGTTAGTGTGGCTGGAG 
TGTTTTCTTCGCTCGGATTGATTAAGGGTCCAAAAACATTTTCGAGTAATCTGGGTCGCTTGCTAATTGGAAAACCACTGATCTGAGTTAAGACA 
TAAATGTTTGGACCGCTATTAAATGCAAGGATACGAAAATAGCAAGTAACATGGAAGCCGCATCACCGAGGGAAACCCAGGTGAGTAAAAAGGTG 

T C G C T AG AAT TG AT GAT G ACGT C TCATCTTGTTATTTG CAT T T GC AG AAG AACC C G AAT C C G AT C G AGGC GC T GAG C AAAG AT G AAAT CAT C AG C 
AAATACAAAGGCCTGCTCAACATCGCTAAAAAGGCGAAACAGGCAAAAGATGGTAAGTTTGATGGAGAGTTTCAGGGAATCTTCCAGAATAATGC 
AAGCCACTCTCTTTCAGAACTCACCGAGGAAAATCACCGACTCAAGGATGCCCTTAAACGGGCGGCGGAGAAGCAGAGCAGCCTGCCAGCCATGC 
AGGAAATGGTCCAGGACTTCACCGATAAGAACCTCATCCTCACCGAGGAAGTGAATAATCTGAAACGAAAGACCAAAGAGGATGCCGATCGACTG 
ACGCAATTCGAGATCGAGAACGAGAGCCTAAAGCGCCAACTTGGGCGACTGAGTGATGAAAACGATGCCCTCCTTGCCAATGTGGATCGCATGGA 
AAAGGCTATGCAACAGGTCAATGCTCTGGGTAACGAGCAGCGCAAGAATCTGGAACTTTTGGAAGTGGACATTGCCAAAATTAAGGAAGCGGAGG 
CGGAAAACGCTAGTCTTCGCCAGCAAGTCGCCACGATGGAAGAAGAGTCGAGTGTGCTGCAGCAAAAGTACCAAAACATCAAGGAACTTAACTCT 
GAGCAACGCAAGAAGTTTAATTCTCTAAAGGACCGCTTCATTGATGTACACCGCAAGTTGAAGAACCTCAAGGAGTGCAAGTGCGTGCTGCTGGA 
AACGCAGCACGAGTACGCCGCATCCGTTTCCAAATGGCAGGTGGAGATTATCAAAGCTTCGCAACTGTTGTGCGCAAAGATGGCATCCCTGCAGG 
CCGAGAATGAAAAACTGAAGCTGAACAATGGAAAGTCAGACAATAATCCTCAAACAATTGACACTGGAATTGACCGCAAGCGCTTGCTGCAAAGA 
GTTCAGGAAATGGATCGGCTAGCCAAAATAGTGAAGCAGAAGCAAAAGAACCAAAGGAGCAATTTGAATGTGGAGTACCTACTTAAGAAGATAAC 
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CGCTCTCGAAGAGCTTGCAGTCATTATAAAGCAGCAGCATAGAATTGACAAGGAACAATTGATAAGTGTAACGAAGGAGCAGGAAAACACCAAAA 

ATCACGCCAGGAACCTTAATGTAAGCTTATTCCAGACCAAGCTTGATCAAATGCAAAATCTGGTCAAAGTCATTGCAAAGGAGCGTGATAATCAG 

CAGAGGAAGTTGCAGGAGTTGGAAGCCATCTGCATAGAACTGCGCCAACACAACGAGGATTTGCTTACGCGATACCACCTCAAGGAGCAAGAGCA 

TGGAGAGTTGCTAACGGAGATGCGGGAACTGAACGAGGCGCTCAAAGGACGCGGTGATGCCATTTCACGCCTTCAAGAACAGCACGAAGCGGAGG 

TCAAGCGACAACGAGACTTGGAGGCTCAACTATCTAATAGTCAGCAGGCGGCGCAGGAGAAGTTACAGAAGATCAAACAACTCCAGAGCCGTGTT 

GAGGAATTGGAGCAGGCGAATGCTGATGCCCAGAGTGATGTTCTGTCGACATCCACAATATCACGAGCCGAAGAACTAAGCCGCCTGCGTGAACT 

GGACGAAGGCTACGAGGAGAAGTACCATAAGCTCAGAGCCATAGCTGCCAAGCTGAAAAAGAAACTGCAGGAGCAGACGCAGCAGCTGAATGAAA 

TGGAACAGAGCGGTGCCCTAAAGGAGGAGTTGGAGGCAATCAAGTTGGCACAGGCCCAACTGCAACAGGATTTAAATGCAGCTCGAGCTGAAAAC 

CAGAAGCTAAAGTCCAAGGAGAAAGTAAAGCACTCCAGTGTCCTTAATTTGGAAATTGAAGCAGCTGAAAAATCCCTGAGCGAAGTTAGTGCGAA 

GCTGACAGCTAAGTCTAGCGAACTAGAAGCCGTTAAGGAATCACTGGCCAGCAAGGAAAACACCATCGTCCAGCTGCGCAAGGAGATCGCTATCC 

TAGAGGAGGCCAAAAACGGCGAAGCGGCCCACTCCCTAGAACTTAAGGAGCAGATTGATCGCATGCAGGTCCAGGTCAAAGACGCAGTTCACAGC 

AAGCAACAGGCGCTCACTCAGAACAAGGATTTGGAACATGGCGTTGAACAGGCCAAGCTCGAGGCGGAGCAATTGCGTCTCCAGCTGTCGGAGAG 

CGCCCAACAATACGAATCGAAGCTAAACACGGCCACCCAACAACTACTAAGCCAAACCCAGGAGCTGGAAATGCACCTGGCAGAGCAAAAGCGAT 

TGGAGACGGCGCTTCGAAATGCGGAACGAGCTCTGGAGGACCTGCGAGTAGAGTACACCGAATACAAGTTGAAGGCACAGTCTGTGCTGCGCAAA 

AACCAAAATAAGGGCTCCAACCGCGAACAGGAGTTGGAAGAAGAGTTGGTTGCGCTTCGAGAGAGTGAACGCAATCTGCGCGCCAGCAACGATGG 

AAGAGCTGCCCGCCTGGCCCAGTTGGATAGTCAAATAGAGGAATTGCGACAGGATAACACTGATCTGCAAAAGCGTAGCAAAGAGCTCGTCTCTC 

TAGTTGACGAACTGCGTCAGCAAAACGACCTGCTGTCGCTAGAAAACCAACGCCAGTTGCAATTCCAACACGACCTGATGCAGCAGCATCGCCAG 

CAAGTTGATGAGCTGGACGCCGGGCACCAGTTGCAATTAACGCAGGTGCAGGAGCAACTGGAGGAAGCGCAAAAGATGCAAGCAAACGTCTCTCA 

GCATACCACTGCATCCGCCGCGAGCGTGGACACCAGTCCAGAGCAGGCAAAGATCGATTACTTGCTTATGGACCACGAAACAGGCTTGGATGGCC 

ACGCTGGAGACGTCTCTCTGGCCCAACTGGCTGCTCAGCGAAAGATATCTACTGCCTCGAGACGTTCCCACGACTTCATGCCGCTTGACGAGCTG 

CTCAACACATCAATGAACCAAATAACCAGCGATACCGTGACTACCATCTCTAATTTCGGACGCAGCGTTTCGCAGCAGGAGGACGAGGAAGCGGA 

AATGGCCGCACGGGGAGACTTTTCCGTCCAAAGTGCCCAGCTTCAGGCCACAAAAGAACGCCTAAGCATCCAGGAAAGTCGTGTCAAGCACTTGA 

CTGCCCTGTTGGCCGAAAACGAACAGGATTTGGCTAAACTGACCCAAATGAACGACATGCTCAAGGAAGAATTGCGCCGCCAAGAGCGCTCCGAG 

GAACGTGAACAACACATGCACAACTCCGAGTACTTGAAGAACGTCTTCTTAAAGGTGAGATCAGGAACGCAATAACTATATCCACGGGCTTAATC 

TATTTACTTTATAAACTTGCAGTTCCTTACGTTAAACAATGTCGATGAGCGGCAGCGCTTGGTTCCCGTGCTAAATACCATTCTCCGCCTGAGTC 

GCAACGAAATGGAAATGCTGAACTGTGTGGCCAAGGGACAGAAAGGTT^ACTACTAGCAGAAAAATTTAAATTTTAGTATGGTTTCTTGTATTAAG 

ATATTGCTTTTAGTTTCTGCAGATGGTAGCAGTCGCAGCTGGACGGGATTCCTCACTGCGTGGAGTGGAGGAGGCAGTCCCAATAATAACAACTA 

GCGAACATCATAGGATATATAATATATAAATATTCGTCTTGTTGGATGCGGATAGTCGTCAGTCGTTGGGCAGTTGCATAGCATTCATTCTGTAA 

ATCTTGTACAATTTCCCTTTTATATATAATGTATTTTAAAATTGCTCGTTGCTGTTTGGCACATCGTACTTTGTACACTCTTGTAATTTGTCCAC 

ACCCGTTGTGTTGAGTGCCCTATACAGCACCACCGGCAATATGCGCGTTGTAGATTGAAGAAAGCCTTTGCATATATTTAAAAACAATGTTTTTC 

AATGGTTTGTTTAATTTGAAATTCTATTACCATATAGATCAGACTATAAATCAGATGCATTGTCTATAAATTAGTATTCCATTATTTGTATTGTA 

TGTACTATATATAAATAGGATCGGTATTTATGCTTACATTTACACTTTTAAACGGTGTATATTGTTTACACACAATATAAAATATCATTAACATA 

AATCTTACATAAATTTCATTAAAACCAAAAGTGAGTAATCAAGTTAAATAAAAATTCTGTCTTGGCGGAATACATATGCAGGTGAACGATTTGAA 

TCGAAATGCGCCAAAAGATTAAAGATCACAAGTCGAATAACTACTTAAATACTAATCAGTATTGTGAGCCCGGATCGGATCCTATGCCAATCTAT 

TGCCAAACGCCCGCATAGTTGCCCACCAAAGTCGCCGGCAGTGGTCCTCCTGCAGCGAAGAGGCGCATCCTAGTGCTGTCATAGCAGTGGGTGTA 

GATGGCGGGCACATACTTCACCTGCTGGAAGCCATCCATCTCATCCTCCGGATACTGAAATGTACAAAACAGTTAGGTGAATTGGTGTATTTTGT 

GGAACTGTAAC 

(SEQ ID NO: 1390) 

Exon: 1001.. 1030 
Exon: 1093.. 4329 
Exon: 4393. .4521 
Start ATG: 1001 

Transcript No. : CT11878 

AT GG AAGC C GC AT C AC C G AGG GAAAC C C AG AAG AAC C C G A AT C CGAT C G AGGC GC T G AGC AAAG AT G AAAT CAT C AGC AAAT AC AAAGGC CT GC T 
CAACATCGCTAAAAAGGCGAAACAGGCAAAAGATGGTAAGTTTGATGGAGAGTTTCAGGGAATCTTCCAGAATAATGCAAGCCACTCTCTTTCAG 
AACTCACCGAGGAAAATCACCGACTCAAGGATGCCCTTAAACGGGCGGCGGAGAAGCAGAGCAGCCTGCCAGCCATGCAGGAAATGGTCCAGGAC 
TTCACCGATAAGAACCTCATCCTCACCGAGGAAGTGAATAATCTGAAACGAAAGACCAAAGAGGATGCCGATCGACTGACGCAATTCGAGATCGA 
GAACGAGAGCCTAAAGCGCCAACTTGGGCGACTGAGTGATGAAAACGATGCCCTCCTTGCCAATGTGGATCGCATGGAAAAGGCTATGCAACAGG 
TCAATGCTCTGGGTAACGAGCAGCGCAAGAATCTGGAACTTTTGGAAGTGGACATTGCCAAAATTAAGGAAGCGGAGGCGGAAAACGCTAGTCTT 
CGCCAGCAAGTCGCCACGATGGAAGAAGAGTCGAGTGTGCTGCAGCAAAAGTACCAAAACATCAAGGAACTTAACTCTGAGCAACGCAAGAAGTT 
TAATTCTCTAAAGGACCGCTTCATTGATGTACACCGCAAGTTGAAGAACCTCAAGGAGTGCAAGTGCGTGCTGCTGGAAACGCAGCACGAGTACG 
CCGCATCCGTTTCCAAATGGCAGGTGGAGATTATCAAAGCTTCGCAACTGTTGTGCGCAAAGATGGCATCCCTGCAGGCCGAGAATGAAAAACTG 
AAGCTGAACAATGGAAAGTCAGACAATAATCCTCAAACAATTGACACTGGAATTGACCGCAAGCGCTTGCTGCAAAGAGTTCAGGAAATGGATCG 
GCTAGCCAAAATAGTGAAGCAGAAGCAAAAGAACCAAAGGAGCAATTTGAATGTGGAGTACCTACTTAAGAAGATAACCGCTCTCGAAGAGCTTG 
CAGT CAT TAT AAAGC AG C AGC AT AGAAT T G AC AAG GAAC AAT T G AT AAGT GT AAC G AAGG AGC AGG AAA AC AC C AAAAAT C AC GC C AGG AAC C T T 
AATGTAAGCTTATTCCAGACCAAGCTTGATCAAATGCAAAATCTGGTCAAAGTCATTGCAAAGGAGCGTGATAATCAGCAGAGGAAGTTGCAGGA 
GTTGGAAGCCATCTGCATAGAACTGCGCCAACACAACGAGGATTTGCTTACGCGATACCACCTCAAGGAGCAAGAGCATGGAGAGTTGCTAACGG 
AGATGCGGGAACTGAACGAGGCGCTCAAAGGACGCGGTGATGCCATTTCACGCCTTCAAGAACAGCACGAAGCGGAGGTCAAGCGACAACGAGAC 
TTGGAGGCTCAACTATCTAATAGTCAGCAGGCGGCGCAGGAGAAGTTACAGAAGATCAAACAACTCCAGAGCCGTGTTGAGGAATTGGAGCAGGC 
GAATGCTGATGCCCAGAGTGATGTTCTGTCGACATCCACAATATCACGAGCCGAAGAACTAAGCCGCCTGCGTGAACTGGACGAAGGCTACGAGG 
AGAAGTACCATAAGCTCAGAGCCATAGCTGCCAAGCTGAAAAAGAAACTGCAGGAGCAGACGCAGCAGCTGAATGAAATGGAACAGAGCGGTGCC 
CTAAAGGAGGAGTTGGAGGCAATCAAGTTGGCACAGGCCCAACTGCAACAGGATTTAAATGCAGCTCGAGCTGAAAACCAGAAGCTAAAGTCCAA 
GGAGAAAGTAAAGCACTCCAGTGTCCTTAATTTGGAAATTGAAGCAGCTGAAAAATCCCTGAGCGAAGTTAGTGCGAAGCTGACAGCTAAGTCTA 
GCGAACTAGAAGCCGTTAAGGAATCACTGGCCAGCZ^AGGAAAACACCATCGTCCAGCTGCGCAAGGAGATCGCTATCCTAGAGGAGGCCAAAAAC 
GGCGAAGCGGCCCACTCCCTAGAACTTAAGGAGCAGATTGATCGCATGCAGGTCCAGGTCAAAGACGCAGTTCACAGCAAGCAACAGGCGCTCAC 
TCAGAACAAGGATTTGGAACATGGCGTTGAACAGGCCAAGCTCGAGGCGGAGCAATTGCGTCTCCAGCTGTCGGAGAGCGCCCAACAATACGAAT 
CGAAGCTAAACACGGCCACCCAACAACTACTAAGCCAAACCCAGGAGCTGGAAATGCACCTGGCAGAGCAAAAGCGATTGGAGACGGCGCTTCGA 
AATGCGGAACGAGCTCTGGAGGACCTGCGAGTAGAGTACACCGAATACAAGTTGAAGGCACAGTCTGTGCTGCGCAAAAACCAAAATAAGGGCTC 
CAACCGCGAACAGGAGTTGGAAGAAGAGTTGGTTGCGCTTCGAGAGAGTGAACGCAATCTGCGCGCCAGCAACGATGGAAGAGCTGCCCGCCTGG 
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CCCAGTTGGATAGTCAAATAGAGGAATTGCGACAGGATAACACTGATCTGCAAAAGCGTAGCAAAGAGCTCGTCTCTCTAGTTGACGAACTGCGT 
CAGCAAAACGACCTGCTGTCGCTAGAAAACCAACGCCAGTTGCAATTCCAACACGACCTGATGCAGCAGCATCGCCAGCAAGTTGATGAGCTGGA 
CGCCGGGCACCAGTTGCAATTAACGCAGGTGCAGGAGCAACTGGAGGAAGCGCAAAAGATGCAAGCAAACGTCTCTCAGCATACCACTGCATCCG 
CCGCGAGCGTGGACACCAGTCCAGAGCAGGCAAAGATCGATTACTTGCTTATGGACCACGAAACAGGCTTGGATGGCCACGCTGGAGACGTCTCT 
CTGGCCCAACTGGCTGCTCAGCGAAAGATATCTACTGCCTCGAGACGTTCCCACGACTTCATGCCGCTTGACGAGCTGCTCAACACATCAATGAA 
CCAAATAACCAGCGATACCGTGACTACCATCTCTAATTTCGGACGCAGCGTTTCGCAGCAGGAGGACGAGGAAGCGGAAATGGCCGCACGGGGAG 
ACTTTTCCGTCCAAAGTGCCCAGCTTCAGGCCACAAAAGAACGCCTAAGCATCCAGGAAAGTCGTGTCAAGCACTTGACTGCCCTGTTGGCCGAA 
AACGAACAGGATTTGGCTAAACTGACCCAAATGAACGACATGCTCAAGGAAGAATTGCGCCGCCAAGAGCGCTCCGAGGAACGTGAACAACACAT 
GCACAACTCCGAGTACTTGAAGAACGTCTTCTTAAAGTTCCTTACGTTAAACAATGTCGATGAGCGGCAGCGCTTGGTTCCCGTGCTAAATACCA 
TTCTCCGCCTGAGTCGCAACGAAATGGAAATGCTGAACTGTGTGGCCAAGGGACAGAAAGGTAACTACTAG 
(SEQ ID NO: 1391) 

Start ATG: 1 

MEAASPRETQKNPNPIEALSKDEIISKYKGLLNIAKKAKQAKDGKFDGEFQGIFQNNASHSLSELTEENHRLKDALKRAAEKQSSLPAMQEMVQD 
FTDKNLILTEEVHNLKRKTKEDADRLTQFEIENESLKRQLGRLSDENDALLANVDRMEKAMQQVNALGKEQRKNLELLEVDIAKIKEAEAENASL 
RQQVATMEEESSVLQQKYQNIKELNSEQRKKFNSLKDRFIDVHRKLKNLKECKCVLLETQHEYAASVSKWQVEIIKASQLLCAKMASLQAENEKL 
KLNNGKSDNNPQTIDTGIDRKRLLQRVQEMDRLAKIVKQKQKNQRSNLNVEYLLKKITALEELAVIIKQQHRIDKEQLISVTKEQENTKNHARNL 
NVSLFQTKLDQMQNLVKVIAKERDNQQRKLQELEAICIELRQHKEDLLTRYHLKEQEHGELLTEMRELNEALKGRGDAISRLQEQHEAEVKRQRD 
LEAQLSNSQQAAQEKLQKIKQLQSRVEELEQANADAQSDVLSTSTISRAEELSRLRELDEGYEEKYHKLRAIAAKLKKKLQEQTQQLNEiyiEQSGA 
LKEELEAIKLAQAQLQQDLNAARAENQKLKSKEKVKHSSVLNLEIEAAEKSLSEVSAKLTAKSSELEAVKESLASKENTIVQLRKEIAILEEAKN 
GEAAHSLELKEQIDRMQVQVKDAVHSKQQALTQNKDLEHGVEQAKLEAEQLRLQLSESAQQYESKLNTATQQLLSQTQELEMHLAEQKRLETALR 
WAERALEDLRVEYTEYKLKAQSVLRKNQNKGSNREQELEEELVALRESERNLRASNDGRAARLAQLDSQIEELRQDNTDLQKRSKELVSLVDELR 
QQNDLLSLENQRQLQFQHDLMQQHRQQVDELDAGHQLQLTQVQEQLEEAQKMQANVSQHTTASAASVDTSPEQAKIDYLLMDHETGLDGHAGDVS 
LAQLAAQRKISTASRRSHDFMPLDELLNTSMNQITSDTVTTISNFGRSVSQQEDEEAEMAARGDFSVQSAQLQATKERLSIQESRVKHLTALLAE 
UEQDLAKLTQMNDMLKEELRRQERSEEREQHMHNSEYLKNVFLKFLTLNNVDERQRLVPVLNTILRLSRNEMEMLNCVAKGQKGWY* 
(SEQ ID NO: 1392) 



Celera Sequence No. : 142000013384367 

CAGATCTTTCGTTTAAACTTACGTGAATTAGGTTTAAAAATGCTCCAATTAATCGATGTCAGTGCTTTTATCACGAGTTTTGGCTTATTAAGTTC 
GGGCTCAAGCCGCTCGTTGCTTCCGTGCACGCGGCAGTTGTTGGCGTGGACGCCGGAGTTGCTGCTATTACAAGTGTTTAGCAACTTTGAACACT 
GCGATTTCGTGCTGCTGCCAACAAGTGATGCAGTTGTAAATGAATGTTGCAATTTCAAATAGCAATTGTTCTTGATTTTAAAGCACATGAGCACT 
AGACTGATTCGAATTGGACATTTGAATCTTTAACGGGCGGAGAATATAAAACACCGGCAAAATAAATAAAGAAAGGAGCCACAACAAACGGAAAG 
CCAGAGGCGTAGTCATTCTCTTCGCAAATCAGCGGGGGTTACCATTCGAGCAGCATACAACCCTCTCTGAGAGAGAATTGACATGCCACTGGATC 
TCACACATACAATGAAAGGCACACTCACTCAAAGTAGACAGCACTGCCAGACTTAATGCAATAGTATAGCAGTTGGTGTTGGCAGTGCCATTTGA 
TCGCTGGAGGTGCTGCCAACGCATTTACCAATTAATTAATGACCACCCCTAAATGTTGTAAATTATAAAAATTATTCAAGAATATTACGATTTTG 
GTTAAATTAATGTACAAAATAAAATATTTCATATTTGTATTGGCTGTCAAATGAAGTTATTTTATTTTAAGATAAATGCTAACAAATCTTAGAGA 
AAC TGGAAAACAAGGTTGAGCGGTTTTGAGAAT T AT GAAC AACGCGT AAAAAAC AT T AT T T AT AAACGT AAAAAAAT ATT AT AGT CTAGCCCGAC 
AGGTAAAAAATAAATTTATCCGGGGCCACCTTGGTCAACCCTCGTCAACGACCGTTCGGCCTGCCAACTCGCTTTTCCCTGCAACGCAAGGCGCC 
GCTCTCTGCTCACACTCTCACTTGGCTGTGTGTGTGTGCTCAATACTCGCTTTTAGTTTTCGCTGCAGTCGGTCGCCGTTTTTAGTCTCAGCTGC 
TGCGCGCGTGTGTTGTGATTGTGTGAATTGCGTTTGCGTTTGTAGCATAACACAAAACCGATAAATTCCTGGCGAAGCGAGCTCTCTCGATTGGA 
TTCTTGCAAAGTGAACTAGCGATGAGCGCCGCCTCTGAGGTGAACCTGATGTTGCTGCGGTCAGTGGAGAAGCAGACTGCGCTTTATGATCGTAC 
CGACGAAAACTATAGGAAGCGCCTGCCGAGTGAGAACGCCTGGGACATGGTTGCCTCCGAAGTTGGAGAGTCGGGTGAGTCGTGAGTGCGAGCGG 
GATAGCAATCGACGTTGCGTAGTTTTCTGCCAGTGCTGCACAGCGTCAACGCGCGCACAACAGTCAGTCGTCCCCGAATTATCGGCCGTCGATAG 
TTGTGTGCGAGTTGGCAGCGCCATGTATGCCATCACAAAGCAGCTGGCATCGATTAATTCGAATATAACGAGTCTCCTGTTGAATGCGTAGTCGT 
TTGGGATGAAATTGCTTCTTGCTGTGACATAATTTTTGTAAATAGCTGGAAAGTAATTTATATTAGTTATTTGGCAACTGCAATAATTGCATGGT 
AAAAACTAAACAAAGTTATTTCGTTTGTTGTTATTATTATTTTTTTAATTTAACATTTTATAATTTATAATTCTCCAACAAGCTTGTTATTTCAT 
AT T AAAAAAAT GT CT T AT TT GAAT AT T AC AT TAT T T T AG AAGG AT TT AAAAAT AAAT AAAT T AAAT AC AG AAAAACC AAAAT ATG A C AAAT AAT A 
ATATCTAGTCTATTCAGTAGCCAAATCCATTTCATTTAATTCTTATTAGCTTTATAAATTTTTTTTAAATGATTTGAATCCTTCTGCACGTTTTG 
GTAACACTGGTAACACTAATCGCCCATTAGCTGAGTAAAATTTAACCTTAGAAAATTGGTATATTGTATTATATTGGTATATTATGAGTGGGTAG 
TTCCATTTATCCATGCATCTCTGATTTTGTTGTCGGTGTATATCGATTGACCCATATATATTTCCCTGGAAAGATCTTTTGGCCAAACTTTTCGA 
GCAGTACAATATAAGTTGGCTCTTTTTCATTTGAGAACTGAAAAAATATTGCCAATGATGGGGGACTTCTTATTCGAGAATTCTTGCTGTAAGTT 
TTATTTATATATCGCAAAAGAAAGAAAAATCCTTTTTAGTGTACATATGTACATACACACGTGCATGTACATATTTAAGTATGATTCTCAATTTG 
ATTGCGATTATTTCCACGTGGTTTCTGACACCTTTTACTAAACTTTACACGATTCCTTAACACAACTATTTTTTAAGTAATGTGTTTGGATATCA 
GATAAAGGGTGTTTCAGCAATGTCTACAGGTTTTGTTAACTAACAAATTAAAACGTATCTTAGAAGTTTGGGCGTCGGAAACCTGCACGCCTTAT 
AAATATGTATGATATGTACATCCATGTACGTTCCCATGTATGTATGTCGGAAATCTGCATATATATATATTTATATAAATATATATGGTCATGTA 
TGGTCATGCAGATAAATGGACGCTTCTAATTTGTATACCCCTTACTCGTAGATTTAAAGGGTATACGTTGGAAGGGATTCGTTGCAAAGTATGTA 
TGCAACTGGTATGAAGGTCGATCTCAGGAACTATAGAAACAAAAAGGTTCAAATTAAACATATAGATTAAAGGGACTTAGACGCAGCGAAAGTTT 
GTTTTTTCATAATTTTTAATTAGTTTTGTAAATTTATATTGACTTTTTGAACAGTTTTTTTAAAATTTTCTCATTTGACTCCCCAATATCTATCT 
ATATGCCATATCTATATGCATTATAAAGCTTCCCACTTCCATTATTCATTCTCTTATTATTTGTTTTTGTGGTGATTTATTGCACTGTAAATTAA 
TTTTTTGCAAAAATGCTTTGTCTTAAAAATATAAAGTAATTTCCTACAATTTTTCTTGTATATGTTATAGCTGATATTATTTTATCACTTTAAAA 
AATTGTTATGTTTGGCTAGAAAAAAGCTTATTTTTTATGACCCGATTTTTATTTGAACTATCCTTATGCAAGGACAAGAATTGAAATGGCACCGA 
ACACAAAATTTCATGTGTCAGTTTTGTCAGATTTTGGCCGCGGTAATTGAAAATAACCTTCCGCAGCGTAGTTAGTTACATACTGTTACAAACCA 
TATTAATCGGCGAGTATAAAGGCAGATAACTAAGAAGCAGTTTTGGCACGTAGAAGCTAAGAATTTCTTGTCTATTACTCCAAATCAATTTTGCC 
ACTTCCTCCAATGGAAAGAGCCATACGAATTTGATCAATTAAAAAATAATATGGAAATATCTAATACCGAAGACCATATTTAGACCATTAAGACC 
ATTAATTAGCTAATCATTTTCCGACTGTTGGTATTGAAAATATAGTTTTTCGATTTTAATAAATGTAATCCAAAATTTATATATATTTCCAAGAT 
CTAGTGTATTCTTATTAAAAACAAGTAATGTGCCATCCGGGTTGTAGGACCCGTGATTCAGGGTTCGCGGTGCGTACTGGATGCCCCGATTTTGT 
GACTAGCAGTTACACACTGCAGTGCTTTTATTTTTATTTTTAAAAATTTACATACATGTATAGCTTAATTATTAAATACTGCTACCTTCAGGCAG 
CTAAAAAATTATATATAAACTGCAGTGCATGTACTCGCCACCGAATACTCATTTATACTCATTTAGATATTCATACATAGCATACTTTCAAGCGA 
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AACTTGCATTTACGGAACCATGTACAATGTTGGTTTTTTGAAATTZVAAAATAATTTAACTAGTGTCTCTTCCCATTATTAATAAGCATTCCTGAG 
CTCTTATTACCTGTTCATGTAAAATGCACACAAAGTTTTATGAGCTAAGATTAAAGTGGAATTTTCCGGGCAATCAGGGGGTGTTATGCTATTGA 
GAAGTAAAGGTTGGGCCGCTACCTGTTGCCTGATATATATTACGTATCATATCTTCTTACCTAGCCTTATCATTTCCCTCTTGTTCCCTCTTGCA 
GTGGAAAAATGCAAGCGACGCTGGCGTCAGCTGCGAAATGACTATACCCGCTGGTGCAACGCCGACGCCAACCGGCGACGAAATGGCCAGCGTCG 
TTTGGCTTATCCGCTGGCCGATGAGCTGCGCTTTTTGGACCGCCATCTAAACATAGCTGACGACATGGCTGCCGACGATGACCGCAGCGTATCAT 
CGGACAAGGACAGGGATAACAATAGAGACAGCGAGGGCGTGGATCATCATGCTCAGGCGTCATTAAAAGAACGCGCATCATCGACAAGCAAGTTA 
GTTAAGGAGGTCAAGCTGGCAAGTCAAGTTCGTAAGGAAAAATCGTCCCAAGACAAGAGAGAAAATTGGGAAAATCCTGGAGATAAACAGCGATC 
TAGGAAGAAAAGTGCTGAAGAGAAGCTGAATGATTTGGAGGAATCGGACGAACCAGAGAAGGTTCCTGAACTCGATAGTTTTCTGCAGTCTGATA 
ATGAAGACGACGAGTGCATGGACGAGGAGCACCTAGAAGATCTGGAAGGTTTCGATTTTGACTTGGAACAATCGAATCAGGAAAAGGAGTTAACG 
C C GG AG AAGT C T C T AGAG AAG AAT AAC AC CGAC T C AAC T GT T AGCC AGC C GG AGT T CT T T GT C AAGC AAAC C AGAG ATC C T C GT T T GC T AAT T AT 
TGGCAGAAAGAATTCGACGTCTTCGTTTGCTGAATCCAAGGCTTCAAAATCTATTTCCGGGGATCCTCTGGAAACGGCTATGGAAGATAGTGGCG 
ATGAATATGGTAGGGATGGTGAAAAACGCGTCACAGGCAGCGACACTACTTCTCCAACCCGACGTCCACGTCGCGCAGCCCGTGCATCAATAAGT 
TTCGCTGGCATAAGGGATTTACCGATCCAGAAACCAGGCCAAGCACAACGAATGACGCGATTGCAGCGCAGGAAATCTATGAGTTTAGCGGCTGT 
ACACAGTGTAAGGAGCTCCACATCGCCTGTGAAAATGACTCCAGTCCCCAGGTCTCAGCCGATCAACAAGCCGCCATCCGGGCCAGTCCAGATCA 
CGAAAATCAGTCATCGGGATGACATCTTCCCGAGACCTGCGGTGCCATCTGGAGTTGTGAATATAAATCAAAACCAACTCAAGACTCAACAACAG 
CAGAAGACGCTGATCTTTCCGACGACCGATGAAGCGATCGCTAACCCGCAGGGATCTGGTCCACCACAAAGGAGGTCCGTGGGTAGGCCGCCCAA 
GAAGCTACCTATGGTGCAGCGAATAGTTGCGCCGGAAACGCAGACCAAGCCACAGTCGCCCATAAACACCAAAGTCCTGACCATTGGCAAGCCCA 
CCAGCTCCGTCGCGAGTGGTAACATAAATAATATTGCTATCAAAAGCAACGTCGTTACTAGCATAAGCAGCAGCACCGC TACT ACT AAT ACC ATT 
ACCAAAGGTACCAGTAACAACAGCATCAAAACTACTGAAAACGTTAATGTAATAAGCTACAGTCCGAATTCATCTAGCACCAGCAGCAGCAGCAA 
GGCCACCACTGGTGCCTCTGGTGCGGTGATAACTAATATACCCACAACCACCACAGCTGGAACATCCGTTCCGGTCGGAAAGAGTATAACCCAAT 
TGAAAATGACTGAGCGAGGCACTCAAACCGGAGTCCAGAATCCTTTCTCCGATAATTACTTTCTCGAAATGATCAAGCCCCAGATGCAGGAAATG 
AACCCCCGTCAGAAGATGCACTTTAAGAAGAAAGTATTCCAAGCGCTGATGGAAACCTTTGATGATGCTACCGATTTCCCAACGTCTAAAGAACT 
GCAGCATTTTAATATAAACACTCCTTCAGGATTTGAGCACATTACAGATCCGGAACTTCGATTGGTCAGGGAGCTAGTCAGCATGGTGAGCGCGG 
CTAAGGTTACCCTAATTAGACCGCCAGGAGAAGCTACAGCTATAGCTACAGCTTCCCGCAGTGGAATTGCTCCTGAGGTACAGCGAGGACCCCGA 
ACAAATATGACACGCCAAGTCATTCAGAGGGTATACAAGCCAGGCACTGGACAAGAGATCGCCCCCACCAGCCCGGCGGGTGGACTAGATAAGAG 
GTTGTTCAGGTTAGCGGCTATGGGTGGTAAACCGAATGGCAACTTGAGTGCCACTCAGGAAATTTTGCGCAAGGATAGTGTTGACAGTAACCATA 
GTGTGGTCAAGGTGGCCAACCACGCCCCAACAGGCAGTCCCAAAGGAGCTGCTGTAGTGGCTGCTATAAGGCCGCAGGGTTCATTGATCAACAGT 
TTCTTTGGACAAGGCAATGGACCGACGACGGCCAAGGGAGCTGCCTCCGAGAATATCCGCGCCATGTCCAGGAGGTATTCCGTGTGCGGTAGCAG 
CAATCCACCCAATGCCCAAAACGCTAGCCAAGGTTCCGCCAATGGCAGCACCATAAACTCCAATGCTAGCGCTATGGAAGCGTCTATGCTTAAGC 
GGCGATTGATAGCAGCTGGGCATGGAATGGTGCCACCTACCCAACGCCCTCGGTATTCTGCCGTAGGTGCTAGTCAAATGACCACTGCATCGCAA 
GGCAGCAGTTTACTAGTGAGGAAGTCTGTGGGCTGCGTTCCTGCTTACCAAAAGCAAATCTCACCCACTGGCGGTGCCTCCCTGCTTCAGAAAAC 
TCCGCAGATTGCCAGTGTGCAAGGATCTGCCTTCAATGACTTTGCCCAGCCCAAGCCCGCTGCAACGGCCTCTGTAAACGGCGAAGAAAGTTCGC 
CTACCTCGGCGCTGAAGCGCAGCTTGGTGGTGGCTAATACCAAAACCTCCTTTCAGGACCTTCTTCAGCAGGCCAGTCAGACGGTGCAGAGAAAT 
AAAATTGAATCCTCGGAGACAGCAGCCACTATTGCTGCGGATGATTTTTCGCTGGACAACCTTAAGCGAGAGCCAGTGGATCCCGCGGAGGATCA 
C AAT GAC ATAC TC GGT GTT T AAGAC C AGGC C AAAG AAATT T CC G AGT AT T G AT T AAAG AGT AAAGAAT T AAG A C AG AT T C AT T AAAT C GT AAAT A 
ACACGTTGAAAAATTGTTAAGCTATCTAATGCACTTATAATGTTGCGTTAAACATTTTTTATAGTTCCCTATAATTTTAAGAATTTCGAATTTTT 
TTTTTTTGGAAAAAGTTTTTAACTTTTTCTATTTTGTTCATAATTTTTTAGACGATAGAAAACCGACAGTTTTGTTCAAAATTTGCACAAGAAAA 
ATGATTAATGAGAAATCGAATCTTAACTTTATTTCACATTTTCTAGTTATATATATCGAATAAAAATTGCCCTTATAAGTTAAACAAAAAGGACA 
ACTGAAAAAATATTTAAATATGTTTTTATGGAAATATATTTCTATACATAGTTAGGTCAAAGTCACTAGAATATTATTCTAATGTCTTTGGTTAG 
GTAGTGAAATATATTGCAGTACATTTATGCGAAACAACCATCACGAAAATTTGAAAATTAGTTTGGTTGGTGAACCAAACTGGAGGGTGGGTGCT 
CCAGACGTTGTTCTAGCAGAGACATAAGTAAAATTATGAAAATTAATTAATTCTAATAATTATGACAATATCTTTGGTTCGGGTACCGATTCTGG 
AGTATTTCCAAAGCTTCAGAGTATTTAGATACTGATCAATAATATTAAATTGTTTTTATCAATACATCAGTGAGTCGGTGGAATTAACAGTTCCA 
AAGCACACAAACTGATGCAATTACCACTATCAGGATATATACGTGGACCTAACTACGAAGTAACACATCATCGATAACAGAAGATTAGATATGAA 
AACGCTGACGCAGCTAAGAAAAGCCTAAACATAACAATTCAGTAATAACATAAAACCTTCTTGTATTTATAAATAGAATAAAAAAATAAATTTGC 
AAATTTGATCAAACGTAATTCCAACGTTATTGAAATACTTCCCGTTTTGGGTGAACCACCGGTAAAGATTTACAAGGTACCAGTACCATTCAATA 
TGCTGTTCTTTTTGCACATTTGGCTGGGAATCTAATCTGGAGAGGTCTACCGCGAATGGGCCGTAAATAAGGGAGTCAGTGCCCAGATAAGGCAG 
CTATATAACCTCTCTTCAAGTGGCCGCGAGGTTCAGTCGAATTCGAGACATCACCATGAAACTCTTTGCCATTTGTGCCCTCCTCATCCTTCCGC 
TGGTAAGCGGTGGAATCGTGCCACGCACACCTACTAGGCACTTTGTGGCCTACGAGATCCATTCTACCCATAATCTCCCCCAGATGGAAGGTAAG 
AGACAGTATTAGTTATACCAGATGGTACAATTCGCTGGATGGATTACGCTTAGC 
(SEQ ID WO: 1393) 

Exon: 1001.. 1309 
Exon: 4087.. 7414 
Start ATG: 1162 

Transcript No. : CT12739 

TTTTAGTTTTCGCTGCAGTCGGTCGCCGTTTTTAGTCTCAGCTGCTGCGCGCGTGTGTTGTGATTGTGTGAATTGCGTTTGCGTTTGTAGCATAA 
CACAAAACCGATAAATTCCTGGCGAAGCGAGCTCTCTCGATTGGATTCTTGCAAAGTGAACTAGCGATGAGCGCCGCCTCTGAGGTGAACCTGAT 
GTTGCTGCGGTCAGTGGAGAAGCAGACTGCGCTTTATGATCGTACCGACGAAAACTATAGGAAGCGCCTGCCGAGTGAGAACGCCTGGGACATGG 
TTGCCTCCGAAGTTGGAGAGTCGGTGGAAAAATGCAAGCGACGCTGGCGTCAGCTGCGAAATGACTATACCCGCTGGTGCAACGCCGACGCCAAC 
CGGCGACGAAATGGCCAGCGTCGTTTGGCTTATCCGCTGGCCGATGAGCTGCGCTTTTTGGACCGCCATCTAAACATAGCTGACGACATGGCTGC 
CGACGATGACCGCAGCGTATCATCGGACAAGGACAGGGATAACAATAGAGACAGCGAGGGCGTGGATCATCATGCTCAGGCGTCATTAAAAGAAC 
GCGCATCATCGACAAGCAAGTTAGTTAAGGAGGTCAAGCTGGCAAGTCAAGTTCGTAAGGAAAAATCGTCCCAAGACAAGAGAGAAAATTGGGAA 
AAT CC T GG AG AT AAAC AGCG AT C T AGG AAG AAAAGT GCT GAAGAG AAGCT G AAT G AT T T GG AGG AAT C GG AC G AACC AG AG AAG GT TC C T G AA C T 
CGATAGTTTTCTGCAGTCTGATAATGAAGACGACGAGTGCATGGACGAGGAGCACCTAGAAGATCTGGAAGGTTTCGATTTTGACTTGGAACAAT 
CGAATCAGGAAAAGGAGTTAACGCCGGAGAAGTCTCTAGAGAAGAATAACACCGACTCAACTGTTAGCCAGCCGGAGTTCTTTGTCAAGCAAACC 
AGAGATCCTCGTTTGCTAATTATTGGCAGAAAGAATTCGACGTCTTCGTTTGCTGAATCCAAGGCTTCAAAATCTATTTCCGGGGATCCTCTGGA 
AACGGCTATGGAAGATAGTGGCGATGAATATGGTAGGGATGGTGAAAAACGCGTCACAGGCAGCGACACTACTTCTCCAACCCGACGTCCACGTC 
GCGCAGCCCGTGCATCAATAAGTTTCGCTGGCATAAGGGATTTACCGATCCAGAAACCAGGCCAAGCACAACGAATGACGCGATTGCAGCGCAGG 
AAATCTATGAGTTTAGCGGCTGTACACAGTGTAAGGAGCTCCACATCGCCTGTGAAAATGACTCCAGTCCCCAGGTCTCAGCCGATCAACAAGCC 
GCCATCCGGGCCAGTCCAGATCACGAAAATCAGTCATCGGGATGACATCTTCCCGAGACCTGCGGTGCCATCTGGAGTTGTGAATATAAATCAAA 
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ACCAACTCAAGACTCAACAACAGCAGAAGACGCTGATCTTTCCGACGACCGATGAAGCGATCGCTAACCCGCAGGGATCTGGTCCACCACAAAGG 
AGGTCCGTGGGTAGGCCGCCCAAGAAGCTACCTATGGTGCAGCGAATAGTTGCGCCGGAAACGCAGACCAAGCCACAGTCGCCCATAAACACCAA 
AGTCCTGACCATTGGCAAGCCCACCAGCTCCGTCGCGAGTGGTAACATAAATAATATTGCTATCAAAAGCAACGTCGTTACTAGCATAAGCAGCA 
GCACCGCTACTACTAATACCATTACCAAAGGTACCAGTAACMCAGCATCAAAACTACTGAAAACGTTAATGTAATAAGCTACAGTCCGAATTCA 
TCTAGCACCAGCAGCAGCAGCAAGGCCACCACTGGTGCCTCTGGTGCGGTGATAACTAATATACCCACAACCACCACAGCTGGAACATCCGTTCC 
GGTCGGAAAGAGTATAACCCAATTGAAAATGACTGAGCGAGGCACTCAAACCGGAGTCCAGAATCCTTTCTCCGATAATTACTTTCTCGAAATGA 
TCAAGCCCCAGATGCAGGAAATGAACCCCCGTCAGAAGATGCACTTTAAGAAGAAAGTATTCCAAGCGCTGATGGAAACCTTTGATGATGCTACC 
GATTTCCCAACGTCTAAAGAACTGCAGCATTTTAATATAAACACTCCTTCAGGATTTGAGCACATTACAGATCCGGAACTTCGATTGGTCAGGGA 
GCTAGTCAGCATGGTGAGCGCGGCTAAGGTTACCCTAATTAGACCGCCAGGAGAAGCTACAGCTATAGCTACAGCTTCCCGCAGTGGAATTGCTC 
CTGAGGTACAGCGAGGACCCCGAACAAATATGACACGCCAAGTCATTCAGAGGGTATACAAGCCAGGCACTGGACAAGAGATCGCCCCCACCAGC 
CCGGCGGGTGGACTAGATAAGAGGTTGTTCAGGTTAGCGGCTATGGGTGGTAAACCGAATGGCAACTTGAGTGCCACTCAGGAAATTTTGCGCAA 
GGATAGTGTTGACAGTAACCATAGTGTGGTCAAGGTGGCCAACCACGCCCCAACAGGCAGTCCCAAAGGAGCTGCTGTAGTGGCTGCTATAAGGC 
CGCAGGGTTCATTGATCAACAGTTTCTTTGGACAAGGCAATGGACCGACGACGGCCAAGGGAGCTGCCTCCGAGAATATCCGCGCCATGTCCAGG 
AGGTATTCCGTGTGCGGTAGCAGCAATCCACCCAATGCCCAAAACGCTAGCCAAGGTTCCGCCAATGGCAGCACCATAAACTCCAATGCTAGCGC 
TATGGAAGCGTCTATGCTTAAGCGGCGATTGATAGCAGCTGGGCATGGAATGGTGCCACCTACCCAACGCCCTCGGTATTCTGCCGTAGGTGCTA 
GTCAAATGACCACTGCATCGCAAGGCAGCAGTTTACTAGTGAGGAAGTCTGTGGGCTGCGTTCCTGCTTACCAAAAGCAAATCTCACCCACTGGC 
GGTGCCTCCCTGCTTCAGAAAACTCCGCAGATTGCCAGTGTGCAAGGATCTGCCTTCAATGACTTTGCCCAGCCCAAGCCCGCTGCAACGGCCTC 
TGTAAACGGCGAAGAAAGTTCGCCTACCTCGGCGCTGAAGCGCAGCTTGGTGGTGGCTAATACCAAAACCTCCTTTCAGGACCTTCTTCAGCAGG 
CCAGTCAGACGGTGCAGAGAAATAAAATTGAATCCTCGGAGACAGCAGCCACTATTGCTGCGGATGATTTTTCGCTGGACAACCTTAAGCGAGAG 
CCAGTGGATCCCGCGGAGGATCACAATGACATACTCGGTGTTTAAGACCAGGCCAAAGAAATTTCCGAGTATTGATTAAAGAGTAAAGAATTAAG 
ACAGATTCATTAAATCGTAAATAACACGTTGAAAAATTGTTAAGCTATCTAATGCACTTATAATGTTGCGTTAAACATTTTTTATAGTTCCCTAT 
AATTTTAAGAATTTCGAATTTTTTTTTTTTGGAAAAAGTTTTTAACTTTTTCTATTTTGTTCATAATTTTTTAGACGATAGAAAACCGACAGTTT 
TGTTCAAAATTTGCACAAGAAAAATGATTAATGAGAAATCGAATCTTAACTTTATTTCACATTTTCTAGTTATATATATCGAATAAAAATTGCCC 
TTATAAGTTAAAC AAAAAGGACAACT G 
(SEQ ID HO: 1394) 

Start ATG: 162 

MSAASEVNLMLLRSVEKQTALYDRTDENYRKRLPSEWAWDMVASEVGESVEKCKRRWRQLRNDYTRWCNADANRRRNGQRRLAYPLADELRFLDR 
HLNIADDMAADDDRSVSSDKDRDNNRDSEGVDHHAQASLKERASSTSKLVKEVKLASQVRKEKSSQDKRENWENPGDKQRSRKKSAEEKLNDLEE 
SDEPEKVPELDSFLQSDNEDDECMDEEHLEDLEGFDFDLEQSNQEKELTPEKSLEKNNTDSTVSQPEFFVKQTRDPRLLIIGRKNSTSSFAESKA 
SKSISGDPLETAMEDSGDEYGRDGEKRVTGSDTTSPTRRPRRAARASISFAGIRDLPIQKPGQAQRMTRLQRRKSMSLAAVHSVRSSTSPVKMTP 
VPRSQPINKPPSGPVQITKISHRDDIFPRPAVPSGWHINQNQLKTQQQQKTLIFPTTDEAIANPQGSGPPQRRSVGRPPKKLPMVQRIVAPETQ 
TKPQSPINTKVLTIGKPTSSVASGNINNIAIKSNVVTSISSSTATTNTITKGTSNNSIKTTENVNVISYSPNSSSTSSSSKATTGASGAVITNIP 
TTTTAGTSVPVGKSITQLKMTERGTQTGVQNPFSDNYFLEMIKPQMQEMNPRQKMHFKKKVFQALMETFDDATDFPTSKELQHFNINTPSGFEHI 
TDPELRLVRELVSMVSAAKVTLIRPPGEATAIATASRSGIAPEVQRGPRTNMTRQVIQRVYKPGTGQEIAPTSPAGGLDKRLFRLAAMGGKPNGN 
LSATQEILRKDSVDSNHSVVKVANHAPTGSPKGAAVVAAIRPQGSLINSFFGQGNGPTTAKGAASENIRAMSRRYSVCGSSNPPNAQNASQGSAN 
GSTINSNASAMEASMLKRRLIAAGHGMVPPTQRPRYSAVGASQMTTASQGSSLLVRKSVGCVPAYQKQISPTGGASLLQKTPQIASVQGSAFNDF 
AQPKPAATASVNGEESSPTSALKRSLVVANTKTSFQDLLQQASQTVQRNKIESSETAATIAADDFSLDNLKREPVDPAEDHNDILGV* 
{ SEQ ID NO: 1395) 

Name : stonewall 

Classification: transcription_f actor 
Gene Symbol: stwl 
FlyBase ID: FBgn0003459 



Celera Sequence No. : 1420000133843 67 

TAAGGTTAATTTGTCATCCTTCCTTTAAAGCGATTATTAATGAAGGTTGTGTTTCTTAACCAGGTGGTGCCGCCCAGAAGACCAACCAGGTGCGC 
AATGGCGACGAAATCCTTAGCATCAATGGTGCCTCCACGTCGCGGATGACACGTGTGGATGCCTGGAACTATATGAAGCAACTGCCGCTGGGACC 
GGTCAAGATTTGTTTCGCCTAGATGAAATGGTCTAAATGAAATACCAACGATGGGGAAGAATGCGCCACAATTTTCATAGATTTAAGCCAATTAG 
CTTGGTTACCCCTTAACCGGTCTAACCTTGGGGGTCTTGCCAATTCGATGGATCAATGCTGCGAGCGATTTGTTTTGCATTTAAACGAACCTTTA 
ATTGTATGCCTAATTTGATAGATATTTAATTTAAAGTGCAATTGTTTAAACCCTTACAACGTTTTCATGCCCTGCTTACCTACCTATAACTGATC 
GATGATTGTACATATTACTAATCGAGACTAATTTACGTATTTAAACCTTTAATCAAATTGAAGCCAGCTATCGGGCAGCCCTATTTCCTTATACA 
CAACTTATTATTCAAAAAAACATAACGAAAAAAAAAACAATTATATTTTATGCCATAAATTATACGGGAACGTTATACATGTATATTATTTTAAT 
AATTATTATTTTATTTAAAGAACAACAAAAACAATGTTTACGTATACAAAAGTATTTGCTAAAAATAAACATACAACATATATGAAATTAACCTG 
AATGTAATTTTTTTAAATTGTCTGGATTGTTTTAATTTCCTTCACAAAAAAATTTTAGTTTTCACAAAAAAGTCTAGTTTTAAATTTAGAAAAAT 
TTATTTATTTAGTTTCAACTTCACATTTTTCTGTTTATACATTATGGGAACATTTCTAAAAACAACTTAAAATGTTATTTGTCCTATGTACTGTA 
CATAAATTTGATATCAGTGTATTAAAGCTGTTCAATGAGCAGTTCGCAAGCTACAAATCTTTGGATGCCTTGGACACCAGCTTTGGCAGCTGCAG 
CATAAGTCCCATGTGCTGTAGTGGATTCGCCGAGGAGAGACTCGAGGTAGTGGATAGCGTTCCATTGACATTCTCCCTGGGATGCGGCTGCGCTG 
GGTGGGTTGATGAGGCTGACTGCTCGGAATGGAATATCTCGCCAAGCAGGTCGTACACACGTCGCTTGAAGTACAGCTTCTGGTGGAAGTTCATG 
TGCTTGATTTCGGGACGCAGTCCCTGCAGGAATGCGTCATCGAAATCCATGATTGGCAATGCTTCAATACAGCGGGTTGATGTCTGAAATAAGAA 
AATAACAAATCATAAATAAACGCTGTACAAATTAAATTTTGCACTTAATAGATTGTTACTTTGGGGAATTCAAGCAATGGCACTCTATATAAAGT 
AATTTCTCCCACCTTGGCCTCCTTCTCACTGCTGTCGCTCTCTGTGCGCATTTCTGCCGGGACCGTGTCCTCGAAATCCATGATTGATGATGGTT 
CATTATTAAATTGTGTTGCCTGAAGAATTAAAGGAAACTGCATCATTAAAATTATAAATGCAAATTTCAGCTTTGAAATAAAAACGGGCACACAG 
TATAAAACTAAGCTAATTTGCACAGCTTTAACTTTTGACCAAAAATATAGCATACATCATTTTTTTACTTTAAAGTATAGTTCATAACGCTAAAC 
AAGGTTGTAATAATTTAAGGCCTCAGCAAAATGCATCTGAGTATTGCTCCCACCTCAACGTCCGTCGCCGATGCAGGGTCGGTGCTGTGACGGCT 
CTGCGCATGCTCCGAATTCAGAAAGCTCGGGGAGTGCACCATCTCGAGGATTGCCTCTACGGGAGTTTCGGGATCCCCTTCGTCGTGCTCCTCCT 
GTCCCTTGGGTCTTGACCTACGACCTCTGAGCGGAACGGGCACTCCGGGAGTGATGTGCTTTTGCAGGAACTTCAGCTCGCTGTTAAGATAGTAC 
GTATTCGCACCGTGATGCAACTTTATTGAGCGGGCATAGCTGGAGCGAATGTTGCGCCAACGTTCCTTGCAGCTTTTAACTGAAATGATCAATAG 
CAGTCAATAAGATTGTAAGATAATATTATATAATATTTATATAATATAAATTGTAAGATTTTATAAGAGATTATATTTTATTTTATATTATGTAT 
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CGACATAGTAAAATCGTGCTTATAAAAATATAAACATCTCCTAATGAAATGTAAATTTAATAATTAAAGTAZ^TTTAGTTAATATGCAATATGTC 
GAAATATAGATAAAAACCCACCCGAATTTCGCATCTCATTGGATATTTCCTTCCACGCATTTTTCACAGTCGATTTCCGCAAGTAGTTGTCATCG 
TGACGATCGTAAAGACAAGGATGTAGTTTCACCAAATGACATATTTTGGCATTGATGGCGTAAACGTTCGGTGGTGGAGCTATGGACCGACGGCG 
T GT AT T CAT T T CAT AT AT AAAAACT AAAT AT GC AAATT T AAT T AAT T AAT A T AA T T AAT AGA GT T AAAGT AGAAGAC CC GAAAT C G AAT CT G AAA 
CATGTGTCGCCTGACCGTGCAACAGCTGTTTGCCGCTACTTTGCTTCTCTGCTCCGCTTTCAACAACAGTTATTTTCTCTGAAACCAGATCTTTC 
GTTTAAACTTACGTGAATTAGGTTTAAAAATGCTCCAATTAATCGATGTCAGTGCTTTTATCACGAGTTTTGGCTTATTAAGTTCGGGCTCAAGC 
CGCTCGTTGCTTCCGTGCACGCGGCAGTTGTTGGCGTGGACGCCGGAGTTGCTGCTATTACAAGTGTTTAGCAACTTTGAACACTGCGATTTCGT 
GCTGCTGCCAACAAGTGATGCAGTTGTAAATGAATGTTGCAATTTCAAATAGCAATTGTTCTTGATTTTAAAGCACATGAGCACTAGACTGATTC 
GAATTGGACATTTGAATCTTTAACGGGCGGAGAATATAAAACACCGGCAAAATAAATAAAGAAAGGAGCCACAACAAACGGAAAGCCAGAGGCGT 
AGTCATTCTCTTCGCAAATCAGCGGGGGTTACCATTCGAGCAGCATACAACCCTCTCTGAGAGAGAATTGACATGCCACTGGATCTCACACATAC 
AATGAAAGGCACACTCACTCAAAGTAGACAGCACTGCCAGACTTAATGCAATAGTATAGCAGTTGGTGTTGGCAGTGCCATTTGATCGCTGGAGG 
TGCTGCCAACGCATTTACCAATTAATTAATGACCACCCCTAAATGTTGTAAATTATAAAAATTATTCAAGAATATTACGATTTTGGTTAAATTAA 
TGTACAAAATAAAATATTTCATATTTGTATTGGCTGTCAAATGAAGTTATTTTATTTTAAGATAAATGCTAACAAATCTTAGAGAAACTGGAAAA 
CAAGGTTGAGCGGTTTTGAGAATTATGAACAACGCGTAAAAAACATTATTTATAAACGTAAAAAAATATTATAGTCTAGCCCGACAGGTAAAAAA 
TAAATTTATCCGGGGCCACCTTGGTCAACCCTCGTCAACGACCGTTCGGCCTGCCAACTCGCTTTTCCCTGCAACGCAAGGCGCCGCTCTCTGCT 
CACACTCTCACTTGGCTGTGTGTGTGTGCTCAATACTCGCTTTTAGTTTTCGCTGCAGTCGGTCGCCGTTTTTAGTCTCAGCTGCTGCGCGCGTG 
TGTTGTGATTGTGTGAATTGCGTTTGCGTTTGTAGCATAACACAAAACCGATAAATTCCTGGCGAAGCGAGCTCTCTCGATTGGATTCTTGCAAA 
GTGAACTAGCGATGAGCGCCGCCTCTGAGGTGAACCTGATGTTGCTGCGGTCAGTGGAGAAGCAGACTGCGCTTTATGATCGTACCGACGAAAAC 
T AT AGG AAGCGCC T GC C GAGT G AG AACGC CTGGGAC ATG GT TGC CT C CGAAGT 
(SEQ ID NO: 1396) 

Exon: 2948. .2673 
Exon: 2493.. 2302 
Exon: 2074.. 1764 
Exon: 1539.. 1438 
Exon: 1318.. 1001 

Start ATG: 2479 (Reverse strand: CAT) 
Transcript No. : CT13005 

TTCGAATCAGTCTAGTGCTCATGTGCTTTAAAATCAAGAACAATTGCTATTTGAAATTGCAACATTCATTTACAACTGCATCACTTGTTGGCAGC 
AGCACGAAATCGCAGTGTTCAAAGTTGCTAAACACTTGTAATAGCAGCAACTCCGGCGTCCACGCCAACAACTGCCGCGTGCACGGAAGCAACGA 
GCGGCTTGAGCCCGAACTTAATAAGCCAAAACTCGTGATAAAAGCACTGACATCGATTAATTGGAGCATTTTTAAACCTAATTCACTTTTTATAT 
ATGAAATGAATACACGCCGTCGGTCCATAGCTCCACCACCGAACGTTTACGCCATCAATGCCAAAATATGTCATTTGGTGAAACTACATCCTTGT 
CTTTACGATCGTCACGATGACAACTACTTGCGGAAATCGACTGTGAAAAATGCGTGGAAGGAAATATCCAATGAGATGCGAAATTCGGTTAAAAG 
CTGCAAGGAACGTTGGCGCAACATTCGCTCCAGCTATGCCCGCTCAATAAAGTTGCATCACGGTGCGAATACGTACTATCTTAACAGCGAGCTGA 
AGT T C C T GC AAAAGC AC AT C AC T C CC GG AGT GCCCGTTCCGCT C AG AGGT CGT AGGT C AAG AC CC AAGGG AC AG GAG GAGCACG AC GAAGGGGA T 
CCCGAAACTCCCGTAGAGGCAATCCTCGAGATGGTGCACTCCCCGAGCTTTCTGAATTCGGAGCATGCGCAGAGCCGTCACAGCACCGACCCTGC 
ATCGGCGACGGACGTTGAGGCAACACAATTTAATAATGAACCATCATCAATCATGGATTTCGAGGACACGGTCCCGGCAGAAATGCGCACAGAGA 
GCGACAGCAGTGAGAAGGAGGCCAAGACATCAACCCGCTGTATTGAAGCATTGCCAATCATGGATTTCGATGACGCATTCCTGCAGGGACTGCGT 
CCCGAAATCAAGCACATGAACTTCCACCAGAAGCTGTACTTCAAGCGACGTGTGTACGACCTGCTTGGCGAGATATTCCATTCCGAGCAGTCAGC 
CTCATCAACCCACCCAGCGCAGCCGCATCCCAGGGAGAATGTCAATGGAACGCTATCCACTACCTCGAGTCTCTCCTCGGCGAATCCACTACAGC 
ACATGGGACTTATGCTGCAGCTGCCAAAGCTGGTGTCCAAGGCATCCAAAGATTTGTAG 
(SEQ ID NO: 1397) 

Start ATG: 291 (Reverse strand: CAT) 

MNTRRRSIAPPPNVYAINAKICHLVKLHPCLYDRHDDNYLRKSTVKNAWKEISNEMRNSVKSCKERWRNIRSSYARSIKLHHGANTYYLKSELKF 

LQKHITPGVPVPLRGRRSRPKGQEEHDEGDPETPVEAILEMVHSPSFLNSEHAQSRHSTDPASATDVEATQFNNEPSSIMDFEDTVPAEMRTESD 

SSEKEAKTSTRCIEALPIMDFDDAFLQGLRPEIKHMNFHQKLYFKRRVYDLLGEIFHSEQSASSTHPAQPHPREHVNGTLSTTSSLSSANPLQHM 

GLMLQLPKLVSKASKDL* 

(SEQ ID NO: 1398) 

Name: stwl-like 

Classification: transcription_f actor 



Celera Sequence No. : 142000013384637 

CTTGCCTACGAAGTGGGTCTGCTGGTCCAGAAACTGGCCGGCGGAGTTGGCCCCCCATTGACGAAGCCCAACAGCAGTGCCCACCTGTAGAGCGG 
AATGACGAACTAACCTCTCACGGGAATCGACAATTGACACAGATCCTGGATGGAGCAATAAATCTCTTTACTCTAAGGTGCTCAATAACTTAAAA 
AAAAAACAACTTAACTCTCAAACGCATTTATTCTGCAAATGTTCACTGTCTTTTGTGTTCTAAAGATGACATATTTTTTGCCATTATTTATAAAG 
TTCTTTTCAAGTTTATTTATACATATGTACATTGTGAAAGTTGCAATTTTATGTCCTTCGCTATTTTGTAATTGAAGAATTTAATGAAAATGCAA 
CCGCTACGTTGATTTTTTTGTTGAGCTGCTAGCGGTAACTATTGAAATTATAAAAGAATTGATAGTAGAGATTAATGAACCTAATCAAGTTTGTA 
TTGCTATTTTCTATAAAAGTATTTTTTAAACAATCTAAAGATCCTTTTACTTTAGGTCGAATATAAATCAGTTTTCACTTCATTTGGAATTTACT 
ATGCTGTACGCACAATTTATGGCAAATTTAAATCACTGTCCCGGCAAATAAATTAAAATTTTGCCAACTTTTATTTTTAAACTTAACATTTTAAT 
AAATGTTAATGGAGCTGGCAGAATTGAAACGCGCCATCTGGCGGTCAGTAGCGGAGACATGTAGGACAGTAATTGATAAGGACTACTGTTAACTG 
TTACAATGTAAGCAAATACTTCAAAACTGTTATACTTGTGTTAATTAAAGCATACGTTCAAAAGTCGAGTTGAGTTATTGAATGTTAACTGTTCG 
TAGTTCGAAAGTTATCTAATTTAGCTAACTAATTTAGCAATTGTTTATAAAAAAGTTTTACTTGAATATTATTAATACTCGGTATATCTATGCAA 
ATGGTATTAAAAGTATATTTTCGCATGCCGGATGGTTTACTCGTCTAAAACTCAGCCGGTCACACCGTCTGATTGCCGTGCTGTCACTTTGGCTG 
ATGCAATTGCAGTTTTTTTAGTTAAAACAAAATTAGTTTCCAGTCAACTAGTATTCAAACAGCCCAGACACAATGGACGCGGAAGACGTAAGTTC 
CCGGAACACCGGCTAGTGCGTTTCATACGCGGAAAACTTGCGTGGAAAATTCGCATATTATCGTACATGTGCGCCCTTGTTTGCGTATGTGTATG 
TGTGTATTTACACATTTGTGTATGAGAACTGGGCGAACATTGAGGTTATGTTTGCTTGATCCGGATTTAAACCATATTGAAATTCCAATGATTTT 
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CGCAGGGTTTTGATCCCACTTTGCTGAAAAAGAAGAAGAAGAAGAAGACCACGTTCGATCTGGACGCGGCATTGGGTCTGGAGGATGACACCAAG 
AAGGAGGATCCCCAGGATGAGGCATCGGCGGAAGGAGGTGCCGCCGCCGAGGAGGATAATCTAGATCTGGAGAGTTTCGGCAAGAAGAAGAAAAA 
GAAGAAGAAGCCCTTCAACATGGACGAAATCGAAGCGGCCATACCCAGCTTTGGCGGCGATGATGTGGCCGCCAGCGAGGAGCCCGAGGAGGAGG 
AGATCAATCTGGACATGGACTTCTCGATGGCCAAAAAGAAGAAGAAGAGCAAGAAGAAGGAACTGGACGAGCTGTTTGCGGACCAAGCGGACGAC 
GACAAGAGCGAAGACAAAGAGAACGGTACGGTGACGAATTGCAGAGCAGCCAGGGAAGCTTAGGGTTATAGAATCATCCCAACCAAGATTGGCAG 
CCATATAAGCTTGTTGTCACTTGACAAAGGCAAAGCTATACGATTCTTAAAGTGTATAACTTGGCTTTCGCCTTTTCCGAAGCATTAAAACTCAG 
TGTATGAAGTGAACTAGAAATGCTTAGCCAGTGAGTTAGCATGTAGCTGTTCCAACTTTAAATAGACTCCAGATTTCAAAGCTCTCACTTAGCAA 
ATAAGCTTATACCATACCCTTTTAAATAACGTTTTCATCCGTTTACAGACGAGGACAACAGCTCAACCTGGTTTGGCTCCGACCGCGATTACACA 
TACGACGAGTTGCTGAAGCGGGTCTTCGAAATCATTCTCGACAAGAATCCGGACATGGCCGCCGGCCGAAAGCCAAAGTTCGTGATGCGACCGCC 
GCAGGTGCTGCGCGTGGGAACCAAGAAGACCTCCTTTGCCAACTTCATGGACATTGCGAAAACGCTGCATCGCCTGCCCAAGCATCTGCTCGATT 
TCCTGCTGGCCGAGTTGGGTACCAGTGGCTCCATGGACGGCAACCAGCAGCTGATCATCAAGGGCCGTTTCCAGCCCAAACAGATCGAGAATGTG 
CTGCGTCGCTACATCAAGGAGTACGTCACCTGTCACACCTGCCGCTCCCCGGAAACGATATTGCAGAAGGACACGCGTCTCTTCTTCCTGCAGTG 
CGAATCCTGTGGCTCCCGCTGCTCGGTGGCCAGCATTAAGTCAGGTTTCCAGGCTGTCACCGGCAAACGTGCCGCCATACGAGCAAAAACAACCT 
AAATTCTCCTTATTTZVAAACTTTTTGAATTTTTATTACCAGTAAAATTTCAATACAAAAAACCAACGATACTATTTATTTGTTGTATTCTGCATT 
GTTTTATGCCTTCGTTGCCAGTCGGTTAGGTAGTCTACGAAGCCAATTTCCATATTCCGCGATCGGCCGCTGTCTATGAAGAACTGAAAAGCGTC 
CAGACCCATCTTGCCGCGGAACCTATTCGGACGACCGCGCAGGAATAGGCTGTGGCCCCTCGAGATGCGCGTGGGATCAATGTACTCCATGGAGA 
CCTGCAGCAGCCGCATCATGCTCACGGCAATGCTCTTGTAGGCCTCGGACGAGGCCAGCGAACTGGCTTCACTGTACAGGCTGCTGATCACGCGT 
CCATTGAAGGGTTTTTTGGTGATGCTGGCCTCCAGGGAGCCGGTGCCATCATCGATCATGAACTTCATGAACGACTCGTTGTGCGTTCCATTGCC 
AGTTACAAAGGCGTAGATGATGCAGGTGCAGAAGTGAATCGGATGCACGTCGTCTGTGATGAACACATTCTCGTTCTCCGGATCCTGAGTTGCGC 
GCTCAATCTGGGATATGGTCAGCGGCACGTTGTCGTGGACCTCCGGACCACATTTGCCCGTGGACTTTTCACTCTGTTGATTCTTGCGTATCACG 
AAGTTATCCAGCTGGCTTTCTATGTCCTCGAAACTCTGATTAAAATCCATTCTATTTGCTTAGTCTGCGATTTCAAAGGGGATTTCTTTATTGCA 
GTGCATTTTGCATTAGCGCCAAAAAAAAAAACAGTTGTGAGCATGGGCGTAGACTTCGTATTTTCTTACAAATAATATTAATTAAAATTAATTTT 
GTGAGCAATTTTCACACAATTGTATTATAAGTTAAAACCAGGGTCACATTAATTTGCAGAACCGCGCAATATTTTCTTTTTAACCCCCTTACAAA 
TTTTCAGTTGTTTTGACTACGCCCCTGCTAATTTTTACTTATTAAATTCAAAGTCTAAAAACATTGTCACCAGATAATACGAGTATACACTATAT 
GGACAAACGTAAAATCGT 
(SEQ ID NO: 1399) 

Exon: 1001. .1132 
Exon: 1336. .1735 
Exon: 2044. .2628 
Start ATG: 1118 

Transcript No. : CT13682 

CTCAGCCGGTCACACCGTCTGATTGCCGTGCTGTCACTTTGGCTGATGCAATTGCAGTTTTTTTAGTTAAAACAAAATTAGTTTCCAGTCAACTA 
GTATTCAAACAGCCCAGACACAATGGACGCGGAAGACGGTTTTGATCCCACTTTGCTGAAAAAGAAGAAGAAGAAGAAGACCACGTTCGATCTGG 
ACGCGGCATTGGGTCTGGAGGATGACACCAAGAAGGAGGATCCCCAGGATGAGGCATCGGCGGAAGGAGGTGCCGCCGCCGAGGAGGATAATCTA 
GATCTGGAGAGTTTCGGCAAGAAGAAGAAAAAGAAGAAGAAGCCCTTCAACATGGACGAAATCGAAGCGGCCATACCCAGCTTTGGCGGCGATGA 
TGTGGCCGCCAGCGAGGAGCCCGAGGAGGAGGAGATCAATCTGGACATGGACTTCTCGATGGCCAAAAAGAAGAAGAAGAGCAAGAAGAAGGAAC 
TGGACGAGCTGTTTGCGGACCAAGCGGACGACGACAAGAGCGAAGACAAAGAGAACGACGAGGACAACAGCTCAACCTGGTTTGGCTCCGACCGC 
GATTACACATACGACGAGTTGCTGAAGCGGGTCTTCGAAATCATTCTCGACAAGAATCCGGACATGGCCGCCGGCCGAAAGCCAAAGTTCGTGAT 
GCGACCGCCGCAGGTGCTGCGCGTGGGAACCAAGAAGACCTCCTTTGCCAACTTCATGGACATTGCGAAAACGCTGCATCGCCTGCCCAAGCATC 
TGCTCGATTTCCTGCTGGCCGAGTTGGGTACCAGTGGCTCCATGGACGGCAACCAGCAGCTGATCATCAAGGGCCGTTTCCAGCCCAAACAGATC 
GAGAATGTGCTGCGTCGCTACATCAAGGAGTACGTCACCTGTCACACCTGCCGCTCCCCGGAAACGATATTGCAGAAGGACACGCGTCTCTTCTT 
CCTGCAGTGCGAATCCTGTGGCTCCCGCTGCTCGGTGGCCAGCATTAAGTCAGGTTTCCAGGCTGTCACCGGCAAACGTGCCGCCATACGAGCAA 
AAACAACCTAAATTCTCCTTATTTAAAACTTTTTGAATTTTTATTACCAGTAAAATTTCAATACAAA2^ACC 
(SEQ ID NO: 1400) 

Start ATG: 118 

MDAEDGFDPTLLKKKKKKKTTFDLDAALGLEDDTKKEDPQDEASAEGGAAAEEDNLDLESFGKKKKKKKKPFNMDEIEAAIPSFGGDDVAASEEP 

EEEEINLDMDFSMAKKKKKSKKKELDELFADQADDDKSEDKENDEDNSSTWFGSDRDYTYDELLKRVFEIILDKNPDMAAGRKPKFVMRPPQVLR 

VGTKKTSFANFMDIAKTLHRLPKHLLDFLLAELGTSGSMDGNQQLIIKGRFQPKQIENVLRRYIKEYVTCHTCRSPETILQKDTRLFFLQCESCG 

SRCSVASIKSGFQAVTGKRAAIRAKTT* 

(SEQ ID NO: 1401) 

Name: elF2-beta 

Classification: translation_f actor 
Gene Symbol: eIF-2beta 
FlyBase ID: FBgn0004926 



Celera Sequence No. : 142000013384528 

GAATCTAAGTGGCATCAGAAATCTGATCACGACTACATAATGGCGGGATCTGGAGTTGCCAAGCAGATATTTGCCCTCGATTTCGAGATCTTTGG 
ACGAGTGCAAGGTATGCAATGCATTTTTCTGATTCGGCTTGTTAAACAGCGTATTAAATTTTGCAGGTGTGTTCTTCCGCAAAGTAAGTACTGGG 
AGGCCTATATATATATCTGGAAATCAACATTTAACTTATGTACATTTTTGGCTTAGCACACGTCGCATGAGGCTAAAAGATTGGGAGTTAGGGGC 
TGGTGCATGAATACCCGGGATGGGACCGTCAAGGGACAACTGGAGGCTCCTATGATGAATTTAATGGAAATGTAGGTACCGCTGCACTCCACTAA 
TTCAATCATTTAACCATCGCCCACTAATTTGAAGGAAACATTGGCTGGAGAACAACCGAATTCCCAACGCTAAGGTCTCAAAGGCTGAATTTTCG 
CAAATCCAGGAAATCGAGGACTATACGTTCACTTCCTTTGACATAAAACATTAAAAACGAATTATTAGCTCTTTCGTTGTCCATCCAATTCCGAA 
ACAAAGTCTAAATAAATTACCAGGGCTGCAACTTGCGTAAGCAGGCATATCTGCATTGCTATCGACTGAGTCGATAACATTCGATAGTATGAACA 
TAAGAAATATTACGTTTCGCAAAACTGAACCCAGTACTGAAGTACAACAATAAATAAATAATAAATTATTTAAAACAAGGCGGAAAATTATTCCG 
AGAATCGATAGTCGCTAGTCGAAAGTGCGCACCACTCCAGTATCTACTCGATATATTGCTATCACAGCCGATAGTTTGCAGCTCCCTGCCATAGA 
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CACTGGTCATCTCGCTCGCACACACAACAGTGGGCCGCGCCACCGCCATAGGCGGCGCACGTGCAAGAGCGAGACAACCAGCTGGGCGACAGCGC 
ACTTTGGTCTGGCACAGAAACGAGTACAGCTTCGATACACGTCGTTTTGCCAAAAGCGTAGCAGAGCAGTCTAAGCGATGAAAAAAATAAATGAG 
GAAAAAAGTCCGTGAAGCGAAAACCGTAATTCCTACAGTGCGAGTCACGCCATTGTGTTTGCAAAAAATATTTGCCAATTGTGCTACGGGCAATT 
TTTCGGAACGGCCGAATTGTGCGTATAATGAAAAAAAATACTACTTAATTTTTAAGCAATTTACAAATTCTTGCGTGCTGCCTGCCTGCTGTCGG 
TGTGTTCTTGCTTGCATGTGTTAGTGAGTACGATTACTTTCGCGCTGAAAACCTTTTTCCACTGCCCTTTCGCTGGAATCTGGGCGAATATAGAT 
AGGACACAAAGCACATATAACTGATTTTTACTGCAGAATTGATCGAGACATGTGGAATACAATAAATGAGATTGTGTAAATAACTGAATATAGTG 
GCGCATACAGTTGCACGTGTGTGTGTGCTAGCATGTGCGTGTGTGTGTGTAAAGGCGCGAGTTTCCGTTTGCCTAGGCTAATTGTAGTGGAAAAA 
CTCAATAAACGGAATTACATAAAAACATGGTGGCATGATCGGCAGTTTCCCGCCGGCAGAAGATGCCCAATCTTGGAAAATTGTAATTTTCGAAA 
GTAACCGCAATGATTTCGTCAGTATGTAAAGTAGTCTGTCAGTCCCTGCATATGTTTGTGTGTGTGTGTTCTATACAGTTTTTTTTATTGCGCAT 
CATGGTGTGCAATAGCGCGCTTTCTGACCTCTCTCTCGCTCTCATGGGTTCTCTTGTTGGCCAAAGCGACGCTGTGTGTGTGTTGTGTACGCATA 
TGTAGCATATGTACATAACACGCCGGAAAAGACAAAAAAAAAAAACCATTCCAAATTTTGAAATCTTACGGAGGCGGGCGGTTTAAACGAGATAA 
AAAATAAATACACCAATATTGTTCTTTTATATGATAACGCAGTAGTCTCGGACCGAATAGTATGCCTTTATTTTGTCGTCCTAACGCTGAATTGC 
CGAAAAATAATAGCCCGTTTTTGGAAAATTCTCGACAGGATTAGCATAAAGAGACGCCTTAGGAGTCGAACGAACCAAATGCCGCAGCAGCCGGC 
CAACGTATTAGCATCAGAACGATGGCCCCAGCGCTCACAGCCGAGCCACTAAGTCCAAAGGAGAAGCTAACCAGCACCGCCTCGCCCTCGGCCCG 
CAGCTCCTCGGAAAGCGGAGGAGTTGGATGTGCAGTTGGCTTGGTCAAGAAGCCGGCAACACCCACGCCGGCCACGGCCACCACGCGGATTACCC 
GCTCATCAGCGGCAGCGGCCTCAGTTACCGCTTCCCCGCATCTACAACAGCAGCGATCCTCGCTAGCAATCAATAACAAGCGAAAGTCGGGCAAC 
GCTAAAGCCAGCAATGAACCAATGGGGCTGAGCACAGCCCCGACTCCACACAGCACCACTTCACCAAATCCCTCGACGGATACTGCGGCAGCCTC 
CACCTCTGCCGTGGAGCACAACAATAACGACAGCAACAGTACCTCCGCCGGCAACAGCACCTCCGCCGACAACAGCTCCACGCCCAAGGACAACT 
CCACCGCTCAGCTACTGGCGGATCTAACCATCAACTTTGAGGAGACCATATCCGCGGAGATTTGCCTGAGGAAAACCCTACCCGATGTGAGTCTG 
GGGAAAGAGCCGGTTCCGCCTGCCTCGGTGCTGGCAGTGGCCACAAGTTCAACGAGTGCTGGCAGCGCTCTATCCTTATCTGCAGTGGATGAGGT 
GACCAAGATAGAAATAGATAATATAGAAGGTGGGTCTAAGGAATCACTCCAAATAAATAGTCTTGACTCAAAATTTCTCTCGCCCACTGTTTCAG 
AGCGTCTCAGCCAGCTAGATGGGAATGCCACTGGAAGGCCCGAGGAGCATGTGGCACCACCTTCTGTTCCGGTACCCGTGCAGGAGGCGCCAAAC 
ACGCCCACCAGGCCACAGGCGCCACCAAGTGCCCATCTTGAGTCTCCGCAGCCGGCGCGAACACAGCAGCAGTCCATTCAACAGATGACGCCGCA 
AAGCACCACCAGTTCCAACAATTTTCTCAAAGACGGAGCAGCTGGTGCTGTTCTAGAGGATCAGGATATCGAGGAGGTGCTTAAAGCGCTCAAAA 
CATTTGACGGTGGCCATGTTAATCCAGATACCATTTGCGAGTTTTTCGACGAAGTTTGGGAAGAGGCAGCACCGGCAGCGCCATTGCCGCCCGCG 
CCACCCGAAAATGTGGTGGATGTGCCGGGCGCGAAGCCCAGCTGCAGTGGCATCCTCGTTAGCGGCAGCAACATTTCCCAGCCGAATGTGATTAT 
TAAGCAGGAGCAGCGGCCCTGGCAGGAGAGTCATGCTGAGTTGGAGCAGCAGCAGCACGTTATCTCGCGCAGGATCGAGTTTTTGCTACGCAGGA 
TGCGCAAGCTCCAGGCGCGAGCGATGTGCCGTCACGCGAGCGAGGAGGTGGTTGGTATCATGGAGTGGTCTGCCCGCAGTTCCCATAAGGCTCCA 
GTTCCAGCGAGAAGTTCCACGCTGACTGAGCAGGAGGCCACTGTCCTATCGATTGTCTCAGGGCGTCCGAGCTCAACCTTCTGGGAGGAGCAGAA 
CAAGCATCCGCTGCCTGCTAGCCAGATGAGTAATGTGATCCGGCATATTGCGACGGCGGCAAAGCATCAGCAAATCTGCCACTCAGCCAGCGGAA 
GCTCTGCTACTCTGGCGCCATCCTCCAGCTGGTACAACAGCTGCAACAGCTCTGCGCTGCCGATGAAGCGTCCACGAAAGAATCAACTTGATGCT 
TCGGTTGCAACGGGTGCCACACCCACTGCACCTGCGCCAGGAGCGGTAACATCTAGACAGGCAGCTGGCACGACGGGATCCGCAACGGGTTCGAA 
TACACCGCGGGCGGATGATATCGTGCCCGGCTACGACACCTATGTGACTAATGAACTCACCCATGTGTCTGGCCTGCTGCACACGGAGCTTCGAG 
AAGTACAGAATGCCATCGACTCGGACGCCACGGAGTCAAGTTCTGGTGGAGAATCCGCTGACGAGATGGTAACCTACAACAACAACCAGCAGCTC 
TCGCTACCAATGTGAGTACTTCGTATTATTTATGTACATACTTAGGTTATCGGGAATATTTTACAAGTAAACTTTCTTCCATCTTTTTTTGAGGG 
GGGTTTTCAGCTTCTGTCATCTTTTTAAGATCATTCTTTTGCTATAGAATTTGTCCGATTAATGTTGGCTCGGTAGTTTTCTTTCAAAATTGTTT 
TGCTTTGACAGGCATTTTAAACATGATTTAACGTCGTCCTTAGCAATTGAAAAGTTGTAATAGGCTTTGATTAGAGTTGTTGTACCTAATTATCG 
CATTTACAATGCGATATTTCAACTATTTCAACAGTTTTTATTAAAATTGTTAATTAGCGATTATATATTTCTTTTAGCTTTTTTTTTGTGGTATT 
CTCTTTCCTAATGCTCTTGAAATTTGATTCTTTGCAGCACTCGACGCGCTGTCTGGCGATATTCGAGAGATCGTGCAGCCATTGCCCTGCGCTGG 
TCCTGGCTCTGCTCCCAGCTGACCGATTTGGAGATGAAGATCCGTCAGCACAGCGACCTTTACATGGATCTAACTCAGTCCAAGGGCGAGATCCA 
GCTGGAGTCCACAGCTAAGACCTCACCGTCGCTAGCACAGCCGGCAAATGGCATTAAGGAGGAGCCCGGATCCGACTACCTCTGCAGTCGGGCAC 
GTCCATTGGTGCTGTCCGAGTTCCGCAAACGCAAGCTCTTCCAGACCACGAACATGCACACGATTTCCAAGAAAGCCGCACGCCCCAGCAACATT 
AAGTGTGGCTGCCAGTGGCCGCAGGTGCCATGCACACTGTGCACGGGTCGGACAGACCCTACGGCGCCCAGGGATCTGGTCGAGACGATGATGCC 
GCAAAACCGGGTGGCATTGCTCGATGCTGGCTACCATCCAGTGCTCAGCTTCGCCAGCGGTGAGTGGGAGGTTACACTGAGTGGACCCATTTGGT 
TTTCGTCCATGATTATTTCTTATCATCTGTGATGGTTTAACATTATTTATTCTACACTTGAGATTTGAACTGAATTGATTAAAAAGTAAAACTTT 
AAAATGCAATACTTGCAAATAATTTTTATTCGAAACTAAAACATTAAAATCCTCGTATTCTTTACATTGGATTCCAAAATGAAAAGAAATAGTCG 
GATGATGACAGCTAAAGTGGAATTATGTTTGAATATATTCTTATCCCTATCATTCTTGTTTCCCAGATGTCAGTCAGTCGGTTCACTTGGAGGCC 
GTTGCCCGCCAGCCGGACTGGCAGTACCGCGTCATGCGCAGCCAGCCCAAGGCCATTGTCAAGGCCATGTGGAAGGCGGAGCGCGAGACGATAGC 
CTCAGGCGGTGTCGGCGGACAAGGTGGCAGCCGGCGGTCGGGCGATGCGGCGAAGCGGCGTTATATACGACGCAAGGAGCGCAACAACAACTCAA 
ATAAGGAGGCTGGGAGCGGAGCTAAAGCTGCTGCTGTCGCCGCAGGAGGAGCAGGCGGAAACGGAAGCAGTAGCGTCGGGGGAGGGGATAGCGGA 
AACGGTACTGGTACTGCTACTACTTCTTCTTCTACTACGCCTACTACAACGCCACTTGTGGCCAACAGTAAATTATTGTAAACATGACGAAATTC 
GATAATTGTTATTAACATAAAACAACAACAATCTTTTGCATCTCTTCTTCTTCTTTTTCTTTGCATCGATCAACCAAAACACGAAATTCCACCAA 
CACCATCAACAAACACCACCACCAACAACAACAACGAAAAACCCCACCAATTGTTTTATTTTGCCATTCCGTTCTTGTCTCCGTTTCGTCCCCGT 
TCCGCATCCAAATCACGGACTTGACGCAACATCAGCAGCGGCCAGGGGCAAGCGGCAACCACAACACCCAACTGGAGCGAACAGTTCCAGCTCGC 
TGTGGCCAGACTCTCGCCAACGCCAACGACAACGCCACCACAGTCCCTCCTCAACCTCGATCTCGGCACTGAGCGGCGCTAAGAAGGCACGCAAA 
TCCGCAACCAACAGCAGCAGCAGCAATTCCACACAGCAGCAGCAGAATCACCACGGCACCAAAATCAACAATCATCATCTCAATGGCTACGGCGA 
TCAGTGGGGGGAGCAAAGCAGGAGTCGTCGCAACTCTTCGCCCTCCAACAGTCATAAAAATGAGAGGTATGTTTCGGATTCAATAACTGATCCCA 
GGGATTCGCGATAAGTAGTCCTGCTTTTTTTTTATAAGATAGATTGTTATATATAATTTATATGTATGTTTCCCTTGCAGAACCTCTGAGCGTCG 
CGTTCGTCCCATCTACGACATCAATAACATCGTTATACCCTACAGTATGCTAGCCCAGTCGAAAATGGAGATAATTCCCTACAAGGAAATACCCA 
TACCCAAGTAAGTATTCAATTTTGCGCTAAGTCCTTATTTTATATATATATGTCGCATAATGTAAAATACCAAGATTTGTGTATATCTATTAAAA 
TCACCCATTCCATCTTTGCTTTCAGGTGGCGCATTGTAGACAGTGATAACGATAAGGGAAAACATTCGTCAGATGAGTCAATGGAGCGCAAACTG 
AGCAATGGCTGTGTCGCATCAAAGTCAGAGGAGCTAGCCAAACAACAGCTACCACCACCACCAGATCAAGATCAAGAGCAGCCGGCCCTGCAGCC 
TCCGAAGGTTAATGGTTTAAAGGAAAATCAAGCAGTCAAGCACAATAACAATAATAATACAAACAATAATAATAATAAAAATGGACTGGTAAATG 
GTAACGCCAAAAAGGATGAACTTAAGGCTGCAGAGGATCACGATACGCAATCAAAAGAAGACGACGATCCAAAGGAAAAGGTGGTCAAACCCAAG 
CTCAACGGAAACTTGATAAAGAATCCGATTGACAAAACTGCCGAAACGCAAGACAAAATTGCTCAGCCAGCACCAGAACCACCACTGCCAAAGCG 
ACCCAAGCTGGAAACTCCAACCTCCAACGAGGTCACCAAGGCTAATGGGCAGTTGGTGCAGCAACAGCTGGAATCCCGTGAGAGCGAGGAGGAGC 
CACAGCATGTCGACGAAGATCTTTCGGATGAAGCGTTTATTATGCGACATCAGCGAGCTCTTCTCGAGGAGCGACGCCGCTTCGAGACTTTCCTT 
AAGTTCCCCTGGAGCACCCGATCGCGTGCAAATAGGCGCGTCGATAGTCGTGCCGAATCTAGTGGCGCCAACACACCGGATCCCGCCTCACCAGC 
TCCTCATCTAGGCGGCCCTGGTCACGACAACGAAAGCATTCCCTCGCCGCTGGCGTACCCACTGGACGCTTTTAACGAAAGCGGTGAGCTGCTGA 
CTGGCGGACAGGCACGCCAAGCTCGTCGTCGGACTACGTCTAGCAAGCTTAAGGATCAGCTGGAACGGCGCAGCACCACGCCCGATCTGCGGGAG 
GTAAGTCGATGTCCAAGTTTCACATTTGTTGAACTCATATATAAAATAAATTCTGGTTGTGCCTATTACTTTTGCCGAATTTTGTTTTCAAACTG 
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CTTTGAATGTTCAAATACTGATTGTATAGAATAGTTTTCTAATTGAAACCCTTTTCCTTTTTGTATAGTCTTACACTGTGATGCCATCTCCCTTT 
GAGCCCTTGAACTTTCCGCTCTCCGACGAGGTCTACCAGCGATTGCTAGCCGAGACATATGCGCCGCCCAGGTCAATGACCGTCTCAAAGCGGGC 
CAAAAGCAAGTCGATATCCTCGAACTGTGATGGTGGGAACACCTTCACAGGAGGCAGCGGCAGTCGGCGCAGCAGCAAGTGCAAGATTAAGCCAA 
ATGGTCAGCTAAATGGCCAGCTCAATGGCCACCCTGCGGCGGCGAAGTTGAACGGAATAGATGCGGCAAATGAAGGTGGAATTTCTGAGCCGGAA 
GAGGAAGAGATGCTGCTAGAAGACGAGGATGACGACTATCCCAAGCATCATCTGGCGCCGCTGGATGACGAGGAGCCGTCGACACCAGATGTCGA 
GCATGAGCTTTACGACTCGGCGATCGATGCCTATCTCGCTGATCCGGAGGCGCTAGAGGAGGATATGGGCGAGGATCCCTTCGAGGACGACGATC 
CCAACGATCCGGAGTGGAAGACACGGACCGAGGGTGTTCGCAGCAGACGCATCTAAGACCAAAATCCAAAGTTTTCGTTATTTAAGACTTTTAAA 
CGATTTGTCAATTTTTAAAGCCACAACACACACACACCCACAAATCAGCCGGAGCAGACCAACAAATTTTAATTTGTTTCAAACGGTTTGGTTTG 
CACTCACAACAGCATTGCGTACTCTAAGCAATCGTCTAAGTTATTTAGTTCTACGAGCCCACAACAAAAAAAAAAAACACAGATACGCTACACAC 
AAGTACACAACAAAACAAACACGAAGCAGAAGCGAAAATCTTGAAACAATGTAAATACAAGCTAAACGCATAGAGCTAAGCAGTTACTGATAATT 
GTACACTAACCAACGATCAAGAAATCATCGAACGCACCCACCACGTCGCGTGGACGTGGTGGATGCGGTCGGTTGGAGCACTGGTCAGCGGTTAA 
GTAGGCACCACAACAAACCCACTTGGGCTTCAAGCCGAAGTGCGCTAATGATAAAAATATTAAGCTAATCTAAATATATATATATATACGAAAAC 
ATTTAAATATATACCATATGCATACATTAACGTGTAAATGCCGAAGAGAGCAGTTAGCTGAAGATTTGAAATGTATTCGCACCTGCACACAAACG 
ATATAACAACTAAGCTATAGAAGGCAATTATCTGTAAATTGATCAAGCTTAACCATTAGCTAACCAACCATTAGCCTTTAGCCCAGCAAGTAATC 
AGTCGGCGTTGGGGCCGTGTTTTTCTTATATTTTAAAAAGTTTATCAGCTGTAAATGTATAGAGCGGAATTCGAAAGGCGGTGTGTGAAAATGGA 
TCGAACGATAAATGTATGTATATGTATTTTAGTTGCTTAGGCGTTTTCATTTCTACCTTATAAACACGTTAGCGGCTAATTGTAGCAATTAATAC 
TATAAAACACAAAAAAAAAAACAAAAACAAAAAACAACAACCATAAGGAAACCTAGCGCGGCTATGTCCGTGAAAATGTTTATTCTAATTTCGTT 
TTTACTATCTACTGCGTATGTTTCACCGCAAAGCTAGAATCCAGTTTAATTCGTTAAAACTTGAAAAGAAATACAACAGAAATGTGAAGATTAAC 
TAAGGACCGCAAAAATACAGAATTATATATTCTTAAAAATGACACATGGAGCGCCAGAAGTGATATGCTGGGACCACCATCGCCAGATGGTGGAA 
TCTGGCCAACATGTGTGTCG 
(SEQ ID NO; 1402) 

Exon: 1001.. 1052 

Exon: 2034. .2784 

Exon: 2851.. 4096 

Exon: 4503.. 4999 

Exon: 5292.. 5672 

Exon: 5926.. 6241 

Exon: 6351.. 6467 

Exon: 6581.. 7505 

Exon: 7669. .8425 

Start ATG: 2112 

Transcript No. : CT14929 

C AAAAGC GT AGC AG AGC AGT CT AAGC G AT G AAAAAAAT AAAT G AGG AAAAAAGAT T AGC AT AAAG AG ACGC CT T AGG AGT C GAAC G AACC AAAT G 
CCGCAGCAGCCGGCCAACGTATTAGCATCAGAACGATGGCCCCAGCGCTCACAGCCGAGCCACTAAGTCCAAAGGAGAAGCTAACCAGCACCGCC 
TCGCCCTCGGCCCGCAGCTCCTCGGAAAGCGGAGGAGTTGGATGTGCAGTTGGCTTGGTCAAGAAGCCGGCAACACCCACGCCGGCCACGGCCAC 
CACGCGGATTACCCGCTCATCAGCGGCAGCGGCCTCAGTTACCGCTTCCCCGCATCTACAACAGCAGCGATCCTCGCTAGCAATCAATAACAAGC 
GAAAGTCGGGCAACGCTAAAGCCAGCAATGAACCAATGGGGCTGAGCACAGCCCCGACTCCACACAGCACCACTTCACCAAATCCCTCGACGGAT 
ACTGCGGCAGCCTCCACCTCTGCCGTGGAGCACAACAATAACGACAGCAACAGTACCTCCGCCGGCAACAGCACCTCCGCCGACAACAGCTCCAC 
GCCCAAGGACAACTCCACCGCTCAGCTACTGGCGGATCTAACCATCAACTTTGAGGAGACCATATCCGCGGAGATTTGCCTGAGGAAAACCCTAC 
CCGATGTGAGTCTGGGGAAAGAGCCGGTTCCGCCTGCCTCGGTGCTGGCAGTGGCCACAAGTTCAACGAGTGCTGGCAGCGCTCTATCCTTATCT 
GCAGTGGATGAGGTGACCAAGATAGAAATAGATAATATAGAAGAGCGTCTCAGCCAGCTAGATGGGAATGCCACTGGAAGGCCCGAGGAGCATGT 
GGCACCACCTTCTGTTCCGGTACCCGTGCAGGAGGCGCCAAACACGCCCACCAGGCCACAGGCGCCACCAAGTGCCCATCTTGAGTCTCCGCAGC 
C G GC GC GAAC AC AGCAGC AGT C C AT T C AAC AG AT G AC GCC GC AAAG C AC C AC C AG T T C C AAC AAT T T T CT C AAAG AC GG AGC AGCT GGT GC T GT T 
CTAGAGGATCAGGATATCGAGGAGGTGCTTAAAGCGCTCAAAACATTTGACGGTGGCCATGTTAATCCAGATACCATTTGCGAGTTTTTCGACGA 
AGTTTGGGAAGAGGCAGCACCGGCAGCGCCATTGCCGCCCGCGCCACCCGAAAATGTGGTGGATGTGCCGGGCGCGAAGCCCAGCTGCAGTGGCA 
TCCTCGTTAGCGGCAGCAACATTTCCCAGCCGAATGTGATTATTAAGCAGGAGCAGCGGCCCTGGCAGGAGAGTCATGCTGAGTTGGAGCAGCAG 
CAGCACGTTATCTCGCGCAGGATCGAGTTTTTGCTACGCAGGATGCGCAAGCTCCAGGCGCGAGCGATGTGCCGTCACGCGAGCGAGGAGGTGGT 
TGGTATCATGGAGTGGTCTGCCCGCAGTTCCCATAAGGCTCCAGTTCCAGCGAGAAGTTCCACGCTGACTGAGCAGGAGGCCACTGTCCTATCGA 
TTGTCTCAGGGCGTCCGAGCTCAACCTTCTGGGAGGAGCAGAACAAGCATCCGCTGCCTGCTAGCCAGATGAGTAATGTGATCCGGCATATTGCG 
ACGGCGGCAAAGCATCAGCAAATCTGCCACTCAGCCAGCGGAAGCTCTGCTACTCTGGCGCCATCCTCCAGCTGGTACAACAGCTGCAACAGCTC 
TGCGCTGCCGATGAAGCGTCCACGAAAGAATCAACTTGATGCTTCGGTTGCAACGGGTGCCACACCCACTGCACCTGCGCCAGGAGCGGTAACAT 
CTAGACAGGCAGCTGGCACGACGGGATCCGCAACGGGTTCGAATACACCGCGGGCGGATGATATCGTGCCCGGCTACGACACCTATGTGACTAAT 
GAACTCACCCATGTGTCTGGCCTGCTGCACACGGAGCTTCGAGAAGTACAGAATGCCATCGACTCGGACGCCACGGAGTCAAGTTCTGGTGGAGA 
ATCCGCTGACGAGATGGTAACCTACAACAACAACCAGCAGCTCTCGCTACCAATCACTCGACGCGCTGTCTGGCGATATTCGAGAGATCGTGCAG 
CCATTGCCCTGCGCTGGTCCTGGCTCTGCTCCCAGCTGACCGATTTGGAGATGAAGATCCGTCAGCACAGCGACCTTTACATGGATCTAACTCAG 
TCCAAGGGCGAGATCCAGCTGGAGTCCACAGCTAAGACCTCACCGTCGCTAGCACAGCCGGCAAATGGCATTAAGGAGGAGCCCGGATCCGACTA 
CCTCTGCAGTCGGGCACGTCCATTGGTGCTGTCCGAGTTCCGCAAACGCAAGCTCTTCCAGACCACGAACATGCACACGATTTCCAAGAAAGCCG 
CACGCCCCAGCAACATTAAGTGTGGCTGCCAGTGGCCGCAGGTGCCATGCACACTGTGCACGGGTCGGACAGACCCTACGGCGCCCAGGGATCTG 
GTCGAGACGATGATGCCGCAAAACCGGGTGGCATTGCTCGATGCTGGCTACCATCCAGTGCTCAGCTTCGCCAGCGATGTCAGTCAGTCGGTTCA 
CTTGGAGGCCGTTGCCCGCCAGCCGGACTGGCAGTACCGCGTCATGCGCAGCCAGCCCAAGGCCATTGTCAAGGCCATGTGGAAGGCGGAGCGCG 
AGACGATAGCCTCAGGCGGTGTCGGCGGACAAGGTGGCAGCCGGCGGTCGGGCGATGCGGCGAAGCGGCGTTATATACGACGCAAGGAGCGCAAC 
AACAACTCAAATAAGGAGGCTGGGAGCGGAGCTAAAGCTGCTGCTGTCGCCGCAGGAGGAGCAGGCGGAAACGGAAGCAGTAGCGTCGGGGGAGG 
GGATAGCGGAAACGGTACTGGTACTGCTACTACTTCTTCTTCTACTACGCCTACTACAACGCCACTTGTGGCCAACACAGCGGCCAGGGGCAAGC 
GGCAACCACAACACCCAACTGGAGCGAACAGTTCCAGCTCGCTGTGGCCAGACTCTCGCCAACGCCAACGACAACGCCACCACAGTCCCTCCTCA 
ACCTCGATCTCGGCACTGAGCGGCGCTAAGAAGGCACGCAAATCCGCAACCAACAGCAGCAGCAGCAATTCCACACAGCAGCAGCAGAATCACCA 
CGGCACCAAAATCAACAATCATCATCTCAATGGCTACGGCGATCAGTGGGGGGAGCAAAGCAGGAGTCGTCGCAACTCTTCGCCCTCCAACAGTC 
ATAAAAATGAGAGAACCTCTGAGCGTCGCGTTCGTCCCATCTACGACATCAATAACATCGTTATACCCTACAGTATGCTAGCCCAGTCGAAAATG 
GAGATAATTCCCTACAAGGAAATACCCATACCCAAGTGGCGCATTGTAGACAGTGATAACGATAAGGGAAAACATTCGTCAGATGAGTCAATGGA 
GCGCAAACTGAGCAATGGCTGTGTCGCATCAAAGTCAGAGGAGCTAGCCAAACAACAGCTACCACCACCACCAGATCAAGATCAAGAGCAGCCGG 
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CCCTGCAGCCTCCGAAGGTTAATGGTTTAAAGGAAAATCAAGCAGTCAAGCACAATAACAATAATAATACAAACAATAATAATAATAAAAATGGA 
CTGGTAAATGGTAACGCCAAAAAGGATGAACTTAAGGCTGCAGAGGATCACGATACGCAATCAAAAGAAGACGACGATCCAAAGGAAAAGGTGGT 
CAAACCCAAGCTCAACGGAAACTTGATAAAGAATCCGATTGACAAAACTGCCGAAACGCAAGACAAAATTGCTCAGCCAGCACCAGAACCACCAC 
TGCCAAAGCGACCCAAGCTGGAAACTCCAACCTCCAACGAGGTCACCAAGGCTAATGGGCAGTTGGTGCAGCAACAGCTGGAATCCCGTGAGAGC 
GAGGAGGAGCCACAGCATGTCGACGAAGATCTTTCGGATGAAGCGTTTATTATGCGACATCAGCGAGCTCTTCTCGAGGAGCGACGCCGCTTCGA 
GACTTTCCTTAAGTTCCCCTGGAGCACCCGATCGCGTGCAAATAGGCGCGTCGATAGTCGTGCCGAATCTAGTGGCGCCAACACACCGGATCCCG 
CCTCACCAGCTCCTCATCTAGGCGGCCCTGGTCACGACAACGAAAGCATTCCCTCGCCGCTGGCGTACCCACTGGACGCTTTTAACGAAAGCGGT 
GAGCTGCTGACTGGCGGACAGGCACGCCAAGCTCGTCGTCGGACTACGTCTAGCAAGCTTAAGGATCAGCTGGAACGGCGCAGCACCACGCCCGA 
TCTGCGGGAGTCTTACACTGTGATGCCATCTCCCTTTGAGCCCTTGAACTTTCCGCTCTCCGACGAGGTCTACCAGCGATTGCTAGCCGAGACAT 
ATGCGCCGCCCAGGTCAATGACCGTCTCAAAGCGGGCCAAAAGCAAGTCGATATCCTCGAACTGTGATGGTGGGAACACCTTCACAGGAGGCAGC 
GGCAGTCGGCGCAGCAGCAAGTGCAAGATTAAGCCAAATGGTCAGCTAAATGGCCAGCTCAATGGCCACCCTGCGGCGGCGAAGTTGAACGGAAT 
AGATGCGGCAAATGAAGGTGGAATTTCTGAGCCGGAAGAGGAAGAGATGCTGCTAGAAGACGAGGATGACGACTATCCCAAGCATCATCTGGCGC 
CGCTGGATGACGAGGAGCCGTCGACACCAGATGTCGAGCATGAGCTTTACGACTCGGCGATCGATGCCTATCTCGCTGATCCGGAGGCGCTAGAG 
GAGGATATGGGCGAGGATCCCTTCGAGGACGACGATCCCAACGATCCGGAGTGGAAGACACGGACCGAGGGTGTTCGCAGCAGACGCATCTAAGA 
CCAAAATCCAAAGTTTTCGTTATTTAAGACTTTTAAACGATTTGTCAATTTTTAAAGCCACAACACACACACACCCACAAATCAGCCGGAGCAGA 
CCAACAAATTTTAATTTGTTTCAAACGGTTTGGTTTGCACTCACAACAGCATTGCGTACTCTAAGCAATCGTCTAAGTTATTTAGTTCTACGAGC 
CCACAAC 

(SEQ ID NO: 1403) 
Start ATG: 131 

MAPALTAEPLSPKEKLTSTASPSARSSSESGGVGCAVGLVKKPATPTPATATTRITRSSAAAASVTASPHLQQQRSSLAINNKRKSGNAKASNEP 

MGLSTAPTPHSTTSPNPSTDTAAASTSAVEHNNNDSNSTSAGNSTSADNSSTPKDNSTAQLLADLTIWFEETISAEICLRKTLPDVSLGKEPVPP 

ASVLAVATSSTSAGSALSLSAVDEVTKIEIDNIEERLSQLDGNATGRPEEHVAPPSVPVPVQEAPNTPTRPQAPPSAHLESPQPARTQQQSIQQM 

TPQSTTSSNNFLKDGAAGAVLEDQDIEEVLKALKTFDGGHVNPDTICEFFDEVWEEAAPAAPLPPAPPENVVDVPGAKPSCSGILVSGSNISQPN 

VIIKQEQRPWQESHAELEQQQHVISPJIIEFLLRRMRKLQARAMCRHASEEVVGIMEWSARSSHKAPVPARSSTLTEQEATVLSIVSGRPSSTFWE 

EQNKHPLPASQMSNVIRHIATAAKHQQICHSASGSSATLAPSSSWYNSCNSSALPMKRPRKNQLDASVATGATPTAPAPGAVTSRQAAGTTGSAT 

GSNTPRADDIVPGYDTYVTNELTHVSGLLHTELREVQNAIDSDATESSSGGESADEMVTYNNNQQLSLPITRRAVWRYSRDRAAIALRWSWLCSQ 

LTDLEMKIRQHSDLYMDLTQSKGEIQLESTAKTSPSLAQPANGIKEEPGSDYLCSRARPLVLSEFRKRKLFQTTNMHTISKKAARPSNIKCGCQW 

PQVPCTLCTGRTDPTAPRDLVETmPQNRVALLDAGYHPVLSFASDVSQSVHLEAVARQPDWQYRVMRSQPKAIVKAMWKAERETIASGGVGGQG 

GSRRSGDAAKRRYIRRKERNNNSNKEAGSGAKAAAVAAGGAGGKGSSSVGGGDSGNGTGTATTSSSTTPTTTPLVANTAARGKRQPQHPTGANSS 

SSLWPDSRQRQRQRHHSPSSTSISALSGAKKARKSATNSSSSNSTQQQQNHHGTKINNHHLNGYGDQWGEQSRSRRNSSPSNSHKNERTSERRVR 

PIYDINNIVIPYSMLAQSKMEIIPYKEIPIPKWRIVDSDNDKGKHSSDESMERKLSNGCVASKSEELAKQQLPPPPDQDQEQPALQPPKVNGLKE 

NQAVKHNNNNNTWNNNNKNGLVNGMKKDELKMEDHDTQSKEDDDPKEKWKPKLNGNLIKNPIDKTAETQDKIAQPAPEPPLPKRPKLETPT^ 

NEVTKANGQLVQQQLESRESEEEPQHVDEDLSDEAFIMRHQRALLEERRRFETFLKFPWSTRSRAWRRVDSRAESSGANTPDPASPAPHLGGPGH 

DNESIPSPLAYPLDAFNESGELLTGGQARQARRRTTSSKLKDQLERRSTTPDLRESYTVMPSPFEPLNFPLSDEVYQRLLAETYAPPRSMTVSKR 

AKSKSISSNCDGGNTFTGGSGSRRSSKCKIKPNGQLNGQLNGHPAAAKLKGIDAANEGGISEPEEEEMLLEDEDDDYPKHHLAPLDDEEPSTPDV 

EHELYDSAIDAYLADPEALEEDMGEDPFEDDDPNDPEWKTRTEGVRSRRI* 

(SEQ ID NO: 1404) 



Celera Sequence No. : 142000013384528 

AATGGCGGGATCTGGAGTTGCCAAGCAGATATTTGCCCTCGATTTCGAGATCTTTGGACGAGTGCAAGGTATGCAATGCATTTTTCTGATTCGGC 
TTGTTAAACAGCGTATTAAATTTTGCAGGTGTGTTCTTCCGCAAAGTAAGTACTGGGAGGCCTATATATATATCTGGAAATCAACATTTAACTTA 
TGTACATTTTTGGCTTAGCACACGTCGCATGAGGCTAAAAGATTGGGAGTTAGGGGCTGGTGCATGAATACCCGGGATGGGACCGTCAAGGGACA 
AC TGGAGGCTC C T AT GAT GAAT T T AAT GG AAATGT AGGT AC CGC TGC AC T CC ACT AAT T C AA T C A T T T AAC CAT C GC C C ACT AAT T T G AAGG AAA 
CATTGGCTGGAGAACAACCGAATTCCCAACGCTAAGGTCTCAAAGGCTGAATTTTCGCAAATCCAGGAAATCGAGGACTATACGTTCACTTCCTT 
TGACATAAAACATTAAAAACGAATTATTAGCTCTTTCGTTGTCCATCCAATTCCGAAACAAAGTCTAAATAAATTACCAGGGCTGCAACTTGCGT 
AAGCAGGCATATCTGCATTGCTATCGACTGAGTCGATAACATTCGATAGTATGAACATAAGAAATATTACGTTTCGCAAAACTGAACCCAGTACT 
GAAGTACAACAATAAATAAATAATAAATTATTTAAAACAAGGCGGAAAATTATTCCGAGAATCGATAGTCGCTAGTCGAAAGTGCGCACCACTCC 
AGTATCTACTCGATATATTGCTATCACAGCCGATAGTTTGCAGCTCCCTGCCATAGACACTGGTCATCTCGCTCGCACACACAACAGTGGGCCGC 
GCCACCGCCATAGGCGGCGCACGTGCAAGAGCGAGACAACCAGCTGGGCGACAGCGCACTTTGGTCTGGCACAGAAACGAGTACAGCTTCGATAC 
ACGTCGTTTTGCCAAAAGCGTAGCAGAGCAGTCTAAGCGATGAAAAAAATAAATGAGGAAAAAAGTCCGTGAAGCGAAAACCGTAATTCCTACAG 
TGCGAGTCACGCCATTGTGTTTGCAAAAAATATTTGCCAATTGTGCTACGGGCAATTTTTCGGAACGGCCGAATTGTGCGTATAATGAAAAAAAA 
TACTACTTAATTTTTAAGCAATTTACAAATTCTTGCGTGCTGCCTGCCTGCTGTCGGTGTGTTCTTGCTTGCATGTGTTAGTGAGTACGATTACT 
TTCGCGCTGAAAACCTTTTTCCACTGCCCTTTCGCTGGAATCTGGGCGAATATAGATAGGACACAAAGCACATATAACTGATTTTTACTGCAGAA 
TTGATCGAGACATGTGGAATACAATAAATGAGATTGTGTAAATAACTGAATATAGTGGCGCATACAGTTGCACGTGTGTGTGTGCTAGCATGTGC 
GTGTGTGTGTGTAAAGGCGCGAGTTTCCGTTTGCCTAGGCTAATTGTAGTGGAZWVACTCAATAAACGGAATTACATAAAAACATGGTGGCATGA 
TCGGCAGTTTCCCGCCGGCAGAAGATGCCCAATCTTGGAAAATTGTAATTTTCGAAAGTAACCGCAATGATTTCGTCAGTATGTAAAGTAGTCTG 
TCAGTCCCTGCATATGTTTGTGTGTGTGTGTTCTATACAGTTTTTTTTATTGCGCATCATGGTGTGCAATAGCGCGCTTTCTGACCTCTCTCTCG 
CTCTCATGGGTTCTCTTGTTGGCCAAAGCGACGCTGTGTGTGTGTTGTGTACGCATATGTAGCATATGTACATAACACGCCGGAAAAGACAAAAA 
AAAAAAACCATTCCAAATTTTGAAATCTTACGGAGGCGGGCGGTTTAAACGAGATAAAAAATAAATACACCAATATTGTTCTTTTATATGATAAC 
GCAGTAGTCTCGGACCGAATAGTATGCCTTTATTTTGTCGTCCTAACGCTGAATTGCCGT^AAAATAATAGCCCGTTTTTGGAAAATTCTCGACAG 
GATTAGCATAAAGAGACGCCTTAGGAGTCGAACGAACCAAATGCCGCAGCAGCCGGCCAACGTATTAGCATCAGAACGATGGCCCCAGCGCTCAC 
AGCCGAGCCACTAAGTCCAAAGGAGAAGCTAACCAGCACCGCCTCGCCCTCGGCCCGCAGCTCCTCGGAAAGCGGAGGAGTTGGATGTGCAGTTG 
GCTTGGTCAAGAAGCCGGCAACACCCACGCCGGCCACGGCCACCACGCGGATTACCCGCTCATCAGCGGCAGCGGCCTCAGTTACCGCTTCCCCG 
CATCTACAACAGCAGCGATCCTCGCTAGCAATCAATAACAAGCGAAAGTCGGGCAACGCTAAAGCCAGCAATGAACCAATGGGGCTGAGCACAGC 
CCCGACTCCACACAGCACCACTTCACCAAATCCCTCGACGGATACTGCGGCAGCCTCCACCTCTGCCGTGGAGCACAACAATAACGACAGCAACA 
GTACCTCCGCCGGCAACAGCACCTCCGCCGACAACAGCTCCACGCCCAAGGACAACTCCACCGCTCAGCTACTGGCGGATCTAACCATCAACTTT 
GAGGAGACCATATCCGCGGAGATTTGCCTGAGGAAAACCCTACCCGATGTGAGTCTGGGGAAAGAGCCGGTTCCGCCTGCCTCGGTGCTGGCAGT 
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GGCCACAAGTTCAACGAGTGCTGGCAGCGCTCTATCCTTATCTGCAGTGGATGAGGTGACCAAGATAGAAATAGATAATATAGAAGGTGGGTCTA 
AGGAATCACTCCAAATAAATAGTCTTGACTCAAAATTTCTCTCGCCCACTGTTTCAGAGCGTCTCAGCCAGCTAGATGGGAATGCCACTGGAAGG 
CCCGAGGAGCATGTGGCACCACCTTCTGTTCCGGTACCCGTGCAGGAGGCGCCAAACACGCCCACCAGGCCACAGGCGCCACCAAGTGCCCATCT 
TGAGTCTCCGCAGCCGGCGCGAACACAGCAGCAGTCCATTCAACAGATGACGCCGCAAAGCACCACCAGTTCCAACAATTTTCTCAAAGACGGAG 
CAGCTGGTGCTGTTCTAGAGGATCAGGATATCGAGGAGGTGCTTAAAGCGCTCAAAACATTTGACGGTGGCCATGTTAATCCAGATACCATTTGC 
GAGTTTTTCGACGAAGTTTGGGAAGAGGCAGCACCGGCAGCGCCATTGCCGCCCGCGCCACCCGAAAATGTGGTGGATGTGCCGGGCGCGAAGCC 
CAGCTGCAGTGGCATCCTCGTTAGCGGCAGCAACATTTCCCAGCCGAATGTGATTATTAAGCAGGAGCAGCGGCCCTGGCAGGAGAGTCATGCTG 
AGTTGGAGCAGCAGCAGCACGTTATCTCGCGCAGGATCGAGTTTTTGCTACGCAGGATGCGCAAGCTCCAGGCGCGAGCGATGTGCCGTCACGCG 
AGCGAGGAGGTGGTTGGTATCATGGAGTGGTCTGCCCGCAGTTCCCATAAGGCTCCAGTTCCAGCGAGAAGTTCCACGCTGACTGAGCAGGAGGC 
CACTGTCCTATCGATTGTCTCAGGGCGTCCGAGCTCAACCTTCTGGGAGGAGCAGAACAAGCATCCGCTGCCTGCTAGCCAGATGAGTAATGTGA 
TCCGGCATATTGCGACGGCGGCAAAGCATCAGCAAATCTGCCACTCAGCCAGCGGAAGCTCTGCTACTCTGGCGCCATCCTCCAGCTGGTACAAC 
AGCTGCAACAGCTCTGCGCTGCCGATGAAGCGTCCACGAAAGAATCAACTTGATGCTTCGGTTGCAACGGGTGCCACACCCACTGCACCTGCGCC 
AGGAGCGGTAACATCTAGACAGGCAGCTGGCACGACGGGATCCGCAACGGGTTCGAATACACCGCGGGCGGATGATATCGTGCCCGGCTACGACA 
CCTATGTGACTAATGAACTCACCCATGTGTCTGGCCTGCTGCACACGGAGCTTCGAGAAGTACAGAATGCCATCGACTCGGACGCCACGGAGTCA 
AGTTCTGGTGGAGAATCCGCTGACGAGATGGTAACCTACAACAACAACCAGCAGCTCTCGCTACCAATGTGAGTACTTCGTATTATTTATGTACA 
TACTTAGGTTATCGGGAATATTTTACAAGTAAACTTTCTTCCATCTTTTTTTGAGGGGGGTTTTCAGCTTCTGTCATCTTTTTAAGATCATTCTT 
TTGCTATAGAATTTGTCCGATTAATGTTGGCTCGGTAGTTTTCTTTCAAAATTGTTTTGCTTTGACAGGCATTTTAAACATGATTTAACGTCGTC 
CTTAGCAATTGAAAAGTTGTAATAGGCTTTGATTAGAGTTGTTGTACCTAATTATCGCATTTACAATGCGATATTTCAACTATTTCAACAGTTTT 
TATTAAAATTGTTAATTAGCGATTATATATTTCTTTTAGCTTTTTTTTTGTGGTATTCTCTTTCCTAATGCTCTTGAAATTTGATTCTTTGCAGC 
ACTCGACGCGCTGTCTGGCGATATTCGAGAGATCGTGCAGCCATTGCCCTGCGCTGGTCCTGGCTCTGCTCCCAGCTGACCGATTTGGAGATGAA 
GATCCGTCAGCACAGCGACCTTTACATGGATCTAACTCAGTCCAAGGGCGAGATCCAGCTGGAGTCCACAGCTAAGACCTCACCGTCGCTAGCAC 
AGCCGGCAAATGGCATTAAGGAGGAGCCCGGATCCGACTACCTCTGCAGTCGGGCACGTCCATTGGTGCTGTCCGAGTTCCGCAAACGCAAGCTC 
TTCCAGACCACGAACATGCACACGATTTCCAAGAAAGCCGCACGCCCCAGCAACATTAAGTGTGGCTGCCAGTGGCCGCAGGTGCCATGCACACT 
GTGCACGGGTCGGACAGACCCTACGGCGCCCAGGGATCTGGTCGAGACGATGATGCCGCAAAACCGGGTGGCATTGCTCGATGCTGGCTACCATC 
CAGTGCTCAGCTTCGCCAGCGGTGAGTGGGAGGTTACACTGAGTGGACCCATTTGGTTTTCGTCCATGATTATTTCTTATCATCTGTGATGGTTT 
AACATT ATT T AT T C T AC AC T T GAG AT T T G AAC T G AAT T G AT T AAAAAGT AAAACT T T AAAAT G C AAT AC T T GC AAAT AATTTTT ATTCG AAAC T A 
AAACATTAAAATCCTCGTATTCTTTACATTGGATTCCAAAATGAAAAGAAATAGTCGGATGATGACAGCTAAAGTGGAATTATGTTTGAATATAT 
TCTTATCCCTATCATTCTTGTTTCCCAGATGTCAGTCAGTCGGTTCACTTGGAGGCCGTTGCCCGCCAGCCGGACTGGCAGTACCGCGTCATGCG 
CAGCCAGCCCAAGGCCATTGTCAAGGCCATGTGGAAGGCGGAGCGCGAGACGATAGCCTCAGGCGGTGTCGGCGGACAAGGTGGCAGCCGGCGGT 
CGGGCGATGCGGCGAAGCGGCGTTATATACGACGCAAGGAGCGCAACAACAACTCAAATAAGGAGGCTGGGAGCGGAGCTAAAGCTGCTGCTGTC 
GCCGCAGGAGGAGCAGGCGGAAACGGAAGCAGTAGCGTCGGGGGAGGGGATAGCGGAAACGGTACTGGTACTGCTACTACTTCTTCTTCTACTAC 
GCCTACTACAACGCCACTTGTGGCCAACAGTAAATTATTGTAAACATGACGAAATTCGATAATTGTTATTAACATAAAACAACAACAATCTTTTG 
CATCTCTTCTTCTTCTTTTTCTTTGCATCGATCAACCAAAACACGAAATTCCACCAACACCATCAACAAACACCACCACCAACAACAACAACGAA 
AAACCCCACCAATTGTTTTATTTTGCCATTCCGTTCTTGTCTCCGTTTCGTCCCCGTTCCGCATCCAAATCACGGACTTGACGCAACATCAGCAG 
CGGCCAGGGGCAAGCGGCAACCACAACACCCAACTGGAGCGAACAGTTCCAGCTCGCTGTGGCCAGACTCTCGCCAACGCCAACGACAACGCCAC 
CACAGTCCCTCCTCAACCTCGATCTCGGCACTGAGCGGCGCTAAGAAGGCACGCAAATCCGCAACCAACAGCAGCAGCAGCAATTCCACACAGCA 
GCAGCAGAATCACCACGGCACCAAAATCAACAATCATCATCTCAATGGCTACGGCGATCAGTGGGGGGAGCAAAGCAGGAGTCGTCGCAACTCTT 
CGCCCTCCAACAGTCATAAAAATGAGAGGTATGTTTCGGATTCAATAACTGATCCCAGGGATTCGCGATAAGTAGTCCTGCTTTTTTTTTATAAG 
ATAGATTGTTATATATAATTTATATGTATGTTTCCCTTGCAGAACCTCTGAGCGTCGCGTTCGTCCCATCTACGACATCAATAACATCGTTATAC 
CCTACAGTATGCTAGCCCAGTCGAAAATGGAGATAATTCCCTACAAGGAAATACCCATACCCAAGTAAGTATTCAATTTTGCGCTAAGTCCTTAT 
TTTATATATATATGTCGCATAATGTAAAATACCAAGATTTGTGTATATCTATTAAAATCACCCATTCCATCTTTGCTTTCAGGTGGCGCATTGTA 
GACAGTGATAACGATAAGGGAAAACATTCGTCAGATGAGTCAATGGAGCGCAAACTGAGCAATGGCTGTGTCGCATCAAAGTCAGAGGAGCTAGC 
CAAACAACAGCTACCACCACCACCAGATCAAGATCAAGAGCAGCCGGCCCTGCAGCCTCCGAAGGTTAATGGTTTAAAGGAAAATCAAGCAGTCA 
AGC AC AAT AAC AAT AA T AA T AC AAAC AAT AAT AAT AAT AAAAAT GG AC T GGT AAAT GGT AACGC C AAAAAGG AT G AAC T T AAGGC T GC AG AGG AT 
CACGATACGCAATCAAAAGAAGACGACGATCCAAAGGAAAAGGTGGTCAAACCCAAGCTCAACGGAAACTTGATAAAGAATCCGATTGACAAAAC 
TGCCGAAACGCAAGAC AAAAT TGCTCAGCCAGCACCAGAACCACCACTGCCAAAGCGACCCAAGCTGGAAACTCCAACCTCCAACGAGGTCACCA 
AGGCTAATGGGCAGTTGGTGCAGCAACAGCTGGAATCCCGTGAGAGCGAGGAGGAGCCACAGCATGTCGACGAAGATCTTTCGGATGAAGCGTTT 
ATTATGCGACATCAGCGAGCTCTTCTCGAGGAGCGACGCCGCTTCGAGACTTTCCTTAAGTTCCCCTGGAGCACCCGATCGCGTGCAAATAGGCG 
CGTCGATAGTCGTGCCGAATCTAGTGGCGCCAACACACCGGATCCCGCCTCACCAGCTCCTCATCTAGGCGGCCCTGGTCACGACAACGAAAGCA 
TTCCCTCGCCGCTGGCGTACCCACTGGACGCTTTTAACGAAAGCGGTGAGCTGCTGACTGGCGGACAGGCACGCCAAGCTCGTCGTCGGACTACG 
TCTAGCAAGCTTAAGGATCAGCTGGAACGGCGCAGCACCACGCCCGATCTGCGGGAGGTAAGTCGATGTCCAAGTTTCACATTTGTTGAACTCAT 
ATATAAAATAAATTCTGGTTGTGCCTATTACTTTTGCCGAATTTTGTTTTCAAACTGCTTTGAATGTTCAAATACTGATTGTATAGAATAGTTTT 
CTAATTGAAACCCTTTTCCTTTTTGTATAGTCTTACACTGTGATGCCATCTCCCTTTGAGCCCTTGAACTTTCCGCTCTCCGACGAGGTCTACCA 
GCGATTGCTAGCCGAGACATATGCGCCGCCCAGGTCAATGACCGTCTCAAAGCGGGCCAAAAGCAAGTCGATATCCTCGAACTGTGATGGTGGGA 
ACACCTTCACAGGAGGCAGCGGCAGTCGGCGCAGCAGCAAGTGCAAGATTAAGCCAAATGGTCAGCTAAATGGCCAGCTCAATGGCCACCCTGCG 
GCGGCGAAGTTGAACGGAATAGATGCGGCAAATGAAGGTGGAATTTCTGAGCCGGAAGAGGAAGAGATGCTGCTAGAAGACGAGGATGACGACTA 
TCCCAAGCATCATCTGGCGCCGCTGGATGACGAGGAGCCGTCGACACCAGATGTCGAGCATGAGCTTTACGACTCGGCGATCGATGCCTATCTCG 
CTGATCCGGAGGCGCTAGAGGAGGATATGGGCGAGGATCCCTTCGAGGACGACGATCCCAACGATCCGGAGTGGAAGACACGGACCGAGGGTGTT 
CGCAGCAGACGCATCTAAGACCAAAATCCAAAGTTTTCGTTATTTAAGACTTTTAAACGATTTGTCAATTTTTAAAGCCACAACACACACACACC 
CACAAATCAGCCGGAGCAGACCAACAAATTTTAATTTGTTTCAAACGGTTTGGTTTGCACTCACAACAGCATTGCGTACTCTAAGCAATCGTCTA 
AGTTATTTAGTTCTACGAGCCCACAACAAAAAAAAAAAACACAGATACGCTACACACAAGTACACAACAAAACAAACACGAAGCAGAAGCGAAAA 
TCTTGAAACAATGTAAATACAAGCTAAACGCATAGAGCTAAGCAGTTACTGATAATTGTACACTAACCAACGATCAAGAAATCATCGAACGCACC 
CACCACGTCGCGTGGACGTGGTGGATGCGGTCGGTTGGAGCACTGGTCAGCGGTTAAGTAGGCACCACAACAAACCCACTTGGGCTTCAAGCCGA 
AGTGCGCTAATGATAAAAATATTAAGCTAATCTAAATATATATATATATACGAAAACATTTAAATATATACCATATGCATACATTAACGTGTAAA 
TGCCGAAGAGAGCAGTTAGCTGAAGATTTGAAATGTATTCGCACCTGCACACAAACGATATAACAACTAAGCTATAGAAGGCAATTATCTGTAAA 
TTGATCAAGCTTAACCATTAGCTAACCAACCATTAGCCTTTAGCCCAGCAAGTAATCAGTCGGCGTTGGGGCCGTGTTTTTCTTATATTTTAAAA 
AGTTTATCAGCTGTAAATGTATAGAGCGGAATTCGAAAGGCGGTGTGTGAAAATGGATCGAACGATAAATGTATGTATATGTATTTTAGTTGCTT 
AGGCGTTTTCATTTCTACCTTATAAACACGTTAGCGGCTAATTGTAGCAATTAATACTATAAAACACAAAAAAAAAAACAAAAACAAAAAACAAC 
AACCATAAGGAAACCTAGCGCGGCTATGTCCGTGAAAATGTTTATTCTAATTTCGTTTTTACTATCTACTGCGTATGTTTCACCGCAAAGCTAGA 
ATCCAGTTTAATTCGTTAAAACTTGAAAAGAAATACAACAGAAATGTGAAGATTAACTAAGGACCGCAAAAATACAGAATTATATATTCTTAAAA 
ATGACACATGGAGCGCCAGAAGTGATATGCTGGGACCACCATCGCCAGATGGTGGAATCTGGCCAACATGTGTGTCG 
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Start 
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2074 



Transcript Wo. : CT14980 

AAATGAGGAAAAAAGTCCGTGAAGCGAAAACCGTAATTCCTACAGTGCGAGTCACGCCATTGTGTTTGCAAAAAATATTTGCCAATTGTGCTACG 
GGCAATTTTTCGGAACGGCCGAATTGATTAGCATAAAGAGACGCCTTAGGAGTCGAACGAACCAAATGCCGCAGCAGCCGGCCAACGTATTAGCA 
TCAGAACGATGGCCCCAGCGCTCACAGCCGAGCCACTAAGTCCAAAGGAGAAGCTAACCAGCACCGCCTCGCCCTCGGCCCGCAGCTCCTCGGAA 
AGCGGAGGAGTTGGATGTGCAGTTGGCTTGGTCAAGAAGCCGGCAACACCCACGCCGGCCACGGCCACCACGCGGATTACCCGCTCATCAGCGGC 
AGCGGCCTCAGTTACCGCTTCCCCGCATCTACAACAGCAGCGATCCTCGCTAGCAATCAATAACAAGCGAAAGTCGGGCAACGCTAAAGCCAGCA 
ATGAACCAATGGGGCTGAGCACAGCCCCGACTCCACACAGCACCACTTCACCAAATCCCTCGACGGATACTGCGGCAGCCTCCACCTCTGCCGTG 
GAGCACAACAATAACGACAGCAACAGTACCTCCGCCGGCAACAGCACCTCCGCCGACAACAGCTCCACGCCCAAGGACAACTCCACCGCTCAGCT 
ACTGGCGGATCTAACCATCAACTTTGAGGAGACCATATCCGCGGAGATTTGCCTGAGGAAAACCCTACCCGATGTGAGTCTGGGGAAAGAGCCGG 
TTCCGCCTGCCTCGGTGCTGGCAGTGGCCACAAGTTCAACGAGTGCTGGCAGCGCTCTATCCTTATCTGCAGTGGATGAGGTGACCAAGATAGAA 
ATAGATAATATAGAAGAGCGTCTCAGCCAGCTAGATGGGAATGCCACTGGAAGGCCCGAGGAGCATGTGGCACCACCTTCTGTTCCGGTACCCGT 
GCAGGAGGCGCCAAACACGCCCACCAGGCCACAGGCGCCACCAAGTGCCCATCTTGAGTCTCCGCAGCCGGCGCGAACACAGCAGCAGTCCATTC 
AACAGATGACGCCGCAAAGCACCACCAGTTCCAACAATTTTCTCAAAGACGGAGCAGCTGGTGCTGTTCTAGAGGATCAGGATATCGAGGAGGTG 
CTTAAAGCGCTCAAAACATTTGACGGTGGCCATGTTAATCCAGATACCATTTGCGAGTTTTTCGACGAAGTTTGGGAAGAGGCAGCACCGGCAGC 
GCCATTGCCGCCCGCGCCACCCGAAAATGTGGTGGATGTGCCGGGCGCGAAGCCCAGCTGCAGTGGCATCCTCGTTAGCGGCAGCAACATTTCCC 
AGCCGAATGTGATTATTAAGCAGGAGCAGCGGCCCTGGCAGGAGAGTCATGCTGAGTTGGAGCAGCAGCAGCACGTTATCTCGCGCAGGATCGAG 
TTTTTGCTACGCAGGATGCGCAAGCTCCAGGCGCGAGCGATGTGCCGTCACGCGAGCGAGGAGGTGGTTGGTATCATGGAGTGGTCTGCCCGCAG 
TTCCCATAAGGCTCCAGTTCCAGCGAGAAGTTCCACGCTGACTGAGCAGGAGGCCACTGTCCTATCGATTGTCTCAGGGCGTCCGAGCTCAACCT 
TCTGGGAGGAGCAGAACAAGCATCCGCTGCCTGCTAGCCAGATGAGTAATGTGATCCGGCATATTGCGACGGCGGCAAAGCATCAGCAAATCTGC 
CACTCAGCCAGCGGAAGCTCTGCTACTCTGGCGCCATCCTCCAGCTGGTACAACAGCTGCAACAGCTCTGCGCTGCCGATGAAGCGTCCACGAAA 
GAATCAACTTGATGCTTCGGTTGCAACGGGTGCCACACCCACTGCACCTGCGCCAGGAGCGGTAACATCTAGACAGGCAGCTGGCACGACGGGAT 
CCGCAACGGGTTCGAATACACCGCGGGCGGATGATATCGTGCCCGGCTACGACACCTATGTGACTAATGAACTCACCCATGTGTCTGGCCTGCTG 
CACACGGAGCTTCGAGAAGTACAGAATGCCATCGACTCGGACGCCACGGAGTCAAGTTCTGGTGGAGAATCCGCTGACGAGATGGTAACCTACAA 
CAACAACCAGCAGCTCTCGCTACCAATCACTCGACGCGGTGTCTGGCGATATTCGAGAGATCGTGCAGCCATTGCCCTGCGCTGGTCCTGGCTCT 
GCTCCCAGCTGACCGATTTGGAGATGAAGATCCGTCAGGACAGCGACCTTTACATGGATCTAACTCAGTCCAAGGGCGAGATCCAGCTGGAGTCC 
ACAGCTAAGACCTCACCGTCGCTAGCACAGCCGGCAAATGGCATTAAGGAGGAGCCCGGATCCGACTACCTCTGCAGTCGGGCACGTCCATTGGT 
GCTGTCCGAGTTCCGCAAACGCAAGCTCTTCCAGACCACGAACATGCACACGATTTCCAAGAAAGCCGCACGCCCCAGCAACATTAAGTGTGGCT 
GCCAGTGGCCGCAGGTGCCATGCACACTGTGCACGGGTCGGACAGACCCTACGGCGCCCAGGGATCTGGTCGAGACGATGATGCCGCAAAACCGG 
GTGGCATTGCTCGATGCTGGCTACCATCCAGTGCTCAGCTTCGCCAGCGATGTCAGTCAGTCGGTTCACTTGGAGGCCGTTGCCCGCCAGCCGGA 
CTGGCAGTACCGCGTCATGCGCAGCCAGCCCAAGGCCATTGTCAAGGCCATGTGGAAGGCGGAGCGCGAGACGATAGCCTCAGGCGGTGTCGGCG 
GACAAGGTGGCAGCCGGCGGTCGGGCGATGCGGCGAAGCGGCGTTATATACGACGCAAGGAGCGCAACAACAACTCAAATAAGGAGGCTGGGAGC 
GGAGCTAAAGCTGCTGCTGTCGCCGCAGGAGGAGCAGGCGGAAACGGAAGCAGTAGCGTCGGGGGAGGGGATAGCGGAAACGGTACTGGTACTGC 
TACTACTTCTTCTTCTACTACGCCTACTACAACGCCACTTGTGGCCAACACAGCGGCCAGGGGCAAGCGGCAACCACAACACCCAACTGGAGCGA 
ACAGTTCCAGCTCGCTGTGGCCAGACTCTCGCCAACGCCAACGACAACGCCACCACAGTCCCTCCTCAACCTCGATCTCGGCACTGAGCGGCGCT 
AAGAAGGCACGCAAATCCGCAACCAACAGCAGCAGCAGCAATTCCACACAGCAGCAGCAGAATCACCACGGCACCAAAATCAACAATCATCATCT 
CAATGGCTACGGCGATCAGTGGGGGGAGCAAAGCAGGAGTCGTCGCAACTCTTCGCCCTCCAACAGTCATAAAAATGAGAGAACCTCTGAGCGTC 
GCGTTCGTCCCATCTACGACATCAATAACATCGTTATACCCTACAGTATGCTAGCCCAGTCGAAAATGGAGATAATTCCCTACAAGGAAATACCC 
ATACCCAAGTGGCGCATTGTAGACAGTGATAACGATAAGGGAAAACATTCGTCAGATGAGTCAATGGAGCGCAAACTGAGCAATGGCTGTGTCGC 
ATCAAAGTCAGAGGAGCTAGCCAAACAACAGCTACCACCACCACCAGATCAAGATCAAGAGCAGCCGGCCCTGCAGCCTCCGAAGGTTAATGGTT 
T AAAGGAAAATCAAGCAGTC AAGCAC AATAACAATAATAATAC AAAC AAT AAT AAT AAT AAAAAT GGACT GGT AAAT GGT AAC GCC AAAAAGGAT 
GAACTTAAGGCTGCAGAGGATCACGATACGCAATCAAAAGAAGACGACGATCCAAAGGAAAAGGTGGTCAAACCCAAGCTCAACGGAAACTTGAT 
AAAGAATCCGATTGACAAAACTGCCGAAACGCAAGACAAAATTGCTCAGCCAGCACCAGAACCACCACTGCCAAAGCGACCCAAGCTGGAAACTC 
CAACCTCCAACGAGGTCACCAAGGCTAATGGGCAGTTGGTGCAGCAACAGCTGGAATCCCGTGAGAGCGAGGAGGAGCCACAGCATGTCGACGAA 
GATCTTTCGGATGAAGCGTTTATTATGCGACATCAGCGAGCTCTTCTCGAGGAGCGACGCCGCTTCGAGACTTTCCTTAAGTTCCCCTGGAGCAC 
CCGATCGCGTGCAAATAGGCGCGTCGATAGTCGTGCCGAATCTAGTGGCGCCAACACACCGGATCCCGCCTCACCAGCTCCTCATCTAGGCGGCC 
CTGGTCACGACAACGAAAGCATTCCCTCGCCGCTGGCGTACCCACTGGACGCTTTTAACGAAAGCGGTGAGCTGCTGACTGGCGGACAGGCACGC 
CAAGCTCGTCGTCGGACTACGTCTAGCAAGCTTAAGGATCAGCTGGAACGGCGCAGCACCACGCCCGATCTGCGGGAGTCTTACACTGTGATGCC 
ATCTCCCTTTGAGCCCTTGAACTTTCCGCTCTCCGACGAGGTCTACCAGCGATTGCTAGCCGAGACATATGCGCCGCCCAGGTCAATGACCGTCT 
CAAAGCGGGCCAAAAGCAAGTCGATATCCTCGAACTGTGATGGTGGGAACACCTTCACAGGAGGCAGCGGCAGTCGGCGCAGCAGCAAGTGCAAG 
ATTAAGCCAAATGGTCAGCTAAATGGCCAGCTCAATGGCCACCCTGCGGCGGCGAAGTTGAACGGAATAGATGCGGCAAATGAAGGTGGAATTTC 
TGAGCCGGAAGAGGAAGAGATGCTGCTAGAAGACGAGGATGACGACTATCCCAAGCATCATCTGGCGCCGCTGGATGACGAGGAGCCGTCGACAC 
CAGATGTCGAGCATGAGCTTTACGACTCGGCGATCGATGCCTATCTCGCTGATCCGGAGGCGCTAGAGGAGGATATGGGCGAGGATCCCTTCGAG 
GACGACGATCCCAACGATCCGGAGTGGAAGACACGGACCGAGGGTGTTCGCAGCAGACGCATCTAAGACCAAAATCCAAAGTTTTCGTTATTTAA 
G AC T T TT AAAC G A T T T GTC AA T T T T T AAAGC C AC AAC A C AC AC AC AC CC AC AAAT C AGC CGG AGC AG AC C AAC AAAT TT T AAT TT GTT T C AAAC G 
GTTTGGTTTGCACTCACAACAGCATTGCGTACTCTAAGCAATCGTCTAAGTTATTTAGTTCTACGAGCCCACAAC 
(SEQ ID NO: 1406) 

Start ATG: 199 
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MAPALTAEPLSPKEKLTSTASPSARSSSESGGVGCAVGLVKKPATPTPATATTRITRSSAAAASVTASPHLQQQRSSLAINNKRKSGNAKASWEP 
MGLSTAPTPHSTTSPNPSTDTAAASTSAVEHNNNDSNSTSAGNSTSADMSSTPKDNSTAQLLADLTINFEETISAEICLRKTLPDVSLGKEPVPP 
ASVLAVATSSTSAGSALSLSAVDEVTKIEIDNIEERLSQLDGNATGRPEEHVAPPSVPVPVQEAPNTPTRPQAPPSAHLESPQPARTQQQSIQQM 
TPQSTTSSNNFLKDGAAGAVLEDQDIEEVLKALKTFDGGHVHPDTICEFFDEVWEEAAPAAPLPPAPPENVVDVPGAKPSCSGILVSGSNISQPN 
VIIKQEQRPWQESHAELEQQQHVISRRIEFLLRRMRKLQARAMCRHASEEVVGIMEWSARSSHKAPVPARSSTLTEQEATVLSIVSGRPSSTFWE 
EQNKHPLPASQMSNVIRHIATAAKHQQICHSASGSSATLAPSSSWYNSCNSSALPMKRPRKNQLDASVATGATPTAPAPGAVTSRQAAGTTGSAT 
GSRTPRADDIVPGYDTYVTNELTHVSGLLHTELREVQNAIDSDATESSSGGESADEIWTyNNNQQLSLPITRRAVWRYSRDRAAIALRWSWLCSQ 
LTDLEMKIRQHSDLYMDLTQSKGEIQLESTAKTSPSLAQPANGIKEEPGSDYLCSRARPLVLSEFRKRKLFQTTKMHTISKKAARPSNIKCGCQW 
PQVPCTLCTGRTDPTAPRDLVETMMPQNRVALLDAGYHPVLSFASDVSQSVHLEAVARQPDWQYRVMRSQPKAIVKAMWKAERETIASGGVGGQG 
GSRRSGDAAKRRYIRRKERNNNSNKEAGSGAKAAAVAAGGAGGNGSSSVGGGDSGNGTGTATTSSSTTPTTTPLVANTAARGKRQPQHPTGAMSS 
SSLWPDSRQRQRQRHHSPSSTSISALSGAKKARKSATNSSSSNSTQQQQNHHGTKINNHHLNGYGDQWGEQSRSRRNSSPSNSHKNERTSERRVR 
PIYDINNIVIPYSMLAQSKMEIIPYKEIPIPKWRIVDSDNDKGKHSSDESMERKLSMGCVASKSEELAKQQLPPPPDQDQEQPALQPPKVNGLKE 
NQAVKHNNNNNTNNNNNKNGLVNGNAKKDELKAAEDHDTQSKEDDDPKEKVVKPKLNGNLIKNPIDKTAETQDKIAQPAPEPPLPKRPKLETPTS 
HEVTKANGQLVQQQLESRESEEEPQHVDEDLSDEAFIMRHQRALLEERRRFETFLKFPWSTRSRANRRVDSRAESSGANTPDPASPAPHLGGPGH 
DNESIPSPLAYPLDAFNESGELLTGGQARQARRRTTSSKLKDQLERRSTTPDLRESYTVMPSPFEPLNFPLSDEVYQRLLAETYAPPRSMTVSKR 
AKSKSISSNCDGGNTFTGGSGSRRSSKCKIKPNGQLNGQLNGHPAAAKLNGIDAANEGGISEPEEEEMLLEDEDDDYPKHHLAPLDDEEPSTPDV 
EHELYDSAIDAYLADPEALEEDMGEDPFEDDDPNDPEWKTRTEGVRSRRI* 
(SEQ ID NO: 1407) 



Celera Sequence No. : 142000012807541 

ACACTGCCATCGCTGCCATCCCGTCGCTTGGTGGGCGTGGCAGGGGCACTGGCCGCGCCTGATTTGTAGGCGGGATGATCCCGCTCAGCATCGCC 
GGCGGAGGAGAAGGAGGTGTCCAGTAGCGCGTCGCCCGACGAGGTCGGGGTGCCACGTGAATTAGTGCGCTTCTTCATCGCCTAACTACAAATTC 
GATTATCAAATGTAGGCGGGTGGCGTGATGTGGTTGTTGTTAGGTGGTTGGTGGCTCGGAATCGGTTTGGTGGCTAAGTACATGCGGGGACTGAA 
GGTTGCGCAACATTCGATTCGATTCCACCAAGACTTGCCACTCGGCTCTCCTCGCAGCGGTTTCTATGGCTCCCGGCTCGAGTGACCTCGATTGC 
TATGGCGTGGGCACGGCTCCTGGATCCTCGACTCGCGTTCGATCGGCGCACGTTTATTTTATCATTTGCCAGCTACCGTCCGTTGTTGTCAATTA 
ACATGCTGCGCGCGGCTTGGACCAATAAAAATCGACACTTGCCGGCGGTGGTGTTGCCGGAGGGTTGAAATGTGTGACTTGATAACAGGGGCGTT 
GCCAGATGAATAGCGGAACGTTGGGGGTGATCCAGGTATCTTATGATGACAAATATTTTGTATATTTTCAGATCGTATTCGAAAAGCAGTCATCA 
CCCTACTTTATAACAAGATTATATTGTCACTATTTTAGAGTTATATAGCGTTAGTTATATAACTTAGTTCTTTACCTTATTTTTTCTATATAATA 
ACTTTCTAGGCAGCATTTATGGCATTCCCGAGGATAAAATGATTTTTTGTTTACAAACAGTCAAATAATAATGAGGTCACGCATAACATTATTCA 
GATTGGATATTTCAACATAGTGGAATCCTGGAACACCCCTAATGCTATGACTGTAGCCCTGAACAGCTGATTGCACTTGTTGACTAAAACGCAAT 
CCACGACTTTGCATTGGTTTCGGAATAAATTGCTTTTAAATCGCTTTATAATGCTGGACAAGGTTAAAAATGTTATCGTAGTGCTATCAGGCAAG 
GGCGGAGTAGGAAAATCCACAGTGAGCACCCAATTATCGCTGGCACTGCGAAAAAACGGATTTAAGGTAAGCCAAACCATTTCACCACAAAAACA 
TTACCGATGAATTTTGAATGGTACTCATATGAACAATCTATTAGGTTGGTCTGCTCGATATAGATCTGTGTGGACCGAGTGTGCCATATCTATTG 
GGTCTGGAAGGACGGGATATCTTCCAGTGCGACGATGGATGGGTTCCTGTCTATACAGATGAATCCCAAACCTTGGCGGTGATGTCCATTGGATT 
TTTATTGAAGAACCGTGAGGATCCGGTCATTTGGCGTGGTCCCAAAAAGACCATGATGATCCGGCAGTTCCTCACCGACGTCAGGTGGGATGAGC 
TGGATTACCTAATTATCGATACGCCACCCGGCACTTCGGATGAGCACATCACCGTAATGGAGTGCCTGAAGGAGGTTGGTTGCCACGGAGCCATA 
ATTGTGACCACACCGCAGGAGGTGGCCCTGGACGATGTACGCAAGGAGATCACCTTCTGCAAGAAGACCGGCATCAATATCCTGGGCATCGTCGA 
GAATATGAGTGGGTAAGTTATTTGAGTTTGTTAAGCCGCAAGACAGACAACAAAATAGGATTCATTTTAATTTGTCCTCAAAAAATTGGAACTCC 
AATGCAATGCAACGAGAACGATATCCATCCCTGACTGAAAAGAGTTTTACAAGCTTCTGATAAGCATTTCTACTGTATTTGATGCTGACTAATTG 
AAAATGCAAATATACATAGATATTAATTGGACGCCTTTGCGTGTCAAGTTTACGGCAGGGCGACACCTGTGGACTTTAACCAGGTGATAATTATC 
TCATGTTTGACTAAATTATCTTATATTTGATTGTCACAGGTTCGTGTGCCCCCATTGCACCTCGTGCACCAATATATTCTCGTCAAACGGTGGCG 
TCTCACTGGCCACTTACGCCCAAGTACCACATCTAGGAACTCTGCCCATAGATCCCCGTGTTGGCATTTTGGCGGGCACCACTACGTCCGTCTTG 
GATGAGCTTCCGGATTCCACGACAGCGGAGGTGCTGACGCATATAGTTGAGAAGTTGAAAACGATGTTGGTCTCCTAAATAAATACCAACATAAC 
AATTTTGTAAATATGATAGCTTGTTTCATTATTCAGTTTATGAATATAACCATTTTCTTCAGGAAACGATTTAATTTGCCATGGTATTTAATTGG 
TAAACTTAAATAGGGACTCCCATTGTTAAACACATTTTAGCTTTTGTTTAGCTTTGGTTTCTTTTTCTTTATTTTTGTATGCTTTGGAATGAATG 
TAGTTGATGAGTTTTGGGTATAATAATTTAAAATTATATGAGTGCGAGCGTTTGATATGTAATCACTCTAATGCAACAATTCTGCGTGTGCAGTA 
ATTATAATTTACAATCAAACTAATACTTTCCAATTATAATTAATAACGTATTCCTTAACTGCTTAACTCCTACTGCTAAACCTAATCGCTTTATT 
TATTGATTTGAGCATAGTTTAAAGATCAAATTGTATGCTTATCTCTGGACTGACGACAAAGTATAGTTTCTTTTCCTAAGTAGTACTATCATGCA 
G AC T T C AT C C ACT GTT T TG GT AT AATTT T T AAG A T C AAAAA TC AGT C T GC A G G A GG AT G G AAT T C AC AC AG AG AAAAAAC GAACT GAT AAAC T G A 
GATGGCAACCTGGAACTATAATTTTCTCTCTGTGCAGGAATCACAACTTTTGCTCCGTCTTCGCTTTCTCCGTTTCATTCAGCTTTCGAACTGGA 
TTTAATGGTATGGCATGGTATTTTTTGTTTTTCGAATATCTCTTCACGATGGCATAAAGGGATATAAACACTATAAGTATAACAAAGCCCATTAT 
AACCACGCATTCCAAAGGCCAATCGGTTTGGATGCGCTCCTGGGGCACCTCTGCGTAGGGATACCTTATTTGCTGGATCTCCATACTGTTGAGAT 
CCTTAATCTGACTGGTCGATTCGGGCGCCACGACCGCCGGTTGATTCGTGATGGGATTATCGGTCACCACGGATAAGAAGTTAGTTGCATTGACT 
AAGATCACGGTGGGCTCATCAGTTGCTTTCGGT 
(SEQ ID NO: 1408) 

Exon: 1001.. 1111 
Exon: 1185.. 1627 
Exon: 1940.. 2168 
Start ATG: 1001 

Transcript No. : CT15597 

ATGCTGGACAAGGTTAAAAATGTTATCGTAGTGCTATCAGGCAAGGGCGGAGTAGGAAAATCCACAGTGAGCACCCAATTATCGCTGGCACTGCG 
AAAAAACGGATTTAAGGTTGGTCTGCTCGATATAGATCTGTGTGGACCGAGTGTGCCATATCTATTGGGTCTGGAAGGACGGGATATCTTCCAGT 
GCGACGATGGATGGGTTCCTGTCTATACAGATGAATCCCAAACCTTGGCGGTGATGTCCATTGGATTTTTATTGAAGAACCGTGAGGATCCGGTC 
ATTTGGCGTGGTCCCAAAAAGACCATGATGATCCGGCAGTTCCTCACCGACGTCAGGTGGGATGAGCTGGATTACCTAATTATCGATACGCCACC 
CGGCACTTCGGATGAGCACATCACCGTAATGGAGTGCCTGAAGGAGGTTGGTTGCCACGGAGCCATAATTGTGACCACACCGCAGGAGGTGGCCC 
TGGACGATGTACGCAAGGAGATCACCTTCTGCAAGAAGACCGGCATCAATATCCTGGGCATCGTCGAGAATATGAGTGGGTTCGTGTGCCCCCAT 
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TGCACCTCGTGCACCAATATATTCTCGTCAAACGGTGGCGTCTCACTGGCCACTTACGCCCAAGTACCACATCTAGGAACTCTGCCCATAGATCC 
CCGTGTTGGCATTTTGGCGGGCACCACTACGTCCGTCTTGGATGAGCTTCCGGATTCCACGACAGCGGAGGTGCTGACGCATATAGTTGAGAAGT 
TGAAAACGATGTTGGTCTCCTAA 
(SEQ ID NO: 1409) 

Start ATG: 1 

MLDKVKNVIVVLSGKGGVGKSTVSTQLSLALRKNGFKVGLLDIDLCGPSVPYLLGLEGRDIFQCDDGWVPVYTDESQTLAVMSIGFLLKNREDPV 
IWRGPKKTMMIRQFLTDVRWDELDYLIIDTPPGTSDEHITVMECLKEVGCHGAIIVTTPQEVALDDVRKEITFCKKTGINILGIVENMSGFVCPH 
CTSCTWIFSSNGGVSLATYAQVPHLGTLPIDPRVGILAGTTTSVLDELPDSTTAEVLTHIVEKLKTMLVS* 
(SEQ ID NO: 1410) 

Classification: hypothetical 



Celera Sequence No. : 142000013384010 

TTTTGGTTACTTGAAACGCTAAGACTGAAATGGCTTGTTATGGAGAAGAAAACACGTCCTGCAATACATTATATAATAAAAAGAACCCAGCAATT 
TTCGCATCACCACATTACAGACAACAAAGTACTTTGTCCTCATCGTTCTGTGTGATCCAGATCCCCGACTATATATGATATATACACCTAGCTAG 
ATT T AAGC G AT T T GT AC AT AC AG AAGC AAAAGC AAAAC AAT AT AC C AGAAAAT TG C C T GT T AAAT AC AT T TAT AT AC AC AT AT G AAT T T AGTT AT 
ATCACGATGACTAGTTTTTAGTTCTTAGAGGTTATTTAGTTAGTAAGGAGCCCAACTGACGTAACGTGAATAAATCAACTATATATGTGCCCATA 
TATAAAGCGTACTATTTAATTATTTAGTTTTTACTGGACTTATGACACAACGAGCCACTGTAATGTCTTATTCAGCTCAATGCTCATAAACAAAA 
TAAATATTTTTGGAAAAAATACTGTAGTTCTTCTTATTTAGCCAAGACGTCCAAACCTTTTTTTAATAAATTTAATTGAGATGAAGATTTGCATT 
CGAATTCTATAGAAGCATCAAACAAATATTACAACAGTATCGAGTGGGTTATTAAAAATTAAATATGTAAAAATCGCTCATTGTGTTAAAATATT 
AAT T AT AAT AG AT T AGCTC AAAC ATC TT AAA T C GAC A T TT T GT GC AAC ACAT AAAAAT T C T CGC AGAC AAT T GGC AGCT GAT T ACG AT T AAAC AA 
AACCGAAATTAGCCGTTTTCCTTTAAAGCAATCAGCATTCGTGATCTCTTGTTTGGTTACCTCACAATGCTAGTTAGTTATAGCGCCATAAAACC 
CCATTTTCAATGTCAGGGAAATCATAAGGTCATAAGGTTATCCCCACGATCGTTCTAATGTGATCACAGCCACAATCTATAATAACCAGTTCCCA 
ATAGATAGACAAGATTAGAAAACGCAGCTTGGTAGAGTTTTTTATATTTTATAATTTAATTTATTCAGAGTGCTTTCTTGGGTATCGGGTCAGAC 
AATTGAAACAGCATTAATATCATATTCGCATGTGTGGCCTGTGGGGGCAGCGGAGAGGAGATAACAAGGGGCTGGCCATCAAAGGTACAACATAC 
GGACATACTTTCCAGCGAAAGCTAGACACACATAGGATTAAATATAGCACATAGTATAACAGAATATATATGGAATTGAATACTTTGAACTTGAG 
TGTAGCTTAGTTTCGTACGATAGAATGTTTGGCTTTTAAAGTTAGAAAGATGTGTTTATAAAAAAGTGTGTATAAGTGCTGTGTTAGAGAAATGC 
GT AAAGT GTT G T T T TGG AAT GT AC AT GT GC AATGGC AC ACAC G C AAC AT T AAT GT AT T CT T GCT GAG AAAC AAGTATAAAGC ATT T TC GC T A T AC 
TGAAACCGAAATTCGTTAGAAAGTAAAAGTATGTTGCTCCTGTGTCTGCCCACAATCGCTCCAACTAGTGAACTCTGTCCTTAAGGTAACCAATT 
GGACTAGCCCACTTAATGCTGCTTGGCCTTCAGGCTAGCTGGCATCTCTGGTGGTGGTGGGCGGGGAATGCCCAGGGCGACCTTCACACCATCGT 
AGATGAACCACTGCAGAGCGGTCAGGGTACC GAT CATGATAATACGAGGAGTCAATCCGTTCC ACAT GCCACTGAAGCCCAGCGACTTGGCCACG 
CTGATGGCGCTAGCTCCCTTGGCCTGGTTCAGCTTGGACACCACCACATCAGCGGGATGCGACACCACGGCACAGAACACACCGGCGATGTAGCC 
AGCGGCGAAGGTCACGATCAGCTGCTCGCCCTTGGTGCAGTCGGCACGAGGCTTGGGCACCACATACCTGGGAAAAGTTGCATCATTAGTATTGA 
TTTAACGGGGTTCGTTGGGGCTTGACAACCCGAACTGTGAACGGTATTGCTCTAGTTTTTACATTTGAACTTTCAGTGGCTACCTGCTAATTTGC 
ACTTCAAAAGGAATGCTTTGCTTTTTTTATTTTATCCATAGTAACAAACTAGTTCTGAAATAATATGATTTTCCTATATTTGAACAAGATAAGCA 
TTCGTCGTTGGTTTTTATTCTGGCGTGCCATTTATTAGATACGAATGGCATTCTTATCTGCTGACGCGCTAAAAGTTTATTTTATTATGAATACT 
TTCAGTTTTGATCAATTAGCATATCACTTTTGAAATTTCACAGATTAACAGTCTACCTTTAAACTATTGTAATAGCAAAACGCTAATACGAATAC 
AGATATTGTGCACAGCACTGGTAAGTTTTTCCGCTGTAGAAACTAGAGGATACCTAAAGAGGAAATGTTACTTACTTGTAGAGCAGCTCCACGGT 
GCGCTCGAAGCAAGCGAACTTCATCATGGTGTATGGGATCTGTCGCATCCACAGGGGAACCAGACCCTTGTAGAAGGCATTGACGCCCTCCTCCT 
TTAGCATCTTGGGCACTGCCTCGCGGAAGTTGTTGGCGTATCCGGGAATAGTCTGGATCTTCACCTTGGCGGCCTCGAACGGCGCCAGAGCGATA 
TCGGCGAAGAACTCGGCCGAAGCGGAAGCAGCCAGATACAAGGAGGTGCGGTACAGGTAGGCGTTCTCCTCGCCAATGATTTCGGCGTACTTCAC 
CTTGAACAACTCGTACAGACCGAACTTGCACAGACCCTGAAAGATACAGAAACGCGGATTAGCCTGCTGTTTCTCAGAAGTTATTTTTCAGTTTG 
CAGTAGTATCCAAGATTTTATCTTCGGTATCTGTTTCTCCCGTAACCATAAAGTTGTCAAAGGTGTGTTGTTATTCTGAGGTATACTAAGATATC 
AGCCTAGGAATCAAACTTCCTAATGTTCGATTCGAATCACTTTTATCTGCTTCAAGCCCGGAACCGATGTAATTAAGGTCAGCCGACCTGGTCAC 
GAGCATATAATTCGCCTCCAGACGATTATGCAACCCATAAAGTGAAGCCCAATCTCACCTGTGCCGAGTAGCCGAGCAGAGTGGGGAACCAGCCC 
TTAGCCAGTCCGCGGGCGCCCTCCTCCGCCACGGTGACCTTGAATCCGTGCACCAGATTCTTGTACTTGGCCTGGTCGACCTGCAGACGGCACTT 
CACCAGATCCAGTGGGACCACGAAGGTGTGGGTGGTGCCGCAGCTGAGAATGCCACCAATTCCGCACAGAGCGAAGTACTTGGTGCTGCCGAATT 
CGCACGAATCCTGCTGGTTGGCCACCGGGGTAGCGGCGGCGGCGATCTGGCGACCCTCAACCGGCTGGGGTTCCACCACTGGGGCGGCGGCATCG 
CATCTTGCCATCGACATGGGGGTGCGGAATGGCGAGTTCCGGGCAGTTTCAAAGAAGCTGGAGAACATCTTGATTCTGTTCTTGTTCTGGGGTCG 
GCTGAAAGTAGAGAGACCATCCATTAGTTACGTATATATGTACCTCGGAGGCGACATGTGCTTTATCTAATTATATATATAGCTGGGTAATCAGC 
TGCTTTTCGCTTTTGTAGGTTACATAACGGCGTGCCTCGAATTCCGAGCACCCGGACTTACGACACCAAATTTACCCGAAAAGCGAAGGATTTTA 
AGCAACCACGTAAATTTAGTGTATGTTAAGGGAGCGATAGAGCACCGCTCGGCTACTTTGATTTCGATTTTACAAAACTTATGCAATTTTCGCTC 
GGAATTTTCGCACTTACTATTCGTCAGCAACGTTAGCGGCTCGTTGGCGAGGATTCAAAGAATGAGGGGGGGCTCACGGAGGTGCAGCGATTTCT 
CACGAATACCAAAACGCCTTAATGGTTTTGCCGTGGAGAGGTGTAACTAGCGGATTGAGGATAGTTGGCCACACTCATGTCAAAATCTTCGGCTA 
TGGACTGCAACCTTGGTGCAAAACTTTGGTGGCGCCTGAGCGTGGACTCGAACCATCTGCATACGAGGTATCAGGTGAGATTATTTTAATGCCAT 
TGCCTTCGTTGTGAATAAAATATTAATTATTAATATTAATTTCATATTGATAACATTAGTTGTAAATATTTTGAAAAAATAATAAATTCATTTGT 
AAAATCCTCATATTTTTGCATAGTATTTGTGATTTGTTTTCTATTTTTACACAATAAAACAAATTTTAATCTAAAGCTTTATTGTATTTAGAACT 
CTCTGTAAAATATATAACAATTTTCAATTATATGTACATTTTAAATTTATAAGTTATAGCCCCATGGTAAAACTGGATAATTTTATGAGTAAATA 
TGTGAAAATAAAGTTCTGCCATGATTTCACATGGGGTTGGCCCAATCCTTCACTTTTTACGCACGGACAGGAAATTAGGCCTTTGTATTCCATTT 
TATATTTCACTCTGCAAGTGAAGCAAAATTCACCCTGGTGATCTTCAGTTAATAAGATTTGACAACCGATTTCTGGTGACCAAACGCAGCAGACT 
TTGGCATTCATAGTATCTGTAAAATTAAAAAATTAGTCTTTTATTACTATTCAAATGCTTCTTATACCGATATGCAGTCCGGGTGGAACGCAAAC 
ACCCTCTTTATTGGTTGTTTTTGGGAATTCACCTTGTAGACGACCCAATCGCCTATAAAAGTTTCCTTCATAACAACTGAATCCTGTCTCTTTAA 
TCCGACACATTTCCGTATTTTCCTTCCTACAATCTAAGCATCTCCCATTTTCTATTATAGATACGTTTTTTTCGGCACAGGTGATCAACAGTCTC 
C C AC TG AG T AT AA T A A GAAAAT A T AT A 
(SEQ ID NO: 1411) 

Exon: 3777.. 3723 
Exon: 3421.. 3004 
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Exon: 2696.. 2356 
Exon: 1872.. 1001 

Start ATG: 3393 (Reverse strand: CAT) 
Transcript No. : CT15958 

CCCCCCCTCATTCTTTGAATCCTCGCCAACGAGCCGCTAACGTTGCTGACGAATACCGACCCCAGAACAAGAACAGAATCAAGATGTTCTCCAGC 
TTCTTTGAAACTGCCCGGAACTCGCCATTCCGCACCCCCATGTCGATGGCAAGATGCGATGCCGCCGCCCCAGTGGTGGAACCCCAGCCGGTTGA 
GGGTCGCCAGATCGCCGCCGCCGCTACCCCGGTGGCCAACCAGCAGGATTCGTGCGAATTCGGCAGCACCAAGTACTTCGCTCTGTGCGGAATTG 
GTGGCATTCTCAGCTGCGGCACCACCCACACCTTCGTGGTCCCACTGGATCTGGTGAAGTGCCGTCTGCAGGTCGACCAGGCCAAGTACAAGAAT 
CTGGTGCACGGATTCAAGGTCACCGTGGCGGAGGAGGGCGCCCGCGGACTGGCTAAGGGCTGGTTCCCCACTCTGCTCGGCTACTCGGCACAGGG 
TCTGTGCAAGTTCGGTCTGTACGAGTTGTTCAAGGTGAAGTACGCCGAAATCATTGGCGAGGAGAACGCCTACCTGTACCGCACCTCCTTGTATC 
TGGCTGCTTCCGCTTCGGCCGAGTTCTTCGCCGATATCGCTCTGGCGCCGTTCGAGGCCGCCAAGGTGAAGATCCAGACTATTCCCGGATACGCC 
AACAACTTCCGCGAGGCAGTGCCCAAGATGCTAAAGGAGGAGGGCGTCAATGCCTTCTACAAGGGTCTGGTTCCCCTGTGGATGCGACAGATCCC 
ATACACCATGATGAAGTTCGCTTGCTTCGAGCGCACCGTGGAGCTGCTCTACAAGTATGTGGTGCCCAAGCCTCGTGCCGACTGCACCAAGGGCG 
AGCAGCTGATCGTGACCTTCGCCGCTGGCTACATCGCCGGTGTGTTCTGTGCCGTGGTGTCGCATCCCGCTGATGTGGTGGTGTCCAAGCTGAAC 
CAGGCCAAGGGAGCTAGCGCCATCAGCGTGGCCAAGTCGCTGGGCTTCAGTGGCATGTGGAACGGATTGACTCCTCGTATTATCATGATCGGTAC 
CCTGACCGCTCTGCAGTGGTTCATCTACGATGGTGTGAAGGTCGCCCTGGGCATTCCCCGCCCACCACCACCAGAGATGCCAGCTAGCCTGAAGG 
CCAAGCAGCATTAAGTGGGCTAGTCCAATTGGTTACCTTAAGGACAGAGTTCACTAGTTGGAGCGATTGTGGGCAGACACAGGAGCAACATACTT 
TTACTTTCTAACGAATTTCGGTTTCAGTATAGCGAAAATGCTTTATACTTGTTTCTCAGCAAGAATACATTAATGTTGCGTGTGTGCCATTGCAC 
ATGTACATTCCAAAACAACACTTTACGCATTTCTCTAACACAGCACTTATACACACTTTTTTATAAACACATCTTTCTAACTTTAAAAGCCAAAC 
ATTCTATCGTACGAAACTAAGCTACACTCAAGTTCAAAGTATTCAATTCCATATATATTCTGTTATACTATGTGCTATATTTAATCCTATGTGTG 
TCTAGCTTTCGCTGGAAAGTATGTCCGTATGTTGTACCTTTGATGGCCAGCCCCTTGTTATCTCCTCTCCGCTGCCCCCACAGGCCACACATGCG 
AAT AT GAT AT T AAT GCT GT T T C AAT T GT C TG ACC C GAT AC C C AAGAAAGC AC TCTG AATAAATT AAAT TAT 
(SEQ ID NO: 1412) 

Start ATG: 84 (Reverse strand: CAT) 

MFSSFFETARNSPFRTPMSMARCDAAAPVVEPQPVEGRQIAAAATPVANQQDSCEFGSTKYFALCGIGGILSCGTTHTFVVPLDLVKCRLQVDQA 

KYKNLVHGFKVTVAEEGARGLAKGWFPTLLGYSAQGLCKFGLYELFKVKYAEIIGEENAYLYRTSLYLAASASAEFFADIALAPFEAAKVKIQTI 

PGYANNFREAVPKMLKEEGVNAFYKGLVPLWMRQIPYTMMKFACFERTVELLYKYVVPKPRADCTKGEQLIVTFAAGYIAGVFCAVVSHPADVVV 

SKLNQAKGASAISVAKSLGFSGMWNGLTPRIIMIGTLTALQWFIYDGVKVALGIPRPPPPEMPASLKAKQH* 

{SEQ ID NO: 1413) 

Name: mitochondrial carrier protein; phosphate transporter 
Classification: transporter 
Gene Symbol : Mpcp 
FlyBase ID: FBgn0026409 



Celera Sequence No. : 142000013385184 

CGACGGCCTCCTTGGTGGGCAGGGTTTTGAGGATAACCACCACATCGGCTACATTGTTGCCCGTCAGTATGGTGCCCTTCTTGAAGGAGCCAACC 
TGACGCACTTCCTCCAGTTGCTGTGGAATAATCAGAACAGTTTAAACATGCAAATCACAAGGAGTTGTGTAATGTACTCACACAAGTTGTCAGAT 
CCCCGGGTGCAACCACCAGATTGTCGAGAACAGCCTGCACCTTGGTCACTAGATTGCCAATAGCCGTCTGCTCGCTGGGAGTGGGACTCAGGTCC 
TGGTTACGCTTCAGCAGCGCCTGAAATGCAATTGTTTTCCCATTGCAGACACCTTGGATACGCAAGTAATTTATTGATTTACCGCAGTCAAAGCA 
GAATCGTCAACGGCGCCGGCGGATGGAACTTTGGGAAAGAAGACCTCCGCGAGGGTCAGATCGAAGGGGTGGCGCGGCACGAATGTCTTTTTGAA 
CGGCGGACGTATGCCGCCGCGCATGGGGCGGCCACCTCGGAGAGCTCCGCGAACCATATTCCTGTGGGGATGAATTTTTAATAATTTTCGCAACG 
CCGGAGCGTTCTGTTTTTTTTAACAGATAGGTAAACAGTGTGACCGAAGCTGGACCGGTAAGGAAACGACATCAAAGATGGGCTAAGCGCGTTTT 
CAAAGTATACCGCATAAAATATTTTTAGAGGGCAAACATTTTGAACATTTAAACATTTTACTACATTAGAATGCATAAATTGATTGAATTAAAAT 
ACGGAGACTGGTGTTTTTAATCTGCATAATTTTGTAGTTGTAGCTAGTGCAACAAGGTTTCTATCAGTTCCAGGAAAACGCTTTGTTTACAAGCA 
AAGTGCTGATTTTTATGTTGATGGTCTTACCCTTTTGTTGTACAAAATTCCGGTGCTAAAATCGGATTAAAAGCGCATGAAGTTATTCGTAAGTA 
GTTGAAAATCGCACTAGAGATGAATCCTAATGTTTATAGTTTTCAGAACTATGGATCTGGCGAACGTAGATCAACGCCCCGTGAAGGACGAACCA 
CCTCAAGAAGACACTTTCGAAGAAGAATTAATCTTCATTTCTAACAGCGACTTCGAAGAGCTTGAAAGCGAAATAAAGATTGAGAACTTCTGTAG 
TTATGGCAAAGATTTGGAGCCAGTTAAAGGCGTCCCGTCGAAGCTGAAGACGTGTAAATCCAAAATAGCAAAGAAGCGCCCCTTGCGAAAGCAAA 
CAGATACGTTTAAGTGTACCCAATGCCAAAAGACGTTTACAAGAAAGGAAAACCTCGAATCACACTTGCGACTTCACGCAGAAGAACGTCCGTTC 
GAGTGTTCCCACTGCTCCAAGAGCTTTGGACGCAGGACGCATTACAAGCGACACTTGCTCAAACACGAAAAGCGACCTCATAAGTGTTCCCACTG 
CTCAAAAACCTTTACCCAGAATTCATCGCTTAAACAACATCTGCATGAGCACACAGGAGAGCGCCCCTTTAAGTGTACCCAGTGCTCGACGTCGT 
TCGCGCGAAAATCTCATCTTCAGGTACACTTACGTACGCATAGTGAAGAACGACCCTTTGAGTGTACCCATTGCGAAAAGGCATTTAAAAACAAT 
TCACATCTCCAGGAACACTTGCGAACGCATCAGGAGGCACGGCCCTTTAAGTGTTCCCACTGCTCAAAGAGTTTTAAGCTAAGATCAATTCTACA 
AAAGCACTTGCTGACGCACGCCGAACGATCCTTTAAGTGTACCCAGTGCCCGAAGACTTTTCTGCAGAATGATAGTTTGCAAATTCATTTGCGAG 
TGCACGCGGGCGAAGACCCGTTCAAGTGTCCCCATTGCTCAGAGACCTTTGCAAGGAATTCGAGACTTCAACTACACTTACTTGAGCACGCTGGA 
AAAGAGCCCCTGAAGTGTTCCCAGTGCTCAGCCACCTTTGCCATGAGGTCACTTTATCGAGTCCACGTGCGTTTGCATACAAGAGAGCGACAGTA 
CAAGTGCGCCGAATGCTCGAAATCTTTCTTCAAGAAGTCACATCTTGTGGAGCATCAGCAAGTGCATACGGGAGAGCGGCCCTTTAAGTGTACCC 
ACTGTTTCAAGGACTTCAAATGTCGTACGCATCTTCGGGTGCATATGCTTGATCATATTGGAGAAAAGGTACCCAAGTGTTCCTACTGCTCGAAG 
GAGTTTAAACTAAGCTCGCAACTGCTGGTGCACCTGCAAGAGCATACTGGAAAAAACCAATTCGAGTGTCCCCACTGCTCGAAGTCCTATACAAC 
TAGCTCGACTCTTCACATGCATTTGCGAACGCACACGGGAGAATTGCCCTTCAAGTGTTCGCACTGCTCAAAATTGTTTGCGAGAAGCGCAGAAC 
ACCAGGAACACTTGCGAACGCATACGGGGGAAAAACCCTACAAGTGTTCACACTGCTCAACCCGATACACACAAAAGTCCAGTCTTGGGCGACAC 
CTGCGGCGAACCCTATGCGGCACTAAATCGGATTAAAACCAATTAGTTTAGAATTAACTTAGAAATAATGTAATAAGAAGTAAAGTAGGCGAAAA 
TAAATAAATGAGTTTAAATATTTTTTTAAGTTTAGAAATCGGTAGTGTGCCCATTTGTTTAGTCCCCCTGATCGACCTGTACCATATTTGGCATT 
CAGCATTCGGTCACACTGCCGCCCAATTGCTGTGTCATTTTTGTTTCTGTGCGAAATAAGTCGGCTTTTTCATTTTACACATCAAATCACTGCAT 
TTGCGGCCCAGGAAATATGGCTACCGAACGATACAGCTTTTCGTTGACCACGTTCAGGTGAGAGCATCGCTATAGCCGGAGCGGAGCATTCGCTT 
CGCCCCGCGCAACACAAACTGCCGCCATGACTGGGCAATAAGTACAATTTTCTACCGTTTGCAGTCCTTCCGGAAAACTCGTCCAACTGGAGTAT 
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GCATTGGCGGCCGTATCTGGCGGAGCTCCCTCCGTGGGCATTATAGGTTAGCATGCCGCACAGCCAGTCAGCCTGTCAGCCAGTACTGTGCATTC 
AACTAACACACTATTATTTATTTTTCTTGGCTGTAGCTTCCAACGGCGTCGTCATTGCCACAGAGAACAAACACAAGTCACCGCTGTATGAGCAG 
CACAGTGTACATCGCGTGGAGATGATCTACAACCACATCGGCATGGTGTACTCGGGAATGGGTCCGGACTACCGCCTGCTGGTCAAGCAGGCCCG 
CAAGATCGCCCAGACGTACTACCTGACCTACAAGGAGCCGATTCCAGTGTCACAGCTGGTGCAGCGCGTGGCCACGCTCATGCAGGAGTACACTC 
AGTCCGGGTAAGTGTGCGCGCTTTTCTCGACTTTGGGTTCCCATAGATAATCGTATGCAATCCCACCAACAGTGGCGTTCGTCCCTTTGGCGTTT 
CCCTACTGATCTGCGGCTGGGACAATGATCGTCCG'TATCTCTACCAATCCGATCCTTCGGGCGCCTACTTCGCCTGGAAGGCCACT 
(SEQ ID NO: 1414) 

Exon: 1001.. 2506 
Start ATG: 1001 

Transcript No. : CT16741 

ATGGATCTGGCGAACGTAGATCAACGCCCCGTGAAGGACGAACCACCTCAAGAAGACACTTTCGAAGAAGAATTAATCTTCATTTCTAACAGCGA 
CTTCGAAGAGCTTGAAAGCGAAATAAAGATTGAGAACTTCTGTAGTTATGGCAAAGATTTGGAGCCAGTTAAAGGCGTCCCGTCGAAGCTGAAGA 
CGTGTAAATCCAAAATAGCAAAGAAGCGCCCCTTGCGAAAGCAAACAGATACGTTTAAGTGTACCCAATGCCAAAAGACGTTTACAAGAAAGGAA 
AACCTCGAATCACACTTGCGACTTCACGCAGAAGAACGTCCGTTCGAGTGTTCCCACTGCTCCAAGAGCTTTGGACGCAGGACGCATTACAAGCG 
ACACTTGCTCAAACACGAAAAGCGACCTCATAAGTGTTCCCACTGCTCAAAAACCTTTACCCAGAATTCATCGCTTAAACAACATCTGCATGAGC 
ACACAGGAGAGCGCCCCTTTAAGTGTACCCAGTGCTCGACGTCGTTCGCGCGAAAATCTCATCTTCAGGTACACTTACGTACGCATAGTGAAGAA 
CGACCCTTTGAGTGTACCCATTGCGAAAAGGCATTTAAAAACAATTCACATCTCCAGGAACACTTGCGAACGCATCAGGAGGCACGGCCCTTTAA 
GTGTTCCCACTGCTCAAAGAGTTTTAAGCTAAGATCAATTCTACAAAAGCACTTGCTGACGCACGCCGAACGATCCTTTAAGTGTACCCAGTGCC 
CGAAGACTTTTCTGCAGAATGATAGTTTGCAAATTCATTTGCGAGTGCACGCGGGCGAAGACCCGTTCAAGTGTCCCCATTGCTCAGAGACCTTT 
GCAAGGAATTCGAGACTTCAACTACACTTACTTGAGCACGCTGGAAAAGAGCCCCTGAAGTGTTCCCAGTGCTCAGCCACCTTTGCCATGAGGTC 
ACTTTATCGAGTCCACGTGCGTTTGCATACAAGAGAGCGACAGTACAAGTGCGCCGAATGCTCGAAATCTTTCTTCAAGAAGTCACATCTTGTGG 
AGCATCAGCAAGTGCATACGGGAGAGCGGCCCTTTAAGTGTACCCACTGTTTCAAGGACTTCAAATGTCGTACGCATCTTCGGGTGCATATGCTT 
GATCATATTGGAGAAAAGGTACCCAAGTGTTCCTACTGCTCGAAGGAGTTTAAACTAAGCTCGCAACTGCTGGTGCACCTGCAAGAGCATACTGG 
AAAAAACCAATTCGAGTGTCCCCACTGCTCGAAGTCCTATACAACTAGCTCGACTCTTCACATGCATTTGCGAACGCACACGGGAGAATTGCCCT 
TCAAGTGTTCGCACTGCTCAAAATTGTTTGCGAGAAGCGCAGAACACCAGGAACACTTGCGAACGCATACGGGGGAAAAACCCTACAAGTGTTCA 
CACTGCTCAACCCGATACACACAAAAGTCCAGTCTTGGGCGACACCTGCGGCGAACCCTATGCGGCACTAAATCGGATTAA 
(SEQ ID NO: 1415} 

Start ATG: 1 

MDLANVDQRPVKDEPPQEDTFEEELIFISNSDFEELESEIKIENFCSyGKDLEPVKGVPSKLKTCKSKIAKKRPLRKQTDTFKCTQCQKTFTRKE 
NLESHLRLHAEERPFECSHCSKSFGRRTHYKRHLLKHEKRPHKCSHCSKTFTQNSSLKQHLHEHTGERPFKCTQCSTSFARKSHLQVHLRTHSEE 
RPFECTHCEKAFKNNSHLQEHLRTHQEARPFKCSHCSKSFKLRSILQKHLLTHAERSFKCTQCPKTFLQNDSLQIHLRVHAGEDPFKCPHCSETF 
ARNSRLQLHLLEHAGKEPLKCSQCSATFAMRSLYRVHVRLHTRERQYKCAECSKSFFKKSHLVEHQQVHTGERPFKCTHCFKDFKCRTHLRVHML 
DHIGEKVPKCSYCSKEFKLSSQLLVHLQEHTGKKQFECPHCSKSYTTSSTLHMHLRTHTGELPFKCSHCSKLFARSAEHQEHLRTHTGEKPYKCS 
HCSTRYTQKSSLGRHLRRTLCGTKSD* 
(SEQ ID NO: 1416) 

Name: zinc finger protein 
Classification: trans cription_f actor 



Celera Sequence No. : 142000013384536 

GTTTAATTCAAGGATTCCTCTTGTTTGGGCATTTCAGATATTGGCAGTTATCTAGTGTTGTTTGGTGTCTTTGAAAACTGTTCGAAAATGGAATA 
ACAAATAATCTTGGGCAATTCCCATTACAATATGAAGTTGTTATTCTTCGCAGCCAGATGGACTGTAATGCAATCGTAGATCACAGCTAACTCCA 
CTCGCAAGCGGTTCATAAATCATTCAAGCTAACCCATGAAAGGCAACTGTTTGCCAAGTCAATCCAGGCCAAGACCCCCCAGACTCTCCTCGTAT 
TCGTGGTTATCTTGGGATTGGATTGTTATACATTGACTTTTTGAAATGCAAATCACATGCAGACCACAGATGCATAGCCAATGAATGCAAATAGA 
CAAAGACCTGCATAGGCTCCTATTTACATACATGTTCTCAGAAAATCACATATTCAAACTGCTCCTATCTCCCACCTCTTGCCCTCCCCTGCAGG 
TCGGATGTTTGCGTTTAACTTTTTACATTAACTGTCATTTGGTTTATTTCACAATTTACAAAATGCAACTTAAACAATGGTCTTAGACGTGGTGC 
TGGCCATGCGTGGAGGAGAGTGTCGTCAACAGATGTCCTTTGTGTTGTTCGGCATCCTGGTTACATTTTACATACAGTGCAGTGGAAGGAACTTA 
ACTAACTTACTGCGGTTGCAACCGTTTCTCGGGTTCTTTAATTTGTTGTCGTGTGCTACACTAGTACTAAGCTTTGCTGCTGCGGTTGCCGTACA 
CTTGGTGTTCGCTTAGTTTGCCTTTGTTTCAGTTGACGCTGCTGTGGCTTCACATTGCTTTAATTTTGTTTCATTTTGTTTTCATGGTGTTAGTT 
TATATCGTAAATTTATTTATGCAATTTTATCTCAATTTCATTAATCTGTTTTGCTTTGTTTTGCTTTTCTTTGCCTTTTAGTTTTGTTTTCTTGT 
TTTGCCTGATTTTTACATACACATATTTTGTTTACAATTTTCTACTAGACTTAATCGGCATCCGCGCGCATACCCCGCACCTCGCCCACCTGCCG 
CGCCTTGTCCAGCTCCTCCTTCGTCTGGTCCTCGATGGCGCGAATGTCCTCCATTGTTAGACCGTACCAGCGATCGGTTGAACAGAAAACTTGGC 
TAAAAATAGAAATGAAAAGACGATCAATAAGGAGTTTCCATCCAGGGCATACTAAAAAGGTATTCATGCATATCGAACTTTATAGATTTTAAGCA 
GTTCAAGGATATACAATTTCAAGACTATTACCGATGGAAGTTTGTAAAGAGGCGACGCTCCGATTTCTGTATGAAATTCTCTACTCTTGTTTGCA 
GGCCGAACCATTTGAACTCGCACGTGACCAGCTTGTAGCAGGTCATGACAGGATTAACATGCTTTTTCCAGTCGGATCCCACCAGGGGACCGCGG 
CCCGTCTTCTTTGACTGGTAGCTGAAAGGATGATCGAAACGTAAATTTCTAATTCACATCACATTATTCAATATTAACGCTTACGTGGTTGGATC 
CTCATCGGGCTTGTAGTCCGCGGGCAGCACCGGATCGTTGGCAATGTCGATGTGCACTATCTCTCGCACTTTCAGCTTATCCGGCGTCAGCTCGT 
GCACCTGCAATATCGAAGAGCCATGGCGATATATCTTAGATCCATACATTTTATTAACTTATCGAACACAAAAAGCGTTTACTACATAGACATGA 
ACATGGATCGATGGCGGATAAGAACTACAAAGCCGAATCGAGCAGAGACAGCAAATGAAATACAAAACAAACGAAACTTACGTTGTCAACAGTGC 
CAAGGTCATTTTCTATGTGTTGCGAATAAATGTCTATTTTAAAGTTTTTATCCATATAACCAGGATTCTTAATCGATAACCAAAGGATTACAAAA 
CCAAACCAAAAAAGCCAAACAAAATGATGGTAGGATTTTGACAAAATCATATTGCAGGATGATCGAGTTAGTAGTTAGTTTTCACAAATAAATGT 
AGGGTTGAGATGTGATGATGTTATGGAGAGAAAAGAAGAAAACACACGTTAGTCATATCATGCTGTGGATTCTTGACTTACATTTTCTGAATCGC 
CCGCATCTCCGACGTGCAGAGTGTCGATGATTATTTTGAAAGCATCTTTCATAAACTTGGGGTTCTAGAAATTCAATATAATCACAGTAATCATC 
AGGTTGCCTCACATTGCAGTACATTGCTAATGCTAAACCAATTCATTTGTTGATTTGTTTCTAACCCCCATTTTGTTACAACTATTAAATTTAAG 
TGCTCTCCCCATTTACCGTGATAATCGTTCGACAATAGGGATAGGCATTCCATGCCTCCTCGTGGATCTCCAATGAGCCCTTGGGTGCCAATAGT 
CTTATGTAGGCTGGAACTTTTGATTGCAGATGGTAGATCTTATATGTATATTGACCGGAATTGTATTTGCCACCTGTAGGAATCGAAAAACTCTC 
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TGTTAGCCCGCAACTATTCATTACCAATCGGTATTTGCTTACCCAGCAGGGGAAAATCTTCGAAGGGTTCGTTTTTTAACACCTCGATGCCCTCG 
CCGCCACCCGTATTCTCCTTTGACGCCTCGGCCACCGAGAATAATTGTGCAACTTGATACTGTGGAGCAGAAAAAAAAAGGAGTTGGGAAAATTG 
TTATCCTCGTGACAAGGGGCTTTTTGCCCAGTTAACTCTCTTACCTCTTCCACAGTCAATGGCAAAGTCACACGGCTGCAACGAGAAAAAAAAAT 
GAATATGTCAGCGAATGTGTGTGTAAAGGCTACGTGATTTTAATTAAGTTTAATATAACCATTTTAATTAAATACCATGCCTGTGCATAAAAAGC 
TGTTGATACCTTTCGCGCTACTTCGGGTGTATGGAGTATATCGAGTTTAGTACGAGGCGGAATATACAAATTATTAGCAATAACCAGTGAAATGC 
AACCAGTTATTATTGCTCTATTTAAAGGAGCGGTATTCAGACTACGTTTTAGTAGTAAATATGCCAGTGTCTTTTGTGCAAATAAATTAAATATG 
CATTCAGAAACAATAAATGTCACCCACTCACTACATGCAAATAGTTTAGACTGCATTTCTTTTAGTTTACAACAGGTGTACTTAAATCTCGATGG 
TTTTTCTGTTTGTCAATAAATAATCCGCTTTGTAATAGGAACGCTTTCCTCTTAAGGCCAACTTTGAATTGTCTGTTCCACTTTAAGTTCTAAGT 
AGCAAGGCACACAAATTGCCTCAAATTACAAGCCCTCTACTCTGGTTTTACATTTTCTAAGGCAATCAACTCAAGCCCCAAAGTATGCAGCACCT 
ATTCAAACAAGGAATCTTGAGACTACTTGCAAGAATGACCATCGGTAATTTTGGCTAGAGCCAATCAATCGTGGCCAATTGATACTTGTTGAGAT 
AAGATCTCAGGTAATTTATATGAGCACTTAATACAAACTGTTGAAAAAGGTTAATATCACTGCACGATTTAAACGATATATTTTTAATTAAAAGC 
TTACACGAGTGGGCGGAGCATTGGCATTGAAAAGAAACAAAAACAAAGGGTTTCCACAGCATAAAAGCTTCACAAGCCTTTTTGCATTTTATTTG 
CATGCATATGAGTTTGAACACGCCAAACCGAAAACCCACCCACTTCCGGGCCACATACTCATCCTCTAATTACCAAAAAGGTTGCCAGGCAGCAA 
AAGAGTAGGAAAAAGCGGCGAAAAGTGTTCTGGGACCTCCATATACATAATTATGCCAGCCTTGAAACTTGTACGCATCGCTTGACTTTTGTTGC 
GGT T T T GGC GGGG AC AATTT AAT TGC AAAAAAT T AAT T T T TT A C AACG AAT ATG AAAT GCT C T T AAAGG AT GG AAAAAAGT CT AT T AAC T TTT AT 
AGCTTTGAGTTTAGAAACTTTACTTTAATAGAACAAAAGAGGATTAAAACCTGTAACCTATATTATCATCGCTTTATAATCTCCTGCCAATGTTT 
CCATTTTGAGAGTTAATGACGCAAAAATGTTCTATGCATTCCCAAACTTTTTTGATGGTTTTTTAATTTTTTTTTATAGAGAAATAGTTAGGCCA 
AAGTTCTTTAATTTTGATACAGCAAGCATAAAACTGTGATGAGTGAAAAAGAGGAGCATAAAAACCTCTGGTATTTCGCCTTGATCTCTGGGAAT 
TTACAACCCGTAGTCGCAATGTTGAAATTTAGCTACAAACTGTAAGCCAAGGAAAGTCAACAGTCAACGGCTTGGCAAATGAAATATTCCCCGGC 
AAATAAAGGCCAAATTGCTCCCCGGTGCGAGGGAAGCCTTGGAATGCATTCGCAAGGTGCGAACTGGAGCTCCTTGGCCATAAAGAACCTGAACC 
TCGTGCTGGCCCGGCTTTATGGGCTGTCAGCCGTTCTCGCCTCCCCACACATAATACACATAATACACGGCAAATGCCAAACAATGAGACACAAG 
AAAACACGACACGACTCGAGAACAATCAAAGTGATGTGGCGAAAGTGGGGGAAATTCGGGCCTTTGCTTGCCGGTTGTCTTGCAGACGCAATTAA 
CCTTTTGGCCCACTGTTGACACTCGTAATGATGTGTTTCCTAATTGCCAGTCGACTCGAGCAACTGGTGGGCGTGGCAGAGGGACAGAACCAGGC 
GCTTTAATGCTGCCTACTCAAAAGTTGGCCAACTACGCGCCGGCCCTGAAAAGGTTAAAGTTCTAAGAGCACTCTTTTTATGGCGGTAGGGGATC 
AAGTTCAACTTTCGGCCGAGTTCATCTGGGGGGTTATTCTCCAAACAAGGTGGATGATTTAGGAGTGTGTGCGTGTCGGGGGAATGGGGCTCCAA 
AAAGTTGGAACTATAAAGGGACTTAGGGAGAGGTAGAACATTGTATTGCGGTGTTTAATTGTTAATGAAGTTATTCGGAACTTTACTTAATCATT 
GCATAATCAAGAGGAACGATTTGAGCTCTTGTGGTGTGTGTGTGTTCAACGGGTTGGAATTAAAGTTTTCGCTTAAGTCATTTAAAACGGAAATT 
CTAAGGTAAATTGCTTATGCTTTGCATTTTTGGTCAATTAACAACAGCCCATCTATACTTATGTAAATAAATGCCTAAGTAATTCCAAATAAATA 
TTCGAAATGGTTACCATCCAATTTAATTACCCATTTATGTGCCAACGTATCATCAATTCATTTAATTATGCAAACAGTATAACTTTTCTTTTCAA 
CAAATTTCCACGCAGCATTTGTCGTTCAATCAAATCTGATTGTAAAACTTTAGCGCAAAAAATTGCCCATAAAAATTGGTCTTCTGCACAAACTT 
TTCCTGTTCCGCAAAAAAAGAAAGCATTTCATTTCTTGAAAGGCAAAAAAACTTGCTAAAGCTTAAAACCCAGAAGAGGGAAAACCATTAGAGGA 
CAAACCAGAACGTGTTTTTGTGACAAAATCGCTGTTCCAATAAGTTTAGTGGTAAATAAATTTCCCTGAAATTGTCGAAACAACATTCAACTGGG 
TTGAACAACATCAGCAAATGCGAGAGATCAAACGATATAGTCGCGTTTCTCCATGGGGTTTTTGGGTTTCCTATGAATCACAGCACAATCAGAGC 
ATTGAAGAAATTATGCAAATCAAAAATAGAAACATACAACAAATCCCCACACATGAATGGTGGTAGAGGGGGGCCAAAAAGTACAAAAGTACAAA 
AGTAAAGCAAAAACAAGCGTCTGAGAATATTGAAGAACTCTTAATGAGACCAAAGTTGCCGAAAGAAACACTCGGGAGACCCCTTGGCACTCAAG 
CCTCTTTTGTGGCCATGAACAAGGGAAATTGTTTGCTCTTGTTGGCATAATTGAAAATGGAAAGTGATTGTTTGGGGCTCTGTGGTCTTCGCATC 
CCAAAGAGGTGGTGTGTGTGTGTGTGTGTGGTGTGTGGTGTGTGAGTTGTGTGGGTGGGTCGAGGGGAAAGCTAGTGGCCAAGGCTCTTGTGGTG 
GCCATTATTGAGCGACGCCCACTGGCCATTGTCGCAATTGAAGGCCGAAGCCAAGTAATAATTGATTTACCCGCAATAAAGATGTCACACAGAGA 
TTGCTCAGTCGATTGTTCAGTGAATAAGGCAGCAACGAGTCATTCGAGCTTGAAATTAAAGCATTTGTGCTAAACCTTCGTCTAACTAACAATAC 
ATATAACAAAACGATATTATCCCTGTTAAGCAGTAGGTGTTTGCATTAACATAAGTAGATATTCTGATGTGTTTATAATTTAAGCCATTTACAAG 
CCATAAGTAATTAGCTATCTAAATAGCCAAACCATTTATATACTATTCTTAAGCGTAACTGATAACTGACAAAGCTCATATGGAAATCAAGTGAA 
ATTATAATATGGCCATGCACCATTGAAGAGAAGTTGTTTTTTAAATCAAGGCCGCATTTCGCGGTTTGCCCCCATTCGGATAATAGCATACTATG 
TATCTCCATATTTGGAGCCCAATTATGCTGGCGCTTAGTAGCCGGCACGCTCAATTGCCAAACATTTTCGAAACAACCGAAGCCCGAAACACTCG 
AAACGTGTAATGAGTACAGTGCCGATGTGAAGTAAAACGACGCATTGAAGTCGAAAGCCAAACACTGCGTGTTATCTTATCGCAGTACATAAAAA 
AGCGCACAAATGGCGGAACTAAATTGGAGCTCGTGCCATTTTCAAATTGATTGCTGGCTGGGCGAGCGGGGAGGGCGAAGGGAGAACTGTATCTG 
ACGGATACATGTGTGGTGTACAAAGAGGCCAATCAGTTTGGGGCAGTAAAAGTCACGCTACAGCTGGTTTATGACGAGCACCATAAAGGACCGAC 
ATTGAAAACAACAAGCTTTTGCTTTCAGAGATCAAAGTTGAGACGGTTGAAAGTTGAAAAAAGAAACCTGCACTCACACCAACTTTATTGTCGCA 
AAAACGTGGCCTATGGACGAGGGTAAATGAATTTGCTTGATGGTTGGCCGTACAGCAATAAGATCGATTTCATTAAATTCTAAGAATATATAATC 
TATTTGAACCATTGTTTATTTATTTAAACTTTCCAGCCAATTGCTGATATAATCACGTAAATATAGTTAACATTATTGGCTTGATTCCACAATAA 
ATTCTGTATTCTAAATTCACTAGCTATATTTACTATGTCAACAAATAAAGTGCAGCACTTTAAACATTTTGTGAACAATAAAACACAATTTCAGA 
TAAATCATGCCAAGTGCAGACTATCACAATTCGTTGTTGTTACCTTTTCAGCTGCTTTTTATTCATTTGGCCACTTTGCCCGCCGGCCATTCAAA 
ATTGTTTATCAGCCTGCAGCAACCTGTGATGCGGCACGCGTTGCCTTTATTTGCACCTTTGTTTTTCATTTAATGACGATGCTGAAATGTTCATT 
GCCAAATGGTTTACACGCCAACAGGCGGCAGACGAAAAAAAATGCGAAAATCTGGCAGGAAAAACATTTTGAAATGCAAATGCTACGGGCTTCGG 
ATTCGTGTTGGGATTACGGCGTCGAGAACAATTAAACATTAATGCCTCTCGTGAAGACTGCCCGTCAACTTATGCGTATTTAAATAAATAAAATA 
AAAAAAATTTTATTGTGCAGTAAAATGCAGTTGCCTTAAATCTTCCATATTCTCTGTTAAAAATTTCGAGCAATACAAAACGTTCCCAGAAATGT 
ATATTATTTAAAGACCGGCAGAAGTATATTTTCACCTATACTTCATGAGCTTTAGTTTACTTTTTGGTCGTTGCACACTTGAATACATATCGTAC 
ACCATATGATAAATATTAATGATATAAATACATTTTATGGGTAATCTAAATAGGAATTGTGTACATTTCTGTGAACGAACGCTGTAATGAACTAC 
CAAATGCCACGGATTTTTCGCTAGCTTTTAAAAATTTTCCATTTTAAAATTTTTCCATATCATTTCTTCTTTATGATTAATGATATACCTATGTT 
TTTGTTGCAATAATATTCTTAAATGATATGTAATTAAAAATAAGCTATACAATACAAGCTTAGCAACACAAAAGTAGAAATCGATTTCTGTGTAC 
TTTAAGCCAGAAACCGATCGTATGAGAATTAATAGCTTACAAAACCGATACACTTAACACCTATCAGTAAAAGGTGCTGTTTATATACAATTGAG 
TTTCCATTATAAGATTGATGCGAATGATTGCGCATACGAAAAATGATAAGACCGAGATTGTGTGTTTGAGCTTTCAGTTTCAATCTCCTGTATGC 
AAGCCGCTCGGATTTTCCTTGATTTTCCGACAGCTGTCAAAGTCAAAGCAATTATAAGCCGAGCCATTTGATGTCAGACTTTTGACAATTTCCGT 
TAATAAATTAAAAGTATCATTGCCAAAATGGGAATGTGAGGGTGGCAAACGTGAGCCATTTGCAAATGAGAAGAATGTATGAGAAGAATGTCACC 
TGGTGCTTTGCGACCGAAAAGGATTAAAGATGCACGATGAGTCAGTTTAGTGGGGTTAACGCAACGACCTTATCGCACTTAACAACTTGTTTGAT 
TAAAATCGCAGGGCCCAAAGCCGAAAAGAGAGTAATGGGAAAAGAGAGGCGCAAAAGAGAGTGACTCACTGAAAAAGTCGACAACGTGGCGTCGC 
CGAGCGCCTAAAAAGCATTATCTTTACGTATATAGTATAGATTATCGATGATTCAAAGGAGTTTCCAAAACTAATTAGCAGTGGCATTGGTTATA 
AGAAGTAAGTTTTGGCTTACGTAGCTCCAAAATGTGACTTAAACTCATTACGTCATCGCCACAGCTGCAAAATACCTGAGATAGAGCGAGATGGA 
AAGAGAGGGGGTGTGCGGAAAAGGGGGCGTCAGGGGGCTGATAAGAAAACGGTTCTGTTGTTGTTGCGGAAAATGCCCGCAAACTTAAATACAAC 
AAAAGCAACTACTGAGAGAACTGAGAGAGCAGCAACACACGCGACCAGAGAGAAATGGAGAGAGCGAATGTCACGAAGAGAGTGTAAAATCGATT 
TGTCACAAATAATTTGCGGCTTGCCCAGATTTTTTTTTGCGATTTAAACATCAAATAATCACAGGTAAATGGTAAAAACATACAGTTTTGTACTC 
ACAATTCTTTGATCTGCATCTTTGCCAATTATTATCCGGTATTTTCCTGCTTGATTCGCTGCGAGAGAGTACAAAATACAGAATTAGATTTAATG 
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CGAATTCGCTGGCGGATTTCAACTAAATTGTAGCATTTATTATTTCCAAATATAAATGCACTTGCACTCTACGCTTAACCTTCGCACGCACTTTC 
TATATTTAATTTACATATATATCTGCCAATAACACTTTGGTGTTTTTGTTTCGCTGCCAAATCGCTGCGCCCGCACTTTTTGTTTATGTTTGGCC 
AAGCCAACTTTAGTTTAGTTAGCCAATTGGGATTCTAATGCTTTTTCTCCAATTGTCGGCTAAACGCACGAGTCATATGCACACACACACTCTCT 
TGCAAATACACACGCACACACAGCGGGCGGAATATCAATAGCAATTTATGCGATTTGCCGCTGATTTATTACTTAAGTGATGACGTACAACTGGG 
GAGTGCGATTGGCCTGGTATCGGGATCGGTACGCGTTTGCAAATCGGCGGCAGTAATATACTAGTAATGAGTTTGTTATTTAGCTACTGCGTGTC 
CGGAGCGGAGCTAAAGTTCGATGTTCGTGCAAAACACTTCGATTCCGATAGGCGGATGCTATCGATTTCGGCGATGCCCCTTGGTCACACTTGGT 
GGTGGGCGCTGCCCGTCGCCGACTATCGATAGCACAAGCGGGTTATCTAGGTGTGCGCAGCTTGTTAGGGTGACTCATGCTGTTAAAATTATTAT 
AAAAAGTTAATGAATATAATATAGTTATAATAAAATTATATATAAATCTATAAGATCAAAGATCATCAGTTATCATTTATCATTTGATTATATGA 
AAAAC AAG AAC AG AAAC AAG AT T T AAT AGG T T TT T G AAATG TG AAAA TGTGGGTTACCCC C AAT T CT T AT T CG AAAT T AAAT AAC CT AAAG AAC A 
CCTACACAGATAGGTAATTTGCACATAAGCAAATTTTGTCTAGAATTCCAAATGGTAATAATAGTAGTAATAGATTCCAATTCTTATAAGCTTGG 
TGGCTCCGTCATTTCCTCCTTCTTGGCATCATTTCCCATTGCGTCTAGTGCGTCCTTCTCATCCTCTGGCGGTGGCGGCAATTCTAATCCTCCGA 
GCATACTTTCTGGTCGATTTGGAGATGGCGGTTCGGGAACGAGCTCTTCGGATTCGAGATCTATTTCGCGGGGTGCCTTACGAGTGGAGGGCCTT 
TCGAAATCCGGATTGCCGTCATCATCGGTGCGCATGCTCAGCGCGAACTTGATAATTCCCATGCCCTGGTAGTCATCGTCGATGGGTATGTTTCG 
GGCCCACACCTTGCAGCTGAAGAAGGTCTTCGCATTCTCGGGCATCTTGTTCATCTGGATGGCCACCACGCGCGTCACTCCTCTCATGTTCTCGG 
AGGCCTCGAAATATGGACCCGGATAGAAGACCATTTCGGGCTTGGGCCCATTGGTCACCTCGCAGGTGATCCAAATCTTGGGCGTGTTGCCCAGC 
TTGCCCACGGTGTCCTGAAGGCTCGCTGGCGCCTCCTTGGGCAAGTCATCTGGCGTGTCGTAGGTCTCCGGCACATAGCCAATA 
(SEQ ID NO: 1417) 

Exon: 9724.. 9589 
Exon: 9178.. 9123 
Exon: 2735.. 2705 
Exon: 2624.. 2513 
Exon: 2448. .2297 
Exon: 2154.. 2077 
Exon: 1619.. 1510 
Exon: 1446.. 1267 
Exon: 1139.. 1001 

Start ATG: 9139 (Reverse strand: CAT) 
Transcript No. : CT16783 

TTTTGCACGAACATCGAACTTTAGCTCCGCTCCGGACACGCAGTAGCTAAATAACAAACTCATTACTAGTATATTACTGCCGCCGATTTGCAAAC 
GCGTACCGATCCCGATACCAGGCCAATCGCACTCCCCAGTTCGAATCAAGCAGGAAAATACCGGATAATAATTGGCAAAGATGCAGATCAAAGAA 
T T C C GT GT GAC T T T GC CAT T GACTGT GG AAG AGT ATC AAGTTG C AC AAT TAT T C TC GGT GGC CG A GGCGT CAAAGG AGAAT ACGGGTGGCGGCGA 
GGGCATCGAGGTGTTAAAAAACGAACCCTTCGAAGATTTTCCCCTGCTGGGTGGCAAATACAATTCCGGTCAATATACATATAAGATCTACCATC 
TGCAATCAAAAGTTCCAGCCTACATAAGACTATTGGCACCCAAGGGCTCATTGGAGATCCACGAGGAGGCATGGAATGCCTATCCCTATTGTCGA 
ACGATTATCACGAACCCCAAGTTTATGAAAGATGCTTTCAAAATAATCATCGACACTCTGCACGTCGGAGATGCGGGCGATTCAGAAAATGTGCA 
CGAGCTGACGCCGGATAAGCTGAAAGTGCGAGAGATAGTGCACATCGACATTGCCAACGATCCGGTGCTGCCCGCGGACTACAAGCCCGATGAGG 
ATCCAACCACCTACCAGTCAAAGAAGACGGGCCGCGGTCCCCTGGTGGGATCCGACTGGAAAAAGCATGTTAATCCTGTCATGACCTGCTACAAG 
CTGGTCACGTGCGAGTTCAAATGGTTCGGCCTGCAAACAAGAGTAGAGAATTTCATACAGAAATCGGAGCGTCGCCTCTTTACAAACTTCCATCG 
CCAAGTTTTCTGTTCAACCGATCGCTGGTACGGTCTAACAATGGAGGACATTCGCGCCATCGAGGACCAGACGAAGGAGGAGCTGGACAAGGCGC 
GGCAGGTGGGCGAGGTGCGGGGTATGCGCGCGGATGCCGATTAA 
(SEQ ID NO: 1418) 

Start ATG: 176 (Reverse strand: CAT) 

MQIKEFRVTLPLTVEEYQVAQLFSVAEASKENTGGGEGIEVLKNEPFEDFPLLGGKYNSGQYTYKI YHLQSKVPAYIRLLAPKGSLEIHEEAWNA 
YPYCRTIITKPKFMKDAFKIIIDTLHVGDAGDSENVHELTPDKLKVREIVHIDIANDPVLPADYKPDEDPTTYQSKKTGRGPLVGSDWKKHVNPV 
MTCYKLVTCEFKWFGLQTRVENFIQKSERRLFTNFHRQVFCSTDRWYGLTMEDIRAIEDQTKEELDKARQVGEVRGMRADAD* 
(SEQ ID NO: 1419) 

Name: PHOSPHATIDYLINOSITOL TRANSFER B like protein 

Classification : ligand_binding_or_carrier 



Celera Sequence No. : 142000013384535 

AAGCCGATCATCACCATGGAGGATGCGATCAAGGCGGAATCGTTCCGCCTAGCCAAGAGCAGTTACTACAAGCAACAGATGAAGGTTTAGAGAGG 
GCGAATCCAGCTCTGGAGCAGAGGCTCTGGCAGCTTTTGCAGCGTTTATATAACATGAAATATATATACGCATTCCGATCAAAGCTGGGTTAACC 
AGATAGATAGATAGTAACGTTTAAATAGCGCCTGGCGCGTTCGATTTTAAAGAGATTTAGAGCGTTATCCCGTGCCTATAGATCTTATAGTATAG 
ACAACGAACGATCACTCAAATCCAAGTCAATAATTCAAGAATTTATGTCTGTTTCTGTGAAAGGGAAACTAATTTTGTTAAAGAAGACTTACAAT 
ATCGTAATACTTGTTCAATCGTCGTGGCCGATAGAAATATCTTACAATCCGAAAGTTGATGAATGGAATTGGTCTGCAACTGGTCGCCTTCATTT 
CGTAAAATGTTCGCTTGCGGCCGAAAAATTTCGATATATCTACAATTGATCTACAATCTTTACTAAATTTTGAAAAAGGAACACTTTGAATTTCG 
AACTGTCAATCGTATCATTAGAATTTAATCTAAATTTAAATCTTGCTAAAGGAAATAGCAAGGAACACTTTCGTCGTCGGCTACGCATTCATTGT 
AAAATTTTAAATTTTGACATTCCGCACTTTTTGATAGATAAGCGAAGAGTATTTTTATTACATGTATCGCAAGTATTCATTTCAACACACATATC 
TATATATATATATATATATATATATATATATATATATATATATGTTATATATTTATTCAATTTTGTTTACCATTGATCAATTTTTCACACATGAA 
ACAACCGCCAGCATTATATAATTTTTTTATTTTTTTAAAAAATGTGTACACATATTCTGAAAATGAAAAATTCAATGGCTCGAGTGCCAAATAAA 
GAAATGGTTACAATTTAAGGAAACAAATGTCCTTCTTGCGTTTGAAACAACTAATCCTTTTCGCCCTCGCGGCGTTTCTCGAAAAGGGCCAGGAA 
GATGCCATCGGTAAGATCACTGTCCGGCTGGCAATACAGGACGTTCTTGCCAATATTGGGATAGTCCTTGTCGCCCACATTGTGCCACTTGTTGC 
GCAAGGCCTTCTTGCAGCTGAGCAGCTTGAAGGATGGATTTAGCTGAAGACAACGCTGCACCACCTGCTCGTTTTCCTCCTTCCACAGCGAACAC 
GTGCAGTAGGCAATGCGTTTGACGTTCGGAAAGGCGCCCATCGCGTGCGACAGGATCTTAATTTGCAAACCCTGCAGCTTTTGCAGCCGCTTGTC 
CTCCTTCGGCTCGTCGCACACGGTCATGCGGTTTTGCATTCCGCTGCCAGAGCAACTAGGGTCCACCAGGATGTACTCTACGTCCGGAAAGCGTT 
CGGGTGCTGAAATTCAAACGTGTTACTTTTTTAAATATTCATTGTGAAATATATAAATATAAAACTCACTTAGGTTCAAGGCGTCTCCCAAAATG 
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GGTTTTACAATATCGCAACCAGCATCCTTTGTTATTTCGCACAGCGTATTGTAGCGTACGTGGTCCTGCTCCACGGAGTAGATGCATCCCTTGTT 

CTGCATCACATTGCATATGTGCACTGTCTTCATACCGGGAGCTGCGCACATATCCAGCACAGTGGCCCCAGACGGCGGAGCAAGTAGTTCGGCGG 

CCAAACAGGTGGCCTTGTTCTGCAGTATAAACCTCTTGCTGTGCACCAACGGATGGCGGACCCAGTAGTTGGACCACTTGGCAGGAAAAATGAGC 

ACACCCTCAACGTGCAGATCGGTCATGAACTCGTTCTCCGCCAGCGATTTGATGGCAGTCAGAAAATCCGCATAGCTGGCGTCCGCAGGCAGCTC 

CTTGCGCCGCCAGTCGCTTTTATGCAGGTAGTCCAAGGCCTCCGCCAAGCTGTATAGATTTGTGTTGATGCGCACATATCGAGGATTGGGTTCTG 

TGTGGGCAAACAAAAAGACGAGTTCAAAATTGTAGCCAAAGTAAGATTTATTAAGAGTTGTTAAGAGTCGTTACTGGTCCTTTAAACGATCGAAA 

TAATCTGGACTTGGCTGGCAGCACTTAAGTTACTGTTCTTTCTTTGGCTCCGCAGCTTCCGGCTCGTCGTCCTCATCCTGCTGCACGGGTTCCAA 

GGGTGGTGGGGGCTTCTCGCGGAAAGCCTTTAATACATACATGGCCACCACGAGGTTAACAACCACAACAGCCGTGAGAACTGACCAACAGTTGA 

CCGTGAAAAGGTCCAAATGGGGGAAGGACTCCTGCAGCCAGCTGCGGACGCCATAGAAGCCCAAGAAGGGCAGCGTGAACATGAGCACACTGTAG 

GCCAACAGCCAGAGGAAGACCTCCGCACTGGTGTCCTCCTGCTGCGTATGATGCTCCACGGGAGAATCAATGACAGCCGGGGAATCATCAGTGGT 

AGCTGGCACCACATCGACATGCTGTTGTTTCCTGTTCTTCTTGCCCATCTCAGCCGGCTAATACTGGTAATCGGTGTGATAACCTGCCTTTTCGC 

TGGACACCGAAATCTCTGATGGAATTTAGCAGTCGATCCTTGTAACTTCGCACCGTTTGCACGGGTTTGCTTTCGCCGTTCAGCTCCTTCCTGCC 

AAACAGCAGTTCGGTGACTAGAATCTTGGCCAAACTGGGATCAAAACTGGGATTGTCCCTGAGCAGTCCTGTCTCTTCGATGGCCTTTTCCAGTG 

CCACACGATTTTCGCTAAATTTTTTCAGAACCGTATGCAAAGAGCGTGTACGCTACAAGTGAAGTGCGAGAGGGAAAAGTTATACGCATATTGGT 

TTGATCTGCGGAGGAGAGGTCCAATCCACTTACGGCATGCTTCTCTGCAAATATTAGGGTCTTGATGCACTTCTGCTGCTCCAGTGCCGCCTTCA 

AAATCTTGGCAGTCGCCCTGTATTGGGTGGGAACTTTTATGGAATGCGGTTTTTTACTCATTGTCGTCACGTGCACGTGTATGTGTTGTTTGCGC 

TAGAATAGGTGCAACCCCAATCACGACTTTATCGATATATGCTTATCGAATGTCAGGGCTGGAGATAAGCATGTGGCGTTTGAAACAAATCATAT 

TGTTATTCATGTTTTGAAAATATCTACAATTAATGTAAAGGTTGTTCTAATTTTGCCAACGATAACTACAAAACATATATGTTCTATCCAATATA 

TTTACATACAGGGTATCAAATTTACAAATAAATTGGCAGTGTGGCCATACGTTGCGAGCAAAAATTACTGTACTTCAAGCATTGCCAACACAAGA 

ATATTTTCATGGTTAAAAATTTTGGCGCGATAAAGCCGCAGTTTAAAATGTCAAAGTGTGCGCGTTCTTGGAGGTTTACCAAAATTTAGAATCAT 

GCTTTTATTTATCGTGCATAAGTTGGTAAATTGGAATTAAATCAAGTTTTTTCCAATTTGTTTCTGGTGTGTCGTTTGTTTTGGGGTCTGTACGG 

CTGCTTTAATTAAAAAATATTAAAATGAAGCACGCTATTCTTTAAGGACTTTAAGGAAACGTACCTTTCTATGTTTGCCCAGGAGTTGATAGACG 

ATATCAAGGCTGAGGATCGCATCCTACAATGTCCTTGGTGCGAAGAACAGACGTTGAGTCCATCAGCCAGCACCCGGAAACTTTTGTATTGGCTT 

TTAACGCTTTCAATTCGACTTTCAGAAAAGAGTCTCCTCGAAAATCACGACTGATATCCATGTAAATGAATCGGTAGATACTTGCACTTTCTTTT 

TAGTTATTAAAACTGACACACTGTCACTTTGCAGGTTTTAACCGCTGGCGGAAATGAAGAAGGAATCCGGGAATATTCATTTCAGTCCATTGTGG 

TGCACAGCGGAACTCCCCTCAGGAACAATCTTAGCCCACTAAACATCAATAAACTACTAAAATAACTTTGACTGCTGCTTAATTTCAAATATACA 

TAAATTAATATTTTTTATTTTTAAGTTAAATTTTTTAATTCTTTTGGAG 

{SEQ ID NO: 1420) 

Exon: 3039.. 2884 
Exon: 2807. .2558 
Exon: 1992.. 1495 
Exon: 1431.. 1001 

Start ATG: 3006 (Reverse strand: CAT) 
Transcript No. : CT17524 

CGCAAACAACACATACACGTGCACGTGACGACAATGAGTAAAAAACCGCATTCCATAAAAGTTCCCACCCAATACAGGGCGACTGCCAAGATTTT 
GAAGGCGGCACTGGAGCAGCAGAAGTGCATCAAGACCCTAATATTTGCAGAGAAGCATGCCCGTACACGCTCTTTGCATACGGTTCTGAAAAAAT 
TTAGCGAAAATCGTGTGGCACTGGAAAAGGCCATCGAAGAGACAGGACTGCTCAGGGACAATCCCAGTTTTGATCCCAGTTTGGCCAAGATTCTA 
GTCACCGAACTGCTGTTTGGCAGGAAGGAGCTGAACGGCGAAAGCAAACCCGTGCAAACGGTGCGAAGTTACAAGGATCGACTGCTAAATTCCAT 
CAGAGATTTCGGTGTCCAGCGAAAAGAACCCAATCCTCGATATGTGCGCATCAACACAAATCTATACAGCTTGGCGGAGGCCTTGGACTACCTGC 
ATAAAAGCGACTGGCGGCGCAAGGAGCTGCCTGCGGACGCCAGCTATGCGGATTTTCTGACTGCCATCAAATCGCTGGCGGAGAACGAGTTCATG 
ACCGATCTGCACGTTGAGGGTGTGCTCATTTTTCCTGCCAAGTGGTCCAACTACTGGGTCCGCCATCCGTTGGTGCACAGCAAGAGGTTTATACT 
GCAGAACAAGGCCACCTGTTTGGCCGCCGAACTACTTGCTCCGCCGTCTGGGGCCACTGTGCTGGATATGTGCGCAGCTCCCGGTATGAAGACAG 
TGCACATATGCAATGTGATGCAGAACAAGGGATGCATCTACTCCGTGGAGCAGGACCACGTACGCTACAATACGCTGTGCGAAATAACAAAGGAT 
GCTGGTTGCGATATTGTAAAACCCATTTTGGGAGACGCCTTGAACCTAACACCCGAACGCTTTCCGGACGTAGAGTACATCCTGGTGGACCCTAG 
TTGCTCTGGCAGCGGAATGCAAAACCGCATGACCGTGTGCGACGAGCCGAAGGAGGACAAGCGGCTGCAAAAGCTGCAGGGTTTGCAAATTAAGA 
TCCTGTCGCACGCGATGGGCGCCTTTCCGAACGTCAAACGCATTGCCTACTGCACGTGTTCGCTGTGGAAGGAGGAAAACGAGCAGGTGGTGCAG 
CGTTGTCTTCAGCTAAATCCATCCTTCAAGCTGCTCAGCTGCAAGAAGGCCTTGCGCAACAAGTGGCACAATGTGGGCGACAAGGACTATCCCAA 
TATTGGCAAGAACGTCCTGTATTGCCAGCCGGACAGTGATCTTACCGATGGCATCTTCCTGGCCCTTTTCGAGAAACGCCGCGAGGGCGAAAAGG 
ATTAG 

(SEQ ID NO: 1421) 

Start ATG: 34 (Reverse strand: CAT) 

MSKKPHSIKVPTQYRATAKILKAALEQQKCIKTLIFAEKHARTRSLHTVLKKFSENRVALEKAIEETGLLRDNPSFDPSLAKILVTELLFGRKEL 

NGESKPVQTVRSYKDRLLNSIRDFGVQRKEPNPRYVRINTNLYSLAEALDYLHKSDWRRKELPADASYADFLTAIKSLAENEFMTDLHVEGVLIF 

PAKWSNyWRHPLVHSKRFILQNKATCLAAELLAPPSGATVLDMCAAPGMKTVHICNVMQNKGCIYSVEQDHVRYNTLCEITKDAGCDIVKPILG 

DALNLTPERFPDVEYILVDPSCSGSGMQNRMTVCDEPKEDKRLQKLQGLQIKILSHAMGAFPNVKRIAYCTCSLWKEENEQWQRCLQLNPSFKL 

LSCKKALRNKWHNVGDKDYPNIGKWVLYCQPDSDLTDGIFLALFEKRREGEKD* 

(SEQ ID NO: 1422) 

Classification: hypothetical 



Celera Sequence No. : 142000013383859 

TGTGTTCCAGTTCGCCGGTTAGCACATCCCAAATGCACACATTGTTGTCCGTGGAAGCTGAAAGTAACTGGAGTGGATTGGAAAATGTAGATGAT 
CAGGTGTACGGCGAGCGAGGGCCTACCTTGTGACCATTCCTGGTCCAACTGAGACTGCAAACGGGATGCACATGGGCCGAAATAATCTTGGCAAT 
GCCCCTCGTCAGAAAATCCCAGATTACAATGCGGCCATCATTGCATCCCACGGCCAAAAGGGTTCCGTATTTGTTAAAGGCACACGTCACGGCCA 
ACGAAATGCAATCCAGCGAGCCGTCGAACTCCTCTGGGTAGTTTTGTCCGAAAGATTCTTAATAAAGCTGGAGTTATTTACTACATAAGCATAAA 
AAATGGGCGGCAATCTCACCTAGTAGCTCCAAATTCATTTTTGTTTACTATCTTCTTCTTTTAAAATGGCAACTGACACTCACACCGAACGGCAA 
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TAACATATTATGTTATAAATGATAAGCATAACAGAAGACGAGTTTTTCGATAGTTAAGGTTATATTCTGTACAACTACTGAACATCAATTATTAT 
ATTATAATACGAATTTAAAAATCTACGTATCTACGTTGGTGTTTTTTGTGGTTTTATTAATAGAATAAGTTTAGTTAATTTTAGAGGTGAAATTA 
AATATTGAAATAGTTAAACATATAAAATAGATAAATTTGGTGAAGCGTAAGTAAATTTTAAACTGCATTAGTTATATCTTAATTCAAGTTCCCCA 
CTAATTTATGAGCGTTAGATGCGATAGTGTTATGTCTGGCGCAAGCGATTGTCTCGACGTTTCTTCGATATTTTAAGATATCGATAATGTCATCG 
GAGATTTGCCACCCCTATTTTTTGAGCTCTGTGCACGTTGTATGAGAAAAATTTATGCAGATGTATTCGCCAATTGTAACCAAGTCCGGATTGTG 
AAATTGCACGCAGACAGCGAGCATCATAATAAGCGCTCTAATTGGAGAAGGCAGAAGTTTCGAATTAAGCCATTTCGGAGACGTTTCGTCCGTGG 
CACGCGATTGTGTCAACGTCACCTTTGGAGCACTCGCCCTCTCTCGCTCGCTCCCGCTTTCGTTGCCAATTTTTTGCTTGCCCTCTCTCGGTCAC 
TCCCTTCGAAAGCTTAACGGTTAGAGTGTAATAATGAAATAACCCAGTTTCGAATTTCGTTCACAACAAAAGTGCGGGCCTTTCATGCCAAAATA 
ACATTTGGTTTCGAATTGTTTTTCAAATTCGAATCGAGGTTTTCCAGCTTTCCAGTTTGACAGCGAGAGAACGAGAGAGAGCGAGGGCGAATTAC 
GGTGTTCGCGCTCTGCTTGTGCTTTCCACTCCACTCCCCTTCTTAACTTCCCCCCTCCCAGCCTATTTACTCTGTGTGCATGTGTAAATGAATAC 
TTTAAAACGTTTTTAATCGTTGCAGTGTATTCATCGCCAGCCACGTTAAAAGGAAGAACGTGTATGTGAATACGACCATCAGAGGATCTTAGCGA 
AGGATTCAGGAGCCCAAATTCTTAATCCCCATCCCACACACACACATATACTCGCACGTAGGCACGCTCTCTGTGAGAGGAAGGGTAAATAAAAG 
AAGCAACGAGCAAAATGTCTGGCGAGGAAACGTCTGCCGATCGCAATGTCGAGATCTGGAAAATCAAGAAGCTCATCAAGAGCCTGGAAATGGCC 
CGCGGGTGAGTCTTATCAATGATATACATGTCCTTTAACGTCCCCCCCAACGACTGTTTTGCGTACGTGTGTGGGGTATACACAATGCATGGCAC 
GAAAACAACAACACTGGGGTGGAAGTGGGCGTACTGGACGTGGGTGTGGGTGGGGACTTGAGGGAGCGGCGAGTACAACCACCTTTCCTTGTTGT 
CGCCGTGTGAATCTCCTTTGCATTGTTTCCAATCTTGGCAAGATCCCCCGCCTCACTCTATCTCTCTTTTTTGCATACGAGCAACACGTTTTTGT 
CGAGTTTTTATCAGTTCAAACTAGACTGATTGAGAGACGGAGAGAGAGAGAGAGAGAGAGGAGTGAGTGAGTGAGTGAGTAATCGCCTCGTGGCT 
GTTGTGCGTATGCGTGCATGTGTGTGTGTGTTTTTCCCCCAAAATGGGCCAAGCTTTGTGGACCCCTCGCTCTCACTCATCCTGCCTCGCTCACT 
CCCTTGACCCGGTCCTTACCGCTTCACACCGCTTTAGAGTGGGTAACAAGGTCAGCAAATCGAGTGACCCCCACAGGAGCATATCTGCTATGTAT 
ATACATATATATGTATATTGTTGTCAATATGCTCCACAATTGGAGCTAACATTACACCTCTTCCAATTGGAGTTGGCCACTGTTGCCAATTGGGA 
TTCGCTTTGACAGATATCTTCCTATTGGTTTTCAGCCATATGCGAAATATATACACACATATGTATTTGTTTTTCTTTTTTTAAGTTCTGTGCTA 
TTTTTTGTTGTATTTTGCATTCCAACATTTTGTACAAATTACGATTGGCACTCCTCTGTATTATAACGAACGAGAAAATGATTTGCATCAGAGAA 
ACAAACGTTTGGAAGTTACAGAAACGTATTTGTTTCAATAAATTAATCAAAGTTTAAATTTAGTCCAATAACCGAATGTCAAGTTTAAAAGACAT 
ATGCCACGAAATTTCACGGCTAATTAAATAACTTTAAAGTAGGTTGCTATTATGTGCATCTGACGCGCAAATATACGATTGAATAGCACTGTTGC 
TATGGGAGAAAGAGCCATTTCCGCGACTTATGTTCTCCCTTGTTTGGGTTTCTTGCAGCAATGGAACCAGCATGATTTCTTTGATTATTCCGCCA 
AAGGATCAAATCTCGCGCGTCAGCAAGATGTTGGCCGATGAGTTTGGAACGGCGTCGAACATCAAGTCGCGTGTAAATCGGTTGTCCGTCCTCGG 
TGCCATTACGTCGGTACAGCACAGACTCAAATTATACACCAAAGGTAAGCTGAGCTAAGAAATTGTTCATAAACAATCGAACAAACGAACTCTAA 
ACCGTATCACCAACCGCTTCAATTCAGTGCCTCCCAACGGTTTGGTCATCTACTGCGGCACAATAGTCACAGAGGAGGGCAAGGAGAAGAAGGTG 
AACATAGACTTTGAGCCATTCAAGCCCATAAACACGTCGCTCTACCTCTGCGACAACAAGTTCCACACGGAGGCCCTCACTGCCCTGCTCGCCGA 
CGACAACAAATTTGGATTCATCGTGATGGATGGTAACGGAGCGCTATTCGGTACCCTTCAGGGCAACACGCGCGAGGTGCTCCACAAGTTCACCG 
TCGATCTGCCGAAGAAGCACGGTCGTGGTGGTCAGTCCGCCCTTCGTTTCGCCCGTCTGCGTATGGAGAAGCGCCACAACTACGTGCGGAAGGTC 
GCCGAGGTGGCCACCCAGCTCTTCATCACGAACGACAAGCCCAACATTGCCGGACTCATCCTGGCTGGTAGTGCGGATTTCAAGACTGAGCTTAG 
TCAGTCTGATATGTTCGATCCTGTAAGTGGGAGTTAACCTATTTCCATCGATGCCATATTAAGCCTTACACTTGAATCTTTTTTCAGCGTTTGCA 
ATCAAAAGTCATCAAGCTGGTGGACGTGTCGTATGGTGGGGAAAACGGTTTTAACCAGGCCATTGAACTGGCGGCCGAATCATTGCAGAACGTTA 
AATTCATACAGGAGAAGAAACTCATTGGTCGCTACTTTGATGAAATTTCTCAGGTGAGTTGCCGGATACGAGTTCTGAGGCATTTCGAAATTAAA 
AACCACATCCGTATGCTTTCTATATTTAAAAGGATACTGGCAAATACTGTTTTGGAGTGGAGGACACTTTGCGGGCACTGGAACTTGGCTCTGTA 
GAGACTCTCATTTGTTGGGAGAACCTGGATATTCAACGTTATGTTCTCAAGAATCATGCCAACTCGACGTCAACGACAGTATTACATTTGACGCC 
CGAGCAGGAAAAGGACAAGTCGCACTTCACTGACAAGGAGGTGAGAGCTAAACTGAGTAATGATACTAATCCGCACGCTAACTGGGAATTCAATC 
GTATAGAGCGGGGTAGAAATGGAGCTGATTGAGTCTCAGCCGCTGCTGGAATGGCTGGCAAACAACTACAAAATGTTCGGCGCCACACTGGAGAT 
TATCACGGATAAGTCCCAGGAAGGAAGTCAGTTCGTGCGAGGTTTCGGTGGAATCGGCGGTAAGTTTGAGCGGCCAAACAGCACGGCTGGCTAGT 
TGTAGATTGGTTATGTACTAAACAAATGCTGCAACTATATCCATTCCTCTTCCCTCATGCTGCTTTCACACGGTCGTCAACGCTATAATGTAAAT 
GTTAATTTGCAAACTGGTACCTCAATTCGATATGTAGGCATTTTGCGCTATAAAGTAGATTTCCAATCCCTACAGGCCGATGAGCCGCTGGAGGA 
TGTCGATCTCGATGACTATTAAGCCTGTTAAGGGTGGGCTGCGAATGGGTCTCGATTATGCGTTACATACATAATAAACCAGTACTTTTTGGGAA 
ACCTCGTAAAGCAAAACGTTCAAATTGATTAGGAACCAATTTACTTGCCACATAGTTTTACCTATGTTCAGCGTAGCCGTTCAATATTTATCACC 
AATAGGGTCCCTCTCATCAGTCGGATGGTGCATTCCCTTGTCGATCAGATCAGCGTGCATCTAGATATGTACCTATACGTTTTACCAAATTTTTT 
AATTAGTCGTGCACCAGGCGGAATCGCGTACTTTTAAGCAGCATATTAAATTCGTATTTCGTACTCGTAATTCGTAATTTGTATTTCAATGGTCA 
TGTAAAAGCGATGCGATAGATATTAGCGCATTCTCCGTTCATCTGGTTCGATTTCGACTGCGGTATTAGGAGCAAAAGATCTTCGGTTAACTGCC 
ACATGCCACTTTCGAACGTATCTGCATTTCCCCTATTCAAATTATAGCCGAGTGTTATAAGAAAGTATACAAATCTAATCCATAAAATTGTAATA 
AAAATTAAACCAGAATAAAAACACCACACAATGAGGATGGGGTATCCGTTTATATTCAGTCGCGAAGTACAACAAATATTTTATTTAGTGTATGA 
AAAGTTAAATGTGGATACAAAGGAGGTGAGAATCTGAGTTTCGGTTGAACTATCCTGTGTGTTGATCCTTTTGTTTTTTCCTCTTCATTTATTTT 
TCAATTTTCAATCAGCTTCTGTTCTTCTATTTTTTTTTTTTTTTTGTTTCTTATTTTAATTTAATTCAATTGCTGGGTGTGCCTCTGCAGGTATC 
TTACGCTACAAGGTGGATTTCCAGAGTATGCAGCTCGATGAATTGGACAATGATGGCTTCGATCTAGATGATTACTAGGAAATGTATCTGTAGAA 
ATGAATTTGCCACTACGTTCGACCGCAAACTGAACTGCAACTACAACTATATGAATCCAGGAAATCAAATAGAATATTTAAATAATATGAGAAAA 
CACAAACCTTCTTCCATTTTTACGTTGTTGGCGAAAAAAGTAAAATTTTAAATAATGCATATAGATACAAACGATAAAATGTTAAATCGAGGAAA 
ACATATTAAAATGCAACCGTTTTACGTGAACAAATTTGCAATTTGAATATATTTTCGAGGGACTAGAGCTCTTGTTATATAATTAATTGTTCCTC 
TATAAATCTAAATCTAAATCTACATTGACAATACGCCACGTTTCCTAAAGTTTTTGGGTACAAGGAGAATTGTATCTGTGAGTTTTAATCGTTCT 
ATTATAGCTTGAAAACGAAATTTGCACCGAAAGCTGATAAAATTCCGACTCTGCTATCTGCTGGATAAGTAAAATGCTGGTAGCCACTTGGAGTA 
CTACCAGTACAGCAATGAAAGTAAAGTGAAATGGGCATTTCATTTTCTACTTTCTATTTGCAGGCACACTGGAAACAGTTTACTGCGAGATTCCC 
ACTTCATTAGTTGGCAATGCACGTGGACGACAGCAAAAAGCACTCGTAGTACTTGTTTCGCCCTTGATTTGCATCTAAGAATCGCTCAGATACAT 
CTGGTGAGCTGAAACTCGTTTCTGAGCCGCAGTGTCGTGCAGGTATCCGTACTCGTTTGAATATTTGTAAGTAATCATTGCAGCGAGACTTCCAA 
AAAGTAAGCTCAGCTAAGTTCTGCGGTGTTGGGCATCCAATTAGTTGACCAAGTTTTCTGGTGGCATGCACATGAAACTTGTTTAATCGATAGGG 
TAAATATGCAAATTTCGATTGAATTTGACGATAATGCGTCGAGATGAAAAGAAAACGAATTGTGCAGGATTTTGGTTACGAGATAAGCCGAATCA 
GCTAATAAGGCACGTGCTTAATTGATATATATTGTATGTTTAAGTAACCTAAATATGCACGACTTTAATATCACTTTTATTTCGCTTTCGTAATA 
AAACGAGGTTCACTCTAAAGAGCACAAGCGTGGAGGTGTATGTGTGTTCTTACAGCCCCGATCATTTGTATTTCCATTTGAGTGCGAGCGAAGAC 
GAGGGTTTACAACTTGATGAGAATTTGATCAAGATA 
(SEQ ID NO: 1423} 

Exon: 1001.. 1715 

Exon: 2814.. 2989 

Exon: 3068.. 3537 

Exon: 3603. .3758 
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Exon: 3833.. 4030 
Exon: 4092. .4239 
Exon: 4408. .4485 
Exon: 5316.. 5591 
Start ATG: 1630 

Transcript No. : CT17 696 

GCAGAAGTTTCGAATTAAGCCATTTCGGAGACGTTTCGTCCGTGGCACGCGATTGTGTCAACGTCACCTTTGGAGCACTCGCCCTCTCTCGCTCG 
CTCCCGCTTTCGTTGCCAATTTTTTGCTTGCCCTCTCTCGGTCACTCCCTTCGAAAGCTTAACGGTTAGAGTGTAATAATGAAATAACCCAGTTT 
CGAATTTCGTTCACAACAAAAGTGCGGGCCTTTCATGCCAAAATAACATTTGGTTTCGAATTGTTTTTCAAATTCGAATCGAGGTTTTCCAGCTT 
TCCAGTTTGACAGCGAGAGAACGAGAGAGAGCGAGGGCGAATTACGGTGTTCGCGCTCTGCTTGTGCTTTCCACTCCACTCCCCTTCTTAACTTC 
CCCCCTCCCAGCCTATTTACTCTGTGTGCATGTGTAAATGAATACTTTAAAACGTTTTTAATCGTTGCAGTGTATTCATCGCCAGCCACGTTAAA 
AGGAAGAACGTGTATGTGAATACGACCATCAGAGGATCTTAGCGAAGGATTCAGGAGCCCAAATTCTTAATCCCCATCCCACACACACACATATA 
CTCGCACGTAGGCACGCTCTCTGTGAGAGGAAGGGTAAATAAAAGAAGCAACGAGCAAAATGTCTGGCGAGGAAACGTCTGCCGATCGCAATGTC 
GAGATCTGGAAAATCAAGAAGCTCATCAAGAGCCTGGAAATGGCCCGCGGCAATGGAACCAGCATGATTTCTTTGATTATTCCGCCAAAGGATCA 
AATCTCGCGCGTCAGCAAGATGTTGGCCGATGAGTTTGGAACGGCGTCGAACATCAAGTCGCGTGTAAATCGGTTGTCCGTCCTCGGTGCCATTA 
CGTCGGTACAGCACAGACTCAAATTATACACCAAAGTGCCTCCCAACGGTTTGGTCATCTACTGCGGCACAATAGTCACAGAGGAGGGCAAGGAG 
AAGAAGGTGAACATAGACTTTGAGCCATTCAAGCCCATAAACACGTCGCTCTACCTCTGCGACAACAAGTTCCACACGGAGGCCCTCACTGCCCT 
GCTCGCCGACGACAACAAATTTGGATTCATCGTGATGGATGGTAACGGAGCGCTATTCGGTACCCTTCAGGGCAACACGCGCGAGGTGCTCCACA 
AGTTCACCGTCGATCTGCCGAAGAAGCACGGTCGTGGTGGTCAGTCCGCCCTTCGTTTCGCCCGTCTGCGTATGGAGAAGCGCCACAACTACGTG 
CGGAAGGTCGCCGAGGTGGCCACCCAGCTCTTCATCACGAACGACAAGCCCAACATTGCCGGACTCATCCTGGCTGGTAGTGCGGATTTCAAGAC 
TGAGCTTAGTCAGTCTGATATGTTCGATCCTCGTTTGCAATCAAAAGTCATCAAGCTGGTGGACGTGTCGTATGGTGGGGAAAACGGTTTTAACC 
AGGCCATTGAACTGGCGGCCGAATCATTGCAGAACGTTAAATTCATACAGGAGAAGAAACTCATTGGTCGCTACTTTGATGAAATTTCTCAGGAT 
ACTGGCAAATACTGTTTTGGAGTGGAGGACACTTTGCGGGCACTGGAACTTGGCTCTGTAGAGACTCTCATTTGTTGGGAGAACCTGGATATTCA 
ACGTTATGTTCTCAAGAATCATGCCAACTCGACGTCAACGACAGTATTACATTTGACGCCCGAGCAGGAAAAGGACAAGTCGCACTTCACTGACA 
AGGAGAGCGGGGTAGAAATGGAGCTGATTGAGTCTCAGCCGCTGCTGGAATGGCTGGCAAACAACTACAAAATGTTCGGCGCCACACTGGAGATT 
ATCACGGATAAGTCCCAGGAAGGAAGTCAGTTCGTGCGAGGTTTCGGTGGAATCGGCGGCATTTTGCGCTATAAAGTAGATTTCCAATCCCTACA 
GGCCGATGAGCCGCTGGAGGATGTCGATCTCGATGACTATTGTATCTTACGCTACAAGGTGGATTTCCAGAGTATGCAGCTCGATGAATTGGACA 
ATGATGGCTTCGATCTAGATGATTACTAGGAAATGTATCTGTAGAAATGAATTTGCCACTACGTTCGACCGCAAACTGAACTGCAACTACAACTA 
TATGAATCCAGGAAATCAAATAGAATATTTAAATAATATGAGAAAACACAAACCTTCTTCCATTTTTACGTTGTTGGCGAAAAAAGTAAAATTTT 
AAATAATGCATATAGATACAAACGATAAAATG 
(SEQ ID NO: 1424) 

Start ATG: 630 

MSGEETSADRNVEIWKIKKLIKSLEMARGNGTSMISLIIPPKDQISRVSKMLADEFGTASNIKSRVNRLSVLGAITSVQHRLKLYTKVPPNGLVI 
YCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTEALTALLADDNKFGFIVMDGNGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRF 
ARLRMEKRHNYVRKVAEVATQLFITNDKFNIAGLILAGSADFKTELSQSDMFDPRLQSKVIKLVDVSYGGENGFNQAIELAAESLQKfVKFIQEKK 
LIGRYFDEISQDTGKYCFGVEDTLRALELGSVETLICWENLDIQRYVLKNHANSTSTTVLHLTPEQEKDKSHFTDKESGVEMELIESQPLLEWLA 
NNYKMFGATLEIITDKSQEGSQFVRGFGGIGGILRYKVDFQSLQADEPLEDVDLDDYCILRYKVDFQSMQLDELDNDGFDLDDY* 
(SEQ ID NO: 1425) 

Name: translational release factor 1 
Classification: translation factor 



Celera Sequence No. : 142000013384535 

TTTTCTCCCTCAAAACAGCAAGTTTATACTACTTATTAACCAAGGTAAAATTAAAATGTTTTACAACTGTCATCGGAAAATAAAGCATTCCATGT 
TTTAAAAACACCCAATCGAATGGATTATGTAAAAGTATTGGATTTTTTTTATTTTGCCCACTTAGTAGCAAGTGGCGCTTACGGATGCCCGATCC 
ATGGGGCAACCATAAACTTGGATGTCTGTGATTAGGGTAAGGAAACATAACACTTATTTCACAAGAATTCGCCAGCTGCTTAAATGGTACATGTA 
AAAATATTGAAGTATTTTTATTTTTCTAGACATTCTGAGTGGCGCTCGATGTGTACGTACATACAACTCTTTGGATTCTGTGTGTACAGTCAATT 
TACGATGCCCAAGCCGCCCTGAGCGAATTCCATATTTACAAGAGAAGGGGGTGGGGATTGGGAGGGGGGGGGGTGTCGGAAAGGGTTGGGGCCTT 
AAAATACCTAAACTCAACTACACTCAACAATTTGCTTATGGTCACTACTTATACGCCTTATACACCGGCGGTTGCACTTTAAGCTTCAACGTTAT 
AGATTTGGCCAGGACACCAGCGGAGGAACCACGATCCCGACGGCCAATCCCCGCACCGAACACTCCTTCCGTGCTAGCCGGAAGTGGATCGGCGG 
TGGGCGTGGCCGCAAGGAACGGCCAACCGGGGCGTCATTTGGGATATGAGCCGCGCCGGAGCCGGCAGTCTGCTGTGCCCGGAATTCGTGGGGGA 
ATCGTTCACGGGACGGACCAGCAGCCCCTGCCGGGCGGACGCAGTTGACTAGTGGCGGACAAAACTTTCAAAGCAATCGAAAGCGAAACCTTTGT 
ACACACTTAATAGATTCCTGCGGGTCGAGTTCGTGGTCAGTTCGTCGTCAATGCTCAGTGTCCAGTAATCTTTTCGTTTGTTTTTTGGATGAGAA 
AATGAGGGAAACTTAAACGGCTCGGTAAAAAAGAAAGGCGTGGCAGCGCGCTAAACAAATTGCTCTGAGCTGTTGGCGGACAGCTCCATCAGCCG 
CCAGGCAGCCTTGGGGTTGAGTTCGTTGCGGTCGCGGCACAGCACCCAGACGTAGTGAACCCGCATCACCTTCTCCGGATCTCCCTCGACGACCT 
GACTCTTCTGGTCCCGCACGCACATAATCTGCTGGGCTTGGAACGTGATAATCAGTACCGGGCCCTGCTCCATAACCTTGCCCATTGCCAGTTCA 
ATGTTCTCGATGTCGAGGATCTTGGAGTCCAGGTACACGCCGGCCTTCTTAGCCTCCTTAATGGGATTTGCGATAATATTGAAAGTGCTCTCAAA 
GCACCAATCCTTCAGAATCTCCAGATCCCCACGGACAATCGATTCCAGAATATTTGGAATGATGTCCGTTTCGCAATCGCGGAGGAAATCCTTCT 
GATCAAAGCTCGGATCGATTTTCACCAGCTCCGTCATCGTTTCCGAAAGCTCTGTCTTTGAAAAGAGTCCACCCATGACGTCGGACACTTTATCG 
GTCAGCAGGCGGGAGGCACGAATAACAGGGTTCTCGGACTCGTCGTATTTAACCTTCCAGTCAAGCACCTTGTTCACGTAGGTATTGTTGTTCTT 
GAAGTTCTCCCACGACTCGTAGAACTTGGAGTCCTTGTGCAGCTCCATACCTGAAAAGTAACATTACATTATGGCCATGACCTTTGATCTTATTG 
AAGGAAAAAGATGGAGACTCACCAGTGGCTTCCGTGTTGGGCTCCACAACGCGATCACTATCGCTCATGACGAGCTGGACGCGCTTTCGCAACTT 
GGCAGGAGCACGATAGACACGATTCTCTATGCTCGCGCTGTCCATCTCCTTCTTGATGGTCGTCGTCGTGTTAGAGATAGCCTGGAATGCGCTGG 
TCTGGCCCAGCTTGCCACTGGTGTCGGAGATCGTGTCGCTGACCCCCCTAGCCTTCTTCGAGAGCTCCTCGGTAACCTTTTTGGCCAGGTGGGAT 
TTGCTGGCGTCCTCCAGAACATCGCCTACGCGCTCCTTGATGGCGCCCAGCTGCTCCTTTAGCATGCTAGACGACTTTTGCGCCTCCGATTCGAC 
AATGTTAAACTTTTGCCGCGCCGATTTCAGGGCATCCGATTCTTCCAGCTTCTGGGCCTCCTCACGGAACTTTCGGATATTGTCCTTGATCTCCT 
TGTTCTTATCCATCTCCGCCTTCATGTTGTCGAAGAACTGCGAGAAGAAGCCGGCCCTGCGACCTGGCGCACTATAAAAGCGGGGCTGCGGAGTA 
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AAGTCATGACTTAGGCCGGTTGCGCCATGGGGTTTTATGAGCCAAACTCTAACCTGCTGCTGCTGCAGGTTCTGCGAGGCCTGCTGGCACGTGAA 

GAGACAGGCGCGATCCCGCACAAGGGCAGCTATTCTGTACTGCAAGGAGAAAAAATCATTGAAAGCTTCGACCGTTTTAACCTCGAAATATGCAC 

ATGTAAGGACGGATGTGAGCGAACGCCAGTGATGACCGGGATCAGAGGTAACCTACCATGGTGGGGATTAGGTGACCGTTCGCAGGTAGTTTGAT 

CGGAGCGAATGTTCGGGGGGTCTGGCGTCAGAGGCTCTAAACTTTATGTAATTCCTGCCGCGAAACACGCACGTATCAAGCAGTCAGCTGTTCTC 

TTCGTTCAGCGCGCGCCGGTGTTGC^AAACGAGCGCTCTTCGCCGGCGGTGGCTCGTGCGATAGTTCGTTTTGTCGGTAATCCGATGTTGCCGCG 

CCGATATCATGTGATGTTGTCACAGTGCGCGAAATTCGAATGGTGGTGTGCAGTGATTGTGTTGTGACGGCGAGTGGCGCGTGTGGGTGCTTAGT 

TTTGGGAGATGTTTTCGTATTTTTTTGTTGATAACTCAGGCTTTGTTGCTGTGTTGTAGTACTATTTTCCATTGCGCGGTGTCCAGCTTTTAATT 

AGTGGCACATATTCTTAGCAAGTAAAAATTATTTTGCATACTATTAAATTTCTTATAAATTATTTTCTAAAATTAAGTTTACCTTTTCAATTTTA 

CTAAAAATATCGATATATTTATTATCGCTGGAAAACTACATTATTCCACCTCTAAGCAAGAACCGTTAGTTGGCGCGTAGCTTTACCACAAAATT 

CCTGGAATTGCCGTACGCTTCGCAGTTGTTTCAAGTTGTCTAAGGGACATACGATTTTTTTTGCCTCTGCGTCACGATTTTAACCCAAAAGCGAG 

TTTAGTTACATGTACATTATTATTAGATAAAGAAGTATCGCGAATACTTCAGTTGAATAAACTGTGCTTGGTTTTTGGGTGAGGATTTGTGGAAA 

GTAGAGTGCGCGATAACCGTAACTTTCGACCCGGATTTTCGCCATGTTCCGCAGCAACTTGGAGGGCAGTGGCGCAGCAGCAGTAGGTGTTGCAA 

ATCCCCCCTCGTTGGCTCAGTCTGGAAAGATTTTCCAATTGCAGGATAACTTTTCTGCTTTTCACGCCAGAGGAGGGCTCAACATTCTGGGCCTG 

CAGGACATGTATTTGGATACCAGTGGGGCCAACTCGTCGGCCACTTTGAGTCCGCCCATTACGCCGGTGACCCCTGACCCGTCGACGTCTGCGCA 

GTCGACGCACTTCCCTTTTCTGGCCGACAGCGCAGCCACCGCCAATTCGCTCCTTATGCAGCGACAGTACCACTACCACTTGCTGCTCCAGCAGC 

AGCAGCAACTGGCCATGGCGCAGCACCAATTGGCGCTGGCTGCATCAGCGGCAGCGGCTAGTGCGAGTCACCAGCAAACGGACGAGATTGCGCGA 

TCCTTGAAAATCTTTGCGCAGGTGACGACGGGCGCAGCAGGTAAGAAGAAATGGCGAATATGGCGTCGTTTAAAACAGAATAGCCAAAAACAGTG 

CGCGGGAGCGAGATGACTATCTGTACAAGTACAGTTATGCCAAAGTCAATGGCCATGGGCTTAGTTAAAGGGGCAACGAAAGCACAGAGCAAAAC 

AGAAGTAAAATTGTGTAAATGTATAATTGCTGCAGAGGTCAGAGGATTTTCGCCATTTAGTGTAGTATATTTTTTATTTTACTATTTTGTATGTG 

CTTTAAAATATATCTTTGCCCAACACTTGTATCCCCTTCAAGAGGTTTTTCGAGCGCCCCTGAATTTTCGGCCATTGACCTTAGCATTTTCGCTA 

TTCCTTTTTGTTTTAACTTCGGAAAGCCATTTTGCGTCTTACATTTTCTAGAAGTTCTTTGCTTTTTCTCGTCTGTTCTTGTTTTTTTTTTGTAT 

GCAAAAACTAATCCGTATTTGCTTATTACATTTCTGAAATGCAATTGAAACGCGAATAATAAAAAAAATACGTTCAGGACAAGGTAATGAAAAGC 

TTGCTATTCCCAGAAAATGCGGCTGGCTCGATGCAGGATGTGATGCAGGAGTTCGCGACCAATGGCTATGCCAGCGATGATCTCGGTCGCATGTC 

CTACGGGAGTGCTCCGCCACAGGTGCAAATGCCACCGCAGCAGCAGCATCAGCAACAGCAGGGGCTGCACCTGCCACTGGGCCGCAATCCTGCCC 

AGCTGCAGACCAATGGCGGCAACTTAATGCCCATTCCACTCGCCACCCACTGGCTGAACAACTACCGGGAGCATCTGAACAACGTGTGGCGAAAC 

ATGTCGTATATACCAGCCGCTCCCAATACAATGGGTTTGCAGGCCCAAACAGCGGCCACTGTGTCCACCAATCTCGGCGTGGGAATGGGTCTGGG 

ATTGCCCGTGCAGGGCGAACAGCTGCGCGGAGCTTCCAATTCCAGTAACAATAATAACAACAACAACAAGGTGTACAAGCGTTACAACAGCAAGG 

CCAAAGAGGTGAGTGGATCTGATCCAGGATTCGCAGAATTAAAACTTGCAAAATATATTTGTTATCTTGTTTCTCCAACAGATCAGCCGCCACTG 

CGTCTTTTGTGAGAATAACAACGAACCAGAGGCGGTTATCAATAGCCACTCAGTGCGAGATAACTTTAACCGAGTGCTGTGCCCCAAACTACGCA 

CCTACGTGTGCCCCATCTGCGGGGCATCTGGGGACTCGGCGCACACGATTAAGTACTGCCCCAAGAAGCCGATCATCACCATGGAGGATGCGATC 

AAGGCGGAATCGTTCCGCCTAGCCAAGAGCAGTTACTACAAGCAACAGATGAAGGTTTAGAGAGGGCGAATCCAGCTCTGGAGCAGAGGCTCTGG 

CAGCTTTTGCAGCGTTTATATAACATGAAATATATATACGCATTCCGATCAAAGCTGGGTTAACCAGATAGATAGATAGTAACGTTTAAATAGCG 

CCTGGCGCGTTCGATTTTAAAGAGATTTAGAGCGTTATCCCGTGCCTATAGATCTTATAGTATAGACAACGAACGATCACTCAAATCCAAGTCAA 

TAATTCAAGAATTTATGTCTGTTTCTGTGAAAGGGAAACTAATTTTGTTAAAGAAGACTTACAATATCGTAATACTTGTTCAATCGTCGTGGCCG 

ATAGAAATATCTTACAATCCGAAAGTTGATGAATGGAATTGGTCTGCAACTGGTCGCCTTCATTTCGTAAAATGTTCGCTTGCGGCCGAAAAATT 

TCGATATATCTACAATTGATCTACAATCTTTACTAAATTTTGAAAAAGGAACACTTTGAATTTCGAACTGTCAATCGTATCATTAGAATTTAATC 

TAAATTTAAATCTTGCTAAAGGAAATAGCAAGGAACACTTTCGTCGTCGGCTACGCATTCATTGTAAAATTTTAAATTTTGACATTCCGCACTTT 

TTGATAGATAAGCGAAGAGTATTTTTATTACATGTATCGCAAGTATTCATTTCAACACACATATCTATATATATATATATATATATATATATATA 

TATATATATATATGTTATATATTTATTCAATTTTGTTTACCATTGATCAATTTTTCACACATGAAACAACCGCCAGCATTATATAATTTTTTTAT 

TTTTTTAAAAAATGTGTACACATATTCTGAAAATGAAAAATTCAATGGCTCGAGTGCCAAATAAAGAAATGGTTACAATTTAAGGAAACAAATGT 

CCTTCTTGCGTTTGAAACAACTAATCCTTTTCGCCCTCGCGGCGTTTCTCGAAAAGGGCCAGGAAGATGCCATCGGTAAGATCACTGTCCGGCTG 

GCAATACAGGACGTTCTTGCCAATATTGGGATAGTCCTTGTCGCCCACATTGTGCCACTTGTTGCGCAAGGCCTTCTTGCAGCTGAGCAGCTTGA 

AGGATGGATTTAGCTGAAGACAACGCTGCACCACCTGCTCGTTTTCCTCCTTCCACAGCGAACACGTGCAGTAGGCAATGCGTTTGACGTTCGGA 

AAGGCGCCCATCGCGTGCGACAGGATCTTAATTTGCAAACCCTGCAGCTTTTGCAGCCGCTTGTCCTCCTTCGGCTCGTCGCACACGGTCATGCG 

GTTTTGCATTCCGCTGCCAGAGCAACTAGGGTCCACCAGGATGTACTCTACGTCCGGAAAGCGTTCGGGTGCTGAAATTCAAACGTGTTACTTTT 

TTAAATATTCATTGTGAAATATATAAATATAAAACTCACTTAGGTTCAAGGCGTCTCCCAAAATGGGTTTTACAATATCGCAACCAGCATCCTTT 
GTTATTTCGCACAGCGTATTGTAGCG 

(SEQ ID NO: 1426) 

Exon; 5676. .5478 
Exon: 2414. .2334 
Exon: 2270.. 1733 
Exon: 1665.. 1001 

Start ATG: 5573 (Reverse strand: CAT) 
Transcript No. : CT17728 

ACAGTTCGAAATTCAAAGTGTTCCTTTTTCAAAATTTAGTAAAGATTGTAGATCAATTGTAGATATATCGAAATTTTTCGGCCGCAAGCGAACAT 
TTTACGAAATGAAGGCGACCAGTTGCAGACCAATTCCATTCATCAACTTTCGGATTGTAAGATATTTCTATCGGCCACGACGATTGAACAAGTAT 
TACGATATTTACAGAATAGCTGCCCTTGTGCGGGATCGCGCCTGTCTCTTCACGTGCCAGCAGGCCTCGCAGAACCTGCAGCAGCAGCAGCCCCG 
CTTTTATAGTGCGCCAGGTCGCAGGGCCGGCTTCTTCTCGCAGTTCTTCGACAACATGAAGGCGGAGATGGATAAGAACAAGGAGATCAAGGACA 
ATATCCGAAAGTTCCGTGAGGAGGCCCAGAAGCTGGAAGAATCGGATGCCCTGAAATCGGCGCGGCAAAAGTTTAACATTGTCGAATCGGAGGCG 
CAAAAGTCGTCTAGCATGCTAAAGGAGCAGCTGGGCGCCATCAAGGAGCGCGTAGGCGATGTTCTGGAGGACGCCAGCAAATCCCACCTGGCCAA 
AAAGGTTACCGAGGAGCTCTCGAAGAAGGCTAGGGGGGTCAGCGACACGATCTCCGACACCAGTGGCAAGCTGGGCCAGACCAGCGCATTCCAGG 
CTATCTCTAACACGACGACGACCATCAAGAAGGAGATGGACAGCGCGAGCATAGAGAATCGTGTCTATCGTGCTCCTGCCAAGTTGCGAAAGCGC 
GTCCAGCTCGTCATGAGCGATAGTGATCGCGTTGTGGAGCCCAACACGGAAGCCACTGGTATGGAGCTGCACAAGGACTCCAAGTTCTACGAGTC 
GTGGGAGAACTTCAAGAACAACAATACCTACGTGAACAAGGTGCTTGACTGGAAGGTTAAATACGACGAGTCCGAGAACCCTGTTATTCGTGCCT 
CCCGCCTGCTGACCGATAAAGTGTCCGACGTCATGGGTGGACTCTTTTCAAAGACAGAGCTTTCGGAAACGATGACGGAGCTGGTGAAAATCGAT 
CCGAGCTTTGATCAGAAGGATTTCCTCCGCGATTGCGAAACGGACATCATTCCAAATATTCTGGAATCGATTGTCCGTGGGGATCTGGAGATTCT 
GAAGGATTGGTGCTTTGAGAGCACTTTCAATATTATCGCAAATCCCATTAAGGAGGCTAAGAAGGCCGGCGTGTACCTGGACTCCAAGATCCTCG 
ACATCGAGAACATTGAACTGGCAATGGGCAAGGTTATGGAGCAGGGCCCGGTACTGATTATCACGTTCCAAGCCCAGCAGATTATGTGCGTGCGG 
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GACCAGAAGAGTCAGGTCGTCGAGGGAGATCCGGAGAAGGTGATGCGGGTTCACTACGTCTGGGTGCTGTGCCGCGACCGCAACGAACTCAACCC 

CAAGGCTGCCTGGCGGCTGATGGAGCTGTCCGCCAACAGCTCAGAGCAATTTGTTTAG 

(SEQ ID NO: 1427) 

Start ATG; 104 (Reverse strand: CAT) 

MKATSCRPIPFIKFRIVRYFYRPRRLNKYYDIYRIAALVRDRACLFTCQQASQNLQQQQPRFYSAPGRRAGFFSQFFDNMKAEiyiDKNKEIKDNIR 
KFREEAQKLEESDALKSARQKFNIVESEAQKSSSMLKEQLGAIKERVGDVLEDASKSHLAKKVTEELSKKARGVSDTISDTSGKLGQTSAFQAIS 
NTTTTIKKEMDSASIENRVYRAPAKLRKRVQLVMSDSDRVVEPNTEATGMELHKDSKFYESWENFKNNNTYVNKVLDWKVKYDESENPVIRASRL 
LTDKVSDVMGGLFSKTELSETMTELVKIDPSFDQKDFLRDCETDIIPNILESIVRGDLEILKDWCFESTFNIIANPIKEAKKAGVYLDSKILDIE 
NIELAMGKVMEQGPVLIITFQAQQIMCVRDQKSQVVEGDPEKVMRVHYVWVLCRDRNELNPKAAWRLMELSANSSEQFV* 
(SEQ ID NO: 1428) 

Name: probable mitochondrial inner membrane protein import receptor (translocase) 
Classification: receptor 



Celera Sequence No. : 142000013385184 

CCAGCGTCTGCGCGCCATCATCAACGGCATCAATCCCGACGTGCTCCAGGGTCATCAGCCAGTGGCTTCAAGTGAAGTAGTGGTCGCAACGGAGG 
CGCCACAGACCGTGGCCGAGTCCACTGAGGCGGCGGAGCAGCAGCAGCAGCAGCAGCAGCAGGAGAACGGCGAGACTGAGCTGGAGAAACCTAAC 
GAAAGTTCGGCGGACGTTGCGGAGATAACTGCCGCTGTGGAGGCCTGCCAGATTGAGGGGAACTCGGTGGTGACCACCGATGAAGCCGCACAGCC 
AAGCCCAACCGTCCAAGCTGAAGCCAGTGGCTCCAGTGAGTAGATCTCAGATGACAGCAGCACGTGCCAGCACACACCAAACGTAAAAACAAAAA 
CATCCCATTTAGAACGCATTCATTTAGAAGAAATGTTAAGTATTTTCCGGAATTTACATGTAAATAAAGCGAAATAAAGCATGAGCGCATGCAGC 
GTATTATGGATTTTTCAAATGTGTTTATTGAAATCAGTTTCATAATTCATACACGTTTTATATCATTTGTTTATTAATACAAAGGATTATTGATA 
ATACATATTAATAATAGTTAATTTTTAAAAAACACATCCTTAAAATGACGGGAAGCGGACGTTTGAATAGCTGATGAATGGTAAGGAGTTTTAGG 
GACAAACTAATATATTTCGCTGATTTTCTTGCAGATATTTCTCTGGCTGGCTGGCTGGCTGGTCCATTCGGTCGGACTGCATTGTGGACGAACAA 
AAAAAAAACTGATTCAAGGTCGAAACTTAGTGCTTATAATTCCTCTTTATACTTCTTGGCAAGCATTCGCCTCCACCTCCTCGGTTCAACGATTT 
CCCCTCTTCGATTTTTTGTTTCTTGTTTTGTTTTCACTTGCTCTATACTCTTAAGGCCAACGAGCACTACAGGGCCCCCACATCTGTATGCTTTG 
TGACCGACTGGCGTCCAGTCGCTCCTCTGCTGCTCCGTTTCCATCTAATCTGCATGGTTATGCGCTTGGTGCTCTGCTCGAACGGTCAGTTGATT 
AACCTAGTGTTTAAATTTTGGACAAAACCTATAAAAGCGCTGGTGTTCAAATGTGTGTACATGTGTGTGCGTGGGTGAATGGTTGGTCGGTCGGT 
AGATATACATATATATATATATATACGGGGGCTATTACAAATAAATATGTGGCTTTTAATCATGGTATTTGTCAATGTGTTTGTAATTGCGATTT 
AAATCTCAATGGGTGTGGGGTGGAAGGGGAAAACTGCAGTCATCTGCTGGAAGGGATGTCTCTGGGGATGTCCTTTGATCGTGGCGCTGCTATTC 
CATACCAAATTAATGGTGCAATTTCTTGTGGATTTCGACTCGCCCCGGCTGCCGATGATAAAAGTGACCAAAAACAGAGATAAGGTTGGGGTGAG 
AGTGTGATTCACTAGTTTGTTACTCTGCGTCAGCAGCCCATACTTACGTTTCATGCATCATTTGATGTGTTGTTTCAAAGGATTTTATTTCTGGT 
TGATTGTGTGGTTGGGTTTGTGGTTGTTTTTGTCGTGTTTATCTTTTGTATCTTGTTTGGATTTCTTATTGATCTGTTTTAGATAGAAAAAGACT 
GCTCGTGGTGCTGCAGTTGTGTGATATTACTTTTTACGCTTGTCGGCTGGCTTGAGGATTTGGAACGAGCACATCAGCGTATCGTCCACGGACAT 
CATGGCGCCGGCATTGTCGAACTCGCCGCAGTAGTTGGGCGCCGAGAACAGGGTGACCAGCATGCGTTTGGCAAAGAACTCGTACCCATCCTCGA 
CGACTTGATGGGCTCGGCAGATGAGATCGAACTCGTGCTTCTGCAAAAACTTGGCCACCACCTCGGCACCGAAGGTGAAGCTAACGCCGCGGTCG 
TTTTCGCCCCAGCCCATGGTATCCTTATCGGGATCGGACCACAGGAGATCGCACAGCAGTCCCTGGTCGGGCACATCGGTTGGCCGCATAATGCG 
ACGGATCTGCTCCATGGAGGTCAAATCGGGACTGAGACCACCGTGGCAGCAGAAGATCTTCTCGTCGACAATGGCCGCCACTGGCAGGCAGTTGA 
AGCAGTCCGTGAATGTCTTCCACAACTTGATGCTGTAGCGACGCTTGCATTCGTCGTAGAATCCGTATATGCGATTAATGCTGGCGCATTCGTGG 
TTGCCGCGCAGCAGGAAGAAATTCTCCGAGTATTTGATCTTGTAGGCGAGCAGCAGGCAGATCGTCTCCAGCGATTGCTTGCCGCGATCGACGTA 
GTCGCCGAGGAACAGGTAATTCGATTCCGGCGGAAAGCCGCCGTACTCGAACAGACGCAACAGATCGTAGTACTGTCCATGGATGTCGCCGCAGA 
TCTTCAACGGTGCCTCTAGCTCGAGCAGAATGGGCTGCGACAGGAAGATCTCGCGCGACTTCAAGCAAAGTCCCCGGATCTCGCCCTCCGAGAGC 
TGTACGTTTTTACCTGGCCGTGCCCCACGCACCTCGAGAAGTCGCGATATTATGCTGTCGATATTCATCACGTCGCCCATGTTTGCGTGCGAAAG 
TGTGGATCTGGTGCACTAGTGCTGCTGTTGGGTGTCGAGGTTACCGCGAACGATCGTTGGTAGCGCCGAAATAGCCTCGAAATCTTTAATTTCCG 
AAATCTATTTCTGCTAAATACTTTTCACAGCTAACAAGTATGAAAATTTCTTCGTATGCTGTTGCGTGAGTGCGGGCGGGATGGCGAGTGTGCTT 
CTAGCTGATGTTGCCACACTGCCGGAAGTCGCGGATATGGCCAGATAATAGACGCCATCTTATCTAATCGATAACATGCAAATATAGTGCATAAT 
CGATAACGAAGTTGCTGTAATATTACTCTTATTACATTAAGGACATTTCCAATCCATTGCAAGAGTTCCGCAATAGCTCTACTTTTATATTTTCA 
TTCGAAACGGGTCTTCTCGACCCTATAGTGGCTCGAAACTGCTGTTAGTAATCGATTAAACGCATGGCGTCCAGTTACTCTATTTCAGCGGTGTG 
GCAGCCTGATCAGGGGGACACTAGCCATCCCGCTTGCACTCAGGCACGCCTGGCATGACTTACTGTGGAAAAGAGATGGGAGACTGATGCATTTG 
GCGAGGCTATTCGATATACCGGTAACAAGGGGATATTTAAATACATTGGAAATTCAAATAAGCCAATAAGAATATTCGTTTACAGAACATAGGTT 
AATTCATTTTCCTTATTGGAATAACTTCCACATGAGTTATGATTTAAAATGCCAAATTTCCCTTAAATTCTCACAAAACTAAATAATATATTGAT 
AGATATATTAACAATCACACTATGTAAAAATTTTGTATGAAATGTGCTCATGTCTTGTTATCGATAATTCGATACTCTATGCATTTCCTGCCTAC 
GCCCGCTAATCATTTAATTAACTGTTAAATAGGCGGGTTTCAATTTAATCAAAATGCTGTTAAACGATATCATAAGCGGAAAATTAGTTGATTCC 
GTCTATATTATAGCGGAAATTGGCCAAAATCACCAAGGTTGTGTGGAAACAGCGAAGAAAATGATATGGGAGGCCAAAAAAGCCGGTTGCCACTG 
CGTCAAGTTTCAAAAATCGGATTTGCCAGCAAAGTTTACGAGATCCGCTCTAGATCGCGAATATATCAGTGATCACGCCTGGGGAAAAACGTACG 
GGGAGCACAAGGAGTACCTGGAGTTCTCCAAAGATCAGTATCTCCAGCTGCAGGCCCACTGCAAGGAGTTAA 
(SEQ ID NO: 1429) 

Exon: 2777.. 1473 
Exon: 1378. .1001 

Start ATG: 2550 (Reverse strand: CAT) 
Transcript No. : CT17842 

GCAGTGTGGCAACATCAGCTAGAAGCACACTCGCCATCCCGCCCGCACTCACGCAACAGCATACGAAGAAATTTTCATACTTGTTAGCTGTGAAA 
AGTATTTAGCAGAAATAGATTTCGGAAATTAAAGATTTCGAGGCTATTTCGGCGCTACCAACGATCGTTCGCGGTAACCTCGACACCCAACAGCA 
GCACTAGTGCACCAGATCCACACTTTCGCACGCAAACATGGGCGACGTGATGAATATCGACAGCATAATATCGCGACTTCTCGAGGTGCGTGGGG 
CACGGCCAGGTAAAAACGTACAGCTCTCGGAGGGCGAGATCCGGGGACTTTGCTTGAAGTCGCGCGAGATCTTCCTGTCGCAGCCCATTCTGCTC 
GAGCTAGAGGCACCGTTGAAGATCTGCGGCGACATCCATGGACAGTACTACGATCTGTTGCGTCTGTTCGAGTACGGCGGCTTTCCGCCGGAATC 
GAATTACCTGTTCCTCGGCGACTACGTCGATCGCGGCAAGCAATCGCTGGAGACGATCTGCCTGCTGCTCGCCTACAAGATCAAATACTCGGAGA 
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ATTTCTTCCTGCTGCGCGGCAACCACGAATGCGCCAGCA'TTAATCGCATATACGGATTCTACGACGAATGCAAGCGTCGCTACAGCATCAAGTTG 
TGGAAGACATTCACGGACTGCTTCAACTGCCTGCCAGTGGCGGCCATTGTCGACGAGAAGATCTTCTGCTGCCACGGTGGTCTCAGTCCCGATTT 
GACCTCCATGGAGCAGATCCGTCGCATTATGCGGCCAACCGATGTGCCCGACCAGGGACTGCTGTGCGATCTCCTGTGGTCCGATCCCGATAAGG 
ATACCATGGGCTGGGGCGAAAACGACCGCGGCGTTAGCTTCACCTTCGGTGCCGAGGTGGTGGCCAAGTTTTTGCAGAAGCACGAGTTCGATCTC 
ATCTGCCGAGCCCATCAAGTCGTCGAGGATGGGTACGAGTTCTTTGCCAAACGCATGCTGGTCACCCTGTTCTCGGCGCCCAACTACTGCGGCGA 
GTTCGACAATGCCGGCGCCATGATGTCCGTGGACGATACGCTGATGTGCTCGTTCCAAATCCTCAAGCCAGCCGACAAGCGTAAAAAGTAATATC 
ACACAACTGCAGCACCACGAGCAGTCTTTTTCTATCTAAAACAGATCAATAAGAAATCCA^ 

C ACAAACCCAACC ACAC AAT C AACCAGAAATAAAATCCTTTGAAAC AAC AC ATCAAAT GAT GCATGAAACCC GGGGCGAGTCGAAATCC ACAAG A 

AATTGCACCATTAATTTGGTATGGAATAGCAGCGCCACGATCAAAGGACATCCCCAGAGACATCCCTTCCAGCAGATGACTGCAGTTTTCCCCTT 

CCACCCCACACCCATTGAGATTTAAATCGCAATTACAAACACATTGACAAATACCATGATTAAAAGCCACATATTTATTTGTAATAGCCCCCGTA 

TATATATATATATATGTATATCTACCGACCGACCAACCATTCACCCACGCACACACATGTACACACATTTGAACACCAGCGCTTTTATAGGTTTT 

GTCCAAAATTTAAACACTAGGTTAATCAACTGACCGTTCGAGCAGAGCACCAAGCGCATAACCATGCA 

(SEQ ID NO: 1430) 

Start ATG: 228 (Reverse strand: CAT) 

MGDVMNIDSirSRLLEVRGARPGKNVQLSEGEIRGLCLKSREIFLSQPILLELEAPLKICGDIHGQYYDLLRLFEYGGFPPESNYLFLGDYVDRG 
KQSLETICLLLAYKIKYSENFFLLRGNHECASINRIYGFYDECKRRYSIKLWKTFTDCFHCLPVAAIVDEKIFCCHGGLSPDLTSMEQIRRIMRP 
TDVPDQGLLCDLLWSDPDKDTMGWGENDRGVSFTFGAEVVAKFLQKHEFDLICRAHQVVEDGYEFFAKRMLVTLFSAPNYCGEFDNAGAMMSVDD 
TLMCSFQI LKPADKRKK* 
(SEQ ID NO: 1431) 

Name: Serine-Threonine Phosphatase Alpha-2 
Clas si f ication : protein_j?hosphatas e 
Gene Symbol: Ppl-87B 
FlyBase ID: FBgn0004103 



Celera Sequence No. : 142000013383859 

CATCTGTGTTGGGATCGCTGATGAGCAGGACCTTTAGTCCATTTTCTAGTTGCAATCCGCGGTAGTCGCGTGTGTCCTGCAGCGACTTTTCAATG 
TTGTTCAGCCTATAAACGACGAACCATTGCCTTTCGACGACCTCTAAGATGCGTAACGTGCACTTACCTCAATATGGGCTCCATGCTGTCTGGCT 
TCCTAGTTGCGGACTTTTGCGAGCTTTCGGCTATTGTCATCTTGGGTTTTATAAGCGTGCCAATCGTTCGATGTGTTACTGAAATGCTGGCCCTC 
AACGCACTTGACCTGTGCGAAATAAATGCGATTTAGTGGTGCCCGTTAGCGGAAATTTGTTTAATTGCTGCCAACCGTCGACCAAAAACCGCTGT 
TAAGGCAAAAATCGATTTTCTGCACGTAAGATACATTGAGTTTGTGAAATACTCTTGACGACGCTACTCCGGCTTTACGTAGCGCGCACAAGAAA 
T C T T AG AGT G GGAGG AGT AGATT TC C AAGC C GGTC GGT T AC AC GT T C AAT C AAT T TT AAT C GC T TT T T GG AC CG AAAAAT AAATT C T AAC T AGT T 
TCGCAACAACTAAAATAGCAAACAATCATCTGTTCGCAGAGCTGCCAACTCACCGCGCACGCAGAACTAGTTTAATATGTTGCCAGATCGTTTAT 
TTTGTGAGTGGTTCCAGATTTGAAATTCCCAATTGGAGAAATTTTACAAAGTGGTAGGCCATATTTTCAAATACGAAAGACTTAATAAGTTTAAA 
GTGATGATTTTTAATAGTTCTAATGTAACTTTGGACTAAAGTAAGCTGCATATTCACCTTACCTAGGATAGATGGCGATTTCAAAATGTATTTCA 
AATGCTTGAAATTTTTCCAACGACCATACAATAAATATTGGGCAACCCTGATAACAGTCATTATAAAAATAAATATTCCGTGAATTCCCGTTCTC 
CGTCAATCTGGCAGTGCTCCGCGCCCAACCGCCGCCGTTTTCCAACACTGGCCGGTTGCCGTCAAAGCGCGTAGAAGAATTGCGGGCTGTAACTG 
CAATTTATTATTTTTTAATTTCATTTATAGTGCACCTGGTCAGCGAGCACTGGCTCATCCGCAGCCATCCGTCGGCAACCAGCTCGCAGCGATGT 
TAAACTTGTGCCAGCCGTAAAGATTTCGCGTGTTTGCGAACCACAAACTGCGTCGCCATCGGTGTGTGGGTGCAGAAATAAATGCAAATTGTTGT 
AAAATGCCGTGCTTTGGTGTGCGTGTTGATAAGTGCAATACGCAAAAGTAAGAACATTGAGGCAAATCCGTGCAATTCGTGCAATCCGTGGGATC 
AGGCGTTTTCAGAGGTGAGTGCTGCAGTGCTACGTCCGCGGAGATTTATGAGTTCGGGAATTCTGTGCTCTCCCCTTCGGCCAGCAGTCGAAGAA 
TTAGGTTCTAATCAGGTTTTAATTGTTTACCCACCAACATGCGTTTTCCTCGAGAAAAAGTGCAAGTGTTCAACGATTTTTCTCCCCTAAATGGG 
TCAAGTTTGCCGGAATGTCGAGTCCGAACTTCAAGATCCCCTCCTAAAAGTCCCTTCTGTGCCTCCTTGTGTGCGAGTCTGAATGTGTGTGTGGT 
TTGGTAACGGTATTTGTGGCCGTTTTGCAATGGCACGAGAAAATCAATAAGCTGGCAAAATTGAAGAAAGGAGACTTCCCAGACATGGCAAAACA 
GCGCAGAGTTTGTTAATAATGCTAATATGCTGGCGTGTTCTGCGAATTCTCGCGAATCAAAATGTCTAATTAAACCACAGGTTCACACCTCGCAT 
GAGAAATAAATTGGAACCAGTCCCTAAATAAACTTTGTTTTTATTTCCCAAGATTATGGTTGGTGCAAATATTTTACAATGAAAACTAAGAACAT 
ACCTTTTAAAAACTAAAAAATACGATCTTTCCATTTTCAAAATACCTGATATCTTCAAAAACTAACAGTTGTTTACCTTCACAACGTCTTCTTTC 
TTTGTTTCTTTGTTTTTGCTTCACTTTGCTTAAGACTGGGCACTTAAATAATATATTTTAAAAATArTTCTACAAGGAAATAATAAAAATAAAAA 
CGGTATTTTATAGTAGTTATATCAGACATTAATTTTTCAATATTTGAAGAATATTCTTCCCTCTTTAACCTTCATAAAGTCAATTCAAATGGCTC 
TAAAT AGATT G AAC ACATAAATT ATAGAAAATCAAAGC AAAAGT GGAACCGATTGCC AT GTAT TCCAGAGGT CAAC GAT CCACAATATTGACAGT 
AACTTAAAGTCTTCGCACGAACAAAAGACTTTCCTGTGGAGAACAACGATCGCTTCTGGCGACCACTTGAAGTACTTTCAATTGCATGGTTATTG 
TTTTGGGTTTAAAGCGCGTCCCAAAATAGACAGACTGCAGCTCGTCGAGAAGATCGGACAATGGATTGCCATGAAGCGTGGGCCACGCAGGAAGA 
AGTGTGCTCATTGCGACCGGAGGATTGAAGGCAGATATATTTTTAGTGCAGCCCAATTAAGTTCAATTAGAGGTTCAAAGAGAGAGAGCGAGCTT 
ATCGCTCCGTATCCGAGGTACTTACTCCAACTTCCTCCGCGACAAGTGGATCAAGTGGGGGTCAATGAATTGGCTGGTTTTATAGCCTCCGTTTG 
CGGGATCCAATAGATTTCTAAGTAAATGATACGAAGCTGGTATCTGATTTCACTATAGTTATTTTCACAGTGCGCAGCGAGTAAACAAACAAAAT 
TGCCTATTGGCGTAGATCAGTTATTCTGGCTTTACGTATAACTGAACGTAGAAGTGAATTTTCCCAGTGCGTAGATTATTTCAAACTATGTATTT 
CTATTTTGGGAAGCACTTAGGTATCTAGATTAAAATCAAATTAATCACGTGATGAAACCATTTTGCCTTCACACAAAACGTTAATGCAGACGACA 
CTTTGTGTAACTTAGTAATTAGCAGGTCGATTGTTAGTCGAGAATTTAGCATCTGCTTATCAGAACTGAAAAACATTAAGGCAGCTATTCTACGT 
AGTTCACTTATTTATATTAGAAGGGGTGTCAGCAGTTTATTTTATTTTCAATATTTTCATTTATGTATGTTTTCCTCAGGAGAGACCACATTTTA 
TTAAATCACACAAAAATCCAATACTATTTTCATTTCAGATTCCATAAGCTGGTTCTGCTATTTGTGCATGCCCAAAATGGACAACGAGCAGCCGC 
ATCAAGAGCTAGTGAAGTGCGTCCTGGTGGGGGACACGGCCGTGGGCAAGACGCGACTGATCTGCGCGAGGGCCTGCAACAAACACGTCTCGCTA 
TCTCAGCTCCTCTCTACGCACGTGCCGACCGTGTGGGCCATTGACCAGTATCGCATATATAAGGATGTGGGTTACCATTCTCTCAAGAGTCATAG 
TTATTGCATTTAACCGAATCCATATACATTAAAGGTACTCGAGCGATCATGGGAGGTGGTCGATGGAGTGAATGTCTCACTTCGCCTATGGGACA 
CATTTGGTGACCACGACAAGGATCGGCGCTTCGCATACGGCAGGTGTGTAGAGCTCATGTAGCAGCATTTTCGCCATACGAATCAATAAGATGCC 
TTCTTGACCTACAGATCCGATGTGGTGCTGCTGTGCTTTTCCATTGCCAGTCCGATTTCCCTGCGCAACTGCAAGATGATGTGGTATCCGGAAAT 
TCGCCGCTTTTGTCCGGATGTTCCCGTCATTCTAGTTGGCTGCAAGAACGACCTGCGCTACATGTATCGCGACGAGAACTATCTCTCGTATTTCG 
GCGAGAAGGGAACCTTTGTTCGGTATGCATAGCTAGCAGGTTTCAGCCGGTTTTGAGTGGCTTTCACTTTTTAATGTTGTTCTTTAATCAAATCA 
ATTCAATTATTCACCATAATTTAATTATATTTAGAACACACGAAAGATATAATTCAATGTTTAGCCCACTATAGC AAATT AATCTGTTTATATGT 
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TCAGAGATATTCTATTTAATTGGATTAAATTAAAGGCAATCATTACTATAATCTTTTTTTAMGAGCCGCGTTGAAGAGTGACCTGGTTATGCCG 
GATGAGGCGCGTGCCGTCGCCAAAGAGCTAGGAGTGGCGTACTACGAGACCAGTGTTTTCACATACTTTGGGGTGAACGAGGTGTTTGAAAACGC 
CATCCGATCCGCGTTGATTGCACGCCGACAGCAGCGCTTCTGGATGACAAACCTGAAGAAGGTGCAGAAGCCCCTGCTGCAGGCGCCGTTCCGGC 
CACCGAAGCCACCACCACCGGAGGTCACCGTCATGGTCGGCAACTATCGGCAGGACATCTACAACATGTTCCTGTCGCAGGCCTACACCGACCTC 
GTCCTGGTCGGGGCGGGTGGCACCAAGTTCGCCGTGCACAGGTTCATGCTAGCCGCCGCGTCCAGCATCTTCCAGCGCCTGCTTAGCACTGAGCT 
GACTGATATGGGCGGGCGGAGCAGCAGCGAATCTAGCATGGTCAGCTCAACATTCGGGGAGGCCACCATTGCGGACTTTAACGATGACACGGAGG 
CCCTGATCCGCTACGAATCACGCACACAACGGTGGGTGCTGCCTCCCGATTGCATTGAGCCCATTTGACTTTGAATACATTTTGCACCCACAGAA 
TGTGGGAACACTTGAAGCGCCGCTCCAGCTACCAGGCGTTGCCTCTCATGGAGTCTAAGCGGTCCAACGATTTGTACAGGGAGCTGCATCATCCG 
GTCCTGCAGAGCATTCGCCTGGTGCACGTGGAAAACCATCGGGGTGTAAATGGTCTGCAGACGATTGTCACCCTTAGCAAACTTATCTCTCCGCA 
AGCTCTGCACCAGTGCCTGAGATTTATTTATACCGGTACCATTGACAAGGATTGCGATAATATCGAGGTAGGAAATGTTTCAATGTATTCTGTGT 
TAAGTTTGACCTACTAATATATTGTTAATTCTCACAGGAAATTCGAGAAGCTGCCGATCTATTAGAACTGCCGCAACTGACTCAGCTACTTTCCA 
GGCCTCAAACCGTTATGGAGAACTCCAGCGATGAGCCAAATCCGCACATTTGCCTGGTGAGCGCATTGATGTCATGATATCTATTTTATGATCCT 
AACCATTTTCTGTTACATACATTAGCGCATCAAAGAAAGCATGGAACGGCATTGCATTGGTGACGGGTGCTTCAGTGATGTCACCTTTGAATTGG 
ACGATGGCTTAATGAAGGTACATACATATGTCTGTTGAATTACTATATGAAGCACAATCGTTTACACTTACCTTTGCTGCATTGCAGGCACACCG 
CGCTGTGTTGGTTGGTCGTTGTGATGTCATGCGCGCAATGTTGTTAGGCGACTTTCGCGAGGCTCATTCCAATGTGGTAGGAACGGGCACAATAC 
CCCTTAACTAAGGACAGTTGTCATAAATTTCCTTTACCTTAGATCGTATTCCCTGGCGTTACCATTTACACATTCCACAAGCTGCTGTGCTATTT 
GTACACAGACCAGATTCCGCCAATCTCGGCCGTCAAGTGCCTCAATCTGCTGGAGCTGGCCAATAGACTCTGCCTGCCGCGGCTGCTCAACCTGG 
T T GAAT GC CGC GT T AT TG AAG AT CT C AC T CT G AT T TC T C AG AAC GAAAC C AAT G AAAC GGT GG AT C AC TGT C T AAAGCTGCT C G AGC C AGTG AAG 
GTACACATAGAGAGATCAGTTAAAAATGATATACCGAAATTCAATACTTTTCTTTTTTTTTCAGCTGCACAACGCACATCAATTGGCCGAGTGGT 
GCATGTCATACCTGTGCGTCAATTACAACCTTATTTGTAAGTTTTCACTCAAGGGCCTAAAGGCGCTACACCAGGATAATCAGGAGTACTTGCGC 
GAGCACCGATGGCCTCCAGTCTGGTATCTGAAAGACTATGATTACTATCAGCGATGCCTGAACGAGCTGAACAAGGAGCTTAAGCTAAAGACTTC 
GAGGCGGGAATCGCCTAGCGATGACGAGGGCTGTCTGTGCTTTACGGGCGGTAAGTGTTGCTACATTTACGGAACTATCTGCTCCACATCCCACC 
ACACTTAACATCCCAGTCCAATACTCCAGTCGGAGGATCAGCTCAGGTTTTAGTTTCAGCTAGCTTTAGAGGACAATGTATGTATCATGACGGCG 
GTTTGTATTCGTAGCTAGGTGTCTCGTTATTGTTTTCCTTTCCTGTTGATTTCCTTGCTCATTTACAACGACAACGCCATCATCATAATATTAAT 
GTCCGAAATTTGGTCAAAGTATTCTCTTGCTGGCTGTTAGGTAAATCGAAGCGAAGCGCTGATGTGAGTGGAACCACGGATAACAGCAACGCAGA 
TAATCAGATCTTCAACAGCGCCGGCAACTCGCTCAACCACATTGATCTGGAGGCGGACATGGACCTGAATCTCTGACACCCAGCCCTAGGGCAAG 
GAAGGTGTGTTGATATTAATGTCAAAGTAAGGATTCTTCGATATTTATGATTTACTTCGATACTTTGATTCCAGATCACTGCGCTTCATTGTGAT 
CCTGCCGGTCCTGATCTGTTTTATCTGTACGATTTTAAGTATTTTGATACTTTTCCAAATCCATACATAACAAACCTCAAGGGCACGATTTAGAG 
T GGT CGGGG AG AT AAGGG T T GG AAAT GG AGGGGGC GT GGC GC GT C T G CT T T T G AC AT T ACGC AAT ATAT AG AGCG C AAAT T T GC C T TAGT GATT A 
ACGATTAATAAGTCCAACGTAGCTCGACCCAGAACAAGGGAGCATACTGCAGAGTGCTTCGGATTTCTTATATTTATACGCCTATATAAAACGCA 
AAGCATAGACGAGTATGTATGCACATGCACCTCGGCGCTGCGAAGCATTTCGCTGTATTGCCTTACGATATATCGGTGTACACTCAACTTGTTCT 
GCACGATCCCTCCCATGGAATGTGGACAGCCGAAGGACGCAGACATGTGTATATCCATATATTTGTACCTTTACCTAAGACATGTATGTATGTAT 
ATTTTTGAATTCCTACTTACGGAAACATTTGTCTAGCCATAGCTCTAAGCAATTCTCTACGATAAGGACGCATTTTGTACAGAATGATAGAATAA 
AAAAGCAATTACGAACAAATCTCACGTTTCTTCCTTGTAATAAAAACGCATTTTACAATGTAATAAATTTAGAATTATAACAGTTCACATGCCAT 
TTAAAAAAACACAACATTTATATCGCAGTTTTCACGGATTTTACACTGGATTGCGATACAGTACCATCCTCCACCTCAAGATCCTGCAGCATGGC 
AATCATCTCCTCGTCCGTAATCACAACCTCCGCCCTTAGGTTGTTATTAATATGTGGACTAAACGTAGCCGGCGCAGATTCTCCAGCCAGTTCAC 
GCAATTCCTGCTCAAGTTGATCATCTTCCTGGCTCGCGTTCTCCACAACGCTATTGGACATTATGTCCTGAACTTCCCGATGCTGGTCGAGCGTG 
TCCCGCACATCGGCCAATACTTCGTCGACATTGTCGTACTTCAATCCAGAATTCGAGAGAACCTTCTTCAAGGTATTTGATCCGATCTTGTAGGC 
ATCCAGAACAACGCCACTGTTCTGGGCCTCATCCACACTGGACAGAAGAGATTCGATGTTGTGCAGGGCAAGACTGCGACGTTCTAATTAGAGAA 
TTTTCAAAGAACATTATTCACAAAAGCCAGATAAGGGGTGTCTACAATAAAGATTACCATGATTCTTTTCCAGGAGATGCCTTTTTCGTAGATAG 
GTTTTTGCCATTTGCCGTTTGTTCTCCTTCATGTACTGGCGCACTTTGTCATCGTTCACCTTGATCTCCTCCTCTAGGTCCTCCAACTGCTTGAG 
TAACTGAGCTTGGGTATTCTGAAGATTGTGGACGGCCTGGTCTTCTTGGCTTATATTAAGATCGTCGCCAGGTTTTTCTATAACAAAAATTATAG 
TTTAATAATACATACTAACTACTTGAGAACCTTTTAAACTTGCTTGGTATTTTGATCAAGTGAATTTGGCGTATGCCTTTTTCTACTTTAAATTC 
TAGGCCCAAGATTTGACGAACATTTAAGGCCAACAGGCAAGCGCATAGATCTTTGTCTGAGTAAGTGGGAACTCCGTGAGTCTTACAAAGCGTTT 
TCAAAGCATTAAAGTGCAGTAGTTTTCCAGTGTTTTCCGACAGGACTTTTTCGGCGATGAGATCGCAAGTGTTCTGGAAAGTTAATTATGTAGGA 
TTCCTATAATCATTGGATGATATCACATACGTTTAAGGCATCTAGGTGAATCCATTCAATTAAAGCGGAGGCTTCCAGGTCCTCCGTCACCACGC 
TGTGCTTGATTTTCGACCAGCTCCATGAAAGGGGGCGCTTTACCAGGCTGTTGACGAGCCAACCACCCCAGGAATTGGCCGGATCGTACTCAAAT 
TCACTGCGAGAGCGGATCTGCTTCCTCTGCTGCATCTCGGATATCACGGTGTCCAGGCAGGCGGGCAGTTGGTCA 
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Transcript No. : CT17926 

GCCGGTTGCCGTCAAAGCGCGTAGAAGAATTGCGGGCTGTAACTGCAATTTATTATTTTTTAATTTCATTTATAGTGCACCTGGTCAGCGAGCAC 
TGGCTCATCCGCAGCCATCCGTCGGCAACCAGCTCGCAGCGATGTTAAACTTGTGCCAGCCGTAAAGATTTCGCGTGTTTGCGAACCACAAACTG 
CGTCGCCATCGGTGTGTGGGTGCAGAAATAAATGCAAATTGTTGTAAAATGCCGTGCTTTGGTGTGCGTGTTGATAAGTGCAATACGCAAAAGTA 
AGAACATTGAGGCAAATCCGTGCAATTCGTGCAATCCGTGGGATCAGGCGTTTTCAGAGATTCCATAAGCTGGTTCTGCTATTTGTGCATGCCCA 
AAATGGACAACGAGCAGCCGCATCAAGAGCTAGTGAAGTGCGTCCTGGTGGGGGACACGGCCGTGGGCAAGACGCGACTGATCTGCGCGAGGGCC 
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TGCAACAAACACGTCTCGCTATCTCAGCTCCTCTCTACGCACGTGCCGACCGTGTGGGCCATTGACCAGTATCGCATATATAAGGATGTACTCGA 
GCGATCATGGGAGGTGGTCGATGGAGTGAATGTCTCACTTCGCCTATGGGACACATTTGGTGACCACGACAAGGATCGGCGCTTCGCATACGGCA 
GATCCGATGTGGTGCTGCTGTGCTTTTCCATTGCCAGTCCGATTTCCCTGCGCAACTGCAAGATGATGTGGTATCCGGAAATTCGCCGCTTTTGT 
CCGGATGTTCCCGTCATTCTAGTTGGCTGCAAGAACGACCTGCGCTACATGTATCGCGACGAGAACTATCTCTCGTATTTCGGCGAGAAGGGAAC 
CTTTGTTCGAGCCGCGTTGAAGAGTGACCTGGTTATGCCGGATGAGGCGCGTGCCGTCGCCAAAGAGCTAGGAGTGGCGTACTACGAGACCAGTG 
TTTTCACATACTTTGGGGTGAACGAGGTGTTTGAAAACGCCATCCGATCCGCGTTGATTGCACGCCGACAGCAGCGCTTCTGGATGACAAACCTG 
AAGAAGGTGCAGAAGCCCCTGCTGCAGGCGCCGTTCCGGCCACCGAAGCCACCACCACCGGAGGTCACCGTCATGGTCGGCAACTATCGGCAGGA 
CATCTACAACATGTTCCTGTCGCAGGCCTACACCGACCTCGTCCTGGTCGGGGCGGGTGGCACCAAGTTCGCCGTGCACAGGTTCATGCTAGCCG 
CCGCGTCCAGCATCTTCCAGCGCCTGCTTAGCACTGAGCTGACTGATATGGGCGGGCGGAGCAGCAGCGAATCTAGCATGGTCAGCTCAACATTC 
GGGGAGGCCACCATTGCGGACTTTAACGATGACACGGAGGCCCTGATCCGCTACGAATCACGCACACAACGAATGTGGGAACACTTGAAGCGCCG 
CTCCAGCTACCAGGCGTTGCCTCTCATGGAGTCTAAGCGGTCCAACGATTTGTACAGGGAGCTGCATCATCCGGTCCTGCAGAGCATTCGCCTGG 
TGCACGTGGAAAACCATCGGGGTGTAAATGGTCTGCAGACGATTGTCACCCTTAGCAAACTTATCTCTCCGCAAGCTCTGCACCAGTGCCTGAGA 
TTTATTTATACCGGTACCATTGACAAGGATTGCGATAATATCGAGGAAATTCGAGAAGCTGCCGATCTATTAGAACTGCCGCAACTGACTCAGCT 
ACTTTCCAGGCCTCAAACCGTTATGGAGAACTCCAGCGATGAGCCAAATCCGCACATTTGCCTGCGCATCAAAGAAAGCATGGAACGGCATTGCA 
TTGGTGACGGGTGCTTCAGTGATGTCACCTTTGAATTGGACGATGGCTTAATGAAGGCACACCGCGCTGTGTTGGTTGGTCGTTGTGATGTCATG 
CGCGCAATGTTGTTAGGCGACTTTCGCGAGGCTCATTCCAATGTGATCGTATTCCCTGGCGTTACCATTTACACATTCCACAAGCTGCTGTGCTA 
TTTGTACACAGACCAGATTCCGCCAATCTCGGCCGTCAAGTGCCTCAATCTGCTGGAGCTGGCCAATAGACTCTGCCTGCCGCGGCTGCTCAACC 
TGGTTGAATGCCGCGTTATTGAAGATCTCACTCTGATTTCTCAGAACGAAACCAATGAAACGGTGGATCACTGTCTAAAGCTGCTCGAGCCAGTG 
AAGCTGCACAACGCACATCAATTGGCCGAGTGGTGCATGTCATACCTGTGCGTCAATTACAACCTTATTTGTAAGTTTTCACTCAAGGGCCTAAA 
GGCGCTACACCAGGATAATCAGGAGTACTTGCGCGAGCACCGATGGCCTCCAGTCTGGTATCTGAAAGACTATGATTACTATCAGCGATGCCTGA 
ACGAGCTGAACAAGGAGCTTAAGCTAAAGACTTCGAGGCGGGAATCGCCTAGCGATGACGAGGGCTGTCTGTGCTTTACGGGCGGTAAATCGAAG 
CGAAGCGCTGATGTGAGTGGAACCACGGATAACAGCAACGCAGATAATCAGATCTTCAACAGCGCCGGCAACTCGCTCAACCACATTGATCTGGA 
GGCGGACATGGACCTGAATCTCTGACACGTTTCTTCCTTGTAATAAAAACGCATTTTACAATGTAATAAATTTAGAATTATAACAGTTCACATGC 
CATTTAAAAAAACACAACATTTATATCGCAGTTTTCACGGATTTTACACTGGATTGCGATACAGTACCATCCTCCACCTCAAGATCCTGCAGCAT 
GGCAATCATCTCCTCGTCCGTAATCACAACCTCCGCCCTTAGGTTGTTATTAATATGTGGACTAAACGTAGCCGGCGCAGATTCTCCAGCCAGTT 
CACGCAATTCCTGCTCAAGTTGATCATCTTCCTGGCTCGCGTTCTCCACAACGCTATTGGACATTATGTCCTGAACTTCCCGATGCTGGTCGAGC 
GTGTCCCGCACATCGGCCAATACTTCGT 
(SEQ ID NO: 1433) 

Start ATG: 383 

MDNEQPHQELVKCVLVGDTAVGKTRLICARACNKHVSLSQLLSTHVPTWAIDQYRIYKDVLERSWEVVDGVNVSLRLWDTFGDHDKDRRFAYGR 
SDWLLCFSIASPISLRNCKMMWYPEIRRF^ 

FTYFGVNEVFENAIRSALIARRQQRFWMTNLKKVQKPLLQAPFRPPKPPPPEVTVMVGNYRQDIYNMFLSQAYTDLVLVGAGGTKFAVHRFMLAA 
ASSIFQRLLSTELTDMGGRSSSESSMVSSTFGEATIADFNDDTEALIRYESRTQRMWEHLKRRSSYQALPLMESKRSNDLYRELHHPVLQSIRLV 
HVENHRGVNGLQTIVTLSKLISPQALHQCLRFIYTGTIDKDCDNIEEIREAADLLELPQLTQLLSRPQTVMENSSDEPNPHICLRIKESMERHCI 
GDGCFSDVTFELDDGLMKAHRAVLVGRCDVMRAMLLGDFREAHSNVIVFPGVTIYTFHKLLCYLYTDQIPPISAVKCLNLLELANRLCLPRLLNL 
VECRVIEDLTLISQNETNETVDHCLKLLEPVKLHNAHQLAEWCMSYLCVNYNLICKFSLKGLKALHQDNQEYLREHRWPPVWYLKDYDYYQRCLN 
ELNKELKLKTSRRESPSDDEGCLCFTGGKSKRSADVSGTTDNSNADNQIFKSAGNSLNHIDLEADMDLNL* 
(SEQ ID NO; 1434) 

Classification: enzyme 



Celera Sequence No. : 142000013384535 

TAAACAAACAGACGTGATATAAATTTTTTCTAATAGACAATAACTTTTTACCCATCTTAAACAAATTTTACAGTAAAATAATTTTTTAGTGTATA 
CTTTTAGGATGAGACAGTTAATAATCTTGCACGCATTACTTATACGACGACCCTGGTTAACGTTATCCAATACCCAATGATAAAATATAGAATCT 
GTGGATTTACTATATCAGTATCTTTATTATACGAGTACATTAATTCAGTTTACAGTTTTACAAAATTATTTGTTTGCCTGACATGTTTTCGTTCT 
TCGATTTTAAATTTACGTTGTTTTAACATCTATTAAAAGTCGAAATTAATTTCGTTTGTGTCTGGGATGGACTGCGATTTCAATGTTGATAATAA 
TATATGTATATATATAATACTTATTCGTCATCGACCCAAGTCGGTCGGTAAAGTGCATTAAAATGATTGATTGGCAACCGGAAATGAATTTTAAA 
TTCATGATTTTCTAATGTTTGGCTTTACTTAGAACCCATTAACAAGACGATGCAATGAGGTTGACGTGTTATGATCAAATGGACGACTTAAATTT 
CATAACAATCTTCTTTATATGTACACATATCGTACTTTATCTACAACTGCCACGGAATTCGCACTTATCTTAATTAATTATTTAATTTTCTGTTC 
AACAGTGTCTAAGTTGGTAAAGTAAGATGCTTCAGTTGGAGCCATTATTTCCTAAAGAATTCTTGATAATATATTCAATTTACGGATATATTTTC 
CTATCATCAGTATAATGGAAACTACTAGCCAATTGAAAAATGATGGCTCAGTCTTCGCCTTGCTCTTGCAATATATGTTATAATTGAAGTATAAT 
TGCAGCCAAATTTAATTTACCAAATATATGTAAGTTCGATTGTAAAGGGCATACAGAAATAAATAAAACAAAAGATCATGTCCCATAATACTCTT 
TATAAAATCTGAATTTATCAGCTAATATATGAAATTGATCTGCTGGGTGTAATGTATACTTATGTATTTATTTGTAATGCAGATACCCACATATT 
TGTAACGTATGAATCTAAAACTAAAGTCATAAGTCGTGTTTAAGCCGAACGGGATTGGCCAATGATAAGTGTTCTTGCTTCTCAAATCGATCCGG 
GATCATCTGTACAATTTTCGAGGGAAAGGGGGCTTTAAGTCTTAAAACCTCAAGTCGGTAAGTCTATGTAATTATTTAAATTGGACCTTACGCGG 
TAATTAATCGTTACAGGTAAAATGTATACATTGCAATATGCCATTTTATGTATAAATGTGTGTATATTTGTTGTATTGTATTTTAAACATAATAT 
TTTTATACAGAGTTTAGACAAAATCAAATAAATAGCGCGATAAACTTAAAATTATATGTAAATGTATTCATAAAGAACAGCTATGCCATTTCATA 
GGAGTTTACCAGCGACAGATATAATTATTAAGAATAAATACTTTCCATGATTGGTTTTTGAATACTCGCTATTTAGCATAATGTATGTAGCGAAG 
TAAATTTGAAGGACTAGGAACCAGTACGACGTATGATTATGTTTATCTATATGCTTATAAATGGTAATTATATATAATTTTTCAACTTATACAAA 
CGAAATTGGGTTTCTTATTGTAAGAGCGTGCATAATTTTTAGTAGAACATTTTGGCTTACTTACAATTGCGCTCTATACACTTGACAATAAGGAA 
ATAACTATTTACTTACAAATTAAACCTTTAGGCATCGTAAACTAACCAACTTTTGTGGGCAAAATTATTTGCGATTATTTTTTGTATTTTTATCA 
AGTGGTACTGCCAGGTATAAAGGAATCGTGTATGGGGATAAAAATCTCAAAGAATCGAAGAAATAAGTTATTAAATATGAATCAGGTCGGGGTTT 
TCATCACAAGGTCCCCACCACAGCTAGCCGCCTTGCTGTTGCTTCTTTTTATCGAATATGGGAAGCTTCGACTTGACCGCACCCTCGCACACCAT 
CATCCTTTGCTGCGCCGGCTTCTTGTCCAGATCGAAGCACAGGCCATCCGCGTTCAGGTGCCGCTTGAAGTCCGAATAGCGACGCGACTCGTACG 
AAAGGTAGTTGGAACCCAGGGCAGAGGCAGCCACAGCAGCTGCTACCGTCCCGTTCCCATTGCGATGGTGCACCGAAGGCGGCGAGGAGTCCTGT 
CGCTTTCGCTCAAAATACTCCACTATCTTGGCAATCTGTGCGGATCTTGTACTGGGCAGTGCCGTGAAGGAGCGGTCGTCCCAGAGCAGGGAGTC 
AGAGCCGGCCAAGCTACTTATATCATCGGAGCTATCCGTGTATATGGACGAGGTGCGGTTCTTACAGTACAATAGTTGATTCGTTGCAAAAGGAC 
TACGCTGAAGCAGAGTATGTATCTCCGAGTCTAAAGCCAGACGATTGATCAGGCTCGCCGTAGCCGCCATTTGGGATTCGCTGCCATTCCCGCTG 



FIGURE SHEET 767 



AGCAACGCATGGTGGCAAGTGTATCGTCGCTGGTGTGTGCGCCGCGATGCCACAATGTCCGCGTCCAAGTCATCCATCGAGTCCAACGCTGTGGA 
TGATGAGGTATTTCCACCGATCACAGCGGGATCATTGTCGATCAGCAGCACCGACGATGAGGTCGAGGACGTATCGTTGAGCAAGTGGCTCTGCT 
GTTGTTGCTGGGCCGCAGCAGCGGCTGCAGTTACGGCGGCATTCGATGTCGAGGTGGCGTAGTAGCAGCAGCGACACAAACTCGCCAGTTTATCG 
CTAGAAGATAAAATCATAATCAGTAAAATCAGATGCCAAGAACACAAATACAGCAGGCCCTACTTCATGTACCAAAAACATCAGACAAACATTCG 
AATAGCTTTTGAAAATGCTTAATGGCTTTGTGATGAACATCGACTAATAAGATGATATTACACAATTAGCACGATCAAGAAAAAAGTGACTTATG 
ATTGTGATTTTTATGCTTTAGCTAAACAAGTCACTTACGTTGCTTGTGCGTTAGGTGCTCGATATTTATACCGTGGCTGTGTTTAACCTGGGGTA 
TGCCGAAATAATTATCATATATGCAACATTTTTCAAATTTAAGTTTGTATACGTATATTCACAGATCATTTAGCTAACATCTAATACATCCATTA 
TAAAAATCTATTTTGATTCTAAGCCATTTAAGATACAATATGGTCATTACAATTCAGTTAAATTGAAATCTATACAATTAAGAGATTTACTCGAA 
AGTCTTCTTCTAATACGGTACGATTTTACGAGGCAAGTTAAAAAAATATCTAATATCTACTCACTTGATTTCACTATCTGCGCTGACGCATGCAT 
CGAGCATAAGCTGCGTCTTCAATTTCTCATAGCAGCAGTTGCTACTGTTGCCCCCGCCGCCGCAACCAAACCCCAACGATCCAATCCCGTTCAGC 
TGCCGTCCGAAACTGTTGTTGGTTATCGCCGCATCGGACTCTTCGTTAAAGCACGAGAAGAAGCCACTTTCTATAGAGTCCCGCCTCGTTAGGTT 
TGTCTTCAAGGGAAGCTGCTCACACGAGTTCTTCAATTCGTTCAGATCGCAGGTGTTTTTCAGTTCGTCGAGCTCAAGATCAGTGCAGGATGAAA 
CGGACACCGGGGTTTCCACCAACCTGGTGGAGCAGCCAGACATGGGAGTGGGCAAAGCCTGCACCGCCGCTGTTGCACCGCTACTCGCTTGCTCT 
TTTGATTTTCTGGCGCTTTCAGTCAGATCCTCACACGAATTGGTCGCACCCACTGACGCAGTCAACGTGATTGTAGATGCGCTGGCCTTCAGTAG 
AGTTGGACTCTCGAATTCACCAGTACTGGTACTTTTCATGCTGGTGCCATTGGAGCCGCGAAAAAGCGGATGCGCATACAAGGAATTCGCGCGGT 
ATTTTCGAGCGGTTGGCACAGTGGACGGACGCACGGCGCTGTCATCGCTGCTTGATTCCGCATCTGCACAGTTGCCACCGCCATCGGGATCCTCA 
CCTTCCGAATTGCAATCATCGCAGTCATCCTCATCGGTTGGCAATGCTATTGATGGTGGTGGAGCACTGGTAACTGCTGGAGGAGCAGCATTGCT 
GATTCGCTGAGGAAGTCGAAGCTCGATGGCGCTGTATTTGCGCGTCATATGCGGCGACACTGGCAGGCTCTTTGGTTCACTGGGCGTTGCTGCCG 
AAGCCTTGTTACCCGTTGATGTTTCCTCCGTACTCGTTCCTCCGCCCTGAGCCCCCGCTTTGCTCTGCATTGCGGCCAGTTCTTTGAAGAACGGC 
GCTAGCATCTCGAAGCATGAGCTGAATAGATCTCCCACCCCTGCTGCATAGGTTTTCGGATTGGCGTCCAGGATTTTCTTGACACCACGCAGTTG 
CGCCAGCGCTTCAAACGGATTGGGTTTAGAGCCGTTGTGATCCTTTTCAGATGTCGGCTTGTACTGTGGATCGCCCACTTTGCGCAGTGTGAGTA 
CCGCTGGAACGGGCTTGGGAAGCGGTGGGGGATCTGGAATAGACGTGGCGGGCAACTGAACCGAGGAAAGCTGCCTTGGGGGCGTTGGTGAGTGT 
GTTGAGGAGCGCTGTCGCTGCATCTGATGACACCGCGCCTTGTCACTTGGCGACGTGTGCATGGGGAAGTGGATGATGTTCTCGCTTAGCGAACG 
CCGGTGCAACAAGATCTCGGCCTCCAACTTCTGAGTAGCCCGTCGCACTCTAGACTTCAGTATATGATCATCAAAGCCACTGCTTCCAGCAGATC 
CATTTGGCGTGCCCGGAAGCGCAGCCGAAGCAGACTTTTTACTTGGAAATCGGAAGCCGTCCTCGTCCACTTCCTGTAGATGGCGCGGTGTTAAA 
GTGCCAGCAGGAGCAGTACAAATAGTGGCCACCGTGGGTACTGATGCTCGGGGTGTCAACGTTACGTTCTCCGAATCGCTGCCTCCTGGTGAGCA 
GTTGATCAACTCCAGGGAACTGAGTGGACTTGCTCCGGCCGACGACTCGTGCTCGTATTTCTCCAGCAGCAAAGTCAACTTTTTGGTGGCATCGT 
GGAACGTGGGTCGGCTCTTGGCATCGTACTGTAAGTTATCGGATGGGTTACAACACATTCTTATATTGATTATAATTCCAAAACTCACCAGACAG 
CAGTAGAAGGCCAGACGTAAAAACACAGGAGGAGTGTCCATCGGACACAATTCTACAAATGCCAGGTAGTCCAGTCCAAACGACGCCGTGCGCGG 
CATCATATCCGGGTCCGCTTCGATGCGGGCTATGATCTCGCATTGAATGATGCCAAAGGAGAAGACATCGCTTGTCTGATCGTACCACTGGCCCT 
TCAGGCACTCCGGACTGACCCAATACGGTGAGCCGACTGTCTCAAGGCGTGATTTTCTGCTGTAAGCAAGCCCATTTTGGATTCGTTTAATTATA 
TTACACTTATGTTATGCTTAGACCAAGATATACTTACGATTTGACTGGTATCTTTGCGGCTAGACCAAAGTCACCCACTACAGCTTCGTACTGAT 
CGTTGGCCAGATTGCGAATAAGTACATTCTGAATGAAGAATCGGATTAGCATACTAATTATTGGAGTTCCTTGCAGTTAGCGCTCACCTTGCTCG 
TCAGATCCCTGTGAAAGATGCCCGCGTCGTGAACGTAGGACATGCCGCGCGCTATACCCAGCGCCAGTCTTATCTTCTGCGTGGCGCTGAGCACC 
ACCTCCTTGTTGGCCAGCAGCTGCTCCAGGGAGCCGCCGTTGATGTACTCGGTCAGGGCGTGCAGTTGGCCCTCCTGCACGCAAACGCCCATGAA 
GCTGAAGCGAAAGAGGAAGGAACACGCTTAGTTAAAGTTTGCCACTGCAGTGCAACTATTCGGTATGGCCCGAATAAGTTTCACTTTCAATCACA 
CAACTTGACGGGCACTGCAACTTCGCGGAGCCCAGATTTCTTATCGCCGGTCGACATGCAATCGGCCACCCCCCACTGATTGTTGGCATTGTTTC 
AGCCTTGATTCGATGCAAAATGAAATTGTAATTCGAATCCCCCGATGCGGCAGGAGCACGCAATAAAAAGAGAAAGTTTTCCCTTCAATGGACAC 
ACAGTTCCTTGGAATAGGGAGTTGGATTGGAATCTCCTAGGATTTAAAGAACAAAGAAACCACTTCCTAAGCACTCATCTCCGCTTCCTGCTCAG 
ACAATGTCAATGACTCCTAAGCCTAAGGAGCTGCTCGCTCACCAGCCGCCAACCACCGACTCACTCCTGGCGGCAGCACAATCTGAGGTCACCAG 
GCTCTGCAACCGGAACTGTAAGCTTATTAAGAAAATGGCAACGCATGCGTCTACCTTCTTGCTAACTCCACTCCTTCCTCAAAACTGTGTGCAGC 
CAGGTAAAAAAAAAGGTGAACTGCTGCCAGACAGGGCTGACGATGATGCTTCAAGGTGACAATGAACTTTAGCGGAACTATGAAAGCATGTCAGC 
GTGCACATCATACTATGTATACCCTTAAAATCAGGGTAGTTATATAAAACACATTTGGTTAATAATCATGCTGAGATGTTCTTGTAGAAATATAT 
AGAAATAATATATAATATAGCCGATTAGCGCATGTGACTGAAGTCCCGTGCAGGTGTTCATATCGATAAGGGTTCACTTCCATTCCGTCTTCTGT 
CAAATGGTCTAATCGGACTTGCAGGCTTTATCTGCGCACATGTTTCTTTTTGGCCAGACCCAGGTTATAGCCATGGCCCAAAAGAGTTGGCCAAA 
TGGTTCCAAACTCTGCGTCGTGTGCCAGATAAGCTTTATCGCCTATATAATATATATTCGCCACACTGCCCCGCACAAACTCAAGTGGCGTTTGA 
AAAGTTTGACTTACGTTTATGTGCGCGATTGGGCCATTAATTCTTCGCTCATAGTAAATATTTATTTAATTAGTTCAACCGCAGAGCAAAGTAAA 
TCGCTAATAAATTTAGCAAACGGTGTGTGGGTCGGAAATAATTGCTTTGAGCACGCAACCCGTCACCTCATCCACCGCAGAACAGACACCTTTCC 
GTCAGGCTGTCAATAAGGATACGACAACAGGGTCTCCAGGGTGCGACTGGGCGCGTGACGTCAGGGCACTGGCAATCAAATGTTATGGCCACCCC 
GACGCGTTTTAACAACCGAGTGGTCTCAACGCATTAACATACGAGTTGCAAAGGGGTTGGCCCAGAACACCAGTTGCATCGTTTGCCGACTGACA 
TTGGGAATCACAGCAGATAGAATCATCAGCGGTCTGCGGGGAAGCTTAATTGAAATCAGTCAGGTCACTGGCTTTCCGGCAGTGAATTCAGGTGA 
GGACACTACAGTGGAAACTGTACAAAATAACTTGTTTTTATATACGGCGTAGGATCTGACTATTTAAAACCATTCATGGTTTAAATAAATATTTA 
TTATTTCAAAAATTTGGATGGAATCGATTGTGGTATTTGTTCATGAGGGATATCTATACGCTTTTGTAACTCAATTAGTTTGTTCTCAGAAAATT 
CGATCTACAAAGGATCGACTGTATTACCATACACAATGCGCTATTACGCTGCATAAAAAATGGAAATGTCTTGGCCAAGAGGGCATGACCCTTTT 
ATATTTTCTTGATAGATATGCCACGCCAGTGTGTGGCAGACACATGTCAGCCCTTCAGACAGCCCAAGGGCTCGGCTCGACGCGGATCTGAAATG 
AAGTTATCACAAAGGCGAGACAAGCTCGATGATCCCAGGTGTTGACGGCGAAGGTAGGGGCTCCAAGGCCCGAGGGTTTTTGTCTGCCGCACATC 
CTCTCCAGCAGGAGAGACACACCTATGCCTCACACCCCCCCCCCCCACCAATCACTCTTCTCGCGACTCGTGACCGTGGGTGCAGCATGCCGCAC 
TTCCATTTGCATGACCATCGCAGGGATTTACGGTTCTAATTCAATTTATTTCACTGCGTGATACACGATTAGCGCCCCGAGTGAGTGTACCCCCC 
ATCCACTCCATGCCCTTTTCCTGCCCATTAGCGGCTTACATTAAGAGGATTTGCCTGCAGTTAACACCTTTAATGTACGCTCTAAGCCATTCTCA 
ACTCAACGTGGCTTTGAAGAAGAGATCTGTTGACGGAAAGGATTTAATTGAAAAGCACAGAAACTGGTTTCTGTTTTAAAATGAGAGTGAGGCTG 
CCAACATCGTTTGATTGCACGCAAAGTACTTTGTTAATTGGGCAGTTAATTTATAACGCATTCATCAGGCTGGGTGCAATGTTATTTTCGGAACG 
CAAAAAGATTAATTAATTATCAAAAGGGGCAAAGGCAGAGCCATTTGCAGTGGGCGTCGCCAGACATTTTGAGGCGGGTTCGTCGACATCAAGCA 
GCATTGCCTGCAAATGCAAAGCAGTCAGTAGCTGGCCAAAAGAGGAGGAGGTGGAGGTACCAAAGGCGAAGGTGTCTCCATTCCATTCCATTCTC 
TAGCAGGAAATGAATACACGTCGAGGGGCCCACTTGAGAAACTGCAGCTAAAGGATGCCGGGAGCAGGAAGCGAAACCTACTCAAGCGGTTATAA 
GCCTCTGAAACTGCCGAAGAATCATTGTTCTGGCAGCGTTTAGTTTTTCAATTTCTTTTTTTTTGTTTCGCCAGCATTTTCGCCAGTTTTCTACC 
GTTTTTACCCTGCAGCCAAAAGAGATGTGATGTGGTGTTAAATTGAAATGTACTGCTTCGCTAGGTGTTACAATATTTGTTACGAGCATATTACA 
CTGTTTTCCCGGGAAAACGCTGTACGTTCAACAGTCGGCAAATCGTCAGCCGGCATTTCTCTTGCAAGTTGCAATGAAAACCATAAATTGACACG 
TTTGCACGTTTCTCTCCCAAGACTGCGCCAATTGGTTTCCCAGACTTTTCACAGACGAAGTCGTAGTAGTCCAACTAAGTCCGCTAACTGCCGCT 
GGCCAGGCCAGCTTGGTTATTAGATAGACTAATGAAATGAAAAATCCCGCTGGAATTCTTTCACCCGTGTGTTGCAATAGCCGCCGCCGCCGCCG 
CCGAAGCATTTTATGTTGGCAAATCGAATTGGAGCATTGAAGCATTTTGCGCTTAATTGAAAATTATTACCACAACATCAGTGGCAGCAACAGAT 
ATTGCAACTACAGCAGCAGCACCAACAGCAGCAGCAACAACAGCAGCAGCGGCACGATCGCGGGGTTAAAAGGTTGACACAAAGTAAACAAAATT 
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CGCTTTTAGGCAAACAGTAAATAACCCAAGACCAGGCCCAATGCCCAACTTCCAAATGAGACGGTGGCACTGGAGCAGCAGCCGCAGCCGGGGGC 
GTGGCAATGGGCGTTATAAATGGTCCTAAACGCTGCCAACTGCAATTACCATTCGGCATTTGCTGCTGTCGGGTAGTAACAACAGAGCAACAGTC 
ACAGCAACAGCAACAGCATTGCGAAATGAAATGATTGAATTTGGTTTTCAATTAGAAAAATATGCAAAACGTGATGCGAAAATTATGCAGATTGC 
AGCTGCAGCAGCAGCAGCGGCGGCAACAGCAACAGCTACAGCAACAGCAGCAACATCCAACAGTGGTGGCCATAACTTCCTCTTCTTGTTTTCAC 
TGCTGACCGATTTTCAATCTAGTGGAAGCTAGTGCAGCCCAAAAAGAAAGCTCATATATCGCGGGGTGTTGCATAAATCTGAATATCTAATGCCC 
GCTTTTTGTTGTGAAAAATTGATTGAAGTGTATTGAAATTGATTGAAAGTTGATGACATAATTTCGTGCATACGAAAAACGCAGGCCAGAGACAA 
AAGACAATGGCCAAACGCAAACGCAAACGCAAATTGGAAGTGAATGTTGGCAGTCAGCCGGGCCAAAAGACGGGCGAGAGAGATGAATGTTTTAT 
GGGACATCGGGCCTGCCTGACACACGTTTATTTACCATTTTACGGCTCTATTTTGTTTAGCTTATTTTTTCATGCGGAAGTTGACAATTACACAA 
GTGCGATTTATGCAGACAAATATGAGACGATTTATCGCTGGGAGATAAACTAACTAAAAATCCAGCGGACGCACAGTAGTGAGTAATTGCTGTCA 
AGAAACTTGTTACAACACGGCATCAGCAGCACGCAACTTGAGCAAAATAATAATAAAAAACCAAAACAAAATGGAATGCAAGGCAGGCTCTCATT 
AAATTCCAATCTGTATCTGTATCTGTATCTGTATCTGTATCTGCATCTGTTGCCGAAAAAACGCTGGGTATTGTGCGATACACATTACTCATGCT 
CTTGCTCGTTTTTTTTTTCGGGTACATCAATCCTGCTGCTCTTCTTTTTATACTTTTACATGTGTTAGCACTTACAGCGACAACAACACAATGTA 
ATAATAATAATGCCGTGGGTTCAATGACTTGCAATTGCGTCGCATTGTCTTGTGTACTATACATATACATATTATTGTGGCTGAGACATAAAGCA 
TTACCAAAATAAATAAAAATAACAGCAATATGCATACCCCAGTTTTGGGTCAGTGAGTTTAGTTGTGCTTGGCATTGCCAAAAAGGCCAAAAAAC 
AAAAAAAAAACATCTCCCTTCCCACCAAAGTGGATGATGATCCATCGATAATTGCTGGCTCTATGGAATGCATAATTTCGTGTTTGCTGTAAACA 
C AC ACT AT T GG T T T AAAT TT AGC AC AC ACC T AC AT ACT C T AC GC TCC AT AAAT AGGC GGC A T T C GGAAGT GCT AAT G AAAAT C CG T G AT GC AAAT 
ACTGCAGATGCTGATTCAGATACAGATACACTGACGTCAATGCAATTGACGATCGAATCCAAATCGAACACACTGATCACACGGGCGTCTGAAAA 
ACAAGCTGCCTAATGAAATTGCGCGCGACAGTTTACCAGGCCATGTTGATCATCAATCAGAAATTAACCCAATTTAGCTTTCTTGCACTGCCTGC 
AATGGCAACGATTCGTGCCACACGGCAAAACACGGAGGCAGCCGAGCAGCTTCAATTACAGGCCATCGATCGACAATTGACAAACCCCAATCGAC 
GAGCAAACATGAAAAGCTGCGCTGATAAAAGTGCGCTGCAAGTAGTCAACCAGTCTGGCAGTCCCGTCGTCCAGGGTATCCATTAAGCAATAAAA 
AGAAATGTTTATATGGCCGCAATCATGACACGCTCTCTTTCGGCCTCATGTTGCTGCTGCTGCAGCTAATGTTGCAGCAGGTGTTGCTGCTGCAA 
CTTGTCAAAGTCTAAAAGTCTGGTAACTGTACAAACTACCCGGCGGCCAGAAATTAAGTACTCAATTTGGCCAGACACTAAATCCCCCATTAAAC 
TACTTACATCTACTATACACACATATAGAACTTAATTATATATGCACGGATTTCGGAATTCACTCAAACTGCTGTCGCGCTGCATGAATGGCCAA 
TTGAACCCATCTTGCACAATAAAATTCTCATTAAATTGTCAACAATTTTTTTTTTTTTTTCGCCCGTTCTCGGCAATCTTTTCATTAGCCCGACA 
ATGTGCCACCACCGATGAGGGCGCCCATTCAATTCAAGGCCAGCCATCTCGGTTTATTGGCAAGCATTTCATTTTGCTTCGTTTCATAAAAGGGG 
ACAGAAATGAGCCGCCAACGCTCGGCGACACACAAATCCAGCCAAGACAAATAATGGCCGAGAGACAACAATGTGCAGTGGCCCCTCGAAAAATG 
GAGGCTTGTGGCAAACACATCATCGCATCTATACTACGGTACCATACCATACCATACTATACTATACCATTGTATACTACCTATATGCAGGCATT 
CCGCCACCACAGACGACCTTCAAGTGACGCCGCGACTTTACTCATCGCAAGATGAGTGGTGGTGGTAGCTCGATCGTCTGACTGGCTATTCAATT 
AAGTGCAATCTGACCTTAGTGAAACCTCTTGCAACGCCAACGCCGGGTTTATATCAATAGCTGAGCTGACTGATTGCTGATTGCCCAAGTCGCTT 
TATTGTAGATCGGGGTCAAAAAGCTCAACTGCAACTTGTGCGGTAGAAAGGGTGAGAGCTCAGTGGTCCAATCCGGTGGGTTAGCACCTTAAGTC 
ATTAACCCGCGCGCATTTGCATTCGCATGCCAACAATTTGTCAACATTTGCCCGACAACTTATCACCTGCTTAGCATCGCAGCCAGGCGCAAAGA 
TCAAGTGCCGCCAAAGCAAAAGCTGAAACAGCAGCTCCAGCGGAATTTCCACCATTGTTTGTGGCCCGTAATGATCCGGGCTAAGAACTGTCACA 
TGTCTTTGCGCGGTCTGAGTCGCGTTAAATGAGCGTTTAAATGGCTGTGAGCCAAAGCGGCAGCTTGTTTACCATTCCCCCCCTGCTCGGTTGTT 
ACAAGTGGAGTAGTTGTTGTTTCACCAGTTGATATATAAATGTATGTATGTATATACCATAAAAAAAAGGGGGCAAAACAAAGTGTAGAGCACCC 
GGCGCCATCAGCGTCATCGCTGTGTTTACATTCAGAGAGAGCCACTCAATAATGTAAGTCAACTTGCAATGGGATTTGTGAGTCAAACGATGGGT 
AT GCT GC GC T AT C AAGAAT T ACGT AT T AAAT AT AAG GAG AT AT AAGG AG AT AT AAGG T ATC C AAT AAT ATGAAT AGT ATATT ATTGC T T AAT AAG 
AAAAAGATATCAAGGTTATGTGTGCATTTAATTTATGAAATAATAATGCGTAGATGCATGCGTGTCGATGCATACTACCGATGATAAGTAAACTA 
TCACGTTAATTACATTTTCTATTTGAATGACCATTTGAATTCATGTATAATTTTGGAGACCAGCCCCTAGAAACAAATACTACTACTACAATTAG 
CCACATTTTCGCAGTGCATTAGTGTTGCTGCTGGGGAGCAGAGTATGGAGCAAACTATCTGTCAAGTTGGTGGCAGCTTGGGAGCGGCGACAGCT 
ACAAGCCATGCCATGTGGGTGGCTGGGTGCCGTTGCTCCAGTGGGCCAATAGAGAGGGGAGGCGTGCCAAAGGCTGGGCGGGGAATGCCATTCGC 
AATTGCCCGACTCGCTGGCTGTTCGTTCTCATTGTGCGACATTCCGCATTACGGCGGCCGCGTGATAAGCCCATAAAAAATACACGCCGCCAACG 
GCAACCACTCAACTAACTATCCTCAGTCGAGTGTGTTGTGAGAGTTTCAGTTTCAGTTTCAGTTTTGTGGCCAATAAGTCAGCGATTCGGGCGTG 
GGTGTGCATACTATTCGATGCATTGCTGTAGTACATGGCGTGCTGCCGGCTAGAAAGGCGGTTTCAATTACCCAAAAGATTAGTTCTATTGACGC 
TCGACTGCATCGGCATTACACTGATAGTCGGAGGATGCAACGGTACAGCTAGTTCTGGCAGCCAATCAATAGGCTTCATTAGCCGGATCGGATCG 
GATCGGATCGTAGAAGGCCGCCATGGGAGGCGGCCCGCATGGACGCCCAGCGGTGTAGCTGTTAAAATTTGACAGTTAGCCGACAGCCGCGATCG 
TTAGGCATCATTAAACGCGCCCGCCAACAAGAGCAGCGCCTAACTAACTAACTAACTAACTAACTAACTGTCAGTCAACCCGGTCAGTCATTCGT 
ACAGTTGATCATTTGGACGACCTGGGAGCGGAGTGTTTAGCGATCCACGTCACTGATAAAATCACAAACGCGGATCGAGGATCGCGGATATGAGC 
GGGATTCAATCGGGCGGAACAGGCATAGGTACTAGTACCTAAATGCATACACTGAGACAAATAATAACGCTGTTAAATATATATAAAAGTACATA 
TTAAAATTGATTGAAATGCAGCGCTTACAGAATAAGCTATAAATAATTATAAAGCATCTCATTTGAAGTGCTTAAATAGCTTTAAGGTGTAAAAT 
AAAGTTTGAAAAATGAAAAATATTTAAATCCTGTTAATTCCAACATATTGTTTTTGATTGCAACTACTTGTTCTCGCAGTGTCGCATTAATTGGC 
CTCAGTGTAAATATATAAATAGCTTTGCTGCAGGGGATCCATAAAACAAAGAATTAAATGCGTGCGCCTGAATTTTAATTTCACTATCCAGTCGA 
CCGATGGCGCCACAGCTGCATTCCAGAGTTTTTAGAATTTTCATTTAGTGCCGACGAATAAAGAAAAGTTGGCAGCTGCATGGATTCGCTTTAGA 
CTGGCTGTGTTCCGCTGACCCAAATCCATCTCTCAACGAGCCCGGCGCACACAGCGAAATAAGTTGGCCAGCCAGGGCATTTAATCAACAGAATC 
CGTTGCATTTAATACCCCGACCGCGATTCGTTGCACGTTTCTCACTTAGCTCTGTGTCAGAAACAGTCGCTAATGACAGATGTCATAAAATCAGG 
TGGGCTCTACCAATACTAACTCTTGGAAAATCCCGAGTGAGGTCAGATCAAATCGCGTTGCCTTCTTAACGTTTAATCTTTCGGATTATACGAGT 
TTAAAGTTTGAATGCCTAAAAAACAAAAGAATGGCAGCTAATATAGAAGACATGTCTCTTGCCATTTTCGGGGTGTTAACGTGGTAAATGAAAGG 
GCGGGTCCACAACACACACACACACAGAGAGAAAGAGAAAAAGGTCTTAGCCAGAAATTGCATATTCATGGAGGCGAGCAGGAAAAGATGGGCGG 
TGCAAAATGGATTGGTCTGTCTGGTCTGGCATTTTGCGGAAAAAATCGCTAAAATAGAAAACGGCAACAGCAGCAGCAAAAAGTCCCTGAGAATT 
GCATACAATTGAAGTCGCTGCCATCAGACACTCATTGTTGCGTGCAGTCACGTGAGGATGAAAAACAAACTGAAAAAATAGTAGAAATATATTCG 
AAAAAACAGAAACAAAAAACAAACGCCAGCGAAGTGAGTAAGCTCTACGCAGCTTGGCAATTAAATCTAGTCAGGTCCGACAGACAACAATGTCA 
TGAGCCTTTGCCATATTGTTTCAGACTCTGCCACTCAGCCAGCCAATTACATCAGCAGTTTGTTTTTTTTTTTTCTGTGACCCGTGAGAGGGTCC 
CCCCCCCAAACTCTGCTAACCCCCCAAACCCACCCAACACACCTGGTCACTGCCAAGGTAATTACACAGCAGAGACTCTGCTGCAGCCTGAACTT 
GACCTTTGCGCGAGCATCGGAAATGTGTTGTGCGATTTTGGTCTTGAACCCGCTGCTCTGAACTCCAACTGGGGGTACTCTTTCAAAATGCCAAA 
TAGCAAATGGCCAGTCGAAAAATATAAACAAATCGGCGAAACCATAACAGACCAATCTCATTGCGGTAAACAGTAAATATTTGCACATTATTAAT 
TTGAAAGCACTCATGCTGCCATAAATACATTTGTGTGTAAAAATAATATCCATAATGATGTCAGTGGAGTGAGCGGCCCCAGAAGCACAGATGTC 
AAGAGTTACGACGACAGGCTGAAAGTGAATCATACCCCTCATATACTACATACATACATACATACATATATGATCATTTCCATTTTAGCATAGGT 
CAGTATTATTTTGATGATGCCAGCAGAAGCAGCCCCTAACCTAAGTGCTGAGAAATCATGTGTAAAATATGCCAAAAACTGAATTGAAACCGACC 
GAGAAGAACATGGGCATTATTTGCGTAGAATTTAAAATAAAGACCCCGCGCGATGAACCCATTAAATTGTTTACGTAATCCATTTAAATGAATGG 
TCAATTCGCGTCGAGGTGCGAACTGAATGCGTAAACACAATCGGGCCACATGATCATCAAAAGCTAAGGGGGGAGTAGTATATATGTATATATGT 
CAATATATGTGTGTGTAAATACATATTATCATCCAGAGTGGACAACAACAAGTAGCATTCATACATAAATAAGTAACGCAGTCAAGTTTGCAATT 
AAAATCAGTGAATGAGTCAGACCATCGAGTTTGGCATTGACCCATAAGATGCTTTGGCTTGCTCGCGATTAGAATGGGTATTTTTAGACGATAAG 
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CTGGCTGGACCACAAACTTCATATTATAGACCATATAAATATAATATTATAGACTTACCTCAATATATTCGCATGCGAAAGTTTATTCAACAGCT 
GCACCTCCCGCAGCATATTCGGTCGATTGGCCCGCAACTGATTCATCTTGAGCACCATCACCTGGCCGGTTGTACGATGTGTGACCTGCAAAATC 
GGAATCAGCAAACAGAAATCAGTATAAACATTGCCAAAATCAGAAACATGAAAATATATACATATATGTAGAAGCAGATCCATT^AATCGTTCTGC 
CAGTCAGACAATTCAAGTGCAAAATGTGAGGACACGACTAAACTAAAAACTGAACCAGGTACAAAAACACATGTTCGAATTCCAGGGTCAAGTTC 
GCGTGGAAAGTTGTTGGCGAGTGCAAGGCAAATAGTTCGAGAGCTGGTGGATAATTGCATCGAAACAAGTGAGACAACAAGCAAAGTGGTTTTCA 
TGGATTTGTGCAGACCAAATGAAAAATCAATAAACCCGTCTAACTGTCTGCGCTGGCAATTTTTGGTCCAATGCAATTTGATGGTCGGTGGTTTG 
CGCGCCAGAGAAGTCGTCTGCAATCACAATTAAAATTGTTAATTGAATTCGCCTTTTCCAGATGGGTTGGGTTGGTAATAGGTACATACGCCCAC 
C AAAAAGGGT T AAAT GT CAT T T GGT AT GT AGGT GCTATTCCT GT AC ATC T AC AC T AG AAGC AG AAAT T AAAT T AC T T AAT T GA GT T AGC AAAT T A 
ACCCCAGGCTAGACATCGTTTAAATCATCTTACAATATTCTAGACGACGTGTGAAAAGGTTAAAATATTTTAAAATAACCTAGAACTGTTGCCAT 
CTAAACTTGAAACCAAATAACAAAGAACATTAGTATCTTTATTTGAAGATTATACTTTAGATTTAAGAGCGACTTTTTCTCAGTTTCGAGCTGGT 
CGTTGGGCGTCTGAGCCGGGATTGCCACTTGGCGCCAGACAACATTATTCACAATTCACTTAAACAAATAGCCCCAAGTTGCTGCCAAAGCACAC 
ACACGTACACACGAACAACACTCGTTTTTATGTGGACAATGGCAACTAGTGTTCAACTCTTCTTGAATTTTAGTATTCTATTCTATTTTTATGCC 
AACAAACACACTGCAAATACGTAGTGAAAACAAACAGCAGGCTGTGGCACTTCCACACACTTCTATTCCTTGAGCGCGAATTTCTCGGCTCTTTG 
CTAATGGCAGCAAATTAAGTTTCCTTGTCCCTTGCAATTCGCTCGAGCGGAATCTACATACATACATATATATATATAGATATGTAGGTGAGTAA 
GCTGATACAGGCCGGATGGATGGATGGCTGGATGGCTGGATAGATAGATAGGATACGGGGGACACTTGGAGACCGGGAACACGTGTGCGTGGGTG 
ACACACACGACGCCGACAACTACCATGCTACCATTACCTGCAGTACCTTTTTATGTGTACAAACAACCAAGTCGTAACCAAGTCATTCCCAAGTC 
GTTCTTTATTGTCCTGTTGCGCTTTTAGCTCATTTCGACTGTGAGCTGGCCTCCATTCTCTATCTGCTAGCTGCCTTTGCGTCGGTGTTAACAAA 
TTTATGAAGTTTCTGTTGCCATTTGCTACAAGATATTCGCTTACGCCCAGCGGCCGAAGTCCAATTCACGGCGGTCTGCAGTCATGAGTTGTTTG 
GCTTGTTGTCTCCGCCGGCAGCAGAATGTGGCAATCAAAATCAAAAGCAGAGGGTAAGGAAAAGAGGTGGTTGCCCAGATTCTTAACTCGGCTCA 
GAATTCGATTTCCGCTTGGTGTCACTTTATTTTCGGCTAGCTTCTCATGTCATTCGCTCATTTGCTTATCTGCCACTCAAAAAGCTTTCAATATA 
CTTATAAGTTATATTTATACCTATATGCTTTCCGCAATAAAATCACCTCATAATGCTGACACACCCAGCCGGATGGACTCGAAAAAATATATTTG 
CAGTAAGGCAGTTCTCTTTTAAATCGCTGCTGTTCATCATACATATATCGATTATAATGGCAAGACTAATTTTGTGAGAAAACCAAATAATATAT 
GTAATATATATTAAATTTGCAATATCATATCTTATCTATCATATCAATAGCAATTTCTTTAAATAATTCATACTTTTATCTATGATATAGGGGAC 
TTCCATGTATTTTTCTATTTCTTTGAAATCTTGATTCCTGAAAGTTTCCCACCTAGTGAGGTCTCACTGTATTTCTTGCTTTAATTGCCTAATTA 
TTTGTGCGTGACCAATAAAGTTGATGGCTTTACGTCCGTTTGCCAAATTGATTTGCATAAAACGAGACCGTGTACCTTACAGACAACATTCTGTC 
ACACAGCCCTGTCATCCCGAGCGACAGGCCATCAAAAGGGCCACTCCCCAAGTGTGTGTGTCCATTGAGTGTTTGCCGGGACAATTAAACCATTA 
AATGCCATTTTAGAATGCTTTCGAAGCGCTTAATGAGCGCTGCAGATGAAAAGCGGCAGTGCAGTTTCCACAAACTACGTGGTGTATTTAACCAA 
AAATGTTCGAACCGTTCGTGCCGCAGACAAAGCGAATGAGCCGCCTCTCCTCCGGCTGGTGCAGTGGACCAAAAAGTGGCACGTCCGTGTTCCTA 
AGTCCGAAGAGCCAGTCTCTGCAACTGTGCTAAAAGAGAACTCACTCCACATCCATCATACATACATATGTGAGCTGAGCTGTGTGTGGATTGAT 
CTTTTGCTCGGCGGCCAGCTTCAAAGACTGTCCATTTGGTCGGTTGGTCGGTCGGTTTCTTATTGAAAAGCGTTTAGGCCAGCAATGACGTGAGA 
AAAGAATTACCCACAGAACGCAAACATACTCATACATCATAGCCCAGCATCGATGGGCTTCCAAAATAATCAATTACGGGCGAAATCCGAAAACG 
AAGGTGTTCCAGTGAAAGAAACCACCATTGATCTGGGTCGTCGTCGCCGTCGTCGTCGTCGTGTGTGGGAAAGTTACTCCCAAGATTGGATTCTA 
AAAGCTTTTTAGCAGCTTAGAAAAATCCAGGAATTTTCCCCCGAAATTCATCGTTTCATTTTTTAGCCCCAAAATGTGTGATAAACAATAGTCGA 
ACGTAAACAGCAGAACAAGAAATCAATTATGAAGGCGCATGAAGAGCATTTTCAACTTATTGGCTCTGTTTGCTTATTTCGAAGCATGGAAACGC 
TTTTAATTTTTCGTTTGCACTGCAAATAATGTCCAACTTAAAGTTCCGCCAATGAAGACGACCGGCCAATAACCGCAGGAGGTGAGAGTGAAGTC 
ACAGTTGTGCGTTCTGCGCATTCAAATGAAACGCATCAAATGATTATTACTTAACCAGTTTTAATGGCCAGCTGAGAGGCTACTCATCTCATTGG 
ATTGCATCGATGGTTATGTCAACAGGCGGCAGCAAGTCAGCAGGCATTTCGGCATGCAAACAAATTTCTTGCCTTTTGCCGTAAAATACTCGTAT 
TTTCAC T TTCAGT T GGGGCAAAAAT GAAAAC AAC AAAAACAACGCC ACAAT AAAAAT GGTGGCGGT TGCGGAAAC AAAAC AACAACGAAAGTTGC 
CGCACCAGCTCGCCAAAAAAAAACATCCTTTTTTGACCAAAAGTCCAGGCATTTGGCCGAAAACCAGTCGCCACTCACACTGACATTTCGGCCGG 
CGGAAAGGAAAAGAAAGATGCAAGCAAAAAACCAACAAAATATTGGTAATCGGTAGGCAGAGAGAGAAAAACCCGCTCATTTTTTGGCCTGTAAG 
CGCGCTCGTTACATACTCGTGATTAATTTTGTTGGCCAAAAAACGCGTTTTGAGCCAAACGGACGCGGCATGGGGCACACGTGTGACATTCGATC 
TTCGACTCGATTGCACCGGCTACCACCAGTTAGTTAGTTGCTTTTGGTGTTGAAAATATTCCAGCTATTTACACGGCGTTGTTGCGTTTTCGTTC 
ATTTTGCTGCTTGACAAATACTTAATGCGATTTACAGTTGGGTTTTTTCATTCACCAGAGTCAGAGCCATGCCTGGTCAATCATCTTTCACATGG 
TAGGCAGCGCGTTGTGGCAACACCAGTTAGCGGTACAATGTCACCGGCGGAACGGCTCCGTAGTGGCAGTGAGTCTGGCCAAAACGTGTTCTGGC 
ACACTCCGTCGCGTTTTTCACACCGTTGCGCGAACCCAAAGCCCAATGCCCAGAAATACGCACTCCGAGACGACAGAGAAATGTCGGCTAAAACC 
ATGTAGTAAGACAAATATTAGGCGAGGTGTTGCTGCCACCAACTTCTGGTCATTCCCATTCCCATTCCCATTCCCATTGCCAACAACTTGTTGGC 
AGGAAAAGAACCAGCTGGCGAGCTGAGCGATGGAAGCCGGCTTTTGAATGGAATTTAACCCAGTTTGCTTTCTTCTTTCTCGCTGGCTGGCAGAT 
AGCAGATAACTGCTGCTAGCCGGCTTGTTGTCTTCATTTCTACCACGACAACTGGCTTTTCAGTTTTGCTTTTGGGCGTCAGGGGCCGCTCTAAC 
TAAATGGCCTTTATATTGCTATGGTCAGTTATAACGAGCCAATGCAGAGAGACAGGTGACTTTGTTTGGAAATTTGCATGGAGCTGAAATCGGGT 
TAAATGGTATTAATATTGCACGCTCTATTGTCACGAGAACGAGCCAAGTATAATAAGGATTTTAACCTAAATCTATCTTATAAGAAGTGTGTGCT 
TCAAGCTTTTTAATTTACATTTGCCTATAAGGTATGGTTTATATCTTCTTCACTTATGGCAGAACTAAAATTAACCTTGACAGCTGCACTGACGT 
AGGCACAAATATTTTTAGACATTATTTGACCGTATCAAAATTGTTTTAGCCTTATCTGGCTGAACATGAACATATATAAAACCTACATATTCTCT 
GGTTATATCAGCACAGTCAAGTACTAGGATTATAATTCAATGGGTAATCTAATCAGCCTAGCTTATCATCGGCAAGTATGGAAAGTTCAACTGCC 
CAAGGGCAAACTTATGATTGGGCCATAAAAATGCATCCGTTGGCCGAAGGGATGGGATCCCCAATTAATTAGCCAAAAAGCGAGAGAGCGAGAGC 
GTGTGCATCAATCACCATAAACGTTTATTAAGCCGACAATAGCGGAACACAAGTGCAGATAAAACTTCAGCCAGCCAGCGCCAAATAAAAGATAT 
ATATCTCTCTATATTATATGCCCGGGCCACAAACAGAGTCGTAAACAACGGCAAAAAACCATAAGCTCAAGTGCCGTTTTAAGCGATTTCGAATT 
TGGTTTTCATGTGAAACGAGCGGAATTCGCTGTTGCACCTTATACATTCACACCATCACTATTGCTTTTTCTTTTTTTTTTCCAAATGCTTTAAT 
TGGGCGGTGGGCGGTGCAACATGTGGTATATGCACCATGTGTGTGGGCGACCAGGAAGTTGTCCGTGTTGTCCATTTTGAGGAGTGCAAATCAAA 
GAATATTCGAGGCTGTGTGCTGCAAGCTGCAATTATCGGGGGCAAGATACAGTTTCAGATTTATGCAACCTTTTATTGAACTTGTTGGATAATAA 
TGTGTGTCTGTGTGTGTGTGCCTCGGACGTCGGATGATTATTATTATGGTTTGTGGCACCGTGTGTGTTTGTGCTTTTAATTGTTAACGGAAGCC 
TTGGCGCGCAGTTATGCAAATCTTTTTTGCTTTAGTTTGGAATGTTTGTTTTTTTCTGTTTTGTGTTGTTGTTTTTTTTTTTTTTCAGGCGTTTA 
CCGTCTTTTCTTTGCCATGCAGCTCCTGCCTGTTGTTTCACCTTTTCCTTTGCGGCACAGCACATGTTTAACAAAATTTTATTTCGTGTTAAGCA 
AATAATTTAGCAAAGTAAAATGTTGTTGCCATTAGATGTTGCCGGGGCAGGAAGCAACAACGGCAACAGCTGCAGCTACAATAGCAACAGCAACA 
TCAGCAGCAAGCTCACGCATATTGCACAACTTTTAGTGGGAGGATGAGGTGGGAGCTGGCAATTTCTTTTGCCAGTCTGTTTTTTTTTTGTAACA 
ACAGCAATTATTAGCTGTTAGTTGTAGGCAACATGGAAATTCGATATCAAATTGAAAAAACCCGAGGGGTAAGGAAAAGCGAATGTTGTTTTGGC 
GGAGGCGGGGACAATTAACGCATACTAAACTAAAAACCGCTATGCATCACGAACACGCGGAAATTGCCAGTGCACATCTCATGTATTTATCAGCT 
AAGCTCAAAGTAATAATAATAATAATAACAAACATTGCATAATCAATTAAATCTGACAAGGAGGCCTCACTTGCTATCAGCCTTCGAGATCACCT 
TCAGCAGTTCATTGCACGTTCGTTCGCTGGCTGGGTAGATGGGTTGGACGACTAACTGCCAGTCGGTCTATAACTAAGTATAGTTTAATGGCCCA 
TGTTATGTCACCGTCATCATCCTCGTGCCCCTGCCCCTGCCCCTTCCCCCAGCCACTGCCCATCCATCGATTCATTTTAAAGACCCCGTAATGCT 
GTTCTCACGAACTGAGAGACTCTAGGACTCTTGGCTTTAATGACATCGGACGACAGCAAAAGTTTACGATTACACGACCGGCGTCGTTTCTCGGC 
CATAAAAAGCATTCAAAATCGAACACGTTCTACATATTGAATTATAACAATGTCACACAATAGTAACAATCCAGCCATTCCAGCTATACAACCGT 
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CATGTGTAGGATGACTCGGCAACTGGGGTATATTCTTTTCAGTTTTCAGTTCCAGCAGTTTGAGGTGGGTCGTATCATACGACTTGCATACGCTT 
AAGGGTTCGTGAGCGATATCTTTGGGAAGTGCTGCTCCTACTGCTCACCTCTGAAATTAATAATAAACTGGACTACGACGTGCCTGCCATGGATA 
CGGAATGGCTGTTTGTTTAGGTGAGGCGCACGACGACGACGACGACAACAAGACATTCTTGGCTTGGATCGAACCTGATTTGTTTATGGGCTTGC 
CCAGCTGCATATTTCCCCCTAGATCATTCGAAATCACTCTAAACGAAGAGAACGTCGACGACCCCCGCCAAGACGAAAAATTATAGTACAACTAT 
TACCTATTACCTATCCGCCGATCCGAAGGTCATAATTATTCATATTCGTATTCTTGGAGATATCAATCATTTGCGGTTGTGGAATGTGCTGAAAT 
GGGCAATCCGCTATATATATATATAATGACTTAATAATTTGCCGAACAATATATTAGACTGATTTATAGTTGATAGAATGCTATCTCACTCCGGC 
TTCACCTCTATTTGTGTACTTTACGATACTCCGCTATAGCAGACCGTACAATAATAAAATTAAAAGCCGCTTAACTTGTTTACTTTTGCATCAAT 
GGCAT G GC AAAAG AC AT AAT AAAT GG GT T AAGC AAAC AC ACT C T C AC AAAC GC AC C T C AAAAAAAAAAAAAAT AAAAT AAAAC AAAAAAAAC C AG 
CAACGTCATTAACCGACCCCACGTTTCGGGCCCGTTGAGTGCCTTCAGATAGTCGAGGGATCTTAGATTGCAGATTGTACATACATATCTATGTA 
TATATATATGTACATATAACTAGCACTATTCACGTAGCGATTGCATGGGCATTACGCTTTTCAGTTATTTCGTTTTCTCTAATATTCGGCGCGGA 
AAAAACTCCCCAAGGTAGTTCCGCAAAAACTCTGTCGTTTAAAAATTTTAACGGGGTGCCTTAGAAAGTGTTAGAAACACCACTCACACGACTGA 
GCTTATAATAGACTCACAGGGGAGATACATATGCATATATATTCAGAAATTATCATAAAACCTAGTGCACTTTTATAGCCAGTCGGTATGATGCA 
AGCAAATGCATTCCAAACATGTTCCTCTTTCCGTTTTGTTATCAGTACACGTTAAAACTTCTGATACAAGAACTAAAGCTACAATCCATATTTAT 
AAGCACAGTGCCATTGCCACACCGAGTATGAGAAATAAATTAATCTTCGATACATGGTTAAGGCAATTAAATGGATAAATGTAACACAAAATCAA 
TTAAAGACGTCTTGTTGTACATGAGCAGCAGCTATAGACACCTCTATATATATCTATATATTTATATATATATATGGGTATATGTATCTATATCG 
ACAAGTTGCCGCCTCTAACAAACTGTGCTAAATACCAGAAATGAAATAAAAACAAAAACCGCACAAGACAGTAGCAAACTGTTAATTGGATGTGG 
CGACTGTCAATCGATAGGTGGAAGTGAAAAGCCAGAACTGAAGCGTAGATACATATATGTATCTGTACGACTTGCAGTGCGTTTTTCAAGTACAT 
ACATACGTCAAAAAGGGGAGGGATTAGGAGGGGTGTGCTGCAAAATAATTTGACCTTAGAGCCACTAAATTGATCGAATTGCCAATTGAAAGGGG 
CTTCGATGGGGTGGGAAAATACACAAACAAATTTATCGAACAGAGAAGCTAGTTTTTCAGTTTTAATTTCAAACCAATCCATTCGATTCTACGGC 
GCCAGACAATCCCTTCTTGCCACTAAGCTCTTTGATCACTTATGCACGGCGAGTAAGTCATACAATTTAATTGCTTTTATCGCACGCCTTAAATC 
GCAAACATCAATGGATGTGGGCTCTGTCAATTAAATTAGAAAAGCACGCGTGTCTGTGAGTAGTACCTTTTTGCCACACAGCAGTTGCACTCACA 
CACATTCACCACCACCACCACCAACAACAACAACAACAACAGATGCATTTGGCCAGTTGTGACACGTCCCTGTGATTAAAATCAACATAAAATCC 
AGACACACGCCTTCCGCTGGCTTACCCATCCCCATCCCCGTTCCTAGGTTTTCATTAAATGACAACAACAAGCACTTCCGTTTCCATCGCTTTTC 
TCCGGCTATTCGACTAAGTGTGGCGCATTTGCCATAAGCAGTCCCAGCGCCCATGTCGCCACCATTCCACCATTGGCCCAAAGACCAAACAGTCC 
AAGCCGATCCAATCAAGACGACTTGGGTCGCTTCGGATCGGATTGGATCGGTGTCCGTTGCATGGTGACCATTAAAGGGTCACTGGCATGCTGCT 
GGCGATTAGCACAAGCAAGATGCCATGCGAAAGGGTTTTCGCCTCCCTATTTGGCTAGAGAAAACTCCGACAAGATGGGAAAGTTGCAAGGCAAC 
ACAGAAATGGGGAATACCCTATCTTTGCTCAACTAAGAGAGATTTATAGAAGTTTTTACTATCCTTATAACCTTATGATAATGTATGTTACTATC 
CAC T T TAG AAT C G AAT C T T TT AAT AT C CT T T T AAGGCT GG C AAGG AT CAT G AGT G TC C AT CT T GG AG AAAG GGT AT TT C AGCT AGAG AAAC G AC G 
AGTCTCTTGAGAGATCTTTGGCACAAAGAGCTGCATTCGAAATCCACGCAATTTCCATACCCAGAACGGCTCGAACTTTGTCGGCTTGTTAAGTG 
CGCGATCGGGTACGAAATGTAATTGGTTTGGCGCGTTAATAATCAATTGGCGTTTACATTTTAAATGTGGCATGCCGCGGACAGGGAAAGGCGGC 
TGCGTTTGTTCTCCCAGCGAGACACCGAGGAATGCCAGCCAGCCAGCCAACCAGCCAGCCAGGCCAAACGGAGCCAGCACTCTAAACTCTCCATT 
GAGATCGTGGCGACGTCTGCCCTTGATGAAGTGCCACATGCTGCCGGTTGCCACAGTAGCCAGTCCGTATGATGGAAGACCCTTGCCCCCGCCCT 
TTTCTCCCCCCGCCACCATCACCGCCAGTTAGCGTTTGTGCAACTCATGTCAAGTGGGCCGGACTGGTCGGCGACTACACGTGTCGCCCAACGTC 
ATCGCCATCGCCAGCACCCAACTAGTTCTAATCGAAGCCGGATCGTATTGCACGTTAAAAAAAGAATATCAACTACATGGATAATAGTTTAAAAA 
ATTGGGTTATCTTTCAATACCTATTTAAGATTTTTAGTTTGAGCTTCCATTTCTGTAAGCTGAATCAATCTTAAGGTTTAAAGTCTTTAACTATT 
GTATTTTCCAAGATAAGTAATTCAATTCCTGAGTGTTAATTGAAGATTACTTCAATCTGAAACTGATTTTACTTAAAAGTGTCAACTTTTGTGAG 
GAAAAACTTATAAAATATGTGCAAAACTTAGTTCAAGTCTGCAGATCAGATCATCTAAAAGCCCTCTTTTAATCTGTTGTAATGTTCTGTTAGTT 
CAATCTAATAAGATTTTGTTCTGAGTGCAGAAACCAGTTGGACTGGTGGCGATTGGTGGCGTTTCAAGCTCAAGTCCATCAATCAGCGGCTTGTT 
TGGGAGCTGAGTGAGCCATGGGCTGCAGACCCGGGCTAAAATAGCTTTTTAATTGTCGCAAATATTTATAGGAAATTTCTCACGCACAGCCGAAT 
GGAGTCGTGCAAATTACCTGGTCTCAGTTACCTCAGTAGTCACTCGAATGTCGTGCAATTAAATCCAAGGATTTGACAATAAGGAACGAAAAAAA 
GCTCACCTGAAAGCAGAGAGAAGGAAAAACATTTATTAATTCGGTGCTACGCGTGCGATGGCAATGAACTTATATTCGTTTAACAATCTACTTTT 
GATAATTAAAATGTGCGCATAAATTATTGTACACTTCATGAGCAATCAGAAACAGAAACAAAGTGCAAACGCTCCACATAAAGAAAGTAGAGAAG 
AAAG AAACAT AAAAAC AAC AC AC AAT AAAC AATT T AATTACGC AT CGT T T GT GC ATTGAGTGTATAACC CGC AC AAAAGTGAAAT GC AGTTATAA 
ACACTCAAAAAATGAATAATCTTATATTGCCAAAGCAAATAAACAAAGACACGGCCCACAACAGTTGGCTGCACATTTCAAATTGGCCAAGAGAG 
CGGACAAGTAAGCCCCGCTACGAATTCATTACGTTTTCGAATGTGAGAAAATTTTCGAGATGAATTCCTGACTCTAGTCGCCTGGAAGTTTTTTA 
ATTCATAAACACGAGGGCAATAAATTACTTTGTATTGGAAAAACCTACGCAGATGTGCCTGCTGCTTACATGTCGCGTGTTAGTTGGTTCAAAAA 
GAAAGTCTGCACTATATATATGATTGCTTTGTTTCGTTTTCCACACCATTGTTAGTTAGCGGTTTTCCAAATATTTCAGCTCCTTTGGCAAATTA 
TGTTATGTCGAAATCAAGTTTAAAATAAATCAGAAAACTTAGTCAAAAAACCCAATCTCAACTGCAACATCTGCATAGCATCTTGTCATGTCGCT 
TGTCGCTCAACTTTTCCAACGCGCTCCTCCTCTGTTTTTATTACTCCATTGCCGCGGAGGGATGGATATGAATGGTTGAATGGTCGGATAGTTGG 
ATGGTCCAATGGCTGGATCACTGGATGGGTGTCTGACTGGAGGAGAGCCTGCGACATGTCAACTATTTCAAGGCTAAATCAGCGAGGCAGTCAGT 
CGTCGTGTGTCAACAGTGTGTTTCACCCTCGCCAACCGAGCGAGGTCATCGAACTCCGGACACTGCAGCTCGGATTTATGGAACAAGTCGGAATG 
CATTTGCCTGTCATTTCGTTAATTGAAAGTTGGTGACAACAAACAGCAGAGGCAGTCGAGAGCCGGGTCCCGTCTCGGTGAGTAATTTCATTTGG 
ATATCCTTCATTCAGGCACTCCGAATTACGGCCCAGATGGAATGCAAACAAATAAGTACAAGCAGACTAACTAAATGCAAACTCGCAATCAATTA 
GAACAAACAGAGCTCTGTTTTTCCCAGGCCCATTTGGCGACTCGTTAAAATAGCTTACTCACAAACAAGCACCGCGCCGCACCCTCACTGCCACA 
AAAAAAAATGTTGCTCATTATTTTTCGGTCTTTTAGCGTTTTTTTGTTTTGGTTTTTCTGCTGTGTTTTTCCAATGCGTTCGAAAATAAATAAAT 
AATAATAATAAAGTTGCACGACACCCATATTCGTATGAATAAAGGCCGAGTATCTCGAAATTTATTTAATTTTGTACCTCTTTTTTTTGGGCACA 
TTTGCTGGGAGCAGCCGTCAATTCGTCAGATGCGTTTTAGTTGCCTCATAAAAAAAAAACCCAGTCTGGGCTTAGAAAATGCAACACAGGAAAAT 
AATAACATTTCGTGGTTTCTGGCGCCGTGACAAATTTAAATGAGCTCTACTTTTGGGTGCAGCCCCTGCGACACCAAAACAACAACAACAACAGC 
AAGTAAAAAAAAAACGAAAAAGGGGCAACAGCATATATATATATAGTCATAATAATGGGGTCATATATTACTAATGTGTTTGGATCGCCGGCGTC 
TGCCCGAAGCTCCAAAATGAATCAATGCAAAACTATGAAATGCATATCGCAACGACAGCAACAACAACAGTGACAACACAGTTGACAAGCGCTGC 
CCGCGGTGCACAGTGGTCAAAACACTTGAATTATCGTGTAAAATAATAGTAATTATAAAGTTAAATCTCTTGCCATTGCTAAGCTAATAACAAGA 
CAATTTTATTGTAAAAATCGTAATAAGAGATTATAAAAACCTGTTAAGGGTCTAAAGCTAGTTTTTCCACAGTAGCTATTGAGTAATGAATTGTT 
GAAATATAAGCTATAATTTATAATTGCGTATAACAATTATCATATACCCATTACCGGACCCACTGTGCTGCAACACGCTGGCGATGAATCAGTTC 
TACAGCGGATGGAGGCTGGGCAATCATCGTCGTCGTCGTCGTCGTCAAGGGGCAATATGCTCAAGGCCTTGTGTCAGAATGCATTTTTATGAATG 
AACGCGCGAAGGCATACAATTTACACGTATTGAGTGCGCTTAAATGCTCCGCCAATCGCCGGCGAGCGAAACTGCTGCAGATAGCTGGGATCTCA 
AAAGCTAAACTATTTTTGCCAGCATCCTCGATGATTTCGAAGTTCGCCAGAGTTCGCTGAAAAGCCAATGCAATGCAATATGGCTATGAGATAGC 
GAAGTGCATTGAACTCCGGCCATCGGATGGATCAACCAGTTGAGCAGTCAAACAAACGTTTAAACGCTCAACAGTCAATTCCATGCGACTCCATT 
CAATTCAATTCAATTCAACTTGATTCGATTCGAGTATCTGCTAAAAGCAGATATTACCAACTGGCTTTGGCCAGCCCGTTTTTTGGCCGTTGCCT 
ATTTGGTAGGATATCGTGTTGGCCGCGGCAGTTCGCTTTTTGGGCGGCCCCATCCTCCAAAAAGCGAGTGAGCGCGCCCAGCCGGTCACGTTTCA 
ACTATCAACTGGAAATGGCTTTTTGATTGGCCATCCTCCCCAGAGCAATGCCTATATAGTTCTCGAAATGCGGAAGTTATGAGCCTAGCCAAGAT 
TAACCTGCGCCCTCCTCCTCCTCCTCCCTTCGCTAATGGCGTATATTTTCTAATCAATTTTCGAACACACTGAAAAGTGCCTTGCAGCGTTAAAT 



FIGURE SHEET 771 



CGATAAATTGCGATGGGGCAGCTATATGTGGCGATGTTGTTGCTGCTGGTGTTGTTGTTGTCTTCCCCATTAGACAACAGGCTGTGCAATCTTTA 
GTGTCGTTTGTCACCATATTTGCCACTGCAGTTGACACTGATGGGGGACCAACAACCGAGAACCACACAGAAACCCACCCAGACAGCCAGCCTCT 
CTGCTGTCAGCTTCGTCTGGGTCTATGTTTGCACTGAAAGAAATGCACTTGGAAGTCGTATTTATGGTTTCAATATACGGAAACCTATATAAAAG 
CTATCTTTTTATGCATTATCAATATGATATTGATTTGCATATGCAACTCCCCCGTAAATTCAAAATTTTCCCTCAGTGCATTGAAGCCACCACCA 
AACTTGTTGCTTGCCTGCGCCCTTCATTAATCGTGAAGCTGGTTGGGAAAACACAACTGTGTGTGCGGTGATGGCATTTGATGTGATCCTGAACC 
TGCGGGCTCTGACTGACTCCCCTGGGTGGTGGTGAGCACAGCGCAGGTTTTTGGTTGGAATTTGATTATGCACCAGACCAGAGCCAGTGTTTGCA 
GCAACAAGAGCTCCCCGTACCATTCCCATTACCATTTCCTATTCCCATTCCCATTCCCATTCCCTGGTTGCATCTGCGCAAAAGCGTCGAGTGCA 
ACTTGTCATTGTTTATCTGCTGTGGCAGTGCCAAACGATAGCTTCGGCTTCTGCTTCTGCTTCTGCTCCAGCTTCTGATTCTGATTCCGCTTCCG 
CTGGATGGTGGTGGAGCTGAGATGTTCCTGCTGCAGCTGCCTACAATGCGGCTTGCTTGTGCCATTTCCATTTCCATTTTCTTTGCCTGCCCAAA 
TGTGCGCAACTGCAGCTTATCTAAAAAAAAAAAAAACGAAGGGGCACAAGAAGGGTTTGGCGAAGCCGGGGTTCCATGTCTTGCCATTCACCACT 
GCTTCTATTTAGGGTTCTTCTCTGCCACTGAAGAGGCGGCCATAGAAGTGCCCGCGTCCAAAGTTCTCAGCTGCTGCTAATTAAAAGAGCCTGCC 
AGAGACAAGATACAGATACATGCGGGATCCACTGCGAAGATCGTGCACAGCAAGAAAAACTGGAACCTTTATGCTTTACATATGATTTAAAGCAA 
AG AAG AAG AAGT T C TT TC AT T TG T AT T C AT T T A C A T T AC T AT T T T T AC C AT ATT GT TAT T AGACT C G GT C ATT AT AT AT T TT T T C AG AT ACT GG A 
ATCATTTGATTGCAATCTTTTTTCGAACAATTGCCGCAGTGTGGCTTAGTAAGTGATATCACAGAGAGGCATCACCCAAGACACAGACAAAACAC 
GAGCTGCAACTGCAACTGCAACTGCAACTGTAACTGCATCTGCAGCAACATCAGCCCCAGCAACATCGCAAGCATCCATCGCAAGGGGGCAGCGC 
TTGGGTGGCAGCCTGTTAAATGCATTTGCTGCCCTGACCCCTATATATTGTGACTATCAGGCCATTTGGCGGAAATTTAAATTTAATTAGAGCGC 
TCTGCCTCTGTCCAAGATGTCCGAGAACGGAAAGCACAAACTCCGGGACTCCGGGATCGTAGTATCGCAATCGTATCGGTTCGTTAGGACGGTCG 
T T GGGT CT T TGAT GGC GCT C AAT TG AGG AAT T G AAC AAAGGAAT G AAG AAT AT AC AT AT T AG AAG ACT T C GAT AAT AT TT TTG AT AC TATAC TGG 
AATGGAAAACGACAACGGATGCGTGGGAATACGCAGTCGATGCTATAAACACATATATTCATTTGTTTTCTTTATGTCTCTTTATTTCTGTTTAA 
AGGGAAGATATTCGAGATGTTCATTGTCCCACGGATTTATGATGGCAGATAGTATCTATGTATATTCTGTATGGGGGTGTGGTGCGCATTATAGC 
TGTCATTTTACTATTGTCGTATAGTTTATTTGCATACTTATACCGATTGTTTCATACGTTTCTTTTCGCTGATGAATCGCTGCTGGTGGTGGCAT 
G ACG C AGT T CC C AT CC AT ACT GG C AAT AAAT AT AT AT T CATC TAT GGCC AG AC AAT CGGAAAAAG AG AGCG CCG AAAAGT ACG C AC T T GAGTC AC 
TAAAAAAAATATTATGTGTGATTGAAAGAAATGGGGGAAAGGGTTGCAGAACATAGACATATGATATATGCACGCATAGGTTTCTCAAGGGGTGC 
GCACAACACTTTAGAGAAGGAAATTTTTAAATTTCGAAATTGTCTGACACTCAGACAATTGAGTTATGACAGCCGGCTGAGTGTTTAGTTCTATT 
AAAGTGCCTCCGCAGATGCTCCAACTTGATATGGGATACCTAGATCCATAGATAGATACAAATTCGGATATTTCCGAGCCAGAGTTTGCCTGAGT 
TTCGCCCGTTTGCAGAGTTCGGCCAATCAATTTGCCTCAACAAATTTGCAGCACAAATAGAAACGCTTAGATATTTAACCAAATGGTCAATTGTC 
ACGCCAATTTAAGACGCCAAAGTCTCCTTTGGCGACTGCCGCTTGACGCTGTGATATTTTCCCCATTCAAGCCGATTTGTTTGTTTTATGCCGCG 
AGCCAAAACCAAACTGACCATGAAACAAACAGCCAGTTAGCTCCGAGCGGATAGTTCAATACGAAATATTTCAAAAAAAATATATATGTCTCCGA 
CTTCTCTGCGCCACGCTTGACCGCTTCGAAGGGTATATATTTAGTTCAAGTGATTTGTAAAATGGAAAAAATACAATACAAAAAATACAACATTG 
TGAAGCCGGCGATTTTCTATTTACATTTAGGTGATGGATAGCTTGGGCTAGAGAAACCCCTGGATAAACCACAATTAATTACGATCCAGCGACAG 
TCGATCGATTGCCGACTTTGACTTTGGCTCCGATATTATCTCGACTTCGTCTCGAGTTTTTTTTTGAAAAAAATTCTGGGCATTTGAGTTTTTCA 
ACTTCAACTGTCGGGTATAGTTTTTCAAACAAATACGCACATGGGTGTTGGTTTCCCCTCCTCGATCGGTTCTCGGTTGTAGTTGCGTTTTTCGC 
TGGAAATCATTGGGGGAGAGTGCTTTCTTTTTTTGAGCGATGCGTGCCTACAAAGGCGCCCCACGAGTTCAATGACCATGGTCAGCCAAGATCGC 
TTGTATGTAAGTAAGTATGTCCCATGGGAAGGGGGTGAACCATTGTGTGTATATTGTACATACGAATATGATGTATAGTGTGCGCATATGTTGGG 
CATAAAGCTGCTTTATCCTGGGGATGCGGTCTAGGGCATTTCACTGGGCTTTTGTACTCTGTATTTCTGTATTGTGCGCTCGCAGCAATTCATGC 
ACATTCATTGTTTGCATAGCCAGATGTTTAAGACCGAAAAAATGTCTCGTTAATTGACTCTTACTTAACTCTCCCACTCCCACTGAAAATTGCAA 
TTAGATGACTTAATGGGCGCACGCGGCCAACGGGCTGTTTGCAAAATAGCTGAGTTAATTGGAATTATTCCGATATCAACCCACCTTGCGCAAAG 
TTTTCCTTTTGGCTTAATTGTTTATTTACACCTAGAAAGATAAGAGAATTTGTGGCTGTTTCGCTGTTTACGATCTGGAGAATGTGGAGTGAGAA 
CAATAAAGTATTCCACAGTCTATATATAAAGCTCAATTTATGGATATGTACTCCCCAAATACACCCATTCTTTTCACTGAATATTTGCATAGCCG 
ATAAAGAAGGCCCCAAGTCTGACACATCGGACAACTGACAAAGCCAGGGAGGAGCACAGTCCATGGCGAATGGAGCATCTTCAGGGCGGAGTAGT 
GGAGATTAAGTTAATTACAATTTCGCGCTGCTCTGGCAAAGTCAGCCAAGCTGGCCAAGCTGTACACGCTGGCAACCTGTAACGTGCAACGTGCA 
ATAAGCAACTTGCAACATTGTAGCGATTGCCACTGAATACTTTGTGCTGAACTTGATCGCTGGGCCAGGGCAGTGTGGAAATGCCTATGCCATCG 
ATGGCTATATATGTATGTATGTACGTTGACCAACTTTTGAAGACACCAAACAATACCAAGTGCAAGACGGTATACCCATTCTCCCATTCATTCAT 
AGGTCGTCGACGTCTGGCACCCACGTAGCAAAACAAAGCAAAAAGCGAAACTAAATCGTGATTTATGATGATTATATTTGAATTGCTCATTGTCT 
CTCACACCCCAAACATCCCAGATTCGCCTTTAGATCGTCCAAACATTTGGCTCTAACGAACGATTTTATTTTTTTTTTTATCTCATTGCAAATAA 
ATAAAAATCAAAATTGGGACTCACTGGAAGTGATGGTGGCCTGACCATGACCATGTTGCAGCTGCTGCAAATGCAGTTTTCCATCGAGCAGCGAT 
AGTTGGAAAAGCCCCCTACCACCAAAAAAAAAAAAAGGGAGCTCTAGTGCAGTTTCACTTTGACTCTCCCTTCGGGGTTTAAGTTCCTCGTGGCG 
TTGCATTGGGGGGTCTGGGTCTGTAGGGCGCCGGGAAAGTTAATTAATTTTTCAGCGAAAGTTGCCTTAATTAATGCACATCAGAGACTGCATTT 
TGGGGTCATCTACTAAACTCGCCAAGCATAAACCTTAATCCACTACGGCCAGCAAGCAACCCCTTAAAGTTCGTATTACCAAAGTCCTGGCATTT 
GCTATTAGCAGCAGCGTTTTAAATTACAGTTTCACTTTTTAATTAGACCCCGCCCGGGCCAGACGTCTATGCAAAGTCGAATAGAGAGCACCGGA 
GGCAGCAGCTCATGGAAAGGAAATTTTAATTAAACAGTCTGCGAGAGGGTTGCCGCAAACAGTAACAAAAAAAACCTAAATAAAAACCAGCACAT 
CGCACAACGAACTCCAATGGTCGGAGTCACACACAACTGCGAGTGCTGGTCAGTTGCGAAAAATGTTTCGAGGGCTTGGCCAGGAAAAGGAGGCG 
GGCCCATCCTTTTCAGAGGGTAAAATACAAAAAACACACTATTACTCGTACTAGACTGTACTGGAAAACTTGAAATACAAAGTGCTGCCAAAGTC 
CAAACTCATCAATGATTTGTATAGTAAAGACTGCATTCCTTAAAATACACAATTGTGAGTAAACCTCTAGAGCAAGCTCACAAGTAAAATGCTTA 
GAAAATCGCTGTTGTATTATGCTCATAATATATATATATATGCTCAGTAAAAGCATAGAACACAAATAACTAAACTCAACTGCAAATGTTAAGGA 
ACTCGATCGATACATTTTTCACCATCTTTGTTTGGGTATTTCAATTTCAGTTTTTCGCAGTGTAAGAGCCAGCGATGAACTAAGTCATTTGCAGA 
CCTGACTGAGGGATTCGAATAGGGCACGGCATTCAGCTTTCGGCTCAGATCGGTTTGGCTCGCTTTGGCCGGGGTTCCTGGTTTAATAGCTTCGC 
TCTTGGCATTGAAATATTTATCATGTCGTGCTAATATTGCCCAGAACCTCGAGGACTGGACACGCTCGATTGGACACTTGGATAGGGCGCAGCTA 
GCTCTGGAGGAGCTGGCCAAAGTAAAAGCCAGGTCGCTTGCCTGGCTGCCTGGTTGGTTAGTTGTAGTCAGTCCGTCTATCTATGTATTCCAGAG 
ACTCCGAATCCCTTCCCCCTTTCGACTGCTTCTTGCGTTGCCCCTGTCATCATTGACACGCGAATTGCAAAAGTTAATTTAAGTAGTTTTGGCCA 
AAAGACGACAACAAGTCAGGGAAACCAAACTGCACTGCACTCCTGAAGGCTCTGAGCTTCGGCTTTGGCTAGTCATGGGCAGATTTATATGGCCA 
TTTGGCTAATGAATTGCGGCCAGAGGCGAAGGAAGAAATAAGAGCTGATCCATGGGCGCATATAAATCATGAGAACTCAACTTCGTTGCTGCCTT 
TAAAAACTTCGATTTCATTCCTGAGAAGCCAGATCATCGATGAGCTCGTTTTTTTTTTTTTTGTTTTTTCCAGCCATCAGCATATTTTGTTTGGG 
CCCGCATTGGTTGTTGTTGTTTTTTTTTTTCGAATGCCATTTGCTTATTTTTGTTGGCCATCTCGTCGCACGCCATGTGAATTATGTGGGAGTTT 
TGTTGGCCCTGCGCAAACATTTCAATTCGAATCGTTTGTAATTATTGCGCAAATGTGACAAGACAAACAGTTTGTTGAACACATGTCTGTGAGAA 
AGGCGAAACCATCGAGTAAAACAGAAGAAACAACCTAAATGGATGGCAAGGTCACAAACAGCAGCGAGCTGAAATCCCTTCGAAAAGGTTTCTTT 
TTTCGTGGCTTTATGGCCGAAACAAAAATAATGAATGCGGGCCCCAAAGGAAATGCCCAGAATATACATGTACTGAGTCATCGCACCGTCAAAAG 
ATCTTGCAAAATAGGAAATAAGCATTAAGATCGCACTGCTAAACAAAGCGAGTCCCCACAAATTAAGGCAACGTCAAGCGGCGTGAAATGAAAAG 
GCAAAATTCGAAATAAATTCACGAGACGAATCGCCGCTAAAGTTGAGCAAACGAAACCAGCCCCAAACACCTCTCATAAACAAAACATATTTATA 
TTTATATTTATATCGAGAATATTCTCCGAGCTGCATCCCATCTGATCTGATCCAATCCATCCACCGGGCAAGTAGCACATCGTGCATTTATTACA 
TTTTTGACTCACAGAAATAATCAACGTGAAATATTTGGCACACATTCAACCAAGAGGAGAAAAAAAGCGATTGGGGCACGCGACGTTTGGCGGGG 
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GCCTGGCGTCCCGATCAATAAAAATAGCCACTCGCAATCCGCGGCAAGACGTGGGCGAGCAACATCATCATCATCATCATCTCTAGTATCCGAAA 
CCGAATCGAAAACAACATCATCATCGCCGGATGATGGCCCGATGGAATGCCATCTATGCCACAGTTAAGAGGCAAACAAAGTGCAAGGACGTCCC 
AAGGGGCCCGTACTCATCCCTCCCCGTTTGGCTGGATATAAAAATATCTTCTTCTCGGTATCTTTGGCCACTCCACTCTCCAGAGGTGAATGAAT 
CCGAAAATATTTCTGCACGCAAGTTGTGCGCGTAATTGCACGTTGCTGCACCTCGCTCTCGGTTATGGGAGTCACTCAGTCGCAGCCGTGGAGTA 
GGAGCCATCCAGATTGGGATCCAGACTCTGTGGGCTGTGCGACGGACGGCATCTTCTTCTGGCTTAACCCACATACAGGCAGGCAAACTTCCCTT 
TGCAACACATTCCTAGCGCTTTTACTGCGACATGCTGATTGGCGGCAGCATCCCTGCCGCAGTATCTAACTGCATCTAAATGGAGTTGTGGGCTT 
CAGTTCTTTGAACCAGCCAGCCTAGGAGCACTAGGATCCACGAAAGTAGTGAATATGACAGTATTACCGCACTTCTGGAAAGTAACTAACTGTTA 
CTTTCGGTAACGAGTAACGAGTGACCAAAACTAAATATGTCAAAGAACGTATATCAATCTTAAAAGTGCATAGTATTCATAGTATTCATAGTACT 
CAGTTTGTGGAGAAGAACTATCAACTATTTTGCATACTACTACACTGAAATAAATACACAGTCAGGTTTCCCCTTTTCGCACAAAAATGTCGCAA 
TTTATGGGCTCTAATATGAAATTCATTGCTTTTCTCGCCCGCTCCGCATTTTTCGCCGACTTGGCTCCTCGAAAATGCAGCAACTCGTTAGACGC 
ATTAGGGCCTGGGACATGGGACATCAGTTCACTTCCCTGCGGATGACATATGGCCAAGTGGGGATCTGCTTGCCTCCCAGCTCACATACCTGTGG 
TGGTGGCGGCGCGTCTTATTAGAATTTATGATTCCCATCTTTTTCTTTTTTGCTTTCTTCGGCGCCGTCGTTTCGTGGACTTTGGTTTACTTGCT 
TTGCTTTGCTGGCTGGCTGGCTGCCTTTGCCCAGCTGGCATTTACTCTTAGTTTATTATTTATCAGGCACTTTATGGAGCCCCGAACAGCACACG 
TATATATTTGCATTTATAATTTCAAATTGTTTATGGCCAAGCGTGAAAATGCCTCACAAGACAACAAGAAAATGGCAAAACTGCAAGCTTAAAAC 
AAAAAAAAAAAATCAGAAAAGAAAAGTGGGCAGCAACAAATCTTTTCCTAATGCATGAAATTTAATGTGCATGACGCATCCGACACATGAACTAA 
AGAACTGAATATATTTACAACTTATACGAGAGTCAACTAATTGAATTCGATTATTCAGCAGGAAGCACTTGTAGTGGATAAGCCGCGGATTTGCA 
ATAAATTTCCCACCCGCTCAGCAGTTTATGAAGATAAATACACAAAACAGCTGCGATCAAAATCAAAGCAGCGTTTGTATAAAATAAAAAAAAAT 
TGAAAAAAAATTAAATGTTTGTACAACTAATGAACCTTCAATTGGCTTAATTATGAAAACATTTGGTTAATGCCTGAAGATTAATTAATGACAAA 
TTTGGGCGGTCGTATCGCAATTAATGGGACAGCTGATCTAATAAATTACCTGTTCCTTGATTTTGCAATCATGGGCATGTGCAAATAAATACCCA 
ATACGTTGCGCTTATCAATGAAAATCGTATTTCGCAATACACCACTGCAATGACTAGAAGAATTCTATTTCACAGCAATAAATCTATTATTAATG 
AATGAAACGCTACGCTGGATGCCCACGTAGGCGAAACAAAGCACAAATATTTGCATATTTCAAAATCATATTTACGAGCTCGACGACTTCAATCA 
ATTAACGCATTAACATGGTTTTATTATGTATCGCATTCAAATTGGGCAAATCGGCGGTAGCCAAAGTGCGAAAAAACTTGTTCTCGGCGCCATTA 
CCAAAAAAATATAAAAAAAAATAAAACAGCGAACTACACAAATGTTGCCAACGTGGGCGATCGATCGATCCCAAAGTGAGTAAGTATTATAACTT 
AATTAGCTGGCTAGCGATTTGCACAGTTGCTTGAATGGAAATCAAATATTTTCGTTGCTCATTGTGAAGTGTTTACGGTAAATATTTAGCTGTGT 
ATATATTTTTGGCCCATTGCATAACTTGTTTTGTTTGCAATTTCTTTCAAAGAGTGCAAAATTCATTTGGGCTTTTGTTATTGAAATCGCCGCCG 
GTTGGTTTGATTAGTGCGTTAAGCACTTTTACTTTTTTCGAACCAAAGACAAAGCCTTTGGATATGCAAATGGGATTTCAATTTCTCACATTTAT 
AGGCCCGAGGCCCTAACTAAGCAAACTTTGTTTTTAGCTTGGACATTAAGTATTTTTAATCAAGCCCAAAGGTAAGATGCATTATAACATTCGAG 
GGAAATTAATTTGGGGCGCATCAGGGTTCGTCTATAACAAATAAGCGCTTTTTCTGCTGAATATGTTCGGTTTCGCTGTAAACATTTGCATGCAC 
ATCAAAAGTAACTGTAACCGAATGTTTGCCTGGCCAGCAAAAAAGTAAGAAACAACCAGCAGACGAGGCAGGCAACAACAACAACAACAGCAGCA 
ACAAAAGATCAAAATCAATTGAAAAATTGTTTGGCGAAATGGTAAGACCAAATCGAAGATGGAAAATGTGTGTGCAGCAGGCGGCAGCAACGGGA 
CAAAAGGAGGGAAAAACGAGTGGCCAGCGGAAGAAACAAACCAACCACTCCCGGACCAATGCAGATGTTGCAGTTGCAGCAACTGCTGCTGCTGC 
TGCTGCTGCTGCTGCTGCTGATGTTGCTGTTGCAGTTTCTAAATGCCGCCGCGCACCAATTCACCGACAGTGAGGCAGGCACAGGCAGCGAGAAT 
GCAAACTTTGCATAATTTATGGCAATCAATTTCGCGTGAACATCAACACGGTTGCAACGCAAAGAGATACAAAGCCGTGCCAAAAGCAAAACGCA 
AAGATGGCGAAAAAGAAAAGAAGAAATTGTGGCAGCAACAACAGCAAATGCAACAGCAACGGCAACTGCATGCCATCCGTCTATCGAATGCCACT 
ATCAACTTATTTAGTTCTACTTAAATGTTTACTTTCATCAATTCGCAACGAATGCGACTGCCAGCGGTGCAATGTCAATTGGCTGCGGATCGAAT 
CGGATCGGCTTGGATTGTCTGATGTGGCTTTTTGGTCACAAAACTGTAGCTGGTCAAGAGCGACTAAGACTCTCTAGTCGCCTCAACAGGATGCC 
CGCCATCGCAGCTAC AGATAC AT T C ACTT T GCCAACTCCGTCGCTTAGTGG AAC ACTTTTCGC AAAAACAAAAAAAAAACAAAAAAGGAT GAAAA 
AAAATCATGCAAAGGCAGCCGGGGAGTACGATTTCGCCTGGCCCTAATCTAATTGCCATCGCCTAACCCAATTTCCCAGTTCTTCAATTGCAACT 
TAGAACTTTCACTCGCTGCCTTGGCCAGCGCGAATCAACCAACCACAATTGTTTTGCACTTGCAAGAGGATGGCTTTGAGGTCTCCTCGCTCAAA 
ATACATATCATTTTACATAATAAATTTAATGCCGCACAATCCGATCTTCTTTGTCTGCAATTTGCTCACATAGGAACCCCTCATTGTGTCTACTG 
TAGTTATCAAATAAATACTAAGCACTTTAGTCAAACTCTACTCTACTTCCATTCTGCTTGAGTGCCGTCTACACTGGCGTTATGCATTCACCGTT 
GCTTTTGGCCCACTTACTAACAACACTTACTTACGAAAACTGAAAACTGAAAAAGAAAAAACCAAACTTGAACGAAAAAACCAAGTTTTTCCTGA 
TTTATGTCCGAACAAATTGTGTGTTGGCTTGGCGTTTTCCGTATGCGACATGATTTAGCTAAATTATGTTGTGATCTTTTGATATGTACAATGAA 
AACGAATGAAAACGGTTCCTCTCATCTCCGATTAAAATCGGGAATATGCAGAGGTCGTTATTAATTTTCCCACTTATGGACGCTTTCCCGCCTGA 
TTAACCCCATCTGTGTGAGGTTAAAATCCGCCAGCTTCTATGCATCTTGCAGCTAAACCGCATCGTTTACAGTTGGTATCCGTTGCACGCAACAG 
ATGGAAATAGATGGGAAATTGTGAAAGAGTTTTAATCAAAGTGGGAACCGCTTGCTGCTCCTCCGTTTGATATCTTTCTCGCGACTGCCTTGACA 
CGCAATTGAGCAGCAGTTCCCACCCAGGGCCAAAAATAAATCTCGAAGAGATTCTTCATTTCGAAATATATGTATCTTTGGGAGGGAGCAGAAAT 
TGGGGCCAAAAGCGTGGCAGTCGAGATGAAAATCCATGCGTGCACTTGACGCCGCTGAACCCACAAATTGGTCGTGTCATTAGGCAGATATTGAT 
TTGAATTTATTGGCAATTCAATTAACTCTAACCGAATGATCGGGGGCTGGCTTCAATTATTTTCCTTCTATTTTTTATCGATATGAAGGGAAGGG 
GTGAAGTTTTTTCTACTGCTGCTGCAGACATTAATTGTCTGGCAGCTGGCGCATTACAATGCGGAAATGCAGTTGAATCGAATGGATCCGAATGG 
AATGGAATCGAACGCACTTCACACGCAAATAAACAGTCCGTTTGTTGGTTTGGTCTGGAAGTTATCTTCGGGATGCACGGCAGCCATATATTCTC 
CATCGAGGGGTGAAGTGAACTGGGCATATTGAATGGGGGGGGCAACTGCGAACTGGGAATGGCGAGCAGTTATGCAAATCAGCAGGTAGTCTCCC 
CCGGAAAGGTAGAGGAACAGAACCATACATAGGTCGGTCATTCACCGCTAGATTCACAAGTATCCATGAGATACGCGCAACTCGCATGCAATTTG 
CGGCATTAGAATATGCGCACATGAATAGATAGAAAGGATCCTTGTGTTGTGTTGACCACGAATGAATGGAGCAGATTTCCATTTAACCGCAGCTC 
ATTGTGCACATTCATTAGACAACTGAAATATAATTGCAAGTAAGGTTGCCATTGGTGCTGTTGGGAAAGCTGGGTGAAAATAATGGCAACAAATA 
ACCGAATAACGCCCGTGAAACACAATAAACATTCACGAAAGAAATGTGTAAATATCACTTAAAAACGAAAATGAAAGTGAAACCGTTTTCATTCA 
T CC AT T C AG AC AT AT G AAT CT G AAAAAAT AG AT G AG AAAG T T C AA T AT AAC T C GGGT GAAA GAGTTGTCGTATCATT T AG AAAAGC T T GGT GT T C 
ACATTGATTTCAATTTCCACTACCATGTCCCCCCCCATCGATCTATGCAGCTAACTAACCCATCATCTAGATGGCCGTGTACATTGTACCAGTAC 
CAATGTCACTGTTGATTGCATAATCACCTGACCGTCACTAATTCCTACGTAATTGGAAAGCCATAATGTCAGAGTCTTTGTGCCGGTCGTTTGGG 
CCCTCCTCCCATCGCATCATGCATCGCATCGCATTGCATCGCCTCATATGCCATGCACCACATCTGTTGCAACTTATTAATTTTTGGGGCCAACA 
ACCGTTCCGAGCTGTAAGCGAAATGCCCAAATAGCAACCCGCAACAAAGCAGCTTACACTCTTAAGTGTTGGGCCGAGGCTAAAACTAAACGCTA 
TTAAATTTTACACACGGTTTCACTGCAGTTCGAATGCGTTTTTCATGAGCTCTCCGCTCGGAATTTTTTTTTATAGCTCCCAGCCAGCTAGCTAA 
GCTGCAGTGGCAATGGGTTTGGATTTTTGAGTCAGCTCTGGCTCTGGTCTGGGCCATTGTTTTGCAATTTTGTTTCATTTTGGCCCAGAGAGAAG 
GGGGGGGGGGGGTGGGATGATGCGGCGGCGGGGCGGCATGATGATGATAATGAATGATGAGCCGGCCGAACAACCGCCACAATACCTATTGAGTA 
ATTACAAAAATTTTAAAGTTTAATTAAAATTGTGTAAATAATCGCATTTGTTTGGCTTTGTTACCGTTGGCGGGTGTTACCGTTGCGAAAACGAA 
AGTGCCATCAAATTATGATGCTCAATTGTGGGAACAACAGCTGCCTTTTGACAAGCTCACCGCCTGCCTGCCACTCGATGAATTTCAAATGCTTG 
TCATTGAGGCGGCAAAGTATCTGCATGGATCCTATGACTAATGGCAGGGAGTGAAAATCATCAAACAGATTGATCACTTCATTTAACAGTGTACC 
AAAATATTTCTTATCTGAGACATGCGCCCACGAGCAAGATACTTTTCAGCGAGATACGTGTCAATGATTTAAAAATAAAAAAATGAAACCAATTG 
TAAAAACTAAAGAATTGATTCTTATGCTTATTCAAGTGCGGAATTGTCACTTACCTTGTAAACCTCCGAGAAGAATCCCGATCCAATTTTCTCCT 
TGACGAAGTCATCCACGGAGTATAATGCTGAGACGGCAGTGCGCAGCGCCCGACACGAGCTGCCGGTGACCAGGCGATCCGATGATATGGTCCGC 
GTCACGGGCGCCGGCGGAAGACCATTTCCGATCCAGCCGCCGTTGTGGAGCGGCTGATGTTTCACTGGTGCTTCCGCGGTGGCGTCTCCATTGGT 
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CGATGTTGTGACGGTGGTGGCGGTGGTGCACATACCCGCTGTGCTCGGCCAGATGAAAGGGGAGGCGGCCGATGAGTCCACCTGGTCCACCGATA 
ATCCGCAGATCTCCGACGGCGGGCCGGGTCCACTGTGCTGATTCAGGCTCATGTTTATCAGCTACGACGTATGTTGCTGTCGTAACCGGATGTGT 
ATGTGAAGTTGGTTTAAGTTTGTTTTTTTTTGGCGTTGGACGACCTCGCCTTGCCTGCTTGTTGTGGGGGTGATGGATCTCTGGGGATCGAGTTG 
GGAACTGTGTGGTTTGGCTAATGCTGATGTTCTCTTGGCTACACCATGGTGTCGTTCATCGCAAGACAATAGTTAGCGGATTGACGCGACAGTTA 
GACGGGCACGATTTGTTCAACGGATGATATATTTTAGTGTTTGCTTATGGAACGCAGCACGAAACATTTATGGGACCCATAGGTCTGAATTTGAC 
AATTTTAAAAATTTTGAAACAGGAAAAAAAATTGTGGAGAACTAACTATCAATCACTTTTTGCTTGCTTCGTAATAGAGAAACTTGTGTATTTAG 
TTGTTGGATATTTGGAATGCATTTGATTGTTCTTCGATTCTTGGCACGTTGCAATTGTGTTGGAAGCTGCACTGGAAGCGCTCGCAGTTGGAAGT 
CTGAAAGAAATGCAGGCAATAGTTAAAAAGTTAATAAAATTCAATCAAACTCGTCGCTGCTCTAATGAAATGTAAACAAATGCGTCTATAGATAG 
GTATATATTTGCTCGAAAAAACAGCGAAGCGGGCCGCGTGATCAACATTTCTGTGTTTATTCCTTTTTTATGAGGTGCAAATTAAAATACTTCAA 
AGAATTCGCCTTATTTTCTGTTGTCACGCGCCACTCAAAGTAATATCCCAGTGTCAGCAATAAATTCTAGAATATAGGAACATAAGATATTTAAA 
AAGTATAAAAGGGGCAGAAATCAAGACAATGCAAATTTATTCAGGCCTAATAAGAGCTAAAAGTTTAATATTTTAAATGGTGTAGATAGGAATCT 
GATTCTTACTCGGAAGAAGCTGATAAGTTTTTTCGAAGAAACTTAAGCCTCTTCATCACGAAATCGTTTCTTCATTTAATACTCAAACAGCGGCC 
TTATTTATTTATCATTCCAGTTTTTCGTAGACATTTTGCATTAGCACATGGCGAAAAAAAACATCGCAAGCTAACAAAACTAACAAGAATGAATT 
AATCGACCCCGCTGGAAATTGCTAATAAATATTCAGGGTCAAAGTCGACAAAATTGGCAAAGTTTTGCTTATCAGCGGGCTCTTGCGGTGTGATA 
GCCGCTTAATATATGTACAGGTATTTGTTGCCGTTTGATAAGGCGGCCTTTTTAAGTGCGATAAAGCACACAACCCATCAAATGTCCCCAAAGCC 
AGCAGACTTCATTCAGCATTTGCTTTTTTTTCGAACTAACAATGTAATTTAGCAGCAATTTTGTTATCATTACGCTTACTTTTGTCGTCGTCGTT 
TGTTTTTCGTCGTTGTCTTGAGAGGGGGGCAAATGAGTTCAGTTCATCAGCAAATAATTGAGATAAACACGATCACTAAAACCTTGGCCTTGCGC 
CTTGAAAACTACGTCTGTTTTGAGTGGCTTCAAGGAGTTAGTTACATTCGGCAAGGGAGGTCGCTGAAGAGTTCCACAATCGGAAAAACACCACC 
ACCCACCGCGAAGTACTCAGTTTCAGTTTTTTGAGATAAGTTCGGAAATGATATAACCAAATATTTTGCTGCTTGCTATTAGACCGGCGAACAAT 
GGC AAC AG GTT GG AAT T T T T T GAT TTGCCATTCCT T AT C GCAAT AACC AC AT AT AT AT T C GG C G AT AAG AT AC T G AG AAAT AGC T TTT GGGCG AC 
ACAACAAAATTGAATTTGGCTCGTAAAGGATCATGCGAGTGGCCAAAAGGCGATCATGCCCACAGAGATAACAGCTCAAAGCCACAAAGCTCACA 
ATTAAAACTATAACAAAATAATTACACCAAATTTCGACATAATGATAATGATAGCAGCGAACGGACACAGAGCTCTCACTTTTCCGCTGCTGGCC 
AAAGTCAAAATAAAACTCAAATTACCAAGCACATTTTTGCACTCGGCAAATAAACGGGGAAGTTATGCGAATTGGGATTTGCTATTTGCTATTTG 
CCGTTTTGGCATTTTGTGTGTGTCTTTCCTTCTCTTGCGAAATCTGTCCAAATAGCCAGTTGAAGTCAAAGCCGGCAGCAGGCCATAATTAGGGT 
CAATTGCGGCTAACCGCCGCCAATTGACGACAACGAAAGCTTTTGGCCAAATGAAGCGGCCGGGGAAAAAGTGCTACACTTCATTAGGTGCCATG 
ACAAACGAAGAGCTGCAGCAAAAAGTGGGCTTGTGGCTTCGACTTAATAGCACCCAAGGATCGTTTTCGCGCCCACCACGCCCCCAGAATACAGT 
TGTCCAATTACAGGCAGGTCTTTGTCAAATTATCTCGTTTTGAGCTGTGAAATGAATACGTGCTATCTTCACCAGTCAGCTGTGAAAGTTTCCAT 
ACACCCTAATGAGTGGCAAACTTAAATATATTTTTAAAAAAAAATTGTAAACATTAAACTCGGCTATCATGGTGAAGAGTTAGCTGGGGAGAGAG 
T GC CT AT C ATCG ATG AAG AGG AAAATCC AT T T C GTGGC C C AC AAAAAT GAGT T T AGC AC GCT C CGC G AAAC T AAGT C AAGGC CC C AAGC AAC GAT 
TTTCGTTACCGATTTGGGTGAGCAAATCTATCGAGTCGAGAAAAACACTTAAGCGGTCGCTTTTATTTCGACTCTTGCATACATTTATTTTCCCA 
TCACGTTTCAAACTATATACACATGTATTCCGCGTGGAAAGCCGCAGCAGTAAAGCATCAAAATATTTATGTTTCAATAGGAATGGCAGCCAAAC 
AAAT GTCAGTCCGCATCCGTGGTTGTT GCC AAAC AT AT GG A AAAT AG AT GAT G AGCC C C AC AAC AC C GG AATG C TGC AAT C T T AAAAG T C A GC AG 
TTCCACGAAGATTAAATGCCCTAAAGGCATTGAAGCAGCTGAATATAATTTAAAAATCTAAAAAAAAAAAAGGCGGAACACGACGTGCAATAAAT 
TTAGGCAATTTTAGAAAATACATCTTGTGGTGTCGCAAGAACTCAAAACTAAAGAAATCACGTTCAAAACAGCAGATAATAGGTATTTCTACATC 
AACAAGTTGACTTAGGCAAGAAAGTTTCTATATCTGAAAGTTGGTGGGATTCTGCACCGACTTAAGAACAACTTTTCTCCAAATATTAACTCTAC 
TAATCGTTCAAAATCGAGGCTATTTAATTAATACATTTCCTAATATGTTTGCTAGAGAGAAACCAATGAAATTAATTTCAGCACACCACTTATCG 
CCGTTTGCTTTCGATGGCGTTTATGGCATTTTCCAAAACGAATGTTAAACCGTTAACCGCCACAAGGTTCGAGTCAATAAACCGCTGTCGACTGC 
GGTGACTTTGTTGCCGTGCTGTGAGATCAAGTTGCATTTTTATGGGGCCAAGGGTCGTCGCGGCCACCACCGTCTGCATCATCATCATCATCAGC 
AGCAGCATCATCGTGGTATCTTCGCCTACGTAGACGCGTGCCAGGAGAATTGTGTGCAATGCACACTGCATCGTTTCAGCTGGGCCATTAAAGAG 
GCATAAATCTTGGTTATGAAATTGGAAATGATTTGCTGTTCAGGCACATTCAAGTAGGGAGCAAGTAACGCACCGAGAAGTGATGCGTTCAGGAA 
GCGTCTTGCGTTTAGAGTCCGAAAAAAGCGTTAATGAAAAGGAAAGTTGGTCAGCGGCAGAAAAAATGCCACTTCATTGCCGTGTAGAAAGAAAA 
ACAAAGACGAGGAGACGGCGAAAAGAAGCCAAGGCAGCTCAAGAAGAAGAAATTTCTATAAACAAAGAGATGAGACGCCGAGGCAGAGCAAAATT 
ATCAAAGAACAACAAAAACCGAGAAACACAAAAAAAAAAGAAGTGGAAAATGGGACATGGCACACTCTTGGGCCCACAGAAACAACAAAGACTGA 
ATGAAATTCATGTCGTTGTCGCGCTCATAAATTCTTCTACGCCTATATATCGCTGGCTATCTGAGAGATACAGATACACCCACACTTGGAAAAAC 
TACCAACTCATTCAGCCGGGGAGGTGCTGGCTGCATCCGCTCTCTACCTATATGTATGCCTACTTATGAAAAAGAATCTATGTATGTAAATTTGG 
TGTATTTTGAATAAATTCCAAGCCCATCGAAGGTGTGAGGTGCCAGTTATCGTTAGTTTTTTATTCTCTTGAGTTTAAAGTTTAGACTCCAGAGG 
CCGTACAACAAGTGATTCACGTAACGAAATGGAAAGCGAGGAGAAATATGTGTGTATACAAATCTACCAAATAAATTCAAAGGCAATGGAATTTT 
CGACTGAAAGCATAAGATAAGCTTAATTAGCCGGGCAGACAAAAATTTCACTTTCCTTTGGTACAGAACCGTCGTTTTCTAAACTATTTCTGAAT 
GTCCAACGATCACTGCGCCATTTAGACATTTCGACTAATTGGCAGCCAGTCGGCTTTCGGGGGGAGGGCACTCACGTGCCGCCCATCGATCTGCA 
GCAAAACTATACGTAAATAAACTTTAGGATCGAAGAAATCCAACAAGAACGTCAACAAAAACGAAAACCAAATGGGTTTATTTTCAGCCTCAAGT 
GC AAG T CC T AT AAG AAG AT C G AAC AAAAC AAAAC AC TT GAT G AAT GC AAC C GAAGGAGGTGC AAAAAT C G AAC AA T AA C AAAT A C AA T AGGAAGG 
GCGCGTAGAGTGAGCACAAATCGTTTCGTTTATTTTATTTATTTTATTCCCCTCTATGCGGACACACACAAATGAAAACAAATTAGCTTAGGCAG 
CATTAACAGCCAGCTGCCAAGGGCGGAAGTACGGGGGAGTTTACTGTTAATTTTATAAACACCACGAAAGACCACAACGAAAAAAAAAAAAACAC 
CACAAAAATGCAGCAGTAGCAACAGCCAACAACCACCAAACCGGCAAAATCCCAATAGGGCAGCTGCATTTCGGGAGGAAACCCAAGTGGAATGT 
GCATTCGAATTGGGAGCTATAGCTTCTCGATCCCCGGATAATGAGCCGTGAACATCTGTCGCAGTTTTACGTGGAAAACTGTCTTTCCGCTGAAT 
CGACCGAATCGGAAAAAAAGGGAGAAACTGAGACTAAGGAGAGAGCGAATGCTGGGAAAACCAGCACCAACATCGGAAGCATCTTCATCATAACT 
TGCAAATGTGTTTGTGTTATTTTTACATTTCCGCTGCTTACGAAAATATTTACGCTCCCCTTCGCACAAAACCTTCCCCATTTCGAATTTCAAAT 
TTCGAATTCCAGACGCTCGCACACACGCACGTGTTAGAGATGAGCCAGATTTCCGAGTGCATCGCATGGTGACAACAGGTTCGATTCAAATGAAT 
ATGGAAATGATTCTGCTGAGCTCACAAAAAATTTAAGCATTTGCTTTTCAAATTAGATCATATGGCTATGCTATTCGAAACAGATGTGAAAAAGT 
AACAAGTTTTAGCTCTACAATGTTTACTTTAAATTTAGCATTTACGAAGTATAAATAGACACATTATTTACGCATTATGTTTTATTATATTTAAG 
ACAAAATACAATCAACGTAAGTTAAAATATAAGATTGCAAACCTATTTACCCTCAATTCTAAATGTTTAAATATACAATAGTACATGACAATTTT 
GCATTTCCCGTTACGAAATTATATTTGTGTTTTTATCTCGGTAGAACCTTGAAAACCATGTTGTGTCAAGATGCAGCGAAAGCAGCGGGTGGCAA 
CTTTTGGCAGGCAGTGTGCGAAAGCTAAGGTCAGGCCCATCTCTAGGTTGCTTCCGCACACAGACACGCACGCACACAGCGATGGGGAAATGCCA 
ACGATAGCACAGGTCTAATCCATTTAATTCTGGCTTTTAACCGATTTCTGCTAATACTATTTATTTACCCGTTTGGTTTTGGCGACTGCTCTCTC 
TCTCTCCCTCCGTTCTTATTGCCGTGTGGCCATAATCCAATTATGCTTGTGTTTGTGTGTATGTGTCTGCTCTCTATTGCAATTGCAGTTATTCG 
CATCCGCCGTGATATGCAACCAACACATTCGCACTCGCGCACTCACACGCACCCGCAGACATCACAAATGCGAGGACCGAAAATCCTGAGCGTCA 
ACAAAATAAATATAGCTCTGGTGTTTTAAAATAGGTTTTCACTCGATTCTGCGTTTGCTGTCTCGTCTTGTTTATGTGCATTCGTCGTTGGTTGT 
GTGTACTTGTTTTTTTTTTTTTTTGTTAGACGCGACTGTCCATTCAACATTACCACTCAGCCGACGCAGAGCGAGATTATAATTCGGTAACCGAT 
TCACTGGCACTGCTCTAACGGGCTATAGCCTATGATCCAGTCTGGAATGGAGCTAAAGATATGTTGCTTTCATTTATGTCTGCACTCGCGAAAAA 
GCGTCCACAAATTAACAGTTCTTCGGGAGCGACGCGCAGCTAGACGTCCGACTTTGTTTTCCGACTCGATTAGAGTGACCGTTGTGGGATGGTGG 
AAAAAAGTGGAGGGCCCATGCTGAAACCGATAACAAAAATACCAGATGTGCGATATACACTATCGATAACGAGCACAATTTCTTAATTCTAATTT 
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AAAACAATTATATAACGGGCTTTCGTGTTTTATTTAACAAATAGTGAATAAAATGCGGCTACTTCTTTTTGATGTACTTCATGTACTTCTTGGCG 
TTGAAGTTGTCCTCCACCTCCTCCATGATCTTGACCTTCTTGCGCGGCTTGCGGGCCTCCAATCTCGCTCTGCGCTCCGCATCGGTTAAGGTGCT 
CAGATCCAAGGGAAGCTAGTAGATGGTACTCGTTAATTATGGTTTCGTTAAGTTATAGTAACTAACGTTACCTTGATAATCTTCCTGGGCTCGTG 
ATAGCGGATGTTTATCGTGGATCCATCGGGTTGCACCACAACTGTGGGATATAGCCGGCAGTAAACCGAGCGATGCAGCCTGGTAACCGCCGTCG 
TGGCCGATGAGATGCAGCGTATCTGCTGAACAGCCTGGGGCAACTGTTTCAGCAACATTTTAATTGATAATCACAATATTTACACTAAATAAGAG 
CTACACTTAAATCGTGTTTTTGCTGCCAGGCAGCTGATAGTGTGACCGACCCGAGGGTAAGATTTAAAGACTCCTAAATTAGTTATTAAGAATCT 
GTGGTAGCCCTGGTTCAGTTTCCACATTATGTGGTATTTTCCCCAGCTCTGGTCACATTCGTACTCAAAGTGTAAGTGTTGGTTGCAGAGGCAGA 
AATTCTTATTTTTCTCCTTGAAAATATAAGTACGTTACGTTAAACACTGCAAAAATAATAAAAAAAAAGCGCAGGGTTGGGCTACAACAGCGATT 
AGTCTGCAAGCCTCGAGGAAAGGACTGCAAACGCGTTTTTAGTTTGAAGGAACATTGAGGATTGAAAAAAATTACATAATTTTTGCGAGCATAAC 
TCGCCGTGCAT TAT GTC GAT AAGTCTGCAACT AAA 
(SEQ ID NO: 1435) 

Exon: 49575.. 48994 
Exon: 43605.. 42710 
Exon: 16140.. 16019 
Exon: 5796.. 5598 
Exon: 5538.. 5453 
Exon: 5379.. 5124 
Exon: 5063.. 3295 
Exon: 2755.. 1001 

Start ATG: 43087 (Reverse strand: CAT) 
Transcript No. : CT18533 

GGTCACTCTAATCGAGTCGGAAAACAAAGTCGGACGTCTAGCTGCGCGTCGCTCCCGAAGAACTGTTAATTTGTGGACGCTTTTTCGCGAGTGCA 
GACATAAATGAAAGCAACATATCTTTAGCTCCATTCCAGACTGGATCATAGGCTATAGCCCGTTAGAGCAGTGCCAGTGAATCGGTTACCGAATT 
ATAATCTCGCTCTGCGTCGGCTGAGTGGTAATGTTGAATGGACAGTCGCGTCTAACAAAAAAAAAAAAAAACMGTACACACAACCAACGACGAA 
TGCACATAAACAAGACGAGACAGCAAACGCAGAATCGAGTGAAAACCTATTTTAAAACACCAGAGCTATATTTATTTTGTTGACGCTCAGGATTT 
TCGGTCCTCGCATTTGTGATGTCTGCGGGTGCGTGTGAGTGCGCGAGTGCGAATGTGTTGGTTGCATATCACGGCGGATGCGAATAACTGCAATT 
GCAATAGAGAGCAGACACATACACACAAACACAAGCATAATTGGATTATGGCCACACGGCAATAAGAACGGAGGGAGAGAGAGAGAGCAGTCGCC 
AAAACCAAACGGACTTCCAACTGCGAGCGCTTCCAGTGCAGCTTCCAACACAATTGCAACGTGCCAAGAATCGAAGAACAATCAAATGCATTCCA 
AATATCCAACAACTAAATACACAAGTTTCTCTATTACGAAGCAAGCAAAAAGTGATTGATAGTTAGTTCTCCACAATTTTTTTTCCTGTTTCAAA 
ATTTTTAAAATTGTCAAATTCAGACCTATGGGTCCCATAAATGTTTCGTGCTGCGTTCCATAAGCAAACACTAAAATATATCATCCGTTGAACAA 
AT C GT GC CC GT C T AAC T GT CG C GT C AAT C C GCT AAC T AT T GT C T T GC GAT G AAC G AC ACC AT GGTG T AGCC AAG AG AAC ATC AGC AT T AGC C AAA 
CCACACAGTTCCCAACTCGATCCCCAGAGATCCATCACCCCCACAACAAGCAGGCAAGGCGAGGTCGTCCAACGCCAAAAAAAAACAAACTTAAA 
CCAACTTCACATACACATCCGGTTACGACAGCAACATACGTCGTAGCTGATAAACATGAGCCTGAATCAGCACAGTGGACCCGGCCCGCCGTCGG 
AGATCTGCGGATTATCGGTGGACCAGGTGGACTCATCGGCCGCCTCCCCTTTCATCTGGCCGAGCACAGCGGGTATGTGCACCACCGCCACCACC 
GTCACAACATCGACCAATGGAGACGCCACCGCGGAAGCACCAGTGAAACATCAGCCGCTCCACAACGGCGGCTGGATCGGAAATGGTCTTCCGCC 
GGCGCCCGTGACGCGGACCATATCATCGGATCGCCTGGTCACCGGCAGCTCGTGTCGGGCGCTGCGCACTGCCGTCTCAGCATTATACTCCGTGG 
ATGACTTCGTCAAGGAGAAAATTGGATCGGGATTCTTCTCGGAGGTTTACAAGGTCACACATCGTACAACCGGCCAGGTGATGGTGCTCAAGATG 
AATCAGTTGCGGGCCAATCGACCGAATATGCTGCGGGAGGTGCAGCTGTTGAATAAACTTTCGCATGCGAATATATTGAGCTTCATGGGCGTTTG 
CGTGCAGGAGGGCCAACTGCACGCCCTGACCGAGTACATCAACGGCGGCTCCCTGGAGCAGCTGCTGGCCAACAAGGAGGTGGTGCTCAGCGCCA 
CGCAGAAGATAAGACTGGCGCTGGGTATAGCGCGCGGCATGTCCTACGTTCACGACGCGGGCATCTTTCACAGGGATCTGACGAGCAAGAATGTA 
CTTATTCGCAATCTGGCCAACGATCAGTACGAAGCTGTAGTGGGTGACTTTGGTCTAGCCGCAAAGATACCAGTCAAATCCAGAAAATCACGCCT 
TGAGACAGTCGGCTCACCGTATTGGGTCAGTCCGGAGTGCCTGAAGGGCCAGTGGTACGATCAGACAAGCGATGTCTTCTCCTTTGGCATCATTC 
AATGCGAGATCATAGCCCGCATCGAAGCGGACCCGGATATGATGCCGCGCACGGCGTCGTTTGGACTGGACTACCTGGCATTTGTAGAATTGTGT 
CCGATGGACACTCCTCCTGTGTTTTTACGTCTGGCCTTCTACTGCTGTCTGTACGATGCCAAGAGCCGACCCACGTTCCACGATGCCACCAAAAA 
GTTGACTTTGCTGCTGGAGAAATACGAGCACGAGTCGTCGGCCGGAGCAAGTCCACTCAGTTCCCTGGAGTTGATCAACTGCTCACCAGGAGGCA 
GCGATTCGGAGAACGTAACGTTGACACCCCGAGCATCAGTACCCACGGTGGCCACTATTTGTACTGCTCCTGCTGGCACTTTAACACCGCGCCAT 
CTACAGGAAGTGGACGAGGACGGCTTCCGATTTCCAAGTAAAAAGTCTGCTTCGGCTGCGCTTCCGGGCACGCCAAATGGATCTGCTGGAAGCAG 
TGGCTTTGATGATCATATACTGAAGTCTAGAGTGCGACGGGCTACTCAGAAGTTGGAGGCCGAGATCTTGTTGCACCGGCGTTCGCTAAGCGAGA 
ACATCATCCACTTCCCCATGCACACGTCGCCAAGTGACAAGGCGCGGTGTCATCAGATGCAGCGACAGCGCTCCTCAACACACTCACCAACGCCC 
CCAAGGCAGCTTTCCTCGGTTCAGTTGCCCGCCACGTCTATTCCAGATCCCCCACCGCTTCCCAAGCCCGTTCCAGCGGTACTCACACTGCGCAA 
AGTGGGCGATCCACAGTACAAGCCGACATCTGAAAAGGATCACAACGGCTCTAAACCCAATCCGTTTGAAGCGCTGGCGCAACTGCGTGGTGTCA 
AGAAAATCCTGGACGCCAATCCGAAAACCTATGCAGCAGGGGTGGGAGATCTATTCAGCTCATGCTTCGAGATGCTAGCGCCGTTCTTCAAAGAA 
CTGGCCGCAATGCAGAGCAAAGCGGGGGCTCAGGGCGGAGGAACGAGTACGGAGGAAACATCAACGGGTAACAAGGCTTCGGCAGCAACGCCCAG 
TGAACCAAAGAGCCTGCCAGTGTCGCCGCATATGACGCGCAAATACAGCGCCATCGAGCTTCGACTTCCTCAGCGAATCAGCAATGCTGCTCCTC 
CAGCAGTTACCAGTGCTCCACCACCATCAATAGCATTGCCAACCGATGAGGATGACTGCGATGATTGCAATTCGGAAGGTGAGGATCCCGATGGC 
GGTGGCAACTGTGCAGATGCGGAATCAAGCAGCGATGACAGCGCCGTGCGTCCGTCCACTGTGCCAACCGCTCGAAAATACCGCGCGAATTCCTT 
GTATGCGCATCCGCTTTTTCGCGGCTCCAATGGCACCAGCATGAAAAGTACCAGTACTGGTGAATTCGAGAGTCCAACTCTACTGAAGGCCAGCG 
CATCTACAATCACGTTGACTGCGTCAGTGGGTGCGACCAATTCGTGTGAGGATCTGACTGAAAGCGCCAGAAAATCAAAAGAGCAAGCGAGTAGC 
GGTGCAACAGCGGCGGTGCAGGCTTTGCCCACTCCCATGTCTGGCTGCTCCACCAGGTTGGTGGAAACCCCGGTGTCCGTTTCATCCTGCACTGA 
TCTTGAGCTCGACGAACTGAAAAACACCTGCGATCTGAACGAATTGAAGAACTCGTGTGAGCAGCTTCCCTTGAAGACAAACCTAACGAGGCGGG 
ACTCTATAGAAAGTGGCTTCTTCTCGTGCTTTAACGAAGAGTCCGATGCGGCGATAACCAACAACAGTTTCGGACGGCAGCTGAACGGGATTGGA 
TCGTTGGGGTTTGGTTGCGGCGGCGGGGGCAACAGTAGCAACTGCTGCTATGAGAAATTGAAGACGCAGCTTATGCTCGATGCATGCGTCAGCGC 
AGATAGTGAAATCAACGATAAACTGGCGAGTTTGTGTCGCTGCTGCTACTACGCCACCTCGACATCGAATGCCGCCGTAACTGCAGCCGCTGCTG 
CGGCCCAGCAACAACAGCAGAGCCACTTGCTCAACGATACGTCCTCGACCTCATCGTCGGTGCTGCTGATCGACAATGATCCCGCTGTGATCGGT 
GGAAATACCTCATCATCCACAGCGTTGGACTCGATGGATGACTTGGACGCGGACATTGTGGCATCGCGGCGCACACACCAGCGACGATACACTTG 
CCACCATGCGTTGCTCAGCGGGAATGGCAGCGAATCCCAAATGGCGGCTACGGCGAGCCTGATCAATCGTCTGGCTTTAGACTCGGAGATACATA 
CTCTGCTTCAGCGTAGTCCTTTTGCAACGAATCAACTATTGTACTGTAAGAACCGCACCTCGTCCATATACACGGATAGCTCCGATGATATAAGT 
AGCTTGGCCGGCTCTGACTCCCTGCTCTGGGACGACCGCTCCTTCACGGCACTGCCCAGTACAAGATCCGCACAGATTGCCAAGATAGTGGAGTA 
TTTTGAGCGAAAGCGACAGGACTCCTCGCCGCCTTCGGTGCACCATCGCAATGGGAACGGGACGGTAGCAGCTGCTGTGGCTGCCTCTGCCCTGG 
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GTTCCAACTACCTTTCGTACGAGTCGCGTCGCTATTCGGACTTCAAGCGGCACCTGAACGCGGATGGCCTGTGCTTCGATCTGGACAAGAAGCCG 

GCGCAGCAAAGGATGATGGTGTGCGAGGGTGCGGTCAAGTCGAAGCTTCCCATATTCGATAAAAAGAAGCAACAGCAAGGCGGCTAGCTGTGGTG 

GGGACCTTGTGATGAAAACCCCGACCTGATTCATATTTAATAACTTATTTCTTCGATTCTTTGAGATTTTTATCCCCATACACGATTCCTTTATA 

CCTGGCAGTACCACTTGATAAAAATACAAAAAATAATCGCAAATAATTTTGCCCACAAAAGTTGGTTAGTTTACGATGCCTAAAGGTTTAATTTG 

TAAGTAAATAGTTATTTCCTTATTGTCAAGTGTATAGAGCGCAATTGTAAGTAAGCCAAAATGTTCTACTAAAAATTATGCACGCTCTTACAATA 

AGAAACCCAATTTCGTTTGTATAAGTTGAAAAATTATATATAATTACCATTTATAAGCATATAGATAAACATAATCATACGTCGTACTGGTTCCT 

AGTCCTTCAAATTTACTTCGCTACATACATTATGCTAAATAGCGAGTATTCAAAAACCAATCATGGAAAGTATTTATTCTTAATAATTATATCTG 

TCGCTGGTAAACTCCTATGAAATGGCATAGCTGTTCTTTATGAATACATTTACATATAATTTTAAGTTTATCGCGCTATTTATTTGATTTTGTCT 

AAACTCTGTATAAAAATATTATGTTTAAAATACAATACAACAAATATACACACATTTATACATAAAATGGCATATTGCAATGTATACATTTTACC 

TGTAACGATTAATTACCGCGTAAGGTCCAATTTAAATAATTACATAGACTTACCGACTTGAGGTTTTAAGACTTAAAGCCCCCTTTCCCTCGAAA 

ATTGTACAGATGATCCCGGATCGATTTGAGAAGCAAGAACACTTATCATTGGCCAATCCCGTTCGGCTTAAACACGACTTATGACTTTAGTTTTA 

GATTCATACGTTACAAATATGTGGGTATCTGCATTACAAATAAATACATAAGTATACATT 

(SEQ ID NO: 1436) 

Start ATG: 1101 (Reverse strand: CAT) 

MSLNQHSGPGPPSEICGLSVDQVDSSAASPFIWPSTAGMCTTATTVTTSTNGDATAEAPVKHQPLHNGGWIGNGLPPAPVTRTISSDRLVTGSSC 
RALRTAVSALYSVDDFVKEKIGSGFFSEVYKVTHRTTGQVMVLKMNQLRANRPNMLREVQLLHKLSHANILSFMGVCVQEGQLHALTEYINGGSL 
EQLLANKEVVLSATQKIRLALGIARGMSYVHDAGIFHRDLTSKNVLIRNLANDQYEAWGDFGLAAKIPVKSRKSRLETVGSPyWVSPECLKGQW 
YDQTSDVFSFGIIQCEIIARIEADPDMMPRTASFGLDYLAFVELCPMDTPPVFLRLAFYCCLYDAKSRPTFHDATKKLTLLLEKYEHESSAGASP 
LSSLELINCSPGGSDSENVTLTPRASVPTVATICTAPAGTLTPRHLQEVDEDGFRFPSKKSASAALPGTPNGSAGSSGFDDHILKSRVRRATQKL 
EAEILLHRRSLSENIIHFPMHTSPSDKARCHQMQRQRSSTHSPTPPRQLSSVQLPATSIPDPPPLPKPVPAVLTLRKVGDPQYKPTSEKDHNGSK 
PNPFEALAQLRGVKKILDANPKTYAAGVGDLFSSCFEMLAPFFKELAAMQSKAGAQGGGTSTEETSTGNKASAATPSEPKSLPVSPHMTRKYSAI 
ELRLPQRISNAAPPAVTSAPPPSIALPTDEDDCDDCNSEGEDPDGGGNCADAESSSDDSAVRPSTVPTARKYRANSLYAHPLFRGSNGTSMKSTS 
TGEFESPTLLKASASTITLTASVGATNSCEDLTESARKSKEQASSGATAAVQALPTPMSGCSTRLVETPVSVSSCTDLELDELKNTCDLNELKNS 
CEQLPLKTNLTRRDSIESGFFSCFNEESDAAITNNSFGRQLNGIGSLGFGCGGGGNSSNCCYEKLKTQLMLDACVSADSEINDKLASLCRCCYYA 
TSTSNAAVTAAAAAAQQQQQSHLLNDTSSTSSSVLLIDNDPAVIGGNTSSSTALDSMDDLDADIVASRRTHQRRYTCHHALLSGNGSESQMAATA 
SLINRLALDSEIHTLLQRSPFATNQLLYCKNRTSSIYTDSSDDISSLAGSDSLLWDDRSFTALPSTRSAQIAKIVEYFERKRQDSSPPSVHHRNG 
NGTVAAAVAASALGSNYLSYESRRYSDFKRHLNADGLCFDLDKKPAQQRMMVCEGAVKSKLPIFDKKKQQQGG* 
(SEQ ID NO: 1437) 

Name: SERINE /THREONINE PROTEIN KINASE 
Classification: protein_kinase 
Gene Symbol: cdi 
FlyBase ID; FBgn0004876 



Celera Sequence No, : 142000013384325 

ACAATTACTCAGCCGAACTGTTTCCCACCGTGGTGCGGAACTCGGCCATGGGATTGGGATCGATGTGTGCCCGTTTGTCGGGAGCCCTTACTCCG 
CTCATTACATTGCTGGACTCATTTGATCCGAAGATACCGGCCGTGCTGTTCGGCGTGGTGGCCCTGATCTCGGGCTTCTGGGTTATGTTCCTGCC 
GGAGACAATGAACCAGCCCATGCCCGAGTCCATTGAGGATGGAGAGAACTTCGGAAAGGGAGATACTTGGTTCAGTCAGTGTGCTGGTCGCAAAA 
AGCGCCAGAATAGCGTATATCCCGACGATCCGGAGCAAATGGTGCCGCTCAAGAACATCGAGAGCAAGTAGAACCTAGCCAAAGACGATTGTTTC 
CCAAGTCCAAAATCCAGCTTAAATTCCTTGTTACCCTGTGTTTTCTTGTTAAGAATTCTACTCTTCTGCCTAGTAGTAAGATTTAATAGTTTCTA 
TTCATTCCAAGCGAATAAAACGATGTGCAGAATATGGGGAATCCTAATTTGTGGATTCACTCATTTCTGAGTGTCCCTTTGTTGTTTGTACGGAT 
TTTTTCTTTTAATTTTTTCGGTGGTAAAATTTAGTTTAAAGGAAATTTAAATGCTTGACGAGCATTTTTACTGATTAGTGTTGTCTACTTTTAGG 
CTCTATCAATTTAGGCGCAGATTCATAGCTTGAATTTCAATTTTCTCTACGGTAAAACCAGTACACACCAAAAAATTATTTCCCAACTATTTCCA 
ATTAAAGTTTATTCTCCGACCGACACAGCCAATTCGGGTTCGTCTGGCCTTGCAAAGTTTCAGTAATTTCCACTCAATTGCACTATCGAATTGCC 
AATTGCCGACAGGGATTTCCAATGTCAAATCCATAACCCATCGCTTCCCTTTGCTCCACATGATGTTTAACCCGCCAATTAAAGTTTGTTTAAAG 
AGAGAGTTTTGGTTTTTCGGTTTTTGTTTCTTGTTTGGTAATTCTTCATTTTGTTTTGTTTGGTACATTTGAGAACATTCATTTCGAATTTGAAT 
ATATTTTCAATTTAAATTTTAACTACACGTAGGCATATAGATTTATAAACGTATGTATGTGTATGCCGATATCTGATGCGATATCGTTACACGAT 
TTCGCTGGCTTAGAATTACATTAACATTTACATAGATATAGTAGCTTTATATCTGGGATTTCGGATCGCGGTTCTCGGTAGCGCCTCTGTCAGCG 
ATTTCCCGTCGAAGGTACACAGAGATATATGTAGATATTGTTAGCAGGTATTTGCGATGGCTTAACATCAATATTCTTACACATAGACACACTCA 
AATCGCGAGATAAACACATCTAGATGCGGTTGTTAGCCAATGAACTTGTATCAGAAGCCAACTGGGGTAAATGCTACGCTTGAAGGAAGGGTGGG 
AAAGTATGTCGGGTATGGGAGTTCGAGTGCCTATTAATACAAACATACGAACGTAGCTAAATATATGTATGTGTAAATTAATCACAATTGAATCG 
GAAAATATTTACAAAAAATACTCAATCGAAGCCCCAACCATCCGCAGAGCATGAGTCCCAACCCGTTGTCCCGATCGTTTGGTTTTACAACTCAG 
GTTTTCAGCTTTAGTTTCGTCACCACCTTATGATTATGCCCGTTCCGGGCACTTGGTCTGGACAATTGGCGCCTTCACCTGCAGTCCCGGCCCGG 
TCTCCGCCAGCATGGCCGTGTGCACAATAATGGCCGGCTGGTCCTCGCTCTGAGGATGCAGCTCGTGCGACTCGCCGGGTGTGAGGCTAGCCAAC 
GCGGCCAAAAGATCATGGTTGACCAGTGGCTCGCCTTCCTCGCGCGGCTCCCAGCCGGCTGGCGGCGAAGCGGGCGGGGAGATGAGGAACTGTTT 
GACCGGCGCCGGTGGCTGCAGGTTCTTGTTGCTGACCGGTGTCACCGGCTGGGCAAAGTAGCAGGTGATAACCGTCTTCTTGTTAAACTCGTACT 
GGTGCAGCTTGATTCGCGCATTTGCCGCCGCAATGGCATTGTCATAGTTAACACGCAGGCGACGGAAGCTGCGCAGCCACTGGAACGTAGCCGAT 
TCGCTGAATGTGCGGAATAACTCCTCCATGGCGTGCTTCAGCTCTGGATTGGCAAACACCTCGGAGTGGATGTTGGTCACGATGATTGAGGTGGG 
TAGGTCGTCGAACGAGTCTGCATCCACCTCCGGCTCTGTGTCGCTATCCACGTCTCCCTCCTTTGGGAGCGACGGATGCTGATTGGGCAGGCCGT 
CGGCGGCGTTGATGAAAATGTCCTGGTCCGGCGACAGCTTGTCTATGCTGCCGCCTCCACTGCTGTTCTGTGTCGACTTCAGTTTGTTCTTGCTT 
CTGCCGTTGGCGCTGTTGTTGTTGTTGTGATTGCCACGCGGAGTTGTCGGAGTGGCTGCATTGGCCGCGTCAGCTTCGCCAGTGGCGTCAGGAGT 
GGTCGGATCAGGTGCATCGGCGGACGCATTGTTGTTGGACTTGGCCGCGTCGGACATGGCTGGCTGAAAAGGAAATGAAAGCTACTTTAAATAAT 
GTTTACAATAAAACGTTTTTATCGAATATGGCGAATAGCAGCCAAGAAATATACTTCTGTTAACCATCCATTTATTATTTACGAAGCATAAACCC 
TTTTTATGTTGTTAAACAACTAATTATTGGGGTCAAATAAATGAATTCCTCGTAAAGTTCATTTAAAAGCTTGTTTTTAATGGCAAATGTATGTG 
GGAAGTGGCCGCTGTTCAAGGATTTATTTTTGTGATAGCCATTAATGCTATATTTATGTTATACGGCCTTTAATTATAGCCCAACGAGCACGGAC 
TTTACAGTAATTGGATTCAACAAGTCAGTGACATTCTCCATAAATAGAACTGTCAGCTAGTTAACCTTCACCTCCAAAACATTTTTAAAAAATTG 
GTAAACAGCACTCGCAAATTAATTCCGCACAAGCGACGAGGGGGTTGGATAAACACCTCAGCACAACGACAGGATAGGGTTTTTGTTTGCGTAAT 
ATATTTGTAACAAATTTCGAGCGCCACGCTACGTCAAAAGAATTGCTACCGCTTTTCACGGAACTGGCAGCCCTTCAAAGCAAACCAATTACGGT 



FIGURE SHEET 776 



ACCAAAAACTACAGACGTGTCTAAGTTTGGGAAGGAGCGGGGGACCACAAGGAAGTGGGTCTCCGGCAGGCGAGGACCTTAAGTAGCGACACGTG 
AAGCTAATCAATAGGTCTATCAAAGGGGTTGGAGTTAAAATAGGGATCTCCGATGGTGACGCACTGTTGATTCACAACTGCTCTTGATTTTATTG 
TTATGCCTGCTAGCCACAAACTTTTGGCGAAGGGAAAGCGCCAGACGGTTGATTAATAGTAGCGAAATTTTTATGAATGAATGAATTAGTGAGGA 
ATTGGCGTCAGAATAAAAATGGTGACGTGTGTCTGCTTCTGCTTCGCACCTGTGACCGCCATGAAACCCATTCCCAACGAATCCAAGGGAACCCT 
TGGTGGGTTTCATTATAATAACTGCCAGTCGTAGTTCTGGATGGGTGCCCATTTGTATGCTATAAAAATAGCATTCTTCTTAGTGTTTTATTGTG 
GCCCATGAATCAGCAATATGCAGTGAGTTAATTATACGCTGTTTACCCGGACAGGGAGTTATTTCAGACAAATGCAAATGGGTTTTTGTGCTAGC 
TTGGGCTAACAAAATTGGCTTAGTTTTTGGGTGCGAAATGTGCGAAGTTTTTATGAGTTTCCATGCAAGGATTCGGCCCAGCAGCTGAAAATGTT 
TGCTTTGTATTTCTTAATTAATAATTTTTGGCATAAATTAAATTTCCAAGAACTTTTAATGAATTTTAATTGGGATTTGGATTTTATGGTTTCCA 
TAACTCAATTGATTTTATGCCGGCAACTATATTATAATTTTGTTTTTGTTATTTCTTAAAAAATGACTGTTAATAGACTAAGTTTTACGTGCTAA 
TGCGATTATATATATTTTTAAAGGAAGGCAAATGGAGCGTTGAAGGATAACTAAAATAGCTTAAGGGTTATTCTTGGCTATCAAATAAAGTAAGA 
AATTCCAATTACTACCTTATCTATGAAATATGTTTTCTTGGAACTAAGCCATTCTTCGTGTGAAGTAAATTGCTTTTTAAAGCGATTAAATTGTA 
ATACCATCTTTGTGTTAAGCTTATACTTATGATTTTCGGAAAATAAATAATCACTTGAAGGAATGACAAGGCCAAGGACTGGGGTTTTAAAGCTC 
ACCAGACGCGACGCAAGAACCTCTTTGGCTACTGAATTCCCAAAGCTTATGCTTTGTTAGTACTTTCACTTTTCTTTTATGGCTATTTTATTTTT 
TGTTGACTTTCACCTCAACGCAGGTCAGCAGAGCAACAACAGTAAGCGCCGTAGCTAACAATAAAAACAAAGGCCAAAAAAGAACCCAACGACGA 
GTTGATCTATTTCCGTATGATTGATTTCCTTGAAAGGACACACCCACGCAATTGCGACTTTTGCTTCCAGCGGCAGTTGGAGATGCTACTGTTGT 
TCATGACGTGTGTACTTTGTTTGCCGTAGAGCTTGATCCCTATCTCAGGAGCAGCATGGGAATGAGGACGCGTGTAAGTGCAAGTCAGCATGAAC 
GAGCTGCTGGCATGCAAATCGCATGGGCCGCGTGTGACGTCACCGTTCCTGCACGCAATCCATAAAAAAAAAACGAAGAAAGCGGGCCGATAGCG 
CACTTTAAACTAGACAAAAGTAGCTCTCAGCCCCGCCCACTTTTGAAAGCGGTGCCGTGGTTTCGACATGATGATGAATTGCCTGCCGGTCGTTA 
GCCAACGACCTTCACTCATTCATAATTGCTGTCATCATTATATCAAGTGGTTGCTGCCCCCCACCATAGGACTCCTGCTCTCTATCCAACCATCC 
ATATCCATGTAATCATTTGCCAATCCATCCATCCTTTTGCTCATCCCGCCATGTCAGGCTTTGCAGATTTTTAGCCATAAGTAGCCCGCATGTAA 
GCCACCTCAAAGACACGCAAAAGCTCTCTAAATCATTTGAGACGGCTTCAGTGGGCCACTCAAAAAAAAATCAGCAACAAAGGGATATAAACAAA 
TTCATGTAACATTTTAAGCTGAACACTAATACAATGCGTTGCCTTTTATTACTTTAATAATATTGATCAGAAGCTCTAGGACATCCTTAAATGTT 
TAAGTTGCCATTTAATTGAAGCTCACGTCCCTCACTTAAAATTGATCGGATGTGTTCGTGGGAATACGAGTGTGAGTGAATTGTGGCCTTATTTA 
CATTATCGTCCCGATTTCGTGACAAAAGGAAAATCCCACGAGTGCGCTTTCCACTCTCCATTACAGCTGCTGCTATTGCATTTGCCGCTGCTGCT 
ACTGCTGCAGTTGCAATTTCAGTTTACCCTATTTGAATGCAAATGCAAATAACGCTGGGGGAAGATTAATTTGATTTTCTGCTGAAATTTTAAAC 
TTTCTCCTGGCAGATACCACGCGTCTGAGCCAGATATTTGCGATTGATGTAAACCGCCTGGAATTTAATTAAATTGTTCCTGGCCTTTATAGAGA 
GCTCAAAGCGGCGCTCTGTTATTACAGCTCCTTGCGGTTGCCGATGGAGGTTTTGCTGCTCATATTTGTGCCTGCAAGTGTCGGCTTCGTTTGCC 
ACATGTCACATGTTGCTGCAACCGGAATCCTTTCAAGGCAACTGCAAAAGCAAAAGGTCAGGCGGTCAGGTACTGCTACAGTTCGGTCACTGCTC 
GTTGGGCTTGCTCGAAGCAAGCATTACATCAGGAGTTCTGTGGCAGGACGACACGCGAATCGTGTGCGAAACTTAAGATATTTTGCTGAAAGCTT 
CGTCAACTAAATTAATTGAATCCTAATTAGTTTGGAACTTCCCTTTGTTCACTGTTCAATCTGGGCTGGATTGCTTCTGCTGTAGGCTTAGAAAT 
TTCCCACACACGATGCTCATTGTTGGAAATTTAAATACTCGTATCTGTGGGCTGCACCTGAAAGTATGCTATTCTTCCCACACACATGGAGATTC 
TCTGTAATTTAGCTCCACTAAACCAAACCAGCTTTGCTTTCACGGCTGCTGTGGCTGGATAATTACTACCAAAGGTGAAACGTTCACCGAAAACT 
CGCCCAGCATCATGAGCGGTTTTGCGTGGGAGCGGCATAGCCTTGGTGAGTGCGTGGTTGTGGCTGGAACGGAACAAATCCTGCTGATTGGCAGG 
ACATTGATTGGCCGCATGAGCCATGTGTGGAAAATGCAAAATGCTTAAAAGCTTAGCTTTGGACTGCAGATCAGCGAGATTTGTATTTGATTTCT 
ATATACACTGCGAAGATATTATTTATTCTATCACTAGAAAAATCACAATGAAACAAGAACGCTAGAGTATTTTAAGTAAACAGTTAAGTAAAAAG 
TTAAATATTTAAGATTTATACCCAGCAGAATAAAGCAGACTACAATAAATTTATAGGAAATCGAATTACCAGGAAAGTTTAATTCCAGTAAACTG 
ACCAATGCAAATTGTAATTATCCCAAATTATGCAATATTTTAAAAATCAATTTTTTTTTATTTTAGCACGTAACTAATATGCTTACTAATTCTAC 
GGGGAACTGTGTGGGTTTGCTTTGTGGAAACATTAAACGTAATCCCTTTATCCAGTGGATAGTTGTTCACCACCCGTCAACCGGTCACAAGATAA 
CAACTATATAGCTTTGTTAATGCCGATACCTTTGTAGGCTGCGCACAGCATCCGAGGGAAAATCACGGAATTTGCTGAATTTACCTTTGGCTTGC 
ATGACAATATCAGGTTTTCGTAATTACCCGACACTTCCAGGCATGAACTGTTAATTAAATCATTTACACACCGATGAATCGCACCGCAATTGGAT 
TGTTAACACATTTGTACCAAGTGAATGTATGAAAAATGGTTTCTCTTTTTGGGGTCAGAAAATTCCTGGCATTTCGATTGACCTTCGACGAAATT 
GTAAATTCTAATGCGGCCCCCATACACATACCATAACAGCTCATCCATCCCCGCCCACTACTTTTACAAATTGGATTGTTATCACTCAAAGTCCT 
TGGGCGATTCCGAGGAGTTTTCACTTGCCTCAGACTCAAACTGATTCGACCAGCCCCTTTGTTGTCGACTGTCACTCGCAACTGTGCGGCTCCAA 
GTGCAAATGTATCTGTGTATCTGTAGGTTTCTGTGTGCGCCCGTCTTGTCAGCGCTTTGACATGAAAATAGTTTGGGCCAACTTGGCTATGCAAA 
AGCGATTTCTATTAGGAGCCGCCATCGCCGACATGTGCTGCCGAAATTCGGTGATTTGGGTAGGATGAAGAGATTACCAAAAGTTTACAGTGGTT 
CTTGCTTCCAAGCAAGGAAGGAATTAAAATCCTGTTCAGCAAATATTCCAATAAATCGCTTTTAAGCAAGTGTTATCAGCAGTTCAATGTTGAGT 
TCAACCGGTTGAAACCCGCTGGTTATCGTAAATCATGGCCACGGTAGTTCGCTCGATTCGCTTCTTATCGCGACGACAGTCCCCACTGACGCACC 
AAATGGACGTATTTAGACTCCATAGATAACATTTGGCTATTGTTACTATGGTTAATTGAGACATTTACAACTCAAGTATACCTCAAGTCAAGCAG 
AACATACGTATGCGCGCATATACAGTCGAATGGAATCGACGTCGCTGCCAATCAGGCCGATGTGTTAAGTGAGTTAGGGGCACTTTTGCTTTTGG 
GACGTGACGTCAGTGCAAACTGCAGCTAGATCCAGTCAAGCCAATTGACGAAAGAGCCGACGGCAGCGGTCTGTACGTATTCAATTCATCAATTG 
GCAAAACCCTGGCAAAAAGGCAAGAAGATGGAAAACAAAACAAAAGCGAACGCAAACAGAAATAGCATAAAATGGACAACAGACGCGACAACACA 
GGCACCAATCGTAGGGGCAAAAATTATGAGTTGCTTTTATTTTAAAATCGTTTCATTTATTTTGCGTCAGCAGCGCCTCTCAATGAGGAAACGCT 
GCGTATACATAATTTTCTGGCCACATATACGCACTCCCATATCTTTCTCCCATCGATTCAGTGGTCGTTCGCAATCAAAGGCACAGACGAGTGAT 
CTTTCCTTGGCCTTTTGCAGTCTCTTAAAGACGGGTTGTTTTCGAGTGTTTTTTGCAGCAGAATGGTAATCAGAATGCTCTCGGCAATTCTTTAT 
TGAACGACCGTTCAATAAATTTCACAAAAGGAAATGTTATCAGCGGGTATTCTTAAAATAACTCAATCTTTAACAACTTATATTTGGAATCAATC 
ACATCAACCCGTAATCAGATAACTGATAACATTTTAGAGGAAATAGTTTGTACCGGCACATTAACAAGTTAATGGTTTTGTTATTCGCCTTATCG 
TGAAATACTGTAAATAAATTTTTTAATTGCTTTTCGAAAGTAGTTGTTTACAACTTAATTTATCAGAGTGTTTTTGTTCTCCGCTCAGAATTAAT 
CACGAAGTGGGGTGGTAATCCAGATTAACATAAGAAAAAAACAATTTTAAAATATAATATTAAGTCGCCATTATGCATGAGTAACAACTAACAAA 
TTGCTATAAATAATAGGAGATAATCTTGAATACTGCTAGAAAGTGTAATACAATGTAAGGCAATTAAATGTGCAATAAAATCAATGTTATTATCA 
GCATGATTGTATATGTCGGAGCGCGACTCTTCGCATTCCAGAAATTCCTTCTGTGATTTCATATTCTGATGAAATTGTAATATTGCGGTAAAATT 
CTGTCGAGCGCTGCGGCAGAGGCACGAACAGCCTCTGCAGCTGAGTGAGGTCCGATCGAGTGGTCATAAATAGTGTGTTAGAGAGAGATGACAAT 
GTGGCACACGCCAGTGTGTATAGCCATTAGAGAATATGATGAAGAAGGGACATGTAAGAAGATCCCTTCAGTGAAGTTTGACTGCTGACGTCGAT 
CGGAACTTGCTGCGCTGACGTACAAAATCGCGAAGTGAATAAATAATATGGATGAGACTCCTGTTTCGCCGACATATACAATAGTGCTCAAGACC 
CTAATGGAATTATACGTTAATAACCAGCCACATTTCTTAGATATTTCTAATATGAGCCATCTGCTGCAGGTTCTTTCCAATATCTAATTCTAGAT 
CTTCTTCGAATACGACCTTTTTGGCCATGAAACGATGATTTGCCACTTCATTCACAAGCATTAATTTGTCATGATTCTCTTAAGCGTGCACTTTA 
TCTGAAAGTCTGAACAGCTGGCTGCGAAATGGATTCCCCGGATTGGAAATGGCAAGTAAATCTGTCCTCGCTACAAACAAGTGGGCACCACTGGG 
CATTCGGGGAATAGGGATATGGGTAGGAATGGGGATATATTGTGGCATTGGCGAAAGGTCGCTATGCAAAGCCAAGGTCGAGACTTACACCCAGG 
ACATTGTGTTGTGCGGCTGGCAGTAGAAACGCCCACGCACACGCCCCTGTCAACTGGCCGGGTAAAAAATAGAATCAAATACATATGTATACAGA 
ACAGGAGTTGAAGCAAAGCTCGCATTTTATTTGCATTTGTGAATTTATTGCTTTCGGTTACGTCTGCTTCGCCTTGCCACTTTTCCAAACTCGTT 
GTGCTGCTGCTTGGGGAATTTTCGGAAATTCGAAGTGATTGGGTTTGGAGCAGTAGCCAAGGGCTTCTACTGTTGCTCAAAGCAATTAATATTTG 
AGAAAAAATAACGAAAACGAAGTAAGGCGGCAGCAACAAAGCGACCGCAACAGTGGCAACAAATCGCCCCACACCGCACGTCTTCAGTTTACAAC 
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CAAGGACTTGGCACGTTTAAAATTCAGCCAGCAGCCGCAGCAACGAACACGGACAGTGGGCGGAATTTAATTTCGAAAAACATATTACTTTATTG 
TTCACATCTTTACCCACCTTCCGATTCGAGTACATACGATTCCACTGAAGCCACTGAAGACTTACTTCACTTAAAGTACATTTTCTTCAACTTCG 
TGGGGTCTTTCTAGAACTTAAGACTTATTCTAACTTAAAAAGACTATAAAGTTTTTAAACATTCCGTTGTTTTGTCAAATTTGAATAATTTACGA 
ATGTGAATTTGTCGCAGAAGTATTGCTGAGTATTATCTTTATAAATTTTTCATTTTTTAAAGTGACTAAAGAAACAGCTTTAATGCGTCTTACTT 
CATATTGGTCATCCCGACCACAGTGCACTGGTTACCAAAGTATGCGATAACAATAACATCGTCCACTATGTGGTCGTTGCTGGGGTTACGGGCAG 
TCAGTCACGGTCACGGCTGCAACCCCTCCGCTCCGGCCAAAACGAGTCCACAGAACAATGGCCAAGGACGAAGGGGGGGGAGGGGGGATTCAATA 
ACAACAGCAGGAACAACAAACATTCTGCTCCCACGCAAGAGCAGCAGGAACTGTGGCAAAAACAGCTGCTGCAGGATCAACGAGGTGGGAGTGGA 
TTCGAGGACTTTAAAACACGAGCCAACTAAAAATATTCAGACACGCCGAAGCGAAAATACCCTTTAGCAGGGGGTTAATTCAAGTGGGGTATTTT 
CCCCTTCTAGTGTTTTTCTTTTACCAGGGTAATCACAGATTCTGTTTCTGTGCACTTAAACGGGGTTTATTCCGTTTCTAAAAATATCAAATGTC 
TGGATTGACTTATTTCTTGGTATTTGACTAAAAATAAAAACAAATATGCAGCTTTTGGGAAACTGATTCCTTGTTGACACAATAGAAACCACCAT 
ATGACCACACCTTCCACGCTACTGCAATAACTAGAACATTTAGAACAACAAACAGAATGGCACTATTATCGGCAAATGATGGCGCATGCATAAAA 
ATGAGGAGATCGAAAGTGGGTGGCTGGGGATGGGATAACCTTGTACTTCTTGAGATAAAAACAGTTCAGTATTTAGCAGCGGCTTATTCCTGAAT 
GTTTATAAACAAATCGAGCGAGAGAGCGATGCATAAATGAAATATGAGGGGGGGTGAAGGGTGGTTGCAAAGAGGGTGACTTTTCGCAGACGCTG 
CGGCACCAACACAAGCAAAGCAACGGCAACAATTTGCACGTAAAACTCAGCTGCTGCTGCTGCTGCTGTTGCCGTTACATAAGAAACTCTTGCAA 
AAAGTGCAAATCTCAATTCGGTATGCAAATGGACCGCTGCCACAATGTACTCACCGTTTTTAGAAACTATTCAGTGGCCAAATTGCGATTCTGCT 
TTTATGCGCCAATAACAAACGGCTCTTCTCCACCGTTGTTGTTGCTCTCCTCCTTGTTGTTGTGGTCGCAGCTACTGCTTAGCGAAGCAGTTGTT 
ATTGTTGTTGCGGCAACAGTAATTTGCAAGTGCACCGCTATTGTTGAATGTTTTTAGACGATTGTCATGCCAGTGGAAGCTTCCAGTCTGCGTCA 
TTCGCGGAGACGCGTAGTTTCACGTTCGACGACACAACCGCTCACAGCGAATTTTGTTTTCGGACTAGAGCTGAGAAACGGCGAATTTTGGTGCT 
TCTGGAAACATCGATTAAACTAAAACCCGATAAGTATGCGACAAGCGGTCACACTCGGCGATGTGCGATAGGATTGCTGCATTCGATTATATAGT 
TATCGAAAAGAGCAAAGAGAAGCCGGAATATACGGCATTAAAAGTGCATTTTGTATTAAAATTAAGAAAATTAGTAACGATATACCGCAGAATAG 
GGAGACATGGCCAACGTGGAATCTATGATTGTGGAGGAAAAGACGCAGGTCAAGCAGATTGACCGCGAGAAGGTTTGTTTGTGACCCCGGTCATG 
TTTACTTCGCGTTTGCTCTAGATGTGTTAATAAGTTTATGTATTTTCCAGACCTGTCCTATGCTGCTCCGTGTCTTCTGCTCTACGGGACGACAC 
CACTCCGTGTCGGAGTATATGTTCGGCAACGTGCCCACCAACGAGCTTCAGATTTACACCTGGCAAGACGCCACCCTGCACGAACTGACCTCTCT 
GGTGC GA GA C GT CAAT CC GGACACC C GG AAG AAGG GC ACCT ACT T T G AC TTTGCTGTCGT GT AC CC C AACT TC C GG AGTAAT C ACTT C C AG AT GC 
GCGAAATCGGAGTGACCTGCACGGGTCAAAAAGGAATCGATGATAATAAGACACTTGCTCAGGCCAAATTCAGCATTGGAGACTTTCTGGACATC 
TCGATTACTCCGCCCAACCGACTGCCGCCAACCGCCAGGCGCCAGCGTCCGTACTGATTCTTCACTTTCATCTATTCTAATCATTTTAATCTTTC 
ACTTGACACTCAATGGAGTATGAACATAATTATAAGTGGAATACAAGTAAGACGAACTACGAGCTGCTTGACTTAAGGGACTATATGTTGTGGGG 
AGTATCAGTTCCTGTAAAAGCCGTTCGGGCGCTTCGACTTCTTCTTGCCCAACTGCGTCATGTCGATCTCCAGACCATTGGGCAGCACGGCGCGA 
TTCTCTTCGT 
(SEQ ID NO: 1438) 

Exon: 11550.. 11265 
Exon: 2534.. 1001 

Start ATG: 2527 (Reverse strand: CAT) 
Transcript No. : CT19005 

AAAATTCGCTGTGAGCGGTTGTGTCGTCGAACGTGAAACTACGCGTCTCCGCGAATGACGCAGACTGGAAGCTTCCACTGGCATGACAATCGTCT 

AAAAACATTCAACAATAGCGGTGCACTTGCAAATTACTGTTGCCGCAACAACAATAACAACTGCTTCGCTAAGCAGTAGCTGCGACCACAACAAC 

AAG GAGGAG AGC AACAAC AAC G GT GGAG AAG AGCC GT T T GT T ATT GG C GC AT AAAAGC AG AAT C GC AAT T T GGC C ACTG AAT AGT TTC T AAAAAC 

GGCCAGCCATGTCCGACGCGGCCAAGTCCAACAACAATGCGTCCGCCGATGCACCTGATCCGACCACTCCTGACGCCACTGGCGAAGCTGACGCG 

GCCAATGCAGCCACTCCGACAACTCCGCGTGGCAATCACAACAACAACAACAGCGCCAACGGCAGAAGCAAGAACAAACTGAAGTCGACACAGAA 

CAGCAGTGGAGGCGGCAGCATAGACAAGCTGTCGCCGGACCAGGACATTTTCATCAACGCCGCCGACGGCCTGCCCAATCAGCATCCGTCGCTCC 

CAAAGGAGGGAGACGTGGATAGCGACACAGAGCCGGAGGTGGATGCAGACTCGTTCGACGACCTACCCACCTCAATCATCGTGACCAACATCCAC 

TCCGAGGTGTTTGCCAATCCAGAGCTGAAGCACGCCATGGAGGAGTTATTCCGCACATTCAGCGAATCGGCTACGTTCCAGTGGCTGCGCAGCTT 

CCGTCGCCTGCGTGTTAACTATGACAATGCCATTGCGGCGGCAAATGCGCGAATCAAGCTGCACCAGTACGAGTTTAACAAGAAGACGGTTATCA 

CCTGCTACTTTGCCCAGCCGGTGACACCGGTCAGCAACAAGAACCTGCAGCCACCGGCGCCGGTCAAACAGTTCCTCATCTCCCCGCCCGCTTCG 

CCGCCAGCCGGCTGGGAGCCGCGCGAGGAAGGCGAGCCACTGGTCAACCATGATCTTTTGGCCGCGTTGGCTAGCCTCACACCCGGCGAGTCGCA 

CGAGCTGCATCCTCAGAGCGAGGACCAGCCGGCCATTATTGTGCACACGGCCATGCTGGCGGAGACCGGGCCGGGACTGCAGGTGAAGGCGCCAA 

TTGTCCAGACCAAGTGCCCGGAACGGGCATAATCATAAGGTGGTGACGAAACTAAAGCTGAAAACCTGAGTTGTAAAACCAAACGATCGGGACAA 

CGGGTTGGGACTCATGCTCTGCGGATGGTTGGGGCTTCGATTGAGTATTTTTTGTAAATATTTTCCGATTCAATTGTGATTAATTTACACATACA 

TATATTTAGCTACGTTCGTATGTTTGTATTAATAGGCACTCGAACTCCCATACCCGACATACTTTCCCACCCTTCCTTCAAGCGTAGCATTTACC 

CCAGTTGGCTTCTGATACAAGTTCATTGGCTAACAACCGCATCTAGATGTGTTTATCTCGCGATTTGAGTGTGTCTATGTGTAAGAATATTGATG 

TTAAGCCATCGCAAATACCTGCTAACAATATCTACATATATCTCTGTGTACCTTCGACGGGAAATCGCTGACAGAGGCGCTACCGAGAACCGCGA 

TCCGAAATCCCAGATATAAAGCTACTATATCTATGTAAATGTTAATGTAATTCTAAGCCAGCGAAATCGTGTAACGATATCGCATCAGATATCGG 

CATACACATACATACGTTTATAAATCTATATGCCTACGTGTAGTTAAAATTTAAATTGAAAATATATTCAAATTCGAAATGAATGTTCTCAAATG 

T AC C AAAC AAAAC AA 

(SEQ ID NO: 1439) 

Start ATG: 294 (Reverse strand: CAT) 

MSDAAKSNNNASADAPDPTTPDATGEADAANAATPTTPRGNHNNNNSANGRSKNKLKSTQNSSGGGSIDKLSPDQDIFIKAADGLPNQHPSLPKE 
GDVDSDTEPEVDADSFDDLPTSIIVTNIHSEVFANPELKHAMEELFRTFSESATFQWLRSFRRLRVNYDNAIAAANARIKLHQYEFNKKTVITCY 
FAQPVTPVSNKNLQPPAPVKQFLISPPASPPAGWEPREEGEPLVNHDLLAALASLTPGESHELHPQSEDQPAIIVHTAMLAETGPGLQVKAPIVQ 
TKCPERA* 

(SEQ ID NO: 1440) 

Classification: known_flybase_gene 
Gene Symbol: nla 
FlyBase ID: FBgn0026629 
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Celera Sequence No, : 142000013384485 

GATAGCCATTCCACCAAGGCGACGACATCTGAAGGACCATCGAGTGGGGACCGCGAGTCCCAGCTTAAGCATTGCTTGGACAATTTTAATTTGAA 

GCTTTTAACAGGAATTTATTCTACATTTGCCAGTTTAATATTCGTATACTTAAGTGTATAAAAATATATGTGTAATGTGTGTTTAAACAGTGCGT 

TTCGCCGATGTAAGACAGTTGTGGAAAAGGCAGTTCGTTTTCTAAATCCAAAGGCATTCAAAACGAAGCAGATTGCAATAGTTCTATGGCCTGGA 

AACAACAAAATATTAATAATAATAAGACCAAACAATTTGTGTAAGCCATAAAAATAAGAGATTTACAAATTCTTGAGAATACAAATGTATTTGTA 

ACAGAACGAAATTTAAGCTCATAACATTTAAGGTCCAATTATCACATACTTTTTGTTTCAAATAAAAATGTTTCAAATATTTCGACAAATACAGG 

GAACTTAAGGTCACAACTATCTTTCAGTGCTGAATCGCTTGTTTAATTCGTTGAGGCATTTTTCATTATGATTTGAAACCAAGTGCGCAAAACAA 

AT T G T AAGT AGGAAT G GCTC AAAT AT G T C AAAAAAGT AAAAT T AGT G ACT AAC ATAAC TT AGG ACT AGG T GACATTCGTTTCCGAGTTTC G ATG C 

TGTTGCATTAGGAATAATTTTGAACACGTGGAATTTACGTTTATGATGAGAAGGCATACATATACAGAATTCAGCATCTTCGCCTAAAACTCATC 

TGGATATGTGGTCTCATGGGATTCGTTGCTTTCTTCGTGCTATCGGTTGAGCATATTCGTCAGATTTGTGTGAAGGTAACGCGTTTCTAAGGTCA 

CTAAGGTGCTAAACATGCCGAGTTCATGTCCATCCAGCGGGGGAATGCATCGGGGTTTCGTCCTGGCCTTCTATTCGAGTCTTTCATTGCAGGTT 

TCTAGTGAAATACTATGCAGTCTTTGCCTGTCCCATTGTGATAGCCGATCCTACAACTGTGCCGCGAGTCCGGAGGCAGAGATGTTGTCGCCAGC 

CCCAGCCGTTTTTCTGGCTTCGCGACACACCAGCACCGGGGCCACACAGATTTCCACCTCTAGCCGTTGCCGGTGACGACCGTACTGGATATACT 

CCCGCCAACAGGGCACGGGACTGGCTGCACCAATGCGCATTCGCGGCGCATCCGCCTGAGCCGAGGTGGCGAAACTGTCATCTAGAACCTGCAGC 

ACTGCCTCCGGGTTAATCTGCGGATGAAAGAATTATCAGTTCAAAACTAAAACATACTACACATAAATGCGTACTCACAAATTGGGATTTGCAGA 

CGTATCGGTGAGCAGTGAGCGCTGCCTTTGCAGCGGCAGCACGTGTGTTTTTCCACTTGGAGCCCGCTGTGGTAAGGATCGCCTGATAGGCCAAC 

GTGTGCACATGTATGCGCGAGATAGAGCGACGCTTGGCATCGCTTGAGCTTCGGTCTTCGTAATCTTCAAGGAGACTAATGAAAACCTGCCGCAT 

CTGGTCAAGGGTGTGAGCTATGCGCGGATTCCAGTCGGTGGCTAAGGTCGTCCGTCCGTGAGCAAGAACCTGCTGCAGGTTAGAGAGCTCTTGTT 

CGTTCATTCCCAGGGAGTCTACGTAGGGCAGTACAAAGTGTCGAAGCTGTTGAAGCAGCTGCAACTCCACGTAAGAAGCCATCTCGAAGTGGTTC 

AGAGTGCCCTGCGGCTGGCTGGTTAGTTGCCTTTGTACTTGCTGCAGGCGCGCCTCTCGCTCACCAGACTTAAAGGTGAACATGTCCATCATTTG 

CAGCCCGCTGACAACCAACAGCTGTGGCTGATACATCTTCAAAGCATCCGTCAACTGCTCCACGGCCCGCAGATGCGGATTATTTCGATCGTTGT 

GCAGAATGTAGCGATTGGCGCGAGGTGCCACATATGGGCCCCATTTGTCACCAGCCTTGTACTCCAGAATAAGATGGATGTCATCGTTTGGAATC 

TCGTCACCAGCCAGACGAATCTCCTTTGGCAGCAGTGGCCGCAGCCTGAAGAAGTTTTTACATTAAGTTCCATGAAATAATTTGTTATTCAGATA 

GGTAATTACTTTCTAGACATGTGGGCGCCCAGTAGCACTTGAGCTCCCTCCATAAAAAAGCGCACGGCCATCAGTGGGGCGTTACCACCCATGTA 

CCAATTGATCCTCTCCTTGTCCATCACCTTGGCGTAGCCCACCAGCTGTGTGAAGAGGGTACTGTTTGCCATCACTCGCCTGAAAAGTGGACCAG 

TATGAAATACTTTGTTAGTAACTTGCATAGCCAATAAAATATTGGATATCCAAGATCCTTCCATGCAACTTACTCCGCTGCAGCGCCGTTCTGAA 

AGTAATAGGCAAAGGACTGCAAGAGCTCGTCCTCGTTGTTCACGACCGCCCGAGATCCTGTAGTGGCTGCTGCGACTGGAATCCGGCGCCCATAG 

TAGCGATCCAGAAATTCAGTGGCGTTTATCTGCAAGTCCGTGCAGGCTCCGTAGCCGATGGCCACCTTGGGTGCCGGTTCCAACGCGGTCCGCTT 

CAGGGATCTGAAATACAAATGCATATATAGCTAGTGCTGCAGTGTAGCCGAATGAACTCACCCCTCGATTGCCAGGAGACCGGTGAGCAGGACAG 

TCGTCCTGTTAAGCGTCTGCAGGGCGATGAATGCCTGCCAAATTATGGAAACAAAGGCCGTTAGGACGGAACAGCCGGTGATCCATCCCATGTAT 

CGCAGGGCGGTCATTGTGGGCGTGGATTGTTGTGTGATTGGTTCGCCTCAGTTTGGGCTTTCCAAACTATTTTGTATGTGACAATCCGCCTTCCG 

GCTACGCCAACTTTCTTATCAGAATTATCGCTCTTTTTCCTTTGAGATGCTTCAAAGCTAGCGATGACACTATCTGCAGTTCTTGGACTACAGCG 

ATAGTCCAACCAACAGTTACATTTGAATGCTATACTGCTATACTATATTTGAATATATTAAAATAAATTTAAATTGAAATGTGCACTTATAAGAG 

CGCCCTCTCAATGTGTTAATTTGTTCAATGCATAGAACAGCTTACATTGTGTTTAAACATCTTTTTTTGAGCATATTTTTTCGCTAACATATTGG 

GAATATGTATGTTTATTT GAT CCTCGTAGTTTATATTTTAATTTTTTAAGCAAAC AAAT AAAATTGCAATTGCTTATCGATGACAACAATGGTTT 

GAGCCACAGCTACTCGAAGCTGGACAGTTCAGTATTGAGCAACTGTAGTTTCGATAAACGATATGTTCCGAAATAACGCCTTCTGCTATCGAAGA 

CAGTCTTGCGTCTGTAAAGTGATAAAATAATTTTACAATTGAGTCGCCTTGGGAATTTCAGTCACTCTCCACACACCAAATACATAGCTCGTGTA 

ACAGCCTGATTTAGTCGAACAGCACATCATTTGCCGGAATCGTTAGCGAATAGCAGCACTCTCCGCTAAGTCGGCACGCTAATTGGCAAAACCGA 

TTCGATTCCAGGGAAGGAGACACACTGATAGACGGGTGAAGAACAAAAGGAGAAGGAAAACAACAGTTAATCGTCAAATTTTACAGGCAAAAGGT 

GAGCAACTACGCCTTGCACACAAACAACCGCATCGACGAGCAAACGCACGCACACACAAGCCTTATCGCCTGCGTAGTGGTTGGTGTATATAAAA 

AATTAAAACAATATTTGCACATCCGATTATTACTATAATATTTCGCGTCCACATATATTAGCAAGTGTGCGTGCGCCTGCCCTGGATTTACGTGC 

CAGTGTTTGTGCGAGCCTAGCGCGTTTGTGTCTGTGTGTGTGTGTAATGAAATGCAATTTTCTCTTGGCATTTTGCGCGCCTCCGATTTTCGCTG 

CCCCTCTGCAAT 

(SEQ ID NO: 1441) 

Exon: 2907 . .2627 
Exon: 2572.. 2354 
Exon: 2264.. 2100 
Exon: 2040.. 1314 
Exon: 1252.. 1001 

Start ATG: 2769 (Reverse strand: CAT) 
Transcript No. : CT20 604 

CTTTGAAGCATCTCAAAGGAAAAAGAGCGATAATTCTGATAAGAAAGTTGGCGTAGCCGGAAGGCGGATTGTCACATACAAAATAGTTTGGAAAG 
CCCAAACTGAGGCGAACCAATCACACAACAATCCACGCCCACAATGACCGCCCTGCGATACATGGGATGGATCACCGGCTGTTCCGTCCTAACGG 
CCTTTGTTTCCATAATTTGGCAGGCATTCATCGCCCTGCAGACGCTTAACAGGACGACTGTCCTGCTCACCGGTCTCCTGGCAATCGAGGGATCC 
CTGAAGCGGACCGCGTTGGAACCGGCACCCAAGGTGGCCATCGGCTACGGAGCCTGCACGGACTTGCAGATAAACGCCACTGAATTTCTGGATCG 
CTACTATGGGCGCCGGATTCCAGTCGCAGCAGCCACTACAGGATCTCGGGCGGTCGTGAACAACGAGGACGAGCTCTTGCAGTCCTTTGCCTATT 
ACTTTCAGAACGGCGCTGCAGCGGAGCGAGTGATGGCAAACAGTACCCTCTTCACACAGCTGGTGGGCTACGCCAAGGTGATGGACAAGGAGAGG 
ATCAATTGGTACATGGGTGGTAACGCCCCACTGATGGCCGTGCGCTTTTTTATGGAGGGAGCTCAAGTGCTACTGGGCGCCCACATGTCTAGAAA 
GCTGCGGCCACTGCTGCCAAAGGAGATTCGTCTGGCTGGTGACGAGATTCCAAACGATGACATCCATCTTATTCTGGAGTACAAGGCTGGTGACA 
AATGGGGCCCATATGTGGCACCTCGCGCCAATCGCTACATTCTGCACAACGATCGAAATAATCCGCATCTGCGGGCCGTGGAGCAGTTGACGGAT 
GCTTTGAAGATGTATCAGCCACAGCTGTTGGTTGTCAGCGGGCTGCAAATGATGGACATGTTCACCTTTAAGTCTGGTGAGCGAGAGGCGCGCCT 
GCAGCAAGTACAAAGGCAACTAACCAGCCAGCCGCAGGGCACTCTGAACCACTTCGAGATGGCTTCTTACGTGGAGTTGCAGCTGCTTCAACAGC 
TTCGACACTTTGTACTGCCCTACGTAGACTCCCTGGGAATGAACGAACAAGAGCTCTCTAACCTGCAGCAGGTTCTTGCTCACGGACGGACGACC 
TTAGCCACCGACTGGAATCCGCGCATAGCTCACACCCTTGACCAGATGCGGCAGGTTTTCATTAGTCTCCTTGAAGATTACGAAGACCGAAGCTC 
AAGCGATGCCAAGCGTCGCTCTATCTCGCGCATACATGTGCACACGTTGGCCTATCAGGCGATCCTTACCACAGCGGGCTCCAAGTGGAAAAACA 
CACGTGCTGCCGCTGCAAAGGCAGCGCTCACTGCTCACCGATACGTCTGCAAATCCCAATTTATTAACCCGGAGGCAGTGCTGCAGGTTCTAGAT 
GACAGTTTCGCCACCTCGGCTCAGGCGGATGCGCCGCGAATGCGCATTGGTGCAGCCAGTCCCGTGCCCTGTTGGCGGGAGTATATCCAGTACGG 
TCGTCACCGGCAACGGCTAGAGGTGGAAATCTGTGTGGCCCCGGTGCTGGTGTGTCGCGAAGCCAGAAAAACGGCTGGGGCTGGCGACAACATCT 
CTGCCTCCGGACTCGCGGCACAGTTGTAG 
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(SEQ ID NO: 1442) 



Start ATG: 139 {Reverse strand: CAT) 

MTALRYMGWITGCSVLTAFVSIIWQAFIALQTLNRTTVLLTGLLAIEGSLKRTALEPAPKVAIGYGACTDLQINATEFLDRYYGRRIPVAAATTG 
SRAVVNNEDELLQSFAYYFQNGAAAERVMANSTLFTQ 

EIPNDDIHLILEYKAGDKWGPYVAPRANRYILHNDRNNPHLRAVEQLTDALKMYQPQLLVVSGLQMMDMFTFKSGEREARLQQVQRQLTSQPQGT 
LNHFEMASYVELQLLQQLRHFVLPYVDSLGl^EQELSNLQQVLAHGRTTLATDWNPRIAHTLDQMRQVFISLLEDYEDRSSSDAKRRSISRIHVH 
TLAYQAILTTAGSKWKNTPJy^KAALTAHRYVCKSQFINPEAVLQVLDDSFATSAQADAPRMRIGAASPVPCWREYIQYGRHRQRLEVEICVAP 
VLVCREARKTAGAGDNISASGLAAQL* 
( SEQ ID HO: 1443) 

Name: transacylase-like 
Classification: enzyme 

Celera Sequence No. : 142000013384485 

GTCTGCGCACGGAAACCAAAGACTGGCGGGAAAACACCCTAAATGTGCCGAAAACTGCGCTTAGCAAAGTTTTAAGCCAGTTGATTTAATTAATA 
TGCATTTTTGTGGTAATTGCCAACGGTCGCACTTAAAATAAAAAAACATTTGTTTTACATTTCGGATCCAAGGAAATAAGCCATTATTGGCTTAA 
ATAAGCTTTAACGACTGCAATCATGAGCACCCCTCGCAAGAACTATAACAACCTGCCGCCCTACTGCCCACAAGGCGGCCAGTACCCATCGCAGT 
CCGCGGATTACAGAGCGCAGCAATTCGGTGGCCAGCAGAAGCCACAAAATCCAAACGTTGGCTTCTACGAGGACAAACAGGGCCAGCAAAATGTT 
CCTCATTTCTACCAGAACAAATCCCCACAGGAGCCGCAGCGCCATCGGCCCTACGGTCAGGGCAGGAATTTCGGACGCGGCGGCAGCTTCAATCG 
CAGGAACCAACAGAACTGGGATCATCCGAGGAGCCACCAGAACAGGGAACAAAACCAGAGCGGCAGCTCCTTCGGCCAGTACTTTCATCCCTCCA 
TGCTGGAGGATCCTTGGCGGGAGCTAATGGAACGGCACGAGGCCATCCACGGACCCAGTACTAGCCGAACTTCACCGAAAGAGGTGGCGGCTTCC 
TAAAATGTAGAATACAACACTAAGAGCTAGTTAAGCCCTGTAGGATAATTTCCCCAAAGCAAAGAAAGAATTTTAATTTTAGCTTTCCGATATTA 
AACAAAAGAATGTGTAAACATAAAAATGGTTTCATCATGAGATCCGAGCCTGCTTTTGAGTGGCACATTACGTGCTGAAAACCGCTCCGATGCCT 
TCAAGAAAGAACTATGGCTAAGGTTGTAACATTTTATACATTTAAGCAACAGTTTCACGTGAATGAAACCCGTGTGTATACTTAAAAATTTAAGA 
CTACCGTAATGGAACACCCCTCAGTGCAATGGGGCTCCCCGAGTGAGCTGCTAATAGTGCTGCTGTTGATGCCGCTTCTTCTCCAGATACTTCTC 
CCAGTCGACGGCAACGCGATTGTAGATACCCATGGCGGCACGTGCATCCTCCACGGAATTATGCTCGCCCGTTTGGATCTCCTGACCCAGAACGG 
CCTTTGTCAGGCGTTTTAAGCTGGGCGTGTGGGTATTAGAGATGAGTTTGCACAGCGGCTTATAATGCGACGTATCACGAATGTCATGGAACGGA 
TGGCGGATGCCCAGTACGGCAAGATCGTTGCGCAGGCCGTGTCCCACCAGAATGCGGCCATGTATCAGCTTCATCACTTCGTTTTGCACGGCGGC 
AAAGTCCTCACCGTTAGCGATGTCCTGCGGCCGAATGCCGGACACGCTGGTGCGGTAGTCGGTGACCTCCTTTCGCGGCTTCACATACTTGTCCA 
GCAGCACGTGTCCCATGCGATTCACGATGGACACGCGTGCCAGCATGTCGTCCCGTGTGTTGTGTCCCACGCCCACCATCTCGCAGTCCATGGCC 
AAAATGCGATTGCGGTGTGCCTTCTTTTTCATGCGCATGCGTGCCGATTTGCTTAGTGGTGCTGACTCCGTCGCCTTGGCATCCTGGTTTTCACT 
TCCAGCTGCTGCTGCTGCTCCACCGCCAGCAGCGGCCATCCAGTTGGAGCCGGCTGCCCGGCGACTTGTGTTCACCTGTTGCACCTGTAGGCGCT 
CCATGGAGGCTTGCAGCTTGCGTTGATTGTGTTGCGGGGAAACCACATTTTTCGTGCTGTCCGTGGTGTTTTTGTTGTTGGCAGACCCGCTTCTG 
CTGCTGTCCAAGTTATTGGCCCGGCTACGATTCATGTCGAACTGCTTCTGGCGTGGCGTAGCGGACATGGCGTCGGGGCATCTATATATTCCAAA 
TAAATATCCAGCTTAAGACAATCTCGAATAACAAATGTGTTTCTTTATTTTAGTAACACCGACCAGTGTGACCGTCTTGGGGTTGCTGGAGTATT 
ACTGTCGACGGGCTTTTGGTATTTAAACAGTGTCAATGGGTGCGTTCAGCAGGTAAATAGCAATCACGCCTTATGTAGCCTGTGAACGCATTCAC 
TGCGTCAATAATATGTGCATTGATCGCTCGAACAAATAAGACCGTAAAAATCGCTATTCATTTACAGTTTCAGACGTTGAAGTAGTGTAAGTTCA 
TTAAAATATTGAAGAAATGTGGGAGTATTGACTAATATTCTAAAATCACTTATGGCGGTTTTATATTTTAACAAATCGTAAAAATATCAAACTAA 
CGGATTTTATACATATGCATGTGCACACTCGAGATTGGGCTTGAAGTTTTGGATATCCACTGCGAAGTGCGCACTTGATAAAGATGTTCGTATGC 
AAGGAGTTTTAAACTGGCTCCCGCGTATTTCGTTCGGAGTGCATCCTGGAGCACATAGATGTCAAGTAAGCGCAGCTTACTTAATTCGCATTAAG 
GCATATGGTCGTCGAGGAAGTAAAACTTTACTTGAAAAGCGGGGCGTGGCCCCTATCTGCATTCGGCCCCTTCGGCGGCCTGGGCAACTTTCCCA 
ATCACATCGAGGATTATTCATTTGAGGAATGCCGTTTGCAGCATTACATGGCTCAACAGGAGGATCGCCTCACTATTTATGAGTTGCAGTATGCT 
CGCGAAATTGACTTGGCCACCATGAAAATTATTTGATTAAGATGTCGCCCCAAAAACTACTGGAAACATTGATTGAAATTTACGATCAGCCTCTA 
ATCGACAGTTCCTCTGTCGCATCCGTCAGGGACCAGACTTCGGACGAGAGAAAATGCCAAGTTCAGCCTCAAAGCTTCCCAGGTTTTCAAACCAA 
CTTTTCAAACCAACTGCCAGTTAAACATCTCCGGTGACACGGCGAGGCCGTCCAACAGAGCATGCTTAAGATGCTGGAAGGCAGCGTGGTTAGCC 
TAACCGACCGGGAACAACTGTTTTAAAACTAATAT 
(SEQ ID NO: 1444) 

Exon: 1980, . 1001 

Start ATG: 1873 (Reverse strand: CAT) 
Transcript No. : CT21153 

CCAAGACGGTCACACTGGTCGGTGTTACTAAAATAAAGAAACACATTTGTTATTCGAGATTGTCTTAAGCTGGATATTTATTTGGAATATATAGA 
TGCCCCGACGCCATGTCCGCTACGCCACGCCAGAAGCAGTTCGACATGAATCGTAGCCGGGCCAATAACTTGGACAGCAGCAGAAGCGGGTCTGC 
CAACAACAAAAACACCACGGACAGCACGAAAAATGTGGTTTCCCCGCAACACAATCAACGCAAGCTGCAAGCCTCCATGGAGCGCCTACAGGTGC 
AACAGGTGAACACAAGTCGCCGGGCAGCCGGCTCCAACTGGATGGCCGCTGCTGGCGGTGGAGCAGCAGCAGCAGCTGGAAGTGAAAACCAGGAT 
GCCAAGGCGACGGAGTCAGCACCACTAAGCAAATCGGCACGCATGCGCATGAAAAAGAAGGCACACCGCAATCGCATTTTGGCCATGGACTGCGA 
GATGGTGGGCGTGGGACACAACACACGGGACGACATGCTGGCACGCGTGTCCATCGTGAATCGCATGGGACACGTGCTGCTGGACAAGTATGTGA 
AGCCGCGAAAGGAGGTCACCGACTACCGCACCAGCGTGTCCGGCATTCGGCCGCAGGACATCGCTAACGGTGAGGACTTTGCCGCCGTGCAAAAC 
GAAGTGATGAAGCTGATACATGGCCGCATTCTGGTGGGACACGGCCTGCGCAACGATCTTGCCGTACTGGGCATCCGCCATCCGTTCCATGACAT 
TCGTGATACGTCGCATTATAAGCCGCTGTGCAAACTCATCTCTAATACCCACACGCCCAGCTTAAAACGCCTGACAAAGGCCGTTCTGGGTCAGG 
AGATCCAAACGGGCGAGCATAATTCCGTGGAGGATGCACGTGCCGCCATGGGTATCTACAATCGCGTTGCCGTCGACTGGGAGAAGTATCTGGAG 
AAGAAGCGGCATCAACAGCAGCACTATTAG 
(SEQ ID NO: 1445) 

Start ATG: 108 (Reverse strand: CAT) 
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MSATPRQKQFDMNRSRANWLDSSRSGSAWNKNTTDSTKNVVSPQHNQRKLQASMERLQVQQVNTSRRAAGSNWMAAAGGGAAAAAGSENQDAKAT 
ESAPLSKSARMRMKKKAHRNRILAMDCEMVGVGHNTRDDMLARVSIVNRMGHVLLDKYVKPRKEVTDyRTSVSGIRPQDIANGEDFAAVQNEVMK 
LIHGRILVGHGLRNDLAVLGIRHPFHDIRDTSHYKPLCKLISKTHTPSLKRLTKAVLGQEIQTGEHNSVEDARAAMGIYNRVAVDWEKYLEKKRH 
QQQHY* 

(SEQ ID NO: 1446) 

Name: EXONUCLEASE-LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384534 

AAATTCGCAGCCTAATTGCTCCTATTATTAATTAGGAATATGGCAAGTTCTCAGCTCGTTAAGGCTCATTTTAGAATGTTTGTCGGGCAATTGTT 

TTCAACATCTTTATGGCATTATTAAATTACTATTGGCGTGCGCCGAAACAGCAGGTGGGGGAAATCCTTATAGAGCGGGTCTCTCCAATTTAAAG 

CGGCTACTTGTTAAATTTATAATGCAATTTGGCGGTGTTAACTATTAACGAGGTTAAGTGGCGCAATTTGAAACCACATTTGAAGGATTTCCCAT 

GGTTTAATGGTGTGCATTTTTAAGCACCCTAGTGCACGGCATTTCTAGTACTTGGCTTAATAATAAATGTAAGAACAAACTTATAGACCAATTAG 

GTTCTAGCAATTTATCAAAACGTTACGACGTGTTGTATTCTTTCTAAGGCGTAGAAAATAGCGAATGGGAAAGCTTCTTATTTATAATTCATTGT 

TTGAGGTAATTTACTATTGAAGGTTTATCACTTAAAAGCGGAAACAGATATGGCTCAGACTTTAACCTAGGCATTATATGTAGATATGAAACCTA 

TTTACAAGTTTTATATTTCTCTCTGAACTCAAAATAGAATTAATATATAGTATTAAAGATATACTGCTGTGCAATATATAGGAACAAAACTAAGC 

TTCCAATTATAATAAATGCAATAAATTAGTATAAATTCACATGAAACCCTTACCTCCGCCGACGAGAATTACCTCCAAAACTCGATTGGCTTTAA 

AAAAAAAACCGGTAATAGAATTGTAACTCGCCTGCTCAAATGTTGTTTAATGATTCAACGAGCGCACTTTCGTTTATAATGAATTCAACCAAAAG 

TCCAGTTTCCAATTGCGATAGCCCAATATCGCAATATCACAGTTCACACGCACAGCTCGAACACTAAAAAAACACTGAGCCAGCTGAGAGGCAAC 

TGCTAACTGCTAACTGTAAACCAGTTCGACTTCAAAACAACAACTAAGTGCCGTTCGCTCTCTTCGCCTTCTCTTTTCTCTCCCGCTTTCTTCTC 

TCCACCTCTTCACTGCTTTGCTGGTCAATCTAGCCTTGTGTGCGTGAGTGTGGGTGCGCCTATCACGATGTACAGGTAAACTTTTATTATTACTA 

AATGTCCAGAAGTCTTTAATTAAAATAATTACTTATTGCGACAATAAGTAAGAAGGTGAAATGTAAGACTTCCCAAATTGCAATAAGAACAATAG 

CTGTGAGAGCAATAATAATCTTCAGCTTTTATCACTGAGCGGAGCGACATAAGCAAGCTGCACTGTACTTCAGATGCAATGTTGTTGTTTCAGTT 

GTTGTAGCTGCTCCTTGGCTTTCACACGCGAGTTGTGGGTGAGCCTTAATCACATTTATTCGATGGTGAGTGGAGAAGGAGGTGCTGAGTGGGCG 

AGGGAGCGGCGCCGAGAGAGCGAACGAGCGGTTGACATTGACAACCCCCCTTTTTTGCTGTTGGGAGCGGGCGCACGCCCATGTGTGAGTAGGCT 

AATATCGCTACACTTCGAAGTTTCGTTGAATTTGCGCTCTGTTTTGTTTGCTTCTCTCTTTATCGCTCTGCGCTTTGGCTCTCCCGTTCGCTATT 

TGTTGTTAATTCGAATACGAATATAACTTCTAGTTTTTATGGTGCCTTTGCGCTCGGTTTTATGACAAAACCATTTTTTTTCCACTGCGGGTAAT 

TAAGAACTCGCTGGCACGAATAACCGAACCAATTCAATGTGAATCCACAGTGGAGGCGATCGAGGAACAAAAACAGTCCGCACCATCATTTATTA 

ACGCCAATTGTCAGTTCAATTCATTTTCTTCAACTGCTTCTGTGCGAAGAAACCGTTTCATTGCGTCACAAAGATGATTATGCTCATATATTATC 

GAACTAATGGGGGGAATTCAAAATTATATAGAGCTGTAAAACAGGAGCCAGCTAATAAATCGGCTAGTATCAACTCCTTATAAATAGCCTCTTTA 

CGCAATACTATTGAATACAGAAAAATAAGCCATCAAAGTCAGCATTATTTGCAGTGTTTGGCGACCACTTCCCCGATAAGCCATACATATAGAGT 

TACGTAACTGGAGATCGGCGACTCGAGTGGCCGGGCTTTGGCTTTATATAGCTAATTAACTGGACGATCGACCAGGAGCACTTGTGTGCAACTCG 

GCAAAATACTATACTTCAGATGTGAAATTGCTAGATTTCTAGAATCGATCAGATTTTCCCCATTCATAAACTGGGTCTGCGACTGTGGCTAATCA 

G C GC AAT ACTG AT T GAT C GAT T G GAAGT GCC AT T GG AC AAT T T AT AG AGC G AT C C AT AAAT CAT AATC GAC TGG T AT T T A T TG T GC GCT AT T C GC 

AACTACTCGAGCCCAGCTTTTAGGGTTTCCGTTCCAGCTGGAGATCTTTGTGGGGACGCAAGGCTTCTGGAAACCGAGACCCCGAGAAAAAGATC 

ACATGATAGAACCCCCCCATATGATTATGATGATTGCATAAAACTCGAAAACTCGCGACTTTGCAAAGAGAGCGCGACTCACAGATACTCTGGAG 

AGTATTATATTCACATGACAAGGGTTCGACTAACACACACACACACTGCGAACGTAGTAGGTGCGTACCTGAGTATCTCAGAGATACTCGGGGCT 

TACGAAAGAGATAGCGCTACGCGTATCTGCCGCTCTGACTTTAGCTGCGACTTCTGGCAGGTGTTTGCACAATGGGTCCCATGACAGGTGTGCCC 

CGCAGCCCCCAGTAAGCTCACTTTTCCGACCAGTAGCTGTCAGCAGCCAGCATCACGTGCCAAAGTGCATTCAAATTGGGCTTAATTGCACATAC 

TCCCTGCTGCCAAAGCACGTGCTTTGGATGGCTAGTTTACAGGGGATATGGGATATATACGGGTGGATAACCCGTGGCACAGCTTTTACACCTCG 

AACCCGCCGGGTTTTTCCAGTTGCTTCGAGTTTTTTCGAGTTTTTCGCTTACCTTCACAGCTCCGAAAACGCTTTTTATGTCAAAGACACACCAG 

ATTCGATTCCCCGCGAGTGCGAGCGATAAATTTAGAACTTGTTTGTTGCCGAGCGAGAGTATCTTTATTCGAATCTGAATCTGAATGGAGCGCAG 

CACACGTCTGCTCACCGCCAGCGATCGACTGCGTTTAAATTTGGCACTCGCTGCCGCGACTTCGGTTCGAAAATCTCGCGATCGCCTCGAAATGC 

CTCGCACTCTCTGTTACTCAGCACACACACCCTGTGAATTCAGGCGGCTCTCTCGGCGCGGGCATGCCAACTATGTATGATCGGTGACCGTTATC 

GCAAATCAGAAATCGACAAATACCCGTACTAAAGTCAAAATCCTCGAGTGCTACATGTGTCTACCAAAAAAATGGTCAGTTTCCAAAAACCTAAT 

CACTCTTAGGATCACCCAATTAATCACCATTAATATGTGATATTTTTGCAGTAACTAGATGTGCTTGTTGATAGTGAGGAAGCCAATCTCAAAAA 

AAAACCAATTGAACTAACACTTCTTACAATTTAAACTTTTAATTAGAATTTAGAATATAAAATAATATTAAGCTAATTTGGTAACAGTTTCATTT 

TAAATTTCAACTGTTTTAAGTGCTGTAAAGTGCATAAAGTGTTAACAGGTGGCTCCACGCACAACGCGTTAGTACCTTACCAAATAGCGTTTTCA 

CTGCACCGCTCACCACGGCCACACTCTGCCTTTGTTTTGTTTTGATATTCCCACTGCATCGCATTTAATTTATTAAAAGCTGTTACCAAATTATG 

TCCTACGCCGGCTATAGCCAGCTGCCCAGCGGCAGCGACCAGGAGCGCAGGCAGGCGCAGCTCGCGGCCAGCACCTACGACGTGCTACAGCGGAC 

GACGGACAGTATCCAGCGATCCAACCAGATTGCCATCGAAACGGAGAACATGGGGGCGGAGGTAATTACCCAGATAGTAGCCATTAACAGGGCTA 

ACTAATGCTCGCTTACGCATCAACAGGTACTCGGCGAACTGGGCGAGCAGAGGGAGTCGCTGCTACGCACCACGCGCCGCCTGGAGGACGCCGAT 

CAGGATCTGTCCAAATCGAGGGTCATCATTCGGAAGTTGAGCAGGGAGGTGCTCTACAACAAGATCATCCTAATCCTGATTATCATTCTGGAGGT 

GGGCATACTCGTCGGCTTGCTGGTGCTGAAGTTCGCTCACCTGTGAGCCAATGCCATTCTATTCCAAAGTCGTAACCACCAAACAGGCGATGGAC 

TATATATACCTATTAATCGTTACGAATTCTTTGTCCTAGTTAATAAGCTTATAAATGTATATACTCCACTAAAATGCACTTCCAATTCAAAAAGG 

TCAGTTTTAGGATATACCCAGCGATGAATCGGGTGGTTGATTATGTTTATTCATGTTGGCTAAAGCGTTAATGCCTTAGACGTCTATAATATGTA 

TTATGGTAGTAAAACACAGATAAAATCGGGTCTGGACATCGGAAGCTCTGCACTCCGAAGAGTTTAGCCCAGTCCCAACGCATATACATACGTAT 

GCTATAACAATGATAATATGTAGTTCGTGCAGCACGCTATGCCACGACTATTTGGCAAAAAGCAACGTTAATTACATTCAGTGGGCGACAACGAG 

ATAGTAGAACATCCTGAACAATGTCTGCATAATACTTGATCTATGATAAACGGTTACATATATGTAAGTATGTGGGCATCGCTTTGGGGATGCTT 

CTTTGTTCGAGAGATGTTTGGGATTTCGCAGAGCTGCGCCTCCTCATTCCTTTGGTGCAGCAGCTCTCCGCTTCAGTTGCCTTCAATTGTCCTTC 

GGAAGAACGATGGACCAGCCGAAAACGCGGCGATCGAAGCCGCAGGAGAAGAGATCGCAGTCCAGCAGCGGCTCCTCCGCCCACGCTACAGACGA 

CACCATCCGTCGGTGGCCCTGCCGACTGGTCCGTGGCAATCGGGCGACTCTCTGCCCGTTCAAATCGAACAGTCGGATCTGTCGATTGTCATGCG 

GTATGGCGATAATTCCACCAGTGGAGACCGCCAAACGATTGACCGAGGAATCCGTTCGAATGGTGGCCAAGGCGGATCGCATGTTGCGCAGCTCC 

C AGAC C T T GAT T GT AC GAT CATC T G ATC 

(SEQ ID NO: 1447) 

Exon: 1001.. 1509 
Exon: 3818.. 3956 
Exon: 4017.. 4158 
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Start ATG: 1506 



Transcript No. : CT23167 

CCGTTCGCTCTCTTCGCCTTCTCTTTTCTCTCCCGC7TTCTTCTCTCCACCTCTTCACTGCTTTGCTGGTCAATCTAGCCTTGTGTGCGTGAGTG 
TGGGTGCGCCTATCACGATGTACAGGTAAACTTTTATTATTACTAAATGTCCAGAAGTCTTTAATTAAAATAATTACTTATTGCGACAATAAGTA 
AGAAGGTGAAATGTAAGACTTCCCAAATTGCAATAAGAACAATAGCTGTGAGAGCAATAATAATCTTCAGCTTTTATCACTGAGCGGAGCGACAT 
AAGCAAGCTGCACTGTACTTCAGATGCAATGTTGTTGTTTCAGTTGTTGTAGCTGCTCCTTGGCTTTCACACGCGAGTTGTGGGTGAGCCTTAAT 
CACATTTATTCGATGGTGAGTGGAGAAGGAGGTGCTGAGTGGGCGAGGGAGCGGCGCCGAGAGAGCGAACGAGCGGTTGACATTGACAACCCCCC 
TTTTTTGCTGTTGGGAGCGGGCGCACGCCCATGTCCAGCTGCCCAGCGGCAGCGACCAGGAGCGCAGGCAGGCGCAGCTCGCGGCCAGCACCTAC 
GACGTGCTACAGCGGACGAGGGACAGTATCCAGCGATCCAACCAGATTGCCATCGAAACGGAGAACATGGGGGCGGAGGTACTCGGCGAACTGGG 
CGAGCAGAGGGAGTCGCTGCTACGCACCACGCGCCGCCTGGAGGACGCCGATCAGGATCTGTCCAAATCGAGGGTCATCATTCGGAAGTTGAGCA 
GGGAGGTGCTCTACAACAAGATCATCCTAA 
(SEQ ID NO: 1448) 

Start ATG: 506 

MSSCPAAATRSAGRRSSRPAPTTCYSGRRTVSSDPTRLPSKRRTWGRRYSANWASRGSRCYAPRAAWRTPIRICPNRGSSFGS* 
(SEQ ID NO: 1449) 



Celera Sequence No. : 142000013384250 

TGTCGCCCACATAGATGCTGTTTCTCTTGTCCGGCGAGGATTCAAACGACGTGGAGGCGGATACGTCCATGGTCGTGTTCAGTTGGGTCAGTTGG 
AGGCCTGGAAGAATCGGTTGCTAGTGGAAATTCGGTTTCGGTTGCTGGCGAGGCGACGTTTAAACATCTGGCGAAGAACTGTGGAGTACCCAGAG 
TTGAAGAGCGATTATCAGAGACATGTATATAGATAATACGTTTGCCTTACTCTCTGCAAACTGCAAATTTGTTTTACTTTTCTGCCTACGTGTCG 
TACATTGCCAGTTGATTATGTTCAGTCGGGTTTTCTCGACTGACTTTAAAGTCTGCGCCACACACGCACACACATTTGTGCTCCACATATTTGCC 
TTAGAATTTGTGTAATTCACCTGGGCATTCTGTGTTTTCTTAATTTACTCACCTGAAAATTTGTGTTTAGTGTGGGTGTTGTTGCACTGCTGGAT 
GTTTGGGGCACATTGGCAGAGGCCTTAATTGACGACTGTCTGCTGATTGAAAACTCAATTTGATTGGCGCTGATTACGTTTTTGATTCAATGCGT 
GCTCTTAATTTAAATAAAAACCCAAAAAACTAAAGAAATTAAATTGTGATTAAATTCTTCTTGCTAGCGTCGACCAGTGTTGCCTAATGCTTAAA 
ATGCAATTATAAGAGCGACTCTAGTGCAAGAAACGAGTAATCGATTCTGGTTCGATAGGCAACGACTTCGATAAGGCGCAAATTAAACTAGTTGT 
AAAGGTTTTACGCTATATGTTGTATGCAAGAAAATCAACATAATTTAGCTATAAATATAGGCTACACTTATTTTTAATTATAAGTCCAAATTTCT 
TAGTGCATAGGAGTGGAATAACTAATTTTATATCAGTTTTTGGAATGCCCCTAACATGTCCGTGGTATTATTATTTAGTAACGTCAGGGTCACAC 
TTTTGTTTTGTTTTGCTTTGCAACTTGACTTTTAAAATTGCAATTGCAGCATGGCCGAAGAACCCGAGGAACCCGCCAAGGTATGGATTACATAA 
ATCTATTGGACATCAGGTGCCTGATTAAGCTTTAACTATGTTTAATCTCCCATAGAGCTTCGTTATCTTCGGACGCGATGTGGCGCAGATCCCGT 
GCTTTCGTAACAGTTTTCTCTACGGAATCAGCGGAGGAATCGGCATCGGCTTGCTGACTTTTCTGGGCACCTCGCGAACCCACCTGTCCACCCAC 
GTGGGCTTCGGCTCCTTCTTCTGCGGCACCATCGCCTACTGGATGACCTGCAGGTAAGTGACTGTGATTAAAACCGTGCGGCGCCCTTATCACTT 
AATTATCAATGTGGCATCCGCAGGTATCAATGGTCCGTCAGGAGATTCGAGCAGCAGCAATTGCGTGAGGCGATGAGACGACAAGCCCTTTATGA 
GGGCACCCAAAGGGAGAGAGACTTAGATCTGAAGTCGGCGTAGCATAAGTCAGATCCTTCATTACTACTTCGATGTTTGTTTTCACTCTAGGATA 
CCAAGTAATCCTGGCArTGTTAAATCCCTTGTGTTGTAAACTACCTTTTGTTACCAAACTTTATAGTAGCCAATATATTATATATATGTTGTAAG 
TGAAATCTTAGCTTGCCACTCGTAGGGAATTAATTATGCCACCATCTTTGATTTGGGATTGACAGAAATCAATCCATCATTTTAATGATTGTGGT 
TCCTAAAACCTGTTGGGCTTTATTGTGCTTTCGATTTGATTTAAGCTGATAGAGCTAGTTGTTTACTTTTGAAGAACCTTGAACCCATTTTGACG 
ACCTTTTAACTTTTTAATAGTTTTTTTTTACCATTTACATAAAAGCAAAATTTACAGTCTAAAAGTTGTATATTCAATACGTTCTGGTACGTAAA 
AGGGTTATTTAATGCACCATTTTTACAACTTTAGATAGTATACTTTTTACTTGAGTGGGAGCCATATCACAAAAATGAAATGAAAGGTTAAAAAA 
GTATCTAAAAGTAAGTTTCAATTAATATTCTGAAAAATAATTGCATTTCATGTACGTTTCTCTGAAATACCCCTAATTCGATACTATTTATAAAT 
CTGGAAGATATATAGCTAGGAAATAGATTTATATATTTGCCTAGTTTGAACACCTTGTGAACATTTTGTTTAAAAAAAAAGCGTATCGTGGTACT 
AGAAAACTTTAAGATACAGATAAATAACTTCACTATACCGAGGTTTTGGTGTTTCTTGAGACTGAAGGACAATCTTCAAAATGGTCTTAATTCTT 
G AG AT AG AAAAGC T G C AGT T CT C AT GCT G AAT AGT G AT C AGTACGATTT T ATC TG AC AG AT T GC TTTCCTAGCT GAC C AC AC C GAT CT C GG AT GG 
GTATCTGCTCTAATCCGAGATAGCAGGTAGTAGAACTGCTGCCTCGTAACCCCTAGACATGGCCATTTCCACGCACATGTAGGCGATCCTGCA 
(SEQ ID NO: 1450) 

Exon: 1001.. 1030 
Exon: 1101.. 1288 
Exon: 1354.. 1468 
Start ATG: 1001 

Transcript No. : CT23225 

ATGGCCGAAGAACCCGAGGAACCCGCCAAGAGCTTCGTTATCTTCGGACGCGATGTGGCGCAGATCCCGTGCTTTCGTAACAGTTTTCTCTACGG 
AATCAGCGGAGGAATCGGCATCGGCTTGCTGACTTTTCTGGGCACCTCGCGAACCCACCTGTCCACCCACGTGGGCTTCGGCTCCTTCTTCTGCG 
GCACCATCGCCTACTGGATGACCTGCAGGTATCAATGGTCCGTCAGGAGATTCGAGCAGCAGCAATTGCGTGAGGCGATGAGACGACAAGCCCTT 
TATGAGGGCACCCAAAGGGAGAGAGACTTAGATCTGAAGTCGGCGTAG 
(SEQ ID NO: 1451) 

Start ATG: 1 

l^EEPEEPAKSFVIFGRDVAQIPCFRNSFLYGISGGIGIGLLTFLGTSRTHLSTHVGFGSFFCGTIAYWMTCRYQWSVRRFEQQQLREAMRRQAL 
YEGTQRERDLDLKSA* 
(SEQ ID NO: 1452) 

Classification: hypothetical 
Gene Symbol: l(3)87Df 
FlyBase ID: FBgn0002354 
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Celera Sequence No. : 142000013383919 

GGTTGGATCCATACTGACAATATGTAAACTTTGGGACTGCCTTCTGTGTCATCTGTATTCATTGGAACACAGTTGCCATCAGCGCGTTTAATTAA 
CTTATCTGAATTGATTTACATGCATACCAAATTCCAAAGATCGCTCACAGAGTATAAAGGAACTTGAAACCGATGTTCGTTTCTTTACAATTATA 
TATTTCAAGTTGCTTGACATATATACGTGCATGCATATATTTGGGCGTGTTTGCGGCCTGTGTTTGTTTGTTTTGGGTGTGAATCTGGAGTTTAT 
GGAAGTGGAAAGCGGGATTGTTCACCTATCCTTACATTTTACGGGGATCACTAGGTGTTAGGAAGCATCACTATTTCTGAATTGATTTGTGCCGA 
TTTCGGAACTTAATTTTCATCAAATGTTTGTGTTTAAATCGATTTATTTAGTGTAATTGTTGAGTTATCTTGTAAATACATACATTTATTATATT 
CCAAGTTTCTCACATAAACTCTATGTTTATACATTTTTTGTCGCTTTCATCTTGCAGTTTTGTCTATACTATTACTAAACCTAGATACGATATTT 
AAGAGGCAACCAAAGAGATCTGGCTTGAGATCCCTGGCAGCCTAGTGCGAAACGGACCGAATGAACAATTTGATTTATTGAATTAGTTTTTTTCC 
TTTGCTTGGGTAATAAGAAATCTTTCTTTTAGTTTACTGGAGTGTTTAATACATATACTAAATGTAAATATATATTGAAAAGGTTCGTGTGTGTA 
TTTTTCTGTCTTCGTGTTCAGTTGTCTGTAAAGTTGGTTCGGTTGTGGGTGTCAGGGAGTGGGTTTCAAATTGGGGTGTGCAAAAGTCCTCCAGA 
GATCTTGAGCGTGCCCACAGCTAAAACCTAAAACAAACGAAAAAGTATGAATATCCCAGTTCACGCTGGTGATTCCTTTAACTTACCAGTCCTCT 
T C G C AT GC C AGT AT GGT AT AGTG GTC ATG AC C AC GC CC C T AGGGC C AATC C TAGTCGGTCTTTGT C TC CGT AAACT C C T C C AG GGC AAAAAAGC T 
GCCGCCGGAGGGGCCACCAGTGCTGCTGGCCGTGCTCGTGGCAACAACGGTGGTGGATGCCGTGCCACTAGATGCCTCCGACACTTGCACTTCCT 
CGCCGGCCGAGTCGGCAGCCGCCTCCTTGTCATCTGCACCTCCGGAAGCGCTGGCTGCCTGCTGTGCGGCCTCCAGGACCTGTTGCTGAACCTGG 
ACGATGGGCTGGCCATCTTCCGTGTACGACACGGACACTGGGATGCGGGTACCATCCTCGCTAATGACATGAGCGCCTTCCTCACCATCCTGTCC 
CTGCTGGATAATTAACTGCCCCTCGGAGTTCAACAGGGAGGCATCGATAAGCTGCTGTTCATTGTTGTCACCATGGACAAGTACCATCTGGCCAC 
TGCCATCGGTCATCATCTGGAGGGTTTCGCCCTCAAGTCCCTCGCCGGACTGCAGAACTCCTACACTTTGGAGCAGCTGCTGTATTTGCTCGGGA 
TCTTGCACAATTATTTGGTTGCCTTCGGCGTCCGTCAGGATGTGGTTGTCCAGCTGGAGGAGATGGCCGTTCTCGTTAACAATGAACTTGATCTG 
GCCACCTTCCACCTGTGCCTGCATGGCTTCGTCAAGTTCCATGGCAACATGCTCGCCACCTGCAGCTTCACCCGCCTCGCCCTCGGCTCCAGTAC 
TGGCAGTAACCTCGCCTTCGCCAGATTCACCGGCTTCAGCGGTCGTGGCATTGGCCTCCGACTCAGTTGTGGGCGTATCTGTAGCCTCACGCTTA 
GCAACTACGTCTCCTTCGTCAGATGGTTGTGGTTCCTCAGCAGCGGCGGCTTCCTTTGACTTGGCGCTTTCTCCAGCCTTGGCCTTGGCTTTCTC 
TTGACCTGCTTTGCTAGCCGTCTTGGTTTCCTCTGAGGCCTCAGCTTCAGCAGCTGCAGCCTCGTTGTCATCATCTAAACTGGGCATGGAAACCG 
GTGCGGCATTGGATTTGCTATCGGCAGAGGAGGCCTCGCCGCTTTCGCCCTTGGCATCGATGCTGCCCTGGGACTGTGATTCGTCTTCGGCTTGA 
TCCAAATTTGGCATGTCAATGCTAACAGAGACAGGTGCTGTGGTATTGGGCGAATCCTTTTGCGATTGGTCTCCGTCCGGTTGCGGGATAGTAAT 
GTTGAAGGTTATGCCTCCACTGCCCTCCTGCGAGGTAATAACCATCTCGCCGCCTTTGCTTCCCGGTTTGGGCGCCTCACCGCTCTCGTTCTTGT 
CGGAGTGGAGCTTTTCAAACAGGCTCTCAATGTCGGCCTCTGCGTTTTGCTCGCCGCTCTCGTCACCGGCCTTACCAGTTGCCTCTCCGGGCTCA 
TCGCTTAGTAAAGCCGTAGCTGCCGAATTTAGGGCAGTGGCTCCGGCCGAAGTTGCCTTCTTGTCACGTCTGTGGGTCTCCATGTGGTGGTTGTA 
GTATGTCTCCTGGATGAAAGTCAGTGAGCAAATGCTGCACGAGAAGAAGCGTATGTTGTTCTCCGACTCGGTGGTTGCCACCACAGCAGCTGTCG 
AAGCAGGCAGGCTATCGCTCAGTGCCTCATCAATGGTCTCTTGAAGGAGCTTCTGGGTGTTCAGGTTCTGGCTAACTGACTGCTTGGGCGATACC 
GTCGATGCGGAAGGACTCTGATGCACTTCGAAGTGCTTAACAAGCATCTCCCGTTTCGGGAAGGTCATCGAACAAGTGCCGCACTTGAAGAAGTT 
GGTCTGATGCTGTTTGGCATGCACAGTAGCAGCTTCGCGCGTCTCAAAGCGCTGAGAGCACAGGCGGCACTTAAAAGGCTTAGGTTTACCAGCGG 
CATGCTTGCGCATATGCTCATCAAAAACGACCTTGTCCGTTGACTTGAACTGGCAATGATTGCATGTGTATAGCGATGTTCCAACCAGATTCTCG 
TTGGCAGCAGCTGCAGCGGCAGCTGCAGCAGCAGCCACGGGAGCAGCTGATGCCGGTATGGTGGTGGTGCTGGTCACCAGTACATTGGTAGCGTG 
ATTCGTCTCCATGTGCTTGCGCAGGCTGTTCTCGTTCACGAAGTTAATGTGACACATGGCACACGTGTTATCCATGGCGGTATTGTGGTCGCGCA 
CCATGTGCTGCTTCATGGCCCCAAAGTGCGGGAAGATTTTGCGGCAGGTCTTGCACTGGAAGGTCTGCTGCTTGCAGGTTTCATAATGCGTGTTG 
AACAGCCGCATATCCGAGGTTTTATAGTCACACGAGGTGCATTTATAAATCTTGGTGGCCTTGCGTTGGACTGTCTGGGTGCCTTGGACGCTGCT 
CTGGGTGGTTCCTGGGGCGATCTTGGACTGGCCAACTGTACGATTGCAGAATATGCATTAGGAAAGCTTTATAATTTCTGTATTTATGTACTTAC 
CTGGGTTTCCCTGCAGGAGTTTACGTGGACGCAGCAAATCCGCACTAGGACTCTCCTCCAACGAGCGATTGGCGGATCCGCCGACCATTTTCTGC 
ATCTTGAGCGGCGGACTGCCTTCCTGGCCCATATCCTCGTTAATGGCATACTTCTTGTTAAGCAAACTGATGAGGTTCGTCTTGACGCGCTCCAC 
ATCGTTTTCCAGGCGGTCGTAGTAGTTCACCAGGTTGGTACAGCGTGAGCACACGTAATCCTCGCCGCAAACAATCACGAGGAAACCCTCGCCCA 
CGAGCTGGGCCAGTTTGTTGGGGAATTTGGTGGAAGTGTGCGTGGTGCACATATCGGTTAGCGAAGTCTGTGGCTCGCAGCGGTCATCGCAGATA 
TAGCATCGCGTGGCTAATCCGCCGATACCCACGCCACCCGACTCTGCATCGTTGACCGCCTGCTGCTGTGCTACGCCGCCAACGGCTGAGGCTCC 
TCCGGCAGAGGCACCTCCTCCTCCGCCACCACCGGGTACCACCATACGCGTCGACTTGAGGATTCGCGGCTGGCTGGGACCGGTCACAGTGACCG 
CTCCGCTACTGAGCAGATTGTTAAACTTGCGGTAGGTGGCCGGATTCAGTTGGTTGTTCATTTCGTGGCGTTTTGGATTCAGGTGAGCACGCCGT 
AGTGATGGTGTTATTCGGTTTGTGCGGGCGCTACACGCGCAACTGCAAGCCGCTACCTTTCAATATTAATCGGCGATTACGACAAACTACACTTC 
TGCACTCCGGTTTGGCAGCTTGCTGGGATTTATGTATCGATCGGGTACGGATGAGCCACTCGAACGAAACAATCTATCTTTTTTGGACACCAGGA 
ACAACAATCTCTTCCGTTTTTAATTTTCGGTTTCGTTCTGAGTTGTCGCGGCGTCACTTTGACTGTACTAAAGACTCGGTGCTGTATTTTTTAGG 
GTTAAAAATGCGTATGTGCTGTAGCAGTTCTCGACGGACGCGTATGGTCGTCTCGCTCTTTAGCATGCTCGCTGCCCTGGCACGGCGGGAAAACA 
TGGGCTAGTCGTTTCACTCGCACGCCACGGAAATGGCGGCTGCCTGGGAGGTCAACCATGGTCAGACAGAGAAAATCTCACAGCGTTGCCGTTCA 
TTACGATAACGTAATTATTTACTTGAAATAATAAAGCATTTATTTTTCTTATTGCGATTGCTGAGTGGTTAAATACAAATAAATACTTACTTGTT 
GTGCTTGTATTTGTAAAAGGTAAATAATTAGCACATTTTGTCTTGAAAAAAGGCTCAGCGTCATTCCTATCATGCAATGTCAAAATTAACAGTGC 
GTTTTGTAGACAGTAGAGGGCGCCTTTGCCCGCCCACTTTAAATTCACAAATATTCTCCTCGTAAATATTTGGCATACCCAACCTGTGAGGTAAG 
CATTAAATACTTGCATTTACGCAGTGATAAAGGCCAACATCACTAATGTGCTAATACATAGGACCGATAAGATTCCGAAATTCCAGTTTTACCTA 
TTCGATTTAGGTATATCTTTTAGCCAAAGGAAAGTTGAAGGGAAGGGGGCTCTCTTATCAGTAGCGATAATATAACATACGACCTGATAAACGTT 
CCAGCCAAACCACAGAATTCAGTTAACGAACGAAACGAAAAGCGAGCGGATACCGCCTCGATCGAAGCGAAACATAAAAAATAAAAGCCCCTGGC 
CAGTTGGCTTTTTGTTATAAGAGCCATTATCGCAAGGTTTATCGGGAAATATAGTAAACGGGTGGCGAACTCAATTGTCCGGCTCGATCGTTCAG 
TTGGCCAACAGACGTCGCGTACTGCAAAAGCGCACTCAACCCGATCGATTTGAGTCCCGCATACGGGGCGTGAGAATATCTCCCAGTATCTGGAA 
ACCACCCCTATATATCGAACAGTCACTGCAAAATGTATTCAGCACCAGCGGCAAAAATCAAGCACTTTGGCTCCCTTAAAACGGTAGCAGCTTGG 
GCCGGAATTC 
(SEQ ID NO: 1453) 

Exon: 4425.. 3421 
Exon: 3360. .1001 

Start ATG: 4051 (Reverse strand: CAT) 
Transcript No. : CT23578 

CGAGACGACCATACGCGTCCGTCGAGAACTGCTACAGCACATACGCATTTTTAACCCTAAAAAATACAGCACCGAGTCTTTAGTACAGTCAAAGT 
GACGCCGCGACAACTCAGAACGAAACCGAAAATTAAAAACGGAAGAGATTGTTGTTCCTGGTGTCCAAAAAAGATAGATTGTTTCGTTCGAGTGG 
CTCATCCGTACCCGATCGATACATAAATCCCAGCAAGCTGCCAAACCGGAGTGCAGAAGTGTAGTTTGTCGTAATCGCCGATTAATATTGAAAGG 
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TAGCGGCTTGCAGTTGCGCGTGTAGCGCCCGCACAAACCGAATAACACCATCACTACGGCGTGCTCACCTGAATCCAAAACGCCACGAAATGAAC 
AACCAACTGAATCCGGCCACCTACCGCAAGTTTAACAATCTGCTCAGTAGCGGAGCGGTCACTGTGACCGGTCCCAGCCAGCCGCGAATCCTCAA 
GTCGACGCGTATGGTGGTACCCGGTGGTGGCGGAGGAGGAGGTGCCTCTGCCGGAGGAGCCTCAGCCGTTGGCGGCGTAGCACAGCAGCAGGCGG 
TCAACGATGCAGAGTCGGGTGGCGTGGGTATCGGCGGATTAGCCACGCGATGCTATATCTGCGATGACCGCTGCGAGCCACAGACTTCGCTAACC 
GATATGTGCACCACGCACACTTCCACCAAATTCCCCAACAAACTGGCCCAGCTCGTGGGCGAGGGTTTCCTCGTGATTGTTTGCGGCGAGGATTA 
CGTGTGCTCACGCTGTACCAACCTGGTGAACTACTACGACCGCCTGGAAAACGATGTGGAGCGCGTCAAGACGAACCTCATCAGTTTGCTTAACA 
AGAAGTATGCCATTAACGAGGATATGGGCCAGGAAGGCAGTCCGCCGCTCAAGATGCAGAAAATGGTCGGCGGATCCGCCAATCGCTCGTTGGAG 
GAGAGTCCTAGTGCGGATTTGCTGCGTCCACGTAAACTCCTGCAGGGAAACCCAGTTGGCCAGTCCAAGATCGCCCCAGGAACCACCCAGAGCAG 
CGTCCAAGGCACCCAGACAGTCCAACGCAAGGCCACCAAGATTTATAAATGCACCTCGTGTGACTATAAAACCTCGGATATGCGGCTGTTCAACA 
CGCATTATGAAACCTGCAAGCAGCAGACCTTCCAGTGCAAGACCTGCCGCAAAATCTTCCCGCACTTTGGGGCCATGAAGCAGCACATGGTGCGC 
G AC C AC AAT AC C GC C AT GG AT AAC AC GT GT GCC AT GT GTC AC AT T AAC T T CGT G AAC G AGAAC AGC CT GCG C AAGC AC ATGG AGAC GAAT C AC GC 
TACCAATGTACTGGTGACCAGCACCACCACCATACCGGCATCAGCTGCTCCCGTGGCTGCTGCTGCAGCTGCCGCTGCAGCTGCTGCCAACGAGA 
ATCTGGTTGGAACATCGCTATACACATGCAATCATTGCCAGTTCAAGTCAACGGACAAGGTCGTTTTTGATGAGCATATGCGCAAGCATGCCGCT 
GGTAAACCTAAGCCTTTTAAGTGCCGCCTGTGCTCTCAGCGCTTTGAGACGCGCGAAGCTGCTACTGTGCATGCCAAACAGCATCAGACCAACTT 
CTTCAAGTGCGGCACTTGTTCGATGACCTTCCCGAAACGGGAGATGCTTGTTAAGCACTTCGAAGTGCATCAGAGTCCTTCCGCATCGACGGTAT 
CGCCCAAGCAGTCAGTTAGCCAGAACCTGAACACCCAGAAGCTCCTTCAAGAGACCATTGATGAGGCACTGAGCGATAGCCTGCCTGCTTCGACA 
GCTGCTGTGGTGGCAACCACCGAGTCGGAGAACAACATACGCTTCTTCTCGTGCAGCATTTGCTCACTGACTTTCATCCAGGAGACATACTACAA 
CCACCACATGGAGACCCACAGACGTGACAAGAAGGCAACTTCGGCCGGAGCCACTGCCCTAAATTCGGCAGCTACGGCTTTACTAAGCGATGAGC 
CCGGAGAGGCAACTGGTAAGGCCGGTGACGAGAGCGGCGAGCAAAACGCAGAGGCCGACATTGAGAGCCTGTTTGAAAAGCTCCACTCCGACAAG 
AACGAGAGCGGTGAGGCGCCCAAACCGGGAAGCAAAGGCGGCGAGATGGTTATTACCTCGCAGGAGGGCAGTGGAGGCATAACCTTCAACATTAC 
TATCCCGCAACCGGACGGAGACCAATCGCAAAAGGATTCGCCCAATACCACAGCACCTGTCTCTGTTAGCATTGACATGCCAAATTTGGATCAAG 
CCGAAGACGAATCACAGTCCCAGGGCAGCATCGATGCCAAGGGCGAAAGCGGCGAGGCCTCCTCTGCCGATAGCAAATCCAATGCCGCACCGGTT 
TCCATGCCCAGTTTAGATGATGACAACGAGGCTGCAGCTGCTGAAGCTGAGGCCTCAGAGGAAACCAAGACGGCTAGCAAAGCAGGTCAAGAGAA 
AGCCAAGGCCAAGGCTGGAGAAAGCGCCAAGTCAAAGGAAGCCGCCGCTGCTGAGGAACCACAACCATCTGACGAAGGAGACGTAGTTGCTAAGC 
GTGAGGCTACAGATACGCCCACAACTGAGTCGGAGGCCAATGCCACGACCGCTGAAGCCGGTGAATCTGGCGAAGGCGAGGTTACTGCCAGTACT 
GGAGCCGAGGGCGAGGCGGGTGAAGCTGCAGGTGGCGAGCATGTTGCCATGGAACTTGACGAAGCCATGCAGGCACAGGTGGAAGGTGGCCAGAT 
CAAGTTCATTGTTAACGAGAACGGCCATCTCCTCCAGCTGGACAACCACATCCTGACGGACGCCGAAGGCAACCAAATAATTGTGCAAGATCCCG 
AGCAAATACAGCAGCTGCTCCAAAGTGTAGGAGTTCTGCAGTCCGGCGAGGGACTTGAGGGCGAAACCCTCCAGATGATGACCGATGGCAGTGGC 
CAGATGGTACTTGTCCATGGTGACAACAATGAACAGCAGCTTATCGATGCCTCCCTGTTGAACTCCGAGGGGCAGTTAATTATCCAGCAGGGACA 
GGATGGTGAGGAAGGCGCTCATGTCATTAGCGAGGATGGTACCCGCATCCCAGTGTCCGTGTCGTACACGGAAGATGGCCAGCCCATCGTCCAGG 
TTCAGCAACAGGTCCTGGAGGCCGCACAGCAGGCAGCCAGCGCTTCCGGAGGTGCAGATGACAAGGAGGCGGCTGCCGACTCGGCCGGCGAGGAA 
GTGCAAGTGTCGGAGGCATCTAGTGGCACGGCATCCACCACCGTTGTTGCCACGAGCACGGCCAGCAGCACTGGTGGCCCCTCCGGCGGCAGCTT 
TTTTGCCCTGGAGGAGTTTACGGAGACAAAGACCGACTAG 
(SEQ ID NO: 1454) 

Start ATG: 375 (Reverse strand: CAT) 

MNNQLKPATYKKFNNLLSSGAVTVTGPSQPRILKSTRMVVPGGGGGGGASAGGASAVGGVAQQQAVNDAESGGVGIGGLATRCYICDDRCEPQTS 
LTDMCTTHTSTKFPNKLAQLVGEGFLVIVCGEDYVCSRCTNLVNYYDRLENDVERVKTNLISLLNKKYAINEDMGQEGSPPLKMQKMVGGSANRS 
LEESPSADLLRPRKLLQGNPVGQSKIAPGTTQSSVQGTQTVQRKATKIYKCTSCDYKTSDMRLFNTHYETCKQQTFQCKTCRKIFPHFGAMKQHM 
VRDHNTAMDNTCAMCHINFVNENSLRKHMET^ 

AAGKPKPFKCRLCSQRFETREAATVHAKQHQTNFFKCGTCSMTFPKREMLVKHFEVHQSPSASTVSPKQSVSQNLNTQKLLQETIDEALSDSLPA 
STAAVVATTESENNIRFFSCSICSLTFIQETYYNHHMETHRRDKECATSAGATALNSAATALLSDEPGEATGKAGDESGEQNAEADIESLFEKLHS 
DKNESGEAPKPGSKGGEMVITSQEGSGGITFNITIPQPDGDQSQKDSPNTTAPVSVSIDMPNLDQAEDESQSQGSIDAKGESGEASSADSKSNAA 
P V SMP SLDDDW E AAAAE AE AS E ET KT AS K AG QE K AK AKAGE S AKS KE AAAAE E P Q PS DE GD V V AKRE AT DT P T T E S E AN AT T AE AGE S GE GE VT A 
STGAEGEAGEAAGGEHVAMELDEAMQAQVEGGQIKFIVNENGHLLQLDNHILTDAEGNQIIVQDPEQIQQLLQSVGVLQSGEGLEGETLQMMTDG 
SGQMVLVHGDNNEQQLIDASLLNSEGQLIIQQGQDGEEGAHVISEDGTRIPVSVSYTEDGQPIVQVQQQVLEAAQQAASASGGADDKEAAADSAG 
EEVQVSEASSGTASTTVVATSTASSTGGPSGGSFFALEEFTETKTD* 
(SEQ ID NO: 1455) 

Name: zinc finger protein 
Classification: trans cription_f actor 



Celera Sequence No. : 142000013383919 

TTGGCGCCGTCCAGGCTCGCGCCATTCACCTGACAAGCCTTCTAGCCAAAGGTAAGGCAATTGTTTATGCAATAGCCACTGAATCTCAAACTGTA 
ATCCCCGCCAGAACGCCGATACACAAACAAACACGAGTGGGTGGAGGTGGTATCCGGCAGCAATGCCATAGTAGGCATCTCCAGCTACGCCCAGG 
AGGCTCTCGGGGATGTGGTGTTCGCCCAACTCCCAGAACCCGGCACGGAACTCAAGCAGGATGACGAATGTGGGGCCCTGGAGAGCGTTAAAGCG 
GCTAGCGAGGTGTATTCACCCGTTAGTGGCAAGGTAATTGAAAAGAATGCCGAGGTGGAGGACACACCGGCCCTAGTCAACAGCAGTTGCTATGA 
GAAGGGTAAGATTTGCGTGAAGACCCCTTTTTAACTTCACTTGAAACACGTACTTTTCTTTTACAGGCTGGCTTTTCAAGGTGGACCTGAAGAAT 
CCCAAGGAACTCGAAGCCCTTATGACCGAGGATCAGTACAAAGCCTTCCTGAGCAGCAGTGGGGATCACTAGAACCAGCAGCTAATCCTAACCCT 
AAAGCATCTTGACTGTAGCCTTTAAAACGAGTAATTGTTAGCATTTGTTATCAGCTGAATAACGCTACACTCATACCCAAATATCACCAACCAAA 
CTCATCGCCATACACACGTTGTTGTAGCTCTTGAAATTTGTTACGAATAAGAATCAACAATAAATCCAAATGTTATATTTAAATTGGGCATCTTG 
ACCTGCGGGTGGCAGACGTCGTCTGCTCCCCCATGGAAACCTTCTAGGTGATAAGACCAAAAGACCGAAGAGAAAACGTGTTGAGAGTTGCTCAA 
AGATAAGAAGTCACTCAACAGAAGATTAAGAGAGCGGACAAGGCACTCAGTTTCATAATAATACAGATCAGCTTAATACGGCGTGCTTGGCTGGA 
ATTAAGAAAAAACATTTAACACTGTTGCTTGGACACGCGATGGCGCACGCCACGGCACTCAGCCATCACCTGATGGATTCGATGATGGACCGATG 
ACACACCTACCTCTCTGGAAAGTTACGCCGTGCTCTGGCGATAAGGATGTCCAAGTCGCGCCTTGAAAATACATATGCCCGATCAAGTTGATAAG 
CCTTACAAAAAGCGCCGCCCATGTCTCATTGAACTTGAAGAGCGTGTTCGCGGAAGCTTTTCCATTCGGAAAGCTATGGCAACGGAGGCGAAAAG 
CTTGGCAGTAGCTCATTCGAATGGTCAGTGCTCAAAAAGAACTCATTCCGTGCGGTGCGAGAAGCGCATGCTGCTCGAGAACCAACCCCTGAAAA 
TAATAAAAACAATCACACAAAGCTAAGGATAATTGCAAAACAGGATAATTGAAAACCTAAAGCAAGGAAAAACAGTTGAAGGACATACACAAATA 
ACTAAAACCACAACTCAATAACAAAATGCCAGCGGAAAAATCAATATACGATCCCAACCCGGCCATCAGCCAAAATGCCTACATATCCAGCTTCG 
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AGGAGTACCAGAAGTTCTACCAGGAGTCACTGGACAATCCGGCCGAGTTCTGGTCCCGCGTGGCCAAGCAGTTCCACTGGGAAACGCCAGCCGAT 
CAGGATAAGTTCCTCAAGTACAACTTCAACATCTCGAAGGGACCCATTTCGATCAAATGGATGGAGGGTGCCTCCACCAATCTGTGCTACAATCT 
CCTAGATCGCAACGTTAGGAACGGCCTGGGCGACCAGATAGCCTACTACTGGTGAGTTCCCTTGCAAATCCTCGCTCACTTTCGCTCTGGGTGGC 
GTAGGTCGCACGAAAACAAAAGTATTTGGTTGGCAGCGCAGGATTATTGATTTTTCGCCTGGTTACGAAACTCTCGCCTTGTGACGTAGAACGGT 
GGT C T C CT C T T GC AC AGG AT TT AC T GC GC AGGC ACC ATAC CG AAAC AG ATG A T AGC AC T C C TT T G T AT AAT T T AAAT AC AT AT GT G C AC AAG G AT 
GCAGCTCAATGGCACATCCCACCTGCCATCGATTTTGGTTGAAGAATCTTCCCTTATTACCTTTGTGCCTGGTAATCTGAGCGATTATTTTCGTT 
TTGATTGTCGCAAATAATTGGTTTAATCGTCTGCATGCCAAATGTGACAGTGGGACTAATGACCGATCAGTTCAACTTTGGGCGGTTTTTGGTTT 
TCCAAATGCGCTTTGCTAGACACTCCGACTCTGGGTCAATGTCAGGGCTACTCTTCGTTCCTCGATACGACACGACCCTCATAAGGGGGTGACCC 
GGCCATGCTTATTTATTCAGTTTAAATCGATTTACAATTAAACCGGTTGGGGAAGCCCTGAAACATAGAAGCTTTGGTCTTATTTGATTTCACAC 
ACTGTTTACTGTGAGTAAGAATGTTTTTCTTTATTCATAAAATTAAATTTAAATGTCGGTTTTTTACCAATACTTCCAATTATTTTAAGTACATT 
TCCTGTGTTTTTGAATGTAAAGCAAAGCTTAAGGGTCATTGTCAGTTGTGTTGTTAAGCTCACTAAAACATTAATAACTAAGGCAGCGTGTGGTT 
TTTTTTATTAGCAGACCGTAATACTTGCCCCAAAAGTACAATATCCGTACTTTCTGATGTCTTATCTAATTGTCAGTCTATAACTCATCTTTGCG 
TTTTTGTTTGTTTCTCCACACGAGGAGGCATTCATTCGTAACAAATTAAGCTAAATTAAACAATCGAATGCGATACTAACTAGGGACTGCTCCCA 
CTGTATATGTTCTTTCCTCTTCAAATCAACATACTTCGGTGACTGCGATTAAGCAACAGACACTCCCTTTCGCTGATTAGCTACCGAGGAGGAGT 
CCAGTGCCGTATATTACCTTATGAGGTTCTCAAGTGCCCACTCAAATCGCAATTAAATTGAAAAGCTCGGACTGGATGGACGGAATAAGTATTTT 
GTTTTGGTTGCGCGAGAAGATTTTTTGTTGTTTTCAGACCCAAAGTTCACCGAGCAATGCGTCAGGGCGCAAAAAAACACAAAGTTAAACAAATG 
TTAATGCTAATGAGCTTATCTCCGGGTTATCATTTGAGAGGCGGGTTAGGGGGCGGGGCACGTGTGCCGGAGGAAATGCAAATGAAAGTCAGGTA 
ACTGAAAGGCAATTACAAGGTCATTGCAAGTATTGGTTCGCATTCAGCCTCGAACTTACGACGGCCCATCACTACTACTATTCTCGGCCATTTCG 
CACTTTTGTTGGGAATTATTTAATCCCAGCCCACAATCTGTCCTCACTTTCCTCAGATAAGATAAAATGGGGAACACAGGCCCAAAAAGGCTGAA 
AAAAAAACGGTGTGCACGTAATGAGTAATAATTAACGCTTTCCGAACCGCCGTCTCAGATAATTTTACTGCAAACCAAAGCAAGCTAAAATCTTA 
AAAGTTATATGTCCCAGTTTTATTGCATAGATAGACGCAGCGAATCGGAAAAGCTGGAGAATCGGTTATTGGGAATTGGCTTTGGCCTGATTAAT 
ATTGCTAAAATTGCGTTTTGGTGCAATTTATGTAATTTTTCAACTTGAAAAACAATATTTTGGGATGTCTTAAATATTTGTGTGCACCTTCCGTC 
GAACTAAACAAGGTTGTTGTACCTTTGTCCCAAAATAACCGTTTCGAACCCAACCGATTTACGCAAACTGTTTTCAATTTCGAGAGTCGTTAGCA 
ACAAAATTAAGTCGAATTAAACACACGAAAGCCTGAAGCGAAAGCCGAGCAATATTATGTAAACGATTTGACAAAGCGCGCAAATTGTTTATTCA 
AATTAATGGAAGCGCGTCTTTGCGTTAAACAGTTTGCTATTTTTCCTTGCCTTTAATTTGTGTGTCAGTCGGCGTCTGGGCAAAAGGTGGATTAT 
AATCGATATATTCACTGGGAAATTGAATAAAAAGATCCAAAACAGTTGAAAGCGTGTAAAAATGAAAAGTTTACCCAAAATTATCGAAATAAGGT 
ATTTCTAATATTCCAGTGAGCGCCAACAGTTTTACAGCCACTGCGCGTCTAGAAAATCACGCGATCACATCGTCTGATTCACTCACAACCAATTA 
GAGAGCCACTGTGCTCCCTCCCTCCACGCACGTTGAAAGACTAAGAGAGAGGTAGTGGTTGTTGTGGAAGGGGGGGGGGGCAGTGCTGCTGCGCT 
AAAGTAGAATTGACTGTCTGTCTTACCTGCTAAGAAATAAGAGAGCCTCAAAAAACAACATGTAGCCAGCTACCTAGGCTACTTTTCTTGACTTA 
CCTCGCTTTGTTATCAGTGTTTCCACCTGGAAAAGGTTGACAGGCCCATTTCCGTAAATTATGTAAGCGACTCTATTCACACAAATCAAAGATGC 
GCCAATCCGTAAAAACAAATAACCGATCTTTAGATAAACATAGTAGATGAGCATCGTACGATAATGTCTCTCAAACTGTAGGGCCCGGGCAGACA 
CGGAGACGCGATAGAGAGAGATAGAAGAAGTCACCGCAAGAAATCAGATGGTTTTGTGGCTAAACAATTTTTGATCTTAGAAAGTCTCAATATTA 
CTGCTTGAGTTTGTTTGGAGAGTATCTACAAGCATTTTGTAACAACTAAAAAAATTCCGTCAGAACGTATGGTAAACAATTCCGTGGGATTTTAA 
AAAATCACTCAGTTTAAGGCCATGTAGGTTACACAAAGTAATTTTTTCCGCTCATAAATACGAGTGCTTGAGAAATTAATAGTAATAGGAAATCT 
AACAACAACCGATGCAATAAAAATCACACATCTTAAGTAAATATAAATTGGTTTCTATCAGCATTTCCTAAGCAAATTATTTTCAGGCGTTTACT 
AATATTCAAAATATAGTAAGCACAAGGATATTTGTACAGTGTAGGAAAGAAACCCAAAACATTCCATGCGATCAATACGACACGCGAATCAACAG 
CTGATGTTATTGTTATTAAGATAAGTCGCCGGCACTTATTGTTTTTTCCATGGATTTTAGGCCATTCCTCATCATTTTTTCATTTTTGGCTCGCA 
CGTGTGGTGAGAGTGGCTGTTTCTAGTATATTTTCGGATTTTTGATTTCTGATAAGCGGGAGACGGCCACGCGTCTTCCCTTGGTCAAGAGGTGA 
ACGGTAGATGGCGGACCGGGACGGTGCATGAGGTGTTTCTAAGGCGGCATGGAAAATTTCTTGTCTGGCGATCCGGCGGCGAGATTAGGAGGCGA 
GTGGCCCGATTACGCCTCTAAATTGGAAATTAATACACTCGTCGCGACACATTTCTCCCACTGGTTTGATTACTCATGCGGAGAAGGAGGAGGCG 
CCTCTGGTGCTCCAAACTCTGATAAGTGGAGAAGACGTCCGCAAACACACAACATAGTTTTCCGTGACTGATGGGAATTTCGGCCCAGTTTGGCA 
ATCACCATATACATATATATTATAAGTTTGATTATATTTATTTGATCTTATCCATAACAGTGATGTTGACGTTAGATGGCTGTCAAGGTGGAGAC 
CAAGGTTAAGGACAAACAGTCTGTTAAGGGTAGACGTTATCAAAATTTGTAAGCCTGGAAAATTCGGCACATTTCAATATAGGCGATATTAATCT 
GTACGAAGGGAAAAATGATCTGTACCTGTCTTCGATTAGTTATGTCTATTGCTTATAACATTCTTTTTATGAAATTGCTGGTTTGTGGGGATTAC 
AAATGACATTAGTTTCTGATCGTGATCCCTTTTTCGGCCTGCGGGCTTCACTTTGGGGATCCAAATCTCCATTGGGGATCCCTCCTTGTTGGCAG 
CTGCGAGGAGTACCGACTCCAGTGCATCGGCATAGCAATCGATGTCCTCCCTGAAGTTGAGCTCCTGGGTTAGTTCAACGCGGGAACTTTCATAC 
TCATTGGTTTTAAGTCAGGTGGTTTCCGGGCTTCGGAGAAGAGTTATCAGCATTGGTAAGTGTTCTCCGATGATAAGTCCAATTGCGTGTTTGCA 
GTTATTAGATTGTGAGACATGTTTCTTGCCACAAAAACCCAATCAATCAAAATTTCTAACATCTGTAGGCCAGTATTTAGGACTGCCTGGTAAAC 
GCAGTCCAGTTTTTATGATTTCAATATGGTTTCCTTTCTTTAGGGGTCGTCGGGCAAAATCCCCCAATGAATACGGGTGTCAGCCCTAATCTCCG 
TCGCATACTGTCATATTTGGCCAAGAGCAAGACTCTAAAGTATAGATAGGGTCCAGGAAGAAAATGACTAAGCTCGAAAATAACAGCTCGAGTCA 
CAATCGAGGGGTGTTTTCATTGCGTATTTTATTTATTTTTACCAACTTACTAATTGACAAGTACAAGTACAAACAGGTATTTCGTTCAAGTGGAA 
TATCATTTCACCGGGGCGGTTTTGGCCCAGTTGAATCAGTCCAAACCCCAAACCCCTTCGCTGGGCGAACTTTGAACACAGCGGTACTTTGACAG 
TCAGCGGTACTAAAATAACCCCCAAAAAACGTCACAAGGGGAGCTTTTGCCGTTGTTCCGACTGCTTCCTGCTTTTCTCTTTGGCCATACACATT 
TTGGGCCAAAATAATAGACGCTTTATTGCCGGCTAATTCTAATGCTTATCGCAGGCGACGGACAACAGGTGATGATTTACACTCCGCTGATTGCA 
TTGTCTGCCTTTCTAGTCCACCTGTCGTGTAGTTTATTACTGTTTTCTGCGTTGCTAATCTGCCTTGACCTCCACAACTGCCATGGAAATGGCTT 
TCCCCATTACTTTCATAGTTTCATGACTGACCGTCGGCGATCGCGAGTGTTTATATCCGTTTTGAACGCCGAAACCGTTCACGATTTACCCCAGT 
CACGAAATTTGTTTACCCATTCAAAGCTTTCAAAGCTCTGACACATTCCATTTAGTGTTTAATTTTATATTCATTCATTTAATAGCAACAATTAG 
GGAAGCATTTACATTGCCTCATATATGTATGTGAATTTTGGATTATTAGGATTGGAAATGCGATTCGGTTTTATTCTTTTATTTATTTTTAGATA 
CATACCTTAGTTGTACATAGTTTTGGTAACTTGTGTACTCGTTCAAAGACTTTTTGCAGCTGCTGTAAATAAAGTAAAGTAGGTTATTGGCACTG 
GCTTCTGTTGCCAAGTTCAGAACAAAAAGGGAAAGGCATACAAAGTATATACAATTACACAATGCTAAGTCATATTACATATGTACACGGTAGGA 
ACACAAGCGGCACACAAAAGGGGATTGGATTTAAATGCATTTTTAAATATTTTGTTGGAAATCTTACCGTAGCTCTAAAATATCTCTTTGAAATT 
AGGTGACAATTTGAGTTCAGAAAATCGAATTAAGTCGGGTGGGCAATTAGGAAGCCTCGCATAAAAGTAAGTGATCCAATTATGGTCGGGGTAAT 
TAGCCCAATTGTGGGGCACCCGGTATGCGGCCACCCATTTGAGAACGCCATCGGCGCGTGCTCAATGTAATACTATAGAAGGAATACTAAGTATG 
TGTTTCCTCACGGTCGTGTTTCTTTTTTGACATTGTTTCAGGGAGGGCAACCACCCCGACGATTATTCCCGGGGTCTCACCTATCGCAAGTTGCT 
GGAGGAGGTGTGCCGCTTCGCCAACGTGCTGAAGGATCATGGCATTCGAAAGGGCGATCGTGTATCCATTTACATGCCCATGATTCTGGAGCTGC 
CTATCGCGATGCTGGCCTGCGCTCGCATTGGAGCCGTGCACTCGATCGTATTCGCCGGATTCTCACCGGACTCACTGGCGGAGCGGATGTTCGAT 
TGCAAGGCCAAGCTGCTGATTACGGCGGATGGAGCCTGGCGTGGAGAGAAACCCCTGTACCTGAAGGCTCTGTGCGACACGGCCTTGGAGAAAGT 
CGAGGAGATGGGGCACTCCGTGGAGAAGTGCATCGTGGTGTCGCATTTGAAGCGAGTGACTCCGTGCCAGCCGGATCATGTCGAGGAGGAAATCC 
CATGGACCGATGACCGCGATTACTGGTGGCACGAGGAGATGGAGGACAAGGAGCCGGCCTGCTATCCGGAGTGGATGGACGCCGAGGATCCATTG 
TTCATGCTCTACACCAGTGGCTCCACTGGCAAGCCCAAGGGAGTGCTCCACACGACTGCCGGATATCTGCTGTATGCGGCCACGACCTTTAAGAT 
CGTATTTGACTATAAGCCAGGGGATATCTACTGGTGCACCGCCGATGTTGGCTGGATCACGGGACACACCTACGTGGTGTACGGACCATTGGCCA 
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ATGGAGCCACTTCAGTGATTGTAAGTGCCTGATTAGATCTAACAAATAATGGATTTTTTCTGACTAACTGTCCCTATTACTTCGTAATTGCAGTT 

CGAAGGCACACCATTCTTCCCTGGAAACGATCGGTACTGGAGTGTCATTGACAAATATAAGGTAACTCAGTTCTACACAGCTCCAACGGCGATTC 

GTGCCCTCATGAAGTTCGGCGAGGGTCCCGTTCTAAAGCACAACCTGAGCGGACTCAAGTACGTTTATTTATACTTCTATAACAATTTTCGTAAT 

GACTTTATTCATTTGCAGAGTTTTGGGCAGCGTGGGTGAGCCCATCAACCCGGAGGCCTGGCTCTGGTATTACAAATACATTGGAAAGGAGCAGT 

GCTCCATTGTGGACACCTTTTGGCAAACGGAAACCGGTGGTCATGTCATTACACCTCTGCCTGGAGCGACTCCCATGAAGCCGGGATCTGCTGTA 

AGATAAATCCTTTGGCTAGTTGATGTCAGTACTAAAATTTATTGTCTTCTACTCTTAGTCATTCCCCTTCTTTGGAGTGAAGCCCACTTTGCTGG 

ATGAGTGCGGCATTGAAATTAAGGGGGAGGGAGAGGGCTACTTGGTCTTCTCGCAACCTTGGCCGGGAATGATGCGCACTTTGTACAACAACCAT 

GAACGCTTCGAGGACACTTATTTCTCCAAGTTCCCAGGCTACTACTGCACTGGCGATGGTGAGTCTTGAAGTTTTAGACTGTTTTTCATTGTGAT 

TTTAAAGTGGATTAGGATTGTTAATTATTTATGTTTGCTTTGATTTTTTTCGCACAGGAGCCCGACGCGATGCTGACGGCTATTTGTGGATCACT 

GGTCGTGTGGACGACATGTTAAACGTGTCCGGGCACCTGATGTCCACTGCCGAGGTAGAGTCGGTTCTCACAGAGCATCCTCGCGTGGCTGAGTC 

CGCCGTGGTTTCCCGTCCGCATCCCGTCAAGGGCGAGTGTCTCTACTGCTTCATAACGCCGAACGAGAACGAGGTATTCGACCAGAAGCTAATTT 

CGGACCTGAAGAAGATGGTGCGAGAACGTATCGGACCTTTTGCCATGCCGGATGTCATTCAAAACGCTCCCGGACTGCCCAAAACTCGTTCCGGC 

AAGATTATGCGTCGTGTGCTGCGCAAGATCGCGGTTAACGATCGCAACGTGGGCGACACATCTACCTTGGCCGATGAGCAAATTGTGGAGCAACT 

GTTCGCCAACCGGCCAGTGGAGGCCAAGTAGATGGTAGCTGTTGGACCAACTGGCAGCAGCCAGGATCCCTTAACCCAAGGCTTTCTCCGGTTGC 

ACTGACCACTCCACTTTGCCATCGCATTTCCACCACTTACGTATCGCGATTACGTATCCTGTAAATCTCCCTCTACTTATATGAATTACCCACTG 

CTTAATGTTTGTTCCGTTCCGGGCTACGGCGTGGTTAGCACCTTTCAAATTCTACTTAGCTCGTAAGCCTAAGCGTGTGCGTAAGTAAGTGACTG 

GATGGTTTCCGATCCCAGTTATATATCGTCTATATGATTTTGTGTACTTGCCTGTTCATCGTAGTTGACATGCTCACCACGCACACATTTTCCAG 

TTCTTTTGTTTCTTCGTTGTAGTCGTAGTTTAAGTTCTCGCTAATTCGTTGTGATACAAATTTAATTATATCTTACTTATGTATGTATTTTTAGT 

AGAATTGATATGAGACAAAATATATACACGATTCCAATTGTCGCCACAATTTACACAGATCACCTTTTTTTAGTTGTATCTTTTTTTTGTAGTGT 

TTCTTTCACACCTTTTCAGATCTGAGGAGATGCGCACGTAAGTTCGGAAACCATAGGGGATATCGGTTTCTTCCCCGTCCCATTAGCCCATAAGT 

TTGAGAACGATACATATCTGTGTGGCATTATGCTTGCCACATTCCTACATACGATCTACATGTTAAAGAGAACCACGCGGATATACTATATACAC 

T AT AT AT AT AT AT AT GT TAT AAGGT C AG AAG ACT T AC TGC AAC G AAAC AT&TC ATTG AAGTT ACT GC T GC AAGC C T ATGG AT AT T A T AGAA GC T T 

TTATTGTTAGAGATCGTTATCTTGAGGGGAGATCGGCTAGAAAGTAATGGATATCTTGGATTGGCTGATTAGGGGCAGCGGTGCCCAGTCTGAAT 

GTAAAAAGATTGTTAGTTATACAGCGATGGCGAACGAAAGATTGCAATTTACATATCTATAAATATATATATATATATTTCTGTATATGAACATA 

TGTGTATAATAAAGTTTATTGTTTTAATGTGTTTGAACCTGCAAGTGCAACGACTATATTTTATAGTTAGAAATTCGTATGAAGTGTCAGTCCTG 

AGAATTAAGATCGCACTAGCCGAGCATGATACTTCCTTGTGAATCAGTCAACTCTCGGCCTTTGTCACACTTGTTTTGCTGAATTTAGGCTCGGT 

CTTTCAACACCTTGGTGGCAGTAACAGATCTTTGTCCGGGTTGTTGTTTGGGGCTGTGGACGAGCACCACCAGAATGGTCGACGAAAAGCTACCA 

CCAAGCAAGGAACCGGCTGAACGGCCTCCAGACCAGACCACCCCCGGAAACGCGGCTACCCGATTCGGCTTGGAGGATCACGAGGCCACTCAGCA 

GGATGCCCGGGTGAACA 

{SEQ ID NO: 1456) 

Exon: 1001.. 1761 
Exon: 7547.. 8285 
Exon: 8359.. 8513 
Exon: 8569.. 8737 
Exon: 8799.. 8988 
Exon: 9083. .9942 
Start ATG: 1451 

Transcript No. : CT23612 

CACGGCACTCAGCCATCACCTGATGGATTCGATGATGGACCGATGACACACCTACCTCTCTGGAAAGTTACGCCGTGCTCTGGCGATAAGGATGT 
CCAAGTCGCGCCTTGAAAATACATATGCCCGATCAAGTTGATAAGCCTTACAAAAAGCGCCGCCCATGTCTCATTGAACTTGAAGAGCGTGTTCG 
CGGAAGCTTTTCCATTCGGAAAGCTATGGCAACGGAGGCGAAAAGCTTGGCAGTAGCTCATTCGAATGGTCAGTGCTCAAAAAGAACTCATTCCG 
TGCGGTGCGAGAAGCGCATGCTGCTCGAGAACCAACCCCTGAAAATAATAAAAACAATCACACAAAGCTAAGGATAATTGCAAAACAGGATAATT 
GAAAACCTAAAGCAAGGAAAAACAGTTGAAGGACATACACAAATAACTAAAACCACAACTCAATAACAAAATGCCAGCGGAAAAATCAATATACG 
ATCCCAACCCGGCCATCAGCCAAAATGCCTACATATCCAGCTTCGAGGAGTACCAGAAGTTCTACCAGGAGTCACTGGACAATCCGGCCGAGTTC 
TGGTCCCGCGTGGCCAAGCAGTTCCACTGGGAAACGCCAGCCGATCAGGATAAGTTCCTCAAGTACAACTTCAACATCTCGAAGGGACCCATTTC 
GATCAAATGGATGGAGGGTGCCTCCACCAATCTGTGCTACAATCTCCTAGATCGCAACGTTAGGAACGGCCTGGGCGACCAGATAGCCTACTACT 
GGGAGGGCAACCACCCCGACGATTATTCCCGGGGTCTCACCTATCGCAAGTTGCTGGAGGAGGTGTGCCGCTTCGCCAACGTGCTGAAGGATCAT 
GGCATTCGAAAGGGCGATCGTGTATCCATTTACATGCCCATGATTCTGGAGCTGCCTATCGCGATGCTGGCCTGCGCTCGCATTGGAGCCGTGCA 
CTCGATCGTATTCGCCGGATTCTCACCGGACTCACTGGCGGAGCGGATGTTCGATTGCAAGGCCAAGCTGCTGATTACGGCGGATGGAGCCTGGC 
GTGGAGAGAAACCCCTGTACCTGAAGGCTCTGTGCGACACGGCCTTGGAGAAAGTCGAGGAGATGGGGCACTCCGTGGAGAAGTGCATCGTGGTG 
TCGCATTTGAAGCGAGTGACTCCGTGCCAGCCGGATCATGTCGAGGAGGAAATCCCATGGACCGATGACCGCGATTACTGGTGGCACGAGGAGAT 
GGAGGACAAGGAGCCGGCCTGCTATCCGGAGTGGATGGACGCCGAGGATCCATTGTTCATGCTCTACACCAGTGGCTCCACTGGCAAGCCCAAGG 
GAGTGCTCCACACGACTGCCGGATATCTGCTGTATGCGGCCACGACCTTTAAGATCGTATTTGACTATAAGCCAGGGGATATCTACTGGTGCACC 
GCCGATGTTGGCTGGATCACGGGACACACCTACGTGGTGTACGGACCATTGGCCAATGGAGCCACTTCAGTGATTTTCGAAGGCACACCATTCTT 
CCCTGGAAACGATCGGTACTGGAGTGTCATTGACAAATATAAGGTAACTCAGTTCTACACAGCTCCAACGGCGATTCGTGCCCTCATGAAGTTCG 
GCGAGGGTCCCGTTCTAAAGCACAACCTGAGCGGACTCAAAGTTTTGGGCAGCGTGGGTGAGCCCATCAACCCGGAGGCCTGGCTCTGGTATTAC 
AAATACATTGGAAAGGAGCAGTGCTCCATTGTGGACACCTTTTGGCAAACGGAAACCGGTGGTCATGTCATTACACCTCTGCCTGGAGCGACTCC 
CATGAAGCCGGGATCTGCTTCATTCCCCTTCTTTGGAGTGAAGCCCACTTTGCTGGATGAGTGCGGCATTGAAATTAAGGGGGAGGGAGAGGGCT 
ACTTGGTCTTCTCGCAACCTTGGCCGGGAATGATGCGCACTTTGTACAACAACCATGAACGCTTCGAGGACACTTATTTCTCCAAGTTCCCAGGC 
TACTACTGCACTGGCGATGGAGCCCGACGCGATGCTGACGGCTATTTGTGGATCACTGGTCGTGTGGACGACATGTTAAACGTGTCCGGGCACCT 
GATGTCCACTGCCGAGGTAGAGTCGGTTCTCACAGAGCATCCTCGCGTGGCTGAGTCCGCCGTGGTTTCCCGTCCGCATCCCGTCAAGGGCGAGT 
GTCTCTACTGCTTCATAACGCCGAACGAGAACGAGGTATTCGACCAGAAGCTAATTTCGGACCTGAAGAAGATGGTGCGAGAACGTATCGGACCT 
TTTGCCATGCCGGATGTCATTCAAAACGCTCCCGGACTGCCCAAAACTCGTTCCGGCAAGATTATGCGTCGTGTGCTGCGCAAGATCGCGGTTAA 
CGATCGCAACGTGGGCGACACATCTACCTTGGCCGATGAGCAAATTGTGGAGCAACTGTTCGCCAACCGGCCAGTGGAGGCCAAGTAGATGGTAG 
CTGTTGGACCAACTGGCAGCAGCCAGGATCCCTTAACCCAAGGCTTTCTCCGGTTGCACTGACCACTCCACTTTGCCATCGCATTTCCACCACTT 
ACGTATCGCGATTACGTATCCTGTAAATCTCCCTCTACTTATATGAATTACCCACTGCTTAATGTTTGTTCCGTTCCGGGCTACGGCGTGGTTAG 
CACCTTTCAAATTCTACTTAGCTCGTAAGCCTAAGCGTGTGCGTAAGTAAGTGACTGGATGGTTTCCGATCCCAGTTATATATCGTCTATATGAT 
TTTGTGTACTTGCCTGTTCATCGTAGTTGACATGCTCACCACGCACACATTTTCCAGTTCTTTTGTTTCTTCGTTGTAGTCGTAGTTTAAGTTCT 
CGCTAATTCGTTGTGATACAAATT 
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(SEQ ID NO: 1457) 



Start ATG: 451 

MPAEKSIYDPNPAISQNAYISSFEEYQKFYQESLDNPAEFWSRVAKQFHWETPADQDKFLKYNFNISKGPISIKWMEGASTNLCYNLLDRNVRNG 
LGDQI AY YWEGNHPDDY SRGLT YRKLLEEVCRFANVLKDHGI RKGDRVS I YMPMILELP I AMLACARI GAVHS I VFAGFS PDSL AERMFDCKAKL 
LITADGAWRGEKPLYLKALCDTALEKVEEMGHSVEKCIVVSHLKRVTPCQPDHVEEEIPWTDDRDYWWHEEMEDKEPACYPEWMDAEDPLFMLYT 
SGSTGKPKGVLHTTAGYLLYAATTFKIVFDYKPGDIYWCTADVGWITGHTYVVYGPLANGATSVIFEGTPFFPGNDRYWSVIDKYKVTQFYTAPT 
AIRALMKFGEGPVLKHNLSGLKVLGSVGEPINPEAWLWYYKYIGKEQCSIVDTFWQTETGGHVITPLPGATPMKPGSASFPFFGVKPTLLDECGI 
EIKGEGEGYLVFSQPWPGMMRTLYWNHERFEDTYFSKFPGYYCTGDGARRDADGYLWITGRVDDMLNVSGHLMSTAEVESVLTEHPRVAESAVVS 
RPHPVKGECLYCFITP1SIENEVFDQKLISDLKKMVRERIGPFAMPDVIQNAPGLPKTRSGKIMRRVLRKIAVNDRNVGDTSTLADEQIVEQLFANR 
PVEAK* 

(SEQ ID NO: 1458) 

Name: acetyl CoA synthase 
Classification: enzyme 
Gene Symbol: AcCoAS 
FlyBase ID: FBgn0012034 



Celera Sequence No. : 142000013384612 

CCGGGAGATGGAAACAATGCGTTTGGGAGAACATGGGAAAGAGATCTGGGGATCTGAAGATACATATGTATATTCAAACAACTCAGGCTGATCAA 
ATAGATTGATAAATTCAAACATATTCCACAGCTGTAGGTGAATTTTTTCGGCCCACTTGTTCTAATAGAGTTAGAGAAATTGGGTTTGCTCAGCT 
TTTTTGGGCACAGGGCCACATATGTACATATCTAATACTCACCGTCTTTTGACGCTTTTTGCTGCTCTCCGGAACTTCACTCATTTCTCGAGGGG 
AACTGTGCTCGCCTATTATGACCCACCAAAGTAGCACGTCCTCCTGCCAAGCAATTCCAAACAAAACTGCGAAAAGTAGTACTTCGAGTCGGAAT 
TCGGTGCAATCGATGAATGAAGTGCGCCACGGAAGTAACCGTTGGCTGGCTTACTGTTTTCCAAATAGGTGTAGCACTCGCAGCAGCTGGACTAT 
TGCACTAAATAGCTTAAAGTCTAGTTATATCGCAAGCAAACGTTTAAATTTGGGATACGAGAACTGTATTATTACTAATTCACATGCACACAAAC 
ACCCACGCACACCAACGCAGCCATCGGCGCTCAGCTGATTAGAGGTGGCCCAAGTGTACGATGGTTAAACGATAGCCCACTACATTGCTAATGTA 
AAAACAAATTTATTAATTTAATTAATATTTTAACTAAACTATCGCATAGTTGTAAGAAAAATTCCAATTGCAAATTAAATTAAAGTCAATACTAC 
TTTTTTATAACCGCTTTTTAGTAACCCATTGAAACATAACTTATGTTCCAATGGATGAAAAAATAGACAAAAAAAGTTGAAACTTTATTAAAATG 
TAGCTGCTAAATATTTAATATTTTTGTAATATCCCAAATCAAATTTGTATGAAAAACAAAGTTCGTTCTGAATGTACGTTTATTGCAAATTCATC 
TGATAAATAAAGGATTAGGACTAGGCTTACGATTAAGAAGTGTCTTCGATCTAGTACGGTGGTGGTTGATACTGCTGCTGGCCCATCATGTTCTG 
CCGCTGGAGCTGAGCCACCAGATTCGGCGTTTGTTGCTGGTTCATGCTCTGCTGCTGCGGCACCATTCCGCCTCCGGCGCCACCGCCAACGGCGC 
CACCCCCCTGCTGAGGCCCGGCTCCCATGCCGCCGCCTGCCCCACCACCCATACCCTGCAACTGCATCATTTGGTGCCTCAGCTGCTGTTGTTGC 
TGTTGCTGCTGCTGGAACTGCTGCTGGGTGTTGAGGACGCCGGGTTTGCCTCCTCCCGCCGACTGCTGACGGGCGTACGGAGGCACCTGGCCGAC 
GACGCCACCCGGCCCCGGACCCATCATGGTGGACATGGTTACGTTGGGCGCCTGGTTCATGTACTGGTTCCGGTTGCCGGCGATCATTCCTCCGG 
GAACAACGCGCTGTTGCTGTTGTTGTTGTTGCTGCTGCTGGGCAACCGCTGCGGCGGCCATAAAGTCGGGATTGTTGCGATTCCCCTGCTGCATC 
TGCGGATTGAAGTTGCCCATCATGTTGGCCTGCTGCTGTTGCTGCTGATTGGGCGCCATGCCGACGCCCACTCCAACGCCTGCTCCACCGCCCTG 
TTGCTGCATGCCCTGCTGTTGCTGCTGCTGGTACTGATTCTGGAAGTTCTGGAACTGCACCTGCTGCTGCGGCTGTTGCTGCGGCTGCTGTTGTA 
TCATGCCTTGCTGAGGCATGCCTCCCATGCCCATGTTGGGATTGTTGCCGGCACCGCCAACGCCGACGCCAACACCAACGCCCACACCGTTGCCT 
CCCTGAGCTCCACCGCTCTGCATCATTTGGTTGGCATTCGGGTTGACCATGCCCACCATATTCTGTGGGATTTGTTGCTGCATCATCGGATTCTG 
CGACATGCCTTGACCTCCCATGTTCATCTGATTCATCTGGTTGGGCGCCATGCCGGGTCGCATGTTGCCGCCGCGCATGAACTGCTGCTGTTGCT 
GCTGGTTGGGATTCATCTGTTGCTGCTGGGCGTTCTGCTGCGGCTGCTGCTGCATGGCGTTGAATGCAGCGACAGACTGAGGATTTTGCTGGGGA 
TTCATTTGCTGCTGGCGCATTTGGTTGGGGTTCATCTGCTGCTGAGGACCGGGCTGTTGTTGCGAAGCCTGTGGTTGTTGCTGCATCGGATTGAA 
ACCACCCCCCGGACCCTGTTGCGCTTGCTGCTGGAATGGTTGTCGTTGTCGTAGCATCTGTGATAGCGCCTGCTTGGAGTTGGCATTCAACGGCG 
GGCGCTCGATGCGATTTGCTGCAATGGTAAGTGACACGGATTACAATCTTTAACCGTATATAATATCAAACACTCGACTTACACTGCATGTTCTG 
CTGATGCGGAGCGTATTGGTTGTGGTACGGTTGTGGCGGCTGCTGCTGCTGCGGCGCATTGTGCCACTGTTGCTGCTGGCCAGGCATACCCTGCG 
GTCCGGCTCCACCGCCCCCTGGTCCCTGCTGCATGAAGTTCATCTGGGGATTTGGCTGCCCGTTCCCACCGCCCACATTCAGCTGCTGCTGCATC 
GGGTTGTTGCCCATTTGCTGCATCTGCTGCTGCTGCTGCTGTTGTAACATGGCATTCTGCTGCATCATGTTGTTCGGATTGGGTGCAAACTGCTG 
CATGTTCATCTGTGGAAAAAATTTATATGAGTTATAATTTAAACTCGAAGGTAACAGAAGAATGGTTTAGCACCGAAAAAGGTTAAGAAAATCTG 
GATTACAAGTTTACAACTTTCGAATTTTTCTGCCATCATGTTGCTTTAAGAGGCTTGCTTCGATTCCTTCATAAACCAAATCTAAGAATTTAACT 
ATTAACGAAAACAATTTTATTACCGGCATGTTCATTGGCATCTGGCCCATTTGGTTGGGCTGCATCTGCTGCTGTTGCATGTGCTGCTGCTGCAT 
GTGCTGCTGTTGTTGCTGCTGCTGTTGCAACTGCTGTTGCTGCACATTCTGCGGCTGTTGAGGCTGCTGTGCCAGCTGAGGCTGCTGTTGCTGAG 
TATTGACCACCGGTGGCGTTTTCGGGTTCTTCGGCTTGCGTTTCCGCGTGGTCGTTCCCTTGCCGCGACCTCTTCCCGTGCCGCCAACCACAGCG 
CTTGGCGACTGATCCACCGACGAGGGCGTGTCCGCCTTCATTTCATCCTTCTACGAAATAGAAACAATGCATTTAGAAAGGGGCAAAGGGAAATG 
CAAAAGGAAATACAAAATGGACTTACTATGCAAATCTTCTCGGGAACAGGCTCAATGTCTTCAGGTGGCAGTGGCAACGGTTCATAGTAATAACT 
ACTGGGCTTGACCAGACTGTGCGTGTGATACTTCAGGTTGCGGTGAGCCTCCTCGTAAGTAAGGGGCTTGCGCTCCAGCTTGACTGCACCAAACC 
ACACCCAGGACAAAGGAGCCGGGTTCTTGTGACCCTCAAGGATGTCCCACACGCTGATACGCTGTTTGTCCGAAATGCGTAGTTGTTTTTTGTCT 
ATATCGCAGATCTTGTTGCCCTTGGTGTCCATTCCAGAGTGTTCGCACGCAATCACCTAAAGGGAGAAATACCACTCAGTTTCCAAGCTCGTTCA 
GTTTTGGGTCATCTACTCACCTCAGTAGGCTGCTTGTAAAGTGGAAGAAGCTGCCGAATAACCCGAATGGAGGCGTTGTTCTTCTCTCCGATTTC 
CTTCTTTAGCTTCTTCATCAGGTTCATGTAGTGCCTCTCGTCATCAGAGACCAGCGTTGAGTGCACCAATGTGGCCAACATGTCCACAACGGTGG 
TGAACAGCTCACGATTGCAGCTGAGATCTACCACGCCCTGGCAAACCAATTGGGCCAGAAGAATAGCCCAGTCTGTGGTGGGGGTACTGTTCTTT 
TGGATAGCCTCGAACATGCCGCCCACCAGCGAAAAGCGCAACTGCAGCGCATCGAGGATCTCCTCGCGCGCCTGCGGCTCATCAATGCCACCGAT 
TGTGTCCAGCTAAAAGGAGTCATGTTACTAGTAAGTTTAACTTTAAGGATAGAAAGTAAACACTCACCTCAGCAAAGGATTGCAGACATTGTGAT 
AACTGGGCATAGAGCGAAACCAGTAAGTTTTCCTTGTACTCGTCCTGGCCTTTCAGGCAAGTTAGGATGAGACCCAGGAAAGGCTGATAATCAAG 
CGGCATCTTCTTGTTGCGTGCGGGCACGCCGCCCAGTCCATAGCTATTGCTACTGTGACAACTGTTGCTCTTCTCGCGCTCGTCACCACTGCCCG 
AGCTATTGCAATCGGACTTGGACACCTTGGAGAAATAGTTCATGCTCTCAAGAACCTGACCAGCCACTCGAAGTATCCTGCCCTGCACCGCTGGT 
GTTAGTTTAGCGATGAGTGGAGCCACTAGCCAGGTGGACGGTTTGGGTTTGTGGGTCGCCTTCACGGCACCCGGCAGGACAACTTCCTCCATGTG 
GAAGACGTCGATTGCCGCGCGTGCCACTGTGTCGAGCCAAACATTAAGCTCCGCGTTGTTGCTCAGCGATTGGCGGTACATAAGCTGAAGATCCA 
GCCACGAGATACGCAGTGTCCACTGTCCGAGATTCTCTAGTATGCGTACGATCATTGAGCGCTGATCCAGTTCGGAGATGATGTTAAACTCAGGC 
TCCGGATAGCAGATCATGTGCAGAACCCGCTGTGCCTGTTTCGCGGTCAGCATTTCGTCGATTATCATGTCGCAGAGTTGCTCTGCGTTCTTTAG 
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GCATCTTTCCAGAACGTGTTCCTGGGCGCAGATCTGCTTGAGCACCGCCTGGGCGAACTCCAACAGCGAGATCTGCTCCATTCCTCCGACCGATG 
GCTGTTGCAGGGCTTCGGGTTCGCTTGTCAAACTGCTGCCCAATTGTAAATCGCTGGTGCCCAGGATCTGGCTGAGATCGGCCGGTCGTTGTTCG 
TTGCTCTTGGGTGTGCTTCCCGGATGGACCGGTGTATTGAAGCCACTACGCTTGCCGCTTCCAAAAGCTCCGCCCAATCCGCCAGAGGAGGTGCC 
GCTGCCGGAAGCCAATGGTGTTTTAAGCGCCGCATTGTCCACAACAATGAGAATGGCCTTGAGCACGGCAAGCACTGCGGCAATGGGAATGGTCT 
TATGGGCACCCACCAGAAGATGACGATCGCAGCTCAGCCGAATGCTAAAGTCGTTTCCCACAGCGTGCAGTGGATTCGGGGAAGTGCTTACCGAG 
TACATGCCAGGTTGAGGAATCTCCAAGGTCTTAAACAGCTTCAGCACCAAGTGACATGTAAGACGAGCACCTGCCTCTGCATCTACACTCAGTTC 
GGCTCCACCAGCGCTTACCGATCGGGCCAATGTAGGCAGGGCGAACTTACTTACAAAGTCGGCTAGTGAGAAGCAATGTCTGGCCACCAGAATGC 
ATACGAACACGGCCAGGGCGTTGTGAGTCTTAAGTTGTCCAATGTCGACCTGTCCGCCCAGGTGCGGATAACGCGGACTCTTACTGCCGCTGCAC 
AGACTCTGCAGTGCGTATATCCACTCCTCAGACAAGACGTTGCAGCTGGCGGTCAGTTCGGCACATGCGAGGGCCACATCGTTCAGCCGCTCGTT 
GTCCGTCTCTCGGCAGACGGCTATTAATGCGTTAGACACGAAACTGTATACATTGGCCGGCGACTCGTGCAGGGTTCCCAGCCAGTGAGGATCCA 
GTTTACCACCGCGTCGAGGGGCCATGAAAAGCTCCTGCAACAACAGTGGGTTGTAACCCTGAGGTGGTAGGCTAAATAAAAAATGGTAATATTTG 
TTAATATTATATATTATATTATTGGGGAAGAACTTGCGATCTAAACTCACTCAAGCTGCTCTGGAGTGTTGAATATGTCCTTGAACCGCTTGATG 
CACTGCAGCTGCTGGTAGTACTCCGTGGATTGTTCCTTTCCCTGGAGCAAAACACACGACTCGTGTAGGTCGCTCAGGTAGGCAATAATGCAGCG 
CTCCGCCGAAGTGCACTCACTCGGATTCGATACGTGCCGAATGGTACGACAGACTCCCTCAAAAACAGACAGCGTCTGCTCCGGTGAAAGAAGAA 
GGCAGCTGTGGTAGCGCCGCAGGATGCTGACGATGTAGAGGGCAAGTGAAGTGGTGTAACTGCGCACTAGGTTTGACTTCTTTAGCTGCAACTGG 
TGCTCCACCTCAGGCAACTCCTTCAACAGACTATCGCACAGTTCCAAAAGACTGTAGATATTAAGAGCCAGCTCCATCAGATCAAAGAGGAACGC 
CACGTGCTCCTGTACCGGCAGGTAGTTGTTGTTCCCTAACGCGAATCCGTTCAGCTGCTCCAGGACATTGGCCGCACATTGGGCTGTGACCACAT 
GCTGGTCGAAATAAGCCATCTGCTGGCACTTGGACGTGGTAGCTTCGAAGTTAAACTCGTTGCGCGAGTGCTTCTTCACATGACCGGCCGCCGCC 
ACATCTATGCTAAACTTTTTTGTGAAAAGCTTTCCAATTTCCTTAGACATCTTCTTGACAGCATGCTTCTTCTCGTCCCTCTCTTTGCCGACTCC 
GAAGAGCAGGATGTATCGCTGGTTGCTTTCACTGCTGTAGCTGCTCATGCTGGGATCATCCTGGGGGATGGGGAAGTGCTTGGTGTACACGTAAT 
GCCGCGATATGCTACCGCCAGGATTGGTTTCTCCATCGCCTGGAGGACCAATCTTGGGACTGTCTGGAGCTTCGTGCTGTTGTCCTTTCTCCTTA 
ATATTTTGCACCAGTTTGTCCAAATCGTCGTCCACATTGGAATCATCAAATTCGTTGTGTTTGAAGTCCAGGCCCCCGCCGAATCCGTCATCATC 
GAAACCGTGCGTCGTCGTTGGTGGTGGAGCTGGAGGTGGCGGCGATGTTTTCGTGGCCGTTGTTCCTGATTTGA'TCACCAGCACCGACTCCAGGA 
GCAGATCTCCTCGTGAAATTAGAGTGTGCATGTACGCGTTGTGCGAAAAGACATCGTGGCGAATCAATGCACTGAACAGCTGAACAAGGTTGGTA 
AATTCCGTGCGCTGCTGTGGACTGCTCACGTGCTCATCGAGGACGGGAGCATCGTGGTCAAGGAAGTGCATTAACACGTGCTGAAACACAGGCAG 
GCCATCAATTAAGCCGGCGCCCGAAGCTAGCGAGTCCTTGTCATCCTTATCACTGGATTGTTGATCCGCCGGCGTGCTTGTCACGTCGATCTGGC 
GCTTGTCAAGTAGAATGGCCACCACCATGGCTCTGTGTTCTCCCCAACGCTGCCCCGACACAGCCCACTCGCACAGAATGCGTACCGTGTCCTTG 
TCTTGTTTGGGTTCATAAGGCGGACGAGGTTCCACCTGATCCTCCTCCCGCCTGCGACTGACAGTTGGGCTGGGAAATATTTGTGCGTATAGGGT 
ATCGATACTATTATTGTGTTCCATGCGATCGAAGCAGTGCGTGTCCAAGTGGTCCAGTGTAGCCAGCACACTGGTGTATTGATTCTTGCCGGCAC 
TCAGCCACTTGGCTGCAAACCAGCGCTGCTCTGCATGCTGAGTGCGCAATACGATGTCCGACTCGGCGGCCCGCAATTGACGTCGAATCTCGTGA 
TTGGTTCGCGGACAGCGTGTGGGCATCGGCAAGACTGAGGGCGCCAAAGGCAAATGGTCCAGCGGACTACCCAACAACGAGGAGGGTGCTCGGTG 
AGCTGCTATCCCACTCCAGACCAGAGCTGTGGGGCACTCGATTGTGATGATCTGTAGGATCGTGCTTAAATACAGCACTATATCCCGATGATGGG 
GACAACTCATTTGTTCCTGAAGCGCCAGCTCATAGGGATCTTGTTGCAACTTCGGCTCGTCCAACTCTTTGATGGTCTGCTGTTCTACGACAGTG 
TTCAGCAACTGGGCCAGTTTTTTGGACACTATGTAGGCCATCTTTCTGCAAAGACGCTCCGACTGGACGAAGTCGTGCATGTACTGAAGGGCAAA 
GCTGAGCACCAATTTCTTCAACGGTTCGTCGAAGCTTGCCTGCGTACGCATCTTGTCCAAAAGGTCGAGGATCCAGTTTAGAAATTCCTGTCGGT 
CCAAAAGTGATTCCTCGTACATGTACTTGCTCAATTGGTGGCAGTATTTCCAATACTTAAGCGCATTGCGACTATCTTCGAACTGGCTTCCAATT 
CCAGAAATTGTAGAGCCTGGTGGAACTGCAGCTCCTGCACCCGCTGCGCCTCCTACCACAGAAGACCCCGGCCCACCGACTACTCCGCCAAGGCT 
TCCCGTCGTCGGCATGACACTGCCGGCATTCACTCCTCCGCCAGGAGCTGCCTGTTGGCCATTAGCTGGGCTATGAATCGGTGGCAGTGGACTGG 
CCATGCTGGGTACCGGGATTACGTTCGTGGAAGCACTGCTACCGGTGACGGAGTTAATAGAGCTGGTGCCAGTGGAGCCATTGCTGGCAGGACCA 
TTACCGGCGGCGGTCAGGCTTCCGGAAGTGGTGCCATTCGAAGAGGACTTGTCGTGGTCTAAAATCAAGAAGTTGAATAAGGGAGTTGATAGATT 
ATAGAACTTAAGGACTGTACTCACTCTGCTGATAATACTCTTGGAGCTTGGGAAGCAGCTCTTTCATGAACTTGATCATGTTGCCGGTCCACTCG 
GCAGCCGGGTCGTAAATGGATCGCTTTTTGTTCTTAGACTCGGTGAAGCTAAGCGTATAGGCGGCGCTCAGCTTGATGAACCAGGTGGCGCGTTG 
CATGTTCACCTGGTTCTCGCACAGAGTAATGAAGATCTCCTCCTTCTTGTTGAACGATGGAGCCCGTTTGGCTAGGCTAAGTAGGGGTTTGTTAC 
CGGCGAGGTCCTTAAACCAGGCGTCCACCGTACACTTGCGACGCGGCGAAACGGGCCAGAAGTTGTCCTTGCAGTTGATCTGCTGCTTCTTGCGC 
CCCGTGTCCGGCAGGGTCATCAGCTCCTCCTTCTTGGCCAGGACGCCGCTAAAGAATGCGCTCACTTTGCTGGCGTTCACATTCGAGTTGTGGGC 
CGTACCGAATTCATCGGACAGCGGTGGCGTCGTCGTGAATCCGTGCTTCACATTCGTGGGCGTCAGCTCGTCCTCTCGCTGCTTCGCGTCCTGCG 
GATAGATGTCCGGCGGACCCAGACGGGTGCGCTTCAGCGGACGCTTCTCCTGCAGCATGGACAGCATCTTCGCGGCCCTCTAGTTGCGTTTTATA 
AACAGATTTTCTTAGCGAATTCCCGATTTCCTCTGTTCGCCAACCCAACAACAAATCAAAATATCAGTGGTGGGGCGAAAAAATGCGATGAGCCA 
TCGATAGTTCGATAACATCCCGCGAACAATCTAGCGGATGCAATCGAATTTAAGCTAAGTGTTAAATGGTTGTAAATATTACAAATGTAATCTTA 
TCATGTTCAGCCACACATCCCCAATCAACCTGATACAGTACTTTAAATATGACGTAATTTTTTAATTATGCAGTGAAAAAGTTACATCGTTGTGC 
ACTAACAAAAGAAATACCACTCAAAGTGGTAAGATCACGAATAAAGCTGCGTATAAATATTAAATAATTTACGTTGTATTTTTGGTAATGATTGA 
AGAAACATTCGTTGTTAAACGAATAAGGCCTCACAAGGCTTGGCGATCGAATTCCGGCTCATCGGCATAAAAGTGTCCGCCGCTTGTCATTCCAT 
CGCTGCACTCGAAGTTCAAACAGAGCATTTTCAGGGGAAATTCGTGCGGTAGCTGCAAACGAAGCACAAAGTGTCACATTAACATAAGGACTTTC 
GAATATATGTATGTGCTTTAAATATTACTCACCCGTTCCATGATGTCATGCCTAATGTCAGTGGCCGCGGCAAATAGCTCTAGGGTTTGATGAGC 
TGAGCGTTTCGATAAGCTGGCCAGCGTGTCCAAGACGAAGGCGTCTGTGATGCCCAGGCAATAGCAAATGTTCAGATACCGAAGGCGCTCATTGT 
TGGCCACCAAAAGGGAGAGCTGCGTTTCTCCAATCTCACGAGCGGACTGTATGGAAAGGCACTCTAATCGACTCAACTTCGCCAAGTGGTGAACA 
CACAGATCGTCACGAGGCTTGCAGACCAGCCACCGGAGACACTTCAAAGCACTGATGTACTGCGCCTGCTCCTCGTCAATCCAGCGTTTGCGGTG 
C AGG AT G AGGT AT T C C AAC T G A 
(SEQ ID NO: 1459) 



Exon: 9187.. 8575 

Ex on: 8513.. 5846 

Exon: 5771.. 4058 

Exon: 3999. .3631 

Exon: 3571,. 3257 

Exon: 3185.. 2874 

Exon: 2669.. 2363 

Exon: 2298.. 1676 

Exon: 1614,. 1001 

Start ATG: 9178 (Reverse strand: CAT) 
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Transcript No. : CT24825 

ATGCTGTCCATGCTGCAGGAGAAGCGTCCGCTGAAGCGCACCCGTCTGGGTCCGCCGGACATCTATCCGCAGGACGCGAAGCAGCGAGAGGACGA 
GCTGACGCCCACGAATGTGAAGCACGGATTCACGACGACGCCACCGCTGTCCGATGAATTCGGTACGGCCCACAACTCGAATGTGAACGCCAGCA 
AAGTGAGCGCATTCTTTAGCGGCGTCCTGGCCAAGAAGGAGGAGCTGATGACCCTGCCGGACACGGGGCGCAAGAAGCAGCAGATCAACTGCAAG 
GACAACTTCTGGCCCGTTTCGCCGCGTCGCAAGTGTACGGTGGACGCCTGGTTTAAGGACCTCGCCGGTAACAAACCCCTACTTAGCCTAGCCAA 
ACGGGCTCCATCGTTCAACAAGAAGGAGGAGATCTTCATTACTCTGTGCGAGAACCAGGTGAACATGCAACGCGCCACCTGGTTCATCAAGCTGA 
GCGCCGCCTATACGCTTAGCTTCACCGAGTCTAAGAACAAAAAGCGATCCATTTACGACCCGGCTGCCGAGTGGACCGGCAACATGATCAAGTTC 
ATGAAAGAGCTGCTTCCCAAGCTCCAAGAGTATTATCAGCAGAACCACGACAAGTCCTCTTCGAATGGCACCACTTCCGGAAGCCTGACCGCCGC 
CGGTAATGGTCCTGCCAGCAATGGCTCCACTGGCACCAGCTCTATTAACTCCGTCACCGGTAGCAGTGCTTCCACGAACGTAATCCCGGTACCCA 
GCATGGCCAGTCCACTGCCACCGATTCATAGCCCAGCTAATGGCCAACAGGCAGCTCCTGGCGGAGGAGTGAATGCCGGCAGTGTCATGCCGACG 
ACGGGAAGCCTTGGCGGAGTAGTCGGTGGGCCGGGGTCTTCTGTGGTAGGAGGCGCAGCGGGTGCAGGAGCTGCAGTTCCACCAGGCTCTACAAT 
TTCTGGAATTGGAAGCCAGTTCGAAGATAGTCGCAATGCGCTTAAGTATTGGAAATACTGCCACCAATTGAGCAAGTACATGTACGAGGAATCAC 
TTTTGGACCGACAGGAATTTCTAAACTGGATCCTCGACCTTTTGGACAAGATGCGTACGCAGGCAAGCTTCGACGAACCGTTGAAGAAATTGGTG 
CTCAGCTTTGCCCTTCAGTACATGCACGACTTCGTCCAGTCGGAGCGTCTTTGCAGAAAGATGGCCTACATAGTGTCCAAAAAACTGGCCCAGTT 
GCTGAACACTGTCGTAGAACAGCAGACCATCAAAGAGTTGGACGAGCCGAAGTTGCAACAAGATCCCTATGAGCTGGCGCTTCAGGAACAAATGA 
GTTGTCCCCATCATCGGGATATAGTGCTGTATTTAAGCACGATCCTACAGATCATCACAATCGAGTGCCCCACAGCTCTGGTCTGGAGTGGGATA 
GCAGCTCACCGAGCACCCTCCTCGTTGTTGGGTAGTCCGCTGGACCATTTGCCTTTGGCGCCCTCAGTCTTGCCGATGCCCACACGCTGTCCGCG 
AACCAATCACGAGATTCGACGTCAATTGCGGGCCGCCGAGTCGGACATCGTATTGCGCACTCAGCATGCAGAGCAGCGCTGGTTTGCAGCCAAGT 
GGCTGAGTGCCGGCAAGAATCAATACACCAGTGTGCTGGCTACACTGGACCACTTGGACACGCACTGCTTCGATCGCATGGAACACAATAATAGT 
ATCGATACCCTATACGCACAAATATTTCCCAGCCCAACTGTCAGTCGCAGGCGGGAGGAGGATCAGGTGGAACCTCGTCCGCCTTATGAACCCAA 
ACAAGACAAGGACACGGTACGCATTCTGTGCGAGTGGGCTGTGTCGGGGCAGCGTTGGGGAGAACACAGAGCCATGGTGGTGGCCATTCTACTTG 
ACAAGCGCCAGATCGACGTGACAAGCACGCCGGCGGATCAACAATCCAGTGATAAGGATGACAAGGACTCGCTAGCTTCGGGCGCCGGCTTAATT 
GATGGCCTGCCTGTGTTTCAGCACGTGTTAATGCACTTCCTTGACCACGATGCTCCCGTCCTCGATGAGCACGTGAGCAGTCCACAGCAGCGCAC 
GGAATTTACCAACCTTGTTCAGCTGTTCAGTGCATTGATTCGCCACGATGTCTTTTCGCACAACGCGTACATGCACACTCTAATTTCACGAGGAG 
ATCTGCTCCTGGAGTCGGTGCTGGTGATCAAATCAGGAACAACGGCCACGAAAACATCGCCGCCACCTCCAGCTCCACCACCAACGACGACGCAC 
GGTTTCGATGATGACGGATTCGGCGGGGGCCTGGACTTCAAACACAACGAATTTGATGATTCCAATGTGGACGACGATTTGGACAAACTGGTGCA 
AAATATTAAGGAGAAAGGACAACAGCACGAAGCTCCAGACAGTCCCAAGATTGGTCCTCCAGGCGATGGAGAAACCAATCCTGGCGGTAGCATAT 
CGCGGCATTACGTGTACACCAAGCACTTCCCCATCCCCCAGGATGATCCCAGCATGAGCAGCTACAGCAGTGAAAGCAACCAGCGATACATCCTG 
CTCTTCGGAGTCGGCAAAGAGAGGGACGAGAAGAAGCATGCTGTCAAGAAGATGTCTAAGGAAATTGGAAAGCTTTTCACAAAAAAGTTTAGCAT 
AGATGTGGCGGCGGCCGGTCATGTGAAGAAGCACTCGCGCAACGAGTTTAACTTCGAAGCTACCACGTCCAAGTGCCAGCAGATGGCTTATTTCG 
AC C AGC ATGTGGT C AC AGCC C AATG T GC GGC C AAT GT CC T GG AGC AGCT G AAC GGAT T C GC GT T AGGG AAC AAC AAC T AC CT GC C GGT AC AGG AG 
CACGTGGCGTTCCTCTTTGATCTGATGGAGCTGGCTCTTAATATCTACAGTCTTTTGGAACTGTGCGATAGTCTGTTGAAGGAGTTGCCTGAGGT 
GGAGCACCAGTTGCAGCTAAAGAAGTCAAACCTAGTGCGCAGTTACACCACTTCACTTGCCCTCTACATCGTCAGCATCCTGCGGCGCTACCACA 
GCTGCCTTCTTCTTTCACCGGAGCAGACGCTGTCTGTTTTTGAGGGAGTCTGTCGTACCATTCGGCACGTATCGAATCCGAGTGAGTGCACTTCG 
GCGGAGCGCTGCATTATTGCCTACCTGAGCGACCTACACGAGTCGTGTGTTTTGCTCCAGGGAAAGGAACAATCCACGGAGTACTACCAGCAGCT 
GCAGTGCATCAAGCGGTTCAAGGACATATTCAACACTCCAGAGCAGCTTGACCTACCACCTCAGGGTTACAACCCACTGTTGTTGCAGGAGCTTT 
TCATGGCCCCTCGACGCGGTGGTAAACTGGATCCTCACTGGCTGGGAACCCTGCACGAGTCGCCGGCCAATGTATACAGTTTCGTGTCTAACGCA 
TTAATAGCCGTCTGCCGAGAGACGGACAACGAGCGGCTGAACGATGTGGCCCTCGCATGTGCCGAACTGACCGCCAGCTGCAACGTCTTGTCTGA 
GGAGTGGATATACGCACTGCAGAGTCTGTGCAGCGGCAGTAAGAGTCCGCGTTATCCGCACCTGGGCGGACAGGTCGACATTGGACAACTTAAGA 
CTCACAACGCCCTGGCCGTGTTCGTATGCATTCTGGTGGCCAGACATTGCTTCTCACTAGCCGACTTTGTAAGTAAGTTCGCCCTGCCTACATTG 
GCCCGATCGGTAAGCGCTGGTGGAGCCGAACTGAGTGTAGATGCAGAGGCAGGTGCTCGTCTTACATGTCACTTGGTGCTGAAGCTGTTTAAGAC 
CTTGGAGATTCCTCAACCTGGCATGTACTCGGTAAGCACTTCCCCGAATCCACTGCACGCTGTGGGAAACGACTTTAGCATTCGGCTGAGCTGCG 
ATCGTCATCTTCTGGTGGGTGCCCATAAGACCATTCCCATTGCCGCAGTGCTTGCCGTGCTCAAGGCCATTCTCATTGTTGTGGACAATGCGGCG 
CTTAAAACACCATTGGCTTCCGGCAGCGGCACCTCCTCTGGCGGATTGGGCGGAGCTTTTGGAAGCGGCAAGCGTAGTGGCTTCAATACACCGGT 
CCATCCGGGAAGCACACCCAAGAGCAACGAACAACGACCGGCCGATCTCAGCCAGATCCTGGGCACCAGCGATTTACAATTGGGCAGCAGTTTGA 
CAAGCGAACCCGAAGCCCTGCAACAGCCATCGGTCGGAGGAATGGAGCAGATCTCGCTGTTGGAGTTCGCCCAGGCGGTGCTCAAGCAGATCTGC 
GCCCAGGAACACGTTCTGGAAAGATGCCTAAAGAACGCAGAGCAACTCTGCGACATGATAATCGACGAAATGCTGACCGCGAAACAGGCACAGCG 
GGTTCTGCACATGATCTGCTATCCGGAGCCTGAGTTTAACATCATCTCCGAACTGGATCAGCGCTCAATGATCGTACGCATACTAGAGAATCTCG 
GACAGTGGACACTGCGTATCTCGTGGCTGGATCTTCAGCTTATGTACCGCCAATCGCTGAGCAACAACGCGGAGCTTAATGTTTGGCTCGACACA 
GTGGCACGCGCGGCAATCGACGTCTTCCACATGGAGGAAGTTGTCCTGCCGGGTGCCGTGAAGGCGACCCACAAACCCAAACCGTCCACCTGGCT 
AGTGGCTCCACTCATCGCTAAACTAACACCAGCGGTGCAGGGCAGGATACTTCGAGTGGCTGGTCAGGTTCTTGAGAGCATGAACTATTTCTCCA 
AGGTGTCCAAGTCCGATTGCAATAGCTCGGGCAGTGGTGACGAGCGCGAGAAGAGCAACAGTTGTCACAGTAGCAATAGCTATGGACTGGGCGGC 
GTGCCCGCACGCAACAAGAAGATGCCGCTTGATTATCAGCCTTTCCTGGGTCTCATCCTAACTTGCCTGAAAGGCCAGGACGAGTACAAGGAAAA 
CTTACTGGTTTCGCTCTATGCCCAGTTATCACAATGTCTGCAATCCTTTGCTGAGCTGGACACAATCGGTGGCATTGATGAGCCGCAGGCGCGCG 
AGGAGATCCTCGATGCGCTGCAGTTGCGCTTTTCGCTGGTGGGCGGCATGTTCGAGGCTATCCAAAAGAACAGTACCCCCACCACAGACTGGGCT 
ATTCTTCTGGCCCAATTGGTTTGCCAGGGCGTGGTAGATCTCAGCTGCAATCGTGAGCTGTTCACCACCGTTGTGGACATGTTGGCCACATTGGT 
GCACTCAACGCTGGTCTCTGATGACGAGAGGCACTACATGAACCTGATGAAGAAGCTAAAGAAGGAAATCGGAGAGAAGAACAACGCCTCCATTC 
GGGTTATTCGGCAGCTTCTTCCACTTTACAAGCAGCCTACTGAGGTGATTGCGTGCGAACACTCTGGAATGGACACCAAGGGCAACAAGATCTGC 
GATATAGACAAAAAACAACTACGCATTTCGGACAAACAGCGTATCAGCGTGTGGGACATCCTTGAGGGTCACAAGAACCCGGCTCCTTTGTCCTG 
GGTGTGGTTTGGTGCAGTCAAGCTGGAGCGCAAGCCCCTTACTTACGAGGAGGCTCACCGCAACCTGAAGTATCACACGCACAGTCTGGTCAAGC 
CCAGTAGTTATTACTATGAACCGTTGCCACTGCCACCTGAAGACATTGAGCCTGTTCCCGAGAAGATTTGCATAAAGGATGAAATGAAGGCGGAC 
ACGCCCTCGTCGGTGGATCAGTCGCCAAGCGCTGTGGTTGGCGGCACGGGAAGAGGTCGCGGCAAGGGAACGACCACGCGGAAACGCAAGCCGAA 
GAACCCGAAAACGCCACCGGTGGTCAATACTCAGCAACAGCAGCCTCAGCTGGCACAGCAGCCTCAACAGCCGCAGAATGTGCAGCAACAGCAGT 
TGCAACAGCAGCAGCAACAACAGCAGCACATGCAGCAGCAGCACATGCAACAGCAGCAGATGCAGCCCAACCAAATGGGCCAGATGCCAATGAAC 
ATGCCGATGAACATGCAGCAGTTTGCACCCAATCCGAACAACATGATGCAGCAGAATGCCATGTTACAACAGCAGCAGCAGCAGCAGATGCAGCA 
AATGGGCAACAACCCGATGCAGCAGCAGCTGAATGTGGGCGGTGGGAACGGGCAGCCAAATCCCCAGATGAACTTCATGCAGCAGGGACCAGGGG 
GCGGTGGAGCCGGACCGCAGGGTATGCCTGGCCAGCAGCAACAGTGGCACAATGCGCCGCAGCAGCAGCAGCCGCCACAACCGTACCACAACCAA 
TACGCTCCGCATCAGCAGAACATGCAGTCAAATCGCATCGAGCGCCCGCCGTTGAATGCCAACTCCAAGCAGGCGCTATCACAGATGCTACGACA 
ACGACAACCATTCCAGCAGCAAGCGCAACAGGGTCCGGGGGGTGGTTTCAATCCGATGCAGCAACAACCACAGGCTTCGCAACAACAGCCCGGTC 
CTCAGCAGCAGATGAACCCCAACCAAATGCGCCAGCAGCAAATGAATCCCCAGCAAAATCCTCAGTCTGTCGCTGCATTCAACGCCATGCAGCAG 
CAGCCGCAGCAGAACGCCCAGCAGCAACAGATGAATCCCAACCAGCAGCAACAGCAGCAGTTCATGCGCGGCGGCAACATGCGACCCGGCATGGC 
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GCCCAACCAGATGAATCAGATGAACATGGGAGGTCAAGGCATGTCGCAGAATCCGATGATGCAGCAACAAATCCCACAGAATATGGTGGGCATGG 

TCAACCCGAATGCCAACCAAATGATGCAGAGCGGTGGAGCTCAGGGAGGCAACGGTGTGGGCGTTGGTGTTGGCGTCGGCGTTGGCGGTGCCGGC 

AACAATCCCAACATGGGCATGGGAGGCATGCCTCAGCAAGGCATGATACAACAGCAGCCGCAGCAACAGCCGCAGCAGCAGAGGGCGGTGGAGCA 

GGCGTTGGAGTGGGCGTCGGCATGGCGCCCAATCAGCAGCAACAGCAGCAGGCCAACATGATGGGCAACTTCAATCCGCAGATGCAGCAGGGGAA 

TCGCAACAATCCCGACTTTATGGCCGCCGCAGCGGTTGCCCAGCAGCAGCAACAACAACAACAGCAACAGCGCGTTGTTCCCGGAGGAATGATGG 

CCGGCAACCGGAACCAGTACATGAACCAGGCGCCCAACGTAACCATGTCCACCATGATGGGTCCGGGGCCGGGTGGCGTCGTCGGCCAGGTGCCT 

CCGTACGCCCGTCAGCAGTCGGCGGGAGGAGGCAAACCCGGCGTCCTCAACACCCAGCAGCAGTTCCAGCAGCAGCAACAGCAACAACAGCAGCT 

GAGGCACCAAATGATGCAGTTGCAGGGTATGGGTGGTGGGGCAGGCGGCGGCATGGGAGCCGGGCCTCAGCAGGGGGGTGGCGCCGTTGGCGGTG 

GCGCCGGAGGCGGAATGGTGCCGCAGCAGCAGAGCATGAACCAGCAACAAACGCCGAATCTGGTGGCTCAGCTCCAGCGGCAGAACATGATGGGC 

CAGCAGCAGTATCAACCACCACCGTACTAG 

{SEQ ID NO: 1460) 

Start ATG: 10 (Reverse strand: CAT) 

MLQEKRPLKRTRLGPPDIYPQDAKQREDELTPTNVKHGFTTTPPLSDEFGTAHNSNVNASKVSAFFSGVLAKKEELMTLPDTGRKKQQINCKDKF 
WPVSPRRKCTVDAWFKDLAGNKPLLSIAKRAPSFNKKEEIFITLCENQVNMQRATWFIKLSAAYTLSFTESKNKKRSIYDPAAEWTGNMIKFMKE 
LLPKLQEYYQQNHDKSSSNGTTSGSLTAAGNGPASNGSTGTSSINSVTGSSASTNVIPVPSMASPLPPIHSPANGQQAAPGGGVNAGSVMPTTGS 
LGGVVGGPGSSVVGGAAGAGAAVPPGSTISGIGSQFEDSRNALKYWKYCHQLSKYMYEESLLDRQEFLNWILDLLDKMRTQASFDEPLKKLVLSF 
ALQYMHDFVQSERLCRKMAYIVSKKLAQLLNTVVEQQTIKELDEPKLQQDPYELALQEQMSCPHHRDIVLYLSTILQIITIECPTALVWSGIAAH 
RAPSSLLGSPLDHLPLAPSVLPMPTRCPRTNHEIRRQLRAAESDIVLRTQHAEQRWFAAKWLSAGKWQYTSVLATLDHLDTHCFDRMEHNNSIDT 
LYAQIFPSPTVSRRREEDQVEPRPPYEPKQDKDTVRILCEWAVSGQRWGEHRAMVVAILLDKRQIDVTSTPADQQSSDKDDKDSLASGAGLIDGL 
PVFQHVLMHFLDHDAPVLDEHVSSPQQRTEFTNLVQLFSALIRHDVFSHNAYMHTLISRGDLLLESVLVIKSGTTATKTSPPPPAPPPTTTHGFD 
DDGFGGGLDFKHNEFDDSNVDDDLDKLVQNIKEKGQQHEAPDSPKIGPPGDGETNPGGSISRHYVYTKHFPIPQDDPSMSSYSSESNQRYILLFG 
VGKERDEKKHAVKKMSKEIGKLFTKKFSIDVAAAGHVKKHSRHEFNFEATTSKCQQMAYFDQHVVTAQCAANVLEQLNGFALGNNNYLPVQEHVA 
FLFDLMELALNIYSLLELCDSLLKELPEVEHQLQLKKSNLVRSYTTSLALYIVSILRRYHSCLLLSPEQTLSVFEGVCRTIRHVSNPSECTSAER 
CIIAYLSDLHESCVLLQGKEQSTEYYQQLQCIKRFKDIFNTPEQLDLPPQGYNPLLLQELFMAPRRGGKLDPHWLGTLHESPANVYSFVSNALIA 
VCRETDNERLNDVALACAELTASCNVLSEEWIYALQSLCSGSKSPRYPHLGGQVDIGQLKTHNALAVFVCILVARHCFSLADFVSKFALPTLARS 
VSAGGAELSVDAEAGARLTCHLVLKLFKTLEIPQPGMYSVSTSPNPLHAVGNDFSIRLSCDRHLLVGAHKTIPIAAVLAVLKAILIWDNAALKT 
PLASGSGTSSGGLGGAFGSGKRSGFNTPVHPGSTPKSNEQRPADLSQILGTSDLQLGSSLTSEPEALQQPSVGGMEQISLLEFAQAVLKQICAQE 
HVLERCLKNAEQLCDMIIDEMLTAKQAQRVLHMICYPEPEFNIISELDQRSMIVRILENLGQWTLRISWLDLQLMYRQSLSNNAELNVWLDTVAR 
AAIDVFHMEEVVLPGAVKATHKPKPSTWLVAPLIAKLTPAVQGRILRVAGQVLESMNYFSKVSKSDCNSSGSGDEREKSNSCHSSNSYGLGGVPA 
RNKKMPLDYQPFLGLILTCLKGQDEYKEWLLVSLYAQLSQCLQSFAELDTIGGIDEPQAREEILDALQLRFSLVGGMFEAIQKWSTPTTDWAILL 
AQLVCQGVVDLSCNRELFTTVVDMLATLVHSTLVSDDERHYMNLMKKLKKEIGEKNWASIRVIRQLLPLYKQPTEVIACEHSGMDTKGNKICDID 
KKQLRISDKQRISVWDILEGHKNPAPLSWVWFGAVKLERKPLTYEEAHRKLKYHTHSLVKPSSYYYEPLPLPPEDIEPVPEKICIKDEMKADTPS 
SVDQSPSAVVGGTGRGRGKGTTTRKRKPKNPKTPPVVNTQQQQPQLAQQPQQPQNVQQQQLQQQQQQQQHMQQQHMQQQQMQPNQMGQMPMNMPM 
NMQQFAPNPNNMMQQNAMLQQQQQQQMQQMGNNPMQQQLNVGGGNGQPNPQMNFMQQGPG^ 

HQQNMQSNRIERPPLNANSKQALSQMLRQRQPFQQQAQQGPGGGFNPMQQQPQASQQQPGPQQQMNPNQMRQQQMNPQQNPQSVAAFNAMQQQPQ 
QNAQQQQMNPNQQQQQQFMRGGlsO^RPGMAPNQl^Ql^MGGQGMSQNPMMQQQIPQNMV 
NMGMGGMPQQGMI QQQPQQQPQQQRAVEQALEWAS AWRP ISSNSSRPT* 
(SEQ ID NO: 1461) 

Classification: receptor 



Celera Sequence No. : 142000013384518 

ATCCATACGAGACATGCAACCACTCTGAGGTCAACCGGTTGTCTAATTTATCTATTCTAGAGAGTTGAAGATCAGCCATTCAGCCATTTGGCCAA 
CTTATTTTTGTTGCCAACTCTATGTGGCGATGTCTCGGCACCCGCAATTATTTAAATGTCCCAGAGACAGAGATTTGTAAATTGCACGGCTACAA 
AGTGCATATTGTATGGTGCCATTAAATGGACCCACAGGGCGTATGCGCAATGTGTCGCACTGAACTTTTTCACCTAGCCCCAGCTCATGAATTAA 
TTCAGCTCTGCATAAATAACAAAGTTATTCAAATTCGCGAACTGTCTACCTAAATCACCAATGACTTTGGACTCCTCT^ATGCATTTCATAGAATT 
TTCCACATTTTTCCAGCAGTATGCAGTATCGTATAAAATGATCTTGAAAGAACCTTAACGTTTGGGGCACAACTATTGGGGTTCTTGACCAACCT 
TTTCTGGTTTTCAATGATAGAGTCAAACATACGTTTGTAAAGAATGGATATGCAAATATTCCATTCGTCGTGTACAACATTAAAAAAAATATATT 
GATTGTTTTTCTGCGGGGATTTTGTTGTTTCGATTTATTCGGCTTCGGTTTTGGATTTCGTTTTATTCGGAATTCTCTTTTTGCGCTTCCGGCTT 
GCTATCTTTGTATCAGTTTTGAGTATCATTGAAGCTGGCCAACTGCCGAAGGTCGCCGATCGGCACTCACCTCAATTGTGGTCCATAATTAGTCG 
TGCAGCTGGGGTGATACCTTCCATATGAAGATTCATAGTGCCATAGTGAAATAGCAATGCGTCCTGCATTTAAATGAGGGAAAGCACTGCCACAG 
AGGAGCACCCACAGATAGATTTGCATCTCCGTGCGGGCTCCACATAGTCGAGACGACAGGCCATAAATCAGAATGAAGAATACTATTGAATGCTA 
ATGTTGTATACTAATTGCCGAATATGTATCTTTTCCGGTTACGCTTGCAGATGTCGCCCAACATAATAGGCAGTACCTTTATATTGGCCGAGACG 
GCCATTGCCGATGGCAACAACAACAAAATGGCTGCCACACCAGCTGCTTCCGCCGCCCCTTCAGCAGCCACGCCCAGCCAGAAGGTGGCCAAGAA 
AGCGGAGGCCAAGACGCCCGAGAACAAGCCCTACGAACTGGAGATCGTCTGGCGCAACGTGGGACTCTTCGTCATCCTCCACTCGATGGCCCTGT 
ACGGTCTGTACTTGGTGTTCGCCGAGAGTGCCTATTGGGAACTCTTGCCAGGTGAGTTTGAGTATCAACACGTTTCGTCTTTGCCTTACTAAGCG 
TCTTGGTTCATCTGCAGTCTATGCCACCATGTTCCTCGGCGGTCTGGGCATCACGGCCGGAGTGCATCGTCTGTGGTCGCACAAGGCCTACAAGG 
CCAAGCTGCCACTGCGCATCTTCCTCATGCTGTGCCAGTCACTGGCCTTCCAGAACAGCATCTGGGAGTGGACCCGTGACCACCGCGTGCATCAC 
AAGTTCACCGACACACACGCCGATCCCCACAACTCCCGACGAGGCTTCTTCTTCGCTCATATGGGCTGGCTGATGTGCAAGAAGCACCCCGATGT 
GACCAGCAAGGGCAAGCAGATCTCCATGGAGGATATTGAGCAGGATCCCGTCGTTATGTTCCAGAAGAAGTGAGTAGTGTGCTATCTATGATTTG 
AAGGCTATAGTATAATAGATCTGAAAGGTTTATACATCCAGGGTGTTCTTGAAGTTTCCCAAGTAATTAATTTGCTCGCCCAAAAGTAGGCAATG 
AACAATAAACAATGTTCTATTGATTCTTAGAAGTTTAACTTTAATCCCTCTATTATATGTACTTATAACACCTCTTATTTCCTTAGAATGTACTT 
TGTGGTTATGCCCATCTGCTGCTTCGCCCTGCCCATGATCTTCCCCTACTACGTGATGGGCAGCTCGCTGCGCGTCTGCTTCTTCACCTGCTCCA 
TGCTGCGCTTCTGCCTGTCGCTGCACTTCACGTGGCTGGTAAACAGCGCCGCCCACTTCTACGGCATGAAGCCCTACGATGTCAATGTGAGCGCC 
ATGAACAACAAGCTGGTATCCACCCTGACCATCGGCGAGGGATGGCACAACTACCACCACGTCTTCCCCTGGGATTACAAGGCAGCCGAGCTGGG 
CACCTACAGTTTCAACTGGACGACGGCCTTTATCGATGTGATGGCTAAGATCGGTAGGTTCTGCTGCGCTCCATGCCCTTGCATATTAACTTTAT 
CTTTATCTCTCATAGGACAAGCCTATGACCTGAAGTTCGTCTCGCAGGAGATGGTCTACAAGCGAGTCCTTCGCACTGGCGATGGCTCCCACATT 
GCCGCCCTGCTGGATGCCAACAACAATAGTGCCATTCCGACCAGCGAGCTGGTGGCCCACCTGGATCACGAGAAGGAGGAGCATGCCATTTGGGG 
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ATGGGACGACAAGGATATCAGCGAGGAGGATCGCAAGGGAGCCAATGTGGTGAACAAGGAATCCGAATGCAAGCTGGATTAGAGATCGGGGATCA 

GTGGGAGCTCACCCAAAACGAACACACTTCTTGCTGCCACATCGGGTCTTCAGTATTTAATTATATTAATTGCCTATTCTCCGTACTGCCTGTGC 

ATCTGAGTGGTGGCTGTAAACTGCAAACTGAAAATAACTACCTAAGGGGGATTCCGATCGTGGGACCTGGTATGATGGTAGCAAGGACATCTTAT 

ATATCAATGGCTTCAATTCCAAATACTACAAAACAAATGAAAACTATTCGAGATTAGTCAATGCAACTCGTTCGCAGCAACAGAGTGTTAAAAAA 

AACCTGCACCCTCTACACAATCCCAATCTCGCACTTGTCGTTCGGGGATAATGAACACATCTGCCACCCTCTCCCTCTCACTATCTCTATCTCTA 

TTTTTATATATATTAATTAACTGTATATTTACGCAAGAGGTTCTAGAATTAAGCGTACCTGCTAAGATCTATTTTAAGCATCGACTTTTATGTGT 

GTGTACATTGTGTTAGTTATGTAGGTCTAAATTGTATATTTCGATCCAACTATTGTTGAAAATTATAACAAAAATTATAAAAAACCAAACAAAAA 

ATATTGTAAGCAACTTATTATCGAAAAGGGATGAAATACGCAATACTTCTCAAGATCCAAATGAGTGTTTAATTGCTTAATTGATTAAATATTTT 

CATAGTGAGAACTAATTGTAATGGTGAGTAAGTACAACATATCAATAAACATTAATACATTTGTTTAACTCGTGCCTACTCTCGGTTTCTAAATG 

TTTACGACTTCTACTGTGTATAATTGCTGTAATTATATTTTGAAGTACAATAAATTTCCCCATTTCAAATTTATTAGGTGTCATTAACCTTTTAA 

CACCTTGAAAGGGCTTTAATTCTGTAGTTAGCTTAAAGTTGGTCGTTAAACTAAAAAGTCAAATGCATTTTGCGATTGCATTGCAATGCATTTGC 

GGATAATAAATAAATATGGCAAAAAGTTTATATTTTT 

(SEQ ID NO: 1462) 

Exon: 1001. .1286 
Exon: 1348.. 1684 
Exon: 1892.. 2238 
Exon: 2296.. 2552 
Start ATG: 1073 

Transcript No. : CT25031 

ATGTCGCCCAACATAATAGGCAGTACCTTTATATTGGCCGAGACGGCCATTGCCGATGGCAACAACAACAAAATGGCTGCCACACCAGCTGCTTC 
CGCCGCCCCTTCAGCAGCCACGCCCAGCCAGAAGGTGGCCAAGAAAGCGGAGGCCAAGACGCCCGAGAACAAGCCCTACGAACTGGAGATCGTCT 
GGCGCAACGTGGGACTCTTCGTCATCCTCCACTCGATGGCCCTGTACGGTCTGTACTTGGTGTTCGCCGAGAGTGCCTATTGGGAACTCTTGCCA 
GTCTATGCCACCATGTTCCTCGGCGGTCTGGGCATCACGGCCGGAGTGCATCGTCTGTGGTCGCACAAGGCCTACAAGGCCAAGCTGCCACTGCG 
CATCTTCCTCATGCTGTGCCAGTCACTGGCCTTCCAGAACAGCATCTGGGAGTGGACCCGTGACCACCGCGTGCATCACAAGTTCACCGACACAC 
AC GCCGATCCC C AC AACTCC CG ACGAGGC TTCTTCTTCGCT C AT AT GGGC T GG C TG AT GT GC AAG AAGC AC C C CG AT GT G AC C AGC AAGG GC AAG 
CAGATCTCCATGGAGGATATTGAGCAGGATCCCGTCGTTATGTTCCAGAAGAAAATGTACTTTGTGGTTATGCCCATCTGCTGCTTCGCCCTGCC 
CATGATCTTCCCCTACTACGTGATGGGCAGCTCGCTGCGCGTCTGCTTCTTCACCTGCTCCATGCTGCGCTTCTGCCTGTCGCTGCACTTCACGT 
GGCTGGTAAACAGCGCCGCCCACTTCTACGGCATGAAGCCCTACGATGTCAATGTGAGCGCCATGAACAACAAGCTGGTATCCACCCTGACCATC 
GGCGAGGGATGGCACAACTACCACCACGTCTTCCCCTGGGATTACAAGGCAGCCGAGCTGGGCACCTACAGTTTCAACTGGACGACGGCCTTTAT 
CGATGTGATGGCTAAGATCGGACAAGCCTATGACCTGAAGTTCGTCTCGCAGGAGATGGTCTACAAGCGAGTCCTTCGCACTGGCGATGGCTCCC 
ACATTGCCGCCCTGCTGGATGCCAACAACAATAGTGCCATTCCGACCAGCGAGCTGGTGGCCCACCTGGATCACGAGAAGGAGGAGCATGCCATT 
TGGGGATGGGACGACAAGGATATCAGCGAGGAGGATCGCAAGGGAGCCAATGTGGTGAACAAGGAATCCGAATGCAAGCTGGATTAG 
(SEQ ID NO: 1463) 

Start ATG: 73 

MAATPAASAAPSAATPSQKVAKKAEAKTPENKPYELEIVWRN 

YKAKLPLRIFLIXILCQSLAFQNSIWEWTRDHRVHHKFTDTHADPHNSRRGFFFAHMGWLMCKKHPDVTSKGKQISMEDIEQDPVVMFQKKMYFVVM 
PICCFALPMIFPYYVMGSSLRVCFFTCSMLRFCLSLHFTWLVNSMHFYGMKPyDVNVSAMNNKLVSTLTIGEGWHNYHHVFPWDYKAAELGTYS 
FNWTTAFIDVMAKIGQAYDLKFVSQEMVYKRVLRTGDGSHIAALLDANNNSAIPTSELVAHLDHEKEEHAIWGWDDKDISEEDRKGANVVNKESE 
CKLD* 

(SEQ ID NO: 1464) 
Classification: enzyme 



Celera Sequence No. : 1420000133838 64 

AATCCAAAATCTTGCCCTGAACTTGACTGTCTGCATGTGATACGGTTGAACGCCTTTTGCGGAAATTATTGCAATTTTCTTGTCTTGTCTTCTAG 
CCGAACACACATAGCACCGTACATTGGGGTTTTGACGGTGGGTTTACTTTCACTTTTGCCGCACTTACTCAACAAGTTGTGGGGGCCCGCTGAAG 
TTTGCTCACAGGGGAAACATTCATCAAAAGATTGAAGAGCTCGTCTTTTCGCTCTAACTCAAATAAGAAGTTAAGTGCATAAGTGAAGTTAGCTG 
CACATAAAAATAGTTTTTTATTGCAGCGCAATTTTTTCTAGTTTCTAGCGAGTTTTAAAAGCGTTGTGTTTACAATCTTTTAAAAAGAAATTTCA 
AAGATTTTTACTACGAAGTATTCTTTGCATTACTCCTTATATAAGTTCTACTTTATATTGAGATCAAATATTAAAGTCTTGTCTTAAAGCTTGAT 
AGAACTTAGTAATGACTTATAAGCTTCAAATTAGAGTATTTAATTACACAGAATAATTAGTTAGTTAAATTGAATTCATTGGTGTGACTTAGAAA 
GCGCCACAATTCCTTTTGTTTATTTTCCCTCACTTTTCCAAACGAGTGCAACGTCAAACCCTTAAACAAAGTCCACTCCACACTGATTGCAACTT 
TGCCAAAGACCAACAAATCATTTAAACAAATCAAACATCAACAGGCGACCGGCGAACAACATGGAGACCTATTTGACGGAGATCAAACAGTTGCG 
CATACCACGCCCGAAATTTGAGCCCAACAGTTGCCATCAGCAGCACCATCATGCGGTGACAGCCGTCGGAGCCACGCCCATCCAGCCGCCCCCGC 
CCCTGCACCACCGTCAGTGGCGACACTTTGTGCGACCCTTTACGCCGCCACGCGATTACCATCTGCCCATGGTGGCCGAGGGCTCCTCCATGCAC 
GACAAGATGTGAGGAGGATTCACATTCAGTGGATGGGCAAACATGATGCATTACTCCCACACATAGCCCCACATAATATACGATACCTACTTAGG 
ATAGTGCTTAAGTCTAGATGTTAGCTTAGTTAACGTACGATAGATACAGATACAGTTGCAGATACAAAGAATTAAAGTATACAAAAAATGGAGAT 
TTCTTTATAATTCGGGTTTTTAAGTAAGTGTGCTGATCATGAAATGAAACTTCATGGCATGAGCAAATGGATTTTTGAGAGTGTTACTTGGAACT 
ATGTTGATAATCTGTTGCTCTACCATTTATAAAATATTATATTTAATACCCAGAGATATCAGAACCTACGACAAAATCACGATCGTTTCGCTTAC 
TCACCGCATCTGGTTCGTATTCCGGTAAAAGGGTTTCGCTGCCCCGGCGAGTTCTGTGCTCATCACAGCCGTATTCCTAAGCCGGGATAAAAGAA 
TTTACAAGATACATTTTTCAGCGGCTTATGTGCTTCTTTGCTCGACGACCAATTCGGTTTGGCACTGGTCTAAATGCGACTTAACAGTATTACAA 
CACTCAATAGCCAATTGGACAACATTTAATAAGGAAGTACAACATAACCGGAAACAGAGACAAAAACGCAAACATCAACCACATTGAGCAGTTGG 
ACAAACATGGAAAAAGATTCAAAAGTATAAAAAACGACAATAGACGTTACACACAACAAGAGACAAAGATACAAAAAGTAGATAGACAGAGAGAG 
ATAAAACCACTAAAGATACAAAATAAAGTAAGACATATACAACAGACAGACAAGAGCTGCAGATTGCAAAACTCGGCGATGTAACAGCCGAATGT 
CAATTTTTTTCAATAAAAGCTTAGAAAAACTTAAAATACAGCATATTCATCAGGCAGAGATTTCGGTGCCAAAAAAAAAATATTGTTAGAAAATA 
AAGAGGAAACTTTATCAGGAGACTCATCGAGTAATTTTCTTCAAAGAGCCTAGATACGAATGTTTCAAGATAGGTATCTTTGACAGATTAAAAGT 
TTTTAGAACAGCTTGCGGAACAGTGAAATTAGACCTTCAGTGGTAACAATATACATATATAGAACTTTTTCAGCTTGCTACCCTATTCGCGCGAC 
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TCGCTTCAGTCCTGACCTGTTGCTGATTGTCTTTGGTCTGCCTGAATTTCAATTGACTTGGTCCCAACGGATGGTGATGATGACGACCGGATCCC 
ACGTTGCCGCCGATACCGCCCAGATGACGCAATGAGGCATGGTGGGGTTGATAGTGTTGCTGCTGCTGTTGTTGTTGTTGTGGAGGCGACAATGA 
TGATGAGGACTGGTTGCTGCCTGTCTGATAATGTGATGCTGGCTGTGAGCGATGCAGTATGGGTCCCGCCGACATGGGACGATGGGGCAACTGTA 
GTGCGGGTCTAGCGTATACATATACACAGGTCAGTTAACGGTTATGATGGGAAATGGGTGCTTAGGGGTCAGTGTTGGGAAATTCCTGAATAATA 
TTTTCGAGCTATATATCTATATAGCTATATATCTATATATATATATCTATATATAGTATATATATATATACTTTTATAATAAGTAAAATTTTAAT 
TTACACAATGCTAAAATTTTGTAGTTCCGAATAAAAGCACTTTTAGTTCTATAAGCATTAATCATCATCATCTTTTCTTTTCACACTATGATAAT 
GTTAAAAACGCATTTGTGAATCTTTTGTAAACTACTACTACCCTATCCAACACTGTTCAGGATGGGATATTTTGGAATATGGGATTATATGGATT 
TTAGGTAAGTGGCTAGGTACCGTTTGTATATATCCGTTACGGTCATGCCCAGGCCCGAGTCCGACGAAGGACTCCGTCCGGTGGCGGCTATTATT 
TCCAAATCATCAGCAGTTACCTGTTGGCGAACGGAACGGAATGGTACGGAATGGAACGAATCGAACGATTTGGTTTGCATACGATTTGCGAGTGA 
GACAGTGACAGCTGAAAGGTTTGACAGTGAGTGAGAGTGAGAGGGGCTCTAGCCTAGGCTTCTGTAGAATTCTACCAAGCATAGACAGAGAGATG 
AAGAGAGAGCGAGTCAGAGAGAGAGCCTAGGTAGAGCGACAAAGATAGAGTGATAGGGAGTGCATTCGTACTTCGATCGGTTGTTAAATCAATTG 
GTATTTTATATATTGGTGCTGATTCATGGTGACTGTTGTGGTTTCTTGGTTATCCTTTTAAGTGCATTTTTAACGTGGGATTACATGAGATCAAT 
TTGATATTGTACAATAATTTTATTACTTAGTGTACTAAATCTAAAGATTACTATTAGGTTGAGCTTTGTCATACCCCAATTATCAAAGCAGATAT 
GTTTTTTATATTCCGTATGTATGTACATCTACAGTTTGGCTCATTTGGAGTCAACTGCACTTTTGAAAATAATTAAATTGGATTAAGTTTCCACA 
AAGCAATTTTTCCGATCGATTTGGTTTTCTTTTTGGCTTAAAATACATAATTTTACAAACAGTCTTCTTCTTATTGCTATAAAATATGTAAAAAC 
TTCGCCACTACAATGTAGAGTAAAGTTGTATGTATTTATATGTGGGTATTGGTTGGAAAAATTGACATTCGATTTGATTGTTTTTGTTTTGAAAT 
GCAATTTTACTTGCAAAGGGCCGAATCAGATGTAAGCGAGTTTCTGAAAGCTTTCTCAAAAGTAAATAAAAGAAAATTACATGCATTTGTTTTGT 
AAATGAAAGAAGCAATAACTGGTTTAGAGTCGGGTGCCTAAAGAAAGTATCAAGTGTACGTATCTCGTTCAACAATTAAACTACAAGTTTTGTTT 
TGATTTTCTTTTGTCGAGTGTTATTTGTATTTTTTCTACAGTTGGAACAAATAAGTTAAAGTAAAAGTTACAAACATACAGTGACAACAATTATT 
AACGACAACAGCAAGAAGAGAAAGATATACTAAACAATAAACACATTGTGGGAATTATATAGTATAGTATAGGTAGTAGAGTAGGTATATAGACG 
AATAGTAGACAGTAGACCGGTTGAACAGTAGTCAAAGTTATTTAGAGTTTATTTGGCTGAACATGCTCAGTGTGTATTTTTGGGGGATATAGTCG 
TGTCTCCCTCTTAACTAAGTGTATATAGTGTCTGTGTCGTAGAATAAATCTTCTAAAAAGCTTACCGGCAAATCACGGGCGGGGTCTCCCTTACG 
CTGTAAAAGATTATAAAGTTTACATATGGGATTAACAGGGTATAGAGATCAACGGGCTTACCCTATGCCTATCCACGCTATAAGGATCGCGTTCT 
CTGCGATAGACGAGCTCATCGTATCGCGTGCGATCCTCGGGGAAAACTAATTCCCGTATTTCCGTCACTAGCGTGTGCTAAGATGGGAAATTGAT 
TGATGTATATTAAGTTTCTCTTATTCTCTTATCAGGTATCCCTTGACTAGGTTAAACACTCACCTTTTCGTACGTATCTAACATCTCTAACAAGA 
CGGCAACAAAGGCATCTATGGTCATTGCTTTCGCACTATATTCGGCAATATAGTAGGAAAGGCTTGCAAAGTGATGACGGGGCAGCAGGGAGCGC 
GCCTTCTCCTCCACCATGGTCGCAATTTGACCTTTTCGGCGTGCTCTGTAAGGCAAAAGCTAGATGTTGTAGAATCTCTTGAAAAGAGATTTTAG 
ATCAGTCGAGACCCACCGTGTGCCCGTTGGACTGTAGGCGTCCCAGGGCTCCATCTCGATGGAGGTGCAGGAGTGGGGCACCTTGCCCACGTCAC 
GTATCACCAGCGACATGCGTTTGTGGrACTTCAACTGACCCACCGCCTCGTCGTGCGTCACATCGAGAAAGGATTGCCCATTGACCTCGAGGATC 
TCGTCGCCAATCATCAGTCCGGATCGATCTGCCACGCTGTCCTTATCTACGCCGGTGACAAAGATCCCCAGGCCGTACTCCACGCCGCCACGGAT 
CATCAGGCCCAGAGACTGACCAGGTTCGATCAGCAGCTCCACGCGACGTATGCTATCGGGTATAGAGTGTTAAGTTGTTAACTAAGAGCTAGAAG 
CTAGATGAGGTAGGAAAAACAGGTATGTATTTAAGGGTAAACCCTATATTTATCGGGACGACATTATCAAATTTTTTAGTTCAATCTCAGAGTAG 
CGAATCCTAAGTATTATGAATATTGGTACATTTTAGTGCCTTTTGTTAAAATTTTTTTTTTTTTTTTCAAAAAGGTCTGCTTTATTTTGAAATAT 
TCTATATATACTCACCGATCCCTTCGCTCGCTGCGCCTGCCCCCGTACTCCATGGGCGGGGAGGCGGGTCGTCCGTGGCGGTCCATCCAGGAGCA 
GGTCTGGCGGAAGGGCAGACCCGCCCCGGAGGCCGCCGCCGCCGCTGCTGCCGCTGCCGCCTGGTTCTGTGGGTGCAGGGGGGGAGCCATCGATG 
CGTACATGGGGTCGCGGGAGGGCGGACCGAAGCCGTGGAGCGGGGCGGTCGAGTTCAGAGTGGGCGGCGCTCGCACCGTCATACTGATTTGACGT 
GA C GA T TT C AAT AT C T GC AC AC ATC T CT GC GT T AAG AAAAAAT GT GGC AAT AG AAG AGAC AC AAAAAAG AG AGG AC AGAT GC AAGC AG AACGG AA 
CGAACCAACAAGAGAAGAGAGAAAGAGAGAGAGAGAGTGGGAAAATGGAGCAAAGTACTAAAGGAAAATTAACAAAATGAAATGGAAATTCGCCA 
ATTGCTTGGACAAAAACGTGTACGTAACGTGGTGCCAGAATTTTCCGATATTATTAATTAACTTTTACTGTGAGCGTGTTCGTTACTCGAAATAC 
CACCCAAAATTTCGCCTGACTTTCATTTGTCGGCGGCCTTTTGCGGTCTGATTTTGTTGTCCGTTCTCAGTGAAGCTTGCCACAATTTGCTGTCA 
TTATTTTAATTGTTTCTGACCAAAAAACTAATGTGAGAATGTGCAAAGTTGTGTTGGTCCATGGAGAAAGGGGCTATGTGCGGGTGCTTTGTGCA 
AATTGGTCTGTGAATTCTCGCAGGTGTTGCCTGCTTTTAGGGGCTTCCAGGTTCGGTTAAAGTCCTAATTTTTCGGTTTGTGGTGAACTTTTCGG 
CCAACTGCACTTATTTTTAATAAATGTATAAGGATTATATCAAACTGAAGTGCAGCATCATAGCATTTGTGGT7AGATTCCTTGACATCCAGCAA 
GGATGTAAATGTGTTGTGAAATGCTTTTCAGCTCAGTTTTCTAGTTAAAAGGATGTGTGGCAGCGTTATGGAATCTTAAATTTATAAAATTCGAG 
CTCTCTACTGTGTTGGCAGAAACGTCAGTCCTTTGCAGTCGGTCAACAGAAGCTAACAACTAAAAGTGTGTAAATAAAAAGCATTTAACAATAAT 
AGAACATACCAAAATAATTCTGATACAAAATGCGACTAATCTATTGCCAATTTTTTCTCAGTGTATATTTGTATTAGTGTATTATTGATTTTATA 
TTCATGGTTTTTCTCAAATATATGTTCGTATTTGGTT 
{ SEQ ID NO: 1465) 
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Start ATG: 5497 (Reverse strand: CAT) 
Transcript No. : CT25276 

ATGACGGTGCGAGCGCCGCCCACTCTGAACTCGACCGCCCCGCTCCACGGCTTCGGTCCGCCCTCCCGCGACCCCATGTACGCATCGATGGCTCC 
CCCCCTGCACCCACAGAACCAGGCGGCAGCGGCAGCAGCGGCGGCGGCGGCCTCCGGGGCGGGTCTGCCCTTCCGCCAGACCTGCTCCTGGATGG 
ACCGCCACGGACGACCCGCCTCCCCGCCCATGGAGTACGGGGGCAGGCGCAGCGAGCGAAGGGATCGCATACGTCGCGTGGAGCTGCTGATCGAA 
CCTGGTCAGTCTCTGGGCCTGATGATCCGTGGCGGCGTGGAGTACGGCCTGGGGATCTTTGTCACCGGCGTAGATAAGGACAGCGTGGCAGATCG 
ATCCGGACTGATGATTGGCGACGAGATCCTCGAGGTCAATGGGCAATCCTTTCTCGATGTGACGCACGACGAGGCGGTGGGTCAGTTGAAGTACC 
ACAAACGCATGTCGCTGGTGATACGTGACGTGGGCAAGGTGCCCCACTCCTGCACCTCCATCGAGATGGAGCCCTGGGACGCCTACAGTCCAACG 
GGCACACGAGCACGCCGAAAAGGTCAAATTGCGACCATGGTGGAGGAGAAGGCGCGCTCCCTGCTGCCCCGTCATCACTTTGCAAGCCTTTCCTA 
CTATATTGCCGAATATAGTGCGAAAGCAATGACCATAGATGCCTTTGTTGCCGTCTTGTTAGAGATGTTAGATACGTACGAAAAGCACACGCTAG 
TGACGGAAATACGGGAATTAGTTTTCCCCGAGGATCGCACGCGATACGATGAGCTCGTCTATCGCAGAGAACGCGATCCTTATAGCGTGGATAGG 
CATAGGCGTAAGGGAGACCCCGCCCGTGATTTGCCGGTAACTGCTGATGATTTGGAAATAATAGCCGCCACCGGACGGAGTCCTTCGTCGGACTC 
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GGGCCTGGGCATGACCTTGCCCCATCGTCCCATGTCGGCGGGACCCATACTGCATCGCTCACAGCCAGCATCACATTATCAGACAGGCAGCAACC 
AGTCCTCATCATCATTGTCGCCTCCACAACAACAACAACAGCAGCAGCAACACTATCAACCCCACCATGCCTCATTGCGTCATCTGGGCGGTATC 
GGCGGCAACGTGGGATCCGGTCGTCATCATCACCATCCGTTGGGACCAAGTCAATTGAAATTCAGGCAGACCAAAGACAATCAGCAACAGGAATA 
CGGCTGTGATGAGCACAGAACTCGCCGGGGCAGCGAAACCCTTTTACCGGAATACGAACCAGATGCGGTATCGTATATTATGTGGGGCTATGTGT 
GGGAGTAA 

(SEQ ID NO: 1466) 

Start ATG: 1 (Reverse strand: CAT) 
MTVRAPPTLNSTAPLHGFGPPSRDPMYASMAPPLHPQ 

PGQSLGLMIRGGVEYGLGIFVTGVDKDSVADRSGLMIGDEILEVNGQSFLDVTHDEAVGQLKYHKRMSLVIRDVGKVPHSCTSIEMEPWDAYSPT 

GTRARRKGQIATMVEEKARSLLPRHHFASLSYYIAEYSAKAMTIDAFVAVLLEMLDTYEKHTLVTEIRELVFPEDRTRYDELVYRRERDPYSVDR 

HRRKGDPARDLPVTADDLEIIAATGRSPSSDSGLGMTLPHRPMSAGPILHRSQPASHYQTGSNQSSSSLSPPQQQQQQQQHYQPHHASLRHLGGI 

GGNVGSGRHHHHPLGPSQLKFRQTKDNQQQEYGCDEHRTRRGSETLLPEYEPDAVSYIMWGYVWE* 

(SEQ ID NO: 1467) 

Classification: enzyme 



Celera Sequence No. : 1420000128062 90 

TCTGACATCACTCCTGCCTTATCTCATTACATATGTACGTATGTGAATTCGGTTTTTACATTCCTTTTTGTACTTGTTAATCATTTTCATTTGAT 
ACTTGATGAACGTGAGCTGGGGCTTCGCTTCGATTTATGCTTTTGTCGGCAAATTGTAATCAGCTTGAAAATTTCCACTGTCAAATTGATAGAGC 
TTAAAAGCTTCGACAGTCCCGAATTTTTCAATTGTTGTGTTTGTTTATCACAAAAATATACCTATCATTATAATTGGAATCATTGTTGCTTCCGA 
TCACATTACGCTGAGATAATCTCGAAATTCCACACTTGCTGGTGTGTAAGGGGTATCTGATAGTCGAGACAGTCTTTACCTGCGTTTACTCTAGT 
TTAAATTTTCTCTTAATTCAAATATCTTAACGGAATAAGAATATATTATTATTAAAATAGTTTACCTAACTACTTACTTTTGCTTAATCCATTGA 
CAGTCGACCTAGTCGTTCCATAATAATTTCACGTATTTATTTGATTTCTCATTAATAATTTATGGGCTTAAGAATTTACAACTAGTTTGCTTAGA 
GATGATAATTATTCTCATTTATCGTATATTGTTTGTTTCGTTATCAATCAAACATTTAACAATTATAGCACGTTCAGGGAAACTACGAGTAGTAC 
ATACATACTGATATTACATAATACGACCGTTGAGAAGGCAACTCAAAAATTGCACTAAAACATTCTTTTACACAGAACCAATTCCATTCGAAATG 
AAAAC AAAAT C AAT T T T AC AT GT AG AAT T T AT AC AGGGC AAAAAC ATTGGGATTCC ACC AC AC TC TT TT TT TTTTTTTXTTTTTTTTTTTGGT GG 
GCGTGGCTGGTGTTGCCTACTTTGGGGATGCCCCATTAACCACTTCGGCTCGTCGCGTTTCATTAGCTACCTCGAATACAATATGTTCTTAATTA 
ATTTTAATAAATTCACGCTTTGATTGCTATTATCCCATCCACTCCTCCGCTCACTCCATGGCCTGCGAAACGCTCTGCTGCGGAGATTGGGCAGG 
TG7y\AGGCTGCGGGGCGTTGCCGTGGCGCCGCCCGAATCCCCAATTCCGATCCCACTGCCATTTCCGTTTCCGTTGGCGGCCGCCCTCTGTTGAT 
TCTGGGCCTGGCTGAGAAGCTCCTTGTTCAGCCGGTAGCTGCTCATGACGTTGGCACAACTCTGGTGGAAGTAAAAGTTACGGCCGCGCAGAAAC 
GCATCCGTGTTGAGATCCTTTAAGCCTGTCTTGAAATACACAAGAAAAGCGGTCGCAAATTATAGAAGTTAGAGATGAAGATTTACGTCTTAAGT 
GTGCATTGACTAGTTAAAATTGCTAGAGTGGAAAGCGAAAGTTAGGGGTTCTCGGGATTGTGCATAGATCGAGCTGATCGAGGTAAAGTGAAAAC 
TGAAGCAAACTGAGAAATTACCTTACAAATGAAACATTTGTTTTCATCGTCCGGTTTTAAATCCGAACTAATCCCCACCTTTGTGTCCAATTGCG 
TTAATCCCGGCACGCCCATCGACGACAACTGCGACAGGCTGTTCTCCATGCTCTCTGCGTATTTCGTCGACGGATTGTTGTTGTTATTACTGTTG 
CTATTGTTGTTGTTGTTGTTGTTGTTATTGTTGTTGTTGTGAGATATGTCCATGGGTCCGGGTTCGGTGCTGCTCACCTGCGGCTCCGACCGCGG 
TCGCTCCTCCACCGGTGAGATGTTCTCGAGGGATCTGTAAGTGATAAATATTTTGGGATAAATACTTGGCGCAGAACATGCCCACCAGCAGCTAT 
CGTGCACAAAAAAAGATACTGATTCCAAAAGCAATAAGAATATGCAGAAAAAGCAGACTTTAATCTTTATCTTTTTTTTTTTAACAAATGTTAAC 
CTTTTGAATAAATAAATACACCAGAAGTGTCCTTTGTTACGCAGTATGCACAAGTTTGAATAAACTGTATAAACCTAAACATATTTATATAGGTA 
TCTCCGTACTTAAGAAATCTTTAAACGCTTTAAATCTTTCAGTAAGCATTACTTATATACAAATAATAAGATTTGATTTCATCATTATCGAAAAA 
ATCTTGCAATACACCACCAAGCAAGTTACTAAAGTTTTGGTTTCTAGGAAGTAATTGTTATTATTCGAACGACTGGAACGACCGCCTCACCTGCT 
GCTGCCGACGTTCTGCTCGTCCTGGCGGCGCTGAATCTTGCCATAGTTGATTTTGCTCGTCAGCTTCACGGTGTCGTCGTCGTCCACAATCACGT 
CCTCGTCGCTGTCGTTTTGGCAGGAGGCCGTGTCCATCATCAGATCCTTGTTGCTGCCACCGCCGTTGTTGTGGGTGCTGCTGCTGCTGTTGGTG 
TCGTTTACGCCCACGCCGCATGAGTCCGCGTTGTAGTAGTGGTGTGTGGTGGATGACAGCTCGCTGACGGTTTTAACTGCGGGTTAAGTGGAATG 
GGACAATTTGCATAAGTTGCGGGACATAGATATAAGTCTCATAAAACAAACCAAATCTAAATAACTGATATGGTTTTGGAAATCTTTGGATTTGG 
TTCATTTCAACTTCTAATCAGTAATATTCTTTGATAAAACTAAGCTTATATTTTGAATGTAGATTTTGGATTCAAGCAATATAGGACATTTGAAT 
CTGACAACTGGAAACCCACCACTGATATGTGCCTGGGTTGGTGCCTCTCCCGGCCAACTACTGCTGCTCGCCGTTGTCACATTGGTGCTCACGTC 
ATGCTCGTCGGGATGCGGATCCTTCTGGCTGAGCACACAGCCCGTCCGGTAGAGAATCCCGTCCAGCTTGCCCGGCTGCGAGTTGCTGTTGTGCA 
TACTGGAGGGCGGCATAACCGTGCCGGATGACCCAGAACTCGGAGTAGCCAGCTCCGAACCTGAGTCGGGCACTTCGTCGTTGCAGCTCTCCACA 
TCCACCGTTTCGATCTCCTCCTGCGATTTGGCATCGTCCTGGAAATATAAAGGAGTTCATTTTAATATTAATATTGAGATATTTTAATATCTAGA 
ACTTTGTATTCCTTTAAATTTAATTTAAGACTGATATTTATTGTAGGACATTCCACTTACCTCCGGAGGAATCTTCCCCGTCTCCAAGGCGTACT 
TCCTCTTGCAGATCGGACAACTGCTACAGTAAAGACTCTCCATCATGTACATTTCGCCGTACTTGCGCTTTCGCACTTTGAGTCTCAGTCGGTTC 
TGGCGATTGTTACGTATTCTCTGAAAGGTCTGGGGAAATCGAGAAGATAAATGGTTAGATTCGCACCAGATTAGCTCCTGAGGAACACCCACCTC 
CCAGCGGTTGTGACTGGTGCGAATGGCGTTCTCCTGCGCGGGACATGGAGGAGCAAACATGTAGGTGGCCTGCAGACTGGCCCTCTCGAATCCCG 
CACTCAGCTTGTAGAGCCTGTCCAGTTCGGTGAGGAAGTGGGTGTGCCACTCCTCGGCGCTCAGCTTCAGTCCGCAAACGGGGCACATCTCGGGA 
ATAGCACCGCCGGCAGCTGCCGCCTGCAGCCCACTGACCAGGCTCAATCCGCCGGGAACTGCTCGCAGACCCGCCATGCTCAAGCTGGCCAAGGC 
CGCGTGGTGGGCGGCGTGGTGGTGCGACAGCGGATGGGGATGGGCCATGTGGTGGTGCGTCGTCACCGCCGTGGACACCGGAGGTGCCCCCACAT 
GGTGGCCCACGTGGTGATGGCTATGGGCGTGCGGATGGGCCAACGGATGCGGGTGCGAGTGGGCGTGCGTGTGGGGGTGATGGTGCGGCGAGGAG 
GCGGCCAGGTGGCTGGCCATCGAAACGGCAACGCTGCTGGTCGCCGTGCTGATGGCGGGATTGCTGGTGGAGCAGGAGGAGGACGAGGAGCTGGA 
CGCCGACGAGGAAGTGGCCAGTGCCTGGTGGCTCTTCCCCGCATTATGCAGATCTGTGTGGGCGGGAAGAGGAGAAGACATCGTTGAATTCCAGT 
CAAGTGCGCAATCATTAATCATGAGTTGATACAATTGAAATGGAAAACACAATCTAGCATGGATTGGCTGGCACTTAAAGAAATATTAAATAGTA 
TCTACAAATTTCTAGAATCTAGAATTGTTAGCTAACCCTATATACTTTAATTAAAGCTCAAATAGAAGTAAATTTATAATGGATCAATCTTATAT 
GAATTTATCTTAAATCGATTTTTCCAGTGCAATTTCCACCTTTTTTTTCCAGCCAATAAATCTGCCAGGCATTTTCTAATTGCCTCATGCCTCTT 
TTCAGCACATCGGTGGAAATTAATTGGAGCTCGTAATTCTTAAAGCGAATCTGGTTGCCACCAACTGGGGATCAGGGAATCCAGGAGTCCGCTGC 
AGGCTTCCTGATGTCAGACACACGTCGACCCCATGTATTCCGGGGTCGAAATACCACTGGCGAAAAAATGAATAAACCACCGGCAACCACAATAA 
ATACAAACTGTTTGCTGCGAACCCAGCAAACAAAATGTGTCCGCCGTTCTGCACGTGACTGCGGCCGGCAATTAAAATAATATTACAATAAACAT 
ATCGCAGTACGTTCCGAACAGGCCCAGTCTAGCAGCGGTCCAACAGTAAAACAGTTTTATTTATATTCTTCTAAATTATTGCCCAACAAAATATG 
TCGGTTAACCGCGCCCATCACCGCCACTCGAATGGATATGCTTAACATAAATGGAGATACATTCGAGTCGTGACTCCGGTTCAAATTAGCTGGCA 
GTTTGGTGCTCCACATCTCTCACTTTAGCTGGGGTTGCTCCTGGTTACCATTTAGCTAGTTCATAAACTTCACTTTTGAGATCAGAATTTGGAAA 
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ACTAAGCGAAATTTAATTTTTGTGGCTATTTTGTTGTATGTATTTCATAAGAAAATTTAGCTCTATATTAAAACAGCAACTGTTGAAAGAAGAAA 
TACCATTTGAGTAAGATACAATCGAGAGCAATACAGATTACAACTAGTTTCAGTAGTTTTATTTTTGTGATTAGCTAAATCTGCATAATAACTTC 
TTGGAACCCATGGGGCATTGTATAAACTCTTTGTAGTATGCAATATATATAGTACTCATGACTGGCATAACTGAAAATATGTCCAGGTTCGATGA 
CCACCCACTCCACGCCGATTTGTGACACGTGCTCAACGATGCTATTTCATAATCCTCCCCGTCCAAGGAATGGGCGTTTCATATATCGCCGGATG 
AAATATGACTTTGCCATTAATATTACGGAAATTAATATGAAAATTCCAAGTGTGCATTCAATATAATACATCATCGGCGAACAAAAACCGCGGAG 
CAGAAGTAATAGCAATAATAACGCTCGTCATAACAATGGCAAACAATGCAATCAAAGCGATTCGCAACACTTGCGCATCCTTTGAGCCAACAAAT 
CGCAGAGTAAACCGAAAAAGTGTGTCCCCCAGCCGGCGATAAATGAGCAAACATTATGGAATCTCGAGGGTGGCTACACAATGCCTCCGGAAGAG 
TTGCGGCAATGTCCTGGATGAGTAATTAGCCGCCACTGGAGAGGACGTCTTCGTGTGCCGAGCACGAGCCGCCCGAAAGGATGTGCGGTCGGTTT 
GACTTTGTAGTCAAGTGGTCGGGTTGCCTGGAATGGGGTTGTTGTAACATTTCATTGGGACCACTCCAGTCAAGTATCCTTGTATCCTTACACGT 
ATATATCCTTGTGCCATTTAAAACACTTACGACATGCACCCCTTCTCTTAACCCTTTGCCGCCCTCTCATTCCTCGGATGCCCTTTTGACTTCTG 
CCCCGCAGTGGGGTTAAATTGGCCTGTTCGTCGCTAACCTTCAGTTAGTTTAACCCATTGATGAATGCTGGCCTGCGCTGCAGCTTCAGCTAATT 
AAAAGTGTATTATTTATCGAAATTAATTTATGGCGCTCCTGTCTATCCACGACTACACGCTAATCTAAAAGTTAAACACAAATATCCACATGTTG 
AGTATGGGTGGTGGGTTCATGTGGCGAGGGAAAACTGGTTTTCCGGCTTAAGTTGAAAGGCAAATAATGGCCGAAAGATAAACAGAGTTGTCACT 
GTTTTAAGCAGAGGTGTTGGAAAGGGGGGGTCGACAAGTGAATATCATTAAATAAATTTTACATTTTATGTGTAATATTTATTGCAAAAATCTAA 
CATTTCATGTTCCAACTCCCGGTTTTCAACTGAATATTCTAATTGGTATATCTTCTAATTAGAGTGCCTTATAATGCCTACATAATGTAACATTT 
AGGAAACCCTTTTGAACTCTCTTGGCACTACGCCCCTGTCATTTATCCCATTTGCTTTATTCATTTGCATTCATATTTATCATTTTCATATGCTG 
CAGCCCGATTTTTGTCATAAGGCGCTTAACTCGATCAGCTTTACTAGACCAATTAGCAACGCATTTGCAGACGAGACGGTTTAGCCTACTTTTTG 
GTAAGGTTGGGATGGAAATTTGGGGAAAACGGAAAAGCTGGAACAGAAACCACCCCCAGGTAAATAAAAGAGAATTTAAGCAACATTTATCTGGA 
CTCGAGAAAACTAATAAAAAACGAATTGAAGTGCAGGAGATGAGAGGTCCTGTCGCTGTCCTTACGCCTGCCAGTCAAAGTTAACCTACTGAAGC 
GTTAAAATTGCTCGGCAAAAGTAAAATTGAAACAAATACGCCCGGCGATTTTTATCAGCAAAAAAAAAAACGGGAGTTCGAAAGGGCAGAACTTT 
CCAATTTTTCCTTTTATCGCAAACATGCAAATTTTTTTGTGCGATGTCGATGTAAATTTGCTAAATAAATAAATGCGGTGCAAGCAACCCTACTT 
ACGTCTCAAATCCCTGGGATCCTGAGAATTATTGCTGCGCGGACTTTCACCCTTTTCAACGGCATCGTAATCCTGTGATCCCTCCATGGATTCGT 
GCGATGTGCATGTGGGGCTGGCCGAATCCGTGCGAAArGATTCGCTGCCATTGAACAGATTCTGCCGGATGGAAATCGAAGTGCAAATCCCATTA 
TGACACGTGTCAATCTTGCGCCGATTAACAGCTCATAGCATCAGCGGATTCCGCAGCGGATTCCGATTGGATACCATCCCAACCCGATCCCAACT 
CACCTGATCCAGGGATATGCCGATGTACTTGGGCGGCGAGAAGGCCGACGGCAGCGAGAAGTTGGACAGCACCGACGGCTTGCCCAGCGGCGAGC 
GGTCACTGGCAAAGTAGCTGGCCGGTCCCGCTCCCGCCGCCGCCGCTGCCGCCGCCGCTCCGTTGGCCACCAGCATCCCGTGGTTGAGCAGCAGG 
TGCGGATTGAAGTAGTTGCCGACTAGTGACATATTGTGCAGATCTTTGGCGTCGGGTAGCTGCAAGAGGGGGAAATCGAAAGGGATTACCTAGGA 
AGCACAGTTGGGGCGAGAGAAATGGGGAAAGTATTGTAGAATCGTGGCATTAGCAAGTTTTCATTCATTTTATATTACTGGATTTTCGTTGGGTA 
CCATTATACATATATCAAGGTCGATGCATCTATTCAGTTTATGAGTGGGAGCAAATCCAACAAAATATTTACCAAAAATTTATAGATACTTCTTT 
GT GT G AG AG AAGAAAGC AAAACTTT AAT AC TGC T AAC T AAAAC TGCT AAC T T AAACAAA TT TAT AT GT AT T CAAAT AAAT T T GC AT AC AT AT T T A 
TTATATACTTTTAATACTATTGTTAAATAACAAATCGATTTATTAAACCGAAGTGGAATTTTATAGTTAAGCTAATGCCAAAAATGTATTGTATT 
CGTTGCATAATTAAGAAATAGTAATAGTAGTTGTTATAATCTAGATTAAGTTCCTTTAAAGAATCCTTCAGGTATTTTATCCTACCTTCCCTTAT 
CTAAGCTTGTAATTCTTTAAAACGTGAGAATCTAAATATATAAATATGAGATCTAAATATCTATAAATTACAAGAGGATATTTAACCGCAAATAA 
GAATGGAAATAAAAATGCTAGGAAGCATGCTTAGTAAGCATCTAAGCCTAGAGTTTTTAAAAGTTAAGTGAGCGTATAATTTAAGCGTTAAATAT 
GCGCATTACTCGAGTCTGAGCATCGGAACGCGCAACAAATTATGGGAGCTACATGACTTGGCGTTAAAATTAGCCAATTATGGAAGGTGGAGGCC 
CAAACGCGGATGTGTGTAACATATGCCAAGGCTTGGAAAGGTTTGGAAAAGCAGGAAAACTGCGATACCTAATCCCATTCCCCTGGAGAAAAGAT 
GGAAGAATCTGCGAAGGGCGGCGTCGATTTGCATCGATACAAATTGTGAACAAAAATGTTTAATTTACGACGAATAAATGTTAAATTGAGAGATT 
TTAGAATTACGATATCGAACATTGCCATAAACTGTGTCAATATCGAAGGGAAGTGGCGGATAGCTCACTTCCAATGGGGTGTAAAATAACGAGAA 
AAACGCTTTGGCCCCCGCCCCAAACTTTCTCCGATCCAAAATTCACACAAGGACTCTGGTAAATGATATGGGTGAAGTGGGCAGGGTGAGTCCGC 
TGTAGTCCGTGGTCTGGGGCAGATAACCCACAGGACTCACAGACAGCGTTCCCAACAAATTGAGGCTGACGAGTGAGCTAAGCGGCGGTTGCCGC 
ACAGACAAGATACGAAGATACGAGTAGAACATAAGCGGCGACATGTGTGACAAGTATTAACCCTCACCCACACCACCGTCACTCCAACGACCCCT 
CAGCCACGGCAAACTTGGGAACAGCAAAATTTACATACAAATTTGTCTGCCAGGAAGTATTTTTACGGCGAAAAATGCGAGCAAATTATGGGATC 
AAT GTTTTTGCAGGAAACTGTAGGTT ATT CC ATTTAAAATCCAATCT AAAT ATT ATCCATAACGAATATATTTAACTGGTTATAGTAAATAGTAG 
TGGTTAGTAGTAGTTCTAAACTTATGTGCTACTTGATAAACTAAAAAATCGATTATAGTTGTATTGAAGTTATGTTTTCAGTACGGAATTGACAC 
TTTAGTCAATCAATCTCATTAATTACAGTTGATTAATCTGGTCCCAAGTCCATCGTAATCCTGGTCACACGCTGCACCCAAGGCGAGTCCAATAA 
ATTGGGGATTATCAAGTGAAATTAGCACCTTCACTTGGGCGCATATGTTGCCACATCGGATCTGAATAATCACCACATCACATCATCCCCATCTT 
CATTAGGCTCGAATGTTACAAGTCAGTGGGAGCAGGAGGAGATTGGGGTATTGCAACCGCATGAGTTGCACTTTGTCTAAGTCTGTACATTGGTC 
CACGCGTGTTTAGGGGCTCCATTCCGGGGTTGGAATCGAAAACCCCAACAGAAGTGGTCAGCCACTTGGCACTCCCATTCCAAACGACTTTTTAA 
TGCAATGCGTGATGGAGGATTTTCAGCGTGACTTTCGCCGATTGATTTATGCCCGTTAATCAGTGCAATAAATGCCATGCACAGGGAAATGCTTC 
TGATTAACCAATTTGGATAGTGCACTGAAAGTTTAAATGTTCTAATGGCTTAATGGTATCTATTGTATATTTCAAAGTTTGATTCATTTAGAGTG 
TGATTGTGTAACTTTAGACAAAGGCATTAAAATAGTTAAGAGATAAAAAACAAAATCTAAAATGTTTAAATTCTCATTTAACTAAGCACTTTGGA 
TGTCTACAATTCAGATTTGAAACCCATTCTCCATAGAGGTGAATACAAAAGTATTATAAGATGGCACTATGGATTAATTCATTTTTCCCAGTGCA 
GGAGCAGGTGCTCGTTTTATCCCTTGGCTGGAGTGAGGAGTTTGGTTCTACTCCCGCTTTGTTTGCTTTTCTATTAGCGCCGGGCGGGTGATTTG 
TTATCCCAAATTGTAGCATTCCTTATCCGGCGAGGGGGATGATGACTGCGACCAGATTTGCATAAATTCGTTCTGTTTGCCCTTAGCACTTGAGT 
TATTGGCGTTTGCCACTCGCCCCGCGGAGGGGACGCACTTATTCGGTGCAACTCTCAGCTGCATCACATACATACCAGCAACAACAATAACAGCT 
TAATTACAACAACACGATAACGAGTGGCAAATAGCAATTGTCCACATACAATTGCCAATTTCGCTTAACTTGGCCATAAAACACGGAAAGCGACG 
GCAGACGGCCCAGAAGAGCAAAGCAAACAACTTGGCCTCCGCGCCACTCCGCAAATAAATCTCAATTTGTGGCACTAACACGGAGTAAACGAGGC 
TCAAATCGAACCGAAGAGCGGCTGGCGATGCTAAAAACGCTTAGCAATTGGCAATTTTCGCAAACTAAATGTTTATGGCTGGCCTCAACCTGTTC 
GACTTTGGCCTTTTCCGTGCCCCAAAGCTACGGAACTCAAGGAGCAGAAAATTGCGATGCCAGTGGATGGCGTAGCATACCTCTGGGCGTCTTTT 
AAGTTTGGCGAGAGCGACGGGAGTTGTGTGTTACTTTAATCTAATACTTTTAAATTAAAAATATTGTATTTGATTAATGTTTCAGTTCACAAACA 
ATCTTTGCCAGCATAGATGACAAAAAAGCCAAGAATGTGCAACGCTTTTGTTTAGTTTCATTATAGCATCATCATAAATGCGTATACGTAATAAT 
AATAATTTTAAGTTGGCAACTAACTTGCGGATGTGCATAGGCAACAAGGCCTATTTATATATGAGGATTATTCCGATTTGATGGTTAAGTTCATA 
TGATTTAACCCAAAAATAAAACGGTCCCGATATTAATTCGCCTGGTGATAAAATAAAGTTGCTGTGACCAACTCAAAACAACACCATTTCTAACT 
TCATAACTGAGTTGGGGTTTTCAATTAGGTGATACGACTTTAACACTTGCAAATTTCTTTTTTGATATTGATATTTTCCTTAATTCATGTATTTG 
TACATACATATGTTCATTTCAAATAAGAAACCGCTTGTAAGGAACATGATATCACCTCTCCCTATCTCTACATTCTGTTGGAATTCTCTTGGCTA 
ACTATCACTGACCATTTGGACATAATACTCCATTTGGTAGTAAGTACTGCTGGTTAAACGCCTCGCTGCCTCCGCATTTCGCTGCTTGGTTCCTA 
GCTGCCTTCCTGATAAATTGCCATAAATTAAATTAGGCTCAGACACACACACACACACACACACAGATAAATATAGAGACACACACTCGCATATG 
CAACACAACTCAACTACTCATACTTAGTGGCAATAAGTGGCAAAAAAATGAATTATTTTTCATATGTTGGCCCGCCTCAGTGCCATAACTCTGCC 
CAACCCAACTTGACTGCCATTTTCTGAAGGGGTCAGTCTGTTTACCTTTTGTTGTTCCTGCTGCCGTCGTGGTTACAGAATTGTTATGGTGCCTG 
GTGGGTTTCTTTTCCGCTGCCCTTCACCCACCCAAACAATCAGGTCGGTGGTTGAAGGTTCGTTGGCAAGCGGAGTGGAGGAGTGGAGGAGGCAG 
TTCCACTGGAAGCAGGCCCATATTAAACCATTTGACATTGCTTTTAATATGCTCGCTCGGCTTGTTTTTATTAATTTTTATGGTTTGCCCCATGC 
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CCAATTGTAATCGGTTTCATAAATTATGCTAGATAAGCGACTTGTGCAGTGGCCTTAGAAGGAATCGTCATTGGAAGAAGAAATGGCAAAGAAAG 

C T G AAG AAAC T AAT ACT C AGC T ATGG AAT GC AG AG AAAGCT AG C T G AAG AT G T AT GT AT GCT AAAAG AT AT T T AAAG AT ATTG AAAAGT AAAAAT 

TTACTTCTTGTTCTTCTTATATAACAGATTTAAACCTATATCTAGTATTGCATTTTATGATGATTATTTATTAAGAACTTTATTGAAAATATTTA 

AGTGACATTGAGGCGGCTAAAACTTGATAGGTATCAATATAGGCCCTGTTTCGCGACAAGAATTCCTCAAAAGCAAGGTAAATACTTGATCTTTC 

TAAGCGATATTCCTGCTTTTCCACTGTCTTCCGTGGCGAAAATCGGTTCCTCACAACCCAACTCGCATCTGAACGCCGAGGACATATTGTTGTCA 

TGTTTAACATATCTTAACGCTACTCAATGCTATGCAGGGCAGAGATACATATGTACGTTCCTATTCGGACTGCAGCTCCGATCGGAATAACCAGT 

TTCGCCAAGCGGACTGTTCGAACATCATCAGCGCCCGAGTTCTTCATCATTTGTAGCATCAGCGTCAAGATCATTGCTCATGTTTACAATGATGA 

CAGGCAGCAGAATGAGCAGAAGCTGGCCGGGAATCGACTTCCAGCCAAATCAAATGGGTCTGCTCTGTGGGATCGGTATTGGGACTGGAATCGGA 

ATCGGGCCAGTAGTTAACTGCGGCCCGCTTCCACTTCCACTCCATCAGGCCCGACTCGCAGTTCGGAATTGAAGCTATCATAAATCTGTTCGTCG 

CAAAAATCTACACACAATGGGGCGTTTGCAAACGCATTCGCCGATCGGAATGCGAAGACGAACTCTCTGAATCTGCCTCTTGACTGACTGCCGCA 

TTGGAAATGCGATCGTGATCGGGCAGTTCTTTTGGGATATAGGAAAGAACAGAAATTGTATGAGAGGATGGAAAAATTCGACTATTGAACTATTT 

AGAAGTGTTCCTATAGAGCTTTTATCTTTGCTTTAAATAGGAACGCGAAATAAAAACACCCATATTCTCTCAATGGAGAAGTTCATTCAAACTAT 

GTTCAATTTAGGGTCCAGATACCGAAAATTTGATTATCGTTGTAGTTGTTTCTTTTTTTTTTTTGTAAATTAACATTATGAGATCTATCATTTTG 

AACAAGGATACTGGCTTCCTTTATGTACATATTATATATAATAACTCCATTTTAACCTATGCTCCATCTAAATAAAGTAGCTCACATTATACACT 

GCACTTAAACGAGCTCTCACAGCTTTCTCCTCACTTTCTCTCTACGTAATGGATCCTTAATTGAGTGCATGATCGGCAATCACGCTGTCATCCCT 

AGGCGCATCGCCAACTGGAAGTTCACTTGGAACTTCTGGTCGGATTTGTTGTTTTACAATGCAGCATTGTTTTGCTTGCTGTTGGGGTTTTGGCC 

GCCGTTTCGCGATTCCTTTGCCTCGCTAGCGGTGTGCTGATTTATGGCATCTTTCGGCATTTCGGCTACCGGCCCCCAGTCTTTATGATTTGTCA 

TAAATATGTCCCCTCTCCGTCCGATCAGAATCGATCGGCTGGGCGATTCCAATCAGCGGCTGTCCGGATGGTTTGGGCTGAGGTTGAGTTTCTGG 

GCTCTACTTACTCTCGATGGAAACGGACGTTGTCCACGTAGAGCCTCGTGCAGACCGGAAGCGGCTTCCGAGAGGACAACGAGCGCTTGGGACGG 

CAAATGGCTGCTGCCAAATGCGTTGCTTTCCATCTGCAGCTCGTTTGTGTTTTGTTTTGGTTTTGGTTTAGACTAATTTGGCACTAGGGGGCACT 

CATAACCTCACTTTCGCTTGTGTTTCAATGTTGGCACATGCAAAGATCACTCGGTATCGGTGATAATCGGTTGGGAATTGTATTTGCGCGTCGGT 

AAAACGGTAAAGCTCGGATCAGCTGACAGCTCGCTCGCTCTCCTTCTCTCTAAAACTCTCTCGCTCGCTCGACACAGGACACCCGCACGTACGCG 

CTGCAGAAATCCAACTGAAGCCAAATGGAAACCAGTTGCCGTTAGTTGGCCCGTGTTCTTCCTCTTCTCCAGCCTTGCCGCTGCGCAGTCGGCGT 

CGGCGTCGCCGCGTAACGCCTCACGACCTCTGATTGGCCAGCGGCCACAACCGGTTTGTCTACACTGTCGCCATCCCGCTCTCGCGCACGCCCAC 

TCCCACTGCGGCAGTGTCTTTACATTATGTTCTCACCTCCTTGGGCGCGCAGTTTGTTGTTGCCGCTGCATTATTATCCCAAAATGGTTGCGTTT 

GTCCATCCGTTGGCCATTGAACGCCTGTTTGTCTGGCAGCACCTGGTACTCGCACTCGAACTGGTACTGGCACTGGTACTGGCACTCTCCGGGTT 

TCCCCGCCCCCACAACCATTTTCCACCTCGATCAGCTCCCCTCCCTTGGACGTAAACTTCTGACATTTTGTGTAAATAAACCGTATGGGAAACAT 

TTTTACTTTTGGCTTTGGCATTTTGTTGTTGTTTGTCGGACACTGTCCCGGACAAAAGTATCTTGGAAGTAGTCTCTTTCCTTTTTTCTGCCCCT 

GCAGCTGGGTTTTTGCACACCGTCCACTTAATATAGCGCATTACTGTGGTATTTATTGGGCGCGAAGTGGCGAAAATATTTCGGAAGCAGGGTGT 

CAAGAAAATTGGAATTTAATAGGCCGCGACTGCTATTCTGGTTAAAAGAACCAAGTCAATTGCACAACGACAAAATATTGTTA 

(SEQ ID NO: 1468) 

Exon: 13428.. 13312 
Exon: 7374.. 7129 
Exon: 6996.. 6843 
Exon: 3948. .3323 
Exon: 3259,. 3101 
Exon: 2983.. 2680 
Exon: 2451.. 2181 
Exon: 1744.. 1504 
Exon: 1264,. 1001 

Start ATG: 13428 {Reverse strand: CAT) 
Transcript No. : CT25312 

ATGGAAAGCAACGCATTTGGCAGCAGCCATTTGCCGTCCCAAGCGCTCGTTGTCCTCTCGGAAGCCGCTTCCGGTCTGCACGAGGCTCTACGTGG 
ACAACGTCCGTTTCCATCGAGACTACCCGACGCCAAAGATCTGCACAATATGTCACTAGTCGGCAACTACTTCAATCCGCACCTGCTGCTCAACC 
ACGGGATGCTGGTGGCCAACGGAGCGGCGGCGGCAGCGGCGGCGGCGGGAGCGGGACCGGCCAGCTACTTTGCCAGTGACCGCTCGCCGCTGGGC 
AAGCCGTCGGTGCTGTCCAACTTCTCGCTGCCGTCGGCCTTCTCGCCGCCCAAGTACATCGGCATATCCCTGGATCAGAATCTGTTCAATGGCAG 
CGAATCATTTCGCACGGATTCGGCCAGCCCCACATGCACATCGCACGAATCCATGGAGGGATCACAGGATTACGATGCCGTTGAAAAGGGTGAAA 
GTCCGCGCAGCAATAATTCTCAGGATCCCAGGGATTTGAGACATCTGCATAATGCGGGGAAGAGCCACCAGGCACTGGCCACTTCCTCGTCGGCG 
TCCAGCTCCTCGTCCTCCTCCTGCTCCACCAGCAATCCCGCCATCAGCACGGCGACCAGCAGCGTTGCCGTTTCGATGGCCAGCCACCTGGCCGC 
CTCCTCGCCGCACCATCACCCCCACACGCACGCCCACTCGCACCCGCATCCGTTGGCCCATCCGCACGCCCATAGCCATCACCACGTGGGCCACC 
ATGTGGGGGCACCTCCGGTGTCCACGGCGGTGACGACGCACCACCACATGGCCCATCCCCATCCGCTGTCGCACCACCACGCCGCCCACCACGCG 
GCCTTGGCCAGCTTGAGCATGGCGGGTCTGCGAGCAGTTCCCGGCGGATTGAGCCTGGTCAGTGGGCTGCAGGCGGCAGCTGCCGGCGGTGCTAT 
TCCCGAGATGTGCCCCGTTTGCGGACTGAAGCTGAGCGCCGAGGAGTGGCACACCCACTTCCTCACCGAACTGGACAGGCTCTACAAGCTGAGTG 
CGGGATTCGAGAGGGCCAGTCTGCAGGCCACCTACATGTTTGCTCCTCCATGTCCCGCGCAGGAGAACGCCATTCGCACCAGTCACAACCGCTGG 
GAGACC TT T C AG AG AAT AC GT AAC AAT C GC C AG AAC C G AC T GAG ACT C AAAGT GC G AAAGC GC AAGT AC GG C G AAAT GT AC AT G AT GG AG AGTCT 
TTACTGTAGCAGTTGTCCGATCTGCAAGAGGAAGTACGCCTTGGAGACGGGGAAGATTCCTCCGGAGGACGATGCCAAATCGCAGGAGGAGATCG 
AAACGGTGGATGTGGAGAGCTGCAACGACGAAGTGCCCGACTCAGGTTCGGAGCTGGCTACTCCGAGTTCTGGGTCATCCGGCACGGTTATGCCG 
CCCTCCAGTATGCACAACAGCAACTCGCAGCCGGGCAAGCTGGACGGGATTCTCTACCGGACGGGCTGTGTGCTCAGCCAGAAGGATCCGCATCC 
CGACGAGCATGACGTGAGCACCAATGTGACAACGGCGAGCAGCAGTAGTTGGCCGGGAGAGGCACCAACCCAGGCACATATCAGTGTTAA7VACCG 
TCAGCGAGCTGTCATCCACCACACACCACTACTACAACGCGGACTCATGCGGCGTGGGCGTAAACGACACCAACAGCAGCAGCAGCACCCACAAC 
AACGGCGGTGGCAGCAACAAGGATCTGATGATGGACACGGCCTCCTGCCAAAACGACAGCGACGAGGACGTGATTGTGGACGACGACGACACCGT 
GAAGCTGACGAGCAAAATCAACTATGGCAAGATTCAGCGCCGCCAGGACGAGCAGAACGTCGGCAGCAGCAGATCCCTCGAGAACATCTCACCGG 
TGGAGGAGCGACCGCGGTCGGAGCCGCAGGTGAGCAGCACCGAACCCGGACCCATGGACATATCTCACAACAACAACAATAACAACAACAACAAC 
AACAACAATAGCAACAGTAATAACAACAACAATCCGTCGACGAAATACGCAGAGAGCATGGAGAACAGCCTGTCGCAGTTGTCGTCGATGGGCGT 
GCCGGGATTAACGCAATTGGACACAAAGACAGGCTTAAAGGATCTCAACACGGATGCGTTTCTGCGCGGCCGTAACTTTTACTTCCACCAGAGTT 
GTGCCAACGTCATGAGCAGCTACCGGCTGAACAAGGAGCTTCTCAGCCAGGCCCAGAATCAACAGAGGGCGGCCGCCAACGGAAACGGAAATGGC 
AGTGGGATCGGAATTGGGGATTCGGGCGGCGCCACGGCAACGCCCCGCAGCCTTTCACCTGCCCAATCTCCGCAGCAGAGCGTTTCGCAGGCCAT 
GGAGTGA 

(SEQ ID NO: 1469) 



FIGURE SHEET 795 



Start ATG: 1 (Reverse strand: CAT) 



MESNAFGSSHLPSQALVVLSEAASGLHEALRGQRPFPSRLPDAKDLHNMSLVGNYFNPHLLLNHGMLVANGAAAAAAAAGAGPASYFASDRSPLG 

KPSVLSNFSLPSAFSPPKYIGISLDQNLFNGSESFRTDSASPTCTSHESMEGSQDYDAVEKGESPRSNNSQDPRDLRHLHNAGKSHQALATSSSA 

SSSSSSSCSTSNPAISTATSSVAVSMASHLAASSPHHHPHTHAHSHPHPLAHPHAHSHHHVGHHVGAPPVSTAVTTHHHMAHPHPLSHHHAAHHA 

ALASLSMAGLRAVPGGLSLVSGLQAAAAGGAIPEMCPVCGLKLSAEEWHTHFLTELDRiYKLSAGFERASLQATYMFAPPCPAQENAIRTSHNRW 

ETFQRIRNNRQNRLRLKVRKRKYGEMYMMESLYCSSCPICKRKYALETGKIPPEDDAKSQEEIETVDVESCNDEVPDSGSELATPSSGSSGTVMP 

PSSMHNSNSQPGKLDGILYRTGCVLSQKDPHPDEHDVSTNVTTASSSSWPGEAPTQAHISVKTVSELSSTTHHYYNADSCGVGVNDTNSSSSTHN 

NGGGSNKDLMMDTASCQNDSDEDVIVDDDDTVKLTSKINYGKIQRRQDEQNVGSSRSLENISPVEERPRSEPQVSSTEPGPMDISHNNNNNNNNN 

NNNSNSNNNNNPSTKYAESMENSLSQLSSMGVPGLTQLDTKTGLKDLNTDAFLRGRNFYFHQSCANVMSSYRLNKELLSQAQNQQRAAANGNGNG 

S G I G I GD SGG AT AT PRS LS P AQ S P QQS V S Q AME * 

(SEQ ID NO: 1470) 

Name: Om protein 



Celera Sequence No. : 142000013384150 

GTTATCGGTGGCTTTAAATTCGTAATAAATCTATTGTTGAATAAAATATTCCATTCCCCCCGAAATTTGAAAAATGAGAATCGCATACAGGCACA 
TTTCGAGTTAACATTATTTTTATTTTTCATACAATTCGTACAATGCCTACCTATAATTAATATTAAATCGCAAAATATTGGCTTTACCTAGTGAA 
AAGTACTTAGCATCCTAATAGCCGATAATATAAATTTCTGTGAAACATTTCGTATACTTTAGGAATAATATGTGTATATTTGCGTGACTTCCTAA 
AGGTGAAGGTTCCGGAACCTACAAAAATATATAAAATGATTGTTTGAACTGGTTATCGTAACACTAATTAAGGTAGTTCTTTGTTACACGCTAGA 
TTATATATTTCTATACATATCGGTCGAGCTGTATATTAATTATGAACAATTAACGATTCTGAAAACTAAGAACTCGTTGGGGACTGTTCAAGCTT 
TGCGAAATAATAGAGATATTTGTACATTTATATAAACCTAATGATTTAAGCGATCCGCATTAATACCAATCAGCATTAACACATCAAAAATATGG 
ATCGCTCTATATATAATTATAGATTCTACTTGTGGATTACAAGAAGATTAAGAACGAAAAGATTTCGATTCCCAAGTATATTTACAATTTAAATT 
GAGCTGGCGCTTTGAAAAAATTGAAAAAATTAACTATTTTAAAGTAGGGGGATGCAATAAATTTGTCTAGCTGTGACCGATATGCCGATGTGCCC 
GAGACCCTATAAACTCATTCGTTCCGGCATGCCCGATTGAATTGACGAACTCTATGGACTAGTTTCGCGTTGGTATATCATAAGAAATTGGCGCG 
GCCTACAACATCAATGGCCAATGACATTGCGCAGTGGCAATGTCATACCCATACGTATTTGATAACAGCTCGCATAAATTCTTTTGTGACTATAC 
AGCGAGTCTGAGACCTTATCTCAGCTCTACAGCAACCAAAACACTCGGCGCTAGGACTTATTACTATTGCAGCTATTGACATTGTTATTGTTGTA 
CAAGTATGGTATAAGATTGGGCTCAATCTCCAGCTGCTGCCTGAGTATCGAGCTCATGACGAACTCCGTTGAAACGATATGACACTTGGGATTTC 
CGTGCCGCGTGAGATCCGCACAGAGATGCATGTCCGTGGGACAGGTTACTATGATGTAGGAATCGGGCGCTTGCATGTGTGTTTCCGCCACAGAT 
GCACCGCTTCTCCGCTTCGGCTCCACTTTACCACCAGACGATTCAATCATTCGTATAATTTCTTCGCGAGCTGGGAACACGTCCGGCGTCACATG 
GAAGTATTTGCCGGCAAACAGCGTTGAACGAGTGGGGGCGCACAGCACGGTATTCAGATTGAACTGCAGGTTCTCGTCCACCGGTATGTGCTGAA 
TGCGGTATGGATCTGTAGGCACAAATTTACCAGCTTTGGCACTATCAGCAATCCAGCTCGACTTCAGCACATAATCGACGTGGCAGCAGGCCTGG 
ATTAGCTTGCATGTACGACTCTCGCGTGTCATTACCAGGTGAGTGGCATCTGCAGGGCTGTCCACCACAATTCCACCCAAAATTCTAAAAAAAAA 
AAAAGGGAAAAGTTAAAAAACGAAACTAAGAATACGTTGTATTATACTTACAACACAGCTTTTTTAAGAGCCTCGGCATCTGCGACCTGGGAAAA 
TATGACCTTTGGTGGTTTGGTGGTAGTTGGATATTCTACGCAGACAATTTGCTCTGGCAGCTGCTCCTGCTGGAATGCGTTCATCTTCTGGCGCT 
TAAGTTTTTGCTCGTTGCCGTAAGGATCGGATAAGTGTCGCTTGACCCGTTCGTGTGCCTCCTGAGTTAGATTGATGGGCGCCTTCCAAGCAGCT 
GTTAAGAAACAAATCGTCAAATGATTTTACAATATAATACGAGTTTAGTTTGTACTTACTTAGCAGATGAGCCACCAGATTGCATTCGATGCGGA 
AGGGAGCGACCAAGTTGTACTGCTGATACTTGGGATTCTCATACTGGGAGAGGCCACTAAGGTTACCGATGCACACGTCGCTTAACCAAAGCGCA 
TTGACCATCGGTATGTTCCACTCCTTGGCGGCATTAAACTTGTTGCCCTCGAGCTGCTTGCACACGACCACTGTGTTCACCTTGGACAGATAGGA 
TGTGTAGATGGCACCGCACTCCTCTGCCATCTGCTGTAGTCTCACAACCTCCTCGCCTTCAAATCCCTCGGAGGTTATGATGTAGCGTTCTAGGG 
GCTTACGATAGCCAAACTGGCTGGGGAACGGTAGGTGCAACGGCTGCCACGGTGGCATCAGTTGGCGCTTTAAGCAAATATCACTCAGCCAGTAG 
GCAGTGACGCAGCGCTTGGCATCCCGCAACGCCTGCATCACCACGCCATGACGCTGAGTGCGACAAATAACATGGGTTACTCGGGGGCAGTAAAC 
ACGTTCGATGTCACCGCCAAACTGTCTAATCGTATCTAGCCAAATGGGCAGTTCATCACCGTCTGTTTCATCGTATTCCACAATGTAGAACGTAC 
AGCCCACCAAAAAGAGATCGGCTGGTAACTTTAGGTTCGGATTGTGCCCGTAGAACTGCGGCATAGGGATCGGTTGCTGACCAGGACGAGTTTGA 
ATGTAGCCGTGAGGAGTCCGCTGCACTGGCACACCCACGCCCACTCCCACAGCCATCTGTTGCTGCTGCTGGGGGGACTGTGTCGGAGCGACTAC 
GATCTGTTGTTGGACTAGATGTTGCTGCGGTGCCTGCTGCTGTTGTGGCAGGGCGACAACTCTGCGCATTTGCTGCTGTTGCAGCAGTTGTTGTT 
GCTGCTGTTGCTGCAACATGAGCAACTCCTGCTGAGTTCGCGGTGGTCCAGGAACTCCGGCTGTTGCGTTATGTTGCTGGCCCATCATTCTAAGA 
GCACCAGCTGCCGACGGTGGCTCGACGTGTTGCTGTGGTGGTAATTGCTGTTGGACAACCATTTGTTGTTGTTGCTGCTGCTGCTGTTGCTGTTG 
TAATAGTAATTGTTGCTGCTGCTGCTGAATGGCTCCGGTCACCACCACCTGTTGTACAATCTGCTGTTGCTGTGGCTGCTGTCCATCGGGAAGTT 
GAATTATTTGCTGTTGTGGTGTTTGAGCTCCATTATTTCCTAAGCGATTGCTCAACATGTTGGCCAGGGCTGTCTTCGTTTTGAGGTTCATTTGC 
TGCGGTGATTGTTGCGGCTGTCCATCGGGAGCTGTGCAAAAGGAATTGGTTACTATGTGCATTGTCCTTTTTTTAATGATTAAAGATGATTCGTA 
TAATTGTAAAAGTCACTGTAGCAGACAAATGCATTTACTTACTCGTTGATGTCATGACCAAAGTTTGCGTCATTATGTTGGCATTTCCGGGAAGA 
ACACCTCGTGCTCCTGGAGTTGTAGGTGTGCCTGGTGCGGACTGCTGCTGCGGCTGTTGGACGATGGTCTGTTGTTCAATCAGCTGACCATTGGG 
TCCGGTGATAAGCTGCACTGTTTGCTTGCCCAGCTGCTGAGGTCCCTGTTGCTGAATGGGAATCTGCTGGACTCGCAACTGCTGCCGCTGCATCA 
GCATTTGCTGGTTGGGATCATTGGGGTTGAAGGGCTGCTGTTGAGGAGACTGCTGCTGTTGCTGTGGTCCCGCTCCTGGCTGCTGTTGAATCGGA 
CTCAAGCCAGGTGTTCCGGCGCGTACAACAACCTGTCGTTTTCCCTGCTGCTGTTGCAGCCAGAGGACTTGTTGCTGCTGTGTTAATCCGGGTGG 
CATTTGGTGTTGGACAATAATGTGCCCGGGCGGAACCGGACCTGCCGGTCGCGTCGCTACTATGCTGCCCGGTGCTCCAGGTCGCGGCTGGTAGG 
CAACCTGTTGGCGAAGCATTACTCGAGGTTTGTTCTGCTGATTCAACTTAGTAATATATTCGGCTCTTTTGTGAGCATCCAATGACATGAGGTGT 
GCATGCGTTTTTTCGTCCACCACGATTTTTTGTTGCTGTGGCTGTTGCATTACCAACTGTTGTTGCTGACCAAGATGCTGTTGCTGCTGCTGCGG 
CTGTTGTTGTTGCGGTGGCGGCAGCACGATCTGTTGCTGTTGTTGAGTGGGTTGAGGCTGTTGCTGTTGAGGCTGGCCGGGCCAAGAGGTTTGCG 
GCCGCAATCCACGCGGAACAAATTGCTGCTGAGGCTGCGGCTGCGGCGTTGTTGGTCCACCGGGCAGGGGTGATTGCATCCCAACACTGGTGGGT 
GTGCCTGGAGTGCTGGACGCCATTTGACCGCCAGCAGGACTTTGCGGTGACCAATGCTGCTGCTGCATTTGTGGATGGTGTTGTGGAGACTGCTG 
TTGTTGCTGAAGCAACAACTGTTGTTGTTGCTGCTGCTGCGGAGGCGGCGTGCGTTGAGGCAGTTGTTGCTGCGTGATAAGCTGCATTATTTGTT 
GCTGCTGATTATCGTTGGACAGTTTTTGCTGCAGTATCTGCTGTTGTTGGGGCGTTTGCTGCTGTTGTATAAGGATTTGCTGTTGTTGTTGTCCC 
ACTAGCTGTTGTTGTTGTTGCTGCTGCTGCTGCTGTTGTTGTTGTTGCTGTTGTTGCTGCTGCTGTTGCTGTTTTTGCATTAACTGCTGTTGTTG 
TAGGTACTGCAAGCGCTGTGGATCACTGAGTATTTGCTGGTTAACAACAATCTGCTGCGGCTGCTGCTGCATTTGCATCCTCTGCTGTTGCTGCT 
GTTGTGGTATAATCTGCTGCTGCTGCAACATTTGTTGTTGCTGCACAAGTTGCTGTTGTGTCTGATTGACCACGTAAACGTTTTGGCTCTGCGGA 
TTTTGGGCTAGGATTTGCTGTTGCTGTTGTACGCTAAGCTGATTAAAGTGCTGCTGGCTAATTATAAAGGTCTTTTGTTGCTGCTGCTGTTGTTG 
TGGCTGCTGAACTCCAGAAATTTGCTGGTGTGAATCAAACGAAAATTTATTGGGGTAAATATTTAAAAAAAAAGACAAAAAACTAAGAGTAGGCA 
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TACCTGCTGTGGTTGTTGTTGCAGCTGCTGTTGCTGGGGAAATTGACCAATAATCTGCTGTTGTTGTTGTTGTGGAACCTGCTGCTGTTGCTGAA 

CATTGGTTGGTGGACGTTGTATGGTTTGTTGAATTATCCGCACTTGACCACCAGCTCCAAGTGCTATTTGCTGCTGTTGTGTAAACTGAACTTGT 

TGCGGTTGTTGTTGTTGAACTTGGTGCTGTTGCGGAAGCTGTTGCTGCTGCTGTGCCGGTGGCAGTTGCTGCACTTGAGTAATCTGTTGTTGCTG 

CTGCGGCTGCTGTTGTGCTTGTTGTTGAACGTGCAACAATTGCTGCTGCTTCTGCAAGGCCAAAGACATATGACTTTGAATGATTTGTTGTCCTT 

GAGCCGTGGATGTGTTTATAACATGGCGCTGAGTGATGACTTGTTGCTGTTGTCCAGGCTGTGGCTGTTGCTGCGTCAGGACCTGCTGTTGTTGC 

TGCTGCTGTTGTTGTTGCTGCTGCTGCTGCTGTTGTTGTTGTTGTTGCTGCTGTATTTGCTAAAAAGAAGTAATCATTAATTTCCGGTAAAAAAT 

AATATATGTTCTTTATTTTACTTGAGCGTTTTGTTGGTTTGGCTGCTGTTGCAAGGTGGGGGTAGTAGGTTGCTGTTGCAGTGGCGATTGTCGCG 

GAGAAGGGGTGTGCTGCTGTTGTGGTTGCTGCTGCTGCTGCTGTTGCGGTTGCTGTGCCATTTGCTGCTGCTGGGGCTGCGGTGCCATTTGTTGT 

TGCTGTTGCATCTGTTGTTGCTGCTTCAATTGTTGCTGCCTCATTATCAACGAGTCCTGATTGCTGCTCAATGCAGATCGACTGAGCACTTGAGC 

TACCTGAACGGGGTCCGTTGGGTCGATAGGCTTCTGCTGGGAGAATGATTGCTGCGGCTGAGTGGCTTGTCCAGGAACCACCTGATGCTGATGCA 

GCTTTGCCTGCATGGGTGGAGTTTGAGAAATCTGGGGCGATTGCGGAGAAATATGCTGGGGACTCTGCTGTTGCTGTTGTTGCTGCTGTTGGGCC 

AATTGTTGTTGCTGCTGCAATTGCATGCGCTGCTGTTGATGGACGGGCTGTTGAAAGGGCTGCATCTGTTGCTGCTGTTGCGGCGACATCATAGA 

TTGTGGTTGAGGTTGTTGTGGCGAGCCCATCATGTGCTGTTGTCGATTCTGTTGCATTTGCTGTTGCTGGCGCAAATGTTGTTGCTGGGCGTTTT 

GCGGCGAGGGCGGAGGTGGCGGAGTCATTTGGGGAGCTGGGGATTGCAGTTGTCGCGGCGATTGTGGCGGCAACATATTGCTAGTTGCATGATGC 

TGCTGTTGCTGCAACGATGGGGATGGCGTACTTGGTGGCATTACCGGCGAACGGTTTTGCACCATCCGAGGCGACTGTTGTTGTTGTAGATTCTG 

TTGCTGTTGTTGCTGCTGCATTTGCATCTGTAATTGCTGCTGCTGCTGCATAACTTGTTGCAGCTGTTGTTGAATTTGTCTCTGTTGCTGCTGCT 

GTTGCGGCGTCAATGCTTGTCCGCTTTGCACGATTTGCTGCAATTGCATTTGATTCTGGTTCAAAATTTGTTGCTGCCGCATTAGATGTTGGTGT 

ATGATTTGCGGCGAAACCGGCACAGATTGTTGTTGTGGCTGCTGTTGCTGAACAACTTGCTGCTGAACAATTTGCTGTTGCTGAACTATCTGCTG 

CTGCTGTACCATCGTTTGCTGCTGGCCCACTTGTTGCTGCACCACCTGTTGCTGTTGAACTATCTGCTGGTGGTGCTGAGGGGATTGCACGGGCT 

GCGGCGTTGTAGATGGTGTACGTGGTGTCTGCCGGCCTGTTGGAGTCGTTGGCAGTCGGGGCGACTGCTGTTGCTGAGGAATTTGTTGTGGATGT 

TGTTGAGAAAGTTGCTGTGGAAGTTGCTGTGCGATTTGTTGCGGGTGTTGTTGCGCAAGCTGCTGTTGAAGTGGTTGAGGATGCGGGCTGTTGC 

(SEQ ID NO: 1471) 

Exon: 6757.. 5627 
Exon: 5569.. 5039 
Exon: 4964.. 3368 
Exon: 3261. .1960 
Exon: 1898.. 1667 
Exon: 1604.. 1001 

Start ATG: 6757 (Reverse strand: CAT) 
Transcript No. : CT25344 

ATGGTACAGCAGCAGCAGATAGTTCAGCAACAGCAAATTGTTCAGCAGCAAGTTGTTCAGCAACAGCAGCCACAACAACAATCTGTGCCGGTTTC 
GCCGCAAATCATACACCAACATCTAATGCGGCAGCAACAAATTTTGAACCAGAATCAAATGCAATTGCAGCAAATCGTGCAAAGCGGACAAGCAT 
TGACGCCGCAACAGCAGCAGCAACAGAGACAAATTCAACAACAGCTGCAACAAGTTATGCAGCAGCAGCAGCAATTACAGATGCAAATGCAGCAG 
CAACAACAGCAACAGAATCTACAACAACAACAGTCGCCTCGGATGGTGCAAAACCGTTCGCCGGTAATGCCACCAAGTACGCCATCCCCATCGTT 
GCAGCAACAGCAGCATCATGCAACTAGCAATATGTTGCCGCCACAATCGCCGCGACAACTGCAATCCCCAGCTCCCCAAATGACTCCGCCACCTC 
CGCCCTCGCCGCAAAACGCCCAGCAACAACATTTGCGCCAGCAACAGCAAATGCAACAGAATCGACAACAGCACATGATGGGCTCGCCACAACAA 
CCTCAACCACAATCTATGATGTCGCCGCAACAGCAGCAACAGATGCAGCCCTTTCAACAGCCCGTCCATCAACAGCAGCGCATGCAATTGCAGCA 
GCAACAACAATTGGCCCAACAGCAGCAACAACAGCAACAGCAGAGTCCCCAGCATATTTCTCCGCAATCGCCCCAGATTTCTCAAACTCCACCCA 
TGCAGGCAAAGCTGCATCAGCATCAGGTGGTTCCTGGACAAGCCACTCAGCCGCAGCAATCATTCTCCCAGCAGAAGCCTATCGACCCAACGGAC 
CCCGTTCAGGTAGCTCAAGTGCTCAGTCGATCTGCATTGAGCAGCAATCAGGACTCGTTGATAATGAGGCAGCAACAATTGAAGCAGCAACAACA 
GATGCAACAGCAACAACAAATGGCACCGCAGCCCCAGCAGCAGCAAATGGCACAGCAACCGCAACAGCAGCAGCAGCAGGAACCACAACAGCAGC 
ACACCCCTTCTCCGCGACAATCGCCACTGCAACAGCAACCTACTACCCCCACCTTGCAACAGCAGCCAAACCAACAAAACGCTCAACAAATACAG 
CAGCAACAACAACAACAACAGCAGCAGCAGCAGCAACAACAACAGCAGCAGCAACAACAGCAGGTCCTGACGCAGCAACAGCCACAGCCTGGACA 
ACAGCAACAAGTCATCACTCAGCGCCATGTTATAAACACATCCACGGCTCAAGGACAACAAATCATTCAAAGTCATATGTCTTTGGCCTTGCAGA 
AGCAGCAGCAATTGTTGCACGTTCAACAACAAGCACAACAGCAGCCGCAGCAGCAACAACAGATTACTCAAGTGCAGCAACTGCCACCGGCACAG 
CAGCAGCAACAGCTTCCGCAACAGCACCAAGTTCAACAACAACAACCGCAACAAGTTCAGTTTACACAACAGCAGCAAATAGCACTTGGAGCTGG 
TGGTCAAGTGCGGATAATTCAACAAACCATACAACGTCCACCAACCAATGTTCAGCAACAGCAGCAGGTTCCACAACAACAACAACAGCAGATTA 
TTGGTCAATTTCCCCAGCAACAGCAGCTGCAACAACAACCACAGCAGCAAATTTCTGGAGTTCAGCAGCCACAACAACAGCAGCAGCAACAAAAG 
ACCTTTATAATTAGCCAGCAGCACTTTAATCAGCTTAGCGTACAACAGCAACAGCAAATCCTAGCCCAAAATCCGCAGAGCCAAAACGTTTACGT 
GGTCAATCAGACACAACAGCAACTTGTGCAGCAACAACAAATGTTGCAGCAGCAGCAGATTATACCACAACAGCAGCAACAGCAGAGGATGCAAA 
TGCAGCAGCAGCCGCAGCAGATTGTTGTTAACCAGCAAATACTCAGTGATCCACAGCGCTTGCAGTACCTACAACAACAGCAGTTAATGCAAAAA 
CAGCAACAGCAGCAGCAACAACAGCAACAACAACAACAGCAGCAGCAGCAGCAACAACAACAACAGCTAGTGGGACAACAACAACAGCAAATCCT 
TATACAACAGCAGCAAACGCCCCAACAACAGCAGATACTGCAGCAAAAACTGTCCAACGATAATCAGCAGCAACAAATAATGCAGCTTATCACGC 
AGCAACAACTGCCTCAACGCACGCCGCCTCCGCAGCAGCAGCAACAACAACAGTTGTTGCTTCAGCAACAACAGCAGTCTCCACAACACCATCCA 
CAAATGCAGCAGCAGCATTGGTCACCGCAAAGTCCTGCTGGCGGTCAAATGGCGTCCAGCACTCCAGGCACACCCACCAGTGTTGGGATGCAATC 
ACCCCTGCCCGGTGGACCAACAACGCCGCAGCCGCAGCCTCAGCAGCAATTTGTTCCGCGTGGATTGCGGCCGCAAACCTCTTGGCCCGGCCAGC 
CTCAACAGCAACAGCCTCAACCCACTCAACAACAGCAACAGATCGTGCTGCCGCCACCGCAACAACAACAGCCGCAGCAGCAGCAACAGCATCTT 
GGTCAGCAACAACAGTTGGTAATGCAACAGCCACAGCAACAAAAAATCGTGGTGGACGAAAAAACGCATGCACACCTCATGTCATTGGATGCTCA 
CAAAAGAGCCGAATATATTACTAAGTTGAATCAGCAGAACAAACCTCGAGTAATGCTTCGCCAACAGGTTGCCTACCAGCCGCGACCTGGAGCAC 
CGGGCAGCATAGTAGCGACGCGACCGGCAGGTCCGGTTCCGCCCGGGCACATTATTGTCCAACACCAAATGCCACCCGGATTAACACAGCAGCAA 
CAAGTCCTCTGGCTGCAACAGCAGCAGGGAAAACGACAGGTTGTTGTACGCGCCGGAACACCTGGCTTGAGTCCGATTCAACAGCAGCCAGGAGC 
GGGACCACAGCAACAGCAGCAGTCTCCTCAACAGCAGCCCTTCAACCCCAATGATCCCAACCAGCAAATGCTGATGCAGCGGCAGCAGTTGCGAG 
TCCAGCAGATTCCCATTCAGCAACAGGGACCTCAGCAGCTGGGCAAGCAAACAGTGCAGCTTATCACCGGACCCAATGGTCAGCTGATTGAACAA 
CAGACCATCGTCCAACAGCCGCAGCAGCAGTCCGCACCAGGCACACCTACAACTCCAGGAGCAGGAGGTGTTCTTCCCGGAAATGCCAACATAAT 
GACGCAAACTTTGGTCATGACATCAACGACTCCCGATGGACAGCCGCAACAATCACCGCAGCAAATGAACCTCAAAACGAAGACAGCCCTGGCCA 
ACATGTTGAGCAATCGCTTAGGT^AATAATGGAGCTCAAACACCACAACAGCAAATAATTCAACTTCCCGATGGACAGCAGCCACAGCAACAGCAG 
ATTGTACAACAGGTGGTGGTGACCGGAGCCATTCAGCAGCAGCAGCAACAATTACTATTACAACAGCAACAGCAGCAGCAGCAACAACAACAAAT 
GGTTGTCCAACAGCAATTACCACCACAGCAACACGTCGAGCCACCGTCGGCAGCTGGTGCTCTTAGAATGATGGGCCAGCAACATAACGCAACAG 
CCGGAGTTCCTGGACCACCGCGAACTCAGCAGGAGTTGCTCATGTTGCAGCAACAGCAGCAACAACAACTGCTGCAACAGCAGCAAATGCGCAGA 
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GTTGTCGCCCTGCCACAACAGCAGCAGGCACCGCAGCAACATCTAGTCCAACAACAGATCGTAGTCGCTCCGACACAGTCCCCCCAGCAGCAGCA 
ACAGATGGCTGTGGGAGTGGGCGTGGGTGTGCCAGTGCAGCGGACTCCTCACGGCTACATTCAAACTCGTCCTGGTCAGCAACCGATCCCTATGC 
CGCAGTTCTACGGGCACAATCCGAACCTAAAGTTACCAGCCGATCTCTTTTTGGTGGGCTGTACGTTCTACATTGTGGAATACGATGAAACAGAC 
GGTGATGAACTGCCCATTTGGCTAGATACGATTAGACAGTTTGGCGGTGACATCGAACGTGTTTACTGCCCCCGAGTAACCCATGTTATTTGTCG 
CACTCAGCGTCATGGCGTGGTGATGCAGGCGTTGCGGGATGCCAAGCGCTGCGTCACTGCCTACTGGCTGAGTGATATTTGCTTAAAGCGCCAAC 
TGATGCCACCGTGGCAGCCGTTGCACCTACCGTTCCCCAGCCAGTTTGGCTATCGTAAGCCCCTAGAACGCTACATCATAACCTCCGAGGGATTT 
GAAGGCGAGGAGGTTGTGAGACTACAGCAGATGGCAGAGGAGTGCGGTGCCATCTACACATCCTATCTGTCCAAGGTGAACACAGTGGTCGTGTG 
CAAGCAGCTCGAGGGCAACAAGTTTAATGCCGCCAAGGAGTGGAACATACCGATGGTCAATGCGCTTTGGTTAAGCGACGTGTGCATCGGTAACC 
TTAGTGGCCTCTCCCAGTATGAGAATCCCAAGTATCAGCAGTACAACTTGGTCGCTCCCTTCCGCATCGAATGCAATCTGGTGGCTCATCTGCTA 
ACTGCTTGGAAGGCGCCCATCAATCTAACTCAGGAGGCACACGAACGGGTCAAGCGACACTTATCCGATCCTTACGGCAACGAGCAAAAACTTAA 
GCGCCAGAAGATGAACGCATTCCAGCAGGAGCAGCTGCCAGAGCAAATTGTCTGCGTAGAATATCC7VACTACCACCAAACCACCAAAGGTCATAT 
TTTCCCAGGTCGCAGATGCCGAGGCTCTTAAAAAAGCTGTGTTAATTTTGGGTGGAATTGTGGTGGACAGCCCTGCAGATGCCACTCACCTGGTA 
ATG ACAC GC GAG AG TCGT AC ATGC AAGC TAAT C C A GGC CT GC T GC C AC GT C GAT T AT GT GCT G AAGT C G AGCT G GATT GC T G AT AGT GC C AAAGC 
TGGTAAATTTGTGCCTACAGATCCATACCGCATTCAGCACATACCGGTGGACGAGAACCTGCAGTTCAATCTGAATACCGTGCTGTGCGCCCCCA 
CTCGTTCAACGCTGTTTGCCGGCAAATACTTCCATGTGACGCCGGACGTGTTCCCAGCTCGCGAAGAAATTATACGAATGATTGAATCGTCTGGT 
GGTAAAGTGGAGCCGAAGCGGAGAAGCGGTGCATCTGTGGCGGAAACACACATGCAAGCGCCCGATTCCTACATCATAGTAACCTGTCCCACGGA 
CATGCATCTCTGTGCGGATCTCACGCGGCACGGAAATCCCAAGTGTCATATCGTTTCAACGGAGTTCGTCATGAGCTCGATACTCAGGCAGCAGC 
TGGAGATTGAGCCCAATCTTATACCATACTTGTACAACAATAACAATGTCAATAGCTGCAATAGTAATAAGTCCTAG 
(SEQ ID NO: 1472) 

Start ATG: 1 (Reverse strand: CAT) 

MVQQQQIVQQQQIVQQQVVQQQQPQQQSVPVSPQIIHQHLMRQQQILNQNQMQLQQIVQSGQALTPQQQQQQRQIQQQLQQVMQQQQQLQMQMQQ 
QQOQQNLQQQQSPRMVQNRSPVMPPSTPSPSLQQQQHHATSNMLPPQSPRQLQSPAPQMTPPPPPSPQNAQQQHLRQQQQMQQNRQQHMMGSPQQ 
PQPQSmSPQQQQQMQPFQQPVHQQQRMQLQQQQQLAQQQQQQQQQSPQHISPQSPQISQTPPMQAKLHQHQVVPGQATQPQQSFSQQKPIDPTD 
PVQVAQVLSRSALSSNQDSLIMRQQQLKQQQQMQQQQQMAPQPQQQQMAQQPQQQQQQQPQQQHTPSPRQSPLQQQPTTPTLQQQPNQQNAQQIQ 
QQQQQQQQQQQQQQQQQQQQQVLTQQQPQPGQQQQVITQRHVINTSTAQGQQIIQSHMSLALQKQQQLLHVQQQAQQQPQQQQQITQVQQLPPAQ 
QQQQLPQQHQVQQQQPQQVQFTQQQQIALGAGGQVRIIQQTIQRPPTNVQQQQQVPQQQQQQIIGQFPQQQQLQQQPQQQISGVQQPQQQQQQQK 
TFIISQQHFNQLSVQQQQQILAQNPQSQNVYVVNQTQQQLVQQQQMLQQQQIIPQQQQQQRMQMQQQPQQIVVNQQILSDPQRLQYLQQQQLMQK 
QQQQQQQQQQQQQQQQQQQQQQLVGQQQQQILIQQQQTPQQQQILQQKLSNDNQQQQIMQLITQQQLPQRTPPPQQQQQQQLLLQQQQQSPQHHP 
QMQQQHWSPQSPAGGQMASSTPGTPTSVGMQSPLPGGPTTPQPQPQQQFVPRGLRPQTSWPGQPQQQQPQPTQQQQQIVLPPPQQQQPQQQQQHL 
GQQQQLVMQQPQQQKIVVDEKTHAHLMSLDAHKRAEYITKLNQQNKPRVMLRQQVAYQPRPGAPGSIVATRPAGPVPPGHIIVQHQMPPGLTQQQ 
QVLWLQQQQGKRQVVVRAGT PGLS P I QQQPGAGPQQQQQS PQ QQ PFN PND PNQQMLMQRQQL R VQQ I P I QQQGPQQLGKQT VQL I TGPNGQL IEQ 
QTIVQQPQQQSAPGTPTTPGAGGVLPGNANIMTQTLVMTSTTPDGQPQQSPQQMNLKTKTALANMLSNRLGNNGAQTPQQQIIQLPDGQQPQQQQ 
IVQQVVVTGAIQQQQQQLLLQQQQQQQQQQQMVVQQQLPPQQHVEPPSAAGALRMMGQQHWATAGVPGPPRTQQELLMLQQQQQQQLLQQQQMRR 
VVALPQQQQAPQQHLVQQQIVVAPTQSPQQQQQMAVGVGVGVPVQRTPHGYIQTRPGQQPIPMPQFYGHWPNLKLPADLFLVGCTFYIVEYDETD 
GDELPIWLDTIRQFGGDIERVYCPRVTHVICRTQRHGVVMQALRDAKRCVTAYWLSDICLKRQLMPPWQPLHLPFPSQFGYRKPLERYIITSEGF 
EGEEWRLQQMAEECGAIYTSYLSKVNTVVVCKQLEGNKFNAAKEWNIPMVNALWLSDVCIGNLSGLSQYENPKYQQYNLVAPFRIECNLVAHLL 
TAWKAPINLTQEAHERVKRHLSDPYGNEQKLKRQKMNAFQQEQLPEQIVCVEYPTTTKPPKVIFSQVADAEALKKAVLILGGIVVDSPADATHLV 
MTRESRTCKLIQACCHVDYVLKSSWIADSAKAGKFVPTDPYRIQHIPVDEMLQFNLNTVLCAPTRSTLFAGKYFHVTPDVFPAREEIIRMIESSG 
GKVEPKRRSGASVAETHMQAPDSYIIVTCPTDMHLCADLTRHGNPKCHIVSTEFVMSSILRQQLEIEPNLIPYLYNNNNVNSCNSNKS* 
{ SEQ ID NO: 1473) 

Classification: hypothetical 



Celera Sequence No. : 142000013384150 

CATTAATTGTAATTATTTGATTGGGTATTTTCGATAAGCTACAAATTCAGGGATTTCGTTGTATACCTTTTAGATTTTCATGCTGCTTGATTACA 
TATACTTTTTTTGTTGCTGCCACTTTTGTGCACTTTTTTTTTCACACACAAACTCGAAACAGAAAATGCCATTAATGATACTGTTGTTGTTGTTG 
TTGTCTTGAGCACTGGAAATTTTCGTTACGCTTTGGCAGACATTACGAGCCACGCTCCGCATCCGCGACCCCCAATCCACAACATCCGCAATCCG 
CTAGCGATATGATGATGGCTATATCTATATTGTTGTTGCCGCGGTTTTTTTTTTTTGGACTACTGGGATCCGAACTCGGACTCGTCTATCTTTGG 
TTACACGTAGTTTGCGACTGAGCTGAGAGCACCTGGCGAAGCACACGCCGAACTCAAAACTGAAGCTGGTCCCGTCTTTTGGCCAAGAGCCTCAC 
TCTCGCTCTCACTCTCTCTCTCTCTCTCTGCGGATGTTAACAAAGGCAGGCGGCTGCAGCGGAGAGCCAGGTAGCTTTGATTTTTTGACCAAACG 
ATCTACCTGGGTAAATAGCACCACCGAGGCCCATGTTAAAAAGATCCAGCGAATATCGTAGCATTGCTATACCTGTACGGATAAGTAAGAATCTC 
TTTGACACCATAGAGTTCTGATCGAGACTCGTACTCCCTCCCAATTCCAATTGATATTAGAAACACTACTGTTGATGATCAATCGCCTGACAGTC 
ACAGTTACAGTTCCGATTTTACAACTATTTAAACCAAGATCAAAAGATCTTAAGATCTTAATAAAAACTAATGTCTCGATCATTCACACTTTAAA 
CATCTAACTGTAAACTGTTTGGTGATAGCGACAGTTGTAAAAAAAAGCAATCCAATTTGACCAAGCCCAATGTTTAATTAACCAGTAAGTGATTG 
ACTTTAAAATGGAGAGAAAGCCAATTTATTTTATTTACTACTATTTACAGTCAATGTTTGCTCTTCTTCTTCTTCTTCTTGGCCTTGTGCTTGGA 
TTCTTCGCTCTTTTTCGATTTTTTCTTGGACTTCAGCTTCGATTTCTTAGACTTTTTCGAGCTGGAGCCCTTGTGCGATTTCTTGGGCTTACTGA 
GTTTGGATCTCTTGTCATTGTCATCAGAAGAATCGGATGATGAGGATTCGTTGCTAGAACTAGCATCGCTATCTTCGCTGCCGGATATGCATTTC 
TTTTCCACCCACTTTTCAGCTTTGCGCTTTTTAGGCGCGTGCTGCTGCCAAAGCTGTTGAAGTCTGTCGTCAATATTCGCCTCTGCCTGAATCTC 
TGGAGCAGCCGGCTTTTGAGGAGGTGCTGTCGGTAACTTTGGTCCATAAATATCCTCTGGCGCCACCTCAGCCATTGCCAACTCCCTATCGTTTC 
TCAGTCGCTCCTTTCGTGGTTTGTATGCGATTTTCAGTTTGCTTCCCTCGATGGGACCAAACTTGGTCGGTGCTGGTTCTTTGGCAGCATCCTTT 
ATCTTAGGACTAAATAGGGAAGCAAAGATTCCTCTGGGTGGCGAAGTGTTTCTCAGGACATTTTGGGCGGCAGCTGAGGTGGTCGATGTTACGGG 
TAAGCCAAAAGACTCTTGCAGGATAGCTAATTTATCCTGCTCGGATAGTTCCGTCTCTTCCTCATCGCTGTCCTCGAAAATGCTTTTGAACAGGT 
CCACTTTCTCTGAGATGGGTTTGTTGCGGGATTCATCTACAGCCTGCTCCAGGGGAGTTTTCGGCACGAAAGTATGCTTAGACGGCTCTTCTTTT 
GTATTAGGTTTATCCTGTATCTCAGCTTCTGGTTGAGGAGCTGGTGGCGGGGATGGTGGTGTTTCTACCTTTTCCGGAATTGATTTTTCGATGTG 
CTTGGGAAAAATAGAGGGGGTTTCGAAGTTTGCTTTAGTGTTTACGGATGTCTCTAGATAATCAAATACTGAAATTTTTGCCTTGGCTTTCAGTT 
CCTTTTCTGGTTCCAGCATGGCACCACCAAAGGGCTCGGCTATATTGTACCGCTTGCACAGCAGTGCGGTGGGTTTCCACATGGTCTTGGTCCTC 
TCCATCACGATCTTTCGCTCCTCCGGCGGCTTCTCTTGCTGCTCTACTTTCTCTGCTTGAACATTGGCTTCCGAAACGAACCTGTCATTCATAAG 
GCCAACTAATGGTCGATATATCTTTGCCGCCTGAATAAACTCCTTCTTCTCCATCTCCCTATCCCACAGAGATAGTGTAACTGGTTGCATTCTCG 
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CCAGGAACTCCGTAATTTCCTTGTCATCCGTAAGCTTTGATGATACGAATTTCTCATATCGCAGCTGCTTAGCTTCATCAGCTATAAAAGGTTTA 

AAAGCGCTTGAAGAATTCATAGCAGAAGCTGAAATAATTGAAATGTTATAATATAATAATTTCTAGGCACTGTAAATTTTGCTTACGCAAGTCCT 

TCGTCTTGAGGGCTGCTCTCTCTTGTATGGCGGCATTTGCCTCCTTTACTTCCTTGGCTAGTTCAGTTTGCTGCTCCTCGCCATTTTCGGTTATC 

ACTCCGCCCTTGGTAAAGCCCTCAGTTCTGGCATTAATGCGTTCCATTAAAGATTTACTATGATCCTTAAAGGGATTTCGTTTTGGCTGCTCTTC 

CGCAGTCTTTTCCTCTTGTTGTTGCTCGCCTAAGATCTGCGCACGTTGATCCGGATTGAGATCGTGTCTTCCCAGACCAGACCGTTTGTACTCTG 

AGGCTGTCTCCAGTTTCTGGGCTCTTTCTTTGTCCATCGGTGCAAAGCGGCTTCGTCGCTGCAGCCAGTTTCTGGGCTGAAAATCCCTAGGTAAA 

TCAATGGCATAAGGCCTCTGCAGCACTGCCGGCGAGTTGTCCTCACTGAAGCCATCGATGACATGACGCTGTTGGACATGCTGCTGCTTTTTTTG 

CTTGGGTTTCTTGTCCGCCAGGGAGAAATCGTAGCGGGTCATATCATCGCGGGCATATATATCCTCGTCCTCTTCCTCAAAGGCTCCCACTCCGA 

AGGCTTGACCTCGTATGGATAGTTGCTTTTTGTTAGCCTGCGCCTCCATCTCACCAAATAGATTGATGTGTTGCATAGGCTTGGCAGAAGAAGAA 

GACGATTTGGACAGAATGGGATCACGGTTGAGACCGGAGTAGCTCATGCCAAAGCGGTTCTCCTTGGGTGTGTAAAAGATCGGCTCGTAGTCGTC 

CGGAGCAAATGTAATATCTTCATCGTCTTCGTCCTCATTATTGCTATCTTCTTCTCCTTTGTTCTCTTTATTAGATGGTAGTCCCTCCGCGCCGT 

AGTGTTCCATAAGATACTGCTCTTTGGAATTTCTGGCCGTAGCTTGGCGCTTCTCCTTTCGAGTTTGCCGCGGGCCTACACCTTGGCCAGGTTTC 

CAGCCCATGCTTTTAAGTATTCGCACGGCCACTTTATCTCGAACCGGTCGTAGAAGTTGCTCCAGCACTGGAAGACCTGGTATGGCCCCGACACC 

CAACTCAGGCTGCATTAACTTGCGGCGCCTCTGATCAGAACGTTGCTTCTGCTCGTCCTCATTGGCGAACTCATCACGCGTGCGAATGCCTTGCG 

GTGCGATGCCAAACTCGCCCAGATCCTCCTGGTCCATAAAGTCTTCCGGTTTCAGCTGAGCCTTTGACGACGCCCGCTCGCCCCGCGAACTTTTA 

AAGGTCTGCGGCGTCCAGCCCTCCAGTGAGCCAACCGTGTTCCAGAAGCCCGCACTGAAGCCGCCGGTGAAGGCTCCATGGAATCTCCGCTTTCC 

ATTTTCGTCCTTCACAATTTGATCCTCGATGGCCACTGGCTTTTTAGCCGGCACAACATCTGTAAAAGAGGTTTTAATGGTATTGGGAAAGCTGA 

TAAATATTTATTACAGTACTCACCCTTTTGCAAGGCTGGCAGCGGAGTTCCAAATCGGTGTAAATGCTCCTCCTCGTCCATTCTATGCTACAAAT 

TCCATATATTAGCGAAACAATTTGTTTGGTCTGGGCGCCAGTGAGTGCACACCACATTACAGATGACAGCTGACTGGAGTGTTCGCCTATCGGTC 

ACGTTTTGCTGAAAACTGTGACCGGAACTTTAATACGAAATTATTAAAAAAAATTAAAACAAGAAATAAGCTAATTTATTGAAATACTAAAATAC 

TAAAGTTTTAAAATTAAAAATAAAAAACTACAGTAATACTTTAAAAAATGTTACAAATTATGTTAATGTTAATTTTTAAAGCCGGCGGCTTACTT 

GTCTGAACTAAATTATTAAAATACCGTTTACTTAATTATTTATCAATCTGGTATTAAAAAAAAACTTGTGAAGAAATAATAAGCAATATTTCAAT 

TACTATTGATGTTTAACTGACCATTGCAATCACGGCTTTGAGTAGCGATATTATTTGCAATCCGACAGGTGAGTTCTGCCATCTCTAGTTCAACC 

G C AC GGTC AC ATTT AAGAC AT TGC AAAT AT T T T A GAGGAA CT AT T T C GT T GC AAT T T T C AGCT GC CT GC AAC GG AAAT AC AGT GC AAAT GGT AAG 

CGCAACGGCGGTGACAAAGAAAGGCACACCTATCCGATTGGAAAACCAGGTAAATACCGAGAATAAGACCAGAACGGGAGTCGCGACAGACTGGA 

GCTACCTGTGCTGCCTGTATGGGTGACTAATGATGTCAACATTTTCTGTTTTCTCTCCTCAAGCCGACGGCCATTTTTGGCTAGCTAAGACGTGA 

TTATTGGCCCGAAAGCACAAGAAGTGCAGCGCAGTAAGTAGGCTGGCATTCCGTGTCCAGGGAACCCGATAACCACAGCTCCATTCCCAACGAAC 

AGCAATCATAAGTACATCAGTTATACCCACATCCTGGTCTCATACACGCACACAAACACAGCGAGAGCGAGATGTCCGAGAAAAACCTGAAAGTG 

GGCGCCCGGGTCGAGCTGACCGGCAAGGATCTGCTTGGCACGGTTGCCTACGTGGGGATGACCAGCTTCGCCGTCGGCAAGTGGG 

(SEQ ID NO: 1474) 

Exon: 4025.. 3919 
Exon: 3859. .2462 
Exon: 2403.. 1001 

Start ATG: 397 6 (Reverse strand: CAT) 
Transcript No. : CT25382 

CCCAGACCAAACAAATTGTTTCGCTAATATATGGAATTTGTAGCATAGAATGGACGAGGAGGAGCATTTACACCGATTTGGAACTCCGCTGCCAG 
CCTTGCAAAAGGATGTTGTGCCGGCTAAAAAGCCAGTGGCCATCGAGGATCAAATTGTGAAGGACGAAAATGGAAAGCGGAGATTCCATGGAGCC 
TTCACCGGCGGCTTCAGTGCGGGCTTCTGGAACACGGTTGGCTCACTGGAGGGCTGGACGCCGCAGACCTTTAAAAGTTCGCGGGGCGAGCGGGC 
GTCGTCAAAGGCTCAGCTGAAACCGGAAGACTTTATGGACCAGGAGGATCTGGGCGAGTTTGGCATCGCACCGCAAGGCATTCGCACGCGTGATG 
AGTTCGCCAATGAGGACGAGCAGAAGCAACGTTCTGATCAGAGGCGCCGCAAGTTAATGCAGCCTGAGTTGGGTGTCGGGGCCATACCAGGTCTT 
CCAGTGCTGGAGCAACTTCTACGACCGGTTCGAGATAAAGTGGCCGTGCGAATACTTAAAAGCATGGGCTGGAAACCTGGCCAAGGTGTAGGCCC 
GCGGCAAACTCGAAAGGAGAAGCGCCAAGCTACGGCCAGAAATTCCAAAGAGCAGTATCTTATGGAACACTACGGCGCGGAGGGACTACCATCTA 
ATAAAGAGAACAAAGGAGAAGAAGATAGCAATAATGAGGACGAAGACGATGAAGATATTACATTTGCTCCGGACGACTACGAGCCGATCTTTTAC 
ACACCCAAGGAGAACCGCTTTGGCATGAGCTACTCCGGTCTCAACCGTGATCCCATTCTGTCCAAATCGTCTTCTTCTTCTGCCAAGCCTATGCA 
ACACATCAATCTATTTGGTGAGATGGAGGCGCAGGCTAACAAAAAGCAACTATCCATACGAGGTCAAGCCTTCGGAGTGGGAGCCTTTGAGGAAG 
AGGACGAGGATATATATGCCCGCGATGATATGACCCGCTACGATTTCTCCCTGGCGGACAAGAAACCCAAGCAAAAAAAGCAGCAGCATGTCCAA 
CAGCGTCATGTCATCGATGGCTTCAGTGAGGACAACTCGCCGGCAGTGCTGCAGAGGCCTTATGCCATTGATTTACCTAGGGATTTTCAGCCCAG 
AAACTGGCTGCAGCGACGAAGCCGCTTTGCACCGATGGACAAAGAAAGAGCCCAGAAACTGGAGACAGCCTCAGAGTACAAACGGTCTGGTCTGG 
GAAGACACGATCTCAATCCGGATCAACGTGCGCAGATCTTAGGCGAGCAACAACAAGAGGAAAAGACTGCGGAAGAGCAGCCAAAACGAAATCCC 
TTTAAGGATCATAGTAAATCTTTAATGGAACGCATTAATGCCAGAACTGAGGGCTTTACCAAGGGCGGAGTGATAACCGAAAATGGCGAGGAGCA 
GCAAACTGAACTAGCCAAGGAAGTAAAGGAGGCAAATGCCGCCATACAAGAGAGAGCAGCCCTCAAGACGAAGGACTTGCCTTCTGCTATGAATT 
CTTCAAGCGCTTTTAAACCTTTTATAGCTGATGAAGCTAAGCAGCTGCGATATGAGAAATTCGTATCATCAAAGCTTACGGATGACAAGGAAATT 
ACGGAGTTCCTGGCGAGAATGCAACCAGTTACACTATCTCTGTGGGATAGGGAGATGGAGAAGAAGGAGTTTATTCAGGCGGCAAAGATATATCG 
ACCATTAGTTGGCCTTATGAATGACAGGTTCGTTTCGGAAGCCAATGTTCAAGCAGAGAAAGTAGAGCAGCAAGAGAAGCCGCCGGAGGAGCGAA 
AGATCGTGATGGAGAGGACCAAGACCATGTGGAAACCCACCGCACTGCTGTGCAAGCGGTACAATATAGCCGAGCCCTTTGGTGGTGCCATGCTG 
G AAC C AGAAAAGG AAC T GAAAGC C AAG GC AAAAAT T TC AGT ATT TG AT T ATC T AG AG AC ATC C GT AAAC AC TAAA GC AAAC TT C G AAAC C C CC TC 
TATTTTTCCCAAGCACATCGAAAAATCAATTCCGGAAAAGGTAGAAACACCACCATCCCCGCCACCAGCTCCTCAACCAGAAGCTGAGATACAGG 
ATAAACCTAATACAAAAGAAGAGCCGTCTAAGCATACTTTCGTGCCGAAAACTCCCCTGGAGCAGGCTGTAGATGAATCCCGCAACAAACCCATC 
TCAGAGAAAGTGGACCTGTTCAAAAGCATTTTCGAGGACAGCGATGAGGAAGAGACGGAACTATCCGAGCAGGATAAATTAGCTATCCTGCAAGA 
GTCTTTTGGCTTACCCGTAACATCGACCACCTCAGCTGCCGCCCAAAATGTCCTGAGAAACACTTCGCCACCCAGAGGAATCTTTGCTTCCCTAT 
TTAGTCCTAAGATAAAGGATGCTGCCAAAGAACCAGCACCGACCAAGTTTGGTCCCATCGAGGGAAGCAAACTGAAAATCGCATACAAACCACGA 
AAGGAGCGACTGAGAAACGATAGGGAGTTGGCAATGGCTGAGGTGGCGCCAGAGGATATTTATGGACCAAAGTTACCGACAGCACCTCCTCAAAA 
GCCGGCTGCTCCAGAGATTCAGGCAGAGGCGAATATTGACGACAGACTTCAACAGCTTTGGCAGCAGCACGCGCCTAAAAAGCGCAAAGCTGAAA 
AGTGGGTGGAAAAGAAATGCATATCCGGCAGCGAAGATAGCGATGCTAGTTCTAGCAACGAATCCTCATCATCCGATTCTTCTGATGACAATGAC 
AAGAGATCCAAACTCAGTAAGCCCAAGAAATCGCACAAGGGCTCCAGCTCGAAAAAGT 
AAAG AGC G AAG AAT C C AAGC AC AAGGC C AAG AAG AAG AAG AAG AAG AGC AAAC AT T G A 
(SEQ ID NO: 1475) 

Start ATG: 50 (Reverse strand: CAT) 
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MDEEEHLHRFGTPLPALQKDVVPAKKPVAIEDQIVKDENGKRRFHGAFTGGFSAGFWNTVGSLEGWTPQTFKSSRGERASSKAQLKPEDFMDQED 
LGEFGIAPQGIRTRDEFANEDEQKQRSDQRRRKLMQPELGVGAIPGLPVLEQLLRPVRDKVAVRILKSMGWKPGQGVGPRQTRKEKRQATARNSK 
EQYLMEHYGAEGLPSNKENKGEEDSNNEDEDDEDITFAPDDYEPIFYTPKENRFGMSYSGLNRDPILSKSSSSSAKPMQHIKLFGEMEAQANKKQ 
LSIRGQAFGVGAFEEEDEDIYARDDMTRYDFSLADKKPKQKKQQHVQQRHVIDGFSEDNSPAVLQRPYAIDLPRDFQPRNWLQRRSRFAPMDKER 
AQKLETASEYKRSGLGRHDLNPDQRAQILGEQQQEEKTAEEQPKRNPFKDHSKSLMERINARTEGFTKGGVITENGEEQQTELAKEVKEANAAIQ 
ERAALKTKDLPSAMNSSSAFKPFIADEAKQLRYEKFVSSKLTDDKEITEFLARMQPVTLSLWDREMEKKEFIQAAKIYRPLVGLMNDRFVSEANV' 
QAEKVEQQEKPPEERKIVMERTKTMKPTALLCKRYKIAEPFGGAMLEPEKELKAKAKISVFDYLETSVNTKANFETPSIFPKHIEKSIPEKVET 
PPSPPPAPQPEAEIQDKPNTKEEPSKHTFVPKTPLEQAVDESRNKPISEKVDLFKSIFEDSDEEETELSEQDKLAILQESFGLPVTSTTSAAAQN 
VLRNTSPPRGIFASLFSPKIKDAAKEPAPTKFGPIEGSKLKIAYKPRKERLRNDRELAMAEVAPEDIYGPKLPTAPPQKPAAPEIQAEANIDDRL 
QQLWQQHAPKKRKAEKWVEKKCISGSEDSDASSSNESSSSDSSDDNDKRSKLSKPKKSHKGSSSKKSKKSKLKSKKKSKKSEESKHKAKKKKKKS 
KH* 

(SEQ ID NO: 1476) 



Celera Sequence No. ; 142000013384150 

ACATTTGTGTCCAATGGATCCTCGCAACCAACTATCTCGAAGGGTATTAATGGGTTCATAGTACCTCTTCTGCAATGGGATCTACTCAGATTTGA 
GTTTGATTTATCGAGCTGGAGGAGCTGGGAAAAGAACTGAAAACTCTGCTCACACTTGTGACGGAAGTGAAAGGCATTTTGAGCGGCCACCAGGC 
AGGTAGGACAGATCTTTTGTGGCATTTCATCCCGTTTCACGCGACAGATCTCCGGCAGAATTGAGTTCAACATTTCCCAAATGGGGGGATCAGAG 
ATCTCCTTGGAGAAAAGGTCCACCAGTATGGATGAGTGTTCGGCGCAGGCTCGGCAGTTCTTTCCCATGCTGGCGTCCACTGATCTTTCCATTTT 
TCCCGATGATCTCCCAACCATAGAGGCCGGAATTTCAAAAGAATCAACAAATGTTGGGCACTTGCTCTCCTGCCAAGTAGTCAATGTGAACGTAT 
TCGCACAGAGTTGTCAGCTGCATCCGATGATTACTCGATAGTCGATAAGGTTTCATATGCCATCTCCATTAACGTCGTGAGGTTAACTGTGACTG 
GGAACAAATCTGCGGCATTCAAAACAATGGTGTCGATAGGTAGGATTCCAATCATATTTAAATCAAAATGTATGTAAATCTATATTAAAGGCATT 
AAC AAT GAAAAT AAG AAT ATT AG AG AT AAT T T T AATT T T AAT AAAT AAT T T AC AAAT C AAGT AAAGTT AC AAAT C AGT CTGT ATT AT AAAAACGT 
TTAATTAACATATAATCTTTAACAAGCGTGCATGCAATGATTTAGGTATTAAAGGTTATCATTTTAATATTTATCTTCATATTTAAGTTGAAATT 
TGCTAACTTTGCCAACAGCTGCAATAATTACTTTCACTTTCGCTTTCACAGTCACAAGATTGACTCATCGGCTTAAAAATACCAGTGACTCGCCA 
CGGAATCGCGGATATCGCTCATCTCTATCAGCTTTGAGCAAAATTAGTATCTTGTATTGGCCAAACAGACAAGCGAGCGAAGGCCTTTTTGTTGT 
CGTTTCTGGCAATGATTTAGGTCTTCATCAAGTGTTATTAAGCTAGGCATAAAGTTTTACCCATCGCTGGACATGGATATCCTGGACATCGATGA 
GGCACTAAAGGATGATTCATATATGTGAGTGAAACCACTCCTGTTGTTTTTGTTCCTGTTGTTGTTGCTGCTGCAGCTGCTGCTACTGCTGCTGG 
CACTTCACAGGCAGAGGCATCGATGCTATAAATACTCATATGCCTGGTACTCCTTGGCAGATTTCTGGCGGATAAGACCCACGACGATGTCTCCA 
ACATATTGCAGCAGTGGAAGACGAACAATCAGCCGCCGTTGCAGCAATTGCACTTGCAGTACAACAACGAGGGTAAGCCTTTTCCACACATACCG 
ATGCACCCGATCACAAACATACACACATGTGCAGCACATCATGCAGTTGCGCTTTCTAAATCAAATTTGGGTTAAACTGGCGGCTACGTTGACAA 
AGTTCAAGTCGCATGTGGGCAGCATTACTTGGCCAGTAGTTACCGTTGTCTTTCAAAGTGGCCGATTGCCATAATTGGCAACAGCGCCAATGTTA 
TTGTTACGGTAATTAATCAAGTCTCAATACGCACACACACACACGCGTTCCCATTGCAAACAGCTGTGCGTGGTACGGCAATAATATTGAAGGAA 
AAAAACTGGCTATGCTAATATGGCAAGAATCTCGCATTATACGCAAACATTATTATATAAATCCAATTTACTTCCTGCCCAGCCAGCGAAAGTCA 
ATCAAGCAATACGCAAATAAACTATTTGCGTACAATTAAATAAACAATTACCACCGAACTAAAGACGAGGATCAAAAATCGATTAGGCGATTTAA 
TTTGAGTGAAATATTGTCATGCTGAGCTATTTTTAATGATTTTCCAAAGTGGAATGCCAGAATTCCAAATATTGCTAAAAATATAAGTTAAATAA 
AACTATTTACAACAAAGTTCATACTTAATAGTCATTCCTGTAAATGTATGCAATGGTTAAACTCAGCATAGATAAATATTTTAAATAGCTTAAAT 
AAT AT AT AC AAAAT AT TC T T T T AAG C GAT AAC AAT AG T AG AAGTG GAAGT AAGAAAA T T AC G T T AAC AAAC AAC AA T T A C T TAT AAT TAT AT AT G 
AAAGCTTCAGGGCGTTTATCCAGTAAGTTAAAAAATAGCTAAATGCTAACTTAACGGTAAAATATCAAATAGTGTGCTGAACACATCGCTTTTGC 
GAAGCGTCCTTTGGATCTTTACTAACACTAAAAAAATTGTAAACTCCCGGAGATTGCTAGTAGTGTGCACGGAATTTAATTACTGGGTTTATTTC 
CTCAAAAGATTTTGTAATTTTACTAGTTTCCTGAAAACTTTGCGATGCAAATATTTGCCACTTTCAAATCCCACGAGCCGGACAGCCGCTTGATC 
CAAAACTTCAAAAAGCACGTGAATTCGCAGTCGCCACTTGACGATTTGCGAAAAAATCTAAATGAAAAAAGTAAAATTCGAGAATATGTACAAGA 
GCTGATTGGAAATGAGCAACAAATGGGATCCGTGCTGGACACGATCTCTTCCACCGAAGAGGACCACCGCGTGGCCACTGTGACCGGATGGTCCA 
CGGTGAACATCGCCAACTGCGCCGGCGTTTCCAATTCTTCAATTTTTGTGGCACTGAAGTGGGTGATGATGCCCATTAAGGTGGAGATATTCAAC 
TTCAAGCGAAGTGGACCGGAAGATAATATGGCACGTGGACAATTTCTGTTGGACAACCTGTTAAGCGCAAATCGCCTAAAACGCTTGCAAGAGGA 
AGTAGCAGTTATTAACACGTGCCAGAAAATCGAGGTGCGTGTCACACCCGGTCTGCCCAAATCGATGATGTGTGCCCAGATAACAGACGAATTTA 
CAACTGCCATTCAGGAGGCCCTTAGATCACGCTATGAGGTGGATTCACGCACCTTAGATCTCTCCAGGTTCCACGCCAGTCCGGAGTTGTCTCTG 
CACTTTTGTCCGCTGCATATGGTCAAGCTGCTAGAAACAGTGCTAGTTCTTTCAAACCATTTATTTCCCCATGTGACCAGCCTAGTACTGAGCAA 
CAATTATTTGTGTTCCCTGAAAGCGTTTGCGGGAAATTCCCAAAGTTTTGCCAGCCTAGAGCGGTTGGACATAAGCGCAAACAGGATTCAAGATT 
TGGGAGAACTCAATTACATCAACAAACTGAGTTGCAAGACGATCTTCTTGGCAGGAAACGGTTTGGCCAAGCTAAGCGTGGACGTCATTCGAAAA 
ATGCTACCACAGTTGAAAAATGTTCATGGCTGTGTGCAGTTGGGGGAAAGTACAGAAGCTGTTGATAATATTCCGAAGTCTCAACAACTTCAAGG 
CGGTGGGACTAATGGTTTAAAGTTTTGCCAGGACTTCATAAGTTCATACTACACTTTCTTTGACGATACCGAAGAGCGTTTCAAACTTAAGAAGT 
ATTACGACGACCAAGCCATGTTCTCGCTTAGCGTGCCGGTACAACTAAATTATGTCTACGGCTACAAGCTGTACAACAGAAACCAGAAGCGCCAG 
CATTCCTCGTTTGCACAAAACGCCAAGCTGCAAGTGGGTAGAGCTGCGCTACTTCTCGCTTTAAGCCGCTTGCCTTTGATGCATACAGACTTAGA 
GAATGTCGGGCTAGACATCCAGGTGTTCACTTCGAGTCTACGCATCTTTACGTTGACGGGATACTTTAAGGAGATATCATCGGATACTCTGGAAC 
CACGACGCTTTCAGCGCACATTCGTATTGCAGACTTCCAATAGCCCCGGCTGGTTGATAACCAACGACATGCTGTGCATAACTTCGACTATGCCG 
GACCCGAAAAAAACCATTAAATTTAAACCAGAAACTAATATGATAAATACCGACCCGTCGGTAGAAAAAATAAATAAACCTGCAGTGACAGTTTT 
AG AC AG AAAGT C T AAAG AGG AT GT C C AAAT T T C TAG A C C C CT AT C AG G AAAAG AGC T G AAT C C T CT T AAT AAGGC AGTGC AG AAAATG AGCC T T T 
CCCAGGTGGACCATCTTCCCAGTGAGAACTTCAACGATATGCCACCATTAGTAGCCATTGGTCCATTGACTACCGGGCAAAATTGTTTGGACTAT 
GAAATGGAGGATATTTTAGTGGCCGATGAGAATAACTTTGACTTGGACATTGATGATGACATCCTGTTTGGATGCGATGACCTTTAAATATGAAC 
AAAAAAAATTAAACAATTCAAACAAATAGCTTTTAGAAACCTTGCACCTATCGGTTAAGACACCATACATTACTATTCATTACTTTAACGAGTGC 
AGAACTCAAGGATCTAATTTATTTTTAAACTTTTATATTTTACGAAAGCCTTCAAAAACAATTCGAAAACCTTTACTCGCCCAAAGCGTCGGAAT 
CTGGATTTCGTAAGTACAAATAAGCAATTGCAACTAATTTGTATATTAAAGTATCTTATTTTTTCAGAGCCAAAATGGAGCGGGAAATGACAGCG 
GTGTTGGAACCGGGTCGGTCGCGGGTGCCCCAGCAATGCTTCAGCTGGAAATTGGCGGTCTTGTCACGACGAATATGCACAAATCCTTCCTGCAA 
GACACCTCACCGCCGTCTTCCATCTGCTCCTCCATGAACACTGAACGGATGAACAACTCGCTAATCACCTCAGCAGACTTTACGAACTTGAACTT 
CCTTCAATCCACAATTGGTCTTGAGCAGGATCCAAACCTTACCACTACGCTTACCAACCAGACAACTGATAACATGGGAATCGGCGGCAGCATAG 
GCGGATATACGCTCAGTGACATGATCCGGGATCGCAAGGCGCTGGAGTCGCTGACCATGTGTGAGGACGATGGGACTCTTATCAAAGACTCGCAC 
ATCGGGCAAATTGACGAAATATCGTTGACACTGAGCAAAACTGCTTCCGGCTGCAGCACAATGGACAACAGCACAGACAGTATGTCCATTCCCCT 
GGCCGCAGATCGAACGATGCCCGTTGTGGTATCGGCAGTTTGTCCTGCCCTACAGCGTACCATGATTTTAGGGGAGGAGATGATCGGCGACACAA 
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CTTTTAACCTCGTGGATAGCCTTACAACTTCTGCGCTGCAATCAGAGTCTGAGTCTCTGCCAGTCGATGGAAATGCGACATTCAAAAGACCCACT 
GCGAGTGCTACTACAGCGGACGAAACCCAGGTGCTAACAGGTAGGCAGATGAATACTACCTTCACCGACGGTTGCAACACTCCTGAGGGCCGCTG 
TGAAACGCCTGAAAATATTGACAGGAAGCTGGCCTTGCTCACCATGGAATCCTCCACACCACTTACAACAAACATGCGCTCACACTGCTACCACA 
ATAACAACAATAATAACAAGGCAGGATATACCCCTACCCTAAAGGGAAGGGGAGATATGAACCTGTCCCCAATTGTTGGCGCAACACCGCAAAAA 
CCAACTGGAACCGCACCAGGGCGATTAAACAACACCTTTGAGCCCGTGGCAAAAACAGCTCCGTTCAACGGCGAGAAGTTCGTGCTGGACACTAT 
GGAGCTCTTGGAGCAGATCGAACAGCCGCTGGATGGCACTTACAACCTGCAAATGTCTGAACAGCACAGGCAGATGCAATGCGTCATGGACTTGG 
CTGAGGCGGAAGTGGAGATGCTGGCCCAGCAGGGCGACGAAGAGCAATTCGAGAATATGCTTGCCGAGCTGGGCAAGGTGAATACCCTAAACGAG 
GAGCAACTGAAGATGCAAAAGTCGCTAGATACCATCAAACGGCGCTTCCATAGAGATGAGGCGGAGGCAGAGGAGCGTGAAGAGGAATTGCAGCA 
GTCGGCAGCCGAAAAGGTGGACACCCCAGTTCTAAATCAGAGCCACTCCAGCAGCAACAGCAGCGGCGGTGGTGGCGAACGGCTGCTCAGTCGCC 
GAAGTCGXCTCTACGACGATGTCAACCTTAGTGCAATGCACGGCAGTAATGGAAGTTCGACCAGCACCAACTCTGCCTCATTCATCGTCCAGCGT 
AGAGATATGCCACAAGTGGAGCAGCCTGCACCAGATCCTGATCCCGAACCAGCGGAAGAGCCTCCAAAGTCCCAGGTGACAGCAGAGATTGATCC 
GGCAAGCTATAAGCTGCCGGAGCGAAGAGAGCGGGATCGCGATCGCTTCAAGACCATTAAGATTTCGAAAGAAATGCGACTGCAGCCGGAAATTA 
TTGTGCCTTGTATAGACGATGAACCACAGCAACTCGAGGACGAGCAAAAGCAATTAGTAGAAGATGTTCGGTACGAGGAGGATCAGCGACAGGGT 
TCACCACCGGGTCGACTTTCTCGTCGAGATCAGACCACAGCAATTAGCAACAACGCAGATGCTAGTGCGAGTAACTATCTGACCTACAAAAAACC 
CAAGGAAAAGAGCCTTATCCAACGACGTCCGCAGCAGCAGGAAGCGCCTTTATTAGCCGAGCCAGCGCCTGCGTCGACTCTTCCCCAGCCGCGTT 
CCTTGTCCAGGCCTCGCTACATCAGTGGTCTTCAGAAGTTTACTACGGTGAGCAAGGCAACTTCAGCCGGAGCGGGACTAAACGCCACTCCAGCC 
GCTGCTATGCCGACTACTGTTACAGTGCCGGCCAACGGTAACGGGGAGCTGAAGAGCCCTATGGGTATTAAATCCAAATCTTTCCACAACTTGTC 
CTCCACAATGAGCGGCGTTGGCTCTGGAGCCCTGCCAAGACCTAGTTTGGGCGGTGGACTGCGGCGTCCCAGTGCGCAGCCGAGCAAGCTTATGA 
GCGGATTGCGTGGACCAACTCAAGCCCAAGAGGTTCGTTTCGAAACTATTTTTCTTATCGACCTGTTTAATTCTCATATTTTTGACTTTTCAGGA 
TGACTCCGCTGTATTTAAAGTACCGAAATTGGTGAGTGGGCTGCGTGCTCCTGGCCTTGCCCCGAAACGAGCAGGAGCTAATGGTTTAGCTCGAC 
CTTCGTCCGGATATTACAGTCTGAGTGTTAAGACGGCACCAGAGGCAGACACTCCTGAGGTTAGTTAATGCAAGATTTGTTAAGGGAACGCTCAA 
GATCTAGCTTTTGTGCTTAGTTAAAACTAAAATTGGAAACTGCCAATCTTGGCGCAAAATATTTCAGAGCCTCTCCTCAGCCTCTTCGCGTGGCA 
GCCTCTATGGCCACAAGGACTCCGGTAGAACCGTTAACGGCGGAAATGTTCCTCAACAGAATCCACAGCAAACCTTCGACATGCAGGCGCTTACC 
TCTAAGCTGACCCAGGTGACTACGGGAGCGACCGGCATACCAAAGCCCTCGGGGTTGAGACCACCATCGCAAATAAAGCGTAGCGGTCTGCCGCG 
ACCTTCCAGCATTGTTAGGCGCTGAGAGCTGATCACTTCCTCCGCCGCCGATTGTATGCATATGTCACACACCCGCCTGAACCGACTTTCCATGT 
TATCAGTAAAATAGTCTAAATCTATTCTTTTTGTACTTTGTGTTCAGTTCTATTTTTGTGTTTGTCATCCCAGCATAGATGCTCATCTTGCCTAG 
TGGTTTCAAGGCACTAGCAACTATGATTTTGAATTAATATTACGACAAATTGTGCATTTAGTTGAACGATACATAAATAGAGAACTTGGCGACTT 
GTAAAACAATTACGCACTCCCAGAACCGCTTACATTTATATATAGAACAAATACATCAACACGAACTTTGCCTTACGAACCATAATAATAAAACA 
AACGATTTAGGGGATGAGAAAGAGAAGATTGAAGCGAGAAATAATAACGATAATGAATTGATAATAACTGTATATTTTTGTAAAGCGATGCAAAA 
ATTCTACGAACGTTATTTTTTGTTGGAGCGGCAAAGCAAGCAAGCAAATCGAATAAATATTACTAAATAACGAATATACAATAAGATCCTGCTCA 
GTGGCCACTAAACCACAAAAAAAAAGGAACGTATAAAACCAAACTGAAATTGATAGTAACTGATAAGAGTATTGAGAAACGTGTTAAAAATCGAA 
CTTACAATATTATCTATTATTATCGCATATATATACAAATACAAAAAAGAAAAlAAAACGACAGCAATATACATTGACTCATCAGATACTTAGACC 
AGAAGGTTAATGACCTGTTTATATTTTGCTATTGCTAGTTCTGAAACGAAAATGATCATACTTGTAATTTCATGAGGTGCACAGTATATTACAAT 
TCCCTTTGTACCCCAATGAAAATTATAAACTTCGGCTCTGCTGATGTGCTATATACATACATATTATATAGATTTTTGTAAACCATCTGAAAAAT 
TAAGTTTTCAGTTATTTTCGGTTGCCAAATTGAAAAGTGAAACTAAACTTTTGTTAGTTACCATCGAAAAATGTCTTAATAAATCTATGCCGTAT 
TGGTCAAAAATAAAATTCCTAAAGATCTTCAATCTATTGAAGCCTTTGTCCATCACACATGCATATCCTAAGTTAAGTTTAGTGCGCATTGTTTA 
ACCCTAATGTAGAATTAGTTTAATCTGTAAGGGACAAGTATTGTTTCACCCTCATCAACTAACCAATAACCACATAACTAACCCAAACCAAGCCT 
CTATTTTGTATTCTAGCGCTAACCCATTAAGCTTTAATCCTATTCGGTGATGATTTGCATGGATCGAATTTAGTTAGCTACTAAGACTAGTATGG 
ATTATTATTTTTGCTAACGCTTTCATATGAGCCTTATA 
{SEQ ID NO: 1477) 

Exon: 1001. .1164 

Exon: 1296.. 1402 

Exon: 4419.. 4474 

Exon: 4533.. 6872 

Exon: 6934.. 7089 

Exon: 7193. .7778 

Start ATG: 1118 

Transcript No. : CT257 98 

CTTGTATTGGCCAAACAGACAAGCGAGCGAAGGCCTTTTTGTTGTCGTTTCTGGCAATGATTTAGGTCTTCATCAAGTGTTATTAAGCTAGGCAT 
AAAGTTTTACCCATCGCTGGACATGGATATCCTGGACATCGATGAGGCACTAAAGGATGATTCATATATATTTCTGGCGGATAAGACCCACGACG 
ATGTCTCCAACATATTGCAGCAGTGGAAGACGAACAATCAGCCGCCGTTGCAGCAATTGCACTTGCAGTACAACAACGAGGCCTTCAAAAACAAT 
TCGAAAACCTTTACTCGCCCAAAGCGTCGGAATCTGGATTTCAGCCAAAATGGAGCGGGAAATGACAGCGGTGTTGGAACCGGGTCGGTCGCGGG 
TGCCCCAGCAATGCTTCAGCTGGAAATTGGCGGTCTTGTCACGACGAATATGCACAAATCCTTCCTGCAAGACACCTCACCGCCGTCTTCCATCT 
GCTCCTCCATGAACACTGAACGGATGAACAACTCGCTAATCACCTCAGCAGACTTTACGAACTTGAACTTCCTTCAATCCACAATTGGTCTTGAG 
CAGGATCCAAACCTTACCACTACGCTTACCAACCAGACAACTGATAACATGGGAATCGGCGGCAGCATAGGCGGATATACGCTCAGTGACATGAT 
CCGGGATCGCAAGGCGCTGGAGTCGCTGACCATGTGTGAGGACGATGGGACTCTTATCAAAGACTCGCACATCGGGCAAATTGACGAAATATCGT 
TGACACTGAGCAAAACTGCTTCCGGCTGCAGCACAATGGACAACAGCACAGACAGTATGTCCATTCCCCTGGCCGCAGATCGAACGATGCCCGTT 
GTGGTATCGGCAGTTTGTCCTGCCCTACAGCGTACCATGATTTTAGGGGAGGAGATGATCGGCGACACAACTTTTAACCTCGTGGATAGCCTTAC 
AACTTCTGCGCTGCAATCAGAGTCTGAGTCTCTGCCAGTCGATGGAAATGCGACATTCAAAAGACCCACTGCGAGTGCTACTACAGCGGACGAAA 
CCCAGGTGCTAACAGGTAGGCAGATGAATACTACCTTCACCGACGGTTGCAACACTCCTGAGGGCCGCTGTGAAACGCCTGAAAATATTGACAGG 
AAGCTGGCCTTGCTCACCATGGAATCCTCCACACCACTTACAACAAACATGCGCTCACACTGCTACCACAATAACAACAATAATAACAAGGCAGG 
ATATACCCCTACCCTAAAGGGAAGGGGAGATATGAACCTGTCCCCAATTGTTGGCGCAACACCGCAAAAACCAACTGGAACCGCACCAGGGCGAT 
TAAACAACACCTTTGAGCCCGTGGCAAAAACAGCTCCGTTCAACGGCGAGAAGTTCGTGCTGGACACTATGGAGCTCTTGGAGCAGATCGAACAG 
CCGCTGGATGGCACTTACAACCTGCAAATGTCTGAACAGCACAGGCAGATGCAATGCGTCATGGACTTGGCTGAGGCGGAAGTGGAGATGCTGGC 
CCAGCAGGGCGACGAAGAGCAATTCGAGAATATGCTTGCCGAGCTGGGCAAGGTGAATACCCTAAACGAGGAGCAACTGAAGATGCAAAAGTCGC 
TAGATACCATCAAACGGCGCTTCCATAGAGATGAGGCGGAGGCAGAGGAGCGTGAAGAGGAATTGCAGCAGTCGGCAGCCGAAAAGGTGGACACC 
CCAGTTCTAAATCAGAGCCACTCCAGCAGCAACAGCAGCGGCGGTGGTGGCGAACGGCTGCTCAGTCGCCGAAGTCGTCTCTACGACGATGTCAA 
CCTTAGTGCAATGCACGGCAGTAATGGAAGTTCGACCAGCACCAACTCTGCCTCATTCATCGTCCAGCGTAGAGATATGCCACAAGTGGAGCAGC 
CTGCACCAGATCCTGATCCCGAACCAGCGGAAGAGCCTCCAAAGTCCCAGGTGACAGCAGAGATTGATCCGGCAAGCTATAAGCTGCCGGAGCGA 
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AGAGAGCGGGATCGCGATCGCTTCAAGACCATTAAGATTTCGAAAGAAATGCGACTGCAGCCGGAAATTATTGTGCCTTGTATAGACGATGAACC 
ACAGCAACTCGAGGACGAGCAAAAGCAATTAGTAGAAGATGTTCGGTACGAGGAGGATCAGCGACAGGGTTCACCACCGGGTCGACTTTCTCGTC 
GAGATCAGACCACAGCAATTAGCAACAACGCAGATGCTAGTGCGAGTAACTATCTGACCTACAAAAAACCCAAGGAAAAGAGCCTTATCCAACGA 
CGTCCGCAGCAGCAGGAAGCGCCTTTATTAGCCGAGCCAGCGCCTGCGTCGACTCTTCCCCAGCCGCGTTCCTTGTCCAGGCCTCGCTACATCAG 
TGGTCTTCAGAAGTTTACTACGGTGAGCAAGGCAACTTCAGCCGGAGCGGGACTAAACGCCACTCCAGCCGCTGCTATGCCGACTACTGTTACAG 
TGCCGGCCAACGGTAACGGGGAGCTGAAGAGCCCTATGGGTATTAAATCCAAATCTTTCCACAACTTGTCCTCCACAATGAGCGGCGTTGGCTCT 
GGAGCCCTGCCAAGACCTAGTTTGGGCGGTGGACTGCGGCGTCCCAGTGCGCAGCCGAGCAAGCTTATGAGCGGATTGCGTGGACCAACTCAAGC 
CCAAGAGGATGACTCCGCTGTATTTAAAGTACCGAAATTGGTGAGTGGGCTGCGTGCTCCTGGCCTTGCCCCGAAACGAGCAGGAGCTAATGGTT 
TAGCTCGACCTTCGTCCGGATATTACAGTCTGAGTGTTAAGACGGCACCAGAGGCAGACACTCCTGAGAGCCTCTCCTCAGCCTCTTCGCGTGGC 
AGCCTCTATGGCCACAAGGACTCCGGTAGAACCGTTAACGGCGGAAATGTTCCTCAACAGAATCCACAGCAAACCTTCGACATGCAGGCGCTTAC 
CTCTAAGCTGACCCAGGTGACTACGGGAGCGACCGGCATACCAAAGCCCTCGGGGTTGAGACCACCATCGCAAATAAAGCGTAGCGGTCTGCCGC 
GACCTTCCAGCATTGTTAGGCGCTGAGAGCTGATCACTTCCTCCGCCGCCGATTGTATGCATATGTCACACACCCGCCTGAACCGACTTTCCATG 
TTATCAGTAAAATAGTCTAAATCTATTCTTTTTGTACTTTGTGTTCAGTTCTATTTTTGTGTTTGTCATCCCAGCATAGATGCTCATCTTGCCTA 
GTGGTTTCAAGGCACTAGCAACTATGATTTTGAATTAATATTACGACAAATTGTGCATTTAGTTGAACGATACATAAATAGAGAACTTGGCGACT 
TGTAAAACAATTACGCACTCCCAGAACCGCTTACATTTATATATAGAACAAATACATCAACACGAACTTTGCCTTACGAACCAT 
{SEQ ID NO: 1478) 

Start ATG: 118 

MDILDIDEALKDDSYIFLADKTHDDVSNILQQWKTNNQPPLQQLHLQYNNEAFKNNSKTFTRPKRRNLDFSQNGAGNDSGVGTGSVAGAPAMLQL 
EIGGLVTTNMHKSFLQDTSPPSSICSSMNTERMNNSLITSADFTNLNFLQSTIGLEQDPNLTTTLTNQTTDNMGIGGSIGGYTLSDMIRDRKALE 
SLTMCEDDGTLIKDSHIGQIDEISLTLSKTASGCSTMDNSTDSMSIPLAADRTMPVVVSAVCPALQRTMILGEEMIGDTTFNLVDSLTTSALQSE 
SESLPVDGNATFKRPTASATTADETQVLTGRQMNTTFTDGCNTPEGRCETPENIDRKLALLTMESSTPLTTNMRSHCYHKNNNNNKAGYTPTLKG 
RGDMNLSPIVGATPQKPTGTAPGRLNNTFEPVAKTAPFNGEKFVLDTMELLEQIEQPLDGTYNLQMSEQHRQMQCVMDLAEAEVEMLAQQGDEEQ 
FENMLAELGKVNTLNEEQLKMQKSLDTIKRRFHRDEAEAEEREEELQQSAAEKVDTPVLNQSHSSSNSSGGGGERLLSRRSRLYDDVNLSAMHGS 
NGSSTSTNSASFIVQRRDMPQVEQPAPDPDPEPAEEPPKSQVTAEIDPASYKLPERRERDRDRFKTIKISKEMR1QPEIIVPCIDDEPQQLEDEQ 
KQLVEDVRYEEDQRQGSPPGRLSRRDQTTAISNNADASASNYLTYKKPKEKSLIQRRPQQQEAPLLAEPAPASTLPQPRSLSRPRYISGLQKFTT 
VSKATSAGAGLNATPAAAMPTTVTVPANGNGELKSPMGIKSKSFHNLSSTMSGVGSGALPRPSLGGGLRRPSAQPSKLMSGLRGPTQAQEDDSAV 
FKVPKLVSGLRAPGLAPKRAGANGLARPSSGYYSLSVKTAPEADTPESLSSASSRGSLYGHKDSGRTVNGGNVPQQNPQQTFDMQALTSKLTQVT 
TGATGIPKPSGLRPPSQIKRSGLPRPSSIVRR* 
(SEQ ID NO: 1479) 



Celera Sequence No. : 142000013384010 

TTTTATAAGTTCAGCAATGAAAATAGACTAAAAGTGTCTCATTCACGAGACCCTAATGGAAGGCCTAATTGATTTACATATGTCAGGGGAGCACC 
CTCACATAAAAGAAATGCGTGGGCAGACACATAAAACCAAAACCCACTCACAAATGAGTCGAGTGGTGAATAATGTCCCATATTCAATTGAGTGC 
GGAAGAGGCCCTTATAAATCTGTGTGTGAAATATCTTTTATAATAAAAAACAATTTAACTAATAATACCAATTCTAGTTCGTAAGGCTATAAATA 
TTGAACTGAACAGACCAGAGGCTATAACCTATCGACTTCGATAAGATAAATATTTAAAAAGCTGAGACGGAAAAAGTGTTTGCCAAACTCCTTGA 
AATTCACTTGAGCGCCACGAGGCTCAGTGGGAAATTTGATTTTGGGAGATGGGATGCTTCCCGTGCATGTGATCATGTCTCGAGGGGCTGACACC 
CGTTTTGACTTTTGTCACCGCCAGTGGCGCCAAAACTAATCAAAGCAACTAGGACCGACGCTAAATCGAAACCACAACTACAATCACTGCGGAAT 
ACCAAAATGCAATCAAAATATGCAAGCCACGTCGATCAGGAGAGAGAGAGAGAGTGAGAGAGAGACCGGGGCTATTCTAGAAACCTAACGCGAGT 
TCAGAACGCTGAATTATGTATACCCTTTAAAGTACTAACAGAGCAGAAAACTTAACTGTTTTATGTGGTCGTTCATTAGTAGTAAAAATGGGATG 
ATTTTATTTTATTCTGCTTTACTGTTATTTCTGTTTTCTGTTTTTATATACTGGCCTCTTGAATATATAACATTCTGATAACATTACAAACGCTC 
CACTAACATGTCTATATGTCTTGGAAGGGAATTCAAACAACCAATCTCTCTCTTTTTCTTTTTCTCTCTCTCTCTTTTTCTGTCGCCCCGATCAA 
GTCTTAACGGCAAGTTGCAGCAACAAACATTTTCATTCGTTTTCGCATCGTCGAAGCGTACGGTTCATAGGAGGAACGGAACGAAGCGGGGAACG 
CGACGGAAACTAGTTGCTGTTTTTTTGTCCGTGTTAAATAATTGACACAAGAAAATTTAGCTACACTTAAGCAAAGTCCGCGAAAAATCTATTAA 
AAATCGTTCGTCGTTTTGTGTGTGTGACCACGAAAAAAGTGCCCCGATCGGAGGATTTTAATAAATTCAATTAAGGTTCCGTCCCAACGATCGTT 
TTTCATTGTCTGACGCTCACGCGTGATGTACAAATGAAAAAGTAAAAATTTAAATAAGATCAAAGAAAGAAAGATCCCAGTAAAATTTAAAGTGG 
ATTGACTGCACAAGAAAAAGAAAAGTTCCTTACCTCCTAGCCAGAAGTCAAAAGTGAAGCGGAAAACAGAGTGGAGATAACAATTAACGGTTAGT 
TGTAAAGCTAAAACTACACAATAAACATATCATGAAAAACTTTATAAAACATAAGAAGGGGGGCATTTTATTATTTTGGGTATCAGCATTTACAT 
CACCTTGTTTCGAATCAAACTGATTTTAACATGCATTTGGACCAAACTAACAACGTTTCGAATGTTATCTCTTATGGAAATGGTATTGCTATATT 
ATACCATCATTAATTCCGATTCCACCTGAGATTAGCAATTAATATGAAATTTCATAAACATATATTATATATTCGGATCATGCATTCTATTTACA 
CATTACTTTCCAATTAAACGTTCATTTATTGCCGCGCTTCGATTAGCAACAAAAACAAATCGTTAATTTTCTATATTTTTCCACAGGGGCTTCCA 
TACCAAAGTTCCTTGTTTGTAGAGATCATTGTAATCAGCGCAGATCGAAGCCATATATAAAATGATTTCGCACAGTCCGCCCATATTCAGTCACA 
GTCCACCCGTTAGTTTTCTGGCGGACTTCATGAGCGGACATCCCCGTCAATCGTTGAAGTAAGTTTACAAAAGTATATAACTCATAAATAAATAT 
AGCCAGCAATTGGACTAATTTAACACAGAATTGCTCTAAGTGAATCCACGAGTTTCGGCAGCCAATCAAAACAAATTAAATAGCCAAACATCATT 
GAAAGAGTACCAATTCTCAGGTTTTTTCATGTCGAAAGAAAATATTTTTTTAACTTTCTTATCGATTGATAAGGAACTAGAAGCATTTTCTGATG 
CATCATCTTTTATGAATTTAACAAGTGGTGTACGTGTTAATAAGAGAAACGATATAAATGTCTGATAACCTGAAAAACCATGTGATAGTAAATGA 
ATTAAACTGAGAGATAAAAAATACATAACAGATA^feAAATGATTAATACC 

TAAACATTCAAATTTAAAACAAATTTCAAAATACCCAAACATAACCCACATTAACAAAATAAACAGACCGGAATGCCGTAATTCCATTTAATGAA 
TATAAGAACTCAGTTGTTTACACAATCCAGAAAAGGAATATTATCGATCTCTCGAGGGCTGCCCATATTGATTCACATGAAACACCTGTTTCAAG 
TGTCGTCTATGGAAATTTTAACACCTGATCAAAGGCAGGAAACATGTCTTACCCAATACATAGTATTAAAACCTGAGAACTATTCGAAAACACAA 
TGAATGCAGGTACTAAAAATATTCGTGCTAAGGCTCGTTTTTGGTTGACAACAACAGATTGGCAAAATATGAAAAGCAAGCATTCAAGATCAAGG 
CAAGACAGAATACTGAACCAAAACAAACATTTCGCCGATTCTGAGGTGATCTTTCGCTGGTGGAGCCAACTGAGATTTGAAGATTCGTTCACGAG 
TCGTGGGGGTGACGCCAAAAATTTGCTTGGCGACAATTTGCGCGTGGCGCTTGACGCAACTTTTTTTCTCAGGGGCACACGAGCAAGCCGAAAAT 
TTGACAAATTGGTAAAGGGAAGAAGGGTAATATCTATATATTATATTTATACAGAGGCGAGAGCAATTGATTGAATTAAAGCAATTTGCCGTGCT 
TCATTTGCTGCACGCGCTGCATTTTCATCATATTTGTTGAGAGTATTTTATTTTTGGGAATGATAAATATACAATACCGAAAATTCGTATAACTT 
ATATTCGTATATATGAGGACTTCAGCTAAAAATAAATCTATATCTAGAGTATGGCCAAGCAAAACAAACGCTACAAGAACCGGTAAAAAACTAAA 
TAATCAGTTAAGATTAAAGAGGTTCAGTAGATAAAACTGTGGGAGGTAATACATGCGAAAGCCCTACTGCGATCCCTACAACTATGAACATTTCT 
TATCGGGGCACGTGAGCGGCATGCCTTCAAACAAACCGAGTACTCCAACCAACTCAAGTAACTCAGCTGCTAATGCGTCAACTCACTTGAAATTT 
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CACAGCGTGATTTTACAACAGTTTACCCCACCCAATACCTAGAACTACCCACGTAACTGGGCGGGTCGTCATCGAAAATGATTGAATTCTTTTGG 
TTGCCAACTCAAGCACTAAATATGCTGATATTGTCTCCAGACAGCGCCAATTAGCCAAAAAAAGTTGATGATAACAGCAGAGCAGACGGAAATGT 
TCCCATAGCCAGGCCCACGCCGAAAAAGACGTATCAGCATGCCGTCAACATATGTTTAGCATGCAAAAGTCCACGCGAACCCCATTGAACGATCC 
TTATCAACCATTGACTTTTAGTTTTATGAATGTCTAGCCGAAAAATTATGATAACTACAAATGGGATGTGAACTGTGCCAAATGAGTAAGACCAG 
CGAGTCTCTCCGACTATACAATTTTAGAAATGACTCACTGAAGAACCCAAACAAATATGCTTCATATTCATACTATACATTTCCCACCCACGGCA 
AACCCGAATTAAGTGAGGAGGTCGCGCGCAACTCCTCTTATCAGCGATGTTTCTTTATAGACAATATACACGTATCAGTAGGACTACACAAGTAC 
AAGTACAAATACAAGTAGTACATGGGCCTCCGAACAATGAAGGCAGGCATTGAGGCAGTAAAAATTAAAGCCACCACTCTTATCAGGACTGATAA 
AAGGCAGCTTAATGCGTAGATGCTTGAAATAAAAGACTGATCTTGGGAATTTCTTTTCTGCTTTCCAAACGAAAAAAGAATCACAAAAAATGGTC 
AAAACAATTGAAAGATGGCTACCATCTGCTGATGTTTAAATATAGAAATTATATTTTCAATTAGTGGTGTTAGAGATATATATTTTAACTGATTG 
TGATATATTCAACCTTCGATATTCAACCTTTTGATTTTTTGAACGAATTCAACACTCATTTAAAAGGACAGAATAGTAACGAAATTAAGTTATGA 
AATAAATATTAAATAAATTTATTTTAAGAAACATTTTTATAGTTATTAATTTAAGTTTTGAAATACAAAAATAAACCAATTAAACTTGCATACCT 
TGTTTCAGCTACAATGACGAGCCCAACAGAGCGTCGCGCTACTGCCGGCCCCAAGTGATAACGCTGGCCTCCAAAGCCATGGTCAACAACGCCTC 
GGCGGCGGTCCAGGGTCAGGGTCCCAGTCAGCAGAAGGCGGGTGCCACGCCCCTGGGCGTGGGCAGTCGTTCCGGACGGAGTTACCTGGAGCGAA 
TGGCCTCGGCGCCGCTACTCACCAATCAGCCCAAATCCTGTCTCAGCCGCAAATCCTGCCTGCACAAATCCCAGAACACTACAGCCAGTTCCGCA 
GATCTCTCGCCGACGGACACCGATGTCGCGGACAGTGCCAGTCATAGCTGTATCTGCAGCAAGGATAGTGACAGCAACGGCGAAGTGAGGTGAGT 
GTGCGGGCGATAAGATTACGTAATGTAGGTAACTCGATACAGCGGCGCCGAAACGATCAAGAAGGCTGTAAATGCCGGCGGACTCAGTCACTTTG 
ACGTCAGGCCTTCAAAGATTATGGCGCCGCACTTATTAAACCACCAACTGAACTAGATGGGAACTTAGTGGTGTGAGAGCCTGGGTAATTGTGGT 
CAGAAT T GGGTGTATCAACCT ATAAC AGAT T C ATTAAAT AGT AAGAT GTCAATTGAGGGGAAATCT TAAGGAAAGT CCGAGAAAGTAAC AC AAGA 
AGCTGTAGATGGTTTTCATAGCGCATCTTAACTATGTAGACTGTGGTCAAAAAGAAATCGAAAGCTTATGATTCAAACTTAAATCTTTGAATTTT 
AATATGGGGAGGATAGGATTTTGGTTCTAAAAATTGGAACGTGAAAAGATCATTTCTTATAAAACTGCGAAAATACCACTTCCGCCATAATATTT 
TATAAACAAAGATTAATCGTTTATAAGACATTTGCAATAGTTCTAGTAAAAGGAATAAGAAAATCTGTTGGCAGTCGAACAACTTTCCCACAGAC 
GTCATAGCCGAGCACATGTCATTTATTAAATAAGCTGAGAAAGCCAATTGCAGACGGCCGCCAGGCTAAACAAATGCAAAAAGATCATGAAATAT 
ACTT GAAAAT AT AT TTTAT AT ATTT TTAAGAGC ATGCAATTTCAC AAAAACAAAATAT AAAT TATCTAAATGACT GT AGAAAT AAGCCAAAATAA 
AAACACGCCACTGCATCGCGAGTCGAGAATCTGTATCTCGTATATCTGAGATAAAAAATCTCACATTTTAAGACTCAAATGGGAATAGACTGGTA 
TTTAGTATAGACCCCAACAGCGAGGCGCCATTATTTTCGTTTTCCATTTCCATTTATTCGCTATATATAGGGAAGATACATTATCGATTCATGTG 
CAGGCCATGAAAATATTTGGGTTATCGATGTCTCGCAGATATTTTGTAAAATTGTATACATATGTATATTTCGCTTTCAACGAGCCCTAGAAATA 
ATTTCCTTTTTGCACTTGAATCTGACATTTTGAGAAATGCCTAAAAATACATATGCCACGAGATAGAGAAAATAATTTATACGTTCATTTGCGTC 
AAATGCGAACACTATTTCTATTTGGGCTGCGTCTCTAGACAAGAGATCAGGCTTAAAGCTACGCAGCTACTCAAACGAATGCAATTAACCATAAT 
ATTATCAATGTAAACCAGACGTGTGCTTGTTGCTGTTGGAGATTTTGTGTACACTTTGCGGTTGCGTATTTTGTATGCCTCGTTTTATCAGTTAC 
GAAATCCACTGGGAAAAAATTAATTTACCAATTCAAATCTTCAGAGTAAATTATAATTCATTTAAGTTAACACAAGAACGTAAAGCAATGCAATA 
GAAAAATTATTAAGAATTATATTTATTGGAGGCTTGATTTAAGAATTGAAGACCTTATTCATCACCTTCCAAAGCATATCGCATCTGCTCGTCAC 
CCATTTGGCCCCACATTTTACAATGAGTTTTGACAGCCAATGAAAGATCTTTCTTTTGTCTTAATTAGCCTCCCCAAAAATCATCAAGATCAACC 
ACAAAACGAGGCAGACGGTTCACGTCAAGGCCGAAGAATTATTAGCATAGAAGCGTGATCAAGCGAAAGGCGTCATCTCATAAAAGACACAATTT 
TTTCGGGTGTAATGTAACTAAAAATATTTCAACATTACAAGCCTCGGAATACGAATGCACACAAACGTAAGGTTATATATAGAGATGGGGAGATA 
TGTCTGATGTTCATGCAAAGATCACGTGAATCGATCGGAAAAGTGAGTCTGGGGCACATGGGTCTGCTGTAAAAATGTTGGCGCACCTTTAGCTG 
AAAAATGTGAAAGAATTTCGTACATTTACAACAATTTTATTTCATTCGTAATTAAATTTATATTTATTGGTAGCGATCGAGATAATTAAGAAAAA 
CCAAATCCACGACAGAATTTTCCGAGTTAAAACTATTTTCAGTCTCTTACTATTGGACTTTGCTTAAAATTTCAGAAAAATATAAACGATAGGTA 
TATTTGGCGAAACTGAAATTCTTATGGGGGAAATAAAATAGAAAATACAGAAAATTTGGCGACATACAATACGGAACTACCTTGATTCGTTTACA 
AAAAAGTAGTTCGATTGTAAGGTCAACCAGAATGACCATCAAATCAGTCAGTTGTAAAAATTTCAACAGGTCTTGCCAGAATATTTTTATCGGAC 
AGGTTATTGACTTTAGAAAATGAGGGAACTCGAACAAGGTCTGCAAATTGAAAACTGAGGAAATTAAAAAGTATTTTTAAACACAATTTTCATTT 
ACATTACAAGCGTTTCGATTTTAAAAAGCCCTTTTGGTGCAGCCAAAACTTGTTTCTCCTGCACCCATCAACGATCTTAATGCGGCTTGCATATT 
GAGGAGTTTACAAATTTATGGACTTTAAACCGGACCCACGAAAATAAATGCGTTCCGTAACGCATACGGAATTGTGACTGCCTTTGGGTTTTAAG 
AAAAAACAGTTGATAGTCGTATCTCGTTTCCCTTGGAGAAAATGTAAGCCAAGTCGGGGAAAATCAATGAAAATGTAAATGATTTTTTTTTTTTA 
ATTTGATTTTTCGGGTGATCTCTTTCAAGTTGACAAGTTTTAGGAAAGCGGTCGGTAAAAAATGAGACCCGTCCGCGGTCTCCGGTTTCTGGTTG 
GCGGCCTGCTGGGCGTTGAGACCCACCCCACACACACCAGATAAGCGATCGCTAAGAAACCTGTTGAAGAAAATGCAGCAACAGCAGGCGCACAC 
CGATCATGCCCTCGATGCGTGATGTCGTCGATGTCGGTAGTCGGTAGTCGTTACGATCCGATTCTGGCCGAGTCGGGTGGTACGATCTGCCATCT 
CAGATACACCATCACCATCACCGATCTCTGGCAATACGGCTGGGGTACAATCAACAGAGAGCCAGTCTGTCGATAGCCATTTCAATTGTTGGCTA 
TTTGTACTTCCCATATATATTTAGCACTCGATGCGTTCCCGCCGACTCTCCTCCAATTTGCAGACCAAATAGAGTCTGCTCTGTGTGGGGCCATT 
AGTCGTAGGTCTGCTCCAACTGACGGAAACTTTTCGCTTTGGGTTTGGCAACAGCAAAGAACCATACAAAAATCGAGCCCCTACCGAAGCTGATT 
AGCCAAAATTAGTCGCTATATCTTATCGAATTTATGGCCAACTTTCTAAATAGAACTTGTATGCGTATTTTTTTCAGTAATATGAAAGTCAAAAG 
GGCGGAACATTTGTTCGAAGAGTTTGGAAAGTAATTAACTGCAGCAAACATTTGTATTGAACACAATATATTATTATATTGTATTATTCGTATCT 
GCATTTAAATTAGGTAAAATTTTAAAAGTCCAATGAATTTCAGATTCCAACTAAATTTCGTTGAATAACTTATATATTTTGAATCACGAGTTATC 
CTCTATATTTTATATATCTTAGTGGGTGTGTTTTTATTTACATTTAAGCAAAAGAACCGTGGGAAGAGTCGCCCAATAACATGGGTGCAAAAAGA 
AAGGAAAAGCAAAAGTAGGCGTGATTTACATATAGATTAAGGCAAATCTTTGGCAAAGCCGTACGTGACTTCTAAAAATAAAACCAACAAAAAAA 
TATACAATACACATTACTCTTTTTATGACGCAGGGAATTTTTCGGGAAACGCAAAGGAATAAAMTGATCAGCAGAATAGAAAAAAAAAAACACT 
AAAGGAAAAGTTTCACTCGACCGGGATTTTATATTACCAATTGTCAGACAATCGACAACTTTGGCACATATAGTAGGAGCTTTCAAAAATGGTTT 
TAAAAACTGCATTTGGCAAAGACGCAGTTTGCTTAACTAAAGTAAA 

AGATGCTGAGTCAATGACCACAAAGAAATAAAATAAATAATTCTGAATGGGGGGTCAAGTCAAACTGATAAAGCCAGGGTCAAATCTATATTTGA 
TAATGTATATTTTAATAATTAATAATTTGCTTATAATAGATTGCGATGTACTCAATGGATGGTATTTCTTTTTTTACTTTTTGCAGCCGTGGCCC 
CAAAAAGCATGTGATCTTCGCTGATGATGAGGGTCTATCGCTAACCGAGGTTCGTGTGATGTCCGAACCCTCGAATGTGCCGCCATATTGGAGCA 
TGAAGTTTCTGGAGCAAATTACACAGGGCCTGGTCAGCCCACATCCTCCGGATCAGTGGACAGTGGACTTCAAGCAGCCAGCCTCAGACTATCTA 
TCATTTAGGTTGGTACCCTATTTACACAACTGTAAAATATATACGAGTATTCTAAATATTTGTATTTACACAGACAAAAGATAGAACGCGACTTT 
GTTTCCTTGGAGAATGTGATCGTCAAGGATGAGGAATCGATTGTCGTGGGCACCATCAAGGTGAAGAATATCGACTTCCAGAAGGAGATCATTGT 
CCGTGTGACCTGGGATGATTGGAAGAGCCAGCAGGATATCTTCTGTACATACGCCAGGGCCTATGGACCGGCTACTTGTGCTCATGTAGTCTTCG 
ATACATTCTCCTTCAAAATCACCTTGCCGCCATCCTCCAAGCGCCTGGAATTCTGCATCTGCTATCGCACCAACGAAACAGAATATTGGGATAAT 
AATGATGTGAGTTTGGCATCAAAATAACTTAAAAAATATTTTACATAATTTGTATTTGCTTACAGGGTAAAAACTACACCATTAGCAAGCGATCG 
CCCTTTTATTACAACGCCTTGTCGCCCTATGATAAGGGTCAGAATCGCAACTCGAGCCAGCAAATAAGGTCCACCCTGACCGATGCCTTGGCCAA 
GGTGCAAGACCAAAATGGCTGGCACCAGGAGCCACATACGCCATATTGGTGAAATCGCCTGGGAACGGGATTGGATTTTCCTGACCCAAGCGATT 
GAATAAAGTTTCTCATTTAGGCCACATGAACAGAAAGAAACAGAATTGGAAAAGATTTAAAGTGTTATAATCAGTTTCATTGACAGAGGGAGAGT 
CCAAAAGATTTTACCACTAACGAAACTTTTTGACCTAAGCTAATACCCGGTACAAATATCCCAATTTTAATGTGCCATTGACTTGAGGTTTACTC 
AATATAAAATGAGATAGGAGGGCTTATAACTACTTCTACTGAGCGAAGCCCGAATTTGTATGTCTTGTCTTTAGTCAACTTGATGGAATTTCAAA 
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ATCTATGTATAACTTTTTAAAATGTTTTATGTTTGACTGTAAAGAATTTTTGGAAAAGATAGATTTATATGTTTACCTGTATGCCTGTGTGATCG 
TCTACATCTATATAACTGATACTAGATTTATGTTAACTAAAAACGAAAAAAAAAACCTTATAAAAATGAAAGGAGATGAATTTTGAGGCGTGAAG 
CAT T CAAGCAAACC AT T GTTTCGAAATTT ACTGC AACTGC AAAATAAAGAAAACAAAAACC AACAAAAC AC ACATT AT G AAAGAACAC AAGAAAA 
GCAACAAAATTGTAGCCCTAAGTTTATTCGTTTTATTACTTTTTATTTCTATTCTTTTTATTTCATTATACACTTTTATCTTTTTGGTTACTTGA 
AACGCTAAGACTGAAATGGCTTGTTATGGAGAAGAAAACACGTCCTGCAATACATTATATAATAAAAAGAACCCAGCAATTTTCGCATCACCACA 
TTACAGACAACAAAGTACTTTGTCCTCATCGTTCTGTGTGATCCAGATCCCCGACTATATATGATATATACACCTAGCTAGATTTAAGCGATTTG 
TACATACAGAAGCAAAAGCAAAACAATATACCAGAAAATTGCCTGTTAAATACATTTATATACACATATGAATTTAGTTATATCACGATGACTAG 
TTTTTAG 

(SEQ ID NO; 1480) 

Exon: 1001.. 1419 

Exon: 1797.. 1958 

Exon: 4474.. 4839 

Exon: 9017. .9223 

Exon: 9289.. 9601 

Exon: 9661.. 9837 

Start ATG: 1867 

Transcript No. : CT26380 

TCGAAGCGTACGGTTCATAGGAGGAACGGAACGAAGCGGGGAACGCGACGGAAACTAGTTGCTGTTTTTTTGTCCGTGTTAAATAATTGACACAA 

GAAAATTTAGCTACACTTAAGCAAAGTCCGCGAAAAATCTATTAAAAATCGTTCGTCGTTTTGTGTGTGTGACCACGAAAAAAGTGCCCCGATCG 

GAGGATTTTAATAAATTCAATTAAGGTTCCGTCCCAACGATCGTTTTTCATTGTCTGACGCTCACGCGTGATGTACAAATGAAAAAGTAAAAATT 

TAAATAAGATCAAAGAAAGAAAGATCCCAGTAAAATTTAAAGTGGATTGACTGCACAAGAAAAAGAAAAGTTCCTTACCTCCTAGCCAGAAGTCA 

AAAGTGAAGCGGAAAACAGAGTGGAGATAACAATTAACGGGGCTTCCATACCAAAGTTCCTTGTTTGTAGAGATCATTGTAATCAGCGCAGATCG 

AAGCCATATATAAAATGATTTCGCACAGTCCGCCCATATTCAGTCACAGTCCACCCGTTAGTTTTCTGGCGGACTTCATGAGCGGACATCCCCGT 

CAATCGTTGAACTACAATGACGAGCCCAACAGAGCGTCGCGCTACTGCCGGCCCCAAGTGATAACGCTGGCCTCCAAAGCCATGGTCAACAACGC 

CTCGGCGGCGGTCCAGGGTCAGGGTCCCAGTCAGCAGAAGGCGGGTGCCACGCCCCTGGGCGTGGGCAGTCGTTCCGGACGGAGTTACCTGGAGC 

GAATGGCCTCGGCGCCGCTACTCACCAATCAGCCCAAATCCTGTCTCAGCCGCAAATCCTGCCTGCACAAATCCCAGAACACTACAGCCAGTTCC 

GCAGATCTCTCGCCGACGGACACCGATGTCGCGGACAGTGCCAGTCATAGCTGTATCTGCAGCAAGGATAGTGACAGCAACGGCGAAGTGAGCCG 

TGGCCCCAAAAAGCATGTGATCTTCGCTGATGATGAGGGTCTATCGCTAACCGAGGTTCGTGTGATGTCCGAACCCTCGAATGTGCCGCCATATT 

GGAGCATGAAGTTTCTGGAGCAAATTACACAGGGCCTGGTCAGCCCACATCCTCCGGATCAGTGGACAGTGGACTTCAAGCAGCCAGCCTCAGAC 

TATCTATCATTTAGACAAAAGATAGAACGCGACTTTGTTTCCTTGGAGAATGTGATCGTCAAGGATGAGGAATCGATTGTCGTGGGCACCATCAA 

GGTGAAGAATATCGACTTCCAGAAGGAGATCATTGTCCGTGTGACCTGGGATGATTGGAAGAGCCAGCAGGATATCTTCTGTACATACGCCAGGG 

CCTATGGACCGGCTACTTGTGCTCATGTAGTCTTCGATACATTCTCCTTCAAAATCACCTTGCCGCCATCCTCCAAGCGCCTGGAATTCTGCATC 

TGCTATCGCACCAACGAAACAGAATATTGGGATAATAATGATGGTAAAAACTACACCATTAGCAAGCGATCGCCCTTTTATTACAACGCCTTGTC 

GCCCTATGATAAGGGTCAGAATCGCAACTCGAGCCAGCAAATAAGGTCCACCCTGACCGATGCCTTGGCCAAGGTGCAAGACCAAAATGGCTGGC 

ACCAGGAGCCACATACGCCATATTGGTGA 

(SEQ ID NO: 1481) 

Start ATG: 4 90 

MISHSPPIFSHSPPVSFLADFMSGHPRQSLNYNDEPNRASRYCRPQVITLASKAMVNNASAAVQGQGPSQQKAGATPLGVGSRSGRSYLERMASA 
PLLTNQPKSCLSRKSCLHKSQNTTASSADLSPTDTDVADSASHSCICSKDSDSNGEVSRGPKKHVIFADDEGLSLTEVRVMSEPSNVPPYWSMKF 
LEQITQGLVSPHPPDQWTVDFKQPASDYLSFRQKIERDFVSLENVIVKDEESIVVGTIKVKNIDFQKEIIVRVTWDDWKSQQDIFCTYARAYGPA 
TCAHVVFDTFSFKITLPPSSKRLEFCICYRTNETEYWDNNDGKNYTISKRSPFYYNALSPYDKGQNRNSSQQIRSTLTDALAKVQDQNGWHQEPH 
TPYW* 

{SEQ ID NO: 1482) 

Name: putative protein phosphatase 
Classification : protein_phosphatase 



Celera Sequence No. : 142000013384010 

TTTTATAAGTTCAGCAATGAAAATAGACTAAAAGTGTCTCATTCACGAGACCCTAATGGAAGGCCTAATTGATTTACATATGTCAGGGGAGCACC 
CTCACATAAAAGAAATGCGTGGGCAGACACATAAAACCAAAACCCACTCACAAATGAGTCGAGTGGTGAATAATGTCCCATATTCAATTGAGTGC 
GGAAGAGGCCCTTATAAATCTGTGTGTGAAATATCTTTTATAATAAAAAACAATTTAACTAATAATACCAATTCTAGTTCGTAAGGCTATAAATA 
TTGAACTGAACAGACCAGAGGCTATAACCTATCGACTTCGATAAGATAAATATTTAAAAAGCTGAGACGGAAAAAGTGTTTGCCAAACTCCTTGA 
AATTCACTTGAGCGCCACGAGGCTCAGTGGGAAATTTGATTTTGGGAGATGGGATGCTTCCCGTGCATGTGATCATGTCTCGAGGGGCTGACACC 
CGTTTTGACTTTTGTCACCGCCAGTGGCGCCAAAACTAATCAAAGCAACTAGGACCGACGCTAAATCGAAACCACAACTACAATCACTGCGGAAT 
ACCAAAATGCAATCAAAATATGCAAGCCACGTCGATCAGGAGAGAGAGAGAGAGTGAGAGAGAGACCGGGGCTATTCTAGAAACCTAACGCGAGT 
T C AGAAC GCT GAAT T AT GTATACCCTTT AAAGT ACTAAC AG AGC AG AAAAC T T AAC TG T T TT AT GT G GT C GT TCAT T AGT AGT AAAAAT GGG ATG 
ATTTTATTTTATTCTGCTTTACTGTTATTTCTGTTTTCTGTTTTTATATACTGGCCTCTTGAATATATAACATTCTGATAACATTACAAACGCTC 
CACTAACATGTCTATATGTCTTGGAAGGGAATTCAAACAACCAATCTCTCTCTTTTTCTTTTTCTCTCTCTCTCTTTTTCTGTCGCCCCGATCAA 
GTCTTAACGGCAAGTTGCAGCAACAAACATTTTCATTCGTTTTCGCATCGTCGAAGCGTACGGTTCATAGGAGGAACGGAACGAAGCGGGGAACG 
CGACGGAAACTAGTTGCTGTTTTTTTGTCCGTGTTAAATAATTGACACAAGAAAATTTAGCTACACTTAAGCAAAGTCCGCGAAAAATCTATTAA 
AAATCGTTCGTCGTTTTGTGTGTGTGACCACGAAAAAAGTGCCCCGATCGGAGGATTTTAATAAATTCAATTAAGGTTCCGTCCCAACGATCGTT 
TTT CAT T GTCTGAC GCT C ACGCGTGATGT ACAAATGAAAAAGTAAAAATT T AAAT AAGAT CAAAG AAAGAAAGATCCC AGTAAAATTTAAAGTGG 
ATT GAC T GC ACAAGAAAA AG AAAAGT T C C TT A C C TC CT AGCC AG AAGTC AAAAGT G AAGC GG AAAA C AGA GT GGAGAT AAC AAT T AAC GGT T AGT 
TGTAAAGCTAAAACTACACAATAAACATATCATGAAAAACTTTATAAAACATAAGAAGGGGGGCATTTTATTATTTTGGGTATCAGCATTTACAT 
CACCTTGTTTCGAATCAAACTGATTTTAACATGCATTTGGACCAAACTAACAACGTTTCGAATGTTATCTCTTATGGAAATGGTATTGCTATATT 
ATACCATCATTAATTCCGATTCCACCTGAGATTAGCAATTAATATGAAATTTCATAAACATATATTATATATTCGGATCATGCATTCTATTTACA 
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CATTACTTTCCAATTAAACGTTCATTTATTGCCGCGCTTCGATTAGCAACAAAAACAAATCGTTAATTTTCTATATTTTTCCACAGGGGCTTCCA 
TACCAAAGTTCCTTGTTTGTAGAGATCATTGTAATCAGCGCAGATCGAAGCCATATATAAAATGATTTCGCACAGTCCGCCCATATTCAGTCACA 
GTCCACCCGTTAGTTTTCTGGCGGACTTCATGAGCGGACATCCCCGTCAATCGTTGAAGTAAGTTTACAAAAGTATATAACTCATAAATAAATAT 
AGCCAGCAATTGGACTAATTTAACACAGAATTGCTCTAAGTGAATCCACGAGTTTCGGCAGCCAATCAAAACAAATTAAATAGCCAAACATCATT 
GAAAGAGTACCAATTCTCAGGTTTTTTCATGTCGAAAGAAAATATTTTTTTAACTTTCTTATCGATTGATAAGGAACTAGAAGCATTTTCTGATG 
CATCATCTTTTATGAATTTAACAAGTGGTGTACGTGTTAATAAGAGAAACGATATAAATGTCTGATAACCTGAAAAACCATGTGATAGTAAATGA 
ATTAAACTGAGAGATAAAAAATACATAACAGATAAAAATGATTAATACCCTTTTGCATTAGGCGTTCTCTCTGGCGAATATTTAAACAATAATTT 
TAAACATTCAAATTTAAAACAAATTTCAAAATACCCAAACATAACCCACATTAACAAAATAAACAGACCGGAATGCCGTAATTCCATTTAATGAA 
TATAAGAACTCAGTTGTTTACACAATCCAGAAAAGGAATATTATCGATCTCTCGAGGGCTGCCCATATTGATTCACATGAAACACCTGTTTCAAG 
TGTCGTCTATGGAAATTTTAACACCTGATCAAAGGCAGGAAACATGTCTTACCCAATACATAGTATTAAAACCTGAGAACTATTCGAAAACACAA 
TGAATGCAGGTACTAAAAATATTCGTGCTAAGGCTCGTTTTTGGTTGACAACAACAGATTGGCAAAATATGAAAAGCAAGCATTCAAGATCAAGG 
CAAGACAGAATACTGAACCAAAACAAACATTTCGCCGATTCTGAGGTGATCTTTCGCTGGTGGAGCCAACTGAGATTTGAAGATTCGTTCACGAG 
TCGTGGGGGTGACGCCAAAAATTTGCTTGGCGACAATTTGCGCGTGGCGCTTGACGCAACTTTTTTTCTCAGGGGCACACGAGCAAGCCGAAAAT 
TTGACAAATTGGTAAAGGGAAGAAGGGTAATATCTATATATTATATTTATACAGAGGCGAGAGCAATTGATTGAATTAAAGCAATTTGCCGTGCT 
TCATTTGCTGCACGCGCTGCATTTTCATCATATTTGTTGAGAGTATTTTATTTTTGGGAATGATAAATATACAATACCGAAAATTCGTATAACTT 
ATATTCGTATATATGAGGACTTCAGCTAAAAATAAATCTATATCTAGAGTATGGCCAAGCAAAACAAACGCTACAAGAACCGGTAAAAAACTAAA 
TAATCAGTTAAGATTAAAGAGGTTCAGTAGATAAAACTGTGGGAGGTAATACATGCGAAAGCCCTACTGCGATCCCTACAACTATGAACATTTCT 
TATCGGGGCACGTGAGCGGCATGCCTTCAAACAAACCGAGTACTCCAACCAACTCAAGTAACTCAGCTGCTAATGCGTCAACTCACTTGAAATTT 
CACAGCGTGATTTTACAACAGTTTACCCCACCCAATACCTAGAACTACCCACGTAACTGGGCGGGTCGTCATCGAAAATGATTGAATTCTTTTGG 
T T GC CAAC T C AAGC AC T AAAT AT GC T GAT AT T GT CTC C AG AC AG C GC CAAT T AGCC AAAAAAAGT T GAT GAT AAC AGC AG AGC AG AC GG AAAT GT 
TCCCATAGCCAGGCCCACGCCGAAAAAGACGTATCAGCATGCCGTCAACATATGTTTAGCATGCAAAAGTCCACGCGAACCCCATTGAACGATCC 
TTATCAACCATTGACTTTTAGTTTTATGAATGTCTAGCCGAAAAATTATGATAACTACAAATGGGATGTGAACTGTGCCAAATGAGTAAGACCAG 
CGAGTCTCTCCGACTATACAATTTTAGAAATGACTCACTGAAGAACCCAAACAAATATGCTTCATATTCATACTATACATTTCCCACCCACGGCA 
AACCCGAATTAAGTGAGGAGGTCGCGCGCAACTCCTCTTATCAGCGATGTTTCTTTATAGACAATATACACGTATCAGTAGGACTACACAAGTAC 
AAGTACAAATACAAGTAGTACATGGGCCTCCGAACAATGAAGGCAGGCATTGAGGCAGTAAAAATTAAAGCCACCACTCTTATCAGGACTGATAA 
AAGGCAGCTTAATGCGTAGATGCTTGAAATAAAAGACTGATCTTGGGAATTTCTTTTCTGCTTTCCAAACGAAAAAAGAATCACAAAAAATGGTC 
AAAACAAT T GAAAGAT GGC T AC CAT C T GCT GAT GT TT AAAT AT AG AAAT T AT AT TT T CAAT T AGT GGT GT T AGAG AT AT AT AT TT T AACT GAT T G 
TGATATATTCAACCTTCGATATTCAACCTTTTGATTTTTTGAACGAATTCAACACTCATTTAAAAGGACAGAATAGTAACGAAATTAAGTTATGA 
AATAAATATTAAATAAATTTATTTTAAGAAACATTTTTATAGTTATTAATTTAAGTTTTGAAATACAAAAATAAACCAATTAAACTTGCATACCT 
TGTTTCAGCTACAATGACGAGCCCAACAGAGCGTCGCGCTACTGCCGGCCCCAAGTGATAACGCTGGCCTCCAAAGCCATGGTCAACAACGCCTC 
GGCGGCGGTCCAGGGTCAGGGTCCCAGTCAGCAGAAGGCGGGTGCCACGCCCCTGGGCGTGGGCAGTCGTTCCGGACGGAGTTACCTGGAGCGAA 
TGGCCTCGGCGCCGCTACTCACCAATCAGCCCAAATCCTGTCTCAGCCGCAAATCCTGCCTGCACAAATCCCAGAACACTACAGCCAGTTCCGCA 
GATCTCTCGCCGACGGACACCGATGTCGCGGACAGTGCCAGTCATAGCTGTATCTGCAGCAAGGATAGTGACAGCAACGGCGAAGTGAGGTGAGT 
GTGCGGGCGATAAGATTACGTAATGTAGGTAACTCGATACAGCGGCGCCGAAACGATCAAGAAGGCTGTAAATGCCGGCGGACTCAGTCACTTTG 
ACGTCAGGCCTTCAAAGATTATGGCGCCGCACTTATTAAACCACCAACTGAACTAGATGGGAACTTAGTGGTGTGAGAGCCTGGGTAATTGTGGT 
CAGAAT TGGGT GT ATC AACCT AT AACAGATTC AT T AAAT AGTAAGATGTC AAT TGAGGGG AAATCTTAAGGAAAGTC C GAGAAAGTAACACAAGA 
AGCTGTAGATGGTTTTCATAGCGCATCTTAACTATGTAGACTGTGGTCAAAAAGAAATCGAAAGCTTATGATTCAAACTTAAATCTTTGAATTTT 
AATATGGGGAGGATAGGATTTTGGTTCTAAAAATTGGAACGTGAAAAGATCATTTCTTATAAAACTGCGAAAATACCACTTCCGCCATAATATTT 
TATAAACAAAGATTAATCGTTTATAAGACATTTGCAATAGTTCTAGTAAAAGGAATAAGAAAATCTGTTGGCAGTCGAACAACTTTCCCACAGAC 
GTCATAGCCGAGCACATGTCATTTATTAAATAAGCTGAGAAAGCCAATTGCAGACGGCCGCCAGGCTAAACAAATGCAAAAAGATCATGAAATAT 
ACTTGAAAATATATTTTATATATTTTTAAGAGCATGCAATTTCACAAAAACAAAATATAAATTATCTAAATGACTGTAGAAATAAGCCAAAATAA 
AAACACGCCACTGCATCGCGAGTCGAGAATCTGTATCTCGTATATCTGAGATAAAAAATCTCACATTTTAAGACTCAAATGGGAATAGACTGGTA 
TTTAGTATAGACCCCAACAGCGAGGCGCCATTATTTTCGTTTTCCATTTCCATTTATTCGCTATATATAGGGAAGATACATTATCGATTCATGTG 
CAGGCCATGAAAATATTTGGGTTATCGATGTCTCGCAGATATTTTGTAAAATTGTATACATATGTATATTTCGCTTTCAACGAGCCCTAGAAATA 
ATTTCCTTTTTGCACTTGAATCTGACATTTTGAGAAATGCCTAAAAATACATATGCCACGAGATAGAGAAAATAATTTATACGTTCATTTGCGTC 
AAATGCGAACACTATTTCTATTTGGGCTGCGTCTCTAGACAAGAGATCAGGCTTAAAGCTACGCAGCTACTCAAACGAATGCT^ATTAACCATAAT 
ATTATCAATGTAAACCAGACGTGTGCTTGTTGCTGTTGGAGATTTTGTGTACACTTTGCGGTTGCGTATTTTGTATGCCTCGTTTTATCAGTTAC 
G AAAT C CAC T GGG AAA AAAT T AATT T AC CAAT T C AAATCT T C AG AGT AAAT T AT AATT C AT TT AAGT T AAC AC AAG AAC G T AAAGCAAT GC AAT A 
GAAAAATTATTAAGAATTATATTTATTGGAGGCTTGATTTAAGAATTGAAGACCTTATTCATCACCTTCCAAAGCATATCGCATCTGCTCGTCAC 
CCATTTGGCCCCACATTTTACAATGAGTTTTGACAGCCAATGAAAGATCTTTCTTTTGTCTTAATTAGCCTCCCCAAAAATCATCAAGATCAACC 
ACAAAACGAGGCAGACGGTTCACGTCAAGGCCGAAGAATTATTAGCATAGAAGCGTGATCAAGCGAAAGGCGTCATCTCATAAAAGACACAATTT 
TTTCGGGTGTAATGTAACTAAAAATATTTCAACATTACAAGCCTCGGAATACGAATGCACACAAACGTAAGGTTATATATAGAGATGGGGAGATA 
TGTCTGATGTTCATGCAAAGATCACGTGAATCGATCGGAAAAGTGAGTCTGGGGCACATGGGTCTGCTGTAAAAATGTTGGCGCACCTTTAGCTG 
AAAAATGTGAAAGAATTTCGTACATTTACAACAATTTTATTTCATTCGTAATTAAATTTATATTTATTGGTAGCGATCGAGATAATTAAGAAAAA 
CCAAATCCACGACAGAATTTTCCGAGTTAAAACTATTTTCAGTCTCTTACTATTGGACTTTGCTTAAAATTTCAGAAAAATATAAACGATAGGTA 
TATTTGGCGAAACTGAAATTCTTATGGGGGAAATAAAATAGAAAATACAGAAAATTTGGCGACATACAATACGGAACTACCTTGATTCGTTTACA 
AAAAAGTAGTTCGATTGTAAGGTCAACCAGAATGACCATCAAATCAGTCAGTTGTAAAAATTTCAACAGGTCTTGCCAGAATATTTTTATCGGAC 
AGGTTATTGACTTTAGAAAATGAGGGAACTCGAACAAGGTCTGCAAATTGAAAACTGAGGAAATTAAAAAGTATTTTTAAACACAATTTTCATTT 
ACATTACAAGCGTTTCGATTTTAAAAAGCCCTTTTGGTGCAGCCAAAACTTGTTTCTCCTGCACCCATCAACGATCTTAATGCGGCTTGCATATT 
GAGGAGTTTACAAATTTATGGACTTTAAACCGGACCCACGAAAATAAATGCGTTCCGTAACGCATACGGAATTGTGACTGCCTTTGGGTTTTAAG 
AAAAAACAGTTGATAGTCGTATCTCGTTTCCCTTGGAGAAAATGTAAGCCAAGTCGGGGAAAATCAATGAAAATGTAAATGATTTTTTTTTTTTA 
ATTTGATTTTTCGGGTGATCTCTTTCAAGTTGACAAGTTTTAGGAAAGCGGTCGGTAAAAAATGAGACCCGTCCGCGGTCTCCGGTTTCTGGTTG 
GCGGCCTGCTGGGCGTTGAGACCCACCCCACACACACCAGATAAGCGATCGCTAAGAAACCTGTTGAAGAAAATGCAGCAACAGCAGGCGCACAC 
CGATCATGCCCTCGATGCGTGATGTCGTCGATGTCGGTAGTCGGTAGTCGTTACGATCCGATTCTGGCCGAGTCGGGTGGTACGATCTGCCATCT 
CAGATACACCATCACCATCACCGATCTCTGGCAATACGGCTGGGGTACAATCAACAGAGAGCCAGTCTGTCGATAGCCATTTCAATTGTTGGCTA 
TTTGTACTTCCCATATATATTTAGCACTCGATGCGTTCCCGCCGACTCTCCTCCAATTTGCAGACCAAATAGAGTCTGCTCTGTGTGGGGCCATT 
AGTCGTAGGTCTGCTCCAACTGACGGAAACTTTTCGCTTTGGGTTTGGCAACAGCAAAGAACCATACAAAAATCGAGCCCCTACCGAAGCTGATT 
AGCCAAAATTAGTCGCTATATCTTATCGAATTTATGGCCAACTTTCTAAATAGAACTTGTATGCGTATTTTTTTCAGTAATATGAAAGTCAAAAG 
GGCGG AAC ATTTGTTCGAAGAGTTTGGAAAGT AATT AACTGCAGCAAACATTTGTATTGAACACAATATATTATTATATTGTATTATTCGTATCT 
GCATTTAAATTAGGTAAAATTTTAAAAGTCCAATGAATTTCAGATTCCAACTAAATTTCGTTGAATAACTTATATATTTTGAATCACGAGTTATC 
CTCTATATTTTATATATCTTAGTGGGTGTGTTTTTATTTACATTTAAGCAAAAGAACCGTGGGAAGAGTCGCCCAATAACATGGGTGCAAAAAGA 
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AAGGAAAAGCAAAAGTAGGCGTGATTTACATATAGATTAAGGCAAATCTTTGGCAAAGCCGTACGTGACTTCTAAAAATAAM 

T ATACAATACACATTAC TCTTT TTATGACGCAGGGAATTT TT CGGGAAACGC AAAGGAAT AAAAAT GATCAGC AGAAT AGAAAAAAAAAAACACT 

AAAGGAAAAGTTTCACTCGACCGGGATTTTATATTACCAATTGTCAGACAATCGACAACTTTGGCACATATAGTAGGAGCTTTCAAAAATGGTTT 

TAAAAACTGCATTTGGCAAAGACGCAGTTTGCTTAACTAAAGTAAAAACAAACCAATAAAATGAAACTGTTAAGGATAGAAATCCGGGCAAAGAA 

AGATGCTGAGTCAATGACCACAAAGAAATAAAATAAATAATTCTGAATGGGGGGTCAAGTCAAACTGATAAAGCCAGGGTCAAATCTATATTTGA 

TAATGTATATTTTAATAATTAATAATTTGCTTATAATAGATTGCGATGTACTCAATGGATGGTATTTCTTTTTTTACTTTTTGCAGCCGTGGCCC 

CAAAAAGCATGTGATCTTCGCTGATGATGAGGGTCTATCGCTAACCGAGGTTCGTGTGATGTCCGAACCCTCGAATGTGCCGCCATATTGGAGCA 

TGAAGTTTCTGGAGCAAATTACACAGGGCCTGGTCAGCCCACATCCTCCGGATCAGTGGACAGTGGACTTCAAGCAGCCAGCCTCAGACTATCTA 

TCATTTAGGTTGGTACCCTATTTACACAACTGTAAAATATATACGAGTATTCTAAATATTTGTATTTACACAGACAAAAGATAGAACGCGACTTT 

GTTTCCTTGGAGAATGTGATCGTCAAGGATGAGGAATCGATTGTCGTGGGCACCATCAAGGTGAAGAATATCGACTTCCAGAAGGAGATCATTGT 

CCGTGTGACCTGGGATGATTGGAAGAGCCAGCAGGATATCTTCTGTACATACGCCAGGGCCTATGGACCGGCTACTTGTGCTCATGTAGTCTTCG 

ATACATTCTCCTTCAAAATCACCTTGCCGCCATCCTCCAAGCGCCTGG7^ATTCTGCATCTGCTATCGCACCAACGAAACAGAATATTGGGATAAT 

AATGATGTGAGTTTGGCATCAAAATAACTTAAAAAATATTTTACATAATTTGTATTTGCTTACAGGGTAAAMCTACACCATTAGCAAGCGATCG 

CCCTTTTATTACAACGCCTTGTCGCCCTATGATAAGGGTCAGAATCGCAACTCGAGCCAGCAAATAAGGTCCACCCTGACCGATGCCTTGGCCAA 

GGTGCAAGACCAAAATGGCTGGCACCAGGAGCCACATACGCCATATTGGTGAAATCGCCTGGGAACGGGATTGGATTTTCCTGACCCAAGCGATT 

G AAT AAAGT T T C T C AT T T AGGC C AC AT G AAC AG AAAG AAAC AG AAT T GGAAAAGAT TT AAAGT GT T AT AATC AGT TTC ATTGAC AG AGGG AG AG T 

CCAAAAGATTTTACCACTAACGAAACTTTTTGACCTAAGCTAATACCCGGTACAAATATCCCAATTTTAATGTGCCATTGACTTGAGGTTTACTC 

AATATAAAATGAGATAGGAGGGCTTATAACTACTTCTACTGAGCGAAGCCCGAATTTGTATGTCTTGTCTTTAGTCAACTTGATGGAATTTCAAA 

ATCTATGTATAACTTTTTAAAATGTTTTATGTTTGACTGTAAAGAATTTTTGGAAAAGATAGATTTATATGTTTACCTGTATGCCTGTGTGATCG 

TCTACATCTATATAACTGATACTAGATTTATGTTAACTAAAAACGAAAAAAAAAACCTTATAAAAATGAAAGGAGATGAATTTTGAGGCGTGAAG 

CAT T CAAGC AAAC CAT T GT T TCG AAAT T T AC T GC AAC TGCAAAAT AAAG AAAACAAAAAC C AAC AAAACAC ACATTATGAAAGAAC ACAAG AAAA 

GCAACAAAATTGTAGCCCTAAGTTTATTCGTTTTATTACTTTTTATTTCTATTCTTTTTATTTCATTATACACTTTTATCTTTTTGGTTACTTGA 

AACGCTAAGACTGAAATGGCTTGTTATGGAGAAGAAAACACGTCCTGCAATACATTATATAATAAAAAGAACCCAGCAATTTTCGCATCACCACA 

TTACAGACAACAAAGTACTTTGTCCTCATCGTTCTGTGTGATCCAGATCCCCGACTATATATGATATATACACCTAGCTAGATTTAAGCGATTTG 

TACATACAGAAGCAAAAGCAAAACAATATACCAGAAAATTGCCTGTTAAATACATTTATATACACATATGAATTTAGTTATATCACGATGACTAG 

TTTTTAG 

(SEQ ID NO: 1483) 

Exon: 1001. .1419 
Exon: 4474.. 4839 
Exon: 9017.. 9223 
Exon: 9289.. 9601 
Exon: 9661.. 9837 
Start ATG: 4544 

Transcript No. : CT26388 

TCGAAGCGTACGGTTCATAGGAGGAACGGAACGAAGCGGGGAACGCGACGGAAACTAGTTGCTGTTTTTTTGTCCGTGTTAAATAATTGACACAA 
GAAAATTTAGCTACACTTAAGCAAAGTCCGCGAAAAATCTATTAAAAATCGTTCGTCGTTTTGTGTGTGTGACCACGAAAAAAGTGCCCCGATCG 
GAGGATTTTAATAAATTCAATTAAGGTTCCGTCCCAACGATCGTTTTTCATTGTCTGACGCTCACGCGTGATGTACAAATGAAAAAGTAAAAATT 
TAAATAAGATCAAAGAAAGAAAGATCCCAGTAAAATTTAAAGTGGATTGACTGCACAAGAAAAAGAAAAGTTCCTTACCTCCTAGCCAGAAGTCA 
AAAGTGAAGCGGAAAACAGAGTGGAGATAACAATTAACGCTACAATGACGAGCCCAACAGAGCGTCGCGCTACTGCCGGCCCCAAGTGATAACGC 
TGGCCTCCAAAGCCATGGTCAACAACGCCTCGGCGGCGGTCCAGGGTCAGGGTCCCAGTCAGCAGAAGGCGGGTGCCACGCCCCTGGGCGTGGGC 
AGTCGTTCCGGACGGAGTTACCTGGAGCGAATGGCCTCGGCGCCGCTACTCACCAATCAGCCCAAATCCTGTCTCAGCCGCAAATCCTGCCTGCA 
CAAATCCCAGAACACTACAGCCAGTTCCGCAGATCTCTCGCCGACGGACACCGATGTCGCGGACAGTGCCAGTCATAGCTGTATCTGCAGCAAGG 
AT AGT G AC AG C AAC GGC G AAGT G AGC C GT GGC C C C AAAAAGC AT GT G AT CTTCGCTGATGATGAGGGTCTAT C GC TAAC C G AGGT T C GT GT G AT G 
TCCGAACCCTCGAATGTGCCGCCATATTGGAGCATGAAGTTTCTGGAGCAAATTACACAGGGCCTGGTCAGCCCACATCCTCCGGATCAGTGGAC 
AGTGGACTTCAAGCAGCCAGCCTCAGACTATCTATCATTTAGACAAAAGATAGAACGCGACTTTGTTTCCTTGGAGAATGTGATCGTCAAGGATG 
AGGAATCGATTGTCGTGGGCACCATCAAGGTGAAGAATATCGACTTCCAGAAGGAGATCATTGTCCGTGTGACCTGGGATGATTGGAAGAGCCAG 
CAGGATATCTTCTGTACATACGCCAGGGCCTATGGACCGGCTACTTGTGCTCATGTAGTCTTCGATACATTCTCCTTCAAAATCACCTTGCCGCC 
ATCCTCCAAGCGCCTGGAATTCTGCATCTGCTATCGCACCAACGAAACAGAATATTGGGATAATAATGATGGTAAAAACTACACCATTAGCAAGC 
GATCGCCCTTTTATTACAACGCCTTGTCGCCCTATGATAAGGGTCAGAATCGCAACTCGAGCCAGCAAATAAGGTCCACCCTGACCGATGCCTTG 
GCCAAGGTGCAAGACCAAAATGGCTGGCACCAGGAGCCACATACGCCATATTGGTGA 
(SEQ ID NO: 1484) 

Start ATG: 490 

MVNNASAAVQGQGPSQQKAGATPLGVGSRSGRSYLERMASAPLLTNQPKSCLSRKSCLHKSQNTTASSADLSPTDTDVADSASHSCICSKDSDSN 

GEVSRGPKKHVIFADDEGLSLTEVRVMSEPSNVPPYWSMKFLEQITQGLVSPHPPDQWTVDFKQPASDYLSFRQKIERDFVSLENVIVKDEESIV 

VGTIKVKNIDFQKEIIVRVTWDDWKSQQDIFCTYARAYGPATCAHVVFDTFSFKITLPPSSKRLEFCICYRTNETEYWDNNDGKNYTISKRSPFY 

YNALSPYDKGQNRNSSQQIRSTLTDALAKVQDQNGWHQEPHTPYW* 

(SEQ ID NO: 1485) 

Name: putative protein phosphatase 
Classification : protein__phosphatase 



Celera Sequence No. : 142000013385203 

CGTAAGAGCGAACCGGCTTTTCTACCATTTGTTATTATTACTTTCACAGGATCTGTTAATATTTCGGAACCGAAGAGAAACCCAGTCCCGATACA 
TGTCTATTTTAATCTAAACCTATCAGATTCTTTTGATTCCTGTGTAACCCTCAGTAGCATTACACTTGTTCCCAGAAAAATGGAATCTGGTATCT 
TAAAAAGAAAACTGAAAGTAATTTATTCTGAACCCAATACCCTTCTAGATTTTGGTGATCAAACATAATAAATTGTTAGATTAGGGACTCGTTCT 
ACGAAAATCTGCTTATCGATTTAATCATAGCTAATGTTGGAATTTATTTTTAATACAATTGCTCGATACCTTAGCTAGAATAATTCAAAAACGTT 
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CTGAATTGTTTATCCTTCATTTTTAAACTGTAAAATTTGAGCTTATTCAGAGTGGAGTAGTATAAGAAAATGTTGAATATTTTGATACCGGATCC 
GATAGTTATAGAACCGGAACCTACGGATCCCGTTGACCTTTTTCTTATCGCACTCTGATATTCCAACCGGTCGGACACATGTGTTTTTCATCCGT 
GAAGTCGGCCCAGAATATTTCCATCATTCAATATTTACCACCTTTTATAGTTTTGTTTACTATCTTGCCGACTTTTTTGTCTTGCCGTCTTGCTC 
TCCTCTCCATCCTAAAAATACTCTTCCTGCGCTCCATCTGGCAATGCCGCTCGACAACTGTACTTCAAAGAGCGTTGCCAGGCAGGTCACTTCTC 
TCATAGATACACAGATACAGGCGCGCACGCACACCAGCACGCAAAATAACCACCACCAGCAGTACAGAAATTACACATTTTCTCCCCGAGTAGAG 
AACTGAAAAACTGGCCGCGCTGGCAGCGCCGGAGCAGCGAAATTCGATCGAAACCATTTACTCAGTCTCAGTTCGAGTTATCCAACGTTGGCGAG 
CGGACAAACAAGCAGGAGAAAAAAGCGTACGGGCAAGAGAAAGAGAAGGCACAAAATCACCCACAGCCACTGAAATTTTCTTTTGCGCTCCGGTG 
AGAAGATAATTTCACAAACCAAAACCGAACCAACAAACAAAAAAAGTGCGCTCCGTGAGCGGGAAAACAATAACAAAATCAAGAGACATGCGATT 
ATAAAACGAACAAAATAACTGAGTGACAAAAAAAAATTAAATGCGTGTCTCATGGCTTGACTGTACTGTGTATTAAAAAATGTGATTGTGTCGGC 
TGTCCGTCTGTCTGAGTTTGTGCGCGTGTGTGTGCTGTTGGCGTTTTTAAATGTGGCGCGCTATTTTTGTGTGATTTTTGTGGGCAGCAACGAAT 
TTTGAAACATCAATTGTATACTAGTTGTCAGTCCCCTTGTGAAAAAGTCTTCTCCAACAACAAAATTCCTGTGAGCAGCCTACACACATGCGTTG 
TCTCGCTCTCGCGCGCTCTGGCCCCCAAAGAGTGCGAGAGCGGGAGAGTGAGACCGAGAAAGAGAGAGAGAGAGAGAGAGCTAATAACTAACGTC 
TGTACTCTTTTTCTTTTCCATTTTTCATTTCAGTTTCGAGAAAGCAGCGCGCCAAGCAGCAAGAAGAGGAAGAGCAAAGGGAGTTGGAAAAGCAG 
TGGCGTCGTGGAGGAGGAGTAGAAAGGGGATTTTTGCGAAAAATCCCAACTTGAAGAGAATTTTCCCCGTAACCGTTAAACATTAAATCGTCGTG 
TTGGAGACAAATAGCAGCGAATACGCTAGCAACGCTGGGGAGGAACAACAATTGTTGGATTTGGACGCGACCGCTTTGATGTCGCTGCCAATGAA 
TTCGCTGTATTCGCTCACCTGGGGCGATTACGGCACCAGCCTCGTATCGGCAATCCAATTGTTGCGCTGCCATGGCGACCTCGTCGATTGCACGT 
TGGCCGCCGGCGGGCGGAGTTTTCCCGCCCACAAGATAGTCCTGTGCGCCGCCTCCCCCTTTCTGCTGGACTTGCTAAAGGTAAGTGCTCGGTTC 
GG AAAC ACT GCC T AAAAC ACC TT T C AAT AAG AG AGCT T GT AC AT AAC C T C AAAC GT T TC G T AC AGT T T GT AAAC AG AG AAAT C C AATCGC AGAAA 
TAGGTTAATGTTTGGAGCGCAGAGAAACTGACACTCGCACGCAAATGCATACATATGTATAAATAACGCAGATTTACGCTCCGCTCCAAAAGTTG 
TCTAGAGATTTTTGTACAAGTTTTCCACTGTCAATTGAGTTGAATGAGTGGCTCCGCCCCCCGCATTCCCGCCGTCTATCTTGCTTATCAGCTTT 
TTCTTGTTGCAAAGTTCACCGGAGAACGAGAGAGTGAGACGTAGAGACAGAGCGCAGACAGAGAGGGAGAGAGCGAGAGAACCAGTTGGTTCGGT 
CGACAGAAAAGTGGTTTTGGGCCCGGTGGTGTAACAGAGAGGGGAGAGCCGAAAGCTGGTGCGCCAACTTTTAAATCGCATGTGTTAGAGCGAGC 
GCAAACGAGAGCGCCAAAAGCAGCGCGGCAACTGTTGTAGTTGTTGGATGGTTGGAGCTTTCGTCTTGCCTTTTGGTTCTTGCTGTTGTTGGGTG 
CACACGAGCGCGACCTCTTTCTTCACAGACAGAACTGTACAGTTGGCTCTGGCTAGGGGGAACTTGAAATATTGGGGGGAATATGGATACATATA 
CAAAGATAATTAAAATTAAATTTGTATTGATTGGTTTGATTTCCTTAACCCATTTCCTAAGCTAAGTCTTTTGTATTTTGTATGTCACCATTTTA 
GAT T GCT AG AT T G ATT GT C T AAC AGC AAAC T T C GCT C AAT T AAAAGT T C C TGG AC TTT AAT AG ACT TAT C TAC T T C AAA T C T T TAT T T AAA CT TA 
TAACATAGTGACCAAGTCCAGATTGTACCAAAGATAAGCAAGCCTGTCTGGTGCGCTTCCCTCGTATCGAATCCAACCCGTATTTCTATAAACTA 
AAGCATTTCATATTAGGAAGGCGCACTTTTAAATGGAGCTTGCAAGTTTTTTCATGAGCTGTATTGCGATACTGTACTTTTTAACAACTCATTTG 
CACTTTAACTTTGACGCTTTTCCGCCACTCTGTCGATTGTTCAAAGTGTGCGCTTAAGTATGAAAGTTGGACCGTCTTCAATGCGTGGCCCTATC 
GTAATGGAAATACTGACCAAAACAGAGCGCGGCCAGCGAAATGAAAACACCTGATGCGACGCCGACAGCGGCTACTCTGCGCTTCGCTCCCCCTT 
ATTCTCAGCGCTCCTCATCTCTCCTCTCCTTTTCTGGGCCGTGAA2VAAGTAAACAGAGCGAAATTTTTGAAAACGCCGCTCAGAGCCGTCGTGGG 
GAAATGTCAAAGAAAAAAAAAAAAAAACAAAAAGAACGGGACCAATGCGGCTGTGAACTCTGCAAGAGTCCGAAAAGTACAAGATGTACTGATCT 
CGTTACACTCACACACGCACGCAAAAACACATGGCGGTCGCTCTTTGTTGCTCTTGCTCCCTCTCCCTTGCTCCTACGGCAAATGGAATGCAACT 
CTGAAAAATCGATGAGCGAGACTTTGGTTGGAGGCAGAGTTCAAAGAGCGCAAATGGGAGTGCGGTGAGATGTGTCATTTCAAGAGATTGGCATT 
GGATGAAGGGAGGGTGTTGGTTGGTGAGTCCGTAAATGGAGAGGGGTTGTTTTCTGAGCAATGCTATATAATTAAATGTTTGGAGCGGGAGCGGG 
AGCGGGAGAGGACACAGAAGAAGAACAGCTAAGCGCTCTTTCTTTAAGCGCTGCTCTGTTTTCTTTTTCAGCATCTGAAATCTCAAAATCCTGCT 
GCAGAATGTGCAGAGCGCGCAAAAGTAAAAGCCGGTTGTCTTTTGTCTGTGTCACTGTGTGCGTGAGCGGAGGAGAGTGTGAGCGACGCGTCGCA 
CCGCGAGTTTCCATTATTTCCATCTCCGTCCGACTGTTTTTGTTATTGTTTTTATTTGGATTGGCGGACGGACGAACGGGCCTCTCAATTCACGC 
C AAAAAT GT G AGAC GAT T T C AAAAC A AAC AC AAC T CT CT AC G AT G AAG T T CGAGC C AAG AG AAAT GT TGG AT G AAG AGC CT AC G AAC T TT GC AAA 
GTGTCAAGAAGTTTTTCTTATAGAACATTTTTAGATTTCAATTAGAAAATTTCTATAACTTACTAGAAACCGGTTTTGGTTATCGTACATACCTT 
AAACAATAGAGAAATATTTATATATCATTTAAAATCAATTTATCTTGCCCTATTCTTAATAACTCCTTTTAATCTCATTAGTTTTACTTATATAA 
ATACAAACTTATCAGCTCTAAATAAATTAGATTATATATGTCTTCTCGATATTTACAGAATACACCATGCAAGCATCCAGTGGTTATGTTGGCTG 
GCGTCAATGCGAACGATCTGGAGGCGCTGCTGGAGTTTGTGTACCGCGGAGAGGTGAGCGTGGATCACGCCCAGCTGCCGTCGCTACTGCAGGCT 
GCCCAGTGCCTGAACATCCAGGGACTGGCACCGCAGACGGTTACCAAGGACGACTACACCACGCACTCGATACAACTGCAGCACATGATTCCACA 
ACATCACGACCAAGACCAACTGATTGCCACGATCGCCACGGCTCCACAGCAAACGGTTCATGCCCAGGTGGTGGAGGACATCCATCATCAGGGCC 
AGATTCTCCAGGCAACGACCCAGACCAACGCAGCAGGACAACAGCAGACCATTGTGACAACCGACGCGGCTAAACATGACCAGGCAGTGATTCAG 
GCTTTTCTTCCGGCACGCAAACGCAAACCACGCGTAAAGAAAATGTCACGTAAGTTAAAAACTTATTTTTGTAAACAGTTGATTCAGAAAATTTG 
TCCAATAAACTCTGAACGTATTGGGTAAGAATTAGAAACAAAAAATAATAACATGCATTTCTACTTGTGCAGCTACGGCACCGAAAATAAGCAAA 
GTTGAAGGAATGGATACGATTATGGGCACACCGACCTCTTCACACGGCTCTGGATCCGTGCAGCAGGTGCTTGGCGAAAATGGAGCCGAGGGCCA 
ACTGCTATCATCCACACCGATCATCAAGAGCGAAGGACAAAAGGTAGAGACTATTGTGACCATGGACCCCAACAACATGATACCGGTAACGTCGG 
CTAATGCGGCAACTGGCGAGATAACACCAGCACAAGGAGCCACTGGCTCATCTGGCGGCAATACAAGCGGCGTCCTATCCACGCCAAAGGCAAAA 
CGCGCTAAACATCCGCCCGGAACAGAGAGTGAGTTTTAATTTCGGTGTATTCACTTGCCGGCTTAAAGTTATTACTAATTATAAGTATATATTTG 
AGAATAATGCATTTTGTTAGAGCCAGTGAATACCCAATATACCATCCCACAACGTGTTCCAAGAACTAAAGAACGCTATTTTCCGTTTTTAGAAC 
CACGTTCACGATCACAATCTGAACAACCTGCTACTTGCCCCATTTGCTATGCTGTCATTCGTCAATCCCGGAACCTGCGGCGCCATCTCGAGCTG 
CGGCATTTTGCCAAACCCGGCGTGAAGAAGGAGAAGAAAAGTAAGTCCGGTAACGATACAACCCTAGACTCCAGCATGGAGATGAACACCACGGC 
AGAGGGCGACAACACAGTGGGCAGTGATGGAGCAGGCGGAGCAGGATCAGCAGGAGGACAATCATCTGGTACGACGCCAACGAGAGTGATATCGA 
ATGCACCGCAGGCCGCGGGGGCGCCGGCCATCCTGGCCCAAGGTGTGCTGCCCCAGCAGCAGCAACAGCAGCAACTGCAGCAGCAACACCAGCAG 
CACTTGACAGCAACATTAGCGGGTGGTGGACAGGCATACATCAAACATGAGGGCGGCGGCGGGGGCGGCACTGGGCAGCAGCAGCAACAGCAAGC 
GGCACAACAGCAGGGCATGCAGAACGTCATACACATTGTGGGCGATCAGGTCTTCATACCGCAGCAGCAACAGCCGCAGCCGCAGTAGCCACCTC 
CAGCCGAGCCTTCAATCATTCCAACGCACCAACGACACCATCATCCACATTTCCAAAAAAACATTAAAAAAAAAAACATAACACTCACAAAAACG 
ATTTGTAAATAACAATCAGCAAACAAACAAATTCCCCGTGATCTGATCGAATGATTTGTGTTCATTTCCTGTATCATTTTTAATTTTTCCATAAT 
TAATTGCTGCGTAAGTTTATATTTTGCATAACTTTTCAAAATTGCGCTTTTTAACATATAACATTTTCTTATGCAGTATTTTCTTGCATATTGTA 
GCACAGAGAAAAGGACAAAAAGAGAGAATTCCGTATGTCTCCACCTAAATGAATACTCGATTAACTTACTTAAATAATCTAATTAGTAGTGTCGC 
AAATAATTGTTGTAAAAATTAATTGTACAATTACAATCGAGGATCGCAATCGGTTTAAGTTTCTCTATTTTAGATATTATATTACAAATTGTTTA 
ATTAATAAAATATTATATATTTTTTTATATAACAAAAACAAAACCAAAATTAATCTCTGCCTGTGGCACGGACTTTAATTTACTGCAATTGGCTG 
TTTATAATTATTGTTTCTCTTAGTTGCAAGAAAATACTGTTGGTCGATTTCCAAACCTACTAACTTCGTACTGATCTTTGTAATGTTTCGCAATT 
CCTCTGGAGCAAATCTTGCGCATTTAGAATTGACAATCTGGCTGATAATCGTATTTAAAAGTCTTGAATTAACGTTGAATATTGTTACGTTGTTT 
TTGTATCTTTTTTAATTATAACTACACGTTTGTTTTATTTTTTAACAGTTTTTCACTAGATTTTTTTTATTTCTATTATTAGTTTTGTTATTTGT 
TGTTCATATTGATTTAAACCTATATTAATATTTGTTTTGTCTTGCGTTTTTAAGTTTTTTGTTTGTAACACCTTTCATCTACACAATCAACACAT 
CTTCATGTCCTGTTTCCGGCTGTAGTCCTTAAAGGCCGCGTTGTTTCTCTTTTGGGCCAGAGATGTTGCTGTAGTGTCGTCCTCTCGCTCTGTAT 
TACTCCTACTATCCAACAGTCTACCTTGGCTAACTCCAGCGGCAGTCTTCTCACCAGCAACTCATTCCTTCCTTGCAGCAACCAGTGGCAAGAAG 
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TCTAGTAGCGGATCCAGCGGTTCAGGCAGTGGGGCACTGAGCTCCAGTGGCAGTGTGCCGCAGGTCCAGACGGTTCAATCCCTGCACACTTTACA 
GGGTGTGCAAGTCAAGAAGGATCCGGATGCCCAGCAACAGCAGCAGCAGCAGCAGCAGCAACAACAACAACAACAGCAGCAAGCAATGACAGTGA 
G C GG T GC GAC GGG C GGA C AA GT GC AAC AAC AGGT GC AGC AAGT GC AGC AG C AGGTCCAGCAGCAACAGCAGC AG CAGCAAC A AC AACAACAAC AG 
CTGCAACATCACCAAATCATAGACTCGTCGGGCAACATTACAACAGCAACAACGAGTGCTCAAGCGGCAGCTGCCGCACAGCAACAGGCGGCTGG 
CCAGCAGCAGCAACTGGTGGCACAATCCGATGGCAGCGAGAGTGGAGCTCCGCTTTCAATTGCCCAGGTGCAGACGCTGCAGGGCCATCAGATCA 
TAGGCAATTTAAATCAGGTGAATATAACTGACTTTCAACAACZ^ACAACCGCAGCAGCAACAACAGCAGCAGCAACAACAGCAGCAACAACAACAA 
CAGCAACAGCAAACACAACAGACGCTCTGATCGGACCGAAGTCTTAGATCCATCTTGTACTATTCTATACAATTATTAATTTGGGTTTTTGCTAA 
ACGCTTTAAATATTCTCTTTGCGGAATGCGTAGACAAATTCTAAATTAGTTTTAGAAAGCCAGAAAGGCATATTTAGTTGTTTACCGTTTTGTTG 
TACAATCTTATGACTTTAAATGTTGATTTACCCCTCCTACAACTTATTTTGTTGCTAGACAAACGCTTAAGTGCTCTATGGTTATTGGAAATTAC 
GTGCATATTTTACATACTTTATAAATATTATCATTTTGTATGGTTTTACAAATGTCCCAATGCTAGAGGACCTCTAGTCATTTTGAAGTCCTGCA 
GATCAGTAAACACCTTAATCCAACAAGGATCCCTTGCTCACGTCCAGATTTTGGCCATGAACGAATGCACCTTGATTTAAAAAATATTAACAAAT 
ACATTAAGTTTAAAAATCCTTGAGTTACAAAGATTTACTTAAGCAGCTCAAAAGTAATTGATCTGATCAAAAGAGAGTTCGCTCAACAAGGTTTC 
TAAAAACCCACCCACCAACAATCACTTAAGTAATCAATCAACCAATCAATAAATCAACTCGATATCTCTGCTTTCGTTTTTGTTTAAGAACTAGG 
CGTTCTATGTTTAAATTGAACCAATTTTATATGTATATTACTTAATTTTAATAGTTGCAAATTTTTCCTAAAATTTAGACCCCGTTGTACTAGTT 
T T GG ATG ATG AAAT AT AC AT AT T CC T GG AAACT AT GAT TT AAT TAT AT AAAT T T GAACG AAAGG GT AT ATT TC ATC T ACCT AAAC T GG AC GAAT T 
ATTGTTAGTCTTCTTACACACTTAAATAAAATCTAATTTTATTTAAAACCAATTACAAAAAATAAGTTTAATAAAACAAAACTTTTTCA7VACAAA 
ATTATTATTAGTATTTTCTTTTGTTTTTCGCTTTTCGGTGATGGAACTTTTTGATTTAATTTTTCAATTAATTTACTCATAGTCCCTTTAGCAAC 
CAACCTTACCTATTAATTAAGTATTTCTGTAATTTACAGCATGAATTAACGTTTTAAATATTTTTTGTATATTAGTTTTTTGCTTTTCCTGATTT 
GC AGT AC G GCT GAC CT AAAAAAT GAAGAC AAACT T AAT TG AAAAAT C AAAT AG T G ACT AAT GAC ACT T C AC AC AC C AGC AACGAAAAC T C AC C AA 
CACCACATGCATTCTGCACTGCAGTTTACACCATTTTGCAACACATGAGCAATCATTAACAAATCGTTTTTATCTTGTCTTCTTCTCTTTCTCGC 
TGCCACGTTAATAACAACTAAACGTCATCTAACCAATCAAATTCAATACATCTGTGTGTAAAAACTAAACATTCATGCATCCGACAACAAATCAA 
AATCATTCAAAAAAAAAAAAAAACAGGGCAACAACAAGAACATTTTGGTAAAGT^AGGTTTTCATTTTGAAAGGCGGTAATAATACGTAATGATGA 
TCCAACGATATGTCCAGCGGCCCTGGAGGAGGAGGAGGAGGAGGAGAAGGTGGAGGCGGCGGCAGCGGGCAACAGCAGCATAACACCAGCAGCTC 
G AGC C AC AAC AAC AGC AGC AAC AAC T GCGGC ACG GC AGC AGC AACT C AA T T GAGC A GC AGC AGC AAC AGC AG AAGC AT C AC AC CAGCAAC AT C GA 
CATCATCGACTGGAACTGGAACAGGAACCACATCCACGCTGCAGCGTCGTATACTGCGCGGTCATGCGGCGCAAACGACCAGCGGCGAGCGTGTC 
CTGCTCATCAACTATGTGCCACAAACAGGAGCATCAGCCACCGCATCCACTCAAAGTACTTCTCAATTGCCCACCCGTAAGATACTCCAGGCCAG 
AGCCTCAGCGGCAGCAGCAGCGGCGGCAGCATCATCGTCATCATCTG 
(SEQ ID NO: 1486) 

Exon: 1001,. 1400 
Exon: 1554.. 1980 
Exon: 4334.. 4799 
Exon: 4918,. 5253 
Exon: 5413, .5550 
Exon: 7109.. 8642 
Start ATG: 1789 

Transcript No. : CT26545 

ACAAAATCACCCACAGCCACTGAAATTTTCTTTTGCGCTCCGGTGAGAAGATAATTTCACAAACCAAAACCGAACCAACAAACAAAAAAAGTGCG 
CTCCGTGAGCGGGAAAACAATAACAAAATCAAGAGACATGCGATTATAAAACGAACAAAATAACTGAGTGACAAAAAAAAATTAAATGCGTGTCT 
CATGGCTTGACTGTACTGTGTATTAAAAAATGTGATTGTGTCGGCTGTCCGTCTGTCTGAGTTTGTGCGCGTGTGTGTGCTGTTGGCGTTTTTAA 
ATGTGGCGCGCTATTTTTGTGTGATTTTTGTGGGCAGCAACGAATTTTGAAACATCAATTGTATACTAGTTGTCAGTCCCCTTGTGAAAAAGTCT 
TCTCCAACAACAAAATTCCTTTTCGAGAAAGCAGCGCGCCAAGCAGCAAGAAGAGGAAGAGCAAAGGGAGTTGGAAAAGCAGTGGCGTCGTGGAG 
GAGGAGTAGAAAGGGGATTTTTGCGAAAAATCCCAACTTGAAGAGAATTTTCCCCGTAACCGTTAAACATTAAATCGTCGTGTTGGAGACAAATA 
GCAGCGAATACGCTAGCAACGCTGGGGAGGAACAACAATTGTTGGATTTGGACGCGACCGCTTTGATGTCGCTGCCAATGAATTCGCTGTATTCG 
CTCACCTGGGGCGATTACGGCACCAGCCTCGTATCGGCAATCCAATTGTTGCGCTGCCATGGCGACCTCGTCGATTGCACGTTGGCCGCCGGCGG 
GCGGAGTTTTCCCGCCCACAAGATAGTCCTGTGCGCCGCCTCCCCCTTTCTGCTGGACTTGCTAAAGAATACACCATGCAAGCATCCAGTGGTTA 
TGTTGGCTGGCGTCAATGCGAACGATCTGGAGGCGCTGCTGGAGTTTGTGTACCGCGGAGAGGTGAGCGTGGATCACGCCCAGCTGCCGTCGCTA 
CTGCAGGCTGCCCAGTGCCTGAACATCCAGGGACTGGCACCGCAGACGGTTACCAAGGACGACTACACCACGCACTCGATACAACTGCAGCACAT 
GATTCCACAACATCACGACCAAGACCAACTGATTGCCACGATCGCCACGGCTCCACAGCAAACGGTTCATGCCCAGGTGGTGGAGGACATCCATC 
ATCAGGGCCAGATTCTCCAGGCAACGACCCAGACCAACGCAGCAGGACAACAGCAGACCATTGTGACAACCGACGCGGCTAAACATGACCAGGCA 
GTGATTCAGGCTTTTCTTCCGGCACGCAAACGCAAACCACGCGTAAAGAAAATGTCACCTACGGCACCGAAAATAAGCAAAGTTGAAGGAATGGA 
TACGATTATGGGCACACCGACCTCTTCACACGGCTCTGGATCCGTGCAGCAGGTGCTTGGCGAAAATGGAGCCGAGGGCCAACTGCTATCATCCA 
C AC C GAT C A T C AAG AGC G AAG G AC AAAAGGT AG AGAC T AT T GT GAC CAT GG AC C C C AAC AAC AT GAT ACC G GT AAC G T C GGC T AAT GC GGC AAC T 
GGCGAGATAACACCAGCACAAGGAGCCACTGGCTCATCTGGCGGCAATACAAGCGGCGTCCTATCCACGCCAAAGGCAAAACGCGCTAAACATCC 
GCCCGGAACAGAGAAACCACGTTCACGATCACAATCTGAACAACCTGCTACTTGCCCCATTTGCTATGCTGTCATTCGTCAATCCCGGAACCTGC 
GGCGCCATCTCGAGCTGCGGCATTTTGCCAAACCCGGCGTGAAGAAGGAGAAGAAAACAACCAGTGGCAAGAAGTCTAGTAGCGGATCCAGCGGT 
TCAGGCAGTGGGGCACTGAGCTCCAGTGGCAGTGTGCCGCAGGTCCAGACGGTTCAATCCCTGCACACTTTACAGGGTGTGCAAGTCAAGAAGGA 
TCCGGATGCCCAGCAACAGCAGCAGCAGCAGCAGCAGCAACAACAACAACAACAGCAGCAAGCAATGACAGTGAGCGGTGCGACGGGCGGACAAG 
TGC AAC AAC AGGT GCAGC AAGT GCA GC AGC AGGT CC AGC AGC AAC AGC AGC AGC AGCAAC AAC AAC AAC AAC AGCTGCAAC AT CACCAAATCAT A 
GACTCGT CGGGC AAC ATT ACAACAGCAACAACGAGT GCT CAAGCGGCAGCT GCCGC ACAGCAACAGGCGGCTGGCC AGC AGC AGC AACT GGTGGC 
ACAATCCGATGGCAGCGAGAGTGGAGCTCCGCTTTCAATTGCCCAGGTGCAGACGCTGCAGGGCCATCAGATCATAGGCAATTTAAATCAGGTGA 
ATATAACTGACTTTCAACAACAACAACCGCAGCAGCAACaACAGCAGCAGCAACAACAGCAGCAACAACAACAACAGCMCAGCAAACACAACAG 
ACGCTCTGATCGGACCGAAGTCTTAGATCCATCTTGTACTATTCTATACAATTATTAATTTGGGTTTTTGCTAAACGCTTTAAATATTCTCTTTG 

cggaatgcgtagacaaattctaaattagttttagaaagccagaaaggcatatttagttgtttaccgttttgttgtacaatcttatgactttaaat 
gttgatttacccctcctacaacttattttgttgctagacaaacgcttaagtgctctatggttattggaaattacgtgcatattttacatacttta 
taaatattatcattttgtatggttttacaaatgtcccaatgctagaggacctctagtcattttgaagtcctgcagatcagtaaacaccttaatcc 
aacaaggatcccttgctcacgtccagattttggccatgaacgaatgcaccttgatttaaaaaatattaacaaatacattaagtttaaaaatcctt 
gagttacaaagatttacttaagcagctcaaaagtaattgatctgatcaaaagagagttcgctcaacaaggtttctaaaaacccacccaccaacaa 
tcacttaagtaatcaatcaaccaatcaataaatcaactcgatatctctgctttcgtttttgtttaagaactaggcgttctatgtttaaattgaac 
caattttatatgtatattacttaattttaatagttgcaaatttttcctaaaatttagaccccgttgtactagttttggatgatgaaatatacata 
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TTCCTGGAAACTATGATTTAATTATATAAATTTGAACGAAAGGGTATATTTCATCTACCTAAACTGGACGAATTATTGTTAGTCTTCTTACACAC 

TTAAATAAAATCTAATTTTATTTAAAACCAATTACAAAAAATAAGTTTAATAAAACAAAACTTTTTCAAAC 

<SEQ ID NO: 1487} 

Start ATG: 636 

MSLPMNSLYSLTWGDYGTSLVSAIQLLRCHGDLVDCTLAAGGRSFPAHKIVLCAASPFLLDLLKNTPCKHPVVMLAGVNANDLEALLEFVYRGEV 

SVDHAQLPSLLQAAQCLNIQGLAPQTVTKDDYTTHSIQLQHMIPQHHDQDQLIATIATAPQQTVHAQWEDIHHQGQILQATTQTNAAGQQQTIV 

TTDAAKHDQAVIQAFLPARKRKPRVKKMSPTAPKISKVEGMDTIMGTPTSSHGSGSVQQVLGENGAEGQLLSSTPIIKSEGQKVETIVTMDPNNM 

IPVTSANAATGEITPAQGATGSSGGNTSGVLSTPKAKRAKHPPGTEKPRSRSQSEQPATCPICYAVIRQSRNLRRHLELRHFAKPGVKKEKKTTS 

GKKSSSGSSGSGSGALSSSGSVPQVQTVQSLHTLQGVQVKKDPDAQQQQQQQQQQQQQQQQQAMTVSGATGGQVQQQVQQVQQQVQQQQQQQQQQ 

QQQLQHHQIIDSSGNITTATTSAQAAAAAQQQAAGQQQQLVAQSDGSESGAPLSIAQVQTLQGHQIIGNLNQVNITDFQQQQPQQQQQQQQQQQQ 

QQQQQQQTQQTL* 

(SEQ ID NO: 1488) 

Name: Trithorax-like 
Classification : transcription_f actor 
Gene Symbol: Trl 
FlyBase ID: FBgn00132 63 



Celera Sequence No. : 142000013384405 

CCAGTCTGCAAATATCCATGGAATTATTTACTACGGGCATCATTATTAACCACCGCGGTTGCTACTCACGCGAGCTTGATGAGATTGGGGCTGGT 
CAGCAGCTGCGGACAACTGTGCCCATTGCTCACCGTCTCTATGAAGTGCTTAAGCTGATCCAACAGGTTCTGGCGCAGCATTTTGTCGTCCTAGG 
CATCGGGAACGCAGCACCGAAGTTGGCCAAGTGAGCAAATGCATCGCAGCAGCGGGTCGCTGCGTCTTTCACTCGGAATACCAAAACGACGGGGC 
ACAACTGCAGGCATTTGGAATCCGCCGCCACCTGGCCACACTAAGGCGGCAGAAAATTCGAAATTCTTTAATATCTGAAAATCATAAAATTAACA 
GAATAATAAATATCTTAGCACTCTAACGTGTCATTACATTCATTATAAACTAGCACCTGAGATTAGTTTAAGATGACACTTTACATTCAAGTTAA 
ATTAAGCTGGAAAAGATTTCTAATCGG71ATTATGTGTTTACTACAATGCTAACTTTCGAATCGAGGATGTAATTACTGAGTAATCAAACTGTTTG 
CTCTTTTTAATACTAAGTAAACGTACTTATAGCTCTATTTTCTAATTTTATAATATTTAACGTGTTTTTGTTTTAAAATATTATCGTAACAGTAG 
GGAGTGCGAAAAACGGTAACACCAAAACCCAGTTCGTGCATTATTGCACACTCTTTTGGGGCGAATCAAGAGCGAAAGACAACATGGCGTCGAAG 
CGGCGAATCTGTCTCGCAGTTTGTCGTGCTGTTTGCTTTTTTCTGCTCTGATTTGTACAAATATTCATTCAAGGTGCTCCAGTTTTGCCAAGTAT 
ATGTTAATAGAGTACCATAACAATGGTGGCGTCACAGGGGTGCTAGAGTTCGATATCGATAACAGCTCACATTCTCGTCATATGCAACTATCGCT 
GTACTCTCTGGCCGAGTATCGATACGCAGATATTATCGTCGTGGTCAGACCAACGCTCTAGCGTCGGTCCGCGGAAAATAGCGAAAATTACAATT 
AT C C GGGAAG AT AT AT T GGT C G AAAAC T AACT C GT GAT C C C CC AAT AAAT AT GC G C AAG AG AAAATT GT AT AT AAAC T G A TC G AATGC AGC CAG C 
GTGCTAAGCACTAAAAGCGAAAAACGCCAGAAACAGAAACAAGGAATTGCAGCGTGAGTGTATAACTTTTTGGTTTTTGCCCCCGCGCTCTTTCT 
CTCTCTTTGCTGCTCTCTCTCTCCCGCTCTCTTTTGCCCTCTTGCGGCTTCTGTGTTGTCTGCGTGCTAGTGTGTGTGTGTTTGAATTTTACTGT 
GAATGTGACCGACGGTGTGTTCTCACGTACTCCGTGGATGATTTGCGGCCTTGCCAGGGAATCGGACTATATATCCCGGCAATAGCCCATATTGG 
ATCCCGATTTTGAATACAACTCGGCTGATTTGCTTCTTTCAGTGTCTCCCCTTTTATCGCACTTTATGCAACAACGCATACCGTCGTGTTGAATT 
TCCTCGCTCTCCGCTGTGTGTGGTTTGTAGGTGTGCGTGTGTTCGAGGTGAGTTAGCCGTCGTCTGTTTTTCTTTTCCCTGCGTTTTGTGTGCGT 
TTCAATGAAATGCAAAATCGCCAGCGAATATCGCGGAAATGCGCTGCTCCTAAATATTATGTATACATAAATAAAGTATCGAATCAAAACAAAAT 
CGGCTTGCAACAACAACAAAAACATTGCAGAGTGTGGGTGGCGCGTGTAGCGATCAAAATTGCGTGCATGTGTAATGTGTGTGGTGGAGAGACGG 
CATTTTTCTTTACCACACGCCTCTTCTTCTTTCCCACAATACACTGCAATTTTTGCGATTCTGTTTTGCGCCTGCCATTCGCAATAATTTTATTG 
AATGAACCGCAAAAGAGACGGCCCAAAAAGCGATGTACTTGCCAGACGTTGTGTCTGTGTGCGTCCAAGTGTCTGTAGCGCGTTGTGTGTGTATC 
GCGTATGAAAATACATCMCTGTTGGGCATCAACAAAAAAACGGAATCAAAAATTTATAATAAAAAATGAAAGGAAATTAGCTGACTTTGTAAT^ 
GCAATCGGAGAGTGTTTACTTGCTAAATTTTCGAGTTCTCCGCCCGTCTCTTTCCCTTCATCCCTCCTGCCTTCACCGCCAATACCACCCCAAAA 
AATTGCATACATGCACACACGCCTTCGGTTGTGCGAGTGGTTTAGTGTGTGAGCGCGCGCGTATTCTGCACACATTGTCATGCATGCAAAAAACA 
AGATACGACGCGTTCGCAATCGTTAGATTCCGTCTCCCCCAATTCATGGCGTTTCATTGCTGGGGGCGAAAGTAAGAAGATAACGAAAGGGATAT 
GCGAACCACTTCCTAATGAAGTCCTGCATCCTTAGCTTTTGGATTCCCACCTCTTCGCCTTCTCCTTCTTCACACGGACACTTGCAGCGTTCGCG 
CCCATGCGTGTGTATGTGTGCGAAGAACAGGAAGAGAGCAGCGAGACTCCAATTATCCATTTACAGTTGCGGTTGAAATGATGGAGCATAGTTGT 
ATTATTTGTATTCACACAATTACCGGTGGTATTTAAAGTCTTAACAATATTTATTAATTTCCTGCTACCTTCCTAAATTTTATTCATAGTTTTAA 
TACTGTACCCAAGTACCTTAAGTTCCCTAATCAGATGTAAACTTGTTGATAGGCATAATTCCTATCTTAGTTTGTATTTTTGTCTACTATAACCT 
AAACTTAATTCTACTACGAATTTAATTACAACATACTAAGACTGTAATCTACAAACCAAACGAAAATGCCTTGAATAAAAGAGGTCTAGTGATTC 
ATAATACGATTTGCACACGTGTTATTAACTAGGTAGTAGCGAAATAGAGTCAAAAAGGCCACCGACTTGTTCACAACTGTATCTCTTACTAACTT 
CCCTATAAAAACTAATTGTTGCTGGGGCGAAAAAATACTGCCAAGTTGTCATATTTGCCGGACTTTATTGATGACTCGACTCCAGAGCTCTATAT 
TTCGCACACAGATTTTCTCTCGGCACTTTTCGCAACCTTCTGCGGCCGCTGGTTATTAATAAAAATTAATAATAAACGAAATCTGTGGCGCTGTG 
CGGCACTTGGACACACTTATGTACACATGAGCATTCTCGTGTTTTTCGCAGCTGATTGCGTAAACGAAACCGCTCGCTACGTTTGTTTTGAATTG 
CTACTTTTGTTTTGGTACCGATTCAGACTTGGAATCCGATTCAGTTACAGATTGAGATTGAGATTGAGATACTCAGGCTCTCCGCCTGGGCTTCA 
ATTCTCCTGCGGGCGTTATCAGTGGCCTATGGCGCTTTATTGGAATTACCATGGAAGTGGGAACTTAATCGCTGTGCAAATCGCTTCGAAACAGC 
GCAGCACGCATTTTCCCCGACGTCTTGCAACCTTGGGCGAGTGGCACTGCGGGCGGTGGGTGGTGAGCGGTTGGGTGATCTAGAGTGACGTAAAT 
ATTTGTGGTGGACAGTCAACATTGAGGTGGCGCTCTTCAACGATGACGTAGCTTTTGCCACTTCTGTTTACTCGCACATTATTTCACACACTTCT 
CCATTGATTTTCGCCGTGCAGATAACGAAATAGTGGACTGATTGTTAACTGTGTATAAATGAAAATATGGGTATGATATGTGTTTATTTACACTT 
TTTTTCTGTGTACGAGAATGGTTAGATTACTCCCCTTCCTCGTTTTGAGTCACCCGCTCTCTGGCCCTCGCCGCTGTTTGTTTTCTTCGCGCGCG 
TGTTTGCTCTCGCCGTGGGGCGAAAGCGCAGCGATAATAGTGGCTCAGATATAGTGTATACTATATGTATAGTAAACCTACTCATACCGAAAACC 
CCATCGCACAGATACGGATACATTTACAGATACACGCGCGCACTGCGGAAGGCCTAGATCTAATGGAAAACGCTAAATGTTTGTAAACAGACCGG 
GCACATTTCTTTTCTTTTCTTAATTATTTTTTGCATTTCGCATTTTAGTTGGCATGGTCTTCTCACGGCTTCTCTCTCTTCACACATCTTTCTTC 
CGAAAATACACTCACACACACGCACACCGAACGTAGCTTTCACATTTTCATTCATCGCAATCTTCGCCATGAAGTGTTGCAAAGGCCCCAAGATA 
AGATATAGGATGCACTTGGATACCTTAACCTCACCTTTTTGCGACAATAGCACACGAATAATCCAATTTTTTGTATGACGTTGTTGGGCGAACAA 
TTTTATAAGTCTTGGAAAGGAATTTCTGTGCTGTGTTGATACACAAAACAACAAAATAATAAAAAATTATAAATGGTATTGAACTACTGTACTAG 
TGAATAGTTTGAACCAGACTTTAAATTAAAGGCAATTATGATGTTGAAAATATTTAAGTACCTTTATGCATGCATATAGTTAAAATACAATTTTT 
T A GA T AAC AAA CAAC AT AT GC GT T TAG T GT C AT T AAAAAAGGG AT T AT A AAAC C AT AAA AAAAGG AT TT CG AAAAAGT AGTG T T T T T T AA T A TT G 
TATGATCTTTGAACGTTTTTTTTTTTAATTGTTTTATTTTCCTAACTTGCAGTGCAGTCTCGTCGTGCGCCCATCCCAATCCCATCAGACCCCAC 
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CCGGCGTTGGAGTGCAAGAATAACATGGATATAGAAGTCAAGCATAAAGATTAGCCGAATCCTTTGAATTTTAAATTCAAGATCCTAATTTAAAT 
TAGCTAAGTAGCATAGCCTAGCCAATAGTCCCTAGCCAGAGGAACGGTTCCAAGCCGCAAGCACTCGCGTTCATAGTCCCGATCGAAATTCAAAC 
GAATCCCAAGGGCACCAACAGCGTGTAGAACCGCAGAAGCACAGCAAGGAGGAACATCATCATGACGACTGACACCCGCCGACGCGTCAAGCTGT 
ACGCGCTGAATGCGGAACGGCAATGGGATGATCGTGGCACTGGCCACGTGTCATCCACCTATGTGGAAAGACTAAAAGGTGGGTTTACGAAATTA 
ATGAAGCCTTGCGAAGTGATTTAAGTTTCCTATTGCTCACTAACAACTTACAGGCATCTCGCTTTTGGTGCGAGCCGAATCCGATGGATCTCTAC 
TTTTGGAGAGCAAGATACAACCGGACACCGCATACCAGAAGCAACAGGATACGCTGATCGTGTGGTCCGAGGGGGATAACTTTGATCTGGCCCTG 
AGTTTCCAGGAAAAGGCGGGCTGCGACGAAATATGGGAAAAGATATGTCAGGTAAGCGTCGTTGAATAAGCATATGCAGTTATAAATGTGCTTAT 
CTATATATCAATGAACTTAAACAATTTCTATATGGTTTTCACATCAATTGCCAACTAAAACTTTTGCAGGTGCAAGGCAAGGATCCATCTGTAGA 
AATAACTCAAGACATCGTCGAAGAGTCTGAGGATGAGCGGTTTGAGGACATGTCGGATACTGCGCCACCCATTGAGCTACCACCCTGCGAGCTTT 
CCCGGCTGGAGGACATCTCGGAAACGATACAAAGCTGTCTCTCCACGCCGCTGCGAAAGGAAAAGTTGTCGATGGCTCTGGAGTCGGAGAGCTAT 
ATCAAGAAGCTGCTGAACTTGTTCCACGTGTGTGAGGACCTGGACAATACCGAGGGCCTGCATCACCTTTTTGAGATCTTCAAGAACATCTTCTT 
GCTAAATAAGAACGCGCTCTTTGAGATTATGTTTGCGGATGATACCATCTTTGATGTGGTCGGGTGCCTGGAGTACGATCCGAGTGTGTCTCAGC 
CCAAAAAGCATCGACAGTATCTGAAGCAGTTAGCCAAGTTCCGCGAGGCAGTGCCGATTAAAAACCTGGACCTGCTGGCTAAGATTCACCAGACG 
TTCCGGGTGCAGTACATCCAGGACATTATCCTGCCCACACCATCGGTCTTTGTCGAAGACAACATGCTGAACACCCTGTCCAGCTTCATCTTCTT 
CAACAAGGTGGAGATTGTCACGATGATCCAGGACGACGAGCGGTATCTCCTGGATGTGTTTGCTGTTCTCACCGATCCCACCACCGGAGACGCGA 
AACGCCGCGACACAGTTCTCTTCCTCAAGGAGTTCTGCAACTATGCCCAGAACCTGCAGCCGCAGGGCAAGGACTCGTTCTACAAAACTCTCACC 
TGCCTAGGCATTCTGCAGGCGCTGGAACTCACTCTCGTGATGAACGACAAGAAGACCAAATCGGCCTCCATCGACATCCTCACGGCCATCGTAGA 
GTTCTCGCCTCTGGTAGTGCGCAATTACACGCTCAACCAGGCCAATCGGCCAGAAGTGGTAAGTTAACCTGGAATATGAAGATAGTCTAGATATG 
ACTATGATTTACATGATTTCTTTTTTTTTTCACACCAGGAGCGTATGCTACTGAACATTGCCATCGAACAAATGCTGAATGACTCGGAGCCGGAG 
CTTGGAATTGCGGTGCAGCTAATGGGTATCGTCAAGATTCTGTTGGAGCCGGAAAACATGCTCACCGAGAAGGGCGATTTTCTCAACTTCTTTTA 
CAAATACAGTGTGCAAACGTTAGTGGGTAAGTGCGAGTACTCTATACCAGCATATTCTAGAGACTAACTGCTTAATGGGTTATAGCCCCGGTGAT 
ACTTAACACGATAGGCGACCGTCCGCAAAACGAGGACTATCAGACGGCCCAGCTGCTGGGAATCGTCCTGGACATTCTTTCGTTCTGCGTGGAAC 
ACCACAGTTATCACATTAAAAACTTTTTGTTGCAAAAGGATCTCCTCAAGCGAATACTGGTGCTAATGAAGAGCACACATACGTTCCTTGTCCTC 
GGCGCTCTGCGATTACTGCGCAAGATAATTGCACTTAAAGATGAGTTTTACAATAGGTAAGCGTCCCCAAAAACTCTTATGGTTAAACAATATAA 
CCAACTTACCTTACTTACAGACACATTGTCAAGTGCAATCTGTTTGCGCCGGTGGTAGATGCCTTCATTCGCAACAACGGCCGATACAACCTTCT 
GGAGTCAGCAATTCTGGAGCTGTTTGAGTTTATAAAACTCGAGGATATCCGCACTCTGTGCGTTTACTTCGTGGAGAACTTCAGCAAGATATTTG 
ATGAAATCGAATACGTGCAGACATTTAAGTACTTGAAGAATCGCTACGATCAGTACCAGGATCGGCTTAAGGACCGCGACAAGATGGAAAACCGA 
ACTGACGGGTAAGTTTTGCTGCAGCGCCTTGTGCTACAACACAATGTGAAAATATTTGGCTGTATTATCTTAAAGTGGCCTGCCCATCATCCGTA 
GCGGTGGCCGGTTTCGTCGGGATCAGCGGCAAATGGAGGAGGAAGAGGAAATGTGGTTTAACGAGGAGGACGATTTCACCGAGGAAATCGACACA 
TAC AAC AATGT C ATG AAATGT TAGTAC T AAAT T AAT AT CGTTTTAGCAT T ACC AAAC C AAAT AT AT T GT GGC G AAT AT CT AAGC AAAAAAGCT AA 
TATTCAAAAACATTACATTATTATTTATGTACCATTGATGAATTGTAAATATATTCCGATCTCGACTTGCAGCCGTCAGCGAAAAGAATGGCCCG 
CAAACGCAAAACCAGCAGAAATCCTCTCCGCCACACAGCACGTCGCCGCACAGTGGATTGTTGGGAAGCCTGAGCACCACCGCGTCATCCACTGC 
CACATCCGCCACGTCAGGTGCACCGGTGGCCAGCGGCAGCAGTAGCCCCGAGGCCATCTCCGCCGACGAGCAGACGCAGGCGGCGGTGCATTTGG 
CTGCCGCTGCCCTTCAGCACCACCAGCAACAGCAACAGCAGCAGCAGCAGAACCCATTCCAGCAACAGACACAGCCCGAGATCGCGGAGCTACAA 
CAGCAGCTGAGCAGCGTGGAAGCGCCTCAAAGCCAGGAGTTGGAGCTGTCACAATCGGCCGCCGCCAGTGCATCGCCCACTTCGTCATCCTCGTC 
TCTGGAAGCCTCGACGTCCTCTTCATCGGCGTCATCGTCTTCATCCTCGTCGTCCTCGTCCTCGCCGCCCGGCTCATCGGCCGCAGCATCGCTAT 
GCGATTCGGCGACAGTAGCCGCCGTGGCAGCATCACAGTTTCTCTCAACGATCGCCACAGCCATGGCGGCCTCTGTGACGGCGGCGGCGGCAACA 
AACAGCTCCCCGAGCATTTCGCCAGCTCCAGCTGTGTCCAGTCCGGATATCGAGAACGCCGACGCCCAGTTGCCGCCCAGCGATGACGCCAGCAG 
CCCGGCCAGTGGGGAGCAAGACGCTAACAGCACGGAAGGTACCAGCAGTGAGGCCGACAAGACGACGGCCAAAAAGGTAAGTTGTGGAACCCATA 
AAATGGGAATGTCATAGTGTTTTCTGTAGTATTACTTGTGAATTGTGGATTTTATCTTCAAAATGCGTGCTTAAATATTTCTATAATGAATCAAT 
GTTTAAGTTGTAAATATAGGAGCATTGGCAAATATTTAAATAAAAAGCTAAAATTCCCTCACAAACTTGAAGCAACAATATGCCTGCATGTATGG 
GTATACAATTACCTTCTATTCTGTGACGATTAAAGTGCACGACTATTATAAGAAGTTTAAGTAGCGTTACGTATGAATAACAAATGTGTGGATGC 
ATTAATTATATTATTTTCGTTACCATAAACAGGGTCTGGTGGACTATGAGAGTGATTCTGGCGAAGATGACTACGAGGAGGATGAGTACTCCGAG 
G GGC C AC AAG C ACAAAA GC GC GC GC GT C A GGC AT AGT T GGC C GAT G AC GAGC AC AGC GGC AT C AAT TAT AAAT TAGTAC AC AC AT AAC AG AAT AC 
AAGAAACCACGGCAAACAACAAGCAACAAACAACACACAGACCCCACACACCAACACCAACGAATGTTAGGATTGCATTGGCGGCAACGACATAC 
AGCACAGTAGGCAAGCAAGCAAATAGCCGTACGATGTAAGATCGAACAAGAAAGCATAAATTTAAATTTTAGCATAAAATTTATTTTAAAGGAGA 
AGCAAAAACACCTAAAGTAAACGAGAAATTATAAATCTTACTTTACTAAATTAAATAACTACATATAGATATATATATATATATTCAAATATATA 
TGCATATGAAAACACAGATAACTATAACTGATCAGCATCCACAAGCAGAGATGTAGTTGAGTGCGTGTTGTTGGAAAGTTGGCAAAGCAAAGCAT 
TAACGAAACAAACCAAAATAAATTGTATTTAGTAGTTAAATCCCCCTCTCTCCAAACTGATATGAACAGCAGCAGCAGCGCCAGCGCAATGGAGG 
TGTGGTCAGGTGGTGAGGGAGATTTGTATACAGCCAGCGCCAGCACATCCATTTGCAATGTCTATCGTATGAGATCGTATGAGTTAACAGATCCT 
GTGAAGTTGGCTGCTGGCCCAGAATGTTTGTGCAATACTTTCGTGTCGATATTTAAATAACAATTGCAAAAGGGTGTATTTAGTTCTAAAGCCTA 
GACTTAACCATTTTTACGATCCTTCTCACCATTGAGATGGATTTTCTGCCAATTGTACTATCACTATCACGATACAGCGCTGTTTTCACTTGACA 
ATTTCATTAAATAATTCTTATCGCGAAAGCATTTTATGAAAGTTTCTATTGCAAGCAGTGCTGTAAATAGATGTTTTTAATTTTAAAATTTAAAC 
TATTCCCACAGCAGTAAAATGTTTTTTTTTCCATTAGGATTGCTTCTGTTTTTTGTGTAATTGCGATTGTTTTCTGAATATCAACCAGATTGCCA 
GGGCCAACTTGACGAAGTTTGTTAGCCGGCCCCTTGAGTTAAATGCAGATAACACTTGAACCCAATGAACCTTGTATATAATTTAGTCACTCTCA 
ATGCGCGATTAGTTGATAACTGCCCTTGCACCACTGACGCCGACATTAAACGCATATATTTGCACTTGGAGATATCGAAATCTTTGCACATTATC 
GTGCTGTTCGAAATCCAGAGTCTTGAGATGTGTTGTCTGTGTAATTGACTGTGTAATTACAAACAATTGTTGCGCTATCCTTGCATTATTGAAAC 
ACACAAACAAAAACACAACACACATCAATAAATTATTAATGATAATGATAACAAATAGGACGTAAAGTTAAACTAATGTGGGGCAAACAGACATC 
AGCAAGCAGAGATAACAATAAAATAAAGTCTTAAGTAAAATAGTCTTCTTGAGTTTTCGAGTAAGTTTTTTTCGTATTTAACTATAATATTAATT 
TGTATTTAACGAATGT^pTAGGGCTCACTTGGAACCAATCCAATCCAATTCAATCCACACTCATA 
(SEQ ID NO: 1489) 
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Exon: 7578. .8341 
Exon: 8 678.. 9232 
Exon: 9291. .9325 
Start ATG: 4907 

Transcript No. : CT2 6561 

CAACGCTCTAGCGTCGGTCCGCGG^^TAGCGAAAATTACAATTATCCGGGAAGATATATTGGTCGAAAACTAACTCGTGATCCCCCAATAAAT 
ATGCGCAAGAGAAAATTGTATATAAACTGATCGAATGCAGCCAGCGTGCTAAGCACTAAAAGCGAAAAACGCCAGAAACAGAAACAAGGAATTGC 
AGCTGCAGTCTCGTCGTGCGCCCATCCCAATCCCATCAGACCCCACCCGGCGTTGGAGTGCAAGAATAACATGGATATAGAAGTCAAGCATAAAG 
ATTAGCCGAATCCTTTGAATTTTAAATTCAAGATCCTAATTTAAATTAGCTAAGTAGCATAGCCTAGCCAATAGTCCCTAGCCAGAGGAACGGTT 
CCAAGCCGCAAGCACTCGCGTTCATAGTCCCGATCGAAATTCAAACGAATCCCAAGGGCACCAACAGCGTGTAGAACCGCAGAAGCACAGCAAGG 
AGGAACATCATCATGACGACTGACACCCGCCGACGCGTCAAGCTGTACGCGCTGAATGCGGAACGGCAATGGGATGATCGTGGCACTGGCCACGT 
GTCATCCACCTATGTGGAAAGACTAAAAGGCATCTCGCTTTTGGTGCGAGCCGAATCCGATGGATCTCTACTTTTGGAGAGCAAGATACAACCGG 
ACACCGCATACCAGAAGCAACAGGATACGCTGATCGTGTGGTCCGAGGGGGATAACTTTGATCTGGCCCTGAGTTTCCAGGAAAAGGCGGGCTGC 
GACGAAATATGGGAAAAGATATGTCAGGTGCAAGGCAAGGATCCATCTGTAGAAATAACTCAAGACATCGTCGAAGAGTCTGAGGATGAGCGGTT 
TGAGGACATCTGTCTCTCCACGCCGCTGCGAAAGGAAAAGTTGTCGATGGCTCTGGAGTCGGAGAGCTATATCAAGAAGCTGCTGAACTTGTTCC 
AC GT GTGT G AGG AC CT GG AC AAT ACCGAGGGCCTGCAT C AC CT T T T T G AG AT CT T C AAG AAC AT CTT C TT GCT AAAT AAG AAC G C GC T CT T TG AG 
ATTATGTTTGCGGATGATACCATCTTTGATGTGGTCGGGTGCCTGGAGTACGATCCGAGTGTGTCTCAGCCCAAAAAGCATCGACAGTATCTGAA 
GCAGTTAGCCAAGTTCCGCGAGGCAGTGCCGATTAAAAACCTGGACCTGCTGGCTAAGATTCACCAGACGTTCCGGGTGCAGTACATCCAGGACA 
TTATCCTGCCCACACCATCGGTCTTTGTCGAAGACAACATGCTGAACACCCTGTCCAGCTTCATCTTCTTCAACAAGGTGGAGATTGTCACGATG 
ATCCAGGACGACGAGCGGTATCTCCTGGATGTGTTTGCTGTTCTCACCGATCCCACCACCGGAGACGCGAAACGCCGCGACACAGTTCTCTTCCT 
CAAGGAGTTCTGCAACTATGCCCAGAACCTGCAGCCGCAGGGCAAGGACTCGTTCTACAAAACTCTCACCTGCCTAGGCATTCTGCAGGCGCTGG 
AACTCACTCTCGTGATGAACGACAAGAAGACCAAATCGGCCTCCATCGACATCCTCACGGCCATCGTAGAGTTCTCGCCTCTGGTAGTGCGCAAT 
T ACACGCTCAACCAGGCCAATCGGCCAGAAGTGGAGCGTAT GCT ACTGAACATTGCCATCGAAC AAAT GCT GAATGACTCGGAGCCGGAGCTTGG 
AATTGCGGTGCAGCTAATGGGTATCGTCAAGATTCTGTTGGAGCCGGAAAACATGCTCACCGAGAAGGGCGATTTTCTCAACTTCTTTTACAAAT 
ACAGTGTGCAAACGTTAGTGGCCCCGGTGATACTTAACACGATAGGCGACCGTCCGCAAAACGAGGACTATCAGACGGCCCAGCTGCTGGGAATC 
GTCCTGGACATTCTTTCGTTCTGCGTGGAACACCACAGTTATCACATTAAAAACTTTTTGTTGCAAAAGGATCTCCTCAAGCGAATACTGGTGCT 
AATGAAGAGCACACATACGTTCCTTGTCCTCGGCGCTCTGCGATTACTGCGCAAGATAATTGCACTTAAAGATGAGTTTTACAATAGACACATTG 
TCAAGTGCAATCTGTTTGCGCCGGTGGTAGATGCCTTCATTCGCAACAACGGCCGATACAACCTTCTGGAGTCAGCAATTCTGGAGCTGTTTGAG 
TTTATAAAACTCGAGGATATCCGCACTCTGTGCGTTTACTTCGTGGAGAACTTCAGCAAGATATTTGATGAAATCGAATACGTGCAGACATTTAA 
GTACTTGAAGAATCGCTACGATCAGTACCAGGATCGGCTTAAGGACCGCGACAAGATGGAAAACCGAACTGACGGTGGCCTGCCCATCATCCGTA 
GCGGTGGCCGGTTTCGTCGGGATCAGCGGCAAATGGAGGAGGAAGAGGAAATGTGGTTTAACGAGGAGGACGATTTCACCGAGGAAATCGACACA 
TACAACAATGTCATGAAATCCGTCAGCGAAAAGAATGGCCCGCAAACGCAAAACCAGCAGAAATCCTCTCCGCCACACAGCACGTCGCCGCACAG 
TGGATTGTTGGGAAGCCTGAGCACCACCGCGTCATCCACTGCCACATCCGCCACGTCAGGTGCACCGGTGGCCAGCGGCAGCAGTAGCCCCGAGG 
CCATCTCCGCCGACGAGCAGACGCAGGCGGCGGTGCATTTGGCTGCCGCTGCCCTTCAGCACCACCAGCAACAGCAACAGCAGCAGCAGCAGAAC 
CCATTCCAGCAACAGACACAGCCCGAGATCGCGGAGCTACAACAGCAGCTGAGCAGCGTGGAAGCGCCTCAAAGCCAGGAGTTGGAGCTGTCACA 
ATCGGCCGCCGCCAGTGCATCGCCCACTTCGTCATCCTCGTCTCTGGAAGCCTCGACGTCCTCTTCATCGGCGTCATCGTCTTCATCCTCGTCGT 
CCTCGTCCTCGCCGCCCGGCTCATCGGCCGCAGCATCGCTATGCGATTCGGCGACAGTAGCCGCCGTGGCAGCATCACAGTTTCTCTCAACGATC 
GCCACAGCCATGGCGGCCTCTGTGACGGCGGCGGCGGCAACAAACAGCTCCCCGAGCATTTCGCCAGCTCCAGCTGTGTCCAGTCCGGATATCGA 
GAACGCCGACGCCCAGTTGCCGCCCAGCGATGACGCCAGCAGCCCGGCCAGTGGGGAGCAAGACGCTAACAGCACGGAAGGTACCAGCAGTGAGG 
CCGACAAGACGACGGCCAAAAAGGGTCTGGTGGACTATGAGAGTGATTCTGGCGAAGATGACTACGAGGAGGATGAGTACTCCGAGGGGCCACAA 
GCACAAAAGCGCGCGCGTCAGGCATAGTTGGCCGATGACGAGCACAGCGGCATCAATTATAAATTAGTACACACATAACAGAATACAAGAAACCA 
CGGCAAACAACAAGCAACAAACAACACACAGACCCCACACACCAACACCAACGAATGTTAGGATTGCATTGGCGGCAACGACATACAGCACAGTA 
GGCAAGCAAGCAAATAGCCGTACGATGTAAGATCGAACAAGAAAGCATAAATTTAAATTTTAGCATAAAATTTATTTTAAAGGAGAAGCAAAAAC 
ACC T AAAGT AAACG AG AAAT TAT AAATC T T AC TT TACT AAAT T AAAT AACT AC AT AT AG AT AT AT AT AT AT AT ATT C AAATAT AT ATGC AT ATG A 
AAACACAGATAACTATAACTGATCAGCATCCACAAGCAGAGATGTAGTTGAGTGCGTGTTGTTGGAAAGTTGGCAAAGCAAAGCATTAACGAAAC 
AAACCAAACAGCGCCAGCGCAATGGAGGTGTGGTCAGGTGGTG 
(SEQ ID NO: 1490) 

Start ATG: 488 

MTTDTRRRVKLYALNAERQWDDRGTGHVSSTYVERLKGISLLVRAESDGSLLLESKIQPDTAYQKQQDTLIVWSEGDNFDLALSFQEKAGCDEIW 
EKICQVQGKDPSVEITQDIVEESEDERFEDICLSTPLRKEKLSMALESESYIKKLLNLFHVCEDLDNTEGLHHLFEIFKNIFLLNKKALFEIMFA 
DDTIFDVVGCLEYDPSVSQPKKHRQYLKQLAKFREAVPIKNLDLLAKIHQTFRVQYIQDIILPTPSVFVEDWMLNTLSSFIFFNKVEIVTMXQDD 
ERYLLDVFAVLTDPTTGDAKRRDTVLFLKEFCNYAQNLQPQGKDSFYKTLTCLGILQALELTLVMNDKKTKSASIDILTAIVEFSPLVVRNYTLN 
QANRPEVERMLLKIAIEQMLNDSEPELGIAVQLMGIVKILLEPENMLTEKGDFLNFFYKYSVQTLVAPVILNTIGDRPQNEDYQTAQLLGIVLDI 
LSFCVEHHSYHIKNFLLQKDLLKRILVLMKSTHTFLVLGALRLLRKIIALKDEFYNRHIVKCKLFAPVVDAFIRNNGRYNLLESAILELFEFIKL 
EDIRTLCVYFVENFSKIFDEIEYVQTFKYLKNRYDQYQDRLKDRDKMENRTDGGLPIIRSGGRFRRDQRQMEEEEEMWFNEEDDFTEEIDTYHNV 
MKSVSEKNGPQTQNQQKSSPPHSTSPHSGLLGSLSTTASSTATSATSGAPVASGSSSPEAISADEQTQAAVHLAAAALQHHQQQQQQQQQNPFQQ 
QTQPEIAELQQQLSSVEAPQSQELELSQSAAASASPTSSSSSLEASTSSSSASSSSSSSSSSSPPGSSAAASLCDSATVAAVAASQFLSTIATAM 
AASVTAAAATNSSPSISPAPAVSSPDIENADAQLPPSDDASSPASGEQDANSTEGTSSEADKTTAKKGLVDYESDSGEDDYEEDEYSEGPQAQKR 
ARQA* 

(SEQ ID NO: 1491) 
Classification: hypothetical 



Celera Sequence No. : 142000013384405 

AACGCCTCTCTCTGCAGGAAATACGCCACGACACGTAAGTAGGAATATTCCTTGATTGCACTTCATGATAACTGATTGTTATAATCTGCTTTTAG 
TTGGTTCCGATCCGCACCGGTTAAGACCGGCCCACCGATACCCATTCCCCCCCTGAAGGGTGACAAATACCGCAACTCCACGGTGATACCTTACT 
TGGAAGCTTACCACTACGGCACCCAGGAGGAGGATGTCTACTTTACAGAACACGACGTAAATCGTACGTACCAACATCAATCTAATGCCGAGTGA 
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ACTACTTTCATGGTTACCTTTACAGAGGAGCTCGCCCGCCAAGCGGCAGCTGCTGCCTCCGAAATTCGGGCCAAACAGTCGGCGGCAGCCCTAGC 

CGCCTGCCACACCTACGAACCGCCCTCCACAAGTGCCGCAGCCGCCAGCAATTCGCTGGGCAACGGTAGCAGAGAGGAGGCGCCCGTCAAGAAGA 

AGGGATCGGCACTGAAGAGGCGCGCCAAGAAGCTGACGTCCTGCATCTCCGTGCGCAAGCTGAGCCACTGCCGAACTTCGTAGGCGCCTTAGATG 

CCTCCACTAAACATAGATGCTAAGATTTATCGAAAGGAATAACTTTGTTAACAAATTGTGAGCAACTTTTTGGTTTTGTTTTCCCGTTTTTATTC 

TGTTGTCCTACATTCGCCAAGTATTTGTAACCATAGAATCATGTAGGAAATTCAGGCGCAACCTCGTACCGCCCGAGCAGCAGGAGCAGCAGCGG 

TCGCATCAACAGCAGCAACAGCCCGCATCAGCGGAGTTGGCTGTGGAATTGGGGAGCTATGGGAGCGACACGAAAGCTATATATCCATGTGACAA 

TACTGTGTATTATATGAAACAAATGAATGCGTAAATTAGTCGTTAATATATTATGATGTACTGAAGGCAGCCGAGGATTGCCGGAGGAGACGGTG 

CCCATTGGTGCGTCGCACACATAAAGAAATATGTTCAAGTGAAATTGTTTGTTGGCTTCGTTATATTCTAAACTACACACGACGAATGATGCGAT 

TTTCAGGCGCCGTCGGTACAATAACTGCTGGACAGTATTTGCAAGGCCTAGGTTGAGATTTTAATTTTTACAAACCAATGAAACGAAAATAAAAG 

AACCCATTGAATATCAAAATACAAACACTGCATTTAGTTCGCACATTTCGCACATTCCAAATTCCACACTCAACACTCGCAGACGCTCCATTATT 

CCATGCACACACGCTCATGCTCACACTACAACATTAGGCTAGCTAACTACGTTTGCTTAATCTATTTC71ACGATCTGCAGCCGGGCCGTCGGATC 

TGGCTCTGAGTCGCCCTCCATCAGCTCGCCCATGATGAACATCTGATGGTAGTGGGGGCCCAGGGTGGACAGCTTACGCTGCATTAGGTGGCGCA 

ACGGGTCCACATATTGCGGCGGCACCTTGGCCAGCTGTGTATTCGTGTTGCGCCAGTGCAGCCCGTTGTTGTCCACGTGAAACAGC GCCGGCGAG 

GGCACGTGATCTTTAAGATAGTTCCAGATGCACTCGCGCAGCAGCAGGGCAATCTGGAAATGTCATACATTAGTTGGTGCTACTACACAGTTTGG 

GCAAATAATGGATCTACCTCTGCCGATTCAAAGTGCTCGATGATGAGATGCAGATTGGTGAGTGGAGGTGGAATTAGTTGGCTCATAGCCAGCAG 

T T GC AGC ATC T TC T TGGTG CAC A GC AT C AGC T CAT C GT T C GC AGC AGTGGGC T GG AAT C GAT T G AC T GC AAACG T AT AAT AT G AT T AGGC AC CT T 

T AT AGAAAAT AAGC AAGT AAAGT G AC AAGCT C AC C ATT CT T GC C CC GCC AGT GAT GC AAGC AG GT G GC G C ACAGTGCGCGCAGT AGCAC GGAGG A 

ATAGCAAAGCGCCGGCCTATGACCGGCGATCAGTCCCAAAACAGCCCACAGCACTGGGGAGTTGATGAACGCTCTCCTGATTAGCATGTCCCGCT 

CCATTGTGACCCTGGTGAACTCCTCATCCGGAAAGGGCAGGCCGTTGTACATGACATCCGTGGAGACCAGCTCGACCAGCAGCAGCGAGACCAGC 

GAGTAGCCCTCCACGTTGTTGGGCTTCTCCAGATCGGCGCAGCATGCATTCAATGCCTTAAGATACAGCGCCTGCTTCTCCGCATCGCAGTTCCG 

CTCGCTGGGGGGCAGCACTGGTCGCAGACCTCTGCCTATAATACCGGAATGCAGCACGGAGCGATTCGAGTTGGCGCCAGAGCTGGTGTGCAGAT 

TCTGCAGGATGAGGTGTTCGTGCTCGGGTATCTCAAGCTCATCGAGCTGATACACCTCCAGCTGACCAAGCAAGTGACCGTGATAAAAGAGCGCG 

CGCTCCACCAGCTCGCGCAGGCAGACGGTTCTCGCGATTTTCCGGGCGGAACACTGCCTCTGCAACAGCGTGTAGGAGCGATGCGAAGCCGCTGC 

TCGACGGATCACAGCTAGAATGAGATATGGTTAATATATTGAGGCAAGGCTAAGGAACAACTATTGCGCTTACTCTTCTCGGTTAAGCCCACAAA 

GAAGTACTTTACGCTTAGCGTGGCCAGCTTGAGGACTAATGGAGTGCGCAGGACAGTTGTCCGGCTGCCGGCTTCAATGTTGTTCAACAGATCCA 

CAATGAGATGGGCCAAGCGCATGCGGGTAGCGTCCTCATCATCCACCTTATCTTCTCCCGGCTCCAATTTTGTGCGCTTTCCGTAATCCTTGCCA 

TTGCACAATTCATTGTTGTTATTGCTATTTTTACTTTGCTGCGCCCGGAATTTATCCGCTATCCGCTCGGCGGCATCCCTGTCTATCCTCAGGAC 

CAGTTTGTTAACGGTGCAGTCGAAGATTTGAAGGATTTTGGGAAGACATTCGTTGAGGGCTTGGTTCAAATGCTGCGCTTTGATGTACGGAGCCA 

CATTCTTGCTCTTGTGCATCCTGATGGAGATTATGAATTTGAAACACTAATTATATGTATCTAACAATAGTAATAATTACCTGGCAAGTATGTTT 

AGGTTGGAGAAAGTGTTAAAGGCTTCTGTGGGTGTGCCCGGCGGAACTGGCAATCGATTGTTAACTTGAACATCCAACGTGATGTCCTCCAAGAC 

CATCTTGAAACCTGGCGTGAATCCCGGAAAGTTAACCATGCCGCTTTGTCCGCCAAGTAAGCGAATTAATGCGCCAATGCCATTGGGATTTTGAG 

CATACGACTTCTTTAGAAGTTGGGATATAGTTTGGTAGTAAATATTCGGGTGCTGGGATGCTAGAAAAATAGTTCGTAAAGTCTGCCATTTCAAT 

ATCTGAAGAACTTTCTACTCACAGACCACCAGCAGTAATCTCACCGCTGTTTGCTGCTCGAAGAGATTGGTGCTGAGGCAGCTCTTCCACAGCAT 

GTACTTGGTTTCCTCATAGGCCAGATCGGCCAGCAACGGGAAGTTCCGCTGGTTGAGCACCCACTGTTCCAGCTCCTGCATCGAATTGTCCAGGA 

AATCCTGACACCAAACATACTTCTCGGCCATTTTGGCTGCCGTAAGGAGCAACGTCGCTCCACGCTCGTTCAGGCGCAGCACGGCATTATTCAGC 

GTGGGCTGCATACCGGCAAACATTTTGTTGCTCAGCCACAACGGCGCCTTCTGCACTATCACCTCGCACATGTCCTCGGTATCTACATGACATTG 

TGTTAGAGAAATAATATATCTCTTTATTCAACCAGTACTTACAGATTTCGACTAGCACATTGACCAAGCTCTGCAGGATCGTGTCCGAGTAGTTG 

GTGGTGATCAGCAGGAAAGGAACAACGGCCACGTCCAGCGGCTTCTTCGGTGCGAATCCCTCCCTAAGCATCTCACGAAAGGCTGCCCTAAGGTG 

CTGTTCGTGGCACGAGGTCATATAATCGAGGATGCCCGCCTCCGAATCCAGGTGGCAGCGGTAGTCATCGACGAATCGCTTGATGCCGCACTGTA 

GAATCTGCATGATGATCTTGTAAGGGAAACAGCCGCCCAATCTCGCCACAATCCAGTCGAAGGTCATGGGAAACCGGTGAAAGGCATTGAGCATG 

ATCTCCACGCAGGCCTCCTCCTCCGGTTGGGTTAACTTTCGGAAACAACTATTTATAAGGGTCATCAGCCCGTGCATGGCGCTGCACTCCAGCCA 

ATAGTTGCAGCTGGCCGTGATGGTCATCCGCCGCTGGGTGTACTTATCGGACAGCTTGGCCACTAGTTCCAGACTCCAGTTGGCGATGAGCGGCG 

ACCAGGCGGGCGGTCCCTTCCACACCAGACTCTGCAGCGCGGTCTTTACCAGATTGAAAGCCTCGTATTCCGGAGCACGTTGCTGCTGCTGGCGA 

TTCCCGGCCAGCTTCACCTGGGATACCGGCATACCGGTCTTTGGGTCGGCCATCTCCGGCGACACATACAGCTGCACACTGATTTCGGCCAGCAG 

GGCGAAGTATTCGAACACAATGTCTCGCGTGGCGGGCAGCTCCTCCAGGAATCCCAGTCTGCAAATATCCATGGAATTATTTACTACGGGCATCA 

TTATTAACCACCGCGGTTGCTACTCACGCGAGCTTGATGAGATTGGGGCTGGTCAGCAGCTGCGGACAACTGTGCCCATTGCTCACCGTCTCTAT 

GAAGTGCTT AAGCT GATCCAACAGGTT CTGGCGCAGCATTTTGTCGTCCT AGGC ATCGGGAACGCAGCACCGAAGTTGGCCAAGTGAGCAAATGC 

ATCGCAGCAGCGGGTCGCTGCGTCTTTCACTCGGAATACCAAAACGACGGGGCACAACTGCAGGCATTTGGAATCCGCCGCCACCTGGCCACACT 

AAGGCGGCAGAAAATTCGAAATTCTTTAATATCTGAAAATCATAAAATTAACAGAATAATAAATATCTTAGCACTCTAACGTGTCATTACATTCA 

TTATAAACTAGCACCTGAGATTAGTTTAAGATGACACTTTACATTCAAGTTAAATTAAGCTGGAAAAGATTTCTAATCGGAATTATGTGTTTACT 

ACAATGCTAACTTTCGAATCGAGGATGTAATTACTGAGTAATCAAACTGTTTGCTCTTTTTAATACTAAGTAAACGTACTTATAGCTCTATTTTC 

TAATTTTATAATATTTAACGTGTTTTTGTTTTAAAATATTATCGTAACAGTAGGGAGTGCGAAAAACGGTAACACCAAAACCCAGTTCGTGCATT 

ATTGCACACTCTTTTGGGGCGAATCAAGAGCGAAAGACAACATGGCGTCGAAGCGGCGAATCTGTCTCGCAGTTTGTCGTGCTGTTTGCTTTTTT 

CTGCTCTGATTTGTACAAATATTCATTCAAGGTGCTCCAGTTTTGCCAAGTATATGTTAATAGAGTACCATAACAATGGTGGCGTCACAGGGGTG 

CTAGAGTTCGATATCGATAACAGCTCACATTCTCGTCATATGCAACTATCGCTGTACTCTCTGGCCGAGTATCGATACGCAGATATTATCGTCGT 

GGTCAGACCAACGCTCTAGCGTCGGTCCGCGGAAAATAGCGAAAATTACAATTATCCGGGAAGATATATTGGTCGAAAACTAACTCGTGATCCCC 

CAATAAATATGCGCAAGAGAAAATTGTATATAAACTGATCGAATGCAGCCAGCGTGCTAAGCACTAAAAGCGAAAAACGCCAGAAACAGAAACAA 

GGAATTGCAGCGTGAGTGTATA 

(SEQ ID NO: 1492) 



Exon: 4722. .4588 

Exon: 4523.. 3748 

Exon: 3692.. 3598 

Exon: 3519.. 3348 

Exon: 3290. .3026 

Exon: 2965.. 2544 

Exon: 2484.. 1840 

Exon: 1775. .1633 

Exon: 1573.. 1001 

Start ATG: 4 423 (Reverse strand: CAT) 
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Transcript No, : CT27084 

CTGCGTTCCCGATGCCTAGGACGACAAAATGCTGCGCCAGAACCTGTTGGATCAGCTTAAGCACTTCATAGAGACGGTGAGCAATGGGCACAGTT 
GTCCGCAGCTGCTGACCAGCCCCAATCTCATCAAGCTCGCACTGGGATTCCTGGAGGAGCTGCCCGCCACGCGAGACATTGTGTTCGAATACTTC 
GCCCTGCTGGCCGAAATCAGTGTGCAGCTGTATGTGTCGCCGGAGATGGCCGACCCAAAGACCGGTATGCCGGTATCCCAGGTGAAGCTGGCCGG 
GAATCGCCAGCAGCAGCAACGTGCTCCGGAATACGAGGCTTTCAATCTGGTAAAGACCGCGCTGCAGAGTCTGGTGTGGAAGGGACCGCCCGCCT 
GGTCGCCGCTCATCGCCAACTGGAGTCTGGAACTAGTGGCCAAGCTGTCCGATAAGTACACCCAGCGGCGGATGACCATCACGGCCAGCTGCAAC 
TATTGGCTGGAGTGCAGCGCCATGCACGGGCTGATGACCCTTATAAATAGTTGTTTCCGAAAGTTAACCCAACCGGAGGAGGAGGCCTGCGTGGA 
GATCATGCTCAATGCCTTTCACCGGTTTCCCATGACCTTCGACTGGATTGTGGCGAGATTGGGCGGCTGTTTCCCTTACAAGATCATCATGCAGA 
TTCTACAGTGCGGCATCAAGCGATTCGTCGATGACTACCGCTGCCACCTGGATTCGGAGGCGGGCATCCTCGATTATATGACCTCGTGCCACGAA 
CAGCACCTTAGGGCAGCCTTTCGTGAGATGCTTAGGGAGGGATTCGCACCGAAGAAGCCGCTGGACGTGGCCGTTGTTCCTTTCCTGCTGATCAC 
CACCAACTACTCGGACACGATCCTGCAGAGCTTGGTCAATGTGCTAGTCGAAATCTATACCGAGGACATGTGCGAGGTGATAGTGCAGAAGGCGC 
CGTTGTGGCTGAGCAACAAAATGTTTGCCGGTATGCAGCCCACGCTGAATAATGCCGATTTCCTGGACAATTCGATGCAGGAGCTGGAACAGTGG 
GTGCTCAACCAGCGGAACTTCCCGTTGCTGGCCGATCTGGCCTATGAGGAAACCAAGTACATGCTGTGGAAGAGCTGCCTCAGCACCAATCTCTT 
CGAGCAGCAAACAGCGGTGAGATTACTGCTGGTGGTCTCATCCCAGCACCCGAATATTTACTACCAAACTATATCCCAACTTCTAAAGAAGTCGT 
ATGCTCAAAATCCCAATGGCATTGGCGCATTAATTCGCTTACTTGGCGGACAAAGCGGCATGGTTAACTTTCCGGGATTCACGCCAGGTTTCAAG 
ATGGTCTTGGAGGACATCACGTTGGATGTTCAAGTTAACAATCGATTGCCAGTTCCGCCGGGCACACCCACAGAAGCCTTTAACACTTTCTCCAA 
CCTAAACATACTTGCCAGGATGCACAAGAGCAAGAATGTGGCTCCGTACATCAAAGCGCAGCATTTGAACCAAGCCCTCAACGAATGTCTTCCCA 
AAATCCTTCAAATCTTCGACTGCACCGTTAACAAACTGGTCCTGAGGATAGACAGGGATGCCGCCGAGCGGATAGCGGATAAATTCCGGGCGCAG 
CAAAGTAAAAATAGCAATAACAACAATGAATTGTGCAATGGCAAGGATTACGGAAAGCGCACAAAATTGGAGCCGGGAGAAGATAAGGTGGATGA 
TGAGGACGCTACCCGCATGCGCTTGGCCCATCTCATTGTGGATCTGTTGAACAACATTGAAGCCGGCAGCCGGACAACTGTCCTGCGCACTCCAT 
TAGTCCTCAAGCTGGCCACGCTAAGCGTAAAGTACTTCTTTGTGGGCTTAACCGAGAAGACTGTGATCCGTCGAGCAGCGGCTTCGCATCGCTCC 
TACACGCTGTTGCAGAGGCAGTGTTCCGCCCGGAAAATCGCGAGAACCGTCTGCCTGCGCGAGCTGGTGGAGCGCGCGCTCTTTTATCACGGTCA 
CTTGCTTGGTCAGCTGGAGGTGTATCAGCTCGATGAGCTTGAGATACCCGAGCACGAACACCTCATCCTGCAGAATCTGCACACCAGCTCTGGCG 
CCAACTCGAATCGCTCCGTGCTGCATTCCGGTATTATAGGCAGAGGTCTGCGACCAGTGCTGCCCCCCAGCGAGCGGAACTGCGATGCGGAGAAG 
CAGGCGCTGTATCTTAAGGCATTGAATGCATGCTGCGCCGATCTGGAGAAGCCCAACAACGTGGAGGGCTACTCGCTGGTCTCGCTGCTGCTGGT 
CGAGCTGGTCTCCACGGATGTCATGTACAACGGCCTGCCCTTTCCGGATGAGGAGTTCACCAGGGTCACAATGGAGCGGGACATGCTAATCAGGA 
GAGCGTTCATCAACTCCCCAGTGCTGTGGGCTGTTTTGGGACTGATCGCCGGTCATAGGCCGGCGCTTTGCTATTCCTCCGTGCTACTGCGCGCA 
CTGTGCGCCACCTGCTTGCATCACTGGCGGGGCAAGAATGTCAATCGATTCCAGCCCACTGCTGCGAACGATGAGCTGATGCTGTGCACCAAGAA 
GATGCTGCAACTGCTGGCTATGAGCCAACTAATTCCACCTCCACTCACCAATCTGCATCTCATCATCGAGCACTTTGAATCGGCAGAGATTGCCC 
TGCTGCTGCGCGAGTGCATCTGGAACTATCTTAAAGATCACGTGCCCTCGCCGGCGCTGTTTCACGTGGACAACAACGGGCTGCACTGGCGCAAC 
ACGAATACACAGCTGGCCAAGGTGCCGCCGCAATATGTGGACCCGTTGCGCCACCTAATGCAGCGTAAGCTGTCCACCCTGGGCCCCCACTACCA 
TCAGATGTTCATCATGGGCGAGCTGATGGAGGGCGACTCAGAGCCAGATCCGACGGCCCGGCTGCAGATCGTTGAAATAGATTAAGCAAACGTAG 
TTAGCTAGCCTAATGTTGTAGTGTGAGCATGAGCGTGTGTGCATGGAATAATGGAGCGTCTGCGAGTGTTGAGTGTGGAATTTGGAATGTGCGAA 
ATGTGCGAACTAAATGCAGTGTTTGTATTTTGATATTCAATGGGTTCTTTTATTTTCGTTTCATTGGTTTGTAAAAATTAAAATCTCAACCTAGG 
CCTTGCAAATACTGTCCAGCAGTTATTGTACCGACGGCGCCTGAAAATCGCATCATTCGTCGTGTGTAGTTTAGAATATAACGAAGCCAAC 
(SEQ ID NO: 1493) 

Start ATG: 236 (Reverse strand: CAT) 

MADPKTGMPVSQVKLAGNRQQQQRAPEYEAFNLVKTALQSLVWKGPPAWSPLIANWSLELVAKLSDKYTQRRMTITASCNYWLECSAMHGLMTLI 
NSCFRKLTQPEEEACVEIMLNAFHRFPMTFDWIVARLGGCFPYKIIMQILQCGIKRFVDDYRCHLDSEAGILDYMTSCHEQHLRAAFREMLREGF 
APKKPLDVAVVPFLLITTNYSDTILQSLVNVLVEIYTEDMCEVIVQKAPLWLSNKMFAGMQPTLNNADFLDNSMQELEQWVLNQRNFPLLADLAY 
EETKYMLWKSCLSTNLFEQQTAVRLLLVVSSQHPNIYYQTISQLLKKSYAQNPNGIGALIRLLGGQSGMVNFPGFTPGFKMVLEDITLDVQVNNR 
LPVPPGTPTEAFNTFSNLWILARMHKSKNVAPYIKAQHLNQALNECLPKILQIFDCTVNKLVLRIDRDAAERIADKFRAQQSKNSNNNNELCNGK 
DYGKRTKLEPGEDKVDDEDATRMRLAHLIVDLLNNIEAGSRTTVLRTPLVLKLATLSVKYFFVGLTEKTVIRRAAASHRSYTLLQRQCSARKIAR 
TVCLRELVERALFYHGHLLGQLEVYQLDELEIPEHEHLILQNLHTSSGANSNRSVLHSGIIGRGLRPVLPPSERNCDAEKQALYLKALWACCADL 
EKPNNVEGYSLVSLLLVELVSTDVMYNGLPFPDEEFTRVTMERDMLIRRAFIWSPVLWAVLGLIAGHRPALCYSSVLLRALCATCLHHWRGKNVN 
RFQPTAANDELMLCTKKMLQLLAMSQLIPPPLTNLHLIIEHFESAEIALLLRECIWNYLKDHVPSPALFHVDNNGLHWRNTNTQLAKVPPQYVDP 
LRHLMQRKLSTLGPHYHQMFIMGELMEGDSEPDPTARLQIVEID* 
(SEQ ID NO: 1494) 



Celera Sequence No. : 142000013384600 

CTGGCTAAGATATTTTTATGGTCCCTAGCGCTACACACTCAGAAATGCACTCTTAAAAAGAAAAGCACTCTATATGTATATATATATACAAATGT 
ATATATGAGATCCAATTACTCACCTTGTTTGAATTGAAAATCGGCTAATCCACAAATAGTTGGCTGGTACAGTGGAAATCCTTCGGAACTCCCGT 
TCGACTCCGATATATGTACCTAGTGAGCTTGGGTCGCGTTTGAAGACTGAAAATACAAAGCTGCTTGTCAAATCGTTTTATAGCCGCCCCTTTTA 
TGCACTGCGTGCATTGCACATATGTATTGTACGACAAAGTTATAGTTATCAAGATATATACATAGGTCGGTCGGAATAGTCGGGCACTCGTAAGG 
CCAGCGCGACTTAATAGGTCATTATAAGCGACCGATTCGAGTTAAGAGCTCTAAAGATCACGTCGCGAATCGCCGCATGGGATCTTCTTTTTTTT 
TTTTGGCTTGTTTTGCATAAGTTCACCTGAAAATGGTTCACTGAGCTCAGCATTAATTTTATGTTATTTCCATTGTTTTTAGTACGTTAAAGTGC 
CAAGACAAGTGCGGTATTTCTCACGGTGCAGCGCACTCAATCGATTTTCCACAGCAATTAACATTATTAAATATAATGTTGCAGAAGGTACACTG 
AAAGTAAGTGAGTACTACAAAATTCAAGAACTTTCTTCACTTTGTGTGCACTTTTAGAAATGTTCTTGACTTAAAACAATATAGTTATTATTATT 
ATTATTGAAAAAATCCGTACTTTCTATCAGGAAATCCACTTTGCTATATTTTGTAACTATATACACTTTGTAACTTGTACTGTTTAGCACTTTCG 
AGAAATTCTCATCTGCCACTTGCGAAAAGTCCACTTCCTATATTTGTTCTAGAATGTAAACGAAGGAATATTTTCATTTCTCGTCGCAAAATACG 
AAATTTTTAAGTTAGCTATATAGCATCGGGGCACTTTCGCAATCGCGGCACAGCTTTATATATGTATATATATTTATATTAAGTGCGTGTTCTGA 
GCGTTTCGCACAGTGTATCGTGGTTAATCTTCTGATGGTTACGCGACTTGGATGCAATTAGACTTGAACTTTTGAACGCTCCGTACACGAGTAGT 
TTAAAGACTAATCGCCTTCGCCACTGTGACTGAGTGAAAAACGAGTTTGTAATCCCTCTTACGGATCGTAAGTGTGTAAGTTATTGCTTTGGCTA 
TCACTCCTCCCGTTCGACACAACAGTAACTTTCTGCTTTTCTGCCGGACACTAGCTAATGTACCTTCTTTCGACTGGCGTGTCCGCGTTCCACGA 
TCAAGTGATCAACTGACTGACGGGGGCTGTGGGTAATTCGAGCTCATTTGCTTTTAGTTTCAAGCACGAGCAATTAATTAAGTATTTCACTGCTC 
GCCGTTTCCAAATGTATTTCCATGCCATTTCTTGCGCTTGCCTGAGCCCATTCTGCGTGTCGGACCGCCATTCCGCCATTTAGCCATTCGGTCAT 
TCCGTCCTCCAGTCAGACGATCACCTTCGAGCACGGAGCTACGTCAGCCTTTAAAGTCCGGCGGAATTTCGTTGGGCAGCCAAGTGCAATTAGCT 
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AAGTCGAAGTACAGAACGCTGAAAGTTAAAAGTAACCCACTGATGGCATTAATCAATTTACGAACACGCAATCCCAGCGACAGCTGTTGATACGC 
ACAGCGCGCTTCACATGGACATACAGACAGACAGACGGACAGACACTCCATTATCTATTAGGTATTTACATGCATTTAATTTGAATGTCAAATTC 
AGATTTATGGCCTAGCCTATGAGAGTTACAAACAATGAGAGTCGATTACATATGAATTTACACCTATTTCTAGTTCTACGTATGCATATCGATTC 
AATTCATCCATTCATTCCGTTGTATTGTCGAGTATAAAACGTACATTTGACTAATCATATAGAGATATTGTGCAGACAATTGTTGCTAGATATTG 
CTTACGATGAACTACAAAAGCGTTCTCGCCTGTTTTATACGTTTAAAGCTTAGACTACGTACAAAAGGTTTGGGTTAGTAATATGTCAACGATAT 
TCAATGCTAAAGTCTTAGGTAGCGCTTAAGTGTACGGATGAGTTACTCTTAAGTACAGATCAAATCTATGCGAAGGAGAATTACGTATTTACAAT 
ACAATTGTCGGAGACTCCTGGTTAGATCACTGGAGTCGGCCGAATCCTCGCTCTGCGACTCCGTCGCCTCCTCTGCAATCATTTGGGCCGAATCT 
TCTCGTACATGAATTTCTGCACGTGCTTGTTGTCCGGCTGCATGAGCTTCTTCTTGTTGAACTCGAATTGGGCCAGCGCGAAATCGTACAGATCG 
TTTTCCATTTGCCAGATTTTTGTCTTTTGAATGGATTTAATTGTCGATTCGCTGGGCGGAAGCTTGGAAGATGTCACCCGCAGATGAGATTTGTT 
T GAATT AT GATAGTGCT CACGAAAGCCGTGAAAAATTCTGC AAAAGAAAAAAAAAGAAC AGGTC AAT GAAGC AACTGGAGTATT CT AAAACAGAC 
AATAACATATGGTTGGGAGGCACACCTTGGAAGGGATCTCTCGAGCAGATCCACAAACTCGTACATCTGCTCTGTGACTCCGACTAGGAAGTATT 
CGTTGACTAGATTGCGCTTGGCCTGGTCCAAGGCCCAACTGCTGCCGGGTTCCCAGCACTCGGCTGCATGGCCACAGAAGAAGGGTATCTGCAGC 
CACATGTTTTTGGGATCACAGTCGGGTTGTTTCTGCACCACGCACTCATCGAAGGTCTGCAAAGATGTGTGAGGGCTTTGGTTACAGGGAGGTTA 
ATACCTTGATGCTCTACCTACAATTTTATTGCCCGCCTTCTTGCGGACTAAATTCGGTCGGTAGTTGTCGCCAAAGCGTAGAAAGTAATAGTAGG 
AGACAAGTCTGTCGAGCGGTTTGCGCACCAAATTGATGTAGATGGGCTTGTGGGCGATTTGGAATCTGCAAAGGAGTAATGCAATTGAGACGGGC 
TCTGCTGGATTGGCTAGATGTTACCTACTTTGAGAAGTCTAGGAAGGCCATGTGGCCGTGATAAAGAGCTGGCTTCATCTCGTGCCACCTGGAAA 
CATTGCGCACGAATTGGATTTGATTGGGAAGCGAGAGGACGTGCATGTTGGCAGTGACATTGATGTGCAGCACATGGAATTTATTGGGCTTGCAC 
AGATCGTATGCTATGTTAACAAAGCTGGTGGATCCCGTTTTCGGTACGCGATTGTAAAGCACCACTAGGTGCTCCTCGAAGTCAAAGTCATCGGT 
GGTGGACGAGGCATGCTGATCTGGGCTCAGAGAATCGCCCAGCTTGGAAAGCGGCTTGAAGGCTGCACAAAGTAGGTAATGGGTGTAAAAGTGAT 
ATGGTTTTGCCTAACTACCGGACTCCACTTACCATGTTCCAGGCGAATCTCTGACCAAAGCAACCAGTAACCTGCACAGGTGACCGCGCACAGGG 
CTATCAGGATCAGCCAGTGGGTGGGTCGGAGCAGAATCCACATCTTAAGCAGTTTCCTAAACATGCCTCGTGTCGGTTTGCCCTACAGGTCAGCA 
CGGTAGCTCATATCAGCGTATTTATTTCAGCATAATCTCAGGTCGTTGTTTTCCTTGTTTTTTTTCTGCAACCAACATCGAGCAGTGTTGGTCAG 
CAGCTGCTGGACAGTGCTGCAAGGCACTGAAATCTGGTGAATCTGTGGGAATGTGCAGCTGTAGCTGAGATGACTAAATTAGTGATGGAAAATAG 
CTCAAAATGCTAGCTCGGATAATTGCATTTAATTTCCAGCTAGGAATCCGTAAATTAATTGCCTTATATTTATAACTTCAATAAATATGTTTTTC 
CTAAAACACAACACAGAATATTTTAGGCCAGTCACAACCAGATCTGCATCACCCTTCAGCAACCCTTTTGCCTTGAGCATCACTGTTTTTATTAG 
TGTGACCACCCTTGCCGGCAAATAATACCAAGCAATGCCAAAATTCTAGCGCCCATTTTTGCGAGGXTAAAATAAATTTCGCAAGCCGGCTGCAA 
TC GC AAAG T AC AG T GCGAC T TAT AT AC AC T C GT AT AAAATAAT T C C ATT T AAAAGC AAG CAT GT C C AC C T T T AAC T T C GC C AGC AT GGCC GC CC A 
ACTGGGCCAGGAGCAGGGAATATCATTCGAGAACAAGGTGCTTTCCTGGAACACAGCTGCCGATGGTAAGTTTGGGTGACGCTTTGGGTACTCCA 
TGCAGTGGCTATATTGAGGCTTTTTACAGTCCAGGATGTGGTGGATGCCCTTAACAAACAGACCACCGTGCACTATCTGAATCTGGACGGGAACA 
CACTGGGCGTTGAGGCCGCCAAGGCGATTGGTGAGGGTCTGAAGCGTCATCCAGAGTTTCGGAAGGCGCTGTGGAAGAACATGTTTACTGGTCGT 
CTCATATCGGAGATTCCGGAGGCACTCAAGCACCTGGGAGCCGCGCTAATTGTCGCGGGCGCCAAACTGACAGTCCTGGATCTCAGCGACAATGC 
CTTAGGACCGAATGGCATGCGAGGCTTAGAGGAGTTACTGCGATCCCCGGTCTGCTACTCGCTGCAGGAGCTGCTGCTGTGCAATTGTGGCCTTG 
GTCCCGAGGGCGGTAGTATGCTGTCCCGGGCTCTGATCGATCTGCATGCCAATGCCAACAAGGCGGGCTTCCCGCTCCAGCTGCGTGTGTTCATA 
GGTTCGCGCAATCGTCTCGAGGATGCCGGTGCTACGGAAATGGCAACCGCATTCCAAACCCTCAAGACCTTCGAGGAGATTGTTCTGGAGCAAAA 
CTCCATTTACATCGAAGGCGTCGAGGCCCTTGCCGAATCCTTCAAGCATAATCCTCATCTACGAGTGCTAAACATGAACGACAATACTCTAAAGT 
CCGAGGGAGCTGAAAAAATAGCTGAGGCTCTTCCCTTCTTGCCACTGTGAGTCGGTTATAAAGTTTTTGATGAGGGAGAGTGAGTGAGTGTTATG 
TGATATCTTTTTGCTTTGCAGGCTGCGTGAAATGAGCTTTGGAGACTGCCTGATCAAAACTAATGGCGCCTACCACTTCGGTGAGGCTCTGGAGA 
GAGGAAACGAACGACTGGAAGTTATCGACTTAGGTTTTAACGAAATCAACAGCGACGGCGGCTTGGTGTTGGTGAATGCTATGGGAAACAAGCCC 
AAGCTACGCATCTTGAATCTAGATGGCAATAGCTTTGGAGAAGAAGGCAGCGAGAAGATAATCAGCGAGATGAGTAAGTTGCCAACTGCTGCCGC 
ACTGCAACCGTTTCAGCACCAGGAAGAGGAGGATTTGGAAGATGAATACCAGGCTGACAAGCAGGACGCAGATTACGAAGAGGAAGAGGAAGTAC 
ACGAGCACGCCAACGATACTACCGAAGAAGCAGATGAGGATAGCGAGGGCGACGAGGACGACGAGGAAGACGAGGGAGACGAGGAGTACAGCAAC 
GTCGCGGAGGAGACTGCCTATGTCACTACGAATGCCTACACGACCAAGGTTAGTTTATACTTCGGAGTACTGTGCAAATGGTAACAGCAATTAAT 
CGAATTATCTTTAATTTTAAATTTTCACAGCTTTTTAACGACACAACCAACTCGATGGCCAGCGAAACTTTTGCGGTCGCGAACAAGACGATCAG 
CCAAAAATGCACTCCAGAGAAGTTCTGTTTGAGCCAGAAACCCTGCTCCCAGGAAGATTTCGATTCGCTAGATATGGATAACAAACTTGAGGCTT 
TGCAGTCGATTGTCAACGTAAGATATTCTCAATAATTAGAATGCAATTAGAGTAACGTAACATTTTTGTACAGCAATTCACCGGCGACAACCATT 
TGCTACTGCTCGTCTTCACCACCTTGAAGTGCGCGCATTTGTCGCAATCCTCGAAAGCTGCGTTGGATCTGGCCGTCTCCTTGTACCAGGCCACC 
TTTGACTATGCCATCAAGACAAAGCAGGAGACACGTGTACTCAACTATGTACTGATGCAGCTCCGTTTGTTGCCCTGCAAGGAGGTATTCCATTC 
GGACTACGATGTCAAGAACTGTCGATTTGCTCTTCGCGAGGCTCTCAAGCAACCAACGTTTGCCAACGACAACATTAAGAATTCCTTTAAGACTT 
TCCTGGAGGGTGCGGAGTCGTAAAGAGAATTAGAGAAGAGATTACCTTTATTTCCCACTTTCCGATTTGTGTAGCTTTCGCGAATAAAAACGTAT 
TATTCCTAACCCGTAGTGGTGTTTTTTTTTTACCCAAAAAACCAATCGAAATTTAATGTATTAGCCATTCGGCAAAAGTGATTTATTTACATGAA 
GAAAACTCGTATGAGACTACAAACTATCTTTATGGAACATGTTAATACATCGATATTACGTTTGTAAATTCCTCTTGGTGGCTTTCTTGCTGCTC 
TGGAACAGAATCAGCTACAGTGAACGAATGTAGCGAAGTGTTTATCTATAGTTAATAAAAACACGAATTATAAAACATTGACGTTAACATCGAGG 
AGAACATTGAAAGATTATAAAATAACGAACGATCGTGGCGAGCAAGTCCGCAAAAAAAAGTGCCTCCTCGAGATTCTCGATGCGGATAATGCTGA 
TTCTAAGAGTCCATCTTCTATCAATATCCTGTGTGGAAATGTGTTCCTTAACAATTAAAATCAGAATCATCATCATCATCTTCATCAATCTCGAT 
ATCATCATCATCGCTTTCAATTACAGCTCGCCTCTTCTTGGGCGGAGCTTTCTTTTTGGCTCCTCCACTGCTCTCATCCTCTCGTCCACGCTTAG 
CTGGACGCTTGGGCGAATCATCATCGCTGGCATTGCCATCATCATCAGAGCCAACGTTGCCACCGTCTTCCTCCTCATCCGAGAAGAGCGAGCTG 
TAGTCTATCTTCTTACTGGCCTGTCGACGTCCAGGACGCTCTGCACGAGGAGTTACTTCCTCCACGTCATCGTCGCTCTTTTTCGCCTAAAGATG 
AAAAATATTAATTGACAATTAAATACAGATTTGGGTTTAAAAACAAATATGGGCAATTACTGAAACACTTACCTTGGCCTTGCTGAAGTCAATCT 
TGCTCTGCTTGAGCCCGCCGCCGTTCTCCTTCTTTTGGCGCGGCTTTTTTTCTCCCGGCTCCTTCTTGACAACCGCCTTCTTGGCCTTGGGTGAG 
GTCTTAGAGCCGCCCTCGACCATTGCATCAAATTCGTCCACCTCATCGCCGCTGGCA 
(SEQ ID NO: 1495) 

Exon: 1001.. 1215 

Exon: 3912.. 4245 

Exon: 4305. .4986 

Exon: 5057.. 5558 

Exon: 5636.. 5812 

Exon: 58 69.. 6277 

Start ATG: 4146 

Transcript No. t CT28175 
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CAGCTTTATATATGTATATATATTTATATTAAGTGCGTGTTCTGAGCGTTTCGCACAGTGTATCGTGGTTAATCTTCTGATGGTTACGCGACTTG 

GATGCAATTAGACTTGAACTTTTGAACGCTCCGTACACGAGTAGTTTAAAGACTAATCGCCTTCGCCACTGTGACTGAGTGAAAAACGAGTTTGT 

AATCCCTCTTACGGATCGTAAGTGTAATATTTTAGGCCAGTCACAACCAGATCTGCATCACCCTTCAGCAACCCTTTTGCCTTGAGCATCACTGT 

TTTTATTAGTGTGACCACCCTTGCCGGCAAATAATACCAAGCAATGCCAAAATTCTAGCGCCCATTTTTGCGAGGTTAAAATAAATTTCGCAAGC 

CGGCTGCAATCGCAAAGTACAGTGCGACTTATATACACTCGTATAAAATAATTCCATTTAAAAGCAAGCATGTCCACCTTTAACTTCGCCAGCAT 

GGCCGCCCAACTGGGCCAGGAGCAGGGAATATCATTCGAGAACAAGGTGCTTTCCTGGAACACAGCTGCCGATGTCCAGGATGTGGTGGATGCCC 

TTAACAAACAGACCACCGTGCACTATCTGAATCTGGACGGGAACACACTGGGCGTTGAGGCCGCCAAGGCGATTGGTGAGGGTCTGAAGCGTCAT 

CCAGAGTTTCGGAAGGCGCTGTGGAAGAACATGTTTACTGGTCGTCTCATATCGGAGATTCCGGAGGCACTCAAGCACCTGGGAGCCGCGCTAAT 

TGTCGCGGGCGCCAAACTGACAGTCCTGGATCTCAGCGACAATGCCTTAGGACCGAATGGCATGCGAGGCTTAGAGGAGTTACTGCGATCCCCGG 

TCTGCTACTCGCTGCAGGAGCTGCTGCTGTGCAATTGTGGCCTTGGTCCCGAGGGCGGTAGTATGCTGTCCCGGGCTCTGATCGATCTGCATGCC 

AATGCCAACAAGGCGGGCTTCCCGCTCCAGCTGCGTGTGTTCATAGGTTCGCGCAATCGTCTCGAGGATGCCGGTGCTACGGAAATGGCAACCGC 

ATTCCAAACCCTCAAGACCTTCGAGGAGATTGTTCTGGAGCAAAACTCCATTTACATCGAAGGCGTCGAGGCCCTTGCCGAATCCTTCAAGCATA 

ATCCTCATCTACGAGTGCTAAACATGAACGACAATACTCTAAAGTCCGAGGGAGCTGAAAAAATAGCTGAGGCXCTTCCCTTCTTGCCACTGCTG 

CGTGAAATGAGCTTTGGAGACTGCCTGATCAAAACTAATGGCGCCTACCACTTCGGTGAGGCTCTGGAGAGAGGAAACGAACGACTGGAAGTTAT 

CGACTTAGGTTTTAACGAAATCAACAGCGACGGCGGCTTGGTGTTGGTGAATGCTATGGGAAACAAGCCCAAGCTACGCATCTTGAATCTAGATG 

GCAATAGCTTTGGAGAAGAAGGCAGCGAGAAGATAATCAGCGAGATGAGTAAGTTGCCAACTGCTGCCGCACTGCAACCGTTTCAGCACCAGGAA 

GAGGAGGATTTGGAAGATGAATACCAGGCTGACAAGCAGGACGCAGATTACGAAGAGGAAGAGGAAGTACACGAGCACGCCAACGATACTACCGA 

AGAAGCAGATGAGGATAGCGAGGGCGACGAGGACGACGAGGAAGACGAGGGAGACGAGGAGTACAGCAACGTCGCGGAGGAGACTGCCTATGTCA 

CTACGAATGCCTACACGACCAAGCTTTTTAACGACACAACCAACTCGATGGCCAGCGAAACTTTTGCGGTCGCGAACAAGACGATCAGCCAAAAA 

TGCACTCCAGAGAAGTTCTGTTTGAGCCAGAAACCCTGCTCCCAGGAAGATTTCGATTCGCTAGATATGGATAACAAACTTGAGGCTTTGCAGTC 

GATTGTCAACCAATTCACCGGCGACAACCATTTGCTACTGCTCGTCTTCACCACCTTGAAGTGCGCGCATTTGTCGCAATCCTCGAAAGCTGCGT 

TGGATCTGGCCGTCTCCTTGTACCAGGCCACCTTTGACTATGCCATCAAGACAAAGCAGGAGACACGTGTACTCAACTATGTACTGATGCAGCTC 

CGTTTGTTGCCCTGCAAGGAGGTATTCCATTCGGACTACGATGTCAAGAACTGTCGATTTGCTCTTCGCGAGGCTCTCAAGCAACCAACGTTTGC 

CAACGACAACATTAAGAATTCCTTTAAGACTTTCCTGGAGGGTGCGGAGTCGTAAAGAGAATTAGAGAAGAGATTACCTTTATTTCCCACTTTCC 

GATTTGTGTAGCTTTCGCGAATAAAAACGTATTATTCCT 

(SEQ ID NO: 1496) 

Start ATG: 450 

MSTFNFASMAAQLGQEQGISFENKVLSWHTAADVQDVVDALNKQTTVHYLNLDGNTLGVEAAKAIGEGLKRHPEFRKALWKNMFTGRLISEIPEA 

LKHLGAALIVAGAKLTVLDLSDNALGPNGMRGLEELLRSPVCYSLQELLLCNCGLGPEGGSMLSRALIDLHANANKAGFPLQLRVFIGSRNRLED 

AGATEMATAFQTLKTFEEIVLEQNSIYIEGVEALAESFKHNPHLRVLHMWDNTLKSEGAEKIAEALPFLPLLREMSFGDCLIKTNGAYHFGEALE 

RGKERLEVIDLGFNEINSDGGLVLVNAMGNKPKLRILNLDGNSFGEEGSEKIISEMSKLPTAAALQPFQHQEEEDLEDEYQADKQDADYEEEEEV 

HEHANDTTEEADEDSEGDEDDEEDEGDEEYSNVAEETAYVTTKAYTTKLFNDTTNSMASETFAVANKTISQKCTPEKFCLSQKPCSQEDFDSLDM 

DNKLEALQSIVNQFTGDNHLLLLVFTTLKCAHLSQSSKAALDLAVSLYQATFDYAIKTKQETRVLNYVLMQLRLLPCKEVFHSDYDVKNCRFALR 

EALKQPT FANDN IKNS FKT FLEGAE S * 

(SEQ ID NO: 1497) 

Name: RAN-GTPASE ACTIVATING PROTEIN 
Classification : signal_transduction 
Gene Symbol: Sd 
FlyBase ID: FBgnO003346 



Celera Sequence No. : 142000013384249 

TTGGATATTTCCGTTTCCTGGCTGTGCATTCTCCTTGTCGTCCGCTTGCATTCCTTTCTCCAGCGCCATTAAGGTGCTGTTTCCGAAGTTGAACG 
ACTGCGAGAATGCCATTGCGCCGGTGCGTTTTTATACTGAACTAAGAATTGTTGCAAACTGCTTGTTACAAGTGAAATGAATAAAATGAAAACGA 
AACGTCGCGCGGTCAATGTGGCCGTCTTATGATTAGTTTCAAATATACTATTCATGTTTGTGACACTTTGAATGAACTGTTTACACCGCCAGCGG 
TCCCACTGAAACCTTGTTCACACTGCACGCGACTTTCTTACAAAATATCTTAATTAAACAGATTTTATTTATACCGTTACTTTCCGAATTCCATA 
TGCGTGCTATGGGGCATTTAGCACAAAGCTAAGGTCCCACTAAAACCTTGTTCACACTGCACGCGCCTTTCTTACAAAATATCTTAATTAGACAG 
ATTTTATTTATACCGTTACTTTCCGAATTCCATATGCGTGCTATAGGGCATTTAGCACAAAGCTAAGGTCCCACTAAAACCTTGTTCACACTGCA 
CGCGCCTTTCTTACAAAATATCTTAATTAGACAGATTTTATTTATACCGTTACTTTCCGAATTCCATATGCGTGCTATAGGGCATTTAGCACAAA 
GCTAAGGTCCCACTAAAACCTTGTTCACACTGCACGCGCCTTTCTTACAAAATATCTTGATTAGACTGATTTCATTTATACCTTTACTTTCTTGC 
T AAGTGT AAGCT AAGT GT AGTT AAGT GT AAGCCTGAAAT CCAAGCCC AAGACC AAAACTGAAAGCAAAAGACC CAC AGC AAAAACT CAAAAAACT 
CCACGACGCAACAAACTCGCGTTCGAATCGCAAGCAAGCTGCGCTGAAATCGCTAACAAGCCAACCAAAAGCCGCACGAAAGTTGCGCCCAAGCC 
GCAATGAATTTAACGCTGACTTTATGAACTTTAAAAGAAACCTTTGGCTTATTTTTGTTTATTTCTTTATTTGTATATATCCTTAGAATAGTTTG 
TGTTGTTCAAGATTACATAAAATTTGTTTACATATCTATTAGATCAATATCCCATTTACATAGTTAGCATTATACAAAATCGCGCGCCGCAAGCT 
GATGACATTCCATTTTAGTTGGTTTTTTTTTATAATATGGTTGTTGGTGGATAGTGTGTGGTTGTGGTTAATTAGCCGCGATACAAATTTGGATA 
TTATGCTAGACGACAAATGTGCGACATTATAAAATAAATAACATGGTTACTGCTCGCCGCCCTTGCCGGATATCTTTTGATGGCCACTCCTTTTG 
GTTTCTGGGGTTAGGCGGAATGGATCCTCATCATCTGGCCATCATCTACATCTTATAGGTGGCGGGCACGGATCGGGGATAGTCCAGATCGTACA 
TCTGGGCCTTGATGAACGCCAGCTTATTGACCGGCTCTGGACGAACAGTGGCCAATCCGTTTTTGTAGGCGTACTCCACAATCCTTTCGGCAATG 
GCCATCGAACAGCTGACAATGGAGCTGAGTGGTGGATACAAGCTGCCCTTGGCCAGGTCGTCCTTGGAGACCAGCTCCGCCAAGCGCTCGGCGGC 
GACCAAGAATACCTGCTCGGGAATGTTCAGCATGCCGGCACACAGAACACCCAGTGCCACGCCAGGGAAAATGTACGAGTTGTTGCCCTGACCCG 
GATAGAACTTCTTGTTGTTGTACGTCACAGGAGCAAAAGGCGAACCGCTGGCGAAGATGCAGCGCCCCTTGGTGTACGTATACGCCTCCTCGGCG 
GTGCACTCCGCCTTGCTGGTCGGATTGGACAGTGCAAAGATGATCGGCGTCTCATTAATATCGGCCATCAGCTCAAGGATCTCCTGGTTGAAGGC 
GCCGCCCTGCGCAGCCGCTCCAATCAGGACATTGGGACGCACCTTTCGCACCGCCTCTGCCAAAGTATCGATGGGTTCGTGCAGCTGGGCAAAGT 
GCAGCTTGTGTTCGGTGAGTCCGCCCTTTGGACGATCGCGGGTGATGACACCACGGCTACATAGAAAGCATCTATAAGTATCAGGATGTATTTTG 
ATTATATATCGACCTACCTATCCACCATCCAGATGCGGGCCTTGGCCTCCTCCTCGGTGAGACCCTCCACCTTCATGGCCATCAGGCACAGGTTG 
GCAATACCAAGAGCCGCTTCTCCGGCGCCCAGGAACAACAGCGTGTTATCCTTCAGCTGGGTCTTCTTGATCTTTAGCGAGGCCAGCAGACCAGC 
CACGGCCACCGACGCGGTTCCTTGAATATCGTCGTTGAAGGTGCAGAAGGAGTCGCGGTATTTGGACAACAGCCTGAAGGCATTGGCGTTGGCAA 
AGTCCTCGAACTGGATTAGGCAGTTTTGACCAAAGCGACGAACGCAGGCATGCATGAACTCATCGATGAACTCATCGTACAGATCTCCAGTGGCC 
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CTGCGTTCGCGCAGACCGATGTACAGGGGATCCTCCAGGATGGATTCGGTATTGGTGCCCACATCCAAGGTGATGGGCAGGCACTGCGATGGCTT 
AATGCCCGCCAAGGCCGTATACAGGGACAGTTTGCCCACGGGTATACCCATTCCGTTGGCGCCCAGATCTCCCAGTCCCAGGATGCGCTCGCCGT 
CCGTGACAACGATGGCACGCACATCCGTTTCCGGCCAGTTCTTTAGCACGTCGTAGATGTGTCCCTTGTCCTTGATGGATATGAACATGCCCTTG 
GCGTTCTGGTGGATCAAACTGTAGCGCTGGCAGGCCAATCCCACGGTGGGCGTGTACACCAGTGGCATCATGTAGGCGATGTCTGAGCTGAGCAC 
GTTGTAGAACAGACGCTCGTTCCGCTCCGATAGGCTGATCAGGTACATGTACTTGTCCAGATCCTGATCCAGTCGCGCCAGCAGAGCGCGGCAGT 
GCTCCACCTGCTCACTGGGCTCACGGACCACATAGGGCAGCATGCCATGGATGCCCAACTGCTGGCGCTCCTCGTGTGTGAAGGCCAGGCCCTGA 
AATTGAAAGGGATATGCATTTTAGTTACCAATACCTACTATTAATATCTGTAGGATTCGTAATAGGATGGCATGATCGCCTCATTGAATGATTAT 
AGAATCTCCTCTCAAATCAAAACAAACAAAATGTAGGATGGTGGAAAAGTTTGACCCGCTTATCATGGACTTGAATTCCCAGGCACGCGCCCCTA 
TTCATATAGAGTATTCCACGGCTACTAGCCACCGAGACAGAGGCAGTTATTACCGATGCTTCGCAGATCCGAGGAGATAGATCCCTCGACTTCAC 
GAGTCGGGCATCAAGTTCAAAGTCGTTTTCTAGCGCCCAGTCGCTTGAAAACATATTTGCAATCGCAGCTTTAATTAAAACACATTTTTTTTCGA 
GGGGGAAGTGAATCAGCGCTGGCCTCGATTCACACGCACAGCTAATATGTTTAGTACGAGATTTTGAAAGTTTTTGCTCCGTAATTGCTGCGGTT 
ATTATCGAAATGGACAAAGCAGGCGGGGATTATGTTGGGGGCGTTGCTGCCGACATTAAATAAAAAAATACGTGCAAGTTGGCATTCAAATAGAA 
ATCACGCTGGATTATGTGGTTTTTGTTTTTAAACTACGGATGTACCTGCTAATATGCACATTAGGGAATAATTACTCGTGCTTTGGGTTACTTTT 
CCGAGTGGCTGCTGTTTTCAAATCGTGCTCATTACAAAGGTTTTCTTAATGAGTCTATTTTAAGCTTCTTGGGACATTTGATTTAGCTTAATAAT 
AAATGCACATTTTTATTTCCAAGCGATATATCAAAATGCAACCAATTCCAAATCTGTTGTGTAAGTTTTCCGAATTATTTTTGTAAATTGAATAT 
TTTGTTAGTATGTCAAATTAAGTTATTATTGGAAATTATACTTGATAGGTTTTCTATCATTTATCATCATTTAAAATTAATCTCGTTTGTTTCGG 
TCACGCAATAATAAATTTATAGTTTTTCATATGAATTTTTAGTTTTCCGATAAAAACAAACTATTCGTATTCTAGTTAATTGATTCGTAATTCGT 
ATTCTAGTTAACGTTTTTGATGCCTATCACTATGAATAAGCAATATTTAAAATAATTTTGGTCAGCTTGCAAAATATCATATATCATATTAAGAG 
CTGACACAGATAAATAATATTGTTGTGTTTTTAAGGAATTTTAGCTGGAAATCGAGCTGATTCCCCTTTTTTTTTTTAAGTTTGCGAAGTAAAAT 
ACATATGTATTTACATACGCATCAAATTCAATTAGCTCTGAACTGGCTGTTTGGACAATAAACACTGAAGTGATCCCCATAAACAATTGTTTACT 
GC C AGAAT T GC C T C G AACT T AAC CAT T ATC GT GT GGCT AC GT C AT CG GT G GGCT GGT G AAATAC AT CC AT ATGT AC AT AC AT AC GT T AAAAAAAG 
ATTGAGAGAACGACACCGTTGACATTGAAAGCGACACACGACCGTCGATAAGCGTGCGAAAGTGTTTTCGAACCGCCTGGCAACAAGTTCTCCCC 
ATTCTCCAGCACATATATGTACATACATATATATATGATGGATTAATATTCATGTATTCATGCCCCCAGCAATTGCCTCCAAACTTGCCACATCA 
TGCCTAAGTCATATGAACTAAACTATAGTTTCATCATCCTCATCACCTTTTCAAGTTCTGACTTTAGTTTGTATTATATTAGTAGTTATAGCAAA 
TCATGTAAGATTGCTTATAAAGAGGGGGCACCAAAAATCGAATCCAGACATGCGTATACGTAATGTGCAACTCCGATATTACGCATACACATTTC 
GCATGGTATTGCAAAAAGGCAAAACGAAACGAAAACTGACAAAAGTCGCAGCCACAAGCAAAACCAAACCAAAAAATCACAAAAATGAGCTGGTT 
CATATTTTTTTTTACACTGTCTGGCGCGACCAGTTCGATCATCATCATCGCAGCGGTGATCATCATTATAGCAATGGCTAATCATTAAGTCGACT 
ATCAACGACCGTCTGTGTCTGAACTTTGAATACGGACTCGGATCGGATGACGAAAGCGTCGCTTTAATCAGCAACAGTTGAGCAGACACACAGAC 
AACGGGCAATCTCAGCTGATTGTTTCCCTTCTGGCAAAGGTGAGCTGATGAAGGGTTTTTTTTTTTCTGTACCAGCCTCCCTCCCAGCAGGTGAA 
ACGCTGCTGCTCTTTTGTGGGTCATACGATGTGGAATGATGACACAGGATGAATGCGCCACTGAATCGGAGCATAATAGTTTATGATGCACACTG 
CAAAAATGCGCGTAATCTTTGAGTAATTTAGGTTATCCAAAAGGAAAACATAAATTCCAACCTATATATGTAAGCTATTTCAAAGCATTTCAGCC 
ATTAATTGCAATTGACAAGCACACTATTATATATTATTCTTTCTTTAGTTGCCCAATATAAGCAATCGATAAATCATTGGGTAAACAATGAAATA 
ATCCAACTATTTCAACAAGTTTCGAGCTCGCCCCGCTCTTTCGGCACATTGACCGCCTGAATTTGTTTAAATTTTTAATTTCCTCAGAGCGCCAA 
TAAATAACAACAAAGAGCAGGGGAATATAATTCTATTCGCCAACTGCATGTGGTGTGAGCAAACCACACAGATAAATATTTAATGTGTGTATATA 
CATATATAGGGCTGATACCAAGTGTTTTTGTGGTGATTATCGTCATTCGGTTGTTTATTTCTGCGGTTCGAAACAGAGTTAAATTTATTGCTGAA 
TGTAGGTCAGTTTTCGCGGACTGGCTTTAATTTTGTTTCTTATTTTTTTCATTGGAGATAGGCACTCGAAAACTTTCCATGGGTAAATCAAAACA 
AAGCTCAGCTGTTCGGGGAAGGAGGTTGAGATAAACAAGAGTCACTTGTTGGCAGCGTGTAAGAAGTACATAATAACAATCTGTAGTAAACAAGT 
GCCAGCCCCTACTCGATAAGGAAACCCAAGATAATTTCTCGCTAAATTGTTAAATTATTATTTTTAAGTCTTTTCGTCCGATTGTGCCGTGACGT 
AAGCGGGAAAAAAATCCACTTGGTTGTCATGGGGAAACACCATTCTGGATGGCCCTCCATTATTCTCAGTGGAGCTTCTGTTTACAATGCCAAAT 
GGTTGGCCCCTTTTGGTCAGTTTTTTTTTTTAGTAGCTACACGGGGGAAAATAATGTCTTTACTAGAAATTACGTAATTTCTTTAGATTCCTAGC 
ATTTTAAATTGTAATATCCGTTCTAACCATTACCACCTTTACATTAAAACTCTGTTTTTTCAGTGCAACTGTTTTAAATAAACAAACGCAAAGCC 
CATGCGCGCTCTCTCTTCTCTCTTTCAAAAATGCTCTCTAATTTTATATTTTGTGAAACAATTCGTGCATTAGCTCATTCATGTCTGCGTGGGAA 
AAAAGCGTAGAAGCCAATGCAAGCTGAAAAACTGCACAAGCATTGAACTTGAAAATGCAATCTGAGATAAATTTGTAAACGAAATCTGCGTGACC 
GATTCGTGTCAACTATTTCATACCGTACAATTTCGGCATGTGCCACAGACAGCTAATTGCCTCAATTATAAAAGTTGGTCTATCGATTTTCTTCG 
GCTAAGGCACCAGAAAAAAAATGAAAAAAAGCAACAAAAAAAACTGACAAAAACAATGAGCTTTGGTCTTCCAGAGTGGCGCGTGTGCCATTTGC 
CATCCCTTTCACTCGCTATGTCATCTAAAGGTCACCCACACCACTTGCCGAAAAATGAAGCCACTGCCAAAAAGTATCTTTTTTTCGCTCTGAAC 
TCAGCAGCTGACGTTTCCGCCTAATAGTTATTTATGTATTTATGATTTGGAATTTTTTTATTCTCGTGCTTGTGGTAAAAATAGCAGGTGGCCTA 
GAATAAATTCACACTTTTTCAAGAAGAAATTCCAACATTGGAACTCACTTCTAGCGAAATTTCACATAATTAACGATTATTAGTAAATATCTTTC 
AGATACATATGGATGGCTAAGGTAAGTTCTAAGGTAAGGCTTAAGTTCTTCAAATTCTAGACCAATTAAACTTCTTCTAATAGTCTATAATTATG 
ACTCTACCTCTAAATATTTACCCAAACACGGTTCACTAACACAATTCGCCGATATGGGGAGCGGAAAAAAAAACCGAATGCGAACCCACCCTTAA 
AAACCTTTGCCTGGTATTTTTCCACTAAGTTCACGGAACTGGGCAACTTAAAAGCGTATTCGCATATTCCTACTTGTGGAACTCACCTTTGAGTA 
GCGCTTGTCCAAGAGTCGGGTGAATCCGGAGAGGCTGCCGACCACCTCATTGTCACCAGTGCCCCAAAGGCCCAGGCGATCGCGCTGCGCGTATT 
TATCTAGTTTCGAATCGGGTTTGCTCATCTTGTCTTCCAAATTTGTTTCGCTAGTGGAGCTGTATGAACGTAGTTGCTGTTGTTGTTGCTGTTGC 
TGGTGGTGAAGGGCGTGCAAACTGCTCTGCAGCTGATTCCAGTGGTGTCGCGAGGATAACGATAACGGCAGACCCTTGGGCAGTTCCAAATCGGT 
GGCTAAGGCTATATTACCCGGATTCGAATCTCTGTCCCTATGCGTTTCCGTATCCGTATGCGGCGATGTGTCCACTTCGGCGGCACTTTGCGTTG 
TGATTCGTGTCGTTAGTTTTGCGAAATTGGAAAGTTCGTATTCGAGTGTGGTTGTCGTTGTGGTCGTATGATTTTTACACACACGACTACTACAA 
CAGCTCTGACTCTCTCTCTCATGGCTATTGTTGTTGTTGCTGTTGCTGTTTCTCTTCGTGTAACAGCCGAGCTCGCGCTCTTTGCCGCGACTATA 
TGAGGAAGGACTCTGCGAATTGTTGGCGGTGGGATAAACCGGCGTGCTGAGGGAGAGAGAGAGAGCAGACAATATGTGAATGAATGTCAATGCGG 
GAGTGACAATAACAACCAATTCGTCACCAACATTCGGATACTCTTTCTGAAAGAACCGATGTTAACATTTTTAATTGAATTTTAATCGTACACAC 
TCCACGTTCCCACACTAACAATTGTTAATAATATATATATTACATTTTAATAGTATATATATCTTATTTGAATAGTATATTATATAACAGTATAT 
AATAATATATGAATTTCACCATAAAAAGCTTTTTGTAATAGCCAAAAAGTCGTAGAGCAATAAAATGGTATTTAGGGTTAAAAGAGGCCCTTAAA 
GGGTGGGTACTCTTCGTTTCAGATTCCTTATCAAGGTTTTCAGAACCTTTGAAATGCGCTGATATGTTTGAACGTATGTGCATGGTCATATGATG 
AGGC G AT GC AT C GC A G AC ACG C CT T C CAG AT GG AAACC GAT AAGT GGT GC C AAAGT G AC AC T GATT AT T GCC AAT T C AG AAAC AC AC C C T CTT GA 
TCTGTGGTCCAATCGTATGACACCACATTATACTTGTTATTCTGAAATTCTTGCATTCGACTCACGGTCCAAAAGTTTTTCTGATACCAGAACAG 
CGTCGTAGGATCGAGTACATGGCGGTGCTCTTAAAAGTCTTCTATATTATATACTTATTAGTATATTGATTTGCCGACAATTATCTCACTGAGTA 
TTTGACTTTTGTCTGCGTTGCTGCAGCTTTCGTGCTGAACTGAACCGCAGTATTCGATTTGTTTTGCTACCACCATATGTATACACTTTGCGATT 
TTGTTTTGTTTCCGCGCTCTTTGGCTCTTTCTCTCCGATTAGTTTCAGTTGTGTGGGGTATATCAGCTGACTGATAAGGGCGGTGCAAACACTAT 
CTGCACATATAAACGCCATTAAACAACGATTATTACCATCAAATCATTTGTCTGGGACCTGGAAAACATTGCGATGCGGCGTGAATAGCATGCGA 
AAAGTTTCCGTGAGAACAGCAGCGGTCGAAATAATAGTATTCAAATTAGACTATAGAGTATTGCAGTTAGCAATTTCTTATCTACTATTATTTAT 
AATCAGTAGCTTTGATTTTAATGTAAACTTAAGTACAACGTGCGTGCATTACTTGAATTAAATAAACCGAACCTTTTCTATTAGAGATTATTAGA 
CTCGATGGCGTTAAATATAAACCAATAATACCTCCTGAACTTGTGGCATCTGACTAAGTATAAATTATTTAAAGAGACGCCTCAAGGTAGAAGCC 
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CCAAACCCCAACCTTAACCTTCAGTGTCCCGAATCGCTTCAGTTCCCGCCCCACTTTGCACTTGACTCACTCATACGACCGGTGGGCCCGTTCCG 

CTTTTGCTCGGATGTCTGTTTGATATATTCAATTATTACACAAAAATTGAAAGTAAAACACGGCAGGCGAATAAATTGGAATATATCACACATGT 

TTGTGTGTGTGCCCGAGCAACCTTCATAAATATTAACAATTTACTTCAGCTTTGCCAACGAATTACATTTTCCAAGTACCTTCGTGAGCAAAACA 

CACCTGTCGCTCTAACTTTTTTTTTTCTTTTTACTTCGAAAATCGTTAATTAAGAAGGTGGTGCTCTAATTTTTTATCGATGGCGTGTGTCAGAA 

GTTATTCTTTTTTATCGCAACGCCGACGAGTCGCCTGTAAATCGTTGATAAAACACTAGTAAAACGAATCATTGACCACT 

(SEQ ID NO: 1498) 

Exon: 8675.. 8521 
Exon: 7931. .7307 
Exon: 3036.. 2108 
Exon: 2051.. 1001 

Start ATG: 8570 (Reverse strand: CAT) 
Transcript No. : CT28469 

AAAGCTGCAGCAACGCAGACAAAAGTCAAATACTCAGTGAGATAATTGTCGGCAAATCAATATACTAATAAGTATATAATATAGAAGACTTTTAA 
GAGCACCGCCATGTACTCGATCCTACGACGCTGTTCTGGTATCAGAAAAACTTTTGGACCCACGCCGGTTTATCCCACCGCCAACAATTCGCAGA 
GTCCTTCCTCATATAGTCGCGGCAAAGAGCGCGAGCTCGGCTGTTACACGAAGAGAAACAGCAACAGCAACAACAACAATAGCCATGAGAGAGAG 
AGTCAGAGCTGTTGTAGTAGTCGTGTGTGTAAAAATCATACGACCACAACGACAACCACACTCGAATACGAACTTTCCAATTTCGCAAAACTAAC 
GACACGAATCACAACGCAAAGTGCCGCCGAAGTGGACACATCGCCGCATACGGATACGGAAACGCATAGGGACAGAGATTCGAATCCGGGTAATA 
TAGCCTTAGCCACCGATTTGGAACTGCCCAAGGGTCTGCCGTTATCGTTATCCTCGCGACACCACTGGAATCAGCTGCAGAGCAGTTTGCACGCC 
CTTCACCACCAGCAACAGCAACAACAACAGCAACTACGTTCATACAGCTCCACTAGCGAAACAAATTTGGAAGACAAGATGAGCAAACCCGATTC 
GAAACTAGATAAATACGCGCAGCGCGATCGCCTGGGCCTTTGGGGCACTGGTGACAATGAGGTGGTCGGCAGCCTCTCCGGATTCACCCGACTCT 
TGGACAAGCGCTACTCAAAGGGCCTGGCCTTCACACACGAGGAGCGCCAGCAGTTGGGCATCCATGGCATGCTGCCCTATGTGGTCCGTGAGCCC 
AGTGAGCAGGTGGAGCACTGCCGCGCTCTGCTGGCGCGACTGGATCAGGATCTGGACAAGTACATGTACCTGATCAGCCTATCGGAGCGGAACGA 
GCGTCTGTTCTACAACGTGCTCAGCTCAGACATCGCCTACATGATGCCACTGGTGTACACGCCCACCGTGGGATTGGCCTGCCAGCGCTACAGTT 
TGATCCACCAGAACGCCAAGGGCATGTTCATATCCATCAAGGACAAGGGACACATCTACGACGTGCTAAAGAACTGGCCGGAAACGGATGTGCGT 
GCCATCGTTGTCACGGACGGCGAGCGCATCCTGGGACTGGGAGATCTGGGCGCCAACGGAATGGGTATACCCGTGGGCAAACTGTCCCTGTATAC 
GGCCTTGGCGGGCATTAAGCCATCGCAGTGCCTGCCCATCACCTTGGATGTGGGCACCAATACCGAATCCATCCTGGAGGATCCCCTGTACATCG 
GTCTGCGCGAACGCAGGGCCACTGGAGATCTGTACGATGAGTTCATCGATGAGTTCATGCATGCCTGCGTTCGTCGCTTTGGTCAAAACTGCCTA 
ATCCAGTTCGAGGACTTTGCCAACGCCAATGCCTTCAGGCTGTTGTCCAAATACCGCGACTCCTTCTGCACCTTCAACGACGATATTCAAGGAAC 
CGCGTCGGTGGCCGTGGCTGGTCTGCTGGCCTCGCTAAAGATCAAGAAGACCCAGCTGAAGGATAACACGCTGTTGTTCCTGGGCGCCGGAGAAG 
CGGCTCTTGGTATTGCCAACCTGTGCCTGATGGCCATGAAGGTGGAGGGTCTCACCGAGGAGGAGGCCAAGGCCCGCATCTGGATGGTGGATAGC 
CGTGGTGTCATCACCCGCGATCGTCCAAAGGGCGGACTCACCGAACACAAGCTGCACTTTGCCCAGCTGCACGAACCCATCGATACTTTGGCAGA 
GGCGGTGCGAAAGGTGCGTCCCAATGTCCTGATTGGAGCGGCTGCGCAGGGCGGCGCCTTCAACCAGGAGATCCTTGAGCTGATGGCCGATATTA 
ATGAGACGCCGATCATCTTTGCACTGTCCAATCCGACCAGCAAGGCGGAGTGCACCGCCGAGGAGGCGTATACGTACACCAAGGGGCGCTGCATC 
TTCGCCAGCGGTTCGCCTTTTGCTCCTGTGACGTACAACAACAAGAAGTTCTATCCGGGTCAGGGCAACAACTCGTACATTTTCCCTGGCGTGGC 
ACTGGGTGTTCTGTGTGCCGGCATGCTGAACATTCCCGAGCAGGTATTCTTGGTCGCCGCCGAGCGCTTGGCGGAGCTGGTCTCCAAGGACGACC 
TGGCCAAGGGCAGCTTGTATCCACCACTCAGCTCCATTGTCAGCTGTTCGATGGCCATTGCCGAAAGGATTGTGGAGTACGCCTACAAAAACGGA 
TTGGCCACTGTTCGTCCAGAGCCGGTCAATAAGCTGGCGTTCATCAAGGCCCAGATGTACGATCTGGACTATCCCCGATCCGTGCCCGCCACCTA 
TAAGATGTAGATGATGGCCAGATGATGAGGATCCATTCCGCCTAACCCCAGAAACCAAAAGGAGTGGCCATCAAAAGATATCCGGCAAGGGCGGC 
GAGCAGTAACCATGTTATTTATTTTATAATGTCGCACATTTGTCGTCTAGCATAATATCCAAATTTGTATCGCGGCTAATTAACCACAACCACAC 
ACTATCCACCAACAACCATATTATAAAAAAAAACCAACTAAAATGGAATGTCATCAGCTTGCGGCGCGCGATTTTGTATAATGCTAACTATGTAA 
AT G GG AT AT T G AT C T AATAG AT ATGT AAAC AAATT T T A T GT AA T CT T G AA C AA C ACAAAC T AT T C T AAGG AT AT AT ACAAAT AAAG AAAT AAAC A 
AAAAT 

(SEQ ID NO: 1499) 

Start ATG: 106 (Reverse strand: CAT) 

MYSILRRCSGIRKTFGPTPVyPTAWNSQSPSSYSRGKERELGCYTKRWSNSNNNNSHERESQSCCSSRVCKNHTTTTTTTLEYELSNFAKLTTRI 
TTQSAAEVDTSPHTDTETHRDRDSNPGNIALATDLELPKGLPLSLSSRHHVJNQLQSSLHALHHQQQQQQQQLRSYSSTSETNLEDKMSKPDSKLD 
KYAQRDRLGLWGTGDNEVVGSLSGFTRLLDKRYSKGLAFTHEERQQLGIHGMLPYVVREPSEQVEHCRALLARLDQDLDKYMYLISLSERNERLF 
YNVLSSDIAY^PLVYTPTVGLACQRYSLIHQWAKGMFISIKDKGHIYDVLKNWPETDVRAIVVTDGERILGLGDLGAMGMGIPVGKLSLYTALA 
GIKPSQCLPITLDVGTNTESILEDPLYIGLRERRATGDLYDEFIDEFMHACVRRFGQNCLIQFEDFANANAFRLLSKYRDSFCTFNDDIQGTASV 
AVAGLLASLKIKKTQLKDNTLLFLGAGEAALGIANLCLMAMKVEGLTEEEAKARIWMVDSRGVITRDRPKGGLTEHKLHFAQLHEPIDTLAEAVR 
KVRPNVLIGAAAQGGAFKIQEILELMADINETPIIFALSNPTSKAECTAEEAYTYTKGRCIFASGSPFAPVTYNNKKFYPGQGNNSYIFPGVALGV 
LCAGMLNIPEQVFLVAAERLAELVSKDDLAKGSLYPPLSSIVSCSMAIAERIVEYAYKNGLATVRPEPVNKLAFIKAQMYDLDYPRSVPATYKM* 
(SEQ ID NO: 1500) 

Name: MALATE OXIDOREDUCTASE HOMOLOG 
Classification: enzyme 



Celera Sequence No. : 142000013384351 

TCACGTTCTCCTGGCAAAATGAAACAAACGCAGTGCCGCTGCAAAACTAGATATTCTCCGTCATTTGCGATTCCTGCATTAATGAAGCACTCGCT 
GTGCGGCGCGCGAATGTGCGATAATTTATTAGTGCCACATTTGCGACACAGATTTCGATTTCGGACCGCCGCCGGTGGCCACCAAGTCAATTCAA 
TTTCCTCACGCGCACACCGCCTGACCCCTTTTGGCCACAAATAATAATGCCGGGCCTATGTCAGTCTATAAGTCTATCAGCACCAAATTTCGCCA 
AATCCCGTGGCCCATCGACACGATCCATTTAGCCACCCATCCATCCATCCGTCATCCGCCACCTGTTGCTCGAATCGCACCAGCTGAACTTCGAA 
TCCGCGCGAATGTCGGGCATGCAAAAGAGGTGACGACCCTGATCCTCCTCTGAGTCGCTGGTGAAATCAATTGTATTTCACAAAACAAATAGAGC 
ACCGACCGTATTCGAAAACAATTTGTTTCTATCACAAACACTCGACACGCGTCGTCTTGTTTGCATTCCAGTCCGCTCCACTTCCAATCCGACTG 
TCCGCAGCTTCGCGCGCGTCTTGAAAGTCCCGCACTCGCAAACACACTCGCGTCCAGTGGGACCGCGCCGAAAGCGCTGGAAAATACCAAGCGAC 
TTGCAAATAAATACCACAGTTCTCAACACCATTGCAGTTTCAGAAAATACCACTTATATTTCTATACCACCAAATTCAAATTAATTATTAAAAGC 
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AAAATCAAAATCAAAATGCACGCTAGACAACCTCTATAAGATAGATAGTAAGATCTTTCTTTTCTTAGCATTTTTGCATTTCACCGACGCGAATT 
TGACCCTTATCGATTTTTTCGGAATACCTTGCTTACTTACTTTAAAAACGTAACAAAAATTACAATAATAAATGTTTCAATTTAATTTGTGTATA 
ATATATTATTTATTTAATACGTCATTTGGTGAAACTAATTAAATTCACTGCAGCACTTCAGTTTAACTTTCACTGAAAACCAGAACCCCACATTA 
CACTGCGCTGATAAGGTCCACTCTAGCGGAAAATGCGTTAGTGCGTGTTGTTTACCGGCATGTCCGAGGTCGAAAGAGCTCTGGATGTTCTGCTC 
CAGGAAGCGGAGGAGCTGTGCATCGGGAGCAGCGTAGTGGAGCTGGACAGGATTCCAACCGCCTTGGAGTTTTGTCGGGAATTCTACTCCAAAAA 
CCAGCCGGTGGTCATTCGCAAGGCACTCAATTGGCCCGCGATTGGCAAATGGACGCCAAAGTACCTGATCGAGGCTCTCGGGGACAGGAGCGTGG 
ATGTGGCAATCACCCCGAATGGCTATGCCGACGGCTTGGCCACTCAAAATGGACAGGAGTACTTCGTTTTGCCGCTGGAGACGAAAATGAAGCTG 
TCCGAGGTGGTCCGCCGCCTGGATGATCCCACTGGTGCTGTCCACTACATACAAAAGCAGAACTCTAACCTCAGTGTGGACCTACCGGAACTGGC 
TGCTGATCTGCGGGTCAGTGATCTCGACTTTGCGCAGCAGTCCTTTAACAAACCGCCTGATGCTGTTAACTTTTGGCTGGGCGACGAGCGCGCCG 
TTACCTCCATGCACAAGGATCCATACGAGAACGTGTACTGCGTGATATCCGGTCACAAGGACTTCGTTCTCATCCCGCCGCACCAGCTTAGCTGC 
GTACCGCGAGGAATTTACCCTACAGGCGTTTACSlAGACCTCAGATAGTGGTCAGTTCTACATTGAACCGTTGAGGGATGAGGAAGGCAGCGATCA 
GTTCACAGAATGGGTCAGCGTTGATCCCTTGTCACCCGACCTGGCCAAATATCCGGAGTACGCCCGGGCCAAGCCTCTGAAAGTACGTGTTCACG 
CGGGAGACATCCTATACCTGCCCAACTACTGGTTCCACCATGTGTCCCAAAGCCACAAGTGCATCGCCGTGAACTTCTGGTACGATCTGGACTAC 
GATAGCCGGTACTGCTACTACCGCATGCTGGAGCAGATGACGTCCGCAAGGAGTGGCTAATAGAATCCGGAGAAGTATTTATTCTCAGGGATTGT 
TAGGTGAATTCAAAACCCTTCTTGTTTTCAGTGTATTTATATATATTATTGTTACATAATTTAGTGTTAAATAAAAAACTGATACCCAAAATGAG 
TTGCGAACTACGACTAAGGACACGACCATAAAAGATTTATCTTTTCCGAAATTCGTAAATGAAATATGAAATATCCGTACAGATGATGACTTCTT 
AAATAAGGAATGTTTGCTTTGTTGGGTTATTCTCTAGTCTATTCGAGACTGACGTCCTGTTTTTCTGCGCCGGAAGGCTCCGCTGCAGCCTGTCC 
ATCTGCCTCCTCGTCGGGATTTTCGTTGTTCTCGCACTCCAGTTCGCCATTTCCCGCGGAGGAGACGTTCTTTTGGTCCGCCTCGGCCAAGTCTG 
CATCGTTATTCGAGTCCTGCGACTGGGATTGGGACTGTTTGCGCACACGCTGGTTCTTCAGCTTGGCCGGACTGGCTCCAGAAGGTGTCACCTTC 
GACTTGGTCTTGCAGCTATCCTGCGGCTGTTCCACCACAAACAGCTGCTCCAAATAAGAACGGTACCAAAGCAGGAACTCGTCCCGCTGTCGTGG 
TATGTCCATCTTCTTGATCTTGGCATATATGCGTTTGGGCGAAATTAGTGGCGAGCTGGCGGTCGAATTCTGGGCAGCAGTGGCCGTCAGCTTTG 
ATACCTCTGCCAGTAGGGAGTCATTGATGACGGTGTTGTTGTGCGTCTGATTGCGAATGCGTTTGATAATGTACTCGATTAGTTCGTGATCCGGT 
GCCAGTTCTATGCTGGTCCCTGGTCCrGATCCCGAGCCAGAGGCTTGTTCTAGTTTTCCCTCGAGGATTTGCTGGTAAGGAAGACGCAATTATGA 
GTTAATCGATCCAGTTTACGAAAGCCTTTCTTACCTCTTCGTTCTCGGTCCTAAGAGAAGAGATAACAGCAGAAGCACTGTCCTCCATTTCATCC 
AGGTGCTGCCCGTTC 
ESEQ ID NO: 1501) 

Exon: 1001. .2055 
Start ATG: 1105 

Transcript No. : CT28505 

CAGCACTTCAGTTTAACTTTCACTGAAAACCAGAACCCCACATTACACTGCGCTGATAAGGTCCACTCTAGCGGAAAATGCGTTAGTGCGTGTTG 
TTTACCGGCATGTCCGAGGTCGAAAGAGCTCTGGATGTTCTGCTCCAGGAAGCGGAGGAGCTGTGCATCGGGAGCAGCGTAGTGGAGCTGGACAG 
GATTCCAACCGCCTTGGAGTTTTGTCGGGAATTCTACTCCAAAAACCAGCCGGTGGTCATTCGCAAGGCACTCAATTGGCCCGCGATTGGCAAAT 
GGACGCCAAAGTACCTGATCGAGGCTCTCGGGGACAGGAGCGTGGATGTGGCAATCACCCCGAATGGCTATGCCGACGGCTTGGCCACTCAAAAT 
GGACAGGAGTACTTCGTTTTGCCGCTGGAGACGAAAATGAAGCTGTCCGAGGTGGTCCGCCGCCTGGATGATCCCACTGGTGCTGTCCACTACAT 
ACAAAAGCAGAACTCTAACCTCAGTGTGGACCTACCGGAACTGGCTGCTGATCTGCGGGTCAGTGATCTCGACTTTGCGCAGCAGTCCTTTAACA 
AACCGCCTGATGCTGTTAACTTTTGGCTGGGCGACGAGCGCGCCGTTACCTCCATGCACAAGGATCCATACGAGAACGTGTACTGCGTGATATCC 
GGTCACAAGGACTTCGTTCTCATCCCGCCGCACCAGCTTAGCTGCGTACCGCGAGGAATTTACCCTACAGGCGTTTACAAGACCTCAGATAGTGG 
TCAGTTCTACATTGAACCGTTGAGGGATGAGGAAGGCAGCGATCAGTTCACAGAATGGGTCAGCGTTGATCCCTTGTCACCCGACCTGGCCAAAT 
ATCCGGAGTACGCCCGGGCCAAGCCTCTGAAAGTACGTGTTCACGCGGGAGACATCCTATACCTGCCCAACTACTGGTTCCACCATGTGTCCCAA 
AGCCACAAGTGCATCGCCGTGAACTTCTGGTACGATCTGGACTACGATAGCCGGTACTGCTACTACCGCATGCTGGAGCAGATGACGTCCGCAAG 
GAGTGGC TAA 
(SEQ ID NO: 1502) 

Start ATG: 105 

MSEVERALDVLLQEAEELCIGSSVVELDRIPTALEFCREFYSKNQPVVIRKALNWPAIGKWTPKYLIEALGDRSVDVAITPNGYADGLATQNGQE 

YFVLPLETKMKLSEVVRRLDDPTGAVHYIQKQNSNLSVDLPELAADLRVSDLDFAQQSFKKPPDAVNFWLGDERAVTSMHKDPYENVYCVISGHK 

DFVLIPPHQLSCVPRGIYPTGVYKTSDSGQFYIEPLRDEEGSDQFTEWVSVDPLSPDLAKYPEYARAKPLKVRVHAGDILYLPNYWFHHVSQSHK 

CIAVNFWYDLDYDSRYCYYRMLEQMTSARSG* 

(SEQ ID NO: 1503) 



Celera Sequence No. : 142000013384598 

TTTTAAGTAGTTTACCCAACTCCTCCGCCTCCTGACTCCTGGATTTTAAGCAGATGGAGCCCGGAAATGAGATATCACACACGACTTAATCGCAG 
GAACACGAACAGGAAACAGATGGAATCGTCGATGATGGAAGTCTACTACGTATGAGTAGGTATAAGTTATTACCACTGAAGTATTAAAAAACCAC 
ATTATTAAGTAATGACAGAACACCTTCTGTGAATTACACGGTCATGTTTCATACAAATACATAGCTTGCATATCAACGATTTCCTTATAAAAGAT 
ATTCTTAGTTAGAAAGAAAACTTTCACCCAATAGCAGTTTTCTCCATTAACTTTGGTATATGAACAACTAAAACAGTATTAACTGTTTATTTTAT 
TACTGTACTTACTATTTACTTACTTATCAAGTCCTCGAGCATTTTTAACATTTGAAAAATAAGAACCAGTGCTAAGGAATAATTATTCTTCGCTA 
TGCACATGCAATATTAATCTGCTCTTACCCAATTGGATAATATATTTACGTACAATAAGATCTTTATGAATGCATCATTGCAAATGCAGATGGTT 
AACTAAGTTTGTAATTCACAAATATTCTTCCAAAGGGGCTTTACTCAACATTTAATGTCACTGACCATTCAGCCAAGTAAATCACCTTCCCAAAC 
TAATTTACCAGTGATTAATTGGGCTAGCAAGTGTAGGTGATTGGAGTATACTAGCCTTATCTACCTTCGGATCATTCCTTATCAAAAGCACGGTT 
GTTTGGGAGAGCCCGCCAAAAAGCCGGACTCTATATGTAGACGGAAAGCTGCTTGATGGAGGGTCTACTGACCCGGTGTCTTCTTACAATCACTG 
TTTCTCAAATAAATTGGTAGGGACAACACACTAAATACACTGTAGATTTGAAACCAAGATGGTTCCACATTAGAAGGGCCCGGTGACATTAGTGG 
ATGTAGTTACTTGTTTGTAGGTAGTAGCTGTTTCCCAATGAAGCACGCTGTTTATTTGAAATTAAAAAAGAACTGGATGCATATATTTCGTCTGC 
TTATCTTGGCGATTCCTCGACGCCAGAGCGGAACGCACTGGGGCGCGAGTCCGGCTAGATTGGTAGACTCCATATGGAGATGTCAGGTGGTGCAC 
CCAGTGCTCCTCTCCTCCGGCTGGACGAGACCTCTACTTCCTGTTCTGGTTGTTGCGGGCCATGTCCTTGGGGAAGGAACGCAGTGGCGGCGTGC 
GGATGCGGCGGGCCTCGCGCGAACGGTTGCGCACCACCTTGGAGTGGATTGCGCAGGACACGCAGTAGTGGAGCTTCGCATAGAGCTTGGGCAGC 
ACGTACGAGTCCCAGATGCTGGCCTCCGTGATGTCGCGCACGGCGGCAGCCTCCACGATGTTGCGGATGACGAACTTCTTGATTGCCTTGTCCTT 
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GGGCACGCAGCGGGCGCAGTTGGTGCAGCGCACGGGCTTCACATGGCCGCGATTGTGCTTGTTGCGTCCTCCGTTACGGCGCTTCTTGGTCATCT 
TGGCGGCTGCTTGTCAGTGGGGGAAACAGTTTGCACGGCCATTAGCATTTAGCACTGCCTTTGCTCAGCGGATGGCTGCCATAATTACCTTATTT 
ATCAATTTCAAAATCCGCCCAGCGAAATCCGCGTGTGTTCACGTTTCGGGAAACGAAAAGAAAGAAAATAGTGTGGACGCACCCGGCGAAACTCG 
GATAAACGGCCTAAAAAATACCTAGCCAGAGCGACTAGCGTGTTTGGCAACCAGGGCTGACATTTTTACCCGCTAAAACTGGTTTGGTATTCCAA 
GTTCTTTGCTTTCCTTTTTAATCTATCAGACTCTTTTAATATATTGGAAATTCGAACTGCTTACACTAAAAAATGTCACCTGAAGAATATTAGTA 
AAAACGATAAAGGTTACACATTTAGCTATATATCCATATCTTATTAGCTAGATACAGCTTTAATCTGATTAGAAGTTGGCAACGCAACATGGTGT 
GGCCAGGTTCCACATCGAAGTAAAATTGTTTTTAGCGGGACATTTCTTTAAAAAAGAGATGAATAACATAACTATAAATTGCTTAATTTAATGTA 
CTAAATATAATATATCCTTCGATATTCCTGAAATTACCCGTATTTTTATAGCAATCTTACGATTTTAAATTTGTTTTAGAAATTTACTTGTAAAT 
TACATTACTAAATTTGGTATAACAATAAATAATCTTCCGTTCAAAAAATTCCAGACAACTTCAAAAACGTATAATTTAGACTACAGCTTTTTAAG 
AAAATTTGTATAAAATTTCAACAAGAAAAAAAATTAATAATAGGCGCAAGCAAAGCCTAAATTGAATTTTTTAAAGTTATTTTTTTTAATTAAGC 
AT CAAAT TTT AGAT TTTTTT AAAAAAT T TAAAACGAAT AAGTCAGT AAAT AAAT AT T TTTTGT ATCGGTGAAAATAAAAATGGAATTATAATAT A 
TTCATTATTATAAATGATAAATATTTATAAGTGTTAAAGACCTTATATAGTATATAATCTGGTAATGTTCGTCCGTCTGTTCGTATGAACGAAGT 
G AAATC AAG AAT TAT T AC AAGC T AGAAG T T T AAAATT AAGC AT ACT T AT T AT AAAAAC GGAAAT TTTTTT GG AAG AT G T CCC AC AAAC AT CC C AA 
ATCAGTTATTC 
(SEQ ID NO: 1504) 

Exon: 1671 ..1609 
Exon: 1528.. 1001 

Start ATG: 1518 (Reverse strand: CAT) 
Transcript No. : CT28945 

T C GT TT C CCGAAACGT GAAC AC ACGC GG AT T T CGC T GGG C GG AT TTT GAAAT TG AT AAAT AAG AGC CGC C AAG A TG AC C AAGAAGCGC C GT AACG 

GAGGACGCAACAAGCACAATCGCGGCCATGTGAAGCCCGTGCGCTGCACCAACTGCGCCCGCTGCGTGCCCAAGGACAAGGCAATCAAGAAGTTC 

GTCATCCGCAACATCGTGGAGGCTGCCGCCGTGCGCGACATCACGGAGGCCAGCATCTGGGACTCGTACGTGCTGCCCAAGCTCTATGCGAAGCT 

CCACTACTGCGTGTCCTGCGCAATCCACTCCAAGGTGGTGCGCAACCGTTCGCGCGAGGCCCGCCGCATCCGCACGCCGCCACTGCGTTCCTTCC 

CCAAGGACATGGCCCGCAACAACCAGAACAGGAAGTAGAGGTCTCGTCCAGCCGGAGGAGAGGAGCACTGGGTGCACCACCTGACATCTCCATAT 

GGAGTCTACCAATCTAGCCGGACTCGCGCCCCAGTGCGTTCCGCTCTGGCGTCGAGGAATCGCCAAGATAAGCAGACGAAATATATGCATCCAGT 

TCTTTTTTAATTT CAAAT AAA 

(SEQ ID NO: 1505) 

Start ATG: 74 (Reverse strand: CAT) 

MTKKRRNGGRNKHNRGHVKPVRCTNCARCVPKDKAIKKFVIRNIVEAAAVRDITEASIWDSYVLPKLYAKLHYCVSCAIHSKVVRNRSREARRIR 
TPPLRSFPKDMARNNQNRK* 
(SEQ ID NO: 1506) 

Classification : ribosomal_protein 
Gene Symbol: RpS26 
FlyBase ID: FBgn0004413 



Celera Sequence No. : 142000013384 600 

TCTTTTTAGTAGTACATTTAATGATCACCTAACAGGCAACAAAAATCCTAAAAGGCACTCACTAATAGATTCCCTTCTCGCCAACAGGGATGACC 
GCCTTCGTACTGTGCATTCTGGCCATGTCCTCGGGACTGGCTGCACTTTTCTCCAGCCGCATCAAGAAGCTAATCACACCATTGCTGAACAAGAC 
GTTCCACAACTTCCTGGGATTCGCGTGCTTTGTGATCGCACTGGTGACCCAGTACTATGGCTACCAGACGGGCTACTTTAAGAGCCGAAGTGAGA 
CGGATTTCCAAATCCTGATGAAGTGCCTCACCCTCATATCGTTGGTCCTGTCGAGCTACGGACCGATGAAGGCACTCTATCAGAAATGCAAAAAT 
ATATCGCAACAGTTTTAGATAGGATGAAAAGAATCACTCGAATGTATGTAGATAAATAAATGTAATTTTTTCTAAAACCCCATTTTATCGACCTA 
TCCAACTATACCAACAACGATTCAAAATTTTATACCTGTTTATTGGTACAACGTGTGATCCATAAACATGACTAAGTATTTTTACAATGCCTCAT 
TAGGGAAAACCCGGTTAACCAGCGGTCAGACTTTCATCGTCCGGTACTCCCCCCTGTAAAAGTTATATAACCACCCTTTTGATTGATTTGATTTG 
ATAAGGTCTGAAAAAGTAATGAATTGACTATTAGATACACTGCAATTTGCATTTAAGTTTAACCGCTCACTCAAAAAGCCAGCATTTGAATTAAT 
CAATCAAAATTATGATCATTTATAATTTAAGTTATACTATTATTCATTTTCCTATAACAGCATTGGGTAGCATGCATCATTTTAAAATCCAAAAC 
ATGACCGGGTTTTTTATCTGATCGGTTTGTGACACTTGGAGGCCGCATAATTGATAGTAACTGTCCCAAATAAGTCCAAACTTTTGAAGCCCCGT 
AAAAAGAAGATCTTGACTTCCGGCACCTTAAATTAAAAAACCGTGTATTTTGCATGGTGCAGTTTCGTATGTTTATTTAGAAAATCCATATCATT 
TCTTCTTAAATCTCTCGCAATAAATAATACTTGGCGCGTGTGTCGCGTTGTGTGTGTGATACGTATATCAGTTAGCAGTAATAATAAATAGCTAG 
CGTTAGGTATTAAAATTCATAGTCAATTGGGTAATACAAGGCTGTCTGGCTTTAAAACCCCCTTTCTGGGTAACTTTTGGTAAGAGAAAGTCAAG 
ATCTCTATATAGAATCGGGTGAACAGATCTTTAGCATGCGGTAGGTAAATAGGAGTTATCAGCTATCGACTAAAGCCCAGTTAAGAATTGTGTGT 
TGGGCAAGGCATATTCGAAATTCGACTTTCTATATCTTTTCATAGTTTCCATAGTTCGCTCCGCTCACAGCTTGTTGCTGCGTCGAATGGCGAAG 
GACATTAGCGCGTGCTCCAGCCGCATGTGGTAGGGTAGCTTGCGCTCCAGAATGACTGGCTCCTGGCCATCTGGACAGCACCGGTTGCGGCAGCA 
CTCGCACTGGCCGTCGTCATCGTCGTATTCCTGCTGCAGTTCCTGCTCCAGTTCGTAGGCCTTCTTGGCAGCCCGCTGCTCGAAGCGCAAATAGA 
AGGCCAGGCAGACAAACAGCATAAATACCAGGGCACACATGGCTCCACTGGCAATGCTCGCCTTCTCCAACATCGGACGCGGCCTCTTGGGCAGG 
TAAGTATTGTCGATCTCCTTGTATTCGCATCGCTGTCCCATAAAGCCAATCGCGCACTCGCAGCTGTAAACCGGTAGATCGGCTATCTTCACCGC 
AAAGCAATGGGCATCGTTCAAACAGTACCAGGCATCGAAGGTTTCCGGACATTTGTATGTGGGGAATGTAATATTGGGCCTGGGCGTGGTCGTCG 
TGGTGGTGCGCATTGTGGTGCTACTAGTGACTGGAGCCTGGGTCGGCGGCAATGCCGTGCCGGACATGGACGAGGAGATCGAAGAGCGCGGCTTG 
GGCACCGTACGGCTGGAGCAGGCCTCGACGTGCCACGGATGTGCCAGACAGCCGATCAGTACCAGGGCAACCAGCCCATGTTGTACACTCATTGT 
GGAATGCATTACAATTTGTAGCTTTTTAGCGTAGCGTATCGGCTTGTTCGTTTTTTGGCGTGCAACGTTTGGTTGTTTAAAAGCTTAAGAATATT 
AAAACTAGTGACTATCACATAAATAAATTAGAATATTCGTTGTCCGCTGCTAGGGCGTCGTCAAATCTCTTTGGAATCGGGTTTCTCTACATAGG 
TATGCTGCTGGTGGAACGTAAACGTTGAGTTAGGGGTGATGTGTGTATCTGTGGGGAAGATGGAAAGACATTAGTGGCCAGTAGTCTCCAAGCAC 
CATCGATGGTTTTTAACTCTAAAACTGTGTACTCAACAGAAGGGGGAAAATCAAACTGCGTAAAATCATAATTATATATGTATGTATTTATAAAA 
GTATAAACTATAATAGATCGCTTTGAATAGTAGTTATGTTACAATCAGCAAATTTTACTCCTTTCTTCACATCCAATTCACTTTCGGTTTAAAAT 
TTCATTTACGTTGTCTCCCGTTGATTATCCTAGCTCATCAAAGCTGGCTGTGATATGCGTTGGGTATCCATATCACAACCACTTGATGAGGCTTA 
GATAATCAACATCGTACAGGTAAGGAACAGTTTCCTTAACTTTATGAATTAGTAACATGGTTAGTAACAATATTGCACCGCATGTGTGTAAGTCA 
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GTTCGATAGGGCCCCCTAAGTCTGAGGTGCCAGTCCACCAGTTGAGCCATTTTTCGGTGCTGTGGCATGCATCTGTCCCCCGTTGTTCCGAATTC 
AGATCTCTGTTAGCCGCGTTTATTTTCCGATTGCATTTCACTACAAATATTATACAGATTTGAGGCGGACATAGTGTGTTTTATTTATGATGGGC 
GTGTGAATTATGTGGCGGGAGGTATTATTATGATTTGATGTACGATACATTACGCTAAGCTCATCAAAGCTGGCTGTGATATGCCGCGCAAAGCG 
GAAATGCATTTCACAACCACTTTGAGAGCCCAGCATGAATGCGCCAATATAGTTTTATTGATTGAGGTTTCGGGATACTTGACTGCGGGAGTTCA 
GAATTTTCAATTTTGGTTTTCATGCTCATGTAATGATATAAACAAAATCAGACACTTGTTACCGTTTTTATACTTTTTGTGTTCAATTATGCTTT 
TTTTTTGGAATTTTCAAATTGTTTTTTGATTGATTTTTTTTTTGCCTTGAGCATCGCGTCGCTCCTCAAAGCTGGCTGTGATATGAATAGGCATA 
AAAAGGC AAAC AT TTCACAACCACTTT GAAGAATG AC AC AAT TCTCCTGTTTCGATTC GAT G AG AGCC G AGGTG C AG ATT C AG AT TC AG AT T C T T 
CTAGGGGTCTTTTTGCAACGTTCTCACGGCCGTCGGCTTTGAGCCTTGTACTTTTTTGCAGCCACGGCACAATAAACAAATATGAAAAAAGTACG 
TAAATAAATAAATGCGACGACGAGACTACAACACGATGATAATGACGATCATGACGACGACGACGATAATGATGATTATGATGATGAGACTACTG 
GTACTGGTGATAGATAATACGCAAATGTTGATTGCAATGAATGGCGACAGAGCGAGCGCAAGTGAATGGGACAATTTCGTACATATCTACGCTAT 
AGACTGGCCATTGTTTCAACAACAACAACAAAATTCAACAGACACAAACGCACACAACACCGAAGAACAACACTGAAAGAGGGCAAAAAGCAATG 
CACCCCAAATAATTATAATAAAAATGTAAACATAATTTTGGCGCCAGCCTTTGGCAATTTGACGCCAAATTTGTTTTTAGATGATTTCTGAGTTC 
AAGTTCGGGAATTCAAACTGATAAGAAATGCAAACGATGAATAACAAAACGGAAGAACCGAATTCAGTTGATCCTCCAGAAGATCATTTAAATAA 
T C GAT T T T AT C G AAAAATT T T AAATGC ATT ACTGAC G AAGT TC AAT C AC T T T AAAA T GT GAG CAAAAC AAAAAC T T T T T AT AT G AAAAG ACG AAT 
GCTGTTCACCATTTCGAGCGCCCACTGTGGTGCAATGCAAAAACAGTTATACGTTGGGTGTGGAGACTTATTTATGTATTAGAATGCTTTTACGG 
TTAACAATTGCTGCACTCTCTACCCTCTCTTTCTCGCTCACGCGCGGCAAGCGGGCAGCTTATTAAAATTTCCGTTATAAAAATTAAATCCACCG 
CCGCTGCTTTTATGTTCATAAATACATGTGATACGTCGCATGCATGTACACATGCACAAATGTATGTATATACACATGTATGTGCGTTTGTGTGC 
G AG AGG AAGTC G AAGT AAGG T AAT G AG CGT AAAGT C AAGTT G AAT AGC C AT AT AAAATG C GC T G AAC T T GC GAGC GT GT A GGT CAAG AT C AGC GG 
CGGCCGTTTATAATATTTTGTTGGTTTTCGCAAATAGCAATTGACGAAAGTTCGATGATCTTGTAGTACATTTCAATAATGGGGGAATGAGCCAA 
CT AGC ACG AT AGC AAC AAC AGC AAT A T AT AC AAGC A T C AC AAT AAT T AC AAC AG C AAC AAC AAC AACGAT C AAC T AT AT T AG AGT GT GG AC G GC A 
AAAAGTGTGCATACATACATGACAACAAAAACACCCGCTAATAAA2W^TACTGTTGTGGCTGCTTCTTCTTCTGCTCTTGCGTCTTTTACAACTT 
CTTCGGCAAGCCAGTTGTTGTTGCTGTAATTTTCATGTTGCCTTGCTCTCTTTCATTCTTGGGCGTCGCTGCGAGTTCGTTGGCTCTTCTCGTTT 
CT GT AT GT GTGT GT GT T T AG AAAACT GT AAGG AAAGC AAAGT GTGTGAAAAATAT GG TT AAGG TAAAAAAT GC C AAT A CAG GAAC AAC AAC AC AG 
CTCGCTCTCTTTTCTCTGCCTGCAGTTTCGAAAAAAAATTTACCGGTTTTTATGTTTTCGCAAAAAAGCTCAAGAAACAAGTCCTACAAGTTTTA 
AATCGAGAGAGGAGAGAGCAGAAAAGGCGTGAGAGCCCAAAGAATGCGCATTTAGCAGAGACAACAACCTTTTGAGGTGTGTTGCGTATACGTGC 
ATATTCGCACACACACACCACCAACGCTTCAGCTCTCGCTCTCTTTTTCTCGCTTCGCCATCTCACTCTTACTCCATGCCTTTCAGGGGATAAAT 
TATGTATGTATATATGTACCATGCGTGTGTGTGTGCGCGGGAATGTTGTGAGGAATTCTAAGAATTTTGCACAAGCGACAAATGAAACGGGGCGC 
GGACCACACATACATATATGTATATTCCAAAACCAAACAAAAGTTTTAATAGTTCAAAAAGAAAATTGATTATGCGTCGGTATGCGAGAAGCGCC 
AACAGCCGAATGCGAAATATATCACACCAACACAAATACAGTGAAAGCAGAAAAGACGCAGAAGATGGGAAAATTCGAAGACACCGCGTGTAGGC 
GTTAAAACCTATTTTTTTTGCCCAACACCAAGCTAAATGAAACACTGCTCACTAAAACACCAATTCACTATGCTTTATCTTCACAAATTACCAAC 
TGTTTGCATTTAAACTTGGACAACATTAGGGGGCAGAACGCCGACAACGCACACCGCGCACACGAAAAAACACAACCGCACCGCCAAAGATTCGA 
AAACCGAATGCCGAACAACAACAAAATTGGTTCTGAAGATGGAGGTGTTTTTCAGCAGAGCGAGATTCCCTGGCTCACGCTGAGCGGGAGAGAAC 
GAGAGCGACAAAGTCTCTCCTCTAAAAATGCGGCATAATAACAACAAAGTGGCGAGCGGCTTTCGAGCTCATGCTGTATGTTGTTGTATACCCGT 
TACTTGGAGAGTTAATTGGGTATACGGTGATGGGCTCTTAGGCAAAGAATTTTTCCAAAATACACTTTGCATAATTACTCAAATAAATTGCCTCA 
GAAGCTACTTTACTCGTTTAAATCTTAAATTAATATTATTCCATTAATGTCAAGCCACACAATACAAAATTCAATAGGAATCACTCACCTATAGC 
TGTCATTGCAAACAAATGATATTACCTATTGGACTAAAGGGTATATCTGAGTAGTTACGCTTCGTAAAAAGGGCTCTCTCTTGTTTTGGTTTTTG 
TTTTGGTGTTGCCAGAAGATTTCGCACGATGTAGGATCAACATGTTATTGACACGACGTCGCCGACGCCGCCACTTTTAAGCTTGGCGGAGCAAA 
AGGCACACAAACAAGAAGCGACGCAGAGCGGCGGTGCAAAAGCACAAAAAAAAAGAGGAAAAATGCGCTCGAGAGCGGAGGGCGTGGCGGTGCGC 
AGGGGCGGGGCAGGTAGCCGGACAAACAGGACGAACAGGATGAAACCCAGACGAGTTCGGCCCCACTGCAGATTTCATGGAACTGGCCGAAGGTT 
GAGGATGCGGATGCGATGCGCAGGCGCGCAAGTAAACAACATTCCATTCCTCGGGAAATTTCTTCTGGCCGCTCGAGAAGCGCGCCAAAAATAAA 
GCTCCGACCGATGTCGTTGCCATAAATATTGCTATTTTGACTTTCCAGATTCTCGCGATTCAACGCCCCTTGAAATGGGAGCAACTTGCTGGAGA 
TTCGCGTATCAGGGCTGGAAGAGGGTTTCTCCAATAACTGCACATTCACTCAAAAATATGGTAAAAAGGTGTGCTGCAGCTTCGAGAGCTCCCAA 
GAGGTAACTTTATGTTATAAAATTACATATATTTATTTTATGTATATATTCTGGGAAAATCCATACGCGATCTAAAACATTTGGGGCGCGGTTTT 
CACAGGTAAATGAGATTTTGGAATTCCTTTACTGAAATGCCAACGTCGATGTCGTCGATTTTTTCAAAAAATCACAAACATCGATGTTTTTCGAG 
GAATCATCGATGTTTCTCCACCTCTAATAAGGCGGATTCGACCTATGTACATGTACTGCATGTATGAAAGCGGCACAACTACCCGTTTCCGGTGG 
GCAGTCGGTCCGCAATGTGGGCCGCAGACACCATTCCACAGCCAATTGACGTCACAAGCGACACAACAGCAACAACCTGGCACACGCTTTCTGGG 
TTATGTCGGAATGCATCCAAAAGCGGGAGAAAAGTAGAAAAAGGCAGGAAAAGAAAGAAACCAAGAGTCACCCGGATGGCAATTCGATTCTGGTG 
AGATTTGG ACAATC AATCGGC TTAGTTCAC AGC CC AGC GGGACGGGACTCAGT GCTACAGGTT CAACTGT AC AAT AAT ACC AAG AAG AT GACG AC 
GATGACGATGACGATGATGATGGGGGATGGGCGATGGGCAAAATGTTAATGGATTGCCCTAGGACCTGCCCTCATTCCGTTTGCCATCTTCGTCG 
CTTCGCTTCTGCATTTGTGCATTCCCATTCCCAATCCGATTCCCCCATCAAGTTGTGCTTCTTTGCCACACCGCTTTTCTGCCCCTTTTGCGGTG 
TTCTGTTGCTTTGTTGATGTTACAATTGTTGCACTTGTTGTGTGCGGCTTAGGACGTTGAACCACAGCCGGCAGAAATGCAGAAATGCACGCAAC 
TGAGTGGCAAGTGTACAGCTGCAGTCACAGCACTAGCAATGCTTTGCGAAGGATTCCTCCAGACCGCTTTGTAATTGGTTTTCTTTCGCGTAATT 
TGGCCGATATTAGTTAGTTCATATCTGCAATCTAAATGTTACGAACCAGCAGAGTTCCTAGAGCCCTAGAGACATTAATGTACTTTTAATTGCCC 
CACTGTATGAATTTTTTTCAGTGCATTCTTGCAATTTGCTAACGTTTTTACCGCCATTACCGTCATTGTCGGTATATATTTGTATGCTCATTCGG 
AAAGCCAATTCGAAGGCAATGCTAATAAAATCAATAAACAACATTACTGTCACGCACACATACATACATACATATGCATATAGCTTGGATGTCCG 
GTTGAAACGATGACGATGAAAGGTCACGACGAAGAATTTTTCGCCAATTGCCTGGTGGGTCGCACAGTCTGGGTCGCGAGACGTCTGACTAGACA 
ACATGCAAATAGTAACGGTAATCATTAGACGCACACACTCGCACCTATTTGCATGCACACTCTCTCTGGCTGTGGTTCTGGTCGTGGGCCCCCAT 
ATTTCATCTATCTCGCGGCCGCTCTTTCTTTCGCTCACCATTTTCTATGCTGAGTTCGACGGCCTTCTTACGTAAAATCAACAACAACGCTCTGC 
GGTATTTCTCTTCTAATAAGCGGGTGCTTGGGGCTTGGGGTTGGGGGGATGTGCACTCGCCGACAGAGTTGCCATATCTAGCCAGACATCAAACT 
CGTTTTCACTGTTTTGGCTCCACACACACACACACACACGCATGTGAGTGGCTTTGCAACTGTACAAGAACAATAAGTAGGGGCAGCATAAAAGC 
AACAACACACGGCGCAGACGCCAAGCAACCGCTCCCCTCCCAACGCTCCACTCAATGTTTGTCTGTCTGTCCGGAACGTCTGTTTATGTATGCTT 
TAGGCGTATACATGCATGAGTGTGTGTGAGCAATTTTTATTTTTCTGAATGTAAAGGAAGTGGCACGTACTGTACTACTCAACTCACCGCTGCCG 
CAACAACAAGAGCAGTGACAGCCGGATCAGGCTGGGAATTCGAATAAGAGCGGAGCGCGTCGCATTCGGATGTTCCGATTCCCCATCGTTCCCAG 
CCGCAAAGCGGTCAGCGTTCCAATGCCAACGGCACAGTGGTTGCGATGCGTAAGGTAACGTGCTACGGAACCGAAACTCGCGGATGATCGTAAGT 
GGGAGATATGTACGTGGGATAAGAGCGAAAGTAGACGGATATCCGATTTGCAAAAAGATGTAGTATTAATTTAAAATGTTTAGAATAACAAAAAT 
AAGCTAACAAAACTATAACCATGACAAACTATGACGATAACAATATATTTGCTATGCGTTACGATTTACAAACTTTGAATACTTTTTAAAAGATG 
CTTATTTAGGATCGATCCATTGTGCAGCGAGATGAGCGGAACGAAAGGCGAAAGGGACGCTGGTTTGCGTCTCCGCTGTACGCCAGACAGACAGA 
CAGACAAGACGCCGCAATATTATAAACAGGAAATGGAATTTACCCACACGGATGCAGCGCGGTGAAGTGTGTGTCTGTGTAAATTGCTGATGAGC 
CGACAAACGGACGTGGTGCTCAATTGACAGATGAGCACGAGAGAGACAGGCAGAGCATAGCACACCGCCGAATGAGCGAAGGAGAAGGCCAGAGC 
GCGCTTCAAAGTTGATTGAAATCAATATACGTACATACGTATGTATACACACATAGCGTGCCGGTGTGAGTGTGCTAAGACCGCTCGTGGAATGT 
TTGAAAAACGCGGTCAAGCGACGGGAAGAGCAAGAATGTTGAACTCTGGAAGTGGGAAGAAGTTTCTGTAAATAGAAGCCAGGTGGAAGGGGCTT 
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TTGCACCTGCCACGCCCAGCAGCCCCTGGAATCAATTAACATGTTGCACAACTGCAGCGGCTCCTATATACATAGTTCATATAATGTACTTGTGA 
TACAGTGACTTACGCACACAGAGGTGCAATTGCAGATGGCGACGAGTGTAGCGTGTAGTTTTTCTCGGAAATGAGCAATGAATGAAATTCGGTTG 
TTTTCAGTACTCACACGCACACGAACGCAGATGCATACACTCGCAATTTGGAGCAAGAGAGAAAGTCTGCACGACATGCGCTCTGGCGTTTGTGT 
GTATACAATTTGCTTACTTTTTGCGCTACAATTGGCAAATAAAACAATAGCGCAATTTTACGCAGCACTTTAGCTTACTAACCCACATTGATGGC 
TCCACAAAGCGAGCAAAATAATTTGTCTGCCTAATTGAAATAAAATAAATAAAACTAAGCAGAAAAAATGTCGGCAAAAAACAAGGCGCAGCTGT 
ATTGTGCGCTAGTGTGTGTAAAAAGGAGACGGCCATAGAAAAAZW^AACACATATACACACAGGAGT^ATTTACTGCTGCTTGCGGCGCCCACACA 
CACATACACAAAAACACTCGCCTATTTAGTACAGCAACTGCGTAAATTCAATAAAAATAGTCATCTGGAGTAAAAAAACCCAGTTTCCTGTCTTC 
ATTTCCCTTTTTCCGTGAAGGTACACAGTCACTACTGTTATAATTTAGCACACACACTCGCGCATTTATGTATGCATTTGGGTTGCTTTCATTTG 
GATGTTTATCACTTACTTTACCACTTACTTTGGGATTTAGCTCTTCGTTTGTCGCACTTTTTTGGCCTGTTTTCATGCCATTTTCTTTTTGAAAA 

actgaaagagtacaattcgttgtgtcgcgtcttaaatgaacaagtgttgccagactttagctggagggagatgacactatcgcgaggcagcataa 
aaatcagtgctgtaaagcgaaattaaaaaaaatatctgataactttaatattagtttatttgtttactgaattttacattttcttatgtactata 
aataaaaatcaattcataaatttatttgagaggagtaaaacaagaagaatccacctaaacattacagacttaagttctaaatatcccaaacgttg 
tttaatttctatttatatttgtaacgattagttagttattagtaataatgaaagaactagtgataaaatccatttgtaatcactgctagaattgg 
cttgaaaatagctagactgacgttaacaatactgttaccagttggcatttcctgatccggcaactccacacgacgccatttcaaaacaaatgcgc 
ggcaacgattttattttaccccgcaacatttagtgagaagtcgcaaaataagtcagattatttgattaaaatgggagcaaaggcaacaaaaatgg 
aaggt gaat atctgcc gg at ac gc cgac ac t aaa c aaaat gcg c c t g gcc agc gaaaagc g ag aac aaat gg aaat g ac g ac gc cg aat cgagag 
caaaacaacgctttgctcgatcctcgatcaccaaatggctgccgcacgcctttaaccgtgagtttataggcattaggtattttttgatattacta 
accctagtaatgatggctgcgttgtagttcttgcaggcgtcaaaacctcgcgttgaggaggaggagcagcatgtgaccaccagtgaaaatgcctt 
ctccatgctgcgaaaacgtttgctcaagggcttctccctgaacgatcctcgctctcctcagctgaatcgcactccactggtcctcgatgaggttc 
gttcgcttaatatggatgacacattcgccgatctcttcgtggacaccagggtgggtacagcacagagagcagtgc 

{SEQ ID NO: 1507) 

Exon: 10380.. 10289 
Exon: 4869.. 4674 
Exon: 2328.. 1001 

Start ATG: 2087 (Reverse strand: CAT) 
Transcript No. : CT29014 

ACACAACGAATTGTACTCTTTCAGTTTTTCAAAAAGAAAATGGCATGAAAACAGGCCAAAAAAGTGCGACAAACGAAGAGCTAAATCCCAAATTT 
TCTAAACACACACACATACAGAAACGAGAAGAGCCAACGAACTCGCAGCGACGCCCAAGAATGAAAGAGAGCAAGGCAACATGAAAATTACAGCA 
ACAACAACTGGCTTGCCGAAGAAGTTGTAAAAGACGCAAGAGCAGAAGAAGAAGCAGCCACAACAGTATTTTTTATTAGCGGGTGTTTTTGTTGT 
CATATACACACATCACCCCTAACTCAACGTTTACGTTCCACCAGCAGCATACCTATGTAGAGAAACCCGATTCCAAAGAGATTTGACGACGCCCT 
AGCAGCGGACAACGAATATTCTAATTTATTTATGTGATAGTCACTAGTTTTAATATTCTTAAGCTTTTAAACAACCAAACGTTGCACGCCAAAAA 
ACGAACAAGCCGATACGCTACGCTAAAAAGCTACAAATTGTAATGCATTCCACAATGAGTGTACAACATGGGCTGGTTGCCCTGGTACTGATCGG 
CTGTCTGGCACATCCGTGGCACGTCGAGGCCTGCTCCAGCCGTACGGTGCCCAAGCCGCGCTCTTCGATCTCCTCGTCCATGTCCGGCACGGCAT 
TGCCGCCGACCCAGGCTCCAGTCACTAGTAGCACCACAATGCGCACCACCACGACGACCACGCCCAGGCCCAATATTACATTCCCCACATACAAA 
TGTCCGGAAACCTTCGATGCCTGGTACTGTTTGAACGATGCCCATTGCTTTGCGGTGAAGATAGCCGATCTACCGGTTTACAGCTGCGAGTGCGC 
GATTGGCTTTATGGGACAGCGATGCGAATACAAGGAGATCGACAATACTTACCTGCCCAAGAGGCCGCGTCCGATGTTGGAGAAGGCGAGCATTG 
CCAGTGGAGCCATGTGTGCCCTGGTATTTATGCTGTTTGTCTGCCTGGCCTTCTATTTGCGCTTCGAGCAGCGGGCTGCCAAGAAGGCCTACGAA 
CTGGAGCAGGAACTGCAGCAGGAATACGACGATGACGACGGCCAGTGCGAGTGCTGCCGCAACCGGTGCTGTCCAGATGGCCAGGAGCCAGTCAT 
TCTGGAGCGCAAGCTACCCTACCACATGCGGCTGGAGCACGCGCTAATGTCCTTCGCCATTCGACGCAGCAACAAGCTGTGAGCGGAGCGAACTA 
TGGAAACTATGAAAAGATATAGAAAGTCGAATTTCGAATATGCCTTGCCCAACACACAATTCTTAACTGGGCTTTAGTCGATAGCTGATAACTCC 
TATTTACCTACCGCATGCTAAAGATCTGTTCACCCGATTCTATATAGAGATCTTGACTTTCTCTTACCAAAAGTTACCCAGAAAGGGGGTTTTAA 
AGCCAGACAGCCTTGTATTACCCAATTGACTATGAATTXTAATACCTAACGCTAGCTATTTATTATTACTGCTAACTGATATACGTATCACACAC 
ACAACGCGACACACGCGCCAAGTATTATTTATTGCGAGAGATTTAAGAAGAAATGATATGGATTTTCTAAATAAACATACGAAACTGCACCATGC 
A 

(SEQ ID NO: 1508) 

Start ATG: 530 (Reverse strand: CAT) 

MSVQHGLVALVLIGCLAHPWHVEACSSRTVPKPRSSISSSMSGTALPPTQAPVTSSTTMRTTTTTTPRPNITFPTYKCPETFDAWYCLNDAHCFA 
VKIADLPVYSCECAIGFMGQRCEYKEIDNTYLPKRPRPMLEKASIASGAMCALVFMLFVCLAFYLRFEQRAAKKAYELEQELQQEYDDDDGQCEC 
CRNRCCPDGQEPVILERKLPYHMRLEHALMSFAIRRSNKL* 
(SEQ ID NO; 1509) 

Name: PROTEIN SPITZ PRECURSOR 
Classification: signal_transduction 
Gene Symbol: spi 
FlyBase ID: FBgn0005672 



Celera Sequence No. : 142000013384600 

CACTGAACAATATTTAAAAAGTTCAAAATCACTGTACAGTACTTGGTATTTTAAACATCCATCACTTTTCCATTCAAAATATATAAATATTGAAA 
AGCATGACATACGATGTTTTTTTTTTTTTATTTTAACTTAGTGTGTCAACTATCTTTATAAAAAAAATTAAAATTTTCTGGGAATCCAATAGAAA 
GCTCAGTCTGGAGATGAATCCTCGAAAGCGGAGCTAATTTGCAAGCAAGACTCGGCATCCAATAGTGGATATCTGCAGATCGTCCGATTTGAGAA 
CTCCGAAATCATATGCTTTTCATTCCTATTTTCTTTAGATCCCGGACAATTGATTTCTGTTTTTTTTCTGTTTTGATTTTCATCATCGATTGATT 
AATTTCCATGAGGACATTGTTTTCAAAGCTTTTAAGACCGCGAATAAATCCAGTCTCATTCAGTAAAGATTCGTGAAGGCAATTTCGTGTGATAT 
CAAGATATATATGTATATTCGGAAATAATGTTCACGTGAACCTCGCGATTTGGAAGTTCAATCCTTTGAAGGATTGAATGATTAGGACGCCATCC 
TTTATATGTTTTAGATATAAATATCACCCTTCACCTTCTCCAGTTTGATACCTATCAATATTACTAGGGAATGCATTACGACGGAATTCAAAACA 
TCTTGAAAAACCTCCAATTACCCCAGTTCAGATTGACCGTTTTATTGCCTCCCATTTTTATAGTTCCGAAGAAAATTACGAACGCAGAGGCATCA 
TTTTAGATTACCTAGATTGCCAAGTTTCATAAAACAAAACCATCTTTAGATTAGGTTTTCAATATTTCGGGTTTGCTGATTTCGATATCGATAAG 
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AAACTGACGTATGTATGTGTGAAGTTTTAATGGGTAAAAATATCGATATTCTGGCGAAATTTTATGTAAAGCAAAATGTGAATGTAAAAATACTA 
AAAGTGGTGAGTTCGATAGAGCCGCGTAAAAATATTTTCATTATCGATAACCGAAGTATTATTAATCTTTGCCAAAGACCAGCTTTGGCATTACA 
ATACAATTTTGGGGATAGATTAACTAGTGCAGCCCAGCAGGCAGCACCGTGATTTGCAACAATTTGAGCCTCACTCATTCAAGAACCTTCCAGCA 
ACTAGTTCGCCTAGTATATATACCTGCTCTCACCCAGCTGCACTTGTATACACAATCGAACGGCCGAACAAAAACTACGCAGTAGTCGAGAGCTC 
CAAATTAATAGAATCCGTCACATAAATTCGCCGCAACAAGTTAAACGAGAATAAAAGCGTCAGCATCGATATATATACATACATATATTCAAAAG 
TCTGTCGAAACTACAGAATACGTTGCGAAAATGCCGGAGAAGCTGTTGAAAATCACCTACCAGGGCGCTGGACCCCAGAAAAAGATAAATGCATA 
CCTGCGGATGCCCTCCCAGAATTACACCATATTGCGTCGCGAAATCGAGCTGTATCTTTTCCAGGAGCGCCAGTTACCCAAATGCGACGTAAGGA 
CCTTCTGGATCGGTGAGTTTTACAATGAAAATGCCGCAGAGTGAATGTTTGCTATACATACAAACAGTTATTCGCAATTGCAATCGAGTCGCAAA 
GATTGCAACATATGTGCATGGTATAAGTACATTCGTTTCCATACATAACTGCATCCGTCCGAGCAGTGCTGACCATTTAGCTTTTAAGTCTCGTT 
ACTTAAAAACCAGCCCAAATTATTGCGTACTTTGCCCAACAAAGCAGGGATTCACCTAGATAGTATTCCTGCTCAATGCTTATCATTGGCAAAAT 
TGTGTTGACACATTAGCTGATAAATATGTACATATATATTTTTATGGTACAAACGTGGTCAACATATTAGATGCACTGTTTCATAAATGGCACAC 
ATACAAAACATTTTTGATTTTACATATTTCTGAATTCTCGAATTATACATATATATTTTAAATACATCTAACGCGCTTTTTAATTTGTAAAGTAT 
GTGTACATGTACTCATATACATTTACATAGATCAAACGGGTTGTGCTGGTACAACCACAACTGTAGCTAATTTGTCTAACTAGAGCCCTCAGTGA 
TTCACCTTTGATAACAAAACTTCACTGATTCATAATTGATTTTATTTTCCTTTGCCTATTCTAACTTTTGCAGACGCTGATAAAGATGAAATCGA 
AATAGTCAACCAAAATGACTATGAGATTTTCCTGGCCAAGTGCGAGAGCAATATGCACGTTCAGGTGGCTCCACTGGCGCCCGTAGAGGAGCCAA 
AGGCCACCAAGCAAGAGGGTTCTTCGGCCAACGCTGAAGCTCCTTCTGTCGACGATCCGAGCAATTTCACCATCCACGACGCCGTTGAATGCGAT 
GGCTGCGGCTTGGCTCCCTTGATTGGGTTCCGCTACAAGTGCGTTCAGTGCAGCAACTATGATCTTTGCCAGAAATGCGAGTTGGCTCACAAGCA 
TCCTGAGCACTTGATGCTACGCATGCCGACCAACAATGGACCCGGTATGGTCGATGCCTGGTTCACAGGTCCAGGATTGGGACGCCGCAGTGGCC 
GGCGCTCCAGGGGACATTGCCCGTTTCAGGAGACGAACCAGGCAGATCCTGCTGGCGAACCCGCAAGGGATAGTCGCCGTGAACGGCGTCAGGCC 
CGCCGCCATGCCGGAGTCTTGACTCAGTTTGTCGAGATGATGACCAACTTGCCGCTGAACACAACAACGGCCACGGCACCAGCCGAACCGCAGAA 
ACCGAAGGCTGCAGAACAAACTGAAAGTCCTCCACAAGCTGAGCCCACTGTTACCGCTGAAAAGGCAGCTGAATCCGAGGCCAAACCAACTGAGC 
CGAAGAAGGTTAACACTGATCAAAGTGTCCCTAGAACCGAAGATCCAGTGACGACTCCACGTTCCACTCAGCCAACTACTCCAGTGATTAATCTG 
GATAACATTTCGCAGATTGTGCCACCCGAGTACATGAGTGCTGGCATCGAAATCCTTAATAACTTCAGCGAAATGTTTTCCAAGATTATCGATAC 
CACTGAGGGCGGTGATTCGGGAATTTTTGCGCCCTCAACGACTCCCAGTGCTGAGAATAAGAAACCAGAAGAGCAGGGTCAATCCAGTGGCCAGT 
CGGGGGCATCCTCGGCCAACCAGTCAGCTGTTCCCTCTGCCGCTCCATCGGCTAATCAGTCTAATGTCCCATCTGCTAACCAGTCGGCCACTCCA 
TCAATTTCTGGTTCGATCCCTGATGCTCAGCTTGAGACAGAGCCCCTGAATCCTAAGCCATCGGAAACCACCACCGAAACAGAGCAGGAAAGACG 
CCGTTCAGACAGCTTGGATCCAGAGTGGCAGCTTATTGACAATGCATACTCTGCAAACAATAGTAACTTGATCAATCTGGACACAACCAATCCCA 
CTGCTGCTCCTCAGGAGCCGGTGCGTGATTTTGGTCAGCTGGGCGAACTATTGCGTCAGCACATGAATGAGGAGGCTCGCGTGGAGCAGGCTTCG 
GCCAACACCCAGACTGCTCAAGTTGATACGGTGAGTACATCGACCTCGACCACATCGGTGACCACCAATAGCGTCGGCACCTCCCCAGCAGCTCC 
CGACGACAAGCGCACGGTCCCCGTCTACCACACTGGTAAGTGCCGAATTAAATAACTAATCGCTTTGTATTAATTATGTCTCCTAATTAGATGAA 
AGCATCAATAAATCGATCCATGCCATGATGGCCATGGGTTTCAGCAACGAGGGCGCCTGGCTAACCCAGCTCCTAGAGTCGGTTCAGGGCAATAT 
CTCAGCTGCCTTGGACGTAATGAACGTATCGCAGAACCGCAACTAAATATTCATATTCAATATCGATATATCTACAATGTATCTTCAATATCTAT 
AGAACCCAATGAAAAAGGGATTTTACTTTATCGTATCAGTTTACTACGTAGCTTAGTTTTTCTGTGGTGCTAGATTTATCTCGTAGCCAAAATAA 
ATGTGAAAATAGATATATGTTATGTTTTAAATAATCAACAAAATGTTATCGTATATATATGTATGTGTGAAATTGCAATAAAGGCACAAAGTAAA 
ATGATTTAAAATGCCTGCTTTATACCTGATATGGTTTTGTTTAGTAGAGTACGATTGACAGACAACAAAAGCCTCCTCCACTTAGATAATGTGTT 
TATACGTATGGTATAAATCGATTCTGGTTTTTCAGAGTTCAAATCCGCAAAGCACTCGAATTATTTCTTGAGTGATACACAAATCCACAATACTG 
TGTTTCGGAATTCGATTCGTTTCCTGTAGTATTGATATTCCAGTGTGAACACTACTCGCTTGTCGAAAGAGCCTCAAAGATCTTCTCTAGCTTCT 
GCTGCATCTGATCCAATAAAATAGTCTGTCGCCAAGCCGGCAACATCTCGCACAAATCTCCGGAGTTCGTAATTTGCAGTCGTAATGGTGGCAAT 
AAATGTGAAAAATTTGAAGAGCTTCTGGTGGTGTGCCTTCCACTGTCGCCGACCGAATCCCGTGTTATATCCGAACATTTGAGATACCATCAGAA 
CGAGGAATCCGGACAATCCGAAAAATCGATGAAGCATTTTCAAGGCGAAGCCACTGTTGATGAAATACAGCGGAAGACCGCTCAGAACGCTACCC 
AGAAGAAGAGCACAACCGATTATACCTGATGGGCGAAAGATGGAGGCATAAATTATCATTATATTTGGGAAAAGAACCGACGAGAACACACCGTA 
AAAAGCATGGTTGCTCTTAAAGTGACGAACGGTGGCATTGGGGTCTTCCCGTTTGCGCTCCAGCTTCTGCCATGTCTTGATTCCGATTCCGCCGG 
CTCCGACCAGGAATGCCAACAGCCCCAGGACGGCGTGCAGGCGGTTGAGGTTCAATGGACCCAACGAGGACTCCAAATATGCGCTATTCCGGACC 
AGC AAT GCT T C GC C C ATG AAAAAC AC AAAC T AT AATG AAAT AG AAG A T AG AT GG AAC AAT GA T T T AGGA C G GGT AAA AAT T T C AGG AG AT AGT AA 
ACCAGTTTATATTTAGGTCTTCCGGGTACAGAAAACTCTCACTCGAGAAATATATATCCTTGAAACCATTAGATTATTTGTTAACCGTGGCCTCA 
AACTTTCACTTTTTCACATATGTTTCTAAATCTTTTAATATCCGAAATCAAAATACTTACGCCGATTGTGCAGTAGAAGGCATGTCCGGCGGTGT 
GCACCAATTTGAGCACCAGACACTTGTAAACAATGGCTACGGTGATCCCCGCCAGAAGCACATGCGAGATCAGTGTGCAGAAGCCAAAGTAGAGG 
CCTCCATAGGAACCTTGCAATTCGGCA 
(SEQ ID NO: 1510) 

Exon: 1001,. 1532 

Exon: 2164.. 3645 

Exon: 3701.. 4347 

Start ATG: 1361 

Transcript No. : CT29100 

CCGAAGTATTATTAATCTTTGCCAAAGACCAGCTTTGGCATTACAATACAATTTTGGGGATAGATTAACTAGTGCAGCCCAGCAGGCAGCACCGT 
GATTTGCAACAATTTGAGCCTCACTCATTCAAGAACCTTCCAGCAACTAGTTCGCCTAGTATATATACCTGCTCTCACCCAGCTGCACTTGTATA 
CACAATCGAACGGCCGAACAAAAACTACGCAGTAGTCGAGAGCTCCAAATTAATAGAATCCGTCACATAAATTCGCCGCAACAAGTTAAACGAGA 
ATAAAAGCGTCAGCATCGATATATATACATACATATATTCAAAAGTCTGTCGAAACTACAGAATACGTTGCGAAAATGCCGGAGAAGCTGTTGAA 
AATCACCTACCAGGGCGCTGGACCCCAGAAAAAGATAAATGCATACCTGCGGATGCCCTCCCAGAATTACACCATATTGCGTCGCGAAATCGAGC 
TGTATCTTTTCCAGGAGCGCCAGTTACCCAAATGCGACGTAAGGACCTTCTGGATCGACGCTGATAAAGATGAAATCGAAATAGTCAACCAAAAT 
GACTATGAGATTTTCCTGGCCAAGTGCGAGAGCAATATGCACGTTCAGGTGGCTCCACTGGCGCCCGTAGAGGAGCCAAAGGCCACCAAGCAAGA 
GGGTTCTTCGGCCAACGCTGAAGCTCCTTCTGTCGACGATCCGAGCAATTTCACCATCCACGACGCCGTTGAATGCGATGGCTGCGGCTTGGCTC 
CCTTGATTGGGTTCCGCTACAAGTGCGTTCAGTGCAGCAACTATGATCTTTGCCAGAAATGCGAGTTGGCTCACAAGCATCCTGAGCACTTGATG 
CTACGCATGCCGACCAACAATGGACCCGGTATGGTCGATGCCTGGTTCACAGGTCCAGGATTGGGACGCCGCAGTGGCCGGCGCTCCAGGGGACA 
TTGCCCGTTTCAGGAGACGAACCAGGCAGATCCTGCTGGCGAACCCGCAAGGGATAGTCGCCGTGAACGGCGTCAGGCCCGCCGCCATGCCGGAG 
TCTTGACTCAGTTTGTCGAGATGATGACCAACTTGCCGCTGAACACAACAACGGCCACGGCACCAGCCGAACCGCAGAAACCGAAGGCTGCAGAA 
CAAACTGAAAGTCCTCCACAAGCTGAGCCCACTGTTACCGCTGAAAAGGCAGCTGAATCCGAGGCCAAACCAACTGAGCCGAAGAAGGTTAACAC 
TGATCAAAGTGTCCCTAGAACCGAAGATCCAGTGACGACTCCACGTTCCACTCAGCCAACTACTCCAGTGATTAATCTGGATAACATTTCGCAGA 
TTGTGCCACCCGAGTACATGAGTGCTGGCATCGAAATCCTTAATAACTTCAGCGAAATGTTTTCCAAGATTATCGATACCACTGAGGGCGGTGAT 
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TCGGGAATTTTTGCGCCCTCAACGACTCCCAGTGCTGAGAATAAGAAACCAGAAGAGCAGGGTCAATCCAGTGGCCAGTCGGGGGCATCCTCGGC 
CAACCAGTCAGCTGTTCCCTCTGCCGCTCCATCGGCTAATCAGTCTAATGTCCCATCTGCTAACCAGTCGGCCACTCCATCAATTTCTGGTTCGA 
TCCCTGATGCTCAGCTTGAGACAGAGCCCCTGAATCCTAAGCCATCGGAAACCACCACCGAAACAGAGCAGGAAAGACGCCGTTCAGACAGCTTG 
GATCCAGAGTGGCAGCTTATTGACAATGCATACTCTGCAAACAATAGTAACTTGATCAATCTGGACACAACCAATCCCACTGCTGCTCCTCAGGA 
GCCGGTGCGTGATTTTGGTCAGCTGGGCGAACTATTGCGTCAGCACATGAATGAGGAGGCTCGCGTGGAGCAGGCTTCGGCCAACACCCAGACTG 
CTCAAGTTGATACGGTGAGTACATCGACCTCGACCACATCGGTGACCACCAATAGCGTCGGCACCTCCCCAGCAGCTCCCGACGACAAGCGCACG 
GTCCCCGTCTACCACACTGATGAAAGCATCAATAAATCGATCCATGCCATGATGGCCATGGGTTTCAGCAACGAGGGCGCCTGGCTAACCCAGCT 
CCTAGAGTCGGTTCAGGGCAATATCTCAGCTGCCTTGGACGTAATGAACGTATCGCAGAACCGCAACTAAATATTCATATTCAATATCGATATAT 
CTACAATGTATCTTCAATATCTATAGAACCCAATGAAAAAGGGATTTTACTTTATCGTATCAGTTTACTACGTAGCTTAGTTTTTCTGTGGTGCT 
AGATTTATCTCGTAGCCAAAATAAATGTGAAAATAGATATATGTTATGTTTTAAATAATCAACAAAATGTTATCGTATATATATGTATGTGTGAA 
ATTGCAATAAAGGCACAAAGTAAAATGATTTAAAATGCCTGCTTTATACCTGATATGGTTTTGTTTAGTAGAGTACGATTGACAGACAACAAAAG 
CCTCCTCCACTTAGATAATGTGTTTATACGTATGGTATAAATCGATTCTGGTTTTTCAGAGTTCAAATCCGCAAAGCACTCGAATTATTTCTTGA 
GTGATACACAAATCCACAATACTGTGTTTCGGAATTCGATTCGTTTCCTGTAGTATTGATATTCCAGTGTGAACACTACTCGCTTGTCGAAAGAG 
C 

(SEQ ID NO: 1511) 
Start ATG: 361 

MPEKLLKITYQGAGPQKKINAYLRMPSQNYTILRREIELYLFQERQLPKCDVRTFWIDADKDEIEIVNQNDYEIFLAKCESNMHVQVAPLAPVEE 

PKATKQEGSSANAEAPSVDDPSNFTIHDAVECDGCGLAPLIGFRYKCVQCSNYDLCQKCELAHKHPEHLMLRMPTNNGPGMVDAWFTGPGLGRRS 

GRRSRGHCPFQETNQADPAGEPARDSRREPJIQARRHAGVLTQFVEMMTNLPLNTTTATAPAEPQKPKAAEQTESPPQAEPTVTAEKAAESEAKPT 

EPKKVNTDQSVPRTEDPVTTPRSTQPTTPVIWLDNISQIVPPEYMSAGIEILNNFSEMFSKIIDTTEGGDSGIFAPSTTPSAENKKPEEQGQSSG 

QSGASSANQSAVPSAAPSANQSNVPSANQSATPSISGSIPDAQLETEPLNPKPSETTTETEQERRRSDSLDPEWQLIDWAYSANNSNLINLDTTN 

PTAAPQEPVRDFGQLGELLRQHMNEEARVEQASANTQTAQVDTVSTSTSTTSVTTWSVGTSPAAPDDKRTVPVYHTDESINKSIHAMMAMGFSNE 

GAWLTQLLESVQGNI SAALDVMNVSQNRN* 

(SEQ ID NO: 1512) 

Classification: known_flybase_gene 
Gene Symbol: ref(2)P 
FlyBase ID: FBgn0003231 



Celera Sequence No. : 142000013384598 

ATGATTTAGATGTTGGCTTACAAAAAAACATGTTTTATTACGGACCCGGTTGGCCCCAATCCATTCTGTCTAAATTCCTCCGATGCGAATACACT 
TTGTTTTCTTCTAAACCACTTTTAAGACTTCTTTTTAGTTTACATATATTAATTCTCCATATGACGCTTCTCAAGCTTAATATATTTGAGTGAAT 
TTTAAATGAATGTTTTCCACTTATGAACACGCTAGTTACACACAAGGTGTTAGGATTGAATCACCTCCTCGCAGAGCCAATACCAAGTGAAGAAC 
GGATCCACCTTGCACTTTGTAGTCCGCCGCGGTTTTATCGTCATTCCTGGGGGGGATCGCATGCGAAATAAGCTAAAATCAATGACCAATAGCCA 
GAAAGACTCACATTTGCTTGCCGGAGAAAATAAGACGCTGTTGCTGGGGCGGAATGCCTTCCTTTTCCTCCACGCGCTCCTTGATGCGATCTACC 
TTGTCCGTGGGCTCAATGTCGATCTCGATTTCCTTGCCGGTCAGGGTCTAATGGAAAGATAATACATATTTACATTAATAATCCAGCCACACATA 
TATTGACTGTAATATTCGACGATTTTGTTGTTGTTCAAAGTGCCCGCTGGGCACGATGGCTGTGATTTATCTTTATCCTACTTTTTTGCTACTCC 
ATATCTTCCACACTAACCTTCACTTTGATCAACATTTTGTGTGTATTTAACTAGGCGCTCCTGTACAAATTCAAAATTGCCGAAATTTTTGCAAA 
TCTCCACAGCTGGTTCTCTTTTTTGGCTGTCACTTTCGGCGTTGGTACGACTGGCAACTATCGATAGTACTGTCGATGACCATCGATTGTTTTCC 
ACACTATTACAAATATAGTATTTTCTCCATCATTCCATCTCTACACAATTCACGTGTAAAAAAATACATTTCGAAAAAGGAGAACTTAATTTATG 
TAAATTTTTTGTAGCTTGATTTATAGTGTGCCCATTACGGTTAGAAGTCTCACATATTTAGAATTCAAATTCTCCTTAAATTATTTTACTCATGA 
GATGCAAAAATCCATTGAATGCCATTACTACATTTTTTAACTTTTGAAAAAAATGGAAGGCGTCACTTTTGAGGGATTTGAATTAAGAGCAATTT 
GTTTATTAATGTTTTATCTCGGATCTTAGAAAATTAAAAAGTATTTTTTTGTCTTTTGTCTTTTCTGTTTTATTAAAATAAGTTGTATGCCTTAA 
TAACTAAATACATTTATTTACAGTACTAAATGGATAACTACTTTGTTCTTTTCGAAGGTACTGAGATCCCCAGCTTAGAAGTTCATCCCTATGAG 
AGCTTCTTCAGTTTGTTCCAGTCGCGAATGGCCTCCCCGATGTGGCGTATATCCCTCGCATAAGTGCGACAGCATCCCCCAATGACCTTGGCCCC 
CAGTTGCGCCCACTCCGGAACGTAGTTCGCCAAGGGCACACAGTGCTCCCTTCCCTGCCAGCCGTTGACCACGTCGTATACCTCTCCGCTATTGG 
GGTACACCACCAACGGTATCTGCTCACCCACTTCGCGGTCACCATTCAAACTCTTAAAGAGCGGGGTCACAAACTTCGGATGCACACAATTTACT 
CCGATGGCCAGGCACTTATCCTGGGCATTTCGTTCAGCCAAGAGGTCCCAAATAGCATTCGCCGCGTCCGCAAAGGTTTCTCCATGCGCCAGGGT 
ATTCTCATCCTTGCACTGGAAGGCCACCCAGAACTTCACATCCGGATAATCATCGCACAGCATCTCCACCAGGGCCTCCGCCTCCATTTGACAGG 
GAATCGTCTCGATGGCCAGCGCATCCACACCAGCCTCCAAGCACGCTTCGATCCTCACGCGATGCCAGTCCGTAATCTCCTTGGCCGGCACAAAG 
TCGGCATAGCTACCGGTGTACTCGGAGCCATCGTGCAGGTGGGCCCCAAAGGGTCCAATGGAGGCAATGATCAAAGGGTATCCTTTAAAAACAAT 
AAATTGGAATATATAATTGGTAAATATTACTATATATAAAATAAAGTATTAAAGAAAGTATTAAAAGTAATGACCCTACCACATCATTCACTTGA 
CCACAATCTAAATTAATCCCGCCCGAGCTGGACCTCATGGATGTCAACTTACCCTCCTGCACCGACAGCTGCGCCTGATAGCACTCGGTGAGATA 
GCGTTCCTTGGCGATGTGGGCCAGGCGGACCGTGTTCTTTATCAGCTCGATGCTCTGCTCCTCGTCCAGCTCCAGGTACTCCATGTAACCATCGA 
CACTGGACTGGTAGGTGTTGGTCAAAATGATATCGGCACCATCTGGTGTTGTCAAGGGGAAAATTCAGTTAGTATGTTGCCAATTGCTACCGGAA 
AACAATTGCTTACTCTGCAAAAAGTCCAGGTGGGTGCTGATAATGGCCGCCGGATTGGTGGCATTGAAGCGGGCACTCCATAGCGGATCCCCATC 
CACAGAGTCACCCACATGGACGGTCATCTGGGTGCCGAAACCGCCATCCTTCACCAGCACGCGCGTCAAACCCATGATTGAAAGATTACTAAGAG 
GGGTGGAGGATGATCCGTTGAAAATTGATTATTAGTTACATTTCTACGTCAGCGGGCTGCGAGATAATTAAATATGTATATTGCGTCTGCCACCT 
GGATTCCACTGCACTCAGGGCGATGTTATACAATGTATCGGAAGAACTAATCTCTATTTTTCAGTTCAACGAACTTGGCAGAACAAAAGACCAGT 
GGCTTAATCTAAAAAACACACTGAGAAAAACAGCTCAAAGTTCTTTAGCTAAGATATATGTTATATGAGTCTTATGATTTATTATAAAAAAGATT 
TAATTCTAGGAGTATTCAGCTTTCACTGAATAGTATACAAATTTAAAACCAAGGTCACATTTTTACGTATCTCAATAAATGGCGAGTTAAAAGCC 
TGAAACATTTTATTCTGTATACGCTAAATTCAAAATATACATCAGCTCTTTAAAATTTGGCAACAATTTTTGAGAAATATATATCTTTTAAAGTG 
AAT CATTTATTACGTTT C AAG AT CT AT T CT T GTT C C AAATT AAAAAAT AT ATT AA T AT T T AAT AT A T G TAAG T AT T CC AAGC AT AACATT T TT AG 
ATTGCTTATAAGATCCCTTTGTTGCACTTTTAAGCATACTCCTTGGATAGTGTATACACCGTCTGTCATCAAAATCTATCGGTAGGGAAGCGAAA 
AACACACGATAAGGTCTCGCACTCACCTTGACAGCTTGTTTATGGATCTTGTAAATGGGCAAATTCAATAACAACTTTATTAGACACTAGACTGG 
CTGGTTGGTTATCAAGTGGTTATCACAAATCTTTCGGAACGGTGGATATGATCAGTCGCAATGGACGCTCGACGAACGTTTAAATGTTTCACGTA 
TCGTCGAGGGGCTTTTATACTTTCCGAAGGGCCACTCAGCTGTGCGGCTCGGCTTATCAGGGCGAAATTTATAATATTTATAATAATTTATAACA 
AAACAAAACGGCAAACACAAACATGAGAGTGCCCCCACCAAGTGGGTGATAAGCCCGGTGCACAGCTGCCAGACGCTGGCCAGCCAGATTGGATT 
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GTCCGGGTGGACTGCGACTCCCCCTCCACGAGGCTGATGAAGCTGATGAAATCGTCAAAGCGCCGCTGTGCGTGGTTCTGGCTTCCGAGTGACTC 

TGCTCGGGATGTCGCATCTGCTAAGTCGACGCCGATAACCTCTGGACGCCGCCTAGGACCAACACTTCAAGATGTACTTGGTTGTTGACTCTGTC 

CTCCTTAAAGGAAAATGGGTGAGCTTCCGACCGCACTCAATAGATTATGGTCCAAAAATAGGTCTACTGATGTCAAAGAGAAAAACAACATTTAA 

CCCCTATTCTGGTAAACAAATCGCTAATGAACAGGAAATAAGACATAAGATATATCAAATAAAATCTCCTCTAAAAGAGATGAGCTTAAAGTGAT 

GATTGTTATTAATATTTGGACAAAAAAAAAATAAATAAATAAATAAATAAATAAAGTAAAATGTAAAGTAAGAGATTGACCGACAAAAGGCTTTC 

AATTTGAAAATGGCCGTTAAAATCATGTTTTTTGAACGAAGTATCCGATTAGGCTGATTGCCAGATTGGGTCAGAGTGCAATAGGTGGACATGTG 

GCTCCACCATCCTCAAAAGCTTCAACAGCCAAAAATGTAAGTACTCTTTTAAAAATGCGGTATGAAGTTCCAAAAAATTTTGAAATATCTTTCAT 

AGTGATAAAGTCGAGAACCGCAAGAATAGAAGGGCATACCCTTCTCTATAAAGTACCTTGCGTACCAGAAGGCGTTTGCCTTGGTACATATGTAA 

GAAGT GAT TAG AT T T GAT G AAT CT 

(SEQ ID NO: 1513} 

Exon: 3394.. 3257 
Exon: 2558.. 2389 
Exon: 2322.. 2143 
Exon: 1982, .1001 

Start ATG: 2545 (Reverse strand: CAT) 
Transcript No. : CT29750 

AGCGTCCATTGCGACTGATCATATCCACCGTTCCGAAAGATTTGTGATAACCACTTGATAACCAACCAGCCAGTCTAGTGTCTAATAAAGTTGTT 

ATTGAATTTGCCCATTTACAAGATCCATAAACAAGCTGTCAAGTAATCTTTCAATCATGGGTTTGACGCGCGTGCTGGTGAAGGATGGCGGTTTC 

GGCACCCAGATGACCGTCCATGTGGGTGACTCTGTGGATGGGGATCCGCTATGGAGTGCCCGCTTCAATGCCACCAATCCGGCGGCCATTATCAG 

CACCCACCTGGACTTTTTGCAGAATGGTGCCGATATCATTTTGACCAACACCTACCAGTCCAGTGTCGATGGTTACATGGAGTACCTGGAGCTGG 

ACGAGGAGCAGAGCATCGAGCTGATAAAGAACACGGTCCGCCTGGCCCACATCGCCAAGGAACGCTATCTCACCGAGTGCTATCAGGCGCAGCTG 

TCGGTGCAGGAGGGATACCCTTTGATCATTGCCTCCATTGGACCCTTTGGGGCCCACCTGCACGATGGCTCCGAGTACACCGGTAGCTATGCCGA 

CTTTGTGCCGGCCAAGGAGATTACGGACTGGCATCGCGTGAGGATCGAAGCGTGCTTGGAGGCTGGTGTGGATGCGCTGGCCATCGAGACGATTC 

CCTGTCAAATGGAGGCGGAGGCCCTGGTGGAGATGCTGTGCGATGATTATCCGGATGTGAAGTTCTGGGTGGCCTTCCAGTGCAAGGATGAGAAT 

ACCCTGGCGCATGGAGAAACCTTTGCGGACGCGGCGAATGCTATTTGGGACCTCTTGGCTGAACGAAATGCCCAGGATAAGTGCCTGGCCATCGG 

AGTAAATTGTGTGCATCCGAAGTTTGTGACCCCGCTCTTTAAGAGTTTGAATGGTGACCGCGAAGTGGGTGAGCAGATACCGTTGGTGGTGTACC 

CCAATAGCGGAGAGGTATACGACGTGGTCAACGGCTGGCAGGGAAGGGAGCACTGTGTGCCCTTGGCGAACTACGTTCCGGAGTGGGCGCAACTG 

GGGGCCAAGGTCATTGGGGGATGCTGTCGCACTTATGCGAGGGATATACGCCACATCGGGGAGGCCATTCGCGACTGGAACAAACTGAAGAAGCT 

CTCATAGGGATGAACTTCTAAGCTGGGGATCTCAGTACCTTCGAAAAGAACAAAGTAGTTATCCATTTAGTACTGTAAATAAATGTATTTAGTTA 

TTAAGGCATACAACTTATTTTAATAAAACAGAAAAGACAAAAGACAAAAAAATACTTTTTAATTTTCTAAGATCCGAGATAAAACATTAATAAAC 

AAATTGCTCTTAATTCAAATCCCTCAAAAGTGACGCCTTCCATTTTTTTCAAAAGTTAAAAAATGTAGTAATGGCATTCAATGGATTTTTGCATC 

T CAT G AGT AAAAT AAT TT AAGG AGAAT T T G AATT CT AAAT AT GT G 

(SEQ ID NO: 1514) 

Start ATG: 152 (Reverse strand; CAT) 

MGLTRVLVKDGGFGTQMTVHVGDSVDGDPLWSARFNATNPAAIISTHLDFLQNGADIILTNTYQSSVDGYMEYLELDEEQSIELIKNTVRLAHIA 
KERYLTECYQAQLSVQEGYPLIIASIGPFGAHLHDGSEYTGSYADFVPAKEITDWHRVRIEACLEAGVDALAIETIPCQMEAEALVEMLCDDYPD 
VKFWVAFQCKDENTLAHGETFADAANAIWDLLAERNAQDKCLAIGVNCVHPKFVTPLFKSLNGDREVGEQIPLVVYPNSGEVYDVVNGWQGREHC 
V PL AN Y V PE WA QL GAK V I GG CC RT YARD I RHI GE A I RDWN KL KKL S * 
(SEQ ID NO: 1515) 

Classification: hypothetical 



Celera Sequence No. : 142000013384637 

TGCGGCAGCAGGAGCAGGTGTTTCTGCAGTCTTTTTAGCCTGGGCTTCCGTATCATCCTCGGCTTCCTCAGCCAAACTGCTACGCTGCGGCTGAA 
CTTCGGCGTAAATGGCGGGAAACTGCGACTGTAGCTTCTGCTTGATGATCCGTATCTTGCGGAATATGTAATCCAGATCCTTCTTCATTTCGCTC 
AGGCATTTCACGTGTCTTTTGAAATCCTCGCTGGCACTTTTCAAACGACTCTGCGATAAGGCATTGCAGTTGAGCAGCATCTCGTTGGTTTTCTC 
AAAGCGTTGCAGCCTGTGGGATATATTTTGGATGTTTATTATAATCTCTGGTTAAATGAAATGGCTTACATTTGCTTTTGAGCCCGTATCATGGT 
CTCCACATCGCCCTGATTCACCATTCCTGCGAGACTTTGAATGAAGGCCTCGGCGGCGGAGTTGCCGAAGCCAGTGAAGGATTCCAGGTCAGGAG 
TGGCATCCTCGGACTGGTGCACATCCGCAAGGCCCAAGTGGCTCATTGTAGCCTAGAATTTATTTGTTTATAATAAGGGAGAACACAAAAATTGA 
TTTGTTTACTTGCTTTTCAAGGTCTCAAGGTTTCTTTGCGACAACTTGAACTCTGCTTTGTGTGCGTTTGCTGGCAGTTTTAGTTTTTTCTATTT 
TTATTTACGTTTTCAAATGGATCTCATGTTTCCATCTATTTTAAAAGTTCTTTTGTTTAGATTTCTTAAAAAAAAAGATATTTTCGTAACAGCTG 
TTTAAGTACTAGTTATAAACCAGTGCTGCAAGCTGCCAAAATGCAAAAGCACTGCCGTAATTAAATTTGAACATTTTACTTTTTAACATAAAATT 
AACACTGTTTTATGACAACATTTGTTCCTATTTAATATAAATCAAAGTTTCTTTAGCTATTTAAACACTTAAAAATATTTAAATAATAAAGCTTA 
GCAGAATGGCATTGGGTATTTTTACGCAAAGCCGTTTTGGTCACACTATGCTCACCGGCTTTTTTTTTATTAAGCGATTGAATTTCCTTATTTAT 
TCTTGATTTAAACGTGC AAAAT GAAGCTCGCTCCGACTGTTAAGCTAAACAATGGCTACGAGATGCCAATTCTGGGCCTAGGAACCTACAATGTA 
AGTCCTTATCATCGGCGATGCTAATAAATCAGCAGGCAGAAATTGTTTATGGAAATTTCTACCATTCAATTCCAATTAAGTGCAATTGTGAAATT 
GAAAGTTAATTATGTTATCTGTAGTTAAAGAAGTCTCGCTGTGAGGCTGCCGTGTGCCACGCCCTCGAAATGGGCTATCGGCATATAGACACCGC 
ATATCTGTACAGGAATGAAGGCATTATAGGCAAGGTTTTAGCTAAACTTATTGGCGACCAGAAACTGAAACGCGAACAGGTGTTTCTGGTCACAA 
AGG T AG GAT AT A GT AC AG ATG GGT T AT AT AT AGT T ACC AGCAC A TG T AAA T AGATC GAAAAT C AAT AT C C AAT T AGC T GT G GG AC AT A T AC C ACG 
AACCCAAGATGGTGAAATACGCCTGTGATATGCAATTAAAGCTACTGGGCGTGGACTATATAGATCTATATCTGATGCATTCGCCGGTGGGCGTG 
GACTACATCTCTGATGAAGATCTGATGCCCCACGAGAATGGCCAGCTGAGGACCAAGTGAGTTTGGGATACAACTAAACATACATAGTTTAGATT 
CAAGAAGCTTAAAGAAACTGATCTTTGGAACTTATAGCGATGTGGACTATGTGGACACCTACAGAAGTATGGAGCAACTGGTGCATCTGGGGCTG 
GTGCGCAGCTTGGGATTGTCCAACTTTAATGCCAATCAGCTGAAGAGATTACTGGAAAACTGCCAAATCAAGCCGGCAAACCTACAAATAGAATG 
TCATCCGGAATTGGTGCAAGTCCCATTAATTGAGCTCTGTAAATTTCACAATATCACCGTGGTTGCCTATTCGCCACTGGGGCGTCCCAAAGCCT 
GCAATCCGCTGCCCGATTACTACACTGATTCCAAGCTACTGGCGTTGGCAGCGAAATACGGCAAGACACCAGCTCAAATCATCCTAAGATACTTG 
GTATAAACGACCTTAGAATTGTAAAGTGATAAAGATATTTATTGGCTAAAAATCTACAGATCGATATTGGCACGGTACCTATTCCAAAAGCTGCC 
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CAGCGAGCCCATCTCGTAGAGAATCTGGATATATTTGATTTCCACCTAACGCCGGAGGAGTTGCTCCAAATGAAGACCTTTAATACAGGGCGCAG 

AATCTGGAAATTCGATAAAGCGAAGAATCATCAGTACTATCCGTACTAGCAAGAGATATTTAGTGTTCGGAAAAAAAGTGTAACTATGTAAATGT 

ATAAGATAGAATCTCGTTAGATAGTTTAGTTTTTACGTTAGTTTTTTTTTAGTACAATATTACAATGTTATAAAATGTCATCATATCAATCGTTA 

TCATATATAAATATACTATATTTTGTGCTAGATATGATCCTATTAACATTTTAAGTGCTCACCTGGGGAAATAATTATTTGTATCTCTCTAAATG 

TCTCTATCAAAGTTTATCGAATCTTTCGTGTGCCTTGCGACGTTGATCAGTTGTATAATTTCGGATATTGTTTCGATTTGTCAATTCGCGGCTCT 

CCGATATGCTCACAATTTCCTGGCCGAAACGAACGTGTTGCTGTGGTCGAAAATAACAATATACTGATCTCAAGAGTCGCATTATAAAACACATA 

T T T GAT T C T GT C AGAAAGG AG AAAGGT T AAT GAT GAAT T T G AT ATT CT CT AT T T GC AGTC G AAGG AC AAC G AAGG C G AAGC CGC T GT G AAAC AT G 

CGATTGATGTGGGCTATCGTCATATAGATACGGCCTATTTCTACCAAAACGAGGCCGAAGTGGGCAAGGCGATTCGGGACAAGATCGCAGAAGGT 

GTGGTCAAGCGAGAGGATATATTTTTGGTCACTAAGGTAAATGCCATGATGATATGTTCTAGATCTATCAATAAGATTATATTTTTGCGCAGCTT 

TGGAACATTTTCCACGATCCAGAGCGCGTTGAGGGCATTTGCCGCAAGCAGTTAAGCAATTTTGGCTTGGACTATATCGATCTGTATCTGATGCA 

TATGCCAGTGGGCTACAAATATGTAGATGACAACACCCTGCTGCCCAAAAATGAGGACGATGTGCTCCAACTGAGCGATGTCGACTATCTGGATA 

CGTACAAAGCCATGGAAAAGCTGGTAAAACTGGGCCTGGTGCGCAGCATCGGAGTATCCAACTTCAACAGCGAGCAGCTGGCACGAGTTTTGGCC 

AACTGCGAGATCAAACCCGTCACTAACCAGGTGGAGTGCTCGCCTGCGCTCAACCAGAAGGCTTTGACAGCCTTCTGCAAGAAGAACGACGTCAC 

CTTGACGGGCTACACGCCTTTGGGAAAGCCCAAGCCGGATATTCAGAAGCCGGACTTTATCTATTCGCCGGAGGTGGCTGTTATTGCCAAAAAAT 

ATGGCAAGACTACACCGCAGATTGTGCTGCGTTACCTGGTAATTGAGGCATCAACCTGACCTCAGCGACAGCATCTAATTAAAATCTCTCTTTTC 

GCAGGTTGGTCTGGGTGTCATTCCCATTCCAAAGTCATCGAATACTAACCGAATTTCTGAGAACTTCGATATCTTCGATTTTGAGCTGACAGCCG 

AGGAAATGGCCGTTCTCGATGGTTATCACACTGGCGAACGGGTGGTGCCATTAAATTTGATTAAGGGCCTCAACCACAAGTACTACCCCTTTTCA 

ATTGAGTTTTAAACCAGATGAGCGCCTTGTTGCTGTATCACGAATATAATGTAATAAATTTGTCATTACATCAATCTCTATGACAACAAATTGAA 

TGCACGAATGTATCTGCATACTTATTCCAAACCCAAATCCAACACAAAGCTCTTAGTGCCTTGATGTTCTCCATGAAATCCGTCCAGTTGCCAAC 

GAT CCGATCCGCACCCCACCGTGGAACTTATTCCCAGAAAT GAGGAGCAAATGGAAGCATCGATCTTTGGCACGGC ACGTTTGACATCC AAGC TA 

AAAGGTGAAAATGTTTGAATAAGACCGAAATATTAAGTATTAAGCTTTTACGACCAATGGTAATGCGAAGTGCTCAAACTTGGAGATCACCAATG 

GAGATTCAAATTTAAGTTACATGTGGAAAAAACTGTATACCTGCATTATTTCAATCATTTCAATAGTTGTTCAGCTTAGACATTGATTAATGAAC 

TAGTGATATTTTTAGAAGGTGCTTCAAGCTTTTCTACAGTAACTTTAGATATAGAAAATAGAAAAAAGTTTAGTAAGTAAATAAGACGTAAAAAA 

GCATATGGATTTAAGATGGATATGTGATTTAAAAAACGTTCTTTTAGAAGAACTTGTGCTGATTGCCAGCTGTTAGCCTAAATTGCTACATCAAT 

TTTAAGGACGCATTTTTTGTTTGCCAACATTCTCTTATCAATGGATCTTCTATTCTGATGTATGGCCAAAAAATATCACGCGGAAGCTATTAGTT 

GCGCCAACCGCAGAAATGAAGCATGAAAGTATTATGGCTATTAGTTGATGTACCTATTACGCCCGACAGGCTAATTATCTTTGGATTGTTGGGTA 

CATTTTGGGTTAATCGCGCCGCCGTAGACCTTTGTAAATAGCACCTCGGCCTGTGGGGTGACGTCAGAGCCAGCCAAGATGAGCACGGGCTGAAT 

GAAATTGGGGTCGTCTAGCCTTTGAGCTGCCCCAAAATAGTGCATGGATGAGTGAATTAAATAGCGGACCGGACTTGGACTTACGTGCTGCTCCA 

CGAGTCGATTAGAGGAGAATCTGTGGC 

(SEQ ID NO: 1516) 

Exon: 1001.. 1137 
Exon; 2814.. 2981 
Exon: 3038.. 3553 
Exon: 3615.. 3872 
Start ATG: 1066 

Transcript No. : CT29792 

CTCACCGGCTTTTTTTTTATTAAGCGATTGAATTTCCTTATTTATTCTTGATTTAAACGTGCAAAATGAAGCTCGCTCCGACTGTTAAGCTAAAC 
AATGGCTACGAGATGCCAATTCTGGGCCTAGGAACCTACAATTCGAAGGACAACGAAGGCGAAGCCGCTGTGAAACATGCGATTGATGTGGGCTA 
TCGTCATATAGATACGGCCTATTTCTACCAAAACGAGGCCGAAGTGGGCAAGGCGATTCGGGACAAGATCGCAGAAGGTGTGGTCAAGCGAGAGG 
ATATATTTTTGGTCACTAAGCTTTGGAACATTTTCCACGATCCAGAGCGCGTTGAGGGCATTTGCCGCAAGCAGTTAAGCAATTTTGGCTTGGAC 
TATATCGATCTGTATCTGATGCATATGCCAGTGGGCTACAAATATGTAGATGACAACACCCTGCTGCCCAAAAATGAGGACGATGTGCTCCAACT 
GAGCGATGTCGACTATCTGGATACGTACAAAGCCATGGAAAAGCTGGTAAAACTGGGCCTGGTGCGCAGCATCGGAGTATCCAACTTCAACAGCG 
AGCAGCTGGCACGAGTTTTGGCCAACTGCGAGATCAAACCCGTCACTAACCAGGTGGAGTGCTCGCCTGCGCTCAACCAGAAGGCTTTGACAGCC 
TTCTGCAAGAAGAACGACGTCACCTTGACGGGCTACACGCCTTTGGGAAAGCCCAAGCCGGATATTCAGAAGCCGGACTTTATCTATTCGCCGGA 
GGTGGCTGTTATTGCCAAAAAATATGGCAAGACTACACCGCAGATTGTGCTGCGTTACCTGGTTGGTCTGGGTGTCATTCCCATTCCAAAGTCAT 
CGAATACTAACCGAATTTCTGAGAACTTCGATATCTTCGATTTTGAGCTGACAGCCGAGGAAATGGCCGTTCTCGATGGTTATCACACTGGCGAA 
CGGGTGGTGCCATTAAATTTGATTAAGGGCCTCAACCACAAGTACTACCCCTTTTCAATTGAGTTTTAAACCAGATGAGCGCCTTGTTGCTGTAT 
CACGAATATAATGTAATAAATTTGTCATTACATC 
(SEQ ID NO: 1517) 

Start ATG: 66 

MKLAPTVKLNNGYEMPILGLGTYNSKDNEGEAAVKHAIDVGYRHIDTAYFYQNEAEVGKAIRDKIAEGVVKREDIFLVTKLWNIFHDPERVEGIC 
RKQLSNFGLDYIDLYLMHMPVGYKYVDDNTLLPKNEDDVLQLSDVDYLDTYKAMEKLVKLGLVRSIGVSNFNSEQLARVLANCEIKPVTNQVECS 
PALNQKALTAFCKKNDVTLTGYTPLGKPKPDIQKPDFIYSPEVAVIAKKYGKTTPQIVLRYLVGLGVIPIPKSSNTNRISENFDIFDFELTAEEM 
AVLDGYHTGERVVPLNLIKGLNHKYYPFSIEF* 
(SEQ ID NO: 1518) 

Name: ALDOSE REDUCTASE LIKE 
Classification: enzyme 



Celera Sequence No. : 142000013384351 

GTTGCAGTTGCTCTTCCTTGGTACACAACTGCTGGCTCTTTTCCGTTATCAGACCCTCCAGCTCGATAATCTTGTCCGCTTGTTCTGACACTCGT 
TCCGCCAGCATCTGATACTGCAGGGCTAAGCCCTCCTTATCCCGTTGCAGGCGACGCATACGCTCGTCGGAATCCTGACGAGGTATGTGCGTCTC 
CAGCCAGTCGCTGATAATGGCCGCCGTGACAGGATCCGGTGCTGGAGCAGCCAGTCCGACCTAAAGTGATAAGCGTAAGTGTAAGCCCTCCGTTT 
CTAGACCTTATCATAGTCAGTGTGACTCACCTGTTGCAGGGCGAGAGCCAAAGTCTTAGCTGTTGACAAAACATTAGTGGCCGAAATTAAGTTGC 
GTTCGCTGAGTGTCAGGGAAGACCCTGCTCCGGATATAGAAACGGATCCCGCCTGTGTCGCGGTACTCGATATAAGGCCGTCCATTTGCTGTAAT 
GCTGCCTCGAGCATTTTGCTGCCCTCCGCCACAGTCGCCATGAAGTCCAAATCAAAGTCTCGCTAATCCCGGGTGATCGATGTGTGGTGTGCTTA 
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TTGCGCGACTAGTGCATTGTTATGTAAGCAGGTCAGGCAGGGCACGTCATTTTGCACTGGTTTAGGGGAAAATGGCATTTATGGTTATTAATTTC 
AGCACTAATTGGGGAAAACTATCACGAAAATAGAAGCTGCTGCACCGTTCTGCTTCTTCTTCTCGCACACCAGCCCAGCCACTTATCGATAGTCA 
GCGAACGCAGAATTGTGTGTAGGAGCAACATCCGATAGTTTGCTAGATGGTCTAAAGTCTACAAAACAAATTTTAAAACAAAATTAATTTTAAGT 
AACGTATAAAAATGCCTAAACACATTTTAATTACATTTTTTGGGGACAAAGTATGCAATATATCTAGATGTGTACAGCTATATTTGGTTTTCCAT 
ACCCGATATCTACTGTACAAAAAAGATACCATCACTAATAAGCAGGTGGAACGGTGGAGTCACGAAAGATGTGTCATCTCCAGCCAGAGCCACAA 
CAAAACAAATATTAAGGATAAAATAACAAATATCTGAAATGAATCGCTTCGCTTATGTGGAGGCCCGTCTGAGTCCCAATGAATCCTTCGTCAGC 
CGCGACAACAGGGTAAAAATATACGATGGCGACCAAAAGGTGAGCAGCTAGTGAATTCGCACAATACGAAACCGGGTCTTTATTAGACTCAAATC 
CTTTCAGACGGACTTTGAAGACGGCGAAGTGGTGCTCACCACACACCGACTGTTTTGGGGTCGCCCTGGAGAGATTGCCCGGGCCGCCGTAACAC 
TTTGCCTGCCACTCAGCTATGTGATATCCGTCAGCGAAGAGACTACTGCCTCTAACTTCTTTGGCCGCAAGACTCGCATAATCATGCATCTGCAT 
CCTCCAACGTCGGACAAAGGACCCGGCCCACTGGACACCAGTCGCGCCACCCACATCAAACTGTCGGGCAAGAATGGCCTGAGTGTCGAGTTCCA 
CAGTGCCTTGCGGGAGACTCTAAATGCCCGCGTCTGGGAAATCCTCTTGACCAGCGAGATCATCATCAATGGCGTGGCCAGCAGTCCAACCTTGG 
AGCCTGCCAACGACAGGCTGGCCCGCATTCAGAAGAGGACAGGAATTGGGGGCATCGAACGGCATTTGGAGGCAAAAGCCAAGGCCACGGATGAG 
AACATAGCACTTGCCTTTCAGGATCTCAGTGTTCTAATGGCCATGGCCAAGGACATGGTAGGAGTGTCCAAGACAATAAGCAGCAAGATACGCGA 
ACAGAAGGGAGAAATATCTGACGATGAAACGGTGCGTTTCAAATCCTACCTCATGAGCCTGGGCATCGATGATCCCGTCACGCGGGACAACTTTA 
CCAGCAATTCTGCCTACTTCAGTAGCCTGGCCCGACAGATTTGCGAAATGCTGCTGGATCCTATCGAGGTAGGAACTGTATTAAAAAGTTATACT 
TTGTATTGACACTTACTTTATTACCCCTTAGGAGCAAGGTGGCATGATGTCCTTGGCTGATGTCTACTGTCGTGTAAACCGCGCTCGTGGTCTGG 
AACTGCTATCGCCTGAGGACTTGCTGCACGCCTGCGAGCAGCTTAGTGGTCCAATACGCCTGCGCAGCTTTCCCAGTGGCGCTAGAGTGCTACAA 
CTGGAGTCCCACGATGATGCCCTAATTGCCGTTGACACATTGGAGAAGGTGGAGGCTGCCGAGTCTCTGGCTGTGGAGGAACTGGCCAAGCAGCT 
GGGAATTTCACTCCTGCTGGCCAAGGAGCGATTGCTAGTGGCCGAACGTTTGGGCAAGGTGTGTCGCGACGAGTCCGTAGAGGGACTGCGGTTCT 
ATCCAAATCTTCTGCTAGGCCGGGACTAATTTCAATGTGAATGTGTGCTTGTTGTTATGGTTAATAAAGATATTTGACTTTTTATTTAAAGCAGG 
AGTTTTGAGTAGAAAGTGAATATCCTTTAAAGGTCCTTACCATTTTCAGTTCCATCGCAATGACAAGACTGAGTTCTTAATTTCATTTCACACTT 
TATTGATTAACGGAGCTTCTGCTTTGACTTTGTCGGCGCTTCATTAAGTTATTTTGGTTTTGTCGTGATTATTTTCATTTTTGTTTAGTATTCTC 
AAGAGTTTTATTCGTAATCAGTTTAATGAAAAGTTTCTACAGTCAGGTATTTTGATAAGCAGTTATTGAATTGAAATTACACATTTATATTAATA 
GTGGTTTTCGTGTTTACATATATATAAATATATTATGCTGCTGCTTAGATTGCTCCGCATTACATTAATAAATCTTTGAATTACGTTTCAATAAC 
CATTTGATCAGGCCACATTACCGCGCATGTATGTATATAACTTTCGTACTCAGGATTTTGGCCAATTGCTTGAGTAACATTTTCCCACTTCGTTA 
TGCTTTCAATCATTCATTCGGTTTACGTAGTACCATGCTTTCATTTTCGTTTATTCATCTTAAATCACATTTATTAATTAATATTAACATTGAGT 
AGTTTATGCTTGAGTGTGTGTTGGATACGGTAGGGAGTACGGGTAGTGGTGAGGTTAATTTGTGTTTTATTTCTTGCTGGCTGAAGAGGTTTATG 
C AG AGG TT TACT TT AAGT GC T GC G C AG CT G AAT C AAAC AAT GAT AAAC AAGT AC AGC AC T CT AGGCGGT AGT AAC AT AAT AAAAT C G T T AC T T AA 
ATAAATAATGTCCATCTACAAAAAAACTGGCTTAGGTGAAAAACAAATATGCTTTGTCGAAAGCGACACTTTTACACATTACTCGACTCGAGTTG 
GCAACTTTTCCATCGTCCAAGCTCATTCAGCTGCTCCTGTCTCTAGTTTCATTGTACATTGAGCTATTACAAATAAATT 
(SEQ ID NO: 1519} 

Exon: 1001,. 1179 
Exon; 1243.. 1968 
Exon: 2027.. 2404 
Start ATG: 1084 

Transcript No. : CT30019 

AC GGT GG AGT C ACG AAAG AT GT GT C AT CTC C AGC C AG AGC C AC AAC AAAAC AAAT AT T AAG GAT AAAAT AACAAAT ATC T G AAAT GAAT C GC T T C 

GCTTATGTGGAGGCCCGTCTGAGTCCCAATGAATCCTTCGTCAGCCGCGACAACAGGGTAAAAATATACGATGGCGACCAAAAGACGGACTTTGA 

AGACGGCGAAGTGGTGCTCACCACACACCGACTGTTTTGGGGTCGCCCTGGAGAGATTGCCCGGGCCGCCGTAACACTTTGCCTGCCACTCAGCT 

ATGTGATATCCGTCAGCGAAGAGACTACTGCCTCTAACTTCTTTGGCCGCAAGACTCGCATAATCATGCATCTGCATCCTCCAACGTCGGACAAA 

GGACCCGGCCCACTGGACACCAGTCGCGCCACCCACATCAAACTGTCGGGCAAGAATGGCCTGAGTGTCGAGTTCCACAGTGCCTTGCGGGAGAC 

TCTAAATGCCCGCGTCTGGGAAATCCTCTTGACCAGCGAGATCATCATCAATGGCGTGGCCAGCAGTCCAACCTTGGAGCCTGCCAACGACAGGC 

TGGCCCGCATTCAGAAGAGGACAGGAATTGGGGGCATCGAACGGCATTTGGAGGCAAAAGCCAAGGCCACGGATGAGAACATAGCACTTGCCTTT 

CAGGATCTCAGTGTTCTAATGGCCATGGCCAAGGACATGGTAGGAGTGTCCAAGACAATAAGCAGCAAGATACGCGAACAGAAGGGAGAAATATC 

TGACGATGAAACGGTGCGTTTCAAATCCTACCTCATGAGCCTGGGCATCGATGATCCCGTCACGCGGGACAACTTTACCAGCAATTCTGCCTACT 

TCAGTAGCCTGGCCCGACAGATTTGCGAAATGCTGCTGGATCCTATCGAGGAGCAAGGTGGCATGATGTCCTTGGCTGATGTCTACTGTCGTGTA 

AACCGCGCTCGTGGTCTGGAACTGCTATCGCCTGAGGACTTGCTGCACGCCTGCGAGCAGCTTAGTGGTCCAATACGCCTGCGCAGCTTTCCCAG 

TGGCGCTAGAGTGCTACAACTGGAGTCCCACGATGATGCCCTAATTGCCGTTGACACATTGGAGAAGGTGGAGGCTGCCGAGTCTCTGGCTGTGG 

AGGAACTGGCCAAGCAGCTGGGAATTTCACTCCTGCTGGCCAAGGAGCGATTGCTAGTGGCCGAACGTTTGGGCAAGGTGTGTCGCGACGAGTCC 

GTAGAGGGACTGCGGTTCTATCCAAATCTTCTGCTAGGCCGGGACTAA 

(SEQ ID NO: 1520) 

Start ATG: 84 

MNRFAYVEARLSPNESFVSRDNRVKIYDGDQKTDFEDGEVVLTTHRLFWGRPGEIARAAVTLCLPLSYVISVSEETTASWFFGRKTRIIMHLHPP 
TSDKGPGPLDTSRATHIKLSGKNGLSVEFHSALRETLNARVWEILLTSEIIINGVASSPTLEPANDRLARIQKRTGIGGIERHLEAKAKATDENI 
ALAFQDLSVLMAMAKDMVGVSKTISSKIREQKGEIS 

YCRVNRARGLELLSPEDLLHACEQLSGPIRLRSFPSGARVLQLESHDDALIAVDTLEKVEAAESLAVEELAKQLGISLLLAKERLLVAERLGKVC 
RDESVEGLRFYPNLLLGRD* 
(SEQ ID NO: 1521) 

Classification: hypothetical 



Celera Sequence No. : 142000013384 603 

TATAGTTGTGTCCGTGGCCGTGGAAATTGTTGCACTTGCCGAAGACTTCCAGATTCTCGGCGTCGCTCAATTGGGGACTGAAGAGAAAATAGATG 
CAGCAATCAATTTAAATCGAACTGAATTGGTTTGCCAGCCGTGATTGAATCAGGCAATTGAATGATTCAATTTGATTCAATCTAATTACCTATGG 
AGACGATGGCAGGCGCTGAAAGTTTCGCGACGGGTTAAAAAGGCAACAGGTTGCTGCGACATTTCAAGTCCGATTTATACTATATATCTGGATCT 
GTATCTGGATGTGGATTTATATATGCGGCTGATCAGCTGCGCGCTTTGGCAATGTTTTACAATGAATGCAAAACCTGCTTTTGAGCCGCGCTTAT 
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TTATATGGTACTTGGATCGAGGGAACAAGGAAATATAAGCCACAAATGCGGCGACTGCGATATCAACACGCTCGGAACTAGTTTCTCTGAACTTT 
TCGAGTGCGGCTAATTGCAGCTGATTAGCGATTGATGTTCCCTCGTTTTCTGTTGCAAGTGCCTATGCGTGCTGCAAAGACAGGGTGTGGTATGC 
TATAGCCATGGTAACAAACTCAATTCAGCAAGGAACTCACCTTTAATTGCTATGGATTTTCAATAATATTTACTTTTGTCTACGTTGCTCGTCAA 
CAGCTGCTCCCGATAACAAGGCAGCGCGTATCGGTTATCGGCCAGAGCTGAGCAGTTTGCCGCCACAACTGTTCTCAGTGATTTACAACTGCACA 
AATAGACTATTGTATGGGTAGAAATCCTTACACTATTACAATTTCGCTATATATTTATTTATTTCATGTTAGTTCTTTGTCAATTATCTCATAAT 
TTAAATGAGCTGATTAACAGAGTTAGCTAACAGAGTATGTAAGTGCTGTTCATCCCTGGCTCGTGGTGGCGTTGCCGGACCTCACGCCGCTCACT 
GTAATCAACCCTGCTTTGCTATCGATCGCCGCCCTTCACTAGAAAAAAAATAACAAACGACGCCGCGCAATAATTCGAAACTCTTCGGCGTTATT 
TAAAAAACGAGCAGAGATACCTAAATAATAATCTAGACATGAGCAACAAGAGCACTCCGGTGCGGCAGCGCATCCAGCAGTTCCAGACGCGAGGA 
GTCCACAAATCCACGCCGGCATCCGAGATGCGCAAGAAGGTGCTGATGACGCGGGCGGAGGATCGGGAGCACCGGGATCGTCCCGAACAGCTGCT 
GAACACTGCATTCAGCGGGAGCACGCCACAGCCGAAGCCCAAGCCCACGGCCCTGAATGCCTGCTACACACCCTCGTCGCTGTACAGGAATGCGA 
ACTCTACTCCTGGAAGAGCAAAGACGCCCGGCACCGGGAAATCCAGCTGCAGCAGGACAAAGGAGCGGGACTCTCTGATGGAGTCCTGCCTCAGC 
GTGTCCGAGGAGAGCAACATGATTGTGGCGGTGCGAGTGCGGCCACTGAATGCCCTGGAATGCACACGTGGTCAGGTCACCAATGTGGTCCAGGT 
GCACGGAAACAGCAACGAGCTGACTGTCCAAGCGGGCAGCAGTGCTGATGCCTCCGCCGGCGTGACGCACTTCTTCAGCTACGACCAGGTCTACT 
ACTCGTGCGATCCCGAACGAAAGAACTTCGCCTGCCAGGCCAAAGTCTTTGAGGGCACAGCGCGTCCCTTGATCGACACTGCCTTCGAGGGCTAC 
AATGCCTGCTTGTTTGCCTACGGCCAGACGGGCTCGGGCAAATCGTACAGCATGATGGGCATTGAAGCATTGGATGACGCCGCTCTGGATGGCGG 
ACCGCCGCACGATGAGGCGGGCATCATTCCACGCTTCTGCCACGAGCTATTCCGCCGCATAGAGGCTGTGAAAAGTCAGCAGCAGCTGCAGGTGG 
AGGTGGAGGTCAGCTACTTTGAGATCTACAACGAGAAGATACACGATCTGCTGAGTGTCCAGCATGCGGCAGCTGCCACCGGGGAATCCACGCCC 
ATCCAGCAGCAACAACAGCAGCAGCGTCCGGCTCTAAAGGTCAGGGAGCATCCCATCTTCGGGCCCTACGTAGTGGACTTGAGTGCCCACTCGGT 
GGACTCGTACTCGGCACTCCGCAACTGGCTTGCCGTGGGCAACTCCCAGCGGGCCACTGCCTCCACGGCCATGAACGACAAGAGCTCGCGCTCAC 
ACTCCATCTTCAACATTGTCCTAAACCTCACCGATCTGAGCAGTGACGATGGCCTGTCCTCGGATACGGACTCGAGCACGGCCTCCTCCCTGCGG 
CAGACGCGGCGCAGCAAGATTAGCCTGGTGGACCTGGCGGGCAGCGAACGCATTAGCGTCTCCGGATCGAACGGCGAGCGGATCCGCGAGGGCGT 
CAGCATCAACAAGAGCCTGCTGACGCTGGGGAAAGTAATTGCTGCTCTCGCCGATTCCCGCAAGGCGATCGCCAATGGACCCCTGGGCTCTGGGA 
CGCCCAGCACCTTCGTTCCCTATCGCGAGAGCGTGCTCACCTGGCTGTTGCGGGTAAGTGTTACGAAATAGTTCTTAGCTGCGTTTGTACTTATC 
AACCGGGTCAGTAGGTTGGTCTACTTTCGGCTAATGATAACTCTTTCTTATCCCCTTGGTTTTTGAAATCAATAATTGTACCCAGAAGTGATTGC 
TAAGTATGTGTTTCGAATAAGTTCAGCTCAGTAAAACCTTATAACCTCATGTCTTCAATATAAATGCATATGTATATTCGCAATGCTTTCCGCAC 
TTTTGCCACCACCGATAGGATTGCTTATCAACACTTTCGCTTCGATTAGGCAGTGGCCCACGGAATGGGGTCCCCTTGCAATCAGTAATTACCCG 
GAAAGCAATCAGTACGTAGAACTGCACTGCAACATGCAAATGACACGTTGACCCCTGCGGAAAGTACCCACAAAATGTTCGTACGGTCAAATGTT 
TACTTAACTGGCTGGCAGATATTCGTCTTTTCAACAGATCTCGTGATGAGCTATGAAGCCCACTGTCTACAGAGATATCATCTCGACTCGCGCGA 
CACGCGAATTCTGTTTTAATGATGTCTGCACTGCGAGGAGCAGCTGCTAGACAGATTTATGTTGGCTCTATCAATTAACTGGGGTGCTAATTACT 
TTAATTAAAATAAAGCGTCAAATTGTCCGAAAAAAACAAGGAAAAGAAGACGACAACATCAGCCAGAGATTAGATTGAGTTTATCTGCGCTTTAT 
AGGACTAAATATGAGAGATCTCGAGTTCTAAACTCGAACCTTGTTATCATTCGTTTGTTTATTTTGTTTGCTAAGATTGCCAGCAGACTTAGTGT 
ATTTTTTATTATTATTATTTCCTTATCATTTTCGATAGGAAAGCCTTTTATTAAGGTCTACCAAATCAAAGACCTCCCGCACCTCAATCAATTGA 
GTGATTAAAATTTTGGCCTCTTATCATATCTATTTAATCTATTGTTGTTGCTTTTCCTATGTGACCATAAAATTTATGACCTGATACAAAGGCGC 
ACGTGTAAACTTTCTAGAAATTTATTTTAAGAACGCGAATAAAAAGATAGCAGTGTCTTCACCCGGGTAATATAATTATTTTGTTATAAAACCTC 
AGCAAATGCTTACATAGCATTAGCAGGAAAATTGTTATAAACTTGTCATGAGACTTGAATGTTGTGCATTAGTTAACATATTAGCTACTTGTTTC 
ACGATTGTTTGAATTGAGTAACGACTGCGTTTTCTTTAGCAATCAATGTAACATACTTATGTGTGCATTTTAAGTTAACAACTGCTCTTAAGCGA 
ACGCCACACGTTCGTAATAACTTGATTGTTGGAATATTACCAGGCTAGTTAATCCGAAAGTGAGGGTTTTATATAATACAAGTATCTGGCTTCTA 
AAAAAAGGCGCACTCGACATTTGCGACGTAATTTATAATCATTCGACTGCATTCAATGAGTAATTACATGGCGCCCACAAGTGAGAGGCTCTTGA 
TCGAAAGTCCAAACACTTGTCTTTTTTCCCGAAACCCGTTCCACTGAGTGTTATTTAGTGTCCAGTGTCCACTTTGCTTTTCGTTACGGTAGTAC 
GGCCCATAAAATGGCTCACATGATTTATTTCCGTACGCGGCCTTTGCCATAAAAGGTGAAGACAAATATTTCTTGGAATGGCGTCGTAGGTCCGT 
AGGGGACTCTCGCAGCCAGCGTAGTGACGAAACGCACCAGATTGTGGTTGCCCAATCCATCTTTAAAATCATTGAAAGATTTAGATGGTTTAGAT 
TTAGGGTTTGTATGTTGATTGTTACTAGTTGTTTTCACCCCTTTAAAAGGAGGGGGGTTTTGTTGTGTTTTTTTTTTATATATCAGCCGACCTTG 
TTTTTTGCCTCTGATATTTTTGGGCTTCCATGGTTCCTATCTATCCGTGAAGTACTTTTCCTATCTTTGCATACGACACAAAAGTTGCATCTACA 
TCTTCACTTGCCACCCCATAGATTTAAAAATACTTTGCAGATAATGCTTTATTTGGTATTTTTGTGCGGTACTTCGGCACTAAAACATTATAACA 
CTTGATACGTTATCGCGTCTCCTAAGTTGCGACACCAACAATTTCAATTGTTGCTGTCGTTGTTGACGTTTTCCACATTTACACGCTGAAGGCTC 
GCATTTGCAATTGCCCTTGCTCAGCTTTTTTCCATCCTTAAGAGCTGCAATAACAATTTGTACACAAAATGAATAATTTATATACAGATAGTTAG 
GATGGTAACTAAACGTTTACGAAGTGTATACAAGCCAATAAAGGTAATTGATGGCAGAGTATGTTACAATTAGCAAATATGTTTAGGCCAAGTAA 
AATCTAAAGTTAACCAGGATTTGGGTACAATGTAAGTAATGGGAGATGCTTTATAATCAAAATTGGTTGGTGTTTTATAACTTTAATTTTTAACA 
TTAGTTGTGATTTGACATTTGCAGTCCGTTTTGTTTATTAGCACAGACTCGTCAATTTAAAGTCATATTTAAGTTCATTATCCAAAAAGCTCATT 
ATTCGAAACAAAAAAAGAATACTGTTAATACTGTTATTTAACTGGCTTAATAACAGTCCACTTATATGAAAATGGTTTTGCACTACGGCTAGGTT 
TTGTAAAAGAATAATTCAATTTTTGATAGGTAATTAGAATAGTTGACATATTTTGATGTAAGTATTTCACTTTAAATACCATGGCCTAGCTATTA 
AGTTCACCTTGCGACTTGACTCCTCTGGAACTTTTCAGTTTTTGTAACAAGCGGTTCAGCATTTTTTGAAATGCAAAGGGATATATTCTGTTCTC 
CAAGAAATAAGTCCTTTTTTATTCATTAAACTATAGTTTTTTCTCAATTTTTATTAGTTTTGAGAACTCGTTGCGACCTTCACATGACTGGGGGA 
ACTTTGGAACTGGAAATGCGGAGTCCCAAGCCAGCTTTCTAAATAAATTCCAAATTAACTTGCTCGTGTGCAAAGCCCTAGAAACCGCACTGAGT 
TCACAATCAGCATTTTCCGAGTTGGATTTGGAAAGCCACAAACACCGTTTAGATAGCCAGGGAATGTGGGATTTTGAATGAAATCCAGAAACATC 
GAATTGAATGCAAATGACGTAGGTTCACACGATTTCAAATTCGATTGATCTGCTGTTTCTAATACGTGTCTAATACGAAAAGCGTGCCACTGAGG 
ATACGTCAGTTAATTATTTTCGTAATTAGCAACAGGCGAGTTGTCTGTCGCCGTTATGAATTTTCAAATAAGCCACGTTTTCAGTTCACAAATGA 
ATGCTAATTATGTGAACTGAAAAAATTTGACATACGAGTAGTAACCAGATTAATTATTCACAAGATTTCGACTAACTGAGCCATTCAATAATGTA 
AATAGTAGTAGTTATAAATAGTATAGTATAGTATAGTAGTTATAAATTCAAAAGTTTGTATCTCTGACCCACGTTGGCATAACAAAATGCTTCGA 
ACGTGTGCTCTCCGTTCCAGTTTCCCGCATAACAACAATTGTAATTTAAGCCACACTGTTTAATAATATATTTAGCCCCAACCAAGTGGTTAGCA 
TTGGAATATAAAATGAGTCGAATTCGATGCACAAACAACTGAATATGTATATGGCTTCCTCCCCGCGAGTTGTTTCTCTCGAACATCGCTTTCCC 
CGTCGCCCCATTGCACATATACCCTTTTGCACACTCATGAAAAAATATGCAAATGGAATGAATGGTTTTCGTGTGGGGGGATGCGCTCAGTTCTC 
AGTTTGATCAGTTGAATTTTTTTGATTCGCCTTTCATGATTAAGGCACGGATGATCTCATCTACCCGTGCCAAACGTTCGTTGCTCTAATTCAAT 
TCGGCTTAGTTGACTGGGGTTTCATTTGGATCACTTGGGAGCTGGCATCCCTGGACATTCGCTTGCAAGGTGGGATAGTGCATGGTCAAGTTCCA 
GGTCGGTTTATTGGGGCTCGAGGTGCTTTCGCTTAGTTCCCAGTTTAGTGCCTAAAACCTGACCTTAGCCATCAAGTACTTACTTATCATATTTA 
ACGTTTTCAAGCGGCGGAACATTTTGAGTTCAGACAGACAAAAGACAGATTAACTTCAATTAGTAAACTGGTGCGAGACGGAAAGATTTCCGTAA 
AGATTTGCATAAGAAATTAAGACTTTTATTTGGAAAATTTAAGTCCAAACTGACCCACCTCAACGTATGTAATTGGAACAGGTGTCTAACTTTTA 
GAAATCGTAGCAACTATGAATGATATCTCCTTTCTTTTAATCGTAATTTCCTCAAGTAACCCTCAAGATACAAGTACGCATTTTAACTTAAATTT 
TTCAATCATTTGTCGTTTTATTCGGTTTACATATAATTTAAACTAACCATTAATATATAATATATATGATATATAACATTGTATAAATTGAAAAT 
TTTCTTGGAACCTGCAAGCAAATTTTACACAAATCATTACATTTTGATCAATTACTTAAATGACTAATGCAATTGTTAAATCGTTTTTATAATAT 
TTCCGTCCGTCTTTTTGCACAGCTGAGTCTGAACACCAAAAATTAATAACAATTTCTGTTCTGTAACGACAAGTTGAGTTAACAAAATGTAACCA 
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GGCATAAACTTTTGAACAATTTCAACTTTTACATACATATTTCGATTTGATTTCATTTTGACACTTGCTCGGACTGTGCAACGCTCAGTTGGTAT 
TACAATAAATATAGGGGAATGTCTAATATAGACAAGGATACAATAAAAATCGGGGTCCAAAGTATGTACTACGCATATATCGAGAAAATAACAGT 
TGCAAAACCTTGATTGATCGTTATAGTTTTCATTATTTCTTTCTTATATGGTAATACCTGTAACTGATCCAATTGGGTCCAAACTAAAGCAACGA 
AAATAACAAAAATTTCGCTTAAAAATTTTCTACTGCAATTTACTCAGTCTCGTATGCAAAATAGAAAATTCAGTTTTTGCTTCCCTTGTGTTACA 
TATATAATTTTTTTTTTTTTTTTTTGCCTAAGGTCAAAAAATGCGTTGGTCAAGCGGGGCAGGTTTTCGTTACTTGATTTTTTCGTCGTGCGCTA 
TTGTTTGTCGGTAAACAAAGTGCCGTTTGTAGTTGTACATAACTGCGAAGTTGGTGTAAAAACGAAGGGCTAGATAATGAAAAAGGTAGTTGTAA 
ATTGGTATCAAGTGGTTCTGGCTTTTGTAAATGGTAGTGTATTCGCAACGCGGAAATGCTGGGCAACCATTTTAAAAAAAATAGGGGTACAGTTG 
CCTTAGATTATTGTAGCTGCAGGCTGCTGTCTCGCTCTCTCGCGCAATCTTCGTCTTCGACAGCTTTTTCTCTTTTCTCCTCGGGTTTTCTTTGT 
TTTTGTTTTTGTTCTTACTCTTTCAGGTTTCCTTTCTTCTGGGTTAGGTTACACATCTGTATACATATATGGCATTTCGCGAATGCTGCGGGCAC 
CGTCCGCAGTAGATTCGTCTCGCTCCCACTCACACCTTGCAGCTTTGCGGGGCGGCAAGGACGGTCGACTGGCTGTGATTGTGGGTGTGCTCCAG 
CTCATGCTCGTGCTCGTGCATGTCCTCCTCGTCCTCGCAGATGCTGTCATGGTGCTTCCTTTGCTCCCGCTTGCGCATGTAGTTGATGAGCGGCC 
GCTTCTTGTGCTCCTTGCAGAATGGGCAGTCCTTGCGCATCTGCTTCTCGGACTTTATCTGCTTGCACTTTTCTGCAATGGGAGCAATGTGATAG 
GTCACCGGTTAGTGAAGGTCGGCACTGAATGTTCACCCTCGTCTGTTGTTTCCCGCATTCGCTTCCACTCTGCCTTTTAGCCTACGTGCCCTTCT 
GGGCCGTCGAACAGCTGATTTGACTACGTGACGTCGCTGCGTCTACGTTCACGACGTCACGTCGCCACCGATGGCGCCCCTCCCCACGCTCGCAC 
AATTTTTCCCGCTACTCGACGTGTGACCCTGTGACGGAGGCAGCTATATATTTAATATTATCATGCACTGGATTTTAGAAACCTATTTTAAAAAG 
TTGAATCTATCTAATCTTTGGTTTTAATGAAAACATTGTTGTCTTTCGACGACTGTAACACAACTACTACATTGAGCACAACTGTACTTACAGCT 
TGTGTGCTTTTGTGACCTGCCTCCGTTAATTAAACCATTGCTTTGATTGCTCAATTGACCGTGTTAAAAGCTAAACGGTTATTATTGATTAAAGC 
C C AGC C GAT C C AT G AGGAAAGT CT AT T T AT AGCC GCC G AT AAGC G AC CT T GC AT AAT C TC T AAAT GTG T TC C AC AC T GGGGGAAAAAAGT GC T T T 
ATAAACTGGATATTTACACACATATGCATGCTACCATTCAAATACGAATTGAGTTGTGTATGTACAGACATGTGAATCTAAAGTTTAAAGTAAGC 
TAGATATTTGCCTTGCATAATTTTTCACTTGTTTTTTTTTTATTTCTCTGCAACCCGATAATGCACTACGCGTATATGTATACTATATGGACGAA 
AACAGCACGAAGGTTTAACATTTACACACCTCAGTTTCGTTATTGCGCTTCAGTTTTGGCACTCAAACACAATGAAACCCTGCTTTGTTTGTTCT 
TCCCTATAAAAGAATCCATTGTGCGACCACCTGCTTAACTTTTGAGAGCCGATTCTGGAGGTACAGCCAAGTCTTAGTGCTTGCTTAGTTGTCTG 
TGTAATTAATAATCGTAAAGTATTCGAGTGCAGAGTGTGTCTTAATGGACTTGAGTAGTGACCTCGATAAGAGCGGGTTTAACCACCTCTGAAAT 
TGCCACAAAAAGTATATCTTTGCCTTTTATGATTTTCCACGAGTGACAAAGTAACGATATACTCTTTTTGTATGCCCATCTGAATTGCGCTTTAA 
CTTTGGAATGCCGGAAATTCGTAAGGAACCTTGTGGGCTTTTAGCCGCTTTGTGTGTGCAATTAAGCGCGAATATCTTTAGCACGTGGATTCAAG 
CCGTAAATAAAAACAAAATCGAGGGGGTGTGGAACCTACTTGATAAGGTGACTTGCGCCGCGAAACTTTGGACACCCTGTGCTTGTGCGCAAAAA 
TTAAGGCGTGTTCTACACACACACACACGCACACACACACTTATGCACATGGGGCTAAGAAATCAGGCTGGCTATTTATAAATGGTTGCAAAAAC 
AACAAAACAACACACGCAATACACAAAAACAAGTGAGAAACGCGTCTGCTTTCAATCTCGCCCCTTTTTTGCCAGTGCATTCTGAGCGCTATCTG 
TTTTTTCGCCAAAAATTACCTTGCACTCGACTGTAGCAGTCCTTGACCGCGTGTGAGGAAGTGGTGGACTCCAGGGTACCATAGGGGAACTCCTC 
GTAGTCGCGGTAGTAGCCGCTGCGCATGGACCCGCCATATGCCATTGTGGTGTCGCTGACGCAGAATCCCCAAACATCCATGTCTGTGCTGTGCG 
GTGTTTCGTTACTATTTTTGCGAACTGCAGTGCCAATCAACGGGATGTGAATTGTGTTTGTAATGTGGGTGTGTGTTTTTCTTTCTTTCTCTCTT 
TTTGCGCGCGTTGTGCGAGGTCGTTTCACTTTGTCTCTCGATTGGCTCGGGCGTCCAATGAACCTGGCTTTTAAAATCACTTTGTTTTGTTTTTT 
TAACAACGCTTTTTGGTGCACTTTTGCGTGGAGGTGACTCGCAGAGGTTTTCCTTCTTTCTTCTTCTTGAGCTGCAAATCTCTCGCCGAAATTCG 
ATGTATTCTGTTGGAGGTCTCTCGTGAAACTGAAAGCCGAACACTAGAATTACTGCATTTTTGTAGCTCTCGTGCGCTGGCGATACTTCGGCTCT 
TGTTTTGCTGCCCACTTCGGGTCTCCCCTATATTTACTTCTCATGTGCCTACGTGTGAATGTGTGTGGACAGTCGCGGGAAAACGTTTAGGGCAC 
GTGTATATGAATGAATGGACTTTAATTTGTTTTAAATATTCTTTAATGTCTGTAGTACTTGGAATAGGTGTATATACTATGCTGCCCGCACTTGT 
ACTAAAATATTTGTGTGCCTGGTTGTAAACACGTATGTGTGGCATTGTGTTTTGTTGTGTTTTCGCGTCTCGTGCTTTTGTCAGAGTTTCACCTT 
CCACCCGACCCCTCCCCTTTCGAGTGCCACACACCTCTCGCTTGGGGCACATTGGTTCGCTACTCTCGAAATCACCTCTGCCGTGGAAGCAAGCA 
TTTTCCAAGTGATTCACCCTGTGTTAGTTTGAAAATTTTTAATAAATATTAATTAGCCGGGGCATGAGGAATACTCTTGGAAGTTTTTCTTATTA 
GCGGCGAATCGCTAATGCCGCGGAGATGACGAAAAAGTAATCATGAGAGTGATCGGACGGATGTTCGACGGCTGATCATCTGGAGGATGGGATGG 
ACATTTGTGTTTTTATAAGCTGCGCTAATCACGGTCTAGACTGCAGGCCGACCATGCCGTGATTTAGCCTTATTTCTGAAACCAAATTTGGTTGC 
CAAGCGGATTGAACACTTTTGAGAGCCCAAATTGGAGTGTTCTTTTTTTTTATAAACTTTCCATGGTCAAAATGGACTGTATTTCCAGCGTGTGG 
CATTCTAGGCGTGTAAACCACCAGCTGCACTTTTTCACTACTTTGCCTAGAGGTTTTTGAACATTTGAAAAATTTGTTTAAATCTAGTGTTAATT 
GGCAGTCTACGTCATTTTAGCCGGTTCCGATCAAGGTTTTGCTGTTTTGGCTCAAAAAGCATACCGCCTATATTAGCGCCTTCCTGCGGTTTATG 
TTTTGGTGCACGCTCATTTCAGTTACTTTTGAGGAGATCGTGACTGATTTCTAAAGGTGGGAGGACGTAGAGGGCTATATTTAGGAAAAAAGATT 
TTTTTTAATGTGGTACGCTTTGGTACGTGAATGACTAACAGAGAGTGTCGCCATTAATTGTATTAATTTGCGGTTAACGGACCACGGACCCAGAT 
TCTGACCAGACGACACTGACGGGCCCCTAAGGTCACCTATGGTAGCTGTTTTTCAAGGCATATTTCTGCACTTGCGACCACTGTGCACGCCTACG 
CCGCTGCTGCTGCGCAGTTTGTGTAATCAACCATCCTCTTTTCGGTGCCCGCTCTTGTTTGGCCCACGGTGCTGACCTGCTTAATCAAACAAGTG 
TCACTTACGCACACTGCCAAGAATGTGTGCTAACTAGCTAATACGAAGCTTTATTTGAATACATACATATATACATACGTATGCATACACTACAC 
TACAAAAAAAAAAAAACTATCAGATAAAAATGCAACGGATGGCCAATGCATTTGTAATTAGGTAAACAAATATCTTACTAAACGAATATGTAACC 
AACATTTTGAAAGCACTGCGATGCACCAATTAGTGAGTTTTATTTCTGAGCCTATTTCTTCCGTGTACTTGCATGTGGATGCATTTTGTTTGCAC 
AACAACTTTTCGATTCCCGCACAGCGGTTGCTGATAACGACCATCAGTAGGTGCCTTTGTCACGGTAGCTTTACGCAAACAGTTGGGCGTTGTCT 
TATCATTTCGTCCCTCCTTATCGCACCGAGTCCACAATCGAAGCACACACTCGAAGGTTTAGATTGTCAATTTTCCATTCCTTGAAATTTTCACA 
CATCTGTTCTGAGTGCTTTCGAGTGCATTATAGTGTTCACCGATGATCGATAATCATCGTACCTCTTGCATAGTGGGGTATATTCCTTTTTGCCA 
CATATGTATGCATATAGGGTTGGTGTGAAAAACCACTTTCCTTATCAACTTGGGTGCCCATCCAGTTCTTTAATTGATTGGAAAATATGTTTATT 
GGGATTTCTGTTTTATTTTCTTCAGTCCATTAAATCATTATTATCACATTGAGATTAAATGGACCATTTCCAATAACAATTCCATACAGGAACCA 
TTAGGATTATATATATCGGTTTATGTTGTTCCTTAAAAGATATTCAGCCGACTCTATAAGCTACTAAAAGTAGTGCATTCAAAATGCCTGATTTA 
TTCATCAATGAAATGTTGTTTCGGTTTTACGTGCTAGATTAAAATAAAATAAAATGCATATAAAAATTATATTTAGGTTTATACTACTGTTACTG 
ATTCACATGCCCTTGTTTGTGATTTATTTCATTTTAAAAGGTGTTAAAAAATATTGTTATGAATGTGAGAAAGCCAAGCTACACGTTGTATACCA 
CAGCCATTTGCCATATTGACCCTGCCAATAATAAATAACTTGATTTACCCCAGATATTAACATCAAAACAATACAATTACTCACCCAATCCAAAG 
ATGCATCACAGAGCTTTTAATATAACTGACGTTGTTCCTTCATTCCAGGAAAACCTCGGCGGGAACTCGAAGACGGTCATGTTGGCCACCATTTC 
ACCGGCCAGCATCCATGCGGATGAGACCCTAGCCACTTTGCGTTACGCCTGCAAGGCGCGTTCCATCGTCAACCGGGTGAAGGTAAACGAATCCC 
CCCACGACAAGATCATACGGGATCTGCGCGCCGAGGTGGATCGCCTGAAGTCGCTGAGGAACGAGTACGAGCGCCAGCGGCGCTTGTCCGGCAAT 
AGTAACAATCCCGTGCCACGTAAGATCATCATTGAAACCTCCGTGGACGAGACCGAGGTGGAGGCACTGCGTCAGCAGCTGGCAGAACGGGAAAG 
GGAGCTGAGTCGAGCGCAAAAGTCATGGATGGAGAAGCTCAAGGAGGCGGAGGATCAGCGCAAATCGGAACTTCGTGTCCTGAAACGCCGGGGAT 
TGGCTCTGGAACTAACTGCTGAGCAGAAGCAGGCCTGTCTGGTTAATTTAACAGCGGATCCCATACTAAGTGGCACCCTGTTCTATCTGTTGCCC 
CAAGGACTAGTGCGGATAGGACGAGGTCGTCTGCCAGGAGGGAGTTCTAGTTCGCAGCCTGATATTGTACTGGATGGTCCACTGGTGGCCTTGCA 
GCACTGCAGCATCGAGCATGAACGTGGTGGCAAGTTGTATGTCATCCCCGGCAGCGAGGACTTCGAGACCTACGTGAACGGAGAACTACTGAAGG 
ATCGTCGCCAGCTGTTTCATGGCGACCGTCTCGTCATAGGTGGCTCCCACTACTTTCGCATTTCCAATCCCTTCTGCTCGCAAAGAGGCAAGGCC 
GATCATCCGGTAGATTTCCAGCTGGCCCATCAGGAGATTTTGCAAAAGCAGGAGCAGCAACTTCGCTCCGAATTGGAGGCGGAAAAGCGAGCCGC 
TCTCACGAAGATTGAACAGGAGCGGGCGCAGCATGCCCGGGACTTTGAGGAGCGTCTGCAGTGCCTCGAACTAGAGCAGTTCAAGTACAAATGCA 
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ACAGCGAGATGCTGGAGACGGAACGCCAGGCCTTGGCCTTGGCCCAACAACAGGAACACACTCCGCTAAGGCATGAAGATGCAGTATCAACTCCA 

GCACAGAAGTCGACTATACTGGAGGACATTCAACGCATCATGCTGAATCCCTCGGAGGAGAGTCTTCACAAGACCCAGTTGATGGTCAAAGAGGC 

CACCCAGCGTTGTCGCCAGTTGGACCTGCCATTGGAGTTCCGGCAAACTCAGACGCCAGATGAATTTGGTCTCCTGCGCACTGTTATCTTGATTC 

TGGATAAGCAGCGGGGATTGAAGGCCGAGTGGCCCACAGCTAGACTGGGTGTGTGGTTGGATTTGGTCAGGGATAATGCTGAACAGCAGGAAAAG 

CTGAACGCTACAACCATTTTCCAGAGTGTTGAGGTGGATTGGGAGCCACTGGACGCGGATCTCAATGAGACACTGAGTGACACACACAACTCCAG 

TCGCATTGCCCTTAATCTGAGTGCCATGAAGGATGTTCTGTTGAATAAACCGCTGAAACGCCTGCTCAACGCATCCAGCAGCAAGAGCAATACAC 

CGCGACAGACATCCACTCCCTCACCTGGAAGCCAGCTGGTGCACTTCACCAAACGCCTGCTGACCTACGATCCGGAGGAGATACCTGAAAGCCAG 

AGTAGTTTCACAACGCTCGTCCAGCAGGAGCTGCAAATCATGCAGCGATCCGCCCAGCGTTTACGGCGACACTGCGAGGCTGCCCTACTCGAGCG 

TGAAAACCATCAGGACAACGGCGTCCTGGCCGTCAGCCTGCAGGAGGCTCTGGCACGAATGGACACCGTGCTGAACGGGATGCACACTTCGTTGG 

CGAGAGCGGAGGCCACTGCATGTGTCACCTCGCCGACGAAAACACAGAAGGCTGTGCGCTTTCTTATCGATTGACGGAGATTAGTTCGCACCTTG 

GACTAGATTTTAATGCTGCAATGTGCTTTGTGTTGTTTTTATGTTTTTAAATATATATTTTTTGTATATTGATTTTTAACAATAAAATGTATTGA 

TAGCACCTTCAAGTGAAACTGGCTTTTGGGAAATATTTTAAAGTGGCGGTACAAAAAAAGACTTTTCAATTAGTTTATCAAATAAATCACTAGCT 

TTAGTTTGTTTTTGTAATGATTTTCTGGTCTTACATAACTTAATATCCAAATCATGCTTGTACGTAAGTGACATAATAGTTTGCTCTGGCATTAA 

AATAATTATTTACTTCATTATCACCTCTTCGAAAGTTTATCACGGTTTTCGAACAACTTTGTGTCAACATCCAAATATTGGTTTATAAAAATATT 

AAAAGTACCGCAGGCTAATAACCATGTGTTCCCTCTGCCTTTCTTGTGGAAATCTTCAACCTTGGGCCTCCTCCTCTTCCACGCCTTCAGTGGAC 

TCCAGCTCCTTATCGCCAGCCAATTGCTCAAGTAGTTTTGTGTTCAGATCAACTACCCAATCGCAGAACTGTCCGCCCTTTAGGCTTACGAAGTC 

GGCCAGGAACACGTTTACCCTTGGCTCGTCCTTTGGCTCGAAGGGACCTGGAATCTGTTCCTGAATCCAAGGCTGCAGCTTCTTGTCCACCCGTT 

TGGCTGTGCTCTTCAGGGTGAAGAAGAGCCGGAAGGCGATATAGCGACCAGTGGGCGTGATAAGGCACTGGGAGACATAACCCTGTTGGGGCTGT 

CGCGAGAGTAGGGAATCCTCCAGAAAGGACTGCAGCTTCTTGACACTCGCTTTATTGGGCCACGGTGTGGGCCACGCATAACTGGGCCAGAATCT 

CAGCGGCAGTGGAATGGGACATCGCCTGTAGATGATCACCACGCTGCGCTGCATTTCCAAGCATCTATTCAGAGTACAATCCTTGAGTGAACCAT 

CCACTGTGGAGTACAGACGATGGGCGAAGAACTGGATCAGATCCTTGTGCAGCTTCTGGTGATGAGCCACTGTCATGGCGTAAAAGTGCTGCAGA 

TCCAGGATTACCACCTCCTCTGGATGGGTGTCCACGAACTGGCGAATCTCCAGAAGTGGCTCAAAAAT 

{ SEQ ID NO: 1522) 

Exon: 1001.. 2523 
Exon: 12874.. 14933 
Start ATG: 1084 

Transcript No. : CT30035 

TAAC AAAC G AC GC C GC G C AAT AA T T C GAAAC T CT T CGGC GT T AT T T AAAAAACG AGC AGAG AT ACCT AAAT AAT AAT CT AGAC AT G AGC AAC AAG 

AGCACTCCGGTGCGGCAGCGCATCCAGCAGTTCCAGACGCGAGGAGTCCACAAATCCACGCCGGCATCCGAGATGCGCAAGAAGGTGCTGATGAC 

GCGGGCGGAGGATCGGGAGCACCGGGATCGTCCCGAACAGCTGCTGAACACTGCATTCAGCGGGAGCACGCCACAGCCGAAGCCCAAGCCCACGG 

CCCTGAATGCCTGCTACACACCCTCGTCGCTGTACAGGAATGCGAACTCTACTCCTGGAAGAGCAAAGACGCCCGGCACCGGGAAATCCAGCTGC 

AGCAGGACAAAGGAGCGGGACTCTCTGATGGAGTCCTGCCTCAGCGTGTCCGAGGAGAGCAACATGATTGTGGCGGTGCGAGTGCGGCCACTGAA 

TGCCCTGGAATGCACACGTGGTCAGGTCACCAATGTGGTCCAGGTGCACGGAAACAGCAACGAGCTGACTGTCCAAGCGGGCAGCAGTGCTGATG 

CCTCCGCCGGCGTGACGCACTTCTTCAGCTACGACCAGGTCTACTACTCGTGCGATCCCGAACGAAAGAACTTCGCCTGCCAGGCCAAAGTCTTT 

GAGGGCACAGCGCGTCCCTTGATCGACACTGCCTTCGAGGGCTACAATGCCTGCTTGTTTGCCTACGGCCAGACGGGCTCGGGCAAATCGTACAG 

CATGATGGGCATTGAAGCATTGGATGACGCCGCTCTGGATGGCGGACCGCCGCACGATGAGGCGGGCATCATTCCACGCTTCTGCCACGAGCTAT 

TCCGCCGCATAGAGGCTGTGAAAAGTCAGCAGCAGCTGCAGGTGGAGGTGGAGGTCAGCTACTTTGAGATCTACAACGAGAAGATACACGATCTG 

CTGAGTGTCCAGCATGCGGCAGCTGCCACCGGGGAATCCACGCCCATCCAGCAGCAACAACAGCAGCAGCGTCCGGCTCTAAAGGTCAGGGAGCA 

TCCCATCTTCGGGCCCTACGTAGTGGACTTGAGTGCCCACTCGGTGGACTCGTACTCGGCACTCCGCAACTGGCTTGCCGTGGGCAACTCCCAGC 

GGGCCACTGCCTCCACGGCCATGAACGACAAGAGCTCGCGCTCACACTCCATCTTCAACATTGTCCTAAACCTCACCGATCTGAGCAGTGACGAT 

GGCCTGTCCTCGGATACGGACTCGAGCACGGCCTCCTCCCTGCGGCAGACGCGGCGCAGCAAGATTAGCCTGGTGGACCTGGCGGGCAGCGAACG 

CATTAGCGTCTCCGGATCGAACGGCGAGCGGATCCGCGAGGGCGTCAGCATCAACAAGAGCCTGCTGACGCTGGGGAAAGTAATTGCTGCTCTCG 

CCGATTCCCGCAAGGCGATCGCCAATGGACCCCTGGGCTCTGGGACGCCCAGCACCTTCGTTCCCTATCGCGAGAGCGTGCTCACCTGGCTGTTG 

CGGGAAAACCTCGGCGGGAACTCGAAGACGGTCATGTTGGCCACCATTTCACCGGCCAGCATCCATGCGGATGAGACCCTAGCCACTTTGCGTTA 

CGCCTGCAAGGCGCGTTCCATCGTCAACCGGGTGAAGGTAAACGAATCCCCCCACGACAAGATCATACGGGATCTGCGCGCCGAGGTGGATCGCC 

TGAAGTCGCTGAGGAACGAGTACGAGCGCCAGCGGCGCTTGTCCGGCAATAGTAACAATCCCGTGCCACGTAAGATCATCATTGAAACCTCCGTG 

GACGAGACCGAGGTGGAGGCACTGCGTCAGCAGCTGGCAGAACGGGAAAGGGAGCTGAGTCGAGCGCAAAAGTCATGGATGGAGAAGCTCAAGGA 

GGCGGAGGATCAGCGCAAATCGGAACTTCGTGTCCTGAAACGCCGGGGATTGGCTCTGGAACTAACTGCTGAGCAGAAGCAGGCCTGTCTGGTTA 

ATTTAACAGCGGATCCCATACTAAGTGGCACCCTGTTCTATCTGTTGCCCCAAGGACTAGTGCGGATAGGACGAGGTCGTCTGCCAGGAGGGAGT 

TCTAGTTCGCAGCCTGATATTGTACTGGATGGTCCACTGGTGGCCTTGCAGCACTGCAGCATCGAGCATGAACGTGGTGGCAAGTTGTATGTCAT 

CCCCGGCAGCGAGGACTTCGAGACCTACGTGAACGGAGAACTACTGAAGGATCGTCGCCAGCTGTTTCATGGCGACCGTCTCGTCATAGGTGGCT 

CCCACTACTTTCGCATTTCCAATCCCTTCTGCTCGCAAAGAGGCAAGGCCGATCATCCGGTAGATTTCCAGCTGGCCCATCAGGAGATTTTGCAA 

AAGCAGGAGCAGCAACTTCGCTCCGAATTGGAGGCGGAAAAGCGAGCCGCTCTCACGAAGATTGAACAGGAGCGGGCGCAGCATGCCCGGGACTT 

TGAGGAGCGTCTGCAGTGCCTCGAACTAGAGCAGTTCAAGTACAAATGCAACAGCGAGATGCTGGAGACGGAACGCCAGGCCTTGGCCTTGGCCC 

AACAACAGGAACACACTCCGCTAAGGCATGAAGATGCAGTATCAACTCCAGCACAGAAGTCGACTATACTGGAGGACATTCAACGCATCATGCTG 

AATCCCTCGGAGGAGAGTCTTCACAAGACCCAGTTGATGGTCAAAGAGGCCACCCAGCGTTGTCGCCAGTTGGACCTGCCATTGGAGTTCCGGCA 

AACTCAGACGCCAGATGAATTTGGTCTCCTGCGCACTGTTATCTTGATTCTGGATAAGCAGCGGGGATTGAAGGCCGAGTGGCCCACAGCTAGAC 

TGGGTGTGTGGTTGGATTTGGTCAGGGATAATGCTGAACAGCAGGAAAAGCTGAACGCTACAACCATTTTCCAGAGTGTTGAGGTGGATTGGGAG 

CCACTGGACGCGGATCTCAATGAGACACTGAGTGACACACACAACTCCAGTCGCATTGCCCTTAATCTGAGTGCCATGAAGGATGTTCTGTTGAA 

TAAACCGCTGAAACGCCTGCTCAACGCATCCAGCAGCAAGAGCAATACACCGCGACAGACATCCACTCCCTCACCTGGAAGCCAGCTGGTGCACT 

TCACCAAACGCCTGCTGACCTACGATCCGGAGGAGATACCTGAAAGCCAGAGTAGTTTCACAACGCTCGTCCAGCAGGAGCTGCAAATCATGCAG 

CGATCCGCCCAGCGTTTACGGCGACACTGCGAGGCTGCCCTACTCGAGCGTGAAAACCATCAGGACAACGGCGTCCTGGCCGTCAGCCTGCAGGA 

GGCTCTGGCACGAATGGACACCGTGCTGAACGGGATGCACACTTCGTTGGCGAGAGCGGAGGCCACTGCATGTGTCACCTCGCCGACGAAAACAC 

AGAAGGCTGTGCGCTTTCTTATCGATTGACGGAGATTAGTTCGCACCTTGGACTAGATTTTAATGCTGCAATGTGCTTTGTGTTGTTTTTATGTT 

TTTAAATATATATTTTTTGTATATTGATTTTTAACAATAAAATGTATTGATAGCACCTTCAAGTGAAA 

{SEQ ID NO: 1523) 

Start ATG: 84 
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MSNKSTPVRQRIQQFQTRGVHKSTPASEMRKKVLMTRAEDREHRDRPEQLLWTAFSGSTPQPKPKPTALNACYTPSSLYRNANSTPGRAKTPGTG 

KSSCSRTKERDSLMESCLSVSEESNMIVAVRVRPLNALECTRGQVTNVVQVHGNSNELTVQAGSSADASAGVTHFFSYDQVyYSCDPERKNFACQ 

AKVFEGTARPLIDTAFEGYNACLFAYGQTGSGKSYSMMGIEALDDAALDGGPPHDEAGIIPRFCHELFRRIEAVKSQQQLQVEVEVSYFEIYNEK 

IHDLLSVQHAAAATGESTPIQQQQQQQRPALKVREHPIFGPYVVDLSAHSVDSYSALRNWLAVGNSQRATASTAMNDKSSRSHSIFNIVLMLTDL 

SSDDGLSSDTDSSTASSLRQTRRSKISLVDLAGSERISVSGSNGERIREGVSINKSLLTLGKVIAALADSRKAIANGPLGSGTPSTFVPYRESVL 

TWLLRENLGGNSKTVMLATISPASIHADETLATLRyACKARSIVNRVKVNESPHDKIIRDLRAEVDRLKSLRKEYERQRRLSGNSNNPVPRKIII 

ETSVDETEVEALRQQLAERERELSRAQKSWMEKLKEAEDQRKSELRVLKRRGLALELTAEQKQACLVNLTADPILSGTLFYLLPQGLVRIGRGRL 

PGGSSSSQPDIVLDGPLVALQHCSIEHERGGKLYVIPGSEDFETYVNGELLKDRRQLFHGDRLVIGGSHYFRISNPFCSQRGKADHPVDFQLAHQ 

EILQKQEQQLRSELEAEKRAALTKIEQERAQHARDFEERLQCLELEQFKYKCNSEMLETERQALALAQQQEHTPLRHEDAVSTPAQKSTILEDIQ 

RIMLNPSEESLHKTQLMVKEATQRCRQLDLPLEFRQTQTPDEFGLLRTVILILDKQRGLKAEWPTARLGVWLDLVRDNAEQQEKLNATTIFQSVE 

VDWEPLDADLNETLSDTHNSSRIALNLSAMKDVLLNKPLKRLLNASSSKSNTPRQTSTPSPGSQLVHFTKRLLTYDPEEIPESQSSFTTLVQQEL 

QIMQRSAQRLRRHCEAALLERENHQDNGVLAVSLQEALARMDTVLNGMHTSLARAEATACVTSPTKTQKAVRFLID* 

{SEQ ID NO: 1524) 

Name: KINESIN MOTOR PROTEIN 
Classification: motor_protein 
Gene Symbol: neb 
FlyBase ID: FBgn0004374 



Celera Sequence No. : 142000013384351 

AATGTTTTTTTGTATTGAAAGTATATTTAAAATATCTCAAGTTCAAACTTTGATTTTTGTAAGAAAGATTACGCTCTTCGTTATTTTTTGACATC 
TAGCCCCATTATTTTACCACTAAGTATGATCGGATTTTTTAACAATGTCAGAGGAGCCAATTTTCATCACCCACTCATGAGGTGCCCAACATATG 
AATATGTCAACTTTTAGTCCCACCGAAACCAACGTCTTAGACCCGGCTGTGATAATTAGCGCAATTCGTTTTTCGTCTTCTGCACTCGAGCGCCT 
TTCTAGCTATCTTAATTTATGTACTTACGTTTAGCACAATTTACGTTTATTTTTTCTGAAACTTTAGCAACCTCCGAGGCTCTCAAACTGCAAGT 
TACAAACACAATCCTTTCGCTTTCACGCTCTCTCACAAGCACACGTCCACACATTCTAGTAATCTAAGCCAAGTTTTTATAAATATTGTAATTAT 
AC AC CTG AAC ACGC AC AC AC AC TTGCAC AC AAAAAGAC A GGGT G AAG AC CT ACC AAACC AAAAAAAAAAAAAAAC AC AAT T T T C AAAGGAAT AAA 
TGTTTAAGAAGCCAAAGAAATTGTGCGTGTTTAGCTAATTTTCCCGATCGCAACCAAATGGAGAAAAGTAATCGGCGAGCTATGCAAAAACTAGT 
ATATTTCATTTCATAATACAATAATTTATGTACGCTGGCGTACTTTACATATCTTTGTGTTTCCTTTCGTAGTTCGTAGTTCCAGTAGTATAAAA 
ATTGTTCGAAAAACACATACGAAAATGATAGTGGGGCAAAGTAGTTGAATATCCATGATATGATGATTAGCAAACACTTTTGGCTTAAAATTAAA 
GACATGACATGAGGTGTGCGGGGAGGGCGGGGAAAAAGACATAAATATCTAAATGTAAACTGTTGCACTTAAATGAAATGAAGCCAGGGCGCTGC 
TGGCTGCTCTCCCTGGATAAGTATAAAAATATGCTTCACGGCCGGCGCATTTAAATGTTTTCCACGTGCCACACCTTGGTGTTGCAGTCCTGTCC 
GGTGGAGATCACAGTGTTATTGTCCAGCCAGACCAACCGAGTGATTTGCGACTGCGGATGGGCATCTGGAGTGGGCAGAAAACAATATGTCAGAT 
ATCGCTTAAAGCTTACTACTCTTTTTGAAATACTTACTCTTGATGATAGTATGTTTGGCCGGATTGGCCACGGACCAGATAATGATAGTAGTGTC 
CAATGATCCACTAGCAACCAGCAGCGAGTTGGGCGACCAGGCCACCGTATTTACGCGAGCACTGTGGAAGCCCCACTCCTTGTTGTGTGCAGGCT 
TTTAAAATACACTCGATTAGTAATGGAATTAAGACATTTATGATTCCTTAATACCTTGTACTCCTCCACTGAGTAGAGCACCACCTTGCGATGGG 
CATCGCAAGCCACCAAATACTTCAAGTCCGGCGAATAAGAAACATCCGTGACAGCTCCCAAGTGATCCAGCTCCACTTTGGGCTCCAATACTCCG 
CCCTTCAGGGTGTAGATGTGCAACTTTTGGTCATCCCCGCCGACAGCAACATCGGAAGTGTCCGCATTAACGGCAATCGAGCTAGCCTCGTACTT 
AATGGGCAGCGAAAAGATCTTCTTCTGGTCCTGCACGAGAGTCAACTCTTTGATGCAAGCAAGTGCAATGATGTTCTCGTTGCGCAAAATGGCCA 
AGCCACGTGGCTGGCAGTTAAGCTTCACCACATAGTCCGTGTAGGAGTTGCCCTCCACGCTGAACTGGCGTAGGCTGTCGTCGATGCCACAGGTA 
TAGACAAAGTCACCCCAGGCGGCAATGCCGTTGATCTGGTTTCCGTGTCCAGTGCCCGTGATGCGATCGTTGGTGCCGCTGCCCGAGTTCCAGTT 
GGTCACCACGCCATCGTGGCTACCCGTGTATATGGTGCTGCGGTCATCGCTCAGTCCCAGCACGGTGATGGGCTTATTGTGGCCCTTCACCACAC 
GCAGCGGTTTCGAGGGATCCGCCACATTGAGGTAGGTGATCACGCCCGAAAGCGAGACGGTGATTAGGTTGTCTCCTTGCCACAAGCAGGACACC 
TGCTGGTCGTCCACAGTGGTGCCCATCACAAACTCGCTAACCAGCTCGCGCGACTCCACTGTCCAAAGGCGGCACGTCTTGTCGCCGGAGCAGGT 
TAGCAGTTGGGTGCTGTCTGGTTTCCAGGCCAATGCGTAAACGCCGCCCTTGTGGGCAGGAGATCCAAATTCGCCCACCAGTTCAGAGCTGGTGC 
CATCGTACAGAAAGACTTTGCCATCAAAGCCGGCGGAGGCGAAGAATTTGCCATCGGGCGAGTAGCGCACAGCCTGGACGAAGCGAGAGTGATCC 
TGCTTGGTCATCTTGAACTTGAACGGGGGTCCCTCGAACACGGCGATAGTATTGTCTTCGCTGCCTGTAATAACAAGTGTGTGTGTGTAAGTAAG 
TTTATTGCATTTTGGGACTATTATGTGAACTCACCCGTAACAATGCGGAATGGCCTGGCTGGCCGGAAGTCCGCCGAGTTAATAGACTTGGACTG 
GCCACTGATCTCGCCCACAGACGTGCCGGTTTCAGACATAAAGACATGACCGAAACGTTCGCGTCCTTCACCCACGGCAACGATGCGCTGATTGT 
CTGGCGACCAGCTAATATCCTTGATGGGTCCTGCAATGGGCTGGAACTCATTTTTAAGCAGGTGCTCCTTGTTGACCGTGTCCCAAATGCGAATC 
TTGCCGGAGGCGTCTGGTGAGAAGAGGAGAAAGCCAAATAAATAACATTGCTCGCTTCAAAACCGCAGATAAGATAAAGAAACCGCTCCAAATAA 
GGAAATGCTAATCATGACCGTGGCGTGACTCATAGTGGTCACGATTTTTATGCTCTGTTGCCGAAAGCCGGGTGCTCCAAGTGTGAAATTGCTGG 
GCGAGACTTACCACCAGATGCAATATAAAATCCGCTGGGCGAGTACTTGGCCACGTTCACTGCACACGAGTGCTCCGTGTAGACATCGGCAATGG 
CTGGATTCTATGGTGCGGAAGAAAGGGAAAAGCATACATACATATTTTAGAGACACTTTTAGCAAATTAAAACAAATTAATGGTAAATGGGATCC 
TAATTATGAACATAGGTGTTTGGTTTTGAATTTGTTATGTATGTTTCCTTCATTAACATTTCATTTTCCAGAACCTGAATATCTTTTGGCGGAAG 
ATTTTATTTCTTGATCATTAAAGATACCATAGAAGTGGCGGGATTATCGGGACAAAAGACAAAGAAACATGGAAATATTTTGAATTTATACCGAT 
TCAATGGGTGAAAGATAGGCAAACACATAGCTGCTGTGAAGAATCACTTTGACCAGATGAAAACATCATAATGATGATGCGTGGGCGGCAAACAA 
GAAATGTATCTATGTGCCAGTCTGTGTTTTGCCGGTTGGCCTGTGGAATGGGAATCTGTGTTTTGCGCCTTATATACTGTACATATATATATAAA 
TAAATATAGAACAGCTGCCCGAAAGCCAGAAAGCACACACAAACTGGCAACAGAAACTGCAGCCTGTCCAAAAATAAAACACCATTTGGACGTAA 
ATGCCATGGATTTATGGCAGGAACGGAGCAGCAGGCTGCAGCATATATCAGCTTACCTCGATGTTCCGGATGATCACAGAGTTTCCATTGGTGTA 
CAGGAAGTTCTTGCCCTTGGGATCGGCACCCAAAACGATGGGTTGGCCGCGTTGGGTGCGCGGCAGAGTTGCGTAGATGTTTTCTGCAATTAAGC 
GATTATTTCGATTAGAAATGGGCTGTGACTGCTGCTACTCAGATAACGCCGGCAAATTGGAGTGGGTGGAAAATAGGGTGCGGCTTCGTTTTTGG 
GGAGCGAACGGGAACGGGGTCGCTCCAACGGCTCCACACGCGCCTCCTCTTACTGCTTAAAAAGGTCTTTCATATGCATCTATGCCCATTTATGG 
GCATAATCACGGTGCTTCCGGCGATTGCTTTCTGCTTTTACCGTTTTTCGCCTGTTGACTCATCCCCCTCTCTACGTATCACATTCGGAGTGGGA 
TCGGATTCATGACTCCAGTTTATCTCTGTGGGATTCCAACTCACTGTTTTCGTACGCCGGTGGTTGGGGTTGTGCCATTCTGGTGATTTCTCGGT 
GCTCCTGAAGCCTTTACTAATTAGTGCGAAATTCCGCGGATAAATGAAAACACCAAACTCAAGTTCGCCGACGATTAGTGGTGGGCTAAGATCGA 
TGTTTTCACCATCGTCGTCATCGATGGTTTTTTCTTTGCTTTATATCGACTTTGCGCTGCGTTTACCCCTTTTTAACAGCGAAGTGAACTGGAAG 
GAAATTAAAAATATATTGTTCTGTGTTTTATACTAACAGTAACTACTAATTGCTACCGTTTTTAATTATACACTAAAAAATTGTTTTTTTTTTTG 
GGATTGAGTTTTCAATTTCCTATATTTGAAAAGGAAATATATAATCAAAATTGTATTTTTATCTAATTTAATAAAATGTATATTTTTCAAAAACT 
ATCTAAATATCGATAAATTGCGCCAAATAAATTAATCGGTCTCTGTGGTTCCCCACCTCTACCATTGTTTATATTTTTCATTTTCCGCAAGAATG 
ATGCAAAGATTTCCGACGCAGGCGACCGTGGAGAATCCACGTTATGGACAGCTAAXCATAGGACCTCCTGGTTCTGGGAAAACCACCTATTGCGG 
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TGAGGCCCTTAAGTTCTATCGCGAACTGGGCCGTCAGGTGGGCGTTGTGAATCTGGATCCAGCCAACGAAAACATGTCCTACGAACCGGTGCTAA 
GTGTGATGGAGCTAATCACTGTGGAGGATTGCATGGAGCACCTGAAGCTGGGACCAAACGGAGCTCTAATGCACTGCGCTGAGTATCTGGCCGAC 
CATTTGGAGGATTGGCTGTTGCCGGCACTGCGCAAACTAAGCGCCACTTACAACTACTTCCTGTTCGACTGTCCCGGCCAAGTTGAGCTATATAC 
CCATCACAACGCTATGGCCCGGATTTTCGAGCGTTTGGAGCGTGAAAGATATAGCCTGGTCACCGTTAACCTTATAGATTCCCATTACTGCTCGG 
AGCCTGCCAAATTCATTGCCACTTTGCTGATGGCATTGAACACCATGTTGCGGATGAGTCTACCCCACGTTAATGTGCTTTCCAAGGCCGATCTG 
CTAAAGAAGCACGAGACCAAGT 
(SEQ ID NO: 1525) 

Exon: 4247.. 4130 
Exon: 3788.. 3667 
Exon: 3047.. 2957 
Exon: 2768.. 2505 
Exon: 2439.. 1385 
Exon: 1328.. 1178 
Exon: 1110.. 1001 

Start ATG: 4163 (Reverse strand: CAT) 
Transcript No. : CT30041 

CGAACTTGAGTTTGGTGTTTTCATTTATCCGCGGAATTTCGCACTAATTAGTAAAGGCTTCAGGAGCACCGAGAAATCACCAGAATGGCACAACC 
CCAACCACCGGCGTACGAAAACAAAAACATCTACGCAACTCTGCCGCGCACCCAACGCGGCCAACCCATCGTTTTGGGTGCCGATCCCAAGGGCA 
AGAACTTCCTGTACACCAATGGAAACTCTGTGATCATCCGGAACATCGAGAATCCAGCCATTGCCGATGTCTACACGGAGCACTCGTGTGCAGTG 
AACGTGGCCAAGTACTCGCCCAGCGGATTTTATATTGCATCTGGTGACGCCTCCGGCAAGATTCGCATTTGGGACACGGTCAACAAGGAGCACCT 
GCTTAAAAATGAGTTCCAGCCCATTGCAGGACCCATCAAGGATATTAGCTGGTCGCCAGACAATCAGCGCATCGTTGCCGTGGGTGAAGGACGCG 
AACGTTTCGGTCATGTCTTTATGTCTGAAACCGGCACGTCTGTGGGCGAGATCAGTGGCCAGTCCAAGTCTATTAACTCGGCGGACTTCCGGCCA 
GCC AGGC CAT T C C GC AT T GT T AC GGGC AGCGAAGAC AAT AC T A TC GC C GT GT T C G AGGG AC C CC C GT T C AAGT T C AAG AT G ACC AAGC AGG AT C A 
CTCTCGCTTCGTCCAGGCTGTGCGCTACTCGCCCGATGGCAAATTCTTCGCCTCCGCCGGCTTTGATGGCAAAGTCTTTCTGTACGATGGCACCA 
GCTCTGAACTGGTGGGCGAATTTGGATCTCCTGCCCACAAGGGCGGCGTTTACGCATTGGCCTGGAAACCAGACAGCACCCAACTGCTAACCTGC 
TCCGGCGACAAGACGTGCCGCCTTTGGACAGTGGAGTCGCGCGAGCTGGTTAGCGAGTTTGTGATGGGCACCACTGTGGACGACCAGCAGGTGTC 
CTGCTTGTGGCAAGGAGACAACCTAATCACCGTCTCGCTTTCGGGCGTGATCACCTACCTCAATGTGGCGGATCCCTCGAAACCGCTGCGTGTGG 
TGAAGGGCCACAATAAGCCCATCACCGTGCTGGGACTGAGCGATGACCGCAGCACCATATACACGGGTAGCCACGATGGCGTGGTGACCAACTGG 
AACTCGGGCAGCGGCACCAACGATCGCATCACGGGCACTGGACACGGAAACCAGATCAACGGCATTGCCGCCTGGGGTGACTTTGTCTATACCTG 
TGGCATCGACGACAGCCTACGCCAGTTCAGCGTGGAGGGCAACTCCTACACGGACTATGTGGTGAAGCTTAACTGCCAGCCACGTGGCTTGGCCA 
TTTTGCGCAACGAGAACATCATTGCACTTGCTTGCATCAAAGAGTTGACTCTCGTGCAGGACCAGAAGAAGATCTTTTCGCTGCCCATTAAGTAC 
GAGGCTAGCTCGATTGCCGTTAATGCGGACACTTCCGATGTTGCTGTCGGCGGGGATGACCAAAAGTTGCACATCTACACCCTGAAGGGCGGAGT 
ATTGGAGCCCAAAGTGGAGCTGGATCACTTGGGAGCTGTCACGGATGTTTCTTATTCGCCGGACTTGAAGTATTTGGTGGCTTGCGATGCCCATC 
GCAAGGTGGTGCTCTACTCAGTGGAGGAGTACAAGCCTGCACACAACAAGGAGTGGGGCTTCCACAGTGCTCGCGTAAATACGGTGGCCTGGTCG 
CCCAACTCGCTGCTGGTTGCTAGTGGATCATTGGACACTACTATCATTATCTGGTCCGTGGCCAATCCGGCCAAACATACTATCATCAAGAATGC 
CCATCCGCAGTCGCAAATCACTCGGTTGGTCTGGCTGGACAATAACACTGTGATCTCCACCGGACAGGACTGCAACACCAAGGTGTGGCACGTGG 
AAAACATTTAA 
(SEQ ID NO: 1526) 

Start ATG: 85 {Reverse strand: CAT) 

MAQPQPPAYENKNIYATLPRTQRGQPIVLGADPKGKNFLYTNGNSVIIRNIENPAIADVYTEHSCAVNVAKYSPSGFYIASGDASGKIRIWDTVN 

KEHLLKNEFQPIAGPIKDISWSPDNQRIVAVGEGRERFGHVFMSETGTSVGEISGQSKSINSADFRPARPFRIVTGSEDNTIAVFEGPPFKFKMT 

KQDHSRFVQAVRYSPDGKFFASAGFDGKVFLYDGTSSELVGEFGSPAHKGGVYALAWKPDSTQLLTCSGDKTCRLWTVESRELVSEFVMGTTVDD 

QQVSCLWQGDNLITVSLSGVITYLNVADPSKPLRVVKGHNKPITVLGLSDDRSTIYTGSHDGVVTNWNSGSGTNDRITGTGHGNQINGIAAWGDF 

VYTCGIDDSLRQFSVEGNSYTDYVVKLNCQPRGLAILRNENIIALACIKELTLVQDQKKIFSLPIKYEASSIAVNADTSDVAVGGDDQKLHIYTL 

KGGVLEPKVELDHLGAVTDVSYSPDLKYLVACDAHRKVVLYSVEEYKPAHNKEWGFHSARVNTVAWSPNSLLVASGSLDTTIIIWSVANPAKHTI 

IKNAHPQSQITRLVWLDNNTVISTGQDCNTKVWHVENI* 

(SEQ ID NO: 1527) 

Name: ACT IN- INTERACTING PROTEIN? 
Classification: actin__binding 

Celera Sequence No. : 142000013384599 

TCTAATTCGGCATACGCCAGTGATTTTTTAAATATAAACAATGGTTTCGTCTAGCTGAAACTAGCTTCCTTTCTGATTTTTAATTATACAAATTC 
C AAAATT C ATTGGAAAAC AAAAT GTTC CGAAACTC AAAAGAATGAAAAT T TTGG AAGCCC AT AT T AAAAAAAAAAACGAGACCTGTAATC ATTTC 
C AGT TC T GGT T C C CAT T C AAT T GAT G AC AC T T GT G C G AAAT C AG AGTT T AT GC G AAT AC AAAT AAT C GT T TAT T GC G AAAC AAAC C C G AAAAT AA 
ACGCGGATGATAATCTTTTGACTCACTGCACTGGTTCTATGCATGGGGCTTAGCCCCCAAAAAATGGCTAGATAGTACAAGTTACCCCACGACCA 
GTGGCGCGGTTTCCACTGCTTCCGCTCTTTGTAGCCCTTTTTCGGGCTCTTCTGCTGCGCTTCGTTCTTCGTTCGTGCTTCTTCTTCTCCGACTT 
CATCGCCGCCGTCTCCTTTGCCGCTCTCTTTGTAAGTGCAGCTCACTCCCACTCGCATATGTACATACAGAGCCCGGAATACTTGGAGTGCACTG 
CTCCCTCGCGCAATTTATGTTCTGTTCGAGTACATATGCCGCATACATATGTACGTGTGCGAGTGTAGGAGGTTGTCAGAGGTCAGTCTCATCGC 
CGCATGTATGTAGGTATGTACATGCATACAAACACCCGTTCCAATTGCCAATCCATTCCCCGTGGCGTTCTGCAGAAACTAGATTTCCCGCTCTC 
TCTCGGTCCCGTTCGCTGGTTCTTTGTCTCTCTCGCTCTCCCGCTGGCTAAAACTGGTTGAGTGAGTTTTTTCGGCTGGCCGCTTTTGTCGGCTT 
GGTTCGTTCGTTCCGTTCCCGTTCGTTTCCGTTCCTCGTTCCGCGTCTTGTAAAATTCTTCTTCGCCATGCTGGTTCATTCTCATTCAGTTCTTA 
ACACTCGCCGGATAAACAACAGTTTGGCGCTTGTTCGCGTTCACTTCCCGTCGCGTTCAGAGAGAATACACTACTACTGCTGCTGCTAGTGTGTG 
TGCGTGCGTGTGAACTGTGCGAAAAGAAGCAATATAAAGTATTAATACTAGTGGAATATTAAGTAAATCTAAGTGCAACAGAAACATACACATAC 
CTAGCCTATGTACGTATGCTACAAGATATACATATCTGCAGCAAATAAATGTTAACTCAGTATACAAGCGAACAAAAGCACCAGAAACAAATTGT 
TTAGTTTTGCTGCTCATTCTTCTTCTTCCACCGATGCCGTAGAGTTCATCAACTTGTTTAGAATCGCGTACGTGACTGTATGTATGCCTGTGTGA 
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GTGAGTGACAGTGTGTTGGTGCTGCCTTTTTCCTGCTGCGCTTCCTCTTGCTCGTGCATCTATCTCTTTCTGGCGGCGCATCACTTGCATTGTTG 
CTGTTGCTGTTTGTTGCAATGCAACAAAACATTTTATTCTTGCCTTTTCCCCCTCCTGGTCTTCGCCTCCTTTGGCGTTTGTTTATGAGCGTGTT 
TGTGTATGTGTGTTTGTTCTTTTCGACGTCCGCGCGCATTGTTGTTATTTGCTTGCATTAACTTCTACGTTTCTACGCTTTTGTTATTATTATCT 
CATTTCTTATTTAGCTCTGCGCCACAGTCTGCTCTCCACAGTCACGGCCAGCAGTTGGTGTAAAAGTGTAATGTTGATGCTCGCCAAAGACATAA 
ATAAAATAAGAGTGAAAAATGTAGAAAATCTAAGATATTTTTTCAGTTTGTGCAATACTACAAATTCGCTTGTACATATCCCCTCGACCTTAACT 
TTGCTTTCATATTCTCTTTCTGCGGAATACTATTCTGTATATTCCCCTGGTCAATAACCACTGTATTTGCAAGAGCAAACAATAATCTGATAATG 
ATGAAATGGTTTATCAACCTCTGAAGCCTAGAGGTGTGCACACTTCCTTCAACTCCAATGCGCGACGACCTTCCCCCGGCCAAAAAGTCCGAAAG 
TCGAGCCTATTTATCGCGACACAACTGTTCAAACAATTCCCCAACAACATTCGATTCGCGTCGGCAGCGACTGTGTTCATGGGTATTGTTGTGGA 
TATCGTTATTGTTATTGCATTTGTTGTTGCTGTTGCCGCTGCGATGCGGAAGAATAAAACACGAGTTTGTTTTCTTGTTTCTGTGTGTATTTTAA 
ACCAAAATTATTACGGCAATGTTCTTTAAACTTAATCACTGTAGACTGACAGACTGATCGACGTGCAAACCACGCTTGGGGCTTCTCATTAATAT 
ACGAACTGCCGGTTCGAGTGAGTTAGTGAGTGCCTGTTTTATTTACGCCGAAAGCAAAAACACAAGAGGCGAAAACAAAACCGAAACCAAAATCT 
AAAAACGTGTTTCAAATTTGTTCCTATCTGTGTTGTGTCTCTGGGTCCAGTATTTGGGGTTTGGCGTACAAGCATGTGGATATGGATACGAGCAG 
AACGGAAAGACGAAACATAACATATCGAATGCTATTTACTCCATGTGTCTTGTTTCACGCTCGATTTGCGTCGCCAGCAGAGCTAAGAAATAAAA 
AACTACGCTTACTGATTAAAAAAGCTGTCGCCGGGCTTTATATTTTGCGTCGAACTGATTGTGTGCAGTGATTACTCGGAAGCGGGAATTAGAAA 
GGACCCCGGCCAGATATTTATTTGATTAAAAAATGCAAAAGCATGCTGCTAGCAACAGAAAAGAAGATTGTTTAAATTAATCATAAAATTGCACT 
AGTTTTTGCACTAGTTTACACTTTATACTTTATCGTAAACACCAGCTCGGGCAACCTTTTTGAAAACTCGAGAATTTACTGTTTATAAATAGGTA 
CAACTGTACCCTAAACAAACAGAAGGGTTGAACAATTATAATTTTATGTTTTTTTCCTTGTTTTTCTAGCTCGTTACTGTATTGTTTGGAGCAGT 
C G AAAGT AAT C T TCT TGGG T T T AC C T AAAG AC T T AC AAGT CT GAT C T AAT T C AC TT T TAT T C G CAT T C GC C C AG T G AT C AC T GT AG AT T AC C AGC 
TGCTTTGAAGATTTTCTGATTTACAATTCAGATACAATTGACAACGATCCTTCTTTTATCAACAGCGATTTGGGCCATCTCCAATCGATTCGCCA 
TCGCAACCCGAACGCCTTGTGTAAATGCGTTTCCTGTGCTGTTTTGTTACTTTCTGAGATTTTCTAGAACTCGGTCGGGGTCGTTCTTCTTCCCT 
CTTCTGTCCCCGCCAATCTTTGTCGTACCAATTTCGTTGTGTTTTGCTCTGCTTTGTTTTGGGTTCATGTGCGAGTTTTCATTATCAGAGAGCGT 
TGCTTACCACCACGTAATAGATAACGCAGGCAACCCGAAGTTGGTACATCTCTAATTGAAAATGTTAAAGTAGATAGCAAATATTTATCGAAAGT 
T AAC AC G AAAAT GAGGC GT AC AGAG AAC ACGT T C C AAT T G AT T AGAT G AT GT T GCAC C AC AAGAC G AT GG CAT G T GC AT C T GCGT T T AAAC T T CC 
TTTATGAGGTGAAATATTTAGGGCCTATTTAAGCTTGTAGATACTTTTAGTTAATTTGATCTTTTGCACACTTGGAAATACAGTTTGCTCTCACA 
TCTAACCTTGGAAAAAAACAAAGGAAGTGAGTTTGCTCTCGACTATTTTGAAGTAAAGAGTGCGCGAATCTACATAGAAAGTCCTTATATGGTAT 
TTCGTTTTACTTGACCAAAATTCTGAATGGAATGGAATATTCCCAGCCGACACTGTACCTGTGCAAGGTTGCGTCGATATTTTCTACCACCACTG 
TGCAGGACACACAGACACTTGCCATCTCGCTCGCCCTCGCTCACAGTGCAGCAGGAGAGTGGGGGAAGAAGCGGCAACTAACGAACCACAGACAC 
TACAAAGCGCAGTAAGAAAGAAAAATGTTGTGCAGAATGAAAAAAAAAACAAGAACAGACAACAGAAAC^ 

GACGCCGTTGACCTGCGGGGCCTTTGATGCGAGTGGGAAAGAGAAGGAGATGGATGGTCGTGGGAGAACGGCCCGTGGAGAAGAGCGAGCGGAGA 
GTGTGAAAGAGGCAGAGGAGAGGAGCGTCTGGGATCTCGGGACCCTCTGCTAAATTGGATGCTGCTATTCTTCTTCTTCTGCTTTACCTCTTCCT 
TCATTCCCTCGCCACCCATGCATCGCGCGACTACAATAAAACTCTCTTGTGCAGATTTTGTTGTTCTTGGATTTTCGCCCATTGTCTCCTTCAAT 
AAAATGCGTCAAGTCAATTTCATTCACCAGCAGCTGCTGCTTTGGGGGTCGCTTCGCTGCTGACGCATCGCTGGCGTCGTCTGATAAAAAAAAAT 
TCAACAAAAACCCAAAAACAAAAAAATACCAGAGAAGCAATCGTCGCAGAATTCGACGAAATTCACAAGGGAAAACAGTAAAATAGTTAGGCAAG 
AACAGGAGATACATTCCAGTACCCCTTCCCTGTCACTTGTCTTGAGTCAACTTGTCTTACCTAGCTCACGAAACGCAAAACAATATAAAGACTGG 
AGATATGAAATTACATCCGTTCAATAATAAAAGTTTACATATGCTAAGTTATAAGATGAAGACGAGGAGCAACGACATGATAAAGGATTAAAAGA 
TTCTAATACCAACCAATGGTGGTATTAATCGGATTAGACTTTTTGAAATACTTTGTTCTTTCCAATCCTAACCGAATTTTCGTTCACAATGATTA 
ACATCAAAAGAAGAAGATTCATGATAATATTGATATCCAGATCCTATCTTAAAAACATTTGCTCTTTGCCAATATCTACTGTAGCGTAATTTGCA 
TCAAACCAAGGCAGCTGCGGTCGTGATGCTGGCGGTGCCTTTGCGATGAAATCGAGGATGCGAATGGAATGGGATGGTCGCTGTAGGATGGGATG 
AAGATTGGAATTGGGATCGGGACGGATATTTTATCTGCCGAGCGACGGCATTTTTGCTGCTGTCTCGCTTAGAAGAGTGTGTGTATGTATTTGTG 
GATTTGTGTGCCTGTCAAGCAGGTAAAGGTAGTTTCTTTGTTCTGTTTTATGGCTTGCTCTTTGTGCTGGGCTTCTTTTACTTTTCGCTGCATTT 
GAGTACGTCTGTATCCTCGACATAAAGTTGCGGTCAGAATATTGGGACTAAATCACTGAATGATGTTTAAGTGGTTGCAAGAGCACCATATACAG 
CTGAATGTTGCAAAGTAATTTATATATTATTTTAAAACGAGATTCGCTCTCATTAGCTTTTAATATTAAACATGTATAGAGACCATTCAACTGTT 
CGATAAATCGTAAACAAGTGTAAATCTATTTTTCTATTAAAACTTGGGACTATTTTGCATTTTGACATACGCACCGCACAATATTATGCCCACCA 
CTGTTGGCAGTGGGTGTGTGTGTCATGGTAATTGCCTTTGTTTTTGCTGCCGCTATTGCTGCTTTTGGTACCATTGAACTCTGCTGCAGTTGCAA 
CCCCAAAAACATTCCGCAGGGGAATGCATGGAAGTTGAACACAGCCTTGTTTATTTTGCTGCACAACCAGAACGAACGCCCTGCCTCCTCTAGCC 
CCATCTCTTTGTAGTTGTTTTTGTGGGTTTATGCATTTTTGCGGAATTACAGTGAAGCCTCATAAAATCGTCTAAAATTCTCATAAGTCTTTTCA 
AAGATGTAAAGCACTGTACAGTAGGGTGCACCATCGATGACTGGTTCTAAGTTCACGATAGGAGCGTTCCACTGTTGTTTCACGTACAGTCAGCA 
CCCCCTATTTTCTCCCACACACCATCTTGATTTGGCTTGAACTTGGTGCTGCGCGCTCTTGTTCCCGATCATTATTTTTTCCATAATCATCGTCA 
TCAGCAGCACATCGCAATGCCTCGCTTTGTTTTCTCTTCAGTTCTTCTGAGCCATGAAATTTATAATCTTGTACAAATTGTTTTGCATTAGTAAT 
GCCTGTGCGCAGTTTGTTCCACTTCTCACATGGGGTAACCACGTCGGTTCTTTAGCTCCGCTTCGCTTTGCTCCCTTTCTTGTATTCTTATTCGT 
TCTTCCGCCGTCCCTCGGTTCTTCTTCTTTTCTTCTCGTGCTTAATTATGGTGGGAGGAGGGCGGCAGGGTGTTCGGGCACGCCCTCAGTGCACT 
GCACTCGCATTTCTGATTATTAGCCCGCTCTATTTTTAAACACTTTTACAAGCGGTCTGGAATTTGTCCGTACACCCCCTGCTACACTTCCATCC 
ATCCACCAAAAAAAAAAAATACTGCTAAGTAAAACATATGGCACAGACATTGAGCTCCACTCCCTCTTCTCGAAAAGTGGAGGCGATTCCATTCT 
TCTGATGGACTCGTGGCCCCAAATGATATTAATCTTGTCATGATATACAATACCAACTGTCAGTTTGATTTGGCCCGAACCGTTTTACCACTCTT 
CCGTTTGTTCTCCGTCAACTAGTCAACTATCTCCCTiAAATTTGCTAATCTAATCGATAGACAACTCGTTCATTATCAATTCCCCTCGTTCCCCCC 
GCCAAGAAACGCTAAATTATTGTTTTTGATGCTCTAATTAGGGGTGTTGCACTTTTGACATGTTGGAAGTGTGTATTTTAGACGCCTAACTAAAT 
ATAGTATAGCAGGTTTACTTACATAAAGATTATAATAACTTTATTAATCAATTATACTATAAATAAAATAATAGGTTAATATAATAGAATAAAGT 
CAGAGTAAAGTGAATTTTTGTGATTCAAGGGTATTTCAGAAGTTCATGAAGAAAAAAACCCATGTTTATGTTTGGTTTCCACTTTGAACGGTGAC 
GTTTTGGTGCTATTCTTGTATTTATTAACTAATTTGTTGAAATTACCGGTGGTAGATTTATGAGTCAGGTTTTTTTTTTATTCCTACGCTCATGT 
ATGTGCGTTTCAGGTGTGATCGATACCTTTTGAAGATCACCGCTTGTTTTTTTGTTGCCCTCCCTCATTTTCCGCGATTCGCAACAGCTCGTAAT 
GTTGCAGAAGCAACATTAAAATGCCGCTATTTTACACGTGTAAATTTGTATTTCTTTTGAATGTGTGTGTGCGAATTTATGTGCCTGTCGTCTGG 
CTATTTTCCCCTCATCTCTTCCGCTCTTCGCCCTCTTGGGTCTTTTGTCTCTTTTCCGCTGCCGCCTTTGTCGGTATACAGAAATTTTCGCTATG 
CAGTTATGTTGATGAGATGTGAATGCGATGATGCTGCTGCAACGTCATTTCTAGCATACTTGTCTCCTCTCTTCCTCTCGAAAGTTCTGCGGCAA 
CATAGAACAGAGAAATGCTGAGTGAAAGGAATAGGCAAAGCAGAGGGAGAGAGGCTGGTGGCCGAGAGAACGTTTGGTTGTTATTGCTTTTGGAT 
GCGGTCGTTGCCCAAATTGGAAAAAAACATGCGCACAGAGTACACTGAAAAGAACCAGTTTTGTAATATAAATAAAAGTCGATATTATTTGGCAT 
AGAG AAT AT T T T GAT T T T TG AT T AGAG T ATGGG AAAAAAT T T GCAT T AAG AT AT AC T CT C C T AT T T T AA T GT AT C T T T T AT C AAC AC GGG AT G AA 
ATTTCTTTTCGCAAAGTCTGCTACGTTTTTGCTGTGTGTATTTATCGTTTATCGCTTCCTGCTTTTGTTGCGCTTTTCTCTTGACACTCGCTCTT 
CGCTGCTCTCTCCACGCTCTCTTTCGTTCTCTCACGCAACCTGTAGCGCGCGCAAAAAAAAGTGTATTTTGGCGTGGAAAATTGTTGCTCTTTCT 
GGGGGGAAGTTTCCTCTTTTTATTTATTTTGTTGTGGGTGACTCGCCGCCATATTGTTTTAGGGGTTGCTAAAACATAAGCGCAATGGGGGTGGT 
GATGTGTGCAGTTGTATTGTTGCTTTTGCAGCCGGCATTCAACAGTTCGGCAAAAAGAGCGCTCATTTTGGGAATCGTCTCGCGTTGGTCGCTCT 
CTTAGTAAACGTAAATAGGGTGTATTGATTTTTCCCGGGTTTTTCGCCACTTTGTTTTCCTGTCTTTCTGGAAGTTGCTCGGTTAACAGGCAGCC 
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AAAAAATTCTCAGACTGTCTGTGGAGTGTGACATTGCATTGCATGGCTGCAGTTGCGGAGTAGTTGTTGTTGCTGGGAGCTGGAAAATCCCCAGG 
ACATTTGCAGCTCTCTGTCGCTCCGCTGTCGCGGCCACTGCCGGCGTCGCTGCTTTTGCGGGGACCTGAGCTTTACCCGGGTCGCTGCATTATGC 
GCAGAGTTAGTTTTGCAGGGAGCCGCGTAGCATTCACGTCACGCAATTTGTACATAACTAGCGAAGATTTTAACATTACATTAGTTTGCAGTGTC 
CTTGCCGACGCACGACGTCACTGTAATGCATAAAATAAACCAAAAACAAGAGCTCTCAATTAACCTTGCATTCAATTATCCTTCCTAATAACCAT 
TTTTCTCTCTCTTCCCATTGCAGCCTTTTGTGCGCGCGACGCAATTATCTGACAGCACTTGAAAGTATGCTACACAAATTTGTTTCAACTTGTTA 
CTGCAACAATAACAACAACCAAAACAACGGCAACCATCCCACGCAATTTATCATATAGACTGCGTAGCAAGCAAGAGCGAGACAGCAACACGGCG 
CAACCAGCAGCGAGAAACAACAACAACAAGCAGAAATCAGGAAACCAGGAAACGAACTGGAATATATAGAAAATTCTACAACAAACTACAACAAC 
AAAGCAGTAAAGGTAAGTCGGCTTATCTCCGTTTAATGAGTTTTACGTTTAGTTTTCGTCGTTGCACAGTTTTCCCTTTGTTGGAAATTTACCGG 
TAGATATATTTTACATACATACATTCATATATGCAGCTCTTTATCAGGGATTTCTCGCGCCCATATTTGGTATAGCTATTCAGATTAGGCGGCAA 
CTAAACTGTAAATATAATTTAGCTTTATAATTTGAAGTCCCTAATTATTTGCACAGGAACTGGAAGTTGGAGATAAATGTTTTTATTCTTGTAAA 
TTAAATAATTATTTACCTTTATTTACCGTTATTTACCTTTAATGGCCTTCCGTATTCTTCTTTTTATTTGTAACAAACTTACTATTCCAATGTTT 
CTAAGAATGGATTAGCATACCTGTACCTACGCCTAGAGGGCTATTTTGGGCAACCCAATTAACCTCAAAGTTAAACAAAGTAATATCCTTCACGA 
CGAAAAAGTGAATATTTGTGGAGCTAGAAAGGACTTTCTAGGGAGCATTGGGAAAAGGCATTAGGTCTGGGTGTATGTGCATGATAAATGTACAT 
CCAGTCTAGACAGTTGCAGTCACCCTTCCCATCCCTCTGACATATACAACCTTCCCCCCTTGGGCGCATTTTTCGGGGTTGATTTTCAGCTGCCT 
CTCTTTGCTTTCACTTGGATTGCCTTTATGTTGGCTTTCTTGTTGACGAAAGTTTCGCCTGGTCTGGAAATCTAGAGAAGTTTCTCCGGGGCTCC 
ACAGTTTTCCTTATTTTGTTTTTTTTTTTGTTTACTTTTTTGCGACGGGGGCTTGTTGTGAAACTCCCCGTTCGTTTTCCCTGCTTTTACTGCTC 
GCATTCTACCCAACTTCAGATTCGCCGTCCTTCTCGAAGGCAACGAACCGCGAAGGCAGCGAAAAGGGGAAGTAAAATTATGATTACGGGGTGTC 
CTGTCCTGTCCTGTCCTGTCCGTCTGTCTGTCTGTGCGTGCGTTTATTGCATTTGTGTCCGTCTCCGCGCAAAGGATAAAGGTTAGGGGGTAGTA 
GATGTAGACGGGCAGCGATTTCCTCCCCGGTCAAAGTCCTTTTATCCAGGCGTACAGCTTCACTTTTAGCCCCGCTTCTTCACTCTGCCGTAAGA 
AATTGGAATTGTGTTTAAAAGCATTCGGTGGGGGCGCGAATCCTAGTTTGATTCGATTCAGTCTGTTGGGCTGGGGAAATAGTAATACCAAGGGG 
AATACACAAGCCACATACCTTTCAACTGTCGTCTTCGAAAATGTTTAATAAAACACGGTTCACAGCTGTCGCGCTTTGGGGGATTAGATGGAGCA 
TGGAGCAGGATGAAAGCAGGATTGGAAGGAGGCAATTAACTACAAATTATGATAACAAAGGGGATGACTAAGGAAAAGGGGCCGTTGTTTGTGGG 
GTGTGGCACATTTGTTTGTTTGGAGGGGCCATGGAAATGCCACAACGTTTTGGTTCAAGTGGAGGAAATGCTGACTCACAAGGATAAAAGACAAG 
GACCGCAGGTGACTCATGGGTCAGTCGATTTGAGGGGCGAAAGTGGCGGAGAGTAGGCGAAAAAAAAACACGTGGTTGGCACAGGAAGCATTGAA 
ACCATTGCTAAGATTATTCACCTTCGTTGTGGGGAACTTAAATCGATTAGATTATAAGTCCGTATTTTCCAATAAATGTATTCATTGTCTACATT 
GTTTAATTAATCCCTCCACTTGTTCTTTCTCCTTGCAGCTGTTACTCCTAAAGGTAAACGGAGCCACCGACGGCACTTACAACATAAAGATGGCG 
T CCT C AC C G AC AC C ATC T C T GG AC T C GAT GC GG GGC GGGGC G AAC T C G AT T G AAT C GT AC G AAC AC GC C GGCT AT C T AT C AG ACT C TCCGCTCAC 
GTTGAGCGGTTCATCGCCACCGGCAAGCGATTCGGCAATCTGCAGTGATGAGTACACGGGCGGCAGCAGCGTTAAATCACGCTCTGAGGTGACTG 
TAATCAATGGCCATCATCCCATTTCGGCCTCCGTATCGTCCAGCTCTTCGGCCAGCTCTTCATCCTGCTCGTCGTCATCGTCCTCATCCTCGTCT 
TCGTCATCGTCCAGCTCCTCGACCAGCGGCCTCAGCGGTTGCGGCAGTACCAGCAGCAGTGTGATCAGCGCCAACAACGTTGCGAGCAGCAATGG 
ACCAGGCGTTATCGGCAGCAATCTTCAGTCCTCAAACAATGGCGGCAACTCGGGCATCAGCAGTCTGGTCGTGGGTGCGGGCAAGGGAAGCAACA 
GCAGCAGCAACAGCTCCTCGAGCAACACATCCGCCAATGGATCTCCGCCACGTTGCACCGCCTGCAAGAGCAAGTGCAGCGATGCGGTGGCCAAG 
TGCTTCGAATGCCAGAGCTACCTGTGCGCCAATTGTGTAACTGCTCATGAGTTCATGCACTGCTTCAACGGACACAATGTGTGCCTGATTAAGGG 
TTTTGAGGCGAGCACCACTGGCACTCCGCTGTCGGTTGGATCGCCTAGCAACAATCCGGCGTCGAATGAGTTCAAGTACGCCAGTTCGCTGACCA 
TGATGCTGCAGCAGCAGCAACAGCTCGACTCGCAACAACAGCAACAGCAGCAACAATTGCTGCCGGCGCAGCCTATGTCGCAACTCTCGAAGATT 
GTACTGGCAGCCGCCGCGCAGGCAAATTCCCAGGAGCAACAGCGAGAGGATAGCATCTATGGTTCACTGCACCCACAACAACAACAACAGCAGCA 
GCAACAGCAGCAGCGACAACTCTTCTGCCCCCGACACAAACAGGAGCTGCTCAAGTTCAGTTGCCGCACCTGCTGCATCCTGGTGTGCAAGGAGT 
GCATTGTACTGGAGCACTCCACAGGTCTCCACGAACTGGAGAACGTGCAGTCCCCCGGAATGACAACCTCTACGGGATCCACAGCCAACGAGTCA 
GCTTTGCAGACTCTGCTCGCCGATATGCGTGGCAAAATTGGCGAGATTGTCGGCATTGCTGGAAACTCGGACCAGAATCTCACCAAGGTGAAGCT 
GCAGTACCAAAAGGCTCACAACGAGCTGAACGAGACGCACCAGTTCTTCGCTTCCATGCTGGACGAGCGCAAGACCGAATTGCTCAAGGAACTGG 
AGACGCTGTACACGGCCAAGGTTAACAGCAACAATTCGTGGCAGCAACGCTCCCGCGACTTGATCGACAAGGGACTGGCCACCTGCGAGGCGGTC 
GAGCGGAGTCCTGCTCCGCCATCATCCTTGCTGACAGAGGCCCTTCTCCTGCGCAAATCCCTGGAGCAACAACTACAGACGGGCATTCAGGAGAT 
GCAGCTACCATTCGAGATCGAATTTATGTCCAACTATCAGTCGATTCAGGCCGGCGTGCGCAACACCTTTGGTTACATTCGGGCAAACAGCTCAG 
ATGGTGGACCTACTGGCATGAGCCTGACGAGCAATGGCCACGGAAAGCAGCCGCCAATTGCCCGACCAACACAGAGCGCCAGCAACAGCAGTGCG 
AGCAGCGCGGGCAGTGGTCACCATGGCCACCACCAACAGTCACACCATCATGGCCACCACAATCATCACCAGACGGCCCATCATCAACAGCTGCA 
GGCGCAATCGTCGCTGCATGGCTTGGGTCTTGGATTGAGTGGCGCCAGTTTGTTGGATAGCAGCTCTAGCGCAGGCGGAGCAGTGGGAGCCTTCA 
GCAACGGAGGACTTCTGCTGGGCGGTCGCGATCGGAACGCTCTGGCAGTGGAACAGCATTTCGGTGAGCTGATGCCCAAGCGAGGAGGAGGTGGC 
TACACGGGCAGCAATGGATCGGCCACCAGTGCCGTGGCCCACTACAATCCCTACGAGAAGTGGAGCAACGGCGGTAGCGACAATCTGTTTTCATC 
GGTGACAAGCGGCGTGTCCGGTAGCTCTGCAGTGGCCGATGCATTTGCCAGCCTGTCTGCTGTTGGAGGATCCGTGGTTAGTGGCGCTGGAGCTG 
GCGGAAGTACTGTGAGCTCAGAGTCGTTGCTGGACCTCACCAACAAGCTGCTTTCGGCCACTATCTATCCGCCCAAGTCGCAGATCAAGCGACAG 
AAGATGATTTACCACTGCAAGTTTGGCGAATTCGGCGTGATGGAGGGCCAGTTCACGGAGCCCAGCGGTGTGGCGGTAAATGCCCAGAATGACAT 
CATTGTTGCGGATACGAACAACCATCGCATTCAGATCTTCGACAAGGAGGGACGCTTCAAGTTCCAGTTTGGCGAGTGCGGCAAGCGTGACTCGC 
AGCTGCTCTATCCGAACCGCGTGGCCGTGGTGCGCAATTCCGGCGATATTATTGTCACCGAGCGCTCGCCCACACACCAGATACAGATCTACAAT 
CAGTACGGCCAGTTTGTGAGGAAATTCGGGGCCACCATTCTGCAGCATCCTCGCGGCGTGACCGTGGACAACAAGGGACGGATCATTGTGGTGGA 
ATGCAAGGTGATGCGTGTGATCATCTTTGATCAGAACGGCAATGTGCTGCACAAGTTCGGGTGCTCTAAGCACCTCGAGTTCCCCAACGGCGTGG 
TGGTCAACGACAAGCAGGAGATCTTCATCAGCGACAATCGGGCGCACTGTGTCAAGGTGTTCAACTACGAGGGCCAGTATCTGCGGCAGATTGGC 
GGCGAGGGTATTACCAACTACCCAATTGGCGTCGGGATCAACTCGAACGGGGAGATCCTCATCGCGGACAACCACAACAACTTCAACCTGACGAT 
CTTCACGCAGGACGGGCAGCTGATCTCGGCGCTCGAGTCGAAGGTGAAGCACGCGCAGTGCTTCGATGTGGCGCTCATGGACGACGGCAGTGTGG 
TGCTGGCCAGCAAGGACTATCGGCTCTACATATATCGCTATGTCCAACTGGCGCCAGTGGGTATGTAAACACACAGACACACACACTCCATGGAT 
GTGAGTCCGGAAAGGAGGAGGATCTGTGGTCCGGCACTCACTACTTCTATAGGCTAAGTTCTCAATAGTCTGACTTGCTCGGTCTCTCCAGCTCT 
AATTCTAAATCTACACAATCGAAGTTGAAATCGAAATCTATAAGATCCCTACTATCTCCCCCGATCTGTCATCTCTCATAGCCTTAGACTTTGCT 
CGCTATGTAATAACACACACGCACACTGTTGGAATGATTTTTATTATCTTTTAAATTATTGTTGAATTTTGTTTTATTTTTTTTATATATATTTA 
CACTGTTAAGCTTAGTTTTTATTTGTGCGTACTTGTTTATGATAAATATATATTATGTGTATTCATGTTCATGTTCAAATCGGATAGTGCCGCCG 
CCTCTCTATATCTTTGCACACTTCTCACTTCAAACCCTCGTATCTCTATCTCGCGTGCAAGTCGTATTTTCAACAAACACAAGCAAACAACCGAA 
ACAAACAAAAAAAATGACACTTTTTAACTATATATAGTAGATACCAGAAGAAATCCCTCCACTTTTTTTATGTTCGTTGCCGACTGCAAGGAAAA 
AAAAGG AAAAAAAT T AT G AAAAAAAGAAGTTGGT ATAAAATAAACT TAAC AAAT AACCGCC AACAAAATGGAGC C GCGTATCGAGCATT AATT AA 
TTAATAATTGATTGTAATTTTTAGATATATGATTGCGCAAATTGTTTGTTATTATTATTTTTGATTGCATAAACATTTGATACATTATATATTAT 
ATAGCTGTTTTTTACGTTTATAAATTATTTTTGATTATACCACTTATTGATTACTGATTATTTATATTTATATACATTAATTAATTATCCTCTAA 
GACATGTTTGCTACTTTTATACGAAATCAACCGAACCCAAAACCGAACCGCGACACAAACAACAATTCACAATTTTTATACTCGCTTAGAATTTA 
TGTTTTCCATTGTTTTATTGTAACCACGACAGCGGAAAGAAAAGATGGAATTACCTACCTACACCGTAATAATAATGCAGACAAAAACCCACTGG 
CGATTCTTTAATTTAATTTAAACAAAAACAACTGAACGCACTGAGAATAGTTGAAATTTTTGTTCTTGATTGTTGAACTATTTTTTGTTTAGGCA 
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TTTATATTTTAAAATAATTTCTTCGTTCGCAAGGAAGACGGACGAATTTGTTGTTTTGCCCCAAAATATATTACTATATTATATGCGAAAATTAT 
TTGTTGTCAAATTGAATTTAAATAAACATGGAAAAATGTGTATATCAAACGAAAACAAGGCGATATTTATGTGCACCAAGTTAGTTAGTGGTAAC 
TTCGAGTGCGAATTCAAAGTATCTTGAGGATGTGTGTGCATAGATCTAATTGAGTATTTAAAAGTAACATTTAACTAAAAGTTGGCAATCAGCGC 
AGAAGACTTGAAACATTTATAACTATTATAATTATACAATTAAAGTATGTCTATTAATATACGATAAACACTTAAATCAGTAAATCAAAAATCTC 
AAAT AAT T G AAT AGT T AT AAAGAG AAACC GC G AAT AGGG AAAC AAC AT C C C AGC C AAG C C AGAC C AAT C GC C C AC T T AT GTT TACTTCCTT G AC A 
G AAAT AGG G AC T AAAT AC GAAAGC C AC AC GGT GG GC AAC AAT C AG GAT AT G GT ACT C G AAG GAATC GT C AC AT AT C C T C T TG AT CTGTTGCG C AC 
CGTGTGCACATGGACTTCTAATTGCAATGATTTAGTTAAAGCATTTACTTGGATAGGCCTCAAATGATGATTGCAGAGCTAACAAAAGTGTTCTT 
TAAGTAAGGGTTAGTTAAAAGCAGTTAGGACGTTAGATATTTAAAGTGAACATACTATTAAGGCAAGGAAACACAACTTTTTGAGACTACAAACG 
CAACATTAACAGGAACAGCATTTAAAAAATGTTTATTTTGTAATTTTAAAAGTAATCACTATGTATTTTTGTTGGTTTTGCTTGGAACGCGACGC 
GAACCCCGCCCCCTTGACCCCATTGACACGCCCACACGTAAGCAGGTATTAGATTTCTTGAATATAATTTAACAATATTGTATTAAACACGTAGT 
ATTTAACTAATATAGTTTCCCCCCTTCCTCATCCTCTTTATATTCCCTTGTTTGTTTTGAAAAAACGCCTTGCTCTGTGTAATTTCTGATTTGCT 
T G AC C T AT T ATT T T AAT TAT T T GAT AAT TACCTATGTTTTGTGTGTATCTACT T AAGT G AGAT T AT T AAAT AT AC G AAAAT AT AT TTTTTAGTTC 
TTTCTT 

(SEQ ID NO: 1528) 

Exon: 1001.. 1388 

Exon: 8472.. 8752 

Exon: 10540.. 14966 

Start ATG: 10540 

Transcript No. : CT30047 

TCGCGTTCAGAGAGAATACACTACTACTGCTGCTGCTAGTGTGTGTGCGTGCGTGTGAACTGTGCGAAAAGAAGCAATATAAAGTATTAATACTA 
GTGGAATATTAAGTAAATCTAAGTGCAACAGAAACATACACATACCTAGCCTATGTACGTATGCTACAAGATATACATATCTGCAGCAAATAAAT 
GTTAACTCAGTATACAAGCGT^ACAAAAGCACCAGAAACAAATTGTTTAGTTTTGCTGCTCATTCTTCTTCTTCCACCGATGCCGTAGAGTTCATC 
AACTTGTTTAGAATCGCGTACGTGACTGTATGTATGCCTGTGTGAGTGAGTGACAGTGTGTTGGTGCTGCCTTTTTCCTGCTGCGCTTCCTCTTG 
CTCGTGCAATTGCAGCCTTTTGTGCGCGCGACGCAATTATCTGACAGCACTTGAAAGTATGCTACACAAATTTGTTTCAACTTGTTACTGCAACA 
ATAACAACAACCAAAACAACGGCAACCATCCCACGCAATTTATCATATAGACTGCGTAGCAAGCAAGAGCGAGACAGCAACACGGCGCAACCAGC 
AGCGAGAAACAACAACAACAAGCAGAAATCAGGAAACCAGGAAACGAACTGGAATATATAGAAAATTCTACAACAAACTACAACAACAAAGCAGT 
AAAGATGGCGTCCTCACCGACACCATCTCTGGACTCGATGCGGGGCGGGGCGAACTCGATTGAATCGTACGAACACGCCGGCTATCTATCAGACT 
CTCCGCTCACGTTGAGCGGTTCATCGCCACCGGCAAGCGATTCGGCAATCTGCAGTGATGAGTACACGGGCGGCAGCAGCGTTAAATCACGCTCT 
GAGGTGACTGTAATCAATGGCCATCATCCCATTTCGGCCTCCGTATCGTCCAGCTCTTCGGCCAGCTCTTCATCCTGCTCGTCGTCATCGTCCTC 
ATCCTCGTCTTCGTCATCGTCCAGCTCCTCGACCAGCGGCCTCAGCGGTTGCGGCAGTACCAGCAGCAGTGTGATCAGCGCCAACAACGTTGCGA 
GCAGCAATGGACCAGGCGTTATCGGCAGCAATCTTCAGTCCTCAAACAATGGCGGCAACTCGGGCATCAGCAGTCTGGTCGTGGGTGCGGGCAAG 
GG AAGC AAC AGC AGC AGC AAC AGC T C C T C G AGC AAC AC AT C C GC C AAT GGAT CTCCGCCACGTT GC AC C G C C T GCAAG AGC AAG T GC AGC GAT G C 
GGTGGCCAAGTGCTTCGAATGCCAGAGCTACCTGTGCGCCAATTGTGTAACTGCTCATGAGTTCATGCACTGCTTCAACGGACACAATGTGTGCC 
TGATTAAGGGTTTTGAGGCGAGCACCACTGGCACTCCGCTGTCGGTTGGATCGCCTAGCAACAATCCGGCGTCGAATGAGTTCAAGTACGCCAGT 
TCGCTGACCATGATGCTGCAGCAGCAGCAACAGCTCGACTCGCAACAACAGCAACAGCAGCAACAATTGCTGCCGGCGCAGCCTATGTCGCAACT 
CTCGAAGATTGTACTGGCAGCCGCCGCGCAGGCAAATTCCCAGGAGCAACAGCGAGAGGATAGCATCTATGGTTCACTGCACCCACAACAACAAC 
AACAGCAGCAGCAACAGCAGCAGCGACAACTCTTCTGCCCCCGACACAAACAGGAGCTGCTCAAGTTCAGTTGCCGCACCTGCTGCATCCTGGTG 
TGCAAGGAGTGCATTGTACTGGAGCACTCCACAGGTCTCCACGAACTGGAGAACGTGCAGTCCCCCGGAATGACAACCTCTACGGGATCCACAGC 
CAACGAGTCAGCTTTGCAGACTCTGCTCGCCGATATGCGTGGCAAAATTGGCGAGATTGTCGGCATTGCTGGAAACTCGGACCAGAATCTCACCA 
AGGTGAAGCTGCAGTACCAAAAGGCTCACAACGAGCTGAACGAGACGCACCAGTTCTTCGCTTCCATGCTGGACGAGCGCAAGACCGAATTGCTC 
AAGGAACTGGAGACGCTGTACACGGCCAAGGTTAACAGCAACAATTCGTGGCAGCAACGCTCCCGCGACTTGATCGACAAGGGACTGGCCACCTG 
CGAGGCGGTCGAGCGGAGTCCTGCTCCGCCATCATCCTTGCTGACAGAGGCCCTTCTCCTGCGCAAATCCCTGGAGCAACAACTACAGACGGGCA 
TTCAGGAGATGCAGCTACCATTCGAGATCGAATTTATGTCCAACTATCAGTCGATTCAGGCCGGCGTGCGCAACACCTTTGGTTACATTCGGGCA 
AACAGCTCAGATGGTGGACCTACTGGCATGAGCCTGACGAGCAATGGCCACGGAAAGCAGCCGCCAATTGCCCGACCAACACAGAGCGCCAGCAA 
CAGCAGTGCGAGCAGCGCGGGCAGTGGTCACCATGGCCACCACCAACAGTCACACCATCATGGCCACCACAATCATCACCAGACGGCCCATCATC 
AACAGCTGCAGGCGCAATCGTCGCTGCATGGCTTGGGTCTTGGATTGAGTGGCGCCAGTTTGTTGGATAGCAGCTCTAGCGCAGGCGGAGCAGTG 
GGAGCCTTCAGCAACGGAGGACTTCTGCTGGGCGGTCGCGATCGGAACGCTCTGGCAGTGGAACAGCATTTCGGTGAGCTGATGCCCAAGCGAGG 
AGGAGGTGGCTACACGGGCAGCAATGGATCGGCCACCAGTGCCGTGGCCCACTACAATCCCTACGAGAAGTGGAGCAACGGCGGTAGCGACAATC 
TGTTTTCATCGGTGACAAGCGGCGTGTCCGGTAGCTCTGCAGTGGCCGATGCATTTGCCAGCCTGTCTGCTGTTGGAGGATCCGTGGTTAGTGGC 
GCTGGAGCTGGCGGAAGTACTGTGAGCTCAGAGTCGTTGCTGGACCTCACCAACAAGCTGCTTTCGGCCACTATCTATCCGCCCAAGTCGCAGAT 
CAAGCGACAGAAGATGATTTACCACTGCAAGTTTGGCGAATTCGGCGTGATGGAGGGCCAGTTCACGGAGCCCAGCGGTGTGGCGGTAAATGCCC 
AGAATGACATCATTGTTGCGGATACGAACAACCATCGCATTCAGATCTTCGACAAGGAGGGACGCTTCAAGTTCCAGTTTGGCGAGTGCGGCAAG 
CGTGACTCGCAGCTGCTCTATCCGAACCGCGTGGCCGTGGTGCGCAATTCCGGCGATATTATTGTCACCGAGCGCTCGCCCACACACCAGATACA 
GATCTACAATCAGTACGGCCAGTTTGTGAGGAAATTCGGGGCCACCATTCTGCAGCATCCTCGCGGCGTGACCGTGGACAACAAGGGACGGATCA 
TTGTGGTGGAATGCAAGGTGATGCGTGTGATCATCTTTGATCAGAACGGCAATGTGCTGCACAAGTTCGGGTGCTCTAAGCACCTCGAGTTCCCC 
AACGGCGTGGTGGTCAACGACAAGCAGGAGATCTTCATCAGCGACAATCGGGCGCACTGTGTCAAGGTGTTCAACTACGAGGGCCAGTATCTGCG 
GCAGATTGGCGGCGAGGGTATTACCAACTACCCAATTGGCGTCGGGATCAACTCGAACGGGGAGATCCTCATCGCGGACAACCACAACAACTTCA 
ACCTGACGATCTTCACGCAGGACGGGCAGCTGATCTCGGCGCTCGAGTCGAAGGTGAAGCACGCGCAGTGCTTCGATGTGGCGCTCATGGACGAC 
GGCAGTGTGGTGCTGGCCAGCAAGGACTATCGGCTCTACATATATCGCTATGTCCAACTGGCGCCAGTGGGTATGTAAACACACAGACACACACA 
CTCCATGGATGTGAGTCCGGAAAGGAGGAGGATCTGTGGTCCGGCACTCACTACTTCTATAGGCTAAGTTCTCAATAGTCTGACTTGCTCGGTCT 
CTCCAGCTCTAATTCTAAATCTACACAATCGAAGTTGAAATCGAAATCTATAAGATCCCTACTATCTCCCCCGATCTGTCATCTCTCATAGCCTT 
AGACTTTGCTCGCTATGTAATAACACACACGCACACTGTTGGAATGATTTTTATTATCTTTTAAATTATTGTTGAATTTTGTTTTATTTTTTTTA 
TATATATTTACACTGTTAAGCTTAGTTTTTATTTGTGCGTACTTGTTTATGATAAATATATATTATGTGTATTCATGTTCATGTTCAAATCGGAT 
AGTGCCGCCGCCTCTCTATATCTTTGCACACTTCTCACTTCAAACCCTCGTATCTCTATCTCGCGTGCAAGTCGTATTTTCAACAAACACAAGCA 
AACAACCGAAACAAACAAAAAAAATGACACTTTTTAACTATATATAGTAGATACCAGAAGAAATCCCTCCACTTTTTTTATGTTCGTTGCCGACT 
GCAAGGAAAAAAAAGGAAAAAAATTATGAAAAAAAGAAGTTGGTATAAAATAAACTTAACAAATAACCGCCAACAAAATGGAGCCGCGTATCGAG 
CATTAATTAATTAATAATTGATTGTAATTTTTAGATATATGATTGCGCAAATTGTTTGTTATTATTATTTTTGATTGCATAAACATTTGATACAT 
TATATATTATATAGCTGTTTTTTACGTTTATAAATTATTTTTGATTATACCACTTATTGATTACTGATTATTTATATTTATATACATTAATTAAT 
TATCCTCTAAGACATGTTTGCTACTTTTATACGAAATCAACCGAACCCAAAACCGAACCGCGACACAAACAACAATTCACAATTTTTATACTCGC 
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TTAGAATTTATGTTTTCCATTGTTTTATTGTAACCACGACAGCGGAAAGAAAAGATGGAATTACCTACCTACACCGTAATAATAATGCAGACAAA 

AACCCACTGGCGATTCTTTAATTTAATTTAAACAAAAACAACTGAACGCACTGAGAATAGTTGAAATTTTTGTTCTTGATTGTTGAACTATTTTT 

TGTTTAGGCATTTATATTTTAAAATAATTTCTTCGTTCGCAAGGAAGACGGACGAATTTGTTGTTTTGCCCCAAAATATATTACTATATTATATG 

CGAAAATTATTTGTTGTCAAATTGAATTTAAATAAACATGGAAAAATGTGTATATCAAACG 

{ SEQ ID NO: 1529) 

Start ATG: 670 

MASSPTPSLDSMRGGANSIESYEHAGYLSDSPLTLSGSSPPASDSAICSDEYTGGSSVKSRSEVTVIMGHHPISASVSSSSSASSSSCSSSSSSS 
SSSSSSSSSTSGLSGCGSTSSSVISANNVASSNGPGVIGSHLQSSNNGGNSGISSLVVGAGKGSNSSSNSSSSNTSANGSPPRCTACKSKCSDAV 
AKCFECQSYLCANCVTAHEFMHCFNGHNVCLIKGFEASTTGTPLSVGSPSNNPASNEFKYASSLTMMLQQQQQLDSQQQQQQQQLLPAQPMSQLS 
KIVLAAAAQANSQEQQREDSIYGSLHPQQQQQQQQQQQRQLFCPRHKQELLKFSCRTCCILVCKECIVLEHSTGLHELENVQSPGMTTSTGSTAN 
ESALQTLLADMRGKIGEIVGIAGNSDQNLTKVKLQYQKAHNELNETHQFFASMLDERKTELLKELETLYTAKVNSNNSWQQRSRDLIDKGLATCE 
AVERSPAPPSSLLTEALLLRKSLEQQLQTGIQEMQLPFEIEFMSNYQSIQAGVRNTFGYIRANSSDGGPTGMSLTSNGHGKQPPIARPTQSASNS 
SASSAGSGHHGHHQQSHHHGHHNHHQTAHHQQLQAQSSLHGLGLGLSGASLLDSSSSAGGAVGAFSNGGLLLGGRDRNALAVEQHFGELMPKRGG 
GGYTGSNGSATSAVAHYNPYEKWSNGGSDNLFSSVTSGVSGSSAVADAFASLSAVGGSVVSGAGAGGSTVSSESLLDLTNKLLSATIYPPKSQIK 
RQKMIYHCKFGEFGVMEGQFTEPSGVAVNAQNDIIVADTNNHRIQIFDKEGRFKFQFGECGKRDSQLLYPNRVAVVRNSGDIIVTERSPTHQIQI 
YNQYGQFVRKFGATILQHPRGVTVDNKGRIIVVECKVMRVIIFDQNGNVLHKFGCSKHLEFPNGVVVNDKQEIFISDNRAHCVKVFNYEGQYLRQ 

IGGEGITNYPIGVGINSNGEILIADNHNNFNLTIFTQDGQLISALESKVKHAQCFDVALMDDGSVVLASKDYRLYIYRYVQLAPVGM* 
(SEQ ID NO: 1530) 

Name: BRAIN TUMOR 

Classification: transcription_f actor 
Gene Symbol: brat 
FlyBase ID: FBgn0010300 



Celera Sequence No. : 142000013384351 

AAAGCTGGAGAAAACCTTCAAAGAGAAAGGCTTTCTCATCCAAACCCAGCAGCTTCAATCCGCCGAAGGTGCTACCTATTGCAAATTTAGACAGC 
TAAAGAAGTTCACGCGCTACCTTTTCCGCAACTGGAAGGATCACATACCCGAGAGCGATCTGGCCAAGCACCAAAATGCCTCGCGGGAGGATAGG 
CTGCCAAATCTGGAAACCCAAACCGCCAACCAGACGACATAGAGCTGCGGCCAGGACATTCAGGAAACATCCACACACAGCACTGCCAAAGATTT 
AAGTTTAAGATCAGCTAATGGCCGGCCCAGCTTCGGTTGCGGTTGCCAGTTGAGGGTGGAGAACAGCAACCCATGCCGATTCCGTCCATGCGGGC 
TGTCCAACAACGCGATCAATGATCCTCAGCCACGCTTCCTAGTCGTTCATCAGTTTTTTTAGTGCCTAATAAAATGTTATTGTTAACAAATAAAC 
TCAATGTTTAACCATATGCAAATGTATATTGTTTGGTTGTAAAAGGCATCGTAGTATGTATTTTACAAAATATCCGTCAATACAGTTATGTGTTA 
GTACAAAAGATAAACTCCAAAATGGGCGATAAGTCTATAGGGGTTATGCTCGCTCTCAAGGTCGTTTCAATGCCATTGCGAATACTTCTTGTTAT 
CATCGTTATATGCTTATTTAGAAGATGTCATTACATCCCGTTTCCAGATTAGCCTAAATGTTAGTTACAGATAATATAAAAATTTCCAAAATGTT 
TGCTGGTATGCTTAGGTCGACGAAAGAATTTTGACTCGAATTCATTCGAAAGATGTTTGGAAGTTTGGGCGTGGCCTTGATAAGATAAATTCTCA 
CTAATATTGTCAATTTTGTGGATTTGGTTGCATAATTGGTTCAGGTGGCAGTTCATTCAGTTTGGGGATCCTTGGCCAGGTTTTGGGCTTACAAC 
TAAAAAACATATTTGCTTTCAATCGATTTGTGCTCCTTTCGACTTCGTCTTTAGTTCGAACTCATGGAGCTCTCATGCTCCTTAGCCGTATCAGC 
TAGATATGTCTCCGGCACACTGGCATTCATGGGACACACGTACTCCGTCCAGTCTACGTCACTCTGGCTCTTCGAGCTCTTGCGACGCTTTAACG 
AAAACTGTGTGCGTTTCACCGGCTTCAGCTTAGCTGTAATGCTTAGCGGTTGGTACCCGGGTGCCGATTCGGCATCTCGCTTACTCAAGATCACT 
GGCCGACCGACTAGCTCCTTAAAGTGCATGGCCAGGTGGCGACGCAGCAGGGACTTGCTCGGTGGTATCGACAGCAGGTCGGCAAGCAAGTCAGC 
ATTGAAACGCGGTTCATAAAGCATCAGAGCCCCGTGGACACCAGCACCACGCAGATTGGGCGTGTATTCAGACAGGTCGATGGTCTGCAGCCATC 
CCATAATGCGATGCGCTGTCCACAGCTCCACATTGTCTCTGTCGGGAGTGGCTGCGGCTTCATCTTCACTGACAACTTTAATGGAGCGTCGAATT 
AGACCTTCCGCGTTCCACGATAAGCGGCGCATGCATAAGATTCCCGCTCGCAAGGAGGCTATGTGCAAACAAGAGCTGACGTGCAATTGGGAGAG 
GTCCTCGAGGGTGAGACGATGCAGCATTCGACCGTCAATCTTGGCCTGCATAAAGTAATCTTTATACTGTGGCAAGCCGATGTCATCCAGCCAGC 
GTAGAACCCATGTCACATCCAAGCTGGATGCCTTCAGTGTCAGGTCATCAAACTCCTTCCCAGTTAGGTCGTCGATGGCCAACAAGATTTTTTTC 
CGATGAAGTGGCATCTTTAGGTTTAGCTCTCGTTCGATGTCGACAGGCGAGGCGGTGAAGAACGACACTGATGGATTGGCATTCAACCACTTTCT 
GTAGGCGGAAATTGTATTTAGTAACTGGGTTCTTCTTGGATTTTGATGACAAAACTTACCGGCAATTGTCCTCATAGCAGCCTAGACCCATGTAT 
GCCAACCAATTGCACACCTCCTGCGCTCCCCAGGTTGTCCAATGTTTTTCATTCTCACCGAACAACGGAGTCTGAGCACTCCACTCAATTCGACC 
GCCTGCCGTAGCTCGGGCTCCTCCTCGTCGAAACTCGGGCTCTGCCTGCTCCAAGGCGGTCAGCTCGCAGTTGCTGCTGTTGCTGCGTCGTAGCT 
TACCAAGTATGTTGCGCAAGCCACGGGCAGATCTTTCCCTCAGCTTTGGTGAGGGTGTAAACTGCGACATAAAGCTGGAAGTCATTGCATCCTCC 
GCTTGATGATGATTGTCATCAATCAAAATGTGTTCGTTCGAAGCAAAGGCCACGTTTCGCTGCTTGCCGCTGCTGTTGGCATTGACGTCTAAGGC 
ACGATTGTTATTGTTGTTCACAAACTTGCCATGCGATCTTGGCAGGCTGTGGAACTGCGGTTTGATTTGGTGCCGCAGTGCAGAGGGCGGTGTCT 
TGGGCGATCCCATAAAGGCCTTCTGATTAATGTTGCTCAAACTGCCGTAAGGAATATTGGCGGAAATGGCGCTACCTCCATCGTTAGCCTGCTCT 
AATGCAGTACGGTGCAGTTTTAGTTCACTTAAGGCACTCATTAGCTCCAGCTTTTGCGTCTCGAGGGCGGAACTAGAAATCATCTGTCGTTGCAG 
TTGCTCTTCCTTGGTACACAACTGCTGGCTCTTTTCCGTTATCAGACCCTCCAGCTCGATAATCTTGTCCGCTTGTTCTGACACTCGTTCCGCCA 
GCATCTGATACTGCAGGGCTAAGCCCTCCTTATCCCGTTGCAGGCGACGCATACGCTCGTCGGAATCCTGACGAGGTATGTGCGTCTCCAGCCAG 
TCGCTGATAATGGCCGCCGTGACAGGATCCGGTGCTGGAGCAGCCAGTCCGACCTAAAGTGATAAGCGTAAGTGTAAGCCCTCCGTTTCTAGACC 
TTATCATAGTCAGTGTGACTCACCTGTTGCAGGGCGAGAGCCAAAGTCTTAGCTGTTGACAAAACATTAGTGGCCGAAATTAAGTTGCGTTCGCT 
GAGTGTCAGGGAAGACCCTGCTCCGGATATAGAAACGGATCCCGCCTGTGTCGCGGTACTCGATATAAGGCCGTCCATTTGCTGTAATGCTGCCT 
CGAGCATTTTGCTGCCCTCCGCCACAGTCGCCATGAAGTCCAAATCAAAGTCTCGCTAATCCCGGGTGATCGATGTGTGGTGTGCTTATTGCGCG 
ACTAGTGCATTGTTATGTAAGCAGGTCAGGCAGGGCACGTCATTTTGCACTGGTTTAGGGGAAAATGGCATTTATGGTTATTAATTTCAGCACTA 
AT T GGGG AAAAC T AT C AC G AAAAT AG AAGCT GC T G C AC CGTTCTGCTTCTTCTTCT C GC AC AC C AGC C C AGC C AC T T AT C G AT AG T C AGC G AAC G 
CAGAATTGTGTGTAGGAGCAACATCCGATAGTTTGCTAGATGGTCTAAAGTCTACAAAACAAATTTTAAAACAAAATTAATTTTAAGTAACGTAT 
AAAAATGCCTAAACACATTTTAATTACATTTTTTGGGGACAAAGTATGCAATATATCTAGATGTGTACAGCTATATTTGGTTTTCCATACCCGAT 
ATCTACTGTACAAAAAAGATACCATCACTAATAAGCAGGTGGAACGGTGGAGTCACGAAAGATGTGTCATCTCCAGCCAGAGCCACAACAAAACA 
AATATTAAGGATAAAATAACAAATATCTGAAATGAATCGCTTCGCTTATGTGGAGGCCCGTCTGAGTCCCAATGAATCCTTCGTCAGCCGCGACA 
ACAGGGTAAAAATATACGATGGCGACCAAAAGGTGAGCAGCTAGTGAATTCGCACAATACGAAACCGGGTCTTTATTAGACTCAAATCCTTTCAG 
ACGGACTTTGAAGACGGCGAAGTGGTGCTCACCACACACCGACTGTTTTGGGGTCGCCCTGGAGAGATTGCCCGGGCCGCCGTAACACTTTGCCT 
GCCACTCAGCTATGTGATATCCGTCAGCGAAGAGACTACTGCCTCTAACTTCTTTGGCCGCAAGACTCGCATAATCATGCATCTGCATCCTCCAA 
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CGTCGGACAAAGGACCCGGCCCACTGGACACCAGTCGCGCCACCCACATCAAACTGTCGGGCAAGAATGGCCTGAGTGTCGAGTTCCACAGTGCC 

TTGCGGGAGACTCTAAATGCCCGCGTCTGGGAAATCCTCTTGACCAGCGAGATCATCATCAATGGCGTGGCCAGCAGTCCAACCTTGGAGCCTGC 

CAACGACAGGCTGGCCCGCATTCAGAAGAGGACAGGAATTGGGGGCATCGAACGGCATTTGGAGGCAAAAGCCAAGGCCACGGATGAGAACATAG 

CACTTGCCTTT 

(SEQ ID NO: 1531) 

Exon: 3381.. 2969 
Exon: 2903.. 1960 
Exon: 1898.. 1001 

Start ATG: 3169 (Reverse strand: CAT) 
Transcript No. : CT30105 

AGAAGAAGAAGCAGAACGGTGCAGCAGCTTCTATTTTCGTGATAGTTTTCCCCAATTAGTGCTGAAATTAATAACCATAAATGCCATTTTCCCCT 

AAACCAGTGCAAAATGACGTGCCCTGCCTGACCTGCTTACATAACAATGCACTAGTCGCGCAATAAGCACACCACACATCGATCACCCGGGATTA 

GCGAGACTTTGATTTGGACTTCATGGCGACTGTGGCGGAGGGCAGCAAAATGCTCGAGGCAGCATTACAGCAAATGGACGGCCTTATATCGAGTA 

CCGCGACACAGGCGGGATCCGTTTCTATATCCGGAGCAGGGTCTTCCCTGACACTCAGCGAACGCAACTTAATTTCGGCCACTAATGTTTTGTCA 

ACAGCTAAGACTTTGGCTCTCGCCCTGCAACAGGTCGGACTGGCTGCTCCAGCACCGGATCCTGTCACGGCGGCCATTATCAGCGACTGGCTGGA 

GACGCACATACCTCGTCAGGATTCCGACGAGCGTATGCGTCGCCTGCAACGGGATAAGGAGGGCTTAGCCCTGCAGTATCAGATGCTGGCGGAAC 

GAGTGTCAGAACAAGCGGACAAGATTATCGAGCTGGAGGGTCTGATAACGGAAAAGAGCCAGCAGTTGTGTACCAAGGAAGAGCAACTGCAACGA 

CAGATGATTTCTAGTTCCGCCCTCGAGACGCAAAAGCTGGAGCTAATGAGTGCCTTAAGTGAACTAAAACTGCACCGTACTGCATTAGAGCAGGC 

TAACGATGGAGGTAGCGCCATTTCCGCCAATATTCCTTACGGCAGTTTGAGCAACATTAATCAGAAGGCCTTTATGGGATCGCCCAAGACACCGC 

CCTCTGCACTGCGGCACCAAATCAAACCGCAGTTCCACAGCCTGCCAAGATCGCATGGCAAGTTTGTGAACAACAATAACAATCGTGCCTTAGAC 

GTCAATGCCAACAGCAGCGGCAAGCAGCGAAACGTGGCCTTTGCTTCGAACGAACACATTTTGATTGATGACAATCATCATCAAGCGGAGGATGC 

AATGACTTCCAGCTTTATGTCGCAGTTTACACCCTCACCAAAGCTGAGGGAAAGATCTGCCCGTGGCTTGCGCAACATACTTGGTAAGCTACGAC 

GCAGCAACAGCAGCAACTGCGAGCTGACCGCCTTGGAGCAGGCAGAGCCCGAGTTTCGACGAGGAGGAGCCCGAGCTACGGCAGGCGGTCGAATT 

GAGTGGAGTGCTCAGACTCCGTTGTTCGGTGAGAATGAAAAACATTGGACAACCTGGGGAGCGCAGGAGGTGTGCAATTGGTTGGCATACATGGG 

TCTAGGCTGCTATGAGGACAATTGCCGAAAGTGGTTGAATGCCAATCCATCAGTGTCGTTCTTCACCGCCTCGCCTGTCGACATCGAACGAGAGC 

TAAACCTAAAGATGCCACTTCATCGGAAAAAAATCTTGTTGGCCATCGACGACCTAACTGGGAAGGAGTTTGATGACCTGACACTGAAGGCATCC 

AGCTTGGATGTGACATGGGTTCTACGCTGGCTGGATGACATCGGCTTGCCACAGTATAAAGATTACTTTATGCAGGCCAAGATTGACGGTCGAAT 

GCTGCATCGTCTCACCCTCGAGGACCTCTCCCAATTGCACGTCAGCTCTTGTTTGCACATAGCCTCCTTGCGAGCGGGAATCTTATGCATGCGCC 

GCTTATCGTGGAACGCGGAAGGTCTAATTCGACGCTCCATTAAAGTTGTCAGTGAAGATGAAGCCGCAGCCACTCCCGACAGAGACAATGTGGAG 

CTGTGGACAGCGCATCGCATTATGGGATGGCTGCAGACCATCGACCTGTCTGAATACACGCCCAATCTGCGTGGTGCTGGTGTCCACGGGGCTCT 

GATGCTTTATGAACCGCGTTTCAATGCTGACTTGCTTGCCGACCTGCTGTCGATACCACCGAGCAAGTCCCTGCTGCGTCGCCACCTGGCCATGC 

ACTTTAAGGAGCTAGTCGGTCGGCCAGTGATCTTGAGTAAGCGAGATGCCGAATCGGCACCCGGGTACCAACCGCTAAGCATTACAGCTAAGCTG 

AAGCCGGTGAAACGCACACAGTTTTCGTTAAAGCGTCGCAAGAGCTCGAAGAGCCAGAGTGACGTAGACTGGACGGAGTACGTGTGTCCCATGAA 

TGCCAGTGTGCCGGAGACATATCTAGCTGATACGGCTAAGGAGCATGAGAGCTCCATGAGTTCGAACTAA 

(SEQ ID NO: 1532) 

Start ATG: 213 (Reverse strand: CAT) 

MATVAEGSKMLEAALQQMDGLI S STATQAGSVS I SGAGSSLTLSERNLI SATNVLSTAKTLALALQQVGLAAPAPDPVTAAI I SDWLETHI PRQD 

SDERMRRLQRDKEGLALQYQMLAERVSEQADKIIELEGLITEKSQQLCTKEEQLQRQMISSSALETQKLELMSALSELKLHRTALEQANDGGSAI 

SANIPYGSLSNINQKAFMGSPKTPPSALRHQIKPQFHSLPRSHGKFVNNNNNRALDVNANSSGKQRNVAFASNEHILIDDNHHQAEDAMTSSFMS 

QFTPSPKLRERSARGLRNILGKLRRSNSSNCELTALEQAEPEFRRGGARATAGGRI EWSAQTPLFGENEKHWTTWGAQEVCNWLAYMGLGCYEDN 

CRKWLNANPSVSFFTASPVDIERELNLKMPLHRKKILLAIDDLTGKEFDDLTLKASSLDVTWVLRWLDDIGLPQYKDYFMQAKIDGRMLHRLTLE 

DLSQLHVSSCLHIASLRAGILCMRRLSWNAEGLIRRSIKVVSEDEAAATPDRDNVELWTAHRIMGWLQTIDLSEYTPNLRGAGVHGALMLYEPRF 

NADLLADLLSIPPSKSLLRRHLAMHFKELVGRPVILSKRDAESAPGYQPLSITAKLKPVKRTQFSLKRRKSSKSQSDVDWTEYVCPMNASVPETY 

LADTAKEHES SMS SN* 

(SEQ ID NO: 1533) 

Name: LIPRIN-BETAl 
Classification: cell adhesion 
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